Query lcl|NC_019538.1_cdsid_YP_007010341.1 [gene=F485_gp338] [protein=tail sheath protein] [protein_id=YP_007010341.1] [location=complement(156469..158505)] Match_columns 678 No_of_seqs 213 out of 801 Neff 9.1 Searched_HMMs 1612 Date Thu Nov 7 18:54:11 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_315 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_315_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:106427 Length: 679 100.0 1E-171 8E-175 957.3 54.5 678 1-678 1-678 (679) 2 protein:vir:101804 Length: 663 100.0 1E-162 7E-166 908.5 54.7 659 1-678 1-661 (663) 3 protein:vir:101187 Length: 663 100.0 2E-162 1E-165 907.3 55.1 659 1-678 1-661 (663) 4 protein:vir:6894 Length: 660 # 100.0 1E-162 9E-166 907.9 54.0 658 1-678 1-659 (660) 5 protein:vir:100539 Length: 663 100.0 4E-162 2E-165 905.4 54.3 659 1-678 1-661 (663) 6 protein:vir:108052 Length: 660 100.0 3E-160 2E-163 895.3 55.8 658 1-678 1-660 (660) 7 protein:vir:98263 Length: 664 100.0 2E-159 1E-162 890.4 54.9 658 1-678 1-663 (664) 8 protein:vir:6594 Length: 666 # 100.0 2E-159 1E-162 890.8 53.9 655 1-678 1-664 (666) 9 protein:vir:80984 Length: 666 100.0 2E-159 1E-162 890.5 51.8 655 1-678 1-664 (666) 10 protein:vir:103456 Length: 659 100.0 5E-157 3E-160 877.6 55.2 657 1-678 1-659 (659) 11 protein:vir:7206 Length: 659 # 100.0 3E-156 2E-159 873.1 55.7 658 1-678 1-659 (659) 12 protein:vir:5663 Length: 671 # 100.0 8E-153 5E-156 854.3 53.5 663 1-675 1-671 (671) 13 protein:vir:106984 Length: 743 100.0 6E-141 4E-144 789.4 53.7 655 1-676 1-743 (743) 14 protein:vir:104477 Length: 749 100.0 1E-140 7E-144 787.7 54.2 642 1-675 1-749 (749) 15 protein:vir:104858 Length: 729 100.0 3E-139 2E-142 780.1 52.1 637 1-677 3-729 (729) 16 protein:vir:79092 Length: 477 100.0 2E-108 1E-111 610.9 42.5 470 1-678 1-477 (477) 17 protein:vir:107865 Length: 477 100.0 3E-107 2E-110 605.0 41.8 470 1-678 1-477 (477) 18 protein:vir:98824 Length: 774 100.0 5E-105 3E-108 592.2 39.6 482 1-672 281-774 (774) 19 protein:vir:103168 Length: 641 100.0 3.7E-96 2.3E-99 543.8 37.7 533 1-559 3-641 (641) 20 protein:vir:78206 Length: 390 100.0 2.6E-95 1.6E-98 539.1 34.5 379 1-678 2-389 (390) 21 protein:vir:103993 Length: 390 100.0 2.6E-95 1.6E-98 539.1 34.5 379 1-678 2-389 (390) 22 protein:vir:5711 Length: 396 # 100.0 9.4E-95 5.8E-98 536.1 37.4 385 1-678 1-394 (396) 23 protein:vir:6079 Length: 396 # 100.0 1.1E-94 6.8E-98 535.7 36.9 385 1-678 1-394 (396) 24 protein:vir:79181 Length: 390 100.0 2.6E-94 1.6E-97 533.7 35.6 379 1-678 2-389 (390) 25 protein:vir:79141 Length: 391 100.0 2.1E-94 1.3E-97 534.2 35.1 379 1-676 2-391 (391) 26 protein:vir:2035 Length: 396 # 100.0 1.2E-93 7.5E-97 530.0 34.9 385 1-678 1-394 (396) 27 protein:vir:98553 Length: 395 100.0 2.5E-93 1.5E-96 528.3 36.6 385 1-678 1-394 (395) 28 protein:vir:1172 Length: 391 # 100.0 1.3E-93 7.8E-97 529.9 33.6 379 1-678 3-390 (391) 29 protein:vir:1845 Length: 392 # 100.0 1.8E-92 1.1E-95 523.5 35.9 382 1-678 1-391 (392) 30 protein:vir:100323 Length: 393 100.0 6.4E-92 4E-95 520.5 34.5 377 1-678 4-391 (393) 31 protein:vir:10336 Length: 386 100.0 6.9E-90 4.3E-93 509.4 35.0 376 1-674 1-386 (386) 32 protein:vir:96740 Length: 388 100.0 8.6E-90 5.3E-93 508.9 34.1 375 1-674 4-388 (388) 33 protein:vir:5833 Length: 742 # 100.0 3.8E-76 2.4E-79 434.0 45.5 600 5-671 1-742 (742) 34 protein:vir:63742 Length: 562 100.0 4.9E-66 3E-69 378.6 39.2 534 1-670 9-562 (562) 35 protein:vir:102819 Length: 648 100.0 5.5E-66 3.4E-69 378.4 37.2 589 1-671 1-648 (648) 36 protein:vir:80488 Length: 562 100.0 1.4E-64 8.9E-68 370.6 40.2 535 1-670 9-562 (562) 37 protein:vir:79798 Length: 717 100.0 1.7E-64 1E-67 370.2 38.1 627 1-665 1-717 (717) 38 protein:vir:95741 Length: 587 100.0 2E-62 1.2E-65 358.9 38.0 561 1-670 9-587 (587) 39 protein:vir:80779 Length: 569 100.0 1.8E-61 1.1E-64 353.7 39.6 541 1-670 1-569 (569) 40 protein:vir:99306 Length: 587 100.0 1.8E-60 1.1E-63 348.1 39.5 557 1-670 9-587 (587) 41 protein:vir:96586 Length: 587 100.0 6.9E-58 4.3E-61 333.9 41.1 548 1-670 9-587 (587) 42 protein:vir:100829 Length: 607 100.0 9.1E-53 5.6E-56 305.9 37.4 563 1-676 18-607 (607) 43 protein:vir:102957 Length: 437 100.0 6E-52 3.7E-55 301.4 34.9 417 1-664 9-437 (437) 44 protein:vir:101326 Length: 529 100.0 1.3E-42 7.8E-46 250.3 32.1 487 1-665 1-529 (529) 45 protein:vir:105470 Length: 451 100.0 6.9E-41 4.3E-44 240.8 35.9 423 1-664 9-451 (451) 46 protein:vir:107310 Length: 581 100.0 4.3E-39 2.7E-42 230.9 33.6 530 79-678 1-579 (581) 47 protein:vir:7653 Length: 581 # 100.0 2.3E-38 1.4E-41 227.0 34.1 522 97-678 1-579 (581) 48 protein:vir:78986 Length: 436 100.0 3.4E-29 2.1E-32 176.7 31.7 406 1-664 11-436 (436) 49 protein:vir:102359 Length: 356 99.2 7.6E-13 4.7E-16 87.0 16.1 321 255-663 1-356 (356) 50 protein:vir:3788 Length: 376 # 99.0 1.4E-09 9E-13 69.0 24.6 350 255-672 1-376 (376) 51 protein:vir:3751 Length: 376 # 98.9 1.1E-08 6.6E-12 64.3 27.2 350 255-672 1-376 (376) 52 protein:vir:276 Length: 369 # 98.9 3.2E-09 2E-12 67.2 23.8 345 274-668 1-369 (369) 53 protein:vir:489 Length: 498 # 98.9 9.3E-09 5.8E-12 64.6 25.0 441 1-668 10-498 (498) 54 protein:vir:4517 Length: 498 # 98.9 1.4E-08 8.5E-12 63.7 25.1 445 1-668 10-498 (498) 55 protein:vir:4463 Length: 498 # 98.8 4.5E-08 2.8E-11 60.9 24.8 449 1-675 10-498 (498) 56 protein:vir:78782 Length: 370 98.8 4.4E-08 2.7E-11 60.9 23.9 346 235-675 1-370 (370) 57 protein:vir:1996 Length: 495 # 98.4 6.2E-07 3.8E-10 54.6 28.6 441 1-665 11-495 (495) 58 protein:vir:95263 Length: 450 98.3 1.6E-06 1E-09 52.4 23.4 407 161-671 1-450 (450) 59 protein:vir:80052 Length: 331 98.0 9.1E-06 5.6E-09 48.2 24.8 311 284-665 1-331 (331) 60 protein:vir:5260 Length: 502 # 97.8 2.1E-05 1.3E-08 46.2 32.5 462 1-665 1-502 (502) 61 protein:vir:3165 Length: 426 # 94.0 0.0056 3.5E-06 32.9 20.0 367 235-665 1-426 (426) 62 protein:vir:96104 Length: 504 93.6 0.0068 4.2E-06 32.5 19.2 427 149-664 1-504 (504) 63 protein:vir:99586 Length: 507 87.4 0.039 2.4E-05 28.3 20.8 437 133-664 1-507 (507) 64 protein:vir:106730 Length: 501 81.5 0.085 5.3E-05 26.4 28.4 433 136-674 1-501 (501) 65 protein:vir:101576 Length: 501 77.2 0.13 7.9E-05 25.5 28.0 440 136-674 1-501 (501) 66 protein:vir:3636 Length: 501 # 76.1 0.14 8.6E-05 25.3 28.3 444 136-674 1-501 (501) 67 protein:vir:78611 Length: 501 57.8 0.43 0.00027 22.6 28.5 442 136-674 1-501 (501) No 1 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=1.4e-171 Score=957.34 Aligned_cols=678 Identities=69% Similarity=1.162 Sum_probs=561.8 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEE Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRT 80 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (678) |+|+||||||||+|++++|+||+||++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~~~t~~~~~vg~~~~gp~~~p~~i~s~~~~~~~fg~~~~~~~~~~~~~~~f~~gg~~~~v 80 (679) T protein:vir:10 1 MTLLSPGVETKEINLQTTIARSSTGRAALVGKFNWGPAYQISQVVSEVDLVDKFGRPDDQTADSFFSGVNFLNYGNDLRL 80 (679) T ss_pred CceecCceEEEeecCCcccccCccccceeeecccCCCCccCEEecCHHHHHHHcCCcccccchHHHHHHHHHhCCCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccceeeccc Q lcl|NC_019538. 81 VRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKELNDYP 160 (678) Q Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~~ 160 (678) |||.+++..+++.++++.+..++.+++.+..+++.+.+..............+..++......++.......+......+ T Consensus 81 vrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~s~~~~~~~~~~~~~~~~~~~~v~~~~~~~~a~~~~~~~ 160 (679) T protein:vir:10 81 VRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGGNVIATGKVTVVNASGGIVAFYVPTAAIIDKAKSLNDYP 160 (679) T ss_pred EEccCcccccccccccccccccccccccccccccceeeeeCCCcccceeEEEeeccCceeeeeecccccccccccccccc Confidence 99999988888888888888888999988888998887766655555555555555554444444444444444444444 Q ss_pred ccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccccccce Q lcl|NC_019538. 161 ALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGDNI 240 (678) Q Consensus 161 ~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~i 240 (678) .................+.........+..+..........+.......................+...+...|.+++.+ T Consensus 161 ~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~~g~~gn~i 240 (679) T protein:vir:10 161 ALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARYAGTYGDNI 240 (679) T ss_pred eecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccceeeeecccccCCcc Confidence 44444444444444444444444444444444444444444333333222333334444455566777888999999999 Q ss_pred eEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecccccccc Q lcl|NC_019538. 241 QVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDRDI 320 (678) Q Consensus 241 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~ 320 (678) ++.+.................................................++.+++..++...+++.++...+.... T Consensus 241 ~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~~~~~~~ 320 (679) T protein:vir:10 241 KVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTKPGDRDI 320 (679) T ss_pred eEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeeecccccccc Confidence 88766555444333222222111111111111111222222233334455666788888888888899998888888777 Q ss_pred ccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCccc Q lcl|NC_019538. 321 YGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGVE 400 (678) Q Consensus 321 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 400 (678) .....+......++.+.++.......+...+..++++||.++......+++..++++++..+...+++++++.......+ T Consensus 321 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~ 400 (679) T protein:vir:10 321 YGTSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIAGAVAGEGAQ 400 (679) T ss_pred cchhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEecCCCCCchh Confidence 77788888888888889988888888888888999999999998888999999999999998888999988888777777 Q ss_pred chhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEeccc Q lcl|NC_019538. 401 IASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKY 480 (678) Q Consensus 401 ~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~ 480 (678) +..+|+.+|++||+++++||+|+|+|+....+.+..++.+++.+||+.+...+....+..+++|+|+++||||++++|+. T Consensus 401 ~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~ 480 (679) T protein:vir:10 401 IASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGNYKYQYDKY 480 (679) T ss_pred hhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEccceeeeccc Confidence 88899999999999999999999999999999999999999999999998888888899999999999999999999999 Q ss_pred CCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCcEEEeccccC Q lcl|NC_019538. 481 NDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQGFILYGDKTM 560 (678) Q Consensus 481 ~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~wG~rT~ 560 (678) +++.+++||||++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++||||||+|+++|+++||+||+ T Consensus 481 ~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g~G~~~wG~rT~ 560 (679) T protein:vir:10 481 NDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVGFAGQGYILYGDKTA 560 (679) T ss_pred CCceEEechHHHHHHHHHHhhccCCcEECcCCeeeccccccccceeecChhhHHhhhhCCceEEEEecCCeEEEEccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHHHhhCC Q lcl|NC_019538. 561 SLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPAVIDRN 640 (678) Q Consensus 561 ~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G 640 (678) ++++++|+||||||||+|||++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||++||++| T Consensus 561 ~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~gf~v~~d~~~nt~~~i~~G 640 (679) T protein:vir:10 561 SQAPTPFDRINVRRLFNLLKKSISESAKYKLFELNDAFTRSSFRSEVGSYLDTIRSLGGIYDFRVVCDESNNTPAVIDRN 640 (679) T ss_pred CCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCC Confidence 99988999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 641 EFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 641 ~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) +|+++|+++|++|||||+|||+|++++++|+|+++++| T Consensus 641 ~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 678 (679) T protein:vir:10 641 EFVATILIKPARSINYITLSFVATSTGADFDELVGSFQ 678 (679) T ss_pred eEEEEEEEEecCCccEEEEEEEEeecCccHHHHHHHhc Confidence 99999999999999999999999999999999999999 No 2 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=1.1e-162 Score=908.45 Aligned_cols=659 Identities=52% Similarity=0.921 Sum_probs=528.7 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEE Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRT 80 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (678) |+||||||||||+|++++|+||+||++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~~~t~~~~~vG~~~~Gp~~~p~~v~~~~~~~~~fg~~~~~~~~~~~~~~~f~ngg~~~~v 80 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYGNDLRL 80 (663) T ss_pred CceecCceEEEEecCCccccccCcccceeEeecccCCCCccEEecCHHHHHHhcCCcCCcchhHHHHHHHHHhCCCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccceeeeecccccccccceeeeccccccccc-ccccccccccccceeeecccccccccceeecc Q lcl|NC_019538. 81 VRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITT-GKVSALNSVGGITFVRFSTAEVVKKAKELNDY 159 (678) Q Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~ 159 (678) |||.+.++.+++..+.+....+...++....+|+.+.+......... +........+.........+.....+.....+ T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~~~v~~~ 160 (663) T protein:vir:10 81 VRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLGTY 160 (663) T ss_pred EEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeeccccccccccccccc Confidence 99999888888888888888888888888888888877555443322 22222222222233322222222222222333 Q ss_pred cccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccccccc Q lcl|NC_019538. 160 PALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGDN 239 (678) Q Consensus 160 ~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~ 239 (678) +.....+......... ..........+..+............. ..+.............+.+.+...|.|++. T Consensus 161 ~~~~~~~~~~~s~~s~--~~~~a~~v~~v~~d~~~~v~~~~~a~~-----~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~ 233 (663) T protein:vir:10 161 PTLGDNWRIDVSGASG--GSAAALALGNIVVDSGVTFGNSEDAPA-----VMTSPAVMEKYAKFGMPLISAVYPGEIGST 233 (663) T ss_pred eeeccceeeEeeeccC--ccccccccceeccccceEEeecccccc-----ccccccccccccccccceEEeccCCcccce Confidence 2222222221111111 111111122222222222222111111 111122223334445567889999999999 Q ss_pred eeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccccccc Q lcl|NC_019538. 240 IQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDRD 319 (678) Q Consensus 240 i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 319 (678) +++.+............. ..+..... ...............+.+.+++..++...+.+.++...+... T Consensus 234 i~V~i~~~~~~~~~~~~~-----------~~~~~~~~-~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~ 301 (663) T protein:vir:10 234 VEVEIVSKTAFNSGAQQT-----------IYPFGGTR-TSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRD 301 (663) T ss_pred eeeeeccccccccccccc-----------eecccccc-cccccceeecccccccceeeEeecCCcceeeecccccccccc Confidence 998776544332211110 00111110 011122223344556677888888888889999988888888 Q ss_pred cccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCcc Q lcl|NC_019538. 320 IYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGV 399 (678) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (678) ......+....+.++.+.++.......+......++++||.|+.+..+..+++++++.+...+.+.+++++++....... T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~ 381 (663) T protein:vir:10 302 VYGSNIFMDDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGA 381 (663) T ss_pred cccchhhhhhhhcCCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCch Confidence 88788888888888888888888777788888889999999999988899999999999988888888888877666666 Q ss_pred cchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecc Q lcl|NC_019538. 400 EIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDK 479 (678) Q Consensus 400 ~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~ 479 (678) +...+++.+|++||+++++||+|+|+|+.........++.+++++|++.+...........+++|+|+++||||++++|+ T Consensus 382 ~~~~~v~~~l~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~ 461 (663) T protein:vir:10 382 EIASTVQKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDK 461 (663) T ss_pred hhHHHHHHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecc Confidence 67788999999999999999999999999888888889999999999999888888888999999999999999999999 Q ss_pred cCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecC-CcEEEeccc Q lcl|NC_019538. 480 YNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPG-QGFILYGDK 558 (678) Q Consensus 480 ~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~-~G~~~wG~r 558 (678) .+++.+++||||++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||||++|++ +|+++||+| T Consensus 462 ~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~r 541 (663) T protein:vir:10 462 YNDINRWVPLAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDK 541 (663) T ss_pred cCCceEEechhHHHHHHHHHhhccCCceEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCCCcEEEEccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999997 799999999 Q ss_pred cCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHHHhh Q lcl|NC_019538. 559 TMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPAVID 638 (678) Q Consensus 559 T~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~ 638 (678) |+++++++|+||||||||+|||++|+++++|+||||||+.+|++|+++|+.||++||++|+|.||+|+||+++||+++|+ T Consensus 542 T~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~ 621 (663) T protein:vir:10 542 MATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVID 621 (663) T ss_pred ccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhh Confidence 99999889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 639 RNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 639 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) +|+|+++|+++|++|+|||+|||+|++++.+|+|++++++ T Consensus 622 ~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 661 (663) T protein:vir:10 622 RNEFVGTIYVKPPRSINYITLNMVATSTGANFDELIGPMQ 661 (663) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHHHh Confidence 9999999999999999999999999999999999999999 No 3 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=1.8e-162 Score=907.26 Aligned_cols=659 Identities=52% Similarity=0.923 Sum_probs=525.5 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEE Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRT 80 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (678) |+||||||||||+|++++|+||+||++||||+|+|||+|+|++|+||.||++.||++++.+|++|+|++||+|||++||| T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYGNDLRL 80 (663) T ss_pred CceecCceEEEEecCcccccccCccceeEEeeeccCCCCccEEecCHHHHHHHhCCcCccchhHHHHHHHHHhCCCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccceeeeecccccccccceeeeccccccccc-ccccccccccccceeeecccccccccceeecc Q lcl|NC_019538. 81 VRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITT-GKVSALNSVGGITFVRFSTAEVVKKAKELNDY 159 (678) Q Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~ 159 (678) |||.+.++.+++..+.+....+...++....+|+.+.+......... +.......++.........+.....+...... T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~~~v~~~ 160 (663) T protein:vir:10 81 VRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLGTY 160 (663) T ss_pred EEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEecccccccccccccee Confidence 99999888888888888888888888888888888877554433322 22222333333333333333222222222222 Q ss_pred cccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccccccc Q lcl|NC_019538. 160 PALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGDN 239 (678) Q Consensus 160 ~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~ 239 (678) .................. ........+..+............. ..+.............+.+.+.++|.+++. T Consensus 161 ~~~~~~~~~~~~~~~~~~--~~~~~v~~vv~~~~~~~~~~~~a~~-----~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~ 233 (663) T protein:vir:10 161 PTLGDNWRIDVSGASGGS--AAALALGNIVVDSGVTFGNSEDAPA-----VMTSPAVMEKYAKFGMPLVSAVYPGEIGST 233 (663) T ss_pred eeccccceeEeeeccccc--cccccccceecccceeeEeeccccc-----cccccchhhhcccccceeeeeecccccccc Confidence 222222222211111111 1111111122222111111111111 111122222334445567788999999999 Q ss_pred eeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccccccc Q lcl|NC_019538. 240 IQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDRD 319 (678) Q Consensus 240 i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 319 (678) +++.+............ . ....... ..................+.+++..++...+.+.++...+... T Consensus 234 i~v~i~~~~~~~~~~~~-----~------v~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~ 301 (663) T protein:vir:10 234 VEVEIVSKTAFNSGAQQ-----T------IYPFGGT-RTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRD 301 (663) T ss_pred eeEEecccccccccccc-----c------ccccccc-cccccceeeeeccccccceeEEEecCCcceeeeeeeecccccc Confidence 98876553322111000 0 0000000 0111112222334455667788888888888888888888887 Q ss_pred cccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCcc Q lcl|NC_019538. 320 IYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGV 399 (678) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (678) ......+....+.++.+.++.......+...+..+++.||.|+.+..+..+++++++.+...+.+.+++++++....... T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~ 381 (663) T protein:vir:10 302 VYGSNIFMDDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGA 381 (663) T ss_pred cchhhhhhhhhhccCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCch Confidence 77777777777778888888888777787778889999999999988999999999999888888888888877766666 Q ss_pred cchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecc Q lcl|NC_019538. 400 EIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDK 479 (678) Q Consensus 400 ~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~ 479 (678) +...+|+.+|++||+++++||+|+|+|...........+.+++.+|++.+...........+++|+|+++||||++++|+ T Consensus 382 ~~~~~v~~al~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~ 461 (663) T protein:vir:10 382 EIASTVQKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDK 461 (663) T ss_pred hhHHHHHHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecc Confidence 67789999999999999999999999998888888889999999999999888888888999999999999999999999 Q ss_pred cCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecC-CcEEEeccc Q lcl|NC_019538. 480 YNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPG-QGFILYGDK 558 (678) Q Consensus 480 ~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~-~G~~~wG~r 558 (678) .+++.+++||||++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||||++|++ +|+++||+| T Consensus 462 ~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~r 541 (663) T protein:vir:10 462 YNDINRWVPLAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDK 541 (663) T ss_pred cCCceEEechhHHHHHHHHHhhccCCceEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEEEccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999997 799999999 Q ss_pred cCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHHHhh Q lcl|NC_019538. 559 TMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPAVID 638 (678) Q Consensus 559 T~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~ 638 (678) |+++++++|+||||||||+|||++|+++++|+||||||+.+|++|+++|+.||++||++|+|.||+|+||+++||++||+ T Consensus 542 T~s~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~ 621 (663) T protein:vir:10 542 MATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVID 621 (663) T ss_pred ccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhh Confidence 99999889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 639 RNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 639 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) +|+|+++|+++|++|+|||+|||+|++++++|+|++++++ T Consensus 622 ~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 661 (663) T protein:vir:10 622 RNEFVGTIYVKPPRSINYITLNMVATSTGANFDELIGPMQ 661 (663) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHHHh Confidence 9999999999999999999999999999999999999999 No 4 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=1.4e-162 Score=907.88 Aligned_cols=658 Identities=53% Similarity=0.918 Sum_probs=525.0 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEE Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRT 80 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (678) |+||||||||||+|++++|+||+||++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~~~g~~~~v 80 (660) T protein:vir:68 1 MALLSPGVELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQITDEVALVDMFGTPNTDTADYFMSAMNFLQYGNDLRV 80 (660) T ss_pred CccccCceEEEEecCCcccccCCCcceeEEecccCCCCccCEEecCHHHHHHhcCCccCccchhHHHHHHHHhCCCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccceeeeecccccccccceeeeccccccc-ccccccccccccccceeeecccccccccceeecc Q lcl|NC_019538. 81 VRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATI-TTGKVSALNSVGGITFVRFSTAEVVKKAKELNDY 159 (678) Q Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~ 159 (678) ||+.+.++.+++....+.+..+...++....+|+.+.+....... ..+....+...+........++.....+.....+ T Consensus 81 vRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a~~~~~~ 160 (660) T protein:vir:68 81 VRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKAKEIGEY 160 (660) T ss_pred EEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeeccccccccceeeccc Confidence 999988888888877888888888888877888888776554332 2233333444443333333333333333333333 Q ss_pred cccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccccccc Q lcl|NC_019538. 160 PALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGDN 239 (678) Q Consensus 160 ~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~ 239 (678) +................. ...........+..........++.. .................+.+...|.+++. T Consensus 161 ~~~~~~~~~~v~~~~~~~--~~~~~v~~~~~d~~~~~~~~~ta~~~-----~~~~~~~~~~~~~~~~~~~A~~~g~~G~~ 233 (660) T protein:vir:68 161 PELGSNWTAEMSGSSSGL--SAVITIDSVVMDSGILLTEVETSEEA-----ITSLTFQESIKKYGVPGVVALYPGELGDQ 233 (660) T ss_pred cccccceeEEeecccccc--eeeeeeccccccccceeeeecccccc-----ccccceeeeecccCccccccccccccccc Confidence 333332222222211111 11111112222222222111111111 11111222223344455667888999999 Q ss_pred eeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccccccc Q lcl|NC_019538. 240 IQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDRD 319 (678) Q Consensus 240 i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 319 (678) +++.+............ ............. .....+.........+.+++..++...+++.++...+... T Consensus 234 i~v~~~~~a~~~~~~~~---------~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 303 (660) T protein:vir:68 234 LEIEIVSKADYDKGASA---------QLKIYPDGGTRYS-TAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERD 303 (660) T ss_pred eEEEEeccccccccccc---------cceeeeccccccc-ceeeEeecccccccceeeeeecCCcceeeeeeeccccccc Confidence 98877554443222111 1111111111111 1122223344455677888888899999999888887777 Q ss_pred cccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCcc Q lcl|NC_019538. 320 IYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGV 399 (678) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (678) ......++.....++.+.++.......+....+..++.||.++....+.++++.+++++...+.+.+.+++++....+.. T Consensus 304 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 383 (660) T protein:vir:68 304 IYGSNIFIDDFFAKGASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESL 383 (660) T ss_pred ccccceeeehhhccCcccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCch Confidence 66777777777788888888888888888888889999999999988999999999999999998888899888888888 Q ss_pred cchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecc Q lcl|NC_019538. 400 EIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDK 479 (678) Q Consensus 400 ~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~ 479 (678) ++..+++.+|++||+++++||+++|+|+.++++.+.+++.+++++||+.... ......+++|+|+++||||++++|+ T Consensus 384 ~~~~~v~~~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~s~~~~~~~p~~~~~d~ 460 (660) T protein:vir:68 384 EVASTVQKHVVAIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGT---YTDNNFNISSTYAAIDGNYKYQYDK 460 (660) T ss_pred HHHHHHHHHHHHHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhccc---ccccccccCcceEEEEcCceEEecc Confidence 8888999999999999999999999999999999999999999999975432 3344567899999999999999999 Q ss_pred cCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCcEEEecccc Q lcl|NC_019538. 480 YNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQGFILYGDKT 559 (678) Q Consensus 480 ~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~wG~rT 559 (678) .+++.+++||||++||+|||+|.++||||||+|+++.+|.|+.++++.++++|++.||++||||||+|+++|+++||+|| T Consensus 461 ~~~~~~~~p~sg~~AGl~Ar~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT 540 (660) T protein:vir:68 461 YNDVNRWVPLAADIAGLCARTDNISQPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKT 540 (660) T ss_pred cCCceEEechhHHHHHHHHHHhccCCcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCeEEEEccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHHHhhC Q lcl|NC_019538. 560 MSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPAVIDR 639 (678) Q Consensus 560 ~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~ 639 (678) +++++++|+||||||||+|||++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||++||++ T Consensus 541 ~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~ 620 (660) T protein:vir:68 541 ATSVPSPFDRINVRRLFNMVKTNIGSASKYRLFELNNAFTRSSFRTETSQYLQGIKALGGVYNFKVVCDTTNNTPAVIDR 620 (660) T ss_pred cCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEecCCCCHHHhhC Confidence 99998899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 640 NEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 640 G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) |+|+++|+++|++|+|||+|||+|++++++|+|+++++= T Consensus 621 G~~~~~i~~~p~~pae~i~l~~~~~~~~~~~~e~~~~v~ 659 (660) T protein:vir:68 621 NEFVATFYLQPARSINYITLNFVATATGADFDELIGAVG 659 (660) T ss_pred CeEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHhhc Confidence 999999999999999999999999999999999999999 No 5 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=4e-162 Score=905.43 Aligned_cols=659 Identities=52% Similarity=0.931 Sum_probs=533.0 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEE Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRT 80 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (678) |+|+||||||||+|++++|+||+||++||||+|+|||+|+|++|+||.||++.||++++.+|++|+|++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~v~t~~~~fvG~~~~gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAALVGKFAWGPAYEIRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYGNDLRL 80 (663) T ss_pred CccccCceEEEEecCcccccccccccceeeeccccCCCCcCEEecCHHHHHHHcCCcccccchHHHHHHHHHhCCCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccceeeeecccccccccceeeeccccccccc-ccccccccccccceeeecccccccccceeecc Q lcl|NC_019538. 81 VRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITT-GKVSALNSVGGITFVRFSTAEVVKKAKELNDY 159 (678) Q Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~ 159 (678) |||.+.++.++++++++..+.+...++..+.+|+.+.+......... .........++........+.....+...... T Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~a~~~~~~ 160 (663) T protein:vir:10 81 VRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAKAKQLGTY 160 (663) T ss_pred EecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEeccccccccccccccc Confidence 99999888888999998888888888888889998887665444322 22233333444444333333333333333333 Q ss_pred cccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccccccc Q lcl|NC_019538. 160 PALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGDN 239 (678) Q Consensus 160 ~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~ 239 (678) +.....+............ .......+..+..+.......+..... .............+.+.+...|.+++. T Consensus 161 ~~~~~a~~~~v~~~~~~~~--~a~av~~i~~dg~vt~~~~~~a~~~~~-----~~~~~~~~~~~~~~~~~a~~~g~~G~~ 233 (663) T protein:vir:10 161 PVLGDNWRAEVSGASGGSA--ATLTLGGIVVDSGVTFGNSEEAPDVMT-----STKVLANFAKYGMPLISAVYPGEIGST 233 (663) T ss_pred cccccceeeEEeecccccc--ccceeEeeecCCceeEEeeeccccccc-----cceeeeeccccccceeeeecccccCcc Confidence 3333333322222211111 111122222222222222222221111 111222233445566778889999999 Q ss_pred eeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccccccc Q lcl|NC_019538. 240 IQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDRD 319 (678) Q Consensus 240 i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 319 (678) +.+.+............. ..+.... ...........+.....++.+++..++...+++.++...+... T Consensus 234 i~v~~~~~~~~~~~~~~~-----------v~~~~g~-~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~ 301 (663) T protein:vir:10 234 VEVEVISKTAFQSGAAQP-----------IYPFGGT-RASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRD 301 (663) T ss_pred eeEeecccccccccceee-----------ecccCcc-cccccccccccccccchhhcccccCCCcccceeeeeccccccc Confidence 888765543322211100 0001110 1111122233344555677888888889999999998888877 Q ss_pred cccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCcc Q lcl|NC_019538. 320 IYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGV 399 (678) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (678) ......++.....++.+.++.......+......+++++|.++....+..+++++++++...+.+++..++++....+++ T Consensus 302 ~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~ 381 (663) T protein:vir:10 302 VYGNNIFMDDYFRNGSSNFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGV 381 (663) T ss_pred cchhhhhhhhhhcCcccceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCch Confidence 77777777777778888888887777777777788999999999888999999999999888888888888888888888 Q ss_pred cchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecc Q lcl|NC_019538. 400 EIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDK 479 (678) Q Consensus 400 ~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~ 479 (678) ++..+|+++|++||+++++||+|+|+|.....+.......+++.+|++.............+++|+|+++||||++++|+ T Consensus 382 ~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~ 461 (663) T protein:vir:10 382 AVASTVQKHVVALADDRQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDK 461 (663) T ss_pred hhHHHHHHHHHHHHHhhCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecc Confidence 88999999999999999999999999999888888888899999999998888888888899999999999999999999 Q ss_pred cCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecC-CcEEEeccc Q lcl|NC_019538. 480 YNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPG-QGFILYGDK 558 (678) Q Consensus 480 ~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~-~G~~~wG~r 558 (678) .+++.+++||||++||+|||+|.++||||||+|+++++|.|+.++.+.+++.|++.||++|||+|+.|++ +|+++||+| T Consensus 462 ~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~r 541 (663) T protein:vir:10 462 YNDINRWVPLSADIAGLCAYTDQVGHPWMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDK 541 (663) T ss_pred cCCceEEechHHHHHHHHHHhhccCCcEEccCCeeecceeccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999997 799999999 Q ss_pred cCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHHHhh Q lcl|NC_019538. 559 TMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPAVID 638 (678) Q Consensus 559 T~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~ 638 (678) |+++++++|+||||||||+||+++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||++||+ T Consensus 542 T~s~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~ 621 (663) T protein:vir:10 542 MATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMEVSQYLDNIRSLGGVYDFRVVCDTTNNTPQVID 621 (663) T ss_pred ccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhh Confidence 99999889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 639 RNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 639 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) +|+|+++|+++|++|+|||+|||+|++++++|+|+++++| T Consensus 622 ~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~f~e~~~~~~ 661 (663) T protein:vir:10 622 SNEFVATIYIKAPRSINYITLNFVATSTGANFDELIGPAQ 661 (663) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEEecCccHHHHHHHHh Confidence 9999999999999999999999999999999999999999 No 6 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=2.8e-160 Score=895.29 Aligned_cols=658 Identities=51% Similarity=0.910 Sum_probs=520.0 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEE Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRT 80 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (678) |+|+||||||||+|++++|+||+||++||||+|+|||+++|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~~~t~~~~~vg~~~~gp~~~p~~v~s~~~~~~~fg~~~~~~~~~~~~~~~f~~~g~~~~v 80 (660) T protein:vir:10 1 MALLSPGIELKETSVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNEVADYFMSGMNFLQYGNDLRT 80 (660) T ss_pred CceecCceEEEeecCCccccCCCcccceEEeecCCCCCccCeEcCCHHHHHHHcCCcCCCchhHHHHHHHHHhCCceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccceeeeecccccccccceeeeccccccccc-ccccccccccccceeeecccccccccceeecc Q lcl|NC_019538. 81 VRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITT-GKVSALNSVGGITFVRFSTAEVVKKAKELNDY 159 (678) Q Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~ 159 (678) ||+.+.+..++++...+.+..++..++..+.+|+.+.+......... +........+........++.....+...... T Consensus 81 vrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~~a~~v~~~ 160 (660) T protein:vir:10 81 VRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIAYARSLNQY 160 (660) T ss_pred EEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeeccccccccccccccccc Confidence 99999888888888888899999999988899999887665543322 23333334444444443333333333333333 Q ss_pred cccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccccccc Q lcl|NC_019538. 160 PALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGDN 239 (678) Q Consensus 160 ~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~ 239 (678) +.....+............ ..........+.............. ...............+.+.+...|.+++. T Consensus 161 ~~~~~~~~~~~~~~~~~~~--~a~sv~~~v~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~a~~~g~~G~~ 233 (660) T protein:vir:10 161 PTLGPAWTAEVTSASSGVS--GTITVGKIVTDSGILLTEAENSEEA-----ITSLEFQAALKKFAMPGVVALYPGEIGST 233 (660) T ss_pred cccccceeEEEecccCccc--cceeeeeeeccCcceEEeeeccccc-----cccccceeeccccccceeeeecccccCcc Confidence 3333333222222111111 1111122222222222111111110 01111122223344455677788888888 Q ss_pred eeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccccccc Q lcl|NC_019538. 240 IQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDRD 319 (678) Q Consensus 240 i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 319 (678) +.+.+....+...... ......+.+..... .............+.+..++..++...+++.++...+... T Consensus 234 i~v~i~~~~~~~~~~~---------~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~ 303 (660) T protein:vir:10 234 LEVEIVSKAAYEAGSS---------KMLDVYPGGGTRAS-IAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKD 303 (660) T ss_pred eeEEEeeccccCCcce---------eEEeeeeccceeeE-EeeeecccccccccccccccccCCcccceeeeeccccccc Confidence 8776654332211100 00111111111111 1111222234455667778888888999999888887777 Q ss_pred cccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCcc Q lcl|NC_019538. 320 IYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGV 399 (678) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (678) ......+..+...++.+.++.......+.......++.||.++....+.+++..+++++...+.+.+++++.+....... T Consensus 304 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~ 383 (660) T protein:vir:10 304 VYGNNIYLDDYFAKGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGD 383 (660) T ss_pred cccceeeeehhhcCCCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCch Confidence 66666777777788888888888777777778889999999998888889999999999988888888888777666556 Q ss_pred cchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecc Q lcl|NC_019538. 400 EIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDK 479 (678) Q Consensus 400 ~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~ 479 (678) .+.++|+++|++||+++++||+++|+|....++.....+.+++.+||+..... .....+++|+|+++||||++++|+ T Consensus 384 ~~~~~v~~al~~~~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~---~~~~~~~~s~~~~~~~p~~~~~d~ 460 (660) T protein:vir:10 384 EVASTVQKHVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTF---DANNMNISTTYAAIDGNYKYQYDK 460 (660) T ss_pred hhhHHHHHHHHHHHHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcccc---cccccccCcceEEEEcCceEEecc Confidence 67788999999999999999999999999989999999999999999854322 233567899999999999999999 Q ss_pred cCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecC-CcEEEeccc Q lcl|NC_019538. 480 YNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPG-QGFILYGDK 558 (678) Q Consensus 480 ~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~-~G~~~wG~r 558 (678) .+++.+++|||+++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||||++|++ +|+++||+| T Consensus 461 ~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~r 540 (660) T protein:vir:10 461 YNDVNRWVPLAADLAGLCARTDDVSQPWMSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDK 540 (660) T ss_pred cCCceeEechhHHHHHHHHHhhccCCcEEccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEEEccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999997 799999999 Q ss_pred cCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHHHhh Q lcl|NC_019538. 559 TMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPAVID 638 (678) Q Consensus 559 T~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~ 638 (678) |+++++++|+||||||||+||+++|+++++|+||||||+.||++|+.+|+.||++||++|+|.||+|+||+++||++||+ T Consensus 541 T~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~gal~g~~V~~d~~~nt~~di~ 620 (660) T protein:vir:10 541 TATKVPSPMDHINVRRLFNMLKKNIGDASKYKLFELNDNFTRSSFRMEVSQYLDGIKALGGIYEGRVVCDTTVNTPAVID 620 (660) T ss_pred ccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhh Confidence 99999989999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 639 RNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 639 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) +|+|+|+|+++|++|||||+|||+|++++++|+|+++++= T Consensus 621 ~G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~~~~~~ 660 (660) T protein:vir:10 621 RNEFIANIYVKPARSINYITLNFVATSTGADFDELIGPLV 660 (660) T ss_pred CCeEEEEEEEEecCCccEEEEEEEEeecCccHHHHhhhcC Confidence 9999999999999999999999999999999999999999 No 7 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=2.2e-159 Score=890.39 Aligned_cols=658 Identities=52% Similarity=0.894 Sum_probs=513.3 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEE Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRT 80 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (678) |+||||||||||+|++++|+||+||++||||+|+|||+|+|++|+||.||++.||++++.+|++|+|++||+|||++||| T Consensus 1 ma~~~PgVyv~E~~~~~~i~~~~ts~~~~vG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (664) T protein:vir:98 1 MALQSPGIETKETSVQSTVVRNSTGRAAIVGKFSWGPAYQIRQISNEVELVNYFGAPDNLTADYFMSAVNFLQYGNDLRL 80 (664) T ss_pred CceecCceEEEecCCCcccccccccceEEEeeccCCCCCccEEecCHHHHHHhcCCccccchhHHHHHHHHHhcCCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccceeeeecccccccccceeeecccccccccc-cccccccccccceeeecccccccc--cceee Q lcl|NC_019538. 81 VRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTG-KVSALNSVGGITFVRFSTAEVVKK--AKELN 157 (678) Q Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~~~~--a~~~~ 157 (678) |||.+.++.++++.+.+.+..+...++.....|+.+.+.+........ ........++.............. +.... T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~~~~~~~~~ 160 (664) T protein:vir:98 81 VRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSLLVLNRSVL 160 (664) T ss_pred EEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCccceeecccccc Confidence 999998888888888888888888888777788888876655443322 112222233322222111110000 00000 Q ss_pred cccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccccc Q lcl|NC_019538. 158 DYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTG 237 (678) Q Consensus 158 ~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~g 237 (678) ...... ....... ....+............+............... ..............+.+.+..+|.++ T Consensus 161 ~~~~~~-~~~~s~~--~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i-----~~~~~~~~~~~~~~~~~~a~~~G~~G 232 (664) T protein:vir:98 161 TQIFLL-VGTTEIV--SQSSGVSASITIDGIESDSGITLLNLDIAKETI-----QGTSFQTLTQKYQIPSVVALYPGELG 232 (664) T ss_pred ccccee-cccceee--eeecccceeeecccccccceeeccccceeeecc-----ccccceeeeeccccceeeeeeccccc Confidence 000000 0000000 000111111111122222222211111111110 11111222233445667788899999 Q ss_pred cceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccccc Q lcl|NC_019538. 238 DNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKEND 317 (678) Q Consensus 238 n~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 317 (678) +.+++.+....+....... ........ ............+...+++.+.+..++.+.+++.++...+. T Consensus 233 n~isv~i~s~~~~~~~~~i-----------~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 300 (664) T protein:vir:98 233 STVQVEIISKAAYDTGAMI-----------SGYPSGIS-VKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTD 300 (664) T ss_pred ceeeeeecccccccCcceE-----------eeccCcee-cccceeeeeeccccCccceeEEEecCCceeeeEEeecccCc Confidence 9998877654433221110 00000000 11111223333455567778888899999999999988888 Q ss_pred cccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccC Q lcl|NC_019538. 318 RDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGE 397 (678) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 397 (678) .+......+......++.+.++.+.....+........+.||.+....++..+...++.++...+++.+++++++..... T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~ 380 (664) T protein:vir:98 301 KDIYGVNIYMDDFFANGGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGE 380 (664) T ss_pred ccceeeeeechhheecccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccceEEecCCCCC Confidence 88777777777777788888888888788877788889999998776677777888899888888888888887776655 Q ss_pred cccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhccccc-ccchhhccccccceEEEEcCeEEE Q lcl|NC_019538. 398 GVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDS-GVVVDDNMNIGTTYSSTSANYKLQ 476 (678) Q Consensus 398 ~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~s~~~~~~~p~~~v 476 (678) ..+...+++.+|++||+++++||+++|+|+..+++.+..++.+++++|++..... ........+++|+|+++||||+++ T Consensus 381 ~~~~~~~v~~al~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~ 460 (664) T protein:vir:98 381 SVEIASTVQKHVISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQ 460 (664) T ss_pred cHHHHHHHHHHHHHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEE Confidence 5566778999999999999999999999999999999999999999999876543 334555678999999999999999 Q ss_pred ecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecC-CcEEEe Q lcl|NC_019538. 477 YDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPG-QGFILY 555 (678) Q Consensus 477 ~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~-~G~~~w 555 (678) +|+.+++.+++||||++||+|||+|.++||||||+|+++.+|.|+.++.+.+++.|++.||++|||||+.|++ +|+++| T Consensus 461 ~d~~~~~~~~~p~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~w 540 (664) T protein:vir:98 461 YDKYNDVNRWVPLAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLY 540 (664) T ss_pred ecccCCceEEechHHHHHHHHHHhhhcCCcEECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCCCcEEEE Confidence 9999999999999999999999999999999999999999999999999999999999999999999999998 799999 Q ss_pred ccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHH Q lcl|NC_019538. 556 GDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPA 635 (678) Q Consensus 556 G~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~ 635 (678) |+||+++++++|+||||||||+||+++|+++++|+||||||+.+|++|+.+|+.||++||++|+|.||+|+||+++||++ T Consensus 541 G~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~ 620 (664) T protein:vir:98 541 GDKTLTSVPSPFDRINVRRLFNMIKKDIGDNAKYKLFENNDDFTRASFRMDTGQYMTNIRALGGCYDYRVICDTTNNTPD 620 (664) T ss_pred cccccCCCCcccceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHH Confidence 99999999889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 636 VIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 636 ~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||++|+|+++|+++|++|||||+|||+|++++++|+|+++++= T Consensus 621 ~i~~G~~~~~i~~~p~~pae~I~~~~~q~~~~~~~~e~~~~~~ 663 (664) T protein:vir:98 621 VIDRNEFVATVYVKPPRSINYITLNFVATSTGADFDELVGPQA 663 (664) T ss_pred HhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchhHhccccc Confidence 9999999999999999999999999999999999999999999 No 8 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=1.9e-159 Score=890.81 Aligned_cols=655 Identities=52% Similarity=0.924 Sum_probs=504.9 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEE Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRT 80 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (678) |+||||||||||+|++++|+||+||++||||+|+|||+|+|++|+||.||+++||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg~~~~v 80 (666) T protein:vir:65 1 MTLLSPGFETKETTLSTTIVQSETGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (666) T ss_pred CceecCceEEEEecCcccccccCcccceEEecccCCCCccCEEecCHHHHHHHcCCccccchhHHHHHHHHHhcCceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccceeeeecccccccccceeeecccccccccc-cccccccccccceeeecccccccccceeecc Q lcl|NC_019538. 81 VRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTG-KVSALNSVGGITFVRFSTAEVVKKAKELNDY 159 (678) Q Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~ 159 (678) ||+.+.+..+++..+++....+...++..+.+|+.+.+.+........ ........+.......++............. T Consensus 81 vrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~~~~~g~~ 160 (666) T protein:vir:65 81 VRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (666) T ss_pred EEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeeccccccCcc Confidence 999998888888888888888888888888889999887766544332 1222222222222222222222111111111 Q ss_pred cccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccccccc Q lcl|NC_019538. 160 PALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGDN 239 (678) Q Consensus 160 ~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~ 239 (678) +.....+....... ..+............++.+............ +...+.........+...+...|.+++. T Consensus 161 ~~l~~~~~~~~~~~--~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~-----~~~~~~~~~~~~~~~a~~A~~~g~~g~~ 233 (666) T protein:vir:65 161 PELDGGWTAEFTSS--SGNGSAALSVTKIVTDSGLLLTDLETSRANI-----TNQTFLTKLKKYDMPAVSAIYAGEIGNS 233 (666) T ss_pred eeEeeccceeeccc--Ccccccceeeeecccccceeeeeeccccccc-----ccccccccccccccceeeeeeccccccc Confidence 11111111111111 1111111112222222222222211111110 1111111122233445667788889988 Q ss_pred eeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccccccc Q lcl|NC_019538. 240 IQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDRD 319 (678) Q Consensus 240 i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 319 (678) +++.+............ ......... .......+.........+.+++..++...|+|.++...+... T Consensus 234 i~v~i~~~~~~~~~~~~----------l~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~ 301 (666) T protein:vir:65 234 LEVEILARSAFKNTAPD----------LTMYPYGGE--RTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKD 301 (666) T ss_pred eeEEeeccccccccccc----------ccccccccc--cccceeeecccccccccceeeeecCCcccceeecccCccccc Confidence 88766544333221110 000000000 001111222233445667888888999999999888888877 Q ss_pred cccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCcccc--------chhHHHhhhhhhhccchhccccccc Q lcl|NC_019538. 320 IYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTA--------SAGDWIEGWDMFSDREHVDVNLFIA 391 (678) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~ 391 (678) ......+..++..++.+.++..............+++.+|.++.... ..++..++++++...+...+++++. T Consensus 302 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 381 (666) T protein:vir:65 302 VYGNSIYMDDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIA 381 (666) T ss_pred ccchhhhhhhhhcccccceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceee Confidence 77777788888788888888777665555566678888888865432 2244567788888887778888877 Q ss_pred cccccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEc Q lcl|NC_019538. 392 GSCAGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSA 471 (678) Q Consensus 392 ~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~ 471 (678) +....++ ++..+++.+|++||+++++||+++|+|+..+++.++.++.+++++||+..... .....+++|+|+++|| T Consensus 382 p~~~~~~-~~~~~v~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~s~~~~~~~ 457 (666) T protein:vir:65 382 GACAGEG-DAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNY---NENNMNINTTYAVIDG 457 (666) T ss_pred cCcCCcc-chhHHHHHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccc---cccccccCcceEEEEc Confidence 7665544 35678999999999999999999999999999999999999999999876532 2345678999999999 Q ss_pred CeEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 472 NYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 472 p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) ||++++|+.+++.+++|||+++||+|||+|.++||||||+|+++.+|.|+.++++.++++|++.||++|||||++|+++| T Consensus 458 p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G 537 (666) T protein:vir:65 458 NYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEG 537 (666) T ss_pred CceEEecccCCceeEechHHHHHHHHHHHhccCCcEEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCCCe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||+++++++|+||||||||+|||++|+++++|+||||||+.+|++|+++|+.||++||++|+|.||+|+||+++ T Consensus 538 ~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~ 617 (666) T protein:vir:65 538 FILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTN 617 (666) T ss_pred EEEEecccCCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCC Confidence 99999999999988999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||+++|++|+|+++|+++|++|||||+|||+|++++++|+|+++++| T Consensus 618 nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 664 (666) T protein:vir:65 618 NTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGSDFDEIIGPAN 664 (666) T ss_pred CCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999 No 9 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=2.1e-159 Score=890.47 Aligned_cols=655 Identities=52% Similarity=0.915 Sum_probs=504.3 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEE Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRT 80 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (678) |+||||||||||+|++++|++|+||++||||+|+|||+|+|++|+||.||+++||++.+.+|++|++++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~~~t~~~~~vg~~~~gp~~~p~~i~~~~~~~~~fg~~~~~~~~~~~~~~~f~~~g~~~~v 80 (666) T protein:vir:80 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (666) T ss_pred CceecCceEEEEecCCccccccCcccceEEeccccCCCccceEecCHHHHHHhcCCccCccchHHHHHHHHhcCCCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccceeeeecccccccccceeeeccccccccc-ccccccccccccceeeecccccccccceeecc Q lcl|NC_019538. 81 VRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITT-GKVSALNSVGGITFVRFSTAEVVKKAKELNDY 159 (678) Q Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~ 159 (678) |||.+.++.+++....+.+......++....+++.+.+.+....... .........+...........+...+...... T Consensus 81 ~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~ta~~~~~a~~~~~~ 160 (666) T protein:vir:80 81 VRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (666) T ss_pred EEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecchhhhcccccccccc Confidence 99999888888888888888888888877777877776655443322 22222233333333333333332222222222 Q ss_pred cccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccccccc Q lcl|NC_019538. 160 PALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGDN 239 (678) Q Consensus 160 ~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~ 239 (678) +.....+....... ..+............+............ ...+...+.........+...+...|.+++. T Consensus 161 ~~v~~~~~~~~~~~--~~~~~~a~~V~~~~~~~~~~~~~~~~a~-----~~~t~~~~~~~~~~~~~~a~~a~~~g~~g~~ 233 (666) T protein:vir:80 161 PELDGDWTAEFTSS--SGNGSAALSVTKIVTDSGLLLTDLETSR-----ANITNQTFLTKLQKYDMPAVSAIYAGEIGNS 233 (666) T ss_pred ceeeccceeeeccc--cccceeeeeeeeeecCCccceeeecccc-----ccccccccccccccccchhhhhhcccccccc Confidence 22222222111111 1111111112122222222211111110 0111122222333344555667788888888 Q ss_pred eeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccccccc Q lcl|NC_019538. 240 IQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDRD 319 (678) Q Consensus 240 i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 319 (678) +++.+............ ....+... .................++.+++...+.++|+|.++...+... T Consensus 234 l~v~i~~~~~~~~~~~~----------~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~ 301 (666) T protein:vir:80 234 LEVEILARSAFKNTAPD----------LTMYPYGG--ERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKD 301 (666) T ss_pred eeeeecccccccccccc----------ceeeeccc--cccccceeeeeccccccceeeEeccCCccceeeeccccccccc Confidence 87765443322111000 00000000 0011112222234455677888889999999999988888887 Q ss_pred cccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCcccc--------chhHHHhhhhhhhccchhccccccc Q lcl|NC_019538. 320 IYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTA--------SAGDWIEGWDMFSDREHVDVNLFIA 391 (678) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~ 391 (678) ......++.+++.++.+.++...............++.+|.++.... ..+++.+++++++..+.+.++++++ T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~ 381 (666) T protein:vir:80 302 VYGNSIYMDDFFGRGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERESIHVNLLIA 381 (666) T ss_pred ccchhhhhhhhhccccceeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhcccccceEee Confidence 77778888888778888877776666666666777888887754421 2234556777888888888888887 Q ss_pred cccccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEc Q lcl|NC_019538. 392 GSCAGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSA 471 (678) Q Consensus 392 ~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~ 471 (678) +....++. +...++.+|++||+++++||+++|+|+..+++.++.++++++++||+.... ......+++|+|+++|| T Consensus 382 p~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~---~~~~~~~~~s~~~~l~~ 457 (666) T protein:vir:80 382 GACAGEGD-AFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGN---YNENNMNINTTYAVIDG 457 (666) T ss_pred cCcCCccc-chHHHHHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhccc---chhhhcccCcceEEEEc Confidence 77665543 457899999999999999999999999999999999999999999986533 23345678999999999 Q ss_pred CeEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 472 NYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 472 p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) ||++++|+.+++.+++||||++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++||||||+|+++| T Consensus 458 p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G 537 (666) T protein:vir:80 458 NYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEG 537 (666) T ss_pred CceEEecccCCceeEechHHHHHHHHHHHhhcCCceEccCCeecceeeccccceeecChhHHHhhhhCCeeEEEEeCCCe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||+++++++|+||||||||+|||++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++ T Consensus 538 ~~~wG~rT~~~~~s~~~~i~vRRl~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~ 617 (666) T protein:vir:80 538 FILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTN 617 (666) T ss_pred EEEEccccCCCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCC Confidence 99999999999988999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||++||++|+|+++|+++|++|||||+|||+|++++.+|+|++++|| T Consensus 618 nt~~di~~G~~~~~i~~~P~~Pae~I~~~~~~~~~~~~~~e~~~~~~ 664 (666) T protein:vir:80 618 NTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGSDFDEIIGPVN 664 (666) T ss_pred CCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999 No 10 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=4.7e-157 Score=877.65 Aligned_cols=657 Identities=53% Similarity=0.902 Sum_probs=501.0 Q ss_pred CceecCceEEEEcC-CCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEE Q lcl|NC_019538. 1 MALLSPGVESKENN-MQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLR 79 (678) Q Consensus 1 ~~~~~PGVyveEv~-~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (678) |+||||||||||++ ++++|+ ++||++||||+|+|||+|+|++|+||.||++.||++++.+|++|+|++||+|||++|| T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~-~~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~ 79 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVN-NSTGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYGNDLR 79 (659) T ss_pred CceecCceEEEEecCCceecc-cCccceEEEecccCCCCCccEEecCHHHHHHHcCCcCCCcchhHHHHHHHhhCCCeEE Confidence 99999999999996 555565 5899999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCcccccccccccccceeeeecccccccccceeeeccccc-ccccccccccccccccceeeecccccccccceeec Q lcl|NC_019538. 80 TVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGA-TITTGKVSALNSVGGITFVRFSTAEVVKKAKELND 158 (678) Q Consensus 80 vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~ 158 (678) |||+.+.+++.++....+.+..+...++.....++.+.+..... .........+...+..................... T Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~~~~~g~ 159 (659) T protein:vir:10 80 VVRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPTAKIIAKAKEVGE 159 (659) T ss_pred EEEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeecccccccccccccc Confidence 99999988888888777777777777766555555555443221 12222222222222222222222222222222211 Q ss_pred ccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccccccc Q lcl|NC_019538. 159 YPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGD 238 (678) Q Consensus 159 ~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn 238 (678) .+...........+... +............+.............. .+...............+.+...|.+++ T Consensus 160 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~~~~~a-----~t~~~~~~~~~~~~~~~v~a~~~G~~g~ 232 (659) T protein:vir:10 160 YPTLGSNWTAEISSSSS--GLAAVITLGKIITDSGILLAEIENAEAA-----MTAVDFQANLKKYGIPGVVALYPGELGD 232 (659) T ss_pred cceeeeeeeeeeeeecc--ccceeeEEeeeecCCceeEEeecccccc-----ccccccccceeecccccccccccceecc Confidence 11111111111111100 0011111112222222222111111111 0111122222233344556777888888 Q ss_pred ceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecccccc Q lcl|NC_019538. 239 NIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDR 318 (678) Q Consensus 239 ~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~ 318 (678) .+++.+....+...... ...... .... ...................+.+.+...+.+++++.++...+.. T Consensus 233 ~~tv~~~~~a~~~~~~~-----v~v~~~----~~~~-~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~ 302 (659) T protein:vir:10 233 KIEIEIVSKADYAKGAS-----ALLPIY----PGGG-TRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEK 302 (659) T ss_pred cceEEEechhhccccce-----eeeeee----eecc-cccccceeeeeeccccccchhhccccccceeeeeeeecccccc Confidence 88876654433321110 000000 0000 0011111122222333445566667777888888888877777 Q ss_pred ccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCc Q lcl|NC_019538. 319 DIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEG 398 (678) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 398 (678) .......+....+.++++.++.......+....+...+.||.|+.+..+.++++.+++++...+.+++++++++....+. T Consensus 303 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~ 382 (659) T protein:vir:10 303 DIYDSNIYIDDFFAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGES 382 (659) T ss_pred ccccchhhhhhhhccCcccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcc Confidence 77777777788888888889888777777777888899999999888889999999999998888889999888887777 Q ss_pred ccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEec Q lcl|NC_019538. 399 VEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYD 478 (678) Q Consensus 399 ~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d 478 (678) .+...+|+.+|++||+++++||+++|+|+...++.+.+.+.+++++||+.... ......+++|+|+++||||++++| T Consensus 383 ~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~---~~~~~~~~~s~~~~l~~p~~~~~d 459 (659) T protein:vir:10 383 LETASTVQKHVVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGS---YTDNNFNISSTYAAIDGNYKYQYD 459 (659) T ss_pred hhhhHHHHHHHHHHHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhccc---ccccccccCcceEEEEeCcEEEec Confidence 77788999999999999999999999999999999999999999999985432 334456889999999999999999 Q ss_pred ccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCcEEEeccc Q lcl|NC_019538. 479 KYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQGFILYGDK 558 (678) Q Consensus 479 ~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~wG~r 558 (678) +.+++.+++|||+++||+|||+|.++||||||+|+++++|.|+.++++.++++|++.||++||||||+|+++|+++||+| T Consensus 460 ~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~r 539 (659) T protein:vir:10 460 KYNDVNRWVPLAADIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDK 539 (659) T ss_pred ccCCceEEechHHHHHHHHHHHhccCCceEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHHHhh Q lcl|NC_019538. 559 TMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPAVID 638 (678) Q Consensus 559 T~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~ 638 (678) |+++++++|+||||||||+|||++|+++++|+||||||+.||++|+.+|+.||++||++|+|+||+|+||+++||+++|+ T Consensus 540 T~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~ 619 (659) T protein:vir:10 540 TATSVPSPFDRINVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGIKALGGIYEYRVVCDTTNNTPSVID 619 (659) T ss_pred ccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhh Confidence 99999889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 639 RNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 639 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) +|+|+++|+++|++|+|||+|||+|++++++|+|++++.- T Consensus 620 ~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 659 (659) T protein:vir:10 620 RNEFVATFYIQPARSINYITLNFVATATGADFDELTGLAG 659 (659) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEEecCcchHHhhccCC Confidence 9999999999999999999999999999999999999999 No 11 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=3.2e-156 Score=873.08 Aligned_cols=658 Identities=53% Similarity=0.894 Sum_probs=495.4 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEE Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRT 80 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (678) |+|+||||||||++....+.+++|||+||||+|+|||+|+|++|+||.||+++||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~PgVyvee~~~~~~~~~~~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg~~~~v 80 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYGNDLRV 80 (659) T ss_pred CceecCceEEEEecCCcccccCCCcceEEEeecCCCCCcccEEecCHHHHHHHcCCcCCCCchhHHHHHHHHhCCceEEE Confidence 99999999999996443445569999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccceeeeecccccccccceeeeccccc-ccccccccccccccccceeeecccccccccceeecc Q lcl|NC_019538. 81 VRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGA-TITTGKVSALNSVGGITFVRFSTAEVVKKAKELNDY 159 (678) Q Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~ 159 (678) |||.+.+++.++......+..+...++.....+.+........ ...+........++.....................+ T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~~~~~~~ 160 (659) T protein:vir:72 81 VRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKAKEVGEY 160 (659) T ss_pred EEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeeccccccccccccccc Confidence 9999888777777776666666666665555555544443321 111222222222221111111111111111111111 Q ss_pred cccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccccccc Q lcl|NC_019538. 160 PALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGDN 239 (678) Q Consensus 160 ~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~ 239 (678) +............... +............+.............. .....+...........+.+..+|.+++. T Consensus 161 ~~~~~~~~~~~~~~~~--~~a~~~~~v~v~~~~~~~~~~v~~~~~a-----~~~~~~~~~v~~~~~~~~~a~~~gt~g~~ 233 (659) T protein:vir:72 161 PTLGSNWTAEISSSSS--GLAAVITLGKIITDSGILLAEIENAEAA-----MTAVDFQANLKKYGIPGVVALYPGELGDK 233 (659) T ss_pred cccccceeeEEeeccc--cccceEEEEEeecCcceeeeeccccchh-----hhcccccccccccccceeeeccccccccc Confidence 1111111111111111 0011111122222222222111111111 11111222222233344566777888888 Q ss_pred eeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccccccc Q lcl|NC_019538. 240 IQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDRD 319 (678) Q Consensus 240 i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 319 (678) +++.+....+....... ............ ................+.+.+...+..++.++++...+... T Consensus 234 ~tv~i~~~~~~~~~~~~---------~v~~~~~~~~~a-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (659) T protein:vir:72 234 IEIEIVSKADYAKGASA---------LLPIYPGGGTRA-STAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKD 303 (659) T ss_pred eeEEEccccccccceee---------eeeccccccccc-ccceeeeeeecccccccceeeecccceeeeeeeeecccccc Confidence 87766543332211100 000000000000 00111111222333445566666677788888877777777 Q ss_pred cccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCcc Q lcl|NC_019538. 320 IYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGV 399 (678) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (678) ......+....+.++++.++.......+....+..++.||.++.+..+..+++++++++...+..++++++++....... T Consensus 304 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~ 383 (659) T protein:vir:72 304 IYDSNIYIDDFFAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESL 383 (659) T ss_pred ccchhhhhhhhhhcCCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcch Confidence 77777788888888889998888777777788888999999998888899999999999988888899998888877777 Q ss_pred cchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecc Q lcl|NC_019538. 400 EIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDK 479 (678) Q Consensus 400 ~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~ 479 (678) +...+++.+|++||+++++||+++|+|+...++.+...+.+++++||+.... ......+++|+|+++||||++++|+ T Consensus 384 ~~~~~v~~~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~---~~~~~~~~~s~~~~~~~p~~~~~d~ 460 (659) T protein:vir:72 384 ETASTVQKHVVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGS---YTDNNFNISSTYAAIDGNHKYQYDK 460 (659) T ss_pred hhhHHHHHHHHHHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccc---cccccccccceeEEEEcCceeeccc Confidence 7788899999999999999999999999999999999999999999986543 3344668899999999999999999 Q ss_pred cCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCcEEEecccc Q lcl|NC_019538. 480 YNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQGFILYGDKT 559 (678) Q Consensus 480 ~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~wG~rT 559 (678) .+++.+++||||++||+|||+|.++||||||+|+++++|.|+.++++.++++|++.||++||||||+|+++|+++||+|| T Consensus 461 ~~~~~~~~p~sg~vAGl~Ar~D~~~G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT 540 (659) T protein:vir:72 461 YNDVNRWVPLAADIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKT 540 (659) T ss_pred cCCceEEechHHHHHHHHHHhhccCCcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCeEEEEcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHHHhhC Q lcl|NC_019538. 560 MSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPAVIDR 639 (678) Q Consensus 560 ~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~ 639 (678) +++++++|+||||||||+|||++|+++++|+||||||+.+|++|+.+|++||++||++|+|.||+|+||+++||++||++ T Consensus 541 ~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~ 620 (659) T protein:vir:72 541 ATSVPSPFDRINVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGNKALGGIYEYRVVCDTTNNTPSVIDR 620 (659) T ss_pred cCCCCcccceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhC Confidence 99998899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 640 NEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 640 G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) |+|+++|+|+|++|||||+|||+|++++++|+|++|..- T Consensus 621 G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~e~~~~~~ 659 (659) T protein:vir:72 621 NEFVATFYIQPARSINYITLNFVATATGADFDELTGLAG 659 (659) T ss_pred CeEEEEEEEEecCCccEEEEEEEEeecCcchHHhcccCC Confidence 999999999999999999999999999999999999999 No 12 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=8.4e-153 Score=854.32 Aligned_cols=663 Identities=49% Similarity=0.844 Sum_probs=483.6 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEE Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRT 80 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (678) |+||||||||||+|++++|+||+||++||||+|+|||+|+|++|+||.||+++||++++.+|++|+|++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~v~t~~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (671) T protein:vir:56 1 MTLLSPGIENKEINLASAIGRAATGRAAMVGKFEWGPAYSITQVTSESDLVTIFGRPNDYTAASFMTANNFLKYGNDLRL 80 (671) T ss_pred CceecCceEEEeecCcccccccCcccceEEecccCCCCccCEEcCCHHHHHHHcCCcCCCcchhHHHHHHHHhcCCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccceeeeecccccccccceeeeccccccccc--ccccccccc--cccceeeeccccccccccee Q lcl|NC_019538. 81 VRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITT--GKVSALNSV--GGITFVRFSTAEVVKKAKEL 156 (678) Q Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~--~~~~~~~~~--~~~~~~~~~~a~~~~~a~~~ 156 (678) |||.+.+..+++..+++....+..+ +....+++.+.+......... ......... +...............+... T Consensus 81 vrv~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~~~~~~ 159 (671) T protein:vir:56 81 VRICDATTAQNATPLYNAVEYTIGA-SNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVAAAKSD 159 (671) T ss_pred EEecCccccccchhhcccccccccc-CcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEEeeecc Confidence 9999988888777777665544333 233345555554322211110 111111000 00000000110011111111 Q ss_pred ecccccccceeeeeeecccccccccceeeeeeccccceeeeccccccccc--ccccccccchhccccccccceeeecccc Q lcl|NC_019538. 157 NDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESL--LRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 157 ~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~--~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) ..+.... .....+..+... ........................ .........+......+..+.+.+.+.+ T Consensus 160 ~~~~~~~------~~t~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g 232 (671) T protein:vir:56 160 GNYPSVG------TITLQPTQGDIA-LTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYVG 232 (671) T ss_pred ccccccc------ccccccccccee-eeeecccccceEEEeccccccccccccccccccccchhhhhccccccccccccc Confidence 1110000 000011110000 000000011111111111111110 0111122334444555666777888899 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) .+++.+++.+............................+.. ...........+....+++..++..++...+++.++.. T Consensus 233 ~~g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~ 311 (671) T protein:vir:56 233 DFGDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGT-RSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTN 311 (671) T ss_pred ccCcceEEEEecccccccccccccceeeeeccccccccccc-cccccceeecccccccccceeEEeecCccceeEEEeec Confidence 99999988776554443322222111111111111111111 12222233344556667778888888889999988888 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) .+.........+......++.+.++.......+ .......+.||.|+.. ...++..+++.+...+.+.+++++++.. T Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~gg~d~~~--~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 388 (671) T protein:vir:56 312 PGDKDVNGQSIFIDEYFENSGSAYITAIAEGWK-TESGAYNFGGGSDANA--GADDWMFGLDMLSDPEVLYTNLVIAGNA 388 (671) T ss_pred ccccccchhhhhhhhhhcccCceEEEecCcccC-CccccccccCcccccc--chhHHHHHHHhhhhccccceeEEEcCCC Confidence 887777777777777777777776655544333 3445567888888753 4556778888888777777776666554 Q ss_pred ccCcccchhHHH-HHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhccccc-ccchhhccccccceEEEEcC Q lcl|NC_019538. 395 AGEGVEIASTVQ-KSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDS-GVVVDDNMNIGTTYSSTSAN 472 (678) Q Consensus 395 ~~~~~~~~~~v~-~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~s~~~~~~~p 472 (678) ..........++ .++..+|+.+++|++++|+|+...++.+...+.+++.+|+...... ........+++|+|+++||| T Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p 468 (671) T protein:vir:56 389 AAEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGN 468 (671) T ss_pred CCccchhHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEecC Confidence 443333333444 4466666788999999999999999999999999999999876533 34456678899999999999 Q ss_pred eEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCcE Q lcl|NC_019538. 473 YKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQGF 552 (678) Q Consensus 473 ~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~ 552 (678) |++++|+.+++.+++|||+++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++||||||+|+++|+ T Consensus 469 ~~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~ 548 (671) T protein:vir:56 469 YKYQYDKYNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNRLAVDLRRAHRDALYQIGINPVVGFAGQGF 548 (671) T ss_pred ceEEecccCCceeEechHHHHHHHHHHhhccCCcEECcCCceeccccccccceeecChhHHHHHhhCCceEEEEecCCeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCC Q lcl|NC_019538. 553 ILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNN 632 (678) Q Consensus 553 ~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~n 632 (678) ++||+||+++++++|+||||||||+|||++|+++++|+||||||+.||++|+++|++||++||++|+|.||+|+||+++| T Consensus 549 ~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~g~~v~~d~~~n 628 (671) T protein:vir:56 549 VLYGDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFELNDEFTRSSFKSEIDAYLTNIQDLGGVYDFRVVCDETNN 628 (671) T ss_pred EEEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCC Confidence 99999999998889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccc Q lcl|NC_019538. 633 TPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVG 675 (678) Q Consensus 633 t~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~ 675 (678) |++||++|+|+++|+++|++|+|||+|||+|++++++|+|++| T Consensus 629 t~~~i~~G~~~~~i~~~p~~Pae~I~~~~~~~~~~~~f~e~~~ 671 (671) T protein:vir:56 629 PGSVIDRNEFVASIYVKPAKSINFITLNFVATSTDADFAEIIG 671 (671) T ss_pred CHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchhhhcC Confidence 9999999999999999999999999999999999999999999 No 13 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=5.7e-141 Score=789.43 Aligned_cols=655 Identities=29% Similarity=0.426 Sum_probs=401.7 Q ss_pred Cc-eecCceEEEEcC-CCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeE Q lcl|NC_019538. 1 MA-LLSPGVESKENN-MQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDL 78 (678) Q Consensus 1 ~~-~~~PGVyveEv~-~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~ 78 (678) |. ||||||||||++ ++++|+||+||++||||+|+|||+|+|++|+||.||+++||++++.+|++|+|++||+|||++| T Consensus 1 m~~~~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~ 80 (743) T protein:vir:10 1 MASQVSPGILIKERDLTNAVVTGALQIRAAHASTFAKGPIGDIVNINTQKELVSVFGEPKEDNAEDWMVASEFLNYGGRL 80 (743) T ss_pred CccccCCceEEEEecCCCceeccCCcceeEEEEeccCCCCCcCEEecCHHHHHHHcCCccCCcchHHHHHHHHHhCCceE Confidence 86 899999999996 7899999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCccccccccc---------------ccccceeeeecccccccccceeeecccccccccccccccccccccceee Q lcl|NC_019538. 79 RTVRILDEDTARNSSP---------------FFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVR 143 (678) Q Consensus 79 ~vvRv~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (678) |||||.+++...++.. ....+...+..||.|++ +.++.+.+...+................... T Consensus 81 ~vvrv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gN-~i~V~v~~~~~d~~~~~~~~~~~~~~~~~~~ 159 (743) T protein:vir:10 81 AVVRAETTGVLNATTGSAGVLVKNRESWDAGSGNGEVFVARTAGSWGN-SLMGVLVDRGADYIVTFAATPTDTAVGTQLL 159 (743) T ss_pred EEEEccCccccccccccccccccccccccccccceeEEEEeecccccc-ceEEEEecCCCcceeeeeccccccccceeee Confidence 9999987653222211 11123344555665443 1233333322221110000000000000000 Q ss_pred ecccccccccceeecccccccceee--------eeeecccccccccceee----eeecccccee-eeccccc---ccc-c Q lcl|NC_019538. 144 FSTAEVVKKAKELNDYPALQNGWQI--------QFTSGGPGSGQSATAVL----NGIRQDSKIY-IRNDEYS---RES-L 206 (678) Q Consensus 144 ~~~a~~~~~a~~~~~~~~~~~~~~~--------~~~s~~~~~g~~a~~~~----~~~~~~~~i~-~~~~~~a---~~~-~ 206 (678) ........ ................ .........+....... .......... ....... ... . T Consensus 160 ~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tv 238 (743) T protein:vir:10 160 FSYSGTLV-TGEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVGRTPGTYSNVPASGGTGTGATFNV 238 (743) T ss_pred eccccccc-ccceeeeeecCcceeeeeccccceeeecccccccccccccccccccccccccceeeEEecccccccccccc Confidence 00000000 0000000000000000 00000000000000000 0000000000 0000000 000 0 Q ss_pred ccccccccchhccccccccceeeeccccccccceeEEeccccccc-----ccccc-ccc---cccccccccccccc-c-- Q lcl|NC_019538. 207 LRRDETTETYIDMCESYGIPVVASRYAGLTGDNIQVAFIAYKDYY-----KFGVD-GKI---SSVNTVNLKTFPSG-L-- 274 (678) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~i~v~v~~~~~~~-----~~~~~-~~~---~~~~~~~~~~~~~~-~-- 274 (678) .....+.. . .................++.+.+......... ..... ... .............. . T Consensus 239 ~v~~~~~~-v---g~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~~a~~~~~~~~~~~~~~~~~ 314 (743) T protein:vir:10 239 VVADAGGG-V---GGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAITELKDWYLNTEIGSTGIKL 314 (743) T ss_pred cccccccc-c---ccccccccccccceeeeccccccccccccccccchhheecccccceeeeecccccccchhhcccccc Confidence 00000000 0 00000000000000000000000000000000 00000 000 00000000000000 0 Q ss_pred --ceeeeeccccccccccccccce--------eeeccCCeeeeeee-eeccccccccccchhhhhhhhcCCcceEEEE-- Q lcl|NC_019538. 275 --SFGNITPSSYLEYGPQTKDQFA--------MIVFVGGSAVESRI-LSVKENDRDIYGSSIYVDEFFINGYSTFIQG-- 341 (678) Q Consensus 275 --~~~~~~~~~~~~~~~~~~~~~~--------~~v~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~-- 341 (678) ..................+.+. .+....+..++++. ++...+.....+...+...+. ++.+.++.. T Consensus 315 ~~~~~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~-~~~s~~~~~~~ 393 (743) T protein:vir:10 315 GDIGPRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVI-NEQSAYLYHGN 393 (743) T ss_pred ccccccceeeeccccccccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceeeccee-ccccceeeccC Confidence 0000000000000000011111 11223344556654 344343333333333322221 112221110 Q ss_pred -----------------------ecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCc Q lcl|NC_019538. 342 -----------------------VAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEG 398 (678) Q Consensus 342 -----------------------~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 398 (678) ..............+.||.|+.. .+..++..+++++...+.+++++++++..... T Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~-~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~~- 471 (743) T protein:vir:10 394 DAAVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDFA-YDAGEFGAAMDLFLDTEETEIDFVLMGGSMAD- 471 (743) T ss_pred cccceeeeccccCccccceeeeecccccccccceEEEeecCccccc-cchhHHHHHHHHhhhccccCcceEEecCcccC- Confidence 01111122234567888988754 46677888899988888888888877665443 Q ss_pred ccchhHHHHHHHHHHHhcCCeEEEEccccchhcc------ccccCCHHHHHHHHhcccccccchhhccccccceEEEEcC Q lcl|NC_019538. 399 VEIASTVQKSVAAICDERQDCLGWISPPREYMVN------LPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSAN 472 (678) Q Consensus 399 ~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p 472 (678) ..+..+++++|++||+++++||+|+|+|...... .....+.+++..|++ .+++|+|+++||| T Consensus 472 ~~~~~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~s~~~~~~~p 539 (743) T protein:vir:10 472 EADTKSKATKVIAIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFS------------DLTSTSYAVFDSG 539 (743) T ss_pred ccchHHHHHHHHHHHHhhCCeEEEEecCCCccccccccccccccccchHHHHHHH------------hccCCeeEEEEcc Confidence 3456789999999999999999999999754322 233345566666654 3467899999999 Q ss_pred eEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCcE Q lcl|NC_019538. 473 YKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQGF 552 (678) Q Consensus 473 ~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~ 552 (678) |++++|+.+++.+++|||+++||+|||+|.++||||||+|+++.+|.|+.++++.++++|++.||++|||||++|+++|+ T Consensus 540 ~~~~~d~~~~~~~~~p~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~ 619 (743) T protein:vir:10 540 YKYVYDRFTDKYRYIPCNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAVKLAYNPNKADRDELYQNRINPVVSLRGQGI 619 (743) T ss_pred ceeeeccccCceeEechhHHHHHHHHHhhccCCcEEccCCeeeeeeeccccceecCChhHHHhHhhCCceEEEEecCCeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCC Q lcl|NC_019538. 553 ILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNN 632 (678) Q Consensus 553 ~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~n 632 (678) ++||+||+++++++|+||||||||+|||++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++| T Consensus 620 ~~wG~rT~~s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~~~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~n 699 (743) T protein:vir:10 620 TLFGDKTALAAPSAFDRINVRRLFLNLEKRARRLAEGVLFEQNDATTRAGFSSALNSYLSEVQARRGVTDYLVICDESNN 699 (743) T ss_pred EEEcccccCCCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCC Confidence 99999999877789999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeecccc Q lcl|NC_019538. 633 TPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGP 676 (678) Q Consensus 633 t~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~ 676 (678) |+++|++|+|+++|+++|++|||||+|||+|++++.+|+|++++ T Consensus 700 t~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~e~~~~ 743 (743) T protein:vir:10 700 TPDIIDRNEFVAEVYVKPTRSINFITITFTATKTGVTFSEVVGR 743 (743) T ss_pred CHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchHhhhcC Confidence 99999999999999999999999999999999999999999999 No 14 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=1.2e-140 Score=787.73 Aligned_cols=642 Identities=29% Similarity=0.442 Sum_probs=398.1 Q ss_pred Cc--eecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeE Q lcl|NC_019538. 1 MA--LLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDL 78 (678) Q Consensus 1 ~~--~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~ 78 (678) |+ ||||||||||++..++|++|+||++||||.|+|||+|+|++|+||.||+++||++++.+|++|+|++||+|||++| T Consensus 1 M~~~~~~PgVyv~e~~~~~~~~~~~t~~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~F~ngg~~~ 80 (749) T protein:vir:10 1 MATNQSSPGVVIQERDLTTVSTIPTANVGVIAAPFTKGPVEEVIEITSERQLAEKFGEPNESNYEYWFSAAQFLSYGGLL 80 (749) T ss_pred CCccccCCeeEEEEecCCcccccccCceeEEEeccCCCCCccCEEcCCHHHHHHHcCCccCCcccHHHHHHHHhhcCCeE Confidence 87 8999999999987788999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCccccccccc-----cc--------------ccceeeeecccccccccceeee--ccccccccccccccccccc Q lcl|NC_019538. 79 RTVRILDEDTARNSSP-----FF--------------ETIDYTITSPGVDYRIGDDVKI--LQNGATITTGKVSALNSVG 137 (678) Q Consensus 79 ~vvRv~~~~~~~~~~~-----~~--------------~~~~~~~~~~~~~~~~g~~i~~--~~~~~~~~~~~~~~~~~~~ 137 (678) |||||.+.+. +++.. .. ..+...+..||. ||+.+.+ .+.+++......... .. T Consensus 81 ~vvRv~~~~~-~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~pG~---~gn~l~v~v~~~~~~~~~~~~~~~--~~ 154 (749) T protein:vir:10 81 KTIRVNSSSL-KNAVDTGTAPLVKNLQDYETSIEDASNNFSWVARTPGD---TGNSIGIFVTDAGADQVVVVPAPG--SG 154 (749) T ss_pred EEEEccCccc-cccccccccccccccccccccccccccceEEEeccCCC---cCCceEEEEEcCCCceeeeeecCC--cc Confidence 9999976542 11110 10 112233445554 4554443 333332111111000 00 Q ss_pred ccceeeecccccccccceeecccccccceeeeeeeccccccc--------c--cc-----------eeeeeeccccceee Q lcl|NC_019538. 138 GITFVRFSTAEVVKKAKELNDYPALQNGWQIQFTSGGPGSGQ--------S--AT-----------AVLNGIRQDSKIYI 196 (678) Q Consensus 138 ~~~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~g~--------~--a~-----------~~~~~~~~~~~i~~ 196 (678) ....... .................+..............+. . +. .............. T Consensus 155 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~~~ 233 (749) T protein:vir:10 155 NEHEFVA-DAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDATNKKLEIGLPSGGVTGIL 233 (749) T ss_pred ceeeEEe-eecccccccccccccccceeeeeccccceeecccceeeeccCcccccccccccCCcceEEEeeeccccccee Confidence 0000000 0000000000000000000000000000000000 0 00 00000000000000 Q ss_pred eccccc--cccccccccc--ccchhccccccccceeeeccccccccceeEEecccccccccccccccccccccccccccc Q lcl|NC_019538. 197 RNDEYS--RESLLRRDET--TETYIDMCESYGIPVVASRYAGLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPS 272 (678) Q Consensus 197 ~~~~~a--~~~~~~~~~~--~~~~~~~~~~~~~~~i~A~~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 272 (678) ...... .......... .....................+..+..+.+....... ...... ...........+. T Consensus 234 a~~~~v~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~v~~~~--~~~~~~--t~~~~~~~a~~~g 309 (749) T protein:vir:10 234 ADNQVITQGTNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITSVRDEY--TEREYL--PGVKWINVAPRPG 309 (749) T ss_pred eeeecccccccccccccccccchhhhhccccceeeccccccCCccceeEEEeeeccc--cccccc--cceeecccccccc Confidence 000000 0000000000 0000000000000000001111111221111110000 000000 0000000000010 Q ss_pred ccceeeeeccccccccccccccceeeec--------cCCeeeeeeee-eccccccccccchhhhhhhhcCCcceEEEEec Q lcl|NC_019538. 273 GLSFGNITPSSYLEYGPQTKDQFAMIVF--------VGGSAVESRIL-SVKENDRDIYGSSIYVDEFFINGYSTFIQGVA 343 (678) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~--------~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 343 (678) ...+.. . .....+.+.+.+. ..++++|.+.. +...+.........+...... ..+.++.... T Consensus 310 t~~~~~-------~-~~g~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~-~~s~~v~~~~ 380 (749) T protein:vir:10 310 TSLYAN-------G-VGGHRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIK-QKSEFIYWAE 380 (749) T ss_pred ceeeee-------c-ccCCCCceEEEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhc-cCCCEEEEEe Confidence 000000 0 0001111111111 12344555543 222233333333323322221 1222221100 Q ss_pred CC--------------------------------------------CccccceeeeeccCcCCcc-----ccchhHHHhh Q lcl|NC_019538. 344 ES--------------------------------------------WPTEYSGILTFGGGNSGNS-----TASAGDWIEG 374 (678) Q Consensus 344 ~~--------------------------------------------~~~~~~~~~~l~gg~dg~~-----~~~~~~~~~~ 374 (678) .. ........+.+.+|.|+.. .....++.++ T Consensus 381 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~ 460 (749) T protein:vir:10 381 HESTLYAATSSASDGLFGQTAANRQFNLFRSAAGSVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSA 460 (749) T ss_pred cccccccccccccccccccccccceeeccccccccceeccccccccccCCcEEEEEccCCcccccccccccccchhHHHH Confidence 00 0001112345666666432 3455778889 Q ss_pred hhhhhccchhccccccccccccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhcccccc-CCHHHHHHHHhcccccc Q lcl|NC_019538. 375 WDMFSDREHVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVA-TAVKKMVEWRRGVTDSG 453 (678) Q Consensus 375 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 453 (678) ++++..++...+++++.+....+ .++..+++.+|++||++|++||+++|+|.....+.... ....++..|+.+ T Consensus 461 ~~~l~~~~~~~~~~li~~~~~~~-~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~----- 534 (749) T protein:vir:10 461 YELIGDPESQIVDFIISGPSGTS-DANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKK----- 534 (749) T ss_pred HHHhhhhhhcccceEEEecCCCC-cchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhh----- Confidence 99998888888887766543332 34567899999999999999999999998766554333 345667777654 Q ss_pred cchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhh Q lcl|NC_019538. 454 VVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHR 533 (678) Q Consensus 454 ~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~ 533 (678) .++|+|+++||||++++|+.+++.+++|||+++||+|||+|.++||||||+|+++.+|+|+.++++.++++|+ T Consensus 535 -------~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~~~~~~~~~~e~ 607 (749) T protein:vir:10 535 -------LPSSSYMVFDSGYKYIYDKYNDVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAIKLAYTPNKAQR 607 (749) T ss_pred -------ccCceeEEEEccceeeeccccCceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccccceeecChhHH Confidence 3568899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhCCcEEEEEecCCcEEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019538. 534 DELYQNSMNPVVGFPGQGFILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDS 613 (678) Q Consensus 534 ~~L~~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~ 613 (678) +.||++|||||++|+++|+++||+||+++.+++|+||||||||+|||++|+++++|+||||||+.+|++|+++|+.||++ T Consensus 608 ~~Ln~~gIn~i~~~~g~G~~~wG~rT~~s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~ 687 (749) T protein:vir:10 608 DQLYANRVNPIVSFPGQGVVLYGDKTALGFASAFDRINIRRLFLTVERVISTAAKAQLFEQNDEAQRSLFINIVEPYLRD 687 (749) T ss_pred HhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999977677999999999999999999999999999999999999999999999999 Q ss_pred HHhcCCeeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccc Q lcl|NC_019538. 614 IKSLGGIYDFRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVG 675 (678) Q Consensus 614 l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~ 675 (678) ||++|+|.||+|+||+++||+++|++|+|+++|+++|++|||||+|||+|++++.+|+|+++ T Consensus 688 l~~~G~i~~f~V~~d~~~Nt~~~i~~G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~~s 749 (749) T protein:vir:10 688 VQGRRGVVDFLVKCDSTNNTPEAVDRGEFYAEVFLKPTRTINYVQLTFVATRTGVSFAEVAS 749 (749) T ss_pred HHhcCCeeeeEEEEcCCCCCHHHhhCCEEEEEEEEEecCCccEEEEEEEEeecCcchHHHhC Confidence 99999999999999999999999999999999999999999999999999999999999999 No 15 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=2.9e-139 Score=780.09 Aligned_cols=637 Identities=31% Similarity=0.461 Sum_probs=382.5 Q ss_pred CceecCceEEEEcC-CCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCc--CccchhHHHHHHHHHhcCCe Q lcl|NC_019538. 1 MALLSPGVESKENN-MQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRP--DNQTADSVLSAINFLKYGND 77 (678) Q Consensus 1 ~~~~~PGVyveEv~-~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~--~~~~~~~~~v~~~f~ngG~~ 77 (678) |+|+||||||||++ ++++|+||+||++||||+|+|||+|+|++|+||.||+++||+| ++.++++|++++||+|||++ T Consensus 3 ~~~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~~~f~ngg~~ 82 (729) T protein:vir:10 3 LNLASPGIVVREVDLTIGRVDPTSGSIGALVAPFAKGPVNDPQLIESEEDLLQTFGQPYSTDKHYEYWMVASSYLAYGGT 82 (729) T ss_pred ccccCCceEEEEecCCCcccccccccceeEEeccccCCCccCeEcCCHHHHHHHcCccccCCcchhHHHHHHHHHhCCce Confidence 67999999999995 7899999999999999999999999999999999999999998 46778999999999999999 Q ss_pred EEEEEcCCccccccccccccccee---------------------------------eeecccccccccceeeec--ccc Q lcl|NC_019538. 78 LRTVRILDEDTARNSSPFFETIDY---------------------------------TITSPGVDYRIGDDVKIL--QNG 122 (678) Q Consensus 78 ~~vvRv~~~~~~~~~~~~~~~~~~---------------------------------~~~~~~~~~~~g~~i~~~--~~~ 122 (678) |||||+.+.++..++.. .++... .+..++ .||+.+.+. +.. T Consensus 83 ~~vvRv~~~~~~~a~~~-~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G---~~gn~~~v~v~~~~ 158 (729) T protein:vir:10 83 MQVVRADDYNTQTGVGL-KNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPG---TWANGIKVAIIDGK 158 (729) T ss_pred EEEEecCcccccccccc-cccccccccccccccccccccccccccccccCCCcceEEEEeccC---ccccceeeEEeccc Confidence 99999988765443322 111110 111111 112211111 100 Q ss_pred cccccccccccccccccceeeecccccccccceeecccccccceeeeeeecccccccc-cceeeeeeccccceeeecccc Q lcl|NC_019538. 123 ATITTGKVSALNSVGGITFVRFSTAEVVKKAKELNDYPALQNGWQIQFTSGGPGSGQS-ATAVLNGIRQDSKIYIRNDEY 201 (678) Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~g~~-a~~~~~~~~~~~~i~~~~~~~ 201 (678) .+..... ............. ............................ ......... ......... T Consensus 159 ~~~~~~~---~~~~~~~~~t~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s---~~~~~~~~~ 225 (729) T protein:vir:10 159 ADQILTV---ASGNTTAVGSAVT-------QSISKTIGTATGTTTIDGVLKGIVTGSTDTTLEVKVIS---HISAAGVET 225 (729) T ss_pred Ccceeee---eccccccceeeee-------eeccccccccccceeeeeeecccccccccccccceecc---cccccccce Confidence 0000000 0000000000000 0000000000000000000000000000 000000000 000000000 Q ss_pred cccccccccccccchhccccccccceeeeccccccccceeEEeccccccccccccccccccc-cccccccccccceeeee Q lcl|NC_019538. 202 SRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGDNIQVAFIAYKDYYKFGVDGKISSVN-TVNLKTFPSGLSFGNIT 280 (678) Q Consensus 202 a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 280 (678) ..... .....................+ ...........+.............. .......+......... T Consensus 226 ~~~~~------~~~~~~~~~~~s~~~~a~~~~~---~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~ 296 (729) T protein:vir:10 226 AVEYQ------QNGTYTFDNSGSVNVIAAGSSG---SGSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVSTR 296 (729) T ss_pred ecccc------ccceeeecccCccceeeecccc---ccccccceeeeccccccccccccccccccccccccccccccccc Confidence 00000 0000000000000000000000 00000000000000000000000000 00000000000000000 Q ss_pred ccccccccccccccceeeeccCCeeeeeee-eeccccccccccchhhhhhhhcCCcceEEEEe----------------- Q lcl|NC_019538. 281 PSSYLEYGPQTKDQFAMIVFVGGSAVESRI-LSVKENDRDIYGSSIYVDEFFINGYSTFIQGV----------------- 342 (678) Q Consensus 281 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~----------------- 342 (678) ............+.....+...+.+++.+. ++...+.........+..... +..+.++... T Consensus 297 ~~~~d~~~~~~~d~~~~~~~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi-~~~s~~~~~~~~~~~~~~~~~~~~~~~ 375 (729) T protein:vir:10 297 GGKNDEIHVLVIDDKGTITGNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFL-ATNSKYIFGGGATSGITTTGYSVSSTN 375 (729) T ss_pred cccccccceeeeccccccccCcccceeeeeeeeeccccccccccccccceee-ccccceeeecccccccccccccccccc Confidence 000000000000000111122223333332 222222222111111111111 1111111100 Q ss_pred -------------cCCCccccceeeeeccCcCCccc----------cchhHHHhhhhhhhccchhccccccccccccCcc Q lcl|NC_019538. 343 -------------AESWPTEYSGILTFGGGNSGNST----------ASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGV 399 (678) Q Consensus 343 -------------~~~~~~~~~~~~~l~gg~dg~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (678) ...........+.+++|.++... ....++..++.++...+.+.++.++.+....++ T Consensus 376 ~~~~~~~~~~~a~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~- 454 (729) T protein:vir:10 376 TLDTDSGWDQNAEGVNFGASGVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPK- 454 (729) T ss_pred eeccccccccccccccccccceeEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCc- Confidence 00111222345667777665332 223456678888887777777666655444443 Q ss_pred cchhHHHHHHHHHHHhcCCeEEEEccccchhccccc---------cCCHHHHHHHHhcccccccchhhccccccceEEEE Q lcl|NC_019538. 400 EIASTVQKSVAAICDERQDCLGWISPPREYMVNLPV---------ATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTS 470 (678) Q Consensus 400 ~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~ 470 (678) .+...++.+|++||+++++|++++|+|+...+.... .+..+++.+|++.+ ..++|+++| T Consensus 455 ~~~~~v~~a~~~~~~~~~~~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~ 522 (729) T protein:vir:10 455 EQSQAVAEKVTAVAEARKDAVAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPL------------SSSTYSVFD 522 (729) T ss_pred cchHHHHHHHHHHHHhcCCeEEEecccccccccccccccccccccchhhHHHHHHHhhc------------cCCceEEEE Confidence 456789999999999999999999999766544322 23445566666533 347899999 Q ss_pred cCeEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCC Q lcl|NC_019538. 471 ANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQ 550 (678) Q Consensus 471 ~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~ 550 (678) |||++++|+.++..+++|||+++||+|||+|.++||||||+|+++.+|.|+.++++.++++|++.||++|||||++|+++ T Consensus 523 ~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~ 602 (729) T protein:vir:10 523 SGYKYMFDRFNNTFRYVPLNGDIAGTCARTDIEQFPWFSPAGTARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGA 602 (729) T ss_pred cCeeEEecccCCceEEechhHHHHHHHHHhhccCCcEEccCCccccceecccceeeecChhhHhhhhhCCceEEEEecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cEEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccC Q lcl|NC_019538. 551 GFILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDET 630 (678) Q Consensus 551 G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~ 630 (678) |+++||+||+++.+++|+|||||||++||+++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||++ T Consensus 603 G~~~wG~rT~~~~d~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~ 682 (729) T protein:vir:10 603 GIILFGDKTGFGKSSAFDRINVRRLFIYLEDAISAAAKDQLFEFNDELTRTNFVNIVEPFLRDVQAKRGIFDFVVICDET 682 (729) T ss_pred eEEEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEcCC Confidence 99999999997667899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccC Q lcl|NC_019538. 631 NNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQ 677 (678) Q Consensus 631 ~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~ 677 (678) +||++||++|+|+++|+++|++|+|||+|||+|++++++|+|+++++ T Consensus 683 ~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~ 729 (729) T protein:vir:10 683 NNTAAVIDSNEFVADIFIKPARSINFIGLTFVATRTGVAFEEVIGSV 729 (729) T ss_pred CCCHHHhhCCeEEEEEEEEecCCccEEEEEEEEeecCccHHHHHhcC Confidence 99999999999999999999999999999999999999999999999 No 16 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=2.1e-108 Score=610.89 Aligned_cols=470 Identities=17% Similarity=0.174 Sum_probs=328.3 Q ss_pred Cc-eecCceEEEEc-CCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeE Q lcl|NC_019538. 1 MA-LLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDL 78 (678) Q Consensus 1 ~~-~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~ 78 (678) |+ |++|||||||+ +++++|++|+|+|++|||++++||+|+|++|+||.||++ ||+.....+++++++.||.|||++| T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p~n~pv~its~~d~~~-~g~~~~~~tL~~Av~~~f~ngg~~~ 79 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDAAQ-FGPQLAGFTIPQALDAVYDYGSGTV 79 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccCCCcccEEEccHHHHHH-hcCCCCCCcHHHHHHHHhhcCCceE Confidence 76 67999999999 589999999999999999999999999999999999986 7888888899999999999999999 Q ss_pred EEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccceeec Q lcl|NC_019538. 79 RTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKELND 158 (678) Q Consensus 79 ~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~ 158 (678) |||||.++++.......... . ....... ..... .... ... T Consensus 80 ~vvrV~~~~~~~~~~a~~~~---------~---------~~~~~~~--------~~~~~-~~~~------------~~~- 119 (477) T protein:vir:79 80 IVINVLDPAVHKSNAASESV---------T---------FDAATGR--------AKLAH-PAAA------------NLV- 119 (477) T ss_pred EEEeccCCcccccccccccc---------c---------ccccccc--------ccccc-cccc------------eeE- Confidence 99999876543322111000 0 0000000 00000 0000 000 Q ss_pred ccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccccccc Q lcl|NC_019538. 159 YPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGD 238 (678) Q Consensus 159 ~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn 238 (678) .... ... .. .....+... .... ....... .+..+. T Consensus 120 -----------v~~~---~~~-~~---~~~~~~~~~-----------------------~~~~-~~~~~~~---~~~~~~ 154 (477) T protein:vir:79 120 -----------LKND---SGG-TT---YTEGTDYAV-----------------------DLIN-GVITRIK---TGTIPA 154 (477) T ss_pred -----------Eeec---ccc-cc---cccCccccc-----------------------cccc-hhhhhhh---cccccc Confidence 0000 000 00 000000000 0000 0000000 000000 Q ss_pred ceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecccccc Q lcl|NC_019538. 239 NIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDR 318 (678) Q Consensus 239 ~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~ 318 (678) ... ............ ... T Consensus 155 ~~~--------------------------------------~~~~~~~~~~~~------------~~~------------ 172 (477) T protein:vir:79 155 AAT--------------------------------------AAKATYDYADPT------------KVT------------ 172 (477) T ss_pred ccc--------------------------------------eeeceeccCCcc------------cce------------ Confidence 000 000000000000 000 Q ss_pred ccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCc Q lcl|NC_019538. 319 DIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEG 398 (678) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 398 (678) . ..+ .+..+. .... .+...+ ........+.+.....|+ T Consensus 173 ----~-------------~~~-----------------~g~~~a-~~~~-----tg~~al--~~~~~~~~~~~~iv~apg 210 (477) T protein:vir:79 173 ----A-------------ADI-----------------IGAVNA-AGMR-----TGMKAL--KDTYNLYGYFSKILIAPA 210 (477) T ss_pred ----e-------------eee-----------------cccccc-cccc-----hhhhhh--hhhhhhcccccceeeccc Confidence 0 000 000000 0000 001111 111112223344556788 Q ss_pred ccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEec Q lcl|NC_019538. 399 VEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYD 478 (678) Q Consensus 399 ~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d 478 (678) +++...|+.+|.++|+++ .||+++|+|. +.+.+++.+|++.... ...+++|+|+++||||++++| T Consensus 211 ~~~~~~v~~~l~~~~~~~-~~~a~~d~p~--------~~~~~~~~~~~~~~~~------~~~~~~s~~~~~~~p~~~~~~ 275 (477) T protein:vir:79 211 YCTQNSVSVELEAMAVQL-GAIAYIDAPI--------GTTLAQALAGRGPAGT------INFNTSSDRVRLCYPHVKVYD 275 (477) T ss_pred cccchhHHHHHHHHHhhc-CeEEEEecCC--------CCChHHHhhhhhhccc------cccccccceEEEEcCeeEEec Confidence 888889999999999987 5889999874 4567888888865432 345788999999999999999 Q ss_pred ccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeeccc---ceecCChhhhhhhhhCCcEEEEEecCCcEEEe Q lcl|NC_019538. 479 KYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRK---LAIETRQAHRDELYQNSMNPVVGFPGQGFILY 555 (678) Q Consensus 479 ~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~---~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~w 555 (678) +.++..+++|||+++||+|||+|.++||||||+|+++.+|.++.. ....++++|++.||++|||+|++|+++|+++| T Consensus 276 ~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~w 355 (477) T protein:vir:79 276 IATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLW 355 (477) T ss_pred ccCCceeeechHHHHHHHHHHhhccCCceEccCCceeecceecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEE Confidence 999999999999999999999999999999999999988877643 23445778999999999999999999999999 Q ss_pred ccccCC--CCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCC Q lcl|NC_019538. 556 GDKTMS--LQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNT 633 (678) Q Consensus 556 G~rT~~--~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt 633 (678) |+||++ +++++|+||||||+|++|+++|++.++|+|||||++.+|++|+.+|++||++||++|+|.||+|+||+++|| T Consensus 356 G~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt 435 (477) T protein:vir:79 356 GNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNP 435 (477) T ss_pred cccccCCCCCCccceeeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCC Confidence 999996 344579999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 634 PAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 634 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ++||++|+|+++|+++|++|+|||+|+|++.... |+++.+= | T Consensus 436 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~--~~~~~~~-~ 477 (477) T protein:vir:79 436 KEELAAGHLLINYKYTVPPPLERLTYETEITSEY--LLTLKGG-N 477 (477) T ss_pred HHHhhCCeEEEEEEEEecCCceeEEEEEEEechH--HhhhccC-C Confidence 9999999999999999999999999999886664 6555444 4 No 17 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=2.5e-107 Score=604.99 Aligned_cols=470 Identities=16% Similarity=0.157 Sum_probs=332.3 Q ss_pred Cc-eecCceEEEEc-CCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeE Q lcl|NC_019538. 1 MA-LLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDL 78 (678) Q Consensus 1 ~~-~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~ 78 (678) |+ |++|||||||+ +++++|++|+|+|++|||++++||+|+|++|+||.|| +.||+.....++.+++++||.|||++| T Consensus 1 M~~~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp~n~pv~its~~d~-~~~g~~~~~~tL~~Av~~~f~nGg~~~ 79 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDA-AQFGPQLAGFTIPQALDAVYDYGSGTV 79 (477) T ss_pred CcccCCCCeEEEEccCCcccccccCCceeEEEecccCCCCCcCEEEccHHHH-HHhccCCCCCcHHHHHHHHHhccceEE Confidence 77 67899999999 5899999999999999999999999999999999999 569999999999999999999999999 Q ss_pred EEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccceeec Q lcl|NC_019538. 79 RTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKELND 158 (678) Q Consensus 79 ~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~ 158 (678) |||||.+++......... .... .. ... . T Consensus 80 ~vVrV~~~~~~~~~~~~~------------------~~~~----------------~~--------------~~~----~ 107 (477) T protein:vir:10 80 IVINVLDPAVHKSNAANE------------------PVTF----------------DA--------------ATG----R 107 (477) T ss_pred EEEecCcccccccccccc------------------cccc----------------cc--------------ccc----e Confidence 999997653221110000 0000 00 000 0 Q ss_pred ccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccccccc Q lcl|NC_019538. 159 YPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGD 238 (678) Q Consensus 159 ~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn 238 (678) . ... +.+.+ T Consensus 108 ~------------------------------~~~-----------------------------------------~~~~~ 116 (477) T protein:vir:10 108 A------------------------------KLA-----------------------------------------HPAAA 116 (477) T ss_pred e------------------------------ccc-----------------------------------------ccccc Confidence 0 000 00000 Q ss_pred ceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecccccc Q lcl|NC_019538. 239 NIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDR 318 (678) Q Consensus 239 ~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~ 318 (678) ...+. .... ........ ..... ..... .... ...... T Consensus 117 ~~~v~--~~a~-----------------~~~~~~~~-~~~~~-------~~~~~---~~~~---------~~~~~~---- 153 (477) T protein:vir:10 117 NLVLK--NDSG-----------------GTTYAEGT-DYAVD-------LINGV---ITRI---------KTGTIP---- 153 (477) T ss_pred ccccc--cccc-----------------ccccccch-hhhhh-------hcccc---ceec---------cccccc---- Confidence 00000 0000 00000000 00000 00000 0000 000000 Q ss_pred ccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCc Q lcl|NC_019538. 319 DIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEG 398 (678) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 398 (678) ..............+.. .......+..+. +...+ ++..+ .+.+....+.+.....|+ T Consensus 154 --------------~~~~~~~~~~~~~~~~~-~~~~~~~g~~~~-~~~~t-----Gl~al--~~~~~~~~~~~~~l~apg 210 (477) T protein:vir:10 154 --------------PGATAAKATYDYADPTK-VTAADIIGAVNA-AGMRT-----GMKAL--KDTYNLYGYFSKILIAPA 210 (477) T ss_pred --------------ccceeeeeccccccccc-cccccccccccc-cchhh-----hhhhh--hhhhhhcchhcccccccc Confidence 00000000000000000 000011111111 11111 12222 222223334556667788 Q ss_pred ccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEec Q lcl|NC_019538. 399 VEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYD 478 (678) Q Consensus 399 ~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d 478 (678) +++...|+.+|.++|+++ .|++++|.|. ..+.+++.+|++.... ...+++|+|++++|||++++| T Consensus 211 ~~~~~~v~~~l~~~~~~~-~~~~~~d~p~--------~~~~~~~~~~~~~~~~------~~~~~~s~~~~~~~p~~~~~d 275 (477) T protein:vir:10 211 YCTQNSVSVELEAMAVQL-GAIAYIDAPI--------GTTLAQALAGRGPAGT------INFNTSSDRVRLCYPHVKVYD 275 (477) T ss_pred cccchhhHHHHHHHHhhC-CEEEEEecCC--------CCCHHHHHhhhhhccc------cccccccceEEEEcCeEEEec Confidence 888889999999999987 5789988873 4567889999875432 345788999999999999999 Q ss_pred ccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeeccc---ceecCChhhhhhhhhCCcEEEEEecCCcEEEe Q lcl|NC_019538. 479 KYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRK---LAIETRQAHRDELYQNSMNPVVGFPGQGFILY 555 (678) Q Consensus 479 ~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~---~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~w 555 (678) +.++..+++|||+++||++||+|.++||||||+|+++.+|.++.. ....++++|++.||++|||+|++|+++|+++| T Consensus 276 ~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~~w 355 (477) T protein:vir:10 276 TATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLW 355 (477) T ss_pred ccCCceeEEchHHHHHHHHHHhhhcCCceeccCCceeccccccccccccccCCChhhHHHHhhCCceEEEEecCCcEEEE Confidence 999999999999999999999999999999999999988888653 23445778999999999999999999999999 Q ss_pred ccccCCC--CccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCC Q lcl|NC_019538. 556 GDKTMSL--QPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNT 633 (678) Q Consensus 556 G~rT~~~--~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt 633 (678) |+||++. +++.|+||+|||++++|+++|+++++|+|||||++.+|++|+++|++||++||++|+|.||+|+||+++|| T Consensus 356 G~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~v~~~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt 435 (477) T protein:vir:10 356 GNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNP 435 (477) T ss_pred cccccCCCCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCC Confidence 9999954 34579999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 634 PAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 634 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ++||++|+|+++|+++|++|+|||+|++++... .|+|+.+= | T Consensus 436 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~--~~~~~~~g-~ 477 (477) T protein:vir:10 436 KEELAAGHLLINYKYTVPPPLERLTYETEITSE--YLLTLKGG-N 477 (477) T ss_pred HHHhhCCeEEEEEEEEecCCcceEEEEEEEcch--HHhhhhcC-C Confidence 999999999999999999999999999987555 36665544 4 No 18 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=5.4e-105 Score=592.20 Aligned_cols=482 Identities=15% Similarity=0.130 Sum_probs=314.4 Q ss_pred CceecCceEEEEc-CCCccccc-CCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeE Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIAR-SSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDL 78 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~-v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~ 78 (678) -+|-.|||||||+ +++++|+| |+||++||||.++|||+|+|++|+||.||.+.||..... ++|+++| T Consensus 281 ~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rGPvn~PvlITS~aD~~~~Fg~~~GG-----------l~GassA 349 (774) T protein:vir:98 281 RNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRGFTTSPALVTTIPDPAIHFTSFQGG-----------LDGPRSA 349 (774) T ss_pred EEEecCceEEEEeCCCCccccccccceeeeecccccCCCCCcCEEEeehhHhhhhhccccCC-----------cccccee Confidence 5678999999999 68999998 999999999999999999999999999988777655321 2344444 Q ss_pred EEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccceeec Q lcl|NC_019538. 79 RTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKELND 158 (678) Q Consensus 79 ~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~ 158 (678) |.+..... + T Consensus 350 ~r~~~~~s--------------------G--------------------------------------------------- 358 (774) T protein:vir:98 350 FRDFYTFN--------------------G--------------------------------------------------- 358 (774) T ss_pred eeeeeeec--------------------c--------------------------------------------------- Confidence 32110000 0 Q ss_pred ccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccccccc Q lcl|NC_019538. 159 YPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGD 238 (678) Q Consensus 159 ~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn 238 (678) .....+.|..+|.||| T Consensus 359 ----------------------------------------------------------------~~~L~i~A~~pGawGN 374 (774) T protein:vir:98 359 ----------------------------------------------------------------TPLLRLQAVSEGNWGN 374 (774) T ss_pred ----------------------------------------------------------------cceEEEEEeecCcCCC Confidence 0011223444455555 Q ss_pred ceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecccccc Q lcl|NC_019538. 239 NIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDR 318 (678) Q Consensus 239 ~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~ 318 (678) .+++.+................... .......... .+......... .....+......+..... .....+. .. T Consensus 375 ~ItV~I~~~t~~~~~l~v~~~~~s~---f~~~~a~e~~-tv~~~~~~~~~-~v~e~~dn~~i~~~~~~~-~~~~in~-vs 447 (774) T protein:vir:98 375 QVTVSIYPVNNSEFRLNVQDLNGSA---FNPPLADEVY-TVKLGDTNESG-ELNALLDSKFIRGFFLPK-SIDSINY-DA 447 (774) T ss_pred ceEEEEEecCCceeEEEEEecCCcc---ccccccceeE-EEecccccccc-eeeeeeceeeEeeccccc-ccccccc-cc Confidence 5554443221100000000000000 0000000000 00000000000 000000000000000000 0000000 00 Q ss_pred ccccchhhhhhhhcCCcceE-EEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccC Q lcl|NC_019538. 319 DIYGSSIYVDEFFINGYSTF-IQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGE 397 (678) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~-v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 397 (678) .................... ...............+++++|.|+.+. +..++....+.... ..+..++.+. T Consensus 448 ~lv~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg~~t-t~~~igg~~~~~~~---tgi~aLl~a~---- 519 (774) T protein:vir:98 448 ALVRQSPLRLAPPDESETDVENPAHVDFYGPNVLVDVTLENGYDGPPV-TNDDYVSIIRTLEN---QPVHILLVGT---- 519 (774) T ss_pred cccccchhcccccccccccccccccccccCCcceEEEeecCCCCcccc-cchheecccccccc---cceeEEEcCc---- Confidence 00000000000000000000 000000011122345678899887553 34444433333322 2333343321 Q ss_pred cccchhHHHHHHHHHHHhc----CCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCe Q lcl|NC_019538. 398 GVEIASTVQKSVAAICDER----QDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANY 473 (678) Q Consensus 398 ~~~~~~~v~~~l~~~~~~~----~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~ 473 (678) ....++.+|+.||+.+ ++|++++|.| ++.+.+++++|++ +++|+|+++|||| T Consensus 520 ---~~~~V~~aii~~~e~~~~~~~~r~avid~p--------~g~t~~~Ai~~r~-------------~f~S~~aal~~Pw 575 (774) T protein:vir:98 520 ---TNVGVQQALITEAERASDSDGLRIAVLAAP--------PRTTPTLAASVTR-------------GFNSTRAVMVAGW 575 (774) T ss_pred ---cchhhHHHHHHHHHHhhhcccceEEEEECC--------CCCCHHHHHHHHh-------------ccCCceEEEEeCc Confidence 2244667777777664 7899999876 3567889999986 3668899999999 Q ss_pred EEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecc---cceecCChhhhhhhhhCCcEEEE-EecC Q lcl|NC_019538. 474 KLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVR---KLAIETRQAHRDELYQNSMNPVV-GFPG 549 (678) Q Consensus 474 ~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~---~~~~~~~~~e~~~L~~~gIn~i~-~~~~ 549 (678) ++++|+.+++.+++|||+++||++||+| +||||+|+++.|+.+.. .+....++.|++.|++++||+++ .+++ T Consensus 576 vkv~D~~~g~~~~vPpSg~vAGl~ArtD----v~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g 651 (774) T protein:vir:98 576 FTYAGQPNSSRYGVPGAAVYAGKLAAID----FFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVD 651 (774) T ss_pred EEEeccCCCceeecChhHHHHHHHHhcC----cccccCCceeecceeccccccccccccchhhhhhcccccceeEEEEcC Confidence 9999999999999999999999999999 99999999988887653 23455688999999999999998 6889 Q ss_pred CcEEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeE-EEEc Q lcl|NC_019538. 550 QGFILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFR-VVCD 628 (678) Q Consensus 550 ~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~-V~~d 628 (678) +|+++||+||+++|+ +|+||++||||+||+++|+++++|+||||||+.+|++|+++++.||++||++|+|.||+ |+|| T Consensus 652 ~G~rvWG~RTlssDp-~wr~InVRRlfd~Ie~SI~~~~~~~VfEPNd~~l~~~I~~sI~~fL~~L~~~GaL~G~~~V~~D 730 (774) T protein:vir:98 652 RTYRFASGVTLSTDP-AWERIYLRRVHDVVRQGAHAILRNYVAMPNSRLVRNQIAAALNAFMGELKRNGNIVSFRPAIID 730 (774) T ss_pred CcEEEEcccccCCCc-ccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceecceEEEEc Confidence 999999999998875 79999999999999999999999999999999999999999999999999999999997 8999 Q ss_pred cCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeee Q lcl|NC_019538. 629 ETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDE 672 (678) Q Consensus 629 ~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e 672 (678) +++||+++|++|+|+++|+++|++|+|||+|||+|..++.+|+| T Consensus 731 ~etNt~~dI~~G~l~i~I~vaP~~PAEfIilri~q~t~~~~l~E 774 (774) T protein:vir:98 731 GSNNSTAAYFSRELYVSLQFQPLYSADYIYVTISRDTETSPLGE 774 (774) T ss_pred CCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEeecceeccC Confidence 99999999999999999999999999999999999999999999 No 19 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=100.00 E-value=3.7e-96 Score=543.79 Aligned_cols=533 Identities=21% Similarity=0.312 Sum_probs=305.1 Q ss_pred Cc-eecCceEEEEcCCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEE Q lcl|NC_019538. 1 MA-LLSPGVESKENNMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLR 79 (678) Q Consensus 1 ~~-~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (678) |. |+||||||||++.+++|+||+||++||||+|+|||+|+|++|+||.||+++||++++.+|++|+|++||+|||++|| T Consensus 3 m~~~~sPGVyv~E~~~~~~i~~v~tsvaafvG~~~~GP~~~p~~v~s~~d~~~~FG~~~~~~~l~~av~~fF~ngG~~~~ 82 (641) T protein:vir:10 3 VSNQLSPGVVIQERDLTAVTTPIGLNVGVLAAPFTKGPVEEIFEVSTERDLASVFGEPNDYNYEYWFTASQFLSYGGVLK 82 (641) T ss_pred CccccCCceEEEEecCCCcccccCCccceEEecccCCCCCccEEecCHHHHHHHcCCcCCCcchHHHHHHHHHhcCCEEE Confidence 76 99999999999877899999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCcccccccc----c---------------ccccceeeeecccccccccceee--ecccccccccccccccccccc Q lcl|NC_019538. 80 TVRILDEDTARNSS----P---------------FFETIDYTITSPGVDYRIGDDVK--ILQNGATITTGKVSALNSVGG 138 (678) Q Consensus 80 vvRv~~~~~~~~~~----~---------------~~~~~~~~~~~~~~~~~~g~~i~--~~~~~~~~~~~~~~~~~~~~~ 138 (678) ||||.+.+...+.. . ..+.+...+..||.| |+.+. +.+.++............... T Consensus 83 vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~pG~~---gn~l~v~i~~~~~~~~~~~~~~~~~~~~ 159 (641) T protein:vir:10 83 AIRLNAASLKNSVDSGTAPLIKNLQEYETTYESSNSNTFKFASRDAGAL---GNSVGIFITDAGPDQIAVLPAPGTGNEW 159 (641) T ss_pred EEEecCccccccccccchhhccccccccccccCcCccccEEEeccCCCc---CCceEEEEEcCCCcceeeeecccccccc Confidence 99997654222111 0 011222334445544 55444 333333221111111100000 Q ss_pred cceeeecccccccccceeecccccccc-eeeeeeeccccc--------------c-cccce--eeeeecccc-ceeeecc Q lcl|NC_019538. 139 ITFVRFSTAEVVKKAKELNDYPALQNG-WQIQFTSGGPGS--------------G-QSATA--VLNGIRQDS-KIYIRND 199 (678) Q Consensus 139 ~~~~~~~~a~~~~~a~~~~~~~~~~~~-~~~~~~s~~~~~--------------g-~~a~~--~~~~~~~~~-~i~~~~~ 199 (678) ....................+...... .........+.. . ..... .......+. ....... T Consensus 160 ~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g~~g~~~~~~ 239 (641) T protein:vir:10 160 EFVADEAVTAASGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGGVTGIFADAQ 239 (641) T ss_pred eeccceeeeeccCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCcceeeeeeee Confidence 000000000000000000000000000 000000000000 0 00000 000000000 0000000 Q ss_pred ccc-ccccccccccccc-hhccccccccceeeeccccccccceeEEecccccccccccccccccccccccccccccccee Q lcl|NC_019538. 200 EYS-RESLLRRDETTET-YIDMCESYGIPVVASRYAGLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFG 277 (678) Q Consensus 200 ~~a-~~~~~~~~~~~~~-~~~~~~~~~~~~i~A~~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (678) ... +.........+.. ............+.+...+..+....+......+.......... .........+.... T Consensus 240 ~~t~gt~~~t~a~~g~~~~~~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~~~a~~~~~g--~~~~~va~~~gts~-- 315 (641) T protein:vir:10 240 VVTQGTNTAAIASSGIERRLYIGKDSGSINFAATDAVVDTNATSATISSVRNEYAEREYLPG--SKWVNVAARPGTSL-- 315 (641) T ss_pred eccCCccceeeecccchhhhhhccccccceeeeecccccccceeeEeeeeeeeecccccccc--cccccccccchhhh-- Confidence 000 0000000000000 00001111122334444555555544433322222111110000 00000000000000 Q ss_pred eeeccccccccccccccceeee-------ccCCeeeeeee-eeccccccccccchhhhhhhhcCCcceEEEEecC----- Q lcl|NC_019538. 278 NITPSSYLEYGPQTKDQFAMIV-------FVGGSAVESRI-LSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAE----- 344 (678) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~v-------~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~----- 344 (678) -....+...+....+++ ...++++|++. ++...+..+..+...+....+ +..++|+..... T Consensus 316 -----~a~~~g~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~-~~~s~~v~~~~~~~~~~ 389 (641) T protein:vir:10 316 -----YANSVGGVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVI-KQQSAYVYWGSHETAPF 389 (641) T ss_pred -----hhhhcCCcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeee-ccccceEEEeccccccc Confidence 00111222222222322 23456778886 555555555554444444433 334444432100 Q ss_pred ----------------------------------------CCccccceeeeeccCcCCcccc-----chhHHHhhhhhhh Q lcl|NC_019538. 345 ----------------------------------------SWPTEYSGILTFGGGNSGNSTA-----SAGDWIEGWDMFS 379 (678) Q Consensus 345 ----------------------------------------~~~~~~~~~~~l~gg~dg~~~~-----~~~~~~~~~~~~~ 379 (678) ...........|.||.|+.... ...+...+++++. T Consensus 390 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~~~~~~~~~tg~~~~~ 469 (641) T protein:vir:10 390 LGTAANAAAGDWGASALNRRYNLLRSTAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALYNLSNVDIATAYELIE 469 (641) T ss_pred ccccccccccccccccccccccccccccccccccccccccCCCCcceeEEEeecCcccccccccccccchhHHHHHHHhh Confidence 0001122346788998875432 3456678899999 Q ss_pred ccchhccccccccccccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhcccccc-CCHHHHHHHHhcccccccchhh Q lcl|NC_019538. 380 DREHVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVA-TAVKKMVEWRRGVTDSGVVVDD 458 (678) Q Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 458 (678) ..+++.+++++++....+ ..+..+++.+|++|||+||+||+|+|+|+...++.+.. ...+++++||.. T Consensus 470 ~~e~~~i~~l~~~~~~~~-~~~~~~v~~~~i~~ce~~~d~~ailD~p~~~~~~~~~~~~~~~~~~~~~~~---------- 538 (641) T protein:vir:10 470 DPESQVIDYVLSGPAGAD-EAAAIAKATTITTIVESRKDCMAFLSPLRSDVIGVSNTTTVTENLVNYFNQ---------- 538 (641) T ss_pred hhhhhccceeeecCCCCC-cchhHHHHHHHHHHHHhcCCEEEEEcCCcccccCCCchhhHHHHHHHHHhh---------- Confidence 999999988888776544 34567899999999999999999999998877766544 356888999853 Q ss_pred ccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhh Q lcl|NC_019538. 459 NMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQ 538 (678) Q Consensus 459 ~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~ 538 (678) +++|+|+++||||++++||.+++.+++||||++||+|||+|.+|||||||||.++..|.|+.++++.+++.|++.||+ T Consensus 539 --~~~s~yaa~y~P~~~v~dp~~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~~~~~e~~~Lnp 616 (641) T protein:vir:10 539 --LPSSNYVVFDSGYKYIYDKYNDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYSPNKTQRDRLYA 616 (641) T ss_pred --cCCCceEEEEeceeEeecccCCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEecChhHHhhhhh Confidence 457899999999999999999999999999999999999999999999999999888999999999999999999999 Q ss_pred CCcEEEEEecCCcEEE----ecccc Q lcl|NC_019538. 539 NSMNPVVGFPGQGFIL----YGDKT 559 (678) Q Consensus 539 ~gIn~i~~~~~~G~~~----wG~rT 559 (678) +||||||.|||+|++- ++.+- T Consensus 617 ~gIN~ir~fpg~G~v~~~~~~~~~~ 641 (641) T protein:vir:10 617 NRINPVVSFPGHAMINNNIAFHTKL 641 (641) T ss_pred cccceEEecCCceeecceeeeeecC Confidence 9999999999999742 11111 No 20 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=2.6e-95 Score=539.14 Aligned_cols=379 Identities=15% Similarity=0.134 Sum_probs=305.7 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhc Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWG-----PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKY 74 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~G-----pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ng 74 (678) ++|.+|||||+|+ +++++|+.++|++.+|||+++++ |+|+|++|+|+.+|...||. ..++.+++..+|.|| T Consensus 2 ~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~---~gtL~~al~~~~~~g 78 (390) T protein:vir:78 2 PQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---KGTLRRTLDAIGKQT 78 (390) T ss_pred cccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCC---Cceehhhhhhhcccc Confidence 7788999999999 68999999999999999999876 99999999999999999995 467788999999999 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) |..||+||+...++...... .. T Consensus 79 g~~~~vv~v~~~~~~~~~~~-----------------------------------------------------~~----- 100 (390) T protein:vir:78 79 KPLTVVVRVAEGKDADETTS-----------------------------------------------------NV----- 100 (390) T ss_pred CceEEEEEeccccccccccc-----------------------------------------------------cc----- Confidence 99999999854321100000 00 Q ss_pred eeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) ..... ..+ ...| T Consensus 101 -ig~~~---------------~~~----------------------------------------------------~~tg 112 (390) T protein:vir:78 101 -IGTVT---------------PDG----------------------------------------------------KYTG 112 (390) T ss_pred -ccccc---------------ccc----------------------------------------------------ccch Confidence 00000 000 0000 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) +.. T Consensus 113 -----~~a------------------------------------------------------------------------ 115 (390) T protein:vir:78 113 -----IKA------------------------------------------------------------------------ 115 (390) T ss_pred -----hhh------------------------------------------------------------------------ Confidence 000 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) +... . ......+... T Consensus 116 ------------l~~~----~-------------------------------------------------~~~~~~p~il 130 (390) T protein:vir:78 116 ------------LLAA----Q-------------------------------------------------GALGVKPRIL 130 (390) T ss_pred ------------hhhh----h-------------------------------------------------hhhcceehhh Confidence 0000 0 0000000011 Q ss_pred ccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) ..|++++ .+++.+|+.+|++++ +++++|.| .+.+.+++.+|+++ ++|+|+++||||+ T Consensus 131 ~ap~~~~-~~v~~~l~~~a~~~~-~~aivD~p--------~~~t~~~a~~~~~~-------------~~s~~~~~~~p~~ 187 (390) T protein:vir:78 131 AAPGLDT-QPVAAALAATAQSLR-AMAYVSAS--------GCKTKEEAAAYRKQ-------------FGQREIMVIWPDW 187 (390) T ss_pred cccccch-HHHHHHHHHhhcccc-eEEEEecC--------CCCCHHHHHHHhhc-------------cCCceEEEEcCce Confidence 1222222 457889999999886 57888876 35678899999863 5688999999999 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeeccc---ceecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRK---LAIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~---~~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) +++|+.++..+++|||+++||++||+|.++||||||+|+.+.++.++.. ........|.+.||++||+++++ ++| T Consensus 188 ~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G 265 (390) T protein:vir:78 188 LGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNG 265 (390) T ss_pred EeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEc--CCC Confidence 9999999999999999999999999999999999999999988877543 23445667889999999999965 689 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||+++|+ +|+||++||++++|+++|+++++|+|||||++.+|++|+++++.||++||++|+|.||+|+||+++ T Consensus 266 ~~~wG~rT~s~d~-~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~ 344 (390) T protein:vir:78 266 FRFWGERTCSDDP-KFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEP 344 (390) T ss_pred EEEEcccccCCCc-ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCC Confidence 9999999998875 799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||++||++|+|+++|+++|++|+|||+|+++.... .+++++++++ T Consensus 345 nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~--~~~~~~~~~~ 389 (390) T protein:vir:78 345 NTADILASGKAYIDYDYTPVPPLENLVLRQRITDR--FLADFPARVA 389 (390) T ss_pred CCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchH--HHHHHHHHhc Confidence 99999999999999999999999999999875444 5899999999 No 21 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=2.6e-95 Score=539.14 Aligned_cols=379 Identities=15% Similarity=0.134 Sum_probs=305.7 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhc Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWG-----PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKY 74 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~G-----pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ng 74 (678) ++|.+|||||+|+ +++++|+.++|++.+|||+++++ |+|+|++|+|+.+|...||. ..++.+++..+|.|| T Consensus 2 ~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~---~gtL~~al~~~~~~g 78 (390) T protein:vir:10 2 PQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---KGTLRRTLDAIGKQT 78 (390) T ss_pred cccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCC---Cceehhhhhhhcccc Confidence 7788999999999 68999999999999999999876 99999999999999999995 467788999999999 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) |..||+||+...++...... .. T Consensus 79 g~~~~vv~v~~~~~~~~~~~-----------------------------------------------------~~----- 100 (390) T protein:vir:10 79 KPLTVVVRVAEGKDADETTS-----------------------------------------------------NV----- 100 (390) T ss_pred CceEEEEEeccccccccccc-----------------------------------------------------cc----- Confidence 99999999854321100000 00 Q ss_pred eeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) ..... ..+ ...| T Consensus 101 -ig~~~---------------~~~----------------------------------------------------~~tg 112 (390) T protein:vir:10 101 -IGTVT---------------PDG----------------------------------------------------KYTG 112 (390) T ss_pred -ccccc---------------ccc----------------------------------------------------ccch Confidence 00000 000 0000 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) +.. T Consensus 113 -----~~a------------------------------------------------------------------------ 115 (390) T protein:vir:10 113 -----IKA------------------------------------------------------------------------ 115 (390) T ss_pred -----hhh------------------------------------------------------------------------ Confidence 000 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) +... . ......+... T Consensus 116 ------------l~~~----~-------------------------------------------------~~~~~~p~il 130 (390) T protein:vir:10 116 ------------LLAA----Q-------------------------------------------------GALGVKPRIL 130 (390) T ss_pred ------------hhhh----h-------------------------------------------------hhhcceehhh Confidence 0000 0 0000000011 Q ss_pred ccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) ..|++++ .+++.+|+.+|++++ +++++|.| .+.+.+++.+|+++ ++|+|+++||||+ T Consensus 131 ~ap~~~~-~~v~~~l~~~a~~~~-~~aivD~p--------~~~t~~~a~~~~~~-------------~~s~~~~~~~p~~ 187 (390) T protein:vir:10 131 AAPGLDT-QPVAAALAATAQSLR-AMAYVSAS--------GCKTKEEAAAYRKQ-------------FGQREIMVIWPDW 187 (390) T ss_pred cccccch-HHHHHHHHHhhcccc-eEEEEecC--------CCCCHHHHHHHhhc-------------cCCceEEEEcCce Confidence 1222222 457889999999886 57888876 35678899999863 5688999999999 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeeccc---ceecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRK---LAIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~---~~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) +++|+.++..+++|||+++||++||+|.++||||||+|+.+.++.++.. ........|.+.||++||+++++ ++| T Consensus 188 ~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G 265 (390) T protein:vir:10 188 LGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNG 265 (390) T ss_pred EeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEc--CCC Confidence 9999999999999999999999999999999999999999988877543 23445667889999999999965 689 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||+++|+ +|+||++||++++|+++|+++++|+|||||++.+|++|+++++.||++||++|+|.||+|+||+++ T Consensus 266 ~~~wG~rT~s~d~-~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~ 344 (390) T protein:vir:10 266 FRFWGERTCSDDP-KFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEP 344 (390) T ss_pred EEEEcccccCCCc-ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCC Confidence 9999999998875 799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||++||++|+|+++|+++|++|+|||+|+++.... .+++++++++ T Consensus 345 nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~--~~~~~~~~~~ 389 (390) T protein:vir:10 345 NTADILASGKAYIDYDYTPVPPLENLVLRQRITDR--FLADFPARVA 389 (390) T ss_pred CCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchH--HHHHHHHHhc Confidence 99999999999999999999999999999875444 5899999999 No 22 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=9.4e-95 Score=536.07 Aligned_cols=385 Identities=15% Similarity=0.163 Sum_probs=307.9 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhc Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWG-----PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKY 74 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~G-----pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ng 74 (678) |+.-+|||||+|+ +++++|.++.|++++|||.++++ |.++|++|+|+.||...||. ..++.+++..+|.|+ T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~---~~tl~~al~~~~~~~ 77 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGK---KGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhccc---ccchHHHHHHhhhcC Confidence 9999999999999 59999999999999999999877 78999999999999999986 457889999999999 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) |..||++|+............ .. T Consensus 78 ~~~~~vv~~~~~~~~~~~~~~--------------------------------------------------------a~- 100 (396) T protein:vir:57 78 KPVTVVVRVEDGTGDDEETKL--------------------------------------------------------AQ- 100 (396) T ss_pred CceeEeeeccccccccccccc--------------------------------------------------------cc- Confidence 999999987533110000000 00 Q ss_pred eeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) +....... ... .+ T Consensus 101 -------------------------t~~~iiG~-~~~-----------------------------------------~~ 113 (396) T protein:vir:57 101 -------------------------TVSNIIGT-TDE-----------------------------------------NG 113 (396) T ss_pred -------------------------cceeeeee-ccc-----------------------------------------cc Confidence 00000000 000 00 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) . ..++. T Consensus 114 ~-~tgl~------------------------------------------------------------------------- 119 (396) T protein:vir:57 114 Q-YTGLK------------------------------------------------------------------------- 119 (396) T ss_pred c-chhhh------------------------------------------------------------------------- Confidence 0 00000 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) .+ .+...+..+++..+ T Consensus 120 -----------al-----------------------------------------------------~~~~~~~~~~p~i~ 135 (396) T protein:vir:57 120 -----------AL-----------------------------------------------------MGAESVTGVKPRIL 135 (396) T ss_pred -----------hh-----------------------------------------------------hhcccceeEEeccc Confidence 00 00000000111122 Q ss_pred ccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) ..|++++ ..++.+|.++|+++ .|++++|.|. +.+++++++||+. ++|+|+++||||+ T Consensus 136 ~ap~~~~-~~v~~al~~~~~~~-~~~~~~d~p~--------~~~~~~~~~~~~~-------------~~s~~~~~~~p~~ 192 (396) T protein:vir:57 136 GVPGLDT-KEVAVALASVCQEL-NAFGYISAWG--------CKTISEVKAYRQN-------------FSQRELMVIWPDF 192 (396) T ss_pred cCcccch-hHHHHHHHHHhhhC-ceEEEEcCCC--------CCCHHHHHHHHhc-------------cCCceEEEEccee Confidence 2333322 46889999999876 6888888763 5678999999863 5688999999999 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeeccc---ceecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRK---LAIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~---~~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) +++|+.++..+++|||+++||++||+|.++|+||||+|+++.||.+... .....+++|++.||++|||++++ ++| T Consensus 193 ~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~--~~G 270 (396) T protein:vir:57 193 LAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQKPGTDADLLNEAGVTTLVR--RDG 270 (396) T ss_pred eeecccCCceeEEehhHHHHHHHHHhhhccCcEeccCCceeccccccceecccccCCcchhhhhhhhcCcEEEEc--CCC Confidence 9999999999999999999999999999999999999999888877643 23445778999999999999965 689 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||+++|+ +|+||+|||++++|+++|+++++|+|||||++.+|++|+++|+.||++||++|+|.||+|+||+++ T Consensus 271 ~~~wG~rT~~~d~-~~~~i~vrR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~ 349 (396) T protein:vir:57 271 FRFWGNRTCSDDP-LFLFESYTRTAQVLADTMAEAHMWAIDKPITATLIRDIIDGINAKFRELKNNGYIVDGTCWFSEES 349 (396) T ss_pred EEEEcccccCCCc-ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCC Confidence 9999999998875 799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||+++|++|+|+++|+++|++|+|||+|++++... .++++++.++ T Consensus 350 n~~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~--~~~~~~~~~~ 394 (396) T protein:vir:57 350 NDAETLKAGKLYIDYDYTPVPPLENLTLRQRITSR--YLASLVTSVN 394 (396) T ss_pred CCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchH--HHHHHHHHhh Confidence 99999999999999999999999999999986665 5888998888 No 23 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=1.1e-94 Score=535.72 Aligned_cols=385 Identities=16% Similarity=0.162 Sum_probs=309.3 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhc Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWG-----PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKY 74 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~G-----pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ng 74 (678) |+.-.|||||+|+ +++++|.+++|++++|||.++++ |.++|++|+|+.+|...||. ...+.+++..+|.|| T Consensus 1 m~~~~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~---~~tl~~a~~~~~~~g 77 (396) T protein:vir:60 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGK---KGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcC---cchhHHHHHHHhhcc Confidence 9998899999999 69999999999999999998654 88999999999999999985 456788999999999 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) |..||++|+.......... . .+ T Consensus 78 g~~~~vv~~~~~~~~~~~~-----------------------------------------------~----------~~- 99 (396) T protein:vir:60 78 KPVTVVVRVEDGTGEDEET-----------------------------------------------K----------LA- 99 (396) T ss_pred CceEEEEeccccccccccc-----------------------------------------------c----------cc- Confidence 9999999885320000000 0 00 Q ss_pred eeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) .....+ .+ T Consensus 100 ----------------------------------~~~~~~--------------------------------------~~ 107 (396) T protein:vir:60 100 ----------------------------------QTVSNI--------------------------------------IG 107 (396) T ss_pred ----------------------------------cccccc--------------------------------------cc Confidence 000000 00 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) .. .... . T Consensus 108 ----~~-----d~~~-------------------------~--------------------------------------- 114 (396) T protein:vir:60 108 ----TT-----DENG-------------------------Q--------------------------------------- 114 (396) T ss_pred ----cc-----cccc-------------------------c--------------------------------------- Confidence 00 0000 0 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) .. ++.++. +...+....+... T Consensus 115 ------------------~t---------------------------------------g~~al~--~~~~~~~~~~~il 135 (396) T protein:vir:60 115 ------------------YT---------------------------------------GLKALL--AAESVTGVKPRIL 135 (396) T ss_pred ------------------cc---------------------------------------chhhhh--hcccceeeeeeec Confidence 00 000000 0000001122223 Q ss_pred ccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) .+|+++ ...++.+|.++|++++ +++++|.|. +.+.+++.+||++ ++|+|+++||||+ T Consensus 136 ~ap~~~-~~~v~~al~~~~~~~~-~~~i~d~p~--------~~~~~~a~~~~~~-------------~~s~~~~~~~p~~ 192 (396) T protein:vir:60 136 GVPGLD-TKEVAVALASVCQKLR-AFGYISAWG--------CKTISEVKAYRQN-------------FSQRELMVIWPDF 192 (396) T ss_pred cccccc-cHHHHHHHHHHhccCC-eEEEEeCCC--------CCCHHHHHHHHhh-------------cCCceEEEEeCce Confidence 344443 3678999999999875 678888763 5688899999864 4688999999999 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeeccc---ceecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRK---LAIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~---~~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) +++|+.++..+++|||+++||++||+|.++|+||||+|+++.||.+... ....++++|++.||++|||+++ +++| T Consensus 193 ~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~--~~~G 270 (396) T protein:vir:60 193 LAWDTVASTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLI--RRDG 270 (396) T ss_pred eeecccCCceeEEchhHHHHHHHHHhhhccCcEeCcCCceecceeeceeecccccCCCcchhhhhhhcCcEEEE--cCCC Confidence 9999999999999999999999999999999999999999888877643 2345678899999999999995 4789 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||+++|+ +|+||++||+++||+++|+++++|+|||||++.+|++|+.+|++||++||++|+|.||+|+||+++ T Consensus 271 ~~~wG~rT~~~d~-~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~~~~d~~~ 349 (396) T protein:vir:60 271 FRFWGNRTCSDDP-LFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEES 349 (396) T ss_pred EEEEcccccCCCc-ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCC Confidence 9999999999875 799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||+++|++|+|+++|+++|++|+|||+|++..... .++++++.+| T Consensus 350 nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~--~~~~~~~~~~ 394 (396) T protein:vir:60 350 NDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDK--YLANLVTSVN 394 (396) T ss_pred CCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchH--HHHHHHHHhh Confidence 99999999999999999999999999999986555 6889999998 No 24 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=2.6e-94 Score=533.68 Aligned_cols=379 Identities=14% Similarity=0.124 Sum_probs=303.9 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhc Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWG-----PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKY 74 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~G-----pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ng 74 (678) |++++|||||+|+ +++++|..++|++.+|+|+++++ |+|+|++|+|+.+|.+.||. ..++.+++..+|.|| T Consensus 2 ~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~---~~tL~~al~~~~~~~ 78 (390) T protein:vir:79 2 PQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---KGTLRRTLDAIGKQT 78 (390) T ss_pred ccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCC---Cccchhhhhhhcccc Confidence 8899999999999 69999999999999999999987 89999999999999999985 455778899999999 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) |+.||+||+....+..... .. T Consensus 79 ~~~~~vv~v~~~~~~~~~~-----------------------------------------------------------~~ 99 (390) T protein:vir:79 79 KPLTVVVRVAEGKDADETT-----------------------------------------------------------SN 99 (390) T ss_pred cceEEEEeecccccccccc-----------------------------------------------------------ce Confidence 9999999985321100000 00 Q ss_pred eeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) ..... .. .... T Consensus 100 ~ig~~------------------------------~~-------------------------------------~~~~-- 110 (390) T protein:vir:79 100 VIGTV------------------------------TP-------------------------------------DGKY-- 110 (390) T ss_pred eeecc------------------------------cc-------------------------------------cccc-- Confidence 00000 00 0000 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) .++.. T Consensus 111 ---tgl~a------------------------------------------------------------------------ 115 (390) T protein:vir:79 111 ---TGIKA------------------------------------------------------------------------ 115 (390) T ss_pred ---hhhhh------------------------------------------------------------------------ Confidence 00000 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) +.. ......+.+... T Consensus 116 ------------l~~-----------------------------------------------------~~~~~~~~p~il 130 (390) T protein:vir:79 116 ------------LLA-----------------------------------------------------AQGALGVKPRIL 130 (390) T ss_pred ------------hhh-----------------------------------------------------hhhhhccccccc Confidence 000 000000011111 Q ss_pred ccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) .+|+++ ..+++.+|..+|++++ +++++|.| .+.+.+++.+|++ +++|+|+++||||+ T Consensus 131 ~ap~~~-~~~v~~~l~~~a~~~~-~~ai~D~p--------~~~t~~~a~~~~~-------------~~~s~~~~~~~p~~ 187 (390) T protein:vir:79 131 AAPGLD-TQPVAAALAATAQSLR-AMAYVSAS--------GCKTKEEAAAYRR-------------QFGQREIMVIWPDW 187 (390) T ss_pred cCCccc-chHHHHHHHHhhhhcc-eEEEEEcc--------CCCCHHHHHHHhc-------------CCCCceEEEEcCce Confidence 222332 2457888999998774 78898876 3557888999985 35688999999999 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeeccc---ceecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRK---LAIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~---~~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) +++|+.++..+++|||+++||++||+|.++||||||+|+++.++.++.. .......+|++.||++||+++++ ++| T Consensus 188 ~~~d~~~~~~~~~p~s~~~Ag~~a~~D~~~g~~~spsN~~i~gi~~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~~--~~G 265 (390) T protein:vir:79 188 LGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNG 265 (390) T ss_pred eecccccCceeEeehHHHHHHHHHhhhccCCcEEccCCceeeccceeeeeccccccccchhhhhhhhcCcEEEEc--CCC Confidence 9999999999999999999999999999999999999999887777643 22344567889999999999864 789 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||+++|+ +|+||+||||+++|+++|+++++|+|||||++.+|++|+++++.||++||++|+|.||+|+||+++ T Consensus 266 ~~~wG~rT~~~d~-~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~ 344 (390) T protein:vir:79 266 FRFWGERTCSDDP-KFAFENYTRTAQVAADSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEP 344 (390) T ss_pred EEEEeccccCCCc-ccceeeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCC Confidence 9999999998875 799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||++||++|+|+++|+++|++|+|||+|++..... .++|+++.++ T Consensus 345 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~--~~~~~~~~v~ 389 (390) T protein:vir:79 345 NTADILASGKAYIDYDYTPVPPLENLVLRQRITDR--FLADFPARVA 389 (390) T ss_pred CCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchH--HHHHHHHHhc Confidence 99999999999999999999999999999986444 5788888888 No 25 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=2.1e-94 Score=534.21 Aligned_cols=379 Identities=13% Similarity=0.112 Sum_probs=304.0 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEeccc-----CCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhc Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQ-----WGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKY 74 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~-----~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ng 74 (678) |++.+|||||+|+ +++++|.+++|++++|||+++ .+|+|+|++|+|+.||...||. ..++.+++..+|.|| T Consensus 2 ~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~---~gtl~~al~~~~~~g 78 (391) T protein:vir:79 2 PTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGD---KGTLAHTLDAITDQT 78 (391) T ss_pred CCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCC---ccccchhhhhhhccc Confidence 8889999999999 699999999999999999986 6899999999999999999995 456778899999999 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) |..||+|++.......... .. . T Consensus 79 g~~~~vv~~~~~~~~~~~~------------------------------------------------~~-----~----- 100 (391) T protein:vir:79 79 NPLTVVVRVAGGASEAETT------------------------------------------------SN-----L----- 100 (391) T ss_pred ccceeeecccccccccccc------------------------------------------------cc-----c----- Confidence 9999999875321000000 00 0 Q ss_pred eeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) .+. ... + .... T Consensus 101 -----------------------------------~g~-~~~---------------~-----------------~~~t- 111 (391) T protein:vir:79 101 -----------------------------------IGT-TNA---------------A-----------------GRYT- 111 (391) T ss_pred -----------------------------------ccc-ccc---------------h-----------------hhhH- Confidence 000 000 0 0000 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) ++.. T Consensus 112 ----Gl~~------------------------------------------------------------------------ 115 (391) T protein:vir:79 112 ----GMKA------------------------------------------------------------------------ 115 (391) T ss_pred ----HHhh------------------------------------------------------------------------ Confidence 0000 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) +.. ...+..+.+... T Consensus 116 ------------l~~-----------------------------------------------------~~~~~~~~p~~l 130 (391) T protein:vir:79 116 ------------LLT-----------------------------------------------------ARNRFGVAPRIL 130 (391) T ss_pred ------------hhh-----------------------------------------------------hhhhhcccchhh Confidence 000 000000011111 Q ss_pred ccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) ..|+.+ ..+++.+|.++|++++ +++++|.| .+.+.+++.+|++. ++|+|+++||||+ T Consensus 131 ~~p~~~-~~~v~~al~~~~~~~~-~~ai~d~p--------~~~t~~~a~~~~~~-------------~~s~~~a~~~P~~ 187 (391) T protein:vir:79 131 AVPGLD-SLPVGTELVTIAQKLR-AFAYLSAY--------GCQTKEEAVAYRSN-------------FGQREAMVMWPDF 187 (391) T ss_pred cCCccc-hhHHHHHHHHHHhhcC-cEEEEECC--------CCCCHHHHHHHHhc-------------cCCceeEEeccee Confidence 122322 3457889999999987 46788876 35678899999864 5678999999999 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeeccc---ceecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRK---LAIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~---~~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) +++|+.++..+++|||+++||++||+|.++||||||+|+++.+|.+... ........|.+.||++|||++++ ++| T Consensus 188 ~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~--~~G 265 (391) T protein:vir:79 188 VGWDTAANAETTLWATARAVGLRAKIDNDTGWHKTLSNVAVGGVTGLSRDVFWDLQDPATDAGYLNANEVTTLVH--RDG 265 (391) T ss_pred eeecCcCCceeeechHHHHHHHHHHhhhcccceeccCCceehhhhccccccccccccccchhhhhhhcCceEEEC--CCc Confidence 9999999999999999999999999999999999999999887777553 22345667899999999999964 789 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||+++|+ +|+||++||++++|+++|+++++|+|||||++.+|++|+++|++||++||++|+|.||+|+||+++ T Consensus 266 ~~~wG~rT~~~d~-~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~ 344 (391) T protein:vir:79 266 YRFWGSRTCSADP-LFAFENYTRTAQVLADTMAEAHMWANDLPMTPTLVRDLLEGINAKLRMLTRNGYLLGGAAWFDADA 344 (391) T ss_pred EEEEcccccCCCc-ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCC Confidence 9999999998875 799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCce--eeecccc Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSAN--FDELVGP 676 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~--~~e~~~~ 676 (678) ||+++|++|+|+++|+++|++|+|||+|++.......+ ++|+.++ T Consensus 345 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~a 391 (391) T protein:vir:79 345 NSKDTLKAGQLAIDYDYTPVPPLENLTFRQRITDRYLMQFAEAVKAA 391 (391) T ss_pred CCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 99999999999999999999999999999998888766 6666666 No 26 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=1.2e-93 Score=530.00 Aligned_cols=385 Identities=15% Similarity=0.142 Sum_probs=306.9 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhc Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWG-----PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKY 74 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~G-----pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ng 74 (678) |+.-.|||||+|+ +++++|.+++|++++|||.++++ |+++|++|+|+.+|...||+. ..+..++.++|.|| T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~---~tL~~al~~~~~ng 77 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKK---GTLAASLQAIADQS 77 (396) T ss_pred CCCCCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccc---cchhhhhhhhhccC Confidence 9988899999999 59999999999999999998765 789999999999999999963 45678899999999 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) |..||++|+.......... . .+ T Consensus 78 g~~~~v~~~~~~~~~~~~~-----------------------------------------------~----------~a- 99 (396) T protein:vir:20 78 KPVTVVMRVEDGTGDDEET-----------------------------------------------K----------LA- 99 (396) T ss_pred ceeEEEEeccccccccccc-----------------------------------------------c----------cc- Confidence 9999999874321000000 0 00 Q ss_pred eeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) . +.. T Consensus 100 --------------------------~------------------------------t~~-------------------- 103 (396) T protein:vir:20 100 --------------------------Q------------------------------TVS-------------------- 103 (396) T ss_pred --------------------------c------------------------------ccc-------------------- Confidence 0 000 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) .+...... .. T Consensus 104 ----~~~~~~~~-~~----------------------------------------------------------------- 113 (396) T protein:vir:20 104 ----NIIGTTDE-NG----------------------------------------------------------------- 113 (396) T ss_pred ----cccccccc-cc----------------------------------------------------------------- Confidence 00000000 00 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) .. .. ...+.. ...+....+... T Consensus 114 --~~--tg-~~al~~-----------------------------------------------------~~~~~~~~p~i~ 135 (396) T protein:vir:20 114 --QY--TG-LKAMLA-----------------------------------------------------AESVTGVKPRIL 135 (396) T ss_pred --cc--ch-hhhhhh-----------------------------------------------------hccccccchhhh Confidence 00 00 000000 000000111112 Q ss_pred ccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) ..|++. ...++.+|.++|++++ +++++|.|. ..+++++.+||++ ++|+|+++||||+ T Consensus 136 ~ap~~~-~~~v~~al~~~~~~~~-~~~~iD~p~--------~~~~~~a~~~r~~-------------~~s~~~~~~~P~~ 192 (396) T protein:vir:20 136 GVPGLD-TKEVAVALASVCQKLR-AFGYISAWG--------CKTISEVKAYRQN-------------FSQRELMVIWPDF 192 (396) T ss_pred hhhhhc-cHHHHHHHHHHHhcCC-cEEEEecCC--------CCCHHHHHHHhhC-------------CCCceEEEEcCcc Confidence 233333 3568999999998876 577888774 4678999999863 5688999999999 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccc---eecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKL---AIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~---~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) +++|+.++..+++|||+++||++||+|.++|+||||+|+++.||.+.... ...++++|++.||++|||++++ ++| T Consensus 193 ~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~--~~G 270 (396) T protein:vir:20 193 LAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIR--RDG 270 (396) T ss_pred ccccCcCCcceeechhHHHHHHHHHhhhhcCcEeccCCceeccceecceecccccCCCcchhhhhhhcCcEEEEc--CCC Confidence 99999999999999999999999999999999999999999888886543 3446788999999999999954 789 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||+++|+ +|+||++||+++||+++|++.++|+|||||++.+|++|+.+++.||++||++|+|.||+|+||+++ T Consensus 271 ~~~wG~rT~s~d~-~~~~i~~rR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~G~l~g~~v~~d~~~ 349 (396) T protein:vir:20 271 FRFWGNRTCSDDP-LFLFENYTRTAQVVADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEES 349 (396) T ss_pred EEEEcccccCCCc-ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCcceeceEEEEecCC Confidence 9999999998864 799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||+++|++|+|+++|+++|++|+|||+|+++.... +|+|++++++ T Consensus 350 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~~ 394 (396) T protein:vir:20 350 NDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDK--YLANLVTSVN 394 (396) T ss_pred CCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchH--HHHHHHHHhh Confidence 99999999999999999999999999999885544 6999999999 No 27 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=2.5e-93 Score=528.26 Aligned_cols=385 Identities=15% Similarity=0.138 Sum_probs=307.7 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhc Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWG-----PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKY 74 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~G-----pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ng 74 (678) |+.-+|||||+|+ ++++++.+++|++.+|||+++.+ |.++|++|+|+.+|+..||. ..++..++..+|.|| T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~---~~tl~~al~~~~~~~ 77 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGK---KGTLAASLQAIADQS 77 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhccc---ccchhhHHHHHhhcc Confidence 9998999999999 69999999999999999999865 78999999999999999996 356788999999999 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) |+.|+++|+.......... . .+ T Consensus 78 ~~~~~vv~~~~~~~~~~~~-------------------------------------------------~--------~a- 99 (395) T protein:vir:98 78 KPVTVVVRVEDGTGDDEEA-------------------------------------------------A--------LA- 99 (395) T ss_pred CceEEEeeccccccccccc-------------------------------------------------c--------cc- Confidence 9999999885321000000 0 00 Q ss_pred eeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) . +...+ .| T Consensus 100 --------------------------~--------~~~~i--------------------------------------~g 107 (395) T protein:vir:98 100 --------------------------Q--------TVSNI--------------------------------------IG 107 (395) T ss_pred --------------------------c--------ccccc--------------------------------------cc Confidence 0 00000 00 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) .. . . .. T Consensus 108 ~~--~------~-~~----------------------------------------------------------------- 113 (395) T protein:vir:98 108 GT--D------E-NG----------------------------------------------------------------- 113 (395) T ss_pred cc--c------c-cc----------------------------------------------------------------- Confidence 00 0 0 00 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) .. .++.++. +......+.+... T Consensus 114 ---------------------------------------------------~~-----Tgl~al~--~~~~~~~~~p~il 135 (395) T protein:vir:98 114 ---------------------------------------------------KY-----TGIKALL--TAQAVTGVKPRIL 135 (395) T ss_pred ---------------------------------------------------ch-----hHHHHHh--hhhhhhccchhhc Confidence 00 0000000 0000001122223 Q ss_pred ccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) ..|++++ .+++.+|..+|++++ +++++|.|. +.+.+++.+|+++ ++|+|+++||||+ T Consensus 136 ~ap~~~~-~~v~~al~~~~~~~~-~~~~~d~p~--------~~t~~~a~~~~~~-------------~~s~~~~~~~p~~ 192 (395) T protein:vir:98 136 GVPGLDT-KEVAVALASAAIKLR-AFAYVSAWG--------CKTISEAMEYRKN-------------FSQRELMVIWPDF 192 (395) T ss_pred ccccccc-cHHHHHHHHHhhhcC-cEEEEEcCC--------CCCHHHHHHHHhc-------------cCCceEEEEecce Confidence 3444443 568899999999875 688888763 5678999999864 5688999999999 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccc---eecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKL---AIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~---~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) +++|+.++..+++|||+++||++||+|.++|+||||+|+++.+|.++... ...++++|++.||++|||+++ +++| T Consensus 193 ~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~--~~~G 270 (395) T protein:vir:98 193 LAWDTVKNTTATAYATARALGLRAYIDQTVGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLV--RKDG 270 (395) T ss_pred eEecccCCceeeechHHHHHHHHHHhhcccCcEeccCCceeecccccceecccccCCCcchHHhhhhcCcEEEE--cCCC Confidence 99999999999999999999999999999999999999998888776532 355678999999999999995 4789 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||+++|+ +|+||++||++++|+++|++.++|+||||||+.+|++|+++|+.||++||++|+|.||+|+||+++ T Consensus 271 ~~~wG~rT~s~d~-~~~~i~~rR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~ 349 (395) T protein:vir:98 271 FRFWGNRTCSDDP-LFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKSNGYIVEGKCWFDEES 349 (395) T ss_pred EEEEcccccCCCc-ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCC Confidence 9999999998874 899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||+++|++|+|+++|+++|++|+|||+|++....+. |+++++.++ T Consensus 350 nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~--~~~~~~~~~ 394 (395) T protein:vir:98 350 NDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKY--LVNLAESVN 394 (395) T ss_pred CCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHH--HHHHHHHhc Confidence 999999999999999999999999999999866665 666666666 No 28 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=1.3e-93 Score=529.90 Aligned_cols=379 Identities=14% Similarity=0.132 Sum_probs=307.8 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEeccc-----CCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhc Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQ-----WGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKY 74 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~-----~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ng 74 (678) |++.+|||||+|+ +++++|..+.|++.+|+|.++ .+|+++|++|+|+.+|...||. ..++.+++..+|.|+ T Consensus 3 ~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~---~~tl~~al~~~~~~~ 79 (391) T protein:vir:11 3 ADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGT---SGTLPASLQAIADQA 79 (391) T ss_pred CCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCC---Cccchhhhhhhhccc Confidence 8899999999999 699999999999999999998 5699999999999999999885 345778899999999 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) |+.||+||+...+..... .. T Consensus 80 g~~~~vv~~~~~~~~~~t-----------------------------------------------------~~------- 99 (391) T protein:vir:11 80 NAATVVVRVKPGEDEAAT-----------------------------------------------------NS------- 99 (391) T ss_pred cceeEEeeeccccccccc-----------------------------------------------------ch------- Confidence 999999988421100000 00 Q ss_pred eeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) + + .| T Consensus 100 ------------------------------------d--~--------------------------------------~g 103 (391) T protein:vir:11 100 ------------------------------------A--V--------------------------------------IG 103 (391) T ss_pred ------------------------------------h--h--------------------------------------hc Confidence 0 0 00 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) .. +. .. .... T Consensus 104 ~~-~a-~~---~~~g----------------------------------------------------------------- 113 (391) T protein:vir:11 104 GV-SA-DG---KYTG----------------------------------------------------------------- 113 (391) T ss_pred cc-cc-cc---chhh----------------------------------------------------------------- Confidence 00 00 00 0000 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) .....+. .....+.+... T Consensus 114 ---------~~a~~~~-----------------------------------------------------~~~~~~~p~~~ 131 (391) T protein:vir:11 114 ---------MKALLAA-----------------------------------------------------KARLGVVPRIL 131 (391) T ss_pred ---------hhhhhhh-----------------------------------------------------hhhheeccccc Confidence 0000000 00000111112 Q ss_pred ccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) ..|++++ .+++.+|+++|+++ +|++++|.|. +.+.+++.+||+. ++|+|+++||||+ T Consensus 132 ~ap~~~~-~~v~~al~~~~~~~-~~~~i~D~p~--------~~t~~~a~~~r~~-------------~~s~~~~~~~p~~ 188 (391) T protein:vir:11 132 GVPGLDT-QPVATALIAIAQQL-RAFAYVSASG--------CKTKEEATAYREN-------------FAAREAMVIWPDF 188 (391) T ss_pred ccccccc-HHHHHHHHHhhccc-ceEEEEEcCC--------CCCHHHHHHHhhh-------------cCCceEEEEcCcc Confidence 2334333 46899999999887 6888998763 4578999999863 5688999999999 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeeccc---ceecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRK---LAIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~---~~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) +++|+.++..+++|||+++||++||+|.++||||||||+++.+|.++.. .+..+++.|++.||++|||+++ +++| T Consensus 189 ~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~--~~~G 266 (391) T protein:vir:11 189 LTWSTVVNQTVPAPAVAQALGLRARIDQEVGWHKTLSNVAVNGVTGISADVFWDLQSPSTDANYLNENEVTTLV--QEGG 266 (391) T ss_pred eecccccCceEEechHHHHHHHHHHhhccCCcEEccCCceeeceeecccccccccCCCcchhhhhhhcCcEEEE--cCCC Confidence 9999999999999999999999999999999999999999988887654 2344578899999999999985 5789 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||++.|+ +|+||+|||+|++|+++|++.++|+|||||++.+|++|+++|+.||++||++|+|+||+|+||+++ T Consensus 267 ~~~wG~rT~~~d~-~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~g~l~g~~~~~~~~~ 345 (391) T protein:vir:11 267 FRFWGSRTCSDDP-LFAFENYTRTAQVLADTIAEAHMWAVDKPMHPSLVRDILEGVNAKFRELKGLGLIIDAQAWYDPNV 345 (391) T ss_pred EEEEcccccCCCc-ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhccceeceEEEEecCC Confidence 9999999998875 799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||+++|++|+|+++|+++|++|+|||+|+++.... .|+|+++.+| T Consensus 346 n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~~ 390 (391) T protein:vir:11 346 NDKDTLKAGKLRITYDYTPVPPLEDLTFFQKITDS--YLVDFASRVN 390 (391) T ss_pred CCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchH--HHHHHHHHhc Confidence 99999999999999999999999999999986555 5899999999 No 29 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=1.8e-92 Score=523.55 Aligned_cols=382 Identities=15% Similarity=0.143 Sum_probs=305.1 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhc Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWG-----PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKY 74 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~G-----pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ng 74 (678) |+--.|||||+|+ +++++|.+++|++.+|||.++++ |+++|++|+|+.+|...||. ...+.+++..+|.|| T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~---~gtl~~al~~~~~ng 77 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGK---KGTLSASLQAIADQS 77 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCC---CcchHHHHHHhhccc Confidence 8776799999999 69999999999999999999876 89999999999999999986 446788999999999 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) |..|+++|+.......... .....+ T Consensus 78 g~~~~vv~v~~~~~~~~~~--------------------------------------------------~t~~dl----- 102 (392) T protein:vir:18 78 KPVTVVVRVAEGTGDDAEA--------------------------------------------------QTTSNI----- 102 (392) T ss_pred CceEEEecccccccccccc--------------------------------------------------cchhhh----- Confidence 9999999874321000000 000000 Q ss_pred eeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) .| T Consensus 103 ------------------------------------------------------------------------------iG 104 (392) T protein:vir:18 103 ------------------------------------------------------------------------------IG 104 (392) T ss_pred ------------------------------------------------------------------------------ee Confidence 00 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) ...... .... T Consensus 105 ~~~~~~-----~~tg----------------------------------------------------------------- 114 (392) T protein:vir:18 105 GTDENG-----KYTG----------------------------------------------------------------- 114 (392) T ss_pred cccccc-----hhhh----------------------------------------------------------------- Confidence 000000 0000 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) ...+.+ ......+.+..+ T Consensus 115 ---------~~al~~-----------------------------------------------------~~~~~~~~p~il 132 (392) T protein:vir:18 115 ---------IKALLT-----------------------------------------------------AEAVTGVKPRIL 132 (392) T ss_pred ---------HHHHHh-----------------------------------------------------hhhhhceeehhc Confidence 000000 000000001111 Q ss_pred ccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) .+|++++ .+++.+|.++|++++ +++++|.| ++.+.+++.+|++. ++|+|+++||||+ T Consensus 133 ~ap~~~~-~~v~~~l~~~~~~~~-~~~~~d~~--------~~~~~~~a~~~~~~-------------~~s~~~~~~~p~~ 189 (392) T protein:vir:18 133 GVPGLDT-QEVATALASVCISLR-AFGYVSAW--------GCKTISEAMAYREN-------------FSQRELMVIWPDF 189 (392) T ss_pred ccCccch-HHHHHHHHHHHhhcC-cEEEEecC--------CCCCHHHHHHHHhh-------------ccCceEEEEeCce Confidence 2233332 468899999999876 57777765 46788999999863 5688999999999 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccc---eecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKL---AIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~---~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) +++|+.++..+++|||+++||++||+|.++||||||+|+++.+|.++... +..++++|++.||++|||+++ +++| T Consensus 190 ~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~--~~~G 267 (392) T protein:vir:18 190 LAWDTTANATATAYATARALGLRAYIDQTIGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLV--RKDG 267 (392) T ss_pred eeecccCCceEEechHHHHHHHHHhhhccCCceEccCCceeeceeecceecccccCCCcchhhhhhhcCceEEE--cCCC Confidence 99999999999999999999999999999999999999999888776532 345678899999999999995 4789 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN 631 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~ 631 (678) +++||+||+++|+ +|+||++||++++|+++|+++++|+|||||++.+|++|+++|++||++||++|+|.||+|+||+++ T Consensus 268 ~~~wG~rT~~~d~-~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~ 346 (392) T protein:vir:18 268 FRFWGNRTCSDDP-LFLFENYTRTAQVLADTMAEAHMWAVDKPITASLIRDIVDGINAKFRELKSNGYIVDGECWFDEES 346 (392) T ss_pred EEEEcccccCCCc-ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCcccceEEEEecCC Confidence 9999999998875 899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 632 NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 632 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) ||+++|++|+|+++|+++|++|+|||+|+++.... .++++++.++ T Consensus 347 nt~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~--~~~~~~~~~~ 391 (392) T protein:vir:18 347 NDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDK--YLVNLAESVN 391 (392) T ss_pred CCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchH--HHHHHHHHhc Confidence 99999999999999999999999999999986555 4777777777 No 30 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=6.4e-92 Score=520.54 Aligned_cols=377 Identities=14% Similarity=0.087 Sum_probs=304.0 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhc Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWG-----PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKY 74 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~G-----pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ng 74 (678) |++.+|||||+|+ +++++|.+++|++.+|||+++++ |+|+|++|+|+.||.+.||. ..++.+++..+|.|+ T Consensus 4 ~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~---~g~L~~al~~~~~~~ 80 (393) T protein:vir:10 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGS---TGTLRRTLNSIGSIV 80 (393) T ss_pred CCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCC---ccchhhhhhhhhccc Confidence 5666799999999 69999999999999999999988 99999999999999999995 457788999999999 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) |+.||+||+...+....... . . T Consensus 81 ~~~~~vv~v~~~~~~~~t~~--------------------------------------------------~---i----- 102 (393) T protein:vir:10 81 KTPTVIVRVAESDDSDTLTA--------------------------------------------------N---I----- 102 (393) T ss_pred CceEEEeecccCcccccccc--------------------------------------------------c---c----- Confidence 99999999853321000000 0 0 Q ss_pred eeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) . + ...++ ..+ T Consensus 103 -i---------------------g--------~~~~~------------------~~t---------------------- 112 (393) T protein:vir:10 103 -V---------------------G--------TQENG------------------KFT---------------------- 112 (393) T ss_pred -c---------------------c--------ccccc------------------hhh---------------------- Confidence 0 0 00000 000 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) ++.. T Consensus 113 ----gl~a------------------------------------------------------------------------ 116 (393) T protein:vir:10 113 ----GIKA------------------------------------------------------------------------ 116 (393) T ss_pred ----HHHH------------------------------------------------------------------------ Confidence 0000 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) +..... . ..+.+..+ T Consensus 117 ------------l~~~~~----~-------------------------------------------------~~~~p~li 131 (393) T protein:vir:10 117 ------------LLTAQS----T-------------------------------------------------VFVKPKLL 131 (393) T ss_pred ------------HHhhhh----h-------------------------------------------------cceeeeee Confidence 000000 0 00000011 Q ss_pred ccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) .+|++++ .+++.+|.++|+++++++++.|+| .++.+++++|++. ++|+|.++||||+ T Consensus 132 ~apg~~~-~~~~~al~~~~~~~~~~~~v~d~~---------~~t~~~ai~~~~~-------------~~s~~~~~~~P~~ 188 (393) T protein:vir:10 132 CVPQHDN-QAVATELLSVAKKLNAFAFISDNG---------ATTKEQAYTYRQN-------------FSQREGMMIFGDW 188 (393) T ss_pred eeccccc-hHHHHHHHHHhhccCcEEEEEcCC---------CCCHHHHHHHhhh-------------cCCceEEEEeccc Confidence 1223322 356788999999999999887765 4578889999863 5678999999999 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeeccc---ceecCChhhhhhhhhCCcEEEEEecCCc Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRK---LAIETRQAHRDELYQNSMNPVVGFPGQG 551 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~---~~~~~~~~e~~~L~~~gIn~i~~~~~~G 551 (678) +++|+.++..+++|||+++||++||+|.++||||||+|+.+.+|.+... ....++++|++.||++|||+|+ +++| T Consensus 189 ~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~G~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~--~~~G 266 (393) T protein:vir:10 189 KSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNG 266 (393) T ss_pred ccccccCCceeEeehhHHHHHHHHHhhcCCCcEEccCCceeeceeecceecccccCCCcchhHhHhhcCceEEE--cCCC Confidence 9999999999999999999999999999999999999999988887654 3355678999999999999995 4789 Q ss_pred EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcC--CeeeeEEEEcc Q lcl|NC_019538. 552 FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLG--GIYDFRVVCDE 629 (678) Q Consensus 552 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~g--al~g~~V~~d~ 629 (678) +++||+||++.|+ +|+||++|||+++|+++|++.++|+|||||++.+|++|+++++.||++||++| +|.||+|+||+ T Consensus 267 ~~~wG~rT~s~d~-~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~~al~g~~v~~~~ 345 (393) T protein:vir:10 267 FRYWGSRTLATDT-RWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAE 345 (393) T ss_pred EEEEcccccCCCc-ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhccccccccceEEecC Confidence 9999999998874 89999999999999999999999999999999999999999999999999865 89999999987 Q ss_pred CCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 630 TNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 630 ~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) + ||++||++|+|+++|+++|++|+|||+|++..... +|+|+++.+. T Consensus 346 ~-nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~--~~~~l~~~v~ 391 (393) T protein:vir:10 346 E-ITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDE--YVVDLVNTLK 391 (393) T ss_pred C-CCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchH--HHHHHHHHHh Confidence 5 88899999999999999999999999999876554 5999999999 No 31 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=6.9e-90 Score=509.40 Aligned_cols=376 Identities=17% Similarity=0.176 Sum_probs=298.3 Q ss_pred Cc-eecCceEEEEc-CCCcccccCCccceeEEecccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHh Q lcl|NC_019538. 1 MA-LLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWG-----PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLK 73 (678) Q Consensus 1 ~~-~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~G-----pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~n 73 (678) |. +.+|||||+|+ +++++|.+++|++.+|||.++.+ |+++|++++|+.++...||+. ..+..++.++|.| T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~---~tl~~a~~~~~~~ 77 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAG---GTLPQAIDGIFDQ 77 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCC---cchhHHHHHHhcc Confidence 55 78999999999 69999999999999999998875 899999999999999999864 4567889999999 Q ss_pred cCCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeeccccccccc Q lcl|NC_019538. 74 YGNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKA 153 (678) Q Consensus 74 gG~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a 153 (678) ||+.||++++....+..... ... T Consensus 78 gg~~~~vv~~~~~~~~~~t~-----------------------------------------------------~~~---- 100 (386) T protein:vir:10 78 TGAVVVVIRVDEGVDSAATQ-----------------------------------------------------SNV---- 100 (386) T ss_pred CceeEEEeeccccccccccc-----------------------------------------------------hhh---- Confidence 99999999874321100000 000 Q ss_pred ceeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccc Q lcl|NC_019538. 154 KELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYA 233 (678) Q Consensus 154 ~~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~ 233 (678) .... ... + .... T Consensus 101 --ig~~-----------------------------~~~--------------------t-----------------~~~t 112 (386) T protein:vir:10 101 --IGKV-----------------------------DAD--------------------T-----------------EQYT 112 (386) T ss_pred --hccc-----------------------------ccc--------------------c-----------------chhh Confidence 0000 000 0 0000 Q ss_pred cccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeec Q lcl|NC_019538. 234 GLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSV 313 (678) Q Consensus 234 G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~ 313 (678) ++.. T Consensus 113 -----gl~~----------------------------------------------------------------------- 116 (386) T protein:vir:10 113 -----GILA----------------------------------------------------------------------- 116 (386) T ss_pred -----hhHH----------------------------------------------------------------------- Confidence 0000 Q ss_pred cccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccc Q lcl|NC_019538. 314 KENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGS 393 (678) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (678) +... ...+ .+.+.. T Consensus 117 -------------l~~~----~~~~-------------------------------------------------~~~p~i 130 (386) T protein:vir:10 117 -------------LLSA----ENTV-------------------------------------------------KVQPRI 130 (386) T ss_pred -------------hhhh----cccc-------------------------------------------------cccccc Confidence 0000 0000 000111 Q ss_pred cccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCe Q lcl|NC_019538. 394 CAGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANY 473 (678) Q Consensus 394 ~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~ 473 (678) ..+|++++...+.++|..+|++++. +.+.+. ...+.+++.+|+.. ++|+|+++|||| T Consensus 131 ~~ap~~~~~~~v~~~l~~~~~~~~~-~~~~~~---------~~~~~~~a~~~~~~-------------~~s~~~~~~~p~ 187 (386) T protein:vir:10 131 LIAPGFSNQKAVADQLVSVADTAAW-LCHSGW---------SNTTDAAAITYREL-------------FGSRRCEVVDPW 187 (386) T ss_pred cccccccchhHHHHHHHHhhcceEE-EEEeCC---------CCCchHHHHHhhhc-------------ccccceEEecCc Confidence 1223444556677888888877653 334443 35667888888753 568899999999 Q ss_pred EEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeeccc---ceecCChhhhhhhhhCCcEEEEEecCC Q lcl|NC_019538. 474 KLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRK---LAIETRQAHRDELYQNSMNPVVGFPGQ 550 (678) Q Consensus 474 ~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~---~~~~~~~~e~~~L~~~gIn~i~~~~~~ 550 (678) ++++|+.++..+++|||+++||++||+|.++||||||+|+++.+|.++.. .+..+++.|++.||++||+++ |+++ T Consensus 188 ~~v~~~~~~~~~~~p~s~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gi~~~--~~~~ 265 (386) T protein:vir:10 188 YKVWDVETSAHIIQPPSARHAGVMAKVHNTLGFWWSNSNQEILGIDGLCRPVDFKLDDPTCRANLLNAKEVTTT--IQQN 265 (386) T ss_pred eeeeccccccceeechHHHHHHHHHHhhhcCCcEEccCCceeecccccceecccccccCcchhhhhhhcCcEEE--EcCC Confidence 99999999999999999999999999999999999999999988888653 345567899999999999987 4689 Q ss_pred cEEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccC Q lcl|NC_019538. 551 GFILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDET 630 (678) Q Consensus 551 G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~ 630 (678) |+++||+||++.| ++|+||+|||++++|+++|+++++|+|||||++.+|.+|+++|++||++||++|+|.||+|+||++ T Consensus 266 G~~~wG~rT~~~d-~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~ 344 (386) T protein:vir:10 266 GFRVWGDRTCSAD-SKWAFKNVVITNDMIADSLVRNHLWAVDRNITKTYVEDVTEGVNNYLRHLKNIGAIAGGECWVDPE 344 (386) T ss_pred CEEEEcccccCCC-cccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccc Confidence 9999999999876 489999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeecc Q lcl|NC_019538. 631 NNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELV 674 (678) Q Consensus 631 ~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~ 674 (678) +||+++|++|+|+++|+++|++|+|||+|++++... .|+|+| T Consensus 345 ~nt~~~~~~G~~~~~i~~~p~~p~e~i~~~~~~~~~--~~~~~~ 386 (386) T protein:vir:10 345 LNSPDQIQQGKVYFDYDFSAYAPAEHITFRSHMVNG--YLTEVV 386 (386) T ss_pred CCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEehh--HHHhhC Confidence 999999999999999999999999999999987666 599999 No 32 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=8.6e-90 Score=508.87 Aligned_cols=375 Identities=13% Similarity=0.105 Sum_probs=305.0 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCC----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcC Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWG----PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYG 75 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~G----pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG 75 (678) |+.-+|||||+|+ +++++|+.++|++.+|||+++.+ |.++|++|.++.|+...||.......+..++..+|.++| T Consensus 4 ~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~~d~~~~~~~~~~~gtl~~al~~~~~~~~ 83 (388) T protein:vir:96 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) T ss_pred CCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecchhhhhhhccccccccchhhhHhhhccCC Confidence 6566689999999 59999999999999999999775 899999999999999999988888888899999999999 Q ss_pred CeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccce Q lcl|NC_019538. 76 NDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKE 155 (678) Q Consensus 76 ~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~ 155 (678) ..|+++|+...++... ..+.+ T Consensus 84 ~~~~vv~v~~g~~~~a-----------------------------------------------------t~a~i------ 104 (388) T protein:vir:96 84 VPQYFIVVPEGADDAA-----------------------------------------------------TMANI------ 104 (388) T ss_pred ceEEEEEecccccccc-----------------------------------------------------cccee------ Confidence 9999999853210000 00000 Q ss_pred eecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccc Q lcl|NC_019538. 156 LNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGL 235 (678) Q Consensus 156 ~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~ 235 (678) .|. T Consensus 105 -----------------------------------------------------------------------------ig~ 107 (388) T protein:vir:96 105 -----------------------------------------------------------------------------IGG 107 (388) T ss_pred -----------------------------------------------------------------------------eee Confidence 000 Q ss_pred cccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccc Q lcl|NC_019538. 236 TGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKE 315 (678) Q Consensus 236 ~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 315 (678) ++.. + .. T Consensus 108 ~~~~-t--------------------------------g~---------------------------------------- 114 (388) T protein:vir:96 108 IDPT-T--------------------------------GR---------------------------------------- 114 (388) T ss_pred cccc-c--------------------------------ch---------------------------------------- Confidence 0000 0 00 Q ss_pred cccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccc Q lcl|NC_019538. 316 NDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCA 395 (678) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 395 (678) ..++.++...+. .+..+. T Consensus 115 --------------------------------------------------------~~gl~al~~~~~------~p~il~ 132 (388) T protein:vir:96 115 --------------------------------------------------------RTGIAALTECTE------RPTLIG 132 (388) T ss_pred --------------------------------------------------------hhHHHHhhhccc------ceeEEE Confidence 000000000000 011123 Q ss_pred cCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEE Q lcl|NC_019538. 396 GEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKL 475 (678) Q Consensus 396 ~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~ 475 (678) .|++++.++|+++|+++|++++ ||+++|.|. .+.+++.+|+... ...+++|+|.++||||++ T Consensus 133 aPg~s~~~~v~~al~~~~~~~~-~~~i~D~p~---------~~~~~~~~~~~~~--------~~~~~~s~~~~~~~P~~~ 194 (388) T protein:vir:96 133 APGFSQNKAVIDALASMAKRLK-CRAVIDGPS---------GSTQDAIDLSGLL--------GGEGTGHDRVYMVDPMPA 194 (388) T ss_pred eeccccchHHHHHHHHHHhhcC-cEEEEeccC---------CchhHHHHHHhhh--------hccCcCcceEEEEeCcee Confidence 3556677889999999999885 799999883 3445566665432 245688999999999999 Q ss_pred EecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecc---cceecCChhhhhhhhhCCcEEEEEecCCcE Q lcl|NC_019538. 476 QYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVR---KLAIETRQAHRDELYQNSMNPVVGFPGQGF 552 (678) Q Consensus 476 v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~---~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~ 552 (678) ++|+.++..+++|||+++||++||+| +||||+|+++ ++.|+. .....++++|++.||++|||+|++|+++|+ T Consensus 195 ~~d~~~~~~~~~p~s~~~AG~~a~~D----~~~spaN~~i-~i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~ 269 (388) T protein:vir:96 195 IYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGNQGV-LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGF 269 (388) T ss_pred eecccCCceeeechHHHHHHHHHhhc----CcccccCeeE-EeeeecccccccccCChhhHHhhhhcCceEEEEecCCcE Confidence 99999999999999999999999999 5999999998 466654 345667889999999999999999999999 Q ss_pred EEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCC Q lcl|NC_019538. 553 ILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNN 632 (678) Q Consensus 553 ~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~n 632 (678) ++||+||++ |+||+|||+++||+++|+++++|+|||||++.+|.+|+++|+.||++||++|+|.||+|+||+++| T Consensus 270 ~~wG~rT~~-----~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~Gal~g~~~~~d~~~n 344 (388) T protein:vir:96 270 SLIGNRTVT-----GKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLN 344 (388) T ss_pred EEEcccccC-----CcceeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCC Confidence 999999984 999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCce--eeecc Q lcl|NC_019538. 633 TPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSAN--FDELV 674 (678) Q Consensus 633 t~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~--~~e~~ 674 (678) |+++|++|+|+++|+++|++|+|||+||++...+..+ |+|++ T Consensus 345 t~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~~~~~~ 388 (388) T protein:vir:96 345 TVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) T ss_pred CHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhC Confidence 9999999999999999999999999999998777666 77777 No 33 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=3.8e-76 Score=434.04 Aligned_cols=600 Identities=21% Similarity=0.234 Sum_probs=287.8 Q ss_pred cCceEEEEcCCCcccccCCccc-eeEEecccCC-CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEEEEE Q lcl|NC_019538. 5 SPGVESKENNMQTTIARSSTGR-AALAGKFQWG-PAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLRTVR 82 (678) Q Consensus 5 ~PGVyveEv~~~~~i~~v~tsv-~afvG~~~~G-pv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~vvR 82 (678) .=-|-|+|+|++..- -|+|-+ .++||.|.-- |-.-||.++ ..||.| -|.-.-+-.. --+-||..+-|+| T Consensus 1 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~------~~~~~~~~~~~~~ 71 (742) T protein:vir:58 1 MYRVNVKEVDLSITP-EVGTPVQTALVGAFDLPIPSELPVSVT-PDEFRR-VGSTELSLIA------DSLVGGQEVTVIR 71 (742) T ss_pred Ceeeeeeeeeeeecc-ccCCchhhheeeeecCCCCccccceec-hhHHhh-cccceeeehh------hhhcCcceEEEEc Confidence 235778999876432 234443 5788887653 456777775 567754 4443222111 1233667777776 Q ss_pred cCCcccccccccc----------------------------cc--------------cceeeeecccccccccceeeecc Q lcl|NC_019538. 83 ILDEDTARNSSPF----------------------------FE--------------TIDYTITSPGVDYRIGDDVKILQ 120 (678) Q Consensus 83 v~~~~~~~~~~~~----------------------------~~--------------~~~~~~~~~~~~~~~g~~i~~~~ 120 (678) --+.+...++.-. .+ .+-..-.-.|+.......|++.+ T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (742) T protein:vir:58 72 PRGETQSLNAAFVVVGGYNVTLGAFNVFYLMFLGYDPQKGYTDVSYVDVQLAGTPTDTILFSYSLDGSSTTHSLTINLNA 151 (742) T ss_pred cCCcccccceeEEEEecceeeehhhheeeeeeeecccCCCcccceeEEEEEccCCCeeEEEeeecCCCcceeEEEEEeec Confidence 5433221111000 00 00000000011111111111111 Q ss_pred ccccccc-------------ccc--------cccccccccceee----------ecccccccccceeecccccc--cce- Q lcl|NC_019538. 121 NGATITT-------------GKV--------SALNSVGGITFVR----------FSTAEVVKKAKELNDYPALQ--NGW- 166 (678) Q Consensus 121 ~~~~~~~-------------~~~--------~~~~~~~~~~~~~----------~~~a~~~~~a~~~~~~~~~~--~~~- 166 (678) ....... +.. +....+.+....+ .+...+.-..++...-..+. +.. T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (742) T protein:vir:58 152 PSVTLPSNIVPLFFYYEPYTGSITLQSSVNYSGLTLNYTVSKATTPWVYFAEYGTPTSSLTLYKGFYLEGIDLNSFNKQF 231 (742) T ss_pred eeEeeccccceeeeEeccccceEEEeeecccCCCcccceeeeeecCcccccccCCCccceeeeecccccccccCccccee Confidence 1000000 000 0000000000000 00000000000000000000 000 Q ss_pred --eeeeeecccccccccceeeeeec----------cccceeeecccccccccccccccccchhccccccccceeeec--- Q lcl|NC_019538. 167 --QIQFTSGGPGSGQSATAVLNGIR----------QDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASR--- 231 (678) Q Consensus 167 --~~~~~s~~~~~g~~a~~~~~~~~----------~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~--- 231 (678) ..+.....-..|.-.-....++. ....+.+....+.+.. ..+........+...... T Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~ 303 (742) T protein:vir:58 232 VVSIENITVNREKGQVLYPSFDVVVHFRDIRGVSANTEYIRFRQVNLNPES--------PNYIERVIGNMTFEFDGERIV 303 (742) T ss_pred eEEEeeeeecccCCceeccceeEEEEEeeccCCCCCccceeeeeeecCCCC--------cceeeecccceeeeeccceee Confidence 00000000000100000000000 0011111111111110 000000000000000000 Q ss_pred cccccccc---eeEEeccccccccccc---------------ccc--------cccccccccc-ccccccceeeeecccc Q lcl|NC_019538. 232 YAGLTGDN---IQVAFIAYKDYYKFGV---------------DGK--------ISSVNTVNLK-TFPSGLSFGNITPSSY 284 (678) Q Consensus 232 ~~G~~gn~---i~v~v~~~~~~~~~~~---------------~~~--------~~~~~~~~~~-~~~~~~~~~~~~~~~~ 284 (678) ..|..-|. +.+.+ ..+..+... .+. ..+....... ..+ ...+...+.... T Consensus 304 ~g~~~~n~~~~~~~~~--~~~~~~~~~~~s~~~~~~~~~~~~v~d~~~~~~~~~~v~~~~t~~~~~p-p~~~~~~e~v~~ 380 (742) T protein:vir:58 304 TGGEYPNQVPFLRVVV--SQDIKQNVAGVEKWVPVGFEGIYSVGDFTVIVNELTNVSIPVTDSAIIP-PMRFTRIEQITL 380 (742) T ss_pred ecccccccccceeeEe--ccccCcCccceeEEEeccccccccccceeeeccccccceeeccccccCC-cccccccceeec Confidence 01111111 00000 000000000 000 0000000000 000 000000000000 Q ss_pred ccccccccccceeeecc--CCeee---eeeeeeccccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccC Q lcl|NC_019538. 285 LEYGPQTKDQFAMIVFV--GGSAV---ESRILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGG 359 (678) Q Consensus 285 ~~~~~~~~~~~~~~v~~--~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg 359 (678) .....|.++... +..+. .++.++...+.....+. .+.... ......+.................+.|| T Consensus 381 -----ngG~~f~v~s~~~~g~~i~~~~as~~~s~ln~~~~V~Gt-~aa~~~-~d~~t~~~v~s~~~alp~~a~sv~laGG 453 (742) T protein:vir:58 381 -----SGGASFSVISNQPYGFNIQDSRHSYWLSPFKDDELIIGT-ELVLPA-LDVSTEFGVSSWEEALPEFSFLMPFQGG 453 (742) T ss_pred -----ccCcceEEEEecccCcceeccCcceEEeccCCceEEEee-hhhccc-cccchheeccccccccceeeEEEeecCC Confidence 000001110000 00000 01111111111110000 000000 0011111111111222222334555666 Q ss_pred cCCccccch----------------hHHHhhhhhhhccchhccccccccccccCcccchhHHHHHHHHHHHhcCCeEE-E Q lcl|NC_019538. 360 NSGNSTASA----------------GDWIEGWDMFSDREHVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQDCLG-W 422 (678) Q Consensus 360 ~dg~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~-i 422 (678) .++...... ++. .++.++. +..++++++ +|++++ ..++.++.++|+..++|+. + T Consensus 454 ~dg~v~v~~~~~D~iG~~~~~d~~~adr-TGL~ALl--ev~eVtILi-----APG~t~-~~v~aav~A~la~a~~Rl~vL 524 (742) T protein:vir:58 454 SDGYIRVDENEPDTIGRVKITPALLANY-ERLLPLL--TEDQFDLVL-----TPYLTF-ADHAGTVNAFINRAENRFLYL 524 (742) T ss_pred ccccccccCCCcccccccccccccccch-hHHHHhh--hcCCCcEEE-----EcCCCc-hHHHHHHHHHHHhhcCCeEEE Confidence 655321100 111 1233332 223444444 445543 3467778888887766654 4 Q ss_pred EccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhh Q lcl|NC_019538. 423 ISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDT 502 (678) Q Consensus 423 ~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~ 502 (678) +|+|. .....+++.+|++ +++|+|+++||||+++.+ ++..+++|||+++||++||+|. T Consensus 525 ~D~P~-------~~tt~~~A~a~r~-------------~~nSsraaly~PwVkv~d--~~~~r~vPpSgaIAGL~ARtD~ 582 (742) T protein:vir:58 525 FDIAG-------DDDTENLAISLAG-------------YINSSFATTFFPWVRRLT--NKGMRTVPASLAAYRSIRTTDP 582 (742) T ss_pred EecCC-------CCchHHHHHHHHh-------------ccCCceEEEEeceeeecc--CCcceeechHHHHHHHHHHhcc Confidence 45553 1234466777764 456889999999999876 5678999999999999999999 Q ss_pred cCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCcEEEeccccCCCCccccceeehhhHHHHHHHH Q lcl|NC_019538. 503 VAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQGFILYGDKTMSLQPTPFDRINVRRLFNLLKKS 582 (678) Q Consensus 503 ~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~s 582 (678) ++|+|+||+|+.+. .+ ...++.|++.||++|||+|++| ++|+++||+||+++.+++|+||||||||+||+++ T Consensus 583 erGvw~SPANrgii--~~-----~~~s~se~d~LN~~GINtIrsf-G~G~rlWGnRTlassDs~wryInVRRlfd~Ie~S 654 (742) T protein:vir:58 583 ETGLAPVGARRGVV--TG-----EPVRQVDWEDLYNNRINPIVRV-GNDVLLFGQKTMLNVNSALNRINVRRLLIVMRNR 654 (742) T ss_pred CCceEecCCcceee--ec-----cccchhhHHHHhhCCceEEEEC-CCcEEEEcceecCCCCcccceEeehhhHHHHHHH Confidence 99999999997542 11 2467889999999999999998 7899999999997767799999999999999999 Q ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEE Q lcl|NC_019538. 583 ISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFS 662 (678) Q Consensus 583 i~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~ 662 (678) |+++++|+||||||+.+|++|+.+|++||++||++|+|.||+|+||+ +||++||++|+|+++|+++|++|||||+|||+ T Consensus 655 I~~a~q~~VfEPNd~~L~~sIk~sInafL~~L~aqGALlGfrV~lDe-tNTpeDI~~Gklvv~I~vAP~~PAEfI~lrf~ 733 (742) T protein:vir:58 655 ISQILSSYLFENNTSENRLRAEALVRQYLESLRLRGAVTDYEVAIDS-VTTPTDIDNNTLRARVTVQPARSIEYIDITFV 733 (742) T ss_pred HHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcC-CCCHHHhhCCEEEEEEEEEccCCcceEEEEEE Confidence 99999999999999999999999999999999999999999999995 68999999999999999999999999999999 Q ss_pred EeecCceee Q lcl|NC_019538. 663 AVGTSANFD 671 (678) Q Consensus 663 ~~~~~~~~~ 671 (678) +.+++++|+ T Consensus 734 it~tga~Fs 742 (742) T protein:vir:58 734 ITPTGVEIT 742 (742) T ss_pred EEecccccC Confidence 999999999 No 34 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=4.9e-66 Score=378.64 Aligned_cols=534 Identities=12% Similarity=0.072 Sum_probs=320.5 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEE Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLR 79 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (678) =-|++|||||||. |+.+++++++|++++|||.+++||++.|++|+||.||++.||+..--.+..+|+..+|.|||++|| T Consensus 9 ~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~~~~~fg~g~l~~~i~~a~~~~~~~g~~~~~ 88 (562) T protein:vir:63 9 KPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGELLDAIERAWNPGEGTGAGDIL 88 (562) T ss_pred CcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCCCCceeEEEccHHHHHHHhcCCchHHHHHHhccccccCCceEEE Confidence 2358999999999 688999999999999999999999999999999999999999866444555666667789999999 Q ss_pred EEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccceeecc Q lcl|NC_019538. 80 TVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKELNDY 159 (678) Q Consensus 80 vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~ 159 (678) +|||.+. ..++...+.+..++..+|. |++++.+.....+....+...+..... .....+ T Consensus 89 ~~rv~~a---~~a~~~~~~~~~~a~~~g~---~~n~i~v~~~~~~~~~~~~~~v~~~~~---------------~~~ev~ 147 (562) T protein:vir:63 89 AMRVEEA---KEATFEAEGVKVSSTIYGA---DANDIQVALEDNTITGTKRLSIVFAKE---------------RVNQVY 147 (562) T ss_pred EEEcCCC---ccceeEecceeEEEeeccc---CCCeEEEEEecCCCCCCcceEEEecCC---------------Ccchhh Confidence 9999442 3333334555566666665 456776654333222111111101000 000111 Q ss_pred cccccceeeeeeecccccccccceeeeeecccccee-eeccc-cccccccc-ccccccchhccc-cccccceeeeccccc Q lcl|NC_019538. 160 PALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIY-IRNDE-YSRESLLR-RDETTETYIDMC-ESYGIPVVASRYAGL 235 (678) Q Consensus 160 ~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~-~~~~~-~a~~~~~~-~~~~~~~~~~~~-~~~~~~~i~A~~~G~ 235 (678) ...+...... .... .. .+...+.+........ +.... .......+ ............ .-.....+.+.+.|. T Consensus 148 ~~~g~V~~i~-y~g~--~~-~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~ 223 (562) T protein:vir:63 148 DNLGSIFSIK-YKGT--EA-SATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPI 223 (562) T ss_pred hhccceeeee-eecc--cc-cceEEEEecCcceeEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeecc Confidence 1111000000 0000 00 0000000000000000 00000 00000000 000000000000 001122345556655 Q ss_pred cccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccc Q lcl|NC_019538. 236 TGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKE 315 (678) Q Consensus 236 ~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 315 (678) -++.+++...+.... ..... ...+ + + + T Consensus 224 ~gn~i~~~~~d~~~~--------~~vkt---~~~~--------------v----------------~-t----------- 250 (562) T protein:vir:63 224 GDKNLTTDNFDAQID--------VDIKT---KEAY--------------V----------------K-A----------- 250 (562) T ss_pred CCceeeeeccccccc--------cchhh---hhhh--------------h----------------h-h----------- Confidence 555554322111000 00000 0000 0 0 0 Q ss_pred cccccccchhhhhhhhcCCcceEEEEecC--CCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccc Q lcl|NC_019538. 316 NDRDIYGSSIYVDEFFINGYSTFIQGVAE--SWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGS 393 (678) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~--~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 393 (678) ...+....+....|+..... ...+ ......|+||.||.... ++..+++.++..+ .+.+.+ T Consensus 251 ---------~~~d~~~~~~~~~~v~~~~~~~~~la-~~~~~~LtGG~dGt~~~---~~~~al~ale~~~---~~~i~~-- 312 (562) T protein:vir:63 251 ---------VGGDIEKQTAYNGYVDFEFDRSKEIA-NFPLTKLTGGDNGTIPE---SWADKFSYFANEG---GYYLVP-- 312 (562) T ss_pred ---------hhhhhhhcccccceeeeeecccccee-cccceeeecCCCCCchh---hHHHHHHHHHhCC---cEEEEe-- Confidence 00000000111112111110 1111 12345789999986542 3556666665443 233332 Q ss_pred cccCcccchhHHHHHHHHHHHhcCC----eEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEE Q lcl|NC_019538. 394 CAGEGVEIASTVQKSVAAICDERQD----CLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSST 469 (678) Q Consensus 394 ~~~~~~~~~~~v~~~l~~~~~~~~~----~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~ 469 (678) .++..+++.++.+||+++++ ++++++.+ .+.+++++..+.. .+++.+.++ T Consensus 313 -----~t~d~av~~~l~a~vkr~~~~g~~~~aVlg~~--------~~~~~~~~~~~a~-------------~~n~ervv~ 366 (562) T protein:vir:63 313 -----LTSKQAVHAEALQFVRDCSYNGNPMRVFVGGG--------IGESMEQLFTRAI-------------GLQNERAGL 366 (562) T ss_pred -----cCCCHHHHHHHHHHHHHHHhCCCcEEEEecCC--------CCCCHHHHHHHhh-------------hcCCCcEEE Confidence 13456788999999987775 78887765 3557777776543 356778889 Q ss_pred EcCeEEEecccCCceeEech---HHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEE Q lcl|NC_019538. 470 SANYKLQYDKYNDTNRWIPL---SADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVG 546 (678) Q Consensus 470 ~~p~~~v~d~~~~~~~~~pp---s~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~ 546 (678) ++|+..+.+. .+..+..|+ ++++||++|.+| +++||.|+.+. . .++...+++.|++.|+++|+++++. T Consensus 367 v~~~~~~~~~-~~~~~~~~~~~~aa~vAGl~A~~~----~~~SlT~~~i~-~---~~v~~~~t~~e~~~li~~Gv~~l~~ 437 (562) T protein:vir:63 367 IGFSGTVKMD-DGRSLKMPGYMFAAQVAGLTCGLE----IGEAITFKNIA-I---ETLDTIYEGSQLDQLNESGIITAEF 437 (562) T ss_pred EecCeeEECC-CCceeeechhHHHHHHHHHhhcCc----hhcCccceeec-c---ccccccCCHHHHHHHHhCCeEEEEE Confidence 9998776654 456677777 789999999886 88999998763 2 4566789999999999999999999 Q ss_pred ecCCcEEEecc-ccC----CCCccccceeehhhHHHHHHHHHHHHHH-HHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCe Q lcl|NC_019538. 547 FPGQGFILYGD-KTM----SLQPTPFDRINVRRLFNLLKKSISESAK-YKLFENNDAFTRNSFRSEVNSYLDSIKSLGGI 620 (678) Q Consensus 547 ~~~~G~~~wG~-rT~----~~~~~~~~~i~vrR~~~~i~~si~~~~~-~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal 620 (678) +++++.++|.. +++ ..++..|++|+++|++|+|++.|++.+. ||+++||+...|..|+..|..||.+||++|+| T Consensus 438 ~~~~~v~~~~iv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yiGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI 517 (562) T protein:vir:63 438 VRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEI 517 (562) T ss_pred ecCCcEEEEEeeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcc Confidence 88888888754 322 2234579999999999999999988765 99999999999999999999999999999999 Q ss_pred eeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCcee Q lcl|NC_019538. 621 YDFRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANF 670 (678) Q Consensus 621 ~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 670 (678) .||... +-+.++..++++|++.++|+.|+|||++++.......+- T Consensus 518 ~~~~~~-----dv~v~~~~d~~~v~~~v~pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:63 518 QDYSPE-----EVQVVIEGDVARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred cCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 998532 123345667899999999999999999999877665444 No 35 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=5.5e-66 Score=378.37 Aligned_cols=589 Identities=13% Similarity=0.080 Sum_probs=297.7 Q ss_pred Ccee---------cCceEEEEc-CCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHH Q lcl|NC_019538. 1 MALL---------SPGVESKEN-NMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAIN 70 (678) Q Consensus 1 ~~~~---------~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~ 70 (678) |+.. +|||||||+ ++.++|+||+||+++|||.+++||+|+|++|+||.||++.||+ .++.|++++| T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~Gp~~~p~~v~s~~~~~~~fgg----g~l~~av~~~ 76 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEGGETYKPYRLTSFAEAVSIFKG----GPLLEHIKAA 76 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCCCCCceeEEecCHHHHHHHhcC----ccHHHHHHHH Confidence 6654 499999999 5889999999999999999999999999999999999999997 3599999999 Q ss_pred HHhcCCeEEEEEcCCcccccccccccccceeeeecccccccccceeeeccccc--ccccccccccccccccceeeecccc Q lcl|NC_019538. 71 FLKYGNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGA--TITTGKVSALNSVGGITFVRFSTAE 148 (678) Q Consensus 71 f~ngG~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~a~ 148 (678) |+|||++||+|||.+.+. ++..++.+..++..++.| |+++.+..... .........+.......... .- T Consensus 77 F~nGg~~~~~vRv~~~~~---a~~~~~~~~~~a~~~g~~---gn~i~~~v~~~~~~~~~~~~l~v~~~~~~~~~d--~~- 147 (648) T protein:vir:10 77 FIGGAGEVVAVRIGNPTT---ASVSIPVAQNTSDTSPAN---LNFVSYEASTRSNQIYVSFDLDENFTSANEADD--TI- 147 (648) T ss_pred HhCCCcEEEEEEcCCCcc---cceecceeEEeecccCCC---CCceEEEEEEcCCCcCceeEEEEEecCCCcccc--ee- Confidence 999999999999976543 333345566666666655 55554322111 11111000000000000000 00 Q ss_pred cccccceeeccccccccee--eeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccc Q lcl|NC_019538. 149 VVKKAKELNDYPALQNGWQ--IQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIP 226 (678) Q Consensus 149 ~~~~a~~~~~~~~~~~~~~--~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 226 (678) ..........+.......+ ............ .......+...+.......... .............+.. T Consensus 148 v~~i~~~~~~y~gt~~~~t~~v~~~~~~~~~~~----~~~~~~~~~~~v~~~~~~~~~~----~~~~v~~~~~~~~~~~- 218 (648) T protein:vir:10 148 IFTIYQKHPDFSVTRETFTFPRKFTTPTVLVKR----GSTLFFVDRSIVNAALAAGPAF----QTALINLLKEQLQPTD- 218 (648) T ss_pred EEEeccCCCcccccceecccccccccccccccc----ccceeecCccchhhhhccCccc----hhhhhhchhhhhhhhh- Confidence 0000000000000000000 000000000000 0000000000000000000000 0000000000000000 Q ss_pred eeeeccccccccceeEEeccccccccccccccccccccccccccccccceeeeecccccccccccccc--ceeee----- Q lcl|NC_019538. 227 VVASRYAGLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQ--FAMIV----- 299 (678) Q Consensus 227 ~i~A~~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v----- 299 (678) .....+.. +...+.+.. ........ ..................+.... ....+ T Consensus 219 -~~~~~~~s--~~~~~d~~~-------------~~~~~~a~----~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~t 278 (648) T protein:vir:10 219 -VVQIFDAS--DTNPVDIPL-------------GLFVYEVL----YGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSAT 278 (648) T ss_pred -hheecccc--ccccccccc-------------cccccccc----chhhhcCCcchhhhhhhccccccccccceeccccc Confidence 00000000 000000000 00000000 00000000000000000000000 00000 Q ss_pred --ccCCeeeeeeeeeccccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCcccc---------ch Q lcl|NC_019538. 300 --FVGGSAVESRILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTA---------SA 368 (678) Q Consensus 300 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~---------~~ 368 (678) ..+...... +...+..++...+... .+.++.......... .....|+||.||..+. +. T Consensus 279 p~~~~~~~~~~---~~~~~~~~~~~v~~~~-------~~~l~~~~~~p~~~~-~~~t~L~GGtdG~~p~s~~~~~~~~~~ 347 (648) T protein:vir:10 279 PFFDGSDYQDY---TSLSDPANWFAKDAYT-------INHLVDTTINPHILA-TRIFSLSGGTNGDDGTGYYQTAVSNYI 347 (648) T ss_pred ccccccceeee---eccccccceeeeeccc-------hhhcccccccCcccc-cccceecccccCCCcccccccccccch Confidence 000000000 0000000111000000 111111111110000 1112589999997753 45 Q ss_pred hHHHhhhhhhhccchhccc---cccccccccCcccchhHHHHHHHHHHHhcC---------CeEEEEccccchhcccccc Q lcl|NC_019538. 369 GDWIEGWDMFSDREHVDVN---LFIAGSCAGEGVEIASTVQKSVAAICDERQ---------DCLGWISPPREYMVNLPVA 436 (678) Q Consensus 369 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~---------~~~~i~d~p~~~~~~~~~~ 436 (678) .+|.++++++...+.+-+- .+.+........++.++++.++++||.++. ..++++.+ .++ T Consensus 348 ~d~~d~l~~~~~~~~~~ivp~~~~~~~~~~~~~lt~~q~i~a~a~shv~~~s~~~~~~~r~~~~~~vg~--------~~~ 419 (648) T protein:vir:10 348 NIWSQGLATLEEEEVNFVIPAYKFTNVTQLNDRLTIFKGIASTFLSHVQTMSQVNRRKARVGVFGLPAP--------SPN 419 (648) T ss_pred hhHHHHhhhccCCCceEEEeecccccccccccccCCccchHHHHHHHHHHhhhccccccccCeEEEeCC--------CCc Confidence 7888888888776654321 112222223335567889999999997552 12333332 233 Q ss_pred CCHHHHHHHH--hcccccccchhhccccccceEEEEcCeEEEecccCCceeEech---HHHHHHHHHHHhhcCCceECcC Q lcl|NC_019538. 437 TAVKKMVEWR--RGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPL---SADMAGLCARTDTVAQPWMSPA 511 (678) Q Consensus 437 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pp---s~~vAG~~a~~d~~~g~~~sPa 511 (678) ++..+....+ ..+...... ....-..+.+...+.+.+. .-+++...+|| .+++||+++++ .+++||. T Consensus 420 es~~~se~~~~~~~~~~~~a~--~~~~d~~~~~~~~~~~~~~--~~~G~~~~~p~~~~Aa~VAGl~a~l----~~~~s~T 491 (648) T protein:vir:10 420 ESVTASEYLYNRNILNTISAM--FGGTDRAQAVVFPFYSNVF--NDEGKVELLGGEFFASYVAGMHANR----EPQDSIT 491 (648) T ss_pred hhHHHHHHHhhhhccccccee--eeecCCceEEeecccceeE--CCCCcEEecchhhHHHHHHhhhhcc----ccccCcc Confidence 4433322111 111111100 0001112333444444333 23567778898 67788888875 6999999 Q ss_pred CcchhheeecccceecCChhhhhhhhhCCcEEEEEecCC----cEEEeccccCCC--CccccceeehhhHHHHHHHHHHH Q lcl|NC_019538. 512 GFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQ----GFILYGDKTMSL--QPTPFDRINVRRLFNLLKKSISE 585 (678) Q Consensus 512 n~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~----G~~~wG~rT~~~--~~~~~~~i~vrR~~~~i~~si~~ 585 (678) ||++.++ ++ +..+.+++.|++.|+++||+||.+++++ ++++-..-|..+ ++..|+.|+++|++||+.+.+++ T Consensus 492 ~k~i~~~-~i-d~~~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~~~~~~~~~~eisv~ri~D~l~~~vr~ 569 (648) T protein:vir:10 492 FLPISGI-GA-EPLYNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPTTWLGPVTQGFQEFVLRRIDDFLQSYVYK 569 (648) T ss_pred cceeecc-cc-ccccCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEeccceeecCCCCcceeeeeeeehhhHHHHHHHH Confidence 9988633 22 2336899999999999999999988775 355555555432 34579999999999999999987 Q ss_pred HHH-HHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeE---EEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEE Q lcl|NC_019538. 586 SAK-YKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFR---VVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNF 661 (678) Q Consensus 586 ~~~-~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~---V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~ 661 (678) .++ +|+++||++..|..||+.|.+||.++++.++|++|. |.++ ++.+++++++.++|++|++||.+++ T Consensus 570 ~l~~~fIG~~n~~~~~~~ik~~i~~~L~~~~~~~~I~~y~~~~v~~~--------~~~~vv~V~~~v~Pv~~i~~I~vti 641 (648) T protein:vir:10 570 NLQEQFIGRKSYGRKTENDIKVYTEALLSNLVGKQIVAYKDVKVTSN--------EDKTVYYVEFFYQPVTEIKFILVTM 641 (648) T ss_pred HHhhhcCcccccHHHHHHHHHHHHHHHhhHhhcCcccCcccceEEEE--------ecCCEEEEEEEEEecceeeEEEEEE Confidence 555 999999999999999999999999999999999974 4332 3458999999999999999999998 Q ss_pred EEeecCceee Q lcl|NC_019538. 662 SAVGTSANFD 671 (678) Q Consensus 662 ~~~~~~~~~~ 671 (678) .-.-. +| T Consensus 642 ~it~~---~~ 648 (648) T protein:vir:10 642 KVTFD---LE 648 (648) T ss_pred EEEec---cC Confidence 64433 22 No 36 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=1.4e-64 Score=370.59 Aligned_cols=535 Identities=12% Similarity=0.068 Sum_probs=321.8 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEE Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLR 79 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (678) =-+..|||||||. |+.+++++++|++++|||.+++||++.|++|+||.||++.||+..--.+..+|+..||.|||++|| T Consensus 9 ~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~l~~~i~~a~~~~~~~g~~~~~ 88 (562) T protein:vir:80 9 KPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGELLDAIERAWNPGEGTGAGDIL 88 (562) T ss_pred CcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCcceeEEEccHHHHHHHhcCCChHHHHHHhcccccccCceEEE Confidence 2357899999999 688999999999999999999999999999999999999999876555566777778899999999 Q ss_pred EEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccceeecc Q lcl|NC_019538. 80 TVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKELNDY 159 (678) Q Consensus 80 vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~ 159 (678) +|||.+. .+++...+.+..++...|.| ++++.+..........+...+..... .....+ T Consensus 89 ~~rv~~a---~~a~~~~~~~~~~~~~~g~~---~n~i~v~~~~~~~~~~~~~~v~~~~~---------------~~~ev~ 147 (562) T protein:vir:80 89 AMRVEEA---KEATFEAEGVKVSSTIYGAD---ANDIQVALEDNTITGTKRLSIVFAKE---------------RVNQVY 147 (562) T ss_pred EEEcCCC---CcceEEecceEEEEeecccC---CCceEEEEecCCCCCCcceEEEecCC---------------cceEEe Confidence 9999543 23333344555666666554 55666544332221111111111000 001111 Q ss_pred cccccceeeeeeecccccccccceeeeeecccccee-eec-cccccccccc-ccccccchhccc-cccccceeeeccccc Q lcl|NC_019538. 160 PALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIY-IRN-DEYSRESLLR-RDETTETYIDMC-ESYGIPVVASRYAGL 235 (678) Q Consensus 160 ~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~-~~~-~~~a~~~~~~-~~~~~~~~~~~~-~~~~~~~i~A~~~G~ 235 (678) ...+...... +.. ....+...+.+........ +.. .........+ ............ .-.....+.|.+.|. T Consensus 148 ~~~g~v~~i~-y~g---~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~~~~~tAky~g~ 223 (562) T protein:vir:80 148 DNLGSIFSIK-YKG---TEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPI 223 (562) T ss_pred eccCceeeee-ecc---ccccceeEEEecCccceEEEEEEecCCcceeEEEeCCCccchhhhhhhhhccccceEEEeccc Confidence 1111100000 000 0000000000000000000 000 0000000000 000000000000 001112345555555 Q ss_pred cccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccc Q lcl|NC_019538. 236 TGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKE 315 (678) Q Consensus 236 ~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 315 (678) -++.+++........ . +... . .. ++ .... T Consensus 224 ~~n~i~~~~~d~~~~--~------~~kt---~----------------------------~~-----------~v-~~~~ 252 (562) T protein:vir:80 224 GDKNLTTDNFDAQID--V------DIKT---K----------------------------EA-----------YV-KAVG 252 (562) T ss_pred CCceeeecccccchh--h------hccc---c----------------------------ee-----------ee-eehh Confidence 555554321110000 0 0000 0 00 00 0000 Q ss_pred cccccccchhhhhhhhcCCcceEEEEecCC-CccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 316 NDRDIYGSSIYVDEFFINGYSTFIQGVAES-WPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) ++ ....+....++...... ..........|+||.||.... ++..+++.+...+. +.+++. T Consensus 253 ~d-----------~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~---~~~dal~~Le~~~~---~~i~~~-- 313 (562) T protein:vir:80 253 GD-----------IEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPE---SWADKFSYFANEGG---YYLVPL-- 313 (562) T ss_pred hh-----------hhhcccccceEEEEeccCccccccceeeeeCCCCCCccc---cHHHHHHHHHhCCc---EEEEec-- Confidence 00 00001111222111110 111122346799999986543 35556666654432 223221 Q ss_pred ccCcccchhHHHHHHHHHHHhcCC----eEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEE Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDERQD----CLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTS 470 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~----~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~ 470 (678) ++.++++..+.+||+++++ ++++++.+ .+.+++++..+.. .+++.+..++ T Consensus 314 -----t~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~--------~~~~~~~~~~~a~-------------~~n~e~vv~v 367 (562) T protein:vir:80 314 -----TSKQAVHAEALQFVRDCSYNGNPMRVFVGGG--------IGESMEQLFTRAI-------------GLQNERAGLI 367 (562) T ss_pred -----CCChHHHHHHHHHHHHHHhCCCeEEEEecCC--------CCCCHHHHHHHhh-------------hcCCCeEEEE Confidence 2446789999999988876 78887654 4567778777654 3567788889 Q ss_pred cCeEEEecccCCceeEech---HHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEe Q lcl|NC_019538. 471 ANYKLQYDKYNDTNRWIPL---SADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGF 547 (678) Q Consensus 471 ~p~~~v~d~~~~~~~~~pp---s~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~ 547 (678) +|+..+.+. .+..+..|| ++++||++|.+| +++||.|+.+.+ .++...+++.|++.|+++|+++++.+ T Consensus 368 ~~~~~~~~~-~~~~~~~~~~~~aa~vAGl~Ag~~----~~~S~T~~~i~~----~~v~~~lt~~e~~~li~~G~l~l~~~ 438 (562) T protein:vir:80 368 GFSGTVKMD-DGRSLKMPGYMFAAQVAGLTCGLE----IGEAITFKNIAI----ETLDTIYEGSQLDQLNESGIITAEFV 438 (562) T ss_pred ecCeeEECC-CCceeeechhHHHHHHHHHHhcCc----cccCccceeecc----ccccccCCHHHHHHHHhCCeEEEEEe Confidence 998776654 455666666 889999999886 889999988643 35667899999999999999999998 Q ss_pred cCCcEEEecc-c---cC-CCCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCee Q lcl|NC_019538. 548 PGQGFILYGD-K---TM-SLQPTPFDRINVRRLFNLLKKSISESA-KYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIY 621 (678) Q Consensus 548 ~~~G~~~wG~-r---T~-~~~~~~~~~i~vrR~~~~i~~si~~~~-~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~ 621 (678) ++++.++|.. + |. ..++..|++|+++|++|+|++.|++.+ +||+++||+...|..|+..|..||.+||++|+|. T Consensus 439 ~~~~v~~~riv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yIGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~ 518 (562) T protein:vir:80 439 RNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQ 518 (562) T ss_pred cCCcEEEEEeeccceeccCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCccc Confidence 8887777732 2 22 233568999999999999999999887 5999999999999999999999999999999999 Q ss_pred eeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCcee Q lcl|NC_019538. 622 DFRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANF 670 (678) Q Consensus 622 g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 670 (678) +|... +-+.++++++++|++.++|+.|+|||++++.......+- T Consensus 519 ~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:80 519 DYSPE-----EVQVVIEGDIARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred CCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 98532 123345678899999999999999999999877665444 No 37 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=1.7e-64 Score=370.23 Aligned_cols=627 Identities=13% Similarity=0.107 Sum_probs=304.3 Q ss_pred Cc----e-ecCceEEEEcCCC----cccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccch------hHH Q lcl|NC_019538. 1 MA----L-LSPGVESKENNMQ----TTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTA------DSV 65 (678) Q Consensus 1 ~~----~-~~PGVyveEv~~~----~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~------~~~ 65 (678) |. | --||+-+.--|+- +..+...|--..+.|.+-.|||.+||+|+-... +..||+-...+. +-- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 79 (717) T protein:vir:79 1 MAGFDQYQAIPGHNARFKDGNLNLKSDPNPRETESVVLLGTATDGPVMQPVRVTPETA-YNIFGKVAHENGVYNGATLLP 79 (717) T ss_pred CCchhhhhcCCCceeeeecCceecCCCCCccccceEEEEeeccCCcccCceeeChhHH-HhhhhhhhhhcccccchhhhH Confidence 44 3 3699999877652 357788888899999999999999999995554 589998654432 223 Q ss_pred HHHHHHHhcCCeEEEEEcCCcccccccccccccceeeeeccccccccc-ceeeecccccccccccccccccccccceeee Q lcl|NC_019538. 66 LSAINFLKYGNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIG-DDVKILQNGATITTGKVSALNSVGGITFVRF 144 (678) Q Consensus 66 ~v~~~f~ngG~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 144 (678) +....+..|..+....|+.+-+..+ +.+...-.-....+. ....| +.+.. . + ..+..+..++++..++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~-~~~~~~~~~~~----~-~---~~~~~~~~~~~~~~~~ 148 (717) T protein:vir:79 80 KFEELWAAGNRDIRLMRTTGVNAVS--SLLGTSYSKNSKEVA-EDKLGGAQARG----N-V---AATFTLPNGGIVEATF 148 (717) T ss_pred HHHHHHhcCCcceEEEEecchhHHH--HHhhcccccchhhHH-HHhhccccccc----c-e---EEEEEcCCCceeeeee Confidence 4455666788889999996543211 111100000000000 00000 00000 0 0 0011112222222221 Q ss_pred cccccccccceeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccc Q lcl|NC_019538. 145 STAEVVKKAKELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYG 224 (678) Q Consensus 145 ~~a~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 224 (678) +.- +....+-...+.-...+.++.................+.....-...+.............+.......+.+.+ T Consensus 149 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (717) T protein:vir:79 149 LLK---ARGVIIPPNNYTLDVGTEEDMKAGTQPTFAQVLLNENVADMESEITVSYEFTYKDAQGETKTSEVLDNNTDKDG 225 (717) T ss_pred eee---ecceEeCCCcceEeccChhhhhcCCCchhhhhhhccchhhccceeEEEEEEEeecccCcchhhhhhcCCCCCCC Confidence 100 00000000000000011111110000000000000000000000001111110000000011111111222222 Q ss_pred cceeeec-----------ccc--ccccceeEEecccccccccccccccccc--------ccccccccccccceeeeeccc Q lcl|NC_019538. 225 IPVVASR-----------YAG--LTGDNIQVAFIAYKDYYKFGVDGKISSV--------NTVNLKTFPSGLSFGNITPSS 283 (678) Q Consensus 225 ~~~i~A~-----------~~G--~~gn~i~v~v~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~ 283 (678) .+.+... +.| ...++++|. +.+.+......-.++.. .-.+...........-....+ T Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (717) T protein:vir:79 226 KPMIAKGADVTIKLEHVALAGLKLYADGIEVV--DAKAFTVAGDQLTIHSNSKMKLGASLEAQYAYNLVEVIQPVIELES 303 (717) T ss_pred ceeEEecccceeehhhhhhhhhHHhhcchhhh--hhhheeeecceEEEEecCCcccchhhHHHHHhhHHHhhccceEEee Confidence 2222110 000 112222221 11111111000000000 000000000000001111111 Q ss_pred cccccccccccceeeeccCCe-eeeeeeeecccc-----------ccccccchhhhhhhhcCCcceEEEE---------e Q lcl|NC_019538. 284 YLEYGPQTKDQFAMIVFVGGS-AVESRILSVKEN-----------DRDIYGSSIYVDEFFINGYSTFIQG---------V 342 (678) Q Consensus 284 ~~~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~v~~---------~ 342 (678) ...+. ..+.++..|...+. ..-++....... ..++......++....++..+.... . T Consensus 304 ~~~g~--~~n~~~~~v~~~D~~~~~~~t~~~~~~g~~~~~pl~~ts~dy~~~~~~vdgI~~~~~~~V~~~g~~s~a~a~~ 381 (717) T protein:vir:79 304 IFGGG--VYNDIMRKVESKDGAVTVTITKPESKRGMISEDPLVFKSGDYTNFKMLVDAINNHPFNNVVRARTKPEFEATF 381 (717) T ss_pred cccCc--eeeeeeeEEecCCceEEEEEecccccCcceeccccccccCceeeeeeeecccccCchhheeeeecccccceee Confidence 11111 11222233322221 111111000000 0011111111111110111111000 0 Q ss_pred cCCCccccceeeeeccCcCCccccchhHHHh--h----------hhhhhccchhccccccccccccC--cccchhHHHHH Q lcl|NC_019538. 343 AESWPTEYSGILTFGGGNSGNSTASAGDWIE--G----------WDMFSDREHVDVNLFIAGSCAGE--GVEIASTVQKS 408 (678) Q Consensus 343 ~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~--~----------~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~ 408 (678) ..... ......+.||.|+.......-+.. + -.++...+.+++++++++....+ .......++.+ T Consensus 382 ~~g~~--s~d~a~f~Gg~dgl~~~~ee~Y~~lGgk~~d~g~lt~~aays~LE~~dVDlVil~ga~adtt~ga~~d~va~a 459 (717) T protein:vir:79 382 TSTLQ--AAADAKFSGGKDELSLDKEEMYKRLGGEKNEEGFVTKQGAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQ 459 (717) T ss_pred eeccc--CchhhccCCCccccccchhhhhccccccccccccccchhhhhhcCcceeEEEEecCccccccccchhhhHHHH Confidence 00000 111223556666654433322210 0 01222233445666665543322 12344567888 Q ss_pred HHHHHHhcC----CeEEEEccccchhccccccCCHHHHHHHHhccccccc--------------chhhccccccceEEEE Q lcl|NC_019538. 409 VAAICDERQ----DCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGV--------------VVDDNMNIGTTYSSTS 470 (678) Q Consensus 409 l~~~~~~~~----~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~s~~~~~~ 470 (678) +++||+.+. .++.+++... +.........+|+..+..... .......+ +.|..++ T Consensus 460 lad~caalSal~r~ai~VI~l~s------p~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idi-s~y~~vv 532 (717) T protein:vir:79 460 LALACAVMSHYNSVTIGIIPTTT------PSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDL-GQFIEVV 532 (717) T ss_pred HHHHHHHhhhccccceeeecccc------ccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccc-cceeeee Confidence 999997652 3344443211 112222333333332211000 00001122 2344444 Q ss_pred cCeEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCC Q lcl|NC_019538. 471 ANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQ 550 (678) Q Consensus 471 ~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~ 550 (678) +++..++.+.++..++.||+|++||+ |.++|+||||+|+++.++ .++...+++.|++.||++|||||+.++++ T Consensus 533 ~~~~~iv~~~~~~~~~~p~AG~vAGl----dA~rGVwkSPANk~I~GV---vgLa~~lT~sE~d~Ln~aGIntIr~~~Gr 605 (717) T protein:vir:79 533 AGPDFIVRNTRLGQMASTPDASYIGM----VSQLKTQSAPTNKPLPSV---TALRYTYSANQLNRLTKARFATFKYKQDG 605 (717) T ss_pred ecceeEEEcCCCceeecCHHHHHHHH----HhcCCcccccccceeccc---ccCcccCCHHHHHHHhhCCeEEEEEeCCc Confidence 44444555566677888887666555 556899999999876555 56778899999999999999999999999 Q ss_pred cEEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccC Q lcl|NC_019538. 551 GFILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDET 630 (678) Q Consensus 551 G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~ 630 (678) |+++||+||++++++.|+||+|||++++|+++|+++++|+|||||++.+|.+|+.+|++||++||++|+|.||++++ T Consensus 606 GirVWGaRTtasd~sdWryInVRRl~D~Ie~sIr~al~~yVgEPNd~~tr~~Ik~sI~afL~~L~r~GAI~Gykvdv--- 682 (717) T protein:vir:79 606 SIGVVDAPTSAHAGSDYTRLSTARIVKEAVNAVREVADPFIGEPNDTGNRNALTAAVDKRLSKMIENKALLGFDFRL--- 682 (717) T ss_pred eEEEEeeeecCCCCcccceeehhhhHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHHHHHhcCceecceeeE--- Confidence 99999999999988899999999999999999999999999999999999999999999999999999999999876 Q ss_pred CCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_019538. 631 NNTPAVIDRNEFVATILIKPARSINYVSLNFSAVG 665 (678) Q Consensus 631 ~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 665 (678) +||++++++|+|+|+|+++|++|+|||+|+++... T Consensus 683 tnT~~di~~G~l~V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 683 VVTPQQELLGEGSIELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred ecChhHhhCCEEEEEEEEEecCcccEEEEEEEEeC Confidence 89999999999999999999999999999998766 No 38 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=2e-62 Score=358.89 Aligned_cols=561 Identities=12% Similarity=0.082 Sum_probs=322.2 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEE Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLR 79 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (678) =-+.+|||||||. |+.+++++++|++++|||.+++||+++|++++||.||++.||+.+--....+|...||.|||++|| T Consensus 9 ~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~l~~~~~~a~~~~~~~g~~~~~ 88 (587) T protein:vir:95 9 RPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGELLDAIELAWGSNPNYTAGRIL 88 (587) T ss_pred cccccCceEEEEecCCccccCCCCCceEEEEEEeCCCCCceeEEeccHHHHHHHhcCcchHHHHHHHhccccCCCceEEE Confidence 2357899999999 688999999999999999999999999999999999999998866444445555666689999999 Q ss_pred EEEcCCcccccccccccccceeeeecccccccccceeeeccccccccccc-ccccccccccceeeecccccccccceeec Q lcl|NC_019538. 80 TVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGK-VSALNSVGGITFVRFSTAEVVKKAKELND 158 (678) Q Consensus 80 vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~a~~~~~a~~~~~ 158 (678) ++||.+.+ +++.....+..++..||.| |+.|.++.......... .+.....++....+.+.... .. T Consensus 89 ~~rv~~~~---~a~~~~~~l~~~a~~~G~~---gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v-------~s 155 (587) T protein:vir:95 89 AMRIEDAK---PASAEIGGLKITSKIYGNV---ANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNI-------FT 155 (587) T ss_pred EEEcCCCc---eeEEEecCeEEEEeccccc---ccceEEEEecCCCCCceeEEEEEecccceeeeeeccce-------ee Confidence 99995543 3344455677777777765 56777654433222211 11111111111111000000 00 Q ss_pred ccccccceeeeee-ecccccccccceeeeeeccccceeeeccccccccccccc-ccccchhccc-cccccceeeeccccc Q lcl|NC_019538. 159 YPALQNGWQIQFT-SGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRD-ETTETYIDMC-ESYGIPVVASRYAGL 235 (678) Q Consensus 159 ~~~~~~~~~~~~~-s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~-~~~~~~~~~~-~~~~~~~i~A~~~G~ 235 (678) ..+.......... .............+.+ +.. .....+-. .......... .-...+.+.|.+.|. T Consensus 156 i~y~g~~~~~~~~v~~~~~t~~a~~~~l~~---g~~---------~v~~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~ 223 (587) T protein:vir:95 156 IKYKGEEANATFSVEHDEETQKASRLVLKV---GDQ---------EVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPF 223 (587) T ss_pred eeeeccccccceeeeecccceeeeeeeeec---CCc---------eEEEEEecCCchHHHHHHHHhhccccceEEEEecc Confidence 0000000000000 0000000000000000 000 00000000 0000010110 111233456777776 Q ss_pred cccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccc Q lcl|NC_019538. 236 TGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKE 315 (678) Q Consensus 236 ~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 315 (678) -++.+.+...+... .... ............+.... .. ...+.......+...+..... T Consensus 224 ~~~~i~~~~~~~~~--~~~v----~~~~~~v~a~~~d~~~~------------~~-~~~~v~~~~~~g~~~~~~~~~--- 281 (587) T protein:vir:95 224 GDKNLESSKLDKIE--NANI----KDKAVYVKAVFGDLEKQ------------TA-YNGIVSFEQLNAEGEVPSNVE--- 281 (587) T ss_pred cCceeEEeecCccc--ccce----ehhhhhhhhhhcceeee------------ee-ceeeeeeecccccceeccchh--- Confidence 66666543211100 0000 00000000000000000 00 000000000000000000000 Q ss_pred cccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccc Q lcl|NC_019538. 316 NDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCA 395 (678) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 395 (678) .. .......+................|+||.||... .++..+++++...+ .+.+++. T Consensus 282 ----------~~----~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG~~~---~~y~~~l~ale~~~---~~~i~~~--- 338 (587) T protein:vir:95 282 ----------VE----AGEESATVTATSPIKTIEPFELTKLKGGTNGEPP---ATWADKLDKFAHEG---GYYIVPL--- 338 (587) T ss_pred ----------hh----hcccchheeccccccceeccceeeeecCCCCCCc---ccHHHHHHHHHhCC---cEEEEec--- Confidence 00 0000000000000001111122458999998654 34666677665443 3333321 Q ss_pred cCcccchhHHHHHHHHHHHhcCC----eEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEc Q lcl|NC_019538. 396 GEGVEIASTVQKSVAAICDERQD----CLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSA 471 (678) Q Consensus 396 ~~~~~~~~~v~~~l~~~~~~~~~----~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~ 471 (678) ++.++++.++.+||+++++ ++++++.+ .+.+.+++..... .+++.+..+++ T Consensus 339 ----t~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~--------~~~~~~~~~~~a~-------------~~n~ervi~v~ 393 (587) T protein:vir:95 339 ----SSKQSVHAEVASFVKERSDAGEPMRAIVGGG--------FNESKEQLFGRQE-------------SLSNPRVSLVA 393 (587) T ss_pred ----CCCHHHHHHHHHHHHHHHhCCCcEEEEEcCC--------CCCCHHHHHHHHh-------------hcCCCcEEEec Confidence 2456789999999988865 78877654 3567788777654 35567788888 Q ss_pred CeEEEecccCCceeEech---HHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEec Q lcl|NC_019538. 472 NYKLQYDKYNDTNRWIPL---SADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFP 548 (678) Q Consensus 472 p~~~v~d~~~~~~~~~pp---s~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~ 548 (678) |+.++. ..++..+.+|| ++++||++|.+| +++||.|+.+. ..++...+++.|++.|+++|+++++.++ T Consensus 394 ~~~~~~-~~dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~----~~~v~~~~t~~e~e~ai~~Gvl~l~~~~ 464 (587) T protein:vir:95 394 NSGTFV-MDDGRKNHVPAYMVAVALGGLASGLE----IGESITFKPLR----VSSLDQIYESIDLDELNENGIISIEFVR 464 (587) T ss_pred ccceEe-cCCCceeeechHHHHHHHHHHHhcCc----hhcCccceeee----cccccccCCHHHHHHHHhCCeEEEEEec Confidence 876544 33566677887 789999999886 78899998764 2456678999999999999999999887 Q ss_pred CCcEEE----eccccCC-CCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeee Q lcl|NC_019538. 549 GQGFIL----YGDKTMS-LQPTPFDRINVRRLFNLLKKSISESA-KYKLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYD 622 (678) Q Consensus 549 ~~G~~~----wG~rT~~-~~~~~~~~i~vrR~~~~i~~si~~~~-~~~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g 622 (678) +++... .+-.|.. .++..|++|+++|++|+|++.|++.+ +||++|||++..|..|+..|..||.+||++|+|.+ T Consensus 465 ~~~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk~nn~~~r~~v~~~i~~~L~~l~~~gaI~~ 544 (587) T protein:vir:95 465 NRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEIQD 544 (587) T ss_pred CCcceEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcccC Confidence 765333 3444432 34467999999999999999999886 59999999999999999999999999999999999 Q ss_pred eEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCcee Q lcl|NC_019538. 623 FRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANF 670 (678) Q Consensus 623 ~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 670 (678) |... +.+.++...+++|++.+.|+.|+|+|.++++......+- T Consensus 545 ~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekI~vt~~~~~~~~~~ 587 (587) T protein:vir:95 545 FPAE-----DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred CCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 8552 222334556899999999999999999999876665443 No 39 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=1.8e-61 Score=353.67 Aligned_cols=541 Identities=13% Similarity=0.091 Sum_probs=310.8 Q ss_pred Cc--------eecCceEEEEc-CCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCC--cCccchhHHHHHH Q lcl|NC_019538. 1 MA--------LLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGR--PDNQTADSVLSAI 69 (678) Q Consensus 1 ~~--------~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~--~~~~~~~~~~v~~ 69 (678) |+ +..||||+||. ++.+++++++|++++|||.+++||+|+|++|++|.||.+.||+ +.+-.++.|.... T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~l~~a~~~a~~~~~ 80 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKPDTVYRFRNYQQAKQVLRSGDLLDAIELAWNASD 80 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCCceeEEecCHHHHHHHhcCCchhHHHHhhccCcc Confidence 43 46899999999 5889999999999999999999999999999999999999966 3444556666677 Q ss_pred HHHhcCCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccc-cccccceeeecccc Q lcl|NC_019538. 70 NFLKYGNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALN-SVGGITFVRFSTAE 148 (678) Q Consensus 70 ~f~ngG~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~a~ 148 (678) +|.|||++||++|+.+.. +++.....+..++...+. |++++.++-............+. ..+ T Consensus 81 ~~~~~~~~~~~~rv~~a~---~a~~~~~~~~~~a~~~g~---~~n~i~v~l~~~~~~~~~~~~v~~~~~----------- 143 (569) T protein:vir:80 81 VNTASAGDILAVRVEDAK---NATLTKGGLTFASTIYGV---DANEIQVALEDNNLTHTKRLTVAFSKD----------- 143 (569) T ss_pred ccccCceEEEEEEcCCCe---eeeeeccceeeeeeeccC---CCceEEEEEecCcCCcceeeEEeeecC----------- Confidence 789999999999995432 233333344444444444 46666654322211110010000 000 Q ss_pred cccccceeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccc--cccccc Q lcl|NC_019538. 149 VVKKAKELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMC--ESYGIP 226 (678) Q Consensus 149 ~~~~a~~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~--~~~~~~ 226 (678) .....++.++..... .++... ..+..+.....................-. ........+. ..+... T Consensus 144 -----~~~~~~~~ig~v~si-~ytg~~---~~a~~~~~~~~~~~~a~~l~~~~g~~~~~---~~~v~~~~~~~~~~~~~~ 211 (569) T protein:vir:80 144 -----GYKKVFDNLGKIFSI-QYKGSE---AQANFTIAQDSISKKATTLTLNVGSEPES---TTEVMKYELGQGVYSETN 211 (569) T ss_pred -----CCccccccccceeeE-EEeecc---ccceEEeecCcCcceeEEEEEEecCCcce---eEEEEeeccCCccchhhh Confidence 000011111110000 000000 00000000000000000000000000000 0000000000 000000 Q ss_pred eeeeccccccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeee Q lcl|NC_019538. 227 VVASRYAGLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAV 306 (678) Q Consensus 227 ~i~A~~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~ 306 (678) .+...+++..+...++.......... .... ......+..... T Consensus 212 ~lv~~~~~~~~f~a~~~~~~~~~~~~------------------------------~~~d------~~~~~~~~t~~~-- 253 (569) T protein:vir:80 212 VLVSAINSLPDWEAKFFPIGDKNLPT------------------------------DALE------AVTKVDVKTEAV-- 253 (569) T ss_pred hhhhhcCCccCceEEEEecCCCccee------------------------------hhcc------chhheeccccce-- Confidence 00111111111110000000000000 0000 000000000000 Q ss_pred eeeeeeccccccccccchhhhhhhhcCCcceEEEEecCC-CccccceeeeeccCcCCccccchhHHHhhhhhhhccchhc Q lcl|NC_019538. 307 ESRILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAES-WPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVD 385 (678) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~ 385 (678) .+.. ........ .....|+.....+ .+........|+||.||... .++..+++.+...+ T Consensus 254 ---~~~~--------~~~di~~~---~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~---~~~~~~l~~le~~~--- 313 (569) T protein:vir:80 254 ---FVGA--------LAGDIAKQ---LEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAP---ESWANKFPLLANEG--- 313 (569) T ss_pred ---eeeh--------hHHHHHHh---hcCCceEEEEecCCcceeeecceeecCCCCCCcc---chHHHHHHHHhhCC--- Confidence 0000 00000000 0111222221111 11122234579999998543 34666677665432 Q ss_pred cccccccccccCcccchhHHHHHHHHHHHhcCC----eEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccc Q lcl|NC_019538. 386 VNLFIAGSCAGEGVEIASTVQKSVAAICDERQD----CLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMN 461 (678) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~----~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (678) .+.+++. ++.++++.++.+||+++++ ++++++.+ .+.+++++.+++. + T Consensus 314 ~~~i~~~-------t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~--------~~~~~~~~~~~a~-------------~ 365 (569) T protein:vir:80 314 GYYLVPL-------TDKQAVHSEALAFVKDRTDNGDPMRIIVGGG--------TNETVEESITRAT-------------N 365 (569) T ss_pred cEEEEec-------CCChHHHHHHHHHHHHHHhCCCcEEEEecCC--------CCCCHHHHHHHHh-------------h Confidence 3333321 2456799999999999865 78888765 3567888877654 4 Q ss_pred cccceEEEEcCeEEEecccCCceeEech---HHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhh Q lcl|NC_019538. 462 IGTTYSSTSANYKLQYDKYNDTNRWIPL---SADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQ 538 (678) Q Consensus 462 ~~s~~~~~~~p~~~v~d~~~~~~~~~pp---s~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~ 538 (678) +++.+.++++||..+++. .+..+.+|+ ++++||++|.++ +++||.|+.+. + .++...+++.|++.|++ T Consensus 366 ~n~e~vv~v~~~~~~~~~-~g~~~~~~~~~~aa~vAG~~A~~~----~~~S~T~k~i~-~---~~i~~~lt~~e~~~li~ 436 (569) T protein:vir:80 366 LRDPRASLVGFSGTRKMD-DGRLLKLPGYMMASQIAGIASGLE----VGEAITFKHFN-V---TSVDRVFESSQLDMLNE 436 (569) T ss_pred cCCCeEEEEecCceeecC-CCcceeechhhHHHHHHHHHhcCc----cccCccceeec-c---ccccccCCHHHHHHHHh Confidence 678899999999988874 455556665 678888887664 89999998764 2 35667899999999999 Q ss_pred CCcEEEEEecCCcEEEecc-c---cC-CCCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHH Q lcl|NC_019538. 539 NSMNPVVGFPGQGFILYGD-K---TM-SLQPTPFDRINVRRLFNLLKKSISESA-KYKLFENNDAFTRNSFRSEVNSYLD 612 (678) Q Consensus 539 ~gIn~i~~~~~~G~~~wG~-r---T~-~~~~~~~~~i~vrR~~~~i~~si~~~~-~~~vfepn~~~~~~~v~~~i~~~L~ 612 (678) +|+++++.+++++.++|.. + |. ..++..|++|+++|++|+|++.|++.+ +||+++||+...|..|+..|+.||. T Consensus 437 ~G~~~l~~~~~~~~~v~~~vn~itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk~nn~~~r~~v~~~i~~~L~ 516 (569) T protein:vir:80 437 SGVISIEFVRNRTLTAFRVVQDVTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGTKVIDTSASLIKNFIQSFLD 516 (569) T ss_pred CCeEEEEEecCceEEEEEEeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcccCChhHHHHHHHHHHHHHH Confidence 9999999988887777744 2 21 233467999999999999999999876 5999999999999999999999999 Q ss_pred HHHhcCCeeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCcee Q lcl|NC_019538. 613 SIKSLGGIYDFRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANF 670 (678) Q Consensus 613 ~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 670 (678) +||++|+|.||... +-+.++..++++|++.++|+.|+|||++|++......+- T Consensus 517 ~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~~ekI~~ti~~~~~~~~~ 569 (569) T protein:vir:80 517 NKKRAREIQDYTPE-----EVQVVLEGDVASISMTVMPIRSLNKITVQLVYKQQILTA 569 (569) T ss_pred HHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccccEEEEEEEEeeeeecC Confidence 99999999998531 123345678999999999999999999999877776544 No 40 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=1.8e-60 Score=348.06 Aligned_cols=557 Identities=12% Similarity=0.095 Sum_probs=317.0 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHH----HhcC Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINF----LKYG 75 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f----~ngG 75 (678) =-+.+|||||||. |+.++++++++++++|||.++|||+++|++++||.||.+.||+.+ +..++.++| .||| T Consensus 9 ~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~----l~~~~~~a~~~~~~~g~ 84 (587) T protein:vir:99 9 RPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGE----LLDAIELAWGSNPNYTA 84 (587) T ss_pred cccccCceEEEEecCCccccCCCCCceEEEEEEecCCccceeEEeccHHHHHHHhcCcc----hHHHHHHHhccccCCCc Confidence 3357899999999 578999999999999999999999999999999999999998854 445555555 7999 Q ss_pred CeEEEEEcCCcccccccccccccceeeeecccccccccceeeeccccccccccccccc-ccccccceeeecccccccccc Q lcl|NC_019538. 76 NDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSAL-NSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 76 ~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~~~~a~ 154 (678) ++||++||.+.+ +++.....+..++..+|.| |+.|.+.-............+ ...++....+.+.. . T Consensus 85 ~~~~~~rv~~~~---~a~~~~~~l~~~a~~~G~~---gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g------~ 152 (587) T protein:vir:99 85 GRILAMRIEDAK---PASAEIGGLKITSKIYGNV---ANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIG------N 152 (587) T ss_pred eEEEEEEcCCCc---eeEEEecCeEEEEeecccc---ccceEEEEccCCCCcceeEEEEEecccceeeeeecc------c Confidence 999999995543 3444455677777777754 666766444333222111111 11111111110000 0 Q ss_pred eeecccccccceeeeee-ecccccccccceeeeeeccccceeeeccccccccccccccc-ccchhccc-cccccceeeec Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFT-SGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDET-TETYIDMC-ESYGIPVVASR 231 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~-s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~-~~~~~~~~-~~~~~~~i~A~ 231 (678) +....+.......... .............+.+ +.. .....+-... ........ .-...+.+.|. T Consensus 153 -v~~i~y~g~~~~a~~~v~~~~~t~~a~~~~l~~---g~~---------~v~~yrL~~g~~~~~~~~~~~i~~~~~~tAk 219 (587) T protein:vir:99 153 -IFTIKYKGEEANATFSVEHDEETQKASRLVLKV---GDQ---------EVKSYDLTGGAYDYTNAIITDINQLPDFEAK 219 (587) T ss_pred -eeeEEeecccccceeeEeecCcceeeeeeeeec---CCc---------eeEEEEecCCchHHHHHHHhhhccccceeEE Confidence 0000000000000000 0000000000000000 000 0000000000 00000000 11122335666 Q ss_pred cccccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeee Q lcl|NC_019538. 232 YAGLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRIL 311 (678) Q Consensus 232 ~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~ 311 (678) +.|.-++++....... ....... +.......... +... . ....++... T Consensus 220 y~~~~~~~i~~~~~~~--~~~~~v~----~~~~~v~a~~~-----------D~~~-~-~~~~~~~~~------------- 267 (587) T protein:vir:99 220 LSPFGDKNLESSKLDK--IENANIK----DKAVYVKAVFG-----------DLEK-Q-TAYNGIVSF------------- 267 (587) T ss_pred eeccCCceeEeecccc--cccceee----eeeeeeehhcc-----------ceee-e-cccceeeee------------- Confidence 6666666554321110 0000000 00000000000 0000 0 000000000 Q ss_pred eccccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccc Q lcl|NC_019538. 312 SVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIA 391 (678) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 391 (678) ....+....... . ..........+................|+||.||... .++..+++++...+ .+.+++ T Consensus 268 ~~~~g~~~~~~~---~-~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG~~~---~sy~~al~ale~~~---~~~i~~ 337 (587) T protein:vir:99 268 EQLNAEGEVPSN---V-EVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPP---ATWADKLDKFAHEG---GYYIVP 337 (587) T ss_pred eecccccchhhh---h-hhhhccccceeeeeccccceecccceeeecCCCCCcc---ccHHHHHHHHhhCC---cEEEEe Confidence 000000000000 0 0000000001111111111111223458999998654 34666677665443 333332 Q ss_pred cccccCcccchhHHHHHHHHHHHhcCC----eEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceE Q lcl|NC_019538. 392 GSCAGEGVEIASTVQKSVAAICDERQD----CLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYS 467 (678) Q Consensus 392 ~~~~~~~~~~~~~v~~~l~~~~~~~~~----~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~ 467 (678) . +..++++.++.+||+++++ ++++++.+ .+.+.+++..+.. .+++.+. T Consensus 338 ~-------t~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~--------~~~~~~~~~~~a~-------------~~n~e~v 389 (587) T protein:vir:99 338 L-------SSKQSVHAEVASFVKERSDAGEPMRAIVGGG--------FNESKEQLFGRQA-------------SLSNPRV 389 (587) T ss_pred c-------CCCHHHHHHHHHHHHHHHhCCCcEEEEecCC--------CCCCHHHHHHHhh-------------hcCCCcE Confidence 1 2456789999999988865 78887754 3567888877654 3456677 Q ss_pred EEEcCeEEEecccCCceeEech---HHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEE Q lcl|NC_019538. 468 STSANYKLQYDKYNDTNRWIPL---SADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPV 544 (678) Q Consensus 468 ~~~~p~~~v~d~~~~~~~~~pp---s~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i 544 (678) .+++++..+. ..++..+.+|| ++++||++|.+| +++||.|+.+. ..++...+++.|++.|+++|++++ T Consensus 390 i~v~~~~~~~-~~dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~----~~~v~~~~t~~e~e~li~~Gvl~l 460 (587) T protein:vir:99 390 SLVANSGTFV-MDDGRKNHVPAYMVAVALGGLASGLE----IGESITFKPLR----VSSLDQIYESIDLDELNENGIISI 460 (587) T ss_pred EEEeccceEe-cCCCceeeechHHHHHHHHHHHhcCc----hhcCccceeee----cccccccCCHHHHHHHHhCCeEEE Confidence 8888876544 23456677777 789999999886 88999998763 245667899999999999999999 Q ss_pred EEecCCc---EEE-eccccCC-CCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019538. 545 VGFPGQG---FIL-YGDKTMS-LQPTPFDRINVRRLFNLLKKSISESA-KYKLFENNDAFTRNSFRSEVNSYLDSIKSLG 618 (678) Q Consensus 545 ~~~~~~G---~~~-wG~rT~~-~~~~~~~~i~vrR~~~~i~~si~~~~-~~~vfepn~~~~~~~v~~~i~~~L~~l~~~g 618 (678) +.+++++ +++ .+-.|.. .++..|++|+++|++|+|++.|++.+ ++|+++||++..|..|+..|..||.+||+.| T Consensus 461 ~~~~~~~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk~Nn~~~r~~i~~~i~~~L~~l~~~g 540 (587) T protein:vir:99 461 EFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDN 540 (587) T ss_pred EEecCCcceEEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCC Confidence 9887764 333 3444432 34467999999999999999999886 5999999999999999999999999999999 Q ss_pred CeeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCcee Q lcl|NC_019538. 619 GIYDFRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANF 670 (678) Q Consensus 619 al~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 670 (678) +|.+|... ..+-+....+++|++.++|+.|+|+|.++++......+- T Consensus 541 aI~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 541 EIQDFPAE-----DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred cccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 99998642 111223445799999999999999999999877665444 No 41 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=6.9e-58 Score=333.95 Aligned_cols=548 Identities=14% Similarity=0.094 Sum_probs=319.7 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHH----HhcC Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINF----LKYG 75 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f----~ngG 75 (678) =.|.+||||||+. ++..+++++++++.+|||.+++||+++|++|++|.||.+.||+.. +..|+.++| .||| T Consensus 9 ~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g~~~~~~~~~~~~~~~~~~g~G~----l~~ai~~a~~~~~~~g~ 84 (587) T protein:vir:96 9 RPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGGEPNTVYQVRNYAQAKSVFRSGE----LLDAIELAWGSNPQYTA 84 (587) T ss_pred CcccCCceEEEEecCCccCCCCCCCceEEEEEEecCCCCceeEEEcChHHHHHhhcCCc----HHHHHHHHhccCcCCCc Confidence 3467999999999 577889999999999999999999999999999999999998864 556666666 7999 Q ss_pred CeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccc-cccccccceeeecccccccccc Q lcl|NC_019538. 76 NDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSA-LNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 76 ~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~~~~a~ 154 (678) +.||+|||.++. +++..+..+..++...+ +||+.+.+.-...+........ ....++. T Consensus 85 ~~~~a~rv~~~~---~a~~~~~~~~~~~~~~g---~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~--------------- 143 (587) T protein:vir:96 85 GKILAMRVEDAK---ASQLEKGGLRVTSKIFG---SVSNDIQVALEKNTITDSLRLRVVFQKDNY--------------- 143 (587) T ss_pred eEEEEEecCCCc---cceeecccccccccccC---CCCceEEEEEEeccCCCccceEEEEecCCc--------------- Confidence 999999995432 34444555555554444 4567777655333221111111 1111111 Q ss_pred eeecccccccceee------e---eee-cccccccccceeeeeeccccceeeeccccccccccccccc--ccchhccccc Q lcl|NC_019538. 155 ELNDYPALQNGWQI------Q---FTS-GGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDET--TETYIDMCES 222 (678) Q Consensus 155 ~~~~~~~~~~~~~~------~---~~s-~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~--~~~~~~~~~~ 222 (678) ...+..++..... . ... ...........++.+ +.. .....+.... .......... T Consensus 144 -~~~~~n~G~v~~i~y~g~~~~a~~~~~~~~~~~~A~~l~l~g---g~~---------~v~~yrl~~g~~~~~~~~~~~~ 210 (587) T protein:vir:96 144 -QEVFDNLGNIFSINYKGEGEKATFSVEKDKETQEAKRLVLKV---DEK---------EVKAYELNGGAYSFTNEIITDI 210 (587) T ss_pred -eeeccccCceEEEEecccccceeEeeccCcccceeeeeEEEe---cCc---------eEEEEEeCCCchhhhhhhhhhh Confidence 1111111110000 0 000 000000000000000 000 0000000000 0000001111 Q ss_pred cccceeeeccccccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccC Q lcl|NC_019538. 223 YGIPVVASRYAGLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVG 302 (678) Q Consensus 223 ~~~~~i~A~~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ 302 (678) ...+.+.+.++|.-++.+++.+.+...... .. ....... +.. ....... .+...+... T Consensus 211 ~~~~~~tAky~g~~~n~~~v~v~d~~~~~~--~k----~~~~y~~-t~~-~di~~~~--------------~~~~~~~~~ 268 (587) T protein:vir:96 211 NELPDFEAKLSPFGDKNLESRKLDEATDVD--IK----GKAVYVK-AVF-GDIENQT--------------QYNQYVKFE 268 (587) T ss_pred ccccceEEEeecccCceeEEEeeccccccc--cc----eEEEeeh-hhh-hhhhhhh--------------ccccceeec Confidence 223456788888888887764422110000 00 0000000 000 0000000 000000000 Q ss_pred CeeeeeeeeeccccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccc Q lcl|NC_019538. 303 GSAVESRILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDRE 382 (678) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~ 382 (678) +... ....... .... ........................|+||.||..+. ++...+++++.. T Consensus 269 ~~~~----------~~~~~~~-~~v~---~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~~~---~y~~~l~ale~~- 330 (587) T protein:vir:96 269 QLPE----------QASEPSD-VEVH---AETESATVTATSKPKAIEPFELTKLSGGTNGEPPT---SWSAKLEKFKNE- 330 (587) T ss_pred cccc----------hhhhhhc-cccc---ccccceeeeecccccccccccceeeecCCCCCCcc---cHHHHHHHHhhC- Confidence 0000 0000000 0000 00000000011111111112234589999986643 455566666543 Q ss_pred hhccccccccccccCcccchhHHHHHHHHHHHhcCC----eEEEEccccchhccccccCCHHHHHHHHhcccccccchhh Q lcl|NC_019538. 383 HVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQD----CLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDD 458 (678) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~----~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (678) +.+.+++. ++.++++..+.+||+++++ ++++++.+ .+.+++++.+.+. T Consensus 331 --~~~~i~~~-------t~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~--------~~~~~~~~~~~a~----------- 382 (587) T protein:vir:96 331 --GGYYIVPL-------TDRQSVHSEVATFVKNRSDAGEPMRAIVGGG--------TSETKEKLFGRQA----------- 382 (587) T ss_pred --CcEEEEec-------CCCHHHHHHHHHHHHHHHhCCCeEEEEecCC--------CCCCHHHHHHHHh----------- Confidence 33333331 2446789999999988865 78877654 3567777776654 Q ss_pred ccccccceEEEEcCeEEEecccCCceeEech---HHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhh Q lcl|NC_019538. 459 NMNIGTTYSSTSANYKLQYDKYNDTNRWIPL---SADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDE 535 (678) Q Consensus 459 ~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pp---s~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~ 535 (678) .+++.+..+++++..+++.. +.....|+ ++++||++|.++ +++||.|+.+.+ .++...+++.|++. T Consensus 383 --~~n~e~vi~v~~~~~~~~~~-~~~~~~~~~~~aa~vAG~~Ag~~----~~~S~T~~~~~~----~~v~~~~t~~e~~~ 451 (587) T protein:vir:96 383 --ILNNPRVALVANSGKFVMGN-GRILQAPAYMVASAVAGLVSGLD----IGESITFKPLFV----NSLDKVYESEELDE 451 (587) T ss_pred --hcCCCcEEEEecceEEecCC-CceeeechhhHHHHHHHHHhcCc----cccCccceeeec----ccccccCCHHHHHH Confidence 35677888888888877653 44444443 688999999775 889999987642 34667899999999 Q ss_pred hhhCCcEEEEEecCCcEEEecc-ccC----CCCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHH Q lcl|NC_019538. 536 LYQNSMNPVVGFPGQGFILYGD-KTM----SLQPTPFDRINVRRLFNLLKKSISESA-KYKLFENNDAFTRNSFRSEVNS 609 (678) Q Consensus 536 L~~~gIn~i~~~~~~G~~~wG~-rT~----~~~~~~~~~i~vrR~~~~i~~si~~~~-~~~vfepn~~~~~~~v~~~i~~ 609 (678) |.++|+.+++.+.+++.++|.. +++ ..++..|++|+++|++|+|++.|++.+ ++|+++||+...|..|+..|.. T Consensus 452 ~i~~G~~~l~~~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk~nn~~~r~~v~~~i~~ 531 (587) T protein:vir:96 452 LNENGIITIEFVRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIGTRTINTSASQIKDFVQS 531 (587) T ss_pred HHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCccccCHHHHHHHHHHHHH Confidence 9999999999988887777744 332 233457999999999999999999987 5899999999999999999999 Q ss_pred HHHHHHhcCCeeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCcee Q lcl|NC_019538. 610 YLDSIKSLGGIYDFRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANF 670 (678) Q Consensus 610 ~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 670 (678) ||.+|+++|+|.+|... +.+-++...+++|++.++|+.|+|||.++++.....++- T Consensus 532 ~L~~l~~~g~I~~~~~~-----dv~v~~~~D~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:96 532 YLGRKKRDNEIQDFPPE-----DVQVIIEGNEARISLTIFPIRALKKISVSLVYRQQTLQA 587 (587) T ss_pred HHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 99999999999998542 122223455799999999999999999998865554333 No 42 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=100.00 E-value=9.1e-53 Score=305.90 Aligned_cols=563 Identities=13% Similarity=0.129 Sum_probs=309.7 Q ss_pred CceecCceEEEEc-CCCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCc--CccchhHHHHHHHHHhcCCe Q lcl|NC_019538. 1 MALLSPGVESKEN-NMQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRP--DNQTADSVLSAINFLKYGND 77 (678) Q Consensus 1 ~~~~~PGVyveEv-~~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~--~~~~~~~~~v~~~f~ngG~~ 77 (678) =-+.+||||+++. |+..+++++++++.+|||.+++||+++|++++||.|+.+.||+. .+--.+.|.+..||.|||+. T Consensus 18 ~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~a~~~f~~g~l~~a~~~a~~~~~~~~~g~~~ 97 (607) T protein:vir:10 18 FYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNGDPTKVYEIRTSQQATKIFGSGDLVDGIKLAFDPTGNSVTNGGT 97 (607) T ss_pred CCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCCCCceEEEEcchhHHHHhhcCcchHHHHHHhhccccCCccCCce Confidence 2346999999999 57889999999999999999999999999999999999999663 34556777788888999999 Q ss_pred EEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccc-cccceeeeccccccccccee Q lcl|NC_019538. 78 LRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSV-GGITFVRFSTAEVVKKAKEL 156 (678) Q Consensus 78 ~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~~a~~~ 156 (678) ||+|||.+... ++..++.+..+....+ .+++.+.+.-. .+.+..+....... .+.....-+-... +. T Consensus 98 ~~~~rv~~~~~---a~~~~~~~~~~~~~~~---~~~~~i~~~l~-~~~~~~~~~~~~~~~d~~~~~~~n~g~~-----~~ 165 (607) T protein:vir:10 98 VYALRVDNAKQ---ASLVKDGLTFTSSIFG---TNANQVSVALD-NDVFGVPRITVNYSPDNYERTYTNIGQM-----FS 165 (607) T ss_pred EEEEeCCCccc---cceecccccccccccc---cCCCceEEEEE-ecCCCccceeEEeecccceeeeeeccce-----ee Confidence 99999955432 3333333333333333 44555554331 11111111111000 0000000000000 00 Q ss_pred ecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhcccccc-------ccceee Q lcl|NC_019538. 157 NDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESY-------GIPVVA 229 (678) Q Consensus 157 ~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~-------~~~~i~ 229 (678) ..+............ ..+ .+.....++........ ...+...............-.+.+...+ +...+. T Consensus 166 i~y~g~~~~a~~~v~-~~~-~g~~~~lt~~~~~~~~~--~~~V~~~~l~~~~~~t~~~l~~din~~~~~~A~~~g~~~i~ 241 (607) T protein:vir:10 166 ITYSGKSASAGYTVS-HDT-DGKAILLTLGSGDSIDK--LTNVATFDLTMSKYDTIAKLMQAISATPNFSASVVGSPSVN 241 (607) T ss_pred cccCcccccccceee-ecC-CCceeEEEecCCCccce--eeeeecccccccccchHHHHHHHhhcCCceEEEEeccccee Confidence 000000000000000 000 01111111111000000 0000000000000000000000000000 011122 Q ss_pred eccccccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeee Q lcl|NC_019538. 230 SRYAGLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESR 309 (678) Q Consensus 230 A~~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~ 309 (678) +.+.+..++.+.+.+..... . .... +... .... .++......++. .+. T Consensus 242 tky~d~~~~~i~V~~~~~iv--~------------a~~~--------------D~~~-~~~~-~~~~~~t~~~~~-~~~- 289 (607) T protein:vir:10 242 TSYLDEVTSPVDVKTAPAVV--T------------AKIG--------------DAIS-KLGY-DPYVVVTQTSNN-KPI- 289 (607) T ss_pred eeccccccceeEEEEeeeee--c------------hhhh--------------hhhh-cccc-cceEEeeecccc-hhh- Confidence 22222222222221110000 0 0000 0000 0000 000000000000 000 Q ss_pred eeeccccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccc Q lcl|NC_019538. 310 ILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLF 389 (678) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~ 389 (678) ......+ ..+.. ..........++. .....|+||.||... .++...++.+...+ .+.+ T Consensus 290 ~~~~~~~-------~~~~~--------~~~~~~~~~~~a~-~a~~~LtGGtdG~~~---~ty~dal~aLe~~e---~~~i 347 (607) T protein:vir:10 290 VNGVSAG-------TGSAT--------ASVTTAPESFPAN-FDTAFLTGGSTGDVP---VSWADKFNGAIGNN---VYYI 347 (607) T ss_pred hhhhhcc-------cccee--------eeeeccccccccc-cceeeeeCCCCCCch---hhHHHHHHHHhhcC---ceEE Confidence 0000000 00000 0000011111111 223568999999653 34555666665543 2333 Q ss_pred cccccccCcccchhHHHHHHHHHHHhcCC----eEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccc Q lcl|NC_019538. 390 IAGSCAGEGVEIASTVQKSVAAICDERQD----CLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTT 465 (678) Q Consensus 390 ~~~~~~~~~~~~~~~v~~~l~~~~~~~~~----~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~ 465 (678) .+. +..++++.++.+||+++++ +++++..+ .+.+++++.++... +++. T Consensus 348 ~~~-------t~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~--------~~~t~~~~~t~a~~-------------~N~e 399 (607) T protein:vir:10 348 IPL-------TSEENIHAELQAFIDEQHVLGYNYHAFVGGG--------FAEPLEQILSRQVN-------------INDS 399 (607) T ss_pred Eec-------CCCHHHHHHHHHHHHHHHhCCCcEEEEecCC--------CCCCHHHHHHHHHh-------------hCCC Confidence 221 2346789999999988875 77776654 45678888877653 5567 Q ss_pred eEEEEcCeEEEecccCCceeEech---HHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcE Q lcl|NC_019538. 466 YSSTSANYKLQYDKYNDTNRWIPL---SADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMN 542 (678) Q Consensus 466 ~~~~~~p~~~v~d~~~~~~~~~pp---s~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn 542 (678) +..++.|+.++.| .+..+..|+ ++++||++|.++ +.+||.|+.+. . .++...+++.|++.|.++|+. T Consensus 400 rvv~V~~~~~~~~--~G~~~~~~~~~~Aa~vAGl~Ag~~----~~~SlT~k~i~-~---~~v~~~lt~~e~e~ai~~Gv~ 469 (607) T protein:vir:10 400 RFGLVGQSGHVQE--GGESVHVPAYLMAAYVGGLSSSLG----VAVPITNKKLA-L---VDLDQNFSGDDLNTLNQNGVI 469 (607) T ss_pred cEEEEecCeeEee--CCcceeccHHHHHHHHHHHHhcCc----cccCcccceec-c---ccccccCCHHHHHHHHhCCeE Confidence 8888999887765 355566665 688999999876 78899998764 2 356678999999999999999 Q ss_pred EEEEecC----CcEEEeccccC--CCCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019538. 543 PVVGFPG----QGFILYGDKTM--SLQPTPFDRINVRRLFNLLKKSISESA-KYKLFENNDAFTRNSFRSEVNSYLDSIK 615 (678) Q Consensus 543 ~i~~~~~----~G~~~wG~rT~--~~~~~~~~~i~vrR~~~~i~~si~~~~-~~~vfepn~~~~~~~v~~~i~~~L~~l~ 615 (678) ++...++ ++++++.+.|. ..++..|++|+++|++|+|.+.|++.+ ++|++++|++..|.+++..+..||..+| T Consensus 470 ~l~~~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nnd~~~~~vk~~i~~~L~~~~ 549 (607) T protein:vir:10 470 GIEHLVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIRSTSADDIKSTVASYLYSEM 549 (607) T ss_pred EEEEccCccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCcccCCcchHHHHHHHHHHHHHHHH Confidence 9976554 36888777765 234568999999999999999999886 4899999999999999999999997665 Q ss_pred h--cCCeeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeecccc Q lcl|NC_019538. 616 S--LGGIYDFRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGP 676 (678) Q Consensus 616 ~--~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~ 676 (678) + .|+|.+|..+ +-+-..+..+++|++.++|+.++|+|.+++.......+-++-.-. T Consensus 550 l~~~gaI~df~~e-----dv~v~~~~D~v~v~~~v~Pv~~iekIyvtv~v~~~~~~~~~~~~~ 607 (607) T protein:vir:10 550 NNDDGLIVDFSES-----DIVVTISGTVVYIQFAVAPTQEIKNIVVSGTYSNYSATSEDNTTK 607 (607) T ss_pred HHhcCceeCCCcc-----ccEEeeCCCEEEEEEEEEEcccceEEEEEEEEEEEEEeeccCCCC Confidence 4 6899998421 112223456899999999999999999999888877665543222 No 43 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=100.00 E-value=6e-52 Score=301.40 Aligned_cols=417 Identities=12% Similarity=0.153 Sum_probs=270.2 Q ss_pred CceecCceEEEEcC-CCcccccCCccceeEEecccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeEE Q lcl|NC_019538. 1 MALLSPGVESKENN-MQTTIARSSTGRAALAGKFQWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDLR 79 (678) Q Consensus 1 ~~~~~PGVyveEv~-~~~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (678) ....-|||||||++ +.++|++++|+++||+|.++|||+++|++|+||.||++.||.... +..+.+..+|++||++|| T Consensus 9 ~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~~Gp~~~~~~i~s~~d~~~~fG~~~~--~~~~~~~~~~~~g~~~~~ 86 (437) T protein:vir:10 9 QNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLALSFGQSKKLMKIRRGEDLFKKLGYEQE--SPQLLLLNEAFKRVSEVL 86 (437) T ss_pred cceecCceeEEEecCCcceeeccCCcEEEEEEEecCCCCceeEEEecHHHHHHHcCCccc--hhHHHHHHHHhcCCCEEE Confidence 77889999999995 778999999999999999999999999999999999999997543 445556677789999999 Q ss_pred EEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccceeecc Q lcl|NC_019538. 80 TVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKELNDY 159 (678) Q Consensus 80 vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~ 159 (678) ++|+.++. +++..+.+.+ T Consensus 87 ~~R~~~g~--~a~~tl~~~~------------------------------------------------------------ 104 (437) T protein:vir:10 87 LYRLNTGE--KANVSLSDNV------------------------------------------------------------ 104 (437) T ss_pred EEECCCCc--eeeEeeccce------------------------------------------------------------ Confidence 99995421 0000000000 Q ss_pred cccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeeccccccccc Q lcl|NC_019538. 160 PALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGDN 239 (678) Q Consensus 160 ~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn~ 239 (678) .+.|.++|.|||. T Consensus 105 -------------------------------------------------------------------~~~A~~~G~~gn~ 117 (437) T protein:vir:10 105 -------------------------------------------------------------------TAQAKYSGVRGND 117 (437) T ss_pred -------------------------------------------------------------------EEEeccCCcccce Confidence 1245667777777 Q ss_pred eeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccccccc Q lcl|NC_019538. 240 IQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDRD 319 (678) Q Consensus 240 i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 319 (678) +++.+....+... .+.+.+..+....+...+....+.. T Consensus 118 i~v~v~~~~~d~~-----------------------------------------~~~v~~~~~~~~~d~~~v~~~~~~~- 155 (437) T protein:vir:10 118 ITVTVKTNVDDPS-----------------------------------------SFDVVTFLDTVVMDLQTVKVLADLK- 155 (437) T ss_pred eEEEEeeccCCcc-----------------------------------------ceEEEEecCcceeeeeehhhhhhhh- Confidence 7665443211000 0000111111111111111000000 Q ss_pred cccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCcc Q lcl|NC_019538. 320 IYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGV 399 (678) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (678) ... ++.......++ ......|+||.||. .+..+|..+++.++.. .++.++++. T Consensus 156 ---~n~------------~v~~~~~~~l~-~~a~~~LtGG~dg~--~t~~dy~~al~~le~~---~~n~l~~~~------ 208 (437) T protein:vir:10 156 ---NNA------------LVEFSGTGELQ-PVAGAKLTGGTDGA--ISTQDYLEYFKALETV---EFNYMALPV------ 208 (437) T ss_pred ---hhc------------ccccccccccc-cccceeeeccccCC--CChhHHHHHHHHhccC---cceEEEecC------ Confidence 000 00000011111 12235788999985 3566788777776543 455555432 Q ss_pred cchhHHHHHHHHHHHhcCCe-----EEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeE Q lcl|NC_019538. 400 EIASTVQKSVAAICDERQDC-----LGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYK 474 (678) Q Consensus 400 ~~~~~v~~~l~~~~~~~~~~-----~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (678) ...++++++.+||+++++. .+++..+ ..+. ....-+.+-. T Consensus 209 -~d~~~~t~~~~~ik~~r~~~g~~~~~V~~~~---------~~d~-------------------------e~Iin~~n~~ 253 (437) T protein:vir:10 209 -EDASIKKAAINFIKRMREDEGLGAQLVVADS---------DADS-------------------------EAVINVKNGV 253 (437) T ss_pred -CChhHHHHHHHHHHHHHhccCceEEEEeCCC---------CCCC-------------------------ceEEEeecce Confidence 2346788899999887642 2333221 0000 0111111111 Q ss_pred EEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCcEEE Q lcl|NC_019538. 475 LQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQGFIL 554 (678) Q Consensus 475 ~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~ 554 (678) ...| ....--.-..+++||++|.+ ++++|+.|+.+.++ ..+...+++.|++.|.++|+.++.+..++-+++ T Consensus 254 ~~~~--~~~~~~~~~~a~vAG~~Ag~----~~~~S~t~~~~~~~---~~v~~~~t~~e~~~~i~~G~~vl~~~~~~v~i~ 324 (437) T protein:vir:10 254 ILSD--KTVIDKTKATVWVAAASANA----GVEKSLTYEKYEDS---VDVVGRLSHTETEDALLKGQFVFTARRGRAVVE 324 (437) T ss_pred eecC--cceechhhHHHHHHHHhccC----ccccCccccccCCc---ccccccCCHHHHHHHHhCCcEEEEEeCCeEEEE Confidence 1111 00011122357889999877 57889999776554 456668999999999999999997754333445 Q ss_pred eccccCC----CCccccceeehhhHHHHHHHHHHHHHH-HHhcC-CCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEc Q lcl|NC_019538. 555 YGDKTMS----LQPTPFDRINVRRLFNLLKKSISESAK-YKLFE-NNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCD 628 (678) Q Consensus 555 wG~rT~~----~~~~~~~~i~vrR~~~~i~~si~~~~~-~~vfe-pn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d 628 (678) +|-.|+. ..+..|++|.++|++|+|.+.|++.++ +|+++ |||...|..++..|..||++|+++|+|.+|.++.. T Consensus 325 ~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~~r~~~~~~i~~yl~~l~~~g~I~~~~~~d~ 404 (437) T protein:vir:10 325 QDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFLGKVSNNEDGRQAFKANRIRYFKDLEARGAIEDFKVEDI 404 (437) T ss_pred EccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccccccCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCceeE Confidence 7776753 334689999999999999999999877 59997 79999999999999999999999999999988766 Q ss_pred cCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_019538. 629 ETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAV 664 (678) Q Consensus 629 ~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 664 (678) ...+. .....+++.+.++|+.++|+|.+++.-. T Consensus 405 ~v~~~---~~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 405 EVLRG---ELKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred EeecC---CCCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 44322 1346889999999999999999998644 No 44 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=100.00 E-value=1.3e-42 Score=250.30 Aligned_cols=487 Identities=14% Similarity=0.125 Sum_probs=313.4 Q ss_pred Cc-e-------ecCceEEEEcC--CCc-ccccCCccceeEEecccCCCCCccEEec--CHHHHHHHcCCcCccchhHHHH Q lcl|NC_019538. 1 MA-L-------LSPGVESKENN--MQT-TIARSSTGRAALAGKFQWGPAYQISQLV--SETDLIDRFGRPDNQTADSVLS 67 (678) Q Consensus 1 ~~-~-------~~PGVyveEv~--~~~-~i~~v~tsv~afvG~~~~Gpv~~pv~i~--s~~~~~~~FG~~~~~~~~~~~v 67 (678) |. | -..||.|.+++ .+. .-.|+++++.|+||.|+||++++|++|+ .|.+|+-.++++....+..+++ T Consensus 1 ~~~ysi~q~ig~aSGvav~pi~~d~t~~~~~g~g~~v~a~Vgif~RG~i~k~~~Vt~~n~~~~LGep~~~~~ga~~E~~~ 80 (529) T protein:vir:10 1 MSQYSIQQSLGNASGVAVSPINADATLSTGVALNSSLWAGIGVFARGKPFTVLAVTESNYEDVLGEPLKPSSGSQFEPIR 80 (529) T ss_pred CCceehhhhhhhhcccccCCcCcccccchheecCceEEEEEEEeecCCCcceEEEchhHHHHHhccccCCCcchhhhhHh Confidence 32 2 25799999984 443 3578899999999999999999999999 7999999999999999999999 Q ss_pred HHHHHhcCCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeeccc Q lcl|NC_019538. 68 AINFLKYGNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTA 147 (678) Q Consensus 68 ~~~f~ngG~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 147 (678) ..|+.-+++.|||||++..+ ++... +.+..... .....+.. T Consensus 81 h~~eA~~~~s~yVVRvv~~d-ak~p~----------------------i~~~~~~~---------------~~~s~~~~- 121 (529) T protein:vir:10 81 HVYEAIQQTSGYVVRAVPDD-AKFPI----------------------IMFDESGE---------------PAYSALPY- 121 (529) T ss_pred hhhhhhcCCceEEEEEcccc-cCCce----------------------EEecCCcc---------------chhhcccc- Confidence 99999888889999987543 11110 00000000 00000000 Q ss_pred ccccccceeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccce Q lcl|NC_019538. 148 EVVKKAKELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPV 227 (678) Q Consensus 148 ~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (678) .....+. . +.... ....++ . T Consensus 122 ---------s~~~~l~-------------~--G~~~~--iy~~Dg---------------------d------------- 141 (529) T protein:vir:10 122 ---------GSEIELD-------------S--GEAFA--IYVDDG---------------------D------------- 141 (529) T ss_pred ---------ccccccc-------------c--cceEE--EEEecC---------------------c------------- Confidence 0000000 0 00000 000000 0 Q ss_pred eeeccccccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeecc-CCeee Q lcl|NC_019538. 228 VASRYAGLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFV-GGSAV 306 (678) Q Consensus 228 i~A~~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~-~~~~~ 306 (678) ++ ..++-++.+. ..+... .. .+ .....+...+.. .+..+ T Consensus 142 -----~~-~s~~~~l~i~-~~~ads---------------------~g------~e------~~~l~~~~~~~~g~~~~l 181 (529) T protein:vir:10 142 -----PC-ISPTRELTIE-TATADS---------------------AG------NE------RFLLKLTQTTSLGVVTTL 181 (529) T ss_pred -----Cc-cCCceEEEEE-eecccc---------------------CC------Cc------cceeeEEEEeecCCceEE Confidence 00 0011111110 000000 00 00 000111222222 24566 Q ss_pred eeeeeeccccccccccchhhhhhhhcCCcceEEEEecC----CCccccceeeeeccCcCCccc-cchhHHHhhhhhhhcc Q lcl|NC_019538. 307 ESRILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAE----SWPTEYSGILTFGGGNSGNST-ASAGDWIEGWDMFSDR 381 (678) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~----~~~~~~~~~~~l~gg~dg~~~-~~~~~~~~~~~~~~~~ 381 (678) |+|..++..+..+..+...|+...+......++.-... ..+-......++++|.||..+ ..+.+|..++.++... T Consensus 182 et~~~sl~~~a~dd~G~~~yl~svle~~s~~l~ai~~~e~~~t~~~~t~~d~~f~~GtdG~~~~i~s~~y~~A~~~L~n~ 261 (529) T protein:vir:10 182 ETHTVSLAEEAKDDMGRLCYLPTALEARSKYLRAVVNEELISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNA 261 (529) T ss_pred EEEEeeeeechhhhcCCccchhHHHhhccCceeeeeeeccccccchhhhhhhhccCCccccccccchHHHHHHHHHhcCC Confidence 89999999999988888888877766544443321111 111111234588999998653 4667898888887644 Q ss_pred chhccccccccccccCcccchhHHHHHHHHHHHhc-CCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhcc Q lcl|NC_019538. 382 EHVDVNLFIAGSCAGEGVEIASTVQKSVAAICDER-QDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNM 460 (678) Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~-~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (678) .++...++...+ ..+++..+|+.+|+++ ++|| .|. ++..+++++++|.++....... T Consensus 262 -p~d~~~il~~g~------y~~a~I~~L~~ic~~~~~d~f--~DV--------~~~LT~~aA~~~~e~~gl~~~~----- 319 (529) T protein:vir:10 262 -PYMYTAVLGLGC------YDNAAITALGKICADRLIDGF--FDV--------KPTLTYAEALPAVEDTGLLGTD----- 319 (529) T ss_pred -cceeeeeeccCC------ccHHHHHHHHHHHhhhhhcEE--EcC--------CCCcCHHHHHHHHHhcCccccC----- Confidence 334444443322 2357789999999654 5554 254 5678999999999876542211 Q ss_pred ccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCC---------ceECcCCcchhheeeccc--ceecCC Q lcl|NC_019538. 461 NIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQ---------PWMSPAGFNRGQILDVRK--LAIETR 529 (678) Q Consensus 461 ~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g---------~~~sPan~~~~~v~~~~~--~~~~~~ 529 (678) .+.. ..+||||. ..||.++....+++||. |+.+..|| +|++|||..+..|.. .+ +-+..+ T Consensus 320 ~~~~--s~y~~P~~-~~D~~tg~k~~~GlsG~-----A~~akargv~~na~v~g~hY~pAGe~r~~inr-~~I~~ly~~d 390 (529) T protein:vir:10 320 YVSC--SVYHYPFS-CKDKWTQSRVVFGLSGV-----AYAAKARGVKKNSDVGGWHYSPAGEERAVIAR-ASIQPLYPED 390 (529) T ss_pred ceee--EEEEccee-eccccccCceeeCCCcc-----eeeccccceeecccccccccccCCCccceeec-ccceeccCCC Confidence 1111 35889998 88999999999999994 44444555 599999998754432 32 334566 Q ss_pred hhhhhhhhhCCcEEEEEecCCcEEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH Q lcl|NC_019538. 530 QAHRDELYQNSMNPVVGFPGQGFILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENNDAFTRNSFRSEVNS 609 (678) Q Consensus 530 ~~e~~~L~~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~~~~~v~~~i~~ 609 (678) +.|...|-.++||++..-.++++.+-.+-|+...++.|||+|+++|+++|++.+.+..+|.+|||++..+|. ++..++. T Consensus 391 ~~e~~~lv~~riNPV~~~~~g~~~idDsLt~~~knny~R~~hv~~lmn~I~~~~~k~a~~~~~~Pd~it~~g-l~~~l~~ 469 (529) T protein:vir:10 391 TPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQDNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAG-LTKGMTK 469 (529) T ss_pred ccCHHHHHhhccCeeeeeccCcceeeeeeceeeeCCchhhhhHHHHHHHHHHHHHHHHHHHhhCCChHHHHH-HHHhHHH Confidence 677778888889888765555544433334443467999999999999999999999999999999999987 9999999 Q ss_pred HHHHHHhcCCeee-----------eEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_019538. 610 YLDSIKSLGGIYD-----------FRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVG 665 (678) Q Consensus 610 ~L~~l~~~gal~g-----------~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 665 (678) +|..+|+.|+|.+ |++++ +|. +.++|.+++.++|.-.+.+|...-.-.+ T Consensus 470 ~L~r~~asgalv~prdp~~~G~epy~~~V-----~q~--d~D~~~v~~~~~ptGv~Rri~~~p~l~~ 529 (529) T protein:vir:10 470 LLDRFVASGALVAPRDPDADGTEPYVLKV-----TQA--EFDKWEVVWACCPTGVARRIQGVPLLIK 529 (529) T ss_pred HHHHHHhcCceecccCccCCCCCceEEEE-----eec--ccCeEEEEEEeecCCceeeEEeeeeecC Confidence 9999999999975 77776 343 3489999999999999999987633333 No 45 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=100.00 E-value=6.9e-41 Score=240.78 Aligned_cols=423 Identities=17% Similarity=0.161 Sum_probs=258.2 Q ss_pred CceecCceEEEEcC-CCcccccCCccceeEEecc-cCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcCCeE Q lcl|NC_019538. 1 MALLSPGVESKENN-MQTTIARSSTGRAALAGKF-QWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYGNDL 78 (678) Q Consensus 1 ~~~~~PGVyveEv~-~~~~i~~v~tsv~afvG~~-~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~~~ 78 (678) +.=.-|||||||++ +.++|.|++|++++|+|.+ .||| ++|+.|.|+.||++.||..... ..+....+|++||++| T Consensus 9 ~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g~-~~~v~i~~~~d~~~~fG~~~~~--~~~~~~~~~~~g~~~v 85 (451) T protein:vir:10 9 QDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWGK-NGVIEVEANSDFTKKLGTTLDD--PSLTALKETLKGASKV 85 (451) T ss_pred ceeecCceEEEEeccCcceeeccCCcEEEEEeeecCCCC-cccEEeecHHHHHHHcCCcccc--hhHHHHHHHhcCCcEE Confidence 55578999999995 7899999999999999965 5666 7899999999999999975433 3333445566799999 Q ss_pred EEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccceeec Q lcl|NC_019538. 79 RTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKELND 158 (678) Q Consensus 79 ~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~ 158 (678) |+.|+.+.. .+.++...+.+ T Consensus 86 ~~yrl~~g~-~a~~t~~~~~~----------------------------------------------------------- 105 (451) T protein:vir:10 86 LVLNPNEGT-AATLTKEGLPW----------------------------------------------------------- 105 (451) T ss_pred EEEEcCCCc-eEEEEeecCce----------------------------------------------------------- Confidence 999995431 10000000000 Q ss_pred ccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccccccc Q lcl|NC_019538. 159 YPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLTGD 238 (678) Q Consensus 159 ~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~gn 238 (678) .+.|.++|.+|| T Consensus 106 --------------------------------------------------------------------~~~Aky~G~~Gn 117 (451) T protein:vir:10 106 --------------------------------------------------------------------TVTANYPGEKGN 117 (451) T ss_pred --------------------------------------------------------------------EEEEeeCCcCCc Confidence 123667777777 Q ss_pred ceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecccccc Q lcl|NC_019538. 239 NIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDR 318 (678) Q Consensus 239 ~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~ 318 (678) .+++.+....+.. ..+.+.+..++..++........ T Consensus 118 ~i~v~v~~~~~d~-----------------------------------------~~~~v~t~~g~~~vd~qtv~~~~--- 153 (451) T protein:vir:10 118 QITVSVEVSPADQ-----------------------------------------NAATVSTIFGTKLVDEQSIKFNE--- 153 (451) T ss_pred eEEEEEecccCCc-----------------------------------------CceEEEEEECCeEEEEEEeeccc--- Confidence 7777553321100 00111112222222221110000 Q ss_pred ccccchhhhhhhhcCCcceEEEEec--CCCccccceeeeeccCcCCcc-ccchhHHHhhhhhhhccchhccccccccccc Q lcl|NC_019538. 319 DIYGSSIYVDEFFINGYSTFIQGVA--ESWPTEYSGILTFGGGNSGNS-TASAGDWIEGWDMFSDREHVDVNLFIAGSCA 395 (678) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~v~~~~--~~~~~~~~~~~~l~gg~dg~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 395 (678) ..+ ...+.|+.... .+.+ .......+++|.+|.. ..+..++...++.+ +.+..+.+.+ T Consensus 154 --------~~e---l~~nd~V~a~~~~~g~~-~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~---e~~~~n~l~~---- 214 (451) T protein:vir:10 154 --------LDK---FKGNDYITAKVVEEGSS-KPVAFTNVSGTLTGGTTTESNKVESLLNDAL---ENEEYAVVTT---- 214 (451) T ss_pred --------hhh---ccCCceEEEEecccccc-cceeeeecccccccccccCCccchHHHHHHh---ccceeeEEEE---- Confidence 000 00111222111 1111 1112233444433322 22334454444443 3344444433 Q ss_pred cCcccchhHHHHHHHHHHHhcCC-----eEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEE Q lcl|NC_019538. 396 GEGVEIASTVQKSVAAICDERQD-----CLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTS 470 (678) Q Consensus 396 ~~~~~~~~~v~~~l~~~~~~~~~-----~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~ 470 (678) ++.+....++..+.+||+++|+ +.+++..+... . ++......+ T Consensus 215 -~~~~~~~~i~~~~~a~ik~~r~~~g~~~~aVl~~~~~~--------~-----------------------~d~egiinv 262 (451) T protein:vir:10 215 -AGFEPSSNMNKLVVEAVKRLRENEGRKVRGVIPTDADT--------T-----------------------YNYEGISTV 262 (451) T ss_pred -ccCCCchHHHHHHHHHHHHHHHhcCCeEEEEecCccCC--------C-----------------------CCCcceEEe Confidence 2333345678888899988753 34555432110 0 001111111 Q ss_pred cCeEEEecccCCceeEech---HHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEe Q lcl|NC_019538. 471 ANYKLQYDKYNDTNRWIPL---SADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGF 547 (678) Q Consensus 471 ~p~~~v~d~~~~~~~~~pp---s~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~ 547 (678) .......| . +.+++ .+++||++|.+ ++.+|+.|+.+.+ +..+...+++.|++.+.++|..+++.. T Consensus 263 ~n~~~~~d----g-~~~~~~~~~~~vAG~~Ag~----~~~~S~T~~~~~~---~~~v~~~~t~~e~~~~i~~G~lvl~~~ 330 (451) T protein:vir:10 263 VNGYTLSD----G-TNVDVKDATGYFAGISASA----DVATSLTYFEVED---AVSAYPKFDNEKTIKALDAGQIVFTTR 330 (451) T ss_pred ecceEecC----c-eeechhhhHHHHHHHHccc----ccccCccceecCC---ceeeeeeCCHHHHHHHHhCCeEEEEEE Confidence 22222111 1 12233 48899999887 4778999977654 455667899999999999999988766 Q ss_pred cCCcEEE-eccccCC----CCccccceeehhhHHHHHHHHHHHHHHH-Hhc-CCCCHHHHHHHHHHHHHHHHHHHhcCCe Q lcl|NC_019538. 548 PGQGFIL-YGDKTMS----LQPTPFDRINVRRLFNLLKKSISESAKY-KLF-ENNDAFTRNSFRSEVNSYLDSIKSLGGI 620 (678) Q Consensus 548 ~~~G~~~-wG~rT~~----~~~~~~~~i~vrR~~~~i~~si~~~~~~-~vf-epn~~~~~~~v~~~i~~~L~~l~~~gal 620 (678) .++++++ +|-.|+. ..+..|+.|.++|++|+|.+.|++.... |++ .|||..-|..++..|..||.+|+++|+| T Consensus 331 ~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~gr~~~~~~i~~yl~~l~~~g~i 410 (451) T protein:vir:10 331 PGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTYLGNVGNNAAGRDLFKADRIAYLTSLQNRNMI 410 (451) T ss_pred cCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccceecCCCHHHHHHHHHHHHHHHHHHHhCCCc Confidence 6777665 7777763 2346799999999999999999999874 888 4699999999999999999999999999 Q ss_pred eeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_019538. 621 YDFRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAV 664 (678) Q Consensus 621 ~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 664 (678) ..|... |.+. ..-.....+++.+.++|+..+|+|.+++.-. T Consensus 411 ~~~~~~-d~~v--~~~~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 411 QSFANT-DITV--EAGNDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred cCCCcc-ceEE--eecCCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 998632 2111 0111356799999999999999999998655 No 46 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=4.3e-39 Score=230.91 Aligned_cols=530 Identities=12% Similarity=0.033 Sum_probs=241.6 Q ss_pred EEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccceeec Q lcl|NC_019538. 79 RTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKELND 158 (678) Q Consensus 79 ~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~ 158 (678) .-||+-.+.++. .+...++. ... |-+-++ ..+. ....+.. + .....-.... T Consensus 1 ~~~~~~~~~~~~---~~~~~~~~--~~~------g~~~~~--~~~~----~i~g~~~-g-----------~~g~~~s~~~ 51 (581) T protein:vir:10 1 MAIDFSQYQTPG---VYTEAVGA--PQL------GIRSSV--PTAV----AIFGTAV-G-----------YQTYRESIRI 51 (581) T ss_pred Ceeeeccccccc---hhhhhccc--ccc------ceeeee--cccc----ccccccc-c-----------cccccccccc Confidence 223332222111 11000000 000 000000 0000 0000000 0 0000000000 Q ss_pred ccccccceeeeeeeccc-ccccccceeeeeeccccceeee----ccccccccccc---------ccccccchhccccccc Q lcl|NC_019538. 159 YPALQNGWQIQFTSGGP-GSGQSATAVLNGIRQDSKIYIR----NDEYSRESLLR---------RDETTETYIDMCESYG 224 (678) Q Consensus 159 ~~~~~~~~~~~~~s~~~-~~g~~a~~~~~~~~~~~~i~~~----~~~~a~~~~~~---------~~~~~~~~~~~~~~~~ 224 (678) +|........+...... ..++.-++...+..+. .+... ....+...+.. ..........+..... T Consensus 52 ~p~~~~~~e~q~v~~~~~~t~GtFtLsf~G~tT~-~I~~~asa~~v~~AL~~L~~i~~~~v~v~g~~g~~~~VtF~g~~~ 130 (581) T protein:vir:10 52 NPDTGETITTQILALVGEPTGGSFKLSLAGEPTG-NIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVA 130 (581) T ss_pred CCCCCCccceEEEEEEecCCCceEEEEeCceecc-cccccCCHHHHHHHHhccCCCCcceEEEECCCCceEEEEEcCCcc Confidence 11000000000000000 0000000001100000 00000 00000000000 0000000000000000 Q ss_pred cceeeecccccc-ccceeEEecccccccccc-cccc---cccccccccc---c-cccccceeeeecccc--ccccccccc Q lcl|NC_019538. 225 IPVVASRYAGLT-GDNIQVAFIAYKDYYKFG-VDGK---ISSVNTVNLK---T-FPSGLSFGNITPSSY--LEYGPQTKD 293 (678) Q Consensus 225 ~~~i~A~~~G~~-gn~i~v~v~~~~~~~~~~-~~~~---~~~~~~~~~~---~-~~~~~~~~~~~~~~~--~~~~~~~~~ 293 (678) . +.+...... +....+.+.....-.... .... +......... . ......+........ ........+ T Consensus 131 ~--l~~~~~~lt~g~~~~vtV~~~~~g~~~~~~~~s~~gi~~~~~~l~~~~~~~~~~~gsd~~~~~~~~~~~~~~~~~~D 208 (581) T protein:vir:10 131 A--LTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDD 208 (581) T ss_pred c--eeeeeceecCCCceeEEEeccccCcccccccccccccccccccccccccCcceeccccceeeecccCcccccccccc Confidence 0 000000000 011111111000000000 0000 0000000000 0 000000000000000 000000000 Q ss_pred cceeeeccCCeeeeeee---eeccccccccc---------cchhhh---hhhhcCCcceEEEEecCCCccccceeeeecc Q lcl|NC_019538. 294 QFAMIVFVGGSAVESRI---LSVKENDRDIY---------GSSIYV---DEFFINGYSTFIQGVAESWPTEYSGILTFGG 358 (678) Q Consensus 294 ~~~~~v~~~~~~~~~~~---~~~~~~~~~~~---------~~~~~~---~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~g 358 (678) -...+...++.+.++.. ++.+..++.+. ....++ .+...+..+.+..... ..........|++ T Consensus 209 ~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~t~~~~--~~~tn~~~~~l~~ 286 (581) T protein:vir:10 209 LYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQ--LAITNGASTILAC 286 (581) T ss_pred ceeeeeeecccccccceEEEEEEEeecCCcceeEEeecCcchhhhhhhhhhccCccccchhhhhe--eeeecccceeEEe Confidence 01122212222221110 11111111000 000000 0000011111111000 0011122345667 Q ss_pred CcCCcc-ccchhHHHhhhhhhhccchhccccccccccccCcccchhHHHHHHHHHHHhcC----CeEEEEccccchhccc Q lcl|NC_019538. 359 GNSGNS-TASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQ----DCLGWISPPREYMVNL 433 (678) Q Consensus 359 g~dg~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~----~~~~i~d~p~~~~~~~ 433 (678) |.++.. .++.++|..+++++...+...+ ++|. ++..+++.+|.+||+++. .+.+++..+. . T Consensus 287 gvd~~g~tvt~~dy~~Al~ale~~~~~~i--vv~~-------t~~~~v~a~l~ahv~~~s~~~~~~ravigV~g-----~ 352 (581) T protein:vir:10 287 AVDPEGDTVTMGDYQNALNKFRDEDEIAI--IVAG-------TGAQPIQALVQQHVSAQSNNKYERRAILGMDG-----S 352 (581) T ss_pred eccCCCCccchHHHHHHHHHHhcCCceEE--EEeC-------CCCHHHHHHHHHHHHHHHhccCCcEEEEEecC-----C Confidence 777643 3678899999988877654332 2321 344678899999997763 3455544321 1 Q ss_pred cccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCC-ceeEechHHHHHHHHHHHhhcCCceECcCC Q lcl|NC_019538. 434 PVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYND-TNRWIPLSADMAGLCARTDTVAQPWMSPAG 512 (678) Q Consensus 434 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~-~~~~~pps~~vAG~~a~~d~~~g~~~sPan 512 (678) +...+.+.+.+. ..++++.+..+++|+.++++...+ ....+|+ .++|+.+|.+.....+++||.| T Consensus 353 ~~~~~~~~~~~~-------------a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~-y~~AA~vAGl~a~~~~~~slT~ 418 (581) T protein:vir:10 353 VTPVPSATRIAN-------------AQSIKDQRVALISPSSFVYYAPELNREVVLGG-QFMAAAVAGKSVSAIAAMPLTR 418 (581) T ss_pred CCCccHHHHHHh-------------hccCCCceEEEEecCceeecCcccCceeccch-hhHHHHHHHHhhccccccCccc Confidence 222334444432 235678899999999998877544 4444555 3444444444455568999999 Q ss_pred cchhheeecccceecCChhhhhhhhhCCcEEEEEecCCcEEE-eccccCCCCccccceeehhhHHHHHHHHHHHHHH--H Q lcl|NC_019538. 513 FNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQGFIL-YGDKTMSLQPTPFDRINVRRLFNLLKKSISESAK--Y 589 (678) Q Consensus 513 ~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~-wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~--~ 589 (678) +++.++ ..+...+++.|++.|+++|+++++.++++++++ ||-+|+..+ .+|++|++||+++++++.+++.++ + T Consensus 419 ~~i~gi---~~l~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~-~~~~~i~~iR~~D~v~~~ir~~~~~~~ 494 (581) T protein:vir:10 419 KVIRGF---SGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTS-LHTREWNIIGQQDVMVYRIRDYLDADG 494 (581) T ss_pred cccccc---ccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCCC-CcceeeeeehhhhHHHHHHHHHhhhhc Confidence 887666 456788999999999999999999999999987 555666655 479999999999999999999985 5 Q ss_pred HhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCce Q lcl|NC_019538. 590 KLFENNDAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSAN 669 (678) Q Consensus 590 ~vfepn~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~ 669 (678) |++|||++.+|.+||..+.+||.+||+.|+|.||+.. ..++.+.+.++++++|.++|++|+|||.+|++....+.. T Consensus 495 fIG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~----~~~~~~~~~d~v~V~i~v~Pv~~i~~I~vti~~~p~~~~ 570 (581) T protein:vir:10 495 LIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL----KARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGD 570 (581) T ss_pred CCCcccCHHHHHHHHHHHHHHHHHHHhcCcccCCccc----eeeeeecCCCEEEEEEEEEecccceEEEEEEEEecCCCc Confidence 8889999999999999999999999999999999642 235667888999999999999999999999999999998 Q ss_pred eeeccccCC Q lcl|NC_019538. 670 FDELVGPQN 678 (678) Q Consensus 670 ~~e~~~~~~ 678 (678) |+--++--- T Consensus 571 ~~~~~~~~~ 579 (581) T protein:vir:10 571 ITSTIEGTT 579 (581) T ss_pred eEEEEeccc Confidence 877666554 No 47 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=2.3e-38 Score=226.98 Aligned_cols=522 Identities=13% Similarity=0.079 Sum_probs=243.2 Q ss_pred ccceeeeec-cccccc------ccceeeecccccccccccccccccccccceeeecccccccccceeecccccc---cce Q lcl|NC_019538. 97 ETIDYTITS-PGVDYR------IGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKELNDYPALQ---NGW 166 (678) Q Consensus 97 ~~~~~~~~~-~~~~~~------~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~~~~~---~~~ 166 (678) ..++....+ ++.|-. .|.+..+ ..+ .....+.. + + ....-....++... ..+ T Consensus 1 ~~~~~~~~~~~~~~t~~~~~~~~g~~~~~--~~~----~~i~g~~~-g----~-------~g~~~s~r~~p~~~~~~evq 62 (581) T protein:vir:76 1 MAIDFSQYQTPGVYTEAVGAPQLGIRSSV--PTA----VAIFGTAV-G----Y-------QTYRESIRINPDTGETITTQ 62 (581) T ss_pred CcccccccccchhhhhhccccccCcceee--eee----eeeccccc-c----c-------ccccceeeecCCCCCCCceE Confidence 112221111 111111 0100000 000 00000000 0 0 00000000000000 000 Q ss_pred eeeeeecccccccccceeeeeeccccceeeec----ccccccccccc---------cccccchhccccccccceeeeccc Q lcl|NC_019538. 167 QIQFTSGGPGSGQSATAVLNGIRQDSKIYIRN----DEYSRESLLRR---------DETTETYIDMCESYGIPVVASRYA 233 (678) Q Consensus 167 ~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~----~~~a~~~~~~~---------~~~~~~~~~~~~~~~~~~i~A~~~ 233 (678) ....+. .+ .++.-++...+..+. .+.... ...+...+... .........+..... .+..... T Consensus 63 ~v~~~~-~~-t~G~ftLt~~g~tT~-~I~~~asa~~v~~AL~~L~~i~~~~v~vtg~~~~~~~V~F~g~~~--~~~~~~~ 137 (581) T protein:vir:76 63 ILALVG-EP-TGGSFKLSLAGEPTG-NIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVA--ALTKDVT 137 (581) T ss_pred EEEEee-cC-CcceEEEEeCceecc-ccccCCCHHHHHHHHhhccCCCCceEEEEcCCCceEEEEEcCCcc--ceeEeee Confidence 000000 00 000001111110000 000000 00000000000 000000000000000 0000000 Q ss_pred ---cccccceeEEecc----cccccccc--c-cccccccccccccccccccceeeeecccccccccccccc-ceee-ecc Q lcl|NC_019538. 234 ---GLTGDNIQVAFIA----YKDYYKFG--V-DGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQ-FAMI-VFV 301 (678) Q Consensus 234 ---G~~gn~i~v~v~~----~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-v~~ 301 (678) |..+-.+++.... ...+.... . ..............++.+..+.................. +... +.. T Consensus 138 ~ltg~~~~~~~V~~~~~G~~~~~~~l~~~g~~~~~~~~~~s~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~ 217 (581) T protein:vir:76 138 GLTGGDNPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVD 217 (581) T ss_pred eeecCCcceeEEEEEecCcCCcCceeeeccccccccceeecCCcceeeecccccceeeccCcccceeeeeeeeeeEeecc Confidence 0000011111000 00000000 0 000000000000000000000000000000000000000 0111 111 Q ss_pred CCeeeeeee--eeccccccccccchhh------------hhhhhcCCcceEEEEecCCCccccceeeeeccCcCCcc-cc Q lcl|NC_019538. 302 GGSAVESRI--LSVKENDRDIYGSSIY------------VDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNS-TA 366 (678) Q Consensus 302 ~~~~~~~~~--~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~-~~ 366 (678) ++......+ ++....+......-.+ ..+...+..+.+...+. ..........|++|.|+.. .+ T Consensus 218 g~~~~~~~i~~~~~~~~D~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~e~~~~~~--~~~t~~~~~~l~~gvd~~g~tv 295 (581) T protein:vir:76 218 GGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQ--LAITNGASTILACAVDPEGDTV 295 (581) T ss_pred cccccceeEEEEEEEeecCCccceEEEecccccccceeeehhhcCccccchhhhhh--eeeccccceEEEeeecCCCCcc Confidence 111110000 0100000000000000 00000011111111100 0011122345677777633 46 Q ss_pred chhHHHhhhhhhhccchhccccccccccccCcccchhHHHHHHHHHHHhcC----CeEEEEccccchhccccccCCHHHH Q lcl|NC_019538. 367 SAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQ----DCLGWISPPREYMVNLPVATAVKKM 442 (678) Q Consensus 367 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~----~~~~i~d~p~~~~~~~~~~~~~~~~ 442 (678) +.++|..+++++...+...+ ++|. +...+++..+.+||++++ .+.+++..+. .+...+.+.+ T Consensus 296 t~~dy~~aL~ale~~~~~~i--vvp~-------t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g-----~~~~~~~~~~ 361 (581) T protein:vir:76 296 TMGDYQNALNKFRDEDEIAI--IVAG-------TGAQPIQALVQQHVSAQSNNKYERRAILGMDG-----SVTPVPSATR 361 (581) T ss_pred chHHHHHHHHHHhcCCeEEE--EEec-------CCChHHHHHHHHHHHHHHhccCCceEEEEeeC-----CCCCchHHHH Confidence 78899999988877654432 2221 244678888889887663 3444444321 1122333444 Q ss_pred HHHHhcccccccchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecc Q lcl|NC_019538. 443 VEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVR 522 (678) Q Consensus 443 ~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~ 522 (678) .+. ...+++.|..+++||.++++...+......|..++|+.+|.+.....+++||.|+++.++ . T Consensus 362 ~~~-------------a~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~~~slT~~~i~g~---~ 425 (581) T protein:vir:76 362 IAN-------------AQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGF---S 425 (581) T ss_pred HHh-------------hcccCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhhccccccCccccccccc---c Confidence 332 235678999999999999987655444444556667777777777889999999887665 5 Q ss_pred cceecCChhhhhhhhhCCcEEEEEecCCcEEE-eccccCCCCccccceeehhhHHHHHHHHHHHHHH--HHhcCCCCHHH Q lcl|NC_019538. 523 KLAIETRQAHRDELYQNSMNPVVGFPGQGFIL-YGDKTMSLQPTPFDRINVRRLFNLLKKSISESAK--YKLFENNDAFT 599 (678) Q Consensus 523 ~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~-wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~--~~vfepn~~~~ 599 (678) .+...+++.|++.|+++|+++++.++++++++ ||-+|+..+ .+|++|++||+++++++.+++.++ +|++|||++.+ T Consensus 426 ~~~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~-~~~k~i~viR~~D~v~~~vr~~~~~~~fiG~~n~~~~ 504 (581) T protein:vir:76 426 GPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTS-LHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTT 504 (581) T ss_pred cccccCCHHHHHHHHhCCeEEEEEecCCeEEEEEeeecCCCC-CccceeeehhhhHHHHHHHHHHHhhhcCCCcccChHH Confidence 67788999999999999999999999999875 777787665 479999999999999999999986 57889999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccccCC Q lcl|NC_019538. 600 RNSFRSEVNSYLDSIKSLGGIYDFRVVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVGPQN 678 (678) Q Consensus 600 ~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 678 (678) |.+||..+..||.+||+.|+|.||... ..++.+++.+++++++.++|++|+|||.++++.+..+..|+--++--- T Consensus 505 r~~ik~~i~~~L~~l~~~g~I~g~~~~----~~~~~~~~~d~v~V~i~v~Pv~~ie~I~vt~~~~p~~~~~~~~~~~~~ 579 (581) T protein:vir:76 505 IVQVKASAEAALVWLVDNNIIRGYRNL----KARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTT 579 (581) T ss_pred HHHHHHHHHHHHHHHHhcCcccCcccc----eeeEEecCCCEEEEEEEEEecccceEEEEEEEEeeCCCceEEEEeccc Confidence 999999999999999999999999632 345667788999999999999999999999999999998877665544 No 48 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=99.96 E-value=3.4e-29 Score=176.68 Aligned_cols=406 Identities=17% Similarity=0.156 Sum_probs=251.9 Q ss_pred CceecCceEEEEcCCC-cccccCCccceeEEecccCCCCCccEEecC---HHHHHHHcCCcCccchhHHHHHHHHHhcCC Q lcl|NC_019538. 1 MALLSPGVESKENNMQ-TTIARSSTGRAALAGKFQWGPAYQISQLVS---ETDLIDRFGRPDNQTADSVLSAINFLKYGN 76 (678) Q Consensus 1 ~~~~~PGVyveEv~~~-~~i~~v~tsv~afvG~~~~Gpv~~pv~i~s---~~~~~~~FG~~~~~~~~~~~v~~~f~ngG~ 76 (678) ..=.-||+||+-++.. ..+++....+.++...+.|||+++++.|++ ..++...||.. ...+....++..| .|++ T Consensus 11 ~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~~wGp~~~v~~i~~~~~~~~~~~~~G~~-~~~~~~~~l~~~~-~~~~ 88 (436) T protein:vir:78 11 QNKVLPGSYINFVSATRATSSLSDRGIVAMPLELDWGIDEEVFQVTSDDFEKYSTKYFGYD-YTHEKLKGLRDLF-KNIR 88 (436) T ss_pred ceeecCceEEEEEecCcceeeccCCeEEEEEEEecCCCCceeEEeecccchHHHHHHhcCc-cchHHHHHHHHHh-cCCC Confidence 3335799999999754 568999999999999999999999999998 56899999964 2222334566655 6678 Q ss_pred eEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeeccccccccccee Q lcl|NC_019538. 77 DLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKEL 156 (678) Q Consensus 77 ~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~ 156 (678) .+|+.|+.+...+.+. -. T Consensus 89 tv~~yrl~~G~~a~~~-------v~------------------------------------------------------- 106 (436) T protein:vir:78 89 LGYFYKLNKGVKASCS-------IA------------------------------------------------------- 106 (436) T ss_pred EEEEEECCCcceeeee-------ee------------------------------------------------------- Confidence 8999999643111100 00 Q ss_pred ecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeeecccccc Q lcl|NC_019538. 157 NDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAGLT 236 (678) Q Consensus 157 ~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G~~ 236 (678) .|.++|.. T Consensus 107 ------------------------------------------------------------------------~Aky~g~~ 114 (436) T protein:vir:78 107 ------------------------------------------------------------------------TARCSGIR 114 (436) T ss_pred ------------------------------------------------------------------------eeecCCCC Confidence 12233334 Q ss_pred ccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecccc Q lcl|NC_019538. 237 GDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKEN 316 (678) Q Consensus 237 gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~ 316 (678) ||.+++.+....+... .+-+.+..+...++...+... T Consensus 115 gn~i~v~v~~~~~d~~-----------------------------------------~~dv~~~~g~~~~d~~~~~~~-- 151 (436) T protein:vir:78 115 GNDLKVIVTTNIDDNA-----------------------------------------KFDVVTLLDNKKVDTQIAKVI-- 151 (436) T ss_pred CcEEEEEecccccccC-----------------------------------------ceEEEEEecchhhhhhhHHHH-- Confidence 4444332221110000 000000001111100000000 Q ss_pred ccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccccc Q lcl|NC_019538. 317 DRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAG 396 (678) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 396 (678) .+. ..+.|+.....+..+. .....|+||.||.+ ++..+|..+++.++.. .++.++++. T Consensus 152 -----------~~l---~~n~~V~~~~~g~la~-~a~~~LtGG~dG~~-~T~~dy~~al~~le~~---~fn~l~~~~--- 209 (436) T protein:vir:78 152 -----------TEL---QDNDYVTWKKEATLEA-TAGLTFTNGTNGEA-VTGTEYQAFLDKIESY---SFNALGCLA--- 209 (436) T ss_pred -----------hhc---cCCceEEEEecccccc-cceeeeeccccccc-cchHHHHHHHHHHccc---ceeEEEecC--- Confidence 000 0011222211112221 22366899999864 5678898888776544 455554432 Q ss_pred CcccchhHHHHHHHHHHHhcCCe-----EEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEc Q lcl|NC_019538. 397 EGVEIASTVQKSVAAICDERQDC-----LGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSA 471 (678) Q Consensus 397 ~~~~~~~~v~~~l~~~~~~~~~~-----~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~ 471 (678) ...++++.+.+++.++|+. -+++... ...+.+. .+. T Consensus 210 ----~d~~~~~~~~a~ikr~re~~g~~~~aV~~~~--------~~~d~Eg--------------------------IIn- 250 (436) T protein:vir:78 210 ----TTAEIKSLFVEFTKRMRDKVGAKFQTVLYKK--------NDADYEG--------------------------VVS- 250 (436) T ss_pred ----CChHHHHHHHHHHHHHHhhcCCeEEEEecCC--------CCCCCce--------------------------EEE- Confidence 2356788899999888742 1222110 0011111 111 Q ss_pred CeEEEecccCCce-eEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCC Q lcl|NC_019538. 472 NYKLQYDKYNDTN-RWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQ 550 (678) Q Consensus 472 p~~~v~d~~~~~~-~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~ 550 (678) +-+...+.. -..-..+++||++|.++ +.+|+.|+.+.++ ..+...+++.|.+.+.++|..++.+. ++ T Consensus 251 ----v~n~v~g~~~~~~~~~a~vAG~~Ag~~----~~~S~T~~~~~~~---~~v~~~~t~~e~~~ai~~G~lvl~~d-~~ 318 (436) T protein:vir:78 251 ----VENKIKDTGLLESSLIYWTTGAIAGCD----INKSNTNKRYDGE---FDVDVNYTQIHLEEALKTGKFIFHKV-GD 318 (436) T ss_pred ----eecccCCceechhHHHHHHHHHHhcCc----cccCccceecCcc---ccccccCCHHHHHHHHhCCeEEEEEe-CC Confidence 111112211 11125688899998775 6668888776543 45667899999999999999999764 56 Q ss_pred cEEEeccc-cC----CCCccccceeehhhHHHHHHHHHHHHHH-HHhcC-CCCHHHHHHHHHHHHHHHHHHHhcCCeeee Q lcl|NC_019538. 551 GFILYGDK-TM----SLQPTPFDRINVRRLFNLLKKSISESAK-YKLFE-NNDAFTRNSFRSEVNSYLDSIKSLGGIYDF 623 (678) Q Consensus 551 G~~~wG~r-T~----~~~~~~~~~i~vrR~~~~i~~si~~~~~-~~vfe-pn~~~~~~~v~~~i~~~L~~l~~~gal~g~ 623 (678) ++++--+- |+ ...+..|+.|.++|++|+|.+.|++... .|+++ ||+..-|..++..|+.||++|.++|+|..| T Consensus 319 ~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yiGKv~N~~dgr~~l~~~i~~yl~~L~~~g~I~~f 398 (436) T protein:vir:78 319 EVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYLGEVPNDKSGRISFWNDVVKHHEQLQNMRAIEDF 398 (436) T ss_pred eEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccccccCCCHHHHHHHHHHHHHHHHHHHhCCcccCC Confidence 66665554 33 2334689999999999999999999876 59995 699999999999999999999999999998 Q ss_pred E---EEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_019538. 624 R---VVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAV 664 (678) Q Consensus 624 ~---V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 664 (678) . +.+++. + ....+++.+.++|+..+|+|.+++.-. T Consensus 399 ~~~Dv~v~~~-~-----~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 399 KADDVSVEPG-S-----DKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred CCcceEEeec-C-----CCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 6 444321 1 356788999999999999999997643 No 49 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.24 E-value=7.6e-13 Score=87.02 Aligned_cols=321 Identities=12% Similarity=0.088 Sum_probs=168.4 Q ss_pred ccccccccccccccccccc-cceeeeeccccccccccccccceeeeccCCeeeeeeeeecccccc-cc-ccchhhhhhhh Q lcl|NC_019538. 255 VDGKISSVNTVNLKTFPSG-LSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDR-DI-YGSSIYVDEFF 331 (678) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~ 331 (678) .. .. |.- ..+.........++... ...++...++-. -..++....-. .+ .....++...+ T Consensus 1 ~~---------gl---p~i~i~f~~~a~ta~~~g~rG----iv~~il~d~~~~-~~~~~~~~~v~~~~~~~n~~~i~~~~ 63 (356) T protein:vir:10 1 MA---------GL---VNINIEFKELATSFIQRSKAG----IVAIILKDTTKM-YKELTSEDDIPISLSADNKKYIKYGF 63 (356) T ss_pred CC---------CC---CceeEEEeecceeeccCCccc----eEEEEEecCCcc-eeEEeccccchhHHHHHHHHHHHHHh Confidence 00 00 000 01111111000000000 011111111100 00000000000 00 00111222211 Q ss_pred cCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccccccccccCcccchhHHHHHHHH Q lcl|NC_019538. 332 INGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGVEIASTVQKSVAA 411 (678) Q Consensus 332 ~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~ 411 (678) ..+... ... ..+..++..+ ..+.+++..+++.+.. +.++.+.++ +. ..++++.+.+ T Consensus 64 ~g~~~~----~~~--------~~p~~~~~~~--~~t~~~y~~aL~~le~---~~fn~l~~~-----~~--d~~~~~~~~a 119 (356) T protein:vir:10 64 VGATDN----EKV--------LRPSKVIIST--FTEDGKVEDILEELES---VEFNYLCMP-----EA--IEAEKTKIVT 119 (356) T ss_pred hccccc----ccc--------ccceeeeeec--ccCchhHHHHHHHhcC---ccceEEEec-----CC--ChHHHHHHHH Confidence 111000 000 0001111111 1134577777776643 345544433 22 2467888889 Q ss_pred HHHhcCCe----EEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCceeE- Q lcl|NC_019538. 412 ICDERQDC----LGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRW- 486 (678) Q Consensus 412 ~~~~~~~~----~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~- 486 (678) ++.++|+- +..+-+. ...+.+.++.+ . .. .++ ++ ..+ T Consensus 120 ~ikr~r~~~~~~~~~V~~~--------~~aD~EgIInv-----------------~--------n~-~~~---~g-~~~t 161 (356) T protein:vir:10 120 WIKKIREEESTEAKAVLAN--------IKADNEAIINF-----------------T--------EN-VVV---DG-EEIT 161 (356) T ss_pred HHHHHHhcCCcEEEEEecC--------CCCCCceeEEe-----------------e--------cC-eEe---cc-eeec Confidence 99887742 2221110 00111111110 0 00 111 11 111 Q ss_pred -echHHHHHHHHHHHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCcEEEecc-ccC---- Q lcl|NC_019538. 487 -IPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQGFILYGD-KTM---- 560 (678) Q Consensus 487 -~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~wG~-rT~---- 560 (678) .-..+++||++|.+. +.+|+.|+.+.++. ....+++.|.+.+.++|--++.+. ++.+++--+ .|+ T Consensus 162 ~~~~~~~vAG~~Ag~~----~n~S~T~~~~~~~~----~~~~~t~~e~~~ai~~G~lvl~~d-~~~V~I~~~VNSltt~t 232 (356) T protein:vir:10 162 AEKYTTRVASLIASTP----NTQSITYAPLDEVE----SIVKIDKASADAKVQAGELILRRL-SGKIRIARGINSLTTLT 232 (356) T ss_pred hhHHHHHHHHHHhccc----hhccccceecCCcc----ccccCCHHHHHHHHhCCeEEEEEE-cCeEEEEecCccceecC Confidence 223578999999885 56688887765432 223588999999999999999775 444555444 343 Q ss_pred CCCccccceeehhhHHHHHHHHHHHHHH-HHhcCC-CCHHHHHHHHHHHHHHHHHHHhcCCee-eeEEEEccCCC----- Q lcl|NC_019538. 561 SLQPTPFDRINVRRLFNLLKKSISESAK-YKLFEN-NDAFTRNSFRSEVNSYLDSIKSLGGIY-DFRVVCDETNN----- 632 (678) Q Consensus 561 ~~~~~~~~~i~vrR~~~~i~~si~~~~~-~~vfep-n~~~~~~~v~~~i~~~L~~l~~~gal~-g~~V~~d~~~n----- 632 (678) ...+..|+.|.+.|++|.|.+.|++... .|+++- |+..-|..++..++.||.+|.++|+|. +|.+..|.+.. T Consensus 233 ~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yiGKv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~ 312 (356) T protein:vir:10 233 AEKGEIFQKIKLVDTKDLISKDIKNIYVEKYLRKCPNTYDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLE 312 (356) T ss_pred CCCCcchhhhHHHHHHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhh Confidence 2334679999999999999999999986 699985 999999999999999999999999995 67777775431 Q ss_pred ---------CHHHhh----CCEEEEEEEEEecCCceEEEEEEEE Q lcl|NC_019538. 633 ---------TPAVID----RNEFVATILIKPARSINYVSLNFSA 663 (678) Q Consensus 633 ---------t~~~i~----~G~~~~~i~~~p~~p~e~i~~~~~~ 663 (678) +...+. .-.+.+.+.++|+-.+|.|.+++.- T Consensus 313 ~~g~d~~~~~d~~v~~~~~~~~v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 313 GKKIAVSKMKENEIKEANTGSNGFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred hccccccccccceeecccCCcEEEEEEEEEEEeeeeeEEeEEeC Confidence 111111 2347799999999999999999875 No 50 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=99.02 E-value=1.4e-09 Score=69.05 Aligned_cols=350 Identities=11% Similarity=0.070 Sum_probs=179.7 Q ss_pred ccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc--------ccccccccchhh Q lcl|NC_019538. 255 VDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK--------ENDRDIYGSSIY 326 (678) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~--------~~~~~~~~~~~~ 326 (678) ..+........+. .+..+..+.+|..+- .+.+-.. .+++.+ .+..+... ... T Consensus 1 ~~~~v~vn~~n~~-----------------~g~~~~~er~~Lfig-~~~~~~~-~~~~~~~~sdld~~lg~~~~~l-k~~ 60 (376) T protein:vir:37 1 MFPSVQINALNQL-----------------SGETKEIERHALFVG-VGTTNQG-KLLALTPDSDFDKVFGETDTDL-KKQ 60 (376) T ss_pred CCCeEEEeccccc-----------------CCCcccccceEEeec-ccccccc-ceeeecCccchHhhhCCCchHH-HHH Confidence 0000000000000 001111122222221 1111110 111111 01111111 122 Q ss_pred hhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhccccc-cccccccCcccchhHH Q lcl|NC_019538. 327 VDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLF-IAGSCAGEGVEIASTV 405 (678) Q Consensus 327 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v 405 (678) +.....|++..|...+... ..+..++.++++... +.+.+..+ +++... ..-....+. T Consensus 61 v~aa~~naG~~~~~~~~~~-------------------~~~~~~~~~Av~~a~--~~~s~E~V~v~~pv~-t~~a~i~aa 118 (376) T protein:vir:37 61 VRAAMLNAGQNWFAHVYIA-------------------QEDGYDFVECVKKAN--QTASFEYCVNTRYLG-VDKASIGKL 118 (376) T ss_pred HHHHHhCCCCcEEEEEEee-------------------cCCchHHHHHHHHhh--hhcCceEEEEecccc-ccHHHHHHH Confidence 3334556666654433211 012234666665532 22333222 222211 011122333 Q ss_pred HHHHHHHHHh-cCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCce Q lcl|NC_019538. 406 QKSVAAICDE-RQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTN 484 (678) Q Consensus 406 ~~~l~~~~~~-~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~ 484 (678) ++....+..+ +|-++.|+..+... .+...+ ++..+|....... ..++++.+..+. |. .+ T Consensus 119 ~~~a~el~~~~~Rpv~file~r~~~-~~~~~~---e~w~~y~~~~~al------~~gia~~~V~~V-~~--~~------- 178 (376) T protein:vir:37 119 QECYAELLAKFGRRTFFIQAVQGIN-HDQSDG---ETWDQYVQKLTTL------QQTIVADHVCLV-PL--LF------- 178 (376) T ss_pred HHHHHHHHHhcCCeEEEEEeccCcC-cccccc---cCHHHHHHHHHHh------hcccccccceee-ee--eh------- Confidence 3334344444 36677777765211 011112 3333443322211 122333333221 10 00 Q ss_pred eEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccce-------ecCChhhhhhhhhCCcEEEEEecCC-cEEEec Q lcl|NC_019538. 485 RWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLA-------IETRQAHRDELYQNSMNPVVGFPGQ-GFILYG 556 (678) Q Consensus 485 ~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~-------~~~~~~e~~~L~~~gIn~i~~~~~~-G~~~wG 556 (678) -..-|.+||.+|+- ..-++.||.-+..+.|.+....+ ..++....+.|..+|..+.+.++|+ |+.+-+ T Consensus 179 --gn~~G~~aGRl~~a--aVsVadspgRV~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d 254 (376) T protein:vir:37 179 --GNETGVLAGRLANR--AVTVADSPARVQTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGYYWAD 254 (376) T ss_pred --hhhHHHHHHHHhhc--ccchhhCccceeccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCceEEeC Confidence 02358888887643 33468899887777776653322 3467888999999999999999985 888888 Q ss_pred cccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCC---CHHHHHHHHHHHHHHHHHHHhcCCeeee----EEEEcc Q lcl|NC_019538. 557 DKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFENN---DAFTRNSFRSEVNSYLDSIKSLGGIYDF----RVVCDE 629 (678) Q Consensus 557 ~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfepn---~~~~~~~v~~~i~~~L~~l~~~gal~g~----~V~~d~ 629 (678) .|||+...++|+||..+|..+-+.|.++..+-..+.... .+.-.+..+.-+..=|++|.+..-+.|. +|...+ T Consensus 255 ~~tl~~~gsDY~~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~ 334 (376) T protein:vir:37 255 GRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPK 334 (376) T ss_pred ceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCC Confidence 899988778999999999999999999888777766432 3344556666677789999998888882 465544 Q ss_pred CC-CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeee Q lcl|NC_019538. 630 TN-NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDE 672 (678) Q Consensus 630 ~~-nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e 672 (678) +. -+..-+...++.|.+-+.|.--..+|+..|.-.-. ...| T Consensus 335 d~Di~i~w~s~~~V~I~~~v~P~~~pk~Itv~I~Ldls--n~~~ 376 (376) T protein:vir:37 335 DDAITIVWQSKTKVTIYIKVRPYDCPKEITANIFLDLD--SLGE 376 (376) T ss_pred CCCceEEeeccceEEEEEEEEeccCCceEEEEEEeecC--CCCC Confidence 32 22223477889999999999989998866432222 1222 No 51 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=98.95 E-value=1.1e-08 Score=64.28 Aligned_cols=350 Identities=11% Similarity=0.089 Sum_probs=183.4 Q ss_pred ccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeeccc--------cccccccchhh Q lcl|NC_019538. 255 VDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKE--------NDRDIYGSSIY 326 (678) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~--------~~~~~~~~~~~ 326 (678) ..+...+....+.. +..+....+|..+- .+..- +..+++.+. +..+...... T Consensus 1 ~~~~v~vn~ln~~q-----------------g~~~~ver~~lfig-~~~~~-~~~~~~~~~~sdld~~lg~~ds~lk~~- 60 (376) T protein:vir:37 1 MFPSVQINALNQLS-----------------GETKEIERHALFVG-VGTTN-QGKLLALTPDSDFDKVFGETDTDLKKQ- 60 (376) T ss_pred CCCeEEEeeeeccC-----------------CCcccccceEEEee-ccccc-cCceEEecCCCChHHhhCCCchhHHHH- Confidence 00000001000000 00111112222111 11110 011111111 1111111111 Q ss_pred hhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccc-ccccccCcccchhHH Q lcl|NC_019538. 327 VDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFI-AGSCAGEGVEIASTV 405 (678) Q Consensus 327 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~v 405 (678) +.....|++..|...+... ..+..++.++++... +.+++..+. ++.. ..+.....+. T Consensus 61 v~aa~~naG~~w~a~~~~p-------------------~~~~~~~~~Av~~a~--~~~s~E~V~v~~p~-~t~~a~i~a~ 118 (376) T protein:vir:37 61 VRAAMLNAGQNWFAHVYIA-------------------QEDGYDFVECVKKAN--QTASFEYCVNTRYL-GVDKASIGKL 118 (376) T ss_pred HHHHHhCCCCceEEEEEec-------------------CCChhhHHHHHHHHH--hhCCeeEEEEecCc-chhHHHHHHH Confidence 2233445555553322110 112345777776653 333333322 2211 1111222333 Q ss_pred HHHHHHHHHhc-CCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCce Q lcl|NC_019538. 406 QKSVAAICDER-QDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTN 484 (678) Q Consensus 406 ~~~l~~~~~~~-~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~ 484 (678) +.....+..+. |-+|.++..+... .+...+++ ..+|...+.. ...++.+.+..++.. + + + T Consensus 119 qa~a~el~~~~~R~vffile~~g~d-~~~~~ge~---w~~y~~~l~a------~~~gia~~~V~vV~~-~--~----g-- 179 (376) T protein:vir:37 119 QECYAELLAKFGRRTFFIQAVQGIN-HDQSDGET---WDQYVQKLTT------LQQTIVADHVCLVPL-L--F----G-- 179 (376) T ss_pred HHHHHHHHHhcCCeEEEEEeccCCC-CcccccCC---HHHHHHHHHH------Hhccccccceeeeee-e--c----c-- Confidence 33333344443 5677777765211 01112223 3334332221 123444555554422 1 1 0 Q ss_pred eEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccce-------ecCChhhhhhhhhCCcEEEEEecCC-cEEEec Q lcl|NC_019538. 485 RWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKLA-------IETRQAHRDELYQNSMNPVVGFPGQ-GFILYG 556 (678) Q Consensus 485 ~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~~-------~~~~~~e~~~L~~~gIn~i~~~~~~-G~~~wG 556 (678) -..|.+||.+|+ ...-++.||.-+.-+.|.|....+ ..++....+.|..+|..+.+.++++ |+.+-+ T Consensus 180 ---n~~G~~aGRl~n--aaVsVadspgRV~tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG~Yw~d 254 (376) T protein:vir:37 180 ---NETGVLAGRLAN--RAVTVADSPARVQTGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDGYYRAD 254 (376) T ss_pred ---chHHHHHHHHHh--CCcchhcCccceeecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCceEEeC Confidence 245888998875 234468999988877777754432 2356678999999999999999984 888888 Q ss_pred cccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcCC---CCHHHHHHHHHHHHHHHHHHHhcCCeeee----EEEEcc Q lcl|NC_019538. 557 DKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFEN---NDAFTRNSFRSEVNSYLDSIKSLGGIYDF----RVVCDE 629 (678) Q Consensus 557 ~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfep---n~~~~~~~v~~~i~~~L~~l~~~gal~g~----~V~~d~ 629 (678) .||++...++|++|..+|.++-+.|.++...-..+..+ .++.-.+..+..++.=|+.|.+.+-|.|. +|+..+ T Consensus 255 g~tl~~~gsDYq~ie~~RVvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~ 334 (376) T protein:vir:37 255 GRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPK 334 (376) T ss_pred CeEeccCCCCeeeehhchHHHHHHHHHHHHHHHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccceeecCC Confidence 99998888899999999999999998887776666543 36667788899999999999999999994 355432 Q ss_pred CC-CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeee Q lcl|NC_019538. 630 TN-NTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDE 672 (678) Q Consensus 630 ~~-nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e 672 (678) +. -+..-....++.|-+-++|.--.+.|+..|.-... ...| T Consensus 335 d~dI~i~w~sk~~V~I~~~vrPy~cpk~i~~~I~LDls--~~~~ 376 (376) T protein:vir:37 335 DDAITIVWQSKTKVTIYIKVRPYDCPKEITANIFLDLD--SLGE 376 (376) T ss_pred CCceEEEeccCceEEEEEEEeeecCcceeEEEEEEecC--CCCC Confidence 21 01111135667777888888778888877653333 2333 No 52 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=98.95 E-value=3.2e-09 Score=67.19 Aligned_cols=345 Identities=12% Similarity=0.122 Sum_probs=176.9 Q ss_pred cceeeeecccc---ccccccccccceeeeccCCe-eeeeeeeeccc--------cccccccchhhhhhhhcCCcceEEEE Q lcl|NC_019538. 274 LSFGNITPSSY---LEYGPQTKDQFAMIVFVGGS-AVESRILSVKE--------NDRDIYGSSIYVDEFFINGYSTFIQG 341 (678) Q Consensus 274 ~~~~~~~~~~~---~~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~v~~ 341 (678) .+.+.+..... .+..+..+.+|..+ ..|.. ..+..+++.+. +..+.... .-+.....|++..|-.. T Consensus 1 m~~~~V~in~~n~~qg~~~~ver~~lfi-g~g~~~~~~g~~~~~~~~sdld~~lg~~ds~lk-~~v~aa~~naG~~w~a~ 78 (369) T protein:vir:27 1 MAWPTVIIKILNLMNGPIADIECHFLFV-IRGTVSGEVRNLIMVDSTSDLDDVLAEASAEGL-AIVKAAQLNGKQAWTAG 78 (369) T ss_pred CCCCceEEecccccCCCcccccceEEEE-EeccccccccceEEecCccchHhhcCCcChhHH-HHHHHHHhCCCCceEEE Confidence 11111111000 00011111222222 11111 00111111111 11111111 11233444555554322 Q ss_pred ecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccc-ccccccCcccchhHHHHHHHHHHHhc-CCe Q lcl|NC_019538. 342 VAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFI-AGSCAGEGVEIASTVQKSVAAICDER-QDC 419 (678) Q Consensus 342 ~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~l~~~~~~~-~~~ 419 (678) +. ++ .+.+++.++++... +.+++..+. ++.. .......+.++....+..+. |-+ T Consensus 79 ~~-----------p~---------~~~~~~~~Av~~a~--~~~s~E~V~v~~p~--t~~a~i~aaq~~a~el~~~~~R~v 134 (369) T protein:vir:27 79 VM-----------IL---------SEEDNWQDAVKKAN--EVSSFEFVVLGFDA--ETKAMIEDAITLRTELKNSLGREV 134 (369) T ss_pred EE-----------Ee---------CCchhHHHHHHhhh--hhCCccEEEEecCc--ccHHHHHHHHHHHHHHHHhcCCeE Confidence 21 11 12345666666443 223332222 1110 11112233344444444443 567 Q ss_pred EEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHH Q lcl|NC_019538. 420 LGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCAR 499 (678) Q Consensus 420 ~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~ 499 (678) |.++..+... .....-+...+|...+.. ...++.+.+..++.-+.... .-.|.+||.+|. T Consensus 135 ffi~e~~~~~----~~~~~~e~w~dy~a~l~a------l~~g~a~~~V~vv~~~~~~g----------n~~G~~aGRl~n 194 (369) T protein:vir:27 135 GVLCQLPAIN----NDPTNGQTWSEWLADTVD------IPKDVASEYISVVPNVHAAG----------DTLGKYAGRLAN 194 (369) T ss_pred EEEEeccccC----CCccccCCHHHHHHHHHH------HhhccCcccceeeeeecccc----------chHHHHHHHHHh Confidence 7777654211 011122233334332221 12345556666552222211 235778888875 Q ss_pred HhhcCCceECcCCcchhheeecccce-----ecCChhhhhhhhhCCcEEEEEecCC-cEEEeccccCCCCccccceeehh Q lcl|NC_019538. 500 TDTVAQPWMSPAGFNRGQILDVRKLA-----IETRQAHRDELYQNSMNPVVGFPGQ-GFILYGDKTMSLQPTPFDRINVR 573 (678) Q Consensus 500 ~d~~~g~~~sPan~~~~~v~~~~~~~-----~~~~~~e~~~L~~~gIn~i~~~~~~-G~~~wG~rT~~~~~~~~~~i~vr 573 (678) - ..-++.||.-+..+.+.|...+. ..++.+.++.|..+|..+.+.++|+ |+.+-+.|||+...++|+||..+ T Consensus 195 ~--aVsIadsp~RVktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~tl~~~gsDYq~iE~~ 272 (369) T protein:vir:27 195 K--EVSIADSPARVQTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGRTLDVPGGDYQDIRHI 272 (369) T ss_pred c--ccchhcCcceeeecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCCeehhhhh Confidence 2 33468889887777777754332 2356678999999999999999984 88888889998888899999999 Q ss_pred hHHHHHHHHHHHHHHHHhcCCC---CHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccCC-CCHHHhhCCEEEEEEEEE Q lcl|NC_019538. 574 RLFNLLKKSISESAKYKLFENN---DAFTRNSFRSEVNSYLDSIKSLGGIYDFRVVCDETN-NTPAVIDRNEFVATILIK 649 (678) Q Consensus 574 R~~~~i~~si~~~~~~~vfepn---~~~~~~~v~~~i~~~L~~l~~~gal~g~~V~~d~~~-nt~~~i~~G~~~~~i~~~ 649 (678) |..+-+.|.++..+-..+..|- ++.-.+..+..++.=|++|.+.+ + -++|...++. -+..-....++.|-+-+. T Consensus 273 RVvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~-f-pgei~~P~d~dI~i~w~~k~~V~I~~~vr 350 (369) T protein:vir:27 273 RVAMKAARKVRIRAIARIADRTLNSTPQSIAAAKLYFTQDLRTMALTG-V-PGEIYPPEDEDIQIKWVNSTDVEIYMSVQ 350 (369) T ss_pred hHHHHHHHHHHHHHHHHhcCcccccChhHHHHHHHHHhhHHHHHHhhc-C-CeEEecCCCCceEEEeeccceEEEEEEEe Confidence 9999999988877776666543 45556667778888888886653 2 2244433211 000111455778888888 Q ss_pred ecCCceEEEEEEEEeecCc Q lcl|NC_019538. 650 PARSINYVSLNFSAVGTSA 668 (678) Q Consensus 650 p~~p~e~i~~~~~~~~~~~ 668 (678) |.--...|+..|.-.-... T Consensus 351 P~~~pk~it~~I~ldl~~~ 369 (369) T protein:vir:27 351 PYECPVKITIAISVKQGDY 369 (369) T ss_pred eccCCceEEEEEEEeccCC Confidence 8888889998887665554 No 53 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=98.91 E-value=9.3e-09 Score=64.62 Aligned_cols=441 Identities=10% Similarity=0.039 Sum_probs=209.9 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEeccc---CCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcC-C Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQ---WGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYG-N 76 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~---~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG-~ 76 (678) =.+..||+|+| ++.+....+..+.-.-+||..- ..|.++|++|+|..|-...||.- +.+.-+++.|..+.- . T Consensus 10 ~~iRvP~~y~E-~dns~A~~~~~~qrvLiiGq~la~gt~~~~~~v~v~s~~~a~~~fG~G---S~l~~M~~a~~~~n~~~ 85 (498) T protein:vir:48 10 SDTLVPLFYAE-MDNSAANTAVTSAPALLIGHASNDAAIEVNSLVLMPSADYARQICGAG---SQLARMVDVYRQTDPFG 85 (498) T ss_pred cccccceEEEE-EecCCCccccCCcceEEEeecCccccccccceEEecCHHHHHHhcCcc---cHHHHHHHHHHHhCCCc Confidence 34568999998 4434444455556788888744 34789999999999999999965 444455677776544 7 Q ss_pred eEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeeccccccccccee Q lcl|NC_019538. 77 DLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKEL 156 (678) Q Consensus 77 ~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~ 156 (678) ++|++-+.+....++++ .+..+... .. .| .+.+-. .+......+...+ T Consensus 86 ~l~~i~~~D~ag~aA~g----~it~tg~a-t~---~G-~l~l~I---------------gg~~v~v~V~~gd-------- 133 (498) T protein:vir:48 86 ELYVIAVPEARGAAATV----RVTVTGEA-EE---SG-TLSLYV---------------GRSSVQVPVVNGD-------- 133 (498) T ss_pred eeEEEeeCCcccceeEE----EEEecccc-cC---Cc-eEEEEE---------------CCEEEEEeecCCC-------- Confidence 89999997632222111 11111110 00 01 011000 0000011000000 Q ss_pred ecccccccceeeeeeecccccccccceeeeee--ccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 157 NDYPALQNGWQIQFTSGGPGSGQSATAVLNGI--RQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 157 ~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~--~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) +....+. .....+ ..+..+... . ..+...++|+..| T Consensus 134 --------------Taa~vA~-----al~aai~a~~~lPVTA~------------~-----------~~~~VtlTAr~kG 171 (498) T protein:vir:48 134 --------------DATAVAT-----AIKEAVNGVITLPFAAS------------S-----------DAGVVTLTARHKG 171 (498) T ss_pred --------------CHHHHHH-----HHHHHHhCCCCcceEEE------------e-----------cCcEEEEEeeecc Confidence 0000000 000000 001111000 0 0011234677778 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) ..||.|.+.+.-+... .. +.. T Consensus 172 ~~GN~I~l~~~~~~~~-~g-----------------------------e~~----------------------------- 192 (498) T protein:vir:48 172 LYGNELPVCLNYYGSG-GG-----------------------------EIL----------------------------- 192 (498) T ss_pred cccccceeeeeeccCc-cc-----------------------------ccc----------------------------- Confidence 7888776643211100 00 000 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) ..+-.+ ....++||.. ..|+..++..+..+. .+.++++.. T Consensus 193 ------------------p~Glt~-------------~itamsgGag------~PDia~aLaal~~~~---~~~I~~p~~ 232 (498) T protein:vir:48 193 ------------------PAGLQV-------------VTEAGTAGSG------APDLTAAVAAMGDEA---FDFIGLPFN 232 (498) T ss_pred ------------------cceeeE-------------EEEcccCCcc------CcchHHHHHhhccCC---ccEEEEeec Confidence 000000 0011223321 113333444443332 233444332 Q ss_pred ccCcccchhHHHHHHHHHHH---------hcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccc Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICD---------ERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTT 465 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~---------~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~ 465 (678) +... ..++.+|++ +++++.++.- ...+..++.+|-.. .++. T Consensus 233 ------D~as-l~al~~~L~~~sgRw~~~~q~~g~~~~a----------~~gT~~~l~t~g~~-------------~N~~ 282 (498) T protein:vir:48 233 ------DAAS-INMMMTEMNDSSGRWSYARQLYGHVYTA----------KLGTLSELVNAGDM-------------HNQQ 282 (498) T ss_pred ------CHHH-HHHHHHHHhhhhhhhhHHhhcCeEEEEe----------ccCCHHHHHHhhhc-------------cCCc Confidence 2222 233444443 2334433322 12356666666543 3344 Q ss_pred eEEEEcCeEEEecccCCceeEechHHHH---HHHHH---HHhhcCCceECcCCcchhheeecccceecCChhhhhhhhhC Q lcl|NC_019538. 466 YSSTSANYKLQYDKYNDTNRWIPLSADM---AGLCA---RTDTVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQN 539 (678) Q Consensus 466 ~~~~~~p~~~v~d~~~~~~~~~pps~~v---AG~~a---~~d~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~ 539 (678) |..+.+ + ++ ...-|+.... |++.| +.|..| .--...+.||..+ .+...++..|+|.|.-+ T Consensus 283 ~it~~~-----~---~~-~~~~p~~~~AAa~a~~aA~~l~~DPAr----PLqtl~L~Gi~~p-~~~~r~~~~ern~LL~~ 348 (498) T protein:vir:48 283 HITLAG-----Y---EK-ETQSPVDELVASRLAREAVFIRNDPAR----PTQTGELVGMLPA-PKGKRFIMTEQQTLLSH 348 (498) T ss_pred eEEEEe-----c---CC-CCCChHHHHHHHHHHHHHHhhhccccc----cccceeeeccccC-CchhcCChHHHHHHHhc Confidence 554432 1 11 1112333323 33332 334322 2223444555433 34566789999999999 Q ss_pred CcEEEEEecCCcEEEeccccC-----C-CCccccceeehhhHHHHHHHHHHHHHHH-HhcCCCCHH-----------HHH Q lcl|NC_019538. 540 SMNPVVGFPGQGFILYGDKTM-----S-LQPTPFDRINVRRLFNLLKKSISESAKY-KLFENNDAF-----------TRN 601 (678) Q Consensus 540 gIn~i~~~~~~G~~~wG~rT~-----~-~~~~~~~~i~vrR~~~~i~~si~~~~~~-~vfepn~~~-----------~~~ 601 (678) ||.++.- .++-..+--..|. . ..|..|..|+..|+.+|+++.++..+.. |--+..-++ |-. T Consensus 349 Gist~~V-~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~ 427 (498) T protein:vir:48 349 GVATAYV-EGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPA 427 (498) T ss_pred CcceEEE-cCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCceecccCcccCCCCcccchH Confidence 9999966 5544666555554 1 2234599999999999999999987763 222222222 678 Q ss_pred HHHHHHHHHHHHHHhcCCeeee---E--EEEccCCCCHHHhhCCEEEEEEEEEecCCceEEE----EEEEEeecCc Q lcl|NC_019538. 602 SFRSEVNSYLDSIKSLGGIYDF---R--VVCDETNNTPAVIDRNEFVATILIKPARSINYVS----LNFSAVGTSA 668 (678) Q Consensus 602 ~v~~~i~~~L~~l~~~gal~g~---~--V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~----~~~~~~~~~~ 668 (678) .||..+-.-+++|..+|-+..+ + +.|.++-+- ..|+.+.+-...+-+..-+- |+++-....+ T Consensus 428 ~ir~eli~~y~~le~~given~~~~~~~LiVerd~~d-----pnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 428 VIKGELLATYRQMERAGIVENYDLFKQYLIVERDADN-----PNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred HHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCC-----CcEEEEEecccccCchhhhhhhhhhhhhhhhcCC Confidence 8999999999999999999873 2 334322211 24566555444444433222 2222111111 No 54 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=98.88 E-value=1.4e-08 Score=63.70 Aligned_cols=445 Identities=11% Similarity=0.044 Sum_probs=209.3 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEeccc---CCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcC-C Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQ---WGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYG-N 76 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~---~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG-~ 76 (678) =.+..||+|+| ++.+....+....-.-+||..- ..+.++|++|+|..|-...||.- +.+.-+++.|..+.- . T Consensus 10 ~~iRvP~~y~E-~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~lfG~G---Sml~~M~~a~~~~n~~~ 85 (498) T protein:vir:45 10 SNTLVPLFYAE-MDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAG---SQLARMVEAYRQTDPFG 85 (498) T ss_pred cccccCeEEEE-EeCCCCCCCCCCcceEEEEecCCccccccceeEEecCHHHHHHhcCcC---cHHHHHHHHHHHhCCcc Confidence 33568999999 5433333444455678888753 34779999999999999999975 445556777777544 7 Q ss_pred eEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeeccccccccccee Q lcl|NC_019538. 77 DLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKEL 156 (678) Q Consensus 77 ~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~ 156 (678) ++|++-+.+....++++ .+..+... .. .| .+.+-. .+..+...+... T Consensus 86 ~l~~i~~~d~aG~aA~g----~it~tg~a-t~---~G-~l~l~I---------------gg~~v~v~V~~g--------- 132 (498) T protein:vir:45 86 ELYVIAVPEATGAAATV----TLTVTGEA-TE---SG-TVNVYV---------------GRTRVQAPVTNG--------- 132 (498) T ss_pred eEEEEeeCCcccceeEE----EEEeeccc-CC---Cc-EEEEEE---------------CCEEEEEEecCC--------- Confidence 89999886532211111 11111100 00 00 000000 000000000000 Q ss_pred ecccccccceeeeeeecccccccccceeeeee--ccccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 157 NDYPALQNGWQIQFTSGGPGSGQSATAVLNGI--RQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 157 ~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~--~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) ++....+ ......+ ..+..+... . ..+...++|+..| T Consensus 133 -------------dTaa~vA-----~al~aaina~~~lPVTA~------------~-----------~~~~VtlTAr~kG 171 (498) T protein:vir:45 133 -------------DNVTTIA-----SSIQDAINAVPTLPFTAS------------S-----------SAGVVTLTARHKG 171 (498) T ss_pred -------------CCHHHHH-----HHHHHHHhCCCCCceEEE------------e-----------cCceEEEEeeccC Confidence 0000000 0000000 001111000 0 0012334677778 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) ..||.|.+.+.-+..... +.. T Consensus 172 ~~GN~I~l~~~~~~~~~g------------------------------e~~----------------------------- 192 (498) T protein:vir:45 172 LCGNEIPVSLNYYGFGGG------------------------------EVL----------------------------- 192 (498) T ss_pred ccccceeEEEeecccccc------------------------------ccc----------------------------- Confidence 888877664321110000 000 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) ..+-+ .....++||. ...|+..++..+..+. .+.++++.. T Consensus 193 ------------------p~Glt-------------~~itamagGa------g~PD~a~alaal~~~~---~~~I~~p~~ 232 (498) T protein:vir:45 193 ------------------PAGVQ-------------IAVATGTAGT------GAPVLTGAVAAMADEP---FDYIGLPFN 232 (498) T ss_pred ------------------cceee-------------EEEEccCCCc------cCchhHHHHHHhccCC---ccEEEEeeC Confidence 00000 0001122232 1123334444444332 233444332 Q ss_pred ccCcccchhHHHHHHHHHHH---------hcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccc Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICD---------ERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTT 465 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~---------~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~ 465 (678) +... ..++.+|++ +++++.++.-. ..+..++.+|-.. +++. T Consensus 233 ------D~as-L~al~~~L~~~sgRw~~~~q~~g~~~~a~----------~gT~~~l~t~g~~-------------~N~~ 282 (498) T protein:vir:45 233 ------DTAS-VNTLVTEMNDTSGRWSYARQLYGHVYTAK----------TGTLSELVNAGDQ-------------FNQQ 282 (498) T ss_pred ------CHHH-HHHHHHHHhhhhhhhhHHhhcCeEEEEec----------cCCHHHHHHhhhc-------------cCCc Confidence 2222 233444443 23344433221 2356677666543 3444 Q ss_pred eEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhh--cCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEE Q lcl|NC_019538. 466 YSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDT--VAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNP 543 (678) Q Consensus 466 ~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~--~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~ 543 (678) |..+.+.. + ...-||-...|++.++.-. +..|-..--...+.||..+ .+...++..|+|.|.-+||.+ T Consensus 283 ~it~~~~~-----~----~~~sp~~~~AAa~aa~~A~~l~~DPArPL~tl~L~Gi~~p-~~~~r~~~~ern~LL~~Gist 352 (498) T protein:vir:45 283 HITLAGYE-----K----ETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPA-PKGKRFTMTEQQTLLSHGVAT 352 (498) T ss_pred eEEEEecC-----C----CCCChHHHHHHHHHHHHHHHhhcccccccCceeecceecC-CchhcCChHHHHHHHhCCcce Confidence 55443211 0 1112333333333333321 2233222233455555433 345678899999999999999 Q ss_pred EEEecCCcEEEeccccC-----C-CCccccceeehhhHHHHHHHHHHHHHHHH-hcCCCCHH-----------HHHHHHH Q lcl|NC_019538. 544 VVGFPGQGFILYGDKTM-----S-LQPTPFDRINVRRLFNLLKKSISESAKYK-LFENNDAF-----------TRNSFRS 605 (678) Q Consensus 544 i~~~~~~G~~~wG~rT~-----~-~~~~~~~~i~vrR~~~~i~~si~~~~~~~-vfepn~~~-----------~~~~v~~ 605 (678) +.--.|+ ..+--..|. . ..|..|..|+..|+.+|+++.++..+... --+..-.+ |-..||. T Consensus 353 ~~V~~G~-V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ 431 (498) T protein:vir:45 353 AYVESGV-LRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKG 431 (498) T ss_pred EEEcCCe-EEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCeeecccCcccCCCCcccchHHHHH Confidence 9764332 444444443 1 22346999999999999999999877643 22222222 6788999 Q ss_pred HHHHHHHHHHhcCCeeee---E--EEEccCCCCHHHhhCCEEEEEEEEEecCCceEE----EEEEEEeecCc Q lcl|NC_019538. 606 EVNSYLDSIKSLGGIYDF---R--VVCDETNNTPAVIDRNEFVATILIKPARSINYV----SLNFSAVGTSA 668 (678) Q Consensus 606 ~i~~~L~~l~~~gal~g~---~--V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i----~~~~~~~~~~~ 668 (678) .+-+-+++|..+|-+..+ + +.|.++-+- ..|+.+.+-...+-+..-+ .|+++-....+ T Consensus 432 ell~~y~~le~~givEn~~~~~~~LiVerd~~d-----pnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 432 ELLATYRQLERAGIVENYELFKQYLVVERDASV-----PNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred HHHHHHHhhhhhccccChhhhcceeEEEECCCC-----CcEEEEEecccccCchhhhhhhhhhheehhhcCC Confidence 999999999999999873 2 334322211 2456655544444443322 22222112211 No 55 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=98.78 E-value=4.5e-08 Score=60.89 Aligned_cols=449 Identities=11% Similarity=0.056 Sum_probs=205.9 Q ss_pred CceecCceEEEEcCCCcccccCCccceeEEeccc---CCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHhcC-C Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSSTGRAALAGKFQ---WGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLKYG-N 76 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~tsv~afvG~~~---~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG-~ 76 (678) =.+..||+|+| ++.+..-......-.-+||..- ..|.++|++|+|..|-...||.- +.+.-+++.|..+.- . T Consensus 10 ~~iRvP~~y~E-~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~~fG~G---Sml~~M~~a~~~~n~~~ 85 (498) T protein:vir:44 10 SDTRVPLFYAE-MDNSAANTARDSGASLLIGHASNDASIAVNSLVLVSSVDYARQICGAG---SQLARMVGAYRKTDPFG 85 (498) T ss_pred cccccCeEEEE-EeCCCCCCCcCCcceEEEEecCcccccccceeEeecCHHHHHHhcCcc---cHHHHHHHHHHHhCCCc Confidence 23458999999 4433222223334577888744 34789999999999999999965 445556777777654 7 Q ss_pred eEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeeccccccccccee Q lcl|NC_019538. 77 DLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAKEL 156 (678) Q Consensus 77 ~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~ 156 (678) ++|++-+.+....++++ .+..+..... .| .+.+-. .+......+... T Consensus 86 ~l~~i~~~D~aG~aAtg----~it~tg~at~----~G-~l~l~I---------------gg~~v~v~V~~g--------- 132 (498) T protein:vir:44 86 ELYVIAVPESTGAAATV----ALTVTGEATE----TG-TVNVYT---------------GRTRVQAPVTSG--------- 132 (498) T ss_pred eeEEEecCCcccceeEE----EEEeecccCC----Cc-EEEEEE---------------CCEEEEEEecCC--------- Confidence 89999886532211111 1111111000 00 000000 000000000000 Q ss_pred ecccccccceeeeeeecccccccccceeeeeec--cccceeeecccccccccccccccccchhccccccccceeeecccc Q lcl|NC_019538. 157 NDYPALQNGWQIQFTSGGPGSGQSATAVLNGIR--QDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVASRYAG 234 (678) Q Consensus 157 ~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~--~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A~~~G 234 (678) ++....+ ......++ .+..+... . ..+...++|+..| T Consensus 133 -------------dTaa~vA-----~al~aaina~~~lPVTA~------------~-----------~~~~vtlTAr~kG 171 (498) T protein:vir:44 133 -------------DDAAAVA-----VSIKDAVNANPDLPFTAT------------S-----------EAGVVTLTARHKG 171 (498) T ss_pred -------------CCHHHHH-----HHHHHHHhCCCCCceEEe------------e-----------ccceEEEEEeccC Confidence 0000000 00000000 01111000 0 0011234677777 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) ..||.|.+.+.-+... .. ...+ T Consensus 172 ~~GN~I~l~~~~~~~~-~g--------------e~~p------------------------------------------- 193 (498) T protein:vir:44 172 LYGNEIPVTLNYYGFG-GG--------------EVLP------------------------------------------- 193 (498) T ss_pred cccCcceEEEeeccCc-cc--------------cccc------------------------------------------- Confidence 7777776643211000 00 0000 Q ss_pred ccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcccccccccc Q lcl|NC_019538. 315 ENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSC 394 (678) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (678) .+-. .....++||. ...|+..++..+..+. .+.++++. T Consensus 194 -------------------~Glt-------------~titamsgGa------g~PDia~alaal~~~~---~~~i~~p~- 231 (498) T protein:vir:44 194 -------------------AGVN-------------ITVASGVKGA------GAPALNDAVAAMGDEP---FDYIGLPF- 231 (498) T ss_pred -------------------ccee-------------EEEEcccCCc------cCchhHHHHHhhccCC---ccEEEEee- Confidence 0000 0001122222 1223444444444332 23344432 Q ss_pred ccCcccchhHHHHHHHHHHHh---------cCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccc Q lcl|NC_019538. 395 AGEGVEIASTVQKSVAAICDE---------RQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTT 465 (678) Q Consensus 395 ~~~~~~~~~~v~~~l~~~~~~---------~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~ 465 (678) ++... ..+|.+|++. +++..++... ..+..++.+|-.. +++. T Consensus 232 -----~D~as-l~al~~~L~~~sgRw~~~~q~~g~~~~a~----------~gT~a~l~t~g~~-------------~N~~ 282 (498) T protein:vir:44 232 -----NDTAS-VNSMATEMNDSSGRWSYVRQLYGHVYTAK----------TGTLSELVAAGDQ-------------FNLQ 282 (498) T ss_pred -----cCHHH-HHHHHHHHhhhhcchHHHhhcCeEEEEec----------cCCHHHHHHhhhc-------------cCCc Confidence 22222 2334444432 2333333221 2346666666543 3344 Q ss_pred eEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhh--cCCceECcCCcchhheeecccceecCChhhhhhhhhCCcEE Q lcl|NC_019538. 466 YSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDT--VAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNP 543 (678) Q Consensus 466 ~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~--~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~ 543 (678) |..+.+... ...-||-...|++.++.-. +..|-..--...+.||..+ .+...++..|+|.|.-+||.+ T Consensus 283 ~it~~~~~~---------~~~sp~~~~AAa~a~~aA~~l~~DPArPL~tl~L~Gi~~p-~~~~r~~~~ern~LL~~Gist 352 (498) T protein:vir:44 283 HITLAGYEK---------DTQTPADELAASRTARAAVFIRNDPARPTQTGELVDMLPA-PKGKRFTTTEQQTLLSHGVAT 352 (498) T ss_pred eEEEEecCC---------CCCCHHHHHHHHHHHHHHHHhhcccccccCceeecccccC-CchhcCChHHHHHHHhcCcce Confidence 444432110 0112333333333333221 2233222233444555432 345678899999999999999 Q ss_pred EEEecCCcEEEeccccC-----C-CCccccceeehhhHHHHHHHHHHHHHHH-HhcCCCCH-----------HHHHHHHH Q lcl|NC_019538. 544 VVGFPGQGFILYGDKTM-----S-LQPTPFDRINVRRLFNLLKKSISESAKY-KLFENNDA-----------FTRNSFRS 605 (678) Q Consensus 544 i~~~~~~G~~~wG~rT~-----~-~~~~~~~~i~vrR~~~~i~~si~~~~~~-~vfepn~~-----------~~~~~v~~ 605 (678) +.--.|+ ..+--..|. . ..|..|..|+..|+.+|+++.++..+.. |--+..-+ -+-..||. T Consensus 353 ~~V~~G~-V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ 431 (498) T protein:vir:44 353 AYVESGV-LRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGSGQAIVTPAVIRG 431 (498) T ss_pred EEEcCCe-EEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCcccccCCcccCCCcccccHHHHHH Confidence 9764332 444444443 1 2234599999999999999999988763 22222111 26788999 Q ss_pred HHHHHHHHHHhcCCeeee---E--EEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccc Q lcl|NC_019538. 606 EVNSYLDSIKSLGGIYDF---R--VVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVG 675 (678) Q Consensus 606 ~i~~~L~~l~~~gal~g~---~--V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~ 675 (678) .+-+-+++|..+|-+..+ + +.|.++-+ +..|+.+.+-...+-+..-+-.+++-. ..++|--+ T Consensus 432 eli~~y~~le~~givEn~~~~~~~LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~~~f~---lq~~~~~~ 498 (498) T protein:vir:44 432 ELGSTYRQMEREGIVENFDLFQQHLIVERNAN-----DSNRLDVLFPPDYVNQLRVFAVLNQFR---LQYSEEAA 498 (498) T ss_pred HHHHHHHhhhhhccccChhhhcceeEEEECCC-----CCcEEEEEecccccCchhhhhhhhhhh---hhhhhhcC Confidence 999999999999999873 2 33432211 124566655544444443222111100 01111111 No 56 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=98.75 E-value=4.4e-08 Score=60.91 Aligned_cols=346 Identities=10% Similarity=0.041 Sum_probs=169.4 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeeeeeecc Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVK 314 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 314 (678) -| .........+. .+..+..+.+|..+ ..+..-. ..+++.+ T Consensus 1 ~~--------------------~~v~vn~~n~~-----------------~g~~~~~er~~lfi-g~~~~~~-g~~~~~~ 41 (370) T protein:vir:78 1 MW--------------------PYVQIYNLNQM-----------------QGPVTEVERHLLFI-GSAASNT-GKLLSLN 41 (370) T ss_pred CC--------------------ceEEEeecccc-----------------CCCcCccceeEEEE-ecccccc-cceEeec Confidence 00 00000000000 01111112222222 1111100 0111111 Q ss_pred c--------cccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccchhcc Q lcl|NC_019538. 315 E--------NDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDV 386 (678) Q Consensus 315 ~--------~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~ 386 (678) . +..+..... -+.....|++..|-... .++ .+..++.++++.... .+++ T Consensus 42 ~~sdld~~l~~~ds~lk~-~v~aa~~naG~~~~~~~-----------~p~---------~~~~d~~~Av~~a~~--~~s~ 98 (370) T protein:vir:78 42 AQSDFDQLLGAADSELKA-NLLAARDNAGQNWSAAA-----------YVL---------PTDKPWLDAARDAQQ--TQSF 98 (370) T ss_pred CccCHHHhcCCcChhHHH-HHHHHHhCCCCceEEEE-----------EEe---------cCchhHHHHHHHHHh--hCCc Confidence 1 111111111 12233445554442211 111 133467777765532 2222 Q ss_pred cccc-ccccccCcccchhHHHHHHHHHHHhc-CCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhcccccc Q lcl|NC_019538. 387 NLFI-AGSCAGEGVEIASTVQKSVAAICDER-QDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGT 464 (678) Q Consensus 387 ~~~~-~~~~~~~~~~~~~~v~~~l~~~~~~~-~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s 464 (678) ..+. ++.. .+-....++++....+..+. |-++.++..+... .++ ...+|...+.. ...++++ T Consensus 99 E~V~v~~~~--s~~a~~~a~~~~a~el~n~~~Rpv~file~~~~~-----~~e---~w~~y~~~l~a------l~~gia~ 162 (370) T protein:vir:78 99 EGVVVLGQE--WHQAAINAAHALNQELIAKWGRWQFMLLAVPAIA-----DEQ---DWATYEAELAT------LQDGIAA 162 (370) T ss_pred cEEEEecCc--chHHHHHHHHHHHHHHHHhcCCeEEEEEeecCCC-----CcC---CHHHHHHHHHH------hhhcccc Confidence 2221 1110 01112223333333333444 5677777765321 222 33333322211 1223444 Q ss_pred ceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecccc-----eecCChhhhhhhhhC Q lcl|NC_019538. 465 TYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVRKL-----AIETRQAHRDELYQN 539 (678) Q Consensus 465 ~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~~~-----~~~~~~~e~~~L~~~ 539 (678) .+..++--|.. -.-|.+||.+|. +.--+..+|.-+..+.+.|...+ ...++.+.++.|..+ T Consensus 163 ~~V~vvp~~~g------------~~~G~~aGRL~n--aavsVadsP~Rv~tG~l~gl~~~p~d~~~~~l~~a~l~aLd~a 228 (370) T protein:vir:78 163 SSVSLIPQLWP------------TLAGAYAGRLCN--RAVSIADSPCRVKTGALVGLGNKPVGKDGIPLPLATLQTLEAN 228 (370) T ss_pred ccceEEeeecc------------ccHHHHHHHHhc--CeeeecccceeeeccccccccccccccCCcccCHHHHHHHHhC Confidence 45544432211 113778887653 22237788887776666664322 234677889999999 Q ss_pred CcEEEEEecCC-cEEEeccccCCCCccccceeehhhHHHHHHHHHHHHHH-HHhcCCCC--HHHHHHHHHHHHHHHHHHH Q lcl|NC_019538. 540 SMNPVVGFPGQ-GFILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAK-YKLFENND--AFTRNSFRSEVNSYLDSIK 615 (678) Q Consensus 540 gIn~i~~~~~~-G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~-~~vfepn~--~~~~~~v~~~i~~~L~~l~ 615 (678) |..+.+.++|+ |+.+-..|||+...++|+||..+|..+-+.|.++..+- ...+|--| +...+..+..+..=|+++- T Consensus 229 gy~vp~~Y~gy~G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~gsia~~~~~~~~~L~ema 308 (370) T protein:vir:78 229 RYSVPMWYPDYDGIYWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAIARIGDRSFNSTPGSTAAAITYFGKDLREMA 308 (370) T ss_pred CCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCcccCCCCcchhHHHHHHHhhHHHHH Confidence 99999999985 88888889998877889999999999999999995444 44443222 2222334444555566666 Q ss_pred hcCCeee--eEEEEc--cC-CCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeeccc Q lcl|NC_019538. 616 SLGGIYD--FRVVCD--ET-NNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELVG 675 (678) Q Consensus 616 ~~gal~g--~~V~~d--~~-~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~ 675 (678) ..+.+.| |.-.|. ++ .-+..-....++.|.+.+.|.--...|+..|.-... +|+==+ T Consensus 309 ~s~~i~~~~fpgeI~~p~d~Di~i~w~s~~~v~I~~~v~P~~~pk~Itv~I~LDls---~e~~~~ 370 (370) T protein:vir:78 309 KSTTINGQPFPGDIASPQDGDIRIQWVAKNLVSVFVVVRTVDCPKGITVNIMLDLS---LNNGEG 370 (370) T ss_pred hhhhhcccccceeEeccCCCcceEEeeccceEEEEEEEEeccCCceEEEEEEEeec---cccCCC Confidence 6787776 433443 21 112223467889999999999888999888743222 332222 No 57 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=98.45 E-value=6.2e-07 Score=54.62 Aligned_cols=441 Identities=12% Similarity=0.086 Sum_probs=208.5 Q ss_pred CceecCceEEEEcCCCcccccCCc--cceeEEecc---cCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHh-c Q lcl|NC_019538. 1 MALLSPGVESKENNMQTTIARSST--GRAALAGKF---QWGPAYQISQLVSETDLIDRFGRPDNQTADSVLSAINFLK-Y 74 (678) Q Consensus 1 ~~~~~PGVyveEv~~~~~i~~v~t--sv~afvG~~---~~Gpv~~pv~i~s~~~~~~~FG~~~~~~~~~~~v~~~f~n-g 74 (678) =.+..||+|+| ++.+.-+.|.++ .-.-+||.. ...|.++|++|+|..|-...||.- +.+.-+++.|..+ - T Consensus 11 ~~iRvP~~y~E-~dns~A~~g~~~~~q~vLiiGq~la~gs~~~~~pv~v~s~~~a~~~fG~G---S~la~M~~a~~~~n~ 86 (495) T protein:vir:19 11 SDVRVPLTYIE-FDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQASAAFGQG---SMLALMADAFLNANR 86 (495) T ss_pred cccccCeEEEE-EccCCCCcCCcCCCceEEEEEecCcccccccceeEEecCHHHHHHhcCcC---cHHHHHHHHHHHhCC Confidence 45678999998 544444444444 445788873 344789999999999999999975 4444556666665 4 Q ss_pred CCeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccccccccccccccceeeecccccccccc Q lcl|NC_019538. 75 GNDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTGKVSALNSVGGITFVRFSTAEVVKKAK 154 (678) Q Consensus 75 G~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~ 154 (678) -.++|++-+.+....++++ .+..+.... .. | .+.+-. .+..+...+ T Consensus 87 ~~~l~~i~~~D~aG~aA~g----~it~tg~at-~~---G-~l~l~I---------------~g~~v~v~V---------- 132 (495) T protein:vir:19 87 VAELWCIPQGNGTGNAAVG----EISLSGTAG-EN---G-SLVTYI---------------AGQRLAVSV---------- 132 (495) T ss_pred cceEEEEeeCChhhceeEE----EEEEeecCC-CC---c-EEEEEE---------------CCEEEEEEe---------- Confidence 4789999886532211111 111111110 00 0 000000 000000000 Q ss_pred eeecccccccceeeeeeeccccccccccee----eeee--ccccceeeecccccccccccccccccchhcccccccccee Q lcl|NC_019538. 155 ELNDYPALQNGWQIQFTSGGPGSGQSATAV----LNGI--RQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVV 228 (678) Q Consensus 155 ~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~----~~~~--~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i 228 (678) ..+.++... ...+ ..+..+...... ..+ .........+ T Consensus 133 ---------------------~~gdTaa~vA~al~aaina~~~lPvTA~~~~---------~~~------~~~a~~~Vtl 176 (495) T protein:vir:19 133 ---------------------AAGATGAALADLLVARIKGQPDLPVTAEVRA---------DSG------DDDTHADVVL 176 (495) T ss_pred ---------------------cCCCCHHHHHHHHHHHhcCCccCceEEEeec---------cCC------CCcCceeEEE Confidence 000000000 0000 000000000000 000 0000001112 Q ss_pred eeccccccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeee Q lcl|NC_019538. 229 ASRYAGLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVES 308 (678) Q Consensus 229 ~A~~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~ 308 (678) +|+..|. +|++.+...-+. T Consensus 177 TAr~kG~-~n~idi~~~~~~------------------------------------------------------------ 195 (495) T protein:vir:19 177 SAKFTGA-LSAVDVRWNYYA------------------------------------------------------------ 195 (495) T ss_pred EEeeccc-cccceeEEEeec------------------------------------------------------------ Confidence 2333332 122222110000 Q ss_pred eeeeccccccccccchhhhhhhhcCCcceEEEEecCCCccccc-eeeeeccCcCCccccchhHHHhhhhhhhccchhccc Q lcl|NC_019538. 309 RILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYS-GILTFGGGNSGNSTASAGDWIEGWDMFSDREHVDVN 387 (678) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~-~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~~~~~~ 387 (678) ....|.... ....++||.. ..|+..++..+... ..+ T Consensus 196 ----------------------------------ge~~p~Glt~titamsgGag------~PDia~alaal~~~---~~~ 232 (495) T protein:vir:19 196 ----------------------------------GETTPYGIITAFKAASGKNG------NPDISASIAGMGDL---QYK 232 (495) T ss_pred ----------------------------------ccccccceeEEEEecCCCCC------CcchHHHHHHhccC---CCc Confidence 000000000 0111233331 12333344444422 233 Q ss_pred cccccccccCcccchhHHHHHHHHHHHhcC------CeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccc Q lcl|NC_019538. 388 LFIAGSCAGEGVEIASTVQKSVAAICDERQ------DCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMN 461 (678) Q Consensus 388 ~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~------~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (678) .++++. ++... ..+|.+|++.|- ++.++.- ...+..++.+|-.. T Consensus 233 ~I~~P~------tD~as-L~al~~~l~~rw~~~~q~~g~~~~a----------~~gT~~~l~t~g~~------------- 282 (495) T protein:vir:19 233 YIVMPY------TDEPN-LNLLRTELQERWGPVNQADGFAVTV----------LSGTYGDISTFGVS------------- 282 (495) T ss_pred EEEEec------CcHHH-HHHHHHHHHHhhhHHHhcCeEEEEe----------ecCCHHHHHHhhhc------------- Confidence 344432 33333 255666665532 3333221 12355666666443 Q ss_pred cccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHh--hcCCceECcCCcchhheeecccceecCChhhhhhhhhC Q lcl|NC_019538. 462 IGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTD--TVAQPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQN 539 (678) Q Consensus 462 ~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d--~~~g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~ 539 (678) .++.|..+.+ . ++ -.-||....|++.++.- .+..|-..--...+.||..+ .+...++..|+|.|.-+ T Consensus 283 ~N~~~it~~~--~------~g--sp~~~~~~AAA~aa~~A~~l~~DPArPL~tl~L~Gi~~p-~~~~r~~~~ern~LL~~ 351 (495) T protein:vir:19 283 RNDHLISCMG--I------AG--APEPSYLYAATLCAVASQALSIDPARPLQTLTLPGRMPP-AVGDRFTWSERNALLFD 351 (495) T ss_pred cCCceEEEEe--c------CC--CCCcHHHHHHHHHHHHHHHhhcccccccCceeecceecC-CccccCChHHHHHHHhC Confidence 3344444431 1 11 12344444444443332 12234333334555565443 34667889999999999 Q ss_pred CcEEEEEecCCcEEEeccccC-----C-CCccccceeehhhHHHHHHHHHHHHHHHHhc-CCCCHH-----------HHH Q lcl|NC_019538. 540 SMNPVVGFPGQGFILYGDKTM-----S-LQPTPFDRINVRRLFNLLKKSISESAKYKLF-ENNDAF-----------TRN 601 (678) Q Consensus 540 gIn~i~~~~~~G~~~wG~rT~-----~-~~~~~~~~i~vrR~~~~i~~si~~~~~~~vf-epn~~~-----------~~~ 601 (678) ||.++.--+++=..+--..|. . ..|..|..|+.-|+.+|+++.++..+...-. +..-++ |-. T Consensus 352 Gist~~V~~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~ 431 (495) T protein:vir:19 352 GISTFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRITQKFPNYKLASDGTRFATGQAVVTPS 431 (495) T ss_pred CcceEEECCCCeEEEEeeeeeeeecCCCCcchhhhhhHHHHHHHHHHHHHHHHHhhhcCCcccccCCCCCCCcccccChH Confidence 999997654444555555553 1 2234599999999999999999987763222 222221 667 Q ss_pred HHHHHHHHHHHHHHhcCCeeee---E--EEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_019538. 602 SFRSEVNSYLDSIKSLGGIYDF---R--VVCDETNNTPAVIDRNEFVATILIKPARSINYVSLNFSAVG 665 (678) Q Consensus 602 ~v~~~i~~~L~~l~~~gal~g~---~--V~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 665 (678) .||..+-+-+++|..+|-+..+ + +.|.++-+ +.+|+.+.+-...+-+..-+-.+++-.- T Consensus 432 ~ir~ell~~~~~le~~given~~~~~~~LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 432 VIKTELLALFEEWENAGLVEDFDTFKEELYVARNKD-----DKDRLDVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred HHHHHHHHHHHhhhhhccccChhhhcceeEEEECCC-----CCcEEEEEecceeeCceeeeeeeeeeeC Confidence 8999999999999999999873 2 33432221 1256777776666666554444433333 No 58 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=98.29 E-value=1.6e-06 Score=52.35 Aligned_cols=407 Identities=10% Similarity=0.012 Sum_probs=162.8 Q ss_pred ccccceeeeeeeccccccc----ccceeeeee-ccccceeeeccccccc-----ccccccc-cccchhccccccccceee Q lcl|NC_019538. 161 ALQNGWQIQFTSGGPGSGQ----SATAVLNGI-RQDSKIYIRNDEYSRE-----SLLRRDE-TTETYIDMCESYGIPVVA 229 (678) Q Consensus 161 ~~~~~~~~~~~s~~~~~g~----~a~~~~~~~-~~~~~i~~~~~~~a~~-----~~~~~~~-~~~~~~~~~~~~~~~~i~ 229 (678) -++....+.... .+.... ......... +....+ .....+.+ ....+.+ ....+-+..++|.. T Consensus 1 ~~s~iVnV~i~~-~~~a~~~~~f~~~l~~~~~~~~~~r~--~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p~p~~---- 73 (450) T protein:vir:95 1 MWNPIVNVDITL-NTAGTTREGFGLPLFLASTDNFEERV--RGYTSLTEVAEDFDENTAAYKAAKQLWSQTPKVTQ---- 73 (450) T ss_pred CCCceEEEeecc-cccccccccceeEEEEcCCCCCccce--eeecCHHHHHHhcCCCcHHHHHHHHHHhCCCcccE---- Confidence 111111111110 000000 000000000 000000 00000000 0000000 00001111111111 Q ss_pred eccccccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeeeeee Q lcl|NC_019538. 230 SRYAGLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAVESR 309 (678) Q Consensus 230 A~~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~ 309 (678) .+.|.|...-+... ......... ..+.+..++...... T Consensus 74 -l~igr~~~~~t~~~-------~~~~~~~~~----------------------------------g~lt~tv~G~~~~~~ 111 (450) T protein:vir:95 74 -LYIGRRAMQYTVSI-------PDAVTESTD----------------------------------YSITVAAGGGISQPY 111 (450) T ss_pred -EEEEeeccchhhhh-------hhhhccccc----------------------------------eeEEEEecceeeeee Confidence 11122211100000 000000000 000011111111100 Q ss_pred e--eeccccccccccchhhhhhhhcCCcceE---------------EEEecCCCccc-----cceeeeeccCcCCccccc Q lcl|NC_019538. 310 I--LSVKENDRDIYGSSIYVDEFFINGYSTF---------------IQGVAESWPTE-----YSGILTFGGGNSGNSTAS 367 (678) Q Consensus 310 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------v~~~~~~~~~~-----~~~~~~l~gg~dg~~~~~ 367 (678) . ++...... .....+ .....+.... ........... .........|.+. T Consensus 112 ~i~~s~a~s~~---~va~~~-~tai~~~~~~~~~~~~~s~g~~~~~t~~~~~~~~~~~~~l~~~~~~~~~~g~~a----- 182 (450) T protein:vir:95 112 QYTAQSSDTAE---NVLQQF-KTQIEADPTIKDKVSVNVTGSNGSATMIIAKAGDNDFVKVTTTAQTVYIASTTA----- 182 (450) T ss_pred EEEEEecCChh---hHHHHh-hhhhcccceeeeeeeeeeecccceeeeeeeccccchhhccccccceeEeccccc----- Confidence 0 00000000 000000 0000000000 00000000000 0000011111111 Q ss_pred hhHHHhhhhhhhccchhccccccccccccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHh Q lcl|NC_019538. 368 AGDWIEGWDMFSDREHVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRR 447 (678) Q Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~ 447 (678) .....++..+..... ..-.+. .+..++ .-..+|..+++....+|.....-... ..........++- . T Consensus 183 -et~~~a~~a~~~~~~-~w~~~~-----~~~~~~--~~i~a~a~w~~a~~~~f~~~~~~~~~-~~~~~~~~~~~i~---~ 249 (450) T protein:vir:95 183 -DTASTALAAIEAYST-DWYFIA-----AEDRTQ--QFVLAMASEIQARKKIFFTANSDVTA-LQGTELASANDVP---A 249 (450) T ss_pred -ccHHHHHHHHHHhhC-CeEEEE-----ecCCCH--HHHHHHHHHHhhcCcEEEEEcCCchh-hhhhhhhcccchH---H Confidence 111122222221110 011111 111111 22344666666655544432221000 0000000000000 0 Q ss_pred cccccccchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcchhheeecc--cce Q lcl|NC_019538. 448 GVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFNRGQILDVR--KLA 525 (678) Q Consensus 448 ~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~~~~v~~~~--~~~ 525 (678) .+. ..++ .+. +-+|++.+.. -.|.+.++|.....+..+--| .+|.+.||..-. ... T Consensus 250 ~l~--------~~~~--~~t------~~~y~~~~~~---~~~~aa~~g~~~~~~~g~~T~---~fk~l~Gv~~~v~~~~~ 307 (450) T protein:vir:95 250 QLA--------KNMY--TRT------VCLWHHAAAE---DYPEMAYIAYGAPYDAGSIAW---GNAQLTGVAASLQPSNQ 307 (450) T ss_pred HHH--------hccC--Cee------EEEeeCCCch---hHHHHHHHHHhhhcccceeee---ccccccceeeeccCccc Confidence 000 0010 111 1122211111 225566666655444333233 356666664322 122 Q ss_pred ecCChhhhhhhhhCCcEEEEEecCCcEEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhc------CCCCHHH Q lcl|NC_019538. 526 IETRQAHRDELYQNSMNPVVGFPGQGFILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLF------ENNDAFT 599 (678) Q Consensus 526 ~~~~~~e~~~L~~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vf------epn~~~~ 599 (678) ..++..|.+.|..+++|++.++.+.++ ++..+|++++ ||-++|-.+|++..|++.+...+- =|-|+.- T Consensus 308 ~~lt~~~~~al~~~~~n~y~~~~~~~~-~~~G~~~~G~-----~iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy~~~G 381 (450) T protein:vir:95 308 RPLTSIQKSALDVRHCNFIDLDGGVPV-VRRGITSGGE-----WIDIIRGVDWLESDLKTSLRDLLINQKGGKITYDDTG 381 (450) T ss_pred cccchHHHHHHHhCCcEEEEEecCcee-eeCCeeeCcc-----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccChhh Confidence 468899999999999999999877654 7788888762 688999999999999999887652 2667787 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeeeEEEEc-cCCCCHHHhhCCEE-EEEEEEEecCCceEEEEEEEEeecCceee Q lcl|NC_019538. 600 RNSFRSEVNSYLDSIKSLGGIYDFRVVCD-ETNNTPAVIDRNEF-VATILIKPARSINYVSLNFSAVGTSANFD 671 (678) Q Consensus 600 ~~~v~~~i~~~L~~l~~~gal~g~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~~~~~~~ 671 (678) ...|+..|+.-|++..++|.|.||+|.+. .+..++.|+.++++ -+.+.++....++++.+++.- +.| T Consensus 382 ~~~i~a~i~~~l~~a~~~G~Ia~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v-----~~~ 450 (450) T protein:vir:95 382 ITRIRQVIETSLQRAVNRNFLSSYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTV-----AYE 450 (450) T ss_pred HHHHHHHHHHHHHHHHhcCcccceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEE-----EeC Confidence 88899999999999999999999999997 58889999999887 488999999999999887653 333 No 59 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=97.96 E-value=9.1e-06 Score=48.22 Aligned_cols=311 Identities=14% Similarity=0.066 Sum_probs=151.1 Q ss_pred cccccccccccceeeeccCCee----e-eeeeeeccccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeecc Q lcl|NC_019538. 284 YLEYGPQTKDQFAMIVFVGGSA----V-ESRILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGG 358 (678) Q Consensus 284 ~~~~~~~~~~~~~~~v~~~~~~----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~g 358 (678) ++....+...+.. +..+... . ..++.........+........++..+ ...|.. +............+.. T Consensus 1 ~~~~iv~V~v~~~--~~~~~~~~~~~~~~~~~~~t~~~~~~y~s~~~v~~d~~~~-~~~Yka--a~~~f~Q~~~~~~i~v 75 (331) T protein:vir:80 1 MVETITDVRVHIS--VLYPSPRIGLGRPAIFVKGTAMGYKEYTTLEELKDTFADN-TEVYAK--AKAVFLQKDRPDTVAV 75 (331) T ss_pred Cccceecceeeec--ccccccccccCcceeEEeccccceEEEechhhhccCCCCC-cHHHHH--HHHHHhccCccceEEE Confidence 1111111100000 0000000 0 000000000000000000000000000 000000 0000000000011111 Q ss_pred CcCCccccchhHHHhhhhhhhccchhccccccccccccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCC Q lcl|NC_019538. 359 GNSGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATA 438 (678) Q Consensus 359 g~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~ 438 (678) +... .+. ...+....-... .-.+. ....+ ..-..++..+++..+.+|.+++.. + T Consensus 76 ~~~~----~~~-~~~a~~a~~~~~---w~~~~-----~~~~~--~~~~~a~a~~~~a~~~~f~~~~~~-----------~ 129 (331) T protein:vir:80 76 ITYE----DTK-LLEAAEAYFLKS---WHFAL-----LAEFK--AADALALSNLIEEQKFKFAVFQVT-----------A 129 (331) T ss_pred eccc----hHH-HHHHHHHhccCc---eeEEE-----eecCC--HHHHHHHHHHHhhCCcEEEEEecC-----------c Confidence 1110 000 011111100000 00111 11111 122345667777777766554421 1 Q ss_pred HHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCCceECcCCcc-hhh Q lcl|NC_019538. 439 VKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWMSPAGFN-RGQ 517 (678) Q Consensus 439 ~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~sPan~~-~~~ 517 (678) ..++..... .+....++++ ..+. . +.+.++|.++..|..+--| .++. +.| T Consensus 130 ~~~~~~~~~---------------~~~t~~~~~~-------~~~~---~-~~aa~~g~~~~~~~g~~t~---~fk~~l~G 180 (331) T protein:vir:80 130 VADITPLAK---------------NTRTIAIVHS-------KTGE---K-LDAALIGNVASLPVGSATW---KGRHGLAG 180 (331) T ss_pred hHHHHHhhc---------------cccEEEEEcC-------Cccc---h-hHHHHHHHHHhcCccceee---eeecccCC Confidence 122211100 0112223322 1111 1 3455666666666433223 2332 334 Q ss_pred eeecccceecCChhhhhhhhhCCcEEEEEecCCcEEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcC---- Q lcl|NC_019538. 518 ILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQGFILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFE---- 593 (678) Q Consensus 518 v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfe---- 593 (678) |. ...++..|++.|..+++|++.++.+.. .++...|++++ ||.+.+-.+|++..|++.+...+-. T Consensus 181 V~-----~~~lt~t~~~al~~~~~N~y~~~~~~~-~~~~G~~~~G~-----~iD~~~~~dWl~~~lq~~l~~ll~~~~ki 249 (331) T protein:vir:80 181 IT-----SEELKVSEIDAIQKAGGMCYIEKAGIA-QTSEGKTVSGE-----FIDSIHGDDWIKATIETRLQKLLTETDKL 249 (331) T ss_pred CC-----CCCCCHHHHHHHHhcCceEEEEecCee-EEecceEeCch-----hHHHHHHHHHHHHHHHHHHHHHHHhCCCC Confidence 42 235789999999999999999987654 46777787763 6899999999999999988865543 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhcCCee--------eeEEEEc-cCCCCHHHhhCCEEE-EEEEEEecCCceEEEEEEEE Q lcl|NC_019538. 594 NNDAFTRNSFRSEVNSYLDSIKSLGGIY--------DFRVVCD-ETNNTPAVIDRNEFV-ATILIKPARSINYVSLNFSA 663 (678) Q Consensus 594 pn~~~~~~~v~~~i~~~L~~l~~~gal~--------g~~V~~d-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~ 663 (678) |-|+.=...|+..++.-|++-.++|.|. ||+|.+. .++.+++|+.++++. +.+.+.+...+++|.+++.- T Consensus 250 Py~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v 329 (331) T protein:vir:80 250 TFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEV 329 (331) T ss_pred ccChhhHHHHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEE Confidence 3466677889999999999999999996 6899987 578899999999985 88899999999999987653 Q ss_pred ee Q lcl|NC_019538. 664 VG 665 (678) Q Consensus 664 ~~ 665 (678) .- T Consensus 330 ~~ 331 (331) T protein:vir:80 330 EV 331 (331) T ss_pred eC Confidence 33 No 60 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=97.76 E-value=2.1e-05 Score=46.18 Aligned_cols=462 Identities=11% Similarity=0.023 Sum_probs=215.3 Q ss_pred Cce-ecCceEEEEcCCCcccccCCccceeEEec-ccCCCCC--ccEE-ecCHHHHHHHcCCcCccchhHHHHHHHHHhcC Q lcl|NC_019538. 1 MAL-LSPGVESKENNMQTTIARSSTGRAALAGK-FQWGPAY--QISQ-LVSETDLIDRFGRPDNQTADSVLSAINFLKYG 75 (678) Q Consensus 1 ~~~-~~PGVyveEv~~~~~i~~v~tsv~afvG~-~~~Gpv~--~pv~-i~s~~~~~~~FG~~~~~~~~~~~v~~~f~ngG 75 (678) |.+ ++.=|.|.---.+..+.+..=+...|+|. +..-|.. +.++ .+|..|-...||. .+.++.+.+.+|- +- T Consensus 1 msip~s~ivnV~i~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~---~s~ey~aA~~yF~-q~ 76 (502) T protein:vir:52 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGT---NSETAKAAQPFFA-QS 76 (502) T ss_pred CCCCccceeEEeeccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCC---ChHHHHHHHHHhc-CC Confidence 998 67766676333456677777788899987 3444433 3333 4688999999994 4566677788884 32 Q ss_pred ---CeEEEEEcCCcccccccccccccceeeeecccccccccceeeecccccccccc--cccccccccccceeeecccccc Q lcl|NC_019538. 76 ---NDLRTVRILDEDTARNSSPFFETIDYTITSPGVDYRIGDDVKILQNGATITTG--KVSALNSVGGITFVRFSTAEVV 150 (678) Q Consensus 76 ---~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~~ 150 (678) +++||-|-....... ....+.+ . +...... .... ..+|.. .. T Consensus 77 p~P~~l~igR~~~~~~~~--~~~~~~~-----------------~----~~~~~~~~~~~~~-~~~G~l-~i-------- 123 (502) T protein:vir:52 77 PRAKQLIVARWQKSASTI--EATKNTL-----------------S----GATLSDDLERFKS-VVNGRF-SL-------- 123 (502) T ss_pred CccceEEEEeccccccce--eechhhh-----------------h----hhhhHHhHHHhhh-hcCcee-EE-------- Confidence 447777754321110 0000000 0 0000000 0000 000000 00 Q ss_pred cccceeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccccccchhccccccccceeee Q lcl|NC_019538. 151 KKAKELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPVVAS 230 (678) Q Consensus 151 ~~a~~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~i~A 230 (678) +... + ..+...++ +........... .+.... T Consensus 124 -------~i~g---------~----------~~t~~~i~------lS~~ts~~~vA~-------~i~~~l---------- 154 (502) T protein:vir:52 124 -------TIGG---------D----------VKKVDGLS------FARLADFNAVAT-------KIQEKL---------- 154 (502) T ss_pred -------Eecc---------e----------eeeeeccc------cccccchhHHHH-------HHHhhh---------- Confidence 0000 0 00000000 000000000000 000000 Q ss_pred ccccccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCee-eeee Q lcl|NC_019538. 231 RYAGLTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSA-VESR 309 (678) Q Consensus 231 ~~~G~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-~~~~ 309 (678) +..+...++.... .. ..|.+.....+.. .-++ T Consensus 155 ---~~~~~~~tv~~d~--------------------------~~------------------~~F~i~s~ttg~~~~~~~ 187 (502) T protein:vir:52 155 ---TTLSVAVSIAYDE--------------------------TG------------------NRFIVSANVAGEDKKTEI 187 (502) T ss_pred ---cccccceEEEEec--------------------------CC------------------ceEEEEeccCCCcceeEE Confidence 0000000000000 00 0000000000000 0000 Q ss_pred eeeccccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCcCCccccchhHHHhhhhhhhccc-hhcccc Q lcl|NC_019538. 310 ILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDRE-HVDVNL 388 (678) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~~-~~~~~~ 388 (678) ....... .++.......... ...+..++... ..|. +. ..+..++..+.... .+-. + T Consensus 188 ~~a~~~~-~~gt~~a~~l~l~-~~~~av~v~~~--------------~~g~---~a---et~~~al~a~~~~~~~w~~-~ 244 (502) T protein:vir:52 188 DYAIDEG-GEGEYIGALLKLE-NGQASRKVGKN--------------SVSL---KK---ETLGEALFNVAEVNNTWYG-F 244 (502) T ss_pred EEeecCC-cchhHHHHHhccc-cccceeeeeee--------------cccc---cc---cCHHHHHHHHHhccCceEE-E Confidence 0000000 0000000000000 00000011000 0111 11 11222233222211 1111 1 Q ss_pred ccccccccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEE Q lcl|NC_019538. 389 FIAGSCAGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSS 468 (678) Q Consensus 389 ~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~ 468 (678) ++ ....+ ..-..++..+++..+.+|....... ...+.. ..++..-.. ..++ .+.. T Consensus 245 ~~-----a~~~~--~~~~la~a~~iea~~~~f~~~~~d~-~~~~~~----~~~i~~~l~-----------a~~~--~~t~ 299 (502) T protein:vir:52 245 TV-----AAQLT--DSEVEAAAKYAQANTKLFGANVIRA-EQIEWS----ADNIYKKLY-----------DAGL--DHTL 299 (502) T ss_pred EE-----eecCC--hhHHHHHHHHHhhcCcEEEEEecCc-ceeccc----cchHHHHHH-----------hccC--ceeE Confidence 11 11111 2234567777887776665432211 111111 111111000 1111 1111 Q ss_pred EEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCC-ceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEe Q lcl|NC_019538. 469 TSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQ-PWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGF 547 (678) Q Consensus 469 ~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g-~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~ 547 (678) -+|++.+ -.|.+.+.|.++.+|-.+- -...-.+|.+.||. ...++..|.+.|..+++|++.++ T Consensus 300 ------~~y~~~~-----~~~~aa~~g~~as~~f~~~~g~iT~~fk~l~GV~-----~~~lt~t~~~al~~~~~N~y~~~ 363 (502) T protein:vir:52 300 ------AMFDKND-----MYPVSSALARLLSTNFAANNSTLTLKFKQQPTIT-----ADEITATEFAKAKRLGINVYTYF 363 (502) T ss_pred ------EEecCCc-----chhHHHHHHHHHhcCCCcCcceeeecccccCCcc-----cCcCCHHHHHHHHhcCceEEEEe Confidence 1222211 1255667788888774332 12233455555553 23588999999999999999998 Q ss_pred cCCcEEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhcC-----CCCHHHHHHHHHHHHHHHHHHHhcCCee- Q lcl|NC_019538. 548 PGQGFILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLFE-----NNDAFTRNSFRSEVNSYLDSIKSLGGIY- 621 (678) Q Consensus 548 ~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vfe-----pn~~~~~~~v~~~i~~~L~~l~~~gal~- 621 (678) .+.++ +...++++++ ||-+.+-.+|++..|++.+...++. |-|+.=...|+..|+.-|++-.++|.|. T Consensus 364 ~~~~~-~~~G~~~~G~-----~iD~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~ 437 (502) T protein:vir:52 364 DDVAM-IAEGTVIGGK-----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAP 437 (502) T ss_pred cCeeE-EecCeeeCCc-----hhhHHHHHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCcccc Confidence 66654 6777787763 6778899999999999998876652 4477778899999999999999999984 Q ss_pred -------------------eeEEEEc-cCCCCHHHhhCCEE-EEEEEEEecCCceEEEEEEEEee Q lcl|NC_019538. 622 -------------------DFRVVCD-ETNNTPAVIDRNEF-VATILIKPARSINYVSLNFSAVG 665 (678) Q Consensus 622 -------------------g~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 665 (678) ||+|.+. .++.++.|+.++++ -+.+.+++...+++|.|.+.-.+ T Consensus 438 G~~~~~~~g~~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 438 GKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred ccccCcccceeeecccccCceEEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 6899987 57889999999999 89999999999999999876555 No 61 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=94.02 E-value=0.0056 Score=32.91 Aligned_cols=367 Identities=11% Similarity=0.035 Sum_probs=137.0 Q ss_pred ccccceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCCeee-------- Q lcl|NC_019538. 235 LTGDNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGGSAV-------- 306 (678) Q Consensus 235 ~~gn~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~-------- 306 (678) =+-+-+.|.+. ..+. .-....|......+.+-. T Consensus 1 m~~~iVnV~Is---------------------~~t~------------------A~~~~~Fg~~liigs~~~~~p~~~f~ 41 (426) T protein:vir:31 1 MPKQIVEIELT---------------------AEIA------------------DRPQETFTDAAIVGTAEEEPPDAEFG 41 (426) T ss_pred CCcceEEEEee---------------------cccc------------------cccccccceeeeeeeccccccccccc Confidence 00000001000 0000 000001111110110000 Q ss_pred e--eee----ee--ccccccccccchhhhhhhhcCCcceEEEE-----ecCCCccccceeeeeccCcCCccccchhHHHh Q lcl|NC_019538. 307 E--SRI----LS--VKENDRDIYGSSIYVDEFFINGYSTFIQG-----VAESWPTEYSGILTFGGGNSGNSTASAGDWIE 373 (678) Q Consensus 307 ~--~~~----~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~-----~~~~~~~~~~~~~~l~gg~dg~~~~~~~~~~~ 373 (678) + .|. +. ...+...+.....+..+..-.+. ..+.. .....+......+...+ ..+.. ....++.. T Consensus 42 ~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~~~~~r-~~v~~at~~~~~~~t~~~tv~g~~~s~-~a~~~-~~a~~i~~ 118 (426) T protein:vir:31 42 EVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQWR-VMVLEATEVTEEELSDGDTIDKVPILG-NHEVE-SPDGDIEF 118 (426) T ss_pred hhhhhhhHHHHHhcCCCChHHHHHHHHHHhCCceeEE-eeccccceeeeccCCcceeecceeeee-cccCc-chHHHHHH Confidence 0 000 00 00000011111111111000000 00000 00000000000011111 11111 11122222 Q ss_pred hhhhhhccchhcccccccc---cc---------ccC--cccchhHHH--HHHHHHHHhcCCeEEEEccccchhccccccC Q lcl|NC_019538. 374 GWDMFSDREHVDVNLFIAG---SC---------AGE--GVEIASTVQ--KSVAAICDERQDCLGWISPPREYMVNLPVAT 437 (678) Q Consensus 374 ~~~~~~~~~~~~~~~~~~~---~~---------~~~--~~~~~~~v~--~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~ 437 (678) ++..............+.. .+ ... +|.....+. ..+..+++...+..++.+ T Consensus 119 ~~~~~~~~~~~~~~~~~~t~~g~~t~~~~~~~~~~s~~dw~~~~~~~s~~~~~~ia~~~~~~~~~~~------------- 185 (426) T protein:vir:31 119 TTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHADWSQLDEFPSDVNNFAVADRRFDLKGVGV------------- 185 (426) T ss_pred hhccccccccceeeeEeccccceeeccccceeeeeccCcchhhhcccccchhhhhhccccchhhhhh------------- Confidence 2111000000000000000 00 000 000000000 000001111111000000 Q ss_pred CHHHHHHHHhcccccccch-hh-------ccccccceE-EEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCCceE Q lcl|NC_019538. 438 AVKKMVEWRRGVTDSGVVV-DD-------NMNIGTTYS-STSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQPWM 508 (678) Q Consensus 438 ~~~~~~~~~~~~~~~~~~~-~~-------~~~~~s~~~-~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~~~ 508 (678) .++..+|.........+. .. ...+.+++. .-|.|-..+... .-+. .--..+++++.|+..+ ||. T Consensus 186 -~~~~~~wa~~~~i~~va~~~e~~~~~~~~~~~a~~~~~~~y~p~~~~~~~-~~~~-~~~~~~~~~~~~aa~~----~~~ 258 (426) T protein:vir:31 186 -LDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMI-VDAS-DDDLAAYQLGKFAVSE----PWY 258 (426) T ss_pred -hHhhhhhhhhcceeeeeeccchhhhcchhhhhhhhhcccccccchhheee-hhcc-ccchhhHHhhhhhhhc----ccc Confidence 011111110000000000 00 000111111 112222111000 0000 0012567888888776 566 Q ss_pred CcCCcchhheeecc------cceecCChhhhhhhhhCCcEEEEEecCCcEEEeccccCCCCccccceeehhhHHHHHHHH Q lcl|NC_019538. 509 SPAGFNRGQILDVR------KLAIETRQAHRDELYQNSMNPVVGFPGQGFILYGDKTMSLQPTPFDRINVRRLFNLLKKS 582 (678) Q Consensus 509 sPan~~~~~v~~~~------~~~~~~~~~e~~~L~~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~s 582 (678) .|.-+...+...+. +....+...++-.++ +..|.++.+. ++..+|-.-|..+....-.||-++|..+||+.. T Consensus 259 ~~~~~~~~~~~~~~~~~~~~gv~~t~~~~~~A~~~-~~~n~~~~~~-~~~~i~~~~~~~G~~~~G~~iD~~~g~dwl~~~ 336 (426) T protein:vir:31 259 NPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGE-GPVNVLIDVS-DANRVSNAVTTAGADSDTSFFDIRRTKVYTAEM 336 (426) T ss_pred chhhhhccccccceeeccccccccccchhhhhhhc-CCceEEEEec-CceeeecceeecccccchhhhhhHHHHHHHHHH Confidence 66432222111111 112222233444555 7789999985 467777777776655556799999999999999 Q ss_pred HHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcCC--eeeeEEEEccCCCCHHHhhCCEEE-EEEEEEecCCce Q lcl|NC_019538. 583 ISESAKYKLFE----NNDAFTRNSFRSEVNSYLDSIKSLGG--IYDFRVVCDETNNTPAVIDRNEFV-ATILIKPARSIN 655 (678) Q Consensus 583 i~~~~~~~vfe----pn~~~~~~~v~~~i~~~L~~l~~~ga--l~g~~V~~d~~~nt~~~i~~G~~~-~~i~~~p~~p~e 655 (678) |+..++..+=. |-+..=...|+..|+.-|++..+.|. +.+|.|...+-..++.|..+.++. +++..+..-.++ T Consensus 337 iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~~~~~y~v~~P~~~~~~~dra~R~~~~i~~~~~laGAIh 416 (426) T protein:vir:31 337 LELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAH 416 (426) T ss_pred HHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCccccceeecCCCccccchhhhhhccCCceEEEEEeCcEE Confidence 99999866532 56777888999999999999998754 557998876544466688887775 889999999999 Q ss_pred EEEEEEEEee Q lcl|NC_019538. 656 YVSLNFSAVG 665 (678) Q Consensus 656 ~i~~~~~~~~ 665 (678) ++.|++.-.- T Consensus 417 ~v~I~g~v~v 426 (426) T protein:vir:31 417 TFSLGLNVSV 426 (426) T ss_pred EEEEEEEEeC Confidence 9998865333 No 62 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=93.65 E-value=0.0068 Score=32.45 Aligned_cols=427 Identities=11% Similarity=0.069 Sum_probs=144.1 Q ss_pred cccccceeecccccccceeeeeeecccccccccc-eeeeeeccccceeeecccccccccccccccccchhccccccccce Q lcl|NC_019538. 149 VVKKAKELNDYPALQNGWQIQFTSGGPGSGQSAT-AVLNGIRQDSKIYIRNDEYSRESLLRRDETTETYIDMCESYGIPV 227 (678) Q Consensus 149 ~~~~a~~~~~~~~~~~~~~~~~~s~~~~~g~~a~-~~~~~~~~~~~i~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (678) .....+.+...+.+ .+..+.... ...... +.......+-.........+ ...|....+.+.. T Consensus 1 mip~s~iV~V~~~v-----------~~~~~~~~~~~~~l~l-~~~~~~~~~r~~~y~s~~~V---~~~FG~~S~ey~a-- 63 (504) T protein:vir:96 1 MISQSRYIRIISGV-----------GAGAPVAGRKLILRVM-TTNNVIPPGIVIEFDNANAV---LSYFGAQSEEYQR-- 63 (504) T ss_pred CCCccceeEeeecc-----------cccccccccccceeEe-ecccCCCccceEEecCHHHH---HHhcCCChHHHHH-- Confidence 11111111000000 000000000 000000 00000000000000000000 0001111111100 Q ss_pred eeeccccccc----cceeEEeccccccccccccccccccccccccccccccceeeeeccccccccccccccceeeeccCC Q lcl|NC_019538. 228 VASRYAGLTG----DNIQVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSYLEYGPQTKDQFAMIVFVGG 303 (678) Q Consensus 228 i~A~~~G~~g----n~i~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 303 (678) +..+-.... .--++.+..... ........+....... .....+ . .-.+.+..++ T Consensus 64 -A~~yF~~~~~~~~~P~~l~igR~~~---~a~~~~l~g~~~~~~~-----~~~~~i-----------~--~G~lsitv~G 121 (504) T protein:vir:96 64 -AAAYFKFISKSVNSPSSISFARWVN---TAIAPMVVGDNLPKTI-----ADFAGF-----------S--AGVLTIMVGA 121 (504) T ss_pred -HHHHhhcCCCCCccccEEEEEeecC---cCccceEEechhHHHH-----HHHhhh-----------h--ceEEEEEEcc Confidence 000000000 000111111000 0000000000000000 000000 0 0001111111 Q ss_pred eeeeee--eeeccccccccccchhhhhhh---------------hcCCcceEEEEecC-CC----------ccccceeee Q lcl|NC_019538. 304 SAVESR--ILSVKENDRDIYGSSIYVDEF---------------FINGYSTFIQGVAE-SW----------PTEYSGILT 355 (678) Q Consensus 304 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~v~~~~~-~~----------~~~~~~~~~ 355 (678) ...... .++.. +........+... .....++++..... +. ....+..+. T Consensus 122 ~~~~~~~i~~S~~---ts~~~vA~~i~~al~~~~~~~~~~~tv~~d~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lg 198 (504) T protein:vir:96 122 AEKNITAIDTSAA---TSMDNVASIIQTEIRKNTDPQLAQATVTWNPNTNQFTLVGATIGTGVLAVAKSADPQDMSTALG 198 (504) T ss_pred eeeeecccccccc---cchHHHHHHHHhhhhcccccccccceEEEeccCCeEEEEeeccccceeEEEeeccccchhhhhh Confidence 111000 00000 0000000000000 00011111111100 00 000000000 Q ss_pred eccCc----CCccccchhHHHhhhhhhhccchhccccccccccccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhc Q lcl|NC_019538. 356 FGGGN----SGNSTASAGDWIEGWDMFSDREHVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMV 431 (678) Q Consensus 356 l~gg~----dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~ 431 (678) +..+. .|....+ ...++..+......=.-+..+ ....+ ..-..++..+++....++.+...- T Consensus 199 l~~~~~~~v~g~~aet---~~~al~al~~~~~~Wy~f~~a----~~~~~--dd~ilalA~w~ea~~~~~~~~~~~----- 264 (504) T protein:vir:96 199 WSTSNVVNVAGQAADL---PDAAVAKSTNVSNNFGSFLFA----GATLD--NDQIKAVSAWNAAQNNQFIYTVAT----- 264 (504) T ss_pred cccccceEEeeccccc---HHHHHHHHHhhcCCeEEEEEE----eccCC--HHHHHHHHHHHhhcCceEEEEEee----- Confidence 00000 0000000 011111111111000000000 00001 111233444554443333211110 Q ss_pred cccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcC--CceEC Q lcl|NC_019538. 432 NLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVA--QPWMS 509 (678) Q Consensus 432 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~--g~~~s 509 (678) ..+.. ........ .+.+.. ..++... .... -++.+..|.++.+|-.+ | -.. T Consensus 265 --~~~~~-~~~~~~~~---------------~~~~~~-----~~~~~~~--~~~~-~~~~~~~~~~as~~f~~~ng-~~T 317 (504) T protein:vir:96 265 --SLANL-GALFDLVK---------------GNSGTA-----LNVLSAT--ASND-FVEQCPSEILAATNYDEPGA-SQN 317 (504) T ss_pred --cccch-hhHHHhhh---------------hcceeE-----EEEeecC--ccch-hHHHHHHHHHHhcCcCcccc-ccc Confidence 00000 00000000 000000 0011000 0111 24455667777776333 2 011 Q ss_pred cCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCc--EEEe-ccccCCCCccccceeehhhHHHHHHHHHHHH Q lcl|NC_019538. 510 PAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQG--FILY-GDKTMSLQPTPFDRINVRRLFNLLKKSISES 586 (678) Q Consensus 510 Pan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G--~~~w-G~rT~~~~~~~~~~i~vrR~~~~i~~si~~~ 586 (678) -..|.+.||. ...++..|.+.|..+++|++..|-+.| +.+| ...++++. -+|.+|.+-+-.+|++..|+.. T Consensus 318 ~~fk~l~GVt-----a~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~-~~~~wiDv~~~~~WL~~~lq~~ 391 (504) T protein:vir:96 318 YMYYQFPGRN-----ITVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGP-TDAVDMNVYANEIWLKSAIAQA 391 (504) T ss_pred ccccccCCcC-----cccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCc-cccchhhhhhhHHHHHHHHHHH Confidence 2334444442 235799999999999999999887655 4555 45555553 2477899999999999999999 Q ss_pred HHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee-----------------------------eeEEEEc-cCCC Q lcl|NC_019538. 587 AKYKLFE----NNDAFTRNSFRSEVNSYLDSIKSLGGIY-----------------------------DFRVVCD-ETNN 632 (678) Q Consensus 587 ~~~~vfe----pn~~~~~~~v~~~i~~~L~~l~~~gal~-----------------------------g~~V~~d-~~~n 632 (678) +....-. |-|+.=...|+..|+.-|++-+++|.|. ||+|.++ .++- T Consensus 392 l~~l~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~GYyv~~~~~s~~ 471 (504) T protein:vir:96 392 LLDLFLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLGYWINITFSSYT 471 (504) T ss_pred HHHHHhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccceEEEecChhcc Confidence 8764433 4477888899999999999999999872 4888886 3444 Q ss_pred CHH-HhhCCEEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_019538. 633 TPA-VIDRNEFVATILIKPARSINYVSLNFSAV 664 (678) Q Consensus 633 t~~-~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 664 (678) +++ .-.++-..+.+.++---.+++|++.-.-. T Consensus 472 s~~~r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 472 NSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred ChhHhhhccccceEEEEEECCeEEEEEeccccC Confidence 544 44455567778888888888887763322 No 63 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=87.38 E-value=0.039 Score=28.30 Aligned_cols=437 Identities=10% Similarity=0.013 Sum_probs=152.7 Q ss_pred cccccccceeeecccccccccceeecccccccceeeeeeecccccccccceeeeeeccccceeeecccccccccccccc- Q lcl|NC_019538. 133 LNSVGGITFVRFSTAEVVKKAKELNDYPALQNGWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRRDE- 211 (678) Q Consensus 133 ~~~~~~~~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~~~- 211 (678) .....+.+.+......--..+. .......-....... .-......... .+ ...-....+++ T Consensus 1 mip~s~iVnV~~~v~~~a~~~~---------~~~~~lilt~~~~~~---~~r~~~y~s~~-~V-----~~~FG~~S~ey~ 62 (507) T protein:vir:99 1 MISQSRYVRIVSGVGAGAPVAQ---------RRLIMRVMTTNAVLP---PGVVFESSSAD-AV-----GAYFGMASEEYK 62 (507) T ss_pred CCCccceeEEeeeccccCcccc---------cccceeeeccccCCC---ccceEeecCHH-HH-----HHhcCCChHHHH Confidence 1111111111100000000000 000000000000000 00000000000 00 00000011110 Q ss_pred cccchhcccc----ccccceeeeccccccccceeEEecccccccccc--ccccccccccccccccccccceeeeeccccc Q lcl|NC_019538. 212 TTETYIDMCE----SYGIPVVASRYAGLTGDNIQVAFIAYKDYYKFG--VDGKISSVNTVNLKTFPSGLSFGNITPSSYL 285 (678) Q Consensus 212 ~~~~~~~~~~----~~~~~~i~A~~~G~~gn~i~v~v~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (678) ....+-+..+ .|.... .|.|.+................ .....++....... +.....+..+.. T Consensus 63 aA~~yFsq~p~~~~~P~~L~-----igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~v~-----G~~~t~~~i~lS 132 (507) T protein:vir:99 63 RAKAYMSFISKSINSPSYIS-----FARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLSIG-----GTVVPIAGIDLT 132 (507) T ss_pred HHHHHhccCCCCCcccceEE-----EEeecCccccceeecchhhhhHHHHhhhcceeEEEEEc-----CceeEecccccc Confidence 0111111111 122211 1222111000000000000000 00000000000000 000000000000 Q ss_pred ccccccc--------------cc-ceeeeccCCeeeeeeeeeccccccccccchhhhhhhhcCCcceEEEEecCCCc--- Q lcl|NC_019538. 286 EYGPQTK--------------DQ-FAMIVFVGGSAVESRILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWP--- 347 (678) Q Consensus 286 ~~~~~~~--------------~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~--- 347 (678) ....... .+ +...+..+ .....|.+.....+... ..-........ T Consensus 133 ~~ts~~~vAs~i~~~l~a~~~~~~~~~tv~~d-~~~~~F~v~s~~tG~~s----------------~i~~at~~~~gt~~ 195 (507) T protein:vir:99 133 AALTLTDVAATLQTKIRASANAELATATVTFN-TTTNQFVLNGTTTGALA----------------PTITAVRTDPATDI 195 (507) T ss_pred ccCCHHHHHHHHHHhhhccccccccceEEEEe-cCCceEEEEeeeccccc----------------eeEEEEcCCchhhH Confidence 0000000 00 00000000 00011211111111000 00000000000 Q ss_pred -----cccceeeeeccCcCCccccchhHHHhhhhhhhcc-chhccccccccccccCcccchhHHHHHHHHHHHhcCCeEE Q lcl|NC_019538. 348 -----TEYSGILTFGGGNSGNSTASAGDWIEGWDMFSDR-EHVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQDCLG 421 (678) Q Consensus 348 -----~~~~~~~~l~gg~dg~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~ 421 (678) ....+. ....|.+. .+ ...+...+... ..+- .+.. ...+..+ ..-..+|..+++....++. T Consensus 196 s~l~~~~~~~a-~~~~g~~a---et---~~~a~~a~~~~~~nW~--~~~~--a~~~~~t--d~~~lalA~wiea~~~~f~ 262 (507) T protein:vir:99 196 SSLLGWTNTGT-VFVKGQAA---ET---PDTSISKSAAISTNFG--SFIY--TSTPALT--NDQITAVASWNASQNNMYM 262 (507) T ss_pred HHHhccccccc-eEeecccc---cC---HHHHHHHHHhhcCCeE--EEEE--EeccccC--hHHHHHHHHHHhhcCcEEE Confidence 000010 11112211 11 11122222211 1110 0000 0111112 2234556777777766664 Q ss_pred EEccccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHh Q lcl|NC_019538. 422 WISPPREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTD 501 (678) Q Consensus 422 i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d 501 (678) ....-. .. ........ ......+...++.+ ..-.-.+.+.+.|.++.+| T Consensus 263 ~~~~~~-------~a----~~~~~~~~-----------~~~~~~~~~~~~~~---------~~~~~~~~aa~~g~~as~n 311 (507) T protein:vir:99 263 YSVPTT-------IA----NIGTLYAA-----------VKGFSGCALNITSD---------SLPVDYIEQSPCEILAATD 311 (507) T ss_pred EEEecC-------ch----hhhhhhhh-----------hhhcceeEEEeecc---------cccchhHHHHHHHHHHhhc Confidence 332110 00 00000000 00000111111111 1111224567777777776 Q ss_pred hcC--CceECcCCcchhheeecccceecCChhhhhhhhhCCcEEEEEecCCc--EEEeccccCCCCccccceeehhhHHH Q lcl|NC_019538. 502 TVA--QPWMSPAGFNRGQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQG--FILYGDKTMSLQPTPFDRINVRRLFN 577 (678) Q Consensus 502 ~~~--g~~~sPan~~~~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G--~~~wG~rT~~~~~~~~~~i~vrR~~~ 577 (678) -++ | -..-..|.+.||. ...++..|.+.|..+++|+...+.+.| +.+|-.-.+++-..+|.++.+-+=.+ T Consensus 312 f~~~ng-~~T~~fk~l~GV~-----a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~gG~~~fid~d~~~~~~ 385 (507) T protein:vir:99 312 YTRVNA-TQNYMYYQFPSRN-----ITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILCGGPNDAVDMNIYANEI 385 (507) T ss_pred cCcCcc-ceeecccccCCcc-----cccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeCCcccceeeeeecchH Confidence 432 2 0112233334442 235899999999999999999987644 66666555544322477777767677 Q ss_pred HHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee-----------------------------eeE Q lcl|NC_019538. 578 LLKKSISESAKYKLFE----NNDAFTRNSFRSEVNSYLDSIKSLGGIY-----------------------------DFR 624 (678) Q Consensus 578 ~i~~si~~~~~~~vfe----pn~~~~~~~v~~~i~~~L~~l~~~gal~-----------------------------g~~ 624 (678) ||+..|+..+....-. |-|..=...|+..|+.-|++-+++|.|. ||+ T Consensus 386 WL~~~iq~~l~~l~~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~~~~~~~~~Gyy 465 (507) T protein:vir:99 386 WLKSAISAQILSLFLNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYITQISGDANAWRQVANIGYW 465 (507) T ss_pred HHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchheecccccccccccceeccceE Confidence 8888888888764322 4577888889999999999999998883 277 Q ss_pred EEEc-cCCCCH-HHhhCCEEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_019538. 625 VVCD-ETNNTP-AVIDRNEFVATILIKPARSINYVSLNFSAV 664 (678) Q Consensus 625 V~~d-~~~nt~-~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 664 (678) |.++ .++.++ +...++...+.+.+.---.+++|++.-.-+ T Consensus 466 ~~~~~~s~~~~~~r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 466 LNITFSSYTNPNTQLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred EEeCChHhcChhhhhccccceEEEEEEeCCeEEEEEeeeecC Confidence 7775 344444 444577777888888888888887764433 No 64 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=81.50 E-value=0.085 Score=26.45 Aligned_cols=433 Identities=8% Similarity=-0.023 Sum_probs=150.2 Q ss_pred ccccceeeecccccccccceeeccccccc----c--eeeeeeecccccccccceeeeeeccccceeeecccccccccccc Q lcl|NC_019538. 136 VGGITFVRFSTAEVVKKAKELNDYPALQN----G--WQIQFTSGGPGSGQSATAVLNGIRQDSKIYIRNDEYSRESLLRR 209 (678) Q Consensus 136 ~~~~~~~~~~~a~~~~~a~~~~~~~~~~~----~--~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~~~~~a~~~~~~~ 209 (678) |+.. ..+ ..+.+...+.+.. . ............... .......- ......-....+ T Consensus 1 m~~~---~ip------~s~iV~V~~~v~~~~~~~~~f~~lll~~~~~~~~~---r~~~y~s~------~~V~~~FG~~S~ 62 (501) T protein:vir:10 1 MPTT---TIP------IDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPG---QLADFFQK------TDVENWFGALSN 62 (501) T ss_pred CCcC---ccc------cceEEEEeeecccCCCcccccceEEEecccCCCcc---ceeeecCH------HHHHHhcCCChH Confidence 1100 000 0000000000000 0 000000000000000 00000000 000000001111 Q ss_pred cc-cccchhc----cccccccceeeeccccccccce--------eEEeccccccccccccccccccccccccccccccce Q lcl|NC_019538. 210 DE-TTETYID----MCESYGIPVVASRYAGLTGDNI--------QVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSF 276 (678) Q Consensus 210 ~~-~~~~~~~----~~~~~~~~~i~A~~~G~~gn~i--------~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (678) ++ ....+-+ -.++|...-+ |.|.+.- .+.......+.. ..+....... .... ..+... T Consensus 63 ey~aA~~yFsg~~~q~p~P~~l~i-----gR~~~~~~~~~l~g~~l~~~~la~~~~--~~g~l~i~i~-g~~~-~~~i~~ 133 (501) T protein:vir:10 63 EAKIADAYFPGIVNGGQLPYDLKF-----ARYVAADAPASVYGIPLTGITLAQLQG--YSGTLTVTTA-AQHV-SANISL 133 (501) T ss_pred HHHHHHHHhhhhcCCCccccEEEE-----EeecccCccceeeeceehhhhhhhhhh--eeeEEEEeec-ccee-eecccc Confidence 11 0111111 1233333222 1221110 000000000000 0000000000 0000 000000 Q ss_pred ee-eeccccccccccccc-cceeeeccCCeeeeeeeeeccccccccccchhhhhhhhcCCcceEEEEecCCCccccceee Q lcl|NC_019538. 277 GN-ITPSSYLEYGPQTKD-QFAMIVFVGGSAVESRILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGIL 354 (678) Q Consensus 277 ~~-~~~~~~~~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 354 (678) .. .+...... .-.... .....+. -+.....|+++....+... ........ .+.+..+ T Consensus 134 s~ats~~~vA~-~i~~al~~~~~tv~-~d~~~~~f~i~~~t~G~~~----------------~i~~~t~~---~d~a~~l 192 (501) T protein:vir:10 134 AAATSFANAAT-LIEAAFTSPDFVVA-YDALRNRFTVVTNTTGTAA----------------AISAVTGT---NNLADEL 192 (501) T ss_pred ccccCHHHHHH-HHHHhhcCCceEEE-EecccceEEEEecccCcce----------------eEEEeecc---ccchhhh Confidence 00 00000000 000000 0000000 0011111222211111100 00000000 0001111 Q ss_pred eeccCc------CCccccchhHHHhhhhhhhccc-hhccccccccccccCcccchhHHHHHHHHHHHhcCCeEEEE--cc Q lcl|NC_019538. 355 TFGGGN------SGNSTASAGDWIEGWDMFSDRE-HVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQDCLGWI--SP 425 (678) Q Consensus 355 ~l~gg~------dg~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~--d~ 425 (678) .|..+. .|... .....++..+.... .+-. +. .... ....-..++..+++....++.+. |. T Consensus 193 ~Lt~~~~a~v~~~g~~a---et~~~Al~a~~~~~~~Wy~-f~-----~a~~--~~~~~~la~A~wi~a~~~~f~~~~~~~ 261 (501) T protein:vir:10 193 GLSAAAGATLQAAGVAA---DTPASAMNRAVGLSRNWAT-FT-----TAWT--AVIADRLAFAAWNSGQAYKYMYVAPDL 261 (501) T ss_pred cccccCceeEEecCccc---ccHHHHHHHHHhcccceEE-EE-----EEec--CChHHHHHHHHHHHhcCceEEEEEecC Confidence 111110 00011 11122222222211 1100 00 1111 11222345667777665544222 21 Q ss_pred ccchhccccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCC Q lcl|NC_019538. 426 PREYMVNLPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQ 505 (678) Q Consensus 426 p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g 505 (678) .. ...+.. ...++..... ..++ .+....|+ . -.|.+.+.|..+.+|-++- T Consensus 262 ~~-~~~~~~---~~~~i~~~l~-----------~~~y--~~t~~~y~------~-------~~~~aa~~g~~as~nf~~~ 311 (501) T protein:vir:10 262 EA-ASIVTN---NAASFGAQVF-----------AAPY--QGTLPLYG------D-------QATAGAVMGYAASINFQLR 311 (501) T ss_pred cc-eeeecc---cchhHHHHHH-----------hcCC--CceEEECC------C-------CCHHHHHHHHHHhcCcccC Confidence 10 000000 0011100000 1111 12222221 1 2366777888888875431 Q ss_pred c-eECcCCcch-hheeecccceecCChhhhhhhhhCCcEEEEEecCCc--EEEeccccCCCCccccceeehhhHHHHHHH Q lcl|NC_019538. 506 P-WMSPAGFNR-GQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQG--FILYGDKTMSLQPTPFDRINVRRLFNLLKK 581 (678) Q Consensus 506 ~-~~sPan~~~-~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G--~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~ 581 (678) . -..-..|.+ .|| ....++..|.+.|..+|+|++..|.+.| +.+|-.-+++++ |.+|.+.+-.+|++. T Consensus 312 ~g~~T~~fkql~~Gv-----~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~---~~wiD~~~g~dWl~~ 383 (501) T protein:vir:10 312 NGRTVLAFRQFNAGV-----PATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK---FLWVDTYLDQIYLNA 383 (501) T ss_pred cceeeeeecccCCCc-----CcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeecc---ceehhhHhhHHHHHH Confidence 1 011122322 222 1235788999999999999999987655 888866667764 567888888899999 Q ss_pred HHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee-----------------------------eeEEEEc Q lcl|NC_019538. 582 SISESAKYKLFE----NNDAFTRNSFRSEVNSYLDSIKSLGGIY-----------------------------DFRVVCD 628 (678) Q Consensus 582 si~~~~~~~vfe----pn~~~~~~~v~~~i~~~L~~l~~~gal~-----------------------------g~~V~~d 628 (678) .|+..+...+-. |-|..=...|+..|+.-|++-+++|.|. ||++.++ T Consensus 384 ~iq~~l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~ 463 (501) T protein:vir:10 384 ELQRAEFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIG 463 (501) T ss_pred HHHHHHHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceeccceEEeeC Confidence 999888764432 4467778889999999999999999883 3667766 Q ss_pred cC-CCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeecc Q lcl|NC_019538. 629 ET-NNTPAVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELV 674 (678) Q Consensus 629 ~~-~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~ 674 (678) .. +.++..-.+....+.+.++--..+++|++-. .||+ T Consensus 464 ~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s---------~~v~ 501 (501) T protein:vir:10 464 NPANPGQARQNRTSPACTLWYSDGGSIQELTIGS---------NAVI 501 (501) T ss_pred cccCChhhhhhcccCceEEEEEeCCceeEEEeee---------eecC Confidence 43 3333333344446666666666666665532 2233 No 65 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=77.18 E-value=0.13 Score=25.49 Aligned_cols=440 Identities=9% Similarity=-0.011 Sum_probs=147.0 Q ss_pred ccccceeeecccccccccceeeccccccc-ceeeeeeecccccccccceeeeeeccccceeee--cccccccccccccc- Q lcl|NC_019538. 136 VGGITFVRFSTAEVVKKAKELNDYPALQN-GWQIQFTSGGPGSGQSATAVLNGIRQDSKIYIR--NDEYSRESLLRRDE- 211 (678) Q Consensus 136 ~~~~~~~~~~~a~~~~~a~~~~~~~~~~~-~~~~~~~s~~~~~g~~a~~~~~~~~~~~~i~~~--~~~~a~~~~~~~~~- 211 (678) |+. ...+. .+.+...+.+.. ................ .+..+...... ......-....+++ T Consensus 1 m~~---~~ip~------s~iV~V~~~v~~~~~~~~~~~~l~l~~~~------~~~~~~~~~~~s~~~V~~~FG~~S~ey~ 65 (501) T protein:vir:10 1 MPT---TTIPI------DQIVQMLPGVIGAGGAPGRLTGLVLTQDT------SVQPGQLADFFQETDVENWFGALSNEAK 65 (501) T ss_pred CCC---CCccc------ceEEEEeeecccCCCccccceeEEEeccC------CCCccceEEecCHHHHHHhcCCChHHHH Confidence 110 00000 000000000000 0000000000000000 00000000000 00000000111111 Q ss_pred cccchhc----cccccccceeeeccccccccce-eEEecc--cccccccccccccccccccccccc--ccccceee-eec Q lcl|NC_019538. 212 TTETYID----MCESYGIPVVASRYAGLTGDNI-QVAFIA--YKDYYKFGVDGKISSVNTVNLKTF--PSGLSFGN-ITP 281 (678) Q Consensus 212 ~~~~~~~----~~~~~~~~~i~A~~~G~~gn~i-~v~v~~--~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-~~~ 281 (678) ....+-+ -.++|....+ |.|...- ...+.. .+..... .-....+......... ..+..... ++. T Consensus 66 aA~~yFsg~~~q~p~P~~l~i-----gR~~~~~~~~~l~g~~l~~~~la-~~~~~sg~l~vti~g~~~~~~i~ls~ats~ 139 (501) T protein:vir:10 66 IADAYFPGIVNGGQLPYDLKF-----ARYVAADAPASVYGIPLTGVTLA-QLQGYSGTLTVTTAAQHVSANISLAAATSF 139 (501) T ss_pred HHHHHhhhhcCCCccccEEEE-----EeecCCCccceEeccchhhhhhh-hcceeeeEEEEeeccceeecccccccccCH Confidence 0111111 1233332222 1121100 000000 0000000 0000000000000000 00000000 000 Q ss_pred ccccccccccc-ccceeeeccCCeeeeeeeeeccccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeeccCc Q lcl|NC_019538. 282 SSYLEYGPQTK-DQFAMIVFVGGSAVESRILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGGGN 360 (678) Q Consensus 282 ~~~~~~~~~~~-~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~ 360 (678) ..... .-... ....+.+.. +.....|++.....+... . +..... ....+..+.|..+. T Consensus 140 ~~vAs-~i~~al~~~~~tv~~-d~~~~~f~its~ttG~~~----------------~-i~~~~~--~~~la~~l~Lt~~~ 198 (501) T protein:vir:10 140 ANAAT-LIEAAFTSPDFVVAY-DALRNRFTVVTNATGTAA----------------A-ISAVTG--TNNLADELGLSAAA 198 (501) T ss_pred HHHHH-HHhhhccCCceEEEE-cccCceEEEEeeccCCce----------------e-EEEeeC--chhhhhhcCccccc Confidence 00000 00000 000000000 001111111111111100 0 000000 00000001111110 Q ss_pred ------CCccccchhHHHhhhhhhhccc-hhccccccccccccCcccchhHHHHHHHHHHHhcCCeEEEEccccc-hhcc Q lcl|NC_019538. 361 ------SGNSTASAGDWIEGWDMFSDRE-HVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQDCLGWISPPRE-YMVN 432 (678) Q Consensus 361 ------dg~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~-~~~~ 432 (678) .|... .....++..+.... .+-. ....... ...-..++..+++....++.+...-.. ...+ T Consensus 199 ~a~v~~~g~~a---et~~~a~~a~~~~~~~Wy~------f~~a~~~--~~~~~la~A~wiea~~~~f~~~~~~~~~~~~~ 267 (501) T protein:vir:10 199 GATLQAAGVAA---DTPASAMNRAVGLSRNWAT------FTTAWTA--VIADRLAFAAWNSGQAYKYMYVAPDLEAASIV 267 (501) T ss_pred cceEEecCccc---ccHHHHHHHHHhccCceEE------EEEecCC--ChHHHHHHHHHHHhcCceEEEEEecCchhhhh Confidence 00011 11112222222111 1100 0011111 122344567777776655433221100 0000 Q ss_pred ccccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCCc-eECcC Q lcl|NC_019538. 433 LPVATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQP-WMSPA 511 (678) Q Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~-~~sPa 511 (678) .....++.... ...++ .+....|+. ..+.+.+.|.++.+|-++-. -..-. T Consensus 268 ---~~~~~~i~~~l-----------~~~~y--~~t~~~y~~-------------~~~~aa~~g~~as~nf~~~~g~~T~~ 318 (501) T protein:vir:10 268 ---TNNAASFGAQV-----------FAAPY--QGTLPLYGD-------------QATAGAVMGYAASINFQLRNGRTVLA 318 (501) T ss_pred ---hhhhhhHHHHH-----------HhcCC--CceEEECCC-------------CcHHHHHHHHHHhhCcccCccceeee Confidence 00001110000 01111 122222221 12456777788877754321 01112 Q ss_pred Ccch-hheeecccceecCChhhhhhhhhCCcEEEEEecCCc--EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHH Q lcl|NC_019538. 512 GFNR-GQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQG--FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAK 588 (678) Q Consensus 512 n~~~-~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G--~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~ 588 (678) +|.+ .|| ....++..|.+.|..+++|+...+.+.| +.+|-.-+++++ |.+|.+-+-.+|++..|+..+. T Consensus 319 fkq~~~Gi-----~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~---~~wiD~~~~~~Wl~~~iq~~l~ 390 (501) T protein:vir:10 319 FRQFNAGV-----PATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK---FLWVDTYLDQIYLNAELQRAEF 390 (501) T ss_pred ccccCCCc-----CcccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeecc---ceeehhhhhHHHHHHHHHHHHH Confidence 2222 122 1235789999999999999999986544 788866677764 5678888877888888887776 Q ss_pred HHhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee-----------------------------eeEEEEccC-CCCH Q lcl|NC_019538. 589 YKLFE----NNDAFTRNSFRSEVNSYLDSIKSLGGIY-----------------------------DFRVVCDET-NNTP 634 (678) Q Consensus 589 ~~vfe----pn~~~~~~~v~~~i~~~L~~l~~~gal~-----------------------------g~~V~~d~~-~nt~ 634 (678) ..+-. |-|..=...|+..|+.-|++-+++|.|. ||++.++.. +.++ T Consensus 391 ~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~ 470 (501) T protein:vir:10 391 EAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPANPGQ 470 (501) T ss_pred HHHHhcCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeeccccCChh Confidence 54332 5577888889999999999999999883 366666633 3333 Q ss_pred HHhhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeecc Q lcl|NC_019538. 635 AVIDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELV 674 (678) Q Consensus 635 ~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~ 674 (678) +...+....+.+.++--..+++|++-. .||+ T Consensus 471 ~R~~R~~p~~~~~y~~~gaIh~v~i~s---------~~v~ 501 (501) T protein:vir:10 471 ARQNRTTPACTLWYSDGGSIQQLTIGS---------NAVI 501 (501) T ss_pred hhhhccccceEEEEEeCCceeEEEeee---------eecC Confidence 333344446666666666666665532 2233 No 66 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=76.14 E-value=0.14 Score=25.29 Aligned_cols=444 Identities=9% Similarity=-0.020 Sum_probs=152.0 Q ss_pred ccccceeeecccccccccceeeccccccc-ceeeeeeeccccccccccee--eeeeccccceeeecccccccccccccc- Q lcl|NC_019538. 136 VGGITFVRFSTAEVVKKAKELNDYPALQN-GWQIQFTSGGPGSGQSATAV--LNGIRQDSKIYIRNDEYSRESLLRRDE- 211 (678) Q Consensus 136 ~~~~~~~~~~~a~~~~~a~~~~~~~~~~~-~~~~~~~s~~~~~g~~a~~~--~~~~~~~~~i~~~~~~~a~~~~~~~~~- 211 (678) |+.. ..+ ..+.+...+.+.. ..................+. ......- ......-....+++ T Consensus 1 m~~~---~ip------~s~iV~V~~~v~~~~~~~~~~~~lllt~~~~~~~~r~~~y~s~------~~V~~~FG~~S~ey~ 65 (501) T protein:vir:36 1 MPTT---TIP------IDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQE------TDVENWFGALSNEAK 65 (501) T ss_pred CCcC---Ccc------cceEEEEeeeeccCCCcceeeeeEEEeccCCCCCcceeeecCH------HHHHHhcCCChHHHH Confidence 1100 000 0000000000000 00000000000000000000 0000000 00000000011110 Q ss_pred cccchhc----cccccccceeeeccccccccceeE-Eecc--ccccccccccccccccccccccccccccceeeeecccc Q lcl|NC_019538. 212 TTETYID----MCESYGIPVVASRYAGLTGDNIQV-AFIA--YKDYYKFGVDGKISSVNTVNLKTFPSGLSFGNITPSSY 284 (678) Q Consensus 212 ~~~~~~~----~~~~~~~~~i~A~~~G~~gn~i~v-~v~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (678) ....+-+ -.++|....+ |.|.+.-.. .+.. .+..... ......+.......... ....+..+.. T Consensus 66 aA~~yFs~~~~q~~~P~~l~i-----gR~~~~a~~~~l~g~~l~~~~~a-~~~~~sg~l~vti~g~~---~~~~i~lS~~ 136 (501) T protein:vir:36 66 IADAYFPGIVNGGQLPYDLKF-----ARYVAADAPASVYGIPLTGVTLA-QLQGYSGTLTVTTAAQH---VSANISLAAA 136 (501) T ss_pred HHHHHhhcccCCCccccEEEE-----EeecCcCcceeEeccchhhhhhh-hccceeEEEEEEeccee---eeeecccccc Confidence 0111111 1223332222 112111000 0000 0000000 00000000000000000 0000000000 Q ss_pred cc-----cccccc-ccceeeeccCCeeeeeeeeeccccccccccchhhhhhhhcCCcceEEEEecCCCccccceeeeecc Q lcl|NC_019538. 285 LE-----YGPQTK-DQFAMIVFVGGSAVESRILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWPTEYSGILTFGG 358 (678) Q Consensus 285 ~~-----~~~~~~-~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~g 358 (678) .. ..-... ......|.. +.....|++.....+......... + +..+ +..... ....+...... T Consensus 137 ts~~~vA~~i~~al~~~~~tv~~-d~~~~~f~i~s~t~G~~~~i~~~t-------~-~~~i-a~~l~L-t~~~~a~v~~~ 205 (501) T protein:vir:36 137 TSFANAATLIEAAFTSPDFVVAY-DALRNRFTVVTNATGTAAAISAVT-------G-TNNF-ADEIGL-SAAAGATLQAA 205 (501) T ss_pred cCHHHHHHHHhhhhcCcceEEEE-cCcceeEEEEeccCCcceeeEeee-------c-ccch-hhhhcc-cccCcceEEec Confidence 00 000000 000000100 011111222222111100000000 0 0000 000000 00000001111 Q ss_pred CcCCccccchhHHHhhhhhhhccc-hhccccccccccccCcccchhHHHHHHHHHHHhcCCeEEEEccccchhccccccC Q lcl|NC_019538. 359 GNSGNSTASAGDWIEGWDMFSDRE-HVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQDCLGWISPPREYMVNLPVAT 437 (678) Q Consensus 359 g~dg~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~d~p~~~~~~~~~~~ 437 (678) |.+. .+ ...++..+.... .+-. +. .....+ ..-..+|..+++....++.+...-.... ..... T Consensus 206 g~~~---et---~~~al~a~~~~s~~Wy~-f~-----~a~~~~--~~~~la~A~wiea~~~~f~~~~~~~~~~--~~~~~ 269 (501) T protein:vir:36 206 GVAA---DT---PASAMNRAVGLSRNWAT-FT-----TAWTAV--IADRLAFASWNSGQAYKYMYVAPDLEAA--SIVSN 269 (501) T ss_pred cccc---cc---HHHHHHHHHhccCceEE-EE-----EecCCC--hHHHHHHHHHHhhcCceEEEEEecCchh--hhhcc Confidence 1111 11 112222222221 1100 01 111111 2234467777777766543322111000 00000 Q ss_pred CHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcC--CceECcCCcch Q lcl|NC_019538. 438 AVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVA--QPWMSPAGFNR 515 (678) Q Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~--g~~~sPan~~~ 515 (678) ...++-.... ..++ .+....| ++ ..|.+++.|..+.+|-++ | -..-.+|.+ T Consensus 270 ~~~~i~~~l~-----------~~~y--~~t~~~y------~~-------~~~~aa~~g~~as~nf~~~~g-~~T~~fkq~ 322 (501) T protein:vir:36 270 NAASFGAQVF-----------AAPY--QGTLPLY------GD-------QATAGAVMGYAASINFQLRNG-RTVLAFRQF 322 (501) T ss_pred chhhHHHHHH-----------hcCC--CcEEEEc------CC-------CCHHHHHHHHHHhcCcccCcc-eeeeecccc Confidence 1111111100 1111 1222221 11 224566778888777433 2 011122332 Q ss_pred -hheeecccceecCChhhhhhhhhCCcEEEEEecCCc--EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHHhc Q lcl|NC_019538. 516 -GQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQG--FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYKLF 592 (678) Q Consensus 516 -~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G--~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~vf 592 (678) .|| ....++..|.+.|..+|+|++..|.+.+ +.+|-.-+++++ |.||.+.+-.+|++..|+..+...+- T Consensus 323 ~~Gi-----~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~---~~wiD~~~g~dWL~~~iq~~l~~ll~ 394 (501) T protein:vir:36 323 NAGV-----PATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK---FLWVDTYLDQIYLNAELQRAEFEAML 394 (501) T ss_pred CCCc-----CcCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeecc---chhhhHHHhHHHHHHHHHHHHHHHHh Confidence 222 1235788999999999999998886544 788876677775 56789999999999999998886543 Q ss_pred C----CCCHHHHHHHHHHHHHHHHHHHhcCCee-----------------------------eeEEEEccCCCC-HHHhh Q lcl|NC_019538. 593 E----NNDAFTRNSFRSEVNSYLDSIKSLGGIY-----------------------------DFRVVCDETNNT-PAVID 638 (678) Q Consensus 593 e----pn~~~~~~~v~~~i~~~L~~l~~~gal~-----------------------------g~~V~~d~~~nt-~~~i~ 638 (678) . |-|..=...|+..|+.-|++-+++|.|. ||++.++....+ ++.-. T Consensus 395 ~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~ 474 (501) T protein:vir:36 395 AYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPANPGQARQN 474 (501) T ss_pred cCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeCcccCChhhhhh Confidence 3 4577778889999999999999999883 366777643333 33333 Q ss_pred CCEEEEEEEEEecCCceEEEEEEEEeecCceeeecc Q lcl|NC_019538. 639 RNEFVATILIKPARSINYVSLNFSAVGTSANFDELV 674 (678) Q Consensus 639 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~ 674 (678) +....+.+.++--..+++|++-. .||+ T Consensus 475 R~~p~~~~~y~~~gaIh~v~i~s---------~~v~ 501 (501) T protein:vir:36 475 RTTPACTLWYSDGGSIQSLTIGS---------NAVI 501 (501) T ss_pred cccCcEEEEEEeCCceeEEEeee---------eeeC Confidence 44446666666666677666532 2233 No 67 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=57.76 E-value=0.43 Score=22.61 Aligned_cols=442 Identities=9% Similarity=0.002 Sum_probs=146.2 Q ss_pred ccccceeeecccccccccceeeccccccc-ceeeeeeeccccccccccee--eeeeccccceeeecccccccccccccc- Q lcl|NC_019538. 136 VGGITFVRFSTAEVVKKAKELNDYPALQN-GWQIQFTSGGPGSGQSATAV--LNGIRQDSKIYIRNDEYSRESLLRRDE- 211 (678) Q Consensus 136 ~~~~~~~~~~~a~~~~~a~~~~~~~~~~~-~~~~~~~s~~~~~g~~a~~~--~~~~~~~~~i~~~~~~~a~~~~~~~~~- 211 (678) |+.. ..+ ..+.+...+.+.. ..................+. ......- ......-....+++ T Consensus 1 m~~~---~ip------~s~iV~V~~~v~~~~~~~~~~~~lll~~~~~~~~~r~~~y~s~------~~V~~~FG~~S~ey~ 65 (501) T protein:vir:78 1 MPTT---TIP------IDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSIQPGQLADFFQK------TDVENWFGGLSNEAV 65 (501) T ss_pred CCcC---ccc------cceEEEEeeecccCCCcceeeeeEEEecCCCCCccceeeecCH------HHHHHhcCCChHHHH Confidence 1100 000 0000000100000 00000000000000000000 0000000 00000000011110 Q ss_pred cccchhc----cccccccceeeeccccccccce--------eEEeccccccccccccccccccccccccccccccceee- Q lcl|NC_019538. 212 TTETYID----MCESYGIPVVASRYAGLTGDNI--------QVAFIAYKDYYKFGVDGKISSVNTVNLKTFPSGLSFGN- 278 (678) Q Consensus 212 ~~~~~~~----~~~~~~~~~i~A~~~G~~gn~i--------~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 278 (678) ....+-+ -.++|...-+ |.|...- .+.......+... .+....+... ... ..+....+ T Consensus 66 aA~~yFs~~~~q~~~P~~l~i-----gR~~~~a~~~~l~g~~l~~~~la~~~~~--~G~l~iti~g-~~~-~~~i~~S~~ 136 (501) T protein:vir:78 66 IADAYFPGIVNGGQLPYDLKF-----ARYVAADAPASVYGIPLTGVTLTQLQGY--SGTLTVTTAA-QHV-SSNISLAAA 136 (501) T ss_pred HHHHHhhcCCCCCcccceEEE-----EeecccCcceeEeccceeccchhhhcee--eeEEEEEecc-cee-eeccccccc Confidence 0111111 1223322221 1221110 0000000000000 0000000000 000 00000000 Q ss_pred eeccccccccccccccceeeeccCCeeeeeeeeeccccccccccchhhhhhhhcCCcceEEEEecCCCc-cccceeeeec Q lcl|NC_019538. 279 ITPSSYLEYGPQTKDQFAMIVFVGGSAVESRILSVKENDRDIYGSSIYVDEFFINGYSTFIQGVAESWP-TEYSGILTFG 357 (678) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-~~~~~~~~l~ 357 (678) .+...........-......+.. +.....|+++....+....... .. + +.++ +.... ....+..... T Consensus 137 ts~~~vA~~i~~al~a~~~tv~~-ds~~~~f~its~t~G~~~~i~~--~t-----~-~~~~---a~~l~Lt~~~~a~v~~ 204 (501) T protein:vir:78 137 TSFANAATLIEAAFTSPDFVVSY-DALRNRFVVNTNATGTAAAISA--VT-----G-TNNL---ADELGLSAAAGASLQA 204 (501) T ss_pred cCHHHHHHHHHhhhcCcceEEEE-ccccceEEEEeeecCCceeEEE--Ee-----c-ccch---hhhhcccccCceeeEe Confidence 00000000000000000000000 0011112211111111000000 00 0 0000 00000 0000000111 Q ss_pred cCcCCccccchhHHHhhhhhhhccc-hhccccccccccccCcccchhHHHHHHHHHHHhcCCeEEEE--ccccchhcccc Q lcl|NC_019538. 358 GGNSGNSTASAGDWIEGWDMFSDRE-HVDVNLFIAGSCAGEGVEIASTVQKSVAAICDERQDCLGWI--SPPREYMVNLP 434 (678) Q Consensus 358 gg~dg~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~i~--d~p~~~~~~~~ 434 (678) .|.+. .+ ...++..+.... .+-. +. ..... ...-..++..+++....++.+. |... ...+.. T Consensus 205 ~g~~a---et---~~~a~~a~~~~~~~Wy~-f~-----~a~~~--~~~~~lalA~wiea~~~~f~~~~~~~~~-~~~~~~ 269 (501) T protein:vir:78 205 AGVAA---DT---PASAMNRAVGLSRNWAT-FT-----TAWTA--VIADRLALASWNSGQAYKYMYVAPDLEP-ASIVTN 269 (501) T ss_pred ccccc---cC---HHHHHHHHHhccCceEE-EE-----EecCC--CHHHHHHHHHHHHhcCceEEEEEecCCc-ceeecc Confidence 11111 11 112222222211 1110 00 11111 1223445777777766554332 1110 000000 Q ss_pred ccCCHHHHHHHHhcccccccchhhccccccceEEEEcCeEEEecccCCceeEechHHHHHHHHHHHhhcCCc-eECcCCc Q lcl|NC_019538. 435 VATAVKKMVEWRRGVTDSGVVVDDNMNIGTTYSSTSANYKLQYDKYNDTNRWIPLSADMAGLCARTDTVAQP-WMSPAGF 513 (678) Q Consensus 435 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~pps~~vAG~~a~~d~~~g~-~~sPan~ 513 (678) ...++-.... ..++ .+....|+. -.+.+.+.|..+.+|-++-. -..-..| T Consensus 270 ---~~~~i~~~l~-----------a~~y--~~t~~~y~~-------------~~~~aa~~g~~as~nf~~~~g~~T~~fk 320 (501) T protein:vir:78 270 ---NSASFGAQVF-----------AAPY--QGTLPLYGD-------------QATAGAVMGYAASINFQLRNGRTVLAFR 320 (501) T ss_pred ---cchhHHHHHh-----------hcCC--CceEEEcCC-------------cchHHHHHHHHHhcCcccCcceeeeecc Confidence 0011100000 1111 122222211 11445667777777644321 0111222 Q ss_pred ch-hheeecccceecCChhhhhhhhhCCcEEEEEecCCc--EEEeccccCCCCccccceeehhhHHHHHHHHHHHHHHHH Q lcl|NC_019538. 514 NR-GQILDVRKLAIETRQAHRDELYQNSMNPVVGFPGQG--FILYGDKTMSLQPTPFDRINVRRLFNLLKKSISESAKYK 590 (678) Q Consensus 514 ~~-~~v~~~~~~~~~~~~~e~~~L~~~gIn~i~~~~~~G--~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~ 590 (678) .+ .|| ....++..|.+.|..+|+|++..|.+.| +.+|-.-+++++ |.+|.+-+-.+|++..|+..+... T Consensus 321 q~~~Gv-----~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~---~~wiD~~~~~~Wl~~~iq~~l~~l 392 (501) T protein:vir:78 321 QFNAGV-----PATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK---FLWVDTYLDQIYLNAELQRAEFEA 392 (501) T ss_pred ccCCCc-----CcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeecc---ceeehhhhhHHHHHHHHHHHHHHH Confidence 22 222 1235788999999999999999887655 888866677764 566888887788888888877754 Q ss_pred hcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee-----------------------------eeEEEEccC-CCCHHH Q lcl|NC_019538. 591 LFE----NNDAFTRNSFRSEVNSYLDSIKSLGGIY-----------------------------DFRVVCDET-NNTPAV 636 (678) Q Consensus 591 vfe----pn~~~~~~~v~~~i~~~L~~l~~~gal~-----------------------------g~~V~~d~~-~nt~~~ 636 (678) +-. |-|..=...|+..|+.-|++-+++|.|. ||++.++.. +.++.. T Consensus 393 l~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~~~~~R 472 (501) T protein:vir:78 393 MLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDPANPGQAR 472 (501) T ss_pred HHhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeeccccCChhhh Confidence 322 5577788889999999999999999883 366666633 333333 Q ss_pred hhCCEEEEEEEEEecCCceEEEEEEEEeecCceeeecc Q lcl|NC_019538. 637 IDRNEFVATILIKPARSINYVSLNFSAVGTSANFDELV 674 (678) Q Consensus 637 i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~ 674 (678) ..+....+.+.++--..+++|++-. .||+ T Consensus 473 ~~R~~p~~~~~y~~~gaIh~v~i~s---------~~v~ 501 (501) T protein:vir:78 473 QNRTTPTCTLWYSDGGSIQELTIGS---------NAVI 501 (501) T ss_pred hhcccCcEEEEEEeCCceeEEEeee---------eecC Confidence 3344445566666666666665432 2233 Done!