Query lcl|NC_020841.1_cdsid_YP_007673367.1 [gene=PSYG_00056] [protein=hypothetical protein] [protein_id=YP_007673367.1] [location=29112..30215] Match_columns 367 No_of_seqs 136 out of 236 Neff 8.7 Searched_HMMs 1612 Date Thu Nov 7 16:53:30 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_56 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_56_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:5260 Length: 502 # 100.0 4E-106 2E-109 598.6 40.0 359 5-367 1-502 (502) 2 protein:vir:3636 Length: 501 # 100.0 2.4E-98 1E-101 555.8 37.5 360 3-366 1-501 (501) 3 protein:vir:96104 Length: 504 100.0 7.4E-98 5E-101 553.1 37.5 359 6-365 1-504 (504) 4 protein:vir:106730 Length: 501 100.0 6.7E-98 4E-101 553.3 37.2 360 3-366 1-501 (501) 5 protein:vir:101576 Length: 501 100.0 1.3E-97 8E-101 551.8 36.7 360 3-366 1-501 (501) 6 protein:vir:78611 Length: 501 100.0 2.1E-97 1E-100 550.6 36.4 360 3-366 1-501 (501) 7 protein:vir:99586 Length: 507 100.0 4.7E-97 3E-100 548.7 37.3 360 6-366 1-507 (507) 8 protein:vir:94073 Length: 494 100.0 1.7E-93 1.1E-96 529.1 35.0 357 5-367 1-494 (494) 9 protein:vir:80052 Length: 331 100.0 5.2E-90 3.2E-93 510.1 36.5 327 8-367 1-331 (331) 10 protein:vir:95263 Length: 450 100.0 7.7E-90 4.8E-93 509.1 36.0 332 8-367 1-449 (450) 11 protein:vir:107720 Length: 515 100.0 6.2E-87 3.9E-90 493.2 33.0 357 7-366 1-515 (515) 12 protein:vir:3165 Length: 426 # 100.0 2.3E-64 1.4E-67 369.5 26.4 339 7-367 1-426 (426) 13 protein:vir:6079 Length: 396 # 99.3 9.5E-11 5.9E-14 75.5 28.6 330 5-367 1-383 (396) 14 protein:vir:1845 Length: 392 # 99.2 1.9E-10 1.2E-13 73.8 28.8 330 5-367 1-380 (392) 15 protein:vir:1172 Length: 391 # 99.2 1.6E-10 1E-13 74.3 27.5 330 1-367 1-379 (391) 16 protein:vir:2035 Length: 396 # 99.2 1.5E-10 9.4E-14 74.4 27.2 331 5-367 1-383 (396) 17 protein:vir:98553 Length: 395 99.2 2.8E-10 1.7E-13 73.0 28.7 329 5-367 1-383 (395) 18 protein:vir:5711 Length: 396 # 99.2 3.5E-10 2.2E-13 72.4 28.3 328 5-367 1-383 (396) 19 protein:vir:100323 Length: 393 99.1 1.3E-09 8.3E-13 69.2 29.9 331 3-367 1-380 (393) 20 protein:vir:102359 Length: 356 99.1 7.9E-11 4.9E-14 76.0 22.1 333 1-365 1-356 (356) 21 protein:vir:96740 Length: 388 99.1 1.6E-09 9.7E-13 68.9 30.4 331 1-367 1-377 (388) 22 protein:vir:103993 Length: 390 99.0 3.8E-09 2.3E-12 66.8 27.2 329 1-367 1-378 (390) 23 protein:vir:78206 Length: 390 99.0 3.8E-09 2.3E-12 66.8 27.2 329 1-367 1-378 (390) 24 protein:vir:79141 Length: 391 99.0 3.2E-09 2E-12 67.1 26.8 329 1-367 1-378 (391) 25 protein:vir:79181 Length: 390 98.9 1.5E-08 9.6E-12 63.4 27.2 328 1-367 1-378 (390) 26 protein:vir:10336 Length: 386 98.8 2.9E-08 1.8E-11 61.9 27.4 329 1-367 1-379 (386) 27 protein:vir:107310 Length: 581 98.7 2.6E-08 1.6E-11 62.1 22.5 326 1-367 177-568 (581) 28 protein:vir:99306 Length: 587 98.7 5.2E-08 3.2E-11 60.5 24.0 326 1-367 191-582 (587) 29 protein:vir:102957 Length: 437 98.7 4E-08 2.5E-11 61.1 22.0 311 1-366 86-437 (437) 30 protein:vir:95741 Length: 587 98.7 5E-08 3.1E-11 60.6 22.5 327 1-367 191-582 (587) 31 protein:vir:96586 Length: 587 98.6 1.7E-07 1E-10 57.7 24.3 328 1-367 191-582 (587) 32 protein:vir:7653 Length: 581 # 98.5 4.4E-07 2.8E-10 55.4 24.1 320 1-367 198-568 (581) 33 protein:vir:107865 Length: 477 98.5 4.6E-07 2.9E-10 55.3 28.6 328 1-367 1-467 (477) 34 protein:vir:105470 Length: 451 98.4 1.4E-07 8.9E-11 58.1 18.2 315 1-366 86-451 (451) 35 protein:vir:79092 Length: 477 98.4 8E-07 4.9E-10 54.0 29.8 328 1-367 1-467 (477) 36 protein:vir:104858 Length: 729 98.4 1E-06 6.5E-10 53.4 25.3 335 1-367 292-717 (729) 37 protein:vir:79798 Length: 717 98.3 4.8E-07 3E-10 55.2 19.3 314 1-367 335-717 (717) 38 protein:vir:78986 Length: 436 98.3 2.2E-07 1.3E-10 57.1 16.7 307 1-366 107-436 (436) 39 protein:vir:63742 Length: 562 98.3 1.3E-06 8.2E-10 52.8 20.9 322 1-367 177-557 (562) 40 protein:vir:80488 Length: 562 98.2 2.7E-06 1.7E-09 51.1 20.6 324 1-367 177-557 (562) 41 protein:vir:104477 Length: 749 98.2 3.4E-06 2.1E-09 50.6 25.8 334 1-367 331-739 (749) 42 protein:vir:6894 Length: 660 # 98.1 3.7E-06 2.3E-09 50.4 22.9 314 1-367 257-646 (660) 43 protein:vir:103456 Length: 659 98.0 6.4E-06 4E-09 49.1 25.1 332 1-367 236-646 (659) 44 protein:vir:106984 Length: 743 98.0 7E-06 4.3E-09 48.8 25.3 335 1-367 326-732 (743) 45 protein:vir:6594 Length: 666 # 98.0 8.3E-06 5.1E-09 48.4 25.5 330 1-367 220-651 (666) 46 protein:vir:7206 Length: 659 # 98.0 8.9E-06 5.5E-09 48.3 24.1 330 1-367 223-646 (659) 47 protein:vir:80779 Length: 569 98.0 9.5E-06 5.9E-09 48.1 22.8 307 1-367 227-564 (569) 48 protein:vir:100829 Length: 607 97.9 9.7E-06 6E-09 48.1 22.2 319 1-367 234-596 (607) 49 protein:vir:80984 Length: 666 97.9 1.1E-05 7.1E-09 47.7 22.8 308 1-367 277-651 (666) 50 protein:vir:98824 Length: 774 97.8 2.1E-05 1.3E-08 46.2 22.4 330 1-367 376-767 (774) 51 protein:vir:5833 Length: 742 # 97.8 2.2E-05 1.4E-08 46.1 25.9 324 1-367 371-736 (742) 52 protein:vir:5663 Length: 671 # 97.7 2.3E-05 1.4E-08 46.0 20.6 326 1-367 270-661 (671) 53 protein:vir:108052 Length: 660 97.7 3.2E-05 2E-08 45.2 27.1 330 1-367 245-647 (660) 54 protein:vir:101187 Length: 663 97.5 5.6E-05 3.5E-08 43.9 25.9 332 1-367 220-648 (663) 55 protein:vir:98263 Length: 664 97.5 6.2E-05 3.8E-08 43.7 24.9 328 1-367 245-650 (664) 56 protein:vir:106427 Length: 679 97.3 0.0001 6.4E-08 42.5 25.2 328 1-367 232-665 (679) 57 protein:vir:102819 Length: 648 97.1 0.00016 9.6E-08 41.5 18.4 323 1-367 248-645 (648) 58 protein:vir:101804 Length: 663 96.9 0.00028 1.7E-07 40.1 27.6 325 1-367 261-648 (663) 59 protein:vir:100539 Length: 663 96.9 0.00028 1.8E-07 40.0 25.6 319 1-367 293-648 (663) 60 protein:vir:4517 Length: 498 # 96.0 0.0011 6.7E-07 36.9 25.3 344 1-355 1-498 (498) 61 protein:vir:489 Length: 498 # 95.8 0.0015 9.1E-07 36.1 24.1 344 1-355 1-498 (498) 62 protein:vir:276 Length: 369 # 95.6 0.0018 1.1E-06 35.6 29.8 331 5-367 1-366 (369) 63 protein:vir:3788 Length: 376 # 93.6 0.007 4.3E-06 32.4 28.6 333 6-367 1-371 (376) 64 protein:vir:4463 Length: 498 # 89.3 0.027 1.7E-05 29.2 25.4 344 1-355 1-498 (498) 65 protein:vir:3751 Length: 376 # 86.5 0.045 2.8E-05 28.0 27.8 329 6-367 1-373 (376) 66 protein:vir:78782 Length: 370 84.9 0.057 3.5E-05 27.4 29.6 332 6-367 1-363 (370) 67 protein:vir:1996 Length: 495 # 66.2 0.27 0.00017 23.7 24.1 326 1-360 115-495 (495) No 1 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=100.00 E-value=3.7e-106 Score=598.57 Aligned_cols=359 Identities=25% Similarity=0.405 Sum_probs=329.3 Q ss_pred ccccccceEEEEEeeeccccccccccceEEEeecc---ccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcccc Q lcl|NC_020841. 5 LTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTA---PNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPKPR 81 (367) Q Consensus 5 ~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~---~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~p~ 81 (367) |++|||+||+|+|++++.+++.++||.+|||++++ +.+..+|++.|++.++|..+||.+||+||+|+.||+|+|+|. T Consensus 1 msip~s~ivnV~i~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~~s~ey~aA~~yF~q~p~P~ 80 (502) T protein:vir:52 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPRAK 80 (502) T ss_pred CCCCccceeEEeeccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCCChHHHHHHHHHhcCCCccc Confidence 99999999999999999999999999999999876 455678999999999999999999999999999999999999 Q ss_pred eEEEEeccCc---------------------------------------------------------------------- Q lcl|NC_020841. 82 DLMIATVTAL---------------------------------------------------------------------- 91 (367) Q Consensus 82 ~v~v~~~~~~---------------------------------------------------------------------- 91 (367) +|+|+||..+ T Consensus 81 ~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~~~l~~~~~~ 160 (502) T protein:vir:52 81 QLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVA 160 (502) T ss_pred eEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHHhhhcccccc Confidence 9998776311 Q ss_pred ----------------------------------------------------------------cchHHHHHHHHhcccC Q lcl|NC_020841. 92 ----------------------------------------------------------------TDPLASIGEVAAKTLG 107 (367) Q Consensus 92 ----------------------------------------------------------------~t~~~~l~~~~~~~~~ 107 (367) +++.+++.++.+.+.+ T Consensus 161 ~tv~~d~~~~~F~i~s~ttg~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g~~aet~~~al~a~~~~~~~ 240 (502) T protein:vir:52 161 VSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVNNT 240 (502) T ss_pred eEEEEecCCceEEEEeccCCCcceeEEEEeecCCcchhHHHHHhccccccceeeeeeecccccccCHHHHHHHHHhccCc Confidence 1223345567778889 Q ss_pred cEEEEEEecCCHHHHHHHHHHhhccCcEEEEEEeCchh----hhHHHHHHHHhccccceeecCCch-hHHHHHHHHHHHc Q lcl|NC_020841. 108 FYAFCFASEVAAADIQGLAEWAQSNNRMFMTVMTDDTE----AVTTGNALKELGQYHYCITYHEDY-ATVGAVAGMALDQ 182 (367) Q Consensus 108 w~~~~~~~~~~~~~~~ala~~~ea~~~~~~~~~~d~~~----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 182 (367) ||+|+++++.++++++++|+|+|+++|+|+++..|... ..++.+.++..++.|+++.||++. ...++++|++++. T Consensus 241 w~~~~~a~~~~~~~~la~a~~iea~~~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~~~y~~~~~~~~aa~~g~~as~ 320 (502) T protein:vir:52 241 WYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSALARLLST 320 (502) T ss_pred eEEEEEeecCChhHHHHHHHHHhhcCcEEEEEecCcceeccccchHHHHHHhccCceeEEEecCCcchhHHHHHHHHHhc Confidence 99999999999999999999999999999988876533 345678889999999999999764 4556789999999 Q ss_pred ccccCcceeeeeeeecCcccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020841. 183 RYDKTDGVKTLHLKSLVSVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGKFFDFVMGFDWLRNVIETN 262 (367) Q Consensus 183 ~~~~~~g~~t~~~k~l~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~~iD~~~~~dwl~~~lq~~ 262 (367) +|++.+|++|||||+++||+|++++.+|+++|+++|||||+++.+ +.++++|++++|+|||++||+|||+++||++ T Consensus 321 ~f~~~~g~iT~~fk~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~----~~~~~~G~~~~G~~iD~~~~~~Wl~~~lq~~ 396 (502) T protein:vir:52 321 NFAANNSTLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDD----VAMIAEGTVIGGKFADEIVILDWFVDAVQKE 396 (502) T ss_pred CCCcCcceeeecccccCCcccCcCCHHHHHHHHhcCceEEEEecC----eeEEecCeeeCCchhhHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999853 4589999999999999999999999999999 Q ss_pred HHHHHHh-cCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhcc Q lcl|NC_020841. 263 VFNGQRL-RRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQR 341 (367) Q Consensus 263 l~~ll~~-~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R 341 (367) |+++|.+ ++|||||+.|+++|+++|+++|+++++||+|+||+|+++++|.+..+|++.+||||+.|++++|+++||++| T Consensus 397 l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~~~~~d~~~~gy~v~~~~~~~~s~~dr~~R 476 (502) T protein:vir:52 397 VFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQAR 476 (502) T ss_pred HHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeecccccCceEEEeCchhhCCHHHHHcc Confidence 9998865 589999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 342 IAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 342 ~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) ++|+++|+|+++||||+|+|++||+- T Consensus 477 ~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 477 RATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred cCCCeEEEEEECceEEEEEEEEEEeC Confidence 99999999999999999999999999 No 2 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=100.00 E-value=2.4e-98 Score=555.76 Aligned_cols=360 Identities=19% Similarity=0.233 Sum_probs=319.2 Q ss_pred cc-ccccccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhc----cC Q lcl|NC_020841. 3 GS-LTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFS----QN 77 (367) Q Consensus 3 ~~-~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~----Q~ 77 (367) |+ .++|+|+||+|++.+.+++.+.++|+ .|+|+..+.++ .+|.+.|+++++|..+||.+||+||+|+.||+ |+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~-~lllt~~~~~~-~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~ 78 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSVQ-PGQLADFFQETDVENWFGALSNEAKIADAYFPGIVNGG 78 (501) T ss_pred CCcCCcccceEEEEeeeeccCCCcceeee-eEEEeccCCCC-CcceeeecCHHHHHHhcCCChHHHHHHHHHhhcccCCC Confidence 55 46999999999999999999999998 67788777765 47999999999999999999999999999998 99 Q ss_pred cccceEEEEeccCc------------------------------------------------------------------ Q lcl|NC_020841. 78 PKPRDLMIATVTAL------------------------------------------------------------------ 91 (367) Q Consensus 78 p~p~~v~v~~~~~~------------------------------------------------------------------ 91 (367) |+|.+|+|+||..+ T Consensus 79 ~~P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~~~~~i~lS~~ts~~~vA~~i~~al~~~~~tv~ 158 (501) T protein:vir:36 79 QLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVA 158 (501) T ss_pred ccccEEEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecceeeeeecccccccCHHHHHHHHhhhhcCcceEEE Confidence 99999998865311 Q ss_pred ---------------------------------------------------cchHHHHHHHHhcccCcEEEEEEecCCHH Q lcl|NC_020841. 92 ---------------------------------------------------TDPLASIGEVAAKTLGFYAFCFASEVAAA 120 (367) Q Consensus 92 ---------------------------------------------------~t~~~~l~~~~~~~~~w~~~~~~~~~~~~ 120 (367) ++|.+++.++.+.+.+||+|.++++.+++ T Consensus 159 ~d~~~~~f~i~s~t~G~~~~i~~~t~~~~ia~~l~Lt~~~~a~v~~~g~~~et~~~al~a~~~~s~~Wy~f~~a~~~~~~ 238 (501) T protein:vir:36 159 YDALRNRFTVVTNATGTAAAISAVTGTNNFADEIGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIA 238 (501) T ss_pred EcCcceeEEEEeccCCcceeeEeeecccchhhhhcccccCcceEEecccccccHHHHHHHHHhccCceEEEEEecCCChH Confidence 12333457788899999999999999999 Q ss_pred HHHHHHHHhhccCcEEEEEEeCch-------hhhHHHHHHHHhccccceeecCCchhHHHHHHHHHHHcccccCcceeee Q lcl|NC_020841. 121 DIQGLAEWAQSNNRMFMTVMTDDT-------EAVTTGNALKELGQYHYCITYHEDYATVGAVAGMALDQRYDKTDGVKTL 193 (367) Q Consensus 121 ~~~ala~~~ea~~~~~~~~~~d~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~t~ 193 (367) +++++|+|+|+++++|++..++.+ ...++++.++..++.|++..||+.. ..++++|++++.||++.+|++|| T Consensus 239 ~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~-~~aa~~g~~as~nf~~~~g~~T~ 317 (501) T protein:vir:36 239 DRLAFASWNSGQAYKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGDQA-TAGAVMGYAASINFQLRNGRTVL 317 (501) T ss_pred HHHHHHHHHhhcCceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCCCC-HHHHHHHHHHhcCcccCcceeee Confidence 999999999999999988877543 2345778889888888888887544 44578999999999999999999 Q ss_pred eeeec-CcccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-chhhHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020841. 194 HLKSL-VSVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-KFFDFVMGFDWLRNVIETNVFNGQRLRR 271 (367) Q Consensus 194 ~~k~l-~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-~~iD~~~~~dwl~~~lq~~l~~ll~~~~ 271 (367) |||++ +||+|++++++|+++|+++|||||+.+++.++.+.|+++|+|+|| +|||+++|+|||+++||++|++||.+++ T Consensus 318 ~fkq~~~Gi~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~g~dWL~~~iq~~l~~ll~~~~ 397 (501) T protein:vir:36 318 AFRQFNAGVPATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYN 397 (501) T ss_pred eccccCCCcCcCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeeccchhhhHHHhHHHHHHHHHHHHHHHHhcCC Confidence 99997 799999999999999999999999999999999999999999988 7999999999999999999999999999 Q ss_pred CCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccc---------cccccccccccceeEEcCchHhCCHHHHhccc Q lcl|NC_020841. 272 LTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAAL---------GEIETYDYLPTGYYVYNESIRDQAQVIREQRI 342 (367) Q Consensus 272 kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~---------g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~ 342 (367) |||||+.|+++|+++|+++|+++++||+|+||+|++..+ +....++.+++|||+++++++ .+++||++|+ T Consensus 398 KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~-~~~~~R~~R~ 476 (501) T protein:vir:36 398 SLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPA-NPGQARQNRT 476 (501) T ss_pred CCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeCccc-CChhhhhhcc Confidence 999999999999999999999999999999999976543 345556789999999988877 4667999999 Q ss_pred cCCeEEEEEECceEEEEEE-EEEec Q lcl|NC_020841. 343 APPFIILVKGAGAIHDTDI-TLIPE 366 (367) Q Consensus 343 ~~~~~~~~~~agaIh~v~i-~~~v~ 366 (367) +|+++|+|+++||||+|+| +++|= T Consensus 477 ~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:36 477 TPACTLWYSDGGSIQSLTIGSNAVI 501 (501) T ss_pred cCcEEEEEEeCCceeEEEeeeeeeC Confidence 9999999999999999998 55555 No 3 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=100.00 E-value=7.4e-98 Score=553.10 Aligned_cols=359 Identities=14% Similarity=0.121 Sum_probs=315.0 Q ss_pred cccccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCc----ccc Q lcl|NC_020841. 6 TLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNP----KPR 81 (367) Q Consensus 6 ~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p----~p~ 81 (367) =+|+|+||+|++++.+++....+|+.+|+|+.++.++. +|.+.|++.++|..+||.+||+||+|..||+|.| +|. T Consensus 1 mip~s~iV~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~-~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF~~~~~~~~~P~ 79 (504) T protein:vir:96 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPP-GIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPS 79 (504) T ss_pred CCCccceeEeeecccccccccccccceeEeecccCCCc-cceEEecCHHHHHHhcCCChHHHHHHHHHhhcCCCCCcccc Confidence 37999999999999999999999999999999998764 8999999999999999999999999999999987 999 Q ss_pred eEEEEeccC----------------------------------------------------------------------- Q lcl|NC_020841. 82 DLMIATVTA----------------------------------------------------------------------- 90 (367) Q Consensus 82 ~v~v~~~~~----------------------------------------------------------------------- 90 (367) +|+|+||.. T Consensus 80 ~l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~~~~~~~~~~ 159 (504) T protein:vir:96 80 SISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRKNTDPQLAQA 159 (504) T ss_pred EEEEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhhhhcccccccccc Confidence 999998641 Q ss_pred ----------------------------------------------------ccchHHHHHHHHhcccCcEEEEEEec-C Q lcl|NC_020841. 91 ----------------------------------------------------LTDPLASIGEVAAKTLGFYAFCFASE-V 117 (367) Q Consensus 91 ----------------------------------------------------~~t~~~~l~~~~~~~~~w~~~~~~~~-~ 117 (367) .+++.+++.++.+.+.+||+|.++++ . T Consensus 160 tv~~d~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lgl~~~~~~~v~g~~aet~~~al~al~~~~~~Wy~f~~a~~~~ 239 (504) T protein:vir:96 160 TVTWNPNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGATL 239 (504) T ss_pred eEEEeccCCeEEEEeeccccceeEEEeeccccchhhhhhcccccceEEeecccccHHHHHHHHHhhcCCeEEEEEEeccC Confidence 12233467778888999999988765 6 Q ss_pred CHHHHHHHHHHhhccCcEEEEEEeCchhhhHHHHHHHHhccccceeecCCc---hhHHHHHHHHHHHcccccCcceeeee Q lcl|NC_020841. 118 AAADIQGLAEWAQSNNRMFMTVMTDDTEAVTTGNALKELGQYHYCITYHED---YATVGAVAGMALDQRYDKTDGVKTLH 194 (367) Q Consensus 118 ~~~~~~ala~~~ea~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~g~~t~~ 194 (367) ++++++++|+|+|+++++|+++.++................++....++.. .....++++++++.+|++.+|++||| T Consensus 240 ~dd~ilalA~w~ea~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~as~~f~~~ng~~T~~ 319 (504) T protein:vir:96 240 DNDQIKAVSAWNAAQNNQFIYTVATSLANLGALFDLVKGNSGTALNVLSATASNDFVEQCPSEILAATNYDEPGASQNYM 319 (504) T ss_pred CHHHHHHHHHHHhhcCceEEEEEeecccchhhHHHhhhhcceeEEEEeecCccchhHHHHHHHHHHhcCcCccccccccc Confidence 788999999999999999998888654433332333344445555555533 33456778999999999999999999 Q ss_pred eeecCcccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020841. 195 LKSLVSVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQRLR 270 (367) Q Consensus 195 ~k~l~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~ 270 (367) ||+++||+|++++++|+++|+++|||||+.+++.++++.|+++|+|+||+ |||++++++||+++||++|++||.++ T Consensus 320 fk~l~GVta~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~~~~~wiDv~~~~~WL~~~lq~~l~~l~~~~ 399 (504) T protein:vir:96 320 YYQFPGRNITVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQALLDLFLNV 399 (504) T ss_pred ccccCCcCcccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCccccchhhhhhhHHHHHHHHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999997 79999999999999999999999999 Q ss_pred CCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccc---------cccccccccceeEEcCchHhCCHHHHhcc Q lcl|NC_020841. 271 RLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGE---------IETYDYLPTGYYVYNESIRDQAQVIREQR 341 (367) Q Consensus 271 ~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~---------~~~~~~~~~gy~v~~~~~~~~~~~dr~~R 341 (367) +|||||+.|+++|+++|+++|++|++||+|+||+|++....+ ...++++.+||||++|++++++++||++| T Consensus 400 ~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~~r~~R 479 (504) T protein:vir:96 400 NAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLGYWINITFSSYTNSNTGLTE 479 (504) T ss_pred CCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccceEEEecChhccChhHhhhc Confidence 999999999999999999999999999999999997754322 33468889999999999999999999999 Q ss_pred ccCCeEEEEEECceEEEEEE-EEEe Q lcl|NC_020841. 342 IAPPFIILVKGAGAIHDTDI-TLIP 365 (367) Q Consensus 342 ~~~~~~~~~~~agaIh~v~i-~~~v 365 (367) ++|+++|+|+++||||+|+| ++.| T Consensus 480 ~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 480 WKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred cccceEEEEEECCeEEEEEeccccC Confidence 99999999999999999999 5556 No 4 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=100.00 E-value=6.7e-98 Score=553.31 Aligned_cols=360 Identities=19% Similarity=0.234 Sum_probs=317.8 Q ss_pred cc-ccccccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhc----cC Q lcl|NC_020841. 3 GS-LTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFS----QN 77 (367) Q Consensus 3 ~~-~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~----Q~ 77 (367) |+ .++|+|+||+|++++.+++...++|+ .|+|+..+.++. +|.+.|+++++|..+||.+||+||+|..||+ |+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~f~-~lll~~~~~~~~-~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~ 78 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSVQP-GQLADFFQKTDVENWFGALSNEAKIADAYFPGIVNGG 78 (501) T ss_pred CCcCccccceEEEEeeecccCCCcccccc-eEEEecccCCCc-cceeeecCHHHHHHhcCCChHHHHHHHHHhhhhcCCC Confidence 55 46999999999999999999999998 556777776554 7999999999999999999999999999998 99 Q ss_pred cccceEEEEeccCcc----------------------------------------------------------------- Q lcl|NC_020841. 78 PKPRDLMIATVTALT----------------------------------------------------------------- 92 (367) Q Consensus 78 p~p~~v~v~~~~~~~----------------------------------------------------------------- 92 (367) |+|.+|+|+||..+. T Consensus 79 p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~~g~l~i~i~g~~~~~~i~~s~ats~~~vA~~i~~al~~~~~tv~ 158 (501) T protein:vir:10 79 QLPYDLKFARYVAADAPASVYGIPLTGITLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVA 158 (501) T ss_pred ccccEEEEEeecccCccceeeeceehhhhhhhhhheeeEEEEeeccceeeeccccccccCHHHHHHHHHHhhcCCceEEE Confidence 999999998864111 Q ss_pred ----------------------------------------------------chHHHHHHHHhcccCcEEEEEEecCCHH Q lcl|NC_020841. 93 ----------------------------------------------------DPLASIGEVAAKTLGFYAFCFASEVAAA 120 (367) Q Consensus 93 ----------------------------------------------------t~~~~l~~~~~~~~~w~~~~~~~~~~~~ 120 (367) +|.+++.++.+.+.+||+|.++++.+++ T Consensus 159 ~d~~~~~f~i~~~t~G~~~~i~~~t~~~d~a~~l~Lt~~~~a~v~~~g~~aet~~~Al~a~~~~~~~Wy~f~~a~~~~~~ 238 (501) T protein:vir:10 159 YDALRNRFTVVTNTTGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIA 238 (501) T ss_pred EecccceEEEEecccCcceeEEEeeccccchhhhcccccCceeEEecCcccccHHHHHHHHHhcccceEEEEEEecCChH Confidence 1222446777889999999999999999 Q ss_pred HHHHHHHHhhccCcEEEEEEeCch-------hhhHHHHHHHHhccccceeecCCchhHHHHHHHHHHHcccccCcceeee Q lcl|NC_020841. 121 DIQGLAEWAQSNNRMFMTVMTDDT-------EAVTTGNALKELGQYHYCITYHEDYATVGAVAGMALDQRYDKTDGVKTL 193 (367) Q Consensus 121 ~~~ala~~~ea~~~~~~~~~~d~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~t~ 193 (367) +++++|+|+|+++++|++..++.+ ...++.+.++..++.|++..||+.. ..++++|++++.||++.+|++|| T Consensus 239 ~~la~A~wi~a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~-~~aa~~g~~as~nf~~~~g~~T~ 317 (501) T protein:vir:10 239 DRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQA-TAGAVMGYAASINFQLRNGRTVL 317 (501) T ss_pred HHHHHHHHHHhcCceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEECCCCC-HHHHHHHHHHhcCcccCcceeee Confidence 999999999999999988877543 2345677888888888888776544 45578999999999999999999 Q ss_pred eeeec-CcccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-chhhHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020841. 194 HLKSL-VSVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-KFFDFVMGFDWLRNVIETNVFNGQRLRR 271 (367) Q Consensus 194 ~~k~l-~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-~~iD~~~~~dwl~~~lq~~l~~ll~~~~ 271 (367) |||++ +||+|++++++|+++|+++|||||+.+++.++.+.|+++|+|+|| +|||+++|+|||+++||.+|++||.+++ T Consensus 318 ~fkql~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~g~dWl~~~iq~~l~~ll~~~~ 397 (501) T protein:vir:10 318 AFRQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYN 397 (501) T ss_pred eecccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeeccceehhhHhhHHHHHHHHHHHHHHHHhcCC Confidence 99996 899999999999999999999999999999999999999999988 7999999999999999999999999999 Q ss_pred CCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCc---------cccccccccccccceeEEcCchHhCCHHHHhccc Q lcl|NC_020841. 272 LTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGA---------ALGEIETYDYLPTGYYVYNESIRDQAQVIREQRI 342 (367) Q Consensus 272 kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~---------~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~ 342 (367) |||||+.|+++|++.|+++|+++++||+|+||++++. ..+....++.+++|||+++++++.+ +++|++|+ T Consensus 398 kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~-~~~R~~R~ 476 (501) T protein:vir:10 398 SLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGNPANP-GQARQNRT 476 (501) T ss_pred CcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceeccceEEeeCcccCC-hhhhhhcc Confidence 9999999999999999999999999999999996444 4445666789999999999988865 46799999 Q ss_pred cCCeEEEEEECceEEEEEE-EEEec Q lcl|NC_020841. 343 APPFIILVKGAGAIHDTDI-TLIPE 366 (367) Q Consensus 343 ~~~~~~~~~~agaIh~v~i-~~~v~ 366 (367) +|+++|+|+++||||+|+| +.+|= T Consensus 477 ~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 477 SPACTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred cCceEEEEEeCCceeEEEeeeeecC Confidence 9999999999999999998 55555 No 5 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=100.00 E-value=1.3e-97 Score=551.78 Aligned_cols=360 Identities=19% Similarity=0.235 Sum_probs=317.8 Q ss_pred cc-ccccccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhc----cC Q lcl|NC_020841. 3 GS-LTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFS----QN 77 (367) Q Consensus 3 ~~-~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~----Q~ 77 (367) |+ .++|+|+||+|++++.+++.+.++|+ .|+|+..+.++..++ +.|++.++|..+|+.+||+||+|..||+ |+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~-~l~l~~~~~~~~~~~-~~~~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~ 78 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSVQPGQL-ADFFQETDVENWFGALSNEAKIADAYFPGIVNGG 78 (501) T ss_pred CCCCCcccceEEEEeeecccCCCccccce-eEEEeccCCCCccce-EEecCHHHHHHhcCCChHHHHHHHHHhhhhcCCC Confidence 66 67999999999999999999999997 668888887766555 6699999999999999999999999999 99 Q ss_pred cccceEEEEeccCc------------------------------------------------------------------ Q lcl|NC_020841. 78 PKPRDLMIATVTAL------------------------------------------------------------------ 91 (367) Q Consensus 78 p~p~~v~v~~~~~~------------------------------------------------------------------ 91 (367) |+|.+|+|+||..+ T Consensus 79 p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i~~al~~~~~tv~ 158 (501) T protein:vir:10 79 QLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVA 158 (501) T ss_pred ccccEEEEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHHhhhccCCceEEE Confidence 99999998875311 Q ss_pred ---------------------------------------------------cchHHHHHHHHhcccCcEEEEEEecCCHH Q lcl|NC_020841. 92 ---------------------------------------------------TDPLASIGEVAAKTLGFYAFCFASEVAAA 120 (367) Q Consensus 92 ---------------------------------------------------~t~~~~l~~~~~~~~~w~~~~~~~~~~~~ 120 (367) +++.+++.++.+.+.+||+|.++++.+++ T Consensus 159 ~d~~~~~f~its~ttG~~~~i~~~~~~~~la~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~~~~~ 238 (501) T protein:vir:10 159 YDALRNRFTVVTNATGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIA 238 (501) T ss_pred EcccCceEEEEeeccCCceeEEEeeCchhhhhhcCccccccceEEecCcccccHHHHHHHHHhccCceEEEEEecCCChH Confidence 11233456778889999999999999999 Q ss_pred HHHHHHHHhhccCcEEEEEEeCchh-------hhHHHHHHHHhccccceeecCCchhHHHHHHHHHHHcccccCcceeee Q lcl|NC_020841. 121 DIQGLAEWAQSNNRMFMTVMTDDTE-------AVTTGNALKELGQYHYCITYHEDYATVGAVAGMALDQRYDKTDGVKTL 193 (367) Q Consensus 121 ~~~ala~~~ea~~~~~~~~~~d~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~t~ 193 (367) +++++|+|+|+++++|++..++.+. ..++...++..++.|++..||+ ....++++|++++.+|++.+|++|| T Consensus 239 ~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~-~~~~aa~~g~~as~nf~~~~g~~T~ 317 (501) T protein:vir:10 239 DRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGD-QATAGAVMGYAASINFQLRNGRTVL 317 (501) T ss_pred HHHHHHHHHHhcCceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCC-CcHHHHHHHHHHhhCcccCccceee Confidence 9999999999999999888776542 3446678888888888777764 4455688999999999999999999 Q ss_pred eeeecC-cccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-chhhHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020841. 194 HLKSLV-SVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-KFFDFVMGFDWLRNVIETNVFNGQRLRR 271 (367) Q Consensus 194 ~~k~l~-Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-~~iD~~~~~dwl~~~lq~~l~~ll~~~~ 271 (367) |||+++ ||+|++++++|+++|+++|||||+.+++.++++.|+++|+|+|+ +|||.++|+|||+++||.+|++||.+++ T Consensus 318 ~fkq~~~Gi~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~ 397 (501) T protein:vir:10 318 AFRQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYN 397 (501) T ss_pred eccccCCCcCcccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeccceeehhhhhHHHHHHHHHHHHHHHHHhcC Confidence 999986 89999999999999999999999999999999999999999988 7999999999999999999999999999 Q ss_pred CCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccc---------cccccccccccceeEEcCchHhCCHHHHhccc Q lcl|NC_020841. 272 LTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAAL---------GEIETYDYLPTGYYVYNESIRDQAQVIREQRI 342 (367) Q Consensus 272 kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~---------g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~ 342 (367) |||||+.|+++|++.|+++|+++++||+|+||+|+++.+ +....++.+++|||+++++++. ++++|++|+ T Consensus 398 kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~-~~~~R~~R~ 476 (501) T protein:vir:10 398 SLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPAN-PGQARQNRT 476 (501) T ss_pred CcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeeccccC-Chhhhhhcc Confidence 999999999999999999999999999999999977654 3355568899999999988874 567999999 Q ss_pred cCCeEEEEEECceEEEEEE-EEEec Q lcl|NC_020841. 343 APPFIILVKGAGAIHDTDI-TLIPE 366 (367) Q Consensus 343 ~~~~~~~~~~agaIh~v~i-~~~v~ 366 (367) +|+++|+|+++||||+|+| +.+|= T Consensus 477 ~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 477 TPACTLWYSDGGSIQQLTIGSNAVI 501 (501) T ss_pred ccceEEEEEeCCceeEEEeeeeecC Confidence 9999999999999999998 55555 No 6 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=100.00 E-value=2.1e-97 Score=550.58 Aligned_cols=360 Identities=19% Similarity=0.234 Sum_probs=319.0 Q ss_pred cc-ccccccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhc----cC Q lcl|NC_020841. 3 GS-LTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFS----QN 77 (367) Q Consensus 3 ~~-~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~----Q~ 77 (367) |+ .++|+|+||+|++++.+++.+.++|+ .|+|+....++ .+|.+.|+++++|..+||.+||+||+|..||+ |+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~-~lll~~~~~~~-~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~ 78 (501) T protein:vir:78 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSIQ-PGQLADFFQKTDVENWFGGLSNEAVIADAYFPGIVNGG 78 (501) T ss_pred CCcCccccceEEEEeeecccCCCcceeee-eEEEecCCCCC-ccceeeecCHHHHHHhcCCChHHHHHHHHHhhcCCCCC Confidence 55 46999999999999999999999998 56777777655 58999999999999999999999999999999 99 Q ss_pred cccceEEEEeccCc------------------------------------------------------------------ Q lcl|NC_020841. 78 PKPRDLMIATVTAL------------------------------------------------------------------ 91 (367) Q Consensus 78 p~p~~v~v~~~~~~------------------------------------------------------------------ 91 (367) |+|.+|+|+||..+ T Consensus 79 ~~P~~l~igR~~~~a~~~~l~g~~l~~~~la~~~~~~G~l~iti~g~~~~~~i~~S~~ts~~~vA~~i~~al~a~~~tv~ 158 (501) T protein:vir:78 79 QLPYDLKFARYVAADAPASVYGIPLTGVTLTQLQGYSGTLTVTTAAQHVSSNISLAAATSFANAATLIEAAFTSPDFVVS 158 (501) T ss_pred cccceEEEEeecccCcceeEeccceeccchhhhceeeeEEEEEeccceeeeccccccccCHHHHHHHHHhhhcCcceEEE Confidence 99999988875411 Q ss_pred ---------------------------------------------------cchHHHHHHHHhcccCcEEEEEEecCCHH Q lcl|NC_020841. 92 ---------------------------------------------------TDPLASIGEVAAKTLGFYAFCFASEVAAA 120 (367) Q Consensus 92 ---------------------------------------------------~t~~~~l~~~~~~~~~w~~~~~~~~~~~~ 120 (367) +++.+++.++.+.+.+||+|.++++.+++ T Consensus 159 ~ds~~~~f~its~t~G~~~~i~~~t~~~~~a~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~~~~~ 238 (501) T protein:vir:78 159 YDALRNRFVVNTNATGTAAAISAVTGTNNLADELGLSAAAGASLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIA 238 (501) T ss_pred EccccceEEEEeeecCCceeEEEEecccchhhhhcccccCceeeEeccccccCHHHHHHHHHhccCceEEEEEecCCCHH Confidence 11223456778889999999999999999 Q ss_pred HHHHHHHHhhccCcEEEEEEeCch-------hhhHHHHHHHHhccccceeecCCchhHHHHHHHHHHHcccccCcceeee Q lcl|NC_020841. 121 DIQGLAEWAQSNNRMFMTVMTDDT-------EAVTTGNALKELGQYHYCITYHEDYATVGAVAGMALDQRYDKTDGVKTL 193 (367) Q Consensus 121 ~~~ala~~~ea~~~~~~~~~~d~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~t~ 193 (367) +++++|+|+|+++++|++..++.+ ...++++.++..++.|++..|| +.+..++++|++++.+|++.+|++|| T Consensus 239 ~~lalA~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~-~~~~~aa~~g~~as~nf~~~~g~~T~ 317 (501) T protein:vir:78 239 DRLALASWNSGQAYKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYG-DQATAGAVMGYAASINFQLRNGRTVL 317 (501) T ss_pred HHHHHHHHHHhcCceEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEcC-CcchHHHHHHHHHhcCcccCcceeee Confidence 999999999999999988876543 2345678899888888887776 55666788999999999999999999 Q ss_pred eeeec-CcccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-chhhHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020841. 194 HLKSL-VSVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-KFFDFVMGFDWLRNVIETNVFNGQRLRR 271 (367) Q Consensus 194 ~~k~l-~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-~~iD~~~~~dwl~~~lq~~l~~ll~~~~ 271 (367) |||++ +||+|++++++|+++|+++|||||+.+++.++.+.|+++|+|+|+ +|||.++|+|||+++||.+|++||.+++ T Consensus 318 ~fkq~~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~ 397 (501) T protein:vir:78 318 AFRQFNAGVPATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYN 397 (501) T ss_pred eccccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeeccceeehhhhhHHHHHHHHHHHHHHHHHhCC Confidence 99996 899999999999999999999999999999999999999999988 7999999999999999999999999999 Q ss_pred CCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccc---------cccccccccccceeEEcCchHhCCHHHHhccc Q lcl|NC_020841. 272 LTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAAL---------GEIETYDYLPTGYYVYNESIRDQAQVIREQRI 342 (367) Q Consensus 272 kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~---------g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~ 342 (367) |||||+.|+++|++.|+++|+++++||+|+||+|+++.+ +....++++++|||+++++++.+ +++|++|+ T Consensus 398 kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~~-~~~R~~R~ 476 (501) T protein:vir:78 398 SLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDPANP-GQARQNRT 476 (501) T ss_pred CcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeeccccCC-hhhhhhcc Confidence 999999999999999999999999999999999977654 34556788999999999988865 46799999 Q ss_pred cCCeEEEEEECceEEEEEE-EEEec Q lcl|NC_020841. 343 APPFIILVKGAGAIHDTDI-TLIPE 366 (367) Q Consensus 343 ~~~~~~~~~~agaIh~v~i-~~~v~ 366 (367) +|+++|+|+++||||+|+| +.+|= T Consensus 477 ~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:78 477 TPTCTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred cCcEEEEEEeCCceeEEEeeeeecC Confidence 9999999999999999998 55555 No 7 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=100.00 E-value=4.7e-97 Score=548.70 Aligned_cols=360 Identities=18% Similarity=0.148 Sum_probs=313.9 Q ss_pred cccccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCc----ccc Q lcl|NC_020841. 6 TLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNP----KPR 81 (367) Q Consensus 6 ~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p----~p~ 81 (367) =+|||+||+|++++.+.+.++++|+.+|||+.++.++ .+|.+.|+++++|..+||.+||+||+|..||+|.| +|. T Consensus 1 mip~s~iVnV~~~v~~~a~~~~~~~~~lilt~~~~~~-~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsq~p~~~~~P~ 79 (507) T protein:vir:99 1 MISQSRYVRIVSGVGAGAPVAQRRLIMRVMTTNAVLP-PGVVFESSSADAVGAYFGMASEEYKRAKAYMSFISKSINSPS 79 (507) T ss_pred CCCccceeEEeeeccccCcccccccceeeeccccCCC-ccceEeecCHHHHHHhcCCChHHHHHHHHHhccCCCCCcccc Confidence 3799999999999999999999999999999998764 48999999999999999999999999999999999 799 Q ss_pred eEEEEeccCc---------------------------------------------------------------------- Q lcl|NC_020841. 82 DLMIATVTAL---------------------------------------------------------------------- 91 (367) Q Consensus 82 ~v~v~~~~~~---------------------------------------------------------------------- 91 (367) +|+|+||.++ T Consensus 80 ~L~igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vAs~i~~~l~a~~~~~~~~~ 159 (507) T protein:vir:99 80 YISFARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVAATLQTKIRASANAELATA 159 (507) T ss_pred eEEEEeecCccccceeecchhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHHHHHHHhhhccccccccce Confidence 9999886310 Q ss_pred -------------------------------------------------------cchHHHHHHHHhcccCcEEEEEEec Q lcl|NC_020841. 92 -------------------------------------------------------TDPLASIGEVAAKTLGFYAFCFASE 116 (367) Q Consensus 92 -------------------------------------------------------~t~~~~l~~~~~~~~~w~~~~~~~~ 116 (367) +++.+++.++.+.+.+||+|++.+. T Consensus 160 tv~~d~~~~~F~v~s~~tG~~s~i~~at~~~~gt~~s~l~~~~~~~a~~~~g~~aet~~~a~~a~~~~~~nW~~~~~a~~ 239 (507) T protein:vir:99 160 TVTFNTTTNQFVLNGTTTGALAPTITAVRTDPATDISSLLGWTNTGTVFVKGQAAETPDTSISKSAAISTNFGSFIYTST 239 (507) T ss_pred EEEEecCCceEEEEeeeccccceeEEEEcCCchhhHHHHhccccccceEeecccccCHHHHHHHHHhhcCCeEEEEEEec Confidence 1123345677778999999988764 Q ss_pred --CCHHHHHHHHHHhhccCcEEEEEEeCchhhhHHHHHHHHhc-cccceeec--CCchhHHHHHHHHHHHcccccCccee Q lcl|NC_020841. 117 --VAAADIQGLAEWAQSNNRMFMTVMTDDTEAVTTGNALKELG-QYHYCITY--HEDYATVGAVAGMALDQRYDKTDGVK 191 (367) Q Consensus 117 --~~~~~~~ala~~~ea~~~~~~~~~~d~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~g~~ 191 (367) +++++++++|+|+|+++++|++..++.....+.....+... ........ .+....++++++++++++|++.+|++ T Consensus 240 ~~~td~~~lalA~wiea~~~~f~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ng~~ 319 (507) T protein:vir:99 240 PALTNDQITAVASWNASQNNMYMYSVPTTIANIGTLYAAVKGFSGCALNITSDSLPVDYIEQSPCEILAATDYTRVNATQ 319 (507) T ss_pred cccChHHHHHHHHHHhhcCcEEEEEEecCchhhhhhhhhhhhcceeEEEeecccccchhHHHHHHHHHHhhccCcCccce Confidence 58899999999999999999988887655443322222222 22211111 23345678899999999999999999 Q ss_pred eeeeeecCcccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020841. 192 TLHLKSLVSVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQ 267 (367) Q Consensus 192 t~~~k~l~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll 267 (367) |||||+++||+|++++++|+++|+++|||||+.+++.++++.|+++|+|+||+ |+|.++++|||+++||++|++|| T Consensus 320 T~~fk~l~GV~a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~gG~~~fid~d~~~~~~WL~~~iq~~l~~l~ 399 (507) T protein:vir:99 320 NYMYYQFPSRNITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILCGGPNDAVDMNIYANEIWLKSAISAQILSLF 399 (507) T ss_pred eecccccCCcccccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeCCcccceeeeeecchHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999995 67778899999999999999999 Q ss_pred HhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceeccc---------ccCccccccccccccccceeEEcCchHhCCHHHH Q lcl|NC_020841. 268 RLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGT---------WNGAALGEIETYDYLPTGYYVYNESIRDQAQVIR 338 (367) Q Consensus 268 ~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~---------~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr 338 (367) .+++|||||+.|+++|+++|+++|+++++||+|+||+ |++.+.+.+..++++.+|||++.++++.+++++| T Consensus 400 ~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r 479 (507) T protein:vir:99 400 LNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYITQISGDANAWRQVANIGYWLNITFSSYTNPNTQ 479 (507) T ss_pred hcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchheecccccccccccceeccceEEEeCChHhcChhhh Confidence 9999999999999999999999999999999999999 6666777889999999999999999999999999 Q ss_pred hccccCCeEEEEEECceEEEEEEEEEec Q lcl|NC_020841. 339 EQRIAPPFIILVKGAGAIHDTDITLIPE 366 (367) Q Consensus 339 ~~R~~~~~~~~~~~agaIh~v~i~~~v~ 366 (367) ++|++|+++|+|+++|+||+|+|+.+.= T Consensus 480 ~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 480 LTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred hccccceEEEEEEeCCeEEEEEeeeecC Confidence 9999999999999999999999965544 No 8 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=100.00 E-value=1.7e-93 Score=529.11 Aligned_cols=357 Identities=20% Similarity=0.275 Sum_probs=317.7 Q ss_pred cc-ccccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhc----cCcc Q lcl|NC_020841. 5 LT-LPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFS----QNPK 79 (367) Q Consensus 5 ~~-l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~----Q~p~ 79 (367) |. +|+|+||+|++++.+++.+.++|+.+|+++.+. ...+|.+.|+++++|..+|+.+||+||+|..||+ |+|+ T Consensus 1 m~~ip~s~iV~V~~~v~~~~~~~~~f~~~l~~~~~~--~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~p~ 78 (494) T protein:vir:94 1 MPNIPISQIVSINPQVVSAGGTQGTLDGLLLTQATG--FPVTQPQVYFSAADVGTAFGLTSDEYNAALVYFAGILGGGQQ 78 (494) T ss_pred CCCCCcccEEEeeeeccccCCcccccceeEeecCcc--CCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCCCcc Confidence 66 999999999999999999999999888777654 3468999999999999999999999999999999 9999 Q ss_pred cceEEEEeccCc-------------------------------------------------------------------- Q lcl|NC_020841. 80 PRDLMIATVTAL-------------------------------------------------------------------- 91 (367) Q Consensus 80 p~~v~v~~~~~~-------------------------------------------------------------------- 91 (367) |.+|+|+||..+ T Consensus 79 P~~l~igR~~~~a~~~~l~g~~~~~tl~~~~~~~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~~ai~~a~~~v~~d~ 158 (494) T protein:vir:94 79 PASLTIGRYASAATSAAVFGAPLTLSLAQLQTLSGTLIVTTDTQRTSAAINLSGATSFANAASLMTSGFTTPNFAITYDA 158 (494) T ss_pred ccEEEEEeecCccccceeeccchhhhHHhhhhcceEEEEEEcceEEEeeecccccCChhhHHHHHhhhhccccceEEEcc Confidence 999999875421 Q ss_pred ----------------------------------------------cchHHHHHHHHhcccCcEEEEEEecCCHHHHHHH Q lcl|NC_020841. 92 ----------------------------------------------TDPLASIGEVAAKTLGFYAFCFASEVAAADIQGL 125 (367) Q Consensus 92 ----------------------------------------------~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~al 125 (367) +++.+++.++.+.+.+||+|.++++.++++++++ T Consensus 159 ~~~~f~v~s~ttG~~s~is~~t~~~a~~l~lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~~~~~~~~~ilal 238 (494) T protein:vir:94 159 QRRRFVLSTTATGTTASVSAVTGTLADGVGLSTASGAYVEGSGLAADTAASALDRLAASSSTWAIFTTAWAASLSDRTAL 238 (494) T ss_pred cCcEEEEEEccCCceeEEEEeccchhhhhhhhccccceEeecCcccccHHHHHHHHHhccCceEEEEEecCCCHHHHHHH Confidence 1233355778888999999999999999999999 Q ss_pred HHHhhccCcEEEEEEeCch-------hhhHHHHHHHHhccccceeecCCchhHHHHHHHHHHHcccccCcceeeeeee-e Q lcl|NC_020841. 126 AEWAQSNNRMFMTVMTDDT-------EAVTTGNALKELGQYHYCITYHEDYATVGAVAGMALDQRYDKTDGVKTLHLK-S 197 (367) Q Consensus 126 a~~~ea~~~~~~~~~~d~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~t~~~k-~ 197 (367) |+|+|+++++|++..++.+ ...++.+.++..++.|++..||++... ++++++.++.+|+..+|++||+|| + T Consensus 239 A~wiea~~~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~-aa~~g~~aa~~~~~~~g~~T~~~k~q 317 (494) T protein:vir:94 239 AQWTSDQVFRRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLLANA-MIVLAWGASTNLQIAEGRTTLALRSP 317 (494) T ss_pred HHHHhhcCccEEEEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCCChH-HHHHHHHHhccccccCcceeEEeecc Confidence 9999999998887765432 245677889999999999988876654 577899999999999999999999 6 Q ss_pred cCcccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc--hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCc Q lcl|NC_020841. 198 LVSVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK--FFDFVMGFDWLRNVIETNVFNGQRLRRLTPQ 275 (367) Q Consensus 198 l~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~--~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy 275 (367) ++|++|++++.+|+++|+++|||||+.+++.++.+.|+++|++ +|+ |||.+++++||+++||.+|++||.+++|||| T Consensus 318 ~~gi~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~-sG~~~~id~~~~~~WL~~~iq~~l~~ll~~~~KIPy 396 (494) T protein:vir:94 318 VSSAGVRVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAI-GGQFLWADTALGWIALRRNLQQALFETLLAYRSLPY 396 (494) T ss_pred CCCCCCccCCHHHHHHHHhcCCeEEEEecccCceEEEecCcee-ccccceeeeeccHHHHHHHHHHHHHHHHHhCCCccc Confidence 8999999999999999999999999999998888877777765 554 7999999999999999999999999999999 Q ss_pred CHhHHHHHHHHHHHHHHHHHhcCceecccccCcc--------ccccccccccccceeEEcCchHhCCHHHHhccccCCeE Q lcl|NC_020841. 276 TDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAA--------LGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFI 347 (367) Q Consensus 276 ~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~--------~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~ 347 (367) |+.|+++|+++|+++|+++++||+|+||+|+++. +|....++++++|||++. ...+++++|++|.+|+++ T Consensus 397 td~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~--~~~~s~~~ra~R~~~~~~ 474 (494) T protein:vir:94 397 NADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGWYLQV--IDPITTTVRTDRGSPTVN 474 (494) T ss_pred ChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccccceeccceeeec--cCCCChhhhhccccCCce Confidence 9999999999999999999999999999999987 888999999999999995 345789999999999999 Q ss_pred EEEEECceEEEEEEEEEecC Q lcl|NC_020841. 348 ILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 348 ~~~~~agaIh~v~i~~~v~~ 367 (367) |+|+++||||+|+|+.++== T Consensus 475 ~~y~~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 475 FWYCDGGSIQRVVVSATTVI 494 (494) T ss_pred EEEEecCcEEEEEEeeEEeC Confidence 99999999999999665444 No 9 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=100.00 E-value=5.2e-90 Score=510.07 Aligned_cols=327 Identities=17% Similarity=0.221 Sum_probs=281.9 Q ss_pred cccceEEEEEeeec-cccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcccceEEEE Q lcl|NC_020841. 8 PINMLVNVSIEYQA-KLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPKPRDLMIA 86 (367) Q Consensus 8 ~i~~iv~V~i~~~~-~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~p~~v~v~ 86 (367) =|+|||+|+|++.. ++.+..+||.+++|...+ .+|+|+|+++++|+.||++++++||+|.++|+|+|+|.+++++ T Consensus 1 ~~~~iv~V~v~~~~~~~~~~~~~~~~~~~~~~t----~~~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~~~~~~i~v~ 76 (331) T protein:vir:80 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGT----AMGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPDTVAVI 76 (331) T ss_pred CccceecceeeecccccccccccCcceeEEecc----ccceEEEechhhhccCCCCCcHHHHHHHHHHhccCccceEEEe Confidence 38999999999984 566677788777776543 4789999999999999999999999999999999999999999 Q ss_pred eccCccchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHHhhccCcEEEEEEeCchhhhHHHHHHHHhccccceeecC Q lcl|NC_020841. 87 TVTALTDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEWAQSNNRMFMTVMTDDTEAVTTGNALKELGQYHYCITYH 166 (367) Q Consensus 87 ~~~~~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~~ea~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 166 (367) ++..... +.++.+..++||+|+.+.+.++++++++|+|+|+++++|+++.+++.. +..+..++.+++..+| T Consensus 77 ~~~~~~~----~~a~~a~~~~~w~~~~~~~~~~~~~~a~a~~~~a~~~~f~~~~~~~~~-----~~~~~~~~~~t~~~~~ 147 (331) T protein:vir:80 77 TYEDTKL----LEAAEAYFLKSWHFALLAEFKAADALALSNLIEEQKFKFAVFQVTAVA-----DITPLAKNTRTIAIVH 147 (331) T ss_pred ccchHHH----HHHHHHhccCceeEEEeecCCHHHHHHHHHHHhhCCcEEEEEecCchH-----HHHHhhccccEEEEEc Confidence 8866433 344444445555566777889999999999999999999988765433 3344556677777776 Q ss_pred Cchh--HHHHHHHHHHHcccccCcceeeeeeee-cCcccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC Q lcl|NC_020841. 167 EDYA--TVGAVAGMALDQRYDKTDGVKTLHLKS-LVSVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG 243 (367) Q Consensus 167 ~~~~--~~~~~~~~~~~~~~~~~~g~~t~~~k~-l~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G 243 (367) +... ..++++|++ ++..+|++||+||+ |+||+|++++.+|+++|+++|||||+++++ +.++++|++++| T Consensus 148 ~~~~~~~~aa~~g~~----~~~~~g~~t~~fk~~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~----~~~~~~G~~~~G 219 (331) T protein:vir:80 148 SKTGEKLDAALIGNV----ASLPVGSATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEKAG----IAQTSEGKTVSG 219 (331) T ss_pred CCccchhHHHHHHHH----HhcCccceeeeeecccCCCCCCCCCHHHHHHHHhcCceEEEEecC----eeEEecceEeCc Confidence 5432 334444444 45667999999996 899999999999999999999999999864 468999999999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccce Q lcl|NC_020841. 244 KFFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGY 323 (367) Q Consensus 244 ~~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy 323 (367) +|||++||+|||+++||++|++||.+++|||||+.|+++|+++|+++|++++++|+|+||++++++ || T Consensus 220 ~~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~~~------------~~ 287 (331) T protein:vir:80 220 EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEP------------NF 287 (331) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhHHHHHHHHHHHHHHHHhCCceecCccCCCc------------ce Confidence 999999999999999999999999999999999999999999999999999999999999998764 69 Q ss_pred eEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 324 YVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 324 ~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) +|+.|++++++++||++|++||++|+|+++||||+|+|++||+= T Consensus 288 ~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 288 SITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred EEEeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 99999999999999999999999999999999999999999888 No 10 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=100.00 E-value=7.7e-90 Score=509.13 Aligned_cols=332 Identities=22% Similarity=0.262 Sum_probs=297.0 Q ss_pred cccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcccceEEEEe Q lcl|NC_020841. 8 PINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPKPRDLMIAT 87 (367) Q Consensus 8 ~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~p~~v~v~~ 87 (367) --|+||||+|++.+.+++.++|+.+|+++++.. ..+|.+.|+++++|+.+||.+||+||+|..||+|.|+|.+++|+| T Consensus 1 ~~s~iVnV~i~~~~~a~~~~~f~~~l~~~~~~~--~~~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p~p~~l~igr 78 (450) T protein:vir:95 1 MWNPIVNVDITLNTAGTTREGFGLPLFLASTDN--FEERVRGYTSLTEVAEDFDENTAAYKAAKQLWSQTPKVTQLYIGR 78 (450) T ss_pred CCCceEEEeecccccccccccceeEEEEcCCCC--CccceeeecCHHHHHHhcCCCcHHHHHHHHHHhCCCcccEEEEEe Confidence 149999999999999999999999999999875 459999999999999999999999999999999999999999986 Q ss_pred ccCc---------------------------------------------------------------------------- Q lcl|NC_020841. 88 VTAL---------------------------------------------------------------------------- 91 (367) Q Consensus 88 ~~~~---------------------------------------------------------------------------- 91 (367) |..+ T Consensus 79 ~~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~~~~~s~g~~~~~t~~~~ 158 (450) T protein:vir:95 79 RAMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDKVSVNVTGSNGSATMIIA 158 (450) T ss_pred eccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeeeeeeeeecccceeeeeee Confidence 5211 Q ss_pred ------------------------cchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHHhhccCcEEEEEEeCchh-- Q lcl|NC_020841. 92 ------------------------TDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEWAQSNNRMFMTVMTDDTE-- 145 (367) Q Consensus 92 ------------------------~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~~ea~~~~~~~~~~d~~~-- 145 (367) +++.+++.++.+.+.+||+|+ +.+.++++++++|+|+++++++|+++..+... T Consensus 159 ~~~~~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~~w~~~~-~~~~~~~~i~a~a~w~~a~~~~f~~~~~~~~~~~ 237 (450) T protein:vir:95 159 KAGDNDFVKVTTTAQTVYIASTTADTASTALAAIEAYSTDWYFIA-AEDRTQQFVLAMASEIQARKKIFFTANSDVTALQ 237 (450) T ss_pred ccccchhhccccccceeEecccccccHHHHHHHHHHhhCCeEEEE-ecCCCHHHHHHHHHHHhhcCcEEEEEcCCchhhh Confidence 123445567888899999876 45678999999999999999999998876442 Q ss_pred ------hhHHHHHHHHhccccceeecCCchhHHHHHHHHHHHcccccCcceeeeeeeecCccccc-------CCCHHHHH Q lcl|NC_020841. 146 ------AVTTGNALKELGQYHYCITYHEDYATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVVST-------DISQTQAA 212 (367) Q Consensus 146 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~~~-------~~t~t~~~ 212 (367) ..++++.++..++.|++..||++... .++++++++.+|+..||++|||||+++||+|+ +|+.+|++ T Consensus 238 ~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~-~~~~aa~~g~~~~~~~g~~T~~fk~l~Gv~~~v~~~~~~~lt~~~~~ 316 (450) T protein:vir:95 238 GTELASANDVPAQLAKNMYTRTVCLWHHAAAE-DYPEMAYIAYGAPYDAGSIAWGNAQLTGVAASLQPSNQRPLTSIQKS 316 (450) T ss_pred hhhhhcccchHHHHHhccCCeeEEEeeCCCch-hHHHHHHHHHhhhcccceeeeccccccceeeeccCccccccchHHHH Confidence 34567889999999999999875433 35678888899999999999999999999996 58999999 Q ss_pred HHHhCCceEEEEeeccccceEEEecCEeeCCchhhHHHHHHHHHHHHHHHHHHHHHhc--CCCCcCHhHHHHHHHHHHHH Q lcl|NC_020841. 213 SLKAACINYYSDYGNPDNSLPIFANGHAGGGKFFDFVMGFDWLRNVIETNVFNGQRLR--RLTPQTDRGMMMIKADIVNG 290 (367) Q Consensus 213 ~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~~iD~~~~~dwl~~~lq~~l~~ll~~~--~kipy~~~G~~~l~~~v~~v 290 (367) +|+++|||||+.+.+ +.++++|++++|+|||++||+|||+++||++|++||.++ +|||||+.|+++|+++|+++ T Consensus 317 al~~~~~n~y~~~~~----~~~~~~G~~~~G~~iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy~~~G~~~i~a~i~~~ 392 (450) T protein:vir:95 317 ALDVRHCNFIDLDGG----VPVVRRGITSGGEWIDIIRGVDWLESDLKTSLRDLLINQKGGKITYDDTGITRIRQVIETS 392 (450) T ss_pred HHHhCCcEEEEEecC----ceeeeCCeeeCcchhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccChhhHHHHHHHHHHH Confidence 999999999999854 458999999999999999999999999999999999875 48999999999999999999 Q ss_pred HHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 291 LEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 291 l~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) |+++++||+|+ ||+|+.|++++++++||++|++|+++|+|+|+||||.++|++||.= T Consensus 393 l~~a~~~G~Ia--------------------~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~ 449 (450) T protein:vir:95 393 LQRAVNRNFLS--------------------SYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAY 449 (450) T ss_pred HHHHHhcCccc--------------------ceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEe Confidence 99999999995 5899999999999999999999999999999999999999999988 No 11 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=100.00 E-value=6.2e-87 Score=493.19 Aligned_cols=357 Identities=15% Similarity=0.137 Sum_probs=293.4 Q ss_pred ccccceEEEEEeeecc---ccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhc----cCcc Q lcl|NC_020841. 7 LPINMLVNVSIEYQAK---LLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFS----QNPK 79 (367) Q Consensus 7 l~i~~iv~V~i~~~~~---~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~----Q~p~ 79 (367) |||+.+++|+|++++. ++..++|+ +|+|++.+.++. +|++.|+++++|+.+||.+||+||+|+.||+ |.|+ T Consensus 1 m~I~~~~~V~i~~~v~aa~~~~~~~f~-~li~t~~~~~p~-~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~ 78 (515) T protein:vir:10 1 MPISFDKYVAITSGVAAQQQIAARSFA-IRVYTPNPMVSV-DRLITATSAADVGAYFGTASEEYKRAVKNFGFISKKTRR 78 (515) T ss_pred CCCCceeEEEeecccccCCccccccce-eeeeecccCCCc-cceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCCccc Confidence 7889999999997764 44457888 578888887755 8999999999999999999999999999999 9999 Q ss_pred cceEEEEeccCc-------------------------------------------------------------------- Q lcl|NC_020841. 80 PRDLMIATVTAL-------------------------------------------------------------------- 91 (367) Q Consensus 80 p~~v~v~~~~~~-------------------------------------------------------------------- 91 (367) |.+|+|+||..+ T Consensus 79 P~~L~igR~~~~a~~~~l~g~~~~~~~l~~~~~is~G~ltitidG~~~~t~s~i~~S~ats~~~vAs~i~tal~~~~~~~ 158 (515) T protein:vir:10 79 PTSIQFARWQREAGPVAIYGGAKKAAALATLQAVTAGAISFLFGGATTVTVSGISFSAATSLADVASELQTALRANADAN 158 (515) T ss_pred ccEEEEEeccCcccceEEEeccchhhhHHhhhcccceeEEEEEcceEEEEeeccccccccCHHHHHHHHHhhhccccccc Confidence 999998763100 Q ss_pred --------------------------------------------------------------cchHHHHHHHHhcccCcE Q lcl|NC_020841. 92 --------------------------------------------------------------TDPLASIGEVAAKTLGFY 109 (367) Q Consensus 92 --------------------------------------------------------------~t~~~~l~~~~~~~~~w~ 109 (367) +++.+++.++.+.+.||| T Consensus 159 ~~~~tv~~d~~~~~F~v~s~~tG~~~~is~~~~t~~~~~t~~a~~lglt~~~~av~~~g~aaet~~~a~~a~~~~s~nWy 238 (515) T protein:vir:10 159 LATCTVSYDPVGARFNFAGSPSDDTVQESISIVPQSNPAIDVAQLLGWNSAQGASYIAASPVVSPVDTLIASVAGNNNFG 238 (515) T ss_pred cceeEEEEecCCCeEEEEEeecCCceeEEEEEecCCCchhhHHHHhccccccceEEecccccccHHHHHHHHHhccCCeE Confidence 013456778888899999 Q ss_pred EEEEEec----CCHHHHHHHHHHhhccCcEEEEEEeCchhhhH-HHHHHHHhcccc-ceeecC-CchhHHHHHHHHHHHc Q lcl|NC_020841. 110 AFCFASE----VAAADIQGLAEWAQSNNRMFMTVMTDDTEAVT-TGNALKELGQYH-YCITYH-EDYATVGAVAGMALDQ 182 (367) Q Consensus 110 ~~~~~~~----~~~~~~~ala~~~ea~~~~~~~~~~d~~~~~~-~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~~~ 182 (367) +|++.++ .+++++++++.|+++++++|++...+...... .....+....+. ....++ ...+++++++|+++++ T Consensus 239 ~f~~a~~~~~~~~~a~~~a~a~~~e~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~asv 318 (515) T protein:vir:10 239 SILFTKNGGTGITLSDAEAIALQNQSYNVAYKFQVGVDDTTYSSWQAALAAIGGVNMIYSPVALAAEYHDMQDGIIEAAT 318 (515) T ss_pred EEEEeecCccccchhHHHHHHHHHhhcCceEEEEeccCccceechhhhhhhhhhcCceEEEEeccCcchHHHHHHHHHhc Confidence 9998764 35789999999999999999887765433221 111222223222 222222 2234566889999999 Q ss_pred ccccCcceeeeeeeecCcccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHH Q lcl|NC_020841. 183 RYDKTDGVKTLHLKSLVSVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNV 258 (367) Q Consensus 183 ~~~~~~g~~t~~~k~l~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~ 258 (367) +|++.+|++|||||+++||+|++++++|+++|+++|||||+.|++.++++.|++||+|+||+ |||++||+|||+++ T Consensus 319 nf~~~ng~iT~kfKq~~Gita~~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G~~~gG~~~~~WiD~~~g~~WL~~~ 398 (515) T protein:vir:10 319 DFTQQGGATGYMYVQFNNQTPAVNDDTLSGILDDLNINYYGQTQVNGTNLSFYQDGVMMGGPTDPRDSNVYANEQWLKSY 398 (515) T ss_pred CCCccchhheeccccCCCCccccCCHHHHHHHHhcCCeEEEEEeccCceEEEEeCCeeeCCccchhHHHHHhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999996 79999999999999 Q ss_pred HHHHHHHHHHhcCCCCcCHhHHHHHHHHH-HHHHHHHHhcCceecccccCcc--------cccc-ccccccccceeEEcC Q lcl|NC_020841. 259 IETNVFNGQRLRRLTPQTDRGMMMIKADI-VNGLEEAVKAGLVAAGTWNGAA--------LGEI-ETYDYLPTGYYVYNE 328 (367) Q Consensus 259 lq~~l~~ll~~~~kipy~~~G~~~l~~~v-~~vl~~a~~~G~I~~g~~~~~~--------~g~~-~~~~~~~~gy~v~~~ 328 (367) ||++|++||.+++|||||+.|+++|++.| +++|++|++||+|+||++++.. .|.. ..++++.+|||++++ T Consensus 399 iq~~l~~L~~s~~KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~~~~~~~~~Gyy~~~~ 478 (515) T protein:vir:10 399 AGASFMSLQLAQGKIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDTAWQKVQNLGYWYDVQ 478 (515) T ss_pred HHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcccccchhhcceeEecC Confidence 99999999999999999999999999987 5799999999999999998876 3444 456899999999999 Q ss_pred chHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEec Q lcl|NC_020841. 329 SIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPE 366 (367) Q Consensus 329 ~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~ 366 (367) +++.+...+|..+ .+++.|||+++|+||+|+++.+.= T Consensus 479 ~~~~~~~~~r~~~-~~~~~~~y~~g~~i~~i~~~~~~v 515 (515) T protein:vir:10 479 ISSFVDTGGTTKY-QAVYSLVYSKDDLIRKVVGTHTLI 515 (515) T ss_pred cCCCCCccccccc-CceeEEEEEcCceEEEEEeeeecC Confidence 8765554443333 234679999999999999976655 No 12 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=100.00 E-value=2.3e-64 Score=369.45 Aligned_cols=339 Identities=15% Similarity=0.096 Sum_probs=241.9 Q ss_pred ccccceEEEEEeeeccccccccccceEEEeeccccCc---ccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcccceE Q lcl|NC_020841. 7 LPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGR---ATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPKPRDL 83 (367) Q Consensus 7 l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~---~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~p~~v 83 (367) || ++||+|+|+++++++..++||.|||||.|+..+. .+|.+-|+++++|..||++++|+||+|.++|+|++...+. T Consensus 1 m~-~~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~~~~~r~ 79 (426) T protein:vir:31 1 MP-KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQWRV 79 (426) T ss_pred CC-cceEEEEeecccccccccccceeeeeeeccccccccccchhhhhhhHHHHHhcCCCChHHHHHHHHHHhCCceeEEe Confidence 67 8999999999999999999999999999987653 4466779999999999999999999999999999765554 Q ss_pred EEEecc----------------------C-ccchHHHH-------------------------------HHHHhcccCcE Q lcl|NC_020841. 84 MIATVT----------------------A-LTDPLASI-------------------------------GEVAAKTLGFY 109 (367) Q Consensus 84 ~v~~~~----------------------~-~~t~~~~l-------------------------------~~~~~~~~~w~ 109 (367) .+.... . ..+..... ..+.....+|+ T Consensus 80 ~v~~at~~~~~~~t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~~~~t~~g~~t~~~~~~~~~~s~~dw~ 159 (426) T protein:vir:31 80 MVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHADWS 159 (426) T ss_pred eccccceeeeccCCcceeecceeeeecccCcchHHHHHHhhccccccccceeeeEeccccceeeccccceeeeeccCcch Confidence 222110 0 00111111 11223455676 Q ss_pred EEEEEe-cCC-------------HHHHHHHHHHhhccCcEEEEEEeCch----hhhHHHHHHHHhccccc--eeecC--C Q lcl|NC_020841. 110 AFCFAS-EVA-------------AADIQGLAEWAQSNNRMFMTVMTDDT----EAVTTGNALKELGQYHY--CITYH--E 167 (367) Q Consensus 110 ~~~~~~-~~~-------------~~~~~ala~~~ea~~~~~~~~~~d~~----~~~~~~~~~~~~~~~~~--~~~~~--~ 167 (367) .+.-+. ..+ ..+..++..|.+++.++++....+.. ...+...+.+..++... ...+. . T Consensus 160 ~~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa~~~~i~~va~~~e~~~~~~~~~~~a~~~~~~~y~p~~~~~~~~~~~ 239 (426) T protein:vir:31 160 QLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDAS 239 (426) T ss_pred hhhcccccchhhhhhccccchhhhhhhHhhhhhhhhcceeeeeeccchhhhcchhhhhhhhhcccccccchhheeehhcc Confidence 653211 111 11223466777777777665554432 12334555555555322 11111 2 Q ss_pred chhHHHHHHHHHHHccc-------ccCcceeeeeeeecCcccccCCCHHHHHHHHhCCceEEEEeecccc-ceEEEecCE Q lcl|NC_020841. 168 DYATVGAVAGMALDQRY-------DKTDGVKTLHLKSLVSVVSTDISQTQAASLKAACINYYSDYGNPDN-SLPIFANGH 239 (367) Q Consensus 168 ~~~~~~~~~~~~~~~~~-------~~~~g~~t~~~k~l~Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~-~~~~~~~G~ 239 (367) .....+..++..++..+ +...+...++|++.+|+... +..+++..+ .+++|.|+.+.+.+. ...+..+|+ T Consensus 240 ~~~~~~~~~~~~aa~~~~~~~~~~~~~~~~~~~~~~~~~gv~~t-~~~~~~A~~-~~~~n~~~~~~~~~~i~~~~~~~G~ 317 (426) T protein:vir:31 240 DDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGT-FEGGDEAEG-EGPVNVLIDVSDANRVSNAVTTAGA 317 (426) T ss_pred ccchhhHHhhhhhhhccccchhhhhccccccceeeccccccccc-cchhhhhhh-cCCceEEEEecCceeeecceeeccc Confidence 23334455565555543 23344566788888888733 333344444 588899998865431 334577899 Q ss_pred eeCCchhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCcccccccccccc Q lcl|NC_020841. 240 AGGGKFFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYL 319 (367) Q Consensus 240 ~~~G~~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~ 319 (367) +++|+|||++|++|||+++||++|++||.|.+|||||+.|+++|++.|+.+|+++++.|. .+ T Consensus 318 ~~~G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g------------------~~ 379 (426) T protein:vir:31 318 DSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVG------------------QP 379 (426) T ss_pred ccchhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCC------------------cc Confidence 999999999999999999999999999999999999999999999999999999997553 23 Q ss_pred ccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 320 PTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 320 ~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) .++|+|..|.+++++ +||++|++++++|.|+|+||||.++|.++|+= T Consensus 380 ~~~y~v~~P~~~~~~-~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 380 LAEYEVDVPEWDDDD-VDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred ccceeecCCCccccc-hhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 457999999888865 69999999999999999999999999999888 No 13 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=99.26 E-value=9.5e-11 Score=75.52 Aligned_cols=330 Identities=10% Similarity=-0.025 Sum_probs=200.4 Q ss_pred ccccccceEEEE-EeeeccccccccccceEEEeecccc----CcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcc Q lcl|NC_020841. 5 LTLPINMLVNVS-IEYQAKLLSRDAFNRLLIVGSTAPN----GRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPK 79 (367) Q Consensus 5 ~~l~i~~iv~V~-i~~~~~~~~~~~fg~~li~~~~~~~----~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~ 79 (367) |+-+.-. |.|. ++-.+.++.....+.+.++|..+-. .........++..+....|+.++....+...+|.++.. T Consensus 1 m~~~~~G-v~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~~~tl~~a~~~~~~~gg~ 79 (396) T protein:vir:60 1 MSDYHHG-VQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKP 79 (396) T ss_pred CCCCCCC-eEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcCcchhHHHHHHHhhccCc Confidence 5555444 3333 3445678888898999988865421 11123355677777777789999999999999998754 Q ss_pred cceEEEEecc---Cc-----------------cchHHHHHHHHhcccC---cEEEEEE-ecCCHHHHHHHHHHhhccCcE Q lcl|NC_020841. 80 PRDLMIATVT---AL-----------------TDPLASIGEVAAKTLG---FYAFCFA-SEVAAADIQGLAEWAQSNNRM 135 (367) Q Consensus 80 p~~v~v~~~~---~~-----------------~t~~~~l~~~~~~~~~---w~~~~~~-~~~~~~~~~ala~~~ea~~~~ 135 (367) ...+.-.... .. ....+.+..+.....- ....... .........++...++..+.+ T Consensus 80 ~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~tg~~al~~~~~~~~~~~~il~ap~~~~~~v~~al~~~~~~~~~~ 159 (396) T protein:vir:60 80 VTVVVRVEDGTGEDEETKLAQTVSNIIGTTDENGQYTGLKALLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLRAF 159 (396) T ss_pred eEEEEecccccccccccccccccccccccccccccccchhhhhhcccceeeeeeeccccccccHHHHHHHHHHhccCCeE Confidence 4322211100 00 0000111222221111 1111111 122334556666666655544 Q ss_pred EEEEEeCchhhhHHHHHHHHhccccceeecCC-----c-------hhHHHHHHHHHHHcccccCcceeeeeeeecCcccc Q lcl|NC_020841. 136 FMTVMTDDTEAVTTGNALKELGQYHYCITYHE-----D-------YATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVVS 203 (367) Q Consensus 136 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~-----~-------~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~~ 203 (367) ++.-........+.......++..+..+.+.. . .+..+.++|..+..+...-+ ......|.+.||.. T Consensus 160 ~i~d~p~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~-~~spaN~~l~gi~~ 238 (396) T protein:vir:60 160 GYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVASTTATAYATARALGLRAKIDQEQGW-HKTLSNVGVNGVTG 238 (396) T ss_pred EEEeCCCCCCHHHHHHHHhhcCCceEEEEeCceeeecccCCceeEEchhHHHHHHHHHhhhccCc-EeCcCCceecceee Confidence 33322222222222223334444443332221 1 13345667776665544311 22234556666642 Q ss_pred --------cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020841. 204 --------TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQRLRR 271 (367) Q Consensus 204 --------~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~ 271 (367) ...+.+|++.|..+|+|+.... .++ .+..+.+++++ ||-+.+-.+|++..|+..+...+-. T Consensus 239 ~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~----~G~-~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e-- 311 (396) T protein:vir:60 239 ISASVFWDLQESGTDADLLNESGVTTLIRR----DGF-RFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDK-- 311 (396) T ss_pred ceeecccccCCCcchhhhhhhcCcEEEEcC----CCE-EEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC-- Confidence 2356789999999999998542 233 46677787773 7889999999999999988876653 Q ss_pred CCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEE Q lcl|NC_020841. 272 LTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVK 351 (367) Q Consensus 272 kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~ 351 (367) |-+..-...|+..|+.-|+...++|.|.. |.++.. .+..+++++.+.+. -+.+.+. T Consensus 312 --~n~~~~~~~i~~~i~~~l~~l~~~gal~g--------------------~~~~~d-~~~nt~~~i~~G~~-~~~i~~~ 367 (396) T protein:vir:60 312 --PITATLIRDIVDGINAKFRELKTNGYIVD--------------------ATCWFS-EESNDAETLKAGKL-YIDYDYT 367 (396) T ss_pred --CCCHHHHHHHHHHHHHHHHHHHhCCceec--------------------eEEEEe-cCCCCHHHhhCCEE-EEEEEEE Confidence 66888899999999999999999999952 445543 56889999999888 4899999 Q ss_pred ECceEEEEEEEEEecC Q lcl|NC_020841. 352 GAGAIHDTDITLIPEA 367 (367) Q Consensus 352 ~agaIh~v~i~~~v~~ 367 (367) ....++.|.+.+..+. T Consensus 368 p~~pae~I~~~~~~~~ 383 (396) T protein:vir:60 368 PVPPLENLTLRQRITD 383 (396) T ss_pred ecCCcceEEEEEEEch Confidence 9999999998777666 No 14 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=99.23 E-value=1.9e-10 Score=73.83 Aligned_cols=330 Identities=11% Similarity=-0.013 Sum_probs=199.4 Q ss_pred ccccccceEEEEEeeeccccccccccceEEEeeccccC----cccceEEEecHHHHHhccCCCcHHHHHHHHHhccCccc Q lcl|NC_020841. 5 LTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNG----RATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPKP 80 (367) Q Consensus 5 ~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~----~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~p 80 (367) |+-++--+--+.++-.+.++...+.+.+-+++..+..+ ........++..+....|+.+.....+...+|.++..+ T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~~gtl~~al~~~~~ngg~~ 80 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGKKGTLSASLQAIADQSKPV 80 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCCCcchHHHHHHhhcccCce Confidence 55555554444455556777777777777777654211 11233556787777777888888888899999987654 Q ss_pred ceEEEEec-c----Cc------------cchHHHHHHHHhccc---CcEEEEEEecC-CHHHHHHHHHHhhccCcEEEEE Q lcl|NC_020841. 81 RDLMIATV-T----AL------------TDPLASIGEVAAKTL---GFYAFCFASEV-AAADIQGLAEWAQSNNRMFMTV 139 (367) Q Consensus 81 ~~v~v~~~-~----~~------------~t~~~~l~~~~~~~~---~w~~~~~~~~~-~~~~~~ala~~~ea~~~~~~~~ 139 (367) ..+..... . .. ....+.+.++.+... .-..++.+.+. ......++...++.-. .+++. T Consensus 81 ~~vv~v~~~~~~~~~~~t~~dliG~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~~~~~~-~~~~~ 159 (392) T protein:vir:18 81 TVVVRVAEGTGDDAEAQTTSNIIGGTDENGKYTGIKALLTAEAVTGVKPRILGVPGLDTQEVATALASVCISLR-AFGYV 159 (392) T ss_pred EEEecccccccccccccchhhheecccccchhhhHHHHHhhhhhhceeehhcccCccchHHHHHHHHHHHhhcC-cEEEE Confidence 43321110 0 00 001111112222111 11112222222 2334445555554333 33333 Q ss_pred EeCch-hhhHHHHHHHHhccccceeecCC----c--------hhHHHHHHHHHHHcccccCcceeeeeeeecCcccc--- Q lcl|NC_020841. 140 MTDDT-EAVTTGNALKELGQYHYCITYHE----D--------YATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVVS--- 203 (367) Q Consensus 140 ~~d~~-~~~~~~~~~~~~~~~~~~~~~~~----~--------~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~~--- 203 (367) ...+. ...........++..+..+.+.. + .+..+.++|..+..+...-+ ......|.+.||.. T Consensus 160 d~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~-~~spaN~~l~gi~~~~~ 238 (392) T protein:vir:18 160 SAWGCKTISEAMAYRENFSQRELMVIWPDFLAWDTTANATATAYATARALGLRAYIDQTIGW-HKTLSNVGVQGVTGISA 238 (392) T ss_pred ecCCCCCHHHHHHHHhhccCceEEEEeCceeeecccCCceEEechHHHHHHHHHhhhccCCc-eEccCCceeeceeecce Confidence 33222 22222222333444443333221 0 13356667776666533311 23334566666552 Q ss_pred -----cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|NC_020841. 204 -----TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQRLRRLTP 274 (367) Q Consensus 204 -----~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~kip 274 (367) ...+..|++.|..+|+|++... .+ ..+..+.+++++ ||-+.+-.+|++..|+..+...+-. | T Consensus 239 ~~~~~~~~~~~~~~~Ln~~gI~t~~~~----~G-~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~ 309 (392) T protein:vir:18 239 SVFWDLQASGTDADLLNEAGVTTLVRK----DG-FRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDK----P 309 (392) T ss_pred ecccccCCCcchhhhhhhcCceEEEcC----CC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----C Confidence 2346789999999999998542 23 356677887773 7889999999999999888876543 6 Q ss_pred cCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECc Q lcl|NC_020841. 275 QTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAG 354 (367) Q Consensus 275 y~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~ag 354 (367) .++.-...|+..++.-|++.++.|.|.. |.+++ +.+..+++++.+++.. +.+.+.... T Consensus 310 n~~~~~~~i~~~i~~~L~~l~~~gal~g--------------------~~v~~-d~~~nt~~~i~~G~~~-~~v~~~p~~ 367 (392) T protein:vir:18 310 ITASLIRDIVDGINAKFRELKSNGYIVD--------------------GECWF-DEESNDKETLKAGKLY-IDYDYTPVP 367 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHhcCcccc--------------------eEEEE-ecCCCCHHHhhCCeEE-EEEEEEecC Confidence 7899999999999999999999999953 44554 3567899999999884 889999999 Q ss_pred eEEEEEEEEEecC Q lcl|NC_020841. 355 AIHDTDITLIPEA 367 (367) Q Consensus 355 aIh~v~i~~~v~~ 367 (367) .+++|++.+..+- T Consensus 368 p~e~I~~~~~~~~ 380 (392) T protein:vir:18 368 PLESLTLRQRITD 380 (392) T ss_pred CcceEEEEEEEch Confidence 9999999888777 No 15 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=99.21 E-value=1.6e-10 Score=74.28 Aligned_cols=330 Identities=11% Similarity=0.016 Sum_probs=195.0 Q ss_pred CcccccccccceEEEEEeeeccccccccccceEEEeecccc-----CcccceEEEecHHHHHhccCCCcHHHHHHHHHhc Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPN-----GRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFS 75 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~-----~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~ 75 (367) |+|+ .+.--+-=+.+.-.++++.....+.+.+++..+.. +..+. ...++..+....|+.....+.+...+|. T Consensus 1 M~~~--~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p-~~v~s~~~~~~~~g~~~tl~~al~~~~~ 77 (391) T protein:vir:11 1 MAAD--QYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTP-VLITNVQAAIGKAGTSGTLPASLQAIAD 77 (391) T ss_pred CCCC--cCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCcccccccc-EEEecchhhheecCCCccchhhhhhhhc Confidence 4444 44344332334444567777788888887766521 12223 4556666666678888889999999999 Q ss_pred cCcccceEEEEeccCc--cchH------------HHHHHHHhcccCc---EEEEEEecC-CHHHHHHHHHHhhccCcEEE Q lcl|NC_020841. 76 QNPKPRDLMIATVTAL--TDPL------------ASIGEVAAKTLGF---YAFCFASEV-AAADIQGLAEWAQSNNRMFM 137 (367) Q Consensus 76 Q~p~p~~v~v~~~~~~--~t~~------------~~l~~~~~~~~~w---~~~~~~~~~-~~~~~~ala~~~ea~~~~~~ 137 (367) ++.....+.-...... .+.. ..+.++.+..... -..+..... ......++...++.. +.|. T Consensus 78 ~~g~~~~vv~~~~~~~~~~t~~d~~g~~~a~~~~~g~~a~~~~~~~~~~~p~~~~ap~~~~~~v~~al~~~~~~~-~~~~ 156 (391) T protein:vir:11 78 QANAATVVVRVKPGEDEAATNSAVIGGVSADGKYTGMKALLAAKARLGVVPRILGVPGLDTQPVATALIAIAQQL-RAFA 156 (391) T ss_pred cccceeEEeeecccccccccchhhhcccccccchhhhhhhhhhhhhheeccccccccccccHHHHHHHHHhhccc-ceEE Confidence 8866543332221110 0000 0111111111110 011111111 233444555544333 3343 Q ss_pred EEEeCch-hhhHHHHHHHHhccccceeecCC-----c-------hhHHHHHHHHHHHcccccCcc-eeeeeeeecCcccc Q lcl|NC_020841. 138 TVMTDDT-EAVTTGNALKELGQYHYCITYHE-----D-------YATVGAVAGMALDQRYDKTDG-VKTLHLKSLVSVVS 203 (367) Q Consensus 138 ~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~-----~-------~~~~~~~~~~~~~~~~~~~~g-~~t~~~k~l~Gv~~ 203 (367) ++-.... ...+.......++..+..+.+.. + .+..+.++|..+..+... | ......|.+.||.. T Consensus 157 i~D~p~~~t~~~a~~~r~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~--g~~~span~~l~gi~~ 234 (391) T protein:vir:11 157 YVSASGCKTKEEATAYRENFAAREAMVIWPDFLTWSTVVNQTVPAPAVAQALGLRARIDQEV--GWHKTLSNVAVNGVTG 234 (391) T ss_pred EEEcCCCCCHHHHHHHhhhcCCceEEEEcCcceecccccCceEEechHHHHHHHHHHhhccC--CcEEccCCceeeceee Confidence 3332221 22222222334444444333321 0 233455666666554322 2 22223456665552 Q ss_pred --------cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020841. 204 --------TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQRLRR 271 (367) Q Consensus 204 --------~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~ 271 (367) ...++.|.+.|..+|+|.... + .+ ..+..+++++++ ||-+.+-.+|++..|+..+...+-. T Consensus 235 ~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~--~--~G-~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e-- 307 (391) T protein:vir:11 235 ISADVFWDLQSPSTDANYLNENEVTTLVQ--E--GG-FRFWGSRTCSDDPLFAFENYTRTAQVLADTIAEAHMWAVDK-- 307 (391) T ss_pred cccccccccCCCcchhhhhhhcCcEEEEc--C--CC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC-- Confidence 234678999999999999843 2 23 356677777773 7889999999999999888765543 Q ss_pred CCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEE Q lcl|NC_020841. 272 LTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVK 351 (367) Q Consensus 272 kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~ 351 (367) |-+..=...|+..++.-|+..+++|.|.. |.+.. +.++.+++++.+.+. -+.+.+. T Consensus 308 --~n~~~~~~~i~~~i~~~l~~l~~~g~l~g--------------------~~~~~-~~~~n~~~~i~~G~~-~~~i~~~ 363 (391) T protein:vir:11 308 --PMHPSLVRDILEGVNAKFRELKGLGLIID--------------------AQAWY-DPNVNDKDTLKAGKL-RITYDYT 363 (391) T ss_pred --CCCHHHHHHHHHHHHHHHHHHHhccceec--------------------eEEEE-ecCCCCHHHhhCCeE-EEEEEEE Confidence 56788889999999999999999999853 34444 356788999999988 4999999 Q ss_pred ECceEEEEEEEEEecC Q lcl|NC_020841. 352 GAGAIHDTDITLIPEA 367 (367) Q Consensus 352 ~agaIh~v~i~~~v~~ 367 (367) ....++.|.+.+..+. T Consensus 364 p~~p~e~i~~~~~~~~ 379 (391) T protein:vir:11 364 PVPPLEDLTFFQKITD 379 (391) T ss_pred ecCCcceEEEEEEEch Confidence 9999999999888777 No 16 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=99.21 E-value=1.5e-10 Score=74.43 Aligned_cols=331 Identities=9% Similarity=-0.035 Sum_probs=197.6 Q ss_pred ccccccceEEEEEeeeccccccccccceEEEeeccccC----cccceEEEecHHHHHhccCCCcHHHHHHHHHhccCccc Q lcl|NC_020841. 5 LTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNG----RATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPKP 80 (367) Q Consensus 5 ~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~----~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~p 80 (367) |+-+.-.+-=+.++-.+.++.....+.+.++|..+..+ ........++..+....|+.+...+.+...+|.++-.. T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~~tL~~al~~~~~ngg~~ 80 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccccchhhhhhhhhccCcee Confidence 66665443323344445666667777777777553211 11234567888888778899888888888999887443 Q ss_pred ceEEEEec---cCc--------------------cchHHHHHHHHhcccCcEEEEEE-ecCCHHHHHHHHHHhhccCcEE Q lcl|NC_020841. 81 RDLMIATV---TAL--------------------TDPLASIGEVAAKTLGFYAFCFA-SEVAAADIQGLAEWAQSNNRMF 136 (367) Q Consensus 81 ~~v~v~~~---~~~--------------------~t~~~~l~~~~~~~~~w~~~~~~-~~~~~~~~~ala~~~ea~~~~~ 136 (367) ..+..... ... .+..+++..........-..... .........++...++....++ T Consensus 81 ~~v~~~~~~~~~~~~~~~a~t~~~~~~~~~~~~~~tg~~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~~~~ 160 (396) T protein:vir:20 81 TVVMRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKAMLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLRAFG 160 (396) T ss_pred EEEEeccccccccccccccccccccccccccccccchhhhhhhhccccccchhhhhhhhhccHHHHHHHHHHHhcCCcEE Confidence 32221110 000 01111122111111110011111 1223445666666665544333 Q ss_pred EEEEeCchhhhHHHHHHHHhccccceeecCC-----c-------hhHHHHHHHHHHHcccccCcceeeeeeeecCcccc- Q lcl|NC_020841. 137 MTVMTDDTEAVTTGNALKELGQYHYCITYHE-----D-------YATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVVS- 203 (367) Q Consensus 137 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~-----~-------~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~~- 203 (367) +.-........+.......++..+..+.+.. + ....+.++|..+..+..+- -......|.+.||.. T Consensus 161 ~iD~p~~~~~~~a~~~r~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g-~~~spaN~~l~gi~~~ 239 (396) T protein:vir:20 161 YISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQG-WHKTLSNVGVNGVTGI 239 (396) T ss_pred EEecCCCCCHHHHHHHhhCCCCceEEEEcCccccccCcCCcceeechhHHHHHHHHHhhhhcC-cEeccCCceeccceec Confidence 2222222122222222334444443333320 0 2344566666665543331 122334556666642 Q ss_pred -------cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_020841. 204 -------TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQRLRRL 272 (367) Q Consensus 204 -------~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~k 272 (367) ...+.+|++.|..+|+|..... .+ ..+..+.+++++ ||-+.+-.+|+...|+..+...+-. T Consensus 240 ~~~~~~~~~~~~~~~~~Ln~~gi~~~~~~----~G-~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~~~~~~~~~v~e--- 311 (396) T protein:vir:20 240 SASVFWDLQESGTDADLLNESGVTTLIRR----DG-FRFWGNRTCSDDPLFLFENYTRTAQVVADTMAEAHMWAVDK--- 311 (396) T ss_pred ceecccccCCCcchhhhhhhcCcEEEEcC----CC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC--- Confidence 2356789999999999998542 23 356677777773 7889999999999999888875543 Q ss_pred CCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEE Q lcl|NC_020841. 273 TPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKG 352 (367) Q Consensus 273 ipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~ 352 (367) |.+..=...|+..++.-|++..+.|.|. ||.+.+. +++.|++++.+++. -+.+.+.. T Consensus 312 -~~~~~~~~~i~~~i~~~L~~l~~~G~l~--------------------g~~v~~d-~~~nt~~~i~~G~~-~~~i~~~p 368 (396) T protein:vir:20 312 -PITATLIRDIVDGINAKFRELKTNGYIV--------------------DATCWFS-EESNDAETLKAGKL-YIDYDYTP 368 (396) T ss_pred -CCCHHHHHHHHHHHHHHHHHHHhCccee--------------------ceEEEEe-cCCCCHHHhhCCEE-EEEEEEEe Confidence 5688888999999999999999999994 3555553 57889999999988 48999999 Q ss_pred CceEEEEEEEEEecC Q lcl|NC_020841. 353 AGAIHDTDITLIPEA 367 (367) Q Consensus 353 agaIh~v~i~~~v~~ 367 (367) ...++.|.+.+..+. T Consensus 369 ~~p~e~i~~~~~~~~ 383 (396) T protein:vir:20 369 VPPLENLTLRQRITD 383 (396) T ss_pred cCCcceEEEEEEEch Confidence 999999998877666 No 17 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=99.21 E-value=2.8e-10 Score=72.98 Aligned_cols=329 Identities=11% Similarity=-0.025 Sum_probs=200.2 Q ss_pred ccccccceEEEEEeeeccccccccccceEEEeecccc-----CcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcc Q lcl|NC_020841. 5 LTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPN-----GRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPK 79 (367) Q Consensus 5 ~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~-----~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~ 79 (367) |+-++-.+-=+.++-.+.++.....+.+.++|..+-. +..+. ...++..+....|+.+...+.+...+|.++.. T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~p-v~v~s~~~~~~~~g~~~tl~~al~~~~~~~~~ 79 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEP-VLITNVQSAIAKAGKKGTLAASLQAIADQSKP 79 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccc-eEeechHHhHhhcccccchhhHHHHHhhccCc Confidence 6666555333334445667777788888888765421 12223 45677777777788888888899999999865 Q ss_pred cceEEEEeccC--------------------ccchHHHHHHHHhcccC---cEEEEEEec-CCHHHHHHHHHHhhccCcE Q lcl|NC_020841. 80 PRDLMIATVTA--------------------LTDPLASIGEVAAKTLG---FYAFCFASE-VAAADIQGLAEWAQSNNRM 135 (367) Q Consensus 80 p~~v~v~~~~~--------------------~~t~~~~l~~~~~~~~~---w~~~~~~~~-~~~~~~~ala~~~ea~~~~ 135 (367) +..+....... .....+.+.++.+.... -..++.... .+.....++...++.-. . T Consensus 80 ~~~vv~~~~~~~~~~~~~~a~~~~~i~g~~~~~~~~Tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~al~~~~~~~~-~ 158 (395) T protein:vir:98 80 VTVVVRVEDGTGDDEEAALAQTVSNIIGGTDENGKYTGIKALLTAQAVTGVKPRILGVPGLDTKEVAVALASAAIKLR-A 158 (395) T ss_pred eEEEeeccccccccccccccccccccccccccccchhHHHHHhhhhhhhccchhhcccccccccHHHHHHHHHhhhcC-c Confidence 54333211100 00011122233322221 112222222 23445556666665433 3 Q ss_pred EEEEEeCc-hhhhHHHHHHHHhccccceeecCC----c--------hhHHHHHHHHHHHcccccCcceeeeeeeecCccc Q lcl|NC_020841. 136 FMTVMTDD-TEAVTTGNALKELGQYHYCITYHE----D--------YATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVV 202 (367) Q Consensus 136 ~~~~~~d~-~~~~~~~~~~~~~~~~~~~~~~~~----~--------~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~ 202 (367) +.++-... ............++..+..+.+.. + .+..+.++|..+..+...-+ ......|.+.||. T Consensus 159 ~~~~d~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~-~~spaN~~i~gi~ 237 (395) T protein:vir:98 159 FAYVSAWGCKTISEAMEYRKNFSQRELMVIWPDFLAWDTVKNTTATAYATARALGLRAYIDQTVGW-HKTLSNVGVQGVT 237 (395) T ss_pred EEEEEcCCCCCHHHHHHHHhccCCceEEEEecceeEecccCCceeeechHHHHHHHHHHhhcccCc-EeccCCceeeccc Confidence 33333222 222232333334444444433321 1 12445666666655433311 1222345555554 Q ss_pred c--------cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC----chhhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020841. 203 S--------TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG----KFFDFVMGFDWLRNVIETNVFNGQRLR 270 (367) Q Consensus 203 ~--------~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G----~~iD~~~~~dwl~~~lq~~l~~ll~~~ 270 (367) . ...+.+|++.|..+|+|.+... .+ ..+..+.++++ .||-+.+-.+|++..|+..+...+-. T Consensus 238 ~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~----~G-~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e- 311 (395) T protein:vir:98 238 GISASVFWDLQASGTDADLLNEAGVTTLVRK----DG-FRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDK- 311 (395) T ss_pred ccceecccccCCCcchHHhhhhcCcEEEEcC----CC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC- Confidence 2 2346889999999999999542 23 35667777777 37889999999999999888876543 Q ss_pred CCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEE Q lcl|NC_020841. 271 RLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILV 350 (367) Q Consensus 271 ~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~ 350 (367) |.++.=...|+..++.-|++.+++|.|.. |.+.. +.++.+++++.+++.. +.+.+ T Consensus 312 ---~~~~~~~~~i~~~i~~~L~~l~~~g~l~g--------------------~~v~~-d~~~nt~~~i~~G~~~-~~i~~ 366 (395) T protein:vir:98 312 ---PITATLIRDIVDGINAKFRELKSNGYIVE--------------------GKCWF-DEESNDKETLKAGKLY-IDYDY 366 (395) T ss_pred ---CCCHHHHHHHHHHHHHHHHHHHhCCceec--------------------eEEEE-ecCCCCHHHhhCCeEE-EEEEE Confidence 66788889999999999999999999953 45554 3467889999999884 99999 Q ss_pred EECceEEEEEEEEEecC Q lcl|NC_020841. 351 KGAGAIHDTDITLIPEA 367 (367) Q Consensus 351 ~~agaIh~v~i~~~v~~ 367 (367) .....++.|++.+..+. T Consensus 367 ~p~~p~e~I~~~~~~~~ 383 (395) T protein:vir:98 367 TPVPPLESLTLRQRITD 383 (395) T ss_pred EecCCcceEEEEEEEch Confidence 99999999999887777 No 18 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=99.19 E-value=3.5e-10 Score=72.44 Aligned_cols=328 Identities=10% Similarity=0.006 Sum_probs=200.5 Q ss_pred ccccccceEEEEEeeeccccccccccceEEEeeccccC-----cccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcc Q lcl|NC_020841. 5 LTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNG-----RATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPK 79 (367) Q Consensus 5 ~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~-----~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~ 79 (367) |+-+.-.+.=+.++-.+.++.....+.+.+++.....+ ..+. ...++..+....|+.+...+.+...+|.++.. T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~p-v~i~s~~~~~~~~g~~~tl~~al~~~~~~~~~ 79 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKP-VLITNVQSAIAKAGKKGTLAASLQAIADQSKP 79 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccC-eEeecchhhhhhcccccchHHHHHHhhhcCCc Confidence 66676664434455556788888888888888664321 1233 44567777767788888888888889988754 Q ss_pred cceEEE---------------------EeccCccchHHHHHHHHhcccCc---EEEEEEecC-CHHHHHHHHHHhhccCc Q lcl|NC_020841. 80 PRDLMI---------------------ATVTALTDPLASIGEVAAKTLGF---YAFCFASEV-AAADIQGLAEWAQSNNR 134 (367) Q Consensus 80 p~~v~v---------------------~~~~~~~t~~~~l~~~~~~~~~w---~~~~~~~~~-~~~~~~ala~~~ea~~~ 134 (367) ...+.. +.... ....+.+.++.....-. -..+..... ......++...++.. . T Consensus 80 ~~~vv~~~~~~~~~~~~~~a~t~~~iiG~~~~-~~~~tgl~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~-~ 157 (396) T protein:vir:57 80 VTVVVRVEDGTGDDEETKLAQTVSNIIGTTDE-NGQYTGLKALMGAESVTGVKPRILGVPGLDTKEVAVALASVCQEL-N 157 (396) T ss_pred eeEeeeccccccccccccccccceeeeeeccc-cccchhhhhhhhcccceeEEeccccCcccchhHHHHHHHHHhhhC-c Confidence 433221 11111 11112222332222211 111112221 233445555555533 3 Q ss_pred EEEEEEeCc-hhhhHHHHHHHHhccccceeecCC----c--------hhHHHHHHHHHHHcccccCcceeeeeeeecCcc Q lcl|NC_020841. 135 MFMTVMTDD-TEAVTTGNALKELGQYHYCITYHE----D--------YATVGAVAGMALDQRYDKTDGVKTLHLKSLVSV 201 (367) Q Consensus 135 ~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~~~~~----~--------~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv 201 (367) .+++..... ............++..+..+.+.. + .+..+.++|..+..+...- -......|.+.|| T Consensus 158 ~~~~~d~p~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g-~~~spaN~~l~gi 236 (396) T protein:vir:57 158 AFGYISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQG-WHKTLSNVGVNGV 236 (396) T ss_pred eEEEEcCCCCCCHHHHHHHHhccCCceEEEEcceeeeecccCCceeEEehhHHHHHHHHHhhhccC-cEeccCCceeccc Confidence 343333322 122222223334444444333320 0 2345666777666554331 1333445667776 Q ss_pred ccc--------CCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020841. 202 VST--------DISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQRL 269 (367) Q Consensus 202 ~~~--------~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~ 269 (367) ..- ..+.+|++.|..+|+|+.... .++ .+..+.+++++ ||-+.+-.+|++..|+..+...+-. T Consensus 237 ~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~~----~G~-~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e 311 (396) T protein:vir:57 237 TGISASVFWDLQKPGTDADLLNEAGVTTLVRR----DGF-RFWGNRTCSDDPLFLFESYTRTAQVLADTMAEAHMWAIDK 311 (396) T ss_pred cccceecccccCCcchhhhhhhhcCcEEEEcC----CCE-EEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC Confidence 532 246789999999999998542 233 56677787773 7888999999999999888875543 Q ss_pred cCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEE Q lcl|NC_020841. 270 RRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIIL 349 (367) Q Consensus 270 ~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~ 349 (367) |-+..=...|+..|+.-|+..+++|.|.. |.+... .+..+++++.+++. -+.+. T Consensus 312 ----~n~~~~~~~i~~~i~~~l~~l~~~gal~g--------------------~~v~~d-~~~n~~~~i~~G~~-~~~v~ 365 (396) T protein:vir:57 312 ----PITATLIRDIIDGINAKFRELKNNGYIVD--------------------GTCWFS-EESNDAETLKAGKL-YIDYD 365 (396) T ss_pred ----CCCHHHHHHHHHHHHHHHHHHHhCCceec--------------------eEEEEe-cCCCCHHHhhCCeE-EEEEE Confidence 56888889999999999999999999953 445543 56788999999988 48999 Q ss_pred EEECceEEEEEEEEEecC Q lcl|NC_020841. 350 VKGAGAIHDTDITLIPEA 367 (367) Q Consensus 350 ~~~agaIh~v~i~~~v~~ 367 (367) +.....++.|.+.+..+. T Consensus 366 ~~p~~p~e~I~~~~~~~~ 383 (396) T protein:vir:57 366 YTPVPPLENLTLRQRITS 383 (396) T ss_pred EEecCCcceEEEEEEEch Confidence 999999999988777666 No 19 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=99.14 E-value=1.3e-09 Score=69.23 Aligned_cols=331 Identities=12% Similarity=0.009 Sum_probs=196.9 Q ss_pred ccccccc-cceEEEEEeeeccccccccccceEEEeeccccCc----ccceEEEecHHHHHhccCCCcHHHHHHHHHhccC Q lcl|NC_020841. 3 GSLTLPI-NMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGR----ATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQN 77 (367) Q Consensus 3 ~~~~l~i-~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~----~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~ 77 (367) |+|+.+. .-+--+.++-.+.++.....+.+.++|..+..+. .+.....++..+....|+.....+.+...+|.|. T Consensus 1 m~m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~~g~L~~al~~~~~~~ 80 (393) T protein:vir:10 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) T ss_pred CCCCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCCccchhhhhhhhhccc Confidence 4454443 3433333444456777777777777776542211 1223455777777778898888999999999987 Q ss_pred cccceEEEEeccCc-------------cchHHHHHHHHhcccC---cEEEEEEec-CCHHHHHHHHHHhhccCcEEEEEE Q lcl|NC_020841. 78 PKPRDLMIATVTAL-------------TDPLASIGEVAAKTLG---FYAFCFASE-VAAADIQGLAEWAQSNNRMFMTVM 140 (367) Q Consensus 78 p~p~~v~v~~~~~~-------------~t~~~~l~~~~~~~~~---w~~~~~~~~-~~~~~~~ala~~~ea~~~~~~~~~ 140 (367) .....+........ ....+.+.++...... --.++...+ .+.....++...++.-+.++++.. T Consensus 81 ~~~~~vv~v~~~~~~~~t~~~iig~~~~~~~tgl~al~~~~~~~~~~p~li~apg~~~~~~~~al~~~~~~~~~~~~v~d 160 (393) T protein:vir:10 81 KTPTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISD 160 (393) T ss_pred CceEEEeecccCccccccccccccccccchhhHHHHHHhhhhhcceeeeeeeeccccchHHHHHHHHHhhccCcEEEEEc Confidence 55443322211100 0111223333222111 112223322 245566677777776665555443 Q ss_pred eCchhhhHHHHHHHHhccccceeecC------Cc------hhHHHHHHHHHHHcccccCcc-eeeeeeeecCccccc--- Q lcl|NC_020841. 141 TDDTEAVTTGNALKELGQYHYCITYH------ED------YATVGAVAGMALDQRYDKTDG-VKTLHLKSLVSVVST--- 204 (367) Q Consensus 141 ~d~~~~~~~~~~~~~~~~~~~~~~~~------~~------~~~~~~~~~~~~~~~~~~~~g-~~t~~~k~l~Gv~~~--- 204 (367) ...+...........++..+..+.+. .. ....+.++|..+..+-.. | ......|.+.||..- T Consensus 161 ~~~~t~~~ai~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~--G~~~spaN~~l~gi~~~~~~ 238 (393) T protein:vir:10 161 NGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTV--GWHKNISNVELDGVTGITKA 238 (393) T ss_pred CCCCCHHHHHHHhhhcCCceEEEEecccccccccCCceeEeehhHHHHHHHHHhhcCC--CcEEccCCceeeceeeccee Confidence 32222222223333344433333222 00 223456666666654322 2 233345566666532 Q ss_pred -----CCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC----chhhHHHHHHHHHHHHHHHHHHHHHhcCCCCc Q lcl|NC_020841. 205 -----DISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG----KFFDFVMGFDWLRNVIETNVFNGQRLRRLTPQ 275 (367) Q Consensus 205 -----~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy 275 (367) ..+++|++.|..+|+|++... .++ .+..+.++++ .||-+.+-.+|++..|+..+...+-. |. T Consensus 239 ~~~~~~~~~~~~~~Ln~~gI~t~~~~----~G~-~~wG~rT~s~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----~~ 309 (393) T protein:vir:10 239 VEFDINESSTEANYLNEKGITICLNH----NGF-RYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDM----PL 309 (393) T ss_pred cccccCCCcchhHhHhhcCceEEEcC----CCE-EEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CC Confidence 346789999999999998542 233 3556667666 37888999999999998887765542 66 Q ss_pred CHhHHHHHHHHHHHHHHHHHhcC--ceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEEC Q lcl|NC_020841. 276 TDRGMMMIKADIVNGLEEAVKAG--LVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGA 353 (367) Q Consensus 276 ~~~G~~~l~~~v~~vl~~a~~~G--~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~a 353 (367) ++.=...++..++.-|+..++.| .|. |+.+..+ ++.+++|..+.+.. +.+.+... T Consensus 310 ~~~~~~~i~~~i~~~L~~l~~~g~~al~--------------------g~~v~~~--~~nt~~~i~~G~~~-~~i~~~p~ 366 (393) T protein:vir:10 310 TPLRVKTMLEAINNKLRSWASGDDPRIL--------------------GARVWVA--EEITADIIKSGKFV-IKYDYHWI 366 (393) T ss_pred CHHHHHHHHHHHHHHHHHHHhccccccc--------------------cceEEec--CCCCHHHhhCCEEE-EEEEEEec Confidence 88888899999999999888866 332 2445443 35778888888774 89999999 Q ss_pred ceEEEEEEEEEecC Q lcl|NC_020841. 354 GAIHDTDITLIPEA 367 (367) Q Consensus 354 gaIh~v~i~~~v~~ 367 (367) ..++.|++.+..+. T Consensus 367 ~p~e~I~~~~~~~~ 380 (393) T protein:vir:10 367 PSLESLGLEQRVND 380 (393) T ss_pred CCcceEEEEEEEch Confidence 99999999888776 No 20 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.13 E-value=7.9e-11 Score=75.97 Aligned_cols=333 Identities=15% Similarity=0.116 Sum_probs=192.9 Q ss_pred CcccccccccceEEEEEeee-ccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhccC-- Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIEYQ-AKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQN-- 77 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~~~-~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~-- 77 (367) ||+ || . +++.+.-. ..+..+..-|...++-..+ ...+..|++.+++...+.... .+.....|..+ T Consensus 1 ~~g---lp--~-i~i~f~~~a~ta~~~g~rGiv~~il~d~----~~~~~~~~~~~~v~~~~~~~n--~~~i~~~~~g~~~ 68 (356) T protein:vir:10 1 MAG---LV--N-INIEFKELATSFIQRSKAGIVAIILKDT----TKMYKELTSEDDIPISLSADN--KKYIKYGFVGATD 68 (356) T ss_pred CCC---CC--c-eeEEEeecceeeccCCccceEEEEEecC----CcceeEEeccccchhHHHHHH--HHHHHHHhhcccc Confidence 664 55 1 33333222 2333333335544443332 345788999988866554433 34445556443 Q ss_pred ----cccceEEEEeccCccchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHHhhc----cCcEEEEEEeCchhhh-H Q lcl|NC_020841. 78 ----PKPRDLMIATVTALTDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEWAQS----NNRMFMTVMTDDTEAV-T 148 (367) Q Consensus 78 ----p~p~~v~v~~~~~~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~~ea----~~~~~~~~~~d~~~~~-~ 148 (367) ..|.++.++...+.++..+++.+++....|| +++ ...+.+++..++.|+.. .++++-.+.....+.. - T Consensus 69 ~~~~~~p~~~~~~~~~t~~~y~~aL~~le~~~fn~--l~~-~~~d~~~~~~~~a~ikr~r~~~~~~~~~V~~~~~aD~Eg 145 (356) T protein:vir:10 69 NEKVLRPSKVIISTFTEDGKVEDILEELESVEFNY--LCM-PEAIEAEKTKIVTWIKKIREEESTEAKAVLANIKADNEA 145 (356) T ss_pred ccccccceeeeeecccCchhHHHHHHHhcCccceE--EEe-cCCChHHHHHHHHHHHHHHhcCCcEEEEEecCCCCCCce Confidence 2477888887777888999999998776665 443 34677889999999984 3455554443321100 0 Q ss_pred HHHHHHHhccccceeecCCchhHHHHHHHHHHHcccccCcceeeeeeeecCccccc-CCCHHHHHHHHhCCceEEEEeec Q lcl|NC_020841. 149 TGNALKELGQYHYCITYHEDYATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVVST-DISQTQAASLKAACINYYSDYGN 227 (367) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~~~-~~t~t~~~~l~~~~~n~y~~~~~ 227 (367) +.+.... .......+.....+++++|+.++...++ |. -|+.++++... .++.+|++...++|.-.+..-++ T Consensus 146 IInv~n~---~~~~g~~~t~~~~~~~vAG~~Ag~~~n~---S~--T~~~~~~~~~~~~~t~~e~~~ai~~G~lvl~~d~~ 217 (356) T protein:vir:10 146 IINFTEN---VVVDGEEITAEKYTTRVASLIASTPNTQ---SI--TYAPLDEVESIVKIDKASADAKVQAGELILRRLSG 217 (356) T ss_pred eEEeecC---eEecceeechhHHHHHHHHHHhccchhc---cc--cceecCCccccccCCHHHHHHHHhCCeEEEEEEcC Confidence 0000000 0001112233445667778777765554 33 34566666533 58899999999999888755432 Q ss_pred cccceEEEecCEee----CCc----h--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhc Q lcl|NC_020841. 228 PDNSLPIFANGHAG----GGK----F--FDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKA 297 (367) Q Consensus 228 ~~~~~~~~~~G~~~----~G~----~--iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~ 297 (367) ...+.+|.-+ +.+ | |-.++..|-+.+.++..+-+.++ +|+|=+..|..++.+.++.-+++..+. T Consensus 218 ----~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yi--GKv~N~~dgr~~l~~ai~~y~~~L~~~ 291 (356) T protein:vir:10 218 ----KIRIARGINSLTTLTAEKGEIFQKIKLVDTKDLISKDIKNIYVEKYL--RKCPNTYDNKCLFIVAVQSYLTELAKQ 291 (356) T ss_pred ----eEEEEecCccceecCCCCCcchhhhHHHHHHHHHHHHHHHHHhhccc--cccCCCHHHHHHHHHHHHHHHHHHHhC Confidence 2356667522 333 2 88888888888877765443333 689999999999999999999999999 Q ss_pred CceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEe Q lcl|NC_020841. 298 GLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIP 365 (367) Q Consensus 298 G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v 365 (367) |.|.++..-.-+.- -+..-...+|- -+..+.++.-....-+..--++..++.-.|+..+.++++| T Consensus 292 ~~I~~~~~~eid~e-~q~~~~~~~g~--d~~~~~d~~v~~~~~~~~v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 292 ELIDSNFTVEIDLE-KQKEYLEGKKI--AVSKMKENEIKEANTGSNGFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred CccccCceeEeccc-chHHHhhhccc--cccccccceeecccCCcEEEEEEEEEEEeeeeeEEeEEeC Confidence 99976532110000 00000001110 0111111111111122233377788999999999999999 No 21 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=99.13 E-value=1.6e-09 Score=68.87 Aligned_cols=331 Identities=11% Similarity=0.002 Sum_probs=194.9 Q ss_pred CcccccccccceEEEEEeeeccccccccccceEEEeecccc----CcccceEEEecHHHH---HhccCCCcHHHHHHHHH Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPN----GRATDTGIYTSIDGV---KLDYGVEADEYKIAQKY 73 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~----~~~~~~~~yts~~~v---~~df~~~s~~ykaA~~~ 73 (367) |+- |+-..-.+.=+.++-.++++.....+.+.+++..+.. +..+. ....+..+. ..........+.+...+ T Consensus 1 m~~-~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~-~~i~~~~d~~~~~~~~~~~gtl~~al~~~ 78 (388) T protein:vir:96 1 MPV-IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVP-FRVANTADAQYLDSTGNELGTGWHAASET 78 (388) T ss_pred CCC-CCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccc-eeeecchhhhhhhccccccccchhhhHhh Confidence 321 2222234333445555677888888888887765321 11122 222333332 22334456678889999 Q ss_pred hccCcccceEEEEe---------------ccCccchHHHHHHHHhcccCcEEEEEEecC--CHHHHHHHHHHhhccCcEE Q lcl|NC_020841. 74 FSQNPKPRDLMIAT---------------VTALTDPLASIGEVAAKTLGFYAFCFASEV--AAADIQGLAEWAQSNNRMF 136 (367) Q Consensus 74 F~Q~p~p~~v~v~~---------------~~~~~t~~~~l~~~~~~~~~w~~~~~~~~~--~~~~~~ala~~~ea~~~~~ 136 (367) |.|...+..+.... .+......+.+.++..... ...++.+.+. ......++...++.-+ .| T Consensus 79 ~~~~~~~~~vv~v~~g~~~~at~a~iig~~~~~tg~~~gl~al~~~~~-~p~il~aPg~s~~~~v~~al~~~~~~~~-~~ 156 (388) T protein:vir:96 79 LKKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTE-RPTLIGAPGFSQNKAVIDALASMAKRLK-CR 156 (388) T ss_pred hccCCceEEEEEeccccccccccceeeeecccccchhhHHHHhhhccc-ceeEEEeeccccchHHHHHHHHHHhhcC-cE Confidence 99986554333211 1111111123333333222 2233333332 2344556666665433 33 Q ss_pred EEEEeCchhhhHH--HHH-HHH--hccccceeecC------C------chhHHHHHHHHHHHcccccCcceeeeeeeecC Q lcl|NC_020841. 137 MTVMTDDTEAVTT--GNA-LKE--LGQYHYCITYH------E------DYATVGAVAGMALDQRYDKTDGVKTLHLKSLV 199 (367) Q Consensus 137 ~~~~~d~~~~~~~--~~~-~~~--~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~ 199 (367) .++-......... ... ... +...+..+.+. + ..+..+.++|..+..++-..|.-..+. +. T Consensus 157 ~i~D~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~spaN~~i~---i~ 233 (388) T protein:vir:96 157 AVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVL---IQ 233 (388) T ss_pred EEEeccCCchhHHHHHHhhhhccCcCcceEEEEeCceeeecccCCceeeechHHHHHHHHHhhcCcccccCeeEE---ee Confidence 3333222111111 111 111 12223333332 0 123456777777777765555433332 34 Q ss_pred cccc-----cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCchhhHHHHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|NC_020841. 200 SVVS-----TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGKFFDFVMGFDWLRNVIETNVFNGQRLRRLTP 274 (367) Q Consensus 200 Gv~~-----~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~~iD~~~~~dwl~~~lq~~l~~ll~~~~kip 274 (367) |+.- ...+.+|++.|..+|+|++.++.+.+ ..+..+.+++..||-+.+-.+|++..|+..+...+-. | T Consensus 234 g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G---~~~wG~rT~~~~~i~vrR~~~~i~~si~~~~~~~v~e----p 306 (388) T protein:vir:96 234 DVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGG---FSLIGNRTVTGKFISFVGLEDAIARKLEAASQRAMSK----Q 306 (388) T ss_pred eecccccccccCChhhHHhhhhcCceEEEEecCCc---EEEEcccccCCcceeehhhHHHHHHHHHHHHHHhccC----C Confidence 4431 24477899999999999999886543 2467777888889999999999999999888765442 6 Q ss_pred cCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECc Q lcl|NC_020841. 275 QTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAG 354 (367) Q Consensus 275 y~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~ag 354 (367) .+..=...|+..|+.-|+...+.|.|.. |.+++ +.+..+++|+.+.+. -+.+.+.... T Consensus 307 n~~~~~~~i~~~i~~fL~~l~~~Gal~g--------------------~~~~~-d~~~nt~~~i~~G~~-~~~i~~~p~~ 364 (388) T protein:vir:96 307 LTKSFMEQEIKKINLFMQDLVAAEIIPG--------------------GEVYL-HPTLNTVERYKNGSW-YIVIDYGRYS 364 (388) T ss_pred CCHHHHHHHHHHHHHHHHHHHhCCceee--------------------eEEEE-ecCCCCHHHhhCCEE-EEEEEEEecC Confidence 6888889999999999999999999953 33444 467789999999888 4889999999 Q ss_pred eEEEEEEEEEecC Q lcl|NC_020841. 355 AIHDTDITLIPEA 367 (367) Q Consensus 355 aIh~v~i~~~v~~ 367 (367) .++.|++.+..+. T Consensus 365 pae~I~~~~~~~~ 377 (388) T protein:vir:96 365 PNEHMIFHLNAVD 377 (388) T ss_pred CcceEEEEEEEch Confidence 9999998777766 No 22 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=99.03 E-value=3.8e-09 Score=66.76 Aligned_cols=329 Identities=14% Similarity=0.028 Sum_probs=192.9 Q ss_pred CcccccccccceEEEE-EeeeccccccccccceEEEeecccc----CcccceEEEecHHHHHhccCCCcHHHHHHHHHhc Q lcl|NC_020841. 1 MAGSLTLPINMLVNVS-IEYQAKLLSRDAFNRLLIVGSTAPN----GRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFS 75 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~-i~~~~~~~~~~~fg~~li~~~~~~~----~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~ 75 (367) |+=.. + .. |.|. ++-.+.++.......+.+++..+-- ...+.....++..+....|+.....+.+...+|. T Consensus 1 M~~~~-~--~G-v~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~ 76 (390) T protein:vir:10 1 MPQDY-H--HG-VRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGK 76 (390) T ss_pred Ccccc-c--CC-eEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhcc Confidence 33211 1 12 3333 3334567777788888888755311 1112234457777777789998889999999999 Q ss_pred cCcccceEEEEec--cCccch------------HHHHHHHHhcccC---cEEEEEEecCC-HHHHHHHHHHhhccCcEEE Q lcl|NC_020841. 76 QNPKPRDLMIATV--TALTDP------------LASIGEVAAKTLG---FYAFCFASEVA-AADIQGLAEWAQSNNRMFM 137 (367) Q Consensus 76 Q~p~p~~v~v~~~--~~~~t~------------~~~l~~~~~~~~~---w~~~~~~~~~~-~~~~~ala~~~ea~~~~~~ 137 (367) ++..+..++-... +...+. .+.+.++...... --..+.....+ .....++...++. -+.++ T Consensus 77 ~gg~~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~-~~~~a 155 (390) T protein:vir:10 77 QTKPLTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQS-LRAMA 155 (390) T ss_pred ccCceEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcc-cceEE Confidence 9876543322211 000000 0111122111111 01111222222 2233334443332 23333 Q ss_pred EEEeCc-hhhhHHHHHHHHhccccceeecC------C------chhHHHHHHHHHHHcccccCcc-eeeeeeeecCcccc Q lcl|NC_020841. 138 TVMTDD-TEAVTTGNALKELGQYHYCITYH------E------DYATVGAVAGMALDQRYDKTDG-VKTLHLKSLVSVVS 203 (367) Q Consensus 138 ~~~~d~-~~~~~~~~~~~~~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~~~~~g-~~t~~~k~l~Gv~~ 203 (367) ++-... ............++..+..+.+. + ..+..+.++|..+..+... | ......|.+.|+.- T Consensus 156 ivD~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~--g~~~spaN~~l~gi~~ 233 (390) T protein:vir:10 156 YVSASGCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDI--GWHKTISNVVVNGVSG 233 (390) T ss_pred EEecCCCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCC--CcEECcCCceeeceee Confidence 333222 22222222223344444333322 1 0233566777777665432 3 23334566666552 Q ss_pred --------cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020841. 204 --------TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQRLRR 271 (367) Q Consensus 204 --------~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~ 271 (367) ......|.+.|..+|+|...... ++ .+..+.+++++ ||-+.+-.+|++..|+..+...+-. T Consensus 234 ~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~~----G~-~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e-- 306 (390) T protein:vir:10 234 ISADVSWDLQDPATDAGYLNEHEVTTLVNRN----GF-RFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDG-- 306 (390) T ss_pred cceecccccccccchhhhhhhcCcEEEEcCC----CE-EEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC-- Confidence 23456678889999999986532 23 45677777663 7889999999999999888865543 Q ss_pred CCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEE Q lcl|NC_020841. 272 LTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVK 351 (367) Q Consensus 272 kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~ 351 (367) |.+..-...|+..++.-|+..+++|.|. ||.+.+. .++.+++|+.+.+.. +.+.+. T Consensus 307 --~n~~~~~~~i~~~i~~~L~~l~~~g~l~--------------------g~~v~~d-~~~nt~~~i~~G~~~-~~v~~~ 362 (390) T protein:vir:10 307 --PLNPSLARDIVESINGWFRQQVANGYLI--------------------GGSAWID-PEPNTADILASGKAY-IDYDYT 362 (390) T ss_pred --CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEEc-cCCCCHHHhhCCeEE-EEEEEE Confidence 6789999999999999999999999984 3666655 567899999998884 899999 Q ss_pred ECceEEEEEEEEEecC Q lcl|NC_020841. 352 GAGAIHDTDITLIPEA 367 (367) Q Consensus 352 ~agaIh~v~i~~~v~~ 367 (367) ....++.|++.+..+. T Consensus 363 p~~pae~I~~~~~~~~ 378 (390) T protein:vir:10 363 PVPPLENLVLRQRITD 378 (390) T ss_pred ecCCcceEEEEEEEch Confidence 9999999988777666 No 23 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=99.03 E-value=3.8e-09 Score=66.76 Aligned_cols=329 Identities=14% Similarity=0.028 Sum_probs=192.9 Q ss_pred CcccccccccceEEEE-EeeeccccccccccceEEEeecccc----CcccceEEEecHHHHHhccCCCcHHHHHHHHHhc Q lcl|NC_020841. 1 MAGSLTLPINMLVNVS-IEYQAKLLSRDAFNRLLIVGSTAPN----GRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFS 75 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~-i~~~~~~~~~~~fg~~li~~~~~~~----~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~ 75 (367) |+=.. + .. |.|. ++-.+.++.......+.+++..+-- ...+.....++..+....|+.....+.+...+|. T Consensus 1 M~~~~-~--~G-v~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~ 76 (390) T protein:vir:78 1 MPQDY-H--HG-VRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGK 76 (390) T ss_pred Ccccc-c--CC-eEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhcc Confidence 33211 1 12 3333 3334567777788888888755311 1112234457777777789998889999999999 Q ss_pred cCcccceEEEEec--cCccch------------HHHHHHHHhcccC---cEEEEEEecCC-HHHHHHHHHHhhccCcEEE Q lcl|NC_020841. 76 QNPKPRDLMIATV--TALTDP------------LASIGEVAAKTLG---FYAFCFASEVA-AADIQGLAEWAQSNNRMFM 137 (367) Q Consensus 76 Q~p~p~~v~v~~~--~~~~t~------------~~~l~~~~~~~~~---w~~~~~~~~~~-~~~~~ala~~~ea~~~~~~ 137 (367) ++..+..++-... +...+. .+.+.++...... --..+.....+ .....++...++. -+.++ T Consensus 77 ~gg~~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~-~~~~a 155 (390) T protein:vir:78 77 QTKPLTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQS-LRAMA 155 (390) T ss_pred ccCceEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcc-cceEE Confidence 9876543322211 000000 0111122111111 01111222222 2233334443332 23333 Q ss_pred EEEeCc-hhhhHHHHHHHHhccccceeecC------C------chhHHHHHHHHHHHcccccCcc-eeeeeeeecCcccc Q lcl|NC_020841. 138 TVMTDD-TEAVTTGNALKELGQYHYCITYH------E------DYATVGAVAGMALDQRYDKTDG-VKTLHLKSLVSVVS 203 (367) Q Consensus 138 ~~~~d~-~~~~~~~~~~~~~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~~~~~g-~~t~~~k~l~Gv~~ 203 (367) ++-... ............++..+..+.+. + ..+..+.++|..+..+... | ......|.+.|+.- T Consensus 156 ivD~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~--g~~~spaN~~l~gi~~ 233 (390) T protein:vir:78 156 YVSASGCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDI--GWHKTISNVVVNGVSG 233 (390) T ss_pred EEecCCCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCC--CcEECcCCceeeceee Confidence 333222 22222222223344444333322 1 0233566777777665432 3 23334566666552 Q ss_pred --------cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020841. 204 --------TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQRLRR 271 (367) Q Consensus 204 --------~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~ 271 (367) ......|.+.|..+|+|...... ++ .+..+.+++++ ||-+.+-.+|++..|+..+...+-. T Consensus 234 ~~~~~~~~~~~~~~~~~~ln~~gi~t~~~~~----G~-~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e-- 306 (390) T protein:vir:78 234 ISADVSWDLQDPATDAGYLNEHEVTTLVNRN----GF-RFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDG-- 306 (390) T ss_pred cceecccccccccchhhhhhhcCcEEEEcCC----CE-EEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC-- Confidence 23456678889999999986532 23 45677777663 7889999999999999888865543 Q ss_pred CCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEE Q lcl|NC_020841. 272 LTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVK 351 (367) Q Consensus 272 kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~ 351 (367) |.+..-...|+..++.-|+..+++|.|. ||.+.+. .++.+++|+.+.+.. +.+.+. T Consensus 307 --~n~~~~~~~i~~~i~~~L~~l~~~g~l~--------------------g~~v~~d-~~~nt~~~i~~G~~~-~~v~~~ 362 (390) T protein:vir:78 307 --PLNPSLARDIVESINGWFRQQVANGYLI--------------------GGSAWID-PEPNTADILASGKAY-IDYDYT 362 (390) T ss_pred --CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEEc-cCCCCHHHhhCCeEE-EEEEEE Confidence 6789999999999999999999999984 3666655 567899999998884 899999 Q ss_pred ECceEEEEEEEEEecC Q lcl|NC_020841. 352 GAGAIHDTDITLIPEA 367 (367) Q Consensus 352 ~agaIh~v~i~~~v~~ 367 (367) ....++.|++.+..+. T Consensus 363 p~~pae~I~~~~~~~~ 378 (390) T protein:vir:78 363 PVPPLENLVLRQRITD 378 (390) T ss_pred ecCCcceEEEEEEEch Confidence 9999999988777666 No 24 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=99.03 E-value=3.2e-09 Score=67.14 Aligned_cols=329 Identities=11% Similarity=-0.007 Sum_probs=196.5 Q ss_pred CcccccccccceEEEE-EeeeccccccccccceEEEeeccc-----cCcccceEEEecHHHHHhccCCCcHHHHHHHHHh Q lcl|NC_020841. 1 MAGSLTLPINMLVNVS-IEYQAKLLSRDAFNRLLIVGSTAP-----NGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYF 74 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~-i~~~~~~~~~~~fg~~li~~~~~~-----~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F 74 (367) |+-. +..- |.|. ++-.+.++.......+.+++..+. .+..+. ...++..+-...|+...-.+.+...+| T Consensus 1 M~~~---~~pG-v~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~p-v~iss~~~~~~~~g~~gtl~~al~~~~ 75 (391) T protein:vir:79 1 MPTD---YHHG-VRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTP-VLLTNPQAYIGKAGDKGTLAHTLDAIT 75 (391) T ss_pred CCCC---CCCC-eEEEECCCCcccccccCCceEEEEeecccccccccccccC-EEeccHHHHHHhcCCccccchhhhhhh Confidence 4321 2122 3332 333456677777778878776531 122233 466888777777888888888899999 Q ss_pred ccCcccceEEEEeccC--------------ccchHHHHHHHHhcccCcE---EEEE-EecCCHHHHHHHHHHhhccCcEE Q lcl|NC_020841. 75 SQNPKPRDLMIATVTA--------------LTDPLASIGEVAAKTLGFY---AFCF-ASEVAAADIQGLAEWAQSNNRMF 136 (367) Q Consensus 75 ~Q~p~p~~v~v~~~~~--------------~~t~~~~l~~~~~~~~~w~---~~~~-~~~~~~~~~~ala~~~ea~~~~~ 136 (367) .++-.+..++...... .....+.+..+........ .... ..........++...++....+. T Consensus 76 ~~gg~~~~vv~~~~~~~~~~~~~~~~g~~~~~~~~tGl~~l~~~~~~~~~~p~~l~~p~~~~~~v~~al~~~~~~~~~~a 155 (391) T protein:vir:79 76 DQTNPLTVVVRVAGGASEAETTSNLIGTTNAAGRYTGMKALLTARNRFGVAPRILAVPGLDSLPVGTELVTIAQKLRAFA 155 (391) T ss_pred cccccceeeeccccccccccccccccccccchhhhHHHhhhhhhhhhhcccchhhcCCccchhHHHHHHHHHHhhcCcEE Confidence 9986655443322110 0111122222222221110 0111 11223445556666666544333 Q ss_pred EEEEeCchhhhHHHHHHHHhccccceeecC------C------chhHHHHHHHHHHHcccccCcc-eeeeeeeecCcccc Q lcl|NC_020841. 137 MTVMTDDTEAVTTGNALKELGQYHYCITYH------E------DYATVGAVAGMALDQRYDKTDG-VKTLHLKSLVSVVS 203 (367) Q Consensus 137 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~~~~~g-~~t~~~k~l~Gv~~ 203 (367) +.-....+...........++..+....+. + ..+..+.++|..+..+..+ | ......|.+.||.. T Consensus 156 i~d~p~~~t~~~a~~~~~~~~s~~~a~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~--g~~~spaN~~l~gi~~ 233 (391) T protein:vir:79 156 YLSAYGCQTKEEAVAYRSNFGQREAMVMWPDFVGWDTAANAETTLWATARAVGLRAKIDNDT--GWHKTLSNVAVGGVTG 233 (391) T ss_pred EEECCCCCCHHHHHHHHhccCCceeEEecceeeeecCcCCceeeechHHHHHHHHHHhhhcc--cceeccCCceehhhhc Confidence 332222222222222233334333332222 1 0123456677777665332 3 22333456666542 Q ss_pred --------cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020841. 204 --------TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQRLRR 271 (367) Q Consensus 204 --------~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~~ 271 (367) ...+.++.+.|..+|+|.+... .++ .+..+.+++++ ||-+.+-.+|+...|+..+...+-. T Consensus 234 ~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~~----~G~-~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e-- 306 (391) T protein:vir:79 234 LSRDVFWDLQDPATDAGYLNANEVTTLVHR----DGY-RFWGSRTCSADPLFAFENYTRTAQVLADTMAEAHMWANDL-- 306 (391) T ss_pred cccccccccccccchhhhhhhcCceEEECC----CcE-EEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC-- Confidence 2345567888999999998542 233 46677777774 7989999999999999988876543 Q ss_pred CCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEE Q lcl|NC_020841. 272 LTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVK 351 (367) Q Consensus 272 kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~ 351 (367) |.++.-...|+..++.-|+..+++|.|.. |.+.. ..+..+++++.+.+. -+.+.+. T Consensus 307 --pn~~~~~~~i~~~i~~~l~~l~~~g~l~g--------------------~~v~~-~~~~nt~~~i~~G~~-~~~i~~~ 362 (391) T protein:vir:79 307 --PMTPTLVRDLLEGINAKLRMLTRNGYLLG--------------------GAAWF-DADANSKDTLKAGQL-AIDYDYT 362 (391) T ss_pred --CCCHHHHHHHHHHHHHHHHHHHhCCceec--------------------eEEEE-ecCCCCHHHhhCCEE-EEEEEEE Confidence 67889999999999999999999999953 44444 356788999988887 4889999 Q ss_pred ECceEEEEEEEEEecC Q lcl|NC_020841. 352 GAGAIHDTDITLIPEA 367 (367) Q Consensus 352 ~agaIh~v~i~~~v~~ 367 (367) ....++.|++.+..+. T Consensus 363 p~~p~e~i~~~~~~~~ 378 (391) T protein:vir:79 363 PVPPLENLTFRQRITD 378 (391) T ss_pred ecCCcceEEEEEEEch Confidence 9999999999887777 No 25 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=98.91 E-value=1.5e-08 Score=63.40 Aligned_cols=328 Identities=13% Similarity=0.030 Sum_probs=190.9 Q ss_pred CcccccccccceEEE-EEeeeccccccccccceEEEeecccc-----CcccceEEEecHHHHHhccCCCcHHHHHHHHHh Q lcl|NC_020841. 1 MAGSLTLPINMLVNV-SIEYQAKLLSRDAFNRLLIVGSTAPN-----GRATDTGIYTSIDGVKLDYGVEADEYKIAQKYF 74 (367) Q Consensus 1 ~~~~~~l~i~~iv~V-~i~~~~~~~~~~~fg~~li~~~~~~~-----~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F 74 (367) |+-.. -.=|.| .++-.+.++.....+.+.+++..+-- ...+. ...++..+...-|+.+...+.+...+| T Consensus 1 M~~~~----~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~p-v~its~~~~~~~~g~~~tL~~al~~~~ 75 (390) T protein:vir:79 1 MPQDY----HHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTP-VLLTNVVAALGKAGKKGTLRRTLDAIG 75 (390) T ss_pred Ccccc----CCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccc-eEeecHHHHHHhcCCCccchhhhhhhc Confidence 33222 222334 33444567777777777777755421 12233 344666666556888888888999999 Q ss_pred ccCcccceEEEEec-cCc-------------cchHHHHHHHHhc---ccCcEEEEEEecC-CHHHHHHHHHHhhccCcEE Q lcl|NC_020841. 75 SQNPKPRDLMIATV-TAL-------------TDPLASIGEVAAK---TLGFYAFCFASEV-AAADIQGLAEWAQSNNRMF 136 (367) Q Consensus 75 ~Q~p~p~~v~v~~~-~~~-------------~t~~~~l~~~~~~---~~~w~~~~~~~~~-~~~~~~ala~~~ea~~~~~ 136 (367) .|+.....+..... ... ....+.+.++... .......+.+... ......++...++.. +.+ T Consensus 76 ~~~~~~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~-~~~ 154 (390) T protein:vir:79 76 KQTKPLTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSL-RAM 154 (390) T ss_pred ccccceEEEEeeccccccccccceeeecccccccchhhhhhhhhhhhhccccccccCCcccchHHHHHHHHhhhhc-ceE Confidence 99866543322211 000 0011112222221 1111122222222 223344454545433 333 Q ss_pred EEEEeCch-hhhHHHHHHHHhccccceeecC------Cc------hhHHHHHHHHHHHcccccCcc-eeeeeeeecCccc Q lcl|NC_020841. 137 MTVMTDDT-EAVTTGNALKELGQYHYCITYH------ED------YATVGAVAGMALDQRYDKTDG-VKTLHLKSLVSVV 202 (367) Q Consensus 137 ~~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~------~~------~~~~~~~~~~~~~~~~~~~~g-~~t~~~k~l~Gv~ 202 (367) .+.-.... ...........++..+..+.+. +. .+..+.++|..+..+... | ......|.+.|+. T Consensus 155 ai~D~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~D~~~--g~~~spsN~~i~gi~ 232 (390) T protein:vir:79 155 AYVSASGCKTKEEAAAYRRQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDI--GWHKTISNVVVNGVS 232 (390) T ss_pred EEEEccCCCCHHHHHHHhcCCCCceEEEEcCceeecccccCceeEeehHHHHHHHHHhhhccC--CcEEccCCceeeccc Confidence 33332221 2222222233444444443332 00 134556667766665322 2 1222355555654 Q ss_pred c--------cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020841. 203 S--------TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQRLR 270 (367) Q Consensus 203 ~--------~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~ 270 (367) . ...+..|++.|..+|+|..... .++ .+..+.+++++ ||-+.+-.+|++..|+..+...+-. T Consensus 233 ~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~~~----~G~-~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e- 306 (390) T protein:vir:79 233 GISADVSWDLQDPATDAGYLNEHEVTTLVNR----NGF-RFWGERTCSDDPKFAFENYTRTAQVAADSIAEAQMPVVDG- 306 (390) T ss_pred eeeeeccccccccchhhhhhhhcCcEEEEcC----CCE-EEEeccccCCCcccceeeehhhHHHHHHHHHHHHHHhccC- Confidence 1 2335667888999999998542 223 46677777773 7889999999999999888875543 Q ss_pred CCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEE Q lcl|NC_020841. 271 RLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILV 350 (367) Q Consensus 271 ~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~ 350 (367) |.+..=...|+..++.-|+..+++|.|.. |.+.+. .++.+++|+.+.+.. +.+.+ T Consensus 307 ---~~~~~~~~~i~~~i~~~L~~l~~~gal~g--------------------~~v~~d-~~~nt~~~i~~G~~~-~~i~~ 361 (390) T protein:vir:79 307 ---PLNPSLARDIVESINGWFRQQVANGYLIG--------------------GSAWID-PEPNTADILASGKAY-IDYDY 361 (390) T ss_pred ---CCCHHHHHHHHHHHHHHHHHHHhCCceee--------------------eEEEEe-cCCCCHHHhhCCEEE-EEEEE Confidence 66888889999999999999999999853 556654 567889999998884 88999 Q ss_pred EECceEEEEEEEEEecC Q lcl|NC_020841. 351 KGAGAIHDTDITLIPEA 367 (367) Q Consensus 351 ~~agaIh~v~i~~~v~~ 367 (367) .....++.|++.+..+- T Consensus 362 ~p~~p~e~i~~~~~~~~ 378 (390) T protein:vir:79 362 TPVPPLENLVLRQRITD 378 (390) T ss_pred EecCCcceEEEEEEEch Confidence 99999999988777666 No 26 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=98.84 E-value=2.9e-08 Score=61.91 Aligned_cols=329 Identities=11% Similarity=-0.007 Sum_probs=188.8 Q ss_pred CcccccccccceEEEE-EeeeccccccccccceEEEeecccc----CcccceEEEecHHHHHhccCCCcHHHHHHHHHhc Q lcl|NC_020841. 1 MAGSLTLPINMLVNVS-IEYQAKLLSRDAFNRLLIVGSTAPN----GRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFS 75 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~-i~~~~~~~~~~~fg~~li~~~~~~~----~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~ 75 (367) |+-. -.| . |-|. ++-.+.++....-+.+.+++..+.- .........++..+...-|+.....+.+...+|. T Consensus 1 M~~~-~~~--G-v~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~a~~~~~~ 76 (386) T protein:vir:10 1 MAEQ-YLH--G-AEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAGGTLPQAIDGIFD 76 (386) T ss_pred Cccc-cCC--C-eEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCCcchhHHHHHHhc Confidence 4422 122 2 3233 2233455666666777777755321 1112234556666665667888889999999999 Q ss_pred cCcccceEEEEeccC---------------ccchHHHHHHHHhcccCcEEEE--EEecCCHHHHHHHHHHhhccCcEE-E Q lcl|NC_020841. 76 QNPKPRDLMIATVTA---------------LTDPLASIGEVAAKTLGFYAFC--FASEVAAADIQGLAEWAQSNNRMF-M 137 (367) Q Consensus 76 Q~p~p~~v~v~~~~~---------------~~t~~~~l~~~~~~~~~w~~~~--~~~~~~~~~~~ala~~~ea~~~~~-~ 137 (367) ++..+..+....... .......+..+...... +.+. +...........+++-++...+++ . T Consensus 77 ~gg~~~~vv~~~~~~~~~~t~~~~ig~~~~~t~~~tgl~~l~~~~~~-~~~~p~i~~ap~~~~~~~v~~~l~~~~~~~~~ 155 (386) T protein:vir:10 77 QTGAVVVVIRVDEGVDSAATQSNVIGKVDADTEQYTGILALLSAENT-VKVQPRILIAPGFSNQKAVADQLVSVADTAAW 155 (386) T ss_pred cCceeEEEeeccccccccccchhhhcccccccchhhhhHHhhhhccc-ccccccccccccccchhHHHHHHHHhhcceEE Confidence 986544332221100 00011122222222111 1110 001111122333333334332222 2 Q ss_pred EEEeCc--hhhhHHHHHHHHhccccceeecC------------CchhHHHHHHHHHHHcccccCcc-eeeeeeeecCccc Q lcl|NC_020841. 138 TVMTDD--TEAVTTGNALKELGQYHYCITYH------------EDYATVGAVAGMALDQRYDKTDG-VKTLHLKSLVSVV 202 (367) Q Consensus 138 ~~~~d~--~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~g-~~t~~~k~l~Gv~ 202 (367) ....+. +...........++..+..+.+. ...+..+.++|..+..+... | ......|.+.||. T Consensus 156 ~~~~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~v~~~~~~~~~~~p~s~~~ag~~a~~D~~~--G~~~spaN~~l~gv~ 233 (386) T protein:vir:10 156 LCHSGWSNTTDAAAITYRELFGSRRCEVVDPWYKVWDVETSAHIIQPPSARHAGVMAKVHNTL--GFWWSNSNQEILGID 233 (386) T ss_pred EEEeCCCCCchHHHHHhhhcccccceEEecCceeeeccccccceeechHHHHHHHHHHhhhcC--CcEEccCCceeeccc Confidence 222221 11111112222333333333221 01133456666666665433 3 2333455666664 Q ss_pred c--------cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020841. 203 S--------TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGGK----FFDFVMGFDWLRNVIETNVFNGQRLR 270 (367) Q Consensus 203 ~--------~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G~----~iD~~~~~dwl~~~lq~~l~~ll~~~ 270 (367) - ...+..|.+.|..+|+|.... + .+ ..+..+.+++.+ ||-+.+-.+|++..|+..+...+-. T Consensus 234 ~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~--~--~G-~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e- 307 (386) T protein:vir:10 234 GLCRPVDFKLDDPTCRANLLNAKEVTTTIQ--Q--NG-FRVWGDRTCSADSKWAFKNVVITNDMIADSLVRNHLWAVDR- 307 (386) T ss_pred ccceecccccccCcchhhhhhhcCcEEEEc--C--CC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC- Confidence 2 234678999999999998853 2 23 346677776663 7888889999999999888875543 Q ss_pred CCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEE Q lcl|NC_020841. 271 RLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILV 350 (367) Q Consensus 271 ~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~ 350 (367) |.+..=...|+..++.-|+...++|.|. ||.|.+. .++.+++|+.+++.. +.+.+ T Consensus 308 ---~~~~~~~~~i~~~i~~~L~~l~~~g~l~--------------------g~~v~~d-~~~nt~~~~~~G~~~-~~i~~ 362 (386) T protein:vir:10 308 ---NITKTYVEDVTEGVNNYLRHLKNIGAIA--------------------GGECWVD-PELNSPDQIQQGKVY-FDYDF 362 (386) T ss_pred ---CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEEc-ccCCCHHHhhCCeEE-EEEEE Confidence 6688889999999999999999999984 3667766 678999999999884 99999 Q ss_pred EECceEEEEEEEEEecC Q lcl|NC_020841. 351 KGAGAIHDTDITLIPEA 367 (367) Q Consensus 351 ~~agaIh~v~i~~~v~~ 367 (367) ....-++.+.+.+..+. T Consensus 363 ~p~~p~e~i~~~~~~~~ 379 (386) T protein:vir:10 363 SAYAPAEHITFRSHMVN 379 (386) T ss_pred EecCCceeEEEEEEEeh Confidence 99999999999887777 No 27 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=98.75 E-value=2.6e-08 Score=62.15 Aligned_cols=326 Identities=12% Similarity=0.018 Sum_probs=164.4 Q ss_pred Cccc-cccccc-c------------eEEEEEeeec-cccccccccce-EEE-eecccc-CcccceEEEecHHHHHhcc-- Q lcl|NC_020841. 1 MAGS-LTLPIN-M------------LVNVSIEYQA-KLLSRDAFNRL-LIV-GSTAPN-GRATDTGIYTSIDGVKLDY-- 60 (367) Q Consensus 1 ~~~~-~~l~i~-~------------iv~V~i~~~~-~~~~~~~fg~~-li~-~~~~~~-~~~~~~~~yts~~~v~~df-- 60 (367) +.-. -..+.. + -.++.-++.+ ......+...+ -++ .+.... +.......|...+++..-+ T Consensus 177 ~~~~~~~~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~~~~~~~~~~~~ 256 (581) T protein:vir:10 177 NPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGP 256 (581) T ss_pred ccccCcceeccccceeeecccCccccccccccceeeeeeecccccccceEEEEEEEeecCCcceeEEeecCcchhhhhhh Confidence 1100 000000 0 0011111100 00000000011 111 011000 1111234444444443211 Q ss_pred ------CCCcHHHHHHHHHhccCcccceEEEEeccC------ccchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHH Q lcl|NC_020841. 61 ------GVEADEYKIAQKYFSQNPKPRDLMIATVTA------LTDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEW 128 (367) Q Consensus 61 ------~~~s~~ykaA~~~F~Q~p~p~~v~v~~~~~------~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~ 128 (367) +..++.-+.++..+.. .+.....++.+. ..++.+++++++.+..+ .+++....+.+-+.++..| T Consensus 257 ~~~~~g~~~~~~t~~~~~~~tn--~~~~~l~~gvd~~g~tvt~~dy~~Al~ale~~~~~--~ivv~~t~~~~v~a~l~ah 332 (581) T protein:vir:10 257 AFDEAGNVQSEITLCAQLAITN--GASTILACAVDPEGDTVTMGDYQNALNKFRDEDEI--AIIVAGTGAQPIQALVQQH 332 (581) T ss_pred hhhccCccccchhhhheeeeec--ccceeEEeeccCCCCccchHHHHHHHHHHhcCCce--EEEEeCCCCHHHHHHHHHH Confidence 2233344444433333 333444444432 22466777777764333 3344444444445668888 Q ss_pred hhccC----cEE--EEEEe--CchhhhHHHHHHHHhccccceeecC------Cc----------hhHHHHHHHHHHHccc Q lcl|NC_020841. 129 AQSNN----RMF--MTVMT--DDTEAVTTGNALKELGQYHYCITYH------ED----------YATVGAVAGMALDQRY 184 (367) Q Consensus 129 ~ea~~----~~~--~~~~~--d~~~~~~~~~~~~~~~~~~~~~~~~------~~----------~~~~~~~~~~~~~~~~ 184 (367) ++..+ .+. ..+.- ..............++..|....++ ++ ...+++++|..++. T Consensus 333 v~~~s~~~~~~ravigV~g~~~~~~~~~~~~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~y~~AA~vAGl~a~~-- 410 (581) T protein:vir:10 333 VSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSA-- 410 (581) T ss_pred HHHHHhccCCcEEEEEecCCCCCccHHHHHHhhccCCCceEEEEecCceeecCcccCceeccchhhHHHHHHHHhhcc-- Confidence 86531 222 22221 1111112223334444444443332 11 11234444544443 Q ss_pred ccCcceeeeeeeecCcccc--cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEee---CC--chhhHHHHHHHHHH Q lcl|NC_020841. 185 DKTDGVKTLHLKSLVSVVS--TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAG---GG--KFFDFVMGFDWLRN 257 (367) Q Consensus 185 ~~~~g~~t~~~k~l~Gv~~--~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~---~G--~~iD~~~~~dwl~~ 257 (367) +-...+-||.++|+.. ..++.+|++.|..+|++.+....+.. ..+-+|..+ ++ ..|..++-.|.+.. T Consensus 411 ---~~~~slT~~~i~gi~~l~~~~s~~e~e~ll~~Gv~~l~~~~~~~---v~Iv~gItT~~s~~~~~~i~~iR~~D~v~~ 484 (581) T protein:vir:10 411 ---IAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNL---VHVRHGVTTDPTSLHTREWNIIGQQDVMVY 484 (581) T ss_pred ---ccccCcccccccccccccccCCHHHHHHHHhCCeEEEEEecCCe---EEEEeeeecCCCCCcceeeeeehhhhHHHH Confidence 3345667888888873 46899999999999999998654321 234566654 22 46889999999999 Q ss_pred HHHHHHH-HHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHH Q lcl|NC_020841. 258 VIETNVF-NGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQV 336 (367) Q Consensus 258 ~lq~~l~-~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~ 336 (367) .+++.+. ..|+. + |=++.|.+.|++.+++.|.+..++|+|...... +.+.. T Consensus 485 ~ir~~~~~~~fIG--~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~-------------------------~~~~~ 536 (581) T protein:vir:10 485 RIRDYLDADGLIG--M-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL-------------------------KARQI 536 (581) T ss_pred HHHHHhhhhcCCC--c-ccCHHHHHHHHHHHHHHHHHHHhcCcccCCccc-------------------------eeeee Confidence 9999986 34553 2 558899999999999999999999999642110 01122 Q ss_pred HHhccccCCeEEEEEECceEEEEEEEE--EecC Q lcl|NC_020841. 337 IREQRIAPPFIILVKGAGAIHDTDITL--IPEA 367 (367) Q Consensus 337 dr~~R~~~~~~~~~~~agaIh~v~i~~--~v~~ 367 (367) ++.. ..--+.|.+....+|+.|.+++ ++|. T Consensus 537 ~~~~-d~v~V~i~v~Pv~~i~~I~vti~~~p~~ 568 (581) T protein:vir:10 537 ERQP-DVIEVRYEWRPAYPLNYIVVRYSIAPET 568 (581) T ss_pred ecCC-CEEEEEEEEEecccceEEEEEEEEecCC Confidence 2221 1224888899999999877754 4554 No 28 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=98.74 E-value=5.2e-08 Score=60.50 Aligned_cols=326 Identities=13% Similarity=0.029 Sum_probs=159.2 Q ss_pred Ccccccc-----------cccceEEEEEeeecc---------ccccccccc---eEEEeeccccCcccceE------EEe Q lcl|NC_020841. 1 MAGSLTL-----------PINMLVNVSIEYQAK---------LLSRDAFNR---LLIVGSTAPNGRATDTG------IYT 51 (367) Q Consensus 1 ~~~~~~l-----------~i~~iv~V~i~~~~~---------~~~~~~fg~---~li~~~~~~~~~~~~~~------~yt 51 (367) ++-+++- .|+.+++++-...+. .-...+|.. ..++... ..+...+. .++ T Consensus 191 ~~yrL~~g~~~~~~~~~~~i~~~~~~tAky~~~~~~~i~~~~~~~~~~~~v~~~~~~v~a~--~~D~~~~~~~~~~~~~~ 268 (587) T protein:vir:99 191 KSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKIENANIKDKAVYVKAV--FGDLEKQTAYNGIVSFE 268 (587) T ss_pred EEEEecCCchHHHHHHHhhhccccceeEEeeccCCceeEeecccccccceeeeeeeeeehh--ccceeeecccceeeeee Confidence 1111100 111222222111000 000001110 0000000 00000000 001 Q ss_pred cHHHH---HhccCCCcHHHHHHHHHhccC----cccceEEEEecc--CccchHHHHHHHHhcccCcEEEEEEecCCHHHH Q lcl|NC_020841. 52 SIDGV---KLDYGVEADEYKIAQKYFSQN----PKPRDLMIATVT--ALTDPLASIGEVAAKTLGFYAFCFASEVAAADI 122 (367) Q Consensus 52 s~~~v---~~df~~~s~~ykaA~~~F~Q~----p~p~~v~v~~~~--~~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~ 122 (367) .+.+. ..........+.+....+.-. +.+.....|+.+ ...+..+++++++.. +|..++ ....+...+ T Consensus 269 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG~~~~sy~~al~ale~~--~~~~i~-~~t~d~~i~ 345 (587) T protein:vir:99 269 QLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPPATWADKLDKFAHE--GGYYIV-PLSSKQSVH 345 (587) T ss_pred ecccccchhhhhhhhhccccceeeeeccccceecccceeeecCCCCCccccHHHHHHHHhhC--CcEEEE-ecCCCHHHH Confidence 11000 000000000111111111111 112222334332 233567788888764 455543 334455666 Q ss_pred HHHHHHhhcc---Cc-EEEEEEeCc-hhhhHHHHHHHHhccccceeecC--------------CchhHHHHHHHHHHHcc Q lcl|NC_020841. 123 QGLAEWAQSN---NR-MFMTVMTDD-TEAVTTGNALKELGQYHYCITYH--------------EDYATVGAVAGMALDQR 183 (367) Q Consensus 123 ~ala~~~ea~---~~-~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~ 183 (367) .++..|++.. .+ +..++.... ....+.....+.+++.+.....+ +....+++++|..++.. T Consensus 346 a~l~a~vk~~r~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~ 425 (587) T protein:vir:99 346 AEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQASLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLE 425 (587) T ss_pred HHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEeccceEecCCCceeeechHHHHHHHHHHHhcCc Confidence 7799998753 23 333332221 12222333444555544332221 11223556677777665 Q ss_pred cccCcceeeeeeeecC--cccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeC----C-ch--hhHHHHHHH Q lcl|NC_020841. 184 YDKTDGVKTLHLKSLV--SVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGG----G-KF--FDFVMGFDW 254 (367) Q Consensus 184 ~~~~~g~~t~~~k~l~--Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~----G-~~--iD~~~~~dw 254 (367) ....+ | ||.++ ++. ..++.+|++.+..+|++.+....+........-+|.++- + .| |-.++-.|. T Consensus 426 ~~~Sl---T--~~~i~~~~v~-~~~t~~e~e~li~~Gvl~l~~~~~~~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~ 499 (587) T protein:vir:99 426 IGESI---T--FKPLRVSSLD-QIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDF 499 (587) T ss_pred hhcCc---c--ceeeeccccc-ccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeeceeeccCCCCchhhhhhhhhhHHH Confidence 55433 3 33443 443 378999999999999999876554433222333555432 1 25 779999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCC Q lcl|NC_020841. 255 LRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQA 334 (367) Q Consensus 255 l~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~ 334 (367) +...++..+-+.+.-. |=++.|...|++.+.+.|++..+.|.|..... ++ +.+. + T Consensus 500 i~~di~~~~~~~yiGk---~Nn~~~r~~i~~~i~~~L~~l~~~gaI~~~~~--~d---------------v~v~-~---- 554 (587) T protein:vir:99 500 LVSELKVQLEDQFIGT---RTINTSASIIKDFIQSYLGRKKRDNEIQDFPA--ED---------------VQVI-V---- 554 (587) T ss_pred HHHHHHHHHHhhCCcc---ccchHHHHHHHHHHHHHHHHHHhCCcccCCCc--cc---------------eEEE-e---- Confidence 9999998888777653 44788999999999999999999999964211 10 1111 0 Q ss_pred HHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 335 QVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 335 ~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) ..| |. -+++.+...-+|++|.+++.+.. T Consensus 555 ~~d---~~--~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:99 555 EGN---EA--RISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred cCC---EE--EEEEEEEEcccceEEEEEEEEEe Confidence 111 22 37888999999999999998866 No 29 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=98.70 E-value=4e-08 Score=61.11 Aligned_cols=311 Identities=10% Similarity=0.015 Sum_probs=156.9 Q ss_pred CcccccccccceEEEEEe--eeccccccccccceEEEeeccccCcccce--E-----------EEecHHHHHhccCCCcH Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIE--YQAKLLSRDAFNRLLIVGSTAPNGRATDT--G-----------IYTSIDGVKLDYGVEAD 65 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~--~~~~~~~~~~fg~~li~~~~~~~~~~~~~--~-----------~yts~~~v~~df~~~s~ 65 (367) ...++.-.- --.+++. +...+..+...|+-+-+.-.+.......+ . ++...+++.. . T Consensus 86 ~~~R~~~g~--~a~~tl~~~~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~v~~~~~~~~~d~~~v~~~~~~~~------n 157 (437) T protein:vir:10 86 LLYRLNTGE--KANVSLSDNVTAQAKYSGVRGNDITVTVKTNVDDPSSFDVVTFLDTVVMDLQTVKVLADLKN------N 157 (437) T ss_pred EEEECCCCc--eeeEeeccceEEEeccCCcccceeEEEEeeccCCccceEEEEecCcceeeeeehhhhhhhhh------h Confidence 111110000 0000000 00011111111211111111100111111 1 1111222110 0 Q ss_pred HHHHHHHHhccCccc-ceE-EEEecc---CccchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHHhhcc----CcEE Q lcl|NC_020841. 66 EYKIAQKYFSQNPKP-RDL-MIATVT---ALTDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEWAQSN----NRMF 136 (367) Q Consensus 66 ~ykaA~~~F~Q~p~p-~~v-~v~~~~---~~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~~ea~----~~~~ 136 (367) .|-.+.. ...+.+ ... ..++.+ +.+++.+++.+++... |.++++ ...+.+.+.++..|++.. ++++ T Consensus 158 ~~v~~~~--~~~l~~~a~~~LtGG~dg~~t~~dy~~al~~le~~~--~n~l~~-~~~d~~~~t~~~~~ik~~r~~~g~~~ 232 (437) T protein:vir:10 158 ALVEFSG--TGELQPVAGAKLTGGTDGAISTQDYLEYFKALETVE--FNYMAL-PVEDASIKKAAINFIKRMREDEGLGA 232 (437) T ss_pred ccccccc--ccccccccceeeeccccCCCChhHHHHHHHHhccCc--ceEEEe-cCCChhHHHHHHHHHHHHHhccCceE Confidence 1111100 001111 111 222222 2234677788887654 444443 445778899999998742 4455 Q ss_pred EEEEeCchhhhHHHHHHHHhc-cccceeec----CCchhHHHHHHHHHHHcccccCcceeeeeeeecCccc-c-cCCCHH Q lcl|NC_020841. 137 MTVMTDDTEAVTTGNALKELG-QYHYCITY----HEDYATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVV-S-TDISQT 209 (367) Q Consensus 137 ~~~~~d~~~~~~~~~~~~~~~-~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~-~-~~~t~t 209 (367) ..+...... +. .... ........ +.....++.++|..++...++ ..-||.++|+. . ..++.+ T Consensus 233 ~~V~~~~~~--d~----e~Iin~~n~~~~~~~~~~~~~~~~a~vAG~~Ag~~~~~-----S~t~~~~~~~~~v~~~~t~~ 301 (437) T protein:vir:10 233 QLVVADSDA--DS----EAVINVKNGVILSDKTVIDKTKATVWVAAASANAGVEK-----SLTYEKYEDSVDVVGRLSHT 301 (437) T ss_pred EEEeCCCCC--CC----ceEEEeecceeecCcceechhhHHHHHHHHhccCcccc-----CccccccCCcccccccCCHH Confidence 444332211 00 0000 00111111 223344566677776654443 33477888874 3 478999 Q ss_pred HHHHHHhCCceEEEEeeccccceEEEecCEee--------CCc--hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhH Q lcl|NC_020841. 210 QAASLKAACINYYSDYGNPDNSLPIFANGHAG--------GGK--FFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRG 279 (367) Q Consensus 210 ~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~--------~G~--~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G 279 (367) |++.+..+|...+...++ .....+|..+ +-+ .|-.++-.|.+...++..+-+.++ .|+|=+..| T Consensus 302 e~~~~i~~G~~vl~~~~~----~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yi--Gk~~N~~~~ 375 (437) T protein:vir:10 302 ETEDALLKGQFVFTARRG----RAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFL--GKVSNNEDG 375 (437) T ss_pred HHHHHHhCCcEEEEEeCC----eEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccc--cccCCCHHH Confidence 999999999988865432 2345566532 112 466888888888888876666554 478888999 Q ss_pred HHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEE Q lcl|NC_020841. 280 MMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDT 359 (367) Q Consensus 280 ~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v 359 (367) ...+++.+...|++..+.|.|.+....+ +.+ .+ .+ .+..--+++.++.-.++..+ T Consensus 376 r~~~~~~i~~yl~~l~~~g~I~~~~~~d-----------------~~v---~~--~~---~~~~v~v~~~v~~~dame~i 430 (437) T protein:vir:10 376 RQAFKANRIRYFKDLEARGAIEDFKVED-----------------IEV---LR--GE---LKESVVVNVKVKPVDSMEKL 430 (437) T ss_pred HHHHHHHHHHHHHHHHhCCCccCCCcee-----------------EEe---ec--CC---CCCEEEEEEEEEEeeeeeeE Confidence 9999999999999999999997543211 110 00 01 11223388999999999999 Q ss_pred EEEEEec Q lcl|NC_020841. 360 DITLIPE 366 (367) Q Consensus 360 ~i~~~v~ 366 (367) .++++|| T Consensus 431 y~ti~v~ 437 (437) T protein:vir:10 431 YMTVTVE 437 (437) T ss_pred EEEEEec Confidence 9999999 No 30 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=98.69 E-value=5e-08 Score=60.59 Aligned_cols=327 Identities=13% Similarity=0.011 Sum_probs=160.0 Q ss_pred Ccccccc-----------cccceEEEEEeee----------ccccccccccceE--EEeeccccC--cccceEEEecHHH Q lcl|NC_020841. 1 MAGSLTL-----------PINMLVNVSIEYQ----------AKLLSRDAFNRLL--IVGSTAPNG--RATDTGIYTSIDG 55 (367) Q Consensus 1 ~~~~~~l-----------~i~~iv~V~i~~~----------~~~~~~~~fg~~l--i~~~~~~~~--~~~~~~~yts~~~ 55 (367) ++-+++- .|+.+++++.... ..+ ...+|..-- .+......+ ..+.++.+..... T Consensus 191 ~~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~~~-~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~ 269 (587) T protein:vir:95 191 KSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLD-KIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQ 269 (587) T ss_pred EEEEecCCchHHHHHHHHhhccccceEEEEecccCceeEEeecC-cccccceehhhhhhhhhhcceeeeeeceeeeeeec Confidence 1111110 1111111111110 000 000111000 000000000 0000011111100 Q ss_pred H------HhccCCCcHHHHHHHHHhccC----cccceEEEEecc--CccchHHHHHHHHhcccCcEEEEEEecCCHHHHH Q lcl|NC_020841. 56 V------KLDYGVEADEYKIAQKYFSQN----PKPRDLMIATVT--ALTDPLASIGEVAAKTLGFYAFCFASEVAAADIQ 123 (367) Q Consensus 56 v------~~df~~~s~~ykaA~~~F~Q~----p~p~~v~v~~~~--~~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ 123 (367) + ...+......+.++...+.-. +.+.....|+.+ ...+..+++++++.. +|..++ ....+...+. T Consensus 270 ~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG~~~~~y~~~l~ale~~--~~~~i~-~~t~d~~v~a 346 (587) T protein:vir:95 270 LNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPPATWADKLDKFAHE--GGYYIV-PLSSKQSVHA 346 (587) T ss_pred ccccceeccchhhhhcccchheeccccccceeccceeeeecCCCCCCcccHHHHHHHHHhC--CcEEEE-ecCCCHHHHH Confidence 0 000011111122222222211 111222334332 233567788888764 455553 3334556667 Q ss_pred HHHHHhhcc---CcEE-EEEEeCc-hhhhHHHHHHHHhccccceeecCC--------------chhHHHHHHHHHHHccc Q lcl|NC_020841. 124 GLAEWAQSN---NRMF-MTVMTDD-TEAVTTGNALKELGQYHYCITYHE--------------DYATVGAVAGMALDQRY 184 (367) Q Consensus 124 ala~~~ea~---~~~~-~~~~~d~-~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~ 184 (367) ++..|++.. .++. .++.... ....+.....+.+++.+.....++ ....+++++|..++... T Consensus 347 ~l~a~vk~~~~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~ervi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~ 426 (587) T protein:vir:95 347 EVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQESLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEI 426 (587) T ss_pred HHHHHHHHHHhCCCcEEEEEcCCCCCCHHHHHHHHhhcCCCcEEEecccceEecCCCceeeechHHHHHHHHHHHhcCch Confidence 799998643 3333 3332221 122233334445555544333221 12234566777776655 Q ss_pred ccCcceeeeeeeecC--cccccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEee-----CCch--hhHHHHHHHH Q lcl|NC_020841. 185 DKTDGVKTLHLKSLV--SVVSTDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAG-----GGKF--FDFVMGFDWL 255 (367) Q Consensus 185 ~~~~g~~t~~~k~l~--Gv~~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~-----~G~~--iD~~~~~dwl 255 (367) ...+ | ||.++ ++. ..++.+|++.+..+|++......+........-+|.++ +-.| |-.++-.|.+ T Consensus 427 ~~Sl---T--~~~i~~~~v~-~~~t~~e~e~ai~~Gvl~l~~~~~~~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i 500 (587) T protein:vir:95 427 GESI---T--FKPLRVSSLD-QIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFL 500 (587) T ss_pred hcCc---c--ceeeeccccc-ccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeecceeccCCCCcchhhhhhhhhHHHH Confidence 5433 2 33433 443 37899999999999999987654433322223345443 1135 7799999999 Q ss_pred HHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCH Q lcl|NC_020841. 256 RNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQ 335 (367) Q Consensus 256 ~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~ 335 (367) ...++..+-+.+.-. |=++.|...|++.+...|++..+.|.|..... +. +.+. .. T Consensus 501 ~~dir~~~~~~~iGk---~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~--~d---------------v~v~-----~~ 555 (587) T protein:vir:95 501 VSELKVQLEDQFIGT---RTINTSASIIKDFIQSYLGRKKRDNEIQDFPA--ED---------------VQVI-----VE 555 (587) T ss_pred HHHHHHHHHhhCCcc---ccchHHHHHHHHHHHHHHHHHHhCCcccCCCc--cc---------------eEEE-----ec Confidence 999998887776553 45788999999999999999999999964311 00 1110 01 Q ss_pred HHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 336 VIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 336 ~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) .| | --++|.+...-++++|.+++.+.. T Consensus 556 ~d---~--~~v~~~v~Pv~~mekI~vt~~~~~ 582 (587) T protein:vir:95 556 GN---E--ARISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred CC---E--EEEEEEEEEcccceEEEEEEEEee Confidence 11 2 247888999999999999999755 No 31 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=98.63 E-value=1.7e-07 Score=57.74 Aligned_cols=328 Identities=11% Similarity=0.001 Sum_probs=160.0 Q ss_pred Cccccccc--------ccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHH--HHH--h------ccCC Q lcl|NC_020841. 1 MAGSLTLP--------INMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSID--GVK--L------DYGV 62 (367) Q Consensus 1 ~~~~~~l~--------i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~--~v~--~------df~~ 62 (367) ++-++.-+ +.+ ++-...+..+-.+.++++...-..............+|...- ++. . ++.. T Consensus 191 ~~yrl~~g~~~~~~~~~~~-~~~~~~~tAky~g~~~n~~~v~v~d~~~~~~~k~~~~y~~t~~~di~~~~~~~~~~~~~~ 269 (587) T protein:vir:96 191 KAYELNGGAYSFTNEIITD-INELPDFEAKLSPFGDKNLESRKLDEATDVDIKGKAVYVKAVFGDIENQTQYNQYVKFEQ 269 (587) T ss_pred EEEEeCCCchhhhhhhhhh-hccccceEEEeecccCceeEEEeeccccccccceEEEeehhhhhhhhhhhccccceeecc Confidence 11111000 001 000011122222333333222111111000111112222110 000 0 0000 Q ss_pred -Cc--HHHHHHHHH----------h---ccC-cccceEEEEecc--CccchHHHHHHHHhcccCcEEEEEEecCCHHHHH Q lcl|NC_020841. 63 -EA--DEYKIAQKY----------F---SQN-PKPRDLMIATVT--ALTDPLASIGEVAAKTLGFYAFCFASEVAAADIQ 123 (367) Q Consensus 63 -~s--~~ykaA~~~----------F---~Q~-p~p~~v~v~~~~--~~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ 123 (367) .. ........- . ... +.+..-..|+.+ ...+..+++++++.. +|.+++ ....+.+.+. T Consensus 270 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~~~~y~~~l~ale~~--~~~~i~-~~t~d~ai~~ 346 (587) T protein:vir:96 270 LPEQASEPSDVEVHAETESATVTATSKPKAIEPFELTKLSGGTNGEPPTSWSAKLEKFKNE--GGYYIV-PLTDRQSVHS 346 (587) T ss_pred ccchhhhhhcccccccccceeeeecccccccccccceeeecCCCCCCcccHHHHHHHHhhC--CcEEEE-ecCCCHHHHH Confidence 00 000000000 0 000 111111223321 123567788888765 454443 3444556677 Q ss_pred HHHHHhhcc---CcEEEEEE-eCc-hhhhHHHHHHHHhccccceeecCC--------------chhHHHHHHHHHHHccc Q lcl|NC_020841. 124 GLAEWAQSN---NRMFMTVM-TDD-TEAVTTGNALKELGQYHYCITYHE--------------DYATVGAVAGMALDQRY 184 (367) Q Consensus 124 ala~~~ea~---~~~~~~~~-~d~-~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~ 184 (367) .+..|++.. .+++..+. ... ..........+.+++.+.....++ ....+++++|..++... T Consensus 347 ~l~a~vk~~r~~gk~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi~v~~~~~~~~~~~~~~~~~~~~~aa~vAG~~Ag~~~ 426 (587) T protein:vir:96 347 EVATFVKNRSDAGEPMRAIVGGGTSETKEKLFGRQAILNNPRVALVANSGKFVMGNGRILQAPAYMVASAVAGLVSGLDI 426 (587) T ss_pred HHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHHhhcCCCcEEEEecceEEecCCCceeeechhhHHHHHHHHHhcCcc Confidence 799999643 33343333 222 122233334445555444333221 12235567777776655 Q ss_pred ccCcceeeeeeeecCccc-ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-----c--hhhHHHHHHHHH Q lcl|NC_020841. 185 DKTDGVKTLHLKSLVSVV-STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-----K--FFDFVMGFDWLR 256 (367) Q Consensus 185 ~~~~g~~t~~~k~l~Gv~-~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-----~--~iD~~~~~dwl~ 256 (367) +..+ | ||.++++. ...++.+|++.+..+|...+....+......-.-++.++-. . .|-.++-.|.+. T Consensus 427 ~~S~---T--~~~~~~~~v~~~~t~~e~~~~i~~G~~~l~~~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~ 501 (587) T protein:vir:96 427 GESI---T--FKPLFVNSLDKVYESEELDELNENGIITIEFVRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLV 501 (587) T ss_pred ccCc---c--ceeeecccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHHHH Confidence 5433 3 34444332 23789999999999999998765443221111223444322 2 477889999999 Q ss_pred HHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHH Q lcl|NC_020841. 257 NVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQV 336 (367) Q Consensus 257 ~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~ 336 (367) ..++..+-+.++- | |=++.|...|++.++..|++..+.|.|..... +. ..+.. .+ T Consensus 502 ~di~~~~~~~yiG--k-~nn~~~r~~v~~~i~~~L~~l~~~g~I~~~~~--~d-------------v~v~~-------~~ 556 (587) T protein:vir:96 502 SELKILLEEQYIG--T-RTINTSASQIKDFVQSYLGRKKRDNEIQDFPP--ED-------------VQVII-------EG 556 (587) T ss_pred HHHHHHHHhcCCc--c-ccCHHHHHHHHHHHHHHHHHHHhCCcccCCCc--cc-------------eEEEe-------cC Confidence 9988887766654 3 44788999999999999999999999953211 11 11111 11 Q ss_pred HHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 337 IREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 337 dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) | +. -+.+.+...-+|++|.+++.+.. T Consensus 557 D---~~--~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:96 557 N---EA--RISLTIFPIRALKKISVSLVYRQ 582 (587) T ss_pred C---EE--EEEEEEEEcccceEEEEEEEEEe Confidence 2 22 37889999999999999999755 No 32 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=98.50 E-value=4.4e-07 Score=55.41 Aligned_cols=320 Identities=12% Similarity=0.037 Sum_probs=161.9 Q ss_pred CcccccccccceEEEEEeeeccccccccccceEEEeecc--ccCcccceEEEecHHHHHhcc--------CCCcHHHHHH Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTA--PNGRATDTGIYTSIDGVKLDY--------GVEADEYKIA 70 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~--~~~~~~~~~~yts~~~v~~df--------~~~s~~ykaA 70 (367) ++...+ ++++.+ +++ . ..+....-....++..+- ..+.......|.+.++...-| ...++.-+.+ T Consensus 198 ~~~~~~-~~~~~~-~t~--~-~~~~g~~~~~~~i~~~~~~~~D~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~e~~~~~ 272 (581) T protein:vir:76 198 GEDGEA-NTRDDL-YTI--Q-RVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCA 272 (581) T ss_pred Ccccce-eeeeee-eee--E-eecccccccceeEEEEEEEeecCCccceEEEecccccccceeeehhhcCccccchhhhh Confidence 111100 111100 011 0 011111111122221110 011122234455554443222 2233444445 Q ss_pred HHHhccCcccceEEEEeccC------ccchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHHhhccC---c-EE--EE Q lcl|NC_020841. 71 QKYFSQNPKPRDLMIATVTA------LTDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEWAQSNN---R-MF--MT 138 (367) Q Consensus 71 ~~~F~Q~p~p~~v~v~~~~~------~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~~ea~~---~-~~--~~ 138 (367) +..| .+.+.....++.+. ..++.+++++++.+.. ..+++....+..-+..+..|++..+ + +. +. T Consensus 273 ~~~~--t~~~~~~l~~gvd~~g~tvt~~dy~~aL~ale~~~~--~~ivvp~t~~~~i~a~l~ahv~~~s~~~~~~ra~ig 348 (581) T protein:vir:76 273 QLAI--TNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDE--IAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILG 348 (581) T ss_pred heee--ccccceEEEeeecCCCCccchHHHHHHHHHHhcCCe--EEEEEecCCChHHHHHHHHHHHHHHhccCCceEEEE Confidence 4433 33344455555443 2245667777776433 3334444434343455777775432 2 22 22 Q ss_pred EEeC--chhhhHHHHHHHHhccccceeecC------Cc----------hhHHHHHHHHHHHcccccCcceeeeeeeecCc Q lcl|NC_020841. 139 VMTD--DTEAVTTGNALKELGQYHYCITYH------ED----------YATVGAVAGMALDQRYDKTDGVKTLHLKSLVS 200 (367) Q Consensus 139 ~~~d--~~~~~~~~~~~~~~~~~~~~~~~~------~~----------~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~G 200 (367) +... .............++..|....++ .+ ...+++++|..++.. -...+-||.++| T Consensus 349 v~g~~~~~~~~~~~~~a~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~-----~~~slT~~~i~g 423 (581) T protein:vir:76 349 MDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAI-----AAMPLTRKVIRG 423 (581) T ss_pred eeCCCCCchHHHHHHhhcccCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhhccc-----cccCcccccccc Confidence 2211 111112222333444444433331 11 112334444444433 344556888888 Q ss_pred ccc--cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEee---CC--chhhHHHHHHHHHHHHHHHHHH-HHHhcCC Q lcl|NC_020841. 201 VVS--TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAG---GG--KFFDFVMGFDWLRNVIETNVFN-GQRLRRL 272 (367) Q Consensus 201 v~~--~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~---~G--~~iD~~~~~dwl~~~lq~~l~~-ll~~~~k 272 (367) +.. ..++.+|++.+..+|++.+....+.. ..+-+|..+ +. ..|-.++-.|.+...+++.+.. .|.. + T Consensus 424 ~~~~~~~~s~~e~e~ll~~Gv~~l~~~~~~~---v~Iv~gItT~~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG--~ 498 (581) T protein:vir:76 424 FSGPAEVQRDGEKSRESSEGLMVIEKTPRNL---VHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIG--M 498 (581) T ss_pred cccccccCCHHHHHHHHhCCeEEEEEecCCe---EEEEEeeecCCCCCccceeeehhhhHHHHHHHHHHHhhhcCCC--c Confidence 874 46899999999999999998654322 224566543 22 4588999999999999998864 4553 2 Q ss_pred CCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHh-ccccCCeEEEEE Q lcl|NC_020841. 273 TPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIRE-QRIAPPFIILVK 351 (367) Q Consensus 273 ipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~-~R~~~~~~~~~~ 351 (367) |=++.|...|++.+.+.|.+..++|+|....... . +..++. .| --+.+.+. T Consensus 499 -~n~~~~r~~ik~~i~~~L~~l~~~g~I~g~~~~~-----------------~--------~~~~~~~d~--v~V~i~v~ 550 (581) T protein:vir:76 499 -PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLK-----------------A--------RQIERQPDV--IEVRYEWR 550 (581) T ss_pred -ccChHHHHHHHHHHHHHHHHHHhcCcccCcccce-----------------e--------eEEecCCCE--EEEEEEEE Confidence 5588999999999999999999999996422100 0 011111 12 23678888 Q ss_pred ECceEEEEEE--EEEecC Q lcl|NC_020841. 352 GAGAIHDTDI--TLIPEA 367 (367) Q Consensus 352 ~agaIh~v~i--~~~v~~ 367 (367) ..-+|.+|.+ .++++. T Consensus 551 Pv~~ie~I~vt~~~~p~~ 568 (581) T protein:vir:76 551 PAYPLNYIVVRYSIAPET 568 (581) T ss_pred ecccceEEEEEEEEeeCC Confidence 8888987766 555665 No 33 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=98.49 E-value=4.6e-07 Score=55.32 Aligned_cols=328 Identities=9% Similarity=-0.061 Sum_probs=182.9 Q ss_pred CcccccccccceEEEEE-eeeccccccccccceEEEeeccccCcccceEEEecHHHHHh--ccCCCcHHHHHHHHHhccC Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSI-EYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKL--DYGVEADEYKIAQKYFSQN 77 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i-~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~--df~~~s~~ykaA~~~F~Q~ 77 (367) |+-. + .-=|-|.- +-.++++....-+.+.++|..+..+.....+ .+|..+... +......+..++..+|.++ T Consensus 1 M~~~--~--~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp~n~pv~-its~~d~~~~g~~~~~~tL~~Av~~~f~nG 75 (477) T protein:vir:10 1 MAAN--Y--LHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQ-SLSDVDAAQFGPQLAGFTIPQALDAVYDYG 75 (477) T ss_pred Cccc--C--CCCeEEEEccCCcccccccCCceeEEEecccCCCCCcCEE-EccHHHHHHhccCCCCCcHHHHHHHHHhcc Confidence 3321 2 22344442 3335677777788888998766544444444 466655532 2345677899999999988 Q ss_pred cccceEEEEeccCccc-----------------------------------------h--HHHHHHHHhcccCc------ Q lcl|NC_020841. 78 PKPRDLMIATVTALTD-----------------------------------------P--LASIGEVAAKTLGF------ 108 (367) Q Consensus 78 p~p~~v~v~~~~~~~t-----------------------------------------~--~~~l~~~~~~~~~w------ 108 (367) .++.. +.+...... . ...+.........+ T Consensus 76 g~~~~--vVrV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (477) T protein:vir:10 76 SGTVI--VINVLDPAVHKSNAANEPVTFDAATGRAKLAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGVITRIKTGTIP 153 (477) T ss_pred ceEEE--EEecCccccccccccccccccccccceecccccccccccccccccccccccchhhhhhhccccceeccccccc Confidence 65432 222110000 0 00000000000000 Q ss_pred ---------------------------------------------E----EEEEEe--cCCHHHHHHHHHHhhccCcEEE Q lcl|NC_020841. 109 ---------------------------------------------Y----AFCFAS--EVAAADIQGLAEWAQSNNRMFM 137 (367) Q Consensus 109 ---------------------------------------------~----~~~~~~--~~~~~~~~ala~~~ea~~~~~~ 137 (367) + ..+... ....+...+|...++.. +.+. T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~tGl~al~~~~~~~~~~~~~l~apg~~~~~~v~~~l~~~~~~~-~~~~ 232 (477) T protein:vir:10 154 PGATAAKATYDYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQL-GAIA 232 (477) T ss_pred ccceeeeeccccccccccccccccccccccchhhhhhhhhhhhhhcchhcccccccccccchhhHHHHHHHHhhC-CEEE Confidence 0 000000 00111223333333322 2333 Q ss_pred EEEeCchh-hhHHHHHHHH-------hccccceeecC------C------chhHHHHHHHHHHHcccccCcc-eeeeeee Q lcl|NC_020841. 138 TVMTDDTE-AVTTGNALKE-------LGQYHYCITYH------E------DYATVGAVAGMALDQRYDKTDG-VKTLHLK 196 (367) Q Consensus 138 ~~~~d~~~-~~~~~~~~~~-------~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~~~~~g-~~t~~~k 196 (367) ++-..... .......... +...+..+.+. . ..+..+.++|..+..+-.+ | ..+...| T Consensus 233 ~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~--g~~~span~ 310 (477) T protein:vir:10 233 YIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTATNAERLEPLSSRAAGLRARVDLDK--GYWWSSSNQ 310 (477) T ss_pred EEecCCCCCHHHHHhhhhhccccccccccceEEEEcCeEEEecccCCceeEEchHHHHHHHHHHhhhcC--CceeccCCc Confidence 33222111 1111111111 11222222221 0 1133456666666554222 3 2333445 Q ss_pred ecCcccc---c-----CCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-------chhhHHHHHHHHHHHHHH Q lcl|NC_020841. 197 SLVSVVS---T-----DISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-------KFFDFVMGFDWLRNVIET 261 (367) Q Consensus 197 ~l~Gv~~---~-----~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-------~~iD~~~~~dwl~~~lq~ 261 (367) .+.||.. . ..+++|.+.|..+++|++.++.+.+ .++..++++.+ .||-+.+-.+|++..|+. T Consensus 311 ~~~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G---~~~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~ 387 (477) T protein:vir:10 311 QLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSG---LRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRY 387 (477) T ss_pred eeccccccccccccccCCChhhHHHHhhCCceEEEEecCCc---EEEEcccccCCCCCCcccceeehhhHHHHHHHHHHH Confidence 5555442 2 2356899999999999999886543 24666676633 267888899999999988 Q ss_pred HHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhcc Q lcl|NC_020841. 262 NVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQR 341 (367) Q Consensus 262 ~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R 341 (367) .+...+-. |.+..-...|+..++.-|+..++.|.|. ||.+.+ +.++.|++|+.++ T Consensus 388 ~~~~~v~~----~~~~~~~~~i~~~i~~~l~~l~~~g~l~--------------------g~~v~~-~~~~nt~~~i~~G 442 (477) T protein:vir:10 388 FSQQFVDA----PIDQGLIDSLVESVNGFGRKLIGDGALL--------------------GFKAWF-DPARNPKEELAAG 442 (477) T ss_pred HHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEE-ecCCCCHHHhhCC Confidence 88875542 4577788999999999999999999994 366776 4678899999999 Q ss_pred ccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 342 IAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 342 ~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) +. -+.+.+.....++.|.+.+..+- T Consensus 443 ~~-~~~i~~~p~~p~e~i~~~~~~~~ 467 (477) T protein:vir:10 443 HL-LINYKYTVPPPLERLTYETEITS 467 (477) T ss_pred eE-EEEEEEEecCCcceEEEEEEEcc Confidence 98 49999999999999888776666 No 34 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=98.42 E-value=1.4e-07 Score=58.10 Aligned_cols=315 Identities=11% Similarity=0.039 Sum_probs=153.1 Q ss_pred Cccccc--------cccc-------------ceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhc Q lcl|NC_020841. 1 MAGSLT--------LPIN-------------MLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLD 59 (367) Q Consensus 1 ~~~~~~--------l~i~-------------~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~d 59 (367) +..++. +..+ +-+.|+|.-+ ......|-..+.++...+ . .....+.+..++.. T Consensus 86 ~~yrl~~g~~a~~t~~~~~~~~~Aky~G~~Gn~i~v~v~~~--~~d~~~~~v~t~~g~~~v-d--~qtv~~~~~~el~~- 159 (451) T protein:vir:10 86 LVLNPNEGTAATLTKEGLPWTVTANYPGEKGNQITVSVEVS--PADQNAATVSTIFGTKLV-D--EQSIKFNELDKFKG- 159 (451) T ss_pred EEEEcCCCceEEEEeecCceEEEEeeCCcCCceEEEEEecc--cCCcCceEEEEEECCeEE-E--EEEeeccchhhccC- Confidence 111110 0000 0111211111 111111111111111100 0 00001112222211 Q ss_pred cCCCcHHHHHHHHHhccCcccceEE-E-Ee------ccCccchHHHHHHHHhcccCcEEEEEEe-cCCHHHHHHHHHHhh Q lcl|NC_020841. 60 YGVEADEYKIAQKYFSQNPKPRDLM-I-AT------VTALTDPLASIGEVAAKTLGFYAFCFAS-EVAAADIQGLAEWAQ 130 (367) Q Consensus 60 f~~~s~~ykaA~~~F~Q~p~p~~v~-v-~~------~~~~~t~~~~l~~~~~~~~~w~~~~~~~-~~~~~~~~ala~~~e 130 (367) ..|..+...+...+.+.... . +. ..+.+...+++.+++....+|. ++.. +.+.+.+..+..|+. T Consensus 160 -----nd~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~~n~l--~~~~~~~~~~i~~~~~a~ik 232 (451) T protein:vir:10 160 -----NDYITAKVVEEGSSKPVAFTNVSGTLTGGTTTESNKVESLLNDALENEEYAVV--TTAGFEPSSNMNKLVVEAVK 232 (451) T ss_pred -----CceEEEEecccccccceeeeecccccccccccCCccchHHHHHHhccceeeEE--EEccCCCchHHHHHHHHHHH Confidence 11111111122222111111 0 00 0123345566777766555543 3332 333456777899998 Q ss_pred c----cCcEEEEEEeC-chhhhHHHHHHHHhccccceee----cCCchhHHHHHHHHHHHcccccCcceeeeeeeecCcc Q lcl|NC_020841. 131 S----NNRMFMTVMTD-DTEAVTTGNALKELGQYHYCIT----YHEDYATVGAVAGMALDQRYDKTDGVKTLHLKSLVSV 201 (367) Q Consensus 131 a----~~~~~~~~~~d-~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv 201 (367) . .++++-.+... .....+-. .. ......... .+.....+++++|..++...++ ..-|+.++|+ T Consensus 233 ~~r~~~g~~~~aVl~~~~~~~~d~e-gi--inv~n~~~~~dg~~~~~~~~~~~vAG~~Ag~~~~~-----S~T~~~~~~~ 304 (451) T protein:vir:10 233 RLRENEGRKVRGVIPTDADTTYNYE-GI--STVVNGYTLSDGTNVDVKDATGYFAGISASADVAT-----SLTYFEVEDA 304 (451) T ss_pred HHHHhcCCeEEEEecCccCCCCCCc-ce--EEeecceEecCceeechhhhHHHHHHHHccccccc-----CccceecCCc Confidence 5 24555333321 11111100 00 001111111 1233445667777777654443 3346677776 Q ss_pred c-c-cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEee--------CCc--hhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020841. 202 V-S-TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAG--------GGK--FFDFVMGFDWLRNVIETNVFNGQRL 269 (367) Q Consensus 202 ~-~-~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~--------~G~--~iD~~~~~dwl~~~lq~~l~~ll~~ 269 (367) . . ..++.+|+..+.++|...+....+ .. ..+.+|..+ +-. .|-.++-.|-+...++..+-+.++ T Consensus 305 ~~v~~~~t~~e~~~~i~~G~lvl~~~~g--~~-v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~yi- 380 (451) T protein:vir:10 305 VSAYPKFDNEKTIKALDAGQIVFTTRPG--QR-VVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTYL- 380 (451) T ss_pred eeeeeeCCHHHHHHHHhCCeEEEEEEcC--Ce-EEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccc- Confidence 4 3 478999999999999877643222 11 234456533 112 377888888888887765544333 Q ss_pred cCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEE Q lcl|NC_020841. 270 RRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIIL 349 (367) Q Consensus 270 ~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~ 349 (367) .|+|=+..|..++++.|..-|++..+.|.|..+...+ -.+.. . ..+..--+++. T Consensus 381 -Gk~~N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~~d---------------~~v~~-------~---~~~~~v~v~~~ 434 (451) T protein:vir:10 381 -GNVGNNAAGRDLFKADRIAYLTSLQNRNMIQSFANTD---------------ITVEA-------G---NDMDSIVVNLA 434 (451) T ss_pred -eecCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCccc---------------eEEee-------c---CCCCEEEEEEE Confidence 5889899999999999999999999999996543210 01110 0 01233348889 Q ss_pred EEECceEEEEEEEEEec Q lcl|NC_020841. 350 VKGAGAIHDTDITLIPE 366 (367) Q Consensus 350 ~~~agaIh~v~i~~~v~ 366 (367) ++.-.||..+.+++.|+ T Consensus 435 v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 435 VTPVDAMEKLYMTMVVR 451 (451) T ss_pred EEEEeeeeeEEEEEEEc Confidence 99999999999999999 No 35 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=98.41 E-value=8e-07 Score=54.02 Aligned_cols=328 Identities=9% Similarity=-0.060 Sum_probs=184.0 Q ss_pred CcccccccccceEEEE-EeeeccccccccccceEEEeeccccCcccceEEEecHHHHHh--ccCCCcHHHHHHHHHhccC Q lcl|NC_020841. 1 MAGSLTLPINMLVNVS-IEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKL--DYGVEADEYKIAQKYFSQN 77 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~-i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~--df~~~s~~ykaA~~~F~Q~ 77 (367) ||- .-+| =|-|. ++-.++++....-+.+.++|.....+..+.. ..+++.+.+. +.......+.++..+|.++ T Consensus 1 M~~-~~~p---GVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p~n~pv-~its~~d~~~~g~~~~~~tL~~Av~~~f~ng 75 (477) T protein:vir:79 1 MAA-NYLH---GVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPV-QSLSDVDAAQFGPQLAGFTIPQALDAVYDYG 75 (477) T ss_pred CcC-CCCC---CeEEEEecCCcccccccCCceEEEEeecccCCCcccE-EEccHHHHHHhcCCCCCCcHHHHHHHHhhcC Confidence 542 2233 34444 3334567777788888899877655544554 4466666543 2335677889999999886 Q ss_pred cccceEEEEeccCccc----------------------------------------------hH---------------- Q lcl|NC_020841. 78 PKPRDLMIATVTALTD----------------------------------------------PL---------------- 95 (367) Q Consensus 78 p~p~~v~v~~~~~~~t----------------------------------------------~~---------------- 95 (367) -.+. ++.+...... .. T Consensus 76 g~~~--~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (477) T protein:vir:79 76 SGTV--IVINVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGGTTYTEGTDYAVDLINGVITRIKTGTIP 153 (477) T ss_pred CceE--EEEeccCCccccccccccccccccccccccccccccceeEEeecccccccccCccccccccchhhhhhhccccc Confidence 4332 2222110000 00 Q ss_pred ------------------------------------HHHHHHHhcccCcEEEEEEec--CCHHHHHHHHHHhhccCcEEE Q lcl|NC_020841. 96 ------------------------------------ASIGEVAAKTLGFYAFCFASE--VAAADIQGLAEWAQSNNRMFM 137 (367) Q Consensus 96 ------------------------------------~~l~~~~~~~~~w~~~~~~~~--~~~~~~~ala~~~ea~~~~~~ 137 (367) +++.........-...+.... .......++...++.. +.+. T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~a~~~~tg~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~-~~~a 232 (477) T protein:vir:79 154 AAATAAKATYDYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQL-GAIA 232 (477) T ss_pred cccceeeceeccCCcccceeeeecccccccccchhhhhhhhhhhhcccccceeeccccccchhHHHHHHHHHhhc-CeEE Confidence 000000000000000000000 1112233333333322 2233 Q ss_pred EEEeCchh-hhHHHHHHHH-------hccccceeecC------C------chhHHHHHHHHHHHcccccCcc-eeeeeee Q lcl|NC_020841. 138 TVMTDDTE-AVTTGNALKE-------LGQYHYCITYH------E------DYATVGAVAGMALDQRYDKTDG-VKTLHLK 196 (367) Q Consensus 138 ~~~~d~~~-~~~~~~~~~~-------~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~~~~~g-~~t~~~k 196 (367) ++-..... .......... +...+..+.+. . ..+..+.++|..+..+.. .| ..+...| T Consensus 233 ~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~--~g~~~span~ 310 (477) T protein:vir:79 233 YIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEPLSSRAAGLRARVDLD--KGYWWSSSNQ 310 (477) T ss_pred EEecCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeechHHHHHHHHHHhhcc--CCceEccCCc Confidence 32221111 1111111111 11122222221 0 013345666666655432 23 2333445 Q ss_pred ecCcccc---c-----CCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-------chhhHHHHHHHHHHHHHH Q lcl|NC_020841. 197 SLVSVVS---T-----DISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-------KFFDFVMGFDWLRNVIET 261 (367) Q Consensus 197 ~l~Gv~~---~-----~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-------~~iD~~~~~dwl~~~lq~ 261 (367) .+.||.. . ..+++|.+.|.++|+|.+..+.+.+ ..+..++++.+ .||-+.+-.+|++..|+. T Consensus 311 ~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G---~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~ 387 (477) T protein:vir:79 311 QLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSG---LRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRY 387 (477) T ss_pred eeecceecccccccccCCChhhHHHHhhCCceEEEEecCCc---EEEEcccccCCCCCCccceeeehhhHHHHHHHHHHH Confidence 5555542 1 2356899999999999999886543 25666776531 268888999999999999 Q ss_pred HHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhcc Q lcl|NC_020841. 262 NVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQR 341 (367) Q Consensus 262 ~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R 341 (367) .+..++-. |-+..-...|+..|+.-|++.++.|.|. ||.+.+ +.++.+++|+.++ T Consensus 388 ~~~~~v~e----~~~~~~~~~i~~~i~~~l~~l~~~g~l~--------------------g~~v~~-~~~~nt~~~i~~G 442 (477) T protein:vir:79 388 FSQQFVDA----PIDQGLIDSLVESVNGFGRKLIGDGALL--------------------GFKAWF-DPARNPKEELAAG 442 (477) T ss_pred HHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEE-ecCCCCHHHhhCC Confidence 88876543 4477778999999999999999999984 366766 5678899999999 Q ss_pred ccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 342 IAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 342 ~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) +. -+.+.+.....++.|.+.+..+- T Consensus 443 ~~-~~~i~~~p~~p~e~i~~~~~~~~ 467 (477) T protein:vir:79 443 HL-LINYKYTVPPPLERLTYETEITS 467 (477) T ss_pred eE-EEEEEEEecCCceeEEEEEEEec Confidence 98 59999999999999999877776 No 36 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=98.36 E-value=1e-06 Score=53.38 Aligned_cols=335 Identities=8% Similarity=-0.025 Sum_probs=158.8 Q ss_pred Ccccccccc------------------cceEEEEEeeeccccccccccceEEEeeccccCcccceEEEe----------- Q lcl|NC_020841. 1 MAGSLTLPI------------------NMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYT----------- 51 (367) Q Consensus 1 ~~~~~~l~i------------------~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yt----------- 51 (367) ++...+.-- ..++...+.+++.......-+.+..+ ..+.....++..-. T Consensus 292 ~~~~~~~~~d~~~~~~~d~~~~~~~~~g~vve~~~~~s~~~~~~~~~~~~~~~--~~vi~~~s~~~~~~~~~~~~~~~~~ 369 (729) T protein:vir:10 292 YVSTRGGKNDEIHVLVIDDKGTITGNSGTILEKHLSLSKAKDAEYSVGSSSYW--RDFLATNSKYIFGGGATSGITTTGY 369 (729) T ss_pred ccccccccccccceeeeccccccccCcccceeeeeeeeecccccccccccccc--ceeeccccceeeecccccccccccc Confidence 000000000 01111111111000000000000000 00000000000000 Q ss_pred cHHHHHhccCCCcHHHHHHHHHhccCcccceEEEEeccC---------------ccchHHHHHHHHhccc-CcEEEEEE- Q lcl|NC_020841. 52 SIDGVKLDYGVEADEYKIAQKYFSQNPKPRDLMIATVTA---------------LTDPLASIGEVAAKTL-GFYAFCFA- 114 (367) Q Consensus 52 s~~~v~~df~~~s~~ykaA~~~F~Q~p~p~~v~v~~~~~---------------~~t~~~~l~~~~~~~~-~w~~~~~~- 114 (367) ........-......+-+....+...........++.+. .......+.++.+... ........ T Consensus 370 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~ 449 (729) T protein:vir:10 370 SVSSTNTLDTDSGWDQNAEGVNFGASGVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGA 449 (729) T ss_pred cccccceeccccccccccccccccccceeEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecC Confidence 000000000000000000000011110000000111110 0112234444443211 11111111 Q ss_pred ----ecCCHHHHHHHHHHhhccCcEEEEEEeC----------------c--hhhhHHHHHHHHhccccceeecCC----- Q lcl|NC_020841. 115 ----SEVAAADIQGLAEWAQSNNRMFMTVMTD----------------D--TEAVTTGNALKELGQYHYCITYHE----- 167 (367) Q Consensus 115 ----~~~~~~~~~ala~~~ea~~~~~~~~~~d----------------~--~~~~~~~~~~~~~~~~~~~~~~~~----- 167 (367) .........++..+++.....+.+.... . +...........+........+++ T Consensus 450 ~~~~~~~~~~v~~a~~~~~~~~~~~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 529 (729) T protein:vir:10 450 AHHPKEQSQAVAEKVTAVAEARKDAVAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPLSSSTYSVFDSGYKYMF 529 (729) T ss_pred CCCCccchHHHHHHHHHHHHhcCCeEEEecccccccccccccccccccccchhhHHHHHHHhhccCCceEEEEcCeeEEe Confidence 1123445567777788766555443211 0 001111111222222222222221 Q ss_pred --------chhHHHHHHHHHHHcccccCcceeeeeeeecCccc-----ccCCCHHHHHHHHhCCceEEEEeeccccceEE Q lcl|NC_020841. 168 --------DYATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVV-----STDISQTQAASLKAACINYYSDYGNPDNSLPI 234 (367) Q Consensus 168 --------~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~ 234 (367) ..+..+.++|..+..+..+-+ ......|.+.||. ...+++.|.+.|..+|+|++..+.+.+ + . T Consensus 530 d~~~~~~~~~p~s~~~aGl~a~~d~~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~-~ 605 (729) T protein:vir:10 530 DRFNNTFRYVPLNGDIAGTCARTDIEQFP-WFSPAGTARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAG--I-I 605 (729) T ss_pred cccCCceEEechhHHHHHHHHHhhccCCc-EEccCCccccceecccceeeecChhhHhhhhhCCceEEEEecCCe--E-E Confidence 123345667777766544421 2333444444443 135788999999999999999886543 2 4 Q ss_pred EecCEeeCC-----chhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCcc Q lcl|NC_020841. 235 FANGHAGGG-----KFFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAA 309 (367) Q Consensus 235 ~~~G~~~~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~ 309 (367) +..++++.+ .||-+.+-.+|++..|+..++..+-. |.+..=...|+..|+.-|+..+++|.|. T Consensus 606 ~wG~rT~~~~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~~L~~l~~~g~l~-------- 673 (729) T protein:vir:10 606 LFGDKTGFGKSSAFDRINVRRLFIYLEDAISAAAKDQLFE----FNDELTRTNFVNIVEPFLRDVQAKRGIF-------- 673 (729) T ss_pred EEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcccee-------- Confidence 566666533 47889999999999999988876543 5578888999999999999999999984 Q ss_pred ccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 310 LGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 310 ~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) ||.|.+. .++.+++|+.+.+.. +.+.+.....+++|.+++.-.. T Consensus 674 ------------g~~v~~d-~~~nt~~~i~~G~~~-~~v~~~p~~p~e~i~~~~~~~~ 717 (729) T protein:vir:10 674 ------------DFVVICD-ETNNTAAVIDSNEFV-ADIFIKPARSINFIGLTFVATR 717 (729) T ss_pred ------------eeEEEEc-CCCCCHHHhhCCeEE-EEEEEEecCCccEEEEEEEEee Confidence 4778775 778899999999884 9999999999999999876665 No 37 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=98.34 E-value=4.8e-07 Score=55.22 Aligned_cols=314 Identities=9% Similarity=0.011 Sum_probs=147.6 Q ss_pred Cccccccccc----ceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHH-HHHHHHHhc Q lcl|NC_020841. 1 MAGSLTLPIN----MLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADE-YKIAQKYFS 75 (367) Q Consensus 1 ~~~~~~l~i~----~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~-ykaA~~~F~ 75 (367) -.|.-.-|++ +-+++.. -...+....++.+.+++..+..... + .-..+.+-...|+..+.. -.....++. T Consensus 335 ~g~~~~~pl~~ts~dy~~~~~--~vdgI~~~~~~~V~~~g~~s~a~a~--~-~~g~~s~d~a~f~Gg~dgl~~~~ee~Y~ 409 (717) T protein:vir:79 335 RGMISEDPLVFKSGDYTNFKM--LVDAINNHPFNNVVRARTKPEFEAT--F-TSTLQAAADAKFSGGKDELSLDKEEMYK 409 (717) T ss_pred CcceeccccccccCceeeeee--eecccccCchhheeeeeccccccee--e-eecccCchhhccCCCccccccchhhhhc Confidence 1111111211 2111111 1122222234444444432211000 0 000000001111111100 000000000 Q ss_pred cCcccceEEEEec-cCcc--chHHHHHHHHhcccCcEEEEEEec---------CCHHHHHHHHHHhhccCcE----EEEE Q lcl|NC_020841. 76 QNPKPRDLMIATV-TALT--DPLASIGEVAAKTLGFYAFCFASE---------VAAADIQGLAEWAQSNNRM----FMTV 139 (367) Q Consensus 76 Q~p~p~~v~v~~~-~~~~--t~~~~l~~~~~~~~~w~~~~~~~~---------~~~~~~~ala~~~ea~~~~----~~~~ 139 (367) -.++. .... +...+...++...-++ ++..+ ..++...+++.++++.+.. +.+. T Consensus 410 --------~lGgk~~d~g~lt~~aays~LE~~dVDl---Vil~ga~adtt~ga~~d~va~alad~caalSal~r~ai~VI 478 (717) T protein:vir:79 410 --------RLGGEKNEEGFVTKQGAYQYLENYEVDY---VIPLGVHADTKLIGKYDDFAYQLALACAVMSHYNSVTIGII 478 (717) T ss_pred --------cccccccccccccchhhhhhcCcceeEE---EEecCccccccccchhhhHHHHHHHHHHHhhhccccceeee Confidence 00000 0000 0111222222211111 11111 0122344555565544211 1111 Q ss_pred Ee---CchhhhHHHH---HHHHh------------------------cccccee------ecCC-----chhHHHHHHHH Q lcl|NC_020841. 140 MT---DDTEAVTTGN---ALKEL------------------------GQYHYCI------TYHE-----DYATVGAVAGM 178 (367) Q Consensus 140 ~~---d~~~~~~~~~---~~~~~------------------------~~~~~~~------~~~~-----~~~~~~~~~~~ 178 (367) .. .+........ .+... +.+...+ .... ....++.+++. T Consensus 479 ~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis~y~~vv~~~~~iv~~~~~~~~~~p~AG~vAGl 558 (717) T protein:vir:79 479 PTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLGQFIEVVAGPDFIVRNTRLGQMASTPDASYIGM 558 (717) T ss_pred ccccccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccccceeeeeecceeEEEcCCCceeecCHHHHHHHH Confidence 00 0000000000 00000 0000000 0000 01123455555 Q ss_pred HHHcccccCcceeeeeeeecCccc--ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC---c--hhhHHHH Q lcl|NC_020841. 179 ALDQRYDKTDGVKTLHLKSLVSVV--STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG---K--FFDFVMG 251 (367) Q Consensus 179 ~~~~~~~~~~g~~t~~~k~l~Gv~--~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G---~--~iD~~~~ 251 (367) .+...+...| .+|.+.|+. ...++..|++.|..+|+|++..+.+.+ + .+..+.++++ + ||-+.+- T Consensus 559 dA~rGVwkSP-----ANk~I~GVvgLa~~lT~sE~d~Ln~aGIntIr~~~GrG--i-rVWGaRTtasd~sdWryInVRRl 630 (717) T protein:vir:79 559 VSQLKTQSAP-----TNKPLPSVTALRYTYSANQLNRLTKARFATFKYKQDGS--I-GVVDAPTSAHAGSDYTRLSTARI 630 (717) T ss_pred HhcCCccccc-----ccceecccccCcccCCHHHHHHHhhCCeEEEEEeCCce--E-EEEeeeecCCCCcccceeehhhh Confidence 5554444433 356677766 446899999999999999998776543 3 4567777654 2 6889999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchH Q lcl|NC_020841. 252 FDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIR 331 (367) Q Consensus 252 ~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~ 331 (367) .|++...|+..+..++- + |-+..+...|++.|+.-|++..+.|.|.- |.+.+ T Consensus 631 ~D~Ie~sIr~al~~yVg---E-PNd~~tr~~Ik~sI~afL~~L~r~GAI~G--------------------ykvdv---- 682 (717) T protein:vir:79 631 VKEAVNAVREVADPFIG---E-PNDTGNRNALTAAVDKRLSKMIENKALLG--------------------FDFRL---- 682 (717) T ss_pred HHHHHHHHHHHHHHhcc---c-cCCHHHHHHHHHHHHHHHHHHHhcCceec--------------------ceeeE---- Confidence 99999999888876432 2 66888999999999999999999999952 33321 Q ss_pred hCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 332 DQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 332 ~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) .++++|..+.+. -+.+.+.....++.|.|++.++| T Consensus 683 tnT~~di~~G~l-~V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 683 VVTPQQELLGEG-SIELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred ecChhHhhCCEE-EEEEEEEecCcccEEEEEEEEeC Confidence 356777766544 38899999999999999999999 No 38 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=98.31 E-value=2.2e-07 Score=57.12 Aligned_cols=307 Identities=13% Similarity=0.029 Sum_probs=157.1 Q ss_pred CcccccccccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHh-ccCCCcHHHHHHHHHhccCcc Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKL-DYGVEADEYKIAQKYFSQNPK 79 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~-df~~~s~~ykaA~~~F~Q~p~ 79 (367) .|-..+.+=++ +.|+|.- .......|...+.++...+ + -.....++++.. +|-.-... .-... .. T Consensus 107 ~Aky~g~~gn~-i~v~v~~--~~~d~~~~dv~~~~g~~~~-d----~~~~~~~~~l~~n~~V~~~~~-----g~la~-~a 172 (436) T protein:vir:78 107 TARCSGIRGND-LKVIVTT--NIDDNAKFDVVTLLDNKKV-D----TQIAKVITELQDNDYVTWKKE-----ATLEA-TA 172 (436) T ss_pred eeecCCCCCcE-EEEEecc--cccccCceEEEEEecchhh-h----hhhHHHHhhccCCceEEEEec-----ccccc-cc Confidence 11111111011 1122211 1111122221122211110 0 001111222111 11000000 00000 00 Q ss_pred cceEEEEec----cCccchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHHhhc----cCcEEEEEEeCchhhh--HH Q lcl|NC_020841. 80 PRDLMIATV----TALTDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEWAQS----NNRMFMTVMTDDTEAV--TT 149 (367) Q Consensus 80 p~~v~v~~~----~~~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~~ea----~~~~~~~~~~d~~~~~--~~ 149 (367) -..+ .++. .+.+++.+++.+++... |.++++. ..+.+.+..++.|+.. .++++-.+........ -+ T Consensus 173 ~~~L-tGG~dG~~~T~~dy~~al~~le~~~--fn~l~~~-~~d~~~~~~~~a~ikr~re~~g~~~~aV~~~~~~~d~EgI 248 (436) T protein:vir:78 173 GLTF-TNGTNGEAVTGTEYQAFLDKIESYS--FNALGCL-ATTAEIKSLFVEFTKRMRDKVGAKFQTVLYKKNDADYEGV 248 (436) T ss_pred eeee-eccccccccchHHHHHHHHHHcccc--eeEEEec-CCChHHHHHHHHHHHHHHhhcCCeEEEEecCCCCCCCceE Confidence 0112 2221 13345677788777664 5555444 4577888999999984 3455544443321111 00 Q ss_pred HHHHHHhccccceeecCCchhHHHHHHHHHHHcccccCcceeeeeeeecCccc-c-cCCCHHHHHHHHhCCceEEEEeec Q lcl|NC_020841. 150 GNALKELGQYHYCITYHEDYATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVV-S-TDISQTQAASLKAACINYYSDYGN 227 (367) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~-~-~~~t~t~~~~l~~~~~n~y~~~~~ 227 (367) .+.... .. ...+.....+++++|..++...++ |+ -|+.++|+. . ..++.+|++.+..+|...+..-+ T Consensus 249 Inv~n~--v~---g~~~~~~~~~a~vAG~~Ag~~~~~---S~--T~~~~~~~~~v~~~~t~~e~~~ai~~G~lvl~~d~- 317 (436) T protein:vir:78 249 VSVENK--IK---DTGLLESSLIYWTTGAIAGCDINK---SN--TNKRYDGEFDVDVNYTQIHLEEALKTGKFIFHKVG- 317 (436) T ss_pred EEeecc--cC---CceechhHHHHHHHHHHhcCcccc---Cc--cceecCccccccccCCHHHHHHHHhCCeEEEEEeC- Confidence 000000 00 001223345566777777654444 33 377888774 3 46899999999999987775432 Q ss_pred cccceEEEecCEee--------CCc--hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhc Q lcl|NC_020841. 228 PDNSLPIFANGHAG--------GGK--FFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKA 297 (367) Q Consensus 228 ~~~~~~~~~~G~~~--------~G~--~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~ 297 (367) + ...+.+|..+ +.. -|-.++..|-+...++..+-+.++ .|+|=+..|..++.+.+..-|++..+. T Consensus 318 --~-~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yi--GKv~N~~dgr~~l~~~i~~yl~~L~~~ 392 (436) T protein:vir:78 318 --D-EVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYL--GEVPNDKSGRISFWNDVVKHHEQLQNM 392 (436) T ss_pred --C-eEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccc--cccCCCHHHHHHHHHHHHHHHHHHHhC Confidence 2 2356666633 112 377888888888888776555444 589999999999999999999999999 Q ss_pred CceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEec Q lcl|NC_020841. 298 GLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPE 366 (367) Q Consensus 298 G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~ 366 (367) |.|.+.... + +.+ .+.+ .+.+--+++.++.-.||..+.++++|. T Consensus 393 g~I~~f~~~--D---------------v~v------~~~~--~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 393 RAIEDFKAD--D---------------VSV------EPGS--DKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred CcccCCCCc--c---------------eEE------eecC--CCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 999642210 0 111 1111 122334888899999999999999999 No 39 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=98.31 E-value=1.3e-06 Score=52.82 Aligned_cols=322 Identities=14% Similarity=0.002 Sum_probs=152.1 Q ss_pred Cccccccccc--ceEEEEEee-------------------eccccccccccceEEEeeccccCcccceEEEecHHHH--- Q lcl|NC_020841. 1 MAGSLTLPIN--MLVNVSIEY-------------------QAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGV--- 56 (367) Q Consensus 1 ~~~~~~l~i~--~iv~V~i~~-------------------~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v--- 56 (367) =|.+..+... .+....+.. ...-...+++...+- ..+......+++...-+ T Consensus 177 ~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~~gn~i~~~-----~~d~~~~~~vkt~~~~v~t~ 251 (562) T protein:vir:63 177 KATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTD-----NFDAQIDVDIKTKEAYVKAV 251 (562) T ss_pred eEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeeccCCceeeee-----ccccccccchhhhhhhhhhh Confidence 0000000000 000000000 000011111111110 00011111111110000 Q ss_pred Hhcc--CCCcHHHHHHHHHhccC----cccceEEEEeccC--ccchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHH Q lcl|NC_020841. 57 KLDY--GVEADEYKIAQKYFSQN----PKPRDLMIATVTA--LTDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEW 128 (367) Q Consensus 57 ~~df--~~~s~~ykaA~~~F~Q~----p~p~~v~v~~~~~--~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~ 128 (367) ..|- ......|-.+. +.-. .-+..-..++.+. ..+..+++++++.. +|+.++ +...+...+.++..| T Consensus 252 ~~d~~~~~~~~~~v~~~--~~~~~~la~~~~~~LtGG~dGt~~~~~~~al~ale~~--~~~~i~-~~t~d~av~~~l~a~ 326 (562) T protein:vir:63 252 GGDIEKQTAYNGYVDFE--FDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANE--GGYYLV-PLTSKQAVHAEALQF 326 (562) T ss_pred hhhhhhcccccceeeee--eccccceecccceeeecCCCCCchhhHHHHHHHHHhC--CcEEEE-ecCCCHHHHHHHHHH Confidence 0000 00000000000 0000 0011222232221 22456677777754 455543 334455566779999 Q ss_pred hhcc---Cc-EEEEEEeCc-hhhhHHHHHHHHhccccceeecCC--------------chhHHHHHHHHHHHcccccCcc Q lcl|NC_020841. 129 AQSN---NR-MFMTVMTDD-TEAVTTGNALKELGQYHYCITYHE--------------DYATVGAVAGMALDQRYDKTDG 189 (367) Q Consensus 129 ~ea~---~~-~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~g 189 (367) ++.. .+ +..++.... ..........+.+++.+.....++ ....+++++|..++..... T Consensus 327 vkr~~~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~ervv~v~~~~~~~~~~~~~~~~~~~~~aa~vAGl~A~~~~~~--- 403 (562) T protein:vir:63 327 VRDCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKMPGYMFAAQVAGLTCGLEIGE--- 403 (562) T ss_pred HHHHHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEecCeeEECCCCceeeechhHHHHHHHHHhhcCchhc--- Confidence 9642 33 333332221 122233334445555544333221 1223456666666554443 Q ss_pred eeeeeeeecCccc-ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeC-----Cc--hhhHHHHHHHHHHHHHH Q lcl|NC_020841. 190 VKTLHLKSLVSVV-STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGG-----GK--FFDFVMGFDWLRNVIET 261 (367) Q Consensus 190 ~~t~~~k~l~Gv~-~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~-----G~--~iD~~~~~dwl~~~lq~ 261 (367) +.| ||.++++. ...++.+|++.+..+|.+.+....+......-.-++.++- -. +|-.++-.|.+...++. T Consensus 404 SlT--~~~i~~~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~ 481 (562) T protein:vir:63 404 AIT--FKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKI 481 (562) T ss_pred Ccc--ceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhhHHHHHHHHHHHH Confidence 333 34444332 2478999999999999999876544322211122444332 12 47789999999998888 Q ss_pred HHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhcc Q lcl|NC_020841. 262 NVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQR 341 (367) Q Consensus 262 ~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R 341 (367) .+-+.+.- | |=++.|...|++.+...|++..+.|.|..... .. ..+. + ..| + T Consensus 482 ~~~~~yiG--k-~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~--~d-------------v~v~---~----~~d---~ 533 (562) T protein:vir:63 482 SLDNEYIG--T-KIIDTSASLVKNFVQSFLDRKKLAKEIQDYSP--EE-------------VQVV---I----EGD---V 533 (562) T ss_pred HHHhcCCc--c-ccChHHHHHHHHHHHHHHHHHHhCCcccCCCc--cc-------------eEEE---e----cCC---E Confidence 87766654 3 45788999999999999999999999953210 00 0111 0 112 2 Q ss_pred ccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 342 IAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 342 ~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) . -+.+.+...-++|+|.+++.+.. T Consensus 534 ~--~v~~~v~pv~~mekIy~ti~~~~ 557 (562) T protein:vir:63 534 A--RISLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred E--EEEEEEEEcccceEEEEEEEEee Confidence 2 36788899999999999998766 No 40 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=98.20 E-value=2.7e-06 Score=51.10 Aligned_cols=324 Identities=11% Similarity=-0.010 Sum_probs=154.4 Q ss_pred Ccccccccc--cceEEEEEeee-------------------ccccccccccceEEEeeccccCcc-cceEEEec--HHHH Q lcl|NC_020841. 1 MAGSLTLPI--NMLVNVSIEYQ-------------------AKLLSRDAFNRLLIVGSTAPNGRA-TDTGIYTS--IDGV 56 (367) Q Consensus 1 ~~~~~~l~i--~~iv~V~i~~~-------------------~~~~~~~~fg~~li~~~~~~~~~~-~~~~~yts--~~~v 56 (367) =|.+.++.. +.+....+.-. .+-...++....+-.... ..... .....|.. ..++ T Consensus 177 ~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~~~~~tAky~g~~~n~i~~~~~d~-~~~~~~kt~~~~v~~~~~d~ 255 (562) T protein:vir:80 177 KATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDA-QIDVDIKTKEAYVKAVGGDI 255 (562) T ss_pred eEEEEEEecCCcceeEEEeCCCccchhhhhhhhhccccceEEEecccCCceeeeccccc-chhhhcccceeeeeehhhhh Confidence 011111110 00111111000 000001111100000000 00000 00000100 0000 Q ss_pred HhccCCCcHHHHHHHHHhccCcc----cceEEEEecc--CccchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHHhh Q lcl|NC_020841. 57 KLDYGVEADEYKIAQKYFSQNPK----PRDLMIATVT--ALTDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEWAQ 130 (367) Q Consensus 57 ~~df~~~s~~ykaA~~~F~Q~p~----p~~v~v~~~~--~~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~~e 130 (367) .. ......|..+. +.-.-. +..-..|+.+ ...+..+++++++.. +|++++ +...+.+.+..+..|++ T Consensus 256 ~~--~~~~n~~v~~~--~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~Le~~--~~~~i~-~~t~d~ai~~~~~a~vk 328 (562) T protein:vir:80 256 EK--QTAYNGYVEFE--FDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANE--GGYYLV-PLTSKQAVHAEALQFVR 328 (562) T ss_pred hh--cccccceEEEE--eccCccccccceeeeeCCCCCCccccHHHHHHHHHhC--CcEEEE-ecCCChHHHHHHHHHHH Confidence 00 00000000000 000000 1111122222 123567788888764 555554 33345566788999996 Q ss_pred cc---CcEEEEEEe-Cc-hhhhHHHHHHHHhccccceeecCC--------------chhHHHHHHHHHHHcccccCccee Q lcl|NC_020841. 131 SN---NRMFMTVMT-DD-TEAVTTGNALKELGQYHYCITYHE--------------DYATVGAVAGMALDQRYDKTDGVK 191 (367) Q Consensus 131 a~---~~~~~~~~~-d~-~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~g~~ 191 (367) .. +++...+.. .. ..........+.+++.+.....++ ....+++++|..++..... + T Consensus 329 r~r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~~~~~~~~~~~~aa~vAGl~Ag~~~~~---S- 404 (562) T protein:vir:80 329 DCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKMPGYMFAAQVAGLTCGLEIGE---A- 404 (562) T ss_pred HHHhCCCeEEEEecCCCCCCHHHHHHHhhhcCCCeEEEEecCeeEECCCCceeeechhHHHHHHHHHHhcCcccc---C- Confidence 43 343433332 21 122233334445555544333221 1223556666666655443 3 Q ss_pred eeeeeecCcccc-cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeC-----Cc--hhhHHHHHHHHHHHHHHHH Q lcl|NC_020841. 192 TLHLKSLVSVVS-TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGG-----GK--FFDFVMGFDWLRNVIETNV 263 (367) Q Consensus 192 t~~~k~l~Gv~~-~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~-----G~--~iD~~~~~dwl~~~lq~~l 263 (367) ..||.++++.. ..++.+|++.+..+|.+.+....+........-++.++- -. +|-.++-.|.+...++..+ T Consensus 405 -~T~~~i~~~~v~~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~ 483 (562) T protein:vir:80 405 -ITFKNIAIETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISL 483 (562) T ss_pred -ccceeeccccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceeccCCCCchhhhhhhhHHHHHHHHHHHHHH Confidence 33455655432 368999999999999999876544322221123444432 12 4778889999998888887 Q ss_pred HHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhcccc Q lcl|NC_020841. 264 FNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIA 343 (367) Q Consensus 264 ~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~ 343 (367) -+.++-. |=++.|...|++.+...|++..+.|.|..... .. +.+. ...|+ . T Consensus 484 ~~~yIGk---~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~--~d---------------v~v~-----~~~d~---~- 534 (562) T protein:vir:80 484 DNEYIGT---KIIDTSASLVKNFVQSFLDRKKLAKEIQDYSP--EE---------------VQVV-----IEGDI---A- 534 (562) T ss_pred HhcCCcc---ccChHHHHHHHHHHHHHHHHHHhCCcccCCCc--cc---------------eEEE-----ecCCE---E- Confidence 7766543 34788999999999999999999999953210 00 1111 12222 2 Q ss_pred CCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 344 PPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 344 ~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) -+.+.+...-++++|.+++.+.. T Consensus 535 -~v~~~v~Pv~~mekIy~ti~~~~ 557 (562) T protein:vir:80 535 -RISLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred -EEEEEEEEcccceEEEEEEEEEe Confidence 37889999999999999998766 No 41 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=98.16 E-value=3.4e-06 Score=50.57 Aligned_cols=334 Identities=12% Similarity=0.068 Sum_probs=160.2 Q ss_pred CcccccccccceEEEEEeeecccccccccc------------ceEE-EeeccccCcccceEEEecHHH--------HH-h Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIEYQAKLLSRDAFN------------RLLI-VGSTAPNGRATDTGIYTSIDG--------VK-L 58 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~~~~~~~~~~~fg------------~~li-~~~~~~~~~~~~~~~yts~~~--------v~-~ 58 (367) -.++.+-....++...+.++........-+ ...+ .+.+..... . ...+..+. +. . T Consensus 331 ~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~ 407 (749) T protein:vir:10 331 IDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSEFIYWAEHESTLY-A--ATSSASDGLFGQTAANRQFN 407 (749) T ss_pred CCCeeeecccceeeeeeeccccccccccccccchhhhhhccCCCEEEEEecccccc-c--ccccccccccccccccceee Confidence 000000000111111111111000000000 0011 111100000 0 00000000 00 0 Q ss_pred cc--CCCcHHHHHHHHHhccCcccce-EEEEecc-----------CccchHHHHHHHHhcccCcEEEEEEe--cC----C Q lcl|NC_020841. 59 DY--GVEADEYKIAQKYFSQNPKPRD-LMIATVT-----------ALTDPLASIGEVAAKTLGFYAFCFAS--EV----A 118 (367) Q Consensus 59 df--~~~s~~ykaA~~~F~Q~p~p~~-v~v~~~~-----------~~~t~~~~l~~~~~~~~~w~~~~~~~--~~----~ 118 (367) .+ ...+..|-.....+...+.... +.+.+.. ........+..+......-.-+++.. .. . T Consensus 408 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~li~~~~~~~~~~~ 487 (749) T protein:vir:10 408 LFRSAAGSVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIGDPESQIVDFIISGPSGTSDANA 487 (749) T ss_pred ccccccccceeccccccccccCCcEEEEEccCCcccccccccccccchhHHHHHHHhhhhhhcccceEEEecCCCCcchh Confidence 00 0001111111111111111111 1111100 11122333444443332222222221 11 2 Q ss_pred HHHHHHHHHHhhccCcEEEEEEeCch-------h---hhHHHHHHHHhccccceeecCC-------------chhHHHHH Q lcl|NC_020841. 119 AADIQGLAEWAQSNNRMFMTVMTDDT-------E---AVTTGNALKELGQYHYCITYHE-------------DYATVGAV 175 (367) Q Consensus 119 ~~~~~ala~~~ea~~~~~~~~~~d~~-------~---~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~ 175 (367) .....++...++....++++...... . ........+..........+++ ..+..+.+ T Consensus 488 ~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~v 567 (749) T protein:vir:10 488 LAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKKLPSSSYMVFDSGYKYIYDKYNDVYRYIPCNGDT 567 (749) T ss_pred HHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhhccCceeEEEEccceeeeccccCceEEechHHHH Confidence 34566777788877776655432111 0 0111111111111122222221 12345667 Q ss_pred HHHHHHcccccCcceeeeeeeecC---ccc--ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-----ch Q lcl|NC_020841. 176 AGMALDQRYDKTDGVKTLHLKSLV---SVV--STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-----KF 245 (367) Q Consensus 176 ~~~~~~~~~~~~~g~~t~~~k~l~---Gv~--~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-----~~ 245 (367) +|..+..+..+-+ ......|.+. |+. ...+++.|.+.|..+|+|....+.+.+ ..+...+++.+ .| T Consensus 568 AGl~Ar~D~~~g~-~~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~g~G---~~~wG~rT~~s~d~~~~~ 643 (749) T protein:vir:10 568 AGLCLQTNEISEP-WFSPAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPIVSFPGQG---VVLYGDKTALGFASAFDR 643 (749) T ss_pred HHHHHHhhccCCc-EECcCCceeeeeeccccceeecChhHHHhhhhCCceEEEEecCCe---EEEEcceecCCCCcccce Confidence 7777766544311 1111244433 332 235788999999999999999886543 25666676533 36 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeE Q lcl|NC_020841. 246 FDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYV 325 (367) Q Consensus 246 iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v 325 (367) |-+.+-.+|++..|+..+...+-. |.++.=...|+..|+.-|+..++.|.|. ||.| T Consensus 644 i~vRRl~~~ie~si~~~~~~~v~e----pn~~~l~~~i~~~i~~fL~~l~~~G~i~--------------------~f~V 699 (749) T protein:vir:10 644 INIRRLFLTVERVISTAAKAQLFE----QNDEAQRSLFINIVEPYLRDVQGRRGVV--------------------DFLV 699 (749) T ss_pred eehhhhHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee--------------------eeEE Confidence 888899999999988887765543 5578888999999999999999999873 4788 Q ss_pred EcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 326 YNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 326 ~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) .+. .+..+++++.+.+. -+.+.+.....++.|++++.-.+ T Consensus 700 ~~d-~~~Nt~~~i~~G~~-~~~i~~~P~~pae~I~~~~~~~~ 739 (749) T protein:vir:10 700 KCD-STNNTPEAVDRGEF-YAEVFLKPTRTINYVQLTFVATR 739 (749) T ss_pred EEc-CCCCCHHHhhCCEE-EEEEEEEecCCccEEEEEEEEee Confidence 876 77889999999888 59999999999999999877443 No 42 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=98.14 E-value=3.7e-06 Score=50.37 Aligned_cols=314 Identities=8% Similarity=-0.028 Sum_probs=148.0 Q ss_pred Cc-----------ccccccccceEEEEEeeecccccccccc-----------ceEEEeeccccCcccceEEEecHHHHHh Q lcl|NC_020841. 1 MA-----------GSLTLPINMLVNVSIEYQAKLLSRDAFN-----------RLLIVGSTAPNGRATDTGIYTSIDGVKL 58 (367) Q Consensus 1 ~~-----------~~~~l~i~~iv~V~i~~~~~~~~~~~fg-----------~~li~~~~~~~~~~~~~~~yts~~~v~~ 58 (367) .. .....+ ++...+.+...... ...|. ...+...........++.. ... T Consensus 257 ~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~-~~~----- 327 (660) T protein:vir:68 257 DGGTRYSTAKAIFGYGPQT-DDQYAIIVRRNDSV--VQSVVLSTKRGERDIYGSNIFIDDFFAKGASNYIF-ATA----- 327 (660) T ss_pred cccccccceeeEeeccccc-ccceeeeeecCCcc--eeeeeeecccccccccccceeeehhhccCcccEEE-Eee----- Confidence 00 000011 12222222111000 00000 0000000000000011100 000 Q ss_pred ccCCCcHHHHHHHHHhccCcccceEEEEeccC-----ccchHHHHHHHHhcccCcEEEEEEec---CCH----HHHHHHH Q lcl|NC_020841. 59 DYGVEADEYKIAQKYFSQNPKPRDLMIATVTA-----LTDPLASIGEVAAKTLGFYAFCFASE---VAA----ADIQGLA 126 (367) Q Consensus 59 df~~~s~~ykaA~~~F~Q~p~p~~v~v~~~~~-----~~t~~~~l~~~~~~~~~w~~~~~~~~---~~~----~~~~ala 126 (367) ...+. ......-..++.+. ..+...++..+.....-...+++... .+. .-+.++. T Consensus 328 ---~~~~~----------~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~ 394 (660) T protein:vir:68 328 ---QGWPK----------GFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHVV 394 (660) T ss_pred ---cCCCc----------cccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHHH Confidence 00000 00000001111110 01111222222211111111111111 111 1233444 Q ss_pred HHhhccCcEEEEEEe------Cch---hhhHHHHHHHH----------hccccceeecC------C------chhHHHHH Q lcl|NC_020841. 127 EWAQSNNRMFMTVMT------DDT---EAVTTGNALKE----------LGQYHYCITYH------E------DYATVGAV 175 (367) Q Consensus 127 ~~~ea~~~~~~~~~~------d~~---~~~~~~~~~~~----------~~~~~~~~~~~------~------~~~~~~~~ 175 (367) .+++....+|.++-. +.. ...++...... +...+..+.+. + ..+..+.+ T Consensus 395 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~ 474 (660) T protein:vir:68 395 AIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADI 474 (660) T ss_pred HHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHH Confidence 555554444433211 110 11111111110 01111111111 0 12345667 Q ss_pred HHHHHHcccccCcc-eeeeeeeecCccc-----ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-----c Q lcl|NC_020841. 176 AGMALDQRYDKTDG-VKTLHLKSLVSVV-----STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-----K 244 (367) Q Consensus 176 ~~~~~~~~~~~~~g-~~t~~~k~l~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-----~ 244 (367) +|..+..+..+ | ......|.+.||. ...++++|.+.|..+++|+...+.+.+ ..+...+++++ . T Consensus 475 AGl~Ar~d~~~--g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G---~~~wG~rT~~~~~s~~~ 549 (660) T protein:vir:68 475 AGLCARTDNIS--QPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDG---YVLYGDKTATSVPSPFD 549 (660) T ss_pred HHHHHHHhccC--CcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCe---EEEEcceecCCCCcccc Confidence 77777665433 3 1222344444432 124789999999999999998886543 35677777655 2 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCcccccccccccccccee Q lcl|NC_020841. 245 FFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYY 324 (367) Q Consensus 245 ~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~ 324 (367) ||-+.+-.+|++..|+..+...+-. |.+..=...|+..|+.-|+..+++|.|. ||. T Consensus 550 ~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~~L~~l~~~gal~--------------------gf~ 605 (660) T protein:vir:68 550 RINVRRLFNMVKTNIGSASKYRLFE----LNNAFTRSSFRTETSQYLQGIKALGGVY--------------------NFK 605 (660) T ss_pred eEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------------------eeE Confidence 6778888899998888888765543 4477777899999999999999999984 377 Q ss_pred EEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 325 VYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 325 v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) |.. +.++.|++|+.+.+. -+.+.+.....++.|.+++.=.. T Consensus 606 V~~-d~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~l~~~~~~ 646 (660) T protein:vir:68 606 VVC-DTTNNTPAVIDRNEF-VATFYLQPARSINYITLNFVATA 646 (660) T ss_pred EEE-ecCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 876 578899999999988 49999999999999999876443 No 43 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=98.03 E-value=6.4e-06 Score=49.06 Aligned_cols=332 Identities=9% Similarity=0.012 Sum_probs=159.8 Q ss_pred CcccccccccceEEEEEeeeccccccccccc-eEEEeeccc----------cCcccceEEEecHHHHHhccCCCcHHHHH Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIEYQAKLLSRDAFNR-LLIVGSTAP----------NGRATDTGIYTSIDGVKLDYGVEADEYKI 69 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~~~~~~~~~~~fg~-~li~~~~~~----------~~~~~~~~~yts~~~v~~df~~~s~~yka 69 (367) .+.......+..+.+.+...........-.. ..-.+.... ....+.+...... +-+......+. T Consensus 236 v~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~-----~~~~~~~~~~~ 310 (659) T protein:vir:10 236 IEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKR-----GEKDIYDSNIY 310 (659) T ss_pred EEEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccc-----cccccccchhh Confidence 1100000011111111111100000000000 000000000 0000000000000 00111111111 Q ss_pred HHHHhccCcc--------------cceE-EEEeccCc-----cchHHHHHHHHhcccCcEEEEEEecC-------CHHHH Q lcl|NC_020841. 70 AQKYFSQNPK--------------PRDL-MIATVTAL-----TDPLASIGEVAAKTLGFYAFCFASEV-------AAADI 122 (367) Q Consensus 70 A~~~F~Q~p~--------------p~~v-~v~~~~~~-----~t~~~~l~~~~~~~~~w~~~~~~~~~-------~~~~~ 122 (367) ....|..... ...+ ..++.+.. .+....+..+.....-...++++... ...-. T Consensus 311 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~ 390 (659) T protein:vir:10 311 IDDFFAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQ 390 (659) T ss_pred hhhhhccCcccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHH Confidence 2222322110 0001 11111111 11223333333322212233333221 12345 Q ss_pred HHHHHHhhccCcEEEEEEeC------c---hhhhHHHHHHHH----------hccccceeecC------C------chhH Q lcl|NC_020841. 123 QGLAEWAQSNNRMFMTVMTD------D---TEAVTTGNALKE----------LGQYHYCITYH------E------DYAT 171 (367) Q Consensus 123 ~ala~~~ea~~~~~~~~~~d------~---~~~~~~~~~~~~----------~~~~~~~~~~~------~------~~~~ 171 (367) .++...++....+|.+.... . ....+....... +...+..+.+. + ..+. T Consensus 391 ~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~ 470 (659) T protein:vir:10 391 KHVVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPL 470 (659) T ss_pred HHHHHHHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEech Confidence 55667777777666554321 0 111111111111 11112222111 1 1234 Q ss_pred HHHHHHHHHHcccccCcceeeeeeeecC---ccc--ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC--- Q lcl|NC_020841. 172 VGAVAGMALDQRYDKTDGVKTLHLKSLV---SVV--STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG--- 243 (367) Q Consensus 172 ~~~~~~~~~~~~~~~~~g~~t~~~k~l~---Gv~--~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G--- 243 (367) .+.++|..+-.+..+-+ ......|.+. |+. ...+++.|.+.|..+++|++..+.+.+ ..+....++.+ T Consensus 471 sg~~AGl~Ar~D~~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G---~~~wG~rT~~~~~s 546 (659) T protein:vir:10 471 AADIAGLCARTDNVSQT-WMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDG---YVLYGDKTATSVPS 546 (659) T ss_pred HHHHHHHHHHHhccCCc-eEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCe---EEEEcccccCCCCc Confidence 56777777766544421 1222333333 332 235789999999999999998876543 24566666554 Q ss_pred --chhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCcccccccccccccc Q lcl|NC_020841. 244 --KFFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPT 321 (367) Q Consensus 244 --~~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~ 321 (367) .||-+.+-.+|+...|+..+...+-. |.++.=...|+..|+.-|+..+++|.|. T Consensus 547 ~~~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~fL~~l~~~gal~-------------------- 602 (659) T protein:vir:10 547 PFDRINVRRLFNMLKTNIGRSSKYRLFE----LNNAFTRSSFRTETAQYLQGIKALGGIY-------------------- 602 (659) T ss_pred ccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee-------------------- Confidence 36888889999999988888765543 5577778899999999999999999994 Q ss_pred ceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 322 GYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 322 gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) ||.|.++ .++.|++|+.+.+. -+.+.+...-.++.|.+++.=.. T Consensus 603 ~~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~~ 646 (659) T protein:vir:10 603 EYRVVCD-TTNNTPSVIDRNEF-VATFYIQPARSINYITLNFVATA 646 (659) T ss_pred eEEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEEe Confidence 4788876 57899999999988 49999999999999999877554 No 44 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=98.02 E-value=7e-06 Score=48.84 Aligned_cols=335 Identities=12% Similarity=0.022 Sum_probs=161.2 Q ss_pred CcccccccccceEEEEEeeeccccccccccceE---EEeeccccCc--ccceEEEe-cHHHHH--hccCCCcHHH-HHHH Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIEYQAKLLSRDAFNRLL---IVGSTAPNGR--ATDTGIYT-SIDGVK--LDYGVEADEY-KIAQ 71 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~~~~~~~~~~~fg~~l---i~~~~~~~~~--~~~~~~yt-s~~~v~--~df~~~s~~y-kaA~ 71 (367) .+.......+.+.-+.++.. .......+..+ .+.......+ ......+. -+.+-. ..+....... -+.. T Consensus 326 ~~~~~~~~~d~~~v~v~~~~--~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~ 403 (743) T protein:vir:10 326 FATDNGITDDQVHFAVIDTT--GELTGTANTIVERLTYLSKLSDARSEENANIYYKNVINEQSAYLYHGNDAAVQIAASG 403 (743) T ss_pred ccccccccccceEEEEecCc--ceeeeccCceeEEEeeeecccccccccCcceeecceeccccceeeccCcccceeeecc Confidence 11111111111111111100 00000000000 0000000000 00000000 000000 0000000000 0000 Q ss_pred HHhccC--------------cccceE-EEEeccCc----cchHHHHHHHHhcccCcEEEEEEec------CCHHHHHHHH Q lcl|NC_020841. 72 KYFSQN--------------PKPRDL-MIATVTAL----TDPLASIGEVAAKTLGFYAFCFASE------VAAADIQGLA 126 (367) Q Consensus 72 ~~F~Q~--------------p~p~~v-~v~~~~~~----~t~~~~l~~~~~~~~~w~~~~~~~~------~~~~~~~ala 126 (367) ..+.+. +....+ .+++.+.. ......+..+.....-...++++.. .......++. T Consensus 404 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~~~~~~~~v~~a~~ 483 (743) T protein:vir:10 404 EAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDFAYDAGEFGAAMDLFLDTEETEIDFVLMGGSMADEADTKSKATKVI 483 (743) T ss_pred ccCccccceeeeecccccccccceEEEeecCccccccchhHHHHHHHHhhhccccCcceEEecCcccCccchHHHHHHHH Confidence 111111 011111 22222211 1112233333332222223333322 1234466677 Q ss_pred HHhhccCcEEEEEEeCch---------------hhhHHHHHHHHhccccceeecCC-------------chhHHHHHHHH Q lcl|NC_020841. 127 EWAQSNNRMFMTVMTDDT---------------EAVTTGNALKELGQYHYCITYHE-------------DYATVGAVAGM 178 (367) Q Consensus 127 ~~~ea~~~~~~~~~~d~~---------------~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~ 178 (367) ..++...+++.++-.... .........+..........+++ ..+..+.++|. T Consensus 484 ~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl 563 (743) T protein:vir:10 484 AIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSDLTSTSYAVFDSGYKYVYDRFTDKYRYIPCNGDVAGL 563 (743) T ss_pred HHHHhhCCeEEEEecCCCccccccccccccccccchHHHHHHHhccCCeeEEEEccceeeeccccCceeEechhHHHHHH Confidence 777776666655432210 00111111111111111112211 02344667777 Q ss_pred HHHcccccCcceeeeeeeecCccc-----ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-----chhhH Q lcl|NC_020841. 179 ALDQRYDKTDGVKTLHLKSLVSVV-----STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-----KFFDF 248 (367) Q Consensus 179 ~~~~~~~~~~g~~t~~~k~l~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-----~~iD~ 248 (367) .+..+..+-+ ......|.+.||. ...+++.|.+.|..+++|++..+.+.+ + .+...+++.+ .||-+ T Consensus 564 ~a~~D~~~g~-~~span~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~-~~wG~rT~~s~d~~~~~i~v 639 (743) T protein:vir:10 564 CVQTSNQLDD-WYSPAGLNRGGILNAVKLAYNPNKADRDELYQNRINPVVSLRGQG--I-TLFGDKTALAAPSAFDRINV 639 (743) T ss_pred HHHhhccCCc-EEccCCeeeeeeeccccceecCChhHHHhHhhCCceEEEEecCCe--E-EEEcccccCCCCcccceEee Confidence 7766544321 2233445555543 135788999999999999999886543 3 4556666544 27888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcC Q lcl|NC_020841. 249 VMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNE 328 (367) Q Consensus 249 ~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~ 328 (367) .+-.+|++..|+..++..+-. |.+..=.+.|+..|+.-|+..+++|.|. ||.|.+. T Consensus 640 rR~~~~i~~si~~~~~~~v~e----~n~~~~~~~i~~~i~~fL~~l~~~gal~--------------------~~~V~~d 695 (743) T protein:vir:10 640 RRLFLNLEKRARRLAEGVLFE----QNDATTRAGFSSALNSYLSEVQARRGVT--------------------DYLVICD 695 (743) T ss_pred hhhHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------------------eeEEEEc Confidence 899999999999988876543 4478888999999999999999999883 4788886 Q ss_pred chHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 329 SIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 329 ~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) .++.+++|+.+.+. -+.+.+.....+++|.+++.=.. T Consensus 696 -~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~~~~ 732 (743) T protein:vir:10 696 -ESNNTPDIIDRNEF-VAEVYVKPTRSINFITITFTATK 732 (743) T ss_pred -CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 68899999999988 49999999999999999886332 No 45 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=97.98 E-value=8.3e-06 Score=48.44 Aligned_cols=330 Identities=11% Similarity=0.004 Sum_probs=162.7 Q ss_pred Cccccccc--ccceEEEEEeeecc------ccccccc-----c--------------ceEEEeeccccCcccceEEEecH Q lcl|NC_020841. 1 MAGSLTLP--INMLVNVSIEYQAK------LLSRDAF-----N--------------RLLIVGSTAPNGRATDTGIYTSI 53 (367) Q Consensus 1 ~~~~~~l~--i~~iv~V~i~~~~~------~~~~~~f-----g--------------~~li~~~~~~~~~~~~~~~yts~ 53 (367) .+.....+ ..+-+.|.+.-... .+....+ . -.++++... ...|+|..-... T Consensus 220 ~a~~A~~~g~~g~~i~v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g--~~~e~~~~~~~~ 297 (666) T protein:vir:65 220 PAVSAIYAGEIGNSLEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDG--VVVESYVLSTLK 297 (666) T ss_pred ceeeeeeccccccceeEEeecccccccccccccccccccccccceeeecccccccccceeeeecCC--cccceeecccCc Confidence 00000000 00111121111000 0000000 0 001111000 011111110000 Q ss_pred HHHHhccCCCcHHHHHHHHHhccC-------------cccceE---------------EEEeccCccchHHHHHHHHhcc Q lcl|NC_020841. 54 DGVKLDYGVEADEYKIAQKYFSQN-------------PKPRDL---------------MIATVTALTDPLASIGEVAAKT 105 (367) Q Consensus 54 ~~v~~df~~~s~~ykaA~~~F~Q~-------------p~p~~v---------------~v~~~~~~~t~~~~l~~~~~~~ 105 (367) .+ + ....+..|.. .++... ..+..+ .++...........+..+.+.. T Consensus 298 ~~-~--~~~~~~~~~~--~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 372 (666) T protein:vir:65 298 GD-K--DVYGNSIYMD--DFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERE 372 (666) T ss_pred cc-c--cccchhhhhh--hhhcccccceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhh Confidence 00 0 0001111111 111100 000000 1111111122334445554443 Q ss_pred cCcEEEEEEec------CCHHHHHHHHHHhhccCcEEEEEEe------Cc---hhhhHHHHHHHHh----------cccc Q lcl|NC_020841. 106 LGFYAFCFASE------VAAADIQGLAEWAQSNNRMFMTVMT------DD---TEAVTTGNALKEL----------GQYH 160 (367) Q Consensus 106 ~~w~~~~~~~~------~~~~~~~ala~~~ea~~~~~~~~~~------d~---~~~~~~~~~~~~~----------~~~~ 160 (367) .....+++... ....-..++..+++....+|..... +. +...+.......+ ...+ T Consensus 373 ~~~~~~l~~p~~~~~~~~~~~v~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~ 452 (666) T protein:vir:65 373 SIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTY 452 (666) T ss_pred hccCCceeecCcCCccchhHHHHHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcce Confidence 32222333221 1345566677778877666544321 11 1111111111111 1112 Q ss_pred ceeecC------C------chhHHHHHHHHHHHcccccCcceeeeeeeecCccc-----ccCCCHHHHHHHHhCCceEEE Q lcl|NC_020841. 161 YCITYH------E------DYATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVV-----STDISQTQAASLKAACINYYS 223 (367) Q Consensus 161 ~~~~~~------~------~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~-----~~~~t~t~~~~l~~~~~n~y~ 223 (367) ..+.+. + ..+..+.++|..+..+..+-+ ......|.+.||. ...+++.|.+.|..+|+|++. T Consensus 453 ~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~ 531 (666) T protein:vir:65 453 AVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQP-WMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVI 531 (666) T ss_pred EEEEcCceEEecccCCceeEechHHHHHHHHHHHhccCCc-EEccCCeecceeeccccceeecChhHHHhhhhCCceEEE Confidence 222111 1 113456667777766543311 1222344444332 235788999999999999998 Q ss_pred EeeccccceEEEecCEeeCC-----chhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020841. 224 DYGNPDNSLPIFANGHAGGG-----KFFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAG 298 (367) Q Consensus 224 ~~~~~~~~~~~~~~G~~~~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G 298 (367) .+.+.+ ..+..++++++ .||-+.+-.+|++..|+...+..+-. |.+..=...|+..|+.-|++.+++| T Consensus 532 ~~~~~G---~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~g 604 (666) T protein:vir:65 532 GAGGEG---FILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLG 604 (666) T ss_pred EeCCCe---EEEEecccCCCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCC Confidence 886543 25667777655 26888889999999999888876543 5578888999999999999999999 Q ss_pred ceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 299 LVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 299 ~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) .|. ||.|.+. .++.|++|+.+.+. -+.+.+.....++.|.+++.=.. T Consensus 605 al~--------------------g~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~~ 651 (666) T protein:vir:65 605 GIY--------------------DFRVQCD-TTNNTPDVIDRNEF-VASMFIKPAKSINYIMLNFTAVA 651 (666) T ss_pred cee--------------------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 994 4778876 67899999999988 59999999999999999887554 No 46 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=97.97 E-value=8.9e-06 Score=48.29 Aligned_cols=330 Identities=9% Similarity=0.002 Sum_probs=162.6 Q ss_pred Ccccccccc-------------cceEEEEEeeecccccccccc-ceEEEeecccc----------CcccceEEEecHHHH Q lcl|NC_020841. 1 MAGSLTLPI-------------NMLVNVSIEYQAKLLSRDAFN-RLLIVGSTAPN----------GRATDTGIYTSIDGV 56 (367) Q Consensus 1 ~~~~~~l~i-------------~~iv~V~i~~~~~~~~~~~fg-~~li~~~~~~~----------~~~~~~~~yts~~~v 56 (367) .+...+-.. ...+.+.+............. ..+.++..... ...+.+..-. T Consensus 223 ~a~~~gt~g~~~tv~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 297 (659) T protein:vir:72 223 VALYPGELGDKIEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLST----- 297 (659) T ss_pred eeccccccccceeEEEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeee----- Confidence 111110000 111111111111000000000 00000000000 0000111000 Q ss_pred HhccCCCcHHHHHHHHHhccCcccceEE----------------EEeccC-----ccchHHHHHHHHhcccCcEEEEEEe Q lcl|NC_020841. 57 KLDYGVEADEYKIAQKYFSQNPKPRDLM----------------IATVTA-----LTDPLASIGEVAAKTLGFYAFCFAS 115 (367) Q Consensus 57 ~~df~~~s~~ykaA~~~F~Q~p~p~~v~----------------v~~~~~-----~~t~~~~l~~~~~~~~~w~~~~~~~ 115 (367) ..+.+.....-......|...... .+. .++.+. ..+....+..+.....-...+++.. T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~-~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p 376 (659) T protein:vir:72 298 KRGEKDIYDSNIYIDDFFAKGGSE-YIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAG 376 (659) T ss_pred ccccccccchhhhhhhhhhcCCce-EEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEec Confidence 001111111122223333221110 111 111111 1112334444433222122333332 Q ss_pred cC---C----HHHHHHHHHHhhccCcEEEEEEeCc---------hhhhHHHHHHHHh----------ccccceeecC--- Q lcl|NC_020841. 116 EV---A----AADIQGLAEWAQSNNRMFMTVMTDD---------TEAVTTGNALKEL----------GQYHYCITYH--- 166 (367) Q Consensus 116 ~~---~----~~~~~ala~~~ea~~~~~~~~~~d~---------~~~~~~~~~~~~~----------~~~~~~~~~~--- 166 (367) .. . ..-..++...++....+|.+.-... ............. ...+..+.+. T Consensus 377 ~~~~~~~~~~~~v~~~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~ 456 (659) T protein:vir:72 377 SCAGESLETASTVQKHVVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKY 456 (659) T ss_pred CCCCcchhhhHHHHHHHHHHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCcee Confidence 21 1 1234556677777766665543210 1111111111111 1112222111 Q ss_pred ---C------chhHHHHHHHHHHHcccccCcc-eeeeeeeecCccc-----ccCCCHHHHHHHHhCCceEEEEeeccccc Q lcl|NC_020841. 167 ---E------DYATVGAVAGMALDQRYDKTDG-VKTLHLKSLVSVV-----STDISQTQAASLKAACINYYSDYGNPDNS 231 (367) Q Consensus 167 ---~------~~~~~~~~~~~~~~~~~~~~~g-~~t~~~k~l~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~~~ 231 (367) + ..+..+.++|..+-.+..+ | ......|.+.||. ...+++.|.+.|..+++|+...+.+.+ T Consensus 457 ~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~--G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G-- 532 (659) T protein:vir:72 457 QYDKYNDVNRWVPLAADIAGLCARTDNVS--QTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDG-- 532 (659) T ss_pred eccccCCceEEechHHHHHHHHHHhhccC--CcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCe-- Confidence 0 0134567777777665433 3 2222344443332 235788999999999999999886543 Q ss_pred eEEEecCEeeCC-----chhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceeccccc Q lcl|NC_020841. 232 LPIFANGHAGGG-----KFFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWN 306 (367) Q Consensus 232 ~~~~~~G~~~~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~ 306 (367) + .+...+++++ .||-+.+-.+|+...|+..+...+-. |.++.=...|+..|+.-|++.+++|.|. T Consensus 533 ~-~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~fL~~l~~~gal~----- 602 (659) T protein:vir:72 533 Y-VLYGDKTATSVPSPFDRINVRRLFNMLKTNIGRSSKYRLFE----LNNAFTRSSFRTETAQYLQGNKALGGIY----- 602 (659) T ss_pred E-EEEcccccCCCCcccceEeehhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCcee----- Confidence 2 4566666554 36888889999999998888765443 5577888999999999999999999993 Q ss_pred CccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 307 GAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 307 ~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) ||.|.++ .++.+++|+.+.+. -+.+.+.....++.|.+++.=.. T Consensus 603 ---------------~~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~~~~ 646 (659) T protein:vir:72 603 ---------------EYRVVCD-TTNNTPSVIDRNEF-VATFYIQPARSINYITLNFVATA 646 (659) T ss_pred ---------------eEEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCccEEEEEEEEee Confidence 4888876 77899999999988 49999999999999999987433 No 47 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=97.95 E-value=9.5e-06 Score=48.12 Aligned_cols=307 Identities=12% Similarity=0.006 Sum_probs=154.1 Q ss_pred CcccccccccceEEEEEeeeccccccccccceEEEeecc--ccC--cccceEEEecHHHHHhccCCCcHHHHHHHHHhcc Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTA--PNG--RATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQ 76 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~--~~~--~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q 76 (367) + +...-+ .+...+++..+. .....+ +..+.... ... ..+.|..... ..+.+.-..+...|.. T Consensus 227 ~-~~~~~~--~~~~~~~d~~~~-~~~~t~--~~~~~~~~~di~~~~~~~~~v~~~~--------~~~~~l~~~~~~~LtG 292 (569) T protein:vir:80 227 F-FPIGDK--NLPTDALEAVTK-VDVKTE--AVFVGALAGDIAKQLEYNDYVTVAV--------DATKPVEDFELTNLTG 292 (569) T ss_pred E-EecCCC--cceehhccchhh-eecccc--ceeeehhHHHHHHhhcCCceEEEEe--------cCCcceeeecceeecC Confidence 1 111111 111111221111 101111 11111100 000 0111111110 0000000011111221 Q ss_pred CcccceEEEEeccCccchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHHhhcc---Cc-EEEEEEeCch-hhhHHHH Q lcl|NC_020841. 77 NPKPRDLMIATVTALTDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEWAQSN---NR-MFMTVMTDDT-EAVTTGN 151 (367) Q Consensus 77 ~p~p~~v~v~~~~~~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~~ea~---~~-~~~~~~~d~~-~~~~~~~ 151 (367) +.-+ ....+..+++++++.. +|.+++ +...+.+.+.++..|++.. ++ ++.++..... ....... T Consensus 293 G~dG--------~~~~~~~~~l~~le~~--~~~~i~-~~t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~ 361 (569) T protein:vir:80 293 GSDG--------TAPESWANKFPLLANE--GGYYLV-PLTDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTNETVEESIT 361 (569) T ss_pred CCCC--------CccchHHHHHHHHhhC--CcEEEE-ecCCChHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHH Confidence 1110 1123467788888764 344443 3444566778899999854 23 3333332221 1222233 Q ss_pred HHHHhccccceeecC--------------CchhHHHHHHHHHHHcccccCcceeeeeeeecCcccc-cCCCHHHHHHHHh Q lcl|NC_020841. 152 ALKELGQYHYCITYH--------------EDYATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVVS-TDISQTQAASLKA 216 (367) Q Consensus 152 ~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~~-~~~t~t~~~~l~~ 216 (367) ..+.+++.+...... +....+++++|..++..... +.| ||.++++.. ..++.+|++.+.. T Consensus 362 ~a~~~n~e~vv~v~~~~~~~~~~g~~~~~~~~~~aa~vAG~~A~~~~~~---S~T--~k~i~~~~i~~~lt~~e~~~li~ 436 (569) T protein:vir:80 362 RATNLRDPRASLVGFSGTRKMDDGRLLKLPGYMMASQIAGIASGLEVGE---AIT--FKHFNVTSVDRVFESSQLDMLNE 436 (569) T ss_pred HHhhcCCCeEEEEecCceeecCCCcceeechhhHHHHHHHHHhcCcccc---Ccc--ceeeccccccccCCHHHHHHHHh Confidence 344444444322221 11233556666666655444 333 445543332 3689999999999 Q ss_pred CCceEEEEeeccccceEEEecCEeeCC-----ch--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHH Q lcl|NC_020841. 217 ACINYYSDYGNPDNSLPIFANGHAGGG-----KF--FDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVN 289 (367) Q Consensus 217 ~~~n~y~~~~~~~~~~~~~~~G~~~~G-----~~--iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~ 289 (367) +|.+.+....+........-++.++-. .| |-.++-.|.+...++..+-+.+.-. |=++.|...|++.++. T Consensus 437 ~G~~~l~~~~~~~~~v~~~vn~itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk---~nn~~~r~~v~~~i~~ 513 (569) T protein:vir:80 437 SGVISIEFVRNRTLTAFRVVQDVTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGT---KVIDTSASLIKNFIQS 513 (569) T ss_pred CCeEEEEEecCceEEEEEEeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcc---cCChhHHHHHHHHHHH Confidence 999998765433222212224554422 24 8889999999999988877766543 5578899999999999 Q ss_pred HHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 290 GLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 290 vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) .|++..+.|.|..... .. ..+.. ..| |. -+.|.+...-++++|.+++.+.. T Consensus 514 ~L~~l~~~gaI~~~~~--~d-------------v~v~~-------~~d---~~--~v~~~v~Pv~~~ekI~~ti~~~~ 564 (569) T protein:vir:80 514 FLDNKKRAREIQDYTP--EE-------------VQVVL-------EGD---VA--SISMTVMPIRSLNKITVQLVYKQ 564 (569) T ss_pred HHHHHHhCCcccCCCc--cc-------------eEEEe-------cCC---EE--EEEEEEEEcccccEEEEEEEEee Confidence 9999999999953210 00 01110 111 22 37888999999999999998776 No 48 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=97.95 E-value=9.7e-06 Score=48.06 Aligned_cols=319 Identities=9% Similarity=-0.020 Sum_probs=149.2 Q ss_pred Ccccccc------cccceEEEEEeeeccccccccccceEEEeeccccCcccceEEE---ecHHHHHhccCCCcHHHHHHH Q lcl|NC_020841. 1 MAGSLTL------PINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIY---TSIDGVKLDYGVEADEYKIAQ 71 (367) Q Consensus 1 ~~~~~~l------~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~y---ts~~~v~~df~~~s~~ykaA~ 71 (367) +.+..++ |..+-++|.+........... .++ ... ...|... ...+++............+.. T Consensus 234 ~~g~~~i~tky~d~~~~~i~V~~~~~iv~a~~~D--~~~---~~~----~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~ 304 (607) T protein:vir:10 234 VVGSPSVNTSYLDEVTSPVDVKTAPAVVTAKIGD--AIS---KLG----YDPYVVVTQTSNNKPIVNGVSAGTGSATASV 304 (607) T ss_pred EecccceeeeccccccceeEEEEeeeeechhhhh--hhh---ccc----ccceEEeeecccchhhhhhhhccccceeeee Confidence 1111111 111111222111100000000 000 000 0001110 111111111011111111111 Q ss_pred HHhccCcccc----eEEEEeccC--ccchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHHhhcc---CcEEEEEEeC Q lcl|NC_020841. 72 KYFSQNPKPR----DLMIATVTA--LTDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEWAQSN---NRMFMTVMTD 142 (367) Q Consensus 72 ~~F~Q~p~p~----~v~v~~~~~--~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~~ea~---~~~~~~~~~d 142 (367) ..++- -.|. ....|+-+. ..+..+++++++.. +|+.+.. ...+...+.++..|++.. .+++..+... T Consensus 305 ~~~~~-~~~a~~a~~~LtGGtdG~~~~ty~dal~aLe~~--e~~~i~~-~t~d~ai~~~l~a~vkr~~~~g~~~~aVlg~ 380 (607) T protein:vir:10 305 TTAPE-SFPANFDTAFLTGGSTGDVPVSWADKFNGAIGN--NVYYIIP-LTSEENIHAELQAFIDEQHVLGYNYHAFVGG 380 (607) T ss_pred ecccc-ccccccceeeeeCCCCCCchhhHHHHHHHHhhc--CceEEEe-cCCCHHHHHHHHHHHHHHHhCCCcEEEEecC Confidence 11111 1111 113333222 23456778887764 4555543 344556678899999643 4444433322 Q ss_pred -c-hhhhHHHHHHHHhccccceeecC-------------CchhHHHHHHHHHHHcccccCcceeeeeeeecCccc-ccCC Q lcl|NC_020841. 143 -D-TEAVTTGNALKELGQYHYCITYH-------------EDYATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVV-STDI 206 (367) Q Consensus 143 -~-~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~-~~~~ 206 (367) . ....+.....+.+++.+...... +....+++++|..++...+. +.| ||.++++. ...+ T Consensus 381 ~~~~t~~~~~t~a~~~N~ervv~V~~~~~~~~~G~~~~~~~~~~Aa~vAGl~Ag~~~~~---SlT--~k~i~~~~v~~~l 455 (607) T protein:vir:10 381 GFAEPLEQILSRQVNINDSRFGLVGQSGHVQEGGESVHVPAYLMAAYVGGLSSSLGVAV---PIT--NKKLALVDLDQNF 455 (607) T ss_pred CCCCCHHHHHHHHHhhCCCcEEEEecCeeEeeCCcceeccHHHHHHHHHHHHhcCcccc---Ccc--cceeccccccccC Confidence 1 12223334445555554432221 11223456666666654443 333 34444333 2369 Q ss_pred CHHHHHHHHhCCceEEEEeecccc-ceEEEecCEeeCC-----ch--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHh Q lcl|NC_020841. 207 SQTQAASLKAACINYYSDYGNPDN-SLPIFANGHAGGG-----KF--FDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDR 278 (367) Q Consensus 207 t~t~~~~l~~~~~n~y~~~~~~~~-~~~~~~~G~~~~G-----~~--iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~ 278 (367) +.+|++.+..+|+..+....+... +.....+|.++-+ .| |-.++-.|.+...++..+-+.++- |++. +. T Consensus 456 t~~e~e~ai~~Gv~~l~~~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIG--k~nn-d~ 532 (607) T protein:vir:10 456 SGDDLNTLNQNGVIGIEHLVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIG--SNIR-ST 532 (607) T ss_pred CHHHHHHHHhCCeEEEEEccCccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCc--ccCC-cc Confidence 999999999999998865433211 1122445555422 24 778999999999888887776653 3333 45 Q ss_pred HHHHHHHHHHHHHHH--HHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceE Q lcl|NC_020841. 279 GMMMIKADIVNGLEE--AVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAI 356 (367) Q Consensus 279 G~~~l~~~v~~vl~~--a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaI 356 (367) ....++..+...|.. -...|.|... .. . + +.++ ...| + --+.+.+..-.+| T Consensus 533 ~~~~vk~~i~~~L~~~~l~~~gaI~df-~~-e--------d-------v~v~-----~~~D---~--v~v~~~v~Pv~~i 585 (607) T protein:vir:10 533 SADDIKSTVASYLYSEMNNDDGLIVDF-SE-S--------D-------IVVT-----ISGT---V--VYIQFAVAPTQEI 585 (607) T ss_pred hHHHHHHHHHHHHHHHHHHhcCceeCC-Cc-c--------c-------cEEe-----eCCC---E--EEEEEEEEEcccc Confidence 667888888888743 4456777421 00 0 0 1111 0112 2 2378899999999 Q ss_pred EEEEEEEEecC Q lcl|NC_020841. 357 HDTDITLIPEA 367 (367) Q Consensus 357 h~v~i~~~v~~ 367 (367) +.|.+++.+.. T Consensus 586 ekIyvtv~v~~ 596 (607) T protein:vir:10 586 KNIVVSGTYSN 596 (607) T ss_pred eEEEEEEEEEE Confidence 99999998887 No 49 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=97.91 E-value=1.1e-05 Score=47.67 Aligned_cols=308 Identities=8% Similarity=-0.032 Sum_probs=145.5 Q ss_pred CcccccccccceEEEEEeeeccc-ccccc------------ccceEEEeeccc-cCcccceEEEecH-HHHHh-----cc Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIEYQAKL-LSRDA------------FNRLLIVGSTAP-NGRATDTGIYTSI-DGVKL-----DY 60 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~~~~~~-~~~~~------------fg~~li~~~~~~-~~~~~~~~~yts~-~~v~~-----df 60 (367) .......+ ..++.. ..++... ..... -...++...... .........+... +.... +. T Consensus 277 ~~~~v~~~-g~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 354 (666) T protein:vir:80 277 YAFIVRRD-GVVVES-YVLSTLKGDKDVYGNSIYMDDFFGRGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGA 354 (666) T ss_pred eeeEeccC-Ccccee-eecccccccccccchhhhhhhhhccccceeeeecccccccccceEEEecCCCCccccccccccc Confidence 00000111 001100 0111100 00000 000011110000 0000000000000 00000 00 Q ss_pred CCCcHHHHHHHHHhccCcccceEEEEeccCccchHHHHHHHHhcccCcEEEEEEec------CCHHHHHHHHHHhhccCc Q lcl|NC_020841. 61 GVEADEYKIAQKYFSQNPKPRDLMIATVTALTDPLASIGEVAAKTLGFYAFCFASE------VAAADIQGLAEWAQSNNR 134 (367) Q Consensus 61 ~~~s~~ykaA~~~F~Q~p~p~~v~v~~~~~~~t~~~~l~~~~~~~~~w~~~~~~~~------~~~~~~~ala~~~ea~~~ 134 (367) ......+..+..++ ++.+. .+. .+++... .......++...++.... T Consensus 355 ~~~~g~~~~~~~~~-------------------------~~~~~-~~~-~~l~~p~~~~~~~~~~~v~~~~~~~~~~~~~ 407 (666) T protein:vir:80 355 DPFIGAMMQGWGLF-------------------------AERES-IHV-NLLIAGACAGEGDAFSTVQKHAVSIGDERQD 407 (666) T ss_pred ccccccchhhhhhh-------------------------hhhcc-ccc-ceEeecCcCCcccchHHHHHHHHHHHHhhcc Confidence 00000011111111 11111 111 1122111 112233444455554443 Q ss_pred EEEEEEe------C---chhhhHHHHHHHH----------hccccceeecC------C------chhHHHHHHHHHHHcc Q lcl|NC_020841. 135 MFMTVMT------D---DTEAVTTGNALKE----------LGQYHYCITYH------E------DYATVGAVAGMALDQR 183 (367) Q Consensus 135 ~~~~~~~------d---~~~~~~~~~~~~~----------~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~ 183 (367) ++.+... + .+...+....... +...+..+.+. + -.+..+.++|..+..+ T Consensus 408 ~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D 487 (666) T protein:vir:80 408 CLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTD 487 (666) T ss_pred eEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCceeEechHHHHHHHHHHHh Confidence 3322210 1 0111111111111 11111111111 0 0133556677766655 Q ss_pred cccCcceeeeeeeecCccc-----ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-----chhhHHHHHH Q lcl|NC_020841. 184 YDKTDGVKTLHLKSLVSVV-----STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-----KFFDFVMGFD 253 (367) Q Consensus 184 ~~~~~g~~t~~~k~l~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-----~~iD~~~~~d 253 (367) ..+-+ ......|.+.||. ...+++.|.+.|..+|+|++..+.+.+ ..+..++++++ .||-+.+-.+ T Consensus 488 ~~~g~-~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G---~~~wG~rT~~~~~s~~~~i~vRRl~~ 563 (666) T protein:vir:80 488 AVSQP-WMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEG---FILMGDKTATTVPSPFDRINVRRLFN 563 (666) T ss_pred hcCCc-eEccCCeecceeeccccceeecChhHHHhhhhCCeeEEEEeCCCe---EEEEccccCCCCCcccceeehhhHHH Confidence 43311 1112334433332 235688999999999999998886543 35667777655 2588888899 Q ss_pred HHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhC Q lcl|NC_020841. 254 WLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQ 333 (367) Q Consensus 254 wl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~ 333 (367) |++..|+...+..+-. |.|..=.+.|+..|+.-|++.+++|.|. ||.|.+. .++. T Consensus 564 ~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~--------------------g~~V~~d-~~~n 618 (666) T protein:vir:80 564 MLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY--------------------DFRVQCD-TTNN 618 (666) T ss_pred HHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------------------eeEEEEc-CCCC Confidence 9999998888765543 5577778999999999999999999984 3778876 7789 Q ss_pred CHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 334 AQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 334 ~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) |++|+.+.+. -+.+.+.....++.|.+++.=.+ T Consensus 619 t~~di~~G~~-~~~i~~~P~~Pae~I~~~~~~~~ 651 (666) T protein:vir:80 619 TPDVIDRNEF-VASMFIKPAKSINYIMLNFTAVA 651 (666) T ss_pred CHHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 9999999988 59999999999999999987443 No 50 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=97.76 E-value=2.1e-05 Score=46.19 Aligned_cols=330 Identities=11% Similarity=0.023 Sum_probs=154.9 Q ss_pred Cccccccc---------------------ccceEEEEEee-eccccccccccceEEEeeccccC-cccce-EEEecHHHH Q lcl|NC_020841. 1 MAGSLTLP---------------------INMLVNVSIEY-QAKLLSRDAFNRLLIVGSTAPNG-RATDT-GIYTSIDGV 56 (367) Q Consensus 1 ~~~~~~l~---------------------i~~iv~V~i~~-~~~~~~~~~fg~~li~~~~~~~~-~~~~~-~~yts~~~v 56 (367) +..+..-+ ..+...|+..- ...+.....|....++....... ....+ ........+ T Consensus 376 ItV~I~~~t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~~~~~~~~v~e~~dn~~i~~~~~~~~~~~in~vs~lv~~~~~ 455 (774) T protein:vir:98 376 VTVSIYPVNNSEFRLNVQDLNGSAFNPPLADEVYTVKLGDTNESGELNALLDSKFIRGFFLPKSIDSINYDAALVRQSPL 455 (774) T ss_pred eEEEEEecCCceeEEEEEecCCccccccccceeEEEecccccccceeeeeeceeeEeecccccccccccccccccccchh Confidence 11111000 01111111000 00011111222222211110000 00000 000000000 Q ss_pred HhccCCCcHHHHHHHHHhccC-cccceEEEEeccCc----cchHHHHHHHHh--cccCcEEEEEEecCCHHHHHHHHHHh Q lcl|NC_020841. 57 KLDYGVEADEYKIAQKYFSQN-PKPRDLMIATVTAL----TDPLASIGEVAA--KTLGFYAFCFASEVAAADIQGLAEWA 129 (367) Q Consensus 57 ~~df~~~s~~ykaA~~~F~Q~-p~p~~v~v~~~~~~----~t~~~~l~~~~~--~~~~w~~~~~~~~~~~~~~~ala~~~ 129 (367) ... .. ++.-..+...-.+. -.+..+.+...... .+..+.+..... ....++.+ ...........++..++ T Consensus 456 ~~a-~~-d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg~~tt~~~igg~~~~~~~tgi~aL-l~a~~~~~V~~aii~~~ 532 (774) T protein:vir:98 456 RLA-PP-DESETDVENPAHVDFYGPNVLVDVTLENGYDGPPVTNDDYVSIIRTLENQPVHIL-LVGTTNVGVQQALITEA 532 (774) T ss_pred ccc-cc-ccccccccccccccccCCcceEEEeecCCCCcccccchheecccccccccceeEE-EcCccchhhHHHHHHHH Confidence 000 00 00000000000000 01112222221111 111111111111 11233333 23333444555555555 Q ss_pred hc----cCcEEEEEEeCch-hhhHHHHHHHHhccccceeecCC----c--------hhHHHHHHHHHHHcccccCcceee Q lcl|NC_020841. 130 QS----NNRMFMTVMTDDT-EAVTTGNALKELGQYHYCITYHE----D--------YATVGAVAGMALDQRYDKTDGVKT 192 (367) Q Consensus 130 ea----~~~~~~~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~----~--------~~~~~~~~~~~~~~~~~~~~g~~t 192 (367) +. ...++.+...... ...........++..+..+.+.. + .+..+.++|..+..++...|. T Consensus 533 e~~~~~~~~r~avid~p~g~t~~~Ai~~r~~f~S~~aal~~Pwvkv~D~~~g~~~~vPpSg~vAGl~ArtDv~kSPA--- 609 (774) T protein:vir:98 533 ERASDSDGLRIAVLAAPPRTTPTLAASVTRGFNSTRAVMVAGWFTYAGQPNSSRYGVPGAAVYAGKLAAIDFFVSPA--- 609 (774) T ss_pred HHhhhcccceEEEEECCCCCCHHHHHHHHhccCCceEEEEeCcEEEeccCCCceeecChhHHHHHHHHhcCcccccC--- Confidence 53 2344443332221 11222223334444443333321 1 233567777777766544333 Q ss_pred eeeeecCccc--------ccCCCHHHHHHHHhCCceEEE-EeeccccceEEEecCEeeCC----chhhHHHHHHHHHHHH Q lcl|NC_020841. 193 LHLKSLVSVV--------STDISQTQAASLKAACINYYS-DYGNPDNSLPIFANGHAGGG----KFFDFVMGFDWLRNVI 259 (367) Q Consensus 193 ~~~k~l~Gv~--------~~~~t~t~~~~l~~~~~n~y~-~~~~~~~~~~~~~~G~~~~G----~~iD~~~~~dwl~~~l 259 (367) .|.+.|+. .+..++.+.+.|..+++|..+ ...+ +++ .+..+.++++ .||-+.+-.+|++..| T Consensus 610 --Nk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g--~G~-rvWG~RTlssDp~wr~InVRRlfd~Ie~SI 684 (774) T protein:vir:98 610 --ARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVD--RTY-RFASGVTLSTDPAWERIYLRRVHDVVRQGA 684 (774) T ss_pred --CceeecceeccccccccccccchhhhhhcccccceeEEEEcC--CcE-EEEcccccCCCcccceEeehhhHHHHHHHH Confidence 45555553 223467888889999999876 3333 233 3445555555 3788999999999999 Q ss_pred HHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCcccccccccccccccee-EEcCchHhCCHHHH Q lcl|NC_020841. 260 ETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYY-VYNESIRDQAQVIR 338 (367) Q Consensus 260 q~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~-v~~~~~~~~~~~dr 338 (367) +..+...+- + |.++.....|+..++.-|+..++.|.|.- |. +.. +.+..+++++ T Consensus 685 ~~~~~~~Vf---E-PNd~~l~~~I~~sI~~fL~~L~~~GaL~G--------------------~~~V~~-D~etNt~~dI 739 (774) T protein:vir:98 685 HAILRNYVA---M-PNSRLVRNQIAAALNAFMGELKRNGNIVS--------------------FRPAII-DGSNNSTAAY 739 (774) T ss_pred HHHHHHhcc---C-CCCHHHHHHHHHHHHHHHHHHHhCCceec--------------------ceEEEE-cCCCCCHHHh Confidence 988777543 2 67999999999999999999999999953 32 332 3666788898 Q ss_pred hccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 339 EQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 339 ~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) .+.+. -+.+.+.....++.|.+++.-+. T Consensus 740 ~~G~l-~i~I~vaP~~PAEfIilri~q~t 767 (774) T protein:vir:98 740 FSREL-YVSLQFQPLYSADYIYVTISRDT 767 (774) T ss_pred hCCEE-EEEEEEEecCCcceEEEEEEEee Confidence 88877 48899999999999999988777 No 51 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=97.75 E-value=2.2e-05 Score=46.09 Aligned_cols=324 Identities=12% Similarity=0.025 Sum_probs=149.9 Q ss_pred Cccccc-cccc----------ceEEEEEeeeccccccccccce-EEEeeccccCcccceEEEecHHHHHhccCCCcHHHH Q lcl|NC_020841. 1 MAGSLT-LPIN----------MLVNVSIEYQAKLLSRDAFNRL-LIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYK 68 (367) Q Consensus 1 ~~~~~~-l~i~----------~iv~V~i~~~~~~~~~~~fg~~-li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~yk 68 (367) +.+... .|++ ...+.++...-.+....-++.+ ++++.....++......+.. .......... T Consensus 371 ~~~~~e~v~~ngG~~f~v~s~~~~g~~i~~~~as~~~s~ln~~~~V~Gt~aa~~~~d~~t~~~v------~s~~~alp~~ 444 (742) T protein:vir:58 371 RFTRIEQITLSGGASFSVISNQPYGFNIQDSRHSYWLSPFKDDELIIGTELVLPALDVSTEFGV------SSWEEALPEF 444 (742) T ss_pred cccccceeecccCcceEEEEecccCcceeccCcceEEeccCCceEEEeehhhccccccchheec------ccccccccee Confidence 111110 1222 1111222221111111122222 33433222111110000000 0000000000 Q ss_pred HHHHHhccCccc-------ceEEEEeccCcc---chHHHHHHHHhcccCcEEEEEEecCC-HHHHHHHHHHhhccCcEEE Q lcl|NC_020841. 69 IAQKYFSQNPKP-------RDLMIATVTALT---DPLASIGEVAAKTLGFYAFCFASEVA-AADIQGLAEWAQSNNRMFM 137 (367) Q Consensus 69 aA~~~F~Q~p~p-------~~v~v~~~~~~~---t~~~~l~~~~~~~~~w~~~~~~~~~~-~~~~~ala~~~ea~~~~~~ 137 (367) .....|+.+..+ ..-.++.....+ ..-+.+.++.+. .+ ..++.+...+ .+...++...++.-.+++. T Consensus 445 a~sv~laGG~dg~v~v~~~~~D~iG~~~~~d~~~adrTGL~ALlev-~e-VtILiAPG~t~~~v~aav~A~la~a~~Rl~ 522 (742) T protein:vir:58 445 SFLMPFQGGSDGYIRVDENEPDTIGRVKITPALLANYERLLPLLTE-DQ-FDLVLTPYLTFADHAGTVNAFINRAENRFL 522 (742) T ss_pred eEEEeecCCccccccccCCCcccccccccccccccchhHHHHhhhc-CC-CcEEEEcCCCchHHHHHHHHHHHhhcCCeE Confidence 000111111000 000011110000 001112222221 11 1223332222 2333344444443333332 Q ss_pred EEE-eCch--hhhHHHHHHHHhccccceeecC------C----chhHHHHHHHHHHHcccccCcce-eeeeeeecCcccc Q lcl|NC_020841. 138 TVM-TDDT--EAVTTGNALKELGQYHYCITYH------E----DYATVGAVAGMALDQRYDKTDGV-KTLHLKSLVSVVS 203 (367) Q Consensus 138 ~~~-~d~~--~~~~~~~~~~~~~~~~~~~~~~------~----~~~~~~~~~~~~~~~~~~~~~g~-~t~~~k~l~Gv~~ 203 (367) ... .+.. ...........++..+..+.+. . ..+..+.++|..+..+.. +|- ..-..|.+ +.. T Consensus 523 vL~D~P~~~tt~~~A~a~r~~~nSsraaly~PwVkv~d~~~~r~vPpSgaIAGL~ARtD~e--rGvw~SPANrgi--i~~ 598 (742) T protein:vir:58 523 YLFDIAGDDDTENLAISLAGYINSSFATTFFPWVRRLTNKGMRTVPASLAAYRSIRTTDPE--TGLAPVGARRGV--VTG 598 (742) T ss_pred EEEecCCCCchHHHHHHHHhccCCceEEEEeceeeeccCCcceeechHHHHHHHHHHhccC--CceEecCCccee--eec Confidence 222 1111 1111111222223333332221 0 112345666666665432 331 11112222 223 Q ss_pred cCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-----chhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHh Q lcl|NC_020841. 204 TDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-----KFFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDR 278 (367) Q Consensus 204 ~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~ 278 (367) ...+++|.+.|..+++|+...++ +++ .+..+.++.+ .||-+.+-.+|++..|+..+...+-. |-|+. T Consensus 599 ~~~s~se~d~LN~~GINtIrsfG---~G~-rlWGnRTlassDs~wryInVRRlfd~Ie~SI~~a~q~~VfE----PNd~~ 670 (742) T protein:vir:58 599 EPVRQVDWEDLYNNRINPIVRVG---NDV-LLFGQKTMLNVNSALNRINVRRLLIVMRNRISQILSSYLFE----NNTSE 670 (742) T ss_pred cccchhhHHHHhhCCceEEEECC---CcE-EEEcceecCCCCcccceEeehhhHHHHHHHHHHHHHHhccC----CCCHH Confidence 45678999999999999998773 233 4556676543 37889999999999998888765433 66888 Q ss_pred HHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEE Q lcl|NC_020841. 279 GMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHD 358 (367) Q Consensus 279 G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~ 358 (367) -...|+..|+.-|+...++|.|. ||.|... ++.+++|+.+.+. -+.+.+.....++. T Consensus 671 L~~sIk~sInafL~~L~aqGALl--------------------GfrV~lD--etNTpeDI~~Gkl-vv~I~vAP~~PAEf 727 (742) T protein:vir:58 671 NRLRAEALVRQYLESLRLRGAVT--------------------DYEVAID--SVTTPTDIDNNTL-RARVTVQPARSIEY 727 (742) T ss_pred HHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEEc--CCCCHHHhhCCEE-EEEEEEEccCCcce Confidence 88999999999999999999984 3667765 3588889988877 48999999999999 Q ss_pred EEEEEEecC Q lcl|NC_020841. 359 TDITLIPEA 367 (367) Q Consensus 359 v~i~~~v~~ 367 (367) |++++.-.. T Consensus 728 I~lrf~it~ 736 (742) T protein:vir:58 728 IDITFVITP 736 (742) T ss_pred EEEEEEEEe Confidence 998776544 No 52 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=97.74 E-value=2.3e-05 Score=46.03 Aligned_cols=326 Identities=11% Similarity=0.057 Sum_probs=148.4 Q ss_pred Cccccccc----------ccceEEEEEeee----------ccccccccccceEEEeeccccCcccceEE--------Eec Q lcl|NC_020841. 1 MAGSLTLP----------INMLVNVSIEYQ----------AKLLSRDAFNRLLIVGSTAPNGRATDTGI--------YTS 52 (367) Q Consensus 1 ~~~~~~l~----------i~~iv~V~i~~~----------~~~~~~~~fg~~li~~~~~~~~~~~~~~~--------yts 52 (367) ...+-..+ .++...+.+... ........-+...++..... .....+.. ... T Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 348 (671) T protein:vir:56 270 DGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTNPGDKDVNGQSIFIDEYFE-NSGSAYITAIAEGWKTESG 348 (671) T ss_pred cccccccccceeecccccccccceeEEeecCccceeEEEeecccccccchhhhhhhhhhc-ccCceEEEecCcccCCccc Confidence 00000000 001011111110 00000000000000000000 00000000 000 Q ss_pred HHHHHh--ccCCCcHHHHHHHHHhccCc--ccceEEEEeccCcc-c-----hHHHHHHHHhcccCcEEEEEEe------c Q lcl|NC_020841. 53 IDGVKL--DYGVEADEYKIAQKYFSQNP--KPRDLMIATVTALT-D-----PLASIGEVAAKTLGFYAFCFAS------E 116 (367) Q Consensus 53 ~~~v~~--df~~~s~~ykaA~~~F~Q~p--~p~~v~v~~~~~~~-t-----~~~~l~~~~~~~~~w~~~~~~~------~ 116 (367) ...+.. |-......++.+-..|.... .|..+......... . ....+..+.+...+...++-.. . T Consensus 349 ~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~ 428 (671) T protein:vir:56 349 AYNFGGGSDANAGADDWMFGLDMLSDPEVLYTNLVIAGNAAAEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNK 428 (671) T ss_pred cccccCccccccchhHHHHHHHhhhhccccceeEEEcCCCCCccchhHHHHHHHHHHHHHhhcCCEEEEEeccccccccc Confidence 001111 11112223333333333221 12111111111100 0 0112233333333322221100 0 Q ss_pred CCHHHHHHHHHHhhccCcEEEEEEeCchhhhHHHHHHHHhccccceeecC------C------chhHHHHHHHHHHHccc Q lcl|NC_020841. 117 VAAADIQGLAEWAQSNNRMFMTVMTDDTEAVTTGNALKELGQYHYCITYH------E------DYATVGAVAGMALDQRY 184 (367) Q Consensus 117 ~~~~~~~ala~~~ea~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~ 184 (367) ........+.+|.+..... +... ......+...+..+.+. + ..+..+.++|..+..+. T Consensus 429 ~~~~~~~~~~~~~~~~~~~------~~~~----~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~ 498 (671) T protein:vir:56 429 QAGTAVANIQGWRTGIDPT------NGQA----VVDNLNVSTTYAVIDGNYKYQYDKYNDRNRWVPLAGDIAGLCAYTDQ 498 (671) T ss_pred cccccHHHHHHHhhhcccc------chhh----hhhhccCCcceEEEecCceEEecccCCceeEechHHHHHHHHHHhhc Confidence 0112333344444322100 0000 00000001111111110 0 11234566777776654 Q ss_pred ccCcceeeeeeeecC---ccc--ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-----chhhHHHHHHH Q lcl|NC_020841. 185 DKTDGVKTLHLKSLV---SVV--STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-----KFFDFVMGFDW 254 (367) Q Consensus 185 ~~~~g~~t~~~k~l~---Gv~--~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-----~~iD~~~~~dw 254 (367) .+-+ ......|.+. |+. ...+++.|.+.|..+|+|+...+.+.+ ..+...+++++ .||-+.+-.+| T Consensus 499 ~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G---~~~wG~rT~~~~~~~~~~i~vrR~~~~ 574 (671) T protein:vir:56 499 VSQP-WMSPAGFNRGQIKGVNRLAVDLRRAHRDALYQIGINPVVGFAGQG---FVLYGDKTATQQASAFDRINVRRLFNL 574 (671) T ss_pred cCCc-EECcCCceeccccccccceeecChhHHHHHhhCCceEEEEecCCe---EEEEcceecCCCCcccceEehhhHHHH Confidence 4311 1111233333 332 235788999999999999999886543 24566676554 37888999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCC Q lcl|NC_020841. 255 LRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQA 334 (367) Q Consensus 255 l~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~ 334 (367) ++..|+..++..+-. |.+..=...|+..|+.-|+..++.|.|. ||.|.+. .++.| T Consensus 575 i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~fL~~l~~~gal~--------------------g~~v~~d-~~~nt 629 (671) T protein:vir:56 575 LKKAISDAAKYRLFE----LNDEFTRSSFKSEIDAYLTNIQDLGGVY--------------------DFRVVCD-ETNNP 629 (671) T ss_pred HHHHHHHHHHHhcCC----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEEc-CCCCC Confidence 999999888865443 5577777899999999999999999984 4788877 77899 Q ss_pred HHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 335 QVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 335 ~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) ++|+.+.+. -+.+.+.....++.|++++.=.. T Consensus 630 ~~~i~~G~~-~~~i~~~p~~Pae~I~~~~~~~~ 661 (671) T protein:vir:56 630 GSVIDRNEF-VASIYVKPAKSINFITLNFVATS 661 (671) T ss_pred HHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 999999988 59999999999999999886333 No 53 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=97.66 E-value=3.2e-05 Score=45.22 Aligned_cols=330 Identities=11% Similarity=0.015 Sum_probs=159.9 Q ss_pred Ccc----cccccccceEEEEEee-eccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhc Q lcl|NC_020841. 1 MAG----SLTLPINMLVNVSIEY-QAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFS 75 (367) Q Consensus 1 ~~~----~~~l~i~~iv~V~i~~-~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~ 75 (367) .+. ...+|........... ........+.- .+++. ... ...+.+...+...+.. ...+..+ ....+. T Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~v~-~~g-~~~~~~~~~~~~~~~~---~~~~~~~--~~~~~~ 316 (660) T protein:vir:10 245 EAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQY-AIIVR-RDG-AIVESVVLSTKEGEKD---VYGNNIY--LDDYFA 316 (660) T ss_pred CCcceeEEeeeeccceeeEEeeeeccccccccccc-ccccc-cCC-cccceeeeeccccccc---cccceee--eehhhc Confidence 110 0011111100000000 00011111100 11111 110 0111121111111100 0000000 000111 Q ss_pred cC------------c---ccceEEEEecc-----CccchHHHHHHHHhcccCcEEEEEEecC-------CHHHHHHHHHH Q lcl|NC_020841. 76 QN------------P---KPRDLMIATVT-----ALTDPLASIGEVAAKTLGFYAFCFASEV-------AAADIQGLAEW 128 (367) Q Consensus 76 Q~------------p---~p~~v~v~~~~-----~~~t~~~~l~~~~~~~~~w~~~~~~~~~-------~~~~~~ala~~ 128 (367) .+ | .+..-..++.+ ...+....+..+.........+++.... ...-..++..+ T Consensus 317 ~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~~~ 396 (660) T protein:vir:10 317 KGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVVSI 396 (660) T ss_pred CCCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHHHH Confidence 00 0 11111111111 1112223344443332222223332211 12345556677 Q ss_pred hhccCcEEEEEEeC------ch---hhhHHHHHHHH----------hccccceeecC------C------chhHHHHHHH Q lcl|NC_020841. 129 AQSNNRMFMTVMTD------DT---EAVTTGNALKE----------LGQYHYCITYH------E------DYATVGAVAG 177 (367) Q Consensus 129 ~ea~~~~~~~~~~d------~~---~~~~~~~~~~~----------~~~~~~~~~~~------~------~~~~~~~~~~ 177 (367) ++....+|.++-.. .. ........... +...+..+.+. + -.+..+.++| T Consensus 397 ~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~AG 476 (660) T protein:vir:10 397 ADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADLAG 476 (660) T ss_pred HHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechhHHHHH Confidence 77776666554321 00 11111111111 11111111111 0 0234567777 Q ss_pred HHHHcccccCcceeeeeeeecCccc-----ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-----chhh Q lcl|NC_020841. 178 MALDQRYDKTDGVKTLHLKSLVSVV-----STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-----KFFD 247 (367) Q Consensus 178 ~~~~~~~~~~~g~~t~~~k~l~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-----~~iD 247 (367) ..+..+..+-+ ...-.+|.+.||. ...+++.|.+.|..+|+|++.++-+. +++ .+...+++.+ .||- T Consensus 477 l~Ar~D~~~g~-~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~-~G~-~~wG~rT~~~~~s~~~~i~ 553 (660) T protein:vir:10 477 LCARTDDVSQP-WMSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGG-DGF-VLFGDKTATKVPSPMDHIN 553 (660) T ss_pred HHHHhhccCCc-EEccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCC-CcE-EEEcccccCCCCcccceEe Confidence 77766543311 1122344444332 23578999999999999999887432 222 4556666544 2678 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEc Q lcl|NC_020841. 248 FVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYN 327 (367) Q Consensus 248 ~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~ 327 (367) +.+-.+|++..|+...+..+-. |.++.-.+.|+..++.-|+..+++|.|. ||.|.+ T Consensus 554 vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~fL~~l~~~gal~--------------------g~~V~~ 609 (660) T protein:vir:10 554 VRRLFNMLKKNIGDASKYKLFE----LNDNFTRSSFRMEVSQYLDGIKALGGIY--------------------EGRVVC 609 (660) T ss_pred hhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEE Confidence 8889999999999888876543 5588889999999999999999999995 377877 Q ss_pred CchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 328 ESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 328 ~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) + .++.+++|+.+.+.. +.+.+.....++.|.+++.-.. T Consensus 610 d-~~~nt~~di~~G~~~-~~i~~~P~~pae~I~~~~~~~~ 647 (660) T protein:vir:10 610 D-TTVNTPAVIDRNEFI-ANIYVKPARSINYITLNFVATS 647 (660) T ss_pred c-CCCCCHHHhhCCeEE-EEEEEEecCCccEEEEEEEEee Confidence 6 678899999999884 9999999999999999877554 No 54 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=97.49 E-value=5.6e-05 Score=43.88 Aligned_cols=332 Identities=9% Similarity=-0.010 Sum_probs=157.5 Q ss_pred Cccccccc--ccceEEEEEeeeccccccccccceEE-Eeecc-c-----------cCcccce----------EEEecHHH Q lcl|NC_020841. 1 MAGSLTLP--INMLVNVSIEYQAKLLSRDAFNRLLI-VGSTA-P-----------NGRATDT----------GIYTSIDG 55 (367) Q Consensus 1 ~~~~~~l~--i~~iv~V~i~~~~~~~~~~~fg~~li-~~~~~-~-----------~~~~~~~----------~~yts~~~ 55 (367) .......| ..+.+.|.+.. ... ...+..+. +.... . ......+ ..+..+. T Consensus 220 ~~~~a~~~G~~Gn~i~v~i~~--~~~--~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s- 294 (663) T protein:vir:10 220 PLVSAVYPGEIGSTVEVEIVS--KTA--FNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLS- 294 (663) T ss_pred eeeeeecccccccceeEEecc--ccc--ccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeee- Confidence 00000000 00111121111 100 00000000 00000 0 0000000 0000000 Q ss_pred HHhccCCCcHHHHHHHHHhccCcccc--------------eEEEE-eccC-----ccchHHHHHHHHhcccCcEEEEEEe Q lcl|NC_020841. 56 VKLDYGVEADEYKIAQKYFSQNPKPR--------------DLMIA-TVTA-----LTDPLASIGEVAAKTLGFYAFCFAS 115 (367) Q Consensus 56 v~~df~~~s~~ykaA~~~F~Q~p~p~--------------~v~v~-~~~~-----~~t~~~~l~~~~~~~~~w~~~~~~~ 115 (367) +..+-......-......|..+..+. .+.+. +.+. ..+....+..+.+...-...++++. T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~ 374 (663) T protein:vir:10 295 TRKGDRDVYGSNIFMDDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAG 374 (663) T ss_pred ecccccccchhhhhhhhhhccCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEec Confidence 00000000000011222333321110 01111 1111 1122233444433322122223322 Q ss_pred c--C-C----HHHHHHHHHHhhccCcEEEEEEeCc---------hhhhHHHHHHHH-------------hccccceeecC Q lcl|NC_020841. 116 E--V-A----AADIQGLAEWAQSNNRMFMTVMTDD---------TEAVTTGNALKE-------------LGQYHYCITYH 166 (367) Q Consensus 116 ~--~-~----~~~~~ala~~~ea~~~~~~~~~~d~---------~~~~~~~~~~~~-------------~~~~~~~~~~~ 166 (367) . . + ..-..++...++....+|.+.-... ....+....... +...+.. .++ T Consensus 375 ~~~~~~~~~~~~v~~al~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~-l~~ 453 (663) T protein:vir:10 375 ACGSDGAEIASTVQKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAF-IIG 453 (663) T ss_pred cCCCCchhhHHHHHHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEE-EEc Confidence 1 1 1 2234455566666655555443211 001111111111 0111111 111 Q ss_pred C-------------chhHHHHHHHHHHHcccccCcceeeeeeeecC---ccc--ccCCCHHHHHHHHhCCceEEEEeecc Q lcl|NC_020841. 167 E-------------DYATVGAVAGMALDQRYDKTDGVKTLHLKSLV---SVV--STDISQTQAASLKAACINYYSDYGNP 228 (367) Q Consensus 167 ~-------------~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~---Gv~--~~~~t~t~~~~l~~~~~n~y~~~~~~ 228 (367) + ..+..+.++|..+..+..+-+ ......|.+. |+. ...+++.|.+.|..+|+|.+..+-+. T Consensus 454 P~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~-~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~ 532 (663) T protein:vir:10 454 NYKYQYDKYNDINRWVPLAADIAGLCAYTDQVSHP-WMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGG 532 (663) T ss_pred cceEEecccCCceEEechhHHHHHHHHHhhccCCc-eEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCC Confidence 1 123456777777766544411 1122233332 322 24688999999999999999887542 Q ss_pred ccceEEEecCEeeCC-----chhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecc Q lcl|NC_020841. 229 DNSLPIFANGHAGGG-----KFFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAG 303 (367) Q Consensus 229 ~~~~~~~~~G~~~~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g 303 (367) +++ .+...+++++ .||-+.+-.+|++..|+......+-. |.+..-...|+..|+.-|++.+++|.|. T Consensus 533 -~G~-~~wG~rT~s~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~~L~~l~~~gal~-- 604 (663) T protein:vir:10 533 -DGF-VLFGDKMATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFE----NNDAFTRQSFRMETSQYLDGIRSLGGCY-- 604 (663) T ss_pred -CcE-EEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee-- Confidence 222 3555566544 25888899999999998888875443 5688888999999999999999999984 Q ss_pred cccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 304 TWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 304 ~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) ||.|.+. .++.|++|+.+.+. -+.+.+.....++.|.+++.-.. T Consensus 605 ------------------g~~v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~~ 648 (663) T protein:vir:10 605 ------------------DFRVVCD-TTNNTPNVIDRNEF-VGTIYVKPPRSINYITLNMVATS 648 (663) T ss_pred ------------------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 3778876 77889999999988 49999999999999999887555 No 55 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=97.46 E-value=6.2e-05 Score=43.65 Aligned_cols=328 Identities=8% Similarity=0.021 Sum_probs=153.3 Q ss_pred Cccccccc-------ccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEe------------cHHHHHhccC Q lcl|NC_020841. 1 MAGSLTLP-------INMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYT------------SIDGVKLDYG 61 (367) Q Consensus 1 ~~~~~~l~-------i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yt------------s~~~v~~df~ 61 (367) -.-...++ +.......+...++ ....+. +++- ... ...+.+.... ...... . T Consensus 245 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~--~~~~~~--~~~~-~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~~~---~ 315 (664) T protein:vir:98 245 YDTGAMISGYPSGISVKNSGRSVMTYGPQ--TDNQYA--FVVR-RGG-IVQESFIVSTDKTDKDIYGVNIYMDDFF---A 315 (664) T ss_pred ccCcceEeeccCceecccceeeeeecccc--Ccccee--EEEe-cCC-ceeeeEEeecccCcccceeeeeechhhe---e Confidence 00000011 00000000100000 000000 0000 000 0001111000 000000 0 Q ss_pred CCcHHHHHHHHHhccCcccce-EE-EEeccC-----ccchHHHHHHHHhcccCcEEEEEEecC---CH----HHHHHHHH Q lcl|NC_020841. 62 VEADEYKIAQKYFSQNPKPRD-LM-IATVTA-----LTDPLASIGEVAAKTLGFYAFCFASEV---AA----ADIQGLAE 127 (367) Q Consensus 62 ~~s~~ykaA~~~F~Q~p~p~~-v~-v~~~~~-----~~t~~~~l~~~~~~~~~w~~~~~~~~~---~~----~~~~ala~ 127 (367) .....|..+... ........ +. .++.+. .....+.+.++.+...-.-.+++.... +. .-..++.. T Consensus 316 ~~~~~~~~~~~~-~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~~~~v~~al~~ 394 (664) T protein:vir:98 316 NGGSQYVFGTSM-NWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGESVEIASTVQKHVIS 394 (664) T ss_pred cccceeeeeecc-cCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccceEEecCCCCCcHHHHHHHHHHHHH Confidence 000011110000 00000000 01 111111 112223344443322111122333221 11 23444555 Q ss_pred HhhccCcEEEEEEe------Cch---hhhHHHHHHH--------------HhccccceeecC------C------chhHH Q lcl|NC_020841. 128 WAQSNNRMFMTVMT------DDT---EAVTTGNALK--------------ELGQYHYCITYH------E------DYATV 172 (367) Q Consensus 128 ~~ea~~~~~~~~~~------d~~---~~~~~~~~~~--------------~~~~~~~~~~~~------~------~~~~~ 172 (367) .++....+|.+.-. +.. ...+...... .+...+..+.+. + ..+.. T Consensus 395 ~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s 474 (664) T protein:vir:98 395 IGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNRWVPLA 474 (664) T ss_pred HHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceEEechH Confidence 66665555543321 111 1111100000 011111122111 1 02345 Q ss_pred HHHHHHHHHcccccCcceeeeeeeecCccc-----ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC---c Q lcl|NC_020841. 173 GAVAGMALDQRYDKTDGVKTLHLKSLVSVV-----STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG---K 244 (367) Q Consensus 173 ~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~-----~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G---~ 244 (367) +.++|..+..+..+-+ ......|.+.||. ...+++.|.+.|..+|+|.+..+.+. +++ .+...+++.+ + T Consensus 475 g~~AGl~A~~D~~~g~-~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~-~G~-~~wG~rT~~~~~s~ 551 (664) T protein:vir:98 475 GDIAGLCVYTDSVANP-WMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGG-SGF-VLYGDKTLTSVPSP 551 (664) T ss_pred HHHHHHHHHhhhcCCc-EECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCC-CcE-EEEcccccCCCCcc Confidence 6677777766543311 1222334333332 24578899999999999999887542 222 4566666554 2 Q ss_pred --hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccc Q lcl|NC_020841. 245 --FFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTG 322 (367) Q Consensus 245 --~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~g 322 (367) ||-+.+-.+|++..|+..++..+-. |.+..=...|+..|+.-|+..+++|.|. | T Consensus 552 ~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~--------------------g 607 (664) T protein:vir:98 552 FDRINVRRLFNMIKKDIGDNAKYKLFE----NNDDFTRASFRMDTGQYMTNIRALGGCY--------------------D 607 (664) T ss_pred cceEeehhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------------------e Confidence 5888888899999888888765543 5678888999999999999999999984 4 Q ss_pred eeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 323 YYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 323 y~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) |.|.+. .++.|++|+.+.+. -+.+.+...-.++.|.+++.-.. T Consensus 608 ~~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~~q~~ 650 (664) T protein:vir:98 608 YRVICD-TTNNTPDVIDRNEF-VATVYVKPPRSINYITLNFVATS 650 (664) T ss_pred eEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 788887 77899999999988 59999999999999999876544 No 56 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=97.30 E-value=0.0001 Score=42.45 Aligned_cols=328 Identities=11% Similarity=0.069 Sum_probs=156.4 Q ss_pred Ccccc---------ccc-------ccceEE-----------------EEEeeecc-ccccccccceEEEeeccccCcccc Q lcl|NC_020841. 1 MAGSL---------TLP-------INMLVN-----------------VSIEYQAK-LLSRDAFNRLLIVGSTAPNGRATD 46 (367) Q Consensus 1 ~~~~~---------~l~-------i~~iv~-----------------V~i~~~~~-~~~~~~fg~~li~~~~~~~~~~~~ 46 (367) -.... ..| ...+++ +....... +..... ...+++ .... ...+. T Consensus 232 ~~g~~gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~vvv-~~~g-~~~~~ 308 (679) T protein:vir:10 232 YAGTYGDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNES-QFAFIV-FNNG-VAVES 308 (679) T ss_pred cccccCCcceEEEEeecccccccccccccccccccccccccccccccceeeeeccccccccc-ceeeEE-eccc-ccccc Confidence 00000 000 000000 00000000 000000 000000 0000 00111 Q ss_pred eEEEecHHHHHhccCCCcHHHHHHHHHhccC------------cc-cce-EEE-EeccCc-----cchHHHHHHHHhccc Q lcl|NC_020841. 47 TGIYTSIDGVKLDYGVEADEYKIAQKYFSQN------------PK-PRD-LMI-ATVTAL-----TDPLASIGEVAAKTL 106 (367) Q Consensus 47 ~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~------------p~-p~~-v~v-~~~~~~-----~t~~~~l~~~~~~~~ 106 (367) +...+...+. +... .......+|..+ |. +.. +-+ ++.+.. ......+..+..... T Consensus 309 ~~~~~~~~~~---~~~~--~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~ 383 (679) T protein:vir:10 309 KILSTKPGDR---DIYG--TSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREH 383 (679) T ss_pred eeeecccccc---cccc--hhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccc Confidence 1111111110 0000 001111112111 00 000 111 111110 111112222222111 Q ss_pred CcEEEEEEecC-------CHHHHHHHHHHhhccCcEEEEEEeC------ch--hhhHHHHHHHH--------------hc Q lcl|NC_020841. 107 GFYAFCFASEV-------AAADIQGLAEWAQSNNRMFMTVMTD------DT--EAVTTGNALKE--------------LG 157 (367) Q Consensus 107 ~w~~~~~~~~~-------~~~~~~ala~~~ea~~~~~~~~~~d------~~--~~~~~~~~~~~--------------~~ 157 (367) ...-+++.... ...-..++-..++....+|.++..- .. ...+.....+. +. T Consensus 384 ~~~~~l~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~ 463 (679) T protein:vir:10 384 TDVNLFIAGAVAGEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIG 463 (679) T ss_pred cccceEEecCCCCCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccC Confidence 12223333221 1234556667777777676654321 00 01111111111 00 Q ss_pred cccceeecCC-------------chhHHHHHHHHHHHcccccCcceeeeeeeecCccc-----ccCCCHHHHHHHHhCCc Q lcl|NC_020841. 158 QYHYCITYHE-------------DYATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVV-----STDISQTQAASLKAACI 219 (367) Q Consensus 158 ~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~-----~~~~t~t~~~~l~~~~~ 219 (367) ..+.. .+++ ..+..+.++|..+-.+..+-+ ......|.+.||. .-.+++.|.+.|..+|+ T Consensus 464 s~~~~-~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~-~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gi 541 (679) T protein:vir:10 464 TTYAS-VDGNYKYQYDKYNDVNRWIPLAADIAGLCARTDTVGQP-WQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGI 541 (679) T ss_pred cceEE-EEccceeeecccCCceEEechHHHHHHHHHHhhccCCc-EECcCCeeeccccccccceeecChhhHHhhhhCCc Confidence 11111 1111 023356677777766543311 1222344444432 23578899999999999 Q ss_pred eEEEEeeccccceEEEecCEeeCC---c--hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHH Q lcl|NC_020841. 220 NYYSDYGNPDNSLPIFANGHAGGG---K--FFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEA 294 (367) Q Consensus 220 n~y~~~~~~~~~~~~~~~G~~~~G---~--~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a 294 (367) |....+.+.+ + .+...+++++ + ||-+.+-.+|++..|+......+-. |.|..=.+.|+..|+.-|.+. T Consensus 542 n~i~~~~g~G--~-~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~fL~~l 614 (679) T protein:vir:10 542 NPIVGFAGQG--Y-ILYGDKTASQAPTPFDRINVRRLFNLLKKSISESAKYKLFE----LNDAFTRSSFRSEVGSYLDTI 614 (679) T ss_pred eEEEEecCCe--E-EEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHH Confidence 9999886543 2 4566677655 2 6778888999998888888765543 557777899999999999999 Q ss_pred HhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 295 VKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 295 ~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) .++|.|. ||.|.+. .++.+++|+.+.+. -+.+.+...-.++.|.+++.=.. T Consensus 615 ~~~gal~--------------------gf~v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~i~~~~~~~~ 665 (679) T protein:vir:10 615 RSLGGIY--------------------DFRVVCD-ESNNTPAVIDRNEF-VATILIKPARSINYITLSFVATS 665 (679) T ss_pred HhCCcee--------------------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCccEEEEEEEEee Confidence 9999984 4888877 68899999999988 59999999999999999887554 No 57 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=97.14 E-value=0.00016 Score=41.47 Aligned_cols=323 Identities=10% Similarity=-0.015 Sum_probs=145.4 Q ss_pred Ccccccccc---c-ce---E--EEEEeeeccccccccccceEEEeeccccCcccceEEEe-cHHHHHhccCCCcHHHHHH Q lcl|NC_020841. 1 MAGSLTLPI---N-ML---V--NVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYT-SIDGVKLDYGVEADEYKIA 70 (367) Q Consensus 1 ~~~~~~l~i---~-~i---v--~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yt-s~~~v~~df~~~s~~ykaA 70 (367) +...|.+-+ . ++ . +-.+.+...+... + +..+. .+...+..++..+-. ..+++.. .+..|. --+ T Consensus 248 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~-~-~~~~~--~~~~~~~~~~~~v~~~~~~~l~~--~~~~p~-~~~ 320 (648) T protein:vir:10 248 FGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFD-G-SDYQD--YTSLSDPANWFAKDAYTINHLVD--TTINPH-ILA 320 (648) T ss_pred hcCCcchhhhhhhccccccccccceeccccccccc-c-cceee--eeccccccceeeeeccchhhccc--ccccCc-ccc Confidence 111111110 0 10 0 0011111111110 0 00000 000001111111100 0011100 111111 001 Q ss_pred HHHh---ccCc--ccce-EEEEeccCccchHHHHHHHHhcccCcEEEEEE----------ecC-CHHHHHHHH-HHhhcc Q lcl|NC_020841. 71 QKYF---SQNP--KPRD-LMIATVTALTDPLASIGEVAAKTLGFYAFCFA----------SEV-AAADIQGLA-EWAQSN 132 (367) Q Consensus 71 ~~~F---~Q~p--~p~~-v~v~~~~~~~t~~~~l~~~~~~~~~w~~~~~~----------~~~-~~~~~~ala-~~~ea~ 132 (367) ..+| +++. .|.. -.-+......++.++++.+++....| .+.. +.. +..-+.+.+ .|+... T Consensus 321 ~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~--ivp~~~~~~~~~~~~~lt~~q~i~a~a~shv~~~ 398 (648) T protein:vir:10 321 TRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNF--VIPAYKFTNVTQLNDRLTIFKGIASTFLSHVQTM 398 (648) T ss_pred cccceecccccCCCcccccccccccchhhHHHHhhhccCCCceE--EEeecccccccccccccCCccchHHHHHHHHHHh Confidence 1111 1221 1110 00011113344566666665543322 2210 111 111222222 455432 Q ss_pred C------cE--EEEEEeCchhhh-HHHHHHH---Hhccccc------------------------eeecCCchhHHHHHH Q lcl|NC_020841. 133 N------RM--FMTVMTDDTEAV-TTGNALK---ELGQYHY------------------------CITYHEDYATVGAVA 176 (367) Q Consensus 133 ~------~~--~~~~~~d~~~~~-~~~~~~~---~~~~~~~------------------------~~~~~~~~~~~~~~~ 176 (367) + -+ +.........+. .....+. .++..+. -....+....+++++ T Consensus 399 s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~~~~~~~a~~~~~d~~~~~~~~~~~~~~~~~G~~~~~p~~~~Aa~VA 478 (648) T protein:vir:10 399 SQVNRRKARVGVFGLPAPSPNESVTASEYLYNRNILNTISAMFGGTDRAQAVVFPFYSNVFNDEGKVELLGGEFFASYVA 478 (648) T ss_pred hhccccccccCeEEEeCCCCchhHHHHHHHhhhhcccccceeeeecCCceEEeecccceeECCCCcEEecchhhHHHHHH Confidence 1 11 222222211111 1110110 0111010 001112233355666 Q ss_pred HHHHHcccccCcceeeeeeeecCcc--cc-cCCCHHHHHHHHhCCceEEEEeeccccc-eEEEecCEeeCC-------ch Q lcl|NC_020841. 177 GMALDQRYDKTDGVKTLHLKSLVSV--VS-TDISQTQAASLKAACINYYSDYGNPDNS-LPIFANGHAGGG-------KF 245 (367) Q Consensus 177 ~~~~~~~~~~~~g~~t~~~k~l~Gv--~~-~~~t~t~~~~l~~~~~n~y~~~~~~~~~-~~~~~~G~~~~G-------~~ 245 (367) |..++...... .-||.+.++ .+ ..++++|++.|..+|++++....+.+.. ....-.|.++.+ .- T Consensus 479 Gl~a~l~~~~s-----~T~k~i~~~~id~~~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~~~~~~~~~~e 553 (648) T protein:vir:10 479 GMHANREPQDS-----ITFLPISGIGAEPLYNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPTTWLGPVTQGFQE 553 (648) T ss_pred hhhhccccccC-----cccceeeccccccccCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEeccceeecCCCCcceee Confidence 66665444443 345565544 33 4789999999999999999887654332 233456777665 25 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeE Q lcl|NC_020841. 246 FDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYV 325 (367) Q Consensus 246 iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v 325 (367) |-..+-.|++...++..+.+.|+-... ++.....|++.+..-|.+-++.+-|.+-.. ..+ T Consensus 554 isv~ri~D~l~~~vr~~l~~~fIG~~n---~~~~~~~ik~~i~~~L~~~~~~~~I~~y~~-----------------~~v 613 (648) T protein:vir:10 554 FVLRRIDDFLQSYVYKNLQEQFIGRKS---YGRKTENDIKVYTEALLSNLVGKQIVAYKD-----------------VKV 613 (648) T ss_pred eeeeehhhHHHHHHHHHHhhhcCcccc---cHHHHHHHHHHHHHHHhhHhhcCcccCccc-----------------ceE Confidence 778999999999999999999987533 777889999999999888888777763211 112 Q ss_pred EcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 326 YNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 326 ~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) ++.. ...|. -+.|.+....+|+.|.+++.|.- T Consensus 614 ~~~~--------~~~vv--~V~~~v~Pv~~i~~I~vti~it~ 645 (648) T protein:vir:10 614 TSNE--------DKTVY--YVEFFYQPVTEIKFILVTMKVTF 645 (648) T ss_pred EEEe--------cCCEE--EEEEEEEecceeeEEEEEEEEEe Confidence 2110 11332 58999999999999999888877 No 58 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=96.89 E-value=0.00028 Score=40.10 Aligned_cols=325 Identities=8% Similarity=0.002 Sum_probs=150.4 Q ss_pred Cccccccc----ccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhcc Q lcl|NC_020841. 1 MAGSLTLP----INMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQ 76 (367) Q Consensus 1 ~~~~~~l~----i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q 76 (367) -...-..+ .++...+.+... ......|...+.-+.. ..+.+..-+..-+......+..+.. .. T Consensus 261 ~~~~~v~~~~~~~~~~~~~~~~~~--~~~~~~~~~s~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 327 (663) T protein:vir:10 261 SNARGVIQYGPMTDDQFAIIVRRD--GIVVESTVLSTRKGDR---------DVYGSNIFMDDYFRNGGSNFIFASS--EG 327 (663) T ss_pred ccccceeecccccccceeeEeecC--Ccceeeeccccccccc---------ccccchhhhhhhhcCCcceEEEEee--cc Confidence 00000000 011111111111 1111111100000000 0000000000000000000000000 00 Q ss_pred Cccc-ce-EEE-EeccCc-----cchHHHHHHHHhcccCcEEEEEEec--C-C----HHHHHHHHHHhhccCcEEEEEEe Q lcl|NC_020841. 77 NPKP-RD-LMI-ATVTAL-----TDPLASIGEVAAKTLGFYAFCFASE--V-A----AADIQGLAEWAQSNNRMFMTVMT 141 (367) Q Consensus 77 ~p~p-~~-v~v-~~~~~~-----~t~~~~l~~~~~~~~~w~~~~~~~~--~-~----~~~~~ala~~~ea~~~~~~~~~~ 141 (367) .|.+ .. +.+ ++.+.. .+....+..+.+...-.-.++++.. . . ..-..++..+++....+|.+... T Consensus 328 ~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~a~~~~~~~ai~d~ 407 (663) T protein:vir:10 328 WPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLADDRQDCVAIVNP 407 (663) T ss_pred cCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHHHhhCCEEEEEec Confidence 0000 00 111 111111 1112222223222111111222211 1 1 11233344555544444443322 Q ss_pred Cc---------hhhhHHHHHHHH-------------hccccceeecC------C------chhHHHHHHHHHHHcccccC Q lcl|NC_020841. 142 DD---------TEAVTTGNALKE-------------LGQYHYCITYH------E------DYATVGAVAGMALDQRYDKT 187 (367) Q Consensus 142 d~---------~~~~~~~~~~~~-------------~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~~~~ 187 (367) .. ....+....... +...+..+.+. + ..+..+.++|..+..+..+- T Consensus 408 p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g 487 (663) T protein:vir:10 408 PAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAADIAGLCAYTDQVSH 487 (663) T ss_pred CcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCceEEechhHHHHHHHHHhhccCC Confidence 10 000011000000 11111111111 0 02345677777777665442 Q ss_pred cceeeeeeeecC---ccc--ccCCCHHHHHHHHhCCceEEEEeeccccceEEEecCEeeCC---c--hhhHHHHHHHHHH Q lcl|NC_020841. 188 DGVKTLHLKSLV---SVV--STDISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG---K--FFDFVMGFDWLRN 257 (367) Q Consensus 188 ~g~~t~~~k~l~---Gv~--~~~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G---~--~iD~~~~~dwl~~ 257 (367) + ......|.+. |+. ...+++.|.+.|..+|+|++..+-+. ++ ..+...+++.+ + ||-+.+-.+|++. T Consensus 488 ~-~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~-~G-~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~ 564 (663) T protein:vir:10 488 P-WMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGG-DG-FVLFGDKMATQVPSPFDRINVRRLFNMLKK 564 (663) T ss_pred c-eEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCC-Cc-EEEEcccccCCCCcccceEehhhHHHHHHH Confidence 1 1222334333 332 24578999999999999998876432 12 24555666543 2 5888899999999 Q ss_pred HHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHH Q lcl|NC_020841. 258 VIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVI 337 (367) Q Consensus 258 ~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~d 337 (367) .|+......+-. |.+..=...|+..|+.-|++.+++|.|. ||.|.++ .++.|++| T Consensus 565 si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~--------------------g~~v~~d-~~~nt~~~ 619 (663) T protein:vir:10 565 NIGDTSKYELFE----NNDAFTRQSFRMETSQYLDGIRSLGGCY--------------------DFRVVCD-TTNNTPNV 619 (663) T ss_pred HHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEEc-CCCCCHHH Confidence 999888875543 5688888999999999999999999984 4778876 77899999 Q ss_pred HhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 338 REQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 338 r~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) +.+.+. -+.+.+.....++.|.+++.-.. T Consensus 620 i~~G~~-~~~i~~~p~~pae~i~~~~~~~~ 648 (663) T protein:vir:10 620 IDRNEF-VGTIYVKPPRSINYITLNMVATS 648 (663) T ss_pred hhCCeE-EEEEEEEecCCcceEEEEEEEee Confidence 999988 59999999999999999888655 No 59 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=96.88 E-value=0.00028 Score=40.04 Aligned_cols=319 Identities=11% Similarity=0.074 Sum_probs=149.0 Q ss_pred CcccccccccceEEEEEeeeccccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcc- Q lcl|NC_020841. 1 MAGSLTLPINMLVNVSIEYQAKLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPK- 79 (367) Q Consensus 1 ~~~~~~l~i~~iv~V~i~~~~~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~- 79 (367) +.+.... .......+.+. .... .+ ...++........... -..++-......+-..+...|..+-..|..... T Consensus 293 ls~~~~~--~~~~~~~~~~~-~~~~-~~-~s~~v~~~~~~~~~~~-~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~ 366 (663) T protein:vir:10 293 LSTRRGD--RDVYGNNIFMD-DYFR-NG-SSNFIYASSVNWPAGF-TGIIQLGGGASANNAVGSDELIAGWDLFADREAL 366 (663) T ss_pred eeccccc--cccchhhhhhh-hhhc-Cc-ccceeEeeccccCccc-ceeEEecccccCcccchhhhhhhHHhhhcccccc Confidence 2221111 11110000010 0000 01 1111111110000000 000100000000011122223333222222111 Q ss_pred cceEEE-EeccCc-----cchHHHHHHHHhcccCcEEEEEEecC--------CHHHHHHHHHHhhccCcEEEEEEeCchh Q lcl|NC_020841. 80 PRDLMI-ATVTAL-----TDPLASIGEVAAKTLGFYAFCFASEV--------AAADIQGLAEWAQSNNRMFMTVMTDDTE 145 (367) Q Consensus 80 p~~v~v-~~~~~~-----~t~~~~l~~~~~~~~~w~~~~~~~~~--------~~~~~~ala~~~ea~~~~~~~~~~d~~~ 145 (367) +..+.+ +..... .....++..+.+...+-+ .+.+.. ....+..+..|-+..... T Consensus 367 ~~~~~i~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~--ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~---------- 434 (663) T protein:vir:10 367 HVNLMIAGACKSDGVAVASTVQKHVVALADDRQDCV--AFVNPPSELLVGVPTTQAVKNIVEWRNGVTTG---------- 434 (663) T ss_pred CceEEEeecCCCCchhhHHHHHHHHHHHHHhhCCEE--EEEecCcccccccchhhhHHHHHHHhhhcccc---------- Confidence 111111 111111 111222333333333322 222221 112233333443321000 Q ss_pred hhHHHHHHHHhccccceeecC------C------chhHHHHHHHHHHHcccccCcceeeeeeeecCccc-----ccCCCH Q lcl|NC_020841. 146 AVTTGNALKELGQYHYCITYH------E------DYATVGAVAGMALDQRYDKTDGVKTLHLKSLVSVV-----STDISQ 208 (367) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~-----~~~~t~ 208 (367) .........+...+..+.+. + -.+..+.++|..+..+..+-+ ......|.+.||. ...+++ T Consensus 435 -~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~-~~span~~~~~i~g~~~~~~~~~~ 512 (663) T protein:vir:10 435 -GEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPLSADIAGLCAYTDQVGHP-WMSPAGYRRGQLRNTIKLAIEPKQ 512 (663) T ss_pred -chhhhhhcccCcceEEEEecceeEecccCCceEEechHHHHHHHHHHhhccCCc-EEccCCeeecceeccccceeecCc Confidence 00000000011111111110 0 113445667777665543311 1222334333333 235788 Q ss_pred HHHHHHHhCCceEEEEeeccccceEEEecCEeeCC-----chhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHH Q lcl|NC_020841. 209 TQAASLKAACINYYSDYGNPDNSLPIFANGHAGGG-----KFFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMI 283 (367) Q Consensus 209 t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~G-----~~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l 283 (367) .|.+.|..+|+|.+..+-+. ++ ..+...+++++ .||-+.+-.+|++..|+..+...+-. |.+..-...| T Consensus 513 ~~~~~Ln~~gin~i~~~~~~-~G-~~~wG~rT~s~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i 586 (663) T protein:vir:10 513 SLRDTMYQVSINPVTGFAGG-DG-FVLFGDKMATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFE----NNDAFTRQSF 586 (663) T ss_pred hhHHHHHhCCCcEEEEeeCC-Cc-EEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHH Confidence 99999999999999887542 12 24566666554 26888888899998888887765443 6688888999 Q ss_pred HHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEE Q lcl|NC_020841. 284 KADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITL 363 (367) Q Consensus 284 ~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~ 363 (367) +..|+.-|++.+++|.|. ||.|.+. .++.+++|+.+.+. -+.+.+.....++.|.+++ T Consensus 587 ~~~i~~~L~~l~~~gal~--------------------gf~V~~d-~~~nt~~~i~~G~~-~~~i~~~p~~pae~I~~~~ 644 (663) T protein:vir:10 587 RMEVSQYLDNIRSLGGVY--------------------DFRVVCD-TTNNTPQVIDSNEF-VATIYIKAPRSINYITLNF 644 (663) T ss_pred HHHHHHHHHHHHhCCcee--------------------eeEEEEc-CCCCCHHHhhCCeE-EEEEEEEecCCcceEEEEE Confidence 999999999999999984 3778876 77889999999888 5999999999999999976 Q ss_pred EecC Q lcl|NC_020841. 364 IPEA 367 (367) Q Consensus 364 ~v~~ 367 (367) .=.. T Consensus 645 ~~~~ 648 (663) T protein:vir:10 645 VATS 648 (663) T ss_pred EEEe Confidence 6443 No 60 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=96.05 E-value=0.0011 Score=36.86 Aligned_cols=344 Identities=14% Similarity=0.094 Sum_probs=170.4 Q ss_pred Cccc-cccccc-ceEEEEEeeec-cccccccccceEEEeecccc---CcccceEEEecHHHHHhccCCCcHHHHHHHHHh Q lcl|NC_020841. 1 MAGS-LTLPIN-MLVNVSIEYQA-KLLSRDAFNRLLIVGSTAPN---GRATDTGIYTSIDGVKLDYGVEADEYKIAQKYF 74 (367) Q Consensus 1 ~~~~-~~l~i~-~iv~V~i~~~~-~~~~~~~fg~~li~~~~~~~---~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F 74 (367) |.++ ...|-| ++=-+-+.+++ .+-...+-...|++|..... ......++ .|.++..+-||..|-.-.+++.+. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v~v-~s~~~a~~lfG~GSml~~M~~a~~ 79 (498) T protein:vir:45 1 MTISFNTIPSNTLVPLFYAEMDNQAANTAQDSGASLLIGHANNGAEIVANSLVLM-PSADYARQICGAGSQLARMVEAYR 79 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCCCCCCcceEEEEecCCccccccceeEEe-cCHHHHHHhcCcCcHHHHHHHHHH Confidence 6665 445533 11134455543 33344455678899876322 33445555 577777778999998888888776 Q ss_pred ccCc------------------------------ccceEEEEec------cCccchHH---H------------------ Q lcl|NC_020841. 75 SQNP------------------------------KPRDLMIATV------TALTDPLA---S------------------ 97 (367) Q Consensus 75 ~Q~p------------------------------~p~~v~v~~~------~~~~t~~~---~------------------ 97 (367) .-.| .+-.++|++. .+.+++.+ + T Consensus 80 ~~n~~~~l~~i~~~d~aG~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA~~~ 159 (498) T protein:vir:45 80 QTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSS 159 (498) T ss_pred HhCCcceEEEEeeCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEEEec Confidence 6322 1113334321 11112111 1 Q ss_pred -----------------------------------------------------HHHHHhcccCcEEEEEEecCCHHHHHH Q lcl|NC_020841. 98 -----------------------------------------------------IGEVAAKTLGFYAFCFASEVAAADIQG 124 (367) Q Consensus 98 -----------------------------------------------------l~~~~~~~~~w~~~~~~~~~~~~~~~a 124 (367) ..++......||.+++..-.+.+.+.+ T Consensus 160 ~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamagGag~PD~a~alaal~~~~~~~I~~p~~D~asL~a 239 (498) T protein:vir:45 160 AGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNT 239 (498) T ss_pred CceEEEEeeccCccccceeEEEeeccccccccccceeeEEEEccCCCccCchhHHHHHHhccCCccEEEEeeCCHHHHHH Confidence 122333334455444444445556666 Q ss_pred HHHHhhc-------cCcEEEEEEeCch-hhhHHHHHHHHhccccceeecC-C---chh-HHHHHHHHHHHcccccCccee Q lcl|NC_020841. 125 LAEWAQS-------NNRMFMTVMTDDT-EAVTTGNALKELGQYHYCITYH-E---DYA-TVGAVAGMALDQRYDKTDGVK 191 (367) Q Consensus 125 la~~~ea-------~~~~~~~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~-~---~~~-~~~~~~~~~~~~~~~~~~g~~ 191 (367) +..+++. -+.++.....-.. +.......-...+..|..+..+ + ++. ..++..+..++.....+|.+ T Consensus 240 l~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~~~~~sp~~~~AAa~aa~~A~~l~~DPAr- 318 (498) T protein:vir:45 240 LVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPAR- 318 (498) T ss_pred HHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCChHHHHHHHHHHHHHHHhhccccc- Confidence 6666653 1222222221111 1111112222334455444333 2 222 22222223333333455533 Q ss_pred eeeeeecCccccc----CCCHHHHHHHHhCCceEEEEeeccccceEEEecCEee-----CC----chhh--HHHHHHHHH Q lcl|NC_020841. 192 TLHLKSLVSVVST----DISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAG-----GG----KFFD--FVMGFDWLR 256 (367) Q Consensus 192 t~~~k~l~Gv~~~----~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~-----~G----~~iD--~~~~~dwl~ 256 (367) ++.--.|+||.|. .++.+|.+.|..+|+..+.. . . +...+.+..++ .| .|.| .++-.++++ T Consensus 319 PL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V-~-~--G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr 394 (498) T protein:vir:45 319 PTQTGELVGMLPAPKGKRFTMTEQQTLLSHGVATAYV-E-S--GVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVL 394 (498) T ss_pred ccCceeecceecCCchhcCChHHHHHHHhCCcceEEE-c-C--CeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHH Confidence 3333467888854 46889999999999999854 2 2 23556777663 34 3655 899999999 Q ss_pred HHHHHHHHHHHHhcCCCCcCHhH---------HHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEc Q lcl|NC_020841. 257 NVIETNVFNGQRLRRLTPQTDRG---------MMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYN 327 (367) Q Consensus 257 ~~lq~~l~~ll~~~~kipy~~~G---------~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~ 327 (367) ..++..+..-+- ..|+.-+... -..|++.+-..+++....|++..-+..-...-.....+...+ ..+.. T Consensus 395 ~~~r~~i~~kfp-R~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givEn~~~~~~~LiVerd~~dpnR-ln~~~ 472 (498) T protein:vir:45 395 RKLKSVITSKYG-RHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNR-LNTLF 472 (498) T ss_pred HHHHHHhhhhcC-CeeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcE-EEEEe Confidence 999999987763 3343333221 268999999999999999998543211111000000000000 11111 Q ss_pred -CchHhCCHHHHhccccCCeEEEEEECce Q lcl|NC_020841. 328 -ESIRDQAQVIREQRIAPPFIILVKGAGA 355 (367) Q Consensus 328 -~~~~~~~~~dr~~R~~~~~~~~~~~aga 355 (367) ++..++ =|---..-.+.+.|.-++| T Consensus 473 p~d~vn~---L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 473 PPDYVNQ---LRVFAVVNQFRLQYSEESA 498 (498) T ss_pred cccccCc---hhhhhhhhhhheehhhcCC Confidence 111111 1111111234444555555 No 61 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=95.78 E-value=0.0015 Score=36.11 Aligned_cols=344 Identities=15% Similarity=0.087 Sum_probs=168.1 Q ss_pred Cccc-ccccccceE-EEEEeeec-cccccccccceEEEeecccc---CcccceEEEecHHHHHhccCCCcHHHHHHHHHh Q lcl|NC_020841. 1 MAGS-LTLPINMLV-NVSIEYQA-KLLSRDAFNRLLIVGSTAPN---GRATDTGIYTSIDGVKLDYGVEADEYKIAQKYF 74 (367) Q Consensus 1 ~~~~-~~l~i~~iv-~V~i~~~~-~~~~~~~fg~~li~~~~~~~---~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F 74 (367) |.++ ...|-|=.| -+-+.+++ .+....--...|++|..... ......++ .|.++..+-||..|-.-.+++.+. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~qrvLiiGq~la~gt~~~~~~v~v-~s~~~a~~~fG~GS~l~~M~~a~~ 79 (498) T protein:vir:48 1 MTISFSAVPSDTLVPLFYAEMDNSAANTAVTSAPALLIGHASNDAAIEVNSLVLM-PSADYARQICGAGSQLARMVDVYR 79 (498) T ss_pred CCccccccCcccccceEEEEEecCCCccccCCcceEEEeecCccccccccceEEe-cCHHHHHHhcCcccHHHHHHHHHH Confidence 5555 344533111 23344433 22222222468888876432 33445555 566777778999999888888876 Q ss_pred ccCc------------------------------ccceEEEEec------cCccchHH---H------------------ Q lcl|NC_020841. 75 SQNP------------------------------KPRDLMIATV------TALTDPLA---S------------------ 97 (367) Q Consensus 75 ~Q~p------------------------------~p~~v~v~~~------~~~~t~~~---~------------------ 97 (367) ...| .+-.++|++. .+.+++.+ + T Consensus 80 ~~n~~~~l~~i~~~D~ag~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aai~a~~~lPVTA~~~ 159 (498) T protein:vir:48 80 QTDPFGELYVIAVPEARGAAATVRVTVTGEAEESGTLSLYVGRSSVQVPVVNGDDATAVATAIKEAVNGVITLPFAASSD 159 (498) T ss_pred HhCCCceeEEEeeCCcccceeEEEEEecccccCCceEEEEECCEEEEEeecCCCCHHHHHHHHHHHHhCCCCcceEEEec Confidence 6332 1112344321 11112111 1 Q ss_pred -----------------------------------------------------HHHHHhcccCcEEEEEEecCCHHHHHH Q lcl|NC_020841. 98 -----------------------------------------------------IGEVAAKTLGFYAFCFASEVAAADIQG 124 (367) Q Consensus 98 -----------------------------------------------------l~~~~~~~~~w~~~~~~~~~~~~~~~a 124 (367) -.++......||.++++.-.+.+.+.+ T Consensus 160 ~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamsgGag~PDia~aLaal~~~~~~~I~~p~~D~asl~a 239 (498) T protein:vir:48 160 AGVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPDLTAAVAAMGDEAFDFIGLPFNDAASINM 239 (498) T ss_pred CcEEEEEeeecccccccceeeeeeccCcccccccceeeEEEEcccCCccCcchHHHHHhhccCCccEEEEeecCHHHHHH Confidence 122333334555555444445566666 Q ss_pred HHHHhhc-------cCcEEEEEEeCch-hhhHHHHHHHHhccccceee-cCCc---hh-HHHHHHHHHHHcccccCccee Q lcl|NC_020841. 125 LAEWAQS-------NNRMFMTVMTDDT-EAVTTGNALKELGQYHYCIT-YHED---YA-TVGAVAGMALDQRYDKTDGVK 191 (367) Q Consensus 125 la~~~ea-------~~~~~~~~~~d~~-~~~~~~~~~~~~~~~~~~~~-~~~~---~~-~~~~~~~~~~~~~~~~~~g~~ 191 (367) +.+|++. -+.++.....-.. +.......-...+..|..+. +.+. +. ..++..+..++.....+|.+ T Consensus 240 l~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~~~~~~p~~~~AAa~a~~aA~~l~~DPAr- 318 (498) T protein:vir:48 240 MMTEMNDSSGRWSYARQLYGHVYTAKLGTLSELVNAGDMHNQQHITLAGYEKETQSPVDELVASRLAREAVFIRNDPAR- 318 (498) T ss_pred HHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCChHHHHHHHHHHHHHHhhhccccc- Confidence 6666643 1223322221111 11111112223344554433 3322 21 22222333333333455533 Q ss_pred eeeeeecCccccc----CCCHHHHHHHHhCCceEEEEeeccccceEEEecCEee-----CC----chhh--HHHHHHHHH Q lcl|NC_020841. 192 TLHLKSLVSVVST----DISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAG-----GG----KFFD--FVMGFDWLR 256 (367) Q Consensus 192 t~~~k~l~Gv~~~----~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~-----~G----~~iD--~~~~~dwl~ 256 (367) ++.--.|+||.|. .++.+|.+.|..+|+..+..-+ +...+.+..++ .| .|.| .++-.++++ T Consensus 319 PLqtl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~----G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr 394 (498) T protein:vir:48 319 PTQTGELVGMLPAPKGKRFIMTEQQTLLSHGVATAYVEG----GTLRIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVL 394 (498) T ss_pred cccceeeeccccCCchhcCChHHHHHHHhcCcceEEEcC----CeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHH Confidence 3333467888854 3588999999999999985422 23456666663 34 3655 899999999 Q ss_pred HHHHHHHHHHHHhcCCCCcCHhH---------HHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEc Q lcl|NC_020841. 257 NVIETNVFNGQRLRRLTPQTDRG---------MMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYN 327 (367) Q Consensus 257 ~~lq~~l~~ll~~~~kipy~~~G---------~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~ 327 (367) ..++..+..-+- ..|+.-+..+ -..|++.+-..+++....|++..-+..-...-.....+...+ ..+.. T Consensus 395 ~~~r~~i~~kfp-R~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given~~~~~~~LiVerd~~dpnR-ln~~~ 472 (498) T protein:vir:48 395 RKLKSVITSKYG-RHKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVENYDLFKQYLIVERDADNPNR-LNTLF 472 (498) T ss_pred HHHHHHhhhhcC-CceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcE-EEEEe Confidence 999999987774 3344333221 268999999999999999998643221111000000000000 11111 Q ss_pred -CchHhCCHHHHhccccCCeEEEEEECce Q lcl|NC_020841. 328 -ESIRDQAQVIREQRIAPPFIILVKGAGA 355 (367) Q Consensus 328 -~~~~~~~~~dr~~R~~~~~~~~~~~aga 355 (367) ++..++ =|---..-.+.+.|.-++| T Consensus 473 p~d~vn~---L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 473 PPDYVNQ---LRVFAVVNQFRLQYSEESA 498 (498) T ss_pred cccccCc---hhhhhhhhhhhhhhhhcCC Confidence 111111 1111111223444455555 No 62 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=95.57 E-value=0.0018 Score=35.60 Aligned_cols=331 Identities=12% Similarity=0.042 Sum_probs=154.7 Q ss_pred ccccccceEEEEEeeec-cccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcccc-e Q lcl|NC_020841. 5 LTLPINMLVNVSIEYQA-KLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPKPR-D 82 (367) Q Consensus 5 ~~l~i~~iv~V~i~~~~-~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~p~-~ 82 (367) |++| .|+|+ +++. ++....-=...|++|..+...........++-+++..-+++.+...|.=-..|....... . T Consensus 1 m~~~---~V~in-~~n~~qg~~~~ver~~lfig~g~~~~~~g~~~~~~~~sdld~~lg~~ds~lk~~v~aa~~naG~~w~ 76 (369) T protein:vir:27 1 MAWP---TVIIK-ILNLMNGPIADIECHFLFVIRGTVSGEVRNLIMVDSTSDLDDVLAEASAEGLAIVKAAQLNGKQAWT 76 (369) T ss_pred CCCC---ceEEe-cccccCCCcccccceEEEEEeccccccccceEEecCccchHhhcCCcChhHHHHHHHHHhCCCCceE Confidence 7776 23343 2221 222222123456775443211111122234444444445665555555333333332222 1 Q ss_pred EEEEeccCccchHHHHHHHHhcccCcEEEEEEecC-CHHHHHHHHHHhh---cc-CcEEEEEEe----C--chhh----- Q lcl|NC_020841. 83 LMIATVTALTDPLASIGEVAAKTLGFYAFCFASEV-AAADIQGLAEWAQ---SN-NRMFMTVMT----D--DTEA----- 146 (367) Q Consensus 83 v~v~~~~~~~t~~~~l~~~~~~~~~w~~~~~~~~~-~~~~~~ala~~~e---a~-~~~~~~~~~----d--~~~~----- 146 (367) .++.-..+.+++.+++... +....+-+..++... +.+++.++....+ .. .+..|+... + .... T Consensus 77 a~~~p~~~~~~~~~Av~~a-~~~~s~E~V~v~~p~t~~a~i~aaq~~a~el~~~~~R~vffi~e~~~~~~~~~~~e~w~d 155 (369) T protein:vir:27 77 AGVMILSEEDNWQDAVKKA-NEVSSFEFVVLGFDAETKAMIEDAITLRTELKNSLGREVGVLCQLPAINNDPTNGQTWSE 155 (369) T ss_pred EEEEEeCCchhHHHHHHhh-hhhCCccEEEEecCcccHHHHHHHHHHHHHHHHhcCCeEEEEEeccccCCCccccCCHHH Confidence 2223334555666666544 344455445555543 3455555443332 32 333344332 1 1111 Q ss_pred --hHHHHHHHHhccccce--eecCCchhHHHHHHHHHHH--cccccCcceeeeeeeecCccc-----c--cCCCHHHHHH Q lcl|NC_020841. 147 --VTTGNALKELGQYHYC--ITYHEDYATVGAVAGMALD--QRYDKTDGVKTLHLKSLVSVV-----S--TDISQTQAAS 213 (367) Q Consensus 147 --~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~--~~~~~~~g~~t~~~k~l~Gv~-----~--~~~t~t~~~~ 213 (367) .......+.+...+.. ..++......+-++|+++. +.-...||++-- -.+.|+. + ..++.+.+.+ T Consensus 156 y~a~l~al~~g~a~~~V~vv~~~~~~gn~~G~~aGRl~n~aVsIadsp~RVkt--G~l~g~~~~p~d~~g~~l~~a~l~a 233 (369) T protein:vir:27 156 WLADTVDIPKDVASEYISVVPNVHAAGDTLGKYAGRLANKEVSIADSPARVQT--GSVLGNTELMKDKAGKALDLATLKA 233 (369) T ss_pred HHHHHHHHhhccCcccceeeeeeccccchHHHHHHHHHhcccchhcCcceeee--cccccccccccCCCCcccCHHHHHH Confidence 1111112223333322 2233223345666777642 222344554211 0122322 1 1378899999 Q ss_pred HHhCCceEEEEeeccccceEEEecCEee---CCchhhHHHHHHHHHHHHHHHHHHHH-HhcCCCCcCHhHHHHHHHHHHH Q lcl|NC_020841. 214 LKAACINYYSDYGNPDNSLPIFANGHAG---GGKFFDFVMGFDWLRNVIETNVFNGQ-RLRRLTPQTDRGMMMIKADIVN 289 (367) Q Consensus 214 l~~~~~n~y~~~~~~~~~~~~~~~G~~~---~G~~iD~~~~~dwl~~~lq~~l~~ll-~~~~kipy~~~G~~~l~~~v~~ 289 (367) |+++|+.+.-.|.+-. ..++.+|.++ +|+|=-.-+.+.+-|..=+.++..+- +....+.=+..+++..+..+.. T Consensus 234 Ld~agysvp~~Y~gy~--G~Yw~d~~tl~~~gsDYq~iE~~RVvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~ 311 (369) T protein:vir:27 234 LESNRIAVPMWYPDYP--GQYWTTGRTLDVPGGDYQDIRHIRVAMKAARKVRIRAIARIADRTLNSTPQSIAAAKLYFTQ 311 (369) T ss_pred HHhCCCeEEEeeCCCC--ceEEeCceEeccCCCCeehhhhhhHHHHHHHHHHHHHHHHhcCcccccChhHHHHHHHHHhh Confidence 9999999999998754 3479999996 34554444444454544444444331 2344577889999999999999 Q ss_pred HHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 290 GLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 290 vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) +|++..+.+ -||+...+..+ ++.-+ ...+.--.|.+..+.=+-=..+++.|-++- T Consensus 312 pLr~M~ks~--fpgei~~P~d~--------------------dI~i~-w~~k~~V~I~~~vrP~~~pk~it~~I~ldl 366 (369) T protein:vir:27 312 DLRTMALTG--VPGEIYPPEDE--------------------DIQIK-WVNSTDVEIYMSVQPYECPVKITIAISVKQ 366 (369) T ss_pred HHHHHHhhc--CCeEEecCCCC--------------------ceEEE-eeccceEEEEEEEeeccCCceEEEEEEEec Confidence 999997664 35554333222 22111 011111123233222233334444444444 No 63 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=93.61 E-value=0.007 Score=32.41 Aligned_cols=333 Identities=13% Similarity=0.059 Sum_probs=152.2 Q ss_pred cccccceEEEEEeeec-cccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcccceEE Q lcl|NC_020841. 6 TLPINMLVNVSIEYQA-KLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPKPRDLM 84 (367) Q Consensus 6 ~l~i~~iv~V~i~~~~-~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~p~~v~ 84 (367) -+| .|.|+ .++. ++....-=-..|++|..+.. .. ......+.+++..-+++.+...|.=-..+.........+ T Consensus 1 ~~~---~v~vn-~~n~~~g~~~~~er~~Lfig~~~~~-~~-~~~~~~~~sdld~~lg~~~~~lk~~v~aa~~naG~~~~~ 74 (376) T protein:vir:37 1 MFP---SVQIN-ALNQLSGETKEIERHALFVGVGTTN-QG-KLLALTPDSDFDKVFGETDTDLKKQVRAAMLNAGQNWFA 74 (376) T ss_pred CCC---eEEEe-cccccCCCcccccceEEeecccccc-cc-ceeeecCccchHhhhCCCchHHHHHHHHHHhCCCCcEEE Confidence 122 23333 1121 22111111235667665532 11 112223333333334555555554333333332333222 Q ss_pred EEe-c-cCccchHHHHHHHHhcccCcEEEEEEec--CCHHHHHH---HHHHhhccC-cEEEEEEeCc------hhhhH-- Q lcl|NC_020841. 85 IAT-V-TALTDPLASIGEVAAKTLGFYAFCFASE--VAAADIQG---LAEWAQSNN-RMFMTVMTDD------TEAVT-- 148 (367) Q Consensus 85 v~~-~-~~~~t~~~~l~~~~~~~~~w~~~~~~~~--~~~~~~~a---la~~~ea~~-~~~~~~~~d~------~~~~~-- 148 (367) ... . ....+..+++... +...++-++.++.. .+.+++.+ ++.....+- +..++..... ....+ T Consensus 75 ~~~~~~~~~~~~~~Av~~a-~~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv~file~r~~~~~~~~~e~w~ 153 (376) T protein:vir:37 75 HVYIAQEDGYDFVECVKKA-NQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQAVQGINHDQSDGETWD 153 (376) T ss_pred EEEeecCCchHHHHHHHHh-hhhcCceEEEEeccccccHHHHHHHHHHHHHHHHhcCCeEEEEEeccCcCcccccccCHH Confidence 211 1 2334566666543 34455544444443 34545444 444444442 3333333211 11111 Q ss_pred -HHHHHHH----hcccc-c-eeecCCchhHHHHHHHHHH--HcccccCcceeee-eeee------cCcccccCCCHHHHH Q lcl|NC_020841. 149 -TGNALKE----LGQYH-Y-CITYHEDYATVGAVAGMAL--DQRYDKTDGVKTL-HLKS------LVSVVSTDISQTQAA 212 (367) Q Consensus 149 -~~~~~~~----~~~~~-~-~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~~t~-~~k~------l~Gv~~~~~t~t~~~ 212 (367) -...+.. +...+ . +...|++ ..+-++|+.+ ++.-...+|++.- .... ...-....++...+. T Consensus 154 ~y~~~~~al~~gia~~~V~~V~~~~gn--~~G~~aGRl~~aaVsVadspgRV~tG~l~gl~~~~lp~d~~~~~l~~a~l~ 231 (376) T protein:vir:37 154 QYVQKLTTLQQTIVADHVCLVPLLFGN--ETGVLAGRLANRAVTVADSPARVQTGALVSLGSANKPLDKDRNELTLAHLK 231 (376) T ss_pred HHHHHHHHhhcccccccceeeeeehhh--hHHHHHHHHhhcccchhhCccceeccccccccccccccCcCcccCCHHHHH Confidence 1112222 12222 1 2222432 3556677653 3333455665321 1111 122333568899999 Q ss_pred HHHhCCceEEEEeeccccceEEEecCEeeC---Cch--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHH Q lcl|NC_020841. 213 SLKAACINYYSDYGNPDNSLPIFANGHAGG---GKF--FDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADI 287 (367) Q Consensus 213 ~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~---G~~--iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v 287 (367) +|+++||.+...|.+-. ..++.+|.++. |+| |-..+=.|=....+.......+... .+==+..+++..+..+ T Consensus 232 aLd~agy~vp~~Y~gy~--G~Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R~vR~~ai~~i~D~-~lnst~~sia~~~~yi 308 (376) T protein:vir:37 232 SLETARYSVPMWYPDYD--GYYWADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIADR-SFNSTTSSTEYHKNYF 308 (376) T ss_pred HHHhCCCeEEEeeCCCC--ceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCc-ccCcchhhHHHHHHHH Confidence 99999999999998765 34799999864 344 4444444444334443333333322 2223566788888889 Q ss_pred HHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 288 VNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 288 ~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) ..+|++..+.+.|..-. .+| .|..|.-.++..+-....+. .|++..+.=|--..+++.|-.+- T Consensus 309 ~~pLr~M~~s~~i~g~~---------------fpG-eI~~p~d~Di~i~w~s~~~V-~I~~~v~P~~~pk~Itv~I~Ldl 371 (376) T protein:vir:37 309 AKPLRDMSKSATINGKD---------------FPG-ECMPPKDDAITIVWQSKTKV-TIYIKVRPYDCPKEITANIFLDL 371 (376) T ss_pred HHHHHHHHhcchhcccc---------------ccc-eeecCCCCCceEEeeccceE-EEEEEEEeccCCceEEEEEEeec Confidence 99999887776664210 112 24444433333332233322 35555555555555554444444 No 64 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=89.28 E-value=0.027 Score=29.16 Aligned_cols=344 Identities=13% Similarity=0.079 Sum_probs=170.0 Q ss_pred Cccc-cccccc-ceEEEEEeeec-cccccccccceEEEeecccc---CcccceEEEecHHHHHhccCCCcHHHHHHHHHh Q lcl|NC_020841. 1 MAGS-LTLPIN-MLVNVSIEYQA-KLLSRDAFNRLLIVGSTAPN---GRATDTGIYTSIDGVKLDYGVEADEYKIAQKYF 74 (367) Q Consensus 1 ~~~~-~~l~i~-~iv~V~i~~~~-~~~~~~~fg~~li~~~~~~~---~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F 74 (367) |.++ ...|-| ++=-+-+.++. .+-...+-...|++|..... ......++ +|.++...-||..|-.-.+++.+. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v~v-~s~~~a~~~fG~GSml~~M~~a~~ 79 (498) T protein:vir:44 1 MAISFNSIPSDTRVPLFYAEMDNSAANTARDSGASLLIGHASNDASIAVNSLVLV-SSVDYARQICGAGSQLARMVGAYR 79 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCCCcCCcceEEEEecCcccccccceeEee-cCHHHHHHhcCcccHHHHHHHHHH Confidence 6665 444533 11133455543 34444555678899876432 23445455 577777788999999888888877 Q ss_pred ccCc------------------------------ccceEEEEec------cCccchHH---------------------- Q lcl|NC_020841. 75 SQNP------------------------------KPRDLMIATV------TALTDPLA---------------------- 96 (367) Q Consensus 75 ~Q~p------------------------------~p~~v~v~~~------~~~~t~~~---------------------- 96 (367) .-.| .+-.++|++. .+.+++.+ T Consensus 80 ~~n~~~~l~~i~~~D~aG~aAtg~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA~~~ 159 (498) T protein:vir:44 80 KTDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPFTATSE 159 (498) T ss_pred HhCCCceeEEEecCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEEeec Confidence 6322 1112334321 11111111 Q ss_pred ----------------------------------------------------HHHHHHhcccCcEEEEEEecCCHHHHHH Q lcl|NC_020841. 97 ----------------------------------------------------SIGEVAAKTLGFYAFCFASEVAAADIQG 124 (367) Q Consensus 97 ----------------------------------------------------~l~~~~~~~~~w~~~~~~~~~~~~~~~a 124 (367) .-.++......||.++++.-.+.+.+.+ T Consensus 160 ~~~vtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~i~~p~~D~asl~a 239 (498) T protein:vir:44 160 AGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDTASVNS 239 (498) T ss_pred cceEEEEEeccCcccCcceEEEeeccCccccccccceeEEEEcccCCccCchhHHHHHhhccCCccEEEEeecCHHHHHH Confidence 1123333344555555544445566666 Q ss_pred HHHHhhc-------cCcEEEEEEeCchh-hhHHHHHHHHhccccceeecC-C---chh-HHHHHHHHHHHcccccCccee Q lcl|NC_020841. 125 LAEWAQS-------NNRMFMTVMTDDTE-AVTTGNALKELGQYHYCITYH-E---DYA-TVGAVAGMALDQRYDKTDGVK 191 (367) Q Consensus 125 la~~~ea-------~~~~~~~~~~d~~~-~~~~~~~~~~~~~~~~~~~~~-~---~~~-~~~~~~~~~~~~~~~~~~g~~ 191 (367) +..+++. -..++......... .......-...+..|..+..+ + ++. ..++..+..++.....+|.+ T Consensus 240 l~~~L~~~sgRw~~~~q~~g~~~~a~~gT~a~l~t~g~~~N~~~it~~~~~~~~~sp~~~~AAa~a~~aA~~l~~DPAr- 318 (498) T protein:vir:44 240 MATEMNDSSGRWSYVRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAGYEKDTQTPADELAASRTARAAVFIRNDPAR- 318 (498) T ss_pred HHHHHhhhhcchHHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCCHHHHHHHHHHHHHHHHhhccccc- Confidence 7766643 12233222221111 111111222334455544433 2 222 22222333333333555533 Q ss_pred eeeeeecCccccc----CCCHHHHHHHHhCCceEEEEeeccccceEEEecCEee-----CC----chhh--HHHHHHHHH Q lcl|NC_020841. 192 TLHLKSLVSVVST----DISQTQAASLKAACINYYSDYGNPDNSLPIFANGHAG-----GG----KFFD--FVMGFDWLR 256 (367) Q Consensus 192 t~~~k~l~Gv~~~----~~t~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~-----~G----~~iD--~~~~~dwl~ 256 (367) ++.--.|+||.|. .++.+|.+.|..+|+..+..- . +...+.+..++ .| .|.| .++-.++++ T Consensus 319 PL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~--~--G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr 394 (498) T protein:vir:44 319 PTQTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVE--S--GVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVL 394 (498) T ss_pred ccCceeecccccCCchhcCChHHHHHHHhcCcceEEEc--C--CeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHH Confidence 3333467888854 468899999999999998542 2 23456677663 34 3655 899999999 Q ss_pred HHHHHHHHHHHHhcCCCCcCH----hH-----HHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEc Q lcl|NC_020841. 257 NVIETNVFNGQRLRRLTPQTD----RG-----MMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYN 327 (367) Q Consensus 257 ~~lq~~l~~ll~~~~kipy~~----~G-----~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~ 327 (367) ..++..+..-+- ..|+.=++ .| -..|++.+-..+++....|++..-+..-...-.....+...+ ..+.. T Consensus 395 ~~~r~~i~~kfp-R~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~givEn~~~~~~~LiVerd~~dpnR-ln~~~ 472 (498) T protein:vir:44 395 RRLKSVITSKYG-RHKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREGIVENFDLFQQHLIVERNANDSNR-LDVLF 472 (498) T ss_pred HHHHHHhhhhcC-CcccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcE-EEEEe Confidence 999999976663 33333221 12 257899999999999999998543211111000000000000 11111 Q ss_pred -CchHhCCHHHHhccccCCeEEEEEECce Q lcl|NC_020841. 328 -ESIRDQAQVIREQRIAPPFIILVKGAGA 355 (367) Q Consensus 328 -~~~~~~~~~dr~~R~~~~~~~~~~~aga 355 (367) ++..++ =|---..-.+.+.|.-++| T Consensus 473 p~d~vn~---L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:44 473 PPDYVNQ---LRVFAVLNQFRLQYSEEAA 498 (498) T ss_pred cccccCc---hhhhhhhhhhhhhhhhhcC Confidence 111111 1111111123334444444 No 65 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=86.51 E-value=0.045 Score=27.96 Aligned_cols=329 Identities=15% Similarity=0.065 Sum_probs=152.8 Q ss_pred cccccceEEEEEeeec-cccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcccce-E Q lcl|NC_020841. 6 TLPINMLVNVSIEYQA-KLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPKPRD-L 83 (367) Q Consensus 6 ~l~i~~iv~V~i~~~~-~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~p~~-v 83 (367) -+| .|+|+ +++. ++..+.-=-..|++|..+... ....-...-+++..-+++.+...|.=-........... . T Consensus 1 ~~~---~v~vn-~ln~~qg~~~~ver~~lfig~~~~~~--~~~~~~~~~sdld~~lg~~ds~lk~~v~aa~~naG~~w~a 74 (376) T protein:vir:37 1 MFP---SVQIN-ALNQLSGETKEIERHALFVGVGTTNQ--GKLLALTPDSDFDKVFGETDTDLKKQVRAAMLNAGQNWFA 74 (376) T ss_pred CCC---eEEEe-eeeccCCCcccccceEEEeecccccc--CceEEecCCCChHHhhCCCchhHHHHHHHHHhCCCCceEE Confidence 122 23343 1221 222222112457777766421 22222333344444456665555542222222212222 1 Q ss_pred EEEec-cCccchHHHHHHHHhcccCcEEEEEEec--CCHHHHHHHH---HHhhcc-CcEEEEEEeC------chhhhH-- Q lcl|NC_020841. 84 MIATV-TALTDPLASIGEVAAKTLGFYAFCFASE--VAAADIQGLA---EWAQSN-NRMFMTVMTD------DTEAVT-- 148 (367) Q Consensus 84 ~v~~~-~~~~t~~~~l~~~~~~~~~w~~~~~~~~--~~~~~~~ala---~~~ea~-~~~~~~~~~d------~~~~~~-- 148 (367) .+-.. .+..+..+++... +....+-+..++.. ++.+++.++. .-..+. .+..|+...- .....+ T Consensus 75 ~~~~p~~~~~~~~~Av~~a-~~~~s~E~V~v~~p~~t~~a~i~a~qa~a~el~~~~~R~vffile~~g~d~~~~~ge~w~ 153 (376) T protein:vir:37 75 HVYIAQEDGYDFVECVKKA-NQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQAVQGINHDQSDGETWD 153 (376) T ss_pred EEEecCCChhhHHHHHHHH-HhhCCeeEEEEecCcchhHHHHHHHHHHHHHHHHhcCCeEEEEEeccCCCCcccccCCHH Confidence 11111 2345666777655 44555555555553 3456655553 333333 3333333321 111111 Q ss_pred -HHHHHH----Hhccccc--eeecCCchhHHHHHHHHHH--HcccccCcceee-eeeeecCccc-c-----cCCCHHHHH Q lcl|NC_020841. 149 -TGNALK----ELGQYHY--CITYHEDYATVGAVAGMAL--DQRYDKTDGVKT-LHLKSLVSVV-S-----TDISQTQAA 212 (367) Q Consensus 149 -~~~~~~----~~~~~~~--~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~~t-~~~k~l~Gv~-~-----~~~t~t~~~ 212 (367) -...+. .+...+. +..++++ ..+-++|+++ ++.-...||++- -..+.+.-++ | ..++.+.+. T Consensus 154 ~y~~~l~a~~~gia~~~V~vV~~~~gn--~~G~~aGRl~naaVsVadspgRV~tGai~gl~~~~~p~d~~g~el~~a~l~ 231 (376) T protein:vir:37 154 QYVQKLTTLQQTIVADHVCLVPLLFGN--ETGVLAGRLANRAVTVADSPARVQTGALVSLGSANKPLDKDGNELTLAHLK 231 (376) T ss_pred HHHHHHHHHhccccccceeeeeeeccc--hHHHHHHHHHhCCcchhcCccceeecccccccccccccccCCcccchHHHH Confidence 112222 2223322 2333443 4556677764 333345666532 2222222121 1 236788999 Q ss_pred HHHhCCceEEEEeeccccceEEEecCEeeC---Cc--hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHH Q lcl|NC_020841. 213 SLKAACINYYSDYGNPDNSLPIFANGHAGG---GK--FFDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADI 287 (367) Q Consensus 213 ~l~~~~~n~y~~~~~~~~~~~~~~~G~~~~---G~--~iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v 287 (367) +|+++|+.+.-.|.+-. ..++.+|.++. |+ +|-.++=.|=...+++.....-+. ...+--+..+++..+..+ T Consensus 232 aLd~arysvpr~Y~gyd--G~Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R~vR~~Ai~~i~-Dr~lnstp~sia~~~~~~ 308 (376) T protein:vir:37 232 SLETARYSVPMWYPDYD--GYYRADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIA-DRSFNSTTSSTEYHKNYF 308 (376) T ss_pred HHHhCCCeEEEeeCCCC--ceEEeCCeEeccCCCCeeeehhchHHHHHHHHHHHHHHHHhc-CccccCChhHHHHHHHHH Confidence 99999999999998754 34799999963 44 576666666666666655444443 334677889999999999 Q ss_pred HHHHHHHHhcC----ceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEE--EE Q lcl|NC_020841. 288 VNGLEEAVKAG----LVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDT--DI 361 (367) Q Consensus 288 ~~vl~~a~~~G----~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v--~i 361 (367) ..+|++..+.+ ..-||+...+..+... +.- ..|..-.|.+..+.=+-=..+ +| T Consensus 309 ~~pLr~M~ks~ei~g~~fpgei~~P~d~dI~----------i~w-----------~sk~~V~I~~~vrPy~cpk~i~~~I 367 (376) T protein:vir:37 309 AKPLRDMSKSATINGKDFPGECMPPKDDAIT----------IVW-----------QSKTKVTIYIKVRPYDCPKEITANI 367 (376) T ss_pred hHHHHHHHhhhhhccccccceeecCCCCceE----------EEe-----------ccCceEEEEEEEeeecCcceeEEEE Confidence 99999986654 3344444433322210 000 001111122222111111222 22 Q ss_pred EEEecC Q lcl|NC_020841. 362 TLIPEA 367 (367) Q Consensus 362 ~~~v~~ 367 (367) -+.... T Consensus 368 ~LDls~ 373 (376) T protein:vir:37 368 FLDLDS 373 (376) T ss_pred EEecCC Confidence 222222 No 66 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=84.86 E-value=0.057 Score=27.40 Aligned_cols=332 Identities=13% Similarity=0.059 Sum_probs=156.3 Q ss_pred cccccceEEEEEeeec-cccccccccceEEEeeccccCcccceEEEecHHHHHhccCCCcHHHHHHHHHhccCcccce-E Q lcl|NC_020841. 6 TLPINMLVNVSIEYQA-KLLSRDAFNRLLIVGSTAPNGRATDTGIYTSIDGVKLDYGVEADEYKIAQKYFSQNPKPRD-L 83 (367) Q Consensus 6 ~l~i~~iv~V~i~~~~-~~~~~~~fg~~li~~~~~~~~~~~~~~~yts~~~v~~df~~~s~~ykaA~~~F~Q~p~p~~-v 83 (367) -.| .|.|+ .++. ++....-=-..|++|..+.. . ...-...+.+++..-++..+...|.=-..+........ . T Consensus 1 ~~~---~v~vn-~~n~~~g~~~~~er~~lfig~~~~~-~-g~~~~~~~~sdld~~l~~~ds~lk~~v~aa~~naG~~~~~ 74 (370) T protein:vir:78 1 MWP---YVQIY-NLNQMQGPVTEVERHLLFIGSAASN-T-GKLLSLNAQSDFDQLLGAADSELKANLLAARDNAGQNWSA 74 (370) T ss_pred CCc---eEEEe-eccccCCCcCccceeEEEEeccccc-c-cceEeecCccCHHHhcCCcChhHHHHHHHHHhCCCCceEE Confidence 122 23343 1221 22111111234666655421 1 11222333344444456665555553333333322221 2 Q ss_pred EEEeccCccchHHHHHHHHhcccCcEEEEEEecC-CHHHHHHHHHHhhc----cCcEEEEEEe--CchhhhH---HHHHH Q lcl|NC_020841. 84 MIATVTALTDPLASIGEVAAKTLGFYAFCFASEV-AAADIQGLAEWAQS----NNRMFMTVMT--DDTEAVT---TGNAL 153 (367) Q Consensus 84 ~v~~~~~~~t~~~~l~~~~~~~~~w~~~~~~~~~-~~~~~~ala~~~ea----~~~~~~~~~~--d~~~~~~---~~~~~ 153 (367) ++.-..+.++..+++... +....+-+..++... +.+++.++...++. ..+..++... ......+ -...+ T Consensus 75 ~~~p~~~~~d~~~Av~~a-~~~~s~E~V~v~~~~s~~a~~~a~~~~a~el~n~~~Rpv~file~~~~~~~e~w~~y~~~l 153 (370) T protein:vir:78 75 AAYVLPTDKPWLDAARDA-QQTQSFEGVVVLGQEWHQAAINAAHALNQELIAKWGRWQFMLLAVPAIADEQDWATYEAEL 153 (370) T ss_pred EEEEecCchhHHHHHHHH-HhhCCccEEEEecCcchHHHHHHHHHHHHHHHHhcCCeEEEEEeecCCCCcCCHHHHHHHH Confidence 233344556677777555 344555445555543 34666665444432 2233333332 1111111 11112 Q ss_pred ----HHhcccc--ceeecCCchhHHHHHHHHHH--HcccccCccee-eeeeeecCccc-----ccCCCHHHHHHHHhCCc Q lcl|NC_020841. 154 ----KELGQYH--YCITYHEDYATVGAVAGMAL--DQRYDKTDGVK-TLHLKSLVSVV-----STDISQTQAASLKAACI 219 (367) Q Consensus 154 ----~~~~~~~--~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~~-t~~~k~l~Gv~-----~~~~t~t~~~~l~~~~~ 219 (367) +.+...+ .+..+|++ ..+.++|+++ ++.-...|+++ |-..+.+ |.. ...++.+.+++|+++|| T Consensus 154 ~al~~gia~~~V~vvp~~~g~--~~G~~aGRL~naavsVadsP~Rv~tG~l~gl-~~~p~d~~~~~l~~a~l~aLd~agy 230 (370) T protein:vir:78 154 ATLQDGIAASSVSLIPQLWPT--LAGAYAGRLCNRAVSIADSPCRVKTGALVGL-GNKPVGKDGIPLPLATLQTLEANRY 230 (370) T ss_pred HHhhhccccccceEEeeeccc--cHHHHHHHHhcCeeeecccceeeeccccccc-cccccccCCcccCHHHHHHHHhCCC Confidence 2222322 23333433 2345566543 22222333332 1122221 111 23478899999999999 Q ss_pred eEEEEeeccccceEEEecCEeeC---Cch--hhHHHHHHHHHHHHHHHHHHHHHhcCCCCcCHhHHHHHHHHHHHHHHHH Q lcl|NC_020841. 220 NYYSDYGNPDNSLPIFANGHAGG---GKF--FDFVMGFDWLRNVIETNVFNGQRLRRLTPQTDRGMMMIKADIVNGLEEA 294 (367) Q Consensus 220 n~y~~~~~~~~~~~~~~~G~~~~---G~~--iD~~~~~dwl~~~lq~~l~~ll~~~~kipy~~~G~~~l~~~v~~vl~~a 294 (367) .+...|.+-. -.++.+|.++. |+| |-..+=.|=....+.......+.+. ++==++..++..+......|++. T Consensus 231 ~vp~~Y~gy~--G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~i~D~-~lnst~gsia~~~~~~~~~L~em 307 (370) T protein:vir:78 231 SVPMWYPDYD--GIYWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAIARIGDR-SFNSTPGSTAAAITYFGKDLREM 307 (370) T ss_pred eEEEeeCCCC--ceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCc-ccCCCCcchhHHHHHHHhhHHHH Confidence 9999998765 34799999863 355 4444444444444443344434332 22224467888888899999998 Q ss_pred HhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeEEEEEECceEEEEEEEEEecC Q lcl|NC_020841. 295 VKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFIILVKGAGAIHDTDITLIPEA 367 (367) Q Consensus 295 ~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~~~~~~agaIh~v~i~~~v~~ 367 (367) .+.+-|..-+ -+| +|.+|.-.++..+-...++. .|++....=|-=..+++.|-.+- T Consensus 308 a~s~~i~~~~---------------fpg-eI~~p~d~Di~i~w~s~~~v-~I~~~v~P~~~pk~Itv~I~LDl 363 (370) T protein:vir:78 308 AKSTTINGQP---------------FPG-DIASPQDGDIRIQWVAKNLV-SVFVVVRTVDCPKGITVNIMLDL 363 (370) T ss_pred Hhhhhhcccc---------------cce-eEeccCCCcceEEeeccceE-EEEEEEEeccCCceEEEEEEEee Confidence 8888875311 112 34444433333333333333 35566555555555555554444 No 67 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=66.18 E-value=0.27 Score=23.69 Aligned_cols=326 Identities=14% Similarity=0.072 Sum_probs=144.7 Q ss_pred Cccccccccc-ceEEEEEeeecccccc---------ccccceEEEeecc-ccC--cccceEEEecHHHHHhccCCCcHHH Q lcl|NC_020841. 1 MAGSLTLPIN-MLVNVSIEYQAKLLSR---------DAFNRLLIVGSTA-PNG--RATDTGIYTSIDGVKLDYGVEADEY 67 (367) Q Consensus 1 ~~~~~~l~i~-~iv~V~i~~~~~~~~~---------~~fg~~li~~~~~-~~~--~~~~~~~yts~~~v~~df~~~s~~y 67 (367) =++.++|-|- +.|.|.|.-...+..- ..-..|..-.... +.+ .......-..-++-..|.... T Consensus 115 ~~G~l~l~I~g~~v~v~V~~gdTaa~vA~al~aaina~~~lPvTA~~~~~~~~~~a~~~VtlTAr~kG~~n~idi~---- 190 (495) T protein:vir:19 115 ENGSLVTYIAGQRLAVSVAAGATGAALADLLVARIKGQPDLPVTAEVRADSGDDDTHADVVLSAKFTGALSAVDVR---- 190 (495) T ss_pred CCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHhcCCccCceEEEeeccCCCCcCceeEEEEEeeccccccceeE---- Confidence 5555555555 3445555443322211 1111122111100 000 000000000000000000000 Q ss_pred HHHHHHhccCccc--ceEEEEec---cCccchHHHHHHHHhcccCcEEEEEEecCCHHHHHHHHHHhhcc----CcEE-- Q lcl|NC_020841. 68 KIAQKYFSQNPKP--RDLMIATV---TALTDPLASIGEVAAKTLGFYAFCFASEVAAADIQGLAEWAQSN----NRMF-- 136 (367) Q Consensus 68 kaA~~~F~Q~p~p--~~v~v~~~---~~~~t~~~~l~~~~~~~~~w~~~~~~~~~~~~~~~ala~~~ea~----~~~~-- 136 (367) -.++.....| -.+.+-.. ....+...++.++. ..||.++++.-.+.+.+.+|.++++.. +.++ T Consensus 191 ---~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~---~~~~~~I~~P~tD~asL~al~~~l~~rw~~~~q~~g~ 264 (495) T protein:vir:19 191 ---WNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMG---DLQYKYIVMPYTDEPNLNLLRTELQERWGPVNQADGF 264 (495) T ss_pred ---EEeecccccccceeEEEEecCCCCCCcchHHHHHHhc---cCCCcEEEEecCcHHHHHHHHHHHHHhhhHHHhcCeE Confidence 0111122223 33333332 23334555555555 556666665545667778888888753 2222 Q ss_pred -EEEEeCchhhhHHHHHHHHhccccceeecCC---chh-HHHHHHHHHHHcccccCcceeeeeeeecCccccc----CCC Q lcl|NC_020841. 137 -MTVMTDDTEAVTTGNALKELGQYHYCITYHE---DYA-TVGAVAGMALDQRYDKTDGVKTLHLKSLVSVVST----DIS 207 (367) Q Consensus 137 -~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~~~~~~~~~~~~~~~~~~g~~t~~~k~l~Gv~~~----~~t 207 (367) +......-.... ..-...+..|..+..++ ++. ..++..+.+++.....+| +-++.--.|+||.|. .++ T Consensus 265 ~~~a~~gT~~~l~--t~g~~~N~~~it~~~~~gsp~~~~~~AAA~aa~~A~~l~~DP-ArPL~tl~L~Gi~~p~~~~r~~ 341 (495) T protein:vir:19 265 AVTVLSGTYGDIS--TFGVSRNDHLISCMGIAGAPEPSYLYAATLCAVASQALSIDP-ARPLQTLTLPGRMPPAVGDRFT 341 (495) T ss_pred EEEeecCCHHHHH--HhhhccCCceEEEEecCCCCCcHHHHHHHHHHHHHHHhhccc-ccccCceeecceecCCccccCC Confidence 222222222221 22223345554444432 222 222222223333334445 334444467888854 368 Q ss_pred HHHHHHHHhCCceEEEEeeccccceEEEecCEee-----CC----chhh--HHHHHHHHHHHHHHHHHHHHHhcCCCCcC Q lcl|NC_020841. 208 QTQAASLKAACINYYSDYGNPDNSLPIFANGHAG-----GG----KFFD--FVMGFDWLRNVIETNVFNGQRLRRLTPQT 276 (367) Q Consensus 208 ~t~~~~l~~~~~n~y~~~~~~~~~~~~~~~G~~~-----~G----~~iD--~~~~~dwl~~~lq~~l~~ll~~~~kipy~ 276 (367) .+|.+.|..+|+..+.--.+ +...+.+..++ .| .|.| .++-.++++..++..+..-+-. .|+.-+ T Consensus 342 ~~ern~LL~~Gist~~V~~~---G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR-~KLa~d 417 (495) T protein:vir:19 342 WSERNALLFDGISTFNVNDG---GEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRITQKFPN-YKLASD 417 (495) T ss_pred hHHHHHHHhCCcceEEECCC---CeEEEEeeeeeeeecCCCCcchhhhhhHHHHHHHHHHHHHHHHHhhhcCC-cccccC Confidence 89999999999998864321 22345555553 35 3655 8999999999999999877653 333322 Q ss_pred HhH---------HHHHHHHHHHHHHHHHhcCceecccccCccccccccccccccceeEEcCchHhCCHHHHhccccCCeE Q lcl|NC_020841. 277 DRG---------MMMIKADIVNGLEEAVKAGLVAAGTWNGAALGEIETYDYLPTGYYVYNESIRDQAQVIREQRIAPPFI 347 (367) Q Consensus 277 ~~G---------~~~l~~~v~~vl~~a~~~G~I~~g~~~~~~~g~~~~~~~~~~gy~v~~~~~~~~~~~dr~~R~~~~~~ 347 (367) ..+ -..|++.+-+++++....|++..-+..-.. +.+ .|+...+ .|-+=..|+-- T Consensus 418 ~~~~~~gq~IvTp~~ir~ell~~~~~le~~given~~~~~~~---------------LiV-erd~~dp-nRln~~~p~d~ 480 (495) T protein:vir:19 418 GTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEE---------------LYV-ARNKDDK-DRLDVLCGPNL 480 (495) T ss_pred CCCCCCcccccChHHHHHHHHHHHHhhhhhccccChhhhcce---------------eEE-EECCCCC-cEEEEEeccee Confidence 221 257999999999999999998543211110 111 0111111 12221121110 Q ss_pred E--EEEECceEEEEE Q lcl|NC_020841. 348 I--LVKGAGAIHDTD 360 (367) Q Consensus 348 ~--~~~~agaIh~v~ 360 (367) + ....+|.|+.+= T Consensus 481 vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 481 INQFRIFAAQVQFIL 495 (495) T ss_pred eCceeeeeeeeeeeC Confidence 0 011222222211 Done!