Query lcl|Aclame:protein:vir:80052|NCBI_annot:gp14|genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Match_columns 331 No_of_seqs 134 out of 242 Neff 8.4 Searched_HMMs 1612 Date Tue Dec 3 01:52:36 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_25 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_25_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80052 Length: 331 100.0 3E-107 2E-110 604.7 39.0 331 1-331 1-331 (331) 2 protein:vir:5260 Length: 502 # 100.0 2.4E-92 1.5E-95 522.9 38.2 328 1-331 4-502 (502) 3 protein:vir:95263 Length: 450 100.0 3.5E-91 2.2E-94 516.5 36.7 321 1-331 1-449 (450) 4 protein:vir:3636 Length: 501 # 100.0 7.2E-85 4.5E-88 481.9 36.6 324 1-331 7-500 (501) 5 protein:vir:106730 Length: 501 100.0 1.5E-84 9.5E-88 480.1 36.6 324 1-331 7-500 (501) 6 protein:vir:78611 Length: 501 100.0 5.6E-84 3.4E-87 477.0 36.5 324 1-331 7-500 (501) 7 protein:vir:101576 Length: 501 100.0 1.8E-83 1.1E-86 474.2 36.5 324 1-331 7-500 (501) 8 protein:vir:99586 Length: 507 100.0 1.3E-83 7.9E-87 475.1 34.5 327 1-330 3-507 (507) 9 protein:vir:96104 Length: 504 100.0 4.9E-83 3.1E-86 471.8 35.6 328 1-330 3-504 (504) 10 protein:vir:94073 Length: 494 100.0 1.1E-81 6.6E-85 464.5 35.8 325 1-331 5-494 (494) 11 protein:vir:107720 Length: 515 100.0 1.2E-74 7.2E-78 425.9 34.1 326 1-330 2-515 (515) 12 protein:vir:3165 Length: 426 # 100.0 4.1E-62 2.5E-65 357.1 26.3 320 1-331 1-426 (426) 13 protein:vir:102359 Length: 356 99.5 2.7E-13 1.7E-16 89.4 26.1 313 1-329 1-356 (356) 14 protein:vir:102957 Length: 437 99.3 6.6E-12 4.1E-15 81.9 22.9 302 1-330 90-437 (437) 15 protein:vir:78986 Length: 436 99.2 1.7E-11 1.1E-14 79.6 20.3 300 1-330 111-436 (436) 16 protein:vir:6079 Length: 396 # 99.0 4E-09 2.5E-12 66.6 27.0 313 1-331 1-383 (396) 17 protein:vir:1845 Length: 392 # 99.0 4.2E-09 2.6E-12 66.5 26.2 314 1-331 1-380 (392) 18 protein:vir:103993 Length: 390 99.0 5.1E-09 3.1E-12 66.1 25.5 315 1-331 1-378 (390) 19 protein:vir:78206 Length: 390 99.0 5.1E-09 3.1E-12 66.1 25.5 315 1-331 1-378 (390) 20 protein:vir:98553 Length: 395 98.9 5.7E-09 3.5E-12 65.8 25.3 314 1-331 1-383 (395) 21 protein:vir:96586 Length: 587 98.9 4.1E-09 2.6E-12 66.5 24.3 309 1-331 206-582 (587) 22 protein:vir:5711 Length: 396 # 98.9 7.9E-09 4.9E-12 65.0 25.7 313 1-331 1-383 (396) 23 protein:vir:79181 Length: 390 98.9 1E-08 6.4E-12 64.4 26.0 313 1-331 1-378 (390) 24 protein:vir:2035 Length: 396 # 98.9 1.1E-08 6.8E-12 64.2 26.0 313 1-331 1-383 (396) 25 protein:vir:107310 Length: 581 98.9 1.9E-08 1.2E-11 62.9 26.4 310 1-331 175-566 (581) 26 protein:vir:95741 Length: 587 98.9 9.3E-09 5.8E-12 64.6 23.7 309 1-331 206-582 (587) 27 protein:vir:10336 Length: 386 98.9 1.8E-08 1.1E-11 63.0 25.0 312 1-331 1-379 (386) 28 protein:vir:79141 Length: 391 98.9 1.8E-08 1.1E-11 63.1 24.8 314 1-331 1-378 (391) 29 protein:vir:99306 Length: 587 98.8 2.3E-08 1.4E-11 62.5 25.0 310 1-331 206-582 (587) 30 protein:vir:105470 Length: 451 98.8 3.5E-09 2.2E-12 66.9 20.4 306 1-330 86-451 (451) 31 protein:vir:7653 Length: 581 # 98.8 3.6E-08 2.2E-11 61.4 28.0 307 1-331 201-566 (581) 32 protein:vir:1172 Length: 391 # 98.8 4.2E-08 2.6E-11 61.0 26.2 313 1-331 1-379 (391) 33 protein:vir:80488 Length: 562 98.8 1.6E-08 9.6E-12 63.4 22.3 301 1-331 206-557 (562) 34 protein:vir:63742 Length: 562 98.7 1.7E-08 1.1E-11 63.2 21.0 304 1-331 206-557 (562) 35 protein:vir:5833 Length: 742 # 98.7 1.1E-07 6.8E-11 58.7 27.3 308 1-331 389-736 (742) 36 protein:vir:80779 Length: 569 98.6 1.1E-07 6.9E-11 58.7 20.8 303 1-331 209-564 (569) 37 protein:vir:100323 Length: 393 98.6 3E-07 1.9E-10 56.3 28.3 314 1-331 3-380 (393) 38 protein:vir:79092 Length: 477 98.4 3.6E-07 2.2E-10 55.9 20.8 304 1-331 109-467 (477) 39 protein:vir:96740 Length: 388 98.4 8.4E-07 5.2E-10 53.9 29.4 314 1-331 4-377 (388) 40 protein:vir:100829 Length: 607 98.4 9.2E-07 5.7E-10 53.7 21.4 310 1-331 220-596 (607) 41 protein:vir:107865 Length: 477 98.4 1.1E-06 6.6E-10 53.3 23.9 302 1-331 109-467 (477) 42 protein:vir:80984 Length: 666 98.3 8E-07 4.9E-10 54.0 19.4 302 1-331 272-651 (666) 43 protein:vir:104858 Length: 729 98.2 2.9E-06 1.8E-09 51.0 25.1 314 1-331 317-717 (729) 44 protein:vir:108052 Length: 660 98.1 4.8E-06 3E-09 49.7 26.2 309 1-331 245-647 (660) 45 protein:vir:106984 Length: 743 98.0 7.7E-06 4.7E-09 48.6 26.3 291 1-331 351-732 (743) 46 protein:vir:103456 Length: 659 98.0 7.8E-06 4.8E-09 48.6 23.9 313 1-331 243-646 (659) 47 protein:vir:98824 Length: 774 98.0 9.4E-06 5.8E-09 48.1 21.0 310 1-331 371-767 (774) 48 protein:vir:102819 Length: 648 97.9 1.1E-05 7E-09 47.7 20.0 310 1-331 206-645 (648) 49 protein:vir:3788 Length: 376 # 97.8 2.2E-05 1.3E-08 46.2 28.4 318 1-331 1-371 (376) 50 protein:vir:79798 Length: 717 97.6 3.5E-05 2.2E-08 45.0 24.3 307 1-331 309-717 (717) 51 protein:vir:6894 Length: 660 # 97.6 3.7E-05 2.3E-08 44.9 26.1 291 1-331 274-646 (660) 52 protein:vir:104477 Length: 749 97.6 3.7E-05 2.3E-08 44.9 25.8 315 1-331 365-739 (749) 53 protein:vir:6594 Length: 666 # 97.6 4.4E-05 2.7E-08 44.5 25.0 296 1-331 272-651 (666) 54 protein:vir:5663 Length: 671 # 97.4 8.3E-05 5.1E-08 43.0 26.4 309 1-331 287-661 (671) 55 protein:vir:7206 Length: 659 # 97.2 0.00012 7.4E-08 42.1 30.6 313 1-331 227-646 (659) 56 protein:vir:489 Length: 498 # 97.2 0.00012 7.6E-08 42.0 21.6 313 1-319 80-498 (498) 57 protein:vir:98263 Length: 664 97.1 0.00016 9.8E-08 41.4 24.9 303 1-331 273-650 (664) 58 protein:vir:4517 Length: 498 # 97.0 0.0002 1.2E-07 40.9 24.4 313 1-319 80-498 (498) 59 protein:vir:100539 Length: 663 96.9 0.00028 1.7E-07 40.1 26.1 303 1-331 272-648 (663) 60 protein:vir:101187 Length: 663 96.9 0.00029 1.8E-07 39.9 28.6 310 1-331 229-648 (663) 61 protein:vir:78782 Length: 370 96.7 0.00041 2.5E-07 39.2 28.7 319 1-331 1-363 (370) 62 protein:vir:106427 Length: 679 96.5 0.0006 3.7E-07 38.3 26.9 308 1-331 235-665 (679) 63 protein:vir:4463 Length: 498 # 96.4 0.00071 4.4E-07 37.8 23.3 313 1-319 80-498 (498) 64 protein:vir:101804 Length: 663 96.3 0.00073 4.5E-07 37.8 26.5 306 1-331 272-648 (663) 65 protein:vir:276 Length: 369 # 95.9 0.0012 7.6E-07 36.6 29.2 317 1-331 1-366 (369) 66 protein:vir:1996 Length: 495 # 93.0 0.0093 5.8E-06 31.7 26.4 318 1-331 83-495 (495) 67 protein:vir:3751 Length: 376 # 88.6 0.031 1.9E-05 28.8 29.2 318 1-331 1-371 (376) No 1 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=100.00 E-value=2.9e-107 Score=604.69 Aligned_cols=331 Identities=100% Similarity=1.380 Sum_probs=325.8 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCcceEEEechhhhccCCCCChHHHHHHHHHHccCCCcceEEEEeccc Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPDTVAVITYED 80 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p~~v~v~~~~~ 80 (331) ||+||+||+|++++.+|+++.+++.+++|..++++++|+|+++++|+.||++++++||+|+++|+|+|+|.+++|+++.+ T Consensus 1 ~~~~iv~V~v~~~~~~~~~~~~~~~~~~~~~~t~~~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~~~~~~i~v~~~~~ 80 (331) T protein:vir:80 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPDTVAVITYED 80 (331) T ss_pred CccceecceeeecccccccccccCcceeEEeccccceEEEechhhhccCCCCCcHHHHHHHHHHhccCccceEEEeccch Confidence 99999999999999999989999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHhcCcEEEEEEeCChHHHHhhcccceEEEEEeCCCchhHHHHHHH Q lcl|Aclame:pro 81 TKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEEQKFKFAVFQVTAVADITPLAKNTRTIAIVHSKTGEKLDAALIG 160 (331) Q Consensus 81 ~~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea~~~~~~~~~~t~~~~~~~~~~~~~t~~~~~~~~~~~~~aa~~g 160 (331) ++.+.++.+..+++|||+++.++++++++++|+|+|+++++|+++.+++++++.+.+++.|+++++|+.+++|++++++| T Consensus 81 ~~~~~a~~a~~~~~w~~~~~~~~~~~~~~a~a~~~~a~~~~f~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~aa~~g 160 (331) T protein:vir:80 81 TKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEEQKFKFAVFQVTAVADITPLAKNTRTIAIVHSKTGEKLDAALIG 160 (331) T ss_pred HHHHHHHHHhccCceeEEEeecCCHHHHHHHHHHHhhCCcEEEEEecCchHHHHHhhccccEEEEEcCCccchhHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhcCCccceeeeeeeccCCcCCCCCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCceehhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 NVASLPVGSATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGEFIDSIHGDDWIKATIETRLQ 240 (331) Q Consensus 161 ~~~~~~~G~~t~~~k~~l~gv~~~~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~~iD~~~~~dwl~~~iq~~l~ 240 (331) ++++.+||++||+||++|+||+|++++.+|+++|+++|||||++++|..++++|++++|+|||++||+|||+++||++|+ T Consensus 161 ~~~~~~~g~~t~~fk~~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~~G~~iD~~~~~dWl~~~lq~~l~ 240 (331) T protein:vir:80 161 NVASLPVGSATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGEFIDSIHGDDWIKATIETRLQ 240 (331) T ss_pred HHHhcCccceeeeeecccCCCCCCCCCHHHHHHHHhcCceEEEEecCeeEEecceEeCchhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999988999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceE Q lcl|Aclame:pro 241 KLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAI 320 (331) Q Consensus 241 ~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaI 320 (331) ++|++++|||||+.|+++|+++|+++|++++++|+|+||+++++++|+|+.|++++++++||++|++||++|+|+++||| T Consensus 241 ~ll~~~~kiPy~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI 320 (331) T protein:vir:80 241 KLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAI 320 (331) T ss_pred HHHHhCCCCccChhhHHHHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEEEEEEcceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEeC Q lcl|Aclame:pro 321 HSVDVYGEVEV 331 (331) Q Consensus 321 h~v~i~~~v~~ 331 (331) |.|+|+++|+| T Consensus 321 ~~v~i~~~v~~ 331 (331) T protein:vir:80 321 HSVDVYGEVEV 331 (331) T ss_pred EEEEEEEEEeC Confidence 99999999999 No 2 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=100.00 E-value=2.4e-92 Score=522.85 Aligned_cols=328 Identities=21% Similarity=0.282 Sum_probs=289.2 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccC-------CcceEEEechhhhccCCCCChHHHHHHHHHHccCCCcceE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGT-------AMGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPDTV 73 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~-------~~~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p~~v 73 (331) =+|+|++|++++.+.+ .++++|+.+|||++.+ .+++++|+++++|..+|+.++||||+|.+||+|.|+|.+| T Consensus 4 p~s~ivnV~i~~~~~a-~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~~s~ey~aA~~yF~q~p~P~~l 82 (502) T protein:vir:52 4 SISHIVNVQLNTVPKS-AARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQSPRAKQL 82 (502) T ss_pred CccceeEEeecccccc-ccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCCChHHHHHHHHHhcCCCccceE Confidence 4799999999999655 5567999999998654 3579999999999999999999999999999999999999 Q ss_pred EEEeccchh----------------------------------------------------------------------- Q lcl|Aclame:pro 74 AVITYEDTK----------------------------------------------------------------------- 82 (331) Q Consensus 74 ~v~~~~~~~----------------------------------------------------------------------- 82 (331) +|++|..+. T Consensus 83 ~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~~~l~~~~~~~t 162 (502) T protein:vir:52 83 IVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTLSVAVS 162 (502) T ss_pred EEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHHhhhcccccceE Confidence 998774210 Q ss_pred ----------------------------------------------------------------HHHHHHHh--hcCcee Q lcl|Aclame:pro 83 ----------------------------------------------------------------LLEAAEAY--FLKSWH 96 (331) Q Consensus 83 ----------------------------------------------------------------~~~al~~~--~~~~~~ 96 (331) ..+++.++ .+++|| T Consensus 163 v~~d~~~~~F~i~s~ttg~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g~~aet~~~al~a~~~~~~~w~ 242 (502) T protein:vir:52 163 IAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVAEVNNTWY 242 (502) T ss_pred EEEecCCceEEEEeccCCCcceeEEEEeecCCcchhHHHHHhccccccceeeeeeecccccccCHHHHHHHHHhccCceE Confidence 00111122 134566 Q ss_pred -EEEEecCCHHHHHHHHHHHHhcCcEEEEEEe---------CChHHHHhhcccceEEEEEeCCCchhHHHHHHHHHhcCC Q lcl|Aclame:pro 97 -FALLAEFKAADALALSNLIEEQKFKFAVFQV---------TAVADITPLAKNTRTIAIVHSKTGEKLDAALIGNVASLP 166 (331) Q Consensus 97 -f~~~~~~~~~~i~alA~w~ea~~~~~~~~~~---------t~~~~~~~~~~~~~t~~~~~~~~~~~~~aa~~g~~~~~~ 166 (331) |++..+.++++++++|+|+|+|+++|+++.. +++...++..+|.|++++||++ ++|++++++|++++.+ T Consensus 243 ~~~~a~~~~~~~~la~a~~iea~~~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~~~y~~~-~~~~~aa~~g~~as~~ 321 (502) T protein:vir:52 243 GFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKN-DMYPVSSALARLLSTN 321 (502) T ss_pred EEEEeecCChhHHHHHHHHHhhcCcEEEEEecCcceeccccchHHHHHHhccCceeEEEecCC-cchhHHHHHHHHHhcC Confidence 5555667899999999999999999987642 3455667788999999999985 6899999999998764 Q ss_pred ----ccceeeeeeeccCCcCCCCCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCceehhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 167 ----VGSATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGEFIDSIHGDDWIKATIETRLQKL 242 (331) Q Consensus 167 ----~G~~t~~~k~~l~gv~~~~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~~iD~~~~~dwl~~~iq~~l~~l 242 (331) +|++|||||+ ++||+|++++.+|+++|+++|||||+.+++..++++|++++|+|||++|++|||+++||++|+++ T Consensus 322 f~~~~g~iT~~fk~-l~GV~~~~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~~G~~iD~~~~~~Wl~~~lq~~l~~~ 400 (502) T protein:vir:52 322 FAANNSTLTLKFKQ-QPTITADEITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGKFADEIVILDWFVDAVQKEVFAR 400 (502) T ss_pred CCcCcceeeecccc-cCCcccCcCCHHHHHHHHhcCceEEEEecCeeEEecCeeeCCchhhHHHHHHHHHHHHHHHHHHH Confidence 6899999996 89999999999999999999999999999999999999999999999999999999999999998 Q ss_pred Hh-cCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCC------------cceEEEccchhcCCHHHHHhcccCC Q lcl|Aclame:pro 243 LT-ETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGE------------PNFSITALQRSDLNDDDIAKRNYKG 309 (331) Q Consensus 243 ~~-~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~------------~~~~v~~~~~~~~~~~dr~~R~~~~ 309 (331) |. +++|||||+.|+++|+++|+++|+++++||+|+||+++++ +||+|+.|++++++++||++|++|+ T Consensus 401 L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~~~ 480 (502) T protein:vir:52 401 LYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATP 480 (502) T ss_pred HHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeecccccCceEEEeCchhhCCHHHHHcccCCC Confidence 75 4689999999999999999999999999999999999875 4799999999999999999999999 Q ss_pred eEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 310 LSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 310 i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ++|+|+++||||+|+|+++|+= T Consensus 481 ~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 481 IQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred eEEEEEECceEEEEEEEEEEeC Confidence 9999999999999999888888 No 3 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=100.00 E-value=3.5e-91 Score=516.47 Aligned_cols=321 Identities=24% Similarity=0.353 Sum_probs=290.3 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCC--cceEEEechhhhccCCCCChHHHHHHHHHHccCCCcceEEEEec Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTA--MGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPDTVAVITY 78 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~--~~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p~~v~v~~~ 78 (331) |.|+|++|++++.. .+.++++|+.+||++++.. ++.++|+++++|+++|+.++|+||+|.++|+|.|+|.+++|++| T Consensus 1 ~~s~iVnV~i~~~~-~a~~~~~f~~~l~~~~~~~~~~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p~p~~l~igr~ 79 (450) T protein:vir:95 1 MWNPIVNVDITLNT-AGTTREGFGLPLFLASTDNFEERVRGYTSLTEVAEDFDENTAAYKAAKQLWSQTPKVTQLYIGRR 79 (450) T ss_pred CCCceEEEeecccc-cccccccceeEEEEcCCCCCccceeeecCHHHHHHhcCCCcHHHHHHHHHHhCCCcccEEEEEee Confidence 99999999999985 5677789999999998754 68999999999999999999999999999999999999999877 Q ss_pred cch----------------------------------------------------------------------------- Q lcl|Aclame:pro 79 EDT----------------------------------------------------------------------------- 81 (331) Q Consensus 79 ~~~----------------------------------------------------------------------------- 81 (331) ..+ T Consensus 80 ~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~~~~~s~g~~~~~t~~~~~ 159 (450) T protein:vir:95 80 AMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDKVSVNVTGSNGSATMIIAK 159 (450) T ss_pred ccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeeeeeeeeecccceeeeeeec Confidence 421 Q ss_pred ------------------------hHHHHHHHhh--cCceeEEEEecCCHHHHHHHHHHHHhcCcEEEEEEe-------- Q lcl|Aclame:pro 82 ------------------------KLLEAAEAYF--LKSWHFALLAEFKAADALALSNLIEEQKFKFAVFQV-------- 127 (331) Q Consensus 82 ------------------------~~~~al~~~~--~~~~~f~~~~~~~~~~i~alA~w~ea~~~~~~~~~~-------- 127 (331) +..+++.++. +++||++++.+.++++++++|+|+|+|+++|+++.+ T Consensus 160 ~~~~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~~w~~~~~~~~~~~~i~a~a~w~~a~~~~f~~~~~~~~~~~~~ 239 (450) T protein:vir:95 160 AGDNDFVKVTTTAQTVYIASTTADTASTALAAIEAYSTDWYFIAAEDRTQQFVLAMASEIQARKKIFFTANSDVTALQGT 239 (450) T ss_pred cccchhhccccccceeEecccccccHHHHHHHHHHhhCCeEEEEecCCCHHHHHHHHHHHhhcCcEEEEEcCCchhhhhh Confidence 0122222222 567999999999999999999999999999998653 Q ss_pred -----CChHHHHhhcccceEEEEEeCCC-chhHHHHHHHHHhcCCccceeeeeeeccCCcCCC-------CCCHHHHHHH Q lcl|Aclame:pro 128 -----TAVADITPLAKNTRTIAIVHSKT-GEKLDAALIGNVASLPVGSATWKGRHGLAGITSE-------ELKVSEIDAI 194 (331) Q Consensus 128 -----t~~~~~~~~~~~~~t~~~~~~~~-~~~~~aa~~g~~~~~~~G~~t~~~k~~l~gv~~~-------~~t~t~~~~l 194 (331) +++...++..+|.||+++||+.. .+|++++++|++++.++|++|||||+ ++||+|+ .|+++|+++| T Consensus 240 ~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~~~aa~~g~~~~~~~g~~T~~fk~-l~Gv~~~v~~~~~~~lt~~~~~al 318 (450) T protein:vir:95 240 ELASANDVPAQLAKNMYTRTVCLWHHAAAEDYPEMAYIAYGAPYDAGSIAWGNAQ-LTGVAASLQPSNQRPLTSIQKSAL 318 (450) T ss_pred hhhcccchHHHHHhccCCeeEEEeeCCCchhHHHHHHHHHhhhcccceeeecccc-ccceeeeccCccccccchHHHHHH Confidence 33455677788899999999875 46999999999999999999999996 8999996 5899999999 Q ss_pred HhCCCeEEEEEcCeeEEecCEEeCCceehhhHHHHHHHHHHHHHHHHHHhcC--CCCCcCHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 195 QKAGGMCYIEKAGIAQTSEGKTVSGEFIDSIHGDDWIKATIETRLQKLLTET--DKLTFDARGIALLQSELTTVLNEGFA 272 (331) Q Consensus 195 ~~~~~n~y~~~~g~~~~~~G~~~~G~~iD~~~~~dwl~~~iq~~l~~l~~~~--~kipyt~~G~~~i~~~v~~vl~~~~~ 272 (331) +++|||||+.+.+..++++|++++|+|||++||+|||+++||++|++||.++ +|||||+.|+++|+++|+++|+++++ T Consensus 319 ~~~~~n~y~~~~~~~~~~~G~~~~G~~iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy~~~G~~~i~a~i~~~l~~a~~ 398 (450) T protein:vir:95 319 DVRHCNFIDLDGGVPVVRRGITSGGEWIDIIRGVDWLESDLKTSLRDLLINQKGGKITYDDTGITRIRQVIETSLQRAVN 398 (450) T ss_pred HhCCcEEEEEecCceeeeCCeeeCcchhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccChhhHHHHHHHHHHHHHHHHh Confidence 9999999999999999999999999999999999999999999999999765 58999999999999999999999999 Q ss_pred cCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 273 NGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 273 ~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ||+|+ +|+|+.|++++++++||++|++|+++|+|+|+||||.++|+|+|+- T Consensus 399 ~G~Ia--------~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~ 449 (450) T protein:vir:95 399 RNFLS--------SYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAY 449 (450) T ss_pred cCccc--------ceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEe Confidence 99996 6899999999999999999999999999999999999999999999 No 4 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=100.00 E-value=7.2e-85 Score=481.89 Aligned_cols=324 Identities=15% Similarity=0.166 Sum_probs=274.7 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccC---CcceEEEechhhhccCCCCChHHHHHHHHHHc----cCCCcceE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGT---AMGYKEYTTLEELKDTFADNTEVYAKAKAVFL----QKDRPDTV 73 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~---~~~~~~yts~~~v~~~f~~~s~~ykaA~~~fs----Q~~~p~~v 73 (331) =+|+|++|++++....+. ..+|+ .|+|+... .++.++|+++++|.++|+.++|||++|.+||+ |.|+|.+| T Consensus 7 p~s~iV~V~~~v~~~~~~-~~~~~-~lllt~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~~~P~~l 84 (501) T protein:vir:36 7 PIDQIVQMLPGVIGAGGA-PGRLT-GLVLTQDTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVNGGQLPYDL 84 (501) T ss_pred ccceEEEEeeeeccCCCc-ceeee-eEEEeccCCCCCcceeeecCHHHHHHhcCCChHHHHHHHHHhhcccCCCccccEE Confidence 378999999999876655 46788 55666543 46899999999999999999999999999998 99999999 Q ss_pred EEEeccch------------------------------------------------------------------------ Q lcl|Aclame:pro 74 AVITYEDT------------------------------------------------------------------------ 81 (331) Q Consensus 74 ~v~~~~~~------------------------------------------------------------------------ 81 (331) +|++|..+ T Consensus 85 ~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~~~~~i~lS~~ts~~~vA~~i~~al~~~~~tv~~d~~~~ 164 (501) T protein:vir:36 85 KFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVAYDALRN 164 (501) T ss_pred EEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecceeeeeecccccccCHHHHHHHHhhhhcCcceEEEEcCcce Confidence 99987521 Q ss_pred ----------------------------------------------hHHHHHHHh--hcCcee-EEEEecCCHHHHHHHH Q lcl|Aclame:pro 82 ----------------------------------------------KLLEAAEAY--FLKSWH-FALLAEFKAADALALS 112 (331) Q Consensus 82 ----------------------------------------------~~~~al~~~--~~~~~~-f~~~~~~~~~~i~alA 112 (331) +..+++.++ .+++|| |.++.++++++++++| T Consensus 165 ~f~i~s~t~G~~~~i~~~t~~~~ia~~l~Lt~~~~a~v~~~g~~~et~~~al~a~~~~s~~Wy~f~~a~~~~~~~~la~A 244 (501) T protein:vir:36 165 RFTVVTNATGTAAAISAVTGTNNFADEIGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFA 244 (501) T ss_pred eEEEEeccCCcceeeEeeecccchhhhhcccccCcceEEecccccccHHHHHHHHHhccCceEEEEEecCCChHHHHHHH Confidence 001222222 234566 6777889999999999 Q ss_pred HHHHhcCcEEEEEEeCCh------------HHHHhhcccceEEEEEeCCCchhHHHHHHHHHhcC----Cccceeeeeee Q lcl|Aclame:pro 113 NLIEEQKFKFAVFQVTAV------------ADITPLAKNTRTIAIVHSKTGEKLDAALIGNVASL----PVGSATWKGRH 176 (331) Q Consensus 113 ~w~ea~~~~~~~~~~t~~------------~~~~~~~~~~~t~~~~~~~~~~~~~aa~~g~~~~~----~~G~~t~~~k~ 176 (331) +|+|+|+++|+++.++.. ...++..+|.|++++||+. +++++++|++++. .+|++|||||+ T Consensus 245 ~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~---~~~aa~~g~~as~nf~~~~g~~T~~fkq 321 (501) T protein:vir:36 245 SWNSGQAYKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGDQ---ATAGAVMGYAASINFQLRNGRTVLAFRQ 321 (501) T ss_pred HHHhhcCceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCCC---CHHHHHHHHHHhcCcccCcceeeeeccc Confidence 999999999988765443 3345667899999999863 4567888888766 67999999997 Q ss_pred ccCCcCCCCCCHHHHHHHHhCCCeEEEEEcC----eeEEecCEEeCC-ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCc Q lcl|Aclame:pro 177 GLAGITSEELKVSEIDAIQKAGGMCYIEKAG----IAQTSEGKTVSG-EFIDSIHGDDWIKATIETRLQKLLTETDKLTF 251 (331) Q Consensus 177 ~l~gv~~~~~t~t~~~~l~~~~~n~y~~~~g----~~~~~~G~~~~G-~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipy 251 (331) ..+||+|++++.+|+++|+++|+|||+.+.+ ..++++|+++|| +|||+++|+|||+++||+++++||++++|||| T Consensus 322 ~~~Gi~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~g~dWL~~~iq~~l~~ll~~~~KIPy 401 (501) T protein:vir:36 322 FNAGVPATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPY 401 (501) T ss_pred cCCCcCcCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeeccchhhhHHHhHHHHHHHHHHHHHHHHhcCCCCcc Confidence 3389999999999999999999999999875 568999999887 89999999999999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhcCcccccccCC---------------------CcceEEEccchhcCCHHHHHhcccCCe Q lcl|Aclame:pro 252 DARGIALLQSELTTVLNEGFANGIIDSNDETG---------------------EPNFSITALQRSDLNDDDIAKRNYKGL 310 (331) Q Consensus 252 t~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~---------------------~~~~~v~~~~~~~~~~~dr~~R~~~~i 310 (331) |+.|+++|+++|+++|+++++||+|+||+|++ +.||+++.++++ .++++|++|++|++ T Consensus 402 td~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~-~~~~~R~~R~~p~~ 480 (501) T protein:vir:36 402 NEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPA-NPGQARQNRTTPAC 480 (501) T ss_pred ChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeCccc-CChhhhhhcccCcE Confidence 99999999999999999999999999998844 137999988776 46679999999999 Q ss_pred EEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 311 SFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 311 ~~~~~~aGaIh~v~i~~~v~~ 331 (331) +|+|+++||||+|+| ++++| T Consensus 481 ~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:36 481 TLWYSDGGSIQSLTI-GSNAV 500 (501) T ss_pred EEEEEeCCceeEEEe-eeeee Confidence 999999999999999 77777 No 5 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=100.00 E-value=1.5e-84 Score=480.09 Aligned_cols=324 Identities=14% Similarity=0.167 Sum_probs=274.0 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccC---CcceEEEechhhhccCCCCChHHHHHHHHHHc----cCCCcceE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGT---AMGYKEYTTLEELKDTFADNTEVYAKAKAVFL----QKDRPDTV 73 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~---~~~~~~yts~~~v~~~f~~~s~~ykaA~~~fs----Q~~~p~~v 73 (331) =+|+|++|++++....+. ..+|+.+ +|+... .++.++|+++++|.++|+.++||||+|.+||+ |.|+|.+| T Consensus 7 p~s~iV~V~~~v~~~~~~-~~~f~~l-ll~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~P~~l 84 (501) T protein:vir:10 7 PIDQIVQMLPGVIGAGGA-PGRLTGL-VLTQDTSVQPGQLADFFQKTDVENWFGALSNEAKIADAYFPGIVNGGQLPYDL 84 (501) T ss_pred ccceEEEEeeecccCCCc-ccccceE-EEecccCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhhcCCCccccEE Confidence 378999999999865554 5689854 555443 36899999999999999999999999999998 99999999 Q ss_pred EEEeccchh----------------------------------------------------------------------- Q lcl|Aclame:pro 74 AVITYEDTK----------------------------------------------------------------------- 82 (331) Q Consensus 74 ~v~~~~~~~----------------------------------------------------------------------- 82 (331) +|++|..+. T Consensus 85 ~igR~~~~~~~~~l~g~~l~~~~la~~~~~~g~l~i~i~g~~~~~~i~~s~ats~~~vA~~i~~al~~~~~tv~~d~~~~ 164 (501) T protein:vir:10 85 KFARYVAADAPASVYGIPLTGITLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVAYDALRN 164 (501) T ss_pred EEEeecccCccceeeeceehhhhhhhhhheeeEEEEeeccceeeeccccccccCHHHHHHHHHHhhcCCceEEEEecccc Confidence 999876220 Q ss_pred -----------------------------------------------HHHHHHHh--hcCcee-EEEEecCCHHHHHHHH Q lcl|Aclame:pro 83 -----------------------------------------------LLEAAEAY--FLKSWH-FALLAEFKAADALALS 112 (331) Q Consensus 83 -----------------------------------------------~~~al~~~--~~~~~~-f~~~~~~~~~~i~alA 112 (331) ..+++.++ .+++|| |.++.++++++++++| T Consensus 165 ~f~i~~~t~G~~~~i~~~t~~~d~a~~l~Lt~~~~a~v~~~g~~aet~~~Al~a~~~~~~~Wy~f~~a~~~~~~~~la~A 244 (501) T protein:vir:10 165 RFTVVTNTTGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFA 244 (501) T ss_pred eEEEEecccCcceeEEEeeccccchhhhcccccCceeEEecCcccccHHHHHHHHHhcccceEEEEEEecCChHHHHHHH Confidence 01122221 234566 6667889999999999 Q ss_pred HHHHhcCcEEEEEEeCC------------hHHHHhhcccceEEEEEeCCCchhHHHHHHHHHhcC----Cccceeeeeee Q lcl|Aclame:pro 113 NLIEEQKFKFAVFQVTA------------VADITPLAKNTRTIAIVHSKTGEKLDAALIGNVASL----PVGSATWKGRH 176 (331) Q Consensus 113 ~w~ea~~~~~~~~~~t~------------~~~~~~~~~~~~t~~~~~~~~~~~~~aa~~g~~~~~----~~G~~t~~~k~ 176 (331) +|+|+|+++|+++.++. +...++..+|.|++++||+. +++++++|++++. .+|++|||||+ T Consensus 245 ~wi~a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~---~~~aa~~g~~as~nf~~~~g~~T~~fkq 321 (501) T protein:vir:10 245 AWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQ---ATAGAVMGYAASINFQLRNGRTVLAFRQ 321 (501) T ss_pred HHHHhcCceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEECCCC---CHHHHHHHHHHhcCcccCcceeeeeecc Confidence 99999999998775433 33456777899999999863 4677888888776 56999999997 Q ss_pred ccCCcCCCCCCHHHHHHHHhCCCeEEEEEcC----eeEEecCEEeCC-ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCc Q lcl|Aclame:pro 177 GLAGITSEELKVSEIDAIQKAGGMCYIEKAG----IAQTSEGKTVSG-EFIDSIHGDDWIKATIETRLQKLLTETDKLTF 251 (331) Q Consensus 177 ~l~gv~~~~~t~t~~~~l~~~~~n~y~~~~g----~~~~~~G~~~~G-~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipy 251 (331) ..+||+|++++.+|+++|+++|+|||+.+.+ ..++++|+++|| +|||+++|+|||+++||+++++||++++|||| T Consensus 322 l~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kIPy 401 (501) T protein:vir:10 322 FNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPY 401 (501) T ss_pred cCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeeccceehhhHhhHHHHHHHHHHHHHHHHhcCCCccc Confidence 3389999999999999999999999999976 568999999988 89999999999999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhcCcccccccCC---------------------CcceEEEccchhcCCHHHHHhcccCCe Q lcl|Aclame:pro 252 DARGIALLQSELTTVLNEGFANGIIDSNDETG---------------------EPNFSITALQRSDLNDDDIAKRNYKGL 310 (331) Q Consensus 252 t~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~---------------------~~~~~v~~~~~~~~~~~dr~~R~~~~i 310 (331) |+.|+++|++.|+++|+++++||+|+||++++ +.||+++.++++.+ +++|++|++|++ T Consensus 402 t~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~-~~~R~~R~~p~~ 480 (501) T protein:vir:10 402 NEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGNPANP-GQARQNRTSPAC 480 (501) T ss_pred CHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceeccceEEeeCcccCC-hhhhhhcccCce Confidence 99999999999999999999999999997532 24799999888764 467999999999 Q ss_pred EEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 311 SFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 311 ~~~~~~aGaIh~v~i~~~v~~ 331 (331) +|+|+++||||+|+| +.++| T Consensus 481 ~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:10 481 TLWYSDGGSIQELTI-GSNAV 500 (501) T ss_pred EEEEEeCCceeEEEe-eeeec Confidence 999999999999999 77777 No 6 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=100.00 E-value=5.6e-84 Score=477.02 Aligned_cols=324 Identities=15% Similarity=0.177 Sum_probs=274.9 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccC---CcceEEEechhhhccCCCCChHHHHHHHHHHc----cCCCcceE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGT---AMGYKEYTTLEELKDTFADNTEVYAKAKAVFL----QKDRPDTV 73 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~---~~~~~~yts~~~v~~~f~~~s~~ykaA~~~fs----Q~~~p~~v 73 (331) =+|+|++|++++....+. ..+|+ .|+|+... .++.++|+++++|.++|+.++|||++|.++|+ |.|+|.+| T Consensus 7 p~s~iV~V~~~v~~~~~~-~~~~~-~lll~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~~~P~~l 84 (501) T protein:vir:78 7 PIDQIVQMLPGVIGAGGA-PGRLT-GLVLTQDTSIQPGQLADFFQKTDVENWFGGLSNEAVIADAYFPGIVNGGQLPYDL 84 (501) T ss_pred ccceEEEEeeecccCCCc-ceeee-eEEEecCCCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhcCCCCCcccceE Confidence 378999999999866555 46798 45666443 46899999999999999999999999999999 99999999 Q ss_pred EEEeccch------------------------------------------------------------------------ Q lcl|Aclame:pro 74 AVITYEDT------------------------------------------------------------------------ 81 (331) Q Consensus 74 ~v~~~~~~------------------------------------------------------------------------ 81 (331) +|++|..+ T Consensus 85 ~igR~~~~a~~~~l~g~~l~~~~la~~~~~~G~l~iti~g~~~~~~i~~S~~ts~~~vA~~i~~al~a~~~tv~~ds~~~ 164 (501) T protein:vir:78 85 KFARYVAADAPASVYGIPLTGVTLTQLQGYSGTLTVTTAAQHVSSNISLAAATSFANAATLIEAAFTSPDFVVSYDALRN 164 (501) T ss_pred EEEeecccCcceeEeccceeccchhhhceeeeEEEEEeccceeeeccccccccCHHHHHHHHHhhhcCcceEEEEccccc Confidence 99986521 Q ss_pred ----------------------------------------------hHHHHHHHh--hcCcee-EEEEecCCHHHHHHHH Q lcl|Aclame:pro 82 ----------------------------------------------KLLEAAEAY--FLKSWH-FALLAEFKAADALALS 112 (331) Q Consensus 82 ----------------------------------------------~~~~al~~~--~~~~~~-f~~~~~~~~~~i~alA 112 (331) +..+++.++ .+++|| |.+++++++++++++| T Consensus 165 ~f~its~t~G~~~~i~~~t~~~~~a~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~~~~~~~lalA 244 (501) T protein:vir:78 165 RFVVNTNATGTAAAISAVTGTNNLADELGLSAAAGASLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIADRLALA 244 (501) T ss_pred eEEEEeeecCCceeEEEEecccchhhhhcccccCceeeEeccccccCHHHHHHHHHhccCceEEEEEecCCCHHHHHHHH Confidence 001111121 134555 6677889999999999 Q ss_pred HHHHhcCcEEEEEEeCC------------hHHHHhhcccceEEEEEeCCCchhHHHHHHHHHhcC----Cccceeeeeee Q lcl|Aclame:pro 113 NLIEEQKFKFAVFQVTA------------VADITPLAKNTRTIAIVHSKTGEKLDAALIGNVASL----PVGSATWKGRH 176 (331) Q Consensus 113 ~w~ea~~~~~~~~~~t~------------~~~~~~~~~~~~t~~~~~~~~~~~~~aa~~g~~~~~----~~G~~t~~~k~ 176 (331) +|+|+|+++|+++.++. +...++..+|.||+++||+ ++++++++|++++. .+|++|||||+ T Consensus 245 ~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~~---~~~~aa~~g~~as~nf~~~~g~~T~~fkq 321 (501) T protein:vir:78 245 SWNSGQAYKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYGD---QATAGAVMGYAASINFQLRNGRTVLAFRQ 321 (501) T ss_pred HHHHhcCceEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEcCC---cchHHHHHHHHHhcCcccCcceeeeeccc Confidence 99999999998775433 3345667789999999984 56788999999766 57999999997 Q ss_pred ccCCcCCCCCCHHHHHHHHhCCCeEEEEEcC----eeEEecCEEeCC-ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCc Q lcl|Aclame:pro 177 GLAGITSEELKVSEIDAIQKAGGMCYIEKAG----IAQTSEGKTVSG-EFIDSIHGDDWIKATIETRLQKLLTETDKLTF 251 (331) Q Consensus 177 ~l~gv~~~~~t~t~~~~l~~~~~n~y~~~~g----~~~~~~G~~~~G-~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipy 251 (331) ..+||+|++++++|+++|+++|+|||+.+.+ ..++++|+++|+ +|||.++|+|||+++||+++++||++++|||| T Consensus 322 ~~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPy 401 (501) T protein:vir:78 322 FNAGVPATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPY 401 (501) T ss_pred cCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeeccceeehhhhhHHHHHHHHHHHHHHHHHhCCCccc Confidence 3389999999999999999999999999976 468999999887 79999999999999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhcCcccccccCC---------------------CcceEEEccchhcCCHHHHHhcccCCe Q lcl|Aclame:pro 252 DARGIALLQSELTTVLNEGFANGIIDSNDETG---------------------EPNFSITALQRSDLNDDDIAKRNYKGL 310 (331) Q Consensus 252 t~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~---------------------~~~~~v~~~~~~~~~~~dr~~R~~~~i 310 (331) |+.|+++|+++|+++|+++++||+|+||+|++ +.||+++.++++. ++++|++|++|++ T Consensus 402 t~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~-~~~~R~~R~~p~~ 480 (501) T protein:vir:78 402 NEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDPAN-PGQARQNRTTPTC 480 (501) T ss_pred CHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeeccccC-ChhhhhhcccCcE Confidence 99999999999999999999999999998843 2379999988876 4467999999999 Q ss_pred EEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 311 SFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 311 ~~~~~~aGaIh~v~i~~~v~~ 331 (331) +|+|+++||||+|+| +.++| T Consensus 481 ~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:78 481 TLWYSDGGSIQELTI-GSNAV 500 (501) T ss_pred EEEEEeCCceeEEEe-eeeec Confidence 999999999999999 77777 No 7 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=100.00 E-value=1.8e-83 Score=474.20 Aligned_cols=324 Identities=15% Similarity=0.175 Sum_probs=273.5 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCC---cceEEEechhhhccCCCCChHHHHHHHHHHc----cCCCcceE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTA---MGYKEYTTLEELKDTFADNTEVYAKAKAVFL----QKDRPDTV 73 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~---~~~~~yts~~~v~~~f~~~s~~ykaA~~~fs----Q~~~p~~v 73 (331) =+|+|++|++++....+. +.+|+ .|+|+.... .+..+|+|+++|.++|+.++||||+|.++|+ |.|+|.+| T Consensus 7 p~s~iV~V~~~v~~~~~~-~~~~~-~l~l~~~~~~~~~~~~~~~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~P~~l 84 (501) T protein:vir:10 7 PIDQIVQMLPGVIGAGGA-PGRLT-GLVLTQDTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVNGGQLPYDL 84 (501) T ss_pred ccceEEEEeeecccCCCc-cccce-eEEEeccCCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhhhcCCCccccEE Confidence 378999999999865554 56787 557776544 4567799999999999999999999999999 99999999 Q ss_pred EEEeccchh----------------------------------------------------------------------- Q lcl|Aclame:pro 74 AVITYEDTK----------------------------------------------------------------------- 82 (331) Q Consensus 74 ~v~~~~~~~----------------------------------------------------------------------- 82 (331) +|++|..+. T Consensus 85 ~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i~~al~~~~~tv~~d~~~~ 164 (501) T protein:vir:10 85 KFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVAYDALRN 164 (501) T ss_pred EEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHHhhhccCCceEEEEcccCc Confidence 999875220 Q ss_pred -----------------------------------------------HHHHHHHh--hcCcee-EEEEecCCHHHHHHHH Q lcl|Aclame:pro 83 -----------------------------------------------LLEAAEAY--FLKSWH-FALLAEFKAADALALS 112 (331) Q Consensus 83 -----------------------------------------------~~~al~~~--~~~~~~-f~~~~~~~~~~i~alA 112 (331) ..+++.++ .+++|| |.+++++++++++++| T Consensus 165 ~f~its~ttG~~~~i~~~~~~~~la~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~~~~~~~la~A 244 (501) T protein:vir:10 165 RFTVVTNATGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFA 244 (501) T ss_pred eEEEEeeccCCceeEEEeeCchhhhhhcCccccccceEEecCcccccHHHHHHHHHhccCceEEEEEecCCChHHHHHHH Confidence 01111121 234566 6677889999999999 Q ss_pred HHHHhcCcEEEEEEeCCh------------HHHHhhcccceEEEEEeCCCchhHHHHHHHHHhcC----Cccceeeeeee Q lcl|Aclame:pro 113 NLIEEQKFKFAVFQVTAV------------ADITPLAKNTRTIAIVHSKTGEKLDAALIGNVASL----PVGSATWKGRH 176 (331) Q Consensus 113 ~w~ea~~~~~~~~~~t~~------------~~~~~~~~~~~t~~~~~~~~~~~~~aa~~g~~~~~----~~G~~t~~~k~ 176 (331) +|+|+|+++|+++.++.. ...++..+|.|++++||+ ++++++++|++++. .+|++|||||+ T Consensus 245 ~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~---~~~~aa~~g~~as~nf~~~~g~~T~~fkq 321 (501) T protein:vir:10 245 AWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGD---QATAGAVMGYAASINFQLRNGRTVLAFRQ 321 (501) T ss_pred HHHHhcCceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCC---CcHHHHHHHHHHhhCcccCccceeeeccc Confidence 999999999988765443 334566789999999985 55778889998766 57999999997 Q ss_pred ccCCcCCCCCCHHHHHHHHhCCCeEEEEEcCe----eEEecCEEeCC-ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCc Q lcl|Aclame:pro 177 GLAGITSEELKVSEIDAIQKAGGMCYIEKAGI----AQTSEGKTVSG-EFIDSIHGDDWIKATIETRLQKLLTETDKLTF 251 (331) Q Consensus 177 ~l~gv~~~~~t~t~~~~l~~~~~n~y~~~~g~----~~~~~G~~~~G-~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipy 251 (331) ..+||+|++++++|+++|+++|||||+.+++. .++++|+++|+ +|||.++|+|||+++||+++++||.+++|||| T Consensus 322 ~~~Gi~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPy 401 (501) T protein:vir:10 322 FNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPY 401 (501) T ss_pred cCCCcCcccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeccceeehhhhhHHHHHHHHHHHHHHHHHhcCCccc Confidence 32499999999999999999999999999763 58899999988 79999999999999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhcCcccccccCC---------------------CcceEEEccchhcCCHHHHHhcccCCe Q lcl|Aclame:pro 252 DARGIALLQSELTTVLNEGFANGIIDSNDETG---------------------EPNFSITALQRSDLNDDDIAKRNYKGL 310 (331) Q Consensus 252 t~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~---------------------~~~~~v~~~~~~~~~~~dr~~R~~~~i 310 (331) |+.|+++|+++|+++|+++++||+|+||+|++ +.||+++.++++. ++++|++|++|++ T Consensus 402 t~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~-~~~~R~~R~~p~~ 480 (501) T protein:vir:10 402 NEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPAN-PGQARQNRTTPAC 480 (501) T ss_pred CHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeeccccC-Chhhhhhccccce Confidence 99999999999999999999999999998843 2379999887774 6679999999999 Q ss_pred EEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 311 SFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 311 ~~~~~~aGaIh~v~i~~~v~~ 331 (331) +|+|+++||||+|+| +.++| T Consensus 481 ~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:10 481 TLWYSDGGSIQQLTI-GSNAV 500 (501) T ss_pred EEEEEeCCceeEEEe-eeeec Confidence 999999999999999 77777 No 8 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=100.00 E-value=1.3e-83 Score=475.05 Aligned_cols=327 Identities=12% Similarity=0.103 Sum_probs=272.3 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCC---cceEEEechhhhccCCCCChHHHHHHHHHHccCC----CcceE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTA---MGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKD----RPDTV 73 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~---~~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~----~p~~v 73 (331) =+|+|++|++++...+ ..+.+|+.+|||+.++. ++.++|+++++|.++|+.++||||+|.+||+|.| +|.+| T Consensus 3 p~s~iVnV~~~v~~~a-~~~~~~~~~lilt~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsq~p~~~~~P~~L 81 (507) T protein:vir:99 3 SQSRYVRIVSGVGAGA-PVAQRRLIMRVMTTNAVLPPGVVFESSSADAVGAYFGMASEEYKRAKAYMSFISKSINSPSYI 81 (507) T ss_pred CccceeEEeeeccccC-cccccccceeeeccccCCCccceEeecCHHHHHHhcCCChHHHHHHHHHhccCCCCCcccceE Confidence 3578888888887544 34568999999987654 6789999999999999999999999999999999 79999 Q ss_pred EEEeccchh----------------------------------------------------------------------- Q lcl|Aclame:pro 74 AVITYEDTK----------------------------------------------------------------------- 82 (331) Q Consensus 74 ~v~~~~~~~----------------------------------------------------------------------- 82 (331) +|++|..+. T Consensus 82 ~igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vAs~i~~~l~a~~~~~~~~~tv 161 (507) T protein:vir:99 82 SFARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVAATLQTKIRASANAELATATV 161 (507) T ss_pred EEEeecCccccceeecchhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHHHHHHHhhhccccccccceEE Confidence 999885210 Q ss_pred -------------------------------------------------------HHHHHHHh--hcCceeEEE-Ee--c Q lcl|Aclame:pro 83 -------------------------------------------------------LLEAAEAY--FLKSWHFAL-LA--E 102 (331) Q Consensus 83 -------------------------------------------------------~~~al~~~--~~~~~~f~~-~~--~ 102 (331) ..+++.++ .+++||+++ .. + T Consensus 162 ~~d~~~~~F~v~s~~tG~~s~i~~at~~~~gt~~s~l~~~~~~~a~~~~g~~aet~~~a~~a~~~~~~nW~~~~~a~~~~ 241 (507) T protein:vir:99 162 TFNTTTNQFVLNGTTTGALAPTITAVRTDPATDISSLLGWTNTGTVFVKGQAAETPDTSISKSAAISTNFGSFIYTSTPA 241 (507) T ss_pred EEecCCceEEEEeeeccccceeEEEEcCCchhhHHHHhccccccceEeecccccCHHHHHHHHHhhcCCeEEEEEEeccc Confidence 00111111 245666544 33 3 Q ss_pred CCHHHHHHHHHHHHhcCcEEEEEEeCChHHHHhhcc-------cceEEEEEeCCCchhHHHHHHHHHhcC----Ccccee Q lcl|Aclame:pro 103 FKAADALALSNLIEEQKFKFAVFQVTAVADITPLAK-------NTRTIAIVHSKTGEKLDAALIGNVASL----PVGSAT 171 (331) Q Consensus 103 ~~~~~i~alA~w~ea~~~~~~~~~~t~~~~~~~~~~-------~~~t~~~~~~~~~~~~~aa~~g~~~~~----~~G~~t 171 (331) +++++++++|+|+|+|+++|++..+++.+....... ..++. ..+..+.+|++++++|++++. .+|++| T Consensus 242 ~td~~~lalA~wiea~~~~f~~~~~~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~aa~~g~~as~nf~~~ng~~T 320 (507) T protein:vir:99 242 LTNDQITAVASWNASQNNMYMYSVPTTIANIGTLYAAVKGFSGCALNI-TSDSLPVDYIEQSPCEILAATDYTRVNATQN 320 (507) T ss_pred cChHHHHHHHHHHhhcCcEEEEEEecCchhhhhhhhhhhhcceeEEEe-ecccccchhHHHHHHHHHHhhccCcCcccee Confidence 588999999999999999999887776554332221 11222 123445678999999999764 679999 Q ss_pred eeeeeccCCcCCCCCCHHHHHHHHhCCCeEEEEEcC----eeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 172 WKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEKAG----IAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLL 243 (331) Q Consensus 172 ~~~k~~l~gv~~~~~t~t~~~~l~~~~~n~y~~~~g----~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~ 243 (331) ||||+ ++||+|++++++|+++|+++|||||+.+.| ..++++|++++|+ |||.++++|||+++||++|++|| T Consensus 321 ~~fk~-l~GV~a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~gG~~~fid~d~~~~~~WL~~~iq~~l~~l~ 399 (507) T protein:vir:99 321 YMYYQ-FPSRNITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILCGGPNDAVDMNIYANEIWLKSAISAQILSLF 399 (507) T ss_pred ecccc-cCCcccccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeCCcccceeeeeecchHHHHHHHHHHHHHHH Confidence 99996 899999999999999999999999999976 5689999999995 67789999999999999999999 Q ss_pred hcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccc---------cCC------------CcceEEEccchhcCCHHHH Q lcl|Aclame:pro 244 TETDKLTFDARGIALLQSELTTVLNEGFANGIIDSND---------ETG------------EPNFSITALQRSDLNDDDI 302 (331) Q Consensus 244 ~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~---------~~~------------~~~~~v~~~~~~~~~~~dr 302 (331) .+++|||||+.|+++|+++|+++|+++++||+|+||+ +++ ..||+++.|+++.+++++| T Consensus 400 ~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r 479 (507) T protein:vir:99 400 LNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYITQISGDANAWRQVANIGYWLNITFSSYTNPNTQ 479 (507) T ss_pred hcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchheecccccccccccceeccceEEEeCChHhcChhhh Confidence 9999999999999999999999999999999999997 333 2479999999999999999 Q ss_pred HhcccCCeEEEEEEcceEEEEEEEEEEe Q lcl|Aclame:pro 303 AKRNYKGLSFRYKRSGAIHSVDVYGEVE 330 (331) Q Consensus 303 ~~R~~~~i~~~~~~aGaIh~v~i~~~v~ 330 (331) ++|++|+++|+|+++|+||+|+|++++= T Consensus 480 ~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 480 LTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred hccccceEEEEEEeCCeEEEEEeeeecC Confidence 9999999999999999999999988876 No 9 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=100.00 E-value=4.9e-83 Score=471.82 Aligned_cols=328 Identities=14% Similarity=0.126 Sum_probs=273.6 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCC---cceEEEechhhhccCCCCChHHHHHHHHHHccCC----CcceE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTA---MGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKD----RPDTV 73 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~---~~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~----~p~~v 73 (331) =+|+|++|++++... +....+|+.+|||+.++. ++.++|+++++|.++|+.++||||+|.+||+|.| +|.+| T Consensus 3 p~s~iV~V~~~v~~~-~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF~~~~~~~~~P~~l 81 (504) T protein:vir:96 3 SQSRYIRIISGVGAG-APVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPSSI 81 (504) T ss_pred CccceeEeeeccccc-ccccccccceeEeecccCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhcCCCCCccccEE Confidence 357888888888754 344568999999998864 6789999999999999999999999999999987 99999 Q ss_pred EEEeccchh----------------------------------------------------------------------- Q lcl|Aclame:pro 74 AVITYEDTK----------------------------------------------------------------------- 82 (331) Q Consensus 74 ~v~~~~~~~----------------------------------------------------------------------- 82 (331) +|++|..+. T Consensus 82 ~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~~~~~~~~~~tv 161 (504) T protein:vir:96 82 SFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRKNTDPQLAQATV 161 (504) T ss_pred EEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhhhhcccccccccceE Confidence 999875210 Q ss_pred -----------------------------------------------------HHHHHHHh--hcCceeEEEE-ec-CCH Q lcl|Aclame:pro 83 -----------------------------------------------------LLEAAEAY--FLKSWHFALL-AE-FKA 105 (331) Q Consensus 83 -----------------------------------------------------~~~al~~~--~~~~~~f~~~-~~-~~~ 105 (331) ..+++.++ .+++||+++. .+ .++ T Consensus 162 ~~d~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lgl~~~~~~~v~g~~aet~~~al~al~~~~~~Wy~f~~a~~~~~d 241 (504) T protein:vir:96 162 TWNPNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGATLDN 241 (504) T ss_pred EEeccCCeEEEEeeccccceeEEEeeccccchhhhhhcccccceEEeecccccHHHHHHHHHhhcCCeEEEEEEeccCCH Confidence 00111111 2346775544 33 678 Q ss_pred HHHHHHHHHHHhcCcEEEEEEeCChHH-----HHhhcccceEEEEEeCC-CchhHHHHHHHHHh----cCCccceeeeee Q lcl|Aclame:pro 106 ADALALSNLIEEQKFKFAVFQVTAVAD-----ITPLAKNTRTIAIVHSK-TGEKLDAALIGNVA----SLPVGSATWKGR 175 (331) Q Consensus 106 ~~i~alA~w~ea~~~~~~~~~~t~~~~-----~~~~~~~~~t~~~~~~~-~~~~~~aa~~g~~~----~~~~G~~t~~~k 175 (331) ++++++|+|+|+|+++|++..++..+. .....++.+++.++|.. ..+++++..+++++ +..+|++||||| T Consensus 242 d~ilalA~w~ea~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~as~~f~~~ng~~T~~fk 321 (504) T protein:vir:96 242 DQIKAVSAWNAAQNNQFIYTVATSLANLGALFDLVKGNSGTALNVLSATASNDFVEQCPSEILAATNYDEPGASQNYMYY 321 (504) T ss_pred HHHHHHHHHHhhcCceEEEEEeecccchhhHHHhhhhcceeEEEEeecCccchhHHHHHHHHHHhcCcCccccccccccc Confidence 999999999999999998776644322 22334556676766654 44677766666664 556899999999 Q ss_pred eccCCcCCCCCCHHHHHHHHhCCCeEEEEEcCe----eEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCC Q lcl|Aclame:pro 176 HGLAGITSEELKVSEIDAIQKAGGMCYIEKAGI----AQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETD 247 (331) Q Consensus 176 ~~l~gv~~~~~t~t~~~~l~~~~~n~y~~~~g~----~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~ 247 (331) + ++||+|++++++|+++|+++|||||+.+.+. .++++|++++|+ |||+++++|||+++||++|++||.+++ T Consensus 322 ~-l~GVta~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~~~~~wiDv~~~~~WL~~~lq~~l~~l~~~~~ 400 (504) T protein:vir:96 322 Q-FPGRNITVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVN 400 (504) T ss_pred c-cCCcCcccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCccccchhhhhhhHHHHHHHHHHHHHHHHhcCC Confidence 6 8999999999999999999999999999763 588999999997 799999999999999999999999999 Q ss_pred CCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCC---------------------CcceEEEccchhcCCHHHHHhcc Q lcl|Aclame:pro 248 KLTFDARGIALLQSELTTVLNEGFANGIIDSNDETG---------------------EPNFSITALQRSDLNDDDIAKRN 306 (331) Q Consensus 248 kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~---------------------~~~~~v~~~~~~~~~~~dr~~R~ 306 (331) |||||+.|+++|+++|+++|+++++||+|+||+|++ +.||+++.|++++++++||++|+ T Consensus 401 kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~~r~~R~ 480 (504) T protein:vir:96 401 AVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEW 480 (504) T ss_pred CcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccceEEEecChhccChhHhhhcc Confidence 999999999999999999999999999999998844 24799999999999999999999 Q ss_pred cCCeEEEEEEcceEEEEEEEEEEe Q lcl|Aclame:pro 307 YKGLSFRYKRSGAIHSVDVYGEVE 330 (331) Q Consensus 307 ~~~i~~~~~~aGaIh~v~i~~~v~ 330 (331) +|+++|+|+++||||+|+|++++= T Consensus 481 ~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 481 KANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred ccceEEEEEECCeEEEEEeccccC Confidence 999999999999999999988876 No 10 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=100.00 E-value=1.1e-81 Score=464.51 Aligned_cols=325 Identities=15% Similarity=0.171 Sum_probs=273.4 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEcc--CCcceEEEechhhhccCCCCChHHHHHHHHHHc----cCCCcceEE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKG--TAMGYKEYTTLEELKDTFADNTEVYAKAKAVFL----QKDRPDTVA 74 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~--~~~~~~~yts~~~v~~~f~~~s~~ykaA~~~fs----Q~~~p~~v~ 74 (331) =+|+|++|++++... +..+++|+.+|++..+ +.++.++|+++++|.++|+.++||||+|.++|+ |.|+|.+|+ T Consensus 5 p~s~iV~V~~~v~~~-~~~~~~f~~~l~~~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q~p~P~~l~ 83 (494) T protein:vir:94 5 PISQIVSINPQVVSA-GGTQGTLDGLLLTQATGFPVTQPQVYFSAADVGTAFGLTSDEYNAALVYFAGILGGGQQPASLT 83 (494) T ss_pred CcccEEEeeeecccc-CCcccccceeEeecCccCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCCCccccEEE Confidence 349999999999854 5667899988887765 347899999999999999999999999999999 999999999 Q ss_pred EEeccch------------------------------------------------------------------------- Q lcl|Aclame:pro 75 VITYEDT------------------------------------------------------------------------- 81 (331) Q Consensus 75 v~~~~~~------------------------------------------------------------------------- 81 (331) |++|..+ T Consensus 84 igR~~~~a~~~~l~g~~~~~tl~~~~~~~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~~ai~~a~~~v~~d~~~~~f 163 (494) T protein:vir:94 84 IGRYASAATSAAVFGAPLTLSLAQLQTLSGTLIVTTDTQRTSAAINLSGATSFANAASLMTSGFTTPNFAITYDAQRRRF 163 (494) T ss_pred EEeecCccccceeeccchhhhHHhhhhcceEEEEEEcceEEEeeecccccCChhhHHHHHhhhhccccceEEEcccCcEE Confidence 9976521 Q ss_pred ------------------------------------------hHHHHHHHh--hcCcee-EEEEecCCHHHHHHHHHHHH Q lcl|Aclame:pro 82 ------------------------------------------KLLEAAEAY--FLKSWH-FALLAEFKAADALALSNLIE 116 (331) Q Consensus 82 ------------------------------------------~~~~al~~~--~~~~~~-f~~~~~~~~~~i~alA~w~e 116 (331) +..+++.++ .+++|| |.+..+.++++++++|+|+| T Consensus 164 ~v~s~ttG~~s~is~~t~~~a~~l~lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~~~~~~~~~ilalA~wie 243 (494) T protein:vir:94 164 VLSTTATGTTASVSAVTGTLADGVGLSTASGAYVEGSGLAADTAASALDRLAASSSTWAIFTTAWAASLSDRTALAQWTS 243 (494) T ss_pred EEEEccCCceeEEEEeccchhhhhhhhccccceEeecCcccccHHHHHHHHHhccCceEEEEEecCCCHHHHHHHHHHHh Confidence 011222222 234555 66667789999999999999 Q ss_pred hcCcEEEEEEeC------------ChHHHHhhcccceEEEEEeCCCchhHHHHHHHHHhcCC----ccceeeeeeeccCC Q lcl|Aclame:pro 117 EQKFKFAVFQVT------------AVADITPLAKNTRTIAIVHSKTGEKLDAALIGNVASLP----VGSATWKGRHGLAG 180 (331) Q Consensus 117 a~~~~~~~~~~t------------~~~~~~~~~~~~~t~~~~~~~~~~~~~aa~~g~~~~~~----~G~~t~~~k~~l~g 180 (331) +|+++|+++.++ ++...++..+|.||+++||+.. ++++++|++++.+ +|++||+||.+++| T Consensus 244 a~~~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~---~~aa~~g~~aa~~~~~~~g~~T~~~k~q~~g 320 (494) T protein:vir:94 244 DQVFRRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLLA---NAMIVLAWGASTNLQIAEGRTTLALRSPVSS 320 (494) T ss_pred hcCccEEEEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCCC---hHHHHHHHHHhccccccCcceeEEeeccCCC Confidence 999988876543 3445677789999999999854 3567777776655 49999999966899 Q ss_pred cCCCCCCHHHHHHHHhCCCeEEEEEcCe---eEEecCEEeCCc--eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHH Q lcl|Aclame:pro 181 ITSEELKVSEIDAIQKAGGMCYIEKAGI---AQTSEGKTVSGE--FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARG 255 (331) Q Consensus 181 v~~~~~t~t~~~~l~~~~~n~y~~~~g~---~~~~~G~~~~G~--~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G 255 (331) ++|++++.+|+++|+++|||||+.+++. ..+.+|.+++|+ |||.+++++||+++||++|++||.+++|||||+.| T Consensus 321 i~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG~~~~id~~~~~~WL~~~iq~~l~~ll~~~~KIPytd~G 400 (494) T protein:vir:94 321 AGVRVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGGQFLWADTALGWIALRRNLQQALFETLLAYRSLPYNADG 400 (494) T ss_pred CCCccCCHHHHHHHHhcCCeEEEEecccCceEEEecCceeccccceeeeeccHHHHHHHHHHHHHHHHHhCCCcccChhh Confidence 9999999999999999999999999864 344456667776 79999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCcccccccCCCc--------------------ceEEEccchhcCCHHHHHhcccCCeEEEEE Q lcl|Aclame:pro 256 IALLQSELTTVLNEGFANGIIDSNDETGEP--------------------NFSITALQRSDLNDDDIAKRNYKGLSFRYK 315 (331) Q Consensus 256 ~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~--------------------~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~ 315 (331) +++|+++|+++|+++++||+|+||+|+++. ||+++. ...+++++|++|.+|+++|+|+ T Consensus 401 ~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~--~~~~s~~~ra~R~~~~~~~~y~ 478 (494) T protein:vir:94 401 YNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGWYLQV--IDPITTTVRTDRGSPTVNFWYC 478 (494) T ss_pred HHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccccceeccceeeec--cCCCChhhhhccccCCceEEEE Confidence 999999999999999999999999998753 455543 2458999999999999999999 Q ss_pred EcceEEEEEEEEEEeC Q lcl|Aclame:pro 316 RSGAIHSVDVYGEVEV 331 (331) Q Consensus 316 ~aGaIh~v~i~~~v~~ 331 (331) ++||||.|+|++++=+ T Consensus 479 ~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 479 DGGSIQRVVVSATTVI 494 (494) T ss_pred ecCcEEEEEEeeEEeC Confidence 9999999999999888 No 11 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=100.00 E-value=1.2e-74 Score=425.92 Aligned_cols=326 Identities=12% Similarity=0.096 Sum_probs=263.7 Q ss_pred CCCceeeEEEEEeeccccc--ccccceeEEEEccC---CcceEEEechhhhccCCCCChHHHHHHHHHHc----cCCCcc Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSP--RIGLGRPAIFVKGT---AMGYKEYTTLEELKDTFADNTEVYAKAKAVFL----QKDRPD 71 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~--~~~fg~~li~~~~~---~~~~~~yts~~~v~~~f~~~s~~ykaA~~~fs----Q~~~p~ 71 (331) =|+++++|.|++.+.+|.+ .+.|+ +|||+... .+++++|+++++|..+|+.++||||+|.+||+ |.|+|. T Consensus 2 ~I~~~~~V~i~~~v~aa~~~~~~~f~-~li~t~~~~~p~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~P~ 80 (515) T protein:vir:10 2 PISFDKYVAITSGVAAQQQIAARSFA-IRVYTPNPMVSVDRLITATSAADVGAYFGTASEEYKRAVKNFGFISKKTRRPT 80 (515) T ss_pred CCCceeEEEeecccccCCccccccce-eeeeecccCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCCccccc Confidence 6899999999998766653 45788 56666543 46899999999999999999999999999999 999999 Q ss_pred eEEEEeccchh--------------------------------------------------------------------- Q lcl|Aclame:pro 72 TVAVITYEDTK--------------------------------------------------------------------- 82 (331) Q Consensus 72 ~v~v~~~~~~~--------------------------------------------------------------------- 82 (331) +|+|++|..+. T Consensus 81 ~L~igR~~~~a~~~~l~g~~~~~~~l~~~~~is~G~ltitidG~~~~t~s~i~~S~ats~~~vAs~i~tal~~~~~~~~~ 160 (515) T protein:vir:10 81 SIQFARWQREAGPVAIYGGAKKAAALATLQAVTAGAISFLFGGATTVTVSGISFSAATSLADVASELQTALRANADANLA 160 (515) T ss_pred EEEEEeccCcccceEEEeccchhhhHHhhhcccceeEEEEEcceEEEEeeccccccccCHHHHHHHHHhhhccccccccc Confidence 99998754210 Q ss_pred --------------------------------------------------------------HHHHHHHhh--cCceeEE Q lcl|Aclame:pro 83 --------------------------------------------------------------LLEAAEAYF--LKSWHFA 98 (331) Q Consensus 83 --------------------------------------------------------------~~~al~~~~--~~~~~f~ 98 (331) ..+++.++. +++||++ T Consensus 161 ~~tv~~d~~~~~F~v~s~~tG~~~~is~~~~t~~~~~t~~a~~lglt~~~~av~~~g~aaet~~~a~~a~~~~s~nWy~f 240 (515) T protein:vir:10 161 TCTVSYDPVGARFNFAGSPSDDTVQESISIVPQSNPAIDVAQLLGWNSAQGASYIAASPVVSPVDTLIASVAGNNNFGSI 240 (515) T ss_pred eeEEEEecCCCeEEEEEeecCCceeEEEEEecCCCchhhHHHHhccccccceEEecccccccHHHHHHHHHhccCCeEEE Confidence 011222222 4677766 Q ss_pred EEec-----CCHHHHHHHHHHHHhcCcEEEEEEeCChH-------HHHhhcccceEEEEEeCCCchhHHHHHHHHHhcC- Q lcl|Aclame:pro 99 LLAE-----FKAADALALSNLIEEQKFKFAVFQVTAVA-------DITPLAKNTRTIAIVHSKTGEKLDAALIGNVASL- 165 (331) Q Consensus 99 ~~~~-----~~~~~i~alA~w~ea~~~~~~~~~~t~~~-------~~~~~~~~~~t~~~~~~~~~~~~~aa~~g~~~~~- 165 (331) ++.+ .++++++++++|+|+++++|++...++.. ......++.++...++. ..+|++++++|++++. T Consensus 241 ~~a~~~~~~~~~a~~~a~a~~~e~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~-~~~~~~a~~~g~~asvn 319 (515) T protein:vir:10 241 LFTKNGGTGITLSDAEAIALQNQSYNVAYKFQVGVDDTTYSSWQAALAAIGGVNMIYSPVAL-AAEYHDMQDGIIEAATD 319 (515) T ss_pred EEeecCccccchhHHHHHHHHHhhcCceEEEEeccCccceechhhhhhhhhhcCceEEEEec-cCcchHHHHHHHHHhcC Confidence 6653 35789999999999999999876543321 22334455666666555 4567888999998775 Q ss_pred ---CccceeeeeeeccCCcCCCCCCHHHHHHHHhCCCeEEEEEcC----eeEEecCEEeCCc----eehhhHHHHHHHHH Q lcl|Aclame:pro 166 ---PVGSATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEKAG----IAQTSEGKTVSGE----FIDSIHGDDWIKAT 234 (331) Q Consensus 166 ---~~G~~t~~~k~~l~gv~~~~~t~t~~~~l~~~~~n~y~~~~g----~~~~~~G~~~~G~----~iD~~~~~dwl~~~ 234 (331) .+|++|||||+ ++||+|++++++|+++|++||||||+.|.+ ..|++||+++||+ |||++||+|||+++ T Consensus 320 f~~~ng~iT~kfKq-~~Gita~~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G~~~gG~~~~~WiD~~~g~~WL~~~ 398 (515) T protein:vir:10 320 FTQQGGATGYMYVQ-FNNQTPAVNDDTLSGILDDLNINYYGQTQVNGTNLSFYQDGVMMGGPTDPRDSNVYANEQWLKSY 398 (515) T ss_pred CCccchhheecccc-CCCCccccCCHHHHHHHHhcCCeEEEEEeccCceEEEEeCCeeeCCccchhHHHHHhhHHHHHHH Confidence 46999999997 799999999999999999999999999965 5799999999996 79999999999999 Q ss_pred HHHHHHHHHhcCCCCCcCHHHHHHHHHHH-HHHHHHHHhcCcccccccCC---------------------CcceEEEcc Q lcl|Aclame:pro 235 IETRLQKLLTETDKLTFDARGIALLQSEL-TTVLNEGFANGIIDSNDETG---------------------EPNFSITAL 292 (331) Q Consensus 235 iq~~l~~l~~~~~kipyt~~G~~~i~~~v-~~vl~~~~~~G~I~~g~~~~---------------------~~~~~v~~~ 292 (331) ||++|++||++++|||||+.|+++|++.| +++|+++++||+|+||++++ ..||+++.| T Consensus 399 iq~~l~~L~~s~~KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~~~~~~~~~Gyy~~~~ 478 (515) T protein:vir:10 399 AGASFMSLQLAQGKIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDTAWQKVQNLGYWYDVQ 478 (515) T ss_pred HHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcccccchhhcceeEecC Confidence 99999999999999999999999999987 57999999999999999744 247999998 Q ss_pred chhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEe Q lcl|Aclame:pro 293 QRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVE 330 (331) Q Consensus 293 ~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~ 330 (331) +.+.++..+|..+ .+++.|||+++|+||+|+++.++= T Consensus 479 ~~~~~~~~~r~~~-~~~~~~~y~~g~~i~~i~~~~~~v 515 (515) T protein:vir:10 479 ISSFVDTGGTTKY-QAVYSLVYSKDDLIRKVVGTHTLI 515 (515) T ss_pred cCCCCCccccccc-CceeEEEEEcCceEEEEEeeeecC Confidence 8765555544444 234689999999999999988876 No 12 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=100.00 E-value=4.1e-62 Score=357.15 Aligned_cols=320 Identities=14% Similarity=0.179 Sum_probs=234.0 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCc-------ceEEEechhhhccCCCCChHHHHHHHHHHccCCCcceE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAM-------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPDTV 73 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~-------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p~~v 73 (331) |-++|++|+|+++. +|.++.+||.+|||++++.+ +.+.|+|+++|++||++++|+||+|.++|+|++...+. T Consensus 1 m~~~iVnV~Is~~t-~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~~~~~r~ 79 (426) T protein:vir:31 1 MPKQIVEIELTAEI-ADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQWRV 79 (426) T ss_pred CCcceEEEEeeccc-ccccccccceeeeeeeccccccccccchhhhhhhHHHHHhcCCCChHHHHHHHHHHhCCceeEEe Confidence 99999999999994 67778899999999988764 45569999999999999999999999999999776553 Q ss_pred EEEecc-----------------------chhHHHHHH----H------------------------------hhcCcee Q lcl|Aclame:pro 74 AVITYE-----------------------DTKLLEAAE----A------------------------------YFLKSWH 96 (331) Q Consensus 74 ~v~~~~-----------------------~~~~~~al~----~------------------------------~~~~~~~ 96 (331) .+...+ .......+. + ....+|+ T Consensus 80 ~v~~at~~~~~~~t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~~~~t~~g~~t~~~~~~~~~~s~~dw~ 159 (426) T protein:vir:31 80 MVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHADWS 159 (426) T ss_pred eccccceeeeccCCcceeecceeeeecccCcchHHHHHHhhccccccccceeeeEeccccceeeccccceeeeeccCcch Confidence 222110 000111111 0 1233566 Q ss_pred EEEEe--cCC----------HHHH---HHHHHHHHhcCcEEEEEEe---------CChHHHHhhcccceEEEEEe--CCC Q lcl|Aclame:pro 97 FALLA--EFK----------AADA---LALSNLIEEQKFKFAVFQV---------TAVADITPLAKNTRTIAIVH--SKT 150 (331) Q Consensus 97 f~~~~--~~~----------~~~i---~alA~w~ea~~~~~~~~~~---------t~~~~~~~~~~~~~t~~~~~--~~~ 150 (331) .+... ..+ .+.+ .++..|.+++.++++.... ++.+...+.++|.++.++++ ... T Consensus 160 ~~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa~~~~i~~va~~~e~~~~~~~~~~~a~~~~~~~y~p~~~~~~~~~~~ 239 (426) T protein:vir:31 160 QLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDAS 239 (426) T ss_pred hhhcccccchhhhhhccccchhhhhhhHhhhhhhhhcceeeeeeccchhhhcchhhhhhhhhcccccccchhheeehhcc Confidence 44311 111 1122 2367788888877765432 22344455566654544444 333 Q ss_pred chhHHHHHHHHHhcCCc-----------cceeeeeeeccCCcCCCCCCHHHHHHHHhCCCeEEEEEcCee-----EEecC Q lcl|Aclame:pro 151 GEKLDAALIGNVASLPV-----------GSATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEKAGIA-----QTSEG 214 (331) Q Consensus 151 ~~~~~aa~~g~~~~~~~-----------G~~t~~~k~~l~gv~~~~~t~t~~~~l~~~~~n~y~~~~g~~-----~~~~G 214 (331) .....+.+++.++...| ++..++|+++ +|+... +..++.. ..++++|.|..+.|.+ ++++| T Consensus 240 ~~~~~~~~~~~~aa~~~~~~~~~~~~~~~~~~~~~~~~-~gv~~t-~~~~~~A-~~~~~~n~~~~~~~~~~i~~~~~~~G 316 (426) T protein:vir:31 240 DDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGD-PEEQGT-FEGGDEA-EGEGPVNVLIDVSDANRVSNAVTTAG 316 (426) T ss_pred ccchhhHHhhhhhhhccccchhhhhccccccceeeccc-cccccc-cchhhhh-hhcCCceEEEEecCceeeecceeecc Confidence 34445666666655432 3445566653 777532 3333333 4458889999988754 56779 Q ss_pred EEeCCceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccch Q lcl|Aclame:pro 215 KTVSGEFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQR 294 (331) Q Consensus 215 ~~~~G~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~ 294 (331) ++++|+|||++|++|||+++||++|++||+|.+|||||+.|++||++.|+.+|+++++.|. +..++|+|..|.+ T Consensus 317 ~~~~G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g------~~~~~y~v~~P~~ 390 (426) T protein:vir:31 317 ADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVG------QPLAEYEVDVPEW 390 (426) T ss_pred cccchhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCC------ccccceeecCCCc Confidence 9999999999999999999999999999999999999999999999999999999998654 2234799999998 Q ss_pred hcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 295 SDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 295 ~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ++++ +||++|++++++|.++|+||||.++|+|+|+| T Consensus 391 ~~~~-~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 391 DDDD-VDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred cccc-hhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 8865 69999999999999999999999999999999 No 13 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.46 E-value=2.7e-13 Score=89.45 Aligned_cols=313 Identities=20% Similarity=0.249 Sum_probs=194.7 Q ss_pred CCCceeeEEEEEeeccc--ccccccceeEEEEccCCcceEEEechhhhccCCCCChHHHHHHHHHHccC------CCcce Q lcl|Aclame:pro 1 MVETITDVRVHISVLYP--SPRIGLGRPAIFVKGTAMGYKEYTTLEELKDTFADNTEVYAKAKAVFLQK------DRPDT 72 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~--~~~~~fg~~li~~~~~~~~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~------~~p~~ 72 (331) |.= +=++.|.+.-.++ ..+..=|...++-.+++..+++|++.+++...+......|.. ..|..+ ..|.+ T Consensus 1 ~~g-lp~i~i~f~~~a~ta~~~g~rGiv~~il~d~~~~~~~~~~~~~v~~~~~~~n~~~i~--~~~~g~~~~~~~~~p~~ 77 (356) T protein:vir:10 1 MAG-LVNINIEFKELATSFIQRSKAGIVAIILKDTTKMYKELTSEDDIPISLSADNKKYIK--YGFVGATDNEKVLRPSK 77 (356) T ss_pred CCC-CCceeEEEeecceeeccCCccceEEEEEecCCcceeEEeccccchhHHHHHHHHHHH--HHhhcccccccccccee Confidence 433 4455555542222 222112455555566677899999999988777665555554 444332 12555 Q ss_pred EEEEec-cchhHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHh----cCcEEEEEEeCChHHHHhhcccceEEEEEe Q lcl|Aclame:pro 73 VAVITY-EDTKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEE----QKFKFAVFQVTAVADITPLAKNTRTIAIVH 147 (331) Q Consensus 73 v~v~~~-~~~~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea----~~~~~~~~~~t~~~~~~~~~~~~~t~~~~~ 147 (331) +.+... +.++..++|..+-...|-++++...+.++...++.|+.. .++++..+.....++.....+..+. +.+ T Consensus 78 ~~~~~~~t~~~y~~aL~~le~~~fn~l~~~~~d~~~~~~~~a~ikr~r~~~~~~~~~V~~~~~aD~EgIInv~n~-~~~- 155 (356) T protein:vir:10 78 VIISTFTEDGKVEDILEELESVEFNYLCMPEAIEAEKTKIVTWIKKIREEESTEAKAVLANIKADNEAIINFTEN-VVV- 155 (356) T ss_pred eeeecccCchhHHHHHHHhcCccceEEEecCCChHHHHHHHHHHHHHHhcCCcEEEEEecCCCCCCceeEEeecC-eEe- Confidence 544433 345677778777677777888888888999999999964 3456655543322222222221111 111 Q ss_pred CCCchhH----HHHHHHHHhcCCc-cceeeeeeeccCCcCCC-CCCHHHHHHHHhCCCeEEEEEcCeeEEecCE----Ee Q lcl|Aclame:pro 148 SKTGEKL----DAALIGNVASLPV-GSATWKGRHGLAGITSE-ELKVSEIDAIQKAGGMCYIEKAGIAQTSEGK----TV 217 (331) Q Consensus 148 ~~~~~~~----~aa~~g~~~~~~~-G~~t~~~k~~l~gv~~~-~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~----~~ 217 (331) ...++. ++.++|..++... -|.|. +. ++++... .++.+|++.+.++|...+...++...+.+|. +. T Consensus 156 -~g~~~t~~~~~~~vAG~~Ag~~~n~S~T~--~~-~~~~~~~~~~t~~e~~~ai~~G~lvl~~d~~~V~I~~~VNSltt~ 231 (356) T protein:vir:10 156 -DGEEITAEKYTTRVASLIASTPNTQSITY--AP-LDEVESIVKIDKASADAKVQAGELILRRLSGKIRIARGINSLTTL 231 (356) T ss_pred -cceeechhHHHHHHHHHHhccchhccccc--ee-cCCccccccCCHHHHHHHHhCCeEEEEEEcCeEEEEecCccceec Confidence 112333 3344444444443 35564 32 5665432 5899999999999999998888877788886 22 Q ss_pred CCc----e--ehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEc Q lcl|Aclame:pro 218 SGE----F--IDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITA 291 (331) Q Consensus 218 ~G~----~--iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~ 291 (331) +.+ | |-.++..|-+.+.++...-+.++ +|+|=+..|..++.+.++..+++..+.|.|.++.+ ..+.. T Consensus 232 t~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yi--GKv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~-----~eid~ 304 (356) T protein:vir:10 232 TAEKGEIFQKIKLVDTKDLISKDIKNIYVEKYL--RKCPNTYDNKCLFIVAVQSYLTELAKQELIDSNFT-----VEIDL 304 (356) T ss_pred CCCCCcchhhhHHHHHHHHHHHHHHHHHhhccc--cccCCCHHHHHHHHHHHHHHHHHHHhCCccccCce-----eEecc Confidence 333 3 89999999999999875554555 79999999999999999999999999999976421 01111 Q ss_pred c-----------chhcCCHHHHHh---cccCCeEEEEEEcceEEEEEEEEEE Q lcl|Aclame:pro 292 L-----------QRSDLNDDDIAK---RNYKGLSFRYKRSGAIHSVDVYGEV 329 (331) Q Consensus 292 ~-----------~~~~~~~~dr~~---R~~~~i~~~~~~aGaIh~v~i~~~v 329 (331) . ..+++++.+... +..-=+++.+++-.|+..+.++.+| T Consensus 305 e~q~~~~~~~g~d~~~~~d~~v~~~~~~~~v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 305 EKQKEYLEGKKIAVSKMKENEIKEANTGSNGFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred cchHHHhhhccccccccccceeecccCCcEEEEEEEEEEEeeeeeEEeEEeC Confidence 0 011122222111 1122266777889999999988888 No 14 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=99.28 E-value=6.6e-12 Score=81.87 Aligned_cols=302 Identities=14% Similarity=0.122 Sum_probs=163.8 Q ss_pred CCC------ceeeEEEEEeecccccccccceeEEEEccC---------------Cc-ceEEEechhhhcc-CCCC---Ch Q lcl|Aclame:pro 1 MVE------TITDVRVHISVLYPSPRIGLGRPAIFVKGT---------------AM-GYKEYTTLEELKD-TFAD---NT 54 (331) Q Consensus 1 ~v~------~i~dV~v~i~~~~~~~~~~fg~~li~~~~~---------------~~-~~~~yts~~~v~~-~f~~---~s 54 (331) +.+ .+.|+ +.+....|... -+.+-+..... .. .-.++..++++.. ++.. +. T Consensus 90 ~~~g~~a~~tl~~~-~~~~A~~~G~~--gn~i~v~v~~~~~d~~~~~v~~~~~~~~~d~~~v~~~~~~~~n~~v~~~~~~ 166 (437) T protein:vir:10 90 LNTGEKANVSLSDN-VTAQAKYSGVR--GNDITVTVKTNVDDPSSFDVVTFLDTVVMDLQTVKVLADLKNNALVEFSGTG 166 (437) T ss_pred CCCCceeeEeeccc-eEEEeccCCcc--cceeEEEEeeccCCccceEEEEecCcceeeeeehhhhhhhhhhccccccccc Confidence 000 01111 11111112111 00111111110 00 0111222222111 1100 00 Q ss_pred HHHHHHHHHHccCCCcceEEEEeccchhHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHh----cCcEEEEEEeCCh Q lcl|Aclame:pro 55 EVYAKAKAVFLQKDRPDTVAVITYEDTKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEE----QKFKFAVFQVTAV 130 (331) Q Consensus 55 ~~ykaA~~~fsQ~~~p~~v~v~~~~~~~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea----~~~~~~~~~~t~~ 130 (331) .+-..+...|..+..+. .+.++..++|..+....|.++++...+.+.+.++..|++. ..+++..+..... T Consensus 167 ~l~~~a~~~LtGG~dg~------~t~~dy~~al~~le~~~~n~l~~~~~d~~~~t~~~~~ik~~r~~~g~~~~~V~~~~~ 240 (437) T protein:vir:10 167 ELQPVAGAKLTGGTDGA------ISTQDYLEYFKALETVEFNYMALPVEDASIKKAAINFIKRMREDEGLGAQLVVADSD 240 (437) T ss_pred ccccccceeeeccccCC------CChhHHHHHHHHhccCcceEEEecCCChhHHHHHHHHHHHHHhccCceEEEEeCCCC Confidence 00000011111111110 1224556777777777788888888888889999999874 2445544432221 Q ss_pred HHHHhhcccceEEEEEeCCCch----hHHHHHHHHHhcCCccceeeeeeeccCCcC-C-CCCCHHHHHHHHhCCCeEEEE Q lcl|Aclame:pro 131 ADITPLAKNTRTIAIVHSKTGE----KLDAALIGNVASLPVGSATWKGRHGLAGIT-S-EELKVSEIDAIQKAGGMCYIE 204 (331) Q Consensus 131 ~~~~~~~~~~~t~~~~~~~~~~----~~~aa~~g~~~~~~~G~~t~~~k~~l~gv~-~-~~~t~t~~~~l~~~~~n~y~~ 204 (331) ++.....+..+... ...... ..++.++|.+++..+. ...-++. ++|+. . ..++.+|++.+.++|...+.. T Consensus 241 ~d~e~Iin~~n~~~--~~~~~~~~~~~~~a~vAG~~Ag~~~~-~S~t~~~-~~~~~~v~~~~t~~e~~~~i~~G~~vl~~ 316 (437) T protein:vir:10 241 ADSEAVINVKNGVI--LSDKTVIDKTKATVWVAAASANAGVE-KSLTYEK-YEDSVDVVGRLSHTETEDALLKGQFVFTA 316 (437) T ss_pred CCCceEEEeeccee--ecCcceechhhHHHHHHHHhccCccc-cCccccc-cCCcccccccCCHHHHHHHHhCCcEEEEE Confidence 11111111111111 111112 2344455555555443 2333554 78863 3 478999999999999999988 Q ss_pred EcCeeEEecCEEe----C---C-c--eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 205 KAGIAQTSEGKTV----S---G-E--FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANG 274 (331) Q Consensus 205 ~~g~~~~~~G~~~----~---G-~--~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G 274 (331) .++...+.+|..+ + + + .|-+++-.|.+...++..+-+.++ +|+|=+..|..++++.++..|++..+.| T Consensus 317 ~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yi--Gk~~N~~~~r~~~~~~i~~yl~~l~~~g 394 (437) T protein:vir:10 317 RRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFL--GKVSNNEDGRQAFKANRIRYFKDLEARG 394 (437) T ss_pred eCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccc--cccCCCHHHHHHHHHHHHHHHHHHHhCC Confidence 8877777777632 1 1 2 477999999999999987777666 6899999999999999999999999999 Q ss_pred cccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEe Q lcl|Aclame:pro 275 IIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVE 330 (331) Q Consensus 275 ~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~ 330 (331) .|.+.... .+.... - ++ .... -+++.++...++..+.++.+|. T Consensus 395 ~I~~~~~~-----d~~v~~---~--~~-~~~v--~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 395 AIEDFKVE-----DIEVLR---G--EL-KESV--VVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred CccCCCce-----eEEeec---C--CC-CCEE--EEEEEEEEeeeeeeEEEEEEec Confidence 99754321 111111 0 11 1222 3889999999999999999999 No 15 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=99.17 E-value=1.7e-11 Score=79.59 Aligned_cols=300 Identities=12% Similarity=0.056 Sum_probs=167.4 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCcceEEEechhhhcc-CC---CCChHHHHHHHHHHccCCCcceEEEE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMGYKEYTTLEELKD-TF---ADNTEVYAKAKAVFLQKDRPDTVAVI 76 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~~~~~yts~~~v~~-~f---~~~s~~ykaA~~~fsQ~~~p~~v~v~ 76 (331) .-.+=-+++|.+.. .+.....|-..+.++.... +-.....++++.. +| ..+..+-.-+...|..|..... T Consensus 111 ~g~~gn~i~v~v~~-~~~d~~~~dv~~~~g~~~~-d~~~~~~~~~l~~n~~V~~~~~g~la~~a~~~LtGG~dG~~---- 184 (436) T protein:vir:78 111 SGIRGNDLKVIVTT-NIDDNAKFDVVTLLDNKKV-DTQIAKVITELQDNDYVTWKKEATLEATAGLTFTNGTNGEA---- 184 (436) T ss_pred CCCCCcEEEEEecc-cccccCceEEEEEecchhh-hhhhHHHHhhccCCceEEEEecccccccceeeeeccccccc---- Confidence 11111112222221 1111122322222221110 0111123333322 11 1010111111122222222111 Q ss_pred eccchhHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHh----cCcEEEEEEeCChHHHHhhcccceEEEEEeC-CCc Q lcl|Aclame:pro 77 TYEDTKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEE----QKFKFAVFQVTAVADITPLAKNTRTIAIVHS-KTG 151 (331) Q Consensus 77 ~~~~~~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea----~~~~~~~~~~t~~~~~~~~~~~~~t~~~~~~-~~~ 151 (331) .+.++..++|..+-...|.++++...+.+...+++.|+.. .++++..+..... ...+....-+-.. ... T Consensus 185 -~T~~dy~~al~~le~~~fn~l~~~~~d~~~~~~~~a~ikr~re~~g~~~~aV~~~~~-----~~d~EgIInv~n~v~g~ 258 (436) T protein:vir:78 185 -VTGTEYQAFLDKIESYSFNALGCLATTAEIKSLFVEFTKRMRDKVGAKFQTVLYKKN-----DADYEGVVSVENKIKDT 258 (436) T ss_pred -cchHHHHHHHHHHcccceeEEEecCCChHHHHHHHHHHHHHHhhcCCeEEEEecCCC-----CCCCceEEEeecccCCc Confidence 1235667788887778888888888888888999999964 3455554432211 0111111111111 111 Q ss_pred hh----HHHHHHHHHhcCCc-cceeeeeeeccCCcC-C-CCCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEe----CC- Q lcl|Aclame:pro 152 EK----LDAALIGNVASLPV-GSATWKGRHGLAGIT-S-EELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTV----SG- 219 (331) Q Consensus 152 ~~----~~aa~~g~~~~~~~-G~~t~~~k~~l~gv~-~-~~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~----~G- 219 (331) .+ .++.++|..++... -|.| |+. ++|+. . ..++.+|++.+.++|..++...++...+.+|..+ +. T Consensus 259 ~~~~~~~~a~vAG~~Ag~~~~~S~T--~~~-~~~~~~v~~~~t~~e~~~ai~~G~lvl~~d~~~v~I~~~VNTltt~~~~ 335 (436) T protein:vir:78 259 GLLESSLIYWTTGAIAGCDINKSNT--NKR-YDGEFDVDVNYTQIHLEEALKTGKFIFHKVGDEVHVLEDINTFVSFTDE 335 (436) T ss_pred eechhHHHHHHHHHHhcCccccCcc--cee-cCccccccccCCHHHHHHHHhCCeEEEEEeCCeEEEEEccccceecCCC Confidence 22 23344444444443 3455 553 67762 3 3599999999999999998877777777777633 21 Q ss_pred ---c--eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccch Q lcl|Aclame:pro 220 ---E--FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQR 294 (331) Q Consensus 220 ---~--~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~ 294 (331) + -|-.++..|-+.+.++..+-+.++ +|+|=+..|..++.+.++..|++..+.|.|.+.+.. .+.+. T Consensus 336 k~~~~~kI~vir~~D~i~~di~~~~~~~yi--GKv~N~~dgr~~l~~~i~~yl~~L~~~g~I~~f~~~-----Dv~v~-- 406 (436) T protein:vir:78 336 KNDDFSSNQSVRVLDQIANDIATLFNTKYL--GEVPNDKSGRISFWNDVVKHHEQLQNMRAIEDFKAD-----DVSVE-- 406 (436) T ss_pred CCcchhhhhHHHHHHHHHHHHHHHhhhccc--cccCCCHHHHHHHHHHHHHHHHHHHhCCcccCCCCc-----ceEEe-- Confidence 2 488999999999999877666666 699999999999999999999999999999743211 12211 Q ss_pred hcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEe Q lcl|Aclame:pro 295 SDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVE 330 (331) Q Consensus 295 ~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~ 330 (331) +.+ ..+ .-=+++.+++-.|+..+.++.+|. T Consensus 407 ----~~~-~~~-~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 407 ----PGS-DKK-TVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred ----ecC-CCC-EEEEEEEEEEEEeeeeEEEEEEEC Confidence 111 122 223788889999999999999999 No 16 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=99.02 E-value=4e-09 Score=66.61 Aligned_cols=313 Identities=12% Similarity=0.094 Sum_probs=197.1 Q ss_pred CCCceeeEEEEEeecc--cccccccceeEEEEccCCc--------ceEEEechhhhccCCCCChHHHHHHHHHHccCCCc Q lcl|Aclame:pro 1 MVETITDVRVHISVLY--PSPRIGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRP 70 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~--~~~~~~fg~~li~~~~~~~--------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p 70 (331) |.+-.=.|.|.--... ++..+....+.+++..... ....+++..+-...|+.+..+..+...+|.++... T Consensus 1 m~~~~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~~~tl~~a~~~~~~~gg~~ 80 (396) T protein:vir:60 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcCcchhHHHHHHHhhccCce Confidence 8887666777653333 3445677888888865332 23456777776677888888888888999887544 Q ss_pred ceEEEEecc-ch-----------------------hHHHHHHHhhcCcee---EEEEecCC-HHHHHHHHHHHHhcCcEE Q lcl|Aclame:pro 71 DTVAVITYE-DT-----------------------KLLEAAEAYFLKSWH---FALLAEFK-AADALALSNLIEEQKFKF 122 (331) Q Consensus 71 ~~v~v~~~~-~~-----------------------~~~~al~~~~~~~~~---f~~~~~~~-~~~i~alA~w~ea~~~~~ 122 (331) ..+.-.... .. +...++.+....... .+...+.+ ..-..++...++..+. + T Consensus 81 ~~vv~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~tg~~al~~~~~~~~~~~~il~ap~~~~~~v~~al~~~~~~~~~-~ 159 (396) T protein:vir:60 81 TVVVRVEDGTGEDEETKLAQTVSNIIGTTDENGQYTGLKALLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLRA-F 159 (396) T ss_pred EEEEecccccccccccccccccccccccccccccccchhhhhhcccceeeeeeeccccccccHHHHHHHHHHhccCCe-E Confidence 332111000 00 011222222111111 11112222 2333456666655443 4 Q ss_pred EEEEe---CChHHHHhh---cccceEEEEEeCC-------Cc----hhHHHHHHHHHhcCCcccee---eeeeeccCCcC Q lcl|Aclame:pro 123 AVFQV---TAVADITPL---AKNTRTIAIVHSK-------TG----EKLDAALIGNVASLPVGSAT---WKGRHGLAGIT 182 (331) Q Consensus 123 ~~~~~---t~~~~~~~~---~~~~~t~~~~~~~-------~~----~~~~aa~~g~~~~~~~G~~t---~~~k~~l~gv~ 182 (331) +++.. ++.+.+... .+..+.. +|++. .. ..+.+.++|.++..+.-.-- ...| .+.|+. T Consensus 160 ~i~d~p~~~~~~~a~~~~~~~~s~~~~-~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~-~l~gi~ 237 (396) T protein:vir:60 160 GYISAWGCKTISEVKAYRQNFSQRELM-VIWPDFLAWDTVASTTATAYATARALGLRAKIDQEQGWHKTLSNV-GVNGVT 237 (396) T ss_pred EEEeCCCCCCHHHHHHHHhhcCCceEE-EEeCceeeecccCCceeEEchhHHHHHHHHHhhhccCcEeCcCCc-eeccee Confidence 44432 222333222 2333443 33331 11 13466677766655432212 2334 355653 Q ss_pred C--------CCCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCCCCC Q lcl|Aclame:pro 183 S--------EELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLT 250 (331) Q Consensus 183 ~--------~~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~kip 250 (331) . ..++.+|.+.|..+|+|+.....| ..+..+++++++ ||-+.+-.+|+...|+..+...+-. | T Consensus 238 ~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G-~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~ 312 (396) T protein:vir:60 238 GISASVFWDLQESGTDADLLNESGVTTLIRRDG-FRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDK----P 312 (396) T ss_pred eceeecccccCCCcchhhhhhhcCcEEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----C Confidence 3 235678999999999999977555 356677888884 8999999999999999988765443 6 Q ss_pred cCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEe Q lcl|Aclame:pro 251 FDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVE 330 (331) Q Consensus 251 yt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~ 330 (331) .++.-...|+..++.-|+..+++|.|. ||++++.. +..|++++.+.+.. +.+.+.....++.|.+....+ T Consensus 313 n~~~~~~~i~~~i~~~l~~l~~~gal~--------g~~~~~d~-~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~~~~~ 382 (396) T protein:vir:60 313 ITATLIRDIVDGINAKFRELKTNGYIV--------DATCWFSE-ESNDAETLKAGKLY-IDYDYTPVPPLENLTLRQRIT 382 (396) T ss_pred CCHHHHHHHHHHHHHHHHHHHhCCcee--------ceEEEEec-CCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEc Confidence 688889999999999999999999996 36777753 67899999888885 889999999999999999999 Q ss_pred C Q lcl|Aclame:pro 331 V 331 (331) Q Consensus 331 ~ 331 (331) . T Consensus 383 ~ 383 (396) T protein:vir:60 383 D 383 (396) T ss_pred h Confidence 8 No 17 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=99.00 E-value=4.2e-09 Score=66.50 Aligned_cols=314 Identities=12% Similarity=0.061 Sum_probs=193.4 Q ss_pred CCCceeeEEEEEeecc--cccccccceeEEEEccCCc--------ceEEEechhhhccCCCCChHHHHHHHHHHccCCCc Q lcl|Aclame:pro 1 MVETITDVRVHISVLY--PSPRIGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRP 70 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~--~~~~~~fg~~li~~~~~~~--------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p 70 (331) |.+-+-.|.|.--... |....+...+.++++.... ....+++..+....|+.+..+..+...+|.++..+ T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~~gtl~~al~~~~~ngg~~ 80 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGKKGTLSASLQAIADQSKPV 80 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCCCcchHHHHHHhhcccCce Confidence 8887666766553222 3344555566666654321 23456777776666777777788888999887654 Q ss_pred ceEE-EEe--cc-c-----------------hhHHHHHHHhhcCcee---EEEEecCCHHHH-HHHHHHHHhcCcEEEEE Q lcl|Aclame:pro 71 DTVA-VIT--YE-D-----------------TKLLEAAEAYFLKSWH---FALLAEFKAADA-LALSNLIEEQKFKFAVF 125 (331) Q Consensus 71 ~~v~-v~~--~~-~-----------------~~~~~al~~~~~~~~~---f~~~~~~~~~~i-~alA~w~ea~~~~~~~~ 125 (331) ..+. +.. .. . .+...++.+.....+. .+.+.+.+...+ .++...++. -+.++++ T Consensus 81 ~~vv~v~~~~~~~~~~~t~~dliG~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~~~~-~~~~~~~ 159 (392) T protein:vir:18 81 TVVVRVAEGTGDDAEAQTTSNIIGGTDENGKYTGIKALLTAEAVTGVKPRILGVPGLDTQEVATALASVCIS-LRAFGYV 159 (392) T ss_pred EEEecccccccccccccchhhheecccccchhhhHHHHHhhhhhhceeehhcccCccchHHHHHHHHHHHhh-cCcEEEE Confidence 3321 110 00 0 0112223332222111 222333333333 344454443 3345555 Q ss_pred Ee---CChHHHHh---hcccceEEEEEe-----C-CCc----hhHHHHHHHHHhcCCccc---eeeeeeeccCCcCC--- Q lcl|Aclame:pro 126 QV---TAVADITP---LAKNTRTIAIVH-----S-KTG----EKLDAALIGNVASLPVGS---ATWKGRHGLAGITS--- 183 (331) Q Consensus 126 ~~---t~~~~~~~---~~~~~~t~~~~~-----~-~~~----~~~~aa~~g~~~~~~~G~---~t~~~k~~l~gv~~--- 183 (331) .. ++...+.. ..+..|..+.+- + ..+ ..|.+.++|..+..+... .....| .+.||.. T Consensus 160 d~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~-~l~gi~~~~~ 238 (392) T protein:vir:18 160 SAWGCKTISEAMAYRENFSQRELMVIWPDFLAWDTTANATATAYATARALGLRAYIDQTIGWHKTLSNV-GVQGVTGISA 238 (392) T ss_pred ecCCCCCHHHHHHHHhhccCceEEEEeCceeeecccCCceEEechHHHHHHHHHhhhccCCceEccCCc-eeeceeecce Confidence 43 12222222 122334433321 1 111 134566666665544322 223344 3556532 Q ss_pred -----CCCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHH Q lcl|Aclame:pro 184 -----EELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDAR 254 (331) Q Consensus 184 -----~~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~ 254 (331) ..++..|.+.|..+|+|+.....| ..+..+++++++ ||-+.+-.+|++..|+..+...+=. |.++. T Consensus 239 ~~~~~~~~~~~~~~~Ln~~gI~t~~~~~G-~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~n~~~ 313 (392) T protein:vir:18 239 SVFWDLQASGTDADLLNEAGVTTLVRKDG-FRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDK----PITAS 313 (392) T ss_pred ecccccCCCcchhhhhhhcCceEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHH Confidence 124678999999999999976555 466677888874 8999999999999999888765432 77899 Q ss_pred HHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 255 GIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 255 G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) -...|+..++.-|++.+++|.|.. |++++.. ...+++++.++++. +.+.+.....+++|.+....+. T Consensus 314 ~~~~i~~~i~~~L~~l~~~gal~g--------~~v~~d~-~~nt~~~i~~G~~~-~~v~~~p~~p~e~I~~~~~~~~ 380 (392) T protein:vir:18 314 LIRDIVDGINAKFRELKSNGYIVD--------GECWFDE-ESNDKETLKAGKLY-IDYDYTPVPPLESLTLRQRITD 380 (392) T ss_pred HHHHHHHHHHHHHHHHHhcCcccc--------eEEEEec-CCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 999999999999999999999963 6777753 67789999998885 8899999999999999999988 No 18 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=98.96 E-value=5.1e-09 Score=66.06 Aligned_cols=315 Identities=11% Similarity=0.022 Sum_probs=193.6 Q ss_pred CCCcee-eEEEEEeecc--cccccccceeEEEEccCCc--------ceEEEechhhhccCCCCChHHHHHHHHHHccCCC Q lcl|Aclame:pro 1 MVETIT-DVRVHISVLY--PSPRIGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDR 69 (331) Q Consensus 1 ~v~~i~-dV~v~i~~~~--~~~~~~fg~~li~~~~~~~--------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~ 69 (331) |.++.. .|.|.--... +........+.+++..... .....++..+....|+.+..++.+...+|.|+.. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:10 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 888775 4665443222 3345567777777754332 1234667777777888888889999999999876 Q ss_pred cceEE-EEeccch-----------------hHHHHHHHhhcCc---eeEEEEecCCHHHHHHHHHHHHhcCcEEEEEEeC Q lcl|Aclame:pro 70 PDTVA-VITYEDT-----------------KLLEAAEAYFLKS---WHFALLAEFKAADALALSNLIEEQKFKFAVFQVT 128 (331) Q Consensus 70 p~~v~-v~~~~~~-----------------~~~~al~~~~~~~---~~f~~~~~~~~~~i~alA~w~ea~~~~~~~~~~t 128 (331) +..+. +...... +...++....... ---++....+...+.+....+..+-+.++++... T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~aivD~p 160 (390) T protein:vir:10 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSAS 160 (390) T ss_pred eEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccceEEEEecC Confidence 54331 1111111 0111121111110 0111222233333433333333334455555432 Q ss_pred ---ChHHHHh---hcccceEEEEEe------CCCc----hhHHHHHHHHHhcCCccc---eeeeeeeccCCcCC------ Q lcl|Aclame:pro 129 ---AVADITP---LAKNTRTIAIVH------SKTG----EKLDAALIGNVASLPVGS---ATWKGRHGLAGITS------ 183 (331) Q Consensus 129 ---~~~~~~~---~~~~~~t~~~~~------~~~~----~~~~aa~~g~~~~~~~G~---~t~~~k~~l~gv~~------ 183 (331) +...+.. ..+..|....|. +... ..+.+.++|.++..+.-. .....| .+.|+.. T Consensus 161 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~-~l~gi~~~~~~~~ 239 (390) T protein:vir:10 161 GCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNV-VVNGVSGISADVS 239 (390) T ss_pred CCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCc-eeeceeecceecc Confidence 2222222 122334443331 1111 134566666665554322 222334 2555543 Q ss_pred --CCCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHH Q lcl|Aclame:pro 184 --EELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIA 257 (331) Q Consensus 184 --~~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~ 257 (331) ...+..|.+.|..+|++...+..|. .+..+++++++ ||-+.+-.+|+...|+..+...+= + |.++.-.. T Consensus 240 ~~~~~~~~~~~~ln~~gi~t~~~~~G~-~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~---e-~n~~~~~~ 314 (390) T protein:vir:10 240 WDLQDPATDAGYLNEHEVTTLVNRNGF-RFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVD---G-PLNPSLAR 314 (390) T ss_pred cccccccchhhhhhhcCcEEEEcCCCE-EEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc---C-CCCHHHHH Confidence 2345667888999999999876664 55677787774 899999999999999988875443 2 77899999 Q ss_pred HHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 258 LLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 258 ~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) .|+..++.-|+..+++|.|. ||+|.+.. +..|++|+.+.++. +.+.+.....+++|.+....+. T Consensus 315 ~i~~~i~~~L~~l~~~g~l~--------g~~v~~d~-~~nt~~~i~~G~~~-~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:10 315 DIVESINGWFRQQVANGYLI--------GGSAWIDP-EPNTADILASGKAY-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred HHHHHHHHHHHHHHhCCcee--------eeEEEEcc-CCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 99999999999999999996 47788763 57899999888885 8899999999999999999998 No 19 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=98.96 E-value=5.1e-09 Score=66.06 Aligned_cols=315 Identities=11% Similarity=0.022 Sum_probs=193.6 Q ss_pred CCCcee-eEEEEEeecc--cccccccceeEEEEccCCc--------ceEEEechhhhccCCCCChHHHHHHHHHHccCCC Q lcl|Aclame:pro 1 MVETIT-DVRVHISVLY--PSPRIGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDR 69 (331) Q Consensus 1 ~v~~i~-dV~v~i~~~~--~~~~~~fg~~li~~~~~~~--------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~ 69 (331) |.++.. .|.|.--... +........+.+++..... .....++..+....|+.+..++.+...+|.|+.. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:78 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 888775 4665443222 3345567777777754332 1234667777777888888889999999999876 Q ss_pred cceEE-EEeccch-----------------hHHHHHHHhhcCc---eeEEEEecCCHHHHHHHHHHHHhcCcEEEEEEeC Q lcl|Aclame:pro 70 PDTVA-VITYEDT-----------------KLLEAAEAYFLKS---WHFALLAEFKAADALALSNLIEEQKFKFAVFQVT 128 (331) Q Consensus 70 p~~v~-v~~~~~~-----------------~~~~al~~~~~~~---~~f~~~~~~~~~~i~alA~w~ea~~~~~~~~~~t 128 (331) +..+. +...... +...++....... ---++....+...+.+....+..+-+.++++... T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~aivD~p 160 (390) T protein:vir:78 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSAS 160 (390) T ss_pred eEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccceEEEEecC Confidence 54331 1111111 0111121111110 0111222233333433333333334455555432 Q ss_pred ---ChHHHHh---hcccceEEEEEe------CCCc----hhHHHHHHHHHhcCCccc---eeeeeeeccCCcCC------ Q lcl|Aclame:pro 129 ---AVADITP---LAKNTRTIAIVH------SKTG----EKLDAALIGNVASLPVGS---ATWKGRHGLAGITS------ 183 (331) Q Consensus 129 ---~~~~~~~---~~~~~~t~~~~~------~~~~----~~~~aa~~g~~~~~~~G~---~t~~~k~~l~gv~~------ 183 (331) +...+.. ..+..|....|. +... ..+.+.++|.++..+.-. .....| .+.|+.. T Consensus 161 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~-~l~gi~~~~~~~~ 239 (390) T protein:vir:78 161 GCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNV-VVNGVSGISADVS 239 (390) T ss_pred CCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCc-eeeceeecceecc Confidence 2222222 122334443331 1111 134566666665554322 222334 2555543 Q ss_pred --CCCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHH Q lcl|Aclame:pro 184 --EELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIA 257 (331) Q Consensus 184 --~~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~ 257 (331) ...+..|.+.|..+|++...+..|. .+..+++++++ ||-+.+-.+|+...|+..+...+= + |.++.-.. T Consensus 240 ~~~~~~~~~~~~ln~~gi~t~~~~~G~-~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~---e-~n~~~~~~ 314 (390) T protein:vir:78 240 WDLQDPATDAGYLNEHEVTTLVNRNGF-RFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVD---G-PLNPSLAR 314 (390) T ss_pred cccccccchhhhhhhcCcEEEEcCCCE-EEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc---C-CCCHHHHH Confidence 2345667888999999999876664 55677787774 899999999999999988875443 2 77899999 Q ss_pred HHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 258 LLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 258 ~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) .|+..++.-|+..+++|.|. ||+|.+.. +..|++|+.+.++. +.+.+.....+++|.+....+. T Consensus 315 ~i~~~i~~~L~~l~~~g~l~--------g~~v~~d~-~~nt~~~i~~G~~~-~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:78 315 DIVESINGWFRQQVANGYLI--------GGSAWIDP-EPNTADILASGKAY-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred HHHHHHHHHHHHHHhCCcee--------eeEEEEcc-CCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 99999999999999999996 47788763 57899999888885 8899999999999999999998 No 20 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=98.95 E-value=5.7e-09 Score=65.77 Aligned_cols=314 Identities=12% Similarity=0.056 Sum_probs=195.8 Q ss_pred CCCceeeEEEEEeeccc--ccccccceeEEEEccCCc--------ceEEEechhhhccCCCCChHHHHHHHHHHccCCCc Q lcl|Aclame:pro 1 MVETITDVRVHISVLYP--SPRIGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRP 70 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~--~~~~~fg~~li~~~~~~~--------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p 70 (331) |.+-.-.|.|.-....+ +..+..+.+.+++..... .....++..+....|+.+..++.+...+|.++..+ T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhcccccchhhHHHHHhhccCce Confidence 88877777775533332 344567777777755321 12446777777777888878888888999998665 Q ss_pred ceEEEEeccc------------------------hhHHHHHHHhhcC---ceeEEEEecC-CHHHHHHHHHHHHhcCcEE Q lcl|Aclame:pro 71 DTVAVITYED------------------------TKLLEAAEAYFLK---SWHFALLAEF-KAADALALSNLIEEQKFKF 122 (331) Q Consensus 71 ~~v~v~~~~~------------------------~~~~~al~~~~~~---~~~f~~~~~~-~~~~i~alA~w~ea~~~~~ 122 (331) ..+....... .+.+.++.+.... .-..+...+. +.....++..-++.- +.| T Consensus 81 ~~vv~~~~~~~~~~~~~~a~~~~~i~g~~~~~~~~Tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~al~~~~~~~-~~~ 159 (395) T protein:vir:98 81 TVVVRVEDGTGDDEEAALAQTVSNIIGGTDENGKYTGIKALLTAQAVTGVKPRILGVPGLDTKEVAVALASAAIKL-RAF 159 (395) T ss_pred EEEeeccccccccccccccccccccccccccccchhHHHHHhhhhhhhccchhhcccccccccHHHHHHHHHhhhc-CcE Confidence 5433221100 0112233332211 1112222222 233344555555433 345 Q ss_pred EEEEeC---ChHHHHhh---cccceEEEEEe-----CC-Cc----hhHHHHHHHHHhcCCccceee---eeeeccCCcCC Q lcl|Aclame:pro 123 AVFQVT---AVADITPL---AKNTRTIAIVH-----SK-TG----EKLDAALIGNVASLPVGSATW---KGRHGLAGITS 183 (331) Q Consensus 123 ~~~~~t---~~~~~~~~---~~~~~t~~~~~-----~~-~~----~~~~aa~~g~~~~~~~G~~t~---~~k~~l~gv~~ 183 (331) +++... +...+... .+..|.++.|- +. .+ ..+++.++|.++..+.-.--| ..| .+.|+.. T Consensus 160 ~~~d~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~-~i~gi~~ 238 (395) T protein:vir:98 160 AYVSAWGCKTISEAMEYRKNFSQRELMVIWPDFLAWDTVKNTTATAYATARALGLRAYIDQTVGWHKTLSNV-GVQGVTG 238 (395) T ss_pred EEEEcCCCCCHHHHHHHHhccCCceEEEEecceeEecccCCceeeechHHHHHHHHHHhhcccCcEeccCCc-eeecccc Confidence 555432 22332222 22334443331 11 11 125566666665444321122 233 2444422 Q ss_pred --------CCCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCCCCCc Q lcl|Aclame:pro 184 --------EELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTF 251 (331) Q Consensus 184 --------~~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~kipy 251 (331) ..++.+|++.|.++|+|++....| -.+..+++++++ ||-+.+-.+|+...|+..+...+-. |. T Consensus 239 ~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G-~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~~ 313 (395) T protein:vir:98 239 ISASVFWDLQASGTDADLLNEAGVTTLVRKDG-FRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDK----PI 313 (395) T ss_pred cceecccccCCCcchHHhhhhcCcEEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CC Confidence 124688999999999999976555 456677788773 8999999999999999888765432 66 Q ss_pred CHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 252 DARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 252 t~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ++.=...|+..++.-|++.+++|.|. ||++++.. +..|++++.++++. +.+.+.....+++|++....+. T Consensus 314 ~~~~~~~i~~~i~~~L~~l~~~g~l~--------g~~v~~d~-~~nt~~~i~~G~~~-~~i~~~p~~p~e~I~~~~~~~~ 383 (395) T protein:vir:98 314 TATLIRDIVDGINAKFRELKSNGYIV--------EGKCWFDE-ESNDKETLKAGKLY-IDYDYTPVPPLESLTLRQRITD 383 (395) T ss_pred CHHHHHHHHHHHHHHHHHHHhCCcee--------ceEEEEec-CCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 78878999999999999999999996 36787754 56789999999885 8999999999999999999999 No 21 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=98.94 E-value=4.1e-09 Score=66.54 Aligned_cols=309 Identities=10% Similarity=0.074 Sum_probs=157.8 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEE---ccCC-cceEEEech-----------------hhhccCCCC------- Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFV---KGTA-MGYKEYTTL-----------------EELKDTFAD------- 52 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~---~~~~-~~~~~yts~-----------------~~v~~~f~~------- 52 (331) ++.++-.+ ..++..-|.. .++..-.-.. ...+ .....|... ..+...+.. T Consensus 206 ~~~~~~~~-~~~tAky~g~-~~n~~~v~v~d~~~~~~~k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 283 (587) T protein:vir:96 206 IITDINEL-PDFEAKLSPF-GDKNLESRKLDEATDVDIKGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVH 283 (587) T ss_pred hhhhhccc-cceEEEeecc-cCceeEEEeeccccccccceEEEeehhhhhhhhhhhccccceeeccccchhhhhhccccc Confidence 33333222 1122111211 1111111000 0000 001111110 000000000 Q ss_pred ---ChHHHHHHHHHHccCCCcceEEEEecc---chhHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHhc----CcEE Q lcl|Aclame:pro 53 ---NTEVYAKAKAVFLQKDRPDTVAVITYE---DTKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEEQ----KFKF 122 (331) Q Consensus 53 ---~s~~ykaA~~~fsQ~~~p~~v~v~~~~---~~~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea~----~~~~ 122 (331) ....++.....-.-.+.+..-..++.+ +.+..+++.+....+|+++.+...+.+.+.++..|++.. .+++ T Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~~~~y~~~l~ale~~~~~~i~~~t~d~ai~~~l~a~vk~~r~~gk~~~ 363 (587) T protein:vir:96 284 AETESATVTATSKPKAIEPFELTKLSGGTNGEPPTSWSAKLEKFKNEGGYYIVPLTDRQSVHSEVATFVKNRSDAGEPMR 363 (587) T ss_pred ccccceeeeecccccccccccceeeecCCCCCCcccHHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCeEE Confidence 000000000000000001110111111 224556666666677777777666667677899999542 3344 Q ss_pred EEEE---eCChHHHH---hhcccceEEEEEeCC------------CchhHHHHHHHHHhcCCcc-ceeeeeeeccCCcCC Q lcl|Aclame:pro 123 AVFQ---VTAVADIT---PLAKNTRTIAIVHSK------------TGEKLDAALIGNVASLPVG-SATWKGRHGLAGITS 183 (331) Q Consensus 123 ~~~~---~t~~~~~~---~~~~~~~t~~~~~~~------------~~~~~~aa~~g~~~~~~~G-~~t~~~k~~l~gv~~ 183 (331) .++. ..+.+... ...++.|...+.+.- +..+.++.++|..++..+. +.|++=.. +.++. T Consensus 364 aVlg~~~~~~~~~~~~~a~~~n~e~vi~v~~~~~~~~~~~~~~~~~~~~~aa~vAG~~Ag~~~~~S~T~~~~~-~~~v~- 441 (587) T protein:vir:96 364 AIVGGGTSETKEKLFGRQAILNNPRVALVANSGKFVMGNGRILQAPAYMVASAVAGLVSGLDIGESITFKPLF-VNSLD- 441 (587) T ss_pred EEecCCCCCCHHHHHHHHhhcCCCcEEEEecceEEecCCCceeeechhhHHHHHHHHHhcCccccCccceeee-ccccc- Confidence 4442 23333322 334556655443321 1112344455555665553 45543221 23443 Q ss_pred CCCCHHHHHHHHhCCCeEEEEEcCee----EEecCEEeCC-----c--eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcC Q lcl|Aclame:pro 184 EELKVSEIDAIQKAGGMCYIEKAGIA----QTSEGKTVSG-----E--FIDSIHGDDWIKATIETRLQKLLTETDKLTFD 252 (331) Q Consensus 184 ~~~t~t~~~~l~~~~~n~y~~~~g~~----~~~~G~~~~G-----~--~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt 252 (331) ..++.+|++.+.++|..++....+.. ..-++.+.-. . .|-.++-.|.+...++..+-+.++. | |-+ T Consensus 442 ~~~t~~e~~~~i~~G~~~l~~~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiG--k-~nn 518 (587) T protein:vir:96 442 KVYESEELDELNENGIITIEFVRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIG--T-RTI 518 (587) T ss_pred ccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCc--c-ccC Confidence 36999999999999999987765432 1223444322 1 4789999999999999888777764 5 568 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 253 ARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 253 ~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) +.|...|++.+++.|++..+.|.|..... +..++.. .+| +. -+++.+++.-++++|.++.++.- T Consensus 519 ~~~r~~v~~~i~~~L~~l~~~g~I~~~~~---~dv~v~~-------~~D---~~--~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:96 519 NTSASQIKDFVQSYLGRKKRDNEIQDFPP---EDVQVII-------EGN---EA--RISLTIFPIRALKKISVSLVYRQ 582 (587) T ss_pred HHHHHHHHHHHHHHHHHHHhCCcccCCCc---cceEEEe-------cCC---EE--EEEEEEEEcccceEEEEEEEEEe Confidence 89999999999999999999999963211 1122221 112 22 37888999999999999998866 No 22 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=98.94 E-value=7.9e-09 Score=65.02 Aligned_cols=313 Identities=12% Similarity=0.095 Sum_probs=194.7 Q ss_pred CCCceeeEEEEEeecc--cccccccceeEEEEccCCc--------ceEEEechhhhccCCCCChHHHHHHHHHHccCCCc Q lcl|Aclame:pro 1 MVETITDVRVHISVLY--PSPRIGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRP 70 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~--~~~~~~fg~~li~~~~~~~--------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p 70 (331) |.+-.-.|.|.--... ++..+....+.+++..... .....++..+....++.+..++.+...+|.++... T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhcccccchHHHHHHhhhcCCce Confidence 8887777777553332 3445667777777765432 23446677776667777777888888888886554 Q ss_pred ceEEEEe---------------------ccc---hhHHHHHHHhhcCcee---EEEEecCCH-HHHHHHHHHHHhcCcEE Q lcl|Aclame:pro 71 DTVAVIT---------------------YED---TKLLEAAEAYFLKSWH---FALLAEFKA-ADALALSNLIEEQKFKF 122 (331) Q Consensus 71 ~~v~v~~---------------------~~~---~~~~~al~~~~~~~~~---f~~~~~~~~-~~i~alA~w~ea~~~~~ 122 (331) ..+.-.. ... .+...++.+....... .+.....+. .-..+|...++. -..+ T Consensus 81 ~~vv~~~~~~~~~~~~~~a~t~~~iiG~~~~~~~~tgl~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~-~~~~ 159 (396) T protein:vir:57 81 TVVVRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKALMGAESVTGVKPRILGVPGLDTKEVAVALASVCQE-LNAF 159 (396) T ss_pred eEeeeccccccccccccccccceeeeeeccccccchhhhhhhhcccceeEEeccccCcccchhHHHHHHHHHhhh-CceE Confidence 3321100 000 0112333332221111 111122222 223445555543 3455 Q ss_pred EEEEeC---ChHHHHhh---cccceEEEEEeCC-------Cc----hhHHHHHHHHHhcCCccc---eeeeeeeccCCcC Q lcl|Aclame:pro 123 AVFQVT---AVADITPL---AKNTRTIAIVHSK-------TG----EKLDAALIGNVASLPVGS---ATWKGRHGLAGIT 182 (331) Q Consensus 123 ~~~~~t---~~~~~~~~---~~~~~t~~~~~~~-------~~----~~~~aa~~g~~~~~~~G~---~t~~~k~~l~gv~ 182 (331) .+.... +...+... .+..|..+. ++. .+ ..+++.++|.++..+.-+ .....+ .+.||. T Consensus 160 ~~~d~p~~~~~~~~~~~~~~~~s~~~~~~-~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~-~l~gi~ 237 (396) T protein:vir:57 160 GYISAWGCKTISEVKAYRQNFSQRELMVI-WPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNV-GVNGVT 237 (396) T ss_pred EEEcCCCCCCHHHHHHHHhccCCceEEEE-cceeeeecccCCceeEEehhHHHHHHHHHhhhccCcEeccCCc-eecccc Confidence 555432 22333222 233444433 321 11 134566666665444321 223344 366654 Q ss_pred CC--------CCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCCCCC Q lcl|Aclame:pro 183 SE--------ELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLT 250 (331) Q Consensus 183 ~~--------~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~kip 250 (331) .. .++.+|.+.|..+|+|+.....| ..+..+++++++ ||-+.+-.+|++..|+..+...+=. | T Consensus 238 ~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~~~G-~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----~ 312 (396) T protein:vir:57 238 GISASVFWDLQKPGTDADLLNEAGVTTLVRRDG-FRFWGNRTCSDDPLFLFESYTRTAQVLADTMAEAHMWAIDK----P 312 (396) T ss_pred ccceecccccCCcchhhhhhhhcCcEEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----C Confidence 32 24678999999999999977555 456677888874 8999999999999999888765432 6 Q ss_pred cCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEe Q lcl|Aclame:pro 251 FDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVE 330 (331) Q Consensus 251 yt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~ 330 (331) .++.=...|+..++..|+..+++|.|. +|++.+.. +..+++++.+.++. +.+.+.....+++|.+....+ T Consensus 313 n~~~~~~~i~~~i~~~l~~l~~~gal~--------g~~v~~d~-~~n~~~~i~~G~~~-~~v~~~p~~p~e~I~~~~~~~ 382 (396) T protein:vir:57 313 ITATLIRDIIDGINAKFRELKNNGYIV--------DGTCWFSE-ESNDAETLKAGKLY-IDYDYTPVPPLENLTLRQRIT 382 (396) T ss_pred CCHHHHHHHHHHHHHHHHHHHhCCcee--------ceEEEEec-CCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEc Confidence 688888999999999999999999996 36777754 56789999888885 899999999999999999999 Q ss_pred C Q lcl|Aclame:pro 331 V 331 (331) Q Consensus 331 ~ 331 (331) . T Consensus 383 ~ 383 (396) T protein:vir:57 383 S 383 (396) T ss_pred h Confidence 8 No 23 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=98.93 E-value=1e-08 Score=64.37 Aligned_cols=313 Identities=11% Similarity=0.029 Sum_probs=190.0 Q ss_pred CCCcee-eEEEEEe--ecccccccccceeEEEEccCCc--------ceEEEechhhhccCCCCChHHHHHHHHHHccCCC Q lcl|Aclame:pro 1 MVETIT-DVRVHIS--VLYPSPRIGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDR 69 (331) Q Consensus 1 ~v~~i~-dV~v~i~--~~~~~~~~~fg~~li~~~~~~~--------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~ 69 (331) |.++.- .|.|.-. ...+...+....+.++++.... .....++..+....|+.+..++.+...+|.|+.. T Consensus 1 M~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~~~tL~~al~~~~~~~~~ 80 (390) T protein:vir:79 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCCCccchhhhhhhcccccc Confidence 888775 3666442 2233445556677777755432 1234566666666677777788888889998776 Q ss_pred cceEEEEecc-ch-----------------hHHHHHHHhhcCce---eEEEEecCCHHHHH-HHHHHHHhcCcEEEEEEe Q lcl|Aclame:pro 70 PDTVAVITYE-DT-----------------KLLEAAEAYFLKSW---HFALLAEFKAADAL-ALSNLIEEQKFKFAVFQV 127 (331) Q Consensus 70 p~~v~v~~~~-~~-----------------~~~~al~~~~~~~~---~f~~~~~~~~~~i~-alA~w~ea~~~~~~~~~~ 127 (331) +..+...... .. +.+.++........ --+.....+...+. ++..-++ +-+.+.++.. T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~-~~~~~ai~D~ 159 (390) T protein:vir:79 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQ-SLRAMAYVSA 159 (390) T ss_pred eEEEEeeccccccccccceeeecccccccchhhhhhhhhhhhhccccccccCCcccchHHHHHHHHhhh-hcceEEEEEc Confidence 5432221111 00 11122222211100 01111122222233 3333343 3445665543 Q ss_pred C---ChHHHHhh---cccceEEEEEeCC-------Cc----hhHHHHHHHHHhcCCccceeee---eeeccCCcCC---- Q lcl|Aclame:pro 128 T---AVADITPL---AKNTRTIAIVHSK-------TG----EKLDAALIGNVASLPVGSATWK---GRHGLAGITS---- 183 (331) Q Consensus 128 t---~~~~~~~~---~~~~~t~~~~~~~-------~~----~~~~aa~~g~~~~~~~G~~t~~---~k~~l~gv~~---- 183 (331) . ....+... .+..|.++.+ +. .. ..+.+.++|.++..+.-.-.|+ .| .+.|+.. T Consensus 160 p~~~t~~~a~~~~~~~~s~~~~~~~-p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~D~~~g~~~spsN~-~i~gi~~~~~~ 237 (390) T protein:vir:79 160 SGCKTKEEAAAYRRQFGQREIMVIW-PDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNV-VVNGVSGISAD 237 (390) T ss_pred cCCCCHHHHHHHhcCCCCceEEEEc-CceeecccccCceeEeehHHHHHHHHHhhhccCCcEEccCCc-eeeccceeeee Confidence 2 22222211 2334444433 21 11 1356667776665553222332 33 2445422 Q ss_pred ----CCCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHH Q lcl|Aclame:pro 184 ----EELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARG 255 (331) Q Consensus 184 ----~~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G 255 (331) ...+..|.+.|..+|++......| ..+..+++++++ ||-+.+-.+|+...|+..+...+=. |.++.= T Consensus 238 ~~~~~~~~~~~a~~Ln~~gi~t~~~~~G-~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----~~~~~~ 312 (390) T protein:vir:79 238 VSWDLQDPATDAGYLNEHEVTTLVNRNG-FRFWGERTCSDDPKFAFENYTRTAQVAADSIAEAQMPVVDG----PLNPSL 312 (390) T ss_pred ccccccccchhhhhhhhcCcEEEEcCCC-EEEEeccccCCCcccceeeehhhHHHHHHHHHHHHHHhccC----CCCHHH Confidence 224566788899999999876555 345677777774 8999999999999999888754432 668888 Q ss_pred HHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 256 IALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 256 ~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ...|+..++..|+..+++|.|. ||++.+.. +.-|++|+.+.++. +.+.+.....+++|++....+. T Consensus 313 ~~~i~~~i~~~L~~l~~~gal~--------g~~v~~d~-~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 378 (390) T protein:vir:79 313 ARDIVESINGWFRQQVANGYLI--------GGSAWIDP-EPNTADILASGKAY-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred HHHHHHHHHHHHHHHHhCCcee--------eeEEEEec-CCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 8999999999999999999997 36788764 56789999988885 8899999999999999999998 No 24 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=98.93 E-value=1.1e-08 Score=64.21 Aligned_cols=313 Identities=13% Similarity=0.113 Sum_probs=194.2 Q ss_pred CCCceeeEEEEEeecccc--cccccceeEEEEccCCc--------ceEEEechhhhccCCCCChHHHHHHHHHHccCCCc Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPS--PRIGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRP 70 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~--~~~~fg~~li~~~~~~~--------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p 70 (331) |.+-.=.|.|.-....+. ..+....+.++++.... ....+++..+....|+.+..++.+...+|.++... T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~~tL~~al~~~~~ngg~~ 80 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccccchhhhhhhhhccCcee Confidence 888766677655433333 33445666666654321 23457788777777888888888888888886544 Q ss_pred ceEEEEec---cc---------------------hhHHHHHHHhhcCc-ee--EEEEec-CCHHHHHHHHHHHHhcCcEE Q lcl|Aclame:pro 71 DTVAVITY---ED---------------------TKLLEAAEAYFLKS-WH--FALLAE-FKAADALALSNLIEEQKFKF 122 (331) Q Consensus 71 ~~v~v~~~---~~---------------------~~~~~al~~~~~~~-~~--f~~~~~-~~~~~i~alA~w~ea~~~~~ 122 (331) ..+..... .. .+...++....... +. .+.... .+...+.+|...++... .+ T Consensus 81 ~~v~~~~~~~~~~~~~~~a~t~~~~~~~~~~~~~~tg~~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~-~~ 159 (396) T protein:vir:20 81 TVVMRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKAMLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLR-AF 159 (396) T ss_pred EEEEeccccccccccccccccccccccccccccccchhhhhhhhccccccchhhhhhhhhccHHHHHHHHHHHhcCC-cE Confidence 32211100 00 01112222221110 00 111111 22344556666665543 44 Q ss_pred EEEEeC---ChHHHHhhc---ccceEEEEEeCC-------Cc----hhHHHHHHHHHhcCCc--c-ceeeeeeeccCCcC Q lcl|Aclame:pro 123 AVFQVT---AVADITPLA---KNTRTIAIVHSK-------TG----EKLDAALIGNVASLPV--G-SATWKGRHGLAGIT 182 (331) Q Consensus 123 ~~~~~t---~~~~~~~~~---~~~~t~~~~~~~-------~~----~~~~aa~~g~~~~~~~--G-~~t~~~k~~l~gv~ 182 (331) +++... +.+.+.... +..+. .+|++. .. ..+++.++|.++..+. | -.....+ .+.||. T Consensus 160 ~~iD~p~~~~~~~a~~~r~~~~s~~~-~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~-~l~gi~ 237 (396) T protein:vir:20 160 GYISAWGCKTISEVKAYRQNFSQREL-MVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNV-GVNGVT 237 (396) T ss_pred EEEecCCCCCHHHHHHHhhCCCCceE-EEEcCccccccCcCCcceeechhHHHHHHHHHhhhhcCcEeccCCc-eeccce Confidence 444332 333333222 23333 344331 11 1346666666654442 2 1223333 355653 Q ss_pred CC--------CCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCCCCC Q lcl|Aclame:pro 183 SE--------ELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLT 250 (331) Q Consensus 183 ~~--------~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~kip 250 (331) .. .++.+|.+.|..+|+|+.....| -.+..+++++++ ||-+.+-.+|+...|+..+...+-. | T Consensus 238 ~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~~~G-~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~~~~~~~~~v~e----~ 312 (396) T protein:vir:20 238 GISASVFWDLQESGTDADLLNESGVTTLIRRDG-FRFWGNRTCSDDPLFLFENYTRTAQVVADTMAEAHMWAVDK----P 312 (396) T ss_pred ecceecccccCCCcchhhhhhhcCcEEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----C Confidence 32 35678999999999999977555 456677788874 8999999999999999888765432 6 Q ss_pred cCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEe Q lcl|Aclame:pro 251 FDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVE 330 (331) Q Consensus 251 yt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~ 330 (331) .++.=...|+..++..|+..++.|.|. ||++++. .++.|++++.++++. +.+.+.....++.|.+....+ T Consensus 313 ~~~~~~~~i~~~i~~~L~~l~~~G~l~--------g~~v~~d-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~ 382 (396) T protein:vir:20 313 ITATLIRDIVDGINAKFRELKTNGYIV--------DATCWFS-EESNDAETLKAGKLY-IDYDYTPVPPLENLTLRQRIT 382 (396) T ss_pred CCHHHHHHHHHHHHHHHHHHHhCccee--------ceEEEEe-cCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEc Confidence 688888999999999999999999996 3678875 467889999999885 899999999999999999999 Q ss_pred C Q lcl|Aclame:pro 331 V 331 (331) Q Consensus 331 ~ 331 (331) . T Consensus 383 ~ 383 (396) T protein:vir:20 383 D 383 (396) T ss_pred h Confidence 8 No 25 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=98.89 E-value=1.9e-08 Score=62.87 Aligned_cols=310 Identities=14% Similarity=0.079 Sum_probs=162.0 Q ss_pred CCCce-e-------eEEEEE---e-------------ecc--cccccccceeEEEE---ccCC-cceEEEechhhhccCC Q lcl|Aclame:pro 1 MVETI-T-------DVRVHI---S-------------VLY--PSPRIGLGRPAIFV---KGTA-MGYKEYTTLEELKDTF 50 (331) Q Consensus 1 ~v~~i-~-------dV~v~i---~-------------~~~--~~~~~~fg~~li~~---~~~~-~~~~~yts~~~v~~~f 50 (331) +++-. - |+.+.. . ... .....+-+..+=+. .+++ .....|..-+++..-+ T Consensus 175 l~~~~~~~~~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~~~~~~~~~~ 254 (581) T protein:vir:10 175 VVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFY 254 (581) T ss_pred ccccccCcceeccccceeeecccCccccccccccceeeeeeecccccccceEEEEEEEeecCCcceeEEeecCcchhhhh Confidence 10000 0 000000 0 000 00000000000000 0111 1122233323222111 Q ss_pred --------CCChHHHHHHHHHHccCCCcceEEEEecc-------chhHHHHHHHhhcCceeEEEEecCCHHHH-HHHHHH Q lcl|Aclame:pro 51 --------ADNTEVYAKAKAVFLQKDRPDTVAVITYE-------DTKLLEAAEAYFLKSWHFALLAEFKAADA-LALSNL 114 (331) Q Consensus 51 --------~~~s~~ykaA~~~fsQ~~~p~~v~v~~~~-------~~~~~~al~~~~~~~~~f~~~~~~~~~~i-~alA~w 114 (331) ...++..+.++..+..++ ......+.+ ..+..++|.++.......+++...+.+.+ .++..| T Consensus 255 ~~~~~~~g~~~~~~t~~~~~~~tn~~--~~~l~~gvd~~g~tvt~~dy~~Al~ale~~~~~~ivv~~t~~~~v~a~l~ah 332 (581) T protein:vir:10 255 GPAFDEAGNVQSEITLCAQLAITNGA--STILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQH 332 (581) T ss_pred hhhhhccCccccchhhhheeeeeccc--ceeEEeeccCCCCccchHHHHHHHHHHhcCCceEEEEeCCCCHHHHHHHHHH Confidence 122334444443333333 222222222 23566778777766655444555555554 457788 Q ss_pred HHhc----CcEEE---EEE---eCChHHH---HhhcccceEEEEEeC------C--------CchhHHHHHHHHHhcCCc Q lcl|Aclame:pro 115 IEEQ----KFKFA---VFQ---VTAVADI---TPLAKNTRTIAIVHS------K--------TGEKLDAALIGNVASLPV 167 (331) Q Consensus 115 ~ea~----~~~~~---~~~---~t~~~~~---~~~~~~~~t~~~~~~------~--------~~~~~~aa~~g~~~~~~~ 167 (331) ++.. +.+.. +.. ..+.... ....++.|....+.. + +.++.++.++|..+...+ T Consensus 333 v~~~s~~~~~~ravigV~g~~~~~~~~~~~~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~y~~AA~vAGl~a~~~~ 412 (581) T protein:vir:10 333 VSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIA 412 (581) T ss_pred HHHHHhccCCcEEEEEecCCCCCccHHHHHHhhccCCCceEEEEecCceeecCcccCceeccchhhHHHHHHHHhhcccc Confidence 7653 12222 211 1122222 222345565544421 1 112334555565666655 Q ss_pred cceeeeeeeccCCcCC--CCCCHHHHHHHHhCCCeEEEEEcCe-eEEecCEEeC---C--ceehhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 168 GSATWKGRHGLAGITS--EELKVSEIDAIQKAGGMCYIEKAGI-AQTSEGKTVS---G--EFIDSIHGDDWIKATIETRL 239 (331) Q Consensus 168 G~~t~~~k~~l~gv~~--~~~t~t~~~~l~~~~~n~y~~~~g~-~~~~~G~~~~---G--~~iD~~~~~dwl~~~iq~~l 239 (331) . ..+-|| .++|+.. ..++.+|++.|.++|++.+....+. ..+-+|.+.- + +.|-.++-.|.+...+++.+ T Consensus 413 ~-~slT~~-~i~gi~~l~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~~~i~~iR~~D~v~~~ir~~~ 490 (581) T protein:vir:10 413 A-MPLTRK-VIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYL 490 (581) T ss_pred c-cCcccc-cccccccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCCCCcceeeeeehhhhHHHHHHHHHh Confidence 3 455566 4787754 4689999999999999999876544 4455666442 2 46889999999999999998 Q ss_pred H-HHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcc Q lcl|Aclame:pro 240 Q-KLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSG 318 (331) Q Consensus 240 ~-~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aG 318 (331) . ..|+. + |-++.|...|++.++++|++.+++|.|...... . .++.++..- .--+.|.+...- T Consensus 491 ~~~~fIG--~-~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~-------~------~~~~~~~~d-~v~V~i~v~Pv~ 553 (581) T protein:vir:10 491 DADGLIG--M-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL-------K------ARQIERQPD-VIEVRYEWRPAY 553 (581) T ss_pred hhhcCCC--c-ccCHHHHHHHHHHHHHHHHHHHhcCcccCCccc-------e------eeeeecCCC-EEEEEEEEEecc Confidence 6 44663 3 668899999999999999999999999753210 0 122222221 223778888899 Q ss_pred eEEEEEEEEEEeC Q lcl|Aclame:pro 319 AIHSVDVYGEVEV 331 (331) Q Consensus 319 aIh~v~i~~~v~~ 331 (331) +|++|.++..+.= T Consensus 554 ~i~~I~vti~~~p 566 (581) T protein:vir:10 554 PLNYIVVRYSIAP 566 (581) T ss_pred cceEEEEEEEEec Confidence 9999888766655 No 26 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=98.87 E-value=9.3e-09 Score=64.61 Aligned_cols=309 Identities=10% Similarity=0.094 Sum_probs=156.0 Q ss_pred CCCceee---EE----------EEEeecccccccccceeE--EEEccCCc------ceEEEechhhhc------cCCCCC Q lcl|Aclame:pro 1 MVETITD---VR----------VHISVLYPSPRIGLGRPA--IFVKGTAM------GYKEYTTLEELK------DTFADN 53 (331) Q Consensus 1 ~v~~i~d---V~----------v~i~~~~~~~~~~fg~~l--i~~~~~~~------~~~~yts~~~v~------~~f~~~ 53 (331) .+++|-+ ++ ++++. .+.. ..+..-- .+...... .++.+.....+. ...... T Consensus 206 ~~~~in~~~~~tAky~g~~~~~i~~~~-~~~~-~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~ 283 (587) T protein:vir:95 206 IITDINQLPDFEAKLSPFGDKNLESSK-LDKI-ENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVE 283 (587) T ss_pred HHHhhccccceEEEEecccCceeEEee-cCcc-cccceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhh Confidence 1111111 11 11110 0000 0000000 00000000 000111100000 000001 Q ss_pred hHHHHHHHHHHccCC----CcceEEEEecc---chhHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHhc----CcEE Q lcl|Aclame:pro 54 TEVYAKAKAVFLQKD----RPDTVAVITYE---DTKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEEQ----KFKF 122 (331) Q Consensus 54 s~~ykaA~~~fsQ~~----~p~~v~v~~~~---~~~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea~----~~~~ 122 (331) ...+.++...+.-.. .+.+...++.+ +.+..+++.++...+|.++.+...+.+.+.++..|++.. .+++ T Consensus 284 ~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG~~~~~y~~~l~ale~~~~~~i~~~t~d~~v~a~l~a~vk~~~~~g~~~~ 363 (587) T protein:vir:95 284 AGEESATVTATSPIKTIEPFELTKLKGGTNGEPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMR 363 (587) T ss_pred hcccchheeccccccceeccceeeeecCCCCCCcccHHHHHHHHHhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEE Confidence 111111111111110 01111222222 234566677766677777776666666667899998643 2344 Q ss_pred EEEE---eCChHHHH---hhcccceEEEEEeC------CC--chh---H-HHHHHHHHhcCCcc-ceeeeeeeccCCcCC Q lcl|Aclame:pro 123 AVFQ---VTAVADIT---PLAKNTRTIAIVHS------KT--GEK---L-DAALIGNVASLPVG-SATWKGRHGLAGITS 183 (331) Q Consensus 123 ~~~~---~t~~~~~~---~~~~~~~t~~~~~~------~~--~~~---~-~aa~~g~~~~~~~G-~~t~~~k~~l~gv~~ 183 (331) .++. ..+.+... ...++.|...+.+. +. ..+ . ++.++|..++..+. +.|++=.. +.++. T Consensus 364 aVvg~~~~~~~~~~~~~a~~~n~ervi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i~-~~~v~- 441 (587) T protein:vir:95 364 AIVGGGFNESKEQLFGRQESLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGESITFKPLR-VSSLD- 441 (587) T ss_pred EEEcCCCCCCHHHHHHHHhhcCCCcEEEecccceEecCCCceeeechHHHHHHHHHHHhcCchhcCccceeee-ccccc- Confidence 4442 23333322 23345665544322 11 112 2 33444555555553 45543222 34554 Q ss_pred CCCCHHHHHHHHhCCCeEEEEEcCe----eEEecCEEeC----C-ce--ehhhHHHHHHHHHHHHHHHHHHhcCCCCCcC Q lcl|Aclame:pro 184 EELKVSEIDAIQKAGGMCYIEKAGI----AQTSEGKTVS----G-EF--IDSIHGDDWIKATIETRLQKLLTETDKLTFD 252 (331) Q Consensus 184 ~~~t~t~~~~l~~~~~n~y~~~~g~----~~~~~G~~~~----G-~~--iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt 252 (331) ..++.+|++.+..+|.+.+....+. -..-+|.+.- + .| |-.++-.|.+...++..+-+.++. | |-+ T Consensus 442 ~~~t~~e~e~ai~~Gvl~l~~~~~~~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iG--k-~nn 518 (587) T protein:vir:95 442 QIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIG--T-RTI 518 (587) T ss_pred ccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCc--c-ccc Confidence 3689999999999999988765442 1233444431 1 25 779999999999999988877764 4 568 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 253 ARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 253 ~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) +.|...|++.+++.|++..+.|.|..... .+ ..+. ...| | --++|.+++.-++++|.++.++.- T Consensus 519 ~~~r~~v~~~i~~~L~~l~~~gaI~~~~~-~d--v~v~-------~~~d---~--~~v~~~v~Pv~~mekI~vt~~~~~ 582 (587) T protein:vir:95 519 NTSASIIKDFIQSYLGRKKRDNEIQDFPA-ED--VQVI-------VEGN---E--ARISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred hHHHHHHHHHHHHHHHHHHhCCcccCCCc-cc--eEEE-------ecCC---E--EEEEEEEEEcccceEEEEEEEEee Confidence 89999999999999999999999964321 11 1111 0111 2 237888899999999999988866 No 27 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=98.86 E-value=1.8e-08 Score=63.03 Aligned_cols=312 Identities=10% Similarity=0.001 Sum_probs=185.7 Q ss_pred CCCceee-EEEEEe--ecccccccccceeEEEEccCCc--------ceEEEechhhhccCCCCChHHHHHHHHHHccCCC Q lcl|Aclame:pro 1 MVETITD-VRVHIS--VLYPSPRIGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDR 69 (331) Q Consensus 1 ~v~~i~d-V~v~i~--~~~~~~~~~fg~~li~~~~~~~--------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~ 69 (331) |.++... |.|.-. ...|...+....+.++++.... .....++..+...-++.+..++.+...+|.++.. T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~a~~~~~~~gg~ 80 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAGGTLPQAIDGIFDQTGA 80 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCCcchhHHHHHHhccCce Confidence 8877644 544332 2223444566777777754321 1234555555555677777889999999998865 Q ss_pred cceEEEEeccch-------------------hHHHHHHHhhcCceeEEEEec----CCHHHHHHHHHHHHhcCc--EEEE Q lcl|Aclame:pro 70 PDTVAVITYEDT-------------------KLLEAAEAYFLKSWHFALLAE----FKAADALALSNLIEEQKF--KFAV 124 (331) Q Consensus 70 p~~v~v~~~~~~-------------------~~~~al~~~~~~~~~f~~~~~----~~~~~i~alA~w~ea~~~--~~~~ 124 (331) +..+........ +...++...... +..... ....+..++++-+++... +.+. T Consensus 81 ~~~vv~~~~~~~~~~t~~~~ig~~~~~t~~~tgl~~l~~~~~~---~~~~p~i~~ap~~~~~~~v~~~l~~~~~~~~~~~ 157 (386) T protein:vir:10 81 VVVVIRVDEGVDSAATQSNVIGKVDADTEQYTGILALLSAENT---VKVQPRILIAPGFSNQKAVADQLVSVADTAAWLC 157 (386) T ss_pred eEEEeeccccccccccchhhhcccccccchhhhhHHhhhhccc---ccccccccccccccchhHHHHHHHHhhcceEEEE Confidence 543322211100 111112211111 111111 111222333333333222 2222 Q ss_pred -EEe--CChHHHH---hhcccceEEEEEe------CC--Cch--hHHHHHHHHHhcCCc--c-ceeeeeeeccCCcCCC- Q lcl|Aclame:pro 125 -FQV--TAVADIT---PLAKNTRTIAIVH------SK--TGE--KLDAALIGNVASLPV--G-SATWKGRHGLAGITSE- 184 (331) Q Consensus 125 -~~~--t~~~~~~---~~~~~~~t~~~~~------~~--~~~--~~~aa~~g~~~~~~~--G-~~t~~~k~~l~gv~~~- 184 (331) ... +..+.+. ...+..+...++- +. ... .+++.++|.++..+. | -.....| .+.||..- T Consensus 158 ~~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~v~~~~~~~~~~~p~s~~~ag~~a~~D~~~G~~~spaN~-~l~gv~~~~ 236 (386) T protein:vir:10 158 HSGWSNTTDAAAITYRELFGSRRCEVVDPWYKVWDVETSAHIIQPPSARHAGVMAKVHNTLGFWWSNSNQ-EILGIDGLC 236 (386) T ss_pred EeCCCCCchHHHHHhhhcccccceEEecCceeeeccccccceeechHHHHHHHHHHhhhcCCcEEccCCc-eeecccccc Confidence 211 1122211 1112333333221 11 111 345666666654442 3 1223334 35555321 Q ss_pred -------CCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCH Q lcl|Aclame:pro 185 -------ELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDA 253 (331) Q Consensus 185 -------~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~ 253 (331) ..+..|.+.|.++|++....-.| ..+..+++++++ ||-+.+-.+|+...|+..+...+-. |.++ T Consensus 237 ~~~~~~~~~~~~~~~~l~~~gi~~~~~~~G-~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~~~~ 311 (386) T protein:vir:10 237 RPVDFKLDDPTCRANLLNAKEVTTTIQQNG-FRVWGDRTCSADSKWAFKNVVITNDMIADSLVRNHLWAVDR----NITK 311 (386) T ss_pred eecccccccCcchhhhhhhcCcEEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCH Confidence 24688999999999999876544 455677777764 7889999999999999888765432 6688 Q ss_pred HHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 254 RGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 254 ~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) .=...|+..++.-|+..+++|.|. ||+|.+. .+..|++|+.++++. +.+.+....-+++|.++...+. T Consensus 312 ~~~~~i~~~i~~~L~~l~~~g~l~--------g~~v~~d-~~~nt~~~~~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 379 (386) T protein:vir:10 312 TYVEDVTEGVNNYLRHLKNIGAIA--------GGECWVD-PELNSPDQIQQGKVY-FDYDFSAYAPAEHITFRSHMVN 379 (386) T ss_pred HHHHHHHHHHHHHHHHHHhCCcee--------eeEEEEc-ccCCCHHHhhCCeEE-EEEEEEecCCceeEEEEEEEeh Confidence 888999999999999999999996 4788887 567899999999886 8999999999999999999998 No 28 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=98.85 E-value=1.8e-08 Score=63.08 Aligned_cols=314 Identities=11% Similarity=0.039 Sum_probs=192.6 Q ss_pred CCCceee-EEEEEe--ecccccccccceeEEEEccCC--------cceEEEechhhhccCCCCChHHHHHHHHHHccCCC Q lcl|Aclame:pro 1 MVETITD-VRVHIS--VLYPSPRIGLGRPAIFVKGTA--------MGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDR 69 (331) Q Consensus 1 ~v~~i~d-V~v~i~--~~~~~~~~~fg~~li~~~~~~--------~~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~ 69 (331) |-++... |.|.-. ...++..+....+.++++... .....+++..+....|+....++.+...+|.++.. T Consensus 1 M~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~~gtl~~al~~~~~~gg~ 80 (391) T protein:vir:79 1 MPTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGDKGTLAHTLDAITDQTNP 80 (391) T ss_pred CCCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCCccccchhhhhhhccccc Confidence 7776533 555432 222344566777777775432 12345788777777788877778888889988866 Q ss_pred cceEEEEeccc---------------h---hHHHHHHHhhcCcee---EEEEecC-CHHHHHHHHHHHHhcCcEEEEEEe Q lcl|Aclame:pro 70 PDTVAVITYED---------------T---KLLEAAEAYFLKSWH---FALLAEF-KAADALALSNLIEEQKFKFAVFQV 127 (331) Q Consensus 70 p~~v~v~~~~~---------------~---~~~~al~~~~~~~~~---f~~~~~~-~~~~i~alA~w~ea~~~~~~~~~~ 127 (331) +..+....... . +...++.+....... .+..... ......++...++... .++++.. T Consensus 81 ~~~vv~~~~~~~~~~~~~~~~g~~~~~~~~tGl~~l~~~~~~~~~~p~~l~~p~~~~~~v~~al~~~~~~~~-~~ai~d~ 159 (391) T protein:vir:79 81 LTVVVRVAGGASEAETTSNLIGTTNAAGRYTGMKALLTARNRFGVAPRILAVPGLDSLPVGTELVTIAQKLR-AFAYLSA 159 (391) T ss_pred ceeeeccccccccccccccccccccchhhhHHHhhhhhhhhhhcccchhhcCCccchhHHHHHHHHHHhhcC-cEEEEEC Confidence 54432221110 0 111222222111000 1111111 2333445556665443 3444433 Q ss_pred ---CChHHHHhh---cccceEEEEEe------CCCc----hhHHHHHHHHHhcCCccceeee---eeeccCCcCC----- Q lcl|Aclame:pro 128 ---TAVADITPL---AKNTRTIAIVH------SKTG----EKLDAALIGNVASLPVGSATWK---GRHGLAGITS----- 183 (331) Q Consensus 128 ---t~~~~~~~~---~~~~~t~~~~~------~~~~----~~~~aa~~g~~~~~~~G~~t~~---~k~~l~gv~~----- 183 (331) ++...+... .+..+.++.|- +... ..+.+.++|.++..+.-.-.|+ .+ .+.|+.. T Consensus 160 p~~~t~~~a~~~~~~~~s~~~a~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~g~~~spaN~-~l~gi~~~~~~~ 238 (391) T protein:vir:79 160 YGCQTKEEAVAYRSNFGQREAMVMWPDFVGWDTAANAETTLWATARAVGLRAKIDNDTGWHKTLSNV-AVGGVTGLSRDV 238 (391) T ss_pred CCCCCHHHHHHHHhccCCceeEEecceeeeecCcCCceeeechHHHHHHHHHHhhhcccceeccCCc-eehhhhcccccc Confidence 222332221 23334443332 1111 1356777777665553222332 23 3555532 Q ss_pred ---CCCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHH Q lcl|Aclame:pro 184 ---EELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGI 256 (331) Q Consensus 184 ---~~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~ 256 (331) ...+.+|.+.|..+|+|++..-.| ..+..+++++++ ||-+.+-.+|+...|+..+...+-. |.++.-. T Consensus 239 ~~~~~~~~~~~~~Ln~~~I~t~~~~~G-~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----pn~~~~~ 313 (391) T protein:vir:79 239 FWDLQDPATDAGYLNANEVTTLVHRDG-YRFWGSRTCSADPLFAFENYTRTAQVLADTMAEAHMWANDL----PMTPTLV 313 (391) T ss_pred ccccccccchhhhhhhcCceEEECCCc-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHH Confidence 224566788899999999876544 356677788875 8999999999999999988765432 7789889 Q ss_pred HHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 257 ALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 257 ~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ..|+..++.-|+..+++|.|.. |++++. .+..|++++.+.+.. +.+.+...-.+++|++....+. T Consensus 314 ~~i~~~i~~~l~~l~~~g~l~g--------~~v~~~-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 378 (391) T protein:vir:79 314 RDLLEGINAKLRMLTRNGYLLG--------GAAWFD-ADANSKDTLKAGQLA-IDYDYTPVPPLENLTFRQRITD 378 (391) T ss_pred HHHHHHHHHHHHHHHhCCceec--------eEEEEe-cCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 9999999999999999999963 567765 467789998888875 8899999999999999999988 No 29 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=98.84 E-value=2.3e-08 Score=62.47 Aligned_cols=310 Identities=9% Similarity=0.060 Sum_probs=155.4 Q ss_pred CCCceee---EEEEEee--------cccccccccce---eEEEEccCCcceEEEechhh------hcc------CCCCCh Q lcl|Aclame:pro 1 MVETITD---VRVHISV--------LYPSPRIGLGR---PAIFVKGTAMGYKEYTTLEE------LKD------TFADNT 54 (331) Q Consensus 1 ~v~~i~d---V~v~i~~--------~~~~~~~~fg~---~li~~~~~~~~~~~yts~~~------v~~------~f~~~s 54 (331) .++.|-+ ++-.... ..+-....+.. ..++. ....+...|..... +.. ...... T Consensus 206 ~~~~i~~~~~~tAky~~~~~~~i~~~~~~~~~~~~v~~~~~~v~-a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 284 (587) T protein:vir:99 206 IITDINQLPDFEAKLSPFGDKNLESSKLDKIENANIKDKAVYVK-AVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEA 284 (587) T ss_pred HHhhhccccceeEEeeccCCceeEeecccccccceeeeeeeeee-hhccceeeecccceeeeeeecccccchhhhhhhhh Confidence 2222221 1111100 00000000000 00000 00000000000000 000 000000 Q ss_pred HHHHHHHHHHccC----CCcceEEEEecc---chhHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHhc----CcEEE Q lcl|Aclame:pro 55 EVYAKAKAVFLQK----DRPDTVAVITYE---DTKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEEQ----KFKFA 123 (331) Q Consensus 55 ~~ykaA~~~fsQ~----~~p~~v~v~~~~---~~~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea~----~~~~~ 123 (331) ..+.+....+.-. +.+.....++.+ +.+..+++.++...+|+++.+...+.+.+.++..|++.. .+++. T Consensus 285 ~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG~~~~sy~~al~ale~~~~~~i~~~t~d~~i~a~l~a~vk~~r~~g~~~~a 364 (587) T protein:vir:99 285 GEESATVTATSPIKTIEPFELTKLKGGTNGEPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRA 364 (587) T ss_pred ccccceeeeeccccceecccceeeecCCCCCccccHHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEE Confidence 0000000001000 011111222222 224556677666677777776666666667899998643 23444 Q ss_pred EEE---eCChHHH---HhhcccceEEEEEeC------CC------chhHHHHHHHHHhcCCcc-ceeeeeeeccCCcCCC Q lcl|Aclame:pro 124 VFQ---VTAVADI---TPLAKNTRTIAIVHS------KT------GEKLDAALIGNVASLPVG-SATWKGRHGLAGITSE 184 (331) Q Consensus 124 ~~~---~t~~~~~---~~~~~~~~t~~~~~~------~~------~~~~~aa~~g~~~~~~~G-~~t~~~k~~l~gv~~~ 184 (331) ++. ..+...+ ....++.|...+... +. .++.++.++|..++..+. +.|++=.. +.++. . T Consensus 365 Vlg~~~~~~~~~~~~~a~~~n~e~vi~v~~~~~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i~-~~~v~-~ 442 (587) T protein:vir:99 365 IVGGGFNESKEQLFGRQASLSNPRVSLVANSGTFVMDDGRKNHVPAYMVAVALGGLASGLEIGESITFKPLR-VSSLD-Q 442 (587) T ss_pred EecCCCCCCHHHHHHHhhhcCCCcEEEEeccceEecCCCceeeechHHHHHHHHHHHhcCchhcCccceeee-ccccc-c Confidence 442 2233322 233455555443221 11 112234444555555553 45543222 34554 3 Q ss_pred CCCHHHHHHHHhCCCeEEEEEcCe----eEEecCEEeC----C-ce--ehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCH Q lcl|Aclame:pro 185 ELKVSEIDAIQKAGGMCYIEKAGI----AQTSEGKTVS----G-EF--IDSIHGDDWIKATIETRLQKLLTETDKLTFDA 253 (331) Q Consensus 185 ~~t~t~~~~l~~~~~n~y~~~~g~----~~~~~G~~~~----G-~~--iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~ 253 (331) .++.+|++.+..+|.+.+....+. -..-+|.+.- + .| |-.++-.|.+...++..+-+.++. | |=++ T Consensus 443 ~~t~~e~e~li~~Gvl~l~~~~~~~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiG--k-~Nn~ 519 (587) T protein:vir:99 443 IYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIG--T-RTIN 519 (587) T ss_pred cCCHHHHHHHHhCCeEEEEEecCCcceEEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCc--c-ccch Confidence 689999999999999988765432 2334455432 1 24 779999999999999988877765 3 5578 Q ss_pred HHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 254 RGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 254 ~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) .|...|++.+++.|++..+.|.|..... . ...+.. ..| |. -+++.+++.-++++|.++.++.- T Consensus 520 ~~r~~i~~~i~~~L~~l~~~gaI~~~~~-~--dv~v~~-------~~d---~~--~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:99 520 TSASIIKDFIQSYLGRKKRDNEIQDFPA-E--DVQVIV-------EGN---EA--RISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred HHHHHHHHHHHHHHHHHHhCCcccCCCc-c--ceEEEe-------cCC---EE--EEEEEEEEcccceEEEEEEEEEe Confidence 9999999999999999999999964311 1 111111 111 22 37888999999999999988866 No 30 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=98.84 E-value=3.5e-09 Score=66.92 Aligned_cols=306 Identities=13% Similarity=0.082 Sum_probs=155.8 Q ss_pred CCCceee---EE-------EEEeeccccc---------------ccccceeEEEEccCCcceE-EEechhhhcc-CCC-- Q lcl|Aclame:pro 1 MVETITD---VR-------VHISVLYPSP---------------RIGLGRPAIFVKGTAMGYK-EYTTLEELKD-TFA-- 51 (331) Q Consensus 1 ~v~~i~d---V~-------v~i~~~~~~~---------------~~~fg~~li~~~~~~~~~~-~yts~~~v~~-~f~-- 51 (331) .+=|+.+ .. +.++...|.. ...|-..+.++........ .+.++.++.. +|- T Consensus 86 ~~yrl~~g~~a~~t~~~~~~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g~~~vd~qtv~~~~~~el~~nd~V~a 165 (451) T protein:vir:10 86 LVLNPNEGTAATLTKEGLPWTVTANYPGEKGNQITVSVEVSPADQNAATVSTIFGTKLVDEQSIKFNELDKFKGNDYITA 165 (451) T ss_pred EEEEcCCCceEEEEeecCceEEEEeeCCcCCceEEEEEecccCCcCceEEEEEECCeEEEEEEeeccchhhccCCceEEE Confidence 0000100 00 1111111211 1122222222211111111 1223443322 110 Q ss_pred ---CChHHHHHHHHHHccCCCcceEEEEeccchhHHHHHHHhhcCceeEEEEecCC--HHHHHHHHHHHHh----cCcEE Q lcl|Aclame:pro 52 ---DNTEVYAKAKAVFLQKDRPDTVAVITYEDTKLLEAAEAYFLKSWHFALLAEFK--AADALALSNLIEE----QKFKF 122 (331) Q Consensus 52 ---~~s~~ykaA~~~fsQ~~~p~~v~v~~~~~~~~~~al~~~~~~~~~f~~~~~~~--~~~i~alA~w~ea----~~~~~ 122 (331) .+....+.+...++.+..... ...+.++..+++.......|.++++...+ .+....+.+|+.. .++++ T Consensus 166 ~~~~~g~~~~~~~~~l~~~~~gg~---~~~~~~~~~~~l~~~e~~~~n~l~~~~~~~~~~i~~~~~a~ik~~r~~~g~~~ 242 (451) T protein:vir:10 166 KVVEEGSSKPVAFTNVSGTLTGGT---TTESNKVESLLNDALENEEYAVVTTAGFEPSSNMNKLVVEAVKRLRENEGRKV 242 (451) T ss_pred Eecccccccceeeeeccccccccc---ccCCccchHHHHHHhccceeeEEEEccCCCchHHHHHHHHHHHHHHHhcCCeE Confidence 001111111111111111000 01123344556666555666666665443 4456678999975 24555 Q ss_pred EEEEeCChHHHHhhcccceEEEEEeC----CC----chhHHHHHHHHHhcCCcc-ceeeeeeeccCCcC-C-CCCCHHHH Q lcl|Aclame:pro 123 AVFQVTAVADITPLAKNTRTIAIVHS----KT----GEKLDAALIGNVASLPVG-SATWKGRHGLAGIT-S-EELKVSEI 191 (331) Q Consensus 123 ~~~~~t~~~~~~~~~~~~~t~~~~~~----~~----~~~~~aa~~g~~~~~~~G-~~t~~~k~~l~gv~-~-~~~t~t~~ 191 (331) ..+...... ...++.+..-+.+. +. ....++.++|.+++.... |.| |+. ++|+. . ..++.+|+ T Consensus 243 ~aVl~~~~~---~~~d~egiinv~n~~~~~dg~~~~~~~~~~~vAG~~Ag~~~~~S~T--~~~-~~~~~~v~~~~t~~e~ 316 (451) T protein:vir:10 243 RGVIPTDAD---TTYNYEGISTVVNGYTLSDGTNVDVKDATGYFAGISASADVATSLT--YFE-VEDAVSAYPKFDNEKT 316 (451) T ss_pred EEEecCccC---CCCCCcceEEeecceEecCceeechhhhHHHHHHHHcccccccCcc--cee-cCCceeeeeeCCHHHH Confidence 433211100 01122222211111 11 122345555555555442 455 443 67752 2 46999999 Q ss_pred HHHHhCCCeEEEEEcC-eeEEecCEEe----C---C---ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHH Q lcl|Aclame:pro 192 DAIQKAGGMCYIEKAG-IAQTSEGKTV----S---G---EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQ 260 (331) Q Consensus 192 ~~l~~~~~n~y~~~~g-~~~~~~G~~~----~---G---~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~ 260 (331) +.+.++|..++....| ...+.+|..+ + + ..|-.++..|-+.++++..+-+.++ +|+|=+..|..++. T Consensus 317 ~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~yi--Gk~~N~~~gr~~~~ 394 (451) T protein:vir:10 317 IKALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTYL--GNVGNNAAGRDLFK 394 (451) T ss_pred HHHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccc--eecCCCHHHHHHHH Confidence 9999999987754444 4555667532 1 1 2488999999999999876655555 69999999999999 Q ss_pred HHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEe Q lcl|Aclame:pro 261 SELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVE 330 (331) Q Consensus 261 ~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~ 330 (331) +.++..|++..+.|.|.++... .+.... . ..+..--+++.+++-.|+..+.++..+. T Consensus 395 ~~i~~yl~~l~~~g~i~~~~~~-----d~~v~~------~--~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 395 ADRIAYLTSLQNRNMIQSFANT-----DITVEA------G--NDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred HHHHHHHHHHHhCCCccCCCcc-----ceEEee------c--CCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 9999999999999999764321 121111 0 0122234888899999999999999999 No 31 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=98.82 E-value=3.6e-08 Score=61.42 Aligned_cols=307 Identities=13% Similarity=0.069 Sum_probs=161.9 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEE-----ccCC-cceEEEechhhhccCC--------CCChHHHHHHHHHHcc Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFV-----KGTA-MGYKEYTTLEELKDTF--------ADNTEVYAKAKAVFLQ 66 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~-----~~~~-~~~~~yts~~~v~~~f--------~~~s~~ykaA~~~fsQ 66 (331) ---++.+....+...-.... -....|+. .++. .....|.+-++....+ ...++.-+.++..|.. T Consensus 201 ~~~~~~~~~~t~~~~~~g~~--~~~~~i~~~~~~~~D~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~e~~~~~~~~~t~ 278 (581) T protein:vir:76 201 GEANTRDDLYTIQRVVDGGH--IDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITN 278 (581) T ss_pred cceeeeeeeeeeEeeccccc--ccceeEEEEEEEeecCCccceEEEecccccccceeeehhhcCccccchhhhhheeecc Confidence 00001111111111000000 01111111 1111 1222333333332222 1223444444433333 Q ss_pred CCCcceEEEEeccc-------hhHHHHHHHhhcCceeEEEEecCCHHHHH-HHHHHHHhcC----cEE---EEEEe---C Q lcl|Aclame:pro 67 KDRPDTVAVITYED-------TKLLEAAEAYFLKSWHFALLAEFKAADAL-ALSNLIEEQK----FKF---AVFQV---T 128 (331) Q Consensus 67 ~~~p~~v~v~~~~~-------~~~~~al~~~~~~~~~f~~~~~~~~~~i~-alA~w~ea~~----~~~---~~~~~---t 128 (331) ++ ......+.++ .+..++|.++......++++...+.+.+. ++..|++... .+. .+... . T Consensus 279 ~~--~~~l~~gvd~~g~tvt~~dy~~aL~ale~~~~~~ivvp~t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g~~~~~ 356 (581) T protein:vir:76 279 GA--STILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPV 356 (581) T ss_pred cc--ceEEEeeecCCCCccchHHHHHHHHHHhcCCeEEEEEecCCChHHHHHHHHHHHHHHhccCCceEEEEeeCCCCCc Confidence 33 3333333332 35667787777776666556555555554 4777775432 222 22211 1 Q ss_pred ChHHH---HhhcccceEEEEEe------CC--------CchhHHHHHHHHHhcCCccceeeeeeeccCCcCC--CCCCHH Q lcl|Aclame:pro 129 AVADI---TPLAKNTRTIAIVH------SK--------TGEKLDAALIGNVASLPVGSATWKGRHGLAGITS--EELKVS 189 (331) Q Consensus 129 ~~~~~---~~~~~~~~t~~~~~------~~--------~~~~~~aa~~g~~~~~~~G~~t~~~k~~l~gv~~--~~~t~t 189 (331) +.... ....++.|....+. .. +.++.++.++|..+...+. ...-+| .++|+.. ..++.+ T Consensus 357 ~~~~~~~~a~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~~-~slT~~-~i~g~~~~~~~~s~~ 434 (581) T protein:vir:76 357 PSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAA-MPLTRK-VIRGFSGPAEVQRDG 434 (581) T ss_pred hHHHHHHhhcccCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhhccccc-cCcccc-cccccccccccCCHH Confidence 22222 12234556654442 11 1122344455555555553 344466 4788754 368999 Q ss_pred HHHHHHhCCCeEEEEEcCe-eEEecCEEeC---C--ceehhhHHHHHHHHHHHHHHHH-HHhcCCCCCcCHHHHHHHHHH Q lcl|Aclame:pro 190 EIDAIQKAGGMCYIEKAGI-AQTSEGKTVS---G--EFIDSIHGDDWIKATIETRLQK-LLTETDKLTFDARGIALLQSE 262 (331) Q Consensus 190 ~~~~l~~~~~n~y~~~~g~-~~~~~G~~~~---G--~~iD~~~~~dwl~~~iq~~l~~-l~~~~~kipyt~~G~~~i~~~ 262 (331) |++.+..+|++.+....+. ..+-+|.+.- . +.|-+++-.|.+...+++.+.. .|.. + |-++.|...|++. T Consensus 435 e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG--~-~n~~~~r~~ik~~ 511 (581) T protein:vir:76 435 EKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIG--M-PIYDTTIVQVKAS 511 (581) T ss_pred HHHHHHhCCeEEEEEecCCeEEEEEeeecCCCCCccceeeehhhhHHHHHHHHHHHhhhcCCC--c-ccChHHHHHHHHH Confidence 9999999999999876554 3455666432 2 4688999999999999998864 4663 3 6689999999999 Q ss_pred HHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHH-hcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 263 LTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIA-KRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 263 v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~-~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ++++|++.+++|.|........ +..+++ .|. -+.+.++..-+|.+|.++..+.= T Consensus 512 i~~~L~~l~~~g~I~g~~~~~~-------------~~~~~~~d~v--~V~i~v~Pv~~ie~I~vt~~~~p 566 (581) T protein:vir:76 512 AEAALVWLVDNNIIRGYRNLKA-------------RQIERQPDVI--EVRYEWRPAYPLNYIVVRYSIAP 566 (581) T ss_pred HHHHHHHHHhcCcccCccccee-------------eEEecCCCEE--EEEEEEEecccceEEEEEEEEee Confidence 9999999999999974321111 111111 122 25677777888888777665544 No 32 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=98.80 E-value=4.2e-08 Score=61.03 Aligned_cols=313 Identities=12% Similarity=0.086 Sum_probs=186.9 Q ss_pred CCC-ce-eeEEEEEe--ecccccccccceeEEEEccCCc--------ceEEEechhhhccCCCCChHHHHHHHHHHccCC Q lcl|Aclame:pro 1 MVE-TI-TDVRVHIS--VLYPSPRIGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKD 68 (331) Q Consensus 1 ~v~-~i-~dV~v~i~--~~~~~~~~~fg~~li~~~~~~~--------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~ 68 (331) |-. .. -.|.|.-- ...++..+....+.++++.... .....++..+....|+.+..++.+...+|.++. T Consensus 1 M~~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~~~tl~~al~~~~~~~g 80 (391) T protein:vir:11 1 MAADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGTSGTLPASLQAIADQAN 80 (391) T ss_pred CCCCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCCCccchhhhhhhhcccc Confidence 333 22 23544332 2223344566666666655421 124566666666678888888889999999887 Q ss_pred CcceEE-EEeccchh-----------------HHHHHHHhhcCc---eeEEEEecCC-HHHHHHHHHHHHhcCcEEEEEE Q lcl|Aclame:pro 69 RPDTVA-VITYEDTK-----------------LLEAAEAYFLKS---WHFALLAEFK-AADALALSNLIEEQKFKFAVFQ 126 (331) Q Consensus 69 ~p~~v~-v~~~~~~~-----------------~~~al~~~~~~~---~~f~~~~~~~-~~~i~alA~w~ea~~~~~~~~~ 126 (331) ....+. +...+... ...++.+..... -..+.....+ .+...++...++. -+.|.++. T Consensus 81 ~~~~vv~~~~~~~~~~t~~d~~g~~~a~~~~~g~~a~~~~~~~~~~~p~~~~ap~~~~~~v~~al~~~~~~-~~~~~i~D 159 (391) T protein:vir:11 81 AATVVVRVKPGEDEAATNSAVIGGVSADGKYTGMKALLAAKARLGVVPRILGVPGLDTQPVATALIAIAQQ-LRAFAYVS 159 (391) T ss_pred ceeEEeeecccccccccchhhhcccccccchhhhhhhhhhhhhheeccccccccccccHHHHHHHHHhhcc-cceEEEEE Confidence 664332 21111110 011111111100 0111111222 2233344444433 34555554 Q ss_pred eC---ChHHHHhh---cccceEEEEEeCC-------Cc----hhHHHHHHHHHhcCCccceee---eeeeccCCcCCC-- Q lcl|Aclame:pro 127 VT---AVADITPL---AKNTRTIAIVHSK-------TG----EKLDAALIGNVASLPVGSATW---KGRHGLAGITSE-- 184 (331) Q Consensus 127 ~t---~~~~~~~~---~~~~~t~~~~~~~-------~~----~~~~aa~~g~~~~~~~G~~t~---~~k~~l~gv~~~-- 184 (331) .. +...+... .+..+.++.+ +. .+ ..+.+.++|..+..+.-.-.| ..| ++.||..- T Consensus 160 ~p~~~t~~~a~~~r~~~~s~~~~~~~-p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~-~l~gi~~~~~ 237 (391) T protein:vir:11 160 ASGCKTKEEATAYRENFAAREAMVIW-PDFLTWSTVVNQTVPAPAVAQALGLRARIDQEVGWHKTLSNV-AVNGVTGISA 237 (391) T ss_pred cCCCCCHHHHHHHhhhcCCceEEEEc-CcceecccccCceEEechHHHHHHHHHHhhccCCcEEccCCc-eeeceeeccc Confidence 22 22333222 2334444333 21 11 124566666665444322122 233 34554331 Q ss_pred ------CCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCCc----eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHH Q lcl|Aclame:pro 185 ------ELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDAR 254 (331) Q Consensus 185 ------~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G~----~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~ 254 (331) .++..|.+.|..+|+|......| -.+..+++++++ ||-+.+-.+|+...|+..+...+=. |.++. T Consensus 238 ~~~~~~~~~~~~~~~Ln~~gi~~~~~~~G-~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~n~~~ 312 (391) T protein:vir:11 238 DVFWDLQSPSTDANYLNENEVTTLVQEGG-FRFWGSRTCSDDPLFAFENYTRTAQVLADTIAEAHMWAVDK----PMHPS 312 (391) T ss_pred ccccccCCCcchhhhhhhcCcEEEEcCCC-EEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHH Confidence 24678999999999999866444 456677788774 8999999999999999887754432 66888 Q ss_pred HHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 255 GIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 255 G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) =...|+..++..|+..+++|.|.. |++.+.. +..|++++.+.+.. +.+.+.....++.|.+....+. T Consensus 313 ~~~~i~~~i~~~l~~l~~~g~l~g--------~~~~~~~-~~n~~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 379 (391) T protein:vir:11 313 LVRDILEGVNAKFRELKGLGLIID--------AQAWYDP-NVNDKDTLKAGKLR-ITYDYTPVPPLEDLTFFQKITD 379 (391) T ss_pred HHHHHHHHHHHHHHHHHhccceec--------eEEEEec-CCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 889999999999999999999963 5677643 67789999988885 8999999999999999999998 No 33 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=98.78 E-value=1.6e-08 Score=63.40 Aligned_cols=301 Identities=11% Similarity=0.072 Sum_probs=155.5 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEc-cCCcce---EEEec--hhhhc-----cCCC-----CChHHHHHHHHHH Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVK-GTAMGY---KEYTT--LEELK-----DTFA-----DNTEVYAKAKAVF 64 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~-~~~~~~---~~yts--~~~v~-----~~f~-----~~s~~ykaA~~~f 64 (331) .+..|-.+ ..+...-+. ..+....+-... -..... ..|-. ..++. .+|- .+..+-.-+...| T Consensus 206 l~~~i~~~-~~~tAky~g-~~~n~i~~~~~d~~~~~~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~L 283 (562) T protein:vir:80 206 LISDINNL-PDFEAKFFP-IGDKNLTTDNFDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKL 283 (562) T ss_pred hhhhhccc-cceEEEecc-cCCceeeecccccchhhhcccceeeeeehhhhhhhcccccceEEEEeccCccccccceeee Confidence 11111110 001100010 011110000000 000000 00100 00000 0000 0000000011222 Q ss_pred ccCCCcceEEEEeccchhHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHhc----CcEEEEEE---eCChHHHH--- Q lcl|Aclame:pro 65 LQKDRPDTVAVITYEDTKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEEQ----KFKFAVFQ---VTAVADIT--- 134 (331) Q Consensus 65 sQ~~~p~~v~v~~~~~~~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea~----~~~~~~~~---~t~~~~~~--- 134 (331) ..+..+. .+.+..+++..+....|+++.+...+.+.+.+++.|++.. .+++.++. ..+..... T Consensus 284 tGG~dG~-------~~~~~~dal~~Le~~~~~~i~~~t~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a 356 (562) T protein:vir:80 284 TGGDNGT-------IPESWADKFSYFANEGGYYLVPLTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRA 356 (562) T ss_pred eCCCCCC-------ccccHHHHHHHHHhCCcEEEEecCCChHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHh Confidence 2222211 1224456666666667777777666667678899999642 23444442 23333332 Q ss_pred hhcccceEEEEEeCC------------CchhHHHHHHHHHhcCCcc-ceeeeeeeccCCcC-CCCCCHHHHHHHHhCCCe Q lcl|Aclame:pro 135 PLAKNTRTIAIVHSK------------TGEKLDAALIGNVASLPVG-SATWKGRHGLAGIT-SEELKVSEIDAIQKAGGM 200 (331) Q Consensus 135 ~~~~~~~t~~~~~~~------------~~~~~~aa~~g~~~~~~~G-~~t~~~k~~l~gv~-~~~~t~t~~~~l~~~~~n 200 (331) ...++.|...+.++. +....++.++|..++..+. +.|+ |. ++++. ...++.+|++.+.++|.+ T Consensus 357 ~~~n~e~vv~v~~~~~~~~~~~~~~~~~~~~~aa~vAGl~Ag~~~~~S~T~--~~-i~~~~v~~~lt~~e~~~li~~G~l 433 (562) T protein:vir:80 357 IGLQNERAGLIGFSGTVKMDDGRSLKMPGYMFAAQVAGLTCGLEIGEAITF--KN-IAIETLDTIYEGSQLDQLNESGII 433 (562) T ss_pred hhcCCCeEEEEecCeeEECCCCceeeechhHHHHHHHHHHhcCccccCccc--ee-eccccccccCCHHHHHHHHhCCeE Confidence 223456665543321 1112344555555555543 4454 43 45432 246899999999999999 Q ss_pred EEEEEcCee----EEecCEEeCC-----c--eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 201 CYIEKAGIA----QTSEGKTVSG-----E--FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNE 269 (331) Q Consensus 201 ~y~~~~g~~----~~~~G~~~~G-----~--~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~ 269 (331) ++....+.. ..-++.+.-. . .|-+++-.|.+...++..+-+.++. | |-++.|...|++.++..|++ T Consensus 434 ~l~~~~~~~v~~~riv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yIG--k-~Nn~~~r~~v~~~i~~~L~~ 510 (562) T protein:vir:80 434 TAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIG--T-KIIDTSASLVKNFVQSFLDR 510 (562) T ss_pred EEEEecCCcEEEEEeeccceeccCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCc--c-ccChHHHHHHHHHHHHHHHH Confidence 997755432 2234443321 2 4789999999999999888777774 4 56889999999999999999 Q ss_pred HHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 270 GFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 270 ~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ..+.|.|..... . ..++. .++| +. -+.+.+...-++++|.++.++.- T Consensus 511 l~~~gaI~~~~~-~--dv~v~-------~~~d---~~--~v~~~v~Pv~~mekIy~ti~~~~ 557 (562) T protein:vir:80 511 KKLAKEIQDYSP-E--EVQVV-------IEGD---IA--RISLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred HHhCCcccCCCc-c--ceEEE-------ecCC---EE--EEEEEEEEcccceEEEEEEEEEe Confidence 999999964321 1 11121 1122 22 37888999999999999998887 No 34 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=98.73 E-value=1.7e-08 Score=63.18 Aligned_cols=304 Identities=11% Similarity=0.061 Sum_probs=152.3 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEcc-CCcceE---EEec--hhhhc-----cCCCCChHHHHHHHHHHccCCC Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKG-TAMGYK---EYTT--LEELK-----DTFADNTEVYAKAKAVFLQKDR 69 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~-~~~~~~---~yts--~~~v~-----~~f~~~s~~ykaA~~~fsQ~~~ 69 (331) +++.|-.+ ..++..-+.. .++...+-.... .....+ .|.. ..++. .+|-..... ....+. .- T Consensus 206 l~~~in~~-~~~~aky~~~-~gn~i~~~~~d~~~~~~vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~---~~~~la--~~ 278 (562) T protein:vir:63 206 LISDINNL-PDFEAKFFPI-GDKNLTTDNFDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVDFEFD---RSKEIA--NF 278 (562) T ss_pred HHHhhccc-cceEEEeecc-CCceeeeeccccccccchhhhhhhhhhhhhhhhhcccccceeeeeec---ccccee--cc Confidence 11111111 0011111110 111111000000 000000 0100 00000 000000000 000000 00 Q ss_pred cceEEEEecc---chhHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHh----cCcEEEEEE---eCChHHHH---hh Q lcl|Aclame:pro 70 PDTVAVITYE---DTKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEE----QKFKFAVFQ---VTAVADIT---PL 136 (331) Q Consensus 70 p~~v~v~~~~---~~~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea----~~~~~~~~~---~t~~~~~~---~~ 136 (331) +.....++.+ +.+..+++..+....|+++.+...+.+-+.++..|++. ..+++.++. ..+..... .. T Consensus 279 ~~~~LtGG~dGt~~~~~~~al~ale~~~~~~i~~~t~d~av~~~l~a~vkr~~~~g~~~~aVlg~~~~~~~~~~~~~a~~ 358 (562) T protein:vir:63 279 PLTKLTGGDNGTIPESWADKFSYFANEGGYYLVPLTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIG 358 (562) T ss_pred cceeeecCCCCCchhhHHHHHHHHHhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhh Confidence 1111112222 12334556655556677776665566666789999954 233455442 23333332 22 Q ss_pred cccceEEEEEeCC-----C---chh----HHHHHHHHHhcCCcc-ceeeeeeeccCCcCCCCCCHHHHHHHHhCCCeEEE Q lcl|Aclame:pro 137 AKNTRTIAIVHSK-----T---GEK----LDAALIGNVASLPVG-SATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYI 203 (331) Q Consensus 137 ~~~~~t~~~~~~~-----~---~~~----~~aa~~g~~~~~~~G-~~t~~~k~~l~gv~~~~~t~t~~~~l~~~~~n~y~ 203 (331) .++.|...+.+.. . ..+ .++.++|..+...+. +.|++=.. +.++. ..++.+|++.+..+|.+.+. T Consensus 359 ~n~ervv~v~~~~~~~~~~~~~~~~~~~~~aa~vAGl~A~~~~~~SlT~~~i~-~~~v~-~~~t~~e~~~li~~Gv~~l~ 436 (562) T protein:vir:63 359 LQNERAGLIGFSGTVKMDDGRSLKMPGYMFAAQVAGLTCGLEIGEAITFKNIA-IETLD-TIYEGSQLDQLNESGIITAE 436 (562) T ss_pred cCCCcEEEEecCeeEECCCCceeeechhHHHHHHHHHhhcCchhcCccceeec-ccccc-ccCCHHHHHHHHhCCeEEEE Confidence 3456665544321 0 112 234455545555443 45543322 33443 47999999999999999997 Q ss_pred EEcCee----EEecCEEeCC-----c--eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 204 EKAGIA----QTSEGKTVSG-----E--FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFA 272 (331) Q Consensus 204 ~~~g~~----~~~~G~~~~G-----~--~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~ 272 (331) ...+.. ..-++.+.-+ . .|-+++-.|.+...++..+-+.++. | |-++.|...|++.+++.|++..+ T Consensus 437 ~~~~~~v~~~~iv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yiG--k-~Nn~~~r~~v~~~i~~~L~~l~~ 513 (562) T protein:vir:63 437 FVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIG--T-KIIDTSASLVKNFVQSFLDRKKL 513 (562) T ss_pred EecCCcEEEEEeeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCc--c-ccChHHHHHHHHHHHHHHHHHHh Confidence 755432 1223443321 2 4789999999999999888777764 4 56889999999999999999999 Q ss_pred cCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 273 NGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 273 ~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) .|.|..... .+ ..+.. ..| +. -+.+.+...-++++|.++.++.- T Consensus 514 ~gaI~~~~~-~d--v~v~~-------~~d---~~--~v~~~v~pv~~mekIy~ti~~~~ 557 (562) T protein:vir:63 514 AKEIQDYSP-EE--VQVVI-------EGD---VA--RISLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred CCcccCCCc-cc--eEEEe-------cCC---EE--EEEEEEEEcccceEEEEEEEEee Confidence 999963211 11 11111 112 22 36788899999999999998887 No 35 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=98.69 E-value=1.1e-07 Score=58.75 Aligned_cols=308 Identities=11% Similarity=0.064 Sum_probs=147.1 Q ss_pred CCCceeeEEEEEeecccccccccc-eeEEEEccCCcceE----EE--echhh-h-----ccCCCCChHHHHHHHHHHccC Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLG-RPAIFVKGTAMGYK----EY--TTLEE-L-----KDTFADNTEVYAKAKAVFLQK 67 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg-~~li~~~~~~~~~~----~y--ts~~~-v-----~~~f~~~s~~ykaA~~~fsQ~ 67 (331) +++.-.+.++......-.. .-++ .-+++++....... .+ ++.++ + ...|..+...+.... . T Consensus 389 ~s~~~~g~~i~~~~as~~~-s~ln~~~~V~Gt~aa~~~~d~~t~~~v~s~~~alp~~a~sv~laGG~dg~v~v~-----~ 462 (742) T protein:vir:58 389 ISNQPYGFNIQDSRHSYWL-SPFKDDELIIGTELVLPALDVSTEFGVSSWEEALPEFSFLMPFQGGSDGYIRVD-----E 462 (742) T ss_pred EEecccCcceeccCcceEE-eccCCceEEEeehhhccccccchheeccccccccceeeEEEeecCCcccccccc-----C Confidence 3333333333222111000 0111 12222221110000 00 00000 0 000000000000000 0 Q ss_pred CCcceEEE-Eecc----chhHHHHHHHhhcCceeEEEEecCCHHH-HHHHHHHHHh-cCcEEEEEEeC---ChHH-HH-- Q lcl|Aclame:pro 68 DRPDTVAV-ITYE----DTKLLEAAEAYFLKSWHFALLAEFKAAD-ALALSNLIEE-QKFKFAVFQVT---AVAD-IT-- 134 (331) Q Consensus 68 ~~p~~v~v-~~~~----~~~~~~al~~~~~~~~~f~~~~~~~~~~-i~alA~w~ea-~~~~~~~~~~t---~~~~-~~-- 134 (331) ..+..+.. ...+ .-+-+.++... ...-.+++.+.+..+ ..++.+.++. +++++...... +... +. T Consensus 463 ~~~D~iG~~~~~d~~~adrTGL~ALlev--~eVtILiAPG~t~~~v~aav~A~la~a~~Rl~vL~D~P~~~tt~~~A~a~ 540 (742) T protein:vir:58 463 NEPDTIGRVKITPALLANYERLLPLLTE--DQFDLVLTPYLTFADHAGTVNAFINRAENRFLYLFDIAGDDDTENLAISL 540 (742) T ss_pred CCcccccccccccccccchhHHHHhhhc--CCCcEEEEcCCCchHHHHHHHHHHHhhcCCeEEEEecCCCCchHHHHHHH Confidence 00000000 0000 00111222211 111223333333222 2333444443 22222222111 1111 11 Q ss_pred -hhcccceEEEEEe----CCC--ch--hHHHHHHHHHhcCCccceeeeeeeccCCcCCCCCCHHHHHHHHhCCCeEEEEE Q lcl|Aclame:pro 135 -PLAKNTRTIAIVH----SKT--GE--KLDAALIGNVASLPVGSATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEK 205 (331) Q Consensus 135 -~~~~~~~t~~~~~----~~~--~~--~~~aa~~g~~~~~~~G~~t~~~k~~l~gv~~~~~t~t~~~~l~~~~~n~y~~~ 205 (331) ...+..+.++.|. ... .. -+.++++|.++..+.-.--|+--.+...+.....+++|.+.|..+|+|+..+. T Consensus 541 r~~~nSsraaly~PwVkv~d~~~~r~vPpSgaIAGL~ARtD~erGvw~SPANrgii~~~~~s~se~d~LN~~GINtIrsf 620 (742) T protein:vir:58 541 AGYINSSFATTFFPWVRRLTNKGMRTVPASLAAYRSIRTTDPETGLAPVGARRGVVTGEPVRQVDWEDLYNNRINPIVRV 620 (742) T ss_pred HhccCCceEEEEeceeeeccCCcceeechHHHHHHHHHHhccCCceEecCCcceeeeccccchhhHHHHhhCCceEEEEC Confidence 1112233332221 000 11 23555666665544321123221121223334577899999999999999887 Q ss_pred cCeeEEecCEEeCC-----ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccc Q lcl|Aclame:pro 206 AGIAQTSEGKTVSG-----EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSND 280 (331) Q Consensus 206 ~g~~~~~~G~~~~G-----~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~ 280 (331) ++--.+..++++.+ .||-+.|-.+|++..|+..+...+-. |.++.-...|+..++.-|+..+++|.|. T Consensus 621 G~G~rlWGnRTlassDs~wryInVRRlfd~Ie~SI~~a~q~~VfE----PNd~~L~~sIk~sInafL~~L~aqGALl--- 693 (742) T protein:vir:58 621 GNDVLLFGQKTMLNVNSALNRINVRRLLIVMRNRISQILSSYLFE----NNTSENRLRAEALVRQYLESLRLRGAVT--- 693 (742) T ss_pred CCcEEEEcceecCCCCcccceEeehhhHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee--- Confidence 54446667777644 37999999999999999888755332 6688888999999999999999999996 Q ss_pred cCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 281 ETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 281 ~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ||.|++. ++.+++|+.+.++. +.+.+...-.++.|+++..++- T Consensus 694 -----GfrV~lD--etNTpeDI~~Gklv-v~I~vAP~~PAEfI~lrf~it~ 736 (742) T protein:vir:58 694 -----DYEVAID--SVTTPTDIDNNTLR-ARVTVQPARSIEYIDITFVITP 736 (742) T ss_pred -----eeEEEEc--CCCCHHHhhCCEEE-EEEEEEccCCcceEEEEEEEEe Confidence 3889886 35778888888875 8888899999999998887765 No 36 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=98.56 E-value=1.1e-07 Score=58.71 Aligned_cols=303 Identities=10% Similarity=0.069 Sum_probs=153.7 Q ss_pred CCCcee-------eEEEEEeec--ccccccccceeEEEEccCCcceEEEechhhhcc--------CCC--CChHHHHHHH Q lcl|Aclame:pro 1 MVETIT-------DVRVHISVL--YPSPRIGLGRPAIFVKGTAMGYKEYTTLEELKD--------TFA--DNTEVYAKAK 61 (331) Q Consensus 1 ~v~~i~-------dV~v~i~~~--~~~~~~~fg~~li~~~~~~~~~~~yts~~~v~~--------~f~--~~s~~ykaA~ 61 (331) -+..++ +.+...... ........-....+...+...+.. ....++.. ++. .+.++-..+. T Consensus 209 ~~~~lv~~~~~~~~f~a~~~~~~~~~~~~~~~d~~~~~~~~t~~~~~~-~~~~di~~~~~~~~~v~~~~~~~~~l~~~~~ 287 (569) T protein:vir:80 209 ETNVLVSAINSLPDWEAKFFPIGDKNLPTDALEAVTKVDVKTEAVFVG-ALAGDIAKQLEYNDYVTVAVDATKPVEDFEL 287 (569) T ss_pred hhhhhhhhcCCccCceEEEEecCCCcceehhccchhheeccccceeee-hhHHHHHHhhcCCceEEEEecCCcceeeecc Confidence 011111 011111000 000000000000000000000000 00011110 000 0001111111 Q ss_pred HHHccCCCcceEEEEeccchhHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHhc----CcEEEEEE---eCChHHHH Q lcl|Aclame:pro 62 AVFLQKDRPDTVAVITYEDTKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEEQ----KFKFAVFQ---VTAVADIT 134 (331) Q Consensus 62 ~~fsQ~~~p~~v~v~~~~~~~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea~----~~~~~~~~---~t~~~~~~ 134 (331) ..|..+..+. ...+..+++..+....|.++.+...+.+.+.++..|++.. ++++.++. ..+.+... T Consensus 288 ~~LtGG~dG~-------~~~~~~~~l~~le~~~~~~i~~~t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~ 360 (569) T protein:vir:80 288 TNLTGGSDGT-------APESWANKFPLLANEGGYYLVPLTDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTNETVEESI 360 (569) T ss_pred eeecCCCCCC-------ccchHHHHHHHHhhCCcEEEEecCCChHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHH Confidence 2222222211 1124455666665667777777666677778899999753 34555543 23333332 Q ss_pred ---hhcccceEEEEEeC------CC--chh----HHHHHHHHHhcCCcc-ceeeeeeeccCCcCCCCCCHHHHHHHHhCC Q lcl|Aclame:pro 135 ---PLAKNTRTIAIVHS------KT--GEK----LDAALIGNVASLPVG-SATWKGRHGLAGITSEELKVSEIDAIQKAG 198 (331) Q Consensus 135 ---~~~~~~~t~~~~~~------~~--~~~----~~aa~~g~~~~~~~G-~~t~~~k~~l~gv~~~~~t~t~~~~l~~~~ 198 (331) ...++.|...+... +. ..+ .++.++|..++..+. +.|++=.. +.++. ..++.+|++.+.++| T Consensus 361 ~~a~~~n~e~vv~v~~~~~~~~~~g~~~~~~~~~~aa~vAG~~A~~~~~~S~T~k~i~-~~~i~-~~lt~~e~~~li~~G 438 (569) T protein:vir:80 361 TRATNLRDPRASLVGFSGTRKMDDGRLLKLPGYMMASQIAGIASGLEVGEAITFKHFN-VTSVD-RVFESSQLDMLNESG 438 (569) T ss_pred HHHhhcCCCeEEEEecCceeecCCCcceeechhhHHHHHHHHHhcCccccCccceeec-ccccc-ccCCHHHHHHHHhCC Confidence 22344554333211 00 122 234445555555543 45543221 23443 368999999999999 Q ss_pred CeEEEEEcCee----EEecCEEeCC-----ce--ehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 199 GMCYIEKAGIA----QTSEGKTVSG-----EF--IDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVL 267 (331) Q Consensus 199 ~n~y~~~~g~~----~~~~G~~~~G-----~~--iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl 267 (331) .+++....+.. ..-++.+.-+ .| |-+++-.|.+...++..+-+.++. | |-++.|...|++.+++.| T Consensus 439 ~~~l~~~~~~~~~v~~~vn~itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiG--k-~nn~~~r~~v~~~i~~~L 515 (569) T protein:vir:80 439 VISIEFVRNRTLTAFRVVQDVTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIG--T-KVIDTSASLIKNFIQSFL 515 (569) T ss_pred eEEEEEecCceEEEEEEeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCc--c-cCChhHHHHHHHHHHHHH Confidence 99997755432 2234554422 24 889999999999999888777764 4 568889999999999999 Q ss_pred HHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 268 NEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 268 ~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ++..+.|.|..... . ...+.. ..| |. -+.|.+.+.-++++|.++.++.- T Consensus 516 ~~l~~~gaI~~~~~-~--dv~v~~-------~~d---~~--~v~~~v~Pv~~~ekI~~ti~~~~ 564 (569) T protein:vir:80 516 DNKKRAREIQDYTP-E--EVQVVL-------EGD---VA--SISMTVMPIRSLNKITVQLVYKQ 564 (569) T ss_pred HHHHhCCcccCCCc-c--ceEEEe-------cCC---EE--EEEEEEEEcccccEEEEEEEEee Confidence 99999999963211 1 111111 111 22 37888899999999999988877 No 37 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=98.55 E-value=3e-07 Score=56.35 Aligned_cols=314 Identities=11% Similarity=0.062 Sum_probs=193.5 Q ss_pred CCCcee-eEEEEEeeccc--ccccccceeEEEEccCCc--------ceEEEechhhhccCCCCChHHHHHHHHHHccCCC Q lcl|Aclame:pro 1 MVETIT-DVRVHISVLYP--SPRIGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDR 69 (331) Q Consensus 1 ~v~~i~-dV~v~i~~~~~--~~~~~fg~~li~~~~~~~--------~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~ 69 (331) |.+... .|.|.-....+ ........+.++++.+.. .....++..+....|+....++.+...+|.|+.. T Consensus 3 m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~~g~L~~al~~~~~~~~~ 82 (393) T protein:vir:10 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKT 82 (393) T ss_pred CCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCCccchhhhhhhhhcccCc Confidence 888764 67775543333 233455666666654432 2344667777777788888888888889988765 Q ss_pred cceEEEEec-cch----------------hHHHHHHHhhcCc-e--eEEEEecC-CHHHHHHHHHHHHhcCcEEEEEEeC Q lcl|Aclame:pro 70 PDTVAVITY-EDT----------------KLLEAAEAYFLKS-W--HFALLAEF-KAADALALSNLIEEQKFKFAVFQVT 128 (331) Q Consensus 70 p~~v~v~~~-~~~----------------~~~~al~~~~~~~-~--~f~~~~~~-~~~~i~alA~w~ea~~~~~~~~~~t 128 (331) ...+.-... ..+ +.+.++....... . ..+...+. +.....++...++.-+.++++.... T Consensus 83 ~~~vv~v~~~~~~~~t~~~iig~~~~~~~tgl~al~~~~~~~~~~p~li~apg~~~~~~~~al~~~~~~~~~~~~v~d~~ 162 (393) T protein:vir:10 83 PTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISDNG 162 (393) T ss_pred eEEEeecccCccccccccccccccccchhhHHHHHHhhhhhcceeeeeeeeccccchHHHHHHHHHhhccCcEEEEEcCC Confidence 443222111 111 1233333332211 1 12222333 3455567777777666666554322 Q ss_pred --ChHHHHh---hcccceEEEEEeC------CCc----hhHHHHHHHHHhcCCcccee---eeeeeccCCcCCC------ Q lcl|Aclame:pro 129 --AVADITP---LAKNTRTIAIVHS------KTG----EKLDAALIGNVASLPVGSAT---WKGRHGLAGITSE------ 184 (331) Q Consensus 129 --~~~~~~~---~~~~~~t~~~~~~------~~~----~~~~aa~~g~~~~~~~G~~t---~~~k~~l~gv~~~------ 184 (331) ....+.. ..+..+..+++-. ... ..+.+.++|.++..+.-.-- ...| .|.|+... T Consensus 163 ~~t~~~ai~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~G~~~spaN~-~l~gi~~~~~~~~~ 241 (393) T protein:vir:10 163 ATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNV-ELDGVTGITKAVEF 241 (393) T ss_pred CCCHHHHHHHhhhcCCceEEEEecccccccccCCceeEeehhHHHHHHHHHhhcCCCcEEccCCc-eeeceeecceeccc Confidence 2222222 1223344443321 011 23566677766655542222 2333 35555432 Q ss_pred --CCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEeCC----ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHH Q lcl|Aclame:pro 185 --ELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSG----EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIAL 258 (331) Q Consensus 185 --~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~~G----~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~ 258 (331) .++++|.+.|..+|+|++.+..|. .+..++++++ .||-+.+-.+|++..|+..+...+- + |.++.=... T Consensus 242 ~~~~~~~~~~~Ln~~gI~t~~~~~G~-~~wG~rT~s~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~---e-~~~~~~~~~ 316 (393) T protein:vir:10 242 DINESSTEANYLNEKGITICLNHNGF-RYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVD---M-PLTPLRVKT 316 (393) T ss_pred ccCCCcchhHhHhhcCceEEEcCCCE-EEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhcc---C-CCCHHHHHH Confidence 245789999999999998765443 4556677776 3889999999999999988775443 2 668888889 Q ss_pred HHHHHHHHHHHHHhcC--cccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 259 LQSELTTVLNEGFANG--IIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 259 i~~~v~~vl~~~~~~G--~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ++..++.-|+..++.| .|. ++++.+.+ +.+++|..+.+.. +.+.+...-.+++|++....+. T Consensus 317 i~~~i~~~L~~l~~~g~~al~--------g~~v~~~~--~nt~~~i~~G~~~-~~i~~~p~~p~e~I~~~~~~~~ 380 (393) T protein:vir:10 317 MLEAINNKLRSWASGDDPRIL--------GARVWVAE--EITADIIKSGKFV-IKYDYHWIPSLESLGLEQRVND 380 (393) T ss_pred HHHHHHHHHHHHHhccccccc--------cceEEecC--CCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 9999999999988866 343 35676654 4778888887775 8899999999999999999998 No 38 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=98.44 E-value=3.6e-07 Score=55.90 Aligned_cols=304 Identities=12% Similarity=0.099 Sum_probs=145.8 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEE--ccCCcceEEEechhhhccCCCCChHHHHHHHHHHccCC-C-cc-eEEE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFV--KGTAMGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKD-R-PD-TVAV 75 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~--~~~~~~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~-~-p~-~v~v 75 (331) -........+.+...........+....+. .......... +.+.. .......+.+.. . +. ...+ T Consensus 109 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~---~~~~~~~~~~~~~~~~~~~~~~ 177 (477) T protein:vir:79 109 KLAHPAAANLVLKNDSGGTTYTEGTDYAVDLINGVITRIKTG--------TIPAA---ATAAKATYDYADPTKVTAADII 177 (477) T ss_pred cccccccceeEEeecccccccccCccccccccchhhhhhhcc--------ccccc---cceeeceeccCCcccceeeeec Confidence 001111111111111100000001100000 0000000000 00000 000000000000 0 00 0011 Q ss_pred Eeccc---hhHHHHHHHhhcC---ceeEEEEecC--CHHHHHHHHHHHHhcCcEEEEEEeCC---hHHHHhh-------- Q lcl|Aclame:pro 76 ITYED---TKLLEAAEAYFLK---SWHFALLAEF--KAADALALSNLIEEQKFKFAVFQVTA---VADITPL-------- 136 (331) Q Consensus 76 ~~~~~---~~~~~al~~~~~~---~~~f~~~~~~--~~~~i~alA~w~ea~~~~~~~~~~t~---~~~~~~~-------- 136 (331) +..+. .+...++...... ....+..... ...-..++...++.. +.+.+++... ...+... T Consensus 178 g~~~a~~~~tg~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~-~~~a~~d~p~~~~~~~~~~~~~~~~~~~ 256 (477) T protein:vir:79 178 GAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQL-GAIAYIDAPIGTTLAQALAGRGPAGTIN 256 (477) T ss_pred ccccccccchhhhhhhhhhhhcccccceeeccccccchhHHHHHHHHHhhc-CeEEEEecCCCCChHHHhhhhhhccccc Confidence 11000 1111222211110 0011111111 112223333333322 3344443221 1111100 Q ss_pred --cccceEEEEEe------CCCc---h-hHHHHHHHHHhcCCc--c-ceeeeeeeccCCcCC---C-----CCCHHHHHH Q lcl|Aclame:pro 137 --AKNTRTIAIVH------SKTG---E-KLDAALIGNVASLPV--G-SATWKGRHGLAGITS---E-----ELKVSEIDA 193 (331) Q Consensus 137 --~~~~~t~~~~~------~~~~---~-~~~aa~~g~~~~~~~--G-~~t~~~k~~l~gv~~---~-----~~t~t~~~~ 193 (331) .+..|..+.|. +..+ . .+.+.++|.++..+. | ......+ .+.||.. + ..+++|.+. T Consensus 257 ~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~-~~~gv~~~~~~~~~~~~~~~~~~~~ 335 (477) T protein:vir:79 257 FNTSSDRVRLCYPHVKVYDIATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQ-QLVGVTGVERPLSAMIDDPQSDVNM 335 (477) T ss_pred cccccceEEEEcCeeEEecccCCceeeechHHHHHHHHHHhhccCCceEccCCc-eeecceecccccccccCCChhhHHH Confidence 11223333221 1111 1 345667776655443 3 1122333 2445432 1 235789999 Q ss_pred HHhCCCeEEEEEcCee-EEecCEEeCC-------ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHH Q lcl|Aclame:pro 194 IQKAGGMCYIEKAGIA-QTSEGKTVSG-------EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTT 265 (331) Q Consensus 194 l~~~~~n~y~~~~g~~-~~~~G~~~~G-------~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~ 265 (331) |.++|+|.+.++.|.. .+..++++.+ .||-+.+-.+|+...|+..+...+-. |.+..-...|+..++. T Consensus 336 L~~~~i~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~~~~~~~~~i~~~i~~ 411 (477) T protein:vir:79 336 LNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA----PIDQGLIDSLVESVNG 411 (477) T ss_pred HhhCCceEEEEecCCcEEEEcccccCCCCCCccceeeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHH Confidence 9999999999987654 5566777632 26889999999999999988865432 4477778999999999 Q ss_pred HHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 266 VLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 266 vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) -|++.++.|.|. +|+|++ ..++.|++|+.+.++. +.+.+.....+++|.+....+. T Consensus 412 ~l~~l~~~g~l~--------g~~v~~-~~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 467 (477) T protein:vir:79 412 FGRKLIGDGALL--------GFKAWF-DPARNPKEELAAGHLL-INYKYTVPPPLERLTYETEITS 467 (477) T ss_pred HHHHHHhCCcee--------eeEEEE-ecCCCCHHHhhCCeEE-EEEEEEecCCceeEEEEEEEec Confidence 999999999996 478887 4467899999999885 8999999999999999999988 No 39 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=98.40 E-value=8.4e-07 Score=53.90 Aligned_cols=314 Identities=14% Similarity=0.051 Sum_probs=186.9 Q ss_pred CCCceeeEEEEEeecc--cccccccceeEEEEccCCcc-------eEEEechhh---hccCCCCChHHHHHHHHHHccCC Q lcl|Aclame:pro 1 MVETITDVRVHISVLY--PSPRIGLGRPAIFVKGTAMG-------YKEYTTLEE---LKDTFADNTEVYAKAKAVFLQKD 68 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~--~~~~~~fg~~li~~~~~~~~-------~~~yts~~~---v~~~f~~~s~~ykaA~~~fsQ~~ 68 (331) |-+=.-.|.|.--... ++..+....+.++++....+ .....+..+ +.........++.+...+|.|+. T Consensus 4 ~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~~d~~~~~~~~~~~gtl~~al~~~~~~~~ 83 (388) T protein:vir:96 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) T ss_pred CCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecchhhhhhhccccccccchhhhHhhhccCC Confidence 3332345555442222 34445677777777653321 122223322 33334455567788888999886 Q ss_pred CcceEEEEec-cch------------------hHHHHHHHhhcCceeEEEEecCCH--HHHHHHHHHHHhcCcEEEEEEe Q lcl|Aclame:pro 69 RPDTVAVITY-EDT------------------KLLEAAEAYFLKSWHFALLAEFKA--ADALALSNLIEEQKFKFAVFQV 127 (331) Q Consensus 69 ~p~~v~v~~~-~~~------------------~~~~al~~~~~~~~~f~~~~~~~~--~~i~alA~w~ea~~~~~~~~~~ 127 (331) .+..+..... ..+ +...++... ...--.+++.+.+. .-..++...++.- +.|.+++. T Consensus 84 ~~~~vv~v~~g~~~~at~a~iig~~~~~tg~~~gl~al~~~-~~~p~il~aPg~s~~~~v~~al~~~~~~~-~~~~i~D~ 161 (388) T protein:vir:96 84 VPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTEC-TERPTLIGAPGFSQNKAVIDALASMAKRL-KCRAVIDG 161 (388) T ss_pred ceEEEEEeccccccccccceeeeecccccchhhHHHHhhhc-ccceeEEEeeccccchHHHHHHHHHHhhc-CcEEEEec Confidence 5543322111 110 011112111 11112333334322 2334566666543 34555543 Q ss_pred CC--hHHHH---hh-----cccceEEEEEe------CCCc----hhHHHHHHHHHhcCCccceeeeeee-ccCCcCC--- Q lcl|Aclame:pro 128 TA--VADIT---PL-----AKNTRTIAIVH------SKTG----EKLDAALIGNVASLPVGSATWKGRH-GLAGITS--- 183 (331) Q Consensus 128 t~--~~~~~---~~-----~~~~~t~~~~~------~~~~----~~~~aa~~g~~~~~~~G~~t~~~k~-~l~gv~~--- 183 (331) .. ..... .. .+..|.+..|- +... ..+++.++|..+..++ ......|. .+.|+.. T Consensus 162 p~~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~-~~spaN~~i~i~g~~~~~~ 240 (388) T protein:vir:96 162 PSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKP-WESPGNQGVLIQDVARVID 240 (388) T ss_pred cCCchhHHHHHHhhhhccCcCcceEEEEeCceeeecccCCceeeechHHHHHHHHHhhcC-cccccCeeEEeeeeccccc Confidence 21 11111 11 12234443331 1111 1456777777776665 22222321 1234322 Q ss_pred --CCCCHHHHHHHHhCCCeEEEEEcCe-eEEecCEEeCCceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHH Q lcl|Aclame:pro 184 --EELKVSEIDAIQKAGGMCYIEKAGI-AQTSEGKTVSGEFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQ 260 (331) Q Consensus 184 --~~~t~t~~~~l~~~~~n~y~~~~g~-~~~~~G~~~~G~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~ 260 (331) ..++.+|.+.|..+|+|++.++.+. ..+..+++++..||-+.+..+|++..|+..+...+- + |.++.=...|+ T Consensus 241 ~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~wG~rT~~~~~i~vrR~~~~i~~si~~~~~~~v~---e-pn~~~~~~~i~ 316 (388) T protein:vir:96 241 YNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVTGKFISFVGLEDAIARKLEAASQRAMS---K-QLTKSFMEQEI 316 (388) T ss_pred ccccCChhhHHhhhhcCceEEEEecCCcEEEEcccccCCcceeehhhHHHHHHHHHHHHHHhcc---C-CCCHHHHHHHH Confidence 2347789999999999999998654 456777888889999999999999999988775432 2 66888889999 Q ss_pred HHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 261 SELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 261 ~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ..++.-|+..+++|.|.. |++++ ..+..|++|+.+.+.. +.+.+.....+++|++...++. T Consensus 317 ~~i~~fL~~l~~~Gal~g--------~~~~~-d~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~~~~~~ 377 (388) T protein:vir:96 317 KKINLFMQDLVAAEIIPG--------GEVYL-HPTLNTVERYKNGSWY-IVIDYGRYSPNEHMIFHLNAVD 377 (388) T ss_pred HHHHHHHHHHHhCCceee--------eEEEE-ecCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 999999999999999963 56776 4467899999988885 8899999999999999999988 No 40 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=98.37 E-value=9.2e-07 Score=53.68 Aligned_cols=310 Identities=12% Similarity=0.077 Sum_probs=145.7 Q ss_pred CCCc-----------eeeEEEEEeecccccc-cccce-eEEEEccCC--------cceEEEe---chhhhccCCCCChHH Q lcl|Aclame:pro 1 MVET-----------ITDVRVHISVLYPSPR-IGLGR-PAIFVKGTA--------MGYKEYT---TLEELKDTFADNTEV 56 (331) Q Consensus 1 ~v~~-----------i~dV~v~i~~~~~~~~-~~fg~-~li~~~~~~--------~~~~~yt---s~~~v~~~f~~~s~~ 56 (331) ++.+ +-+.+++..-.-+... ..... .-++..... ..+...+ ..+++....... .+ T Consensus 220 l~~din~~~~~~A~~~g~~~i~tky~d~~~~~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~~~~~~-~~ 298 (607) T protein:vir:10 220 LMQAISATPNFSASVVGSPSVNTSYLDEVTSPVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVNGVSAG-TG 298 (607) T ss_pred HHHHhhcCCceEEEEecccceeeeccccccceeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhhhhhcc-cc Confidence 0000 0011111110000000 00000 000000000 0000000 000010000000 00 Q ss_pred HHHHHHHHccCCCcce----EEEEeccc---hhHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHhc---Cc-EEEEE Q lcl|Aclame:pro 57 YAKAKAVFLQKDRPDT----VAVITYED---TKLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEEQ---KF-KFAVF 125 (331) Q Consensus 57 ykaA~~~fsQ~~~p~~----v~v~~~~~---~~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea~---~~-~~~~~ 125 (331) +..+.....-...|.. ...++.+. .+..+++..+....|+++.+...+.+.+.++.+|++.. .+ +..++ T Consensus 299 ~~~~~~~~~~~~~~a~~a~~~LtGGtdG~~~~ty~dal~aLe~~e~~~i~~~t~d~ai~~~l~a~vkr~~~~g~~~~aVl 378 (607) T protein:vir:10 299 SATASVTTAPESFPANFDTAFLTGGSTGDVPVSWADKFNGAIGNNVYYIIPLTSEENIHAELQAFIDEQHVLGYNYHAFV 378 (607) T ss_pred ceeeeeeccccccccccceeeeeCCCCCCchhhHHHHHHHHhhcCceEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEe Confidence 0000000000011111 11122221 23455666655566777776666667778899999642 33 33333 Q ss_pred E---eCChHHH---HhhcccceEEEEEeC----CC---ch---h-HHHHHHHHHhcCCcc-ceeeeeeeccCCcCCCCCC Q lcl|Aclame:pro 126 Q---VTAVADI---TPLAKNTRTIAIVHS----KT---GE---K-LDAALIGNVASLPVG-SATWKGRHGLAGITSEELK 187 (331) Q Consensus 126 ~---~t~~~~~---~~~~~~~~t~~~~~~----~~---~~---~-~~aa~~g~~~~~~~G-~~t~~~k~~l~gv~~~~~t 187 (331) . ..+...+ ....++.|...+... +. .. | .++.++|..++..+. +.|++=.. +.++.+ .++ T Consensus 379 g~~~~~t~~~~~t~a~~~N~ervv~V~~~~~~~~~G~~~~~~~~~~Aa~vAGl~Ag~~~~~SlT~k~i~-~~~v~~-~lt 456 (607) T protein:vir:10 379 GGGFAEPLEQILSRQVNINDSRFGLVGQSGHVQEGGESVHVPAYLMAAYVGGLSSSLGVAVPITNKKLA-LVDLDQ-NFS 456 (607) T ss_pred cCCCCCCHHHHHHHHHhhCCCcEEEEecCeeEeeCCcceeccHHHHHHHHHHHHhcCccccCcccceec-cccccc-cCC Confidence 2 2233322 233456655433221 11 11 2 234444555555543 55544332 345543 699 Q ss_pred HHHHHHHHhCCCeEEEEEcC-----eeEEecCEEeCC-----c--eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHH Q lcl|Aclame:pro 188 VSEIDAIQKAGGMCYIEKAG-----IAQTSEGKTVSG-----E--FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARG 255 (331) Q Consensus 188 ~t~~~~l~~~~~n~y~~~~g-----~~~~~~G~~~~G-----~--~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G 255 (331) .+|++.+..+|..++....+ .....+|.+.-+ . .|-.++-+|.+...++..+-+.++ +|++. +.. T Consensus 457 ~~e~e~ai~~Gv~~l~~~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yI--Gk~nn-d~~ 533 (607) T protein:vir:10 457 GDDLNTLNQNGVIGIEHLVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYI--GSNIR-STS 533 (607) T ss_pred HHHHHHHHhCCeEEEEEccCccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCC--cccCC-cch Confidence 99999999999998865432 234556655432 2 488999999999999988887776 44444 455 Q ss_pred HHHHHHHHHHHHH--HHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 256 IALLQSELTTVLN--EGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 256 ~~~i~~~v~~vl~--~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ...++..+...|. +....|.|.. .+..+ +.+. .+.| +. -+.+.++..-+|++|.++..+.= T Consensus 534 ~~~vk~~i~~~L~~~~l~~~gaI~d-f~~ed----v~v~-----~~~D---~v--~v~~~v~Pv~~iekIyvtv~v~~ 596 (607) T protein:vir:10 534 ADDIKSTVASYLYSEMNNDDGLIVD-FSESD----IVVT-----ISGT---VV--YIQFAVAPTQEIKNIVVSGTYSN 596 (607) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeC-CCccc----cEEe-----eCCC---EE--EEEEEEEEcccceEEEEEEEEEE Confidence 6778888888874 3455688852 11111 1111 1112 22 37888999999999988877765 No 41 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=98.36 E-value=1.1e-06 Score=53.32 Aligned_cols=302 Identities=10% Similarity=0.053 Sum_probs=140.9 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEc-------------c--CCcceEEEech-------hhhccCCC-----CC Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVK-------------G--TAMGYKEYTTL-------EELKDTFA-----DN 53 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~-------------~--~~~~~~~yts~-------~~v~~~f~-----~~ 53 (331) ....+......+................+.. . .......+... .++..+.. ++ T Consensus 109 ~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~tG 188 (477) T protein:vir:10 109 KLAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGVITRIKTGTIPPGATAAKATYDYADPTKVTAADIIGAVNAAGMRTG 188 (477) T ss_pred cccccccccccccccccccccccchhhhhhhccccceecccccccccceeeeeccccccccccccccccccccccchhhh Confidence 0000000000010000000000000000000 0 00000000000 00000000 00 Q ss_pred hHHHHHHHHHHccCCCcceEEEEeccch-hHHHHHHHhhcCceeEEEEecCCHHHHHHHHHHHHhcCcEEEEEEeCChHH Q lcl|Aclame:pro 54 TEVYAKAKAVFLQKDRPDTVAVITYEDT-KLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEEQKFKFAVFQVTAVAD 132 (331) Q Consensus 54 s~~ykaA~~~fsQ~~~p~~v~v~~~~~~-~~~~al~~~~~~~~~f~~~~~~~~~~i~alA~w~ea~~~~~~~~~~t~~~~ 132 (331) ......+...+. ..|..+.+-.+... ...+++.......--+..++........++-+|-+...+. T Consensus 189 l~al~~~~~~~~--~~~~~l~apg~~~~~~v~~~l~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~----------- 255 (477) T protein:vir:10 189 MKALKDTYNLYG--YFSKILIAPAYCTQNSVSVELEAMAVQLGAIAYIDAPIGTTLAQALAGRGPAGTI----------- 255 (477) T ss_pred hhhhhhhhhhcc--hhcccccccccccchhhHHHHHHHHhhCCEEEEEecCCCCCHHHHHhhhhhcccc----------- Confidence 000011111111 11112221111111 1111121111111111122211111111122222211000 Q ss_pred HHhhcccceEEEEEe------CCCc---h-hHHHHHHHHHhcCCccc---eeeeeeeccCCcCC---C-----CCCHHHH Q lcl|Aclame:pro 133 ITPLAKNTRTIAIVH------SKTG---E-KLDAALIGNVASLPVGS---ATWKGRHGLAGITS---E-----ELKVSEI 191 (331) Q Consensus 133 ~~~~~~~~~t~~~~~------~~~~---~-~~~aa~~g~~~~~~~G~---~t~~~k~~l~gv~~---~-----~~t~t~~ 191 (331) ....+..|..+.|. +..+ . .+++.++|.++..+.-. .+...+ .+.||.. . ..+++|. T Consensus 256 -~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~-~~~gi~~~~~~~~~~~~~~~~~~ 333 (477) T protein:vir:10 256 -NFNTSSDRVRLCYPHVKVYDTATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQ-QLVGVTGVERPLSAMIDDPQSDV 333 (477) T ss_pred -ccccccceEEEEcCeEEEecccCCceeEEchHHHHHHHHHHhhhcCCceeccCCc-eeccccccccccccccCCChhhH Confidence 00011222222221 1111 1 24566666665444322 222333 2444322 2 2367899 Q ss_pred HHHHhCCCeEEEEEcCee-EEecCEEeCC-------ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHH Q lcl|Aclame:pro 192 DAIQKAGGMCYIEKAGIA-QTSEGKTVSG-------EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSEL 263 (331) Q Consensus 192 ~~l~~~~~n~y~~~~g~~-~~~~G~~~~G-------~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v 263 (331) +.|.++|+|++.++.|.. .+..++++.+ .||-+.+-.+|+...|+..+...+-. |.+..-...|+..+ T Consensus 334 ~~L~~~gi~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~v~~----~~~~~~~~~i~~~i 409 (477) T protein:vir:10 334 NMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA----PIDQGLIDSLVESV 409 (477) T ss_pred HHHhhCCceEEEEecCCcEEEEcccccCCCCCCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHH Confidence 999999999999987654 5566677643 26888999999999999888764432 45777789999999 Q ss_pred HHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 264 TTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 264 ~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) +..|+..++.|.|. +|+|++. .+..|++|+.++++. +.+.+.....+++|.+....+. T Consensus 410 ~~~l~~l~~~g~l~--------g~~v~~~-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 467 (477) T protein:vir:10 410 NGFGRKLIGDGALL--------GFKAWFD-PARNPKEELAAGHLL-INYKYTVPPPLERLTYETEITS 467 (477) T ss_pred HHHHHHHHhCCcee--------eeEEEEe-cCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEcc Confidence 99999999999996 4788884 467899999999995 8999999999999999999988 No 42 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=98.29 E-value=8e-07 Score=54.02 Aligned_cols=302 Identities=15% Similarity=0.055 Sum_probs=145.4 Q ss_pred CCCceeeE----------EEEEeecccc-cccc-----------cceeEEEEccCCc-ceEEEechhhhccCCCCChHHH Q lcl|Aclame:pro 1 MVETITDV----------RVHISVLYPS-PRIG-----------LGRPAIFVKGTAM-GYKEYTTLEELKDTFADNTEVY 57 (331) Q Consensus 1 ~v~~i~dV----------~v~i~~~~~~-~~~~-----------fg~~li~~~~~~~-~~~~yts~~~v~~~f~~~s~~y 57 (331) ..++-..+ +..++..... ...+ ....++....... ....+. -.+.++-....... T Consensus 272 ~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~~~~~~~ 349 (666) T protein:vir:80 272 QNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFGRGSSQYIYATAQGWVDGFSGI--ISLAGGVSANEATT 349 (666) T ss_pred ccccceeeEeccCCccceeeecccccccccccchhhhhhhhhccccceeeeecccccccccceE--EEecCCCCcccccc Confidence 11110000 0011100000 0000 0011111111000 000000 00000000000000 Q ss_pred HHHHHHHccCCCcceEEEEeccchhHHHHHHHhhcCceeEEEEecC------CHHHHHHHHHHHHhcCcEEEEEE----- Q lcl|Aclame:pro 58 AKAKAVFLQKDRPDTVAVITYEDTKLLEAAEAYFLKSWHFALLAEF------KAADALALSNLIEEQKFKFAVFQ----- 126 (331) Q Consensus 58 kaA~~~fsQ~~~p~~v~v~~~~~~~~~~al~~~~~~~~~f~~~~~~------~~~~i~alA~w~ea~~~~~~~~~----- 126 (331) . ..+..+ .++....-..+.++.+. ....+++.... ..+-..++...++....+|.++. T Consensus 350 ~------~~~~~~---~~g~~~~~~~~~~~~~~--~~~~~l~~p~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 418 (666) T protein:vir:80 350 G------GVGADP---FIGAMMQGWGLFAERES--IHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRST 418 (666) T ss_pred c------cccccc---ccccchhhhhhhhhhcc--cccceEeecCcCCcccchHHHHHHHHHHHHhhcceEEEeecceeE Confidence 0 000000 00000000001111111 11122222221 11222344555555443333221 Q ss_pred ------eCChHHHHhhc-------------ccceEEEEEeCC-------Cch----hHHHHHHHHHhcCCccceee---e Q lcl|Aclame:pro 127 ------VTAVADITPLA-------------KNTRTIAIVHSK-------TGE----KLDAALIGNVASLPVGSATW---K 173 (331) Q Consensus 127 ------~t~~~~~~~~~-------------~~~~t~~~~~~~-------~~~----~~~aa~~g~~~~~~~G~~t~---~ 173 (331) .++..++.... +..+ ..+|++- .+. -+++.++|.++..+.-+-.| . T Consensus 419 ~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~-~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~sPa 497 (666) T protein:vir:80 419 VVNIPVTTAIDNLIAWREGSGNYNENNMNINTTY-AVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPA 497 (666) T ss_pred EeecCCCCCHHHHHHHHHhcccchhhhcccCcce-EEEEcCceEEecccCCceeEechHHHHHHHHHHHhhcCCceEccC Confidence 11222221111 1122 2233321 111 24555666655443211122 2 Q ss_pred eeeccCCcC---C--CCCCHHHHHHHHhCCCeEEEEEcCe-eEEecCEEeCC-----ceehhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 174 GRHGLAGIT---S--EELKVSEIDAIQKAGGMCYIEKAGI-AQTSEGKTVSG-----EFIDSIHGDDWIKATIETRLQKL 242 (331) Q Consensus 174 ~k~~l~gv~---~--~~~t~t~~~~l~~~~~n~y~~~~g~-~~~~~G~~~~G-----~~iD~~~~~dwl~~~iq~~l~~l 242 (331) .|. +.|+. . -.+++.|.+.|..+|+|+..++.|. ..+..++++++ .||-+.+-.+|+...|++.+... T Consensus 498 n~~-~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~i~vRRl~~~i~~si~~~~~~~ 576 (666) T protein:vir:80 498 GYN-RGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYK 576 (666) T ss_pred Cee-cceeeccccceeecChhHHHhhhhCCeeEEEEeCCCeEEEEccccCCCCCcccceeehhhHHHHHHHHHHHHHHHh Confidence 332 33332 1 3578999999999999999998775 46677787765 26889999999999999888765 Q ss_pred HhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEE Q lcl|Aclame:pro 243 LTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHS 322 (331) Q Consensus 243 ~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~ 322 (331) +-. |.++.=...|+..+..-|++.+++|.|. +|.|++. .++.|++|+.+.++. +.+.+...-.+++ T Consensus 577 v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~--------g~~V~~d-~~~nt~~di~~G~~~-~~i~~~P~~Pae~ 642 (666) T protein:vir:80 577 LFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY--------DFRVQCD-TTNNTPDVIDRNEFV-ASMFIKPAKSINY 642 (666) T ss_pred ccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------eeEEEEc-CCCCCHHHhhCCeEE-EEEEEEecCCcce Confidence 432 4577778899999999999999999997 3889988 567899999999885 8999999999999 Q ss_pred EEEEEEEeC Q lcl|Aclame:pro 323 VDVYGEVEV 331 (331) Q Consensus 323 v~i~~~v~~ 331 (331) |.++..-.= T Consensus 643 I~~~~~~~~ 651 (666) T protein:vir:80 643 IMLNFTAVA 651 (666) T ss_pred EEEEEEEee Confidence 998865332 No 43 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=98.19 E-value=2.9e-06 Score=50.96 Aligned_cols=314 Identities=13% Similarity=0.063 Sum_probs=155.2 Q ss_pred CCCceeeEEEEEeecccccccccc------------eeEEEEccCCcceEE-EechhhhccCCCCChHHHHHHHHHHccC Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLG------------RPAIFVKGTAMGYKE-YTTLEELKDTFADNTEVYAKAKAVFLQK 67 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg------------~~li~~~~~~~~~~~-yts~~~v~~~f~~~s~~ykaA~~~fsQ~ 67 (331) --..++...++++.. +.....-+ ...++.......... -.........-......+.+....+. . T Consensus 317 ~~g~vve~~~~~s~~-~~~~~~~~~~~~~~~vi~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~-~ 394 (729) T protein:vir:10 317 NSGTILEKHLSLSKA-KDAEYSVGSSSYWRDFLATNSKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGVNFG-A 394 (729) T ss_pred Ccccceeeeeeeeec-cccccccccccccceeeccccceeeecccccccccccccccccceecccccccccccccccc-c Confidence 001111111222110 00000000 001111111000000 00000000000000000000000000 0 Q ss_pred CCcceEEEE-ecc----------------chhHHHHHHHhhcC---ceeEEEEec------CCHHHHHHHHHHHHhcCcE Q lcl|Aclame:pro 68 DRPDTVAVI-TYE----------------DTKLLEAAEAYFLK---SWHFALLAE------FKAADALALSNLIEEQKFK 121 (331) Q Consensus 68 ~~p~~v~v~-~~~----------------~~~~~~al~~~~~~---~~~f~~~~~------~~~~~i~alA~w~ea~~~~ 121 (331) ..+..+... ..+ ..+...++..+.+. ....+.... .+.....++...++..... T Consensus 395 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~~~ 474 (729) T protein:vir:10 395 SGVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARKDA 474 (729) T ss_pred cceeEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhcCCe Confidence 000111000 000 01122333333221 122222221 2234445777778777666 Q ss_pred EEEEEeC----------------C----hHHHHh---hcccceEEEEEeCC-------Cc---h-hHHHHHHHHHhcCCc Q lcl|Aclame:pro 122 FAVFQVT----------------A----VADITP---LAKNTRTIAIVHSK-------TG---E-KLDAALIGNVASLPV 167 (331) Q Consensus 122 ~~~~~~t----------------~----~~~~~~---~~~~~~t~~~~~~~-------~~---~-~~~aa~~g~~~~~~~ 167 (331) +.++... . ...... .....+-..++++. .+ . .+.+.++|.++-.+. T Consensus 475 ~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~d~ 554 (729) T protein:vir:10 475 VAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGTCARTDI 554 (729) T ss_pred EEEecccccccccccccccccccccchhhHHHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhHHHHHHHHHhhc Confidence 6554311 0 011111 11112223344321 11 1 245566666655443 Q ss_pred cceeee---eeeccCCcCC-----CCCCHHHHHHHHhCCCeEEEEEcCe-eEEecCEEeCC-----ceehhhHHHHHHHH Q lcl|Aclame:pro 168 GSATWK---GRHGLAGITS-----EELKVSEIDAIQKAGGMCYIEKAGI-AQTSEGKTVSG-----EFIDSIHGDDWIKA 233 (331) Q Consensus 168 G~~t~~---~k~~l~gv~~-----~~~t~t~~~~l~~~~~n~y~~~~g~-~~~~~G~~~~G-----~~iD~~~~~dwl~~ 233 (331) -+-.|+ .| ++.||.. ..+++.|.+.|..+|+|++.++.|. ..+..++++.+ .||-+.|..+|++. T Consensus 555 ~~g~~~span~-~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~d~~~~~i~vrR~~~~i~~ 633 (729) T protein:vir:10 555 EQFPWFSPAGT-ARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAGIILFGDKTGFGKSSAFDRINVRRLFIYLED 633 (729) T ss_pred cCCcEEccCCc-cccceecccceeeecChhhHhhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHHHHHH Confidence 222332 23 2333322 3478899999999999999998765 45666677644 38999999999999 Q ss_pred HHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEE Q lcl|Aclame:pro 234 TIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFR 313 (331) Q Consensus 234 ~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~ 313 (331) .|++.+...+-. |.++.=...|+..|+.-|+..+++|.|. +|.|++. .+..|++|+.+.++. +.+. T Consensus 634 si~~~~~~~v~e----pn~~~~~~~i~~~i~~~L~~l~~~g~l~--------g~~v~~d-~~~nt~~~i~~G~~~-~~v~ 699 (729) T protein:vir:10 634 AISAAAKDQLFE----FNDELTRTNFVNIVEPFLRDVQAKRGIF--------DFVVICD-ETNNTAAVIDSNEFV-ADIF 699 (729) T ss_pred HHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcccee--------eeEEEEc-CCCCCHHHhhCCeEE-EEEE Confidence 999988765432 5578888999999999999999999996 4899987 577899999999885 8999 Q ss_pred EEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 314 YKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 314 ~~~aGaIh~v~i~~~v~~ 331 (331) +.....+++|.++..-.- T Consensus 700 ~~p~~p~e~i~~~~~~~~ 717 (729) T protein:vir:10 700 IKPARSINFIGLTFVATR 717 (729) T ss_pred EEecCCccEEEEEEEEee Confidence 999999999998754444 No 44 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=98.09 E-value=4.8e-06 Score=49.73 Aligned_cols=309 Identities=14% Similarity=0.064 Sum_probs=155.8 Q ss_pred CCCceeeEEEE---------Eee--cccccc-cccceeEEEEccCC-cceEEEechhhhccCCCCChHHHHHHHHHHccC Q lcl|Aclame:pro 1 MVETITDVRVH---------ISV--LYPSPR-IGLGRPAIFVKGTA-MGYKEYTTLEELKDTFADNTEVYAKAKAVFLQK 67 (331) Q Consensus 1 ~v~~i~dV~v~---------i~~--~~~~~~-~~fg~~li~~~~~~-~~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~ 67 (331) .......+++- ... ..+... ..+. ..+...+.. +.+...+...+ .....+..+. ...+..+ T Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~v~~~g~~~~~~~~~~~~~~---~~~~~~~~~~--~~~~~~~ 318 (660) T protein:vir:10 245 EAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYA-IIVRRDGAIVESVVLSTKEGE---KDVYGNNIYL--DDYFAKG 318 (660) T ss_pred CCcceeEEeeeeccceeeEEeeeecccccccccccc-cccccCCcccceeeeeccccc---cccccceeee--ehhhcCC Confidence 11111111110 000 000000 0011 000001110 01100000000 0000000000 0000000 Q ss_pred ------------CC--cceEEEE-ecc------chhHHHHHHHh---hcCceeEEEEecC-------CHHHHHHHHHHHH Q lcl|Aclame:pro 68 ------------DR--PDTVAVI-TYE------DTKLLEAAEAY---FLKSWHFALLAEF-------KAADALALSNLIE 116 (331) Q Consensus 68 ------------~~--p~~v~v~-~~~------~~~~~~al~~~---~~~~~~f~~~~~~-------~~~~i~alA~w~e 116 (331) +. ...+... ..+ ..+...++..+ .....-++++... ..+-..++...+| T Consensus 319 ~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~~~~~ 398 (660) T protein:vir:10 319 TSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVVSIAD 398 (660) T ss_pred CccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHHHHHH Confidence 00 0000000 000 01111222211 1122334444332 1122346667777 Q ss_pred hcCcEEEEEEeC-----------ChHHHHhhcc-------------cceEEEEEeCC-------Cc----hhHHHHHHHH Q lcl|Aclame:pro 117 EQKFKFAVFQVT-----------AVADITPLAK-------------NTRTIAIVHSK-------TG----EKLDAALIGN 161 (331) Q Consensus 117 a~~~~~~~~~~t-----------~~~~~~~~~~-------------~~~t~~~~~~~-------~~----~~~~aa~~g~ 161 (331) ....+|.+++.. ..+.+..... ..+ ..+|++. .+ ..+.+.++|. T Consensus 399 ~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~-~~~~~p~~~~~d~~~~~~~~~p~sg~~AGl 477 (660) T protein:vir:10 399 ERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTY-AAIDGNYKYQYDKYNDVNRWVPLAADLAGL 477 (660) T ss_pred hhCCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcce-EEEEcCceEEecccCCceeEechhHHHHHH Confidence 777777766421 2222222111 112 2333332 11 1345666666 Q ss_pred HhcCCccceee---eeeeccCCcC-----CCCCCHHHHHHHHhCCCeEEEEEcC-ee-EEecCEEeCC-----ceehhhH Q lcl|Aclame:pro 162 VASLPVGSATW---KGRHGLAGIT-----SEELKVSEIDAIQKAGGMCYIEKAG-IA-QTSEGKTVSG-----EFIDSIH 226 (331) Q Consensus 162 ~~~~~~G~~t~---~~k~~l~gv~-----~~~~t~t~~~~l~~~~~n~y~~~~g-~~-~~~~G~~~~G-----~~iD~~~ 226 (331) ++-.+.-+--| .+|. +.|+. .-.+++.|.+.|..+|+|+...+-+ .. .+...+++++ .||-+.+ T Consensus 478 ~Ar~D~~~g~~~sPan~~-~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR 556 (660) T protein:vir:10 478 CARTDDVSQPWMSPAGYN-RGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTATKVPSPMDHINVRR 556 (660) T ss_pred HHHhhccCCcEEccCCee-eceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccCCCCcccceEehhh Confidence 65444322122 2332 33331 1358999999999999999988644 33 4566677655 2688889 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcc Q lcl|Aclame:pro 227 GDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRN 306 (331) Q Consensus 227 ~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~ 306 (331) -.+|+...|+......+-. |.++.-...|+..++.-|+..+++|.|.. |.|++. .++.|++|+.+.+ T Consensus 557 ~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~fL~~l~~~gal~g--------~~V~~d-~~~nt~~di~~G~ 623 (660) T protein:vir:10 557 LFNMLKKNIGDASKYKLFE----LNDNFTRSSFRMEVSQYLDGIKALGGIYE--------GRVVCD-TTVNTPAVIDRNE 623 (660) T ss_pred HHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCceee--------eEEEEc-CCCCCHHHhhCCe Confidence 9999999999888765433 55888889999999999999999999973 889887 5678999999998 Q ss_pred cCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 307 YKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 307 ~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) +. +.+.+...-.+++|.++..-+- T Consensus 624 ~~-~~i~~~P~~pae~I~~~~~~~~ 647 (660) T protein:vir:10 624 FI-ANIYVKPARSINYITLNFVATS 647 (660) T ss_pred EE-EEEEEEecCCccEEEEEEEEee Confidence 86 9999999999999999877664 No 45 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=98.00 E-value=7.7e-06 Score=48.63 Aligned_cols=291 Identities=15% Similarity=0.113 Sum_probs=149.6 Q ss_pred CCCceeeEEEEEeecccccccccce------------eEEEEccCCcc-------------eEEEechhh---------- Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGR------------PAIFVKGTAMG-------------YKEYTTLEE---------- 45 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~------------~li~~~~~~~~-------------~~~yts~~~---------- 45 (331) -...++.....++. .+......+. .++........ -..+.+... T Consensus 351 ~~~~v~~~~~~~s~-~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (743) T protein:vir:10 351 TANTIVERLTYLSK-LSDARSEENANIYYKNVINEQSAYLYHGNDAAVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWV 429 (743) T ss_pred ccCceeEEEeeeec-ccccccccCcceeecceeccccceeeccCcccceeeeccccCccccceeeeecccccccccceEE Confidence 11111111111110 0000000000 00110000000 000000000 Q ss_pred -hcc---CCCCChHHHHHHHHHHccCCCcceEEEEeccchhHHHHHHHhhcCceeEEEEec------CCHHHHHHHHHHH Q lcl|Aclame:pro 46 -LKD---TFADNTEVYAKAKAVFLQKDRPDTVAVITYEDTKLLEAAEAYFLKSWHFALLAE------FKAADALALSNLI 115 (331) Q Consensus 46 -v~~---~f~~~s~~ykaA~~~fsQ~~~p~~v~v~~~~~~~~~~al~~~~~~~~~f~~~~~------~~~~~i~alA~w~ 115 (331) +.+ +...+...+..+-..|..... ...-++++.. ....-..++.+.+ T Consensus 430 ~~~gG~d~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~ll~~p~~~~~~~~~~~v~~a~~~~~ 486 (743) T protein:vir:10 430 NLAGGNDDFAYDAGEFGAAMDLFLDTEE-----------------------TEIDFVLMGGSMADEADTKSKATKVIAIA 486 (743) T ss_pred EeecCccccccchhHHHHHHHHhhhccc-----------------------cCcceEEecCcccCccchHHHHHHHHHHH Confidence 000 111222233333333322111 0111222221 1122334455555 Q ss_pred HhcCcEEEEEEeCC-----------------hHHHH----hhcccceEEEEEeCC-------Cc----hhHHHHHHHHHh Q lcl|Aclame:pro 116 EEQKFKFAVFQVTA-----------------VADIT----PLAKNTRTIAIVHSK-------TG----EKLDAALIGNVA 163 (331) Q Consensus 116 ea~~~~~~~~~~t~-----------------~~~~~----~~~~~~~t~~~~~~~-------~~----~~~~aa~~g~~~ 163 (331) +.....|.+++... ..... ...+..+.. +|++- .+ ..+++.++|.++ T Consensus 487 ~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~-~~~p~~~~~d~~~~~~~~~p~s~~~AGl~a 565 (743) T protein:vir:10 487 ASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSDLTSTSYAV-FDSGYKYVYDRFTDKYRYIPCNGDVAGLCV 565 (743) T ss_pred HhhCCeEEEEecCCCccccccccccccccccchHHHHHHHhccCCeeEE-EEccceeeeccccCceeEechhHHHHHHHH Confidence 55444555443210 00111 011112222 22221 11 123555666655 Q ss_pred cCCccceee---eeeeccCCcCC-----CCCCHHHHHHHHhCCCeEEEEEcCe-eEEecCEEeCC-----ceehhhHHHH Q lcl|Aclame:pro 164 SLPVGSATW---KGRHGLAGITS-----EELKVSEIDAIQKAGGMCYIEKAGI-AQTSEGKTVSG-----EFIDSIHGDD 229 (331) Q Consensus 164 ~~~~G~~t~---~~k~~l~gv~~-----~~~t~t~~~~l~~~~~n~y~~~~g~-~~~~~G~~~~G-----~~iD~~~~~d 229 (331) ..+.-+-.| ..| ++.||.- -.++++|.+.|..+|+|+...+.+. ..+...+++.+ .||-+.+-.+ T Consensus 566 ~~D~~~g~~~span~-~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~s~d~~~~~i~vrR~~~ 644 (743) T protein:vir:10 566 QTSNQLDDWYSPAGL-NRGGILNAVKLAYNPNKADRDELYQNRINPVVSLRGQGITLFGDKTALAAPSAFDRINVRRLFL 644 (743) T ss_pred HhhccCCcEEccCCe-eeeeeeccccceecCChhHHHhHhhCCceEEEEecCCeEEEEcccccCCCCcccceEeehhhHH Confidence 444321122 233 2344422 2478999999999999999998665 45566677654 2788999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCC Q lcl|Aclame:pro 230 WIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKG 309 (331) Q Consensus 230 wl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~ 309 (331) ||+..|+..+...+-. |.++.=...|+..|+.-|+..+++|.|. +|.|++. .+..|++++.+.++. T Consensus 645 ~i~~si~~~~~~~v~e----~n~~~~~~~i~~~i~~fL~~l~~~gal~--------~~~V~~d-~~~nt~~~i~~G~~~- 710 (743) T protein:vir:10 645 NLEKRARRLAEGVLFE----QNDATTRAGFSSALNSYLSEVQARRGVT--------DYLVICD-ESNNTPDIIDRNEFV- 710 (743) T ss_pred HHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------eeEEEEc-CCCCCHHHhhCCeEE- Confidence 9999999988765432 4478888999999999999999999984 6889997 578899999999886 Q ss_pred eEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 310 LSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 310 i~~~~~~aGaIh~v~i~~~v~~ 331 (331) +.+.+.....+++|.++..-.- T Consensus 711 ~~i~~~p~~pae~I~~~~~~~~ 732 (743) T protein:vir:10 711 AEVYVKPTRSINFITITFTATK 732 (743) T ss_pred EEEEEEecCCcceEEEEEEEee Confidence 8999999999999988766443 No 46 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=97.99 E-value=7.8e-06 Score=48.60 Aligned_cols=313 Identities=13% Similarity=0.051 Sum_probs=158.1 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEE-ccCC----------cceE-EEechhhhccCCCCChHHHHHHHHHHccCC Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFV-KGTA----------MGYK-EYTTLEELKDTFADNTEVYAKAKAVFLQKD 68 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~-~~~~----------~~~~-~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~ 68 (331) -.+....+.+.+.... ..........+-. .... .... .+..... .+-+............|..+. T Consensus 243 ~~~~~~~v~v~~~~~~-~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 319 (659) T protein:vir:10 243 DYAKGASALLPIYPGG-GTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTK--RGEKDIYDSNIYIDDFFAKGG 319 (659) T ss_pred hccccceeeeeeeeec-ccccccceeeeeeccccccchhhccccccceeeeeeeecc--ccccccccchhhhhhhhccCc Confidence 0111111111111100 0000000000000 0000 0000 0000000 000000011111112222111 Q ss_pred C--------------cceEEEEe-cc------chhHHHHHHHhh---cCceeEEEEecCC-------HHHHHHHHHHHHh Q lcl|Aclame:pro 69 R--------------PDTVAVIT-YE------DTKLLEAAEAYF---LKSWHFALLAEFK-------AADALALSNLIEE 117 (331) Q Consensus 69 ~--------------p~~v~v~~-~~------~~~~~~al~~~~---~~~~~f~~~~~~~-------~~~i~alA~w~ea 117 (331) . ...+.+.+ .+ ..+...++..+. ..+..++++.... .+-..++...++. T Consensus 320 ~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~al~~~~~~ 399 (659) T protein:vir:10 320 SEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVSIGDA 399 (659) T ss_pred ccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHHHHHh Confidence 0 00111111 00 011122222221 1233444444321 1224556677777 Q ss_pred cCcEEEEEEeC-----------ChHHHHhhc-------------ccceEEEEEeCC-------Cc----hhHHHHHHHHH Q lcl|Aclame:pro 118 QKFKFAVFQVT-----------AVADITPLA-------------KNTRTIAIVHSK-------TG----EKLDAALIGNV 162 (331) Q Consensus 118 ~~~~~~~~~~t-----------~~~~~~~~~-------------~~~~t~~~~~~~-------~~----~~~~aa~~g~~ 162 (331) ...+|+++... ....+.... +..+ ..+|++. .+ ..|.+.++|.+ T Consensus 400 ~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~-~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl~ 478 (659) T protein:vir:10 400 RQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTY-AAIDGNYKYQYDKYNDVNRWVPLAADIAGLC 478 (659) T ss_pred hCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcce-EEEEeCcEEEecccCCceEEechHHHHHHHH Confidence 77777765421 112221111 1223 3444431 11 13456666666 Q ss_pred hcCCccceee---eeee--ccCCcCC--CCCCHHHHHHHHhCCCeEEEEEcCe-eEEecCEEeCC-----ceehhhHHHH Q lcl|Aclame:pro 163 ASLPVGSATW---KGRH--GLAGITS--EELKVSEIDAIQKAGGMCYIEKAGI-AQTSEGKTVSG-----EFIDSIHGDD 229 (331) Q Consensus 163 ~~~~~G~~t~---~~k~--~l~gv~~--~~~t~t~~~~l~~~~~n~y~~~~g~-~~~~~G~~~~G-----~~iD~~~~~d 229 (331) +-.+.-+-.| ..|. .+.|+.. ..+++.|.+.|..+|+|+...+.|. ..+...+++++ .||-+.+-.+ T Consensus 479 Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~~~ 558 (659) T protein:vir:10 479 ARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRINVRRLFN 558 (659) T ss_pred HHHhccCCceEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCCCCcccceEehhhHHH Confidence 5444321122 2221 1223321 3578999999999999999998765 45666677665 3788999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCC Q lcl|Aclame:pro 230 WIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKG 309 (331) Q Consensus 230 wl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~ 309 (331) |+...|++.+...+-. |.++.=...|+..|+.-|+..+++|.|. +|.|++.. +..|++++.+.++. T Consensus 559 ~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~fL~~l~~~gal~--------~~~V~~d~-~~nt~~~i~~G~~~- 624 (659) T protein:vir:10 559 MLKTNIGRSSKYRLFE----LNNAFTRSSFRTETAQYLQGIKALGGIY--------EYRVVCDT-TNNTPSVIDRNEFV- 624 (659) T ss_pred HHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------eEEEEEcC-CCCCHHHhhCCeEE- Confidence 9999999888764432 5577778899999999999999999996 58999875 77899999999885 Q ss_pred eEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 310 LSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 310 i~~~~~~aGaIh~v~i~~~v~~ 331 (331) +.+.+...-.+++|.++...+- T Consensus 625 ~~i~~~p~~pae~i~~~~~~~~ 646 (659) T protein:vir:10 625 ATFYIQPARSINYITLNFVATA 646 (659) T ss_pred EEEEEEecCCcceEEEEEEEEe Confidence 8999999999999998876654 No 47 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=97.95 E-value=9.4e-06 Score=48.15 Aligned_cols=310 Identities=15% Similarity=0.073 Sum_probs=153.4 Q ss_pred CCCceeeEEEEEeec------------cc-----------------------ccccccceeEEEEccCCcceEE------ Q lcl|Aclame:pro 1 MVETITDVRVHISVL------------YP-----------------------SPRIGLGRPAIFVKGTAMGYKE------ 39 (331) Q Consensus 1 ~v~~i~dV~v~i~~~------------~~-----------------------~~~~~fg~~li~~~~~~~~~~~------ 39 (331) ...+= ++|.+... .+ .-...+....+........... T Consensus 371 awGN~--ItV~I~~~t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~~~~~~~~v~e~~dn~~i~~~~~~~~~~~in~vs~ 448 (774) T protein:vir:98 371 NWGNQ--VTVSIYPVNNSEFRLNVQDLNGSAFNPPLADEVYTVKLGDTNESGELNALLDSKFIRGFFLPKSIDSINYDAA 448 (774) T ss_pred cCCCc--eEEEEEecCCceeEEEEEecCCccccccccceeEEEecccccccceeeeeeceeeEeeccccccccccccccc Confidence 11110 11111100 00 0001111111111000000000 Q ss_pred EechhhhccC-CCCChHHHHHHHHHHccCCCcceEEEEe----ccc-hhHHHHH----HHhhcCceeEEEEecCCHHHHH Q lcl|Aclame:pro 40 YTTLEELKDT-FADNTEVYAKAKAVFLQKDRPDTVAVIT----YED-TKLLEAA----EAYFLKSWHFALLAEFKAADAL 109 (331) Q Consensus 40 yts~~~v~~~-f~~~s~~ykaA~~~fsQ~~~p~~v~v~~----~~~-~~~~~al----~~~~~~~~~f~~~~~~~~~~i~ 109 (331) ....+.+... -....+........-.+. +..+.+.. .+. +...+.. ...-...++.+.....+..... T Consensus 449 lv~~~~~~~a~~d~~~~~~~~~~~~~~~~--~~~~v~v~lagG~Dg~~tt~~~igg~~~~~~~tgi~aLl~a~~~~~V~~ 526 (774) T protein:vir:98 449 LVRQSPLRLAPPDESETDVENPAHVDFYG--PNVLVDVTLENGYDGPPVTNDDYVSIIRTLENQPVHILLVGTTNVGVQQ 526 (774) T ss_pred ccccchhcccccccccccccccccccccC--CcceEEEeecCCCCcccccchheecccccccccceeEEEcCccchhhHH Confidence 0000000000 000000000000000001 11111111 111 1111111 1111244555554444555555 Q ss_pred HHHHHHHh----cCcEEEEEEe---CChHHHHhhc---ccceEEEEEeCC-------Cc----hhHHHHHHHHHhcCCcc Q lcl|Aclame:pro 110 ALSNLIEE----QKFKFAVFQV---TAVADITPLA---KNTRTIAIVHSK-------TG----EKLDAALIGNVASLPVG 168 (331) Q Consensus 110 alA~w~ea----~~~~~~~~~~---t~~~~~~~~~---~~~~t~~~~~~~-------~~----~~~~aa~~g~~~~~~~G 168 (331) ++..++|. ...+++++.. .+...+.+.. +..+. .++++. .. ..|++.++|.++..++ T Consensus 527 aii~~~e~~~~~~~~r~avid~p~g~t~~~Ai~~r~~f~S~~a-al~~Pwvkv~D~~~g~~~~vPpSg~vAGl~ArtDv- 604 (774) T protein:vir:98 527 ALITEAERASDSDGLRIAVLAAPPRTTPTLAASVTRGFNSTRA-VMVAGWFTYAGQPNSSRYGVPGAAVYAGKLAAIDF- 604 (774) T ss_pred HHHHHHHHhhhcccceEEEEECCCCCCHHHHHHHHhccCCceE-EEEeCcEEEeccCCCceeecChhHHHHHHHHhcCc- Confidence 66666654 2445555542 1122222221 22333 344431 11 1356677777776664 Q ss_pred ceeeeeeeccCCcC--------CCCCCHHHHHHHHhCCCeEEE-EEcCee-EEecCEEeCCc----eehhhHHHHHHHHH Q lcl|Aclame:pro 169 SATWKGRHGLAGIT--------SEELKVSEIDAIQKAGGMCYI-EKAGIA-QTSEGKTVSGE----FIDSIHGDDWIKAT 234 (331) Q Consensus 169 ~~t~~~k~~l~gv~--------~~~~t~t~~~~l~~~~~n~y~-~~~g~~-~~~~G~~~~G~----~iD~~~~~dwl~~~ 234 (331) ......+ .+.|+. ....++.+.+.|..+++|... ...+.+ .+..+++++++ ||-+.+-.+|++.. T Consensus 605 ~kSPANk-~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g~G~rvWG~RTlssDp~wr~InVRRlfd~Ie~S 683 (774) T protein:vir:98 605 FVSPAAR-SLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVDRTYRFASGVTLSTDPAWERIYLRRVHDVVRQG 683 (774) T ss_pred ccccCCc-eeecceeccccccccccccchhhhhhcccccceeEEEEcCCcEEEEcccccCCCcccceEeehhhHHHHHHH Confidence 2333444 355653 223567888889999999876 333333 44555666663 78999999999999 Q ss_pred HHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceE-EEccchhcCCHHHHHhcccCCeEEE Q lcl|Aclame:pro 235 IETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFS-ITALQRSDLNDDDIAKRNYKGLSFR 313 (331) Q Consensus 235 iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~-v~~~~~~~~~~~dr~~R~~~~i~~~ 313 (331) |+..+...+ .+ |.++.....|+..++..|+..++.|.|.. |+ ++. +.+.-|++++.+.++. +.+. T Consensus 684 I~~~~~~~V---fE-PNd~~l~~~I~~sI~~fL~~L~~~GaL~G--------~~~V~~-D~etNt~~dI~~G~l~-i~I~ 749 (774) T protein:vir:98 684 AHAILRNYV---AM-PNSRLVRNQIAAALNAFMGELKRNGNIVS--------FRPAII-DGSNNSTAAYFSRELY-VSLQ 749 (774) T ss_pred HHHHHHHhc---cC-CCCHHHHHHHHHHHHHHHHHHHhCCceec--------ceEEEE-cCCCCCHHHhhCCEEE-EEEE Confidence 998877644 33 77999999999999999999999999963 33 444 3466788888888775 8889 Q ss_pred EEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 314 YKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 314 ~~~aGaIh~v~i~~~v~~ 331 (331) +.....+++|.++..-+- T Consensus 750 vaP~~PAEfIilri~q~t 767 (774) T protein:vir:98 750 FQPLYSADYIYVTISRDT 767 (774) T ss_pred EEecCCcceEEEEEEEee Confidence 999999999998776666 No 48 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=97.91 E-value=1.1e-05 Score=47.70 Aligned_cols=310 Identities=11% Similarity=0.049 Sum_probs=151.0 Q ss_pred CCCceeeE------------------EEEEeecc--ccccccc------------cee---------EEEEccCCcc--- Q lcl|Aclame:pro 1 MVETITDV------------------RVHISVLY--PSPRIGL------------GRP---------AIFVKGTAMG--- 36 (331) Q Consensus 1 ~v~~i~dV------------------~v~i~~~~--~~~~~~f------------g~~---------li~~~~~~~~--- 36 (331) +++.|..+ -+.+.... +.....+ |.. ..+......+ T Consensus 206 ~v~~~~~~~~~~~~~~~~~~s~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~ 285 (648) T protein:vir:10 206 LINLLKEQLQPTDVVQIFDASDTNPVDIPLGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSD 285 (648) T ss_pred hhhchhhhhhhhhhheecccccccccccccccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccc Confidence 22221111 11111000 0000000 000 0000000000 Q ss_pred eEEEechhhhccCCCC------------ChHHHHHHHHH--Hcc---CCCcceEE--EEeccchhHHHHHHHhhcCceeE Q lcl|Aclame:pro 37 YKEYTTLEELKDTFAD------------NTEVYAKAKAV--FLQ---KDRPDTVA--VITYEDTKLLEAAEAYFLKSWHF 97 (331) Q Consensus 37 ~~~yts~~~v~~~f~~------------~s~~ykaA~~~--fsQ---~~~p~~v~--v~~~~~~~~~~al~~~~~~~~~f 97 (331) +...+.+.+...-+.. ..|+-- +..+ ++. +..|..-+ .......++.+++....+.+-|| T Consensus 286 ~~~~~~~~~~~~~~~v~~~~~~~l~~~~~~p~~~-~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ 364 (648) T protein:vir:10 286 YQDYTSLSDPANWFAKDAYTINHLVDTTINPHIL-ATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNF 364 (648) T ss_pred eeeeeccccccceeeeeccchhhcccccccCccc-ccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceE Confidence 0001111110000000 000000 0000 111 11221100 00011234667777766777777 Q ss_pred EEEec-----------CCH-HHHHHHH-HHHHhcC---------cEEEEEE-eCChH--H---HH--hhcccceEEE--- Q lcl|Aclame:pro 98 ALLAE-----------FKA-ADALALS-NLIEEQK---------FKFAVFQ-VTAVA--D---IT--PLAKNTRTIA--- 144 (331) Q Consensus 98 ~~~~~-----------~~~-~~i~alA-~w~ea~~---------~~~~~~~-~t~~~--~---~~--~~~~~~~t~~--- 144 (331) ++... .+. .-|.+.+ .|+.... .++..+. ..+.+ . +. ...+..|... T Consensus 365 ivp~~~~~~~~~~~~~lt~~q~i~a~a~shv~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~~~~~~~a~~~~~ 444 (648) T protein:vir:10 365 VIPAYKFTNVTQLNDRLTIFKGIASTFLSHVQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRNILNTISAMFGGT 444 (648) T ss_pred EEeecccccccccccccCCccchHHHHHHHHHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhhcccccceeeeec Confidence 77622 111 2233333 5554321 1233322 11111 1 10 0011111111 Q ss_pred -------------EEeCC------CchhHHHHHHHHHhcCCccceeeeeeeccCCcC--C-CCCCHHHHHHHHhCCCeEE Q lcl|Aclame:pro 145 -------------IVHSK------TGEKLDAALIGNVASLPVGSATWKGRHGLAGIT--S-EELKVSEIDAIQKAGGMCY 202 (331) Q Consensus 145 -------------~~~~~------~~~~~~aa~~g~~~~~~~G~~t~~~k~~l~gv~--~-~~~t~t~~~~l~~~~~n~y 202 (331) .+++. +.++.+++++|..+...++ ...-||. +.++. + ..+++.|++.|.++|++++ T Consensus 445 d~~~~~~~~~~~~~~~~~G~~~~~p~~~~Aa~VAGl~a~l~~~-~s~T~k~-i~~~~id~~~~~t~~qld~L~~~Gv~~i 522 (648) T protein:vir:10 445 DRAQAVVFPFYSNVFNDEGKVELLGGEFFASYVAGMHANREPQ-DSITFLP-ISGIGAEPLYNWTYTQKDDLISNRVLFV 522 (648) T ss_pred CCceEEeecccceeECCCCcEEecchhhHHHHHHhhhhccccc-cCcccce-eeccccccccCCCHHHHHHHhcCCcEEE Confidence 11111 3345678888888888775 2233553 55543 3 3789999999999999999 Q ss_pred EEEcCe-----eEEecCEEeCC-------ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 203 IEKAGI-----AQTSEGKTVSG-------EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEG 270 (331) Q Consensus 203 ~~~~g~-----~~~~~G~~~~G-------~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~ 270 (331) .+..+. ...-.|.+..+ +-|-+++..|.+...+++.+.+.|+.. |=++.....|++.+.+-|.+- T Consensus 523 e~~~~~~~~~~~rvv~gITT~~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~---~n~~~~~~~ik~~i~~~L~~~ 599 (648) T protein:vir:10 523 EKVKTSFGGIVYRIHHNPTTWLGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGR---KSYGRKTENDIKVYTEALLSN 599 (648) T ss_pred EEecCCcceeeEEEeccceeecCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcc---cccHHHHHHHHHHHHHHHhhH Confidence 876542 23455666655 268899999999999999999999864 446778899999999998888 Q ss_pred HhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 271 FANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 271 ~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ++.+-|.+-. + .++++- . ...|. -|.|.+....+|+.|.++..|+- T Consensus 600 ~~~~~I~~y~-~----~~v~~~------~--~~~vv--~V~~~v~Pv~~i~~I~vti~it~ 645 (648) T protein:vir:10 600 LVGKQIVAYK-D----VKVTSN------E--DKTVY--YVEFFYQPVTEIKFILVTMKVTF 645 (648) T ss_pred hhcCcccCcc-c----ceEEEE------e--cCCEE--EEEEEEEecceeeEEEEEEEEEe Confidence 8888886422 1 123321 1 12333 48899999999999988877777 No 49 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=97.76 E-value=2.2e-05 Score=46.15 Aligned_cols=318 Identities=9% Similarity=0.010 Sum_probs=164.0 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCcceEEE-----echhhhccCCCCChHHHHH-HHHHHccCCCcceEE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMGYKEY-----TTLEELKDTFADNTEVYAK-AKAVFLQKDRPDTVA 74 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~~~~~y-----ts~~~v~~~f~~~s~~yka-A~~~fsQ~~~p~~v~ 74 (331) |..++.==+.|+. +.+.. .---..|+++.++....+.. ++++.+.+. .+...|. .++...++.+-=... T Consensus 1 ~~~~v~vn~~n~~-~g~~~-~~er~~Lfig~~~~~~~~~~~~~~~sdld~~lg~---~~~~lk~~v~aa~~naG~~~~~~ 75 (376) T protein:vir:37 1 MFPSVQINALNQL-SGETK-EIERHALFVGVGTTNQGKLLALTPDSDFDKVFGE---TDTDLKKQVRAAMLNAGQNWFAH 75 (376) T ss_pred CCCeEEEeccccc-CCCcc-cccceEEeeccccccccceeeecCccchHhhhCC---CchHHHHHHHHHHhCCCCcEEEE Confidence 7775432222222 22222 23446677776655443443 566665543 2222232 222222222221222 Q ss_pred E--EeccchhHHHHHHHhh-cCceeEEEEecC---CHHHHHH---HHHHHHhcCc--EEEEEEeC----------ChHHH Q lcl|Aclame:pro 75 V--ITYEDTKLLEAAEAYF-LKSWHFALLAEF---KAADALA---LSNLIEEQKF--KFAVFQVT----------AVADI 133 (331) Q Consensus 75 v--~~~~~~~~~~al~~~~-~~~~~f~~~~~~---~~~~i~a---lA~w~ea~~~--~~~~~~~t----------~~~~~ 133 (331) + -..+..+..+++.... .-.+.|..+... +.+++.+ ++....++-. .|+.+... +..+. T Consensus 76 ~~~~~~~~~~~~~Av~~a~~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv~file~r~~~~~~~~~e~w~~y 155 (376) T protein:vir:37 76 VYIAQEDGYDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQAVQGINHDQSDGETWDQY 155 (376) T ss_pred EEeecCCchHHHHHHHHhhhhcCceEEEEeccccccHHHHHHHHHHHHHHHHhcCCeEEEEEeccCcCcccccccCHHHH Confidence 2 2223356777776543 234555555553 3556554 4444444432 34433322 11111 Q ss_pred ----Hhh---cccceEEEEEeCCCchhHHHHHHHHHh--c----CCccceee-eee----ecc-CCcCCCCCCHHHHHHH Q lcl|Aclame:pro 134 ----TPL---AKNTRTIAIVHSKTGEKLDAALIGNVA--S----LPVGSATW-KGR----HGL-AGITSEELKVSEIDAI 194 (331) Q Consensus 134 ----~~~---~~~~~t~~~~~~~~~~~~~aa~~g~~~--~----~~~G~~t~-~~k----~~l-~gv~~~~~t~t~~~~l 194 (331) ... ....++.++..-- .+....++||.. + ..||++.- ... ..+ ..-....++.+.+.+| T Consensus 156 ~~~~~al~~gia~~~V~~V~~~~--gn~~G~~aGRl~~aaVsVadspgRV~tG~l~gl~~~~lp~d~~~~~l~~a~l~aL 233 (376) T protein:vir:37 156 VQKLTTLQQTIVADHVCLVPLLF--GNETGVLAGRLANRAVTVADSPARVQTGALVSLGSANKPLDKDRNELTLAHLKSL 233 (376) T ss_pred HHHHHHhhcccccccceeeeeeh--hhhHHHHHHHHhhcccchhhCccceeccccccccccccccCcCcccCCHHHHHHH Confidence 111 1122332222111 123667788752 2 24555421 111 001 2233457899999999 Q ss_pred HhCCCeEEEEEcCe--eEEecCEEeC---CceehhhHHHHHHHH--HHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 195 QKAGGMCYIEKAGI--AQTSEGKTVS---GEFIDSIHGDDWIKA--TIETRLQKLLTETDKLTFDARGIALLQSELTTVL 267 (331) Q Consensus 195 ~~~~~n~y~~~~g~--~~~~~G~~~~---G~~iD~~~~~dwl~~--~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl 267 (331) +++|+.+...|.|. .++.+|.++. |+|=-.-+.+.+-|. +++......+. ...+--++.+++..+..+..+| T Consensus 234 d~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R~vR~~ai~~i~-D~~lnst~~sia~~~~yi~~pL 312 (376) T protein:vir:37 234 ETARYSVPMWYPDYDGYYWADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIA-DRSFNSTTSSTEYHKNYFAKPL 312 (376) T ss_pred HhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhC-CcccCcchhhHHHHHHHHHHHH Confidence 99999999999985 4778899875 455444444444444 44433333232 2334446777889999999999 Q ss_pred HHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 268 NEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 268 ~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ++..+.+.|..- ..|| +|..|+=.++..+-....+. .|.+..+.=|.-..|+++--|.+ T Consensus 313 r~M~~s~~i~g~---~fpG-eI~~p~d~Di~i~w~s~~~V-~I~~~v~P~~~pk~Itv~I~Ldl 371 (376) T protein:vir:37 313 RDMSKSATINGK---DFPG-ECMPPKDDAITIVWQSKTKV-TIYIKVRPYDCPKEITANIFLDL 371 (376) T ss_pred HHHHhcchhccc---cccc-eeecCCCCCceEEeeccceE-EEEEEEEeccCCceEEEEEEeec Confidence 999888887531 1222 47766644555444344444 36776777777777777766777 No 50 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=97.63 E-value=3.5e-05 Score=45.03 Aligned_cols=307 Identities=12% Similarity=0.139 Sum_probs=143.2 Q ss_pred CCCce------eeEEEEEeecccccccccc--eeEEEEccCCcceEEEec------------------hhhhccCCCCCh Q lcl|Aclame:pro 1 MVETI------TDVRVHISVLYPSPRIGLG--RPAIFVKGTAMGYKEYTT------------------LEELKDTFADNT 54 (331) Q Consensus 1 ~v~~i------~dV~v~i~~~~~~~~~~fg--~~li~~~~~~~~~~~yts------------------~~~v~~~f~~~s 54 (331) +-+++ .|..+..+++.|....++- .+|.+++ .+|..+.. ..+....|..+ T Consensus 309 ~~n~~~~~v~~~D~~~~~~~t~~~~~~g~~~~~pl~~ts---~dy~~~~~~vdgI~~~~~~~V~~~g~~s~a~a~~~~g- 384 (717) T protein:vir:79 309 VYNDIMRKVESKDGAVTVTITKPESKRGMISEDPLVFKS---GDYTNFKMLVDAINNHPFNNVVRARTKPEFEATFTST- 384 (717) T ss_pred eeeeeeeEEecCCceEEEEEecccccCcceecccccccc---CceeeeeeeecccccCchhheeeeecccccceeeeec- Confidence 11111 1122222222221111110 1111111 00111110 00000000000 Q ss_pred HHHHHHHHHHccCCCcceEE-------EEec-cch-h-HH-HHHHHhhcCceeEEEEecCC---------HHHHHHHHHH Q lcl|Aclame:pro 55 EVYAKAKAVFLQKDRPDTVA-------VITY-EDT-K-LL-EAAEAYFLKSWHFALLAEFK---------AADALALSNL 114 (331) Q Consensus 55 ~~ykaA~~~fsQ~~~p~~v~-------v~~~-~~~-~-~~-~al~~~~~~~~~f~~~~~~~---------~~~i~alA~w 114 (331) .-.+..+.|+.+.....+. .+.. ..+ . +. .+.........-++...+.. ++...+++++ T Consensus 385 -~~s~d~a~f~Gg~dgl~~~~ee~Y~~lGgk~~d~g~lt~~aays~LE~~dVDlVil~ga~adtt~ga~~d~va~alad~ 463 (717) T protein:vir:79 385 -LQAAADAKFSGGKDELSLDKEEMYKRLGGEKNEEGFVTKQGAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQLALA 463 (717) T ss_pred -ccCchhhccCCCccccccchhhhhccccccccccccccchhhhhhcCcceeEEEEecCccccccccchhhhHHHHHHHH Confidence 0001111222221111100 0000 000 0 00 01111111122222222210 1223455555 Q ss_pred HHhcCc----EEEEEEe---CCh-----HH-HHhh---------------------------cccc-----eEEEEEeCC Q lcl|Aclame:pro 115 IEEQKF----KFAVFQV---TAV-----AD-ITPL---------------------------AKNT-----RTIAIVHSK 149 (331) Q Consensus 115 ~ea~~~----~~~~~~~---t~~-----~~-~~~~---------------------------~~~~-----~t~~~~~~~ 149 (331) |+++.. .+.+... .+. +. .... ..+. +...+.... T Consensus 464 caalSal~r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis~y~~vv~~~~~iv~~~~ 543 (717) T protein:vir:79 464 CAVMSHYNSVTIGIIPTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLGQFIEVVAGPDFIVRNTR 543 (717) T ss_pred HHHhhhccccceeeeccccccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccccceeeeeecceeEEEcCC Confidence 554321 1111110 000 00 0000 0000 001111111 Q ss_pred Cch---hHHHHHHHHHhcCCccceeeeeeeccCCcC--CCCCCHHHHHHHHhCCCeEEEEEcCee-EEecCEEeCC---c Q lcl|Aclame:pro 150 TGE---KLDAALIGNVASLPVGSATWKGRHGLAGIT--SEELKVSEIDAIQKAGGMCYIEKAGIA-QTSEGKTVSG---E 220 (331) Q Consensus 150 ~~~---~~~aa~~g~~~~~~~G~~t~~~k~~l~gv~--~~~~t~t~~~~l~~~~~n~y~~~~g~~-~~~~G~~~~G---~ 220 (331) ... .+++.++|......+. ....++ .+.|+. ...++..|++.|..+|+|++..+.|.. .+..++++++ + T Consensus 544 ~~~~~~p~AG~vAGldA~rGVw-kSPANk-~I~GVvgLa~~lT~sE~d~Ln~aGIntIr~~~GrGirVWGaRTtasd~sd 621 (717) T protein:vir:79 544 LGQMASTPDASYIGMVSQLKTQ-SAPTNK-PLPSVTALRYTYSANQLNRLTKARFATFKYKQDGSIGVVDAPTSAHAGSD 621 (717) T ss_pred CceeecCHHHHHHHHHhcCCcc-cccccc-eecccccCcccCCHHHHHHHhhCCeEEEEEeCCceEEEEeeeecCCCCcc Confidence 111 2344555555554432 223355 366653 346899999999999999998876644 5677887765 2 Q ss_pred --eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCC Q lcl|Aclame:pro 221 --FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLN 298 (331) Q Consensus 221 --~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~ 298 (331) +|-+.+-.|++...|+..+... + .+ |-++.+...|++.|+..|++..+.|.|.. |+++. ..+ T Consensus 622 WryInVRRl~D~Ie~sIr~al~~y-V--gE-PNd~~tr~~Ik~sI~afL~~L~r~GAI~G--------ykvdv----tnT 685 (717) T protein:vir:79 622 YTRLSTARIVKEAVNAVREVADPF-I--GE-PNDTGNRNALTAAVDKRLSKMIENKALLG--------FDFRL----VVT 685 (717) T ss_pred cceeehhhhHHHHHHHHHHHHHHh-c--cc-cCCHHHHHHHHHHHHHHHHHHHhcCceec--------ceeeE----ecC Confidence 6899999999999999887753 3 22 67888999999999999999999999963 44433 356 Q ss_pred HHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 299 DDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 299 ~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ++|..+-+.. +.+.+.....+++|.++.+|+= T Consensus 686 ~~di~~G~l~-V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 686 PQQELLGEGS-IELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred hhHhhCCEEE-EEEEEEecCcccEEEEEEEEeC Confidence 7776665543 7888999999999999988888 No 51 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=97.62 E-value=3.7e-05 Score=44.88 Aligned_cols=291 Identities=14% Similarity=0.059 Sum_probs=146.9 Q ss_pred CCCceeeEEEEEeecc---------cccccccceeEEEEc---cCCcceEEEechhh----------hccC----CCCCh Q lcl|Aclame:pro 1 MVETITDVRVHISVLY---------PSPRIGLGRPAIFVK---GTAMGYKEYTTLEE----------LKDT----FADNT 54 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~---------~~~~~~fg~~li~~~---~~~~~~~~yts~~~----------v~~~----f~~~s 54 (331) ..++..++.+...-.. +......+....+.. .....+ .+..... +.++ .+.+. T Consensus 274 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~ 352 (660) T protein:vir:68 274 QTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDFFAKGASNY-IFATAQGWPKGFSGVIKLNGGLSSNETVEA 352 (660) T ss_pred ccccceeeeeecCCcceeeeeeecccccccccccceeeehhhccCcccE-EEEeecCCCccccceeeecccccccccccc Confidence 2222222222211000 000000000000000 000011 0000000 0000 00011 Q ss_pred HHHHHHHHHHccCCCcceEEEEeccchhHHHHHHHhhcCceeEEEEec---CCHH----HHHHHHHHHHhcCcEEEEEEe Q lcl|Aclame:pro 55 EVYAKAKAVFLQKDRPDTVAVITYEDTKLLEAAEAYFLKSWHFALLAE---FKAA----DALALSNLIEEQKFKFAVFQV 127 (331) Q Consensus 55 ~~ykaA~~~fsQ~~~p~~v~v~~~~~~~~~~al~~~~~~~~~f~~~~~---~~~~----~i~alA~w~ea~~~~~~~~~~ 127 (331) ..+..+-.. +.........++.+.. .+.+ -+.++...++....+|.+++. T Consensus 353 ~~~~~~~~~-----------------------~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~~~d~ 409 (660) T protein:vir:68 353 GDLMEAWDL-----------------------FADRESVNAQLFIAGSCAGESLEVASTVQKHVVAIGDSRQDCLVLCSP 409 (660) T ss_pred chhhhHHHH-----------------------hhhhhccccceeeccccCCCchHHHHHHHHHHHHHHHhhCCeEEEEcc Confidence 111111111 1111111122222211 1111 123444555554444443321 Q ss_pred -----------CChHHHHhhc-------------ccceEEEEEeCC-------Cch----hHHHHHHHHHhcCCccceee Q lcl|Aclame:pro 128 -----------TAVADITPLA-------------KNTRTIAIVHSK-------TGE----KLDAALIGNVASLPVGSATW 172 (331) Q Consensus 128 -----------t~~~~~~~~~-------------~~~~t~~~~~~~-------~~~----~~~aa~~g~~~~~~~G~~t~ 172 (331) +....+.... +..+ ..+|++. .+. -|.+.++|.++-.+.-+-.| T Consensus 410 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~-~~~~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~d~~~g~~ 488 (660) T protein:vir:68 410 PRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTY-AAIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDNISQPW 488 (660) T ss_pred cceeEecCCCCCCHHHHHHHHhhcccccccccccCcce-EEEEcCceEEecccCCceEEechhHHHHHHHHHHhccCCcE Confidence 1111111110 1112 2333321 111 24566666665443211122 Q ss_pred ---eeeeccCCcC---C--CCCCHHHHHHHHhCCCeEEEEEcCe-eEEecCEEeCC-----ceehhhHHHHHHHHHHHHH Q lcl|Aclame:pro 173 ---KGRHGLAGIT---S--EELKVSEIDAIQKAGGMCYIEKAGI-AQTSEGKTVSG-----EFIDSIHGDDWIKATIETR 238 (331) Q Consensus 173 ---~~k~~l~gv~---~--~~~t~t~~~~l~~~~~n~y~~~~g~-~~~~~G~~~~G-----~~iD~~~~~dwl~~~iq~~ 238 (331) ..+. +.||. . -.++++|.+.|..+|+|+...+.|. ..+...+++++ .||-+.|-.+|+...|++. T Consensus 489 ~span~~-~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~ 567 (660) T protein:vir:68 489 MSPAGYN-RGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRINVRRLFNMVKTNIGSA 567 (660) T ss_pred EccCCee-eceeeccceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceEehhhHHHHHHHHHHHH Confidence 2332 33332 1 2478999999999999999998876 46667777765 2688889999999999988 Q ss_pred HHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcc Q lcl|Aclame:pro 239 LQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSG 318 (331) Q Consensus 239 l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aG 318 (331) +...+-. |.++.=...|+..|+.-|+..+++|.|. +|.|++ ..+..|++|+.+.++. +.+.+.... T Consensus 568 ~~~~v~e----pn~~~~~~~i~~~i~~~L~~l~~~gal~--------gf~V~~-d~~~nt~~~i~~G~~~-~~i~~~p~~ 633 (660) T protein:vir:68 568 SKYRLFE----LNNAFTRSSFRTETSQYLQGIKALGGVY--------NFKVVC-DTTNNTPAVIDRNEFV-ATFYLQPAR 633 (660) T ss_pred HHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------eeEEEE-ecCCCCHHHhhCCeEE-EEEEEEecC Confidence 8765432 4467667899999999999999999997 388988 4678899999999886 899999999 Q ss_pred eEEEEEEEEEEeC Q lcl|Aclame:pro 319 AIHSVDVYGEVEV 331 (331) Q Consensus 319 aIh~v~i~~~v~~ 331 (331) .+++|.++..-.- T Consensus 634 pae~i~l~~~~~~ 646 (660) T protein:vir:68 634 SINYITLNFVATA 646 (660) T ss_pred CcceEEEEEEEee Confidence 9999988776554 No 52 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=97.61 E-value=3.7e-05 Score=44.85 Aligned_cols=315 Identities=11% Similarity=0.024 Sum_probs=151.3 Q ss_pred CCCcee---e-EEEEEeecc-cccccccceeEEEEccCCcceEEEechhhhccCCCCChHHH--HHHHHHH---ccCCCc Q lcl|Aclame:pro 1 MVETIT---D-VRVHISVLY-PSPRIGLGRPAIFVKGTAMGYKEYTTLEELKDTFADNTEVY--AKAKAVF---LQKDRP 70 (331) Q Consensus 1 ~v~~i~---d-V~v~i~~~~-~~~~~~fg~~li~~~~~~~~~~~yts~~~v~~~f~~~s~~y--kaA~~~f---sQ~~~p 70 (331) +.+.+. . |.+--.... ....................+-.+.+.... ..+......+ ..+..++ +.+.. T Consensus 365 ~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gg~d- 442 (749) T protein:vir:10 365 YAEVIKQKSEFIYWAEHESTLYAATSSASDGLFGQTAANRQFNLFRSAAGS-VDYPAGVTTLGSKNNATYYYRLSGGVN- 442 (749) T ss_pred hhhhhccCCCEEEEEecccccccccccccccccccccccceeecccccccc-ceeccccccccccCCcEEEEEccCCcc- Confidence 211111 1 111000000 000000000000000000000000000000 0000000000 0000000 00000 Q ss_pred ceEEEE--eccc---hhHHHHHHHhhcCceeEEEEec--C----CHHHHHHHHHHHHhcCcEEEEEEeCC---------- Q lcl|Aclame:pro 71 DTVAVI--TYED---TKLLEAAEAYFLKSWHFALLAE--F----KAADALALSNLIEEQKFKFAVFQVTA---------- 129 (331) Q Consensus 71 ~~v~v~--~~~~---~~~~~al~~~~~~~~~f~~~~~--~----~~~~i~alA~w~ea~~~~~~~~~~t~---------- 129 (331) ..+... .... ....+++........-.+++.. . ......++...+|....++.+..... T Consensus 443 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~li~~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~ 522 (749) T protein:vir:10 443 YTVSAGQYTITNTDIGSAYELIGDPESQIVDFIISGPSGTSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTT 522 (749) T ss_pred cccccccccccchhHHHHHHHhhhhhhcccceEEEecCCCCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccch Confidence 000000 0000 1112222211112222222221 1 12344566677777666666553210 Q ss_pred --hHHHHh---hcccceEEEEEeCC-------Cch----hHHHHHHHHHhcCCccceeee---eee--ccCCcC--CCCC Q lcl|Aclame:pro 130 --VADITP---LAKNTRTIAIVHSK-------TGE----KLDAALIGNVASLPVGSATWK---GRH--GLAGIT--SEEL 186 (331) Q Consensus 130 --~~~~~~---~~~~~~t~~~~~~~-------~~~----~~~aa~~g~~~~~~~G~~t~~---~k~--~l~gv~--~~~~ 186 (331) ...... .....+-..+|++. .+. -+++.++|.++..+.-.--|+ .|. .+.|+. ...+ T Consensus 523 ~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~~~~~~~ 602 (749) T protein:vir:10 523 TITTNIVDFFKKLPSSSYMVFDSGYKYIYDKYNDVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAIKLAYTP 602 (749) T ss_pred hhhhHHHHHHhhccCceeEEEEccceeeeccccCceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccccceeec Confidence 001110 01112223333321 111 245666666665543222332 331 123331 2357 Q ss_pred CHHHHHHHHhCCCeEEEEEcCee-EEecCEEeCC-----ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHH Q lcl|Aclame:pro 187 KVSEIDAIQKAGGMCYIEKAGIA-QTSEGKTVSG-----EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQ 260 (331) Q Consensus 187 t~t~~~~l~~~~~n~y~~~~g~~-~~~~G~~~~G-----~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~ 260 (331) ++.|.+.|..+|+|+...+.|.. .+...+++.+ .||-+.|-.+|++..|+..+...+-. |.++.=...|+ T Consensus 603 ~~~e~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~e----pn~~~l~~~i~ 678 (749) T protein:vir:10 603 NKAQRDQLYANRVNPIVSFPGQGVVLYGDKTALGFASAFDRINIRRLFLTVERVISTAAKAQLFE----QNDEAQRSLFI 678 (749) T ss_pred ChhHHHhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcC----CCCHHHHHHHH Confidence 89999999999999999987754 5566677643 37889999999999999887754432 55777789999 Q ss_pred HHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 261 SELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 261 ~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ..++.-|+..+++|.|. +|.|++. .+..|++++.+.++. +.+.+.....+++|.++..-+- T Consensus 679 ~~i~~fL~~l~~~G~i~--------~f~V~~d-~~~Nt~~~i~~G~~~-~~i~~~P~~pae~I~~~~~~~~ 739 (749) T protein:vir:10 679 NIVEPYLRDVQGRRGVV--------DFLVKCD-STNNTPEAVDRGEFY-AEVFLKPTRTINYVQLTFVATR 739 (749) T ss_pred HHHHHHHHHHHhcCCee--------eeEEEEc-CCCCCHHHhhCCEEE-EEEEEEecCCccEEEEEEEEee Confidence 99999999999999884 5889987 577899999998885 8999999999999998866443 No 53 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=97.56 E-value=4.4e-05 Score=44.45 Aligned_cols=296 Identities=16% Similarity=0.064 Sum_probs=148.0 Q ss_pred CCCceeeEEEE----------Eeecc-cccc-----------cccceeEEEEccCCc--c-eEEEechhhhccCCCCChH Q lcl|Aclame:pro 1 MVETITDVRVH----------ISVLY-PSPR-----------IGLGRPAIFVKGTAM--G-YKEYTTLEELKDTFADNTE 55 (331) Q Consensus 1 ~v~~i~dV~v~----------i~~~~-~~~~-----------~~fg~~li~~~~~~~--~-~~~yts~~~v~~~f~~~s~ 55 (331) .-++-..+.+. +.... +... ......++....... . -..+. +..+.... T Consensus 272 ~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~----~~~g~~~~-- 345 (666) T protein:vir:65 272 QNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSSQYIYATAQGWVDGFSGIIS----LAGGVSAN-- 345 (666) T ss_pred cccccceeeeecCCcccceeecccCcccccccchhhhhhhhhcccccceeeeecccccccccceEE----ccCCCCcC-- Confidence 11110011110 00000 0000 000111111111000 0 00000 00000000 Q ss_pred HHHHHHHHHccCCCcceEEEEecc----chhHHHHHHHhhcCceeEEEEecC------CHHHHHHHHHHHHhcCcEEEEE Q lcl|Aclame:pro 56 VYAKAKAVFLQKDRPDTVAVITYE----DTKLLEAAEAYFLKSWHFALLAEF------KAADALALSNLIEEQKFKFAVF 125 (331) Q Consensus 56 ~ykaA~~~fsQ~~~p~~v~v~~~~----~~~~~~al~~~~~~~~~f~~~~~~------~~~~i~alA~w~ea~~~~~~~~ 125 (331) .+..-.++..+ ..+..+++........-.++.... ...-..++...++....+|..+ T Consensus 346 -------------~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~~~l~~~~~~~~~~~a~~ 412 (666) T protein:vir:65 346 -------------EATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMV 412 (666) T ss_pred -------------cccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCccchhHHHHHHHHHHHhhccceEEEe Confidence 00000000000 011112222111111112222111 1233345555555555444433 Q ss_pred Ee-----------CChHHHHhhc-------------ccceEEEEEeCC-------Cch----hHHHHHHHHHhcCCccce Q lcl|Aclame:pro 126 QV-----------TAVADITPLA-------------KNTRTIAIVHSK-------TGE----KLDAALIGNVASLPVGSA 170 (331) Q Consensus 126 ~~-----------t~~~~~~~~~-------------~~~~t~~~~~~~-------~~~----~~~aa~~g~~~~~~~G~~ 170 (331) .. ++...+.... +..+ ..+|++. .+. -+++.++|.++..+.-+- T Consensus 413 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~-~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g 491 (666) T protein:vir:65 413 SPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTY-AVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQ 491 (666) T ss_pred ccccceeeecCCCCCHHHHHHHHHhcccccccccccCcce-EEEEcCceEEecccCCceeEechHHHHHHHHHHHhccCC Confidence 21 1222221111 1112 2333321 111 245556666554432111 Q ss_pred ee---eeeeccCCc---CC--CCCCHHHHHHHHhCCCeEEEEEcCe-eEEecCEEeCC-----ceehhhHHHHHHHHHHH Q lcl|Aclame:pro 171 TW---KGRHGLAGI---TS--EELKVSEIDAIQKAGGMCYIEKAGI-AQTSEGKTVSG-----EFIDSIHGDDWIKATIE 236 (331) Q Consensus 171 t~---~~k~~l~gv---~~--~~~t~t~~~~l~~~~~n~y~~~~g~-~~~~~G~~~~G-----~~iD~~~~~dwl~~~iq 236 (331) .| ..|. +.|| .. -.+++.|.+.|..+|+|++..+.|. ..+..++++++ .||-+.+-.+|++..|+ T Consensus 492 ~~~span~~-~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~ 570 (666) T protein:vir:65 492 PWMSPAGYN-RGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIG 570 (666) T ss_pred cEEccCCee-cceeeccccceeecChhHHHhhhhCCceEEEEeCCCeEEEEecccCCCCCcccceEehhhHHHHHHHHHH Confidence 22 2332 2332 11 2478899999999999999998775 46667777766 27889999999999999 Q ss_pred HHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEE Q lcl|Aclame:pro 237 TRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKR 316 (331) Q Consensus 237 ~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~ 316 (331) +.....+=. |.++.=...|+..++.-|++.+++|.|. +|.|++. .++.|++|+.+.++. +.+.+.. T Consensus 571 ~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~gal~--------g~~V~~d-~~~nt~~~i~~G~~~-~~i~~~p 636 (666) T protein:vir:65 571 DSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY--------DFRVQCD-TTNNTPDVIDRNEFV-ASMFIKP 636 (666) T ss_pred HHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee--------eeEEEEc-CCCCCHHHhhCCeEE-EEEEEEe Confidence 888765432 5577878999999999999999999996 3899987 567899999999885 8999999 Q ss_pred cceEEEEEEEEEEeC Q lcl|Aclame:pro 317 SGAIHSVDVYGEVEV 331 (331) Q Consensus 317 aGaIh~v~i~~~v~~ 331 (331) ...+++|.++..-.= T Consensus 637 ~~pae~i~~~~~~~~ 651 (666) T protein:vir:65 637 AKSINYIMLNFTAVA 651 (666) T ss_pred cCCcceEEEEEEEee Confidence 999999998865443 No 54 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=97.37 E-value=8.3e-05 Score=42.96 Aligned_cols=309 Identities=13% Similarity=0.063 Sum_probs=148.3 Q ss_pred CCCceeeEEEEEeec---------ccccccccceeEEEEc---cCCcceE--------EEechhhhccC--CCCChHHHH Q lcl|Aclame:pro 1 MVETITDVRVHISVL---------YPSPRIGLGRPAIFVK---GTAMGYK--------EYTTLEELKDT--FADNTEVYA 58 (331) Q Consensus 1 ~v~~i~dV~v~i~~~---------~~~~~~~fg~~li~~~---~~~~~~~--------~yts~~~v~~~--f~~~s~~yk 58 (331) +.++...+.+...-. .+......+...++.. .....+. .......+.++ ...+...++ T Consensus 287 ~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~ 366 (671) T protein:vir:56 287 SNSNQYAVIVRVSGEVEEAFIVSTNPGDKDVNGQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANAGADDWM 366 (671) T ss_pred cccccceeEEeecCccceeEEEeecccccccchhhhhhhhhhcccCceEEEecCcccCCccccccccCccccccchhHHH Confidence 222222222221100 0000000000000000 0000000 00001111111 112223333 Q ss_pred HHHHHHccCC--CcceEEEEeccchh------H-HHHHHHhh--cCceeEEEEec-------CCHHHHHHHHHHHHhcCc Q lcl|Aclame:pro 59 KAKAVFLQKD--RPDTVAVITYEDTK------L-LEAAEAYF--LKSWHFALLAE-------FKAADALALSNLIEEQKF 120 (331) Q Consensus 59 aA~~~fsQ~~--~p~~v~v~~~~~~~------~-~~al~~~~--~~~~~f~~~~~-------~~~~~i~alA~w~ea~~~ 120 (331) .+-..|.... .|..+......... . ..++..+. ......++... ........+.+|.+..+. T Consensus 367 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (671) T protein:vir:56 367 FGLDMLSDPEVLYTNLVIAGNAAAEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDP 446 (671) T ss_pred HHHHhhhhccccceeEEEcCCCCCccchhHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccc Confidence 3323332211 11111111100000 0 11111111 11111111100 011223334444443211 Q ss_pred EEEEEEeCChHHHH--hhcccceEEEEEeCC-------Cch----hHHHHHHHHHhcCCccceeee---eee--ccCCcC Q lcl|Aclame:pro 121 KFAVFQVTAVADIT--PLAKNTRTIAIVHSK-------TGE----KLDAALIGNVASLPVGSATWK---GRH--GLAGIT 182 (331) Q Consensus 121 ~~~~~~~t~~~~~~--~~~~~~~t~~~~~~~-------~~~----~~~aa~~g~~~~~~~G~~t~~---~k~--~l~gv~ 182 (331) ++-.... ...+..+. .++++. .+. -+.+.++|.++..+.-+--|+ .|. .+.|+. T Consensus 447 -------~~~~~~~~~~~~~s~~~-~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~ 518 (671) T protein:vir:56 447 -------TNGQAVVDNLNVSTTYA-VIDGNYKYQYDKYNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVN 518 (671) T ss_pred -------cchhhhhhhccCCcceE-EEecCceEEecccCCceeEechHHHHHHHHHHhhccCCcEECcCCceeccccccc Confidence 0000000 00111222 222221 111 245666666654442221222 221 122332 Q ss_pred --CCCCCHHHHHHHHhCCCeEEEEEcCe-eEEecCEEeCC-----ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHH Q lcl|Aclame:pro 183 --SEELKVSEIDAIQKAGGMCYIEKAGI-AQTSEGKTVSG-----EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDAR 254 (331) Q Consensus 183 --~~~~t~t~~~~l~~~~~n~y~~~~g~-~~~~~G~~~~G-----~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~ 254 (331) ...+++.|.+.|..+|+|+..++.|. ..+...+++++ .||-+.|-.+|+...|+..+...+-. |.++. T Consensus 519 ~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~ 594 (671) T protein:vir:56 519 RLAVDLRRAHRDALYQIGINPVVGFAGQGFVLYGDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFE----LNDEF 594 (671) T ss_pred cceeecChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCC----CCCHH Confidence 23478999999999999999998665 45566677665 27899999999999999888764432 55777 Q ss_pred HHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 255 GIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 255 G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) =...|+..|..-|+..+++|.|. +|.|++. .++.|++|+.+.++. +.+.+.....+++|.++..-+- T Consensus 595 ~~~~i~~~i~~fL~~l~~~gal~--------g~~v~~d-~~~nt~~~i~~G~~~-~~i~~~p~~Pae~I~~~~~~~~ 661 (671) T protein:vir:56 595 TRSSFKSEIDAYLTNIQDLGGVY--------DFRVVCD-ETNNPGSVIDRNEFV-ASIYVKPAKSINFITLNFVATS 661 (671) T ss_pred HHHHHHHHHHHHHHHHHhCCcee--------eeEEEEc-CCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEee Confidence 77899999999999999999996 4899988 578899999999885 8999999999999999876555 No 55 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=97.24 E-value=0.00012 Score=42.11 Aligned_cols=313 Identities=14% Similarity=0.068 Sum_probs=158.2 Q ss_pred CCCceeeEEEEEeecc---------------cccccccceeEEE-EccCCcceE-----------EEechhhhccCCCCC Q lcl|Aclame:pro 1 MVETITDVRVHISVLY---------------PSPRIGLGRPAIF-VKGTAMGYK-----------EYTTLEELKDTFADN 53 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~---------------~~~~~~fg~~li~-~~~~~~~~~-----------~yts~~~v~~~f~~~ 53 (331) ...-...+++.+.... .....+.....+. .......+. .+... ...+.+.. T Consensus 227 ~gt~g~~~tv~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 304 (659) T protein:vir:72 227 PGELGDKIEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLS--TKRGEKDI 304 (659) T ss_pred ccccccceeEEEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeee--eccccccc Confidence 0000001111111000 0000000000000 000000000 00000 00011111 Q ss_pred hHHHHHHHHHHccCCCcc----------------eEEEEec-----cchhHHHHHHHhhc---CceeEEEEecCC----- Q lcl|Aclame:pro 54 TEVYAKAKAVFLQKDRPD----------------TVAVITY-----EDTKLLEAAEAYFL---KSWHFALLAEFK----- 104 (331) Q Consensus 54 s~~ykaA~~~fsQ~~~p~----------------~v~v~~~-----~~~~~~~al~~~~~---~~~~f~~~~~~~----- 104 (331) ...-......|..+...- .+.-+.. ...+...++..+.. -+...+++.... T Consensus 305 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~ 384 (659) T protein:vir:72 305 YDSNIYIDDFFAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLE 384 (659) T ss_pred cchhhhhhhhhhcCCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchh Confidence 111122222332221110 0100000 01112233333221 123344444321 Q ss_pred --HHHHHHHHHHHHhcCcEEEEEEeC-----------ChHHHHhhc-------------ccceEEEEEeCC-------Cc Q lcl|Aclame:pro 105 --AADALALSNLIEEQKFKFAVFQVT-----------AVADITPLA-------------KNTRTIAIVHSK-------TG 151 (331) Q Consensus 105 --~~~i~alA~w~ea~~~~~~~~~~t-----------~~~~~~~~~-------------~~~~t~~~~~~~-------~~ 151 (331) ..-..++...+|....+|+++... ....+.... +..+ ..+|++. .+ T Consensus 385 ~~~~v~~~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~-~~~~~p~~~~~d~~~~ 463 (659) T protein:vir:72 385 TASTVQKHVVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTY-AAIDGNHKYQYDKYND 463 (659) T ss_pred hhHHHHHHHHHHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhcccccccccccccee-EEEEcCceeeccccCC Confidence 122345667777777777766421 122221111 1222 3444432 11 Q ss_pred h----hHHHHHHHHHhcCCccceee---eeeeccCCcC-----CCCCCHHHHHHHHhCCCeEEEEEcCee-EEecCEEeC Q lcl|Aclame:pro 152 E----KLDAALIGNVASLPVGSATW---KGRHGLAGIT-----SEELKVSEIDAIQKAGGMCYIEKAGIA-QTSEGKTVS 218 (331) Q Consensus 152 ~----~~~aa~~g~~~~~~~G~~t~---~~k~~l~gv~-----~~~~t~t~~~~l~~~~~n~y~~~~g~~-~~~~G~~~~ 218 (331) . -|.+.++|.++-.+.-+-.| .++ ++.|+. .-.+++.|.+.|..+++|+..++.|.. .+...++++ T Consensus 464 ~~~~~p~sg~vAGl~Ar~D~~~G~~~span~-~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~ 542 (659) T protein:vir:72 464 VNRWVPLAADIAGLCARTDNVSQTWMSPAGY-NRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTAT 542 (659) T ss_pred ceEEechHHHHHHHHHHhhccCCcEEccCCe-eeceeeccccccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccC Confidence 1 24566666665444321122 233 223322 134789999999999999999987653 556667766 Q ss_pred C-----ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccc Q lcl|Aclame:pro 219 G-----EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQ 293 (331) Q Consensus 219 G-----~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~ 293 (331) + .||-+.+-.+|+...|++.+...+-. |.++.=...|+..|+.-|+..+++|.|. +|.|++. T Consensus 543 ~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~fL~~l~~~gal~--------~~~V~~d- 609 (659) T protein:vir:72 543 SVPSPFDRINVRRLFNMLKTNIGRSSKYRLFE----LNNAFTRSSFRTETAQYLQGNKALGGIY--------EYRVVCD- 609 (659) T ss_pred CCCcccceEeehhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------eEEEEEc- Confidence 5 27889999999999999888764432 5577778899999999999999999994 5899987 Q ss_pred hhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 294 RSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 294 ~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) .++.|++|+.+.++. +.+.+.....+++|.++..-+- T Consensus 610 ~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~~~~~~ 646 (659) T protein:vir:72 610 TTNNTPSVIDRNEFV-ATFYIQPARSINYITLNFVATA 646 (659) T ss_pred CCCCCHHHhhCCeEE-EEEEEEecCCccEEEEEEEEee Confidence 577899999999885 9999999999999999866443 No 56 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=97.23 E-value=0.00012 Score=42.04 Aligned_cols=313 Identities=13% Similarity=0.104 Sum_probs=152.0 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCcc-eEEEech----hhhccCCCCChHHHHHHHHHHccCCCc----- Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMG-YKEYTTL----EELKDTFADNTEVYAKAKAVFLQKDRP----- 70 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~~-~~~yts~----~~v~~~f~~~s~~ykaA~~~fsQ~~~p----- 70 (331) =.+...++.+. .+.-+.+...-|+.-+-++.+... ...|=.- -.|..+-+...-.-+.++++=++..-| T Consensus 80 ~~n~~~~l~~i-~~~D~ag~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aai~a~~~lPVTA~~ 158 (498) T protein:vir:48 80 QTDPFGELYVI-AVPEARGAAATVRVTVTGEAEESGTLSLYVGRSSVQVPVVNGDDATAVATAIKEAVNGVITLPFAASS 158 (498) T ss_pred HhCCCceeEEE-eeCCcccceeEEEEEecccccCCceEEEEECCEEEEEeecCCCCHHHHHHHHHHHHhCCCCcceEEEe Confidence 12223333321 111122223333333333222111 0000000 000111011111111111111111111 Q ss_pred -----------------------------------c--eEEEE----eccchhHHHHHHHhhcCceeEEEEecCCHHHHH Q lcl|Aclame:pro 71 -----------------------------------D--TVAVI----TYEDTKLLEAAEAYFLKSWHFALLAEFKAADAL 109 (331) Q Consensus 71 -----------------------------------~--~v~v~----~~~~~~~~~al~~~~~~~~~f~~~~~~~~~~i~ 109 (331) . ++.+. +....+...+|.+..+.+|.|+++.-.|.+.+. T Consensus 159 ~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamsgGag~PDia~aLaal~~~~~~~I~~p~~D~asl~ 238 (498) T protein:vir:48 159 DAGVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPDLTAAVAAMGDEAFDFIGLPFNDAASIN 238 (498) T ss_pred cCcEEEEEeeecccccccceeeeeeccCcccccccceeeEEEEcccCCccCcchHHHHHhhccCCccEEEEeecCHHHHH Confidence 1 11111 111235667777777888889999888888888 Q ss_pred HHHHHHHhcC-------cE--EEEEEe-CChHHHH---hhcccceEEEEEeCCCch---h-HHHHHHHHHh---cCCccc Q lcl|Aclame:pro 110 ALSNLIEEQK-------FK--FAVFQV-TAVADIT---PLAKNTRTIAIVHSKTGE---K-LDAALIGNVA---SLPVGS 169 (331) Q Consensus 110 alA~w~ea~~-------~~--~~~~~~-t~~~~~~---~~~~~~~t~~~~~~~~~~---~-~~aa~~g~~~---~~~~G~ 169 (331) ++.++++.-. .+ +.+... ...+.+. ...+..|..++++..... + .+++++++++ ..+|.+ T Consensus 239 al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~~~~~~p~~~~AAa~a~~aA~~l~~DPAr 318 (498) T protein:vir:48 239 MMMTEMNDSSGRWSYARQLYGHVYTAKLGTLSELVNAGDMHNQQHITLAGYEKETQSPVDELVASRLAREAVFIRNDPAR 318 (498) T ss_pred HHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCChHHHHHHHHHHHHHHhhhccccc Confidence 8888886422 22 223322 2233333 334556666666543322 2 3455666554 556643 Q ss_pred eeeeeeeccCCcCCCC----CCHHHHHHHHhCCCeEEEEEcCeeEEecCEEe-----CC----ceeh--hhHHHHHHHHH Q lcl|Aclame:pro 170 ATWKGRHGLAGITSEE----LKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTV-----SG----EFID--SIHGDDWIKAT 234 (331) Q Consensus 170 ~t~~~k~~l~gv~~~~----~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~-----~G----~~iD--~~~~~dwl~~~ 234 (331) .+.-. .|+||.|.. ++.+|.+.|..+|+..+.-.+|...+.+..+. .| .|.| +++-.++++.. T Consensus 319 -PLqtl-~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~ 396 (498) T protein:vir:48 319 -PTQTG-ELVGMLPAPKGKRFIMTEQQTLLSHGVATAYVEGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVLRK 396 (498) T ss_pred -cccce-eeeccccCCchhcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHH Confidence 22222 377887654 68899999999999999777777666666543 35 2665 89999999999 Q ss_pred HHHHHHHHHhcCCCCCcCHH---------HHHHHHHHHHHHHHHHHhcCcccccccC---------CCcceEE--Eccch Q lcl|Aclame:pro 235 IETRLQKLLTETDKLTFDAR---------GIALLQSELTTVLNEGFANGIIDSNDET---------GEPNFSI--TALQR 294 (331) Q Consensus 235 iq~~l~~l~~~~~kipyt~~---------G~~~i~~~v~~vl~~~~~~G~I~~g~~~---------~~~~~~v--~~~~~ 294 (331) ++..+..-|-. .|+.=+.. --..|++.+-.++++....|++..-+.. ++.-.++ ..|+ T Consensus 397 ~r~~i~~kfpR-~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given~~~~~~~LiVerd~~dpnRln~~~p~- 474 (498) T protein:vir:48 397 LKSVITSKYGR-HKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVENYDLFKQYLIVERDADNPNRLNTLFPP- 474 (498) T ss_pred HHHHhhhhcCC-ceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcEEEEEecc- Confidence 99999876622 23321111 1268999999999999999999642211 1111111 1111 Q ss_pred hcCCHHHHHhcccCCeEEEEEEcce Q lcl|Aclame:pro 295 SDLNDDDIAKRNYKGLSFRYKRSGA 319 (331) Q Consensus 295 ~~~~~~dr~~R~~~~i~~~~~~aGa 319 (331) +.-..=|---..-.+.+.|.-++| T Consensus 475 -d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 475 -DYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred -cccCchhhhhhhhhhhhhhhhcCC Confidence 111111111111123333334444 No 57 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=97.13 E-value=0.00016 Score=41.42 Aligned_cols=303 Identities=12% Similarity=0.036 Sum_probs=150.5 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCC-cce-EEEechhhhccCCCCChHHHHHHHHHHccCCCcce--EEE- Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTA-MGY-KEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPDT--VAV- 75 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~-~~~-~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p~~--v~v- 75 (331) ..++...+.+.... .. ...| .+..... .++ .......+.. ......|..+... ..|.... +.. T Consensus 273 ~~~~~~~~~~~~~~--~~-~e~~----~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~--~~~~~~~~~~~~~ 340 (664) T protein:vir:98 273 QTDNQYAFVVRRGG--IV-QESF----IVSTDKTDKDIYGVNIYMDDFF---ANGGSQYVFGTSM--NWPKGFSGILEFG 340 (664) T ss_pred cCccceeEEEecCC--ce-eeeE----EeecccCcccceeeeeechhhe---ecccceeeeeecc--cCCcccceeEecc Confidence 11111111111100 00 0000 0111110 000 0000000000 0000000000000 0000000 000 Q ss_pred Eec------cchhHHHHHHHhhc---CceeEEEEecCC---H----HHHHHHHHHHHhcCcEEEEEEe-----------C Q lcl|Aclame:pro 76 ITY------EDTKLLEAAEAYFL---KSWHFALLAEFK---A----ADALALSNLIEEQKFKFAVFQV-----------T 128 (331) Q Consensus 76 ~~~------~~~~~~~al~~~~~---~~~~f~~~~~~~---~----~~i~alA~w~ea~~~~~~~~~~-----------t 128 (331) ++. ...+....+..+.. -..-.+++...+ . +-..++...++....+|.+++. + T Consensus 341 gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~~~~v~~al~~~a~~~~~~~a~~d~~~~~~~~~~~~~ 420 (664) T protein:vir:98 341 GGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGESVEIASTVQKHVISIGDERQDCTVFVSPPRSLLVNIPLAT 420 (664) T ss_pred CccccccccCchhHHHHHHhhhcccccccceEEecCCCCCcHHHHHHHHHHHHHHHHhcCCeEEEEccccceeccCCccc Confidence 000 01111222222211 122233333321 1 1233455556666556655431 1 Q ss_pred ChHHHHhh-----------------cccceEEEEEeCC-------Cch----hHHHHHHHHHhcCCccceeee---eeec Q lcl|Aclame:pro 129 AVADITPL-----------------AKNTRTIAIVHSK-------TGE----KLDAALIGNVASLPVGSATWK---GRHG 177 (331) Q Consensus 129 ~~~~~~~~-----------------~~~~~t~~~~~~~-------~~~----~~~aa~~g~~~~~~~G~~t~~---~k~~ 177 (331) ...++... .+..+ ..+|++. .+. -+.+.++|.++..+.-+-.|+ .| + T Consensus 421 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~-~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl~A~~D~~~g~~~span~-~ 498 (664) T protein:vir:98 421 AVDNIVEWRTGYKISGGTPVDNNLNVSSSY-GFLDGNYKYQYDKYNDVNRWVPLAGDIAGLCVYTDSVANPWMSPAGY-N 498 (664) T ss_pred cHHHHHHHhhhccccccchhhhhcCCccce-EEEEcCeEEEecccCCceEEechHHHHHHHHHHhhhcCCcEECcCCc-e Confidence 11111110 01122 2333321 111 245666666654443111222 22 1 Q ss_pred cCCcC-----CCCCCHHHHHHHHhCCCeEEEEEcC-ee-EEecCEEeCC---c--eehhhHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 178 LAGIT-----SEELKVSEIDAIQKAGGMCYIEKAG-IA-QTSEGKTVSG---E--FIDSIHGDDWIKATIETRLQKLLTE 245 (331) Q Consensus 178 l~gv~-----~~~~t~t~~~~l~~~~~n~y~~~~g-~~-~~~~G~~~~G---~--~iD~~~~~dwl~~~iq~~l~~l~~~ 245 (331) +.||. ...+++.|.+.|..+|+|+...+-| .. .+...+++++ + ||-+.+-.+|+...|+..+...+-. T Consensus 499 ~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e 578 (664) T protein:vir:98 499 RGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGDKTLTSVPSPFDRINVRRLFNMIKKDIGDNAKYKLFE 578 (664) T ss_pred eeeeeccccceeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcccccCCCCcccceEeehhHHHHHHHHHHHHHHHhhcC Confidence 22322 2347889999999999999988755 34 4667777665 2 6889999999999999888765433 Q ss_pred CCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEE Q lcl|Aclame:pro 246 TDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDV 325 (331) Q Consensus 246 ~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i 325 (331) |.++.=...|+..|+.-|+..+++|.|. +|.|++. .+..|++|+.+.++. +.+.+...-.+++|.+ T Consensus 579 ----pn~~~l~~~i~~~i~~~L~~l~~~gal~--------g~~V~~d-~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~ 644 (664) T protein:vir:98 579 ----NNDDFTRASFRMDTGQYMTNIRALGGCY--------DYRVICD-TTNNTPDVIDRNEFV-ATVYVKPPRSINYITL 644 (664) T ss_pred ----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------eeEEEEc-CCCCCHHHhhCCeEE-EEEEEEecCCcceEEE Confidence 5678878999999999999999999996 3899998 577899999999885 8999999999999998 Q ss_pred EEEEeC Q lcl|Aclame:pro 326 YGEVEV 331 (331) Q Consensus 326 ~~~v~~ 331 (331) +..-+- T Consensus 645 ~~~q~~ 650 (664) T protein:vir:98 645 NFVATS 650 (664) T ss_pred EEEEee Confidence 866654 No 58 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=97.04 E-value=0.0002 Score=40.88 Aligned_cols=313 Identities=13% Similarity=0.111 Sum_probs=153.8 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCcc-eEEEech----hhhccCCCCChHHHHHHHHHHccCCCcc---- Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMG-YKEYTTL----EELKDTFADNTEVYAKAKAVFLQKDRPD---- 71 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~~-~~~yts~----~~v~~~f~~~s~~ykaA~~~fsQ~~~p~---- 71 (331) =.+...++.+.- +.-+.+...-|+.-+-++.+... ...|=.- -.|..+-+...-.-+.++++=++..-|. T Consensus 80 ~~n~~~~l~~i~-~~d~aG~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA~~ 158 (498) T protein:vir:45 80 QTDPFGELYVIA-VPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASS 158 (498) T ss_pred HhCCcceEEEEe-eCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEEEe Confidence 223333443311 11122233344433333322211 0000000 0001110111111111111111111111 Q ss_pred --------------------------------------eEEEEec----cchhHHHHHHHhhcCceeEEEEecCCHHHHH Q lcl|Aclame:pro 72 --------------------------------------TVAVITY----EDTKLLEAAEAYFLKSWHFALLAEFKAADAL 109 (331) Q Consensus 72 --------------------------------------~v~v~~~----~~~~~~~al~~~~~~~~~f~~~~~~~~~~i~ 109 (331) ++.+... ...+...+|.+..+.+|.|+++.-.|.+... T Consensus 159 ~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamagGag~PD~a~alaal~~~~~~~I~~p~~D~asL~ 238 (498) T protein:vir:45 159 SAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVN 238 (498) T ss_pred cCceEEEEeeccCccccceeEEEeeccccccccccceeeEEEEccCCCccCchhHHHHHHhccCCccEEEEeeCCHHHHH Confidence 1111111 1125667777777788889998888888888 Q ss_pred HHHHHHHhcC-------cEE--EEEEe-CChHHHH---hhcccceEEEEEeCCCch----hHHHHHHHHHh---cCCccc Q lcl|Aclame:pro 110 ALSNLIEEQK-------FKF--AVFQV-TAVADIT---PLAKNTRTIAIVHSKTGE----KLDAALIGNVA---SLPVGS 169 (331) Q Consensus 110 alA~w~ea~~-------~~~--~~~~~-t~~~~~~---~~~~~~~t~~~~~~~~~~----~~~aa~~g~~~---~~~~G~ 169 (331) ++.++++.-. .++ .+... ...+.+. ...+..|..++++....+ -.+|+++++++ ..+|.+ T Consensus 239 al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~~~~~sp~~~~AAa~aa~~A~~l~~DPAr 318 (498) T protein:vir:45 239 TLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPAR 318 (498) T ss_pred HHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCChHHHHHHHHHHHHHHHhhccccc Confidence 8888886422 222 22322 2333333 334566777776532211 33566666664 556643 Q ss_pred eeeeeeeccCCcCCC----CCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEe-----CCc----eeh--hhHHHHHHHHH Q lcl|Aclame:pro 170 ATWKGRHGLAGITSE----ELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTV-----SGE----FID--SIHGDDWIKAT 234 (331) Q Consensus 170 ~t~~~k~~l~gv~~~----~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~-----~G~----~iD--~~~~~dwl~~~ 234 (331) .+.-. .|+|+.|. .++.+|.+.|..+|+..+.--.|...+.+..+. .|. |.| +++-.++++.. T Consensus 319 -PL~tl-~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~ 396 (498) T protein:vir:45 319 -PTQTG-ELVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRK 396 (498) T ss_pred -ccCce-eecceecCCchhcCChHHHHHHHhCCcceEEEcCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHH Confidence 22222 36787754 378999999999999999776787666666653 452 665 89999999999 Q ss_pred HHHHHHHHHhcCCCCCcCHH---------HHHHHHHHHHHHHHHHHhcCcccccccC---------CCcceEE--Eccch Q lcl|Aclame:pro 235 IETRLQKLLTETDKLTFDAR---------GIALLQSELTTVLNEGFANGIIDSNDET---------GEPNFSI--TALQR 294 (331) Q Consensus 235 iq~~l~~l~~~~~kipyt~~---------G~~~i~~~v~~vl~~~~~~G~I~~g~~~---------~~~~~~v--~~~~~ 294 (331) +++.+..-|-. .|+.=+.. --..|++.+-.++++....|++..-+.. ++.-.++ ..|+ T Consensus 397 ~r~~i~~kfpR-~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givEn~~~~~~~LiVerd~~dpnRln~~~p~- 474 (498) T protein:vir:45 397 LKSVITSKYGR-HKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPP- 474 (498) T ss_pred HHHHhhhhcCC-eeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcEEEEEecc- Confidence 99999876522 22221111 1268999999999999999999642211 1111111 1111 Q ss_pred hcCCHHHHHhcccCCeEEEEEEcce Q lcl|Aclame:pro 295 SDLNDDDIAKRNYKGLSFRYKRSGA 319 (331) Q Consensus 295 ~~~~~~dr~~R~~~~i~~~~~~aGa 319 (331) +.-..=|---..-.+.+.|.-++| T Consensus 475 -d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 475 -DYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred -cccCchhhhhhhhhhheehhhcCC Confidence 111111111111123333444444 No 59 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=96.89 E-value=0.00028 Score=40.08 Aligned_cols=303 Identities=13% Similarity=0.040 Sum_probs=151.0 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCcceE--EEechhhhccCCCCChHHHHHHHHHHccCCCcc--eEEEE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMGYK--EYTTLEELKDTFADNTEVYAKAKAVFLQKDRPD--TVAVI 76 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~~~~--~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p~--~v~v~ 76 (331) +..+...+.+... .. ....+ ++......... ....+.+. +......|..+.. .. .|.+. .+.+. T Consensus 272 ~~~~~~~~~~~~~--g~-~~e~~----~ls~~~~~~~~~~~~~~~~~~---~~~~~s~~v~~~~-~~-~~~~~~~~~~l~ 339 (663) T protein:vir:10 272 MTDDQFAIIVRRD--GI-VVEST----VLSTRRGDRDVYGNNIFMDDY---FRNGSSNFIYASS-VN-WPAGFTGIIQLG 339 (663) T ss_pred ccchhhcccccCC--Cc-cccee----eeeccccccccchhhhhhhhh---hcCcccceeEeec-cc-cCcccceeEEec Confidence 1112222211110 00 00011 11111111000 00000000 0000000000000 00 01110 01110 Q ss_pred e-cc------chhHHHHHHHh---hcCceeEEEEecC---C----HHHHHHHHHHHHhcCcEEEEEEeCC---------- Q lcl|Aclame:pro 77 T-YE------DTKLLEAAEAY---FLKSWHFALLAEF---K----AADALALSNLIEEQKFKFAVFQVTA---------- 129 (331) Q Consensus 77 ~-~~------~~~~~~al~~~---~~~~~~f~~~~~~---~----~~~i~alA~w~ea~~~~~~~~~~t~---------- 129 (331) . .+ ..+...++... ..-...++.+... . ..-+.++...+|....+|.+++... T Consensus 340 gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~ 419 (663) T protein:vir:10 340 GGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVALADDRQDCVAFVNPPSELLVGVPTTQ 419 (663) T ss_pred ccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHHHHhhCCEEEEEecCcccccccchhh Confidence 0 00 11111222111 1123333333221 1 1123445555666555666654321 Q ss_pred -hHHHHh----------------hcccceEEEEEeCC-------Cc---h-hHHHHHHHHHhcCCccceee---eeeecc Q lcl|Aclame:pro 130 -VADITP----------------LAKNTRTIAIVHSK-------TG---E-KLDAALIGNVASLPVGSATW---KGRHGL 178 (331) Q Consensus 130 -~~~~~~----------------~~~~~~t~~~~~~~-------~~---~-~~~aa~~g~~~~~~~G~~t~---~~k~~l 178 (331) ...+.. ..+..+. .++++. .+ . .|.+.++|.++-.+.-+--| ..+. + T Consensus 420 ~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~-~l~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~span~~-~ 497 (663) T protein:vir:10 420 AVKNIVEWRNGVTTGGEVVDNNMNISSTYA-FISGNYKYQYDKYNDINRWVPLSADIAGLCAYTDQVGHPWMSPAGYR-R 497 (663) T ss_pred hHHHHHHHhhhccccchhhhhhcccCcceE-EEEecceeEecccCCceEEechHHHHHHHHHHhhccCCcEEccCCee-e Confidence 001000 0112233 333321 11 1 24566666655443211122 2221 2 Q ss_pred CCcCC-----CCCCHHHHHHHHhCCCeEEEEEcC-ee-EEecCEEeCCc-----eehhhHHHHHHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 179 AGITS-----EELKVSEIDAIQKAGGMCYIEKAG-IA-QTSEGKTVSGE-----FIDSIHGDDWIKATIETRLQKLLTET 246 (331) Q Consensus 179 ~gv~~-----~~~t~t~~~~l~~~~~n~y~~~~g-~~-~~~~G~~~~G~-----~iD~~~~~dwl~~~iq~~l~~l~~~~ 246 (331) .||.. ..+++.|.+.|..+|+|.+..+-+ .. .+...++++++ ||-+.+-.+|+...|+..+...+-. T Consensus 498 ~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e- 576 (663) T protein:vir:10 498 GQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFE- 576 (663) T ss_pred cceeccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC- Confidence 23321 357899999999999999988754 44 46667776652 7889999999999999888764432 Q ss_pred CCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEE Q lcl|Aclame:pro 247 DKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVY 326 (331) Q Consensus 247 ~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~ 326 (331) |.++.-...|+..++.-|++.+++|.|. +|.|++. .+..|++++.+.++. +.+.+.....+++|.++ T Consensus 577 ---pn~~~l~~~i~~~i~~~L~~l~~~gal~--------gf~V~~d-~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~ 643 (663) T protein:vir:10 577 ---NNDAFTRQSFRMEVSQYLDNIRSLGGVY--------DFRVVCD-TTNNTPQVIDSNEFV-ATIYIKAPRSINYITLN 643 (663) T ss_pred ---CCCHHHHHHHHHHHHHHHHHHHhCCcee--------eeEEEEc-CCCCCHHHhhCCeEE-EEEEEEecCCcceEEEE Confidence 6688888999999999999999999996 3889987 567899999988885 89999999999999988 Q ss_pred EEEeC Q lcl|Aclame:pro 327 GEVEV 331 (331) Q Consensus 327 ~~v~~ 331 (331) ...+= T Consensus 644 ~~~~~ 648 (663) T protein:vir:10 644 FVATS 648 (663) T ss_pred EEEEe Confidence 66554 No 60 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=96.86 E-value=0.00029 Score=39.94 Aligned_cols=310 Identities=15% Similarity=0.059 Sum_probs=153.1 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEc-----------------cCC----------cceEEEechhhhccCCCCC Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVK-----------------GTA----------MGYKEYTTLEELKDTFADN 53 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~-----------------~~~----------~~~~~yts~~~v~~~f~~~ 53 (331) -..+ ++.+.+. .+.. ...+..+.+.. ... ....++..+. +..+.+.. T Consensus 229 ~~Gn--~i~v~i~--~~~~-~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s-~~~~~~~~ 302 (663) T protein:vir:10 229 EIGS--TVEVEIV--SKTA-FNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLS-TRKGDRDV 302 (663) T ss_pred cccc--ceeEEec--cccc-ccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeee-eccccccc Confidence 0000 1111111 1000 00000000000 000 0000110000 00000000 Q ss_pred hHHHHHHHHHHccCCCcce--------------EEEEe-cc------chhHHHHHHHhhcC---ceeEEEEecC---CH- Q lcl|Aclame:pro 54 TEVYAKAKAVFLQKDRPDT--------------VAVIT-YE------DTKLLEAAEAYFLK---SWHFALLAEF---KA- 105 (331) Q Consensus 54 s~~ykaA~~~fsQ~~~p~~--------------v~v~~-~~------~~~~~~al~~~~~~---~~~f~~~~~~---~~- 105 (331) ...-......|.++..+-. +.+.+ .+ ..+...++..+... ..-.+.+... +. T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~ 382 (663) T protein:vir:10 303 YGSNIFMDDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAE 382 (663) T ss_pred chhhhhhhhhhccCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchh Confidence 0000011122222211100 11110 00 01122222222211 2223333221 11 Q ss_pred ---HHHHHHHHHHHhcCcEEEEEEeC-----------ChHHHHhh----------------cccceEEEEEeCC------ Q lcl|Aclame:pro 106 ---ADALALSNLIEEQKFKFAVFQVT-----------AVADITPL----------------AKNTRTIAIVHSK------ 149 (331) Q Consensus 106 ---~~i~alA~w~ea~~~~~~~~~~t-----------~~~~~~~~----------------~~~~~t~~~~~~~------ 149 (331) .-..++...++....+|.+++.. ....+... ....+ ..+|++- T Consensus 383 ~~~~v~~al~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~-~~l~~P~~~~~d~ 461 (663) T protein:vir:10 383 IASTVQKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTY-AFIIGNYKYQYDK 461 (663) T ss_pred hHHHHHHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcce-EEEEccceEEecc Confidence 12345556666665566665431 11111110 01122 3344431 Q ss_pred -Cc----hhHHHHHHHHHhcCCccceeee---eee--ccCCcC--CCCCCHHHHHHHHhCCCeEEEEEcC-e-eEEecCE Q lcl|Aclame:pro 150 -TG----EKLDAALIGNVASLPVGSATWK---GRH--GLAGIT--SEELKVSEIDAIQKAGGMCYIEKAG-I-AQTSEGK 215 (331) Q Consensus 150 -~~----~~~~aa~~g~~~~~~~G~~t~~---~k~--~l~gv~--~~~~t~t~~~~l~~~~~n~y~~~~g-~-~~~~~G~ 215 (331) .+ ..+++.++|.++-.+.-.--|+ .|. .+.|+. ...+++.|.+.|..+|+|+...+-| . ..+...+ T Consensus 462 ~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~r 541 (663) T protein:vir:10 462 YNDINRWVPLAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDK 541 (663) T ss_pred cCCceEEechhHHHHHHHHHhhccCCceEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEEEccc Confidence 11 1345666666654443211222 221 122321 2358999999999999999988754 3 3456667 Q ss_pred EeCC-----ceehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEE Q lcl|Aclame:pro 216 TVSG-----EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSIT 290 (331) Q Consensus 216 ~~~G-----~~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~ 290 (331) ++++ .||-+.+-.+|+...|+..+...+-. |.++.-...|+..++.-|++.+++|.|. +|.|+ T Consensus 542 T~s~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~~L~~l~~~gal~--------g~~v~ 609 (663) T protein:vir:10 542 MATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFE----NNDAFTRQSFRMETSQYLDGIRSLGGCY--------DFRVV 609 (663) T ss_pred ccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCcee--------eeEEE Confidence 7665 26889999999999999888764432 5688888999999999999999999996 38999 Q ss_pred ccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 291 ALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 291 ~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) +. .+..|++++.+.++. +.+.+.....+++|.++...+- T Consensus 610 ~d-~~~nt~~~i~~G~~~-~~i~~~p~~pae~i~~~~~~~~ 648 (663) T protein:vir:10 610 CD-TTNNTPNVIDRNEFV-GTIYVKPPRSINYITLNMVATS 648 (663) T ss_pred Ec-CCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEee Confidence 87 567899999999886 8999999999999988765444 No 61 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=96.69 E-value=0.00041 Score=39.17 Aligned_cols=319 Identities=13% Similarity=0.062 Sum_probs=164.4 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCcceEE--EechhhhccCCCCChHHHHH-HHHH-HccCCC-cceEEE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMGYKE--YTTLEELKDTFADNTEVYAK-AKAV-FLQKDR-PDTVAV 75 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~~~~~--yts~~~v~~~f~~~s~~yka-A~~~-fsQ~~~-p~~v~v 75 (331) |...+.==+.|+. +.+.. .---..|+++.++..-.+. ....+++.+-++..+...|. ..+. ...+.. -..++. T Consensus 1 ~~~~v~vn~~n~~-~g~~~-~~er~~lfig~~~~~~g~~~~~~~~sdld~~l~~~ds~lk~~v~aa~~naG~~~~~~~~p 78 (370) T protein:vir:78 1 MWPYVQIYNLNQM-QGPVT-EVERHLLFIGSAASNTGKLLSLNAQSDFDQLLGAADSELKANLLAARDNAGQNWSAAAYV 78 (370) T ss_pred CCceEEEeecccc-CCCcC-ccceeEEEEecccccccceEeecCccCHHHhcCCcChhHHHHHHHHHhCCCCceEEEEEE Confidence 8875432222222 22222 2344667777665432222 22333333334333333333 2222 222211 111211 Q ss_pred EeccchhHHHHHHHhh-cCceeEEEEecC--CHHHHHHHHHHHHh---c--CcEEEEEEeCC------hHH----HHhhc Q lcl|Aclame:pro 76 ITYEDTKLLEAAEAYF-LKSWHFALLAEF--KAADALALSNLIEE---Q--KFKFAVFQVTA------VAD----ITPLA 137 (331) Q Consensus 76 ~~~~~~~~~~al~~~~-~~~~~f~~~~~~--~~~~i~alA~w~ea---~--~~~~~~~~~t~------~~~----~~~~~ 137 (331) .....+..+|++... ...+.|+.+... +.+++.++....+. . ...|+.+.... .++ ..... T Consensus 79 -~~~~~d~~~Av~~a~~~~s~E~V~v~~~~s~~a~~~a~~~~a~el~n~~~Rpv~file~~~~~~~e~w~~y~~~l~al~ 157 (370) T protein:vir:78 79 -LPTDKPWLDAARDAQQTQSFEGVVVLGQEWHQAAINAAHALNQELIAKWGRWQFMLLAVPAIADEQDWATYEAELATLQ 157 (370) T ss_pred -ecCchhHHHHHHHHHhhCCccEEEEecCcchHHHHHHHHHHHHHHHHhcCCeEEEEEeecCCCCcCCHHHHHHHHHHhh Confidence 223457778876554 346678777775 34666665444432 2 23444443211 111 11111 Q ss_pred ---ccce--EEEEEeCCCchhHHHHHHHHHhc------CCcccee-eeeee--ccC-CcCCCCCCHHHHHHHHhCCCeEE Q lcl|Aclame:pro 138 ---KNTR--TIAIVHSKTGEKLDAALIGNVAS------LPVGSAT-WKGRH--GLA-GITSEELKVSEIDAIQKAGGMCY 202 (331) Q Consensus 138 ---~~~~--t~~~~~~~~~~~~~aa~~g~~~~------~~~G~~t-~~~k~--~l~-gv~~~~~t~t~~~~l~~~~~n~y 202 (331) ...+ ..+.+|. .....++||+.. ..|+++. -.-+. .+| .-....++.+.+++|+++|+.+. T Consensus 158 ~gia~~~V~vvp~~~g----~~~G~~aGRL~naavsVadsP~Rv~tG~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp 233 (370) T protein:vir:78 158 DGIAASSVSLIPQLWP----TLAGAYAGRLCNRAVSIADSPCRVKTGALVGLGNKPVGKDGIPLPLATLQTLEANRYSVP 233 (370) T ss_pred hccccccceEEeeecc----ccHHHHHHHHhcCeeeecccceeeeccccccccccccccCCcccCHHHHHHHHhCCCeEE Confidence 1222 2333343 236677887532 1233221 11111 011 01234588899999999999999 Q ss_pred EEEcCe--eEEecCEEeC---CceehhhHHHHHHHHHHHHHHHHHH-hcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCcc Q lcl|Aclame:pro 203 IEKAGI--AQTSEGKTVS---GEFIDSIHGDDWIKATIETRLQKLL-TETDKLTFDARGIALLQSELTTVLNEGFANGII 276 (331) Q Consensus 203 ~~~~g~--~~~~~G~~~~---G~~iD~~~~~dwl~~~iq~~l~~l~-~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I 276 (331) ..|.|. .++.+|.++. |+|=-.-+.+.+-|..-+.++..+. +....+-=++..++..+......|++..+.+-| T Consensus 234 ~~Y~gy~G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~gsia~~~~~~~~~L~ema~s~~i 313 (370) T protein:vir:78 234 MWYPDYDGIYWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAIARIGDRSFNSTPGSTAAAITYFGKDLREMAKSTTI 313 (370) T ss_pred EeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCcccCCCCcchhHHHHHHHhhHHHHHhhhhh Confidence 999985 4778899875 4554444445555554444443332 222234446677888888899999999999988 Q ss_pred cccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 277 DSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 277 ~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ..-+ -|+ +|..|.=.++..+-...++. .|.+..+.=|.=..|+++--|++ T Consensus 314 ~~~~---fpg-eI~~p~d~Di~i~w~s~~~v-~I~~~v~P~~~pk~Itv~I~LDl 363 (370) T protein:vir:78 314 NGQP---FPG-DIASPQDGDIRIQWVAKNLV-SVFVVVRTVDCPKGITVNIMLDL 363 (370) T ss_pred cccc---cce-eEeccCCCcceEEeeccceE-EEEEEEEeccCCceEEEEEEEee Confidence 6421 222 56666533445444444444 47777777777777776666666 No 62 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=96.47 E-value=0.0006 Score=38.26 Aligned_cols=308 Identities=14% Similarity=0.084 Sum_probs=152.6 Q ss_pred CCCceeeEEE-----------------------EEe------------e--c-ccccccccceeEEEEccCC--cceEEE Q lcl|Aclame:pro 1 MVETITDVRV-----------------------HIS------------V--L-YPSPRIGLGRPAIFVKGTA--MGYKEY 40 (331) Q Consensus 1 ~v~~i~dV~v-----------------------~i~------------~--~-~~~~~~~fg~~li~~~~~~--~~~~~y 40 (331) ...+-+.+.+ ... . . .+.....|. .++..... +.+... T Consensus 235 ~~gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--vvv~~~g~~~~~~~~~ 312 (679) T protein:vir:10 235 TYGDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFA--FIVFNNGVAVESKILS 312 (679) T ss_pred ccCCcceEEEEeecccccccccccccccccccccccccccccccceeeeeccccccccccee--eEEecccccccceeee Confidence 0000000000 000 0 0 000000000 00000000 000000 Q ss_pred echhhhccCCCCChHHHHHHHHHHccCC----------Ccc----eEEEEec-cc------hhHHHHHHHhh--cC-cee Q lcl|Aclame:pro 41 TTLEELKDTFADNTEVYAKAKAVFLQKD----------RPD----TVAVITY-ED------TKLLEAAEAYF--LK-SWH 96 (331) Q Consensus 41 ts~~~v~~~f~~~s~~ykaA~~~fsQ~~----------~p~----~v~v~~~-~~------~~~~~al~~~~--~~-~~~ 96 (331) +..++ .+......| ...++..+. .|. .+.+.+. +. .+...+...+. .. ..- T Consensus 313 ~~~~~---~~~~~~~~~--~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 387 (679) T protein:vir:10 313 TKPGD---RDIYGTSIY--INEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVN 387 (679) T ss_pred ccccc---ccccchhhh--hhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccc Confidence 00000 000010101 111111110 010 0111000 00 01111111111 11 122 Q ss_pred EEEEecCC-------HHHHHHHHHHHHhcCcEEEEEEeCC-----------hHHHHhhc----------------ccceE Q lcl|Aclame:pro 97 FALLAEFK-------AADALALSNLIEEQKFKFAVFQVTA-----------VADITPLA----------------KNTRT 142 (331) Q Consensus 97 f~~~~~~~-------~~~i~alA~w~ea~~~~~~~~~~t~-----------~~~~~~~~----------------~~~~t 142 (331) ++++.... .+-..++...+|....+|.+++.-. ...+.... +..+. T Consensus 388 ~l~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~ 467 (679) T protein:vir:10 388 LFIAGAVAGEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYA 467 (679) T ss_pred eEEecCCCCCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceE Confidence 33333322 2234566677777777777764211 11111100 11222 Q ss_pred EEEEeCC-------Cch----hHHHHHHHHHhcCCccceee---eeeeccCCc---CC--CCCCHHHHHHHHhCCCeEEE Q lcl|Aclame:pro 143 IAIVHSK-------TGE----KLDAALIGNVASLPVGSATW---KGRHGLAGI---TS--EELKVSEIDAIQKAGGMCYI 203 (331) Q Consensus 143 ~~~~~~~-------~~~----~~~aa~~g~~~~~~~G~~t~---~~k~~l~gv---~~--~~~t~t~~~~l~~~~~n~y~ 203 (331) .+|++. .+. -|.+.++|.++-.+.-+-.| ..+. +.|| .. -.+++.|.+.|..+|+|+.. T Consensus 468 -~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~-~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~ 545 (679) T protein:vir:10 468 -SVDGNYKYQYDKYNDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFN-RGQIVNVIKLAVDTRQAHRDEMYTNGINPIV 545 (679) T ss_pred -EEEccceeeecccCCceEEechHHHHHHHHHHhhccCCcEECcCCee-eccccccccceeecChhhHHhhhhCCceEEE Confidence 333331 111 23566666665444321122 2231 2333 11 24789999999999999999 Q ss_pred EEcCee-EEecCEEeCC---c--eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHhcCccc Q lcl|Aclame:pro 204 EKAGIA-QTSEGKTVSG---E--FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIID 277 (331) Q Consensus 204 ~~~g~~-~~~~G~~~~G---~--~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl~~~~~~G~I~ 277 (331) .+.|.. .+...+++++ + ||-+.|-.+|++..|+......+-. |.++.=...|+..|..-|.+.+++|.|. T Consensus 546 ~~~g~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~fL~~l~~~gal~ 621 (679) T protein:vir:10 546 GFAGQGYILYGDKTASQAPTPFDRINVRRLFNLLKKSISESAKYKLFE----LNDAFTRSSFRSEVGSYLDTIRSLGGIY 621 (679) T ss_pred EecCCeEEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCcee Confidence 987654 5567777765 2 6888999999999999888765432 4577778999999999999999999997 Q ss_pred ccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 278 SNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 278 ~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) +|.|++. .++.+++|+.+.++. +.+.+...-.+++|.++..-+- T Consensus 622 --------gf~v~~d-~~~nt~~~i~~G~~~-~~i~~~p~~pae~i~~~~~~~~ 665 (679) T protein:vir:10 622 --------DFRVVCD-ESNNTPAVIDRNEFV-ATILIKPARSINYITLSFVATS 665 (679) T ss_pred --------eeEEEEc-CCCCCHHHhhCCeEE-EEEEEEecCCccEEEEEEEEee Confidence 3899988 578899999999885 8999999999999998866554 No 63 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=96.35 E-value=0.00071 Score=37.84 Aligned_cols=313 Identities=12% Similarity=0.124 Sum_probs=152.3 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCcc-eEEEechh----hhccCCCCChHHHHHHHHHHccCCCc----- Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMG-YKEYTTLE----ELKDTFADNTEVYAKAKAVFLQKDRP----- 70 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~~-~~~yts~~----~v~~~f~~~s~~ykaA~~~fsQ~~~p----- 70 (331) =.+...++.+.--.. +.+...-|++-+-++.+... ...|=.-. .|..+-+...-.-+.++++=++..-| T Consensus 80 ~~n~~~~l~~i~~~D-~aG~aAtg~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA~~ 158 (498) T protein:vir:44 80 KTDPFGELYVIAVPE-STGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPFTATS 158 (498) T ss_pred HhCCCceeEEEecCC-cccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEEee Confidence 122333333321111 22223333333333322211 00000000 00011011111111111111111111 Q ss_pred -------------------------------------ceEEEEec----cchhHHHHHHHhhcCceeEEEEecCCHHHHH Q lcl|Aclame:pro 71 -------------------------------------DTVAVITY----EDTKLLEAAEAYFLKSWHFALLAEFKAADAL 109 (331) Q Consensus 71 -------------------------------------~~v~v~~~----~~~~~~~al~~~~~~~~~f~~~~~~~~~~i~ 109 (331) -++.+... ...+...+|.+....+|.|+++.-.|.+... T Consensus 159 ~~~~vtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~i~~p~~D~asl~ 238 (498) T protein:vir:44 159 EAGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDTASVN 238 (498) T ss_pred ccceEEEEEeccCcccCcceEEEeeccCccccccccceeEEEEcccCCccCchhHHHHHhhccCCccEEEEeecCHHHHH Confidence 11112111 1235667777777788889988888888888 Q ss_pred HHHHHHHhcC-------cEE--EEEE-eCChHHHH---hhcccceEEEEEeCCCc----hhHHHHHHHHHh---cCCccc Q lcl|Aclame:pro 110 ALSNLIEEQK-------FKF--AVFQ-VTAVADIT---PLAKNTRTIAIVHSKTG----EKLDAALIGNVA---SLPVGS 169 (331) Q Consensus 110 alA~w~ea~~-------~~~--~~~~-~t~~~~~~---~~~~~~~t~~~~~~~~~----~~~~aa~~g~~~---~~~~G~ 169 (331) ++.++++.-. .++ .+.. ....+.+. ...+..|..++++.... .-.+|+++++++ ..+|.+ T Consensus 239 al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~a~l~t~g~~~N~~~it~~~~~~~~~sp~~~~AAa~a~~aA~~l~~DPAr 318 (498) T protein:vir:44 239 SMATEMNDSSGRWSYVRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAGYEKDTQTPADELAASRTARAAVFIRNDPAR 318 (498) T ss_pred HHHHHHhhhhcchHHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEecCCCCCCHHHHHHHHHHHHHHHHhhccccc Confidence 8888886422 122 2222 22233333 23355667677664322 233556666665 556753 Q ss_pred eeeeeeeccCCcCCC----CCCHHHHHHHHhCCCeEEEEEcCeeEEecCEEe-----CCc----eeh--hhHHHHHHHHH Q lcl|Aclame:pro 170 ATWKGRHGLAGITSE----ELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTV-----SGE----FID--SIHGDDWIKAT 234 (331) Q Consensus 170 ~t~~~k~~l~gv~~~----~~t~t~~~~l~~~~~n~y~~~~g~~~~~~G~~~-----~G~----~iD--~~~~~dwl~~~ 234 (331) .+.-. .|+|+.|. .++.+|.+.|..+|+..+.--.|...+.+..+. .|. |.| +++-.++++.. T Consensus 319 -PL~tl-~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~ 396 (498) T protein:vir:44 319 -PTQTG-ELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVESGVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRR 396 (498) T ss_pred -ccCce-eecccccCCchhcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHH Confidence 22222 37788765 378999999999999999776787666666653 452 665 89999999999 Q ss_pred HHHHHHHHHhc----CCCCCcCHHHH-----HHHHHHHHHHHHHHHhcCcccccccC---------CCcceE--EEccch Q lcl|Aclame:pro 235 IETRLQKLLTE----TDKLTFDARGI-----ALLQSELTTVLNEGFANGIIDSNDET---------GEPNFS--ITALQR 294 (331) Q Consensus 235 iq~~l~~l~~~----~~kipyt~~G~-----~~i~~~v~~vl~~~~~~G~I~~g~~~---------~~~~~~--v~~~~~ 294 (331) +++.+..-|-. .+..++. .|. ..|++.+-.++++....|++..-+.. ++.-.+ +..|+ T Consensus 397 ~r~~i~~kfpR~KLa~d~~~~~-~gq~IvTp~~ir~eli~~y~~le~~givEn~~~~~~~LiVerd~~dpnRln~~~p~- 474 (498) T protein:vir:44 397 LKSVITSKYGRHKLANDGTRFG-SGQAIVTPAVIRGELGSTYRQMEREGIVENFDLFQQHLIVERNANDSNRLDVLFPP- 474 (498) T ss_pred HHHHhhhhcCCcccccCCcccC-CCcccccHHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcEEEEEecc- Confidence 99999765522 1222222 222 58999999999999999999642211 010111 11111 Q ss_pred hcCCHHHHHhcccCCeEEEEEEcce Q lcl|Aclame:pro 295 SDLNDDDIAKRNYKGLSFRYKRSGA 319 (331) Q Consensus 295 ~~~~~~dr~~R~~~~i~~~~~~aGa 319 (331) +.-..=|---..-.+.+.|.-++| T Consensus 475 -d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:44 475 -DYVNQLRVFAVLNQFRLQYSEEAA 498 (498) T ss_pred -cccCchhhhhhhhhhhhhhhhhcC Confidence 111111111111112223333333 No 64 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=96.33 E-value=0.00073 Score=37.77 Aligned_cols=306 Identities=14% Similarity=0.057 Sum_probs=145.4 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCcceEEEechhhhccCCCCChHHHHHHHHHHccCCCc--ceEEEEe- Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRP--DTVAVIT- 77 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~~~~~yts~~~v~~~f~~~s~~ykaA~~~fsQ~~~p--~~v~v~~- 77 (331) +.++...+.+.... .....+. +-...+.. ..+.+...+..-+..+...+..+.. ...|.+ ..+.+.. T Consensus 272 ~~~~~~~~~~~~~~---~~~~~~~--~s~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~gg 341 (663) T protein:vir:10 272 MTDDQFAIIVRRDG---IVVESTV--LSTRKGDR---DVYGSNIFMDDYFRNGGSNFIFASS--EGWPAGFTGIIQLGGG 341 (663) T ss_pred ccccceeeEeecCC---cceeeec--cccccccc---ccccchhhhhhhhcCCcceEEEEee--cccCccccceeEeccc Confidence 22222222111110 0000110 00000000 0000000000000000000000000 000000 0000000 Q ss_pred c------cchhHHHHHHHhhcC---ceeEEEEecC--C-----HHHHHHHHHHHHhcCcEEEEEEeC-----------Ch Q lcl|Aclame:pro 78 Y------EDTKLLEAAEAYFLK---SWHFALLAEF--K-----AADALALSNLIEEQKFKFAVFQVT-----------AV 130 (331) Q Consensus 78 ~------~~~~~~~al~~~~~~---~~~f~~~~~~--~-----~~~i~alA~w~ea~~~~~~~~~~t-----------~~ 130 (331) . ...+...++...... .-.++++... + ..-..++...++....+|.+++.. .. T Consensus 342 ~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~ 421 (663) T protein:vir:10 342 TSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAV 421 (663) T ss_pred cCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHHHhhCCEEEEEecCcccccccccccch Confidence 0 001111222111111 1112222111 0 111233445555544455444321 11 Q ss_pred HHHHhh----------------cccceEEEEEeCC-------Cch----hHHHHHHHHHhcCCccceee---eeee--cc Q lcl|Aclame:pro 131 ADITPL----------------AKNTRTIAIVHSK-------TGE----KLDAALIGNVASLPVGSATW---KGRH--GL 178 (331) Q Consensus 131 ~~~~~~----------------~~~~~t~~~~~~~-------~~~----~~~aa~~g~~~~~~~G~~t~---~~k~--~l 178 (331) ..+... .+..+. .++++. .+. -+++.++|.++..+.-.--| ..|. .+ T Consensus 422 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~-~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i 500 (663) T protein:vir:10 422 KNIVEWRNGMTGSGEVVDNNMNISSTYA-FIIGNYKYQYDKYNDINRWVPLAADIAGLCAYTDQVSHPWMSPAGYRRGQI 500 (663) T ss_pred HHHHHHHHhccccccchhhhcccCccce-EEEcCceEEecccCCceEEechhHHHHHHHHHhhccCCceEccCCceeccc Confidence 111000 011222 233221 111 24566666665444321122 2231 12 Q ss_pred CCcC--CCCCCHHHHHHHHhCCCeEEEEEcC-ee-EEecCEEeCC---c--eehhhHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|Aclame:pro 179 AGIT--SEELKVSEIDAIQKAGGMCYIEKAG-IA-QTSEGKTVSG---E--FIDSIHGDDWIKATIETRLQKLLTETDKL 249 (331) Q Consensus 179 ~gv~--~~~~t~t~~~~l~~~~~n~y~~~~g-~~-~~~~G~~~~G---~--~iD~~~~~dwl~~~iq~~l~~l~~~~~ki 249 (331) .|+. ...+++.|.+.|..+|+|+...+-| .. .+...+++++ + ||-+.+-.+||...|++.+...+-. T Consensus 501 ~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e---- 576 (663) T protein:vir:10 501 RNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFE---- 576 (663) T ss_pred cccccceeecChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC---- Confidence 2332 2357999999999999999888654 33 4566667654 2 6888999999999999888764432 Q ss_pred CcCHHHHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEE Q lcl|Aclame:pro 250 TFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEV 329 (331) Q Consensus 250 pyt~~G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v 329 (331) |.++.=...|+..+..-|++.+++|.|. +|.|++. .+..|++++.+.++. +.+.+.....+++|.++..- T Consensus 577 pn~~~l~~~i~~~i~~~L~~l~~~gal~--------g~~v~~d-~~~nt~~~i~~G~~~-~~i~~~p~~pae~i~~~~~~ 646 (663) T protein:vir:10 577 NNDAFTRQSFRMETSQYLDGIRSLGGCY--------DFRVVCD-TTNNTPNVIDRNEFV-GTIYVKPPRSINYITLNMVA 646 (663) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCcee--------eeEEEEc-CCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEE Confidence 5688888999999999999999999996 3899988 577899999999885 89999999999999887665 Q ss_pred eC Q lcl|Aclame:pro 330 EV 331 (331) Q Consensus 330 ~~ 331 (331) += T Consensus 647 ~~ 648 (663) T protein:vir:10 647 TS 648 (663) T ss_pred ee Confidence 44 No 65 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=95.94 E-value=0.0012 Score=36.56 Aligned_cols=317 Identities=15% Similarity=0.072 Sum_probs=159.9 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCC--c--ceEEEechhhhccCCCCC-hHHHHHHHHHHccCCCcceEEE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTA--M--GYKEYTTLEELKDTFADN-TEVYAKAKAVFLQKDRPDTVAV 75 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~--~--~~~~yts~~~v~~~f~~~-s~~ykaA~~~fsQ~~~p~~v~v 75 (331) |.=--+.| .+++.-+.....--...|+++.++. . ..-....-+++.+-+++. +++=+...++...+.+--..++ T Consensus 1 m~~~~V~i-n~~n~~qg~~~~ver~~lfig~g~~~~~~g~~~~~~~~sdld~~lg~~ds~lk~~v~aa~~naG~~w~a~~ 79 (369) T protein:vir:27 1 MAWPTVII-KILNLMNGPIADIECHFLFVIRGTVSGEVRNLIMVDSTSDLDDVLAEASAEGLAIVKAAQLNGKQAWTAGV 79 (369) T ss_pred CCCCceEE-ecccccCCCcccccceEEEEEeccccccccceEEecCccchHhhcCCcChhHHHHHHHHHhCCCCceEEEE Confidence 66544444 2234333222223556778866542 1 112222333333333333 3332223333222221111111 Q ss_pred -EeccchhHHHHHHHhhc-CceeEEEEecC--CHHHHHHHHHHH---Hhc--CcEEEEEEeC----------ChHHH-H- Q lcl|Aclame:pro 76 -ITYEDTKLLEAAEAYFL-KSWHFALLAEF--KAADALALSNLI---EEQ--KFKFAVFQVT----------AVADI-T- 134 (331) Q Consensus 76 -~~~~~~~~~~al~~~~~-~~~~f~~~~~~--~~~~i~alA~w~---ea~--~~~~~~~~~t----------~~~~~-~- 134 (331) ......+..+|+..... -.+.|+.+... +.+++.++.... .++ ...|+.+..- +..+. . T Consensus 80 ~p~~~~~~~~~Av~~a~~~~s~E~V~v~~p~t~~a~i~aaq~~a~el~~~~~R~vffi~e~~~~~~~~~~~e~w~dy~a~ 159 (369) T protein:vir:27 80 MILSEEDNWQDAVKKANEVSSFEFVVLGFDAETKAMIEDAITLRTELKNSLGREVGVLCQLPAINNDPTNGQTWSEWLAD 159 (369) T ss_pred EEeCCchhHHHHHHhhhhhCCccEEEEecCcccHHHHHHHHHHHHHHHHhcCCeEEEEEeccccCCCccccCCHHHHHHH Confidence 12234567777765543 36677777764 346665544333 232 2344444311 11111 1 Q ss_pred -----hhcccceEE--EEEeCCCchhHHHHHHHHHhc--C----Cccceee-eee--eccCCc-CCCCCCHHHHHHHHhC Q lcl|Aclame:pro 135 -----PLAKNTRTI--AIVHSKTGEKLDAALIGNVAS--L----PVGSATW-KGR--HGLAGI-TSEELKVSEIDAIQKA 197 (331) Q Consensus 135 -----~~~~~~~t~--~~~~~~~~~~~~aa~~g~~~~--~----~~G~~t~-~~k--~~l~gv-~~~~~t~t~~~~l~~~ 197 (331) .-....+.. +.+|...+ ....++||... . .||++-- .-. ..++.- ....++.+.+.+|+++ T Consensus 160 l~al~~g~a~~~V~vv~~~~~~gn--~~G~~aGRl~n~aVsIadsp~RVktG~l~g~~~~p~d~~g~~l~~a~l~aLd~a 237 (369) T protein:vir:27 160 TVDIPKDVASEYISVVPNVHAAGD--TLGKYAGRLANKEVSIADSPARVQTGSVLGNTELMKDKAGKALDLATLKALESN 237 (369) T ss_pred HHHHhhccCcccceeeeeeccccc--hHHHHHHHHHhcccchhcCcceeeecccccccccccCCCCcccCHHHHHHHHhC Confidence 111223333 33343222 35566777532 1 3444321 100 011211 1124788999999999 Q ss_pred CCeEEEEEcCe--eEEecCEEeC---CceehhhHHHHHHHHHHHHHHHHH-HhcCCCCCcCHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 198 GGMCYIEKAGI--AQTSEGKTVS---GEFIDSIHGDDWIKATIETRLQKL-LTETDKLTFDARGIALLQSELTTVLNEGF 271 (331) Q Consensus 198 ~~n~y~~~~g~--~~~~~G~~~~---G~~iD~~~~~dwl~~~iq~~l~~l-~~~~~kipyt~~G~~~i~~~v~~vl~~~~ 271 (331) |+.+...|.|. .++.+|.++. |+|=-.-..+.+-|..=+.++..+ .+....+.-++.+++..+..+..+|++.. T Consensus 238 gysvp~~Y~gy~G~Yw~d~~tl~~~gsDYq~iE~~RVvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ 317 (369) T protein:vir:27 238 RIAVPMWYPDYPGQYWTTGRTLDVPGGDYQDIRHIRVAMKAARKVRIRAIARIADRTLNSTPQSIAAAKLYFTQDLRTMA 317 (369) T ss_pred CCeEEEeeCCCCceEEeCceEeccCCCCeehhhhhhHHHHHHHHHHHHHHHHhcCcccccChhHHHHHHHHHhhHHHHHH Confidence 99999999984 4778999875 566555555555555555555444 23455588899999999999999999997 Q ss_pred hcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 272 ANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 272 ~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) +-+ -|| +|..|.-.+++-.- ..+.--.|.+..+.=+.=..++++--|++ T Consensus 318 ks~--fpg--------ei~~P~d~dI~i~w-~~k~~V~I~~~vrP~~~pk~it~~I~ldl 366 (369) T protein:vir:27 318 LTG--VPG--------EIYPPEDEDIQIKW-VNSTDVEIYMSVQPYECPVKITIAISVKQ 366 (369) T ss_pred hhc--CCe--------EEecCCCCceEEEe-eccceEEEEEEEeeccCCceEEEEEEEec Confidence 664 233 36666544443332 12222235555555555556666666666 No 66 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=92.97 E-value=0.0093 Score=31.72 Aligned_cols=318 Identities=14% Similarity=0.111 Sum_probs=153.7 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCcc-eE------EEe-------chhhhcc------CCCCChHHHH-- Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMG-YK------EYT-------TLEELKD------TFADNTEVYA-- 58 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~~-~~------~yt-------s~~~v~~------~f~~~s~~yk-- 58 (331) =.+...++.+.- +.-+.+...-|+.-+-++.+... .. .+. +++.+++ .-..+.|+.. T Consensus 83 ~~n~~~~l~~i~-~~D~aG~aA~g~it~tg~at~~G~l~l~I~g~~v~v~V~~gdTaa~vA~al~aaina~~~lPvTA~~ 161 (495) T protein:vir:19 83 NANRVAELWCIP-QGNGTGNAAVGEISLSGTAGENGSLVTYIAGQRLAVSVAAGATGAALADLLVARIKGQPDLPVTAEV 161 (495) T ss_pred HhCCcceEEEEe-eCChhhceeEEEEEEeecCCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHhcCCccCceEEEe Confidence 112222322211 11011122223333322222110 00 000 0000000 0000000000 Q ss_pred --------------HHHHHHcc-------------CCCc--ceEEEEec----cchhHHHHHHHhhcCceeEEEEecCCH Q lcl|Aclame:pro 59 --------------KAKAVFLQ-------------KDRP--DTVAVITY----EDTKLLEAAEAYFLKSWHFALLAEFKA 105 (331) Q Consensus 59 --------------aA~~~fsQ-------------~~~p--~~v~v~~~----~~~~~~~al~~~~~~~~~f~~~~~~~~ 105 (331) ..++..+. ...| -++.+... ...+...+|.+..+.||.|+++.-.|. T Consensus 162 ~~~~~~~~a~~~VtlTAr~kG~~n~idi~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~I~~P~tD~ 241 (495) T protein:vir:19 162 RADSGDDDTHADVVLSAKFTGALSAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMGDLQYKYIVMPYTDE 241 (495) T ss_pred eccCCCCcCceeEEEEEeeccccccceeEEEeecccccccceeEEEEecCCCCCCcchHHHHHHhccCCCcEEEEecCcH Confidence 00011111 1111 12222211 123566777777777888888887788 Q ss_pred HHHHHHHHHHHhcC------cEEEEEE-eCChHHHH---hhcccceEEEEEeCCCch--h-HHHHHHHHHh---cCCccc Q lcl|Aclame:pro 106 ADALALSNLIEEQK------FKFAVFQ-VTAVADIT---PLAKNTRTIAIVHSKTGE--K-LDAALIGNVA---SLPVGS 169 (331) Q Consensus 106 ~~i~alA~w~ea~~------~~~~~~~-~t~~~~~~---~~~~~~~t~~~~~~~~~~--~-~~aa~~g~~~---~~~~G~ 169 (331) +...+|.++++..- .-+.+.. ....+.+. ...+..+..++++..... + .+|++++.++ ..+|.+ T Consensus 242 asL~al~~~l~~rw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~~~gsp~~~~~~AAA~aa~~A~~l~~DPAr 321 (495) T protein:vir:19 242 PNLNLLRTELQERWGPVNQADGFAVTVLSGTYGDISTFGVSRNDHLISCMGIAGAPEPSYLYAATLCAVASQALSIDPAR 321 (495) T ss_pred HHHHHHHHHHHHhhhHHHhcCeEEEEeecCCHHHHHHhhhccCCceEEEEecCCCCCcHHHHHHHHHHHHHHHhhccccc Confidence 88888888887521 1222332 23333333 334566777777643222 1 2455555543 456643 Q ss_pred eeeeeeeccCCcCCCC----CCHHHHHHHHhCCCeEEEE-EcCeeEEecCEEe-----CC----ceeh--hhHHHHHHHH Q lcl|Aclame:pro 170 ATWKGRHGLAGITSEE----LKVSEIDAIQKAGGMCYIE-KAGIAQTSEGKTV-----SG----EFID--SIHGDDWIKA 233 (331) Q Consensus 170 ~t~~~k~~l~gv~~~~----~t~t~~~~l~~~~~n~y~~-~~g~~~~~~G~~~-----~G----~~iD--~~~~~dwl~~ 233 (331) .+.-. .|+|+.|.. ++.+|.+.|..+|+..+.- .+|...+.+..+. .| .|.| +++.+++++. T Consensus 322 -PL~tl-~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~ 399 (495) T protein:vir:19 322 -PLQTL-TLPGRMPPAVGDRFTWSERNALLFDGISTFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRY 399 (495) T ss_pred -ccCce-eecceecCCccccCChHHHHHHHhCCcceEEECCCCeEEEEeeeeeeeecCCCCcchhhhhhHHHHHHHHHHH Confidence 22223 377877544 7899999999999998874 5777766665544 35 2766 8999999999 Q ss_pred HHHHHHHHHHhcC----CCCCcCHH----HHHHHHHHHHHHHHHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhc Q lcl|Aclame:pro 234 TIETRLQKLLTET----DKLTFDAR----GIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKR 305 (331) Q Consensus 234 ~iq~~l~~l~~~~----~kipyt~~----G~~~i~~~v~~vl~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R 305 (331) .+++.+..-|-.. +..++... --..|++.+-+++++....|++..-+. -... +.+ . ...+| .+| T Consensus 400 ~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~given~~~-~~~~--LiV-e---rd~~d-pnR 471 (495) T protein:vir:19 400 SLRTRITQKFPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDT-FKEE--LYV-A---RNKDD-KDR 471 (495) T ss_pred HHHHHHhhhcCCcccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccccChhh-hcce--eEE-E---ECCCC-CcE Confidence 9999998766332 21222111 125799999999999999999963221 1111 111 1 11111 122 Q ss_pred ccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 306 NYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 306 ~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) . ++.+-..+...-|.+-....+-| T Consensus 472 l--n~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 472 L--DVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred E--EEEecceeeCceeeeeeeeeeeC Confidence 1 23333444444444444433334 No 67 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=88.61 E-value=0.031 Score=28.83 Aligned_cols=318 Identities=10% Similarity=0.095 Sum_probs=161.8 Q ss_pred CCCceeeEEEEEeecccccccccceeEEEEccCCcceEE--EechhhhccCCCCChHHHHH-HHHHHccCCCc--ceEEE Q lcl|Aclame:pro 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMGYKE--YTTLEELKDTFADNTEVYAK-AKAVFLQKDRP--DTVAV 75 (331) Q Consensus 1 ~v~~i~dV~v~i~~~~~~~~~~fg~~li~~~~~~~~~~~--yts~~~v~~~f~~~s~~yka-A~~~fsQ~~~p--~~v~v 75 (331) |..++.=-++|+. +.+.. .---..|+++.++....+. ...-+++-.-|++.+...|. ..+......+- ..+.+ T Consensus 1 ~~~~v~vn~ln~~-qg~~~-~ver~~lfig~~~~~~~~~~~~~~~sdld~~lg~~ds~lk~~v~aa~~naG~~w~a~~~~ 78 (376) T protein:vir:37 1 MFPSVQINALNQL-SGETK-EIERHALFVGVGTTNQGKLLALTPDSDFDKVFGETDTDLKKQVRAAMLNAGQNWFAHVYI 78 (376) T ss_pred CCCeEEEeeeecc-CCCcc-cccceEEEeeccccccCceEEecCCCChHHhhCCCchhHHHHHHHHHhCCCCceEEEEEe Confidence 7775432222222 22332 2345678888765432222 22223333334443333332 22222221111 11222 Q ss_pred EeccchhHHHHHHHhhc-CceeEEEEecC---CHHHHHHHHHH---HHhc--CcEEEEEEeC----------ChHHH-Hh Q lcl|Aclame:pro 76 ITYEDTKLLEAAEAYFL-KSWHFALLAEF---KAADALALSNL---IEEQ--KFKFAVFQVT----------AVADI-TP 135 (331) Q Consensus 76 ~~~~~~~~~~al~~~~~-~~~~f~~~~~~---~~~~i~alA~w---~ea~--~~~~~~~~~t----------~~~~~-~~ 135 (331) -..+..+..+|++.... -.+.|+.+... +.+++.++... ..++ ...|+.+..- +.++. .. T Consensus 79 p~~~~~~~~~Av~~a~~~~s~E~V~v~~p~~t~~a~i~a~qa~a~el~~~~~R~vffile~~g~d~~~~~ge~w~~y~~~ 158 (376) T protein:vir:37 79 AQEDGYDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQAVQGINHDQSDGETWDQYVQK 158 (376) T ss_pred cCCChhhHHHHHHHHHhhCCeeEEEEecCcchhHHHHHHHHHHHHHHHHhcCCeEEEEEeccCCCCcccccCCHHHHHHH Confidence 22234677788776643 45677777764 35666655333 3333 2344444432 11111 11 Q ss_pred h------cccceEE--EEEeCCCchhHHHHHHHHHh--c----CCcccee-eeeeeccCCcC------CCCCCHHHHHHH Q lcl|Aclame:pro 136 L------AKNTRTI--AIVHSKTGEKLDAALIGNVA--S----LPVGSAT-WKGRHGLAGIT------SEELKVSEIDAI 194 (331) Q Consensus 136 ~------~~~~~t~--~~~~~~~~~~~~aa~~g~~~--~----~~~G~~t-~~~k~~l~gv~------~~~~t~t~~~~l 194 (331) . ....+.. +.+|. .....++||.. + ..||++- -..+ .+.-+. ...++.+-+.+| T Consensus 159 l~a~~~gia~~~V~vV~~~~g----n~~G~~aGRl~naaVsVadspgRV~tGai~-gl~~~~~p~d~~g~el~~a~l~aL 233 (376) T protein:vir:37 159 LTTLQQTIVADHVCLVPLLFG----NETGVLAGRLANRAVTVADSPARVQTGALV-SLGSANKPLDKDGNELTLAHLKSL 233 (376) T ss_pred HHHHhccccccceeeeeeecc----chHHHHHHHHHhCCcchhcCccceeecccc-cccccccccccCCcccchHHHHHH Confidence 1 1222332 22232 24677788863 2 2456542 2211 121111 124788999999 Q ss_pred HhCCCeEEEEEcCe--eEEecCEEeC---Cc--eehhhHHHHHHHHHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 195 QKAGGMCYIEKAGI--AQTSEGKTVS---GE--FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVL 267 (331) Q Consensus 195 ~~~~~n~y~~~~g~--~~~~~G~~~~---G~--~iD~~~~~dwl~~~iq~~l~~l~~~~~kipyt~~G~~~i~~~v~~vl 267 (331) +++|+.+.-.|.|. .++.+|.++. |+ +|-.++=.|=...+++.....-+ ....+.-++.+++..+..+..+| T Consensus 234 d~arysvpr~Y~gydG~Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R~vR~~Ai~~i-~Dr~lnstp~sia~~~~~~~~pL 312 (376) T protein:vir:37 234 ETARYSVPMWYPDYDGYYRADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKI-ADRSFNSTTSSTEYHKNYFAKPL 312 (376) T ss_pred HhCCCeEEEeeCCCCceEEeCCeEeccCCCCeeeehhchHHHHHHHHHHHHHHHHh-cCccccCChhHHHHHHHHHhHHH Confidence 99999999999984 4778899875 44 56666666666666655444333 34457888999999999999999 Q ss_pred HHHHhcCcccccccCCCcceEEEccchhcCCHHHHHhcccCCeEEEEEEcceEEEEEEEEEEeC Q lcl|Aclame:pro 268 NEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) Q Consensus 268 ~~~~~~G~I~~g~~~~~~~~~v~~~~~~~~~~~dr~~R~~~~i~~~~~~aGaIh~v~i~~~v~~ 331 (331) ++..+.+=|..-.--| .|..|+=.++.-.- ..|..--|.+..+.=+.=..++++--|++ T Consensus 313 r~M~ks~ei~g~~fpg----ei~~P~d~dI~i~w-~sk~~V~I~~~vrPy~cpk~i~~~I~LDl 371 (376) T protein:vir:37 313 RDMSKSATINGKDFPG----ECMPPKDDAITIVW-QSKTKVTIYIKVRPYDCPKEITANIFLDL 371 (376) T ss_pred HHHHhhhhhccccccc----eeecCCCCceEEEe-ccCceEEEEEEEeeecCcceeEEEEEEec Confidence 9998876664210000 24444322222111 12222234454455555555666656666 Done!