Query lcl|Aclame:protein:vir:3165|NCBI_annot:capsid protein CP67|genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Match_columns 426 No_of_seqs 100 out of 112 Neff 6.6 Searched_HMMs 1612 Date Sat Nov 30 07:01:37 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_19 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_19_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:3165 Length: 426 # 100.0 1E-140 9E-144 787.2 32.3 426 1-426 1-426 (426) 2 protein:vir:95263 Length: 450 100.0 2.7E-86 1.7E-89 489.7 33.2 399 1-426 1-449 (450) 3 protein:vir:5260 Length: 502 # 100.0 2E-80 1.2E-83 457.6 33.0 406 1-426 1-502 (502) 4 protein:vir:106730 Length: 501 100.0 2.9E-75 1.8E-78 429.2 32.4 401 1-426 1-500 (501) 5 protein:vir:101576 Length: 501 100.0 3.2E-75 2E-78 429.0 31.8 401 1-426 1-500 (501) 6 protein:vir:3636 Length: 501 # 100.0 5.6E-75 3.5E-78 427.6 31.6 401 1-426 1-500 (501) 7 protein:vir:78611 Length: 501 100.0 4.7E-74 2.9E-77 422.6 32.3 401 1-426 1-500 (501) 8 protein:vir:94073 Length: 494 100.0 2.3E-73 1.4E-76 418.8 30.5 400 1-426 1-494 (494) 9 protein:vir:96104 Length: 504 100.0 3.1E-72 1.9E-75 412.6 31.0 405 1-425 2-504 (504) 10 protein:vir:99586 Length: 507 100.0 4.7E-71 2.9E-74 406.1 29.7 406 1-425 2-507 (507) 11 protein:vir:80052 Length: 331 100.0 9.5E-71 5.9E-74 404.5 29.1 320 1-426 1-331 (331) 12 protein:vir:107720 Length: 515 100.0 3.8E-64 2.4E-67 368.3 28.0 407 1-425 1-515 (515) 13 protein:vir:6079 Length: 396 # 99.1 2.1E-09 1.3E-12 68.2 29.6 358 1-426 1-383 (396) 14 protein:vir:2035 Length: 396 # 99.0 7.6E-09 4.7E-12 65.1 28.6 358 1-426 1-383 (396) 15 protein:vir:107865 Length: 477 98.9 1.5E-08 9.1E-12 63.5 29.9 404 1-426 1-467 (477) 16 protein:vir:10336 Length: 386 98.9 1.8E-08 1.1E-11 63.1 24.9 353 1-426 1-379 (386) 17 protein:vir:79092 Length: 477 98.8 2.9E-08 1.8E-11 61.9 31.2 409 1-426 1-467 (477) 18 protein:vir:103993 Length: 390 98.8 3.4E-08 2.1E-11 61.5 27.0 353 1-426 1-378 (390) 19 protein:vir:78206 Length: 390 98.8 3.4E-08 2.1E-11 61.5 27.0 353 1-426 1-378 (390) 20 protein:vir:5711 Length: 396 # 98.8 4.7E-08 2.9E-11 60.8 29.4 359 1-426 1-383 (396) 21 protein:vir:1172 Length: 391 # 98.7 4E-08 2.5E-11 61.1 23.5 358 1-426 1-379 (391) 22 protein:vir:98553 Length: 395 98.7 9E-08 5.6E-11 59.2 29.3 364 1-426 1-383 (395) 23 protein:vir:79141 Length: 391 98.7 5.1E-08 3.1E-11 60.6 22.4 360 1-426 1-378 (391) 24 protein:vir:1845 Length: 392 # 98.6 3E-07 1.9E-10 56.3 27.9 359 1-426 1-380 (392) 25 protein:vir:96740 Length: 388 98.5 3.7E-07 2.3E-10 55.8 28.9 353 1-426 1-377 (388) 26 protein:vir:79181 Length: 390 98.4 6.2E-07 3.8E-10 54.6 28.3 358 1-426 1-378 (390) 27 protein:vir:102957 Length: 437 98.0 6.9E-06 4.3E-09 48.9 26.8 384 1-425 13-437 (437) 28 protein:vir:100323 Length: 393 98.0 8.4E-06 5.2E-09 48.4 29.7 353 1-426 1-380 (393) 29 protein:vir:107310 Length: 581 97.6 3.8E-05 2.4E-08 44.8 21.5 342 1-426 184-566 (581) 30 protein:vir:79798 Length: 717 97.1 0.00017 1.1E-07 41.3 19.9 376 1-426 314-717 (717) 31 protein:vir:7653 Length: 581 # 97.1 0.00019 1.2E-07 41.0 21.5 330 1-426 205-566 (581) 32 protein:vir:5663 Length: 671 # 96.9 0.00027 1.7E-07 40.1 23.2 377 1-426 232-661 (671) 33 protein:vir:104858 Length: 729 96.9 0.00028 1.7E-07 40.0 24.2 399 1-426 279-717 (729) 34 protein:vir:104477 Length: 749 96.2 0.00089 5.5E-07 37.3 26.8 396 1-426 301-739 (749) 35 protein:vir:4463 Length: 498 # 96.2 0.00092 5.7E-07 37.2 20.0 397 1-426 14-491 (498) 36 protein:vir:5833 Length: 742 # 96.1 0.00095 5.9E-07 37.1 24.7 358 1-426 350-736 (742) 37 protein:vir:4517 Length: 498 # 96.0 0.0011 6.8E-07 36.8 20.5 397 1-426 14-491 (498) 38 protein:vir:489 Length: 498 # 95.8 0.0014 8.6E-07 36.2 22.3 397 1-426 14-491 (498) 39 protein:vir:100539 Length: 663 95.6 0.0018 1.1E-06 35.7 21.9 412 1-426 163-648 (663) 40 protein:vir:1996 Length: 495 # 95.3 0.0024 1.5E-06 34.9 23.8 398 1-426 15-495 (495) 41 protein:vir:101187 Length: 663 95.3 0.0024 1.5E-06 34.9 22.4 413 1-426 145-648 (663) 42 protein:vir:80984 Length: 666 94.8 0.0034 2.1E-06 34.1 23.7 412 1-426 156-651 (666) 43 protein:vir:78986 Length: 436 94.6 0.0039 2.4E-06 33.8 23.6 373 1-425 15-436 (436) 44 protein:vir:101804 Length: 663 94.4 0.0045 2.8E-06 33.4 23.6 409 1-426 136-648 (663) 45 protein:vir:6594 Length: 666 # 94.3 0.0048 3E-06 33.3 26.1 361 1-426 226-651 (666) 46 protein:vir:6894 Length: 660 # 93.7 0.0068 4.2E-06 32.5 22.5 400 1-426 175-646 (660) 47 protein:vir:108052 Length: 660 93.0 0.009 5.6E-06 31.8 25.3 388 1-426 205-647 (660) 48 protein:vir:7206 Length: 659 # 92.1 0.013 7.8E-06 31.0 25.3 366 1-426 226-646 (659) 49 protein:vir:105470 Length: 451 91.3 0.017 1E-05 30.3 24.6 386 1-425 9-451 (451) 50 protein:vir:103456 Length: 659 89.3 0.027 1.7E-05 29.2 26.9 387 1-426 210-646 (659) 51 protein:vir:106984 Length: 743 88.4 0.033 2E-05 28.7 25.9 381 1-426 313-732 (743) 52 protein:vir:99306 Length: 587 88.3 0.033 2E-05 28.7 26.3 395 1-426 6-582 (587) 53 protein:vir:98824 Length: 774 87.8 0.037 2.3E-05 28.5 24.6 377 1-426 371-767 (774) 54 protein:vir:63742 Length: 562 86.2 0.048 3E-05 27.8 25.9 397 1-426 6-557 (562) 55 protein:vir:102819 Length: 648 83.8 0.066 4.1E-05 27.1 20.4 395 1-426 153-645 (648) 56 protein:vir:106427 Length: 679 83.4 0.069 4.3E-05 26.9 26.7 364 1-426 235-665 (679) 57 protein:vir:100829 Length: 607 82.0 0.08 5E-05 26.6 25.1 398 1-426 15-596 (607) 58 protein:vir:98263 Length: 664 79.9 0.1 6.2E-05 26.1 24.6 411 1-426 163-650 (664) 59 protein:vir:96586 Length: 587 78.0 0.12 7.4E-05 25.7 22.7 388 1-426 114-582 (587) 60 protein:vir:3788 Length: 376 # 75.6 0.15 9E-05 25.2 25.0 342 1-426 1-371 (376) 61 protein:vir:78782 Length: 370 74.2 0.16 0.0001 24.9 23.0 341 1-426 1-363 (370) 62 protein:vir:95741 Length: 587 71.2 0.2 0.00012 24.4 26.6 398 1-426 6-582 (587) 63 protein:vir:80488 Length: 562 61.6 0.35 0.00022 23.1 26.1 395 1-426 1-557 (562) 64 protein:vir:80779 Length: 569 46.1 0.75 0.00046 21.3 29.7 395 1-426 6-564 (569) 65 protein:vir:3751 Length: 376 # 23.3 2.3 0.0014 18.6 24.1 345 1-426 1-371 (376) No 1 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=100.00 E-value=1.5e-140 Score=787.20 Aligned_cols=426 Identities=100% Similarity=1.438 Sum_probs=418.5 Q ss_pred CCCceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCceeeee Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQWRVM 80 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~~~~~ 80 (426) ||||||||+|+|+|+|+++|+||.|||||+|+++||+++|+||+||+|+++|++|||++||+||||.++|+|+++++|++ T Consensus 1 m~~~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~~~~~r~~ 80 (426) T protein:vir:31 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQWRVM 80 (426) T ss_pred CCcceEEEEeecccccccccccceeeeeeeccccccccccchhhhhhhHHHHHhcCCCChHHHHHHHHHHhCCceeEEee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeecccccchh Q lcl|Aclame:pro 81 VLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHADWSQ 160 (426) Q Consensus 81 v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~~d~~~ 160 (426) ++++++++.++++|+.+|+++|++++++++++++.+++++.++++.+++++++++++.+|+++++..+.+.++++.||++ T Consensus 81 v~~at~~~~~~~t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~~~~t~~g~~t~~~~~~~~~~s~~dw~~ 160 (426) T protein:vir:31 81 VLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHADWSQ 160 (426) T ss_pred ccccceeeeccCCcceeecceeeeecccCcchHHHHHHhhccccccccceeeeEeccccceeeccccceeeeeccCcchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCc Q lcl|Aclame:pro 161 LDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASD 240 (426) Q Consensus 161 ~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~ 240 (426) +.+++++++.+.++++++|+++++++++|++|++++++++++++.++..+++.+++++.+|.++++++.+.|++.....+ T Consensus 161 ~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa~~~~i~~va~~~e~~~~~~~~~~~a~~~~~~~y~p~~~~~~~~~~~~ 240 (426) T protein:vir:31 161 LDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASD 240 (426) T ss_pred hhcccccchhhhhhccccchhhhhhhHhhhhhhhhcceeeeeeccchhhhcchhhhhhhhhcccccccchhheeehhccc Confidence 99999999999999999999999999999999999999999999999999999999999999998888888888888889 Q ss_pred cchhHHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhcCccEEEEEcCCEEEeeceeecCcccC Q lcl|Aclame:pro 241 DDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPVNVLIDVSDANRVSNAVTTAGADSD 320 (426) Q Consensus 241 ~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~N~~~~~~g~~~~~~~~t~~G~~~s 320 (426) ++..++++++|++++|||+++|+++++++..++++++|||.+++++++++++++++|+|+.|+|+++++++++++|++++ T Consensus 241 ~~~~~~~~~~~aa~~~~~~~~~~~~~~~~~~~~~~~~~gv~~t~~~~~~A~~~~~~n~~~~~~~~~~i~~~~~~~G~~~~ 320 (426) T protein:vir:31 241 DDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPVNVLIDVSDANRVSNAVTTAGADSD 320 (426) T ss_pred cchhhHHhhhhhhhccccchhhhhccccccceeeccccccccccchhhhhhhcCCceEEEEecCceeeecceeecccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhh Q lcl|Aclame:pro 321 TSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNR 400 (426) Q Consensus 321 g~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R 400 (426) |+|||++||+|||+++||++|++||+|++||||||.||+||++.|+++|+++|+++|+++++|+|++|+++++++||++| T Consensus 321 G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~~~~~y~v~~P~~~~~~~dra~R 400 (426) T protein:vir:31 321 TSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNR 400 (426) T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCccccceeecCCCccccchhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 401 NWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 401 ~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +|+||+|+|+|+||||.++|+|+|+| T Consensus 401 ~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 401 NWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred ccCCceEEEEEeCcEEEEEEEEEEeC Confidence 99999999999999999999999999 No 2 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=100.00 E-value=2.7e-86 Score=489.69 Aligned_cols=399 Identities=14% Similarity=0.100 Sum_probs=283.8 Q ss_pred CCCceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCceeeee Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQWRVM 80 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~~~~~ 80 (426) |+.+||||+|+|+++|+++|+||.+||+|.|.+.+ +|++.|+++++|++|||.+|||||||++||+|+|++.+.+ T Consensus 1 ~~s~iVnV~i~~~~~a~~~~~f~~~l~~~~~~~~~-----~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p~p~~l~ 75 (450) T protein:vir:95 1 MWNPIVNVDITLNTAGTTREGFGLPLFLASTDNFE-----ERVRGYTSLTEVAEDFDENTAAYKAAKQLWSQTPKVTQLY 75 (450) T ss_pred CCCceEEEeecccccccccccceeEEEEcCCCCCc-----cceeeecCHHHHHHhcCCCcHHHHHHHHHHhCCCcccEEE Confidence 99999999999999999999999999999998632 4666699999999999999999999999999999988776 Q ss_pred eccccc----ccc----cccccc--ceeccceee---cccccccchhhhhhhcccccccccceeeeeeccccceeeechh Q lcl|Aclame:pro 81 VLEATE----VTE----EELSDG--DTIDKVPIL---GNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSED 147 (426) Q Consensus 81 v~~~t~----v~~----~~~~~~--~tv~~~~~s---~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~ 147 (426) +.+... ... +..+.. .+|+++..+ .......+...+...+.+.+...++....+..+..|... T Consensus 76 igr~~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~~~~~s~g~~~---- 151 (450) T protein:vir:95 76 IGRRAMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDKVSVNVTGSNG---- 151 (450) T ss_pred EEeeccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeeeeeeeeecccc---- Confidence 644321 111 111112 233332211 111111112222222222222221111111111111100 Q ss_pred heeeecccccchhhhhhhccccc--------------e-eec-ccccch------hhhHhHhhhhhhhhhcceEEEEEec Q lcl|Aclame:pro 148 SIELTYFHADWSQLDEFPSDVNN--------------F-AVA-DRRFDL------KGVGVLDETHSWASDEDMGMIANGV 205 (426) Q Consensus 148 ~~~~~~~~~d~~~~~~~~s~~~~--------------~-~la-~~~~~~------~~~~~~~~~~~wa~~~~kl~~~~~~ 205 (426) .....+....+..+.++...... . .+. ....|| ....+...++.|.+++.++|+.+.. T Consensus 152 ~~t~~~~~~~~~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~~w~~~~~~~~~~~~i~a~a~w~~a~~~~f~~~~~ 231 (450) T protein:vir:95 152 SATMIIAKAGDNDFVKVTTTAQTVYIASTTADTASTALAAIEAYSTDWYFIAAEDRTQQFVLAMASEIQARKKIFFTANS 231 (450) T ss_pred eeeeeeeccccchhhccccccceeEecccccccHHHHHHHHHHhhCCeEEEEecCCCHHHHHHHHHHHhhcCcEEEEEcC Confidence 00111111111111111100000 0 011 112243 2223355678899999999999998 Q ss_pred ccccccch---hhHHHHHHHhhccCcceEEEEecCCCccchhHHHHHHHhhhcccccceeecccccceeecccccccccc Q lcl|Aclame:pro 206 NVDDYDSV---DEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQG 282 (426) Q Consensus 206 d~~~~~~~---~~~~~~a~~~a~~~~rt~~~~~~~~~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~ 282 (426) +...+... ..+...+++++.+|.|++++||+....++++++++|++++.+| ++.+|+||++|||.. T Consensus 232 ~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~~~aa~~g~~~~~~~-----------g~~T~~fk~l~Gv~~ 300 (450) T protein:vir:95 232 DVTALQGTELASANDVPAQLAKNMYTRTVCLWHHAAAEDYPEMAYIAYGAPYDA-----------GSIAWGNAQLTGVAA 300 (450) T ss_pred CchhhhhhhhhcccchHHHHHhccCCeeEEEeeCCCchhHHHHHHHHHhhhccc-----------ceeeeccccccceee Confidence 88776543 4556778899999999999999998899999999999998877 889999999999973 Q ss_pred --------ccchhHHHHhhcCc-cEEEEEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcC--CCC Q lcl|Aclame:pro 283 --------TFEGGDEAEGEGPV-NVLIDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSD--DDV 351 (426) Q Consensus 283 --------~~~~~~~~~~~~~~-N~~~~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~--~KI 351 (426) .|+.+|..+++.++ |+|.++++...+. +|++++|+|||++||+|||+++||++|++||.++ .|| T Consensus 301 ~v~~~~~~~lt~~~~~al~~~~~n~y~~~~~~~~~~-----~G~~~~G~~iD~~~~~~wl~~~iq~~l~~ll~~~~~~Ki 375 (450) T protein:vir:95 301 SLQPSNQRPLTSIQKSALDVRHCNFIDLDGGVPVVR-----RGITSGGEWIDIIRGVDWLESDLKTSLRDLLINQKGGKI 375 (450) T ss_pred eccCccccccchHHHHHHHhCCcEEEEEecCceeee-----CCeeeCcchhHHHHHHHHHHHHHHHHHHHHHHhcCCCCC Confidence 47888999998776 7777766654333 5555566899999999999999999999999765 499 Q ss_pred cccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccC-cHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 352 PFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDD-DDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 352 p~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~-~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) |||+.|++||++.|+.+|++++++| .+.+|+|++|.+++ +++||++|+|++|+|+|+|+||||.++|+|||+- T Consensus 376 Py~~~G~~~i~a~i~~~l~~a~~~G--~Ia~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~ 449 (450) T protein:vir:95 376 TYDDTGITRIRQVIETSLQRAVNRN--FLSSYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAY 449 (450) T ss_pred ccChhhHHHHHHHHHHHHHHHHhcC--cccceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEe Confidence 9999999999999999999999866 57899999887655 5789999999999999999999999999999999 No 3 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=100.00 E-value=2e-80 Score=457.56 Aligned_cols=406 Identities=13% Similarity=0.055 Sum_probs=284.3 Q ss_pred CC---CceEEEEEeeccccccccCccceEEEeccc---ccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCC Q lcl|Aclame:pro 1 MP---KQIVEIELTAEIADRPQETFTDAAIVGTAE---EEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGA 74 (426) Q Consensus 1 mp---~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~---~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~ 74 (426) |+ ++||||+|++++.++.+++||.+||||.+. +++|.+ |++.|+|+++|++|||.+|||||||++||+|.| T Consensus 1 msip~s~ivnV~i~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~---r~~~~~s~~~V~~~FG~~s~ey~aA~~yF~q~p 77 (502) T protein:vir:52 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKT---RYVYVENQRDVEQLFGTNSETAKAAQPFFAQSP 77 (502) T ss_pred CCCCccceeEEeeccccccccccccCceEEEeeccCccccCCcc---ceEEecCHHHHHHhcCCChHHHHHHHHHhcCCC Confidence 76 899999999999999999999999999764 555533 666799999999999999999999999999999 Q ss_pred ceeeeeeccccccccc------------------------cccccceecc-------ceeecccccccchhhhhhhcccc Q lcl|Aclame:pro 75 EQWRVMVLEATEVTEE------------------------ELSDGDTIDK-------VPILGNHEVESPDGDIEFTTDDD 123 (426) Q Consensus 75 ~~~~~~v~~~t~v~~~------------------------~~~~~~tv~~-------~~~s~~~~~~~ta~~i~~~~~~~ 123 (426) +|.+.++.+......+ ....+.+|++ .+.+......+.+..+...+.+. T Consensus 78 ~P~~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~~~l~~~ 157 (502) T protein:vir:52 78 RAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLTTL 157 (502) T ss_pred ccceEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHHhhhccc Confidence 9876555443211100 0012223333 33344444445555555444331 Q ss_pred -------cccccceeeeeec-cccceeeec-hhheee-ecccccchhhhhh-----------------hccc-cce-eec Q lcl|Aclame:pro 124 -------PDVEDFDAEIVIN-SATGDVATS-EDSIEL-TYFHADWSQLDEF-----------------PSDV-NNF-AVA 174 (426) Q Consensus 124 -------~~~t~~~~~~~~~-~~~g~~t~~-~~~~~~-~~~~~d~~~~~~~-----------------~s~~-~~~-~la 174 (426) ++.+ ...+.+. ..+|+.+.. ...... .....+.+....+ .+++ ++. .+. T Consensus 158 ~~~~tv~~d~~--~~~F~i~s~ttg~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g~~aet~~~al~a~~ 235 (502) T protein:vir:52 158 SVAVSIAYDET--GNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVA 235 (502) T ss_pred ccceEEEEecC--CceEEEEeccCCCcceeEEEEeecCCcchhHHHHHhccccccceeeeeeecccccccCHHHHHHHHH Confidence 1111 1111111 111111100 000000 0000000000000 0010 000 111 Q ss_pred -ccccch-------hhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCccchhHH Q lcl|Aclame:pro 175 -DRRFDL-------KGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAY 246 (426) Q Consensus 175 -~~~~~~-------~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~~~~~aa 246 (426) ....|| ....+...++.|.+++.|+|+.+..+....+... ++..+++++.+|.|++++||+. .++++++ T Consensus 236 ~~~~~w~~~~~a~~~~~~~~la~a~~iea~~~~f~~~~~d~~~~~~~~-~~i~~~l~a~~~~~t~~~y~~~--~~~~~aa 312 (502) T protein:vir:52 236 EVNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVIRAEQIEWSA-DNIYKKLYDAGLDHTLAMFDKN--DMYPVSS 312 (502) T ss_pred hccCceEEEEEeecCChhHHHHHHHHHhhcCcEEEEEecCcceecccc-chHHHHHHhccCceeEEEecCC--cchhHHH Confidence 112343 1223355678899999999999999988887764 5566677888999999999975 5789999 Q ss_pred HHHHHhhhcccccceeecccccceeeccccccccc-cccchhHHHHhhcCc-cEEEEEcCCEEEeeceeecCcccCccee Q lcl|Aclame:pro 247 QLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQ-GTFEGGDEAEGEGPV-NVLIDVSDANRVSNAVTTAGADSDTSFF 324 (426) Q Consensus 247 ~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~-~~~~~~~~~~~~~~~-N~~~~~~g~~~~~~~~t~~G~~~sg~~i 324 (426) ++|++++.++. ..++..+|+||+++||. +.++.+|+.+++.++ |+|..++|... ..+|++++|+|| T Consensus 313 ~~g~~as~~f~-------~~~g~iT~~fk~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~~~~-----~~~G~~~~G~~i 380 (502) T protein:vir:52 313 ALARLLSTNFA-------ANNSTLTLKFKQQPTITADEITATEFAKAKRLGINVYTYFDDVAM-----IAEGTVIGGKFA 380 (502) T ss_pred HHHHHHhcCCC-------cCcceeeecccccCCcccCcCCHHHHHHHHhcCceEEEEecCeeE-----EecCeeeCCchh Confidence 99999999863 34588999999999997 779999999998777 88888866443 345556667899 Q ss_pred ehhhhHHHHHHHHHHHHHHHHhc-CCCCcccHHHHHHHHHHHHHHHHHhhcCCC--------ccc----------cceeE Q lcl|Aclame:pro 325 DIRRTKVYTAEMLELDLESLQVS-DDDVPFTEDGQAMIEDAIKGTMSGLTGSVG--------QPL----------AEYEV 385 (426) Q Consensus 325 D~i~g~dwl~~~iq~~l~~ll~~-~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g--------~~~----------~~y~~ 385 (426) |++||+|||+++||++|+++|.+ ++|||||+.|++||++.|+.+|++++++|. +.. ++|.+ T Consensus 381 D~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~~~~~d~~~~gy~v 460 (502) T protein:vir:52 381 DEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYV 460 (502) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeecccccCceEE Confidence 99999999999999999998865 579999999999999999999999998762 111 46888 Q ss_pred ecCcc-cCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 386 DVPEW-DDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 386 ~~p~~-~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) .+|.+ +++++||++|++|+++|+|+|+||||.|+|+|||-= T Consensus 461 ~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 461 WAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred EeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 87765 556899999999999999999999999999888777 No 4 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=100.00 E-value=2.9e-75 Score=429.20 Aligned_cols=401 Identities=14% Similarity=0.050 Sum_probs=274.1 Q ss_pred CC------CceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHh--- Q lcl|Aclame:pro 1 MP------KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEE--- 71 (426) Q Consensus 1 mp------~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~--- 71 (426) || .+||||+++|.++++..++||.|| |+.+.++|+ +|+++|+|+++|++|||.+|||||||++||+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~f~~ll-l~~~~~~~~----~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~ 75 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLV-LTQDTSVQP----GQLADFFQKTDVENWFGALSNEAKIADAYFPGIV 75 (501) T ss_pred CCcCccccceEEEEeeecccCCCcccccceEE-EecccCCCc----cceeeecCHHHHHHhcCCChHHHHHHHHHhhhhc Confidence 88 599999999999999999999665 454444553 3666699999999999999999999999998 Q ss_pred -cCCceeeeeeccccccccc-----------------cccccc--eeccce------eecccccccchhhhhhhcccccc Q lcl|Aclame:pro 72 -MGAEQWRVMVLEATEVTEE-----------------ELSDGD--TIDKVP------ILGNHEVESPDGDIEFTTDDDPD 125 (426) Q Consensus 72 -Q~~~~~~~~v~~~t~v~~~-----------------~~~~~~--tv~~~~------~s~~~~~~~ta~~i~~~~~~~~~ 125 (426) |.|+|.+.++-+..+.... ..+.+. ++++.. .+......+.+..|...+.+... T Consensus 76 ~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~~g~l~i~i~g~~~~~~i~~s~ats~~~vA~~i~~al~~~~~ 155 (501) T protein:vir:10 76 NGGQLPYDLKFARYVAADAPASVYGIPLTGITLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDF 155 (501) T ss_pred CCCccccEEEEEeecccCccceeeeceehhhhhhhhhheeeEEEEeeccceeeeccccccccCHHHHHHHHHHhhcCCce Confidence 9988766554443211110 001112 222222 22222223445555544443211 Q ss_pred c--cc-ceeeee-eccccceeeechhheeeecccccchhhhhhh------------------ccccceeecccccch--- Q lcl|Aclame:pro 126 V--ED-FDAEIV-INSATGDVATSEDSIELTYFHADWSQLDEFP------------------SDVNNFAVADRRFDL--- 180 (426) Q Consensus 126 ~--t~-~~~~~~-~~~~~g~~t~~~~~~~~~~~~~d~~~~~~~~------------------s~~~~~~la~~~~~~--- 180 (426) . .. ....+. ....+|..++..... ...|.....+++ ..++. .......|| T Consensus 156 tv~~d~~~~~f~i~~~t~G~~~~i~~~t----~~~d~a~~l~Lt~~~~a~v~~~g~~aet~~~Al~a-~~~~~~~Wy~f~ 230 (501) T protein:vir:10 156 VVAYDALRNRFTVVTNTTGTAAAISAVT----GTNNLADELGLSAAAGATLQAAGVAADTPASAMNR-AVGLSRNWATFT 230 (501) T ss_pred EEEEecccceEEEEecccCcceeEEEee----ccccchhhhcccccCceeEEecCcccccHHHHHHH-HHhcccceEEEE Confidence 1 11 111111 111222111100000 000111111111 01111 111223454 Q ss_pred ----hhhHhHhhhhhhhhhcceEE--EEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCccchhHHHHHHHhhh Q lcl|Aclame:pro 181 ----KGVGVLDETHSWASDEDMGM--IANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKFAVS 254 (426) Q Consensus 181 ----~~~~~~~~~~~wa~~~~kl~--~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~~~~~aa~~g~~~~~ 254 (426) ....+...++.|++++.+.| +....+...++....+...+++++.+|.|++++||+. .++++++|++++. T Consensus 231 ~a~~~~~~~~la~A~wi~a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~----~~~aa~~g~~as~ 306 (501) T protein:vir:10 231 TAWTAVIADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQ----ATAGAVMGYAASI 306 (501) T ss_pred EEecCChHHHHHHHHHHHhcCceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEECCCC----CHHHHHHHHHHhc Confidence 12233556788999998865 4555555566677777888889999999999999853 4788999999988 Q ss_pred cccccceeecccccceeeccccc-cccc-cccchhHHHHhhcCc-cEEEEEc---CCEEEeeceeecCcccCcceeehhh Q lcl|Aclame:pro 255 EPWYNPLWNELPAGETVSKNVGD-PEEQ-GTFEGGDEAEGEGPV-NVLIDVS---DANRVSNAVTTAGADSDTSFFDIRR 328 (426) Q Consensus 255 ~p~~~~~~~~~~~~~~~~~~k~~-~gv~-~~~~~~~~~~~~~~~-N~~~~~~---g~~~~~~~~t~~G~~~sg~~iD~i~ 328 (426) ++ +..+++.+|+||++ +||. +.+..+|..++++++ |.|..|. ....++++|+++| +++|||++| T Consensus 307 nf-------~~~~g~~T~~fkql~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~~~wiD~~~ 376 (501) T protein:vir:10 307 NF-------QLRNGRTVLAFRQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---KFLWVDTYL 376 (501) T ss_pred Cc-------ccCcceeeeeecccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeec---cceehhhHh Confidence 74 45669999999996 8997 669999999998777 7776664 4457788888877 357999999 Q ss_pred hHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCC--------------------------cc-cc Q lcl|Aclame:pro 329 TKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVG--------------------------QP-LA 381 (426) Q Consensus 329 g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g--------------------------~~-~~ 381 (426) |+|||+++||++|++||.+++|||||+.|++||++.|+.+|+++++||+ +. -. T Consensus 377 g~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~ 456 (501) T protein:vir:10 377 DQIYLNAELQRAEFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTR 456 (501) T ss_pred hHHHHHHHHHHHHHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceecc Confidence 9999999999999999999999999999999999999999999998751 11 12 Q ss_pred ceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 382 EYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 382 ~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +|.+..+.++.++++|++|+.|+++|.|+++||||+|+| +.+.| T Consensus 457 Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:10 457 GWYFLIGNPANPGQARQNRTSPACTLWYSDGGSIQELTI-GSNAV 500 (501) T ss_pred ceEEeeCcccCChhhhhhcccCceEEEEEeCCceeEEEe-eeeec Confidence 477776666555568999999999999999999999999 88888 No 5 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=100.00 E-value=3.2e-75 Score=428.96 Aligned_cols=401 Identities=14% Similarity=0.058 Sum_probs=276.0 Q ss_pred CC------CceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHh--- Q lcl|Aclame:pro 1 MP------KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEE--- 71 (426) Q Consensus 1 mp------~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~--- 71 (426) || .+||||++++.++++..++|+. |||+.+.++|+ +||++|+|.++|++|||.+|||||||++||+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~-l~l~~~~~~~~----~~~~~~~s~~~V~~~FG~~S~ey~aA~~yFsg~~ 75 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTG-LVLTQDTSVQP----GQLADFFQETDVENWFGALSNEAKIADAYFPGIV 75 (501) T ss_pred CCCCCcccceEEEEeeecccCCCcccccee-EEEeccCCCCc----cceEEecCHHHHHHhcCCChHHHHHHHHHhhhhc Confidence 88 5999999999999999999995 57777777765 4677799999999999999999999999999 Q ss_pred -cCCceeeeeeccccccccc---------cc--------c--ccceeccceee------cccccccchhhhhhhcccccc Q lcl|Aclame:pro 72 -MGAEQWRVMVLEATEVTEE---------EL--------S--DGDTIDKVPIL------GNHEVESPDGDIEFTTDDDPD 125 (426) Q Consensus 72 -Q~~~~~~~~v~~~t~v~~~---------~~--------~--~~~tv~~~~~s------~~~~~~~ta~~i~~~~~~~~~ 125 (426) |.|+|.+.++-+....... +. + -..++.+...+ ......+.+..|...+..... T Consensus 76 ~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i~~al~~~~~ 155 (501) T protein:vir:10 76 NGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDF 155 (501) T ss_pred CCCccccEEEEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHHhhhccCCce Confidence 9988766554443221110 00 1 01122222211 111122333333333332210 Q ss_pred c--cc-ceeeeee-ccccceeeechhheeeecccccchhhhhhhc------------------cccceeecccccch--- Q lcl|Aclame:pro 126 V--ED-FDAEIVI-NSATGDVATSEDSIELTYFHADWSQLDEFPS------------------DVNNFAVADRRFDL--- 180 (426) Q Consensus 126 ~--t~-~~~~~~~-~~~~g~~t~~~~~~~~~~~~~d~~~~~~~~s------------------~~~~~~la~~~~~~--- 180 (426) . .. ....+.+ ...+|..++.... ....|.....++++ .++. .......|| T Consensus 156 tv~~d~~~~~f~its~ttG~~~~i~~~----~~~~~la~~l~Lt~~~~a~v~~~g~~aet~~~a~~a-~~~~~~~Wy~f~ 230 (501) T protein:vir:10 156 VVAYDALRNRFTVVTNATGTAAAISAV----TGTNNLADELGLSAAAGATLQAAGVAADTPASAMNR-AVGLSRNWATFT 230 (501) T ss_pred EEEEcccCceEEEEeeccCCceeEEEe----eCchhhhhhcCccccccceEEecCcccccHHHHHHH-HHhccCceEEEE Confidence 0 00 0011111 1111111110000 00001111111110 0111 111223454 Q ss_pred ----hhhHhHhhhhhhhhhcceEE--EEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCccchhHHHHHHHhhh Q lcl|Aclame:pro 181 ----KGVGVLDETHSWASDEDMGM--IANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKFAVS 254 (426) Q Consensus 181 ----~~~~~~~~~~~wa~~~~kl~--~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~~~~~aa~~g~~~~~ 254 (426) ....+...++.|++++.+.| +....+...+.....+...+++++.+|.|++++||+ .+++++++|++++. T Consensus 231 ~a~~~~~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~----~~~~aa~~g~~as~ 306 (501) T protein:vir:10 231 TAWTAVIADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGD----QATAGAVMGYAASI 306 (501) T ss_pred EecCCChHHHHHHHHHHHhcCceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCC----CcHHHHHHHHHHhh Confidence 12234556788999988865 455555566677777778888999999999999974 45788999999998 Q ss_pred cccccceeecccccceeecccccc-ccc-cccchhHHHHhhcCc-cEEEEEcCC---EEEeeceeecCcccCcceeehhh Q lcl|Aclame:pro 255 EPWYNPLWNELPAGETVSKNVGDP-EEQ-GTFEGGDEAEGEGPV-NVLIDVSDA---NRVSNAVTTAGADSDTSFFDIRR 328 (426) Q Consensus 255 ~p~~~~~~~~~~~~~~~~~~k~~~-gv~-~~~~~~~~~~~~~~~-N~~~~~~g~---~~~~~~~t~~G~~~sg~~iD~i~ 328 (426) ++ +..++..+|+||++| ||. +.+..+|..+++.++ |+|..|.+. ..++++|+++|. ++|||+++ T Consensus 307 nf-------~~~~g~~T~~fkq~~~Gi~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~---~~wiD~~~ 376 (501) T protein:vir:10 307 NF-------QLRNGRTVLAFRQFNAGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK---FLWVDTYL 376 (501) T ss_pred Cc-------ccCccceeeeccccCCCcCcccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeecc---ceeehhhh Confidence 85 455699999999997 797 669999999998776 999999543 478888888773 47999999 Q ss_pred hHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCC--------------------------cc-cc Q lcl|Aclame:pro 329 TKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVG--------------------------QP-LA 381 (426) Q Consensus 329 g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g--------------------------~~-~~ 381 (426) |+|||+++||++|++||.+++|||||+.|++||++.|+.+|+++++||+ +. -. T Consensus 377 ~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~ 456 (501) T protein:vir:10 377 DQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTR 456 (501) T ss_pred hHHHHHHHHHHHHHHHHHhcCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceecc Confidence 9999999999999999999999999999999999999999999998751 11 12 Q ss_pred ceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 382 EYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 382 ~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +|.+..+.++.+.+||++|+.|+++|.|+++||||+|+| +.+.| T Consensus 457 Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:10 457 GWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQQLTI-GSNAV 500 (501) T ss_pred ceeEeeccccCChhhhhhccccceEEEEEeCCceeEEEe-eeeec Confidence 477776666666679999999999999999999999999 88888 No 6 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=100.00 E-value=5.6e-75 Score=427.65 Aligned_cols=401 Identities=15% Similarity=0.065 Sum_probs=274.5 Q ss_pred CC------CceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHh--- Q lcl|Aclame:pro 1 MP------KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEE--- 71 (426) Q Consensus 1 mp------~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~--- 71 (426) || .+||||++++.++++..++|+ .|||+.+..+|+ +|+++|+++++|++|||.+|||||||++||+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~-~lllt~~~~~~~----~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~ 75 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLT-GLVLTQDTSVQP----GQLADFFQETDVENWFGALSNEAKIADAYFPGIV 75 (501) T ss_pred CCcCCcccceEEEEeeeeccCCCcceeee-eEEEeccCCCCC----cceeeecCHHHHHHhcCCChHHHHHHHHHhhccc Confidence 88 599999999999999999999 567776666654 3666699999999999999999999999998 Q ss_pred -cCCceeeeeeccccccccc-----------------ccc--ccceeccce------eecccccccchhhhhhhcccccc Q lcl|Aclame:pro 72 -MGAEQWRVMVLEATEVTEE-----------------ELS--DGDTIDKVP------ILGNHEVESPDGDIEFTTDDDPD 125 (426) Q Consensus 72 -Q~~~~~~~~v~~~t~v~~~-----------------~~~--~~~tv~~~~------~s~~~~~~~ta~~i~~~~~~~~~ 125 (426) |.|+|.+.++.+....... ..+ -..++.+.. .+......+.+..|...+..... T Consensus 76 ~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~~~~~i~lS~~ts~~~vA~~i~~al~~~~~ 155 (501) T protein:vir:36 76 NGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDF 155 (501) T ss_pred CCCccccEEEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecceeeeeecccccccCHHHHHHHHhhhhcCcce Confidence 9988765544333211100 001 112233322 22222222344444444433211 Q ss_pred --ccc-ceeee-eeccccceeeechhheeeecccccchhhhhhhc------------------cccceeecccccch--- Q lcl|Aclame:pro 126 --VED-FDAEI-VINSATGDVATSEDSIELTYFHADWSQLDEFPS------------------DVNNFAVADRRFDL--- 180 (426) Q Consensus 126 --~t~-~~~~~-~~~~~~g~~t~~~~~~~~~~~~~d~~~~~~~~s------------------~~~~~~la~~~~~~--- 180 (426) ... ....+ .....+|..++...... ..|.....++.+ .++. .......|| T Consensus 156 tv~~d~~~~~f~i~s~t~G~~~~i~~~t~----~~~ia~~l~Lt~~~~a~v~~~g~~~et~~~al~a-~~~~s~~Wy~f~ 230 (501) T protein:vir:36 156 VVAYDALRNRFTVVTNATGTAAAISAVTG----TNNFADEIGLSAAAGATLQAAGVAADTPASAMNR-AVGLSRNWATFT 230 (501) T ss_pred EEEEcCcceeEEEEeccCCcceeeEeeec----ccchhhhhcccccCcceEEecccccccHHHHHHH-HHhccCceEEEE Confidence 111 11111 11122222111110000 001111111110 1111 111223454 Q ss_pred ----hhhHhHhhhhhhhhhcceEE--EEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCccchhHHHHHHHhhh Q lcl|Aclame:pro 181 ----KGVGVLDETHSWASDEDMGM--IANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKFAVS 254 (426) Q Consensus 181 ----~~~~~~~~~~~wa~~~~kl~--~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~~~~~aa~~g~~~~~ 254 (426) ....+...++.|.+++.+.| +....+...+.....++..+++++.+|.|++++||+. .++++++|++++. T Consensus 231 ~a~~~~~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~----~~~aa~~g~~as~ 306 (501) T protein:vir:36 231 TAWTAVIADRLAFASWNSGQAYKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGDQ----ATAGAVMGYAASI 306 (501) T ss_pred EecCCChHHHHHHHHHHhhcCceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCCC----CHHHHHHHHHHhc Confidence 11233456788999999865 4455555566777778888899999999999999853 4678899999988 Q ss_pred cccccceeecccccceeeccccc-cccc-cccchhHHHHhhcCc-cEEEEE---cCCEEEeeceeecCcccCcceeehhh Q lcl|Aclame:pro 255 EPWYNPLWNELPAGETVSKNVGD-PEEQ-GTFEGGDEAEGEGPV-NVLIDV---SDANRVSNAVTTAGADSDTSFFDIRR 328 (426) Q Consensus 255 ~p~~~~~~~~~~~~~~~~~~k~~-~gv~-~~~~~~~~~~~~~~~-N~~~~~---~g~~~~~~~~t~~G~~~sg~~iD~i~ 328 (426) ++ +..++..+|+||++ |||. +.+..+|..++++++ |.|..| ++...++++|+++| +++|||++| T Consensus 307 nf-------~~~~g~~T~~fkq~~~Gi~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~~~wiD~~~ 376 (501) T protein:vir:36 307 NF-------QLRNGRTVLAFRQFNAGVPATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---KFLWVDTYL 376 (501) T ss_pred Cc-------ccCcceeeeeccccCCCcCcCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeec---cchhhhHHH Confidence 74 45669999999996 8997 668889998998776 766555 45567788888877 357999999 Q ss_pred hHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCC--------------------------ccc-c Q lcl|Aclame:pro 329 TKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVG--------------------------QPL-A 381 (426) Q Consensus 329 g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g--------------------------~~~-~ 381 (426) |+||||++||++|++||.+++||||||.|++||++.|+.+|+++++||+ +.. . T Consensus 377 g~dWL~~~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~ 456 (501) T protein:vir:36 377 DQIYLNAELQRAEFEAMLAYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTR 456 (501) T ss_pred hHHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceecc Confidence 9999999999999999999999999999999999999999999998751 111 2 Q ss_pred ceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 382 EYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 382 ~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +|.+..|.++.+++||++|+.|+++|.|+++||||+|+| +.+.| T Consensus 457 Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:36 457 GWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQSLTI-GSNAV 500 (501) T ss_pred ceEEeeCcccCChhhhhhcccCcEEEEEEeCCceeEEEe-eeeee Confidence 477777777666679999999999999999999999999 88888 No 7 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=100.00 E-value=4.7e-74 Score=422.61 Aligned_cols=401 Identities=15% Similarity=0.071 Sum_probs=273.6 Q ss_pred CC------CceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHh--- Q lcl|Aclame:pro 1 MP------KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEE--- 71 (426) Q Consensus 1 mp------~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~--- 71 (426) || .+||||+++|.++++..++|+.| +|+.+..+|+ +|+++|+|+++|++|||.+|||||||++||+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~l-ll~~~~~~~~----~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~ 75 (501) T protein:vir:78 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGL-VLTQDTSIQP----GQLADFFQKTDVENWFGGLSNEAVIADAYFPGIV 75 (501) T ss_pred CCcCccccceEEEEeeecccCCCcceeeeeE-EEecCCCCCc----cceeeecCHHHHHHhcCCChHHHHHHHHHhhcCC Confidence 88 59999999999999999999965 5555444543 3666699999999999999999999999999 Q ss_pred -cCCceeeeeeccccccccc-----------------ccccc--ceeccc------eeecccccccchhhhhhhcccccc Q lcl|Aclame:pro 72 -MGAEQWRVMVLEATEVTEE-----------------ELSDG--DTIDKV------PILGNHEVESPDGDIEFTTDDDPD 125 (426) Q Consensus 72 -Q~~~~~~~~v~~~t~v~~~-----------------~~~~~--~tv~~~------~~s~~~~~~~ta~~i~~~~~~~~~ 125 (426) |.|+|.+.++.+....... ..+.+ .++.++ +.+......+.+..|...+.+... T Consensus 76 ~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~la~~~~~~G~l~iti~g~~~~~~i~~S~~ts~~~vA~~i~~al~a~~~ 155 (501) T protein:vir:78 76 NGGQLPYDLKFARYVAADAPASVYGIPLTGVTLTQLQGYSGTLTVTTAAQHVSSNISLAAATSFANAATLIEAAFTSPDF 155 (501) T ss_pred CCCcccceEEEEeecccCcceeEeccceeccchhhhceeeeEEEEEeccceeeeccccccccCHHHHHHHHHhhhcCcce Confidence 9988765544332211110 00111 122332 112222222444445544443211 Q ss_pred --ccc-ceeeeeec-cccceeeechhheeeecccccchhhhhhhc------------------cccceeecccccch--- Q lcl|Aclame:pro 126 --VED-FDAEIVIN-SATGDVATSEDSIELTYFHADWSQLDEFPS------------------DVNNFAVADRRFDL--- 180 (426) Q Consensus 126 --~t~-~~~~~~~~-~~~g~~t~~~~~~~~~~~~~d~~~~~~~~s------------------~~~~~~la~~~~~~--- 180 (426) ..+ ....+.+. ..+|...+..... ...+.....++++ .++. .......|| T Consensus 156 tv~~ds~~~~f~its~t~G~~~~i~~~t----~~~~~a~~l~Lt~~~~a~v~~~g~~aet~~~a~~a-~~~~~~~Wy~f~ 230 (501) T protein:vir:78 156 VVSYDALRNRFVVNTNATGTAAAISAVT----GTNNLADELGLSAAAGASLQAAGVAADTPASAMNR-AVGLSRNWATFT 230 (501) T ss_pred EEEEccccceEEEEeeecCCceeEEEEe----cccchhhhhcccccCceeeEeccccccCHHHHHHH-HHhccCceEEEE Confidence 010 11111111 1111111100000 0000011111110 0111 111223454 Q ss_pred ----hhhHhHhhhhhhhhhcceEE--EEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCccchhHHHHHHHhhh Q lcl|Aclame:pro 181 ----KGVGVLDETHSWASDEDMGM--IANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKFAVS 254 (426) Q Consensus 181 ----~~~~~~~~~~~wa~~~~kl~--~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~~~~~aa~~g~~~~~ 254 (426) ....+...++.|++++.+.| +....+...++...+++..+++++.+|.|++++||+ ++++++++|++++. T Consensus 231 ~a~~~~~~~~lalA~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~~----~~~~aa~~g~~as~ 306 (501) T protein:vir:78 231 TAWTAVIADRLALASWNSGQAYKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYGD----QATAGAVMGYAASI 306 (501) T ss_pred EecCCCHHHHHHHHHHHHhcCceEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEcCC----cchHHHHHHHHHhc Confidence 11233556788999998865 455555566777777888888999999999999973 46788999999998 Q ss_pred cccccceeecccccceeeccccc-cccc-cccchhHHHHhhcCc-cEEEEEc---CCEEEeeceeecCcccCcceeehhh Q lcl|Aclame:pro 255 EPWYNPLWNELPAGETVSKNVGD-PEEQ-GTFEGGDEAEGEGPV-NVLIDVS---DANRVSNAVTTAGADSDTSFFDIRR 328 (426) Q Consensus 255 ~p~~~~~~~~~~~~~~~~~~k~~-~gv~-~~~~~~~~~~~~~~~-N~~~~~~---g~~~~~~~~t~~G~~~sg~~iD~i~ 328 (426) ++ +..++..+|+||++ +||. +.+..+|..+++.++ |.|..|. ....++++|+++|. .+|||+++ T Consensus 307 nf-------~~~~g~~T~~fkq~~~Gv~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~---~~wiD~~~ 376 (501) T protein:vir:78 307 NF-------QLRNGRTVLAFRQFNAGVPATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK---FLWVDTYL 376 (501) T ss_pred Cc-------ccCcceeeeeccccCCCcCcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeecc---ceeehhhh Confidence 75 45669999999995 8997 668999999998776 7776664 45577888888773 47899999 Q ss_pred hHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCC--------------------------cc-cc Q lcl|Aclame:pro 329 TKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVG--------------------------QP-LA 381 (426) Q Consensus 329 g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g--------------------------~~-~~ 381 (426) |+|||+++||++|++||.+++|||||+.|++||++.|+.+|+++++||+ +. -. T Consensus 377 ~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~ 456 (501) T protein:vir:78 377 DQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMR 456 (501) T ss_pred hHHHHHHHHHHHHHHHHHhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceecc Confidence 9999999999999999999999999999999999999999999998751 11 11 Q ss_pred ceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 382 EYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 382 ~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +|.+..+.++.++++|++|+.|+++|.|+++||||+|+| +.+.| T Consensus 457 Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i-~s~~v 500 (501) T protein:vir:78 457 GWYFLIGDPANPGQARQNRTTPTCTLWYSDGGSIQELTI-GSNAV 500 (501) T ss_pred ceEEeeccccCChhhhhhcccCcEEEEEEeCCceeEEEe-eeeec Confidence 477776665555568999999999999999999999999 88888 No 8 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=100.00 E-value=2.3e-73 Score=418.85 Aligned_cols=400 Identities=13% Similarity=0.040 Sum_probs=272.0 Q ss_pred CC----CceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHh----c Q lcl|Aclame:pro 1 MP----KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEE----M 72 (426) Q Consensus 1 mp----~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~----Q 72 (426) || ++||||+++|.++++++|+||.|||++.|.+ |. +|+++|+|+++|++|||.+|||||||++||+ | T Consensus 1 m~~ip~s~iV~V~~~v~~~~~~~~~f~~~l~~~~~~~--~~---~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~~~q 75 (494) T protein:vir:94 1 MPNIPISQIVSINPQVVSAGGTQGTLDGLLLTQATGF--PV---TQPQVYFSAADVGTAFGLTSDEYNAALVYFAGILGG 75 (494) T ss_pred CCCCCcccEEEeeeeccccCCcccccceeEeecCccC--Cc---cceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCC Confidence 88 7999999999999999999999999998865 22 3566699999999999999999999999999 9 Q ss_pred CCceeeeeecccccc----ccccc--------------cccceecc------ceeecccccccchhhhhhhccccc-cc- Q lcl|Aclame:pro 73 GAEQWRVMVLEATEV----TEEEL--------------SDGDTIDK------VPILGNHEVESPDGDIEFTTDDDP-DV- 126 (426) Q Consensus 73 ~~~~~~~~v~~~t~v----~~~~~--------------~~~~tv~~------~~~s~~~~~~~ta~~i~~~~~~~~-~~- 126 (426) .|+|.+.++.+..+. ..-+. +...++++ .+.+......+.+..+...+.+.- .+ T Consensus 76 ~p~P~~l~igR~~~~a~~~~l~g~~~~~tl~~~~~~~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~~ai~~a~~~v~ 155 (494) T protein:vir:94 76 GQQPASLTIGRYASAATSAAVFGAPLTLSLAQLQTLSGTLIVTTDTQRTSAAINLSGATSFANAASLMTSGFTTPNFAIT 155 (494) T ss_pred CccccEEEEEeecCccccceeeccchhhhHHhhhhcceEEEEEEcceEEEeeecccccCChhhHHHHHhhhhccccceEE Confidence 988766554432211 01011 11223333 222333333455555544443211 00 Q ss_pred c-cceeee-eeccccceeeechhheeeecccccchhhhhhhc------------------cccceeecccccch------ Q lcl|Aclame:pro 127 E-DFDAEI-VINSATGDVATSEDSIELTYFHADWSQLDEFPS------------------DVNNFAVADRRFDL------ 180 (426) Q Consensus 127 t-~~~~~~-~~~~~~g~~t~~~~~~~~~~~~~d~~~~~~~~s------------------~~~~~~la~~~~~~------ 180 (426) . +....+ +..+.+|..+ ...+.+.+.....++.+ .++. .......|| T Consensus 156 ~d~~~~~f~v~s~ttG~~s------~is~~t~~~a~~l~lt~~~~a~v~~~g~~aet~~~a~~a-~~~~~~~Wy~f~~~~ 228 (494) T protein:vir:94 156 YDAQRRRFVLSTTATGTTA------SVSAVTGTLADGVGLSTASGAYVEGSGLAADTAASALDR-LAASSSTWAIFTTAW 228 (494) T ss_pred EcccCcEEEEEEccCCcee------EEEEeccchhhhhhhhccccceEeecCcccccHHHHHHH-HHhccCceEEEEEec Confidence 0 111111 1122222211 11111222111111110 0111 111223454 Q ss_pred -hhhHhHhhhhhhhhhcce--EEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCccchhHHHHHHHhhhccc Q lcl|Aclame:pro 181 -KGVGVLDETHSWASDEDM--GMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKFAVSEPW 257 (426) Q Consensus 181 -~~~~~~~~~~~wa~~~~k--l~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~~~~~aa~~g~~~~~~p~ 257 (426) ....+...++.|++++.| +|+....++..++...++...+++++.+|.|++++||+.. ++++++|.++..++ T Consensus 229 ~~~~~~ilalA~wiea~~~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~----~~aa~~g~~aa~~~- 303 (494) T protein:vir:94 229 AASLSDRTALAQWTSDQVFRRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLLA----NAMIVLAWGASTNL- 303 (494) T ss_pred CCCHHHHHHHHHHHhhcCccEEEEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCCC----hHHHHHHHHHhccc- Confidence 112345568889999988 6666777777777777777788899999999999998654 67788888877663 Q ss_pred ccceeecccccceeeccc-cccccc-cccchhHHHHhhcCc-cEEEEEcCC---EEEeeceeecCcccCcceeehhhhHH Q lcl|Aclame:pro 258 YNPLWNELPAGETVSKNV-GDPEEQ-GTFEGGDEAEGEGPV-NVLIDVSDA---NRVSNAVTTAGADSDTSFFDIRRTKV 331 (426) Q Consensus 258 ~~~~~~~~~~~~~~~~~k-~~~gv~-~~~~~~~~~~~~~~~-N~~~~~~g~---~~~~~~~t~~G~~~sg~~iD~i~g~d 331 (426) +..+|..+|+|| ++||+. +.+..+|..+++.++ |+|..|.+. ..++++++++| .+.|||.+++++ T Consensus 304 ------~~~~g~~T~~~k~q~~gi~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG---~~~~id~~~~~~ 374 (494) T protein:vir:94 304 ------QIAEGRTTLALRSPVSSAGVRVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGG---QFLWADTALGWI 374 (494) T ss_pred ------cccCcceeEEeeccCCCCCCccCCHHHHHHHHhcCCeEEEEecccCceEEEecCceecc---ccceeeeeccHH Confidence 445588999999 689997 568889998988766 999998653 34455666654 234677777888 Q ss_pred HHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCC--------------------ccccceeE----ec Q lcl|Aclame:pro 332 YTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVG--------------------QPLAEYEV----DV 387 (426) Q Consensus 332 wl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g--------------------~~~~~y~~----~~ 387 (426) |||++||++|++||.+++||||||.|++||++.|+.+|+++++||+ ....+... .+ T Consensus 375 WL~~~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~ 454 (494) T protein:vir:94 375 ALRRNLQQALFETLLAYRSLPYNADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGWYL 454 (494) T ss_pred HHHHHHHHHHHHHHHhCCCcccChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccccceeccceee Confidence 9999999999999999999999999999999999999999998752 11222111 11 Q ss_pred Cc-ccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 388 PE-WDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 388 p~-~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +. ...++++|++|.+|+++|.|+++||||.|+|++++-+ T Consensus 455 ~~~~~~s~~~ra~R~~~~~~~~y~~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 455 QVIDPITTTVRTDRGSPTVNFWYCDGGSIQRVVVSATTVI 494 (494) T ss_pred eccCCCChhhhhccccCCceEEEEecCcEEEEEEeeEEeC Confidence 22 2344679999999999999999999999999999988 No 9 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=100.00 E-value=3.1e-72 Score=412.64 Aligned_cols=405 Identities=11% Similarity=0.049 Sum_probs=273.4 Q ss_pred CC-CceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCC----c Q lcl|Aclame:pro 1 MP-KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGA----E 75 (426) Q Consensus 1 mp-~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~----~ 75 (426) .| .+||||+++|.++++..++|+.+|||+.|.++|+ +|++.|+|+++|++|||++|||||||++||+|.+ + T Consensus 2 ip~s~iV~V~~~v~~~~~~~~~~~~~l~l~~~~~~~~----~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF~~~~~~~~~ 77 (504) T protein:vir:96 2 ISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIPP----GIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNS 77 (504) T ss_pred CCccceeEeeecccccccccccccceeEeecccCCCc----cceEEecCHHHHHHhcCCChHHHHHHHHHhhcCCCCCcc Confidence 56 8999999999999999999999999999999874 3666699999999999999999999999999965 5 Q ss_pred eeeeeecc----cccccccc------------c---cccceecc-------ceeecccccccchhhhhhhcccccccccc Q lcl|Aclame:pro 76 QWRVMVLE----ATEVTEEE------------L---SDGDTIDK-------VPILGNHEVESPDGDIEFTTDDDPDVEDF 129 (426) Q Consensus 76 ~~~~~v~~----~t~v~~~~------------~---~~~~tv~~-------~~~s~~~~~~~ta~~i~~~~~~~~~~t~~ 129 (426) |.+.++.+ ++....-+ . +-..+|.+ .+.|......+.+..+...+.+..+...+ T Consensus 78 P~~l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~~~~~~~~ 157 (504) T protein:vir:96 78 PSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRKNTDPQLA 157 (504) T ss_pred ccEEEEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhhhhcccccccc Confidence 54433322 22111111 0 11123333 33333333345566665555554332211 Q ss_pred eee---------e-eeccccceeeechhheeeecccccchhhhhhh----------------ccccceeecccccch--- Q lcl|Aclame:pro 130 DAE---------I-VINSATGDVATSEDSIELTYFHADWSQLDEFP----------------SDVNNFAVADRRFDL--- 180 (426) Q Consensus 130 ~~~---------~-~~~~~~g~~t~~~~~~~~~~~~~d~~~~~~~~----------------s~~~~~~la~~~~~~--- 180 (426) ... + ...+.+|..+........ ..+......++ ..++. .......|| T Consensus 158 ~~tv~~d~~~~~f~its~~tg~~~~~~~~~a~---~~~~~~~lgl~~~~~~~v~g~~aet~~~al~a-l~~~~~~Wy~f~ 233 (504) T protein:vir:96 158 QATVTWNPNTNQFTLVGATIGTGVLAVAKSAD---PQDMSTALGWSTSNVVNVAGQAADLPDAAVAK-STNVSNNFGSFL 233 (504) T ss_pred cceEEEeccCCeEEEEeeccccceeEEEeecc---ccchhhhhhcccccceEEeecccccHHHHHHH-HHhhcCCeEEEE Confidence 111 1 111122221111100000 00111110000 00111 011112344 Q ss_pred -hh----hHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCccchhHHHHHHHhhhc Q lcl|Aclame:pro 181 -KG----VGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKFAVSE 255 (426) Q Consensus 181 -~~----~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~~~~~aa~~g~~~~~~ 255 (426) +. ..++..++.|++++++.|+....+... +.........+.+.+++++++.....+++++..++.+++.+ T Consensus 234 ~a~~~~~dd~ilalA~w~ea~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~as~~ 308 (504) T protein:vir:96 234 FAGATLDNDQIKAVSAWNAAQNNQFIYTVATSLA-----NLGALFDLVKGNSGTALNVLSATASNDFVEQCPSEILAATN 308 (504) T ss_pred EEeccCCHHHHHHHHHHHhhcCceEEEEEeeccc-----chhhHHHhhhhcceeEEEEeecCccchhHHHHHHHHHHhcC Confidence 11 122446788999999988765553221 12222334445556889999988888899999999999887 Q ss_pred ccccceeecccccceeeccccccccc-cccchhHHHHhhcCc-cEEEEEcCC---EEEeeceeecCcccCcceeehhhhH Q lcl|Aclame:pro 256 PWYNPLWNELPAGETVSKNVGDPEEQ-GTFEGGDEAEGEGPV-NVLIDVSDA---NRVSNAVTTAGADSDTSFFDIRRTK 330 (426) Q Consensus 256 p~~~~~~~~~~~~~~~~~~k~~~gv~-~~~~~~~~~~~~~~~-N~~~~~~g~---~~~~~~~t~~G~~~sg~~iD~i~g~ 330 (426) | +..++..+|+||++|||. +.+..+|..+|+.++ |.|..+.+. ..+++.|++.||+-+--|||++|++ T Consensus 309 f-------~~~ng~~T~~fk~l~GVta~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~~~~~wiDv~~~~ 381 (504) T protein:vir:96 309 Y-------DEPGASQNYMYYQFPGRNITVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYANE 381 (504) T ss_pred c-------CcccccccccccccCCcCcccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCccccchhhhhhhH Confidence 5 445699999999999997 779999999998777 877776543 4555555554433222279999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCC--------------------ccc-------cce Q lcl|Aclame:pro 331 VYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVG--------------------QPL-------AEY 383 (426) Q Consensus 331 dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g--------------------~~~-------~~y 383 (426) +|||++||++|++||.+++|||||+.|++||++.|+.+|++++++|+ +.+ .+| T Consensus 382 ~WL~~~lq~~l~~l~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~GY 461 (504) T protein:vir:96 382 IWLKSAIAQALLDLFLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLGY 461 (504) T ss_pred HHHHHHHHHHHHHHHhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccce Confidence 99999999999999999999999999999999999999999998751 111 248 Q ss_pred eEecCc-ccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEe Q lcl|Aclame:pro 384 EVDVPE-WDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVS 425 (426) Q Consensus 384 ~~~~p~-~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~ 425 (426) .+.+|. ++++++||++|+.|+++|.|+++||||+|+|..++- T Consensus 462 yv~~~~~s~~s~~~r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 462 WINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred EEEecChhccChhHhhhccccceEEEEEECCeEEEEEeccccC Confidence 888765 556789999999999999999999999999999887 No 10 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=100.00 E-value=4.7e-71 Score=406.14 Aligned_cols=406 Identities=12% Similarity=0.040 Sum_probs=270.0 Q ss_pred CC-CceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCc---- Q lcl|Aclame:pro 1 MP-KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE---- 75 (426) Q Consensus 1 mp-~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~---- 75 (426) .| .+||||+++|.++++..++|+.+|||+.|.++|+ +|+++|+|+++|++|||.+|||||||++||+|.|+ T Consensus 2 ip~s~iVnV~~~v~~~a~~~~~~~~~lilt~~~~~~~----~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsq~p~~~~~ 77 (507) T protein:vir:99 2 ISQSRYVRIVSGVGAGAPVAQRRLIMRVMTTNAVLPP----GVVFESSSADAVGAYFGMASEEYKRAKAYMSFISKSINS 77 (507) T ss_pred CCccceeEEeeeccccCcccccccceeeeccccCCCc----cceEeecCHHHHHHhcCCChHHHHHHHHHhccCCCCCcc Confidence 56 8999999999999999999999999999988764 36677999999999999999999999999999873 Q ss_pred eeeeeecccc----ccccc-------------ccccc--ceecc-------ceeecccccccchhhhhhhcccccccccc Q lcl|Aclame:pro 76 QWRVMVLEAT----EVTEE-------------ELSDG--DTIDK-------VPILGNHEVESPDGDIEFTTDDDPDVEDF 129 (426) Q Consensus 76 ~~~~~v~~~t----~v~~~-------------~~~~~--~tv~~-------~~~s~~~~~~~ta~~i~~~~~~~~~~t~~ 129 (426) |.+.++.+.. ....- ..+.. .+|.+ .+++......+.+..|...+.+..+.... T Consensus 78 P~~L~igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vAs~i~~~l~a~~~~~~~ 157 (507) T protein:vir:99 78 PSYISFARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVAATLQTKIRASANAELA 157 (507) T ss_pred cceEEEEeecCccccceeecchhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHHHHHHHhhhcccccccc Confidence 4443332221 11000 01111 23333 33334444456666666666654333221 Q ss_pred eeeeeec----------cccceeeechhheeeecccccchhhhhhhc------------cc-cc--eeecccccchh--- Q lcl|Aclame:pro 130 DAEIVIN----------SATGDVATSEDSIELTYFHADWSQLDEFPS------------DV-NN--FAVADRRFDLK--- 181 (426) Q Consensus 130 ~~~~~~~----------~~~g~~t~~~~~~~~~~~~~d~~~~~~~~s------------~~-~~--~~la~~~~~~~--- 181 (426) ......+ +.+|..++.. .........+.+.+...+. ++ ++ ........||. T Consensus 158 ~~tv~~d~~~~~F~v~s~~tG~~s~i~-~at~~~~gt~~s~l~~~~~~~a~~~~g~~aet~~~a~~a~~~~~~nW~~~~~ 236 (507) T protein:vir:99 158 TATVTFNTTTNQFVLNGTTTGALAPTI-TAVRTDPATDISSLLGWTNTGTVFVKGQAAETPDTSISKSAAISTNFGSFIY 236 (507) T ss_pred ceEEEEecCCceEEEEeeeccccceeE-EEEcCCchhhHHHHhccccccceEeecccccCHHHHHHHHHhhcCCeEEEEE Confidence 1111111 1111110000 0000000111111111110 00 00 01112334541 Q ss_pred ------hhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCccchhHHHHHHHhhhc Q lcl|Aclame:pro 182 ------GVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKFAVSE 255 (426) Q Consensus 182 ------~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~~~~~aa~~g~~~~~~ 255 (426) ...++.+++.|.+++.+.|+....+.+ ..........+..+.+.++.. ......++++++++|+|++.+ T Consensus 237 a~~~~~td~~~lalA~wiea~~~~f~~~~~~~~---a~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~aa~~g~~as~n 311 (507) T protein:vir:99 237 TSTPALTNDQITAVASWNASQNNMYMYSVPTTI---ANIGTLYAAVKGFSGCALNIT--SDSLPVDYIEQSPCEILAATD 311 (507) T ss_pred EeccccChHHHHHHHHHHhhcCcEEEEEEecCc---hhhhhhhhhhhhcceeEEEee--cccccchhHHHHHHHHHHhhc Confidence 112345688999999999886655432 222333444444444434332 234567889999999999987 Q ss_pred ccccceeecccccceeeccccccccc-cccchhHHHHhhcCc-cEEEEEcCC---EEEeeceeecCcccCcceeeh--hh Q lcl|Aclame:pro 256 PWYNPLWNELPAGETVSKNVGDPEEQ-GTFEGGDEAEGEGPV-NVLIDVSDA---NRVSNAVTTAGADSDTSFFDI--RR 328 (426) Q Consensus 256 p~~~~~~~~~~~~~~~~~~k~~~gv~-~~~~~~~~~~~~~~~-N~~~~~~g~---~~~~~~~t~~G~~~sg~~iD~--i~ 328 (426) + +..++..+|+||++|||. +.+..+|..+++.+| |+|++|.+. ..+++.|++.||.. +|||+ ++ T Consensus 312 f-------~~~ng~~T~~fk~l~GV~a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~gG~~--~fid~d~~~ 382 (507) T protein:vir:99 312 Y-------TRVNATQNYMYYQFPSRNITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILCGGPN--DAVDMNIYA 382 (507) T ss_pred c-------CcCccceeecccccCCcccccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeCCcc--cceeeeeec Confidence 4 456799999999999997 779999999998776 999888663 45666555555321 35555 55 Q ss_pred hHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCC----C-----------------ccc------c Q lcl|Aclame:pro 329 TKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSV----G-----------------QPL------A 381 (426) Q Consensus 329 g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~----g-----------------~~~------~ 381 (426) ++||||++||++|++||.+++|||||+.|++||++.|+.+|++++++| | +.. . T Consensus 383 ~~~WL~~~iq~~l~~l~~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~~~~~~~~~ 462 (507) T protein:vir:99 383 NEIWLKSAISAQILSLFLNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYITQISGDANAWRQVANI 462 (507) T ss_pred chHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchheecccccccccccceecc Confidence 777999999999999999999999999999999999999999999875 1 111 1 Q ss_pred ceeEecCc-ccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEe Q lcl|Aclame:pro 382 EYEVDVPE-WDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVS 425 (426) Q Consensus 382 ~y~~~~p~-~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~ 425 (426) +|.+..|. +.++++||++|+.|+++|.|+++|+||+|+|..++- T Consensus 463 Gyy~~~~~~s~~~~~~r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 463 GYWLNITFSSYTNPNTQLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred ceEEEeCChHhcChhhhhccccceEEEEEEeCCeEEEEEeeeecC Confidence 37777654 567889999999999999999999999999999887 No 11 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=100.00 E-value=9.5e-71 Score=404.48 Aligned_cols=320 Identities=13% Similarity=0.163 Sum_probs=242.3 Q ss_pred CCCceEEEEEeecccc-ccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCceeee Q lcl|Aclame:pro 1 MPKQIVEIELTAEIAD-RPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQWRV 79 (426) Q Consensus 1 mp~~iVnV~isl~t~a-~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~~~~ 79 (426) |=.+||||+|+|...+ ..+.+||.++|+..+++. +.+.|+++++|+.|||+++|+||+|.++|+|++++... T Consensus 1 ~~~~iv~V~v~~~~~~~~~~~~~~~~~~~~~~t~~-------~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~~~~~~i 73 (331) T protein:vir:80 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAM-------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPDTV 73 (331) T ss_pred CccceecceeeecccccccccccCcceeEEecccc-------ceEEEechhhhccCCCCCcHHHHHHHHHHhccCccceE Confidence 9999999999998543 345555555555544432 44559999999999999999999999999999887665 Q ss_pred eeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeecccccch Q lcl|Aclame:pro 80 MVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHADWS 159 (426) Q Consensus 80 ~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~~d~~ 159 (426) .+....... +... .......+|+ T Consensus 74 ~v~~~~~~~---------------------------~~~a------------------------------~~a~~~~~w~ 96 (331) T protein:vir:80 74 AVITYEDTK---------------------------LLEA------------------------------AEAYFLKSWH 96 (331) T ss_pred EEeccchHH---------------------------HHHH------------------------------HHHhccCcee Confidence 442221100 0000 0011122354 Q ss_pred hhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCCC Q lcl|Aclame:pro 160 QLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDAS 239 (426) Q Consensus 160 ~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~ 239 (426) ++. ++++ ...++-.++.|++++.+.|+....+. . ....+...+.++++++|... T Consensus 97 ~~~----------~~~~-----~~~~~~a~a~~~~a~~~~f~~~~~~~-------~---~~~~~~~~~~~t~~~~~~~~- 150 (331) T protein:vir:80 97 FAL----------LAEF-----KAADALALSNLIEEQKFKFAVFQVTA-------V---ADITPLAKNTRTIAIVHSKT- 150 (331) T ss_pred EEE----------eecC-----CHHHHHHHHHHHhhCCcEEEEEecCc-------h---HHHHHhhccccEEEEEcCCc- Confidence 432 1111 12234457789999999987654321 1 12222334568999998755 Q ss_pred ccchhHHHHHHHhhhcccccceeecccccceeecccc-ccccc-cccchhHHHHhhcCc-cEEEEEcCCEEEeeceeecC Q lcl|Aclame:pro 240 DDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVG-DPEEQ-GTFEGGDEAEGEGPV-NVLIDVSDANRVSNAVTTAG 316 (426) Q Consensus 240 ~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~-~~gv~-~~~~~~~~~~~~~~~-N~~~~~~g~~~~~~~~t~~G 316 (426) .++++++++|++++.+| |..+|+||+ ++||. +.++.+|+.+++.++ |+|.++.|...+ .+| T Consensus 151 ~~~~~aa~~g~~~~~~~-----------g~~t~~fk~~l~GV~~~~lt~t~~~al~~~~~N~y~~~~~~~~~-----~~G 214 (331) T protein:vir:80 151 GEKLDAALIGNVASLPV-----------GSATWKGRHGLAGITSEELKVSEIDAIQKAGGMCYIEKAGIAQT-----SEG 214 (331) T ss_pred cchhHHHHHHHHHhcCc-----------cceeeeeecccCCCCCCCCCHHHHHHHHhcCceEEEEecCeeEE-----ecc Confidence 57899999999999988 667899997 69997 779999999998776 888888775433 344 Q ss_pred cccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCC------ccccceeEecCcc Q lcl|Aclame:pro 317 ADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVG------QPLAEYEVDVPEW 390 (426) Q Consensus 317 ~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g------~~~~~y~~~~p~~ 390 (426) ++++|+|||++||+|||+++||++|++||.+++|||||+.|++||++.|+.+|++++++|. +..++|.|..|.+ T Consensus 215 ~~~~G~~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~ 294 (331) T protein:vir:80 215 KTVSGEFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQR 294 (331) T ss_pred eEeCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhHHHHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCch Confidence 4555689999999999999999999999999999999999999999999999999999862 3346899987765 Q ss_pred c-CcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 391 D-DDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 391 ~-~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) + ++++||++|++||++|+|+|+||||.++|+|||+| T Consensus 295 ~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 295 SDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred hcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 5 56899999999999999999999999999999999 No 12 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=100.00 E-value=3.8e-64 Score=368.25 Aligned_cols=407 Identities=10% Similarity=-0.018 Sum_probs=263.9 Q ss_pred CC-CceEEEEEeecccc---ccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHh----c Q lcl|Aclame:pro 1 MP-KQIVEIELTAEIAD---RPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEE----M 72 (426) Q Consensus 1 mp-~~iVnV~isl~t~a---~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~----Q 72 (426) || ..+|+|+|++++.+ ++.|+||. ||++.+..+|+ +|++.|+|+++|++|||.+|||||||++||+ | T Consensus 1 m~I~~~~~V~i~~~v~aa~~~~~~~f~~-li~t~~~~~p~----~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q 75 (515) T protein:vir:10 1 MPISFDKYVAITSGVAAQQQIAARSFAI-RVYTPNPMVSV----DRLITATSAADVGAYFGTASEEYKRAVKNFGFISKK 75 (515) T ss_pred CCCCceeEEEeecccccCCcccccccee-eeeecccCCCc----cceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCC Confidence 99 99999999987744 45689995 66666666653 4677799999999999999999999999999 9 Q ss_pred CCceeeeeecccc----cccc-----cc-----------ccccceecccee--------ecccccccchhhhhhhccccc Q lcl|Aclame:pro 73 GAEQWRVMVLEAT----EVTE-----EE-----------LSDGDTIDKVPI--------LGNHEVESPDGDIEFTTDDDP 124 (426) Q Consensus 73 ~~~~~~~~v~~~t----~v~~-----~~-----------~~~~~tv~~~~~--------s~~~~~~~ta~~i~~~~~~~~ 124 (426) .|+|.+.++-+.. .... .+ .+...+|+++.. |......+.+..|...+.+.. T Consensus 76 ~p~P~~L~igR~~~~a~~~~l~g~~~~~~~l~~~~~is~G~ltitidG~~~~t~s~i~~S~ats~~~vAs~i~tal~~~~ 155 (515) T protein:vir:10 76 TRRPTSIQFARWQREAGPVAIYGGAKKAAALATLQAVTAGAISFLFGGATTVTVSGISFSAATSLADVASELQTALRANA 155 (515) T ss_pred cccccEEEEEeccCcccceEEEeccchhhhHHhhhcccceeEEEEEcceEEEEeeccccccccCHHHHHHHHHhhhcccc Confidence 9887665543321 1110 00 112234444332 333334567777777777655 Q ss_pred ccccceeeeeeccccceeeec-----hh-heeeeccc-----ccchhhhhhhc-------------cc-cc--eeecccc Q lcl|Aclame:pro 125 DVEDFDAEIVINSATGDVATS-----ED-SIELTYFH-----ADWSQLDEFPS-------------DV-NN--FAVADRR 177 (426) Q Consensus 125 ~~t~~~~~~~~~~~~g~~t~~-----~~-~~~~~~~~-----~d~~~~~~~~s-------------~~-~~--~~la~~~ 177 (426) +.......+.-++..+.+... .. +....+.. .+...+.++.+ ++ ++ ....... T Consensus 156 ~~~~~~~tv~~d~~~~~F~v~s~~tG~~~~is~~~~t~~~~~t~~a~~lglt~~~~av~~~g~aaet~~~a~~a~~~~s~ 235 (515) T protein:vir:10 156 DANLATCTVSYDPVGARFNFAGSPSDDTVQESISIVPQSNPAIDVAQLLGWNSAQGASYIAASPVVSPVDTLIASVAGNN 235 (515) T ss_pred ccccceeEEEEecCCCeEEEEEeecCCceeEEEEEecCCCchhhHHHHhccccccceEEecccccccHHHHHHHHHhccC Confidence 544433333332222211100 00 00111111 01111111110 00 00 0111223 Q ss_pred cchh-----------hhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCccchhHH Q lcl|Aclame:pro 178 FDLK-----------GVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAY 246 (426) Q Consensus 178 ~~~~-----------~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~~~~~aa 246 (426) .||. ...+...++.|.+++.+.|+................ ......+.++..++.. ..++++++ T Consensus 236 nWy~f~~a~~~~~~~~~a~~~a~a~~~e~~~~~~~~~~~~~~~~~~~~~a~---~~~~~~~~~~~~~~~~--~~~~~~a~ 310 (515) T protein:vir:10 236 NFGSILFTKNGGTGITLSDAEAIALQNQSYNVAYKFQVGVDDTTYSSWQAA---LAAIGGVNMIYSPVAL--AAEYHDMQ 310 (515) T ss_pred CeEEEEEeecCccccchhHHHHHHHHHhhcCceEEEEeccCccceechhhh---hhhhhhcCceEEEEec--cCcchHHH Confidence 4541 112334467899999988887665443322222221 2222334455555543 34678999 Q ss_pred HHHHHhhhcccccceeecccccceeeccccccccc-cccchhHHHHhhcCc-cEEEEEcC---CEEEeeceeecCcccCc Q lcl|Aclame:pro 247 QLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQ-GTFEGGDEAEGEGPV-NVLIDVSD---ANRVSNAVTTAGADSDT 321 (426) Q Consensus 247 ~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~-~~~~~~~~~~~~~~~-N~~~~~~g---~~~~~~~~t~~G~~~sg 321 (426) .+|++++.++ +..++..+|+||++|||+ +.+...|..+++.++ |+|.+|.+ +..+++.|++.||+.++ T Consensus 311 ~~g~~asvnf-------~~~ng~iT~kfKq~~Gita~~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G~~~gG~~~~ 383 (515) T protein:vir:10 311 DGIIEAATDF-------TQQGGATGYMYVQFNNQTPAVNDDTLSGILDDLNINYYGQTQVNGTNLSFYQDGVMMGGPTDP 383 (515) T ss_pred HHHHHHhcCC-------CccchhheeccccCCCCccccCCHHHHHHHHhcCCeEEEEEeccCceEEEEeCCeeeCCccch Confidence 9999998875 445688999999999997 779999999998666 99988854 57788888888888788 Q ss_pred ceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHH-HHHHHhhcCC--------------------Ccc- Q lcl|Aclame:pro 322 SFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIK-GTMSGLTGSV--------------------GQP- 379 (426) Q Consensus 322 ~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~-~~l~~~v~~~--------------------g~~- 379 (426) +|||++||+|||+++||++|++||.+++||||||.|++||++.|+ .+|+++++|| |+. T Consensus 384 ~WiD~~~g~~WL~~~iq~~l~~L~~s~~KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~ 463 (515) T protein:vir:10 384 RDSNVYANEQWLKSYAGASFMSLQLAQGKIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDT 463 (515) T ss_pred hHHHHHhhHHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcc Confidence 899999999999999999999999999999999999999999886 5999999875 111 Q ss_pred c------cceeEecCc-ccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEe Q lcl|Aclame:pro 380 L------AEYEVDVPE-WDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVS 425 (426) Q Consensus 380 ~------~~y~~~~p~-~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~ 425 (426) . -+|-+.+|. +.+...+|..+..+-+-|.++ .|.||++++.-++- T Consensus 464 ~~~~~~~~Gyy~~~~~~~~~~~~~r~~~~~~~~~~y~~-g~~i~~i~~~~~~v 515 (515) T protein:vir:10 464 AWQKVQNLGYWYDVQISSFVDTGGTTKYQAVYSLVYSK-DDLIRKVVGTHTLI 515 (515) T ss_pred cccchhhcceeEecCcCCCCCcccccccCceeEEEEEc-CceEEEEEeeeecC Confidence 1 136666444 444444555555554444444 99999999888776 No 13 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=99.10 E-value=2.1e-09 Score=68.16 Aligned_cols=358 Identities=15% Similarity=0.023 Sum_probs=190.8 Q ss_pred CCC--ceEE-EEEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCce Q lcl|Aclame:pro 1 MPK--QIVE-IELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQ 76 (426) Q Consensus 1 mp~--~iVn-V~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~ 76 (426) |+. .=|. ..++-.+.++.....+++.|+|.....+... .++...+.++..+-...||.++..+.+...+|.++-.. T Consensus 1 m~~~~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~~~tl~~a~~~~~~~gg~~ 80 (396) T protein:vir:60 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcCcchhHHHHHHHhhccCce Confidence 984 2222 2334577888899999999999774332211 13566778999999999999999999999999997544 Q ss_pred eeeeeccccccccccccccceeccceeecccccccch-hhhhhhcccccccccceeeeeeccccceeeechhheeeeccc Q lcl|Aclame:pro 77 WRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPD-GDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFH 155 (426) Q Consensus 77 ~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta-~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~ 155 (426) ....................+.. .+ ....+.+. ......+........... T Consensus 81 ~~vv~~~~~~~~~~~~~~~~~~~--~~--~~~~d~~~~~tg~~al~~~~~~~~~~~------------------------ 132 (396) T protein:vir:60 81 TVVVRVEDGTGEDEETKLAQTVS--NI--IGTTDENGQYTGLKALLAAESVTGVKP------------------------ 132 (396) T ss_pred EEEEecccccccccccccccccc--cc--cccccccccccchhhhhhcccceeeee------------------------ Confidence 33322111111001000000000 00 00000000 000000000000000000 Q ss_pred ccchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEe Q lcl|Aclame:pro 156 ADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMI 235 (426) Q Consensus 156 ~d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~ 235 (426) . ..++ .++........+...++..+.+++....... ........ ...+-.....+++ T Consensus 133 -------------~-il~a---p~~~~~~v~~al~~~~~~~~~~~i~d~p~~~---~~~~a~~~---~~~~~s~~~~~~~ 189 (396) T protein:vir:60 133 -------------R-ILGV---PGLDTKEVAVALASVCQKLRAFGYISAWGCK---TISEVKAY---RQNFSQRELMVIW 189 (396) T ss_pred -------------e-eccc---cccccHHHHHHHHHHhccCCeEEEEeCCCCC---CHHHHHHH---HhhcCCceEEEEe Confidence 0 0000 0111111112222333433434333221111 11111111 1111111222222 Q ss_pred cCC----------CccchhHHHHHHHhhhcccccceeecccccceeeccccccccc---------cccchhHHHHhhcCc Q lcl|Aclame:pro 236 VDA----------SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQ---------GTFEGGDEAEGEGPV 296 (426) Q Consensus 236 ~~~----------~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~---------~~~~~~~~~~~~~~~ 296 (426) .-- ....|.++++|.++..+.-+.+ |+. + .++.+.||. ......|...|+.+. T Consensus 190 p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~-~~s-p------aN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~g 261 (396) T protein:vir:60 190 PDFLAWDTVASTTATAYATARALGLRAKIDQEQGW-HKT-L------SNVGVNGVTGISASVFWDLQESGTDADLLNESG 261 (396) T ss_pred CceeeecccCCceeEEchhHHHHHHHHHhhhccCc-EeC-c------CCceecceeeceeecccccCCCcchhhhhhhcC Confidence 211 0123568888988887743321 111 1 122222321 123345566676554 Q ss_pred -cEEEEEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 297 -NVLIDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGS 375 (426) Q Consensus 297 -N~~~~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~ 375 (426) |+++. +++..+|-+.|+++ +..=.||=++|..||++..|+..++..+-. |.+..-...|+..|+.-|..-+++ T Consensus 262 I~~~~~-~~G~~~wG~rT~~~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~ 335 (396) T protein:vir:60 262 VTTLIR-RDGFRFWGNRTCSD-DPLFLFENYTRTAQVLADTMAEAHMWAVDK----PITATLIRDIVDGINAKFRELKTN 335 (396) T ss_pred cEEEEc-CCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhC Confidence 88765 55677786667655 223469999999999999999998877643 778888999999999999998876 Q ss_pred CCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 376 VGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 376 ~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) |. + .+|++..-....+++|+.+-++. +++.+.....++++.++....+ T Consensus 336 ga-l-~g~~~~~d~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~~~~~~ 383 (396) T protein:vir:60 336 GY-I-VDATCWFSEESNDAETLKAGKLY-IDYDYTPVPPLENLTLRQRITD 383 (396) T ss_pred Cc-e-eceEEEEecCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 53 3 35777665555666787776665 7888888999999999999998 No 14 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=98.98 E-value=7.6e-09 Score=65.09 Aligned_cols=358 Identities=15% Similarity=0.038 Sum_probs=189.3 Q ss_pred CCC---ceEEEEEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCce Q lcl|Aclame:pro 1 MPK---QIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQ 76 (426) Q Consensus 1 mp~---~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~ 76 (426) |+. -|-=+.+.-.+.++..-...++-|+|.+...+... ..++-.+.++.++....||.+...+.+..++|.++-.. T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~~tL~~al~~~~~ngg~~ 80 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccccchhhhhhhhhccCcee Confidence 983 33334445567777777889999999763322211 13456679999999999999999999999999987443 Q ss_pred eeeeeccccccccccccccceeccceeecccccccch-hhhhhhcccccccccceeeeeeccccceeeechhheeeeccc Q lcl|Aclame:pro 77 WRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPD-GDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFH 155 (426) Q Consensus 77 ~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta-~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~ 155 (426) ....................+... +.+. ..... ......+....... T Consensus 81 ~~v~~~~~~~~~~~~~~~a~t~~~--~~~~--~~~~~~~tg~~al~~~~~~~---------------------------- 128 (396) T protein:vir:20 81 TVVMRVEDGTGDDEETKLAQTVSN--IIGT--TDENGQYTGLKAMLAAESVT---------------------------- 128 (396) T ss_pred EEEEeccccccccccccccccccc--cccc--cccccccchhhhhhhhcccc---------------------------- Confidence 333221111110000000000000 0000 00000 00000000000000 Q ss_pred ccchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEe Q lcl|Aclame:pro 156 ADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMI 235 (426) Q Consensus 156 ~d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~ 235 (426) ...-++. ++ . ++........+...++..+.+++...... ...+ ...+ .+.++-.+...+|+ T Consensus 129 -------~~~p~i~---~a-p--~~~~~~v~~al~~~~~~~~~~~~iD~p~~---~~~~--~a~~-~r~~~~s~~~~~~~ 189 (396) T protein:vir:20 129 -------GVKPRIL---GV-P--GLDTKEVAVALASVCQKLRAFGYISAWGC---KTIS--EVKA-YRQNFSQRELMVIW 189 (396) T ss_pred -------ccchhhh---hh-h--hhccHHHHHHHHHHHhcCCcEEEEecCCC---CCHH--HHHH-HhhCCCCceEEEEc Confidence 0000000 00 0 00001111222223333333322211111 1111 1111 11112112222333 Q ss_pred cCC----------CccchhHHHHHHHhhhcccccceeecccccceeeccccccccc---------cccchhHHHHhhcCc Q lcl|Aclame:pro 236 VDA----------SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQ---------GTFEGGDEAEGEGPV 296 (426) Q Consensus 236 ~~~----------~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~---------~~~~~~~~~~~~~~~ 296 (426) +-. ....|.++++|.++..+.-+.+. + .+ .++.+.||. .....+|...|+.+. T Consensus 190 P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~-~-sp------aN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~g 261 (396) T protein:vir:20 190 PDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWH-K-TL------SNVGVNGVTGISASVFWDLQESGTDADLLNESG 261 (396) T ss_pred CccccccCcCCcceeechhHHHHHHHHHhhhhcCcE-e-cc------CCceeccceecceecccccCCCcchhhhhhhcC Confidence 211 11346788888888777322211 1 11 122223321 123345666676544 Q ss_pred -cEEEEEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 297 -NVLIDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGS 375 (426) Q Consensus 297 -N~~~~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~ 375 (426) |.++. +++..+|-+.|+++ +..=.||=++|..||+...++..++..+-. |.+..=+..|+..|+.-|++-+++ T Consensus 262 i~~~~~-~~G~~~wG~rT~s~-d~~~~~i~~rR~~~~i~~~~~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~ 335 (396) T protein:vir:20 262 VTTLIR-RDGFRFWGNRTCSD-DPLFLFENYTRTAQVVADTMAEAHMWAVDK----PITATLIRDIVDGINAKFRELKTN 335 (396) T ss_pred cEEEEc-CCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhC Confidence 88765 55677786666654 333469999999999999999998876643 778888889999999999998876 Q ss_pred CCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 376 VGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 376 ~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) |. +-+|++...+...+++|+.+.++. +++.+.....++++.++....+ T Consensus 336 G~--l~g~~v~~d~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 383 (396) T protein:vir:20 336 GY--IVDATCWFSEESNDAETLKAGKLY-IDYDYTPVPPLENLTLRQRITD 383 (396) T ss_pred cc--eeceEEEEecCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 53 345777776666677888887776 8888889999999999999998 No 15 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=98.92 E-value=1.5e-08 Score=63.53 Aligned_cols=404 Identities=10% Similarity=-0.016 Sum_probs=187.4 Q ss_pred CCC---ceEEE-EEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHh--ccCCCCHHHHHHHHHHhcCC Q lcl|Aclame:pro 1 MPK---QIVEI-ELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGD--DYGEDSDVYTASEAIEEMGA 74 (426) Q Consensus 1 mp~---~iVnV-~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~--Dfg~~sp~YkAA~~~f~Q~~ 74 (426) ||. .=|-| .+.-.+.++..-.-+++.|||.....|. +...+-+|..+... ++..+...+.|..++|.|+. T Consensus 1 M~~~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp~----n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~nGg 76 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPV----NTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDYGS 76 (477) T ss_pred CcccCCCCeEEEEccCCcccccccCCceeEEEecccCCCC----CcCEEEccHHHHHHhccCCCCCcHHHHHHHHHhccc Confidence 992 32333 3344567788888999999997643332 34455666666543 23356788899999999985 Q ss_pred ceeeeeecc-cc----ccccccc---cccce-------eccceeecccc---cccchhhhhhhccccc-------ccccc Q lcl|Aclame:pro 75 EQWRVMVLE-AT----EVTEEEL---SDGDT-------IDKVPILGNHE---VESPDGDIEFTTDDDP-------DVEDF 129 (426) Q Consensus 75 ~~~~~~v~~-~t----~v~~~~~---~~~~t-------v~~~~~s~~~~---~~~ta~~i~~~~~~~~-------~~t~~ 129 (426) .+..+.... .. .+..... ..... .....+...+. .........+...... ..... T Consensus 77 ~~~~vVrV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (477) T protein:vir:10 77 GTVIVINVLDPAVHKSNAANEPVTFDAATGRAKLAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGVITRIKTGTIPPGA 156 (477) T ss_pred eEEEEEecCccccccccccccccccccccceecccccccccccccccccccccccchhhhhhhccccceecccccccccc Confidence 543322111 11 0000000 00000 00000000000 0000000000000000 00000 Q ss_pred eeeeeec-ccc-ceeeec--hh--heeeecccccchhhhhhhcc--ccceeecccccchhhhHhHhhhhhhhhhcceEEE Q lcl|Aclame:pro 130 DAEIVIN-SAT-GDVATS--ED--SIELTYFHADWSQLDEFPSD--VNNFAVADRRFDLKGVGVLDETHSWASDEDMGMI 201 (426) Q Consensus 130 ~~~~~~~-~~~-g~~t~~--~~--~~~~~~~~~d~~~~~~~~s~--~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~ 201 (426) ....... ... ...... .. ........ ...+...... +....+... ++-......+.+..-++.-+.+.+ T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~tG--l~al~~~~~~~~~~~~~l~ap-g~~~~~~v~~~l~~~~~~~~~~~~ 233 (477) T protein:vir:10 157 TAAKATYDYADPTKVTAADIIGAVNAAGMRTG--MKALKDTYNLYGYFSKILIAP-AYCTQNSVSVELEAMAVQLGAIAY 233 (477) T ss_pred eeeeeccccccccccccccccccccccchhhh--hhhhhhhhhhcchhccccccc-ccccchhhHHHHHHHHhhCCEEEE Confidence 0000000 000 000000 00 00000000 0011110000 000000000 111111112222222333333322 Q ss_pred EEecccccccchhhHHHHHH----HhhccCcceEEEEecCC----------CccchhHHHHHHHhhhc----ccccceee Q lcl|Aclame:pro 202 ANGVNVDDYDSVDEAMDVAH----EVAGYVPSGDLMMIVDA----------SDDDLAAYQLGKFAVSE----PWYNPLWN 263 (426) Q Consensus 202 ~~~~d~~~~~~~~~~~~~a~----~~a~~~~rt~~~~~~~~----------~~~~~~aa~~g~~~~~~----p~~~~~~~ 263 (426) .|............... ...++..+...+++.-. ..-.|.+.++|.++..+ ||.++..+ T Consensus 234 ---~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~ 310 (477) T protein:vir:10 234 ---IDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQ 310 (477) T ss_pred ---EecCCCCCHHHHHhhhhhccccccccccceEEEEcCeEEEecccCCceeEEchHHHHHHHHHHhhhcCCceeccCCc Confidence 22211111111111111 01111122222322211 11235688888887765 33333222 Q ss_pred cccccceeeccccccccc--cccchhHHHHhhcCc-cEEEEEc-CCEEEeeceeecCcccC--cceeehhhhHHHHHHHH Q lcl|Aclame:pro 264 ELPAGETVSKNVGDPEEQ--GTFEGGDEAEGEGPV-NVLIDVS-DANRVSNAVTTAGADSD--TSFFDIRRTKVYTAEML 337 (426) Q Consensus 264 ~~~~~~~~~~~k~~~gv~--~~~~~~~~~~~~~~~-N~~~~~~-g~~~~~~~~t~~G~~~s--g~~iD~i~g~dwl~~~i 337 (426) ...+..- . ...+. ......|...|+.+. |.++.+. ++..+|-+.|+.+...+ -.||=++|..||+...+ T Consensus 311 ~~~gi~~---~--~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~ 385 (477) T protein:vir:10 311 QLVGVTG---V--ERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESL 385 (477) T ss_pred eeccccc---c--ccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCcccceeehhhHHHHHHHHH Confidence 1111000 0 00011 112345566676544 8888775 56888888887664333 35889999999999999 Q ss_pred HHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEE Q lcl|Aclame:pro 338 ELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHT 417 (426) Q Consensus 338 q~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~ 417 (426) +..++..+- + |.+..-...|+..|+.-|++-+++|. + .+|.+.+.+...+++|+.+.++. +.+.+.....+++ T Consensus 386 ~~~~~~~v~---~-~~~~~~~~~i~~~i~~~l~~l~~~g~-l-~g~~v~~~~~~nt~~~i~~G~~~-~~i~~~p~~p~e~ 458 (477) T protein:vir:10 386 RYFSQQFVD---A-PIDQGLIDSLVESVNGFGRKLIGDGA-L-LGFKAWFDPARNPKEELAAGHLL-INYKYTVPPPLER 458 (477) T ss_pred HHHHHHhcc---C-CCCHHHHHHHHHHHHHHHHHHHhCCc-e-eeeEEEEecCCCCHHHhhCCeEE-EEEEEEecCCcce Confidence 999887654 2 66777788999999999999987543 3 35888887766778899998887 9999999999999 Q ss_pred EEEEEEEeC Q lcl|Aclame:pro 418 FSLGLNVSV 426 (426) Q Consensus 418 v~I~g~v~v 426 (426) +.++....+ T Consensus 459 i~~~~~~~~ 467 (477) T protein:vir:10 459 LTYETEITS 467 (477) T ss_pred EEEEEEEcc Confidence 999999888 No 16 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=98.86 E-value=1.8e-08 Score=63.09 Aligned_cols=353 Identities=14% Similarity=0.025 Sum_probs=183.9 Q ss_pred CCC----ceEEEEEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCc Q lcl|Aclame:pro 1 MPK----QIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE 75 (426) Q Consensus 1 mp~----~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~ 75 (426) ||. -|-=+.++-.+.++....-+++.|+|.....++.. ..++....++..+...-||.+.+.+++...+|.|+.. T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~a~~~~~~~gg~ 80 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAGGTLPQAIDGIFDQTGA 80 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCCcchhHHHHHHhccCce Confidence 993 33334455566677777889999999764322211 1345556777777777899999999999999999743 Q ss_pred eeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeeccc Q lcl|Aclame:pro 76 QWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFH 155 (426) Q Consensus 76 ~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~ 155 (426) ....... ..+.....+.. .+.+.... ......++......... ..... T Consensus 81 ~~~vv~~------~~~~~~~~t~~--~~ig~~~~---~t~~~tgl~~l~~~~~~---~~~~p------------------ 128 (386) T protein:vir:10 81 VVVVIRV------DEGVDSAATQS--NVIGKVDA---DTEQYTGILALLSAENT---VKVQP------------------ 128 (386) T ss_pred eEEEeec------cccccccccch--hhhccccc---ccchhhhhHHhhhhccc---ccccc------------------ Confidence 3221110 00000000000 00000000 00000000000000000 00000 Q ss_pred ccchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHH----HHhhccCcceE Q lcl|Aclame:pro 156 ADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVA----HEVAGYVPSGD 231 (426) Q Consensus 156 ~d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a----~~~a~~~~rt~ 231 (426) .+ ..+........+ .+.+ ..-.+++.+....+....+..+...... ...+-+++. + T Consensus 129 -----------~i---~~ap~~~~~~~v--~~~l---~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~s~~~~~~~p~-~ 188 (386) T protein:vir:10 129 -----------RI---LIAPGFSNQKAV--ADQL---VSVADTAAWLCHSGWSNTTDAAAITYRELFGSRRCEVVDPW-Y 188 (386) T ss_pred -----------cc---cccccccchhHH--HHHH---HHhhcceEEEEEeCCCCCchHHHHHhhhcccccceEEecCc-e Confidence 00 000000000000 1111 1111122222222211111100000000 000111111 1 Q ss_pred EEEec--CC-CccchhHHHHHHHhhhc----ccccceeecccccceeeccccccccc---------cccchhHHHHhhcC Q lcl|Aclame:pro 232 LMMIV--DA-SDDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNVGDPEEQ---------GTFEGGDEAEGEGP 295 (426) Q Consensus 232 ~~~~~--~~-~~~~~~aa~~g~~~~~~----p~~~~~~~~~~~~~~~~~~k~~~gv~---------~~~~~~~~~~~~~~ 295 (426) .++.. .. ..-.|.++++|.++..+ ||.++.. +.+.||. ......|...|+.+ T Consensus 189 ~v~~~~~~~~~~~p~s~~~ag~~a~~D~~~G~~~spaN------------~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~ 256 (386) T protein:vir:10 189 KVWDVETSAHIIQPPSARHAGVMAKVHNTLGFWWSNSN------------QEILGIDGLCRPVDFKLDDPTCRANLLNAK 256 (386) T ss_pred eeeccccccceeechHHHHHHHHHHhhhcCCcEEccCC------------ceeecccccceecccccccCcchhhhhhhc Confidence 11111 11 12235778888888776 3333322 2222221 12334556666655 Q ss_pred c-cEEEEEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 296 V-NVLIDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTG 374 (426) Q Consensus 296 ~-N~~~~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~ 374 (426) + |.++. +++..+|-+.|.++ +..=.||=++|-.+|++..|+..++..+-. |.+..=...|+..|+.-|..-++ T Consensus 257 gi~~~~~-~~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~ 330 (386) T protein:vir:10 257 EVTTTIQ-QNGFRVWGDRTCSA-DSKWAFKNVVITNDMIADSLVRNHLWAVDR----NITKTYVEDVTEGVNNYLRHLKN 330 (386) T ss_pred CcEEEEc-CCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHh Confidence 5 66654 55577776666544 334459999999999999999998876642 77888899999999999999987 Q ss_pred CCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 375 SVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 375 ~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +|. +-+|.+.......+++|+.+.++. +++.+.....++++.++...+. T Consensus 331 ~g~--l~g~~v~~d~~~nt~~~~~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 379 (386) T protein:vir:10 331 IGA--IAGGECWVDPELNSPDQIQQGKVY-FDYDFSAYAPAEHITFRSHMVN 379 (386) T ss_pred CCc--eeeeEEEEcccCCCHHHhhCCeEE-EEEEEEecCCceeEEEEEEEeh Confidence 653 446888887766778888887777 8899999999999999999998 No 17 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=98.85 E-value=2.9e-08 Score=61.93 Aligned_cols=409 Identities=11% Similarity=0.003 Sum_probs=189.0 Q ss_pred CCC---ceEEE-EEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHh--ccCCCCHHHHHHHHHHhcCC Q lcl|Aclame:pro 1 MPK---QIVEI-ELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGD--DYGEDSDVYTASEAIEEMGA 74 (426) Q Consensus 1 mp~---~iVnV-~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~--Dfg~~sp~YkAA~~~f~Q~~ 74 (426) ||. .=|-| .|+-.++++....-+++.|||.....|. ++..+-+|..+-.. +.+.+...+.|..++|.|+- T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p~----n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~ngg 76 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPV----NTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDYGS 76 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccCCC----cccEEEccHHHHHHhcCCCCCCcHHHHHHHHhhcCC Confidence 992 32333 3455677888888899999997654443 44556667655554 23366778899999999974 Q ss_pred ceeeeee-ccccccccccc---------cccceec-----cceeeccc-------ccccchhhhhhhcc-----cc-ccc Q lcl|Aclame:pro 75 EQWRVMV-LEATEVTEEEL---------SDGDTID-----KVPILGNH-------EVESPDGDIEFTTD-----DD-PDV 126 (426) Q Consensus 75 ~~~~~~v-~~~t~v~~~~~---------~~~~tv~-----~~~~s~~~-------~~~~ta~~i~~~~~-----~~-~~~ 126 (426) .+..+.. .+......... ....... ...+.... ....+...+..... .. ... T Consensus 77 ~~~~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (477) T protein:vir:79 77 GTVIVINVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGGTTYTEGTDYAVDLINGVITRIKTGTIPAAA 156 (477) T ss_pred ceEEEEeccCCccccccccccccccccccccccccccccceeEEeecccccccccCccccccccchhhhhhhcccccccc Confidence 4333221 11111000000 0000000 00000000 00000000000000 00 000 Q ss_pred ccceeeeeeccccceeeechhheeeeccccc---chhhhhhhcc--ccceeecccccchhhhHhHhhhhhhhhhcceEEE Q lcl|Aclame:pro 127 EDFDAEIVINSATGDVATSEDSIELTYFHAD---WSQLDEFPSD--VNNFAVADRRFDLKGVGVLDETHSWASDEDMGMI 201 (426) Q Consensus 127 t~~~~~~~~~~~~g~~t~~~~~~~~~~~~~d---~~~~~~~~s~--~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~ 201 (426) +...... ...................... ...+....+- +....+... ++-......+.+..-++..+.+.+ T Consensus 157 ~~~~~~~--~~~~~~~~~~~~~~g~~~a~~~~tg~~al~~~~~~~~~~~~iv~ap-g~~~~~~v~~~l~~~~~~~~~~a~ 233 (477) T protein:vir:79 157 TAAKATY--DYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAP-AYCTQNSVSVELEAMAVQLGAIAY 233 (477) T ss_pred ceeecee--ccCCcccceeeeecccccccccchhhhhhhhhhhhcccccceeecc-ccccchhHHHHHHHHHhhcCeEEE Confidence 0000000 0000000000000000000000 0011111110 000001000 000111112223333343333333 Q ss_pred EEecccccccchhhHHHHH-HHhhccCcceEEEEecC-------C---CccchhHHHHHHHhhhc----ccccceeeccc Q lcl|Aclame:pro 202 ANGVNVDDYDSVDEAMDVA-HEVAGYVPSGDLMMIVD-------A---SDDDLAAYQLGKFAVSE----PWYNPLWNELP 266 (426) Q Consensus 202 ~~~~d~~~~~~~~~~~~~a-~~~a~~~~rt~~~~~~~-------~---~~~~~~aa~~g~~~~~~----p~~~~~~~~~~ 266 (426) ...........+....... ....+...+...+++.- . ..-.|.++++|.++..+ ||.++..+... T Consensus 234 ~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~ 313 (477) T protein:vir:79 234 IDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLV 313 (477) T ss_pred EecCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeechHHHHHHHHHHhhccCCceEccCCceee Confidence 3222111111111111000 00011111222222211 1 12236788888888765 33333222211 Q ss_pred ccceeeccccccccccccchhHHHHhhcC-ccEEEEEc-CCEEEeeceeecCcccC--cceeehhhhHHHHHHHHHHHHH Q lcl|Aclame:pro 267 AGETVSKNVGDPEEQGTFEGGDEAEGEGP-VNVLIDVS-DANRVSNAVTTAGADSD--TSFFDIRRTKVYTAEMLELDLE 342 (426) Q Consensus 267 ~~~~~~~~k~~~gv~~~~~~~~~~~~~~~-~N~~~~~~-g~~~~~~~~t~~G~~~s--g~~iD~i~g~dwl~~~iq~~l~ 342 (426) +.. ++.............|...|+.+ .|.++.+. ++..+|-+.|+.+...+ -.||=++|..+|++..++..++ T Consensus 314 gv~---~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~ 390 (477) T protein:vir:79 314 GVT---GVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQ 390 (477) T ss_pred cce---ecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCccceeeehhhHHHHHHHHHHHHHH Confidence 110 11111000112234566667654 48888775 46788888887543322 3588999999999999999988 Q ss_pred HHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEE Q lcl|Aclame:pro 343 SLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGL 422 (426) Q Consensus 343 ~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g 422 (426) .++-. |.+..-...|+..|+.-|++-++.|. +.+|.+...+...+++|+.+.++. +++.+.....++++.++. T Consensus 391 ~~v~e----~~~~~~~~~i~~~i~~~l~~l~~~g~--l~g~~v~~~~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~ 463 (477) T protein:vir:79 391 QFVDA----PIDQGLIDSLVESVNGFGRKLIGDGA--LLGFKAWFDPARNPKEELAAGHLL-INYKYTVPPPLERLTYET 463 (477) T ss_pred HhccC----CCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEecCCCCHHHhhCCeEE-EEEEEEecCCceeEEEEE Confidence 76642 55777788999999999999987653 345888887777778898887776 899999999999999999 Q ss_pred EEeC Q lcl|Aclame:pro 423 NVSV 426 (426) Q Consensus 423 ~v~v 426 (426) .... T Consensus 464 ~~~~ 467 (477) T protein:vir:79 464 EITS 467 (477) T ss_pred EEec Confidence 8888 No 18 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=98.83 E-value=3.4e-08 Score=61.51 Aligned_cols=353 Identities=13% Similarity=0.006 Sum_probs=184.3 Q ss_pred CCCce---EE-EEEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCc Q lcl|Aclame:pro 1 MPKQI---VE-IELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE 75 (426) Q Consensus 1 mp~~i---Vn-V~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~ 75 (426) ||.+. |. +.++-.++++..-....+.|+|.....++.. -+++..+.++..+....||.+...+.+...+|.|+-. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:10 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 99533 22 3455678888888999999999764333322 1355566788888888999999999999999999754 Q ss_pred eeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeeccc Q lcl|Aclame:pro 76 QWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFH 155 (426) Q Consensus 76 ~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~ 155 (426) ........ . +.....+.. .+.+..... ........+......... T Consensus 81 ~~~vv~v~--~----~~~~~~~~~--~~ig~~~~~-~~~tg~~al~~~~~~~~~-------------------------- 125 (390) T protein:vir:10 81 LTVVVRVA--E----GKDADETTS--NVIGTVTPD-GKYTGIKALLAAQGALGV-------------------------- 125 (390) T ss_pred eEEEEEec--c----ccccccccc--ccccccccc-cccchhhhhhhhhhhhcc-------------------------- Confidence 33221110 0 000000000 000000000 000000000000000000 Q ss_pred ccchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEe Q lcl|Aclame:pro 156 ADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMI 235 (426) Q Consensus 156 ~d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~ 235 (426) ....+... ++........+..-++.-+ .+ ...|...-... ..+. ..+.++......+++ T Consensus 126 -------------~p~il~ap--~~~~~~v~~~l~~~a~~~~-~~--aivD~p~~~t~--~~a~-~~~~~~~s~~~~~~~ 184 (390) T protein:vir:10 126 -------------KPRILAAP--GLDTQPVAAALAATAQSLR-AM--AYVSASGCKTK--EEAA-AYRKQFGQREIMVIW 184 (390) T ss_pred -------------eehhhccc--ccchHHHHHHHHHhhcccc-eE--EEEecCCCCCH--HHHH-HHhhccCCceEEEEc Confidence 00000000 0000000111111111111 12 12221110111 1111 111111111122222 Q ss_pred cCC----------CccchhHHHHHHHhhhcccccceeecccccceeeccccccccc---------cccchhHHHHhhcCc Q lcl|Aclame:pro 236 VDA----------SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQ---------GTFEGGDEAEGEGPV 296 (426) Q Consensus 236 ~~~----------~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~---------~~~~~~~~~~~~~~~ 296 (426) +-. ..--|.++++|.++..+.-+.+ |+. +.++.+.||. ......|...|+.++ T Consensus 185 p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~-~~s-------paN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~g 256 (390) T protein:vir:10 185 PDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGW-HKT-------ISNVVVNGVSGISADVSWDLQDPATDAGYLNEHE 256 (390) T ss_pred CceEeecccCCcccccchHHHHHHHHHHhhcCCCc-EEC-------cCCceeeceeecceecccccccccchhhhhhhcC Confidence 111 1224668888888877732211 111 1122222221 112223444565554 Q ss_pred -cEEEEEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 297 -NVLIDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGS 375 (426) Q Consensus 297 -N~~~~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~ 375 (426) |.++. +++..+|-+.|.++ +..=.||=++|..||+++.++..++..+- | |.+..-...|+..|+.-|..-+++ T Consensus 257 i~t~~~-~~G~~~wG~rT~s~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~---e-~n~~~~~~~i~~~i~~~L~~l~~~ 330 (390) T protein:vir:10 257 VTTLVN-RNGFRFWGERTCSD-DPKFAFENYTRTAQVAGDSIAEAQMPVVD---G-PLNPSLARDIVESINGWFRQQVAN 330 (390) T ss_pred cEEEEc-CCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhcc---C-CCCHHHHHHHHHHHHHHHHHHHhC Confidence 77765 45577776666654 22336999999999999999999887653 3 889999999999999999999876 Q ss_pred CCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 376 VGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 376 ~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) |. +.+|.+.......+++|+.+-++. +++.+...-.++++.++....+ T Consensus 331 g~--l~g~~v~~d~~~nt~~~i~~G~~~-~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:10 331 GY--LIGGSAWIDPEPNTADILASGKAY-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred Cc--eeeeEEEEccCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 53 446888777666777787776655 7788888899999999999998 No 19 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=98.83 E-value=3.4e-08 Score=61.51 Aligned_cols=353 Identities=13% Similarity=0.006 Sum_probs=184.3 Q ss_pred CCCce---EE-EEEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCc Q lcl|Aclame:pro 1 MPKQI---VE-IELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE 75 (426) Q Consensus 1 mp~~i---Vn-V~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~ 75 (426) ||.+. |. +.++-.++++..-....+.|+|.....++.. -+++..+.++..+....||.+...+.+...+|.|+-. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:78 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 99533 22 3455678888888999999999764333322 1355566788888888999999999999999999754 Q ss_pred eeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeeccc Q lcl|Aclame:pro 76 QWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFH 155 (426) Q Consensus 76 ~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~ 155 (426) ........ . +.....+.. .+.+..... ........+......... T Consensus 81 ~~~vv~v~--~----~~~~~~~~~--~~ig~~~~~-~~~tg~~al~~~~~~~~~-------------------------- 125 (390) T protein:vir:78 81 LTVVVRVA--E----GKDADETTS--NVIGTVTPD-GKYTGIKALLAAQGALGV-------------------------- 125 (390) T ss_pred eEEEEEec--c----ccccccccc--ccccccccc-cccchhhhhhhhhhhhcc-------------------------- Confidence 33221110 0 000000000 000000000 000000000000000000 Q ss_pred ccchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEe Q lcl|Aclame:pro 156 ADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMI 235 (426) Q Consensus 156 ~d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~ 235 (426) ....+... ++........+..-++.-+ .+ ...|...-... ..+. ..+.++......+++ T Consensus 126 -------------~p~il~ap--~~~~~~v~~~l~~~a~~~~-~~--aivD~p~~~t~--~~a~-~~~~~~~s~~~~~~~ 184 (390) T protein:vir:78 126 -------------KPRILAAP--GLDTQPVAAALAATAQSLR-AM--AYVSASGCKTK--EEAA-AYRKQFGQREIMVIW 184 (390) T ss_pred -------------eehhhccc--ccchHHHHHHHHHhhcccc-eE--EEEecCCCCCH--HHHH-HHhhccCCceEEEEc Confidence 00000000 0000000111111111111 12 12221110111 1111 111111111122222 Q ss_pred cCC----------CccchhHHHHHHHhhhcccccceeecccccceeeccccccccc---------cccchhHHHHhhcCc Q lcl|Aclame:pro 236 VDA----------SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQ---------GTFEGGDEAEGEGPV 296 (426) Q Consensus 236 ~~~----------~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~---------~~~~~~~~~~~~~~~ 296 (426) +-. ..--|.++++|.++..+.-+.+ |+. +.++.+.||. ......|...|+.++ T Consensus 185 p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~-~~s-------paN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~g 256 (390) T protein:vir:78 185 PDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGW-HKT-------ISNVVVNGVSGISADVSWDLQDPATDAGYLNEHE 256 (390) T ss_pred CceEeecccCCcccccchHHHHHHHHHHhhcCCCc-EEC-------cCCceeeceeecceecccccccccchhhhhhhcC Confidence 111 1224668888888877732211 111 1122222221 112223444565554 Q ss_pred -cEEEEEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 297 -NVLIDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGS 375 (426) Q Consensus 297 -N~~~~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~ 375 (426) |.++. +++..+|-+.|.++ +..=.||=++|..||+++.++..++..+- | |.+..-...|+..|+.-|..-+++ T Consensus 257 i~t~~~-~~G~~~wG~rT~s~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~---e-~n~~~~~~~i~~~i~~~L~~l~~~ 330 (390) T protein:vir:78 257 VTTLVN-RNGFRFWGERTCSD-DPKFAFENYTRTAQVAGDSIAEAQMPVVD---G-PLNPSLARDIVESINGWFRQQVAN 330 (390) T ss_pred cEEEEc-CCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhcc---C-CCCHHHHHHHHHHHHHHHHHHHhC Confidence 77765 45577776666654 22336999999999999999999887653 3 889999999999999999999876 Q ss_pred CCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 376 VGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 376 ~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) |. +.+|.+.......+++|+.+-++. +++.+...-.++++.++....+ T Consensus 331 g~--l~g~~v~~d~~~nt~~~i~~G~~~-~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:78 331 GY--LIGGSAWIDPEPNTADILASGKAY-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred Cc--eeeeEEEEccCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 53 446888777666777787776655 7788888899999999999998 No 20 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=98.79 E-value=4.7e-08 Score=60.77 Aligned_cols=359 Identities=15% Similarity=0.048 Sum_probs=187.3 Q ss_pred CCC---ceEEEEEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCce Q lcl|Aclame:pro 1 MPK---QIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQ 76 (426) Q Consensus 1 mp~---~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~ 76 (426) |+. -|-=+.+.-.+.++..-..+++.|+|.....++.. .+++..+.++..+....||.+.-.+.+-..+|.++-.. T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhcccccchHHHHHHhhhcCCce Confidence 884 34444556677888899999999999764333211 14566678888888899999888888888999887444 Q ss_pred eeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeecccc Q lcl|Aclame:pro 77 WRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHA 156 (426) Q Consensus 77 ~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~~ 156 (426) ....................+. ..+.+... ..........+............. T Consensus 81 ~~vv~~~~~~~~~~~~~~a~t~--~~iiG~~~-~~~~~tgl~al~~~~~~~~~~p~i----------------------- 134 (396) T protein:vir:57 81 TVVVRVEDGTGDDEETKLAQTV--SNIIGTTD-ENGQYTGLKALMGAESVTGVKPRI----------------------- 134 (396) T ss_pred eEeeeccccccccccccccccc--eeeeeecc-ccccchhhhhhhhcccceeEEecc----------------------- Confidence 3332111111100000000000 00000000 000000000110000000000000 Q ss_pred cchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEec Q lcl|Aclame:pro 157 DWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIV 236 (426) Q Consensus 157 d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~ 236 (426) .++ . ++........+..-++..+.+++ .+...-...+.....+ ..+-.....+++. T Consensus 135 ---------------~~a-p--~~~~~~v~~al~~~~~~~~~~~~---~d~p~~~~~~~~~~~~---~~~~s~~~~~~~p 190 (396) T protein:vir:57 135 ---------------LGV-P--GLDTKEVAVALASVCQELNAFGY---ISAWGCKTISEVKAYR---QNFSQRELMVIWP 190 (396) T ss_pred ---------------ccC-c--ccchhHHHHHHHHHhhhCceEEE---EcCCCCCCHHHHHHHH---hccCCceEEEEcc Confidence 000 0 00000001111111221121211 1111101111111111 1111111112211 Q ss_pred CC----------CccchhHHHHHHHhhhcccccceeecccccceeeccccccccc---------cccchhHHHHhhcCc- Q lcl|Aclame:pro 237 DA----------SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQ---------GTFEGGDEAEGEGPV- 296 (426) Q Consensus 237 ~~----------~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~---------~~~~~~~~~~~~~~~- 296 (426) -. ....|.++++|.++..+.-+.+ |+. + .++.+.||. ......|...|+.+. T Consensus 191 ~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~-~~s-p------aN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi 262 (396) T protein:vir:57 191 DFLAWDTVTSTTATAYATARALGLRAKIDQEQGW-HKT-L------SNVGVNGVTGISASVFWDLQKPGTDADLLNEAGV 262 (396) T ss_pred eeeeecccCCceeEEehhHHHHHHHHHhhhccCc-Eec-c------CCceeccccccceecccccCCcchhhhhhhhcCc Confidence 10 1124678888888877743221 111 1 122222221 112344556666554 Q ss_pred cEEEEEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 297 NVLIDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSV 376 (426) Q Consensus 297 N~~~~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~ 376 (426) |.++. +++..+|-+.|.++ +..=.||=++|..||++..|+..++..+-. |.+..=...|+..|+.-|..-+++| T Consensus 263 ~t~~~-~~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~g 336 (396) T protein:vir:57 263 TTLVR-RDGFRFWGNRTCSD-DPLFLFESYTRTAQVLADTMAEAHMWAIDK----PITATLIRDIIDGINAKFRELKNNG 336 (396) T ss_pred EEEEc-CCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCC Confidence 77765 45677776666654 223469999999999999999998876643 7788888999999999999988754 Q ss_pred CccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 377 GQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 377 g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) . + -+|++.......+++|+.+.++. +++.+.....++++.++....+ T Consensus 337 a-l-~g~~v~~d~~~n~~~~i~~G~~~-~~v~~~p~~p~e~I~~~~~~~~ 383 (396) T protein:vir:57 337 Y-I-VDGTCWFSEESNDAETLKAGKLY-IDYDYTPVPPLENLTLRQRITS 383 (396) T ss_pred c-e-eceEEEEecCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 3 3 35777776666677788887776 8888899999999999999998 No 21 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=98.75 E-value=4e-08 Score=61.14 Aligned_cols=358 Identities=12% Similarity=0.010 Sum_probs=182.6 Q ss_pred CC--C---ceEEEEEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCC Q lcl|Aclame:pro 1 MP--K---QIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGA 74 (426) Q Consensus 1 mp--~---~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~ 74 (426) |+ . -|-=+.+.-.++++.....+.+.|+|.....++.. ..++..+.++..+...-||.+...|.+...+|.|+. T Consensus 1 M~~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~~~tl~~al~~~~~~~g 80 (391) T protein:vir:11 1 MAADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGTSGTLPASLQAIADQAN 80 (391) T ss_pred CCCCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCCCccchhhhhhhhcccc Confidence 44 1 23224566677778888899999999774333222 134566788888888889999999999999999974 Q ss_pred ceeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeecc Q lcl|Aclame:pro 75 EQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYF 154 (426) Q Consensus 75 ~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~ 154 (426) ......... . +.....+.. .+.+.......... ...+.............. T Consensus 81 ~~~~vv~~~--~----~~~~~~t~~--d~~g~~~a~~~~~g-~~a~~~~~~~~~~~p~~~-------------------- 131 (391) T protein:vir:11 81 AATVVVRVK--P----GEDEAATNS--AVIGGVSADGKYTG-MKALLAAKARLGVVPRIL-------------------- 131 (391) T ss_pred ceeEEeeec--c----cccccccch--hhhcccccccchhh-hhhhhhhhhhheeccccc-------------------- Confidence 433221110 0 000000000 00000000000000 000000000000000000 Q ss_pred cccchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEE Q lcl|Aclame:pro 155 HADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMM 234 (426) Q Consensus 155 ~~d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~ 234 (426) .+ . ++........+..-++.-+ .++. .|...-......... +.++-.....++ T Consensus 132 ------------------~a-p--~~~~~~v~~al~~~~~~~~-~~~i--~D~p~~~t~~~a~~~---r~~~~s~~~~~~ 184 (391) T protein:vir:11 132 ------------------GV-P--GLDTQPVATALIAIAQQLR-AFAY--VSASGCKTKEEATAY---RENFAAREAMVI 184 (391) T ss_pred ------------------cc-c--ccccHHHHHHHHHhhcccc-eEEE--EEcCCCCCHHHHHHH---hhhcCCceEEEE Confidence 00 0 0000000001111111111 1111 111000001111111 111111111222 Q ss_pred ecCC----------CccchhHHHHHHHhhhc----ccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEE Q lcl|Aclame:pro 235 IVDA----------SDDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVL 299 (426) Q Consensus 235 ~~~~----------~~~~~~aa~~g~~~~~~----p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~ 299 (426) +.-. ...-|.++++|.++-.+ ||.++..+...+...... ...-.......|...|+..+ |.+ T Consensus 185 ~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~l~gi~~~~~---~~~~~~~~~~~~~~~Ln~~gi~~~ 261 (391) T protein:vir:11 185 WPDFLTWSTVVNQTVPAPAVAQALGLRARIDQEVGWHKTLSNVAVNGVTGISA---DVFWDLQSPSTDANYLNENEVTTL 261 (391) T ss_pred cCcceecccccCceEEechHHHHHHHHHHhhccCCcEEccCCceeeceeeccc---ccccccCCCcchhhhhhhcCcEEE Confidence 2110 11236788888877666 444433232211111100 00001122345556666554 776 Q ss_pred EEEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 300 IDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQP 379 (426) Q Consensus 300 ~~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~ 379 (426) +. +++..+|-+.|.++ +..=.||=++|..||++..++..++..+-. |.+..=...|+..|+.-|++-+++|. + T Consensus 262 ~~-~~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----~n~~~~~~~i~~~i~~~l~~l~~~g~-l 334 (391) T protein:vir:11 262 VQ-EGGFRFWGSRTCSD-DPLFAFENYTRTAQVLADTIAEAHMWAVDK----PMHPSLVRDILEGVNAKFRELKGLGL-I 334 (391) T ss_pred Ec-CCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhccc-e Confidence 54 55677777677654 233469999999999999999988866643 77888888999999999999987653 3 Q ss_pred ccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 380 LAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 380 ~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) -+|++...+...+++|+.+-++. +++.+.....++++.++..... T Consensus 335 -~g~~~~~~~~~n~~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 379 (391) T protein:vir:11 335 -IDAQAWYDPNVNDKDTLKAGKLR-ITYDYTPVPPLEDLTFFQKITD 379 (391) T ss_pred -eceEEEEecCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 35777766666677788876666 8888899999999999999988 No 22 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=98.71 E-value=9e-08 Score=59.21 Aligned_cols=364 Identities=15% Similarity=0.020 Sum_probs=185.7 Q ss_pred CCCceEEE---EEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCce Q lcl|Aclame:pro 1 MPKQIVEI---ELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQ 76 (426) Q Consensus 1 mp~~iVnV---~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~ 76 (426) |+.-.=.| .+.-.++++.....+++.|+|.....++.. .+++..+.++..+....||.+.-.+.+...+|.|+-.. T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhcccccchhhHHHHHhhccCce Confidence 99522233 345567777888899999999763222211 14455668888999999999988888999999997444 Q ss_pred eeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeecccc Q lcl|Aclame:pro 77 WRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHA 156 (426) Q Consensus 77 ~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~~ 156 (426) ....................+.. .+.+... ..........+......... T Consensus 81 ~~vv~~~~~~~~~~~~~~a~~~~--~i~g~~~-~~~~~Tgl~al~~~~~~~~~--------------------------- 130 (395) T protein:vir:98 81 TVVVRVEDGTGDDEEAALAQTVS--NIIGGTD-ENGKYTGIKALLTAQAVTGV--------------------------- 130 (395) T ss_pred EEEeecccccccccccccccccc--ccccccc-cccchhHHHHHhhhhhhhcc--------------------------- Confidence 33222111111111100000000 0000000 00000000001000000000 Q ss_pred cchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEec Q lcl|Aclame:pro 157 DWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIV 236 (426) Q Consensus 157 d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~ 236 (426) ....+.........+ .+.+..-++.-+...+ .|...-...+ .+.+ -+.++-.....++++ T Consensus 131 ------------~p~il~ap~~~~~~v--~~al~~~~~~~~~~~~---~d~p~~~t~~--~a~~-~~~~~~s~~~~~~~p 190 (395) T protein:vir:98 131 ------------KPRILGVPGLDTKEV--AVALASAAIKLRAFAY---VSAWGCKTIS--EAME-YRKNFSQRELMVIWP 190 (395) T ss_pred ------------chhhcccccccccHH--HHHHHHHhhhcCcEEE---EEcCCCCCHH--HHHH-HHhccCCceEEEEec Confidence 000000000000000 1111111222111111 1111000001 1111 111111111222222 Q ss_pred CC----------CccchhHHHHHHHhhhcc----cccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEE Q lcl|Aclame:pro 237 DA----------SDDDLAAYQLGKFAVSEP----WYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLID 301 (426) Q Consensus 237 ~~----------~~~~~~aa~~g~~~~~~p----~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~ 301 (426) -. ...-|.++++|.++..+. |.++..+...+.. .......-......+|...|+.+. |.++. T Consensus 191 ~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~i~gi~---~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~ 267 (395) T protein:vir:98 191 DFLAWDTVKNTTATAYATARALGLRAYIDQTVGWHKTLSNVGVQGVT---GISASVFWDLQASGTDADLLNEAGVTTLVR 267 (395) T ss_pred ceeEecccCCceeeechHHHHHHHHHHhhcccCcEeccCCceeeccc---ccceecccccCCCcchHHhhhhcCcEEEEc Confidence 11 112367888888887763 3322211111000 000000001123355666676544 77765 Q ss_pred EcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCcccc Q lcl|Aclame:pro 302 VSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLA 381 (426) Q Consensus 302 ~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~ 381 (426) +++..+|-+.|.++ +..=.||=++|-.||++..|+..++..+-. |.+..=+..|+..|+.-|.+-+++|. +- T Consensus 268 -~~G~~~wG~rT~s~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~g~--l~ 339 (395) T protein:vir:98 268 -KDGFRFWGNRTCSD-DPLFLFENYTRTAQVLADTMAEAHMWAVDK----PITATLIRDIVDGINAKFRELKSNGY--IV 339 (395) T ss_pred -CCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCc--ee Confidence 45577776666554 233469999999999999999998876643 77888888999999999999887553 34 Q ss_pred ceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 382 EYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 382 ~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +|.+...+...+.+|+.+.++. +.+.+.....++++.++...++ T Consensus 340 g~~v~~d~~~nt~~~i~~G~~~-~~i~~~p~~p~e~I~~~~~~~~ 383 (395) T protein:vir:98 340 EGKCWFDEESNDKETLKAGKLY-IDYDYTPVPPLESLTLRQRITD 383 (395) T ss_pred ceEEEEecCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 5777776666677888887777 8889999999999999999999 No 23 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=98.69 E-value=5.1e-08 Score=60.58 Aligned_cols=360 Identities=13% Similarity=0.001 Sum_probs=182.0 Q ss_pred CCCc----eEEEEEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCc Q lcl|Aclame:pro 1 MPKQ----IVEIELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE 75 (426) Q Consensus 1 mp~~----iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~ 75 (426) ||.+ |==+.++-.++++.......+-|+|.....++.. .+++..+.+|.++-..-||.+.-.+.+-..+|.|+-. T Consensus 1 M~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~~gtl~~al~~~~~~gg~ 80 (391) T protein:vir:79 1 MPTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGDKGTLAHTLDAITDQTNP 80 (391) T ss_pred CCCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCCccccchhhhhhhccccc Confidence 9942 2223455677788888999999999874322221 1456678999999888899888888888999998733 Q ss_pred eeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeeccc Q lcl|Aclame:pro 76 QWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFH 155 (426) Q Consensus 76 ~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~ 155 (426) ........... ... .....+.+..... ........+.............. T Consensus 81 ~~~vv~~~~~~------~~~--~~~~~~~g~~~~~-~~~tGl~~l~~~~~~~~~~p~~l--------------------- 130 (391) T protein:vir:79 81 LTVVVRVAGGA------SEA--ETTSNLIGTTNAA-GRYTGMKALLTARNRFGVAPRIL--------------------- 130 (391) T ss_pred ceeeecccccc------ccc--cccccccccccch-hhhHHHhhhhhhhhhhcccchhh--------------------- Confidence 32221111000 000 0000000000000 00000000100000000000000 Q ss_pred ccchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHH-----HhhccCcce Q lcl|Aclame:pro 156 ADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAH-----EVAGYVPSG 230 (426) Q Consensus 156 ~d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~-----~~a~~~~rt 230 (426) ++ .++........+...++.-+..++....... ..+....... ..+.++|. T Consensus 131 -----------------~~---p~~~~~~v~~al~~~~~~~~~~ai~d~p~~~---t~~~a~~~~~~~~s~~~a~~~P~- 186 (391) T protein:vir:79 131 -----------------AV---PGLDSLPVGTELVTIAQKLRAFAYLSAYGCQ---TKEEAVAYRSNFGQREAMVMWPD- 186 (391) T ss_pred -----------------cC---CccchhHHHHHHHHHHhhcCcEEEEECCCCC---CHHHHHHHHhccCCceeEEecce- Confidence 00 0000111112222223222222222111111 1111111000 00111211 Q ss_pred EEEEecCC---CccchhHHHHHHHhhhc----ccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEE Q lcl|Aclame:pro 231 DLMMIVDA---SDDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDV 302 (426) Q Consensus 231 ~~~~~~~~---~~~~~~aa~~g~~~~~~----p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~ 302 (426) ...+.+.. ....|.++++|.++..+ ||.++..+...+.. .......-.......|...|+.+. |.++. T Consensus 187 ~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~g~~~spaN~~l~gi~---~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~- 262 (391) T protein:vir:79 187 FVGWDTAANAETTLWATARAVGLRAKIDNDTGWHKTLSNVAVGGVT---GLSRDVFWDLQDPATDAGYLNANEVTTLVH- 262 (391) T ss_pred eeeecCcCCceeeechHHHHHHHHHHhhhcccceeccCCceehhhh---ccccccccccccccchhhhhhhcCceEEEC- Confidence 11121111 12346788888888777 33332222111100 000000000111222334454333 66654 Q ss_pred cCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccc Q lcl|Aclame:pro 303 SDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAE 382 (426) Q Consensus 303 ~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~ 382 (426) +++..+|-+.|+++ +..=.||=++|..||++..|+..++..+-. |.+..-...|+..|+.-|.+-+++|. +.+ T Consensus 263 ~~G~~~wG~rT~~~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l~~~g~--l~g 335 (391) T protein:vir:79 263 RDGYRFWGSRTCSA-DPLFAFENYTRTAQVLADTMAEAHMWANDL----PMTPTLVRDLLEGINAKLRMLTRNGY--LLG 335 (391) T ss_pred CCcEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCc--eec Confidence 45577776666654 223359999999999999999998876642 88999999999999999999987664 346 Q ss_pred eeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 383 YEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 383 y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) |++.......+++|+.+-++. +++.+...-.+.++.++-.... T Consensus 336 ~~v~~~~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 378 (391) T protein:vir:79 336 GAAWFDADANSKDTLKAGQLA-IDYDYTPVPPLENLTFRQRITD 378 (391) T ss_pred eEEEEecCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 777765555666777765555 7788888889999999998888 No 24 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=98.55 E-value=3e-07 Score=56.33 Aligned_cols=359 Identities=15% Similarity=0.026 Sum_probs=184.8 Q ss_pred CCC---ceEEEEEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCce Q lcl|Aclame:pro 1 MPK---QIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQ 76 (426) Q Consensus 1 mp~---~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~ 76 (426) |+. -|-=+.+.-.+.++..-+-+.+-|+|.....++.. .+++..+.++.++-..-||.+.-...+...+|.|+-.. T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~~gtl~~al~~~~~ngg~~ 80 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGKKGTLSASLQAIADQSKPV 80 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCCCcchHHHHHHhhcccCce Confidence 884 33334455567888888889999999764332221 24566678888888888999888888999999987433 Q ss_pred eeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeecccc Q lcl|Aclame:pro 77 WRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHA 156 (426) Q Consensus 77 ~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~~ 156 (426) ........-. ..+...+....+.+..... ........+............ T Consensus 81 ~~vv~v~~~~-----~~~~~~~t~~dliG~~~~~-~~~tg~~al~~~~~~~~~~p~------------------------ 130 (392) T protein:vir:18 81 TVVVRVAEGT-----GDDAEAQTTSNIIGGTDEN-GKYTGIKALLTAEAVTGVKPR------------------------ 130 (392) T ss_pred EEEecccccc-----cccccccchhhheeccccc-chhhhHHHHHhhhhhhceeeh------------------------ Confidence 2221100000 0000000000000000000 000000011110000000000 Q ss_pred cchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEec Q lcl|Aclame:pro 157 DWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIV 236 (426) Q Consensus 157 d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~ 236 (426) .+... ++........+..-++.-..+.+ .+...-...+..... +..+-.+...+++. T Consensus 131 ---------------il~ap--~~~~~~v~~~l~~~~~~~~~~~~---~d~~~~~~~~~a~~~---~~~~~s~~~~~~~p 187 (392) T protein:vir:18 131 ---------------ILGVP--GLDTQEVATALASVCISLRAFGY---VSAWGCKTISEAMAY---RENFSQRELMVIWP 187 (392) T ss_pred ---------------hcccC--ccchHHHHHHHHHHHhhcCcEEE---EecCCCCCHHHHHHH---HhhccCceEEEEeC Confidence 00000 00000001111111221111211 111110111111111 11111111112211 Q ss_pred C-------C---CccchhHHHHHHHhhhc----ccccceeecccccceeeccccccccc--cccchhHHHHhhcCc-cEE Q lcl|Aclame:pro 237 D-------A---SDDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNVGDPEEQ--GTFEGGDEAEGEGPV-NVL 299 (426) Q Consensus 237 ~-------~---~~~~~~aa~~g~~~~~~----p~~~~~~~~~~~~~~~~~~k~~~gv~--~~~~~~~~~~~~~~~-N~~ 299 (426) - . ...-|.++++|.++..+ ||.++..+...+...... .+. ......|...|+.+. |.+ T Consensus 188 ~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~-----~~~~~~~~~~~~~~~Ln~~gI~t~ 262 (392) T protein:vir:18 188 DFLAWDTTANATATAYATARALGLRAYIDQTIGWHKTLSNVGVQGVTGISA-----SVFWDLQASGTDADLLNEAGVTTL 262 (392) T ss_pred ceeeecccCCceEEechHHHHHHHHHhhhccCCceEccCCceeeceeecce-----ecccccCCCcchhhhhhhcCceEE Confidence 1 0 11236788888887766 333332222111111000 011 123345566676554 887 Q ss_pred EEEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 300 IDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQP 379 (426) Q Consensus 300 ~~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~ 379 (426) +. +++..+|-+.|.++- ..=.||=++|..||++..|+..++..+- | |.+..-...|+..|+.-|.+-+++|. + T Consensus 263 ~~-~~G~~~wG~rT~~~d-~~~~~i~~rR~~~~i~~~i~~~~~~~v~---e-~n~~~~~~~i~~~i~~~L~~l~~~ga-l 335 (392) T protein:vir:18 263 VR-KDGFRFWGNRTCSDD-PLFLFENYTRTAQVLADTMAEAHMWAVD---K-PITASLIRDIVDGINAKFRELKSNGY-I 335 (392) T ss_pred Ec-CCCEEEEcccccCCC-cccceeehhhHHHHHHHHHHHHHHHhcc---C-CCCHHHHHHHHHHHHHHHHHHHhcCc-c Confidence 65 556788877776542 2335999999999999999999887664 2 88999999999999999999987653 3 Q ss_pred ccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 380 LAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 380 ~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) -+|++.......+++|+.+.++. +++.+.....++++.++..... T Consensus 336 -~g~~v~~d~~~nt~~~i~~G~~~-~~v~~~p~~p~e~I~~~~~~~~ 380 (392) T protein:vir:18 336 -VDGECWFDEESNDKETLKAGKLY-IDYDYTPVPPLESLTLRQRITD 380 (392) T ss_pred -cceEEEEecCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEch Confidence 35777765555667788887776 8888889999999999999888 No 25 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=98.52 E-value=3.7e-07 Score=55.83 Aligned_cols=353 Identities=14% Similarity=0.041 Sum_probs=181.9 Q ss_pred CCC------ceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHh---ccCCCCHHHHHHHHHHh Q lcl|Aclame:pro 1 MPK------QIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGD---DYGEDSDVYTASEAIEE 71 (426) Q Consensus 1 mp~------~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~---Dfg~~sp~YkAA~~~f~ 71 (426) ||. -|-=+.++-.++++.......+.|+|.....++...+++-..-.+..+.+. ..+.....+.+...+|. T Consensus 1 m~~~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~~d~~~~~~~~~~~gtl~~al~~~~~ 80 (388) T protein:vir:96 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) T ss_pred CCCCCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecchhhhhhhccccccccchhhhHhhhc Confidence 882 334456777888899999999999998744443333444444334444333 33445667788888998 Q ss_pred cCCceeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheee Q lcl|Aclame:pro 72 MGAEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIEL 151 (426) Q Consensus 72 Q~~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~ 151 (426) |+.......... .+.....+... +.+..... + ....++. T Consensus 81 ~~~~~~~vv~v~------~g~~~~at~a~--iig~~~~~-t--g~~~gl~------------------------------ 119 (388) T protein:vir:96 81 KTSVPQYFIVVP------EGADDAATMAN--IIGGIDPT-T--GRRTGIA------------------------------ 119 (388) T ss_pred cCCceEEEEEec------cccccccccce--eeeecccc-c--chhhHHH------------------------------ Confidence 874333221110 01111100000 00000000 0 0000000 Q ss_pred ecccccchhhhhhhccccceeecccccch-hhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHH-hhccC-c Q lcl|Aclame:pro 152 TYFHADWSQLDEFPSDVNNFAVADRRFDL-KGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHE-VAGYV-P 228 (426) Q Consensus 152 ~~~~~d~~~~~~~~s~~~~~~la~~~~~~-~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~-~a~~~-~ 228 (426) .+.+.....+....+ ++ +.......+..-++.-+.+.+ .|... +..+........ ...++ . T Consensus 120 --------al~~~~~~p~il~aP----g~s~~~~v~~al~~~~~~~~~~~i---~D~p~-~~~~~~~~~~~~~~~~~~~s 183 (388) T protein:vir:96 120 --------ALTECTERPTLIGAP----GFSQNKAVIDALASMAKRLKCRAV---IDGPS-GSTQDAIDLSGLLGGEGTGH 183 (388) T ss_pred --------HhhhcccceeEEEee----ccccchHHHHHHHHHHhhcCcEEE---EeccC-CchhHHHHHHhhhhccCcCc Confidence 000000000000001 11 000111122222222222222 22111 111111111111 11111 1 Q ss_pred ceEEEEecC-------C---CccchhHHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhcCc-c Q lcl|Aclame:pro 229 SGDLMMIVD-------A---SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-N 297 (426) Q Consensus 229 rt~~~~~~~-------~---~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N 297 (426) ....+++.- . ....|.++++|.++..+||..+..+... . .......+-.-....+|...|+.+. | T Consensus 184 ~~~~~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~spaN~~i~-i---~g~~~~~~~~~~~~~~~~~~Ln~~gI~ 259 (388) T protein:vir:96 184 DRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVL-I---QDVARVIDYNILDKSTEGDLLNRNGVS 259 (388) T ss_pred ceEEEEeCceeeecccCCceeeechHHHHHHHHHhhcCcccccCeeEE-e---eeecccccccccCChhhHHhhhhcCce Confidence 122222211 1 1234778899999999997665433221 0 0111110101123445556666554 8 Q ss_pred EEEEE-cCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 298 VLIDV-SDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSV 376 (426) Q Consensus 298 ~~~~~-~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~ 376 (426) .++.. +++..+|-+.|++ ..||=++|..+|++..|+..++..+- + |.+..=...|+..|+.-|.+-+++| T Consensus 260 ~i~~~~~~G~~~wG~rT~~-----~~~i~vrR~~~~i~~si~~~~~~~v~---e-pn~~~~~~~i~~~i~~fL~~l~~~G 330 (388) T protein:vir:96 260 YFARTSMGGFSLIGNRTVT-----GKFISFVGLEDAIARKLEAASQRAMS---K-QLTKSFMEQEIKKINLFMQDLVAAE 330 (388) T ss_pred EEEEecCCcEEEEcccccC-----CcceeehhhHHHHHHHHHHHHHHhcc---C-CCCHHHHHHHHHHHHHHHHHHHhCC Confidence 88876 4577788776664 47999999999999999988876553 3 7788888899999999999888755 Q ss_pred CccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 377 GQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 377 g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) .+. +|++..-....+++|+.+-++. +.+.+...-.++++.++...+. T Consensus 331 -al~-g~~~~~d~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~~~~~~ 377 (388) T protein:vir:96 331 -IIP-GGEVYLHPTLNTVERYKNGSWY-IVIDYGRYSPNEHMIFHLNAVD 377 (388) T ss_pred -cee-eeEEEEecCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 333 5777665555667777775555 7777888899999999999988 No 26 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=98.45 E-value=6.2e-07 Score=54.63 Aligned_cols=358 Identities=11% Similarity=-0.013 Sum_probs=183.0 Q ss_pred CCCce---EEE-EEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCc Q lcl|Aclame:pro 1 MPKQI---VEI-ELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE 75 (426) Q Consensus 1 mp~~i---VnV-~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~ 75 (426) ||.+. |.| .+.-.+.++.......+.|+|.....++.. ..++..+-+|..+...-||.+.-.+.+...+|.|+-. T Consensus 1 M~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~~~tL~~al~~~~~~~~~ 80 (390) T protein:vir:79 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCCCccchhhhhhhcccccc Confidence 99422 333 556778888888889999999764332211 1345566778877777799988888999999999754 Q ss_pred eeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeeccc Q lcl|Aclame:pro 76 QWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFH 155 (426) Q Consensus 76 ~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~ 155 (426) ......... +.+...+.. ...+..... ........+......... T Consensus 81 ~~~vv~v~~------~~~~~~~~~--~~ig~~~~~-~~~tgl~al~~~~~~~~~-------------------------- 125 (390) T protein:vir:79 81 LTVVVRVAE------GKDADETTS--NVIGTVTPD-GKYTGIKALLAAQGALGV-------------------------- 125 (390) T ss_pred eEEEEeecc------ccccccccc--eeeeccccc-ccchhhhhhhhhhhhhcc-------------------------- Confidence 333221110 000000000 000000000 000000000000000000 Q ss_pred ccchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEe Q lcl|Aclame:pro 156 ADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMI 235 (426) Q Consensus 156 ~d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~ 235 (426) ....+....... ......+..-++..+.+.+ .|.. ......... ..+.++-.....+++ T Consensus 126 -------------~p~il~ap~~~~--~~v~~~l~~~a~~~~~~ai---~D~p--~~~t~~~a~-~~~~~~~s~~~~~~~ 184 (390) T protein:vir:79 126 -------------KPRILAAPGLDT--QPVAAALAATAQSLRAMAY---VSAS--GCKTKEEAA-AYRRQFGQREIMVIW 184 (390) T ss_pred -------------ccccccCCcccc--hHHHHHHHHhhhhcceEEE---EEcc--CCCCHHHHH-HHhcCCCCceEEEEc Confidence 000000000000 0011111112222222222 1211 000011111 111111111222222 Q ss_pred cCC----------CccchhHHHHHHHhhhc----ccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEE Q lcl|Aclame:pro 236 VDA----------SDDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLI 300 (426) Q Consensus 236 ~~~----------~~~~~~aa~~g~~~~~~----p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~ 300 (426) .-. ....|.++++|.++..+ ||.++..+...+.... ...-... ......|...|+.++ |.++ T Consensus 185 p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~D~~~g~~~spsN~~i~gi~~~-~~~~~~~--~~~~~~~a~~Ln~~gi~t~~ 261 (390) T protein:vir:79 185 PDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGI-SADVSWD--LQDPATDAGYLNEHEVTTLV 261 (390) T ss_pred CceeecccccCceeEeehHHHHHHHHHhhhccCCcEEccCCceeecccee-eeecccc--ccccchhhhhhhhcCcEEEE Confidence 111 11237788889888777 3333322211111000 0000000 111222344555544 6765 Q ss_pred EEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccc Q lcl|Aclame:pro 301 DVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPL 380 (426) Q Consensus 301 ~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~ 380 (426) . +++..+|-+.|.++ +..=.||=++|..||++..|+..++..+-. |.+..=...|+..|+.-|..-+++|. + T Consensus 262 ~-~~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----~~~~~~~~~i~~~i~~~L~~l~~~ga--l 333 (390) T protein:vir:79 262 N-RNGFRFWGERTCSD-DPKFAFENYTRTAQVAADSIAEAQMPVVDG----PLNPSLARDIVESINGWFRQQVANGY--L 333 (390) T ss_pred c-CCCEEEEeccccCC-CcccceeeehhhHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCc--e Confidence 4 55677776666654 223469999999999999999998876642 77888889999999999999987653 3 Q ss_pred cceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 381 AEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 381 ~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) -+|.+.......+++|+.+-++. +++.+.....++++.++-...+ T Consensus 334 ~g~~v~~d~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 378 (390) T protein:vir:79 334 IGGSAWIDPEPNTADILASGKAY-IDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred eeeEEEEecCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 45888776666777788776665 7888888999999999999988 No 27 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=98.02 E-value=6.9e-06 Score=48.89 Aligned_cols=384 Identities=12% Similarity=0.095 Sum_probs=171.6 Q ss_pred CCCceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCC--CCHHHHHHHHHHhcCCceee Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGE--DSDVYTASEAIEEMGAEQWR 78 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~--~sp~YkAA~~~f~Q~~~~~~ 78 (426) -|--.||+ ++-..+.+..-.=|...|+|...--|+ ++..+-+|.++...=||. +.+.|++...+| ++++..+ T Consensus 13 ~PGvYi~~-~~~~~~~i~~~~~~~~a~~~~~~~Gp~----~~~~~i~s~~d~~~~fG~~~~~~~~~~~~~~~-~g~~~~~ 86 (437) T protein:vir:10 13 RPGAYINV-KSKDIAMTRLGGDGVVTVPLALSFGQS----KKLMKIRRGEDLFKKLGYEQESPQLLLLNEAF-KRVSEVL 86 (437) T ss_pred cCceeEEE-ecCCcceeeccCCcEEEEEEEecCCCC----ceeEEEecHHHHHHHcCCccchhHHHHHHHHh-cCCCEEE Confidence 45444553 334444444445567888886654443 345567788899998994 456777777777 5666666 Q ss_pred eeecccccccccccccccee---------ccceeecccccccc-hhhhhhhcc----cccccccce----eeeeeccccc Q lcl|Aclame:pro 79 VMVLEATEVTEEELSDGDTI---------DKVPILGNHEVESP-DGDIEFTTD----DDPDVEDFD----AEIVINSATG 140 (426) Q Consensus 79 ~~v~~~t~v~~~~~~~~~tv---------~~~~~s~~~~~~~t-a~~i~~~~~----~~~~~t~~~----~~~~~~~~~g 140 (426) .+++..-........++.++ |...++.....++. ..++..-.. ......... .....-..++ T Consensus 87 ~~R~~~g~~a~~tl~~~~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~v~~~~~~~~~d~~~v~~~~~~~~n~~v~~~~~~ 166 (437) T protein:vir:10 87 LYRLNTGEKANVSLSDNVTAQAKYSGVRGNDITVTVKTNVDDPSSFDVVTFLDTVVMDLQTVKVLADLKNNALVEFSGTG 166 (437) T ss_pred EEECCCCceeeEeeccceEEEeccCCcccceeEEEEeeccCCccceEEEEecCcceeeeeehhhhhhhhhhccccccccc Confidence 55543110000000111111 11111111111111 111100000 000000000 0000000000 Q ss_pred eeeech-----hheeeecccccch-hhhhhhc-cccceeecccccchhhhHhHhhhhhhhhh---c-ceEEEEEeccccc Q lcl|Aclame:pro 141 DVATSE-----DSIELTYFHADWS-QLDEFPS-DVNNFAVADRRFDLKGVGVLDETHSWASD---E-DMGMIANGVNVDD 209 (426) Q Consensus 141 ~~t~~~-----~~~~~~~~~~d~~-~~~~~~s-~~~~~~la~~~~~~~~~~~~~~~~~wa~~---~-~kl~~~~~~d~~~ 209 (426) ..+... +.........||. ++..+.. ..+...++... ......+..|+.. + .+-+.+..... . T Consensus 167 ~l~~~a~~~LtGG~dg~~t~~dy~~al~~le~~~~n~l~~~~~d-----~~~~t~~~~~ik~~r~~~g~~~~~V~~~~-~ 240 (437) T protein:vir:10 167 ELQPVAGAKLTGGTDGAISTQDYLEYFKALETVEFNYMALPVED-----ASIKKAAINFIKRMREDEGLGAQLVVADS-D 240 (437) T ss_pred ccccccceeeeccccCCCChhHHHHHHHHhccCcceEEEecCCC-----hhHHHHHHHHHHHHHhccCceEEEEeCCC-C Confidence 000000 0000011111222 1222211 11222222211 1122333445321 1 11111111110 0 Q ss_pred ccchhhHHHHHHHhhccCcceEEEEecCC-CccchhHHHHHHHhhhcccccceeecccccceeeccccccc---cccccc Q lcl|Aclame:pro 210 YDSVDEAMDVAHEVAGYVPSGDLMMIVDA-SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPE---EQGTFE 285 (426) Q Consensus 210 ~~~~~~~~~~a~~~a~~~~rt~~~~~~~~-~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~g---v~~~~~ 285 (426) .+ ...+. +............ ......+++.|.++...+..++++ +..++ +...++ T Consensus 241 ~d--~e~Ii-------n~~n~~~~~~~~~~~~~~~~a~vAG~~Ag~~~~~S~t~------------~~~~~~~~v~~~~t 299 (437) T protein:vir:10 241 AD--SEAVI-------NVKNGVILSDKTVIDKTKATVWVAAASANAGVEKSLTY------------EKYEDSVDVVGRLS 299 (437) T ss_pred CC--CceEE-------EeecceeecCcceechhhHHHHHHHHhccCccccCccc------------cccCCcccccccCC Confidence 00 00000 0000011110000 111234666666666554444433 33443 334567 Q ss_pred hhHHHHhhcC-ccEEEEEcCCEEEeeceee-----cCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHH Q lcl|Aclame:pro 286 GGDEAEGEGP-VNVLIDVSDANRVSNAVTT-----AGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQA 359 (426) Q Consensus 286 ~~~~~~~~~~-~N~~~~~~g~~~~~~~~t~-----~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~ 359 (426) .+|+..+-.. ..+++..++...+.++..+ +.+..+-..|=++|..|.+...|+..+.+.++ .|+|=+..|-. T Consensus 300 ~~e~~~~i~~G~~vl~~~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yi--Gk~~N~~~~r~ 377 (437) T protein:vir:10 300 HTETEDALLKGQFVFTARRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFL--GKVSNNEDGRQ 377 (437) T ss_pred HHHHHHHHhCCcEEEEEeCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccc--cccCCCHHHHH Confidence 7776655433 4677766776777777543 22222334677889999999998887666555 48898999999 Q ss_pred HHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEe Q lcl|Aclame:pro 360 MIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVS 425 (426) Q Consensus 360 ~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~ 425 (426) .+.+.|.+-|++-.+.+ .+..|+.. ..+..+.+-....+ +++.++..-++.++.+.++|. T Consensus 378 ~~~~~i~~yl~~l~~~g--~I~~~~~~--d~~v~~~~~~~~v~--v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 378 AFKANRIRYFKDLEARG--AIEDFKVE--DIEVLRGELKESVV--VNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred HHHHHHHHHHHHHHhCC--CccCCCce--eEEeecCCCCCEEE--EEEEEEEeeeeeeEEEEEEec Confidence 99999999999887654 23334321 00001111122333 899999999999999999999 No 28 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=97.98 E-value=8.4e-06 Score=48.41 Aligned_cols=353 Identities=12% Similarity=0.036 Sum_probs=179.3 Q ss_pred CC--Cc----eEEEEEeeccccccccCccceEEEeccccccccc-chhhhheeecHHHHHhccCCCCHHHHHHHHHHhcC Q lcl|Aclame:pro 1 MP--KQ----IVEIELTAEIADRPQETFTDAAIVGTAEEEPPDA-EFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMG 73 (426) Q Consensus 1 mp--~~----iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~-~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~ 73 (426) || .+ |-=+.+.-.+.++..-....+.|+|.+...++.. .++...+.++..+....||.....+.+...+|.|+ T Consensus 1 m~m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~~g~L~~al~~~~~~~ 80 (393) T protein:vir:10 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) T ss_pred CCCCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCCccchhhhhhhhhccc Confidence 55 43 3233444567777778889999999875443221 14556667888888899999999999999999996 Q ss_pred Cceeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeec Q lcl|Aclame:pro 74 AEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTY 153 (426) Q Consensus 74 ~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~ 153 (426) -............ ... .....+.+.. ..........+............... T Consensus 81 ~~~~~vv~v~~~~--~~~------~t~~~iig~~--~~~~~tgl~al~~~~~~~~~~p~li~------------------ 132 (393) T protein:vir:10 81 KTPTVIVRVAESD--DSD------TLTANIVGTQ--ENGKFTGIKALLTAQSTVFVKPKLLC------------------ 132 (393) T ss_pred CceEEEeecccCc--ccc------cccccccccc--ccchhhHHHHHHhhhhhcceeeeeee------------------ Confidence 4333221111000 000 0000000000 00000001111100000000000000 Q ss_pred ccccchhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEE Q lcl|Aclame:pro 154 FHADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLM 233 (426) Q Consensus 154 ~~~d~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~ 233 (426) .+ ++........+..-++.-.-.++.. ..+. +..+...... .........+ T Consensus 133 -------------------ap----g~~~~~~~~al~~~~~~~~~~~~v~-d~~~--~t~~~ai~~~---~~~~s~~~~~ 183 (393) T protein:vir:10 133 -------------------VP----QHDNQAVATELLSVAKKLNAFAFIS-DNGA--TTKEQAYTYR---QNFSQREGMM 183 (393) T ss_pred -------------------ec----cccchHHHHHHHHHhhccCcEEEEE-cCCC--CCHHHHHHHh---hhcCCceEEE Confidence 00 0000000111111111111111111 0000 0011111100 0010011111 Q ss_pred EecCC----------CccchhHHHHHHHhhhcccccceeecccccceeeccccccccc---------cccchhHHHHhhc Q lcl|Aclame:pro 234 MIVDA----------SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQ---------GTFEGGDEAEGEG 294 (426) Q Consensus 234 ~~~~~----------~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~---------~~~~~~~~~~~~~ 294 (426) |+..- ....|.++++|.++..+.-+.+ |+ .+ .++.+.||. ..+...|...|+. T Consensus 184 ~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~G~-~~-sp------aN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~ 255 (393) T protein:vir:10 184 IFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGW-HK-NI------SNVELDGVTGITKAVEFDINESSTEANYLNE 255 (393) T ss_pred EecccccccccCCceeEeehhHHHHHHHHHhhcCCCc-EE-cc------CCceeeceeecceecccccCCCcchhHhHhh Confidence 21111 1134668888888877643221 11 11 122223332 1233456666765 Q ss_pred Cc-cEEEEEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 295 PV-NVLIDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLT 373 (426) Q Consensus 295 ~~-N~~~~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v 373 (426) +. |+++. +++..+|-+.|.++ ...=.||=++|-.+|++..|+..++..+- | |.+..=+..|+..|+.-|++-+ T Consensus 256 ~gI~t~~~-~~G~~~wG~rT~s~-d~~~~~i~vrR~~~~i~~~i~~~~~~~v~---e-~~~~~~~~~i~~~i~~~L~~l~ 329 (393) T protein:vir:10 256 KGITICLN-HNGFRYWGSRTLAT-DTRWAFQQSVRTAQIIKETIGAGLAWAVD---M-PLTPLRVKTMLEAINNKLRSWA 329 (393) T ss_pred cCceEEEc-CCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhcc---C-CCCHHHHHHHHHHHHHHHHHHH Confidence 54 77754 45677776666544 22336999999999999999999887653 3 7788888899999999998877 Q ss_pred cCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 374 GSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 374 ~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) ..|...+.+|.+.... +.+++|..+-++. +++.+...-.++++.++...+. T Consensus 330 ~~g~~al~g~~v~~~~-~nt~~~i~~G~~~-~~i~~~p~~p~e~I~~~~~~~~ 380 (393) T protein:vir:10 330 SGDDPRILGARVWVAE-EITADIIKSGKFV-IKYDYHWIPSLESLGLEQRVND 380 (393) T ss_pred hccccccccceEEecC-CCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEch Confidence 6443345567776543 3555666664444 7888888999999999999988 No 29 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=97.61 E-value=3.8e-05 Score=44.80 Aligned_cols=342 Identities=13% Similarity=0.113 Sum_probs=145.6 Q ss_pred CC--CceEEEEEeecccc---ccccCccceEEEecc--c----------ccccccchhhhheeecHHHHHhccCCCCHHH Q lcl|Aclame:pro 1 MP--KQIVEIELTAEIAD---RPQETFTDAAIVGTA--E----------EEPPDAEFGEVNQYSTSTSVGDDYGEDSDVY 63 (426) Q Consensus 1 mp--~~iVnV~isl~t~a---~~~~~Fg~~Lilg~~--~----------~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~Y 63 (426) -+ .++.=-..+....+ ++..-+-...+++.. + ..||.. .|+.+|..-+.+.+-+ T Consensus 184 ~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~--~~~v~~~~~~~~~~~~------- 254 (581) T protein:vir:10 184 YVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNY--HEVIRFTDPDDIQDFY------- 254 (581) T ss_pred eeccccceeeecccCccccccccccceeeeeeecccccccceEEEEEEEeecCCc--ceeEEeecCcchhhhh------- Confidence 11 11111111111110 000001122222221 0 012211 1334444444432222 Q ss_pred HHHHHHHhcCCceeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceee Q lcl|Aclame:pro 64 TASEAIEEMGAEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVA 143 (426) Q Consensus 64 kAA~~~f~Q~~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t 143 (426) ..+|.-. +.. ..++... .... .++.....++ T Consensus 255 ---~~~~~~~-----------------g~~-------------------~~~~t~~--~~~~--------~tn~~~~~l~ 285 (581) T protein:vir:10 255 ---GPAFDEA-----------------GNV-------------------QSEITLC--AQLA--------ITNGASTILA 285 (581) T ss_pred ---hhhhhcc-----------------Ccc-------------------ccchhhh--heee--------eecccceeEE Confidence 2222110 000 0000000 0000 0000000011 Q ss_pred echhheeeecccccch-hhhhhhcc-ccceeecccccchhhh-HhHhhhhhhhhhcce---EEEEEecccccccchhhHH Q lcl|Aclame:pro 144 TSEDSIELTYFHADWS-QLDEFPSD-VNNFAVADRRFDLKGV-GVLDETHSWASDEDM---GMIANGVNVDDYDSVDEAM 217 (426) Q Consensus 144 ~~~~~~~~~~~~~d~~-~~~~~~s~-~~~~~la~~~~~~~~~-~~~~~~~~wa~~~~k---l~~~~~~d~~~~~~~~~~~ 217 (426) .+......+....||. +|..+... .+.+.++.+.. .++ ..+..+.+.++.+.+ ..+....... ...... T Consensus 286 ~gvd~~g~tvt~~dy~~Al~ale~~~~~~ivv~~t~~--~~v~a~l~ahv~~~s~~~~~~ravigV~g~~~---~~~~~~ 360 (581) T protein:vir:10 286 CAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGA--QPIQALVQQHVSAQSNNKYERRAILGMDGSVT---PVPSAT 360 (581) T ss_pred eeccCCCCccchHHHHHHHHHHhcCCceEEEEeCCCC--HHHHHHHHHHHHHHHhccCCcEEEEEecCCCC---CccHHH Confidence 1111111122233443 33333331 12222332211 111 223333333333321 1122211110 001111 Q ss_pred HHHHHhhccCcceEEEEecCC--------------CccchhHHHHHHHhhhcccccceeecccccceeeccccccccccc Q lcl|Aclame:pro 218 DVAHEVAGYVPSGDLMMIVDA--------------SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGT 283 (426) Q Consensus 218 ~~a~~~a~~~~rt~~~~~~~~--------------~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~ 283 (426) ..++...-+. +++.+++... .-.+..+++.|..+..+|..+++++...+ ..++... T Consensus 361 ~~~~a~~~n~-~Rvvlv~p~~~~~~g~~~~~~v~lp~y~~AA~vAGl~a~~~~~~slT~~~i~g---------i~~l~~~ 430 (581) T protein:vir:10 361 RIANAQSIKD-QRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRG---------FSGPAEV 430 (581) T ss_pred HHHhhccCCC-ceEEEEecCceeecCcccCceeccchhhHHHHHHHHhhccccccCcccccccc---------ccccccc Confidence 1122222222 3333332211 11123567777777777755554443322 1222344 Q ss_pred cchhHHHHhh-cCccEEEEE-cCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHH-HHhcCCCCcccHHHHHH Q lcl|Aclame:pro 284 FEGGDEAEGE-GPVNVLIDV-SDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLES-LQVSDDDVPFTEDGQAM 360 (426) Q Consensus 284 ~~~~~~~~~~-~~~N~~~~~-~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~-ll~~~~KIp~td~Gi~~ 360 (426) ++.+|+..+. +..+.++.. ++...+.++.++-....+...|-++|-.|++..++++.++. .++. | |=.+.|... T Consensus 431 ~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~~~i~~iR~~D~v~~~ir~~~~~~~fIG--~-~n~~~~r~~ 507 (581) T protein:vir:10 431 QRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIG--M-PIYDTTIVQ 507 (581) T ss_pred CCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCCCCcceeeeeehhhhHHHHHHHHHhhhhcCCC--c-ccCHHHHHH Confidence 6666665554 445777765 44467789988866666667899999999999999999974 4553 3 556789999 Q ss_pred HHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHH-hhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 361 IEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRV-NRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 361 i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra-~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) |++.+.+.|++-.+++ .+-+|+. .+. ++.++. -|. -+.|.++..-+|+++.++..+.= T Consensus 508 ik~~i~~~L~~l~~~g--~I~~~~~--~~~--~~~~~~~d~v--~V~i~v~Pv~~i~~I~vti~~~p 566 (581) T protein:vir:10 508 VKASAEAALVWLVDNN--IIRGYRN--LKA--RQIERQPDVI--EVRYEWRPAYPLNYIVVRYSIAP 566 (581) T ss_pred HHHHHHHHHHHHHhcC--cccCCcc--cee--eeeecCCCEE--EEEEEEEecccceEEEEEEEEec Confidence 9999999999988754 3445541 111 111111 121 27888889999999888776654 No 30 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=97.10 E-value=0.00017 Score=41.25 Aligned_cols=376 Identities=10% Similarity=0.035 Sum_probs=145.0 Q ss_pred CC-----CceEEEEEeeccccccccCccceEEEecccccccccchhhhheeec-HHHHHhccCCCCHHHHH-HHHHHhcC Q lcl|Aclame:pro 1 MP-----KQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYST-STSVGDDYGEDSDVYTA-SEAIEEMG 73 (426) Q Consensus 1 mp-----~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts-~~~V~~Dfg~~sp~YkA-A~~~f~Q~ 73 (426) |+ ++-+..++++-+....-. ...||.+.+-.. .+ |-. +++|..| |..|- +.-=+++- T Consensus 314 ~~~v~~~D~~~~~~~t~~~~~~g~~-~~~pl~~ts~dy-------~~---~~~~vdgI~~~-----~~~~V~~~g~~s~a 377 (717) T protein:vir:79 314 MRKVESKDGAVTVTITKPESKRGMI-SEDPLVFKSGDY-------TN---FKMLVDAINNH-----PFNNVVRARTKPEF 377 (717) T ss_pred eeEEecCCceEEEEEecccccCcce-eccccccccCce-------ee---eeeeecccccC-----chhheeeeeccccc Confidence 44 112334444443332111 223555443210 00 111 1233221 22110 00000000 Q ss_pred Cceeeeeeccccccccccccccceeccceeeccccc-ccchhhhh-hhcccccccccceeeeeeccccceeeec-----h Q lcl|Aclame:pro 74 AEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEV-ESPDGDIE-FTTDDDPDVEDFDAEIVINSATGDVATS-----E 146 (426) Q Consensus 74 ~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~-~~ta~~i~-~~~~~~~~~t~~~~~~~~~~~~g~~t~~-----~ 146 (426) ....-..-+........+..++.++..-.+-...+. ......+. .+.-+.++...++...... ..+..+.. . T Consensus 378 ~a~~~~g~~s~d~a~f~Gg~dgl~~~~ee~Y~~lGgk~~d~g~lt~~aays~LE~~dVDlVil~g-a~adtt~ga~~d~v 456 (717) T protein:vir:79 378 EATFTSTLQAAADAKFSGGKDELSLDKEEMYKRLGGEKNEEGFVTKQGAYQYLENYEVDYVIPLG-VHADTKLIGKYDDF 456 (717) T ss_pred ceeeeecccCchhhccCCCccccccchhhhhccccccccccccccchhhhhhcCcceeEEEEecC-ccccccccchhhhH Confidence 000000000000001111122211111100000000 00000000 0000111111111111110 00000000 0 Q ss_pred hheeeecccccchhhhhhhccccceeeccc-ccchhhhHh-Hhh-------hhhhhhhcceEEEEEecccccccchhhHH Q lcl|Aclame:pro 147 DSIELTYFHADWSQLDEFPSDVNNFAVADR-RFDLKGVGV-LDE-------THSWASDEDMGMIANGVNVDDYDSVDEAM 217 (426) Q Consensus 147 ~~~~~~~~~~d~~~~~~~~s~~~~~~la~~-~~~~~~~~~-~~~-------~~~wa~~~~kl~~~~~~d~~~~~~~~~~~ 217 (426) ......++.. -+.+ ....+..+.+... ....+.+.+ .+. .+.|.......+ ....+.. . T Consensus 457 a~alad~caa-lSal--~r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~-~~~~~~i--------d 524 (717) T protein:vir:79 457 AYQLALACAV-MSHY--NSVTIGIIPTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIF-DADRNKI--------D 524 (717) T ss_pred HHHHHHHHHH-hhhc--cccceeeeccccccccchhhHHHHHHHHHhhhhhhhhhcchhcccc-ccccccc--------c Confidence 0000000000 0000 0000000000000 000000000 000 011111000000 0000000 0 Q ss_pred HHHHHhhccCcceEEEEecCC--CccchhHHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhcC Q lcl|Aclame:pro 218 DVAHEVAGYVPSGDLMMIVDA--SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGP 295 (426) Q Consensus 218 ~~a~~~a~~~~rt~~~~~~~~--~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~ 295 (426) ... .......+...+..... ....+.++++|..+...||.+++.+... ...++...++.+|+..|..+ T Consensus 525 is~-y~~vv~~~~~iv~~~~~~~~~~p~AG~vAGldA~rGVwkSPANk~I~---------GVvgLa~~lT~sE~d~Ln~a 594 (717) T protein:vir:79 525 LGQ-FIEVVAGPDFIVRNTRLGQMASTPDASYIGMVSQLKTQSAPTNKPLP---------SVTALRYTYSANQLNRLTKA 594 (717) T ss_pred ccc-eeeeeecceeEEEcCCCceeecCHHHHHHHHHhcCCcccccccceec---------ccccCcccCCHHHHHHHhhC Confidence 000 00000011111111111 1223467777777777777665433211 12223345777888777755 Q ss_pred c-cEEEEE-cCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 296 V-NVLIDV-SDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLT 373 (426) Q Consensus 296 ~-N~~~~~-~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v 373 (426) . |.++.. +.+..+|.+.|+++...+-.||=++|-.|++...|+..+...+ .+ |-+..+...|++.|++-|.+-. T Consensus 595 GIntIr~~~GrGirVWGaRTtasd~sdWryInVRRl~D~Ie~sIr~al~~yV---gE-PNd~~tr~~Ik~sI~afL~~L~ 670 (717) T protein:vir:79 595 RFATFKYKQDGSIGVVDAPTSAHAGSDYTRLSTARIVKEAVNAVREVADPFI---GE-PNDTGNRNALTAAVDKRLSKMI 670 (717) T ss_pred CeEEEEEeCCceEEEEeeeecCCCCcccceeehhhhHHHHHHHHHHHHHHhc---cc-cCCHHHHHHHHHHHHHHHHHHH Confidence 4 888876 4478889888887766566899999999999999999888654 23 6788899999999999999988 Q ss_pred cCCCccccceeEecCcccCcHHHHHh-hcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 374 GSVGQPLAEYEVDVPEWDDDDVDRVN-RNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 374 ~~~g~~~~~y~~~~p~~~~~~~dra~-R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +.|. +-+|.+.+ ..+++|..+ |.+ +.+.+.....++++.|+.+|+= T Consensus 671 r~GA--I~Gykvdv---tnT~~di~~G~l~--V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 671 ENKA--LLGFDFRL---VVTPQQELLGEGS--IELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred hcCc--eecceeeE---ecChhHhhCCEEE--EEEEEEecCcccEEEEEEEEeC Confidence 7653 34565432 223334332 332 6788889999999999988888 No 31 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=97.06 E-value=0.00019 Score=41.01 Aligned_cols=330 Identities=14% Similarity=0.102 Sum_probs=141.7 Q ss_pred CCCceEEEEEeeccccccccCccceEEEecc-cccccccchhhhheeecHHHHHhccCC--------CCHHHHHHHHHHh Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTA-EEEPPDAEFGEVNQYSTSTSVGDDYGE--------DSDVYTASEAIEE 71 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~~~~Fg~~Lilg~~-~~~~~~~~~~~~~~Yts~~~V~~Dfg~--------~sp~YkAA~~~f~ 71 (426) ...+++.+.- ++....=....|+.-+ ...||.. .++.+|.+-+.+.+-||. ++..=++++..|. T Consensus 205 ~~~~~~t~~~-----~~~g~~~~~~~i~~~~~~~~D~~~--~~~v~~~~~~~~~~~~~~~~~~~g~~~~e~~~~~~~~~t 277 (581) T protein:vir:76 205 TRDDLYTIQR-----VVDGGHIDPGDIVQLSYRYTDPNY--HEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAIT 277 (581) T ss_pred eeeeeeeeEe-----ecccccccceeEEEEEEEeecCCc--cceEEEecccccccceeeehhhcCccccchhhhhheeec Confidence 1111111111 1111111112222211 1123322 255556665555444431 1111111111111 Q ss_pred cCCceeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheee Q lcl|Aclame:pro 72 MGAEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIEL 151 (426) Q Consensus 72 Q~~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~ 151 (426) .++. ..++.+...... T Consensus 278 ~~~~----------------------------------------------------------------~~l~~gvd~~g~ 293 (581) T protein:vir:76 278 NGAS----------------------------------------------------------------TILACAVDPEGD 293 (581) T ss_pred cccc----------------------------------------------------------------eEEEeeecCCCC Confidence 1100 000111101011 Q ss_pred ecccccch-hhhhhhcc-ccceeecccccchhhh-HhHhhhhhhhhhcceE---EEEEecccccccchhhHHHHHHHhhc Q lcl|Aclame:pro 152 TYFHADWS-QLDEFPSD-VNNFAVADRRFDLKGV-GVLDETHSWASDEDMG---MIANGVNVDDYDSVDEAMDVAHEVAG 225 (426) Q Consensus 152 ~~~~~d~~-~~~~~~s~-~~~~~la~~~~~~~~~-~~~~~~~~wa~~~~kl---~~~~~~d~~~~~~~~~~~~~a~~~a~ 225 (426) +....||. ++.++... .+...++.... .++ ..+..+.+.++.+.+- .+....... ...+ ....++...- T Consensus 294 tvt~~dy~~aL~ale~~~~~~ivvp~t~~--~~i~a~l~ahv~~~s~~~~~~ra~igv~g~~~--~~~~-~~~~~~a~~~ 368 (581) T protein:vir:76 294 TVTMGDYQNALNKFRDEDEIAIIVAGTGA--QPIQALVQQHVSAQSNNKYERRAILGMDGSVT--PVPS-ATRIANAQSI 368 (581) T ss_pred ccchHHHHHHHHHHhcCCeEEEEEecCCC--hHHHHHHHHHHHHHHhccCCceEEEEeeCCCC--CchH-HHHHHhhccc Confidence 11222332 33333221 11122322111 111 1123333333333221 112111111 0111 1111122222 Q ss_pred cCcceEEEEe------cCC-------CccchhHHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHh Q lcl|Aclame:pro 226 YVPSGDLMMI------VDA-------SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEG 292 (426) Q Consensus 226 ~~~rt~~~~~------~~~-------~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~ 292 (426) +..|-.+++. ... .-.+..+++.|..+..+|..+++++...+ ..++...++.+|+..+ T Consensus 369 ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~~~slT~~~i~g---------~~~~~~~~s~~e~e~l 439 (581) T protein:vir:76 369 KDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRG---------FSGPAEVQRDGEKSRE 439 (581) T ss_pred CCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhhccccccCcccccccc---------cccccccCCHHHHHHH Confidence 3334333331 110 01224466666666666655554443321 2233345666666554 Q ss_pred -hcCccEEEEEc-CCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHH-HhcCCCCcccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 293 -EGPVNVLIDVS-DANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESL-QVSDDDVPFTEDGQAMIEDAIKGTM 369 (426) Q Consensus 293 -~~~~N~~~~~~-g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~l-l~~~~KIp~td~Gi~~i~~~v~~~l 369 (426) ++..+.+.... +...+.++.++-........|-++|-.|++..++++.+..+ |.. | |=.+.|...|++.|.+.| T Consensus 440 l~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG--~-~n~~~~r~~ik~~i~~~L 516 (581) T protein:vir:76 440 SSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIG--M-PIYDTTIVQVKASAEAAL 516 (581) T ss_pred HhCCeEEEEEecCCeEEEEEeeecCCCCCccceeeehhhhHHHHHHHHHHHhhhcCCC--c-ccChHHHHHHHHHHHHHH Confidence 45557787644 45667899888665556678999999999999999999744 553 2 556789999999999999 Q ss_pred HHhhcCCCccccceeEecCcccCcHHHHH-hhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 370 SGLTGSVGQPLAEYEVDVPEWDDDDVDRV-NRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 370 ~~~v~~~g~~~~~y~~~~p~~~~~~~dra-~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) ++-.+++ .+-+|+. .+..+. +++ .|. -+.++++..=+|.++.++..+.= T Consensus 517 ~~l~~~g--~I~g~~~--~~~~~~--~~~~d~v--~V~i~v~Pv~~ie~I~vt~~~~p 566 (581) T protein:vir:76 517 VWLVDNN--IIRGYRN--LKARQI--ERQPDVI--EVRYEWRPAYPLNYIVVRYSIAP 566 (581) T ss_pred HHHHhcC--cccCccc--ceeeEE--ecCCCEE--EEEEEEEecccceEEEEEEEEee Confidence 9988754 2334541 111111 111 122 25666777777777766655543 No 32 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=96.90 E-value=0.00027 Score=40.14 Aligned_cols=377 Identities=11% Similarity=0.047 Sum_probs=146.3 Q ss_pred CC-C-------------------------ceEEEEEeeccccc-------------cccCccceEEEecccccccccchh Q lcl|Aclame:pro 1 MP-K-------------------------QIVEIELTAEIADR-------------PQETFTDAAIVGTAEEEPPDAEFG 41 (426) Q Consensus 1 mp-~-------------------------~iVnV~isl~t~a~-------------~~~~Fg~~Lilg~~~~~~~~~~~~ 41 (426) .. . +++...+....... ....|..++... .. T Consensus 232 g~~g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~--g~-------- 301 (671) T protein:vir:56 232 GDFGDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVS--GE-------- 301 (671) T ss_pred cccCcceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeec--Cc-------- Confidence 10 1 11111111110000 000011000000 00 Q ss_pred hhheeecHHHHHhccCCCCHHHHHHHHHHhcCCceeeeeeccccccccccccccceeccceeecccccccchhhhhhhcc Q lcl|Aclame:pro 42 EVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTD 121 (426) Q Consensus 42 ~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~ 121 (426) ...+|.-...-.+......-.| ...++..+.....+........ ......+.+..+......++..++. T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~gg~d~~~~~~~~~~~~~ 370 (671) T protein:vir:56 302 VEEAFIVSTNPGDKDVNGQSIF--IDEYFENSGSAYITAIAEGWKT---------ESGAYNFGGGSDANAGADDWMFGLD 370 (671) T ss_pred cceeEEEeecccccccchhhhh--hhhhhcccCceEEEecCcccCC---------ccccccccCccccccchhHHHHHHH Confidence 0000100000000011101111 1112222211111111100000 0000111111111111122222222 Q ss_pred cccccccceeeeeeccccceeeechhheeeecccccchhhhhhh-ccccceeeccc-ccch---hhhHhHhhhhhhhhhc Q lcl|Aclame:pro 122 DDPDVEDFDAEIVINSATGDVATSEDSIELTYFHADWSQLDEFP-SDVNNFAVADR-RFDL---KGVGVLDETHSWASDE 196 (426) Q Consensus 122 ~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~~d~~~~~~~~-s~~~~~~la~~-~~~~---~~~~~~~~~~~wa~~~ 196 (426) .-.+.+...........-........ ....+..+.... ...+.+.+.+. ..+. ......++..+|.+.. T Consensus 371 ~~~~~~~~~~~~~~a~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (671) T protein:vir:56 371 MLSDPEVLYTNLVIAGNAAAEEVSIA------STVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGI 444 (671) T ss_pred hhhhccccceeEEEcCCCCCccchhH------HHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhc Confidence 21111111111111000000000000 000000000000 00001111110 0000 0000011111121111 Q ss_pred ceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCCC---ccchhHHHHHHHhhhc----ccccceeecccccc Q lcl|Aclame:pro 197 DMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDAS---DDDLAAYQLGKFAVSE----PWYNPLWNELPAGE 269 (426) Q Consensus 197 ~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~---~~~~~aa~~g~~~~~~----p~~~~~~~~~~~~~ 269 (426) ...-.....+...+ .+.....+++. ..++..... .--|.++++|.|+-.+ ||.++..+... T Consensus 445 ~~~~~~~~~~~~~~--------~s~~~~~~~p~-~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~--- 512 (671) T protein:vir:56 445 DPTNGQAVVDNLNV--------STTYAVIDGNY-KYQYDKYNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRG--- 512 (671) T ss_pred cccchhhhhhhccC--------CcceEEEecCc-eEEecccCCceeEechHHHHHHHHHHhhccCCcEECcCCceec--- Confidence 00000000000000 00001111211 111111111 1126788889888776 44443322211 Q ss_pred eeeccccccccccccchhHHHHhhcCc-cEEEEEcC-CEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 270 TVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSD-ANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVS 347 (426) Q Consensus 270 ~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g-~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~ 347 (426) ......++.-.+...|...|+.+. |.++..-| +..+|-+.|.++....-.||=++|..+||++.|+..++..+-. T Consensus 513 ---~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~e 589 (671) T protein:vir:56 513 ---QIKGVNRLAVDLRRAHRDALYQIGINPVVGFAGQGFVLYGDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFE 589 (671) T ss_pred ---cccccccceeecChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCC Confidence 111111122235566777777544 88888754 6778877777765556789999999999999999998876532 Q ss_pred CCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 348 DDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 348 ~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) |.+..=...|+..|+.-|.+-++++. +-+|.+...+...+++|+.+-++. +++.+...-.++++.++..-+- T Consensus 590 ----pn~~~~~~~i~~~i~~fL~~l~~~ga--l~g~~v~~d~~~nt~~~i~~G~~~-~~i~~~p~~Pae~I~~~~~~~~ 661 (671) T protein:vir:56 590 ----LNDEFTRSSFKSEIDAYLTNIQDLGG--VYDFRVVCDETNNPGSVIDRNEFV-ASIYVKPAKSINFITLNFVATS 661 (671) T ss_pred ----CCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEcCCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEee Confidence 66777778889999999988887553 445888888777788888887775 8899999999999999887666 No 33 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=96.88 E-value=0.00028 Score=40.05 Aligned_cols=399 Identities=11% Similarity=0.029 Sum_probs=145.7 Q ss_pred CCCce--EEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhc-cCCCCHHHHHHHHHHhcCCcee Q lcl|Aclame:pro 1 MPKQI--VEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDD-YGEDSDVYTASEAIEEMGAEQW 77 (426) Q Consensus 1 mp~~i--VnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~D-fg~~sp~YkAA~~~f~Q~~~~~ 77 (426) .+.++ ...+....+...........++......+.... .+-+..|..+....+. -....+.|..-. +.+..... T Consensus 279 ~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~~~~~-g~vve~~~~~s~~~~~~~~~~~~~~~~~v--i~~~s~~~ 355 (729) T protein:vir:10 279 EWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTITGNS-GTILEKHLSLSKAKDAEYSVGSSSYWRDF--LATNSKYI 355 (729) T ss_pred cccccccccccccccccccccccccceeeeccccccccCc-ccceeeeeeeeecccccccccccccccee--ecccccee Confidence 11111 000111111100000000011111000000000 0001112221111110 111122221110 11100000 Q ss_pred eeeeccccccccccc----cccceeccceeecccccccch--hhhhhhcccccccccceeeeeeccccceeeechhheee Q lcl|Aclame:pro 78 RVMVLEATEVTEEEL----SDGDTIDKVPILGNHEVESPD--GDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIEL 151 (426) Q Consensus 78 ~~~v~~~t~v~~~~~----~~~~tv~~~~~s~~~~~~~ta--~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~ 151 (426) .... ..+....... ......... ....+...... ......+....+... .......+....+.. T Consensus 356 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~~~~~~~~~~~~~~~~g~~~~~----~~~~~~~~~~~~~~~---- 425 (729) T protein:vir:10 356 FGGG-ATSGITTTGYSVSSTNTLDTDSG-WDQNAEGVNFGASGVATLTLAGGTNYGD----KTDLTTSGALSSGVD---- 425 (729) T ss_pred eecc-cccccccccccccccceeccccc-cccccccccccccceeEEEeeccccccc----ccccccccccccchh---- Confidence 0000 0000000000 000000000 00000000000 000000000000000 000000000000000 Q ss_pred ecccccchhhhhhhc-cccceee-cccccchhhhHhHhhhhhhhhhcc-eEEEEEe------cccc----cccchhhH-H Q lcl|Aclame:pro 152 TYFHADWSQLDEFPS-DVNNFAV-ADRRFDLKGVGVLDETHSWASDED-MGMIANG------VNVD----DYDSVDEA-M 217 (426) Q Consensus 152 ~~~~~d~~~~~~~~s-~~~~~~l-a~~~~~~~~~~~~~~~~~wa~~~~-kl~~~~~------~d~~----~~~~~~~~-~ 217 (426) .... -+..+..... .++...+ +....|.........+...++... .+.+... .+.. ..+..... . T Consensus 426 ~~~~-g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~~~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 504 (729) T protein:vir:10 426 DIIS-GYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARKDAVAFISPYRQAFLNDTVSGTVTVSNIDQTTE 504 (729) T ss_pred HHHH-HHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhcCCeEEEecccccccccccccccccccccchhhH Confidence 0000 0000100000 0000000 111111111111122222222222 1111110 0000 00000000 0 Q ss_pred HHHHHhhccCcce--------EEEEecCCC---ccchhHHHHHHHhhhcc----cccceeecccccceeecccccccccc Q lcl|Aclame:pro 218 DVAHEVAGYVPSG--------DLMMIVDAS---DDDLAAYQLGKFAVSEP----WYNPLWNELPAGETVSKNVGDPEEQG 282 (426) Q Consensus 218 ~~a~~~a~~~~rt--------~~~~~~~~~---~~~~~aa~~g~~~~~~p----~~~~~~~~~~~~~~~~~~k~~~gv~~ 282 (426) ........+..++ ..++..... .--|.++++|.++-.+. |.++..+... ......+..- T Consensus 505 ~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~d~~~g~~~span~~~~------~i~g~~~~~~ 578 (729) T protein:vir:10 505 NVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGTCARTDIEQFPWFSPAGTARG------PILNSVKLVY 578 (729) T ss_pred HHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhHHHHHHHHHhhccCCcEEccCCcccc------ceecccceee Confidence 0000011111111 111211111 22356788888877764 3333222211 1111111112 Q ss_pred ccchhHHHHhhcCc-cEEEEEc-CCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHH Q lcl|Aclame:pro 283 TFEGGDEAEGEGPV-NVLIDVS-DANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAM 360 (426) Q Consensus 283 ~~~~~~~~~~~~~~-N~~~~~~-g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~ 360 (426) .+..+|...|+.+. |.++.+- ++..+|-+.|..+.+..-.||=++|..+||+..|+..++..+-. |.|..=... T Consensus 579 ~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~~~~~ 654 (729) T protein:vir:10 579 NPGKKQRDILYSNRINPVILSPGAGIILFGDKTGFGKSSAFDRINVRRLFIYLEDAISAAAKDQLFE----FNDELTRTN 654 (729) T ss_pred ecChhhHhhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcC----CCCHHHHHH Confidence 34566666777555 8888874 56888887777676666689999999999999999998876642 778888889 Q ss_pred HHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 361 IEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 361 i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) |+..|+.-|..-+++|. +.+|.+.......+++|+.+-++. +.+.+.....++++.++..-+- T Consensus 655 i~~~i~~~L~~l~~~g~--l~g~~v~~d~~~nt~~~i~~G~~~-~~v~~~p~~p~e~i~~~~~~~~ 717 (729) T protein:vir:10 655 FVNIVEPFLRDVQAKRG--IFDFVVICDETNNTAAVIDSNEFV-ADIFIKPARSINFIGLTFVATR 717 (729) T ss_pred HHHHHHHHHHHHHhccc--eeeeEEEEcCCCCCHHHhhCCeEE-EEEEEEecCCccEEEEEEEEee Confidence 99999999999887654 446888887666778888887776 8889999999999998866555 No 34 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=96.20 E-value=0.00089 Score=37.32 Aligned_cols=396 Identities=9% Similarity=0.020 Sum_probs=161.5 Q ss_pred CC----CceEEEEEeeccccccccC------ccceEEEecccccccccchhhhheeecHHHHH-hccCCCCHHHHHHHHH Q lcl|Aclame:pro 1 MP----KQIVEIELTAEIADRPQET------FTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVG-DDYGEDSDVYTASEAI 69 (426) Q Consensus 1 mp----~~iVnV~isl~t~a~~~~~------Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~-~Dfg~~sp~YkAA~~~ 69 (426) |+ +.....-++ ........ .....|.|.... -+.+|..++.-. ..+....+.|- ... T Consensus 301 ~~~~a~~~gt~~~~~--~~~g~~D~~~v~v~~~~g~~~~~~g~--------v~e~~~~~~~~~~~~~~~~~~~~~--~~~ 368 (749) T protein:vir:10 301 WINVAPRPGTSLYAN--GVGGHRDEMHVILVDIDGGVTGTVGA--------LLERYIDVSKASDAKTSVGETNYY--AEV 368 (749) T ss_pred eccccccccceeeee--cccCCCCceEEEEecCCCeeeecccc--------eeeeeeeccccccccccccccchh--hhh Confidence 22 111111110 00000000 001122222211 111233332111 12334455553 222 Q ss_pred HhcCCceeeeeecc-ccc---cccccccccceeccceeecccccccchhhhhhhcccccccccceeee-eeccccceeee Q lcl|Aclame:pro 70 EEMGAEQWRVMVLE-ATE---VTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEI-VINSATGDVAT 144 (426) Q Consensus 70 f~Q~~~~~~~~v~~-~t~---v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~-~~~~~~g~~t~ 144 (426) +.+.....+..... ... .+......+.........-.. ......................... .....++..+. T Consensus 369 ~~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~ 447 (749) T protein:vir:10 369 IKQKSEFIYWAEHESTLYAATSSASDGLFGQTAANRQFNLFR-SAAGSVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSA 447 (749) T ss_pred hccCCCEEEEEecccccccccccccccccccccccceeeccc-cccccceeccccccccccCCcEEEEEccCCccccccc Confidence 22322222211100 000 000000000000000000000 0000000000110000000000000 00011111111 Q ss_pred chhheeeecccccch----hhhhhhc-cccceeecccc-cchhhhHhHhhhhhhhhhcceEEEEEeccc-ccc---cchh Q lcl|Aclame:pro 145 SEDSIELTYFHADWS----QLDEFPS-DVNNFAVADRR-FDLKGVGVLDETHSWASDEDMGMIANGVNV-DDY---DSVD 214 (426) Q Consensus 145 ~~~~~~~~~~~~d~~----~~~~~~s-~~~~~~la~~~-~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~-~~~---~~~~ 214 (426) +. ......++. .+..... .++...++... .+.........+...++.....++...... ... .... T Consensus 448 ~~----~~~~~~~~~~~~~~l~~~~~~~~~~li~~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~ 523 (749) T protein:vir:10 448 GQ----YTITNTDIGSAYELIGDPESQIVDFIISGPSGTSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTT 523 (749) T ss_pred cc----ccccchhHHHHHHHhhhhhhcccceEEEecCCCCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchh Confidence 00 000111111 1110110 11111111110 111112222333333443333222211111 111 1111 Q ss_pred hHHHHHHHhhccCcce-EEEEe-------cCCC---ccchhHHHHHHHhhhc----ccccceeecccccceeeccccccc Q lcl|Aclame:pro 215 EAMDVAHEVAGYVPSG-DLMMI-------VDAS---DDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNVGDPE 279 (426) Q Consensus 215 ~~~~~a~~~a~~~~rt-~~~~~-------~~~~---~~~~~aa~~g~~~~~~----p~~~~~~~~~~~~~~~~~~k~~~g 279 (426) ..........++..++ ..+|+ .... ..-|.+.++|.|+-.+ ||..+..++. .......+ T Consensus 524 ~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~------~~i~g~~~ 597 (749) T protein:vir:10 524 ITTNIVDFFKKLPSSSYMVFDSGYKYIYDKYNDVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQR------GVLRNAIK 597 (749) T ss_pred hhhHHHHHHhhccCceeEEEEccceeeeccccCceEEechHHHHHHHHHHhhccCCcEECcCCcee------eeeecccc Confidence 1111111111111111 22222 1111 1236788888888876 4444333221 11111112 Q ss_pred cccccchhHHHHhhcCc-cEEEEEcC-CEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHH Q lcl|Aclame:pro 280 EQGTFEGGDEAEGEGPV-NVLIDVSD-ANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDG 357 (426) Q Consensus 280 v~~~~~~~~~~~~~~~~-N~~~~~~g-~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~G 357 (426) +.-.+...|...|+.+. |.++...| +..+|-+.|..+.+..-.||=++|..+||+..|+..++..+-. |.++.= T Consensus 598 ~~~~~~~~e~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~e----pn~~~l 673 (749) T protein:vir:10 598 LAYTPNKAQRDQLYANRVNPIVSFPGQGVVLYGDKTALGFASAFDRINIRRLFLTVERVISTAAKAQLFE----QNDEAQ 673 (749) T ss_pred ceeecChhHHHhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcC----CCCHHH Confidence 22234566666777555 88888754 5777877777666655679999999999999999988876643 678888 Q ss_pred HHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 358 QAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 358 i~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) ...|+..|+.-|.+-++++. +..|.+.......+++|+.+.++. +++.+...-.++++.++..-+- T Consensus 674 ~~~i~~~i~~fL~~l~~~G~--i~~f~V~~d~~~Nt~~~i~~G~~~-~~i~~~P~~pae~I~~~~~~~~ 739 (749) T protein:vir:10 674 RSLFINIVEPYLRDVQGRRG--VVDFLVKCDSTNNTPEAVDRGEFY-AEVFLKPTRTINYVQLTFVATR 739 (749) T ss_pred HHHHHHHHHHHHHHHHhcCC--eeeeEEEEcCCCCCHHHhhCCEEE-EEEEEEecCCccEEEEEEEEee Confidence 88999999999998887664 567888888666778888887775 8899999999999998877554 No 35 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=96.17 E-value=0.00092 Score=37.24 Aligned_cols=397 Identities=14% Similarity=0.110 Sum_probs=174.2 Q ss_pred CCCceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCc--eee Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE--QWR 78 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~--~~~ 78 (426) .|--.++++=++.+.+... .-.||||-... .-........|-+|.++...=||..|-.+.++++|....+- .+- T Consensus 14 vP~~y~E~dns~A~~~~~~---q~vLiiGq~la-~gs~~~~~~v~v~s~~~a~~~fG~GSml~~M~~a~~~~n~~~~l~~ 89 (498) T protein:vir:44 14 VPLFYAEMDNSAANTARDS---GASLLIGHASN-DASIAVNSLVLVSSVDYARQICGAGSQLARMVGAYRKTDPFGELYV 89 (498) T ss_pred cCeEEEEEeCCCCCCCcCC---cceEEEEecCc-ccccccceeEeecCHHHHHHhcCcccHHHHHHHHHHHhCCCceeEE Confidence 3334455544554444322 35899995422 11122345667788899999999999999999999976422 222 Q ss_pred eeecccccc------ccc-----cccccceeccceeecccccccchhhhhhhcccccccccceeeeeecc---------c Q lcl|Aclame:pro 79 VMVLEATEV------TEE-----ELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINS---------A 138 (426) Q Consensus 79 ~~v~~~t~v------~~~-----~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~---------~ 138 (426) ..+.+++.+ +.. ..+-...|.+..+...-...+++..+...+.+.....+.-.+++... - T Consensus 90 i~~~D~aG~aAtg~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA~~~~~~vtlTAr~ 169 (498) T protein:vir:44 90 IAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPFTATSEAGVVTLTARH 169 (498) T ss_pred EecCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEEeeccceEEEEEec Confidence 222222211 111 11222345566666555556777777777666555543222222211 1 Q ss_pred cceeeechhheeeeccc-ccchhhhhhhccccce--------------eecccccc-----hhhhHhHhhhh-------- Q lcl|Aclame:pro 139 TGDVATSEDSIELTYFH-ADWSQLDEFPSDVNNF--------------AVADRRFD-----LKGVGVLDETH-------- 190 (426) Q Consensus 139 ~g~~t~~~~~~~~~~~~-~d~~~~~~~~s~~~~~--------------~la~~~~~-----~~~~~~~~~~~-------- 190 (426) .|...+.......-+.. ++......+..++.+. .+.+.+.. |.+...+..+. T Consensus 170 kG~~GN~I~l~~~~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~i~~p~~D~asl~al~~~L~~~sg 249 (498) T protein:vir:44 170 KGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDTASVNSMATEMNDSSG 249 (498) T ss_pred cCcccCcceEEEeeccCccccccccceeEEEEcccCCccCchhHHHHHhhccCCccEEEEeecCHHHHHHHHHHHhhhhc Confidence 12111111111111100 0000000000000000 01111110 01111111111 Q ss_pred hh--hhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCCC-cc--chhHHHHHHHh---hhccccccee Q lcl|Aclame:pro 191 SW--ASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDAS-DD--DLAAYQLGKFA---VSEPWYNPLW 262 (426) Q Consensus 191 ~w--a~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~-~~--~~~aa~~g~~~---~~~p~~~~~~ 262 (426) -| .+.-+-+.+..... ........-...+.++..|+++.... ++ ...+++.++++ .+||-+++.- T Consensus 250 Rw~~~~q~~g~~~~a~~g-------T~a~l~t~g~~~N~~~it~~~~~~~~~sp~~~~AAa~a~~aA~~l~~DPArPL~t 322 (498) T protein:vir:44 250 RWSYVRQLYGHVYTAKTG-------TLSELVAAGDQFNLQHITLAGYEKDTQTPADELAASRTARAAVFIRNDPARPTQT 322 (498) T ss_pred chHHHhhcCeEEEEeccC-------CHHHHHHhhhccCCceEEEEecCCCCCCHHHHHHHHHHHHHHHHhhcccccccCc Confidence 12 22222222222221 12223333444555677778775432 22 33445555554 6888776522 Q ss_pred ecccccceeeccccccccccccchhHHHHh--hcCccEEEEEcCCEEEeeceee----cCcccCcceeeh--hhhHHHHH Q lcl|Aclame:pro 263 NELPAGETVSKNVGDPEEQGTFEGGDEAEG--EGPVNVLIDVSDANRVSNAVTT----AGADSDTSFFDI--RRTKVYTA 334 (426) Q Consensus 263 ~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~--~~~~N~~~~~~g~~~~~~~~t~----~G~~~sg~~iD~--i~g~dwl~ 334 (426) ..+++.. -|...+.++-.|...+ ++-.-.++. +|...|-+.+|+ .-|..|-.|.|+ +|-.+|+. T Consensus 323 l~L~Gi~-------~p~~~~r~~~~ern~LL~~Gist~~V~-~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr 394 (498) T protein:vir:44 323 GELVDML-------PAPKGKRFTTTEQQTLLSHGVATAYVE-SGVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVL 394 (498) T ss_pred eeecccc-------cCCchhcCChHHHHHHHhcCcceEEEc-CCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHH Confidence 2222111 1122233344443333 333455554 788888887776 455677888884 89999999 Q ss_pred HHHHHHHHHHHhcCCCCcccH----HHH-----HHHHHHHHHHHHHhhcCCCcccccee-----Eec-CcccCcHHHHHh Q lcl|Aclame:pro 335 EMLELDLESLQVSDDDVPFTE----DGQ-----AMIEDAIKGTMSGLTGSVGQPLAEYE-----VDV-PEWDDDDVDRVN 399 (426) Q Consensus 335 ~~iq~~l~~ll~~~~KIp~td----~Gi-----~~i~~~v~~~l~~~v~~~g~~~~~y~-----~~~-p~~~~~~~dra~ 399 (426) ..+++.+...+ -..|+-=++ .|. .+|++.+...+++....+ .+..+. +.+ ....++ +|-+ T Consensus 395 ~~~r~~i~~kf-pR~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~g--ivEn~~~~~~~LiVerd~~dp--nRln 469 (498) T protein:vir:44 395 RRLKSVITSKY-GRHKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREG--IVENFDLFQQHLIVERNANDS--NRLD 469 (498) T ss_pred HHHHHHhhhhc-CCcccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhc--cccChhhhcceeEEEECCCCC--cEEE Confidence 99999996554 333433221 122 357777777766543221 111111 111 111111 1111 Q ss_pred hcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 400 RNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 400 R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +.+-..+.+..|-+-.+..+-+ T Consensus 470 -----~~~p~d~vn~L~V~A~~~~f~l 491 (498) T protein:vir:44 470 -----VLFPPDYVNQLRVFAVLNQFRL 491 (498) T ss_pred -----EEecccccCchhhhhhhhhhhh Confidence 1122222222222222211111 No 36 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=96.14 E-value=0.00095 Score=37.14 Aligned_cols=358 Identities=9% Similarity=0.017 Sum_probs=142.2 Q ss_pred CCCceEEEEEeecc---------------ccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHH Q lcl|Aclame:pro 1 MPKQIVEIELTAEI---------------ADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTA 65 (426) Q Consensus 1 mp~~iVnV~isl~t---------------~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkA 65 (426) .|+..-+|.|-+.- .+-...+|- +++.+. ..+...- -+..|. ...+. .|.+.. T Consensus 350 ~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~~f~---v~s~~~-~g~~i~~-~~as~~-----~s~ln--~~~~V~ 417 (742) T protein:vir:58 350 IVNELTNVSIPVTDSAIIPPMRFTRIEQITLSGGASFS---VISNQP-YGFNIQD-SRHSYW-----LSPFK--DDELII 417 (742) T ss_pred eccccccceeeccccccCCcccccccceeecccCcceE---EEEecc-cCcceec-cCcceE-----EeccC--CceEEE Confidence 33322222222211 001111121 111111 0000000 000011 00111 111100 Q ss_pred HH---HHHhcCCceeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeecccccee Q lcl|Aclame:pro 66 SE---AIEEMGAEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDV 142 (426) Q Consensus 66 A~---~~f~Q~~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~ 142 (426) .. +-+.+. .....+.. ................+.+... .......+.. +.. T Consensus 418 Gt~aa~~~~d~--------~t~~~v~s--~~~alp~~a~sv~laGG~dg~v----~v~~~~~D~i------------G~~ 471 (742) T protein:vir:58 418 GTELVLPALDV--------STEFGVSS--WEEALPEFSFLMPFQGGSDGYI----RVDENEPDTI------------GRV 471 (742) T ss_pred eehhhcccccc--------chheeccc--cccccceeeEEEeecCCccccc----cccCCCcccc------------ccc Confidence 00 000000 00000000 0000000000000000000000 0000000000 000 Q ss_pred eechhheeeecccccchhhhhhh--ccccceeecccccchhhhHhHhhhhhhhhhcceEEEEEecccccccchhhHHHHH Q lcl|Aclame:pro 143 ATSEDSIELTYFHADWSQLDEFP--SDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVA 220 (426) Q Consensus 143 t~~~~~~~~~~~~~d~~~~~~~~--s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a 220 (426) ..... ...+..++..+. ..++...++... .-.....+....+- .+..+++..... ..+... ....+ T Consensus 472 -~~~d~-----~~adrTGL~ALlev~eVtILiAPG~t-~~~v~aav~A~la~--a~~Rl~vL~D~P--~~~tt~-~~A~a 539 (742) T protein:vir:58 472 -KITPA-----LLANYERLLPLLTEDQFDLVLTPYLT-FADHAGTVNAFINR--AENRFLYLFDIA--GDDDTE-NLAIS 539 (742) T ss_pred -ccccc-----cccchhHHHHhhhcCCCcEEEEcCCC-chHHHHHHHHHHHh--hcCCeEEEEecC--CCCchH-HHHHH Confidence 00000 000111111111 011111122110 00111111121111 122333322111 111111 11111 Q ss_pred HHhhccCcceEEEEecC---C-----CccchhHHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHh Q lcl|Aclame:pro 221 HEVAGYVPSGDLMMIVD---A-----SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEG 292 (426) Q Consensus 221 ~~~a~~~~rt~~~~~~~---~-----~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~ 292 (426) ....-...+..+ ++.- . ...-|.++++|.++..+.-+.. |+.. .++...+. .....+|...| T Consensus 540 ~r~~~nSsraal-y~PwVkv~d~~~~r~vPpSgaIAGL~ARtD~erGv-w~SP-------ANrgii~~-~~~s~se~d~L 609 (742) T protein:vir:58 540 LAGYINSSFATT-FFPWVRRLTNKGMRTVPASLAAYRSIRTTDPETGL-APVG-------ARRGVVTG-EPVRQVDWEDL 609 (742) T ss_pred HHhccCCceEEE-EeceeeeccCCcceeechHHHHHHHHHHhccCCce-EecC-------Ccceeeec-cccchhhHHHH Confidence 111111122222 2211 0 1123567888888887753221 2211 12222111 12345666777 Q ss_pred hcCc-cEEEEEcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 293 EGPV-NVLIDVSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSG 371 (426) Q Consensus 293 ~~~~-N~~~~~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~ 371 (426) +.+. |.++..+++..+|-+.|..+.+..-.||-++|..|||+..|+..++..+-. |.+..-...|+..|++-|+. T Consensus 610 N~~GINtIrsfG~G~rlWGnRTlassDs~wryInVRRlfd~Ie~SI~~a~q~~VfE----PNd~~L~~sIk~sInafL~~ 685 (742) T protein:vir:58 610 YNNRINPIVRVGNDVLLFGQKTMLNVNSALNRINVRRLLIVMRNRISQILSSYLFE----NNTSENRLRAEALVRQYLES 685 (742) T ss_pred hhCCceEEEECCCcEEEEcceecCCCCcccceEeehhhHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHH Confidence 6554 898888778888877777676655679999999999999999988776532 77888889999999999998 Q ss_pred hhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 372 LTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 372 ~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) -+++|. + -+|.+... ....+.|+.+-++. +++.+...-.++++.++..++- T Consensus 686 L~aqGA-L-lGfrV~lD-etNTpeDI~~Gklv-v~I~vAP~~PAEfI~lrf~it~ 736 (742) T protein:vir:58 686 LRLRGA-V-TDYEVAID-SVTTPTDIDNNTLR-ARVTVQPARSIEYIDITFVITP 736 (742) T ss_pred HHhCCc-e-eeeEEEEc-CCCCHHHhhCCEEE-EEEEEEccCCcceEEEEEEEEe Confidence 887553 3 35887664 34555677664444 7777788889999998887765 No 37 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=96.04 E-value=0.0011 Score=36.83 Aligned_cols=397 Identities=15% Similarity=0.068 Sum_probs=175.4 Q ss_pred CCCceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCc--eee Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE--QWR 78 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~--~~~ 78 (426) .|--.++++=++.+.+... .-.||||-... .-........|-+|-++...=||..|-.+.++++|....+- .+- T Consensus 14 vP~~y~E~dns~A~~~~~~---q~vLiiGq~la-~gs~~~~~~v~v~s~~~a~~lfG~GSml~~M~~a~~~~n~~~~l~~ 89 (498) T protein:vir:45 14 VPLFYAEMDNQAANTAQDS---GASLLIGHANN-GAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQTDPFGELYV 89 (498) T ss_pred cCeEEEEEeCCCCCCCCCC---cceEEEEecCC-ccccccceeEEecCHHHHHHhcCcCcHHHHHHHHHHHhCCcceEEE Confidence 3334555555665554433 35899985422 11122345667788899999999999999999999876422 111 Q ss_pred eeeccccc------ccccc-----ccccceeccceeecccccccchhhhhhhcccccccccceeeeeec---------cc Q lcl|Aclame:pro 79 VMVLEATE------VTEEE-----LSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVIN---------SA 138 (426) Q Consensus 79 ~~v~~~t~------v~~~~-----~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~---------~~ 138 (426) ..+.+++. ++..+ .+-...|.+..+...-...+++..+...+.+.....+.-.+++.. .- T Consensus 90 i~~~d~aG~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA~~~~~~VtlTAr~ 169 (498) T protein:vir:45 90 IAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSSAGVVTLTARH 169 (498) T ss_pred EeeCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEEEecCceEEEEeec Confidence 12222221 11111 122234566666655555677777777666655543222222211 11 Q ss_pred cceeeechhheeeeccc-ccchhhhhhhccc----------cce----eecccccc-----hhhhHhHhhhh-------- Q lcl|Aclame:pro 139 TGDVATSEDSIELTYFH-ADWSQLDEFPSDV----------NNF----AVADRRFD-----LKGVGVLDETH-------- 190 (426) Q Consensus 139 ~g~~t~~~~~~~~~~~~-~d~~~~~~~~s~~----------~~~----~la~~~~~-----~~~~~~~~~~~-------- 190 (426) .|...+.......-+.. .+......+.-++ |.. .+.+.+.. |.+...+..+. T Consensus 170 kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamagGag~PD~a~alaal~~~~~~~I~~p~~D~asL~al~~~L~~~sg 249 (498) T protein:vir:45 170 KGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSG 249 (498) T ss_pred cCccccceeEEEeeccccccccccceeeEEEEccCCCccCchhHHHHHHhccCCccEEEEeeCCHHHHHHHHHHHhhhhh Confidence 12222111111111100 0000000000000 000 01111110 01111111111 Q ss_pred h--hhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCC-Ccc--chhHHHHHHHh---hhccccccee Q lcl|Aclame:pro 191 S--WASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDA-SDD--DLAAYQLGKFA---VSEPWYNPLW 262 (426) Q Consensus 191 ~--wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~-~~~--~~~aa~~g~~~---~~~p~~~~~~ 262 (426) - |.+.-+-+.+..... .-......-...+.++..|+++... .++ ...+++.++++ .+||-+++.- T Consensus 250 Rw~~~~q~~g~~~~a~~g-------T~~~l~t~g~~~N~~~it~~~~~~~~~sp~~~~AAa~aa~~A~~l~~DPArPL~t 322 (498) T protein:vir:45 250 RWSYARQLYGHVYTAKTG-------TLSELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARPTQT 322 (498) T ss_pred hhhHHhhcCeEEEEeccC-------CHHHHHHhhhccCCceEEEEecCCCCCChHHHHHHHHHHHHHHHhhcccccccCc Confidence 1 233333333333222 1222333344455667777776443 222 34455555555 6888776532 Q ss_pred ecccccceeeccccccccccccchhHHHHh--hcCccEEEEEcCCEEEeeceee----cCcccCcceeeh--hhhHHHHH Q lcl|Aclame:pro 263 NELPAGETVSKNVGDPEEQGTFEGGDEAEG--EGPVNVLIDVSDANRVSNAVTT----AGADSDTSFFDI--RRTKVYTA 334 (426) Q Consensus 263 ~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~--~~~~N~~~~~~g~~~~~~~~t~----~G~~~sg~~iD~--i~g~dwl~ 334 (426) ..+++.. -|...+.+.-.|...+ ++-.-.++. +|...|-+.+|+ +-|..|-.|.|+ +|-.+|+. T Consensus 323 l~L~Gi~-------~p~~~~r~~~~ern~LL~~Gist~~V~-~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr 394 (498) T protein:vir:45 323 GELVGML-------PAPKGKRFTMTEQQTLLSHGVATAYVE-SGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVL 394 (498) T ss_pred eeeccee-------cCCchhcCChHHHHHHHhCCcceEEEc-CCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHH Confidence 2222211 1122233444443333 333455554 788888887776 455677888884 89999999 Q ss_pred HHHHHHHHHHHhcCCCCcccHHH---------HHHHHHHHHHHHHHhhcCCCccccce------eEecCcccCcHHHHHh Q lcl|Aclame:pro 335 EMLELDLESLQVSDDDVPFTEDG---------QAMIEDAIKGTMSGLTGSVGQPLAEY------EVDVPEWDDDDVDRVN 399 (426) Q Consensus 335 ~~iq~~l~~ll~~~~KIp~td~G---------i~~i~~~v~~~l~~~v~~~g~~~~~y------~~~~p~~~~~~~dra~ 399 (426) ..+++.+...+- ..|+.=++.. -.+|++.+...+++....+ .+..+ -+.-...+++ +|-+ T Consensus 395 ~~~r~~i~~kfp-R~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~g--ivEn~~~~~~~LiVerd~~dp--nRln 469 (498) T protein:vir:45 395 RKLKSVITSKYG-RHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAG--IVENYELFKQYLVVERDASVP--NRLN 469 (498) T ss_pred HHHHHHhhhhcC-CeeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhc--cccChhhhcceeEEEECCCCC--cEEE Confidence 999999887662 3343322221 1356777777666543321 11111 1111111111 1111 Q ss_pred hcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 400 RNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 400 R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +.+-..+.+..|-+-.+..+-+ T Consensus 470 -----~~~p~d~vn~L~V~A~~~~f~l 491 (498) T protein:vir:45 470 -----TLFPPDYVNQLRVFAVVNQFRL 491 (498) T ss_pred -----EEecccccCchhhhhhhhhhhe Confidence 1111222222222211111111 No 38 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=95.83 E-value=0.0014 Score=36.24 Aligned_cols=397 Identities=13% Similarity=0.070 Sum_probs=177.1 Q ss_pred CCCceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCc--eee Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE--QWR 78 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~--~~~ 78 (426) .|--.++++-++...+.+. .-.||||-.... -........|-+|-++...=||..|-.+.+++++....+- .+- T Consensus 14 vP~~y~E~dns~A~~~~~~---qrvLiiGq~la~-gt~~~~~~v~v~s~~~a~~~fG~GS~l~~M~~a~~~~n~~~~l~~ 89 (498) T protein:vir:48 14 VPLFYAEMDNSAANTAVTS---APALLIGHASND-AAIEVNSLVLMPSADYARQICGAGSQLARMVDVYRQTDPFGELYV 89 (498) T ss_pred cceEEEEEecCCCccccCC---cceEEEeecCcc-ccccccceEEecCHHHHHHhcCcccHHHHHHHHHHHhCCCceeEE Confidence 4445567766776666554 248999954321 1112345667778899999999999999999999876422 222 Q ss_pred eeecccccc------ccc-----cccccceeccceeecccccccchhhhhhhcccccccccceeeeeec---------cc Q lcl|Aclame:pro 79 VMVLEATEV------TEE-----ELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVIN---------SA 138 (426) Q Consensus 79 ~~v~~~t~v------~~~-----~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~---------~~ 138 (426) ..+.+++.+ +.. ..+-...|.+..+...-...+++..+...+.+.....+.-.+++.. .- T Consensus 90 i~~~D~ag~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aai~a~~~lPVTA~~~~~~VtlTAr~ 169 (498) T protein:vir:48 90 IAVPEARGAAATVRVTVTGEAEESGTLSLYVGRSSVQVPVVNGDDATAVATAIKEAVNGVITLPFAASSDAGVVTLTARH 169 (498) T ss_pred EeeCCcccceeEEEEEecccccCCceEEEEECCEEEEEeecCCCCHHHHHHHHHHHHhCCCCcceEEEecCcEEEEEeee Confidence 222222211 111 1122244566666655555677777777766655543322222211 11 Q ss_pred cceeeechhheeeeccc--c--cchhhhh-------hhccccce-e---ecccccc-----hhhhHhHhhhh-------- Q lcl|Aclame:pro 139 TGDVATSEDSIELTYFH--A--DWSQLDE-------FPSDVNNF-A---VADRRFD-----LKGVGVLDETH-------- 190 (426) Q Consensus 139 ~g~~t~~~~~~~~~~~~--~--d~~~~~~-------~~s~~~~~-~---la~~~~~-----~~~~~~~~~~~-------- 190 (426) .|...+.......-+.. + ...++.. ...+-|.. + +.+.+.. |.+...+..+. T Consensus 170 kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamsgGag~PDia~aLaal~~~~~~~I~~p~~D~asl~al~~~L~~~sg 249 (498) T protein:vir:48 170 KGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPDLTAAVAAMGDEAFDFIGLPFNDAASINMMMTEMNDSSG 249 (498) T ss_pred cccccccceeeeeeccCcccccccceeeEEEEcccCCccCcchHHHHHhhccCCccEEEEeecCHHHHHHHHHHHhhhhh Confidence 12222111111111100 0 0001100 00000000 0 1111111 01111111111 Q ss_pred h--hhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCc-c--chhHHHHHHHh---hhccccccee Q lcl|Aclame:pro 191 S--WASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASD-D--DLAAYQLGKFA---VSEPWYNPLW 262 (426) Q Consensus 191 ~--wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~-~--~~~aa~~g~~~---~~~p~~~~~~ 262 (426) - |.+.-+-+.+..... .-......-...+.++..|+++....+ + ...+++.++++ .+||-+++.- T Consensus 250 Rw~~~~q~~g~~~~a~~g-------T~~~l~t~g~~~N~~~it~~~~~~~~~~p~~~~AAa~a~~aA~~l~~DPArPLqt 322 (498) T protein:vir:48 250 RWSYARQLYGHVYTAKLG-------TLSELVNAGDMHNQQHITLAGYEKETQSPVDELVASRLAREAVFIRNDPARPTQT 322 (498) T ss_pred hhhHHhhcCeEEEEeccC-------CHHHHHHhhhccCCceEEEEecCCCCCChHHHHHHHHHHHHHHhhhccccccccc Confidence 1 233333333333322 122233334445566777777665543 2 12344445544 6788766422 Q ss_pred ecccccceeeccccccccccccchhHHHH-h-hcCccEEEEEcCCEEEeeceee----cCcccCcceeeh--hhhHHHHH Q lcl|Aclame:pro 263 NELPAGETVSKNVGDPEEQGTFEGGDEAE-G-EGPVNVLIDVSDANRVSNAVTT----AGADSDTSFFDI--RRTKVYTA 334 (426) Q Consensus 263 ~~~~~~~~~~~~k~~~gv~~~~~~~~~~~-~-~~~~N~~~~~~g~~~~~~~~t~----~G~~~sg~~iD~--i~g~dwl~ 334 (426) ..+++.. -|...+.+.-.|... | ++-.-.++ -+|...|-+.+|+ +-|..|-.|.|+ +|-.+|+. T Consensus 323 l~L~Gi~-------~p~~~~r~~~~ern~LL~~Gist~~V-~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr 394 (498) T protein:vir:48 323 GELVGML-------PAPKGKRFIMTEQQTLLSHGVATAYV-EGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVL 394 (498) T ss_pred eeeeccc-------cCCchhcCChHHHHHHHhcCcceEEE-cCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHH Confidence 2221111 112223333344333 3 33345555 5788888887776 455677788884 89999999 Q ss_pred HHHHHHHHHHHhcCCCCcccHHH---------HHHHHHHHHHHHHHhhcCCCcccccee------EecCcccCcHHHHHh Q lcl|Aclame:pro 335 EMLELDLESLQVSDDDVPFTEDG---------QAMIEDAIKGTMSGLTGSVGQPLAEYE------VDVPEWDDDDVDRVN 399 (426) Q Consensus 335 ~~iq~~l~~ll~~~~KIp~td~G---------i~~i~~~v~~~l~~~v~~~g~~~~~y~------~~~p~~~~~~~dra~ 399 (426) ..+++.+...+- ..|+.=++.+ -.+|++.+...+++....+ .+..+. +.-....++ +|-+ T Consensus 395 ~~~r~~i~~kfp-R~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~g--iven~~~~~~~LiVerd~~dp--nRln 469 (498) T protein:vir:48 395 RKLKSVITSKYG-RHKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAG--IVENYDLFKQYLIVERDADNP--NRLN 469 (498) T ss_pred HHHHHHhhhhcC-CceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhc--cccChhhhcceeEEEECCCCC--cEEE Confidence 999999887663 3343322221 1357777777766543321 111111 110111111 1111 Q ss_pred hcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 400 RNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 400 R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +.+-..+.+..|-+-.+..+-+ T Consensus 470 -----~~~p~d~vn~L~V~A~~~~f~l 491 (498) T protein:vir:48 470 -----TLFPPDYVNQLRVFAVVNQFRL 491 (498) T ss_pred -----EEecccccCchhhhhhhhhhhh Confidence 1111222222222211111111 No 39 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=95.61 E-value=0.0018 Score=35.69 Aligned_cols=412 Identities=9% Similarity=-0.006 Sum_probs=149.7 Q ss_pred CCCceEEEEEeec----cccccccCc---cceEEEecccccccccchhhh---heeec--H-HHHHhccCCCCHHHHHHH Q lcl|Aclame:pro 1 MPKQIVEIELTAE----IADRPQETF---TDAAIVGTAEEEPPDAEFGEV---NQYST--S-TSVGDDYGEDSDVYTASE 67 (426) Q Consensus 1 mp~~iVnV~isl~----t~a~~~~~F---g~~Lilg~~~~~~~~~~~~~~---~~Yts--~-~~V~~Dfg~~sp~YkAA~ 67 (426) +..... ..+... ..+..-.++ +...+................ ..+.- + .....++|..-+...... T Consensus 163 ~~~a~~-~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~v~~~~~ 241 (663) T protein:vir:10 163 LGDNWR-AEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGSTVEVEVISK 241 (663) T ss_pred ccccee-eEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCcceeEeeccc Confidence 221111 111111 111000000 000000000000000000000 00000 0 000011222222221111 Q ss_pred HHHhcCCc-eeeee----ecccccccccc-c----------cccceeccceeeccccccc---chhhhhhhcccccccc- Q lcl|Aclame:pro 68 AIEEMGAE-QWRVM----VLEATEVTEEE-L----------SDGDTIDKVPILGNHEVES---PDGDIEFTTDDDPDVE- 127 (426) Q Consensus 68 ~~f~Q~~~-~~~~~----v~~~t~v~~~~-~----------~~~~tv~~~~~s~~~~~~~---ta~~i~~~~~~~~~~t- 127 (426) ..+.++.. ..... ......+...+ . ..+........+....... ....+.+.+....... T Consensus 242 ~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~~~~~~s~~v 321 (663) T protein:vir:10 242 TAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDYFRNGSSNFI 321 (663) T ss_pred ccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhhhhhhhcCccccee Confidence 11211100 00000 00000000000 0 0000000000010000000 0000111110000000 Q ss_pred cceeeeeeccccceeeechhheeee-cccccch-hhhhhh--ccccce-eecc--cccch-hhhHhHhhhhhhhhhc-ce Q lcl|Aclame:pro 128 DFDAEIVINSATGDVATSEDSIELT-YFHADWS-QLDEFP--SDVNNF-AVAD--RRFDL-KGVGVLDETHSWASDE-DM 198 (426) Q Consensus 128 ~~~~~~~~~~~~g~~t~~~~~~~~~-~~~~d~~-~~~~~~--s~~~~~-~la~--~~~~~-~~~~~~~~~~~wa~~~-~k 198 (426) .+..........+.++.+.+..... ....|.. +...+. ..++.. .++. ...+. ........+...++.. +. T Consensus 322 ~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~~~~~~~~ 401 (663) T protein:vir:10 322 YASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVALADDRQDC 401 (663) T ss_pred EeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHHHHhhCCE Confidence 0000000000011111000000000 0000110 001110 001111 1110 01111 1111122222223332 23 Q ss_pred EEEEEecccccc--cchhhHHHHHH-------------HhhccCcceEEEEec-------CC---CccchhHHHHHHHhh Q lcl|Aclame:pro 199 GMIANGVNVDDY--DSVDEAMDVAH-------------EVAGYVPSGDLMMIV-------DA---SDDDLAAYQLGKFAV 253 (426) Q Consensus 199 l~~~~~~d~~~~--~~~~~~~~~a~-------------~~a~~~~rt~~~~~~-------~~---~~~~~~aa~~g~~~~ 253 (426) +.+......... ........... ...++..+...+++. .. ..--|.++++|.|+- T Consensus 402 ~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar 481 (663) T protein:vir:10 402 VAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPLSADIAGLCAY 481 (663) T ss_pred EEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEechHHHHHHHHHH Confidence 333321111000 00010000000 001111112222222 11 112467888888887 Q ss_pred hc----ccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEEcC--CEEEeeceeecCcccCcceeeh Q lcl|Aclame:pro 254 SE----PWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSD--ANRVSNAVTTAGADSDTSFFDI 326 (426) Q Consensus 254 ~~----p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g--~~~~~~~~t~~G~~~sg~~iD~ 326 (426) .+ ||.++..+.. ...+...+....+...|...|+.+. |.++.+-| +..+|-..|+++....-.||=+ T Consensus 482 ~D~~~g~~~span~~~------~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~~i~v 555 (663) T protein:vir:10 482 TDQVGHPWMSPAGYRR------GQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQVPSPFDRINV 555 (663) T ss_pred hhccCCcEEccCCeee------cceeccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCCCCcccceEeh Confidence 66 4433322221 1111112222235556666666444 88877643 5677777777664434469999 Q ss_pred hhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeE Q lcl|Aclame:pro 327 RRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGID 406 (426) Q Consensus 327 i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~ 406 (426) +|..+||+..|+..++..+-. |.+..-...|+..|+.-|.+-++++. +-+|.+.......+++|+.+-++. +. T Consensus 556 rR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~ga--l~gf~V~~d~~~nt~~~i~~G~~~-~~ 628 (663) T protein:vir:10 556 RRLFNMLKKNIGDTSKYELFE----NNDAFTRQSFRMEVSQYLDNIRSLGG--VYDFRVVCDTTNNTPQVIDSNEFV-AT 628 (663) T ss_pred hhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEcCCCCCHHHhhCCeEE-EE Confidence 999999999999998875532 77888889999999999999887653 445888888777788888887775 89 Q ss_pred EEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 407 LDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 407 ~~~~laGAIh~v~I~g~v~v 426 (426) +.++..-.++++.++...+- T Consensus 629 i~~~p~~pae~I~~~~~~~~ 648 (663) T protein:vir:10 629 IYIKAPRSINYITLNFVATS 648 (663) T ss_pred EEEEecCCcceEEEEEEEEe Confidence 99999999999999877765 No 40 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=95.27 E-value=0.0024 Score=34.95 Aligned_cols=398 Identities=12% Similarity=0.068 Sum_probs=188.8 Q ss_pred CCCceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCc--eee Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAE--QWR 78 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~--~~~ 78 (426) .|--.++++-++.....+. +=.-.||||-.... -...-....|-+|.++...=||..|-.+.+++++....+- .+- T Consensus 15 vP~~y~E~dns~A~~g~~~-~~q~vLiiGq~la~-gs~~~~~pv~v~s~~~a~~~fG~GS~la~M~~a~~~~n~~~~l~~ 92 (495) T protein:vir:19 15 VPLTYIEFDNSNAVSGTPA-PRQRVLMFGQSGSK-ASAAPNVPVRIRSGSQASAAFGQGSMLALMADAFLNANRVAELWC 92 (495) T ss_pred cCeEEEEEccCCCCcCCcC-CCceEEEEEecCcc-cccccceeEEecCHHHHHHhcCcCcHHHHHHHHHHHhCCcceEEE Confidence 3345566666665332222 22348999964221 1111345667778899999999999999999999876422 222 Q ss_pred eeecccccc------ccc-----cccccceeccceeecccccccchhhhhhhcccccccccceeeeeec-------cccc Q lcl|Aclame:pro 79 VMVLEATEV------TEE-----ELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVIN-------SATG 140 (426) Q Consensus 79 ~~v~~~t~v------~~~-----~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~-------~~~g 140 (426) ..+.+++.+ +.. ..+-...|.+..+...-...+|+..+...+.+.....+.-.+++.. +..+ T Consensus 93 i~~~D~aG~aA~g~it~tg~at~~G~l~l~I~g~~v~v~V~~gdTaa~vA~al~aaina~~~lPvTA~~~~~~~~~~a~~ 172 (495) T protein:vir:19 93 IPQGNGTGNAAVGEISLSGTAGENGSLVTYIAGQRLAVSVAAGATGAALADLLVARIKGQPDLPVTAEVRADSGDDDTHA 172 (495) T ss_pred EeeCChhhceeEEEEEEeecCCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHhcCCccCceEEEeeccCCCCcCce Confidence 222222211 111 1122234566666655556677877777666655543222222111 1112 Q ss_pred eee-echhheeeecccccch----hhhhhhcccc--cee----------------ecccccch-----hhhHh------- Q lcl|Aclame:pro 141 DVA-TSEDSIELTYFHADWS----QLDEFPSDVN--NFA----------------VADRRFDL-----KGVGV------- 185 (426) Q Consensus 141 ~~t-~~~~~~~~~~~~~d~~----~~~~~~s~~~--~~~----------------la~~~~~~-----~~~~~------- 185 (426) .++ ++..+.+ ....|.. .-+.....+. ... +.+.++.+ .+... T Consensus 173 ~VtlTAr~kG~--~n~idi~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~I~~P~tD~asL~al~~~ 250 (495) T protein:vir:19 173 DVVLSAKFTGA--LSAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMGDLQYKYIVMPYTDEPNLNLLRTE 250 (495) T ss_pred eEEEEEeeccc--cccceeEEEeecccccccceeEEEEecCCCCCCcchHHHHHHhccCCCcEEEEecCcHHHHHHHHHH Confidence 222 1111111 0011110 0000111110 001 11111110 01111 Q ss_pred HhhhhhhhhhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCCCccc--hhHHHHHHH---hhhcccccc Q lcl|Aclame:pro 186 LDETHSWASDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDD--LAAYQLGKF---AVSEPWYNP 260 (426) Q Consensus 186 ~~~~~~wa~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~~~--~~aa~~g~~---~~~~p~~~~ 260 (426) ++.-..|.+.-+-+.+..... ........-...+.++..|+++..+.++- ..+++.+++ ..+||-+++ T Consensus 251 l~~rw~~~~q~~g~~~~a~~g-------T~~~l~t~g~~~N~~~it~~~~~gsp~~~~~~AAA~aa~~A~~l~~DPArPL 323 (495) T protein:vir:19 251 LQERWGPVNQADGFAVTVLSG-------TYGDISTFGVSRNDHLISCMGIAGAPEPSYLYAATLCAVASQALSIDPARPL 323 (495) T ss_pred HHHhhhHHHhcCeEEEEeecC-------CHHHHHHhhhccCCceEEEEecCCCCCcHHHHHHHHHHHHHHHhhccccccc Confidence 112222334444444443332 11233334445566677777776554432 223444443 357887765 Q ss_pred eeecccccceeeccccccccccccchhHHHHh--hcCccEEEEEcCCEEEeeceee----cCcccCcceeeh--hhhHHH Q lcl|Aclame:pro 261 LWNELPAGETVSKNVGDPEEQGTFEGGDEAEG--EGPVNVLIDVSDANRVSNAVTT----AGADSDTSFFDI--RRTKVY 332 (426) Q Consensus 261 ~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~--~~~~N~~~~~~g~~~~~~~~t~----~G~~~sg~~iD~--i~g~dw 332 (426) .-..+++.. -|...+-++-.|...+ ++-.-..+.-+|...|-+.+|+ +-|..|-.|.|+ ++-.+| T Consensus 324 ~tl~L~Gi~-------~p~~~~r~~~~ern~LL~~Gist~~V~~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~y 396 (495) T protein:vir:19 324 QTLTLPGRM-------PPAVGDRFTWSERNALLFDGISTFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSY 396 (495) T ss_pred Cceeeccee-------cCCccccCChHHHHHHHhCCcceEEECCCCeEEEEeeeeeeeecCCCCcchhhhhhHHHHHHHH Confidence 322222111 1122233444444333 2333556667788888888776 455678889885 899999 Q ss_pred HHHHHHHHHHHHHhcCCCCcccHHH---------HHHHHHHHHHHHHHhhcCCCccccceeE-----ec-CcccCcHHHH Q lcl|Aclame:pro 333 TAEMLELDLESLQVSDDDVPFTEDG---------QAMIEDAIKGTMSGLTGSVGQPLAEYEV-----DV-PEWDDDDVDR 397 (426) Q Consensus 333 l~~~iq~~l~~ll~~~~KIp~td~G---------i~~i~~~v~~~l~~~v~~~g~~~~~y~~-----~~-p~~~~~~~dr 397 (426) +...+++.+...+-. .|+.=++.+ -.+|++.+...+++....+ .+..+.. .+ ...+++ +| T Consensus 397 vr~~~r~~i~~kfpR-~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~g--iven~~~~~~~LiVerd~~dp--nR 471 (495) T protein:vir:19 397 LRYSLRTRITQKFPN-YKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAG--LVEDFDTFKEELYVARNKDDK--DR 471 (495) T ss_pred HHHHHHHHHhhhcCC-cccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhc--cccChhhhcceeEEEECCCCC--cE Confidence 999999999876643 333322221 1357777777776654322 2222211 11 111111 22 Q ss_pred HhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 398 VNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 398 a~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) - ++.+-..+.+..|-+-.+..+-+ T Consensus 472 l-----n~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 472 L-----DVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred E-----EEEecceeeCceeeeeeeeeeeC Confidence 2 34455566666666665555555 No 41 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=95.27 E-value=0.0024 Score=34.95 Aligned_cols=413 Identities=11% Similarity=-0.011 Sum_probs=147.4 Q ss_pred CC-------CceEEE----------EEee----ccccccccCcc---ceEEEec---ccccccccchhhhh-eeecHHHH Q lcl|Aclame:pro 1 MP-------KQIVEI----------ELTA----EIADRPQETFT---DAAIVGT---AEEEPPDAEFGEVN-QYSTSTSV 52 (426) Q Consensus 1 mp-------~~iVnV----------~isl----~t~a~~~~~Fg---~~Lilg~---~~~~~~~~~~~~~~-~Yts~~~V 52 (426) .| ...+.+ .++- ...+.....|- ..++... +....... ..... .+...... T Consensus 145 ~~~a~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~-~~~~~~~~~~~~~~ 223 (663) T protein:vir:10 145 VPTAEIIAKTRQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPA-VMEKYAKFGMPLVS 223 (663) T ss_pred eccccccccccccceeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccc-hhhhcccccceeee Confidence 11 011111 0000 00000000000 0000000 00000000 00000 00000000 Q ss_pred HhccCCCCHHHH---HHHHHHhcCCceeeeeeccccc-------cc-cc--ccc---------ccceeccceeecccccc Q lcl|Aclame:pro 53 GDDYGEDSDVYT---ASEAIEEMGAEQWRVMVLEATE-------VT-EE--ELS---------DGDTIDKVPILGNHEVE 110 (426) Q Consensus 53 ~~Dfg~~sp~Yk---AA~~~f~Q~~~~~~~~v~~~t~-------v~-~~--~~~---------~~~tv~~~~~s~~~~~~ 110 (426) +...|....... .+...+.++. ...+..... .+ .. ... ++......+.+...+.. T Consensus 224 a~~~G~~Gn~i~v~i~~~~~~~~~~---~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~ 300 (663) T protein:vir:10 224 AVYPGEIGSTVEVEIVSKTAFNSGA---QQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDR 300 (663) T ss_pred eecccccccceeEEecccccccccc---cccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeeccccc Confidence 000000000000 0000000000 000000000 00 00 000 00000000111111000 Q ss_pred c---chhhhhhhccccccc-ccceeeeeeccccceeeechhhe-eeecccccch----hhhhhhc-cccceeecccc-cc Q lcl|Aclame:pro 111 S---PDGDIEFTTDDDPDV-EDFDAEIVINSATGDVATSEDSI-ELTYFHADWS----QLDEFPS-DVNNFAVADRR-FD 179 (426) Q Consensus 111 ~---ta~~i~~~~~~~~~~-t~~~~~~~~~~~~g~~t~~~~~~-~~~~~~~d~~----~~~~~~s-~~~~~~la~~~-~~ 179 (426) . +...+...+...... ..+.........++.+....+.. .......+.. .+..... .++...+.... .. T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~ 380 (663) T protein:vir:10 301 DVYGSNIFMDDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDG 380 (663) T ss_pred ccchhhhhhhhhhccCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCc Confidence 0 000000000000000 00000000000011010000000 0000111111 1110000 11111111000 00 Q ss_pred h-hhhHhHhhhhhhhhhc-ceEEEEEecccccc---cc--hhhHHHHHHHhh---------ccCc-ceEEEEec------ Q lcl|Aclame:pro 180 L-KGVGVLDETHSWASDE-DMGMIANGVNVDDY---DS--VDEAMDVAHEVA---------GYVP-SGDLMMIV------ 236 (426) Q Consensus 180 ~-~~~~~~~~~~~wa~~~-~kl~~~~~~d~~~~---~~--~~~~~~~a~~~a---------~~~~-rt~~~~~~------ 236 (426) . ........+...++.. +++.+......... .. ............ .++. +-..+|++ T Consensus 381 ~~~~~~v~~al~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d 460 (663) T protein:vir:10 381 AEIASTVQKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYD 460 (663) T ss_pred hhhHHHHHHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEec Confidence 0 0011112222223332 24443322211100 00 111111111000 0011 11122221 Q ss_pred -CC---CccchhHHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEEc--CCEEEe Q lcl|Aclame:pro 237 -DA---SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVS--DANRVS 309 (426) Q Consensus 237 -~~---~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~--g~~~~~ 309 (426) .. ...-|.++++|.|+-.+.-+.+ |+ -+.|..........++...+...|...|+.+. |.++.+- ++..+| T Consensus 461 ~~~~~~~~~p~s~~vAGl~Ar~D~~~g~-~~-sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~w 538 (663) T protein:vir:10 461 KYNDINRWVPLAADIAGLCAYTDQVSHP-WM-SPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLF 538 (663) T ss_pred ccCCceEEechhHHHHHHHHHhhccCCc-eE-ccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEEE Confidence 11 1224678888888877643321 11 11221111111222233346667777776544 8887764 467777 Q ss_pred eceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCc Q lcl|Aclame:pro 310 NAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPE 389 (426) Q Consensus 310 ~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~ 389 (426) -..|+++....-.||=++|..+||.+.|+..++..+-. |.+..-...|+..|+.-|.+-++++. +-+|.+.... T Consensus 539 G~rT~s~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~l~~~i~~~i~~~L~~l~~~ga--l~g~~v~~d~ 612 (663) T protein:vir:10 539 GDKMATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFE----NNDAFTRQSFRMETSQYLDGIRSLGG--CYDFRVVCDT 612 (663) T ss_pred cccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCc--eeeeEEEEcC Confidence 77777765444568999999999999999998876532 77888888999999999999887553 4459888887 Q ss_pred ccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 390 WDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 390 ~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) ...+++|+.+-++. +.+.+.....++++.++...+- T Consensus 613 ~~nt~~~i~~G~~~-~~i~~~p~~pae~i~~~~~~~~ 648 (663) T protein:vir:10 613 TNNTPNVIDRNEFV-GTIYVKPPRSINYITLNMVATS 648 (663) T ss_pred CCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEee Confidence 77788898887776 8889999999999998866554 No 42 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=94.84 E-value=0.0034 Score=34.14 Aligned_cols=412 Identities=9% Similarity=-0.024 Sum_probs=145.4 Q ss_pred CCC--ceE--E--EEEeecccccc--------ccCccceEEEecc-cccccccchhhhh---eeecHHHHHh-ccCCC-C Q lcl|Aclame:pro 1 MPK--QIV--E--IELTAEIADRP--------QETFTDAAIVGTA-EEEPPDAEFGEVN---QYSTSTSVGD-DYGED-S 60 (426) Q Consensus 1 mp~--~iV--n--V~isl~t~a~~--------~~~Fg~~Lilg~~-~~~~~~~~~~~~~---~Yts~~~V~~-Dfg~~-s 60 (426) .+. ..+ + ..+........ -.+.+........ ........+.... ......++.. ..|.. + T Consensus 156 ~~~~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a~~~g~~g~~l~ 235 (666) T protein:vir:80 156 AIGVYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSAIYAGEIGNSLE 235 (666) T ss_pred cccccceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhhhccccccccee Confidence 000 000 0 00100000000 0001110000000 0000000000000 0000000000 01100 0 Q ss_pred HHHHHHHHHHhcCCceeeeeeccccc-----cccc-ccc---ccce-------eccceeecccccccch-hhh-hh---- Q lcl|Aclame:pro 61 DVYTASEAIEEMGAEQWRVMVLEATE-----VTEE-ELS---DGDT-------IDKVPILGNHEVESPD-GDI-EF---- 118 (426) Q Consensus 61 p~YkAA~~~f~Q~~~~~~~~v~~~t~-----v~~~-~~~---~~~t-------v~~~~~s~~~~~~~ta-~~i-~~---- 118 (426) ........++..++.+ ......... +... ... .... +....++......... ... .. T Consensus 236 v~i~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (666) T protein:vir:80 236 VEILARSAFKNTAPDL-TMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFG 314 (666) T ss_pred eeeccccccccccccc-eeeeccccccccceeeeeccccccceeeEeccCCccceeeecccccccccccchhhhhhhhhc Confidence 0000000011111000 000000000 0000 000 0000 0000000000000000 000 00 Q ss_pred -hc----ccccccc--cceeeeeeccccceeeechhheeeeccccc---chhhhhhhc--cccceeecccccch-hhhHh Q lcl|Aclame:pro 119 -TT----DDDPDVE--DFDAEIVINSATGDVATSEDSIELTYFHAD---WSQLDEFPS--DVNNFAVADRRFDL-KGVGV 185 (426) Q Consensus 119 -~~----~~~~~~t--~~~~~~~~~~~~g~~t~~~~~~~~~~~~~d---~~~~~~~~s--~~~~~~la~~~~~~-~~~~~ 185 (426) +. ....... .......+.....................+ -.++..... .++....+...... ..... T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v 394 (666) T protein:vir:80 315 RGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERESIHVNLLIAGACAGEGDAFSTV 394 (666) T ss_pred cccceeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhcccccceEeecCcCCcccchHHH Confidence 00 0000000 000000000000000000000000000000 000000000 01111111110000 00111 Q ss_pred Hhhhhhhhhhc-ceEEEEEeccc---ccccchhhHHHHH-HHhhc-------cCc-ceEEEEec-------CCC---ccc Q lcl|Aclame:pro 186 LDETHSWASDE-DMGMIANGVNV---DDYDSVDEAMDVA-HEVAG-------YVP-SGDLMMIV-------DAS---DDD 242 (426) Q Consensus 186 ~~~~~~wa~~~-~kl~~~~~~d~---~~~~~~~~~~~~a-~~~a~-------~~~-rt~~~~~~-------~~~---~~~ 242 (426) ...+...++.. +++.+....-. +............ +...+ ++. +-..+++. ... .-- T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p 474 (666) T protein:vir:80 395 QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 474 (666) T ss_pred HHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCceeEec Confidence 11122222222 22222211000 0011111111111 11110 010 11112211 110 123 Q ss_pred hhHHHHHHHhhhc----ccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEEcC-CEEEeeceeecC Q lcl|Aclame:pro 243 LAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSD-ANRVSNAVTTAG 316 (426) Q Consensus 243 ~~aa~~g~~~~~~----p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g-~~~~~~~~t~~G 316 (426) |.++++|.|+..+ ||..+..++.. ..+...+..-.+...|...|+.+. |.++.+.| +..+|-+.|+++ T Consensus 475 ~sg~~AGl~Ar~D~~~g~~~sPan~~~~------~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~ 548 (666) T protein:vir:80 475 LAADIAGLCARTDAVSQPWMSPAGYNRG------QIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATT 548 (666) T ss_pred hHHHHHHHHHHHhhcCCceEccCCeecc------eeeccccceeecChhHHHhhhhCCeeEEEEeCCCeEEEEccccCCC Confidence 6788888888665 44333333211 111111111234556666777554 88988776 688888888776 Q ss_pred cccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHH Q lcl|Aclame:pro 317 ADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVD 396 (426) Q Consensus 317 ~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~d 396 (426) ....-.||=++|-.+||++.|+..++..+-. |.+..=...|+..|+.-|.+-++++. +.+|.+...+...+++| T Consensus 549 ~~s~~~~i~vRRl~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~ga--l~g~~V~~d~~~nt~~d 622 (666) T protein:vir:80 549 VPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGG--IYDFRVQCDTTNNTPDV 622 (666) T ss_pred CCcccceeehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCc--eeeeEEEEcCCCCCHHH Confidence 6555678999999999999999998876643 66777778889999999998887553 44599988877778889 Q ss_pred HHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 397 RVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 397 ra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +.+.++. +.+.+...-.++++.++..-+= T Consensus 623 i~~G~~~-~~i~~~P~~Pae~I~~~~~~~~ 651 (666) T protein:vir:80 623 IDRNEFV-ASMFIKPAKSINYIMLNFTAVA 651 (666) T ss_pred hhCCeEE-EEEEEEecCCcceEEEEEEEee Confidence 9887776 8999999999999999876443 No 43 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=94.62 E-value=0.0039 Score=33.78 Aligned_cols=373 Identities=11% Similarity=0.071 Sum_probs=156.8 Q ss_pred CCCceEEEE-EeeccccccccCccceEEEecccccccccchhhhheeecH---HHHHhccCCC--CHHHHHHHHHHhcCC Q lcl|Aclame:pro 1 MPKQIVEIE-LTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTS---TSVGDDYGED--SDVYTASEAIEEMGA 74 (426) Q Consensus 1 mp~~iVnV~-isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~---~~V~~Dfg~~--sp~YkAA~~~f~Q~~ 74 (426) -|---+|+. ......+...||. ..+....+--|+. ++..-++. .++..=||.+ .|..+..+.+| +++ T Consensus 15 ~PG~Y~n~~~~~~~~~~~~~rGi--~a~p~~~~wGp~~----~v~~i~~~~~~~~~~~~~G~~~~~~~~~~l~~~~-~~~ 87 (436) T protein:vir:78 15 LPGSYINFVSATRATSSLSDRGI--VAMPLELDWGIDE----EVFQVTSDDFEKYSTKYFGYDYTHEKLKGLRDLF-KNI 87 (436) T ss_pred cCceEEEEEecCcceeeccCCeE--EEEEEEecCCCCc----eeEEeecccchHHHHHHhcCccchHHHHHHHHHh-cCC Confidence 442333332 1112222334443 3333322322322 23333332 3555557754 44445666677 456 Q ss_pred ceeeeeeccc-c----ccccccccccceeccceeecccccccch--------------hhhhhhcccccccccceeeeee Q lcl|Aclame:pro 75 EQWRVMVLEA-T----EVTEEELSDGDTIDKVPILGNHEVESPD--------------GDIEFTTDDDPDVEDFDAEIVI 135 (426) Q Consensus 75 ~~~~~~v~~~-t----~v~~~~~~~~~tv~~~~~s~~~~~~~ta--------------~~i~~~~~~~~~~t~~~~~~~~ 135 (426) +....+++.. + .++.+ +..+..=|...++.-...+++. .+.+..... .. .-+.+.+ T Consensus 88 ~tv~~yrl~~G~~a~~~v~~A-ky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~~d~~~~~~~~~-l~--~n~~V~~- 162 (436) T protein:vir:78 88 RLGYFYKLNKGVKASCSIATA-RCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKKVDTQIAKVITE-LQ--DNDYVTW- 162 (436) T ss_pred CEEEEEECCCcceeeeeeeee-ecCCCCCcEEEEEecccccccCceEEEEEecchhhhhhhHHHHhh-cc--CCceEEE- Confidence 7777666541 1 11111 1111110111111111111100 011111100 00 0000000 Q ss_pred cccccee--------eechhheeeecccccch-hhhhhhc-cccceeecccccchhhhHhHhhhhhhhh----hcceEEE Q lcl|Aclame:pro 136 NSATGDV--------ATSEDSIELTYFHADWS-QLDEFPS-DVNNFAVADRRFDLKGVGVLDETHSWAS----DEDMGMI 201 (426) Q Consensus 136 ~~~~g~~--------t~~~~~~~~~~~~~d~~-~~~~~~s-~~~~~~la~~~~~~~~~~~~~~~~~wa~----~~~kl~~ 201 (426) ..++++ +.+.... .....||. .+.++.. ..+...+++... .....+.+|+. .+.+-+- T Consensus 163 -~~~g~la~~a~~~LtGG~dG~--~~T~~dy~~al~~le~~~fn~l~~~~~d~-----~~~~~~~a~ikr~re~~g~~~~ 234 (436) T protein:vir:78 163 -KKEATLEATAGLTFTNGTNGE--AVTGTEYQAFLDKIESYSFNALGCLATTA-----EIKSLFVEFTKRMRDKVGAKFQ 234 (436) T ss_pred -Eecccccccceeeeecccccc--ccchHHHHHHHHHHcccceeEEEecCCCh-----HHHHHHHHHHHHHHhhcCCeEE Confidence 011111 1111111 11222333 2222222 122233333211 12233445543 2222221 Q ss_pred EEecccccccchhhHHHHHHHhhccCcceEEEEecCCCccchhHHHHHHHhhhcccccceeecccccceeeccccccc-- Q lcl|Aclame:pro 202 ANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDASDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPE-- 279 (426) Q Consensus 202 ~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~g-- 279 (426) +........+. ..+.. ...+...++ + ......+++.|.++......++| ++..++ T Consensus 235 aV~~~~~~~d~--EgIIn--v~n~v~g~~---~----~~~~~~a~vAG~~Ag~~~~~S~T------------~~~~~~~~ 291 (436) T protein:vir:78 235 TVLYKKNDADY--EGVVS--VENKIKDTG---L----LESSLIYWTTGAIAGCDINKSNT------------NKRYDGEF 291 (436) T ss_pred EEecCCCCCCC--ceEEE--eecccCCce---e----chhHHHHHHHHHHhcCccccCcc------------ceecCccc Confidence 21111111110 00000 000000010 0 11124466666655544333333 333333 Q ss_pred -cccccchhHHHHhhcCc-cEEEEEcCCEEEeeceee-----cCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCc Q lcl|Aclame:pro 280 -EQGTFEGGDEAEGEGPV-NVLIDVSDANRVSNAVTT-----AGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVP 352 (426) Q Consensus 280 -v~~~~~~~~~~~~~~~~-N~~~~~~g~~~~~~~~t~-----~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp 352 (426) |...++.+|+..+-.+. -++..-++...+.++..+ +.+..+=..|=++|..|.+.++++....+.++ .|+| T Consensus 292 ~v~~~~t~~e~~~ai~~G~lvl~~d~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yi--GKv~ 369 (436) T protein:vir:78 292 DVDVNYTQIHLEEALKTGKFIFHKVGDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYL--GEVP 369 (436) T ss_pred cccccCCHHHHHHHHhCCeEEEEEeCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccc--cccC Confidence 33456777765544333 555555566667776543 22222333577888889988888876655444 4999 Q ss_pred ccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcH-HHHHhhcCCCeEEEEEEcccEEEEEEEEEEe Q lcl|Aclame:pro 353 FTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDD-VDRVNRNWGGIDLDARLAQRAHTFSLGLNVS 425 (426) Q Consensus 353 ~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~-~dra~R~~~~i~~~~~laGAIh~v~I~g~v~ 425 (426) =+..|-.++.+.|.+-|++-.+.+ .+..|+.. -.... .+-+...+ +++.++.--|+..+.+.++|. T Consensus 370 N~~dgr~~l~~~i~~yl~~L~~~g--~I~~f~~~---Dv~v~~~~~~~~v~--v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 370 NDKSGRISFWNDVVKHHEQLQNMR--AIEDFKAD---DVSVEPGSDKKTVV--VSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred CCHHHHHHHHHHHHHHHHHHHhCC--cccCCCCc---ceEEeecCCCCEEE--EEEEEEEEEeeeeEEEEEEEC Confidence 999999999999999998876644 23344321 00111 11111222 888899999999999999999 No 44 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=94.40 E-value=0.0045 Score=33.44 Aligned_cols=409 Identities=11% Similarity=-0.027 Sum_probs=147.6 Q ss_pred CCCceEEEEEeeccccccc--cCc--------cceEEEecc-----cccccccchhhhhe-eecHHHHHhccCCCCHHHH Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRPQ--ETF--------TDAAIVGTA-----EEEPPDAEFGEVNQ-YSTSTSVGDDYGEDSDVYT 64 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~~--~~F--------g~~Lilg~~-----~~~~~~~~~~~~~~-Yts~~~V~~Dfg~~sp~Yk 64 (426) .-...+.|.+ .+..... +.- ...+-++.. ........++.-.. ....+.... +...+.+. T Consensus 136 ~~~~~~~v~~--~ta~~~~~~~~v~~~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~--~~t~~~~~ 211 (663) T protein:vir:10 136 SDGKIKSLFV--PTAEIIAKTRQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPA--VMTSPAVM 211 (663) T ss_pred cccceEEEee--ccccccccccccccceeeccceeeEeeeccCccccccccceeccccceEEeecccccc--cccccccc Confidence 0011111111 1110000 000 000001000 00000000000000 000000000 00011111 Q ss_pred HHHHHHh--------cC--Cceeeeeecccccc----------------------ccccc--ccc---------ceeccc Q lcl|Aclame:pro 65 ASEAIEE--------MG--AEQWRVMVLEATEV----------------------TEEEL--SDG---------DTIDKV 101 (426) Q Consensus 65 AA~~~f~--------Q~--~~~~~~~v~~~t~v----------------------~~~~~--~~~---------~tv~~~ 101 (426) .....+. .+ ...+.+.+...+.. ...+. .+. ...... T Consensus 212 ~~~~~~~~~~i~A~~~G~~Gn~i~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~ 291 (663) T protein:vir:10 212 EKYAKFGMPLISAVYPGEIGSTVEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVEST 291 (663) T ss_pred ccccccccceEEeccCCcccceeeeeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeee Confidence 0000000 00 00000000000000 00000 000 000000 Q ss_pred eeeccccccc---chhhhhhhcccccccccceeeeee-----ccccceeeechhhe-eeecccccch-hhhhhh--cccc Q lcl|Aclame:pro 102 PILGNHEVES---PDGDIEFTTDDDPDVEDFDAEIVI-----NSATGDVATSEDSI-ELTYFHADWS-QLDEFP--SDVN 169 (426) Q Consensus 102 ~~s~~~~~~~---ta~~i~~~~~~~~~~t~~~~~~~~-----~~~~g~~t~~~~~~-~~~~~~~d~~-~~~~~~--s~~~ 169 (426) +.+....... +...+...+ .........+. ...++.++...+.. .......+.. ++..+. ..++ T Consensus 292 ~~s~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~ 367 (663) T protein:vir:10 292 VLSTRKGDRDVYGSNIFMDDYF----RNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALH 367 (663) T ss_pred cccccccccccccchhhhhhhh----cCCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccc Confidence 0000000000 000000000 00000000000 00000000000000 0000001111 100000 0011 Q ss_pred c-eeecccccc--h-hhhHhHhhhhhhhhhc-ceEEEEEecccccc---cchhhHHHH--HHHhh---------ccCcce Q lcl|Aclame:pro 170 N-FAVADRRFD--L-KGVGVLDETHSWASDE-DMGMIANGVNVDDY---DSVDEAMDV--AHEVA---------GYVPSG 230 (426) Q Consensus 170 ~-~~la~~~~~--~-~~~~~~~~~~~wa~~~-~kl~~~~~~d~~~~---~~~~~~~~~--a~~~a---------~~~~rt 230 (426) . ..++..... . ........+...++.. +++.+......... ......... ..... .++... T Consensus 368 ~~~~i~~~~~~~~~~~~~~v~~~l~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~ 447 (663) T protein:vir:10 368 VNLMIAGACGSDGAEIASTVQKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISST 447 (663) T ss_pred eeEEEeccCCCCchhhHHHHHHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCcc Confidence 0 111110000 0 0001111222223332 24444332211110 011111111 10000 011111 Q ss_pred -EEEEec-------CCC---ccchhHHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhcCc-cE Q lcl|Aclame:pro 231 -DLMMIV-------DAS---DDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NV 298 (426) Q Consensus 231 -~~~~~~-------~~~---~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~ 298 (426) ..+++. ... ..-|.++++|.|+-.+.-+.+ |+. +.|..........++...+...|...|+.+. |. T Consensus 448 ~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~-~~s-Pan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~ 525 (663) T protein:vir:10 448 YAFIIGNYKYQYDKYNDINRWVPLAADIAGLCAYTDQVSHP-WMS-PAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINP 525 (663) T ss_pred ceEEEcCceEEecccCCceEEechhHHHHHHHHHhhccCCc-eEc-cCCceeccccccccceeecChhHHHHHhhCCceE Confidence 112211 111 124668888888877743321 111 1121111111122223345667777777554 88 Q ss_pred EEEEc--CCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 299 LIDVS--DANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSV 376 (426) Q Consensus 299 ~~~~~--g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~ 376 (426) ++.+- ++..+|-..|+++....-.||=++|..+||++.|+..++..+-. |.+..=...|+..|+.-|.+-++++ T Consensus 526 i~~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~g 601 (663) T protein:vir:10 526 VTGFAGGDGFVLFGDKMATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFE----NNDAFTRQSFRMETSQYLDGIRSLG 601 (663) T ss_pred EEEEeCCCcEEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCC Confidence 87764 36777777777665445578999999999999999998876542 7788888899999999999988755 Q ss_pred CccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 377 GQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 377 g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) . +-+|.+.......+++|+.+-++. +.+.+...-.++++.++..-+- T Consensus 602 a--l~g~~v~~d~~~nt~~~i~~G~~~-~~i~~~p~~pae~i~~~~~~~~ 648 (663) T protein:vir:10 602 G--CYDFRVVCDTTNNTPNVIDRNEFV-GTIYVKPPRSINYITLNMVATS 648 (663) T ss_pred c--eeeeEEEEcCCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEee Confidence 3 446999888777788888887775 8899999999999998877655 No 45 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=94.28 E-value=0.0048 Score=33.27 Aligned_cols=361 Identities=13% Similarity=0.058 Sum_probs=148.9 Q ss_pred CC---CceEEEEEeeccccccc-----------------------------cCccceEEEecccccccccchhhhheeec Q lcl|Aclame:pro 1 MP---KQIVEIELTAEIADRPQ-----------------------------ETFTDAAIVGTAEEEPPDAEFGEVNQYST 48 (426) Q Consensus 1 mp---~~iVnV~isl~t~a~~~-----------------------------~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts 48 (426) .| .+-+.|.+ ....... ..|+ ++++... ..+.+|.- T Consensus 226 ~~g~~g~~i~v~i--~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~g--------~~~e~~~~ 293 (666) T protein:vir:65 226 YAGEIGNSLEVEI--LARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYA--FIVRRDG--------VVVESYVL 293 (666) T ss_pred eccccccceeEEe--ecccccccccccccccccccccccceeeecccccccccce--eeeecCC--------cccceeec Confidence 11 11112222 1111100 0111 1111100 00111110 Q ss_pred HHHHHhccCCCCHHHHHHHHHHhcCC-ceeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccc Q lcl|Aclame:pro 49 STSVGDDYGEDSDVYTASEAIEEMGA-EQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVE 127 (426) Q Consensus 49 ~~~V~~Dfg~~sp~YkAA~~~f~Q~~-~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t 127 (426) ......-+......|... ++.++. ...+....... .+.....+ ..+......+...+. T Consensus 294 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~v~~~~~~~~----~~~~~~~~-----~~~g~~~~~~~~~~~---------- 352 (666) T protein:vir:65 294 STLKGDKDVYGNSIYMDD--FFARGSSQYIYATAQGWV----DGFSGIIS-----LAGGVSANEATTGGV---------- 352 (666) T ss_pred ccCcccccccchhhhhhh--hhcccccceeeeeccccc----ccccceEE-----ccCCCCcCccccccc---------- Confidence 000000011112222111 111110 11111000000 00000000 000000000000000 Q ss_pred cceeeeeeccccceeeechhheeeecccccchhhhhhhc-cccceeecccc-cchhhhHhHhhhhhhhhhcceEEEE-Ee Q lcl|Aclame:pro 128 DFDAEIVINSATGDVATSEDSIELTYFHADWSQLDEFPS-DVNNFAVADRR-FDLKGVGVLDETHSWASDEDMGMIA-NG 204 (426) Q Consensus 128 ~~~~~~~~~~~~g~~t~~~~~~~~~~~~~d~~~~~~~~s-~~~~~~la~~~-~~~~~~~~~~~~~~wa~~~~kl~~~-~~ 204 (426) ......+....+ +..+..... .++....+... .+........++...++..+-.+.. .. T Consensus 353 ------g~~~~~~~~~~~------------~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~~~l~~~~~~~~~~~a~~d~ 414 (666) T protein:vir:65 353 ------GADPFIGAMMQG------------WDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSP 414 (666) T ss_pred ------ccccccccHHHH------------HHHHhhhhhccCCceeecCcCCccchhHHHHHHHHHHHhhccceEEEecc Confidence 000000000000 000000000 01100001000 0001111122222333333222211 10 Q ss_pred -----cccccccchhhHHHHHHHhhc--------cCcceEEEEe-------cCCC---ccchhHHHHHHHhhhc----cc Q lcl|Aclame:pro 205 -----VNVDDYDSVDEAMDVAHEVAG--------YVPSGDLMMI-------VDAS---DDDLAAYQLGKFAVSE----PW 257 (426) Q Consensus 205 -----~d~~~~~~~~~~~~~a~~~a~--------~~~rt~~~~~-------~~~~---~~~~~aa~~g~~~~~~----p~ 257 (426) .+...-+....... .+...+ +-.....+|+ .... .--|.++++|.|+-.+ || T Consensus 415 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~ 493 (666) T protein:vir:65 415 PRSTVVNIPVTTAIDNLIA-WREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPW 493 (666) T ss_pred ccceeeecCCCCCHHHHHH-HHHhcccccccccccCcceEEEEcCceEEecccCCceeEechHHHHHHHHHHHhccCCcE Confidence 01000011111111 111111 0001111222 1111 2236788888888765 44 Q ss_pred ccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEEcC-CEEEeeceeecCcccCcceeehhhhHHHHHH Q lcl|Aclame:pro 258 YNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSD-ANRVSNAVTTAGADSDTSFFDIRRTKVYTAE 335 (426) Q Consensus 258 ~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g-~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~ 335 (426) ..+..++... .+...+..-.+...|...|+.+. |.++...| +..+|-+.|.++....-.||=++|..+||++ T Consensus 494 ~span~~~~~------i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~ 567 (666) T protein:vir:65 494 MSPAGYNRGQ------IMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKK 567 (666) T ss_pred EccCCeecce------eeccccceeecChhHHHhhhhCCceEEEEeCCCeEEEEecccCCCCCcccceEehhhHHHHHHH Confidence 4443332211 11111111234556666776544 98888765 6888888887776555679999999999999 Q ss_pred HHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccE Q lcl|Aclame:pro 336 MLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRA 415 (426) Q Consensus 336 ~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAI 415 (426) .|+..++..+-. |.+..=...|+..|+.-|.+-++++. +-+|.+.......+++|+.+.++. +.+.+.....+ T Consensus 568 si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~ga--l~g~~V~~d~~~nt~~~i~~G~~~-~~i~~~p~~pa 640 (666) T protein:vir:65 568 NIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGG--IYDFRVQCDTTNNTPDVIDRNEFV-ASMFIKPAKSI 640 (666) T ss_pred HHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEcCCCCCHHHhhCCeEE-EEEEEEecCCc Confidence 999998876643 67778888999999999999887553 445989888777788898887775 89999999999 Q ss_pred EEEEEEEEEeC Q lcl|Aclame:pro 416 HTFSLGLNVSV 426 (426) Q Consensus 416 h~v~I~g~v~v 426 (426) +++.++..-+= T Consensus 641 e~i~~~~~~~~ 651 (666) T protein:vir:65 641 NYIMLNFTAVA 651 (666) T ss_pred ceEEEEEEEee Confidence 99998877654 No 46 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=93.66 E-value=0.0068 Score=32.47 Aligned_cols=400 Identities=12% Similarity=0.050 Sum_probs=145.1 Q ss_pred CC--CceEEEE-------EeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccC----CCCHHH---- Q lcl|Aclame:pro 1 MP--KQIVEIE-------LTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYG----EDSDVY---- 63 (426) Q Consensus 1 mp--~~iVnV~-------isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg----~~sp~Y---- 63 (426) .+ ...+.+. +.+............+.++-.- +......-...+... ...++- ...+.+ T Consensus 175 ~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~---~~~~~~~~~A~~~g~--~G~~i~v~~~~~a~~~~~~~ 249 (660) T protein:vir:68 175 SSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESI---KKYGVPGVVALYPGE--LGDQLEIEIVSKADYDKGAS 249 (660) T ss_pred cccceeeeeeccccccccceeeeeccccccccccceeeee---cccCccccccccccc--cccceEEEEecccccccccc Confidence 11 1111000 0000000000000000000000 000000000000000 000000 000000 Q ss_pred -------------HHHHHHHhcCCceeeeeeccccccccccccccceeccceeecccccccchhh---hhhhccccccc- Q lcl|Aclame:pro 64 -------------TASEAIEEMGAEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGD---IEFTTDDDPDV- 126 (426) Q Consensus 64 -------------kAA~~~f~Q~~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~---i~~~~~~~~~~- 126 (426) ..+..+|.+.+...... +..+.. .+..+.....+-.......... +.......... T Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~v~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (660) T protein:vir:68 250 AQLKIYPDGGTRYSTAKAIFGYGPQTDDQY----AIIVRR---NDSVVQSVVLSTKRGERDIYGSNIFIDDFFAKGASNY 322 (660) T ss_pred ccceeeecccccccceeeEeecccccccce----eeeeec---CCcceeeeeeecccccccccccceeeehhhccCcccE Confidence 00111111110000000 000000 0000000000000000000000 00000000000 Q ss_pred ccceeeeeeccccceeeechhhe-eeecccccc----hhhhhhhccccce-eeccc--ccchh-hhHhHhhhhhhhhhcc Q lcl|Aclame:pro 127 EDFDAEIVINSATGDVATSEDSI-ELTYFHADW----SQLDEFPSDVNNF-AVADR--RFDLK-GVGVLDETHSWASDED 197 (426) Q Consensus 127 t~~~~~~~~~~~~g~~t~~~~~~-~~~~~~~d~----~~~~~~~s~~~~~-~la~~--~~~~~-~~~~~~~~~~wa~~~~ 197 (426) ..+..........+......+.. .......+. ..+.... .+... .++.. ..... .......+..-++..+ T Consensus 323 v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~v~~~l~~~~~~~~ 401 (660) T protein:vir:68 323 IFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRE-SVNAQLFIAGSCAGESLEVASTVQKHVVAIGDSRQ 401 (660) T ss_pred EEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhh-ccccceeeccccCCCchHHHHHHHHHHHHHHHhhC Confidence 00000000000000000000000 000000000 0111111 01111 11100 00000 0111122222233322 Q ss_pred -eEEEEEe-----cccccccchhhHHHHHHHhh-------ccCcceEEEEec-------CCC---ccchhHHHHHHHhhh Q lcl|Aclame:pro 198 -MGMIANG-----VNVDDYDSVDEAMDVAHEVA-------GYVPSGDLMMIV-------DAS---DDDLAAYQLGKFAVS 254 (426) Q Consensus 198 -kl~~~~~-----~d~~~~~~~~~~~~~a~~~a-------~~~~rt~~~~~~-------~~~---~~~~~aa~~g~~~~~ 254 (426) ++++... .+...-...+.....+.... ++-.+-..+|+. ... ..-|.++++|.++-. T Consensus 402 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~ 481 (660) T protein:vir:68 402 DCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADIAGLCART 481 (660) T ss_pred CeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHHHHHHHHH Confidence 2222211 11000011111111111100 010111222222 111 123678888988877 Q ss_pred c----ccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEEcC-CEEEeeceeecCcccCcceeehhh Q lcl|Aclame:pro 255 E----PWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSD-ANRVSNAVTTAGADSDTSFFDIRR 328 (426) Q Consensus 255 ~----p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g-~~~~~~~~t~~G~~~sg~~iD~i~ 328 (426) + ||..+..+..... ....+..-.+..+|...|+.+. |.++.+.| +..+|-+.|.++....-.||=++| T Consensus 482 d~~~g~~~span~~~~~i------~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR 555 (660) T protein:vir:68 482 DNISQPWMSPAGYNRGQI------LNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRINVRR 555 (660) T ss_pred hccCCcEEccCCeeecee------eccceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceEehhh Confidence 6 5544433322111 1111111235566777776544 88888765 578888888777555567999999 Q ss_pred hHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEE Q lcl|Aclame:pro 329 TKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLD 408 (426) Q Consensus 329 g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~ 408 (426) ..+||+..++..++..+-. |.+..=...|+..|+.-|.+-++++. +-+|.+...+...+++|+.+.++. +.+. T Consensus 556 ~~~~i~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~~L~~l~~~ga--l~gf~V~~d~~~nt~~~i~~G~~~-~~i~ 628 (660) T protein:vir:68 556 LFNMVKTNIGSASKYRLFE----LNNAFTRSSFRTETSQYLQGIKALGG--VYNFKVVCDTTNNTPAVIDRNEFV-ATFY 628 (660) T ss_pred HHHHHHHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhcCc--eeeeEEEEecCCCCHHHhhCCeEE-EEEE Confidence 9999999999998876642 55677778889999999998887553 345888887777788898887777 8999 Q ss_pred EEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 409 ARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 409 ~~laGAIh~v~I~g~v~v 426 (426) +...-.++++.++..-+- T Consensus 629 ~~p~~pae~i~l~~~~~~ 646 (660) T protein:vir:68 629 LQPARSINYITLNFVATA 646 (660) T ss_pred EEecCCcceEEEEEEEee Confidence 999999999998877665 No 47 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=93.04 E-value=0.009 Score=31.79 Aligned_cols=388 Identities=10% Similarity=-0.046 Sum_probs=147.9 Q ss_pred CC-CceEEEEE-----eeccccccccCccceEEEeccccc--ccccchhh---hheeec-HHHHHhccCCCCHHHHHHHH Q lcl|Aclame:pro 1 MP-KQIVEIEL-----TAEIADRPQETFTDAAIVGTAEEE--PPDAEFGE---VNQYST-STSVGDDYGEDSDVYTASEA 68 (426) Q Consensus 1 mp-~~iVnV~i-----sl~t~a~~~~~Fg~~Lilg~~~~~--~~~~~~~~---~~~Yts-~~~V~~Dfg~~sp~YkAA~~ 68 (426) .. ........ .+........+..+...+...... .|....+- -..+.. .+.+...-+..+..|..... T Consensus 205 ~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 284 (660) T protein:vir:10 205 ITSLEFQAALKKFAMPGVVALYPGEIGSTLEVEIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVR 284 (660) T ss_pred cccccceeeccccccceeeeecccccCcceeEEEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccc Confidence 10 00000000 000000011111111111110000 00000000 000000 00000000000000100000 Q ss_pred HHhcCCceeeeeeccccccccccccccceeccceeeccccc---ccchhhhhhhccccccc-ccceeeeeeccccce--e Q lcl|Aclame:pro 69 IEEMGAEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEV---ESPDGDIEFTTDDDPDV-EDFDAEIVINSATGD--V 142 (426) Q Consensus 69 ~f~Q~~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~---~~ta~~i~~~~~~~~~~-t~~~~~~~~~~~~g~--~ 142 (426) .++.......++..... ..+...+.+.+...... .-+..........+. + T Consensus 285 ------------------------~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l 340 (660) T protein:vir:10 285 ------------------------RDGAIVESVVLSTKEGEKDVYGNNIYLDDYFAKGTSNYIYATSLNWPKGFSGIINL 340 (660) T ss_pred ------------------------cCCcccceeeeeccccccccccceeeeehhhcCCCccEEEEEeccCCCCcccceee Confidence 00000000000000000 00000000000000000 000000000000000 0 Q ss_pred eechhheeeecccccc-hhhhhhh---c-cccceeeccc--ccchhhhHhHhhhhhhhhhcc-eEEEEEecccc---ccc Q lcl|Aclame:pro 143 ATSEDSIELTYFHADW-SQLDEFP---S-DVNNFAVADR--RFDLKGVGVLDETHSWASDED-MGMIANGVNVD---DYD 211 (426) Q Consensus 143 t~~~~~~~~~~~~~d~-~~~~~~~---s-~~~~~~la~~--~~~~~~~~~~~~~~~wa~~~~-kl~~~~~~d~~---~~~ 211 (426) ..+.+.. ......+. .++..+. . .++....+.. ..+.........+...++... .+.+....... ... T Consensus 341 ~gg~~~~-~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~~~~~~~~~~~aiid~p~~~~~~~~~ 419 (660) T protein:vir:10 341 SGGISAN-DKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVVSIADERQDCLAFISPPKGLLVNVPL 419 (660) T ss_pred eccccCc-cccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHHHHHHhhCCEEEEEecCccccccccc Confidence 0000000 00000011 0111111 0 0111111110 000001111222222333322 33322211000 000 Q ss_pred -chhhHHHHHHHhhc-------cCcc-eEEEEecCC----------CccchhHHHHHHHhhhc----ccccceeeccccc Q lcl|Aclame:pro 212 -SVDEAMDVAHEVAG-------YVPS-GDLMMIVDA----------SDDDLAAYQLGKFAVSE----PWYNPLWNELPAG 268 (426) Q Consensus 212 -~~~~~~~~a~~~a~-------~~~r-t~~~~~~~~----------~~~~~~aa~~g~~~~~~----p~~~~~~~~~~~~ 268 (426) .........+...+ ++.. ...+|+.-. ....|.++++|.++-.+ ||..+..+.... T Consensus 420 ~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~- 498 (660) T protein:vir:10 420 TRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADLAGLCARTDDVSQPWMSPAGYNRGQ- 498 (660) T ss_pred ccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechhHHHHHHHHHhhccCCcEEccCCeeece- Confidence 01111111111111 0111 112222211 11246788889888776 444443332211 Q ss_pred ceeeccccccccccccchhHHHHhhcCc-cEEEEEc--CCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 269 ETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVS--DANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQ 345 (426) Q Consensus 269 ~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~--g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll 345 (426) .....+..-.+...|...|+.+. |.++..- ++..+|-..|+++-..+-.||=++|..+||++.|+..++..+ T Consensus 499 -----i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v 573 (660) T protein:vir:10 499 -----ILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTATKVPSPMDHINVRRLFNMLKKNIGDASKYKL 573 (660) T ss_pred -----eeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhc Confidence 11111111235566777777554 8888764 367777777766654455789999999999999999988876 Q ss_pred hcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEe Q lcl|Aclame:pro 346 VSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVS 425 (426) Q Consensus 346 ~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~ 425 (426) -. |.++.-...|+..|+.-|..-++++. +.+|.+.......+++|+.+.++. +.+.+...-.++++.++..-+ T Consensus 574 ~e----pn~~~l~~~i~~~i~~fL~~l~~~ga--l~g~~V~~d~~~nt~~di~~G~~~-~~i~~~P~~pae~I~~~~~~~ 646 (660) T protein:vir:10 574 FE----LNDNFTRSSFRMEVSQYLDGIKALGG--IYEGRVVCDTTVNTPAVIDRNEFI-ANIYVKPARSINYITLNFVAT 646 (660) T ss_pred cC----CCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEcCCCCCHHHhhCCeEE-EEEEEEecCCccEEEEEEEEe Confidence 43 66888888999999999999887553 445888888777788899888877 899999999999999998777 Q ss_pred C Q lcl|Aclame:pro 426 V 426 (426) Q Consensus 426 v 426 (426) - T Consensus 647 ~ 647 (660) T protein:vir:10 647 S 647 (660) T ss_pred e Confidence 5 No 48 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=92.15 E-value=0.013 Score=30.99 Aligned_cols=366 Identities=12% Similarity=0.019 Sum_probs=145.9 Q ss_pred CC-CceEEEEEeeccccccccCccceEE---------------E--ecccccccccchhhhheeecH------HHHHhcc Q lcl|Aclame:pro 1 MP-KQIVEIELTAEIADRPQETFTDAAI---------------V--GTAEEEPPDAEFGEVNQYSTS------TSVGDDY 56 (426) Q Consensus 1 mp-~~iVnV~isl~t~a~~~~~Fg~~Li---------------l--g~~~~~~~~~~~~~~~~Yts~------~~V~~Df 56 (426) .| .-.-.+++.+...+.........+- + +.+.. ++ +..+.+.... -....+. T Consensus 226 ~~gt~g~~~tv~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~ 301 (659) T protein:vir:72 226 YPGELGDKIEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTD-SQ---YAIIVRRNDAIVQSVVLSTKRGE 301 (659) T ss_pred cccccccceeEEEccccccccceeeeeecccccccccccceeeeeeecccc-cc---cceeeecccceeeeeeeeecccc Confidence 11 0000111111111000000000000 0 00000 00 0000000000 0000111 Q ss_pred CCCCHHHHHHHHHHhcCC-ceeeeeeccccccccccccccceeccceeecccc--cccchhhhhhhcccccccccceeee Q lcl|Aclame:pro 57 GEDSDVYTASEAIEEMGA-EQWRVMVLEATEVTEEELSDGDTIDKVPILGNHE--VESPDGDIEFTTDDDPDVEDFDAEI 133 (426) Q Consensus 57 g~~sp~YkAA~~~f~Q~~-~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~--~~~ta~~i~~~~~~~~~~t~~~~~~ 133 (426) +...+.-.....+|..+. ...+....... . ....+ .++.+... ...+..++..++ T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-~-----~~~~~---~~l~gg~~~~~~~~~~~~~~~~------------- 359 (659) T protein:vir:72 302 KDIYDSNIYIDDFFAKGGSEYIFATAQNWP-E-----GFSGI---LTLSGGLSSNAEVTAGDLMEAW------------- 359 (659) T ss_pred ccccchhhhhhhhhhcCCceEEEEEecccC-C-----ccccc---ccccccccccccccchhHHHHH------------- Confidence 111111122222222110 00000000000 0 00000 00000000 000000000000 Q ss_pred eeccccceeeechhheeeecccccchhhhhhhc-cccceeecccc-cchhh-hHhHhhhhhhhhhc-ceEEEEEeccccc Q lcl|Aclame:pro 134 VINSATGDVATSEDSIELTYFHADWSQLDEFPS-DVNNFAVADRR-FDLKG-VGVLDETHSWASDE-DMGMIANGVNVDD 209 (426) Q Consensus 134 ~~~~~~g~~t~~~~~~~~~~~~~d~~~~~~~~s-~~~~~~la~~~-~~~~~-~~~~~~~~~wa~~~-~kl~~~~~~d~~~ 209 (426) ..+..... ..+........ ...+. ......+..-++.. +++++........ T Consensus 360 -------------------------~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~~~~d~~~~~~ 414 (659) T protein:vir:72 360 -------------------------DFFADRESVDVQLFIAGSCAGESLETASTVQKHVVSIGDARQDCLVLCSPPRETV 414 (659) T ss_pred -------------------------HHhhhccccceeEEEecCCCCcchhhhHHHHHHHHHHHhhhCCEEEEEcCccccc Confidence 00000000 01100011000 00000 00111122223332 2333332111000 Q ss_pred -----ccchhhHHHHHHHhh------ccC-cceEEEEec-------CCC---ccchhHHHHHHHhhhcccccceeecccc Q lcl|Aclame:pro 210 -----YDSVDEAMDVAHEVA------GYV-PSGDLMMIV-------DAS---DDDLAAYQLGKFAVSEPWYNPLWNELPA 267 (426) Q Consensus 210 -----~~~~~~~~~~a~~~a------~~~-~rt~~~~~~-------~~~---~~~~~aa~~g~~~~~~p~~~~~~~~~~~ 267 (426) ....+.....+.... .+. .+...+|+. ... ..-|.++++|.++-.+.-+.+ |+. +. T Consensus 415 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~G~-~~s-pa 492 (659) T protein:vir:72 415 VGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLAADIAGLCARTDNVSQT-WMS-PA 492 (659) T ss_pred cCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechHHHHHHHHHHhhccCCc-EEc-cC Confidence 111111111111100 001 111122222 111 113668888888876642221 111 11 Q ss_pred cceeeccccccccccccchhHHHHhhcCc-cEEEEEcC-CEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 268 GETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSD-ANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQ 345 (426) Q Consensus 268 ~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g-~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll 345 (426) |......+...+..-.+..+|...|+.+. |.++.+.| +..+|-+.|+++....-.||-++|..+||...++..++..+ T Consensus 493 n~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v 572 (659) T protein:vir:72 493 GYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRINVRRLFNMLKTNIGRSSKYRL 572 (659) T ss_pred CeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccCCCCcccceEeehhHHHHHHHHHHHHHHHhh Confidence 11111111111111234566777777554 88888754 67788777777665556899999999999999999988765 Q ss_pred hcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEe Q lcl|Aclame:pro 346 VSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVS 425 (426) Q Consensus 346 ~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~ 425 (426) -. |.++.=...|+..|+.-|.+-++++. +-.|.+...+...+++|+.+-++. +.+.+...-.++++.++..-+ T Consensus 573 ~e----~n~~~l~~~i~~~i~~fL~~l~~~ga--l~~~~V~~d~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~~~~~ 645 (659) T protein:vir:72 573 FE----LNNAFTRSSFRTETAQYLQGNKALGG--IYEYRVVCDTTNNTPSVIDRNEFV-ATFYIQPARSINYITLNFVAT 645 (659) T ss_pred cC----CCCHHHHHHHHHHHHHHHHHHHhcCc--eeeEEEEEcCCCCCHHHhhCCeEE-EEEEEEecCCccEEEEEEEEe Confidence 32 67788888899999999999887664 457888888777778888887776 889999999999999987765 Q ss_pred C Q lcl|Aclame:pro 426 V 426 (426) Q Consensus 426 v 426 (426) - T Consensus 646 ~ 646 (659) T protein:vir:72 646 A 646 (659) T ss_pred e Confidence 4 No 49 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=91.28 E-value=0.017 Score=30.34 Aligned_cols=386 Identities=11% Similarity=-0.015 Sum_probs=157.9 Q ss_pred CCCceEEEEEeeccccc-cccCc--cceEEEecccccccccchhhhheeecHHHHHhccC--CCCHHHHHHHHHHhcCCc Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADR-PQETF--TDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYG--EDSDVYTASEAIEEMGAE 75 (426) Q Consensus 1 mp~~iVnV~isl~t~a~-~~~~F--g~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg--~~sp~YkAA~~~f~Q~~~ 75 (426) +.|-.=-|=|+..+.+. +..+- ...++||-...-+|.. + ..-.+-++...-|| .+.+.|++-+++| ++++ T Consensus 9 ~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g~~~---~-v~i~~~~d~~~~fG~~~~~~~~~~~~~~~-~g~~ 83 (451) T protein:vir:10 9 QDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWGKNG---V-IEVEANSDFTKKLGTTLDDPSLTALKETL-KGAS 83 (451) T ss_pred ceeecCceEEEEeccCcceeeccCCcEEEEEeeecCCCCcc---c-EEeecHHHHHHHcCCcccchhHHHHHHHh-cCCc Confidence 22211122233222221 11122 2466776443333433 2 23455577778888 6677888777777 5777 Q ss_pred eeeeeeccc-ccccccccccccee---------ccceeecccccccchhhhhhhcccccccccceeee------------ Q lcl|Aclame:pro 76 QWRVMVLEA-TEVTEEELSDGDTI---------DKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEI------------ 133 (426) Q Consensus 76 ~~~~~v~~~-t~v~~~~~~~~~tv---------~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~------------ 133 (426) ....+++.. +...........++ |...++.....++.....+..... .+.++... T Consensus 84 ~v~~yrl~~g~~a~~t~~~~~~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g---~~~vd~qtv~~~~~~el~~n 160 (451) T protein:vir:10 84 KVLVLNPNEGTAATLTKEGLPWTVTANYPGEKGNQITVSVEVSPADQNAATVSTIFG---TKLVDEQSIKFNELDKFKGN 160 (451) T ss_pred EEEEEEcCCCceEEEEeecCceEEEEeeCCcCCceEEEEEecccCCcCceEEEEEEC---CeEEEEEEeeccchhhccCC Confidence 766665531 11100000011111 111111111111111111110000 01111100 Q ss_pred ----eecccccee--eechhh------eeeecccccch-hhhhhhc-cccceeecccccchhhhHhHhhhhhhhhh---- Q lcl|Aclame:pro 134 ----VINSATGDV--ATSEDS------IELTYFHADWS-QLDEFPS-DVNNFAVADRRFDLKGVGVLDETHSWASD---- 195 (426) Q Consensus 134 ----~~~~~~g~~--t~~~~~------~~~~~~~~d~~-~~~~~~s-~~~~~~la~~~~~~~~~~~~~~~~~wa~~---- 195 (426) +.....+.. ...... ........++. .+..+.. ..+...++.... .......+.+|+.. T Consensus 161 d~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~~n~l~~~~~~~---~~~i~~~~~a~ik~~r~~ 237 (451) T protein:vir:10 161 DYITAKVVEEGSSKPVAFTNVSGTLTGGTTTESNKVESLLNDALENEEYAVVTTAGFEP---SSNMNKLVVEAVKRLREN 237 (451) T ss_pred ceEEEEecccccccceeeeecccccccccccCCccchHHHHHHhccceeeEEEEccCCC---chHHHHHHHHHHHHHHHh Confidence 000000000 000000 00000011111 1111111 011111211100 00111223445432 Q ss_pred cceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecC-CCccchhHHHHHHHhhhcccccceeecccccceeecc Q lcl|Aclame:pro 196 EDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVD-ASDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKN 274 (426) Q Consensus 196 ~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~-~~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~ 274 (426) +.+-+.+...+..........+. +.-....+.... -......+|+.|.++......+++ + T Consensus 238 ~g~~~~aVl~~~~~~~~d~egii-------nv~n~~~~~dg~~~~~~~~~~~vAG~~Ag~~~~~S~T------------~ 298 (451) T protein:vir:10 238 EGRKVRGVIPTDADTTYNYEGIS-------TVVNGYTLSDGTNVDVKDATGYFAGISASADVATSLT------------Y 298 (451) T ss_pred cCCeEEEEecCccCCCCCCcceE-------EeecceEecCceeechhhhHHHHHHHHcccccccCcc------------c Confidence 22222221111100000000000 000011111000 011223466666666543222333 3 Q ss_pred cccccc---ccccchhHHHHhhcCcc-EEEEEcCC-EEEeeceee-----cCcccCcceeehhhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 275 VGDPEE---QGTFEGGDEAEGEGPVN-VLIDVSDA-NRVSNAVTT-----AGADSDTSFFDIRRTKVYTAEMLELDLESL 344 (426) Q Consensus 275 k~~~gv---~~~~~~~~~~~~~~~~N-~~~~~~g~-~~~~~~~t~-----~G~~~sg~~iD~i~g~dwl~~~iq~~l~~l 344 (426) +..+++ ...++.+|+.++-.... ++....|. ..+.++..+ +.+.-+-..|=++|..|.+.++|+...-+. T Consensus 299 ~~~~~~~~v~~~~t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~ 378 (451) T protein:vir:10 299 FEVEDAVSAYPKFDNEKTIKALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERT 378 (451) T ss_pred eecCCceeeeeeCCHHHHHHHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhc Confidence 334433 34577788765554443 33333453 556676543 333333446888999999999888766554 Q ss_pred HhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcH-HHHHhhcCCCeEEEEEEcccEEEEEEEEE Q lcl|Aclame:pro 345 QVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDD-VDRVNRNWGGIDLDARLAQRAHTFSLGLN 423 (426) Q Consensus 345 l~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~-~dra~R~~~~i~~~~~laGAIh~v~I~g~ 423 (426) ++ .|+|=+..|-.++.+.|.+-|++-.+.+ .+..|... ..+.. .+-.... -+++.++.--++.++.+.+. T Consensus 379 yi--Gk~~N~~~gr~~~~~~i~~yl~~l~~~g--~i~~~~~~---d~~v~~~~~~~~v--~v~~~v~pvdame~iy~t~~ 449 (451) T protein:vir:10 379 YL--GNVGNNAAGRDLFKADRIAYLTSLQNRN--MIQSFANT---DITVEAGNDMDSI--VVNLAVTPVDAMEKLYMTMV 449 (451) T ss_pred cc--eecCCCHHHHHHHHHHHHHHHHHHHhCC--CccCCCcc---ceEEeecCCCCEE--EEEEEEEEEeeeeeEEEEEE Confidence 44 4899999999999999999998876644 22333211 00110 0111122 28889999999999999888 Q ss_pred Ee Q lcl|Aclame:pro 424 VS 425 (426) Q Consensus 424 v~ 425 (426) +. T Consensus 450 v~ 451 (451) T protein:vir:10 450 VR 451 (451) T ss_pred Ec Confidence 88 No 50 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=89.33 E-value=0.027 Score=29.19 Aligned_cols=387 Identities=12% Similarity=0.030 Sum_probs=144.9 Q ss_pred CCCceEE-----------------EEEeeccccc-cccCccceEEEe--cccccccccchhhhheeecHHHHHhccCCCC Q lcl|Aclame:pro 1 MPKQIVE-----------------IELTAEIADR-PQETFTDAAIVG--TAEEEPPDAEFGEVNQYSTSTSVGDDYGEDS 60 (426) Q Consensus 1 mp~~iVn-----------------V~isl~t~a~-~~~~Fg~~Lilg--~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~s 60 (426) .+..+.. +++.....+. ...--....+.. .+...+. .....+. +... T Consensus 210 ~~~~~~~~~~~~v~a~~~G~~g~~~tv~~~~~a~~~~~~~v~v~~~~~~~~~a~~~----t~~~~~~---------~~~~ 276 (659) T protein:vir:10 210 FQANLKKYGIPGVVALYPGELGDKIEIEIVSKADYAKGASALLPIYPGGGTRASTA----KAVFGYG---------PQTD 276 (659) T ss_pred cccceeecccccccccccceecccceEEEechhhccccceeeeeeeeecccccccc----eeeeeec---------cccc Confidence 0011110 1111100000 000000000000 0000000 0000000 0000 Q ss_pred HHHHHHHHHHhcCCceeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccc--cceeeeeeccc Q lcl|Aclame:pro 61 DVYTASEAIEEMGAEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVE--DFDAEIVINSA 138 (426) Q Consensus 61 p~YkAA~~~f~Q~~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t--~~~~~~~~~~~ 138 (426) ..+.-+ .-..+......-+. ...+................ +....-+.......+... ........+. T Consensus 277 ~~~~~~--v~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~--~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~- 346 (659) T protein:vir:10 277 SQYAII--VRRNDAIVQSVVLS-----TKRGEKDIYDSNIYIDDFFA--KGGSEYIFATAQNWPEGFSGILTLSGGLSS- 346 (659) T ss_pred cchhhc--cccccceeeeeeee-----ccccccccccchhhhhhhhc--cCcccEEEEeecccCCCccceeeecccccc- Confidence 000000 00000000000000 00000000000000000000 000000000000000000 0000000000 Q ss_pred cceeeechhheeeecccccchhhhhhhc-cccceeeccccc-ch-hhhHhHhhhhhhhhhcc-eEEEEEecccccc---- Q lcl|Aclame:pro 139 TGDVATSEDSIELTYFHADWSQLDEFPS-DVNNFAVADRRF-DL-KGVGVLDETHSWASDED-MGMIANGVNVDDY---- 210 (426) Q Consensus 139 ~g~~t~~~~~~~~~~~~~d~~~~~~~~s-~~~~~~la~~~~-~~-~~~~~~~~~~~wa~~~~-kl~~~~~~d~~~~---- 210 (426) .+..+... .. .-+..+..... ..+...++.... .. ........+..-++... .+++......... T Consensus 347 ~~~~~~~~-~~------~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~ 419 (659) T protein:vir:10 347 NAEVTAGD-LM------EAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVSIGDARQDCLVLCSPPRETVVGIPV 419 (659) T ss_pred cccccchh-HH------HHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHHHHHhhCCeEEEEcCccccccCCCc Confidence 00000000 00 00000000000 011111111000 00 00111112222233332 2322221111110 Q ss_pred -cchhhHHHHHHHhhc------cCcc-eEEEEe-------cCCC---ccchhHHHHHHHhhhcccccceeecccccceee Q lcl|Aclame:pro 211 -DSVDEAMDVAHEVAG------YVPS-GDLMMI-------VDAS---DDDLAAYQLGKFAVSEPWYNPLWNELPAGETVS 272 (426) Q Consensus 211 -~~~~~~~~~a~~~a~------~~~r-t~~~~~-------~~~~---~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~ 272 (426) ...+.....+..... ++.. ...+|+ +... ..-|.++++|.++-.+.-+.+ |+. +.|.... T Consensus 420 ~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~-~~s-pan~~~~ 497 (659) T protein:vir:10 420 TRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDNVSQT-WMS-PAGYNRG 497 (659) T ss_pred ccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEechHHHHHHHHHHHhccCCc-eEc-cCCceee Confidence 111111111111000 0111 112221 1111 123668888888876543221 111 1111111 Q ss_pred ccccccccccccchhHHHHhhcCc-cEEEEEcC-CEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCC Q lcl|Aclame:pro 273 KNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSD-ANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDD 350 (426) Q Consensus 273 ~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g-~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~K 350 (426) ..+...+..-.+..+|...|+.+. |.++.+.| +..+|-+.|+++....-.||-++|..+||...|+..++..+-. T Consensus 498 ~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e--- 574 (659) T protein:vir:10 498 QILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRINVRRLFNMLKTNIGRSSKYRLFE--- 574 (659) T ss_pred eeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccC--- Confidence 111111111235667777777554 88888765 6888888887766555679999999999999999998876532 Q ss_pred CcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 351 VPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 351 Ip~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) |.+..=...|+..|+.-|++-++++. +-.|.+...+...+++|+.+-++. +.+.+...-.++++.++...+- T Consensus 575 -~n~~~l~~~i~~~i~~fL~~l~~~ga--l~~~~V~~d~~~nt~~~i~~G~~~-~~i~~~p~~pae~i~~~~~~~~ 646 (659) T protein:vir:10 575 -LNNAFTRSSFRTETAQYLQGIKALGG--IYEYRVVCDTTNNTPSVIDRNEFV-ATFYIQPARSINYITLNFVATA 646 (659) T ss_pred -CCCHHHHHHHHHHHHHHHHHHHhcCc--eeeEEEEEcCCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEEe Confidence 67777788899999999999887653 447888888777788888887776 8899999999999999887775 No 51 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=88.38 E-value=0.033 Score=28.73 Aligned_cols=381 Identities=10% Similarity=0.008 Sum_probs=148.5 Q ss_pred CCCceEE--EEEeeccc-cccccCccce------EEEecccccccccchhhhheeecHHHHHh-ccCCCCHHHHHHHHHH Q lcl|Aclame:pro 1 MPKQIVE--IELTAEIA-DRPQETFTDA------AIVGTAEEEPPDAEFGEVNQYSTSTSVGD-DYGEDSDVYTASEAIE 70 (426) Q Consensus 1 mp~~iVn--V~isl~t~-a~~~~~Fg~~------Lilg~~~~~~~~~~~~~~~~Yts~~~V~~-Dfg~~sp~YkAA~~~f 70 (426) ...++-. ++....+. ......|... .+.+.+.. -+.+|..+..-.+ -....++.|..-. . T Consensus 313 ~~~~~~~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~--------v~~~~~~~s~~~~~~~~~~~~~~~~~~--~ 382 (743) T protein:vir:10 313 KLGDIGPRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANT--------IVERLTYLSKLSDARSEENANIYYKNV--I 382 (743) T ss_pred ccccccccceeeeccccccccccceEEEEecCcceeeeccCc--------eeEEEeeeecccccccccCcceeecce--e Confidence 1111100 00000000 0000011100 01111110 0111222211110 0112233332110 0 Q ss_pred hcCCceeeeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhhee Q lcl|Aclame:pro 71 EMGAEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIE 150 (426) Q Consensus 71 ~Q~~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~ 150 (426) .+.-..........+...............+.......... ............++........ T Consensus 383 ~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~gG~d~~~~~~~~~-- 445 (743) T protein:vir:10 383 NEQSAYLYHGNDAAVQIAASGEAWGQSSDQVLADAGTAFSR---------------TTGYWVNLAGGNDDFAYDAGEF-- 445 (743) T ss_pred ccccceeeccCcccceeeeccccCccccceeeeeccccccc---------------ccceEEEeecCccccccchhHH-- Confidence 01000000000000000000000000000000000000000 0000000000000000000000 Q ss_pred eecccccchhhhhhhc-cccceeecccccc-hhhhHhHhhhhhhhhhcc-eEEEEEeccccc--------ccchhhH-HH Q lcl|Aclame:pro 151 LTYFHADWSQLDEFPS-DVNNFAVADRRFD-LKGVGVLDETHSWASDED-MGMIANGVNVDD--------YDSVDEA-MD 218 (426) Q Consensus 151 ~~~~~~d~~~~~~~~s-~~~~~~la~~~~~-~~~~~~~~~~~~wa~~~~-kl~~~~~~d~~~--------~~~~~~~-~~ 218 (426) . .-+..+..... .++...++..... .........+..-++... ++.+........ .....+. .. T Consensus 446 ---~-~~~~~~~~~~~~~~~ll~~p~~~~~~~~~~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~ 521 (743) T protein:vir:10 446 ---G-AAMDLFLDTEETEIDFVLMGGSMADEADTKSKATKVIAIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENT 521 (743) T ss_pred ---H-HHHHHhhhccccCcceEEecCcccCccchHHHHHHHHHHHHhhCCeEEEEecCCCccccccccccccccccchHH Confidence 0 00001110000 0111111111000 011111222333333332 333332111110 0011111 11 Q ss_pred HHHHhhccCcce-EEEE-------ecCCC---ccchhHHHHHHHhhhc----ccccceeecccccceeeccccccccccc Q lcl|Aclame:pro 219 VAHEVAGYVPSG-DLMM-------IVDAS---DDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNVGDPEEQGT 283 (426) Q Consensus 219 ~a~~~a~~~~rt-~~~~-------~~~~~---~~~~~aa~~g~~~~~~----p~~~~~~~~~~~~~~~~~~k~~~gv~~~ 283 (426) .... ..+..+. ..+| +.... ..-|.++++|.++-.+ ||..+..+...+.. ...+..-. T Consensus 522 ~~~~-~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~a~~D~~~g~~~span~~~~gi~------g~~~~~~~ 594 (743) T protein:vir:10 522 IAFF-SDLTSTSYAVFDSGYKYVYDRFTDKYRYIPCNGDVAGLCVQTSNQLDDWYSPAGLNRGGIL------NAVKLAYN 594 (743) T ss_pred HHHH-HhccCCeeEEEEccceeeeccccCceeEechhHHHHHHHHHhhccCCcEEccCCeeeeeee------ccccceec Confidence 1111 1111111 1111 11111 1235688888888775 44444333221111 11111123 Q ss_pred cchhHHHHhhcCc-cEEEEEc-CCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHH Q lcl|Aclame:pro 284 FEGGDEAEGEGPV-NVLIDVS-DANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMI 361 (426) Q Consensus 284 ~~~~~~~~~~~~~-N~~~~~~-g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i 361 (426) +...|...|+.+. |.++.+. ++..+|-+.|..+.+..=.||=++|-.+||+..|+..++..+-. |.+..=...| T Consensus 595 ~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e----~n~~~~~~~i 670 (743) T protein:vir:10 595 PNKADRDELYQNRINPVVSLRGQGITLFGDKTALAAPSAFDRINVRRLFLNLEKRARRLAEGVLFE----QNDATTRAGF 670 (743) T ss_pred CChhHHHhHhhCCceEEEEecCCeEEEEcccccCCCCcccceEeehhhHHHHHHHHHHHHHHhccC----CCCHHHHHHH Confidence 4566777776544 8888764 46777877776665555679999999999999999999876643 6688888899 Q ss_pred HHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 362 EDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 362 ~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +..|+.-|.+-+++++ +..|.+.......+++|+.+-++. +.+.+...-.++++.++..-+- T Consensus 671 ~~~i~~fL~~l~~~ga--l~~~~V~~d~~~nt~~~i~~G~~~-~~i~~~p~~pae~I~~~~~~~~ 732 (743) T protein:vir:10 671 SSALNSYLSEVQARRG--VTDYLVICDESNNTPDIIDRNEFV-AEVYVKPTRSINFITITFTATK 732 (743) T ss_pred HHHHHHHHHHHHhcCc--eeeeEEEEcCCCCCHHHhhCCeEE-EEEEEEecCCcceEEEEEEEee Confidence 9999999999887664 567888887666677888887776 8888999999999998877554 No 52 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=88.34 E-value=0.033 Score=28.71 Aligned_cols=395 Identities=13% Similarity=0.090 Sum_probs=175.6 Q ss_pred CC-----CceEEEEEeec-cccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhc-- Q lcl|Aclame:pro 1 MP-----KQIVEIELTAE-IADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEM-- 72 (426) Q Consensus 1 mp-----~~iVnV~isl~-t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q-- 72 (426) -| ++=|-|.+.-+ ..+...-+.+.+.|||....-|| .+.-++++.++...=||... .-.+.+++|.| T Consensus 6 ~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~----~~~~~~~~~~~~~~~~~~g~-l~~~~~~a~~~~~ 80 (587) T protein:vir:99 6 FPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEP----NTVYELRNYSQAKRLFRSGE-LLDAIELAWGSNP 80 (587) T ss_pred cCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCCcc----ceeEEeccHHHHHHHhcCcc-hHHHHHHHhcccc Confidence 23 34455666553 56677788999999999887776 34667899999999997644 55678889976 Q ss_pred --CCceeeeeeccccc---c-----ccccccccceeccc-------eeeccccc-----ccchhhhhh------------ Q lcl|Aclame:pro 73 --GAEQWRVMVLEATE---V-----TEEELSDGDTIDKV-------PILGNHEV-----ESPDGDIEF------------ 118 (426) Q Consensus 73 --~~~~~~~~v~~~t~---v-----~~~~~~~~~tv~~~-------~~s~~~~~-----~~ta~~i~~------------ 118 (426) +....++++.+... . +...+.++..-|.. ++++.... ++-..++.+ T Consensus 81 ~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~i~y~g 160 (587) T protein:vir:99 81 NYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKYKG 160 (587) T ss_pred CCCceEEEEEEcCCCceeEEEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEEecccceeeeeeccceeeEEeec Confidence 34445544433211 1 11112222221111 11110000 000000000 Q ss_pred ---------------------hcccccccccceeeeeecccc----------cee---e-----echhheeeecc----- Q lcl|Aclame:pro 119 ---------------------TTDDDPDVEDFDAEIVINSAT----------GDV---A-----TSEDSIELTYF----- 154 (426) Q Consensus 119 ---------------------~~~~~~~~t~~~~~~~~~~~~----------g~~---t-----~~~~~~~~~~~----- 154 (426) .+..... + +..-.+.+... ..+ + .........+. T Consensus 161 ~~~~a~~~v~~~~~t~~a~~~~l~~g~~-~-v~~yrL~~g~~~~~~~~~~~i~~~~~~tAky~~~~~~~i~~~~~~~~~~ 238 (587) T protein:vir:99 161 EEANATFSVEHDEETQKASRLVLKVGDQ-E-VKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKIEN 238 (587) T ss_pred ccccceeeEeecCcceeeeeeeeecCCc-e-eEEEEecCCchHHHHHHHhhhccccceeEEeeccCCceeEeeccccccc Confidence 0000000 0 00000000000 000 0 00000000000 Q ss_pred -------------cccc------hh------hh---hhhc-----------------------cccceeecccccchhhh Q lcl|Aclame:pro 155 -------------HADW------SQ------LD---EFPS-----------------------DVNNFAVADRRFDLKGV 183 (426) Q Consensus 155 -------------~~d~------~~------~~---~~~s-----------------------~~~~~~la~~~~~~~~~ 183 (426) ..|. .. +. +... +.....|.....+-.. T Consensus 239 ~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG~~~- 317 (587) T protein:vir:99 239 ANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPP- 317 (587) T ss_pred ceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceecccceeeecCCCCCcc- Confidence 0000 00 00 0000 0000001100000000 Q ss_pred HhHhhhhhhhhhcceEEEEE-eccccccc------------------------chhhHHHHHHHhhccCcceEEEEecC- Q lcl|Aclame:pro 184 GVLDETHSWASDEDMGMIAN-GVNVDDYD------------------------SVDEAMDVAHEVAGYVPSGDLMMIVD- 237 (426) Q Consensus 184 ~~~~~~~~wa~~~~kl~~~~-~~d~~~~~------------------------~~~~~~~~a~~~a~~~~rt~~~~~~~- 237 (426) .+.+..-+-.+..+..++.. +.+..... ...-.....+.++-++.| +...+.. T Consensus 318 ~sy~~al~ale~~~~~~i~~~t~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~e~-vi~v~~~~ 396 (587) T protein:vir:99 318 ATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQASLSNPR-VSLVANSG 396 (587) T ss_pred ccHHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCc-EEEEeccc Confidence 00000000112222222211 11110000 000001111222223323 2222221 Q ss_pred -------CCc----cchhHHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhcC-ccEEEEEcCC Q lcl|Aclame:pro 238 -------ASD----DDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGP-VNVLIDVSDA 305 (426) Q Consensus 238 -------~~~----~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~-~N~~~~~~g~ 305 (426) ... .+..+++.|..+..+|..+++++... ..++...++.+|+..+..+ .+.++...+. T Consensus 397 ~~~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i~----------~~~v~~~~t~~e~e~li~~Gvl~l~~~~~~ 466 (587) T protein:vir:99 397 TFVMDDGRKNHVPAYMVAVALGGLASGLEIGESITFKPLR----------VSSLDQIYESIDLDELNENGIISIEFVRNR 466 (587) T ss_pred eEecCCCceeeechHHHHHHHHHHHhcCchhcCccceeee----------cccccccCCHHHHHHHHhCCeEEEEEecCC Confidence 001 12346777777777766555544321 2234455677776655433 3555543332 Q ss_pred ----EEEeeceeecCcccCcce--eehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCcc Q lcl|Aclame:pro 306 ----NRVSNAVTTAGADSDTSF--FDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQP 379 (426) Q Consensus 306 ----~~~~~~~t~~G~~~sg~~--iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~ 379 (426) ..+.++.++-....+-.| |=+||..|.+...|+..+.+.+.-. |=.+.|-..|++.|...|++-.+.+- T Consensus 467 ~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk---~Nn~~~r~~i~~~i~~~L~~l~~~ga-- 541 (587) T protein:vir:99 467 TNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGT---RTINTSASIIKDFIQSYLGRKKRDNE-- 541 (587) T ss_pred cceEEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCcc---ccchHHHHHHHHHHHHHHHHHHhCCc-- Confidence 355677777443334445 6699999999999999988777653 33578999999999999999876542 Q ss_pred ccceeEecCcccCcHHHHHh-hcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 380 LAEYEVDVPEWDDDDVDRVN-RNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 380 ~~~y~~~~p~~~~~~~dra~-R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +-+|+. + +.+.++.+ |. -+++.++..-+++++.+++.+.- T Consensus 542 I~~~~~--~---dv~v~~~~d~~--~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:99 542 IQDFPA--E---DVQVIVEGNEA--RISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred ccCCCc--c---ceEEEecCCEE--EEEEEEEEcccceEEEEEEEEEe Confidence 334432 1 11111111 22 27889999999999998888866 No 53 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=87.78 E-value=0.037 Score=28.47 Aligned_cols=377 Identities=13% Similarity=0.018 Sum_probs=140.8 Q ss_pred CCCceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCceeeee Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQWRVM 80 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~~~~~ 80 (426) .+-|-+.|.|.-.+.+ .|...+.+....- |.-. ..+++..-...+...|-.....|........ T Consensus 371 awGN~ItV~I~~~t~~----~~~l~v~~~~~s~------f~~~----~a~e~~tv~~~~~~~~~~v~e~~dn~~i~~~-- 434 (774) T protein:vir:98 371 NWGNQVTVSIYPVNNS----EFRLNVQDLNGSA------FNPP----LADEVYTVKLGDTNESGELNALLDSKFIRGF-- 434 (774) T ss_pred cCCCceEEEEEecCCc----eeEEEEEecCCcc------cccc----ccceeEEEecccccccceeeeeeceeeEeec-- Confidence 2223333333221111 1221111110000 0000 0000000000000111111111111000000 Q ss_pred eccccccccccccccceeccceeecccccccchhhhhhhccc-ccccccceee-eeeccccceeeechhheeeecccccc Q lcl|Aclame:pro 81 VLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDD-DPDVEDFDAE-IVINSATGDVATSEDSIELTYFHADW 158 (426) Q Consensus 81 v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~-~~~~t~~~~~-~~~~~~~g~~t~~~~~~~~~~~~~d~ 158 (426) ..+............+...+..... .+........+.. .......... .+....++..++ ...+... T Consensus 435 --~~~~~~~~in~vs~lv~~~~~~~a~--~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg~~tt-----~~~igg~-- 503 (774) T protein:vir:98 435 --FLPKSIDSINYDAALVRQSPLRLAP--PDESETDVENPAHVDFYGPNVLVDVTLENGYDGPPVT-----NDDYVSI-- 503 (774) T ss_pred --ccccccccccccccccccchhcccc--cccccccccccccccccCCcceEEEeecCCCCccccc-----chheecc-- Confidence 0000000000000000000000000 0000000000000 0000000000 000000110000 0000000 Q ss_pred hhhhhhh-ccccceeecccccchhhhHhHhhhhhhhh---hcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEE Q lcl|Aclame:pro 159 SQLDEFP-SDVNNFAVADRRFDLKGVGVLDETHSWAS---DEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMM 234 (426) Q Consensus 159 ~~~~~~~-s~~~~~~la~~~~~~~~~~~~~~~~~wa~---~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~ 234 (426) +.... ..++....+....+ ....+...++ ...++.++-...+... +...+.....++-.+...++ T Consensus 504 --~~~~~~tgi~aLl~a~~~~~-----V~~aii~~~e~~~~~~~~r~avid~p~g~----t~~~Ai~~r~~f~S~~aal~ 572 (774) T protein:vir:98 504 --IRTLENQPVHILLVGTTNVG-----VQQALITEAERASDSDGLRIAVLAAPPRT----TPTLAASVTRGFNSTRAVMV 572 (774) T ss_pred --cccccccceeEEEcCccchh-----hHHHHHHHHHHhhhcccceEEEEECCCCC----CHHHHHHHHhccCCceEEEE Confidence 00000 11111111111111 1111111121 1222222222221111 11111112222222223333 Q ss_pred ecCC----------CccchhHHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhc-CccEEE--E Q lcl|Aclame:pro 235 IVDA----------SDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEG-PVNVLI--D 301 (426) Q Consensus 235 ~~~~----------~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~-~~N~~~--~ 301 (426) +.-. ....|.++++|.++..+||.++..+...+..-.. .........+..++..+.. ..|.+. . T Consensus 573 ~Pwvkv~D~~~g~~~~vPpSg~vAGl~ArtDv~kSPANk~I~Givg~a---i~~~l~~~~t~ae~d~Ln~~gIN~i~itt 649 (774) T protein:vir:98 573 AGWFTYAGQPNSSRYGVPGAAVYAGKLAAIDFFVSPAARSLVGPLFNI---IESDTDNYTSRSNQDIYSAARLEVLSLDT 649 (774) T ss_pred eCcEEEeccCCCceeecChhHHHHHHHHhcCcccccCCceeecceecc---ccccccccccchhhhhhcccccceeEEEE Confidence 3211 1223678899999988987765544322111000 0001112233344444432 225543 3 Q ss_pred EcCCEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCcccc Q lcl|Aclame:pro 302 VSDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLA 381 (426) Q Consensus 302 ~~g~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~ 381 (426) ..++..+|-+.|+++ +..=.||=++|..|||+..|+..++.++ .| |.++.....|+..|+.-|..-++.| .+. T Consensus 650 ~g~G~rvWG~RTlss-Dp~wr~InVRRlfd~Ie~SI~~~~~~~V---fE-PNd~~l~~~I~~sI~~fL~~L~~~G-aL~- 722 (774) T protein:vir:98 650 VDRTYRFASGVTLST-DPAWERIYLRRVHDVVRQGAHAILRNYV---AM-PNSRLVRNQIAAALNAFMGELKRNG-NIV- 722 (774) T ss_pred cCCcEEEEcccccCC-CcccceEeehhhHHHHHHHHHHHHHHhc---cC-CCCHHHHHHHHHHHHHHHHHHHhCC-cee- Confidence 356667776666544 2334689999999999999999988765 34 8899999999999999999998755 333 Q ss_pred cee-EecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 382 EYE-VDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 382 ~y~-~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +|+ +.......+++|+.+-++. +.+.+...-.++++.++..-.- T Consensus 723 G~~~V~~D~etNt~~dI~~G~l~-i~I~vaP~~PAEfIilri~q~t 767 (774) T protein:vir:98 723 SFRPAIIDGSNNSTAAYFSRELY-VSLQFQPLYSADYIYVTISRDT 767 (774) T ss_pred cceEEEEcCCCCCHHHhhCCEEE-EEEEEEecCCcceEEEEEEEee Confidence 454 4444455677788776665 7888889999999998877666 No 54 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=86.15 E-value=0.048 Score=27.83 Aligned_cols=397 Identities=14% Similarity=0.095 Sum_probs=171.5 Q ss_pred CC-----CceEEEEEee-ccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHh--- Q lcl|Aclame:pro 1 MP-----KQIVEIELTA-EIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEE--- 71 (426) Q Consensus 1 mp-----~~iVnV~isl-~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~--- 71 (426) -| ++=|-|.+.- +..+...-+.+.+.|||....-|| ++.-++++.++-..=||... .-.|.+++|. T Consensus 6 ~~~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G~~----~~~~~~~~~~~~~~~fg~g~-l~~~i~~a~~~~~ 80 (562) T protein:vir:63 6 YPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKP----NAVYKVRNYSQAKSVFRSGE-LLDAIERAWNPGE 80 (562) T ss_pred eCCCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCCCC----ceeEEEccHHHHHHHhcCCc-hHHHHHHhccccc Confidence 12 3445555554 566778889999999999877765 45778999999999997533 3356777774 Q ss_pred -cCCceeeeeecccccccc--------ccccccce-------------------------------------eccceeec Q lcl|Aclame:pro 72 -MGAEQWRVMVLEATEVTE--------EELSDGDT-------------------------------------IDKVPILG 105 (426) Q Consensus 72 -Q~~~~~~~~v~~~t~v~~--------~~~~~~~t-------------------------------------v~~~~~s~ 105 (426) .|....++++.+.+.... +...++.. |-....++ T Consensus 81 ~~g~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~V~~i~y~g 160 (562) T protein:vir:63 81 GTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSIKYKG 160 (562) T ss_pred cCCceEEEEEEcCCCccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhccceeeeeeec Confidence 443333333322221100 00000000 00000000 Q ss_pred ccc-----------------------------------cccchhhhhhhcccccccccc--------------eeee--e Q lcl|Aclame:pro 106 NHE-----------------------------------VESPDGDIEFTTDDDPDVEDF--------------DAEI--V 134 (426) Q Consensus 106 ~~~-----------------------------------~~~ta~~i~~~~~~~~~~t~~--------------~~~~--~ 134 (426) .+. ...+...+.+.+.+.+..+.. +... . T Consensus 161 ~~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~~gn~i~~~~~d~~~~~~ 240 (562) T protein:vir:63 161 TEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDAQIDVD 240 (562) T ss_pred ccccceEEEEecCcceeEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeeccCCceeeeeccccccccc Confidence 000 000000000000000000000 0000 0 Q ss_pred eccccceeeechhheeeecccccchhhhhhhc-cc---cceeecccccc-----hh-hh------------------HhH Q lcl|Aclame:pro 135 INSATGDVATSEDSIELTYFHADWSQLDEFPS-DV---NNFAVADRRFD-----LK-GV------------------GVL 186 (426) Q Consensus 135 ~~~~~g~~t~~~~~~~~~~~~~d~~~~~~~~s-~~---~~~~la~~~~~-----~~-~~------------------~~~ 186 (426) +.+....+.+............+|.......+ .+ ....|.....+ |. .. ... T Consensus 241 vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la~~~~~~LtGG~dGt~~~~~~~al~ale~~~~~~i~~~t~d~av~ 320 (562) T protein:vir:63 241 IKTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVPLTSKQAVH 320 (562) T ss_pred hhhhhhhhhhhhhhhhhcccccceeeeeeccccceecccceeeecCCCCCchhhHHHHHHHHHhCCcEEEEecCCCHHHH Confidence 00000000000000000000001110000000 00 00011000000 00 00 000 Q ss_pred hhhhhhh---hhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecC-------CCcc----chhHHHHHHHh Q lcl|Aclame:pro 187 DETHSWA---SDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVD-------ASDD----DLAAYQLGKFA 252 (426) Q Consensus 187 ~~~~~wa---~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~-------~~~~----~~~aa~~g~~~ 252 (426) ..+.+|+ .++.+-..+...... ..+......+.+.-++.|-.++.... .... +..++++|..+ T Consensus 321 ~~l~a~vkr~~~~g~~~~aVlg~~~---~~~~~~~~~~a~~~n~ervv~v~~~~~~~~~~~~~~~~~~~~~aa~vAGl~A 397 (562) T protein:vir:63 321 AEALQFVRDCSYNGNPMRVFVGGGI---GESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKMPGYMFAAQVAGLTC 397 (562) T ss_pred HHHHHHHHHHHhCCCcEEEEecCCC---CCCHHHHHHHhhhcCCCcEEEEecCeeEECCCCceeeechhHHHHHHHHHhh Confidence 1122222 111111111111000 00111111222222333333322110 0011 23567777777 Q ss_pred hhcccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEEcCC-E---EEeeceeecCcccCcce--ee Q lcl|Aclame:pro 253 VSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSDA-N---RVSNAVTTAGADSDTSF--FD 325 (426) Q Consensus 253 ~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g~-~---~~~~~~t~~G~~~sg~~--iD 325 (426) ..+|..+++++... ..++...++.+|+..+..+. +++....+. . .+.++.++-+...+-.| |= T Consensus 398 ~~~~~~SlT~~~i~----------~~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~~~~~~~ki~ 467 (562) T protein:vir:63 398 GLEIGEAITFKNIA----------IETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIG 467 (562) T ss_pred cCchhcCccceeec----------cccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhh Confidence 77766555554321 23344556777776655443 666554433 2 23466666443333334 67 Q ss_pred hhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCe Q lcl|Aclame:pro 326 IRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGI 405 (426) Q Consensus 326 ~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i 405 (426) +||..|.+...|+..+.+.+.- | |=.+.|-..|.+.|.+.|++-.+.+- +-+|+ | ++.+.++.+.+. -+ T Consensus 468 viRv~D~i~~dir~~~~~~yiG--k-~Nn~~~r~~v~~~i~~~L~~l~~~ga--I~~~~---~--~dv~v~~~~d~~-~v 536 (562) T protein:vir:63 468 VGEANDFLVSELKISLDNEYIG--T-KIIDTSASLVKNFVQSFLDRKKLAKE--IQDYS---P--EEVQVVIEGDVA-RI 536 (562) T ss_pred hhHHHHHHHHHHHHHHHhcCCc--c-ccChHHHHHHHHHHHHHHHHHHhCCc--ccCCC---c--cceEEEecCCEE-EE Confidence 9999999999999888776654 3 45678999999999999999876542 23342 1 111111111112 36 Q ss_pred EEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 406 DLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 406 ~~~~~laGAIh~v~I~g~v~v 426 (426) +|.++..-++|++.+++++.- T Consensus 537 ~~~v~pv~~mekIy~ti~~~~ 557 (562) T protein:vir:63 537 SLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred EEEEEEcccceEEEEEEEEee Confidence 788999999999999888877 No 55 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=83.79 E-value=0.066 Score=27.07 Aligned_cols=395 Identities=11% Similarity=0.011 Sum_probs=144.8 Q ss_pred CC-C------ceEEEEEeeccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcC Q lcl|Aclame:pro 1 MP-K------QIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMG 73 (426) Q Consensus 1 mp-~------~iVnV~isl~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~ 73 (426) ++ . .--..++...+-.. ...|+..++...+....... ...+....++-.++. ....|....+.|.-+ T Consensus 153 ~~~~~y~gt~~~~t~~v~~~~~~~-~~~~~~~~~~~~~~~v~~~~----~~~~~~~~~~v~~~~-~~~~~~~~~~~~~~s 226 (648) T protein:vir:10 153 QKHPDFSVTRETFTFPRKFTTPTV-LVKRGSTLFFVDRSIVNAAL----AAGPAFQTALINLLK-EQLQPTDVVQIFDAS 226 (648) T ss_pred cCCCcccccceecccccccccccc-ccccccceeecCccchhhhh----ccCccchhhhhhchh-hhhhhhhhheecccc Confidence 11 0 11111111111111 12344333322211100000 000000111111100 000011111111100 Q ss_pred Cc-----eeeeeecccc--------cc----ccccc-------cccceecccee-eccc-----c----cc------cch Q lcl|Aclame:pro 74 AE-----QWRVMVLEAT--------EV----TEEEL-------SDGDTIDKVPI-LGNH-----E----VE------SPD 113 (426) Q Consensus 74 ~~-----~~~~~v~~~t--------~v----~~~~~-------~~~~tv~~~~~-s~~~-----~----~~------~ta 113 (426) .. .....+.++. .. +..+. .+...+..+|. .... . .+ .+. T Consensus 227 ~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~~~~~~v~~~~~ 306 (648) T protein:vir:10 227 DTNPVDIPLGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLSDPANWFAKDAYTI 306 (648) T ss_pred cccccccccccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccccccceeeeeccch Confidence 00 0000000000 00 00000 00000000000 0000 0 00 000 Q ss_pred hhhhhhcccccccccceeeeeeccccceeeechhheeeecccccch-hhhhhhccccceeeccc-------ccc----hh Q lcl|Aclame:pro 114 GDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHADWS-QLDEFPSDVNNFAVADR-------RFD----LK 181 (426) Q Consensus 114 ~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~~d~~-~~~~~~s~~~~~~la~~-------~~~----~~ 181 (426) ..+.. ....+...+...-...-..+|...++-..-..+.+..||+ .+..+...-....++.. .+| -. T Consensus 307 ~~l~~-~~~~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ivp~~~~~~~~~~~~~lt~~q 385 (648) T protein:vir:10 307 NHLVD-TTINPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVIPAYKFTNVTQLNDRLTIFK 385 (648) T ss_pred hhccc-ccccCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEEeecccccccccccccCCcc Confidence 00000 0000010000000011122233322222223333556774 44433321111112100 000 01 Q ss_pred hhHhHhhhhhhhhhc---------ceEEEEEecccc-cccchhhHHHHHHHh-------------hccCcceEEEEecC- Q lcl|Aclame:pro 182 GVGVLDETHSWASDE---------DMGMIANGVNVD-DYDSVDEAMDVAHEV-------------AGYVPSGDLMMIVD- 237 (426) Q Consensus 182 ~~~~~~~~~~wa~~~---------~kl~~~~~~d~~-~~~~~~~~~~~a~~~-------------a~~~~rt~~~~~~~- 237 (426) .+ .+++-.|+.+- .-+++....-+. .....+.......+. +...+-....++.. T Consensus 386 ~i--~a~a~shv~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~~~~~~~a~~~~~d~~~~~~~~~~~~~~~~~G 463 (648) T protein:vir:10 386 GI--ASTFLSHVQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRNILNTISAMFGGTDRAQAVVFPFYSNVFNDEG 463 (648) T ss_pred ch--HHHHHHHHHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhhcccccceeeeecCCceEEeecccceeECCCC Confidence 11 12222233211 123432111100 000000000000000 00000001111111 Q ss_pred ----CCccchhHHHHHHHhhhcccccceeecccccceeeccccccccc--cccchhHHHHhhcC-ccEEEEEcC-----C Q lcl|Aclame:pro 238 ----ASDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQ--GTFEGGDEAEGEGP-VNVLIDVSD-----A 305 (426) Q Consensus 238 ----~~~~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~--~~~~~~~~~~~~~~-~N~~~~~~g-----~ 305 (426) -...+.++++.|..+...|..++|+|...+. ++. ..++.+|+..|... .+.++...+ . T Consensus 464 ~~~~~p~~~~Aa~VAGl~a~l~~~~s~T~k~i~~~----------~id~~~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~ 533 (648) T protein:vir:10 464 KVELLGGEFFASYVAGMHANREPQDSITFLPISGI----------GAEPLYNWTYTQKDDLISNRVLFVEKVKTSFGGIV 533 (648) T ss_pred cEEecchhhHHHHHHhhhhccccccCcccceeecc----------ccccccCCCHHHHHHHhcCCcEEEEEecCCcceee Confidence 1223447888899888888777666544322 221 14667777776544 355554433 3 Q ss_pred EEEeeceeecCcccCc--ceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccce Q lcl|Aclame:pro 306 NRVSNAVTTAGADSDT--SFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEY 383 (426) Q Consensus 306 ~~~~~~~t~~G~~~sg--~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y 383 (426) ..+.+|.|+.+...+- +-|=++|-.|++...++..+.+.++-. |=++.....|++.|..-|.+-.+ ++.++ .| T Consensus 534 ~rvv~gITT~~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~---~n~~~~~~~ik~~i~~~L~~~~~-~~~I~-~y 608 (648) T protein:vir:10 534 YRIHHNPTTWLGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGR---KSYGRKTENDIKVYTEALLSNLV-GKQIV-AY 608 (648) T ss_pred EEEeccceeecCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcc---cccHHHHHHHHHHHHHHHhhHhh-cCccc-Cc Confidence 4577899998864322 367889999999999999999999864 44667888888888887655444 23344 34 Q ss_pred e-EecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 384 E-VDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 384 ~-~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) + .++ ....+ ..|. -|.|+++..-+||++.|+..|+- T Consensus 609 ~~~~v----~~~~~-~~vv--~V~~~v~Pv~~i~~I~vti~it~ 645 (648) T protein:vir:10 609 KDVKV----TSNED-KTVY--YVEFFYQPVTEIKFILVTMKVTF 645 (648) T ss_pred ccceE----EEEec-CCEE--EEEEEEEecceeeEEEEEEEEEe Confidence 3 111 00111 1232 59999999999999999999998 No 56 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=83.37 E-value=0.069 Score=26.95 Aligned_cols=364 Identities=12% Similarity=-0.020 Sum_probs=142.6 Q ss_pred CCCceEEEEEeecccccc----------------ccCccceEEEeccccc-----cccc----------chhhhheeecH Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRP----------------QETFTDAAIVGTAEEE-----PPDA----------EFGEVNQYSTS 49 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~----------------~~~Fg~~Lilg~~~~~-----~~~~----------~~~~~~~Yts~ 49 (426) ..-+-+.+.+...+.... ....+.++........ .+.. ...|....+.. T Consensus 235 ~~gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~ 314 (679) T protein:vir:10 235 TYGDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTK 314 (679) T ss_pred ccCCcceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeeecc Confidence 111111111111111100 0000000000000000 0000 00000000000 Q ss_pred HHHHhccCCCCHHHHHHHHHHhcCCceeeeeeccccccccccccccceeccceeeccccc--ccchhhhhhhcccccccc Q lcl|Aclame:pro 50 TSVGDDYGEDSDVYTASEAIEEMGAEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEV--ESPDGDIEFTTDDDPDVE 127 (426) Q Consensus 50 ~~V~~Dfg~~sp~YkAA~~~f~Q~~~~~~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~--~~ta~~i~~~~~~~~~~t 127 (426) . .| +.......-...++..+....-........+..++ . ++..+.... ..+..+.. T Consensus 315 ~---~~-~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~-----~---~~~~gg~~~~~~~~~~~~~---------- 372 (679) T protein:vir:10 315 P---GD-RDIYGTSIYINEYFGNGYSSFVQGVAESWPVGYTG-----V---LAFGGGQSSNTDISAAEFM---------- 372 (679) T ss_pred c---cc-ccccchhhhhhhhhcCcccceeeeccccccccccc-----e---eeccCCccCCCccchhhhh---------- Confidence 0 00 00000000001111111000000000000000000 0 000000000 00000000 Q ss_pred cceeeeeeccccceeeechhheeeecccccchhhhhhhccccceeecccccch--hhhHhHhhhhhhhhhcc-eEEEEEe Q lcl|Aclame:pro 128 DFDAEIVINSATGDVATSEDSIELTYFHADWSQLDEFPSDVNNFAVADRRFDL--KGVGVLDETHSWASDED-MGMIANG 204 (426) Q Consensus 128 ~~~~~~~~~~~~g~~t~~~~~~~~~~~~~d~~~~~~~~s~~~~~~la~~~~~~--~~~~~~~~~~~wa~~~~-kl~~~~~ 204 (426) .. .+...+.+. ..++...++...... ........+..-++..+ ++.+... T Consensus 373 ----------------~~----------~~~~~~~~~-~~~~~l~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~ 425 (679) T protein:vir:10 373 ----------------KG----------WDMFADREH-TDVNLFIAGAVAGEGAQIASTVQKAVVAIADERRDCLVLISP 425 (679) T ss_pred ----------------hh----------hhhhhcccc-cccceEEecCCCCCchhhhHHHHHHHHHHHHhhCCeEEEEec Confidence 00 000000000 001111111100000 00011112222233322 3333321 Q ss_pred ccccccc---c--hhhHHHHHHHh---------hccCcc-eEEEEec-------CCC---ccchhHHHHHHHhhhc---- Q lcl|Aclame:pro 205 VNVDDYD---S--VDEAMDVAHEV---------AGYVPS-GDLMMIV-------DAS---DDDLAAYQLGKFAVSE---- 255 (426) Q Consensus 205 ~d~~~~~---~--~~~~~~~a~~~---------a~~~~r-t~~~~~~-------~~~---~~~~~aa~~g~~~~~~---- 255 (426) ......+ . .+.....+... ..+..+ -..+|+. ... ..-|.++++|.|+-.+ T Consensus 426 p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g 505 (679) T protein:vir:10 426 PREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGNYKYQYDKYNDVNRWIPLAADIAGLCARTDTVGQ 505 (679) T ss_pred cccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEccceeeecccCCceEEechHHHHHHHHHHhhccCC Confidence 1111100 0 01010101000 000001 1112221 111 1235688888888776 Q ss_pred ccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEEcC-CEEEeeceeecCcccCcceeehhhhHHHH Q lcl|Aclame:pro 256 PWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSD-ANRVSNAVTTAGADSDTSFFDIRRTKVYT 333 (426) Q Consensus 256 p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g-~~~~~~~~t~~G~~~sg~~iD~i~g~dwl 333 (426) ||..+..+........ .+..-.+...|...|+.+. |.++.+-| +..+|-..|.++....-.||=++|-.+|| T Consensus 506 ~~~sPan~~~~~i~g~------~~~~~~~~~~~~~~Ln~~gin~i~~~~g~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i 579 (679) T protein:vir:10 506 PWQSPAGFNRGQIVNV------IKLAVDTRQAHRDEMYTNGINPIVGFAGQGYILYGDKTASQAPTPFDRINVRRLFNLL 579 (679) T ss_pred cEECcCCeeecccccc------ccceeecChhhHHhhhhCCceEEEEecCCeEEEEcccccCCCCcccceEehhhHHHHH Confidence 4444433322111101 1111124556777776544 88888754 67888888877765556799999999999 Q ss_pred HHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcc Q lcl|Aclame:pro 334 AEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQ 413 (426) Q Consensus 334 ~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laG 413 (426) +..|+..++..+-. |.+..=...|+..|+.-|.+-+++|. +-+|.+.......+++|+.+-++. +.+.+...- T Consensus 580 ~~si~~~~~~~v~e----pn~~~~~~~i~~~i~~fL~~l~~~ga--l~gf~v~~d~~~nt~~~i~~G~~~-~~i~~~p~~ 652 (679) T protein:vir:10 580 KKSISESAKYKLFE----LNDAFTRSSFRSEVGSYLDTIRSLGG--IYDFRVVCDESNNTPAVIDRNEFV-ATILIKPAR 652 (679) T ss_pred HHHHHHHHHHhccC----CCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEcCCCCCHHHhhCCeEE-EEEEEEecC Confidence 99999998876642 66777778889999999988887653 335888887666678888887775 889999999 Q ss_pred cEEEEEEEEEEeC Q lcl|Aclame:pro 414 RAHTFSLGLNVSV 426 (426) Q Consensus 414 AIh~v~I~g~v~v 426 (426) .+++|.++..-+- T Consensus 653 pae~i~~~~~~~~ 665 (679) T protein:vir:10 653 SINYITLSFVATS 665 (679) T ss_pred CccEEEEEEEEee Confidence 9999999877765 No 57 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=82.04 E-value=0.08 Score=26.59 Aligned_cols=398 Identities=12% Similarity=0.059 Sum_probs=166.8 Q ss_pred CC-----CceEEEEEee-ccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHH---- Q lcl|Aclame:pro 1 MP-----KQIVEIELTA-EIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIE---- 70 (426) Q Consensus 1 mp-----~~iVnV~isl-~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f---- 70 (426) .| ++=|-|.+.- ...+.+.-+.+.+.|||....-|| +++-++++.++...=||. -|.-.|.+++| T Consensus 15 ~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G~~----~~~~~~~~~~~a~~~f~~-g~l~~a~~~a~~~~~ 89 (607) T protein:vir:10 15 YPLFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNGDP----TKVYEIRTSQQATKIFGS-GDLVDGIKLAFDPTG 89 (607) T ss_pred hCCCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCCCC----ceEEEEcchhHHHHhhcC-cchHHHHHHhhcccc Confidence 33 4445566655 445567789999999999988776 467789999999999974 33556788888 Q ss_pred --hcCCceeeeeecccc---cccccccc-----ccc-------------------e------------------------ Q lcl|Aclame:pro 71 --EMGAEQWRVMVLEAT---EVTEEELS-----DGD-------------------T------------------------ 97 (426) Q Consensus 71 --~Q~~~~~~~~v~~~t---~v~~~~~~-----~~~-------------------t------------------------ 97 (426) .|++...+.++.+.. ..+.++.. ++. + T Consensus 90 ~~~~g~~~~~~~rv~~~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~~~~~~~n~g~~~~i~y~ 169 (607) T protein:vir:10 90 NSVTNGGTVYALRVDNAKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDNYERTYTNIGQMFSITYS 169 (607) T ss_pred CCccCCceEEEEeCCCccccceecccccccccccccCCCceEEEEEecCCCccceeEEeecccceeeeeeccceeecccC Confidence 466665555443221 11111110 000 0 Q ss_pred ---------ec----cceeeccccc-------------------ccchhhhhhhcccccccccceeeeeec--------- Q lcl|Aclame:pro 98 ---------ID----KVPILGNHEV-------------------ESPDGDIEFTTDDDPDVEDFDAEIVIN--------- 136 (426) Q Consensus 98 ---------v~----~~~~s~~~~~-------------------~~ta~~i~~~~~~~~~~t~~~~~~~~~--------- 136 (426) |. +.+......+ -.+..++++-+..-++ ..+..+.+ T Consensus 170 g~~~~a~~~v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din~~~~---~~A~~~g~~~i~tky~d 246 (607) T protein:vir:10 170 GKSASAGYTVSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAISATPN---FSASVVGSPSVNTSYLD 246 (607) T ss_pred cccccccceeeecCCCceeEEEecCCCccceeeeeecccccccccchHHHHHHHhhcCCc---eEEEEecccceeeeccc Confidence 00 0000000000 0011111111000000 00000000 Q ss_pred cccceeeechh-----h---eeeecccccch----hhhh---hh----------------------ccccceeecccccc Q lcl|Aclame:pro 137 SATGDVATSED-----S---IELTYFHADWS----QLDE---FP----------------------SDVNNFAVADRRFD 179 (426) Q Consensus 137 ~~~g~~t~~~~-----~---~~~~~~~~d~~----~~~~---~~----------------------s~~~~~~la~~~~~ 179 (426) .....++.... . -...+.....+ .... .. .+.....|.....+ T Consensus 247 ~~~~~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~LtGGtdG 326 (607) T protein:vir:10 247 EVTSPVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVNGVSAGTGSATASVTTAPESFPANFDTAFLTGGSTG 326 (607) T ss_pred cccceeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhhhhhccccceeeeeeccccccccccceeeeeCCCCC Confidence 00000000000 0 00000000000 0000 00 00000001100000 Q ss_pred hhhhHhHhhhhhhhhhcceEEE-EEeccccccc------------------------chhhHHHHHHHhhccCcceEEEE Q lcl|Aclame:pro 180 LKGVGVLDETHSWASDEDMGMI-ANGVNVDDYD------------------------SVDEAMDVAHEVAGYVPSGDLMM 234 (426) Q Consensus 180 ~~~~~~~~~~~~wa~~~~kl~~-~~~~d~~~~~------------------------~~~~~~~~a~~~a~~~~rt~~~~ 234 (426) -.. ...+..-+-.+..+..++ ..+.+..... ...-.....+.+.-++.|-.++. T Consensus 327 ~~~-~ty~dal~aLe~~e~~~i~~~t~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~~~~t~~~~~t~a~~~N~ervv~V~ 405 (607) T protein:vir:10 327 DVP-VSWADKFNGAIGNNVYYIIPLTSEENIHAELQAFIDEQHVLGYNYHAFVGGGFAEPLEQILSRQVNINDSRFGLVG 405 (607) T ss_pred Cch-hhHHHHHHHHhhcCceEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHHHhhCCCcEEEEe Confidence 000 000110011122222222 1111111000 00001111122222232322221 Q ss_pred ec----CC--Ccc----chhHHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEEc Q lcl|Aclame:pro 235 IV----DA--SDD----DLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVS 303 (426) Q Consensus 235 ~~----~~--~~~----~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~ 303 (426) .. .. ... ...+++.|..+..+|-.+++++... ..++...+..+|+..+..+. .++.... T Consensus 406 ~~~~~~~~G~~~~~~~~~~Aa~vAGl~Ag~~~~~SlT~k~i~----------~~~v~~~lt~~e~e~ai~~Gv~~l~~~~ 475 (607) T protein:vir:10 406 QSGHVQEGGESVHVPAYLMAAYVGGLSSSLGVAVPITNKKLA----------LVDLDQNFSGDDLNTLNQNGVIGIEHLV 475 (607) T ss_pred cCeeEeeCCcceeccHHHHHHHHHHHHhcCccccCcccceec----------cccccccCCHHHHHHHHhCCeEEEEEcc Confidence 10 00 011 2345666666666655454443221 22344557777776655443 5554322 Q ss_pred -----CCEEEeeceeecCcccCcce--eehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 304 -----DANRVSNAVTTAGADSDTSF--FDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSV 376 (426) Q Consensus 304 -----g~~~~~~~~t~~G~~~sg~~--iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~ 376 (426) +...+.++.++-+...+-.| |=+||..|.+.+.|+..+.+.++- |++.+ .....++..|...|..-..+. T Consensus 476 ~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIG--k~nnd-~~~~~vk~~i~~~L~~~~l~~ 552 (607) T protein:vir:10 476 NRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIG--SNIRS-TSADDIKSTVASYLYSEMNND 552 (607) T ss_pred CccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCc--ccCCc-chHHHHHHHHHHHHHHHHHHh Confidence 23567788888554333334 889999999999999888776652 33443 455678888887774322222 Q ss_pred CccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 377 GQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 377 g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) ...+-.|. | ++.+.++.+.++ -++|.++..-+||++.+++.+.= T Consensus 553 ~gaI~df~---~--edv~v~~~~D~v-~v~~~v~Pv~~iekIyvtv~v~~ 596 (607) T protein:vir:10 553 DGLIVDFS---E--SDIVVTISGTVV-YIQFAVAPTQEIKNIVVSGTYSN 596 (607) T ss_pred cCceeCCC---c--cccEEeeCCCEE-EEEEEEEEcccceEEEEEEEEEE Confidence 11233442 1 111111111122 27899999999998887776655 No 58 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=79.85 E-value=0.1 Score=26.06 Aligned_cols=411 Identities=11% Similarity=-0.008 Sum_probs=146.6 Q ss_pred CC-----CceEEEEEeeccccccccCc---cceE----E-EecccccccccchhhhheeecHHHHHh-ccCCCCHHHHHH Q lcl|Aclame:pro 1 MP-----KQIVEIELTAEIADRPQETF---TDAA----I-VGTAEEEPPDAEFGEVNQYSTSTSVGD-DYGEDSDVYTAS 66 (426) Q Consensus 1 mp-----~~iVnV~isl~t~a~~~~~F---g~~L----i-lg~~~~~~~~~~~~~~~~Yts~~~V~~-Dfg~~sp~YkAA 66 (426) -+ ..++.+.............+ +.++ . ++......+.. +..-.+...+..... .+|.+=++...+ T Consensus 163 ~~~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~-~~~~~~~~~~~a~~~G~~Gn~isv~i~s 241 (664) T protein:vir:98 163 IFLLVGTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQT-LTQKYQIPSVVALYPGELGSTVQVEIIS 241 (664) T ss_pred cceecccceeeeeecccceeeecccccccceeeccccceeeecccccccee-eeeccccceeeeeecccccceeeeeecc Confidence 00 01111111000000000000 0000 0 00000000000 000000000000000 122211112122 Q ss_pred HHHHhcCCce-ee---eeeccc-ccccccc----cccccee--cc-----ceeecccccccchhhhhhhcccccccccce Q lcl|Aclame:pro 67 EAIEEMGAEQ-WR---VMVLEA-TEVTEEE----LSDGDTI--DK-----VPILGNHEVESPDGDIEFTTDDDPDVEDFD 130 (426) Q Consensus 67 ~~~f~Q~~~~-~~---~~v~~~-t~v~~~~----~~~~~tv--~~-----~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~ 130 (426) ...+.++... .. ...... ..+.... .....++ .+ ..++......+........ .......... T Consensus 242 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 320 (664) T protein:vir:98 242 KAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIYM-DDFFANGGSQ 320 (664) T ss_pred cccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeeeeec-hhheecccce Confidence 2222221100 00 000000 0000000 0000000 00 0000000000000000000 0000000000 Q ss_pred eeeeec-----cccceeeechhheeeeccc--ccchhhhhhhc----cccceeeccccc-chh-hhHhHhhhhhhhhhc- Q lcl|Aclame:pro 131 AEIVIN-----SATGDVATSEDSIELTYFH--ADWSQLDEFPS----DVNNFAVADRRF-DLK-GVGVLDETHSWASDE- 196 (426) Q Consensus 131 ~~~~~~-----~~~g~~t~~~~~~~~~~~~--~d~~~~~~~~s----~~~~~~la~~~~-~~~-~~~~~~~~~~wa~~~- 196 (426) ...... ..++.+..........+.. ..-.++..+.. .++....+.... ... .......+..-++.. T Consensus 321 ~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~~~~v~~al~~~a~~~~ 400 (664) T protein:vir:98 321 YVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGESVEIASTVQKHVISIGDERQ 400 (664) T ss_pred eeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccceEEecCCCCCcHHHHHHHHHHHHHHHHhcC Confidence 000000 0000000000000000000 00011111110 011111111000 000 001111222222322 Q ss_pred ceEEEEEe-----cccccccchhhHHHHHH----------HhhccCcce-EEE-------EecCCC---ccchhHHHHHH Q lcl|Aclame:pro 197 DMGMIANG-----VNVDDYDSVDEAMDVAH----------EVAGYVPSG-DLM-------MIVDAS---DDDLAAYQLGK 250 (426) Q Consensus 197 ~kl~~~~~-----~d~~~~~~~~~~~~~a~----------~~a~~~~rt-~~~-------~~~~~~---~~~~~aa~~g~ 250 (426) +++.+... .+...-...+....-.. ....++... ..+ ++.... ..-|.+.++|. T Consensus 401 ~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl 480 (664) T protein:vir:98 401 DCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNRWVPLAGDIAGL 480 (664) T ss_pred CeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceEEechHHHHHHH Confidence 23322211 11000001111100000 000011111 112 211111 12367888888 Q ss_pred Hhhhc----ccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEEcC--CEEEeeceeecCcccCcce Q lcl|Aclame:pro 251 FAVSE----PWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSD--ANRVSNAVTTAGADSDTSF 323 (426) Q Consensus 251 ~~~~~----p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g--~~~~~~~~t~~G~~~sg~~ 323 (426) ++-.+ ||..+..+...+ .+...++...+...|...|+.+. |.++..-| +..+|-+.|.++....-.| T Consensus 481 ~A~~D~~~g~~~span~~~~~------i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~ 554 (664) T protein:vir:98 481 CVYTDSVANPWMSPAGYNRGQ------IRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGDKTLTSVPSPFDR 554 (664) T ss_pred HHHhhhcCCcEECcCCceeee------eeccccceeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcccccCCCCcccce Confidence 88776 443333222111 11111222334556666666544 87777644 5677777777665444568 Q ss_pred eehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHhhcCC Q lcl|Aclame:pro 324 FDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWG 403 (426) Q Consensus 324 iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~R~~~ 403 (426) |=++|-.+||+..|+..++..+-. |.+..=...|+..|+.-|..-+++|. +-+|.+.......+++|+.+-++. T Consensus 555 i~vrR~~~~i~~si~~~~~~~v~e----pn~~~l~~~i~~~i~~~L~~l~~~ga--l~g~~V~~d~~~nt~~~i~~G~~~ 628 (664) T protein:vir:98 555 INVRRLFNMIKKDIGDNAKYKLFE----NNDDFTRASFRMDTGQYMTNIRALGG--CYDYRVICDTTNNTPDVIDRNEFV 628 (664) T ss_pred EeehhHHHHHHHHHHHHHHHhhcC----CCCHHHHHHHHHHHHHHHHHHHhcCc--eeeeEEEEcCCCCCHHHhhCCeEE Confidence 999999999999999998876643 67778888899999999999887553 445888888777788888887775 Q ss_pred CeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 404 GIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 404 ~i~~~~~laGAIh~v~I~g~v~v 426 (426) +.+.+...-.++++.++..-+- T Consensus 629 -~~i~~~p~~pae~I~~~~~q~~ 650 (664) T protein:vir:98 629 -ATVYVKPPRSINYITLNFVATS 650 (664) T ss_pred -EEEEEEecCCcceEEEEEEEee Confidence 8899999999999999877765 No 59 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=78.02 E-value=0.12 Score=25.66 Aligned_cols=388 Identities=14% Similarity=0.083 Sum_probs=148.1 Q ss_pred CCCceEEEEEeeccccccccCccceEEEec-ccccccccchhhhhe--eecHHH--HHhccCCCCHHHHHHHHHHhcCCc Q lcl|Aclame:pro 1 MPKQIVEIELTAEIADRPQETFTDAAIVGT-AEEEPPDAEFGEVNQ--YSTSTS--VGDDYGEDSDVYTASEAIEEMGAE 75 (426) Q Consensus 1 mp~~iVnV~isl~t~a~~~~~Fg~~Lilg~-~~~~~~~~~~~~~~~--Yts~~~--V~~Dfg~~sp~YkAA~~~f~Q~~~ 75 (426) -+-|-|.|.+.-++..-.. .|-...-.+. +.+.+. +..+.. |+.-++ +-.=|+ ++..|.|..+.+..+++ T Consensus 114 ~~~n~i~v~~~~~~~~~~~-~~~~~~~~~~~~~~~~n---~G~v~~i~y~g~~~~a~~~~~~-~~~~~~A~~l~l~gg~~ 188 (587) T protein:vir:96 114 SVSNDIQVALEKNTITDSL-RLRVVFQKDNYQEVFDN---LGNIFSINYKGEGEKATFSVEK-DKETQEAKRLVLKVDEK 188 (587) T ss_pred CCCceEEEEEEeccCCCcc-ceEEEEecCCceeeccc---cCceEEEEecccccceeEeecc-CcccceeeeeEEEecCc Confidence 3345555666433322221 1211000000 000000 001110 221111 111121 33334444444444444 Q ss_pred eeeeeecccccc--ccc-----------ccccc-ceeccceeecccccccch------------hhhh-----------h Q lcl|Aclame:pro 76 QWRVMVLEATEV--TEE-----------ELSDG-DTIDKVPILGNHEVESPD------------GDIE-----------F 118 (426) Q Consensus 76 ~~~~~v~~~t~v--~~~-----------~~~~~-~tv~~~~~s~~~~~~~ta------------~~i~-----------~ 118 (426) ....+.+..... +.. .+.+. ..-+.+.++......+.. ..+. . T Consensus 189 ~v~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~~n~~~v~v~d~~~~~~~k~~~~y~~t~~~di~~~~~~~~~~~~~ 268 (587) T protein:vir:96 189 EVKAYELNGGAYSFTNEIITDINELPDFEAKLSPFGDKNLESRKLDEATDVDIKGKAVYVKAVFGDIENQTQYNQYVKFE 268 (587) T ss_pred eEEEEEeCCCchhhhhhhhhhhccccceEEEeecccCceeEEEeeccccccccceEEEeehhhhhhhhhhhccccceeec Confidence 433333211100 000 00000 000000000000000000 0000 0 Q ss_pred hcccccc------cc--cceeeeeeccccce--------eeechhheeeecccccch-hhhhhhc-cccceeecccccch Q lcl|Aclame:pro 119 TTDDDPD------VE--DFDAEIVINSATGD--------VATSEDSIELTYFHADWS-QLDEFPS-DVNNFAVADRRFDL 180 (426) Q Consensus 119 ~~~~~~~------~t--~~~~~~~~~~~~g~--------~t~~~~~~~~~~~~~d~~-~~~~~~s-~~~~~~la~~~~~~ 180 (426) ++..... .+ ............+. ++.+.+... ..+|. .+..+.. ..+.....+.. T Consensus 269 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~----~~~y~~~l~ale~~~~~~i~~~t~d--- 341 (587) T protein:vir:96 269 QLPEQASEPSDVEVHAETESATVTATSKPKAIEPFELTKLSGGTNGEP----PTSWSAKLEKFKNEGGYYIVPLTDR--- 341 (587) T ss_pred cccchhhhhhcccccccccceeeeecccccccccccceeeecCCCCCC----cccHHHHHHHHhhCCcEEEEecCCC--- Confidence 0000000 00 00000000000000 110000000 01121 2222221 11111111111 Q ss_pred hhhHhHhhhhhhh---hhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCC-------C----ccchhHH Q lcl|Aclame:pro 181 KGVGVLDETHSWA---SDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDA-------S----DDDLAAY 246 (426) Q Consensus 181 ~~~~~~~~~~~wa---~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~-------~----~~~~~aa 246 (426) ......+.+|+ ..+.+-..+...... .........+...-++.|-.++.+... . ..+..++ T Consensus 342 --~ai~~~l~a~vk~~r~~gk~~~aVlg~~~---~~~~~~~~~~a~~~n~e~vi~v~~~~~~~~~~~~~~~~~~~~~aa~ 416 (587) T protein:vir:96 342 --QSVHSEVATFVKNRSDAGEPMRAIVGGGT---SETKEKLFGRQAILNNPRVALVANSGKFVMGNGRILQAPAYMVASA 416 (587) T ss_pred --HHHHHHHHHHHHHHHhCCCeEEEEecCCC---CCCHHHHHHHHhhcCCCcEEEEecceEEecCCCceeeechhhHHHH Confidence 11123344454 233333333222211 111111222333334434333322110 0 1124567 Q ss_pred HHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhcC-ccEEEEEcCCE----EEeeceeecCcccCc Q lcl|Aclame:pro 247 QLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGP-VNVLIDVSDAN----RVSNAVTTAGADSDT 321 (426) Q Consensus 247 ~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~-~N~~~~~~g~~----~~~~~~t~~G~~~sg 321 (426) ++|..+..++..+++++... ..++...++.+|+..+..+ .++++...+.. .+.++.++-....+- T Consensus 417 vAG~~Ag~~~~~S~T~~~~~----------~~~v~~~~t~~e~~~~i~~G~~~l~~~~~~~~~v~~~vnsitT~t~~~~~ 486 (587) T protein:vir:96 417 VAGLVSGLDIGESITFKPLF----------VNSLDKVYESEELDELNENGIITIEFVRNRMTTMFRIVDDVTTFPDKNDP 486 (587) T ss_pred HHHHHhcCccccCccceeee----------cccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCc Confidence 77777777665555544221 2234455677776655433 36665544432 233566664433222 Q ss_pred c--eeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHHHh Q lcl|Aclame:pro 322 S--FFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVN 399 (426) Q Consensus 322 ~--~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dra~ 399 (426) . .|=+||..|.+...|+..+.+.+.- | |=.+.|-..|.+.|.+.|++..+.+- +.+|+. + +.+.++.+ T Consensus 487 ~~~~i~virv~D~i~~di~~~~~~~yiG--k-~nn~~~r~~v~~~i~~~L~~l~~~g~--I~~~~~--~---dv~v~~~~ 556 (587) T protein:vir:96 487 VKSEMALGEANDFLVSELKILLEEQYIG--T-RTINTSASQIKDFVQSYLGRKKRDNE--IQDFPP--E---DVQVIIEG 556 (587) T ss_pred hhhhhhhHHHHHHHHHHHHHHHHhcCCc--c-ccCHHHHHHHHHHHHHHHHHHHhCCc--ccCCCc--c---ceEEEecC Confidence 2 4779999999999999888776654 3 34667999999999999999876442 334432 1 11111111 Q ss_pred hcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 400 RNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 400 R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) .+. -++|.++..-+++++.+++++.= T Consensus 557 D~~-~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:96 557 NEA-RISLTIFPIRALKKISVSLVYRQ 582 (587) T ss_pred CEE-EEEEEEEEcccceEEEEEEEEEe Confidence 111 37889999999999999888866 No 60 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=75.55 E-value=0.15 Score=25.17 Aligned_cols=342 Identities=12% Similarity=0.026 Sum_probs=155.0 Q ss_pred CCC--ceEEEEEeeccccccccCccceEEEecc-cccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCcee Q lcl|Aclame:pro 1 MPK--QIVEIELTAEIADRPQETFTDAAIVGTA-EEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQW 77 (426) Q Consensus 1 mp~--~iVnV~isl~t~a~~~~~Fg~~Lilg~~-~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~~ 77 (426) |+- .|=+++.-..+...-+|- .||||.+ .++.... .+..-|+++.+ ||..+.+.|.=.++|.-.+ T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er~---~Lfig~~~~~~~~~~---~~~~~sdld~~---lg~~~~~lk~~v~aa~~na--- 68 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERH---ALFVGVGTTNQGKLL---ALTPDSDFDKV---FGETDTDLKKQVRAAMLNA--- 68 (376) T ss_pred CCCeEEEecccccCCCcccccce---EEeecccccccccee---eecCccchHhh---hCCCchHHHHHHHHHHhCC--- Confidence 883 444455545555555664 6999865 3332211 22333555444 5777777776555553221 Q ss_pred eeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeeccccc Q lcl|Aclame:pro 78 RVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHAD 157 (426) Q Consensus 78 ~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~~d 157 (426) |+.+...+-..+.++ ....+.+......-.+++..++... ++ ...+ T Consensus 69 -------------G~~~~~~~~~~~~~~--------~~~~~Av~~a~~~~s~E~V~v~~pv----~t---------~~a~ 114 (376) T protein:vir:37 69 -------------GQNWFAHVYIAQEDG--------YDFVECVKKANQTASFEYCVNTRYL----GV---------DKAS 114 (376) T ss_pred -------------CCcEEEEEEeecCCc--------hHHHHHHHHhhhhcCceEEEEeccc----cc---------cHHH Confidence 111111111000000 0011111111111111111111000 00 0000 Q ss_pred chhhhhhhccccceeecccccchhhhHhHhhhhhhhhhc--ceEE-EEEecccc--cccc---hhhHHHHHHHhhccC-c Q lcl|Aclame:pro 158 WSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDE--DMGM-IANGVNVD--DYDS---VDEAMDVAHEVAGYV-P 228 (426) Q Consensus 158 ~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~--~kl~-~~~~~d~~--~~~~---~~~~~~~a~~~a~~~-~ 228 (426) | -+. ...+.....+ +-+| +......+ .... .+.......+..+.. + T Consensus 115 i---------------------~aa----~~~a~el~~~~~Rpv~file~r~~~~~~~~~e~w~~y~~~~~al~~gia~~ 169 (376) T protein:vir:37 115 I---------------------GKL----QECYAELLAKFGRRTFFIQAVQGINHDQSDGETWDQYVQKLTTLQQTIVAD 169 (376) T ss_pred H---------------------HHH----HHHHHHHHHhcCCeEEEEEeccCcCcccccccCHHHHHHHHHHhhcccccc Confidence 1 011 1111111111 2222 22222111 1011 111112222222221 2 Q ss_pred ce--EEEEecCCCccchhHHHHHHHhhh------cccccceeecccccceeecccccc--ccccccchhHHHHhhcCc-c Q lcl|Aclame:pro 229 SG--DLMMIVDASDDDLAAYQLGKFAVS------EPWYNPLWNELPAGETVSKNVGDP--EEQGTFEGGDEAEGEGPV-N 297 (426) Q Consensus 229 rt--~~~~~~~~~~~~~~aa~~g~~~~~------~p~~~~~~~~~~~~~~~~~~k~~~--gv~~~~~~~~~~~~~~~~-N 297 (426) +. .+..|. ...+.++||++.. +|.+- +.++... .+ ....| +....+....+.+|+.+. - T Consensus 170 ~V~~V~~~~g-----n~~G~~aGRl~~aaVsVadspgRV---~tG~l~g-l~-~~~lp~d~~~~~l~~a~l~aLd~agy~ 239 (376) T protein:vir:37 170 HVCLVPLLFG-----NETGVLAGRLANRAVTVADSPARV---QTGALVS-LG-SANKPLDKDRNELTLAHLKSLETARYS 239 (376) T ss_pred cceeeeeehh-----hhHHHHHHHHhhcccchhhCccce---ecccccc-cc-ccccccCcCcccCCHHHHHHHHhCCCe Confidence 22 222222 2356778886422 34332 1111100 00 00111 111346677788887554 8 Q ss_pred EEEEEcC--CEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 298 VLIDVSD--ANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGS 375 (426) Q Consensus 298 ~~~~~~g--~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~ 375 (426) ++..|.| +.|..++.++.--.+|.+||-.+|=.|=..-+++......+. ....-=+..+|+..++-+...|++..+. T Consensus 240 vp~~Y~gy~G~Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R~vR~~ai~~i~-D~~lnst~~sia~~~~yi~~pLr~M~~s 318 (376) T protein:vir:37 240 VPMWYPDYDGYYWADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIA-DRSFNSTTSSTEYHKNYFAKPLRDMSKS 318 (376) T ss_pred EEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhC-CcccCcchhhHHHHHHHHHHHHHHHHhc Confidence 8888877 899999999988888999999999988888777766544443 2334447788888898899999887664 Q ss_pred ---CCccccceeEecCcccCcHH-HHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 376 ---VGQPLAEYEVDVPEWDDDDV-DRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 376 ---~g~~~~~y~~~~p~~~~~~~-dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +|..+++ .|..|+-.++.. -....+. .|.+..+.=|--..+.++.-|-+ T Consensus 319 ~~i~g~~fpG-eI~~p~d~Di~i~w~s~~~V-~I~~~v~P~~~pk~Itv~I~Ldl 371 (376) T protein:vir:37 319 ATINGKDFPG-ECMPPKDDAITIVWQSKTKV-TIYIKVRPYDCPKEITANIFLDL 371 (376) T ss_pred chhccccccc-eeecCCCCCceEEeeccceE-EEEEEEEeccCCceEEEEEEeec Confidence 3444455 578888766632 1222222 14443333333334444444444 No 61 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=74.20 E-value=0.16 Score=24.93 Aligned_cols=341 Identities=13% Similarity=0.043 Sum_probs=154.3 Q ss_pred CCC--ceEEEEEeeccccccccCccceEEEeccc-ccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCcee Q lcl|Aclame:pro 1 MPK--QIVEIELTAEIADRPQETFTDAAIVGTAE-EEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQW 77 (426) Q Consensus 1 mp~--~iVnV~isl~t~a~~~~~Fg~~Lilg~~~-~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~~ 77 (426) ||- .|=+++.-..+...-+|- .||||... ++.. .-.+..-|++++ =||..+.+.|+=.++|.-++. T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er~---~lfig~~~~~~g~---~~~~~~~sdld~---~l~~~ds~lk~~v~aa~~naG-- 69 (370) T protein:vir:78 1 MWPYVQIYNLNQMQGPVTEVERH---LLFIGSAASNTGK---LLSLNAQSDFDQ---LLGAADSELKANLLAARDNAG-- 69 (370) T ss_pred CCceEEEeeccccCCCcCcccee---EEEEecccccccc---eEeecCccCHHH---hcCCcChhHHHHHHHHHhCCC-- Confidence 984 444455545555555663 69998653 3222 112333444444 467777777766666643322 Q ss_pred eeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeeccccc Q lcl|Aclame:pro 78 RVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHAD 157 (426) Q Consensus 78 ~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~~d 157 (426) +....++...+ +.....+.+......-.++++..+...+ ...+ T Consensus 70 --------------~~~~~~~~p~~---------~~~d~~~Av~~a~~~~s~E~V~v~~~~s--------------~~a~ 112 (370) T protein:vir:78 70 --------------QNWSAAAYVLP---------TDKPWLDAARDAQQTQSFEGVVVLGQEW--------------HQAA 112 (370) T ss_pred --------------CceEEEEEEec---------CchhHHHHHHHHHhhCCccEEEEecCcc--------------hHHH Confidence 11111111110 0001111111111111111111110000 0000 Q ss_pred chhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcc-eEEEEEecccccccc-hhhHHHHHHHhhccCc-ce-EEE Q lcl|Aclame:pro 158 WSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDED-MGMIANGVNVDDYDS-VDEAMDVAHEVAGYVP-SG-DLM 233 (426) Q Consensus 158 ~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~-kl~~~~~~d~~~~~~-~~~~~~~a~~~a~~~~-rt-~~~ 233 (426) |. +......++ |..=.+ ..|+........-.. .+.......+..+... +. .|. T Consensus 113 ~~---------------------a~~~~a~el--~n~~~Rpv~file~~~~~~~e~w~~y~~~l~al~~gia~~~V~vvp 169 (370) T protein:vir:78 113 IN---------------------AAHALNQEL--IAKWGRWQFMLLAVPAIADEQDWATYEAELATLQDGIAASSVSLIP 169 (370) T ss_pred HH---------------------HHHHHHHHH--HHhcCCeEEEEEeecCCCCcCCHHHHHHHHHHhhhccccccceEEe Confidence 10 111111111 111112 233333332221100 0111222223333211 11 111 Q ss_pred EecCCCccchhHHHHHHHhh------hcccccceeecccccceeecccccccc--ccccchhHHHHhhcCc-cEEEEEcC Q lcl|Aclame:pro 234 MIVDASDDDLAAYQLGKFAV------SEPWYNPLWNELPAGETVSKNVGDPEE--QGTFEGGDEAEGEGPV-NVLIDVSD 304 (426) Q Consensus 234 ~~~~~~~~~~~aa~~g~~~~------~~p~~~~~~~~~~~~~~~~~~k~~~gv--~~~~~~~~~~~~~~~~-N~~~~~~g 304 (426) ... ....+.++||++- ..|.+- +.+.... +...|-- ...+..+.+.+|+.+. .++..|.| T Consensus 170 ~~~----g~~~G~~aGRL~naavsVadsP~Rv---~tG~l~g----l~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~g 238 (370) T protein:vir:78 170 QLW----PTLAGAYAGRLCNRAVSIADSPCRV---KTGALVG----LGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPD 238 (370) T ss_pred eec----cccHHHHHHHHhcCeeeecccceee---ecccccc----ccccccccCCcccCHHHHHHHHhCCCeEEEeeCC Confidence 111 1234677888642 122211 1111000 1111211 1235566778887655 88777777 Q ss_pred --CEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcC---CCcc Q lcl|Aclame:pro 305 --ANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGS---VGQP 379 (426) Q Consensus 305 --~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~---~g~~ 379 (426) +.|...+.++.--.+|.+||-.+|=.|=..-+++...-..+... ++-=|...|+.......+.|++.... ++.. T Consensus 239 y~G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~i~D~-~lnst~gsia~~~~~~~~~L~ema~s~~i~~~~ 317 (370) T protein:vir:78 239 YDGIYWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAIARIGDR-SFNSTPGSTAAAITYFGKDLREMAKSTTINGQP 317 (370) T ss_pred CCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCc-ccCCCCcchhHHHHHHHhhHHHHHhhhhhcccc Confidence 89999999998777899999999998888888886554444333 33336677777777777888765543 3555 Q ss_pred ccceeEecCcccCcH-HHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 380 LAEYEVDVPEWDDDD-VDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 380 ~~~y~~~~p~~~~~~-~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +++ .|..|.-.++. +-.+.++.. |.+..+.=|--..+.++.-|-+ T Consensus 318 fpg-eI~~p~d~Di~i~w~s~~~v~-I~~~v~P~~~pk~Itv~I~LDl 363 (370) T protein:vir:78 318 FPG-DIASPQDGDIRIQWVAKNLVS-VFVVVRTVDCPKGITVNIMLDL 363 (370) T ss_pred cce-eEeccCCCcceEEeeccceEE-EEEEEEeccCCceEEEEEEEee Confidence 656 67778866653 222333222 4454444444444444443333 No 62 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=71.21 E-value=0.2 Score=24.43 Aligned_cols=398 Identities=13% Similarity=0.087 Sum_probs=174.5 Q ss_pred CC-----CceEEEEEeec-cccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhc-- Q lcl|Aclame:pro 1 MP-----KQIVEIELTAE-IADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEM-- 72 (426) Q Consensus 1 mp-----~~iVnV~isl~-t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q-- 72 (426) -| ++=|-|.+.-+ ..+...-+.+.+.|||....-|| ++.-++++.++...=||... .-.+.+++|.| T Consensus 6 ~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~----~~~~~~~~~~~~~~~~~~g~-l~~~~~~a~~~~~ 80 (587) T protein:vir:95 6 FPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEP----NTVYELRNYSQAKRLFRSGE-LLDAIELAWGSNP 80 (587) T ss_pred cCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCCCC----ceeEEeccHHHHHHHhcCcc-hHHHHHHHhcccc Confidence 23 33455555553 56677788999999999887776 34667889999999998543 44677888876 Q ss_pred --CCceeeeeeccccc--------cccccccccceeccce-------eecccc-----cccchhhhhh------hcc--- Q lcl|Aclame:pro 73 --GAEQWRVMVLEATE--------VTEEELSDGDTIDKVP-------ILGNHE-----VESPDGDIEF------TTD--- 121 (426) Q Consensus 73 --~~~~~~~~v~~~t~--------v~~~~~~~~~tv~~~~-------~s~~~~-----~~~ta~~i~~------~~~--- 121 (426) +....++++.+... .+...+.+|..-|... +++... .++-..++.+ .+. T Consensus 81 ~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~si~y~g 160 (587) T protein:vir:95 81 NYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKYKG 160 (587) T ss_pred CCCceEEEEEEcCCCceeEEEecCeEEEEecccccccceEEEEecCCCCCceeEEEEEecccceeeeeeccceeeeeeec Confidence 34444444333211 1111122222222111 111100 0000001000 000 Q ss_pred --c---------c--ccc---------ccceeeeeeccc-----------------c-ceeeechhheeeec-------- Q lcl|Aclame:pro 122 --D---------D--PDV---------EDFDAEIVINSA-----------------T-GDVATSEDSIELTY-------- 153 (426) Q Consensus 122 --~---------~--~~~---------t~~~~~~~~~~~-----------------~-g~~t~~~~~~~~~~-------- 153 (426) . + ..+ ..+..-.+.+.. + .+...........+ T Consensus 161 ~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~~~~~~~~~ 240 (587) T protein:vir:95 161 EEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLDKIENAN 240 (587) T ss_pred cccccceeeeecccceeeeeeeeecCCceEEEEEecCCchHHHHHHHHhhccccceEEEEecccCceeEEeecCcccccc Confidence 0 0 000 000000000000 0 00000000000000 Q ss_pred -c---------cccch------------hhh---hhhc-----------------------cccceeecccccchhhhHh Q lcl|Aclame:pro 154 -F---------HADWS------------QLD---EFPS-----------------------DVNNFAVADRRFDLKGVGV 185 (426) Q Consensus 154 -~---------~~d~~------------~~~---~~~s-----------------------~~~~~~la~~~~~~~~~~~ 185 (426) . ..|.. .+. +... +.....|.....+-... + T Consensus 241 v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG~~~~-~ 319 (587) T protein:vir:95 241 IKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPPA-T 319 (587) T ss_pred eehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccceeeeecCCCCCCcc-c Confidence 0 00000 000 0000 00000011000000000 0 Q ss_pred HhhhhhhhhhcceEEEEE-ecccccc------------------------cchhhHHHHHHHhhccCcceEEEEecCC-- Q lcl|Aclame:pro 186 LDETHSWASDEDMGMIAN-GVNVDDY------------------------DSVDEAMDVAHEVAGYVPSGDLMMIVDA-- 238 (426) Q Consensus 186 ~~~~~~wa~~~~kl~~~~-~~d~~~~------------------------~~~~~~~~~a~~~a~~~~rt~~~~~~~~-- 238 (426) .+..-+-.+..+..++.. +.+.... ..........+.+.-++.|-.++ +... T Consensus 320 y~~~l~ale~~~~~~i~~~t~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~ervi~v-~~~~~~ 398 (587) T protein:vir:95 320 WADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQESLSNPRVSLV-ANSGTF 398 (587) T ss_pred HHHHHHHHHhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEEcCCCCCCHHHHHHHHhhcCCCcEEEe-cccceE Confidence 000000011222112111 1111000 00000111122222233332222 2110 Q ss_pred ------Cc----cchhHHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhcC-ccEEEEEcCC-- Q lcl|Aclame:pro 239 ------SD----DDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGP-VNVLIDVSDA-- 305 (426) Q Consensus 239 ------~~----~~~~aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~-~N~~~~~~g~-- 305 (426) .. ....+++.|..+..+|..+++++... ..++.+.++.+|+..+..+ .++++...+. T Consensus 399 ~~~dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i~----------~~~v~~~~t~~e~e~ai~~Gvl~l~~~~~~~~ 468 (587) T protein:vir:95 399 VMDDGRKNHVPAYMVAVALGGLASGLEIGESITFKPLR----------VSSLDQIYESIDLDELNENGIISIEFVRNRTN 468 (587) T ss_pred ecCCCceeeechHHHHHHHHHHHhcCchhcCccceeee----------cccccccCCHHHHHHHHhCCeEEEEEecCCcc Confidence 01 12246777777777776555544321 2344455677776655433 3555543332 Q ss_pred --EEEeeceeecCcccCcce--eehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCcccc Q lcl|Aclame:pro 306 --NRVSNAVTTAGADSDTSF--FDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLA 381 (426) Q Consensus 306 --~~~~~~~t~~G~~~sg~~--iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~ 381 (426) ..+.++.++-....+-.| |=+||..|.+...|+..+.+.+.- | |=.+.|-..|++.|...|++-.+.+- +- T Consensus 469 ~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iG--k-~nn~~~r~~v~~~i~~~L~~l~~~ga--I~ 543 (587) T protein:vir:95 469 TFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIG--T-RTINTSASIIKDFIQSYLGRKKRDNE--IQ 543 (587) T ss_pred eEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCc--c-ccchHHHHHHHHHHHHHHHHHHhCCc--cc Confidence 345677777443334445 669999999999999998877764 3 34678999999999999999876442 33 Q ss_pred ceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 382 EYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 382 ~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +|+. ++.+.++.+.+. -++|.++..-+++++.+++.+.= T Consensus 544 ~~~~-----~dv~v~~~~d~~-~v~~~v~Pv~~mekI~vt~~~~~ 582 (587) T protein:vir:95 544 DFPA-----EDVQVIVEGNEA-RISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred CCCc-----cceEEEecCCEE-EEEEEEEEcccceEEEEEEEEee Confidence 4432 111111111111 37888999999999999888766 No 63 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=61.61 E-value=0.35 Score=23.08 Aligned_cols=395 Identities=13% Similarity=0.082 Sum_probs=169.7 Q ss_pred CC----------CceEEEEEee-ccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHH Q lcl|Aclame:pro 1 MP----------KQIVEIELTA-EIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAI 69 (426) Q Consensus 1 mp----------~~iVnV~isl-~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~ 69 (426) |- ++=|-|.+.- ...+...-+.+.+.|||....-|| ++.-++++.++...=||... .-.|.+++ T Consensus 1 ~~~~~~~~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~----~~~~~~~~~~~~~~~f~~g~-l~~~i~~a 75 (562) T protein:vir:80 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKP----NAVYKVRNYSQAKSVFRSGE-LLDAIERA 75 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCc----ceeEEEccHHHHHHHhcCCC-hHHHHHHh Confidence 42 2335555554 455667788999999999887775 45778999999999997433 33466777 Q ss_pred Hh----cCCceeeeeecccccccc--------ccccccc-------------------------------------eecc Q lcl|Aclame:pro 70 EE----MGAEQWRVMVLEATEVTE--------EELSDGD-------------------------------------TIDK 100 (426) Q Consensus 70 f~----Q~~~~~~~~v~~~t~v~~--------~~~~~~~-------------------------------------tv~~ 100 (426) |. .+....++++.+.+.... ....++. .|-. T Consensus 76 ~~~~~~~g~~~~~~~rv~~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~v~~ 155 (562) T protein:vir:80 76 WNPGEGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFS 155 (562) T ss_pred cccccccCceEEEEEEcCCCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCceee Confidence 75 343333333322211100 0000000 0000 Q ss_pred ceeecccc-----------------------------------cccchhhhhhhcccccccccce------------eee Q lcl|Aclame:pro 101 VPILGNHE-----------------------------------VESPDGDIEFTTDDDPDVEDFD------------AEI 133 (426) Q Consensus 101 ~~~s~~~~-----------------------------------~~~ta~~i~~~~~~~~~~t~~~------------~~~ 133 (426) .+.++.+. ...+...+.+.+...+..++.. .+. T Consensus 156 i~y~g~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~~~~~tAky~g~~~n~i~~~~~d~ 235 (562) T protein:vir:80 156 IKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDA 235 (562) T ss_pred eeeccccccceeEEEecCccceEEEEEEecCCcceeEEEeCCCccchhhhhhhhhccccceEEEecccCCceeeeccccc Confidence 00000000 0000000110000000000000 000 Q ss_pred ee----ccccceeeechhheeeecccccchhhhhhhc-cc---cceeeccccc-----chhh-h---------------- Q lcl|Aclame:pro 134 VI----NSATGDVATSEDSIELTYFHADWSQLDEFPS-DV---NNFAVADRRF-----DLKG-V---------------- 183 (426) Q Consensus 134 ~~----~~~~g~~t~~~~~~~~~~~~~d~~~~~~~~s-~~---~~~~la~~~~-----~~~~-~---------------- 183 (426) .. .+...+++.............+|.......+ .+ ....|..... .|.. . T Consensus 236 ~~~~~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~Le~~~~~~i~~~t~ 315 (562) T protein:vir:80 236 QIDVDIKTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVPLTS 315 (562) T ss_pred chhhhcccceeeeeehhhhhhhcccccceEEEEeccCccccccceeeeeCCCCCCccccHHHHHHHHHhCCcEEEEecCC Confidence 00 0000000000000000000001100000000 00 0001110000 0000 0 Q ss_pred --HhHhhhhhhh---hhcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecC-------CCc----cchhHHH Q lcl|Aclame:pro 184 --GVLDETHSWA---SDEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVD-------ASD----DDLAAYQ 247 (426) Q Consensus 184 --~~~~~~~~wa---~~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~-------~~~----~~~~aa~ 247 (426) .....+.+|+ .++.+...+...... .........+.++-++.|-.++.... ... ....+++ T Consensus 316 d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~---~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~~~~~~~~~~~~~aa~v 392 (562) T protein:vir:80 316 KQAVHAEALQFVRDCSYNGNPMRVFVGGGI---GESMEQLFTRAIGLQNERAGLIGFSGTVKMDDGRSLKMPGYMFAAQV 392 (562) T ss_pred ChHHHHHHHHHHHHHHhCCCeEEEEecCCC---CCCHHHHHHHhhhcCCCeEEEEecCeeEECCCCceeeechhHHHHHH Confidence 0011122232 111121111111100 00111112222233333333322110 001 1235677 Q ss_pred HHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEEcCC-E---EEeeceeecCcccCcc Q lcl|Aclame:pro 248 LGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSDA-N---RVSNAVTTAGADSDTS 322 (426) Q Consensus 248 ~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g~-~---~~~~~~t~~G~~~sg~ 322 (426) +|..+..++..+++++... ..++...++.+|+..+.... +.+....+. . .+.++.++-....+-. T Consensus 393 AGl~Ag~~~~~S~T~~~i~----------~~~v~~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~~~~~~ 462 (562) T protein:vir:80 393 AGLTCGLEIGEAITFKNIA----------IETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPV 462 (562) T ss_pred HHHHhcCccccCccceeec----------cccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceeccCCCCch Confidence 7777776665555444332 12334456777776665443 555544333 2 2345666644332333 Q ss_pred --eeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccC--cHHHHH Q lcl|Aclame:pro 323 --FFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDD--DDVDRV 398 (426) Q Consensus 323 --~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~--~~~dra 398 (426) +|=+||..|.+.+.|+..+.+.+.- | |=.+.|-..|.+.|.+.|++-.+.+ . +-+|+ |...+ ..+|+ T Consensus 463 ~~ki~viRv~D~i~~dir~~~~~~yIG--k-~Nn~~~r~~v~~~i~~~L~~l~~~g-a-I~~~~---~~dv~v~~~~d~- 533 (562) T protein:vir:80 463 KSEIGVGEANDFLVSELKISLDNEYIG--T-KIIDTSASLVKNFVQSFLDRKKLAK-E-IQDYS---PEEVQVVIEGDI- 533 (562) T ss_pred hhhhhhhHHHHHHHHHHHHHHHhcCCc--c-ccChHHHHHHHHHHHHHHHHHHhCC-c-ccCCC---ccceEEEecCCE- Confidence 4679999999999999888777654 3 3467899999999999999887644 2 23343 11111 11222 Q ss_pred hhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 399 NRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 399 ~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) . -++|.++..-+++++.+++++.- T Consensus 534 --~--~v~~~v~Pv~~mekIy~ti~~~~ 557 (562) T protein:vir:80 534 --A--RISLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred --E--EEEEEEEEcccceEEEEEEEEEe Confidence 2 27888999999999999888777 No 64 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=46.07 E-value=0.75 Score=21.28 Aligned_cols=395 Identities=12% Similarity=0.100 Sum_probs=172.2 Q ss_pred CC-----CceEEEEEee-ccccccccCccceEEEecccccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHh--- Q lcl|Aclame:pro 1 MP-----KQIVEIELTA-EIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEE--- 71 (426) Q Consensus 1 mp-----~~iVnV~isl-~t~a~~~~~Fg~~Lilg~~~~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~--- 71 (426) .| ++=|-|.+.- ...+...-+.+.+.|||....-|| ++.-++++.++..+=||... .-.|..++|. T Consensus 6 ~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~----~~~~~~~~~~~~~~~f~~g~-l~~a~~~a~~~~~ 80 (569) T protein:vir:80 6 FPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKP----DTVYRFRNYQQAKQVLRSGD-LLDAIELAWNASD 80 (569) T ss_pred ecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCC----ceeEEecCHHHHHHHhcCCc-hhHHHHhhccCcc Confidence 23 3445555554 566678889999999999877775 45777899999999997533 4456677773 Q ss_pred ---cCCceeeeeeccccc---cccc-----cccccc-----eec--c------------------------------cee Q lcl|Aclame:pro 72 ---MGAEQWRVMVLEATE---VTEE-----ELSDGD-----TID--K------------------------------VPI 103 (426) Q Consensus 72 ---Q~~~~~~~~v~~~t~---v~~~-----~~~~~~-----tv~--~------------------------------~~~ 103 (426) +++...++++.+... .+.. ...++. .++ . ... T Consensus 81 ~~~~~~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~~~~~~~ig~v~si~y 160 (569) T protein:vir:80 81 VNTASAGDILAVRVEDAKNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGYKKVFDNLGKIFSIQY 160 (569) T ss_pred ccccCceEEEEEEcCCCeeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCCccccccccceeeEEE Confidence 444444444432210 0100 000000 000 0 000 Q ss_pred eccc----------ccccchhhhhh--------------------------hcccccccc-cceeeeeec---------- Q lcl|Aclame:pro 104 LGNH----------EVESPDGDIEF--------------------------TTDDDPDVE-DFDAEIVIN---------- 136 (426) Q Consensus 104 s~~~----------~~~~ta~~i~~--------------------------~~~~~~~~t-~~~~~~~~~---------- 136 (426) ++.+ ....++..+.. .+...+..+ ...+..... T Consensus 161 tg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~f~a~~~~~~~~~~~~~~~ 240 (569) T protein:vir:80 161 KGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPDWEAKFFPIGDKNLPTDAL 240 (569) T ss_pred eeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCCccCceEEEEecCCCcceehhc Confidence 0000 00000000000 000000000 000000000 Q ss_pred ---------cccceeeechhheeeecccccchhhhhhhc-cccc---eeeccccc-----chhhh--------------- Q lcl|Aclame:pro 137 ---------SATGDVATSEDSIELTYFHADWSQLDEFPS-DVNN---FAVADRRF-----DLKGV--------------- 183 (426) Q Consensus 137 ---------~~~g~~t~~~~~~~~~~~~~d~~~~~~~~s-~~~~---~~la~~~~-----~~~~~--------------- 183 (426) +....++.....+.......+|..+....+ .+.. ..|..... .|..- T Consensus 241 d~~~~~~~~t~~~~~~~~~~di~~~~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~~~~~~~l~~le~~~~~~i~~~ 320 (569) T protein:vir:80 241 EAVTKVDVKTEAVFVGALAGDIAKQLEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAPESWANKFPLLANEGGYYLVPL 320 (569) T ss_pred cchhheeccccceeeehhHHHHHHhhcCCceEEEEecCCcceeeecceeecCCCCCCccchHHHHHHHHhhCCcEEEEec Confidence 000000000000000000000100000000 0000 01111000 01000 Q ss_pred ----HhHhhhhhhhh---hcceEEEEEecccccccchhhHHHHHHHhhccCcceEEEEecCC------------Cccchh Q lcl|Aclame:pro 184 ----GVLDETHSWAS---DEDMGMIANGVNVDDYDSVDEAMDVAHEVAGYVPSGDLMMIVDA------------SDDDLA 244 (426) Q Consensus 184 ----~~~~~~~~wa~---~~~kl~~~~~~d~~~~~~~~~~~~~a~~~a~~~~rt~~~~~~~~------------~~~~~~ 244 (426) .....+.+|++ ++.+...+...... ..+......+.+.-++ +.+.+++... ...+.. T Consensus 321 t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~---~~~~~~~~~~a~~~n~-e~vv~v~~~~~~~~~~g~~~~~~~~~~a 396 (569) T protein:vir:80 321 TDKQAVHSEALAFVKDRTDNGDPMRIIVGGGT---NETVEESITRATNLRD-PRASLVGFSGTRKMDDGRLLKLPGYMMA 396 (569) T ss_pred CCChHHHHHHHHHHHHHHhCCCcEEEEecCCC---CCCHHHHHHHHhhcCC-CeEEEEecCceeecCCCcceeechhhHH Confidence 00112223321 11121111111100 0000111112222222 3333332210 011235 Q ss_pred HHHHHHHhhhcccccceeecccccceeeccccccccccccchhHHHHhhcCc-cEEEEEcCCE----EEeeceeecCccc Q lcl|Aclame:pro 245 AYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPV-NVLIDVSDAN----RVSNAVTTAGADS 319 (426) Q Consensus 245 aa~~g~~~~~~p~~~~~~~~~~~~~~~~~~k~~~gv~~~~~~~~~~~~~~~~-N~~~~~~g~~----~~~~~~t~~G~~~ 319 (426) ++++|..+..+|..+++++... ..++...++.+|+..+.... +++....+.. .+.++.++-+... T Consensus 397 a~vAG~~A~~~~~~S~T~k~i~----------~~~i~~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~itT~t~~~ 466 (569) T protein:vir:80 397 SQIAGIASGLEVGEAITFKHFN----------VTSVDRVFESSQLDMLNESGVISIEFVRNRTLTAFRVVQDVTTYNDKS 466 (569) T ss_pred HHHHHHHhcCccccCccceeec----------cccccccCCHHHHHHHHhCCeEEEEEecCceEEEEEEeccceecCCCC Confidence 6777777777776655554332 12334456777776654443 6665544332 2335666644333 Q ss_pred Ccce--eehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcCCCccccceeEecCcccCcHHHH Q lcl|Aclame:pro 320 DTSF--FDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDR 397 (426) Q Consensus 320 sg~~--iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~~g~~~~~y~~~~p~~~~~~~dr 397 (426) +-.| |=+||..|.+...|+..+.+.+.- | |-.+.|-..|++.|.+.|++-.+.+ .+ -+|+ | ++.+.++ T Consensus 467 ~~~~~~i~viRv~D~i~~dir~~~~~~yiG--k-~nn~~~r~~v~~~i~~~L~~l~~~g-aI-~~~~---~--~dv~v~~ 536 (569) T protein:vir:80 467 DPVKNEMSVGEANDFLVSELKIELDNNFIG--T-KVIDTSASLIKNFIQSFLDNKKRAR-EI-QDYT---P--EEVQVVL 536 (569) T ss_pred CchhhhhhhhHHHHHHHHHHHHHHHhhcCc--c-cCChhHHHHHHHHHHHHHHHHHhCC-cc-cCCC---c--cceEEEe Confidence 3344 889999999999999888776654 3 4577889999999999999887644 22 2342 1 1121112 Q ss_pred Hh-hcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 398 VN-RNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 398 a~-R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) .+ |. -++|.++..-+++++.++.++.- T Consensus 537 ~~d~~--~v~~~v~Pv~~~ekI~~ti~~~~ 564 (569) T protein:vir:80 537 EGDVA--SISMTVMPIRSLNKITVQLVYKQ 564 (569) T ss_pred cCCEE--EEEEEEEEcccccEEEEEEEEee Confidence 11 22 27888999999999999988877 No 65 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=23.26 E-value=2.3 Score=18.57 Aligned_cols=345 Identities=12% Similarity=0.013 Sum_probs=152.7 Q ss_pred CCC--ceEEEEEeeccccccccCccceEEEeccc-ccccccchhhhheeecHHHHHhccCCCCHHHHHHHHHHhcCCcee Q lcl|Aclame:pro 1 MPK--QIVEIELTAEIADRPQETFTDAAIVGTAE-EEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQW 77 (426) Q Consensus 1 mp~--~iVnV~isl~t~a~~~~~Fg~~Lilg~~~-~~~~~~~~~~~~~Yts~~~V~~Dfg~~sp~YkAA~~~f~Q~~~~~ 77 (426) |+- .|-+++.--.+...-+|. .||||.+. +... .-.+..=|+++++ ||..+.+.|+=.+++.-.. T Consensus 1 ~~~~v~vn~ln~~qg~~~~ver~---~lfig~~~~~~~~---~~~~~~~sdld~~---lg~~ds~lk~~v~aa~~na--- 68 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERH---ALFVGVGTTNQGK---LLALTPDSDFDKV---FGETDTDLKKQVRAAMLNA--- 68 (376) T ss_pred CCCeEEEeeeeccCCCcccccce---EEEeeccccccCc---eEEecCCCChHHh---hCCCchhHHHHHHHHHhCC--- Confidence 883 444555555566555664 59999763 3221 1123334444444 6777778875555543321 Q ss_pred eeeeccccccccccccccceeccceeecccccccchhhhhhhcccccccccceeeeeeccccceeeechhheeeeccccc Q lcl|Aclame:pro 78 RVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTYFHAD 157 (426) Q Consensus 78 ~~~v~~~t~v~~~~~~~~~tv~~~~~s~~~~~~~ta~~i~~~~~~~~~~t~~~~~~~~~~~~g~~t~~~~~~~~~~~~~d 157 (426) |+.+...+-....+ .....+.+......-.++++..+.... + .... T Consensus 69 -------------G~~w~a~~~~p~~~--------~~~~~~Av~~a~~~~s~E~V~v~~p~~----t---------~~a~ 114 (376) T protein:vir:37 69 -------------GQNWFAHVYIAQED--------GYDFVECVKKANQTASFEYCVNTRYLG----V---------DKAS 114 (376) T ss_pred -------------CCceEEEEEecCCC--------hhhHHHHHHHHHhhCCeeEEEEecCcc----h---------hHHH Confidence 22222111100000 001111111111111112211111000 0 0000 Q ss_pred chhhhhhhccccceeecccccchhhhHhHhhhhhhhhhcc-eEEEEEecccc--cccc---hhhHHHHHHHhhccCcceE Q lcl|Aclame:pro 158 WSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDED-MGMIANGVNVD--DYDS---VDEAMDVAHEVAGYVPSGD 231 (426) Q Consensus 158 ~~~~~~~~s~~~~~~la~~~~~~~~~~~~~~~~~wa~~~~-kl~~~~~~d~~--~~~~---~~~~~~~a~~~a~~~~rt~ 231 (426) |. +......++ |..=.+ ..|+......+ ..+. .+.......+..|...+.. T Consensus 115 i~---------------------a~qa~a~el--~~~~~R~vffile~~g~d~~~~~ge~w~~y~~~l~a~~~gia~~~V 171 (376) T protein:vir:37 115 IG---------------------KLQECYAEL--LAKFGRRTFFIQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHV 171 (376) T ss_pred HH---------------------HHHHHHHHH--HHhcCCeEEEEEeccCCCCcccccCCHHHHHHHHHHHhccccccce Confidence 10 011111111 111122 23333332111 1111 1112222233333322221 Q ss_pred E---EEecCCCccchhHHHHHHHhhh------cccccceeecccccce--eeccccccccccccchhHHHHhhcC-ccEE Q lcl|Aclame:pro 232 L---MMIVDASDDDLAAYQLGKFAVS------EPWYNPLWNELPAGET--VSKNVGDPEEQGTFEGGDEAEGEGP-VNVL 299 (426) Q Consensus 232 ~---~~~~~~~~~~~~aa~~g~~~~~------~p~~~~~~~~~~~~~~--~~~~k~~~gv~~~~~~~~~~~~~~~-~N~~ 299 (426) . .++. ...+.++||++-. .|.+- +.++.... ......--| ..+..+-+.+|+.. --++ T Consensus 172 ~vV~~~~g-----n~~G~~aGRl~naaVsVadspgRV---~tGai~gl~~~~~p~d~~g--~el~~a~l~aLd~arysvp 241 (376) T protein:vir:37 172 CLVPLLFG-----NETGVLAGRLANRAVTVADSPARV---QTGALVSLGSANKPLDKDG--NELTLAHLKSLETARYSVP 241 (376) T ss_pred eeeeeecc-----chHHHHHHHHHhCCcchhcCccce---eecccccccccccccccCC--cccchHHHHHHHhCCCeEE Confidence 1 1122 2456778887521 34332 11111000 000000001 12444566677644 4777 Q ss_pred EEEcC--CEEEeeceeecCcccCcceeehhhhHHHHHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHHHHHhhcC-- Q lcl|Aclame:pro 300 IDVSD--ANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGS-- 375 (426) Q Consensus 300 ~~~~g--~~~~~~~~t~~G~~~sg~~iD~i~g~dwl~~~iq~~l~~ll~~~~KIp~td~Gi~~i~~~v~~~l~~~v~~-- 375 (426) ..|.| +.|...+.++..-.+|.++|-.+|=.|=..-+++......+ .+..+.-|..+|+..+.-+...|++-.+. T Consensus 242 r~Y~gydG~Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R~vR~~Ai~~i-~Dr~lnstp~sia~~~~~~~~pLr~M~ks~e 320 (376) T protein:vir:37 242 MWYPDYDGYYRADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKI-ADRSFNSTTSSTEYHKNYFAKPLRDMSKSAT 320 (376) T ss_pred EeeCCCCceEEeCCeEeccCCCCeeeehhchHHHHHHHHHHHHHHHHh-cCccccCChhHHHHHHHHHhHHHHHHHhhhh Confidence 77777 88999999998888899999999999988888886544433 34557889999999999999999987654 Q ss_pred -CCccccceeEecCcccCcHHHHHhhcCCCeEEEEEEcccEEEEEEEEEEeC Q lcl|Aclame:pro 376 -VGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) Q Consensus 376 -~g~~~~~y~~~~p~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) +|..+|+ .|..|.-.++.---..|.--.|.+..+.=+-=..+.++.-|-+ T Consensus 321 i~g~~fpg-ei~~P~d~dI~i~w~sk~~V~I~~~vrPy~cpk~i~~~I~LDl 371 (376) T protein:vir:37 321 INGKDFPG-ECMPPKDDAITIVWQSKTKVTIYIKVRPYDCPKEITANIFLDL 371 (376) T ss_pred hccccccc-eeecCCCCceEEEeccCceEEEEEEEeeecCcceeEEEEEEec Confidence 2333332 2667776665311011111112222222222223333333333 Done!