Query lcl|Aclame:protein:vir:96740|NCBI_annot:phage tail sheath protein FI-like|genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Match_columns 388 No_of_seqs 155 out of 795 Neff 8.8 Searched_HMMs 1612 Date Mon Dec 2 14:09:03 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_47 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_47_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:96740 Length: 388 100.0 1E-120 9E-124 677.6 36.7 388 1-388 1-388 (388) 2 protein:vir:103993 Length: 390 100.0 9E-101 5E-104 569.1 34.2 374 1-388 1-389 (390) 3 protein:vir:78206 Length: 390 100.0 9E-101 5E-104 569.1 34.2 374 1-388 1-389 (390) 4 protein:vir:79181 Length: 390 100.0 3E-100 2E-103 566.0 34.7 374 1-388 1-389 (390) 5 protein:vir:1845 Length: 392 # 100.0 6E-100 3E-103 564.7 35.0 373 4-388 1-391 (392) 6 protein:vir:98553 Length: 395 100.0 7E-100 4E-103 564.3 35.2 373 4-388 1-394 (395) 7 protein:vir:79141 Length: 391 100.0 2.3E-99 1E-102 561.4 34.4 374 1-388 1-389 (391) 8 protein:vir:1172 Length: 391 # 100.0 2.3E-99 1E-102 561.3 34.1 375 1-388 1-390 (391) 9 protein:vir:100323 Length: 393 100.0 3.6E-99 2E-102 560.3 34.2 374 1-388 1-391 (393) 10 protein:vir:5711 Length: 396 # 100.0 7.5E-99 5E-102 558.5 34.4 373 4-388 1-394 (396) 11 protein:vir:6079 Length: 396 # 100.0 1.5E-98 1E-101 556.8 34.0 373 4-388 1-394 (396) 12 protein:vir:2035 Length: 396 # 100.0 2.4E-98 1E-101 555.8 33.9 373 4-388 1-394 (396) 13 protein:vir:107865 Length: 477 100.0 2.9E-98 2E-101 555.4 34.2 379 1-387 1-477 (477) 14 protein:vir:79092 Length: 477 100.0 8.5E-98 5E-101 552.8 33.9 379 1-387 1-477 (477) 15 protein:vir:10336 Length: 386 100.0 7.9E-98 5E-101 552.9 32.5 372 1-384 1-386 (386) 16 protein:vir:98263 Length: 664 100.0 5.5E-87 3.4E-90 493.5 33.3 378 1-388 1-659 (664) 17 protein:vir:6594 Length: 666 # 100.0 2.2E-86 1.4E-89 490.2 33.1 378 4-388 1-660 (666) 18 protein:vir:108052 Length: 660 100.0 3.8E-86 2.4E-89 488.9 33.4 380 4-388 1-660 (660) 19 protein:vir:80984 Length: 666 100.0 5.7E-86 3.6E-89 487.9 32.8 379 4-388 1-664 (666) 20 protein:vir:7206 Length: 659 # 100.0 8.7E-86 5.4E-89 486.9 33.8 378 4-388 1-655 (659) 21 protein:vir:103456 Length: 659 100.0 9.7E-86 6E-89 486.7 33.5 378 4-388 1-655 (659) 22 protein:vir:106427 Length: 679 100.0 8.4E-86 5.2E-89 487.0 32.9 380 4-388 1-678 (679) 23 protein:vir:101187 Length: 663 100.0 5.2E-85 3.2E-88 482.7 32.3 378 4-388 1-657 (663) 24 protein:vir:6894 Length: 660 # 100.0 3.9E-84 2.4E-87 477.8 34.9 380 4-388 1-659 (660) 25 protein:vir:5663 Length: 671 # 100.0 2E-84 1.2E-87 479.5 33.0 378 4-388 1-670 (671) 26 protein:vir:101804 Length: 663 100.0 3.9E-84 2.4E-87 477.9 32.0 378 4-388 1-657 (663) 27 protein:vir:100539 Length: 663 100.0 1E-83 6.5E-87 475.5 32.0 378 4-388 1-657 (663) 28 protein:vir:104858 Length: 729 100.0 1.3E-82 8.1E-86 469.5 33.8 380 1-387 1-729 (729) 29 protein:vir:106984 Length: 743 100.0 8.4E-81 5.2E-84 459.6 32.1 385 1-386 299-743 (743) 30 protein:vir:98824 Length: 774 100.0 2.4E-78 1.5E-81 446.2 28.5 369 1-386 279-774 (774) 31 protein:vir:104477 Length: 749 100.0 1.6E-77 9.8E-81 441.6 28.1 374 1-385 316-749 (749) 32 protein:vir:5833 Length: 742 # 100.0 1.1E-72 6.9E-76 415.1 29.2 362 1-385 320-742 (742) 33 protein:vir:79798 Length: 717 100.0 8.5E-51 5.3E-54 295.1 23.6 334 1-377 330-717 (717) 34 protein:vir:63742 Length: 562 100.0 1.9E-39 1.2E-42 232.9 27.0 357 1-382 1-562 (562) 35 protein:vir:80488 Length: 562 100.0 1.6E-38 1E-41 227.8 25.7 357 1-382 1-562 (562) 36 protein:vir:80779 Length: 569 100.0 2.8E-38 1.8E-41 226.4 26.9 354 1-382 1-569 (569) 37 protein:vir:103168 Length: 641 100.0 3.3E-38 2.1E-41 226.1 20.1 274 1-287 1-641 (641) 38 protein:vir:102819 Length: 648 100.0 1.6E-34 1E-37 205.8 28.6 355 1-380 1-648 (648) 39 protein:vir:95741 Length: 587 100.0 1.8E-34 1.1E-37 205.6 25.7 354 1-382 1-587 (587) 40 protein:vir:96586 Length: 587 100.0 1E-33 6.4E-37 201.4 25.4 354 1-382 1-587 (587) 41 protein:vir:107310 Length: 581 100.0 5.6E-34 3.5E-37 202.9 21.6 363 1-388 177-575 (581) 42 protein:vir:99306 Length: 587 100.0 3.1E-33 1.9E-36 198.8 25.5 354 1-382 1-587 (587) 43 protein:vir:7653 Length: 581 # 100.0 1.7E-33 1E-36 200.3 21.5 370 1-388 156-575 (581) 44 protein:vir:102957 Length: 437 99.9 1.8E-29 1.1E-32 178.2 23.3 356 1-376 1-437 (437) 45 protein:vir:100829 Length: 607 99.9 1.3E-27 7.8E-31 168.1 25.7 359 1-388 15-607 (607) 46 protein:vir:105470 Length: 451 99.9 3.7E-24 2.3E-27 149.1 25.9 357 1-376 1-451 (451) 47 protein:vir:101326 Length: 529 99.8 1.5E-21 9.3E-25 134.8 19.9 363 1-377 112-529 (529) 48 protein:vir:78986 Length: 436 99.6 2.6E-16 1.6E-19 106.0 23.8 356 1-376 3-436 (436) 49 protein:vir:102359 Length: 356 99.2 3.3E-12 2E-15 83.5 19.9 320 1-375 1-356 (356) 50 protein:vir:95263 Length: 450 98.7 1.2E-07 7.5E-11 58.5 26.3 355 6-378 1-450 (450) 51 protein:vir:3165 Length: 426 # 98.6 8.9E-08 5.5E-11 59.2 22.0 357 1-377 1-426 (426) 52 protein:vir:80052 Length: 331 98.6 1.8E-07 1.1E-10 57.5 25.2 313 1-377 1-331 (331) 53 protein:vir:5260 Length: 502 # 98.6 1.3E-07 8E-11 58.4 22.3 345 1-377 73-502 (502) 54 protein:vir:3788 Length: 376 # 98.0 6.5E-06 4.1E-09 49.0 23.6 338 7-382 1-376 (376) 55 protein:vir:78782 Length: 370 97.9 1.1E-05 6.5E-09 47.9 23.9 337 7-384 1-370 (370) 56 protein:vir:3751 Length: 376 # 97.8 2.2E-05 1.4E-08 46.1 24.5 340 7-382 1-376 (376) 57 protein:vir:106984 Length: 743 97.7 4.9E-08 3E-11 60.7 2.6 341 1-388 1-390 (743) 58 protein:vir:276 Length: 369 # 97.6 3.5E-05 2.2E-08 45.0 27.3 332 1-380 1-369 (369) 59 protein:vir:104477 Length: 749 97.3 3.6E-06 2.2E-09 50.4 8.3 191 1-388 1-192 (749) 60 protein:vir:4463 Length: 498 # 97.1 0.00019 1.2E-07 41.0 19.2 349 1-380 1-498 (498) 61 protein:vir:489 Length: 498 # 96.9 0.00028 1.8E-07 40.0 19.1 350 1-380 1-498 (498) 62 protein:vir:1996 Length: 495 # 96.5 0.00054 3.4E-07 38.5 22.6 347 1-377 1-495 (495) 63 protein:vir:4517 Length: 498 # 96.3 0.00076 4.7E-07 37.7 21.4 349 1-380 1-498 (498) 64 protein:vir:101576 Length: 501 96.0 0.0011 7.1E-07 36.7 24.6 363 1-377 1-501 (501) 65 protein:vir:3636 Length: 501 # 95.8 0.0014 8.7E-07 36.2 24.9 363 1-377 1-501 (501) 66 protein:vir:106730 Length: 501 95.2 0.0026 1.6E-06 34.7 24.3 363 1-377 1-501 (501) 67 protein:vir:78611 Length: 501 94.9 0.0032 2E-06 34.3 26.1 363 1-377 1-501 (501) 68 protein:vir:94073 Length: 494 93.8 0.0063 3.9E-06 32.7 26.4 359 1-377 1-494 (494) 69 protein:vir:96104 Length: 504 81.5 0.085 5.3E-05 26.4 21.9 351 1-376 68-504 (504) 70 protein:vir:99586 Length: 507 78.4 0.11 7.1E-05 25.7 21.8 349 1-376 71-507 (507) No 1 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=1.5e-120 Score=677.58 Aligned_cols=388 Identities=100% Similarity=1.475 Sum_probs=379.8 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) ||+||+|+||||++|++++++||+++++++++|||+++++++..++++++++.+..+.+..+..+...+|+..++..+++ T Consensus 1 m~~~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~~d~~~~~~~~~~~gtl~~al~~~~~ 80 (388) T protein:vir:96 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) T ss_pred CCCCCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecchhhhhhhccccccccchhhhHhhhc Confidence 99999999999999999999999999999999999999999999999999999999999888888889999999999999 Q ss_pred cccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhhhhhhheecccccchhHHHHHHHHHhhhCceEEEEe Q lcl|Aclame:pro 81 KTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRLKCRAVID 160 (388) Q Consensus 81 ~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~~~i~d 160 (388) +++..++++++.++++.+++.++++++.++.+|.++|++++.+++.+|+||++||+++.++|+++|.++|+++++++++| T Consensus 81 ~~~~~~~vv~v~~g~~~~at~a~iig~~~~~tg~~~gl~al~~~~~~p~il~aPg~s~~~~v~~al~~~~~~~~~~~i~D 160 (388) T protein:vir:96 81 KTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRLKCRAVID 160 (388) T ss_pred cCCceEEEEEeccccccccccceeeeecccccchhhHHHHhhhcccceeEEEeeccccchHHHHHHHHHHhhcCcEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCcchhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccccccccccccccceeeccccc Q lcl|Aclame:pro 161 GPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVLIQDVARVID 240 (388) Q Consensus 161 ~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~ 240 (388) +|.+..+.+.+++.+....+++|+|+++||||++++|+.++..+++|||+++||++|++|+|+||+|+++++.|+++++. T Consensus 161 ~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~spaN~~i~i~g~~~~~~ 240 (388) T protein:vir:96 161 GPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVLIQDVARVID 240 (388) T ss_pred ccCCchhHHHHHHhhhhccCcCcceEEEEeCceeeecccCCceeeechHHHHHHHHHhhcCcccccCeeEEeeeeccccc Confidence 99998888889999898999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCchhhhhhccccceEEEEEeCCCcEEEEccccCCCceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHH Q lcl|Aclame:pro 241 YNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVTGKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKIN 320 (388) Q Consensus 241 ~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~ 320 (388) +.+.++.+|+++||++|||+|++|+++|+++||+||++|+||++|||++||+++|++.++|+|||||++.+|++|+++++ T Consensus 241 ~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~wG~rT~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~ 320 (388) T protein:vir:96 241 YNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVTGKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKIN 320 (388) T ss_pred ccccCChhhHHhhhhcCceEEEEecCCcEEEEcccccCCcceeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 321 LFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 321 ~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~~~~ 388 (388) .||++||++|+|+||+++||+++||+++|++|+|+++|+++|++|+|||+|+++++++||++|+++|| T Consensus 321 ~fL~~l~~~Gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~~~~~~ 388 (388) T protein:vir:96 321 LFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) T ss_pred HHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhC Confidence 99999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=8.7e-101 Score=569.15 Aligned_cols=374 Identities=24% Similarity=0.381 Sum_probs=339.1 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeecccccccc-ccCcceeeccchhhhhhcccccccccchhhhhhhh Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADV-AFSVPFRVANTADAQYLDSTGNELGTGWHAASETL 79 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~-~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~ 79 (388) ||+ +|+||||++|++++++|+.++++++++|+|+++++++.. ++++|+++++..+.... .+..++|..++..++ T Consensus 1 M~~--~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~---~g~~gtL~~al~~~~ 75 (390) T protein:vir:10 1 MPQ--DYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGK---AGKKGTLRRTLDAIG 75 (390) T ss_pred Ccc--cccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhh---cCCCceehhhhhhhc Confidence 996 789999999999999999999999999999999987764 68999999987766554 445789999999999 Q ss_pred ccccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhh----hhhhheecccccchhHHHHHHHHHhhhCce Q lcl|Aclame:pro 80 KKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKRLKC 155 (388) Q Consensus 80 ~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~----~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~ 155 (388) ++++..+++|++.++.+.+.+..+++++.+. ++..+|++++.... ..|.++++|++++ .+|+++|..+|+++++ T Consensus 76 ~~gg~~~~vv~v~~~~~~~~~~~~~ig~~~~-~~~~tg~~al~~~~~~~~~~p~il~ap~~~~-~~v~~~l~~~a~~~~~ 153 (390) T protein:vir:10 76 KQTKPLTVVVRVAEGKDADETTSNVIGTVTP-DGKYTGIKALLAAQGALGVKPRILAAPGLDT-QPVAAALAATAQSLRA 153 (390) T ss_pred cccCceEEEEEeccccccccccccccccccc-ccccchhhhhhhhhhhhcceehhhcccccch-HHHHHHHHHhhcccce Confidence 9999999999999999999999999999886 46778888776654 4599999999975 5699999999999999 Q ss_pred EEEEecCCCcch-hHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhcccc----ccccccccc Q lcl|Aclame:pro 156 RAVIDGPSGSTQ-DAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKP----WESPGNQGV 230 (388) Q Consensus 156 ~~i~d~p~~~~~-~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~----~~s~~n~p~ 230 (388) ++++|.|.+.+. .+..+ ..+++|++.++||||++.+|+..+..+++|||+++||++|++|. |+||+|+++ T Consensus 154 ~aivD~p~~~t~~~a~~~-----~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l 228 (390) T protein:vir:10 154 MAYVSASGCKTKEEAAAY-----RKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVV 228 (390) T ss_pred EEEEecCCCCCHHHHHHH-----hhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCcee Confidence 999999976553 33333 34788999999999999999999999999999999999999995 566667766 Q ss_pred -cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 231 -LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSK 305 (388) Q Consensus 231 -~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~vfe 305 (388) ++.+++..+.+...+..+|+++||++||+++++ ++||++||+||++ |+||++|||++||+++|++.++|+||| T Consensus 229 ~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e 306 (390) T protein:vir:10 229 NGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNGFRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDG 306 (390) T ss_pred eceeecceecccccccccchhhhhhhcCcEEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC Confidence 477888889999999999999999999999976 6799999999985 999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHHHH Q lcl|Aclame:pro 306 QLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIE 385 (388) Q Consensus 306 pn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~ 385 (388) ||++.+|++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|++ T Consensus 307 ~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~ 386 (390) T protein:vir:10 307 PLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPA 386 (390) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcC Q lcl|Aclame:pro 386 EVL 388 (388) Q Consensus 386 ~~~ 388 (388) +|. T Consensus 387 ~~~ 389 (390) T protein:vir:10 387 RVA 389 (390) T ss_pred Hhc Confidence 999 No 3 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=8.7e-101 Score=569.15 Aligned_cols=374 Identities=24% Similarity=0.381 Sum_probs=339.1 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeecccccccc-ccCcceeeccchhhhhhcccccccccchhhhhhhh Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADV-AFSVPFRVANTADAQYLDSTGNELGTGWHAASETL 79 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~-~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~ 79 (388) ||+ +|+||||++|++++++|+.++++++++|+|+++++++.. ++++|+++++..+.... .+..++|..++..++ T Consensus 1 M~~--~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~---~g~~gtL~~al~~~~ 75 (390) T protein:vir:78 1 MPQ--DYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGK---AGKKGTLRRTLDAIG 75 (390) T ss_pred Ccc--cccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhh---cCCCceehhhhhhhc Confidence 996 789999999999999999999999999999999987764 68999999987766554 445789999999999 Q ss_pred ccccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhh----hhhhheecccccchhHHHHHHHHHhhhCce Q lcl|Aclame:pro 80 KKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKRLKC 155 (388) Q Consensus 80 ~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~----~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~ 155 (388) ++++..+++|++.++.+.+.+..+++++.+. ++..+|++++.... ..|.++++|++++ .+|+++|..+|+++++ T Consensus 76 ~~gg~~~~vv~v~~~~~~~~~~~~~ig~~~~-~~~~tg~~al~~~~~~~~~~p~il~ap~~~~-~~v~~~l~~~a~~~~~ 153 (390) T protein:vir:78 76 KQTKPLTVVVRVAEGKDADETTSNVIGTVTP-DGKYTGIKALLAAQGALGVKPRILAAPGLDT-QPVAAALAATAQSLRA 153 (390) T ss_pred cccCceEEEEEeccccccccccccccccccc-ccccchhhhhhhhhhhhcceehhhcccccch-HHHHHHHHHhhcccce Confidence 9999999999999999999999999999886 46778888776654 4599999999975 5699999999999999 Q ss_pred EEEEecCCCcch-hHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhcccc----ccccccccc Q lcl|Aclame:pro 156 RAVIDGPSGSTQ-DAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKP----WESPGNQGV 230 (388) Q Consensus 156 ~~i~d~p~~~~~-~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~----~~s~~n~p~ 230 (388) ++++|.|.+.+. .+..+ ..+++|++.++||||++.+|+..+..+++|||+++||++|++|. |+||+|+++ T Consensus 154 ~aivD~p~~~t~~~a~~~-----~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l 228 (390) T protein:vir:78 154 MAYVSASGCKTKEEAAAY-----RKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVV 228 (390) T ss_pred EEEEecCCCCCHHHHHHH-----hhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCcee Confidence 999999976553 33333 34788999999999999999999999999999999999999995 566667766 Q ss_pred -cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 231 -LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSK 305 (388) Q Consensus 231 -~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~vfe 305 (388) ++.+++..+.+...+..+|+++||++||+++++ ++||++||+||++ |+||++|||++||+++|++.++|+||| T Consensus 229 ~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e 306 (390) T protein:vir:78 229 NGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNGFRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDG 306 (390) T ss_pred eceeecceecccccccccchhhhhhhcCcEEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC Confidence 477888889999999999999999999999976 6799999999985 999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHHHH Q lcl|Aclame:pro 306 QLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIE 385 (388) Q Consensus 306 pn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~ 385 (388) ||++.+|++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|++ T Consensus 307 ~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~ 386 (390) T protein:vir:78 307 PLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPA 386 (390) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcC Q lcl|Aclame:pro 386 EVL 388 (388) Q Consensus 386 ~~~ 388 (388) +|. T Consensus 387 ~~~ 389 (390) T protein:vir:78 387 RVA 389 (390) T ss_pred Hhc Confidence 999 No 4 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=3.3e-100 Score=565.98 Aligned_cols=374 Identities=25% Similarity=0.378 Sum_probs=340.2 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccc-cccCcceeeccchhhhhhcccccccccchhhhhhhh Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHAD-VAFSVPFRVANTADAQYLDSTGNELGTGWHAASETL 79 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~-~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~ 79 (388) ||+ +|+||||++|++++++||.++++++++|+|+++++++. +++++++++++..+....+ +..++|..++..++ T Consensus 1 M~~--~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~---g~~~tL~~al~~~~ 75 (390) T protein:vir:79 1 MPQ--DYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKA---GKKGTLRRTLDAIG 75 (390) T ss_pred Ccc--ccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhc---CCCccchhhhhhhc Confidence 996 78999999999999999999999999999999998876 4689999999987766654 55789999999999 Q ss_pred ccccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhhh----hhhheecccccchhHHHHHHHHHhhhCce Q lcl|Aclame:pro 80 KKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTE----RPTLIGAPGFSQNKAVIDALASMAKRLKC 155 (388) Q Consensus 80 ~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~~----~p~ll~ap~~~~~~~v~~~l~~~~~~~~~ 155 (388) .+++..++++++.++.+...+..+++++.+. ++..+|++++.+... .|.++++|++++ ++|+++|..+|+++++ T Consensus 76 ~~~~~~~~vv~v~~~~~~~~~~~~~ig~~~~-~~~~tgl~al~~~~~~~~~~p~il~ap~~~~-~~v~~~l~~~a~~~~~ 153 (390) T protein:vir:79 76 KQTKPLTVVVRVAEGKDADETTSNVIGTVTP-DGKYTGIKALLAAQGALGVKPRILAAPGLDT-QPVAAALAATAQSLRA 153 (390) T ss_pred ccccceEEEEeeccccccccccceeeecccc-cccchhhhhhhhhhhhhccccccccCCcccc-hHHHHHHHHhhhhcce Confidence 9999999999999998888888888888776 567889888877654 489999999875 5689999999999999 Q ss_pred EEEEecCCCcch-hHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccc----cccccccccc Q lcl|Aclame:pro 156 RAVIDGPSGSTQ-DAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGNQGV 230 (388) Q Consensus 156 ~~i~d~p~~~~~-~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d----~~~s~~n~p~ 230 (388) ++++|+|.+.+. .+.++ ..+++|+|+++||||++.+|+..+..+++|||+++||++|++| +|+||+|+++ T Consensus 154 ~ai~D~p~~~t~~~a~~~-----~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~D~~~g~~~spsN~~i 228 (390) T protein:vir:79 154 MAYVSASGCKTKEEAAAY-----RRQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVV 228 (390) T ss_pred EEEEEccCCCCHHHHHHH-----hcCCCCceEEEEcCceeecccccCceeEeehHHHHHHHHHhhhccCCcEEccCCcee Confidence 999999977553 33333 3478899999999999999999999999999999999999999 4677777776 Q ss_pred -cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 231 -LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSK 305 (388) Q Consensus 231 -~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~vfe 305 (388) ++.+++..+.+...+..+|+++||++||+++++ ++||++||+||++ |+||++|||++||+++|++.++|+||| T Consensus 229 ~gi~~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~~--~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e 306 (390) T protein:vir:79 229 NGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNGFRFWGERTCSDDPKFAFENYTRTAQVAADSIAEAQMPVVDG 306 (390) T ss_pred eccceeeeeccccccccchhhhhhhhcCcEEEEc--CCCEEEEeccccCCCcccceeeehhhHHHHHHHHHHHHHHhccC Confidence 577888888998999999999999999999965 6899999999985 999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHHHH Q lcl|Aclame:pro 306 QLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIE 385 (388) Q Consensus 306 pn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~ 385 (388) ||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|++ T Consensus 307 ~~~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~ 386 (390) T protein:vir:79 307 PLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPA 386 (390) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcC Q lcl|Aclame:pro 386 EVL 388 (388) Q Consensus 386 ~~~ 388 (388) +|. T Consensus 387 ~v~ 389 (390) T protein:vir:79 387 RVA 389 (390) T ss_pred Hhc Confidence 999 No 5 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=5.6e-100 Score=564.71 Aligned_cols=373 Identities=25% Similarity=0.360 Sum_probs=336.3 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeecccccccc-ccCcceeeccchhhhhhcccccccccchhhhhhhhccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADV-AFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKT 82 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~-~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~ 82 (388) |++|+||||++|+++|++|+.++++++++|+|+++++++.. ++++|++++++.++...+ +..+++..++..+++++ T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~---g~~gtl~~al~~~~~ng 77 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKA---GKKGTLSASLQAIADQS 77 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhc---CCCcchHHHHHHhhccc Confidence 78899999999999999999999999999999999988764 689999999988877654 44678999999999999 Q ss_pred cceEEEEeccc---ccccccccccccccccchhhhhhhHhhhhhhh----hhhhheecccccchhHHHHHHHHHhhhCce Q lcl|Aclame:pro 83 SVPQYFIVVPE---GADDAATMANIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKRLKC 155 (388) Q Consensus 83 ~~~~~vv~~~~---~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~----~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~ 155 (388) +..+++++... .+..+.+..+++|+.+. ++..++++++.+.. ..|.++++||+++ ++|+++|.++|+++++ T Consensus 78 g~~~~vv~v~~~~~~~~~~~t~~dliG~~~~-~~~~tg~~al~~~~~~~~~~p~il~ap~~~~-~~v~~~l~~~~~~~~~ 155 (392) T protein:vir:18 78 KPVTVVVRVAEGTGDDAEAQTTSNIIGGTDE-NGKYTGIKALLTAEAVTGVKPRILGVPGLDT-QEVATALASVCISLRA 155 (392) T ss_pred CceEEEecccccccccccccchhhheecccc-cchhhhHHHHHhhhhhhceeehhcccCccch-HHHHHHHHHHHhhcCc Confidence 99999887543 35667777888888764 67888888877665 3589999999975 6799999999999999 Q ss_pred EEEEecCCCcc-hhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccc----cccccccccc Q lcl|Aclame:pro 156 RAVIDGPSGST-QDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGNQGV 230 (388) Q Consensus 156 ~~i~d~p~~~~-~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d----~~~s~~n~p~ 230 (388) ++++|+|.+.+ +.+..++ .+++|+++++||||++.+|+.++..+++|||+++||+++++| +|+||+|+++ T Consensus 156 ~~~~d~~~~~~~~~a~~~~-----~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l 230 (392) T protein:vir:18 156 FGYVSAWGCKTISEAMAYR-----ENFSQRELMVIWPDFLAWDTTANATATAYATARALGLRAYIDQTIGWHKTLSNVGV 230 (392) T ss_pred EEEEecCCCCCHHHHHHHH-----hhccCceEEEEeCceeeecccCCceEEechHHHHHHHHHhhhccCCceEccCCcee Confidence 99999987755 3333333 367899999999999999999999999999999999999999 5677777776 Q ss_pred -cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 231 -LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSK 305 (388) Q Consensus 231 -~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~vfe 305 (388) ++.+++..+.+...++.+|++.||++|||+|++ ++||++||+||++ |+||++||++++|+++|++.++|+||| T Consensus 231 ~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~--~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e 308 (392) T protein:vir:18 231 QGVTGISASVFWDLQASGTDADLLNEAGVTTLVR--KDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDK 308 (392) T ss_pred eceeecceecccccCCCcchhhhhhhcCceEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC Confidence 578899999999999999999999999999964 6899999999986 999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHHHH Q lcl|Aclame:pro 306 QLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIE 385 (388) Q Consensus 306 pn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~ 385 (388) ||++.+|++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|++ T Consensus 309 ~n~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~~~~~~~~ 388 (392) T protein:vir:18 309 PITASLIRDIVDGINAKFRELKSNGYIVDGECWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKYLVNLAE 388 (392) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhcCcccceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcC Q lcl|Aclame:pro 386 EVL 388 (388) Q Consensus 386 ~~~ 388 (388) +|. T Consensus 389 ~~~ 391 (392) T protein:vir:18 389 SVN 391 (392) T ss_pred Hhc Confidence 999 No 6 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=6.5e-100 Score=564.35 Aligned_cols=373 Identities=25% Similarity=0.365 Sum_probs=331.2 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeecccccccc-ccCcceeeccchhhhhhcccccccccchhhhhhhhccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADV-AFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKT 82 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~-~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~ 82 (388) |++|+||||++|++++++++.+++|++++|+|+++++++.. ++++|+++++..+....+ +..+++..++..+++++ T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~---g~~~tl~~al~~~~~~~ 77 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKA---GKKGTLAASLQAIADQS 77 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhc---ccccchhhHHHHHhhcc Confidence 77899999999999999999999999999999999987664 689999999988776654 45689999999999999 Q ss_pred cceEEEEeccccccc------ccccccccccccchhhhhhhHhhhhhhh----hhhhheecccccchhHHHHHHHHHhhh Q lcl|Aclame:pro 83 SVPQYFIVVPEGADD------AATMANIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKR 152 (388) Q Consensus 83 ~~~~~vv~~~~~~~~------~~~~~~~~~~~~~~tg~~tgl~a~~~~~----~~p~ll~ap~~~~~~~v~~~l~~~~~~ 152 (388) +..+++++...+... +.+...+.++.+ .++.++|++++.+.. ..|.++++||+++ ++++++|.++|++ T Consensus 78 ~~~~~vv~~~~~~~~~~~~~~a~~~~~i~g~~~-~~~~~Tgl~al~~~~~~~~~~p~il~ap~~~~-~~v~~al~~~~~~ 155 (395) T protein:vir:98 78 KPVTVVVRVEDGTGDDEEAALAQTVSNIIGGTD-ENGKYTGIKALLTAQAVTGVKPRILGVPGLDT-KEVAVALASAAIK 155 (395) T ss_pred CceEEEeeccccccccccccccccccccccccc-cccchhHHHHHhhhhhhhccchhhcccccccc-cHHHHHHHHHhhh Confidence 999999887544322 233344555543 467889998887765 4699999999965 5689999999999 Q ss_pred CceEEEEecCCCcc-hhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccc----ccccccc Q lcl|Aclame:pro 153 LKCRAVIDGPSGST-QDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGN 227 (388) Q Consensus 153 ~~~~~i~d~p~~~~-~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d----~~~s~~n 227 (388) +++++++|+|.+.+ +.+..++ .+++|+|+++||||++++|+.++..+++|||+++||+++++| +|+||+| T Consensus 156 ~~~~~~~d~p~~~t~~~a~~~~-----~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN 230 (395) T protein:vir:98 156 LRAFAYVSAWGCKTISEAMEYR-----KNFSQRELMVIWPDFLAWDTVKNTTATAYATARALGLRAYIDQTVGWHKTLSN 230 (395) T ss_pred cCcEEEEEcCCCCCHHHHHHHH-----hccCCceEEEEecceeEecccCCceeeechHHHHHHHHHHhhcccCcEeccCC Confidence 99999999998755 3333333 367899999999999999999999999999999999999999 4666677 Q ss_pred ccc-cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 228 QGV-LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRA 302 (388) Q Consensus 228 ~p~-~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~ 302 (388) +++ ++.|++..+.+...++.+|++.||++|||++++ ++||++||+||++ |+||++||++++|+++|++.++|+ T Consensus 231 ~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~--~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~ 308 (395) T protein:vir:98 231 VGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVR--KDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWA 308 (395) T ss_pred ceeecccccceecccccCCCcchHHhhhhcCcEEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHh Confidence 765 577888889999999999999999999999965 6899999999986 999999999999999999999999 Q ss_pred hcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 303 MSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 303 vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) +||||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++ T Consensus 309 v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~~~~ 388 (395) T protein:vir:98 309 VDKPITATLIRDIVDGINAKFRELKSNGYIVEGKCWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKYLVN 388 (395) T ss_pred ccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcC Q lcl|Aclame:pro 383 FIEEVL 388 (388) Q Consensus 383 l~~~~~ 388 (388) |+++|. T Consensus 389 ~~~~~~ 394 (395) T protein:vir:98 389 LAESVN 394 (395) T ss_pred HHHHhc Confidence 999999 No 7 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=2.3e-99 Score=561.36 Aligned_cols=374 Identities=22% Similarity=0.331 Sum_probs=339.6 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccc-cccCcceeeccchhhhhhcccccccccchhhhhhhh Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHAD-VAFSVPFRVANTADAQYLDSTGNELGTGWHAASETL 79 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~-~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~ 79 (388) ||+ +|+||||++|++++++|+.++++++++|+|+++++++. .++++|+++++..++...+ +..+++..++..++ T Consensus 1 M~~--~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~---g~~gtl~~al~~~~ 75 (391) T protein:vir:79 1 MPT--DYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKA---GDKGTLAHTLDAIT 75 (391) T ss_pred CCC--CCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhc---CCccccchhhhhhh Confidence 995 78999999999999999999999999999999998876 4579999999988776544 45689999999999 Q ss_pred ccccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhhh----hhhheecccccchhHHHHHHHHHhhhCce Q lcl|Aclame:pro 80 KKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTE----RPTLIGAPGFSQNKAVIDALASMAKRLKC 155 (388) Q Consensus 80 ~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~~----~p~ll~ap~~~~~~~v~~~l~~~~~~~~~ 155 (388) ++++..++++++........+..++.++.+. ++..+|++++.+... .|.++++|+++ ..+++++|.++|+++++ T Consensus 76 ~~gg~~~~vv~~~~~~~~~~~~~~~~g~~~~-~~~~tGl~~l~~~~~~~~~~p~~l~~p~~~-~~~v~~al~~~~~~~~~ 153 (391) T protein:vir:79 76 DQTNPLTVVVRVAGGASEAETTSNLIGTTNA-AGRYTGMKALLTARNRFGVAPRILAVPGLD-SLPVGTELVTIAQKLRA 153 (391) T ss_pred cccccceeeeccccccccccccccccccccc-hhhhHHHhhhhhhhhhhcccchhhcCCccc-hhHHHHHHHHHHhhcCc Confidence 9999999999999998888888888888886 688899888877654 48889999986 56699999999999999 Q ss_pred EEEEecCCCcc-hhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccc----cccccccccc Q lcl|Aclame:pro 156 RAVIDGPSGST-QDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGNQGV 230 (388) Q Consensus 156 ~~i~d~p~~~~-~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d----~~~s~~n~p~ 230 (388) ++++|+|.+.+ ..+..+ ..+++|+|+++||||++.+|+.++..+++|||+++||++|++| +|+||+|+++ T Consensus 154 ~ai~d~p~~~t~~~a~~~-----~~~~~s~~~a~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~g~~~spaN~~l 228 (391) T protein:vir:79 154 FAYLSAYGCQTKEEAVAY-----RSNFGQREAMVMWPDFVGWDTAANAETTLWATARAVGLRAKIDNDTGWHKTLSNVAV 228 (391) T ss_pred EEEEECCCCCCHHHHHHH-----HhccCCceeEEecceeeeecCcCCceeeechHHHHHHHHHHhhhcccceeccCCcee Confidence 99999997654 333333 3467899999999999999999999999999999999999999 4666667765 Q ss_pred -cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 231 -LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSK 305 (388) Q Consensus 231 -~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~vfe 305 (388) ++.|+++.+.+...+..+|++.||++||+++++ ++||++||+||++ |+||++|||+++|+++|+++++|+||| T Consensus 229 ~gi~~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~--~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e 306 (391) T protein:vir:79 229 GGVTGLSRDVFWDLQDPATDAGYLNANEVTTLVH--RDGYRFWGSRTCSADPLFAFENYTRTAQVLADTMAEAHMWANDL 306 (391) T ss_pred hhhhccccccccccccccchhhhhhhcCceEEEC--CCcEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC Confidence 577888889999999999999999999999965 6899999999985 999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHHHH Q lcl|Aclame:pro 306 QLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIE 385 (388) Q Consensus 306 pn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~ 385 (388) ||++.+|++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++|++ T Consensus 307 pn~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~ 386 (391) T protein:vir:79 307 PMTPTLVRDLLEGINAKLRMLTRNGYLLGGAAWFDADANSKDTLKAGQLAIDYDYTPVPPLENLTFRQRITDRYLMQFAE 386 (391) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcC Q lcl|Aclame:pro 386 EVL 388 (388) Q Consensus 386 ~~~ 388 (388) +|. T Consensus 387 ~v~ 389 (391) T protein:vir:79 387 AVK 389 (391) T ss_pred Hhh Confidence 999 No 8 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=2.3e-99 Score=561.32 Aligned_cols=375 Identities=23% Similarity=0.358 Sum_probs=338.9 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeecccccccc-ccCcceeeccchhhhhhcccccccccchhhhhhhh Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADV-AFSVPFRVANTADAQYLDSTGNELGTGWHAASETL 79 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~-~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~ 79 (388) |++ ++|+||||++|++++++++..+++++++|+|+++++++.. ++++++++++..++... .+..+++..++..++ T Consensus 1 M~~-~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~---~g~~~tl~~al~~~~ 76 (391) T protein:vir:11 1 MAA-DQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGK---AGTSGTLPASLQAIA 76 (391) T ss_pred CCC-CcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhhee---cCCCccchhhhhhhh Confidence 776 4789999999999999999999999999999999987764 68899999988776554 456789999999999 Q ss_pred ccccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhh----hhhhheecccccchhHHHHHHHHHhhhCce Q lcl|Aclame:pro 80 KKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKRLKC 155 (388) Q Consensus 80 ~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~----~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~ 155 (388) ++++..++++++.++++.+.+..+++++.+. ++..++++++.+.. ..|.++.+|++++ ++++++|.++|+++++ T Consensus 77 ~~~g~~~~vv~~~~~~~~~~t~~d~~g~~~a-~~~~~g~~a~~~~~~~~~~~p~~~~ap~~~~-~~v~~al~~~~~~~~~ 154 (391) T protein:vir:11 77 DQANAATVVVRVKPGEDEAATNSAVIGGVSA-DGKYTGMKALLAAKARLGVVPRILGVPGLDT-QPVATALIAIAQQLRA 154 (391) T ss_pred ccccceeEEeeecccccccccchhhhccccc-ccchhhhhhhhhhhhhheecccccccccccc-HHHHHHHHHhhcccce Confidence 9999999999999999999999999998885 55667777666544 4689999999865 5699999999999999 Q ss_pred EEEEecCCCcc-hhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccc----cccccccccc Q lcl|Aclame:pro 156 RAVIDGPSGST-QDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGNQGV 230 (388) Q Consensus 156 ~~i~d~p~~~~-~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d----~~~s~~n~p~ 230 (388) ++++|.|.+.+ ..+..+ +.+++|+|+++||||++.+|+.++..+++|||+++||+++|+| +|+||+|+++ T Consensus 155 ~~i~D~p~~~t~~~a~~~-----r~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~l 229 (391) T protein:vir:11 155 FAYVSASGCKTKEEATAY-----RENFAAREAMVIWPDFLTWSTVVNQTVPAPAVAQALGLRARIDQEVGWHKTLSNVAV 229 (391) T ss_pred EEEEEcCCCCCHHHHHHH-----hhhcCCceEEEEcCcceecccccCceEEechHHHHHHHHHHhhccCCcEEccCCcee Confidence 99999998754 333333 2478999999999999999999999999999999999999999 5677777776 Q ss_pred -cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 231 -LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSK 305 (388) Q Consensus 231 -~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~vfe 305 (388) ++.+++..+.++..++++|++.||++||+++++ ++||++||+||++ |+||++||+|+||+++|++.++|+||| T Consensus 230 ~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~--~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e 307 (391) T protein:vir:11 230 NGVTGISADVFWDLQSPSTDANYLNENEVTTLVQ--EGGFRFWGSRTCSDDPLFAFENYTRTAQVLADTIAEAHMWAVDK 307 (391) T ss_pred eceeecccccccccCCCcchhhhhhhcCcEEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC Confidence 577888889999999999999999999999854 7899999999986 999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHHHH Q lcl|Aclame:pro 306 QLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIE 385 (388) Q Consensus 306 pn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~ 385 (388) ||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+++++++++|+++|++ T Consensus 308 ~n~~~~~~~i~~~i~~~l~~l~~~g~l~g~~~~~~~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~ 387 (391) T protein:vir:11 308 PMHPSLVRDILEGVNAKFRELKGLGLIIDAQAWYDPNVNDKDTLKAGKLRITYDYTPVPPLEDLTFFQKITDSYLVDFAS 387 (391) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhccceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcC Q lcl|Aclame:pro 386 EVL 388 (388) Q Consensus 386 ~~~ 388 (388) +|+ T Consensus 388 ~~~ 390 (391) T protein:vir:11 388 RVN 390 (391) T ss_pred Hhc Confidence 999 No 9 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=3.6e-99 Score=560.32 Aligned_cols=374 Identities=21% Similarity=0.278 Sum_probs=334.8 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeecccccccc-ccCcceeeccchhhhhhcccccccccchhhhhhhh Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADV-AFSVPFRVANTADAQYLDSTGNELGTGWHAASETL 79 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~-~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~ 79 (388) ||.+++|+||||++|++++++++.+++|++++|+|+++++++.. ++++|++++++.++...+ +..+++..++..++ T Consensus 1 m~m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~---g~~g~L~~al~~~~ 77 (393) T protein:vir:10 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKA---GSTGTLRRTLNSIG 77 (393) T ss_pred CCCCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhh---CCccchhhhhhhhh Confidence 99999999999999999999999999999999999999988764 689999999988877654 45689999999999 Q ss_pred ccccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhh----hhhhheecccccchhHHHHHHHHHhhhCce Q lcl|Aclame:pro 80 KKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKRLKC 155 (388) Q Consensus 80 ~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~----~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~ 155 (388) ++.+..++++++.+++..+.+..+++++.+ ++.++|++++.+.. ..|.++++||+++ ++++++|.++|+++++ T Consensus 78 ~~~~~~~~vv~v~~~~~~~~t~~~iig~~~--~~~~tgl~al~~~~~~~~~~p~li~apg~~~-~~~~~al~~~~~~~~~ 154 (393) T protein:vir:10 78 SIVKTPTVIVRVAESDDSDTLTANIVGTQE--NGKFTGIKALLTAQSTVFVKPKLLCVPQHDN-QAVATELLSVAKKLNA 154 (393) T ss_pred cccCceEEEeecccCccccccccccccccc--cchhhHHHHHHhhhhhcceeeeeeeeccccc-hHHHHHHHHHhhccCc Confidence 999999999999999999999999988665 35678888877654 4689999999975 4577889999999988 Q ss_pred EEEEecCCCcc-hhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhcccc----ccccccccc Q lcl|Aclame:pro 156 RAVIDGPSGST-QDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKP----WESPGNQGV 230 (388) Q Consensus 156 ~~i~d~p~~~~-~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~----~~s~~n~p~ 230 (388) +++++.|.+.+ +++..++ .+++|.+.++||||++.+++..+..+++|||+++||++|++|. |+||+|+++ T Consensus 155 ~~~v~d~~~~t~~~ai~~~-----~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~G~~~spaN~~l 229 (393) T protein:vir:10 155 FAFISDNGATTKEQAYTYR-----QNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVEL 229 (393) T ss_pred EEEEEcCCCCCHHHHHHHh-----hhcCCceEEEEecccccccccCCceeEeehhHHHHHHHHHhhcCCCcEEccCCcee Confidence 87776554443 4444333 3678999999999999999999999999999999999999995 566667765 Q ss_pred -cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 231 -LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSK 305 (388) Q Consensus 231 -~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~vfe 305 (388) ++.|++..+.+.+.++++|+++||++|||+|++ ++||++||+||++ |+||++|||+++|+++|++.++|+||| T Consensus 230 ~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~--~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e 307 (393) T protein:vir:10 230 DGVTGITKAVEFDINESSTEANYLNEKGITICLN--HNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDM 307 (393) T ss_pred eceeecceecccccCCCcchhHhHhhcCceEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccC Confidence 578899999999999999999999999999965 6899999999985 999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhcC--CeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHH Q lcl|Aclame:pro 306 QLTKSFMEQEIKKINLFMQDLVAAE--IIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEF 383 (388) Q Consensus 306 pn~~~~~~~i~~~i~~~L~~l~~~G--al~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l 383 (388) ||++.+|++|+++++.||++||++| +|.|++|+||++ ||++||++|+|+++|+++|++|+|||+|+++++++|+++| T Consensus 308 ~~~~~~~~~i~~~i~~~L~~l~~~g~~al~g~~v~~~~~-nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~~~~l 386 (393) T protein:vir:10 308 PLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEE-ITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDL 386 (393) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhccccccccceEEecCC-CCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHH Confidence 9999999999999999999999865 899999999976 8889999999999999999999999999999999999999 Q ss_pred HHhcC Q lcl|Aclame:pro 384 IEEVL 388 (388) Q Consensus 384 ~~~~~ 388 (388) +++|+ T Consensus 387 ~~~v~ 391 (393) T protein:vir:10 387 VNTLK 391 (393) T ss_pred HHHHh Confidence 99999 No 10 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=7.5e-99 Score=558.53 Aligned_cols=373 Identities=23% Similarity=0.342 Sum_probs=332.7 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccc-cccCcceeeccchhhhhhcccccccccchhhhhhhhccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHAD-VAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKT 82 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~-~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~ 82 (388) |++|+||||++|++++++++.++++++++|+|+++++++. +++++++++++..++...+ +..+++..++..+++++ T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~---g~~~tl~~al~~~~~~~ 77 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKA---GKKGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhc---ccccchHHHHHHhhhcC Confidence 7889999999999999999999999999999999998875 4688999999887766543 55689999999999999 Q ss_pred cceEEEEeccccc------ccccccccccccccchhhhhhhHhhhhhhhh----hhhheecccccchhHHHHHHHHHhhh Q lcl|Aclame:pro 83 SVPQYFIVVPEGA------DDAATMANIIGGIDPTTGRRTGIAALTECTE----RPTLIGAPGFSQNKAVIDALASMAKR 152 (388) Q Consensus 83 ~~~~~vv~~~~~~------~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~~----~p~ll~ap~~~~~~~v~~~l~~~~~~ 152 (388) +..+++++...+. ..+.+.++++|+.+ .++..+|++++.++.. .|.++++|++++ +.|+++|.++|++ T Consensus 78 ~~~~~vv~~~~~~~~~~~~~~a~t~~~iiG~~~-~~~~~tgl~al~~~~~~~~~~p~i~~ap~~~~-~~v~~al~~~~~~ 155 (396) T protein:vir:57 78 KPVTVVVRVEDGTGDDEETKLAQTVSNIIGTTD-ENGQYTGLKALMGAESVTGVKPRILGVPGLDT-KEVAVALASVCQE 155 (396) T ss_pred CceeEeeeccccccccccccccccceeeeeecc-ccccchhhhhhhhcccceeEEeccccCcccch-hHHHHHHHHHhhh Confidence 9999988765432 33445566777655 4678889988887664 489999999865 6799999999999 Q ss_pred CceEEEEecCCCcc-hhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccc----ccccccc Q lcl|Aclame:pro 153 LKCRAVIDGPSGST-QDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGN 227 (388) Q Consensus 153 ~~~~~i~d~p~~~~-~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d----~~~s~~n 227 (388) +++++++|+|.+.+ +.+.+++ .+++|.|+++||||++.+|+.++..+++|||+++||++||+| +|+||+| T Consensus 156 ~~~~~~~d~p~~~~~~~~~~~~-----~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN 230 (396) T protein:vir:57 156 LNAFGYISAWGCKTISEVKAYR-----QNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSN 230 (396) T ss_pred CceEEEEcCCCCCCHHHHHHHH-----hccCCceEEEEcceeeeecccCCceeEEehhHHHHHHHHHhhhccCcEeccCC Confidence 99999999998755 3333333 367899999999999999999999999999999999999999 4667777 Q ss_pred ccc-cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 228 QGV-LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRA 302 (388) Q Consensus 228 ~p~-~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~ 302 (388) +++ ++.++++.+.+...++.+|+++||++|||++++ ++||++||+||++ |+||++||+++||+++|++.++|+ T Consensus 231 ~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~--~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~ 308 (396) T protein:vir:57 231 VGVNGVTGISASVFWDLQKPGTDADLLNEAGVTTLVR--RDGFRFWGNRTCSDDPLFLFESYTRTAQVLADTMAEAHMWA 308 (396) T ss_pred ceeccccccceecccccCCcchhhhhhhhcCcEEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHh Confidence 775 477889999999999999999999999999965 6799999999986 999999999999999999999999 Q ss_pred hcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 303 MSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 303 vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) |||||++.+|++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++.+|+++ T Consensus 309 v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~~~~~ 388 (396) T protein:vir:57 309 IDKPITATLIRDIIDGINAKFRELKNNGYIVDGTCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITSRYLAS 388 (396) T ss_pred ccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcC Q lcl|Aclame:pro 383 FIEEVL 388 (388) Q Consensus 383 l~~~~~ 388 (388) |+++|. T Consensus 389 ~~~~~~ 394 (396) T protein:vir:57 389 LVTSVN 394 (396) T ss_pred HHHHhh Confidence 999999 No 11 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=1.5e-98 Score=556.81 Aligned_cols=373 Identities=23% Similarity=0.354 Sum_probs=330.8 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccc-cccCcceeeccchhhhhhcccccccccchhhhhhhhccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHAD-VAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKT 82 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~-~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~ 82 (388) |++|+||||++|++++++|+.+++|++++|+|+++++++. ++.+++++++++.++...+ +..+++.+++..+++++ T Consensus 1 m~~~~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~---g~~~tl~~a~~~~~~~g 77 (396) T protein:vir:60 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKA---GKKGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhh---cCcchhHHHHHHHhhcc Confidence 7789999999999999999999999999999999998766 4678999999988776654 45689999999999999 Q ss_pred cceEEEEeccccccc------ccccccccccccchhhhhhhHhhhhhhh----hhhhheecccccchhHHHHHHHHHhhh Q lcl|Aclame:pro 83 SVPQYFIVVPEGADD------AATMANIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKR 152 (388) Q Consensus 83 ~~~~~vv~~~~~~~~------~~~~~~~~~~~~~~tg~~tgl~a~~~~~----~~p~ll~ap~~~~~~~v~~~l~~~~~~ 152 (388) +..+++++...+... +.+...+.++.+. ++..+|++++.+.. ..|.++++||++ .+.|+++|.++|++ T Consensus 78 g~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~d~-~~~~tg~~al~~~~~~~~~~~~il~ap~~~-~~~v~~al~~~~~~ 155 (396) T protein:vir:60 78 KPVTVVVRVEDGTGEDEETKLAQTVSNIIGTTDE-NGQYTGLKALLAAESVTGVKPRILGVPGLD-TKEVAVALASVCQK 155 (396) T ss_pred CceEEEEecccccccccccccccccccccccccc-cccccchhhhhhcccceeeeeeeccccccc-cHHHHHHHHHHhcc Confidence 999999987544333 3344556666665 67788888887665 358999999985 57799999999999 Q ss_pred CceEEEEecCCCcc-hhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhcccc----cccccc Q lcl|Aclame:pro 153 LKCRAVIDGPSGST-QDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKP----WESPGN 227 (388) Q Consensus 153 ~~~~~i~d~p~~~~-~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~----~~s~~n 227 (388) +++++++|.|.+.+ +++..++ .+++|.|+++||||++.+|+.++..+++|||+++||++|++|. |+||+| T Consensus 156 ~~~~~i~d~p~~~~~~~a~~~~-----~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN 230 (396) T protein:vir:60 156 LRAFGYISAWGCKTISEVKAYR-----QNFSQRELMVIWPDFLAWDTVASTTATAYATARALGLRAKIDQEQGWHKTLSN 230 (396) T ss_pred CCeEEEEeCCCCCCHHHHHHHH-----hhcCCceEEEEeCceeeecccCCceeEEchhHHHHHHHHHhhhccCcEeCcCC Confidence 99999999997755 3333333 3678999999999999999999999999999999999999995 556666 Q ss_pred ccc-cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 228 QGV-LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRA 302 (388) Q Consensus 228 ~p~-~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~ 302 (388) +++ ++.+++..+.+...++.+|+++||++|||++++ ++|+++||+||++ |+||++||+++||+++|++.++|+ T Consensus 231 ~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~--~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~ 308 (396) T protein:vir:60 231 VGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIR--RDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWA 308 (396) T ss_pred ceecceeeceeecccccCCCcchhhhhhhcCcEEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHh Confidence 665 477888888999999999999999999999955 7899999999986 999999999999999999999999 Q ss_pred hcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 303 MSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 303 vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) |||||++.+|++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++.+|+++ T Consensus 309 v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~ 388 (396) T protein:vir:60 309 VDKPITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLAN 388 (396) T ss_pred ccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcC Q lcl|Aclame:pro 383 FIEEVL 388 (388) Q Consensus 383 l~~~~~ 388 (388) |+++|. T Consensus 389 ~~~~~~ 394 (396) T protein:vir:60 389 LVTSVN 394 (396) T ss_pred HHHHhh Confidence 999999 No 12 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=2.4e-98 Score=555.78 Aligned_cols=373 Identities=22% Similarity=0.341 Sum_probs=330.6 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccc-cccCcceeeccchhhhhhcccccccccchhhhhhhhccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHAD-VAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKT 82 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~-~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~ 82 (388) |++|+||||++|++++++++..++|++++|+|+++++++. +++++|+++++..+....+ +..++|..++..+++++ T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~---g~~~tL~~al~~~~~ng 77 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKA---GKKGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhc---ccccchhhhhhhhhccC Confidence 7789999999999999999999999999999999998865 5678999999988877664 45688999999999999 Q ss_pred cceEEEEeccccc------ccccccccccccccchhhhhhhHhhhhhhh----hhhhheecccccchhHHHHHHHHHhhh Q lcl|Aclame:pro 83 SVPQYFIVVPEGA------DDAATMANIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKR 152 (388) Q Consensus 83 ~~~~~vv~~~~~~------~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~----~~p~ll~ap~~~~~~~v~~~l~~~~~~ 152 (388) +..+++++...+. ..+.+...+.++.+. ++..+|++++.+.. ..|.++++|++++ +.|+++|.++|++ T Consensus 78 g~~~~v~~~~~~~~~~~~~~~a~t~~~~~~~~~~-~~~~tg~~al~~~~~~~~~~p~i~~ap~~~~-~~v~~al~~~~~~ 155 (396) T protein:vir:20 78 KPVTVVMRVEDGTGDDEETKLAQTVSNIIGTTDE-NGQYTGLKAMLAAESVTGVKPRILGVPGLDT-KEVAVALASVCQK 155 (396) T ss_pred ceeEEEEecccccccccccccccccccccccccc-ccccchhhhhhhhccccccchhhhhhhhhcc-HHHHHHHHHHHhc Confidence 9999988765433 233444555555544 56778888777654 4689999999865 6689999999999 Q ss_pred CceEEEEecCCCcc-hhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccc----ccccccc Q lcl|Aclame:pro 153 LKCRAVIDGPSGST-QDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGN 227 (388) Q Consensus 153 ~~~~~i~d~p~~~~-~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d----~~~s~~n 227 (388) +++++++|.|.+.+ +++.+++ .+++|.|+++||||++.+|+..+..+++|||+++||++|++| +|+||+| T Consensus 156 ~~~~~~iD~p~~~~~~~a~~~r-----~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN 230 (396) T protein:vir:20 156 LRAFGYISAWGCKTISEVKAYR-----QNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSN 230 (396) T ss_pred CCcEEEEecCCCCCHHHHHHHh-----hCCCCceEEEEcCccccccCcCCcceeechhHHHHHHHHHhhhhcCcEeccCC Confidence 99999999998755 3333332 467899999999999999999999999999999999999999 4667777 Q ss_pred ccc-cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 228 QGV-LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRA 302 (388) Q Consensus 228 ~p~-~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~ 302 (388) +++ ++.|+++++.+...++.+|++.||++|||++++ ++||++||+||++ |+||++||+++||+++|++.++|+ T Consensus 231 ~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~--~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~~~~~~~~~ 308 (396) T protein:vir:20 231 VGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIR--RDGFRFWGNRTCSDDPLFLFENYTRTAQVVADTMAEAHMWA 308 (396) T ss_pred ceeccceecceecccccCCCcchhhhhhhcCcEEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHh Confidence 776 578889999999999999999999999999965 6899999999986 999999999999999999999999 Q ss_pred hcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 303 MSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 303 vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) |||||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++++|+++ T Consensus 309 v~e~~~~~~~~~i~~~i~~~L~~l~~~G~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 388 (396) T protein:vir:20 309 VDKPITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLAN 388 (396) T ss_pred ccCCCCHHHHHHHHHHHHHHHHHHHhCcceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcC Q lcl|Aclame:pro 383 FIEEVL 388 (388) Q Consensus 383 l~~~~~ 388 (388) |+++|. T Consensus 389 ~~~~~~ 394 (396) T protein:vir:20 389 LVTSVN 394 (396) T ss_pred HHHHhh Confidence 999999 No 13 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=2.9e-98 Score=555.36 Aligned_cols=379 Identities=24% Similarity=0.285 Sum_probs=330.0 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) ||+ +|+||||++|+++++++|..++|++++|||+++.+ +.|+|++++|+.++.. ++.....++|..++..+|+ T Consensus 1 M~~--~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~g----p~n~pv~its~~d~~~-~g~~~~~~tL~~Av~~~f~ 73 (477) T protein:vir:10 1 MAA--NYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIG----PVNTPVQSLSDVDAAQ-FGPQLAGFTIPQALDAVYD 73 (477) T ss_pred Ccc--cCCCCeEEEEccCCcccccccCCceeEEEecccCC----CCCcCEEEccHHHHHH-hccCCCCCcHHHHHHHHHh Confidence 996 58999999999999999999999999999999876 6789999999999865 4455567899999999999 Q ss_pred cccceEEEEecccccccccccc---------------------------------------------------------- Q lcl|Aclame:pro 81 KTSVPQYFIVVPEGADDAATMA---------------------------------------------------------- 102 (388) Q Consensus 81 ~~~~~~~vv~~~~~~~~~~~~~---------------------------------------------------------- 102 (388) +++..++++++.+....+.+.. T Consensus 74 nGg~~~~vVrV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (477) T protein:vir:10 74 YGSGTVIVINVLDPAVHKSNAANEPVTFDAATGRAKLAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGVITRIKTGTIP 153 (477) T ss_pred ccceEEEEEecCccccccccccccccccccccceecccccccccccccccccccccccchhhhhhhccccceeccccccc Confidence 9999999988754322211110 Q ss_pred ---------------------cccccccchhhhhhhHhhhhhhh----hhhhheecccccchhHHHHHHHHHhhhCceEE Q lcl|Aclame:pro 103 ---------------------NIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKRLKCRA 157 (388) Q Consensus 103 ---------------------~~~~~~~~~tg~~tgl~a~~~~~----~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~~~ 157 (388) ++++.. ..++.++|++++.+.. ..|.++++||+++.++|.++|.++|+++++++ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~~~tGl~al~~~~~~~~~~~~~l~apg~~~~~~v~~~l~~~~~~~~~~~ 232 (477) T protein:vir:10 154 PGATAAKATYDYADPTKVTAADIIGAV-NAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLGAIA 232 (477) T ss_pred ccceeeeeccccccccccccccccccc-cccchhhhhhhhhhhhhhcchhcccccccccccchhhHHHHHHHHhhCCEEE Confidence 011111 1245567888876654 35789999999999999999999999999999 Q ss_pred EEecCCCcch-hHHHHHHH--hhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhcccc----ccccccccc Q lcl|Aclame:pro 158 VIDGPSGSTQ-DAIDLSGL--LGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKP----WESPGNQGV 230 (388) Q Consensus 158 i~d~p~~~~~-~~~~~~~~--~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~----~~s~~n~p~ 230 (388) ++|.|.+.+. .+..++.. ....+++|+|++++|||++++|+.++..+++|||+++||++|++|. |+||+|+++ T Consensus 233 ~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~ 312 (477) T protein:vir:10 233 YIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQL 312 (477) T ss_pred EEecCCCCCHHHHHhhhhhccccccccccceEEEEcCeEEEecccCCceeEEchHHHHHHHHHHhhhcCCceeccCCcee Confidence 9999977653 33444332 2345678999999999999999999999999999999999999994 666667776 Q ss_pred -cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC-------CceeeehhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 231 -LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT-------GKFISFVGLEDAIARKLEAASQRA 302 (388) Q Consensus 231 -~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~-------~~~i~vrR~~~~i~~~i~~~~~~~ 302 (388) ++.++++.+.+.+.++++|++.||++|||+|++|+++|+++||+||++ |+|+++||+++||+++|++.++|+ T Consensus 313 ~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~ 392 (477) T protein:vir:10 313 VGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQF 392 (477) T ss_pred ccccccccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCcccceeehhhHHHHHHHHHHHHHHHh Confidence 477888889999999999999999999999999999999999999982 999999999999999999999999 Q ss_pred hcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 303 MSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 303 vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) |||||++.+|++|++++++||++||++|+|+||+|+||+++||++||++|+|+++|+++|++|+|||+|+++++++||++ T Consensus 393 v~~~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 472 (477) T protein:vir:10 393 VDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITSEYLLT 472 (477) T ss_pred ccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEcchHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhc Q lcl|Aclame:pro 383 FIEEV 387 (388) Q Consensus 383 l~~~~ 387 (388) |+..= T Consensus 473 ~~~g~ 477 (477) T protein:vir:10 473 LKGGN 477 (477) T ss_pred hhcCC Confidence 99888 No 14 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=8.5e-98 Score=552.77 Aligned_cols=379 Identities=23% Similarity=0.285 Sum_probs=326.8 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) ||+ +|+||||++|+++++++|.+++|++++|||+++.+ +.|+|++++|+.+++.+ +.....++|..++..+|+ T Consensus 1 M~~--~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~----p~n~pv~its~~d~~~~-g~~~~~~tL~~Av~~~f~ 73 (477) T protein:vir:79 1 MAA--NYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIG----PVNTPVQSLSDVDAAQF-GPQLAGFTIPQALDAVYD 73 (477) T ss_pred CcC--CCCCCeEEEEecCCcccccccCCceEEEEeecccC----CCcccEEEccHHHHHHh-cCCCCCCcHHHHHHHHhh Confidence 995 79999999999999999999999999999999887 67899999999999875 444567999999999999 Q ss_pred cccceEEEEecccccccccccccc-------------------------------------------------------- Q lcl|Aclame:pro 81 KTSVPQYFIVVPEGADDAATMANI-------------------------------------------------------- 104 (388) Q Consensus 81 ~~~~~~~vv~~~~~~~~~~~~~~~-------------------------------------------------------- 104 (388) +++..|+++++.+....+...... T Consensus 74 ngg~~~~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (477) T protein:vir:79 74 YGSGTVIVINVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGGTTYTEGTDYAVDLINGVITRIKTGTIP 153 (477) T ss_pred cCCceEEEEeccCCccccccccccccccccccccccccccccceeEEeecccccccccCccccccccchhhhhhhccccc Confidence 999999998875543222211000 Q ss_pred -----------------------cccccchhhhhhhHhhhhhhh----hhhhheecccccchhHHHHHHHHHhhhCceEE Q lcl|Aclame:pro 105 -----------------------IGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKRLKCRA 157 (388) Q Consensus 105 -----------------------~~~~~~~tg~~tgl~a~~~~~----~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~~~ 157 (388) ++..+ ..+..+|++++.... ..|.++.+||+++.++|.++|.++|+++++++ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~-a~~~~tg~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~~~~a 232 (477) T protein:vir:79 154 AAATAAKATYDYADPTKVTAADIIGAVN-AAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLGAIA 232 (477) T ss_pred cccceeeceeccCCcccceeeeeccccc-ccccchhhhhhhhhhhhcccccceeeccccccchhHHHHHHHHHhhcCeEE Confidence 00000 123345666665544 35789999999999999999999999999999 Q ss_pred EEecCCCcch-hHHHHHHH--hhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhcccc----ccccccccc Q lcl|Aclame:pro 158 VIDGPSGSTQ-DAIDLSGL--LGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKP----WESPGNQGV 230 (388) Q Consensus 158 i~d~p~~~~~-~~~~~~~~--~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~----~~s~~n~p~ 230 (388) ++|.|.+.+. ....++.. ....+++|.|++++|||++++|+.++..+++|||+++||++||+|. |+||+|+++ T Consensus 233 ~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~ 312 (477) T protein:vir:79 233 YIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQL 312 (477) T ss_pred EEecCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeechHHHHHHHHHHhhccCCceEccCCcee Confidence 9999977653 33333322 2245678999999999999999999999999999999999999995 555666665 Q ss_pred -cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC-------CceeeehhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 231 -LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT-------GKFISFVGLEDAIARKLEAASQRA 302 (388) Q Consensus 231 -~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~-------~~~i~vrR~~~~i~~~i~~~~~~~ 302 (388) ++.++.+.+.+...++++|++.||++|||+|++|+++|+++||+||++ |+|+++||++++|+++|++.++|+ T Consensus 313 ~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~ 392 (477) T protein:vir:79 313 VGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQF 392 (477) T ss_pred ecceecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCccceeeehhhHHHHHHHHHHHHHHHh Confidence 477888888999999999999999999999999999999999999983 999999999999999999999999 Q ss_pred hcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 303 MSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 303 vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) |||||++.+|++|++++++||++||++|+|+||+|+||+++||++||++|+|+++|+++|++|+|||+|+++++++||++ T Consensus 393 v~e~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 472 (477) T protein:vir:79 393 VDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITSEYLLT 472 (477) T ss_pred ccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEechHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred HHHhc Q lcl|Aclame:pro 383 FIEEV 387 (388) Q Consensus 383 l~~~~ 387 (388) |+..= T Consensus 473 ~~~~~ 477 (477) T protein:vir:79 473 LKGGN 477 (477) T ss_pred hccCC Confidence 88887 No 15 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=7.9e-98 Score=552.94 Aligned_cols=372 Identities=30% Similarity=0.438 Sum_probs=342.7 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeecccccccc-ccCcceeeccchhhhhhcccccccccchhhhhhhh Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADV-AFSVPFRVANTADAQYLDSTGNELGTGWHAASETL 79 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~-~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~ 79 (388) ||+ +|+||||++|++++++|+.+++|++++|||+++++++.. +.++++++++..+....+ +..+++..++..++ T Consensus 1 M~~--~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~---g~~~tl~~a~~~~~ 75 (386) T protein:vir:10 1 MAE--QYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKL---GAGGTLPQAIDGIF 75 (386) T ss_pred Ccc--ccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhc---CCCcchhHHHHHHh Confidence 995 689999999999999999999999999999999887664 689999999988876654 45689999999999 Q ss_pred ccccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhh----hhhhheecccccchhHHHHHHHHHhhhCce Q lcl|Aclame:pro 80 KKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECT----ERPTLIGAPGFSQNKAVIDALASMAKRLKC 155 (388) Q Consensus 80 ~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~----~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~ 155 (388) ++++..|+++++.++.+...+..+++++.+..++..+|++++.+.. ..|.++.+|++++..+|.++|.++++++++ T Consensus 76 ~~gg~~~~vv~~~~~~~~~~t~~~~ig~~~~~t~~~tgl~~l~~~~~~~~~~p~i~~ap~~~~~~~v~~~l~~~~~~~~~ 155 (386) T protein:vir:10 76 DQTGAVVVVIRVDEGVDSAATQSNVIGKVDADTEQYTGILALLSAENTVKVQPRILIAPGFSNQKAVADQLVSVADTAAW 155 (386) T ss_pred ccCceeEEEeeccccccccccchhhhcccccccchhhhhHHhhhhcccccccccccccccccchhHHHHHHHHhhcceEE Confidence 9999999999999999999999999999999999999998887665 358999999999999999999999999999 Q ss_pred EEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccc----cccccccccc- Q lcl|Aclame:pro 156 RAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGNQGV- 230 (388) Q Consensus 156 ~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d----~~~s~~n~p~- 230 (388) +.+.|.+....+.+..++ ..++|.++++||||++++|+.++..+++|||+++||++|++| +|+||+|+++ T Consensus 156 ~~~~~~~~~~~~~a~~~~-----~~~~s~~~~~~~p~~~v~~~~~~~~~~~p~s~~~ag~~a~~D~~~G~~~spaN~~l~ 230 (386) T protein:vir:10 156 LCHSGWSNTTDAAAITYR-----ELFGSRRCEVVDPWYKVWDVETSAHIIQPPSARHAGVMAKVHNTLGFWWSNSNQEIL 230 (386) T ss_pred EEEeCCCCCchHHHHHhh-----hcccccceEEecCceeeeccccccceeechHHHHHHHHHHhhhcCCcEEccCCceee Confidence 999998877666554443 477899999999999999999999999999999999999999 5667777775 Q ss_pred cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC----CceeeehhhHHHHHHHHHHHHHHHhccc Q lcl|Aclame:pro 231 LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSKQ 306 (388) Q Consensus 231 ~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~vfep 306 (388) ++.|+++.+.+...++++|+++||++||++++ +++|+++||+||++ |+||++|||+++|+++|+++++|+|||| T Consensus 231 gv~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~--~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~ 308 (386) T protein:vir:10 231 GIDGLCRPVDFKLDDPTCRANLLNAKEVTTTI--QQNGFRVWGDRTCSADSKWAFKNVVITNDMIADSLVRNHLWAVDRN 308 (386) T ss_pred cccccceecccccccCcchhhhhhhcCcEEEE--cCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCC Confidence 57889999999999999999999999999885 57899999999986 9999999999999999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHHH Q lcl|Aclame:pro 307 LTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFI 384 (388) Q Consensus 307 n~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~ 384 (388) |++.+|++|++++++||++||++|+|.||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++.+|+++|+ T Consensus 309 ~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~~~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~ 386 (386) T protein:vir:10 309 ITKTYVEDVTEGVNNYLRHLKNIGAIAGGECWVDPELNSPDQIQQGKVYFDYDFSAYAPAEHITFRSHMVNGYLTEVV 386 (386) T ss_pred CCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEehhHHHhhC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999999 No 16 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=5.5e-87 Score=493.48 Aligned_cols=378 Identities=12% Similarity=0.093 Sum_probs=310.3 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) |+ .-.||||++|+ +++++|..+.|++.+|+|.++.+ |.++|++++|+.++.+.|+.......+..++..+|. T Consensus 1 ma---~~~PgVyv~E~-~~~~~i~~~~ts~~~~vG~~~~G----p~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ 72 (664) T protein:vir:98 1 MA---LQSPGIETKET-SVQSTVVRNSTGRAAIVGKFSWG----PAYQIRQISNEVELVNYFGAPDNLTADYFMSAVNFL 72 (664) T ss_pred Cc---eecCceEEEec-CCCcccccccccceEEEeeccCC----CCCccEEecCHHHHHHhcCCccccchhHHHHHHHHH Confidence 66 33579999999 58999999999999999999877 568999999999999999987777778888888888 Q ss_pred cccceEEEEecccccccc----------------------------------------cc----------------cccc Q lcl|Aclame:pro 81 KTSVPQYFIVVPEGADDA----------------------------------------AT----------------MANI 104 (388) Q Consensus 81 ~~~~~~~vv~~~~~~~~~----------------------------------------~~----------------~~~~ 104 (388) +++..|+++|+......+ +. ..+. T Consensus 73 ngg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~ 152 (664) T protein:vir:98 73 QYGNDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSL 152 (664) T ss_pred hcCCeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCccce Confidence 888888887753211000 00 0000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 105 -------------------------------------------------------------------------------- 104 (388) Q Consensus 105 -------------------------------------------------------------------------------- 104 (388) T Consensus 153 ~~~~~~~~~~~~~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G~~G 232 (664) T protein:vir:98 153 LVLNRSVLTQIFLLVGTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALYPGELG 232 (664) T ss_pred eecccccccccceecccceeeeeecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeeeccccc Confidence Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 105 -------------------------------------------------------------------------------- 104 (388) Q Consensus 105 -------------------------------------------------------------------------------- 104 (388) T Consensus 233 n~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 312 (664) T protein:vir:98 233 STVQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIYMDD 312 (664) T ss_pred ceeeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeeeeechh Confidence Q ss_pred ---------------------------ccccc-----chhhhhhhHhhhhhhh-hhhhheecccccch-----hHHHHHH Q lcl|Aclame:pro 105 ---------------------------IGGID-----PTTGRRTGIAALTECT-ERPTLIGAPGFSQN-----KAVIDAL 146 (388) Q Consensus 105 ---------------------------~~~~~-----~~tg~~tgl~a~~~~~-~~p~ll~ap~~~~~-----~~v~~~l 146 (388) .++.+ ...+.++|++++.+.. ..|+++++||+++. ++|..+| T Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~~~~v~~al 392 (664) T protein:vir:98 313 FFANGGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGESVEIASTVQKHV 392 (664) T ss_pred heecccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccceEEecCCCCCcHHHHHHHHHHH Confidence 00000 0011224455554433 24789999998764 4689999 Q ss_pred HHHhhhCc-eEEEEecCCC---------cchhHHHHHH---------HhhhcccccceEEEEecceecccccccceeehh Q lcl|Aclame:pro 147 ASMAKRLK-CRAVIDGPSG---------STQDAIDLSG---------LLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVP 207 (388) Q Consensus 147 ~~~~~~~~-~~~i~d~p~~---------~~~~~~~~~~---------~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p 207 (388) .++|++++ +++++|.|.. ..+.+.+++. .....+++|+|+++||||++++|+.++..+++| T Consensus 393 ~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p 472 (664) T protein:vir:98 393 ISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNRWVP 472 (664) T ss_pred HHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceEEec Confidence 99999874 8999998842 1222333332 122457899999999999999999999999999 Q ss_pred hHHHHHHHHhccccccccccccccc--eeecccccccccCchhhhhhccccceEEEEEeCC-CcEEEEccccCC-----C Q lcl|Aclame:pro 208 PSTIAMGAVAAVKPWESPGNQGVLI--QDVARVIDYNILDKSTEGDLLNRNGVSYFARTSM-GGFSLIGNRTVT-----G 279 (388) Q Consensus 208 ~s~~~aG~~a~~d~~~s~~n~p~~~--~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~-----~ 279 (388) ||+++||++||+|..+|||+.|+|. .++...+.+...+++.|++.||++|||+|+.|++ +|+++||+||++ | T Consensus 473 ~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~ 552 (664) T protein:vir:98 473 LAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGDKTLTSVPSPF 552 (664) T ss_pred hHHHHHHHHHHhhhcCCcEECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcccccCCCCccc Confidence 9999999999999888888877764 3566667778888999999999999999999987 699999999974 9 Q ss_pred ceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEE Q lcl|Aclame:pro 280 KFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVID 359 (388) Q Consensus 280 ~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~ 359 (388) +|||+||||+||+++|++.++|+|||||++.||++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+ T Consensus 553 ~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~ 632 (664) T protein:vir:98 553 DRINVRRLFNMIKKDIGDNAKYKLFENNDDFTRASFRMDTGQYMTNIRALGGCYDYRVICDTTNNTPDVIDRNEFVATVY 632 (664) T ss_pred ceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCcceeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 360 YGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 360 ~~p~~pae~I~~~~~~~~~~~~~l~~~~~ 388 (388) ++|++|+|||+|+|++.....+ |+|++ T Consensus 633 ~~p~~pae~I~~~~~q~~~~~~--~~e~~ 659 (664) T protein:vir:98 633 VKPPRSINYITLNFVATSTGAD--FDELV 659 (664) T ss_pred EEecCCcceEEEEEEEeecCcc--hhHhc Confidence 9999999999999999999877 99999 No 17 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=2.2e-86 Score=490.16 Aligned_cols=378 Identities=12% Similarity=0.072 Sum_probs=296.8 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhcccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~ 83 (388) |+.-.+|||++|+ +++++|..+.|++.+|||.++.+ +.++|++|+|+.++.+.|+.......+..++..+|.+++ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~G----p~~~p~~v~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg 75 (666) T protein:vir:65 1 MTLLSPGFETKET-TLSTTIVQSETGRAALVGKFQWG----PAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQYG 75 (666) T ss_pred CceecCceEEEEe-cCcccccccCcccceEEecccCC----CCccCEEecCHHHHHHHcCCccccchhHHHHHHHHHhcC Confidence 7676779999999 68999999999999999999877 678899999999999888865544445555555555555 Q ss_pred ceEEEEecccccccc--------------------------------------c--cc---c------------------ Q lcl|Aclame:pro 84 VPQYFIVVPEGADDA--------------------------------------A--TM---A------------------ 102 (388) Q Consensus 84 ~~~~vv~~~~~~~~~--------------------------------------~--~~---~------------------ 102 (388) ..|+++|+......+ . .. . T Consensus 76 ~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~~~ 155 (666) T protein:vir:65 76 NDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAK 155 (666) T ss_pred ceEEEEEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeecccc Confidence 555554431100000 0 00 0 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 103 -------------------------------------------------------------------------------- 102 (388) Q Consensus 103 -------------------------------------------------------------------------------- 102 (388) T Consensus 156 ~~g~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~i~ 235 (666) T protein:vir:65 156 AIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNSLE 235 (666) T ss_pred ccCcceeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeecccccccee Confidence Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 103 -------------------------------------------------------------------------------- 102 (388) Q Consensus 103 -------------------------------------------------------------------------------- 102 (388) T Consensus 236 v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (666) T protein:vir:65 236 VEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFAR 315 (666) T ss_pred EEeecccccccccccccccccccccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhhhhhcc Confidence Q ss_pred ----------------------------------cccccccchhhhhhhHhhhhhhh-hhhhheecccccc----hhHHH Q lcl|Aclame:pro 103 ----------------------------------NIIGGIDPTTGRRTGIAALTECT-ERPTLIGAPGFSQ----NKAVI 143 (388) Q Consensus 103 ----------------------------------~~~~~~~~~tg~~tgl~a~~~~~-~~p~ll~ap~~~~----~~~v~ 143 (388) ++++..+......++++++.+.. ..+.++++|++++ .+.|+ T Consensus 316 ~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~ 395 (666) T protein:vir:65 316 GSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQ 395 (666) T ss_pred cccceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCccchhHHHH Confidence 00000000000112222222221 2367899999865 47899 Q ss_pred HHHHHHhhhCc-eEEEEecCCC---------cchhHHHHHHHh-----hhcccccceEEEEecceecccccccceeehhh Q lcl|Aclame:pro 144 DALASMAKRLK-CRAVIDGPSG---------STQDAIDLSGLL-----GGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPP 208 (388) Q Consensus 144 ~~l~~~~~~~~-~~~i~d~p~~---------~~~~~~~~~~~~-----~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~ 208 (388) .+|.++|++++ +++++|.|.. ..+.+..++... ...+++|.|+++||||++++|+.++..+++|| T Consensus 396 ~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~ 475 (666) T protein:vir:65 396 KHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPL 475 (666) T ss_pred HHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEecccCCceeEech Confidence 99999999875 7888887642 223333333221 12357899999999999999999999999999 Q ss_pred HHHHHHHHhccccccccccccccc--eeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC-----Cce Q lcl|Aclame:pro 209 STIAMGAVAAVKPWESPGNQGVLI--QDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT-----GKF 281 (388) Q Consensus 209 s~~~aG~~a~~d~~~s~~n~p~~~--~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~-----~~~ 281 (388) |+++||++||+|..+|||+.|+|. .++...+.+...++++|++.||++|||||++|+++|+++||+||++ |+| T Consensus 476 sg~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~ 555 (666) T protein:vir:65 476 AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDR 555 (666) T ss_pred HHHHHHHHHHHhccCCcEEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCCCeEEEEecccCCCCCcccce Confidence 999999999999888888888764 2566667778888999999999999999999999999999999984 999 Q ss_pred eeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEE Q lcl|Aclame:pro 282 ISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYG 361 (388) Q Consensus 282 i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~ 361 (388) |+|||||+||+++|++.++|+|||||++.||++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++ T Consensus 556 i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~ 635 (666) T protein:vir:65 556 INVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIK 635 (666) T ss_pred EehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCcceeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 362 RYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 362 p~~pae~I~~~~~~~~~~~~~l~~~~~ 388 (388) |++|+|||+|++++.....+ |+||+ T Consensus 636 p~~pae~i~~~~~~~~~~~~--~~e~~ 660 (666) T protein:vir:65 636 PAKSINYIMLNFTAVATGSD--FDEII 660 (666) T ss_pred ecCCcceEEEEEEEeecCcc--HHHHH Confidence 99999999999999988544 55666 No 18 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=3.8e-86 Score=488.86 Aligned_cols=380 Identities=14% Similarity=0.085 Sum_probs=300.3 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhcccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~ 83 (388) |+-..+|||++|+ +++++|..+.|++.+|+|.++.+ +.++|++|+|+.++...|+.......+..++..+|.+++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vg~~~~g----p~~~p~~v~s~~~~~~~fg~~~~~~~~~~~~~~~f~~~g 75 (660) T protein:vir:10 1 MALLSPGIELKET-SVQSTVVRNATGRAALVGKFQWG----PAFQVTQITNEVELVDLFGGPNNEVADYFMSGMNFLQYG 75 (660) T ss_pred CceecCceEEEee-cCCccccCCCcccceEEeecCCC----CCccCeEcCCHHHHHHHcCCcCCCchhHHHHHHHHHhCC Confidence 6566679999999 58999999999999999999877 678899999999999999877666677777777788877 Q ss_pred ceEEEEeccccccccccc------------------------------------------c--c---------------- Q lcl|Aclame:pro 84 VPQYFIVVPEGADDAATM------------------------------------------A--N---------------- 103 (388) Q Consensus 84 ~~~~vv~~~~~~~~~~~~------------------------------------------~--~---------------- 103 (388) ..|++||+......+... + . T Consensus 76 ~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~~a~ 155 (660) T protein:vir:10 76 NDLRTVRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIAYAR 155 (660) T ss_pred ceEEEEEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeecccccccccccc Confidence 777776653221100000 0 0 Q ss_pred ---------------cc------------------cc------------------------------------cc----- Q lcl|Aclame:pro 104 ---------------II------------------GG------------------------------------ID----- 109 (388) Q Consensus 104 ---------------~~------------------~~------------------------------------~~----- 109 (388) +. ++ .. T Consensus 156 ~v~~~~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~ 235 (660) T protein:vir:10 156 SLNQYPTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYPGEIGSTLE 235 (660) T ss_pred ccccccccccceeEEEecccCccccceeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeecccccCccee Confidence 00 00 00 Q ss_pred ------------------c--hh--------------------------------------------------------- Q lcl|Aclame:pro 110 ------------------P--TT--------------------------------------------------------- 112 (388) Q Consensus 110 ------------------~--~t--------------------------------------------------------- 112 (388) . .+ T Consensus 236 v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (660) T protein:vir:10 236 VEIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYLDDYF 315 (660) T ss_pred EEEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceeeeehhh Confidence 0 00 Q ss_pred --------------------------------------hhhhhHhhhhhhh-hhhhheecccccc-----hhHHHHHHHH Q lcl|Aclame:pro 113 --------------------------------------GRRTGIAALTECT-ERPTLIGAPGFSQ-----NKAVIDALAS 148 (388) Q Consensus 113 --------------------------------------g~~tgl~a~~~~~-~~p~ll~ap~~~~-----~~~v~~~l~~ 148 (388) ...+++.++.... ..+.++++|++.+ .++|+++|.+ T Consensus 316 ~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~~ 395 (660) T protein:vir:10 316 AKGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVVS 395 (660) T ss_pred cCCCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHHH Confidence 0000111111110 1256677787653 4579999999 Q ss_pred HhhhCc-eEEEEecCCCc------chhHHHHHHHhh--------hcccccceEEEEecceecccccccceeehhhHHHHH Q lcl|Aclame:pro 149 MAKRLK-CRAVIDGPSGS------TQDAIDLSGLLG--------GEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAM 213 (388) Q Consensus 149 ~~~~~~-~~~i~d~p~~~------~~~~~~~~~~~~--------~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~a 213 (388) +|++++ ||+++|+|.+. .....+...++. ..+++|.|+++||||++++|+.++..+++|||+++| T Consensus 396 ~~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~A 475 (660) T protein:vir:10 396 IADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADLA 475 (660) T ss_pred HHHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechhHHHH Confidence 999875 99999999642 112223333333 346889999999999999999999999999999999 Q ss_pred HHHhccccccccccccccce--eecccccccccCchhhhhhccccceEEEEEeCC-CcEEEEccccCC-----Cceeeeh Q lcl|Aclame:pro 214 GAVAAVKPWESPGNQGVLIQ--DVARVIDYNILDKSTEGDLLNRNGVSYFARTSM-GGFSLIGNRTVT-----GKFISFV 285 (388) Q Consensus 214 G~~a~~d~~~s~~n~p~~~~--g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~-----~~~i~vr 285 (388) |++||+|..+|||+.|+|.. ++...+.+...+++.|++.||++|||+|++|++ +||++||+||++ |+||||| T Consensus 476 Gl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~vr 555 (660) T protein:vir:10 476 GLCARTDDVSQPWMSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTATKVPSPMDHINVR 555 (660) T ss_pred HHHHHhhccCCcEEccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccCCCCcccceEehh Confidence 99999998777777777642 455667778889999999999999999999986 799999999974 9999999 Q ss_pred hhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCc Q lcl|Aclame:pro 286 GLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSP 365 (388) Q Consensus 286 R~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~p 365 (388) |||+||+++|++.++|+|||||++.||++|+++++.||++||++|+|.||+|+||+++||++||++|+|+++|+++|++| T Consensus 556 R~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~i~~~P~~p 635 (660) T protein:vir:10 556 RLFNMLKKNIGDASKYKLFELNDNFTRSSFRMEVSQYLDGIKALGGIYEGRVVCDTTVNTPAVIDRNEFIANIYVKPARS 635 (660) T ss_pred hHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeEEEEEEEcchH--HHHHHHhcC Q lcl|Aclame:pro 366 NEHMIFHLNAVDRI--VEEFIEEVL 388 (388) Q Consensus 366 ae~I~~~~~~~~~~--~~~l~~~~~ 388 (388) +|||+|++++.... +++++.+++ T Consensus 636 ae~I~~~~~~~~~~~~~~e~~~~~~ 660 (660) T protein:vir:10 636 INYITLNFVATSTGADFDELIGPLV 660 (660) T ss_pred ccEEEEEEEEeecCccHHHHhhhcC Confidence 99999998888776 455555555 No 19 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=5.7e-86 Score=487.90 Aligned_cols=379 Identities=13% Similarity=0.097 Sum_probs=296.9 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhcccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~ 83 (388) |+...+|||++|+ +++++|..+.|++.+|||.+..+ |.++|++|+|+.++.+.|+.......+..++..+|.+++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~~~~~~t~~~~~vg~~~~g----p~~~p~~i~~~~~~~~~fg~~~~~~~~~~~~~~~f~~~g 75 (666) T protein:vir:80 1 MTLLSPGFETKET-TLSTTIVQSATGRAALVGKFQWG----PAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQYG 75 (666) T ss_pred CceecCceEEEEe-cCCccccccCcccceEEeccccC----CCccceEecCHHHHHHhcCCccCccchHHHHHHHHhcCC Confidence 7677789999999 68999999999999999999877 578999999999999999876666666667777777777 Q ss_pred ceEEEEeccccccccccc--------------------------------------------c----------------- Q lcl|Aclame:pro 84 VPQYFIVVPEGADDAATM--------------------------------------------A----------------- 102 (388) Q Consensus 84 ~~~~vv~~~~~~~~~~~~--------------------------------------------~----------------- 102 (388) ..|+++|+......+... . T Consensus 76 ~~~~v~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~ta~~~~~a~ 155 (666) T protein:vir:80 76 NDLRVVRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAK 155 (666) T ss_pred CeEEEEEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecchhhhccccc Confidence 777766642110000000 0 Q ss_pred --------------cc------------c-c-----c------------------------------------------- Q lcl|Aclame:pro 103 --------------NI------------I-G-----G------------------------------------------- 107 (388) Q Consensus 103 --------------~~------------~-~-----~------------------------------------------- 107 (388) .+ . + . T Consensus 156 ~~~~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a~~~g~~g~~l~ 235 (666) T protein:vir:80 156 AIGVYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSAIYAGEIGNSLE 235 (666) T ss_pred cccccceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhhhccccccccee Confidence 00 0 0 0 Q ss_pred ------------------------------------ccch--------hh------------------------------ Q lcl|Aclame:pro 108 ------------------------------------IDPT--------TG------------------------------ 113 (388) Q Consensus 108 ------------------------------------~~~~--------tg------------------------------ 113 (388) .+.. .+ T Consensus 236 v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (666) T protein:vir:80 236 VEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFGR 315 (666) T ss_pred eeeccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeecccccccccccchhhhhhhhhcc Confidence 0000 00 Q ss_pred -----------------------------------------------hhhhHhhhhhhhhhhhheecccccc----hhHH Q lcl|Aclame:pro 114 -----------------------------------------------RRTGIAALTECTERPTLIGAPGFSQ----NKAV 142 (388) Q Consensus 114 -----------------------------------------------~~tgl~a~~~~~~~p~ll~ap~~~~----~~~v 142 (388) ..+++.++.+.. .+.++++|++++ .+++ T Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~-~~~~l~~p~~~~~~~~~~~v 394 (666) T protein:vir:80 316 GSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERESI-HVNLLIAGACAGEGDAFSTV 394 (666) T ss_pred ccceeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhccc-ccceEeecCcCCcccchHHH Confidence 000111111111 134677777753 4678 Q ss_pred HHHHHHHhhhCc-eEEEEecCC--------Ccc-hhHHHHHHHhh-----hcccccceEEEEecceecccccccceeehh Q lcl|Aclame:pro 143 IDALASMAKRLK-CRAVIDGPS--------GST-QDAIDLSGLLG-----GEGTGHDRVYMVDPMPAIYSRKAQGNIYVP 207 (388) Q Consensus 143 ~~~l~~~~~~~~-~~~i~d~p~--------~~~-~~~~~~~~~~~-----~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p 207 (388) +.+|.++|++++ |++++|.|. +.+ ++..+++.... ..+++|.|+++||||++++|+.++..+++| T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p 474 (666) T protein:vir:80 395 QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 474 (666) T ss_pred HHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCceeEec Confidence 899999999774 777776653 222 33333332211 246889999999999999999999999999 Q ss_pred hHHHHHHHHhccccccccccccccce--eecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC-----Cc Q lcl|Aclame:pro 208 PSTIAMGAVAAVKPWESPGNQGVLIQ--DVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT-----GK 280 (388) Q Consensus 208 ~s~~~aG~~a~~d~~~s~~n~p~~~~--g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~-----~~ 280 (388) ||+++||++||+|..+|||+.|+|.. ++...+.+.+.+++.|++.||++|||||++|+++|+++||+||++ |+ T Consensus 475 ~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~ 554 (666) T protein:vir:80 475 LAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFD 554 (666) T ss_pred hHHHHHHHHHHHhhcCCceEccCCeecceeeccccceeecChhHHHhhhhCCeeEEEEeCCCeEEEEccccCCCCCcccc Confidence 99999999999998888888887743 677777888899999999999999999999999999999999974 99 Q ss_pred eeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEE Q lcl|Aclame:pro 281 FISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDY 360 (388) Q Consensus 281 ~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~ 360 (388) ||||||||+||+++|++.++|+||||||+.||++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|++ T Consensus 555 ~i~vRRl~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~i~~ 634 (666) T protein:vir:80 555 RINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFI 634 (666) T ss_pred eeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecCcceeEEEEEEEcchHHH--HHHHhcC Q lcl|Aclame:pro 361 GRYSPNEHMIFHLNAVDRIVE--EFIEEVL 388 (388) Q Consensus 361 ~p~~pae~I~~~~~~~~~~~~--~l~~~~~ 388 (388) +|++|+|||+|++++.....+ +++..|= T Consensus 635 ~P~~Pae~I~~~~~~~~~~~~~~e~~~~~~ 664 (666) T protein:vir:80 635 KPAKSINYIMLNFTAVATGSDFDEIIGPVN 664 (666) T ss_pred EecCCcceEEEEEEEeecCccHHHHHHHHh Confidence 999999999999998887544 4444444 No 20 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=8.7e-86 Score=486.92 Aligned_cols=378 Identities=12% Similarity=0.043 Sum_probs=305.9 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhcccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~ 83 (388) |+.-.+|||++|+..+++++.. .|++.+|+|.++.+ +.++|++++|+.++.+.|+.......+..++..+|.+++ T Consensus 1 ~~~~~PgVyvee~~~~~~~~~~-~ts~~~fvG~~~~G----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg 75 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVVNN-STGTAALAGKFQWG----PAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYG 75 (659) T ss_pred CceecCceEEEEecCCcccccC-CCcceEEEeecCCC----CCcccEEecCHHHHHHHcCCcCCCCchhHHHHHHHHhCC Confidence 7666779999999999977655 89999999999877 578899999999999999987777778888888888888 Q ss_pred ceEEEEeccccccccccc---------------------------cc----------------------c---------- Q lcl|Aclame:pro 84 VPQYFIVVPEGADDAATM---------------------------AN----------------------I---------- 104 (388) Q Consensus 84 ~~~~vv~~~~~~~~~~~~---------------------------~~----------------------~---------- 104 (388) ..|++||+......+... +. + T Consensus 76 ~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~~ 155 (659) T protein:vir:72 76 NDLRVVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKAK 155 (659) T ss_pred ceEEEEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeecccccccccc Confidence 888888752110000000 00 0 Q ss_pred -------------------cc------------------------------c---------------------c--cc-- Q lcl|Aclame:pro 105 -------------------IG------------------------------G---------------------I--DP-- 110 (388) Q Consensus 105 -------------------~~------------------------------~---------------------~--~~-- 110 (388) .. . . +. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~~t 235 (659) T protein:vir:72 156 EVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIE 235 (659) T ss_pred ccccccccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeecccccccccee Confidence 00 0 0 00 Q ss_pred ----------------------------------------------------------------h--------------- Q lcl|Aclame:pro 111 ----------------------------------------------------------------T--------------- 111 (388) Q Consensus 111 ----------------------------------------------------------------~--------------- 111 (388) . T Consensus 236 v~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (659) T protein:vir:72 236 IEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFF 315 (659) T ss_pred EEEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhhhh Confidence 0 Q ss_pred -hh------------------------------------hhhhHhhhhhh-hhhhhheecccccc-----hhHHHHHHHH Q lcl|Aclame:pro 112 -TG------------------------------------RRTGIAALTEC-TERPTLIGAPGFSQ-----NKAVIDALAS 148 (388) Q Consensus 112 -tg------------------------------------~~tgl~a~~~~-~~~p~ll~ap~~~~-----~~~v~~~l~~ 148 (388) .+ ..+++.++... ...++++++||+++ .+.|.++|.+ T Consensus 316 ~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~ 395 (659) T protein:vir:72 316 AKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVS 395 (659) T ss_pred hcCCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHH Confidence 00 00000000000 01367888898764 3578999999 Q ss_pred HhhhCc-eEEEEecCCCc---------chhHHHHHHHh-----hhcccccceEEEEecceecccccccceeehhhHHHHH Q lcl|Aclame:pro 149 MAKRLK-CRAVIDGPSGS---------TQDAIDLSGLL-----GGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAM 213 (388) Q Consensus 149 ~~~~~~-~~~i~d~p~~~---------~~~~~~~~~~~-----~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~a 213 (388) +|++++ +++++|.|... .+....++... ...+++|+|+++||||++++|+.++..+++|||+++| T Consensus 396 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vA 475 (659) T protein:vir:72 396 IGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLAADIA 475 (659) T ss_pred HHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechHHHHH Confidence 999875 89999988532 22333333221 2246789999999999999999999999999999999 Q ss_pred HHHhccccccccccccccce--eecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC-----Cceeeehh Q lcl|Aclame:pro 214 GAVAAVKPWESPGNQGVLIQ--DVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT-----GKFISFVG 286 (388) Q Consensus 214 G~~a~~d~~~s~~n~p~~~~--g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~-----~~~i~vrR 286 (388) |++||+|..+|||+.|+|.. ++...+.+.+.++++|++.||++|||||++|+++|+++||+||++ |+||+||| T Consensus 476 Gl~Ar~D~~~G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~i~vrR 555 (659) T protein:vir:72 476 GLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRINVRR 555 (659) T ss_pred HHHHHhhccCCcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccCCCCcccceEeehh Confidence 99999999999888888743 566677788889999999999999999999999999999999974 99999999 Q ss_pred hHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcc Q lcl|Aclame:pro 287 LEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPN 366 (388) Q Consensus 287 ~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pa 366 (388) ||+||+++|++.++|+|||||++.||++|+++|++||++||++|+|.||+|+||+++||++||++|+|+++|+++|++|+ T Consensus 556 ~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pa 635 (659) T protein:vir:72 556 LFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGNKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYIQPARSI 635 (659) T ss_pred HHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEEEecCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 367 EHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 367 e~I~~~~~~~~~~~~~l~~~~~ 388 (388) |||+|+|++.....+ |+||. T Consensus 636 e~I~~~~~~~~~~~~--~~e~~ 655 (659) T protein:vir:72 636 NYITLNFVATATGAD--FDELT 655 (659) T ss_pred cEEEEEEEEeecCcc--hHHhc Confidence 999999999888877 89999 No 21 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=9.7e-86 Score=486.66 Aligned_cols=378 Identities=12% Similarity=0.062 Sum_probs=306.9 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhcccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~ 83 (388) |+.-.+|||++|+..+++++.. .|++.+|+|+++.+ +.++|++++|+.++.+.|+.......+..++..+|.|++ T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~~-~ts~~~fvG~~~~G----p~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg 75 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVNN-STGTAALAGKFQWG----PAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYG 75 (659) T ss_pred CceecCceEEEEecCCceeccc-CccceEEEecccCC----CCCccEEecCHHHHHHHcCCcCCCcchhHHHHHHHhhCC Confidence 6666779999999999988866 79999999999877 568899999999999999988888888889999999999 Q ss_pred ceEEEEeccccccccc-------------------------------c----cccc------------------------ Q lcl|Aclame:pro 84 VPQYFIVVPEGADDAA-------------------------------T----MANI------------------------ 104 (388) Q Consensus 84 ~~~~vv~~~~~~~~~~-------------------------------~----~~~~------------------------ 104 (388) ..|++||+......+. . ...+ T Consensus 76 ~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~~~ 155 (659) T protein:vir:10 76 NDLRVVRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPTAKIIAKAK 155 (659) T ss_pred CeEEEEEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeecccccccccc Confidence 9888887521110000 0 0000 Q ss_pred -cc---------------------------cc-----------cc---------------------hh------------ Q lcl|Aclame:pro 105 -IG---------------------------GI-----------DP---------------------TT------------ 112 (388) Q Consensus 105 -~~---------------------------~~-----------~~---------------------~t------------ 112 (388) .+ .. .. .+ T Consensus 156 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G~~g~~~t 235 (659) T protein:vir:10 156 EVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIE 235 (659) T ss_pred cccccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecccccccccccceecccce Confidence 00 00 00 00 Q ss_pred ----------------------------------------------------------------hh-------------- Q lcl|Aclame:pro 113 ----------------------------------------------------------------GR-------------- 114 (388) Q Consensus 113 ----------------------------------------------------------------g~-------------- 114 (388) +. T Consensus 236 v~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (659) T protein:vir:10 236 IEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFF 315 (659) T ss_pred EEEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccccccchhhhhhhh Confidence 00 Q ss_pred ----------------------------------------hhhHhhhhhh-hhhhhheecccccc-----hhHHHHHHHH Q lcl|Aclame:pro 115 ----------------------------------------RTGIAALTEC-TERPTLIGAPGFSQ-----NKAVIDALAS 148 (388) Q Consensus 115 ----------------------------------------~tgl~a~~~~-~~~p~ll~ap~~~~-----~~~v~~~l~~ 148 (388) .+++.++... ...++++++|+++. .++|..+|.+ T Consensus 316 ~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~al~~ 395 (659) T protein:vir:10 316 AKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVS 395 (659) T ss_pred ccCcccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHH Confidence 0000000000 01367888888754 3678999999 Q ss_pred HhhhC-ceEEEEecCCCc---------chhHHHHHHHh-----hhcccccceEEEEecceecccccccceeehhhHHHHH Q lcl|Aclame:pro 149 MAKRL-KCRAVIDGPSGS---------TQDAIDLSGLL-----GGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAM 213 (388) Q Consensus 149 ~~~~~-~~~~i~d~p~~~---------~~~~~~~~~~~-----~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~a 213 (388) +|+++ ++++++|.|... .+...+++... ...+++|+|+++||||++++|+.++..+++|||+++| T Consensus 396 ~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~A 475 (659) T protein:vir:10 396 IGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADIA 475 (659) T ss_pred HHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEechHHHHH Confidence 99987 589999988532 12333333221 1235789999999999999999999999999999999 Q ss_pred HHHhccccccccccccccce--eecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC-----Cceeeehh Q lcl|Aclame:pro 214 GAVAAVKPWESPGNQGVLIQ--DVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT-----GKFISFVG 286 (388) Q Consensus 214 G~~a~~d~~~s~~n~p~~~~--g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~-----~~~i~vrR 286 (388) |++||+|..+|||+.|+|.. ++...+.+...++++|++.||++|||||++|+++|+++||+||++ |+|||||| T Consensus 476 Gl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR 555 (659) T protein:vir:10 476 GLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRINVRR 555 (659) T ss_pred HHHHHHhccCCceEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCCCCcccceEehhh Confidence 99999999998888888743 566666778888999999999999999999999999999999984 99999999 Q ss_pred hHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcc Q lcl|Aclame:pro 287 LEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPN 366 (388) Q Consensus 287 ~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pa 366 (388) ||+||+++|++.++|+|||||++.||++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+ T Consensus 556 ~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pa 635 (659) T protein:vir:10 556 LFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGIKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYIQPARSI 635 (659) T ss_pred HHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEEEecCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 367 EHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 367 e~I~~~~~~~~~~~~~l~~~~~ 388 (388) |||+|++++.....+ |+||+ T Consensus 636 e~i~~~~~~~~~~~~--~~e~~ 655 (659) T protein:vir:10 636 NYITLNFVATATGAD--FDELT 655 (659) T ss_pred ceEEEEEEEEecCcc--hHHhh Confidence 999999999988888 99999 No 22 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=8.4e-86 Score=487.01 Aligned_cols=380 Identities=12% Similarity=0.087 Sum_probs=302.6 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhcccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~ 83 (388) |+-..+|||++|+ +++++|..+.|++.+|||.++.+ |.++|++|+|+.++...|+.......+..++..+|.+++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vg~~~~g----p~~~p~~i~s~~~~~~~fg~~~~~~~~~~~~~~~f~~gg 75 (679) T protein:vir:10 1 MTLLSPGVETKEI-NLQTTIARSSTGRAALVGKFNWG----PAYQISQVVSEVDLVDKFGRPDDQTADSFFSGVNFLNYG 75 (679) T ss_pred CceecCceEEEee-cCCcccccCccccceeeecccCC----CCccCEEecCHHHHHHHcCCcccccchHHHHHHHHHhCC Confidence 5555669999999 59999999999999999999877 678999999999999999987777788888888888888 Q ss_pred ceEEEEecccccccccc---------------------------------------c--------------c------cc Q lcl|Aclame:pro 84 VPQYFIVVPEGADDAAT---------------------------------------M--------------A------NI 104 (388) Q Consensus 84 ~~~~vv~~~~~~~~~~~---------------------------------------~--------------~------~~ 104 (388) ..|+++|+......+.. . . .. T Consensus 76 ~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~s~~~~~~~~~~~~~~~~~~~~v~~~~~~~~a~~ 155 (679) T protein:vir:10 76 NDLRLVRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGGNVIATGKVTVVNASGGIVAFYVPTAAIIDKAKS 155 (679) T ss_pred CeEEEEEccCcccccccccccccccccccccccccccccceeeeeCCCcccceeEEEeeccCceeeeeeccccccccccc Confidence 88888875321110000 0 0 00 Q ss_pred -----------------------------cccccc-----------------------------------------hhhh Q lcl|Aclame:pro 105 -----------------------------IGGIDP-----------------------------------------TTGR 114 (388) Q Consensus 105 -----------------------------~~~~~~-----------------------------------------~tg~ 114 (388) +..... ..+. T Consensus 156 ~~~~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~~g~ 235 (679) T protein:vir:10 156 LNDYPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARYAGT 235 (679) T ss_pred ccccceecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccceeeeecccc Confidence 000000 0000 Q ss_pred -----------------------------------------------------------------hhh------------ Q lcl|Aclame:pro 115 -----------------------------------------------------------------RTG------------ 117 (388) Q Consensus 115 -----------------------------------------------------------------~tg------------ 117 (388) ..+ T Consensus 236 ~gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~~ 315 (679) T protein:vir:10 236 YGDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTKP 315 (679) T ss_pred cCCcceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeeeccc Confidence 000 Q ss_pred -----------H---------------------------------------hhh--------hhh--hhhhhheeccccc Q lcl|Aclame:pro 118 -----------I---------------------------------------AAL--------TEC--TERPTLIGAPGFS 137 (388) Q Consensus 118 -----------l---------------------------------------~a~--------~~~--~~~p~ll~ap~~~ 137 (388) + .+. ... ...+.++++|++. T Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~ 395 (679) T protein:vir:10 316 GDRDIYGTSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIAGAVA 395 (679) T ss_pred ccccccchhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEecCCC Confidence 0 000 000 0124567888775 Q ss_pred c-----hhHHHHHHHHHhhhCc-eEEEEecCCCcc---------hhHHHHHHH--------hhhcccccceEEEEeccee Q lcl|Aclame:pro 138 Q-----NKAVIDALASMAKRLK-CRAVIDGPSGST---------QDAIDLSGL--------LGGEGTGHDRVYMVDPMPA 194 (388) Q Consensus 138 ~-----~~~v~~~l~~~~~~~~-~~~i~d~p~~~~---------~~~~~~~~~--------~~~~~~~s~~~~~~~p~~~ 194 (388) . .++|+.+|..+|++++ |++|+|+|.... ++..+++.. ....+++|.|+++||||++ T Consensus 396 ~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~~~ 475 (679) T protein:vir:10 396 GEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGNYKY 475 (679) T ss_pred CCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEcccee Confidence 3 3678999999999875 999999986432 223333321 2234678999999999999 Q ss_pred cccccccceeehhhHHHHHHHHhccccccccccccccce--eecccccccccCchhhhhhccccceEEEEEeCCCcEEEE Q lcl|Aclame:pro 195 IYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVLIQ--DVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLI 272 (388) Q Consensus 195 ~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~~~--g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~w 272 (388) ++|+.++..+++|||+++||++||+|..+|||+.|+|.. ++...+.+.+.++++|++.||++|||+|++|+++|+++| T Consensus 476 ~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g~G~~~w 555 (679) T protein:vir:10 476 QYDKYNDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVGFAGQGYILY 555 (679) T ss_pred eecccCCceEEechHHHHHHHHHHhhccCCcEECcCCeeeccccccccceeecChhhHHhhhhCCceEEEEecCCeEEEE Confidence 999999999999999999999999998888888888743 455566778888999999999999999999999999999 Q ss_pred ccccCC-----CceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHH Q lcl|Aclame:pro 273 GNRTVT-----GKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVE 347 (388) Q Consensus 273 G~rT~~-----~~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~ 347 (388) |+||++ |+||||||||+||+++|++.++|+|||||++.||++|+++|++||++||++|+|+||+|+||+++||++ T Consensus 556 G~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~gf~v~~d~~~nt~~ 635 (679) T protein:vir:10 556 GDKTASQAPTPFDRINVRRLFNLLKKSISESAKYKLFELNDAFTRSSFRSEVGSYLDTIRSLGGIYDFRVVCDESNNTPA 635 (679) T ss_pred cccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHH Confidence 999984 999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhCCeEEEEEEEEecCcceeEEEEEEEcchH--HHHHHHhcC Q lcl|Aclame:pro 348 RYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRI--VEEFIEEVL 388 (388) Q Consensus 348 ~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~--~~~l~~~~~ 388 (388) +|++|+|+++|+++|++|+|||+|+|++.... +++++.++= T Consensus 636 ~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 678 (679) T protein:vir:10 636 VIDRNEFVATILIKPARSINYITLSFVATSTGADFDELVGSFQ 678 (679) T ss_pred HhhCCeEEEEEEEEecCCccEEEEEEEEeecCccHHHHHHHhc Confidence 99999999999999999999999998887664 555444444 No 23 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=5.2e-85 Score=482.67 Aligned_cols=378 Identities=13% Similarity=0.099 Sum_probs=299.1 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhcccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~ 83 (388) |+-..+|||++|+ +++++|..+.|++.+|+|.++-+ |.++|++|+|+.++.+.|+.......+..++..+|.|++ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~G----p~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg 75 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAAIVGKFAWG----PAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYG 75 (663) T ss_pred CceecCceEEEEe-cCcccccccCccceeEEeeeccC----CCCccEEecCHHHHHHHhCCcCccchhHHHHHHHHHhCC Confidence 6556679999999 69999999999999999999877 568999999999999999987777788889999999999 Q ss_pred ceEEEEeccccccccccc------------------------------------------------cccc---------- Q lcl|Aclame:pro 84 VPQYFIVVPEGADDAATM------------------------------------------------ANII---------- 105 (388) Q Consensus 84 ~~~~vv~~~~~~~~~~~~------------------------------------------------~~~~---------- 105 (388) ..++++|+..+...+... +... T Consensus 76 ~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~~ 155 (663) T protein:vir:10 76 NDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTR 155 (663) T ss_pred CeEEEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEecccccccccc Confidence 999998864321110000 0000 Q ss_pred --------------------cc----------c-cc-------------hh---------------------hhh----- Q lcl|Aclame:pro 106 --------------------GG----------I-DP-------------TT---------------------GRR----- 115 (388) Q Consensus 106 --------------------~~----------~-~~-------------~t---------------------g~~----- 115 (388) +. . +. .+ |.. T Consensus 156 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~i~ 235 (663) T protein:vir:10 156 QLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGSTVE 235 (663) T ss_pred ccceeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeeccccccccee Confidence 00 0 00 00 000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 116 -------------------------------------------------------------------------------- 115 (388) Q Consensus 116 -------------------------------------------------------------------------------- 115 (388) T Consensus 236 v~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~ 315 (663) T protein:vir:10 236 VEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRN 315 (663) T ss_pred EEecccccccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeecccccccchhhhhhhhhhcc Confidence Q ss_pred ---------------------------------------hhHhhhhhhh-hhhhheecc--ccc---chhHHHHHHHHHh Q lcl|Aclame:pro 116 ---------------------------------------TGIAALTECT-ERPTLIGAP--GFS---QNKAVIDALASMA 150 (388) Q Consensus 116 ---------------------------------------tgl~a~~~~~-~~p~ll~ap--~~~---~~~~v~~~l~~~~ 150 (388) .+++++.... ..+.++++| +.. ..++|..+|.++| T Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~~~a 395 (663) T protein:vir:10 316 GGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLA 395 (663) T ss_pred CcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHH Confidence 0000000000 001122222 111 1256888999999 Q ss_pred hhCc-eEEEEecCCCcc---------hhHHHHHHH--------hhhcccccceEEEEecceecccccccceeehhhHHHH Q lcl|Aclame:pro 151 KRLK-CRAVIDGPSGST---------QDAIDLSGL--------LGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIA 212 (388) Q Consensus 151 ~~~~-~~~i~d~p~~~~---------~~~~~~~~~--------~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ 212 (388) ++++ +++|+|.|.+.. ..+.+++.. ....+++|+|+++||||++++|+.++..+++|||+++ T Consensus 396 ~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p~s~~v 475 (663) T protein:vir:10 396 DDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAADI 475 (663) T ss_pred HhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEechhHHH Confidence 9875 999999996531 222333321 2245788999999999999999999999999999999 Q ss_pred HHHHhccccccccccccccce--eecccccccccCchhhhhhccccceEEEEEeCC-CcEEEEccccCC-----Cceeee Q lcl|Aclame:pro 213 MGAVAAVKPWESPGNQGVLIQ--DVARVIDYNILDKSTEGDLLNRNGVSYFARTSM-GGFSLIGNRTVT-----GKFISF 284 (388) Q Consensus 213 aG~~a~~d~~~s~~n~p~~~~--g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~-----~~~i~v 284 (388) ||++||+|..+|||+.|+|.. ++...+.+...+++.|++.||++|||||++|++ +|+++||+||++ |+|||+ T Consensus 476 AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~~i~v 555 (663) T protein:vir:10 476 AGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFDRINV 555 (663) T ss_pred HHHHHHhhccCCceEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccceEeh Confidence 999999999888888888743 455566778889999999999999999999987 799999999974 999999 Q ss_pred hhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecC Q lcl|Aclame:pro 285 VGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYS 364 (388) Q Consensus 285 rR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~ 364 (388) ||||+||+++|++.++|+|||||++.+|++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++ T Consensus 556 rR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~ 635 (663) T protein:vir:10 556 RRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTIYVKPPR 635 (663) T ss_pred hhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 365 PNEHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 365 pae~I~~~~~~~~~~~~~l~~~~~ 388 (388) |+|||+|+++++....+ |+|++ T Consensus 636 pae~i~~~~~~~~~~~~--~~e~~ 657 (663) T protein:vir:10 636 SINYITLNMVATSTGAN--FDELI 657 (663) T ss_pred CcceEEEEEEEeecCcc--HHHHH Confidence 99999999999887766 67777 No 24 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=3.9e-84 Score=477.85 Aligned_cols=380 Identities=13% Similarity=0.071 Sum_probs=297.1 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhcccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~ 83 (388) |+--.+|||++|+ +++++|..+.|++.+|+|.++.+ |.++|++|+|+.++.+.|+.......+..++..+|.+++ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~G----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~~~g 75 (660) T protein:vir:68 1 MALLSPGVELKET-TVQSTVVNNSTGTAALAGKFQWG----PAFQIKQITDEVALVDMFGTPNTDTADYFMSAMNFLQYG 75 (660) T ss_pred CccccCceEEEEe-cCCcccccCCCcceeEEecccCC----CCccCEEecCHHHHHHhcCCccCccchhHHHHHHHHhCC Confidence 6555679999999 69999999999999999999877 678999999999999999987777777788888888888 Q ss_pred ceEEEEeccccccccccc-----------------------------------cc-------------------c----- Q lcl|Aclame:pro 84 VPQYFIVVPEGADDAATM-----------------------------------AN-------------------I----- 104 (388) Q Consensus 84 ~~~~vv~~~~~~~~~~~~-----------------------------------~~-------------------~----- 104 (388) ..++++|+..+...+... .. + T Consensus 76 ~~~~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a~ 155 (660) T protein:vir:68 76 NDLRVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKAK 155 (660) T ss_pred CeEEEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeeccccccccce Confidence 888888753211110000 00 0 Q ss_pred ----------------c-------------cccc-c-------hh-hh----hh-----------hHhh----------- Q lcl|Aclame:pro 105 ----------------I-------------GGID-P-------TT-GR----RT-----------GIAA----------- 120 (388) Q Consensus 105 ----------------~-------------~~~~-~-------~t-g~----~t-----------gl~a----------- 120 (388) . +... . .+ +. .+ ++.+ T Consensus 156 ~~~~~~~~~~~~~~~v~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~i~ 235 (660) T protein:vir:68 156 EIGEYPELGSNWTAEMSGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQLE 235 (660) T ss_pred eeccccccccceeEEeecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCccccccccccccccceE Confidence 0 0000 0 00 00 00 0000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 121 -------------------------------------------------------------------------------- 120 (388) Q Consensus 121 -------------------------------------------------------------------------------- 120 (388) T Consensus 236 v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (660) T protein:vir:68 236 IEIVSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDFF 315 (660) T ss_pred EEEeccccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeeeeecccccccccccceeeehhh Confidence Q ss_pred ----------------------------------------------hhhhhh-hhhheeccccc-----chhHHHHHHHH Q lcl|Aclame:pro 121 ----------------------------------------------LTECTE-RPTLIGAPGFS-----QNKAVIDALAS 148 (388) Q Consensus 121 ----------------------------------------------~~~~~~-~p~ll~ap~~~-----~~~~v~~~l~~ 148 (388) +..... .+.++++++.. +.++|+.+|.+ T Consensus 316 ~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~~ 395 (660) T protein:vir:68 316 AKGASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHVVA 395 (660) T ss_pred ccCcccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHHHH Confidence 000000 00011111111 12467888999 Q ss_pred HhhhC-ceEEEEecCCC--------c-chhHHHHHHHh-----hhcccccceEEEEecceecccccccceeehhhHHHHH Q lcl|Aclame:pro 149 MAKRL-KCRAVIDGPSG--------S-TQDAIDLSGLL-----GGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAM 213 (388) Q Consensus 149 ~~~~~-~~~~i~d~p~~--------~-~~~~~~~~~~~-----~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~a 213 (388) +|+++ +|++++|.|.. . .+...+++... ...+++|.|+++||||++++|+.++..+++|||+++| T Consensus 396 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~A 475 (660) T protein:vir:68 396 IGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADIA 475 (660) T ss_pred HHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHHH Confidence 99877 48888887642 1 23333333321 1235789999999999999999999999999999999 Q ss_pred HHHhccccccccccccccc--eeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC-----Cceeeehh Q lcl|Aclame:pro 214 GAVAAVKPWESPGNQGVLI--QDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT-----GKFISFVG 286 (388) Q Consensus 214 G~~a~~d~~~s~~n~p~~~--~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~-----~~~i~vrR 286 (388) |++||+|..+|||+.|+|. .++...+.+.+.++++|++.||++|||+|++|+++|+++||+||++ |+|||||| T Consensus 476 Gl~Ar~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR 555 (660) T protein:vir:68 476 GLCARTDNISQPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRINVRR 555 (660) T ss_pred HHHHHHhccCCcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceEehhh Confidence 9999999888888888774 2566677888889999999999999999999999999999999984 99999999 Q ss_pred hHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcc Q lcl|Aclame:pro 287 LEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPN 366 (388) Q Consensus 287 ~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pa 366 (388) ||+||+++|+++++|+|||||++.||++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+ T Consensus 556 ~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pa 635 (660) T protein:vir:68 556 LFNMVKTNIGSASKYRLFELNNAFTRSSFRTETSQYLQGIKALGGVYNFKVVCDTTNNTPAVIDRNEFVATFYLQPARSI 635 (660) T ss_pred HHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEEEEEcch--HHHHHHHhcC Q lcl|Aclame:pro 367 EHMIFHLNAVDR--IVEEFIEEVL 388 (388) Q Consensus 367 e~I~~~~~~~~~--~~~~l~~~~~ 388 (388) |||+|+|++... +++++++++= T Consensus 636 e~i~l~~~~~~~~~~~~e~~~~v~ 659 (660) T protein:vir:68 636 NYITLNFVATATGADFDELIGAVG 659 (660) T ss_pred ceEEEEEEEeecCccHHHHHHhhc Confidence 999999988855 8888888888 No 25 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=2e-84 Score=479.45 Aligned_cols=378 Identities=13% Similarity=0.097 Sum_probs=296.0 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhcccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~ 83 (388) |+.-.+|||++|+ +++++|..+.|++.+|||.++.+ |.++|++|+|+.++.+.|+.......+..++..+|.|++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~v~t~~~~fvG~~~~G----p~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg 75 (671) T protein:vir:56 1 MTLLSPGIENKEI-NLASAIGRAATGRAAMVGKFEWG----PAYSITQVTSESDLVTIFGRPNDYTAASFMTANNFLKYG 75 (671) T ss_pred CceecCceEEEee-cCcccccccCcccceEEecccCC----CCccCEEcCCHHHHHHHcCCcCCCcchhHHHHHHHHhcC Confidence 6666779999999 59999999999999999999877 578999999999999999987777778889999999999 Q ss_pred ceEEEEeccccccccccc---------------------------------c-------c-------------------- Q lcl|Aclame:pro 84 VPQYFIVVPEGADDAATM---------------------------------A-------N-------------------- 103 (388) Q Consensus 84 ~~~~vv~~~~~~~~~~~~---------------------------------~-------~-------------------- 103 (388) ..|++||+......+... . + T Consensus 76 ~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~~ 155 (671) T protein:vir:56 76 NDLRLVRICDATTAQNATPLYNAVEYTIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVAA 155 (671) T ss_pred CeEEEEEecCccccccchhhccccccccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEEe Confidence 999988864321100000 0 0 Q ss_pred --------ccc-----------------c--------------------------------------------------- Q lcl|Aclame:pro 104 --------IIG-----------------G--------------------------------------------------- 107 (388) Q Consensus 104 --------~~~-----------------~--------------------------------------------------- 107 (388) ..+ . T Consensus 156 ~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g 235 (671) T protein:vir:56 156 AKSDGNYPSVGTITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYVGDFG 235 (671) T ss_pred eeccccccccccccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhcccccccccccccccC Confidence 000 0 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 108 -------------------------------------------------------------------------------- 107 (388) Q Consensus 108 -------------------------------------------------------------------------------- 107 (388) T Consensus 236 ~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~ 315 (671) T protein:vir:56 236 DAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTNPGDK 315 (671) T ss_pred cceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEEEeeccccc Confidence Q ss_pred ----------------------------------------ccchhhh---hhhHhhhhhhh-hhhhheecccccchh--- Q lcl|Aclame:pro 108 ----------------------------------------IDPTTGR---RTGIAALTECT-ERPTLIGAPGFSQNK--- 140 (388) Q Consensus 108 ----------------------------------------~~~~tg~---~tgl~a~~~~~-~~p~ll~ap~~~~~~--- 140 (388) .+...+. .++++++.... ..|.++++|++++.. T Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 395 (671) T protein:vir:56 316 DVNGQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANAGADDWMFGLDMLSDPEVLYTNLVIAGNAAAEEVSI 395 (671) T ss_pred ccchhhhhhhhhhcccCceEEEecCcccCCccccccccCccccccchhHHHHHHHhhhhccccceeEEEcCCCCCccchh Confidence 0000000 00000000000 013344444443321 Q ss_pred ---HHHHHHHHHhh-hCceEEEEecCCCc---------chhHHHHHH---------HhhhcccccceEEEEecceecccc Q lcl|Aclame:pro 141 ---AVIDALASMAK-RLKCRAVIDGPSGS---------TQDAIDLSG---------LLGGEGTGHDRVYMVDPMPAIYSR 198 (388) Q Consensus 141 ---~v~~~l~~~~~-~~~~~~i~d~p~~~---------~~~~~~~~~---------~~~~~~~~s~~~~~~~p~~~~~~~ 198 (388) ...+++.++++ +.++++++|.|... .....+++. .....+++|.|+++||||++++|+ T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~ 475 (671) T protein:vir:56 396 ASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGNYKYQYDK 475 (671) T ss_pred HHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEecCceEEecc Confidence 12334556654 56799999988542 122222221 123456789999999999999999 Q ss_pred cccceeehhhHHHHHHHHhccccccccccccccce--eecccccccccCchhhhhhccccceEEEEEeCCCcEEEEcccc Q lcl|Aclame:pro 199 KAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVLIQ--DVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRT 276 (388) Q Consensus 199 ~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~~~--g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT 276 (388) .++..+++|||+++||++||+|..+|||+.|+|.. ++...+.....+++.|++.||++|||+|++|+++|+++||+|| T Consensus 476 ~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT 555 (671) T protein:vir:56 476 YNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNRLAVDLRRAHRDALYQIGINPVVGFAGQGFVLYGDKT 555 (671) T ss_pred cCCceeEechHHHHHHHHHHhhccCCcEECcCCceeccccccccceeecChhHHHHHhhCCceEEEEecCCeEEEEccee Confidence 99999999999999999999998777777777632 3444556667778899999999999999999999999999999 Q ss_pred CC-----CceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhC Q lcl|Aclame:pro 277 VT-----GKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKN 351 (388) Q Consensus 277 ~~-----~~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~ 351 (388) ++ |+||++||||+||+++|++.++|+|||||++.||++|+++|++||++||++|+|+||+|+||+++||+++|++ T Consensus 556 ~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~g~~v~~d~~~nt~~~i~~ 635 (671) T protein:vir:56 556 ATQQASAFDRINVRRLFNLLKKAISDAAKYRLFELNDEFTRSSFKSEIDAYLTNIQDLGGVYDFRVVCDETNNPGSVIDR 635 (671) T ss_pred cCCCCcccceEehhhHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhC Confidence 74 9999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CeEEEEEEEEecCcceeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 352 GSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 352 G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~~~~ 388 (388) |+|+++|+++|++|+|||+|+|++.....+ |+||+ T Consensus 636 G~~~~~i~~~p~~Pae~I~~~~~~~~~~~~--f~e~~ 670 (671) T protein:vir:56 636 NEFVASIYVKPAKSINFITLNFVATSTDAD--FAEII 670 (671) T ss_pred CeEEEEEEEEecCCcceEEEEEEEeecCcc--hhhhc Confidence 999999999999999999999999999888 99999 No 26 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=3.9e-84 Score=477.86 Aligned_cols=378 Identities=12% Similarity=0.094 Sum_probs=296.7 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhcccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~ 83 (388) |+-..+|||++|+ +++++|..+.|++.+|||.+..+ |.++|++|+|+.++.+.|+.......+..++..+|.+++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vG~~~~G----p~~~p~~v~~~~~~~~~fg~~~~~~~~~~~~~~~f~ngg 75 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAAIVGKFAWG----PAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYG 75 (663) T ss_pred CceecCceEEEEe-cCCccccccCcccceeEeecccC----CCCccEEecCHHHHHHhcCCcCCcchhHHHHHHHHHhCC Confidence 6566679999999 59999999999999999999877 568999999999999999988777778888899999999 Q ss_pred ceEEEEecccccccccc---------------------------c--cc------------------c------------ Q lcl|Aclame:pro 84 VPQYFIVVPEGADDAAT---------------------------M--AN------------------I------------ 104 (388) Q Consensus 84 ~~~~vv~~~~~~~~~~~---------------------------~--~~------------------~------------ 104 (388) ..|+++|+..+...+.. . .. + T Consensus 76 ~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~~ 155 (663) T protein:vir:10 76 NDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTR 155 (663) T ss_pred CeEEEEEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeecccccccccc Confidence 99988876321110000 0 00 0 Q ss_pred -------------------ccc---------c--cch-------hhh-----------------------hh-------- Q lcl|Aclame:pro 105 -------------------IGG---------I--DPT-------TGR-----------------------RT-------- 116 (388) Q Consensus 105 -------------------~~~---------~--~~~-------tg~-----------------------~t-------- 116 (388) .++ . +.. +.. .. T Consensus 156 ~v~~~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~i~ 235 (663) T protein:vir:10 156 QLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGSTVE 235 (663) T ss_pred ccccceeeccceeeEeeeccCccccccccceeccccceEEeeccccccccccccccccccccccceEEeccCCcccceee Confidence 000 0 000 000 00 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 117 -------------------------------------------------------------------------------- 116 (388) Q Consensus 117 -------------------------------------------------------------------------------- 116 (388) T Consensus 236 V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~ 315 (663) T protein:vir:10 236 VEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRN 315 (663) T ss_pred eeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhhhhcC Confidence Q ss_pred ----------------------------------------hHhhhhhhh-hhhhheeccc--cc---chhHHHHHHHHHh Q lcl|Aclame:pro 117 ----------------------------------------GIAALTECT-ERPTLIGAPG--FS---QNKAVIDALASMA 150 (388) Q Consensus 117 ----------------------------------------gl~a~~~~~-~~p~ll~ap~--~~---~~~~v~~~l~~~~ 150 (388) ++..+.... ..+.++++|. .. ..++|+.+|.++| T Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~a 395 (663) T protein:vir:10 316 GGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLA 395 (663) T ss_pred CcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHH Confidence 000000000 0011222221 11 1256888999999 Q ss_pred hhCc-eEEEEecCCCcc---------hhHHHHHH--------HhhhcccccceEEEEecceecccccccceeehhhHHHH Q lcl|Aclame:pro 151 KRLK-CRAVIDGPSGST---------QDAIDLSG--------LLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIA 212 (388) Q Consensus 151 ~~~~-~~~i~d~p~~~~---------~~~~~~~~--------~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ 212 (388) ++++ +++++|.|.+.. .....++. .....+++|.|+++||||++++|+.++..+++|||+++ T Consensus 396 ~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~v 475 (663) T protein:vir:10 396 DDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAADI 475 (663) T ss_pred HhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCceEEechhHHH Confidence 9875 899999997532 12222221 12235688999999999999999999999999999999 Q ss_pred HHHHhccccccccccccccce--eecccccccccCchhhhhhccccceEEEEEeCC-CcEEEEccccCC-----Cceeee Q lcl|Aclame:pro 213 MGAVAAVKPWESPGNQGVLIQ--DVARVIDYNILDKSTEGDLLNRNGVSYFARTSM-GGFSLIGNRTVT-----GKFISF 284 (388) Q Consensus 213 aG~~a~~d~~~s~~n~p~~~~--g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~-----~~~i~v 284 (388) ||++||+|..+|||+.|+|.. ++...++++..+++.|++.||++|||+|++|++ +||++||+||++ |+|||| T Consensus 476 AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~v 555 (663) T protein:vir:10 476 AGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFDRINV 555 (663) T ss_pred HHHHHHhhccCCceEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccceEeh Confidence 999999999888888888742 455667788888999999999999999999987 799999999974 999999 Q ss_pred hhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecC Q lcl|Aclame:pro 285 VGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYS 364 (388) Q Consensus 285 rR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~ 364 (388) ||||+||+++|++.++|+|||||++.+|++|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+++|+++|++ T Consensus 556 rR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~ 635 (663) T protein:vir:10 556 RRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTIYVKPPR 635 (663) T ss_pred hhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 365 PNEHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 365 pae~I~~~~~~~~~~~~~l~~~~~ 388 (388) |+|||+|+++++....+ |+|++ T Consensus 636 pae~i~~~~~~~~~~~~--~~e~~ 657 (663) T protein:vir:10 636 SINYITLNMVATSTGAN--FDELI 657 (663) T ss_pred CcceEEEEEEEeecCcc--HHHHH Confidence 99999999999877655 66666 No 27 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=1e-83 Score=475.52 Aligned_cols=378 Identities=13% Similarity=0.110 Sum_probs=300.1 Q ss_pred CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhcccc Q lcl|Aclame:pro 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) Q Consensus 4 ~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~ 83 (388) |+-..+|||++|+ +++++|..+.|++.+|||.+..+ |.++|++|+|+.++.+.|+.......+..++..+|.+++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~~~~v~t~~~~fvG~~~~g----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg 75 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAALVGKFAWG----PAYEIRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYG 75 (663) T ss_pred CccccCceEEEEe-cCcccccccccccceeeeccccC----CCCcCEEecCHHHHHHHcCCcccccchHHHHHHHHHhCC Confidence 6555679999999 58999999999999999999877 678999999999999999988777788889999999999 Q ss_pred ceEEEEeccccccccccc------------------------------------------------c------------- Q lcl|Aclame:pro 84 VPQYFIVVPEGADDAATM------------------------------------------------A------------- 102 (388) Q Consensus 84 ~~~~vv~~~~~~~~~~~~------------------------------------------------~------------- 102 (388) ..|++||+......+... . T Consensus 76 ~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~a~ 155 (663) T protein:vir:10 76 NDLRLVRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAKAK 155 (663) T ss_pred CeEEEEecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEecccccccccc Confidence 999988875321110000 0 Q ss_pred ---c--------------cc---------ccc---------------cchh------------------------h---- Q lcl|Aclame:pro 103 ---N--------------II---------GGI---------------DPTT------------------------G---- 113 (388) Q Consensus 103 ---~--------------~~---------~~~---------------~~~t------------------------g---- 113 (388) . .. .+. +..+ | T Consensus 156 ~~~~~~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~ 235 (663) T protein:vir:10 156 QLGTYPVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGSTVE 235 (663) T ss_pred ccccccccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCccee Confidence 0 00 000 0000 0 Q ss_pred ----hhh---------------------------h--------------------------------------H-hhhhh Q lcl|Aclame:pro 114 ----RRT---------------------------G--------------------------------------I-AALTE 123 (388) Q Consensus 114 ----~~t---------------------------g--------------------------------------l-~a~~~ 123 (388) ..+ + + ..+.. T Consensus 236 v~~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~~~~ 315 (663) T protein:vir:10 236 VEVISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDYFRN 315 (663) T ss_pred EeecccccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhhhhhhhcC Confidence 000 0 0 00000 Q ss_pred -----------------------------------------------h--hhhhhheec----ccccchhHHHHHHHHHh Q lcl|Aclame:pro 124 -----------------------------------------------C--TERPTLIGA----PGFSQNKAVIDALASMA 150 (388) Q Consensus 124 -----------------------------------------------~--~~~p~ll~a----p~~~~~~~v~~~l~~~~ 150 (388) . .....+++. |++++.+.|+++|.++| T Consensus 316 ~~s~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~~ 395 (663) T protein:vir:10 316 GSSNFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVALA 395 (663) T ss_pred cccceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHHH Confidence 0 000011111 22223467889999999 Q ss_pred hhCc-eEEEEecCCCcch------hHHHHHHH-----------hhhcccccceEEEEecceecccccccceeehhhHHHH Q lcl|Aclame:pro 151 KRLK-CRAVIDGPSGSTQ------DAIDLSGL-----------LGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIA 212 (388) Q Consensus 151 ~~~~-~~~i~d~p~~~~~------~~~~~~~~-----------~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ 212 (388) ++++ |++|+|+|.+... .......+ ....+++|+|+++||||++++|+.++..+++|||+++ T Consensus 396 ~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s~~v 475 (663) T protein:vir:10 396 DDRQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPLSADI 475 (663) T ss_pred HhhCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEechHHHH Confidence 9875 9999999976431 11122222 2245789999999999999999999999999999999 Q ss_pred HHHHhccccccccccccccce--eecccccccccCchhhhhhccccceEEEEEeCC-CcEEEEccccCC-----Cceeee Q lcl|Aclame:pro 213 MGAVAAVKPWESPGNQGVLIQ--DVARVIDYNILDKSTEGDLLNRNGVSYFARTSM-GGFSLIGNRTVT-----GKFISF 284 (388) Q Consensus 213 aG~~a~~d~~~s~~n~p~~~~--g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~-----~~~i~v 284 (388) ||++||+|..+|||+.|+|.. ++...+.+...+++.|++.||++|||+|+.|++ +||++||+||++ |+||++ T Consensus 476 AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~~i~v 555 (663) T protein:vir:10 476 AGLCAYTDQVGHPWMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQVPSPFDRINV 555 (663) T ss_pred HHHHHHhhccCCcEEccCCeeecceeccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCCCCcccceEeh Confidence 999999998888888887743 566667777888999999999999999999987 799999999984 999999 Q ss_pred hhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecC Q lcl|Aclame:pro 285 VGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYS 364 (388) Q Consensus 285 rR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~ 364 (388) ||||+||+++|++.++|+|||||++.||++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++ T Consensus 556 rR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~~p~~ 635 (663) T protein:vir:10 556 RRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMEVSQYLDNIRSLGGVYDFRVVCDTTNNTPQVIDSNEFVATIYIKAPR 635 (663) T ss_pred hhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 365 PNEHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 365 pae~I~~~~~~~~~~~~~l~~~~~ 388 (388) |+|||+|++++...+.+ |+||+ T Consensus 636 pae~I~~~~~~~~~~~~--f~e~~ 657 (663) T protein:vir:10 636 SINYITLNFVATSTGAN--FDELI 657 (663) T ss_pred CcceEEEEEEEEecCcc--HHHHH Confidence 99999999999988866 88888 No 28 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=1.3e-82 Score=469.50 Aligned_cols=380 Identities=14% Similarity=0.083 Sum_probs=291.7 Q ss_pred CCCCCCcCC-CeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhccccc--ccccchhhhhh Q lcl|Aclame:pro 1 MPVIDQFEH-NGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGN--ELGTGWHAASE 77 (388) Q Consensus 1 M~~~t~~~h-GV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~--~~gtl~~a~~~ 77 (388) ||+. |+| |||++|++.++++|..+.|++.+|||.++.+ +.++|++|+|+.++.+.|+... ....+..++.. T Consensus 1 m~~~--~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~~~~G----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~~ 74 (729) T protein:vir:10 1 MPLN--LASPGIVVREVDLTIGRVDPTSGSIGALVAPFAKG----PVNDPQLIESEEDLLQTFGQPYSTDKHYEYWMVAS 74 (729) T ss_pred CCcc--ccCCceEEEEecCCCcccccccccceeEEeccccC----CCccCeEcCCHHHHHHHcCccccCCcchhHHHHHH Confidence 8863 555 9999999999999999999999999999877 5789999999999999988743 23345678888 Q ss_pred hhccccceEEEEecccccccccc--------------------------------------c------------------ Q lcl|Aclame:pro 78 TLKKTSVPQYFIVVPEGADDAAT--------------------------------------M------------------ 101 (388) Q Consensus 78 ~~~~~~~~~~vv~~~~~~~~~~~--------------------------------------~------------------ 101 (388) +|.|++..|+++|+......+.+ . T Consensus 75 ~f~ngg~~~~vvRv~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~~~v~v 154 (729) T protein:vir:10 75 SYLAYGGTMQVVRADDYNTQTGVGLKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANGIKVAI 154 (729) T ss_pred HHHhCCceEEEEecCcccccccccccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccceeeEE Confidence 99999999999886431100000 0 Q ss_pred c-------------------------------c-------------ccccccch-------------------------- Q lcl|Aclame:pro 102 A-------------------------------N-------------IIGGIDPT-------------------------- 111 (388) Q Consensus 102 ~-------------------------------~-------------~~~~~~~~-------------------------- 111 (388) . . .....+.. T Consensus 155 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s~~~~~~~~~~~~~~~~~~ 234 (729) T protein:vir:10 155 IDGKADQILTVASGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTLEVKVISHISAAGVETAVEYQQNGT 234 (729) T ss_pred ecccCcceeeeeccccccceeeeeeeccccccccccceeeeeeecccccccccccccceecccccccccceeccccccce Confidence 0 0 00000000 Q ss_pred -----------------------------------------h-------------------------------------- Q lcl|Aclame:pro 112 -----------------------------------------T-------------------------------------- 112 (388) Q Consensus 112 -----------------------------------------t-------------------------------------- 112 (388) + T Consensus 235 ~~~~~~~s~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~ 314 (729) T protein:vir:10 235 YTFDNSGSVNVIAAGSSGSGSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTI 314 (729) T ss_pred eeecccCccceeeeccccccccccceeeeccccccccccccccccccccccccccccccccccccccccceeeecccccc Confidence 0 Q ss_pred -h------------------------------------------------------------------------------ Q lcl|Aclame:pro 113 -G------------------------------------------------------------------------------ 113 (388) Q Consensus 113 -g------------------------------------------------------------------------------ 113 (388) + T Consensus 315 ~~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 394 (729) T protein:vir:10 315 TGNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLATNSKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGVNFGA 394 (729) T ss_pred ccCcccceeeeeeeeeccccccccccccccceeeccccceeeecccccccccccccccccceeccccccccccccccccc Confidence 0 Q ss_pred --------------------------------hhhhHhhhhhhhh-hhhheeccc----ccchhHHHHHHHHHhhhC-ce Q lcl|Aclame:pro 114 --------------------------------RRTGIAALTECTE-RPTLIGAPG----FSQNKAVIDALASMAKRL-KC 155 (388) Q Consensus 114 --------------------------------~~tgl~a~~~~~~-~p~ll~ap~----~~~~~~v~~~l~~~~~~~-~~ 155 (388) ..+|++++..... ....+++++ ..+.+.+..+|.++|+++ ++ T Consensus 395 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~~~ 474 (729) T protein:vir:10 395 SGVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARKDA 474 (729) T ss_pred cceeEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhcCCe Confidence 0000000000000 000011110 112355677888888876 48 Q ss_pred EEEEecCCCcch---------------hHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccc Q lcl|Aclame:pro 156 RAVIDGPSGSTQ---------------DAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK 220 (388) Q Consensus 156 ~~i~d~p~~~~~---------------~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d 220 (388) ++++|.|..... ...+...+... ..+++++++||||++++|+.++..+++|||+++||++||+| T Consensus 475 ~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~d 553 (729) T protein:vir:10 475 VAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAP-LSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGTCARTD 553 (729) T ss_pred EEEecccccccccccccccccccccchhhHHHHHHHhh-ccCCceEEEEcCeeEEecccCCceEEechhHHHHHHHHHhh Confidence 899988753210 11111111111 23578999999999999999999999999999999999999 Q ss_pred cccccccccccc--eeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCC-----CceeeehhhHHHHHH Q lcl|Aclame:pro 221 PWESPGNQGVLI--QDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT-----GKFISFVGLEDAIAR 293 (388) Q Consensus 221 ~~~s~~n~p~~~--~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~-----~~~i~vrR~~~~i~~ 293 (388) ..+|||+.|+|. .++...+.+.+.++++|++.||++|||||++|+++|+++||+||++ |+||++|||++||++ T Consensus 554 ~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~d~~~~~i~vrR~~~~i~~ 633 (729) T protein:vir:10 554 IEQFPWFSPAGTARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAGIILFGDKTGFGKSSAFDRINVRRLFIYLED 633 (729) T ss_pred ccCCcEEccCCccccceecccceeeecChhhHhhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHHHHHH Confidence 888888777763 3566677788889999999999999999999999999999999973 999999999999999 Q ss_pred HHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEE Q lcl|Aclame:pro 294 KLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHL 373 (388) Q Consensus 294 ~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~ 373 (388) +|++.++|+|||||++.+|++|++++++||++||++|+|+||+|+||+++||++||++|+|+++|+++|++|+|||+|++ T Consensus 634 si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~ 713 (729) T protein:vir:10 634 AISAAAKDQLFEFNDELTRTNFVNIVEPFLRDVQAKRGIFDFVVICDETNNTAAVIDSNEFVADIFIKPARSINFIGLTF 713 (729) T ss_pred HHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCccEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcch--HHHHHHHhc Q lcl|Aclame:pro 374 NAVDR--IVEEFIEEV 387 (388) Q Consensus 374 ~~~~~--~~~~l~~~~ 387 (388) +++.. ++++++++| T Consensus 714 ~~~~~~~~~~e~~~~~ 729 (729) T protein:vir:10 714 VATRTGVAFEEVIGSV 729 (729) T ss_pred EEeecCccHHHHHhcC Confidence 88876 678888888 No 29 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=8.4e-81 Score=459.59 Aligned_cols=385 Identities=11% Similarity=0.055 Sum_probs=266.0 Q ss_pred CCCC--CCc-CCC-------------------------eEEEEcCCCcccccccCcceeEEEeeccccccccccCcceee Q lcl|Aclame:pro 1 MPVI--DQF-EHN-------------------------GISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRV 52 (388) Q Consensus 1 M~~~--t~~-~hG-------------------------V~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v 52 (388) +... ..+ +.| +.+++.+..........+.+..+.......+.....+.+..+ T Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~ 378 (743) T protein:vir:10 299 KDWYLNTEIGSTGIKLGDIGPRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYY 378 (743) T ss_pred ccccccchhhccccccccccccceeeeccccccccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceee Confidence 0000 000 111 111111111111111111111122111111111111111111 Q ss_pred ccc-hhhhhhcccccccccchhhhhhhhccccceEEEEecccccccccccccccccccchh----hhhhhHhhhhhhhh- Q lcl|Aclame:pro 53 ANT-ADAQYLDSTGNELGTGWHAASETLKKTSVPQYFIVVPEGADDAATMANIIGGIDPTT----GRRTGIAALTECTE- 126 (388) Q Consensus 53 ~s~-~~~~~~~~~~~~~gtl~~a~~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~t----g~~tgl~a~~~~~~- 126 (388) ... .+...+.......++...+....+.......+..........+.+..+++|+.|..+ +..++++++..... T Consensus 379 ~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~~~~~~~~~~~~~~~~~~~~ 458 (743) T protein:vir:10 379 KNVINEQSAYLYHGNDAAVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDFAYDAGEFGAAMDLFLDTEET 458 (743) T ss_pred cceeccccceeeccCcccceeeeccccCccccceeeeecccccccccceEEEeecCccccccchhHHHHHHHHhhhcccc Confidence 111 111111112222334444444444544444444444444555566667888877543 23344444443332 Q ss_pred hhhheecccccc----hhHHHHHHHHHhhhCc-eEEEEecCCCcchh------------HHHHHHHhhhcccccceEEEE Q lcl|Aclame:pro 127 RPTLIGAPGFSQ----NKAVIDALASMAKRLK-CRAVIDGPSGSTQD------------AIDLSGLLGGEGTGHDRVYMV 189 (388) Q Consensus 127 ~p~ll~ap~~~~----~~~v~~~l~~~~~~~~-~~~i~d~p~~~~~~------------~~~~~~~~~~~~~~s~~~~~~ 189 (388) .++++++||+.+ ..+|..++.++|++++ |++++|+|.+.... ......+. ...++|+|+++| T Consensus 459 ~~~ll~~p~~~~~~~~~~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~s~~~~~~ 537 (743) T protein:vir:10 459 EIDFVLMGGSMADEADTKSKATKVIAIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFF-SDLTSTSYAVFD 537 (743) T ss_pred CcceEEecCcccCccchHHHHHHHHHHHHhhCCeEEEEecCCCccccccccccccccccchHHHHHH-HhccCCeeEEEE Confidence 358999999754 4789999999998765 99999999754211 11111222 335689999999 Q ss_pred ecceecccccccceeehhhHHHHHHHHhccccccccccccccc--eeecccccccccCchhhhhhccccceEEEEEeCCC Q lcl|Aclame:pro 190 DPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVLI--QDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMG 267 (388) Q Consensus 190 ~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~~--~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~ 267 (388) |||++++|+.++..+++|||+++||++||+|..+|||+.|+|. .|+...+.+.+.++++|++.||++|||||++|+++ T Consensus 538 ~p~~~~~d~~~~~~~~~p~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~ 617 (743) T protein:vir:10 538 SGYKYVYDRFTDKYRYIPCNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAVKLAYNPNKADRDELYQNRINPVVSLRGQ 617 (743) T ss_pred ccceeeeccccCceeEechhHHHHHHHHHhhccCCcEEccCCeeeeeeeccccceecCChhHHHhHhhCCceEEEEecCC Confidence 9999999999999999999999999999999887777777763 35667777888899999999999999999999999 Q ss_pred cEEEEccccCC-----CceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEecc Q lcl|Aclame:pro 268 GFSLIGNRTVT-----GKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPT 342 (388) Q Consensus 268 G~~~wG~rT~~-----~~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~ 342 (388) |+++||+||++ |+||++||||+||+++|++.++|+|||||++.+|++|++++++||++||++|+|.||+|+||++ T Consensus 618 G~~~wG~rT~~s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~~~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~ 697 (743) T protein:vir:10 618 GITLFGDKTALAAPSAFDRINVRRLFLNLEKRARRLAEGVLFEQNDATTRAGFSSALNSYLSEVQARRGVTDYLVICDES 697 (743) T ss_pred eEEEEcccccCCCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCC Confidence 99999999973 9999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcch--HHHHHHHh Q lcl|Aclame:pro 343 LNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDR--IVEEFIEE 386 (388) Q Consensus 343 ~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~--~~~~l~~~ 386 (388) +||+++|++|+|+++|+++|++|+|||+|+|++... +++++++. T Consensus 698 ~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~e~~~~ 743 (743) T protein:vir:10 698 NNTPDIIDRNEFVAEVYVKPTRSINFITITFTATKTGVTFSEVVGR 743 (743) T ss_pred CCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchHhhhcC Confidence 999999999999999999999999999999885544 55555555 No 30 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=2.4e-78 Score=446.16 Aligned_cols=369 Identities=14% Similarity=0.081 Sum_probs=270.0 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccc-cCcceeEEEeeccccccccccCcceeeccchhhhhhcccc-----ccc------ Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGP-PGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTG-----NEL------ 68 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~-v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~-----~~~------ 68 (388) |..-=++ ||||++|+.+++++|.. +.|++.+|+|.++.+ +.++|++++|+.|+...|... +.+ T Consensus 279 ~~~~v~~-~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rG----Pvn~PvlITS~aD~~~~Fg~~~GGl~GassA~r~~ 353 (774) T protein:vir:98 279 ITRNVED-NGVVIQLEPALTGSISNRFSFYVTANDNTANRG----FTTSPALVTTIPDPAIHFTSFQGGLDGPRSAFRDF 353 (774) T ss_pred eEEEEec-CceEEEEeCCCCccccccccceeeeecccccCC----CCCcCEEEeehhHhhhhhccccCCccccceeeeee Confidence 5554454 79999999999999976 999999999998876 578999999999955443210 000 Q ss_pred ----ccc---hhhhhhhhccccceEEE-------------------------------Eeccc----------------- Q lcl|Aclame:pro 69 ----GTG---WHAASETLKKTSVPQYF-------------------------------IVVPE----------------- 93 (388) Q Consensus 69 ----gtl---~~a~~~~~~~~~~~~~v-------------------------------v~~~~----------------- 93 (388) +.. ..+.... .-++.-.+. +.... T Consensus 354 ~~~sG~~~L~i~A~~pG-awGN~ItV~I~~~t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~~~~~~~~v~e~~dn~~i~ 432 (774) T protein:vir:98 354 YTFNGTPLLRLQAVSEG-NWGNQVTVSIYPVNNSEFRLNVQDLNGSAFNPPLADEVYTVKLGDTNESGELNALLDSKFIR 432 (774) T ss_pred eeecccceEEEEEeecC-cCCCceEEEEEecCCceeEEEEEecCCccccccccceeEEEecccccccceeeeeeceeeEe Confidence 000 0000000 000000000 00000 Q ss_pred --ccccccc----------------------c-------------------ccccccccchhhhhhhHhhhhh----hhh Q lcl|Aclame:pro 94 --GADDAAT----------------------M-------------------ANIIGGIDPTTGRRTGIAALTE----CTE 126 (388) Q Consensus 94 --~~~~~~~----------------------~-------------------~~~~~~~~~~tg~~tgl~a~~~----~~~ 126 (388) ..+.... . ..+.++.++. .+....... ... T Consensus 433 ~~~~~~~~~~in~vs~lv~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg~---~tt~~~igg~~~~~~~ 509 (774) T protein:vir:98 433 GFFLPKSIDSINYDAALVRQSPLRLAPPDESETDVENPAHVDFYGPNVLVDVTLENGYDGP---PVTNDDYVSIIRTLEN 509 (774) T ss_pred ecccccccccccccccccccchhcccccccccccccccccccccCCcceEEEeecCCCCcc---cccchheecccccccc Confidence 0000000 0 0000111111 111110000 001 Q ss_pred h-hhheecccccchhHHHHHHHHHhhh-----CceEEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceecccccc Q lcl|Aclame:pro 127 R-PTLIGAPGFSQNKAVIDALASMAKR-----LKCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKA 200 (388) Q Consensus 127 ~-p~ll~ap~~~~~~~v~~~l~~~~~~-----~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~ 200 (388) . ..++++ +. ...++..+|..+|++ ..+++++|.|.+.+.+. ....+.+++|+|+++||||++++|+.. T Consensus 510 tgi~aLl~-a~-~~~~V~~aii~~~e~~~~~~~~r~avid~p~g~t~~~----Ai~~r~~f~S~~aal~~Pwvkv~D~~~ 583 (774) T protein:vir:98 510 QPVHILLV-GT-TNVGVQQALITEAERASDSDGLRIAVLAAPPRTTPTL----AASVTRGFNSTRAVMVAGWFTYAGQPN 583 (774) T ss_pred cceeEEEc-Cc-cchhhHHHHHHHHHHhhhcccceEEEEECCCCCCHHH----HHHHHhccCCceEEEEeCcEEEeccCC Confidence 1 122332 22 345667777776664 35788999887765332 122234788999999999999999999 Q ss_pred cceeehhhHHHHHHHHhccccccccccccc-cceeecccccccccCchhhhhhccccceEEEE-EeCCCcEEEEccccCC Q lcl|Aclame:pro 201 QGNIYVPPSTIAMGAVAAVKPWESPGNQGV-LIQDVARVIDYNILDKSTEGDLLNRNGVSYFA-RTSMGGFSLIGNRTVT 278 (388) Q Consensus 201 ~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~-~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~-~~~~~G~~~wG~rT~~ 278 (388) +..+++|||+++||++|++|+|+||+|+++ ++.|+..........++.|++.||.++||+++ .++++|+++||+||++ T Consensus 584 g~~~~vPpSg~vAGl~ArtDv~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g~G~rvWG~RTls 663 (774) T protein:vir:98 584 SSRYGVPGAAVYAGKLAAIDFFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVDRTYRFASGVTLS 663 (774) T ss_pred CceeecChhHHHHHHHHhcCcccccCCceeecceeccccccccccccchhhhhhcccccceeEEEEcCCcEEEEcccccC Confidence 999999999999999999999999999997 67777666666777788899999999999997 6889999999999985 Q ss_pred ----CceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeee-EEEeccCCCHHHhhCCe Q lcl|Aclame:pro 279 ----GKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGE-VYLHPTLNTVERYKNGS 353 (388) Q Consensus 279 ----~~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~-v~~d~~~Nt~~~i~~G~ 353 (388) |+||++|||++||+++|++.++|+|||||++.+|++|+++++.||++||++|+|+|++ |+||+++||+++|++|+ T Consensus 664 sDp~wr~InVRRlfd~Ie~SI~~~~~~~VfEPNd~~l~~~I~~sI~~fL~~L~~~GaL~G~~~V~~D~etNt~~dI~~G~ 743 (774) T protein:vir:98 664 TDPAWERIYLRRVHDVVRQGAHAILRNYVAMPNSRLVRNQIAAALNAFMGELKRNGNIVSFRPAIIDGSNNSTAAYFSRE 743 (774) T ss_pred CCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceecceEEEEcCCCCCHHHhhCCE Confidence 9999999999999999999999999999999999999999999999999999999997 89999999999999999 Q ss_pred EEEEEEEEecCcceeEEEEEEEcchHHHHHHHh Q lcl|Aclame:pro 354 WYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIEE 386 (388) Q Consensus 354 ~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~~ 386 (388) |+++|+++|++|+|||+|+++++.++.+ |+| T Consensus 744 l~i~I~vaP~~PAEfIilri~q~t~~~~--l~E 774 (774) T protein:vir:98 744 LYVSLQFQPLYSADYIYVTISRDTETSP--LGE 774 (774) T ss_pred EEEEEEEEecCCcceEEEEEEEeeccee--ccC Confidence 9999999999999999999999999866 666 No 31 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=1.6e-77 Score=441.65 Aligned_cols=374 Identities=13% Similarity=0.067 Sum_probs=246.2 Q ss_pred CCC-CCCcCCCeEEEEcCCCcccccccCcceeEEEe-------------------------------ecc---------c Q lcl|Aclame:pro 1 MPV-IDQFEHNGISIETHEPPPPMGPPGDNVVAWVV-------------------------------TAP---------D 39 (388) Q Consensus 1 M~~-~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vg-------------------------------ta~---------~ 39 (388) +.- .++-+| +........-+....+.+-.++. ..+ . T Consensus 316 ~~~g~~D~~~---v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~ 392 (749) T protein:vir:10 316 GVGGHRDEMH---VILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSEFIYWAEHESTLYAATSSA 392 (749) T ss_pred cccCCCCceE---EEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhccCCCEEEEEecccccccccccc Confidence 000 001111 11111111111111111111111 000 0 Q ss_pred cccccccCcceeeccchhhhhhcccccccccchhhhhhhhccccceEEEEecccccccccccccccccccchhhhhhhHh Q lcl|Aclame:pro 40 KHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIA 119 (388) Q Consensus 40 ~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~ 119 (388) .+.....+....... .. .......+.......+.. .....++++...+.+...+............+....+. T Consensus 393 ~~~~~~~~~~~~~~~-----~~-~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~ 465 (749) T protein:vir:10 393 SDGLFGQTAANRQFN-----LF-RSAAGSVDYPAGVTTLGS-KNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIG 465 (749) T ss_pred cccccccccccceee-----cc-ccccccceeccccccccc-cCCcEEEEEccCCcccccccccccccchhHHHHHHHhh Confidence 000000000000000 00 000000111111122222 22334555555554443333333332333334444444 Q ss_pred hhhhhhhhhhheeccccc--chhHHHHHHHHHhhhCc-eEEEEecCCCcchhH-------HHHHHHhhhcccccceEEEE Q lcl|Aclame:pro 120 ALTECTERPTLIGAPGFS--QNKAVIDALASMAKRLK-CRAVIDGPSGSTQDA-------IDLSGLLGGEGTGHDRVYMV 189 (388) Q Consensus 120 a~~~~~~~p~ll~ap~~~--~~~~v~~~l~~~~~~~~-~~~i~d~p~~~~~~~-------~~~~~~~~~~~~~s~~~~~~ 189 (388) ........+.++..|++. +..+|+.+|.++|++++ +++++|+|.+..... .+...+ .....+|.|+++| T Consensus 466 ~~~~~~~~~li~~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~-~~~~~~s~~~~~~ 544 (749) T protein:vir:10 466 DPESQIVDFIISGPSGTSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDF-FKKLPSSSYMVFD 544 (749) T ss_pred hhhhcccceEEEecCCCCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHH-HhhccCceeEEEE Confidence 333334444455555654 34678999999999775 567778776532111 111111 1234578899999 Q ss_pred ecceecccccccceeehhhHHHHHHHHhccccccccccccccc--eeecccccccccCchhhhhhccccceEEEEEeCCC Q lcl|Aclame:pro 190 DPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVLI--QDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMG 267 (388) Q Consensus 190 ~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~~--~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~ 267 (388) |||++++|+.++..+++|||+++||++||+|..+|||+.|+|. .++...+.+...+++.|++.||++|||||++|+++ T Consensus 545 ~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~g~ 624 (749) T protein:vir:10 545 SGYKYIYDKYNDVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPIVSFPGQ 624 (749) T ss_pred ccceeeeccccCceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccccceeecChhHHHhhhhCCceEEEEecCC Confidence 9999999999999999999999999999999888888777763 24666667788889999999999999999999999 Q ss_pred cEEEEccccCC-----CceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEecc Q lcl|Aclame:pro 268 GFSLIGNRTVT-----GKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPT 342 (388) Q Consensus 268 G~~~wG~rT~~-----~~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~ 342 (388) |+++||+||+. |+||||||||+||+++|++.++|+|||||++.||++|++++++||++||++|+|.||+|+||++ T Consensus 625 G~~~wG~rT~~s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~G~i~~f~V~~d~~ 704 (749) T protein:vir:10 625 GVVLYGDKTALGFASAFDRINIRRLFLTVERVISTAAKAQLFEQNDEAQRSLFINIVEPYLRDVQGRRGVVDFLVKCDST 704 (749) T ss_pred eEEEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcCC Confidence 99999999973 9999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcch--HHHHHHH Q lcl|Aclame:pro 343 LNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDR--IVEEFIE 385 (388) Q Consensus 343 ~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~--~~~~l~~ 385 (388) +||+++|++|+|+++|+++|++|+|||+|+|+++.. +++++.+ T Consensus 705 ~Nt~~~i~~G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~~s 749 (749) T protein:vir:10 705 NNTPEAVDRGEFYAEVFLKPTRTINYVQLTFVATRTGVSFAEVAS 749 (749) T ss_pred CCCHHHhhCCEEEEEEEEEecCCccEEEEEEEEeecCcchHHHhC Confidence 999999999999999999999999999999988775 4555555 No 32 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=1.1e-72 Score=415.07 Aligned_cols=362 Identities=13% Similarity=0.009 Sum_probs=238.2 Q ss_pred CCCCCCcCCCeEEEEc--CCCcccccccC------cce---------------------e-------EEEeecc------ Q lcl|Aclame:pro 1 MPVIDQFEHNGISIET--HEPPPPMGPPG------DNV---------------------V-------AWVVTAP------ 38 (388) Q Consensus 1 M~~~t~~~hGV~~~e~--~~~~~~i~~v~------tav---------------------~-------g~vgta~------ 38 (388) |..+++..-||..... .++..++.... +++ + .++...+ T Consensus 320 ~~~~~~~~~~~s~~~~~~~~~~~~v~d~~~~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~~f~v~s~~~~g~~i~ 399 (742) T protein:vir:58 320 SQDIKQNVAGVEKWVPVGFEGIYSVGDFTVIVNELTNVSIPVTDSAIIPPMRFTRIEQITLSGGASFSVISNQPYGFNIQ 399 (742) T ss_pred ccccCcCccceeEEEeccccccccccceeeeccccccceeeccccccCCcccccccceeecccCcceEEEEecccCccee Confidence 6666665555444332 11111111100 000 0 0000000 Q ss_pred cc---ccccccCcceeeccchhhhhhcc------cccccccchhhhhhhhccccceEEEEecccccccccccccccc--- Q lcl|Aclame:pro 39 DK---HADVAFSVPFRVANTADAQYLDS------TGNELGTGWHAASETLKKTSVPQYFIVVPEGADDAATMANIIG--- 106 (388) Q Consensus 39 ~~---~~~~~~~~~v~v~s~~~~~~~~~------~~~~~gtl~~a~~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~~--- 106 (388) .+ ....+++.+..+........... .....+.+..........++..+.+. . .....+.++ T Consensus 400 ~~~as~~~s~ln~~~~V~Gt~aa~~~~d~~t~~~v~s~~~alp~~a~sv~laGG~dg~v~-v------~~~~~D~iG~~~ 472 (742) T protein:vir:58 400 DSRHSYWLSPFKDDELIIGTELVLPALDVSTEFGVSSWEEALPEFSFLMPFQGGSDGYIR-V------DENEPDTIGRVK 472 (742) T ss_pred ccCcceEEeccCCceEEEeehhhccccccchheeccccccccceeeEEEeecCCcccccc-c------cCCCcccccccc Confidence 00 00012222222221111000000 00000011110000111111111111 1 111122222 Q ss_pred cccchhhhhhhHhhhhhhhhhhhheecccccchhHHHHHHHHHhhhC--ceEEEEecCCCcchhHHHHHHHhhhcccccc Q lcl|Aclame:pro 107 GIDPTTGRRTGIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRL--KCRAVIDGPSGSTQDAIDLSGLLGGEGTGHD 184 (388) Q Consensus 107 ~~~~~tg~~tgl~a~~~~~~~p~ll~ap~~~~~~~v~~~l~~~~~~~--~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~ 184 (388) ..+...+.++||+++.+.. .++|+++||+++. .+++++.++|+.. +.+++.|+|.+.+... .+.+ ...+++|. T Consensus 473 ~~d~~~adrTGL~ALlev~-eVtILiAPG~t~~-~v~aav~A~la~a~~Rl~vL~D~P~~~tt~~-~A~a--~r~~~nSs 547 (742) T protein:vir:58 473 ITPALLANYERLLPLLTED-QFDLVLTPYLTFA-DHAGTVNAFINRAENRFLYLFDIAGDDDTEN-LAIS--LAGYINSS 547 (742) T ss_pred cccccccchhHHHHhhhcC-CCcEEEEcCCCch-HHHHHHHHHHHhhcCCeEEEEecCCCCchHH-HHHH--HHhccCCc Confidence 2333346688999988775 5799999999754 3455555555432 4567788887654332 2222 23467899 Q ss_pred eEEEEecceecccccccceeehhhHHHHHHHHhccccccccccccccceeecccccccccCchhhhhhccccceEEEEEe Q lcl|Aclame:pro 185 RVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFART 264 (388) Q Consensus 185 ~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~ 264 (388) |+++||||++..+. +..+++|||+++||++||+|..+|+|++|+|. ++... ....++|++.||++|||+|++| T Consensus 548 raaly~PwVkv~d~--~~~r~vPpSgaIAGL~ARtD~erGvw~SPANr-gii~~----~~~s~se~d~LN~~GINtIrsf 620 (742) T protein:vir:58 548 FATTFFPWVRRLTN--KGMRTVPASLAAYRSIRTTDPETGLAPVGARR-GVVTG----EPVRQVDWEDLYNNRINPIVRV 620 (742) T ss_pred eEEEEeceeeeccC--CcceeechHHHHHHHHHHhccCCceEecCCcc-eeeec----cccchhhHHHHhhCCceEEEEC Confidence 99999999998764 56788999999999999999999999999884 33322 3456789999999999999987 Q ss_pred CCCcEEEEccccCC-----CceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEE Q lcl|Aclame:pro 265 SMGGFSLIGNRTVT-----GKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYL 339 (388) Q Consensus 265 ~~~G~~~wG~rT~~-----~~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~ 339 (388) ++||++||+||++ |+||+|||||+||+++|+++++|+|||||++.||++|++++++||++||++|+|+||+|+| T Consensus 621 -G~G~rlWGnRTlassDs~wryInVRRlfd~Ie~SI~~a~q~~VfEPNd~~L~~sIk~sInafL~~L~aqGALlGfrV~l 699 (742) T protein:vir:58 621 -GNDVLLFGQKTMLNVNSALNRINVRRLLIVMRNRISQILSSYLFENNTSENRLRAEALVRQYLESLRLRGAVTDYEVAI 699 (742) T ss_pred -CCcEEEEcceecCCCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEE Confidence 7899999999983 9999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHHHH Q lcl|Aclame:pro 340 HPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIE 385 (388) Q Consensus 340 d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~ 385 (388) |+ +||+++|++|+|+++|+++|++|+|||+|++.++....+ |+ T Consensus 700 De-tNTpeDI~~Gklvv~I~vAP~~PAEfI~lrf~it~tga~--Fs 742 (742) T protein:vir:58 700 DS-VTTPTDIDNNTLRARVTVQPARSIEYIDITFVITPTGVE--IT 742 (742) T ss_pred cC-CCCHHHhhCCEEEEEEEEEccCCcceEEEEEEEEecccc--cC Confidence 95 588999999999999999999999999999998888777 44 No 33 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=8.5e-51 Score=295.10 Aligned_cols=334 Identities=10% Similarity=0.062 Sum_probs=198.1 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccc-----------cccc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTG-----------NELG 69 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~-----------~~~g 69 (388) ||.. -||+..+. |........+.++.--+.- ...+.++.+...+.......+... +..+ T Consensus 330 ~~~~---~~g~~~~~------pl~~ts~dy~~~~~~vdgI-~~~~~~~V~~~g~~s~a~a~~~~g~~s~d~a~f~Gg~dg 399 (717) T protein:vir:79 330 KPES---KRGMISED------PLVFKSGDYTNFKMLVDAI-NNHPFNNVVRARTKPEFEATFTSTLQAAADAKFSGGKDE 399 (717) T ss_pred cccc---cCcceecc------ccccccCceeeeeeeeccc-ccCchhheeeeecccccceeeeecccCchhhccCCCccc Confidence 5542 45655443 1111111121111111100 111222222222222111111100 0001 Q ss_pred cchhhhhhhhccccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhhhhhhheecccccc-------hhHH Q lcl|Aclame:pro 70 TGWHAASETLKKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTERPTLIGAPGFSQ-------NKAV 142 (388) Q Consensus 70 tl~~a~~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~~~p~ll~ap~~~~-------~~~v 142 (388) +.....+.+...+.. ....... ...+.++.|+. ....+++.|+... ...+ T Consensus 400 -l~~~~ee~Y~~lGgk------------~~d~g~l-----t~~aays~LE~-----~dVDlVil~ga~adtt~ga~~d~v 456 (717) T protein:vir:79 400 -LSLDKEEMYKRLGGE------------KNEEGFV-----TKQGAYQYLEN-----YEVDYVIPLGVHADTKLIGKYDDF 456 (717) T ss_pred -cccchhhhhcccccc------------ccccccc-----cchhhhhhcCc-----ceeEEEEecCccccccccchhhhH Confidence 000111111110000 0000000 00112222221 1234555554321 1234 Q ss_pred HHHHHHHhhhC-----ceEEEEec--CCCcchhH-HHHHHHhh----------------------hcccccceEEEEecc Q lcl|Aclame:pro 143 IDALASMAKRL-----KCRAVIDG--PSGSTQDA-IDLSGLLG----------------------GEGTGHDRVYMVDPM 192 (388) Q Consensus 143 ~~~l~~~~~~~-----~~~~i~d~--p~~~~~~~-~~~~~~~~----------------------~~~~~s~~~~~~~p~ 192 (388) +.++..+|... .++.+++. |.+..... .+++.... ...+ +.+...++++ T Consensus 457 a~alad~caalSal~r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idi-s~y~~vv~~~ 535 (717) T protein:vir:79 457 AYQLALACAVMSHYNSVTIGIIPTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDL-GQFIEVVAGP 535 (717) T ss_pred HHHHHHHHHHhhhccccceeeeccccccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccc-cceeeeeecc Confidence 45555555422 12233332 21111110 00000000 0011 2234444444 Q ss_pred eecccccccceeehhhHHHHHHHHhccccccccccccc-cceeecccccccccCchhhhhhccccceEEEEEeCCCcEEE Q lcl|Aclame:pro 193 PAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGV-LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSL 271 (388) Q Consensus 193 ~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~-~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~ 271 (388) .....+........||+|++||+.++..+|+||+|+++ ++.++. ..++..|++.||++|||||+.++++|+++ T Consensus 536 ~~iv~~~~~~~~~~p~AG~vAGldA~rGVwkSPANk~I~GVvgLa------~~lT~sE~d~Ln~aGIntIr~~~GrGirV 609 (717) T protein:vir:79 536 DFIVRNTRLGQMASTPDASYIGMVSQLKTQSAPTNKPLPSVTALR------YTYSANQLNRLTKARFATFKYKQDGSIGV 609 (717) T ss_pred eeEEEcCCCceeecCHHHHHHHHHhcCCcccccccceecccccCc------ccCCHHHHHHHhhCCeEEEEEeCCceEEE Confidence 44444555667889999999999999999999999987 454443 45678899999999999999999999999 Q ss_pred EccccCC-----CceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCH Q lcl|Aclame:pro 272 IGNRTVT-----GKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTV 346 (388) Q Consensus 272 wG~rT~~-----~~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~ 346 (388) ||+||++ |+||++||++++|+++|++.++|+|||||++.+|.+|+++|++||++||++|+|.||++.+ +||+ T Consensus 610 WGaRTtasd~sdWryInVRRl~D~Ie~sIr~al~~yVgEPNd~~tr~~Ik~sI~afL~~L~r~GAI~Gykvdv---tnT~ 686 (717) T protein:vir:79 610 VDAPTSAHAGSDYTRLSTARIVKEAVNAVREVADPFIGEPNDTGNRNALTAAVDKRLSKMIENKALLGFDFRL---VVTP 686 (717) T ss_pred EeeeecCCCCcccceeehhhhHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHHHHHhcCceecceeeE---ecCh Confidence 9999974 9999999999999999999999999999999999999999999999999999999999765 7999 Q ss_pred HHhhCCeEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 347 ERYKNGSWYIVIDYGRYSPNEHMIFHLNAVD 377 (388) Q Consensus 347 ~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~ 377 (388) +++++|+++++|+++|++|+|||+|+++.+. T Consensus 687 ~di~~G~l~V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 687 QQELLGEGSIELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred hHhhCCEEEEEEEEEecCcccEEEEEEEEeC Confidence 9999999999999999999999999999998 No 34 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=1.9e-39 Score=232.91 Aligned_cols=357 Identities=10% Similarity=0.072 Sum_probs=249.6 Q ss_pred CCC----CCCc-CCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhh Q lcl|Aclame:pro 1 MPV----IDQF-EHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAA 75 (388) Q Consensus 1 M~~----~t~~-~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~ 75 (388) |+. |-.| ++|||++|..++++++..+++++.+|||.++.+ +.+++++++++.++...|+. |.|..++ T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G----~~~~~~~~~~~~~~~~~fg~----g~l~~~i 72 (562) T protein:vir:63 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGG----KPNAVYKVRNYSQAKSVFRS----GELLDAI 72 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCC----CCceeEEEccHHHHHHHhcC----CchHHHH Confidence 543 2223 459999999999999999999999999999888 55789999999999988765 3455554 Q ss_pred hh----hhccccceEEEEecccccccccccccc----------------------------------------------- Q lcl|Aclame:pro 76 SE----TLKKTSVPQYFIVVPEGADDAATMANI----------------------------------------------- 104 (388) Q Consensus 76 ~~----~~~~~~~~~~vv~~~~~~~~~~~~~~~----------------------------------------------- 104 (388) .. .+.++++.+|.+++........+...+ T Consensus 73 ~~a~~~~~~~g~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~ 152 (562) T protein:vir:63 73 ERAWNPGEGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGS 152 (562) T ss_pred HHhccccccCCceEEEEEEcCCCccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhccc Confidence 43 335677666666542211110000000 Q ss_pred -------------------------------cccccc------hhhh-hhhHhhhhh----------------------- Q lcl|Aclame:pro 105 -------------------------------IGGIDP------TTGR-RTGIAALTE----------------------- 123 (388) Q Consensus 105 -------------------------------~~~~~~------~tg~-~tgl~a~~~----------------------- 123 (388) .++... ..|. .+...+... T Consensus 153 V~~i~y~g~~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~~gn~i~~~~ 232 (562) T protein:vir:63 153 IFSIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDN 232 (562) T ss_pred eeeeeeecccccceEEEEecCcceeEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeeccCCceeeeec Confidence 000000 0000 000000000 Q ss_pred ------hh---------------------------------------------------------------hhhhheecc Q lcl|Aclame:pro 124 ------CT---------------------------------------------------------------ERPTLIGAP 134 (388) Q Consensus 124 ------~~---------------------------------------------------------------~~p~ll~ap 134 (388) .. ......++| T Consensus 233 ~d~~~~~~vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la~~~~~~LtGG~dGt~~~~~~~al~ale~~~~~~i~~ 312 (562) T protein:vir:63 233 FDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVP 312 (562) T ss_pred cccccccchhhhhhhhhhhhhhhhhcccccceeeeeeccccceecccceeeecCCCCCchhhHHHHHHHHHhCCcEEEEe Confidence 00 000001111 Q ss_pred cccchhHHHHHHHHHhhhC-----ceEEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceecccccccceeehhh- Q lcl|Aclame:pro 135 GFSQNKAVIDALASMAKRL-----KCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPP- 208 (388) Q Consensus 135 ~~~~~~~v~~~l~~~~~~~-----~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~- 208 (388) +++++++.+++.++++++ .++++++.+.+.+......+ ...+++++++.++|+....+. .+....+|+ T Consensus 313 -~t~d~av~~~l~a~vkr~~~~g~~~~aVlg~~~~~~~~~~~~~----a~~~n~ervv~v~~~~~~~~~-~~~~~~~~~~ 386 (562) T protein:vir:63 313 -LTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTR----AIGLQNERAGLIGFSGTVKMD-DGRSLKMPGY 386 (562) T ss_pred -cCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHH----hhhcCCCcEEEEecCeeEECC-CCceeeechh Confidence 123456667777776544 35788887766554433322 346789999999999765543 345566676 Q ss_pred --HHHHHHHHhccccccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEcc-cc--------- Q lcl|Aclame:pro 209 --STIAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGN-RT--------- 276 (388) Q Consensus 209 --s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~-rT--------- 276 (388) ++++||+.++.++++|+.|+++...++ ...+++.|++.|+++|+.+++...+++.++|.. ++ T Consensus 387 ~~aa~vAGl~A~~~~~~SlT~~~i~~~~v------~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~~~ 460 (562) T protein:vir:63 387 MFAAQVAGLTCGLEIGEAITFKNIAIETL------DTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTD 460 (562) T ss_pred HHHHHHHHHhhcCchhcCccceeeccccc------cccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCC Confidence 889999999999999999998854333 235689999999999999999887777777743 22 Q ss_pred CCCceeeehhhHHHHHHHHHHHHH-HHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEE Q lcl|Aclame:pro 277 VTGKFISFVGLEDAIARKLEAASQ-RAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWY 355 (388) Q Consensus 277 ~~~~~i~vrR~~~~i~~~i~~~~~-~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~ 355 (388) ..|++|+++|++|+|.+.++..+. ||+++||+...|.+|+..+..||.+||+.|+|.||... +-+.++.+++++ T Consensus 461 ~~~~ki~viRv~D~i~~dir~~~~~~yiGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~ 535 (562) T protein:vir:63 461 PVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EVQVVIEGDVAR 535 (562) T ss_pred chhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEE Confidence 239999999999999999987765 89999999999999999999999999999999998532 112234567899 Q ss_pred EEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 356 IVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 356 ~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) +++.+.|+.|+|+|.+++.+..+-+++ T Consensus 536 v~~~v~pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:63 536 ISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred EEEEEEEcccceEEEEEEEEeeeeecC Confidence 999999999999999999999999998 No 35 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=1.6e-38 Score=227.79 Aligned_cols=357 Identities=10% Similarity=0.065 Sum_probs=246.4 Q ss_pred CCCCC-----CcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhh Q lcl|Aclame:pro 1 MPVID-----QFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAA 75 (388) Q Consensus 1 M~~~t-----~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~ 75 (388) |+..- .-++|||+++..++.+++..+++++.+|+|.++.+ +.+++++++++.++...|.. |.|..++ T Consensus 1 ~~~~~~~~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G----~~~~~~~~~~~~~~~~~f~~----g~l~~~i 72 (562) T protein:vir:80 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGG----KPNAVYKVRNYSQAKSVFRS----GELLDAI 72 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCC----CcceeEEEccHHHHHHHhcC----CChHHHH Confidence 65421 23469999999999999999999999999999888 45788999999999888765 3344443 Q ss_pred hhh----hccccceEEEEeccccccccccccc---------------------------------------------c-- Q lcl|Aclame:pro 76 SET----LKKTSVPQYFIVVPEGADDAATMAN---------------------------------------------I-- 104 (388) Q Consensus 76 ~~~----~~~~~~~~~vv~~~~~~~~~~~~~~---------------------------------------------~-- 104 (388) ... +.++++.+|.+++........+... + T Consensus 73 ~~a~~~~~~~g~~~~~~~rv~~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~ 152 (562) T protein:vir:80 73 ERAWNPGEGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGS 152 (562) T ss_pred HHhcccccccCceEEEEEEcCCCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCc Confidence 333 3466666666554221110000000 0 Q ss_pred -------------------------------cccccc------hhhh-hhhHhhhhh----------------------- Q lcl|Aclame:pro 105 -------------------------------IGGIDP------TTGR-RTGIAALTE----------------------- 123 (388) Q Consensus 105 -------------------------------~~~~~~------~tg~-~tgl~a~~~----------------------- 123 (388) .++... ..|. .+...+... T Consensus 153 v~~i~y~g~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~~~~~tAky~g~~~n~i~~~~ 232 (562) T protein:vir:80 153 IFSIKYKGTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDN 232 (562) T ss_pred eeeeeeccccccceeEEEecCccceEEEEEEecCCcceeEEEeCCCccchhhhhhhhhccccceEEEecccCCceeeecc Confidence 000000 0000 000000000 Q ss_pred ---------------------------------------------------------------------hhhhhhheecc Q lcl|Aclame:pro 124 ---------------------------------------------------------------------CTERPTLIGAP 134 (388) Q Consensus 124 ---------------------------------------------------------------------~~~~p~ll~ap 134 (388) ........+.+ T Consensus 233 ~d~~~~~~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~Le~~~~~~i~~ 312 (562) T protein:vir:80 233 FDAQIDVDIKTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVP 312 (562) T ss_pred cccchhhhcccceeeeeehhhhhhhcccccceEEEEeccCccccccceeeeeCCCCCCccccHHHHHHHHHhCCcEEEEe Confidence 00000000000 Q ss_pred cccchhHHHHHHHHHhhhC-----ceEEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceecccccccceeehhh- Q lcl|Aclame:pro 135 GFSQNKAVIDALASMAKRL-----KCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPP- 208 (388) Q Consensus 135 ~~~~~~~v~~~l~~~~~~~-----~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~- 208 (388) .+.++++.+++.++++++ ++++++..+.+.+.+....+ ...+++++++.++|+....+. .+.....|+ T Consensus 313 -~t~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~----a~~~n~e~vv~v~~~~~~~~~-~~~~~~~~~~ 386 (562) T protein:vir:80 313 -LTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTR----AIGLQNERAGLIGFSGTVKMD-DGRSLKMPGY 386 (562) T ss_pred -cCCChHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHH----hhhcCCCeEEEEecCeeEECC-CCceeeechh Confidence 123456677777776654 36778877766554333222 346789999999998765543 344555555 Q ss_pred --HHHHHHHHhccccccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEc-ccc--------- Q lcl|Aclame:pro 209 --STIAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIG-NRT--------- 276 (388) Q Consensus 209 --s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG-~rT--------- 276 (388) ++++||++|+.++++|+.|+++...++ ...+++.|++.|+++|+.+++...+++.+.|. -++ T Consensus 387 ~~aa~vAGl~Ag~~~~~S~T~~~i~~~~v------~~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~~~~ 460 (562) T protein:vir:80 387 MFAAQVAGLTCGLEIGEAITFKNIAIETL------DTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTD 460 (562) T ss_pred HHHHHHHHHHhcCccccCccceeeccccc------cccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceeccCCCC Confidence 889999999999999999999853322 23468899999999999999988777777772 222 Q ss_pred CCCceeeehhhHHHHHHHHHHHH-HHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEE Q lcl|Aclame:pro 277 VTGKFISFVGLEDAIARKLEAAS-QRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWY 355 (388) Q Consensus 277 ~~~~~i~vrR~~~~i~~~i~~~~-~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~ 355 (388) ..|++|+++|++|+|.+.+++.+ .||++|||+...|..++..+..||.+||+.|+|.||... +-+.++.+++++ T Consensus 461 ~~~~ki~viRv~D~i~~dir~~~~~~yIGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~ 535 (562) T protein:vir:80 461 PVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EVQVVIEGDIAR 535 (562) T ss_pred chhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEE Confidence 23999999999999999998887 689999999999999999999999999999999998632 112234668899 Q ss_pred EEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 356 IVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 356 ~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) +++.+.|+.|+|+|.+++.+..+-+++ T Consensus 536 v~~~v~Pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:80 536 ISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred EEEEEEEcccceEEEEEEEEEeeeecC Confidence 999999999999999999999999998 No 36 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=2.8e-38 Score=226.43 Aligned_cols=354 Identities=14% Similarity=0.070 Sum_probs=244.7 Q ss_pred CCCC-----CCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhh Q lcl|Aclame:pro 1 MPVI-----DQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAA 75 (388) Q Consensus 1 M~~~-----t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~ 75 (388) |+.. ..-++|||+++..+++.++..+++.+.+|+|.++.+ +.+++++++++.++...|..+ .|..+. T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G----~~~~~~~~~~~~~~~~~f~~g----~l~~a~ 72 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGG----KPDTVYRFRNYQQAKQVLRSG----DLLDAI 72 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCC----CCceeEEecCHHHHHHHhcCC----chhHHH Confidence 6542 122469999999999999999999999999999988 457889999999988776542 233332 Q ss_pred hhhh------ccccceEEEEecccccccccc------------------------------------------------- Q lcl|Aclame:pro 76 SETL------KKTSVPQYFIVVPEGADDAAT------------------------------------------------- 100 (388) Q Consensus 76 ~~~~------~~~~~~~~vv~~~~~~~~~~~------------------------------------------------- 100 (388) ...+ .++++.|+++++........+ T Consensus 73 ~~a~~~~~~~~~~~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~~~~~~~i 152 (569) T protein:vir:80 73 ELAWNASDVNTASAGDILAVRVEDAKNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGYKKVFDNL 152 (569) T ss_pred HhhccCccccccCceEEEEEEcCCCeeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCCccccccc Confidence 2221 122222332222100000000 Q ss_pred -----------------------------------------------------------------------cc------- Q lcl|Aclame:pro 101 -----------------------------------------------------------------------MA------- 102 (388) Q Consensus 101 -----------------------------------------------------------------------~~------- 102 (388) .. T Consensus 153 g~v~si~ytg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~f~a~~~~~~~ 232 (569) T protein:vir:80 153 GKIFSIQYKGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPDWEAKFFPIGD 232 (569) T ss_pred cceeeEEEeeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCCccCceEEEEecCC Confidence 00 Q ss_pred --------------------------------------------------------cccccccchhh--hhhhHhhhhhh Q lcl|Aclame:pro 103 --------------------------------------------------------NIIGGIDPTTG--RRTGIAALTEC 124 (388) Q Consensus 103 --------------------------------------------------------~~~~~~~~~tg--~~tgl~a~~~~ 124 (388) .+.|+.++... ....|+++.. T Consensus 233 ~~~~~~~~d~~~~~~~~t~~~~~~~~~~di~~~~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~~~~~~~l~~le~- 311 (569) T protein:vir:80 233 KNLPTDALEAVTKVDVKTEAVFVGALAGDIAKQLEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAPESWANKFPLLAN- 311 (569) T ss_pred CcceehhccchhheeccccceeeehhHHHHHHhhcCCceEEEEecCCcceeeecceeecCCCCCCccchHHHHHHHHhh- Confidence 00000000000 0001111110 Q ss_pred hhhhhheecccccchhHHHHHHHHHhhhC-----ceEEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceeccccc Q lcl|Aclame:pro 125 TERPTLIGAPGFSQNKAVIDALASMAKRL-----KCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRK 199 (388) Q Consensus 125 ~~~p~ll~ap~~~~~~~v~~~l~~~~~~~-----~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~ 199 (388) .....++|. +.++++.+++.++++++ .++++++.+.+.+.+....+ ..+++++++++++||....+. T Consensus 312 --~~~~~i~~~-t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~----a~~~n~e~vv~v~~~~~~~~~- 383 (569) T protein:vir:80 312 --EGGYYLVPL-TDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTNETVEESITR----ATNLRDPRASLVGFSGTRKMD- 383 (569) T ss_pred --CCcEEEEec-CCChHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHH----HhhcCCCeEEEEecCceeecC- Confidence 011111111 34567888888888765 36888888876654433222 347889999999999876653 Q ss_pred ccceeehhh---HHHHHHHHhccccccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEEcc-c Q lcl|Aclame:pro 200 AQGNIYVPP---STIAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGN-R 275 (388) Q Consensus 200 ~~~~~~~p~---s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~-r 275 (388) .+....+|+ ++++||+.|..++++|+.|+++++.++. ..++..|++.|+++|+.+++..++++.++|.. + T Consensus 384 ~g~~~~~~~~~~aa~vAG~~A~~~~~~S~T~k~i~~~~i~------~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn 457 (569) T protein:vir:80 384 DGRLLKLPGYMMASQIAGIASGLEVGEAITFKHFNVTSVD------RVFESSQLDMLNESGVISIEFVRNRTLTAFRVVQ 457 (569) T ss_pred CCcceeechhhHHHHHHHHHhcCccccCccceeecccccc------ccCCHHHHHHHHhCCeEEEEEecCceEEEEEEec Confidence 344445554 7899999999999999999998654443 24678999999999999999888776667743 2 Q ss_pred c---------CCCceeeehhhHHHHHHHHHHHH-HHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCC Q lcl|Aclame:pro 276 T---------VTGKFISFVGLEDAIARKLEAAS-QRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNT 345 (388) Q Consensus 276 T---------~~~~~i~vrR~~~~i~~~i~~~~-~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt 345 (388) + ..|++++++|++|+|.+.|+..+ .+|+++||+...|..++..++.||.+||++|+|.||... +- T Consensus 458 ~itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv 532 (569) T protein:vir:80 458 DVTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGTKVIDTSASLIKNFIQSFLDNKKRAREIQDYTPE-----EV 532 (569) T ss_pred cceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcccCChhHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ce Confidence 2 23999999999999999998876 689999999999999999999999999999999998532 11 Q ss_pred HHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 346 VERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 346 ~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) +.++..+++++++.+.|+.|+|+|.+++++..+-+++ T Consensus 533 ~v~~~~d~~~v~~~v~Pv~~~ekI~~ti~~~~~~~~~ 569 (569) T protein:vir:80 533 QVVLEGDVASISMTVMPIRSLNKITVQLVYKQQILTA 569 (569) T ss_pred EEEecCCEEEEEEEEEEcccccEEEEEEEEeeeeecC Confidence 2235568999999999999999999999999999998 No 37 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=100.00 E-value=3.3e-38 Score=226.05 Aligned_cols=274 Identities=14% Similarity=0.106 Sum_probs=189.8 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) ||--+...+|||++|+..+ ++|..+.|++.+|||.++.+ +.++|++|+|+.++...|+.......+..++..+|. T Consensus 1 ~~m~~~~sPGVyv~E~~~~-~~i~~v~tsvaafvG~~~~G----P~~~p~~v~s~~d~~~~FG~~~~~~~l~~av~~fF~ 75 (641) T protein:vir:10 1 MSVSNQLSPGVVIQERDLT-AVTTPIGLNVGVLAAPFTKG----PVEEIFEVSTERDLASVFGEPNDYNYEYWFTASQFL 75 (641) T ss_pred CCCccccCCceEEEEecCC-CcccccCCccceEEecccCC----CCCccEEecCHHHHHHHcCCcCCCcchHHHHHHHHH Confidence 5533334459999999876 68999999999999999876 678999999999999999988888889999999999 Q ss_pred cccceEEEEecccccccccc-----------------------------------ccc-c-------------------- Q lcl|Aclame:pro 81 KTSVPQYFIVVPEGADDAAT-----------------------------------MAN-I-------------------- 104 (388) Q Consensus 81 ~~~~~~~vv~~~~~~~~~~~-----------------------------------~~~-~-------------------- 104 (388) |++..|++||+......+.+ ..+ + T Consensus 76 ngG~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~pG~~gn~l~v~i~~~~~~~~~~~~~~~~ 155 (641) T protein:vir:10 76 SYGGVLKAIRLNAASLKNSVDSGTAPLIKNLQEYETTYESSNSNTFKFASRDAGALGNSVGIFITDAGPDQIAVLPAPGT 155 (641) T ss_pred hcCCEEEEEEecCccccccccccchhhccccccccccccCcCccccEEEeccCCCcCCceEEEEEcCCCcceeeeecccc Confidence 99999999886321100000 000 0 Q ss_pred ------------------------------------------------------------------------ccccc--- Q lcl|Aclame:pro 105 ------------------------------------------------------------------------IGGID--- 109 (388) Q Consensus 105 ------------------------------------------------------------------------~~~~~--- 109 (388) .++.. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g~~g~~ 235 (641) T protein:vir:10 156 GNEWEFVADEAVTAASGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGGVTGIF 235 (641) T ss_pred cccceeccceeeeeccCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCcceeee Confidence 00000 Q ss_pred --c--h-----------hhhh----------------------------------------------------------- Q lcl|Aclame:pro 110 --P--T-----------TGRR----------------------------------------------------------- 115 (388) Q Consensus 110 --~--~-----------tg~~----------------------------------------------------------- 115 (388) . . .+.. T Consensus 236 ~~~~~~t~gt~~~t~a~~g~~~~~~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~~~a~~~~~g~~~~~va~~~gts~ 315 (641) T protein:vir:10 236 ADAQVVTQGTNTAAIASSGIERRLYIGKDSGSINFAATDAVVDTNATSATISSVRNEYAEREYLPGSKWVNVAARPGTSL 315 (641) T ss_pred eeeeeccCCccceeeecccchhhhhhccccccceeeeecccccccceeeEeeeeeeeecccccccccccccccccchhhh Confidence 0 0 0000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 116 -------------------------------------------------------------------------------- 115 (388) Q Consensus 116 -------------------------------------------------------------------------------- 115 (388) T Consensus 316 ~a~~~g~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~ 395 (641) T protein:vir:10 316 YANSVGGVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVIKQQSAYVYWGSHETAPFLGTAAN 395 (641) T ss_pred hhhhcCCcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeeeccccceEEEeccccccccccccc Confidence Q ss_pred -------------------------------------------------------------------hhHhhhhhhhh-h Q lcl|Aclame:pro 116 -------------------------------------------------------------------TGIAALTECTE-R 127 (388) Q Consensus 116 -------------------------------------------------------------------tgl~a~~~~~~-~ 127 (388) +|++++.+... . T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~~~~~~~~~tg~~~~~~~e~~~ 475 (641) T protein:vir:10 396 AAAGDWGASALNRRYNLLRSTAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALYNLSNVDIATAYELIEDPESQV 475 (641) T ss_pred ccccccccccccccccccccccccccccccccccCCCCcceeEEEeecCcccccccccccccchhHHHHHHHhhhhhhhc Confidence 00000000000 0 Q ss_pred hhheecccc----cchhHHHHHHHHHhhhCc-eEEEEecCCCcch-------hHHHHHHHhhhcccccceEEEEecceec Q lcl|Aclame:pro 128 PTLIGAPGF----SQNKAVIDALASMAKRLK-CRAVIDGPSGSTQ-------DAIDLSGLLGGEGTGHDRVYMVDPMPAI 195 (388) Q Consensus 128 p~ll~ap~~----~~~~~v~~~l~~~~~~~~-~~~i~d~p~~~~~-------~~~~~~~~~~~~~~~s~~~~~~~p~~~~ 195 (388) ..++++|+. ...++++.++.++|++++ ||+|+|+|.+... .......+.. .+.+|.|+++||||+++ T Consensus 476 i~~l~~~~~~~~~~~~~~v~~~~i~~ce~~~d~~ailD~p~~~~~~~~~~~~~~~~~~~~~~-~~~~s~yaa~y~P~~~v 554 (641) T protein:vir:10 476 IDYVLSGPAGADEAAAIAKATTITTIVESRKDCMAFLSPLRSDVIGVSNTTTVTENLVNYFN-QLPSSNYVVFDSGYKYI 554 (641) T ss_pred cceeeecCCCCCcchhHHHHHHHHHHHHhcCCEEEEEcCCcccccCCCchhhHHHHHHHHHh-hcCCCceEEEEeceeEe Confidence 001111111 122457778889998776 9999999975321 1122222222 25689999999999999 Q ss_pred ccccccceeehhhHHHHHHHHhccccccccccccccce--eecccccccccCchhhhhhccccceEEEEEeCCCcEEEEc Q lcl|Aclame:pro 196 YSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVLIQ--DVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIG 273 (388) Q Consensus 196 ~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~~~--g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG 273 (388) +|+.+++.+++||||++||+|||+|..+|+|++|+|.. .+...+.+.+..++.|++.||++||||||.|||+|++- T Consensus 555 ~dp~~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~~~~~e~~~Lnp~gIN~ir~fpg~G~v~-- 632 (641) T protein:vir:10 555 YDKYNDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYSPNKTQRDRLYANRINPVVSFPGHAMIN-- 632 (641) T ss_pred ecccCCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEecChhHHhhhhhcccceEEecCCceeec-- Confidence 99999999999999999999999999999999999974 46778888999999999999999999999999999862 Q ss_pred cccCCCceeeehhh Q lcl|Aclame:pro 274 NRTVTGKFISFVGL 287 (388) Q Consensus 274 ~rT~~~~~i~vrR~ 287 (388) | ...+. ..+ T Consensus 633 ~-~~~~~----~~~ 641 (641) T protein:vir:10 633 N-NIAFH----TKL 641 (641) T ss_pred c-eeeee----ecC Confidence 2 11111 000 No 38 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=1.6e-34 Score=205.80 Aligned_cols=355 Identities=11% Similarity=0.043 Sum_probs=237.0 Q ss_pred CCCCCCcC------CCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhh Q lcl|Aclame:pro 1 MPVIDQFE------HNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHA 74 (388) Q Consensus 1 M~~~t~~~------hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a 74 (388) |+...-|- +|||++|+.+++++|..+.|++.+|||.+..+ +.++|++++|+.++...|+. +.|.++ T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~G----p~~~p~~v~s~~~~~~~fgg----g~l~~a 72 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEGG----ETYKPYRLTSFAEAVSIFKG----GPLLEH 72 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCCC----CCceeEEecCHHHHHHHhcC----ccHHHH Confidence 87643233 79999999999999999999999999999877 57899999999999988763 568999 Q ss_pred hhhhhccccceEEEEecccccccccccc---------------------------------------------cccccc- Q lcl|Aclame:pro 75 ASETLKKTSVPQYFIVVPEGADDAATMA---------------------------------------------NIIGGI- 108 (388) Q Consensus 75 ~~~~~~~~~~~~~vv~~~~~~~~~~~~~---------------------------------------------~~~~~~- 108 (388) +..+|.+++..|+++++........+.. +++-.. T Consensus 73 v~~~F~nGg~~~~~vRv~~~~~a~~~~~~~~~~a~~~g~~gn~i~~~v~~~~~~~~~~~~l~v~~~~~~~~~d~~v~~i~ 152 (648) T protein:vir:10 73 IKAAFIGGAGEVVAVRIGNPTTASVSIPVAQNTSDTSPANLNFVSYEASTRSNQIYVSFDLDENFTSANEADDTIIFTIY 152 (648) T ss_pred HHHHHhCCCcEEEEEEcCCCcccceecceeEEeecccCCCCCceEEEEEEcCCCcCceeEEEEEecCCCcccceeEEEec Confidence 9999999999999988643221111100 000000 Q ss_pred ------cch----hh-----------hh-----------------------hh--------------------------- Q lcl|Aclame:pro 109 ------DPT----TG-----------RR-----------------------TG--------------------------- 117 (388) Q Consensus 109 ------~~~----tg-----------~~-----------------------tg--------------------------- 117 (388) +.. ++ .. +. T Consensus 153 ~~~~~y~gt~~~~t~~v~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~s~~~~~d 232 (648) T protein:vir:10 153 QKHPDFSVTRETFTFPRKFTTPTVLVKRGSTLFFVDRSIVNAALAAGPAFQTALINLLKEQLQPTDVVQIFDASDTNPVD 232 (648) T ss_pred cCCCcccccceeccccccccccccccccccceeecCccchhhhhccCccchhhhhhchhhhhhhhhhheecccccccccc Confidence 000 00 00 00 Q ss_pred -------Hhhhh-----------------hhh------------------------------------------------ Q lcl|Aclame:pro 118 -------IAALT-----------------ECT------------------------------------------------ 125 (388) Q Consensus 118 -------l~a~~-----------------~~~------------------------------------------------ 125 (388) +.++. +.. T Consensus 233 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~~~~~~v~~~~~~~l~~~ 312 (648) T protein:vir:10 233 IPLGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLSDPANWFAKDAYTINHLVDT 312 (648) T ss_pred cccccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccccccceeeeeccchhhcccc Confidence 00000 000 Q ss_pred -hhhhhee-------------cc-------------cc-------------------------------cchhHHHHHHH Q lcl|Aclame:pro 126 -ERPTLIG-------------AP-------------GF-------------------------------SQNKAVIDALA 147 (388) Q Consensus 126 -~~p~ll~-------------ap-------------~~-------------------------------~~~~~v~~~l~ 147 (388) ..|.+.. .| +| +..+++++.+. T Consensus 313 ~~~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ivp~~~~~~~~~~~~~lt~~q~i~a~a~ 392 (648) T protein:vir:10 313 TINPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVIPAYKFTNVTQLNDRLTIFKGIASTFL 392 (648) T ss_pred cccCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEEeecccccccccccccCCccchHHHHH Confidence 0000000 00 00 11245666666 Q ss_pred HHhhhC----------ceEEEEecCCCcchhHHHHHHHhhhcccccce----------EEE--Eecceecccccccceee Q lcl|Aclame:pro 148 SMAKRL----------KCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDR----------VYM--VDPMPAIYSRKAQGNIY 205 (388) Q Consensus 148 ~~~~~~----------~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~----------~~~--~~p~~~~~~~~~~~~~~ 205 (388) ++++.+ ..++++-++.+.+....+.... ...++..+ ++. |+.+.. ...+...+ T Consensus 393 shv~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~--~~~~~~~~a~~~~~d~~~~~~~~~~~~~~---~~~G~~~~ 467 (648) T protein:vir:10 393 SHVQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYN--RNILNTISAMFGGTDRAQAVVFPFYSNVF---NDEGKVEL 467 (648) T ss_pred HHHHHhhhccccccccCeEEEeCCCCchhHHHHHHHhh--hhcccccceeeeecCCceEEeecccceeE---CCCCcEEe Confidence 665532 1244443333333211111111 11111111 111 122222 22355666 Q ss_pred hhh---HHHHHHHHhccccccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCC----------cEEEE Q lcl|Aclame:pro 206 VPP---STIAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMG----------GFSLI 272 (388) Q Consensus 206 ~p~---s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~----------G~~~w 272 (388) +|| .+++||++++..++.|+-+||+.+.++.+. ...++.|.+.|+++||+||....++ |+..| T Consensus 468 ~p~~~~Aa~VAGl~a~l~~~~s~T~k~i~~~~id~~----~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~ 543 (648) T protein:vir:10 468 LGGEFFASYVAGMHANREPQDSITFLPISGIGAEPL----YNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPTTW 543 (648) T ss_pred cchhhHHHHHHhhhhccccccCcccceeeccccccc----cCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEeccceee Confidence 888 788999999999999999999976554432 3467899999999999999876553 56668 Q ss_pred cccc-CCCceeeehhhHHHHHHHHHH-HHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeee---EEEeccCCCHH Q lcl|Aclame:pro 273 GNRT-VTGKFISFVGLEDAIARKLEA-ASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGE---VYLHPTLNTVE 347 (388) Q Consensus 273 G~rT-~~~~~i~vrR~~~~i~~~i~~-~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~---v~~d~~~Nt~~ 347 (388) +... ..|+.|+++|++|++.+.+++ ...+|+++||+...|.++++.+.+||.++++.++|.+|. |.++. T Consensus 544 ~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~~n~~~~~~~ik~~i~~~L~~~~~~~~I~~y~~~~v~~~~------ 617 (648) T protein:vir:10 544 LGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGRKSYGRKTENDIKVYTEALLSNLVGKQIVAYKDVKVTSNE------ 617 (648) T ss_pred cCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcccccHHHHHHHHHHHHHHHhhHhhcCcccCcccceEEEEe------ Confidence 8743 459999999999999999987 556999999999999999999999999999999999974 66653 Q ss_pred HhhCCeEEEEEEEEecCcceeEEEEEEEcchHH Q lcl|Aclame:pro 348 RYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIV 380 (388) Q Consensus 348 ~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~ 380 (388) +++++++++.+.|++|++||.++++++.+.- T Consensus 618 --~~~vv~V~~~v~Pv~~i~~I~vti~it~~~~ 648 (648) T protein:vir:10 618 --DKTVYYVEFFYQPVTEIKFILVTMKVTFDLE 648 (648) T ss_pred --cCCEEEEEEEEEecceeeEEEEEEEEEeccC Confidence 4589999999999999999998888777643 No 39 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=1.8e-34 Score=205.63 Aligned_cols=354 Identities=12% Similarity=0.057 Sum_probs=241.9 Q ss_pred CCCCC-----CcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhh Q lcl|Aclame:pro 1 MPVID-----QFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAA 75 (388) Q Consensus 1 M~~~t-----~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~ 75 (388) |+.-. .-++|||+++..++..++..+++++.+|+|.++.+ +.++++.++++.++...++.+ .|.+++ T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G----~~~~~~~~~~~~~~~~~~~~g----~l~~~~ 72 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGG----EPNTVYELRNYSQAKRLFRSG----ELLDAI 72 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCC----CCceeEEeccHHHHHHHhcCc----chHHHH Confidence 76533 23569999999999999999999999999999988 457889999999998887653 344444 Q ss_pred hhhh----ccccceEEEEeccccccccccccc------------------------------------------------ Q lcl|Aclame:pro 76 SETL----KKTSVPQYFIVVPEGADDAATMAN------------------------------------------------ 103 (388) Q Consensus 76 ~~~~----~~~~~~~~vv~~~~~~~~~~~~~~------------------------------------------------ 103 (388) ...+ .+++..++.+++........+..+ T Consensus 73 ~~a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 152 (587) T protein:vir:95 73 ELAWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGN 152 (587) T ss_pred HHHhccccCCCceEEEEEEcCCCceeEEEecCeEEEEecccccccceEEEEecCCCCCceeEEEEEecccceeeeeeccc Confidence 3333 344444444332111000000000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 104 -------------------------------------------------------------------------------- 103 (388) Q Consensus 104 -------------------------------------------------------------------------------- 103 (388) T Consensus 153 v~si~y~g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~ 232 (587) T protein:vir:95 153 IFTIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSK 232 (587) T ss_pred eeeeeeeccccccceeeeecccceeeeeeeeecCCceEEEEEecCCchHHHHHHHHhhccccceEEEEecccCceeEEee Confidence Q ss_pred ---------------------------------------------------------------------------ccccc Q lcl|Aclame:pro 104 ---------------------------------------------------------------------------IIGGI 108 (388) Q Consensus 104 ---------------------------------------------------------------------------~~~~~ 108 (388) +.|+. T Consensus 233 ~~~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~ 312 (587) T protein:vir:95 233 LDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGT 312 (587) T ss_pred cCcccccceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccceeeeecCC Confidence 00000 Q ss_pred cchhh--hhhhHhhhhhhhhhhhheecccccchhHHHHHHHHHhhhC-----ceEEEEecCCCcchhHHHHHHHhhhccc Q lcl|Aclame:pro 109 DPTTG--RRTGIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRL-----KCRAVIDGPSGSTQDAIDLSGLLGGEGT 181 (388) Q Consensus 109 ~~~tg--~~tgl~a~~~~~~~p~ll~ap~~~~~~~v~~~l~~~~~~~-----~~~~i~d~p~~~~~~~~~~~~~~~~~~~ 181 (388) ++... -...|.++.... ...++|. ++++++.+++.++++++ ..++++..+.+.+......+ ...+ T Consensus 313 dG~~~~~y~~~l~ale~~~---~~~i~~~-t~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~~~~~~~~~~~~----a~~~ 384 (587) T protein:vir:95 313 NGEPPATWADKLDKFAHEG---GYYIVPL-SSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGR----QESL 384 (587) T ss_pred CCCCcccHHHHHHHHHhCC---cEEEEec-CCCHHHHHHHHHHHHHHHhCCCcEEEEEcCCCCCCHHHHHHH----Hhhc Confidence 00000 000000000000 0011111 23466777777777654 36788877765554333322 3467 Q ss_pred ccceEEEEecceecccccccceeehhh---HHHHHHHHhccccccccccccccceeecccccccccCchhhhhhccccce Q lcl|Aclame:pro 182 GHDRVYMVDPMPAIYSRKAQGNIYVPP---STIAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGV 258 (388) Q Consensus 182 ~s~~~~~~~p~~~~~~~~~~~~~~~p~---s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gI 258 (388) ++.+++.++|+..... ..+....+|| ++++||++|..|+.+|+.|+++...++. ..++..|++.|+++|+ T Consensus 385 n~ervi~v~~~~~~~~-~dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i~~~~v~------~~~t~~e~e~ai~~Gv 457 (587) T protein:vir:95 385 SNPRVSLVANSGTFVM-DDGRKNHVPAYMVAVALGGLASGLEIGESITFKPLRVSSLD------QIYESIDLDELNENGI 457 (587) T ss_pred CCCcEEEecccceEec-CCCceeeechHHHHHHHHHHHhcCchhcCccceeeeccccc------ccCCHHHHHHHHhCCe Confidence 8999999998865332 2344556666 7899999999999999999998643332 3568899999999999 Q ss_pred EEEEEeCCCcEEE----EccccC------CCceeeehhhHHHHHHHHHHHH-HHHhcccCCHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 259 SYFARTSMGGFSL----IGNRTV------TGKFISFVGLEDAIARKLEAAS-QRAMSKQLTKSFMEQEIKKINLFMQDLV 327 (388) Q Consensus 259 n~i~~~~~~G~~~----wG~rT~------~~~~i~vrR~~~~i~~~i~~~~-~~~vfepn~~~~~~~i~~~i~~~L~~l~ 327 (388) .++...++++-.. .|-.|. .|++++++|++|+|.+.+++.+ .+|++|||+...|..++..+..||.+|| T Consensus 458 l~l~~~~~~~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk~nn~~~r~~v~~~i~~~L~~l~ 537 (587) T protein:vir:95 458 ISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKK 537 (587) T ss_pred EEEEEecCCcceEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHH Confidence 9998776654222 343443 3999999999999999999886 6999999999999999999999999999 Q ss_pred hcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 328 AAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 328 ~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) +.|+|.+|... +.+-++...++++++.+.|+.|+|+|.+++++..+-+++ T Consensus 538 ~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekI~vt~~~~~~~~~~ 587 (587) T protein:vir:95 538 RDNEIQDFPAE-----DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred hCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 99999998542 112223456899999999999999999999999999997 No 40 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=1e-33 Score=201.43 Aligned_cols=354 Identities=12% Similarity=0.068 Sum_probs=239.1 Q ss_pred CCCCC----Cc-CCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhh Q lcl|Aclame:pro 1 MPVID----QF-EHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAA 75 (388) Q Consensus 1 M~~~t----~~-~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~ 75 (388) |+..- .| ++|||+++..++..++...++++.+|+|++..+ +.+++++++++.++.+.++.+ .|.+++ T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g----~~~~~~~~~~~~~~~~~~g~G----~l~~ai 72 (587) T protein:vir:96 1 MAKDIFPRRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGG----EPNTVYQVRNYAQAKSVFRSG----ELLDAI 72 (587) T ss_pred CeeeeeCCCcccCCceEEEEecCCccCCCCCCCceEEEEEEecCC----CCceeEEEcChHHHHHhhcCC----cHHHHH Confidence 66532 22 459999999999999999999999999999888 447889999999988776542 233433 Q ss_pred hhhh----ccccceEEEEeccccccccccc-------------------------------------------------- Q lcl|Aclame:pro 76 SETL----KKTSVPQYFIVVPEGADDAATM-------------------------------------------------- 101 (388) Q Consensus 76 ~~~~----~~~~~~~~vv~~~~~~~~~~~~-------------------------------------------------- 101 (388) ...+ .+++..++.+++.+......+. T Consensus 73 ~~a~~~~~~~g~~~~~a~rv~~~~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~ 152 (587) T protein:vir:96 73 ELAWGSNPQYTAGKILAMRVEDAKASQLEKGGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGN 152 (587) T ss_pred HHHhccCcCCCceEEEEEecCCCccceeecccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCc Confidence 2222 3344444433321100000000 Q ss_pred -----------------------------------------------------------------cccc----------- Q lcl|Aclame:pro 102 -----------------------------------------------------------------ANII----------- 105 (388) Q Consensus 102 -----------------------------------------------------------------~~~~----------- 105 (388) +.+. T Consensus 153 v~~i~y~g~~~~a~~~~~~~~~~~~A~~l~l~gg~~~v~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~~n~~~v~v 232 (587) T protein:vir:96 153 IFSINYKGEGEKATFSVEKDKETQEAKRLVLKVDEKEVKAYELNGGAYSFTNEIITDINELPDFEAKLSPFGDKNLESRK 232 (587) T ss_pred eEEEEecccccceeEeeccCcccceeeeeEEEecCceEEEEEeCCCchhhhhhhhhhhccccceEEEeecccCceeEEEe Confidence 0000 Q ss_pred -----------------------------------------------------------------------------ccc Q lcl|Aclame:pro 106 -----------------------------------------------------------------------------GGI 108 (388) Q Consensus 106 -----------------------------------------------------------------------------~~~ 108 (388) |+. T Consensus 233 ~d~~~~~~~k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~ 312 (587) T protein:vir:96 233 LDEATDVDIKGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFELTKLSGGT 312 (587) T ss_pred eccccccccceEEEeehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeecccccccccccceeeecCC Confidence 000 Q ss_pred cchhh--hhhhHhhhhhhhhhhhheecccccchhHHHHHHHHHhhhC-----ceEEEEecCCCcchhHHHHHHHhhhccc Q lcl|Aclame:pro 109 DPTTG--RRTGIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRL-----KCRAVIDGPSGSTQDAIDLSGLLGGEGT 181 (388) Q Consensus 109 ~~~tg--~~tgl~a~~~~~~~p~ll~ap~~~~~~~v~~~l~~~~~~~-----~~~~i~d~p~~~~~~~~~~~~~~~~~~~ 181 (388) ++... -...|.++.... -..++ +. +.++++.+++.++++++ .+++++..+.+.+......+ ...+ T Consensus 313 dG~~~~~y~~~l~ale~~~--~~~i~-~~-t~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~~~~~~~~~~~~----a~~~ 384 (587) T protein:vir:96 313 NGEPPTSWSAKLEKFKNEG--GYYIV-PL-TDRQSVHSEVATFVKNRSDAGEPMRAIVGGGTSETKEKLFGR----QAIL 384 (587) T ss_pred CCCCcccHHHHHHHHhhCC--cEEEE-ec-CCCHHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHH----Hhhc Confidence 00000 000000000000 00111 11 23456777777777654 36777877765543333222 3467 Q ss_pred ccceEEEEecceecccccccceeehh---hHHHHHHHHhccccccccccccccceeecccccccccCchhhhhhccccce Q lcl|Aclame:pro 182 GHDRVYMVDPMPAIYSRKAQGNIYVP---PSTIAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGV 258 (388) Q Consensus 182 ~s~~~~~~~p~~~~~~~~~~~~~~~p---~s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gI 258 (388) ++++++.++++....+.. +....+| .++++||++|..++.+|+-|+++...++. ..++..|++.|.++|+ T Consensus 385 n~e~vi~v~~~~~~~~~~-~~~~~~~~~~~aa~vAG~~Ag~~~~~S~T~~~~~~~~v~------~~~t~~e~~~~i~~G~ 457 (587) T protein:vir:96 385 NNPRVALVANSGKFVMGN-GRILQAPAYMVASAVAGLVSGLDIGESITFKPLFVNSLD------KVYESEELDELNENGI 457 (587) T ss_pred CCCcEEEEecceEEecCC-CceeeechhhHHHHHHHHHhcCccccCccceeeeccccc------ccCCHHHHHHHHhCCe Confidence 899999999987765543 3333333 37899999999999999999988543332 3468889999999999 Q ss_pred EEEEEeCCCcEEEEcc-ccC---------CCceeeehhhHHHHHHHHHHHH-HHHhcccCCHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 259 SYFARTSMGGFSLIGN-RTV---------TGKFISFVGLEDAIARKLEAAS-QRAMSKQLTKSFMEQEIKKINLFMQDLV 327 (388) Q Consensus 259 n~i~~~~~~G~~~wG~-rT~---------~~~~i~vrR~~~~i~~~i~~~~-~~~vfepn~~~~~~~i~~~i~~~L~~l~ 327 (388) .+++...+++..+|.. +++ .|++++++|++|+|.+.|++.+ .+|++|||+...|..++..+..||.+|| T Consensus 458 ~~l~~~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~ 537 (587) T protein:vir:96 458 ITIEFVRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIGTRTINTSASQIKDFVQSYLGRKK 537 (587) T ss_pred EEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCccccCHHHHHHHHHHHHHHHHHHH Confidence 9999877777777733 332 3999999999999999999887 5899999999999999999999999999 Q ss_pred hcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 328 AAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 328 ~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) +.|+|.+|... +..-++...++++++.+.|+.|+|+|.+++.+..+-+++ T Consensus 538 ~~g~I~~~~~~-----dv~v~~~~D~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:96 538 RDNEIQDFPPE-----DVQVIIEGNEARISLTIFPIRALKKISVSLVYRQQTLQA 587 (587) T ss_pred hCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 99999998541 111123345799999999999999999999999999987 No 41 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=5.6e-34 Score=202.87 Aligned_cols=363 Identities=10% Similarity=0.042 Sum_probs=204.7 Q ss_pred CCCCCCcC-----CCeEEEEcCCCccccccc--CcceeEEEeeccc-cccccccCcceeeccchhhhhh-cccccccccc Q lcl|Aclame:pro 1 MPVIDQFE-----HNGISIETHEPPPPMGPP--GDNVVAWVVTAPD-KHADVAFSVPFRVANTADAQYL-DSTGNELGTG 71 (388) Q Consensus 1 M~~~t~~~-----hGV~~~e~~~~~~~i~~v--~tav~g~vgta~~-~~~~~~~~~~v~v~s~~~~~~~-~~~~~~~gtl 71 (388) |+...++. -++....-......--.. .+-+..+-|..-+ .+. ..+..++.|.... .......-.. T Consensus 177 ~~~~~~~~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v------~~~~~~~~d~~~~~~v~~~~~~~~ 250 (581) T protein:vir:10 177 NPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDI------VQLSYRYTDPNYHEVIRFTDPDDI 250 (581) T ss_pred ccccCcceeccccceeeecccCccccccccccceeeeeeecccccccceE------EEEEEEeecCCcceeEEeecCcch Confidence 11110000 011111100000000000 0000000000000 000 0000001111000 0000000000 Q ss_pred hhhhhhhhccccceEE-EEecccccccccccccccccccch----h--hhhhhHhhhhhhhhhhhheecccccchhHHHH Q lcl|Aclame:pro 72 WHAASETLKKTSVPQY-FIVVPEGADDAATMANIIGGIDPT----T--GRRTGIAALTECTERPTLIGAPGFSQNKAVID 144 (388) Q Consensus 72 ~~a~~~~~~~~~~~~~-vv~~~~~~~~~~~~~~~~~~~~~~----t--g~~tgl~a~~~~~~~p~ll~ap~~~~~~~v~~ 144 (388) .+.+...++..+...- +...............+.++.++. + -....|.+++.... ..+++|+ +..+++++ T Consensus 251 ~~~~~~~~~~~g~~~~~~t~~~~~~~tn~~~~~l~~gvd~~g~tvt~~dy~~Al~ale~~~~--~~ivv~~-t~~~~v~a 327 (581) T protein:vir:10 251 QDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDE--IAIIVAG-TGAQPIQA 327 (581) T ss_pred hhhhhhhhhccCccccchhhhheeeeecccceeEEeeccCCCCccchHHHHHHHHHHhcCCc--eEEEEeC-CCCHHHHH Confidence 1111111111111000 000000000011111122222221 1 11233444333221 2345666 56678888 Q ss_pred HHHHHhhhC-----ceEEEEecCCC---cchhHHHHHHHhhhcccccceEEEEecceecccccc-cceeehhh---HHHH Q lcl|Aclame:pro 145 ALASMAKRL-----KCRAVIDGPSG---STQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKA-QGNIYVPP---STIA 212 (388) Q Consensus 145 ~l~~~~~~~-----~~~~i~d~p~~---~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~-~~~~~~p~---s~~~ 212 (388) ++.++++++ .+.+++..+.. .+..... ....+++++|+.++||+....+... +....+|+ .+++ T Consensus 328 ~l~ahv~~~s~~~~~~ravigV~g~~~~~~~~~~~----~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~y~~AA~v 403 (581) T protein:vir:10 328 LVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRI----ANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAV 403 (581) T ss_pred HHHHHHHHHHhccCCcEEEEEecCCCCCccHHHHH----HhhccCCCceEEEEecCceeecCcccCceeccchhhHHHHH Confidence 888887654 23455554432 2222211 1234778999999999988766554 34444666 6788 Q ss_pred HHHHhcccccccccccccc-ceeecccccccccCchhhhhhccccceEEEEEeCCCcEEE-EccccC----CCceeeehh Q lcl|Aclame:pro 213 MGAVAAVKPWESPGNQGVL-IQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSL-IGNRTV----TGKFISFVG 286 (388) Q Consensus 213 aG~~a~~d~~~s~~n~p~~-~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~-wG~rT~----~~~~i~vrR 286 (388) ||+++..|+.+|+-++++. +.++ ...++..|++.|+++|++++..++++|+++ ||-+|+ .|++|++|| T Consensus 404 AGl~a~~~~~~slT~~~i~gi~~l------~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~~~i~~iR 477 (581) T protein:vir:10 404 AGKSVSAIAAMPLTRKVIRGFSGP------AEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIG 477 (581) T ss_pred HHHhhccccccCcccccccccccc------cccCCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCCCCcceeeeeeh Confidence 9999999999999999984 4332 345578899999999999999988899886 777785 499999999 Q ss_pred hHHHHHHHHHHHHH--HHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecC Q lcl|Aclame:pro 287 LEDAIARKLEAASQ--RAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYS 364 (388) Q Consensus 287 ~~~~i~~~i~~~~~--~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~ 364 (388) ++|++.+.+++.++ .|++|||+..+|.+|+..+..||..||+.|+|.||+..- .++.+.+.+.+++++.++|++ T Consensus 478 ~~D~v~~~ir~~~~~~~fIG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~~----~~~~~~~~d~v~V~i~v~Pv~ 553 (581) T protein:vir:10 478 QQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLK----ARQIERQPDVIEVRYEWRPAY 553 (581) T ss_pred hhhHHHHHHHHHhhhhcCCCcccCHHHHHHHHHHHHHHHHHHHhcCcccCCccce----eeeeecCCCEEEEEEEEEecc Confidence 99999999999985 578899999999999999999999999999999987432 244467889999999999999 Q ss_pred cceeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 365 PNEHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 365 pae~I~~~~~~~~~~~~~l~~~~~ 388 (388) |+|||.+++++.++.=+ +.+-| T Consensus 554 ~i~~I~vti~~~p~~~~--~~~~~ 575 (581) T protein:vir:10 554 PLNYIVVRYSIAPETGD--ITSTI 575 (581) T ss_pred cceEEEEEEEEecCCCc--eEEEE Confidence 99999999999998755 44444 No 42 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=3.1e-33 Score=198.84 Aligned_cols=354 Identities=12% Similarity=0.065 Sum_probs=240.1 Q ss_pred CCCCC-----CcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhh Q lcl|Aclame:pro 1 MPVID-----QFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAA 75 (388) Q Consensus 1 M~~~t-----~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~ 75 (388) |+.-. .-++|||+++..++..++..+++++.+|+|.+..+ +.+++++++++.++...+.. |.|.+++ T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G----~~~~~~~~~~~~~~~~~~~~----g~l~~~~ 72 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGG----EPNTVYELRNYSQAKRLFRS----GELLDAI 72 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCC----ccceeEEeccHHHHHHHhcC----cchHHHH Confidence 76533 23569999999999999999999999999999888 45678999999998887755 3344444 Q ss_pred hhhh----ccccceEEEEeccccccccccccc------------------------------------------------ Q lcl|Aclame:pro 76 SETL----KKTSVPQYFIVVPEGADDAATMAN------------------------------------------------ 103 (388) Q Consensus 76 ~~~~----~~~~~~~~vv~~~~~~~~~~~~~~------------------------------------------------ 103 (388) ...| .+++..++.+++........+..+ T Consensus 73 ~~a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 152 (587) T protein:vir:99 73 ELAWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGN 152 (587) T ss_pred HHHhccccCCCceEEEEEEcCCCceeEEEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEEecccceeeeeeccc Confidence 3333 344444444332111000000000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 104 -------------------------------------------------------------------------------- 103 (388) Q Consensus 104 -------------------------------------------------------------------------------- 103 (388) T Consensus 153 v~~i~y~g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~~~~tAky~~~~~~~i~~~~ 232 (587) T protein:vir:99 153 IFTIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSK 232 (587) T ss_pred eeeEEeecccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccccceeEEeeccCCceeEeec Confidence Q ss_pred ---------------------------------------------------------------------------ccccc Q lcl|Aclame:pro 104 ---------------------------------------------------------------------------IIGGI 108 (388) Q Consensus 104 ---------------------------------------------------------------------------~~~~~ 108 (388) +.|+. T Consensus 233 ~~~~~~~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~ 312 (587) T protein:vir:99 233 LDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGT 312 (587) T ss_pred ccccccceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceecccceeeecCC Confidence 00000 Q ss_pred cchhh--hhhhHhhhhhhhhhhhheecccccchhHHHHHHHHHhhhC-----ceEEEEecCCCcchhHHHHHHHhhhccc Q lcl|Aclame:pro 109 DPTTG--RRTGIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRL-----KCRAVIDGPSGSTQDAIDLSGLLGGEGT 181 (388) Q Consensus 109 ~~~tg--~~tgl~a~~~~~~~p~ll~ap~~~~~~~v~~~l~~~~~~~-----~~~~i~d~p~~~~~~~~~~~~~~~~~~~ 181 (388) ++... -...|.++.... - ..++|. +.++.+.+++.++++++ .+++++..+.+.+......+ ...+ T Consensus 313 dG~~~~sy~~al~ale~~~--~-~~i~~~-t~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~~~~~~~~~~~----a~~~ 384 (587) T protein:vir:99 313 NGEPPATWADKLDKFAHEG--G-YYIVPL-SSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGR----QASL 384 (587) T ss_pred CCCccccHHHHHHHHhhCC--c-EEEEec-CCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHH----hhhc Confidence 00000 000111111100 0 011111 23456777777777654 36788877765553332222 3467 Q ss_pred ccceEEEEecceecccccccceeehhh---HHHHHHHHhccccccccccccccceeecccccccccCchhhhhhccccce Q lcl|Aclame:pro 182 GHDRVYMVDPMPAIYSRKAQGNIYVPP---STIAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGV 258 (388) Q Consensus 182 ~s~~~~~~~p~~~~~~~~~~~~~~~p~---s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gI 258 (388) ++++++.++|+..... .++....+|+ ++++||++|..++.+|+-|+++...++ ...++..|++.|+++|+ T Consensus 385 n~e~vi~v~~~~~~~~-~dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i~~~~v------~~~~t~~e~e~li~~Gv 457 (587) T protein:vir:99 385 SNPRVSLVANSGTFVM-DDGRKNHVPAYMVAVALGGLASGLEIGESITFKPLRVSSL------DQIYESIDLDELNENGI 457 (587) T ss_pred CCCcEEEEeccceEec-CCCceeeechHHHHHHHHHHHhcCchhcCccceeeecccc------cccCCHHHHHHHHhCCe Confidence 8899999998754332 2244455666 789999999999999999998754333 23568899999999999 Q ss_pred EEEEEeCCCc---EEE-EccccC------CCceeeehhhHHHHHHHHHHHH-HHHhcccCCHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 259 SYFARTSMGG---FSL-IGNRTV------TGKFISFVGLEDAIARKLEAAS-QRAMSKQLTKSFMEQEIKKINLFMQDLV 327 (388) Q Consensus 259 n~i~~~~~~G---~~~-wG~rT~------~~~~i~vrR~~~~i~~~i~~~~-~~~vfepn~~~~~~~i~~~i~~~L~~l~ 327 (388) .++...++++ +++ .|-.|. .|++++++|++|+|.+.+++.+ .+|+++||+...|..|+..+..||.+|| T Consensus 458 l~l~~~~~~~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk~Nn~~~r~~i~~~i~~~L~~l~ 537 (587) T protein:vir:99 458 ISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKK 537 (587) T ss_pred EEEEEecCCcceEEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHH Confidence 9998776654 332 344443 3999999999999999999886 6899999999999999999999999999 Q ss_pred hcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 328 AAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEE 382 (388) Q Consensus 328 ~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~ 382 (388) +.|+|.+|... +.+-++...++++++.+.|+.|+|+|.+++++..+-+++ T Consensus 538 ~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 538 RDNEIQDFPAE-----DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred hCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 99999998642 111122345799999999999999999999999999998 No 43 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=1.7e-33 Score=200.27 Aligned_cols=370 Identities=9% Similarity=0.068 Sum_probs=202.1 Q ss_pred CCCCCCcCC--CeEE---EEcCCCcccccccCcc-----------------e--eEEEeecccc--ccccccCcceeecc Q lcl|Aclame:pro 1 MPVIDQFEH--NGIS---IETHEPPPPMGPPGDN-----------------V--VAWVVTAPDK--HADVAFSVPFRVAN 54 (388) Q Consensus 1 M~~~t~~~h--GV~~---~e~~~~~~~i~~v~ta-----------------v--~g~vgta~~~--~~~~~~~~~v~v~s 54 (388) =|+++--++ |++. .++.+.--..-+..+. . ...+.....+ +.-.......+.++ T Consensus 156 ~~~~~~~l~~~g~~~~~~~~~~s~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D 235 (581) T protein:vir:76 156 VPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTD 235 (581) T ss_pred cCCcCceeeeccccccccceeecCCcceeeecccccceeeccCcccceeeeeeeeeeEeecccccccceeEEEEEEEeec Confidence 111110001 2220 0111000000000000 0 0000000000 00000000000000 Q ss_pred chhhhhhcccccccccchhhhhhhhccccceE-EEEecccccccccccccccccccch----h--hhhhhHhhhhhhhhh Q lcl|Aclame:pro 55 TADAQYLDSTGNELGTGWHAASETLKKTSVPQ-YFIVVPEGADDAATMANIIGGIDPT----T--GRRTGIAALTECTER 127 (388) Q Consensus 55 ~~~~~~~~~~~~~~gtl~~a~~~~~~~~~~~~-~vv~~~~~~~~~~~~~~~~~~~~~~----t--g~~tgl~a~~~~~~~ 127 (388) ........... .-.....+...++..+... -+...............+.++.++. + .....|.+++.... T Consensus 236 ~~~~~~v~~~~--~~~~~~~~~~~~~~~g~~~~e~~~~~~~~~t~~~~~~l~~gvd~~g~tvt~~dy~~aL~ale~~~~- 312 (581) T protein:vir:76 236 PNYHEVIRFTD--PDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDE- 312 (581) T ss_pred CCccceEEEec--ccccccceeeehhhcCccccchhhhhheeeccccceEEEeeecCCCCccchHHHHHHHHHHhcCCe- Confidence 00000000000 0000001101111111000 0000000000011111222222221 1 11233443333221 Q ss_pred hhheecccccchhHHHHHHHHHhhhC-----ceEEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceeccccccc- Q lcl|Aclame:pro 128 PTLIGAPGFSQNKAVIDALASMAKRL-----KCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQ- 201 (388) Q Consensus 128 p~ll~ap~~~~~~~v~~~l~~~~~~~-----~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~- 201 (388) ..+++|+ +.++++++++.++++++ ...+++..+.......... ......+++++|+.++||+....+...+ T Consensus 313 -~~ivvp~-t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g~~~~~~~~~-~~~~a~~~ns~Rvvlv~p~~~~~~g~~~~ 389 (581) T protein:vir:76 313 -IAIIVAG-TGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSAT-RIANAQSIKDQRVALISPSSFVYYAPELN 389 (581) T ss_pred -EEEEEec-CCChHHHHHHHHHHHHHHhccCCceEEEEeeCCCCCchHHH-HHHhhcccCCCcEEEEEcCceEeccccCC Confidence 2345665 56677888888877644 2344444333222111111 1112346789999999999887765433 Q ss_pred ceeehhh---HHHHHHHHhcccccccccccccc-ceeecccccccccCchhhhhhccccceEEEEEeCCCcEE-EEcccc Q lcl|Aclame:pro 202 GNIYVPP---STIAMGAVAAVKPWESPGNQGVL-IQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFS-LIGNRT 276 (388) Q Consensus 202 ~~~~~p~---s~~~aG~~a~~d~~~s~~n~p~~-~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~-~wG~rT 276 (388) ....+|+ .+++||+.+..++.+|+-++++. +. .....++..|++.|+++|++++..+++++++ +||-+| T Consensus 390 ~~~~lp~~~~AA~vAG~~a~~~~~~slT~~~i~g~~------~~~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT 463 (581) T protein:vir:76 390 REVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFS------GPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTT 463 (581) T ss_pred cceecchhhhhhhHHhhhhccccccCcccccccccc------cccccCCHHHHHHHHhCCeEEEEEecCCeEEEEEeeec Confidence 3334454 56778899999999999999985 32 2334457789999999999999988888987 488888 Q ss_pred C----CCceeeehhhHHHHHHHHHHHHH--HHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHhh Q lcl|Aclame:pro 277 V----TGKFISFVGLEDAIARKLEAASQ--RAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYK 350 (388) Q Consensus 277 ~----~~~~i~vrR~~~~i~~~i~~~~~--~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~ 350 (388) + +|+++++||++|++.+.+++.++ .|++|||+..+|.+|+..+..||..||+.|+|.||+.. ..++.+++ T Consensus 464 ~~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~g~~~~----~~~~~~~~ 539 (581) T protein:vir:76 464 DPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL----KARQIERQ 539 (581) T ss_pred CCCCCccceeeehhhhHHHHHHHHHHHhhhcCCCcccChHHHHHHHHHHHHHHHHHHhcCcccCcccc----eeeEEecC Confidence 5 49999999999999999999986 57779999999999999999999999999999998632 23555667 Q ss_pred CCeEEEEEEEEecCcceeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 351 NGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 351 ~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~~~~ 388 (388) .+++++++.++|++|+|||.+++++.++.=+ +.+-| T Consensus 540 ~d~v~V~i~v~Pv~~ie~I~vt~~~~p~~~~--~~~~~ 575 (581) T protein:vir:76 540 PDVIEVRYEWRPAYPLNYIVVRYSIAPETGD--ITSTI 575 (581) T ss_pred CCEEEEEEEEEecccceEEEEEEEEeeCCCc--eEEEE Confidence 8999999999999999999999999888654 44444 No 44 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=99.95 E-value=1.8e-29 Score=178.23 Aligned_cols=356 Identities=10% Similarity=-0.026 Sum_probs=221.1 Q ss_pred CCC-----CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhh Q lcl|Aclame:pro 1 MPV-----IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAA 75 (388) Q Consensus 1 M~~-----~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~ 75 (388) |+- ...-.+|||++++..+.+++..+++++.+|++.++-+ |.++++.|+|..++...|+...... ....+ T Consensus 1 m~gg~~~~~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~~G----p~~~~~~i~s~~d~~~~fG~~~~~~-~~~~~ 75 (437) T protein:vir:10 1 MAGGIWKRQNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLALSFG----QSKKLMKIRRGEDLFKKLGYEQESP-QLLLL 75 (437) T ss_pred CCcceecccceecCceeEEEecCCcceeeccCCcEEEEEEEecCC----CCceeEEEecHHHHHHHcCCccchh-HHHHH Confidence 443 2334789999999999999999999999999998766 6788999999999998887533211 12233 Q ss_pred hhhhccccceEEEEeccccccccccccccc------ccccc---------h---hhhhh-------------hHhhhhhh Q lcl|Aclame:pro 76 SETLKKTSVPQYFIVVPEGADDAATMANII------GGIDP---------T---TGRRT-------------GIAALTEC 124 (388) Q Consensus 76 ~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~------~~~~~---------~---tg~~t-------------gl~a~~~~ 124 (388) ...+ +++..++++++.++.....+.++.. .+.-+ . .+... -+....+. T Consensus 76 ~~~~-~g~~~~~~~R~~~g~~a~~tl~~~~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~v~~~~~~~~~d~~~v~~~~~~ 154 (437) T protein:vir:10 76 NEAF-KRVSEVLLYRLNTGEKANVSLSDNVTAQAKYSGVRGNDITVTVKTNVDDPSSFDVVTFLDTVVMDLQTVKVLADL 154 (437) T ss_pred HHHh-cCCCEEEEEECCCCceeeEeeccceEEEeccCCcccceeEEEEeeccCCccceEEEEecCcceeeeeehhhhhhh Confidence 3333 5666788888766543322222210 00000 0 00000 00000000 Q ss_pred hh--------------hhhheecccc---cchhHHHHHHHHHhh-hCceEEEEecCCCcchhHHHHHHHhhh-cccccce Q lcl|Aclame:pro 125 TE--------------RPTLIGAPGF---SQNKAVIDALASMAK-RLKCRAVIDGPSGSTQDAIDLSGLLGG-EGTGHDR 185 (388) Q Consensus 125 ~~--------------~p~ll~ap~~---~~~~~v~~~l~~~~~-~~~~~~i~d~p~~~~~~~~~~~~~~~~-~~~~s~~ 185 (388) .. ....-+..|- ........+|.++.. +.+.+++ |............|..+ ..-...+ T Consensus 155 ~~n~~v~~~~~~~l~~~a~~~LtGG~dg~~t~~dy~~al~~le~~~~n~l~~---~~~d~~~~t~~~~~ik~~r~~~g~~ 231 (437) T protein:vir:10 155 KNNALVEFSGTGELQPVAGAKLTGGTDGAISTQDYLEYFKALETVEFNYMAL---PVEDASIKKAAINFIKRMREDEGLG 231 (437) T ss_pred hhhcccccccccccccccceeeeccccCCCChhHHHHHHHHhccCcceEEEe---cCCChhHHHHHHHHHHHHHhccCce Confidence 00 0000111110 011234555655532 2333332 22222112222222221 1111222 Q ss_pred EEEE--------------ecceecccccccceeehhhHHHHHHHHhcccccccccccccc-ceeecccccccccCchhhh Q lcl|Aclame:pro 186 VYMV--------------DPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVL-IQDVARVIDYNILDKSTEG 250 (388) Q Consensus 186 ~~~~--------------~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~-~~g~~~~~~~~~~~~~~~~ 250 (388) ..++ .+.....+. ......-.++.+||++|..+..+|+.|+++. +.. ....++.+|. T Consensus 232 ~~~V~~~~~~d~e~Iin~~n~~~~~~~--~~~~~~~~~a~vAG~~Ag~~~~~S~t~~~~~~~~~------v~~~~t~~e~ 303 (437) T protein:vir:10 232 AQLVVADSDADSEAVINVKNGVILSDK--TVIDKTKATVWVAAASANAGVEKSLTYEKYEDSVD------VVGRLSHTET 303 (437) T ss_pred EEEEeCCCCCCCceEEEeecceeecCc--ceechhhHHHHHHHHhccCccccCccccccCCccc------ccccCCHHHH Confidence 2222 111111110 0111234468899999999999999999874 222 2235688999 Q ss_pred hhccccceEEEEEeCCCcEEEEccccC---------CCceeeehhhHHHHHHHHHHHHH-HHhcc-cCCHHHHHHHHHHH Q lcl|Aclame:pro 251 DLLNRNGVSYFARTSMGGFSLIGNRTV---------TGKFISFVGLEDAIARKLEAASQ-RAMSK-QLTKSFMEQEIKKI 319 (388) Q Consensus 251 ~~Ln~~gIn~i~~~~~~G~~~wG~rT~---------~~~~i~vrR~~~~i~~~i~~~~~-~~vfe-pn~~~~~~~i~~~i 319 (388) +.|.++|+.++.+..++-+.++|-.|+ .|++|.++|++|+|.+.+++.+. .|+++ ||+...|..++..+ T Consensus 304 ~~~i~~G~~vl~~~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~~r~~~~~~i 383 (437) T protein:vir:10 304 EDALLKGQFVFTARRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFLGKVSNNEDGRQAFKANR 383 (437) T ss_pred HHHHhCCcEEEEEeCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccccccCCCHHHHHHHHHHH Confidence 999999999987653333444786664 38999999999999999999877 49997 79999999999999 Q ss_pred HHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEc Q lcl|Aclame:pro 320 NLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAV 376 (388) Q Consensus 320 ~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~ 376 (388) +.||.+|+++|+|.+|.+..++..+.. ....+++++.+.|+.++|+|.+.+... T Consensus 384 ~~yl~~l~~~g~I~~~~~~d~~v~~~~---~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 384 IRYFKDLEARGAIEDFKVEDIEVLRGE---LKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred HHHHHHHHhCCCccCCCceeEEeecCC---CCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 999999999999999988776544322 347899999999999999999999888 No 45 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=99.94 E-value=1.3e-27 Score=168.06 Aligned_cols=359 Identities=10% Similarity=0.076 Sum_probs=232.5 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhh- Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETL- 79 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~- 79 (388) -|--..-++|||+++..++..++...++.+.+|||.+..+ +.+++++++++.++...|.. |.|.++....| T Consensus 15 ~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G----~~~~~~~~~~~~~a~~~f~~----g~l~~a~~~a~~ 86 (607) T protein:vir:10 15 YPLFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNG----DPTKVYEIRTSQQATKIFGS----GDLVDGIKLAFD 86 (607) T ss_pred hCCCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCC----CCceEEEEcchhHHHHhhcC----cchHHHHHHhhc Confidence 2221223569999999999999999999999999999988 45788999999988877654 23334333333 Q ss_pred -----ccccceEEEEecccccccccccc--------------c------------------------------------- Q lcl|Aclame:pro 80 -----KKTSVPQYFIVVPEGADDAATMA--------------N------------------------------------- 103 (388) Q Consensus 80 -----~~~~~~~~vv~~~~~~~~~~~~~--------------~------------------------------------- 103 (388) .++++.|+.+++........+.+ + T Consensus 87 ~~~~~~~g~~~~~~~rv~~~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~~~~~~~n~g~~~~i 166 (607) T protein:vir:10 87 PTGNSVTNGGTVYALRVDNAKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDNYERTYTNIGQMFSI 166 (607) T ss_pred cccCCccCCceEEEEeCCCccccceecccccccccccccCCCceEEEEEecCCCccceeEEeecccceeeeeeccceeec Confidence 34545555544311000000000 0 Q ss_pred ------------cc----c---------cccchh-----------hhh-h------------------------------ Q lcl|Aclame:pro 104 ------------II----G---------GIDPTT-----------GRR-T------------------------------ 116 (388) Q Consensus 104 ------------~~----~---------~~~~~t-----------g~~-t------------------------------ 116 (388) +. | +.+... +.+ + T Consensus 167 ~y~g~~~~a~~~v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din~~~~~~A~~~g~~~i~tky~d 246 (607) T protein:vir:10 167 TYSGKSASAGYTVSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAISATPNFSASVVGSPSVNTSYLD 246 (607) T ss_pred ccCcccccccceeeecCCCceeEEEecCCCccceeeeeecccccccccchHHHHHHHhhcCCceEEEEecccceeeeccc Confidence 00 0 000000 000 0 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 117 -------------------------------------------------------------------------------- 116 (388) Q Consensus 117 -------------------------------------------------------------------------------- 116 (388) T Consensus 247 ~~~~~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~LtGGtdG 326 (607) T protein:vir:10 247 EVTSPVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVNGVSAGTGSATASVTTAPESFPANFDTAFLTGGSTG 326 (607) T ss_pred cccceeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhhhhhccccceeeeeeccccccccccceeeeeCCCCC Confidence Q ss_pred --------hHhhhhhhhhhhhheecccccchhHHHHHHHHHhhhC-----ceEEEEecCCCcchhHHHHHHHhhhccccc Q lcl|Aclame:pro 117 --------GIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRL-----KCRAVIDGPSGSTQDAIDLSGLLGGEGTGH 183 (388) Q Consensus 117 --------gl~a~~~~~~~p~ll~ap~~~~~~~v~~~l~~~~~~~-----~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s 183 (388) .+.++... ....++ + .+.++++.+++.++++++ ++.+++..+.+.+......+ ...+++ T Consensus 327 ~~~~ty~dal~aLe~~--e~~~i~-~-~t~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~~~~t~~~~~t~----a~~~N~ 398 (607) T protein:vir:10 327 DVPVSWADKFNGAIGN--NVYYII-P-LTSEENIHAELQAFIDEQHVLGYNYHAFVGGGFAEPLEQILSR----QVNIND 398 (607) T ss_pred CchhhHHHHHHHHhhc--CceEEE-e-cCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHH----HHhhCC Confidence 00000000 000011 0 012355666677766543 35677766655543332222 235788 Q ss_pred ceEEEEecceecccccccceeehh---hHHHHHHHHhccccccccccccccceeecccccccccCchhhhhhccccceEE Q lcl|Aclame:pro 184 DRVYMVDPMPAIYSRKAQGNIYVP---PSTIAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSY 260 (388) Q Consensus 184 ~~~~~~~p~~~~~~~~~~~~~~~p---~s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~ 260 (388) ++++.+.|+....+. +.....| .++++||+.|..++.+|+-|++++..++. ..++..|++.|.++|+.+ T Consensus 399 ervv~V~~~~~~~~~--G~~~~~~~~~~Aa~vAGl~Ag~~~~~SlT~k~i~~~~v~------~~lt~~e~e~ai~~Gv~~ 470 (607) T protein:vir:10 399 SRFGLVGQSGHVQEG--GESVHVPAYLMAAYVGGLSSSLGVAVPITNKKLALVDLD------QNFSGDDLNTLNQNGVIG 470 (607) T ss_pred CcEEEEecCeeEeeC--CcceeccHHHHHHHHHHHHhcCccccCcccceecccccc------ccCCHHHHHHHHhCCeEE Confidence 999999998765443 3344444 47899999999999999999998643332 356788999999999999 Q ss_pred EEEeC----CCcEEEEcc-ccC------CCceeeehhhHHHHHHHHHHHH-HHHhcccCCHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 261 FARTS----MGGFSLIGN-RTV------TGKFISFVGLEDAIARKLEAAS-QRAMSKQLTKSFMEQEIKKINLFMQDLVA 328 (388) Q Consensus 261 i~~~~----~~G~~~wG~-rT~------~~~~i~vrR~~~~i~~~i~~~~-~~~vfepn~~~~~~~i~~~i~~~L~~l~~ 328 (388) +...+ +++++++-. .|. .|++++++|++|+|.+.+++.+ .+|++++|+...|.+++..+..+|..+|. T Consensus 471 l~~~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nnd~~~~~vk~~i~~~L~~~~l 550 (607) T protein:vir:10 471 IEHLVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIRSTSADDIKSTVASYLYSEMN 550 (607) T ss_pred EEEccCccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCcccCCcchHHHHHHHHHHHHHHHHH Confidence 86543 335666443 332 3999999999999999999886 58999999999999999999999987665 Q ss_pred --cCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 329 --AEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 329 --~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~~~~ 388 (388) .|+|.+|... +-+-...+.++++++.+.|+.++|+|.+++++.++-|++-=+.-- T Consensus 551 ~~~gaI~df~~e-----dv~v~~~~D~v~v~~~v~Pv~~iekIyvtv~v~~~~~~~~~~~~~ 607 (607) T protein:vir:10 551 NDDGLIVDFSES-----DIVVTISGTVVYIQFAVAPTQEIKNIVVSGTYSNYSATSEDNTTK 607 (607) T ss_pred HhcCceeCCCcc-----ccEEeeCCCEEEEEEEEEEcccceEEEEEEEEEEEEEeeccCCCC Confidence 5899987421 111123456899999999999999999999999988774333222 No 46 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=99.89 E-value=3.7e-24 Score=149.07 Aligned_cols=357 Identities=10% Similarity=-0.062 Sum_probs=207.9 Q ss_pred CCC-----CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhh Q lcl|Aclame:pro 1 MPV-----IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAA 75 (388) Q Consensus 1 M~~-----~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~ 75 (388) |+- ...-.+|||+++...+.+++..+.+.++.+++.+.+.. .++++.+.+..++...++.... .....++ T Consensus 1 magg~~~~~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g----~~~~v~i~~~~d~~~~fG~~~~-~~~~~~~ 75 (451) T protein:vir:10 1 MAGGTWKAQDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWG----KNGVIEVEANSDFTKKLGTTLD-DPSLTAL 75 (451) T ss_pred CCceeeccceeecCceEEEEeccCcceeeccCCcEEEEEeeecCCC----CcccEEeecHHHHHHHcCCccc-chhHHHH Confidence 553 23346799999999999999999999999998654432 3567889999998887765332 2223345 Q ss_pred hhhhccccceEEEEeccccccccccccc-------ccccccch------------hhhh--------------h----hH Q lcl|Aclame:pro 76 SETLKKTSVPQYFIVVPEGADDAATMAN-------IIGGIDPT------------TGRR--------------T----GI 118 (388) Q Consensus 76 ~~~~~~~~~~~~vv~~~~~~~~~~~~~~-------~~~~~~~~------------tg~~--------------t----gl 118 (388) ..++. ++..++++++..+.....+... -..+.-+. .... + -. T Consensus 76 ~~~~~-g~~~v~~yrl~~g~~a~~t~~~~~~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g~~~vd~qtv~~~~~ 154 (451) T protein:vir:10 76 KETLK-GASKVLVLNPNEGTAATLTKEGLPWTVTANYPGEKGNQITVSVEVSPADQNAATVSTIFGTKLVDEQSIKFNEL 154 (451) T ss_pred HHHhc-CCcEEEEEEcCCCceEEEEeecCceEEEEeeCCcCCceEEEEEecccCCcCceEEEEEECCeEEEEEEeeccch Confidence 54443 4556677776554332222110 00000000 0000 0 00 Q ss_pred hhhhhhh---------hhh--hheecc--ccc------chhHHHHHHHHHhh-hCceEEEEecCCCcchhHHHHHHHhhh Q lcl|Aclame:pro 119 AALTECT---------ERP--TLIGAP--GFS------QNKAVIDALASMAK-RLKCRAVIDGPSGSTQDAIDLSGLLGG 178 (388) Q Consensus 119 ~a~~~~~---------~~p--~ll~ap--~~~------~~~~v~~~l~~~~~-~~~~~~i~d~p~~~~~~~~~~~~~~~~ 178 (388) ..+.... ..+ ...... +.. .......+|..+-. ..+.+++.-.. +..........|..+ T Consensus 155 ~el~~nd~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~~n~l~~~~~~-~~~~i~~~~~a~ik~ 233 (451) T protein:vir:10 155 DKFKGNDYITAKVVEEGSSKPVAFTNVSGTLTGGTTTESNKVESLLNDALENEEYAVVTTAGFE-PSSNMNKLVVEAVKR 233 (451) T ss_pred hhccCCceEEEEecccccccceeeeecccccccccccCCccchHHHHHHhccceeeEEEEccCC-CchHHHHHHHHHHHH Confidence 0000000 000 000000 000 00111223322211 22222221111 111111111111111 Q ss_pred --ccc----------------ccceEEEEecceecccccccceeehhhHHHHHHHHhcccccccccccccc-ceeecccc Q lcl|Aclame:pro 179 --EGT----------------GHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVL-IQDVARVI 239 (388) Q Consensus 179 --~~~----------------~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~-~~g~~~~~ 239 (388) .+- ++..++.+-+.+...|.. .....-..+.+||++|.....+|+-|+++. +..+ T Consensus 234 ~r~~~g~~~~aVl~~~~~~~~d~egiinv~n~~~~~dg~--~~~~~~~~~~vAG~~Ag~~~~~S~T~~~~~~~~~v---- 307 (451) T protein:vir:10 234 LRENEGRKVRGVIPTDADTTYNYEGISTVVNGYTLSDGT--NVDVKDATGYFAGISASADVATSLTYFEVEDAVSA---- 307 (451) T ss_pred HHHhcCCeEEEEecCccCCCCCCcceEEeecceEecCce--eechhhhHHHHHHHHcccccccCccceecCCceee---- Confidence 111 122222222222211110 011123358899999999999999998873 3322 Q ss_pred cccccCchhhhhhccccceEEEEEeCCCcEE-EEccccC---------CCceeeehhhHHHHHHHHHHHHHH-Hhcc-cC Q lcl|Aclame:pro 240 DYNILDKSTEGDLLNRNGVSYFARTSMGGFS-LIGNRTV---------TGKFISFVGLEDAIARKLEAASQR-AMSK-QL 307 (388) Q Consensus 240 ~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~-~wG~rT~---------~~~~i~vrR~~~~i~~~i~~~~~~-~vfe-pn 307 (388) ...++.+|.+.+.++|..++....+++++ .+|-.|+ .|+.|.++|++|+|.+.+++.+.. |+++ || T Consensus 308 --~~~~t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N 385 (451) T protein:vir:10 308 --YPKFDNEKTIKALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTYLGNVGN 385 (451) T ss_pred --eeeCCHHHHHHHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccceecCC Confidence 23567899999999999887755676665 4887776 299999999999999999999875 8886 79 Q ss_pred CHHHHHHHHHHHHHHHHHHHhcCCeeeeeEE-EeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEc Q lcl|Aclame:pro 308 TKSFMEQEIKKINLFMQDLVAAEIIPGGEVY-LHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAV 376 (388) Q Consensus 308 ~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~-~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~ 376 (388) +..-|..++..++.||.+|+++|+|.+|... .+-+.+ -....+++++.+.|+..+|+|.+.+++. T Consensus 386 ~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~~d~~v~~~----~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 386 NAAGRDLFKADRIAYLTSLQNRNMIQSFANTDITVEAG----NDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred CHHHHHHHHHHHHHHHHHHHhCCCccCCCccceEEeec----CCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 9999999999999999999999999987622 111111 1357899999999999999999999888 No 47 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=99.82 E-value=1.5e-21 Score=134.76 Aligned_cols=363 Identities=11% Similarity=-0.020 Sum_probs=190.1 Q ss_pred CCC-CCCcCCCeEEEEcCCCcccccccCccee----EEEeecc--ccc---cccccCcceeeccchhh---hhhcc---- Q lcl|Aclame:pro 1 MPV-IDQFEHNGISIETHEPPPPMGPPGDNVV----AWVVTAP--DKH---ADVAFSVPFRVANTADA---QYLDS---- 63 (388) Q Consensus 1 M~~-~t~~~hGV~~~e~~~~~~~i~~v~tav~----g~vgta~--~~~---~~~~~~~~v~v~s~~~~---~~~~~---- 63 (388) |.. -+.|..|..++ .+.|..-.--++.+-+ +.--+.. ++| .+.+...-....+.-.. +.+.. T Consensus 112 ~~~~~s~~~~s~~~~-l~~G~~~~iy~~Dgd~~~s~~~~l~i~~~~ads~g~e~~~l~~~~~~~~g~~~~let~~~sl~~ 190 (529) T protein:vir:10 112 GEPAYSALPYGSEIE-LDSGEAFAIYVDDGDPCISPTRELTIETATADSAGNERFLLKLTQTTSLGVVTTLETHTVSLAE 190 (529) T ss_pred ccchhhccccccccc-ccccceEEEEEecCcCccCCceEEEEEeeccccCCCccceeeEEEEeecCCceEEEEEEeeeee Confidence 111 11111122222 1111111100111110 1111111 011 11110000000000000 00000 Q ss_pred ----cccccccchhhhhhhhccccceEEEEecccccc-cccccccccccccchh------hhhhhHhhhhhhhhhhhhee Q lcl|Aclame:pro 64 ----TGNELGTGWHAASETLKKTSVPQYFIVVPEGAD-DAATMANIIGGIDPTT------GRRTGIAALTECTERPTLIG 132 (388) Q Consensus 64 ----~~~~~gtl~~a~~~~~~~~~~~~~vv~~~~~~~-~~~~~~~~~~~~~~~t------g~~tgl~a~~~~~~~p~ll~ 132 (388) ..+....+..++......- ...+.+......+ .......+.+++|... ....++.++......-..++ T Consensus 191 ~a~dd~G~~~yl~svle~~s~~l-~ai~~~e~~~t~~~~t~~d~~f~~GtdG~~~~i~s~~y~~A~~~L~n~p~d~~~il 269 (529) T protein:vir:10 191 EAKDDMGRLCYLPTALEARSKYL-RAVVNEELISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAPYMYTAVL 269 (529) T ss_pred chhhhcCCccchhHHHhhccCce-eeeeeeccccccchhhhhhhhccCCccccccccchHHHHHHHHHhcCCcceeeeee Confidence 0111112222211100000 0011111111100 0011123445555432 12334444443333333444 Q ss_pred cccccchhHHHHHHHHHhhhCceEEEEecCCCcchhHHHHHHHhhhccc-cc---ceEEEEecceecccccccceeehhh Q lcl|Aclame:pro 133 APGFSQNKAVIDALASMAKRLKCRAVIDGPSGSTQDAIDLSGLLGGEGT-GH---DRVYMVDPMPAIYSRKAQGNIYVPP 208 (388) Q Consensus 133 ap~~~~~~~v~~~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~-~s---~~~~~~~p~~~~~~~~~~~~~~~p~ 208 (388) .-| +.++++..+|..+|++.+..++.|.|...+..+ +..+....++ ++ .....+|||. ..|+.++.....++ T Consensus 270 ~~g-~y~~a~I~~L~~ic~~~~~d~f~DV~~~LT~~a--A~~~~e~~gl~~~~~~~~s~y~~P~~-~~D~~tg~k~~~Gl 345 (529) T protein:vir:10 270 GLG-CYDNAAITALGKICADRLIDGFFDVKPTLTYAE--ALPAVEDTGLLGTDYVSCSVYHYPFS-CKDKWTQSRVVFGL 345 (529) T ss_pred ccC-CccHHHHHHHHHHHhhhhhcEEEcCCCCcCHHH--HHHHHHhcCccccCceeeEEEEccee-eccccccCceeeCC Confidence 433 568889999999999888888889998877543 3345555555 22 2356778887 77888888889999 Q ss_pred HHH--HHHHHh--ccccccccccccccce-e-ecc-cccccccCchhhhhhccccceEEEEEeCCC----cEEEEccccC Q lcl|Aclame:pro 209 STI--AMGAVA--AVKPWESPGNQGVLIQ-D-VAR-VIDYNILDKSTEGDLLNRNGVSYFARTSMG----GFSLIGNRTV 277 (388) Q Consensus 209 s~~--~aG~~a--~~d~~~s~~n~p~~~~-g-~~~-~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~----G~~~wG~rT~ 277 (388) ||. +|...+ +.-+-.|.+..|+|.. + +.+ .+..-...++.|...|-.+.||++..--++ +-.+||+|+- T Consensus 346 sG~A~~akargv~~na~v~g~hY~pAGe~r~~inr~~I~~ly~~d~~e~~~lv~~riNPV~~~~~g~~~idDsLt~~~kn 425 (529) T protein:vir:10 346 SGVAYAAKARGVKKNSDVGGWHYSPAGEERAVIARASIQPLYPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQD 425 (529) T ss_pred CcceeeccccceeecccccccccccCCCccceeecccceeccCCCccCHHHHHhhccCeeeeeccCcceeeeeeceeeeC Confidence 995 332222 1122334455666642 2 111 122223334444444555666665543333 4457788875 Q ss_pred C-CceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHhcCCeee-----------eeEEEeccCCC Q lcl|Aclame:pro 278 T-GKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPG-----------GEVYLHPTLNT 345 (388) Q Consensus 278 ~-~~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i~~~L~~l~~~Gal~g-----------~~v~~d~~~Nt 345 (388) . |||+|+++|+++|.+.+.+..++.+|||++..+|. +++-+..+|..+|+.|+|.+ |.+.+ + T Consensus 426 ny~R~~hv~~lmn~I~~~~~k~a~~~~~~Pd~it~~g-l~~~l~~~L~r~~asgalv~prdp~~~G~epy~~~V-----~ 499 (529) T protein:vir:10 426 NYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAG-LTKGMTKLLDRFVASGALVAPRDPDADGTEPYVLKV-----T 499 (529) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHhhCCChHHHHH-HHHhHHHHHHHHHhcCceecccCccCCCCCceEEEE-----e Confidence 5 89999999999999999999999999999999987 99999999999999999986 33333 2 Q ss_pred HHHhhCCeEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 346 VERYKNGSWYIVIDYGRYSPNEHMIFHLNAVD 377 (388) Q Consensus 346 ~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~ 377 (388) + .+.+++.+++.++|.-.+..|...-..-. T Consensus 500 q--~d~D~~~v~~~~~ptGv~Rri~~~p~l~~ 529 (529) T protein:vir:10 500 Q--AEFDKWEVVWACCPTGVARRIQGVPLLIK 529 (529) T ss_pred e--cccCeEEEEEEeecCCceeeEEeeeeecC Confidence 3 34599999999999999999887665555 No 48 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=99.64 E-value=2.6e-16 Score=106.03 Aligned_cols=356 Identities=11% Similarity=0.018 Sum_probs=193.2 Q ss_pred CCC-----CCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchh-- Q lcl|Aclame:pro 1 MPV-----IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWH-- 73 (388) Q Consensus 1 M~~-----~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~-- 73 (388) |+- ...-.+|+|+...+.+...+.....++..+...++=+ +.++.+.|++..........++...+... T Consensus 3 magg~~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~~wG----p~~~v~~i~~~~~~~~~~~~~G~~~~~~~~~ 78 (436) T protein:vir:78 3 LGGGTFVTQNKVLPGSYINFVSATRATSSLSDRGIVAMPLELDWG----IDEEVFQVTSDDFEKYSTKYFGYDYTHEKLK 78 (436) T ss_pred ccceeeccceeecCceEEEEEecCcceeeccCCeEEEEEEEecCC----CCceeEEeecccchHHHHHHhcCccchHHHH Confidence 322 2344679999999888888888888888888777555 55666777664322222222222233221 Q ss_pred hhhhhhccccceEEEEeccccccccccc--ccccc------------cccchhh-----------h--hh--hHhhhhhh Q lcl|Aclame:pro 74 AASETLKKTSVPQYFIVVPEGADDAATM--ANIIG------------GIDPTTG-----------R--RT--GIAALTEC 124 (388) Q Consensus 74 a~~~~~~~~~~~~~vv~~~~~~~~~~~~--~~~~~------------~~~~~tg-----------~--~t--gl~a~~~~ 124 (388) .++..+. +....+.+++.++.....+. +.+.| ..+..+. . .+ -+..+... T Consensus 79 ~l~~~~~-~~~tv~~yrl~~G~~a~~~v~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~~d~~~~~~~~~l~~n 157 (436) T protein:vir:78 79 GLRDLFK-NIRLGYFYKLNKGVKASCSIATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKKVDTQIAKVITELQDN 157 (436) T ss_pred HHHHHhc-CCCEEEEEECCCcceeeeeeeeeecCCCCCcEEEEEecccccccCceEEEEEecchhhhhhhHHHHhhccCC Confidence 1222222 22233344443322111111 00000 0000000 0 00 00000000 Q ss_pred h-----------hhhhheeccccc----chhHHHHHHHHHhh-hCceEEEEecCCCcchhHHHHHHHhh-hc-ccccceE Q lcl|Aclame:pro 125 T-----------ERPTLIGAPGFS----QNKAVIDALASMAK-RLKCRAVIDGPSGSTQDAIDLSGLLG-GE-GTGHDRV 186 (388) Q Consensus 125 ~-----------~~p~ll~ap~~~----~~~~v~~~l~~~~~-~~~~~~i~d~p~~~~~~~~~~~~~~~-~~-~~~s~~~ 186 (388) . .....-+..|-. .......+|..+.. +++.+++... ..........|.. .. .-+-+.. T Consensus 158 ~~V~~~~~g~la~~a~~~LtGG~dG~~~T~~dy~~al~~le~~~fn~l~~~~~---d~~~~~~~~a~ikr~re~~g~~~~ 234 (436) T protein:vir:78 158 DYVTWKKEATLEATAGLTFTNGTNGEAVTGTEYQAFLDKIESYSFNALGCLAT---TAEIKSLFVEFTKRMRDKVGAKFQ 234 (436) T ss_pred ceEEEEecccccccceeeeeccccccccchHHHHHHHHHHcccceeEEEecCC---ChHHHHHHHHHHHHHHhhcCCeEE Confidence 0 000001111110 11233444444322 2232222221 1111111111111 11 1111122 Q ss_pred EEEecce--------ecccccccc-eeehhhHHHHHHHHhccccccccccccccceeecccccccccCchhhhhhccccc Q lcl|Aclame:pro 187 YMVDPMP--------AIYSRKAQG-NIYVPPSTIAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNG 257 (388) Q Consensus 187 ~~~~p~~--------~~~~~~~~~-~~~~p~s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~g 257 (388) ++.++.- .+.+...+. ....-.++.+||+.|..+..+|+-++++.. + .+....++.+|.+.+.++| T Consensus 235 aV~~~~~~~d~EgIInv~n~v~g~~~~~~~~~a~vAG~~Ag~~~~~S~T~~~~~~--~---~~v~~~~t~~e~~~ai~~G 309 (436) T protein:vir:78 235 TVLYKKNDADYEGVVSVENKIKDTGLLESSLIYWTTGAIAGCDINKSNTNKRYDG--E---FDVDVNYTQIHLEEALKTG 309 (436) T ss_pred EEecCCCCCCCceEEEeecccCCceechhHHHHHHHHHHhcCccccCccceecCc--c---ccccccCCHHHHHHHHhCC Confidence 2222110 011111111 112346788999999999999999988742 1 1222346788999999999 Q ss_pred eEEEEEeCCCcEEEE-ccccC---------CCceeeehhhHHHHHHHHHHHHH-HHhcc-cCCHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 258 VSYFARTSMGGFSLI-GNRTV---------TGKFISFVGLEDAIARKLEAASQ-RAMSK-QLTKSFMEQEIKKINLFMQD 325 (388) Q Consensus 258 In~i~~~~~~G~~~w-G~rT~---------~~~~i~vrR~~~~i~~~i~~~~~-~~vfe-pn~~~~~~~i~~~i~~~L~~ 325 (388) .-++.+. +++.++- |-.|+ .|+.|.++|++|+|.+.+++.+. .|+++ ||+..-|..++..++.||.+ T Consensus 310 ~lvl~~d-~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yiGKv~N~~dgr~~l~~~i~~yl~~ 388 (436) T protein:vir:78 310 KFIFHKV-GDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYLGEVPNDKSGRISFWNDVVKHHEQ 388 (436) T ss_pred eEEEEEe-CCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccccccCCCHHHHHHHHHHHHHHHHH Confidence 8887654 5554442 33343 38999999999999999998875 59996 79999999999999999999 Q ss_pred HHhcCCeeeee---EEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEc Q lcl|Aclame:pro 326 LVAAEIIPGGE---VYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAV 376 (388) Q Consensus 326 l~~~Gal~g~~---v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~ 376 (388) |.++|+|..|. +..++. + ....+++++.+.|+..+|.|.+.++.. T Consensus 389 L~~~g~I~~f~~~Dv~v~~~-~-----~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 389 LQNMRAIEDFKADDVSVEPG-S-----DKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred HHhCCcccCCCCcceEEeec-C-----CCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 99999999876 444322 2 357789999999999999999999988 No 49 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.25 E-value=3.3e-12 Score=83.54 Aligned_cols=320 Identities=11% Similarity=0.045 Sum_probs=187.2 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) |+=| |+++++-..-++..+....-++..++-..... . ...+++..+... .. .......+...+. T Consensus 1 ~~gl----p~i~i~f~~~a~ta~~~g~rGiv~~il~d~~~----~---~~~~~~~~~v~~---~~--~~~n~~~i~~~~~ 64 (356) T protein:vir:10 1 MAGL----VNINIEFKELATSFIQRSKAGIVAIILKDTTK----M---YKELTSEDDIPI---SL--SADNKKYIKYGFV 64 (356) T ss_pred CCCC----CceeEEEeecceeeccCCccceEEEEEecCCc----c---eeEEeccccchh---HH--HHHHHHHHHHHhh Confidence 8874 78888888888887777766666555432111 0 111111111100 00 0111122222222 Q ss_pred cccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhhhhhhheecccccchhHHHHHHHHHhhhCc-----e Q lcl|Aclame:pro 81 KTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRLK-----C 155 (388) Q Consensus 81 ~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~~~p~ll~ap~~~~~~~v~~~l~~~~~~~~-----~ 155 (388) .+........... .+.+...........|.+++. .....+++|+ .++++.+.+.+...|+| . T Consensus 65 g~~~~~~~~~p~~---------~~~~~~~t~~~y~~aL~~le~--~~fn~l~~~~--~d~~~~~~~~a~ikr~r~~~~~~ 131 (356) T protein:vir:10 65 GATDNEKVLRPSK---------VIISTFTEDGKVEDILEELES--VEFNYLCMPE--AIEAEKTKIVTWIKKIREEESTE 131 (356) T ss_pred cccccccccccee---------eeeecccCchhHHHHHHHhcC--ccceEEEecC--CChHHHHHHHHHHHHHHhcCCcE Confidence 2211111110000 000001111112234444432 2455688886 35667777777776553 1 Q ss_pred EEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccccccccccccccceee Q lcl|Aclame:pro 156 RAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVLIQDV 235 (388) Q Consensus 156 ~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~~~g~ 235 (388) +..+-... ..+++.++-+-..+.. +.. .......++.+||+.|.....+|+-+++..- + T Consensus 132 ~~~V~~~~----------------~aD~EgIInv~n~~~~-~g~--~~t~~~~~~~vAG~~Ag~~~n~S~T~~~~~~--~ 190 (356) T protein:vir:10 132 AKAVLANI----------------KADNEAIINFTENVVV-DGE--EITAEKYTTRVASLIASTPNTQSITYAPLDE--V 190 (356) T ss_pred EEEEecCC----------------CCCCceeEEeecCeEe-cce--eechhHHHHHHHHHHhccchhccccceecCC--c Confidence 22221111 1123333333222221 111 1122455779999999999999999998742 2 Q ss_pred cccccccccCchhhhhhccccceEEEEEeCCCcEEE-EccccC---------CCceeeehhhHHHHHHHHHHHHH-HHhc Q lcl|Aclame:pro 236 ARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSL-IGNRTV---------TGKFISFVGLEDAIARKLEAASQ-RAMS 304 (388) Q Consensus 236 ~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~-wG~rT~---------~~~~i~vrR~~~~i~~~i~~~~~-~~vf 304 (388) .. + ..++.+|.+..-.+|--++.+. ++..++ -|-.|+ .|+.|.+.|++|.|.+.+++.+. .|++ T Consensus 191 ~~-~---~~~t~~e~~~ai~~G~lvl~~d-~~~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yiG 265 (356) T protein:vir:10 191 ES-I---VKIDKASADAKVQAGELILRRL-SGKIRIARGINSLTTLTAEKGEIFQKIKLVDTKDLISKDIKNIYVEKYLR 265 (356) T ss_pred cc-c---ccCCHHHHHHHHhCCeEEEEEE-cCeEEEEecCccceecCCCCCcchhhhHHHHHHHHHHHHHHHHHhhcccc Confidence 11 1 1346778888888888877664 333332 244443 38999999999999999999886 6999 Q ss_pred c-cCCHHHHHHHHHHHHHHHHHHHhcCCee-eeeEEEeccC--------------CCHHHhh----CCeEEEEEEEEecC Q lcl|Aclame:pro 305 K-QLTKSFMEQEIKKINLFMQDLVAAEIIP-GGEVYLHPTL--------------NTVERYK----NGSWYIVIDYGRYS 364 (388) Q Consensus 305 e-pn~~~~~~~i~~~i~~~L~~l~~~Gal~-g~~v~~d~~~--------------Nt~~~i~----~G~~~~~v~~~p~~ 364 (388) + ||+..-+..+...++.||.+|.++|+|. ++.+..|.+. ++...+. .-.+.++..+.|+- T Consensus 266 Kv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~~~~d~~v~~~~~~~~v~~~~~v~~vd 345 (356) T protein:vir:10 266 KCPNTYDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLEGKKIAVSKMKENEIKEANTGSNGFYLINLKLVD 345 (356) T ss_pred ccCCCHHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhhhccccccccccceeecccCCcEEEEEEEEEEEe Confidence 8 6999999999999999999999999996 6777777643 2222222 24578999999999 Q ss_pred cceeEEEEEEE Q lcl|Aclame:pro 365 PNEHMIFHLNA 375 (388) Q Consensus 365 pae~I~~~~~~ 375 (388) .+|.|.+.+.. T Consensus 346 amE~iy~ti~v 356 (356) T protein:vir:10 346 AMEDINIRVQM 356 (356) T ss_pred eeeeEEeEEeC Confidence 99999999988 No 50 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=98.67 E-value=1.2e-07 Score=58.51 Aligned_cols=355 Identities=10% Similarity=0.016 Sum_probs=189.4 Q ss_pred CcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCccee-eccchhhhhhcccccccccchhhhhhhhccccc Q lcl|Aclame:pro 6 QFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFR-VANTADAQYLDSTGNELGTGWHAASETLKKTSV 84 (388) Q Consensus 6 ~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~-v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~~ 84 (388) .|- -+.=+.++-.+.++...+-..+.|++.....+ +.++ .++..+....|+ .....+++...+|.+.-. T Consensus 1 ~~s-~iVnV~i~~~~~a~~~~~f~~~l~~~~~~~~~------~r~~~yss~~~V~~~FG---~~S~ey~aA~~yF~q~p~ 70 (450) T protein:vir:95 1 MWN-PIVNVDITLNTAGTTREGFGLPLFLASTDNFE------ERVRGYTSLTEVAEDFD---ENTAAYKAAKQLWSQTPK 70 (450) T ss_pred CCC-ceEEEeecccccccccccceeEEEEcCCCCCc------cceeeecCHHHHHHhcC---CCcHHHHHHHHHHhCCCc Confidence 122 23344444566666667767777777543221 2232 334444444433 334444555555544222 Q ss_pred eEEE-E-ec------------------------cccc--ccccccccccccccchhhhhhhHhhh--------------- Q lcl|Aclame:pro 85 PQYF-I-VV------------------------PEGA--DDAATMANIIGGIDPTTGRRTGIAAL--------------- 121 (388) Q Consensus 85 ~~~v-v-~~------------------------~~~~--~~~~~~~~~~~~~~~~tg~~tgl~a~--------------- 121 (388) +..+ + |. .... ....+........+..+...+.+... T Consensus 71 p~~l~igr~~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~~~~~s~g~~ 150 (450) T protein:vir:95 71 VTQLYIGRRAMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDKVSVNVTGSN 150 (450) T ss_pred ccEEEEEeeccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeeeeeeeeeccc Confidence 2111 1 00 0000 00000000000000000011111100 Q ss_pred ---------------hhhhhhhhheecccccchhHHHHHHHHHhhhCceEEEEecCCCcchhHHHHHHHhhhcccccceE Q lcl|Aclame:pro 122 ---------------TECTERPTLIGAPGFSQNKAVIDALASMAKRLKCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRV 186 (388) Q Consensus 122 ---------------~~~~~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~ 186 (388) ......+..+...|. ....+..+|.++.+.-.-...+-.+..++++......|.... .+. T Consensus 151 ~~~t~~~~~~~~~~~~~l~~~~~~~~~~g~-~aet~~~a~~a~~~~~~~w~~~~~~~~~~~~i~a~a~w~~a~----~~~ 225 (450) T protein:vir:95 151 GSATMIIAKAGDNDFVKVTTTAQTVYIAST-TADTASTALAAIEAYSTDWYFIAAEDRTQQFVLAMASEIQAR----KKI 225 (450) T ss_pred ceeeeeeeccccchhhccccccceeEeccc-ccccHHHHHHHHHHhhCCeEEEEecCCCHHHHHHHHHHHhhc----CcE Confidence 000011122222222 123355666666543322222223333444555555555432 223 Q ss_pred EEEecce-ecccc-----c---------c--cce--------eehhhHHHHHHHHhccccccccc-cccccceeecccc- Q lcl|Aclame:pro 187 YMVDPMP-AIYSR-----K---------A--QGN--------IYVPPSTIAMGAVAAVKPWESPG-NQGVLIQDVARVI- 239 (388) Q Consensus 187 ~~~~p~~-~~~~~-----~---------~--~~~--------~~~p~s~~~aG~~a~~d~~~s~~-n~p~~~~g~~~~~- 239 (388) ..+.+|- ...+. . . ... ....+.++++|.....++-+=.| +|.. .|+...+ T Consensus 226 f~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~~~aa~~g~~~~~~~g~~T~~fk~l--~Gv~~~v~ 303 (450) T protein:vir:95 226 FFTANSDVTALQGTELASANDVPAQLAKNMYTRTVCLWHHAAAEDYPEMAYIAYGAPYDAGSIAWGNAQL--TGVAASLQ 303 (450) T ss_pred EEEEcCCchhhhhhhhhcccchHHHHHhccCCeeEEEeeCCCchhHHHHHHHHHhhhcccceeeeccccc--cceeeecc Confidence 3332221 11000 0 0 000 11235566666655544322112 3333 2333221 Q ss_pred -cccccCchhhhhhccccceEEEEEeCCCcEEEEccccCCCceeeehhhHHHHHHHHHHHHHHHhc-----c-cCCHHHH Q lcl|Aclame:pro 240 -DYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVTGKFISFVGLEDAIARKLEAASQRAMS-----K-QLTKSFM 312 (388) Q Consensus 240 -~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~i~vrR~~~~i~~~i~~~~~~~vf-----e-pn~~~~~ 312 (388) .....++.+|++.|..+++|++.++.+.++ ++...|+...||-++|-.+|++..|+..+...+- + |-+..-. T Consensus 304 ~~~~~~lt~~~~~al~~~~~n~y~~~~~~~~-~~~G~~~~G~~iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy~~~G~ 382 (450) T protein:vir:95 304 PSNQRPLTSIQKSALDVRHCNFIDLDGGVPV-VRRGITSGGEWIDIIRGVDWLESDLKTSLRDLLINQKGGKITYDDTGI 382 (450) T ss_pred CccccccchHHHHHHHhCCcEEEEEecCcee-eeCCeeeCcchhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccChhhH Confidence 122456889999999999999999877664 7788899888999999999999999999988762 2 7777888 Q ss_pred HHHHHHHHHHHHHHHhcCCeeeeeEEEec-cCCCHHHhhCCeEE-EEEEEEecCcceeEEEEEEEcch Q lcl|Aclame:pro 313 EQEIKKINLFMQDLVAAEIIPGGEVYLHP-TLNTVERYKNGSWY-IVIDYGRYSPNEHMIFHLNAVDR 378 (388) Q Consensus 313 ~~i~~~i~~~L~~l~~~Gal~g~~v~~d~-~~Nt~~~i~~G~~~-~~v~~~p~~pae~I~~~~~~~~~ 378 (388) ..|+..|+.-|++..++|.|.||+|...+ +..+++|+.+.++. +++.+.....++++.++....=| T Consensus 383 ~~i~a~i~~~l~~a~~~G~Ia~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~~ 450 (450) T protein:vir:95 383 TRIRQVIETSLQRAVNRNFLSSYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAYE 450 (450) T ss_pred HHHHHHHHHHHHHHHhcCcccceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEeC Confidence 88999999999999999999999999865 77889999988865 78888888999988877766666 No 51 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=98.62 E-value=8.9e-08 Score=59.24 Aligned_cols=357 Identities=15% Similarity=0.040 Sum_probs=178.8 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) ||. -+.=+.++-.+.++..-.-+.+.|+|+....++...++...+.++...... .++..+..+++...+|. T Consensus 1 m~~------~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~~---Dfg~~s~~Y~AA~~~f~ 71 (426) T protein:vir:31 1 MPK------QIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGD---DYGEDSDVYTASEAIEE 71 (426) T ss_pred CCc------ceEEEEeecccccccccccceeeeeeeccccccccccchhhhhhhHHHHHh---cCCCChHHHHHHHHHHh Confidence 883 355556667778888888899999998876665444555555555544443 44566777788888887 Q ss_pred cccceEEEEec------cccccccccccc--cc---ccccchhhhhhhHhh----------------------------- Q lcl|Aclame:pro 81 KTSVPQYFIVV------PEGADDAATMAN--II---GGIDPTTGRRTGIAA----------------------------- 120 (388) Q Consensus 81 ~~~~~~~vv~~------~~~~~~~~~~~~--~~---~~~~~~tg~~tgl~a----------------------------- 120 (388) ++-..-..... .+......+... +. +..+...-...++.+ T Consensus 72 Q~~~~~r~~v~~at~~~~~~~t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~~~~t~~g~~t~~~~~~~~ 151 (426) T protein:vir:31 72 MGAEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIEL 151 (426) T ss_pred CCceeEEeeccccceeeeccCCcceeecceeeeecccCcchHHHHHHhhccccccccceeeeEeccccceeeccccceee Confidence 75211110000 000000000000 00 000000000011100 Q ss_pred ---------hhhhhhhhhheecc---cccchhHHHHHHHHHhhhCceEEEEecCCCcchhHHHHHHHhhhcccccc-eEE Q lcl|Aclame:pro 121 ---------LTECTERPTLIGAP---GFSQNKAVIDALASMAKRLKCRAVIDGPSGSTQDAIDLSGLLGGEGTGHD-RVY 187 (388) Q Consensus 121 ---------~~~~~~~p~ll~ap---~~~~~~~v~~~l~~~~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~-~~~ 187 (388) +.++.........+ ++.+...+.+.+...++.-+-+.+...-...+.... ...+.+. ... T Consensus 152 ~~s~~dw~~~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa~~~~i~~va~~~e~~~~~~~-------~~~~a~~~~~~ 224 (426) T protein:vir:31 152 TYFHADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDSV-------DEAMDVAHEVA 224 (426) T ss_pred eeccCcchhhhcccccchhhhhhccccchhhhhhhHhhhhhhhhcceeeeeeccchhhhcch-------hhhhhhhhccc Confidence 00000000000000 011111111112111111111111111100000000 0011111 111 Q ss_pred EEecceecccccccceeehhhHHHHHHHHhccccccccccccc-cce---eecccccccccCchhhhhhccccceEEEEE Q lcl|Aclame:pro 188 MVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGV-LIQ---DVARVIDYNILDKSTEGDLLNRNGVSYFAR 263 (388) Q Consensus 188 ~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~-~~~---g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~ 263 (388) -|.|-........ ...--..+.+++.++..+||..|+..-. +.. ...+..+.......+++..++ +..|.+.. T Consensus 225 ~y~p~~~~~~~~~--~~~~~~~~~~~~~~aa~~~~~~~~~~~~~~~~~~~~~~~~~gv~~t~~~~~~A~~~-~~~n~~~~ 301 (426) T protein:vir:31 225 GYVPSGDLMMIVD--ASDDDLAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGE-GPVNVLID 301 (426) T ss_pred ccccchhheeehh--ccccchhhHHhhhhhhhccccchhhhhccccccceeeccccccccccchhhhhhhc-CCceEEEE Confidence 1222221110000 0001236688899999999887653222 111 111122222222223333454 66799888 Q ss_pred eCCCcEEEEc-----cccCCCceeeehhhHHHHHHHHHHHHHHHhc---c-cCCHHHHHHHHHHHHHHHHHHHhcCC--e Q lcl|Aclame:pro 264 TSMGGFSLIG-----NRTVTGKFISFVGLEDAIARKLEAASQRAMS---K-QLTKSFMEQEIKKINLFMQDLVAAEI--I 332 (388) Q Consensus 264 ~~~~G~~~wG-----~rT~~~~~i~vrR~~~~i~~~i~~~~~~~vf---e-pn~~~~~~~i~~~i~~~L~~l~~~Ga--l 332 (388) +.+ +..+|= ..+++-.||-++|..+|+++.++..++..+= + |-+..-...|+..|+.-|++.++.|. + T Consensus 302 ~~~-~~~i~~~~~~~G~~~~G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~~~ 380 (426) T protein:vir:31 302 VSD-ANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPL 380 (426) T ss_pred ecC-ceeeecceeecccccchhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCccc Confidence 764 444543 2334567999999999999999999887763 3 77888888999999999999998644 4 Q ss_pred eeeeEEEeccCCCHHHhhCCeEE-EEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 333 PGGEVYLHPTLNTVERYKNGSWY-IVIDYGRYSPNEHMIFHLNAVD 377 (388) Q Consensus 333 ~g~~v~~d~~~Nt~~~i~~G~~~-~~v~~~p~~pae~I~~~~~~~~ 377 (388) .+|.|...+...++.|..+-++. +++.....-.+.++.++...+. T Consensus 381 ~~y~v~~P~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 381 AEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred cceeecCCCccccchhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 46887765544555677776665 7788888899999999999888 No 52 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=98.62 E-value=1.8e-07 Score=57.50 Aligned_cols=313 Identities=13% Similarity=0.048 Sum_probs=174.8 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) |- +. .-+|.+.-....+.+ ...-..+.+..+.... ..+..++. .......+.....+.+...++. T Consensus 1 ~~--~~-iv~V~v~~~~~~~~~--~~~~~~~~~~~~~t~~-------~~~~y~s~---~~v~~d~~~~~~~Ykaa~~~f~ 65 (331) T protein:vir:80 1 MV--ET-ITDVRVHISVLYPSP--RIGLGRPAIFVKGTAM-------GYKEYTTL---EELKDTFADNTEVYAKAKAVFL 65 (331) T ss_pred Cc--cc-eecceeeeccccccc--ccccCcceeEEecccc-------ceEEEech---hhhccCCCCCcHHHHHHHHHHh Confidence 21 12 223333322111111 2222333333222111 01223333 2232334455667778888888 Q ss_pred cccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhhh-hhhheecccccchhHHHHHHHHHhhh-CceEEE Q lcl|Aclame:pro 81 KTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTE-RPTLIGAPGFSQNKAVIDALASMAKR-LKCRAV 158 (388) Q Consensus 81 ~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~~-~p~ll~ap~~~~~~~v~~~l~~~~~~-~~~~~i 158 (388) ++..+.-+....... . +.+.++..... .--.+.+.+.. ...+ .++....+. -..|.+ T Consensus 66 Q~~~~~~i~v~~~~~---------------~----~~~~a~~a~~~~~w~~~~~~~~~-~~~~-~a~a~~~~a~~~~f~~ 124 (331) T protein:vir:80 66 QKDRPDTVAVITYED---------------T----KLLEAAEAYFLKSWHFALLAEFK-AADA-LALSNLIEEQKFKFAV 124 (331) T ss_pred ccCccceEEEeccch---------------H----HHHHHHHHhccCceeEEEeecCC-HHHH-HHHHHHHhhCCcEEEE Confidence 876543222111100 0 11111111110 00112222222 2222 233343332 233444 Q ss_pred EecCCCcchhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhcccccccccc-c-cccceeec Q lcl|Aclame:pro 159 IDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGN-Q-GVLIQDVA 236 (388) Q Consensus 159 ~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n-~-p~~~~g~~ 236 (388) ++.... .. .... . + .+....++++.- . --+.++++|..+..++-.-.|+ + ++ .|+. T Consensus 125 ~~~~~~--~~--~~~~---~-~-~~~t~~~~~~~~-------~----~~~~aa~~g~~~~~~~g~~t~~fk~~l--~GV~ 182 (331) T protein:vir:80 125 FQVTAV--AD--ITPL---A-K-NTRTIAIVHSKT-------G----EKLDAALIGNVASLPVGSATWKGRHGL--AGIT 182 (331) T ss_pred EecCch--HH--HHHh---h-c-cccEEEEEcCCc-------c----chhHHHHHHHHHhcCccceeeeeeccc--CCCC Confidence 443221 11 1111 1 1 233444443321 1 1245667788888887543332 2 32 2332 Q ss_pred ccccccccCchhhhhhccccceEEEEEeCCCcEEEEccccCCCceeeehhhHHHHHHHHHHHHHHHhcc----cCCHHHH Q lcl|Aclame:pro 237 RVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVTGKFISFVGLEDAIARKLEAASQRAMSK----QLTKSFM 312 (388) Q Consensus 237 ~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~i~vrR~~~~i~~~i~~~~~~~vfe----pn~~~~~ 312 (388) . ..++.+|++.|..+++|++.++.+..+ ++...|++-.||-+.+-.+|++..|+..+...+-. |-++.=. T Consensus 183 ~-----~~lt~t~~~al~~~~~N~y~~~~~~~~-~~~G~~~~G~~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~ 256 (331) T protein:vir:80 183 S-----EELKVSEIDAIQKAGGMCYIEKAGIAQ-TSEGKTVSGEFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGI 256 (331) T ss_pred C-----CCCCHHHHHHHHhcCceEEEEecCeeE-EecceEeCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhH Confidence 2 235788999999999999999866544 56666777789999999999999999998887643 5566777 Q ss_pred HHHHHHHHHHHHHHHhcCCee--------eeeEEEec-cCCCHHHhhCCeEE-EEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 313 EQEIKKINLFMQDLVAAEIIP--------GGEVYLHP-TLNTVERYKNGSWY-IVIDYGRYSPNEHMIFHLNAVD 377 (388) Q Consensus 313 ~~i~~~i~~~L~~l~~~Gal~--------g~~v~~d~-~~Nt~~~i~~G~~~-~~v~~~p~~pae~I~~~~~~~~ 377 (388) ..|+..++.-|++-++.|.|. ||.|...+ ++.+++|+.+++.. +.+.+.+...+++|++....+. T Consensus 257 ~~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 257 ALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred HHHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 889999999999999999996 57888754 67899999998886 8888999999999999999988 No 53 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=98.60 E-value=1.3e-07 Score=58.36 Aligned_cols=345 Identities=12% Similarity=0.046 Sum_probs=162.2 Q ss_pred CCCCCCcCCC-eEEEEcCCCcccccccCcceeEEEeec-----------------ccccccc-ccCcceeeccchhhhhh Q lcl|Aclame:pro 1 MPVIDQFEHN-GISIETHEPPPPMGPPGDNVVAWVVTA-----------------PDKHADV-AFSVPFRVANTADAQYL 61 (388) Q Consensus 1 M~~~t~~~hG-V~~~e~~~~~~~i~~v~tav~g~vgta-----------------~~~~~~~-~~~~~v~v~s~~~~~~~ 61 (388) +.-. ..++ +++-+-............++.+...+. ....... .++. -..++..+.+.. T Consensus 73 F~q~--p~P~~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~l-S~~ts~~~vA~~ 149 (502) T protein:vir:52 73 FAQS--PRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSF-ARLADFNAVATK 149 (502) T ss_pred hcCC--CccceEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeecccc-ccccchhHHHHH Confidence 1110 1111 333333322222221111111111100 0000000 0000 001111111111 Q ss_pred ccc-ccccccchhhhhhhhccccceEEEEecc-ccccccccccccccccc-chhhhhhhHhhhhhhhhhhhhe----ecc Q lcl|Aclame:pro 62 DST-GNELGTGWHAASETLKKTSVPQYFIVVP-EGADDAATMANIIGGID-PTTGRRTGIAALTECTERPTLI----GAP 134 (388) Q Consensus 62 ~~~-~~~~gtl~~a~~~~~~~~~~~~~vv~~~-~~~~~~~~~~~~~~~~~-~~tg~~tgl~a~~~~~~~p~ll----~ap 134 (388) ... ....+.. +..-++..+ ..+++... .+.....+ +.-+.+ ..+| +.+.++......+..+ ... T Consensus 150 i~~~l~~~~~~---~tv~~d~~~-~~F~i~s~ttg~~~~~~---~~~a~~~~~~g--t~~a~~l~l~~~~~av~v~~~~~ 220 (502) T protein:vir:52 150 IQEKLTTLSVA---VSIAYDETG-NRFIVSANVAGEDKKTE---IDYAIDEGGEG--EYIGALLKLENGQASRKVGKNSV 220 (502) T ss_pred HHhhhcccccc---eEEEEecCC-ceEEEEeccCCCcceeE---EEEeecCCcch--hHHHHHhccccccceeeeeeecc Confidence 000 0000000 000011111 11112111 11111110 000000 0011 1111111111111111 111 Q ss_pred cccchhHHHHHHHHHhh---hCceEEEEecCCCcchhHHHHHHHhhhcc--------------------------cccce Q lcl|Aclame:pro 135 GFSQNKAVIDALASMAK---RLKCRAVIDGPSGSTQDAIDLSGLLGGEG--------------------------TGHDR 185 (388) Q Consensus 135 ~~~~~~~v~~~l~~~~~---~~~~~~i~d~p~~~~~~~~~~~~~~~~~~--------------------------~~s~~ 185 (388) |. ....+.++|.++.+ .+-.+.+.+.+. +++......|.+..+ .+..+ T Consensus 221 g~-~aet~~~al~a~~~~~~~w~~~~~a~~~~--~~~~la~a~~iea~~~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~ 297 (502) T protein:vir:52 221 SL-KKETLGEALFNVAEVNNTWYGFTVAAQLT--DSEVEAAAKYAQANTKLFGANVIRAEQIEWSADNIYKKLYDAGLDH 297 (502) T ss_pred cc-cccCHHHHHHHHHhccCceEEEEEeecCC--hhHHHHHHHHHhhcCcEEEEEecCcceeccccchHHHHHHhccCce Confidence 11 12223333333332 222233333221 222233333333211 01112 Q ss_pred EEEEecceecccccccceeehhhHHHHHHHHhccccccccccccc---cceeecccccccccCchhhhhhccccceEEEE Q lcl|Aclame:pro 186 VYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGV---LIQDVARVIDYNILDKSTEGDLLNRNGVSYFA 262 (388) Q Consensus 186 ~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~---~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~ 262 (388) ..+.|-. . -..+.++++|..+..|+..-+.-... ...|+.. ..++.+|++.|..+++|++. T Consensus 298 t~~~y~~----------~-~~~~~aa~~g~~as~~f~~~~g~iT~~fk~l~GV~~-----~~lt~t~~~al~~~~~N~y~ 361 (502) T protein:vir:52 298 TLAMFDK----------N-DMYPVSSALARLLSTNFAANNSTLTLKFKQQPTITA-----DEITATEFAKAKRLGINVYT 361 (502) T ss_pred eEEEecC----------C-cchhHHHHHHHHHhcCCCcCcceeeecccccCCccc-----CcCCHHHHHHHHhcCceEEE Confidence 2222210 0 12356677888898887544332221 1223332 23688899999999999999 Q ss_pred EeCCCcEEEEccccCCCceeeehhhHHHHHHHHHHHHHHHhcc-----cCCHHHHHHHHHHHHHHHHHHHhcCCee---- Q lcl|Aclame:pro 263 RTSMGGFSLIGNRTVTGKFISFVGLEDAIARKLEAASQRAMSK-----QLTKSFMEQEIKKINLFMQDLVAAEIIP---- 333 (388) Q Consensus 263 ~~~~~G~~~wG~rT~~~~~i~vrR~~~~i~~~i~~~~~~~vfe-----pn~~~~~~~i~~~i~~~L~~l~~~Gal~---- 333 (388) ++.+.++ +...+++.-+||-+.+-.+|++..|+..+...++. |-|+.=...|+..++.-|++-+++|.|. T Consensus 362 ~~~~~~~-~~~G~~~~G~~iD~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~ 440 (502) T protein:vir:52 362 YFDDVAM-IAEGTVIGGKFADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKW 440 (502) T ss_pred EecCeeE-EecCeeeCCchhhHHHHHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccc Confidence 9876654 66777888789999999999999999998876652 6677778899999999999999999984 Q ss_pred ----------------eeeEEEe-ccCCCHHHhhCCeE-EEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 334 ----------------GGEVYLH-PTLNTVERYKNGSW-YIVIDYGRYSPNEHMIFHLNAVD 377 (388) Q Consensus 334 ----------------g~~v~~d-~~~Nt~~~i~~G~~-~~~v~~~p~~pae~I~~~~~~~~ 377 (388) ||.+... .++.+++|+.+++. -+.+.+.+...+++|++.+..+. T Consensus 441 ~~~~~g~~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 441 TGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred cCcccceeeecccccCceEEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 5778876 46789999999988 89999999999999999888888 No 54 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=98.03 E-value=6.5e-06 Score=49.00 Aligned_cols=338 Identities=12% Similarity=0.043 Sum_probs=183.5 Q ss_pred cCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhccccceE Q lcl|Aclame:pro 7 FEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTSVPQ 86 (388) Q Consensus 7 ~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~~~~ 86 (388) -..-|.+...+-+..++..+.--+. |||.+..... +...+...+|...+.+. .++.|...+.+...|.+.. T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er~~L-fig~~~~~~~-----~~~~~~~~sdld~~lg~--~~~~lk~~v~aa~~naG~~- 71 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERHAL-FVGVGTTNQG-----KLLALTPDSDFDKVFGE--TDTDLKKQVRAAMLNAGQN- 71 (376) T ss_pred CCCeEEEecccccCCCcccccceEE-eecccccccc-----ceeeecCccchHhhhCC--CchHHHHHHHHHHhCCCCc- Confidence 2455888888888888877776555 8887654322 22334444444444322 3355655565555554432 Q ss_pred EEEecccccccccccccccccccchhhhhhhHhh-hhhhhhhhhheeccccc-chhHHHHHHHHHhh----h--CceEEE Q lcl|Aclame:pro 87 YFIVVPEGADDAATMANIIGGIDPTTGRRTGIAA-LTECTERPTLIGAPGFS-QNKAVIDALASMAK----R--LKCRAV 158 (388) Q Consensus 87 ~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a-~~~~~~~p~ll~ap~~~-~~~~v~~~l~~~~~----~--~~~~~i 158 (388) +...+..... . +.+ ..+.++. ......+ .+.+-+-+ .+++-..++.++++ + +..++| T Consensus 72 ~~~~~~~~~~---~------~~~----~~~Av~~a~~~~s~E--~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv~fi 136 (376) T protein:vir:37 72 WFAHVYIAQE---D------GYD----FVECVKKANQTASFE--YCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFI 136 (376) T ss_pred EEEEEEeecC---C------chH----HHHHHHHhhhhcCce--EEEEeccccccHHHHHHHHHHHHHHHHhcCCeEEEE Confidence 2211110000 0 000 1111111 1111111 22222222 23333444455443 2 235677 Q ss_pred EecCCCc-------c-hhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccc--cccccccc Q lcl|Aclame:pro 159 IDGPSGS-------T-QDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK--PWESPGNQ 228 (388) Q Consensus 159 ~d~p~~~-------~-~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d--~~~s~~n~ 228 (388) +..+.-. + .++.. .......++.+.++.++ |.+ + --..|.+||.+|+.- -.++|..- T Consensus 137 le~r~~~~~~~~~e~w~~y~~-~~~al~~gia~~~V~~V-~~~--~---------gn~~G~~aGRl~~aaVsVadspgRV 203 (376) T protein:vir:37 137 QAVQGINHDQSDGETWDQYVQ-KLTTLQQTIVADHVCLV-PLL--F---------GNETGVLAGRLANRAVTVADSPARV 203 (376) T ss_pred EeccCcCcccccccCHHHHHH-HHHHhhcccccccceee-eee--h---------hhhHHHHHHHHhhcccchhhCccce Confidence 7776311 1 11111 11122344555655433 111 1 123788888876442 25566543 Q ss_pred ccc-ceee---ccccc-ccccCchhhhhhccccceEEEEEeCCC-cEEEEc-cccCC-----CceeeehhhHHHHHHHHH Q lcl|Aclame:pro 229 GVL-IQDV---ARVID-YNILDKSTEGDLLNRNGVSYFARTSMG-GFSLIG-NRTVT-----GKFISFVGLEDAIARKLE 296 (388) Q Consensus 229 p~~-~~g~---~~~~~-~~~~~~~~~~~~Ln~~gIn~i~~~~~~-G~~~wG-~rT~~-----~~~i~vrR~~~~i~~~i~ 296 (388) ..+ +.|+ ..+.+ ....++....+.|..+|-.+.+.++|. |+ +|+ .||+. ++||..+|..+-+.|.++ T Consensus 204 ~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~-Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R~vR 282 (376) T protein:vir:37 204 QTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGY-YWADGRTLDVEGGDYQVIENLRVVDKVARKVR 282 (376) T ss_pred eccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCce-EEeCceEeccCCCChhhhhhhhHHHHHHHHHH Confidence 332 2232 22222 224567778889999999999999875 76 665 47774 789999999999998888 Q ss_pred HHHHHHhcccC---CHHHHHHHHHHHHHHHHHHHhcCCeee----eeEEEeccCCC-HHHhhCCeEEEEEEEEecCccee Q lcl|Aclame:pro 297 AASQRAMSKQL---TKSFMEQEIKKINLFMQDLVAAEIIPG----GEVYLHPTLNT-VERYKNGSWYIVIDYGRYSPNEH 368 (388) Q Consensus 297 ~~~~~~vfepn---~~~~~~~i~~~i~~~L~~l~~~Gal~g----~~v~~d~~~Nt-~~~i~~G~~~~~v~~~p~~pae~ 368 (388) ...-..+...- ++.-.+..+.-+..=|+++.+..-+.| ++|...++.+. ..-+...++.+.+.+.|.--.++ T Consensus 283 ~~ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~d~Di~i~w~s~~~V~I~~~v~P~~~pk~ 362 (376) T protein:vir:37 283 LLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKTKVTIYIKVRPYDCPKE 362 (376) T ss_pred HHHHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCCCCCceEEeeccceEEEEEEEEeccCCce Confidence 77766654322 344455666667777888888877777 24554443221 12236788999999999999999 Q ss_pred EEEEEEEcchHHHH Q lcl|Aclame:pro 369 MIFHLNAVDRIVEE 382 (388) Q Consensus 369 I~~~~~~~~~~~~~ 382 (388) |+..+..|-.-.-+ T Consensus 363 Itv~I~Ldlsn~~~ 376 (376) T protein:vir:37 363 ITANIFLDLDSLGE 376 (376) T ss_pred EEEEEEeecCCCCC Confidence 99777666442222 No 55 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=97.93 E-value=1.1e-05 Score=47.87 Aligned_cols=337 Identities=9% Similarity=0.017 Sum_probs=180.7 Q ss_pred cCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhccccceE Q lcl|Aclame:pro 7 FEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTSVPQ 86 (388) Q Consensus 7 ~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~~~~ 86 (388) -.+-|.+...+-+..++..+.--+. |||++..... +...+...+|...+.+ .....|..-+.+...|++..- T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er~~l-fig~~~~~~g-----~~~~~~~~sdld~~l~--~~ds~lk~~v~aa~~naG~~~ 72 (370) T protein:vir:78 1 MWPYVQIYNLNQMQGPVTEVERHLL-FIGSAASNTG-----KLLSLNAQSDFDQLLG--AADSELKANLLAARDNAGQNW 72 (370) T ss_pred CCceEEEeeccccCCCcCccceeEE-EEeccccccc-----ceEeecCccCHHHhcC--CcChhHHHHHHHHHhCCCCce Confidence 1455888888888888877776554 8887763322 1233444445444432 233455555555555554322 Q ss_pred E--EEecccccccccccccccccccchhhhhhhHhhhhhhh--hhhhheecccccchhHHHHHHHHHhhh----C--ceE Q lcl|Aclame:pro 87 Y--FIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECT--ERPTLIGAPGFSQNKAVIDALASMAKR----L--KCR 156 (388) Q Consensus 87 ~--vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~--~~p~ll~ap~~~~~~~v~~~l~~~~~~----~--~~~ 156 (388) . +...... . ..+.|+.... ..+..+.+-|-.+.++-++++.++++. + ..+ T Consensus 73 ~~~~~p~~~~-------~-------------d~~~Av~~a~~~~s~E~V~v~~~~s~~a~~~a~~~~a~el~n~~~Rpv~ 132 (370) T protein:vir:78 73 SAAAYVLPTD-------K-------------PWLDAARDAQQTQSFEGVVVLGQEWHQAAINAAHALNQELIAKWGRWQF 132 (370) T ss_pred EEEEEEecCc-------h-------------hHHHHHHHHHhhCCccEEEEecCcchHHHHHHHHHHHHHHHHhcCCeEE Confidence 1 1111100 0 1122222221 112233443444455556666666553 2 456 Q ss_pred EEEecCCCcchh---HHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhcc--cccccccccccc Q lcl|Aclame:pro 157 AVIDGPSGSTQD---AIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAV--KPWESPGNQGVL 231 (388) Q Consensus 157 ~i~d~p~~~~~~---~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~--d~~~s~~n~p~~ 231 (388) +++..+.....+ ....+-.....++.+.++.++--|.. -..|.+||..+.. -...+|.....+ T Consensus 133 file~~~~~~~e~w~~y~~~l~al~~gia~~~V~vvp~~~g------------~~~G~~aGRL~naavsVadsP~Rv~tG 200 (370) T protein:vir:78 133 MLLAVPAIADEQDWATYEAELATLQDGIAASSVSLIPQLWP------------TLAGAYAGRLCNRAVSIADSPCRVKTG 200 (370) T ss_pred EEEeecCCCCcCCHHHHHHHHHHhhhccccccceEEeeecc------------ccHHHHHHHHhcCeeeecccceeeecc Confidence 677665532211 11111222234566777777633321 1146778865432 123344332222 Q ss_pred -ceeec-ccc-cccccCchhhhhhccccceEEEEEeCCC-cEEEEc-cccCC-----CceeeehhhHHHHHHHHH-HHHH Q lcl|Aclame:pro 232 -IQDVA-RVI-DYNILDKSTEGDLLNRNGVSYFARTSMG-GFSLIG-NRTVT-----GKFISFVGLEDAIARKLE-AASQ 300 (388) Q Consensus 232 -~~g~~-~~~-~~~~~~~~~~~~~Ln~~gIn~i~~~~~~-G~~~wG-~rT~~-----~~~i~vrR~~~~i~~~i~-~~~~ 300 (388) +.|+. .++ .....++.+..+.|..+|-.+.+.++|. |+ +|+ .||+. ++||..+|..+-+.+-++ .++. T Consensus 201 ~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~-Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~ 279 (370) T protein:vir:78 201 ALVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGI-YWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAIA 279 (370) T ss_pred ccccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCce-EEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHH Confidence 12211 121 2334566778889999999999999874 76 665 47774 789999999999999998 4445 Q ss_pred HHhcccCCH--HHHHHHHHHHHHHHHHHHhcCCeee--eeEEEeccCC---CHHHhhCCeEEEEEEEEecCcceeEEEEE Q lcl|Aclame:pro 301 RAMSKQLTK--SFMEQEIKKINLFMQDLVAAEIIPG--GEVYLHPTLN---TVERYKNGSWYIVIDYGRYSPNEHMIFHL 373 (388) Q Consensus 301 ~~vfepn~~--~~~~~i~~~i~~~L~~l~~~Gal~g--~~v~~d~~~N---t~~~i~~G~~~~~v~~~p~~pae~I~~~~ 373 (388) ...+|-.++ ......+..+..=|+++...+.+.| |.-++....+ +..-...+++.+.+.+.|.--.+.|+..+ T Consensus 280 ~i~D~~lnst~gsia~~~~~~~~~L~ema~s~~i~~~~fpgeI~~p~d~Di~i~w~s~~~v~I~~~v~P~~~pk~Itv~I 359 (370) T protein:vir:78 280 RIGDRSFNSTPGSTAAAITYFGKDLREMAKSTTINGQPFPGDIASPQDGDIRIQWVAKNLVSVFVVVRTVDCPKGITVNI 359 (370) T ss_pred HhCCcccCCCCcchhHHHHHHHhhHHHHHhhhhhcccccceeEeccCCCcceEEeeccceEEEEEEEEeccCCceEEEEE Confidence 444543332 2223344444444666666777766 4344432211 11123568889999999998899999988 Q ss_pred EEcchHHHHHH Q lcl|Aclame:pro 374 NAVDRIVEEFI 384 (388) Q Consensus 374 ~~~~~~~~~l~ 384 (388) ..|-..=++-= T Consensus 360 ~LDls~e~~~~ 370 (370) T protein:vir:78 360 MLDLSLNNGEG 370 (370) T ss_pred EEeeccccCCC Confidence 77643222111 No 56 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=97.75 E-value=2.2e-05 Score=46.12 Aligned_cols=340 Identities=12% Similarity=0.003 Sum_probs=190.1 Q ss_pred cCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhccccceE Q lcl|Aclame:pro 7 FEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTSVPQ 86 (388) Q Consensus 7 ~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~~~~~~~ 86 (388) -..-|.+...+-+.-++..+.--.. |||.+..... +...+...+|...+.+. ....|..-+.+...|.+.. T Consensus 1 ~~~~v~vn~ln~~qg~~~~ver~~l-fig~~~~~~~-----~~~~~~~~sdld~~lg~--~ds~lk~~v~aa~~naG~~- 71 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERHAL-FVGVGTTNQG-----KLLALTPDSDFDKVFGE--TDTDLKKQVRAAMLNAGQN- 71 (376) T ss_pred CCCeEEEeeeeccCCCcccccceEE-EeeccccccC-----ceEEecCCCChHHhhCC--CchhHHHHHHHHHhCCCCc- Confidence 2445888888888877777776554 8888764322 22344444454444432 3345555566555554332 Q ss_pred EEEecccccccccccccccccccchhhhhhhHhhhhhhhhhhhheeccccc-chhHHHHHHHHHhh----h--CceEEEE Q lcl|Aclame:pro 87 YFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTERPTLIGAPGFS-QNKAVIDALASMAK----R--LKCRAVI 159 (388) Q Consensus 87 ~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~~~p~ll~ap~~~-~~~~v~~~l~~~~~----~--~~~~~i~ 159 (388) +...+.... .+. ......++.+.+. ..+..+.+-|-+ .+++.+.++.++++ + ...++++ T Consensus 72 w~a~~~~p~------------~~~-~~~~~Av~~a~~~-~s~E~V~v~~p~~t~~a~i~a~qa~a~el~~~~~R~vffil 137 (376) T protein:vir:37 72 WFAHVYIAQ------------EDG-YDFVECVKKANQT-ASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQ 137 (376) T ss_pred eEEEEEecC------------CCh-hhHHHHHHHHHhh-CCeeEEEEecCcchhHHHHHHHHHHHHHHHHhcCCeEEEEE Confidence 221111000 000 0011112211111 112233333322 23444555555443 2 2467777 Q ss_pred ecCCCc-------chhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccc--cccccccccc Q lcl|Aclame:pro 160 DGPSGS-------TQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK--PWESPGNQGV 230 (388) Q Consensus 160 d~p~~~-------~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d--~~~s~~n~p~ 230 (388) ..+.-. +-......-.....++.+.++.++-. +. . -..|.+||.+|+.- -.++|..... T Consensus 138 e~~g~d~~~~~ge~w~~y~~~l~a~~~gia~~~V~vV~~-~~--g---------n~~G~~aGRl~naaVsVadspgRV~t 205 (376) T protein:vir:37 138 AVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPL-LF--G---------NETGVLAGRLANRAVTVADSPARVQT 205 (376) T ss_pred eccCCCCcccccCCHHHHHHHHHHHhccccccceeeeee-ec--c---------chHHHHHHHHHhCCcchhcCccceee Confidence 766311 10111111222345677778777632 11 1 24778899887542 2556665443 Q ss_pred c-ceeec---cccc-ccccCchhhhhhccccceEEEEEeCCC-cEEEEc-cccCC-----CceeeehhhHHHHHHHHHHH Q lcl|Aclame:pro 231 L-IQDVA---RVID-YNILDKSTEGDLLNRNGVSYFARTSMG-GFSLIG-NRTVT-----GKFISFVGLEDAIARKLEAA 298 (388) Q Consensus 231 ~-~~g~~---~~~~-~~~~~~~~~~~~Ln~~gIn~i~~~~~~-G~~~wG-~rT~~-----~~~i~vrR~~~~i~~~i~~~ 298 (388) + +.|+. .+.+ ....++......|..+|-.+.+.++|. |+ +|+ .||+. ++||..+|..+-+.|.++.. T Consensus 206 Gai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG~-Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ 284 (376) T protein:vir:37 206 GALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDGY-YRADGRTLDVEGGDYQVIENLRVVDKVARKVRLL 284 (376) T ss_pred cccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCce-EEeCCeEeccCCCCeeeehhchHHHHHHHHHHHH Confidence 3 22332 2222 123456667888999999999999874 65 665 47764 78999999999988877765 Q ss_pred HHHHhc-c--cCCHHHHHHHHHHHHHHHHHHHhcCCeeee----eEEEeccCC-CHHHhhCCeEEEEEEEEecCcceeEE Q lcl|Aclame:pro 299 SQRAMS-K--QLTKSFMEQEIKKINLFMQDLVAAEIIPGG----EVYLHPTLN-TVERYKNGSWYIVIDYGRYSPNEHMI 370 (388) Q Consensus 299 ~~~~vf-e--pn~~~~~~~i~~~i~~~L~~l~~~Gal~g~----~v~~d~~~N-t~~~i~~G~~~~~v~~~p~~pae~I~ 370 (388) .-..+. + +.++.-.+..+..+..=|++|.+.+-|.|. +|...++.. +..-....++.+.+.+.|.--.+.|+ T Consensus 285 Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~d~dI~i~w~sk~~V~I~~~vrPy~cpk~i~ 364 (376) T protein:vir:37 285 AIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKTKVTIYIKVRPYDCPKEIT 364 (376) T ss_pred HHHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccceeecCCCCceEEEeccCceEEEEEEEeeecCcceeE Confidence 554443 3 456677888888899999999999999993 344432210 00001246677777788887788999 Q ss_pred EEEEEcchHHHH Q lcl|Aclame:pro 371 FHLNAVDRIVEE 382 (388) Q Consensus 371 ~~~~~~~~~~~~ 382 (388) ..+..|-.-+-+ T Consensus 365 ~~I~LDls~~~~ 376 (376) T protein:vir:37 365 ANIFLDLDSLGE 376 (376) T ss_pred EEEEEecCCCCC Confidence 888888663333 No 57 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=97.68 E-value=4.9e-08 Score=60.66 Aligned_cols=341 Identities=10% Similarity=-0.039 Sum_probs=99.8 Q ss_pred CCCCCCcCC-CeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhh Q lcl|Aclame:pro 1 MPVIDQFEH-NGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETL 79 (388) Q Consensus 1 M~~~t~~~h-GV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~ 79 (388) ||. |+| |||++|++.++++|..+.|++.+|||.++.+ +.++|++|+|+.++...|+.......+..++..+| T Consensus 1 m~~---~~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~~~~G----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f 73 (743) T protein:vir:10 1 MAS---QVSPGILIKERDLTNAVVTGALQIRAAHASTFAKG----PIGDIVNINTQKELVSVFGEPKEDNAEDWMVASEF 73 (743) T ss_pred Ccc---ccCCceEEEEecCCCceeccCCcceeEEEEeccCC----CCCcCEEecCHHHHHHHcCCccCCcchHHHHHHHH Confidence 775 455 9999999999999999999999999999877 57899999999999999998888889999999999 Q ss_pred ccccceEEEEeccccccccccccccc---ccccc-hhhhhhhHhhhhhhhhhhhheecccccchhHHHHHHHHHhhhCce Q lcl|Aclame:pro 80 KKTSVPQYFIVVPEGADDAATMANII---GGIDP-TTGRRTGIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRLKC 155 (388) Q Consensus 80 ~~~~~~~~vv~~~~~~~~~~~~~~~~---~~~~~-~tg~~tgl~a~~~~~~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~ 155 (388) .|++..|+++|+........+..... ..... ..+....+.. .-.-||-. .+.+ . T Consensus 74 ~ngg~~~~vvrv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~---------~a~~~G~~------------gN~i-~ 131 (743) T protein:vir:10 74 LNYGGRLAVVRAETTGVLNATTGSAGVLVKNRESWDAGSGNGEVF---------VARTAGSW------------GNSL-M 131 (743) T ss_pred HhCCceEEEEEccCccccccccccccccccccccccccccceeEE---------EEeecccc------------ccce-E Confidence 99999999999976543333221100 00000 0000000000 00001100 0000 0 Q ss_pred EEEEecCCCcchhHH-------HHHHHhhhcccccceEEEEecceeccccccc-ceeeh--------hhHHHHHHHHhcc Q lcl|Aclame:pro 156 RAVIDGPSGSTQDAI-------DLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQ-GNIYV--------PPSTIAMGAVAAV 219 (388) Q Consensus 156 ~~i~d~p~~~~~~~~-------~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~-~~~~~--------p~s~~~aG~~a~~ 219 (388) ..+.+...+...... ............... ..+...+...+ ..... ..+...++..+.. T Consensus 132 V~v~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 206 (743) T protein:vir:10 132 GVLVDRGADYIVTFAATPTDTAVGTQLLFSYSGTLVT-----GEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSF 206 (743) T ss_pred EEEecCCCcceeeeeccccccccceeeeecccccccc-----cceeeeeecCcceeeeeccccceeeecccccccccccc Confidence 011111000000000 000000000000000 00000000000 00000 0000000110000 Q ss_pred -ccc-cccccccc---cc--eeecc-cccccccC-------chhhhhhccccceEEEEEeCC---CcEEEEcccc----- Q lcl|Aclame:pro 220 -KPW-ESPGNQGV---LI--QDVAR-VIDYNILD-------KSTEGDLLNRNGVSYFARTSM---GGFSLIGNRT----- 276 (388) Q Consensus 220 -d~~-~s~~n~p~---~~--~g~~~-~~~~~~~~-------~~~~~~~Ln~~gIn~i~~~~~---~G~~~wG~rT----- 276 (388) +.. ......+. .+ .+... ........ ...-...+...+-..-..... .+....+..+ T Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tv~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~ 286 (743) T protein:vir:10 207 TDNSTTEVGRTPGTYSNVPASGGTGTGATFNVVVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVA 286 (743) T ss_pred cccccccccccccceeeEEecccccccccccccccccccccccccccccccccceeeeccccccccccccccccchhhee Confidence 000 00000000 00 00000 00000000 000000000000000000000 0000000000 Q ss_pred -CCCceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHH--HHHHHHHHHHhcCCeeeeeEE-EeccCCCHHHhhCC Q lcl|Aclame:pro 277 -VTGKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIK--KINLFMQDLVAAEIIPGGEVY-LHPTLNTVERYKNG 352 (388) Q Consensus 277 -~~~~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~--~i~~~L~~l~~~Gal~g~~v~-~d~~~Nt~~~i~~G 352 (388) ..|.-+..--..+.-.+..... +..+|..+.. ....+-.. ..+.-..+.+. +|.. ... ....| T Consensus 287 ~~~~g~~~~~a~~~~~~~~~~~~---------~~~~~~~~~~~~~t~~~~~~--~~~~~d~~~v~v~~~~-~~~-~~~~~ 353 (743) T protein:vir:10 287 TLSDGTIAITELKDWYLNTEIGS---------TGIKLGDIGPRPGTSQFATD--NGITDDQVHFAVIDTT-GEL-TGTAN 353 (743) T ss_pred cccccceeeeecccccccchhhc---------cccccccccccceeeecccc--ccccccceEEEEecCc-cee-eeccC Confidence 0011111000000000000000 0000000000 00000000 00000011111 1111 000 00111 Q ss_pred eEEEEEEEEecCcc-eeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 353 SWYIVIDYGRYSPN-EHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 353 ~~~~~v~~~p~~pa-e~I~~~~~~~~~~~~~l~~~~~ 388 (388) .+...+.+....+. +...-...+-.+++......+. T Consensus 354 ~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~~~ 390 (743) T protein:vir:10 354 TIVERLTYLSKLSDARSEENANIYYKNVINEQSAYLY 390 (743) T ss_pred ceeEEEeeeecccccccccCcceeecceeccccceee Confidence 11111111100000 0000000000000000000000 No 58 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=97.63 E-value=3.5e-05 Score=45.01 Aligned_cols=332 Identities=11% Similarity=0.009 Sum_probs=181.9 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) |+ .+-|.+...|-+..++..++- ...|||+...... ..+...+...+|...+.+. ....|..-+.+... T Consensus 1 m~-----~~~V~in~~n~~qg~~~~ver-~~lfig~g~~~~~---~g~~~~~~~~sdld~~lg~--~ds~lk~~v~aa~~ 69 (369) T protein:vir:27 1 MA-----WPTVIIKILNLMNGPIADIEC-HFLFVIRGTVSGE---VRNLIMVDSTSDLDDVLAE--ASAEGLAIVKAAQL 69 (369) T ss_pred CC-----CCceEEecccccCCCcccccc-eEEEEEecccccc---ccceEEecCccchHhhcCC--cChhHHHHHHHHHh Confidence 55 355888888888877777664 4558865543221 1222344455555554433 23446666666666 Q ss_pred cccceE--EEEecccccccccccccccccccchhhhhhhHhhhhhhh--hhhhheecccccchhHHHHHHHHHhh----h Q lcl|Aclame:pro 81 KTSVPQ--YFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECT--ERPTLIGAPGFSQNKAVIDALASMAK----R 152 (388) Q Consensus 81 ~~~~~~--~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~--~~p~ll~ap~~~~~~~v~~~l~~~~~----~ 152 (388) +.+..- .+....+.+ + .+.|+.... ..+..+.+-+-++.++.+.++.++++ + T Consensus 70 naG~~w~a~~~p~~~~~-------~-------------~~~Av~~a~~~~s~E~V~v~~p~t~~a~i~aaq~~a~el~~~ 129 (369) T protein:vir:27 70 NGKQAWTAGVMILSEED-------N-------------WQDAVKKANEVSSFEFVVLGFDAETKAMIEDAITLRTELKNS 129 (369) T ss_pred CCCCceEEEEEEeCCch-------h-------------HHHHHHhhhhhCCccEEEEecCcccHHHHHHHHHHHHHHHHh Confidence 655321 111111110 0 111111111 11222333333344444444444443 2 Q ss_pred C--ceEEEEecCCC-----cchh--HHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhcc--cc Q lcl|Aclame:pro 153 L--KCRAVIDGPSG-----STQD--AIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAV--KP 221 (388) Q Consensus 153 ~--~~~~i~d~p~~-----~~~~--~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~--d~ 221 (388) + ..++++..+.. ..+. .....-.....++.+.++.++--+.... .-.|.++|.+|.. .- T Consensus 130 ~~R~vffi~e~~~~~~~~~~~e~w~dy~a~l~al~~g~a~~~V~vv~~~~~~g----------n~~G~~aGRl~n~aVsI 199 (369) T protein:vir:27 130 LGREVGVLCQLPAINNDPTNGQTWSEWLADTVDIPKDVASEYISVVPNVHAAG----------DTLGKYAGRLANKEVSI 199 (369) T ss_pred cCCeEEEEEeccccCCCccccCCHHHHHHHHHHHhhccCcccceeeeeecccc----------chHHHHHHHHHhcccch Confidence 2 45667664421 1111 1111122234567788888873222211 2467788888653 33 Q ss_pred cccccccccc-ceeec-ccc-cccccCchhhhhhccccceEEEEEeCCC-cEEEEcc-ccCC-----CceeeehhhHHHH Q lcl|Aclame:pro 222 WESPGNQGVL-IQDVA-RVI-DYNILDKSTEGDLLNRNGVSYFARTSMG-GFSLIGN-RTVT-----GKFISFVGLEDAI 291 (388) Q Consensus 222 ~~s~~n~p~~-~~g~~-~~~-~~~~~~~~~~~~~Ln~~gIn~i~~~~~~-G~~~wG~-rT~~-----~~~i~vrR~~~~i 291 (388) ..+|....-+ +.|+. .+. .....++.+....|..+|-.+.+.++|. |+ +|++ ||+. ++||..+|..+-+ T Consensus 200 adsp~RVktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~-Yw~d~~tl~~~gsDYq~iE~~RVvdKa 278 (369) T protein:vir:27 200 ADSPARVQTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQ-YWTTGRTLDVPGGDYQDIRHIRVAMKA 278 (369) T ss_pred hcCcceeeecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCce-EEeCceEeccCCCCeehhhhhhHHHHH Confidence 5555543332 22322 121 1223356677888999999999999874 66 6664 7764 7899999999999 Q ss_pred HHHHHHHHHHHhcc---cCCHHHHHHHHHHHHHHHHHHHhcCCeeeeeEEEeccCCCHHHh-----hCCeEEEEEEEEec Q lcl|Aclame:pro 292 ARKLEAASQRAMSK---QLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERY-----KNGSWYIVIDYGRY 363 (388) Q Consensus 292 ~~~i~~~~~~~vfe---pn~~~~~~~i~~~i~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i-----~~G~~~~~v~~~p~ 363 (388) .|.++...-..+.. +.++.-.+..+..+..=|++|.+.+ ..++|.-.++. || ...++.+.+.+.|. T Consensus 279 ~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~--fpgei~~P~d~----dI~i~w~~k~~V~I~~~vrP~ 352 (369) T protein:vir:27 279 ARKVRIRAIARIADRTLNSTPQSIAAAKLYFTQDLRTMALTG--VPGEIYPPEDE----DIQIKWVNSTDVEIYMSVQPY 352 (369) T ss_pred HHHHHHHHHHHhcCcccccChhHHHHHHHHHhhHHHHHHhhc--CCeEEecCCCC----ceEEEeeccceEEEEEEEeec Confidence 98887666555443 3445566667777777788886553 33333333221 23 34577888888888 Q ss_pred CcceeEEEEEEEcchHH Q lcl|Aclame:pro 364 SPNEHMIFHLNAVDRIV 380 (388) Q Consensus 364 ~pae~I~~~~~~~~~~~ 380 (388) --.+.|+..+..|-.-+ T Consensus 353 ~~pk~it~~I~ldl~~~ 369 (369) T protein:vir:27 353 ECPVKITIAISVKQGDY 369 (369) T ss_pred cCCceEEEEEEEeccCC Confidence 88899999998885433 No 59 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=97.28 E-value=3.6e-06 Score=50.43 Aligned_cols=191 Identities=12% Similarity=0.018 Sum_probs=86.4 Q ss_pred CCCCCCcC-CCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhh Q lcl|Aclame:pro 1 MPVIDQFE-HNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETL 79 (388) Q Consensus 1 M~~~t~~~-hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~ 79 (388) ||. +|+ +|||++|+..+ ++|..+.|++.+|||.++-+ |.++|++|+|+.++...|+.......+..++..+| T Consensus 1 M~~--~~~~PgVyv~e~~~~-~~~~~~~t~~~~fvG~~~~G----p~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~F 73 (749) T protein:vir:10 1 MAT--NQSSPGVVIQERDLT-TVSTIPTANVGVIAAPFTKG----PVEEVIEITSERQLAEKFGEPNESNYEYWFSAAQF 73 (749) T ss_pred CCc--cccCCeeEEEEecCC-cccccccCceeEEEeccCCC----CCccCEEcCCHHHHHHHcCCccCCcccHHHHHHHH Confidence 886 344 79999999776 56788999999999999877 66899999999999999998888788999999999 Q ss_pred ccccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhhhhhhheecccccchhHHHHHHHHHhhhCceEEEE Q lcl|Aclame:pro 80 KKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRLKCRAVI 159 (388) Q Consensus 80 ~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~~~p~ll~ap~~~~~~~v~~~l~~~~~~~~~~~i~ 159 (388) .|++..|++||+........+. ..++ +.+... ... T Consensus 74 ~ngg~~~~vvRv~~~~~~~a~~--------~~~~-----------------~~~~~~---~~~----------------- 108 (749) T protein:vir:10 74 LSYGGLLKTIRVNSSSLKNAVD--------TGTA-----------------PLVKNL---QDY----------------- 108 (749) T ss_pred hhcCCeEEEEEccCcccccccc--------cccc-----------------cccccc---ccc----------------- Confidence 9999999999985433211110 0000 000000 000 Q ss_pred ecCCCcchhHHHHHHHhhhcccccceEEEEecceecccccccceeehhhHHHHHHHHhccccccccccccccceeecccc Q lcl|Aclame:pro 160 DGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVLIQDVARVI 239 (388) Q Consensus 160 d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~ 239 (388) ..... ... ..-+...-+| |.|...+-+. +. T Consensus 109 ---~~~~~----------~~~-~~~~~~a~~p--------------------------------G~~gn~l~v~-v~--- 138 (749) T protein:vir:10 109 ---ETSIE----------DAS-NNFSWVARTP--------------------------------GDTGNSIGIF-VT--- 138 (749) T ss_pred ---ccccc----------ccc-cceEEEeccC--------------------------------CCcCCceEEE-EE--- Confidence 00000 000 0000111111 1111100000 00 Q ss_pred cccccCchhhhhhccccceEEEEEeCCCcEEEEccccCCCceeeehhhHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHH Q lcl|Aclame:pro 240 DYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVTGKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKI 319 (388) Q Consensus 240 ~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~i~vrR~~~~i~~~i~~~~~~~vfepn~~~~~~~i~~~i 319 (388) +.. .. .+..+-. ++.+ |+ T Consensus 139 -----~~~--~~-----~~~~~~~-~~~~---~~---------------------------------------------- 156 (749) T protein:vir:10 139 -----DAG--AD-----QVVVVPA-PGSG---NE---------------------------------------------- 156 (749) T ss_pred -----cCC--Cc-----eeeeeec-CCcc---ce---------------------------------------------- Confidence 000 00 0000000 0000 00 Q ss_pred HHHHHHHHhcCCeeeeeEEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcchHHHHHHHhcC Q lcl|Aclame:pro 320 NLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) Q Consensus 320 ~~~L~~l~~~Gal~g~~v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~l~~~~~ 388 (388) ..+..+. ..+. ..| -......-.+.+.+... ...++ T Consensus 157 ---------------~~~~~~~-~~~~---~~~-------~~~~~~~~~~~~~~~~~-------~~~~~ 192 (749) T protein:vir:10 157 ---------------HEFVADA-AVSA---ASG-------AAGKVFKYSIILTIDDV-------VGTFA 192 (749) T ss_pred ---------------eeEEeee-cccc---ccc-------ccccccccceeeeeccc-------cceee Confidence 0000000 0000 001 01111111122221111 11112 No 60 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=97.05 E-value=0.00019 Score=40.97 Aligned_cols=349 Identities=13% Similarity=0.072 Sum_probs=160.1 Q ss_pred CCCC------CCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhh Q lcl|Aclame:pro 1 MPVI------DQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHA 74 (388) Q Consensus 1 M~~~------t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a 74 (388) |... .--.+|+|+|..++.... .....-.-++|..-. ++....+++++++|..+...+++.+. .+..- T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~--~~~~q~vLiiGq~la-~gs~~~~~~v~v~s~~~a~~~fG~GS---ml~~M 74 (498) T protein:vir:44 1 MAISFNSIPSDTRVPLFYAEMDNSAANT--ARDSGASLLIGHASN-DASIAVNSLVLVSSVDYARQICGAGS---QLARM 74 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCC--CcCCcceEEEEecCc-ccccccceeEeecCHHHHHHhcCccc---HHHHH Confidence 5542 112458999987765532 333344556775543 34556788999998887777766531 11110 Q ss_pred hhhh--------------hcc-----------------cc-------ceEEEEeccccccccccc--------------- Q lcl|Aclame:pro 75 ASET--------------LKK-----------------TS-------VPQYFIVVPEGADDAATM--------------- 101 (388) Q Consensus 75 ~~~~--------------~~~-----------------~~-------~~~~vv~~~~~~~~~~~~--------------- 101 (388) +..+ .+. .+ +..+.+.+..++..+.-. T Consensus 75 ~~a~~~~n~~~~l~~i~~~D~aG~aAtg~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPV 154 (498) T protein:vir:44 75 VGAYRKTDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPF 154 (498) T ss_pred HHHHHHhCCCceeEEEecCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCce Confidence 0000 000 00 000111111111110000 Q ss_pred -ccccccccchhhhhhhHhhhh---hh--------hhhhh-----heecccccchhHHHHHHHHHhhhC----------- Q lcl|Aclame:pro 102 -ANIIGGIDPTTGRRTGIAALT---EC--------TERPT-----LIGAPGFSQNKAVIDALASMAKRL----------- 153 (388) Q Consensus 102 -~~~~~~~~~~tg~~tgl~a~~---~~--------~~~p~-----ll~ap~~~~~~~v~~~l~~~~~~~----------- 153 (388) +...++.-..|.++.|...-. .+ ...|. +...-|-..++++..+|.++.+.. T Consensus 155 TA~~~~~~vtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~i~~p~~D~ 234 (498) T protein:vir:44 155 TATSEAGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDT 234 (498) T ss_pred EEeeccceEEEEEeccCcccCcceEEEeeccCccccccccceeEEEEcccCCccCchhHHHHHhhccCCccEEEEeecCH Confidence 001122222233333221000 00 00010 111112223334444444443332 Q ss_pred -----------------------ceEEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceecccccccceeehhh-- Q lcl|Aclame:pro 154 -----------------------KCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPP-- 208 (388) Q Consensus 154 -----------------------~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~-- 208 (388) .++++.-...+.. ....+ ....++.|+-+.+..- ...-|+ T Consensus 235 asl~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~a-~l~t~-----g~~~N~~~it~~~~~~---------~~~sp~~~ 299 (498) T protein:vir:44 235 ASVNSMATEMNDSSGRWSYVRQLYGHVYTAKTGTLS-ELVAA-----GDQFNLQHITLAGYEK---------DTQTPADE 299 (498) T ss_pred HHHHHHHHHHhhhhcchHHHhhcCeEEEEeccCCHH-HHHHh-----hhccCCceEEEEecCC---------CCCCHHHH Confidence 2333332222111 11111 1133445554432110 001122 Q ss_pred -HHHHHHHHh---ccccccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCCcEE-E-----------E Q lcl|Aclame:pro 209 -STIAMGAVA---AVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFS-L-----------I 272 (388) Q Consensus 209 -s~~~aG~~a---~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~-~-----------w 272 (388) ++++|++.+ +.||.+....-.+ .|+..+ ......+..|+|.|..+||.++.-. .|-+ + + T Consensus 300 ~AAa~a~~aA~~l~~DPArPL~tl~L--~Gi~~p-~~~~r~~~~ern~LL~~Gist~~V~--~G~V~I~R~ITTY~~n~~ 374 (498) T protein:vir:44 300 LAASRTARAAVFIRNDPARPTQTGEL--VDMLPA-PKGKRFTTTEQQTLLSHGVATAYVE--SGVLRIQRDITTYRKNAY 374 (498) T ss_pred HHHHHHHHHHHHhhcccccccCceee--cccccC-CchhcCChHHHHHHHhcCcceEEEc--CCeEEEEeeeeeeeecCC Confidence 224444444 6777653332223 233322 3345568889999999999998764 3422 1 1 Q ss_pred ccccCCCceeeehhhHHHHHHHHHHHHHHHhc-ccCCH-----------HHHHHHHHHHHHHHHHHHhcCCeeeee---- Q lcl|Aclame:pro 273 GNRTVTGKFISFVGLEDAIARKLEAASQRAMS-KQLTK-----------SFMEQEIKKINLFMQDLVAAEIIPGGE---- 336 (388) Q Consensus 273 G~rT~~~~~i~vrR~~~~i~~~i~~~~~~~vf-epn~~-----------~~~~~i~~~i~~~L~~l~~~Gal~g~~---- 336 (388) |.-..+|..|.+.|+.+|+.+.++..+...-. +.... .|-..|+..+-.-+++|-.+|-+..++ T Consensus 375 G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~givEn~~~~~~ 454 (498) T protein:vir:44 375 GVADNSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREGIVENFDLFQQ 454 (498) T ss_pred CCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCcccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhccccChhhhcc Confidence 33333588999999999999999987753322 12111 366789999999999999999888742 Q ss_pred -EEEeccCCCHHHhhCCeEEEEEEEEecCcce----eEEEEEEEcchHH Q lcl|Aclame:pro 337 -VYLHPTLNTVERYKNGSWYIVIDYGRYSPNE----HMIFHLNAVDRIV 380 (388) Q Consensus 337 -v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae----~I~~~~~~~~~~~ 380 (388) +.+.++.+ +.+|+.+.+-.-.+-++. .|.|+++++.+.- T Consensus 455 ~LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:44 455 HLIVERNAN-----DSNRLDVLFPPDYVNQLRVFAVLNQFRLQYSEEAA 498 (498) T ss_pred eeEEEECCC-----CCcEEEEEecccccCchhhhhhhhhhhhhhhhhcC Confidence 33433322 125555554444444433 3334444443322 No 61 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=96.87 E-value=0.00028 Score=40.02 Aligned_cols=350 Identities=11% Similarity=0.039 Sum_probs=160.6 Q ss_pred CCCC------CCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhh Q lcl|Aclame:pro 1 MPVI------DQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHA 74 (388) Q Consensus 1 M~~~------t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a 74 (388) |... .--.+|+|++..++...+-.+ +...-++|..-. ++....+++++++|..+...+++.+. .+..- T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~--~qrvLiiGq~la-~gt~~~~~~v~v~s~~~a~~~fG~GS---~l~~M 74 (498) T protein:vir:48 1 MTISFSAVPSDTLVPLFYAEMDNSAANTAVT--SAPALLIGHASN-DAAIEVNSLVLMPSADYARQICGAGS---QLARM 74 (498) T ss_pred CCccccccCcccccceEEEEEecCCCccccC--CcceEEEeecCc-cccccccceEEecCHHHHHHhcCccc---HHHHH Confidence 5542 112458999988876654333 244556665433 34556788999998887777765531 11100 Q ss_pred hhhh--------------hc-----------------cccc-------eEEEEeccccccccccc--------------- Q lcl|Aclame:pro 75 ASET--------------LK-----------------KTSV-------PQYFIVVPEGADDAATM--------------- 101 (388) Q Consensus 75 ~~~~--------------~~-----------------~~~~-------~~~vv~~~~~~~~~~~~--------------- 101 (388) +..+ .+ ..+. ..+-+.+..++..+.-. T Consensus 75 ~~a~~~~n~~~~l~~i~~~D~ag~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aai~a~~~lPV 154 (498) T protein:vir:48 75 VDVYRQTDPFGELYVIAVPEARGAAATVRVTVTGEAEESGTLSLYVGRSSVQVPVVNGDDATAVATAIKEAVNGVITLPF 154 (498) T ss_pred HHHHHHhCCCceeEEEeeCCcccceeEEEEEecccccCCceEEEEECCEEEEEeecCCCCHHHHHHHHHHHHhCCCCcce Confidence 0000 00 0000 00111111111110000 Q ss_pred -ccccccccchhhhhhhHhhhh---hh--------hhhhh-----heecccccchhHHHHHHHHHhhhC----------- Q lcl|Aclame:pro 102 -ANIIGGIDPTTGRRTGIAALT---EC--------TERPT-----LIGAPGFSQNKAVIDALASMAKRL----------- 153 (388) Q Consensus 102 -~~~~~~~~~~tg~~tgl~a~~---~~--------~~~p~-----ll~ap~~~~~~~v~~~l~~~~~~~----------- 153 (388) +...++.-..|.++.|...-. .+ ...|. +-..-|-...+++..+|.++.+.. T Consensus 155 TA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamsgGag~PDia~aLaal~~~~~~~I~~p~~D~ 234 (498) T protein:vir:48 155 AASSDAGVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPDLTAAVAAMGDEAFDFIGLPFNDA 234 (498) T ss_pred EEEecCcEEEEEeeecccccccceeeeeeccCcccccccceeeEEEEcccCCccCcchHHHHHhhccCCccEEEEeecCH Confidence 011122222222222221000 00 00000 000111223334444444433322 Q ss_pred -----------------------ceEEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceecccccccceeehhh-- Q lcl|Aclame:pro 154 -----------------------KCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPP-- 208 (388) Q Consensus 154 -----------------------~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~-- 208 (388) .++++.-...+ -...... ....++.|+-+.+.. ....-|+ T Consensus 235 asl~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT-~~~l~t~-----g~~~N~~~it~~~~~---------~~~~~p~~~ 299 (498) T protein:vir:48 235 ASINMMMTEMNDSSGRWSYARQLYGHVYTAKLGT-LSELVNA-----GDMHNQQHITLAGYE---------KETQSPVDE 299 (498) T ss_pred HHHHHHHHHHhhhhhhhhHHhhcCeEEEEeccCC-HHHHHHh-----hhccCCceEEEEecC---------CCCCChHHH Confidence 23333322221 1111111 123345555444311 1111233 Q ss_pred -HHHHHHHHh---ccccccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCCcEEEE-----------c Q lcl|Aclame:pro 209 -STIAMGAVA---AVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLI-----------G 273 (388) Q Consensus 209 -s~~~aG~~a---~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~w-----------G 273 (388) .++.|++.+ +.||.+....-.+ .|+..+ ......+..|+|.|..+||.++.- .++-..+- | T Consensus 300 ~AAa~a~~aA~~l~~DPArPLqtl~L--~Gi~~p-~~~~r~~~~ern~LL~~Gist~~V-~~G~V~I~R~ITTY~~n~~G 375 (498) T protein:vir:48 300 LVASRLAREAVFIRNDPARPTQTGEL--VGMLPA-PKGKRFIMTEQQTLLSHGVATAYV-EGGTLRIQRSVTTYKKNAYG 375 (498) T ss_pred HHHHHHHHHHHhhhccccccccceee--eccccC-CchhcCChHHHHHHHhcCcceEEE-cCCeEEEEeeeeeeeecCCC Confidence 234444444 6788663333222 344333 344556889999999999999876 44333221 3 Q ss_pred cccCCCceeeehhhHHHHHHHHHHHHHHHhc-ccCCHH-----------HHHHHHHHHHHHHHHHHhcCCeeee---e-- Q lcl|Aclame:pro 274 NRTVTGKFISFVGLEDAIARKLEAASQRAMS-KQLTKS-----------FMEQEIKKINLFMQDLVAAEIIPGG---E-- 336 (388) Q Consensus 274 ~rT~~~~~i~vrR~~~~i~~~i~~~~~~~vf-epn~~~-----------~~~~i~~~i~~~L~~l~~~Gal~g~---~-- 336 (388) .-..+|..|.+.|+.+|+.+.++..+...-- +..... |-..|+..+-.-+++|-.+|-+..+ + T Consensus 376 ~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given~~~~~~~ 455 (498) T protein:vir:48 376 VADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVENYDLFKQY 455 (498) T ss_pred CcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcce Confidence 3333588999999999999999987754322 122222 6678999998999999999988874 2 Q ss_pred EEEeccCCCHHHhhCCeEEEEEEEEecCcce----eEEEEEEEcchHH Q lcl|Aclame:pro 337 VYLHPTLNTVERYKNGSWYIVIDYGRYSPNE----HMIFHLNAVDRIV 380 (388) Q Consensus 337 v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae----~I~~~~~~~~~~~ 380 (388) +.+.++.+ +.+|+.+.+-.-.+-+.. .|.|+++++..-- T Consensus 456 LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 456 LIVERDAD-----NPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred eEEEECCC-----CCcEEEEEecccccCchhhhhhhhhhhhhhhhcCC Confidence 33433322 124555444433333332 3334444433322 No 62 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=96.52 E-value=0.00054 Score=38.48 Aligned_cols=347 Identities=12% Similarity=0.045 Sum_probs=158.9 Q ss_pred CCCCC-------CcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchh Q lcl|Aclame:pro 1 MPVID-------QFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWH 73 (388) Q Consensus 1 M~~~t-------~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~ 73 (388) |+.++ --.+|+|+|-.++...+-.+....-.-++|..-. ++....+++++++|..+...+++.+. .+.. T Consensus 1 m~~i~F~~IP~~iRvP~~y~E~dns~A~~g~~~~~q~vLiiGq~la-~gs~~~~~pv~v~s~~~a~~~fG~GS---~la~ 76 (495) T protein:vir:19 1 MSDISFNAIPSDVRVPLTYIEFDNSNAVSGTPAPRQRVLMFGQSGS-KASAAPNVPVRIRSGSQASAAFGQGS---MLAL 76 (495) T ss_pred CCCCchhhCCcccccCeEEEEEccCCCCcCCcCCCceEEEEEecCc-ccccccceeEEecCHHHHHHhcCcCc---HHHH Confidence 66552 1245899998877554433444444556775433 24456688899988877776666531 0000 Q ss_pred hhhhh--------------h-----c------------ccc-------ceEEEEecccccccc----------------- Q lcl|Aclame:pro 74 AASET--------------L-----K------------KTS-------VPQYFIVVPEGADDA----------------- 98 (388) Q Consensus 74 a~~~~--------------~-----~------------~~~-------~~~~vv~~~~~~~~~----------------- 98 (388) -+..+ . . ..+ +..+.+.+..++..+ T Consensus 77 M~~a~~~~n~~~~l~~i~~~D~aG~aA~g~it~tg~at~~G~l~l~I~g~~v~v~V~~gdTaa~vA~al~aaina~~~lP 156 (495) T protein:vir:19 77 MADAFLNANRVAELWCIPQGNGTGNAAVGEISLSGTAGENGSLVTYIAGQRLAVSVAAGATGAALADLLVARIKGQPDLP 156 (495) T ss_pred HHHHHHHhCCcceEEEEeeCChhhceeEEEEEEeecCCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHhcCCccCc Confidence 00000 0 0 000 000111111111000 Q ss_pred cccc-------cccccccchhhhhhhHhhhhhhhh---------h-----hhheecccccchhHHHHHHHHHhhhC---- Q lcl|Aclame:pro 99 ATMA-------NIIGGIDPTTGRRTGIAALTECTE---------R-----PTLIGAPGFSQNKAVIDALASMAKRL---- 153 (388) Q Consensus 99 ~~~~-------~~~~~~~~~tg~~tgl~a~~~~~~---------~-----p~ll~ap~~~~~~~v~~~l~~~~~~~---- 153 (388) ++.+ .-..+.-..|.+++|- . ..+.. . ..+-...|-..++++..+|.++.+.. T Consensus 157 vTA~~~~~~~~~~a~~~VtlTAr~kG~-~-n~idi~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~I 234 (495) T protein:vir:19 157 VTAEVRADSGDDDTHADVVLSAKFTGA-L-SAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMGDLQYKYI 234 (495) T ss_pred eEEEeeccCCCCcCceeEEEEEeeccc-c-ccceeEEEeecccccccceeEEEEecCCCCCCcchHHHHHHhccCCCcEE Confidence 0000 0011111222222221 1 00000 0 01111112223334444444443332 Q ss_pred ---------------------------ceEEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceecccccccceeeh Q lcl|Aclame:pro 154 ---------------------------KCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYV 206 (388) Q Consensus 154 ---------------------------~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~ 206 (388) .++++.-..++. .+...+ ....++.|+-+++ +. + ..- T Consensus 235 ~~P~tD~asL~al~~~l~~rw~~~~q~~g~~~~a~~gT~-~~l~t~-----g~~~N~~~it~~~--~~------g--sp~ 298 (495) T protein:vir:19 235 VMPYTDEPNLNLLRTELQERWGPVNQADGFAVTVLSGTY-GDISTF-----GVSRNDHLISCMG--IA------G--APE 298 (495) T ss_pred EEecCcHHHHHHHHHHHHHhhhHHHhcCeEEEEeecCCH-HHHHHh-----hhccCCceEEEEe--cC------C--CCC Confidence 233333222211 111111 1233455554442 10 0 011 Q ss_pred hhH---HHHHHHH---hccccccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCCcEEE--------- Q lcl|Aclame:pro 207 PPS---TIAMGAV---AAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSL--------- 271 (388) Q Consensus 207 p~s---~~~aG~~---a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~--------- 271 (388) ||. ++++++. .+.||.+....-.+ .|+.-+ ......+..|+|.|..+||.++.-..++-..+ T Consensus 299 ~~~~~AAA~aa~~A~~l~~DPArPL~tl~L--~Gi~~p-~~~~r~~~~ern~LL~~Gist~~V~~~G~V~I~R~ITTY~~ 375 (495) T protein:vir:19 299 PSYLYAATLCAVASQALSIDPARPLQTLTL--PGRMPP-AVGDRFTWSERNALLFDGISTFNVNDGGEMQIERMITMYRT 375 (495) T ss_pred cHHHHHHHHHHHHHHHhhcccccccCceee--cceecC-CccccCChHHHHHHHhCCcceEEECCCCeEEEEeeeeeeee Confidence 222 2333333 35777654333333 233322 33455688999999999999987654332222 Q ss_pred --EccccCCCceeeehhhHHHHHHHHHHHHHHHhcc-cCCHH-----------HHHHHHHHHHHHHHHHHhcCCeeee-- Q lcl|Aclame:pro 272 --IGNRTVTGKFISFVGLEDAIARKLEAASQRAMSK-QLTKS-----------FMEQEIKKINLFMQDLVAAEIIPGG-- 335 (388) Q Consensus 272 --wG~rT~~~~~i~vrR~~~~i~~~i~~~~~~~vfe-pn~~~-----------~~~~i~~~i~~~L~~l~~~Gal~g~-- 335 (388) +|.-..+|..|++-|+.+|+.+.++......-.+ ..... |=..|+..+-.-+++|-.+|-+..+ T Consensus 376 n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~given~~~ 455 (495) T protein:vir:19 376 NKYGDSDPSYLNVNTIATLSYLRYSLRTRITQKFPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDT 455 (495) T ss_pred cCCCCcchhhhhhHHHHHHHHHHHHHHHHHhhhcCCcccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccccChhh Confidence 2333345888999999999999998877644332 22222 5567999988899999999888874 Q ss_pred -e--EEEeccCCCHHHhhCCeEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 336 -E--VYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVD 377 (388) Q Consensus 336 -~--v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~ 377 (388) + +.+.++-+ +.+|+.+.+-...+-+..-+-.++++-- T Consensus 456 ~~~~LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 456 FKEELYVARNKD-----DKDRLDVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred hcceeEEEECCC-----CCcEEEEEecceeeCceeeeeeeeeeeC Confidence 2 33333322 1256666655555554443322222211 No 63 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=96.31 E-value=0.00076 Score=37.69 Aligned_cols=349 Identities=11% Similarity=0.063 Sum_probs=159.2 Q ss_pred CCCC------CCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhh Q lcl|Aclame:pro 1 MPVI------DQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHA 74 (388) Q Consensus 1 M~~~------t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a 74 (388) |... .--.+|+|+|..++.... .....-.-++|..-. ++....+++++++|..+...+++.+. .+..- T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~--~~~~q~vLiiGq~la-~gs~~~~~~v~v~s~~~a~~lfG~GS---ml~~M 74 (498) T protein:vir:45 1 MTISFNTIPSNTLVPLFYAEMDNQAANT--AQDSGASLLIGHANN-GAEIVANSLVLMPSADYARQICGAGS---QLARM 74 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCC--CCCCcceEEEEecCC-ccccccceeEEecCHHHHHHhcCcCc---HHHHH Confidence 5542 112459999987766633 333445556775543 34556788999998887777666531 11100 Q ss_pred hhhh--------------hc-----------------ccc-------ceEEEEeccccccccccc--------------- Q lcl|Aclame:pro 75 ASET--------------LK-----------------KTS-------VPQYFIVVPEGADDAATM--------------- 101 (388) Q Consensus 75 ~~~~--------------~~-----------------~~~-------~~~~vv~~~~~~~~~~~~--------------- 101 (388) +..+ .+ ..+ +..+.+.+..++..+.-. T Consensus 75 ~~a~~~~n~~~~l~~i~~~d~aG~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPV 154 (498) T protein:vir:45 75 VEAYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPF 154 (498) T ss_pred HHHHHHhCCcceEEEEeeCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCce Confidence 0000 00 000 000111111111110000 Q ss_pred -ccccccccchhhhhhhHhh----hhhh-------hhhhh-----heecccccchhHHHHHHHHHhh------------- Q lcl|Aclame:pro 102 -ANIIGGIDPTTGRRTGIAA----LTEC-------TERPT-----LIGAPGFSQNKAVIDALASMAK------------- 151 (388) Q Consensus 102 -~~~~~~~~~~tg~~tgl~a----~~~~-------~~~p~-----ll~ap~~~~~~~v~~~l~~~~~------------- 151 (388) +...++.-..|.++.|... +... ...|. +...-|-..++++..+|.++.+ T Consensus 155 TA~~~~~~VtlTAr~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~~itamagGag~PD~a~alaal~~~~~~~I~~p~~D~ 234 (498) T protein:vir:45 155 TASSSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDT 234 (498) T ss_pred EEEecCceEEEEeeccCccccceeEEEeeccccccccccceeeEEEEccCCCccCchhHHHHHHhccCCccEEEEeeCCH Confidence 0011122222222222110 0000 00000 0000111222333333333332 Q ss_pred ---------------------hCceEEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceecccccccceeehhh-- Q lcl|Aclame:pro 152 ---------------------RLKCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPP-- 208 (388) Q Consensus 152 ---------------------~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~-- 208 (388) ++.++++.-...+ -...... ....++.|+-+.+..- ...-|| T Consensus 235 asL~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT-~~~l~t~-----g~~~N~~~it~~~~~~---------~~~sp~~~ 299 (498) T protein:vir:45 235 ASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGT-LSELVNA-----GDQFNQQHITLAGYEK---------ETQTPADE 299 (498) T ss_pred HHHHHHHHHHhhhhhhhhHHhhcCeEEEEeccCC-HHHHHHh-----hhccCCceEEEEecCC---------CCCChHHH Confidence 2233333332221 1111111 1233455555443111 111122 Q ss_pred -HHHHHHHHh---ccccccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCCcEE-E-----------E Q lcl|Aclame:pro 209 -STIAMGAVA---AVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFS-L-----------I 272 (388) Q Consensus 209 -s~~~aG~~a---~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~-~-----------w 272 (388) .+++||+.+ +.||.+....-.+ .|+..+ ......+..|+|.|..+||.++.-. .|-+ + + T Consensus 300 ~AAa~aa~~A~~l~~DPArPL~tl~L--~Gi~~p-~~~~r~~~~ern~LL~~Gist~~V~--~G~V~I~R~ITTY~~n~~ 374 (498) T protein:vir:45 300 LAASRTARAAVFIRNDPARPTQTGEL--VGMLPA-PKGKRFTMTEQQTLLSHGVATAYVE--SGVLRIQRDVTTYRKNAY 374 (498) T ss_pred HHHHHHHHHHHHhhcccccccCceee--cceecC-CchhcCChHHHHHHHhCCcceEEEc--CCeEEEEeeeeeeeecCC Confidence 334444444 6787653333333 233332 3345568889999999999998763 3432 1 1 Q ss_pred ccccCCCceeeehhhHHHHHHHHHHHHHHHhc-ccCCHH-----------HHHHHHHHHHHHHHHHHhcCCeeeee---- Q lcl|Aclame:pro 273 GNRTVTGKFISFVGLEDAIARKLEAASQRAMS-KQLTKS-----------FMEQEIKKINLFMQDLVAAEIIPGGE---- 336 (388) Q Consensus 273 G~rT~~~~~i~vrR~~~~i~~~i~~~~~~~vf-epn~~~-----------~~~~i~~~i~~~L~~l~~~Gal~g~~---- 336 (388) |.-..+|..|.+.|+.+|+.+.++..+...-- +.+... |-..|+..+-.-+++|-.+|-+..++ T Consensus 375 G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givEn~~~~~~ 454 (498) T protein:vir:45 375 GVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQ 454 (498) T ss_pred CCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCeeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcc Confidence 33333588999999999999999987764422 111111 56788999989999999999888742 Q ss_pred -EEEeccCCCHHHhhCCeEEEEEEEEecCcc----eeEEEEEEEcchHH Q lcl|Aclame:pro 337 -VYLHPTLNTVERYKNGSWYIVIDYGRYSPN----EHMIFHLNAVDRIV 380 (388) Q Consensus 337 -v~~d~~~Nt~~~i~~G~~~~~v~~~p~~pa----e~I~~~~~~~~~~~ 380 (388) +.+.++.+ +.+|+.+.+-.-.+-+. -.|.|+++++.+-- T Consensus 455 ~LiVerd~~-----dpnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 455 YLVVERDAS-----VPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred eeEEEECCC-----CCcEEEEEecccccCchhhhhhhhhhheehhhcCC Confidence 33333322 12455444433333332 33445555544433 No 64 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=96.00 E-value=0.0011 Score=36.72 Aligned_cols=363 Identities=12% Similarity=0.046 Sum_probs=159.7 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) ||-.+-.+ -.++.+..+..+.....-...+++-+.... .+..+.....+..+....|+..... +++...+|. T Consensus 1 m~~~~ip~--s~iV~V~~~v~~~~~~~~~~~~l~l~~~~~---~~~~~~~~~~s~~~V~~~FG~~S~e---y~aA~~yFs 72 (501) T protein:vir:10 1 MPTTTIPI--DQIVQMLPGVIGAGGAPGRLTGLVLTQDTS---VQPGQLADFFQETDVENWFGALSNE---AKIADAYFP 72 (501) T ss_pred CCCCCccc--ceEEEEeeecccCCCccccceeEEEeccCC---CCccceEEecCHHHHHHhcCCChHH---HHHHHHHhh Confidence 99511112 223333333322222222233333333221 2333334455666666555443322 333333332 Q ss_pred ---cccc-e--EEEEeccccc---------ccccccccc---c-------ccccc-----------hhhhhhhHhhhhhh Q lcl|Aclame:pro 81 ---KTSV-P--QYFIVVPEGA---------DDAATMANI---I-------GGIDP-----------TTGRRTGIAALTEC 124 (388) Q Consensus 81 ---~~~~-~--~~vv~~~~~~---------~~~~~~~~~---~-------~~~~~-----------~tg~~tgl~a~~~~ 124 (388) +... + .++-|..... ..+.+.+++ . ++... -++.-+.+++.... T Consensus 73 g~~~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i~~al~~ 152 (501) T protein:vir:10 73 GIVNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTS 152 (501) T ss_pred hhcCCCccccEEEEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHHhhhccC Confidence 2111 1 1111111000 000001110 0 00000 00011111111000 Q ss_pred h---------------------hhhh----------------------heecccccchhHHHHHHHHHhh---hCceEEE Q lcl|Aclame:pro 125 T---------------------ERPT----------------------LIGAPGFSQNKAVIDALASMAK---RLKCRAV 158 (388) Q Consensus 125 ~---------------------~~p~----------------------ll~ap~~~~~~~v~~~l~~~~~---~~~~~~i 158 (388) . .... .+...|.. ...+.++|.++.+ .+-.+.+ T Consensus 153 ~~~tv~~d~~~~~f~its~ttG~~~~i~~~~~~~~la~~l~Lt~~~~a~v~~~g~~-aet~~~a~~a~~~~~~~Wy~f~~ 231 (501) T protein:vir:10 153 PDFVVAYDALRNRFTVVTNATGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVA-ADTPASAMNRAVGLSRNWATFTT 231 (501) T ss_pred CceEEEEcccCceEEEEeeccCCceeEEEeeCchhhhhhcCccccccceEEecCcc-cccHHHHHHHHHhccCceEEEEE Confidence 0 0001 11111111 1123344444433 3434555 Q ss_pred EecCCCcchhHHHHHHHhhhcccccceEEEEe---cceecccc---------cccceeeh------hhHHHHHHHHhccc Q lcl|Aclame:pro 159 IDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVD---PMPAIYSR---------KAQGNIYV------PPSTIAMGAVAAVK 220 (388) Q Consensus 159 ~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~---p~~~~~~~---------~~~~~~~~------p~s~~~aG~~a~~d 220 (388) ++.+. +++..+...|....+ ..+....+ +....... ..+-.+.+ .+.+++.|..+..| T Consensus 232 a~~~~--~~~~la~A~wiea~~--~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~n 307 (501) T protein:vir:10 232 AWTAV--IADRLAFAAWNSGQA--YKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASIN 307 (501) T ss_pred ecCCC--hHHHHHHHHHHHhcC--ceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCCCcHHHHHHHHHHhhC Confidence 66543 334444555554322 12222221 11111000 00111111 25667788888888 Q ss_pred cccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCCc--EEEEccccCC--CceeeehhhHHHHHHHHH Q lcl|Aclame:pro 221 PWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGG--FSLIGNRTVT--GKFISFVGLEDAIARKLE 296 (388) Q Consensus 221 ~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~~--~~~i~vrR~~~~i~~~i~ 296 (388) +.+-+.-.......+...+. ...++.+|++.|..+|+|+...+.+.| +.+|-.-+++ |.++.+-+=.+|+++.++ T Consensus 308 f~~~~g~~T~~fkq~~~Gi~-a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~~~~Wl~~~iq 386 (501) T protein:vir:10 308 FQLRNGRTVLAFRQFNAGVP-ATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQ 386 (501) T ss_pred cccCccceeeeccccCCCcC-cccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeccceeehhhhhHHHHHHHHH Confidence 75443321111000000111 133678899999999999998885443 6677333333 555777777788888887 Q ss_pred HHHHHHhc---c-cCCHHHHHHHHHHHHHHHHHHHhcCCee-----------------------------eeeEEEeccC Q lcl|Aclame:pro 297 AASQRAMS---K-QLTKSFMEQEIKKINLFMQDLVAAEIIP-----------------------------GGEVYLHPTL 343 (388) Q Consensus 297 ~~~~~~vf---e-pn~~~~~~~i~~~i~~~L~~l~~~Gal~-----------------------------g~~v~~d~~~ 343 (388) ..+...+- + |-+..=...|+..++.-|++-+++|.|. ||.+..+... T Consensus 387 ~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~ 466 (501) T protein:vir:10 387 RAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPA 466 (501) T ss_pred HHHHHHHHhcCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeecccc Confidence 77765433 2 6777888889999999999999998884 2445554433 Q ss_pred CCHHHhh-CCeEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 344 NTVERYK-NGSWYIVIDYGRYSPNEHMIFHLNAVD 377 (388) Q Consensus 344 Nt~~~i~-~G~~~~~v~~~p~~pae~I~~~~~~~~ 377 (388) ++++++. .+...+.+.+.--..+++|++-..--. T Consensus 467 ~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 467 NPGQARQNRTTPACTLWYSDGGSIQQLTIGSNAVI 501 (501) T ss_pred CChhhhhhccccceEEEEEeCCceeEEEeeeeecC Confidence 3333333 344566666666667777765443333 No 65 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=95.82 E-value=0.0014 Score=36.23 Aligned_cols=363 Identities=11% Similarity=0.042 Sum_probs=162.5 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) ||-.+-.+ -.++.+..+..+.....-...+++-+.... .+........+..+....|+... ..+++...+|. T Consensus 1 m~~~~ip~--s~iV~V~~~v~~~~~~~~~~~~lllt~~~~---~~~~r~~~y~s~~~V~~~FG~~S---~ey~aA~~yFs 72 (501) T protein:vir:36 1 MPTTTIPI--DQIVQMLPGVIGAGGAPGRLTGLVLTQDTS---VQPGQLADFFQETDVENWFGALS---NEAKIADAYFP 72 (501) T ss_pred CCcCCccc--ceEEEEeeeeccCCCcceeeeeEEEeccCC---CCCcceeeecCHHHHHHhcCCCh---HHHHHHHHHhh Confidence 98521112 233333333333223232333444433322 12222233345555555544432 22333333332 Q ss_pred ---cccce--E-EEEecccccc---------cccccc----------------------cccccc---cchhhhhhhHhh Q lcl|Aclame:pro 81 ---KTSVP--Q-YFIVVPEGAD---------DAATMA----------------------NIIGGI---DPTTGRRTGIAA 120 (388) Q Consensus 81 ---~~~~~--~-~vv~~~~~~~---------~~~~~~----------------------~~~~~~---~~~tg~~tgl~a 120 (388) +.... . ++-|...... .+.+.+ ++.... +..+...+.|.+ T Consensus 73 ~~~~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~~~~~i~lS~~ts~~~vA~~i~~al~~ 152 (501) T protein:vir:36 73 GIVNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTS 152 (501) T ss_pred cccCCCccccEEEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecceeeeeecccccccCHHHHHHHHhhhhcC Confidence 11111 0 1111100000 000000 000000 000011111110 Q ss_pred hh--------------------------------------h-hhhhhhheecccccchhHHHHHHHHHhh---hCceEEE Q lcl|Aclame:pro 121 LT--------------------------------------E-CTERPTLIGAPGFSQNKAVIDALASMAK---RLKCRAV 158 (388) Q Consensus 121 ~~--------------------------------------~-~~~~p~ll~ap~~~~~~~v~~~l~~~~~---~~~~~~i 158 (388) .. . ....+..+...|.. .....++|.++.+ .+-.|.+ T Consensus 153 ~~~tv~~d~~~~~f~i~s~t~G~~~~i~~~t~~~~ia~~l~Lt~~~~a~v~~~g~~-~et~~~al~a~~~~s~~Wy~f~~ 231 (501) T protein:vir:36 153 PDFVVAYDALRNRFTVVTNATGTAAAISAVTGTNNFADEIGLSAAAGATLQAAGVA-ADTPASAMNRAVGLSRNWATFTT 231 (501) T ss_pred cceEEEEcCcceeEEEEeccCCcceeeEeeecccchhhhhcccccCcceEEecccc-cccHHHHHHHHHhccCceEEEEE Confidence 00 0 00001111112211 1223344544433 3334555 Q ss_pred EecCCCcchhHHHHHHHhhhcccccceEEEEe---cceecccc---------ccccee------ehhhHHHHHHHHhccc Q lcl|Aclame:pro 159 IDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVD---PMPAIYSR---------KAQGNI------YVPPSTIAMGAVAAVK 220 (388) Q Consensus 159 ~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~---p~~~~~~~---------~~~~~~------~~p~s~~~aG~~a~~d 220 (388) ++.+ ++++..+...|.+..+ ..+....+ +....... ..+-.+ ...+.+++.|..+..| T Consensus 232 a~~~--~~~~~la~A~wiea~~--~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~n 307 (501) T protein:vir:36 232 AWTA--VIADRLAFASWNSGQA--YKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASIN 307 (501) T ss_pred ecCC--ChHHHHHHHHHHhhcC--ceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCCCCHHHHHHHHHHhcC Confidence 5544 3344445555555332 12222221 11110000 000001 1245567788888888 Q ss_pred cccccccccccceeecccccccccCchhhhhhccccceEEEEEeCC--CcEEEEccccC--CCceeeehhhHHHHHHHHH Q lcl|Aclame:pro 221 PWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSM--GGFSLIGNRTV--TGKFISFVGLEDAIARKLE 296 (388) Q Consensus 221 ~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~--~G~~~wG~rT~--~~~~i~vrR~~~~i~~~i~ 296 (388) +.+-+.-....-..+...+. ...++.+|++.|..+|.|++..|.+ ..+.+|-.-++ .|.+|.+.+-.+|+++.|+ T Consensus 308 f~~~~g~~T~~fkq~~~Gi~-a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~g~dWL~~~iq 386 (501) T protein:vir:36 308 FQLRNGRTVLAFRQFNAGVP-ATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQ 386 (501) T ss_pred cccCcceeeeeccccCCCcC-cCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeeccchhhhHHHhHHHHHHHHH Confidence 75543322211111111111 1345778999999999999877754 34667633333 3567999999999999999 Q ss_pred HHHHHHhcc----cCCHHHHHHHHHHHHHHHHHHHhcCCee-----------------------------eeeEEEeccC Q lcl|Aclame:pro 297 AASQRAMSK----QLTKSFMEQEIKKINLFMQDLVAAEIIP-----------------------------GGEVYLHPTL 343 (388) Q Consensus 297 ~~~~~~vfe----pn~~~~~~~i~~~i~~~L~~l~~~Gal~-----------------------------g~~v~~d~~~ 343 (388) ..+...+-. |-|..=...|+..++.-|++-+++|.|. ||.++.+... T Consensus 387 ~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~ 466 (501) T protein:vir:36 387 RAEFEAMLAYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPA 466 (501) T ss_pred HHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeCccc Confidence 988875432 6667778888999999999999988883 2445555443 Q ss_pred CCHHHhh-CCeEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 344 NTVERYK-NGSWYIVIDYGRYSPNEHMIFHLNAVD 377 (388) Q Consensus 344 Nt~~~i~-~G~~~~~v~~~p~~pae~I~~~~~~~~ 377 (388) ++++++. .+...+.+.+.--..+++|++-..--. T Consensus 467 ~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:36 467 NPGQARQNRTTPACTLWYSDGGSIQSLTIGSNAVI 501 (501) T ss_pred CChhhhhhcccCcEEEEEEeCCceeEEEeeeeeeC Confidence 4444433 344566666667777777765443333 No 66 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=95.15 E-value=0.0026 Score=34.71 Aligned_cols=363 Identities=11% Similarity=0.039 Sum_probs=162.2 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) ||-.+-.+ -.++.+..+..+-....-...+++-+.... .+........+..+....|+... ..+++...+|. T Consensus 1 m~~~~ip~--s~iV~V~~~v~~~~~~~~~f~~lll~~~~~---~~~~r~~~y~s~~~V~~~FG~~S---~ey~aA~~yFs 72 (501) T protein:vir:10 1 MPTTTIPI--DQIVQMLPGVIGAGGAPGRLTGLVLTQDTS---VQPGQLADFFQKTDVENWFGALS---NEAKIADAYFP 72 (501) T ss_pred CCcCcccc--ceEEEEeeecccCCCcccccceEEEecccC---CCccceeeecCHHHHHHhcCCCh---HHHHHHHHHhh Confidence 99522222 223333333322222222222333322222 12222233445555555554432 22333333332 Q ss_pred ---cccce---EEEEecccccc---------cccccc----------------------cccccc---cchhhhhhhHhh Q lcl|Aclame:pro 81 ---KTSVP---QYFIVVPEGAD---------DAATMA----------------------NIIGGI---DPTTGRRTGIAA 120 (388) Q Consensus 81 ---~~~~~---~~vv~~~~~~~---------~~~~~~----------------------~~~~~~---~~~tg~~tgl~a 120 (388) +.... .++-|...... .+.+.+ ++.... +..+...+.|.+ T Consensus 73 g~~~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~~g~l~i~i~g~~~~~~i~~s~ats~~~vA~~i~~al~~ 152 (501) T protein:vir:10 73 GIVNGGQLPYDLKFARYVAADAPASVYGIPLTGITLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTS 152 (501) T ss_pred hhcCCCccccEEEEEeecccCccceeeeceehhhhhhhhhheeeEEEEeeccceeeeccccccccCHHHHHHHHHHhhcC Confidence 21111 11111110000 000000 000000 000111111111 Q ss_pred hh-hh----------------h----------------------hhhhheecccccchhHHHHHHHHHhh---hCceEEE Q lcl|Aclame:pro 121 LT-EC----------------T----------------------ERPTLIGAPGFSQNKAVIDALASMAK---RLKCRAV 158 (388) Q Consensus 121 ~~-~~----------------~----------------------~~p~ll~ap~~~~~~~v~~~l~~~~~---~~~~~~i 158 (388) .. .+ + ..+..+.+.|.. ...+..+|.++.+ .+-.+.+ T Consensus 153 ~~~tv~~d~~~~~f~i~~~t~G~~~~i~~~t~~~d~a~~l~Lt~~~~a~v~~~g~~-aet~~~Al~a~~~~~~~Wy~f~~ 231 (501) T protein:vir:10 153 PDFVVAYDALRNRFTVVTNTTGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVA-ADTPASAMNRAVGLSRNWATFTT 231 (501) T ss_pred CceEEEEecccceEEEEecccCcceeEEEeeccccchhhhcccccCceeEEecCcc-cccHHHHHHHHHhcccceEEEEE Confidence 00 00 0 000111112211 1123445554443 3334555 Q ss_pred EecCCCcchhHHHHHHHhhhcccccceEEEEe---cceecccc---------ccccee------ehhhHHHHHHHHhccc Q lcl|Aclame:pro 159 IDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVD---PMPAIYSR---------KAQGNI------YVPPSTIAMGAVAAVK 220 (388) Q Consensus 159 ~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~---p~~~~~~~---------~~~~~~------~~p~s~~~aG~~a~~d 220 (388) ++.+. +++..+...|....+ ..+....+ +....... ..+-.+ ...|.+++.|..+..| T Consensus 232 a~~~~--~~~~la~A~wi~a~~--~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~n 307 (501) T protein:vir:10 232 AWTAV--IADRLAFAAWNSGQA--YKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASIN 307 (501) T ss_pred EecCC--hHHHHHHHHHHHhcC--ceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEECCCCCHHHHHHHHHHhcC Confidence 65443 344445555555432 12222222 11110000 000011 1356778888888888 Q ss_pred cccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCCc--EEEEccccC--CCceeeehhhHHHHHHHHH Q lcl|Aclame:pro 221 PWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGG--FSLIGNRTV--TGKFISFVGLEDAIARKLE 296 (388) Q Consensus 221 ~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~--~~~~i~vrR~~~~i~~~i~ 296 (388) +.+-+.-....-..+...+. ...++.+|++.|..+|.|++..+.+.| +.+|-.-++ .|.+|.+.+=.+|+++.|+ T Consensus 308 f~~~~g~~T~~fkql~~Gv~-a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~g~dWl~~~iq 386 (501) T protein:vir:10 308 FQLRNGRTVLAFRQFNAGVP-ATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQ 386 (501) T ss_pred cccCcceeeeeecccCCCcC-cccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeeccceehhhHhhHHHHHHHHH Confidence 75543322111101111111 134678899999999999998876544 777733233 3567888888899999999 Q ss_pred HHHHHHhcc----cCCHHHHHHHHHHHHHHHHHHHhcCCee-----------------------------eeeEEEeccC Q lcl|Aclame:pro 297 AASQRAMSK----QLTKSFMEQEIKKINLFMQDLVAAEIIP-----------------------------GGEVYLHPTL 343 (388) Q Consensus 297 ~~~~~~vfe----pn~~~~~~~i~~~i~~~L~~l~~~Gal~-----------------------------g~~v~~d~~~ 343 (388) ..+...+-. |-|..=...|...++.-|++-+++|.|. ||.+..+... T Consensus 387 ~~l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~ 466 (501) T protein:vir:10 387 RAEFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGNPA 466 (501) T ss_pred HHHHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceeccceEEeeCccc Confidence 888765432 5566777888888999999888888874 2445554433 Q ss_pred CCHHHh-hCCeEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 344 NTVERY-KNGSWYIVIDYGRYSPNEHMIFHLNAVD 377 (388) Q Consensus 344 Nt~~~i-~~G~~~~~v~~~p~~pae~I~~~~~~~~ 377 (388) ++++++ ..+...+.+.+.--..+++|++-..--. T Consensus 467 ~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 467 NPGQARQNRTSPACTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred CChhhhhhcccCceEEEEEeCCceeEEEeeeeecC Confidence 333332 3344566666666677777765443333 No 67 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=94.92 E-value=0.0032 Score=34.28 Aligned_cols=363 Identities=11% Similarity=0.041 Sum_probs=158.2 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) ||-.+-.+ -.++.+..+..+.....-...+++-+..... +........+..+....|+... ..+++...+|. T Consensus 1 m~~~~ip~--s~iV~V~~~v~~~~~~~~~~~~lll~~~~~~---~~~r~~~y~s~~~V~~~FG~~S---~ey~aA~~yFs 72 (501) T protein:vir:78 1 MPTTTIPI--DQIVQMLPGVIGAGGAPGRLTGLVLTQDTSI---QPGQLADFFQKTDVENWFGGLS---NEAVIADAYFP 72 (501) T ss_pred CCcCcccc--ceEEEEeeecccCCCcceeeeeEEEecCCCC---CccceeeecCHHHHHHhcCCCh---HHHHHHHHHhh Confidence 98522122 2233333333322222222333333332221 2222233345555555544432 22333333332 Q ss_pred ---cccce--E-EEEecccccc---------ccccccc----------------------ccccc---cchhhhhhhHhh Q lcl|Aclame:pro 81 ---KTSVP--Q-YFIVVPEGAD---------DAATMAN----------------------IIGGI---DPTTGRRTGIAA 120 (388) Q Consensus 81 ---~~~~~--~-~vv~~~~~~~---------~~~~~~~----------------------~~~~~---~~~tg~~tgl~a 120 (388) +.... . ++-|...... .+.+.+. +.... +..+...+.|.+ T Consensus 73 ~~~~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~la~~~~~~G~l~iti~g~~~~~~i~~S~~ts~~~vA~~i~~al~a 152 (501) T protein:vir:78 73 GIVNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLTQLQGYSGTLTVTTAAQHVSSNISLAAATSFANAATLIEAAFTS 152 (501) T ss_pred cCCCCCcccceEEEEeecccCcceeEeccceeccchhhhceeeeEEEEEeccceeeeccccccccCHHHHHHHHHhhhcC Confidence 11111 0 1111100000 0000000 00000 000111111111 Q ss_pred h--------------------------------------hhhh-hhhhheecccccchhHHHHHHHHHhh---hCceEEE Q lcl|Aclame:pro 121 L--------------------------------------TECT-ERPTLIGAPGFSQNKAVIDALASMAK---RLKCRAV 158 (388) Q Consensus 121 ~--------------------------------------~~~~-~~p~ll~ap~~~~~~~v~~~l~~~~~---~~~~~~i 158 (388) . .... ..+..+.+.|.. ...+.++|.++.+ .+-.+.+ T Consensus 153 ~~~tv~~ds~~~~f~its~t~G~~~~i~~~t~~~~~a~~l~Lt~~~~a~v~~~g~~-aet~~~a~~a~~~~~~~Wy~f~~ 231 (501) T protein:vir:78 153 PDFVVSYDALRNRFVVNTNATGTAAAISAVTGTNNLADELGLSAAAGASLQAAGVA-ADTPASAMNRAVGLSRNWATFTT 231 (501) T ss_pred cceEEEEccccceEEEEeeecCCceeEEEEecccchhhhhcccccCceeeEecccc-ccCHHHHHHHHHhccCceEEEEE Confidence 0 0000 001112122211 1123344444433 3444555 Q ss_pred EecCCCcchhHHHHHHHhhhcccccceEEEEe---cceecccc---------cccceeeh------hhHHHHHHHHhccc Q lcl|Aclame:pro 159 IDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVD---PMPAIYSR---------KAQGNIYV------PPSTIAMGAVAAVK 220 (388) Q Consensus 159 ~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~---p~~~~~~~---------~~~~~~~~------p~s~~~aG~~a~~d 220 (388) ++.+ .+++..+...|....+ ..+....+ +....... ..+-.+.+ .+.+++.|..+..| T Consensus 232 a~~~--~~~~~lalA~wiea~~--~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~~~~~~aa~~g~~as~n 307 (501) T protein:vir:78 232 AWTA--VIADRLALASWNSGQA--YKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASIN 307 (501) T ss_pred ecCC--CHHHHHHHHHHHHhcC--ceEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEcCCcchHHHHHHHHHhcC Confidence 6654 3344445555555432 12222211 11111000 00111112 24667788888888 Q ss_pred cccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCCc--EEEEccccCC--CceeeehhhHHHHHHHHH Q lcl|Aclame:pro 221 PWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGG--FSLIGNRTVT--GKFISFVGLEDAIARKLE 296 (388) Q Consensus 221 ~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~~--~~~i~vrR~~~~i~~~i~ 296 (388) +.+-+.-.......+...+. ...++.+|++.|..+|.|++..+.+.| +.+|-.-+++ |.++.+-+=.+|+++.++ T Consensus 308 f~~~~g~~T~~fkq~~~Gv~-a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~~~wiD~~~~~~Wl~~~iq 386 (501) T protein:vir:78 308 FQLRNGRTVLAFRQFNAGVP-ATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSGKFLWVDTYLDQIYLNAELQ 386 (501) T ss_pred cccCcceeeeeccccCCCcC-cccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeeccceeehhhhhHHHHHHHHH Confidence 75543322111100111111 134578899999999999998876544 7777333333 455777777788888888 Q ss_pred HHHHHHhc---c-cCCHHHHHHHHHHHHHHHHHHHhcCCee-----------------------------eeeEEEeccC Q lcl|Aclame:pro 297 AASQRAMS---K-QLTKSFMEQEIKKINLFMQDLVAAEIIP-----------------------------GGEVYLHPTL 343 (388) Q Consensus 297 ~~~~~~vf---e-pn~~~~~~~i~~~i~~~L~~l~~~Gal~-----------------------------g~~v~~d~~~ 343 (388) ..+...+- + |-+..=...|+..++.-|++-+++|.|. ||.+..+... T Consensus 387 ~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~ 466 (501) T protein:vir:78 387 RAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDPA 466 (501) T ss_pred HHHHHHHHhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeecccc Confidence 77765432 2 6677778889999999999999998884 2445554433 Q ss_pred CCHHH-hhCCeEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 344 NTVER-YKNGSWYIVIDYGRYSPNEHMIFHLNAVD 377 (388) Q Consensus 344 Nt~~~-i~~G~~~~~v~~~p~~pae~I~~~~~~~~ 377 (388) +++++ -..+...+.+.+.--..+++|++-..--. T Consensus 467 ~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:78 467 NPGQARQNRTTPTCTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred CChhhhhhcccCcEEEEEEeCCceeEEEeeeeecC Confidence 33333 23344566666666667777765443333 No 68 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=93.81 E-value=0.0063 Score=32.65 Aligned_cols=359 Identities=13% Similarity=0.079 Sum_probs=159.9 Q ss_pred CCCCCCcCCCeEEEEcCCCcccccccCcceeEEEeeccccccccccCcceeeccchhhhhhcccccccccchhhhhhhhc Q lcl|Aclame:pro 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLK 80 (388) Q Consensus 1 M~~~t~~~hGV~~~e~~~~~~~i~~v~tav~g~vgta~~~~~~~~~~~~v~v~s~~~~~~~~~~~~~~gtl~~a~~~~~~ 80 (388) ||.++--. +.-+.++-.+.++...+-..+.|++.. ..+........+..+....|+... ..+++...+|. T Consensus 1 m~~ip~s~--iV~V~~~v~~~~~~~~~f~~~l~~~~~-----~~~~~r~~~y~s~~~V~~~FG~~S---~ey~aA~~yFs 70 (494) T protein:vir:94 1 MPNIPISQ--IVSINPQVVSAGGTQGTLDGLLLTQAT-----GFPVTQPQVYFSAADVGTAFGLTS---DEYNAALVYFA 70 (494) T ss_pred CCCCCccc--EEEeeeeccccCCcccccceeEeecCc-----cCCccceeeecCHHHHHHhcCCCh---HHHHHHHHHhh Confidence 98663221 222222223334444454444333311 111111222334444444444332 22333333332 Q ss_pred ---cccc-e--EEEEeccc--------ccccccccc----------------------cccccc---cchhhhhhhHhhh Q lcl|Aclame:pro 81 ---KTSV-P--QYFIVVPE--------GADDAATMA----------------------NIIGGI---DPTTGRRTGIAAL 121 (388) Q Consensus 81 ---~~~~-~--~~vv~~~~--------~~~~~~~~~----------------------~~~~~~---~~~tg~~tgl~a~ 121 (388) +... + .++-|... +.....+.+ ++.... +..+...+.+.+. T Consensus 71 ~~~~q~p~P~~l~igR~~~~a~~~~l~g~~~~~tl~~~~~~~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~~ai~~a 150 (494) T protein:vir:94 71 GILGGGQQPASLTIGRYASAATSAAVFGAPLTLSLAQLQTLSGTLIVTTDTQRTSAAINLSGATSFANAASLMTSGFTTP 150 (494) T ss_pred hccCCCccccEEEEEeecCccccceeeccchhhhHHhhhhcceEEEEEEcceEEEeeecccccCChhhHHHHHhhhhccc Confidence 1111 1 11111100 000000000 000000 0000111111100 Q ss_pred h-----------------hhhhh--------------------hhheecccccchhHHHHHHHHHhh---hCceEEEEec Q lcl|Aclame:pro 122 T-----------------ECTER--------------------PTLIGAPGFSQNKAVIDALASMAK---RLKCRAVIDG 161 (388) Q Consensus 122 ~-----------------~~~~~--------------------p~ll~ap~~~~~~~v~~~l~~~~~---~~~~~~i~d~ 161 (388) . ..+.. ...+...|. .......+|.++.+ .+-.+.+.+. T Consensus 151 ~~~v~~d~~~~~f~v~s~ttG~~s~is~~t~~~a~~l~lt~~~~a~v~~~g~-~aet~~~a~~a~~~~~~~Wy~f~~~~~ 229 (494) T protein:vir:94 151 NFAITYDAQRRRFVLSTTATGTTASVSAVTGTLADGVGLSTASGAYVEGSGL-AADTAASALDRLAASSSTWAIFTTAWA 229 (494) T ss_pred cceEEEcccCcEEEEEEccCCceeEEEEeccchhhhhhhhccccceEeecCc-ccccHHHHHHHHHhccCceEEEEEecC Confidence 0 00000 000111121 11123344444433 3444555554 Q ss_pred CCCcchhHHHHHHHhhhcccccceEEEEe---cceecccccc---------ccee------ehhhHHHHHHHHhcccccc Q lcl|Aclame:pro 162 PSGSTQDAIDLSGLLGGEGTGHDRVYMVD---PMPAIYSRKA---------QGNI------YVPPSTIAMGAVAAVKPWE 223 (388) Q Consensus 162 p~~~~~~~~~~~~~~~~~~~~s~~~~~~~---p~~~~~~~~~---------~~~~------~~p~s~~~aG~~a~~d~~~ 223 (388) + +.++..+...|....+ ..+....+ +........+ +-.+ ...|.+++.|..+..|... T Consensus 230 ~--~~~~ilalA~wiea~~--~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~aa~~~~~ 305 (494) T protein:vir:94 230 A--SLSDRTALAQWTSDQV--FRRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLLANAMIVLAWGASTNLQI 305 (494) T ss_pred C--CHHHHHHHHHHHhhcC--ccEEEEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCCChHHHHHHHHHhccccc Confidence 4 2344444555554422 12222221 1111110000 0001 1235567778888888755 Q ss_pred ccccccccceeecccccccccCchhhhhhccccceEEEEEeCCC--cEEEEccccCCCceeeehhhH--HHHHHHHHHHH Q lcl|Aclame:pro 224 SPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMG--GFSLIGNRTVTGKFISFVGLE--DAIARKLEAAS 299 (388) Q Consensus 224 s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~--G~~~wG~rT~~~~~i~vrR~~--~~i~~~i~~~~ 299 (388) .+.+....... ..+.-..-.++.+|++.|..+|+|++..+.+. =+.+|..-+++-.|+-.+... +|+++.++..+ T Consensus 306 ~~g~~T~~~k~-q~~gi~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG~~~~id~~~~~~WL~~~iq~~l 384 (494) T protein:vir:94 306 AEGRTTLALRS-PVSSAGVRVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGGQFLWADTALGWIALRRNLQQAL 384 (494) T ss_pred cCcceeEEeec-cCCCCCCccCCHHHHHHHHhcCCeEEEEecccCceEEEecCceeccccceeeeeccHHHHHHHHHHHH Confidence 44443332111 11111223357788999999999999887532 356776656664444444444 37777777777 Q ss_pred HHHhc---c-cCCHHHHHHHHHHHHHHHHHHHhcCCee----------------------------eeeEEE-e-ccCCC Q lcl|Aclame:pro 300 QRAMS---K-QLTKSFMEQEIKKINLFMQDLVAAEIIP----------------------------GGEVYL-H-PTLNT 345 (388) Q Consensus 300 ~~~vf---e-pn~~~~~~~i~~~i~~~L~~l~~~Gal~----------------------------g~~v~~-d-~~~Nt 345 (388) ...+- + |-|..=...|+..++.-|++-+++|.|. ||.+.. + .+.|+ T Consensus 385 ~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~~~~~s~~~ 464 (494) T protein:vir:94 385 FETLLAYRSLPYNADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGWYLQVIDPITTTV 464 (494) T ss_pred HHHHHhCCCcccChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccccceeccceeeeccCCCChhh Confidence 65432 2 7777788889999999999999999884 234443 2 35566 Q ss_pred HHHhhCCeEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 346 VERYKNGSWYIVIDYGRYSPNEHMIFHLNAVD 377 (388) Q Consensus 346 ~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~ 377 (388) +.++..-++.+.+.. ...+++|++....-. T Consensus 465 ra~R~~~~~~~~y~~--~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 465 RTDRGSPTVNFWYCD--GGSIQRVVVSATTVI 494 (494) T ss_pred hhccccCCceEEEEe--cCcEEEEEEeeEEeC Confidence 666666666665554 677788777765555 No 69 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=81.49 E-value=0.085 Score=26.45 Aligned_cols=351 Identities=8% Similarity=-0.005 Sum_probs=137.1 Q ss_pred CCCC---CCcCCC-eEEEEcCCCccccc--c------------cCcceeEEEeecccccccc-ccCcceeeccchhhhhh Q lcl|Aclame:pro 1 MPVI---DQFEHN-GISIETHEPPPPMG--P------------PGDNVVAWVVTAPDKHADV-AFSVPFRVANTADAQYL 61 (388) Q Consensus 1 M~~~---t~~~hG-V~~~e~~~~~~~i~--~------------v~tav~g~vgta~~~~~~~-~~~~~v~v~s~~~~~~~ 61 (388) +.-. +. .++ .++-+-.....+.. . +.++..-+.. ....... .++- -..+|+.+.+.. T Consensus 68 F~~~~~~~~-~P~~l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv--~G~~~~~~~i~~-S~~ts~~~vA~~ 143 (504) T protein:vir:96 68 FKFISKSVN-SPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMV--GAAEKNITAIDT-SAATSMDNVASI 143 (504) T ss_pred hhcCCCCCc-cccEEEEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEE--cceeeeeccccc-ccccchHHHHHH Confidence 1110 00 111 34444332222211 0 0111110000 0000000 0000 001111121111 Q ss_pred cccccccccchh--hhhhhhccccceEEEEecccccccccccccccccccchhhhhhhHhhhhhhhhhhhheecccccch Q lcl|Aclame:pro 62 DSTGNELGTGWH--AASETLKKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTERPTLIGAPGFSQN 139 (388) Q Consensus 62 ~~~~~~~gtl~~--a~~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~~~p~ll~ap~~~~~ 139 (388) ............ .....++..+ ..+++....... ..-..+.....+ .+.++.... .+......|.. . T Consensus 144 i~~al~~~~~~~~~~~tv~~d~~~-~~f~its~~tg~----~~~~~~~~a~~~----~~~~~lgl~-~~~~~~v~g~~-a 212 (504) T protein:vir:96 144 IQTEIRKNTDPQLAQATVTWNPNT-NQFTLVGATIGT----GVLAVAKSADPQ----DMSTALGWS-TSNVVNVAGQA-A 212 (504) T ss_pred HHhhhhcccccccccceEEEeccC-CeEEEEeecccc----ceeEEEeecccc----chhhhhhcc-cccceEEeecc-c Confidence 100000000000 0000111111 111121111000 000000000000 000000000 01111122221 1 Q ss_pred hHHHHHHHHH---hhhCceEEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceecccc----------ccc----- Q lcl|Aclame:pro 140 KAVIDALASM---AKRLKCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSR----------KAQ----- 201 (388) Q Consensus 140 ~~v~~~l~~~---~~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~----------~~~----- 201 (388) ....++|.++ ...+-.+.+.+.+.. +++......|.+..+ ..+.... |....+. ... T Consensus 213 et~~~al~al~~~~~~Wy~f~~a~~~~~-dd~ilalA~w~ea~~--~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (504) T protein:vir:96 213 DLPDAAVAKSTNVSNNFGSFLFAGATLD-NDQIKAVSAWNAAQN--NQFIYTV--ATSLANLGALFDLVKGNSGTALNVL 287 (504) T ss_pred ccHHHHHHHHHhhcCCeEEEEEEeccCC-HHHHHHHHHHHhhcC--ceEEEEE--eecccchhhHHHhhhhcceeEEEEe Confidence 1122222222 223333444443322 122223333333211 1111111 1100000 000 Q ss_pred --ceeehhhHHHHHHHHhcccccccccccccc---ceeecccccccccCchhhhhhccccceEEEEEeCCCc--EEEE-c Q lcl|Aclame:pro 202 --GNIYVPPSTIAMGAVAAVKPWESPGNQGVL---IQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGG--FSLI-G 273 (388) Q Consensus 202 --~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~---~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~~~w-G 273 (388) ....--+..+.++..+..|+.+-+.-.+.. ..|+.. ..++.+|++.|..+|+|++..+.+.| +.+| . T Consensus 288 ~~~~~~~~~~~~~~~~~as~~f~~~ng~~T~~fk~l~GVta-----~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~ 362 (504) T protein:vir:96 288 SATASNDFVEQCPSEILAATNYDEPGASQNYMYYQFPGRNI-----TVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQR 362 (504) T ss_pred ecCccchhHHHHHHHHHHhcCcCcccccccccccccCCcCc-----ccCCHHHHHHHHhcCCeEEEEeecccceeeEEec Confidence 000112345556777777764432222211 123321 24688899999999999998876544 3444 2 Q ss_pred cccCC----CceeeehhhHHHHHHHHHHHHHHHhcc----cCCHHHHHHHHHHHHHHHHHHHhcCCee------------ Q lcl|Aclame:pro 274 NRTVT----GKFISFVGLEDAIARKLEAASQRAMSK----QLTKSFMEQEIKKINLFMQDLVAAEIIP------------ 333 (388) Q Consensus 274 ~rT~~----~~~i~vrR~~~~i~~~i~~~~~~~vfe----pn~~~~~~~i~~~i~~~L~~l~~~Gal~------------ 333 (388) ..++. |.+|.+-+-.+|+++.|+..+....-. |-|+.=...|+..++.-|++-++.|.|. T Consensus 363 G~~~gG~~~~~wiDv~~~~~WL~~~lq~~l~~l~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~ 442 (504) T protein:vir:96 363 GILCGGPTDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQY 442 (504) T ss_pred CeeeCCccccchhhhhhhHHHHHHHHHHHHHHHHhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchhe Confidence 33443 445888888889999998888764322 5667788889999999999988888762 Q ss_pred -----------------eeeEEEec-cCCCHHHhh-CCeEEEEEEEEecCcceeEEEEEEEc Q lcl|Aclame:pro 334 -----------------GGEVYLHP-TLNTVERYK-NGSWYIVIDYGRYSPNEHMIFHLNAV 376 (388) Q Consensus 334 -----------------g~~v~~d~-~~Nt~~~i~-~G~~~~~v~~~p~~pae~I~~~~~~~ 376 (388) ||.+..+. ++-+++++. .+...+.+...-.-.+++|++....- T Consensus 443 I~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~~r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 443 ITQITGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred ecccccccccccceeccceEEEecChhccChhHhhhccccceEEEEEECCeEEEEEeccccC Confidence 36677643 344444443 35556666667777788877665444 No 70 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=78.41 E-value=0.11 Score=25.74 Aligned_cols=349 Identities=9% Similarity=-0.010 Sum_probs=145.9 Q ss_pred CCCCCCcCCC-eEEEEcCCCcccc--ccc------------CcceeEEEeecccccccc-ccCcceeeccchhhhhhccc Q lcl|Aclame:pro 1 MPVIDQFEHN-GISIETHEPPPPM--GPP------------GDNVVAWVVTAPDKHADV-AFSVPFRVANTADAQYLDST 64 (388) Q Consensus 1 M~~~t~~~hG-V~~~e~~~~~~~i--~~v------------~tav~g~vgta~~~~~~~-~~~~~v~v~s~~~~~~~~~~ 64 (388) -|--+. .++ .++-+-....++. +.. .++.+-+. ........ .++ .-..+|..+.+.. T Consensus 71 ~p~~~~-~P~~L~igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~--v~G~~~t~~~i~-lS~~ts~~~vAs~--- 143 (507) T protein:vir:99 71 ISKSIN-SPSYISFARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLS--IGGTVVPIAGID-LTAALTLTDVAAT--- 143 (507) T ss_pred CCCCCc-ccceEEEEeecCccccceeecchhhhhHHHHhhhcceeEEEE--EcCceeEecccc-ccccCCHHHHHHH--- Confidence 111000 011 3444433222211 110 01110000 00000000 000 0001111211111 Q ss_pred ccccccchhhhhhh-----------hccccceEEEEecc-cccccccccccccccccchhhhhhhHhhhhhhhhhhhhee Q lcl|Aclame:pro 65 GNELGTGWHAASET-----------LKKTSVPQYFIVVP-EGADDAATMANIIGGIDPTTGRRTGIAALTECTERPTLIG 132 (388) Q Consensus 65 ~~~~gtl~~a~~~~-----------~~~~~~~~~vv~~~-~~~~~~~~~~~~~~~~~~~tg~~tgl~a~~~~~~~p~ll~ 132 (388) +..++... ++..+ ..+++... .+..... .+....+. .+.+..+..... ..-+. T Consensus 144 ------i~~~l~a~~~~~~~~~tv~~d~~~-~~F~v~s~~tG~~s~i---~~at~~~~----gt~~s~l~~~~~-~~a~~ 208 (507) T protein:vir:99 144 ------LQTKIRASANAELATATVTFNTTT-NQFVLNGTTTGALAPT---ITAVRTDP----ATDISSLLGWTN-TGTVF 208 (507) T ss_pred ------HHHhhhccccccccceEEEEecCC-ceEEEEeeecccccee---EEEEcCCc----hhhHHHHhcccc-ccceE Confidence 01111111 11111 11111111 1111000 11111111 111111111110 11222 Q ss_pred cccccchhHHHHHHHHHh---hhCceEEEEecCCCcchhHHHHHHHhhhcccccceEEEEecceeccc------------ Q lcl|Aclame:pro 133 APGFSQNKAVIDALASMA---KRLKCRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYS------------ 197 (388) Q Consensus 133 ap~~~~~~~v~~~l~~~~---~~~~~~~i~d~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~------------ 197 (388) ..|.. ...+..+|..+. ..+-.+.+.+.+.-++++..+...|....+ ..+....+-.-.... T Consensus 209 ~~g~~-aet~~~a~~a~~~~~~nW~~~~~a~~~~~td~~~lalA~wiea~~--~~f~~~~~~~~a~~~~~~~~~~~~~~~ 285 (507) T protein:vir:99 209 VKGQA-AETPDTSISKSAAISTNFGSFIYTSTPALTNDQITAVASWNASQN--NMYMYSVPTTIANIGTLYAAVKGFSGC 285 (507) T ss_pred eeccc-ccCHHHHHHHHHhhcCCeEEEEEEeccccChHHHHHHHHHHhhcC--cEEEEEEecCchhhhhhhhhhhhccee Confidence 22322 222333444433 344445556665444444455555554432 222222111000000 Q ss_pred ---ccccceeehhhHHHHHHHHhccccccccccccccceeecccccccccCchhhhhhccccceEEEEEeCCCc--EEEE Q lcl|Aclame:pro 198 ---RKAQGNIYVPPSTIAMGAVAAVKPWESPGNQGVLIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGG--FSLI 272 (388) Q Consensus 198 ---~~~~~~~~~p~s~~~aG~~a~~d~~~s~~n~p~~~~g~~~~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G--~~~w 272 (388) ...+......+.+.+.|..+..|+.+-+.-.......+. .+. ...++.+|++.|..+|+|+...+.+.| +.+| T Consensus 286 ~~~~~~~~~~~~~~~aa~~g~~as~nf~~~ng~~T~~fk~l~-GV~-a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~ 363 (507) T protein:vir:99 286 ALNITSDSLPVDYIEQSPCEILAATDYTRVNATQNYMYYQFP-SRN-ITVSDDTTANLVDANRGNYIGQTQSAGQSLAFY 363 (507) T ss_pred EEEeecccccchhHHHHHHHHHHhhccCcCccceeecccccC-Ccc-cccCCHHHHHHHHhcCCeEEEEeccccceeeEE Confidence 000011112356777788888886543332222111111 111 124688899999999999998886533 5555 Q ss_pred cc-ccCC----CceeeehhhHHHHHHHHHHHHHHHhc---c-cCCHHHHHHHHHHHHHHHHHHHhcCCeee--------- Q lcl|Aclame:pro 273 GN-RTVT----GKFISFVGLEDAIARKLEAASQRAMS---K-QLTKSFMEQEIKKINLFMQDLVAAEIIPG--------- 334 (388) Q Consensus 273 G~-rT~~----~~~i~vrR~~~~i~~~i~~~~~~~vf---e-pn~~~~~~~i~~~i~~~L~~l~~~Gal~g--------- 334 (388) -. .++. |-.+.+-+=.+|++..++..+....- + |-+..=...|+..++.-|++-+++|.|.. T Consensus 364 ~~G~~~gG~~~fid~d~~~~~~WL~~~iq~~l~~l~~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~ 443 (507) T protein:vir:99 364 QRGILCGGPNDAVDMNIYANEIWLKSAISAQILSLFLNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQ 443 (507) T ss_pred ecCeeeCCcccceeeeeecchHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccch Confidence 33 3333 33344445555888888777776332 2 66777788888888888888888887743 Q ss_pred --------------------eeEEEec-cCCCHHHhh-CCeEEEEEEEEecCcceeEEEEEEEc Q lcl|Aclame:pro 335 --------------------GEVYLHP-TLNTVERYK-NGSWYIVIDYGRYSPNEHMIFHLNAV 376 (388) Q Consensus 335 --------------------~~v~~d~-~~Nt~~~i~-~G~~~~~v~~~p~~pae~I~~~~~~~ 376 (388) |.+..+. ++.+++++. .+...+.+-+.---.+++|++....- T Consensus 444 ~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 444 QYITQISGDANAWRQVANIGYWLNITFSSYTNPNTQLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred heecccccccccccceeccceEEEeCChHhcChhhhhccccceEEEEEEeCCeEEEEEeeeecC Confidence 4455543 344444443 56777777777788888888766555 Done!