Query lcl|Aclame:protein:vir:100323|NCBI_annot:tail sheath protein FI|genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Match_columns 393 No_of_seqs 138 out of 822 Neff 9.2 Searched_HMMs 1612 Date Sun Dec 1 19:38:10 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_16 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_16_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:100323 Length: 393 100.0 2E-119 1E-122 670.9 37.0 393 1-393 1-393 (393) 2 protein:vir:78206 Length: 390 100.0 1E-112 7E-116 634.3 36.4 388 3-392 1-390 (390) 3 protein:vir:103993 Length: 390 100.0 1E-112 7E-116 634.3 36.4 388 3-392 1-390 (390) 4 protein:vir:79181 Length: 390 100.0 3E-111 2E-114 626.9 37.0 388 3-392 1-390 (390) 5 protein:vir:79141 Length: 391 100.0 2E-110 1E-113 622.0 35.5 389 3-393 1-391 (391) 6 protein:vir:1172 Length: 391 # 100.0 1E-109 7E-113 617.8 36.4 389 1-392 1-391 (391) 7 protein:vir:98553 Length: 395 100.0 2E-109 1E-112 616.9 36.9 387 4-392 1-395 (395) 8 protein:vir:1845 Length: 392 # 100.0 2E-109 1E-112 616.5 36.9 387 4-392 1-392 (392) 9 protein:vir:6079 Length: 396 # 100.0 6E-108 4E-111 608.3 37.3 388 4-393 1-396 (396) 10 protein:vir:2035 Length: 396 # 100.0 6E-108 4E-111 608.5 36.3 388 4-393 1-396 (396) 11 protein:vir:5711 Length: 396 # 100.0 1E-107 6E-111 607.3 36.8 388 4-393 1-396 (396) 12 protein:vir:10336 Length: 386 100.0 7E-105 4E-108 591.7 34.6 382 3-387 1-386 (386) 13 protein:vir:96740 Length: 388 100.0 5E-100 3E-103 564.9 35.4 374 1-391 1-388 (388) 14 protein:vir:107865 Length: 477 100.0 2.4E-99 1E-102 561.3 34.3 381 3-390 1-477 (477) 15 protein:vir:79092 Length: 477 100.0 3.4E-99 2E-102 560.5 34.7 381 3-390 1-477 (477) 16 protein:vir:6594 Length: 666 # 100.0 1.7E-87 1E-90 496.3 35.4 379 4-393 1-666 (666) 17 protein:vir:106427 Length: 679 100.0 1.4E-87 8.8E-91 496.7 34.6 378 4-392 1-679 (679) 18 protein:vir:80984 Length: 666 100.0 2.9E-87 1.8E-90 495.0 34.6 379 4-393 1-666 (666) 19 protein:vir:6894 Length: 660 # 100.0 4.1E-87 2.5E-90 494.2 34.0 378 4-392 1-660 (660) 20 protein:vir:103456 Length: 659 100.0 6.2E-87 3.8E-90 493.2 33.8 377 4-393 1-657 (659) 21 protein:vir:98263 Length: 664 100.0 4.8E-87 3E-90 493.8 33.0 374 3-393 1-661 (664) 22 protein:vir:7206 Length: 659 # 100.0 8.3E-87 5.1E-90 492.5 33.8 377 4-393 1-657 (659) 23 protein:vir:108052 Length: 660 100.0 4.5E-86 2.8E-89 488.5 33.9 374 4-391 1-660 (660) 24 protein:vir:106984 Length: 743 100.0 5.5E-86 3.4E-89 488.0 33.8 377 3-389 1-743 (743) 25 protein:vir:101187 Length: 663 100.0 1.2E-85 7.7E-89 486.1 34.9 379 4-393 1-663 (663) 26 protein:vir:104858 Length: 729 100.0 2.8E-85 1.7E-88 484.1 33.0 379 1-390 1-729 (729) 27 protein:vir:101804 Length: 663 100.0 7.5E-85 4.6E-88 481.8 35.0 379 4-393 1-663 (663) 28 protein:vir:100539 Length: 663 100.0 5.2E-84 3.2E-87 477.2 32.9 379 4-393 1-663 (663) 29 protein:vir:5663 Length: 671 # 100.0 1.1E-83 6.7E-87 475.4 32.4 376 4-392 1-671 (671) 30 protein:vir:104477 Length: 749 100.0 1.5E-82 9.3E-86 469.2 33.5 375 3-388 1-749 (749) 31 protein:vir:98824 Length: 774 100.0 1.6E-80 9.9E-84 458.1 28.1 376 1-389 276-774 (774) 32 protein:vir:5833 Length: 742 # 100.0 1E-75 6.2E-79 431.7 26.6 365 1-388 347-742 (742) 33 protein:vir:79798 Length: 717 100.0 4.7E-53 2.9E-56 307.4 24.1 346 1-380 328-717 (717) 34 protein:vir:103168 Length: 641 100.0 2.2E-42 1.4E-45 249.0 19.8 264 1-281 1-641 (641) 35 protein:vir:63742 Length: 562 100.0 1.4E-40 8.6E-44 239.1 28.2 361 1-385 6-562 (562) 36 protein:vir:80779 Length: 569 100.0 3.5E-39 2.2E-42 231.4 27.2 361 1-385 1-569 (569) 37 protein:vir:80488 Length: 562 100.0 1.6E-38 9.7E-42 227.9 28.0 361 1-385 6-562 (562) 38 protein:vir:95741 Length: 587 100.0 5.7E-35 3.5E-38 208.3 26.7 361 1-385 1-587 (587) 39 protein:vir:102819 Length: 648 100.0 1.3E-34 8.3E-38 206.3 28.2 359 1-383 1-648 (648) 40 protein:vir:96586 Length: 587 100.0 5.9E-34 3.7E-37 202.7 27.8 361 1-385 6-587 (587) 41 protein:vir:99306 Length: 587 100.0 7.1E-34 4.4E-37 202.3 26.5 361 1-385 1-587 (587) 42 protein:vir:107310 Length: 581 100.0 1E-34 6.3E-38 207.0 21.0 366 1-393 177-581 (581) 43 protein:vir:7653 Length: 581 # 100.0 1.8E-34 1.1E-37 205.6 19.8 363 1-393 156-581 (581) 44 protein:vir:100829 Length: 607 100.0 1.7E-30 1E-33 183.9 25.1 368 1-391 15-607 (607) 45 protein:vir:102957 Length: 437 100.0 3.9E-29 2.4E-32 176.3 26.3 357 1-379 1-437 (437) 46 protein:vir:105470 Length: 451 99.9 3.8E-23 2.3E-26 143.5 27.8 358 1-379 1-451 (451) 47 protein:vir:101326 Length: 529 99.9 2.7E-23 1.7E-26 144.3 20.5 359 1-380 112-529 (529) 48 protein:vir:78986 Length: 436 99.8 4.3E-19 2.7E-22 121.3 26.4 355 1-379 1-436 (436) 49 protein:vir:102359 Length: 356 99.3 3E-12 1.9E-15 83.8 21.9 323 1-378 1-356 (356) 50 protein:vir:3751 Length: 376 # 98.9 4.3E-09 2.7E-12 66.5 21.4 343 7-385 1-376 (376) 51 protein:vir:3788 Length: 376 # 98.8 6.6E-09 4.1E-12 65.4 20.6 343 7-385 1-376 (376) 52 protein:vir:276 Length: 369 # 98.7 6.8E-08 4.2E-11 59.9 24.9 337 1-383 1-369 (369) 53 protein:vir:80052 Length: 331 98.6 1.5E-07 9.5E-11 58.0 25.9 316 3-380 1-331 (331) 54 protein:vir:78782 Length: 370 98.6 3E-08 1.8E-11 61.9 19.4 342 7-387 1-370 (370) 55 protein:vir:5260 Length: 502 # 98.4 7.9E-07 4.9E-10 54.0 26.5 361 1-380 1-502 (502) 56 protein:vir:4517 Length: 498 # 98.4 9.4E-07 5.8E-10 53.6 22.6 355 1-383 1-498 (498) 57 protein:vir:489 Length: 498 # 98.2 2.5E-06 1.6E-09 51.3 21.9 354 1-383 1-498 (498) 58 protein:vir:4463 Length: 498 # 98.1 5.7E-06 3.5E-09 49.3 22.5 355 1-383 1-498 (498) 59 protein:vir:95263 Length: 450 98.0 7.7E-06 4.8E-09 48.6 24.7 355 3-381 1-450 (450) 60 protein:vir:3165 Length: 426 # 97.8 1.5E-05 9.3E-09 47.0 22.6 362 1-380 1-426 (426) 61 protein:vir:1996 Length: 495 # 97.3 9.3E-05 5.7E-08 42.7 25.6 352 1-376 1-495 (495) 62 protein:vir:3636 Length: 501 # 73.5 0.17 0.00011 24.8 26.2 363 1-380 1-501 (501) 63 protein:vir:106730 Length: 501 69.6 0.22 0.00014 24.2 26.3 364 1-380 1-501 (501) 64 protein:vir:101576 Length: 501 66.6 0.26 0.00016 23.8 26.1 364 1-380 1-501 (501) 65 protein:vir:107720 Length: 515 53.6 0.52 0.00032 22.1 12.9 360 1-379 78-515 (515) 66 protein:vir:96104 Length: 504 49.3 0.64 0.0004 21.6 26.8 359 1-379 1-504 (504) 67 protein:vir:108311 Length: 249 40.0 0.99 0.00062 20.6 8.3 102 285-393 1-120 (249) 68 protein:vir:94073 Length: 494 29.8 1.6 0.001 19.4 23.8 356 1-380 1-494 (494) 69 protein:vir:78611 Length: 501 20.9 2.7 0.0017 18.2 27.1 364 1-380 1-501 (501) No 1 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=2.4e-119 Score=670.90 Aligned_cols=393 Identities=100% Similarity=1.423 Sum_probs=384.8 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhccc Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~ 80 (393) |||+++|+|||||+|++++++|+.+++|++++|+|++++.+...+|+++|++++++.++...++..++|..++..++.++ T Consensus 1 m~m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~~g~L~~al~~~~~~~ 80 (393) T protein:vir:10 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) T ss_pred CCCCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCCccchhhhhhhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEecccccccccccchhcccccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEEEEec Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISD 160 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~~i~~ 160 (393) +..++++++...++...+...+.+..++..++|+++++.++..+++.|+++++||+++.+++++|.++|+++++++++++ T Consensus 81 ~~~~~vv~v~~~~~~~~t~~~iig~~~~~~~tgl~al~~~~~~~~~~p~li~apg~~~~~~~~al~~~~~~~~~~~~v~d 160 (393) T protein:vir:10 81 KTPTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISD 160 (393) T ss_pred CceEEEeecccCccccccccccccccccchhhHHHHHHhhhhhcceeeeeeeeccccchHHHHHHHHHhhccCcEEEEEc Confidence 99999999999999999988888888888899999999999999999999999999999999999999999999999999 Q ss_pred CCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeecc Q lcl|Aclame:pro 161 NGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVE 240 (393) Q Consensus 161 ~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~ 240 (393) +|+++.+++++|++.++|.++++||||++.+++.++..+.+|||+++||++|++|.++|||+||||++|.||.+++..++ T Consensus 161 ~~~~t~~~ai~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~G~~~spaN~~l~gi~~~~~~~~ 240 (393) T protein:vir:10 161 NGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVE 240 (393) T ss_pred CCCCCHHHHHHHhhhcCCceEEEEecccccccccCCceeEeehhHHHHHHHHHhhcCCCcEEccCCceeeceeecceecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHH Q lcl|Aclame:pro 241 FDINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEA 320 (393) Q Consensus 241 ~~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~ 320 (393) ++.+++++|+++||++||+++++++||++||+||+++||+|+||++|||+++|+++|++.++|++||||++.+|++|+++ T Consensus 241 ~~~~~~~~~~~~Ln~~gI~t~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~ 320 (393) T protein:vir:10 241 FDINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEA 320 (393) T ss_pred cccCCCcchhHhHhhcCceEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhcccccccceEEEecCCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhcC Q lcl|Aclame:pro 321 INNKLRSWASGDDPRILGARVWVAEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKAL 393 (393) Q Consensus 321 i~~~l~~l~~~g~~~~~~~~v~~~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 393 (393) ++.||++||++|++++.+++++++++||++++++|+|+++|+++|++|+|||+|+++++++|+++||++|+|| T Consensus 321 i~~~L~~l~~~g~~al~g~~v~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~~~~l~~~v~a~ 393 (393) T protein:vir:10 321 INNKLRSWASGDDPRILGARVWVAEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKAL 393 (393) T ss_pred HHHHHHHHHhccccccccceEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHHhcC Confidence 9999999999887778999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=1.1e-112 Score=634.29 Aligned_cols=388 Identities=45% Similarity=0.748 Sum_probs=371.2 Q ss_pred CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhcccCc Q lcl|Aclame:pro 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKT 82 (393) Q Consensus 3 m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~~~ 82 (393) |+++|+|||||+|++++++|+.+++|++++|+|+++++++..+|+++|+++++..++...++..++|.+++..++.+++. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:78 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEecccccccccccchhccc-ccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEEEEecC Q lcl|Aclame:pro 83 PTVIVRVAESDDSDTLTANIVGTQ-ENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISDN 161 (393) Q Consensus 83 ~~~vv~~~~~~~~~~~~~~~~~~~-~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~~i~~~ 161 (393) .++++++....+...+.....+.. ..+..+|+++++..+..++..|.++.+|++++.+|+++|..+|++++++++++.+ T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~aivD~p 160 (390) T protein:vir:78 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSAS 160 (390) T ss_pred eEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccceEEEEecC Confidence 999999998888888877776644 4567899999999999999999999999999999999999999999999988777 Q ss_pred CCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeeccc Q lcl|Aclame:pro 162 GATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEF 241 (393) Q Consensus 162 ~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~ 241 (393) ++.+.+++++++++++|.++++||||++.+++..+..+.+|||+++||++|++|.++||||||||+.|.||.+++.++++ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~ 240 (390) T protein:vir:78 161 GCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSW 240 (390) T ss_pred CCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceeccc Confidence 78888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHH Q lcl|Aclame:pro 242 DINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAI 321 (393) Q Consensus 242 ~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i 321 (393) ..++..+|+++||++||+++++++||++||+||+++||+|+||++|||+++|+++|+++++|+|||||++.+|++|++++ T Consensus 241 ~~~~~~~~~~~ln~~gi~t~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) T protein:vir:78 241 DLQDPATDAGYLNEHEVTTLVNRNGFRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESI 320 (390) T ss_pred ccccccchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhc Q lcl|Aclame:pro 322 NNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKA 392 (393) Q Consensus 322 ~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 392 (393) ++||++||++|. +.|++|+|+ ++||++++++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 321 ~~~L~~l~~~g~--l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:78 321 NGWFRQQVANGY--LIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHhCCc--eeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999885 566777666 579999999999999999999999999999999999999999999999 No 3 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=1.1e-112 Score=634.29 Aligned_cols=388 Identities=45% Similarity=0.748 Sum_probs=371.2 Q ss_pred CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhcccCc Q lcl|Aclame:pro 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKT 82 (393) Q Consensus 3 m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~~~ 82 (393) |+++|+|||||+|++++++|+.+++|++++|+|+++++++..+|+++|+++++..++...++..++|.+++..++.+++. T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~~gtL~~al~~~~~~gg~ 80 (390) T protein:vir:10 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCCCceehhhhhhhccccCc Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEecccccccccccchhccc-ccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEEEEecC Q lcl|Aclame:pro 83 PTVIVRVAESDDSDTLTANIVGTQ-ENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISDN 161 (393) Q Consensus 83 ~~~vv~~~~~~~~~~~~~~~~~~~-~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~~i~~~ 161 (393) .++++++....+...+.....+.. ..+..+|+++++..+..++..|.++.+|++++.+|+++|..+|++++++++++.+ T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~aivD~p 160 (390) T protein:vir:10 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSAS 160 (390) T ss_pred eEEEEEecccccccccccccccccccccccchhhhhhhhhhhhcceehhhcccccchHHHHHHHHHhhcccceEEEEecC Confidence 999999998888888877776644 4567899999999999999999999999999999999999999999999988777 Q ss_pred CCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeeccc Q lcl|Aclame:pro 162 GATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEF 241 (393) Q Consensus 162 ~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~ 241 (393) ++.+.+++++++++++|.++++||||++.+++..+..+.+|||+++||++|++|.++||||||||+.|.||.+++.++++ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~ 240 (390) T protein:vir:10 161 GCKTKEEAAAYRKQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSW 240 (390) T ss_pred CCCCHHHHHHHhhccCCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceeccc Confidence 78888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHH Q lcl|Aclame:pro 242 DINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAI 321 (393) Q Consensus 242 ~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i 321 (393) ..++..+|+++||++||+++++++||++||+||+++||+|+||++|||+++|+++|+++++|+|||||++.+|++|++++ T Consensus 241 ~~~~~~~~~~~ln~~gi~t~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i 320 (390) T protein:vir:10 241 DLQDPATDAGYLNEHEVTTLVNRNGFRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESI 320 (390) T ss_pred ccccccchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhc Q lcl|Aclame:pro 322 NNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKA 392 (393) Q Consensus 322 ~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 392 (393) ++||++||++|. +.|++|+|+ ++||++++++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 321 ~~~L~~l~~~g~--l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:10 321 NGWFRQQVANGY--LIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHhCCc--eeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999885 566777666 579999999999999999999999999999999999999999999999 No 4 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=2.5e-111 Score=626.94 Aligned_cols=388 Identities=45% Similarity=0.750 Sum_probs=369.2 Q ss_pred CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhcccCc Q lcl|Aclame:pro 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKT 82 (393) Q Consensus 3 m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~~~ 82 (393) ||++|+|||||+|++++++|+.+++|++++|+|+++++++..+|+++|+++++..++..++|..++|+.+++.++.+++. T Consensus 1 M~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~~~tL~~al~~~~~~~~~ 80 (390) T protein:vir:79 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGKKGTLRRTLDAIGKQTKP 80 (390) T ss_pred CccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCCCccchhhhhhhcccccc Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEecccccccccccchhcc-cccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEEEEecC Q lcl|Aclame:pro 83 PTVIVRVAESDDSDTLTANIVGT-QENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISDN 161 (393) Q Consensus 83 ~~~vv~~~~~~~~~~~~~~~~~~-~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~~i~~~ 161 (393) .++++++....+...+.....+. .+.+..+|+++++..++.+++.|.++++|++++.+++++|..+|++++++++++.+ T Consensus 81 ~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~~~ai~D~p 160 (390) T protein:vir:79 81 LTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSLRAMAYVSAS 160 (390) T ss_pred eEEEEeeccccccccccceeeecccccccchhhhhhhhhhhhhccccccccCCcccchHHHHHHHHhhhhcceEEEEEcc Confidence 99999998887777776665554 44678999999999999999999999999999999999999999999999888777 Q ss_pred CCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeeccc Q lcl|Aclame:pro 162 GATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEF 241 (393) Q Consensus 162 ~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~ 241 (393) ++.+.+++.+|+++++|.++++||||++.+++..+..+.+|||+++||++|++|.++|||+||||++|+|+.+++..+++ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~D~~~g~~~spsN~~i~gi~~~~~~~~~ 240 (390) T protein:vir:79 161 GCKTKEEAAAYRRQFGQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSW 240 (390) T ss_pred CCCCHHHHHHHhcCCCCceEEEEcCceeecccccCceeEeehHHHHHHHHHhhhccCCcEEccCCceeeccceeeeeccc Confidence 77788999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHH Q lcl|Aclame:pro 242 DINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAI 321 (393) Q Consensus 242 ~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i 321 (393) ..++.++|+++||++||+++++++||++||+||+++||+|+||++||++++|+++|+++++|++||||++.+|++|++++ T Consensus 241 ~~~~~~~~a~~Ln~~gi~t~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i 320 (390) T protein:vir:79 241 DLQDPATDAGYLNEHEVTTLVNRNGFRFWGERTCSDDPKFAFENYTRTAQVAADSIAEAQMPVVDGPLNPSLARDIVESI 320 (390) T ss_pred cccccchhhhhhhhcCcEEEEcCCCEEEEeccccCCCcccceeeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhc Q lcl|Aclame:pro 322 NNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKA 392 (393) Q Consensus 322 ~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 392 (393) +.||++||++|+ +.|++++|+ ++||++++++|+|+++|+++|++|+|||+|+++++++|+++|+++|+| T Consensus 321 ~~~L~~l~~~ga--l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~ 390 (390) T protein:vir:79 321 NGWFRQQVANGY--LIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHhCCc--eeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999885 456777665 679999999999999999999999999999999999999999999999 No 5 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=2e-110 Score=621.96 Aligned_cols=389 Identities=44% Similarity=0.761 Sum_probs=370.5 Q ss_pred CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhcccCc Q lcl|Aclame:pro 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKT 82 (393) Q Consensus 3 m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~~~ 82 (393) ||++|+|||||+|++++++|+.++++++++|+|+++++++..+|+++|+++++..++...++..+++..++..++++++. T Consensus 1 M~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~~gtl~~al~~~~~~gg~ 80 (391) T protein:vir:79 1 MPTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGDKGTLAHTLDAITDQTNP 80 (391) T ss_pred CCCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCCccccchhhhhhhccccc Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEecccccccccccchhccc-ccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEEEEecC Q lcl|Aclame:pro 83 PTVIVRVAESDDSDTLTANIVGTQ-ENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISDN 161 (393) Q Consensus 83 ~~~vv~~~~~~~~~~~~~~~~~~~-~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~~i~~~ 161 (393) .++++++....+.........+.. .++..+|+++++.++..++..|.++++|++++.+++++|.++|++++++++++.+ T Consensus 81 ~~~vv~~~~~~~~~~~~~~~~g~~~~~~~~tGl~~l~~~~~~~~~~p~~l~~p~~~~~~v~~al~~~~~~~~~~ai~d~p 160 (391) T protein:vir:79 81 LTVVVRVAGGASEAETTSNLIGTTNAAGRYTGMKALLTARNRFGVAPRILAVPGLDSLPVGTELVTIAQKLRAFAYLSAY 160 (391) T ss_pred ceeeeccccccccccccccccccccchhhhHHHhhhhhhhhhhcccchhhcCCccchhHHHHHHHHHHhhcCcEEEEECC Confidence 999999998888777776665543 4677899999999999999999999999999999999999999999999988777 Q ss_pred CCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeeccc Q lcl|Aclame:pro 162 GATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEF 241 (393) Q Consensus 162 ~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~ 241 (393) ++.+.++++++++.++|+++++||||++.+++..+..+.+|||+++||++|++|.++||||||||++|.||.++++++++ T Consensus 161 ~~~t~~~a~~~~~~~~s~~~a~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~ 240 (391) T protein:vir:79 161 GCQTKEEAVAYRSNFGQREAMVMWPDFVGWDTAANAETTLWATARAVGLRAKIDNDTGWHKTLSNVAVGGVTGLSRDVFW 240 (391) T ss_pred CCCCHHHHHHHHhccCCceeEEecceeeeecCcCCceeeechHHHHHHHHHHhhhcccceeccCCceehhhhcccccccc Confidence 77888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHH Q lcl|Aclame:pro 242 DINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAI 321 (393) Q Consensus 242 ~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i 321 (393) ..++..+++++||++||+++++++||++||+||+++||+|+||++||++++|+++|+++++|++||||++.+|++|++++ T Consensus 241 ~~~~~~~~~~~Ln~~~I~t~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i 320 (391) T protein:vir:79 241 DLQDPATDAGYLNANEVTTLVHRDGYRFWGSRTCSADPLFAFENYTRTAQVLADTMAEAHMWANDLPMTPTLVRDLLEGI 320 (391) T ss_pred ccccccchhhhhhhcCceEEECCCcEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhcC Q lcl|Aclame:pro 322 NNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKAL 393 (393) Q Consensus 322 ~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 393 (393) ++||++||++|.+ .|++++|+ ++||++++++|+|+++|+++|++|+|||+|+++++++|+++|+++|+|- T Consensus 321 ~~~l~~l~~~g~l--~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~a 391 (391) T protein:vir:79 321 NAKLRMLTRNGYL--LGGAAWFDADANSKDTLKAGQLAIDYDYTPVPPLENLTFRQRITDRYLMQFAEAVKAA 391 (391) T ss_pred HHHHHHHHhCCce--eceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 9999999998854 55666555 6799999999999999999999999999999999999999999999999 No 6 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=1.1e-109 Score=617.84 Aligned_cols=389 Identities=43% Similarity=0.716 Sum_probs=369.4 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhccc Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~ 80 (393) |+ +++|+||||++|++++++++..+.+++++|+|+++++++..+|+++|+++++..++...++..+++..++..++.++ T Consensus 1 M~-~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~~~tl~~al~~~~~~~ 79 (391) T protein:vir:11 1 MA-ADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGTSGTLPASLQAIADQA 79 (391) T ss_pred CC-CCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCCCccchhhhhhhhccc Confidence 65 55778999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEecccccccccccchhccc-ccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEEEEe Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTANIVGTQ-ENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFIS 159 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~~~~~~~-~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~~i~ 159 (393) +..++++++...++...+..+..+.. .....+++++++.++..++..|.++.+|++++.+++++|.++|++++++++++ T Consensus 80 g~~~~vv~~~~~~~~~~t~~d~~g~~~a~~~~~g~~a~~~~~~~~~~~p~~~~ap~~~~~~v~~al~~~~~~~~~~~i~D 159 (391) T protein:vir:11 80 NAATVVVRVKPGEDEAATNSAVIGGVSADGKYTGMKALLAAKARLGVVPRILGVPGLDTQPVATALIAIAQQLRAFAYVS 159 (391) T ss_pred cceeEEeeecccccccccchhhhcccccccchhhhhhhhhhhhhheeccccccccccccHHHHHHHHHhhcccceEEEEE Confidence 99999999999888888777766644 46678999999999999999999999999999999999999999999998888 Q ss_pred cCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeec Q lcl|Aclame:pro 160 DNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAV 239 (393) Q Consensus 160 ~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~ 239 (393) .+++.+.+++++++++++|.++++||||++.+++..+..+.+|||+++||+++|+|.++|||+||||++|+||.+++.++ T Consensus 160 ~p~~~t~~~a~~~r~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~l~gi~~~~~~~ 239 (391) T protein:vir:11 160 ASGCKTKEEATAYRENFAAREAMVIWPDFLTWSTVVNQTVPAPAVAQALGLRARIDQEVGWHKTLSNVAVNGVTGISADV 239 (391) T ss_pred cCCCCCHHHHHHHhhhcCCceEEEEcCcceecccccCceEEechHHHHHHHHHHhhccCCcEEccCCceeeceeeccccc Confidence 77788889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHH Q lcl|Aclame:pro 240 EFDINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLE 319 (393) Q Consensus 240 ~~~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~ 319 (393) +++.+++++|+++||++||+++++++||++||+||+++||+|+||++||++++|+++|++.++|+|||||++.+|++|++ T Consensus 240 ~~~~~~~~~~~~~Ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~ 319 (391) T protein:vir:11 240 FWDLQSPSTDANYLNENEVTTLVQEGGFRFWGSRTCSDDPLFAFENYTRTAQVLADTIAEAHMWAVDKPMHPSLVRDILE 319 (391) T ss_pred ccccCCCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhc Q lcl|Aclame:pro 320 AINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKA 392 (393) Q Consensus 320 ~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 392 (393) +++.||++||++|.+ .|++++|+ ++||++++++|+|+++|+++|++|+|||+++++++++||++|+++|+| T Consensus 320 ~i~~~l~~l~~~g~l--~g~~~~~~~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~a 391 (391) T protein:vir:11 320 GVNAKFRELKGLGLI--IDAQAWYDPNVNDKDTLKAGKLRITYDYTPVPPLEDLTFFQKITDSYLVDFASRVNA 391 (391) T ss_pred HHHHHHHHHHhccce--eceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999998864 45666555 679999999999999999999999999999999999999999999999 No 7 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=1.7e-109 Score=616.88 Aligned_cols=387 Identities=46% Similarity=0.759 Sum_probs=361.1 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhcccCce Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKTP 83 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~~~~ 83 (393) |++|+|||||+|++++++|+.+++|++++|+|++++.++..+|+++|+++++..++...+|..++|..++..++++++.. T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhcccccchhhHHHHHhhccCce Confidence 67889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeccccccccc------ccchhc-ccccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEE Q lcl|Aclame:pro 84 TVIVRVAESDDSDTL------TANIVG-TQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFA 156 (393) Q Consensus 84 ~~vv~~~~~~~~~~~------~~~~~~-~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~ 156 (393) +++++.......... ...+.+ ....+.++|+++++.++..+++.|.++++||+++.+++++|.++|+++++++ T Consensus 81 ~~vv~~~~~~~~~~~~~~a~~~~~i~g~~~~~~~~Tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~al~~~~~~~~~~~ 160 (395) T protein:vir:98 81 TVVVRVEDGTGDDEEAALAQTVSNIIGGTDENGKYTGIKALLTAQAVTGVKPRILGVPGLDTKEVAVALASAAIKLRAFA 160 (395) T ss_pred EEEeeccccccccccccccccccccccccccccchhHHHHHhhhhhhhccchhhcccccccccHHHHHHHHHhhhcCcEE Confidence 999887654433322 222223 3346778999999999999999999999999999999999999999999988 Q ss_pred EEecCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeece Q lcl|Aclame:pro 157 FISDNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGIT 236 (393) Q Consensus 157 ~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~ 236 (393) +++.+++.+.+++++|+++++|+++++||||++++++..+..+.+|||+++||++|++|.++|+||||||+.|+||.+++ T Consensus 161 ~~d~p~~~t~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~i~gi~~~~ 240 (395) T protein:vir:98 161 YVSAWGCKTISEAMEYRKNFSQRELMVIWPDFLAWDTVKNTTATAYATARALGLRAYIDQTVGWHKTLSNVGVQGVTGIS 240 (395) T ss_pred EEEcCCCCCHHHHHHHHhccCCceEEEEecceeEecccCCceeeechHHHHHHHHHHhhcccCcEeccCCceeecccccc Confidence 88776777889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHH Q lcl|Aclame:pro 237 KAVEFDINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKT 316 (393) Q Consensus 237 ~~~~~~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~ 316 (393) .+++++.+++++|++.||++||+++++++||++||+||+++|++|+||++||++++|+++|++.++|++||||++.+|++ T Consensus 241 ~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~~~~~~~~~ 320 (395) T protein:vir:98 241 ASVFWDLQASGTDADLLNEAGVTTLVRKDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRD 320 (395) T ss_pred eecccccCCCcchHHhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhc Q lcl|Aclame:pro 317 MLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKA 392 (393) Q Consensus 317 i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 392 (393) |+++++.||++||++|. +.|++++|+ ++||++++++|+|+++|+++|++|+|||+|+++++++|+++++++|+| T Consensus 321 i~~~i~~~L~~l~~~g~--l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:98 321 IVDGINAKFRELKSNGY--IVEGKCWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKYLVNLAESVNS 395 (395) T ss_pred HHHHHHHHHHHHHhCCc--eeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999999885 455677665 679999999999999999999999999999999999999999999999 No 8 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=2e-109 Score=616.48 Aligned_cols=387 Identities=46% Similarity=0.750 Sum_probs=363.5 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhcccCce Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKTP 83 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~~~~ 83 (393) |++|+|||||+|++++++|+.+++|++++|+|++++.++..+|+++|++++++.++...++..+++..++..++++++.. T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~~gtl~~al~~~~~ngg~~ 80 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGKKGTLSASLQAIADQSKPV 80 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCCCcchHHHHHHhhcccCce Confidence 66789999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeccc---ccccccccchhcc-cccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEEEEe Q lcl|Aclame:pro 84 TVIVRVAES---DDSDTLTANIVGT-QENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFIS 159 (393) Q Consensus 84 ~~vv~~~~~---~~~~~~~~~~~~~-~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~~i~ 159 (393) ++++++... .+...+..+..+. ..++..+++++++.++..++..|.++++||+++.+|+++|.++|++++++++++ T Consensus 81 ~~vv~v~~~~~~~~~~~t~~dliG~~~~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~~~~~~~~~~~d 160 (392) T protein:vir:18 81 TVVVRVAEGTGDDAEAQTTSNIIGGTDENGKYTGIKALLTAEAVTGVKPRILGVPGLDTQEVATALASVCISLRAFGYVS 160 (392) T ss_pred EEEecccccccccccccchhhheecccccchhhhHHHHHhhhhhhceeehhcccCccchHHHHHHHHHHHhhcCcEEEEe Confidence 988876443 3444454555554 346788999999999999999999999999999999999999999999999888 Q ss_pred cCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeec Q lcl|Aclame:pro 160 DNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAV 239 (393) Q Consensus 160 ~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~ 239 (393) .+++.+.+++.+|+++++|.++++||||++.+++..+..+++|||+++||++|++|.++|||+||||++|+||.+++.++ T Consensus 161 ~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~ 240 (392) T protein:vir:18 161 AWGCKTISEAMAYRENFSQRELMVIWPDFLAWDTTANATATAYATARALGLRAYIDQTIGWHKTLSNVGVQGVTGISASV 240 (392) T ss_pred cCCCCCHHHHHHHHhhccCceEEEEeCceeeecccCCceEEechHHHHHHHHHhhhccCCceEccCCceeeceeecceec Confidence 88888999999999999999999999999999998999999999999999999999999999999999999999999999 Q ss_pred ccccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHH Q lcl|Aclame:pro 240 EFDINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLE 319 (393) Q Consensus 240 ~~~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~ 319 (393) +++.+++++|++.||++||+++++++|+++||+||+++||+|+||++||++++|+++|+++++|+|||||++.+|++|++ T Consensus 241 ~~~~~~~~~~~~~Ln~~gI~t~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~ 320 (392) T protein:vir:18 241 FWDLQASGTDADLLNEAGVTTLVRKDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITASLIRDIVD 320 (392) T ss_pred ccccCCCcchhhhhhhcCceEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhc Q lcl|Aclame:pro 320 AINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKA 392 (393) Q Consensus 320 ~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 392 (393) ++++||++||++|.+ .|++++++ ++||++++++|+|+++|+++|++|+|||+|+++++++|+++++++|+| T Consensus 321 ~i~~~L~~l~~~gal--~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 392 (392) T protein:vir:18 321 GINAKFRELKSNGYI--VDGECWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKYLVNLAESVNS 392 (392) T ss_pred HHHHHHHHHHhcCcc--cceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999998854 55666655 679999999999999999999999999999999999999999999999 No 9 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=6.2e-108 Score=608.32 Aligned_cols=388 Identities=45% Similarity=0.748 Sum_probs=362.4 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhcccCce Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKTP 83 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~~~~ 83 (393) |++|+|||||+|++++++|+..++|++++|+|++++.++..+|+++|++++++.++...++..++|..++..++++++.. T Consensus 1 m~~~~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~~~tl~~a~~~~~~~gg~~ 80 (396) T protein:vir:60 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcCcchhHHHHHHHhhccCce Confidence 55688999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecccccccc------cccchhc-ccccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEE Q lcl|Aclame:pro 84 TVIVRVAESDDSDT------LTANIVG-TQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFA 156 (393) Q Consensus 84 ~~vv~~~~~~~~~~------~~~~~~~-~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~ 156 (393) +++++......... +...+.+ ...++..+|+++++..+..++..|.++.+||+++..|+++|.++|+++++++ T Consensus 81 ~~vv~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~tg~~al~~~~~~~~~~~~il~ap~~~~~~v~~al~~~~~~~~~~~ 160 (396) T protein:vir:60 81 TVVVRVEDGTGEDEETKLAQTVSNIIGTTDENGQYTGLKALLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLRAFG 160 (396) T ss_pred EEEEecccccccccccccccccccccccccccccccchhhhhhcccceeeeeeeccccccccHHHHHHHHHHhccCCeEE Confidence 99998765544333 2223333 3345677899999999999999999999999999999999999999999999 Q ss_pred EEecCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeece Q lcl|Aclame:pro 157 FISDNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGIT 236 (393) Q Consensus 157 ~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~ 236 (393) +++.+++.+.+++++++++++|.++++||||++.+++.++..+.+|||+++||++|++|.++|+|+||||++|+||.+++ T Consensus 161 i~d~p~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~ 240 (396) T protein:vir:60 161 YISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVASTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGIS 240 (396) T ss_pred EEeCCCCCCHHHHHHHHhhcCCceEEEEeCceeeecccCCceeEEchhHHHHHHHHHhhhccCcEeCcCCceecceeece Confidence 88877788889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHH Q lcl|Aclame:pro 237 KAVEFDINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKT 316 (393) Q Consensus 237 ~~~~~~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~ 316 (393) .++++..+++++|+++||++||+++|+++|+++||+||+++||+|+||++||++++|+++|++.++|+|||||++.+|++ T Consensus 241 ~~~~~~~~~~~~~~~~Ln~~gI~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~ 320 (396) T protein:vir:60 241 ASVFWDLQESGTDADLLNESGVTTLIRRDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRD 320 (396) T ss_pred eecccccCCCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhcC Q lcl|Aclame:pro 317 MLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKAL 393 (393) Q Consensus 317 i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 393 (393) |++++++||++||++|. +.|++++|+ ++||++++++|+|+++|+++|++|+|||+|+++++++||+++|++|+|= T Consensus 321 i~~~i~~~l~~l~~~ga--l~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~~ 396 (396) T protein:vir:60 321 IVDGINAKFRELKTNGY--IVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLANLVTSVNSN 396 (396) T ss_pred HHHHHHHHHHHHHhCCc--eeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 99999999999999885 456677665 6799999999999999999999999999999999999999999999999 No 10 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=5.7e-108 Score=608.55 Aligned_cols=388 Identities=44% Similarity=0.750 Sum_probs=361.6 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhcccCce Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKTP 83 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~~~~ 83 (393) |++|+|||||+|++++++|+.+++|++++|+|++++++...+|+++|+++++..++...++...+|..++..++++++.. T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~~tL~~al~~~~~ngg~~ 80 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccccchhhhhhhhhccCcee Confidence 55688999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecccccccc------cccchhc-ccccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEE Q lcl|Aclame:pro 84 TVIVRVAESDDSDT------LTANIVG-TQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFA 156 (393) Q Consensus 84 ~~vv~~~~~~~~~~------~~~~~~~-~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~ 156 (393) +++++......... +.....+ ...++..+++++++.++...+..|.++.+|++++++|+.+|.++|+++++++ T Consensus 81 ~~v~~~~~~~~~~~~~~~a~t~~~~~~~~~~~~~~tg~~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~~~~ 160 (396) T protein:vir:20 81 TVVMRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKAMLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLRAFG 160 (396) T ss_pred EEEEeccccccccccccccccccccccccccccccchhhhhhhhccccccchhhhhhhhhccHHHHHHHHHHHhcCCcEE Confidence 99888755443332 2222222 3346778999999999999999999999999999999999999999999999 Q ss_pred EEecCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeece Q lcl|Aclame:pro 157 FISDNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGIT 236 (393) Q Consensus 157 ~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~ 236 (393) +++++.+.+.+++++|+++++|.++++||||++++++.++..+++|||+++||++|++|.++|+|+||||++|+||.+++ T Consensus 161 ~iD~p~~~~~~~a~~~r~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~ 240 (396) T protein:vir:20 161 YISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGIS 240 (396) T ss_pred EEecCCCCCHHHHHHHhhCCCCceEEEEcCccccccCcCCcceeechhHHHHHHHHHhhhhcCcEeccCCceeccceecc Confidence 88777778889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHH Q lcl|Aclame:pro 237 KAVEFDINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKT 316 (393) Q Consensus 237 ~~~~~~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~ 316 (393) +++.+...++++|+++||++||+++++++||++||+||+++|++|+||++||++++|+++|++.++|+|||||++.+|++ T Consensus 241 ~~~~~~~~~~~~~~~~Ln~~gi~~~~~~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~~~~~~~~~v~e~~~~~~~~~ 320 (396) T protein:vir:20 241 ASVFWDLQESGTDADLLNESGVTTLIRRDGFRFWGNRTCSDDPLFLFENYTRTAQVVADTMAEAHMWAVDKPITATLIRD 320 (396) T ss_pred eecccccCCCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhcC Q lcl|Aclame:pro 317 MLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKAL 393 (393) Q Consensus 317 i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 393 (393) |++++++||++||++|. +.|++++|+ ++||++++++|+|+++|+++|++|+|||+|+++++++||++++++|+|= T Consensus 321 i~~~i~~~L~~l~~~G~--l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 396 (396) T protein:vir:20 321 IVDGINAKFRELKTNGY--IVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLANLVTSVNSN 396 (396) T ss_pred HHHHHHHHHHHHHhCcc--eeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 99999999999999885 455667665 6799999999999999999999999999999999999999999999999 No 11 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=9.5e-108 Score=607.31 Aligned_cols=388 Identities=43% Similarity=0.734 Sum_probs=362.1 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhcccCce Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKTP 83 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~~~~ 83 (393) |++|+|||||+|++++++++.++++++++|+|++++.++..+|+++|++++++.++...++..++|..++..++++++.. T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~al~~~~~~~~~~ 80 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGKKGTLAASLQAIADQSKPV 80 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhcccccchHHHHHHhhhcCCce Confidence 66789999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecccccc------cccccchhc-ccccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEE Q lcl|Aclame:pro 84 TVIVRVAESDDS------DTLTANIVG-TQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFA 156 (393) Q Consensus 84 ~~vv~~~~~~~~------~~~~~~~~~-~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~ 156 (393) +++++....... ..+...+.+ ...++.++|+++++.++..+++.|.++.+|++++.+++++|.++|+++++++ T Consensus 81 ~~vv~~~~~~~~~~~~~~a~t~~~iiG~~~~~~~~tgl~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~~~~ 160 (396) T protein:vir:57 81 TVVVRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKALMGAESVTGVKPRILGVPGLDTKEVAVALASVCQELNAFG 160 (396) T ss_pred eEeeeccccccccccccccccceeeeeeccccccchhhhhhhhcccceeEEeccccCcccchhHHHHHHHHHhhhCceEE Confidence 998876554332 223333333 3346788999999999999999999999999999999999999999999998 Q ss_pred EEecCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeece Q lcl|Aclame:pro 157 FISDNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGIT 236 (393) Q Consensus 157 ~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~ 236 (393) +++.+++.+.+++++|+++++|.++++||||++.+++.++..+.+|||+++||++||+|.++|+||||||++|.||.+++ T Consensus 161 ~~d~p~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~ 240 (396) T protein:vir:57 161 YISAWGCKTISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGIS 240 (396) T ss_pred EEcCCCCCCHHHHHHHHhccCCceEEEEcceeeeecccCCceeEEehhHHHHHHHHHhhhccCcEeccCCceeccccccc Confidence 88777788889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHH Q lcl|Aclame:pro 237 KAVEFDINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKT 316 (393) Q Consensus 237 ~~~~~~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~ 316 (393) +.++++.+++++|+++||++||+++++++||++||+||+++||+|+||++||++++|+++|+++++|++||||++.+|++ T Consensus 241 ~~~~~~~~~~~~~~~~Ln~~gi~t~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~n~~~~~~~ 320 (396) T protein:vir:57 241 ASVFWDLQKPGTDADLLNEAGVTTLVRRDGFRFWGNRTCSDDPLFLFESYTRTAQVLADTMAEAHMWAIDKPITATLIRD 320 (396) T ss_pred eecccccCCcchhhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhcC Q lcl|Aclame:pro 317 MLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKAL 393 (393) Q Consensus 317 i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 393 (393) |+++++.||++||++|+ +.|++++|+ ++||++++++|+|+++|+++|++|+|||+|+++++++|+++++++|+|- T Consensus 321 i~~~i~~~l~~l~~~ga--l~g~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~~ 396 (396) T protein:vir:57 321 IIDGINAKFRELKNNGY--IVDGTCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITSRYLASLVTSVNSN 396 (396) T ss_pred HHHHHHHHHHHHHhCCc--eeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 99999999999999885 456777666 5699999999999999999999999999999999999999999999999 No 12 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=6.6e-105 Score=591.74 Aligned_cols=382 Identities=35% Similarity=0.559 Sum_probs=360.5 Q ss_pred CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhcccCc Q lcl|Aclame:pro 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKT 82 (393) Q Consensus 3 m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~~~ 82 (393) |+++|+|||||+|++++++|+.+++|++++|+|+++++++..+|+++|+++++..++...++..+++..++..++.+++. T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~~tl~~a~~~~~~~gg~ 80 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAGGTLPQAIDGIFDQTGA 80 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCCcchhHHHHHHhccCce Confidence 77899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEecccccccccccchhcccc--cccccchhhhhhhhhhhhhccccccccccch-HHHHHHHHHhhcccceEEEEe Q lcl|Aclame:pro 83 PTVIVRVAESDDSDTLTANIVGTQE--NGKFTGIKALLTAQSTVFVKPKLLCVPQHDN-QAVATELLSVAKKLNAFAFIS 159 (393) Q Consensus 83 ~~~vv~~~~~~~~~~~~~~~~~~~~--~~~~~gl~al~~~~~~~~~~~~~l~apg~s~-~~v~~al~~~a~~~~~~~~i~ 159 (393) .+++++.....+...+.....+..+ +...+|+.++...+..++..|.++.+|++++ .+|.++|.++|+++..++.. T Consensus 81 ~~~vv~~~~~~~~~~t~~~~ig~~~~~t~~~tgl~~l~~~~~~~~~~p~i~~ap~~~~~~~v~~~l~~~~~~~~~~~~~- 159 (386) T protein:vir:10 81 VVVVIRVDEGVDSAATQSNVIGKVDADTEQYTGILALLSAENTVKVQPRILIAPGFSNQKAVADQLVSVADTAAWLCHS- 159 (386) T ss_pred eEEEeeccccccccccchhhhcccccccchhhhhHHhhhhcccccccccccccccccchhHHHHHHHHhhcceEEEEEe- Confidence 9999999888888877777666443 5678899999999999999999999999975 57899999999999988754 Q ss_pred cCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeec Q lcl|Aclame:pro 160 DNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAV 239 (393) Q Consensus 160 ~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~ 239 (393) ++++.+.+++.+++..++|.++++||||++++++..+..+.+|||+++||++|++|.++||||||||++|.||.++++++ T Consensus 160 ~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~v~~~~~~~~~~~p~s~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~ 239 (386) T protein:vir:10 160 GWSNTTDAAAITYRELFGSRRCEVVDPWYKVWDVETSAHIIQPPSARHAGVMAKVHNTLGFWWSNSNQEILGIDGLCRPV 239 (386) T ss_pred CCCCCchHHHHHhhhcccccceEEecCceeeeccccccceeechHHHHHHHHHHhhhcCCcEEccCCceeecccccceec Confidence 67788889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCchhhhhhcccceEEEEeCCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHH Q lcl|Aclame:pro 240 EFDINESSTEANYLNEKGITICLNHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLE 319 (393) Q Consensus 240 ~~~~~~~~~~~~~ln~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~ 319 (393) +++.+++++|+++||++||+++++++|+++||+||+++||+|+||++|||+++|+++|+++++|+|||||++.+|++|++ T Consensus 240 ~~~~~~~~~~~~~l~~~gi~~~~~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~ 319 (386) T protein:vir:10 240 DFKLDDPTCRANLLNAKEVTTTIQQNGFRVWGDRTCSADSKWAFKNVVITNDMIADSLVRNHLWAVDRNITKTYVEDVTE 319 (386) T ss_pred ccccccCcchhhhhhhcCcEEEEcCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHH Q lcl|Aclame:pro 320 AINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLV 387 (393) Q Consensus 320 ~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~ 387 (393) ++++||++||++|. +.|++|+|+ ++||++++++|+|+++|+++|++|+|||+|+++++++|+++++ T Consensus 320 ~i~~~L~~l~~~g~--l~g~~v~~d~~~nt~~~~~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~ 386 (386) T protein:vir:10 320 GVNNYLRHLKNIGA--IAGGECWVDPELNSPDQIQQGKVYFDYDFSAYAPAEHITFRSHMVNGYLTEVV 386 (386) T ss_pred HHHHHHHHHHhCCc--eeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEehhHHHhhC Confidence 99999999999885 456777665 6799999999999999999999999999999999999999999 No 13 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=5.2e-100 Score=564.89 Aligned_cols=374 Identities=21% Similarity=0.264 Sum_probs=335.9 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcc---cccchhhhhhhhh Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAG---STGTLRRTLNSIG 77 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g---~~~tl~~~~~~~~ 77 (393) ||.|++|+|||||+|++++++||.++++++++++|+++++++. +|+++|+++.+..++....+ ..+++..++..++ T Consensus 1 m~~~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~-~p~~~~~~i~~~~d~~~~~~~~~~~gtl~~al~~~~ 79 (388) T protein:vir:96 1 MPVIDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHAD-VAFSVPFRVANTADAQYLDSTGNELGTGWHAASETL 79 (388) T ss_pred CCCCCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccc-cccccceeeecchhhhhhhccccccccchhhhHhhh Confidence 9999999999999999999999999999999999999999876 68899999988888766654 4588999999999 Q ss_pred cccCceEEEEEecccccccccccchhcccc--cccccchhhhhhhhhhhhhccccccccccch-HHHHHHHHHhhcccce Q lcl|Aclame:pro 78 SIVKTPTVIVRVAESDDSDTLTANIVGTQE--NGKFTGIKALLTAQSTVFVKPKLLCVPQHDN-QAVATELLSVAKKLNA 154 (393) Q Consensus 78 ~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~--~~~~~gl~al~~~~~~~~~~~~~l~apg~s~-~~v~~al~~~a~~~~~ 154 (393) .+++..++++++...++...+..++.+..+ ++.++|+++++.. ...|.++++||+++ ++|+++|.++|+++++ T Consensus 80 ~~~~~~~~vv~v~~g~~~~at~a~iig~~~~~tg~~~gl~al~~~----~~~p~il~aPg~s~~~~v~~al~~~~~~~~~ 155 (388) T protein:vir:96 80 KKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTEC----TERPTLIGAPGFSQNKAVIDALASMAKRLKC 155 (388) T ss_pred ccCCceEEEEEeccccccccccceeeeecccccchhhHHHHhhhc----ccceeEEEeeccccchHHHHHHHHHHhhcCc Confidence 999999999999988888888887777554 4566777777654 45689999999975 5899999999999987 Q ss_pred EEEEecCCCCcchhhhhhh-----cccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCcee Q lcl|Aclame:pro 155 FAFISDNGATTKEQAYTYR-----QNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVEL 229 (393) Q Consensus 155 ~~~i~~~~~~~~~~a~~~~-----~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l 229 (393) ++++ |.|..+.+++.+++ .+++|.++++||||++++++..+..+.+|||+++||++|++| +||||||+++ T Consensus 156 ~~i~-D~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D----~~~spaN~~i 230 (388) T protein:vir:96 156 RAVI-DGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGNQGV 230 (388) T ss_pred EEEE-eccCCchhHHHHHHhhhhccCcCcceEEEEeCceeeecccCCceeeechHHHHHHHHHhhc----CcccccCeeE Confidence 7655 55666666666554 367899999999999999999999999999999999999999 5999999999 Q ss_pred cceeeceeecccccCCCchhhhhhcccceEEEEe--CCCEEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcc Q lcl|Aclame:pro 230 DGVTGITKAVEFDINESSTEANYLNEKGITICLN--HNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDM 307 (393) Q Consensus 230 ~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~--~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e 307 (393) ++.|+++.++++++++++|+++||++||+++++ ++|+++||+||++ |+||++||++++|+++|+++++|+||| T Consensus 231 -~i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~si~~~~~~~v~e 305 (388) T protein:vir:96 231 -LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSK 305 (388) T ss_pred -EeeeecccccccccCChhhHHhhhhcCceEEEEecCCcEEEEcccccC----CcceeehhhHHHHHHHHHHHHHHhccC Confidence 599999999999999999999999999999965 6899999999986 999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHH Q lcl|Aclame:pro 308 PLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDL 386 (393) Q Consensus 308 pn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~ 386 (393) ||++.+|++|+++++.||++||++|++ .|++++++ ++||+++|++|+|+++|+++|++|+|||+|+++++++|+++| T Consensus 306 pn~~~~~~~i~~~i~~fL~~l~~~Gal--~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~ 383 (388) T protein:vir:96 306 QLTKSFMEQEIKKINLFMQDLVAAEII--PGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEF 383 (388) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCce--eeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHH Confidence 999999999999999999999998864 45666555 679999999999999999999999999999999999999999 Q ss_pred HHHHh Q lcl|Aclame:pro 387 VNTLK 391 (393) Q Consensus 387 ~~~~~ 391 (393) |++|. T Consensus 384 ~~~~~ 388 (388) T protein:vir:96 384 IEEVL 388 (388) T ss_pred HHHhC Confidence 99999 No 14 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=2.4e-99 Score=561.26 Aligned_cols=381 Identities=28% Similarity=0.409 Sum_probs=336.2 Q ss_pred CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcc--cccchhhhhhhhhccc Q lcl|Aclame:pro 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAG--STGTLRRTLNSIGSIV 80 (393) Q Consensus 3 m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g--~~~tl~~~~~~~~~~~ 80 (393) |+++|+||||++|+++++++|..++|++++|+|+++.. |+|+|++++|+.++....+ ..++|..++..+|.++ T Consensus 1 M~~~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~g-----p~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~nG 75 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIG-----PVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDYG 75 (477) T ss_pred CcccCCCCeEEEEccCCcccccccCCceeEEEecccCC-----CCCcCEEEccHHHHHHhccCCCCCcHHHHHHHHHhcc Confidence 77889999999999999999999999999999998865 8899999999999976443 4589999999999999 Q ss_pred CceEEEEEeccccccccccc------------------------------------------------------------ Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTA------------------------------------------------------------ 100 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~------------------------------------------------------------ 100 (393) +..++++++........+.. T Consensus 76 g~~~~vVrV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (477) T protein:vir:10 76 SGTVIVINVLDPAVHKSNAANEPVTFDAATGRAKLAHPAAANLVLKNDSGGTTYAEGTDYAVDLINGVITRIKTGTIPPG 155 (477) T ss_pred ceEEEEEecCccccccccccccccccccccceecccccccccccccccccccccccchhhhhhhccccceeccccccccc Confidence 99999988754431111100 Q ss_pred -------------------chhc-ccccccccchhhhhhhhhhhhhccccccccccch-HHHHHHHHHhhcccceEEEEe Q lcl|Aclame:pro 101 -------------------NIVG-TQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDN-QAVATELLSVAKKLNAFAFIS 159 (393) Q Consensus 101 -------------------~~~~-~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~-~~v~~al~~~a~~~~~~~~i~ 159 (393) ...+ ...+..++|+++++.++..++..+.++.+||+++ .+|.++|.++|++++++++++ T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~tGl~al~~~~~~~~~~~~~l~apg~~~~~~v~~~l~~~~~~~~~~~~~d 235 (477) T protein:vir:10 156 ATAAKATYDYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLGAIAYID 235 (477) T ss_pred ceeeeeccccccccccccccccccccccchhhhhhhhhhhhhhcchhcccccccccccchhhHHHHHHHHhhCCEEEEEe Confidence 0000 0123346789999999999999999999999976 569999999999999988887 Q ss_pred cCCCCcchhhhhhhc-------ccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecce Q lcl|Aclame:pro 160 DNGATTKEQAYTYRQ-------NFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGV 232 (393) Q Consensus 160 ~~~~~~~~~a~~~~~-------~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv 232 (393) .+++.+.++++++++ +++|.+++++|||++.+++..+..+.+|||+++||++||+|.++||||||||++|.|| T Consensus 236 ~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gi 315 (477) T protein:vir:10 236 APIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGV 315 (477) T ss_pred cCCCCCHHHHHhhhhhccccccccccceEEEEcCeEEEecccCCceeEEchHHHHHHHHHHhhhcCCceeccCCceeccc Confidence 766777788888876 5679999999999999999999999999999999999999999999999999999999 Q ss_pred eeceeecccccCCCchhhhhhcccceEEEEe--CCCEEEEecccCC---CCcccceeehhhHHHHHHHHHHHHhHHhhcc Q lcl|Aclame:pro 233 TGITKAVEFDINESSTEANYLNEKGITICLN--HNGFRYWGSRTLA---TDTRWAFQQSVRTAQIIKETIGAGLAWAVDM 307 (393) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~--~~G~~~wG~rT~~---~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e 307 (393) .+++.++.++++++++|+++||++||+++++ ++||++||+||++ .|+.|+||++||++++|+++|++.++|+||| T Consensus 316 ~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~v~~ 395 (477) T protein:vir:10 316 TGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA 395 (477) T ss_pred cccccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCcccceeehhhHHHHHHHHHHHHHHHhccC Confidence 9999999999999999999999999999964 6899999999994 4678999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHH Q lcl|Aclame:pro 308 PLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDL 386 (393) Q Consensus 308 pn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~ 386 (393) ||++.+|++|++++++||++||++|++ .|++|+|+ ++||++||++|+|+++|+++|++|+|||+|+++++++||+++ T Consensus 396 ~~~~~~~~~i~~~i~~~l~~l~~~g~l--~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~ 473 (477) T protein:vir:10 396 PIDQGLIDSLVESVNGFGRKLIGDGAL--LGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITSEYLLTL 473 (477) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCce--eeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEcchHHhhh Confidence 999999999999999999999998864 45666555 679999999999999999999999999999999999999988 Q ss_pred HHHH Q lcl|Aclame:pro 387 VNTL 390 (393) Q Consensus 387 ~~~~ 390 (393) +.-- T Consensus 474 ~~g~ 477 (477) T protein:vir:10 474 KGGN 477 (477) T ss_pred hcCC Confidence 8776 No 15 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=3.4e-99 Score=560.45 Aligned_cols=381 Identities=27% Similarity=0.405 Sum_probs=334.3 Q ss_pred CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcc--cccchhhhhhhhhccc Q lcl|Aclame:pro 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAG--STGTLRRTLNSIGSIV 80 (393) Q Consensus 3 m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g--~~~tl~~~~~~~~~~~ 80 (393) |+++|+|||||+|++++++||.+++|++++|+|+++.. |+|+|++++++.++...++ ..++|..++..+|.++ T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~-----p~n~pv~its~~d~~~~g~~~~~~tL~~Av~~~f~ng 75 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIG-----PVNTPVQSLSDVDAAQFGPQLAGFTIPQALDAVYDYG 75 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccC-----CCcccEEEccHHHHHHhcCCCCCCcHHHHHHHHhhcC Confidence 77899999999999999999999999999999999876 8899999999999987554 4589999999999999 Q ss_pred CceEEEEEecccccccccccc----------------------------------------------------------- Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTAN----------------------------------------------------------- 101 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~~----------------------------------------------------------- 101 (393) +..++++++............ T Consensus 76 g~~~~vvrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (477) T protein:vir:79 76 SGTVIVINVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGGTTYTEGTDYAVDLINGVITRIKTGTIPAA 155 (477) T ss_pred CceEEEEeccCCccccccccccccccccccccccccccccceeEEeecccccccccCccccccccchhhhhhhccccccc Confidence 999999887554321111000 Q ss_pred --------------------hhcc-cccccccchhhhhhhhhhhhhccccccccccch-HHHHHHHHHhhcccceEEEEe Q lcl|Aclame:pro 102 --------------------IVGT-QENGKFTGIKALLTAQSTVFVKPKLLCVPQHDN-QAVATELLSVAKKLNAFAFIS 159 (393) Q Consensus 102 --------------------~~~~-~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~-~~v~~al~~~a~~~~~~~~i~ 159 (393) ..+. ...+..+|+++++.++..++..+.++.+|++++ .+|.++|.++|++++++++++ T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~g~~~a~~~~tg~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~~~~a~~d 235 (477) T protein:vir:79 156 ATAAKATYDYADPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLGAIAYID 235 (477) T ss_pred cceeeceeccCCcccceeeeecccccccccchhhhhhhhhhhhcccccceeeccccccchhHHHHHHHHHhhcCeEEEEe Confidence 0000 012336678888899999999999999999975 569999999999999888776 Q ss_pred cCCCCcchhhhhhhc-------ccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecce Q lcl|Aclame:pro 160 DNGATTKEQAYTYRQ-------NFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGV 232 (393) Q Consensus 160 ~~~~~~~~~a~~~~~-------~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv 232 (393) .+.+.+.+++.+++. +++|.++++||||++++++.++..+.+|||+++||++||+|.++||||||+|+++.|| T Consensus 236 ~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gv 315 (477) T protein:vir:79 236 APIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGV 315 (477) T ss_pred cCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeechHHHHHHHHHHhhccCCceEccCCceeecc Confidence 666777777777775 4679999999999999999999999999999999999999999999999999999999 Q ss_pred eeceeecccccCCCchhhhhhcccceEEEEe--CCCEEEEecccCC---CCcccceeehhhHHHHHHHHHHHHhHHhhcc Q lcl|Aclame:pro 233 TGITKAVEFDINESSTEANYLNEKGITICLN--HNGFRYWGSRTLA---TDTRWAFQQSVRTAQIIKETIGAGLAWAVDM 307 (393) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~--~~G~~~wG~rT~~---~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e 307 (393) .+++.++.++.+++++|+++||++|||++++ ++|+++||+||++ .++.|+||++||++++|+++|++.++|+||| T Consensus 316 ~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e 395 (477) T protein:vir:79 316 TGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDA 395 (477) T ss_pred eecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCccceeeehhhHHHHHHHHHHHHHHHhccC Confidence 9999999999999999999999999999964 6899999999994 4678999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHH Q lcl|Aclame:pro 308 PLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDL 386 (393) Q Consensus 308 pn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~ 386 (393) ||++.+|++|++++++||++||++|.+ .+++++|+ ++||++++++|+|+++|+++|++|+|||+|+++++++||+++ T Consensus 396 ~~~~~~~~~i~~~i~~~l~~l~~~g~l--~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~ 473 (477) T protein:vir:79 396 PIDQGLIDSLVESVNGFGRKLIGDGAL--LGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITSEYLLTL 473 (477) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCce--eeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEechHHhhh Confidence 999999999999999999999998854 45666554 679999999999999999999999999999999999999988 Q ss_pred HHHH Q lcl|Aclame:pro 387 VNTL 390 (393) Q Consensus 387 ~~~~ 390 (393) +.-- T Consensus 474 ~~~~ 477 (477) T protein:vir:79 474 KGGN 477 (477) T ss_pred ccCC Confidence 8766 No 16 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=1.7e-87 Score=496.30 Aligned_cols=379 Identities=14% Similarity=0.112 Sum_probs=299.6 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhhccc Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIGSIV 80 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~~~~ 80 (393) |++-.|||||+|+ +++++|..+.|++.+|+|+++.. |+++|++++++.+++..||.. ..+..++..+|.++ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~G-----p~~~p~~v~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (666) T protein:vir:65 1 MTLLSPGFETKET-TLSTTIVQSETGRAALVGKFQWG-----PAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (666) T ss_pred CceecCceEEEEe-cCcccccccCcccceEEecccCC-----CCccCEEecCHHHHHHHcCCccccchhHHHHHHHHHhc Confidence 6677799999999 58889999999999999988776 889999999999999999953 23445566666666 Q ss_pred CceEEEEEeccccccccc---------------------------------------c----c-c-----------hh-- Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTL---------------------------------------T----A-N-----------IV-- 103 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~---------------------------------------~----~-~-----------~~-- 103 (393) +..++++++......... . . . .. T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~~ 154 (666) T protein:vir:65 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (666) T ss_pred CceEEEEEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeeccc Confidence 666665554211000000 0 0 0 00 Q ss_pred ------------------------------------cc-------------cc-------------------c---cc-- Q lcl|Aclame:pro 104 ------------------------------------GT-------------QE-------------------N---GK-- 110 (393) Q Consensus 104 ------------------------------------~~-------------~~-------------------~---~~-- 110 (393) +. .. . +. T Consensus 155 ~~~g~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~i 234 (666) T protein:vir:65 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNSL 234 (666) T ss_pred cccCcceeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeeccccccce Confidence 00 00 0 00 Q ss_pred ------------------------------------------------ccc----------------------------- Q lcl|Aclame:pro 111 ------------------------------------------------FTG----------------------------- 113 (393) Q Consensus 111 ------------------------------------------------~~g----------------------------- 113 (393) ..| T Consensus 235 ~v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (666) T protein:vir:65 235 EVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFA 314 (666) T ss_pred eEEeecccccccccccccccccccccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhhhhhc Confidence 000 Q ss_pred -----------------------------------------------hhhhhhhhhhhhhccccccccccch-----HHH Q lcl|Aclame:pro 114 -----------------------------------------------IKALLTAQSTVFVKPKLLCVPQHDN-----QAV 141 (393) Q Consensus 114 -----------------------------------------------l~al~~~~~~~~~~~~~l~apg~s~-----~~v 141 (393) ..++..........++++++|+++. .++ T Consensus 315 ~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v 394 (666) T protein:vir:65 315 RGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTV 394 (666) T ss_pred ccccceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCccchhHHH Confidence 0000000000011345667777653 578 Q ss_pred HHHHHHhhcccceEEEEecCC---------CCcchhhhhhhcc----------cccceEEEeccceeEeeccCCceEEec Q lcl|Aclame:pro 142 ATELLSVAKKLNAFAFISDNG---------ATTKEQAYTYRQN----------FSQREGMMIFGDWKSYNTDKKAYDTDY 202 (393) Q Consensus 142 ~~al~~~a~~~~~~~~i~~~~---------~~~~~~a~~~~~~----------~~s~~~~~~~p~~~~~~~~~~~~~~~p 202 (393) +.+|.++|+++++++.++++| ..+.+++++|+.. ++|.|+++||||++++++.++..+++| T Consensus 395 ~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p 474 (666) T protein:vir:65 395 QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 474 (666) T ss_pred HHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEecccCCceeEec Confidence 999999999999988777665 5667888888864 679999999999999999999999999 Q ss_pred hhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eCCCEEEEecccCCCC-c Q lcl|Aclame:pro 203 AVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNGFRYWGSRTLATD-T 279 (393) Q Consensus 203 ~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~~G~~~wG~rT~~~d-~ 279 (393) ||+++||++||+|.++||||||+|+++.||.+.. +.+..+++.|++.||++|||+++ +++|+++||+||++++ + T Consensus 475 ~sg~vAGl~Ar~D~~~g~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s 551 (666) T protein:vir:65 475 LAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV---KLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPS 551 (666) T ss_pred hHHHHHHHHHHHhccCCcEEccCCeecceeeccc---cceeecChhHHHhhhhCCceEEEEeCCCeEEEEecccCCCCCc Confidence 9999999999999999999999999988777753 34566788999999999999995 4689999999999876 5 Q ss_pred ccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEE Q lcl|Aclame:pro 280 RWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFV 358 (393) Q Consensus 280 ~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~ 358 (393) +|+||++|||++||+++|+++++|+||||||+.||++|+++++.||++||++|+ +.||.|.|| ++||+++|++|+|+ T Consensus 552 ~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~ga--l~g~~V~~d~~~nt~~~i~~G~~~ 629 (666) T protein:vir:65 552 PFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGG--IYDFRVQCDTTNNTPDVIDRNEFV 629 (666) T ss_pred ccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEcCCCCCHHHhhCCeEE Confidence 899999999999999999999999999999999999999999999999999885 455667665 56999999999999 Q ss_pred EEEEEEecCcceeEEEEEEEcch--HHHHHHHHHhcC Q lcl|Aclame:pro 359 IKYDYHWIPSLESLGLEQRVNDE--YVVDLVNTLKAL 393 (393) Q Consensus 359 ~~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~~~~ 393 (393) ++|+++|++|+|||+|++..... .++|++++++|- T Consensus 630 ~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~~ 666 (666) T protein:vir:65 630 ASMFIKPAKSINYIMLNFTAVATGSDFDEIIGPANQA 666 (666) T ss_pred EEEEEEecCCcceEEEEEEEeecCccHHHHHHHHhcC Confidence 99999999999999999888655 799999999999 No 17 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=1.4e-87 Score=496.72 Aligned_cols=378 Identities=13% Similarity=0.120 Sum_probs=299.8 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhhccc Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIGSIV 80 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~~~~ 80 (393) +++..|||||+|+ +++++|..+.|++.+|+|.++.. |+++|++++++.+++..||.. ..+..++..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vg~~~~g-----p~~~p~~i~s~~~~~~~fg~~~~~~~~~~~~~~~f~~g 74 (679) T protein:vir:10 1 MTLLSPGVETKEI-NLQTTIARSSTGRAALVGKFNWG-----PAYQISQVVSEVDLVDKFGRPDDQTADSFFSGVNFLNY 74 (679) T ss_pred CceecCceEEEee-cCCcccccCccccceeeecccCC-----CCccCEEecCHHHHHHHcCCcccccchHHHHHHHHHhC Confidence 6666799999999 58999999999999999998776 899999999999999999963 34666777788888 Q ss_pred CceEEEEEeccccccccc---------------------------------------c---------------------- Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTL---------------------------------------T---------------------- 99 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~---------------------------------------~---------------------- 99 (393) +..++++|+......... . T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~s~~~~~~~~~~~~~~~~~~~~v~~~~~~~~a~ 154 (679) T protein:vir:10 75 GNDLRLVRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGGNVIATGKVTVVNASGGIVAFYVPTAAIIDKAK 154 (679) T ss_pred CCeEEEEEccCcccccccccccccccccccccccccccccceeeeeCCCcccceeEEEeeccCceeeeeecccccccccc Confidence 888877765322110000 0 Q ss_pred --------cc--------------------hhc-ccccc-------------------------------------cccc Q lcl|Aclame:pro 100 --------AN--------------------IVG-TQENG-------------------------------------KFTG 113 (393) Q Consensus 100 --------~~--------------------~~~-~~~~~-------------------------------------~~~g 113 (393) .. +.. ..+.. ...+ T Consensus 155 ~~~~~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~~g 234 (679) T protein:vir:10 155 SLNDYPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARYAG 234 (679) T ss_pred cccccceecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccceeeeeccc Confidence 00 000 00000 0000 Q ss_pred ---------------------------------------------------------------------h--h------- Q lcl|Aclame:pro 114 ---------------------------------------------------------------------I--K------- 115 (393) Q Consensus 114 ---------------------------------------------------------------------l--~------- 115 (393) . + T Consensus 235 ~~gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~ 314 (679) T protein:vir:10 235 TYGDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTK 314 (679) T ss_pred ccCCcceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeeecc Confidence 0 0 Q ss_pred ---------------hhhhhh-------------------------------------hhh-------hhcccccccccc Q lcl|Aclame:pro 116 ---------------ALLTAQ-------------------------------------STV-------FVKPKLLCVPQH 136 (393) Q Consensus 116 ---------------al~~~~-------------------------------------~~~-------~~~~~~l~apg~ 136 (393) .+.... ..+ ...+.++++|++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~ 394 (679) T protein:vir:10 315 PGDRDIYGTSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIAGAV 394 (679) T ss_pred cccccccchhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEecCC Confidence 000000 000 012345677776 Q ss_pred c------hHHHHHHHHHhhcccceEEEEecCCC---------Ccchhhhhhhc-------------ccccceEEEeccce Q lcl|Aclame:pro 137 D------NQAVATELLSVAKKLNAFAFISDNGA---------TTKEQAYTYRQ-------------NFSQREGMMIFGDW 188 (393) Q Consensus 137 s------~~~v~~al~~~a~~~~~~~~i~~~~~---------~~~~~a~~~~~-------------~~~s~~~~~~~p~~ 188 (393) + ..+|+.+|+++|+++++++++.|+|. .+.+++..|+. +++|.|+++||||+ T Consensus 395 ~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 474 (679) T protein:vir:10 395 AGEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGNYK 474 (679) T ss_pred CCCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEccce Confidence 4 25689999999999998888888763 33456667764 46799999999999 Q ss_pred eEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eCCC Q lcl|Aclame:pro 189 KSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNG 266 (393) Q Consensus 189 ~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~~G 266 (393) +++++.++..+++|||+++||++||+|.++||||||+|+++.+|.++. +.+..+++.|++.||++|||+++ +++| T Consensus 475 ~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gin~i~~~~g~G 551 (679) T protein:vir:10 475 YQYDKYNDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVI---KLAVDTRQAHRDEMYTNGINPIVGFAGQG 551 (679) T ss_pred eeecccCCceEEechHHHHHHHHHHhhccCCcEECcCCeeeccccccc---cceeecChhhHHhhhhCCceEEEEecCCe Confidence 999999999999999999999999999999999999999988887753 34456778899999999999995 5789 Q ss_pred EEEEecccCCCCc-ccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec- Q lcl|Aclame:pro 267 FRYWGSRTLATDT-RWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA- 344 (393) Q Consensus 267 ~~~wG~rT~~~d~-~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~- 344 (393) +++||+||+++|+ +|+||++|||+++|+++|+++++|+||||||+.+|.+|++++++||++||++|++ .+|.|.|| T Consensus 552 ~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal--~gf~v~~d~ 629 (679) T protein:vir:10 552 YILYGDKTASQAPTPFDRINVRRLFNLLKKSISESAKYKLFELNDAFTRSSFRSEVGSYLDTIRSLGGI--YDFRVVCDE 629 (679) T ss_pred EEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCce--eeeEEEEcC Confidence 9999999998765 7999999999999999999999999999999999999999999999999998854 55667665 Q ss_pred CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEE--cchHHHHHHHHHhc Q lcl|Aclame:pro 345 EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRV--NDEYVVDLVNTLKA 392 (393) Q Consensus 345 ~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~--~~~~~~~~~~~~~~ 392 (393) ++||+++|++|+|+++|+++|++|+|||+|++.. +..+|+|+++++++ T Consensus 630 ~~nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~ 679 (679) T protein:vir:10 630 SNNTPAVIDRNEFVATILIKPARSINYITLSFVATSTGADFDELVGSFQQ 679 (679) T ss_pred CCCCHHHhhCCeEEEEEEEEecCCccEEEEEEEEeecCccHHHHHHHhcC Confidence 5699999999999999999999999999999777 55589999999999 No 18 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=2.9e-87 Score=495.00 Aligned_cols=379 Identities=15% Similarity=0.131 Sum_probs=297.9 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhhccc Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIGSIV 80 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~~~~ 80 (393) |++..|||||+|+ +++++|..+.|++.+|+|.++.. |+++|++++++.+++..||.. ..+..++..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~~~~~~t~~~~~vg~~~~g-----p~~~p~~i~~~~~~~~~fg~~~~~~~~~~~~~~~f~~~ 74 (666) T protein:vir:80 1 MTLLSPGFETKET-TLSTTIVQSATGRAALVGKFQWG-----PAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (666) T ss_pred CceecCceEEEEe-cCCccccccCcccceEEeccccC-----CCccceEecCHHHHHHhcCCccCccchHHHHHHHHhcC Confidence 6666799999999 58899999999999999998776 889999999999999999953 23445666677777 Q ss_pred CceEEEEEeccccccccccc-------------------------------------------c---------------- Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTA-------------------------------------------N---------------- 101 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~-------------------------------------------~---------------- 101 (393) +..++++|+...+....... . T Consensus 75 g~~~~v~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~ta~~~~~a 154 (666) T protein:vir:80 75 GNDLRVVRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (666) T ss_pred CCeEEEEEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecchhhhcccc Confidence 77777766532110000000 0 Q ss_pred ----------------h------------h-c-ccccc---------c-----------------------c-------- Q lcl|Aclame:pro 102 ----------------I------------V-G-TQENG---------K-----------------------F-------- 111 (393) Q Consensus 102 ----------------~------------~-~-~~~~~---------~-----------------------~-------- 111 (393) + . + ..+.. . . T Consensus 155 ~~~~~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a~~~g~~g~~l 234 (666) T protein:vir:80 155 KAIGVYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSAIYAGEIGNSL 234 (666) T ss_pred ccccccceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhhhcccccccce Confidence 0 0 0 00000 0 0 Q ss_pred -------------------------------------------------cc--h--------------h----hhh---- Q lcl|Aclame:pro 112 -------------------------------------------------TG--I--------------K----ALL---- 118 (393) Q Consensus 112 -------------------------------------------------~g--l--------------~----al~---- 118 (393) .+ + . .+. T Consensus 235 ~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (666) T protein:vir:80 235 EVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFG 314 (666) T ss_pred eeeeccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeecccccccccccchhhhhhhhhc Confidence 00 0 0 000 Q ss_pred hhh-------------h--------------------------------hh-------hhccccccccccc-----hHHH Q lcl|Aclame:pro 119 TAQ-------------S--------------------------------TV-------FVKPKLLCVPQHD-----NQAV 141 (393) Q Consensus 119 ~~~-------------~--------------------------------~~-------~~~~~~l~apg~s-----~~~v 141 (393) ... . .+ ...+.++++|+++ ..++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v 394 (666) T protein:vir:80 315 RGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERESIHVNLLIAGACAGEGDAFSTV 394 (666) T ss_pred cccceeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhcccccceEeecCcCCcccchHHH Confidence 000 0 00 0012345566654 3568 Q ss_pred HHHHHHhhcccceEEEEecCC---------CCcchhhhhhhcc----------cccceEEEeccceeEeeccCCceEEec Q lcl|Aclame:pro 142 ATELLSVAKKLNAFAFISDNG---------ATTKEQAYTYRQN----------FSQREGMMIFGDWKSYNTDKKAYDTDY 202 (393) Q Consensus 142 ~~al~~~a~~~~~~~~i~~~~---------~~~~~~a~~~~~~----------~~s~~~~~~~p~~~~~~~~~~~~~~~p 202 (393) +.+|+++|+++++++.+.++| +++.+++++|++. ++|.|+++||||++++|+.+++.+++| T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p 474 (666) T protein:vir:80 395 QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 474 (666) T ss_pred HHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCceeEec Confidence 899999999998776655544 5677889988864 779999999999999999999999999 Q ss_pred hhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eCCCEEEEecccCCCC-c Q lcl|Aclame:pro 203 AVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNGFRYWGSRTLATD-T 279 (393) Q Consensus 203 ~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~~G~~~wG~rT~~~d-~ 279 (393) ||+++||++||+|.++||||||||+++.|+.+. ++.+..+++.|++.||++|||+++ +++|+++||+||++++ + T Consensus 475 ~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~---~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s 551 (666) T protein:vir:80 475 LAADIAGLCARTDAVSQPWMSPAGYNRGQIMNV---VKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPS 551 (666) T ss_pred hHHHHHHHHHHHhhcCCceEccCCeecceeecc---ccceeecChhHHHhhhhCCeeEEEEeCCCeEEEEccccCCCCCc Confidence 999999999999999999999999998777764 344566788999999999999995 5789999999999876 4 Q ss_pred ccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEE Q lcl|Aclame:pro 280 RWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFV 358 (393) Q Consensus 280 ~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~ 358 (393) +|+||++|||++||+++|++.++|+||||||+.||.+|++++++||++||++|+ +.||.|.|| ++||+++|++|+|+ T Consensus 552 ~~~~i~vRRl~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~ga--l~g~~V~~d~~~nt~~di~~G~~~ 629 (666) T protein:vir:80 552 PFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGG--IYDFRVQCDTTNNTPDVIDRNEFV 629 (666) T ss_pred ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCc--eeeeEEEEcCCCCCHHHhhCCeEE Confidence 899999999999999999999999999999999999999999999999999885 455777665 56999999999999 Q ss_pred EEEEEEecCcceeEEEEEEEcch--HHHHHHHHHhcC Q lcl|Aclame:pro 359 IKYDYHWIPSLESLGLEQRVNDE--YVVDLVNTLKAL 393 (393) Q Consensus 359 ~~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~~~~ 393 (393) ++|+++|++|+|||+|++.+... .+++++++++|- T Consensus 630 ~~i~~~P~~Pae~I~~~~~~~~~~~~~~e~~~~~~~~ 666 (666) T protein:vir:80 630 ASMFIKPAKSINYIMLNFTAVATGSDFDEIIGPVNQA 666 (666) T ss_pred EEEEEEecCCcceEEEEEEEeecCccHHHHHHHHhcC Confidence 99999999999999999887555 799999999999 No 19 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=4.1e-87 Score=494.21 Aligned_cols=378 Identities=13% Similarity=0.127 Sum_probs=296.6 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhhccc Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIGSIV 80 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~~~~ 80 (393) |++-.|||||+|+ +++++|..+.|++.+|+|.++.. |+++|++++++.+++..||.. ..+..++..+|.++ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~~~ 74 (660) T protein:vir:68 1 MALLSPGVELKET-TVQSTVVNNSTGTAALAGKFQWG-----PAFQIKQITDEVALVDMFGTPNTDTADYFMSAMNFLQY 74 (660) T ss_pred CccccCceEEEEe-cCCcccccCCCcceeEEecccCC-----CCccCEEecCHHHHHHhcCCccCccchhHHHHHHHHhC Confidence 6666799999999 58999999999999999988776 889999999999999999953 34666777888888 Q ss_pred CceEEEEEeccccccccccc----------------------------------c--------------------h---- Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTA----------------------------------N--------------------I---- 102 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~----------------------------------~--------------------~---- 102 (393) +..++++++........... . + T Consensus 75 g~~~~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a 154 (660) T protein:vir:68 75 GNDLRVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKA 154 (660) T ss_pred CCeEEEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeeccccccccc Confidence 88888877642111000000 0 0 Q ss_pred -------------h-----------------cc-cccc---------c-----c--------cc---hhh---------h Q lcl|Aclame:pro 103 -------------V-----------------GT-QENG---------K-----F--------TG---IKA---------L 117 (393) Q Consensus 103 -------------~-----------------~~-~~~~---------~-----~--------~g---l~a---------l 117 (393) . +. .+.. . . .+ +.+ + T Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~i 234 (660) T protein:vir:68 155 KEIGEYPELGSNWTAEMSGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQL 234 (660) T ss_pred eeeccccccccceeEEeecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCccccccccccccccce Confidence 0 00 0000 0 0 00 000 0 Q ss_pred -----------------------------h-------------------------------------------------- Q lcl|Aclame:pro 118 -----------------------------L-------------------------------------------------- 118 (393) Q Consensus 118 -----------------------------~-------------------------------------------------- 118 (393) . T Consensus 235 ~v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (660) T protein:vir:68 235 EIEIVSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDF 314 (660) T ss_pred EEEEeccccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeeeeecccccccccccceeeehh Confidence 0 Q ss_pred --hhhhhh--------------------h------------------------hccccccccccc------hHHHHHHHH Q lcl|Aclame:pro 119 --TAQSTV--------------------F------------------------VKPKLLCVPQHD------NQAVATELL 146 (393) Q Consensus 119 --~~~~~~--------------------~------------------------~~~~~l~apg~s------~~~v~~al~ 146 (393) .....+ + ..+.++++++.. ..+++.+|+ T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~ 394 (660) T protein:vir:68 315 FAKGASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHVV 394 (660) T ss_pred hccCcccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHHH Confidence 000000 0 000001111111 135788999 Q ss_pred HhhcccceEEEEecCC---------CCcchhhhhhhc----------ccccceEEEeccceeEeeccCCceEEechhHHH Q lcl|Aclame:pro 147 SVAKKLNAFAFISDNG---------ATTKEQAYTYRQ----------NFSQREGMMIFGDWKSYNTDKKAYDTDYAVARA 207 (393) Q Consensus 147 ~~a~~~~~~~~i~~~~---------~~~~~~a~~~~~----------~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ 207 (393) ++|+++++++.+.|+| +.+.+++++|+. +++|.++++||||++++|+.++..+++|||+++ T Consensus 395 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~ 474 (660) T protein:vir:68 395 AIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADI 474 (660) T ss_pred HHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHH Confidence 9999999888777654 456678888886 367999999999999999999999999999999 Q ss_pred HHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEE--EeCCCEEEEecccCCCCc-cccee Q lcl|Aclame:pro 208 CALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITIC--LNHNGFRYWGSRTLATDT-RWAFQ 284 (393) Q Consensus 208 ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~--~~~~G~~~wG~rT~~~d~-~~~~i 284 (393) ||++||+|.++||||||||+++.+|.+. ++.+..++++|++.||++|||++ ++++|+++||+||+++|+ .|+|| T Consensus 475 AGl~Ar~d~~~g~~~span~~~~~i~g~---~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i 551 (660) T protein:vir:68 475 AGLCARTDNISQPWMSPAGYNRGQILNV---IKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRI 551 (660) T ss_pred HHHHHHHhccCCcEEccCCeeeceeecc---ceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceE Confidence 9999999999999999999998888775 34566788999999999999999 567899999999998875 79999 Q ss_pred ehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEe-cCCCCHHHhhCCEEEEEEEE Q lcl|Aclame:pro 285 QSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWV-AEEITADIIKSGKFVIKYDY 363 (393) Q Consensus 285 ~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~-~~~nt~~~i~~G~~~~~v~~ 363 (393) ++|||+++|+++|+++++|+||||||+.+|++|++++++||++||++|++ .||.|.| +++||+++|++|+|+++|++ T Consensus 552 ~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~gal--~gf~V~~d~~~nt~~~i~~G~~~~~i~~ 629 (660) T protein:vir:68 552 NVRRLFNMVKTNIGSASKYRLFELNNAFTRSSFRTETSQYLQGIKALGGV--YNFKVVCDTTNNTPAVIDRNEFVATFYL 629 (660) T ss_pred ehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCce--eeeEEEEecCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999998864 4566655 46799999999999999999 Q ss_pred EecCcceeEEEEEEEc--chHHHHHHHHHhc Q lcl|Aclame:pro 364 HWIPSLESLGLEQRVN--DEYVVDLVNTLKA 392 (393) Q Consensus 364 ~p~~p~e~i~~~~~~~--~~~~~~~~~~~~~ 392 (393) +|++|+|||+|++... ..+++++++++.+ T Consensus 630 ~p~~pae~i~l~~~~~~~~~~~~e~~~~v~~ 660 (660) T protein:vir:68 630 QPARSINYITLNFVATATGADFDELIGAVGG 660 (660) T ss_pred EecCCcceEEEEEEEeecCccHHHHHHhhcC Confidence 9999999999998775 5599999999999 No 20 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=6.2e-87 Score=493.21 Aligned_cols=377 Identities=12% Similarity=0.094 Sum_probs=302.3 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhhccc Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIGSIV 80 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~~~~ 80 (393) |++-.|||||+|++.+++++.. .|++.+|+|+++.. |+++|++++++.+++..||.. ..+..++..+|.++ T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~~-~ts~~~fvG~~~~G-----p~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ng 74 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVNN-STGTAALAGKFQWG-----PAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQY 74 (659) T ss_pred CceecCceEEEEecCCceeccc-CccceEEEecccCC-----CCCccEEecCHHHHHHHcCCcCCCcchhHHHHHHHhhC Confidence 6676799999999999987765 79999999998776 889999999999999999854 45677788888888 Q ss_pred CceEEEEEecccccccc---------------c--------------------c--c----------------------- Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDT---------------L--------------------T--A----------------------- 100 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~---------------~--------------------~--~----------------------- 100 (393) +..++++|+........ . . . T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~~ 154 (659) T protein:vir:10 75 GNDLRVVRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPTAKIIAKA 154 (659) T ss_pred CCeEEEEEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeeccccccccc Confidence 88888877532110000 0 0 0 Q ss_pred ----chhc-----------------------cc-----------cc-----------------------c---------- Q lcl|Aclame:pro 101 ----NIVG-----------------------TQ-----------EN-----------------------G---------- 109 (393) Q Consensus 101 ----~~~~-----------------------~~-----------~~-----------------------~---------- 109 (393) .... .. .. . T Consensus 155 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G~~g~~~ 234 (659) T protein:vir:10 155 KEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKI 234 (659) T ss_pred ccccccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecccccccccccceecccc Confidence 0000 00 00 0 Q ss_pred --------c-----------------------------------------ccc-----------------------hhhh Q lcl|Aclame:pro 110 --------K-----------------------------------------FTG-----------------------IKAL 117 (393) Q Consensus 110 --------~-----------------------------------------~~g-----------------------l~al 117 (393) . ..+ +... T Consensus 235 tv~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (659) T protein:vir:10 235 EIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDF 314 (659) T ss_pred eEEEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccccccchhhhhhh Confidence 0 000 0000 Q ss_pred -hh--------------------------------------------hhhhhhhccccccccccch------HHHHHHHH Q lcl|Aclame:pro 118 -LT--------------------------------------------AQSTVFVKPKLLCVPQHDN------QAVATELL 146 (393) Q Consensus 118 -~~--------------------------------------------~~~~~~~~~~~l~apg~s~------~~v~~al~ 146 (393) .. ....-...++++++|+++. .+|+.+|+ T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~al~ 394 (659) T protein:vir:10 315 FAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVV 394 (659) T ss_pred hccCcccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHH Confidence 00 0000011356777888743 46899999 Q ss_pred HhhcccceEEEEecCCC---------Ccchhhhhhhcc----------cccceEEEeccceeEeeccCCceEEechhHHH Q lcl|Aclame:pro 147 SVAKKLNAFAFISDNGA---------TTKEQAYTYRQN----------FSQREGMMIFGDWKSYNTDKKAYDTDYAVARA 207 (393) Q Consensus 147 ~~a~~~~~~~~i~~~~~---------~~~~~a~~~~~~----------~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ 207 (393) ++|+++++++++.|+|. .+.+++++|++. ++|+++++||||++++++.++..+++|||+++ T Consensus 395 ~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~ 474 (659) T protein:vir:10 395 SIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADI 474 (659) T ss_pred HHHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEechHHHH Confidence 99999999988888763 456788888864 77999999999999999999999999999999 Q ss_pred HHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eCCCEEEEecccCCCCc-cccee Q lcl|Aclame:pro 208 CALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNGFRYWGSRTLATDT-RWAFQ 284 (393) Q Consensus 208 ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~~G~~~wG~rT~~~d~-~~~~i 284 (393) ||++||+|.++||||||||+++.+|.++. +.+..+++.|++.||++|||+++ +++|+++||+||+++|+ +|+|| T Consensus 475 AGl~Ar~D~~~g~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i 551 (659) T protein:vir:10 475 AGLCARTDNVSQTWMSPAGYNRGQILNVI---KLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRI 551 (659) T ss_pred HHHHHHHhccCCceEccCCceeeeeeccc---cceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCCCCcccceE Confidence 99999999999999999999988777764 34566788999999999999986 47899999999998765 89999 Q ss_pred ehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEE Q lcl|Aclame:pro 285 QSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDY 363 (393) Q Consensus 285 ~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~ 363 (393) ++|||++||+++|+++++|+|||||++.||++|+++++.||++||++|++ .+|.|+|| ++||+++|++|+|+++|++ T Consensus 552 ~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal--~~~~V~~d~~~nt~~~i~~G~~~~~i~~ 629 (659) T protein:vir:10 552 NVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGIKALGGI--YEYRVVCDTTNNTPSVIDRNEFVATFYI 629 (659) T ss_pred ehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCce--eeEEEEEcCCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999998864 56677666 5699999999999999999 Q ss_pred EecCcceeEEEEEEEcchHHHHHHHHHhcC Q lcl|Aclame:pro 364 HWIPSLESLGLEQRVNDEYVVDLVNTLKAL 393 (393) Q Consensus 364 ~p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 393 (393) +|++|+|||+|++.+.....+ |+++.+. T Consensus 630 ~p~~pae~i~~~~~~~~~~~~--~~e~~~~ 657 (659) T protein:vir:10 630 QPARSINYITLNFVATATGAD--FDELTGL 657 (659) T ss_pred EecCCcceEEEEEEEEecCcc--hHHhhcc Confidence 999999999999999877766 8888888 No 21 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=4.8e-87 Score=493.82 Aligned_cols=374 Identities=14% Similarity=0.112 Sum_probs=298.7 Q ss_pred CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhhcc Q lcl|Aclame:pro 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIGSI 79 (393) Q Consensus 3 m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~~~ 79 (393) |+ +-.|||||+|++ ++++|..++|++.+|+|.++.. |+++|++++++.+++..||.. ..+..++..+|.+ T Consensus 1 ma-~~~PgVyv~E~~-~~~~i~~~~ts~~~~vG~~~~G-----p~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~n 73 (664) T protein:vir:98 1 MA-LQSPGIETKETS-VQSTVVRNSTGRAAIVGKFSWG-----PAYQIRQISNEVELVNYFGAPDNLTADYFMSAVNFLQ 73 (664) T ss_pred Cc-eecCceEEEecC-CCcccccccccceEEEeeccCC-----CCCccEEecCHHHHHHhcCCccccchhHHHHHHHHHh Confidence 33 446999999994 8999999999999999998776 889999999999999999954 4567778888888 Q ss_pred cCceEEEEEeccccccccc--------------------------------------------c------------cch- Q lcl|Aclame:pro 80 VKTPTVIVRVAESDDSDTL--------------------------------------------T------------ANI- 102 (393) Q Consensus 80 ~~~~~~vv~~~~~~~~~~~--------------------------------------------~------------~~~- 102 (393) ++..++++|+......... . ... T Consensus 74 gg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~~ 153 (664) T protein:vir:98 74 YGNDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSLL 153 (664) T ss_pred cCCeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCcccee Confidence 8888888876321100000 0 000 Q ss_pred -------h-------------------------c-----c-----c---------------------------------- Q lcl|Aclame:pro 103 -------V-------------------------G-----T-----Q---------------------------------- 106 (393) Q Consensus 103 -------~-------------------------~-----~-----~---------------------------------- 106 (393) . + . . T Consensus 154 ~~~~~~~~~~~~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G~~Gn 233 (664) T protein:vir:98 154 VLNRSVLTQIFLLVGTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALYPGELGS 233 (664) T ss_pred ecccccccccceecccceeeeeecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeeecccccc Confidence 0 0 0 0 Q ss_pred ------------------------------------------c------------------------------------- Q lcl|Aclame:pro 107 ------------------------------------------E------------------------------------- 107 (393) Q Consensus 107 ------------------------------------------~------------------------------------- 107 (393) + T Consensus 234 ~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 313 (664) T protein:vir:98 234 TVQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIYMDDF 313 (664) T ss_pred eeeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeeeeechhh Confidence 0 Q ss_pred --------------------------------------cccccchhhhhhhhhhhhhccccccccccch------HHHHH Q lcl|Aclame:pro 108 --------------------------------------NGKFTGIKALLTAQSTVFVKPKLLCVPQHDN------QAVAT 143 (393) Q Consensus 108 --------------------------------------~~~~~gl~al~~~~~~~~~~~~~l~apg~s~------~~v~~ 143 (393) ...++|+++|+ ....+.++++++|++++ .+|+. T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~---~~~~~~~~ll~~p~~~~~~~~~~~~v~~ 390 (664) T protein:vir:98 314 FANGGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFA---DREALHVPLLIAGGCAGESVEIASTVQK 390 (664) T ss_pred eecccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhh---cccccccceEEecCCCCCcHHHHHHHHH Confidence 00000011111 11123467888888864 36899 Q ss_pred HHHHhhcccceEEEEecCC---------CCcchhhhhhhc--------------ccccceEEEeccceeEeeccCCceEE Q lcl|Aclame:pro 144 ELLSVAKKLNAFAFISDNG---------ATTKEQAYTYRQ--------------NFSQREGMMIFGDWKSYNTDKKAYDT 200 (393) Q Consensus 144 al~~~a~~~~~~~~i~~~~---------~~~~~~a~~~~~--------------~~~s~~~~~~~p~~~~~~~~~~~~~~ 200 (393) +|+++|+++++++++.|+| .++.+++++|++ +++|.++++||||++++++.++..++ T Consensus 391 al~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~ 470 (664) T protein:vir:98 391 HVISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNRW 470 (664) T ss_pred HHHHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceEE Confidence 9999999999988888866 445567777765 57899999999999999999999999 Q ss_pred echhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eC-CCEEEEecccCCC Q lcl|Aclame:pro 201 DYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NH-NGFRYWGSRTLAT 277 (393) Q Consensus 201 ~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~-~G~~~wG~rT~~~ 277 (393) +|||+++||++||+|.++||||||+|+++.+|.+.. +....+++.|++.||++|||++. ++ +||++||+||+++ T Consensus 471 ~p~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~ 547 (664) T protein:vir:98 471 VPLAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCI---KLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGDKTLTS 547 (664) T ss_pred echHHHHHHHHHHhhhcCCcEECcCCceeeeeeccc---cceeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcccccCC Confidence 999999999999999999999999999988777753 34456678899999999999984 45 7999999999987 Q ss_pred Cc-ccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCC Q lcl|Aclame:pro 278 DT-RWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSG 355 (393) Q Consensus 278 d~-~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G 355 (393) ++ +|+||++|||+++|+++|+++++|+|||||++.||++|+++++.||++||++|+ +.||.|+|| ++||+++|++| T Consensus 548 ~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~ga--l~g~~V~~d~~~nt~~~i~~G 625 (664) T protein:vir:98 548 VPSPFDRINVRRLFNMIKKDIGDNAKYKLFENNDDFTRASFRMDTGQYMTNIRALGG--CYDYRVICDTTNNTPDVIDRN 625 (664) T ss_pred CCcccceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCc--eeeeEEEEcCCCCCHHHhhCC Confidence 64 899999999999999999999999999999999999999999999999999885 455677666 56999999999 Q ss_pred EEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhcC Q lcl|Aclame:pro 356 KFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKAL 393 (393) Q Consensus 356 ~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 393 (393) +|+++|+++|++|+|||+|++.+.....+ |+++..- T Consensus 626 ~~~~~i~~~p~~pae~I~~~~~q~~~~~~--~~e~~~~ 661 (664) T protein:vir:98 626 EFVATVYVKPPRSINYITLNFVATSTGAD--FDELVGP 661 (664) T ss_pred eEEEEEEEEecCCcceEEEEEEEeecCcc--hhHhccc Confidence 99999999999999999999998766644 5555544 No 22 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=8.3e-87 Score=492.52 Aligned_cols=377 Identities=13% Similarity=0.111 Sum_probs=299.0 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhccc---ccchhhhhhhhhccc Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGS---TGTLRRTLNSIGSIV 80 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~---~~tl~~~~~~~~~~~ 80 (393) |++-.|||||+|++.+++++ .+.|++.+|+|+++.. |+++|++++++.+++..||. ...+..++..+|.++ T Consensus 1 ~~~~~PgVyvee~~~~~~~~-~~~ts~~~fvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVV-NNSTGTAALAGKFQWG-----PAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQY 74 (659) T ss_pred CceecCceEEEEecCCcccc-cCCCcceEEEeecCCC-----CCcccEEecCHHHHHHHcCCcCCCCchhHHHHHHHHhC Confidence 66777999999999999766 4589999999998876 88999999999999999994 355677888888888 Q ss_pred CceEEEEEeccccccccccc---------------------------c----------------------h--------- Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTA---------------------------N----------------------I--------- 102 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~---------------------------~----------------------~--------- 102 (393) +..++++++........... . . T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~ 154 (659) T protein:vir:72 75 GNDLRVVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKA 154 (659) T ss_pred CceEEEEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeeccccccccc Confidence 88888877632110000000 0 0 Q ss_pred --------------------hc---c-----------------c------------------------------c---c- Q lcl|Aclame:pro 103 --------------------VG---T-----------------Q------------------------------E---N- 108 (393) Q Consensus 103 --------------------~~---~-----------------~------------------------------~---~- 108 (393) .. . . . . T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~~ 234 (659) T protein:vir:72 155 KEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKI 234 (659) T ss_pred cccccccccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeeccccccccce Confidence 00 0 0 0 0 Q ss_pred ------------------------cccc----------c-------------------------------------hhh- Q lcl|Aclame:pro 109 ------------------------GKFT----------G-------------------------------------IKA- 116 (393) Q Consensus 109 ------------------------~~~~----------g-------------------------------------l~a- 116 (393) .... + +.. T Consensus 235 tv~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (659) T protein:vir:72 235 EIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDF 314 (659) T ss_pred eEEEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhhh Confidence 0000 0 000 Q ss_pred hhh----------------------------------------h----hhhhhhccccccccccch------HHHHHHHH Q lcl|Aclame:pro 117 LLT----------------------------------------A----QSTVFVKPKLLCVPQHDN------QAVATELL 146 (393) Q Consensus 117 l~~----------------------------------------~----~~~~~~~~~~l~apg~s~------~~v~~al~ 146 (393) +.. . ...-...+.++++|++++ .+++++|+ T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~ 394 (659) T protein:vir:72 315 FAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVV 394 (659) T ss_pred hhcCCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHH Confidence 000 0 000011356777888743 46899999 Q ss_pred HhhcccceEEEEecCC---------CCcchhhhhhhcc----------cccceEEEeccceeEeeccCCceEEechhHHH Q lcl|Aclame:pro 147 SVAKKLNAFAFISDNG---------ATTKEQAYTYRQN----------FSQREGMMIFGDWKSYNTDKKAYDTDYAVARA 207 (393) Q Consensus 147 ~~a~~~~~~~~i~~~~---------~~~~~~a~~~~~~----------~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ 207 (393) ++|+++++++++.|+| ..+.+++++|++. ++|+++++||||++++++.++..+++|||+++ T Consensus 395 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~v 474 (659) T protein:vir:72 395 SIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLAADI 474 (659) T ss_pred HHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechHHHH Confidence 9999999999888876 3455788888864 67999999999999999999999999999999 Q ss_pred HHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eCCCEEEEecccCCCCc-cccee Q lcl|Aclame:pro 208 CALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNGFRYWGSRTLATDT-RWAFQ 284 (393) Q Consensus 208 ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~~G~~~wG~rT~~~d~-~~~~i 284 (393) ||++||+|.++||||||||+++.+|.++. +....++++|++.||++|||+++ +++|+++||+||+++|+ +|+|| T Consensus 475 AGl~Ar~D~~~G~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~i 551 (659) T protein:vir:72 475 AGLCARTDNVSQTWMSPAGYNRGQILNVI---KLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRI 551 (659) T ss_pred HHHHHHhhccCCcEEccCCeeeceeeccc---cccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccCCCCcccceE Confidence 99999999999999999999988887753 34566789999999999999995 57899999999998875 89999 Q ss_pred ehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEE Q lcl|Aclame:pro 285 QSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDY 363 (393) Q Consensus 285 ~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~ 363 (393) ++||++++|+++|+++++|+|||||++.||++|+++|++||++||++|++ .+|.|.|| ++||+++|++|+|+++|++ T Consensus 552 ~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal--~~~~V~~d~~~nt~~~i~~G~~~~~i~~ 629 (659) T protein:vir:72 552 NVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGNKALGGI--YEYRVVCDTTNNTPSVIDRNEFVATFYI 629 (659) T ss_pred eehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCce--eeEEEEEcCCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999999865 56666665 5699999999999999999 Q ss_pred EecCcceeEEEEEEEcchHHHHHHHHHhcC Q lcl|Aclame:pro 364 HWIPSLESLGLEQRVNDEYVVDLVNTLKAL 393 (393) Q Consensus 364 ~p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 393 (393) +|++|+|||+|++.......+ |+|+.-+ T Consensus 630 ~p~~pae~I~~~~~~~~~~~~--~~e~~~~ 657 (659) T protein:vir:72 630 QPARSINYITLNFVATATGAD--FDELTGL 657 (659) T ss_pred EecCCccEEEEEEEEeecCcc--hHHhccc Confidence 999999999999988666544 5556555 No 23 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=4.5e-86 Score=488.47 Aligned_cols=374 Identities=14% Similarity=0.099 Sum_probs=296.7 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhhccc Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIGSIV 80 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~~~~ 80 (393) +++..|||||+|++ ++++|..++|++.+|+|.++.. |+++|++++++.+++..||.. ..+...+..+|.++ T Consensus 1 ~~~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vg~~~~g-----p~~~p~~v~s~~~~~~~fg~~~~~~~~~~~~~~~f~~~ 74 (660) T protein:vir:10 1 MALLSPGIELKETS-VQSTVVRNATGRAALVGKFQWG-----PAFQVTQITNEVELVDLFGGPNNEVADYFMSGMNFLQY 74 (660) T ss_pred CceecCceEEEeec-CCccccCCCcccceEEeecCCC-----CCccCeEcCCHHHHHHHcCCcCCCchhHHHHHHHHHhC Confidence 66767999999995 8999999999999999988776 889999999999999999853 34556666777777 Q ss_pred CceEEEEEeccccccccccc--------------------------------------------chh------------- Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTA--------------------------------------------NIV------------- 103 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~--------------------------------------------~~~------------- 103 (393) +..++++++........... ... T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~~a 154 (660) T protein:vir:10 75 GNDLRTVRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIAYA 154 (660) T ss_pred CceEEEEEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeeccccccccccc Confidence 77777666533221000000 000 Q ss_pred ---------------------cc-----------cccc------------------------------------------ Q lcl|Aclame:pro 104 ---------------------GT-----------QENG------------------------------------------ 109 (393) Q Consensus 104 ---------------------~~-----------~~~~------------------------------------------ 109 (393) .. .+.. T Consensus 155 ~~v~~~~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i 234 (660) T protein:vir:10 155 RSLNQYPTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYPGEIGSTL 234 (660) T ss_pred cccccccccccceeEEEecccCccccceeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeecccccCcce Confidence 00 0000 Q ss_pred -------------------c------------------------------------------------------------ Q lcl|Aclame:pro 110 -------------------K------------------------------------------------------------ 110 (393) Q Consensus 110 -------------------~------------------------------------------------------------ 110 (393) . T Consensus 235 ~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (660) T protein:vir:10 235 EVEIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYLDDY 314 (660) T ss_pred eEEEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceeeeehh Confidence 0 Q ss_pred -----------------------------------------ccchhhhhhhhhhhhhccccccccccc------hHHHHH Q lcl|Aclame:pro 111 -----------------------------------------FTGIKALLTAQSTVFVKPKLLCVPQHD------NQAVAT 143 (393) Q Consensus 111 -----------------------------------------~~gl~al~~~~~~~~~~~~~l~apg~s------~~~v~~ 143 (393) .+++.++. ......++++++|++. ..+|+. T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~---~~~~~~~~~l~~p~~~~~~~~~~~~v~~ 391 (660) T protein:vir:10 315 FAKGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFA---DREALHINLLIAGAVAGEGDEVASTVQK 391 (660) T ss_pred hcCCCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhh---hhhhcccceEEEcCcCCCchhhhHHHHH Confidence 00000000 0000124455666653 346899 Q ss_pred HHHHhhcccceEEEEecCCC---------Ccchhhhhhhc----------ccccceEEEeccceeEeeccCCceEEechh Q lcl|Aclame:pro 144 ELLSVAKKLNAFAFISDNGA---------TTKEQAYTYRQ----------NFSQREGMMIFGDWKSYNTDKKAYDTDYAV 204 (393) Q Consensus 144 al~~~a~~~~~~~~i~~~~~---------~~~~~a~~~~~----------~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S 204 (393) +|.++|+++++++++.|+|. .+.+++++|++ +++|.++++||||++++++.+++.+++||| T Consensus 392 al~~~~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 471 (660) T protein:vir:10 392 HVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLA 471 (660) T ss_pred HHHHHHHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechh Confidence 99999999998888877762 35678888886 467999999999999999999999999999 Q ss_pred HHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eC-CCEEEEecccCCCCc-c Q lcl|Aclame:pro 205 ARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NH-NGFRYWGSRTLATDT-R 280 (393) Q Consensus 205 ~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~-~G~~~wG~rT~~~d~-~ 280 (393) +++||++||+|.++||||||||+++.++.+.. +.+..+++.|++.||++|||+++ ++ +||++||+||+++|+ . T Consensus 472 g~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~ 548 (660) T protein:vir:10 472 ADLAGLCARTDDVSQPWMSPAGYNRGQILNVL---KLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTATKVPSP 548 (660) T ss_pred HHHHHHHHHhhccCCcEEccCCeeeceeeccc---eeeecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccCCCCcc Confidence 99999999999999999999999987777653 34566888999999999999984 44 799999999998876 7 Q ss_pred cceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEE Q lcl|Aclame:pro 281 WAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVI 359 (393) Q Consensus 281 ~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~ 359 (393) |+||++||||+||+++|+++++|+|||||++.||.+|+++++.||++||++|++ .||.|.|| ++||+++|++|+|++ T Consensus 549 ~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~gal--~g~~V~~d~~~nt~~di~~G~~~~ 626 (660) T protein:vir:10 549 MDHINVRRLFNMLKKNIGDASKYKLFELNDNFTRSSFRMEVSQYLDGIKALGGI--YEGRVVCDTTVNTPAVIDRNEFIA 626 (660) T ss_pred cceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCce--eeeEEEEcCCCCCHHHhhCCeEEE Confidence 999999999999999999999999999999999999999999999999998854 55667666 579999999999999 Q ss_pred EEEEEecCcceeEEEEEEEcch--HHHHHHHHHh Q lcl|Aclame:pro 360 KYDYHWIPSLESLGLEQRVNDE--YVVDLVNTLK 391 (393) Q Consensus 360 ~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~~ 391 (393) +|+++|++|+|||+|++.+... .++|+++++. T Consensus 627 ~i~~~P~~pae~I~~~~~~~~~~~~~~e~~~~~~ 660 (660) T protein:vir:10 627 NIYVKPARSINYITLNFVATSTGADFDELIGPLV 660 (660) T ss_pred EEEEEecCCccEEEEEEEEeecCccHHHHhhhcC Confidence 9999999999999999887554 6889999888 No 24 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=5.5e-86 Score=488.02 Aligned_cols=377 Identities=13% Similarity=0.077 Sum_probs=299.2 Q ss_pred CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhhcc Q lcl|Aclame:pro 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIGSI 79 (393) Q Consensus 3 m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~~~ 79 (393) |+.+..|||||+|++.++++|..+.|++.+|+|+++.. |+++|++++|+.+++..||.. ..+..++..+|.+ T Consensus 1 m~~~~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~n 75 (743) T protein:vir:10 1 MASQVSPGILIKERDLTNAVVTGALQIRAAHASTFAKG-----PIGDIVNINTQKELVSVFGEPKEDNAEDWMVASEFLN 75 (743) T ss_pred CccccCCceEEEEecCCCceeccCCcceeEEEEeccCC-----CCCcCEEecCHHHHHHHcCCccCCcchHHHHHHHHHh Confidence 55666699999999999999999999999999998876 889999999999999999953 4577788888888 Q ss_pred cCceEEEEEecccccccccc------------------------------------------------------------ Q lcl|Aclame:pro 80 VKTPTVIVRVAESDDSDTLT------------------------------------------------------------ 99 (393) Q Consensus 80 ~~~~~~vv~~~~~~~~~~~~------------------------------------------------------------ 99 (393) ++..++++++.......... T Consensus 76 gg~~~~vvrv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gN~i~V~v~~~~~d~~~~~~~~~~~~~~~ 155 (743) T protein:vir:10 76 YGGRLAVVRAETTGVLNATTGSAGVLVKNRESWDAGSGNGEVFVARTAGSWGNSLMGVLVDRGADYIVTFAATPTDTAVG 155 (743) T ss_pred CCceEEEEEccCccccccccccccccccccccccccccceeEEEEeeccccccceEEEEecCCCcceeeeeccccccccc Confidence 88777777764321000000 Q ss_pred -c-----------------------------------c-----h---------------------------hccccc--- Q lcl|Aclame:pro 100 -A-----------------------------------N-----I---------------------------VGTQEN--- 108 (393) Q Consensus 100 -~-----------------------------------~-----~---------------------------~~~~~~--- 108 (393) . . . .+..+. T Consensus 156 ~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (743) T protein:vir:10 156 TQLLFSYSGTLVTGEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVGRTPGTYSNVPASGGTGTGAT 235 (743) T ss_pred eeeeecccccccccceeeeeecCcceeeeeccccceeeecccccccccccccccccccccccccceeeEEeccccccccc Confidence 0 0 0 000000 Q ss_pred ----------------------------------------cc-------------------------------ccch--- Q lcl|Aclame:pro 109 ----------------------------------------GK-------------------------------FTGI--- 114 (393) Q Consensus 109 ----------------------------------------~~-------------------------------~~gl--- 114 (393) .. .++. T Consensus 236 ~tv~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~~a~~~~~~~~~~~~~~~~~~ 315 (743) T protein:vir:10 236 FNVVVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAITELKDWYLNTEIGSTGIKLG 315 (743) T ss_pred ccccccccccccccccccccccccceeeeccccccccccccccccchhheecccccceeeeecccccccchhhccccccc Confidence 00 0000 Q ss_pred -----------------------------------------h---hhh-------------------------------- Q lcl|Aclame:pro 115 -----------------------------------------K---ALL-------------------------------- 118 (393) Q Consensus 115 -----------------------------------------~---al~-------------------------------- 118 (393) . .+. T Consensus 316 ~~~~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~ 395 (743) T protein:vir:10 316 DIGPRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVINEQSAYLYHGNDA 395 (743) T ss_pred cccccceeeeccccccccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceeecceeccccceeeccCcc Confidence 0 000 Q ss_pred ---------------------------------------------------hh----hhhhhhccccccccccc-----h Q lcl|Aclame:pro 119 ---------------------------------------------------TA----QSTVFVKPKLLCVPQHD-----N 138 (393) Q Consensus 119 ---------------------------------------------------~~----~~~~~~~~~~l~apg~s-----~ 138 (393) .. ...-...+.++++|++. . T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~~~~~~ 475 (743) T protein:vir:10 396 AVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDFAYDAGEFGAAMDLFLDTEETEIDFVLMGGSMADEADT 475 (743) T ss_pred cceeeeccccCccccceeeeecccccccccceEEEeecCccccccchhHHHHHHHHhhhccccCcceEEecCcccCccch Confidence 00 00000113577788764 3 Q ss_pred HHHHHHHHHhhcccceEEEEecCCCC---------------cchhhhhhh-cccccceEEEeccceeEeeccCCceEEec Q lcl|Aclame:pro 139 QAVATELLSVAKKLNAFAFISDNGAT---------------TKEQAYTYR-QNFSQREGMMIFGDWKSYNTDKKAYDTDY 202 (393) Q Consensus 139 ~~v~~al~~~a~~~~~~~~i~~~~~~---------------~~~~a~~~~-~~~~s~~~~~~~p~~~~~~~~~~~~~~~p 202 (393) .+++.+|+++|+++++++++.|+|.. +..++..++ ..++|+++++||||++++++.++..+++| T Consensus 476 ~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p 555 (743) T protein:vir:10 476 KSKATKVIAIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSDLTSTSYAVFDSGYKYVYDRFTDKYRYIP 555 (743) T ss_pred HHHHHHHHHHHHhhCCeEEEEecCCCccccccccccccccccchHHHHHHHhccCCeeEEEEccceeeeccccCceeEec Confidence 57899999999999977777777632 123444444 45789999999999999999999999999 Q ss_pred hhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eCCCEEEEecccCC-CCc Q lcl|Aclame:pro 203 AVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNGFRYWGSRTLA-TDT 279 (393) Q Consensus 203 ~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~~G~~~wG~rT~~-~d~ 279 (393) ||+++||++||+|.++||||||||+++.||.++. +++..++++|++.||++|||+++ +++|+++||+||++ .|+ T Consensus 556 ~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~s~d~ 632 (743) T protein:vir:10 556 CNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAV---KLAYNPNKADRDELYQNRINPVVSLRGQGITLFGDKTALAAPS 632 (743) T ss_pred hhHHHHHHHHHhhccCCcEEccCCeeeeeeeccc---cceecCChhHHHhHhhCCceEEEEecCCeEEEEcccccCCCCc Confidence 9999999999999999999999999998888763 45567889999999999999995 57899999999985 589 Q ss_pred ccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEE Q lcl|Aclame:pro 280 RWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFV 358 (393) Q Consensus 280 ~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~ 358 (393) +|+||++|||++||+++|+++++|+||||||+.+|++|++++++||++||++|+ +.+|.|+|| ++||+++|++|+|+ T Consensus 633 ~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~~~~~i~~~i~~fL~~l~~~ga--l~~~~V~~d~~~nt~~~i~~G~~~ 710 (743) T protein:vir:10 633 AFDRINVRRLFLNLEKRARRLAEGVLFEQNDATTRAGFSSALNSYLSEVQARRG--VTDYLVICDESNNTPDIIDRNEFV 710 (743) T ss_pred ccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCc--eeeeEEEEcCCCCCHHHhhCCeEE Confidence 999999999999999999999999999999999999999999999999999885 466777666 56999999999999 Q ss_pred EEEEEEecCcceeEEEEEE--EcchHHHHHHHH Q lcl|Aclame:pro 359 IKYDYHWIPSLESLGLEQR--VNDEYVVDLVNT 389 (393) Q Consensus 359 ~~v~~~p~~p~e~i~~~~~--~~~~~~~~~~~~ 389 (393) ++|+++|++|+|||+|++. ++..+|+|++++ T Consensus 711 ~~i~~~p~~pae~I~~~~~~~~~~~~~~e~~~~ 743 (743) T protein:vir:10 711 AEVYVKPTRSINFITITFTATKTGVTFSEVVGR 743 (743) T ss_pred EEEEEEecCCcceEEEEEEEeecCcchHhhhcC Confidence 9999999999999999977 467789999999 No 25 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=1.2e-85 Score=486.06 Aligned_cols=379 Identities=15% Similarity=0.120 Sum_probs=297.5 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhccc---ccchhhhhhhhhccc Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGS---TGTLRRTLNSIGSIV 80 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~---~~tl~~~~~~~~~~~ 80 (393) |++..|||||+|+ +++++|..+.|++.+|+|.++.. |+++|++++++.+++..||. ...+..++..+|.++ T Consensus 1 ~~~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~G-----p~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ng 74 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAAIVGKFAWG-----PAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY 74 (663) T ss_pred CceecCceEEEEe-cCcccccccCccceeEEeeeccC-----CCCccEEecCHHHHHHHhCCcCccchhHHHHHHHHHhC Confidence 6666799999999 59999999999999999998876 88999999999999999996 345677888889999 Q ss_pred CceEEEEEeccccccccccc-----------------------------c-----------------h------------ Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTA-----------------------------N-----------------I------------ 102 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~-----------------------------~-----------------~------------ 102 (393) +..++++|+........... . . T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~ 154 (663) T protein:vir:10 75 GNDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKT 154 (663) T ss_pred CCeEEEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEeccccccccc Confidence 99999888643211100000 0 0 Q ss_pred --h------------------c-----------ccccc--------cccchh-----------------h---------- Q lcl|Aclame:pro 103 --V------------------G-----------TQENG--------KFTGIK-----------------A---------- 116 (393) Q Consensus 103 --~------------------~-----------~~~~~--------~~~gl~-----------------a---------- 116 (393) . . ..+.. ...++. + T Consensus 155 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~i 234 (663) T protein:vir:10 155 RQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGSTV 234 (663) T ss_pred cccceeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeecccccccce Confidence 0 0 00000 000000 0 Q ss_pred --------------------------------------------------------------------------hhhh-- Q lcl|Aclame:pro 117 --------------------------------------------------------------------------LLTA-- 120 (393) Q Consensus 117 --------------------------------------------------------------------------l~~~-- 120 (393) .... T Consensus 235 ~v~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~ 314 (663) T protein:vir:10 235 EVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFR 314 (663) T ss_pred eEEecccccccccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeecccccccchhhhhhhhhhc Confidence 0000 Q ss_pred hh----------------------------------------------hhhhcccccccc--ccc----hHHHHHHHHHh Q lcl|Aclame:pro 121 QS----------------------------------------------TVFVKPKLLCVP--QHD----NQAVATELLSV 148 (393) Q Consensus 121 ~~----------------------------------------------~~~~~~~~l~ap--g~s----~~~v~~al~~~ 148 (393) .. .-.+.+.++++| +.. ..+|+.+|+++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~~~ 394 (663) T protein:vir:10 315 NGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSL 394 (663) T ss_pred cCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHH Confidence 00 000001111111 111 24588999999 Q ss_pred hcccceEEEEecCCC---------Ccchhhhhhhc-------------ccccceEEEeccceeEeeccCCceEEechhHH Q lcl|Aclame:pro 149 AKKLNAFAFISDNGA---------TTKEQAYTYRQ-------------NFSQREGMMIFGDWKSYNTDKKAYDTDYAVAR 206 (393) Q Consensus 149 a~~~~~~~~i~~~~~---------~~~~~a~~~~~-------------~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~ 206 (393) |+++++++++.|+|. .+.+++++|++ +++|+++++||||++++|+.+++.+++|||++ T Consensus 395 a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p~s~~ 474 (663) T protein:vir:10 395 ADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAAD 474 (663) T ss_pred HHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEechhHH Confidence 999998888888773 23456666653 57899999999999999999999999999999 Q ss_pred HHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eC-CCEEEEecccCCCCc-ccc Q lcl|Aclame:pro 207 ACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NH-NGFRYWGSRTLATDT-RWA 282 (393) Q Consensus 207 ~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~-~G~~~wG~rT~~~d~-~~~ 282 (393) +||++||+|.++||||||||+.+.++.++. +.+..+++.|++.||++|||+++ ++ +|+++||+||+++|+ +|+ T Consensus 475 vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~ 551 (663) T protein:vir:10 475 IAGLCAYTDQVSHPWMSPAGYRRGQIRNCI---KLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFD 551 (663) T ss_pred HHHHHHHhhccCCceEccCCceeccccccc---cceeccChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccc Confidence 999999999999999999999987777653 34566788999999999999984 55 799999999998775 899 Q ss_pred eeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEE Q lcl|Aclame:pro 283 FQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKY 361 (393) Q Consensus 283 ~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v 361 (393) ||++|||++||+++|+++++|+||||||+.+|++|++++++||++||++|+ +.+|.|+|| ++||+++|++|+|+++| T Consensus 552 ~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~~L~~l~~~ga--l~g~~v~~d~~~nt~~~i~~G~~~~~i 629 (663) T protein:vir:10 552 RINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGG--CYDFRVVCDTTNNTPNVIDRNEFVGTI 629 (663) T ss_pred eEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCc--eeeeEEEEcCCCCCHHHhhCCeEEEEE Confidence 999999999999999999999999999999999999999999999999885 456777666 57999999999999999 Q ss_pred EEEecCcceeEEEEEEEcc--hHHHHHHHHHhcC Q lcl|Aclame:pro 362 DYHWIPSLESLGLEQRVND--EYVVDLVNTLKAL 393 (393) Q Consensus 362 ~~~p~~p~e~i~~~~~~~~--~~~~~~~~~~~~~ 393 (393) +++|++|+|||+|++.+.. ..++|+++++++- T Consensus 630 ~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~~ 663 (663) T protein:vir:10 630 YVKPPRSINYITLNMVATSTGANFDELIGPMQLA 663 (663) T ss_pred EEEecCCcceEEEEEEEeecCccHHHHHHHHhcC Confidence 9999999999999988755 4699999999999 No 26 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=2.8e-85 Score=484.13 Aligned_cols=379 Identities=16% Similarity=0.117 Sum_probs=293.3 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc-----cchhhhhhh Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST-----GTLRRTLNS 75 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~-----~tl~~~~~~ 75 (393) ||| ++-.|||||+|++.++++|..++|++.+|+|+++.. |+++|++++|+.+++..||.. ..+..++.. T Consensus 1 m~~-~~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~~ 74 (729) T protein:vir:10 1 MPL-NLASPGIVVREVDLTIGRVDPTSGSIGALVAPFAKG-----PVNDPQLIESEEDLLQTFGQPYSTDKHYEYWMVAS 74 (729) T ss_pred CCc-cccCCceEEEEecCCCcccccccccceeEEeccccC-----CCccCeEcCCHHHHHHHcCccccCCcchhHHHHHH Confidence 553 333599999999999999999999999999998866 889999999999999999974 235567888 Q ss_pred hhcccCceEEEEEeccccccccc--------------------------------------------------------- Q lcl|Aclame:pro 76 IGSIVKTPTVIVRVAESDDSDTL--------------------------------------------------------- 98 (393) Q Consensus 76 ~~~~~~~~~~vv~~~~~~~~~~~--------------------------------------------------------- 98 (393) +|.+++..|+++|+......... T Consensus 75 ~f~ngg~~~~vvRv~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~~~v~v 154 (729) T protein:vir:10 75 SYLAYGGTMQVVRADDYNTQTGVGLKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANGIKVAI 154 (729) T ss_pred HHHhCCceEEEEecCcccccccccccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccceeeEE Confidence 89999999999986431100000 Q ss_pred --------------------------------ccc-----------hhc-ccccc-c--------------------c-- Q lcl|Aclame:pro 99 --------------------------------TAN-----------IVG-TQENG-K--------------------F-- 111 (393) Q Consensus 99 --------------------------------~~~-----------~~~-~~~~~-~--------------------~-- 111 (393) ... ... ..+.. . . T Consensus 155 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s~~~~~~~~~~~~~~~~~~ 234 (729) T protein:vir:10 155 IDGKADQILTVASGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTLEVKVISHISAAGVETAVEYQQNGT 234 (729) T ss_pred ecccCcceeeeeccccccceeeeeeeccccccccccceeeeeeecccccccccccccceecccccccccceeccccccce Confidence 000 000 00000 0 0 Q ss_pred -----------------------------------------cc-----------hhhh---------------------- Q lcl|Aclame:pro 112 -----------------------------------------TG-----------IKAL---------------------- 117 (393) Q Consensus 112 -----------------------------------------~g-----------l~al---------------------- 117 (393) .+ ...+ T Consensus 235 ~~~~~~~s~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~ 314 (729) T protein:vir:10 235 YTFDNSGSVNVIAAGSSGSGSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTI 314 (729) T ss_pred eeecccCccceeeeccccccccccceeeeccccccccccccccccccccccccccccccccccccccccceeeecccccc Confidence 00 0000 Q ss_pred -------h----------hhhh----------h----------------------------------------------- Q lcl|Aclame:pro 118 -------L----------TAQS----------T----------------------------------------------- 123 (393) Q Consensus 118 -------~----------~~~~----------~----------------------------------------------- 123 (393) . .... . T Consensus 315 ~~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 394 (729) T protein:vir:10 315 TGNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLATNSKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGVNFGA 394 (729) T ss_pred ccCcccceeeeeeeeeccccccccccccccceeeccccceeeecccccccccccccccccceeccccccccccccccccc Confidence 0 0000 0 Q ss_pred ----------------------------------hhh---------ccccccc-----cccchHHHHHHHHHhhcccceE Q lcl|Aclame:pro 124 ----------------------------------VFV---------KPKLLCV-----PQHDNQAVATELLSVAKKLNAF 155 (393) Q Consensus 124 ----------------------------------~~~---------~~~~l~a-----pg~s~~~v~~al~~~a~~~~~~ 155 (393) .++ ....++. ++.....++.+|+++|++++++ T Consensus 395 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~~~ 474 (729) T protein:vir:10 395 SGVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARKDA 474 (729) T ss_pred cceeEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhcCCe Confidence 000 0000000 1112335778999999999988 Q ss_pred EEEecCCC------------------Ccchhhhhhhcccc-cceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhc Q lcl|Aclame:pro 156 AFISDNGA------------------TTKEQAYTYRQNFS-QREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDK 216 (393) Q Consensus 156 ~~i~~~~~------------------~~~~~a~~~~~~~~-s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~ 216 (393) +++.++|. .+..++..++..+. +.++++||||++++++.++..+.+|||+++||++||+|. T Consensus 475 ~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~d~ 554 (729) T protein:vir:10 475 VAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGTCARTDI 554 (729) T ss_pred EEEecccccccccccccccccccccchhhHHHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhHHHHHHHHHhhc Confidence 77776652 23345666776664 678999999999999999999999999999999999999 Q ss_pred cCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEe--CCCEEEEecccC-CCCcccceeehhhHHHHH Q lcl|Aclame:pro 217 TVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLN--HNGFRYWGSRTL-ATDTRWAFQQSVRTAQII 293 (393) Q Consensus 217 ~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~--~~G~~~wG~rT~-~~d~~~~~i~~rR~~~~i 293 (393) ++||||||+|+++.||.++. ..+..++++|++.||++|||++++ ++|+++||+||+ +.|++|+||++|||+++| T Consensus 555 ~~g~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~d~~~~~i~vrR~~~~i 631 (729) T protein:vir:10 555 EQFPWFSPAGTARGPILNSV---KLVYNPGKKQRDILYSNRINPVILSPGAGIILFGDKTGFGKSSAFDRINVRRLFIYL 631 (729) T ss_pred cCCcEEccCCccccceeccc---ceeeecChhhHhhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHHHH Confidence 99999999999988887754 345667889999999999999954 689999999998 579999999999999999 Q ss_pred HHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEEEEEecCcceeE Q lcl|Aclame:pro 294 KETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKYDYHWIPSLESL 372 (393) Q Consensus 294 ~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v~~~p~~p~e~i 372 (393) +++|+++++|+||||||+.+|++|++++++||++||++|. +.+|+|+|| ++||+++|++|+|+++|+++|++|+||| T Consensus 632 ~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~g~--l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~i 709 (729) T protein:vir:10 632 EDAISAAAKDQLFEFNDELTRTNFVNIVEPFLRDVQAKRG--IFDFVVICDETNNTAAVIDSNEFVADIFIKPARSINFI 709 (729) T ss_pred HHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhccc--eeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCccEE Confidence 9999999999999999999999999999999999999885 456777666 5699999999999999999999999999 Q ss_pred EEEEEEcch--HHHHHHHHH Q lcl|Aclame:pro 373 GLEQRVNDE--YVVDLVNTL 390 (393) Q Consensus 373 ~~~~~~~~~--~~~~~~~~~ 390 (393) +|++++... +|+|++++| T Consensus 710 ~~~~~~~~~~~~~~e~~~~~ 729 (729) T protein:vir:10 710 GLTFVATRTGVAFEEVIGSV 729 (729) T ss_pred EEEEEEeecCccHHHHHhcC Confidence 999887654 789999999 No 27 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=7.5e-85 Score=481.79 Aligned_cols=379 Identities=15% Similarity=0.118 Sum_probs=295.3 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhhccc Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIGSIV 80 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~~~~ 80 (393) |++..|||||+|+ +++++|..++|++.+|+|+++.. |+++|++++++.+++..||.. ..+..++..+|.++ T Consensus 1 ~~~~~Pgvyv~e~-~~~~~i~~~~t~~~~~vG~~~~G-----p~~~p~~v~~~~~~~~~fg~~~~~~~~~~~~~~~f~ng 74 (663) T protein:vir:10 1 MALLSPGIEMKET-SINSTVVRSATGRAAIVGKFAWG-----PAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY 74 (663) T ss_pred CceecCceEEEEe-cCCccccccCcccceeEeecccC-----CCCccEEecCHHHHHHhcCCcCCcchhHHHHHHHHHhC Confidence 6666799999999 58999999999999999998876 889999999999999999964 34667778888888 Q ss_pred CceEEEEEecccccccccc-----------------------------c-------------c--hh------------- Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLT-----------------------------A-------------N--IV------------- 103 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~-----------------------------~-------------~--~~------------- 103 (393) +..++++|+.......... . . .. T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~ 154 (663) T protein:vir:10 75 GNDLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKT 154 (663) T ss_pred CCeEEEEEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeeccccccccc Confidence 8888888763211100000 0 0 00 Q ss_pred -------------------------------cc-cccc--------cccchh-----------------h---------- Q lcl|Aclame:pro 104 -------------------------------GT-QENG--------KFTGIK-----------------A---------- 116 (393) Q Consensus 104 -------------------------------~~-~~~~--------~~~gl~-----------------a---------- 116 (393) +. .+.. ...++. + T Consensus 155 ~~v~~~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~i 234 (663) T protein:vir:10 155 RQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGSTV 234 (663) T ss_pred cccccceeeccceeeEeeeccCccccccccceeccccceEEeeccccccccccccccccccccccceEEeccCCccccee Confidence 00 0000 000000 0 Q ss_pred ---h---------------------------------------------------------------------------h Q lcl|Aclame:pro 117 ---L---------------------------------------------------------------------------L 118 (393) Q Consensus 117 ---l---------------------------------------------------------------------------~ 118 (393) + . T Consensus 235 ~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~ 314 (663) T protein:vir:10 235 EVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFR 314 (663) T ss_pred eeeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhhhhc Confidence 0 0 Q ss_pred hh--------------------------------------------hhhhhhccccccccc--c----chHHHHHHHHHh Q lcl|Aclame:pro 119 TA--------------------------------------------QSTVFVKPKLLCVPQ--H----DNQAVATELLSV 148 (393) Q Consensus 119 ~~--------------------------------------------~~~~~~~~~~l~apg--~----s~~~v~~al~~~ 148 (393) .. ...-.+.+.+++++. . ...+|+.+|.++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~ 394 (663) T protein:vir:10 315 NGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSL 394 (663) T ss_pred CCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHH Confidence 00 000000011112211 1 124588999999 Q ss_pred hcccceEEEEecCCC---------Ccchhhhhhh-------------cccccceEEEeccceeEeeccCCceEEechhHH Q lcl|Aclame:pro 149 AKKLNAFAFISDNGA---------TTKEQAYTYR-------------QNFSQREGMMIFGDWKSYNTDKKAYDTDYAVAR 206 (393) Q Consensus 149 a~~~~~~~~i~~~~~---------~~~~~a~~~~-------------~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~ 206 (393) |+++++++++.|+|. .+.+++.+|+ .+++|.++++||||++++|+.++..+++|||++ T Consensus 395 a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~ 474 (663) T protein:vir:10 395 ADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAAD 474 (663) T ss_pred HHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCceEEechhHH Confidence 999998888888773 2334555555 457899999999999999999999999999999 Q ss_pred HHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEE--EeC-CCEEEEecccCCCCc-ccc Q lcl|Aclame:pro 207 ACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITIC--LNH-NGFRYWGSRTLATDT-RWA 282 (393) Q Consensus 207 ~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~--~~~-~G~~~wG~rT~~~d~-~~~ 282 (393) +||++||+|.++||||||||+++.++.++. +.+..+++.|++.||++|||++ |++ +||++||+||+++|+ +|+ T Consensus 475 vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~s~~~ 551 (663) T protein:vir:10 475 IAGLCAYTDQVSHPWMSPAGYRRGQIRNCI---KLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFD 551 (663) T ss_pred HHHHHHHhhccCCceEccCCceeccccccc---cceeecChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccc Confidence 999999999999999999999987776643 4456678899999999999998 455 799999999998775 899 Q ss_pred eeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEE Q lcl|Aclame:pro 283 FQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKY 361 (393) Q Consensus 283 ~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v 361 (393) ||++|||++||+++|+++++|+||||||+.+|.+|+++++.||++||++|+ +.+|.|.|| ++||+++|++|+|+++| T Consensus 552 ~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~ga--l~g~~v~~d~~~nt~~~i~~G~~~~~i 629 (663) T protein:vir:10 552 RINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGG--CYDFRVVCDTTNNTPNVIDRNEFVGTI 629 (663) T ss_pred eEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEcCCCCCHHHhhCCeEEEEE Confidence 999999999999999999999999999999999999999999999999886 455777665 57999999999999999 Q ss_pred EEEecCcceeEEEEEEEcc--hHHHHHHHHHhcC Q lcl|Aclame:pro 362 DYHWIPSLESLGLEQRVND--EYVVDLVNTLKAL 393 (393) Q Consensus 362 ~~~p~~p~e~i~~~~~~~~--~~~~~~~~~~~~~ 393 (393) +++|++|+|||+|++.+.. ..++|+++++++- T Consensus 630 ~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~~ 663 (663) T protein:vir:10 630 YVKPPRSINYITLNMVATSTGANFDELIGPMQLA 663 (663) T ss_pred EEEecCCcceEEEEEEEeecCccHHHHHHHHhcC Confidence 9999999999999998854 4699999999999 No 28 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=5.2e-84 Score=477.16 Aligned_cols=379 Identities=13% Similarity=0.092 Sum_probs=294.5 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhhccc Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIGSIV 80 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~~~~ 80 (393) |++..|||||+|++ ++++|..+.|++.+|+|.++.. |+++|++++|+.+++..||.. ..+..++..+|.++ T Consensus 1 ~~~~~Pgvyv~e~~-~~~~~~~v~t~~~~fvG~~~~g-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ng 74 (663) T protein:vir:10 1 MALLSPGIEMKETS-INSTVVRSATGRAALVGKFAWG-----PAYEIRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQY 74 (663) T ss_pred CccccCceEEEEec-CcccccccccccceeeeccccC-----CCCcCEEecCHHHHHHHcCCcccccchHHHHHHHHHhC Confidence 66667999999995 8889999999999999988766 889999999999999999964 45677888889999 Q ss_pred CceEEEEEeccccccccccc------------------------------------------------------------ Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTA------------------------------------------------------------ 100 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~------------------------------------------------------------ 100 (393) +..|+++|+........... T Consensus 75 g~~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~a 154 (663) T protein:vir:10 75 GNDLRLVRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAKA 154 (663) T ss_pred CCeEEEEecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEeccccccccc Confidence 99998888753211100000 Q ss_pred ----c--------------hh----------cc-c--------------cccc--------------------------- Q lcl|Aclame:pro 101 ----N--------------IV----------GT-Q--------------ENGK--------------------------- 110 (393) Q Consensus 101 ----~--------------~~----------~~-~--------------~~~~--------------------------- 110 (393) . .. +. . +... T Consensus 155 ~~~~~~~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i 234 (663) T protein:vir:10 155 KQLGTYPVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGSTV 234 (663) T ss_pred cccccccccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCcce Confidence 0 00 00 0 0000 Q ss_pred ------cc---------------------------c----------------------h-----------------hhhh Q lcl|Aclame:pro 111 ------FT---------------------------G----------------------I-----------------KALL 118 (393) Q Consensus 111 ------~~---------------------------g----------------------l-----------------~al~ 118 (393) .. + + ..+. T Consensus 235 ~v~~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~~~ 314 (663) T protein:vir:10 235 EVEVISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDYFR 314 (663) T ss_pred eEeecccccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhhhhhhhc Confidence 00 0 0 0000 Q ss_pred hhhhhh----------------h----h-------------------------ccccccccccc-----hHHHHHHHHHh Q lcl|Aclame:pro 119 TAQSTV----------------F----V-------------------------KPKLLCVPQHD-----NQAVATELLSV 148 (393) Q Consensus 119 ~~~~~~----------------~----~-------------------------~~~~l~apg~s-----~~~v~~al~~~ 148 (393) .....+ . . ...+++.|+.. ..+|+++|+++ T Consensus 315 ~~~s~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~ 394 (663) T protein:vir:10 315 NGSSNFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVAL 394 (663) T ss_pred CcccceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHH Confidence 000000 0 0 00001111111 14588999999 Q ss_pred hcccceEEEEecCCCCc---------chhhh-------------hhhcccccceEEEeccceeEeeccCCceEEechhHH Q lcl|Aclame:pro 149 AKKLNAFAFISDNGATT---------KEQAY-------------TYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVAR 206 (393) Q Consensus 149 a~~~~~~~~i~~~~~~~---------~~~a~-------------~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~ 206 (393) |+++++++++.|+|... .+++. +++.+++|.++++||||++++++.+++.+++|||++ T Consensus 395 ~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s~~ 474 (663) T protein:vir:10 395 ADDRQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPLSAD 474 (663) T ss_pred HHhhCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEechHHH Confidence 99999888888877432 12333 344578999999999999999999999999999999 Q ss_pred HHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEE--EeC-CCEEEEecccCCCCc-ccc Q lcl|Aclame:pro 207 ACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITIC--LNH-NGFRYWGSRTLATDT-RWA 282 (393) Q Consensus 207 ~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~--~~~-~G~~~wG~rT~~~d~-~~~ 282 (393) +||++||+|.++||||||||+++.+|.++. +....+++.|++.||++|||++ +++ +||++||+||+++++ +|+ T Consensus 475 vAGl~Ar~D~~~g~~~span~~~~~i~g~~---~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~ 551 (663) T protein:vir:10 475 IAGLCAYTDQVGHPWMSPAGYRRGQLRNTI---KLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQVPSPFD 551 (663) T ss_pred HHHHHHHhhccCCcEEccCCeeecceeccc---cceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCCCCcccc Confidence 999999999999999999999988777753 3445577889999999999988 455 799999999998764 899 Q ss_pred eeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHHHhhCCEEEEEE Q lcl|Aclame:pro 283 FQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITADIIKSGKFVIKY 361 (393) Q Consensus 283 ~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~~i~~G~~~~~v 361 (393) ||++|||+++|+++|+++++|+||||||+.||++|++++++||++||++|+ +.||.|+|| ++||+++|++|+|+++| T Consensus 552 ~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~ga--l~gf~V~~d~~~nt~~~i~~G~~~~~i 629 (663) T protein:vir:10 552 RINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMEVSQYLDNIRSLGG--VYDFRVVCDTTNNTPQVIDSNEFVATI 629 (663) T ss_pred eEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEcCCCCCHHHhhCCeEEEEE Confidence 999999999999999999999999999999999999999999999999885 455777665 57999999999999999 Q ss_pred EEEecCcceeEEEEEEEcch--HHHHHHHHHhcC Q lcl|Aclame:pro 362 DYHWIPSLESLGLEQRVNDE--YVVDLVNTLKAL 393 (393) Q Consensus 362 ~~~p~~p~e~i~~~~~~~~~--~~~~~~~~~~~~ 393 (393) +++|++|+|||+|++.+... .++|+++++++- T Consensus 630 ~~~p~~pae~I~~~~~~~~~~~~f~e~~~~~~~~ 663 (663) T protein:vir:10 630 YIKAPRSINYITLNFVATSTGANFDELIGPAQLA 663 (663) T ss_pred EEEecCCcceEEEEEEEEecCccHHHHHHHHhcC Confidence 99999999999999988654 588999999999 No 29 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=1.1e-83 Score=475.42 Aligned_cols=376 Identities=13% Similarity=0.118 Sum_probs=286.6 Q ss_pred CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhccc---ccchhhhhhhhhccc Q lcl|Aclame:pro 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGS---TGTLRRTLNSIGSIV 80 (393) Q Consensus 4 ~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~---~~tl~~~~~~~~~~~ 80 (393) |++-.|||||+|++ ++++|..+.|++.+|+|+++.. |+++|++++++.+++..||. ...+..++..+|.++ T Consensus 1 ~~~~~Pgvyv~e~~-~~~~i~~v~t~~~~fvG~~~~G-----p~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~f~ng 74 (671) T protein:vir:56 1 MTLLSPGIENKEIN-LASAIGRAATGRAAMVGKFEWG-----PAYSITQVTSESDLVTIFGRPNDYTAASFMTANNFLKY 74 (671) T ss_pred CceecCceEEEeec-CcccccccCcccceEEecccCC-----CCccCEEcCCHHHHHHHcCCcCCCcchhHHHHHHHHhc Confidence 66666999999995 8999999999999999988876 88999999999999999996 345778888899999 Q ss_pred CceEEEEEecccccccccc------------------------------------------------------------- Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLT------------------------------------------------------------- 99 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~------------------------------------------------------------- 99 (393) +..++++++.......... T Consensus 75 g~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~ 154 (671) T protein:vir:56 75 GNDLRLVRICDATTAQNATPLYNAVEYTIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVA 154 (671) T ss_pred CCeEEEEEecCccccccchhhccccccccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEE Confidence 9999888864422100000 Q ss_pred -----------cc--h-----------hcccccc-----------------ccc-------------c------------ Q lcl|Aclame:pro 100 -----------AN--I-----------VGTQENG-----------------KFT-------------G------------ 113 (393) Q Consensus 100 -----------~~--~-----------~~~~~~~-----------------~~~-------------g------------ 113 (393) .. . ....+.. ... + T Consensus 155 ~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~ 234 (671) T protein:vir:56 155 AAKSDGNYPSVGTITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYVGDF 234 (671) T ss_pred eeeccccccccccccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhccccccccccccccc Confidence 00 0 0000000 000 0 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 114 -------------------------------------------------------------------------------- 113 (393) Q Consensus 114 -------------------------------------------------------------------------------- 113 (393) T Consensus 235 g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~ 314 (671) T protein:vir:56 235 GDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTNPGD 314 (671) T ss_pred CcceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEEEeecccc Confidence Q ss_pred ---------------------------------------------------hhhhhhhhhhhhhccccccccccchH--- Q lcl|Aclame:pro 114 ---------------------------------------------------IKALLTAQSTVFVKPKLLCVPQHDNQ--- 139 (393) Q Consensus 114 ---------------------------------------------------l~al~~~~~~~~~~~~~l~apg~s~~--- 139 (393) ..+++..+.+..+.+.++++|+.++. T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 394 (671) T protein:vir:56 315 KDVNGQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANAGADDWMFGLDMLSDPEVLYTNLVIAGNAAAEEVS 394 (671) T ss_pred cccchhhhhhhhhhcccCceEEEecCcccCCccccccccCccccccchhHHHHHHHhhhhccccceeEEEcCCCCCccch Confidence 00000000001111223333333221 Q ss_pred ----HHHHHHHHhhcccceEEEEecCCC---------Ccchhhhhhhc--------------ccccceEEEeccceeEee Q lcl|Aclame:pro 140 ----AVATELLSVAKKLNAFAFISDNGA---------TTKEQAYTYRQ--------------NFSQREGMMIFGDWKSYN 192 (393) Q Consensus 140 ----~v~~al~~~a~~~~~~~~i~~~~~---------~~~~~a~~~~~--------------~~~s~~~~~~~p~~~~~~ 192 (393) ....++.++|+..++++++.++|. .+.+++.+|+. +++|.++++||||+++++ T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d 474 (671) T protein:vir:56 395 IASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGNYKYQYD 474 (671) T ss_pred hHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEecCceEEec Confidence 123446677777777777777662 34455666653 567899999999999999 Q ss_pred ccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eCCCEEEE Q lcl|Aclame:pro 193 TDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNGFRYW 270 (393) Q Consensus 193 ~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~~G~~~w 270 (393) +.++..+++|||+++||++||+|.++||||||||+.+.++.+... .+..+++.|++.||++|||+++ +++|+++| T Consensus 475 ~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~---~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~w 551 (671) T protein:vir:56 475 KYNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNR---LAVDLRRAHRDALYQIGINPVVGFAGQGFVLY 551 (671) T ss_pred ccCCceeEechHHHHHHHHHHhhccCCcEECcCCceecccccccc---ceeecChhHHHHHhhCCceEEEEecCCeEEEE Confidence 999999999999999999999999999999999998877766543 2344567899999999999996 46899999 Q ss_pred ecccCCCC-cccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCC Q lcl|Aclame:pro 271 GSRTLATD-TRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEIT 348 (393) Q Consensus 271 G~rT~~~d-~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt 348 (393) |+||++++ ++|+||++|||++||+++|+++++|+|||||++.||++|++++++||++||++|+ +.||.|+|+ ++|| T Consensus 552 G~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~ga--l~g~~v~~d~~~nt 629 (671) T protein:vir:56 552 GDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFELNDEFTRSSFKSEIDAYLTNIQDLGG--VYDFRVVCDETNNP 629 (671) T ss_pred cceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEcCCCCC Confidence 99999876 5899999999999999999999999999999999999999999999999999886 456777666 5699 Q ss_pred HHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHhc Q lcl|Aclame:pro 349 ADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKA 392 (393) Q Consensus 349 ~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~ 392 (393) +++|++|+|+++|+++|++|+|||+|++.+.....+ |+++.- T Consensus 630 ~~~i~~G~~~~~i~~~p~~Pae~I~~~~~~~~~~~~--f~e~~~ 671 (671) T protein:vir:56 630 GSVIDRNEFVASIYVKPAKSINFITLNFVATSTDAD--FAEIIG 671 (671) T ss_pred HHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcc--hhhhcC Confidence 999999999999999999999999999998766544 555444 No 30 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=1.5e-82 Score=469.17 Aligned_cols=375 Identities=11% Similarity=0.056 Sum_probs=288.4 Q ss_pred CCcccC-CCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhhc Q lcl|Aclame:pro 3 ILDTYL-HGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIGS 78 (393) Q Consensus 3 m~~~~~-~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~~ 78 (393) |+.+|+ |||||+|++.+ ++|..+.|++.+|+|.++.. |+++|++++|+.++...||.. ..+..++..+|. T Consensus 1 M~~~~~~PgVyv~e~~~~-~~~~~~~t~~~~fvG~~~~G-----p~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~F~ 74 (749) T protein:vir:10 1 MATNQSSPGVVIQERDLT-TVSTIPTANVGVIAAPFTKG-----PVEEVIEITSERQLAEKFGEPNESNYEYWFSAAQFL 74 (749) T ss_pred CCccccCCeeEEEEecCC-cccccccCceeEEEeccCCC-----CCccCEEcCCHHHHHHHcCCccCCcccHHHHHHHHh Confidence 666665 99999999876 56888999999999988766 889999999999999999964 447788899999 Q ss_pred ccCceEEEEEeccccccccc------------------------------------------------------------ Q lcl|Aclame:pro 79 IVKTPTVIVRVAESDDSDTL------------------------------------------------------------ 98 (393) Q Consensus 79 ~~~~~~~vv~~~~~~~~~~~------------------------------------------------------------ 98 (393) +++..++++|+........+ T Consensus 75 ngg~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~pG~~gn~l~v~v~~~~~~~~~~~~~~~~~ 154 (749) T protein:vir:10 75 SYGGLLKTIRVNSSSLKNAVDTGTAPLVKNLQDYETSIEDASNNFSWVARTPGDTGNSIGIFVTDAGADQVVVVPAPGSG 154 (749) T ss_pred hcCCeEEEEEccCccccccccccccccccccccccccccccccceEEEeccCCCcCCceEEEEEcCCCceeeeeecCCcc Confidence 99999988886321100000 Q ss_pred -------------------c-----------------------------cc---hhccccc---------------cc-- Q lcl|Aclame:pro 99 -------------------T-----------------------------AN---IVGTQEN---------------GK-- 110 (393) Q Consensus 99 -------------------~-----------------------------~~---~~~~~~~---------------~~-- 110 (393) . .. .....+. +. T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~~~a 234 (749) T protein:vir:10 155 NEHEFVADAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDATNKKLEIGLPSGGVTGILA 234 (749) T ss_pred ceeeEEeeecccccccccccccccceeeeeccccceeecccceeeeccCcccccccccccCCcceEEEeeecccccceee Confidence 0 00 0000000 00 Q ss_pred -------------------------------------------------------------ccch--------------- Q lcl|Aclame:pro 111 -------------------------------------------------------------FTGI--------------- 114 (393) Q Consensus 111 -------------------------------------------------------------~~gl--------------- 114 (393) .++. T Consensus 235 ~~~~v~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~v~~~~~~~~~~t~~~~~~~a~~~gt~~~~ 314 (749) T protein:vir:10 235 DNQVITQGTNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITSVRDEYTEREYLPGVKWINVAPRPGTSLYA 314 (749) T ss_pred eeecccccccccccccccccchhhhhccccceeeccccccCCccceeEEEeeeccccccccccceeeccccccccceeee Confidence 0000 Q ss_pred -----------------------------hh---hhhh-hhh------------h------------------------- Q lcl|Aclame:pro 115 -----------------------------KA---LLTA-QST------------V------------------------- 124 (393) Q Consensus 115 -----------------------------~a---l~~~-~~~------------~------------------------- 124 (393) +. +... ..+ + T Consensus 315 ~~~~g~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~ 394 (749) T protein:vir:10 315 NGVGGHRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSEFIYWAEHESTLYAATSSASD 394 (749) T ss_pred ecccCCCCceEEEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhccCCCEEEEEecccccccccccccc Confidence 00 0000 000 0 Q ss_pred ----------------------------------------------------------------------------hhcc Q lcl|Aclame:pro 125 ----------------------------------------------------------------------------FVKP 128 (393) Q Consensus 125 ----------------------------------------------------------------------------~~~~ 128 (393) .+.+ T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 474 (749) T protein:vir:10 395 GLFGQTAANRQFNLFRSAAGSVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIGDPESQIVDF 474 (749) T ss_pred cccccccccceeeccccccccceeccccccccccCCcEEEEEccCCcccccccccccccchhHHHHHHHhhhhhhcccce Confidence 0000 Q ss_pred ccccccccc---hHHHHHHHHHhhcccceEEEEecCCCCc----------chhhhhhhc-ccccceEEEeccceeEeecc Q lcl|Aclame:pro 129 KLLCVPQHD---NQAVATELLSVAKKLNAFAFISDNGATT----------KEQAYTYRQ-NFSQREGMMIFGDWKSYNTD 194 (393) Q Consensus 129 ~~l~apg~s---~~~v~~al~~~a~~~~~~~~i~~~~~~~----------~~~a~~~~~-~~~s~~~~~~~p~~~~~~~~ 194 (393) .++..|+++ ..+++.+|+++|++++++++++|+|..+ ..++..++. ..+|.++++||||++++|+. T Consensus 475 li~~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~ 554 (749) T protein:vir:10 475 IISGPSGTSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKKLPSSSYMVFDSGYKYIYDKY 554 (749) T ss_pred EEEecCCCCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhhccCceeEEEEccceeeeccc Confidence 011112221 2358899999999999999999887532 234555554 45688999999999999999 Q ss_pred CCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eCCCEEEEec Q lcl|Aclame:pro 195 KKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNGFRYWGS 272 (393) Q Consensus 195 ~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~~G~~~wG~ 272 (393) ++..+++|||+++||++||+|.++||||||||+++.++.+.. +.+..+++.|++.||++|||+++ +++|+++||+ T Consensus 555 ~~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gIn~i~~~~g~G~~~wG~ 631 (749) T protein:vir:10 555 NDVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAI---KLAYTPNKAQRDQLYANRVNPIVSFPGQGVVLYGD 631 (749) T ss_pred cCceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccc---cceeecChhHHHhhhhCCceEEEEecCCeEEEEcc Confidence 999999999999999999999999999999999877666643 33456778899999999999996 5789999999 Q ss_pred ccC-CCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec-CCCCHH Q lcl|Aclame:pro 273 RTL-ATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA-EEITAD 350 (393) Q Consensus 273 rT~-~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~-~~nt~~ 350 (393) ||+ +.|++|+||++|||++||+++|+++++|+||||||+.||++|++++++||++||++|. +.+|.|+|| ++||++ T Consensus 632 rT~~s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~G~--i~~f~V~~d~~~Nt~~ 709 (749) T protein:vir:10 632 KTALGFASAFDRINIRRLFLTVERVISTAAKAQLFEQNDEAQRSLFINIVEPYLRDVQGRRG--VVDFLVKCDSTNNTPE 709 (749) T ss_pred eecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCC--eeeeEEEEcCCCCCHH Confidence 998 5689999999999999999999999999999999999999999999999999999885 566777666 569999 Q ss_pred HhhCCEEEEEEEEEecCcceeEEEEEEEcch--HHHHHHH Q lcl|Aclame:pro 351 IIKSGKFVIKYDYHWIPSLESLGLEQRVNDE--YVVDLVN 388 (393) Q Consensus 351 ~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~--~~~~~~~ 388 (393) +|++|+|+++|+++|++|+|||+|++.+... +++|+++ T Consensus 710 ~i~~G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~~s 749 (749) T protein:vir:10 710 AVDRGEFYAEVFLKPTRTINYVQLTFVATRTGVSFAEVAS 749 (749) T ss_pred HhhCCEEEEEEEEEecCCccEEEEEEEEeecCcchHHHhC Confidence 9999999999999999999999999987654 6666666 No 31 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=1.6e-80 Score=458.07 Aligned_cols=376 Identities=16% Similarity=0.123 Sum_probs=274.5 Q ss_pred CCCCc-cc-CCCeEEEEcCCCcccccc-cccceeEEEEeecccccccccccceEEeecchhhhhhccccc-chhh---hh Q lcl|Aclame:pro 1 MSILD-TY-LHGVEVVEVNAGGVTIST-AATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTG-TLRR---TL 73 (393) Q Consensus 1 m~m~~-~~-~~GV~v~ev~~~~~~i~~-v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~-tl~~---~~ 73 (393) ...++ ++ .|||||+|++++++||.. +.|++.+|+|.++.. |.++|++++++.++..+|+... .+.. +. T Consensus 276 ~~~~~~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rG-----Pvn~PvlITS~aD~~~~Fg~~~GGl~GassA~ 350 (774) T protein:vir:98 276 FGEITRNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRG-----FTTSPALVTTIPDPAIHFTSFQGGLDGPRSAF 350 (774) T ss_pred ccceEEEEecCceEEEEeCCCCccccccccceeeeecccccCC-----CCCcCEEEeehhHhhhhhccccCCccccceee Confidence 22222 33 299999999999999987 999999999987766 8899999999999776664211 0110 10 Q ss_pred hhhhcccCc--------------eEEEEEecccccc--------------c----------------------ccccc-- Q lcl|Aclame:pro 74 NSIGSIVKT--------------PTVIVRVAESDDS--------------D----------------------TLTAN-- 101 (393) Q Consensus 74 ~~~~~~~~~--------------~~~vv~~~~~~~~--------------~----------------------~~~~~-- 101 (393) ..++...+. ..+.+.+...... . +..+. T Consensus 351 r~~~~~sG~~~L~i~A~~pGawGN~ItV~I~~~t~~~~~l~v~~~~~s~f~~~~a~e~~tv~~~~~~~~~~v~e~~dn~~ 430 (774) T protein:vir:98 351 RDFYTFNGTPLLRLQAVSEGNWGNQVTVSIYPVNNSEFRLNVQDLNGSAFNPPLADEVYTVKLGDTNESGELNALLDSKF 430 (774) T ss_pred eeeeeecccceEEEEEeecCcCCCceEEEEEecCCceeEEEEEecCCccccccccceeEEEecccccccceeeeeeceee Confidence 000000000 0011111000000 0 00000 Q ss_pred hhcccccccc--------------------------------------------------cch----hhhhhhhhhhhhc Q lcl|Aclame:pro 102 IVGTQENGKF--------------------------------------------------TGI----KALLTAQSTVFVK 127 (393) Q Consensus 102 ~~~~~~~~~~--------------------------------------------------~gl----~al~~~~~~~~~~ 127 (393) +.+....... .+. ..+.......... T Consensus 431 i~~~~~~~~~~~in~vs~lv~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg~~tt~~~igg~~~~~~~t 510 (774) T protein:vir:98 431 IRGFFLPKSIDSINYDAALVRQSPLRLAPPDESETDVENPAHVDFYGPNVLVDVTLENGYDGPPVTNDDYVSIIRTLENQ 510 (774) T ss_pred EeecccccccccccccccccccchhcccccccccccccccccccccCCcceEEEeecCCCCcccccchheeccccccccc Confidence 0000000000 000 0000000000000 Q ss_pred cccccccccchHHHHHHHHHhhccc----c-eEEEEecCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEec Q lcl|Aclame:pro 128 PKLLCVPQHDNQAVATELLSVAKKL----N-AFAFISDNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDY 202 (393) Q Consensus 128 ~~~l~apg~s~~~v~~al~~~a~~~----~-~~~~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p 202 (393) .-.++..+....+++.+|+.+|+++ + ++++++++++.+.+++++|+++++|.++++||||++++|+.+++.+++| T Consensus 511 gi~aLl~a~~~~~V~~aii~~~e~~~~~~~~r~avid~p~g~t~~~Ai~~r~~f~S~~aal~~Pwvkv~D~~~g~~~~vP 590 (774) T protein:vir:98 511 PVHILLVGTTNVGVQQALITEAERASDSDGLRIAVLAAPPRTTPTLAASVTRGFNSTRAVMVAGWFTYAGQPNSSRYGVP 590 (774) T ss_pred ceeEEEcCccchhhHHHHHHHHHHhhhcccceEEEEECCCCCCHHHHHHHHhccCCceEEEEeCcEEEeccCCCceeecC Confidence 0011222334567888888888865 3 4555666778888999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEE---EeCCCEEEEecccCCCCc Q lcl|Aclame:pro 203 AVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITIC---LNHNGFRYWGSRTLATDT 279 (393) Q Consensus 203 ~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~---~~~~G~~~wG~rT~~~d~ 279 (393) ||+++||++|++| +||||+|++|.|+.++..++......++.+++.|++++||++ ++++|+++||+||+++|| T Consensus 591 pSg~vAGl~ArtD----v~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g~G~rvWG~RTlssDp 666 (774) T protein:vir:98 591 GAAVYAGKLAAID----FFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVDRTYRFASGVTLSTDP 666 (774) T ss_pred hhHHHHHHHHhcC----cccccCCceeecceeccccccccccccchhhhhhcccccceeEEEEcCCcEEEEcccccCCCc Confidence 9999999999999 999999999999999988877777788899999999999986 458999999999999999 Q ss_pred ccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceE-EEec-CCCCHHHhhCCEE Q lcl|Aclame:pro 280 RWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGAR-VWVA-EEITADIIKSGKF 357 (393) Q Consensus 280 ~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~-v~~~-~~nt~~~i~~G~~ 357 (393) +|+||++|||++||+++|++.++|+|||||++.+|.+|+++++.||++||++|.+ .|++ ++|| ++||++++++|+| T Consensus 667 ~wr~InVRRlfd~Ie~SI~~~~~~~VfEPNd~~l~~~I~~sI~~fL~~L~~~GaL--~G~~~V~~D~etNt~~dI~~G~l 744 (774) T protein:vir:98 667 AWERIYLRRVHDVVRQGAHAILRNYVAMPNSRLVRNQIAAALNAFMGELKRNGNI--VSFRPAIIDGSNNSTAAYFSREL 744 (774) T ss_pred ccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCce--ecceEEEEcCCCCCHHHhhCCEE Confidence 9999999999999999999999999999999999999999999999999999865 3443 5554 6799999999999 Q ss_pred EEEEEEEecCcceeEEEEEEEcchHHHHHHHH Q lcl|Aclame:pro 358 VIKYDYHWIPSLESLGLEQRVNDEYVVDLVNT 389 (393) Q Consensus 358 ~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~ 389 (393) +++|+++|++|+|||+|++++++++.. |++ T Consensus 745 ~i~I~vaP~~PAEfIilri~q~t~~~~--l~E 774 (774) T protein:vir:98 745 YVSLQFQPLYSADYIYVTISRDTETSP--LGE 774 (774) T ss_pred EEEEEEEecCCcceEEEEEEEeeccee--ccC Confidence 999999999999999999999888633 222 No 32 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=1e-75 Score=431.75 Aligned_cols=365 Identities=12% Similarity=0.036 Sum_probs=250.0 Q ss_pred CCCCcccCCCeEEEEcCCCccc-----------------ccccccceeEEEEeecccccccccccceEEeecchhhhhhc Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVT-----------------ISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKA 63 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~-----------------i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~ 63 (393) .....+-...|++....+.++| ...++....++.......+....+++.+.++.......... T Consensus 347 ~~~~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~~f~v~s~~~~g~~i~~~~as~~~s~ln~~~~V~Gt~aa~~~~ 426 (742) T protein:vir:58 347 FTVIVNELTNVSIPVTDSAIIPPMRFTRIEQITLSGGASFSVISNQPYGFNIQDSRHSYWLSPFKDDELIIGTELVLPAL 426 (742) T ss_pred eeeeccccccceeeccccccCCcccccccceeecccCcceEEEEecccCcceeccCcceEEeccCCceEEEeehhhcccc Confidence 0000011123333333333322 11111122222222222222334666666554332222211 Q ss_pred ccc---------cchhhhhhhhhcccCceEEEEEecccccccccccchhcccccccccchhhhhhhhhhhhhcccccccc Q lcl|Aclame:pro 64 GST---------GTLRRTLNSIGSIVKTPTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVP 134 (393) Q Consensus 64 g~~---------~tl~~~~~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~gl~al~~~~~~~~~~~~~l~ap 134 (393) ... ..+..........++..+.+.......+ ...... ......+.++|+++++..+ .+.++++| T Consensus 427 d~~t~~~v~s~~~alp~~a~sv~laGG~dg~v~v~~~~~D-~iG~~~-~~d~~~adrTGL~ALlev~-----eVtILiAP 499 (742) T protein:vir:58 427 DVSTEFGVSSWEEALPEFSFLMPFQGGSDGYIRVDENEPD-TIGRVK-ITPALLANYERLLPLLTED-----QFDLVLTP 499 (742) T ss_pred ccchheeccccccccceeeEEEeecCCccccccccCCCcc-cccccc-cccccccchhHHHHhhhcC-----CCcEEEEc Confidence 100 1111111011111122111111111000 000000 0011234578999998765 36899999 Q ss_pred ccchHHHHHHHHHhhcccceE-EEEecCCCC--cchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHH Q lcl|Aclame:pro 135 QHDNQAVATELLSVAKKLNAF-AFISDNGAT--TKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQ 211 (393) Q Consensus 135 g~s~~~v~~al~~~a~~~~~~-~~i~~~~~~--~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~ 211 (393) |+++..++.++.++|+.+..+ +.+.|+|.. +.+++.++++.++|.++++||||++..+ ++..+++|||+++||++ T Consensus 500 G~t~~~v~aav~A~la~a~~Rl~vL~D~P~~~tt~~~A~a~r~~~nSsraaly~PwVkv~d--~~~~r~vPpSgaIAGL~ 577 (742) T protein:vir:58 500 YLTFADHAGTVNAFINRAENRFLYLFDIAGDDDTENLAISLAGYINSSFATTFFPWVRRLT--NKGMRTVPASLAAYRSI 577 (742) T ss_pred CCCchHHHHHHHHHHHhhcCCeEEEEecCCCCchHHHHHHHHhccCCceEEEEeceeeecc--CCcceeechHHHHHHHH Confidence 999988888888888876543 345455554 3467888999999999999999999775 45678999999999999 Q ss_pred HhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEe-CCCEEEEecccC-CCCcccceeehhhH Q lcl|Aclame:pro 212 AYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLN-HNGFRYWGSRTL-ATDTRWAFQQSVRT 289 (393) Q Consensus 212 a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~-~~G~~~wG~rT~-~~d~~~~~i~~rR~ 289 (393) ||+|.++|+|+||+|+.+.+. . ...++|++.||++|||++++ ++||++||+||+ +.|++|+||++||| T Consensus 578 ARtD~erGvw~SPANrgii~~---~-------~~s~se~d~LN~~GINtIrsfG~G~rlWGnRTlassDs~wryInVRRl 647 (742) T protein:vir:58 578 RTTDPETGLAPVGARRGVVTG---E-------PVRQVDWEDLYNNRINPIVRVGNDVLLFGQKTMLNVNSALNRINVRRL 647 (742) T ss_pred HHhccCCceEecCCcceeeec---c-------ccchhhHHHHhhCCceEEEECCCcEEEEcceecCCCCcccceEeehhh Confidence 999999999999999865321 1 23567999999999999987 589999999998 56999999999999 Q ss_pred HHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEecCCCCHHHhhCCEEEEEEEEEecCcc Q lcl|Aclame:pro 290 AQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEEITADIIKSGKFVIKYDYHWIPSL 369 (393) Q Consensus 290 ~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~~~nt~~~i~~G~~~~~v~~~p~~p~ 369 (393) +++|+++|+++++|+||||||+.+|++|++++++||++||++|+ +.|++|+||++||+++|++|+|+++|+++|++|+ T Consensus 648 fd~Ie~SI~~a~q~~VfEPNd~~L~~sIk~sInafL~~L~aqGA--LlGfrV~lDetNTpeDI~~Gklvv~I~vAP~~PA 725 (742) T protein:vir:58 648 LIVMRNRISQILSSYLFENNTSENRLRAEALVRQYLESLRLRGA--VTDYEVAIDSVTTPTDIDNNTLRARVTVQPARSI 725 (742) T ss_pred HHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEcCCCCHHHhhCCEEEEEEEEEccCCc Confidence 99999999999999999999999999999999999999999885 5678899999999999999999999999999999 Q ss_pred eeEEEEEEEcchHHHHHHH Q lcl|Aclame:pro 370 ESLGLEQRVNDEYVVDLVN 388 (393) Q Consensus 370 e~i~~~~~~~~~~~~~~~~ 388 (393) |||+|++.+...+.+ |+ T Consensus 726 EfI~lrf~it~tga~--Fs 742 (742) T protein:vir:58 726 EYIDITFVITPTGVE--IT 742 (742) T ss_pred ceEEEEEEEEecccc--cC Confidence 999999988777655 33 No 33 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=4.7e-53 Score=307.45 Aligned_cols=346 Identities=11% Similarity=0.008 Sum_probs=207.0 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhccc Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~ 80 (393) -++++. -+|+..+. |....+...+.+..-.+..+. .|.++.+...+..++...+-... +. ..+..+..+ T Consensus 328 ~t~~~~-~~g~~~~~------pl~~ts~dy~~~~~~vdgI~~--~~~~~V~~~g~~s~a~a~~~~g~-~s-~d~a~f~Gg 396 (717) T protein:vir:79 328 ITKPES-KRGMISED------PLVFKSGDYTNFKMLVDAINN--HPFNNVVRARTKPEFEATFTSTL-QA-AADAKFSGG 396 (717) T ss_pred Eecccc-cCcceecc------ccccccCceeeeeeeeccccc--Cchhheeeeecccccceeeeecc-cC-chhhccCCC Confidence 222221 13444222 333333333333332222211 14455555555544444332211 00 011111111 Q ss_pred CceEEEEEecccccccccccchhcccccccccchhhhhhhhhhhhhccccccccccc--------hHHHHHHHHHhhccc Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHD--------NQAVATELLSVAKKL 152 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s--------~~~v~~al~~~a~~~ 152 (393) ........-+..... .+.....+ +. ...+.+..++.+ ...+++.|+.. .+.++.++.++|+.+ T Consensus 397 ~dgl~~~~ee~Y~~l-Ggk~~d~g--~l-t~~aays~LE~~-----dVDlVil~ga~adtt~ga~~d~va~alad~caal 467 (717) T protein:vir:79 397 KDELSLDKEEMYKRL-GGEKNEEG--FV-TKQGAYQYLENY-----EVDYVIPLGVHADTKLIGKYDDFAYQLALACAVM 467 (717) T ss_pred ccccccchhhhhccc-cccccccc--cc-cchhhhhhcCcc-----eeEEEEecCccccccccchhhhHHHHHHHHHHHh Confidence 111111111100000 00000000 00 011222222211 23444444421 235677788888654 Q ss_pred c-----eEEEE--ecCCCCcchhhhhhhccc--------------------------ccceEEEeccceeEeeccCCceE Q lcl|Aclame:pro 153 N-----AFAFI--SDNGATTKEQAYTYRQNF--------------------------SQREGMMIFGDWKSYNTDKKAYD 199 (393) Q Consensus 153 ~-----~~~~i--~~~~~~~~~~a~~~~~~~--------------------------~s~~~~~~~p~~~~~~~~~~~~~ 199 (393) . ...++ ..+++.......++++.+ -+.+..++++++..+.+..+... T Consensus 468 Sal~r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis~y~~vv~~~~~iv~~~~~~~~ 547 (717) T protein:vir:79 468 SHYNSVTIGIIPTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLGQFIEVVAGPDFIVRNTRLGQM 547 (717) T ss_pred hhccccceeeeccccccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccccceeeeeecceeEEEcCCCcee Confidence 2 11111 122232222111111111 02233344444444455566677 Q ss_pred EechhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eCCCEEEEecccCCC Q lcl|Aclame:pro 200 TDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNGFRYWGSRTLAT 277 (393) Q Consensus 200 ~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~~G~~~wG~rT~~~ 277 (393) ..||+|++||+ |.++|+|+||+|++|.|+.++... +++.|++.||++||+++. +++|+++||+||+++ T Consensus 548 ~~p~AG~vAGl----dA~rGVwkSPANk~I~GVvgLa~~------lT~sE~d~Ln~aGIntIr~~~GrGirVWGaRTtas 617 (717) T protein:vir:79 548 ASTPDASYIGM----VSQLKTQSAPTNKPLPSVTALRYT------YSANQLNRLTKARFATFKYKQDGSIGVVDAPTSAH 617 (717) T ss_pred ecCHHHHHHHH----HhcCCcccccccceecccccCccc------CCHHHHHHHhhCCeEEEEEeCCceEEEEeeeecCC Confidence 77887666665 556799999999999999998765 467899999999999995 578999999999987 Q ss_pred Cc-ccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEecCCCCHHHhhCCE Q lcl|Aclame:pro 278 DT-RWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEEITADIIKSGK 356 (393) Q Consensus 278 d~-~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~~~nt~~~i~~G~ 356 (393) ++ .|+||++||++++|+++|+++++|+|||||++.+|.+|++++++||++||++|. +.++.+. .+||++++++|+ T Consensus 618 d~sdWryInVRRl~D~Ie~sIr~al~~yVgEPNd~~tr~~Ik~sI~afL~~L~r~GA--I~Gykvd--vtnT~~di~~G~ 693 (717) T protein:vir:79 618 AGSDYTRLSTARIVKEAVNAVREVADPFIGEPNDTGNRNALTAAVDKRLSKMIENKA--LLGFDFR--LVVTPQQELLGE 693 (717) T ss_pred CCcccceeehhhhHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHHHHHhcCc--eecceee--EecChhHhhCCE Confidence 75 699999999999999999999999999999999999999999999999999885 5666653 489999999999 Q ss_pred EEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 357 FVIKYDYHWIPSLESLGLEQRVND 380 (393) Q Consensus 357 ~~~~v~~~p~~p~e~i~~~~~~~~ 380 (393) ++++|+++|++|+|||+|++.++. T Consensus 694 l~V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 694 GSIELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred EEEEEEEEecCcccEEEEEEEEeC Confidence 999999999999999999999998 No 34 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=100.00 E-value=2.2e-42 Score=248.95 Aligned_cols=264 Identities=11% Similarity=0.005 Sum_probs=187.1 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc---cchhhhhhhhh Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST---GTLRRTLNSIG 77 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~---~tl~~~~~~~~ 77 (393) |||+++..|||||+|++.+ ++|..+.|++.+|+|+++.. |+++|++++|+.+++..||.. ..+..++..+| T Consensus 1 ~~m~~~~sPGVyv~E~~~~-~~i~~v~tsvaafvG~~~~G-----P~~~p~~v~s~~d~~~~FG~~~~~~~l~~av~~fF 74 (641) T protein:vir:10 1 MSVSNQLSPGVVIQERDLT-AVTTPIGLNVGVLAAPFTKG-----PVEEIFEVSTERDLASVFGEPNDYNYEYWFTASQF 74 (641) T ss_pred CCCccccCCceEEEEecCC-CcccccCCccceEEecccCC-----CCCccEEecCHHHHHHHcCCcCCCcchHHHHHHHH Confidence 9999877799999999986 68999999999999988766 899999999999999999953 56888999999 Q ss_pred cccCceEEEEEeccccccccc-----------------------------------------------cc---------- Q lcl|Aclame:pro 78 SIVKTPTVIVRVAESDDSDTL-----------------------------------------------TA---------- 100 (393) Q Consensus 78 ~~~~~~~~vv~~~~~~~~~~~-----------------------------------------------~~---------- 100 (393) .+++..++++|+......... .. T Consensus 75 ~ngG~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~pG~~gn~l~v~i~~~~~~~~~~~~~~~ 154 (641) T protein:vir:10 75 LSYGGVLKAIRLNAASLKNSVDSGTAPLIKNLQEYETTYESSNSNTFKFASRDAGALGNSVGIFITDAGPDQIAVLPAPG 154 (641) T ss_pred HhcCCEEEEEEecCccccccccccchhhccccccccccccCcCccccEEEeccCCCcCCceEEEEEcCCCcceeeeeccc Confidence 999999998887321000000 00 Q ss_pred -----------------------c------------------------------------------------hhc----- Q lcl|Aclame:pro 101 -----------------------N------------------------------------------------IVG----- 104 (393) Q Consensus 101 -----------------------~------------------------------------------------~~~----- 104 (393) . ..+ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g~~g~ 234 (641) T protein:vir:10 155 TGNEWEFVADEAVTAASGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGGVTGI 234 (641) T ss_pred ccccceeccceeeeeccCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCcceee Confidence 0 000 Q ss_pred ----cccc-c----------cc-------------------------------------------cc-----------hh Q lcl|Aclame:pro 105 ----TQEN-G----------KF-------------------------------------------TG-----------IK 115 (393) Q Consensus 105 ----~~~~-~----------~~-------------------------------------------~g-----------l~ 115 (393) ...+ + .. .+ .. T Consensus 235 ~~~~~~~t~gt~~~t~a~~g~~~~~~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~~~a~~~~~g~~~~~va~~~gts 314 (641) T protein:vir:10 235 FADAQVVTQGTNTAAIASSGIERRLYIGKDSGSINFAATDAVVDTNATSATISSVRNEYAEREYLPGSKWVNVAARPGTS 314 (641) T ss_pred eeeeeeccCCccceeeecccchhhhhhccccccceeeeecccccccceeeEeeeeeeeecccccccccccccccccchhh Confidence 0000 0 00 00 00 Q ss_pred h-----------------------------hhhhhhhhh----------------------------------------- Q lcl|Aclame:pro 116 A-----------------------------LLTAQSTVF----------------------------------------- 125 (393) Q Consensus 116 a-----------------------------l~~~~~~~~----------------------------------------- 125 (393) . +.+.+.... T Consensus 315 ~~a~~~g~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~ 394 (641) T protein:vir:10 315 LYANSVGGVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVIKQQSAYVYWGSHETAPFLGTAA 394 (641) T ss_pred hhhhhcCCcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeeeccccceEEEecccccccccccc Confidence 0 000000000 Q ss_pred -------------------------------------------------------------------------------h Q lcl|Aclame:pro 126 -------------------------------------------------------------------------------V 126 (393) Q Consensus 126 -------------------------------------------------------------------------------~ 126 (393) . T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~~~~~~~~~tg~~~~~~~e~~ 474 (641) T protein:vir:10 395 NAAAGDWGASALNRRYNLLRSTAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALYNLSNVDIATAYELIEDPESQ 474 (641) T ss_pred cccccccccccccccccccccccccccccccccccCCCCcceeEEEeecCcccccccccccccchhHHHHHHHhhhhhhh Confidence 0 Q ss_pred cccccccccc-----chHHHHHHHHHhhcccceEEEEecCCCC----------cchhhhhhhcc-cccceEEEeccceeE Q lcl|Aclame:pro 127 KPKLLCVPQH-----DNQAVATELLSVAKKLNAFAFISDNGAT----------TKEQAYTYRQN-FSQREGMMIFGDWKS 190 (393) Q Consensus 127 ~~~~l~apg~-----s~~~v~~al~~~a~~~~~~~~i~~~~~~----------~~~~a~~~~~~-~~s~~~~~~~p~~~~ 190 (393) ...++++|+. ...+++.+|++|||++++++++.|+|.. ..+.++.|++. .+|+|+++||||+++ T Consensus 475 ~i~~l~~~~~~~~~~~~~~v~~~~i~~ce~~~d~~ailD~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~yaa~y~P~~~v 554 (641) T protein:vir:10 475 VIDYVLSGPAGADEAAAIAKATTITTIVESRKDCMAFLSPLRSDVIGVSNTTTVTENLVNYFNQLPSSNYVVFDSGYKYI 554 (641) T ss_pred ccceeeecCCCCCcchhHHHHHHHHHHHHhcCCEEEEEcCCcccccCCCchhhHHHHHHHHHhhcCCCceEEEEeceeEe Confidence 0001111111 1245788899999999999989898742 12456677764 578999999999999 Q ss_pred eeccCCceEEechhHHHHHHHHhhhccCCceecCCCc---eecceeeceeecccccCCCchhhhhhcccceEEE--EeCC Q lcl|Aclame:pro 191 YNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNV---ELDGVTGITKAVEFDINESSTEANYLNEKGITIC--LNHN 265 (393) Q Consensus 191 ~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~---~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~--~~~~ 265 (393) +|+.+++.+++||||++||+|||+|.+|||||||||. .|+|+++++.. .++.|++.||++|||+| |+++ T Consensus 555 ~dp~~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~------~~~~e~~~Lnp~gIN~ir~fpg~ 628 (641) T protein:vir:10 555 YDKYNDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYS------PNKTQRDRLYANRINPVVSFPGH 628 (641) T ss_pred ecccCCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEe------cChhHHhhhhhcccceEEecCCc Confidence 9999999999999999999999999999999999998 47888888765 46789999999999999 5677 Q ss_pred CEE--EEecccCCCCccc Q lcl|Aclame:pro 266 GFR--YWGSRTLATDTRW 281 (393) Q Consensus 266 G~~--~wG~rT~~~d~~~ 281 (393) |++ ...-+| +. T Consensus 629 G~v~~~~~~~~-----~~ 641 (641) T protein:vir:10 629 AMINNNIAFHT-----KL 641 (641) T ss_pred eeecceeeeee-----cC Confidence 765 222222 11 No 35 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=1.4e-40 Score=239.09 Aligned_cols=361 Identities=12% Similarity=0.027 Sum_probs=254.4 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhh--- Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIG--- 77 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~--- 77 (393) -||-..-+||||++|.+++++++..+++++.+|+|.++.. |.++|++++++.++...||.. .|.+++...+ T Consensus 6 ~~~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G-----~~~~~~~~~~~~~~~~~fg~g-~l~~~i~~a~~~~ 79 (562) T protein:vir:63 6 YPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGG-----KPNAVYKVRNYSQAKSVFRSG-ELLDAIERAWNPG 79 (562) T ss_pred eCCCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCC-----CCceeEEEccHHHHHHHhcCC-chHHHHHHhcccc Confidence 4544555799999999999999999999999999999887 679999999999999999763 3666665544 Q ss_pred -cccCceEEEEEeccccccccccc---------------------------------------------chh-------- Q lcl|Aclame:pro 78 -SIVKTPTVIVRVAESDDSDTLTA---------------------------------------------NIV-------- 103 (393) Q Consensus 78 -~~~~~~~~vv~~~~~~~~~~~~~---------------------------------------------~~~-------- 103 (393) .+++..++++++........+.. .+. T Consensus 80 ~~~g~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~V~~i~y~ 159 (562) T protein:vir:63 80 EGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSIKYK 159 (562) T ss_pred ccCCceEEEEEEcCCCccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhccceeeeeee Confidence 57777777766533211110000 000 Q ss_pred -----------c---ccc-------cccc-------------cchhhhhhh----------------------------- Q lcl|Aclame:pro 104 -----------G---TQE-------NGKF-------------TGIKALLTA----------------------------- 120 (393) Q Consensus 104 -----------~---~~~-------~~~~-------------~gl~al~~~----------------------------- 120 (393) + ..+ .+.. +...+.... T Consensus 160 g~~~~~~~~v~~~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~~gn~i~~~~~d~~~~~ 239 (562) T protein:vir:63 160 GTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDAQIDV 239 (562) T ss_pred cccccceEEEEecCcceeEEEEEeecCCcceeEEEecCCccchhHHHHHhhccccceEEEeeccCCceeeeecccccccc Confidence 0 000 0000 000000000 Q ss_pred hhh-------------------hhh----------------------------------------ccccccccccchHHH Q lcl|Aclame:pro 121 QST-------------------VFV----------------------------------------KPKLLCVPQHDNQAV 141 (393) Q Consensus 121 ~~~-------------------~~~----------------------------------------~~~~l~apg~s~~~v 141 (393) ..+ ..+ .....+.+..++.++ T Consensus 240 ~vkt~~~~v~t~~~d~~~~~~~~~~v~~~~~~~~~la~~~~~~LtGG~dGt~~~~~~~al~ale~~~~~~i~~~t~d~av 319 (562) T protein:vir:63 240 DIKTKEAYVKAVGGDIEKQTAYNGYVDFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVPLTSKQAV 319 (562) T ss_pred chhhhhhhhhhhhhhhhhcccccceeeeeeccccceecccceeeecCCCCCchhhHHHHHHHHHhCCcEEEEecCCCHHH Confidence 000 000 000001111123456 Q ss_pred HHHHHHhhcccce-----EEEEecCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEech---hHHHHHHHHh Q lcl|Aclame:pro 142 ATELLSVAKKLNA-----FAFISDNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYA---VARACALQAY 213 (393) Q Consensus 142 ~~al~~~a~~~~~-----~~~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~---S~~~ag~~a~ 213 (393) +.++.++|++++. ++++..+++.+.+++......+++.+.+++.|+....+. .+.....|+ ++.+||+++. T Consensus 320 ~~~l~a~vkr~~~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~ervv~v~~~~~~~~~-~~~~~~~~~~~~aa~vAGl~A~ 398 (562) T protein:vir:63 320 HAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMD-DGRSLKMPGYMFAAQVAGLTCG 398 (562) T ss_pred HHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEecCeeEECC-CCceeeechhHHHHHHHHHhhc Confidence 7778888876654 444444556677788888889999999999998765443 456666777 7899999999 Q ss_pred hhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEe--CCCEEEEec-cc-----CCCCcccceee Q lcl|Aclame:pro 214 IDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLN--HNGFRYWGS-RT-----LATDTRWAFQQ 285 (393) Q Consensus 214 ~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~--~~G~~~wG~-rT-----~~~d~~~~~i~ 285 (393) .| +++||.|+.+. ..++.. .+++.|.+.|++.|+.++.. +++.++|.. ++ ...|+.|++|+ T Consensus 399 ~~----~~~SlT~~~i~-~~~v~~------~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~~~~~~~ki~ 467 (562) T protein:vir:63 399 LE----IGEAITFKNIA-IETLDT------IYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIG 467 (562) T ss_pred Cc----hhcCccceeec-cccccc------cCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhh Confidence 87 88999999986 455543 46889999999999999953 566677754 33 34578899999 Q ss_pred hhhHHHHHHHHHHHHhH-HhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEecCCCCHHHhhCCEEEEEEEEE Q lcl|Aclame:pro 286 SVRTAQIIKETIGAGLA-WAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEEITADIIKSGKFVIKYDYH 364 (393) Q Consensus 286 ~rR~~~~i~~~i~~~~~-~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~~~nt~~~i~~G~~~~~v~~~ 364 (393) ++|++|+|.+++++.+. ||+++||+...|..++..+..||.+|++.|.+ .++.. ++-.-+..+++++|++.+. T Consensus 468 viRv~D~i~~dir~~~~~~yiGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI--~~~~~----~dv~v~~~~d~~~v~~~v~ 541 (562) T protein:vir:63 468 VGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEI--QDYSP----EEVQVVIEGDVARISLTVF 541 (562) T ss_pred hhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcc--cCCCc----cceEEEecCCEEEEEEEEE Confidence 99999999999988865 99999999999999999999999999998864 33321 1112235668899999999 Q ss_pred ecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 365 WIPSLESLGLEQRVNDEYVVD 385 (393) Q Consensus 365 p~~p~e~i~~~~~~~~~~~~~ 385 (393) |+.|+|+|.+++.+.++-++. T Consensus 542 pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:63 542 PIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred EcccceEEEEEEEEeeeeecC Confidence 999999999999999998887 No 36 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=3.5e-39 Score=231.43 Aligned_cols=361 Identities=14% Similarity=0.076 Sum_probs=249.6 Q ss_pred CC-----CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhh Q lcl|Aclame:pro 1 MS-----ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNS 75 (393) Q Consensus 1 m~-----m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~ 75 (393) |+ +-..-+||||+++..++++++..+++.+.+|+|.++.. |.++|++++++.++...|+.. .|.+++.. T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G-----~~~~~~~~~~~~~~~~~f~~g-~l~~a~~~ 74 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGG-----KPDTVYRFRNYQQAKQVLRSG-DLLDAIEL 74 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCC-----CCceeEEecCHHHHHHHhcCC-chhHHHHh Confidence 43 33344699999999999999999999999999999887 668999999999999998763 35555544 Q ss_pred hh------cccCceEEEEEeccccccccc---------------------------------------------ccchh- Q lcl|Aclame:pro 76 IG------SIVKTPTVIVRVAESDDSDTL---------------------------------------------TANIV- 103 (393) Q Consensus 76 ~~------~~~~~~~~vv~~~~~~~~~~~---------------------------------------------~~~~~- 103 (393) .+ .+++..++++++........+ ...+. T Consensus 75 a~~~~~~~~~~~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~~~~~~~ig~ 154 (569) T protein:vir:80 75 AWNASDVNTASAGDILAVRVEDAKNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGYKKVFDNLGK 154 (569) T ss_pred hccCccccccCceEEEEEEcCCCeeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCCccccccccc Confidence 43 344445555444220000000 00000 Q ss_pred -------cc-------c--ccc--------cccc------------------hh---hhhh---------h--------- Q lcl|Aclame:pro 104 -------GT-------Q--ENG--------KFTG------------------IK---ALLT---------A--------- 120 (393) Q Consensus 104 -------~~-------~--~~~--------~~~g------------------l~---al~~---------~--------- 120 (393) +. . +.. .+.| .. .+.. + T Consensus 155 v~si~ytg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~lv~~~~~~~~f~a~~~~~~~~~ 234 (569) T protein:vir:80 155 IFSIQYKGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNVLVSAINSLPDWEAKFFPIGDKN 234 (569) T ss_pred eeeEEEeeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhhhhhhcCCccCceEEEEecCCCc Confidence 00 0 000 0000 00 0000 0 Q ss_pred ----------------------------hhh------hhh-------------------------------------ccc Q lcl|Aclame:pro 121 ----------------------------QST------VFV-------------------------------------KPK 129 (393) Q Consensus 121 ----------------------------~~~------~~~-------------------------------------~~~ 129 (393) ... ..+ ..- T Consensus 235 ~~~~~~d~~~~~~~~t~~~~~~~~~~di~~~~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~~~~~~~l~~le~~~~ 314 (569) T protein:vir:80 235 LPTDALEAVTKVDVKTEAVFVGALAGDIAKQLEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAPESWANKFPLLANEGG 314 (569) T ss_pred ceehhccchhheeccccceeeehhHHHHHHhhcCCceEEEEecCCcceeeecceeecCCCCCCccchHHHHHHHHhhCCc Confidence 000 000 000 Q ss_pred cccccccchHHHHHHHHHhhcccce-----EEEEecCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEech- Q lcl|Aclame:pro 130 LLCVPQHDNQAVATELLSVAKKLNA-----FAFISDNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYA- 203 (393) Q Consensus 130 ~l~apg~s~~~v~~al~~~a~~~~~-----~~~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~- 203 (393) ..+.+...+++++.++.++|+++++ ++++..+++.+.+++......+++.+..+++||....+. .+....+|+ T Consensus 315 ~~i~~~t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~-~g~~~~~~~~ 393 (569) T protein:vir:80 315 YYLVPLTDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTNETVEESITRATNLRDPRASLVGFSGTRKMD-DGRLLKLPGY 393 (569) T ss_pred EEEEecCCChHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHHhhcCCCeEEEEecCceeecC-CCcceeechh Confidence 1111222235678889999988754 444555666778889999999999999999999887654 344455555 Q ss_pred --hHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEe--CCCEEEEec------c Q lcl|Aclame:pro 204 --VARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLN--HNGFRYWGS------R 273 (393) Q Consensus 204 --S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~--~~G~~~wG~------r 273 (393) ++.+||++|..+ +++||.|+.+. +.++.. .+++.|++.|++.|+.++.. +++.++|.. + T Consensus 394 ~~aa~vAG~~A~~~----~~~S~T~k~i~-~~~i~~------~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~itT~ 462 (569) T protein:vir:80 394 MMASQIAGIASGLE----VGEAITFKHFN-VTSVDR------VFESSQLDMLNESGVISIEFVRNRTLTAFRVVQDVTTY 462 (569) T ss_pred hHHHHHHHHHhcCc----cccCccceeec-cccccc------cCCHHHHHHHHhCCeEEEEEecCceEEEEEEeccceec Confidence 678888888775 88999999986 455554 36788999999999999954 455666643 2 Q ss_pred cCCCCcccceeehhhHHHHHHHHHHHHh-HHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEecCCCCHHHh Q lcl|Aclame:pro 274 TLATDTRWAFQQSVRTAQIIKETIGAGL-AWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEEITADII 352 (393) Q Consensus 274 T~~~d~~~~~i~~rR~~~~i~~~i~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~~~nt~~~i 352 (393) |...|+.|++|+++|++|+|.+++++.+ .+|+++||+...|..++..++.||.+|++.|.+ .++.. ++-.-++ T Consensus 463 t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~gaI--~~~~~----~dv~v~~ 536 (569) T protein:vir:80 463 NDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGTKVIDTSASLIKNFIQSFLDNKKRAREI--QDYTP----EEVQVVL 536 (569) T ss_pred CCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcccCChhHHHHHHHHHHHHHHHHHhCCcc--cCCCc----cceEEEe Confidence 2345778999999999999999999876 589999999999999999999999999998864 33321 1112235 Q ss_pred hCCEEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 353 KSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVD 385 (393) Q Consensus 353 ~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 385 (393) .+++++|++.+.|+.|+|+|.+++.+.++-++. T Consensus 537 ~~d~~~v~~~v~Pv~~~ekI~~ti~~~~~~~~~ 569 (569) T protein:vir:80 537 EGDVASISMTVMPIRSLNKITVQLVYKQQILTA 569 (569) T ss_pred cCCEEEEEEEEEEcccccEEEEEEEEeeeeecC Confidence 678999999999999999999999999998887 No 37 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=1.6e-38 Score=227.85 Aligned_cols=361 Identities=12% Similarity=0.043 Sum_probs=251.2 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhh--- Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIG--- 77 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~--- 77 (393) -|+-..-+||||++|..++.+++..+++++.+|+|.++.. |.++|++++++.++...|+.. .|.+++...+ T Consensus 6 ~~~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G-----~~~~~~~~~~~~~~~~~f~~g-~l~~~i~~a~~~~ 79 (562) T protein:vir:80 6 YPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGG-----KPNAVYKVRNYSQAKSVFRSG-ELLDAIERAWNPG 79 (562) T ss_pred eCCCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCC-----CcceeEEEccHHHHHHHhcCC-ChHHHHHHhcccc Confidence 2333344699999999999999999999999999999877 679999999999999999763 3555554444 Q ss_pred -cccCceEEEEEeccccccccccc---------------------------------------------chh-------- Q lcl|Aclame:pro 78 -SIVKTPTVIVRVAESDDSDTLTA---------------------------------------------NIV-------- 103 (393) Q Consensus 78 -~~~~~~~~vv~~~~~~~~~~~~~---------------------------------------------~~~-------- 103 (393) .+++..++++++........+.. .+. T Consensus 80 ~~~g~~~~~~~rv~~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~v~~i~y~ 159 (562) T protein:vir:80 80 EGTGAGDILAMRVEEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSIKYK 159 (562) T ss_pred cccCceEEEEEEcCCCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCceeeeeec Confidence 47777777666533211111000 000 Q ss_pred -----------c---ccc-------cccc-------------cchhhhhhh----------------------------- Q lcl|Aclame:pro 104 -----------G---TQE-------NGKF-------------TGIKALLTA----------------------------- 120 (393) Q Consensus 104 -----------~---~~~-------~~~~-------------~gl~al~~~----------------------------- 120 (393) + ..+ .+.. +...+.... T Consensus 160 g~~~~a~~~i~~~~~~~~a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~~i~~~~~~tAky~g~~~n~i~~~~~d~~~~~ 239 (562) T protein:vir:80 160 GTEASATFTVAVDPVTFKATKLTLKAGDKTVKEYDLGSGAYAETNVLISDINNLPDFEAKFFPIGDKNLTTDNFDAQIDV 239 (562) T ss_pred cccccceeEEEecCccceEEEEEEecCCcceeEEEeCCCccchhhhhhhhhccccceEEEecccCCceeeecccccchhh Confidence 0 000 0000 000000000 Q ss_pred h----------------hhhh----hcc-----c----------------------------------cccccccchHHH Q lcl|Aclame:pro 121 Q----------------STVF----VKP-----K----------------------------------LLCVPQHDNQAV 141 (393) Q Consensus 121 ~----------------~~~~----~~~-----~----------------------------------~l~apg~s~~~v 141 (393) . .... ... . ..+.+...+.++ T Consensus 240 ~~kt~~~~v~~~~~d~~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~~~~dal~~Le~~~~~~i~~~t~d~ai 319 (562) T protein:vir:80 240 DIKTKEAYVKAVGGDIEKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIPESWADKFSYFANEGGYYLVPLTSKQAV 319 (562) T ss_pred hcccceeeeeehhhhhhhcccccceEEEEeccCccccccceeeeeCCCCCCccccHHHHHHHHHhCCcEEEEecCCChHH Confidence 0 0000 000 0 000011122456 Q ss_pred HHHHHHhhcccce----E-EEEecCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEech---hHHHHHHHHh Q lcl|Aclame:pro 142 ATELLSVAKKLNA----F-AFISDNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYA---VARACALQAY 213 (393) Q Consensus 142 ~~al~~~a~~~~~----~-~~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~---S~~~ag~~a~ 213 (393) +..+.++|++++. + +++..+++.+.+++......+++.+..++.|+...... .+.....|+ ++.+||++|. T Consensus 320 ~~~~~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~e~vv~v~~~~~~~~~-~~~~~~~~~~~~aa~vAGl~Ag 398 (562) T protein:vir:80 320 HAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIGLQNERAGLIGFSGTVKMD-DGRSLKMPGYMFAAQVAGLTCG 398 (562) T ss_pred HHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHhhhcCCCeEEEEecCeeEECC-CCceeeechhHHHHHHHHHHhc Confidence 7778888877654 3 44444566777888888899999999999998766544 445555666 8899999999 Q ss_pred hhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEe--CCCEEEEe-ccc---C--CCCcccceee Q lcl|Aclame:pro 214 IDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLN--HNGFRYWG-SRT---L--ATDTRWAFQQ 285 (393) Q Consensus 214 ~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~--~~G~~~wG-~rT---~--~~d~~~~~i~ 285 (393) .| +++||.|+++.+ .++.. .+++.|.+.|++.|+.++.. +++.++|. -++ . ..|+.|++|+ T Consensus 399 ~~----~~~S~T~~~i~~-~~v~~------~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~~~~~~~~ki~ 467 (562) T protein:vir:80 399 LE----IGEAITFKNIAI-ETLDT------IYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIG 467 (562) T ss_pred Cc----cccCccceeecc-ccccc------cCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceeccCCCCchhhhhh Confidence 97 889999999875 34433 36788999999999999954 45566662 222 2 4478899999 Q ss_pred hhhHHHHHHHHHHHHh-HHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEecCCCCHHHhhCCEEEEEEEEE Q lcl|Aclame:pro 286 SVRTAQIIKETIGAGL-AWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEEITADIIKSGKFVIKYDYH 364 (393) Q Consensus 286 ~rR~~~~i~~~i~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~~~nt~~~i~~G~~~~~v~~~ 364 (393) ++|++|+|.+++++.+ .||+++||+...|..++..+..||.+|++.|.+ .++.. ++-.-+..+++++|++.+. T Consensus 468 viRv~D~i~~dir~~~~~~yIGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI--~~~~~----~dv~v~~~~d~~~v~~~v~ 541 (562) T protein:vir:80 468 VGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEI--QDYSP----EEVQVVIEGDIARISLTVF 541 (562) T ss_pred hhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcc--cCCCc----cceEEEecCCEEEEEEEEE Confidence 9999999999999887 589999999999999999999999999998864 33321 1112235678899999999 Q ss_pred ecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 365 WIPSLESLGLEQRVNDEYVVD 385 (393) Q Consensus 365 p~~p~e~i~~~~~~~~~~~~~ 385 (393) |+.|+|+|.+++.+.++-++. T Consensus 542 Pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:80 542 PIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred EcccceEEEEEEEEEeeeecC Confidence 999999999999999998887 No 38 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=5.7e-35 Score=208.34 Aligned_cols=361 Identities=14% Similarity=0.086 Sum_probs=247.1 Q ss_pred CCC-----CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhh Q lcl|Aclame:pro 1 MSI-----LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNS 75 (393) Q Consensus 1 m~m-----~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~ 75 (393) |+. -..-+||||+++..++.+++..+++.+.+|+|.++.. |.++|+++++++++...||.. .|.+++.. T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G-----~~~~~~~~~~~~~~~~~~~~g-~l~~~~~~ 74 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGG-----EPNTVYELRNYSQAKRLFRSG-ELLDAIEL 74 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCC-----CCceeEEeccHHHHHHHhcCc-chHHHHHH Confidence 542 2233699999999999999999999999999999887 668999999999999999763 35566555 Q ss_pred hh----cccCceEEEEEecccccccccc---------------------------------------------cchh--- Q lcl|Aclame:pro 76 IG----SIVKTPTVIVRVAESDDSDTLT---------------------------------------------ANIV--- 103 (393) Q Consensus 76 ~~----~~~~~~~~vv~~~~~~~~~~~~---------------------------------------------~~~~--- 103 (393) .+ .+++..++.+++.......... +.+. T Consensus 75 a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ 154 (587) T protein:vir:95 75 AWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIF 154 (587) T ss_pred HhccccCCCceEEEEEEcCCCceeEEEecCeEEEEecccccccceEEEEecCCCCCceeEEEEEecccceeeeeecccee Confidence 55 3555555544332211000000 0000 Q ss_pred -----ccc---------cc-c-------------------cccc-----hhhhhhhhh---------------------- Q lcl|Aclame:pro 104 -----GTQ---------EN-G-------------------KFTG-----IKALLTAQS---------------------- 122 (393) Q Consensus 104 -----~~~---------~~-~-------------------~~~g-----l~al~~~~~---------------------- 122 (393) |.. +. . ...| ..+...... T Consensus 155 si~y~g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~~~ 234 (587) T protein:vir:95 155 TIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLD 234 (587) T ss_pred eeeeeccccccceeeeecccceeeeeeeeecCCceEEEEEecCCchHHHHHHHHhhccccceEEEEecccCceeEEeecC Confidence 000 00 0 0000 000000000 Q ss_pred -------------------------------------------------------------------------hh----h Q lcl|Aclame:pro 123 -------------------------------------------------------------------------TV----F 125 (393) Q Consensus 123 -------------------------------------------------------------------------~~----~ 125 (393) .. + T Consensus 235 ~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG 314 (587) T protein:vir:95 235 KIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNG 314 (587) T ss_pred cccccceehhhhhhhhhhcceeeeeeceeeeeeecccccceeccchhhhhcccchheeccccccceeccceeeeecCCCC Confidence 00 0 Q ss_pred hcc--------------ccccccccchHHHHHHHHHhhcccce-----EEEEecCCCCcchhhhhhhcccccceEEEecc Q lcl|Aclame:pro 126 VKP--------------KLLCVPQHDNQAVATELLSVAKKLNA-----FAFISDNGATTKEQAYTYRQNFSQREGMMIFG 186 (393) Q Consensus 126 ~~~--------------~~l~apg~s~~~v~~al~~~a~~~~~-----~~~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p 186 (393) ..+ -..+.+..+++++++++.+||++++. ++++..+++.+.+++...+..+++.+..++++ T Consensus 315 ~~~~~y~~~l~ale~~~~~~i~~~t~d~~v~a~l~a~vk~~~~~g~~~~aVvg~~~~~~~~~~~~~a~~~n~ervi~v~~ 394 (587) T protein:vir:95 315 EPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQESLSNPRVSLVAN 394 (587) T ss_pred CCcccHHHHHHHHHhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEEcCCCCCCHHHHHHHHhhcCCCcEEEecc Confidence 000 00112222345678889999877654 44444456677788888999999999999888 Q ss_pred ceeEeeccCCceEEech---hHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE- Q lcl|Aclame:pro 187 DWKSYNTDKKAYDTDYA---VARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL- 262 (393) Q Consensus 187 ~~~~~~~~~~~~~~~p~---S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~- 262 (393) +..+. ..++....+|+ ++.+||++|..| +.+||.|+++. ..++.. .+++.|++.|.++|+.++. T Consensus 395 ~~~~~-~~dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~-~~~v~~------~~t~~e~e~ai~~Gvl~l~~ 462 (587) T protein:vir:95 395 SGTFV-MDDGRKNHVPAYMVAVALGGLASGLE----IGESITFKPLR-VSSLDQ------IYESIDLDELNENGIISIEF 462 (587) T ss_pred cceEe-cCCCceeeechHHHHHHHHHHHhcCc----hhcCccceeee-cccccc------cCCHHHHHHHHhCCeEEEEE Confidence 75543 23455666777 789999999987 77899999986 344433 4678899999999999984 Q ss_pred -eCCC---EE-EEecccC--CCCcccceeehhhHHHHHHHHHHHHh-HHhhcccCCHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 263 -NHNG---FR-YWGSRTL--ATDTRWAFQQSVRTAQIIKETIGAGL-AWAVDMPLTPLRVKTMLEAINNKLRSWASGDDP 334 (393) Q Consensus 263 -~~~G---~~-~wG~rT~--~~d~~~~~i~~rR~~~~i~~~i~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~ 334 (393) ++++ ++ +.|-.|. ..|+.|++|+++|++|+|.+.+++.+ .+|+++||+...|..++..+..||..|++.|.+ T Consensus 463 ~~~~~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk~nn~~~r~~v~~~i~~~L~~l~~~gaI 542 (587) T protein:vir:95 463 VRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEI 542 (587) T ss_pred ecCCcceEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcc Confidence 3443 33 2444554 45788999999999999999999886 599999999999999999999999999998854 Q ss_pred cccceEEEecCCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 335 RILGARVWVAEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVD 385 (393) Q Consensus 335 ~~~~~~v~~~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 385 (393) .+|.. ++..-+....+++|++.+.|+.|+|+|.+++.+.++-++. T Consensus 543 --~~~~~----~dv~v~~~~d~~~v~~~v~Pv~~mekI~vt~~~~~~~~~~ 587 (587) T protein:vir:95 543 --QDFPA----EDVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred --cCCCc----cceEEEecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 33322 1111223556899999999999999999999999988887 No 39 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=1.3e-34 Score=206.28 Aligned_cols=359 Identities=12% Similarity=0.082 Sum_probs=233.3 Q ss_pred CCCCcccC------CCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhh Q lcl|Aclame:pro 1 MSILDTYL------HGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLN 74 (393) Q Consensus 1 m~m~~~~~------~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~ 74 (393) |+|..+|- |||||+|++++.++|..+.|++.+|+|.++.. |+++|++++++.++...||. +.|..++. T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~G-----p~~~p~~v~s~~~~~~~fgg-g~l~~av~ 74 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEGG-----ETYKPYRLTSFAEAVSIFKG-GPLLEHIK 74 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCCC-----CCceeEEecCHHHHHHHhcC-ccHHHHHH Confidence 88777665 99999999999999999999999999999877 78999999999999999975 56899999 Q ss_pred hhhcccCceEEEEEeccccccccccc---------------------------------------------chhccc--- Q lcl|Aclame:pro 75 SIGSIVKTPTVIVRVAESDDSDTLTA---------------------------------------------NIVGTQ--- 106 (393) Q Consensus 75 ~~~~~~~~~~~vv~~~~~~~~~~~~~---------------------------------------------~~~~~~--- 106 (393) .+|.+++..+++++++.......+.. .+.... T Consensus 75 ~~F~nGg~~~~~vRv~~~~~a~~~~~~~~~~a~~~g~~gn~i~~~v~~~~~~~~~~~~l~v~~~~~~~~~d~~v~~i~~~ 154 (648) T protein:vir:10 75 AAFIGGAGEVVAVRIGNPTTASVSIPVAQNTSDTSPANLNFVSYEASTRSNQIYVSFDLDENFTSANEADDTIIFTIYQK 154 (648) T ss_pred HHHhCCCcEEEEEEcCCCcccceecceeEEeecccCCCCCceEEEEEEcCCCcCceeEEEEEecCCCcccceeEEEeccC Confidence 99999999999888643221111000 000000 Q ss_pred -----------------ccc---cccc----------hh------------------hhh-------------------- Q lcl|Aclame:pro 107 -----------------ENG---KFTG----------IK------------------ALL-------------------- 118 (393) Q Consensus 107 -----------------~~~---~~~g----------l~------------------al~-------------------- 118 (393) ++. .+.+ +. ... T Consensus 155 ~~~y~gt~~~~t~~v~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~s~~~~~d~~ 234 (648) T protein:vir:10 155 HPDFSVTRETFTFPRKFTTPTVLVKRGSTLFFVDRSIVNAALAAGPAFQTALINLLKEQLQPTDVVQIFDASDTNPVDIP 234 (648) T ss_pred CCcccccceeccccccccccccccccccceeecCccchhhhhccCccchhhhhhchhhhhhhhhhheecccccccccccc Confidence 000 0000 00 000 Q ss_pred -------h-------------hhhhhhhc--------------------------------------------------- Q lcl|Aclame:pro 119 -------T-------------AQSTVFVK--------------------------------------------------- 127 (393) Q Consensus 119 -------~-------------~~~~~~~~--------------------------------------------------- 127 (393) . .....+-. T Consensus 235 ~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~~~~~~~~v~~~~~~~l~~~~~ 314 (648) T protein:vir:10 235 LGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLSDPANWFAKDAYTINHLVDTTI 314 (648) T ss_pred cccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccccccceeeeeccchhhcccccc Confidence 0 00000000 Q ss_pred -cccc-------------ccc-------------cc--------------------------------chHHHHHHHHHh Q lcl|Aclame:pro 128 -PKLL-------------CVP-------------QH--------------------------------DNQAVATELLSV 148 (393) Q Consensus 128 -~~~l-------------~ap-------------g~--------------------------------s~~~v~~al~~~ 148 (393) |... ..| .| ..+++++.+++| T Consensus 315 ~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ivp~~~~~~~~~~~~~lt~~q~i~a~a~sh 394 (648) T protein:vir:10 315 NPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVIPAYKFTNVTQLNDRLTIFKGIASTFLSH 394 (648) T ss_pred cCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEEeecccccccccccccCCccchHHHHHHH Confidence 0000 000 00 113456666677 Q ss_pred hcccc----------eEEEEecCCCCcchhh--hhhhcccccceEE-----------EeccceeEeeccCCceEEech-- Q lcl|Aclame:pro 149 AKKLN----------AFAFISDNGATTKEQA--YTYRQNFSQREGM-----------MIFGDWKSYNTDKKAYDTDYA-- 203 (393) Q Consensus 149 a~~~~----------~~~~i~~~~~~~~~~a--~~~~~~~~s~~~~-----------~~~p~~~~~~~~~~~~~~~p~-- 203 (393) ++.+. .+..+..+|+.+..++ .-.+..+++.++. ..+.+.. ....++...+|| T Consensus 395 v~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~~~~~~~a~~~~~d~~~~~~~~~~~~~--~~~~G~~~~~p~~~ 472 (648) T protein:vir:10 395 VQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRNILNTISAMFGGTDRAQAVVFPFYSNV--FNDEGKVELLGGEF 472 (648) T ss_pred HHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhhcccccceeeeecCCceEEeeccccee--ECCCCcEEecchhh Confidence 76442 2333334444443222 2222333332221 1122222 223566677888 Q ss_pred -hHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEe--CC----CEEEEecccC- Q lcl|Aclame:pro 204 -VARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLN--HN----GFRYWGSRTL- 275 (393) Q Consensus 204 -S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~--~~----G~~~wG~rT~- 275 (393) .+++||+++.+ .++.||.||++++ .++.. ...+++.|.+.|++.||+++.. ++ ++++--+-|. T Consensus 473 ~Aa~VAGl~a~l----~~~~s~T~k~i~~-~~id~----~~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~ 543 (648) T protein:vir:10 473 FASYVAGMHANR----EPQDSITFLPISG-IGAEP----LYNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPTTW 543 (648) T ss_pred HHHHHHhhhhcc----ccccCcccceeec-ccccc----ccCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEeccceee Confidence 67788888886 5999999999984 33332 1236788999999999999943 32 2333322222 Q ss_pred --CCCcccceeehhhHHHHHHHHHHHHh-HHhhcccCCHHHHHHHHHHHHHHHHHHhhccccccc-ceEEEecCCCCHHH Q lcl|Aclame:pro 276 --ATDTRWAFQQSVRTAQIIKETIGAGL-AWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRIL-GARVWVAEEITADI 351 (393) Q Consensus 276 --~~d~~~~~i~~rR~~~~i~~~i~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~-~~~v~~~~~nt~~~ 351 (393) +.++.|+.|+++|+.|++.+.+++.+ .+|+++||+...|.++++.+.+||.++++.+....+ ...+.. + T Consensus 544 ~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~~n~~~~~~~ik~~i~~~L~~~~~~~~I~~y~~~~v~~-------~ 616 (648) T protein:vir:10 544 LGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGRKSYGRKTENDIKVYTEALLSNLVGKQIVAYKDVKVTS-------N 616 (648) T ss_pred cCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcccccHHHHHHHHHHHHHHHhhHhhcCcccCcccceEEE-------E Confidence 35788999999999999999998754 599999999999999999999999999987654322 122222 2 Q ss_pred hhCCEEEEEEEEEecCcceeEEEEEEEcchHH Q lcl|Aclame:pro 352 IKSGKFVIKYDYHWIPSLESLGLEQRVNDEYV 383 (393) Q Consensus 352 i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~ 383 (393) ..++++++++.+.|++|++||.+++.++.+.- T Consensus 617 ~~~~vv~V~~~v~Pv~~i~~I~vti~it~~~~ 648 (648) T protein:vir:10 617 EDKTVYYVEFFYQPVTEIKFILVTMKVTFDLE 648 (648) T ss_pred ecCCEEEEEEEEEecceeeEEEEEEEEEeccC Confidence 35699999999999999999999999877643 No 40 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=5.9e-34 Score=202.74 Aligned_cols=361 Identities=12% Similarity=0.059 Sum_probs=244.9 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhh--- Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIG--- 77 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~--- 77 (393) -|+-..-+||||+++.+++..++...++.+.+|+|.++.. |.++|++++++.++...+|.. .|.+++...+ T Consensus 6 ~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g-----~~~~~~~~~~~~~~~~~~g~G-~l~~ai~~a~~~~ 79 (587) T protein:vir:96 6 FPRRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGG-----EPNTVYQVRNYAQAKSVFRSG-ELLDAIELAWGSN 79 (587) T ss_pred eCCCcccCCceEEEEecCCccCCCCCCCceEEEEEEecCC-----CCceeEEEcChHHHHHhhcCC-cHHHHHHHHhccC Confidence 3333344799999999999999999999999999999877 668999999999999998764 3666665544 Q ss_pred -cccCceEEEEEecccccccccc---------------------------------------------cchh--c----- Q lcl|Aclame:pro 78 -SIVKTPTVIVRVAESDDSDTLT---------------------------------------------ANIV--G----- 104 (393) Q Consensus 78 -~~~~~~~~vv~~~~~~~~~~~~---------------------------------------------~~~~--~----- 104 (393) .+++..++.+++........+. +... . T Consensus 80 ~~~g~~~~~a~rv~~~~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~v~~i~y~ 159 (587) T protein:vir:96 80 PQYTAGKILAMRVEDAKASQLEKGGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGNIFSINYK 159 (587) T ss_pred cCCCceEEEEEecCCCccceeecccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCceEEEEec Confidence 4555555555442211100000 0000 0 Q ss_pred ccc------------------------------------------------------cccccchh----------hhh-- Q lcl|Aclame:pro 105 TQE------------------------------------------------------NGKFTGIK----------ALL-- 118 (393) Q Consensus 105 ~~~------------------------------------------------------~~~~~gl~----------al~-- 118 (393) +.. +..+.|.. .+. T Consensus 160 g~~~~a~~~~~~~~~~~~A~~l~l~gg~~~v~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~~n~~~v~v~d~~~~~ 239 (587) T protein:vir:96 160 GEGEKATFSVEKDKETQEAKRLVLKVDEKEVKAYELNGGAYSFTNEIITDINELPDFEAKLSPFGDKNLESRKLDEATDV 239 (587) T ss_pred ccccceeEeeccCcccceeeeeEEEecCceEEEEEeCCCchhhhhhhhhhhccccceEEEeecccCceeEEEeecccccc Confidence 000 00000000 000 Q ss_pred --------------hhhhhhh-----------------------------------------------h---c----c-- Q lcl|Aclame:pro 119 --------------TAQSTVF-----------------------------------------------V---K----P-- 128 (393) Q Consensus 119 --------------~~~~~~~-----------------------------------------------~---~----~-- 128 (393) ....... + . + T Consensus 240 ~~k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~~~~ 319 (587) T protein:vir:96 240 DIKGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATSKPKAIEPFELTKLSGGTNGEPPTS 319 (587) T ss_pred ccceEEEeehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeecccccccccccceeeecCCCCCCccc Confidence 0000000 0 0 0 Q ss_pred ------------ccccccccchHHHHHHHHHhhcccce----EEE-EecCCCCcchhhhhhhcccccceEEEeccceeEe Q lcl|Aclame:pro 129 ------------KLLCVPQHDNQAVATELLSVAKKLNA----FAF-ISDNGATTKEQAYTYRQNFSQREGMMIFGDWKSY 191 (393) Q Consensus 129 ------------~~l~apg~s~~~v~~al~~~a~~~~~----~~~-i~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~ 191 (393) -..+++...+++++..+.++|++++. +.. +...++.+.+++...+..+++.+.++++++.... T Consensus 320 y~~~l~ale~~~~~~i~~~t~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi~v~~~~~~~ 399 (587) T protein:vir:96 320 WSAKLEKFKNEGGYYIVPLTDRQSVHSEVATFVKNRSDAGEPMRAIVGGGTSETKEKLFGRQAILNNPRVALVANSGKFV 399 (587) T ss_pred HHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHHhhcCCCcEEEEecceEEe Confidence 00001111234577888889977653 444 4345566778888888999999999988887765 Q ss_pred eccCCceEEech---hHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEe--CCC Q lcl|Aclame:pro 192 NTDKKAYDTDYA---VARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLN--HNG 266 (393) Q Consensus 192 ~~~~~~~~~~p~---S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~--~~G 266 (393) +. .+.....|+ ++.+||++|..+ +++||.|+.+.+ .++.. .+++.|.+.+.++|+.++.. +++ T Consensus 400 ~~-~~~~~~~~~~~~aa~vAG~~Ag~~----~~~S~T~~~~~~-~~v~~------~~t~~e~~~~i~~G~~~l~~~~~~~ 467 (587) T protein:vir:96 400 MG-NGRILQAPAYMVASAVAGLVSGLD----IGESITFKPLFV-NSLDK------VYESEELDELNENGIITIEFVRNRM 467 (587) T ss_pred cC-CCceeeechhhHHHHHHHHHhcCc----cccCccceeeec-ccccc------cCCHHHHHHHHhCCeEEEEEecCCc Confidence 53 334444443 678999999886 889999999874 44443 36788999999999999853 555 Q ss_pred EEEEec-ccC-----CCCcccceeehhhHHHHHHHHHHHHh-HHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccce Q lcl|Aclame:pro 267 FRYWGS-RTL-----ATDTRWAFQQSVRTAQIIKETIGAGL-AWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGA 339 (393) Q Consensus 267 ~~~wG~-rT~-----~~d~~~~~i~~rR~~~~i~~~i~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~ 339 (393) .++|.. +++ ..++.|++|+++|++|+|.+.+++.+ .+|+++||+...|..++..+..||.+|++.|.+ .+| T Consensus 468 ~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~g~I--~~~ 545 (587) T protein:vir:96 468 TTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIGTRTINTSASQIKDFVQSYLGRKKRDNEI--QDF 545 (587) T ss_pred EEEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCccccCHHHHHHHHHHHHHHHHHHHhCCcc--cCC Confidence 667733 443 33667999999999999999999987 589999999999999999999999999998854 333 Q ss_pred EEEecCCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 340 RVWVAEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVD 385 (393) Q Consensus 340 ~v~~~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 385 (393) .. ++-.-++.+.+++|++.+.|+.|+|+|.+++.+.++-++. T Consensus 546 ~~----~dv~v~~~~D~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:96 546 PP----EDVQVIIEGNEARISLTIFPIRALKKISVSLVYRQQTLQA 587 (587) T ss_pred Cc----cceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 22 1111123445799999999999999999999998888886 No 41 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=7.1e-34 Score=202.31 Aligned_cols=361 Identities=14% Similarity=0.079 Sum_probs=246.0 Q ss_pred CCC-----CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhh Q lcl|Aclame:pro 1 MSI-----LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNS 75 (393) Q Consensus 1 m~m-----~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~ 75 (393) |+. -..-+||||+++..++.++...+++.+.+|+|.++.. |.++|++++++.++...|+.. .|.++++. T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G-----~~~~~~~~~~~~~~~~~~~~g-~l~~~~~~ 74 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGG-----EPNTVYELRNYSQAKRLFRSG-ELLDAIEL 74 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCC-----ccceeEEeccHHHHHHHhcCc-chHHHHHH Confidence 542 2233699999999999999999999999999999887 568999999999999999763 36666655 Q ss_pred hh----cccCceEEEEEeccccccc---------------------------------------------ccccchh--- Q lcl|Aclame:pro 76 IG----SIVKTPTVIVRVAESDDSD---------------------------------------------TLTANIV--- 103 (393) Q Consensus 76 ~~----~~~~~~~~vv~~~~~~~~~---------------------------------------------~~~~~~~--- 103 (393) .+ .+++..++++++....... ...+.+. T Consensus 75 a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ 154 (587) T protein:vir:99 75 AWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIF 154 (587) T ss_pred HhccccCCCceEEEEEEcCCCceeEEEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEEecccceeeeeecccee Confidence 55 3455555544332211000 0000000 Q ss_pred -----ccc---------cc-c-------------------cccc-----hhhhhhhhh---------------------- Q lcl|Aclame:pro 104 -----GTQ---------EN-G-------------------KFTG-----IKALLTAQS---------------------- 122 (393) Q Consensus 104 -----~~~---------~~-~-------------------~~~g-----l~al~~~~~---------------------- 122 (393) |.. +. . ...| ..+...... T Consensus 155 ~i~y~g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~~~~tAky~~~~~~~i~~~~~~ 234 (587) T protein:vir:99 155 TIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFEAKLSPFGDKNLESSKLD 234 (587) T ss_pred eEEeecccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccccceeEEeeccCCceeEeeccc Confidence 000 00 0 0000 000000000 Q ss_pred -------------------------------------------------------------------------hhhh-c- Q lcl|Aclame:pro 123 -------------------------------------------------------------------------TVFV-K- 127 (393) Q Consensus 123 -------------------------------------------------------------------------~~~~-~- 127 (393) ..+- . T Consensus 235 ~~~~~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG 314 (587) T protein:vir:99 235 KIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNG 314 (587) T ss_pred ccccceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhhhhhhccccceeeeeccccceecccceeeecCCCC Confidence 0000 0 Q ss_pred ----------------cccccccccchHHHHHHHHHhhcccce----EE-EEecCCCCcchhhhhhhcccccceEEEecc Q lcl|Aclame:pro 128 ----------------PKLLCVPQHDNQAVATELLSVAKKLNA----FA-FISDNGATTKEQAYTYRQNFSQREGMMIFG 186 (393) Q Consensus 128 ----------------~~~l~apg~s~~~v~~al~~~a~~~~~----~~-~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p 186 (393) .-..+.+..+++++++++.+||++++. +. ++..+++.+.+++......+++.+...+.+ T Consensus 315 ~~~~sy~~al~ale~~~~~~i~~~t~d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~~~~~~~~~~~a~~~n~e~vi~v~~ 394 (587) T protein:vir:99 315 EPPATWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRAIVGGGFNESKEQLFGRQASLSNPRVSLVAN 394 (587) T ss_pred CccccHHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhhcCCCcEEEEec Confidence 001112222345677888999877653 33 444456677788899999999999998888 Q ss_pred ceeEeeccCCceEEech---hHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE- Q lcl|Aclame:pro 187 DWKSYNTDKKAYDTDYA---VARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL- 262 (393) Q Consensus 187 ~~~~~~~~~~~~~~~p~---S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~- 262 (393) +..... .++....+|+ ++.+||++|..| +++||.|+++. ..++.. .+++.|++.|.++|+.++. T Consensus 395 ~~~~~~-~dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~-~~~v~~------~~t~~e~e~li~~Gvl~l~~ 462 (587) T protein:vir:99 395 SGTFVM-DDGRKNHVPAYMVAVALGGLASGLE----IGESITFKPLR-VSSLDQ------IYESIDLDELNENGIISIEF 462 (587) T ss_pred cceEec-CCCceeeechHHHHHHHHHHHhcCc----hhcCccceeee-cccccc------cCCHHHHHHHHhCCeEEEEE Confidence 754432 3455566776 788999999887 88999999986 445443 4678899999999999984 Q ss_pred -eCCC---EEE-EecccC--CCCcccceeehhhHHHHHHHHHHHHh-HHhhcccCCHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 263 -NHNG---FRY-WGSRTL--ATDTRWAFQQSVRTAQIIKETIGAGL-AWAVDMPLTPLRVKTMLEAINNKLRSWASGDDP 334 (393) Q Consensus 263 -~~~G---~~~-wG~rT~--~~d~~~~~i~~rR~~~~i~~~i~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~ 334 (393) ++++ +++ .+-.|. ..|+.|++|+++|++|+|.+.+++.+ .+|+++||+...|..++..+..||..|++.|.+ T Consensus 463 ~~~~~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk~Nn~~~r~~i~~~i~~~L~~l~~~gaI 542 (587) T protein:vir:99 463 VRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEI 542 (587) T ss_pred ecCCcceEEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcc Confidence 3333 442 444443 45778999999999999999999886 689999999999999999999999999998854 Q ss_pred cccceEEEecCCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHH Q lcl|Aclame:pro 335 RILGARVWVAEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVD 385 (393) Q Consensus 335 ~~~~~~v~~~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~ 385 (393) .+|.. +.-.-+....+++|++.+.|+.|+|+|.+++.+.++-++. T Consensus 543 --~~~~~----~dv~v~~~~d~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 543 --QDFPA----EDVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred --cCCCc----cceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 33322 1111123445799999999999999999999999998887 No 42 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=1e-34 Score=206.96 Aligned_cols=366 Identities=11% Similarity=0.039 Sum_probs=206.9 Q ss_pred CCCCccc-CC----CeEEEEcCCCcc-cccccccceeEEEE-eecc-cccc------cccc-cceEEeecchhhhhhccc Q lcl|Aclame:pro 1 MSILDTY-LH----GVEVVEVNAGGV-TISTAATSVIGVVC-TGDQ-ADAE------TFPL-NTPVLITNPLNYLEKAGS 65 (393) Q Consensus 1 m~m~~~~-~~----GV~v~ev~~~~~-~i~~v~tav~g~vg-~a~~-~d~~------~~p~-~~~vl~t~~~~~~~~~g~ 65 (393) |+-..++ .- |+.....++... .++.-.-+++..++ ..-+ .+.. .+|- .+.+..++..+.....+. T Consensus 177 ~~~~~~~~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~~~~~~~~~~~~ 256 (581) T protein:vir:10 177 NPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGP 256 (581) T ss_pred ccccCcceeccccceeeecccCccccccccccceeeeeeecccccccceEEEEEEEeecCCcceeEEeecCcchhhhhhh Confidence 1111100 00 111111111110 00000000111110 0000 0000 0000 011222222222111111 Q ss_pred --------ccchhhhhhhhhcccCceEEEEEecccccccccccchhcccccccccchhhhhhhhhhhhhccccccccccc Q lcl|Aclame:pro 66 --------TGTLRRTLNSIGSIVKTPTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHD 137 (393) Q Consensus 66 --------~~tl~~~~~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s 137 (393) ...+.........++....+...+....+ ..+......++.+++..+ ...++.|+.. T Consensus 257 ~~~~~g~~~~~~t~~~~~~~tn~~~~~l~~gvd~~g~---------tvt~~dy~~Al~ale~~~------~~~ivv~~t~ 321 (581) T protein:vir:10 257 AFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGD---------TVTMGDYQNALNKFRDED------EIAIIVAGTG 321 (581) T ss_pred hhhccCccccchhhhheeeeecccceeEEeeccCCCC---------ccchHHHHHHHHHHhcCC------ceEEEEeCCC Confidence 01111111111111111111111100000 001112233444444322 2345577888 Q ss_pred hHHHHHHHHHhhcccc-----eEEE--EecCCC-CcchhhhhhhcccccceEEEeccceeEeeccC-CceEEechhHHHH Q lcl|Aclame:pro 138 NQAVATELLSVAKKLN-----AFAF--ISDNGA-TTKEQAYTYRQNFSQREGMMIFGDWKSYNTDK-KAYDTDYAVARAC 208 (393) Q Consensus 138 ~~~v~~al~~~a~~~~-----~~~~--i~~~~~-~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~-~~~~~~p~S~~~a 208 (393) ..++++.+.+||+++. .+.. +...+. .+.+++++....+++.|..+++|+....+... ++...+|+ .++| T Consensus 322 ~~~v~a~l~ahv~~~s~~~~~~ravigV~g~~~~~~~~~~~~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~-y~~A 400 (581) T protein:vir:10 322 AQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGG-QFMA 400 (581) T ss_pred CHHHHHHHHHHHHHHHhccCCcEEEEEecCCCCCccHHHHHHhhccCCCceEEEEecCceeecCcccCceeccch-hhHH Confidence 8889999999997653 1332 322333 34456677788999999999999987765543 34444555 3334 Q ss_pred HHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eCCCEEE-EecccCCCCcccceee Q lcl|Aclame:pro 209 ALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNGFRY-WGSRTLATDTRWAFQQ 285 (393) Q Consensus 209 g~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~~G~~~-wG~rT~~~d~~~~~i~ 285 (393) +.+|-+-.+..+++||.|++++|+.++... +++.|++.|+++|++++. +++|+++ ||-+|+.+|++|++|+ T Consensus 401 A~vAGl~a~~~~~~slT~~~i~gi~~l~~~------~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~~~i~ 474 (581) T protein:vir:10 401 AAVAGKSVSAIAAMPLTRKVIRGFSGPAEV------QRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWN 474 (581) T ss_pred HHHHHHhhccccccCccccccccccccccc------CCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCCCCcceeee Confidence 444444444458899999999998877543 567899999999999994 5678775 6778888999999999 Q ss_pred hhhHHHHHHHHHHHHhH--HhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEecCCCCHHHhhCCEEEEEEEE Q lcl|Aclame:pro 286 SVRTAQIIKETIGAGLA--WAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEEITADIIKSGKFVIKYDY 363 (393) Q Consensus 286 ~rR~~~~i~~~i~~~~~--~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~~~nt~~~i~~G~~~~~v~~ 363 (393) +||++|++.+.+++.++ +|++|||++.+|.+|+..+++||..||+.|. +.++.. .+.+..+.+.+.+++++.+ T Consensus 475 ~iR~~D~v~~~ir~~~~~~~fIG~~n~~~~r~~ik~~i~~~L~~l~~~g~--I~~~~~---~~~~~~~~~~d~v~V~i~v 549 (581) T protein:vir:10 475 IIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNI--IRGYRN---LKARQIERQPDVIEVRYEW 549 (581) T ss_pred eehhhhHHHHHHHHHhhhhcCCCcccCHHHHHHHHHHHHHHHHHHHhcCc--ccCCcc---ceeeeeecCCCEEEEEEEE Confidence 99999999999999985 5888999999999999999999999999774 444432 2234556788999999999 Q ss_pred EecCcceeEEEEEEEcchH--HHHHHHHHhcC Q lcl|Aclame:pro 364 HWIPSLESLGLEQRVNDEY--VVDLVNTLKAL 393 (393) Q Consensus 364 ~p~~p~e~i~~~~~~~~~~--~~~~~~~~~~~ 393 (393) +|++|+|||.+++++.|+. ++..++---.. T Consensus 550 ~Pv~~i~~I~vti~~~p~~~~~~~~~~~~~~~ 581 (581) T protein:vir:10 550 RPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) T ss_pred EecccceEEEEEEEEecCCCceEEEEeccccC Confidence 9999999999999999882 11111111111 No 43 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=1.8e-34 Score=205.59 Aligned_cols=363 Identities=11% Similarity=0.065 Sum_probs=212.4 Q ss_pred CCCCccc---CCCeE---EEEcCCCccccccccc----------------------ceeEEEE------------eeccc Q lcl|Aclame:pro 1 MSILDTY---LHGVE---VVEVNAGGVTISTAAT----------------------SVIGVVC------------TGDQA 40 (393) Q Consensus 1 m~m~~~~---~~GV~---v~ev~~~~~~i~~v~t----------------------av~g~vg------------~a~~~ 40 (393) -+.+. | .-|++ ..++.+.....-...+ ++...++ .+.-. T Consensus 156 ~~~~~-~~l~~~g~~~~~~~~~~s~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~ 234 (581) T protein:vir:76 156 VPAMN-RALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYT 234 (581) T ss_pred cCCcC-ceeeeccccccccceeecCCcceeeecccccceeeccCcccceeeeeeeeeeEeecccccccceeEEEEEEEee Confidence 11111 1 01332 1111111100000000 0000000 00000 Q ss_pred ccccccccceEEeecchhhhhhcccc--------cchhhhhhhhhcccCceEEEEEecccccccccccchhccccccccc Q lcl|Aclame:pro 41 DAETFPLNTPVLITNPLNYLEKAGST--------GTLRRTLNSIGSIVKTPTVIVRVAESDDSDTLTANIVGTQENGKFT 112 (393) Q Consensus 41 d~~~~p~~~~vl~t~~~~~~~~~g~~--------~tl~~~~~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~ 112 (393) |+.. .+.+...+..+....++.. ..+.......+.++....+...+....+ ..+.....+ T Consensus 235 D~~~---~~~v~~~~~~~~~~~~~~~~~~~g~~~~e~~~~~~~~~t~~~~~~l~~gvd~~g~---------tvt~~dy~~ 302 (581) T protein:vir:76 235 DPNY---HEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGD---------TVTMGDYQN 302 (581) T ss_pred cCCc---cceEEEecccccccceeeehhhcCccccchhhhhheeeccccceEEEeeecCCCC---------ccchHHHHH Confidence 1100 1222222222222221110 1111111111111111111111110000 011112233 Q ss_pred chhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccc-------eEEEEecCCC-CcchhhhhhhcccccceEEEe Q lcl|Aclame:pro 113 GIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLN-------AFAFISDNGA-TTKEQAYTYRQNFSQREGMMI 184 (393) Q Consensus 113 gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~-------~~~~i~~~~~-~~~~~a~~~~~~~~s~~~~~~ 184 (393) ++.+++..+ ...++.|+....++++.+.+||+++. .+..+...+. .+..++++....+++.|..++ T Consensus 303 aL~ale~~~------~~~ivvp~t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g~~~~~~~~~~~~~a~~~ns~Rvvlv 376 (581) T protein:vir:76 303 ALNKFRDED------EIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALI 376 (581) T ss_pred HHHHHhcCC------eEEEEEecCCChHHHHHHHHHHHHHHhccCCceEEEEeeCCCCCchHHHHHHhhcccCCCcEEEE Confidence 444444322 23445677788888888888886653 2223333333 344566777888999999999 Q ss_pred ccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE-- Q lcl|Aclame:pro 185 FGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL-- 262 (393) Q Consensus 185 ~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~-- 262 (393) +|+....+..........|..++|+.+|.+..+..+++||.|++++|+.++... +++.|++.|+++|++++. T Consensus 377 ~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~~~slT~~~i~g~~~~~~~------~s~~e~e~ll~~Gv~~l~~~ 450 (581) T protein:vir:76 377 SPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEV------QRDGEKSRESSEGLMVIEKT 450 (581) T ss_pred EcCceEeccccCCcceecchhhhhhhHHhhhhccccccCccccccccccccccc------CCHHHHHHHHhCCeEEEEEe Confidence 999887765544444444555666666777777789999999999998876643 567899999999999994 Q ss_pred eCCCEE-EEecccCCCCcccceeehhhHHHHHHHHHHHHhH--HhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccce Q lcl|Aclame:pro 263 NHNGFR-YWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLA--WAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGA 339 (393) Q Consensus 263 ~~~G~~-~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~--~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~ 339 (393) ++++++ +||-+|+.++++|++|++||++|++.+.+++.++ .|++|||++.+|.+|+..+..||..||+.|. +.++ T Consensus 451 ~~~~v~Iv~gItT~~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~g~--I~g~ 528 (581) T protein:vir:76 451 PRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNI--IRGY 528 (581) T ss_pred cCCeEEEEEeeecCCCCCccceeeehhhhHHHHHHHHHHHhhhcCCCcccChHHHHHHHHHHHHHHHHHHhcCc--ccCc Confidence 567777 5888999999999999999999999999999986 5788999999999999999999999999774 4444 Q ss_pred EEEecCCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchH--HHHHHHHHhcC Q lcl|Aclame:pro 340 RVWVAEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEY--VVDLVNTLKAL 393 (393) Q Consensus 340 ~v~~~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~--~~~~~~~~~~~ 393 (393) .. .+.+....+.+.+++++.++|++|+|||.+++++.|+. ++..++---.. T Consensus 529 ~~---~~~~~~~~~~d~v~V~i~v~Pv~~ie~I~vt~~~~p~~~~~~~~~~~~~~~ 581 (581) T protein:vir:76 529 RN---LKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) T ss_pred cc---ceeeEEecCCCEEEEEEEEEecccceEEEEEEEEeeCCCceEEEEeccccC Confidence 32 23445566789999999999999999999999998872 11111111111 No 44 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=99.96 E-value=1.7e-30 Score=183.86 Aligned_cols=368 Identities=14% Similarity=0.056 Sum_probs=247.5 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhh--- Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIG--- 77 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~--- 77 (393) -|+=..-+||||+++.+++..++...++.+.+|+|.++.. |.++|++++++.++...|+.. .|.+++...+ T Consensus 15 ~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G-----~~~~~~~~~~~~~a~~~f~~g-~l~~a~~~a~~~~ 88 (607) T protein:vir:10 15 YPLFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNG-----DPTKVYEIRTSQQATKIFGSG-DLVDGIKLAFDPT 88 (607) T ss_pred hCCCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCC-----CCceEEEEcchhHHHHhhcCc-chHHHHHHhhccc Confidence 4433444799999999999999999999999999999877 668999999999999998653 3445554444 Q ss_pred ---cccCceEEEEEecccccccc---------------------------------c-----------ccchh------- Q lcl|Aclame:pro 78 ---SIVKTPTVIVRVAESDDSDT---------------------------------L-----------TANIV------- 103 (393) Q Consensus 78 ---~~~~~~~~vv~~~~~~~~~~---------------------------------~-----------~~~~~------- 103 (393) .+++..++++++........ + ++... T Consensus 89 ~~~~~g~~~~~~~rv~~~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~~~~~~~n~g~~~~i~y 168 (607) T protein:vir:10 89 GNSVTNGGTVYALRVDNAKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDNYERTYTNIGQMFSITY 168 (607) T ss_pred cCCccCCceEEEEeCCCccccceecccccccccccccCCCceEEEEEecCCCccceeEEeecccceeeeeeccceeeccc Confidence 56666777666422110000 0 00000 Q ss_pred ----------------cc---------cccc-------------c-ccchhhhhh------------------------h Q lcl|Aclame:pro 104 ----------------GT---------QENG-------------K-FTGIKALLT------------------------A 120 (393) Q Consensus 104 ----------------~~---------~~~~-------------~-~~gl~al~~------------------------~ 120 (393) +. .+.. . .+...+... . T Consensus 169 ~g~~~~a~~~v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din~~~~~~A~~~g~~~i~tky~d~~ 248 (607) T protein:vir:10 169 SGKSASAGYTVSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAISATPNFSASVVGSPSVNTSYLDEV 248 (607) T ss_pred CcccccccceeeecCCCceeEEEecCCCccceeeeeecccccccccchHHHHHHHhhcCCceEEEEecccceeeeccccc Confidence 00 0000 0 000000000 0 Q ss_pred hhhh--------------------hhc----------------------------------------------------- Q lcl|Aclame:pro 121 QSTV--------------------FVK----------------------------------------------------- 127 (393) Q Consensus 121 ~~~~--------------------~~~----------------------------------------------------- 127 (393) ...+ ... T Consensus 249 ~~~i~V~~~~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~LtGGtdG~~ 328 (607) T protein:vir:10 249 TSPVDVKTAPAVVTAKIGDAISKLGYDPYVVVTQTSNNKPIVNGVSAGTGSATASVTTAPESFPANFDTAFLTGGSTGDV 328 (607) T ss_pred cceeEEEEeeeeechhhhhhhhcccccceEEeeecccchhhhhhhhccccceeeeeeccccccccccceeeeeCCCCCCc Confidence 0000 000 Q ss_pred --------------cccccccccchHHHHHHHHHhhcccce----E-EEEecCCCCcchhhhhhhcccccceEEEeccce Q lcl|Aclame:pro 128 --------------PKLLCVPQHDNQAVATELLSVAKKLNA----F-AFISDNGATTKEQAYTYRQNFSQREGMMIFGDW 188 (393) Q Consensus 128 --------------~~~l~apg~s~~~v~~al~~~a~~~~~----~-~~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~ 188 (393) ....+.+..++.+++.++.++|++++. + +++..+++.+.+++.+....+++.+..++.|+. T Consensus 329 ~~ty~dal~aLe~~e~~~i~~~t~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~~~~t~~~~~t~a~~~N~ervv~V~~~~ 408 (607) T protein:vir:10 329 PVSWADKFNGAIGNNVYYIIPLTSEENIHAELQAFIDEQHVLGYNYHAFVGGGFAEPLEQILSRQVNINDSRFGLVGQSG 408 (607) T ss_pred hhhHHHHHHHHhhcCceEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHHHhhCCCcEEEEecCe Confidence 000001111224567888888876643 3 334445567778888899999999999999987 Q ss_pred eEeeccCCceEEech---hHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEe-- Q lcl|Aclame:pro 189 KSYNTDKKAYDTDYA---VARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLN-- 263 (393) Q Consensus 189 ~~~~~~~~~~~~~p~---S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~-- 263 (393) ...+ .+.....|+ ++.+||++|..| +.+||.|+.+. ..++.. .+++.|.+.+.++|+.++.. T Consensus 409 ~~~~--~G~~~~~~~~~~Aa~vAGl~Ag~~----~~~SlT~k~i~-~~~v~~------~lt~~e~e~ai~~Gv~~l~~~~ 475 (607) T protein:vir:10 409 HVQE--GGESVHVPAYLMAAYVGGLSSSLG----VAVPITNKKLA-LVDLDQ------NFSGDDLNTLNQNGVIGIEHLV 475 (607) T ss_pred eEee--CCcceeccHHHHHHHHHHHHhcCc----cccCcccceec-cccccc------cCCHHHHHHHHhCCeEEEEEcc Confidence 6644 344555554 688899999886 78899999986 445543 36788999999999999843 Q ss_pred ----CCCEEEEecccC---CCCcccceeehhhHHHHHHHHHHHHhH-HhhcccCCHHHHHHHHHHHHHHHHHHhhccccc Q lcl|Aclame:pro 264 ----HNGFRYWGSRTL---ATDTRWAFQQSVRTAQIIKETIGAGLA-WAVDMPLTPLRVKTMLEAINNKLRSWASGDDPR 335 (393) Q Consensus 264 ----~~G~~~wG~rT~---~~d~~~~~i~~rR~~~~i~~~i~~~~~-~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~ 335 (393) +++++++.+-|+ ..++.|++|+++|++|+|.+.+++.+. +|++++|+...|.+++..+..||..+|++.... T Consensus 476 ~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nnd~~~~~vk~~i~~~L~~~~l~~~ga 555 (607) T protein:vir:10 476 NRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIRSTSADDIKSTVASYLYSEMNNDDGL 555 (607) T ss_pred CccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCcccCCcchHHHHHHHHHHHHHHHHHHhcCc Confidence 235888766554 346789999999999999999998875 899999999999999999999998877653333 Q ss_pred ccceEEEecCCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcchHHHHHHHHHh Q lcl|Aclame:pro 336 ILGARVWVAEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLK 391 (393) Q Consensus 336 ~~~~~v~~~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~ 391 (393) +.+|.. ++-.-...+.+++|++.+.|+.++|+|.+++.+.++-++.-=+.-. T Consensus 556 I~df~~----edv~v~~~~D~v~v~~~v~Pv~~iekIyvtv~v~~~~~~~~~~~~~ 607 (607) T protein:vir:10 556 IVDFSE----SDIVVTISGTVVYIQFAVAPTQEIKNIVVSGTYSNYSATSEDNTTK 607 (607) T ss_pred eeCCCc----cccEEeeCCCEEEEEEEEEEcccceEEEEEEEEEEEEEeeccCCCC Confidence 444321 1111123456899999999999999999999999887664333333 No 45 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=99.95 E-value=3.9e-29 Score=176.31 Aligned_cols=357 Identities=13% Similarity=0.078 Sum_probs=215.8 Q ss_pred CCC-----CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchh--hhh Q lcl|Aclame:pro 1 MSI-----LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLR--RTL 73 (393) Q Consensus 1 m~m-----~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~--~~~ 73 (393) |+= ...-+||||++++..+.+++..+.+++.+|+|.++-. |+++|+.++++.++...||...+.. ... T Consensus 1 m~gg~~~~~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~~G-----p~~~~~~i~s~~d~~~~fG~~~~~~~~~~~ 75 (437) T protein:vir:10 1 MAGGIWKRQNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLALSFG-----QSKKLMKIRRGEDLFKKLGYEQESPQLLLL 75 (437) T ss_pred CCcceecccceecCceeEEEecCCcceeeccCCcEEEEEEEecCC-----CCceeEEEecHHHHHHHcCCccchhHHHHH Confidence 331 1234699999999999999999999999999988666 8899999999999999999754421 122 Q ss_pred hhhhcccCceEEEEEecccccccccccchhcccccccccc-----------------------------------hhhhh Q lcl|Aclame:pro 74 NSIGSIVKTPTVIVRVAESDDSDTLTANIVGTQENGKFTG-----------------------------------IKALL 118 (393) Q Consensus 74 ~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~g-----------------------------------l~al~ 118 (393) ..++ +++..++++++..+.....+..+ +...+..+.| +.... T Consensus 76 ~~~~-~g~~~~~~~R~~~g~~a~~tl~~--~~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~v~~~~~~~~~d~~~v~~~~ 152 (437) T protein:vir:10 76 NEAF-KRVSEVLLYRLNTGEKANVSLSD--NVTAQAKYSGVRGNDITVTVKTNVDDPSSFDVVTFLDTVVMDLQTVKVLA 152 (437) T ss_pred HHHh-cCCCEEEEEECCCCceeeEeecc--ceEEEeccCCcccceeEEEEeeccCCccceEEEEecCcceeeeeehhhhh Confidence 2233 56667788887553322111110 0000000000 00000 Q ss_pred hhhhhhhh----------ccccccccccc----hHHHHHHHHHhhcccceEEEEecCCCCcchhhhhhhccc---ccceE Q lcl|Aclame:pro 119 TAQSTVFV----------KPKLLCVPQHD----NQAVATELLSVAKKLNAFAFISDNGATTKEQAYTYRQNF---SQREG 181 (393) Q Consensus 119 ~~~~~~~~----------~~~~l~apg~s----~~~v~~al~~~a~~~~~~~~i~~~~~~~~~~a~~~~~~~---~s~~~ 181 (393) ......-. ........|.+ ......+|..+...--..+.+............+|.+.. ...+. T Consensus 153 ~~~~n~~v~~~~~~~l~~~a~~~LtGG~dg~~t~~dy~~al~~le~~~~n~l~~~~~d~~~~t~~~~~ik~~r~~~g~~~ 232 (437) T protein:vir:10 153 DLKNNALVEFSGTGELQPVAGAKLTGGTDGAISTQDYLEYFKALETVEFNYMALPVEDASIKKAAINFIKRMREDEGLGA 232 (437) T ss_pred hhhhhcccccccccccccccceeeeccccCCCChhHHHHHHHHhccCcceEEEecCCChhHHHHHHHHHHHHHhccCceE Confidence 00000000 00000011111 112344454443222222222111111123334443322 11222 Q ss_pred EEeccc-------eeEe-e----ccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchh Q lcl|Aclame:pro 182 MMIFGD-------WKSY-N----TDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTE 249 (393) Q Consensus 182 ~~~~p~-------~~~~-~----~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~ 249 (393) .+.-+. +..+ + .........-..+.+||++|.. ++.+|+.|+.+.|+..+.. .+++.| T Consensus 233 ~~V~~~~~~d~e~Iin~~n~~~~~~~~~~~~~~~~a~vAG~~Ag~----~~~~S~t~~~~~~~~~v~~------~~t~~e 302 (437) T protein:vir:10 233 QLVVADSDADSEAVINVKNGVILSDKTVIDKTKATVWVAAASANA----GVEKSLTYEKYEDSVDVVG------RLSHTE 302 (437) T ss_pred EEEeCCCCCCCceEEEeecceeecCcceechhhHHHHHHHHhccC----ccccCccccccCCcccccc------cCCHHH Confidence 111111 1100 0 0111111223457788888877 4888999999988776654 357789 Q ss_pred hhhhcccceEEEEeC-CCE-EEEecccCCC-----CcccceeehhhHHHHHHHHHHHHhH-Hhhcc-cCCHHHHHHHHHH Q lcl|Aclame:pro 250 ANYLNEKGITICLNH-NGF-RYWGSRTLAT-----DTRWAFQQSVRTAQIIKETIGAGLA-WAVDM-PLTPLRVKTMLEA 320 (393) Q Consensus 250 ~~~ln~~gi~~~~~~-~G~-~~wG~rT~~~-----d~~~~~i~~rR~~~~i~~~i~~~~~-~~v~e-pn~~~~~~~i~~~ 320 (393) .+.+.++|+.++.+. +.+ .++|-.|+.+ ++.|++|.++|++|+|.+.+++.+. .|+++ ||+...|..++.. T Consensus 303 ~~~~i~~G~~vl~~~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~~r~~~~~~ 382 (437) T protein:vir:10 303 TEDALLKGQFVFTARRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFLGKVSNNEDGRQAFKAN 382 (437) T ss_pred HHHHHhCCcEEEEEeCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccccccCCCHHHHHHHHHH Confidence 999999999998664 334 4578777643 5679999999999999999999887 49997 7999999999999 Q ss_pred HHHHHHHHhhcccccccceEEEecCCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEc Q lcl|Aclame:pro 321 INNKLRSWASGDDPRILGARVWVAEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVN 379 (393) Q Consensus 321 i~~~l~~l~~~g~~~~~~~~v~~~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~ 379 (393) ++.||.+|++.|.+ ..+...+.+.. +......+++++.+.|+.+||+|.+++..+ T Consensus 383 i~~yl~~l~~~g~I--~~~~~~d~~v~--~~~~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 383 RIRYFKDLEARGAI--EDFKVEDIEVL--RGELKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred HHHHHHHHHhCCCc--cCCCceeEEee--cCCCCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 99999999998854 44444332211 111346899999999999999999999999 No 46 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=99.88 E-value=3.8e-23 Score=143.52 Aligned_cols=358 Identities=9% Similarity=-0.004 Sum_probs=207.1 Q ss_pred CCC-----CcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchh--hhh Q lcl|Aclame:pro 1 MSI-----LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLR--RTL 73 (393) Q Consensus 1 m~m-----~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~--~~~ 73 (393) |+= ...-+||||++++.++.+++.++++..+++++...+- +.+.|+.+.++.++...||...+.. ..+ T Consensus 1 magg~~~~~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~-----g~~~~v~i~~~~d~~~~fG~~~~~~~~~~~ 75 (451) T protein:vir:10 1 MAGGTWKAQDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGW-----GKNGVIEVEANSDFTKKLGTTLDDPSLTAL 75 (451) T ss_pred CCceeeccceeecCceEEEEeccCcceeeccCCcEEEEEeeecCC-----CCcccEEeecHHHHHHHcCCcccchhHHHH Confidence 331 1233699999999999999999999999999865332 3366888999999999999665422 233 Q ss_pred hhhhcccCceEEEEEecccccccccccchhcccccccccch--------------------------------------h Q lcl|Aclame:pro 74 NSIGSIVKTPTVIVRVAESDDSDTLTANIVGTQENGKFTGI--------------------------------------K 115 (393) Q Consensus 74 ~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~gl--------------------------------------~ 115 (393) +.++ .++..+++.+...+.....+..... ..-+..+.|. . T Consensus 76 ~~~~-~g~~~v~~yrl~~g~~a~~t~~~~~-~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g~~~vd~qtv~~~~ 153 (451) T protein:vir:10 76 KETL-KGASKVLVLNPNEGTAATLTKEGLP-WTVTANYPGEKGNQITVSVEVSPADQNAATVSTIFGTKLVDEQSIKFNE 153 (451) T ss_pred HHHh-cCCcEEEEEEcCCCceEEEEeecCc-eEEEEeeCCcCCceEEEEEecccCCcCceEEEEEECCeEEEEEEeeccc Confidence 3333 3455566666544332221110000 0000000000 0 Q ss_pred hhhhhhhhhh-------hccc-----ccccc--ccc---hHHHHHHHHHhhcccceEEEEecCCC-Cc--chhhhhhhcc Q lcl|Aclame:pro 116 ALLTAQSTVF-------VKPK-----LLCVP--QHD---NQAVATELLSVAKKLNAFAFISDNGA-TT--KEQAYTYRQN 175 (393) Q Consensus 116 al~~~~~~~~-------~~~~-----~l~ap--g~s---~~~v~~al~~~a~~~~~~~~i~~~~~-~~--~~~a~~~~~~ 175 (393) +.+.....+. ..+. .+..+ |-. ...-....+..-|...-.....+..+ .+ .....+|.+. T Consensus 154 ~~el~~nd~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~~n~l~~~~~~~~~~i~~~~~a~ik~ 233 (451) T protein:vir:10 154 LDKFKGNDYITAKVVEEGSSKPVAFTNVSGTLTGGTTTESNKVESLLNDALENEEYAVVTTAGFEPSSNMNKLVVEAVKR 233 (451) T ss_pred hhhccCCceEEEEecccccccceeeeecccccccccccCCccchHHHHHHhccceeeEEEEccCCCchHHHHHHHHHHHH Confidence 0000000000 0000 00000 000 01112223333333332222222221 11 1223344332 Q ss_pred c----ccce-EEEecc--------ceeEe-ecc-CCceEEech---hHHHHHHHHhhhccCCceecCCCceecceeecee Q lcl|Aclame:pro 176 F----SQRE-GMMIFG--------DWKSY-NTD-KKAYDTDYA---VARACALQAYIDKTVGWHKNISNVELDGVTGITK 237 (393) Q Consensus 176 ~----~s~~-~~~~~p--------~~~~~-~~~-~~~~~~~p~---S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~ 237 (393) . +-.. +.++.+ .+..+ +.. ......+++ .+.+||++|.+ ++.+|+.|+.+.|+..+.. T Consensus 234 ~r~~~g~~~~aVl~~~~~~~~d~egiinv~n~~~~~dg~~~~~~~~~~~vAG~~Ag~----~~~~S~T~~~~~~~~~v~~ 309 (451) T protein:vir:10 234 LRENEGRKVRGVIPTDADTTYNYEGISTVVNGYTLSDGTNVDVKDATGYFAGISASA----DVATSLTYFEVEDAVSAYP 309 (451) T ss_pred HHHhcCCeEEEEecCccCCCCCCcceEEeecceEecCceeechhhhHHHHHHHHccc----ccccCccceecCCceeeee Confidence 2 1221 222211 11111 000 001122333 47888888887 4778999999988777654 Q ss_pred ecccccCCCchhhhhhcccceEEE-Ee-CCCEE-EEecccCCC-----CcccceeehhhHHHHHHHHHHHHhHH-hhcc- Q lcl|Aclame:pro 238 AVEFDINESSTEANYLNEKGITIC-LN-HNGFR-YWGSRTLAT-----DTRWAFQQSVRTAQIIKETIGAGLAW-AVDM- 307 (393) Q Consensus 238 ~~~~~~~~~~~~~~~ln~~gi~~~-~~-~~G~~-~wG~rT~~~-----d~~~~~i~~rR~~~~i~~~i~~~~~~-~v~e- 307 (393) . +++.|.+.+.++|..++ ++ +++++ .+|-.|+.+ ++.|+.|.++|++|.|.+.+++.+.. |+++ T Consensus 310 ~------~t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~ 383 (451) T protein:vir:10 310 K------FDNEKTIKALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTYLGNV 383 (451) T ss_pred e------CCHHHHHHHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccceec Confidence 3 57889999999999887 34 45665 578888743 56799999999999999999999875 9886 Q ss_pred cCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEecCCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEc Q lcl|Aclame:pro 308 PLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVN 379 (393) Q Consensus 308 pn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~ 379 (393) ||+...|..++..++.||.+|++.|.+. .+... |.+- ...-....+++++.+.|+..||+|.+.++++ T Consensus 384 ~N~~~gr~~~~~~i~~yl~~l~~~g~i~--~~~~~-d~~v-~~~~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 384 GNNAAGRDLFKADRIAYLTSLQNRNMIQ--SFANT-DITV-EAGNDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCCcc--CCCcc-ceEE-eecCCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 7999999999999999999999988653 22211 1110 0011367799999999999999999999999 No 47 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=99.86 E-value=2.7e-23 Score=144.32 Aligned_cols=359 Identities=11% Similarity=-0.005 Sum_probs=199.6 Q ss_pred CCCCc-ccCCCeEEEEcCCCccccccccccee----EEEEee--ccccc--ccccccc----------eEE----eecch Q lcl|Aclame:pro 1 MSILD-TYLHGVEVVEVNAGGVTISTAATSVI----GVVCTG--DQADA--ETFPLNT----------PVL----ITNPL 57 (393) Q Consensus 1 m~m~~-~~~~GV~v~ev~~~~~~i~~v~tav~----g~vg~a--~~~d~--~~~p~~~----------~vl----~t~~~ 57 (393) |.... .|.+|.+++ .+.|....--++.+-+ +..-+. .++|. ..++... -++ ..=.. T Consensus 112 ~~~~~s~~~~s~~~~-l~~G~~~~iy~~Dgd~~~s~~~~l~i~~~~ads~g~e~~~l~~~~~~~~g~~~~let~~~sl~~ 190 (529) T protein:vir:10 112 GEPAYSALPYGSEIE-LDSGEAFAIYVDDGDPCISPTRELTIETATADSAGNERFLLKLTQTTSLGVVTTLETHTVSLAE 190 (529) T ss_pred ccchhhccccccccc-ccccceEEEEEecCcCccCCceEEEEEeeccccCCCccceeeEEEEeecCCceEEEEEEeeeee Confidence 32222 222444443 3333321111111111 111111 11111 0000000 000 00112 Q ss_pred hhhhhcccccchhhhhhhhhcccCceEEEEEeccccccccccc-chhcccccccccchh--hhhhh---hhhhhhccccc Q lcl|Aclame:pro 58 NYLEKAGSTGTLRRTLNSIGSIVKTPTVIVRVAESDDSDTLTA-NIVGTQENGKFTGIK--ALLTA---QSTVFVKPKLL 131 (393) Q Consensus 58 ~~~~~~g~~~tl~~~~~~~~~~~~~~~~vv~~~~~~~~~~~~~-~~~~~~~~~~~~gl~--al~~~---~~~~~~~~~~l 131 (393) ++....|....+...++.....--. .+.+......+.....+ ...++++.+. ..+. +.+.+ ....-+....+ T Consensus 191 ~a~dd~G~~~yl~svle~~s~~l~a-i~~~e~~~t~~~~t~~d~~f~~GtdG~~-~~i~s~~y~~A~~~L~n~p~d~~~i 268 (529) T protein:vir:10 191 EAKDDMGRLCYLPTALEARSKYLRA-VVNEELISTAKVTNKKSLAFTGGTNGDQ-SKISTAAYLRAVKVLNNAPYMYTAV 268 (529) T ss_pred chhhhcCCccchhHHHhhccCceee-eeeeccccccchhhhhhhhccCCccccc-cccchHHHHHHHHHhcCCcceeeee Confidence 2223333333333333221111100 01111111111000000 1122222211 1111 11111 11223344556 Q ss_pred cccccchHHHHHHHHHhhcccceEEEEecCCCCcchhhhhhhccccc---c---eEEEeccceeEeeccCCceEEechhH Q lcl|Aclame:pro 132 CVPQHDNQAVATELLSVAKKLNAFAFISDNGATTKEQAYTYRQNFSQ---R---EGMMIFGDWKSYNTDKKAYDTDYAVA 205 (393) Q Consensus 132 ~apg~s~~~v~~al~~~a~~~~~~~~i~~~~~~~~~~a~~~~~~~~s---~---~~~~~~p~~~~~~~~~~~~~~~p~S~ 205 (393) ++-|--..++..+|+.+|++.+..++.+.++..+.++|++|.+.++- . ....||||. .-|+.++....+++|| T Consensus 269 l~~g~y~~a~I~~L~~ic~~~~~d~f~DV~~~LT~~aA~~~~e~~gl~~~~~~~~s~y~~P~~-~~D~~tg~k~~~GlsG 347 (529) T protein:vir:10 269 LGLGCYDNAAITALGKICADRLIDGFFDVKPTLTYAEALPAVEDTGLLGTDYVSCSVYHYPFS-CKDKWTQSRVVFGLSG 347 (529) T ss_pred eccCCccHHHHHHHHHHHhhhhhcEEEcCCCCcCHHHHHHHHHhcCccccCceeeEEEEccee-eccccccCceeeCCCc Confidence 66666677889999999988777666777888999999999976542 2 245678887 6678888888999999 Q ss_pred HHHHHHHhhhccCC---------ceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEeC--C----CEEEE Q lcl|Aclame:pro 206 RACALQAYIDKTVG---------WHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLNH--N----GFRYW 270 (393) Q Consensus 206 ~~ag~~a~~D~~~G---------~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~~--~----G~~~w 270 (393) . |+.+..|| +|.+||++.- +++.- ..|.--...++.|...|..++||.+.-+ + +-.+| T Consensus 348 ~-----A~~akargv~~na~v~g~hY~pAGe~r-~~inr-~~I~~ly~~d~~e~~~lv~~riNPV~~~~~g~~~idDsLt 420 (529) T protein:vir:10 348 V-----AYAAKARGVKKNSDVGGWHYSPAGEER-AVIAR-ASIQPLYPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDALT 420 (529) T ss_pred c-----eeeccccceeecccccccccccCCCcc-ceeec-ccceeccCCCccCHHHHHhhccCeeeeeccCcceeeeeec Confidence 5 44444444 5999999863 33322 1122222234445555555666666432 2 34577 Q ss_pred ecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhccccc---------ccceEE Q lcl|Aclame:pro 271 GSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPR---------ILGARV 341 (393) Q Consensus 271 G~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~---------~~~~~v 341 (393) |+|+ |+.|||+|+++|+++|++.+-+..+|.+|||++..+|. +++.++.+|..+|+.|.+. ..+|.+ T Consensus 421 ~~~k---nny~R~~hv~~lmn~I~~~~~k~a~~~~~~Pd~it~~g-l~~~l~~~L~r~~asgalv~prdp~~~G~epy~~ 496 (529) T protein:vir:10 421 CCTQ---DNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAG-LTKGMTKLLDRFVASGALVAPRDPDADGTEPYVL 496 (529) T ss_pred eeee---CCchhhhhHHHHHHHHHHHHHHHHHHHhhCCChHHHHH-HHHhHHHHHHHHHhcCceecccCccCCCCCceEE Confidence 7775 78899999999999999999999999999999999988 9999999999999988532 233444 Q ss_pred EecCCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 342 WVAEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVND 380 (393) Q Consensus 342 ~~~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 380 (393) .+ ...+.+++.+++.++|+-.+.+|...-.+-. T Consensus 497 ~V------~q~d~D~~~v~~~~~ptGv~Rri~~~p~l~~ 529 (529) T protein:vir:10 497 KV------TQAEFDKWEVVWACCPTGVARRIQGVPLLIK 529 (529) T ss_pred EE------eecccCeEEEEEEeecCCceeeEEeeeeecC Confidence 33 2334589999999999999999888776655 No 48 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=99.78 E-value=4.3e-19 Score=121.25 Aligned_cols=355 Identities=12% Similarity=0.019 Sum_probs=196.7 Q ss_pred CCCCcc-------cCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecc---hhhhhhcccccchh Q lcl|Aclame:pro 1 MSILDT-------YLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNP---LNYLEKAGSTGTLR 70 (393) Q Consensus 1 m~m~~~-------~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~---~~~~~~~g~~~tl~ 70 (393) |+|+-. -+||+|+.....+...+......++++...++=. |.++++.+++. .+....+|...+.. T Consensus 1 ~~magg~~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~~wG-----p~~~v~~i~~~~~~~~~~~~~G~~~~~~ 75 (436) T protein:vir:78 1 MALGGGTFVTQNKVLPGSYINFVSATRATSSLSDRGIVAMPLELDWG-----IDEEVFQVTSDDFEKYSTKYFGYDYTHE 75 (436) T ss_pred CcccceeeccceeecCceEEEEEecCcceeeccCCeEEEEEEEecCC-----CCceeEEeecccchHHHHHHhcCccchH Confidence 998872 2599999999888888888889888888866443 77888877763 45666777665533 Q ss_pred hh--hhhhhcccCceEEEEEeccccccccc---------------------------ccch--hcccccccccchhhhhh Q lcl|Aclame:pro 71 RT--LNSIGSIVKTPTVIVRVAESDDSDTL---------------------------TANI--VGTQENGKFTGIKALLT 119 (393) Q Consensus 71 ~~--~~~~~~~~~~~~~vv~~~~~~~~~~~---------------------------~~~~--~~~~~~~~~~gl~al~~ 119 (393) .. +..++.+ ....+..+...+.....+ .+.. .+... ........... T Consensus 76 ~~~~l~~~~~~-~~tv~~yrl~~G~~a~~~v~~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~~-~d~~~~~~~~~ 153 (436) T protein:vir:78 76 KLKGLRDLFKN-IRLGYFYKLNKGVKASCSIATARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNKK-VDTQIAKVITE 153 (436) T ss_pred HHHHHHHHhcC-CCEEEEEECCCcceeeeeeeeeecCCCCCcEEEEEecccccccCceEEEEEecchh-hhhhhHHHHhh Confidence 21 2222211 111222222111000000 0000 00000 00000000100 Q ss_pred hh-hhh------h---hccccccccccc-----hHHHHHHHHHhhcccceEEEEecCCCCcchhhhhhhccc----ccce Q lcl|Aclame:pro 120 AQ-STV------F---VKPKLLCVPQHD-----NQAVATELLSVAKKLNAFAFISDNGATTKEQAYTYRQNF----SQRE 180 (393) Q Consensus 120 ~~-~~~------~---~~~~~l~apg~s-----~~~v~~al~~~a~~~~~~~~i~~~~~~~~~~a~~~~~~~----~s~~ 180 (393) .. ..+ + .....-...|.+ ......+|..+...-...+.++...........+|.+.. +-+. T Consensus 154 l~~n~~V~~~~~g~la~~a~~~LtGG~dG~~~T~~dy~~al~~le~~~fn~l~~~~~d~~~~~~~~a~ikr~re~~g~~~ 233 (436) T protein:vir:78 154 LQDNDYVTWKKEATLEATAGLTFTNGTNGEAVTGTEYQAFLDKIESYSFNALGCLATTAEIKSLFVEFTKRMRDKVGAKF 233 (436) T ss_pred ccCCceEEEEecccccccceeeeeccccccccchHHHHHHHHHHcccceeEEEecCCChHHHHHHHHHHHHHHhhcCCeE Confidence 00 000 0 000011111211 122334444332222222212111111112233333221 1111 Q ss_pred EEEecc-------cee-EeeccCC-ceEEechhHHHHHHHHhhhccCCceecCCCceecceeeceeecccccCCCchhhh Q lcl|Aclame:pro 181 GMMIFG-------DWK-SYNTDKK-AYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEAN 251 (393) Q Consensus 181 ~~~~~p-------~~~-~~~~~~~-~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~ 251 (393) .++..+ .+. +-+...+ .....-..+.+||++|.++ +.+|+.|+.+.++..+... +++.|.+ T Consensus 234 ~aV~~~~~~~d~EgIInv~n~v~g~~~~~~~~~a~vAG~~Ag~~----~~~S~T~~~~~~~~~v~~~------~t~~e~~ 303 (436) T protein:vir:78 234 QTVLYKKNDADYEGVVSVENKIKDTGLLESSLIYWTTGAIAGCD----INKSNTNKRYDGEFDVDVN------YTQIHLE 303 (436) T ss_pred EEEecCCCCCCCceEEEeecccCCceechhHHHHHHHHHHhcCc----cccCccceecCcccccccc------CCHHHHH Confidence 111111 010 0011111 1112335677888888774 7789999998876665433 5778999 Q ss_pred hhcccceEEEEe-CCCEEEE-ecccCC-----CCcccceeehhhHHHHHHHHHHHHhH-Hhhcc-cCCHHHHHHHHHHHH Q lcl|Aclame:pro 252 YLNEKGITICLN-HNGFRYW-GSRTLA-----TDTRWAFQQSVRTAQIIKETIGAGLA-WAVDM-PLTPLRVKTMLEAIN 322 (393) Q Consensus 252 ~ln~~gi~~~~~-~~G~~~w-G~rT~~-----~d~~~~~i~~rR~~~~i~~~i~~~~~-~~v~e-pn~~~~~~~i~~~i~ 322 (393) .+.++|..++.+ ++++++- |-.|+. .+..|+.|.++|++|+|.+.+++.+. .|+++ ||+..-|..++..++ T Consensus 304 ~ai~~G~lvl~~d~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yiGKv~N~~dgr~~l~~~i~ 383 (436) T protein:vir:78 304 EALKTGKFIFHKVGDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYLGEVPNDKSGRISFWNDVV 383 (436) T ss_pred HHHhCCeEEEEEeCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccccccCCCHHHHHHHHHHHH Confidence 999999988865 4555544 444542 24579999999999999999999986 59996 699999999999999 Q ss_pred HHHHHHhhcccccccceE---EEecCCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEc Q lcl|Aclame:pro 323 NKLRSWASGDDPRILGAR---VWVAEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVN 379 (393) Q Consensus 323 ~~l~~l~~~g~~~~~~~~---v~~~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~ 379 (393) .||.+|.+.|.+. .|. +...+.+ ....+++++.+.|+-.+|+|.++++.. T Consensus 384 ~yl~~L~~~g~I~--~f~~~Dv~v~~~~-----~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 384 KHHEQLQNMRAIE--DFKADDVSVEPGS-----DKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred HHHHHHHhCCccc--CCCCcceEEeecC-----CCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 9999999988543 222 1122111 356788999999999999999999998 No 49 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.29 E-value=3e-12 Score=83.75 Aligned_cols=323 Identities=15% Similarity=0.111 Sum_probs=182.8 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhccc Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~ 80 (393) |+ - +|++++....-+...+....-.++.++-.- .. . .....++..+........ ....+...+..+ T Consensus 1 ~~---g-lp~i~i~f~~~a~ta~~~g~rGiv~~il~d-~~--~-----~~~~~~~~~~v~~~~~~~--n~~~i~~~~~g~ 66 (356) T protein:vir:10 1 MA---G-LVNINIEFKELATSFIQRSKAGIVAIILKD-TT--K-----MYKELTSEDDIPISLSAD--NKKYIKYGFVGA 66 (356) T ss_pred CC---C-CCceeEEEeecceeeccCCccceEEEEEec-CC--c-----ceeEEeccccchhHHHHH--HHHHHHHHhhcc Confidence 33 2 489999888777776665555555554421 11 1 111122222222222111 112222222211 Q ss_pred CceEEEEEecccccccccccchhcccccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccce-----E Q lcl|Aclame:pro 81 KTPTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNA-----F 155 (393) Q Consensus 81 ~~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~-----~ 155 (393) ............... .+.+.......|.+++. .....++.|+. +.++...+.+.+++++. + T Consensus 67 ~~~~~~~~p~~~~~~-------~~~t~~~y~~aL~~le~------~~fn~l~~~~~-d~~~~~~~~a~ikr~r~~~~~~~ 132 (356) T protein:vir:10 67 TDNEKVLRPSKVIIS-------TFTEDGKVEDILEELES------VEFNYLCMPEA-IEAEKTKIVTWIKKIREEESTEA 132 (356) T ss_pred ccccccccceeeeee-------cccCchhHHHHHHHhcC------ccceEEEecCC-ChHHHHHHHHHHHHHHhcCCcEE Confidence 110000000000000 00111122233333332 34456677763 45677778888876653 2 Q ss_pred EEEecCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeec Q lcl|Aclame:pro 156 AFISDNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGI 235 (393) Q Consensus 156 ~~i~~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~ 235 (393) .++. +... .+... .+....-... .+...-..-..+.+||++|.+. +.+|+.|+.+.++... T Consensus 133 ~~V~--~~~~----------aD~Eg-IInv~n~~~~--~g~~~t~~~~~~~vAG~~Ag~~----~n~S~T~~~~~~~~~~ 193 (356) T protein:vir:10 133 KAVL--ANIK----------ADNEA-IINFTENVVV--DGEEITAEKYTTRVASLIASTP----NTQSITYAPLDEVESI 193 (356) T ss_pred EEEe--cCCC----------CCCce-eEEeecCeEe--cceeechhHHHHHHHHHHhccc----hhccccceecCCcccc Confidence 2222 1111 01111 1111110111 1111122345678999999885 6678999888765433 Q ss_pred eeecccccCCCchhhhhhcccceEEEEeCCC-EE-EEecccCC-----CCcccceeehhhHHHHHHHHHHHHhH-Hhhcc Q lcl|Aclame:pro 236 TKAVEFDINESSTEANYLNEKGITICLNHNG-FR-YWGSRTLA-----TDTRWAFQQSVRTAQIIKETIGAGLA-WAVDM 307 (393) Q Consensus 236 ~~~~~~~~~~~~~~~~~ln~~gi~~~~~~~G-~~-~wG~rT~~-----~d~~~~~i~~rR~~~~i~~~i~~~~~-~~v~e 307 (393) . .+++.|.+..-.+|--.+.+.+| .+ .-|-.|+. .+..|+.|.+.|++|.|.+.+++.+. .|+++ T Consensus 194 ~-------~~t~~e~~~ai~~G~lvl~~d~~~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~~Di~~~f~~~yiGK 266 (356) T protein:vir:10 194 V-------KIDKASADAKVQAGELILRRLSGKIRIARGINSLTTLTAEKGEIFQKIKLVDTKDLISKDIKNIYVEKYLRK 266 (356) T ss_pred c-------cCCHHHHHHHHhCCeEEEEEEcCeEEEEecCccceecCCCCCcchhhhHHHHHHHHHHHHHHHHHhhccccc Confidence 2 24567888888899999877544 44 34555552 13459999999999999999999986 69998 Q ss_pred -cCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEecCCCCHH---------------Hhh----CCEEEEEEEEEecC Q lcl|Aclame:pro 308 -PLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEEITAD---------------IIK----SGKFVIKYDYHWIP 367 (393) Q Consensus 308 -pn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~~~nt~~---------------~i~----~G~~~~~v~~~p~~ 367 (393) ||+..-+..++..++.||.+|.+.|.+ ..++.+..|.+.... .+. .-.+.+++.+.|+- T Consensus 267 v~N~~dgr~~l~~ai~~y~~~L~~~~~I-~~~~~~eid~e~q~~~~~~~g~d~~~~~d~~v~~~~~~~~v~~~~~v~~vd 345 (356) T protein:vir:10 267 CPNTYDNKCLFIVAVQSYLTELAKQELI-DSNFTVEIDLEKQKEYLEGKKIAVSKMKENEIKEANTGSNGFYLINLKLVD 345 (356) T ss_pred cCCCHHHHHHHHHHHHHHHHHHHhCCcc-ccCceeEecccchHHHhhhccccccccccceeecccCCcEEEEEEEEEEEe Confidence 699999999999999999999998865 345666555543322 111 24488999999999 Q ss_pred cceeEEEEEEE Q lcl|Aclame:pro 368 SLESLGLEQRV 378 (393) Q Consensus 368 p~e~i~~~~~~ 378 (393) .||.|.+.+.. T Consensus 346 amE~iy~ti~v 356 (356) T protein:vir:10 346 AMEDINIRVQM 356 (356) T ss_pred eeeeEEeEEeC Confidence 99999999999 No 50 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=98.85 E-value=4.3e-09 Score=66.47 Aligned_cols=343 Identities=10% Similarity=0.030 Sum_probs=203.1 Q ss_pred cCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc-cchhhhhhhhhcccCceEE Q lcl|Aclame:pro 7 YLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST-GTLRRTLNSIGSIVKTPTV 85 (393) Q Consensus 7 ~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~-~tl~~~~~~~~~~~~~~~~ 85 (393) -.|-|.|...+.+..++..+.-. .-|+|.++....+.++ ++.-++.-..+|.. ..|...+.+...+++.. + T Consensus 1 ~~~~v~vn~ln~~qg~~~~ver~-~lfig~~~~~~~~~~~------~~~~sdld~~lg~~ds~lk~~v~aa~~naG~~-w 72 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERH-ALFVGVGTTNQGKLLA------LTPDSDFDKVFGETDTDLKKQVRAAMLNAGQN-W 72 (376) T ss_pred CCCeEEEeeeeccCCCcccccce-EEEeeccccccCceEE------ecCCCChHHhhCCCchhHHHHHHHHHhCCCCc-e Confidence 34679999999988888877654 5678887654334333 33345555556544 56777777777766543 3 Q ss_pred EEEecccccccccccchhcccccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhc----c-cceEEEEec Q lcl|Aclame:pro 86 IVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAK----K-LNAFAFISD 160 (393) Q Consensus 86 vv~~~~~~~~~~~~~~~~~~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~----~-~~~~~~i~~ 160 (393) ...+..... .....+.|++.+.....+..-.++.|-.++.+.+.++.++++ + .+..|++.. T Consensus 73 ~a~~~~p~~--------------~~~~~~~Av~~a~~~~s~E~V~v~~p~~t~~a~i~a~qa~a~el~~~~~R~vffile 138 (376) T protein:vir:37 73 FAHVYIAQE--------------DGYDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQA 138 (376) T ss_pred EEEEEecCC--------------ChhhHHHHHHHHHhhCCeeEEEEecCcchhHHHHHHHHHHHHHHHHhcCCeEEEEEe Confidence 222211110 011355666665443333333333332233444444444443 3 356667766 Q ss_pred CCCCc--c---hhhhh-------hhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCce Q lcl|Aclame:pro 161 NGATT--K---EQAYT-------YRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVE 228 (393) Q Consensus 161 ~~~~~--~---~~a~~-------~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~ 228 (393) .++.. . +..-+ -+..+.+.+..++.. .| + -..|.+||.+|+. ..-++.||.... T Consensus 139 ~~g~d~~~~~ge~w~~y~~~l~a~~~gia~~~V~vV~~---~~---g------n~~G~~aGRl~na--aVsVadspgRV~ 204 (376) T protein:vir:37 139 VQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPL---LF---G------NETGVLAGRLANR--AVTVADSPARVQ 204 (376) T ss_pred ccCCCCcccccCCHHHHHHHHHHHhccccccceeeeee---ec---c------chHHHHHHHHHhC--CcchhcCcccee Confidence 55321 1 12222 234556666655422 11 1 1467889988763 334688998865 Q ss_pred ecceeece---eecc-cccCCCchhhhhhcccceEEE--EeC-CCEEEEecccCCC-CcccceeehhhHHHHHHHHHHHH Q lcl|Aclame:pro 229 LDGVTGIT---KAVE-FDINESSTEANYLNEKGITIC--LNH-NGFRYWGSRTLAT-DTRWAFQQSVRTAQIIKETIGAG 300 (393) Q Consensus 229 l~gv~~~~---~~~~-~~~~~~~~~~~~ln~~gi~~~--~~~-~G~~~wG~rT~~~-d~~~~~i~~rR~~~~i~~~i~~~ 300 (393) -..+.++. .+++ ....++..-...|..+|-.+. ++| .|+-+=.+||++. .+.+++|..+|.++-+.|.++.. T Consensus 205 tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG~Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ 284 (376) T protein:vir:37 205 TGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDGYYRADGRTLDVEGGDYQVIENLRVVDKVARKVRLL 284 (376) T ss_pred ecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCceEEeCCeEeccCCCCeeeehhchHHHHHHHHHHHH Confidence 44444433 2222 122345677888999999998 456 5787778899865 36699999999999999988876 Q ss_pred hHHhhcc---cCCHHHHHHHHHHHHHHHHHHhhcccccc--cceEEEe--cCCCCHHHhhCCEEEEEEEEEecCcceeEE Q lcl|Aclame:pro 301 LAWAVDM---PLTPLRVKTMLEAINNKLRSWASGDDPRI--LGARVWV--AEEITADIIKSGKFVIKYDYHWIPSLESLG 373 (393) Q Consensus 301 ~~~~v~e---pn~~~~~~~i~~~i~~~l~~l~~~g~~~~--~~~~v~~--~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~ 373 (393) .-..+.. ..++.-++..+..+..=||+|.+.+.... +..+|.. |.+-+-.-....++.+-+.+.|.--.+.|+ T Consensus 285 Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~d~dI~i~w~sk~~V~I~~~vrPy~cpk~i~ 364 (376) T protein:vir:37 285 AIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKTKVTIYIKVRPYDCPKEIT 364 (376) T ss_pred HHHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccceeecCCCCceEEEeccCceEEEEEEEeeecCcceeE Confidence 6655543 34566777778888888999987653221 1122322 111111112346677778888887789999 Q ss_pred EEEEEcchHHHH Q lcl|Aclame:pro 374 LEQRVNDEYVVD 385 (393) Q Consensus 374 ~~~~~~~~~~~~ 385 (393) ..+..|-.-+.+ T Consensus 365 ~~I~LDls~~~~ 376 (376) T protein:vir:37 365 ANIFLDLDSLGE 376 (376) T ss_pred EEEEEecCCCCC Confidence 999997664444 No 51 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=98.80 E-value=6.6e-09 Score=65.44 Aligned_cols=343 Identities=10% Similarity=0.046 Sum_probs=195.3 Q ss_pred cCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhccc-ccchhhhhhhhhcccCceEE Q lcl|Aclame:pro 7 YLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGS-TGTLRRTLNSIGSIVKTPTV 85 (393) Q Consensus 7 ~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~-~~tl~~~~~~~~~~~~~~~~ 85 (393) -.|-|.|...+.+..++..+.-. .-|+|.+.....+.+++| .-++.-..+|. ...|...+.+...+++.. + T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er~-~Lfig~~~~~~~~~~~~~------~~sdld~~lg~~~~~lk~~v~aa~~naG~~-~ 72 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERH-ALFVGVGTTNQGKLLALT------PDSDFDKVFGETDTDLKKQVRAAMLNAGQN-W 72 (376) T ss_pred CCCeEEEecccccCCCcccccce-EEeeccccccccceeeec------CccchHhhhCCCchHHHHHHHHHHhCCCCc-E Confidence 34679999999999888877654 667887764434433333 33444444453 366777777777776653 3 Q ss_pred EEEecccccccccccchhcccccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhc----c-cceEEEEec Q lcl|Aclame:pro 86 IVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAK----K-LNAFAFISD 160 (393) Q Consensus 86 vv~~~~~~~~~~~~~~~~~~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~----~-~~~~~~i~~ 160 (393) ...+..... .....+.|++.+.....+..-.++.|=-++.+-..++.++++ + .+..+++.. T Consensus 73 ~~~~~~~~~--------------~~~~~~~Av~~a~~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv~file 138 (376) T protein:vir:37 73 FAHVYIAQE--------------DGYDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQA 138 (376) T ss_pred EEEEEeecC--------------CchHHHHHHHHhhhhcCceEEEEeccccccHHHHHHHHHHHHHHHHhcCCeEEEEEe Confidence 222211110 001245555555444333333333341123444444444443 3 356677766 Q ss_pred CCCCc--c---hhhhhh-------hcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCce Q lcl|Aclame:pro 161 NGATT--K---EQAYTY-------RQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVE 228 (393) Q Consensus 161 ~~~~~--~---~~a~~~-------~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~ 228 (393) ..+.. . ++..+| +..+.+.+..+..- .|. -.-|.+||.+|+. ..-++.||.... T Consensus 139 ~r~~~~~~~~~e~w~~y~~~~~al~~gia~~~V~~V~~---~~g---------n~~G~~aGRl~~a--aVsVadspgRV~ 204 (376) T protein:vir:37 139 VQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPL---LFG---------NETGVLAGRLANR--AVTVADSPARVQ 204 (376) T ss_pred ccCcCcccccccCHHHHHHHHHHhhcccccccceeeee---ehh---------hhHHHHHHHHhhc--ccchhhCcccee Confidence 65321 1 111222 23334443332210 110 2367788887654 233677887754 Q ss_pred ecceeec---eeecc-cccCCCchhhhhhcccceEEEE--eC-CCEEEEecccCCC-CcccceeehhhHHHHHHHHHHHH Q lcl|Aclame:pro 229 LDGVTGI---TKAVE-FDINESSTEANYLNEKGITICL--NH-NGFRYWGSRTLAT-DTRWAFQQSVRTAQIIKETIGAG 300 (393) Q Consensus 229 l~gv~~~---~~~~~-~~~~~~~~~~~~ln~~gi~~~~--~~-~G~~~wG~rT~~~-d~~~~~i~~rR~~~~i~~~i~~~ 300 (393) -.-+.++ ..+.+ ....++....+.|..+|-++.. +| .|+-+=.+||++. .+.++||..+|+.+-+.|.++.. T Consensus 205 tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R~vR~~ 284 (376) T protein:vir:37 205 TGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGYYWADGRTLDVEGGDYQVIENLRVVDKVARKVRLL 284 (376) T ss_pred ccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHH Confidence 3333333 23222 2234567788889999999984 56 5787778899865 36699999999999999998888 Q ss_pred hHHhhcccC---CHHHHHHHHHHHHHHHHHHhhccccc--ccceEEEecC--CCCHHHhhCCEEEEEEEEEecCcceeEE Q lcl|Aclame:pro 301 LAWAVDMPL---TPLRVKTMLEAINNKLRSWASGDDPR--ILGARVWVAE--EITADIIKSGKFVIKYDYHWIPSLESLG 373 (393) Q Consensus 301 ~~~~v~epn---~~~~~~~i~~~i~~~l~~l~~~g~~~--~~~~~v~~~~--~nt~~~i~~G~~~~~v~~~p~~p~e~i~ 373 (393) .-..+...- ++.-++..+.-+..=|+.|.+..... .+..++...+ +-+..-+...++.+.+.+.|.--.+.|+ T Consensus 285 ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~d~Di~i~w~s~~~V~I~~~v~P~~~pk~It 364 (376) T protein:vir:37 285 AIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKTKVTIYIKVRPYDCPKEIT 364 (376) T ss_pred HHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCCCCCceEEeeccceEEEEEEEEeccCCceEE Confidence 777765422 34445555555655677776543211 1111232222 2223334778899999999999999999 Q ss_pred EEEEEcchHHHH Q lcl|Aclame:pro 374 LEQRVNDEYVVD 385 (393) Q Consensus 374 ~~~~~~~~~~~~ 385 (393) ..+..+-.-..+ T Consensus 365 v~I~Ldlsn~~~ 376 (376) T protein:vir:37 365 ANIFLDLDSLGE 376 (376) T ss_pred EEEEeecCCCCC Confidence 887776443333 No 52 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=98.75 E-value=6.8e-08 Score=59.88 Aligned_cols=337 Identities=11% Similarity=0.042 Sum_probs=194.9 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeec--ccccccccccceEEeecchhhhhhcccc-cchhhhhhhhh Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGD--QADAETFPLNTPVLITNPLNYLEKAGST-GTLRRTLNSIG 77 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~--~~d~~~~p~~~~vl~t~~~~~~~~~g~~-~tl~~~~~~~~ 77 (393) |+ .|-|.|...+.+..++..+.- ...|||+.. ....+. ..++..++....+|.. ..|...+.+.. T Consensus 1 m~-----~~~V~in~~n~~qg~~~~ver-~~lfig~g~~~~~~g~~------~~~~~~sdld~~lg~~ds~lk~~v~aa~ 68 (369) T protein:vir:27 1 MA-----WPTVIIKILNLMNGPIADIEC-HFLFVIRGTVSGEVRNL------IMVDSTSDLDDVLAEASAEGLAIVKAAQ 68 (369) T ss_pred CC-----CCceEEecccccCCCcccccc-eEEEEEeccccccccce------EEecCccchHhhcCCcChhHHHHHHHHH Confidence 55 477999999999888877664 467785543 222222 2344445555555543 45777777777 Q ss_pred cccCceEEEEEecccccccccccchhcccccccccchhhhhhhhhhhhhcccccccccc-chHHHHHHHHHhh----cc- Q lcl|Aclame:pro 78 SIVKTPTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQH-DNQAVATELLSVA----KK- 151 (393) Q Consensus 78 ~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~-s~~~v~~al~~~a----~~- 151 (393) .+++.. +...+.. ... ......|++.+.....+ ..+...+- +..+...++.+.+ .+ T Consensus 69 ~naG~~-w~a~~~p---~~~------------~~~~~~Av~~a~~~~s~--E~V~v~~p~t~~a~i~aaq~~a~el~~~~ 130 (369) T protein:vir:27 69 LNGKQA-WTAGVMI---LSE------------EDNWQDAVKKANEVSSF--EFVVLGFDAETKAMIEDAITLRTELKNSL 130 (369) T ss_pred hCCCCc-eEEEEEE---eCC------------chhHHHHHHhhhhhCCc--cEEEEecCcccHHHHHHHHHHHHHHHHhc Confidence 766543 3222221 111 11234455555443333 33444443 2333333333333 33 Q ss_pred cceEEEEecCCC--Cc---c-------hhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCC Q lcl|Aclame:pro 152 LNAFAFISDNGA--TT---K-------EQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVG 219 (393) Q Consensus 152 ~~~~~~i~~~~~--~~---~-------~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G 219 (393) .+..+++...++ .. . ....+-+..+.+.+..++..++.. + .-.|.++|.+|.. ..- T Consensus 131 ~R~vffi~e~~~~~~~~~~~e~w~dy~a~l~al~~g~a~~~V~vv~~~~~~----g------n~~G~~aGRl~n~--aVs 198 (369) T protein:vir:27 131 GREVGVLCQLPAINNDPTNGQTWSEWLADTVDIPKDVASEYISVVPNVHAA----G------DTLGKYAGRLANK--EVS 198 (369) T ss_pred CCeEEEEEeccccCCCccccCCHHHHHHHHHHHhhccCcccceeeeeeccc----c------chHHHHHHHHHhc--ccc Confidence 346666665332 11 1 122333456677777665322211 1 2467788888753 233 Q ss_pred ceecCCCceecceeecee-ec-ccccCCCchhhhhhcccceEEE--EeC-CCEEEEecccCCC-CcccceeehhhHHHHH Q lcl|Aclame:pro 220 WHKNISNVELDGVTGITK-AV-EFDINESSTEANYLNEKGITIC--LNH-NGFRYWGSRTLAT-DTRWAFQQSVRTAQII 293 (393) Q Consensus 220 ~~~spaN~~l~gv~~~~~-~~-~~~~~~~~~~~~~ln~~gi~~~--~~~-~G~~~wG~rT~~~-d~~~~~i~~rR~~~~i 293 (393) +..||....-..+.|+.. +. +....++.+....|..+|-++. ++| .|+-+=.+||++. .+.++||-.+|..+-+ T Consensus 199 Iadsp~RVktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~tl~~~gsDYq~iE~~RVvdKa 278 (369) T protein:vir:27 199 IADSPARVQTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGRTLDVPGGDYQDIRHIRVAMKA 278 (369) T ss_pred hhcCcceeeecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCCeehhhhhhHHHHH Confidence 678887765444444321 11 2223355677888999999998 456 5777778899865 4669999999999999 Q ss_pred HHHHHHHhHHhhccc---CCHHHHHHHHHHHHHHHHHHhhcccccccceEEEec--CCCCHHHhhCCEEEEEEEEEecCc Q lcl|Aclame:pro 294 KETIGAGLAWAVDMP---LTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVA--EEITADIIKSGKFVIKYDYHWIPS 368 (393) Q Consensus 294 ~~~i~~~~~~~v~ep---n~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~--~~nt~~~i~~G~~~~~v~~~p~~p 368 (393) .|.++...-..+..| .++.-++..+..+..=|++|.+.+ +..++.-. .+-+-.-....++.+-+.+.|.-- T Consensus 279 ~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~----fpgei~~P~d~dI~i~w~~k~~V~I~~~vrP~~~ 354 (369) T protein:vir:27 279 ARKVRIRAIARIADRTLNSTPQSIAAAKLYFTQDLRTMALTG----VPGEIYPPEDEDIQIKWVNSTDVEIYMSVQPYEC 354 (369) T ss_pred HHHHHHHHHHHhcCcccccChhHHHHHHHHHhhHHHHHHhhc----CCeEEecCCCCceEEEeeccceEEEEEEEeeccC Confidence 998887776666543 344556666666666788886532 33333321 122112224457888888888888 Q ss_pred ceeEEEEEEEcchHH Q lcl|Aclame:pro 369 LESLGLEQRVNDEYV 383 (393) Q Consensus 369 ~e~i~~~~~~~~~~~ 383 (393) .+.|+..+..|-.-. T Consensus 355 pk~it~~I~ldl~~~ 369 (369) T protein:vir:27 355 PVKITIAISVKQGDY 369 (369) T ss_pred CceEEEEEEEeccCC Confidence 899999999964433 No 53 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=98.64 E-value=1.5e-07 Score=57.96 Aligned_cols=316 Identities=11% Similarity=0.050 Sum_probs=182.5 Q ss_pred CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhcccCc Q lcl|Aclame:pro 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIVKT 82 (393) Q Consensus 3 m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~~~ 82 (393) |.++-. +|.|.--...+.+. ..-.++.+....... .....++..+....++....++.+...++.++.. T Consensus 1 ~~~~iv-~V~v~~~~~~~~~~--~~~~~~~~~~~~t~~--------~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~~~ 69 (331) T protein:vir:80 1 MVETIT-DVRVHISVLYPSPR--IGLGRPAIFVKGTAM--------GYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDR 69 (331) T ss_pred Ccccee-cceeeecccccccc--cccCcceeEEecccc--------ceEEEechhhhccCCCCCcHHHHHHHHHHhccCc Confidence 666542 45444332222222 222333333322111 1334556666666777777788888888888765 Q ss_pred eEEEEEecccccccccccchhcccccccccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhhcccceEEEEecCC Q lcl|Aclame:pro 83 PTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISDNG 162 (393) Q Consensus 83 ~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~~i~~~~ 162 (393) ...+........ +.+.++.... ...+ -.+...+.+ .+-..++....+..+.++...... T Consensus 70 ~~~i~v~~~~~~-----------------~~~~a~~a~~-~~~w--~~~~~~~~~-~~~~~a~a~~~~a~~~~f~~~~~~ 128 (331) T protein:vir:80 70 PDTVAVITYEDT-----------------KLLEAAEAYF-LKSW--HFALLAEFK-AADALALSNLIEEQKFKFAVFQVT 128 (331) T ss_pred cceEEEeccchH-----------------HHHHHHHHhc-cCce--eEEEeecCC-HHHHHHHHHHHhhCCcEEEEEecC Confidence 433322211110 1122222111 0111 122222333 232334555555544555443321 Q ss_pred CCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCc-eecceeeceeeccc Q lcl|Aclame:pro 163 ATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNV-ELDGVTGITKAVEF 241 (393) Q Consensus 163 ~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~-~l~gv~~~~~~~~~ 241 (393) . .....+..+ .+....++++. .. . -+.+.+.|..+..|.-+- .-.++ +|.||..- T Consensus 129 ~--~~~~~~~~~--~~~t~~~~~~~-------~~---~-~~~aa~~g~~~~~~~g~~---t~~fk~~l~GV~~~------ 184 (331) T protein:vir:80 129 A--VADITPLAK--NTRTIAIVHSK-------TG---E-KLDAALIGNVASLPVGSA---TWKGRHGLAGITSE------ 184 (331) T ss_pred c--hHHHHHhhc--cccEEEEEcCC-------cc---c-hhHHHHHHHHHhcCccce---eeeeecccCCCCCC------ Confidence 1 122222111 23333444331 11 1 235566677777765332 22444 35565532 Q ss_pred ccCCCchhhhhhcccceEEEEeCCC-EEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhcc----cCCHHHHHH Q lcl|Aclame:pro 242 DINESSTEANYLNEKGITICLNHNG-FRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDM----PLTPLRVKT 316 (393) Q Consensus 242 ~~~~~~~~~~~ln~~gi~~~~~~~G-~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e----pn~~~~~~~ 316 (393) .++..|.+.|..+|+|++...+| -.++.+.|++++ ||-+.+-.+|++..+++.+...+-. |-|+.=... T Consensus 185 --~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~~G~----~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~~ 258 (331) T protein:vir:80 185 --ELKVSEIDAIQKAGGMCYIEKAGIAQTSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIAL 258 (331) T ss_pred --CCCHHHHHHHHhcCceEEEEecCeeEEecceEeCch----hHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhHHH Confidence 25788999999999999977555 457778888874 8999999999999999988876543 556666788 Q ss_pred HHHHHHHHHHHHhhccccc------ccceEEEec--CCCCHHHhhCCEEE-EEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 317 MLEAINNKLRSWASGDDPR------ILGARVWVA--EEITADIIKSGKFV-IKYDYHWIPSLESLGLEQRVND 380 (393) Q Consensus 317 i~~~i~~~l~~l~~~g~~~------~~~~~v~~~--~~nt~~~i~~G~~~-~~v~~~p~~p~e~i~~~~~~~~ 380 (393) |+..++.-|+..+.+|... ..++.+... ++.+++|+.+++.. +.+.+.+...+++|++....+- T Consensus 259 l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 259 LQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred HHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 8888888888888877432 346777764 46899999998877 8888999999999999999888 No 54 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=98.63 E-value=3e-08 Score=61.85 Aligned_cols=342 Identities=10% Similarity=0.045 Sum_probs=193.2 Q ss_pred cCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccc-cchhhhhhhhhcccCceEE Q lcl|Aclame:pro 7 YLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGST-GTLRRTLNSIGSIVKTPTV 85 (393) Q Consensus 7 ~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~-~tl~~~~~~~~~~~~~~~~ 85 (393) -.|-|.|...+.+..++..+.-. .-|+|+++....+.++ ++..++....+|.. ..|...+.+...+++.. + T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er~-~lfig~~~~~~g~~~~------~~~~sdld~~l~~~ds~lk~~v~aa~~naG~~-~ 72 (370) T protein:vir:78 1 MWPYVQIYNLNQMQGPVTEVERH-LLFIGSAASNTGKLLS------LNAQSDFDQLLGAADSELKANLLAARDNAGQN-W 72 (370) T ss_pred CCceEEEeeccccCCCcCcccee-EEEEecccccccceEe------ecCccCHHHhcCCcChhHHHHHHHHHhCCCCc-e Confidence 24789999999999888877654 6678888754444333 34445555666544 55777777766666543 3 Q ss_pred EEEecccccccccccchhcccccccccchhhhhhhhhhhhhcccccccccc-chHHHHHHHHHhhcc----c-ceEEEEe Q lcl|Aclame:pro 86 IVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQH-DNQAVATELLSVAKK----L-NAFAFIS 159 (393) Q Consensus 86 vv~~~~~~~~~~~~~~~~~~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~-s~~~v~~al~~~a~~----~-~~~~~i~ 159 (393) ........ .....+.|++.+... ..+..+...|- ++.+.+.++.++++. + +..+++. T Consensus 73 ~~~~~p~~---------------~~~d~~~Av~~a~~~--~s~E~V~v~~~~s~~a~~~a~~~~a~el~n~~~Rpv~fil 135 (370) T protein:vir:78 73 SAAAYVLP---------------TDKPWLDAARDAQQT--QSFEGVVVLGQEWHQAAINAAHALNQELIAKWGRWQFMLL 135 (370) T ss_pred EEEEEEec---------------CchhHHHHHHHHHhh--CCccEEEEecCcchHHHHHHHHHHHHHHHHhcCCeEEEEE Confidence 32221111 111356666666443 33344445554 444555556555543 3 4666676 Q ss_pred cCCCCcchh--------hhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecc Q lcl|Aclame:pro 160 DNGATTKEQ--------AYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDG 231 (393) Q Consensus 160 ~~~~~~~~~--------a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~g 231 (393) ..++....+ ..+-+..+.+.+..++.-|+. -.-|.+||.+|.. ..-+..||.-....- T Consensus 136 e~~~~~~~e~w~~y~~~l~al~~gia~~~V~vvp~~~g------------~~~G~~aGRL~na--avsVadsP~Rv~tG~ 201 (370) T protein:vir:78 136 AVPAIADEQDWATYEAELATLQDGIAASSVSLIPQLWP------------TLAGAYAGRLCNR--AVSIADSPCRVKTGA 201 (370) T ss_pred eecCCCCcCCHHHHHHHHHHhhhccccccceEEeeecc------------ccHHHHHHHHhcC--eeeecccceeeeccc Confidence 655433221 222334555666555533321 1136788876642 223667777544322 Q ss_pred eeece-ee-cccccCCCchhhhhhcccceEEEE--eC-CCEEEEecccCCC-CcccceeehhhHHHHHHHHHHHHhHHh- Q lcl|Aclame:pro 232 VTGIT-KA-VEFDINESSTEANYLNEKGITICL--NH-NGFRYWGSRTLAT-DTRWAFQQSVRTAQIIKETIGAGLAWA- 304 (393) Q Consensus 232 v~~~~-~~-~~~~~~~~~~~~~~ln~~gi~~~~--~~-~G~~~wG~rT~~~-d~~~~~i~~rR~~~~i~~~i~~~~~~~- 304 (393) +.++. .+ ......++....+.|..+|-++.. +| .|+-+=.+||++. .+.++||..+|+.+-+.|.++..+-.. T Consensus 202 l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~i 281 (370) T protein:vir:78 202 LVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGIYWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAIARI 281 (370) T ss_pred cccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHh Confidence 33321 11 123344666788899999999984 56 5787778899865 366999999999999999999554444 Q ss_pred hcccCCHH--HHHHHHHHHHHHHHHHhhcccccc--cceEEEe--cCCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEE Q lcl|Aclame:pro 305 VDMPLTPL--RVKTMLEAINNKLRSWASGDDPRI--LGARVWV--AEEITADIIKSGKFVIKYDYHWIPSLESLGLEQRV 378 (393) Q Consensus 305 v~epn~~~--~~~~i~~~i~~~l~~l~~~g~~~~--~~~~v~~--~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~ 378 (393) .++-.++. .+...+..+..=|+++...+.... +...+.- |.+-+..-+..+++.+.+.+.|.--.+.|+..+.. T Consensus 282 ~D~~lnst~gsia~~~~~~~~~L~ema~s~~i~~~~fpgeI~~p~d~Di~i~w~s~~~v~I~~~v~P~~~pk~Itv~I~L 361 (370) T protein:vir:78 282 GDRSFNSTPGSTAAAITYFGKDLREMAKSTTINGQPFPGDIASPQDGDIRIQWVAKNLVSVFVVVRTVDCPKGITVNIML 361 (370) T ss_pred CCcccCCCCcchhHHHHHHHhhHHHHHhhhhhcccccceeEeccCCCcceEEeeccceEEEEEEEEeccCCceEEEEEEE Confidence 44333322 222222333333555544443211 1122222 12223333467889999999998888998888877 Q ss_pred cchHHHHHH Q lcl|Aclame:pro 379 NDEYVVDLV 387 (393) Q Consensus 379 ~~~~~~~~~ 387 (393) |-..=++-= T Consensus 362 Dls~e~~~~ 370 (370) T protein:vir:78 362 DLSLNNGEG 370 (370) T ss_pred eeccccCCC Confidence 533211100 No 55 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=98.41 E-value=7.9e-07 Score=54.04 Aligned_cols=361 Identities=12% Similarity=0.052 Sum_probs=186.0 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEe-ecchhhhhhcccccchhhhhhhhhcc Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLI-TNPLNYLEKAGSTGTLRRTLNSIGSI 79 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~-t~~~~~~~~~g~~~tl~~~~~~~~~~ 79 (393) |+.+-+- =|+|.. +-.+.++...+-..+.++++....... +..++++. ++..+....||.....+.+-..+|.+ T Consensus 1 msip~s~--ivnV~i-~~~~~a~~~~~f~~~l~l~~~~~~~~~--~~~~r~~~~~s~~~V~~~FG~~s~ey~aA~~yF~q 75 (502) T protein:vir:52 1 MALSISH--IVNVQL-NTVPKSAARKSFGIVALFTPEAGQAFA--DEKTRYVYVENQRDVEQLFGTNSETAKAAQPFFAQ 75 (502) T ss_pred CCCCccc--eeEEee-ccccccccccccCceEEEeeccCcccc--CCccceEEecCHHHHHHhcCCChHHHHHHHHHhcC Confidence 8866653 233332 222444555566667777755332211 22334433 45556666777666655555555554 Q ss_pred cCceE--EEEEecccc-------------cccccccc-----------hhcccc----------cc-------------- Q lcl|Aclame:pro 80 VKTPT--VIVRVAESD-------------DSDTLTAN-----------IVGTQE----------NG-------------- 109 (393) Q Consensus 80 ~~~~~--~vv~~~~~~-------------~~~~~~~~-----------~~~~~~----------~~-------------- 109 (393) ..... ++-+-.... ........ .+++.. .. T Consensus 76 ~p~P~~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~t~~~i~lS~~ts~~~vA~~i~~~l~ 155 (502) T protein:vir:52 76 SPRAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVKKVDGLSFARLADFNAVATKIQEKLT 155 (502) T ss_pred CCccceEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceeeeeeccccccccchhHHHHHHHhhhc Confidence 32211 111110000 00000000 000000 00 Q ss_pred ----------------------------------------cccchhhhhhhhhhhhhccccccccccchHHHHHHHHHhh Q lcl|Aclame:pro 110 ----------------------------------------KFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVA 149 (393) Q Consensus 110 ----------------------------------------~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al~~~a 149 (393) ..+.+.+++.........+......|........+|.++. T Consensus 156 ~~~~~~tv~~d~~~~~F~i~s~ttg~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v~~~~~g~~aet~~~al~a~~ 235 (502) T protein:vir:52 156 TLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKVGKNSVSLKKETLGEALFNVA 235 (502) T ss_pred ccccceEEEEecCCceEEEEeccCCCcceeEEEEeecCCcchhHHHHHhccccccceeeeeeecccccccCHHHHHHHHH Confidence 0000000110000000000000112222223444454444 Q ss_pred cccceEEEEecCCCCcch---hhhhhhcccccceEEEecccee-E--eec--------cCC---ceE-----EechhHHH Q lcl|Aclame:pro 150 KKLNAFAFISDNGATTKE---QAYTYRQNFSQREGMMIFGDWK-S--YNT--------DKK---AYD-----TDYAVARA 207 (393) Q Consensus 150 ~~~~~~~~i~~~~~~~~~---~a~~~~~~~~s~~~~~~~p~~~-~--~~~--------~~~---~~~-----~~p~S~~~ 207 (393) +.-++........+.+.+ +..+|.+... +...|.+|-. . ... ..+ .-. .-.+.+.+ T Consensus 236 ~~~~~w~~~~~a~~~~~~~~la~a~~iea~~--~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~t~~~y~~~~~~~~aa~ 313 (502) T protein:vir:52 236 EVNNTWYGFTVAAQLTDSEVEAAAKYAQANT--KLFGANVIRAEQIEWSADNIYKKLYDAGLDHTLAMFDKNDMYPVSSA 313 (502) T ss_pred hccCceEEEEEeecCChhHHHHHHHHHhhcC--cEEEEEecCcceeccccchHHHHHHhccCceeEEEecCCcchhHHHH Confidence 333322222222222222 2333433321 1122222110 0 000 000 000 11355667 Q ss_pred HHHHHhhhccCC-ceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEeCCC-EEEEecccCCCCcccceee Q lcl|Aclame:pro 208 CALQAYIDKTVG-WHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLNHNG-FRYWGSRTLATDTRWAFQQ 285 (393) Q Consensus 208 ag~~a~~D~~~G-~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~~~G-~~~wG~rT~~~d~~~~~i~ 285 (393) .|..+.+|..+- -...-.+|.+.||..- .++..|.+.|..+|+|++...+| ..+..+++++++ ||- T Consensus 314 ~g~~as~~f~~~~g~iT~~fk~l~GV~~~--------~lt~t~~~al~~~~~N~y~~~~~~~~~~~G~~~~G~----~iD 381 (502) T protein:vir:52 314 LARLLSTNFAANNSTLTLKFKQQPTITAD--------EITATEFAKAKRLGINVYTYFDDVAMIAEGTVIGGK----FAD 381 (502) T ss_pred HHHHHhcCCCcCcceeeecccccCCcccC--------cCCHHHHHHHHhcCceEEEEecCeeEEecCeeeCCc----hhh Confidence 788888885432 2344556777776532 25788999999999999966555 346688888874 788 Q ss_pred hhhHHHHHHHHHHHHhHHhhcc-----cCCHHHHHHHHHHHHHHHHHHhhccccc------------------ccceEEE Q lcl|Aclame:pro 286 SVRTAQIIKETIGAGLAWAVDM-----PLTPLRVKTMLEAINNKLRSWASGDDPR------------------ILGARVW 342 (393) Q Consensus 286 ~rR~~~~i~~~i~~~~~~~v~e-----pn~~~~~~~i~~~i~~~l~~l~~~g~~~------------------~~~~~v~ 342 (393) +.+-.+|++..|++.+...++. |-|+.=...|+..++.-|+..+.+|... ..|+.+. T Consensus 382 ~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~~~~~d~~~~gy~v~ 461 (502) T protein:vir:52 382 EIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVW 461 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeecccccCceEEE Confidence 9999999999999998766542 5666667888888888888877777432 1356666 Q ss_pred ec--CCCCHHHhhCCEE-EEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 343 VA--EEITADIIKSGKF-VIKYDYHWIPSLESLGLEQRVND 380 (393) Q Consensus 343 ~~--~~nt~~~i~~G~~-~~~v~~~p~~p~e~i~~~~~~~~ 380 (393) .. ++.+++|..+++. -+.+.+.+...+++|++.+..+. T Consensus 462 ~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 462 AAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred eCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 54 4688999999888 89999999999999999999988 No 56 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=98.38 E-value=9.4e-07 Score=53.63 Aligned_cols=355 Identities=13% Similarity=0.092 Sum_probs=164.7 Q ss_pred CCCC-----ccc-CCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhh Q lcl|Aclame:pro 1 MSIL-----DTY-LHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLN 74 (393) Q Consensus 1 m~m~-----~~~-~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~ 74 (393) |++. .+. .||+|+|..++...... ....+-++|-.-. ... .+.++|++++|..++...+|...-|+.-.. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~--~~q~vLiiGq~la-~gs-~~~~~~v~v~s~~~a~~lfG~GSml~~M~~ 76 (498) T protein:vir:45 1 MTISFNTIPSNTLVPLFYAEMDNQAANTAQ--DSGASLLIGHANN-GAE-IVANSLVLMPSADYARQICGAGSQLARMVE 76 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCCCC--CCcceEEEEecCC-ccc-cccceeEEecCHHHHHHhcCcCcHHHHHHH Confidence 6554 344 49999998876663222 2334556664422 222 245899999999999999998766654444 Q ss_pred hhhcc-cCceEEEEEec-------------------------------------ccccccccccchhcc----cc----- Q lcl|Aclame:pro 75 SIGSI-VKTPTVIVRVA-------------------------------------ESDDSDTLTANIVGT----QE----- 107 (393) Q Consensus 75 ~~~~~-~~~~~~vv~~~-------------------------------------~~~~~~~~~~~~~~~----~~----- 107 (393) .+... .-...+++.+. ..++.........-. .+ T Consensus 77 a~~~~n~~~~l~~i~~~d~aG~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA 156 (498) T protein:vir:45 77 AYRQTDPFGELYVIAVPEATGAAATVTLTVTGEATESGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTA 156 (498) T ss_pred HHHHhCCcceEEEEeeCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEE Confidence 33221 11111121111 111100000000000 00 Q ss_pred ---------cccccchhhhhhhhhhh--------hhcccccc------ccccchHHHHHHHHHhhcccceEEEEe----- Q lcl|Aclame:pro 108 ---------NGKFTGIKALLTAQSTV--------FVKPKLLC------VPQHDNQAVATELLSVAKKLNAFAFIS----- 159 (393) Q Consensus 108 ---------~~~~~gl~al~~~~~~~--------~~~~~~l~------apg~s~~~v~~al~~~a~~~~~~~~i~----- 159 (393) +...+|... -...... -..|..+. +.|.-.+++..+|.++.+...+..... T Consensus 157 ~~~~~~VtlTAr~kG~~G-N~I~l~~~~~~~~~ge~~p~Glt~~itamagGag~PD~a~alaal~~~~~~~I~~p~~D~a 235 (498) T protein:vir:45 157 SSSAGVVTLTARHKGLCG-NEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTA 235 (498) T ss_pred EecCceEEEEeeccCccc-cceeEEEeeccccccccccceeeEEEEccCCCccCchhHHHHHHhccCCccEEEEeeCCHH Confidence 000000000 0000000 00011110 001112334455555444443333221 Q ss_pred ---------------------------cCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechh---HHHHH Q lcl|Aclame:pro 160 ---------------------------DNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAV---ARACA 209 (393) Q Consensus 160 ---------------------------~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S---~~~ag 209 (393) .....+..+...+....++.|..+.+ ...+ ..-|+- +.+|| T Consensus 236 sL~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~-------~~~~--~~sp~~~~AAa~aa 306 (498) T protein:vir:45 236 SVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAG-------YEKE--TQTPADELAASRTA 306 (498) T ss_pred HHHHHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEe-------cCCC--CCChHHHHHHHHHH Confidence 11122233333344444444443321 0011 111322 23333 Q ss_pred HHH---hhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEeCCC-EEEEecccC-------CCC Q lcl|Aclame:pro 210 LQA---YIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLNHNG-FRYWGSRTL-------ATD 278 (393) Q Consensus 210 ~~a---~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~~~G-~~~wG~rT~-------~~d 278 (393) +.+ +.|..| .--...|.|+..+... ..++..|.|.|...||.++.-+.| ..+--..|. ..| T Consensus 307 ~~A~~l~~DPAr----PL~tl~L~Gi~~p~~~----~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D 378 (498) T protein:vir:45 307 RAAVFIRNDPAR----PTQTGELVGMLPAPKG----KRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVAD 378 (498) T ss_pred HHHHHhhccccc----ccCceeecceecCCch----hcCChHHHHHHHhCCcceEEEcCCeEEEEeeeeeeeecCCCCcc Confidence 333 344322 2223567777765433 234677899999999999977777 333333332 247 Q ss_pred cccceeehhhHHHHHHHHHHHHhHHhhc-ccCCHH-----------HHHHHHHHHHHHHHHHhhccccc---c--cceEE Q lcl|Aclame:pro 279 TRWAFQQSVRTAQIIKETIGAGLAWAVD-MPLTPL-----------RVKTMLEAINNKLRSWASGDDPR---I--LGARV 341 (393) Q Consensus 279 ~~~~~i~~rR~~~~i~~~i~~~~~~~v~-epn~~~-----------~~~~i~~~i~~~l~~l~~~g~~~---~--~~~~v 341 (393) +.|..|...|+.+|+.+.++..+...-. +.+..+ +-..|+..+-.-++.|...|-.. . ..-.| T Consensus 379 ~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givEn~~~~~~~LiV 458 (498) T protein:vir:45 379 NSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVV 458 (498) T ss_pred hhhhhhhhHHHHHHHHHHHHHHhhhhcCCeeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEE Confidence 8899999999999999999988764422 121111 55677777777777777666211 1 12223 Q ss_pred EecCCCCHHHhhCCEEEEEEEEEecCc----ceeEEEEEEEcchHH Q lcl|Aclame:pro 342 WVAEEITADIIKSGKFVIKYDYHWIPS----LESLGLEQRVNDEYV 383 (393) Q Consensus 342 ~~~~~nt~~~i~~G~~~~~v~~~p~~p----~e~i~~~~~~~~~~~ 383 (393) .-+.+|+ .|+.+.+-.-.+-+ +-.|.|+++++.+.. T Consensus 459 erd~~dp------nRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 459 ERDASVP------NRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred EECCCCC------cEEEEEecccccCchhhhhhhhhhheehhhcCC Confidence 3333332 23333332222222 334566666655443 No 57 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=98.21 E-value=2.5e-06 Score=51.29 Aligned_cols=354 Identities=12% Similarity=0.065 Sum_probs=163.9 Q ss_pred CCCC-----ccc-CCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhh Q lcl|Aclame:pro 1 MSIL-----DTY-LHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLN 74 (393) Q Consensus 1 m~m~-----~~~-~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~ 74 (393) |++. .++ .||+|++..++....-.. +..+-++|..-.. ...+.++|+++.|..++...||...-++.-.. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~~~~--~qrvLiiGq~la~--gt~~~~~~v~v~s~~~a~~~fG~GS~l~~M~~ 76 (498) T protein:vir:48 1 MTISFSAVPSDTLVPLFYAEMDNSAANTAVT--SAPALLIGHASND--AAIEVNSLVLMPSADYARQICGAGSQLARMVD 76 (498) T ss_pred CCccccccCcccccceEEEEEecCCCccccC--CcceEEEeecCcc--ccccccceEEecCHHHHHHhcCcccHHHHHHH Confidence 6543 344 499999998877654333 2345566644222 22356899999999999999998766654444 Q ss_pred hhhcc-cCceEEEEEec-------------------------------------ccccccccccchhcc----c------ Q lcl|Aclame:pro 75 SIGSI-VKTPTVIVRVA-------------------------------------ESDDSDTLTANIVGT----Q------ 106 (393) Q Consensus 75 ~~~~~-~~~~~~vv~~~-------------------------------------~~~~~~~~~~~~~~~----~------ 106 (393) .+... .-...+++.+. ..++...-.....-. . T Consensus 77 a~~~~n~~~~l~~i~~~D~ag~aA~g~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aai~a~~~lPVTA 156 (498) T protein:vir:48 77 VYRQTDPFGELYVIAVPEARGAAATVRVTVTGEAEESGTLSLYVGRSSVQVPVVNGDDATAVATAIKEAVNGVITLPFAA 156 (498) T ss_pred HHHHhCCCceeEEEeeCCcccceeEEEEEecccccCCceEEEEECCEEEEEeecCCCCHHHHHHHHHHHHhCCCCcceEE Confidence 33221 11111222111 111100000000000 0 Q ss_pred --------ccccccchhhhhhhhhhhh--------hccccc------cccccchHHHHHHHHHhhcccceEEEEe----- Q lcl|Aclame:pro 107 --------ENGKFTGIKALLTAQSTVF--------VKPKLL------CVPQHDNQAVATELLSVAKKLNAFAFIS----- 159 (393) Q Consensus 107 --------~~~~~~gl~al~~~~~~~~--------~~~~~l------~apg~s~~~v~~al~~~a~~~~~~~~i~----- 159 (393) -+...+|...= ....... ..|..+ .+-|.-.+++..+|.++.+...+..... T Consensus 157 ~~~~~~VtlTAr~kG~~GN-~I~l~~~~~~~~~ge~~p~Glt~~itamsgGag~PDia~aLaal~~~~~~~I~~p~~D~a 235 (498) T protein:vir:48 157 SSDAGVVTLTARHKGLYGN-ELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPDLTAAVAAMGDEAFDFIGLPFNDAA 235 (498) T ss_pred EecCcEEEEEeeecccccc-cceeeeeeccCcccccccceeeEEEEcccCCccCcchHHHHHhhccCCccEEEEeecCHH Confidence 00000111000 0000000 000000 0001112334444444444443333221 Q ss_pred ---------------------------cCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhH---HHHH Q lcl|Aclame:pro 160 ---------------------------DNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVA---RACA 209 (393) Q Consensus 160 ---------------------------~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~---~~ag 209 (393) .....+..+...+....++.|..+.+ ... ...-|+.. .+|+ T Consensus 236 sl~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~-------~~~--~~~~p~~~~AAa~a~ 306 (498) T protein:vir:48 236 SINMMMTEMNDSSGRWSYARQLYGHVYTAKLGTLSELVNAGDMHNQQHITLAG-------YEK--ETQSPVDELVASRLA 306 (498) T ss_pred HHHHHHHHHhhhhhhhhHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEe-------cCC--CCCChHHHHHHHHHH Confidence 11222233344444444444443322 001 11123222 2233 Q ss_pred HHH---hhhccCCceecCCC-ceecceeeceeecccccCCCchhhhhhcccceEEEEeCCC-EEEEecccC-------CC Q lcl|Aclame:pro 210 LQA---YIDKTVGWHKNISN-VELDGVTGITKAVEFDINESSTEANYLNEKGITICLNHNG-FRYWGSRTL-------AT 277 (393) Q Consensus 210 ~~a---~~D~~~G~~~spaN-~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~~~G-~~~wG~rT~-------~~ 277 (393) +.+ +.|..| |-| ..|.|+..+... ..++..|.|.|.-.||.++.-..| ..+--..|. .. T Consensus 307 ~aA~~l~~DPAr-----PLqtl~L~Gi~~p~~~----~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~ 377 (498) T protein:vir:48 307 REAVFIRNDPAR-----PTQTGELVGMLPAPKG----KRFIMTEQQTLLSHGVATAYVEGGTLRIQRSVTTYKKNAYGVA 377 (498) T ss_pred HHHHhhhccccc-----cccceeeeccccCCch----hcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCCc Confidence 332 444333 333 567777755443 234677899999999999965555 444444443 24 Q ss_pred CcccceeehhhHHHHHHHHHHHHhHHhhc-ccCCHH-----------HHHHHHHHHHHHHHHHhhccccc---c--cceE Q lcl|Aclame:pro 278 DTRWAFQQSVRTAQIIKETIGAGLAWAVD-MPLTPL-----------RVKTMLEAINNKLRSWASGDDPR---I--LGAR 340 (393) Q Consensus 278 d~~~~~i~~rR~~~~i~~~i~~~~~~~v~-epn~~~-----------~~~~i~~~i~~~l~~l~~~g~~~---~--~~~~ 340 (393) |+.|..|...|+.+|+.+.++..+...-. +.+..+ +-..|+..+-.-++.|...|-.. . ..-. T Consensus 378 D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given~~~~~~~Li 457 (498) T protein:vir:48 378 DNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVENYDLFKQYLI 457 (498) T ss_pred chhhhhhhhHHHHHHHHHHHHHHhhhhcCCceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeE Confidence 78899999999999999999988763322 222222 55677777777777777666211 1 1222 Q ss_pred EEecCCCCHHHhhCCEEEEEEEEEecCc----ceeEEEEEEEcchHH Q lcl|Aclame:pro 341 VWVAEEITADIIKSGKFVIKYDYHWIPS----LESLGLEQRVNDEYV 383 (393) Q Consensus 341 v~~~~~nt~~~i~~G~~~~~v~~~p~~p----~e~i~~~~~~~~~~~ 383 (393) |..+.+|+ .|+.+.+-.-.+-+ +-.|.|+++++.... T Consensus 458 Verd~~dp------nRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 458 VERDADNP------NRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred EEECCCCC------cEEEEEecccccCchhhhhhhhhhhhhhhhcCC Confidence 33333332 33333333222222 223455555543332 No 58 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=98.06 E-value=5.7e-06 Score=49.34 Aligned_cols=355 Identities=13% Similarity=0.092 Sum_probs=162.8 Q ss_pred CCCC-----ccc-CCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhh Q lcl|Aclame:pro 1 MSIL-----DTY-LHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLN 74 (393) Q Consensus 1 m~m~-----~~~-~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~ 74 (393) |++. .+. .||+|+|..++.... ......+-++|..-. +. ..+.++|++++|..++...+|...-|+.-.. T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~--~~~~q~vLiiGq~la-~g-s~~~~~~v~v~s~~~a~~~fG~GSml~~M~~ 76 (498) T protein:vir:44 1 MAISFNSIPSDTRVPLFYAEMDNSAANT--ARDSGASLLIGHASN-DA-SIAVNSLVLVSSVDYARQICGAGSQLARMVG 76 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCCC--CcCCcceEEEEecCc-cc-ccccceeEeecCHHHHHHhcCcccHHHHHHH Confidence 6544 344 499999987766532 222234555664422 22 3356899999999999999998776654444 Q ss_pred hhhcc-cCceEEEEEecccc--------------cccccccchhccc---------------------------c----- Q lcl|Aclame:pro 75 SIGSI-VKTPTVIVRVAESD--------------DSDTLTANIVGTQ---------------------------E----- 107 (393) Q Consensus 75 ~~~~~-~~~~~~vv~~~~~~--------------~~~~~~~~~~~~~---------------------------~----- 107 (393) .+... .-...+++.+.... ....+....+++. + T Consensus 77 a~~~~n~~~~l~~i~~~D~aG~aAtg~it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa~vA~al~aaina~~~lPVTA 156 (498) T protein:vir:44 77 AYRKTDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPFTA 156 (498) T ss_pred HHHHhCCCceeEEEecCCcccceeEEEEEeecccCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHHhCCCCCceEE Confidence 43221 11111222111110 0000000000000 0 Q ss_pred ---------cccccchhhhhhhhhhh--------hhccccc---c---ccccchHHHHHHHHHhhcccceEEEEec---- Q lcl|Aclame:pro 108 ---------NGKFTGIKALLTAQSTV--------FVKPKLL---C---VPQHDNQAVATELLSVAKKLNAFAFISD---- 160 (393) Q Consensus 108 ---------~~~~~gl~al~~~~~~~--------~~~~~~l---~---apg~s~~~v~~al~~~a~~~~~~~~i~~---- 160 (393) +...+|... -...... -..|..+ + +.|...+++..+|.++.+...+...... T Consensus 157 ~~~~~~vtlTAr~kG~~G-N~I~l~~~~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~i~~p~~D~a 235 (498) T protein:vir:44 157 TSEAGVVTLTARHKGLYG-NEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDTA 235 (498) T ss_pred eeccceEEEEEeccCccc-CcceEEEeeccCccccccccceeEEEEcccCCccCchhHHHHHhhccCCccEEEEeecCHH Confidence 000001000 0000000 0001000 0 0111123455566665555544433211 Q ss_pred ----------------------------CCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhH---HHHH Q lcl|Aclame:pro 161 ----------------------------NGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVA---RACA 209 (393) Q Consensus 161 ----------------------------~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~---~~ag 209 (393) .-..+..+...+....++.|..+.+ ...+ ..-|+-. .+|+ T Consensus 236 sl~al~~~L~~~sgRw~~~~q~~g~~~~a~~gT~a~l~t~g~~~N~~~it~~~-------~~~~--~~sp~~~~AAa~a~ 306 (498) T protein:vir:44 236 SVNSMATEMNDSSGRWSYVRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAG-------YEKD--TQTPADELAASRTA 306 (498) T ss_pred HHHHHHHHHhhhhcchHHHhhcCeEEEEeccCCHHHHHHhhhccCCceEEEEe-------cCCC--CCCHHHHHHHHHHH Confidence 1111222233333333333332211 1111 0112222 3333 Q ss_pred HHH---hhhccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEeCCC-EEEEecccC-------CCC Q lcl|Aclame:pro 210 LQA---YIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLNHNG-FRYWGSRTL-------ATD 278 (393) Q Consensus 210 ~~a---~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~~~G-~~~wG~rT~-------~~d 278 (393) +.+ +.|..| .--...|.|+..+... ..++..|.|.|...||.++.-+.| ..+--..|. ..| T Consensus 307 ~aA~~l~~DPAr----PL~tl~L~Gi~~p~~~----~r~~~~ern~LL~~Gist~~V~~G~V~I~R~ITTY~~n~~G~~D 378 (498) T protein:vir:44 307 RAAVFIRNDPAR----PTQTGELVDMLPAPKG----KRFTTTEQQTLLSHGVATAYVESGVLRIQRDITTYRKNAYGVAD 378 (498) T ss_pred HHHHHhhccccc----ccCceeecccccCCch----hcCChHHHHHHHhcCcceEEEcCCeEEEEeeeeeeeecCCCCcc Confidence 333 334322 2223567777765433 234677899999999999977777 333333332 247 Q ss_pred cccceeehhhHHHHHHHHHHHHhHHhhc-ccCCH-----------HHHHHHHHHHHHHHHHHhhccccc---c--cceEE Q lcl|Aclame:pro 279 TRWAFQQSVRTAQIIKETIGAGLAWAVD-MPLTP-----------LRVKTMLEAINNKLRSWASGDDPR---I--LGARV 341 (393) Q Consensus 279 ~~~~~i~~rR~~~~i~~~i~~~~~~~v~-epn~~-----------~~~~~i~~~i~~~l~~l~~~g~~~---~--~~~~v 341 (393) +.|..|...|+.+|+.+.++..+...-. +..-. -+-..|+..+-.-++.|...|-.. . ..-.| T Consensus 379 ~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~givEn~~~~~~~LiV 458 (498) T protein:vir:44 379 NSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREGIVENFDLFQQHLIV 458 (498) T ss_pred hhhhhhhhHHHHHHHHHHHHHHhhhhcCCcccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhccccChhhhcceeEE Confidence 8899999999999999999988753322 22111 255677888877788887766211 1 11222 Q ss_pred EecCCCCHHHhhCCEEEEEEEEEecCcc----eeEEEEEEEcchHH Q lcl|Aclame:pro 342 WVAEEITADIIKSGKFVIKYDYHWIPSL----ESLGLEQRVNDEYV 383 (393) Q Consensus 342 ~~~~~nt~~~i~~G~~~~~v~~~p~~p~----e~i~~~~~~~~~~~ 383 (393) .-+.+|+ .|+.+.+-.-.+-+. -.|.|+++++.+.. T Consensus 459 erd~~dp------nRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:44 459 ERNANDS------NRLDVLFPPDYVNQLRVFAVLNQFRLQYSEEAA 498 (498) T ss_pred EECCCCC------cEEEEEecccccCchhhhhhhhhhhhhhhhhcC Confidence 2233322 334333333322222 23445555544333 No 59 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=98.00 E-value=7.7e-06 Score=48.63 Aligned_cols=355 Identities=12% Similarity=0.046 Sum_probs=186.6 Q ss_pred CCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceE-EeecchhhhhhcccccchhhhhhhhhcccC Q lcl|Aclame:pro 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPV-LITNPLNYLEKAGSTGTLRRTLNSIGSIVK 81 (393) Q Consensus 3 m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~v-l~t~~~~~~~~~g~~~tl~~~~~~~~~~~~ 81 (393) |-++ =|.|..-- .+.++...+-..+.+++..... . .++ ..++..+....||.....+.+-..+|.+.. T Consensus 1 ~~s~---iVnV~i~~-~~~a~~~~~f~~~l~~~~~~~~------~-~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p 69 (450) T protein:vir:95 1 MWNP---IVNVDITL-NTAGTTREGFGLPLFLASTDNF------E-ERVRGYTSLTEVAEDFDENTAAYKAAKQLWSQTP 69 (450) T ss_pred CCCc---eEEEeecc-cccccccccceeEEEEcCCCCC------c-cceeeecCHHHHHHhcCCCcHHHHHHHHHHhCCC Confidence 4443 34444222 4455566666667777754322 1 333 455667777888888877777777777653 Q ss_pred ceEE--EEEecccccccc-------cc---c-chhc---------ccccccccc----hhhhhhhhhhh----------- Q lcl|Aclame:pro 82 TPTV--IVRVAESDDSDT-------LT---A-NIVG---------TQENGKFTG----IKALLTAQSTV----------- 124 (393) Q Consensus 82 ~~~~--vv~~~~~~~~~~-------~~---~-~~~~---------~~~~~~~~g----l~al~~~~~~~----------- 124 (393) .... +-+-........ +. . .+.| ........+ +++........ T Consensus 70 ~p~~l~igr~~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~~~~~s~g~ 149 (450) T protein:vir:95 70 KVTQLYIGRRAMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDKVSVNVTGS 149 (450) T ss_pred cccEEEEEeeccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeeeeeeeeecc Confidence 3221 111111000000 00 0 0000 000000000 11100000000 Q ss_pred -------------------hhccccccccccchHHHHHHHHHhhcccceEEEEecCCCCcchhhhh---hhcccccceEE Q lcl|Aclame:pro 125 -------------------FVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISDNGATTKEQAYT---YRQNFSQREGM 182 (393) Q Consensus 125 -------------------~~~~~~l~apg~s~~~v~~al~~~a~~~~~~~~i~~~~~~~~~~a~~---~~~~~~s~~~~ 182 (393) .-....+...|.....+..+|.++.+...+.+ ....+..+.++..+ |.+..+ +.. T Consensus 150 ~~~~t~~~~~~~~~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~~w~-~~~~~~~~~~~i~a~a~w~~a~~--~~f 226 (450) T protein:vir:95 150 NGSATMIIAKAGDNDFVKVTTTAQTVYIASTTADTASTALAAIEAYSTDWY-FIAAEDRTQQFVLAMASEIQARK--KIF 226 (450) T ss_pred cceeeeeeeccccchhhccccccceeEecccccccHHHHHHHHHHhhCCeE-EEEecCCCHHHHHHHHHHHhhcC--cEE Confidence 00011111122222234555555554443333 22334444443333 333221 222 Q ss_pred Eeccce-eEeecc--------------C--CceE-E-------echhHHHHHHHHhhhccCCceecCCCceecceeecee Q lcl|Aclame:pro 183 MIFGDW-KSYNTD--------------K--KAYD-T-------DYAVARACALQAYIDKTVGWHKNISNVELDGVTGITK 237 (393) Q Consensus 183 ~~~p~~-~~~~~~--------------~--~~~~-~-------~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~ 237 (393) .|..|- ...+.. . .+.. . -.+.+.++|.....++-+ ..-.+|.+.||..-.. T Consensus 227 ~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~~~~~~~~aa~~g~~~~~~~g~---~T~~fk~l~Gv~~~v~ 303 (450) T protein:vir:95 227 FTANSDVTALQGTELASANDVPAQLAKNMYTRTVCLWHHAAAEDYPEMAYIAYGAPYDAGS---IAWGNAQLTGVAASLQ 303 (450) T ss_pred EEEcCCchhhhhhhhhcccchHHHHHhccCCeeEEEeeCCCchhHHHHHHHHHhhhcccce---eeeccccccceeeecc Confidence 222221 111000 0 0111 1 123344444433332222 2334677777664322 Q ss_pred ecccccCCCchhhhhhcccceEEEEeCCC-EEEEecccCCCCcccceeehhhHHHHHHHHHHHHhHHhhc-----c-cCC Q lcl|Aclame:pro 238 AVEFDINESSTEANYLNEKGITICLNHNG-FRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVD-----M-PLT 310 (393) Q Consensus 238 ~~~~~~~~~~~~~~~ln~~gi~~~~~~~G-~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~-----e-pn~ 310 (393) . .....++..|.+.|..+|+|++...+| -.++.++|++++ ||-++|-.+|++..|++.+...+- + |-| T Consensus 304 ~-~~~~~lt~~~~~al~~~~~n~y~~~~~~~~~~~G~~~~G~----~iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy~ 378 (450) T protein:vir:95 304 P-SNQRPLTSIQKSALDVRHCNFIDLDGGVPVVRRGITSGGE----WIDIIRGVDWLESDLKTSLRDLLINQKGGKITYD 378 (450) T ss_pred C-ccccccchHHHHHHHhCCcEEEEEecCceeeeCCeeeCcc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccC Confidence 1 122346788999999999998865444 457888998873 788999999999999999887662 2 778 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccccccceEEEec--CCCCHHHhhCCEEE-EEEEEEecCcceeEEEEEEEcch Q lcl|Aclame:pro 311 PLRVKTMLEAINNKLRSWASGDDPRILGARVWVA--EEITADIIKSGKFV-IKYDYHWIPSLESLGLEQRVNDE 381 (393) Q Consensus 311 ~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~--~~nt~~~i~~G~~~-~~v~~~p~~p~e~i~~~~~~~~~ 381 (393) +.-...|+..|+.-|+..+.+|. +.++.|... ++.+++|+.++++. +.+.+...-.++.+.++....=| T Consensus 379 ~~G~~~i~a~i~~~l~~a~~~G~--Ia~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~~ 450 (450) T protein:vir:95 379 DTGITRIRQVIETSLQRAVNRNF--LSSYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAYE 450 (450) T ss_pred hhhHHHHHHHHHHHHHHHHhcCc--ccceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEeC Confidence 88888899999999999888774 456767664 46888999998865 77778888899998877776555 No 60 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=97.85 E-value=1.5e-05 Score=47.03 Aligned_cols=362 Identities=12% Similarity=0.007 Sum_probs=175.5 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhccc Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~ 80 (393) || .+ =|.|. +.-.+.++.......+.|+|++...++... ++...+.++..+...-||.....+.+...+|.|+ T Consensus 1 m~--~~---iVnV~-Is~~t~A~~~~~Fg~~liigs~~~~~p~~~-f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~ 73 (426) T protein:vir:31 1 MP--KQ---IVEIE-LTAEIADRPQETFTDAAIVGTAEEEPPDAE-FGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMG 73 (426) T ss_pred CC--cc---eEEEE-eecccccccccccceeeeeeeccccccccc-cchhhhhhhHHHHHhcCCCChHHHHHHHHHHhCC Confidence 55 32 34333 444566788888899999999876654331 3455567888888888998888888888888876 Q ss_pred CceEEEEEe-------cccccccc-cccchhc-----ccccccccchhhhhhhhhhhhhc---------------ccccc Q lcl|Aclame:pro 81 KTPTVIVRV-------AESDDSDT-LTANIVG-----TQENGKFTGIKALLTAQSTVFVK---------------PKLLC 132 (393) Q Consensus 81 ~~~~~vv~~-------~~~~~~~~-~~~~~~~-----~~~~~~~~gl~al~~~~~~~~~~---------------~~~l~ 132 (393) .-....... +..++... ....+.+ ..+.....++.+-.......... ..+.. T Consensus 74 ~~~~r~~v~~at~~~~~~~t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~~~~t~~g~~t~~~~~~~~~~ 153 (426) T protein:vir:31 74 AEQWRVMVLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEIVINSATGDVATSEDSIELTY 153 (426) T ss_pred ceeEEeeccccceeeeccCCcceeecceeeeecccCcchHHHHHHhhccccccccceeeeEeccccceeeccccceeeee Confidence 222111000 00000000 0000000 00000111111111111100000 00000 Q ss_pred cc-cc------chHHHHHHHHHhhcccc---------------eEEEEe--cCCCCcchhhhhhhcccccce-EEEeccc Q lcl|Aclame:pro 133 VP-QH------DNQAVATELLSVAKKLN---------------AFAFIS--DNGATTKEQAYTYRQNFSQRE-GMMIFGD 187 (393) Q Consensus 133 ap-g~------s~~~v~~al~~~a~~~~---------------~~~~i~--~~~~~~~~~a~~~~~~~~s~~-~~~~~p~ 187 (393) .- ++ +......++........ ....+- ..+.+... -...+.+++ .+-|.|- T Consensus 154 s~~dw~~~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa~~~~i~~va~~~e~~~~~~----~~~~~a~~~~~~~y~p~ 229 (426) T protein:vir:31 154 FHADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDEDMGMIANGVNVDDYDS----VDEAMDVAHEVAGYVPS 229 (426) T ss_pred ccCcchhhhcccccchhhhhhccccchhhhhhhHhhhhhhhhcceeeeeeccchhhhcc----hhhhhhhhhcccccccc Confidence 00 00 00000111111111000 011110 00000000 001122221 1223333 Q ss_pred eeEeeccCCceEEechhHHHHHHHHhhhccCCceecCCCceecceeece---eecccccCCCchhhhhhcccceEEEEe- Q lcl|Aclame:pro 188 WKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGIT---KAVEFDINESSTEANYLNEKGITICLN- 263 (393) Q Consensus 188 ~~~~~~~~~~~~~~p~S~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~---~~~~~~~~~~~~~~~~ln~~gi~~~~~- 263 (393) ......... ..--..+.+++.++..+. |..|.=..+.+..... ...+....+...++-.++ ...|.+.. T Consensus 230 ~~~~~~~~~--~~~~~~~~~~~~~aa~~~----~~~~~~~~~~~~~~~~~~~~~~gv~~t~~~~~~A~~~-~~~n~~~~~ 302 (426) T protein:vir:31 230 GDLMMIVDA--SDDDLAAYQLGKFAVSEP----WYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGE-GPVNVLIDV 302 (426) T ss_pred hhheeehhc--cccchhhHHhhhhhhhcc----ccchhhhhccccccceeeccccccccccchhhhhhhc-CCceEEEEe Confidence 211111000 001125678888888774 4444311111111111 111111111122333343 56788866 Q ss_pred CCCEEEEecccCCC-CcccceeehhhHHHHHHHHHHHHhHHhhc---c-cCCHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 264 HNGFRYWGSRTLAT-DTRWAFQQSVRTAQIIKETIGAGLAWAVD---M-PLTPLRVKTMLEAINNKLRSWASGDDPRILG 338 (393) Q Consensus 264 ~~G~~~wG~rT~~~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~---e-pn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~ 338 (393) .++..+|-.-|.++ ...-.||-++|..+||++.++..++..+= + |-|..-+..|+..|+.=|++.+..|.....+ T Consensus 303 ~~~~~i~~~~~~~G~~~~G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~~~~~ 382 (426) T protein:vir:31 303 SDANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAE 382 (426) T ss_pred cCceeeecceeecccccchhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCccccc Confidence 34566675555443 34456999999999999999999886653 3 8888899999999999999988755333556 Q ss_pred eEEEec-CCCCHHHhhCCEEE-EEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 339 ARVWVA-EEITADIIKSGKFV-IKYDYHWIPSLESLGLEQRVND 380 (393) Q Consensus 339 ~~v~~~-~~nt~~~i~~G~~~-~~v~~~p~~p~e~i~~~~~~~~ 380 (393) |.+-.. ......|..+.++. +++.....-.+.++.++....- T Consensus 383 y~v~~P~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 383 YEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred eeecCCCccccchhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 766543 22344566776666 7777778889999998888877 No 61 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=97.33 E-value=9.3e-05 Score=42.70 Aligned_cols=352 Identities=12% Similarity=0.061 Sum_probs=166.4 Q ss_pred CCC------Cccc-CCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhh Q lcl|Aclame:pro 1 MSI------LDTY-LHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTL 73 (393) Q Consensus 1 m~m------~~~~-~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~ 73 (393) ||- +.+. .||+|+|.-++....-.......+-++|..-.. .. .+.++|++++|..++...||...-++.-. T Consensus 1 m~~i~F~~IP~~iRvP~~y~E~dns~A~~g~~~~~q~vLiiGq~la~-gs-~~~~~pv~v~s~~~a~~~fG~GS~la~M~ 78 (495) T protein:vir:19 1 MSDISFNAIPSDVRVPLTYIEFDNSNAVSGTPAPRQRVLMFGQSGSK-AS-AAPNVPVRIRSGSQASAAFGQGSMLALMA 78 (495) T ss_pred CCCCchhhCCcccccCeEEEEEccCCCCcCCcCCCceEEEEEecCcc-cc-cccceeEEecCHHHHHHhcCcCcHHHHHH Confidence 443 3344 499999998886654333444455666653221 22 24589999999999999999876554433 Q ss_pred hhhhcc-c-------------------------------------CceEEEEEecccccccccccch-----------h- Q lcl|Aclame:pro 74 NSIGSI-V-------------------------------------KTPTVIVRVAESDDSDTLTANI-----------V- 103 (393) Q Consensus 74 ~~~~~~-~-------------------------------------~~~~~vv~~~~~~~~~~~~~~~-----------~- 103 (393) ..+... . ++..+.+.+...++...-.... . T Consensus 79 ~a~~~~n~~~~l~~i~~~D~aG~aA~g~it~tg~at~~G~l~l~I~g~~v~v~V~~gdTaa~vA~al~aaina~~~lPvT 158 (495) T protein:vir:19 79 DAFLNANRVAELWCIPQGNGTGNAAVGEISLSGTAGENGSLVTYIAGQRLAVSVAAGATGAALADLLVARIKGQPDLPVT 158 (495) T ss_pred HHHHHhCCcceEEEEeeCChhhceeEEEEEEeecCCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHhcCCccCceE Confidence 332110 0 0011112222222111100000 0 Q ss_pred --------------cccccccccchhhhhhhhhhhh-----hcccccc------ccccchHHHHHHHHHhhcccceEEEE Q lcl|Aclame:pro 104 --------------GTQENGKFTGIKALLTAQSTVF-----VKPKLLC------VPQHDNQAVATELLSVAKKLNAFAFI 158 (393) Q Consensus 104 --------------~~~~~~~~~gl~al~~~~~~~~-----~~~~~l~------apg~s~~~v~~al~~~a~~~~~~~~i 158 (393) -.+-+...+|. . -..+..+. ..|..+. +.|.-.+++..+|.++.+........ T Consensus 159 A~~~~~~~~~~a~~~VtlTAr~kG~-~-n~idi~~~~~~ge~~p~Glt~titamsgGag~PDia~alaal~~~~~~~I~~ 236 (495) T protein:vir:19 159 AEVRADSGDDDTHADVVLSAKFTGA-L-SAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMGDLQYKYIVM 236 (495) T ss_pred EEeeccCCCCcCceeEEEEEeeccc-c-ccceeEEEeecccccccceeEEEEecCCCCCCcchHHHHHHhccCCCcEEEE Confidence 00011122221 1 01111110 1111110 11222334555555554444433322 Q ss_pred e-----------------------------cCCCCcchhhhhhhcccccceEEEeccceeEeeccCCceEEechhHHHHH Q lcl|Aclame:pro 159 S-----------------------------DNGATTKEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDTDYAVARACA 209 (393) Q Consensus 159 ~-----------------------------~~~~~~~~~a~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~S~~~ag 209 (393) . .....+..+...+....++.|..+.+ . ++ ..-||...+|+ T Consensus 237 P~tD~asL~al~~~l~~rw~~~~q~~g~~~~a~~gT~~~l~t~g~~~N~~~it~~~-------~-~g--sp~~~~~~AAA 306 (495) T protein:vir:19 237 PYTDEPNLNLLRTELQERWGPVNQADGFAVTVLSGTYGDISTFGVSRNDHLISCMG-------I-AG--APEPSYLYAAT 306 (495) T ss_pred ecCcHHHHHHHHHHHHHhhhHHHhcCeEEEEeecCCHHHHHHhhhccCCceEEEEe-------c-CC--CCCcHHHHHHH Confidence 0 11122233344444444444443321 0 11 12344333333 Q ss_pred HHHhhh--ccCCceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEEeC-CC-EEEEecccC-------CCC Q lcl|Aclame:pro 210 LQAYID--KTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLNH-NG-FRYWGSRTL-------ATD 278 (393) Q Consensus 210 ~~a~~D--~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~~-~G-~~~wG~rT~-------~~d 278 (393) +.++.- .+..|-..--...|.|+..+.... .++..|.|.|.-.||.++..+ .| ..+--..|. ..| T Consensus 307 ~aa~~A~~l~~DPArPL~tl~L~Gi~~p~~~~----r~~~~ern~LL~~Gist~~V~~~G~V~I~R~ITTY~~n~~G~~D 382 (495) T protein:vir:19 307 LCAVASQALSIDPARPLQTLTLPGRMPPAVGD----RFTWSERNALLFDGISTFNVNDGGEMQIERMITMYRTNKYGDSD 382 (495) T ss_pred HHHHHHHHhhcccccccCceeecceecCCccc----cCChHHHHHHHhCCcceEEECCCCeEEEEeeeeeeeecCCCCcc Confidence 333221 122232222346777777665433 346779999999999999653 55 334444443 237 Q ss_pred cccceeehhhHHHHHHHHHHHHhHHhhc-ccCCHH-----------HHHHHHHHHHHHHHHHhhccccc---c--cceEE Q lcl|Aclame:pro 279 TRWAFQQSVRTAQIIKETIGAGLAWAVD-MPLTPL-----------RVKTMLEAINNKLRSWASGDDPR---I--LGARV 341 (393) Q Consensus 279 ~~~~~i~~rR~~~~i~~~i~~~~~~~v~-epn~~~-----------~~~~i~~~i~~~l~~l~~~g~~~---~--~~~~v 341 (393) +.|..|++-|+.+|+.+.++..+...-. +.+..+ +=..|+..+-+-++.|...|-.. . ..-.| T Consensus 383 ~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~given~~~~~~~LiV 462 (495) T protein:vir:19 383 PSYLNVNTIATLSYLRYSLRTRITQKFPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEELYV 462 (495) T ss_pred hhhhhhHHHHHHHHHHHHHHHHHhhhcCCcccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccccChhhhcceeEE Confidence 8899999999999999999987764333 222222 45667777777777777666211 1 12223 Q ss_pred EecCCCCHHHhhCCEEEEEEEEEecCcce----eEEEEE Q lcl|Aclame:pro 342 WVAEEITADIIKSGKFVIKYDYHWIPSLE----SLGLEQ 376 (393) Q Consensus 342 ~~~~~nt~~~i~~G~~~~~v~~~p~~p~e----~i~~~~ 376 (393) .-+.+|+ +|+.+.+-...+-... .|.|++ T Consensus 463 erd~~dp------nRln~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 463 ARNKDDK------DRLDVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred EECCCCC------cEEEEEecceeeCceeeeeeeeeeeC Confidence 3333332 3444444443333333 244444 No 62 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=73.50 E-value=0.17 Score=24.81 Aligned_cols=363 Identities=11% Similarity=0.016 Sum_probs=158.3 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhc-- Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGS-- 78 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~-- 78 (393) || -+=+|=-.+..|..+..+.....-...+++-+.... .|.+.....++..+....||.....+.....+|. T Consensus 1 m~--~~~ip~s~iV~V~~~v~~~~~~~~~~~~lllt~~~~----~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~ 74 (501) T protein:vir:36 1 MP--TTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTS----VQPGQLADFFQETDVENWFGALSNEAKIADAYFPGI 74 (501) T ss_pred CC--cCCcccceEEEEeeeeccCCCcceeeeeEEEeccCC----CCCcceeeecCHHHHHHhcCCChHHHHHHHHHhhcc Confidence 55 322233333333333333333333334444332221 2334334456677777888877666666666664 Q ss_pred --ccC-c-eEEEEEecccccccc-------------------cccchhccc------c-------cccccchhhhhhhhh Q lcl|Aclame:pro 79 --IVK-T-PTVIVRVAESDDSDT-------------------LTANIVGTQ------E-------NGKFTGIKALLTAQS 122 (393) Q Consensus 79 --~~~-~-~~~vv~~~~~~~~~~-------------------~~~~~~~~~------~-------~~~~~gl~al~~~~~ 122 (393) +.. + ..++-+-.+...... +....+++. + .+..+.+++...... T Consensus 75 ~~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~sg~l~vti~g~~~~~~i~lS~~ts~~~vA~~i~~al~~~~ 154 (501) T protein:vir:36 75 VNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPD 154 (501) T ss_pred cCCCccccEEEEEeecCcCcceeEeccchhhhhhhhccceeEEEEEEecceeeeeecccccccCHHHHHHHHhhhhcCcc Confidence 211 1 112211110000000 000000000 0 000001111111000 Q ss_pred h-----------------hhhcc----------------------ccccccccchHHHHHHHHHhhcccceEE--EEecC Q lcl|Aclame:pro 123 T-----------------VFVKP----------------------KLLCVPQHDNQAVATELLSVAKKLNAFA--FISDN 161 (393) Q Consensus 123 ~-----------------~~~~~----------------------~~l~apg~s~~~v~~al~~~a~~~~~~~--~i~~~ 161 (393) . .+... ..+...|........+|.++.+.-.+.. .+.+. T Consensus 155 ~tv~~d~~~~~f~i~s~t~G~~~~i~~~t~~~~ia~~l~Lt~~~~a~v~~~g~~~et~~~al~a~~~~s~~Wy~f~~a~~ 234 (501) T protein:vir:36 155 FVVAYDALRNRFTVVTNATGTAAAISAVTGTNNFADEIGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT 234 (501) T ss_pred eEEEEcCcceeEEEEeccCCcceeeEeeecccchhhhhcccccCcceEEecccccccHHHHHHHHHhccCceEEEEEecC Confidence 0 00000 0011112212223444554443333322 22221 Q ss_pred C-CCcchhhhhhhcccccceEEEeccc---eeEeec---------cCC--ceE----EechhHHHHHHHHhhhccC--Cc Q lcl|Aclame:pro 162 G-ATTKEQAYTYRQNFSQREGMMIFGD---WKSYNT---------DKK--AYD----TDYAVARACALQAYIDKTV--GW 220 (393) Q Consensus 162 ~-~~~~~~a~~~~~~~~s~~~~~~~p~---~~~~~~---------~~~--~~~----~~p~S~~~ag~~a~~D~~~--G~ 220 (393) + +....+.-+|.+..+.++....+.. ...... ..+ +.. ...+.+.+.|..+.+|..+ | T Consensus 235 ~~~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g- 313 (501) T protein:vir:36 235 AVIADRLAFASWNSGQAYKYMYVAPDLEAASIVSNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNG- 313 (501) T ss_pred CChHHHHHHHHHHhhcCceEEEEEecCchhhhhccchhhHHHHHHhcCCCcEEEEcCCCCHHHHHHHHHHhcCcccCcc- Confidence 2 1112233334443333332221111 000000 000 011 1234566677777777543 2 Q ss_pred eecCCCcee-cceeeceeecccccCCCchhhhhhcccceEEE--EeC--CCEEEEecccCCCCcccceeehhhHHHHHHH Q lcl|Aclame:pro 221 HKNISNVEL-DGVTGITKAVEFDINESSTEANYLNEKGITIC--LNH--NGFRYWGSRTLATDTRWAFQQSVRTAQIIKE 295 (393) Q Consensus 221 ~~spaN~~l-~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~--~~~--~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~ 295 (393) -..-.+|.+ .|+.. . .++..+++.|..+|.|++ +.+ ..+.+|-.-+++++ +.||-+.+-.+|++. T Consensus 314 ~~T~~fkq~~~Gi~a---~-----~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~g~dWL~~ 383 (501) T protein:vir:36 314 RTVLAFRQFNAGVPA---T-----VHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNA 383 (501) T ss_pred eeeeeccccCCCcCc---C-----cCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeecc--chhhhHHHhHHHHHH Confidence 112233443 22222 1 246789999999999987 433 44888865577776 567888888899998 Q ss_pred HHHHHhHHhhcc----cCCHHHHHHHHHHHHHHHHHHhhccccc---------------------------ccceEEEec Q lcl|Aclame:pro 296 TIGAGLAWAVDM----PLTPLRVKTMLEAINNKLRSWASGDDPR---------------------------ILGARVWVA 344 (393) Q Consensus 296 ~i~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l~~~g~~~---------------------------~~~~~v~~~ 344 (393) .++..+....-. |-|..=...|+..++.-|+.-+.+|... ..|+-++.+ T Consensus 384 ~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~ 463 (501) T protein:vir:36 384 ELQRAEFEAMLAYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIG 463 (501) T ss_pred HHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeC Confidence 888888765433 5666667777777777777776666321 113333333 Q ss_pred -CCCCHHHhhC-CEEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 345 -EEITADIIKS-GKFVIKYDYHWIPSLESLGLEQRVND 380 (393) Q Consensus 345 -~~nt~~~i~~-G~~~~~v~~~p~~p~e~i~~~~~~~~ 380 (393) .+.++++..+ +...+.+.+.---.+++|++-..--- T Consensus 464 ~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:36 464 DPANPGQARQNRTTPACTLWYSDGGSIQSLTIGSNAVI 501 (501) T ss_pred cccCChhhhhhcccCcEEEEEEeCCceeEEEeeeeeeC Confidence 2244444333 44555666666666676654333222 No 63 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=69.61 E-value=0.22 Score=24.18 Aligned_cols=364 Identities=11% Similarity=0.005 Sum_probs=158.7 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhc-- Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGS-- 78 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~-- 78 (393) || -+=+|=-.+..|..+..+.........+++-+.... .|.+.....++..+....||.....+.+-..+|. T Consensus 1 m~--~~~ip~s~iV~V~~~v~~~~~~~~~f~~lll~~~~~----~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~ 74 (501) T protein:vir:10 1 MP--TTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTS----VQPGQLADFFQKTDVENWFGALSNEAKIADAYFPGI 74 (501) T ss_pred CC--cCccccceEEEEeeecccCCCcccccceEEEecccC----CCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhh Confidence 55 322233333333333322222223333333322111 2334334456777777788877666666666664 Q ss_pred --ccC-c-eEEEEEecccccccc---------cccc----------hhccc------c---c----ccccchhhhhhhhh Q lcl|Aclame:pro 79 --IVK-T-PTVIVRVAESDDSDT---------LTAN----------IVGTQ------E---N----GKFTGIKALLTAQS 122 (393) Q Consensus 79 --~~~-~-~~~vv~~~~~~~~~~---------~~~~----------~~~~~------~---~----~~~~gl~al~~~~~ 122 (393) +.. + ..++-+-.+...... +... .+++. + . +.-+.+++...... T Consensus 75 ~~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~~g~l~i~i~g~~~~~~i~~s~ats~~~vA~~i~~al~~~~ 154 (501) T protein:vir:10 75 VNGGQLPYDLKFARYVAADAPASVYGIPLTGITLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPD 154 (501) T ss_pred cCCCccccEEEEEeecccCccceeeeceehhhhhhhhhheeeEEEEeeccceeeeccccccccCHHHHHHHHHHhhcCCc Confidence 221 1 122222111100000 0000 00000 0 0 00000111111000 Q ss_pred -h----------------hhhc----------------------cccccccccchHHHHHHHHHhhcccceEE--EEecC Q lcl|Aclame:pro 123 -T----------------VFVK----------------------PKLLCVPQHDNQAVATELLSVAKKLNAFA--FISDN 161 (393) Q Consensus 123 -~----------------~~~~----------------------~~~l~apg~s~~~v~~al~~~a~~~~~~~--~i~~~ 161 (393) . .+.. +..+...|........+|.++.+.-.+.. .+.+. T Consensus 155 ~tv~~d~~~~~f~i~~~t~G~~~~i~~~t~~~d~a~~l~Lt~~~~a~v~~~g~~aet~~~Al~a~~~~~~~Wy~f~~a~~ 234 (501) T protein:vir:10 155 FVVAYDALRNRFTVVTNTTGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT 234 (501) T ss_pred eEEEEecccceEEEEecccCcceeEEEeeccccchhhhcccccCceeEEecCcccccHHHHHHHHHhcccceEEEEEEec Confidence 0 0000 00111222222223445554443333221 12222 Q ss_pred C-CCcchhhhhhhcccccceEEEeccc---eeEeec---------cCC--ceEE----echhHHHHHHHHhhhccCCc-e Q lcl|Aclame:pro 162 G-ATTKEQAYTYRQNFSQREGMMIFGD---WKSYNT---------DKK--AYDT----DYAVARACALQAYIDKTVGW-H 221 (393) Q Consensus 162 ~-~~~~~~a~~~~~~~~s~~~~~~~p~---~~~~~~---------~~~--~~~~----~p~S~~~ag~~a~~D~~~G~-~ 221 (393) + +....+.-+|.+..+.++....+.. ...... ..+ +... ..|.+.+.|..+.+|..+-. - T Consensus 235 ~~~~~~la~A~wi~a~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~ 314 (501) T protein:vir:10 235 AVIADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGR 314 (501) T ss_pred CChHHHHHHHHHHHhcCceEEEEEecCcceeeecccchhHHHHHHhcCCCceEEECCCCCHHHHHHHHHHhcCcccCcce Confidence 2 2222233334444333333222111 110000 000 0111 24567777888888754321 1 Q ss_pred ecCCCcee-cceeeceeecccccCCCchhhhhhcccceEEE--EeC--CCEEEEecccCCCCcccceeehhhHHHHHHHH Q lcl|Aclame:pro 222 KNISNVEL-DGVTGITKAVEFDINESSTEANYLNEKGITIC--LNH--NGFRYWGSRTLATDTRWAFQQSVRTAQIIKET 296 (393) Q Consensus 222 ~spaN~~l-~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~--~~~--~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~ 296 (393) .+-..|.+ .|+.. + .++..+++.|..+|.|++ +.+ ..+.+|-.-+++++ |.||-+.+-.+|++.. T Consensus 315 ~T~~fkql~~Gv~a---~-----~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~g~dWl~~~ 384 (501) T protein:vir:10 315 TVLAFRQFNAGVPA---T-----AHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAE 384 (501) T ss_pred eeeeecccCCCcCc---c-----cCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeecc--ceehhhHhhHHHHHHH Confidence 12223333 22221 1 256789999999999997 333 44889855556765 5678888888888888 Q ss_pred HHHHhHHhhcc----cCCHHHHHHHHHHHHHHHHHHhhccccc---------------------------ccceEEEec- Q lcl|Aclame:pro 297 IGAGLAWAVDM----PLTPLRVKTMLEAINNKLRSWASGDDPR---------------------------ILGARVWVA- 344 (393) Q Consensus 297 i~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l~~~g~~~---------------------------~~~~~v~~~- 344 (393) ++..+....-. |-|..=...|...++.-|+.-+.+|-.. ..|+-++.+ T Consensus 385 iq~~l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~ 464 (501) T protein:vir:10 385 LQRAEFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGN 464 (501) T ss_pred HHHHHHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceeccceEEeeCc Confidence 88887764432 5556667777777777777777666321 113333333 Q ss_pred CCCCHHHhh-CCEEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 345 EEITADIIK-SGKFVIKYDYHWIPSLESLGLEQRVND 380 (393) Q Consensus 345 ~~nt~~~i~-~G~~~~~v~~~p~~p~e~i~~~~~~~~ 380 (393) .+.++++.. .+...+.+.+.---.+++|++-..--- T Consensus 465 ~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 465 PANPGQARQNRTSPACTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred ccCChhhhhhcccCceEEEEEeCCceeEEEeeeeecC Confidence 223333322 344555566666666666654333222 No 64 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=66.64 E-value=0.26 Score=23.75 Aligned_cols=364 Identities=11% Similarity=0.003 Sum_probs=156.3 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhc-- Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGS-- 78 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~-- 78 (393) || -+=+|=-.+..|..+..+.........+++-+.. ...|.+.....++..+....||.....+.+-..+|. T Consensus 1 m~--~~~ip~s~iV~V~~~v~~~~~~~~~~~~l~l~~~----~~~~~~~~~~~~s~~~V~~~FG~~S~ey~aA~~yFsg~ 74 (501) T protein:vir:10 1 MP--TTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQD----TSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGI 74 (501) T ss_pred CC--CCCcccceEEEEeeecccCCCccccceeEEEecc----CCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhhh Confidence 55 2212222333333333322222233334333222 223555666677888888888887776666666664 Q ss_pred --ccCc--eEEEEEecccccc---------cccccc----------hhccccc-------------ccccchhhhhhhhh Q lcl|Aclame:pro 79 --IVKT--PTVIVRVAESDDS---------DTLTAN----------IVGTQEN-------------GKFTGIKALLTAQS 122 (393) Q Consensus 79 --~~~~--~~~vv~~~~~~~~---------~~~~~~----------~~~~~~~-------------~~~~gl~al~~~~~ 122 (393) +... ..++-+....... ..+... .+++... +.-+.+++...... T Consensus 75 ~~q~p~P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i~~al~~~~ 154 (501) T protein:vir:10 75 VNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPD 154 (501) T ss_pred cCCCccccEEEEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHHhhhccCCc Confidence 2211 1222221110000 000000 0000000 00000000000000 Q ss_pred h-----------------hhhccc----------------------cccccccchHHHHHHHHHhhcccceE--EEEecC Q lcl|Aclame:pro 123 T-----------------VFVKPK----------------------LLCVPQHDNQAVATELLSVAKKLNAF--AFISDN 161 (393) Q Consensus 123 ~-----------------~~~~~~----------------------~l~apg~s~~~v~~al~~~a~~~~~~--~~i~~~ 161 (393) . .+.... .+...|........+|.++.+.-... +.+.+. T Consensus 155 ~tv~~d~~~~~f~its~ttG~~~~i~~~~~~~~la~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~ 234 (501) T protein:vir:10 155 FVVAYDALRNRFTVVTNATGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT 234 (501) T ss_pred eEEEEcccCceEEEEeeccCCceeEEEeeCchhhhhhcCccccccceEEecCcccccHHHHHHHHHhccCceEEEEEecC Confidence 0 000011 11122222222344444433332222 112222 Q ss_pred CC-CcchhhhhhhcccccceEEEeccc---eeEeec---------cCCceEEe------chhHHHHHHHHhhhccCCc-e Q lcl|Aclame:pro 162 GA-TTKEQAYTYRQNFSQREGMMIFGD---WKSYNT---------DKKAYDTD------YAVARACALQAYIDKTVGW-H 221 (393) Q Consensus 162 ~~-~~~~~a~~~~~~~~s~~~~~~~p~---~~~~~~---------~~~~~~~~------p~S~~~ag~~a~~D~~~G~-~ 221 (393) +. ....+.-+|.+..+.++....+.. ...... ..+-.+.+ .+.+.+.|..+.+|..+-. - T Consensus 235 ~~~~~~la~A~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~ 314 (501) T protein:vir:10 235 AVIADRLAFAAWNSGQAYKYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGR 314 (501) T ss_pred CChHHHHHHHHHHHhcCceEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCCCcHHHHHHHHHHhhCcccCccc Confidence 22 222223334443333332221110 000000 00111122 2456777777877754321 1 Q ss_pred ecCCCceec-ceeeceeecccccCCCchhhhhhcccceEEEEe--C--CCEEEEecccCCCCcccceeehhhHHHHHHHH Q lcl|Aclame:pro 222 KNISNVELD-GVTGITKAVEFDINESSTEANYLNEKGITICLN--H--NGFRYWGSRTLATDTRWAFQQSVRTAQIIKET 296 (393) Q Consensus 222 ~spaN~~l~-gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~--~--~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~ 296 (393) .+-..|.+. |+.. . .++..+++.|..+|.|+... + ..+.+|-.-+++++ |.+|-+-+-.+|++.. T Consensus 315 ~T~~fkq~~~Gi~a---~-----~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~~~~Wl~~~ 384 (501) T protein:vir:10 315 TVLAFRQFNAGVPA---T-----AHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAE 384 (501) T ss_pred eeeeccccCCCcCc---c-----cCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeecc--ceeehhhhhHHHHHHH Confidence 122233332 2221 1 25788999999999999843 3 34888855566766 5567777766777776 Q ss_pred HHHHhHHhhc----ccCCHHHHHHHHHHHHHHHHHHhhccccc---------------------------ccceEEEec- Q lcl|Aclame:pro 297 IGAGLAWAVD----MPLTPLRVKTMLEAINNKLRSWASGDDPR---------------------------ILGARVWVA- 344 (393) Q Consensus 297 i~~~~~~~v~----epn~~~~~~~i~~~i~~~l~~l~~~g~~~---------------------------~~~~~v~~~- 344 (393) ++..+....- =|-|..=...|...++.-|+.-+.+|-.. ..|+-++.+ T Consensus 385 iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~ 464 (501) T protein:vir:10 385 LQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGD 464 (501) T ss_pred HHHHHHHHHHhcCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeecc Confidence 6666654332 26677777777788877777777766321 113333332 Q ss_pred CCCCHHHhh-CCEEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 345 EEITADIIK-SGKFVIKYDYHWIPSLESLGLEQRVND 380 (393) Q Consensus 345 ~~nt~~~i~-~G~~~~~v~~~p~~p~e~i~~~~~~~~ 380 (393) .+.++++.. .+...+.+.+.---.+++|++-..--- T Consensus 465 ~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 465 PANPGQARQNRTTPACTLWYSDGGSIQQLTIGSNAVI 501 (501) T ss_pred ccCChhhhhhccccceEEEEEeCCceeEEEeeeeecC Confidence 223444333 244555555555666666654333222 No 65 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=53.64 E-value=0.52 Score=22.13 Aligned_cols=360 Identities=11% Similarity=0.039 Sum_probs=120.3 Q ss_pred CCCCc---ccC-C--CeEEEEcCCCc---ccccccccceeEEEEeeccccc-ccccccceEEeecchhhhhhcccccchh Q lcl|Aclame:pro 1 MSILD---TYL-H--GVEVVEVNAGG---VTISTAATSVIGVVCTGDQADA-ETFPLNTPVLITNPLNYLEKAGSTGTLR 70 (393) Q Consensus 1 m~m~~---~~~-~--GV~v~ev~~~~---~~i~~v~tav~g~vg~a~~~d~-~~~p~~~~vl~t~~~~~~~~~g~~~tl~ 70 (393) .|--- +|. . -.++....-.. .....++... +..+.+.... ...+++. ...++..+............ T Consensus 78 ~P~~L~igR~~~~a~~~~l~g~~~~~~~l~~~~~is~G~--ltitidG~~~~t~s~i~~-S~ats~~~vAs~i~tal~~~ 154 (515) T protein:vir:10 78 RPTSIQFARWQREAGPVAIYGGAKKAAALATLQAVTAGA--ISFLFGGATTVTVSGISF-SAATSLADVASELQTALRAN 154 (515) T ss_pred cccEEEEEeccCcccceEEEeccchhhhHHhhhccccee--EEEEEcceEEEEeecccc-ccccCHHHHHHHHHhhhccc Confidence 11000 000 0 00000000000 0011111010 0011110000 0001100 01122222221111110000 Q ss_pred -----hhhhhhhcccCceEEEEEecccccccccccchhcccccccccchhhhhhhhhhhhhccccccccccchHHHHHHH Q lcl|Aclame:pro 71 -----RTLNSIGSIVKTPTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATEL 145 (393) Q Consensus 71 -----~~~~~~~~~~~~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~gl~al~~~~~~~~~~~~~l~apg~s~~~v~~al 145 (393) ......++......++.....+..+........ .....+.+..++...... ..+...|........+| T Consensus 155 ~~~~~~~~tv~~d~~~~~F~v~s~~tG~~~~is~~~~t---~~~~~t~~a~~lglt~~~----~av~~~g~aaet~~~a~ 227 (515) T protein:vir:10 155 ADANLATCTVSYDPVGARFNFAGSPSDDTVQESISIVP---QSNPAIDVAQLLGWNSAQ----GASYIAASPVVSPVDTL 227 (515) T ss_pred cccccceeEEEEecCCCeEEEEEeecCCceeEEEEEec---CCCchhhHHHHhcccccc----ceEEecccccccHHHHH Confidence 001111222222222222222222111111000 000001111111111100 11111222222222222 Q ss_pred HHhhcccceE-EE-EecCC--CCcchhhh---hhhcccccceEEEecc---ceeEee------ccCCceEE------ech Q lcl|Aclame:pro 146 LSVAKKLNAF-AF-ISDNG--ATTKEQAY---TYRQNFSQREGMMIFG---DWKSYN------TDKKAYDT------DYA 203 (393) Q Consensus 146 ~~~a~~~~~~-~~-i~~~~--~~~~~~a~---~~~~~~~s~~~~~~~p---~~~~~~------~~~~~~~~------~p~ 203 (393) .++.+.-.+. .+ +.+.+ ..+..++. +|.+...-.+...... -...++ ...+.... ..+ T Consensus 228 ~a~~~~s~nWy~f~~a~~~~~~~~~a~~~a~a~~~e~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 307 (515) T protein:vir:10 228 IASVAGNNNFGSILFTKNGGTGITLSDAEAIALQNQSYNVAYKFQVGVDDTTYSSWQAALAAIGGVNMIYSPVALAAEYH 307 (515) T ss_pred HHHHhccCCeEEEEEeecCccccchhHHHHHHHHHhhcCceEEEEeccCccceechhhhhhhhhhcCceEEEEeccCcch Confidence 2222211111 11 11111 11111111 1111111000000000 000000 00000000 012 Q ss_pred hHHHHHHHHhhhccCC-ceecCCCceecceeeceeecccccCCCchhhhhhcccceEEE--EeC--CCEEEEecccC-CC Q lcl|Aclame:pro 204 VARACALQAYIDKTVG-WHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITIC--LNH--NGFRYWGSRTL-AT 277 (393) Q Consensus 204 S~~~ag~~a~~D~~~G-~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~--~~~--~G~~~wG~rT~-~~ 277 (393) -...+|..+.+|..+- =...-..|.+.||.. + .+++.+.+.|..+|+|+. +.+ ..+.+|-.=++ ++ T Consensus 308 ~a~~~g~~asvnf~~~ng~iT~kfKq~~Gita---~-----~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G~~~gG 379 (515) T protein:vir:10 308 DMQDGIIEAATDFTQQGGATGYMYVQFNNQTP---A-----VNDDTLSGILDDLNINYYGQTQVNGTNLSFYQDGVMMGG 379 (515) T ss_pred HHHHHHHHHhcCCCccchhheeccccCCCCcc---c-----cCCHHHHHHHHhcCCeEEEEEeccCceEEEEeCCeeeCC Confidence 3345666666664332 122334455544432 2 257889999999999998 333 45899855444 44 Q ss_pred CcccceeehhhHHHHHHHHHHHHhHHhhcc-----cCCHHHHHHHHHHH-HHHHHHHhhcccc----------------- Q lcl|Aclame:pro 278 DTRWAFQQSVRTAQIIKETIGAGLAWAVDM-----PLTPLRVKTMLEAI-NNKLRSWASGDDP----------------- 334 (393) Q Consensus 278 d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e-----pn~~~~~~~i~~~i-~~~l~~l~~~g~~----------------- 334 (393) +..|++|-++|-.+|++..++..+.. ++. |-|..=...|+..+ +.-|+.-+.+|.. T Consensus 380 ~~~~~WiD~~~g~~WL~~~iq~~l~~-L~~s~~KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~ 458 (515) T protein:vir:10 380 PTDPRDSNVYANEQWLKSYAGASFMS-LQLAQGKIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTEL 458 (515) T ss_pred ccchhHHHHHhhHHHHHHHHHHHHHH-HHhcCCCCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhh Confidence 44688999999999999999999876 443 33444444444433 2344443333311 Q ss_pred ----------cccceEEEec--CCCCHHHhhCCEEEEEEEEEecCcceeEEEEEEEc Q lcl|Aclame:pro 335 ----------RILGARVWVA--EEITADIIKSGKFVIKYDYHWIPSLESLGLEQRVN 379 (393) Q Consensus 335 ----------~~~~~~v~~~--~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~~~~ 379 (393) ...|+-+... +..+..+...+++.+..-+.---.+++|+.....- T Consensus 459 ~g~d~~~~~~~~~Gyy~~~~~~~~~~~~~r~~~~~~~~~~y~~g~~i~~i~~~~~~v 515 (515) T protein:vir:10 459 TGDDTAWQKVQNLGYWYDVQISSFVDTGGTTKYQAVYSLVYSKDDLIRKVVGTHTLI 515 (515) T ss_pred hcCcccccchhhcceeEecCcCCCCCcccccccCceeEEEEEcCceEEEEEeeeecC Confidence 0112222221 22333445555555555555555666666555444 No 66 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=49.28 E-value=0.64 Score=21.63 Aligned_cols=359 Identities=11% Similarity=0.042 Sum_probs=163.0 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhccc Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGSIV 80 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~~~ 80 (393) |--.+++. -|.+.-. +...........-++++... +|.......++..+....||.....+.+-..+|.+. T Consensus 1 mip~s~iV-~V~~~v~---~~~~~~~~~~~~l~l~~~~~-----~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF~~~ 71 (504) T protein:vir:96 1 MISQSRYI-RIISGVG---AGAPVAGRKLILRVMTTNNV-----IPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFI 71 (504) T ss_pred CCCcccee-Eeeeccc---ccccccccccceeEeecccC-----CCccceEEecCHHHHHHhcCCChHHHHHHHHHhhcC Confidence 54344431 1211111 11222222333334443222 233333345566666777887777777777777653 Q ss_pred C------ceEEEEEeccccccccc-------------------ccchhcccc----------cccccchhhhhh----hh Q lcl|Aclame:pro 81 K------TPTVIVRVAESDDSDTL-------------------TANIVGTQE----------NGKFTGIKALLT----AQ 121 (393) Q Consensus 81 ~------~~~~vv~~~~~~~~~~~-------------------~~~~~~~~~----------~~~~~gl~al~~----~~ 121 (393) . ...++-+-...+....- ....+++.. .....+...+.. .. T Consensus 72 ~~~~~~P~~l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~~ 151 (504) T protein:vir:96 72 SKSVNSPSSISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRKN 151 (504) T ss_pred CCCCccccEEEEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhhhhcc Confidence 3 22333332211110000 000000000 000000000000 00 Q ss_pred -------hhh------------------------------------hhc-cccccccccchHHHHHHHHHhhcccce-EE Q lcl|Aclame:pro 122 -------STV------------------------------------FVK-PKLLCVPQHDNQAVATELLSVAKKLNA-FA 156 (393) Q Consensus 122 -------~~~------------------------------------~~~-~~~l~apg~s~~~v~~al~~~a~~~~~-~~ 156 (393) ... ++. +......|........+|.++.+.-.+ .. T Consensus 152 ~~~~~~~~tv~~d~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lgl~~~~~~~v~g~~aet~~~al~al~~~~~~Wy~ 231 (504) T protein:vir:96 152 TDPQLAQATVTWNPNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGS 231 (504) T ss_pred cccccccceEEEeccCCeEEEEeeccccceeEEEeeccccchhhhhhcccccceEEeecccccHHHHHHHHHhhcCCeEE Confidence 000 000 000111111111233344443333222 22 Q ss_pred EEecCCCCcch---hhhhhhcccccceEEEeccceeEeec-------cCCceEE----------echhHHHHHHHHhhhc Q lcl|Aclame:pro 157 FISDNGATTKE---QAYTYRQNFSQREGMMIFGDWKSYNT-------DKKAYDT----------DYAVARACALQAYIDK 216 (393) Q Consensus 157 ~i~~~~~~~~~---~a~~~~~~~~s~~~~~~~p~~~~~~~-------~~~~~~~----------~p~S~~~ag~~a~~D~ 216 (393) +.......+.+ +.-+|.+..+..+... .+....+. .....+. --++.+..|..+.+|. T Consensus 232 f~~a~~~~~dd~ilalA~w~ea~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~as~~f 309 (504) T protein:vir:96 232 FLFAGATLDNDQIKAVSAWNAAQNNQFIYT--VATSLANLGALFDLVKGNSGTALNVLSATASNDFVEQCPSEILAATNY 309 (504) T ss_pred EEEEeccCCHHHHHHHHHHHhhcCceEEEE--EeecccchhhHHHhhhhcceeEEEEeecCccchhHHHHHHHHHHhcCc Confidence 22221212222 2333444433333222 11110000 0001111 1234555677777774 Q ss_pred cC--CceecCCCceecceeeceeecccccCCCchhhhhhcccceEEEE--eCCC--EEEE-ecccCCCCcccceeehhhH Q lcl|Aclame:pro 217 TV--GWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNG--FRYW-GSRTLATDTRWAFQQSVRT 289 (393) Q Consensus 217 ~~--G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~--~~~G--~~~w-G~rT~~~d~~~~~i~~rR~ 289 (393) .+ | -.+-..|.+.||..- .++..+.+.|..+|+|++. .+.| +.+| .+.++++.-.|.+|.+-+- T Consensus 310 ~~~ng-~~T~~fk~l~GVta~--------~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~~~~~wiDv~~~ 380 (504) T protein:vir:96 310 DEPGA-SQNYMYYQFPGRNIT--------VSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGPTDAVDMNVYAN 380 (504) T ss_pred Ccccc-cccccccccCCcCcc--------cCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCccccchhhhhhh Confidence 33 3 123445666666532 2578899999999999983 3333 6666 5566665546888999999 Q ss_pred HHHHHHHHHHHhHHhhcc----cCCHHHHHHHHHHHHHHHHHHhhccccc---------------------------ccc Q lcl|Aclame:pro 290 AQIIKETIGAGLAWAVDM----PLTPLRVKTMLEAINNKLRSWASGDDPR---------------------------ILG 338 (393) Q Consensus 290 ~~~i~~~i~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l~~~g~~~---------------------------~~~ 338 (393) .+||+..++..+....-. |-|+.=+..|+..++.-|+.-+.+|-.. .-| T Consensus 381 ~~WL~~~lq~~l~~l~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~G 460 (504) T protein:vir:96 381 EIWLKSAIAQALLDLFLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLG 460 (504) T ss_pred HHHHHHHHHHHHHHHHhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccc Confidence 999999999888764332 5566777777777777777777666321 123 Q ss_pred eEEEec--CCCCHHHhh-CCEEEEEEEEEecCcceeEEEEEEEc Q lcl|Aclame:pro 339 ARVWVA--EEITADIIK-SGKFVIKYDYHWIPSLESLGLEQRVN 379 (393) Q Consensus 339 ~~v~~~--~~nt~~~i~-~G~~~~~v~~~p~~p~e~i~~~~~~~ 379 (393) +-++.. ++-++++.. .+...+.+.+.---.+++|++....- T Consensus 461 Yyv~~~~~s~~s~~~r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 461 YWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred eEEEecChhccChhHhhhccccceEEEEEECCeEEEEEeccccC Confidence 445543 234444444 35556666666667777776665544 No 67 >protein:vir:108311 Length: 249 # NCBI annotation: hypothetical protein # Family: family:all:28027 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552279;genbank:gi:160700604;genbank:GeneID:5758827 Probab=39.99 E-value=0.99 Score=20.60 Aligned_cols=102 Identities=15% Similarity=0.181 Sum_probs=69.0 Q ss_pred ehhhHHHHHHHHHHHHhHHhhcccCCHHHHHHHHHHHHHHHHHHhhcccccccceEEEecCCCCHHHhhCCEEEEEEEE- Q lcl|Aclame:pro 285 QSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEEITADIIKSGKFVIKYDY- 363 (393) Q Consensus 285 ~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~~~~~~~~v~~~~~nt~~~i~~G~~~~~v~~- 363 (393) -.|-.-+.|..++++..--.++|+-.....++....+|..|.+|...| ..|+...+.+. -+..|+....|++ T Consensus 1 ~sqt~~~II~~ALk~aGvla~Getp~aee~~DA~~~Ln~Ml~~W~~~r------l~V~~~~~~t~-vl~~G~~~YtVGi~ 73 (249) T protein:vir:10 1 MARTVGDIIRSSMRKIGVLAAGEPLPANEGDDALEVFAQMVDAWTNET------LLIPVVNVVTK-VLVENQPEYTIGIY 73 (249) T ss_pred CccCHHHHHHHHHHHccccccCCCCCHhHHHHHHHHHHHHHHHHHhCc------eeEEeeeeeee-eccCCcceEEeeec Confidence 334455889999999999999999999999999999999999988765 22333322221 1456777777772 Q ss_pred -------------EecCcc--eeEEEEEEEcchHHHHH--HHHHhcC Q lcl|Aclame:pro 364 -------------HWIPSL--ESLGLEQRVNDEYVVDL--VNTLKAL 393 (393) Q Consensus 364 -------------~p~~p~--e~i~~~~~~~~~~~~~~--~~~~~~~ 393 (393) --.+|. ++-.|+-..+.+|+.+. .+....+ T Consensus 74 ~~~~~~~~p~~~i~~~RP~~i~sA~~r~~~d~~~~~~~i~~EdY~rI 120 (249) T protein:vir:10 74 PEPVPDPLPSNHIETGRPERILSAFIRDRYDTDYIQEIIDVETYSRI 120 (249) T ss_pred cccccccCCCCceEeecchheeeeeeecccccchhhhhhchhhhhhc Confidence 224455 66667777777777766 2333333 No 68 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=29.83 E-value=1.6 Score=19.43 Aligned_cols=356 Identities=12% Similarity=0.021 Sum_probs=140.6 Q ss_pred CC-CCcccCCCeEEEEcCCCc--ccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhh Q lcl|Aclame:pro 1 MS-ILDTYLHGVEVVEVNAGG--VTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIG 77 (393) Q Consensus 1 m~-m~~~~~~GV~v~ev~~~~--~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~ 77 (393) || .+-+ ++..|..+. .++...+-..+.|.+ ....|...-...++..+....||.....+.+-..+| T Consensus 1 m~~ip~s-----~iV~V~~~v~~~~~~~~~f~~~l~~~------~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF 69 (494) T protein:vir:94 1 MPNIPIS-----QIVSINPQVVSAGGTQGTLDGLLLTQ------ATGFPVTQPQVYFSAADVGTAFGLTSDEYNAALVYF 69 (494) T ss_pred CCCCCcc-----cEEEeeeeccccCCcccccceeEeec------CccCCccceeeecCHHHHHHhcCCChHHHHHHHHHh Confidence 55 1111 122222221 222222232222222 111232222334566667777887766666666666 Q ss_pred c----ccC-c-eEEEEEecccc--------ccccccc--------------------chh--cccc-cccccchhhhhh- Q lcl|Aclame:pro 78 S----IVK-T-PTVIVRVAESD--------DSDTLTA--------------------NIV--GTQE-NGKFTGIKALLT- 119 (393) Q Consensus 78 ~----~~~-~-~~~vv~~~~~~--------~~~~~~~--------------------~~~--~~~~-~~~~~gl~al~~- 119 (393) . +.. + ..++-+-...+ ....+.. .+. ..+. .+.-+.+++... T Consensus 70 s~~~~q~p~P~~l~igR~~~~a~~~~l~g~~~~~tl~~~~~~~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~~ai~~ 149 (494) T protein:vir:94 70 AGILGGGQQPASLTIGRYASAATSAAVFGAPLTLSLAQLQTLSGTLIVTTDTQRTSAAINLSGATSFANAASLMTSGFTT 149 (494) T ss_pred hhccCCCccccEEEEEeecCccccceeeccchhhhHHhhhhcceEEEEEEcceEEEeeecccccCChhhHHHHHhhhhcc Confidence 5 221 1 12222211100 0000000 000 0000 000000000000 Q ss_pred hhh----------------hhhhcccc--------------------ccccccchHHHHHHHHHhhcccceE--EEEecC Q lcl|Aclame:pro 120 AQS----------------TVFVKPKL--------------------LCVPQHDNQAVATELLSVAKKLNAF--AFISDN 161 (393) Q Consensus 120 ~~~----------------~~~~~~~~--------------------l~apg~s~~~v~~al~~~a~~~~~~--~~i~~~ 161 (393) ... ..+....+ +...|........+|.++.+.-+.. +.+.+. T Consensus 150 a~~~v~~d~~~~~f~v~s~ttG~~s~is~~t~~~a~~l~lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~~~~ 229 (494) T protein:vir:94 150 PNFAITYDAQRRRFVLSTTATGTTASVSAVTGTLADGVGLSTASGAYVEGSGLAADTAASALDRLAASSSTWAIFTTAWA 229 (494) T ss_pred ccceEEEcccCcEEEEEEccCCceeEEEEeccchhhhhhhhccccceEeecCcccccHHHHHHHHHhccCceEEEEEecC Confidence 000 00000000 1111111112333444433322222 222222 Q ss_pred CC-CcchhhhhhhcccccceEEEec---cceeEeeccC---------C--ceE----EechhHHHHHHHHhhhccCCcee Q lcl|Aclame:pro 162 GA-TTKEQAYTYRQNFSQREGMMIF---GDWKSYNTDK---------K--AYD----TDYAVARACALQAYIDKTVGWHK 222 (393) Q Consensus 162 ~~-~~~~~a~~~~~~~~s~~~~~~~---p~~~~~~~~~---------~--~~~----~~p~S~~~ag~~a~~D~~~G~~~ 222 (393) +. ....+.-+|.+..+-.+....+ +........+ + +.. ...|.+.+.|..+.+|-+. T Consensus 230 ~~~~~ilalA~wiea~~~~~~~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~~~~aa~~g~~aa~~~~~---- 305 (494) T protein:vir:94 230 ASLSDRTALAQWTSDQVFRRIYAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLLANAMIVLAWGASTNLQI---- 305 (494) T ss_pred CCHHHHHHHHHHHhhcCccEEEEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCCChHHHHHHHHHhccccc---- Confidence 22 1122233344433322322211 1111100000 0 111 1235566777777776432 Q ss_pred cCCCceecc---eeeceeecccccCCCchhhhhhcccceEEEEeC--CC--EEEEecccCCCCcccceeehhhHHHHHHH Q lcl|Aclame:pro 223 NISNVELDG---VTGITKAVEFDINESSTEANYLNEKGITICLNH--NG--FRYWGSRTLATDTRWAFQQSVRTAQIIKE 295 (393) Q Consensus 223 spaN~~l~g---v~~~~~~~~~~~~~~~~~~~~ln~~gi~~~~~~--~G--~~~wG~rT~~~d~~~~~i~~rR~~~~i~~ 295 (393) .+.+..+.. ..++.. ..++..+++.|..+|+|++... .+ +.+|.+-+++++-.|- -+-+=.+|++. T Consensus 306 ~~g~~T~~~k~q~~gi~~-----~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG~~~~i--d~~~~~~WL~~ 378 (494) T protein:vir:94 306 AEGRTTLALRSPVSSAGV-----RVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGGQFLWA--DTALGWIALRR 378 (494) T ss_pred cCcceeEEeeccCCCCCC-----ccCCHHHHHHHHhcCCeEEEEecccCceEEEecCceecccccee--eeeccHHHHHH Confidence 344444331 111111 1246779999999999998543 22 7889777887764332 22222335555 Q ss_pred HHHHHhHHhh---c-ccCCHHHHHHHHHHHHHHHHHHhhccccc--------------------------ccceEEEecC Q lcl|Aclame:pro 296 TIGAGLAWAV---D-MPLTPLRVKTMLEAINNKLRSWASGDDPR--------------------------ILGARVWVAE 345 (393) Q Consensus 296 ~i~~~~~~~v---~-epn~~~~~~~i~~~i~~~l~~l~~~g~~~--------------------------~~~~~v~~~~ 345 (393) .++..+.... . =|-|..=...|+..++.-|+.-+.+|... ..|+-++... T Consensus 379 ~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~~~ 458 (494) T protein:vir:94 379 NLQQALFETLLAYRSLPYNADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGWYLQVID 458 (494) T ss_pred HHHHHHHHHHHhCCCcccChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccccceeccceeeeccC Confidence 5555544322 2 37777777788888888887777766321 0122222222 Q ss_pred CCCHH---HhhCCEEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 346 EITAD---IIKSGKFVIKYDYHWIPSLESLGLEQRVND 380 (393) Q Consensus 346 ~nt~~---~i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 380 (393) ..+++ +....++.+.+. ---.+++|++....-- T Consensus 459 ~~s~~~ra~R~~~~~~~~y~--~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 459 PITTTVRTDRGSPTVNFWYC--DGGSIQRVVVSATTVI 494 (494) T ss_pred CCChhhhhccccCCceEEEE--ecCcEEEEEEeeEEeC Confidence 23333 333333433333 3566777666655443 No 69 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=20.91 E-value=2.7 Score=18.23 Aligned_cols=364 Identities=10% Similarity=0.000 Sum_probs=152.9 Q ss_pred CCCCcccCCCeEEEEcCCCcccccccccceeEEEEeecccccccccccceEEeecchhhhhhcccccchhhhhhhhhc-- Q lcl|Aclame:pro 1 MSILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGSTGTLRRTLNSIGS-- 78 (393) Q Consensus 1 m~m~~~~~~GV~v~ev~~~~~~i~~v~tav~g~vg~a~~~d~~~~p~~~~vl~t~~~~~~~~~g~~~tl~~~~~~~~~-- 78 (393) || -+=+|=-.+..|..+..+.........+++-+.... .|.+.....++..+....||.....+..-..+|. T Consensus 1 m~--~~~ip~s~iV~V~~~v~~~~~~~~~~~~lll~~~~~----~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFs~~ 74 (501) T protein:vir:78 1 MP--TTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTS----IQPGQLADFFQKTDVENWFGGLSNEAVIADAYFPGI 74 (501) T ss_pred CC--cCccccceEEEEeeecccCCCcceeeeeEEEecCCC----CCccceeeecCHHHHHHhcCCChHHHHHHHHHhhcC Confidence 55 322233333444333333333333333443332211 1334333455667777788877766666666664 Q ss_pred --ccCc--eEEEEEecccccccc---------cccc----------hhcc------cc----c---ccccchhhhhhhhh Q lcl|Aclame:pro 79 --IVKT--PTVIVRVAESDDSDT---------LTAN----------IVGT------QE----N---GKFTGIKALLTAQS 122 (393) Q Consensus 79 --~~~~--~~~vv~~~~~~~~~~---------~~~~----------~~~~------~~----~---~~~~gl~al~~~~~ 122 (393) +... ..++-+-.+.+.... +... .+++ .+ + +..+.+.+...... T Consensus 75 ~~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~la~~~~~~G~l~iti~g~~~~~~i~~S~~ts~~~vA~~i~~al~a~~ 154 (501) T protein:vir:78 75 VNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLTQLQGYSGTLTVTTAAQHVSSNISLAAATSFANAATLIEAAFTSPD 154 (501) T ss_pred CCCCcccceEEEEeecccCcceeEeccceeccchhhhceeeeEEEEEeccceeeeccccccccCHHHHHHHHHhhhcCcc Confidence 2211 112211111000000 0000 0000 00 0 00001111111000 Q ss_pred h-----------------hhhc----------------------cccccccccchHHHHHHHHHhhcccceE--EEEecC Q lcl|Aclame:pro 123 T-----------------VFVK----------------------PKLLCVPQHDNQAVATELLSVAKKLNAF--AFISDN 161 (393) Q Consensus 123 ~-----------------~~~~----------------------~~~l~apg~s~~~v~~al~~~a~~~~~~--~~i~~~ 161 (393) . .+.. +..+.+.|........+|.++.+.-.+. +.+.+. T Consensus 155 ~tv~~ds~~~~f~its~t~G~~~~i~~~t~~~~~a~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~ 234 (501) T protein:vir:78 155 FVVSYDALRNRFVVNTNATGTAAAISAVTGTNNLADELGLSAAAGASLQAAGVAADTPASAMNRAVGLSRNWATFTTAWT 234 (501) T ss_pred eEEEEccccceEEEEeeecCCceeEEEEecccchhhhhcccccCceeeEeccccccCHHHHHHHHHhccCceEEEEEecC Confidence 0 0000 0111122222222334444443333222 222222 Q ss_pred CC-CcchhhhhhhcccccceEEEec---cceeEeec---------cCCceEEec------hhHHHHHHHHhhhccCCc-e Q lcl|Aclame:pro 162 GA-TTKEQAYTYRQNFSQREGMMIF---GDWKSYNT---------DKKAYDTDY------AVARACALQAYIDKTVGW-H 221 (393) Q Consensus 162 ~~-~~~~~a~~~~~~~~s~~~~~~~---p~~~~~~~---------~~~~~~~~p------~S~~~ag~~a~~D~~~G~-~ 221 (393) +. ....+.-+|.+..+.++....+ +....... ..+-.+.+| +.+.+.|..+.+|..+-. - T Consensus 235 ~~~~~~lalA~wiea~~~~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~~~~~~aa~~g~~as~nf~~~~g~ 314 (501) T protein:vir:78 235 AVIADRLALASWNSGQAYKYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYGDQATAGAVMGYAASINFQLRNGR 314 (501) T ss_pred CCHHHHHHHHHHHHhcCceEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEcCCcchHHHHHHHHHhcCcccCcce Confidence 22 2222333344433333322211 11111000 001111222 355667777777754321 1 Q ss_pred ecCCCcee-cceeeceeecccccCCCchhhhhhcccceEEE--EeC--CCEEEEecccCCCCcccceeehhhHHHHHHHH Q lcl|Aclame:pro 222 KNISNVEL-DGVTGITKAVEFDINESSTEANYLNEKGITIC--LNH--NGFRYWGSRTLATDTRWAFQQSVRTAQIIKET 296 (393) Q Consensus 222 ~spaN~~l-~gv~~~~~~~~~~~~~~~~~~~~ln~~gi~~~--~~~--~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~ 296 (393) .+-..|.+ .|+.. . .++..+++.|..+|.|++ +.+ ..+.+|-.-+++++ |.+|.+-+-.+|++.. T Consensus 315 ~T~~fkq~~~Gv~a---~-----~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~~~~Wl~~~ 384 (501) T protein:vir:78 315 TVLAFRQFNAGVPA---T-----AHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAE 384 (501) T ss_pred eeeeccccCCCcCc---c-----cCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeecc--ceeehhhhhHHHHHHH Confidence 12223332 22221 1 246789999999999998 333 44889855566766 5556666766777777 Q ss_pred HHHHhHHhhc----ccCCHHHHHHHHHHHHHHHHHHhhccccc---------------------------ccceEEEec- Q lcl|Aclame:pro 297 IGAGLAWAVD----MPLTPLRVKTMLEAINNKLRSWASGDDPR---------------------------ILGARVWVA- 344 (393) Q Consensus 297 i~~~~~~~v~----epn~~~~~~~i~~~i~~~l~~l~~~g~~~---------------------------~~~~~v~~~- 344 (393) ++..+....- =|-|..=...|...++.-|+.-+.+|-.. ..|+-++.+ T Consensus 385 iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~ 464 (501) T protein:vir:78 385 LQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGD 464 (501) T ss_pred HHHHHHHHHHhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeecc Confidence 7666654332 26677777777777777777777766321 113333333 Q ss_pred CCCCHHHhh-CCEEEEEEEEEecCcceeEEEEEEEcc Q lcl|Aclame:pro 345 EEITADIIK-SGKFVIKYDYHWIPSLESLGLEQRVND 380 (393) Q Consensus 345 ~~nt~~~i~-~G~~~~~v~~~p~~p~e~i~~~~~~~~ 380 (393) .+.++++.. .+...+.+.+.---.+++|++-..--- T Consensus 465 ~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:78 465 PANPGQARQNRTTPTCTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred ccCChhhhhhcccCcEEEEEEeCCceeEEEeeeeecC Confidence 223333322 244555555555566666654333222 Done!