Query lcl|NC_013693.1_cdsid_YP_003358652.1 [gene=orf00195] [protein=Gp18 tail sheath protein] [protein_id=YP_003358652.1] [location=complement(108459..110354)] Match_columns 631 No_of_seqs 225 out of 807 Neff 10.6 Searched_HMMs 1612 Date Thu Nov 7 14:34:55 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_165 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_165_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:104477 Length: 749 100.0 3E-143 2E-146 801.8 49.3 618 1-623 1-749 (749) 2 protein:vir:103456 Length: 659 100.0 3E-142 2E-145 796.5 49.9 610 5-626 1-659 (659) 3 protein:vir:106984 Length: 743 100.0 3E-141 2E-144 790.8 47.8 616 1-624 1-743 (743) 4 protein:vir:7206 Length: 659 # 100.0 1E-140 7E-144 788.0 50.1 614 5-625 1-659 (659) 5 protein:vir:104858 Length: 729 100.0 5E-141 3E-144 789.9 46.6 618 1-627 1-729 (729) 6 protein:vir:98263 Length: 664 100.0 3E-140 2E-143 785.2 48.9 605 1-627 1-664 (664) 7 protein:vir:6894 Length: 660 # 100.0 1E-139 7E-143 782.4 47.6 617 5-629 1-660 (660) 8 protein:vir:80984 Length: 666 100.0 8E-140 5E-143 783.2 46.6 601 5-630 1-666 (666) 9 protein:vir:101187 Length: 663 100.0 3E-139 2E-142 779.8 47.3 611 5-630 1-663 (663) 10 protein:vir:6594 Length: 666 # 100.0 2E-138 1E-141 775.2 48.4 601 5-630 1-666 (666) 11 protein:vir:106427 Length: 679 100.0 2E-138 1E-141 775.2 47.2 616 5-629 1-679 (679) 12 protein:vir:101804 Length: 663 100.0 5E-138 3E-141 773.3 47.2 604 5-630 1-663 (663) 13 protein:vir:108052 Length: 660 100.0 2E-137 1E-140 770.0 48.7 598 5-628 1-660 (660) 14 protein:vir:100539 Length: 663 100.0 2E-137 1E-140 770.0 48.0 604 5-630 1-663 (663) 15 protein:vir:5663 Length: 671 # 100.0 8E-134 5E-137 750.3 49.1 608 5-625 1-671 (671) 16 protein:vir:79092 Length: 477 100.0 8E-110 5E-113 618.8 40.1 462 1-626 1-477 (477) 17 protein:vir:107865 Length: 477 100.0 4E-109 3E-112 614.7 39.3 462 1-626 1-477 (477) 18 protein:vir:98824 Length: 774 100.0 2E-105 1E-108 594.1 36.6 487 1-620 277-774 (774) 19 protein:vir:103168 Length: 641 100.0 1.2E-96 8E-100 546.4 31.2 505 1-515 1-641 (641) 20 protein:vir:6079 Length: 396 # 100.0 3.8E-95 2.4E-98 538.2 35.0 382 1-624 1-396 (396) 21 protein:vir:79181 Length: 390 100.0 2.5E-95 1.6E-98 539.2 33.7 377 1-624 1-390 (390) 22 protein:vir:79141 Length: 391 100.0 3E-95 1.9E-98 538.8 33.8 374 1-624 1-391 (391) 23 protein:vir:98553 Length: 395 100.0 1.4E-94 8.5E-98 535.2 36.0 381 1-623 1-395 (395) 24 protein:vir:1845 Length: 392 # 100.0 1.7E-94 1E-97 534.7 35.3 379 1-624 1-392 (392) 25 protein:vir:100323 Length: 393 100.0 1.3E-94 8.4E-98 535.2 34.8 379 1-630 1-393 (393) 26 protein:vir:5711 Length: 396 # 100.0 2.4E-94 1.5E-97 533.8 35.1 382 1-624 1-396 (396) 27 protein:vir:2035 Length: 396 # 100.0 2.1E-94 1.3E-97 534.1 33.3 382 1-624 1-396 (396) 28 protein:vir:1172 Length: 391 # 100.0 1.6E-94 9.7E-98 534.9 32.5 378 1-624 1-391 (391) 29 protein:vir:103993 Length: 390 100.0 3.4E-94 2.1E-97 533.0 32.8 377 1-624 1-390 (390) 30 protein:vir:78206 Length: 390 100.0 3.4E-94 2.1E-97 533.0 32.8 377 1-624 1-390 (390) 31 protein:vir:96740 Length: 388 100.0 1.7E-92 1.1E-95 523.6 35.1 371 1-622 1-388 (388) 32 protein:vir:10336 Length: 386 100.0 4.6E-92 2.9E-95 521.3 35.4 377 1-622 1-386 (386) 33 protein:vir:5833 Length: 742 # 100.0 5.2E-77 3.2E-80 438.8 33.6 526 1-619 198-742 (742) 34 protein:vir:63742 Length: 562 100.0 3E-71 1.9E-74 407.2 34.0 538 1-624 1-562 (562) 35 protein:vir:102819 Length: 648 100.0 1.2E-69 7.2E-73 398.5 38.7 556 1-617 1-648 (648) 36 protein:vir:80488 Length: 562 100.0 3.6E-69 2.2E-72 395.8 36.2 534 1-624 1-562 (562) 37 protein:vir:95741 Length: 587 100.0 4.6E-67 2.9E-70 384.3 35.9 559 1-624 1-587 (587) 38 protein:vir:80779 Length: 569 100.0 4.2E-67 2.6E-70 384.5 32.5 545 1-624 1-569 (569) 39 protein:vir:99306 Length: 587 100.0 2.1E-65 1.3E-68 375.1 36.7 553 1-624 1-587 (587) 40 protein:vir:96586 Length: 587 100.0 1.5E-62 9.6E-66 359.5 35.4 550 1-624 1-587 (587) 41 protein:vir:79798 Length: 717 100.0 1.4E-60 8.7E-64 348.7 40.8 589 1-613 1-717 (717) 42 protein:vir:100829 Length: 607 100.0 6.9E-57 4.3E-60 328.5 33.8 559 1-624 1-607 (607) 43 protein:vir:102957 Length: 437 100.0 1.4E-53 8.8E-57 310.3 33.0 421 1-612 1-437 (437) 44 protein:vir:101326 Length: 529 100.0 5.5E-46 3.4E-49 268.7 31.0 482 4-613 1-529 (529) 45 protein:vir:105470 Length: 451 100.0 8.2E-43 5.1E-46 251.3 31.4 428 1-612 1-451 (451) 46 protein:vir:7653 Length: 581 # 100.0 4.4E-39 2.7E-42 230.9 30.1 512 59-628 1-581 (581) 47 protein:vir:107310 Length: 581 100.0 1.4E-38 8.7E-42 228.1 30.5 517 59-628 1-581 (581) 48 protein:vir:78986 Length: 436 100.0 7.5E-32 4.7E-35 191.2 30.1 411 1-612 1-436 (436) 49 protein:vir:102359 Length: 356 99.4 1.5E-12 9.4E-16 85.4 24.7 321 225-611 1-356 (356) 50 protein:vir:489 Length: 498 # 99.2 2.5E-10 1.5E-13 73.2 27.4 453 1-616 1-498 (498) 51 protein:vir:4517 Length: 498 # 99.1 1.2E-09 7.6E-13 69.5 27.8 447 1-616 1-498 (498) 52 protein:vir:4463 Length: 498 # 99.1 1.5E-09 9.6E-13 68.9 26.9 454 1-616 1-498 (498) 53 protein:vir:3751 Length: 376 # 99.0 5.3E-09 3.3E-12 65.9 27.3 340 238-620 1-376 (376) 54 protein:vir:1996 Length: 495 # 99.0 3.3E-09 2.1E-12 67.1 26.1 451 1-613 1-495 (495) 55 protein:vir:95263 Length: 450 99.0 8.2E-09 5.1E-12 64.9 29.8 436 1-614 1-450 (450) 56 protein:vir:3788 Length: 376 # 98.9 9E-09 5.6E-12 64.7 26.2 343 238-617 1-376 (376) 57 protein:vir:276 Length: 369 # 98.9 2.4E-08 1.5E-11 62.3 27.6 339 225-621 1-369 (369) 58 protein:vir:78782 Length: 370 98.8 1.2E-08 7.4E-12 64.0 21.3 338 249-624 1-370 (370) 59 protein:vir:80052 Length: 331 98.6 2.3E-07 1.4E-10 57.0 27.4 316 235-613 1-331 (331) 60 protein:vir:5260 Length: 502 # 98.2 2.1E-06 1.3E-09 51.7 35.7 459 1-613 1-502 (502) 61 protein:vir:3165 Length: 426 # 95.4 0.0022 1.4E-06 35.1 19.9 393 157-613 1-426 (426) 62 protein:vir:96104 Length: 504 94.8 0.0036 2.2E-06 34.0 33.3 448 1-612 1-504 (504) 63 protein:vir:99586 Length: 507 94.0 0.0056 3.5E-06 32.9 29.7 456 1-612 1-507 (507) 64 protein:vir:101576 Length: 501 93.1 0.0087 5.4E-06 31.9 33.4 451 1-613 1-501 (501) 65 protein:vir:3636 Length: 501 # 92.4 0.011 7.1E-06 31.2 35.0 450 1-613 1-501 (501) 66 protein:vir:106730 Length: 501 90.9 0.019 1.2E-05 30.1 33.7 454 1-613 1-501 (501) 67 protein:vir:78611 Length: 501 90.8 0.019 1.2E-05 30.0 34.9 454 1-613 1-501 (501) 68 protein:vir:107720 Length: 515 81.1 0.089 5.5E-05 26.3 25.9 409 135-612 1-515 (515) 69 protein:vir:94073 Length: 494 75.5 0.15 9E-05 25.2 32.6 444 1-613 1-494 (494) No 1 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=3.2e-143 Score=801.77 Aligned_cols=618 Identities=31% Similarity=0.457 Sum_probs=423.9 Q ss_pred CCCcchhcCCceEEEEecCCCceecccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCC Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTSPSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYS 80 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG 80 (631) |++ +||||||||||+|++.++.+|+||++||||+|+|||+++|++|+|| .||+++||++++.+|++|+|++||+||| T Consensus 1 M~~--~~~~PgVyv~e~~~~~~~~~~~t~~~~fvG~~~~Gp~~~p~~v~s~-~~~~~~fG~~~~~~~~~~~v~~~F~ngg 77 (749) T protein:vir:10 1 MAT--NQSSPGVVIQERDLTTVSTIPTANVGVIAAPFTKGPVEEVIEITSE-RQLAEKFGEPNESNYEYWFSAAQFLSYG 77 (749) T ss_pred CCc--cccCCeeEEEEecCCcccccccCceeEEEeccCCCCCccCEEcCCH-HHHHHHcCCccCCcccHHHHHHHHhhcC Confidence 555 6999999999999987766999999999999999999999999887 7999999999999999999999999999 Q ss_pred ceEEEEEecccCCCcccccccchhhhccccccc--cccccceeeehhhhhhhhhhchhhhhccCcccceeeccceeeee- Q lcl|NC_013693. 81 SVAWVTRVVGPAARNAVTKGQTAILIRNKLDFE--TASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFAYA- 157 (631) Q Consensus 81 ~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~--~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~~~- 157 (631) ++||||||.+..++++.... ....++....+. ..+....+++++++||.|||.+++.+.+................ T Consensus 78 ~~~~vvRv~~~~~~~a~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~pG~~gn~l~v~v~~~~~~~~~~~~~~~~~~~ 156 (749) T protein:vir:10 78 GLLKTIRVNSSSLKNAVDTG-TAPLVKNLQDYETSIEDASNNFSWVARTPGDTGNSIGIFVTDAGADQVVVVPAPGSGNE 156 (749) T ss_pred CeEEEEEccCcccccccccc-ccccccccccccccccccccceEEEeccCCCcCCceEEEEEcCCCceeeeeecCCccce Confidence 99999999988877776443 233343333322 22445668899999999999999999887765443321100000 Q ss_pred --ccc--------ccc-eEEeeeeeeeeecccccccceee--e---------eecccccccceeEee--ccccccc---- Q lcl|NC_013693. 158 --PQA--------GEY-HIVIVDKVGRITDSSGAVGQVDR--I---------SVSGTATGAGSISVA--GEDVAYT---- 209 (631) Q Consensus 158 --~~~--------g~~-~~~~~~~~~~v~~~~~~~~~~~~--~---------~~~~~~~~~~~~~~~--~~~~~~~---- 209 (631) ... +.. .....+................. . ............... ....... T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~~~a~~ 236 (749) T protein:vir:10 157 HEFVADAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDATNKKLEIGLPSGGVTGILADN 236 (749) T ss_pred eeEEeeecccccccccccccccceeeeeccccceeecccceeeeccCcccccccccccCCcceEEEeeecccccceeeee Confidence 000 000 00000000000000000000000 0 000000000000000 0000000 Q ss_pred -----cccccccccc---------cccccccccccccccccccccccccc--------------ccccccccccceeecc Q lcl|NC_013693. 210 -----DTDTPATLAT---------KIGTALTALTDVYSSVVVKSNTVTVT--------------HKAIGPQTVTAIVPDA 261 (631) Q Consensus 210 -----~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~v~--------------~~~~~~~~~~~~~~~~ 261 (631) .......... ......................+... .....+.+........ T Consensus 237 ~~v~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~v~~~~~~~~~~t~~~~~~~a~~~gt~~~~~~ 316 (749) T protein:vir:10 237 QVITQGTNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITSVRDEYTEREYLPGVKWINVAPRPGTSLYANG 316 (749) T ss_pred ecccccccccccccccccchhhhhccccceeeccccccCCccceeEEEeeeccccccccccceeeccccccccceeeeec Confidence 0000000000 00000000000000000000000000 0011111111111111 Q ss_pred cccccceeeee-c-------ccccccceeeeeeeeecccccccchhhhhhhhhccccceeeeeccccc------------ Q lcl|NC_013693. 262 NGLTATAVTTT-V-------GASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVYTFATTLA------------ 321 (631) Q Consensus 262 ~~~~~~~~~~~-v-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------ 321 (631) .+...+.+++. + +..++++|.+..++...+.+...+...++...++..+..++....+.. T Consensus 317 ~~g~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~~ 396 (749) T protein:vir:10 317 VGGHRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSEFIYWAEHESTLYAATSSASDGL 396 (749) T ss_pred ccCCCCceEEEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhccCCCEEEEEecccccccccccccccc Confidence 11112222221 1 223456666666666666666666666666666665554443221100 Q ss_pred -----------------c------------------ccccccccccc---------chhhhhhHHHHHhhhhhcccceeE Q lcl|NC_013693. 322 -----------------A------------------GVTELEGGVDD---------YTGNRVAAIEALNNAEAYDAKPVF 357 (631) Q Consensus 322 -----------------~------------------~~~~l~gg~d~---------~~~~~~~~~~~l~~~~~~~~~~~i 357 (631) . ....+.+|.|. ...++.+.+..+...+.....++| T Consensus 397 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~li 476 (749) T protein:vir:10 397 FGQTAANRQFNLFRSAAGSVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIGDPESQIVDFII 476 (749) T ss_pred cccccccceeeccccccccceeccccccccccCCcEEEEEccCCcccccccccccccchhHHHHHHHhhhhhhcccceEE Confidence 0 00112233322 234566777788877878888888 Q ss_pred Eecccc------chHHHHHHHHHhhccceEEeecccccccc--cccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccC Q lcl|NC_013693. 358 AFCEEL------IEQQTLIDLSTERKDTVSFVSPLRDVVVG--NRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYN 429 (631) Q Consensus 358 ~~~~~~------~~~~~~~~~~~~~~~~~a~~d~~~~~~~~--~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~ 429 (631) +.++.. .++.+++++|+++++||+++|+|...... .......++..||..+. +|+|+++||||++++|+.+ T Consensus 477 ~~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~-~s~~~~~~~p~~~~~d~~~ 555 (749) T protein:vir:10 477 SGPSGTSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKKLP-SSSYMVFDSGYKYIYDKYN 555 (749) T ss_pred EecCCCCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhhcc-CceeEEEEccceeeecccc Confidence 877653 36789999999999999999998765333 33445667788887754 6889999999999999999 Q ss_pred CceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecC Q lcl|NC_013693. 430 DKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGL 509 (631) Q Consensus 430 ~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~ 509 (631) +..+++|||+++||+|||+|.++||||||||+++++|.|+.++++.+++.|++.||++|||||++|+++|+++||+||++ T Consensus 556 ~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~ 635 (749) T protein:vir:10 556 DVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPIVSFPGQGVVLYGDKTAL 635 (749) T ss_pred CceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccccceeecChhHHHhhhhCCceEEEEecCCeEEEEcceecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred CCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCe Q lcl|NC_013693. 510 TRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQ 589 (631) Q Consensus 510 ~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~ 589 (631) +.|++|+|||||||++||+++|+++++|+||||||+.+|++|+++|+.||++||++|+|.||+|+||+++||+++|++|+ T Consensus 636 s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~G~i~~f~V~~d~~~Nt~~~i~~G~ 715 (749) T protein:vir:10 636 GFASAFDRINIRRLFLTVERVISTAAKAQLFEQNDEAQRSLFINIVEPYLRDVQGRRGVVDFLVKCDSTNNTPEAVDRGE 715 (749) T ss_pred CCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcCCCCCHHHhhCCE Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEecCCceEEEEEEEEEecCceeeeeec Q lcl|NC_013693. 590 MVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIET 623 (631) Q Consensus 590 ~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~ 623 (631) |+++|+|+|++|||||+|||+|++++++|+|+.+ T Consensus 716 ~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~~s 749 (749) T protein:vir:10 716 FYAEVFLKPTRTINYVQLTFVATRTGVSFAEVAS 749 (749) T ss_pred EEEEEEEEecCCccEEEEEEEEeecCcchHHHhC Confidence 9999999999999999999999999999999998 No 2 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=2.9e-142 Score=796.54 Aligned_cols=610 Identities=27% Similarity=0.417 Sum_probs=407.7 Q ss_pred chhcCCceEEEEecCCCceecccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCCceEE Q lcl|NC_013693. 5 SFSVAPSVQWTERDATLQTSPSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYSSVAW 84 (631) Q Consensus 5 ~~ylsPGVyveEv~~~~~~~gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG~~~~ 84 (631) ..||+|||||||+|+++++++++|||+||||+|+|||+++|++|+|| .||+++||++++.+|++|+|++||+|||++|| T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~~~ts~~~fvG~~~~Gp~~~p~~i~s~-~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~ 79 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQVTNE-VDLVNTFGQPTAETADYFMSAMNFLQYGNDLR 79 (659) T ss_pred CceecCceEEEEecCCceecccCccceEEEecccCCCCCccEEecCH-HHHHHHcCCcCCCcchhHHHHHHHhhCCCeEE Confidence 45999999999999999999889999999999999999999999886 79999999999999999999999999999999 Q ss_pred EEEecccCCCcccccccchhhhccccccccc----cccceeeehhhhhhhhhhchhhhhccCcccceeec---cceeeee Q lcl|NC_013693. 85 VTRVVGPAARNAVTKGQTAILIRNKLDFETA----SPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEF---RNNFAYA 157 (631) Q Consensus 85 vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~----~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~---~~~~~~~ 157 (631) |||+.+.++..++.......... ...... .....+++.+. .|++...+.+.+......... ....... T Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~---~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~~ 154 (659) T protein:vir:10 80 VVRAVDRDTAKNSSPIAGNIEYT--ISTPGSNYAVGDKITVKYVSD---AIETEGKITEVDTDGKIKKINIPTAKIIAKA 154 (659) T ss_pred EEEccCcccccccccccccceee--EeecccccccccceeeeecCC---CccccceeeEEecccccceeeeccccccccc Confidence 99998765443332211111100 011111 11112233333 344444444443332211111 1111112 Q ss_pred cccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccccccccc Q lcl|NC_013693. 158 PQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSSVVV 237 (631) Q Consensus 158 ~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (631) ...+........+..............................+.................... .........+... T Consensus 155 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~---~~~v~a~~~G~~g 231 (659) T protein:vir:10 155 KEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYG---IPGVVALYPGELG 231 (659) T ss_pred ccccccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecc---cccccccccceec Confidence 2223333333333222222222111111111111111100111111110000000000000000 0000001111111 Q ss_pred ccccccccccccccc----------------cccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhh Q lcl|NC_013693. 238 KSNTVTVTHKAIGPQ----------------TVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAY 301 (631) Q Consensus 238 ~~~~~~v~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (631) ....+.......... ..........+.....+...+...+...+.... ....+.....+...+ T Consensus 232 ~~~tv~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 310 (659) T protein:vir:10 232 DKIEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVL-STKRGEKDIYDSNIY 310 (659) T ss_pred ccceEEEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeee-eccccccccccchhh Confidence 111111000000000 000000111111112222233333444444322 222233334444445 Q ss_pred hhhhhccc-cceeeeeccccc---ccccccccccccc----hhhhhhHHHHHhhhhhcccceeEEeccc---------cc Q lcl|NC_013693. 302 FKDVINDT-SNWVYTFATTLA---AGVTELEGGVDDY----TGNRVAAIEALNNAEAYDAKPVFAFCEE---------LI 364 (631) Q Consensus 302 ~~~~~~~~-~~~~~~~~~~~~---~~~~~l~gg~d~~----~~~~~~~~~~l~~~~~~~~~~~i~~~~~---------~~ 364 (631) ....+.++ +.++.......+ .....+.+|.++. .++..+.+..+...+.++... +++|+ .. T Consensus 311 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~i--l~~p~~~~~~~~~~~~ 388 (659) T protein:vir:10 311 IDDFFAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQL--FIAGSCAGESLETAST 388 (659) T ss_pred hhhhhccCcccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeE--EEecCCCCcchhhhHH Confidence 55555443 334433322222 2345677887653 345566666776666655443 33333 24 Q ss_pred hHHHHHHHHHhhccceEEeeccccc-ccccccCCHHHHHHHHHh--------cCCCcceEEEecCeeEEEeccCCceeEe Q lcl|NC_013693. 365 EQQTLIDLSTERKDTVSFVSPLRDV-VVGNRGREMEDVVAWRES--------LVRDSSYFFMDDNWAYVYDKYNDKMRWI 435 (631) Q Consensus 365 ~~~~~~~~~~~~~~~~a~~d~~~~~-~~~~~~~~~~~~~~~~~~--------~~~~s~~~~~~~p~~~~~d~~~~~~~~~ 435 (631) ++.++++||+++++||+++|+|+.. ++.+..++.+++++||+. ++++|+|+++||||++++|+.+++++++ T Consensus 389 v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~ 468 (659) T protein:vir:10 389 VQKHVVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWV 468 (659) T ss_pred HHHHHHHHHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEe Confidence 6788999999999999999998765 455677889999999985 3578999999999999999999999999 Q ss_pred ehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhh Q lcl|NC_013693. 436 PACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAF 515 (631) Q Consensus 436 p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~ 515 (631) |||+++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||||++|+++|+++||+||+++++++| T Consensus 469 p~sg~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~ 548 (659) T protein:vir:10 469 PLAADIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPF 548 (659) T ss_pred chHHHHHHHHHHHhccCCceEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCCCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888899 Q ss_pred ceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEE Q lcl|NC_013693. 516 DRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIW 595 (631) Q Consensus 516 ~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~ 595 (631) +|||||||++||+++|+++++|+||||||+.||++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+ T Consensus 549 ~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~ 628 (659) T protein:vir:10 549 DRINVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGIKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFY 628 (659) T ss_pred ceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCceEEEEEEEEEecCceeeeeeccCe Q lcl|NC_013693. 596 LKPEYSINWVYLDFAAVRPDMEFSEIETGGG 626 (631) Q Consensus 596 ~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~ 626 (631) |+|++|+|||+|||+|++++++|+|+.-++| T Consensus 629 ~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~ 659 (659) T protein:vir:10 629 IQPARSINYITLNFVATATGADFDELTGLAG 659 (659) T ss_pred EEecCCcceEEEEEEEEecCcchHHhhccCC Confidence 9999999999999999999999999988888 No 3 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=3.2e-141 Score=790.85 Aligned_cols=616 Identities=28% Similarity=0.434 Sum_probs=418.5 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhC Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSY 79 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ng 79 (631) |+. ||||||||||+|+++++| ||+|||+||||+|+|||+++|++|+|| .||++.||++++.+|++|+|++||+|| T Consensus 1 m~~---~~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~~~~Gp~~~p~~i~s~-~~~~~~fG~~~~~~~~~~~v~~~f~ng 76 (743) T protein:vir:10 1 MAS---QVSPGILIKERDLTNAVVTGALQIRAAHASTFAKGPIGDIVNINTQ-KELVSVFGEPKEDNAEDWMVASEFLNY 76 (743) T ss_pred Ccc---ccCCceEEEEecCCCceeccCCcceeEEEEeccCCCCCcCEEecCH-HHHHHHcCCccCCcchHHHHHHHHHhC Confidence 654 999999999999999888 999999999999999999999999886 799999999999999999999999999 Q ss_pred CceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccceee---- Q lcl|NC_013693. 80 SSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFA---- 155 (631) Q Consensus 80 G~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~---- 155 (631) |++||||||.+.+..+++... ....+.+..... .+....+.+.++++|+|||.+++.+.++............. T Consensus 77 g~~~~vvrv~~~~~~~a~~~~-~~~~~~~~~~~~-~~~~~~~~~~a~~~G~~gN~i~V~v~~~~~d~~~~~~~~~~~~~~ 154 (743) T protein:vir:10 77 GGRLAVVRAETTGVLNATTGS-AGVLVKNRESWD-AGSGNGEVFVARTAGSWGNSLMGVLVDRGADYIVTFAATPTDTAV 154 (743) T ss_pred CceEEEEEccCcccccccccc-cccccccccccc-ccccceeEEEEeeccccccceEEEEecCCCcceeeeecccccccc Confidence 999999999988877776543 223333333332 33456789999999999999999998765544322211000 Q ss_pred ----ee-----cccccceEE-eeeeeeeeecccc-------------cc---cceee------------eeecccccc-- Q lcl|NC_013693. 156 ----YA-----PQAGEYHIV-IVDKVGRITDSSG-------------AV---GQVDR------------ISVSGTATG-- 195 (631) Q Consensus 156 ----~~-----~~~g~~~~~-~~~~~~~v~~~~~-------------~~---~~~~~------------~~~~~~~~~-- 195 (631) .. ...+..... ............. .. ..... ......... T Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (743) T protein:vir:10 155 GTQLLFSYSGTLVTGEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVGRTPGTYSNVPASGGTGTGA 234 (743) T ss_pred ceeeeecccccccccceeeeeecCcceeeeeccccceeeecccccccccccccccccccccccccceeeEEecccccccc Confidence 00 000000000 0000000000000 00 00000 000000000 Q ss_pred cceeEeecccccccccccccccccc----cc--cc-----cccccccc-cccccccccccc-----------------cc Q lcl|NC_013693. 196 AGSISVAGEDVAYTDTDTPATLATK----IG--TA-----LTALTDVY-SSVVVKSNTVTV-----------------TH 246 (631) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~--~~-----~~~~~~~~-~~~~~~~~~~~v-----------------~~ 246 (631) ......................... .. .. ........ ............ .. T Consensus 235 ~~tv~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~~a~~~~~~~~~~~~~~~~~ 314 (743) T protein:vir:10 235 TFNVVVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAITELKDWYLNTEIGSTGIKL 314 (743) T ss_pred cccccccccccccccccccccccccceeeeccccccccccccccccchhheecccccceeeeecccccccchhhcccccc Confidence 0000000000000000000000000 00 00 00000000 000000000000 00 Q ss_pred cccccccccceeecccccccceeeee--------cccccccceeeeeeeeecccccccchhhhhhhhhccccceeeeec- Q lcl|NC_013693. 247 KAIGPQTVTAIVPDANGLTATAVTTT--------VGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVYTFA- 317 (631) Q Consensus 247 ~~~~~~~~~~~~~~~~~~~~~~~~~~--------v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 317 (631) ....+......+........+.+... ....+.+++.+..++.....+...+...++...+++.+..+.... T Consensus 315 ~~~~~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~ 394 (743) T protein:vir:10 315 GDIGPRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVINEQSAYLYHGND 394 (743) T ss_pred ccccccceeeeccccccccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceeecceeccccceeeccCc Confidence 00011111122222222222222221 223445566666666666655555555555444444333322110 Q ss_pred -----------------------c----cccccccccccccccch---hhhhhHHHHHhhhhhcccceeEEecc------ Q lcl|NC_013693. 318 -----------------------T----TLAAGVTELEGGVDDYT---GNRVAAIEALNNAEAYDAKPVFAFCE------ 361 (631) Q Consensus 318 -----------------------~----~~~~~~~~l~gg~d~~~---~~~~~~~~~l~~~~~~~~~~~i~~~~------ 361 (631) . .......++.||.|..+ .++.+++..+...+.++... +++|+ T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~~~~~~~~~~~~~~~~~~~~~~~~l-l~~p~~~~~~~ 473 (743) T protein:vir:10 395 AAVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDFAYDAGEFGAAMDLFLDTEETEIDF-VLMGGSMADEA 473 (743) T ss_pred ccceeeeccccCccccceeeeecccccccccceEEEeecCccccccchhHHHHHHHHhhhccccCcce-EEecCcccCcc Confidence 0 00112246778887644 44556666676666665433 33332 Q ss_pred -ccchHHHHHHHHHhhccceEEeecccccc-------cccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCcee Q lcl|NC_013693. 362 -ELIEQQTLIDLSTERKDTVSFVSPLRDVV-------VGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMR 433 (631) Q Consensus 362 -~~~~~~~~~~~~~~~~~~~a~~d~~~~~~-------~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~ 433 (631) ...++.++++||+++++||+++|+|.... ......+..++..|++.+ .+|+|+++||||++++|+.++..+ T Consensus 474 ~~~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~s~~~~~~~p~~~~~d~~~~~~~ 552 (743) T protein:vir:10 474 DTKSKATKVIAIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSDL-TSTSYAVFDSGYKYVYDRFTDKYR 552 (743) T ss_pred chHHHHHHHHHHHHhhCCeEEEEecCCCccccccccccccccccchHHHHHHHhc-cCCeeEEEEccceeeeccccCcee Confidence 13468899999999999999999997532 233456677888888764 478999999999999999999999 Q ss_pred EeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCCh Q lcl|NC_013693. 434 WIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPS 513 (631) Q Consensus 434 ~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~ 513 (631) ++|||+++||++||+|.++||||||+|+++.+|.|+.++++.+++.|++.||++|||||++|+++|+++||+||++++|+ T Consensus 553 ~~p~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~s~d~ 632 (743) T protein:vir:10 553 YIPCNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAVKLAYNPNKADRDELYQNRINPVVSLRGQGITLFGDKTALAAPS 632 (743) T ss_pred EechhHHHHHHHHHhhccCCcEEccCCeeeeeeeccccceecCChhHHHhHhhCCceEEEEecCCeEEEEcccccCCCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988889 Q ss_pred hhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEE Q lcl|NC_013693. 514 AFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAG 593 (631) Q Consensus 514 ~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~ 593 (631) +|+|||||||++||+++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||+++|++|+|+++ T Consensus 633 ~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~~~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~ 712 (743) T protein:vir:10 633 AFDRINVRRLFLNLEKRARRLAEGVLFEQNDATTRAGFSSALNSYLSEVQARRGVTDYLVICDESNNTPDIIDRNEFVAE 712 (743) T ss_pred ccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecCCceEEEEEEEEEecCceeeeeecc Q lcl|NC_013693. 594 IWLKPEYSINWVYLDFAAVRPDMEFSEIETG 624 (631) Q Consensus 594 i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~ 624 (631) |+++|++|+|||+|||+|+++|++|+|+.+= T Consensus 713 i~~~p~~pae~I~~~~~~~~~~~~~~e~~~~ 743 (743) T protein:vir:10 713 VYVKPTRSINFITITFTATKTGVTFSEVVGR 743 (743) T ss_pred EEEEecCCcceEEEEEEEeecCcchHhhhcC Confidence 9999999999999999999999999999554 No 4 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=1.1e-140 Score=787.96 Aligned_cols=614 Identities=27% Similarity=0.402 Sum_probs=406.5 Q ss_pred chhcCCceEEEEecCCCceecccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCCceEE Q lcl|NC_013693. 5 SFSVAPSVQWTERDATLQTSPSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYSSVAW 84 (631) Q Consensus 5 ~~ylsPGVyveEv~~~~~~~gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG~~~~ 84 (631) .+||||||||||+|+++++++++|||+||||+|+|||+|+|++|+|| .||+++||++++.+|++|++++||+|||++|| T Consensus 1 ~~~~~PgVyvee~~~~~~~~~~~ts~~~fvG~~~~Gp~~~p~~i~s~-~~~~~~fG~~~~~~~~~~~~~~~f~ngg~~~~ 79 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQVTNE-VDLVNTFGQPTAETADYFMSAMNFLQYGNDLR 79 (659) T ss_pred CceecCceEEEEecCCcccccCCCcceEEEeecCCCCCcccEEecCH-HHHHHHcCCcCCCCchhHHHHHHHHhCCceEE Confidence 46999999999999999999889999999999999999999999887 79999999999999999999999999999999 Q ss_pred EEEecccCCCcccccccchhhhccccc--cccccccceeeehhhhhhhhhhchhhhhccCcccceeeccceeeeeccccc Q lcl|NC_013693. 85 VTRVVGPAARNAVTKGQTAILIRNKLD--FETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFAYAPQAGE 162 (631) Q Consensus 85 vvRv~~~~a~~a~~~~~~~~~~~~~~~--~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~~~~~~g~ 162 (631) ||||++.+...++.+............ .........+++.++.+|.|++...+.......................+. T Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~~~~~~ 159 (659) T protein:vir:72 80 VVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKAKEVGE 159 (659) T ss_pred EEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeecccccccccccccc Confidence 999987654433322111111111111 111112233456667778777654432221111111111111112222222 Q ss_pred ceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccccccccccc Q lcl|NC_013693. 163 YHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSSVVVKSNTV 242 (631) Q Consensus 163 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (631) +..........+............................................. ...........+.......+ T Consensus 160 ~~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~---~~~~~~~a~~~gt~g~~~tv 236 (659) T protein:vir:72 160 YPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKK---YGIPGVVALYPGELGDKIEI 236 (659) T ss_pred ccccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccc---cccceeeeccccccccceeE Confidence 222222222222221111111111111110000000000000000000000000000 00000000111111111111 Q ss_pred ccccccccccccc----------------ceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhh Q lcl|NC_013693. 243 TVTHKAIGPQTVT----------------AIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVI 306 (631) Q Consensus 243 ~v~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 306 (631) .+........... .......+.........+...+...+... .....+.....+...+....+ T Consensus 237 ~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 315 (659) T protein:vir:72 237 EIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVV-LSTKRGEKDIYDSNIYIDDFF 315 (659) T ss_pred EEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeee-eeeccccccccchhhhhhhhh Confidence 1111000000000 00001111111222222223333333332 222333344444555555655 Q ss_pred cccc-ceeeeecccc---cccccccccccccc----hhhhhhHHHHHhhhhhcccceeEEecccc---------chHHHH Q lcl|NC_013693. 307 NDTS-NWVYTFATTL---AAGVTELEGGVDDY----TGNRVAAIEALNNAEAYDAKPVFAFCEEL---------IEQQTL 369 (631) Q Consensus 307 ~~~~-~~~~~~~~~~---~~~~~~l~gg~d~~----~~~~~~~~~~l~~~~~~~~~~~i~~~~~~---------~~~~~~ 369 (631) .+++ .++....... ......+.+|.++. ..+..+++..+...+.++.. ++++|+. .++.++ T Consensus 316 ~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~l~~p~~~~~~~~~~~~v~~~l 393 (659) T protein:vir:72 316 AKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQ--LFIAGSCAGESLETASTVQKHV 393 (659) T ss_pred hcCCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhcccccee--EEEecCCCCcchhhhHHHHHHH Confidence 5443 3433332222 23345677777653 23455566666666655443 3333332 367889 Q ss_pred HHHHHhhccceEEeeccccc-ccccccCCHHHHHHHHHhc--------CCCcceEEEecCeeEEEeccCCceeEeehHHH Q lcl|NC_013693. 370 IDLSTERKDTVSFVSPLRDV-VVGNRGREMEDVVAWRESL--------VRDSSYFFMDDNWAYVYDKYNDKMRWIPACGG 440 (631) Q Consensus 370 ~~~~~~~~~~~a~~d~~~~~-~~~~~~~~~~~~~~~~~~~--------~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~ 440 (631) ++||+++++||+++|+|+.. ++.+...+.+++++||+.+ +++|+|+++||||++++|+.+++++++|||++ T Consensus 394 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~ 473 (659) T protein:vir:72 394 VSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLAAD 473 (659) T ss_pred HHHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechHHH Confidence 99999999999999998765 4456778899999999864 56899999999999999999999999999999 Q ss_pred HHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehh Q lcl|NC_013693. 441 TAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINV 520 (631) Q Consensus 441 ~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~v 520 (631) +||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||||++|+++|+++||+||+++++++|+|||| T Consensus 474 vAGl~Ar~D~~~G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~i~v 553 (659) T protein:vir:72 474 IAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRINV 553 (659) T ss_pred HHHHHHHhhccCCcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccCCCCcccceEee Confidence 99999999999999999999999999999999999999999999999999999999999999999999888889999999 Q ss_pred hHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecC Q lcl|NC_013693. 521 RGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEY 600 (631) Q Consensus 521 rR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~ 600 (631) |||++||+++|+++++|+|||||++.||++|+++|++||++||++|+|.||+|+||+++||+++|++|+|+++|+|+|++ T Consensus 554 rR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~ 633 (659) T protein:vir:72 554 RRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGNKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYIQPAR 633 (659) T ss_pred hhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEEEEEecCceeeeee-ccC Q lcl|NC_013693. 601 SINWVYLDFAAVRPDMEFSEIE-TGG 625 (631) Q Consensus 601 p~e~i~~~~~~~~~~~~~~e~~-~~g 625 (631) |+|||+|||+|+++|++|+|+. ++| T Consensus 634 pae~I~~~~~~~~~~~~~~e~~~~~~ 659 (659) T protein:vir:72 634 SINYITLNFVATATGADFDELTGLAG 659 (659) T ss_pred CccEEEEEEEEeecCcchHHhcccCC Confidence 9999999999999999999963 344 No 5 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=4.8e-141 Score=789.86 Aligned_cols=618 Identities=28% Similarity=0.414 Sum_probs=403.0 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCC--CccchhHHHHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKP--NDATATDFLVIADFL 77 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~--~~~~~~~~av~~fF~ 77 (631) |++ +|++|||||||+|+++++| ||+||++||||+|+|||+++|++|+|| .||+++||+| ++.++++|++++||+ T Consensus 1 m~~--~~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~-~~~~~~fG~~~~~~~~~~~~~~~~~f~ 77 (729) T protein:vir:10 1 MPL--NLASPGIVVREVDLTIGRVDPTSGSIGALVAPFAKGPVNDPQLIESE-EDLLQTFGQPYSTDKHYEYWMVASSYL 77 (729) T ss_pred CCc--cccCCceEEEEecCCCcccccccccceeEEeccccCCCccCeEcCCH-HHHHHHcCccccCCcchhHHHHHHHHH Confidence 555 5999999999999999888 999999999999999999999999886 7999999998 456789999999999 Q ss_pred hCCceEEEEEecccCCCccccccc-----------chhhhccccc----cccccccceeeehhhhhhhhhhchhhhhccC Q lcl|NC_013693. 78 SYSSVAWVTRVVGPAARNAVTKGQ-----------TAILIRNKLD----FETASPSASITWTGRYAGSLGNDVAINVCDA 142 (631) Q Consensus 78 ngG~~~~vvRv~~~~a~~a~~~~~-----------~~~~~~~~~~----~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~ 142 (631) |||++||||||.+..+..+..... ....+..... .........+++.++++|.|||.+++.+.+. T Consensus 78 ngg~~~~vvRv~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~~~v~v~~~ 157 (729) T protein:vir:10 78 AYGGTMQVVRADDYNTQTGVGLKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANGIKVAIIDG 157 (729) T ss_pred hCCceEEEEecCcccccccccccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccceeeEEecc Confidence 999999999999876554432110 0000111111 1112234567889999999999999998887 Q ss_pred cccceeeccceeeeec------------ccccceEEeeee-eeeeecccccccceeeeeecccccccceeE-eecccccc Q lcl|NC_013693. 143 AGFPTWEFRNNFAYAP------------QAGEYHIVIVDK-VGRITDSSGAVGQVDRISVSGTATGAGSIS-VAGEDVAY 208 (631) Q Consensus 143 ~~~~~~~~~~~~~~~~------------~~g~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 208 (631) ................ ............ .................. ..+...... ........ T Consensus 158 ~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s---~~~~~~~~~~~~~~~~~~ 234 (729) T protein:vir:10 158 KADQILTVASGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTLEVKVIS---HISAAGVETAVEYQQNGT 234 (729) T ss_pred cCcceeeeeccccccceeeeeeeccccccccccceeeeeeecccccccccccccceecc---cccccccceeccccccce Confidence 6655433221110000 000000000000 000000000000000000 000000000 00000000 Q ss_pred --cccc-cccccccc-----cccccccccccccccccccccccccccccccccccceeeccccccccee--------eee Q lcl|NC_013693. 209 --TDTD-TPATLATK-----IGTALTALTDVYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAV--------TTT 272 (631) Q Consensus 209 --~~~~-~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~ 272 (631) .... ........ ..............................+...........+...+.. ... T Consensus 235 ~~~~~~~s~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~ 314 (729) T protein:vir:10 235 YTFDNSGSVNVIAAGSSGSGSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTI 314 (729) T ss_pred eeecccCccceeeeccccccccccceeeeccccccccccccccccccccccccccccccccccccccccceeeecccccc Confidence 0000 00000000 0000000000000000000000000000000000111111111111111 111 Q ss_pred cccccccceeeeeeeeecccccccchhhhhhhhhccccceeeeeccc--------------------------------- Q lcl|NC_013693. 273 VGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVYTFATT--------------------------------- 319 (631) Q Consensus 273 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------------------- 319 (631) ....+..++.+..+..........+...+....+...+.++...... T Consensus 315 ~~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 394 (729) T protein:vir:10 315 TGNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLATNSKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGVNFGA 394 (729) T ss_pred ccCcccceeeeeeeeeccccccccccccccceeeccccceeeecccccccccccccccccceeccccccccccccccccc Confidence 23334445554444444444444444444433333332221111000 Q ss_pred ccccccccccccc--------------cchhhhhhHHHHHhhhhhcccceeEEeccc------cchHHHHHHHHHhhccc Q lcl|NC_013693. 320 LAAGVTELEGGVD--------------DYTGNRVAAIEALNNAEAYDAKPVFAFCEE------LIEQQTLIDLSTERKDT 379 (631) Q Consensus 320 ~~~~~~~l~gg~d--------------~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~------~~~~~~~~~~~~~~~~~ 379 (631) .......+.+|.+ ....+...++..|...+.+...++++.++. ..++.++++||+++++| T Consensus 395 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~~~ 474 (729) T protein:vir:10 395 SGVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARKDA 474 (729) T ss_pred cceeEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhcCCe Confidence 0001112223322 122233445556666666666666766543 24678999999999999 Q ss_pred eEEeeccccccc----------ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhh Q lcl|NC_013693. 380 VSFVSPLRDVVV----------GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSI 449 (631) Q Consensus 380 ~a~~d~~~~~~~----------~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d 449 (631) ++++|+|+.... ....++.+++..||+.++ +++|+++||||++++|+.++..+++|||+++||+|||+| T Consensus 475 ~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl~a~~d 553 (729) T protein:vir:10 475 VAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPLS-SSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGTCARTD 553 (729) T ss_pred EEEecccccccccccccccccccccchhhHHHHHHHhhcc-CCceEEEEcCeeEEecccCCceEEechhHHHHHHHHHhh Confidence 999999865422 223456677888998876 688999999999999999999999999999999999999 Q ss_pred ccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHH Q lcl|NC_013693. 450 EIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQ 529 (631) Q Consensus 450 ~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~ 529 (631) .++||||||+|+++.+|.|+.++++.++++|++.||++|||+|++|+++|+++||+||+++.|++|+|||||||++||++ T Consensus 554 ~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~d~~~~~i~vrR~~~~i~~ 633 (729) T protein:vir:10 554 IEQFPWFSPAGTARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAGIILFGDKTGFGKSSAFDRINVRRLFIYLED 633 (729) T ss_pred ccCCcEEccCCccccceecccceeeecChhhHhhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999888899999999999999999 Q ss_pred HHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEE Q lcl|NC_013693. 530 NIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDF 609 (631) Q Consensus 530 ~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~ 609 (631) +|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||+++|++|+|+++|+++|.+|+|||+||| T Consensus 634 si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~i~~~~ 713 (729) T protein:vir:10 634 AISAAAKDQLFEFNDELTRTNFVNIVEPFLRDVQAKRGIFDFVVICDETNNTAAVIDSNEFVADIFIKPARSINFIGLTF 713 (729) T ss_pred HHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCccEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecCceeeeeeccCee Q lcl|NC_013693. 610 AAVRPDMEFSEIETGGGI 627 (631) Q Consensus 610 ~~~~~~~~~~e~~~~g~~ 627 (631) +|++++++|+|+ .+.| T Consensus 714 ~~~~~~~~~~e~--~~~~ 729 (729) T protein:vir:10 714 VATRTGVAFEEV--IGSV 729 (729) T ss_pred EEeecCccHHHH--HhcC Confidence 999999999999 4555 No 6 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=3.4e-140 Score=785.21 Aligned_cols=605 Identities=27% Similarity=0.398 Sum_probs=401.9 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhC Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSY 79 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ng 79 (631) |+ ||||||||||++ ++++| ||+||++||||+|+|||+|+|++|+|| .||++.||++++.+|++|+|++||+|| T Consensus 1 ma----~~~PgVyv~E~~-~~~~i~~~~ts~~~~vG~~~~Gp~~~p~~i~s~-~d~~~~fG~~~~~~~~~~~v~~~f~ng 74 (664) T protein:vir:98 1 MA----LQSPGIETKETS-VQSTVVRNSTGRAAIVGKFSWGPAYQIRQISNE-VELVNYFGAPDNLTADYFMSAVNFLQY 74 (664) T ss_pred Cc----eecCceEEEecC-CCcccccccccceEEEeeccCCCCCccEEecCH-HHHHHhcCCccccchhHHHHHHHHHhc Confidence 44 889999999997 56777 999999999999999999999999887 799999999999999999999999999 Q ss_pred CceEEEEEecccCCCcccccccchhhhcccccc-----------ccccccceeeehhhhhhhhhhchhhhhccCccccee Q lcl|NC_013693. 80 SSVAWVTRVVGPAARNAVTKGQTAILIRNKLDF-----------ETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTW 148 (631) Q Consensus 80 G~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~-----------~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~ 148 (631) |++||||||.+.+...++............... ..........+.++.+|.|||.+++.+.+....... T Consensus 75 g~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~~~ 154 (664) T protein:vir:98 75 GNDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSLLV 154 (664) T ss_pred CCeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCccceee Confidence 999999999876543322211111111111110 001111223456788999999998887665433221 Q ss_pred eccceeeeecc-cccceEEeee--eeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccc Q lcl|NC_013693. 149 EFRNNFAYAPQ-AGEYHIVIVD--KVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTAL 225 (631) Q Consensus 149 ~~~~~~~~~~~-~g~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (631) ........... .........+ ...... ...... .......................... .... T Consensus 155 ~~~~~~~~~~~~~~~~s~~~~s~g~a~a~~-v~~v~~---d~~~~~~~~~~a~~~i~~~~~~~~~~----------~~~~ 220 (664) T protein:vir:98 155 LNRSVLTQIFLLVGTTEIVSQSSGVSASIT-IDGIES---DSGITLLNLDIAKETIQGTSFQTLTQ----------KYQI 220 (664) T ss_pred cccccccccceecccceeeeeecccceeee-cccccc---cceeeccccceeeeccccccceeeee----------cccc Confidence 11111000000 0000000000 000000 000000 00000000000000000000000000 0000 Q ss_pred cccccccccccccccc--------------ccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecc Q lcl|NC_013693. 226 TALTDVYSSVVVKSNT--------------VTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQG 291 (631) Q Consensus 226 ~~~~~~~~~~~~~~~~--------------~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 291 (631) ......+.+....... +..........................+.+.+..++...+.+. +..... T Consensus 221 ~~~~a~~~G~~Gn~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~-~~~~~~ 299 (664) T protein:vir:98 221 PSVVALYPGELGSTVQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFI-VSTDKT 299 (664) T ss_pred ceeeeeecccccceeeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEE-eecccC Confidence 0000000000000000 0000000000000001111112223344455555566555554 333334 Q ss_pred cccccchhhhhhhhhccccc-eeeeecccccc---cccccccccccch----hhhhhHHHHHhhhhhcccceeEEeccc- Q lcl|NC_013693. 292 SKKSDGSNAYFKDVINDTSN-WVYTFATTLAA---GVTELEGGVDDYT----GNRVAAIEALNNAEAYDAKPVFAFCEE- 362 (631) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~---~~~~l~gg~d~~~----~~~~~~~~~l~~~~~~~~~~~i~~~~~- 362 (631) .+...+...+....+.++.. ++.......+. ....+.+|.+... .+...++..|...+.++.. ++++|+. T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~-ll~~p~~~ 378 (664) T protein:vir:98 300 DKDIYGVNIYMDDFFANGGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVP-LLIAGGCA 378 (664) T ss_pred cccceeeeeechhheecccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccc-eEEecCCC Confidence 44444444444444443332 22222222221 2234566665331 1222233344433333332 2333331 Q ss_pred -------cchHHHHHHHHHhhccceEEeeccccc-ccccccCCHHHHHHHHHh------------cCCCcceEEEecCee Q lcl|NC_013693. 363 -------LIEQQTLIDLSTERKDTVSFVSPLRDV-VVGNRGREMEDVVAWRES------------LVRDSSYFFMDDNWA 422 (631) Q Consensus 363 -------~~~~~~~~~~~~~~~~~~a~~d~~~~~-~~~~~~~~~~~~~~~~~~------------~~~~s~~~~~~~p~~ 422 (631) ..++.++++||+++++||+++|+|+.. ++.+..++.+++++||+. .+++|+|+++||||+ T Consensus 379 ~~~~~~~~~v~~al~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~ 458 (664) T protein:vir:98 379 GESVEIASTVQKHVISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYK 458 (664) T ss_pred CCcHHHHHHHHHHHHHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeE Confidence 136788999999999999999998754 556778899999999974 357899999999999 Q ss_pred EEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcC-CcEE Q lcl|NC_013693. 423 YVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSN-EGIV 501 (631) Q Consensus 423 ~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~-~G~~ 501 (631) +++|+.+++++++|||+++||+|||+|.++||||||+|+++.+|.|+.++.+.+++.|++.||++|||+|++|++ +|++ T Consensus 459 ~~~d~~~~~~~~~p~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~ 538 (664) T protein:vir:98 459 YQYDKYNDVNRWVPLAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGGSGFV 538 (664) T ss_pred EEecccCCceEEechHHHHHHHHHHhhhcCCcEECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCCCcEE Confidence 999999999999999999999999999999999999999999999999999999999999999999999999997 7999 Q ss_pred EEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCC Q lcl|NC_013693. 502 LYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNT 581 (631) Q Consensus 502 ~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt 581 (631) +||+||+++++++|+||||||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|+||+|+||+++|| T Consensus 539 ~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt 618 (664) T protein:vir:98 539 LYGDKTLTSVPSPFDRINVRRLFNMIKKDIGDNAKYKLFENNDDFTRASFRMDTGQYMTNIRALGGCYDYRVICDTTNNT 618 (664) T ss_pred EEcccccCCCCcccceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCC Confidence 99999998888899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccCee Q lcl|NC_013693. 582 ADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGGGI 627 (631) Q Consensus 582 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~~ 627 (631) +++|++|+|+++|+++|++|+|||+|||+|+++|++|+|+...=.| T Consensus 619 ~~~i~~G~~~~~i~~~p~~pae~I~~~~~q~~~~~~~~e~~~~~~~ 664 (664) T protein:vir:98 619 PDVIDRNEFVATVYVKPPRSINYITLNFVATSTGADFDELVGPQAV 664 (664) T ss_pred HHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchhHhcccccC Confidence 9999999999999999999999999999999999999999877777 No 7 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=1.1e-139 Score=782.35 Aligned_cols=617 Identities=27% Similarity=0.391 Sum_probs=397.7 Q ss_pred chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCCceE Q lcl|NC_013693. 5 SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYSSVA 83 (631) Q Consensus 5 ~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG~~~ 83 (631) ..||||||||||++ ++++| ||+|||+||||+|+|||+|+|++|+|| .||++.||++++.+|++|++++||+|||++| T Consensus 1 ~~~~~PgVyv~e~~-~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~-~~~~~~fG~~~~~~~~~~~~~~~f~~~g~~~ 78 (660) T protein:vir:68 1 MALLSPGVELKETT-VQSTVVNNSTGTAALAGKFQWGPAFQIKQITDE-VALVDMFGTPNTDTADYFMSAMNFLQYGNDL 78 (660) T ss_pred CccccCceEEEEec-CCcccccCCCcceeEEecccCCCCccCEEecCH-HHHHHhcCCccCccchhHHHHHHHHhCCCeE Confidence 35999999999997 45556 999999999999999999999999886 7999999999999999999999999999999 Q ss_pred EEEEecccCCCcccccccchhhhccccccc--cccccceeeehhhhhhhhhhchhhhhccCcccceeeccceeeeecccc Q lcl|NC_013693. 84 WVTRVVGPAARNAVTKGQTAILIRNKLDFE--TASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFAYAPQAG 161 (631) Q Consensus 84 ~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~--~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~~~~~~g 161 (631) ||||+++.+...+................. .......++.....++.+++.+.+.......................+ T Consensus 79 ~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a~~~~ 158 (660) T protein:vir:68 79 RVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKAKEIG 158 (660) T ss_pred EEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeeccccccccceeec Confidence 999998755433222111111000000000 000011111111222233332222111100000000000000001111 Q ss_pred cceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccccccccccccc Q lcl|NC_013693. 162 EYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSSVVVKSNT 241 (631) Q Consensus 162 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (631) .+..........+....... .......+...+...................... .............+.+....... T Consensus 159 ~~~~~~~~~~~~v~~~~~~~--~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~-~~~~~~~~~~~A~~~g~~G~~i~ 235 (660) T protein:vir:68 159 EYPELGSNWTAEMSGSSSGL--SAVITIDSVVMDSGILLTEVETSEEAITSLTFQE-SIKKYGVPGVVALYPGELGDQLE 235 (660) T ss_pred cccccccceeEEeecccccc--eeeeeeccccccccceeeeeccccccccccceee-eecccCccccccccccccccceE Confidence 11111111111111110000 0000000000000000000000000000000000 00000000000001111000000 Q ss_pred ccccccc---------------ccccc-ccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhh Q lcl|NC_013693. 242 VTVTHKA---------------IGPQT-VTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDV 305 (631) Q Consensus 242 ~~v~~~~---------------~~~~~-~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (631) +...... ..... ...............+.+.+..++...+.+. +....+.....+...++... T Consensus 236 v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 314 (660) T protein:vir:68 236 IEIVSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVV-LSTKRGERDIYGSNIFIDDF 314 (660) T ss_pred EEEeccccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeee-eecccccccccccceeeehh Confidence 0000000 00000 0001111122223344455555666666553 33333444444444444444 Q ss_pred hccc-cceeeeeccccc---ccccccccccccc----hhhhhhHHHHHhhhhhcccceeEEeccc-------cchHHHHH Q lcl|NC_013693. 306 INDT-SNWVYTFATTLA---AGVTELEGGVDDY----TGNRVAAIEALNNAEAYDAKPVFAFCEE-------LIEQQTLI 370 (631) Q Consensus 306 ~~~~-~~~~~~~~~~~~---~~~~~l~gg~d~~----~~~~~~~~~~l~~~~~~~~~~~i~~~~~-------~~~~~~~~ 370 (631) +.++ +..+.......+ ....++.||.++. .++..+.++++...+.++...+++.+.. ..++.+++ T Consensus 315 ~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~ 394 (660) T protein:vir:68 315 FAKGASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHVV 394 (660) T ss_pred hccCcccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHHH Confidence 4333 333333322222 2234567777653 3455667777887887776665543322 24678999 Q ss_pred HHHHhhccceEEeeccccc-ccccccCCHHHHHHHHHhc--------CCCcceEEEecCeeEEEeccCCceeEeehHHHH Q lcl|NC_013693. 371 DLSTERKDTVSFVSPLRDV-VVGNRGREMEDVVAWRESL--------VRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGT 441 (631) Q Consensus 371 ~~~~~~~~~~a~~d~~~~~-~~~~~~~~~~~~~~~~~~~--------~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 441 (631) +||+++++||+++|+|+.. ++.+.+++++++++||+.. +++|+|+++||||++++|+.++..+++|||+++ T Consensus 395 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~ 474 (660) T protein:vir:68 395 AIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADI 474 (660) T ss_pred HHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHH Confidence 9999999999999998765 5667788999999999853 568999999999999999999999999999999 Q ss_pred HHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhh Q lcl|NC_013693. 442 AGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVR 521 (631) Q Consensus 442 ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vr 521 (631) ||+|||+|.++||||||+|+++.+|.|+.++++.+++.|++.||++|||+||+|+++|+++||+||+++++++|+||||| T Consensus 475 AGl~Ar~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~vr 554 (660) T protein:vir:68 475 AGLCARTDNISQPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRINVR 554 (660) T ss_pred HHHHHHHhccCCcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceEehh Confidence 99999999999999999999999999999999999999999999999999999999999999999998888899999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCC Q lcl|NC_013693. 522 GLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYS 601 (631) Q Consensus 522 R~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p 601 (631) |||+||+++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||+++|++|+|+++|+++|++| T Consensus 555 R~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p 634 (660) T protein:vir:68 555 RLFNMVKTNIGSASKYRLFELNNAFTRSSFRTETSQYLQGIKALGGVYNFKVVCDTTNNTPAVIDRNEFVATFYLQPARS 634 (660) T ss_pred hHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEEEEEecCceeeeeeccCeeee Q lcl|NC_013693. 602 INWVYLDFAAVRPDMEFSEIETGGGIVA 629 (631) Q Consensus 602 ~e~i~~~~~~~~~~~~~~e~~~~g~~~~ 629 (631) +|||+|||+|++++++|+|+. |.|=+ T Consensus 635 ae~i~l~~~~~~~~~~~~e~~--~~v~~ 660 (660) T protein:vir:68 635 INYITLNFVATATGADFDELI--GAVGG 660 (660) T ss_pred cceEEEEEEEeecCccHHHHH--HhhcC Confidence 999999999999999999984 33333 No 8 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=7.9e-140 Score=783.18 Aligned_cols=601 Identities=30% Similarity=0.436 Sum_probs=396.6 Q ss_pred chhcCCceEEEEecCCCceecccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCCceEE Q lcl|NC_013693. 5 SFSVAPSVQWTERDATLQTSPSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYSSVAW 84 (631) Q Consensus 5 ~~ylsPGVyveEv~~~~~~~gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG~~~~ 84 (631) ..||||||||||++.+..+.||+||++||||+|+|||+++|++|+|| .||++.||++++.+|++|++++||+|||++|| T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~~~t~~~~~vg~~~~gp~~~p~~i~~~-~~~~~~fg~~~~~~~~~~~~~~~f~~~g~~~~ 79 (666) T protein:vir:80 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIIQVTNE-VELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (666) T ss_pred CceecCceEEEEecCCccccccCcccceEEeccccCCCccceEecCH-HHHHHhcCCccCccchHHHHHHHHhcCCCeEE Confidence 45999999999997555555999999999999999999999999886 79999999999999999999999999999999 Q ss_pred EEEecccCCCcccccccchhhhccccccccccccceeeehhhhhh---hhhhchhhhhccC--------------cccce Q lcl|NC_013693. 85 VTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAG---SLGNDVAINVCDA--------------AGFPT 147 (631) Q Consensus 85 vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G---~~gn~l~v~v~~~--------------~~~~~ 147 (631) ||||++.+...+..... ..+.+.++.+| .||+.+.+..... ..... T Consensus 80 v~R~~~~~~~~~a~~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~ 142 (666) T protein:vir:80 80 VVRVLNKEKAKNATALA-----------------GNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKG 142 (666) T ss_pred EEEecCccccccccccc-----------------cceeEEEeeccccccccccccccccCcccccCcceEEEeecceeee Confidence 99998754432221111 11222222222 2333222211100 00000 Q ss_pred ee--ccceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccc Q lcl|NC_013693. 148 WE--FRNNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTAL 225 (631) Q Consensus 148 ~~--~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (631) .. ....+......+........+.............. ........+...................... ....... T Consensus 143 ~~~~ta~~~~~a~~~~~~~~v~~~~~~~~~~~~~~~~~a--~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~-~~~~~~~ 219 (666) T protein:vir:80 143 VFIPTGKIIAHAKAIGVYPELDGDWTAEFTSSSGNGSAA--LSVTKIVTDSGLLLTDLETSRANITNQTFLT-KLQKYDM 219 (666) T ss_pred eecchhhhccccccccccceeeccceeeeccccccceee--eeeeeeecCCccceeeecccccccccccccc-ccccccc Confidence 00 00000000111111111111111111111110000 0010000000000000000000000000000 0000000 Q ss_pred cccccccccccccccccc-------------ccccccccc-cccceeecccccccceeeeecccccccceeeeeeeeecc Q lcl|NC_013693. 226 TALTDVYSSVVVKSNTVT-------------VTHKAIGPQ-TVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQG 291 (631) Q Consensus 226 ~~~~~~~~~~~~~~~~~~-------------v~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 291 (631) ......+.+.......+. +......+. ................+.+.+...+..+|.+.. ..... T Consensus 220 ~a~~a~~~g~~g~~l~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~-~~~~~ 298 (666) T protein:vir:80 220 PAVSAIYAGEIGNSLEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVL-STLKG 298 (666) T ss_pred hhhhhhcccccccceeeeeccccccccccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeec-ccccc Confidence 000000110000000000 000000100 011111112223334456666777777877753 34444 Q ss_pred cccccchhhhhhhhhccccceeeeecccc----ccccccccccc------------ccchhhhhhHHHHHhhhhhcccce Q lcl|NC_013693. 292 SKKSDGSNAYFKDVINDTSNWVYTFATTL----AAGVTELEGGV------------DDYTGNRVAAIEALNNAEAYDAKP 355 (631) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~l~gg~------------d~~~~~~~~~~~~l~~~~~~~~~~ 355 (631) .+...+...++...++++........... ......+.+|. ++..++..+....+...+..+. . T Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~-~ 377 (666) T protein:vir:80 299 DKDVYGNSIYMDDFFGRGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERESIHV-N 377 (666) T ss_pred cccccchhhhhhhhhccccceeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhccccc-c Confidence 55555666666666655543322211111 11112233332 2333444444333333333333 3 Q ss_pred eEEecc-------ccchHHHHHHHHHhhccceEEeeccccc-ccccccCCHHHHHHHHHh--------cCCCcceEEEec Q lcl|NC_013693. 356 VFAFCE-------ELIEQQTLIDLSTERKDTVSFVSPLRDV-VVGNRGREMEDVVAWRES--------LVRDSSYFFMDD 419 (631) Q Consensus 356 ~i~~~~-------~~~~~~~~~~~~~~~~~~~a~~d~~~~~-~~~~~~~~~~~~~~~~~~--------~~~~s~~~~~~~ 419 (631) +++.+. ...++.++++||+++++||+++|+|+.. ++.+..++++++++||+. ++++|+|+++|| T Consensus 378 ~l~~p~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~ 457 (666) T protein:vir:80 378 LLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDG 457 (666) T ss_pred eEeecCcCCcccchHHHHHHHHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEc Confidence 333332 1246789999999999999999998764 566788999999999986 357899999999 Q ss_pred CeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCc Q lcl|NC_013693. 420 NWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEG 499 (631) Q Consensus 420 p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G 499 (631) ||++++|+.+++.+++|||+++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||||++|+++| T Consensus 458 p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G 537 (666) T protein:vir:80 458 NYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEG 537 (666) T ss_pred CceEEecccCCceeEechHHHHHHHHHHHhhcCCceEccCCeecceeeccccceeecChhHHHhhhhCCeeEEEEeCCCe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCC Q lcl|NC_013693. 500 IVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADN 579 (631) Q Consensus 500 ~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~ 579 (631) +++||+||+++++++|+||||||||+||+++|++.++|+||||||+.+|++|+++|+.||++||++|+|.||+|+||+++ T Consensus 538 ~~~wG~rT~~~~~s~~~~i~vRRl~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~ 617 (666) T protein:vir:80 538 FILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTN 617 (666) T ss_pred EEEEccccCCCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCC Confidence 99999999988888999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccCeeeec Q lcl|NC_013693. 580 NTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGGGIVAA 630 (631) Q Consensus 580 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~~~~~ 630 (631) ||+++|++|+|+++|+++|++|||||+|||+|+++|++|+|+ .|.|-+| T Consensus 618 nt~~di~~G~~~~~i~~~P~~Pae~I~~~~~~~~~~~~~~e~--~~~~~~~ 666 (666) T protein:vir:80 618 NTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGSDFDEI--IGPVNQA 666 (666) T ss_pred CCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCccHHHH--HHHHhcC Confidence 999999999999999999999999999999999999999999 7888777 No 9 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=3.2e-139 Score=779.85 Aligned_cols=611 Identities=26% Similarity=0.398 Sum_probs=403.9 Q ss_pred chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCCceE Q lcl|NC_013693. 5 SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYSSVA 83 (631) Q Consensus 5 ~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG~~~ 83 (631) ..||+|||||||++ ++++| ||+||++||||+|+|||+|+|++|+|| .||++.||++++.+|++|+|++||+|||++| T Consensus 1 ~~~~~PgVyv~e~~-~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~-~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~ 78 (663) T protein:vir:10 1 MALLSPGIEMKETS-INSTVVRSATGRAAIVGKFAWGPAYEVRQVTNE-VELVDMFGSPDNVTAPYFMSAMNFLQYGNDL 78 (663) T ss_pred CceecCceEEEEec-CcccccccCccceeEEeeeccCCCCccEEecCH-HHHHHHhCCcCccchhHHHHHHHHHhCCCeE Confidence 45999999999997 55555 999999999999999999999999886 7999999999999999999999999999999 Q ss_pred EEEEecccCCCccccccc--chhhhcccccc-------ccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 84 WVTRVVGPAARNAVTKGQ--TAILIRNKLDF-------ETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 84 ~vvRv~~~~a~~a~~~~~--~~~~~~~~~~~-------~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) |||||++.+...+..... ....+...... ........+...++.++.|+|...+.+.......... .. T Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~-~~-- 155 (663) T protein:vir:10 79 RLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAK-TR-- 155 (663) T ss_pred EEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEecccccccc-cc-- Confidence 999998765433221110 00000000000 0000011112223344445554443332211111000 00 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) ..+.......................... .....+...................... .............+.+ T Consensus 156 ----~v~~~~~~~~~~~~~~~~~~~~~~~~~~v--~~vv~~~~~~~~~~~~a~~~~~~~~~~~-~~~~~~~~~~~a~~~G 228 (663) T protein:vir:10 156 ----QLGTYPTLGDNWRIDVSGASGGSAAALAL--GNIVVDSGVTFGNSEDAPAVMTSPAVME-KYAKFGMPLVSAVYPG 228 (663) T ss_pred ----ccceeeeccccceeEeeeccccccccccc--cceecccceeeEeeccccccccccchhh-hcccccceeeeeeccc Confidence 00000000000000000000000000000 0000000000000000000000000000 0000000001111111 Q ss_pred cccccccccccccc--------------cccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhh Q lcl|NC_013693. 235 VVVKSNTVTVTHKA--------------IGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNA 300 (631) Q Consensus 235 ~~~~~~~~~v~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (631) .......+.+.... ....................+...+..++... ++..++...+.+...+... T Consensus 229 ~~Gn~i~v~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~s~~~~~~~~~~~~~ 307 (663) T protein:vir:10 229 EIGSTVEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVV-ESTVLSTRKGDRDVYGSNI 307 (663) T ss_pred ccccceeEEecccccccccccccccccccccccccceeeeeccccccceeEEEecCCcce-eeeeeeecccccccchhhh Confidence 11111111100000 00000000011111112222333333333333 3334455555555566665 Q ss_pred hhhhhhccc-cceeeeecccccc---cccccccccccc----hhhhhhHHHHHhhhhhcccceeEEeccc-------cch Q lcl|NC_013693. 301 YFKDVINDT-SNWVYTFATTLAA---GVTELEGGVDDY----TGNRVAAIEALNNAEAYDAKPVFAFCEE-------LIE 365 (631) Q Consensus 301 ~~~~~~~~~-~~~~~~~~~~~~~---~~~~l~gg~d~~----~~~~~~~~~~l~~~~~~~~~~~i~~~~~-------~~~ 365 (631) ++...+.++ +.++.......+. ....+.+|.|+. ..+..+++..|...+.++..++++.++. ..+ T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v 387 (663) T protein:vir:10 308 FMDDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTV 387 (663) T ss_pred hhhhhhccCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHH Confidence 565555443 3444433332222 234677887754 3566677788888887777776665543 236 Q ss_pred HHHHHHHHHhhccceEEeeccccccc-ccccCCHHHHHHHHHhc-----------CCCcceEEEecCeeEEEeccCCcee Q lcl|NC_013693. 366 QQTLIDLSTERKDTVSFVSPLRDVVV-GNRGREMEDVVAWRESL-----------VRDSSYFFMDDNWAYVYDKYNDKMR 433 (631) Q Consensus 366 ~~~~~~~~~~~~~~~a~~d~~~~~~~-~~~~~~~~~~~~~~~~~-----------~~~s~~~~~~~p~~~~~d~~~~~~~ 433 (631) +.++++||+++++||+++|+|..... ...+.+.+++++||+.+ +++|+|+++||||++++|+.+++.+ T Consensus 388 ~~al~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~ 467 (663) T protein:vir:10 388 QKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINR 467 (663) T ss_pred HHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceE Confidence 78999999999999999999976533 34567889999998753 5789999999999999999999999 Q ss_pred EeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcC-CcEEEEcceecCCCC Q lcl|NC_013693. 434 WIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSN-EGIVLYGDKTGLTRP 512 (631) Q Consensus 434 ~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~-~G~~~wg~rT~~~~~ 512 (631) ++|||+++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||+|++|++ +|+++||+||+++++ T Consensus 468 ~~p~s~~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~ 547 (663) T protein:vir:10 468 WVPLAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVP 547 (663) T ss_pred EechhHHHHHHHHHhhccCCceEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCC Confidence 9999999999999999999999999999999999999999999999999999999999999997 799999999998888 Q ss_pred hhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEE Q lcl|NC_013693. 513 SAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVA 592 (631) Q Consensus 513 ~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~ 592 (631) ++|+||||||||+||+++|+++++|+||||||+.+|++|+++|+.||++||++|+|.||+|+||+++||+++|++|+|++ T Consensus 548 s~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 627 (663) T protein:vir:10 548 SPFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVG 627 (663) T ss_pred cccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEE Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEEecCceeeeeeccCeeeec Q lcl|NC_013693. 593 GIWLKPEYSINWVYLDFAAVRPDMEFSEIETGGGIVAA 630 (631) Q Consensus 593 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~~~~~ 630 (631) +|+|+|++|+|||+|||+|++++++|+|+ .|+|-.| T Consensus 628 ~i~~~p~~pae~i~~~~~~~~~~~~~~e~--~~~~~~~ 663 (663) T protein:vir:10 628 TIYVKPPRSINYITLNMVATSTGANFDEL--IGPMQLA 663 (663) T ss_pred EEEEEecCCcceEEEEEEEeecCccHHHH--HHHHhcC Confidence 99999999999999999999999999999 7888888 No 10 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=2.3e-138 Score=775.21 Aligned_cols=601 Identities=30% Similarity=0.436 Sum_probs=399.5 Q ss_pred chhcCCceEEEEecCCCceecccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCCceEE Q lcl|NC_013693. 5 SFSVAPSVQWTERDATLQTSPSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYSSVAW 84 (631) Q Consensus 5 ~~ylsPGVyveEv~~~~~~~gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG~~~~ 84 (631) .+||||||||||++.+..+.||+||++||||+|+|||+|+|++|+|| .||+++||++++.+|++|++++||+|||++|| T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~v~s~-~~~~~~fG~~~~~~~~~~~~~~~f~ngg~~~~ 79 (666) T protein:vir:65 1 MTLLSPGFETKETTLSTTIVQSETGRAALVGKFQWGPAFQIIQVTNE-VELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (666) T ss_pred CceecCceEEEEecCcccccccCcccceEEecccCCCCccCEEecCH-HHHHHHcCCccccchhHHHHHHHHHhcCceEE Confidence 46999999999997545444999999999999999999999999886 79999999999999999999999999999999 Q ss_pred EEEecccCCCcccccccchhhhccccccccccccceeeehhhh---hhhhhhchhhhhccCc-----ccceee------- Q lcl|NC_013693. 85 VTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRY---AGSLGNDVAINVCDAA-----GFPTWE------- 149 (631) Q Consensus 85 vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~---~G~~gn~l~v~v~~~~-----~~~~~~------- 149 (631) |||+++.+...+...... .+...+++ .+.||+.+.+...... ...... T Consensus 80 vvrv~~~~~~~~~~~~~~-----------------~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g 142 (666) T protein:vir:65 80 VVRVLNKEKAKNATALAG-----------------NVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKG 142 (666) T ss_pred EEEccCcccccccccccC-----------------ceeeeEeeccccccccceEEEEecccccccccccccccccccccc Confidence 999987654433221100 11112222 2234544443221100 000000 Q ss_pred ----ccceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccc Q lcl|NC_013693. 150 ----FRNNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTAL 225 (631) Q Consensus 150 ----~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (631) ....+......+........+............... .......+......................... .... T Consensus 143 ~~~~t~~~~~~~~~~g~~~~l~~~~~~~~~~~~~~~~~a~--sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~-~~~~ 219 (666) T protein:vir:65 143 VFIPTGKIIAHAKAIGVYPELDGGWTAEFTSSSGNGSAAL--SVTKIVTDSGLLLTDLETSRANITNQTFLTKLK-KYDM 219 (666) T ss_pred cccccceeeccccccCcceeEeeccceeecccCcccccce--eeeecccccceeeeeeccccccccccccccccc-cccc Confidence 000011111122222222222222221111111100 010110000000000000000000000000000 0000 Q ss_pred ccccccccccccccccc-------------ccccccccccccc-ceeecccccccceeeeecccccccceeeeeeeeecc Q lcl|NC_013693. 226 TALTDVYSSVVVKSNTV-------------TVTHKAIGPQTVT-AIVPDANGLTATAVTTTVGASGSIIEKYELMQATQG 291 (631) Q Consensus 226 ~~~~~~~~~~~~~~~~~-------------~v~~~~~~~~~~~-~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 291 (631) ....+...+.......+ .+.....+..... .............+.+.+...+..+|.+. .....+ T Consensus 220 ~a~~A~~~g~~g~~i~v~i~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~-~~~~~~ 298 (666) T protein:vir:65 220 PAVSAIYAGEIGNSLEVEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYV-LSTLKG 298 (666) T ss_pred ceeeeeeccccccceeEEeecccccccccccccccccccccccceeeecccccccccceeeeecCCcccceee-cccCcc Confidence 00000000000000000 0000001111110 01111112223445566667777777775 344445 Q ss_pred cccccchhhhhhhhhcccc-ceeeeeccccc---ccccccccc------------cccchhhhhhHHHHHhhhhhcccce Q lcl|NC_013693. 292 SKKSDGSNAYFKDVINDTS-NWVYTFATTLA---AGVTELEGG------------VDDYTGNRVAAIEALNNAEAYDAKP 355 (631) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~---~~~~~l~gg------------~d~~~~~~~~~~~~l~~~~~~~~~~ 355 (631) .+...+...++.+.+.++. ..++....... .....+.+| .++..++..+.+..+...+.+.... T Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (666) T protein:vir:65 299 DKDVYGNSIYMDDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNL 378 (666) T ss_pred cccccchhhhhhhhhcccccceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCc Confidence 5555555555555543332 23322221111 011122222 2333455556666666655544433 Q ss_pred eEEecc-------ccchHHHHHHHHHhhccceEEeeccccc-ccccccCCHHHHHHHHHhc--------CCCcceEEEec Q lcl|NC_013693. 356 VFAFCE-------ELIEQQTLIDLSTERKDTVSFVSPLRDV-VVGNRGREMEDVVAWRESL--------VRDSSYFFMDD 419 (631) Q Consensus 356 ~i~~~~-------~~~~~~~~~~~~~~~~~~~a~~d~~~~~-~~~~~~~~~~~~~~~~~~~--------~~~s~~~~~~~ 419 (631) ++.+. ...++.++++||+++++||+++|+|+.. ++++.+++++++++||+.+ +++|+|+++|| T Consensus 379 -l~~p~~~~~~~~~~~v~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~ 457 (666) T protein:vir:65 379 -LIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDG 457 (666) T ss_pred -eeecCcCCccchhHHHHHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEc Confidence 33222 2357889999999999999999998754 6777889999999999864 46899999999 Q ss_pred CeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCc Q lcl|NC_013693. 420 NWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEG 499 (631) Q Consensus 420 p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G 499 (631) ||++++|+.+++.+++|||+++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||||++|+++| T Consensus 458 p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G 537 (666) T protein:vir:65 458 NYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEG 537 (666) T ss_pred CceEEecccCCceeEechHHHHHHHHHHHhccCCcEEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCCCe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCC Q lcl|NC_013693. 500 IVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADN 579 (631) Q Consensus 500 ~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~ 579 (631) +++||+||+++++++|+||||||||+||+++|+++++|+||||||+.+|++|+++|+.||++||++|+|.||+|+||+++ T Consensus 538 ~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~ 617 (666) T protein:vir:65 538 FILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTN 617 (666) T ss_pred EEEEecccCCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCC Confidence 99999999988888999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccCeeeec Q lcl|NC_013693. 580 NTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGGGIVAA 630 (631) Q Consensus 580 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~~~~~ 630 (631) ||+++|++|+|+++|+++|++|||||+|||+|++++++|+|+ .|.+-.| T Consensus 618 nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~--~~~~~~~ 666 (666) T protein:vir:65 618 NTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGSDFDEI--IGPANQA 666 (666) T ss_pred CCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCccHHHH--HHHHhcC Confidence 999999999999999999999999999999999999999998 5666666 No 11 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=2.3e-138 Score=775.20 Aligned_cols=616 Identities=28% Similarity=0.395 Sum_probs=382.9 Q ss_pred chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCCceE Q lcl|NC_013693. 5 SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYSSVA 83 (631) Q Consensus 5 ~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG~~~ 83 (631) ..||||||||||++ ++++| ||+||++||||+|+|||+|+|++|+|| .||++.||++++.+|++|+|++||+|||++| T Consensus 1 ~~~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vg~~~~gp~~~p~~i~s~-~~~~~~fg~~~~~~~~~~~~~~~f~~gg~~~ 78 (679) T protein:vir:10 1 MTLLSPGVETKEIN-LQTTIARSSTGRAALVGKFNWGPAYQISQVVSE-VDLVDKFGRPDDQTADSFFSGVNFLNYGNDL 78 (679) T ss_pred CceecCceEEEeec-CCcccccCccccceeeecccCCCCccCEEecCH-HHHHHHcCCcccccchHHHHHHHHHhCCCeE Confidence 46999999999997 45666 999999999999999999999999886 7999999999999999999999999999999 Q ss_pred EEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCccc-ceee--ccceeeeeccc Q lcl|NC_013693. 84 WVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGF-PTWE--FRNNFAYAPQA 160 (631) Q Consensus 84 ~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~-~~~~--~~~~~~~~~~~ 160 (631) |||||.+.+...+..+............ .........++++..+...+...+...+.... .... ........... T Consensus 79 ~vvrv~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~v~~~~~~s~~~~~~~~~~~~~~~~~~~~v~~~~~~~~a~~~ 156 (679) T protein:vir:10 79 RLVRVLNETKSRNSSALYQSLSYTITSP--GVDYKVGDVVNVLQGGNVIATGKVTVVNASGGIVAFYVPTAAIIDKAKSL 156 (679) T ss_pred EEEEccCccccccccccccccccccccc--ccccccccceeeeeCCCcccceeEEEeeccCceeeeeecccccccccccc Confidence 9999988765433321111111000000 00000011111111111111111111111110 0000 00000000111 Q ss_pred ccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccc-ccccc----------------cccccccccc- Q lcl|NC_013693. 161 GEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVA-YTDTD----------------TPATLATKIG- 222 (631) Q Consensus 161 g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~----------------~~~~~~~~~~- 222 (631) +.................................+... ........ ..... .........+ T Consensus 157 ~~~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~-~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~~g~ 235 (679) T protein:vir:10 157 NDYPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTI-FVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARYAGT 235 (679) T ss_pred cccceecccceeeeeeccccccceeeeeeeeeccCCce-eeccccccccccccccccchhhhhhhhccccceeeeecccc Confidence 11111111111111111111111111110000000000 00000000 00000 0000000000 Q ss_pred ------ccccccccccccccccc----c--ccccccccccccccccee-ecccccccceeeeecccccccceeeeeeeee Q lcl|NC_013693. 223 ------TALTALTDVYSSVVVKS----N--TVTVTHKAIGPQTVTAIV-PDANGLTATAVTTTVGASGSIIEKYELMQAT 289 (631) Q Consensus 223 ------~~~~~~~~~~~~~~~~~----~--~~~v~~~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 289 (631) ................. . .................. ..........+...+..++...+.+... .. T Consensus 236 ~gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~-~~ 314 (679) T protein:vir:10 236 YGDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILS-TK 314 (679) T ss_pred cCCcceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeee-cc Confidence 00000000000000000 0 000000000000000000 0000111122333334444444443322 22 Q ss_pred cccccccchhhhhhhhhcccc-ceeeeeccc---ccccccccccccccch----hhhhhHHHHHhhhhhcccceeEEecc Q lcl|NC_013693. 290 QGSKKSDGSNAYFKDVINDTS-NWVYTFATT---LAAGVTELEGGVDDYT----GNRVAAIEALNNAEAYDAKPVFAFCE 361 (631) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~l~gg~d~~~----~~~~~~~~~l~~~~~~~~~~~i~~~~ 361 (631) .......+...++...+.++. .++...... .......+.||.++.+ ++..+.+..+...+.... .+++.|+ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~p~ 393 (679) T protein:vir:10 315 PGDRDIYGTSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDV-NLFIAGA 393 (679) T ss_pred cccccccchhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhccccccc-ceEEecC Confidence 333333344444444444332 222221111 1123445667776543 333334444443333333 3333333 Q ss_pred c--------cchHHHHHHHHHhhccceEEeecccccc-cccccCCHHHHHHHHHhc-----------CCCcceEEEecCe Q lcl|NC_013693. 362 E--------LIEQQTLIDLSTERKDTVSFVSPLRDVV-VGNRGREMEDVVAWRESL-----------VRDSSYFFMDDNW 421 (631) Q Consensus 362 ~--------~~~~~~~~~~~~~~~~~~a~~d~~~~~~-~~~~~~~~~~~~~~~~~~-----------~~~s~~~~~~~p~ 421 (631) . ..++.++++||+++++||+++|+|+... .....++.+++++||+.+ +++|+|+++|||| T Consensus 394 ~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~ 473 (679) T protein:vir:10 394 VAGEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGNY 473 (679) T ss_pred CCCCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEccc Confidence 2 2478899999999999999999998764 445667789999999743 4689999999999 Q ss_pred eEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEE Q lcl|NC_013693. 422 AYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIV 501 (631) Q Consensus 422 ~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~ 501 (631) ++++|+.+++++++|||+++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||+|++|+++|++ T Consensus 474 ~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g~G~~ 553 (679) T protein:vir:10 474 KYQYDKYNDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVGFAGQGYI 553 (679) T ss_pred eeeecccCCceEEechHHHHHHHHHHhhccCCcEECcCCeeeccccccccceeecChhhHHhhhhCCceEEEEecCCeEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCC Q lcl|NC_013693. 502 LYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNT 581 (631) Q Consensus 502 ~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt 581 (631) +||+||+++++++|+|||||||++||+++|+++++|+||||||+.+|.+|+++|++||++||++|+|.||+|+||+++|| T Consensus 554 ~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~gf~v~~d~~~nt 633 (679) T protein:vir:10 554 LYGDKTASQAPTPFDRINVRRLFNLLKKSISESAKYKLFELNDAFTRSSFRSEVGSYLDTIRSLGGIYDFRVVCDESNNT 633 (679) T ss_pred EEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCC Confidence 99999998888899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccCeeee Q lcl|NC_013693. 582 ADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGGGIVA 629 (631) Q Consensus 582 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~~~~ 629 (631) +++|++|+|+++|+++|++|+|||+|||+|++++++|+|+ .|.+-- T Consensus 634 ~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~--~~~~~~ 679 (679) T protein:vir:10 634 PAVIDRNEFVATILIKPARSINYITLSFVATSTGADFDEL--VGSFQQ 679 (679) T ss_pred HHHhhCCeEEEEEEEEecCCccEEEEEEEEeecCccHHHH--HHHhcC Confidence 9999999999999999999999999999999999999998 333332 No 12 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=5e-138 Score=773.33 Aligned_cols=604 Identities=26% Similarity=0.427 Sum_probs=402.2 Q ss_pred chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCCceE Q lcl|NC_013693. 5 SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYSSVA 83 (631) Q Consensus 5 ~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG~~~ 83 (631) ..||||||||||++ ++++| ||+||++||||+|+|||+|+|++|+|| .||++.||++.+.+|++|+|++||+|||++| T Consensus 1 ~~~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vG~~~~Gp~~~p~~v~~~-~~~~~~fg~~~~~~~~~~~~~~~f~ngg~~~ 78 (663) T protein:vir:10 1 MALLSPGIEMKETS-INSTVVRSATGRAAIVGKFAWGPAYEVRQVTNE-VELVDMFGSPDNVTAPYFMSAMNFLQYGNDL 78 (663) T ss_pred CceecCceEEEEec-CCccccccCcccceeEeecccCCCCccEEecCH-HHHHHhcCCcCCcchhHHHHHHHHHhCCCeE Confidence 46999999999997 56666 999999999999999999999999886 7999999999999999999999999999999 Q ss_pred EEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeec-----cc------ Q lcl|NC_013693. 84 WVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEF-----RN------ 152 (631) Q Consensus 84 ~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~-----~~------ 152 (631) |||||++.+...++...... ...........+.||+.+.+............. .. T Consensus 79 ~vvRv~~~~~~~~a~~~~~~--------------~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~ 144 (663) T protein:vir:10 79 RLVRVIDMEKAKNASPLVNQ--------------VSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLF 144 (663) T ss_pred EEEEccCCcccccccccccc--------------ceeEEeecccccccccccccccccccccccccceeeecccceEEEe Confidence 99999876543322111000 000111122233566655554332211000000 00 Q ss_pred -----eeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccc Q lcl|NC_013693. 153 -----NFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTA 227 (631) Q Consensus 153 -----~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (631) ........+...................... .........+... .+.......................... T Consensus 145 ~~ta~~~~~~~~v~~~~~~~~~~~~~~s~~s~~~~~--a~~v~~v~~d~~~-~v~~~~~a~~~~t~~~~~~~~~~~~~~~ 221 (663) T protein:vir:10 145 VPTAEIIAKTRQLGTYPTLGDNWRIDVSGASGGSAA--ALALGNIVVDSGV-TFGNSEDAPAVMTSPAVMEKYAKFGMPL 221 (663) T ss_pred eccccccccccccccceeeccceeeEeeeccCcccc--ccccceeccccce-EEeeccccccccccccccccccccccce Confidence 0000000000111000000000000000000 0000000000000 0000000000000000000000000000 Q ss_pred cccccccccccccccccccccc-------------cccc-ccceeecccccccceeeeecccccccceeeeeeeeecccc Q lcl|NC_013693. 228 LTDVYSSVVVKSNTVTVTHKAI-------------GPQT-VTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSK 293 (631) Q Consensus 228 ~~~~~~~~~~~~~~~~v~~~~~-------------~~~~-~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 293 (631) ..+...+.......+.+..... +... ...............+...+..++...+.+ .+....+.. T Consensus 222 i~A~~~G~~Gn~i~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~-~~s~~~~~~ 300 (663) T protein:vir:10 222 ISAVYPGEIGSTVEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVEST-VLSTRKGDR 300 (663) T ss_pred EEeccCCcccceeeeeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeee-ccccccccc Confidence 1111111111111111110000 0000 000011111112222333333333333332 334444445 Q ss_pred cccchhhhhhhhhccc-cceeeeeccccc---ccccccccccccc----hhhhhhHHHHHhhhhhcccceeEEecccc-- Q lcl|NC_013693. 294 KSDGSNAYFKDVINDT-SNWVYTFATTLA---AGVTELEGGVDDY----TGNRVAAIEALNNAEAYDAKPVFAFCEEL-- 363 (631) Q Consensus 294 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~---~~~~~l~gg~d~~----~~~~~~~~~~l~~~~~~~~~~~i~~~~~~-- 363 (631) ...+...++...+.++ +.+........+ .....+.+|.|+. ..+..+++..+.+.+.++..++++.++.. T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~ 380 (663) T protein:vir:10 301 DVYGSNIFMDDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDG 380 (663) T ss_pred ccccchhhhhhhhcCCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCc Confidence 5555555555555443 333333222221 2335677887754 34567778888888877777777665543 Q ss_pred -----chHHHHHHHHHhhccceEEeeccccccc-ccccCCHHHHHHHHHhc-----------CCCcceEEEecCeeEEEe Q lcl|NC_013693. 364 -----IEQQTLIDLSTERKDTVSFVSPLRDVVV-GNRGREMEDVVAWRESL-----------VRDSSYFFMDDNWAYVYD 426 (631) Q Consensus 364 -----~~~~~~~~~~~~~~~~~a~~d~~~~~~~-~~~~~~~~~~~~~~~~~-----------~~~s~~~~~~~p~~~~~d 426 (631) .++.++++||+++++||+++|+|..... .....+.+++.+||+.+ +++|+|+++||||++++| T Consensus 381 ~~~~~~v~~~l~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d 460 (663) T protein:vir:10 381 AEIASTVQKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYD 460 (663) T ss_pred hhhHHHHHHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEec Confidence 3678899999999999999999976543 34567788899998753 578999999999999999 Q ss_pred ccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcC-CcEEEEcc Q lcl|NC_013693. 427 KYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSN-EGIVLYGD 505 (631) Q Consensus 427 ~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~-~G~~~wg~ 505 (631) +.+++.+++|||+++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||+|++|++ +|+++||+ T Consensus 461 ~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~ 540 (663) T protein:vir:10 461 KYNDINRWVPLAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGD 540 (663) T ss_pred ccCCceEEechhHHHHHHHHHhhccCCceEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCCCcEEEEcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999997 79999999 Q ss_pred eecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHh Q lcl|NC_013693. 506 KTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADII 585 (631) Q Consensus 506 rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i 585 (631) ||+++++++|+||||||||+||+++|+++++|+||||||+.+|.+|+++|+.||++||++|+|.||+|+||+++||+++| T Consensus 541 rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i 620 (663) T protein:vir:10 541 KMATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVI 620 (663) T ss_pred cccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHh Confidence 99988888999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccCeeeec Q lcl|NC_013693. 586 AANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGGGIVAA 630 (631) Q Consensus 586 ~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~~~~~ 630 (631) ++|+|+++|+|+|++|+|||+|||+|+++|++|+|+ .|+|-.| T Consensus 621 ~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~--~~~~~~~ 663 (663) T protein:vir:10 621 DRNEFVGTIYVKPPRSINYITLNMVATSTGANFDEL--IGPMQLA 663 (663) T ss_pred hCCeEEEEEEEEecCCcceEEEEEEEeecCccHHHH--HHHHhcC Confidence 999999999999999999999999999999999999 7888888 No 13 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=2e-137 Score=769.97 Aligned_cols=598 Identities=29% Similarity=0.457 Sum_probs=394.4 Q ss_pred chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCCceE Q lcl|NC_013693. 5 SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYSSVA 83 (631) Q Consensus 5 ~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG~~~ 83 (631) ..||||||||||++. +++| ||+||++||||+|+|||+++|++|+|| .||++.||++++.+|++|++++||+|||++| T Consensus 1 ~~~~~Pgvyv~e~~~-~~~i~~~~t~~~~~vg~~~~gp~~~p~~v~s~-~~~~~~fg~~~~~~~~~~~~~~~f~~~g~~~ 78 (660) T protein:vir:10 1 MALLSPGIELKETSV-QSTVVRNATGRAALVGKFQWGPAFQVTQITNE-VELVDLFGGPNNEVADYFMSGMNFLQYGNDL 78 (660) T ss_pred CceecCceEEEeecC-CccccCCCcccceEEeecCCCCCccCeEcCCH-HHHHHHcCCcCCCchhHHHHHHHHHhCCceE Confidence 569999999999974 5666 999999999999999999999999886 7999999999999999999999999999999 Q ss_pred EEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhh---hhhhchhhhhccCcccceeec--cce----e Q lcl|NC_013693. 84 WVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAG---SLGNDVAINVCDAAGFPTWEF--RNN----F 154 (631) Q Consensus 84 ~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G---~~gn~l~v~v~~~~~~~~~~~--~~~----~ 154 (631) |||||.+.+...+... ....+.+++..+| .||+.+++............. ... . T Consensus 79 ~vvrv~~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~ 141 (660) T protein:vir:10 79 RTVRVVSREFAKNASP-----------------IAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKIL 141 (660) T ss_pred EEEEeccccccccccc-----------------ccccceeEEeeccccccccceeeEeeccccccccccceeecccccee Confidence 9999987653221111 1112333333333 577766654432211100000 000 0 Q ss_pred ------e----eecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccc Q lcl|NC_013693. 155 ------A----YAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTA 224 (631) Q Consensus 155 ------~----~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (631) . .....+........+...+........... .......+.... ........................ T Consensus 142 ~~~~~ta~~~~~a~~v~~~~~~~~~~~~~~~~~~~~~~~a~--sv~~~v~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 218 (660) T protein:vir:10 142 SVFIPSAKIIAYARSLNQYPTLGPAWTAEVTSASSGVSGTI--TVGKIVTDSGIL-LTEAENSEEAITSLEFQAALKKFA 218 (660) T ss_pred eeccccccccccccccccccccccceeEEEecccCccccce--eeeeeeccCcce-EEeeeccccccccccceeeccccc Confidence 0 000000000000111111111000000000 000000000000 000000000000000000000000 Q ss_pred ccccccccccccccccccccccc---ccccc----------cccc---eeecccccccceeeeecccccccceeeeeeee Q lcl|NC_013693. 225 LTALTDVYSSVVVKSNTVTVTHK---AIGPQ----------TVTA---IVPDANGLTATAVTTTVGASGSIIEKYELMQA 288 (631) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~v~~~---~~~~~----------~~~~---~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 288 (631) .........+.......+.+... ..++. .... ............+.+.+..++...+.+... . T Consensus 219 ~~~~~a~~~g~~G~~i~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~-~ 297 (660) T protein:vir:10 219 MPGVVALYPGEIGSTLEVEIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLS-T 297 (660) T ss_pred cceeeeecccccCcceeEEEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeee-c Confidence 00000000000000000000000 00000 0000 000111112233344445555555554322 2 Q ss_pred ecccccccchhhhhhhhhccc-cceeeeeccccc---ccccccccccccc----hhhhhhHHHHHhhhhhcccceeEEec Q lcl|NC_013693. 289 TQGSKKSDGSNAYFKDVINDT-SNWVYTFATTLA---AGVTELEGGVDDY----TGNRVAAIEALNNAEAYDAKPVFAFC 360 (631) Q Consensus 289 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~---~~~~~l~gg~d~~----~~~~~~~~~~l~~~~~~~~~~~i~~~ 360 (631) ........+...++...+.++ +..+.......+ .....+.+|.++. .++..+++..+...+.+.... ++.| T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~-l~~p 376 (660) T protein:vir:10 298 KEGEKDVYGNNIYLDDYFAKGTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINL-LIAG 376 (660) T ss_pred cccccccccceeeeehhhcCCCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccce-EEEc Confidence 223333334444444444333 334444332222 2345677776653 234445555665555544433 3333 Q ss_pred c--------ccchHHHHHHHHHhhccceEEeeccccc-ccccccCCHHHHHHHHHhc--------CCCcceEEEecCeeE Q lcl|NC_013693. 361 E--------ELIEQQTLIDLSTERKDTVSFVSPLRDV-VVGNRGREMEDVVAWRESL--------VRDSSYFFMDDNWAY 423 (631) Q Consensus 361 ~--------~~~~~~~~~~~~~~~~~~~a~~d~~~~~-~~~~~~~~~~~~~~~~~~~--------~~~s~~~~~~~p~~~ 423 (631) + ..+++.++++||+++++||+++|+|... ......++++++++||+.. +++|+|+++||||++ T Consensus 377 ~~~~~~~~~~~~v~~al~~~~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~ 456 (660) T protein:vir:10 377 AVAGEGDEVASTVQKHVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKY 456 (660) T ss_pred CcCCCchhhhHHHHHHHHHHHHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceE Confidence 2 1247889999999999999999999764 5556778899999999853 578999999999999 Q ss_pred EEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcC-CcEEE Q lcl|NC_013693. 424 VYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSN-EGIVL 502 (631) Q Consensus 424 ~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~-~G~~~ 502 (631) ++|+.+++++++|||+++||+|||+|.++||||||||+++++|.|+.++++.+++.|++.||++|||+|++|++ +|+++ T Consensus 457 ~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~ 536 (660) T protein:vir:10 457 QYDKYNDVNRWVPLAADLAGLCARTDDVSQPWMSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVL 536 (660) T ss_pred EecccCCceeEechhHHHHHHHHHhhccCCcEEccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999986 79999 Q ss_pred EcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCH Q lcl|NC_013693. 503 YGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTA 582 (631) Q Consensus 503 wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~ 582 (631) ||+||+++++++|+||||||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|.||+|+||+++||+ T Consensus 537 wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~gal~g~~V~~d~~~nt~ 616 (660) T protein:vir:10 537 FGDKTATKVPSPMDHINVRRLFNMLKKNIGDASKYKLFELNDNFTRSSFRMEVSQYLDGIKALGGIYEGRVVCDTTVNTP 616 (660) T ss_pred EcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCH Confidence 99999988888999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccCeee Q lcl|NC_013693. 583 DIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGGGIV 628 (631) Q Consensus 583 ~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~~~ 628 (631) ++|++|+|+++|+++|++|||||+|||+|+++|++|+|+ .|.|| T Consensus 617 ~di~~G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~--~~~~~ 660 (660) T protein:vir:10 617 AVIDRNEFIANIYVKPARSINYITLNFVATSTGADFDEL--IGPLV 660 (660) T ss_pred HHhhCCeEEEEEEEEecCCccEEEEEEEEeecCccHHHH--hhhcC Confidence 999999999999999999999999999999999999999 78898 No 14 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=2e-137 Score=770.04 Aligned_cols=604 Identities=28% Similarity=0.444 Sum_probs=405.3 Q ss_pred chhcCCceEEEEecCCCceecccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCCceEE Q lcl|NC_013693. 5 SFSVAPSVQWTERDATLQTSPSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYSSVAW 84 (631) Q Consensus 5 ~~ylsPGVyveEv~~~~~~~gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG~~~~ 84 (631) ..||||||||||+|++..+.||+||++||||+|+|||+|+|++|+|| .||++.||++++.+|++|+|++||+|||++|| T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~v~t~~~~fvG~~~~gp~~~p~~i~s~-~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~ 79 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAALVGKFAWGPAYEIRQVTNE-VELVDMFGSPDNVTAPYFMSAMNFLQYGNDLR 79 (663) T ss_pred CccccCceEEEEecCcccccccccccceeeeccccCCCCcCEEecCH-HHHHHHcCCcccccchHHHHHHHHHhCCCeEE Confidence 45999999999998655555999999999999999999999999886 79999999999999999999999999999999 Q ss_pred EEEecccCCCccc-ccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCccccee--ec----------- Q lcl|NC_013693. 85 VTRVVGPAARNAV-TKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTW--EF----------- 150 (631) Q Consensus 85 vvRv~~~~a~~a~-~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~--~~----------- 150 (631) ||||.+.+...+. +... ...+ ........+.|||.+.+........... .. T Consensus 80 vvRv~~~~~~~~~~~~~~-------~~~~--------~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~ 144 (663) T protein:vir:10 80 LVRVIDMEQAKNASPLFN-------QIEV--------TITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALF 144 (663) T ss_pred EEecCCcccccccccccc-------ccee--------eEeecccCccccceeeecccccccccCcceeeeccCCceeEEE Confidence 9999875543222 1111 0000 1112334456666665443211100000 00 Q ss_pred -c--ceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccc Q lcl|NC_013693. 151 -R--NNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTA 227 (631) Q Consensus 151 -~--~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (631) . ..................+...+........... .......+.. ............................. T Consensus 145 ~~~a~~~~~a~~~~~~~~~~~a~~~~v~~~~~~~~~a~--av~~i~~dg~-vt~~~~~~a~~~~~~~~~~~~~~~~~~~~ 221 (663) T protein:vir:10 145 VPSSAVIAKAKQLGTYPVLGDNWRAEVSGASGGSAATL--TLGGIVVDSG-VTFGNSEEAPDVMTSTKVLANFAKYGMPL 221 (663) T ss_pred eccccccccccccccccccccceeeEEeeccccccccc--eeEeeecCCc-eeEEeeeccccccccceeeeeccccccce Confidence 0 0000000000011111111111111000000000 0000000000 00000000000000000000000000000 Q ss_pred cccccccccccccccccccc--------------ccccccccceeecccccccceeeeecccccccceeeeeeeeecccc Q lcl|NC_013693. 228 LTDVYSSVVVKSNTVTVTHK--------------AIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSK 293 (631) Q Consensus 228 ~~~~~~~~~~~~~~~~v~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 293 (631) ......+.......+..... .....................+.+.+..++...+.+ .+....+.. T Consensus 222 ~~a~~~g~~G~~i~v~~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~-~ls~~~~~~ 300 (663) T protein:vir:10 222 ISAVYPGEIGSTVEVEVISKTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVEST-VLSTRRGDR 300 (663) T ss_pred eeeecccccCcceeEeecccccccccceeeecccCcccccccccccccccccchhhcccccCCCccccee-eeecccccc Confidence 00001111100000100000 000000000011112222334455555666665555 355555556 Q ss_pred cccchhhhhhhhhcc-ccceeeeeccccccc---ccccccccccc----hhhhhhHHHHHhhhhhcccceeEEecccc-- Q lcl|NC_013693. 294 KSDGSNAYFKDVIND-TSNWVYTFATTLAAG---VTELEGGVDDY----TGNRVAAIEALNNAEAYDAKPVFAFCEEL-- 363 (631) Q Consensus 294 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~---~~~l~gg~d~~----~~~~~~~~~~l~~~~~~~~~~~i~~~~~~-- 363 (631) ...+...++...+.+ .+.++.......+.. ...+.+|.++. ..++.++++++..++.++...+++++... T Consensus 301 ~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~ 380 (663) T protein:vir:10 301 DVYGNNIFMDDYFRNGSSNFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDG 380 (663) T ss_pred ccchhhhhhhhhhcCcccceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCc Confidence 666666666666544 344444444333332 23567776643 34666778888888888887777655432 Q ss_pred -----chHHHHHHHHHhhccceEEeecccccccc-cccCCHHHHHHHHHh-----------cCCCcceEEEecCeeEEEe Q lcl|NC_013693. 364 -----IEQQTLIDLSTERKDTVSFVSPLRDVVVG-NRGREMEDVVAWRES-----------LVRDSSYFFMDDNWAYVYD 426 (631) Q Consensus 364 -----~~~~~~~~~~~~~~~~~a~~d~~~~~~~~-~~~~~~~~~~~~~~~-----------~~~~s~~~~~~~p~~~~~d 426 (631) .++.++++||+++++||+++|+|+..... ......+++.+||+. .+++|+|+++||||++++| T Consensus 381 ~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d 460 (663) T protein:vir:10 381 VAVASTVQKHVVALADDRQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYD 460 (663) T ss_pred hhhHHHHHHHHHHHHHhhCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEec Confidence 46789999999999999999999765433 344567788888864 3568999999999999999 Q ss_pred ccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcC-CcEEEEcc Q lcl|NC_013693. 427 KYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSN-EGIVLYGD 505 (631) Q Consensus 427 ~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~-~G~~~wg~ 505 (631) +.+++++++|||+++||+|||+|.++||||||+|+++++|.|+.++.+.+++.|++.||++|||+|+.|++ +|+++||+ T Consensus 461 ~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~ 540 (663) T protein:vir:10 461 KYNDINRWVPLSADIAGLCAYTDQVGHPWMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGD 540 (663) T ss_pred ccCCceEEechHHHHHHHHHHhhccCCcEEccCCeeecceeccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999997 79999999 Q ss_pred eecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHh Q lcl|NC_013693. 506 KTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADII 585 (631) Q Consensus 506 rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i 585 (631) ||+++++++|+||||||||+||+++|+++++|+||||||+.+|++|+++|++||++||++|+|+||+|+||+++||+++| T Consensus 541 rT~s~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i 620 (663) T protein:vir:10 541 KMATQVPSPFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMEVSQYLDNIRSLGGVYDFRVVCDTTNNTPQVI 620 (663) T ss_pred cccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHh Confidence 99988888999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccCeeeec Q lcl|NC_013693. 586 AANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGGGIVAA 630 (631) Q Consensus 586 ~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~~~~~ 630 (631) ++|+|+++|+++|++|+|||+|||+|+++|++|+|+ .|.+--| T Consensus 621 ~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~f~e~--~~~~~~~ 663 (663) T protein:vir:10 621 DSNEFVATIYIKAPRSINYITLNFVATSTGANFDEL--IGPAQLA 663 (663) T ss_pred hCCeEEEEEEEEecCCcceEEEEEEEEecCccHHHH--HHHHhcC Confidence 999999999999999999999999999999999998 3444444 No 15 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=7.9e-134 Score=750.30 Aligned_cols=608 Identities=28% Similarity=0.452 Sum_probs=377.5 Q ss_pred chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCCceE Q lcl|NC_013693. 5 SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYSSVA 83 (631) Q Consensus 5 ~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG~~~ 83 (631) -+||||||||||++ ++++| ||+||++||||+|+|||+|+|++|+|| .||+++||++++.+|++|+|++||+|||++| T Consensus 1 ~~~~~Pgvyv~e~~-~~~~i~~v~t~~~~fvG~~~~Gp~~~p~~v~s~-~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~ 78 (671) T protein:vir:56 1 MTLLSPGIENKEIN-LASAIGRAATGRAAMVGKFEWGPAYSITQVTSE-SDLVTIFGRPNDYTAASFMTANNFLKYGNDL 78 (671) T ss_pred CceecCceEEEeec-CcccccccCcccceEEecccCCCCccCEEcCCH-HHHHHHcCCcCCCcchhHHHHHHHHhcCCeE Confidence 35999999999997 56667 999999999999999999999999886 7999999999999999999999999999999 Q ss_pred EEEEecccCCCcccccccchhhhccccccccc-----cccceeeehhhhhhhhhhchhhhhccCccccee--ecc---ce Q lcl|NC_013693. 84 WVTRVVGPAARNAVTKGQTAILIRNKLDFETA-----SPSASITWTGRYAGSLGNDVAINVCDAAGFPTW--EFR---NN 153 (631) Q Consensus 84 ~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~-----~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~--~~~---~~ 153 (631) |||||++.+...++.... ........ .....+.+++..++.+.+..++...+....... ... .. T Consensus 79 ~vvrv~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 152 (671) T protein:vir:56 79 RLVRICDATTAQNATPLY------NAVEYTIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEI 152 (671) T ss_pred EEEEecCccccccchhhc------cccccccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeE Confidence 999998765443321110 00000000 011123334444444444433333222111100 000 00 Q ss_pred eeeecccccceEEee----eeeee--eec-ccccccceeeeeecccccccceeEeeccccccc----cccccccccc--- Q lcl|NC_013693. 154 FAYAPQAGEYHIVIV----DKVGR--ITD-SSGAVGQVDRISVSGTATGAGSISVAGEDVAYT----DTDTPATLAT--- 219 (631) Q Consensus 154 ~~~~~~~g~~~~~~~----~~~~~--v~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~--- 219 (631) +......+.+..... +.... ... ................................. .......... T Consensus 153 v~~~~~~~~~~~~~~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g 232 (671) T protein:vir:56 153 VAAAKSDGNYPSVGTITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYVG 232 (671) T ss_pred EEeeeccccccccccccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhccccccccccccc Confidence 001111111100000 00000 000 000000000000000000000000000000000 0000000000 Q ss_pred ccccc----ccccccccccccc-cccc---ccccccc-cccccccc-eeecccccccceeeeecccccccceeeeeeeee Q lcl|NC_013693. 220 KIGTA----LTALTDVYSSVVV-KSNT---VTVTHKA-IGPQTVTA-IVPDANGLTATAVTTTVGASGSIIEKYELMQAT 289 (631) Q Consensus 220 ~~~~~----~~~~~~~~~~~~~-~~~~---~~v~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 289 (631) ..+.. ............. .... ....... ........ ............+...+..++...+.+. .... T Consensus 233 ~~g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~-~~~~ 311 (671) T protein:vir:56 233 DFGDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFI-VSTN 311 (671) T ss_pred ccCcceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEE-Eeec Confidence 00000 0000000000000 0000 0000000 00000000 0000001111122223333344444432 2222 Q ss_pred cccccccchhhhhhhhhccccceee-ee--cccccccccccccccccchhhh--hhHHHHHhhhhhcccceeEEeccccc Q lcl|NC_013693. 290 QGSKKSDGSNAYFKDVINDTSNWVY-TF--ATTLAAGVTELEGGVDDYTGNR--VAAIEALNNAEAYDAKPVFAFCEELI 364 (631) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~~l~gg~d~~~~~~--~~~~~~l~~~~~~~~~~~i~~~~~~~ 364 (631) .......+...+......++..... .. ..........+.||.+...+.. ..++..+...+.+. +.++.++... T Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~~--~~~~~a~~~~ 389 (671) T protein:vir:56 312 PGDKDVNGQSIFIDEYFENSGSAYITAIAEGWKTESGAYNFGGGSDANAGADDWMFGLDMLSDPEVLY--TNLVIAGNAA 389 (671) T ss_pred ccccccchhhhhhhhhhcccCceEEEecCcccCCccccccccCccccccchhHHHHHHHhhhhccccc--eeEEEcCCCC Confidence 2333333333333333333322211 11 1223334456778887664432 33344443333332 3333333211 Q ss_pred ---------h-HHHHHHHHHhhccceEEeecccccc-cccccCCHHHHHHHHHhc------------CCCcceEEEecCe Q lcl|NC_013693. 365 ---------E-QQTLIDLSTERKDTVSFVSPLRDVV-VGNRGREMEDVVAWRESL------------VRDSSYFFMDDNW 421 (631) Q Consensus 365 ---------~-~~~~~~~~~~~~~~~a~~d~~~~~~-~~~~~~~~~~~~~~~~~~------------~~~s~~~~~~~p~ 421 (631) . +.++..+|+.+++|++++|+|+... ....+.+.+++.+||+.+ +++|+|+++|||| T Consensus 390 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~ 469 (671) T protein:vir:56 390 AEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGNY 469 (671) T ss_pred CccchhHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEecCc Confidence 2 2345566677889999999998764 445678899999998643 4678999999999 Q ss_pred eEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEE Q lcl|NC_013693. 422 AYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIV 501 (631) Q Consensus 422 ~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~ 501 (631) ++++|+.+++.+++|||+++||+|||+|.++||||||||+++++|.|+.++++.+++.|++.||++|||+|++|+++|++ T Consensus 470 ~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~ 549 (671) T protein:vir:56 470 KYQYDKYNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNRLAVDLRRAHRDALYQIGINPVVGFAGQGFV 549 (671) T ss_pred eEEecccCCceeEechHHHHHHHHHHhhccCCcEECcCCceeccccccccceeecChhHHHHHhhCCceEEEEecCCeEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCC Q lcl|NC_013693. 502 LYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNT 581 (631) Q Consensus 502 ~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt 581 (631) +||+||++++|++|+||||||||+||+++|+++++|+|||||++.||++|+++|+.||++||++|+|.||+|+||+++|| T Consensus 550 ~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~g~~v~~d~~~nt 629 (671) T protein:vir:56 550 LYGDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFELNDEFTRSSFKSEIDAYLTNIQDLGGVYDFRVVCDETNNP 629 (671) T ss_pred EEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCC Confidence 99999998888999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccC Q lcl|NC_013693. 582 ADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGG 625 (631) Q Consensus 582 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g 625 (631) +++|++|+|+++|+|+|++|+|||+|||+|++++++|+|++ | T Consensus 630 ~~~i~~G~~~~~i~~~p~~Pae~I~~~~~~~~~~~~f~e~~--~ 671 (671) T protein:vir:56 630 GSVIDRNEFVASIYVKPAKSINFITLNFVATSTDADFAEII--G 671 (671) T ss_pred HHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchhhhc--C Confidence 99999999999999999999999999999999999999994 4 No 16 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=7.7e-110 Score=618.80 Aligned_cols=462 Identities=15% Similarity=0.147 Sum_probs=320.3 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhC Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSY 79 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ng 79 (631) ||+ |++|||||||+++++++| +|+|+|++|||++++||+|+|++|+|| .||++ ||+....++|++|+++||.|| T Consensus 1 M~~---~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p~n~pv~its~-~d~~~-~g~~~~~~tL~~Av~~~f~ng 75 (477) T protein:vir:79 1 MAA---NYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSD-VDAAQ-FGPQLAGFTIPQALDAVYDYG 75 (477) T ss_pred CcC---CCCCCeEEEEecCCcccccccCCceEEEEeecccCCCcccEEEccH-HHHHH-hcCCCCCCcHHHHHHHHhhcC Confidence 664 678999999999999888 999999999999999999999999887 69986 788888899999999999999 Q ss_pred CceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccceeeeecc Q lcl|NC_013693. 80 SSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFAYAPQ 159 (631) Q Consensus 80 G~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~~~~~ 159 (631) |++||||||.++.......+.... ............+.....+.+ T Consensus 76 g~~~~vvrV~~~~~~~~~~a~~~~-------------~~~~~~~~~~~~~~~~~~~~v---------------------- 120 (477) T protein:vir:79 76 SGTVIVINVLDPAVHKSNAASESV-------------TFDAATGRAKLAHPAAANLVL---------------------- 120 (477) T ss_pred CceEEEEeccCCcccccccccccc-------------ccccccccccccccccceeEE---------------------- Confidence 999999999765433222110000 000000000000000000000 Q ss_pred cccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccccccccccc Q lcl|NC_013693. 160 AGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSSVVVKS 239 (631) Q Consensus 160 ~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (631) ....... ... .. .... . .. ..... T Consensus 121 ---------------~~~~~~~--~~~---~~--~~~~-~-------------------~~-----------~~~~~--- 144 (477) T protein:vir:79 121 ---------------KNDSGGT--TYT---EG--TDYA-V-------------------DL-----------INGVI--- 144 (477) T ss_pred ---------------eeccccc--ccc---cC--cccc-c-------------------cc-----------cchhh--- Confidence 0000000 000 00 0000 0 00 00000 Q ss_pred ccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceeeeeccc Q lcl|NC_013693. 240 NTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVYTFATT 319 (631) Q Consensus 240 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (631) ........ +.......... .. . .. T Consensus 145 --~~~~~~~~-------------~~~~~~~~~~~-~~---------~-------------------------------~~ 168 (477) T protein:vir:79 145 --TRIKTGTI-------------PAAATAAKATY-DY---------A-------------------------------DP 168 (477) T ss_pred --hhhhcccc-------------ccccceeecee-cc---------C-------------------------------Cc Confidence 00000000 00000000000 00 0 00 Q ss_pred ccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecccc----chHHHHHHHHHhhccceEEeeccccccccccc Q lcl|NC_013693. 320 LAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEEL----IEQQTLIDLSTERKDTVSFVSPLRDVVVGNRG 395 (631) Q Consensus 320 ~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~----~~~~~~~~~~~~~~~~~a~~d~~~~~~~~~~~ 395 (631) .........+..+ ..+...+........+.....+.++..|+. .++.++.++|+++ ++|+++|.|. . T Consensus 169 ~~~~~~~~~g~~~-a~~~~tg~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~-~~~a~~d~p~-------~ 239 (477) T protein:vir:79 169 TKVTAADIIGAVN-AAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQL-GAIAYIDAPI-------G 239 (477) T ss_pred ccceeeeeccccc-ccccchhhhhhhhhhhhcccccceeeccccccchhHHHHHHHHHhhc-CeEEEEecCC-------C Confidence 0000000111111 011111122222223333334444444543 4677888999976 5899998774 4 Q ss_pred CCHHHHHHHHHh-----cCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecc Q lcl|NC_013693. 396 REMEDVVAWRES-----LVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYN 470 (631) Q Consensus 396 ~~~~~~~~~~~~-----~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~ 470 (631) .+.+++.+||+. ++++|+|+++||||++++|+.++..+++|||+++||++||+|.++||||||+|+++.++.+.. T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gv~~~~ 319 (477) T protein:vir:79 240 TTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVE 319 (477) T ss_pred CChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeechHHHHHHHHHHhhccCCceEccCCceeecceecc Confidence 667888888864 457899999999999999999999999999999999999999999999999999976665543 Q ss_pred -cc--eecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecC--CCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCH Q lcl|NC_013693. 471 -RM--AWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGL--TRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDE 545 (631) Q Consensus 471 -~~--~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~--~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~ 545 (631) .+ ....++.|++.||++|||+|++|+++|+++||+||++ ++++.|+||||||++++|+++|++.++|+|||||++ T Consensus 320 ~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~~~~ 399 (477) T protein:vir:79 320 RPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDAPIDQ 399 (477) T ss_pred cccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCccceeeehhhHHHHHHHHHHHHHHHhccCCCCH Confidence 22 2333567999999999999999999999999999996 456789999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccC Q lcl|NC_013693. 546 FTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGG 625 (631) Q Consensus 546 ~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g 625 (631) .+|++|+++|++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++...+ .+.+.|| T Consensus 400 ~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~---~~~~~~~ 476 (477) T protein:vir:79 400 GLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITSEY---LLTLKGG 476 (477) T ss_pred HHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEechH---HhhhccC Confidence 9999999999999999999999999999999999999999999999999999999999999999987655 4566777 Q ss_pred e Q lcl|NC_013693. 626 G 626 (631) Q Consensus 626 ~ 626 (631) - T Consensus 477 ~ 477 (477) T protein:vir:79 477 N 477 (477) T ss_pred C Confidence 7 No 17 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=4.3e-109 Score=614.71 Aligned_cols=462 Identities=15% Similarity=0.151 Sum_probs=326.6 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhC Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSY 79 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ng 79 (631) |++ |++|||||||+++++++| +|+|+|++|||++++||+|+|++|+|| .||. .||+.....+|++|+++||+|| T Consensus 1 M~~---~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp~n~pv~its~-~d~~-~~g~~~~~~tL~~Av~~~f~nG 75 (477) T protein:vir:10 1 MAA---NYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSD-VDAA-QFGPQLAGFTIPQALDAVYDYG 75 (477) T ss_pred Ccc---cCCCCeEEEEccCCcccccccCCceeEEEecccCCCCCcCEEEccH-HHHH-HhccCCCCCcHHHHHHHHHhcc Confidence 665 568999999999999887 999999999999999999999999887 7995 6999999999999999999999 Q ss_pred CceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccceeeeecc Q lcl|NC_013693. 80 SSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFAYAPQ 159 (631) Q Consensus 80 G~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~~~~~ 159 (631) |++||||||.+.....+...... +.. .. . T Consensus 76 g~~~~vVrV~~~~~~~~~~~~~~--------------------------~~~--------~~-----------------~ 104 (477) T protein:vir:10 76 SGTVIVINVLDPAVHKSNAANEP--------------------------VTF--------DA-----------------A 104 (477) T ss_pred ceEEEEEecCccccccccccccc--------------------------ccc--------cc-----------------c Confidence 99999999976543211100000 000 00 0 Q ss_pred cccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccccccccccc Q lcl|NC_013693. 160 AGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSSVVVKS 239 (631) Q Consensus 160 ~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (631) .+.... . .. .. .. ... .. .... T Consensus 105 ~~~~~~------------~------------~~-~~--------------~~---~~v-~~----------~a~~----- 126 (477) T protein:vir:10 105 TGRAKL------------A------------HP-AA--------------AN---LVL-KN----------DSGG----- 126 (477) T ss_pred cceecc------------c------------cc-cc--------------cc---ccc-cc----------cccc----- Confidence 000000 0 00 00 00 000 00 0000 Q ss_pred ccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceeeeeccc Q lcl|NC_013693. 240 NTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVYTFATT 319 (631) Q Consensus 240 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (631) .... .............. .... ...... .... ..... ..... T Consensus 127 ~~~~--------~~~~~~~~~~~~~~-~~~~--------------~~~~~~------~~~~-~~~~~--------~~~~~ 168 (477) T protein:vir:10 127 TTYA--------EGTDYAVDLINGVI-TRIK--------------TGTIPP------GATA-AKATY--------DYADP 168 (477) T ss_pred cccc--------cchhhhhhhccccc-eecc--------------cccccc------ccee-eeecc--------ccccc Confidence 0000 00000000000000 0000 000000 0000 00000 00001 Q ss_pred ccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecccc----chHHHHHHHHHhhccceEEeeccccccccccc Q lcl|NC_013693. 320 LAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEEL----IEQQTLIDLSTERKDTVSFVSPLRDVVVGNRG 395 (631) Q Consensus 320 ~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~----~~~~~~~~~~~~~~~~~a~~d~~~~~~~~~~~ 395 (631) .......+.+..+ .++..++........+.....+.++..++. .++.++.++|+++ ++++++|.|. . T Consensus 169 ~~~~~~~~~g~~~-~~~~~tGl~al~~~~~~~~~~~~~l~apg~~~~~~v~~~l~~~~~~~-~~~~~~d~p~-------~ 239 (477) T protein:vir:10 169 TKVTAADIIGAVN-AAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQL-GAIAYIDAPI-------G 239 (477) T ss_pred ccccccccccccc-ccchhhhhhhhhhhhhhcchhcccccccccccchhhHHHHHHHHhhC-CEEEEEecCC-------C Confidence 1111122333322 344444444444444445555555555544 4677888999976 5899998763 4 Q ss_pred CCHHHHHHHHHh-----cCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecc Q lcl|NC_013693. 396 REMEDVVAWRES-----LVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYN 470 (631) Q Consensus 396 ~~~~~~~~~~~~-----~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~ 470 (631) .+.+++.+||+. ++++|+|++++|||++++|+.++..+++|||+++||++||+|.++||||||+|+++.++.++. T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gi~~~~ 319 (477) T protein:vir:10 240 TTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDTATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVE 319 (477) T ss_pred CCHHHHHhhhhhccccccccccceEEEEcCeEEEecccCCceeEEchHHHHHHHHHHhhhcCCceeccCCceeccccccc Confidence 667888888874 356789999999999999999999999999999999999999999999999999977776653 Q ss_pred c-ce--ecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCC--CChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCH Q lcl|NC_013693. 471 R-MA--WSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLT--RPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDE 545 (631) Q Consensus 471 ~-~~--~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~--~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~ 545 (631) . +. ...++.|++.||++|||+|++|+++|+++||+||++. +++.|+|+||||++++|+++|+++++|+|||||++ T Consensus 320 ~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~v~~~~~~ 399 (477) T protein:vir:10 320 RPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDAPIDQ 399 (477) T ss_pred cccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCH Confidence 2 22 3335678999999999999999999999999999964 56789999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccC Q lcl|NC_013693. 546 FTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGG 625 (631) Q Consensus 546 ~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g 625 (631) .+|++|+++|++||++||++|+|+||+|+||+++||++||++|+|+++|+++|++|+|||+|++++.. ++.|.+.|| T Consensus 400 ~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~---~~~~~~~~g 476 (477) T protein:vir:10 400 GLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITS---EYLLTLKGG 476 (477) T ss_pred HHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEcc---hHHhhhhcC Confidence 99999999999999999999999999999999999999999999999999999999999999998764 445666777 Q ss_pred e Q lcl|NC_013693. 626 G 626 (631) Q Consensus 626 ~ 626 (631) - T Consensus 477 ~ 477 (477) T protein:vir:10 477 N 477 (477) T ss_pred C Confidence 7 No 18 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=2.4e-105 Score=594.14 Aligned_cols=487 Identities=14% Similarity=0.086 Sum_probs=322.1 Q ss_pred CCCcchhcCCceEEEEecCCCcee-c-ccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHh Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-P-SVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLS 78 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-g-v~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~n 78 (631) =++.-+|..|||||||+++++++| | |+|||+||||.++|||+|+|++|+|| .||.+.||..... ++ T Consensus 277 ~~~~~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rGPvn~PvlITS~-aD~~~~Fg~~~GG-----------l~ 344 (774) T protein:vir:98 277 GEITRNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRGFTTSPALVTTI-PDPAIHFTSFQGG-----------LD 344 (774) T ss_pred cceEEEEecCceEEEEeCCCCccccccccceeeeecccccCCCCCcCEEEeeh-hHhhhhhccccCC-----------cc Confidence 445557888999999999999998 7 99999999999999999999999998 6977777654332 36 Q ss_pred CCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccceeeeec Q lcl|NC_013693. 79 YSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFAYAP 158 (631) Q Consensus 79 gG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~~~~ 158 (631) |+++||.+.-. ....+.+++.|+.+|.|||.+++.+.+...... ........ T Consensus 345 GassA~r~~~~-------------------------~sG~~~L~i~A~~pGawGN~ItV~I~~~t~~~~---~l~v~~~~ 396 (774) T protein:vir:98 345 GPRSAFRDFYT-------------------------FNGTPLLRLQAVSEGNWGNQVTVSIYPVNNSEF---RLNVQDLN 396 (774) T ss_pred ccceeeeeeee-------------------------ecccceEEEEEeecCcCCCceEEEEEecCCcee---EEEEEecC Confidence 78887743211 112345789999999999999998876543211 11110000 Q ss_pred ccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccccccc Q lcl|NC_013693. 159 QAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSSVVVK 238 (631) Q Consensus 159 ~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (631) .... .. ........................ .......... .. ..... T Consensus 397 ~s~f-----~~---------~~a~e~~tv~~~~~~~~~~v~e~~-dn~~i~~~~~------------~~---~~~~i--- 443 (774) T protein:vir:98 397 GSAF-----NP---------PLADEVYTVKLGDTNESGELNALL-DSKFIRGFFL------------PK---SIDSI--- 443 (774) T ss_pred Cccc-----cc---------cccceeEEEecccccccceeeeee-ceeeEeeccc------------cc---ccccc--- Confidence 0000 00 000000000000000000000000 0000000000 00 00000 Q ss_pred cccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceeeeecc Q lcl|NC_013693. 239 SNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVYTFAT 318 (631) Q Consensus 239 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (631) ..... +. ....+.... .+. .......... ....... . T Consensus 444 -------------n~vs~-lv-----~~~~~~~a~-~d~----~~~~~~~~~~-----------~~~~~~~--~------ 480 (774) T protein:vir:98 444 -------------NYDAA-LV-----RQSPLRLAP-PDE----SETDVENPAH-----------VDFYGPN--V------ 480 (774) T ss_pred -------------ccccc-cc-----ccchhcccc-ccc----cccccccccc-----------ccccCCc--c------ Confidence 00000 00 000000000 000 0000000000 0000000 0 Q ss_pred cccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhh----ccceEEeecccccccccc Q lcl|NC_013693. 319 TLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTER----KDTVSFVSPLRDVVVGNR 394 (631) Q Consensus 319 ~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~----~~~~a~~d~~~~~~~~~~ 394 (631) .....+.+|.|+...........+...+..++..++.......++.+++++|+.+ ++|++++|.|. T Consensus 481 ---~v~v~lagG~Dg~~tt~~~igg~~~~~~~tgi~aLl~a~~~~~V~~aii~~~e~~~~~~~~r~avid~p~------- 550 (774) T protein:vir:98 481 ---LVDVTLENGYDGPPVTNDDYVSIIRTLENQPVHILLVGTTNVGVQQALITEAERASDSDGLRIAVLAAPP------- 550 (774) T ss_pred ---eEEEeecCCCCcccccchheecccccccccceeEEEcCccchhhHHHHHHHHHHhhhcccceEEEEECCC------- Confidence 0011122333322111000001111222223333333344566788888888875 78999998763 Q ss_pred cCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecc---c Q lcl|NC_013693. 395 GREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYN---R 471 (631) Q Consensus 395 ~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~---~ 471 (631) +.+.+++++||+.+ +|+|+++||||++++|+.+++.+++|||+++||++||+| +||||+|+++.++.+.. . T Consensus 551 g~t~~~Ai~~r~~f--~S~~aal~~Pwvkv~D~~~g~~~~vPpSg~vAGl~ArtD----v~kSPANk~I~Givg~ai~~~ 624 (774) T protein:vir:98 551 RTTPTLAASVTRGF--NSTRAVMVAGWFTYAGQPNSSRYGVPGAAVYAGKLAAID----FFVSPAARSLVGPLFNIIESD 624 (774) T ss_pred CCCHHHHHHHHhcc--CCceEEEEeCcEEEeccCCCceeecChhHHHHHHHHhcC----cccccCCceeecceecccccc Confidence 56889999999976 689999999999999999999999999999999999999 99999999976665432 2 Q ss_pred ceecCChhHhhhhhhcCceEEE-EEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHH Q lcl|NC_013693. 472 MAWSASSDERAVLYRNQINSIV-TFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSL 550 (631) Q Consensus 472 ~~~~~~~~~~~~L~~~gin~i~-~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~ 550 (631) +....++.|++.|++++||+++ .++++|+++||+||+++ |++|+||+||||++||+++|++.++|+||||||+.+|++ T Consensus 625 l~~~~t~ae~d~Ln~~gIN~i~itt~g~G~rvWG~RTlss-Dp~wr~InVRRlfd~Ie~SI~~~~~~~VfEPNd~~l~~~ 703 (774) T protein:vir:98 625 TDNYTSRSNQDIYSAARLEVLSLDTVDRTYRFASGVTLST-DPAWERIYLRRVHDVVRQGAHAILRNYVAMPNSRLVRNQ 703 (774) T ss_pred ccccccchhhhhhcccccceeEEEEcCCcEEEEcccccCC-CcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHH Confidence 3444578899999999999997 58899999999999865 689999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhCCceeeeE-EEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeee Q lcl|NC_013693. 551 FSNAVRPYIRQLANMGAIYDGQ-VKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSE 620 (631) Q Consensus 551 i~~~i~~~l~~l~~~gal~g~~-v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e 620 (631) |+++++.||++||++|+|.||+ |+||+++||+++|++|+|+++|+++|++|+|||+|||+|..++..|+| T Consensus 704 I~~sI~~fL~~L~~~GaL~G~~~V~~D~etNt~~dI~~G~l~i~I~vaP~~PAEfIilri~q~t~~~~l~E 774 (774) T protein:vir:98 704 IAAALNAFMGELKRNGNIVSFRPAIIDGSNNSTAAYFSRELYVSLQFQPLYSADYIYVTISRDTETSPLGE 774 (774) T ss_pred HHHHHHHHHHHHHhCCceecceEEEEcCCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEeecceeccC Confidence 9999999999999999999997 899999999999999999999999999999999999999999999999 No 19 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=100.00 E-value=1.2e-96 Score=546.36 Aligned_cols=505 Identities=26% Similarity=0.358 Sum_probs=309.9 Q ss_pred CCCcchhcCCceEEEEecCCCceecccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhCC Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTSPSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSYS 80 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ngG 80 (631) |.|+ +|+||||||||+|++.++.||+||++||||+|+|||+++|++|+|| .||+++||++++.+|++|+|++||+||| T Consensus 1 ~~m~-~~~sPGVyv~E~~~~~~i~~v~tsvaafvG~~~~GP~~~p~~v~s~-~d~~~~FG~~~~~~~l~~av~~fF~ngG 78 (641) T protein:vir:10 1 MSVS-NQLSPGVVIQERDLTAVTTPIGLNVGVLAAPFTKGPVEEIFEVSTE-RDLASVFGEPNDYNYEYWFTASQFLSYG 78 (641) T ss_pred CCCc-cccCCceEEEEecCCCcccccCCccceEEecccCCCCCccEEecCH-HHHHHHcCCcCCCcchHHHHHHHHHhcC Confidence 9998 6999999999999876544999999999999999999999999887 7999999999999999999999999999 Q ss_pred ceEEEEEecccCCCcccccccchhhhccccccc---cccccceeeehhhhhhhhhhchhhhhccCcccceeeccceee-- Q lcl|NC_013693. 81 SVAWVTRVVGPAARNAVTKGQTAILIRNKLDFE---TASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFA-- 155 (631) Q Consensus 81 ~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~---~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~-- 155 (631) ++||||||.+.++.++.... ....+++..... .......+++.|++||.|||.+++.+.+.............. T Consensus 79 ~~~~vvRv~~~~~~~a~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~A~~pG~~gn~l~v~i~~~~~~~~~~~~~~~~~~ 157 (641) T protein:vir:10 79 GVLKAIRLNAASLKNSVDSG-TAPLIKNLQEYETTYESSNSNTFKFASRDAGALGNSVGIFITDAGPDQIAVLPAPGTGN 157 (641) T ss_pred CEEEEEEecCcccccccccc-chhhccccccccccccCcCccccEEEeccCCCcCCceEEEEEcCCCcceeeeecccccc Confidence 99999999988777765443 333344433332 234556789999999999999999998887665443211000 Q ss_pred eec----------ccccceEEeee-------eeeeeeccc-------ccccceeee---------e---eccccccccee Q lcl|NC_013693. 156 YAP----------QAGEYHIVIVD-------KVGRITDSS-------GAVGQVDRI---------S---VSGTATGAGSI 199 (631) Q Consensus 156 ~~~----------~~g~~~~~~~~-------~~~~v~~~~-------~~~~~~~~~---------~---~~~~~~~~~~~ 199 (631) ... ........... ......... +........ . ...+....... T Consensus 158 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g~~g~~~~ 237 (641) T protein:vir:10 158 EWEFVADEAVTAASGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGGVTGIFAD 237 (641) T ss_pred cceeccceeeeeccCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCcceeeeee Confidence 000 00000000000 000000000 000000000 0 00000000000 Q ss_pred Eeecccccccccccccccccc------ccccccccccccccccccccccccc----------------ccccccccccce Q lcl|NC_013693. 200 SVAGEDVAYTDTDTPATLATK------IGTALTALTDVYSSVVVKSNTVTVT----------------HKAIGPQTVTAI 257 (631) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~v~----------------~~~~~~~~~~~~ 257 (631) ...... .....+....... .......+.............+... .....+++.... T Consensus 238 ~~~~t~--gt~~~t~a~~g~~~~~~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~~~a~~~~~g~~~~~va~~~gts~ 315 (641) T protein:vir:10 238 AQVVTQ--GTNTAAIASSGIERRLYIGKDSGSINFAATDAVVDTNATSATISSVRNEYAEREYLPGSKWVNVAARPGTSL 315 (641) T ss_pred eeeccC--CccceeeecccchhhhhhccccccceeeeecccccccceeeEeeeeeeeecccccccccccccccccchhhh Confidence 000000 0000000000000 0000000000000000000000000 000011111111 Q ss_pred eeccccccccee-ee-------ecccccccceeeeeeeeecccccccchhhhhhhhhccccceeeeeccccc-------- Q lcl|NC_013693. 258 VPDANGLTATAV-TT-------TVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVYTFATTLA-------- 321 (631) Q Consensus 258 ~~~~~~~~~~~~-~~-------~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------- 321 (631) +....+...+.+ .+ ..+.+++++|++..++...+.+...+...++...++..+.+++....... T Consensus 316 ~a~~~g~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~ 395 (641) T protein:vir:10 316 YANSVGGVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVIKQQSAYVYWGSHETAPFLGTAAN 395 (641) T ss_pred hhhhcCCcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeeeccccceEEEeccccccccccccc Confidence 111111111222 12 23456678888877777777777777777777777776666543211100 Q ss_pred ----------------------------------------ccccccccccccc---------hhhhhhHHHHHhhhhhcc Q lcl|NC_013693. 322 ----------------------------------------AGVTELEGGVDDY---------TGNRVAAIEALNNAEAYD 352 (631) Q Consensus 322 ----------------------------------------~~~~~l~gg~d~~---------~~~~~~~~~~l~~~~~~~ 352 (631) .....+.+|.|.. ..+...++..+...+.++ T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~~~~~~~~~tg~~~~~~~e~~~ 475 (641) T protein:vir:10 396 AAAGDWGASALNRRYNLLRSTAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALYNLSNVDIATAYELIEDPESQV 475 (641) T ss_pred ccccccccccccccccccccccccccccccccccCCCCcceeEEEeecCcccccccccccccchhHHHHHHHhhhhhhhc Confidence 0001345555532 223345566666667666 Q ss_pred cceeEEecc------ccchHHHHHHHHHhhccceEEeecccccccc--cccCCHHHHHHHHHhcCCCcceEEEecCeeEE Q lcl|NC_013693. 353 AKPVFAFCE------ELIEQQTLIDLSTERKDTVSFVSPLRDVVVG--NRGREMEDVVAWRESLVRDSSYFFMDDNWAYV 424 (631) Q Consensus 353 ~~~~i~~~~------~~~~~~~~~~~~~~~~~~~a~~d~~~~~~~~--~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~ 424 (631) +..+++.+. ..+++.++++|||.|++||+++|+|+..... ......+++++||+.+ .+|+|+++||||+++ T Consensus 476 i~~l~~~~~~~~~~~~~~v~~~~i~~ce~~~d~~ailD~p~~~~~~~~~~~~~~~~~~~~~~~~-~~s~yaa~y~P~~~v 554 (641) T protein:vir:10 476 IDYVLSGPAGADEAAAIAKATTITTIVESRKDCMAFLSPLRSDVIGVSNTTTVTENLVNYFNQL-PSSNYVVFDSGYKYI 554 (641) T ss_pred cceeeecCCCCCcchhHHHHHHHHHHHHhcCCEEEEEcCCcccccCCCchhhHHHHHHHHHhhc-CCCceEEEEeceeEe Confidence 544444332 1347889999999999999999999865433 3334578889999874 589999999999999 Q ss_pred EeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEc Q lcl|NC_013693. 425 YDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYG 504 (631) Q Consensus 425 ~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg 504 (631) +||.+++.+++||||++||+|||+|.+|||||||||.+++.|+|++++++.+++.|++.||++||||||.|||+|++- T Consensus 555 ~dp~~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~~~~~e~~~Lnp~gIN~ir~fpg~G~v~-- 632 (641) T protein:vir:10 555 YDKYNDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYSPNKTQRDRLYANRINPVVSFPGHAMIN-- 632 (641) T ss_pred ecccCCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEecChhHHhhhhhcccceEEecCCceeec-- Confidence 999999999999999999999999999999999999998889999999999999999999999999999999998852 Q ss_pred ceecCCCChhh Q lcl|NC_013693. 505 DKTGLTRPSAF 515 (631) Q Consensus 505 ~rT~~~~~~~~ 515 (631) +.-.-. .+. T Consensus 633 ~~~~~~--~~~ 641 (641) T protein:vir:10 633 NNIAFH--TKL 641 (641) T ss_pred ceeeee--ecC Confidence 221100 011 No 20 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=3.8e-95 Score=538.22 Aligned_cols=382 Identities=14% Similarity=0.108 Sum_probs=299.0 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccC-----cCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWG-----EAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~G-----p~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |+. |+ |||||+|++.+++++ +++|++++|||.+++. |.++|++++|+ .+|...||. .+.+.+++.+ T Consensus 1 m~~---~~-~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~-~~~~~~~g~---~~tl~~a~~~ 72 (396) T protein:vir:60 1 MSD---YH-HGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNV-QSAIAKAGK---KGTLAASLQA 72 (396) T ss_pred CCC---CC-CCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeech-HHHHHhhcC---cchhHHHHHH Confidence 554 86 999999999999888 8999999999999664 89999999887 699999995 6689999999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) ||.|||..|||+|+.......... . . ... T Consensus 73 ~~~~gg~~~~vv~~~~~~~~~~~~--------------------~-~--------------~~~---------------- 101 (396) T protein:vir:60 73 IADQSKPVTVVVRVEDGTGEDEET--------------------K-L--------------AQT---------------- 101 (396) T ss_pred HhhccCceEEEEeccccccccccc--------------------c-c--------------ccc---------------- Confidence 999999999999874321000000 0 0 000 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) . .. T Consensus 102 ----------------------------------------~-------------------------------------~~ 104 (396) T protein:vir:60 102 ----------------------------------------V-------------------------------------SN 104 (396) T ss_pred ----------------------------------------c-------------------------------------cc Confidence 0 00 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) T Consensus 105 -------------------------------------------------------------------------------- 104 (396) T protein:vir:60 105 -------------------------------------------------------------------------------- 104 (396) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccc---cchHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEE---LIEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~---~~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) ..++.+ .++...+........+.....+.++..++ ..++.++.++|++++ +++++|.|. T Consensus 105 ------------~~~~~d-~~~~~tg~~al~~~~~~~~~~~~il~ap~~~~~~v~~al~~~~~~~~-~~~i~d~p~---- 166 (396) T protein:vir:60 105 ------------IIGTTD-ENGQYTGLKALLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLR-AFGYISAWG---- 166 (396) T ss_pred ------------cccccc-ccccccchhhhhhcccceeeeeeeccccccccHHHHHHHHHHhccCC-eEEEEeCCC---- Confidence 000000 00000000000000011111122222222 236788899998765 788888763 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR 471 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~ 471 (631) ..+++++++||+++ +|+|+++||||++++|+.++..+++|||+++||++||+|.++|+|+||||+++.++.+... T Consensus 167 ---~~~~~~a~~~~~~~--~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~ 241 (396) T protein:vir:60 167 ---CKTISEVKAYRQNF--SQRELMVIWPDFLAWDTVASTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISA 241 (396) T ss_pred ---CCCHHHHHHHHhhc--CCceEEEEeCceeeecccCCceeEEchhHHHHHHHHHhhhccCcEeCcCCceecceeecee Confidence 57889999999976 5889999999999999999999999999999999999999999999999998665554321 Q ss_pred -c--eecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_013693. 472 -M--AWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTR 548 (631) Q Consensus 472 -~--~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~ 548 (631) + ....++.|++.||++|||+++ +++|+++||+||+++ |++|+||+|||++++|+++|+++++|+|||||++.+| T Consensus 242 ~~~~~~~~~~~~~~~Ln~~gI~~~~--~~~G~~~wG~rT~~~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~ 318 (396) T protein:vir:60 242 SVFWDLQESGTDADLLNESGVTTLI--RRDGFRFWGNRTCSD-DPLFLFENYTRTAQVLADTMAEAHMWAVDKPITATLI 318 (396) T ss_pred ecccccCCCcchhhhhhhcCcEEEE--cCCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHH Confidence 2 233456789999999999995 578999999999866 6789999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCce--eeeeecc Q lcl|NC_013693. 549 SLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDME--FSEIETG 624 (631) Q Consensus 549 ~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~--~~e~~~~ 624 (631) ++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+++++++.+..+ |+|+.+. T Consensus 319 ~~i~~~i~~~l~~l~~~gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~~ 396 (396) T protein:vir:60 319 RDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLANLVTSVNSN 396 (396) T ss_pred HHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 999999999999999999999999999999999999999999999999999999999999999887444 6666655 No 21 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=2.5e-95 Score=539.20 Aligned_cols=377 Identities=17% Similarity=0.133 Sum_probs=294.6 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccC-----cCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWG-----EAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~G-----p~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |++ |++|||||+|++.+++++ +++|++++|||.++++ |+++|++|+|+ .+|...||. ..++.+++.+ T Consensus 1 M~~---~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~-~~~~~~~g~---~~tL~~al~~ 73 (390) T protein:vir:79 1 MPQ---DYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNV-VAALGKAGK---KGTLRRTLDA 73 (390) T ss_pred Ccc---ccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecH-HHHHHhcCC---Cccchhhhhh Confidence 666 678999999999999888 7999999999999886 89999999776 699999985 6778999999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) ||.|||+.|||||+.......... . + T Consensus 74 ~~~~~~~~~~vv~v~~~~~~~~~~----------------------~-------------------~------------- 99 (390) T protein:vir:79 74 IGKQTKPLTVVVRVAEGKDADETT----------------------S-------------------N------------- 99 (390) T ss_pred hcccccceEEEEeecccccccccc----------------------c-------------------e------------- Confidence 999999999999984321100000 0 0 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) ..+.... T Consensus 100 ----------------------------------~ig~~~~--------------------------------------- 106 (390) T protein:vir:79 100 ----------------------------------VIGTVTP--------------------------------------- 106 (390) T ss_pred ----------------------------------eeecccc--------------------------------------- Confidence 0000000 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) T Consensus 107 -------------------------------------------------------------------------------- 106 (390) T protein:vir:79 107 -------------------------------------------------------------------------------- 106 (390) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccc---cchHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEE---LIEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~---~~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) .+..++..........+...|.+++++. ..++.++..+|+++ ++++++|.| T Consensus 107 --------------------~~~~tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~-~~~ai~D~p----- 160 (390) T protein:vir:79 107 --------------------DGKYTGIKALLAAQGALGVKPRILAAPGLDTQPVAAALAATAQSL-RAMAYVSAS----- 160 (390) T ss_pred --------------------cccchhhhhhhhhhhhhccccccccCCcccchHHHHHHHHhhhhc-ceEEEEEcc----- Confidence 0000000000000000011111111111 12456677888866 479999876 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR 471 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~ 471 (631) .+.+.+++++||+.+ +|+|+++||||++++|+.++..+++|||+++||++||+|.++|||+||||+++.++.++.. T Consensus 161 --~~~t~~~a~~~~~~~--~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~D~~~g~~~spsN~~i~gi~~~~~ 236 (390) T protein:vir:79 161 --GCKTKEEAAAYRRQF--GQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISA 236 (390) T ss_pred --CCCCHHHHHHHhcCC--CCceEEEEcCceeecccccCceeEeehHHHHHHHHHhhhccCCcEEccCCceeeccceeee Confidence 356788999999866 6899999999999999999999999999999999999999999999999998655544321 Q ss_pred -ceecC--ChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_013693. 472 -MAWSA--SSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTR 548 (631) Q Consensus 472 -~~~~~--~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~ 548 (631) ..+.. .+.|++.||++||++++ +++|+++||+||+++ |++|+||+||||+++|+++|+++++|+|||||++.+| T Consensus 237 ~~~~~~~~~~~~a~~Ln~~gi~t~~--~~~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~ 313 (390) T protein:vir:79 237 DVSWDLQDPATDAGYLNEHEVTTLV--NRNGFRFWGERTCSD-DPKFAFENYTRTAQVAADSIAEAQMPVVDGPLNPSLA 313 (390) T ss_pred eccccccccchhhhhhhhcCcEEEE--cCCCEEEEeccccCC-CcccceeeehhhHHHHHHHHHHHHHHhccCCCCHHHH Confidence 12222 34577899999999986 478999999999865 6789999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCce-eeeeecc Q lcl|NC_013693. 549 SLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDME-FSEIETG 624 (631) Q Consensus 549 ~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~-~~e~~~~ 624 (631) ++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|++.+..+..+ +.+...| T Consensus 314 ~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~ 390 (390) T protein:vir:79 314 RDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999999999999999999999999999999999999999999999999999999998877643 3333344 No 22 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=3e-95 Score=538.76 Aligned_cols=374 Identities=16% Similarity=0.137 Sum_probs=296.2 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeec-----cCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQ-----WGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~-----~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |++ |.+|||||+|++.+++++ +++|++++|||.++ .+|+++|++++|+ .||...||. ..++.+++.+ T Consensus 1 M~~---~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~-~~~~~~~g~---~gtl~~al~~ 73 (391) T protein:vir:79 1 MPT---DYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNP-QAYIGKAGD---KGTLAHTLDA 73 (391) T ss_pred CCC---CCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccH-HHHHHhcCC---ccccchhhhh Confidence 665 458999999999999888 89999999999986 6899999999775 799999996 6788999999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) ||.|||..||++++.......... ..+ T Consensus 74 ~~~~gg~~~~vv~~~~~~~~~~~~----------------------------------~~~------------------- 100 (391) T protein:vir:79 74 ITDQTNPLTVVVRVAGGASEAETT----------------------------------SNL------------------- 100 (391) T ss_pred hhcccccceeeecccccccccccc----------------------------------ccc------------------- Confidence 999999999999874321100000 000 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) .+.... .....+...+ . T Consensus 101 -----------------------------------~g~~~~-----------------------~~~~tGl~~l---~-- 117 (391) T protein:vir:79 101 -----------------------------------IGTTNA-----------------------AGRYTGMKAL---L-- 117 (391) T ss_pred -----------------------------------cccccc-----------------------hhhhHHHhhh---h-- Confidence 000000 0000000000 0 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) T Consensus 118 -------------------------------------------------------------------------------- 117 (391) T protein:vir:79 118 -------------------------------------------------------------------------------- 117 (391) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecc---ccchHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCE---ELIEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~---~~~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) .........+.++..+ ...++.+++++|++++ +++++|.| T Consensus 118 -------------------------------~~~~~~~~~p~~l~~p~~~~~~v~~al~~~~~~~~-~~ai~d~p----- 160 (391) T protein:vir:79 118 -------------------------------TARNRFGVAPRILAVPGLDSLPVGTELVTIAQKLR-AFAYLSAY----- 160 (391) T ss_pred -------------------------------hhhhhhcccchhhcCCccchhHHHHHHHHHHhhcC-cEEEEECC----- Confidence 0000000000000000 1135667888998875 67888876 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR 471 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~ 471 (631) .+.+.+++++||+.+ +|+|+++||||++++|+.++..+++|||+++||++||+|.++|||+||+|+++.++ .+ T Consensus 161 --~~~t~~~a~~~~~~~--~s~~~a~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~g~~~spaN~~l~gi---~~ 233 (391) T protein:vir:79 161 --GCQTKEEAVAYRSNF--GQREAMVMWPDFVGWDTAANAETTLWATARAVGLRAKIDNDTGWHKTLSNVAVGGV---TG 233 (391) T ss_pred --CCCCHHHHHHHHhcc--CCceeEEecceeeeecCcCCceeeechHHHHHHHHHHhhhcccceeccCCceehhh---hc Confidence 357889999999976 58899999999999999999999999999999999999999999999999985544 44 Q ss_pred ceecC------ChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCH Q lcl|NC_013693. 472 MAWSA------SSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDE 545 (631) Q Consensus 472 ~~~~~------~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~ 545 (631) +...+ ...|.+.||++|||+++ +++|+++||+||+++ |++|+||+|||++++|+++|+++++|+|||||++ T Consensus 234 ~~~~~~~~~~~~~~~~~~Ln~~~I~t~~--~~~G~~~wG~rT~~~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~ 310 (391) T protein:vir:79 234 LSRDVFWDLQDPATDAGYLNANEVTTLV--HRDGYRFWGSRTCSA-DPLFAFENYTRTAQVLADTMAEAHMWANDLPMTP 310 (391) T ss_pred cccccccccccccchhhhhhhcCceEEE--CCCcEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccCCCCH Confidence 44333 34578899999999986 478999999999865 6799999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCce--eeeeec Q lcl|NC_013693. 546 FTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDME--FSEIET 623 (631) Q Consensus 546 ~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~--~~e~~~ 623 (631) .+|++|+++++.||++||++|+|.||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++.....+ |+++.. T Consensus 311 ~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~ 390 (391) T protein:vir:79 311 TLVRDLLEGINAKLRMLTRNGYLLGGAAWFDADANSKDTLKAGQLAIDYDYTPVPPLENLTFRQRITDRYLMQFAEAVKA 390 (391) T ss_pred HHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhc Confidence 999999999999999999999999999999999999999999999999999999999999999999988766 666766 Q ss_pred c Q lcl|NC_013693. 624 G 624 (631) Q Consensus 624 ~ 624 (631) + T Consensus 391 a 391 (391) T protein:vir:79 391 A 391 (391) T ss_pred C Confidence 6 No 23 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=1.4e-94 Score=535.18 Aligned_cols=381 Identities=14% Similarity=0.114 Sum_probs=304.2 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccC-----cCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWG-----EAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~G-----p~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |+ +|+ |||||+|++.+++++ +|+|++++|||.++.+ |+++|++|+|+ .||+..||. ..++..++++ T Consensus 1 m~---~~~-~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~-~~~~~~~g~---~~tl~~al~~ 72 (395) T protein:vir:98 1 MS---DFH-HGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNV-QSAIAKAGK---KGTLAASLQA 72 (395) T ss_pred CC---CCC-CCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeech-HHhHhhccc---ccchhhHHHH Confidence 44 475 799999999999887 7999999999999865 88999999776 799999996 5788999999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) ||.|||+.|||+|+.......... .. .. T Consensus 73 ~~~~~~~~~~vv~~~~~~~~~~~~---------------------~~--------------a~----------------- 100 (395) T protein:vir:98 73 IADQSKPVTVVVRVEDGTGDDEEA---------------------AL--------------AQ----------------- 100 (395) T ss_pred HhhccCceEEEeeccccccccccc---------------------cc--------------cc----------------- Confidence 999999999999874321100000 00 00 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) T Consensus 101 -------------------------------------------------------------------------------- 100 (395) T protein:vir:98 101 -------------------------------------------------------------------------------- 100 (395) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) T Consensus 101 -------------------------------------------------------------------------------- 100 (395) T protein:vir:98 101 -------------------------------------------------------------------------------- 100 (395) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecccc---chHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEEL---IEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~---~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) ....+.++ ....+..++.+......+.....+.++.+|++ .++.++.++|++++ +++++|.|. T Consensus 101 --------~~~~i~g~-~~~~~~~Tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~al~~~~~~~~-~~~~~d~p~---- 166 (395) T protein:vir:98 101 --------TVSNIIGG-TDENGKYTGIKALLTAQAVTGVKPRILGVPGLDTKEVAVALASAAIKLR-AFAYVSAWG---- 166 (395) T ss_pred --------cccccccc-cccccchhHHHHHhhhhhhhccchhhcccccccccHHHHHHHHHhhhcC-cEEEEEcCC---- Confidence 00000000 00011111122222222223333434434433 35677888998764 788988763 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR 471 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~ 471 (631) +.+++++++||+++ +|+|+++||||++++|+.++..+++|||+++||++||+|.++|+||||+|+++.++.+... T Consensus 167 ---~~t~~~a~~~~~~~--~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~i~gi~~~~~ 241 (395) T protein:vir:98 167 ---CKTISEAMEYRKNF--SQRELMVIWPDFLAWDTVKNTTATAYATARALGLRAYIDQTVGWHKTLSNVGVQGVTGISA 241 (395) T ss_pred ---CCCHHHHHHHHhcc--CCceEEEEecceeEecccCCceeeechHHHHHHHHHHhhcccCcEeccCCceeecccccce Confidence 56889999999876 5899999999999999999999999999999999999999999999999998665555432 Q ss_pred -ce--ecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_013693. 472 -MA--WSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTR 548 (631) Q Consensus 472 -~~--~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~ 548 (631) +. ...++.|++.||++|||+++ +++|+++||+||+++ |++|+||++||++++|+++|++.++|++||||++.+| T Consensus 242 ~~~~~~~~~~~~~~~Ln~~gI~~~~--~~~G~~~wG~rT~s~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~~~~~~~ 318 (395) T protein:vir:98 242 SVFWDLQASGTDADLLNEAGVTTLV--RKDGFRFWGNRTCSD-DPLFLFENYTRTAQVLADTMAEAHMWAVDKPITATLI 318 (395) T ss_pred ecccccCCCcchHHhhhhcCcEEEE--cCCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHH Confidence 22 33457899999999999995 578999999999865 6799999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCce--eeeeec Q lcl|NC_013693. 549 SLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDME--FSEIET 623 (631) Q Consensus 549 ~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~--~~e~~~ 623 (631) ++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+++++++.++.+ |+|+.+ T Consensus 319 ~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:98 319 RDIVDGINAKFRELKSNGYIVEGKCWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKYLVNLAESVNS 395 (395) T ss_pred HHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999999999999999999999999999999999999999999999999999999999988866 777766 No 24 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=1.7e-94 Score=534.69 Aligned_cols=379 Identities=15% Similarity=0.115 Sum_probs=302.4 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccC-----cCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWG-----EAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~G-----p~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |+ +|+ |||||+|++.+++++ +++|++.+|||.++++ |+++|++++++ .+|...||. .+.+.+++.+ T Consensus 1 m~---~~~-~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~-~~~~~~~g~---~gtl~~al~~ 72 (392) T protein:vir:18 1 MS---DFH-HGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNV-QSAIAKAGK---KGTLSASLQA 72 (392) T ss_pred CC---CCC-CCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeech-HHHHhhcCC---CcchHHHHHH Confidence 44 586 699999999999988 7999999999999876 89999999887 699999986 6678999999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) ||.|||..|+++|+......... .+. T Consensus 73 ~~~ngg~~~~vv~v~~~~~~~~~--------------------------------------~~t---------------- 98 (392) T protein:vir:18 73 IADQSKPVTVVVRVAEGTGDDAE--------------------------------------AQT---------------- 98 (392) T ss_pred hhcccCceEEEeccccccccccc--------------------------------------ccc---------------- Confidence 99999999999986321100000 000 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) T Consensus 99 -------------------------------------------------------------------------------- 98 (392) T protein:vir:18 99 -------------------------------------------------------------------------------- 98 (392) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) T Consensus 99 -------------------------------------------------------------------------------- 98 (392) T protein:vir:18 99 -------------------------------------------------------------------------------- 98 (392) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecccc---chHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEEL---IEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~---~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) ..++.|+.+ .++...+..............+.++.+|++ .++.++.++|++++ +++++|.| T Consensus 99 ---------~~dliG~~~-~~~~~tg~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~~~~~~-~~~~~d~~----- 162 (392) T protein:vir:18 99 ---------TSNIIGGTD-ENGKYTGIKALLTAEAVTGVKPRILGVPGLDTQEVATALASVCISLR-AFGYVSAW----- 162 (392) T ss_pred ---------hhhheeccc-ccchhhhHHHHHhhhhhhceeehhcccCccchHHHHHHHHHHHhhcC-cEEEEecC----- Confidence 000000000 011111112222222223333444444443 35678889998765 78888765 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR 471 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~ 471 (631) .+.+++++++||+++ +|+|+++||||++++|+.++..+++|||+++||++||+|.++|||+||+|+++.++.++.. T Consensus 163 --~~~~~~~a~~~~~~~--~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~ 238 (392) T protein:vir:18 163 --GCKTISEAMAYRENF--SQRELMVIWPDFLAWDTTANATATAYATARALGLRAYIDQTIGWHKTLSNVGVQGVTGISA 238 (392) T ss_pred --CCCCHHHHHHHHhhc--cCceEEEEeCceeeecccCCceEEechHHHHHHHHHhhhccCCceEccCCceeeceeecce Confidence 467899999999976 5899999999999999999999999999999999999999999999999999766655432 Q ss_pred -ce--ecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_013693. 472 -MA--WSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTR 548 (631) Q Consensus 472 -~~--~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~ 548 (631) +. ...++.|++.||++|||+++ +++|+++||+||+++ |++|+||+||||+++|+++|+++++|+|||||++.+| T Consensus 239 ~~~~~~~~~~~~~~~Ln~~gI~t~~--~~~G~~~wG~rT~~~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~ 315 (392) T protein:vir:18 239 SVFWDLQASGTDADLLNEAGVTTLV--RKDGFRFWGNRTCSD-DPLFLFENYTRTAQVLADTMAEAHMWAVDKPITASLI 315 (392) T ss_pred ecccccCCCcchhhhhhhcCceEEE--cCCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHH Confidence 22 33456799999999999995 578999999999865 6799999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCce-eeeeecc Q lcl|NC_013693. 549 SLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDME-FSEIETG 624 (631) Q Consensus 549 ~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~-~~e~~~~ 624 (631) ++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++..++.+ +.|.+.+ T Consensus 316 ~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 392 (392) T protein:vir:18 316 RDIVDGINAKFRELKSNGYIVDGECWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKYLVNLAESVNS 392 (392) T ss_pred HHHHHHHHHHHHHHHhcCcccceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999999999999999999999999999999999999999999999999999999999887755 3333333 No 25 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=1.3e-94 Score=535.21 Aligned_cols=379 Identities=14% Similarity=0.051 Sum_probs=299.5 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccC-----cCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWG-----EAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~G-----p~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |+|+.+|+ |||||+|++.+++++ +++|++++|||.++++ |+|+|++|+|+ .||.+.||. ..++.+++.+ T Consensus 1 m~m~~~~~-~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~-~~~~~~~g~---~g~L~~al~~ 75 (393) T protein:vir:10 1 MSILDTYL-HGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNP-LNYLEKAGS---TGTLRRTLNS 75 (393) T ss_pred CCCCCccC-CCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecch-HHHHHhhCC---ccchhhhhhh Confidence 99998887 899999999999888 8999999999999987 99999999776 799999995 6789999999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) ||.|+|..||+||+......... . .+ T Consensus 76 ~~~~~~~~~~vv~v~~~~~~~~t---------------------------------~--------~~------------- 101 (393) T protein:vir:10 76 IGSIVKTPTVIVRVAESDDSDTL---------------------------------T--------AN------------- 101 (393) T ss_pred hhcccCceEEEeecccCcccccc---------------------------------c--------cc------------- Confidence 99999999999998422100000 0 00 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) ..+ .. . T Consensus 102 ----iig------------------------------~~-~--------------------------------------- 107 (393) T protein:vir:10 102 ----IVG------------------------------TQ-E--------------------------------------- 107 (393) T ss_pred ----ccc------------------------------cc-c--------------------------------------- Confidence 000 00 0 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) T Consensus 108 -------------------------------------------------------------------------------- 107 (393) T protein:vir:10 108 -------------------------------------------------------------------------------- 107 (393) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccc---cchHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEE---LIEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~---~~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) ++..++....+.........|.++++|+ ..++.+++++|++++.++.+.|+| T Consensus 108 --------------------~~~~tgl~al~~~~~~~~~~p~li~apg~~~~~~~~al~~~~~~~~~~~~v~d~~----- 162 (393) T protein:vir:10 108 --------------------NGKFTGIKALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKLNAFAFISDNG----- 162 (393) T ss_pred --------------------cchhhHHHHHHhhhhhcceeeeeeeeccccchHHHHHHHHHhhccCcEEEEEcCC----- Confidence 0000000000000011111122222222 235678899999999888777654 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR 471 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~ 471 (631) .++.++++.||+.+ +|+|+++||||++++|+.++..+++|||+++||++||+|.++|||+||||+++.++.+... T Consensus 163 ---~~t~~~ai~~~~~~--~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~G~~~spaN~~l~gi~~~~~ 237 (393) T protein:vir:10 163 ---ATTKEQAYTYRQNF--SQREGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITK 237 (393) T ss_pred ---CCCHHHHHHHhhhc--CCceEEEEecccccccccCCceeEeehhHHHHHHHHHhhcCCCcEEccCCceeeceeecce Confidence 56889999999976 5889999999999999999999999999999999999999999999999998766655432 Q ss_pred -ce--ecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_013693. 472 -MA--WSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTR 548 (631) Q Consensus 472 -~~--~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~ 548 (631) .. ..+++.|++.||++|||+|+ +++|+++||+||+++ |++|+||+||||+++|+++|++.++|+|||||++.+| T Consensus 238 ~~~~~~~~~~~~~~~Ln~~gI~t~~--~~~G~~~wG~rT~s~-d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~ 314 (393) T protein:vir:10 238 AVEFDINESSTEANYLNEKGITICL--NHNGFRYWGSRTLAT-DTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRV 314 (393) T ss_pred ecccccCCCcchhHhHhhcCceEEE--cCCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHH Confidence 22 33457899999999999995 578999999999865 6799999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCC--ceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccCe Q lcl|NC_013693. 549 SLFSNAVRPYIRQLANMG--AIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGGG 626 (631) Q Consensus 549 ~~i~~~i~~~l~~l~~~g--al~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~ 626 (631) ++|+++++.||++||+.| +|.||+|+||++ ||+++|++|+|+++|+++|++|+|||+|+++++.++ |.|+ -+. T Consensus 315 ~~i~~~i~~~L~~l~~~g~~al~g~~v~~~~~-nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~--~~~l--~~~ 389 (393) T protein:vir:10 315 KTMLEAINNKLRSWASGDDPRILGARVWVAEE-ITADIIKSGKFVIKYDYHWIPSLESLGLEQRVNDEY--VVDL--VNT 389 (393) T ss_pred HHHHHHHHHHHHHHHhccccccccceEEecCC-CCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHH--HHHH--HHH Confidence 999999999999999866 899999999875 888999999999999999999999999999988765 3333 122 Q ss_pred eeec Q lcl|NC_013693. 627 IVAA 630 (631) Q Consensus 627 ~~~~ 630 (631) |-|- T Consensus 390 v~a~ 393 (393) T protein:vir:10 390 LKAL 393 (393) T ss_pred HhcC Confidence 2222 No 26 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=2.4e-94 Score=533.83 Aligned_cols=382 Identities=15% Similarity=0.118 Sum_probs=297.0 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccC-----cCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWG-----EAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~G-----p~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |+ +|+ |||||+|++.+++++ +|.|++++|||.++++ |.++|++|+|+ .||...||. ..++.+++++ T Consensus 1 m~---~~~-~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~-~~~~~~~g~---~~tl~~al~~ 72 (396) T protein:vir:57 1 MS---DYH-HGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNV-QSAIAKAGK---KGTLAASLQA 72 (396) T ss_pred CC---CCC-CceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecc-hhhhhhccc---ccchHHHHHH Confidence 44 476 799999999999888 8999999999999876 88999999887 699999986 5689999999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) +|.|||..||++|+........... . ... T Consensus 73 ~~~~~~~~~~vv~~~~~~~~~~~~~---------------------~--------------a~t---------------- 101 (396) T protein:vir:57 73 IADQSKPVTVVVRVEDGTGDDEETK---------------------L--------------AQT---------------- 101 (396) T ss_pred hhhcCCceeEeeecccccccccccc---------------------c--------------ccc---------------- Confidence 9999999999998743221100000 0 000 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) ... ..+...... .. . T Consensus 102 ------~~~-------------------------iiG~~~~~~-----------------------~~----------t- 116 (396) T protein:vir:57 102 ------VSN-------------------------IIGTTDENG-----------------------QY----------T- 116 (396) T ss_pred ------cee-------------------------eeeeccccc-----------------------cc----------h- Confidence 000 000000000 00 0 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) T Consensus 117 -------------------------------------------------------------------------------- 116 (396) T protein:vir:57 117 -------------------------------------------------------------------------------- 116 (396) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccc---cchHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEE---LIEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~---~~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) +........+.....|.++++++ ..++.++.++|+++ ++++++|.|. T Consensus 117 -------------------------gl~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~-~~~~~~d~p~---- 166 (396) T protein:vir:57 117 -------------------------GLKALMGAESVTGVKPRILGVPGLDTKEVAVALASVCQEL-NAFGYISAWG---- 166 (396) T ss_pred -------------------------hhhhhhhcccceeEEeccccCcccchhHHHHHHHHHhhhC-ceEEEEcCCC---- Confidence 00000000000001111111111 13677889999865 6899988763 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR 471 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~ 471 (631) +.+++++++||+++ +|+|+++||||++++|+.++..+++|||+++||++||+|.++|+||||||+++.++.+... T Consensus 167 ---~~~~~~~~~~~~~~--~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~~ 241 (396) T protein:vir:57 167 ---CKTISEVKAYRQNF--SQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISA 241 (396) T ss_pred ---CCCHHHHHHHHhcc--CCceEEEEcceeeeecccCCceeEEehhHHHHHHHHHhhhccCcEeccCCceeccccccce Confidence 57789999999976 5899999999999999999999999999999999999999999999999999665554322 Q ss_pred -cee--cCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_013693. 472 -MAW--SASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTR 548 (631) Q Consensus 472 -~~~--~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~ 548 (631) +.+ ..++.|++.||++|||+++ +++|+++||+||+++ |++|+||+|||++++|+++|++.++|+|||||++.+| T Consensus 242 ~~~~~~~~~~~~~~~Ln~~gi~t~~--~~~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~n~~~~~ 318 (396) T protein:vir:57 242 SVFWDLQKPGTDADLLNEAGVTTLV--RRDGFRFWGNRTCSD-DPLFLFESYTRTAQVLADTMAEAHMWAIDKPITATLI 318 (396) T ss_pred ecccccCCcchhhhhhhhcCcEEEE--cCCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHH Confidence 222 2346799999999999996 468999999999865 6789999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCce--eeeeecc Q lcl|NC_013693. 549 SLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDME--FSEIETG 624 (631) Q Consensus 549 ~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~--~~e~~~~ 624 (631) ++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++..++.+ |+++.+. T Consensus 319 ~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~~ 396 (396) T protein:vir:57 319 RDIIDGINAKFRELKNNGYIVDGTCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITSRYLASLVTSVNSN 396 (396) T ss_pred HHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 999999999999999999999999999999999999999999999999999999999999999887644 4445444 No 27 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=2.1e-94 Score=534.13 Aligned_cols=382 Identities=14% Similarity=0.099 Sum_probs=295.2 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccC-----cCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWG-----EAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~G-----p~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |+ +|+ |||||+|++.+++++ +++|++++|||.++++ |+++|++|+|+ .||...||+ ...+++++++ T Consensus 1 m~---~~~-~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~-~~~~~~~g~---~~tL~~al~~ 72 (396) T protein:vir:20 1 MS---DYH-HGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNV-QSAISKAGK---KGTLAASLQA 72 (396) T ss_pred CC---CCC-CCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeech-HHHHhhccc---ccchhhhhhh Confidence 43 485 999999999999888 8999999999999764 78999999876 799999996 5678899999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) ||.|||..||++|+.......... .. .. T Consensus 73 ~~~ngg~~~~v~~~~~~~~~~~~~---------------------~~--------------a~----------------- 100 (396) T protein:vir:20 73 IADQSKPVTVVMRVEDGTGDDEET---------------------KL--------------AQ----------------- 100 (396) T ss_pred hhccCceeEEEEeccccccccccc---------------------cc--------------cc----------------- Confidence 999999999999874321100000 00 00 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) ..... ... T Consensus 101 --------------------------------------------------------------t~~~~----------~~~ 108 (396) T protein:vir:20 101 --------------------------------------------------------------TVSNI----------IGT 108 (396) T ss_pred --------------------------------------------------------------ccccc----------ccc Confidence 00000 000 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) .. T Consensus 109 ~~------------------------------------------------------------------------------ 110 (396) T protein:vir:20 109 TD------------------------------------------------------------------------------ 110 (396) T ss_pred cc------------------------------------------------------------------------------ Confidence 00 Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecc---ccchHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCE---ELIEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~---~~~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) .++..++..............|.++..+ ...++.++.++|++++ +++++|.|. T Consensus 111 -------------------~~~~~tg~~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~-~~~~iD~p~---- 166 (396) T protein:vir:20 111 -------------------ENGQYTGLKAMLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLR-AFGYISAWG---- 166 (396) T ss_pred -------------------cccccchhhhhhhhccccccchhhhhhhhhccHHHHHHHHHHHhcCC-cEEEEecCC---- Confidence 0000000000000000000011111111 1236788999998865 788888774 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR 471 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~ 471 (631) ..+++++++||+++ +|+|+++||||++++|+.++..+++|||+++||++||+|.++|+|+||||+++.++.+... T Consensus 167 ---~~~~~~a~~~r~~~--~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~~ 241 (396) T protein:vir:20 167 ---CKTISEVKAYRQNF--SQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISA 241 (396) T ss_pred ---CCCHHHHHHHhhCC--CCceEEEEcCccccccCcCCcceeechhHHHHHHHHHhhhhcCcEeccCCceeccceecce Confidence 56789999999976 5899999999999999999999999999999999999999999999999999766655432 Q ss_pred -ce--ecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_013693. 472 -MA--WSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTR 548 (631) Q Consensus 472 -~~--~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~ 548 (631) +. ..+++.|++.||++|||+++ +++|+++||+||+++ |++|+||++||+++||+++|++.++|+|||||++.+| T Consensus 242 ~~~~~~~~~~~~~~~Ln~~gi~~~~--~~~G~~~wG~rT~s~-d~~~~~i~~rR~~~~i~~~~~~~~~~~v~e~~~~~~~ 318 (396) T protein:vir:20 242 SVFWDLQESGTDADLLNESGVTTLI--RRDGFRFWGNRTCSD-DPLFLFENYTRTAQVVADTMAEAHMWAVDKPITATLI 318 (396) T ss_pred ecccccCCCcchhhhhhhcCcEEEE--cCCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHH Confidence 22 23456799999999999995 478999999999865 6789999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCce--eeeeecc Q lcl|NC_013693. 549 SLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDME--FSEIETG 624 (631) Q Consensus 549 ~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~--~~e~~~~ 624 (631) ++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++..+..+ |+++... T Consensus 319 ~~i~~~i~~~L~~l~~~G~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 396 (396) T protein:vir:20 319 RDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLANLVTSVNSN 396 (396) T ss_pred HHHHHHHHHHHHHHHhCcceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 999999999999999999999999999999999999999999999999999999999999998866533 3333333 No 28 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=1.6e-94 Score=534.87 Aligned_cols=378 Identities=16% Similarity=0.115 Sum_probs=297.3 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeec-----cCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQ-----WGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~-----~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |+| +|.+|||||+|++.+++++ .+.|++++|+|.++ .+|+++|++|+++ .+|...||. ..++.+++.+ T Consensus 1 M~~--~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~-~~~~~~~g~---~~tl~~al~~ 74 (391) T protein:vir:11 1 MAA--DQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNV-QAAIGKAGT---SGTLPASLQA 74 (391) T ss_pred CCC--CcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecc-hhhheecCC---Cccchhhhhh Confidence 555 5889999999999999888 89999999999998 4699999999776 799988885 6678899999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) ||.|+|+.||++|+......... ..+. T Consensus 75 ~~~~~g~~~~vv~~~~~~~~~~t-----------------------------------------~~d~------------ 101 (391) T protein:vir:11 75 IADQANAATVVVRVKPGEDEAAT-----------------------------------------NSAV------------ 101 (391) T ss_pred hhccccceeEEeeeccccccccc-----------------------------------------chhh------------ Confidence 99999999999987321000000 0000 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) .+.. .. T Consensus 102 -----------------------------------~g~~---------------------------------------~a 107 (391) T protein:vir:11 102 -----------------------------------IGGV---------------------------------------SA 107 (391) T ss_pred -----------------------------------hccc---------------------------------------cc Confidence 0000 00 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) T Consensus 108 -------------------------------------------------------------------------------- 107 (391) T protein:vir:11 108 -------------------------------------------------------------------------------- 107 (391) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccc---cchHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEE---LIEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~---~~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) .+...+....++....+...+.++.+++ ..++.+++++|+++ ++++++|.|. T Consensus 108 --------------------~~~~~g~~a~~~~~~~~~~~p~~~~ap~~~~~~v~~al~~~~~~~-~~~~i~D~p~---- 162 (391) T protein:vir:11 108 --------------------DGKYTGMKALLAAKARLGVVPRILGVPGLDTQPVATALIAIAQQL-RAFAYVSASG---- 162 (391) T ss_pred --------------------ccchhhhhhhhhhhhhheeccccccccccccHHHHHHHHHhhccc-ceEEEEEcCC---- Confidence 0000000000000000011111111111 23677888999876 6899998763 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR 471 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~ 471 (631) ..+++++++||+++ +|+|+++||||++++|+.++..+++|||+++||++||+|.++|||+||||+++.++.++.. T Consensus 163 ---~~t~~~a~~~r~~~--~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~l~gi~~~~~ 237 (391) T protein:vir:11 163 ---CKTKEEATAYRENF--AAREAMVIWPDFLTWSTVVNQTVPAPAVAQALGLRARIDQEVGWHKTLSNVAVNGVTGISA 237 (391) T ss_pred ---CCCHHHHHHHhhhc--CCceEEEEcCcceecccccCceEEechHHHHHHHHHHhhccCCcEEccCCceeeceeeccc Confidence 56789999999976 6899999999999999999999999999999999999999999999999999766655432 Q ss_pred -ceec--CChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_013693. 472 -MAWS--ASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTR 548 (631) Q Consensus 472 -~~~~--~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~ 548 (631) +.+. .++.|++.||++|||+++ +++|+++||+||+++ |++|+||+|||++++|+++|++.++|+|||||++.+| T Consensus 238 ~~~~~~~~~~~~~~~Ln~~gi~~~~--~~~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~ 314 (391) T protein:vir:11 238 DVFWDLQSPSTDANYLNENEVTTLV--QEGGFRFWGSRTCSD-DPLFAFENYTRTAQVLADTIAEAHMWAVDKPMHPSLV 314 (391) T ss_pred ccccccCCCcchhhhhhhcCcEEEE--cCCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHH Confidence 2222 346799999999999985 578999999999865 6799999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCc-eeeeeecc Q lcl|NC_013693. 549 SLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDM-EFSEIETG 624 (631) Q Consensus 549 ~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~-~~~e~~~~ 624 (631) ++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||++++++..+.. ++.+.+.+ T Consensus 315 ~~i~~~i~~~l~~l~~~g~l~g~~~~~~~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~a 391 (391) T protein:vir:11 315 RDILEGVNAKFRELKGLGLIIDAQAWYDPNVNDKDTLKAGKLRITYDYTPVPPLEDLTFFQKITDSYLVDFASRVNA 391 (391) T ss_pred HHHHHHHHHHHHHHHhccceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999999999999999999887652 23333333 No 29 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=3.4e-94 Score=532.99 Aligned_cols=377 Identities=17% Similarity=0.146 Sum_probs=293.3 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccC-----cCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWG-----EAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~G-----p~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |++ |++|||||+|++.+++++ .++|++++|||.++++ |+++|++++|+ .+|...||. .+++.+++.. T Consensus 1 M~~---~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~-~~~~~~~g~---~gtL~~al~~ 73 (390) T protein:vir:10 1 MPQ---DYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNV-VAALGKAGK---KGTLRRTLDA 73 (390) T ss_pred Ccc---cccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccH-HHHHhhcCC---Cceehhhhhh Confidence 665 669999999999999888 7999999999999875 99999999776 699999995 6789999999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) ||.|||..|||||+.......+..+ .+ .|. T Consensus 74 ~~~~gg~~~~vv~v~~~~~~~~~~~----------------------~~----ig~------------------------ 103 (390) T protein:vir:10 74 IGKQTKPLTVVVRVAEGKDADETTS----------------------NV----IGT------------------------ 103 (390) T ss_pred hccccCceEEEEEeccccccccccc----------------------cc----ccc------------------------ Confidence 9999999999999853221100000 00 000 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) ....+. .+. .... T Consensus 104 --~~~~~~------------------------------~tg-----------------------------~~al------ 116 (390) T protein:vir:10 104 --VTPDGK------------------------------YTG-----------------------------IKAL------ 116 (390) T ss_pred --cccccc------------------------------cch-----------------------------hhhh------ Confidence 000000 000 0000 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) . .. T Consensus 117 --------------------~-------------------------~~-------------------------------- 119 (390) T protein:vir:10 117 --------------------L-------------------------AA-------------------------------- 119 (390) T ss_pred --------------------h-------------------------hh-------------------------------- Confidence 0 00 Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecc---ccchHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCE---ELIEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~---~~~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) .......|.++.++ ...++.++..+|++++ +++++|.| T Consensus 120 ---------------------------------~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~-~~aivD~p----- 160 (390) T protein:vir:10 120 ---------------------------------QGALGVKPRILAAPGLDTQPVAAALAATAQSLR-AMAYVSAS----- 160 (390) T ss_pred ---------------------------------hhhhcceehhhcccccchHHHHHHHHHhhcccc-eEEEEecC----- Confidence 00000000000000 0125667888898765 68888876 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR 471 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~ 471 (631) ...+.+++++||+++ +|+|+++||||++++|+.++..+++|||+++||++||+|.++|||+||||+++.++.+... T Consensus 161 --~~~t~~~a~~~~~~~--~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~ 236 (390) T protein:vir:10 161 --GCKTKEEAAAYRKQF--GQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISA 236 (390) T ss_pred --CCCCHHHHHHHhhcc--CCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecce Confidence 357889999999976 5899999999999999999999999999999999999999999999999999665555321 Q ss_pred -ceec--CChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_013693. 472 -MAWS--ASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTR 548 (631) Q Consensus 472 -~~~~--~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~ 548 (631) ..+. ..+.|.+.||++||+++++ ++|+++||+||+++ |++|+||+||||+++|+++|+++++|+|||||++.+| T Consensus 237 ~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G~~~wG~rT~s~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~ 313 (390) T protein:vir:10 237 DVSWDLQDPATDAGYLNEHEVTTLVN--RNGFRFWGERTCSD-DPKFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLA 313 (390) T ss_pred ecccccccccchhhhhhhcCcEEEEc--CCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHH Confidence 2222 2345778999999999964 68999999999865 6899999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCce-eeeeecc Q lcl|NC_013693. 549 SLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDME-FSEIETG 624 (631) Q Consensus 549 ~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~-~~e~~~~ 624 (631) ++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++..+..+ +.+-+++ T Consensus 314 ~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:10 314 RDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999999999999999999999999999999999999999999999999999999998866432 3333333 No 30 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=3.4e-94 Score=532.99 Aligned_cols=377 Identities=17% Similarity=0.146 Sum_probs=293.3 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccC-----cCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWG-----EAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~G-----p~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |++ |++|||||+|++.+++++ .++|++++|||.++++ |+++|++++|+ .+|...||. .+++.+++.. T Consensus 1 M~~---~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~-~~~~~~~g~---~gtL~~al~~ 73 (390) T protein:vir:78 1 MPQ---DYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNV-VAALGKAGK---KGTLRRTLDA 73 (390) T ss_pred Ccc---cccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccH-HHHHhhcCC---Cceehhhhhh Confidence 665 669999999999999888 7999999999999875 99999999776 699999995 6789999999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) ||.|||..|||||+.......+..+ .+ .|. T Consensus 74 ~~~~gg~~~~vv~v~~~~~~~~~~~----------------------~~----ig~------------------------ 103 (390) T protein:vir:78 74 IGKQTKPLTVVVRVAEGKDADETTS----------------------NV----IGT------------------------ 103 (390) T ss_pred hccccCceEEEEEeccccccccccc----------------------cc----ccc------------------------ Confidence 9999999999999853221100000 00 000 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) ....+. .+. .... T Consensus 104 --~~~~~~------------------------------~tg-----------------------------~~al------ 116 (390) T protein:vir:78 104 --VTPDGK------------------------------YTG-----------------------------IKAL------ 116 (390) T ss_pred --cccccc------------------------------cch-----------------------------hhhh------ Confidence 000000 000 0000 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) . .. T Consensus 117 --------------------~-------------------------~~-------------------------------- 119 (390) T protein:vir:78 117 --------------------L-------------------------AA-------------------------------- 119 (390) T ss_pred --------------------h-------------------------hh-------------------------------- Confidence 0 00 Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecc---ccchHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCE---ELIEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~---~~~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) .......|.++.++ ...++.++..+|++++ +++++|.| T Consensus 120 ---------------------------------~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~-~~aivD~p----- 160 (390) T protein:vir:78 120 ---------------------------------QGALGVKPRILAAPGLDTQPVAAALAATAQSLR-AMAYVSAS----- 160 (390) T ss_pred ---------------------------------hhhhcceehhhcccccchHHHHHHHHHhhcccc-eEEEEecC----- Confidence 00000000000000 0125667888898765 68888876 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR 471 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~ 471 (631) ...+.+++++||+++ +|+|+++||||++++|+.++..+++|||+++||++||+|.++|||+||||+++.++.+... T Consensus 161 --~~~t~~~a~~~~~~~--~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~ 236 (390) T protein:vir:78 161 --GCKTKEEAAAYRKQF--GQREIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISA 236 (390) T ss_pred --CCCCHHHHHHHhhcc--CCceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecce Confidence 357889999999976 5899999999999999999999999999999999999999999999999999665555321 Q ss_pred -ceec--CChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_013693. 472 -MAWS--ASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTR 548 (631) Q Consensus 472 -~~~~--~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~ 548 (631) ..+. ..+.|.+.||++||+++++ ++|+++||+||+++ |++|+||+||||+++|+++|+++++|+|||||++.+| T Consensus 237 ~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G~~~wG~rT~s~-d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~ 313 (390) T protein:vir:78 237 DVSWDLQDPATDAGYLNEHEVTTLVN--RNGFRFWGERTCSD-DPKFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLA 313 (390) T ss_pred ecccccccccchhhhhhhcCcEEEEc--CCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHH Confidence 2222 2345778999999999964 68999999999865 6899999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCce-eeeeecc Q lcl|NC_013693. 549 SLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDME-FSEIETG 624 (631) Q Consensus 549 ~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~-~~e~~~~ 624 (631) ++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|++++..+..+ +.+-+++ T Consensus 314 ~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:78 314 RDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 999999999999999999999999999999999999999999999999999999999999998866432 3333333 No 31 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=1.7e-92 Score=523.63 Aligned_cols=371 Identities=14% Similarity=0.070 Sum_probs=308.2 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccC----cCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWG----EAELPVLVTGGETGLVKKFFKPNDATATDFLVIAD 75 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~G----p~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~f 75 (631) |+-+.+|+| ||||+|++.+++++ +++|++++|||.++.+ |.++|+++.++ .++++.||.......+..++..+ T Consensus 1 m~~~~~~~h-Gv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~-~d~~~~~~~~~~~gtl~~al~~~ 78 (388) T protein:vir:96 1 MPVIDQFEH-NGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANT-ADAQYLDSTGNELGTGWHAASET 78 (388) T ss_pred CCCCCCCCC-ceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecc-hhhhhhhccccccccchhhhHhh Confidence 998888985 99999999999988 7999999999999765 89999999887 69999999888888999999999 Q ss_pred HHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccceee Q lcl|NC_013693. 76 FLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFA 155 (631) Q Consensus 76 F~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~ 155 (631) |.|+|..||++|+.......+.. T Consensus 79 ~~~~~~~~~vv~v~~g~~~~at~--------------------------------------------------------- 101 (388) T protein:vir:96 79 LKKTSVPQYFIVVPEGADDAATM--------------------------------------------------------- 101 (388) T ss_pred hccCCceEEEEEecccccccccc--------------------------------------------------------- Confidence 99999999999984321000000 Q ss_pred eecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccccccc Q lcl|NC_013693. 156 YAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSSV 235 (631) Q Consensus 156 ~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (631) T Consensus 102 -------------------------------------------------------------------------------- 101 (388) T protein:vir:96 102 -------------------------------------------------------------------------------- 101 (388) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred ccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceeee Q lcl|NC_013693. 236 VVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVYT 315 (631) Q Consensus 236 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (631) T Consensus 102 -------------------------------------------------------------------------------- 101 (388) T protein:vir:96 102 -------------------------------------------------------------------------------- 101 (388) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred ecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecccc----chHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 316 FATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEEL----IEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 316 ~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~----~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) .+++|+.+..++ .+.++.+++.++..|.|+++|++ .++.++.++|+++ ++|+++|.|. T Consensus 102 ---------a~iig~~~~~tg----~~~gl~al~~~~~~p~il~aPg~s~~~~v~~al~~~~~~~-~~~~i~D~p~---- 163 (388) T protein:vir:96 102 ---------ANIIGGIDPTTG----RRTGIAALTECTERPTLIGAPGFSQNKAVIDALASMAKRL-KCRAVIDGPS---- 163 (388) T ss_pred ---------ceeeeecccccc----hhhHHHHhhhcccceeEEEeeccccchHHHHHHHHHHhhc-CcEEEEeccC---- Confidence 000001111111 12233334444445555555543 4678889999987 4899999873 Q ss_pred ccccCCHHHHHH---HHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeee Q lcl|NC_013693. 392 GNRGREMEDVVA---WRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNN 468 (631) Q Consensus 392 ~~~~~~~~~~~~---~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g 468 (631) .+.+++.+ ++...+++|+|+++||||++++|+.++..+++|||+++||++||+| +||||||++++ +.| T Consensus 164 ----~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D----~~~spaN~~i~-i~g 234 (388) T protein:vir:96 164 ----GSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGNQGVL-IQD 234 (388) T ss_pred ----CchhHHHHHHhhhhccCcCcceEEEEeCceeeecccCCceeeechHHHHHHHHHhhc----CcccccCeeEE-eee Confidence 23344444 4455677899999999999999999999999999999999999999 59999999984 666 Q ss_pred cc---cceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCH Q lcl|NC_013693. 469 YN---RMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDE 545 (631) Q Consensus 469 ~~---~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~ 545 (631) +. +....+++.|++.||++|||+|++|+++|+++||+||++ |+|||||||++||+++|++.++|+|||||++ T Consensus 235 ~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~wG~rT~~-----~~~i~vrR~~~~i~~si~~~~~~~v~epn~~ 309 (388) T protein:vir:96 235 VARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT-----GKFISFVGLEDAIARKLEAASQRAMSKQLTK 309 (388) T ss_pred ecccccccccCChhhHHhhhhcCceEEEEecCCcEEEEcccccC-----CcceeehhhHHHHHHHHHHHHHHhccCCCCH Confidence 54 234555678999999999999999999999999999973 9999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCce--eeeee Q lcl|NC_013693. 546 FTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDME--FSEIE 622 (631) Q Consensus 546 ~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~--~~e~~ 622 (631) .+|++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++++.+..+ |++++ T Consensus 310 ~~~~~i~~~i~~fL~~l~~~Gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~~~~~~ 388 (388) T protein:vir:96 310 SFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) T ss_pred HHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhC Confidence 999999999999999999999999999999999999999999999999999999999999999999988887 88887 No 32 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=4.6e-92 Score=521.31 Aligned_cols=377 Identities=18% Similarity=0.178 Sum_probs=287.3 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccC-----cCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWG-----EAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~G-----p~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |++ |++|||||+|++.+++++ +++|++.+|||.++.+ |+++|+++.++ .++...||+ ...+..++.+ T Consensus 1 M~~---~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~-~~~~~~~g~---~~tl~~a~~~ 73 (386) T protein:vir:10 1 MAE---QYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGS-RREAAKLGA---GGTLPQAIDG 73 (386) T ss_pred Ccc---ccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecch-HHHHhhcCC---CcchhHHHHH Confidence 664 668999999999999888 7999999999998865 89999999876 699999986 5678899999 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) +|.|||..||++++......... .. + T Consensus 74 ~~~~gg~~~~vv~~~~~~~~~~t---------------------------------~~--------~------------- 99 (386) T protein:vir:10 74 IFDQTGAVVVVIRVDEGVDSAAT---------------------------------QS--------N------------- 99 (386) T ss_pred HhccCceeEEEeecccccccccc---------------------------------ch--------h------------- Confidence 99999999999986321110000 00 0 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) . .+ .. .. T Consensus 100 --~--ig------------------------------~~-~~-------------------------------------- 106 (386) T protein:vir:10 100 --V--IG------------------------------KV-DA-------------------------------------- 106 (386) T ss_pred --h--hc------------------------------cc-cc-------------------------------------- Confidence 0 00 00 00 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) T Consensus 107 -------------------------------------------------------------------------------- 106 (386) T protein:vir:10 107 -------------------------------------------------------------------------------- 106 (386) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccceEEeecccccccccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDTVSFVSPLRDVVVGNR 394 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~a~~d~~~~~~~~~~ 394 (631) .+...++...+......+...+.+..++....+..+.+.|+...+++..++. .... T Consensus 107 -------------------~t~~~tgl~~l~~~~~~~~~~p~i~~ap~~~~~~~v~~~l~~~~~~~~~~~~-----~~~~ 162 (386) T protein:vir:10 107 -------------------DTEQYTGILALLSAENTVKVQPRILIAPGFSNQKAVADQLVSVADTAAWLCH-----SGWS 162 (386) T ss_pred -------------------ccchhhhhHHhhhhcccccccccccccccccchhHHHHHHHHhhcceEEEEE-----eCCC Confidence 0000000000000000000001111122222233333444433333433322 1234 Q ss_pred cCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeeccc-ce Q lcl|NC_013693. 395 GREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNR-MA 473 (631) Q Consensus 395 ~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~-~~ 473 (631) ..+.+++.+||+.+ .|+|+++||||++++|+.++..+++|||+++||++||+|.++||||||+|+++.++.|... +. T Consensus 163 ~~~~~~a~~~~~~~--~s~~~~~~~p~~~v~~~~~~~~~~~p~s~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~ 240 (386) T protein:vir:10 163 NTTDAAAITYRELF--GSRRCEVVDPWYKVWDVETSAHIIQPPSARHAGVMAKVHNTLGFWWSNSNQEILGIDGLCRPVD 240 (386) T ss_pred CCchHHHHHhhhcc--cccceEEecCceeeeccccccceeechHHHHHHHHHHhhhcCCcEEccCCceeecccccceecc Confidence 67778899999976 5899999999999999999999999999999999999999999999999999766655432 22 Q ss_pred --ecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHH Q lcl|NC_013693. 474 --WSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLF 551 (631) Q Consensus 474 --~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i 551 (631) ...++.|++.||++||+++. +++|+++||+||+++ |+.|+||+||||+++|+++|+++++|+|||||++.+|++| T Consensus 241 ~~~~~~~~~~~~l~~~gi~~~~--~~~G~~~wG~rT~~~-d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i 317 (386) T protein:vir:10 241 FKLDDPTCRANLLNAKEVTTTI--QQNGFRVWGDRTCSA-DSKWAFKNVVITNDMIADSLVRNHLWAVDRNITKTYVEDV 317 (386) T ss_pred cccccCcchhhhhhhcCcEEEE--cCCCEEEEcccccCC-CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHH Confidence 33357799999999999874 689999999999865 6799999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeee Q lcl|NC_013693. 552 SNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIE 622 (631) Q Consensus 552 ~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~ 622 (631) +++|++||++||++|+|+||+|+||+++||+++|++|+|+++|+++|++|+|||+|+++|+..+ |++++ T Consensus 318 ~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~~~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~--~~~~~ 386 (386) T protein:vir:10 318 TEGVNNYLRHLKNIGAIAGGECWVDPELNSPDQIQQGKVYFDYDFSAYAPAEHITFRSHMVNGY--LTEVV 386 (386) T ss_pred HHHHHHHHHHHHhCCceeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEehhH--HHhhC Confidence 9999999999999999999999999999999999999999999999999999999999987654 66666 No 33 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=5.2e-77 Score=438.78 Aligned_cols=526 Identities=17% Similarity=0.126 Sum_probs=268.4 Q ss_pred CCCcchhcC--------CceEEEEecCCCceecccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHH Q lcl|NC_013693. 1 MATQSFSVA--------PSVQWTERDATLQTSPSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLV 72 (631) Q Consensus 1 m~~~~~yls--------PGVyveEv~~~~~~~gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av 72 (631) ----+||.. -|.|.|-+++-..--.---+|-+.+---++||+--|-+- .- -.|+..-|-+...-| .-- T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~--~~~ 273 (742) T protein:vir:58 198 WVYFAEYGTPTSSLTLYKGFYLEGIDLNSFNKQFVVSIENITVNREKGQVLYPSFD-VV-VHFRDIRGVSANTEY--IRF 273 (742) T ss_pred cccccccCCCccceeeeecccccccccCcccceeeEEEeeeeecccCCceecccee-EE-EEEeeccCCCCCccc--eee Confidence 001122333 377777776432100101124444444567775433211 10 123333333321111 111 Q ss_pred HHHHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccc Q lcl|NC_013693. 73 IADFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRN 152 (631) Q Consensus 73 ~~fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~ 152 (631) |+-=+|--+.-||+||.+.- +... ++..+..... -.-..+.+.+..+... .++...+. .+.. . T Consensus 274 ~~~~~~~~~~~~~~~~~~~~----~~~~-~~~~~~~g~~--~~n~~~~~~~~~~~~~-~~~~~~~s--------~~~~-~ 336 (742) T protein:vir:58 274 RQVNLNPESPNYIERVIGNM----TFEF-DGERIVTGGE--YPNQVPFLRVVVSQDI-KQNVAGVE--------KWVP-V 336 (742) T ss_pred eeeecCCCCcceeeecccce----eeee-ccceeeeccc--ccccccceeeEecccc-CcCcccee--------EEEe-c Confidence 22224556777888886421 1110 0000000000 0000011212111100 00000000 0000 0 Q ss_pred eeeeecccccceEEeeee---eeeeeccc----ccccceeeeeecccccccceeEeeccccccccccccccccccccccc Q lcl|NC_013693. 153 NFAYAPQAGEYHIVIVDK---VGRITDSS----GAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTAL 225 (631) Q Consensus 153 ~~~~~~~~g~~~~~~~~~---~~~v~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (631) ........++........ ........ ...........+++. ........ +.......+... T Consensus 337 ~~~~~~~v~d~~~~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~---~f~v~s~~---------~~g~~i~~~~as 404 (742) T protein:vir:58 337 GFEGIYSVGDFTVIVNELTNVSIPVTDSAIIPPMRFTRIEQITLSGGA---SFSVISNQ---------PYGFNIQDSRHS 404 (742) T ss_pred cccccccccceeeeccccccceeeccccccCCcccccccceeecccCc---ceEEEEec---------ccCcceeccCcc Confidence 000000000000000000 00000000 000000000000000 00000000 000000000000 Q ss_pred ccccccccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhh Q lcl|NC_013693. 226 TALTDVYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDV 305 (631) Q Consensus 226 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (631) ........... +.......... .... ................ ... T Consensus 405 ~~~s~ln~~~~-------V~Gt~aa~~~~---------d~~t--~~~v~s~~~alp~~a~-----------------sv~ 449 (742) T protein:vir:58 405 YWLSPFKDDEL-------IIGTELVLPAL---------DVST--EFGVSSWEEALPEFSF-----------------LMP 449 (742) T ss_pred eEEeccCCceE-------EEeehhhcccc---------ccch--heeccccccccceeeE-----------------EEe Confidence 00000000000 00000000000 0000 0000000000000000 000 Q ss_pred hccccceeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecccc---chHHHHHHHHHhhccceEE Q lcl|NC_013693. 306 INDTSNWVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEEL---IEQQTLIDLSTERKDTVSF 382 (631) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~---~~~~~~~~~~~~~~~~~a~ 382 (631) +..+. ..................+.+...++++ ++.++...+...++++ |++ .++.++.++|+.+++|+.. T Consensus 450 laGG~-dg~v~v~~~~~D~iG~~~~~d~~~adrT----GL~ALlev~eVtILiA-PG~t~~~v~aav~A~la~a~~Rl~v 523 (742) T protein:vir:58 450 FQGGS-DGYIRVDENEPDTIGRVKITPALLANYE----RLLPLLTEDQFDLVLT-PYLTFADHAGTVNAFINRAENRFLY 523 (742) T ss_pred ecCCc-cccccccCCCcccccccccccccccchh----HHHHhhhcCCCcEEEE-cCCCchHHHHHHHHHHHhhcCCeEE Confidence 00000 0000000000111111112222233333 3444444444444444 444 3567888999987776654 Q ss_pred e-ecccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccce Q lcl|NC_013693. 383 V-SPLRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFH 461 (631) Q Consensus 383 ~-d~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~ 461 (631) + |.|. ..++.+++++|++.+ +|+|+++||||+++.+ ++..+++|||+++||++||+|.++|+|+||+|+ T Consensus 524 L~D~P~------~~tt~~~A~a~r~~~--nSsraaly~PwVkv~d--~~~~r~vPpSgaIAGL~ARtD~erGvw~SPANr 593 (742) T protein:vir:58 524 LFDIAG------DDDTENLAISLAGYI--NSSFATTFFPWVRRLT--NKGMRTVPASLAAYRSIRTTDPETGLAPVGARR 593 (742) T ss_pred EEecCC------CCchHHHHHHHHhcc--CCceEEEEeceeeecc--CCcceeechHHHHHHHHHHhccCCceEecCCcc Confidence 4 4442 234557788888876 5899999999998876 467889999999999999999999999999998 Q ss_pred eeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcC Q lcl|NC_013693. 462 NRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGE 541 (631) Q Consensus 462 ~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e 541 (631) .+ +.+ ..+++.|++.||++|||+|++| ++|+++||+||++++|++|+|||||||++||+++|+++++|+||| T Consensus 594 gi--i~~-----~~~s~se~d~LN~~GINtIrsf-G~G~rlWGnRTlassDs~wryInVRRlfd~Ie~SI~~a~q~~VfE 665 (742) T protein:vir:58 594 GV--VTG-----EPVRQVDWEDLYNNRINPIVRV-GNDVLLFGQKTMLNVNSALNRINVRRLLIVMRNRISQILSSYLFE 665 (742) T ss_pred ee--eec-----cccchhhHHHHhhCCceEEEEC-CCcEEEEcceecCCCCcccceEeehhhHHHHHHHHHHHHHHhccC Confidence 53 222 3467899999999999999987 789999999999888899999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceee Q lcl|NC_013693. 542 NNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFS 619 (631) Q Consensus 542 pn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~ 619 (631) |||+.+|++|++++++||++||++|+|.||+|+||+ +||+++|++|+|+++|+++|++|||||+|+|+++++|++|+ T Consensus 666 PNd~~L~~sIk~sInafL~~L~aqGALlGfrV~lDe-tNTpeDI~~Gklvv~I~vAP~~PAEfI~lrf~it~tga~Fs 742 (742) T protein:vir:58 666 NNTSENRLRAEALVRQYLESLRLRGAVTDYEVAIDS-VTTPTDIDNNTLRARVTVQPARSIEYIDITFVITPTGVEIT 742 (742) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcC-CCCHHHhhCCEEEEEEEEEccCCcceEEEEEEEEecccccC Confidence 999999999999999999999999999999999995 68899999999999999999999999999999999999999 No 34 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=3e-71 Score=407.21 Aligned_cols=538 Identities=12% Similarity=0.040 Sum_probs=322.4 Q ss_pred CCC----cchhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHH Q lcl|NC_013693. 1 MAT----QSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIAD 75 (631) Q Consensus 1 m~~----~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~f 75 (631) ||- ...|++|||||||.+++++++ +++|++++|||.+++||.+.|++|+|| +||++.||+.+...+..+|+..| T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~-~~~~~~fg~g~l~~~i~~a~~~~ 79 (562) T protein:vir:63 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNY-SQAKSVFRSGELLDAIERAWNPG 79 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCCCCceeEEEccH-HHHHHHhcCCchHHHHHHhcccc Confidence 442 134999999999999999988 999999999999999999999999998 69999999866555566666777 Q ss_pred HHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccceee Q lcl|NC_013693. 76 FLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFA 155 (631) Q Consensus 76 F~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~ 155 (631) |.|||++||+|||.+.. .++.+ ...+++++..+|.|+|.+++.+.+......- ...+. T Consensus 80 ~~~g~~~~~~~rv~~a~--~a~~~------------------~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~--~~~v~ 137 (562) T protein:vir:63 80 EGTGAGDILAMRVEEAK--EATFE------------------AEGVKVSSTIYGADANDIQVALEDNTITGTK--RLSIV 137 (562) T ss_pred ccCCceEEEEEEcCCCc--cceeE------------------ecceeEEEeecccCCCeEEEEEecCCCCCCc--ceEEE Confidence 79999999999994432 22211 1248899999999999999998765332211 11111 Q ss_pred eecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccccccc Q lcl|NC_013693. 156 YAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSSV 235 (631) Q Consensus 156 ~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (631) .....-.......+....+............+................... .............. ... T Consensus 138 ~~~~~~~ev~~~~g~V~~i~y~g~~~~~~~~v~~~~~~~~a~~l~~~~g~~--------~v~~~~L~~g~~~~----~~~ 205 (562) T protein:vir:63 138 FAKERVNQVYDNLGSIFSIKYKGTEASATFTVAVDPVTFKATKLTLKAGDK--------TVKEYDLGSGAYAE----TNV 205 (562) T ss_pred ecCCCcchhhhhccceeeeeeecccccceEEEEecCcceeEEEEEeecCCc--------ceeEEEecCCccch----hHH Confidence 111100000000000111100000000000000000000000000000000 00000000000000 000 Q ss_pred ccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhh--cccccee Q lcl|NC_013693. 236 VVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVI--NDTSNWV 313 (631) Q Consensus 236 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~ 313 (631) ...... ..+ ..+..+ .+.....+...+..... ..... ......++ ...+.. .....++ T Consensus 206 l~~~in-------~~~-~~~aky---~~~~gn~i~~~~~d~~~---~~~vk----t~~~~v~t--~~~d~~~~~~~~~~v 265 (562) T protein:vir:63 206 LISDIN-------NLP-DFEAKF---FPIGDKNLTTDNFDAQI---DVDIK----TKEAYVKA--VGGDIEKQTAYNGYV 265 (562) T ss_pred HHHhhc-------ccc-ceEEEe---eccCCceeeeecccccc---ccchh----hhhhhhhh--hhhhhhhccccccee Confidence 000000 000 000000 00000000000000000 00000 00000000 000000 0001111 Q ss_pred eee----cccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhcc----ceEEeec Q lcl|NC_013693. 314 YTF----ATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKD----TVSFVSP 385 (631) Q Consensus 314 ~~~----~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~----~~a~~d~ 385 (631) ... ..........|.||.++.... .....|..++..+...+++...+.++|.++.+||+++++ ++++++. T Consensus 266 ~~~~~~~~~la~~~~~~LtGG~dGt~~~--~~~~al~ale~~~~~~i~~~t~d~av~~~l~a~vkr~~~~g~~~~aVlg~ 343 (562) T protein:vir:63 266 DFEFDRSKEIANFPLTKLTGGDNGTIPE--SWADKFSYFANEGGYYLVPLTSKQAVHAEALQFVRDCSYNGNPMRVFVGG 343 (562) T ss_pred eeeeccccceecccceeeecCCCCCchh--hHHHHHHHHHhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecC Confidence 110 011112345778888875431 223445555555555555666677889999999987765 7888765 Q ss_pred ccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeeh---HHHHHHHHHHhhccCCceeccccee Q lcl|NC_013693. 386 LRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPA---CGGTAGVWARSIEIAGIYKSPAFHN 462 (631) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~ag~~a~~d~~~g~~~span~~ 462 (631) + .+.+++++......+ +++++++++|+....+. .+..+.+|+ ++++||++|..| +++||.|++ T Consensus 344 ~-------~~~~~~~~~~~a~~~--n~ervv~v~~~~~~~~~-~~~~~~~~~~~~aa~vAGl~A~~~----~~~SlT~~~ 409 (562) T protein:vir:63 344 G-------IGESMEQLFTRAIGL--QNERAGLIGFSGTVKMD-DGRSLKMPGYMFAAQVAGLTCGLE----IGEAITFKN 409 (562) T ss_pred C-------CCCCHHHHHHHhhhc--CCCcEEEEecCeeEECC-CCceeeechhHHHHHHHHHhhcCc----hhcCcccee Confidence 4 356788888877766 58899999998776554 456666776 789999999988 889999998 Q ss_pred eceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcc-eec----CCCChhhceehhhHHHHHHHHHHHHHH-H Q lcl|NC_013693. 463 RGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGD-KTG----LTRPSAFDRINVRGLFIMAEQNIAAIA-K 536 (631) Q Consensus 463 ~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~-rT~----~~~~~~~~~i~vrR~~~~i~~~~~~~~-~ 536 (631) + . ..++...+++.|++.|+++|+++++..++++.++|.. +++ ...++.|++|+++|++|+|++.+++.+ + T Consensus 410 i---~-~~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~ 485 (562) T protein:vir:63 410 I---A-IETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDN 485 (562) T ss_pred e---c-cccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHh Confidence 4 3 3466778999999999999999999988887877754 322 245689999999999999999998876 5 Q ss_pred HHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCc Q lcl|NC_013693. 537 YYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDM 616 (631) Q Consensus 537 ~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~ 616 (631) ||+++||+...|.+|+..+..||.+|++.|+|.+|+.. +-+..+..++++|++.++|+.|+|+|.+++....+.+ T Consensus 486 ~yiGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~pv~~mekIy~ti~~~~~~~ 560 (562) T protein:vir:63 486 EYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EVQVVIEGDVARISLTVFPIRSMKKIEVSLVYRQQIL 560 (562) T ss_pred cCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEeeeee Confidence 99999999999999999999999999999999998532 1122345678999999999999999999998765443 Q ss_pred eeeeeecc Q lcl|NC_013693. 617 EFSEIETG 624 (631) Q Consensus 617 ~~~e~~~~ 624 (631) + + T Consensus 561 ~------~ 562 (562) T protein:vir:63 561 T------A 562 (562) T ss_pred c------C Confidence 3 2 No 35 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=1.2e-69 Score=398.51 Aligned_cols=556 Identities=12% Similarity=0.027 Sum_probs=297.7 Q ss_pred CCCcchhcC------CceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHH Q lcl|NC_013693. 1 MATQSFSVA------PSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVI 73 (631) Q Consensus 1 m~~~~~yls------PGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~ 73 (631) ||+. .|.. |||||||+|++.++| ||+|++++|||.++|||+|+|++|+|| .||++.||+ ++|.+||+ T Consensus 1 ma~~-~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~Gp~~~p~~v~s~-~~~~~~fgg----g~l~~av~ 74 (648) T protein:vir:10 1 MAIS-VYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEGGETYKPYRLTSF-AEAVSIFKG----GPLLEHIK 74 (648) T ss_pred Ceee-eeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCCCCCceeEEecCH-HHHHHHhcC----ccHHHHHH Confidence 9885 6755 999999999999988 999999999999999999999999887 799999997 46999999 Q ss_pred HHHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccce Q lcl|NC_013693. 74 ADFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNN 153 (631) Q Consensus 74 ~fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~ 153 (631) +||+|||++||+|||.+.+...+ ....+.++++.+|.|||.+++.+....+......... T Consensus 75 ~~F~nGg~~~~~vRv~~~~~a~~--------------------~~~~~~~~a~~~g~~gn~i~~~v~~~~~~~~~~~~l~ 134 (648) T protein:vir:10 75 AAFIGGAGEVVAVRIGNPTTASV--------------------SIPVAQNTSDTSPANLNFVSYEASTRSNQIYVSFDLD 134 (648) T ss_pred HHHhCCCcEEEEEEcCCCcccce--------------------ecceeEEeecccCCCCCceEEEEEEcCCCcCceeEEE Confidence 99999999999999965432111 1235889999999999998755542222222222222 Q ss_pred eeeecccccceEEeeeeeeeeec---ccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccc Q lcl|NC_013693. 154 FAYAPQAGEYHIVIVDKVGRITD---SSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTD 230 (631) Q Consensus 154 ~~~~~~~g~~~~~~~~~~~~v~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (631) +... ..++. +......+.. ............... .+............................ ....... T Consensus 135 v~~~-~~~~~---~d~~v~~i~~~~~~y~gt~~~~t~~v~~-~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~-~~~~~v~ 208 (648) T protein:vir:10 135 ENFT-SANEA---DDTIIFTIYQKHPDFSVTRETFTFPRKF-TTPTVLVKRGSTLFFVDRSIVNAALAAGPA-FQTALIN 208 (648) T ss_pred EEec-CCCcc---cceeEEEeccCCCcccccceeccccccc-cccccccccccceeecCccchhhhhccCcc-chhhhhh Confidence 2211 11111 0000000000 000000000000000 000000000000000000000000000000 0000000 Q ss_pred cccccccccccccc--------ccccccccccccee------------ecccccccc----eeeeecccccccceeeeee Q lcl|NC_013693. 231 VYSSVVVKSNTVTV--------THKAIGPQTVTAIV------------PDANGLTAT----AVTTTVGASGSIIEKYELM 286 (631) Q Consensus 231 ~~~~~~~~~~~~~v--------~~~~~~~~~~~~~~------------~~~~~~~~~----~~~~~v~~~~~~~~~~~~~ 286 (631) .............+ .............. ....+.... ...+.+ ..........+. T Consensus 209 ~~~~~~~~~~~~~~~~~s~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~-~~tp~~~~~~~~ 287 (648) T protein:vir:10 209 LLKEQLQPTDVVQIFDASDTNPVDIPLGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNL-SATPFFDGSDYQ 287 (648) T ss_pred chhhhhhhhhhheecccccccccccccccccccccchhhhcCCcchhhhhhhccccccccccceecc-ccccccccccee Confidence 00000000000000 00000000000000 000000000 000000 000000000000 Q ss_pred eeeccc--ccccchhhhhhhhhccccceeeeecccccccccccccccccchhh---------hhhHH-HHHhhhhhcccc Q lcl|NC_013693. 287 QATQGS--KKSDGSNAYFKDVINDTSNWVYTFATTLAAGVTELEGGVDDYTGN---------RVAAI-EALNNAEAYDAK 354 (631) Q Consensus 287 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~---------~~~~~-~~l~~~~~~~~~ 354 (631) ...... .............+. .........+...+.|.||.|+..+. .++.| +.|..++..+.. T Consensus 288 ~~~~~~~~~~~~~v~~~~~~~l~----~~~~~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~ 363 (648) T protein:vir:10 288 DYTSLSDPANWFAKDAYTINHLV----DTTINPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVN 363 (648) T ss_pred eeeccccccceeeeeccchhhcc----cccccCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCce Confidence 000000 000000000000000 00111122233355788998876541 22334 556666655554 Q ss_pred eeEE-------------eccccchHHHHHHHHHhhcc---------ceEEeecccccccccccCCHHHHHH--HHHhcCC Q lcl|NC_013693. 355 PVFA-------------FCEELIEQQTLIDLSTERKD---------TVSFVSPLRDVVVGNRGREMEDVVA--WRESLVR 410 (631) Q Consensus 355 ~~i~-------------~~~~~~~~~~~~~~~~~~~~---------~~a~~d~~~~~~~~~~~~~~~~~~~--~~~~~~~ 410 (631) .++. ..+...++..+++|+..+.. .++++. ..++.+..+... .|..++. T Consensus 364 ~ivp~~~~~~~~~~~~~lt~~q~i~a~a~shv~~~s~~~~~~~r~~~~~~vg-------~~~~es~~~se~~~~~~~~~~ 436 (648) T protein:vir:10 364 FVIPAYKFTNVTQLNDRLTIFKGIASTFLSHVQTMSQVNRRKARVGVFGLPA-------PSPNESVTASEYLYNRNILNT 436 (648) T ss_pred EEEeecccccccccccccCCccchHHHHHHHHHHhhhccccccccCeEEEeC-------CCCchhHHHHHHHhhhhcccc Confidence 4443 22345677888888875521 133322 223444433322 2333322 Q ss_pred Cc--------ceEE-EecCeeEEEeccCCceeEeeh---HHHHHHHHHHhhccCCceecccceeeceeeecccceecCCh Q lcl|NC_013693. 411 DS--------SYFF-MDDNWAYVYDKYNDKMRWIPA---CGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASS 478 (631) Q Consensus 411 ~s--------~~~~-~~~p~~~~~d~~~~~~~~~p~---s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~ 478 (631) .+ ..++ ..+.+.. + ..+++..++|| .+++||+++++. +++||.||++.+ .+ ..+.+.+++ T Consensus 437 ~~a~~~~~d~~~~~~~~~~~~~-~-~~~G~~~~~p~~~~Aa~VAGl~a~l~----~~~s~T~k~i~~-~~-id~~~~~t~ 508 (648) T protein:vir:10 437 ISAMFGGTDRAQAVVFPFYSNV-F-NDEGKVELLGGEFFASYVAGMHANRE----PQDSITFLPISG-IG-AEPLYNWTY 508 (648) T ss_pred cceeeeecCCceEEeeccccee-E-CCCCcEEecchhhHHHHHHhhhhccc----cccCcccceeec-cc-cccccCCCH Confidence 11 1111 1122222 2 22567777888 678899999865 888999999542 22 334578999 Q ss_pred hHhhhhhhcCceEEEEEcCC----cEEEEcceecC--CCChhhceehhhHHHHHHHHHHHH-HHHHHhcCCCCHHHHHHH Q lcl|NC_013693. 479 DERAVLYRNQINSIVTFSNE----GIVLYGDKTGL--TRPSAFDRINVRGLFIMAEQNIAA-IAKYYLGENNDEFTRSLF 551 (631) Q Consensus 479 ~~~~~L~~~gin~i~~~~~~----G~~~wg~rT~~--~~~~~~~~i~vrR~~~~i~~~~~~-~~~~~v~epn~~~~~~~i 551 (631) .|+|.|+++||+||.+++++ ++++-.+-|.. ++++.|+.|+++|+.|++...+++ ..++|+++||++..|.+| T Consensus 509 ~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~~n~~~~~~~i 588 (648) T protein:vir:10 509 TQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPTTWLGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGRKSYGRKTEND 588 (648) T ss_pred HHHHHHhcCCcEEEEEecCCcceeeEEEeccceeecCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcccccHHHHHHH Confidence 99999999999999988764 34455444432 357889999999999999999987 556999999999999999 Q ss_pred HHHHHHHHHHHHhCCceeeeE---EEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCce Q lcl|NC_013693. 552 SNAVRPYIRQLANMGAIYDGQ---VKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDME 617 (631) Q Consensus 552 ~~~i~~~l~~l~~~gal~g~~---v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~ 617 (631) ++.|.+||.++++.++|++|+ |.++ ++++++++++.+.|++|++||.+++..+..- + T Consensus 589 k~~i~~~L~~~~~~~~I~~y~~~~v~~~--------~~~~vv~V~~~v~Pv~~i~~I~vti~it~~~-~ 648 (648) T protein:vir:10 589 IKVYTEALLSNLVGKQIVAYKDVKVTSN--------EDKTVYYVEFFYQPVTEIKFILVTMKVTFDL-E 648 (648) T ss_pred HHHHHHHHhhHhhcCcccCcccceEEEE--------ecCCEEEEEEEEEecceeeEEEEEEEEEecc-C Confidence 999999999999999999975 4443 2458999999999999999999999876432 2 No 36 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=3.6e-69 Score=395.83 Aligned_cols=534 Identities=13% Similarity=0.053 Sum_probs=325.3 Q ss_pred CCCc----chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHH Q lcl|NC_013693. 1 MATQ----SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIAD 75 (631) Q Consensus 1 m~~~----~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~f 75 (631) ||-. ..|..|||||||.+++.+++ +++|++++|||.+++||.+.|++|+|| +||++.||+.+-..+..+|+..| T Consensus 1 ~~~~~~~~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~-~~~~~~f~~g~l~~~i~~a~~~~ 79 (562) T protein:vir:80 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNY-SQAKSVFRSGELLDAIERAWNPG 79 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCcceeEEEccH-HHHHHHhcCCChHHHHHHhcccc Confidence 6532 24899999999999999887 999999999999999999999999997 69999999866666677778888 Q ss_pred HHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccceee Q lcl|NC_013693. 76 FLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFA 155 (631) Q Consensus 76 F~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~ 155 (631) |.|||++||+|||.+..+ ++. ....+++++..+|.|+|.+++.+.+......- ...+. T Consensus 80 ~~~g~~~~~~~rv~~a~~--a~~------------------~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~--~~~v~ 137 (562) T protein:vir:80 80 EGTGAGDILAMRVEEAKE--ATF------------------EAEGVKVSSTIYGADANDIQVALEDNTITGTK--RLSIV 137 (562) T ss_pred cccCceEEEEEEcCCCCc--ceE------------------EecceEEEEeecccCCCceEEEEecCCCCCCc--ceEEE Confidence 899999999999954332 221 01248899999999999999998764322221 11111 Q ss_pred eecccccceEEe--eeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccc-ccccccccccc Q lcl|NC_013693. 156 YAPQAGEYHIVI--VDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKI-GTALTALTDVY 232 (631) Q Consensus 156 ~~~~~g~~~~~~--~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 232 (631) .... ...+.. .+....+............+................... ......+.... ........... T Consensus 138 ~~~~--~~~ev~~~~g~v~~i~y~g~~~~a~~~i~~~~~~~~a~~l~~~~g~~----~v~~~~l~~g~~~~~~~l~~~i~ 211 (562) T protein:vir:80 138 FAKE--RVNQVYDNLGSIFSIKYKGTEASATFTVAVDPVTFKATKLTLKAGDK----TVKEYDLGSGAYAETNVLISDIN 211 (562) T ss_pred ecCC--cceEEeeccCceeeeeeccccccceeEEEecCccceEEEEEEecCCc----ceeEEEeCCCccchhhhhhhhhc Confidence 1111 111110 001111100000000000000000000000000000000 00000000000 00000000000 Q ss_pred cccccccccccccccccccccccceeecccccccceeeeeccccc---ccceeeeeeeeecccccccchhhhhhhhhccc Q lcl|NC_013693. 233 SSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASG---SIIEKYELMQATQGSKKSDGSNAYFKDVINDT 309 (631) Q Consensus 233 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (631) . ...+..+-.+ .....+........ .............+. .. ..+.. T Consensus 212 ~-------~~~~tAky~g-------------~~~n~i~~~~~d~~~~~~~kt~~~~v~~~~~d------~~----~~~~~ 261 (562) T protein:vir:80 212 N-------LPDFEAKFFP-------------IGDKNLTTDNFDAQIDVDIKTKEAYVKAVGGD------IE----KQTAY 261 (562) T ss_pred c-------ccceEEEecc-------------cCCceeeecccccchhhhcccceeeeeehhhh------hh----hcccc Confidence 0 0000000000 00000000000000 000000000000000 00 00000 Q ss_pred cceeeee----cccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhcc----ceE Q lcl|NC_013693. 310 SNWVYTF----ATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKD----TVS 381 (631) Q Consensus 310 ~~~~~~~----~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~----~~a 381 (631) ..++... ..........|.||.|+.... ...+.|..++..+...+++...+.++|..+.+||+++++ +++ T Consensus 262 n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~~--~~~dal~~Le~~~~~~i~~~t~d~ai~~~~~a~vkr~r~~g~~~~a 339 (562) T protein:vir:80 262 NGYVEFEFDRSKEIANFPLTKLTGGDNGTIPE--SWADKFSYFANEGGYYLVPLTSKQAVHAEALQFVRDCSYNGNPMRV 339 (562) T ss_pred cceEEEEeccCccccccceeeeeCCCCCCccc--cHHHHHHHHHhCCcEEEEecCCChHHHHHHHHHHHHHHhCCCeEEE Confidence 1111111 111112345788998875431 123445555555555556666677889999999988765 778 Q ss_pred EeecccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeeh---HHHHHHHHHHhhccCCceecc Q lcl|NC_013693. 382 FVSPLRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPA---CGGTAGVWARSIEIAGIYKSP 458 (631) Q Consensus 382 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~ag~~a~~d~~~g~~~sp 458 (631) +++.+ .+.+++++++....+ +++++++++|+..+.+. .+..+..|+ ++++||++|..| +++|| T Consensus 340 Vvg~~-------~~~~~~~~~~~a~~~--n~e~vv~v~~~~~~~~~-~~~~~~~~~~~~aa~vAGl~Ag~~----~~~S~ 405 (562) T protein:vir:80 340 FVGGG-------IGESMEQLFTRAIGL--QNERAGLIGFSGTVKMD-DGRSLKMPGYMFAAQVAGLTCGLE----IGEAI 405 (562) T ss_pred EecCC-------CCCCHHHHHHHhhhc--CCCeEEEEecCeeEECC-CCceeeechhHHHHHHHHHHhcCc----cccCc Confidence 87654 467888888888866 58899999998776654 455566666 889999999988 78899 Q ss_pred cceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEc----ceec-CCCChhhceehhhHHHHHHHHHHHH Q lcl|NC_013693. 459 AFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYG----DKTG-LTRPSAFDRINVRGLFIMAEQNIAA 533 (631) Q Consensus 459 an~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg----~rT~-~~~~~~~~~i~vrR~~~~i~~~~~~ 533 (631) .|+++ .+ .++...+++.|++.|+++|++++++.++++.++|. -.|. ..+++.|++|+++|++|+|++.|++ T Consensus 406 T~~~i---~~-~~v~~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~ 481 (562) T protein:vir:80 406 TFKNI---AI-ETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKI 481 (562) T ss_pred cceee---cc-ccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceeccCCCCchhhhhhhhHHHHHHHHHHHH Confidence 99985 33 35677899999999999999999998887777772 2332 2457899999999999999999998 Q ss_pred HH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEE Q lcl|NC_013693. 534 IA-KYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAV 612 (631) Q Consensus 534 ~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 612 (631) .+ +||+++||+...|..|+..+..||.+|++.|+|.+|... +-+.++.+++++|++.++|+.|+|||.+++... T Consensus 482 ~~~~~yIGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekIy~ti~~~ 556 (562) T protein:vir:80 482 SLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EVQVVIEGDIARISLTVFPIRSMKKIEVSLVYR 556 (562) T ss_pred HHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEE Confidence 87 699999999999999999999999999999999998532 112235667899999999999999999999876 Q ss_pred ecCceeeeeecc Q lcl|NC_013693. 613 RPDMEFSEIETG 624 (631) Q Consensus 613 ~~~~~~~e~~~~ 624 (631) .+.++ + T Consensus 557 ~~~~~------~ 562 (562) T protein:vir:80 557 QQILT------A 562 (562) T ss_pred eeeec------C Confidence 54433 2 No 37 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=4.6e-67 Score=384.26 Aligned_cols=559 Identities=12% Similarity=0.048 Sum_probs=319.6 Q ss_pred CCCc----chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHH Q lcl|NC_013693. 1 MATQ----SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIAD 75 (631) Q Consensus 1 m~~~----~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~f 75 (631) ||-. ..|..|||||||.+++.+++ +++|++++|||.++|||+++|++++++ +||++.||+.+....+-+|..+| T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~-~~~~~~~~~g~l~~~~~~a~~~~ 79 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNY-SQAKRLFRSGELLDAIELAWGSN 79 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCCCCceeEEeccH-HHHHHHhcCcchHHHHHHHhccc Confidence 7643 24789999999999999877 999999999999999999999999886 79999999866444455555666 Q ss_pred HHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccceee Q lcl|NC_013693. 76 FLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFA 155 (631) Q Consensus 76 F~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~ 155 (631) |.|||++||++||.+..+..++ ...++++++.||.|||.+++.+.+...... ....+. T Consensus 80 ~~~g~~~~~~~rv~~~~~a~~~--------------------~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~--~~~~~~ 137 (587) T protein:vir:95 80 PNYTAGRILAMRIEDAKPASAE--------------------IGGLKITSKIYGNVANNIQVGLEKNTLSDS--LRLRVI 137 (587) T ss_pred cCCCceEEEEEEcCCCceeEEE--------------------ecCeEEEEecccccccceEEEEecCCCCCc--eeEEEE Confidence 6899999999999544432111 124889999999999999998775432211 111111 Q ss_pred eecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccc---ccccccc---ccccccccccccccc Q lcl|NC_013693. 156 YAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVA---YTDTDTP---ATLATKIGTALTALT 229 (631) Q Consensus 156 ~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~---~~~~~~~~~~~~~~~ 229 (631) ..+......-........+................................. +...... ..............+ T Consensus 138 ~~~~~~~~~~~~~g~v~si~y~g~~~~~~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~in~~~~~t 217 (587) T protein:vir:95 138 FQDDRFNEVYDNIGNIFTIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFE 217 (587) T ss_pred EecccceeeeeeccceeeeeeeccccccceeeeecccceeeeeeeeecCCceEEEEEecCCchHHHHHHHHhhccccceE Confidence 1110000000000000001100000000000000000000000000000000 0000000 000000000000011 Q ss_pred ccccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccc Q lcl|NC_013693. 230 DVYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDT 309 (631) Q Consensus 230 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (631) +.+.+........... +...... +... .........+....+....................... .... T Consensus 218 Aky~g~~~~~i~~~~~---~~~~~~~--v~~~-----~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~~~~~~~~~-~~~~ 286 (587) T protein:vir:95 218 AKLSPFGDKNLESSKL---DKIENAN--IKDK-----AVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNVEV-EAGE 286 (587) T ss_pred EEEecccCceeEEeec---Ccccccc--eehh-----hhhhhhhhcceeeeeeceeeeeeecccccceeccchhh-hhcc Confidence 1111111111100000 0000000 0000 00000000000000000000000000000000000000 0000 Q ss_pred cceee----eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhcc----ceE Q lcl|NC_013693. 310 SNWVY----TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKD----TVS 381 (631) Q Consensus 310 ~~~~~----~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~----~~a 381 (631) ..... ..........+.|.||.|+.... .....|..++..+...++++..+.++|.++.+||+++++ +++ T Consensus 287 ~~a~~~~~~~~~~~a~~~~t~LtGG~dG~~~~--~y~~~l~ale~~~~~~i~~~t~d~~v~a~l~a~vk~~~~~g~~~~a 364 (587) T protein:vir:95 287 ESATVTATSPIKTIEPFELTKLKGGTNGEPPA--TWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRA 364 (587) T ss_pred cchheeccccccceeccceeeeecCCCCCCcc--cHHHHHHHHHhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEE Confidence 00000 00111112234688998875431 223445556666655555555667888999999988765 778 Q ss_pred EeecccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeeh---HHHHHHHHHHhhccCCceecc Q lcl|NC_013693. 382 FVSPLRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPA---CGGTAGVWARSIEIAGIYKSP 458 (631) Q Consensus 382 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~ag~~a~~d~~~g~~~sp 458 (631) +++.+ .+.++++++..+..+ +++++++++|+..+. ..++....+|+ ++++||++|..| +++|| T Consensus 365 Vvg~~-------~~~~~~~~~~~a~~~--n~ervi~v~~~~~~~-~~dg~~~~~~~~~~aa~vAGl~Ag~~----~~~Sl 430 (587) T protein:vir:95 365 IVGGG-------FNESKEQLFGRQESL--SNPRVSLVANSGTFV-MDDGRKNHVPAYMVAVALGGLASGLE----IGESI 430 (587) T ss_pred EEcCC-------CCCCHHHHHHHHhhc--CCCcEEEecccceEe-cCCCceeeechHHHHHHHHHHHhcCc----hhcCc Confidence 87644 457888898888876 588999999876543 23466677777 688999999888 77899 Q ss_pred cceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEE----EcceecC-CCChhhceehhhHHHHHHHHHHHH Q lcl|NC_013693. 459 AFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVL----YGDKTGL-TRPSAFDRINVRGLFIMAEQNIAA 533 (631) Q Consensus 459 an~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~----wg~rT~~-~~~~~~~~i~vrR~~~~i~~~~~~ 533 (631) .|+++. ..++...+++.|++.|+++|+++++..++++... .+-.|.. .++..|++++++|++|+|.+.+++ T Consensus 431 T~~~i~----~~~v~~~~t~~e~e~ai~~Gvl~l~~~~~~~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~ 506 (587) T protein:vir:95 431 TFKPLR----VSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKV 506 (587) T ss_pred cceeee----cccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHH Confidence 999853 3456778999999999999999998877665333 3445543 456889999999999999999998 Q ss_pred HH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEE Q lcl|NC_013693. 534 IA-KYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAV 612 (631) Q Consensus 534 ~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 612 (631) .+ +||+++||++..|..|+..+..||.+|++.|+|.+|... +.+-++...+++|++.+.|+.|+|+|.++++.. T Consensus 507 ~~~~~~iGk~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekI~vt~~~~ 581 (587) T protein:vir:95 507 QLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEIQDFPAE-----DVQVIVEGNEARISMTVYPIRSFKKISVSLVYK 581 (587) T ss_pred HHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEEEEEEEe Confidence 86 699999999999999999999999999999999998542 112233455799999999999999999999976 Q ss_pred ecCceeeeeecc Q lcl|NC_013693. 613 RPDMEFSEIETG 624 (631) Q Consensus 613 ~~~~~~~e~~~~ 624 (631) .+.+. + T Consensus 582 ~~~~~------~ 587 (587) T protein:vir:95 582 QQTLQ------A 587 (587) T ss_pred eeeec------C Confidence 54433 2 No 38 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=4.2e-67 Score=384.47 Aligned_cols=545 Identities=12% Similarity=0.067 Sum_probs=321.5 Q ss_pred CCCc----chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCC--CCccchhHHHHH Q lcl|NC_013693. 1 MATQ----SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFK--PNDATATDFLVI 73 (631) Q Consensus 1 m~~~----~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~--~~~~~~~~~av~ 73 (631) ||-. ..+-.|||||||.+++++++ +++|++++|||.+++||.++|++|++| +||++.||+ +.+...+.|... T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~-~~~~~~f~~g~l~~a~~~a~~~~ 79 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKPDTVYRFRNY-QQAKQVLRSGDLLDAIELAWNAS 79 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCCceeEEecCH-HHHHHHhcCCchhHHHHhhccCc Confidence 5532 23678999999999999887 999999999999999999999999887 699999966 344455666666 Q ss_pred HHHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccce Q lcl|NC_013693. 74 ADFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNN 153 (631) Q Consensus 74 ~fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~ 153 (631) .||.|||++||++|+.+..+..+. ...+++++...|.|+|.+++.+.+......... . T Consensus 80 ~~~~~~~~~~~~~rv~~a~~a~~~--------------------~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~--~ 137 (569) T protein:vir:80 80 DVNTASAGDILAVRVEDAKNATLT--------------------KGGLTFASTIYGVDANEIQVALEDNNLTHTKRL--T 137 (569) T ss_pred cccccCceEEEEEEcCCCeeeeee--------------------ccceeeeeeeccCCCceEEEEEecCcCCcceee--E Confidence 677899999999999443321111 124789999999999999998876432221111 1 Q ss_pred eeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccccc Q lcl|NC_013693. 154 FAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYS 233 (631) Q Consensus 154 ~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (631) +......+.......+. +......+...... ..+........+.......... ............ T Consensus 138 v~~~~~~~~~~~~~ig~-------------v~si~ytg~~~~a~-~~~~~~~~~~~a~~l~~~~g~~-~~~~~~v~~~~~ 202 (569) T protein:vir:80 138 VAFSKDGYKKVFDNLGK-------------IFSIQYKGSEAQAN-FTIAQDSISKKATTLTLNVGSE-PESTTEVMKYEL 202 (569) T ss_pred EeeecCCCccccccccc-------------eeeEEEeeccccce-EEeecCcCcceeEEEEEEecCC-cceeEEEEeecc Confidence 11000101000000000 00001111100000 0000000000000000000000 000000000000 Q ss_pred ccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhcccccee Q lcl|NC_013693. 234 SVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWV 313 (631) Q Consensus 234 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (631) ..........+..+-.........+...... ..........+. ................+...+ +...++ T Consensus 203 ~~~~~~~~~~lv~~~~~~~~f~a~~~~~~~~---~~~~~~~d~~~~------~~~~t~~~~~~~~~~di~~~~-~~~~~v 272 (569) T protein:vir:80 203 GQGVYSETNVLVSAINSLPDWEAKFFPIGDK---NLPTDALEAVTK------VDVKTEAVFVGALAGDIAKQL-EYNDYV 272 (569) T ss_pred CCccchhhhhhhhhcCCccCceEEEEecCCC---cceehhccchhh------eeccccceeeehhHHHHHHhh-cCCceE Confidence 0000000000000000000000000000000 000000000000 000000000000000011111 111122 Q ss_pred eeec----ccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhcc----ceEEeec Q lcl|NC_013693. 314 YTFA----TTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKD----TVSFVSP 385 (631) Q Consensus 314 ~~~~----~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~----~~a~~d~ 385 (631) .... .........|.||.|+... ......|..++..+...+++...+.++|.++.+||+++++ ++++++. T Consensus 273 ~~~~~~~~~l~~~~~~~LtGG~dG~~~--~~~~~~l~~le~~~~~~i~~~t~d~av~~~l~a~vkr~r~~g~~~~aVvg~ 350 (569) T protein:vir:80 273 TVAVDATKPVEDFELTNLTGGSDGTAP--ESWANKFPLLANEGGYYLVPLTDKQAVHSEALAFVKDRTDNGDPMRIIVGG 350 (569) T ss_pred EEEecCCcceeeecceeecCCCCCCcc--chHHHHHHHHhhCCcEEEEecCCChHHHHHHHHHHHHHHhCCCcEEEEecC Confidence 2111 1112233568899887543 1233445556666555555556667889999999998865 7888875 Q ss_pred ccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeeh---HHHHHHHHHHhhccCCceeccccee Q lcl|NC_013693. 386 LRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPA---CGGTAGVWARSIEIAGIYKSPAFHN 462 (631) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~ag~~a~~d~~~g~~~span~~ 462 (631) + .+.+++++.++++.+ +++++++++||..+++. .+..+.+|+ ++++||++|..+ +++||.|+. T Consensus 351 ~-------~~~~~~~~~~~a~~~--n~e~vv~v~~~~~~~~~-~g~~~~~~~~~~aa~vAG~~A~~~----~~~S~T~k~ 416 (569) T protein:vir:80 351 G-------TNETVEESITRATNL--RDPRASLVGFSGTRKMD-DGRLLKLPGYMMASQIAGIASGLE----VGEAITFKH 416 (569) T ss_pred C-------CCCCHHHHHHHHhhc--CCCeEEEEecCceeecC-CCcceeechhhHHHHHHHHHhcCc----cccCcccee Confidence 4 457788999888866 58899999999988874 345555665 678888888776 889999998 Q ss_pred eceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcc----eec-CCCChhhceehhhHHHHHHHHHHHHHH-H Q lcl|NC_013693. 463 RGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGD----KTG-LTRPSAFDRINVRGLFIMAEQNIAAIA-K 536 (631) Q Consensus 463 ~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~----rT~-~~~~~~~~~i~vrR~~~~i~~~~~~~~-~ 536 (631) + . ..++...++++|++.|+++|++++++.++++.++|.. .|. ..+++.|++++++|++|+|++.|++.+ + T Consensus 417 i---~-~~~i~~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~ 492 (569) T protein:vir:80 417 F---N-VTSVDRVFESSQLDMLNESGVISIEFVRNRTLTAFRVVQDVTTYNDKSDPVKNEMSVGEANDFLVSELKIELDN 492 (569) T ss_pred e---c-cccccccCCHHHHHHHHhCCeEEEEEecCceEEEEEEeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHh Confidence 5 3 3466778999999999999999999988877777743 222 245688999999999999999999876 6 Q ss_pred HHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCc Q lcl|NC_013693. 537 YYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDM 616 (631) Q Consensus 537 ~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~ 616 (631) ||+++||+...|..|+..++.||.+|+++|+|.+|... +-+.++..++++|++.++|+.|+|||++|++...+.+ T Consensus 493 ~yiGk~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~~ekI~~ti~~~~~~~ 567 (569) T protein:vir:80 493 NFIGTKVIDTSASLIKNFIQSFLDNKKRAREIQDYTPE-----EVQVVLEGDVASISMTVMPIRSLNKITVQLVYKQQIL 567 (569) T ss_pred hcCcccCChhHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccccEEEEEEEEeeeee Confidence 99999999999999999999999999999999998531 1223456679999999999999999999999765443 Q ss_pred eeeeeecc Q lcl|NC_013693. 617 EFSEIETG 624 (631) Q Consensus 617 ~~~e~~~~ 624 (631) + + T Consensus 568 ~------~ 569 (569) T protein:vir:80 568 T------A 569 (569) T ss_pred c------C Confidence 3 2 No 39 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=2.1e-65 Score=375.13 Aligned_cols=553 Identities=12% Similarity=0.043 Sum_probs=316.1 Q ss_pred CCCc----chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHH Q lcl|NC_013693. 1 MATQ----SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIAD 75 (631) Q Consensus 1 m~~~----~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~f 75 (631) ||-. ..|..|||||||.+++.+++ +++|++++|||.++|||+++|++++++ +||++.||+.+ |..++.++ T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~-~~~~~~~~~g~----l~~~~~~a 75 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNY-SQAKRLFRSGE----LLDAIELA 75 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCCccceeEEeccH-HHHHHHhcCcc----hHHHHHHH Confidence 7643 25889999999999999877 999999999999999999999999886 79999998844 55555555 Q ss_pred H----HhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecc Q lcl|NC_013693. 76 F----LSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFR 151 (631) Q Consensus 76 F----~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~ 151 (631) | .|||++||++||.+..+ |+. ....++++++.||.|||.+++.+.+......... T Consensus 76 ~~~~~~~g~~~~~~~rv~~~~~--a~~------------------~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~- 134 (587) T protein:vir:99 76 WGSNPNYTAGRILAMRIEDAKP--ASA------------------EIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRL- 134 (587) T ss_pred hccccCCCceEEEEEEcCCCce--eEE------------------EecCeEEEEeeccccccceEEEEccCCCCcceeE- Confidence 5 79999999999954432 221 1124899999999999999998876543221111 Q ss_pred ceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccc--ccccc-ccc---ccccccccccc Q lcl|NC_013693. 152 NNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDV--AYTDT-DTP---ATLATKIGTAL 225 (631) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~-~~~---~~~~~~~~~~~ 225 (631) .+..........-...+....+............+................... ..... ... ........... T Consensus 135 -~~~~~~~~~~~~~~~~g~v~~i~y~g~~~~a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~ 213 (587) T protein:vir:99 135 -RVIFQDDRFNEVYDNIGNIFTIKYKGEEANATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQL 213 (587) T ss_pred -EEEEecccceeeeeeccceeeEEeecccccceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhccc Confidence 111111000000000000000110000000000000000000000000000000 00000 000 00000000000 Q ss_pred ccccccccccccccccccccccccccccccceeecccccccceeeeeccc---ccccceeeeeeeeecccccccchhhhh Q lcl|NC_013693. 226 TALTDVYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGA---SGSIIEKYELMQATQGSKKSDGSNAYF 302 (631) Q Consensus 226 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (631) ...++.+.+.......... ..... ..........+.. +....+..................... T Consensus 214 ~~~tAky~~~~~~~i~~~~---~~~~~----------~~~v~~~~~~v~a~~~D~~~~~~~~~~~~~~~~~g~~~~~~~~ 280 (587) T protein:vir:99 214 PDFEAKLSPFGDKNLESSK---LDKIE----------NANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEGEVPSNV 280 (587) T ss_pred cceeEEeeccCCceeEeec---ccccc----------cceeeeeeeeeehhccceeeecccceeeeeeecccccchhhhh Confidence 0011111111110000000 00000 0000000000000 000000000000000000000000000 Q ss_pred hhhhccccceeee---ecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhcc- Q lcl|NC_013693. 303 KDVINDTSNWVYT---FATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKD- 378 (631) Q Consensus 303 ~~~~~~~~~~~~~---~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~- 378 (631) ............. .........+.|.||.|+.... .....|..++..+...++++..+.++|.++.+||+++++ T Consensus 281 ~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG~~~~--sy~~al~ale~~~~~~i~~~t~d~~i~a~l~a~vk~~r~~ 358 (587) T protein:vir:99 281 EVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPPA--TWADKLDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDA 358 (587) T ss_pred hhhhccccceeeeeccccceecccceeeecCCCCCccc--cHHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhC Confidence 0000000000000 0111122234688998875431 223445556666655555555667888999999988765 Q ss_pred ---ceEEeecccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeeh---HHHHHHHHHHhhccC Q lcl|NC_013693. 379 ---TVSFVSPLRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPA---CGGTAGVWARSIEIA 452 (631) Q Consensus 379 ---~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~ag~~a~~d~~~ 452 (631) ++++++.+ .+.+++++...+..+ ++.++++++++..+. ..++....+|+ ++++||++|..| T Consensus 359 g~~~~aVlg~~-------~~~~~~~~~~~a~~~--n~e~vi~v~~~~~~~-~~dg~~~~~~~~~~aa~vAGl~Ag~~--- 425 (587) T protein:vir:99 359 GEPMRAIVGGG-------FNESKEQLFGRQASL--SNPRVSLVANSGTFV-MDDGRKNHVPAYMVAVALGGLASGLE--- 425 (587) T ss_pred CCcEEEEecCC-------CCCCHHHHHHHhhhc--CCCcEEEEeccceEe-cCCCceeeechHHHHHHHHHHHhcCc--- Confidence 78887654 356888899888766 588999999876543 23456667776 688999999887 Q ss_pred CceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCc---EEE-EcceecC-CCChhhceehhhHHHHHH Q lcl|NC_013693. 453 GIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEG---IVL-YGDKTGL-TRPSAFDRINVRGLFIMA 527 (631) Q Consensus 453 g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G---~~~-wg~rT~~-~~~~~~~~i~vrR~~~~i 527 (631) +++||.|+++. ..++...+++.|++.|+++|+++++..++++ +++ .+-.|.. .++..|++++++|++|+| T Consensus 426 -~~~SlT~~~i~----~~~v~~~~t~~e~e~li~~Gvl~l~~~~~~~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i 500 (587) T protein:vir:99 426 -IGESITFKPLR----VSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFL 500 (587) T ss_pred -hhcCccceeee----cccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeeceeeccCCCCchhhhhhhhhhHHHH Confidence 78899999853 3466778999999999999999998877664 333 3445543 456889999999999999 Q ss_pred HHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEE Q lcl|NC_013693. 528 EQNIAAIA-KYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVY 606 (631) Q Consensus 528 ~~~~~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~ 606 (631) ++.+++.+ ++|+++||++..|..|+..+..||.+|++.|+|.+|... +.+-+....+++|++.+.|+.|+|+|. T Consensus 501 ~~di~~~~~~~yiGk~Nn~~~r~~i~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~~d~~~v~~~v~Pv~~mekIy 575 (587) T protein:vir:99 501 VSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEIQDFPAE-----DVQVIVEGNEARISMTVYPIRSFKKIS 575 (587) T ss_pred HHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccceEEE Confidence 99999886 699999999999999999999999999999999998642 011122344799999999999999999 Q ss_pred EEEEEEecCceeeeeecc Q lcl|NC_013693. 607 LDFAAVRPDMEFSEIETG 624 (631) Q Consensus 607 ~~~~~~~~~~~~~e~~~~ 624 (631) ++++...+.+. + T Consensus 576 ~tv~~~~~~~~------~ 587 (587) T protein:vir:99 576 VSLVYKQQTLQ------A 587 (587) T ss_pred EEEEEEeeeec------C Confidence 99987654332 2 No 40 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=1.5e-62 Score=359.46 Aligned_cols=550 Identities=13% Similarity=0.070 Sum_probs=318.5 Q ss_pred CCCc----chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHH Q lcl|NC_013693. 1 MATQ----SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIAD 75 (631) Q Consensus 1 m~~~----~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~f 75 (631) ||-. ..|.+||||||+.+++..++ ++++++.+|||.+++||+++|++|+++ +||++.||+.+ |..|+.++ T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g~~~~~~~~~~~-~~~~~~~g~G~----l~~ai~~a 75 (587) T protein:vir:96 1 MAKDIFPRRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGGEPNTVYQVRNY-AQAKSVFRSGE----LLDAIELA 75 (587) T ss_pred CeeeeeCCCcccCCceEEEEecCCccCCCCCCCceEEEEEEecCCCCceeEEEcCh-HHHHHhhcCCc----HHHHHHHH Confidence 6643 36999999999999999877 899999999999999999999999886 79999999854 66666666 Q ss_pred H----HhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecc Q lcl|NC_013693. 76 F----LSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFR 151 (631) Q Consensus 76 F----~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~ 151 (631) | .|||++||++||.++.+..+. ...+++++..+|+|||.+++++.+......-.. T Consensus 76 ~~~~~~~g~~~~~a~rv~~~~~a~~~--------------------~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~- 134 (587) T protein:vir:96 76 WGSNPQYTAGKILAMRVEDAKASQLE--------------------KGGLRVTSKIFGSVSNDIQVALEKNTITDSLRL- 134 (587) T ss_pred hccCcCCCceEEEEEecCCCccceee--------------------cccccccccccCCCCceEEEEEEeccCCCccce- Confidence 6 799999999999554332111 234677888999999999999976533222111 Q ss_pred ceeeeecccccceEEeeee--eeeeecccccccceeeeeecccccccceeEeecc-c-cc-ccccccc--c-cccccccc Q lcl|NC_013693. 152 NNFAYAPQAGEYHIVIVDK--VGRITDSSGAVGQVDRISVSGTATGAGSISVAGE-D-VA-YTDTDTP--A-TLATKIGT 223 (631) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~--~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~-~~~~~~~--~-~~~~~~~~ 223 (631) ... ...+.+...+..+ ...+............+........+........ . .. +...... . ........ T Consensus 135 ~~~---~~~~~~~~~~~n~G~v~~i~y~g~~~~a~~~~~~~~~~~~A~~l~l~gg~~~v~~yrl~~g~~~~~~~~~~~~~ 211 (587) T protein:vir:96 135 RVV---FQKDNYQEVFDNLGNIFSINYKGEGEKATFSVEKDKETQEAKRLVLKVDEKEVKAYELNGGAYSFTNEIITDIN 211 (587) T ss_pred EEE---EecCCceeeccccCceEEEEecccccceeEeeccCcccceeeeeEEEecCceEEEEEeCCCchhhhhhhhhhhc Confidence 100 0111111111111 0000000000000000000000000000000000 0 00 0000000 0 00000000 Q ss_pred ccccccccccccccccccccccccccccccccceeecccccccceeeeecccc-cccceee--eeeeeecccccccchhh Q lcl|NC_013693. 224 ALTALTDVYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGAS-GSIIEKY--ELMQATQGSKKSDGSNA 300 (631) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~-~~~~~~~--~~~~~~~~~~~~~~~~~ 300 (631) .....++.+.+.......+.+.......+. .........- +.+.... ................. T Consensus 212 ~~~~~tAky~g~~~n~~~v~v~d~~~~~~~-------------k~~~~y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~ 278 (587) T protein:vir:96 212 ELPDFEAKLSPFGDKNLESRKLDEATDVDI-------------KGKAVYVKAVFGDIENQTQYNQYVKFEQLPEQASEPS 278 (587) T ss_pred cccceEEEeecccCceeEEEeecccccccc-------------ceEEEeehhhhhhhhhhhccccceeeccccchhhhhh Confidence 000011112211111111111000000000 0000000000 0000000 00000000000000000 Q ss_pred hhhhhhccccceeeee----cccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhh Q lcl|NC_013693. 301 YFKDVINDTSNWVYTF----ATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTER 376 (631) Q Consensus 301 ~~~~~~~~~~~~~~~~----~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 376 (631) . .............. ..........|.||.|+..+. ...+.|..++..+...+++...+.++|..+.+||+++ T Consensus 279 ~-~~v~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~~~--~y~~~l~ale~~~~~~i~~~t~d~ai~~~l~a~vk~~ 355 (587) T protein:vir:96 279 D-VEVHAETESATVTATSKPKAIEPFELTKLSGGTNGEPPT--SWSAKLEKFKNEGGYYIVPLTDRQSVHSEVATFVKNR 355 (587) T ss_pred c-ccccccccceeeeecccccccccccceeeecCCCCCCcc--cHHHHHHHHhhCCcEEEEecCCCHHHHHHHHHHHHHH Confidence 0 00000000000000 011112234588888875532 2234455556666555555555667889999999887 Q ss_pred cc----ceEEeecccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeeh---HHHHHHHHHHhh Q lcl|NC_013693. 377 KD----TVSFVSPLRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPA---CGGTAGVWARSI 449 (631) Q Consensus 377 ~~----~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~ag~~a~~d 449 (631) ++ ++++++.+ .+.+++++.+.+..+ +++++++++++..+.+.. +.....|+ ++++||++|..+ T Consensus 356 r~~gk~~~aVlg~~-------~~~~~~~~~~~a~~~--n~e~vi~v~~~~~~~~~~-~~~~~~~~~~~aa~vAG~~Ag~~ 425 (587) T protein:vir:96 356 SDAGEPMRAIVGGG-------TSETKEKLFGRQAIL--NNPRVALVANSGKFVMGN-GRILQAPAYMVASAVAGLVSGLD 425 (587) T ss_pred HhCCCeEEEEecCC-------CCCCHHHHHHHHhhc--CCCcEEEEecceEEecCC-CceeeechhhHHHHHHHHHhcCc Confidence 65 77887644 457888888888877 578999999988877653 44444543 678999999887 Q ss_pred ccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcc-eecC----CCChhhceehhhHHH Q lcl|NC_013693. 450 EIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGD-KTGL----TRPSAFDRINVRGLF 524 (631) Q Consensus 450 ~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~-rT~~----~~~~~~~~i~vrR~~ 524 (631) +++||.|+++ .+ .++...+++.|++.|+++|+.+++..++++.++|.. +++. .++..|++|+++|++ T Consensus 426 ----~~~S~T~~~~---~~-~~v~~~~t~~e~~~~i~~G~~~l~~~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~ 497 (587) T protein:vir:96 426 ----IGESITFKPL---FV-NSLDKVYESEELDELNENGIITIEFVRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEAN 497 (587) T ss_pred ----cccCccceee---ec-ccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhHHHH Confidence 7889999985 33 356778999999999999999999888877777743 3332 346789999999999 Q ss_pred HHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCce Q lcl|NC_013693. 525 IMAEQNIAAIA-KYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSIN 603 (631) Q Consensus 525 ~~i~~~~~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e 603 (631) |+|.+.|++.+ ++|+++||+...|..|+..+..||.+|++.|+|.+|+.. +-+-++...+++|++.++|+.|+| T Consensus 498 D~i~~di~~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~g~I~~~~~~-----dv~v~~~~D~~~v~~~v~Pv~~me 572 (587) T protein:vir:96 498 DFLVSELKILLEEQYIGTRTINTSASQIKDFVQSYLGRKKRDNEIQDFPPE-----DVQVIIEGNEARISLTIFPIRALK 572 (587) T ss_pred HHHHHHHHHHHHhcCCccccCHHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecCCEEEEEEEEEEcccce Confidence 99999999987 699999999999999999999999999999999998541 111123344799999999999999 Q ss_pred EEEEEEEEEecCceeeeeecc Q lcl|NC_013693. 604 WVYLDFAAVRPDMEFSEIETG 624 (631) Q Consensus 604 ~i~~~~~~~~~~~~~~e~~~~ 624 (631) ||.++++...+.++ + T Consensus 573 kIy~tv~~~~~~~~------~ 587 (587) T protein:vir:96 573 KISVSLVYRQQTLQ------A 587 (587) T ss_pred EEEEEEEEEeeeec------C Confidence 99999986544332 2 No 41 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=1.4e-60 Score=348.72 Aligned_cols=589 Identities=13% Similarity=0.073 Sum_probs=288.2 Q ss_pred CCCcchhcC-CceEEEEec----CCCceecccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCcc------chhH Q lcl|NC_013693. 1 MATQSFSVA-PSVQWTERD----ATLQTSPSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDA------TATD 69 (631) Q Consensus 1 m~~~~~yls-PGVyveEv~----~~~~~~gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~------~~~~ 69 (631) |+---+|.. ||+-+.-.| .+..+..-.|--..+.|.+--||+.+||+++ -. -.+..||+---. ..|- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~ 78 (717) T protein:vir:79 1 MAGFDQYQAIPGHNARFKDGNLNLKSDPNPRETESVVLLGTATDGPVMQPVRVT-PE-TAYNIFGKVAHENGVYNGATLL 78 (717) T ss_pred CCchhhhhcCCCceeeeecCceecCCCCCccccceEEEEeeccCCcccCceeeC-hh-HHHhhhhhhhhhcccccchhhh Confidence 887777764 999887544 3445556778888899999999999999995 34 477899974221 1233 Q ss_pred HHHHHHHHhCCceEEEEEecccCCCcccccc--cchhhhccccccccccccceeeehhhhh--hhhh------------- Q lcl|NC_013693. 70 FLVIADFLSYSSVAWVTRVVGPAARNAVTKG--QTAILIRNKLDFETASPSASITWTGRYA--GSLG------------- 132 (631) Q Consensus 70 ~av~~fF~ngG~~~~vvRv~~~~a~~a~~~~--~~~~~~~~~~~~~~~~~~~~l~~~a~~~--G~~g------------- 132 (631) .+......-|-.+....|+....+-+.-... ......-....+........+.++-+-| |-.. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (717) T protein:vir:79 79 PKFEELWAAGNRDIRLMRTTGVNAVSSLLGTSYSKNSKEVAEDKLGGAQARGNVAATFTLPNGGIVEATFLLKARGVIIP 158 (717) T ss_pred HHHHHHHhcCCcceEEEEecchhHHHHHhhcccccchhhHHHHhhcccccccceEEEEEcCCCceeeeeeeeeecceEeC Confidence 3444455678889999998754332211000 0000000111111111111122222111 1000 Q ss_pred -hchhhhhc-----cCcccceeeccceeeeeccccc-ceEEeeeeeeeeecccccccceeee--eecccccccceeEeec Q lcl|NC_013693. 133 -NDVAINVC-----DAAGFPTWEFRNNFAYAPQAGE-YHIVIVDKVGRITDSSGAVGQVDRI--SVSGTATGAGSISVAG 203 (631) Q Consensus 133 -n~l~v~v~-----~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~~v~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 203 (631) |...+.+- .++..++ ..++.......+ ..+...+......... ++.... ..+.+..+.-...... T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 232 (717) T protein:vir:79 159 PNNYTLDVGTEEDMKAGTQPT---FAQVLLNENVADMESEITVSYEFTYKDAQ---GETKTSEVLDNNTDKDGKPMIAKG 232 (717) T ss_pred CCcceEeccChhhhhcCCCch---hhhhhhccchhhccceeEEEEEEEeeccc---CcchhhhhhcCCCCCCCceeEEec Confidence 00000000 0000000 000000000000 0000000000000000 000000 0000000100111111 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccc-c--------------cccccccceeecccccccce Q lcl|NC_013693. 204 EDVAYTDTDTPATLATKIGTALTALTDVYSSVVVKSNTVTVTHK-A--------------IGPQTVTAIVPDANGLTATA 268 (631) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~-~--------------~~~~~~~~~~~~~~~~~~~~ 268 (631) .+....-.-...........++-...+..-.......+++.... . .-.+++...-+...+...+. T Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~n~ 312 (717) T protein:vir:79 233 ADVTIKLEHVALAGLKLYADGIEVVDAKAFTVAGDQLTIHSNSKMKLGASLEAQYAYNLVEVIQPVIELESIFGGGVYND 312 (717) T ss_pred ccceeehhhhhhhhhHHhhcchhhhhhhheeeecceEEEEecCCcccchhhHHHHHhhHHHhhccceEEeecccCceeee Confidence 10000000000000000000000000000000000011111100 0 00012222222222333333 Q ss_pred eeeecccccc-cceeeeeeeeeccccccc----chhhh------hhhhhc----------cccceee-eecccccccccc Q lcl|NC_013693. 269 VTTTVGASGS-IIEKYELMQATQGSKKSD----GSNAY------FKDVIN----------DTSNWVY-TFATTLAAGVTE 326 (631) Q Consensus 269 ~~~~v~~~~~-~~~~~~~~~~~~~~~~~~----~~~~~------~~~~~~----------~~~~~~~-~~~~~~~~~~~~ 326 (631) +.+.+..+.. ..-+++......+..... .+..| +..+.+ +.+.... ........+... T Consensus 313 ~~~~v~~~D~~~~~~~t~~~~~~g~~~~~pl~~ts~dy~~~~~~vdgI~~~~~~~V~~~g~~s~a~a~~~~g~~s~d~a~ 392 (717) T protein:vir:79 313 IMRKVESKDGAVTVTITKPESKRGMISEDPLVFKSGDYTNFKMLVDAINNHPFNNVVRARTKPEFEATFTSTLQAAADAK 392 (717) T ss_pred eeeEEecCCceEEEEEecccccCcceeccccccccCceeeeeeeecccccCchhheeeeecccccceeeeecccCchhhc Confidence 3333333221 111111111111111111 00001 001100 0000000 000111111222 Q ss_pred cccccccchhhhhhH----------------HHHHhhhhhcccceeEEeccc----------cchHHHHHHHHHhhc--- Q lcl|NC_013693. 327 LEGGVDDYTGNRVAA----------------IEALNNAEAYDAKPVFAFCEE----------LIEQQTLIDLSTERK--- 377 (631) Q Consensus 327 l~gg~d~~~~~~~~~----------------~~~l~~~~~~~~~~~i~~~~~----------~~~~~~~~~~~~~~~--- 377 (631) +.||.|......... ...+..++..++..+++ +.. -..+.++.+||+.+. T Consensus 393 f~Gg~dgl~~~~ee~Y~~lGgk~~d~g~lt~~aays~LE~~dVDlVil-~ga~adtt~ga~~d~va~alad~caalSal~ 471 (717) T protein:vir:79 393 FSGGKDELSLDKEEMYKRLGGEKNEEGFVTKQGAYQYLENYEVDYVIP-LGVHADTKLIGKYDDFAYQLALACAVMSHYN 471 (717) T ss_pred cCCCccccccchhhhhccccccccccccccchhhhhhcCcceeEEEEe-cCccccccccchhhhHHHHHHHHHHHhhhcc Confidence 333433322211110 11222223333333222 221 124568889997642 Q ss_pred -cceEEeecccccccccccCCHHHHHHHHHhcC-----------------------C-CcceEEEecCeeEEEeccCCce Q lcl|NC_013693. 378 -DTVSFVSPLRDVVVGNRGREMEDVVAWRESLV-----------------------R-DSSYFFMDDNWAYVYDKYNDKM 432 (631) Q Consensus 378 -~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~-~s~~~~~~~p~~~~~d~~~~~~ 432 (631) .++.+++... +.....+...+|+..+. . -+.|..+++++..++.+..+.. T Consensus 472 r~ai~VI~l~s-----p~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis~y~~vv~~~~~iv~~~~~~~ 546 (717) T protein:vir:79 472 SVTIGIIPTTT-----PSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLGQFIEVVAGPDFIVRNTRLGQ 546 (717) T ss_pred ccceeeecccc-----ccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccccceeeeeecceeEEEcCCCce Confidence 2344443211 11111122222221110 0 1234555555555556666777 Q ss_pred eEeehHHHHHHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCC Q lcl|NC_013693. 433 RWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRP 512 (631) Q Consensus 433 ~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~ 512 (631) ...||+|++||+ |..+|+|+||+|++ |.|+.++.+.+++.|++.||++|||||++++++|+++||+||+++++ T Consensus 547 ~~~p~AG~vAGl----dA~rGVwkSPANk~---I~GVvgLa~~lT~sE~d~Ln~aGIntIr~~~GrGirVWGaRTtasd~ 619 (717) T protein:vir:79 547 MASTPDASYIGM----VSQLKTQSAPTNKP---LPSVTALRYTYSANQLNRLTKARFATFKYKQDGSIGVVDAPTSAHAG 619 (717) T ss_pred eecCHHHHHHHH----HhcCCcccccccce---ecccccCcccCCHHHHHHHhhCCeEEEEEeCCceEEEEeeeecCCCC Confidence 788886666655 55679999999997 66778889999999999999999999999999999999999998888 Q ss_pred hhhceehhhHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEE Q lcl|NC_013693. 513 SAFDRINVRGLFIMAEQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVA 592 (631) Q Consensus 513 ~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~ 592 (631) +.|+||+|||++++|+++|+++++|+|||||++.+|.+|+.+|++||++||++|+|.||++++ +||++++++|++++ T Consensus 620 sdWryInVRRl~D~Ie~sIr~al~~yVgEPNd~~tr~~Ik~sI~afL~~L~r~GAI~Gykvdv---tnT~~di~~G~l~V 696 (717) T protein:vir:79 620 SDYTRLSTARIVKEAVNAVREVADPFIGEPNDTGNRNALTAAVDKRLSKMIENKALLGFDFRL---VVTPQQELLGEGSI 696 (717) T ss_pred cccceeehhhhHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHHHHHhcCceecceeeE---ecChhHhhCCEEEE Confidence 899999999999999999999999999999999999999999999999999999999999876 79999999999999 Q ss_pred EEEEEecCCceEEEEEEEEEe Q lcl|NC_013693. 593 GIWLKPEYSINWVYLDFAAVR 613 (631) Q Consensus 593 ~i~~~p~~p~e~i~~~~~~~~ 613 (631) +|.++|++|+|||+++++.+. T Consensus 697 ~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 697 ELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred EEEEEecCcccEEEEEEEEeC Confidence 999999999999999998775 No 42 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=100.00 E-value=6.9e-57 Score=328.47 Aligned_cols=559 Identities=16% Similarity=0.118 Sum_probs=315.7 Q ss_pred CCCc-------------chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCC--Cc Q lcl|NC_013693. 1 MATQ-------------SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKP--ND 64 (631) Q Consensus 1 m~~~-------------~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~--~~ 64 (631) |.+. -.+-+||||+++.+++..++ ++++++.+|||.+++||+++|++++++ +|+++.||+. .+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~-~~a~~~f~~g~l~~ 79 (607) T protein:vir:10 1 MTTTITSAESYKRIYPLFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNGDPTKVYEIRTS-QQATKIFGSGDLVD 79 (607) T ss_pred CcceecchhhHHHHhCCCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCCCCceEEEEcch-hHHHHhhcCcchHH Confidence 3221 12678999999999999877 899999999999999999999999887 7999999763 34 Q ss_pred cchhHHHHHHHHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcc Q lcl|NC_013693. 65 ATATDFLVIADFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAG 144 (631) Q Consensus 65 ~~~~~~av~~fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~ 144 (631) .-.|.|.+..||.|||+.||+|||.+..+..+. ...+.+++...|.|+|.+++.++.... T Consensus 80 a~~~a~~~~~~~~~g~~~~~~~rv~~~~~a~~~--------------------~~~~~~~~~~~~~~~~~i~~~l~~~~~ 139 (607) T protein:vir:10 80 GIKLAFDPTGNSVTNGGTVYALRVDNAKQASLV--------------------KDGLTFTSSIFGTNANQVSVALDNDVF 139 (607) T ss_pred HHHHhhccccCCccCCceEEEEeCCCcccccee--------------------cccccccccccccCCCceEEEEEecCC Confidence 556777778888999999999999655443222 123567888999999999988842111 Q ss_pred cceeeccceeeeecccccceEEee--eeeeeeecccccccceeeee-----------ecccccccceeEeeccccccccc Q lcl|NC_013693. 145 FPTWEFRNNFAYAPQAGEYHIVIV--DKVGRITDSSGAVGQVDRIS-----------VSGTATGAGSISVAGEDVAYTDT 211 (631) Q Consensus 145 ~~~~~~~~~~~~~~~~g~~~~~~~--~~~~~v~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~ 211 (631) ... ...+.... + .+..... +....+............+. ...+........+.......... T Consensus 140 ~~~---~~~~~~~~-d-~~~~~~~n~g~~~~i~y~g~~~~a~~~v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~ 214 (607) T protein:vir:10 140 GVP---RITVNYSP-D-NYERTYTNIGQMFSITYSGKSASAGYTVSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKY 214 (607) T ss_pred Ccc---ceeEEeec-c-cceeeeeeccceeecccCcccccccceeeecCCCceeEEEecCCCccceeeeeeccccccccc Confidence 110 00000000 0 0000000 00000000000000000000 00000000000000000000000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccceeecccccccceeeeecccccccceee-eeeeeec Q lcl|NC_013693. 212 DTPATLATKIGTALTALTDVYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKY-ELMQATQ 290 (631) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~-~~~~~~~ 290 (631) .+...+.. ......++..... ....+..+-.++......+........... .+-...+.. .+..... T Consensus 215 ~t~~~l~~----din~~~~~~A~~~---g~~~i~tky~d~~~~~i~V~~~~~iv~a~~-----~D~~~~~~~~~~~~~t~ 282 (607) T protein:vir:10 215 DTIAKLMQ----AISATPNFSASVV---GSPSVNTSYLDEVTSPVDVKTAPAVVTAKI-----GDAISKLGYDPYVVVTQ 282 (607) T ss_pred chHHHHHH----HhhcCCceEEEEe---cccceeeeccccccceeEEEEeeeeechhh-----hhhhhcccccceEEeee Confidence 00000000 0000000000000 000000000000000000000000000000 000000000 0000000 Q ss_pred ccccccchhhhhhh--hhccccceeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHH Q lcl|NC_013693. 291 GSKKSDGSNAYFKD--VINDTSNWVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQT 368 (631) Q Consensus 291 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~ 368 (631) ... .......... ......................|.||.|+.... ...+.+..++..+...++++..+.++|.+ T Consensus 283 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~LtGGtdG~~~~--ty~dal~aLe~~e~~~i~~~t~d~ai~~~ 359 (607) T protein:vir:10 283 TSN-NKPIVNGVSAGTGSATASVTTAPESFPANFDTAFLTGGSTGDVPV--SWADKFNGAIGNNVYYIIPLTSEENIHAE 359 (607) T ss_pred ccc-chhhhhhhhccccceeeeeeccccccccccceeeeeCCCCCCchh--hHHHHHHHHhhcCceEEEecCCCHHHHHH Confidence 000 0000000000 000000000001111122235688998875431 22344445555555555555667788999 Q ss_pred HHHHHHhhcc----ceEEeecccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeeh---HHHH Q lcl|NC_013693. 369 LIDLSTERKD----TVSFVSPLRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPA---CGGT 441 (631) Q Consensus 369 ~~~~~~~~~~----~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~ 441 (631) +.+||+++++ +++++..+ .+.+++++.+++..+ ++++++++.|+..+.|. +..+..|+ ++++ T Consensus 360 l~a~vkr~~~~g~~~~aVlg~~-------~~~t~~~~~t~a~~~--N~ervv~V~~~~~~~~~--G~~~~~~~~~~Aa~v 428 (607) T protein:vir:10 360 LQAFIDEQHVLGYNYHAFVGGG-------FAEPLEQILSRQVNI--NDSRFGLVGQSGHVQEG--GESVHVPAYLMAAYV 428 (607) T ss_pred HHHHHHHHHhCCCcEEEEecCC-------CCCCHHHHHHHHHhh--CCCcEEEEecCeeEeeC--CcceeccHHHHHHHH Confidence 9999988765 67777544 467889999988877 57899999999877663 45555664 6889 Q ss_pred HHHHHHhhccCCceecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcC----CcEEEEcceec--CCCChhh Q lcl|NC_013693. 442 AGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSN----EGIVLYGDKTG--LTRPSAF 515 (631) Q Consensus 442 ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~----~G~~~wg~rT~--~~~~~~~ 515 (631) ||++|..+ +.+||.|+.+. ..++..++++.|++.|+++|+.++...++ ++++++...|. ..++..| T Consensus 429 AGl~Ag~~----~~~SlT~k~i~----~~~v~~~lt~~e~e~ai~~Gv~~l~~~~~~~~~~~vrIv~~ItT~t~~~~~~~ 500 (607) T protein:vir:10 429 GGLSSSLG----VAVPITNKKLA----LVDLDQNFSGDDLNTLNQNGVIGIEHLVNRNATGGYYIVQDVSTNTVSSSHVD 500 (607) T ss_pred HHHHhcCc----cccCcccceec----cccccccCCHHHHHHHHhCCeEEEEEccCccccceEEEeeeeeeccCCCCcch Confidence 99999887 77899999853 34677789999999999999999976543 36888777664 2456899 Q ss_pred ceehhhHHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHh--CCceeeeEEEEccCCCCHHHhhCCeEEE Q lcl|NC_013693. 516 DRINVRGLFIMAEQNIAAIA-KYYLGENNDEFTRSLFSNAVRPYIRQLAN--MGAIYDGQVKCDADNNTADIIAANQMVA 592 (631) Q Consensus 516 ~~i~vrR~~~~i~~~~~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~--~gal~g~~v~~~~~~nt~~~i~~G~~~~ 592 (631) ++++++|++|+|.+.+++.+ ++|++++|++..|..++..+..||..+|. .|+|.+|..+ +-+-.....+++| T Consensus 501 ~~i~viRv~D~i~~dir~~~~~~yIGk~nnd~~~~~vk~~i~~~L~~~~l~~~gaI~df~~e-----dv~v~~~~D~v~v 575 (607) T protein:vir:10 501 GSLYLGELTDFLFDNLRFVLRDTYIGSNIRSTSADDIKSTVASYLYSEMNNDDGLIVDFSES-----DIVVTISGTVVYI 575 (607) T ss_pred heeehhhhHHHHHHHHHHHHhhcCCcccCCcchHHHHHHHHHHHHHHHHHHhcCceeCCCcc-----ccEEeeCCCEEEE Confidence 99999999999999999886 58999999999999999999999976554 6899997421 1111234568999 Q ss_pred EEEEEecCCceEEEEEEEEEecCceeeeeecc Q lcl|NC_013693. 593 GIWLKPEYSINWVYLDFAAVRPDMEFSEIETG 624 (631) Q Consensus 593 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~ 624 (631) ++.+.|+.++|+|.+++....+.++=+....- T Consensus 576 ~~~v~Pv~~iekIyvtv~v~~~~~~~~~~~~~ 607 (607) T protein:vir:10 576 QFAVAPTQEIKNIVVSGTYSNYSATSEDNTTK 607 (607) T ss_pred EEEEEEcccceEEEEEEEEEEEEEeeccCCCC Confidence 99999999999999999987766553333222 No 43 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=100.00 E-value=1.4e-53 Score=310.33 Aligned_cols=421 Identities=17% Similarity=0.162 Sum_probs=273.3 Q ss_pred CCCc----chhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHH Q lcl|NC_013693. 1 MATQ----SFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIAD 75 (631) Q Consensus 1 m~~~----~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~f 75 (631) |+-- -+-.-|||||||++.+.+.| +++|++++|+|.++|||+++|++|+|+ .||++.||+.. .+..+.+..+ T Consensus 1 m~gg~~~~~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~~Gp~~~~~~i~s~-~d~~~~fG~~~--~~~~~~~~~~ 77 (437) T protein:vir:10 1 MAGGIWKRQNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLALSFGQSKKLMKIRRG-EDLFKKLGYEQ--ESPQLLLLNE 77 (437) T ss_pred CCcceecccceecCceeEEEecCCcceeeccCCcEEEEEEEecCCCCceeEEEecH-HHHHHHcCCcc--chhHHHHHHH Confidence 6641 12346999999999999887 899999999999999999999999886 79999999743 3445556666 Q ss_pred HHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccceee Q lcl|NC_013693. 76 FLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFA 155 (631) Q Consensus 76 F~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~ 155 (631) |+|||++||++|+.++.. |+. +....++++|++||.|||.+++.+.+......-.. T Consensus 78 ~~~g~~~~~~~R~~~g~~--a~~-----------------tl~~~~~~~A~~~G~~gn~i~v~v~~~~~d~~~~~----- 133 (437) T protein:vir:10 78 AFKRVSEVLLYRLNTGEK--ANV-----------------SLSDNVTAQAKYSGVRGNDITVTVKTNVDDPSSFD----- 133 (437) T ss_pred HhcCCCEEEEEECCCCce--eeE-----------------eeccceEEEeccCCcccceeEEEEeeccCCccceE----- Confidence 779999999999965321 110 01224788999999999999888765422111000 Q ss_pred eecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccccccc Q lcl|NC_013693. 156 YAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSSV 235 (631) Q Consensus 156 ~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (631) +. ...+. .... . ..+ ........ T Consensus 134 ------------------v~------------~~~~~------~~~d-~----------~~v--------~~~~~~~~-- 156 (437) T protein:vir:10 134 ------------------VV------------TFLDT------VVMD-L----------QTV--------KVLADLKN-- 156 (437) T ss_pred ------------------EE------------EecCc------ceee-e----------eeh--------hhhhhhhh-- Confidence 00 00000 0000 0 000 00000000 Q ss_pred ccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceeee Q lcl|NC_013693. 236 VVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVYT 315 (631) Q Consensus 236 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (631) .. ..... ..+ T Consensus 157 -------------------------------n~----------------~v~~~-----~~~------------------ 166 (437) T protein:vir:10 157 -------------------------------NA----------------LVEFS-----GTG------------------ 166 (437) T ss_pred -------------------------------hc----------------ccccc-----ccc------------------ Confidence 00 00000 000 Q ss_pred ecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccc-----eEEeecccccc Q lcl|NC_013693. 316 FATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDT-----VSFVSPLRDVV 390 (631) Q Consensus 316 ~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~-----~a~~d~~~~~~ 390 (631) ...+.....+.||.++.... ......|..++..+...+++...+.+.+.++.+||+++++. .+++..+ T Consensus 167 --~l~~~a~~~LtGG~dg~~t~-~dy~~al~~le~~~~n~l~~~~~d~~~~t~~~~~ik~~r~~~g~~~~~V~~~~---- 239 (437) T protein:vir:10 167 --ELQPVAGAKLTGGTDGAIST-QDYLEYFKALETVEFNYMALPVEDASIKKAAINFIKRMREDEGLGAQLVVADS---- 239 (437) T ss_pred --ccccccceeeeccccCCCCh-hHHHHHHHHhccCcceEEEecCCChhHHHHHHHHHHHHHhccCceEEEEeCCC---- Confidence 00001112456676654321 11234555566555443333334556788999999887642 2333211 Q ss_pred cccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecc Q lcl|NC_013693. 391 VGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYN 470 (631) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~ 470 (631) . .++.....+.+.....|- ....-.-..+.+||++|..+ +++|+.|+. +.++. T Consensus 240 ----~--------------~d~e~Iin~~n~~~~~~~--~~~~~~~~~a~vAG~~Ag~~----~~~S~t~~~---~~~~~ 292 (437) T protein:vir:10 240 ----D--------------ADSEAVINVKNGVILSDK--TVIDKTKATVWVAAASANAG----VEKSLTYEK---YEDSV 292 (437) T ss_pred ----C--------------CCCceEEEeecceeecCc--ceechhhHHHHHHHHhccCc----cccCccccc---cCCcc Confidence 1 123333334343332221 11111223578899999875 777999987 55666 Q ss_pred cceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecC----CCChhhceehhhHHHHHHHHHHHHHHH-HHhcC-CCC Q lcl|NC_013693. 471 RMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGL----TRPSAFDRINVRGLFIMAEQNIAAIAK-YYLGE-NND 544 (631) Q Consensus 471 ~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~----~~~~~~~~i~vrR~~~~i~~~~~~~~~-~~v~e-pn~ 544 (631) ++..+++++|++.|.++|+.++.+..++-+.++|-.|+. ..+++|+.|.++|++|+|.+.+++.+. +|+++ ||+ T Consensus 293 ~v~~~~t~~e~~~~i~~G~~vl~~~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~ 372 (437) T protein:vir:10 293 DVVGRLSHTETEDALLKGQFVFTARRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFLGKVSNN 372 (437) T ss_pred cccccCCHHHHHHHHhCCcEEEEEeCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccccccCCC Confidence 777789999999999999999976543334447766654 346799999999999999999999876 59998 699 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEE Q lcl|NC_013693. 545 EFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAV 612 (631) Q Consensus 545 ~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 612 (631) +..|..++..|+.||++|+++|+|.+|.++..+..+. .....+++++.++|+.++|+|.+++... T Consensus 373 ~~~r~~~~~~i~~yl~~l~~~g~I~~~~~~d~~v~~~---~~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 373 EDGRQAFKANRIRYFKDLEARGAIEDFKVEDIEVLRG---ELKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred HHHHHHHHHHHHHHHHHHHhCCCccCCCceeEEeecC---CCCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 9999999999999999999999999998876544322 1356889999999999999999999865 No 44 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=100.00 E-value=5.5e-46 Score=268.69 Aligned_cols=482 Identities=12% Similarity=0.069 Sum_probs=309.0 Q ss_pred cchh-------cCCceEEEEecCCC--c-eecccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCC---CccchhHH Q lcl|NC_013693. 4 QSFS-------VAPSVQWTERDATL--Q-TSPSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKP---NDATATDF 70 (631) Q Consensus 4 ~~~y-------lsPGVyveEv~~~~--~-~~gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~---~~~~~~~~ 70 (631) +++| .+.||-|.+++.-. + .+|+++++.|+||.|+||+.++|++++.+ .+++.-|.| ....+... T Consensus 1 ~~~ysi~q~ig~aSGvav~pi~~d~t~~~~~g~g~~v~a~Vgif~RG~i~k~~~Vt~~--n~~~~LGep~~~~~ga~~E~ 78 (529) T protein:vir:10 1 MSQYSIQQSLGNASGVAVSPINADATLSTGVALNSSLWAGIGVFARGKPFTVLAVTES--NYEDVLGEPLKPSSGSQFEP 78 (529) T ss_pred CCceehhhhhhhhcccccCCcCcccccchheecCceEEEEEEEeecCCCcceEEEchh--HHHHHhccccCCCcchhhhh Confidence 2334 24699999887332 3 34789999999999999999999999742 466666665 55677888 Q ss_pred HHHHHHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeec Q lcl|NC_013693. 71 LVIADFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEF 150 (631) Q Consensus 71 av~~fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~ 150 (631) .++.|+.-+++.|||||+++.+++-..-.-. .+ .......+.+..+..++||+.+.+-+.|+.+...... T Consensus 79 ~~h~~eA~~~~s~yVVRvv~~dak~p~i~~~--------~~--~~~~~s~~~~s~~~~l~~G~~~~iy~~Dgd~~~s~~~ 148 (529) T protein:vir:10 79 IRHVYEAIQQTSGYVVRAVPDDAKFPIIMFD--------ES--GEPAYSALPYGSEIELDSGEAFAIYVDDGDPCISPTR 148 (529) T ss_pred HhhhhhhhcCCceEEEEEcccccCCceEEec--------CC--ccchhhcccccccccccccceEEEEEecCcCccCCce Confidence 8999998888889999998877654321100 00 1112233455556667888888877777665433222 Q ss_pred cceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccc Q lcl|NC_013693. 151 RNNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTD 230 (631) Q Consensus 151 ~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (631) ...+.....+....+. ..+...... T Consensus 149 ~l~i~~~~ads~g~e~------------------~~l~~~~~~------------------------------------- 173 (529) T protein:vir:10 149 ELTIETATADSAGNER------------------FLLKLTQTT------------------------------------- 173 (529) T ss_pred EEEEEeeccccCCCcc------------------ceeeEEEEe------------------------------------- Confidence 2222111111000000 000000000 Q ss_pred cccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhcccc Q lcl|NC_013693. 231 VYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTS 310 (631) Q Consensus 231 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (631) . ...+..+|+++ .+....+.+..|...++.+.+...+ T Consensus 174 ---~---------------------------------------~g~~~~let~~-~sl~~~a~dd~G~~~yl~svle~~s 210 (529) T protein:vir:10 174 ---S---------------------------------------LGVVTTLETHT-VSLAEEAKDDMGRLCYLPTALEARS 210 (529) T ss_pred ---e---------------------------------------cCCceEEEEEE-eeeeechhhhcCCccchhHHHhhcc Confidence 0 00001111111 1122222333333344444444433 Q ss_pred ceeeeecccccc--------cccccccccccchh-----hhhhHHHHHhhhhhcccceeEEecc-ccchHHHHHHHHHhh Q lcl|NC_013693. 311 NWVYTFATTLAA--------GVTELEGGVDDYTG-----NRVAAIEALNNAEAYDAKPVFAFCE-ELIEQQTLIDLSTER 376 (631) Q Consensus 311 ~~~~~~~~~~~~--------~~~~l~gg~d~~~~-----~~~~~~~~l~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~ 376 (631) ...+..-..+-. ....+.+|+|+... ++..++..|.+ ..++...+|.... +.++..+|+.+|+++ T Consensus 211 ~~l~ai~~~e~~~t~~~~t~~d~~f~~GtdG~~~~i~s~~y~~A~~~L~n-~p~d~~~il~~g~y~~a~I~~L~~ic~~~ 289 (529) T protein:vir:10 211 KYLRAVVNEELISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNN-APYMYTAVLGLGCYDNAAITALGKICADR 289 (529) T ss_pred CceeeeeeeccccccchhhhhhhhccCCccccccccchHHHHHHHHHhcC-CcceeeeeeccCCccHHHHHHHHHHHhhh Confidence 332221111000 01255677765432 33334444432 3455555554333 667889999999775 Q ss_pred ccceEEeecccccccccccCCHHHHHHHHHhcCCC---cce-EEEecCeeEEEeccCCceeEeehHHH--HHHHH--HHh Q lcl|NC_013693. 377 KDTVSFVSPLRDVVVGNRGREMEDVVAWRESLVRD---SSY-FFMDDNWAYVYDKYNDKMRWIPACGG--TAGVW--ARS 448 (631) Q Consensus 377 ~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~---s~~-~~~~~p~~~~~d~~~~~~~~~p~s~~--~ag~~--a~~ 448 (631) +.-| +.|.| ...|+.++++|++.+|+. +-+ +.++|||. .-||.++....+++||. +|... ++. T Consensus 290 ~~d~-f~DV~-------~~LT~~aA~~~~e~~gl~~~~~~~~s~y~~P~~-~~D~~tg~k~~~GlsG~A~~akargv~~n 360 (529) T protein:vir:10 290 LIDG-FFDVK-------PTLTYAEALPAVEDTGLLGTDYVSCSVYHYPFS-CKDKWTQSRVVFGLSGVAYAAKARGVKKN 360 (529) T ss_pred hhcE-EEcCC-------CCcCHHHHHHHHHhcCccccCceeeEEEEccee-eccccccCceeeCCCcceeeccccceeec Confidence 5333 33655 578999999999998872 222 45777886 78999999999999994 33321 333 Q ss_pred hccCCceecccceeeceeee-cccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHH Q lcl|NC_013693. 449 IEIAGIYKSPAFHNRGKYNN-YNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMA 527 (631) Q Consensus 449 d~~~g~~~span~~~~~i~g-~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i 527 (631) ....|+|++|||+.++-|.- -+.+.+..++.|..+|-.++||++..-.++++++-.+.|++..++.|||+|+++|+++| T Consensus 361 a~v~g~hY~pAGe~r~~inr~~I~~ly~~d~~e~~~lv~~riNPV~~~~~g~~~idDsLt~~~knny~R~~hv~~lmn~I 440 (529) T protein:vir:10 361 SDVGGWHYSPAGEERAVIARASIQPLYPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQDNYLHFQHVPSLMNAI 440 (529) T ss_pred ccccccccccCCCccceeecccceeccCCCccCHHHHHhhccCeeeeeccCcceeeeeeceeeeCCchhhhhHHHHHHHH Confidence 34445699999998775532 12455777888999999999999987666666665666767778999999999999999 Q ss_pred HHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceee-----------eEEEEccCCCCHHHhhCCeEEEEEEE Q lcl|NC_013693. 528 EQNIAAIAKYYLGENNDEFTRSLFSNAVRPYIRQLANMGAIYD-----------GQVKCDADNNTADIIAANQMVAGIWL 596 (631) Q Consensus 528 ~~~~~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g-----------~~v~~~~~~nt~~~i~~G~~~~~i~~ 596 (631) ++.+.+..+|.+|||++..+|. |++.++.+|+.+|+.|+|++ |++.+ +|.+ .++|.+++.+ T Consensus 441 ~~~~~k~a~~~~~~Pd~it~~g-l~~~l~~~L~r~~asgalv~prdp~~~G~epy~~~V-----~q~d--~D~~~v~~~~ 512 (529) T protein:vir:10 441 SRFFVQLARQMKHSPDGITAAG-LTKGMTKLLDRFVASGALVAPRDPDADGTEPYVLKV-----TQAE--FDKWEVVWAC 512 (529) T ss_pred HHHHHHHHHHHhhCCChHHHHH-HHHhHHHHHHHHHhcCceecccCccCCCCCceEEEE-----eecc--cCeEEEEEEe Confidence 9999999999999999999987 99999999999999999975 66666 2333 4899999999 Q ss_pred EecCCceEEEEEEEEEe Q lcl|NC_013693. 597 KPEYSINWVYLDFAAVR 613 (631) Q Consensus 597 ~p~~p~e~i~~~~~~~~ 613 (631) +|..-+++|.+.-...+ T Consensus 513 ~ptGv~Rri~~~p~l~~ 529 (529) T protein:vir:10 513 CPTGVARRIQGVPLLIK 529 (529) T ss_pred ecCCceeeEEeeeeecC Confidence 99999999987655444 No 45 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=100.00 E-value=8.2e-43 Score=251.31 Aligned_cols=428 Identities=15% Similarity=0.094 Sum_probs=260.2 Q ss_pred CCCcc----hhcCCceEEEEecCCCcee-cccCCceEEEeeec-cCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHH Q lcl|NC_013693. 1 MATQS----FSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQ-WGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIA 74 (631) Q Consensus 1 m~~~~----~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~-~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~ 74 (631) |+--- +=.-|||||||++++.+++ |++|++++|+|.+. ||| ++|+.|+|+ .||++.||..... ..+.... T Consensus 1 magg~~~~~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g~-~~~v~i~~~-~d~~~~fG~~~~~--~~~~~~~ 76 (451) T protein:vir:10 1 MAGGTWKAQDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWGK-NGVIEVEAN-SDFTKKLGTTLDD--PSLTALK 76 (451) T ss_pred CCceeeccceeecCceEEEEeccCcceeeccCCcEEEEEeeecCCCC-cccEEeecH-HHHHHHcCCcccc--hhHHHHH Confidence 66520 1134999999999988765 99999999999765 666 789999887 7999999975432 2333334 Q ss_pred HHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecccee Q lcl|NC_013693. 75 DFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNF 154 (631) Q Consensus 75 fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~ 154 (631) +|++||++||+.|+.+.+...++. ....++++|++||.|||.+++++.+...+..-.... T Consensus 77 ~~~~g~~~v~~yrl~~g~~a~~t~------------------~~~~~~~~Aky~G~~Gn~i~v~v~~~~~d~~~~~v~-- 136 (451) T protein:vir:10 77 ETLKGASKVLVLNPNEGTAATLTK------------------EGLPWTVTANYPGEKGNQITVSVEVSPADQNAATVS-- 136 (451) T ss_pred HHhcCCcEEEEEEcCCCceEEEEe------------------ecCceEEEEeeCCcCCceEEEEEecccCCcCceEEE-- Confidence 555799999999997543221110 122367899999999999999887643221100000 Q ss_pred eeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccccc Q lcl|NC_013693. 155 AYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSS 234 (631) Q Consensus 155 ~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (631) ..... . .... ..+.. .. ...... T Consensus 137 -----------t~~g~-~----------~vd~------------qtv~~------------------~~----~~el~~- 159 (451) T protein:vir:10 137 -----------TIFGT-K----------LVDE------------QSIKF------------------NE----LDKFKG- 159 (451) T ss_pred -----------EEECC-e----------EEEE------------EEeec------------------cc----hhhccC- Confidence 00000 0 0000 00000 00 000000 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVY 314 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (631) ...+.+.....+... . .. T Consensus 160 -------------------------------nd~V~a~~~~~g~~~------------------~-------------~~ 177 (451) T protein:vir:10 160 -------------------------------NDYITAKVVEEGSSK------------------P-------------VA 177 (451) T ss_pred -------------------------------CceEEEEeccccccc------------------c-------------ee Confidence 000000000000000 0 00 Q ss_pred eecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecc--ccchHHHHHHHHHhhcc-----ceEEeeccc Q lcl|NC_013693. 315 TFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCE--ELIEQQTLIDLSTERKD-----TVSFVSPLR 387 (631) Q Consensus 315 ~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~~~~~-----~~a~~d~~~ 387 (631) .... .....+|.+..+.. .....|..++..+...+.+.+. ...++..+.++|+++++ +.+++..+. T Consensus 178 ~~~l-----~~~~~gg~~~~~~~--~~~~~l~~~e~~~~n~l~~~~~~~~~~i~~~~~a~ik~~r~~~g~~~~aVl~~~~ 250 (451) T protein:vir:10 178 FTNV-----SGTLTGGTTTESNK--VESLLNDALENEEYAVVTTAGFEPSSNMNKLVVEAVKRLRENEGRKVRGVIPTDA 250 (451) T ss_pred eeec-----ccccccccccCCcc--chHHHHHHhccceeeEEEEccCCCchHHHHHHHHHHHHHHHhcCCeEEEEecCcc Confidence 0000 00112222222111 1123455555555443333222 23467788899988653 245553221 Q ss_pred ccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeeh---HHHHHHHHHHhhccCCceecccceeec Q lcl|NC_013693. 388 DVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPA---CGGTAGVWARSIEIAGIYKSPAFHNRG 464 (631) Q Consensus 388 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~ag~~a~~d~~~g~~~span~~~~ 464 (631) ... .+++....+.+.....| + ..+++ .+.+||++|..+ +.+|+.|+. T Consensus 251 -------~~~------------~d~egiinv~n~~~~~d---g--~~~~~~~~~~~vAG~~Ag~~----~~~S~T~~~-- 300 (451) T protein:vir:10 251 -------DTT------------YNYEGISTVVNGYTLSD---G--TNVDVKDATGYFAGISASAD----VATSLTYFE-- 300 (451) T ss_pred -------CCC------------CCCcceEEeecceEecC---c--eeechhhhHHHHHHHHcccc----cccCcccee-- Confidence 110 13344444444443322 1 12233 478899999876 667999987 Q ss_pred eeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEE-EcceecC----CCChhhceehhhHHHHHHHHHHHHHHHH-H Q lcl|NC_013693. 465 KYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVL-YGDKTGL----TRPSAFDRINVRGLFIMAEQNIAAIAKY-Y 538 (631) Q Consensus 465 ~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~-wg~rT~~----~~~~~~~~i~vrR~~~~i~~~~~~~~~~-~ 538 (631) +.++.++..+++++|++.+.++|..+++...++++++ +|-.|+. ..+..|+.|.++|++|+|.+.+++.+.. | T Consensus 301 -~~~~~~v~~~~t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~y 379 (451) T protein:vir:10 301 -VEDAVSAYPKFDNEKTIKALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTY 379 (451) T ss_pred -cCCceeeeeeCCHHHHHHHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhcc Confidence 5566677789999999999999999887656766665 7777763 3457899999999999999999999864 8 Q ss_pred hcC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEE Q lcl|NC_013693. 539 LGE-NNDEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAV 612 (631) Q Consensus 539 v~e-pn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 612 (631) +++ |||..-|..++..|+.||++|+++|+|..|... |.+. ...-....+++.+.++|+..||+|.+++... T Consensus 380 iGk~~N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~~-d~~v--~~~~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 380 LGNVGNNAAGRDLFKADRIAYLTSLQNRNMIQSFANT-DITV--EAGNDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred ceecCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCcc-ceEE--eecCCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 875 599999999999999999999999999998621 2111 1112367899999999999999999998754 No 46 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=4.4e-39 Score=230.89 Aligned_cols=512 Identities=12% Similarity=0.015 Sum_probs=236.6 Q ss_pred cCCCCccchhHHHHHHHHH-----hCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhh Q lcl|NC_013693. 59 FFKPNDATATDFLVIADFL-----SYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGN 133 (631) Q Consensus 59 fG~~~~~~~~~~av~~fF~-----ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn 133 (631) .|-. .-.+...-.|=+ .+|.+|.+ ...+.+.....|--|. T Consensus 1 ~~~~---~~~~~~~~~~t~~~~~~~~g~~~~~--------------------------------~~~~~i~g~~~g~~g~ 45 (581) T protein:vir:76 1 MAID---FSQYQTPGVYTEAVGAPQLGIRSSV--------------------------------PTAVAIFGTAVGYQTY 45 (581) T ss_pred Cccc---ccccccchhhhhhccccccCcceee--------------------------------eeeeeecccccccccc Confidence 0000 000000000001 12222211 0111222222232222 Q ss_pred chhhhhccCcccc-eeeccceeeeecccccceEEeeeeeee-eecccccc-----------cceeeeeecccccccceeE Q lcl|NC_013693. 134 DVAINVCDAAGFP-TWEFRNNFAYAPQAGEYHIVIVDKVGR-ITDSSGAV-----------GQVDRISVSGTATGAGSIS 200 (631) Q Consensus 134 ~l~v~v~~~~~~~-~~~~~~~~~~~~~~g~~~~~~~~~~~~-v~~~~~~~-----------~~~~~~~~~~~~~~~~~~~ 200 (631) ...++.+...... ..+.. ........|.+...+.+.... +...+... .........+.. ..... T Consensus 46 ~~s~r~~p~~~~~~evq~v-~~~~~~t~G~ftLt~~g~tT~~I~~~asa~~v~~AL~~L~~i~~~~v~vtg~~--~~~~~ 122 (581) T protein:vir:76 46 RESIRINPDTGETITTQIL-ALVGEPTGGSFKLSLAGEPTGNIPFNATQGQVQSALRALPNVEDDEVTVLGDP--GGPWT 122 (581) T ss_pred cceeeecCCCCCCCceEEE-EEeecCCcceEEEEeCceeccccccCCCHHHHHHHHhhccCCCCceEEEEcCC--CceEE Confidence 2222222111100 00000 000000111121111111100 00000000 000000000000 00000 Q ss_pred eeccccccccccccccccccccccccccccccccccccccccccccccccccccc-------ceeec--------ccc-- Q lcl|NC_013693. 201 VAGEDVAYTDTDTPATLATKIGTALTALTDVYSSVVVKSNTVTVTHKAIGPQTVT-------AIVPD--------ANG-- 263 (631) Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~-------~~~~~--------~~~-- 263 (631) +............... ...+... .........+.......+...+....... ...+. ... T Consensus 123 V~F~g~~~~~~~~~~~--ltg~~~~-~~~V~~~~~G~~~~~~~l~~~g~~~~~~~~~~s~~~~~~~l~~~~~~~~~~~~~ 199 (581) T protein:vir:76 123 VTFTKAVAALTKDVTG--LTGGDNP-DLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGE 199 (581) T ss_pred EEEcCCccceeEeeee--eecCCcc-eeEEEEEecCcCCcCceeeeccccccccceeecCCcceeeecccccceeeccCc Confidence 0000000000000000 0000000 00000000000000000000000000000 00000 000 Q ss_pred -----cccceeeeeccccc------cccee-eee--------eeeecccccccchhhhhhhhhcccc---ceeeee-ccc Q lcl|NC_013693. 264 -----LTATAVTTTVGASG------SIIEK-YEL--------MQATQGSKKSDGSNAYFKDVINDTS---NWVYTF-ATT 319 (631) Q Consensus 264 -----~~~~~~~~~v~~~~------~~~~~-~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~-~~~ 319 (631) ...+........++ .+.+. +.. ..+...... .....+..-..+. ...... ... T Consensus 200 ~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~~~~~v~~~~~~~~---~~~~~~~~~~~g~~~~e~~~~~~~~~ 276 (581) T protein:vir:76 200 DGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDI---QDFYGPAFDEAGNVQSEITLCAQLAI 276 (581) T ss_pred ccceeeeeeeeeeEeecccccccceeEEEEEEEeecCCccceEEEeccccc---ccceeeehhhcCccccchhhhhheee Confidence 00000000000000 00000 000 000000000 0000000000000 000000 001 Q ss_pred ccccccccccccccchh--hhhhHHHHHhhhhhcccceeE-EeccccchHHHHHHHHHhhc----cceEEeecccccccc Q lcl|NC_013693. 320 LAAGVTELEGGVDDYTG--NRVAAIEALNNAEAYDAKPVF-AFCEELIEQQTLIDLSTERK----DTVSFVSPLRDVVVG 392 (631) Q Consensus 320 ~~~~~~~l~gg~d~~~~--~~~~~~~~l~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~----~~~a~~d~~~~~~~~ 392 (631) .......+.+|.++... .......+|..++..+...++ +...+..+|..+.+||+.+. ++.+++..+. . T Consensus 277 t~~~~~~l~~gvd~~g~tvt~~dy~~aL~ale~~~~~~ivvp~t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g----~ 352 (581) T protein:vir:76 277 TNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDG----S 352 (581) T ss_pred ccccceEEEeeecCCCCccchHHHHHHHHHHhcCCeEEEEEecCCChHHHHHHHHHHHHHHhccCCceEEEEeeC----C Confidence 11222345555554211 111223455666665554433 33445567888999987764 3455554321 1 Q ss_pred cccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccc Q lcl|NC_013693. 393 NRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRM 472 (631) Q Consensus 393 ~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~ 472 (631) ....+.+++++.... ++++|+.+++||.++++..........|..++|+.+|.+.....+++||.|++ +.|+.++ T Consensus 353 ~~~~~~~~~~~~a~~--~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~~~slT~~~---i~g~~~~ 427 (581) T protein:vir:76 353 VTPVPSATRIANAQS--IKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKV---IRGFSGP 427 (581) T ss_pred CCCchHHHHHHhhcc--cCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhhccccccCccccc---ccccccc Confidence 223344555554444 46899999999999988765444444455566666666666777999999998 5566778 Q ss_pred eecCChhHhhhhhhcCceEEEEEcCCcEEE-EcceecCCCChhhceehhhHHHHHHHHHHHHHHH--HHhcCCCCHHHHH Q lcl|NC_013693. 473 AWSASSDERAVLYRNQINSIVTFSNEGIVL-YGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAK--YYLGENNDEFTRS 549 (631) Q Consensus 473 ~~~~~~~~~~~L~~~gin~i~~~~~~G~~~-wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~--~~v~epn~~~~~~ 549 (631) ...+++.|++.|+++|+++++.++++++++ ||-+|+. .++.|++|++||++|++++.+++.++ +|++|||++.+|. T Consensus 428 ~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~-s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG~~n~~~~r~ 506 (581) T protein:vir:76 428 AEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDP-TSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIV 506 (581) T ss_pred cccCCHHHHHHHHhCCeEEEEEecCCeEEEEEeeecCC-CCCccceeeehhhhHHHHHHHHHHHhhhcCCCcccChHHHH Confidence 889999999999999999999989989975 6767764 46789999999999999999999986 5888999999999 Q ss_pred HHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccCeee Q lcl|NC_013693. 550 LFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGGGIV 628 (631) Q Consensus 550 ~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~~~ 628 (631) +|+..+..||..||+.|+|.||+.. ..++.+++.+++++++.++|++|+|||.+++.++..+-+|+-..+|---. T Consensus 507 ~ik~~i~~~L~~l~~~g~I~g~~~~----~~~~~~~~~d~v~V~i~v~Pv~~ie~I~vt~~~~p~~~~~~~~~~~~~~~ 581 (581) T protein:vir:76 507 QVKASAEAALVWLVDNNIIRGYRNL----KARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) T ss_pred HHHHHHHHHHHHHHhcCcccCcccc----eeeEEecCCCEEEEEEEEEecccceEEEEEEEEeeCCCceEEEEeccccC Confidence 9999999999999999999998632 33556678899999999999999999999999998888888777665433 No 47 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=1.4e-38 Score=228.11 Aligned_cols=517 Identities=12% Similarity=0.078 Sum_probs=230.3 Q ss_pred cCCCCccchhH--HHHHHHHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchh Q lcl|NC_013693. 59 FFKPNDATATD--FLVIADFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVA 136 (631) Q Consensus 59 fG~~~~~~~~~--~av~~fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~ 136 (631) .|-.-..++.. |.-+--.-..|.+|.+- ..+.+..-..|--|-... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~--------------------------------~~~~i~g~~~g~~g~~~s 48 (581) T protein:vir:10 1 MAIDFSQYQTPGVYTEAVGAPQLGIRSSVP--------------------------------TAVAIFGTAVGYQTYRES 48 (581) T ss_pred Ceeeeccccccchhhhhccccccceeeeec--------------------------------cccccccccccccccccc Confidence 11000000000 10000011233333110 001111111111111111 Q ss_pred hhhccCcccc-eeeccceeeeecccccceEEeeeeeee-ee--ccccccc---------ceeeeeecccccccceeEeec Q lcl|NC_013693. 137 INVCDAAGFP-TWEFRNNFAYAPQAGEYHIVIVDKVGR-IT--DSSGAVG---------QVDRISVSGTATGAGSISVAG 203 (631) Q Consensus 137 v~v~~~~~~~-~~~~~~~~~~~~~~g~~~~~~~~~~~~-v~--~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~ 203 (631) .+.+...... ..+. .........|.+...+.+.+.. +. .+...+. ........+.. .....+.. T Consensus 49 ~~~~p~~~~~~e~q~-v~~~~~~t~GtFtLsf~G~tT~~I~~~asa~~v~~AL~~L~~i~~~~v~v~g~~--g~~~~VtF 125 (581) T protein:vir:10 49 IRINPDTGETITTQI-LALVGEPTGGSFKLSLAGEPTGNIPFNATQGQVQSALRALPNVEDDEVTVLGDP--GGPWTVTF 125 (581) T ss_pred cccCCCCCCccceEE-EEEEecCCCceEEEEeCceecccccccCCHHHHHHHHhccCCCCcceEEEECCC--CceEEEEE Confidence 1111100000 0000 0000000111111111111100 00 0000000 00000000000 00000000 Q ss_pred ccccccccc-ccc-----ccccccccccccccccccccc---cccccccccccc------ccccccccee-ec---cccc Q lcl|NC_013693. 204 EDVAYTDTD-TPA-----TLATKIGTALTALTDVYSSVV---VKSNTVTVTHKA------IGPQTVTAIV-PD---ANGL 264 (631) Q Consensus 204 ~~~~~~~~~-~~~-----~~~~~~~~~~~~~~~~~~~~~---~~~~~~~v~~~~------~~~~~~~~~~-~~---~~~~ 264 (631) ......... ... ..................... .......+..-. .+.+...... .. .... T Consensus 126 ~g~~~~l~~~~~~lt~g~~~~vtV~~~~~g~~~~~~~~s~~gi~~~~~~l~~~~~~~~~~~gsd~~~~~~~~~~~~~~~~ 205 (581) T protein:vir:10 126 TKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANT 205 (581) T ss_pred cCCccceeeeeceecCCCceeEEEeccccCcccccccccccccccccccccccccCcceeccccceeeecccCccccccc Confidence 000000000 000 000000000000000000000 000000000000 0000000000 00 0000 Q ss_pred ccceeeeecccccccceeeeee--eee-ccc--------ccccchhhhhhhhhcccc----ceee-eecccccccccccc Q lcl|NC_013693. 265 TATAVTTTVGASGSIIEKYELM--QAT-QGS--------KKSDGSNAYFKDVINDTS----NWVY-TFATTLAAGVTELE 328 (631) Q Consensus 265 ~~~~~~~~v~~~~~~~~~~~~~--~~~-~~~--------~~~~~~~~~~~~~~~~~~----~~~~-~~~~~~~~~~~~l~ 328 (631) ..+.........+.+....... ... .+. .........+....+..+ .... +...........+. T Consensus 206 ~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~t~~~~~~~tn~~~~~l~ 285 (581) T protein:vir:10 206 RDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILA 285 (581) T ss_pred cccceeeeeeecccccccceEEEEEEEeecCCcceeEEeecCcchhhhhhhhhhccCccccchhhhheeeeecccceeEE Confidence 0000000000011100000000 000 000 000000000000000000 0000 00001112223444 Q ss_pred cccccchh--hhhhHHHHHhhhhhcccceeE-EeccccchHHHHHHHHHhhc----cceEEeecccccccccccCCHHHH Q lcl|NC_013693. 329 GGVDDYTG--NRVAAIEALNNAEAYDAKPVF-AFCEELIEQQTLIDLSTERK----DTVSFVSPLRDVVVGNRGREMEDV 401 (631) Q Consensus 329 gg~d~~~~--~~~~~~~~l~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~----~~~a~~d~~~~~~~~~~~~~~~~~ 401 (631) +|.++..+ .......+|..++..+...++ +.....++|.++.+||+.+. ++.++++.+. .....+.+++ T Consensus 286 ~gvd~~g~tvt~~dy~~Al~ale~~~~~~ivv~~t~~~~v~a~l~ahv~~~s~~~~~~ravigV~g----~~~~~~~~~~ 361 (581) T protein:vir:10 286 CAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDG----SVTPVPSATR 361 (581) T ss_pred eeccCCCCccchHHHHHHHHHHhcCCceEEEEeCCCCHHHHHHHHHHHHHHHhccCCcEEEEEecC----CCCCccHHHH Confidence 55554221 111223456666666554433 34445567888999998764 3456654321 1223344555 Q ss_pred HHHHHhcCCCcceEEEecCeeEEEeccC-CceeEeeh---HHHHHHHHHHhhccCCceecccceeeceeeecccceecCC Q lcl|NC_013693. 402 VAWRESLVRDSSYFFMDDNWAYVYDKYN-DKMRWIPA---CGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMAWSAS 477 (631) Q Consensus 402 ~~~~~~~~~~s~~~~~~~p~~~~~d~~~-~~~~~~p~---s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~ 477 (631) ++.... ++++|+.+++|+..+++... +....+|+ .+++||+++.. .+++||.|++ +.|+.++...++ T Consensus 362 ~~~a~~--~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~y~~AA~vAGl~a~~----~~~~slT~~~---i~gi~~l~~~~s 432 (581) T protein:vir:10 362 IANAQS--IKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSA----IAAMPLTRKV---IRGFSGPAEVQR 432 (581) T ss_pred HHhhcc--CCCceEEEEecCceeecCcccCceeccchhhHHHHHHHHhhcc----ccccCccccc---ccccccccccCC Confidence 554444 46899999999998887654 34444555 34455555555 4888999998 556667888999 Q ss_pred hhHhhhhhhcCceEEEEEcCCcEEEEcc-eecCCCChhhceehhhHHHHHHHHHHHHHHH--HHhcCCCCHHHHHHHHHH Q lcl|NC_013693. 478 SDERAVLYRNQINSIVTFSNEGIVLYGD-KTGLTRPSAFDRINVRGLFIMAEQNIAAIAK--YYLGENNDEFTRSLFSNA 554 (631) Q Consensus 478 ~~~~~~L~~~gin~i~~~~~~G~~~wg~-rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~--~~v~epn~~~~~~~i~~~ 554 (631) +.|++.|+++|+++++..+++++++|.+ +|+ ..++.|++|++||++|++.+.+++.++ +|++|||++..|.+|+.. T Consensus 433 ~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~-~s~~~~~~i~~iR~~D~v~~~ir~~~~~~~fIG~~n~~~~r~~ik~~ 511 (581) T protein:vir:10 433 DGEKSRESSEGLMVIEKTPRNLVHVRHGVTTD-PTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKAS 511 (581) T ss_pred HHHHHHHHhCCeEEEEEecCCeEEEEeeeecC-CCCCcceeeeeehhhhHHHHHHHHHhhhhcCCCcccCHHHHHHHHHH Confidence 9999999999999999989999987544 665 446789999999999999999999985 588999999999999999 Q ss_pred HHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCceeeeeeccCeee Q lcl|NC_013693. 555 VRPYIRQLANMGAIYDGQVKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDMEFSEIETGGGIV 628 (631) Q Consensus 555 i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~g~~~ 628 (631) +..||.+||+.|+|.||+.. ..++.+.+.+++++++.++|++|+|||.+|++++..+-+|+-..+|---. T Consensus 512 i~~~L~~l~~~g~I~~~~~~----~~~~~~~~~d~v~V~i~v~Pv~~i~~I~vti~~~p~~~~~~~~~~~~~~~ 581 (581) T protein:vir:10 512 AEAALVWLVDNNIIRGYRNL----KARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) T ss_pred HHHHHHHHHhcCcccCCccc----eeeeeecCCCEEEEEEEEEecccceEEEEEEEEecCCCceEEEEeccccC Confidence 99999999999999998632 23455678899999999999999999999999998888888777665433 No 48 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=100.00 E-value=7.5e-32 Score=191.22 Aligned_cols=411 Identities=18% Similarity=0.102 Sum_probs=261.3 Q ss_pred CCCcchh------cCCceEEEEecCCCc-eecccCCceEEEeeeccCcCCCCeEEecC--HHHHHHHcCCCCccchhHHH Q lcl|NC_013693. 1 MATQSFS------VAPSVQWTERDATLQ-TSPSVVVQGATVGKFQWGEAELPVLVTGG--ETGLVKKFFKPNDATATDFL 71 (631) Q Consensus 1 m~~~~~y------lsPGVyveEv~~~~~-~~gv~tsv~afvG~~~~Gp~~~p~~i~s~--~~e~~~~fG~~~~~~~~~~a 71 (631) |||.--+ .-||+|++-++.... +.+....+.++...+.|||+++++.|++. ..++.+.||..... +.... T Consensus 1 ~~magg~~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~~wGp~~~v~~i~~~~~~~~~~~~~G~~~~~-~~~~~ 79 (436) T protein:vir:78 1 MALGGGTFVTQNKVLPGSYINFVSATRATSSLSDRGIVAMPLELDWGIDEEVFQVTSDDFEKYSTKYFGYDYTH-EKLKG 79 (436) T ss_pred CcccceeeccceeecCceEEEEEecCcceeeccCCeEEEEEEEecCCCCceeEEeecccchHHHHHHhcCccch-HHHHH Confidence 8887633 249999999976654 44888999999999999999999999763 35788899964222 22224 Q ss_pred HHHHHHhCCceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecc Q lcl|NC_013693. 72 VIADFLSYSSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFR 151 (631) Q Consensus 72 v~~fF~ngG~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~ 151 (631) ++..| .|++.+|..|+.+.....+ ...+++++|..||.++|.+.+...+.+-... T Consensus 80 l~~~~-~~~~tv~~yrl~~G~~a~~------------------------~v~~Aky~g~~gn~i~v~v~~~~~d~~~~dv 134 (436) T protein:vir:78 80 LRDLF-KNIRLGYFYKLNKGVKASC------------------------SIATARCSGIRGNDLKVIVTTNIDDNAKFDV 134 (436) T ss_pred HHHHh-cCCCEEEEEECCCcceeee------------------------eeeeeecCCCCCcEEEEEecccccccCceEE Confidence 55554 6778999999965322111 1257899999999999887653222110000 Q ss_pred ceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccc Q lcl|NC_013693. 152 NNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDV 231 (631) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (631) . . ..+. ..+ . T Consensus 135 ~-------------~----------------------~~g~------~~~----------d------------------- 144 (436) T protein:vir:78 135 V-------------T----------------------LLDN------KKV----------D------------------- 144 (436) T ss_pred E-------------E----------------------Eecc------hhh----------h------------------- Confidence 0 0 0000 000 0 Q ss_pred ccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccc Q lcl|NC_013693. 232 YSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSN 311 (631) Q Consensus 232 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (631) . .... .. +.+. +. ..+.... . T Consensus 145 -~-~~~~----~~----------------------~~l~-----~n------~~V~~~~-----~--------------- 165 (436) T protein:vir:78 145 -T-QIAK----VI----------------------TELQ-----DN------DYVTWKK-----E--------------- 165 (436) T ss_pred -h-hhHH----HH----------------------hhcc-----CC------ceEEEEe-----c--------------- Confidence 0 0000 00 0000 00 0011100 0 Q ss_pred eeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccc-----eEEeecc Q lcl|NC_013693. 312 WVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDT-----VSFVSPL 386 (631) Q Consensus 312 ~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~-----~a~~d~~ 386 (631) ..........+.||.++.+.........|..++......+.+...+.+.+..+.+++.++++. -+++.. T Consensus 166 -----g~la~~a~~~LtGG~dG~~~T~~dy~~al~~le~~~fn~l~~~~~d~~~~~~~~a~ikr~re~~g~~~~aV~~~- 239 (436) T protein:vir:78 166 -----ATLEATAGLTFTNGTNGEAVTGTEYQAFLDKIESYSFNALGCLATTAEIKSLFVEFTKRMRDKVGAKFQTVLYK- 239 (436) T ss_pred -----ccccccceeeeeccccccccchHHHHHHHHHHcccceeEEEecCCChHHHHHHHHHHHHHHhhcCCeEEEEecC- Confidence 011112234578888865443344455666667666543333333566788899999887642 122210 Q ss_pred cccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCce-eEeehHHHHHHHHHHhhccCCceecccceeece Q lcl|NC_013693. 387 RDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKM-RWIPACGGTAGVWARSIEIAGIYKSPAFHNRGK 465 (631) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~-~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~ 465 (631) .... +++....+... +.+.. ...-..+.+||++|..+ +.+|+.|+. T Consensus 240 ------~~~~--------------d~EgIInv~n~------v~g~~~~~~~~~a~vAG~~Ag~~----~~~S~T~~~--- 286 (436) T protein:vir:78 240 ------KNDA--------------DYEGVVSVENK------IKDTGLLESSLIYWTTGAIAGCD----INKSNTNKR--- 286 (436) T ss_pred ------CCCC--------------CCceEEEeecc------cCCceechhHHHHHHHHHHhcCc----cccCcccee--- Confidence 0011 22222222221 11211 11124578899999877 556999987 Q ss_pred eeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcc-eec----CCCChhhceehhhHHHHHHHHHHHHHHH-HHh Q lcl|NC_013693. 466 YNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGD-KTG----LTRPSAFDRINVRGLFIMAEQNIAAIAK-YYL 539 (631) Q Consensus 466 i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~-rT~----~~~~~~~~~i~vrR~~~~i~~~~~~~~~-~~v 539 (631) +.++.++...++++|.+.+.++|.-++.+. ++++++--. .|+ ...+..|+.|.++|++|+|.+.+++.+. .|+ T Consensus 287 ~~~~~~v~~~~t~~e~~~ai~~G~lvl~~d-~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yi 365 (436) T protein:vir:78 287 YDGEFDVDVNYTQIHLEEALKTGKFIFHKV-GDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYL 365 (436) T ss_pred cCccccccccCCHHHHHHHHhCCeEEEEEe-CCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccc Confidence 555667778899999999999999888754 566666544 343 2346799999999999999999999875 699 Q ss_pred cC-CCCHHHHHHHHHHHHHHHHHHHhCCceeeeE---EEEccCCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEE Q lcl|NC_013693. 540 GE-NNDEFTRSLFSNAVRPYIRQLANMGAIYDGQ---VKCDADNNTADIIAANQMVAGIWLKPEYSINWVYLDFAAV 612 (631) Q Consensus 540 ~e-pn~~~~~~~i~~~i~~~l~~l~~~gal~g~~---v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 612 (631) ++ ||+..-|..++..++.||++|.+.|+|..|. +.+.+. + ....+++++.++|+..+|+|.+++... T Consensus 366 GKv~N~~dgr~~l~~~i~~yl~~L~~~g~I~~f~~~Dv~v~~~-~-----~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 366 GEVPNDKSGRISFWNDVVKHHEQLQNMRAIEDFKADDVSVEPG-S-----DKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred cccCCCHHHHHHHHHHHHHHHHHHHhCCcccCCCCcceEEeec-C-----CCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 95 5899999999999999999999999999886 444322 1 356788999999999999999998754 No 49 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.37 E-value=1.5e-12 Score=85.37 Aligned_cols=321 Identities=12% Similarity=0.055 Sum_probs=169.4 Q ss_pred cccccccccccccccccccccc-----cccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchh Q lcl|NC_013693. 225 LTALTDVYSSVVVKSNTVTVTH-----KAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSN 299 (631) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~v~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (631) +. +.+...+.+.. ...+.+.. +.+...........+........ ....... T Consensus 1 ~~---------glp~i~i~f~~~a~ta~~~g~rGi--------------v~~il~d~~~~~~~~~~~~~v~~-~~~~~n~ 56 (356) T protein:vir:10 1 MA---------GLVNINIEFKELATSFIQRSKAGI--------------VAIILKDTTKMYKELTSEDDIPI-SLSADNK 56 (356) T ss_pred CC---------CCCceeEEEeecceeeccCCccce--------------EEEEEecCCcceeEEeccccchh-HHHHHHH Confidence 00 00111111111 01111111 01111000000111111110000 0000111 Q ss_pred hhhhhhhccccceeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccc Q lcl|NC_013693. 300 AYFKDVINDTSNWVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDT 379 (631) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 379 (631) .++...+..+.... .. ........+..+...++. ..|..++.+....+.+...+.+.+..+.+++.++++. T Consensus 57 ~~i~~~~~g~~~~~---~~---~~p~~~~~~~~~t~~~y~---~aL~~le~~~fn~l~~~~~d~~~~~~~~a~ikr~r~~ 127 (356) T protein:vir:10 57 KYIKYGFVGATDNE---KV---LRPSKVIISTFTEDGKVE---DILEELESVEFNYLCMPEAIEAEKTKIVTWIKKIREE 127 (356) T ss_pred HHHHHHhhcccccc---cc---ccceeeeeecccCchhHH---HHHHHhcCccceEEEecCCChHHHHHHHHHHHHHHhc Confidence 12222222110000 00 000011111111123333 4455555554443333223456778888999887742 Q ss_pred ----eEEeecccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCce Q lcl|NC_013693. 380 ----VSFVSPLRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIY 455 (631) Q Consensus 380 ----~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~ 455 (631) +..+-. .... +++...-+-..+. ++- ......-..+++||++|..+ .. T Consensus 128 ~~~~~~~V~~-------~~~a--------------D~EgIInv~n~~~-~~g--~~~t~~~~~~~vAG~~Ag~~----~n 179 (356) T protein:vir:10 128 ESTEAKAVLA-------NIKA--------------DNEAIINFTENVV-VDG--EEITAEKYTTRVASLIASTP----NT 179 (356) T ss_pred CCcEEEEEec-------CCCC--------------CCceeEEeecCeE-ecc--eeechhHHHHHHHHHHhccc----hh Confidence 222210 1111 2222222222211 110 01111223578999999887 45 Q ss_pred ecccceeeceeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEE-cceecC----CCChhhceehhhHHHHHHHHH Q lcl|NC_013693. 456 KSPAFHNRGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLY-GDKTGL----TRPSAFDRINVRGLFIMAEQN 530 (631) Q Consensus 456 ~span~~~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~w-g~rT~~----~~~~~~~~i~vrR~~~~i~~~ 530 (631) +|+.|+.+. ++... .+++++|.+.+..+|--++.+. ++.+++- |-.|+. ..+..|+.|.+.|++|.|.+. T Consensus 180 ~S~T~~~~~---~~~~~-~~~t~~e~~~ai~~G~lvl~~d-~~~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~~D 254 (356) T protein:vir:10 180 QSITYAPLD---EVESI-VKIDKASADAKVQAGELILRRL-SGKIRIARGINSLTTLTAEKGEIFQKIKLVDTKDLISKD 254 (356) T ss_pred ccccceecC---Ccccc-ccCCHHHHHHHHhCCeEEEEEE-cCeEEEEecCccceecCCCCCcchhhhHHHHHHHHHHHH Confidence 589998744 33332 4688999999999999988765 4445554 444442 345789999999999999999 Q ss_pred HHHHHH-HHhcCC-CCHHHHHHHHHHHHHHHHHHHhCCcee-eeEEEEccCCC--------------CHHHhh----CCe Q lcl|NC_013693. 531 IAAIAK-YYLGEN-NDEFTRSLFSNAVRPYIRQLANMGAIY-DGQVKCDADNN--------------TADIIA----ANQ 589 (631) Q Consensus 531 ~~~~~~-~~v~ep-n~~~~~~~i~~~i~~~l~~l~~~gal~-g~~v~~~~~~n--------------t~~~i~----~G~ 589 (631) +++... .|+++- |+..-|..++..++.||.+|.+.|+|. +|.+.+|.+.. +...+. .-. T Consensus 255 i~~~f~~~yiGKv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~~~~d~~v~~~~~~~~ 334 (356) T protein:vir:10 255 IKNIYVEKYLRKCPNTYDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLEGKKIAVSKMKENEIKEANTGSN 334 (356) T ss_pred HHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhhhccccccccccceeecccCCcE Confidence 999986 699987 999999999999999999999999995 67777775431 111111 245 Q ss_pred EEEEEEEEecCCceEEEEEEEE Q lcl|NC_013693. 590 MVAGIWLKPEYSINWVYLDFAA 611 (631) Q Consensus 590 ~~~~i~~~p~~p~e~i~~~~~~ 611 (631) +++.+.+.|+-.+|.|.+++.. T Consensus 335 v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 335 GFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred EEEEEEEEEEeeeeeEEeEEeC Confidence 7899999999999999999986 No 50 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=99.19 E-value=2.5e-10 Score=73.24 Aligned_cols=453 Identities=10% Similarity=0.041 Sum_probs=218.8 Q ss_pred CCC-----cchhcCCceEEEEecCCCceecccCCceEEEeeecc---CcCCCCeEEecCHHHHHHHcCCCCccchhHHHH Q lcl|NC_013693. 1 MAT-----QSFSVAPSVQWTERDATLQTSPSVVVQGATVGKFQW---GEAELPVLVTGGETGLVKKFFKPNDATATDFLV 72 (631) Q Consensus 1 m~~-----~~~ylsPGVyveEv~~~~~~~gv~tsv~afvG~~~~---Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av 72 (631) |.. ++.++-||+|+|--.+... .+..+--.-+||..-- .|.++|++|.| ..|-...||. .+.+..++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~-~~~~~qrvLiiGq~la~gt~~~~~~v~v~s-~~~a~~~fG~---GS~l~~M~ 75 (498) T protein:vir:48 1 MTISFSAVPSDTLVPLFYAEMDNSAAN-TAVTSAPALLIGHASNDAAIEVNSLVLMPS-ADYARQICGA---GSQLARMV 75 (498) T ss_pred CCccccccCcccccceEEEEEecCCCc-cccCCcceEEEeecCccccccccceEEecC-HHHHHHhcCc---ccHHHHHH Confidence 554 4678899999995444332 2333356778887653 47899999966 5799999995 78888889 Q ss_pred HHHHHhCC-ceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecc Q lcl|NC_013693. 73 IADFLSYS-SVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFR 151 (631) Q Consensus 73 ~~fF~ngG-~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~ 151 (631) +.|..+.- .++|++-+.|... . .+++. ..+..++...|.. T Consensus 76 ~a~~~~n~~~~l~~i~~~D~ag-~-aA~g~-----------------it~tg~at~~G~l-------------------- 116 (498) T protein:vir:48 76 DVYRQTDPFGELYVIAVPEARG-A-AATVR-----------------VTVTGEAEESGTL-------------------- 116 (498) T ss_pred HHHHHhCCCceeEEEeeCCccc-c-eeEEE-----------------EEecccccCCceE-------------------- Confidence 99888765 7999998865311 1 11110 0111111111110 Q ss_pred ceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccc Q lcl|NC_013693. 152 NNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDV 231 (631) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (631) ..........+ .+..+.+. ..........+.+..+. T Consensus 117 ------------~l~Igg~~v~v-------------~V~~gdTa-------------------a~vA~al~aai~a~~~l 152 (498) T protein:vir:48 117 ------------SLYVGRSSVQV-------------PVVNGDDA-------------------TAVATAIKEAVNGVITL 152 (498) T ss_pred ------------EEEECCEEEEE-------------eecCCCCH-------------------HHHHHHHHHHHhCCCCc Confidence 00000000000 00000000 00000000000000000 Q ss_pred ccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccc Q lcl|NC_013693. 232 YSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSN 311 (631) Q Consensus 232 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (631) -....+....+.++.+..+. ..+...+.+. +.... ++. T Consensus 153 PVTA~~~~~~VtlTAr~kG~-------------~GN~I~l~~~----------~~~~~-------------------~ge 190 (498) T protein:vir:48 153 PFAASSDAGVVTLTARHKGL-------------YGNELPVCLN----------YYGSG-------------------GGE 190 (498) T ss_pred ceEEEecCcEEEEEeeeccc-------------ccccceeeee----------eccCc-------------------ccc Confidence 00000000111111111111 1110000000 00000 000 Q ss_pred eeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 312 WVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 312 ~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) ..........+...||.. .++...+.+.+ .......+++...+.+...++.+|++....|+..+........ T Consensus 191 ---~~p~Glt~~itamsgGag--~PDia~aLaal---~~~~~~~I~~p~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~ 262 (498) T protein:vir:48 191 ---ILPAGLQVVTEAGTAGSG--APDLTAAVAAM---GDEAFDFIGLPFNDAASINMMMTEMNDSSGRWSYARQLYGHVY 262 (498) T ss_pred ---cccceeeEEEEcccCCcc--CcchHHHHHhh---ccCCccEEEEeecCHHHHHHHHHHHhhhhhhhhHHhhcCeEEE Confidence 000011122344556642 23444444433 3333444555455666677777777542111111111111111 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehH---HHHHHHHH---HhhccCCceecccc-eeec Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPAC---GGTAGVWA---RSIEIAGIYKSPAF-HNRG 464 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s---~~~ag~~a---~~d~~~g~~~span-~~~~ 464 (631) .....+..+..++-... ++.+..+.+.. +. ...|+. +.+|++.| +.|..| |-| .. T Consensus 263 ~a~~gT~~~l~t~g~~~--N~~~it~~~~~--------~~-~~~p~~~~AAa~a~~aA~~l~~DPAr-----PLqtl~-- 324 (498) T protein:vir:48 263 TAKLGTLSELVNAGDMH--NQQHITLAGYE--------KE-TQSPVDELVASRLAREAVFIRNDPAR-----PTQTGE-- 324 (498) T ss_pred EeccCCHHHHHHhhhcc--CCceEEEEecC--------CC-CCChHHHHHHHHHHHHHHhhhccccc-----ccccee-- Confidence 22345788888887765 46666544311 11 112332 23333333 445433 222 23 Q ss_pred eeeecc--cceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceec-----C-CCChhhceehhhHHHHHHHHHHHHHHH Q lcl|NC_013693. 465 KYNNYN--RMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTG-----L-TRPSAFDRINVRGLFIMAEQNIAAIAK 536 (631) Q Consensus 465 ~i~g~~--~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~-----~-~~~~~~~~i~vrR~~~~i~~~~~~~~~ 536 (631) +.|+. .+.-+++..|+|.|.-+||.++.. .+.-..+--..|. . ..|.-|..|+..|+.+|+++.++..+. T Consensus 325 -L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V-~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~ 402 (498) T protein:vir:48 325 -LVGMLPAPKGKRFIMTEQQTLLSHGVATAYV-EGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVLRKLKSVIT 402 (498) T ss_pred -eeccccCCchhcCChHHHHHHHhcCcceEEE-cCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhh Confidence 45554 446678999999999999999976 5545556565554 1 246789999999999999999998775 Q ss_pred H-HhcCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeee---E--EEEccCCCCHHHhhCCeEEEEEEEEec Q lcl|NC_013693. 537 Y-YLGENNDEF-----------TRSLFSNAVRPYIRQLANMGAIYDG---Q--VKCDADNNTADIIAANQMVAGIWLKPE 599 (631) Q Consensus 537 ~-~v~epn~~~-----------~~~~i~~~i~~~l~~l~~~gal~g~---~--v~~~~~~nt~~~i~~G~~~~~i~~~p~ 599 (631) . |--+..-++ +-..||..+-+.+++|..+|-+..+ + +.|.++-+. ..|+.+.+-...+ T Consensus 403 ~kfpR~KLa~dg~~~~~gq~IvTp~~ir~eli~~y~~le~~given~~~~~~~LiVerd~~d-----pnRln~~~p~d~v 477 (498) T protein:vir:48 403 SKYGRHKLANDGTRFGPGQAIVTPAVIKGELLATYRQMERAGIVENYDLFKQYLIVERDADN-----PNRLNTLFPPDYV 477 (498) T ss_pred hhcCCceecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCC-----CcEEEEEeccccc Confidence 3 222222222 6678899999999999999988763 2 334332222 2456565555555 Q ss_pred CCceE----EEEEEEEEecCc Q lcl|NC_013693. 600 YSINW----VYLDFAAVRPDM 616 (631) Q Consensus 600 ~p~e~----i~~~~~~~~~~~ 616 (631) .+++- |.|+++.....+ T Consensus 478 n~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 478 NQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred CchhhhhhhhhhhhhhhhcCC Confidence 55433 334444333333 No 51 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=99.11 E-value=1.2e-09 Score=69.45 Aligned_cols=447 Identities=11% Similarity=0.040 Sum_probs=216.3 Q ss_pred CCC-----cchhcCCceEEEEecCCCceecccCCceEEEeeecc---CcCCCCeEEecCHHHHHHHcCCCCccchhHHHH Q lcl|NC_013693. 1 MAT-----QSFSVAPSVQWTERDATLQTSPSVVVQGATVGKFQW---GEAELPVLVTGGETGLVKKFFKPNDATATDFLV 72 (631) Q Consensus 1 m~~-----~~~ylsPGVyveEv~~~~~~~gv~tsv~afvG~~~~---Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av 72 (631) |.. ++..+-||+|+|--.+.. ..+...--.-+||..-- .+.++|++|+| ..|-...||. .+.+..++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A-~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s-~~~a~~lfG~---GSml~~M~ 75 (498) T protein:vir:45 1 MTISFNTIPSNTLVPLFYAEMDNQAA-NTAQDSGASLLIGHANNGAEIVANSLVLMPS-ADYARQICGA---GSQLARMV 75 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCC-CCCCCCcceEEEEecCCccccccceeEEecC-HHHHHHhcCc---CcHHHHHH Confidence 655 466788999999444444 22334456778887643 47899999966 5799999996 78888899 Q ss_pred HHHHHhCC-ceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecc Q lcl|NC_013693. 73 IADFLSYS-SVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFR 151 (631) Q Consensus 73 ~~fF~ngG-~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~ 151 (631) +.|..+.- .++|++-+.+... . .+++ ...+..++...|.. T Consensus 76 ~a~~~~n~~~~l~~i~~~d~aG-~-aA~g-----------------~it~tg~at~~G~l-------------------- 116 (498) T protein:vir:45 76 EAYRQTDPFGELYVIAVPEATG-A-AATV-----------------TLTVTGEATESGTV-------------------- 116 (498) T ss_pred HHHHHhCCcceEEEEeeCCccc-c-eeEE-----------------EEEeecccCCCcEE-------------------- Confidence 99988764 6999998854211 1 1100 00111111111110 Q ss_pred ceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccc Q lcl|NC_013693. 152 NNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDV 231 (631) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (631) ..........+ .+..+.+. ..........+.+..+. T Consensus 117 ------------~l~Igg~~v~v-------------~V~~gdTa-------------------a~vA~al~aaina~~~l 152 (498) T protein:vir:45 117 ------------NVYVGRTRVQA-------------PVTNGDNV-------------------TTIASSIQDAINAVPTL 152 (498) T ss_pred ------------EEEECCEEEEE-------------EecCCCCH-------------------HHHHHHHHHHHhCCCCC Confidence 00000000000 00000000 00000000000000000 Q ss_pred ccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccc Q lcl|NC_013693. 232 YSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSN 311 (631) Q Consensus 232 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (631) -....+....+.++.++.+. ..+...+.++. ... .++. T Consensus 153 PVTA~~~~~~VtlTAr~kG~-------------~GN~I~l~~~~----------~~~-------------------~~ge 190 (498) T protein:vir:45 153 PFTASSSAGVVTLTARHKGL-------------CGNEIPVSLNY----------YGF-------------------GGGE 190 (498) T ss_pred ceEEEecCceEEEEeeccCc-------------cccceeEEEee----------ccc-------------------cccc Confidence 00000000111121111111 11111110000 000 0000 Q ss_pred eeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHh-------hccceEEee Q lcl|NC_013693. 312 WVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTE-------RKDTVSFVS 384 (631) Q Consensus 312 ~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-------~~~~~a~~d 384 (631) ..........+...||.. .++...+.+.+ .......+++...+.+...++.+|++. ++.+.++. T Consensus 191 ---~~p~Glt~~itamagGag--~PD~a~alaal---~~~~~~~I~~p~~D~asL~al~~~L~~~sgRw~~~~q~~g~~- 261 (498) T protein:vir:45 191 ---VLPAGVQIAVATGTAGTG--APVLTGAVAAM---ADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHV- 261 (498) T ss_pred ---cccceeeEEEEccCCCcc--CchhHHHHHHh---ccCCccEEEEeeCCHHHHHHHHHHHhhhhhhhhHHhhcCeEE- Confidence 000001122334555542 23444444443 333334445544566666777777754 22222222 Q ss_pred cccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehH---HHHHHHHH---HhhccCCceecc Q lcl|NC_013693. 385 PLRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPAC---GGTAGVWA---RSIEIAGIYKSP 458 (631) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s---~~~ag~~a---~~d~~~g~~~sp 458 (631) ......+.++..++-...| +.+..+.+.. + + ..-|+- +.+|+..| +.|..| .- T Consensus 262 ------~~a~~gT~~~l~t~g~~~N--~~~it~~~~~-----~--~--~~sp~~~~AAa~aa~~A~~l~~DPAr----PL 320 (498) T protein:vir:45 262 ------YTAKTGTLSELVNAGDQFN--QQHITLAGYE-----K--E--TQTPADELAASRTARAAVFIRNDPAR----PT 320 (498) T ss_pred ------EEeccCCHHHHHHhhhccC--CceEEEEecC-----C--C--CCChHHHHHHHHHHHHHHHhhccccc----cc Confidence 1223457888888877654 6666554321 0 0 112332 33333443 344333 11 Q ss_pred cceeeceeeecc--cceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceec-----C-CCChhhceehhhHHHHHHHHH Q lcl|NC_013693. 459 AFHNRGKYNNYN--RMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTG-----L-TRPSAFDRINVRGLFIMAEQN 530 (631) Q Consensus 459 an~~~~~i~g~~--~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~-----~-~~~~~~~~i~vrR~~~~i~~~ 530 (631) -..+ +.|+. .+.-+++..|+|.|.-+||.++..-.| -..+--..|. . ..|..|..|+..|+.+|+++. T Consensus 321 ~tl~---L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G-~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~ 396 (498) T protein:vir:45 321 QTGE---LVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESG-VLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRK 396 (498) T ss_pred Ccee---ecceecCCchhcCChHHHHHHHhCCcceEEEcCC-eEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHH Confidence 2233 44544 446778999999999999999976433 2444444443 1 246789999999999999999 Q ss_pred HHHHHHHH-hcCCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeee---E--EEEccCCCCHHHhhCCeEEEE Q lcl|NC_013693. 531 IAAIAKYY-LGENNDEF-----------TRSLFSNAVRPYIRQLANMGAIYDG---Q--VKCDADNNTADIIAANQMVAG 593 (631) Q Consensus 531 ~~~~~~~~-v~epn~~~-----------~~~~i~~~i~~~l~~l~~~gal~g~---~--v~~~~~~nt~~~i~~G~~~~~ 593 (631) ++..+... --+.+-++ +-..||..+-+.+++|..+|-+..+ + +.|.++-+. ..|+.+. T Consensus 397 ~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~le~~givEn~~~~~~~LiVerd~~d-----pnRln~~ 471 (498) T protein:vir:45 397 LKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASV-----PNRLNTL 471 (498) T ss_pred HHHHhhhhcCCeeecccCcccCCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCC-----CcEEEEE Confidence 99877533 22222222 5678899999999999999988763 2 334332222 2456565 Q ss_pred EEEEecCCceE----EEEEEEEEecCc Q lcl|NC_013693. 594 IWLKPEYSINW----VYLDFAAVRPDM 616 (631) Q Consensus 594 i~~~p~~p~e~----i~~~~~~~~~~~ 616 (631) +-...+.+++- |.|+++.....+ T Consensus 472 ~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 472 FPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred ecccccCchhhhhhhhhhheehhhcCC Confidence 55555555433 344444433333 No 52 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=99.08 E-value=1.5e-09 Score=68.89 Aligned_cols=454 Identities=12% Similarity=0.040 Sum_probs=215.6 Q ss_pred CCC-----cchhcCCceEEEEecCCCceecccCCceEEEeeecc---CcCCCCeEEecCHHHHHHHcCCCCccchhHHHH Q lcl|NC_013693. 1 MAT-----QSFSVAPSVQWTERDATLQTSPSVVVQGATVGKFQW---GEAELPVLVTGGETGLVKKFFKPNDATATDFLV 72 (631) Q Consensus 1 m~~-----~~~ylsPGVyveEv~~~~~~~gv~tsv~afvG~~~~---Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av 72 (631) |.. ++..+-||+|+|--.+.. ......--.-+||..-- .|.++|++|+| ..|-...||. .+.+..++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A-~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s-~~~a~~~fG~---GSml~~M~ 75 (498) T protein:vir:44 1 MAISFNSIPSDTRVPLFYAEMDNSAA-NTARDSGASLLIGHASNDASIAVNSLVLVSS-VDYARQICGA---GSQLARMV 75 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCC-CCCcCCcceEEEEecCcccccccceeEeecC-HHHHHHhcCc---ccHHHHHH Confidence 655 466888999999533333 22233345677887653 37899999966 5799999995 78889999 Q ss_pred HHHHHhCC-ceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecc Q lcl|NC_013693. 73 IADFLSYS-SVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFR 151 (631) Q Consensus 73 ~~fF~ngG-~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~ 151 (631) +.|..+.- .++|++=+.|. +. ..+++. ..+..++...|.. T Consensus 76 ~a~~~~n~~~~l~~i~~~D~-aG-~aAtg~-----------------it~tg~at~~G~l-------------------- 116 (498) T protein:vir:44 76 GAYRKTDPFGELYVIAVPES-TG-AAATVA-----------------LTVTGEATETGTV-------------------- 116 (498) T ss_pred HHHHHhCCCceeEEEecCCc-cc-ceeEEE-----------------EEeecccCCCcEE-------------------- Confidence 99988765 79999977542 11 111110 0111111111100 Q ss_pred ceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccc Q lcl|NC_013693. 152 NNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDV 231 (631) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (631) ..........+ .+..+.+. ..........+.+..+. T Consensus 117 ------------~l~Igg~~v~v-------------~V~~gdTa-------------------a~vA~al~aaina~~~l 152 (498) T protein:vir:44 117 ------------NVYTGRTRVQA-------------PVTSGDDA-------------------AAVAVSIKDAVNANPDL 152 (498) T ss_pred ------------EEEECCEEEEE-------------EecCCCCH-------------------HHHHHHHHHHHhCCCCC Confidence 00000000000 00000000 00000000000000000 Q ss_pred ccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccc Q lcl|NC_013693. 232 YSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSN 311 (631) Q Consensus 232 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (631) -....+....+.++.++.+. ..+...+.++ +.... ++. T Consensus 153 PVTA~~~~~~vtlTAr~kG~-------------~GN~I~l~~~----------~~~~~-------------------~ge 190 (498) T protein:vir:44 153 PFTATSEAGVVTLTARHKGL-------------YGNEIPVTLN----------YYGFG-------------------GGE 190 (498) T ss_pred ceEEeeccceEEEEEeccCc-------------ccCcceEEEe----------eccCc-------------------ccc Confidence 00000000111111111111 1111100000 00000 000 Q ss_pred eeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccceEEeeccccccc Q lcl|NC_013693. 312 WVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDTVSFVSPLRDVVV 391 (631) Q Consensus 312 ~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~a~~d~~~~~~~ 391 (631) ..........+...||.. .++...+.+.+ .......+++...+.+...++.+|++....|+..+........ T Consensus 191 ---~~p~Glt~titamsgGag--~PDia~alaal---~~~~~~~i~~p~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~ 262 (498) T protein:vir:44 191 ---VLPAGVNITVASGVKGAG--APALNDAVAAM---GDEPFDYIGLPFNDTASVNSMATEMNDSSGRWSYVRQLYGHVY 262 (498) T ss_pred ---ccccceeEEEEcccCCcc--CchhHHHHHhh---ccCCccEEEEeecCHHHHHHHHHHHhhhhcchHHHhhcCeEEE Confidence 000011122344555542 33444444443 3333344455455666777777777542211111111111111 Q ss_pred ccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHH---HHHHHHH---HhhccCCceecccceeece Q lcl|NC_013693. 392 GNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACG---GTAGVWA---RSIEIAGIYKSPAFHNRGK 465 (631) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~---~~ag~~a---~~d~~~g~~~span~~~~~ 465 (631) .....+..++.++-...| +.+..+.+... + ..-|+-. .+|+..| +.|..| .--..+ T Consensus 263 ~a~~gT~a~l~t~g~~~N--~~~it~~~~~~-------~--~~sp~~~~AAa~a~~aA~~l~~DPAr----PL~tl~--- 324 (498) T protein:vir:44 263 TAKTGTLSELVAAGDQFN--LQHITLAGYEK-------D--TQTPADELAASRTARAAVFIRNDPAR----PTQTGE--- 324 (498) T ss_pred EeccCCHHHHHHhhhccC--CceEEEEecCC-------C--CCCHHHHHHHHHHHHHHHHhhccccc----ccCcee--- Confidence 223456888888877654 55555433210 0 0113222 3333333 344333 112223 Q ss_pred eeecc--cceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceec-----C-CCChhhceehhhHHHHHHHHHHHHHHHH Q lcl|NC_013693. 466 YNNYN--RMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTG-----L-TRPSAFDRINVRGLFIMAEQNIAAIAKY 537 (631) Q Consensus 466 i~g~~--~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~-----~-~~~~~~~~i~vrR~~~~i~~~~~~~~~~ 537 (631) +.|+. .+.-+++..|+|.|.-+||.++..-.| -..+--..|. . ..|..|..|+..|+.+|+++.++..+.. T Consensus 325 L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G-~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~ 403 (498) T protein:vir:44 325 LVDMLPAPKGKRFTTTEQQTLLSHGVATAYVESG-VLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVITS 403 (498) T ss_pred ecccccCCchhcCChHHHHHHHhcCcceEEEcCC-eEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhh Confidence 45554 446778999999999999999976433 2444444443 1 2467899999999999999999987742 Q ss_pred -HhcCCCCH-----------HHHHHHHHHHHHHHHHHHhCCceeee---E--EEEccCCCCHHHhhCCeEEEEEEEEecC Q lcl|NC_013693. 538 -YLGENNDE-----------FTRSLFSNAVRPYIRQLANMGAIYDG---Q--VKCDADNNTADIIAANQMVAGIWLKPEY 600 (631) Q Consensus 538 -~v~epn~~-----------~~~~~i~~~i~~~l~~l~~~gal~g~---~--v~~~~~~nt~~~i~~G~~~~~i~~~p~~ 600 (631) |--+..-+ -+-+.|+..+-+.+++|..+|-+..+ + +.|.++-+. ..|+.+.+-...+. T Consensus 404 kfpR~KLa~d~~~~~~gq~IvTp~~ir~eli~~y~~le~~givEn~~~~~~~LiVerd~~d-----pnRln~~~p~d~vn 478 (498) T protein:vir:44 404 KYGRHKLANDGTRFGSGQAIVTPAVIRGELGSTYRQMEREGIVENFDLFQQHLIVERNAND-----SNRLDVLFPPDYVN 478 (498) T ss_pred hcCCcccccCCcccCCCcccccHHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCC-----CcEEEEEecccccC Confidence 22222211 26678999999999999999988763 2 344332211 24566665555555 Q ss_pred CceE----EEEEEEEEecCc Q lcl|NC_013693. 601 SINW----VYLDFAAVRPDM 616 (631) Q Consensus 601 p~e~----i~~~~~~~~~~~ 616 (631) +++- |.|+++.....+ T Consensus 479 ~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:44 479 QLRVFAVLNQFRLQYSEEAA 498 (498) T ss_pred chhhhhhhhhhhhhhhhhcC Confidence 5443 334444333333 No 53 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=99.01 E-value=5.3e-09 Score=65.95 Aligned_cols=340 Identities=11% Similarity=0.040 Sum_probs=175.6 Q ss_pred ccccccccccccc---ccccc--ceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhcc-ccc Q lcl|NC_013693. 238 KSNTVTVTHKAIG---PQTVT--AIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVIND-TSN 311 (631) Q Consensus 238 ~~~~~~v~~~~~~---~~~~~--~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 311 (631) ....+.+...... ..... ..+... +.......+.++... .+....+........ .+.....+ |.. T Consensus 1 ~~~~v~vn~ln~~qg~~~~ver~~lfig~-~~~~~~~~~~~~~~s-------dld~~lg~~ds~lk~-~v~aa~~naG~~ 71 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERHALFVGV-GTTNQGKLLALTPDS-------DFDKVFGETDTDLKK-QVRAAMLNAGQN 71 (376) T ss_pred CCCeEEEeeeeccCCCcccccceEEEeec-cccccCceEEecCCC-------ChHHhhCCCchhHHH-HHHHHHhCCCCc Confidence 0000000000000 00000 000000 000000111111111 111111111111111 12222222 322 Q ss_pred eeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccc---cchHHHHHH----HHHh-hccceEEe Q lcl|NC_013693. 312 WVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEE---LIEQQTLID----LSTE-RKDTVSFV 383 (631) Q Consensus 312 ~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~---~~~~~~~~~----~~~~-~~~~~a~~ 383 (631) +......+ +.+ ..+...+.... .+.+......++-|. .+...++.+ +..+ ++-.|.++ T Consensus 72 w~a~~~~p----------~~~--~~~~~~Av~~a--~~~~s~E~V~v~~p~~t~~a~i~a~qa~a~el~~~~~R~vffil 137 (376) T protein:vir:37 72 WFAHVYIA----------QED--GYDFVECVKKA--NQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQ 137 (376) T ss_pred eEEEEEec----------CCC--hhhHHHHHHHH--HhhCCeeEEEEecCcchhHHHHHHHHHHHHHHHHhcCCeEEEEE Confidence 21111110 000 11223333322 234444444443332 222223322 2233 35677777 Q ss_pred ecccccccccccCCHHHHHHHHHh--cCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccce Q lcl|NC_013693. 384 SPLRDVVVGNRGREMEDVVAWRES--LVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFH 461 (631) Q Consensus 384 d~~~~~~~~~~~~~~~~~~~~~~~--~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~ 461 (631) ..+.-......+.++++-.+.... -++.+.++.++.... + ...|.+||.+++. ..-++.||.-. T Consensus 138 e~~g~d~~~~~ge~w~~y~~~l~a~~~gia~~~V~vV~~~~-------g-----n~~G~~aGRl~na--aVsVadspgRV 203 (376) T protein:vir:37 138 AVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPLLF-------G-----NETGVLAGRLANR--AVTVADSPARV 203 (376) T ss_pred eccCCCCcccccCCHHHHHHHHHHHhccccccceeeeeeec-------c-----chHHHHHHHHHhC--CcchhcCccce Confidence 764211112234466555444332 244566776663211 1 2468889988752 22368899988 Q ss_pred eeceeeecccce-------ecCChhHhhhhhhcCceEEEEEcC-CcEEEEcceecCCCChhhceehhhHHHHHHHHHHHH Q lcl|NC_013693. 462 NRGKYNNYNRMA-------WSASSDERAVLYRNQINSIVTFSN-EGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAA 533 (631) Q Consensus 462 ~~~~i~g~~~~~-------~~~~~~~~~~L~~~gin~i~~~~~-~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~ 533 (631) .-+.+.|+..+. ..++.+..+.|..+|-.+.+.++| .|+++-+.||++...++|++|..+|.++-+.+.++. T Consensus 204 ~tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG~Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R~vR~ 283 (376) T protein:vir:37 204 QTGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDGYYRADGRTLDVEGGDYQVIENLRVVDKVARKVRL 283 (376) T ss_pred eecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCceEEeCCeEeccCCCCeeeehhchHHHHHHHHHHH Confidence 777776654432 235667888999999999999997 477777889999888999999999999988887776 Q ss_pred HHHHHhcCC-C--CHHHHHHHHHHHHHHHHHHHhCCceeee----EEEEccCCCCHHHh-----hCCeEEEEEEEEecCC Q lcl|NC_013693. 534 IAKYYLGEN-N--DEFTRSLFSNAVRPYIRQLANMGAIYDG----QVKCDADNNTADII-----AANQMVAGIWLKPEYS 601 (631) Q Consensus 534 ~~~~~v~ep-n--~~~~~~~i~~~i~~~l~~l~~~gal~g~----~v~~~~~~nt~~~i-----~~G~~~~~i~~~p~~p 601 (631) ..-..+..+ . ++.-.+..+..++.-|+.|.+.+.|.|. +|...++ +|| ...++.+.+.++|.-- T Consensus 284 ~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~d----~dI~i~w~sk~~V~I~~~vrPy~c 359 (376) T protein:vir:37 284 LAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKD----DAITIVWQSKTKVTIYIKVRPYDC 359 (376) T ss_pred HHHHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccceeecCCC----CceEEEeccCceEEEEEEEeeecC Confidence 655444433 2 4677788889999999999999999984 3555433 233 3577888899999988 Q ss_pred ceEEEEEEEEEecCceeee Q lcl|NC_013693. 602 INWVYLDFAAVRPDMEFSE 620 (631) Q Consensus 602 ~e~i~~~~~~~~~~~~~~e 620 (631) .+.|+..|--.-. +..| T Consensus 360 pk~i~~~I~LDls--~~~~ 376 (376) T protein:vir:37 360 PKEITANIFLDLD--SLGE 376 (376) T ss_pred cceeEEEEEEecC--CCCC Confidence 8889888764321 1222 No 54 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=99.01 E-value=3.3e-09 Score=67.06 Aligned_cols=451 Identities=13% Similarity=0.033 Sum_probs=216.7 Q ss_pred CCC------cchhcCCceEEEEecCCC-ceecccCCceEEEeeec---cCcCCCCeEEecCHHHHHHHcCCCCccchhHH Q lcl|NC_013693. 1 MAT------QSFSVAPSVQWTERDATL-QTSPSVVVQGATVGKFQ---WGEAELPVLVTGGETGLVKKFFKPNDATATDF 70 (631) Q Consensus 1 m~~------~~~ylsPGVyveEv~~~~-~~~gv~tsv~afvG~~~---~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~ 70 (631) |++ ++..+-||+|+|--.+.. +....-..-.-+||..- ..|.++|++|+| ..|-...||. .+.+.. T Consensus 1 m~~i~F~~IP~~iRvP~~y~E~dns~A~~g~~~~~q~vLiiGq~la~gs~~~~~pv~v~s-~~~a~~~fG~---GS~la~ 76 (495) T protein:vir:19 1 MSDISFNAIPSDVRVPLTYIEFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRS-GSQASAAFGQ---GSMLAL 76 (495) T ss_pred CCCCchhhCCcccccCeEEEEEccCCCCcCCcCCCceEEEEEecCcccccccceeEEecC-HHHHHHhcCc---CcHHHH Confidence 554 556889999999443322 22222234557788753 347899999976 5799999996 788888 Q ss_pred HHHHHHHhC-CceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceee Q lcl|NC_013693. 71 LVIADFLSY-SSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWE 149 (631) Q Consensus 71 av~~fF~ng-G~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~ 149 (631) +++.|..+. -.++|++-+.|... . .+++ ...+..++...|.. .+ T Consensus 77 M~~a~~~~n~~~~l~~i~~~D~aG-~-aA~g-----------------~it~tg~at~~G~l----~l------------ 121 (495) T protein:vir:19 77 MADAFLNANRVAELWCIPQGNGTG-N-AAVG-----------------EISLSGTAGENGSL----VT------------ 121 (495) T ss_pred HHHHHHHhCCcceEEEEeeCChhh-c-eeEE-----------------EEEEeecCCCCcEE----EE------------ Confidence 888888766 47999998865311 1 1111 00111111111110 00 Q ss_pred ccceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccc Q lcl|NC_013693. 150 FRNNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALT 229 (631) Q Consensus 150 ~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (631) ........ +.+..+.+.... +...............++.. T Consensus 122 ----------------~I~g~~v~-------------v~V~~gdTaa~v-----------A~al~aaina~~~lPvTA~~ 161 (495) T protein:vir:19 122 ----------------YIAGQRLA-------------VSVAAGATGAAL-----------ADLLVARIKGQPDLPVTAEV 161 (495) T ss_pred ----------------EECCEEEE-------------EEecCCCCHHHH-----------HHHHHHHhcCCccCceEEEe Confidence 00000000 000000000000 00000000000000000000 Q ss_pred ccccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccc Q lcl|NC_013693. 230 DVYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDT 309 (631) Q Consensus 230 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (631) ............++++.++.+.. .. .. ....+. . + T Consensus 162 ~~~~~~~~a~~~VtlTAr~kG~~-n~-------------id----------i~~~~~--------------------~-g 196 (495) T protein:vir:19 162 RADSGDDDTHADVVLSAKFTGAL-SA-------------VD----------VRWNYY--------------------A-G 196 (495) T ss_pred eccCCCCcCceeEEEEEeecccc-cc-------------ce----------eEEEee--------------------c-c Confidence 00000000001111111111100 00 00 000000 0 0 Q ss_pred cceeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhc------cceEEe Q lcl|NC_013693. 310 SNWVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERK------DTVSFV 383 (631) Q Consensus 310 ~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~------~~~a~~ 383 (631) . ..........+...||.. .++...+...+ .......+++.-.+.+...++.+|++.+- +.+++. T Consensus 197 e----~~p~Glt~titamsgGag--~PDia~alaal---~~~~~~~I~~P~tD~asL~al~~~l~~rw~~~~q~~g~~~~ 267 (495) T protein:vir:19 197 E----TTPYGIITAFKAASGKNG--NPDISASIAGM---GDLQYKYIVMPYTDEPNLNLLRTELQERWGPVNQADGFAVT 267 (495) T ss_pred c----ccccceeEEEEecCCCCC--CcchHHHHHHh---ccCCCcEEEEecCcHHHHHHHHHHHHHhhhHHHhcCeEEEE Confidence 0 000111122345566642 23444444444 33344444444446666778888877632 223332 Q ss_pred ecccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhh--ccCCceecccce Q lcl|NC_013693. 384 SPLRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSI--EIAGIYKSPAFH 461 (631) Q Consensus 384 d~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d--~~~g~~~span~ 461 (631) ....+..+..++-...| +.+..+.+- ++. .-||....|++.++.- .+..|-..--.. T Consensus 268 ---------a~~gT~~~l~t~g~~~N--~~~it~~~~--------~gs--p~~~~~~AAA~aa~~A~~l~~DPArPL~tl 326 (495) T protein:vir:19 268 ---------VLSGTYGDISTFGVSRN--DHLISCMGI--------AGA--PEPSYLYAATLCAVASQALSIDPARPLQTL 326 (495) T ss_pred ---------eecCCHHHHHHhhhccC--CceEEEEec--------CCC--CCcHHHHHHHHHHHHHHHhhcccccccCce Confidence 23457788888877554 555554421 111 1334443333333321 122231122233 Q ss_pred eeceeeecc--cceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecC------CCChhhceehhhHHHHHHHHHHHH Q lcl|NC_013693. 462 NRGKYNNYN--RMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGL------TRPSAFDRINVRGLFIMAEQNIAA 533 (631) Q Consensus 462 ~~~~i~g~~--~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~------~~~~~~~~i~vrR~~~~i~~~~~~ 533 (631) . +.|+. .+.-+++..|+|.|.-+||.++..-.++-..+--..|.- ..|..|..|+.-|+.+|+++.++. T Consensus 327 ~---L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~ 403 (495) T protein:vir:19 327 T---LPGRMPPAVGDRFTWSERNALLFDGISTFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRT 403 (495) T ss_pred e---ecceecCCccccCChHHHHHHHhCCcceEEECCCCeEEEEeeeeeeeecCCCCcchhhhhhHHHHHHHHHHHHHHH Confidence 3 44544 446778999999999999999876544445555555541 245789999999999999999998 Q ss_pred HHHHHhc-CCCCHH-----------HHHHHHHHHHHHHHHHHhCCceeee---E--EEEccCCCCHHHhhCCeEEEEEEE Q lcl|NC_013693. 534 IAKYYLG-ENNDEF-----------TRSLFSNAVRPYIRQLANMGAIYDG---Q--VKCDADNNTADIIAANQMVAGIWL 596 (631) Q Consensus 534 ~~~~~v~-epn~~~-----------~~~~i~~~i~~~l~~l~~~gal~g~---~--v~~~~~~nt~~~i~~G~~~~~i~~ 596 (631) .....-. +..-++ +-+.||..+-+.+++|..+|-+..+ + +.|.++-+. .+|+.+.+-. T Consensus 404 ~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~given~~~~~~~LiVerd~~d-----pnRln~~~p~ 478 (495) T protein:vir:19 404 RITQKFPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEELYVARNKDD-----KDRLDVLCGP 478 (495) T ss_pred HHhhhcCCcccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCC-----CcEEEEEecc Confidence 7753322 222222 5677899999999999999988763 2 334332211 3567777766 Q ss_pred EecCCceEEEEEEEEEe Q lcl|NC_013693. 597 KPEYSINWVYLDFAAVR 613 (631) Q Consensus 597 ~p~~p~e~i~~~~~~~~ 613 (631) ..+..++-+-.+++--- T Consensus 479 d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 479 NLINQFRIFAAQVQFIL 495 (495) T ss_pred eeeCceeeeeeeeeeeC Confidence 66666654444333111 No 55 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=98.98 E-value=8.2e-09 Score=64.91 Aligned_cols=436 Identities=11% Similarity=-0.050 Sum_probs=204.5 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHHhC Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFLSY 79 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ng 79 (631) |=+ +=|-|. +...+..+ +..=+...|+|... -|. ++++..++..+-...||. .+..+.+.+.||.+. T Consensus 1 ~~s------~iVnV~-i~~~~~a~~~~~f~~~l~~~~~~-~~~-~r~~~yss~~~V~~~FG~---~S~ey~aA~~yF~q~ 68 (450) T protein:vir:95 1 MWN------PIVNVD-ITLNTAGTTREGFGLPLFLASTD-NFE-ERVRGYTSLTEVAEDFDE---NTAAYKAAKQLWSQT 68 (450) T ss_pred CCC------ceEEEe-ecccccccccccceeEEEEcCCC-CCc-cceeeecCHHHHHHhcCC---CcHHHHHHHHHHhCC Confidence 222 112222 11111112 23334556666543 343 567777777888999996 667788999999875 Q ss_pred --CceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccceeeee Q lcl|NC_013693. 80 --SSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNNFAYA 157 (631) Q Consensus 80 --G~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~~~~~ 157 (631) -+++||-|-.... ++ .. ...+.+ T Consensus 69 p~p~~l~igr~~~~~----t~----~~---------------~~~~~~-------------------------------- 93 (450) T protein:vir:95 69 PKVTQLYIGRRAMQY----TV----SI---------------PDAVTE-------------------------------- 93 (450) T ss_pred CcccEEEEEeeccch----hh----hh---------------hhhhcc-------------------------------- Confidence 3577777653210 00 00 000000 Q ss_pred cccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccccccccc Q lcl|NC_013693. 158 PQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVYSSVVV 237 (631) Q Consensus 158 ~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (631) ...+. .. +.+.+.... ...+. ... .+ .+........ T Consensus 94 ~~~g~---------lt-------------~tv~G~~~~--~~~i~------------~s~----a~---s~~~va~~~~- 129 (450) T protein:vir:95 94 STDYS---------IT-------------VAAGGGISQ--PYQYT------------AQS----SD---TAENVLQQFK- 129 (450) T ss_pred cccee---------EE-------------EEecceeee--eeEEE------------EEe----cC---ChhhHHHHhh- Confidence 00000 00 000000000 00000 000 00 0000000000 Q ss_pred ccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccceeeeec Q lcl|NC_013693. 238 KSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNWVYTFA 317 (631) Q Consensus 238 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (631) . .+. .... ....+.... .+..... .+. ..... .... .. T Consensus 130 t--ai~------~~~~-----------~~~~~~~~s--~g~~~~~-t~~-~~~~~------~~~~-------------~~ 167 (450) T protein:vir:95 130 T--QIE------ADPT-----------IKDKVSVNV--TGSNGSA-TMI-IAKAG------DNDF-------------VK 167 (450) T ss_pred h--hhc------ccce-----------eeeeeeeee--eccccee-eee-eeccc------cchh-------------hc Confidence 0 000 0000 000011000 0000000 000 00000 0000 00 Q ss_pred ccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccc-cchHHHHHHHHHhhccceEEeecccccccccccC Q lcl|NC_013693. 318 TTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEE-LIEQQTLIDLSTERKDTVSFVSPLRDVVVGNRGR 396 (631) Q Consensus 318 ~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~~~a~~d~~~~~~~~~~~~ 396 (631) ....... -...|... .....+...+..... +. ..+..+.. .+...++.++++....+|.....-.......... T Consensus 168 l~~~~~~-~~~~g~~a--et~~~a~~a~~~~~~-~w-~~~~~~~~~~~~i~a~a~w~~a~~~~f~~~~~~~~~~~~~~~~ 242 (450) T protein:vir:95 168 VTTTAQT-VYIASTTA--DTASTALAAIEAYST-DW-YFIAAEDRTQQFVLAMASEIQARKKIFFTANSDVTALQGTELA 242 (450) T ss_pred cccccce-eEeccccc--ccHHHHHHHHHHhhC-Ce-EEEEecCCCHHHHHHHHHHHhhcCcEEEEEcCCchhhhhhhhh Confidence 0000000 01111111 111122222222111 11 22222222 1233456666776665665542211111110001 Q ss_pred CHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeee-cc-ccee Q lcl|NC_013693. 397 EMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNN-YN-RMAW 474 (631) Q Consensus 397 ~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g-~~-~~~~ 474 (631) ...+....-...++ +.-+.+|++ .+. .-.+.+.++|.....+.-+-- -.+|.+.||.. +. +... T Consensus 243 ~~~~i~~~l~~~~~-~~t~~~y~~-------~~~---~~~~~aa~~g~~~~~~~g~~T---~~fk~l~Gv~~~v~~~~~~ 308 (450) T protein:vir:95 243 SANDVPAQLAKNMY-TRTVCLWHH-------AAA---EDYPEMAYIAYGAPYDAGSIA---WGNAQLTGVAASLQPSNQR 308 (450) T ss_pred cccchHHHHHhccC-CeeEEEeeC-------CCc---hhHHHHHHHHHhhhcccceee---eccccccceeeeccCcccc Confidence 11111111111111 112333332 111 122556666665544332212 23555444432 11 1234 Q ss_pred cCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhc------CCCCHHHH Q lcl|NC_013693. 475 SASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLG------ENNDEFTR 548 (631) Q Consensus 475 ~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~------epn~~~~~ 548 (631) .+++.|.+.|..+++|++.++.+.++ ++..+|+.+ .||-++|-.+|++..|++.+...+- =|-|+.-. T Consensus 309 ~lt~~~~~al~~~~~n~y~~~~~~~~-~~~G~~~~G-----~~iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy~~~G~ 382 (450) T protein:vir:95 309 PLTSIQKSALDVRHCNFIDLDGGVPV-VRRGITSGG-----EWIDIIRGVDWLESDLKTSLRDLLINQKGGKITYDDTGI 382 (450) T ss_pred ccchHHHHHHHhCCcEEEEEecCcee-eeCCeeeCc-----chhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccChhhH Confidence 68899999999999999999876654 777788655 2788999999999999999886652 26677888 Q ss_pred HHHHHHHHHHHHHHHhCCceeeeEEEEc-cCCCCHHHhhCCeEE-EEEEEEecCCceEEEEEEEEEec Q lcl|NC_013693. 549 SLFSNAVRPYIRQLANMGAIYDGQVKCD-ADNNTADIIAANQMV-AGIWLKPEYSINWVYLDFAAVRP 614 (631) Q Consensus 549 ~~i~~~i~~~l~~l~~~gal~g~~v~~~-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~~ 614 (631) ..|+..|+.-|++.+++|.|.||+|.+. .+..++.|..++++. +.+.++....++++.+++..+-+ T Consensus 383 ~~i~a~i~~~l~~a~~~G~Ia~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~~ 450 (450) T protein:vir:95 383 TRIRQVIETSLQRAVNRNFLSSYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAYE 450 (450) T ss_pred HHHHHHHHHHHHHHHhcCcccceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEeC Confidence 8999999999999999999999999986 578889999988876 88889999999999998876544 No 56 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=98.94 E-value=9e-09 Score=64.69 Aligned_cols=343 Identities=10% Similarity=0.022 Sum_probs=171.4 Q ss_pred ccccccccccccc--c-cccc--ceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhcc-ccc Q lcl|NC_013693. 238 KSNTVTVTHKAIG--P-QTVT--AIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVIND-TSN 311 (631) Q Consensus 238 ~~~~~~v~~~~~~--~-~~~~--~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 311 (631) ....+.+...... + .... ..+..... ......+.++... .+....+....... ..+...-.+ +.. T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er~~Lfig~~~-~~~~~~~~~~~~s-------dld~~lg~~~~~lk-~~v~aa~~naG~~ 71 (376) T protein:vir:37 1 MFPSVQINALNQLSGETKEIERHALFVGVGT-TNQGKLLALTPDS-------DFDKVFGETDTDLK-KQVRAAMLNAGQN 71 (376) T ss_pred CCCeEEEecccccCCCcccccceEEeecccc-ccccceeeecCcc-------chHhhhCCCchHHH-HHHHHHHhCCCCc Confidence 0000111000000 0 0000 00000000 0000001111111 11111111111111 122222222 222 Q ss_pred eeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccc---cchHHHHHHH----HHh-hccceEEe Q lcl|NC_013693. 312 WVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEE---LIEQQTLIDL----STE-RKDTVSFV 383 (631) Q Consensus 312 ~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~---~~~~~~~~~~----~~~-~~~~~a~~ 383 (631) +... ......+ + .+...+.... .+.++.....++-+. .+...++.++ ..+ ++-.|.++ T Consensus 72 ~~~~-~~~~~~~------~-----~~~~~Av~~a--~~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv~fil 137 (376) T protein:vir:37 72 WFAH-VYIAQED------G-----YDFVECVKKA--NQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRTFFIQ 137 (376) T ss_pred EEEE-EEeecCC------c-----hHHHHHHHHh--hhhcCceEEEEeccccccHHHHHHHHHHHHHHHHhcCCeEEEEE Confidence 2111 1110000 0 1122222211 233343333333321 2222333333 333 45677777 Q ss_pred ecccccccccccCCHHHHHHHHHhc--CCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccce Q lcl|NC_013693. 384 SPLRDVVVGNRGREMEDVVAWRESL--VRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFH 461 (631) Q Consensus 384 d~~~~~~~~~~~~~~~~~~~~~~~~--~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~ 461 (631) ..+.-..+...+.++++-.+.+..+ ++.+.+..++.- .+ -...|.+||.+|+.. .-++.||.-. T Consensus 138 e~r~~~~~~~~~e~w~~y~~~~~al~~gia~~~V~~V~~---~~---------gn~~G~~aGRl~~aa--VsVadspgRV 203 (376) T protein:vir:37 138 AVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPL---LF---------GNETGVLAGRLANRA--VTVADSPARV 203 (376) T ss_pred eccCcCcccccccCHHHHHHHHHHhhcccccccceeeee---eh---------hhhHHHHHHHHhhcc--cchhhCccce Confidence 7652211222345666555444332 334444443311 00 123588888876532 2267789887 Q ss_pred eeceeeecccce-------ecCChhHhhhhhhcCceEEEEEcCC-cEEEEcceecCCCChhhceehhhHHHHHHHHHHHH Q lcl|NC_013693. 462 NRGKYNNYNRMA-------WSASSDERAVLYRNQINSIVTFSNE-GIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAA 533 (631) Q Consensus 462 ~~~~i~g~~~~~-------~~~~~~~~~~L~~~gin~i~~~~~~-G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~ 533 (631) ..+.+.|....+ ..++...++.|..+|-.+.+.++|. |+++-+.||++...++++||..+|.++-+.+.++. T Consensus 204 ~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R~vR~ 283 (376) T protein:vir:37 204 QTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGYYWADGRTLDVEGGDYQVIENLRVVDKVARKVRL 283 (376) T ss_pred eccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHH Confidence 766676653322 3467888999999999999999984 77777889998888999999999999999888887 Q ss_pred HHHHHhcCCC---CHHHHHHHHHHHHHHHHHHHhCCceeee----EEEEccCCC-CHHHhhCCeEEEEEEEEecCCceEE Q lcl|NC_013693. 534 IAKYYLGENN---DEFTRSLFSNAVRPYIRQLANMGAIYDG----QVKCDADNN-TADIIAANQMVAGIWLKPEYSINWV 605 (631) Q Consensus 534 ~~~~~v~epn---~~~~~~~i~~~i~~~l~~l~~~gal~g~----~v~~~~~~n-t~~~i~~G~~~~~i~~~p~~p~e~i 605 (631) .+-..+.... ++.-.+..+.-+..-|+.|.+..-+.|. +|...++.+ +..-+...++.+.+.+.|.--.+.| T Consensus 284 ~ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~d~Di~i~w~s~~~V~I~~~v~P~~~pk~I 363 (376) T protein:vir:37 284 LAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKTKVTIYIKVRPYDCPKEI 363 (376) T ss_pred HHHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCCCCCceEEeeccceEEEEEEEEeccCCceE Confidence 7766654432 3444566666688889999998888872 455544321 2223478899999999999999999 Q ss_pred EEEEEEEe-cCce Q lcl|NC_013693. 606 YLDFAAVR-PDME 617 (631) Q Consensus 606 ~~~~~~~~-~~~~ 617 (631) +..|--.- .-.+ T Consensus 364 tv~I~Ldlsn~~~ 376 (376) T protein:vir:37 364 TANIFLDLDSLGE 376 (376) T ss_pred EEEEEeecCCCCC Confidence 87765321 1222 No 57 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=98.86 E-value=2.4e-08 Score=62.34 Aligned_cols=339 Identities=12% Similarity=0.037 Sum_probs=168.3 Q ss_pred cccccccccccccccccccccccccccccccceeecccccccceeeeec-c----ccccc--ceeeeeeeeecccccccc Q lcl|NC_013693. 225 LTALTDVYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTV-G----ASGSI--IEKYELMQATQGSKKSDG 297 (631) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-~----~~~~~--~~~~~~~~~~~~~~~~~~ 297 (631) +.. ..+.+.......... ......+.++. + ..+.+ +.+...+....+..+... T Consensus 1 m~~------------~~V~in~~n~~qg~~--------~~ver~~lfig~g~~~~~~g~~~~~~~~sdld~~lg~~ds~l 60 (369) T protein:vir:27 1 MAW------------PTVIIKILNLMNGPI--------ADIECHFLFVIRGTVSGEVRNLIMVDSTSDLDDVLAEASAEG 60 (369) T ss_pred CCC------------CceEEecccccCCCc--------ccccceEEEEEeccccccccceEEecCccchHhhcCCcChhH Confidence 000 000000000000000 00000011110 0 00000 011111111111111111 Q ss_pred hhhhhhhhhccccceeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecccc--chHHHH----HH Q lcl|NC_013693. 298 SNAYFKDVINDTSNWVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEEL--IEQQTL----ID 371 (631) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~--~~~~~~----~~ 371 (631) . ..+.....+++........+ +. ...+...+.... .+.++.....++-|.. +...++ .+ T Consensus 61 k-~~v~aa~~naG~~w~a~~~p-------~~-----~~~~~~~Av~~a--~~~~s~E~V~v~~p~t~~a~i~aaq~~a~e 125 (369) T protein:vir:27 61 L-AIVKAAQLNGKQAWTAGVMI-------LS-----EEDNWQDAVKKA--NEVSSFEFVVLGFDAETKAMIEDAITLRTE 125 (369) T ss_pred H-HHHHHHHhCCCCceEEEEEE-------eC-----CchhHHHHHHhh--hhhCCccEEEEecCcccHHHHHHHHHHHHH Confidence 1 11222222222211111111 11 011222222221 2334444443333322 222233 33 Q ss_pred HHHh-hccceEEeecccccccccccCCHHHHHHHHHh--cCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHh Q lcl|NC_013693. 372 LSTE-RKDTVSFVSPLRDVVVGNRGREMEDVVAWRES--LVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARS 448 (631) Q Consensus 372 ~~~~-~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~--~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~ 448 (631) +..+ ++..|.++..+.-......+.++++-.+.... -++.+.++.++.-+.... .-.|.+||.++.. T Consensus 126 l~~~~~R~vffi~e~~~~~~~~~~~e~w~dy~a~l~al~~g~a~~~V~vv~~~~~~g----------n~~G~~aGRl~n~ 195 (369) T protein:vir:27 126 LKNSLGREVGVLCQLPAINNDPTNGQTWSEWLADTVDIPKDVASEYISVVPNVHAAG----------DTLGKYAGRLANK 195 (369) T ss_pred HHHhcCCeEEEEEeccccCCCccccCCHHHHHHHHHHHhhccCcccceeeeeecccc----------chHHHHHHHHHhc Confidence 3333 34567777644211122234455554433221 244677777774333222 2357788888752 Q ss_pred hccCCceecccceeeceeeecccce-----ecCChhHhhhhhhcCceEEEEEcC-CcEEEEcceecCCCChhhceehhhH Q lcl|NC_013693. 449 IEIAGIYKSPAFHNRGKYNNYNRMA-----WSASSDERAVLYRNQINSIVTFSN-EGIVLYGDKTGLTRPSAFDRINVRG 522 (631) Q Consensus 449 d~~~g~~~span~~~~~i~g~~~~~-----~~~~~~~~~~L~~~gin~i~~~~~-~G~~~wg~rT~~~~~~~~~~i~vrR 522 (631) ..-++.||.-..-+.+.|...+- ..++.+.++.|..+|-.+.+.++| .|+++-+.||++...++|+||..+| T Consensus 196 --aVsIadsp~RVktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~tl~~~gsDYq~iE~~R 273 (369) T protein:vir:27 196 --EVSIADSPARVQTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGRTLDVPGGDYQDIRHIR 273 (369) T ss_pred --ccchhcCcceeeecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCCeehhhhhh Confidence 22267889877666666644221 335667889999999999999998 4777778899998889999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCC---CHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHh-----hCCeEEEEE Q lcl|NC_013693. 523 LFIMAEQNIAAIAKYYLGENN---DEFTRSLFSNAVRPYIRQLANMGAIYDGQVKCDADNNTADII-----AANQMVAGI 594 (631) Q Consensus 523 ~~~~i~~~~~~~~~~~v~epn---~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~~~~~nt~~~i-----~~G~~~~~i 594 (631) .++-+.+.++...-..+..+. ++.-.+..+..+..-|+.|.+.+ ..+.|...++ +|| .+.++.|.+ T Consensus 274 VvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~--fpgei~~P~d----~dI~i~w~~k~~V~I~~ 347 (369) T protein:vir:27 274 VAMKAARKVRIRAIARIADRTLNSTPQSIAAAKLYFTQDLRTMALTG--VPGEIYPPED----EDIQIKWVNSTDVEIYM 347 (369) T ss_pred HHHHHHHHHHHHHHHHhcCcccccChhHHHHHHHHHhhHHHHHHhhc--CCeEEecCCC----CceEEEeeccceEEEEE Confidence 999888888766655554443 45566777778888899987653 2334444332 244 566888888 Q ss_pred EEEecCCceEEEEEEEEEecCceeeee Q lcl|NC_013693. 595 WLKPEYSINWVYLDFAAVRPDMEFSEI 621 (631) Q Consensus 595 ~~~p~~p~e~i~~~~~~~~~~~~~~e~ 621 (631) .+.|.--.+.|+.+|.-. .++. T Consensus 348 ~vrP~~~pk~it~~I~ld-----l~~~ 369 (369) T protein:vir:27 348 SVQPYECPVKITIAISVK-----QGDY 369 (369) T ss_pred EEeeccCCceEEEEEEEe-----ccCC Confidence 899999889999988743 2223 No 58 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=98.77 E-value=1.2e-08 Score=64.01 Aligned_cols=338 Identities=10% Similarity=0.002 Sum_probs=164.5 Q ss_pred cccccccceeeccccc---ccceeeeecccccccceeeeeeeeecc------cccccchhhhhhhhhccccceeeeeccc Q lcl|NC_013693. 249 IGPQTVTAIVPDANGL---TATAVTTTVGASGSIIEKYELMQATQG------SKKSDGSNAYFKDVINDTSNWVYTFATT 319 (631) Q Consensus 249 ~~~~~~~~~~~~~~~~---~~~~~~~~v~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (631) --+.-....+..-.+. ....+.++ +....-..+...+....+ ..+.... ..+...-.+++........ T Consensus 1 ~~~~v~vn~~n~~~g~~~~~er~~lfi-g~~~~~~g~~~~~~~~sdld~~l~~~ds~lk-~~v~aa~~naG~~~~~~~~- 77 (370) T protein:vir:78 1 MWPYVQIYNLNQMQGPVTEVERHLLFI-GSAASNTGKLLSLNAQSDFDQLLGAADSELK-ANLLAARDNAGQNWSAAAY- 77 (370) T ss_pred CCceEEEeeccccCCCcCccceeEEEE-ecccccccceEeecCccCHHHhcCCcChhHH-HHHHHHHhCCCCceEEEEE- Confidence 0000000000000011 11111111 111111111101111111 1111000 0111111111111111000 Q ss_pred ccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecc--ccchHHHHHHHHH----hh-ccceEEeecccccccc Q lcl|NC_013693. 320 LAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCE--ELIEQQTLIDLST----ER-KDTVSFVSPLRDVVVG 392 (631) Q Consensus 320 ~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~----~~-~~~~a~~d~~~~~~~~ 392 (631) .+. ...+...+.... .+.++......+-+ +.+...++.++++ ++ +..+.++..+.- T Consensus 78 ------p~~-----~~~d~~~Av~~a--~~~~s~E~V~v~~~~s~~a~~~a~~~~a~el~n~~~Rpv~file~~~~---- 140 (370) T protein:vir:78 78 ------VLP-----TDKPWLDAARDA--QQTQSFEGVVVLGQEWHQAAINAAHALNQELIAKWGRWQFMLLAVPAI---- 140 (370) T ss_pred ------Eec-----CchhHHHHHHHH--HhhCCccEEEEecCcchHHHHHHHHHHHHHHHHhcCCeEEEEEeecCC---- Confidence 010 111222333222 22333333333222 2233334434333 33 456777765421 Q ss_pred cccCCHHHHHHHHHh--cCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecc Q lcl|NC_013693. 393 NRGREMEDVVAWRES--LVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYN 470 (631) Q Consensus 393 ~~~~~~~~~~~~~~~--~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~ 470 (631) ..+.++++-.+.... -++.+.++.++.-|.. ..-|.+||.++.. .--+..+|.-...+.+.|.. T Consensus 141 ~~~e~w~~y~~~l~al~~gia~~~V~vvp~~~g------------~~~G~~aGRL~na--avsVadsP~Rv~tG~l~gl~ 206 (370) T protein:vir:78 141 ADEQDWATYEAELATLQDGIAASSVSLIPQLWP------------TLAGAYAGRLCNR--AVSIADSPCRVKTGALVGLG 206 (370) T ss_pred CCcCCHHHHHHHHHHhhhccccccceEEeeecc------------ccHHHHHHHHhcC--eeeecccceeeecccccccc Confidence 234555554433321 2345666666644321 1137778876542 11267788876666665532 Q ss_pred c-----ceecCChhHhhhhhhcCceEEEEEcCC-cEEEEcceecCCCChhhceehhhHHHHHHHHHHH-HHHHHHhcCCC Q lcl|NC_013693. 471 R-----MAWSASSDERAVLYRNQINSIVTFSNE-GIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIA-AIAKYYLGENN 543 (631) Q Consensus 471 ~-----~~~~~~~~~~~~L~~~gin~i~~~~~~-G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~-~~~~~~v~epn 543 (631) . ....++.+.++.|..+|-.+.+.++|. |+++-+.|||+...++++||..+|+.+-+.+.++ ++++...+|-. T Consensus 207 ~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~i~D~~l 286 (370) T protein:vir:78 207 NKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGIYWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAIARIGDRSF 286 (370) T ss_pred ccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCccc Confidence 2 124466788999999999999999984 7777788999888899999999999999998888 44455555433 Q ss_pred CH--HHHHHHHHHHHHHHHHHHhCCceee--eEEEEccCCC---CHHHhhCCeEEEEEEEEecCCceEEEEEEEEEecCc Q lcl|NC_013693. 544 DE--FTRSLFSNAVRPYIRQLANMGAIYD--GQVKCDADNN---TADIIAANQMVAGIWLKPEYSINWVYLDFAAVRPDM 616 (631) Q Consensus 544 ~~--~~~~~i~~~i~~~l~~l~~~gal~g--~~v~~~~~~n---t~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~ 616 (631) |+ ......+..+..-|++|...+.+.| |.-+|....+ ++.-+...++.+.+.+.|.--...|+..|.-. . T Consensus 287 nst~gsia~~~~~~~~~L~ema~s~~i~~~~fpgeI~~p~d~Di~i~w~s~~~v~I~~~v~P~~~pk~Itv~I~LD---l 363 (370) T protein:vir:78 287 NSTPGSTAAAITYFGKDLREMAKSTTINGQPFPGDIASPQDGDIRIQWVAKNLVSVFVVVRTVDCPKGITVNIMLD---L 363 (370) T ss_pred CCCCcchhHHHHHHHhhHHHHHhhhhhcccccceeEeccCCCcceEEeeccceEEEEEEEEeccCCceEEEEEEEe---e Confidence 32 2233444555556777777887777 4444432111 22234788899999999999999999988643 2 Q ss_pred eeeeeecc Q lcl|NC_013693. 617 EFSEIETG 624 (631) Q Consensus 617 ~~~e~~~~ 624 (631) ++++. +| T Consensus 364 s~e~~-~~ 370 (370) T protein:vir:78 364 SLNNG-EG 370 (370) T ss_pred ccccC-CC Confidence 32222 22 No 59 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=98.59 E-value=2.3e-07 Score=56.96 Aligned_cols=316 Identities=11% Similarity=-0.057 Sum_probs=160.1 Q ss_pred cccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeee-cccccccchhhhhhhhhcccccee Q lcl|NC_013693. 235 VVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQAT-QGSKKSDGSNAYFKDVINDTSNWV 313 (631) Q Consensus 235 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 313 (631) ....-..+.+......+... ........+..+.. .....+...... .+..............+..+.... T Consensus 1 ~~~~iv~V~v~~~~~~~~~~--------~~~~~~~~~~~~t~-~~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~~~~~ 71 (331) T protein:vir:80 1 MVETITDVRVHISVLYPSPR--------IGLGRPAIFVKGTA-MGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPD 71 (331) T ss_pred Cccceecceeeecccccccc--------cccCcceeEEeccc-cceEEEechhhhccCCCCCcHHHHHHHHHHhccCccc Confidence 00000001100000000000 00000000111111 111111111111 011111111111112222221111 Q ss_pred eeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccceEEeeccccccccc Q lcl|NC_013693. 314 YTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDTVSFVSPLRDVVVGN 393 (631) Q Consensus 314 ~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~a~~d~~~~~~~~~ 393 (631) .+..+....+. .......... -+...+++.-...+...++.+.++..+.+|.+.+. T Consensus 72 ------------~i~v~~~~~~~---~~~a~~a~~~-~~w~~~~~~~~~~~~~~a~a~~~~a~~~~f~~~~~-------- 127 (331) T protein:vir:80 72 ------------TVAVITYEDTK---LLEAAEAYFL-KSWHFALLAEFKAADALALSNLIEEQKFKFAVFQV-------- 127 (331) T ss_pred ------------eEEEeccchHH---HHHHHHHhcc-CceeEEEeecCCHHHHHHHHHHHhhCCcEEEEEec-------- Confidence 11111111111 1111111111 12223333222333445666777777777765532 Q ss_pred ccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeeecccce Q lcl|NC_013693. 394 RGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNNYNRMA 473 (631) Q Consensus 394 ~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g~~~~~ 473 (631) +...++....+ .+....++++. .+ --+.+.+.|.++.++.-+--| .+|. .+.|+.. T Consensus 128 --~~~~~~~~~~~----~~~t~~~~~~~-------~~----~~~~aa~~g~~~~~~~g~~t~---~fk~--~l~GV~~-- 183 (331) T protein:vir:80 128 --TAVADITPLAK----NTRTIAIVHSK-------TG----EKLDAALIGNVASLPVGSATW---KGRH--GLAGITS-- 183 (331) T ss_pred --CchHHHHHhhc----cccEEEEEcCC-------cc----chhHHHHHHHHHhcCccceee---eeec--ccCCCCC-- Confidence 12222222211 22334444432 11 114566667777666433222 2331 2445443 Q ss_pred ecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcC----CCCHHHHH Q lcl|NC_013693. 474 WSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGE----NNDEFTRS 549 (631) Q Consensus 474 ~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----pn~~~~~~ 549 (631) -.++..|++.|..+++|++.++.+.. .+....|+.++ ||.+.+-.+|++..|++.+...+-. |-|+.=.. T Consensus 184 ~~lt~t~~~al~~~~~N~y~~~~~~~-~~~~G~~~~G~-----~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~ 257 (331) T protein:vir:80 184 EELKVSEIDAIQKAGGMCYIEKAGIA-QTSEGKTVSGE-----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIA 257 (331) T ss_pred CCCCHHHHHHHHhcCceEEEEecCee-EEecceEeCch-----hHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhHH Confidence 35789999999999999999987654 45666776552 7899999999999999988765533 23666778 Q ss_pred HHHHHHHHHHHHHHhCCcee--------eeEEEEc-cCCCCHHHhhCCeEE-EEEEEEecCCceEEEEEEEEEe Q lcl|NC_013693. 550 LFSNAVRPYIRQLANMGAIY--------DGQVKCD-ADNNTADIIAANQMV-AGIWLKPEYSINWVYLDFAAVR 613 (631) Q Consensus 550 ~i~~~i~~~l~~l~~~gal~--------g~~v~~~-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 613 (631) .|+..++.-|++.+++|.|. ||+|.+. .++.+++|..++++. +.+.+++...+++|++++..+- T Consensus 258 ~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 258 LLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred HHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 99999999999999999995 6889886 577899999999887 8888999999999999877543 No 60 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=98.25 E-value=2.1e-06 Score=51.73 Aligned_cols=459 Identities=13% Similarity=0.019 Sum_probs=214.9 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCceEEEeee-ccCcC--CCCeEEecCHHHHHHHcCCCCccchhHHHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQGATVGKF-QWGEA--ELPVLVTGGETGLVKKFFKPNDATATDFLVIADF 76 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv~afvG~~-~~Gp~--~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF 76 (631) |+-+. +.=|.|. +...+..+ +..=+...|+|.. ..-|. .+.++..++..|-...||. .+..+.+.+.|| T Consensus 1 msip~---s~ivnV~-i~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~---~s~ey~aA~~yF 73 (502) T protein:vir:52 1 MALSI---SHIVNVQ-LNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGT---NSETAKAAQPFF 73 (502) T ss_pred CCCCc---cceeEEe-eccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCC---ChHHHHHHHHHh Confidence 77763 2222232 12222223 3445677888874 34333 3456667777889999995 667788999999 Q ss_pred HhCC--ceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhh-hhhhch-hhh-hccCcccceeecc Q lcl|NC_013693. 77 LSYS--SVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAG-SLGNDV-AIN-VCDAAGFPTWEFR 151 (631) Q Consensus 77 ~ngG--~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G-~~gn~l-~v~-v~~~~~~~~~~~~ 151 (631) .+-= +++||-|-......... .+ +...| .....+ .++ +.++ . .+ T Consensus 74 ~q~p~P~~l~igR~~~~~~~~~~-~~------------------------~~~~~~~~~~~~~~~~~~~~G-~-l~---- 122 (502) T protein:vir:52 74 AQSPRAKQLIVARWQKSASTIEA-TK------------------------NTLSGATLSDDLERFKSVVNG-R-FS---- 122 (502) T ss_pred cCCCccceEEEEeccccccceee-ch------------------------hhhhhhhhHHhHHHhhhhcCc-e-eE---- Confidence 5432 47888776432211000 00 00000 000000 000 0000 0 00 Q ss_pred ceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccc Q lcl|NC_013693. 152 NNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDV 231 (631) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (631) +.-.|..... . .+..+....- .. +.......... T Consensus 123 -----i~i~g~~~t~--------~----------~i~lS~~ts~--------~~-----------vA~~i~~~l~~---- 156 (502) T protein:vir:52 123 -----LTIGGDVKKV--------D----------GLSFARLADF--------NA-----------VATKIQEKLTT---- 156 (502) T ss_pred -----EEecceeeee--------e----------ccccccccch--------hH-----------HHHHHHhhhcc---- Confidence 0000000000 0 0000000000 00 00000000000 Q ss_pred ccccccccccccccccccccccccceeecccccccceeeeecccccccce-eeeeeeeecccccccchhhhhhhhh--cc Q lcl|NC_013693. 232 YSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIE-KYELMQATQGSKKSDGSNAYFKDVI--ND 308 (631) Q Consensus 232 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 308 (631) ......+.. +.. ...+.+.....+.... ..... .. ....+. ++...+ .. T Consensus 157 ------~~~~~tv~~-----d~~-----------~~~F~i~s~ttg~~~~~~~~~a-~~---~~~~gt--~~a~~l~l~~ 208 (502) T protein:vir:52 157 ------LSVAVSIAY-----DET-----------GNRFIVSANVAGEDKKTEIDYA-ID---EGGEGE--YIGALLKLEN 208 (502) T ss_pred ------cccceEEEE-----ecC-----------CceEEEEeccCCCcceeEEEEe-ec---CCcchh--HHHHHhcccc Confidence 000000000 000 0001111111110000 00000 00 000000 010000 00 Q ss_pred ccceeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccc-eeEEecccc--chHHHHHHHHHhhccceEEeec Q lcl|NC_013693. 309 TSNWVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAK-PVFAFCEEL--IEQQTLIDLSTERKDTVSFVSP 385 (631) Q Consensus 309 ~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~-~~i~~~~~~--~~~~~~~~~~~~~~~~~a~~d~ 385 (631) ......... ...|... ... .+.+.++...... ..+..+... +...++.++++..+.+|.+... T Consensus 209 ~~~av~v~~---------~~~g~~a--et~---~~al~a~~~~~~~w~~~~~a~~~~~~~~la~a~~iea~~~~f~~~~~ 274 (502) T protein:vir:52 209 GQASRKVGK---------NSVSLKK--ETL---GEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQANTKLFGANVI 274 (502) T ss_pred ccceeeeee---------ecccccc--cCH---HHHHHHHHhccCceEEEEEeecCChhHHHHHHHHHhhcCcEEEEEec Confidence 000000000 0111111 111 1222222222211 122223332 3345666778877766665432 Q ss_pred ccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCC-ceecccceeec Q lcl|NC_013693. 386 LRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAG-IYKSPAFHNRG 464 (631) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g-~~~span~~~~ 464 (631) .....+.. ..+....-...++. .-+.+|++ .+ -.+.+.++|.++.+|-.+- -...-.+|. T Consensus 275 d~~~~~~~----~~~i~~~l~a~~~~-~t~~~y~~-------~~-----~~~~aa~~g~~as~~f~~~~g~iT~~fk~-- 335 (502) T protein:vir:52 275 RAEQIEWS----ADNIYKKLYDAGLD-HTLAMFDK-------ND-----MYPVSSALARLLSTNFAANNSTLTLKFKQ-- 335 (502) T ss_pred Ccceeccc----cchHHHHHHhccCc-eeEEEecC-------Cc-----chhHHHHHHHHHhcCCCcCcceeeecccc-- Confidence 22222111 11222211222221 12333332 11 1256777888888874331 122233444 Q ss_pred eeeecccceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcC--- Q lcl|NC_013693. 465 KYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGE--- 541 (631) Q Consensus 465 ~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e--- 541 (631) +.|+.. -.+++.|++.|..+++|++.++.+.++ +...+++.+ .||-+.+-.+||+..|++.+...++. T Consensus 336 -l~GV~~--~~lt~t~~~al~~~~~N~y~~~~~~~~-~~~G~~~~G-----~~iD~~~~~~Wl~~~lq~~l~~~L~~s~~ 406 (502) T protein:vir:52 336 -QPTITA--DEITATEFAKAKRLGINVYTYFDDVAM-IAEGTVIGG-----KFADEIVILDWFVDAVQKEVFARLYKSPT 406 (502) T ss_pred -cCCccc--CcCCHHHHHHHHhcCceEEEEecCeeE-EecCeeeCC-----chhhHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 444432 357899999999999999999866544 556677655 27778899999999999998765542 Q ss_pred --CCCHHHHHHHHHHHHHHHHHHHhCCcee--------------------eeEEEEc-cCCCCHHHhhCCeE-EEEEEEE Q lcl|NC_013693. 542 --NNDEFTRSLFSNAVRPYIRQLANMGAIY--------------------DGQVKCD-ADNNTADIIAANQM-VAGIWLK 597 (631) Q Consensus 542 --pn~~~~~~~i~~~i~~~l~~l~~~gal~--------------------g~~v~~~-~~~nt~~~i~~G~~-~~~i~~~ 597 (631) |-|+.=...|+..|+.-|++.+++|.|. ||.|.+. .++.+++|..+++. -+.+.++ T Consensus 407 kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~~~~~~~~~ 486 (502) T protein:vir:52 407 KIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVK 486 (502) T ss_pred CcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeecccccCceEEEeCchhhCCHHHHHcccCCCeEEEEE Confidence 3467778999999999999999999984 6888886 57789999999998 8999999 Q ss_pred ecCCceEEEEEEEEEe Q lcl|NC_013693. 598 PEYSINWVYLDFAAVR 613 (631) Q Consensus 598 p~~p~e~i~~~~~~~~ 613 (631) +...+++|+|.+...| T Consensus 487 ~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 487 LAGAIHSSDVIVNYNR 502 (502) T ss_pred ECceEEEEEEEEEEeC Confidence 9999999999888666 No 61 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=95.35 E-value=0.0022 Score=35.11 Aligned_cols=393 Identities=8% Similarity=-0.003 Sum_probs=145.1 Q ss_pred ecccccceEEeeeeeeeeecccc-------cccceeeeeecccccccceeEeeccccccccccccccccccccccccc-- Q lcl|NC_013693. 157 APQAGEYHIVIVDKVGRITDSSG-------AVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTA-- 227 (631) Q Consensus 157 ~~~~g~~~~~~~~~~~~v~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 227 (631) .+ . ...++..... .++....+......+ ........ ........+..+.+..... T Consensus 1 m~---~-------~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~----p~~~f~~~--~~Yss~~~V~~Dfg~~s~~Y~ 64 (426) T protein:vir:31 1 MP---K-------QIVEIELTAEIADRPQETFTDAAIVGTAEEEP----PDAEFGEV--NQYSTSTSVGDDYGEDSDVYT 64 (426) T ss_pred CC---c-------ceEEEEeecccccccccccceeeeeeeccccc----cccccchh--hhhhhHHHHHhcCCCChHHHH Confidence 00 0 0000000000 000000000000000 00000000 0000000011111111000 Q ss_pred -ccccccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhh Q lcl|NC_013693. 228 -LTDVYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVI 306 (631) Q Consensus 228 -~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 306 (631) ....+... .... +.. ............ .++..+...... ...+. ......+...+ T Consensus 65 AA~~~f~Q~-~~~~--r~~-------v~~at~~~~~~~----------t~~~tv~g~~~s-~~a~~---~~~a~~i~~~~ 120 (426) T protein:vir:31 65 ASEAIEEMG-AEQW--RVM-------VLEATEVTEEEL----------SDGDTIDKVPIL-GNHEV---ESPDGDIEFTT 120 (426) T ss_pred HHHHHHhCC-ceeE--Eee-------ccccceeeeccC----------Ccceeecceeee-ecccC---cchHHHHHHhh Confidence 00111100 0000 000 000000000000 000000000000 00000 00011111100 Q ss_pred c----cccce-eeeecccccccccccccccccchh-hhhhHHHHHhhhhh-cccceeEEeccccchHHH---HHHHHHhh Q lcl|NC_013693. 307 N----DTSNW-VYTFATTLAAGVTELEGGVDDYTG-NRVAAIEALNNAEA-YDAKPVFAFCEELIEQQT---LIDLSTER 376 (631) Q Consensus 307 ~----~~~~~-~~~~~~~~~~~~~~l~gg~d~~~~-~~~~~~~~l~~~~~-~~~~~~i~~~~~~~~~~~---~~~~~~~~ 376 (631) . ..... ........ ..+..+...... .....|.++..+.. ++...+.........+.. +.+.++. T Consensus 121 ~~~~~~~~~~~~~~~~t~~----g~~t~~~~~~~~~~s~~dw~~~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa~~- 195 (426) T protein:vir:31 121 DDDPDVEDFDAEIVINSAT----GDVATSEDSIELTYFHADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASD- 195 (426) T ss_pred ccccccccceeeeEecccc----ceeeccccceeeeeccCcchhhhcccccchhhhhhccccchhhhhhhHhhhhhhhh- Confidence 0 00000 00000000 000111000000 00111222221111 111111111111111111 1111111 Q ss_pred ccceEEeecccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCcee Q lcl|NC_013693. 377 KDTVSFVSPLRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYK 456 (631) Q Consensus 377 ~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~ 456 (631) .+.+.+...- ....-...+..++++..- .-|.|........ ....--..+++++.++..+ ||. T Consensus 196 ~~i~~va~~~----e~~~~~~~~~~~a~~~~~-------~~y~p~~~~~~~~--~~~~~~~~~~~~~~~aa~~----~~~ 258 (426) T protein:vir:31 196 EDMGMIANGV----NVDDYDSVDEAMDVAHEV-------AGYVPSGDLMMIV--DASDDDLAAYQLGKFAVSE----PWY 258 (426) T ss_pred cceeeeeecc----chhhhcchhhhhhhhhcc-------cccccchhheeeh--hccccchhhHHhhhhhhhc----ccc Confidence 1222222110 001111122334443321 1122321111000 0000113567888888776 566 Q ss_pred cccceeeceeee----cc--cceecCChhHhhhhhhcCceEEEEEcCCcEEEEcceecCCCChhhceehhhHHHHHHHHH Q lcl|NC_013693. 457 SPAFHNRGKYNN----YN--RMAWSASSDERAVLYRNQINSIVTFSNEGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQN 530 (631) Q Consensus 457 span~~~~~i~g----~~--~~~~~~~~~~~~~L~~~gin~i~~~~~~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~ 530 (631) .|.-+...+-.. .. +..-.+...++-.++ +..|+++.+.+ +.++|-.-|..+....-.||-++|..+||++. T Consensus 259 ~~~~~~~~~~~~~~~~~~~~gv~~t~~~~~~A~~~-~~~n~~~~~~~-~~~i~~~~~~~G~~~~G~~iD~~~g~dwl~~~ 336 (426) T protein:vir:31 259 NPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGE-GPVNVLIDVSD-ANRVSNAVTTAGADSDTSFFDIRRTKVYTAEM 336 (426) T ss_pred chhhhhccccccceeeccccccccccchhhhhhhc-CCceEEEEecC-ceeeecceeecccccchhhhhhHHHHHHHHHH Confidence 654222111111 11 111122233444555 77899998854 67788766766666666799999999999999 Q ss_pred HHHHHHHHhc---C-CCCHHHHHHHHHHHHHHHHHHHhCCc--eeeeEEEEccCCCCHHHhhCCeEE-EEEEEEecCCce Q lcl|NC_013693. 531 IAAIAKYYLG---E-NNDEFTRSLFSNAVRPYIRQLANMGA--IYDGQVKCDADNNTADIIAANQMV-AGIWLKPEYSIN 603 (631) Q Consensus 531 ~~~~~~~~v~---e-pn~~~~~~~i~~~i~~~l~~l~~~ga--l~g~~v~~~~~~nt~~~i~~G~~~-~~i~~~p~~p~e 603 (631) ++..++..+= + |-++.-...|+..|+.-|++.++.|. +.+|.|....-..++.|..+.++. +++..+....++ T Consensus 337 iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~~~~~y~v~~P~~~~~~~dra~R~~~~i~~~~~laGAIh 416 (426) T protein:vir:31 337 LELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAH 416 (426) T ss_pred HHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCccccceeecCCCccccchhhhhhccCCceEEEEEeCcEE Confidence 9999976652 2 55778888999999999999988643 457998877555566787877777 888889999999 Q ss_pred EEEEEEEEEe Q lcl|NC_013693. 604 WVYLDFAAVR 613 (631) Q Consensus 604 ~i~~~~~~~~ 613 (631) ++.|+...+- T Consensus 417 ~v~I~g~v~v 426 (426) T protein:vir:31 417 TFSLGLNVSV 426 (426) T ss_pred EEEEEEEEeC Confidence 9999877543 No 62 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=94.75 E-value=0.0036 Score=33.99 Aligned_cols=448 Identities=11% Similarity=-0.006 Sum_probs=192.2 Q ss_pred CCCcchhcCCceEEEEecCCCcee---cccCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS---PSVVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFL 77 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~---gv~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ 77 (631) |...+.| |.-.+ +..+. .-.-.+..|++....=|++. ++..++..|-...||. .+..+.+.+.||- T Consensus 1 mip~s~i------V~V~~-~v~~~~~~~~~~~~~l~l~~~~~~~~~r-~~~y~s~~~V~~~FG~---~S~ey~aA~~yF~ 69 (504) T protein:vir:96 1 MISQSRY------IRIIS-GVGAGAPVAGRKLILRVMTTNNVIPPGI-VIEFDNANAVLSYFGA---QSEEYQRAAAYFK 69 (504) T ss_pred CCCccce------eEeee-cccccccccccccceeEeecccCCCccc-eEEecCHHHHHHhcCC---ChHHHHHHHHHhh Confidence 8887654 44322 22222 22345678888777777755 5666777889999997 5577888999998 Q ss_pred hCC------ceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcc------c Q lcl|NC_013693. 78 SYS------SVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAG------F 145 (631) Q Consensus 78 ngG------~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~------~ 145 (631) +-- +++||-|=... +..+.-.+.. .. .....+.+...|. +++++.-... . T Consensus 70 ~~~~~~~~P~~l~igR~~~~-a~~~~l~g~~--~~-----------~~~~~~~~i~~G~----lsitv~G~~~~~~~i~~ 131 (504) T protein:vir:96 70 FISKSVNSPSSISFARWVNT-AIAPMVVGDN--LP-----------KTIADFAGFSAGV----LTIMVGAAEKNITAIDT 131 (504) T ss_pred cCCCCCccccEEEEEeecCc-CccceEEech--hH-----------HHHHHHhhhhceE----EEEEEcceeeeeccccc Confidence 743 79999996432 1111100000 00 0001111111111 1222211000 0 Q ss_pred ceeeccceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccc Q lcl|NC_013693. 146 PTWEFRNNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTAL 225 (631) Q Consensus 146 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (631) ...+....++...+..-..... .... .. ....... .......... +.. ... T Consensus 132 S~~ts~~~vA~~i~~al~~~~~------~~~~-~~-----tv~~d~~--~~~f~its~~------tg~-~~~-------- 182 (504) T protein:vir:96 132 SAATSMDNVASIIQTEIRKNTD------PQLA-QA-----TVTWNPN--TNQFTLVGAT------IGT-GVL-------- 182 (504) T ss_pred ccccchHHHHHHHHhhhhcccc------cccc-cc-----eEEEecc--CCeEEEEeec------ccc-cee-------- Confidence 0000000000000000000000 0000 00 0000000 0000000000 000 000 Q ss_pred ccccccccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhh Q lcl|NC_013693. 226 TALTDVYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDV 305 (631) Q Consensus 226 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (631) ......... ..... +.+... ... T Consensus 183 ------~~~~~a~~~------------~~~~~-----------lgl~~~-~~~--------------------------- 205 (504) T protein:vir:96 183 ------AVAKSADPQ------------DMSTA-----------LGWSTS-NVV--------------------------- 205 (504) T ss_pred ------EEEeecccc------------chhhh-----------hhcccc-cce--------------------------- Confidence 000000000 00000 000000 000 Q ss_pred hccccceeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccc-cchHHHHHHHHHhhccceEEee Q lcl|NC_013693. 306 INDTSNWVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEE-LIEQQTLIDLSTERKDTVSFVS 384 (631) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~~~a~~d 384 (631) ...|.+..+ ...+...+.++..--..+.++..+. -+...++.++++....++.+.- T Consensus 206 ---------------------~v~g~~aet--~~~al~al~~~~~~Wy~f~~a~~~~~dd~ilalA~w~ea~~~~~~~~~ 262 (504) T protein:vir:96 206 ---------------------NVAGQAADL--PDAAVAKSTNVSNNFGSFLFAGATLDNDQIKAVSAWNAAQNNQFIYTV 262 (504) T ss_pred ---------------------EEeeccccc--HHHHHHHHHhhcCCeEEEEEEeccCCHHHHHHHHHHHhhcCceEEEEE Confidence 000000000 0001111111110001111111111 1122355566666555543331 Q ss_pred cccccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccC--Cceeccccee Q lcl|NC_013693. 385 PLRDVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIA--GIYKSPAFHN 462 (631) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~--g~~~span~~ 462 (631) . ...+.. ..........+ .....+++.. ... --+..+.++.++.+|-++ | -..-.+|. T Consensus 263 ~------~~~~~~-~~~~~~~~~~~--~~~~~~~~~~------~~~----~~~~~~~~~~~as~~f~~~ng-~~T~~fk~ 322 (504) T protein:vir:96 263 A------TSLANL-GALFDLVKGNS--GTALNVLSAT------ASN----DFVEQCPSEILAATNYDEPGA-SQNYMYYQ 322 (504) T ss_pred e------ecccch-hhHHHhhhhcc--eeEEEEeecC------ccc----hhHHHHHHHHHHhcCcCcccc-cccccccc Confidence 1 001111 11211111111 1111222111 001 124555677777776333 2 01122333 Q ss_pred eceeeecccceecCChhHhhhhhhcCceEEEEEcCCc--EEEE-cceecCCCChhhceehhhHHHHHHHHHHHHHHHHHh Q lcl|NC_013693. 463 RGKYNNYNRMAWSASSDERAVLYRNQINSIVTFSNEG--IVLY-GDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYL 539 (631) Q Consensus 463 ~~~i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~G--~~~w-g~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v 539 (631) +.|+.. ..+++.|.+.|..+|+|++..+.+.| +.+| ...++.+ ..+|.+|.+-+-.+||+..|+..+.... T Consensus 323 ---l~GVta--~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG-~~~~~wiDv~~~~~WL~~~lq~~l~~l~ 396 (504) T protein:vir:96 323 ---FPGRNI--TVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGG-PTDAVDMNVYANEIWLKSAIAQALLDLF 396 (504) T ss_pred ---cCCcCc--ccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCC-ccccchhhhhhhHHHHHHHHHHHHHHHH Confidence 444432 36789999999999999998886544 4555 3444433 2368889999999999999999997754 Q ss_pred cCCC----CHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------eeEEEEc-cCCCC-HHH Q lcl|NC_013693. 540 GENN----DEFTRSLFSNAVRPYIRQLANMGAIY-----------------------------DGQVKCD-ADNNT-ADI 584 (631) Q Consensus 540 ~epn----~~~~~~~i~~~i~~~l~~l~~~gal~-----------------------------g~~v~~~-~~~nt-~~~ 584 (631) -.++ |+.=...|+..++.-|++-+++|.|. ||.|.++ .++-+ .+. T Consensus 397 ~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~~r 476 (504) T protein:vir:96 397 LNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLGYWINITFSSYTNSNTG 476 (504) T ss_pred hcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccceEEEecChhccChhHh Confidence 4432 67788899999999999999999873 3667664 22323 334 Q ss_pred hhCCeEEEEEEEEecCCceEEEEEEEEE Q lcl|NC_013693. 585 IAANQMVAGIWLKPEYSINWVYLDFAAV 612 (631) Q Consensus 585 i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 612 (631) -.++-..+.+.+.--..+++|++.-... T Consensus 477 ~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 477 LTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred hhccccceEEEEEECCeEEEEEeccccC Confidence 4455555666666666677776543222 No 63 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=94.03 E-value=0.0056 Score=32.92 Aligned_cols=456 Identities=11% Similarity=0.032 Sum_probs=195.0 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cc--cCCceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PS--VVVQGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFL 77 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv--~tsv~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~ 77 (631) |...+.| |.-. .+..+. ++ .-..+.|++....=|++. ++..++..|-...||. .+..+.+.+.||- T Consensus 1 mip~s~i------VnV~-~~v~~~a~~~~~~~~~lilt~~~~~~~~r-~~~y~s~~~V~~~FG~---~S~ey~aA~~yFs 69 (507) T protein:vir:99 1 MISQSRY------VRIV-SGVGAGAPVAQRRLIMRVMTTNAVLPPGV-VFESSSADAVGAYFGM---ASEEYKRAKAYMS 69 (507) T ss_pred CCCccce------eEEe-eeccccCcccccccceeeeccccCCCccc-eEeecCHHHHHHhcCC---ChHHHHHHHHHhc Confidence 8887654 3322 222222 22 235777777766667765 5566777889999997 5667778888888 Q ss_pred hCC------ceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeecc Q lcl|NC_013693. 78 SYS------SVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFR 151 (631) Q Consensus 78 ngG------~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~ 151 (631) +-- +++||-|-..... .+.-.+. .+ ......+.+...| . +++++. T Consensus 70 q~p~~~~~P~~L~igR~~~~~~-~a~l~g~---~~----------~~~l~~~~~~~~G---~-lti~v~----------- 120 (507) T protein:vir:99 70 FISKSINSPSYISFARWVNAAI-ASMIVGD---SL----------VKNLPALKAVATP---T-LSLSIG----------- 120 (507) T ss_pred cCCCCCcccceEEEEeecCccc-cceeecc---hh----------hhhHHHHhhhcce---e-EEEEEc----------- Confidence 764 4999999854211 0100000 00 0000001111001 0 111110 Q ss_pred ceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccccccccccccccccccccccccccc Q lcl|NC_013693. 152 NNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDV 231 (631) Q Consensus 152 ~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (631) |..... . .+..+... ...... ............. T Consensus 121 ---------G~~~t~--~---~i~lS~~t-------s~~~vA-------------------------s~i~~~l~a~~~~ 154 (507) T protein:vir:99 121 ---------GTVVPI--A---GIDLTAAL-------TLTDVA-------------------------ATLQTKIRASANA 154 (507) T ss_pred ---------CceeEe--c---cccccccC-------CHHHHH-------------------------HHHHHhhhccccc Confidence 000000 0 00000000 000000 0000000000000 Q ss_pred ccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhh-hhhhhhcccc Q lcl|NC_013693. 232 YSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNA-YFKDVINDTS 310 (631) Q Consensus 232 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 310 (631) ... ...+.. ... ...|.+.....+.... ....... ..++.. .+.. ....+ T Consensus 155 ~~~------~~tv~~--d~~--------------~~~F~v~s~~tG~~s~----i~~at~~--~~gt~~s~l~~-~~~~~ 205 (507) T protein:vir:99 155 ELA------TATVTF--NTT--------------TNQFVLNGTTTGALAP----TITAVRT--DPATDISSLLG-WTNTG 205 (507) T ss_pred ccc------ceEEEE--ecC--------------CceEEEEeeeccccce----eEEEEcC--CchhhHHHHhc-ccccc Confidence 000 000000 000 0001111111110000 0000000 000000 0000 00000 Q ss_pred ceeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEecccc--chHHHHHHHHHhhccceEEeecccc Q lcl|NC_013693. 311 NWVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEEL--IEQQTLIDLSTERKDTVSFVSPLRD 388 (631) Q Consensus 311 ~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~--~~~~~~~~~~~~~~~~~a~~d~~~~ 388 (631) . ....|.+. .....+...+.....--........+.. +...++.+.+|....+|.+.-.. T Consensus 206 -------------a-~~~~g~~a--et~~~a~~a~~~~~~nW~~~~~a~~~~~td~~~lalA~wiea~~~~f~~~~~~-- 267 (507) T protein:vir:99 206 -------------T-VFVKGQAA--ETPDTSISKSAAISTNFGSFIYTSTPALTNDQITAVASWNASQNNMYMYSVPT-- 267 (507) T ss_pred -------------c-eEeecccc--cCHHHHHHHHHhhcCCeEEEEEEeccccChHHHHHHHHHHhhcCcEEEEEEec-- Confidence 0 01111111 1111222222222111111111222222 23356677778777776544211 Q ss_pred cccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccC--Cceecccceeecee Q lcl|NC_013693. 389 VVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIA--GIYKSPAFHNRGKY 466 (631) Q Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~--g~~~span~~~~~i 466 (631) .+ ...............+...++.+ ......+.+.+.|.++.+|-++ | -..-..|. + T Consensus 268 ---~~-----a~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~aa~~g~~as~nf~~~ng-~~T~~fk~---l 326 (507) T protein:vir:99 268 ---TI-----ANIGTLYAAVKGFSGCALNITSD---------SLPVDYIEQSPCEILAATDYTRVNA-TQNYMYYQ---F 326 (507) T ss_pred ---Cc-----hhhhhhhhhhhhcceeEEEeecc---------cccchhHHHHHHHHHHhhccCcCcc-ceeecccc---c Confidence 00 11111111110011111111111 1111234567777788777433 2 01112232 3 Q ss_pred eecccceecCChhHhhhhhhcCceEEEEEcCC--cEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcC--- Q lcl|NC_013693. 467 NNYNRMAWSASSDERAVLYRNQINSIVTFSNE--GIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGE--- 541 (631) Q Consensus 467 ~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~~--G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e--- 541 (631) .|+. ...+++.|.+.|..+++|++..+.+. .+.+|-.-.+++-..+|.++.+-+=.+||+..++..+....-. T Consensus 327 ~GV~--a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~gG~~~fid~d~~~~~~WL~~~iq~~l~~l~~~~~k 404 (507) T protein:vir:99 327 PSRN--ITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILCGGPNDAVDMNIYANEIWLKSAISAQILSLFLNVPR 404 (507) T ss_pred CCcc--cccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeCCcccceeeeeecchHHHHHHHHHHHHHHHhcCCC Confidence 3333 23588999999999999999988654 3677755444442236778777777788888888888764433 Q ss_pred -CCCHHHHHHHHHHHHHHHHHHHhCCceee-----------------------------eEEEEc-cCCCC-HHHhhCCe Q lcl|NC_013693. 542 -NNDEFTRSLFSNAVRPYIRQLANMGAIYD-----------------------------GQVKCD-ADNNT-ADIIAANQ 589 (631) Q Consensus 542 -pn~~~~~~~i~~~i~~~l~~l~~~gal~g-----------------------------~~v~~~-~~~nt-~~~i~~G~ 589 (631) |-|+.=...|+..|+.-|++-+++|.|.. |.|.++ .++.+ .+...++. T Consensus 405 IPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r~~r~~ 484 (507) T protein:vir:99 405 VPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYITQISGDANAWRQVANIGYWLNITFSSYTNPNTQLTEWK 484 (507) T ss_pred CccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchheecccccccccccceeccceEEEeCChHhcChhhhhcccc Confidence 33677788889999999999999988742 556654 23333 34444666 Q ss_pred EEEEEEEEecCCceEEEEEEEEE Q lcl|NC_013693. 590 MVAGIWLKPEYSINWVYLDFAAV 612 (631) Q Consensus 590 ~~~~i~~~p~~p~e~i~~~~~~~ 612 (631) ..+.+.+.--..+++|++.-... T Consensus 485 ~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 485 ASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred ceEEEEEEeCCeEEEEEeeeecC Confidence 66666677777777776644332 No 64 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=93.13 E-value=0.0087 Score=31.88 Aligned_cols=451 Identities=13% Similarity=0.023 Sum_probs=182.7 Q ss_pred CCCcchhcCCceEEEEecCCCceecccCC-ceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHH-- Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTSPSVVV-QGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFL-- 77 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~gv~ts-v~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~-- 77 (631) |+-+ =+.=--||.-.+.-....++.-+ ..-|++....=|+++..+. ++..|-...||. .+..+.+.+.||- T Consensus 1 m~~~--~ip~s~iV~V~~~v~~~~~~~~~~~~l~l~~~~~~~~~~~~~~-~s~~~V~~~FG~---~S~ey~aA~~yFsg~ 74 (501) T protein:vir:10 1 MPTT--TIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADF-FQETDVENWFGA---LSNEAKIADAYFPGI 74 (501) T ss_pred CCCC--CcccceEEEEeeecccCCCccccceeEEEeccCCCCccceEEe-cCHHHHHHhcCC---ChHHHHHHHHHhhhh Confidence 6631 12223455533321111122222 2335555555688887776 556899999997 5556677777776 Q ss_pred -hC---CceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccce Q lcl|NC_013693. 78 -SY---SSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNN 153 (631) Q Consensus 78 -ng---G~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~ 153 (631) |- =+++||-|-...... +.-.+. .+.. ...-.+.+.+ |. +.+++.. ..... . T Consensus 75 ~~q~p~P~~l~igR~~~~~~~-~~l~g~---~l~~---------~~la~~~~~s-g~----l~vti~g-~~~~~-~---- 130 (501) T protein:vir:10 75 VNGGQLPYDLKFARYVAADAP-ASVYGI---PLTG---------VTLAQLQGYS-GT----LTVTTAA-QHVSA-N---- 130 (501) T ss_pred cCCCccccEEEEEeecCCCcc-ceEecc---chhh---------hhhhhcceee-eE----EEEeecc-ceeec-c---- Confidence 32 368999997542111 100000 0000 0000001100 10 0111100 00000 0 Q ss_pred eeeecccccceEEeeeeeeeeecccccccceee---eeecccccccceeEeecccccccccccccccccccccccccccc Q lcl|NC_013693. 154 FAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDR---ISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTD 230 (631) Q Consensus 154 ~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (631) +..+................. ....-............. ....... .... T Consensus 131 ------------i~ls~ats~~~vAs~i~~al~~~~~tv~~d~~~~~f~its~t------tG~~~~i--------~~~~- 183 (501) T protein:vir:10 131 ------------ISLAAATSFANAATLIEAAFTSPDFVVAYDALRNRFTVVTNA------TGTAAAI--------SAVT- 183 (501) T ss_pred ------------cccccccCHHHHHHHHhhhccCCceEEEEcccCceEEEEeec------cCCceeE--------EEee- Confidence 000000000000000000000 000000000000000000 0000000 0000 Q ss_pred cccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhcccc Q lcl|NC_013693. 231 VYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTS 310 (631) Q Consensus 231 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (631) .+ ..+. ..+.+.......+ T Consensus 184 --~~-----------------~~la-----------~~l~Lt~~~~a~v------------------------------- 202 (501) T protein:vir:10 184 --GT-----------------NNLA-----------DELGLSAAAGATL------------------------------- 202 (501) T ss_pred --Cc-----------------hhhh-----------hhcCccccccceE------------------------------- Confidence 00 0000 0000000000000 Q ss_pred ceeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccceEEe--ecccc Q lcl|NC_013693. 311 NWVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDTVSFV--SPLRD 388 (631) Q Consensus 311 ~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~a~~--d~~~~ 388 (631) ... |.+ ......+...+.+...--..+..+..+..+...++.+.++....+|.+. |.... T Consensus 203 ----~~~------------g~~--aet~~~a~~a~~~~~~~Wy~f~~a~~~~~~~~la~A~wiea~~~~f~~~~~~~~~~ 264 (501) T protein:vir:10 203 ----QAA------------GVA--ADTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFAAWNSGQAYKYMYVAPDLEAA 264 (501) T ss_pred ----Eec------------Ccc--cccHHHHHHHHHhccCceEEEEEecCCChHHHHHHHHHHHhcCceEEEEEecCchh Confidence 000 000 0000001111111110000111111112222334555666555444332 11111 Q ss_pred cccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCc-eecccceeeceee Q lcl|NC_013693. 389 VVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGI-YKSPAFHNRGKYN 467 (631) Q Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~-~~span~~~~~i~ 467 (631) ..+. ....+....-...++ .+...+|+. ..+.+.+.|.++.+|-++-. -..-.+|. +. T Consensus 265 ~~~~---~~~~~i~~~l~~~~y--~~t~~~y~~-------------~~~~aa~~g~~as~nf~~~~g~~T~~fkq---~~ 323 (501) T protein:vir:10 265 SIVT---NNAASFGAQVFAAPY--QGTLPLYGD-------------QATAGAVMGYAASINFQLRNGRTVLAFRQ---FN 323 (501) T ss_pred hhhh---hhhhhHHHHHHhcCC--CceEEECCC-------------CcHHHHHHHHHHhhCcccCccceeeeccc---cC Confidence 1100 111112122222222 233333321 12456777888888754311 01112222 11 Q ss_pred ecccc-eecCChhHhhhhhhcCceEEEEEcCC--cEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcC--- Q lcl|NC_013693. 468 NYNRM-AWSASSDERAVLYRNQINSIVTFSNE--GIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGE--- 541 (631) Q Consensus 468 g~~~~-~~~~~~~~~~~L~~~gin~i~~~~~~--G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e--- 541 (631) .++ .-.+++.|.+.|..+|+|+...+.+. -+.+|-.-++++ .|.+|.+-+=.+|+++.++..+...+-. T Consensus 324 --~Gi~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~k 398 (501) T protein:vir:10 324 --AGVPATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---KFLWVDTYLDQIYLNAELQRAEFEAMLAYNS 398 (501) T ss_pred --CCcCcccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeec---cceeehhhhhHHHHHHHHHHHHHHHHHhcCC Confidence 112 23578999999999999999988643 477885555565 3677888887888888888887654432 Q ss_pred -CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------eeEEEEcc-CCCCHHHhhCCeE Q lcl|NC_013693. 542 -NNDEFTRSLFSNAVRPYIRQLANMGAIY-----------------------------DGQVKCDA-DNNTADIIAANQM 590 (631) Q Consensus 542 -pn~~~~~~~i~~~i~~~l~~l~~~gal~-----------------------------g~~v~~~~-~~nt~~~i~~G~~ 590 (631) |-|..=...|+..|+.-|++-+++|.|. ||.|.++. ++.+++...+.-. T Consensus 399 IPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p 478 (501) T protein:vir:10 399 LPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTP 478 (501) T ss_pred cccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeeccccCChhhhhhcccc Confidence 3467778889999999999999999883 35555543 2333344444445 Q ss_pred EEEEEEEecCCceEEEEEEEEEe Q lcl|NC_013693. 591 VAGIWLKPEYSINWVYLDFAAVR 613 (631) Q Consensus 591 ~~~i~~~p~~p~e~i~~~~~~~~ 613 (631) .+.+.++--..+++|++-..... T Consensus 479 ~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 479 ACTLWYSDGGSIQQLTIGSNAVI 501 (501) T ss_pred ceEEEEEeCCceeEEEeeeeecC Confidence 56666666666666665332221 No 65 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=92.42 E-value=0.011 Score=31.22 Aligned_cols=450 Identities=12% Similarity=-0.005 Sum_probs=184.6 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCCc-eEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHH- Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVVQ-GATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFL- 77 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~tsv-~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~- 77 (631) |+-+ =+.=--||.-.+ +..+. +.+-+. +-+++.-..=|++. ++..++..|-...||. .+..+.+.+.||- T Consensus 1 m~~~--~ip~s~iV~V~~-~v~~~~~~~~~~~~lllt~~~~~~~~r-~~~y~s~~~V~~~FG~---~S~ey~aA~~yFs~ 73 (501) T protein:vir:36 1 MPTT--TIPIDQIVQMLP-GVIGAGGAPGRLTGLVLTQDTSVQPGQ-LADFFQETDVENWFGA---LSNEAKIADAYFPG 73 (501) T ss_pred CCcC--CcccceEEEEee-eeccCCCcceeeeeEEEeccCCCCCcc-eeeecCHHHHHHhcCC---ChHHHHHHHHHhhc Confidence 6642 122234454333 22111 222221 22333334447764 5666777899999997 5667778888886 Q ss_pred --hC---CceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccc Q lcl|NC_013693. 78 --SY---SSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRN 152 (631) Q Consensus 78 --ng---G~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~ 152 (631) |- =+++||-|-........- ....+.. ...-.+.+.+ | .+.+++.. .... . T Consensus 74 ~~~q~~~P~~l~igR~~~~a~~~~l----~g~~l~~---------~~~a~~~~~s-g----~l~vti~g-~~~~-~---- 129 (501) T protein:vir:36 74 IVNGGQLPYDLKFARYVAADAPASV----YGIPLTG---------VTLAQLQGYS-G----TLTVTTAA-QHVS-A---- 129 (501) T ss_pred ccCCCccccEEEEEeecCcCcceeE----eccchhh---------hhhhhcccee-E----EEEEEecc-eeee-e---- Confidence 32 358999998643211110 0000000 0000000000 1 01111100 0000 0 Q ss_pred eeeeecccccceEEeeeeeeeeecccccccceee---eeecccccccceeEeeccccccccccccccccccccccccccc Q lcl|NC_013693. 153 NFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDR---ISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALT 229 (631) Q Consensus 153 ~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (631) ....+................. ....-........... ........... .+ ... T Consensus 130 ------------~i~lS~~ts~~~vA~~i~~al~~~~~tv~~d~~~~~f~i~s-------~t~G~~~~i~~-~t---~~~ 186 (501) T protein:vir:36 130 ------------NISLAAATSFANAATLIEAAFTSPDFVVAYDALRNRFTVVT-------NATGTAAAISA-VT---GTN 186 (501) T ss_pred ------------ecccccccCHHHHHHHHhhhhcCcceEEEEcCcceeEEEEe-------ccCCcceeeEe-ee---ccc Confidence 0000000000000000000000 0000000000000000 00000000000 00 000 Q ss_pred ccccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccc Q lcl|NC_013693. 230 DVYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDT 309 (631) Q Consensus 230 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (631) + +.. .+.+.......+ T Consensus 187 ~-----------------------ia~-----------~l~Lt~~~~a~v------------------------------ 202 (501) T protein:vir:36 187 N-----------------------FAD-----------EIGLSAAAGATL------------------------------ 202 (501) T ss_pred c-----------------------hhh-----------hhcccccCcceE------------------------------ Confidence 0 000 000000000000 Q ss_pred cceeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccceEEee--ccc Q lcl|NC_013693. 310 SNWVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDTVSFVS--PLR 387 (631) Q Consensus 310 ~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~a~~d--~~~ 387 (631) . ..|.+. .....+...+.+...--..+.++.....+...++.+.++....+|.+.- ... T Consensus 203 ----~-------------~~g~~~--et~~~al~a~~~~s~~Wy~f~~a~~~~~~~~la~A~wiea~~~~f~~~~~~~~~ 263 (501) T protein:vir:36 203 ----Q-------------AAGVAA--DTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFASWNSGQAYKYMYVAPDLEA 263 (501) T ss_pred ----E-------------eccccc--ccHHHHHHHHHhccCceEEEEEecCCChHHHHHHHHHHhhcCceEEEEEecCch Confidence 0 000000 0000011111111100000111111111223355566666665543331 111 Q ss_pred ccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccC--Cceecccceeece Q lcl|NC_013693. 388 DVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIA--GIYKSPAFHNRGK 465 (631) Q Consensus 388 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~--g~~~span~~~~~ 465 (631) ... ......+....-...++ .+....|. + ..+.+.+.|..+.+|-++ | -..-.+|.+. T Consensus 264 ~~~---~~~~~~~i~~~l~~~~y--~~t~~~y~------~-------~~~~aa~~g~~as~nf~~~~g-~~T~~fkq~~- 323 (501) T protein:vir:36 264 ASI---VSNNAASFGAQVFAAPY--QGTLPLYG------D-------QATAGAVMGYAASINFQLRNG-RTVLAFRQFN- 323 (501) T ss_pred hhh---hccchhhHHHHHHhcCC--CcEEEEcC------C-------CCHHHHHHHHHHhcCcccCcc-eeeeeccccC- Confidence 111 11111222222222232 22332221 1 225567788888877433 2 0011222210 Q ss_pred eeecccceecCChhHhhhhhhcCceEEEEEcC--CcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcC-- Q lcl|NC_013693. 466 YNNYNRMAWSASSDERAVLYRNQINSIVTFSN--EGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGE-- 541 (631) Q Consensus 466 i~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~--~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e-- 541 (631) .|+. ...+++.|.+.|..+|+|++..|.+ ..+.+|-.-++++ +|.||.+.+-.+||+..|++.+...+-. T Consensus 324 -~Gi~--a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~~~wiD~~~g~dWL~~~iq~~l~~ll~~~~ 397 (501) T protein:vir:36 324 -AGVP--ATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---KFLWVDTYLDQIYLNAELQRAEFEAMLAYN 397 (501) T ss_pred -CCcC--cCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeec---cchhhhHHHhHHHHHHHHHHHHHHHHhcCC Confidence 1111 2346789999999999999887764 4477775556665 3678889999999999999999876544 Q ss_pred --CCCHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------eeEEEEccC-CCCHHHhhCCe Q lcl|NC_013693. 542 --NNDEFTRSLFSNAVRPYIRQLANMGAIY-----------------------------DGQVKCDAD-NNTADIIAANQ 589 (631) Q Consensus 542 --pn~~~~~~~i~~~i~~~l~~l~~~gal~-----------------------------g~~v~~~~~-~nt~~~i~~G~ 589 (631) |-|+.=...|+..|+.-|++-+++|.|. ||.+.++.. +.+++...+.- T Consensus 398 KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~ 477 (501) T protein:vir:36 398 SLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPANPGQARQNRTT 477 (501) T ss_pred CCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeCcccCChhhhhhccc Confidence 3367778889999999999999999883 355666532 33344444455 Q ss_pred EEEEEEEEecCCceEEEEEEEEEe Q lcl|NC_013693. 590 MVAGIWLKPEYSINWVYLDFAAVR 613 (631) Q Consensus 590 ~~~~i~~~p~~p~e~i~~~~~~~~ 613 (631) ..+.+.++--..+++|++-..... T Consensus 478 p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:36 478 PACTLWYSDGGSIQSLTIGSNAVI 501 (501) T ss_pred CcEEEEEEeCCceeEEEeeeeeeC Confidence 566666666667777765333222 No 66 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=90.86 E-value=0.019 Score=30.06 Aligned_cols=454 Identities=12% Similarity=-0.004 Sum_probs=184.1 Q ss_pred CCCcchhcCCceEEEEecCCCcee-cccCC-ceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHH- Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTS-PSVVV-QGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFL- 77 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~-gv~ts-v~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~- 77 (631) |+-+ =+.=--||.-.+. ..+. ++.-+ ..-+++....=|++. ++..++..|-...||. .+..+.+.+.||- T Consensus 1 m~~~--~ip~s~iV~V~~~-v~~~~~~~~~f~~lll~~~~~~~~~r-~~~y~s~~~V~~~FG~---~S~ey~aA~~yFsg 73 (501) T protein:vir:10 1 MPTT--TIPIDQIVQMLPG-VIGAGGAPGRLTGLVLTQDTSVQPGQ-LADFFQKTDVENWFGA---LSNEAKIADAYFPG 73 (501) T ss_pred CCcC--ccccceEEEEeee-cccCCCcccccceEEEecccCCCccc-eeeecCHHHHHHhcCC---ChHHHHHHHHHhhh Confidence 6642 1222344553332 2111 22222 122444444557765 4555667899999997 5566777788875 Q ss_pred --hC---CceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccc Q lcl|NC_013693. 78 --SY---SSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRN 152 (631) Q Consensus 78 --ng---G~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~ 152 (631) |- =+++||-|-...... +.-.+. .++. .....+.+.+ | .+.+++.. ..... ... T Consensus 74 ~~~q~p~P~~l~igR~~~~~~~-~~l~g~---~l~~---------~~la~~~~~~-g----~l~i~i~g-~~~~~-~i~- 132 (501) T protein:vir:10 74 IVNGGQLPYDLKFARYVAADAP-ASVYGI---PLTG---------ITLAQLQGYS-G----TLTVTTAA-QHVSA-NIS- 132 (501) T ss_pred hcCCCccccEEEEEeecccCcc-ceeeec---eehh---------hhhhhhhhee-e----EEEEeecc-ceeee-ccc- Confidence 32 368999997543211 100000 0000 0000001100 1 01111100 00000 000 Q ss_pred eeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccccc Q lcl|NC_013693. 153 NFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTDVY 232 (631) Q Consensus 153 ~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (631) .+.................. ....+............. ...+.... T Consensus 133 ---------------~s~ats~~~vA~~i~~al~~-------~~~tv~~d~~~~~f~i~~------~t~G~~~~------ 178 (501) T protein:vir:10 133 ---------------LAAATSFANAATLIEAAFTS-------PDFVVAYDALRNRFTVVT------NTTGTAAA------ 178 (501) T ss_pred ---------------cccccCHHHHHHHHHHhhcC-------CceEEEEecccceEEEEe------cccCccee------ Confidence 00000000000000000000 000000000000000000 00000000 Q ss_pred cccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccce Q lcl|NC_013693. 233 SSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNW 312 (631) Q Consensus 233 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (631) +... .++.... ..+.+.......+ T Consensus 179 ---------i~~~---t~~~d~a-----------~~l~Lt~~~~a~v--------------------------------- 202 (501) T protein:vir:10 179 ---------ISAV---TGTNNLA-----------DELGLSAAAGATL--------------------------------- 202 (501) T ss_pred ---------EEEe---eccccch-----------hhhcccccCceeE--------------------------------- Confidence 0000 0000000 0000000000000 Q ss_pred eeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccceEEe--ecccccc Q lcl|NC_013693. 313 VYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDTVSFV--SPLRDVV 390 (631) Q Consensus 313 ~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~a~~--d~~~~~~ 390 (631) . . .|... .....+...+.+...--..+..+.....+...++.+.++....+|.+. |...... T Consensus 203 -~-~------------~g~~a--et~~~Al~a~~~~~~~Wy~f~~a~~~~~~~~la~A~wi~a~~~~f~~~~~~~~~~~~ 266 (501) T protein:vir:10 203 -Q-A------------AGVAA--DTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFAAWNSGQAYKYMYVAPDLEAASI 266 (501) T ss_pred -E-e------------cCccc--ccHHHHHHHHHhcccceEEEEEEecCChHHHHHHHHHHHhcCceEEEEEecCcceee Confidence 0 0 00000 000011111111110001111111112223334556666655444332 2211111 Q ss_pred cccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCc-eecccceeeceeeec Q lcl|NC_013693. 391 VGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGI-YKSPAFHNRGKYNNY 469 (631) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~-~~span~~~~~i~g~ 469 (631) +.. ...+....-...++ .+...+|+. ..|.+.+.|..+.+|-++-. -..-.+|.+ ..|+ T Consensus 267 ~~~---~~~~i~~~l~~~~y--~~t~~~y~~-------------~~~~aa~~g~~as~nf~~~~g~~T~~fkql--~~Gv 326 (501) T protein:vir:10 267 VTN---NAASFGAQVFAAPY--QGTLPLYGD-------------QATAGAVMGYAASINFQLRNGRTVLAFRQF--NAGV 326 (501) T ss_pred ecc---cchhHHHHHHhcCC--CceEEECCC-------------CCHHHHHHHHHHhcCcccCcceeeeeeccc--CCCc Confidence 111 11122122122232 233333321 23667788888888754311 001122221 0111 Q ss_pred ccceecCChhHhhhhhhcCceEEEEEcCC--cEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcC----CC Q lcl|NC_013693. 470 NRMAWSASSDERAVLYRNQINSIVTFSNE--GIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGE----NN 543 (631) Q Consensus 470 ~~~~~~~~~~~~~~L~~~gin~i~~~~~~--G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----pn 543 (631) . ...+++.|.+.|..+|+|++..+.+. .+.+|-.-++++ +|.||.+.+-.+|+++.|+..+....-. |- T Consensus 327 ~--a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kIPy 401 (501) T protein:vir:10 327 P--ATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---KFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPY 401 (501) T ss_pred C--cccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeec---cceehhhHhhHHHHHHHHHHHHHHHHhcCCCccc Confidence 1 23577899999999999999888654 477885445565 3678889899999999999988765433 22 Q ss_pred CHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------eeEEEEcc-CCCCHHHhhCCeEEEE Q lcl|NC_013693. 544 DEFTRSLFSNAVRPYIRQLANMGAIY-----------------------------DGQVKCDA-DNNTADIIAANQMVAG 593 (631) Q Consensus 544 ~~~~~~~i~~~i~~~l~~l~~~gal~-----------------------------g~~v~~~~-~~nt~~~i~~G~~~~~ 593 (631) |..=...|+..|+.-|++-+++|.|. ||.|.++. ++.+++...+.-..+. T Consensus 402 t~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~ 481 (501) T protein:vir:10 402 NEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGNPANPGQARQNRTSPACT 481 (501) T ss_pred CHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeecccccccccccceeccceEEeeCcccCChhhhhhcccCceE Confidence 56778888999999999999999883 35565653 2333444444445566 Q ss_pred EEEEecCCceEEEEEEEEEe Q lcl|NC_013693. 594 IWLKPEYSINWVYLDFAAVR 613 (631) Q Consensus 594 i~~~p~~p~e~i~~~~~~~~ 613 (631) +.++--..+++|++-..... T Consensus 482 ~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 482 LWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred EEEEeCCceeEEEeeeeecC Confidence 66666666776665333221 No 67 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=90.83 E-value=0.019 Score=30.04 Aligned_cols=454 Identities=12% Similarity=0.005 Sum_probs=181.1 Q ss_pred CCCcchhcCCceEEEEecCCCceecccCC-ceEEEeeeccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHHH-- Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTSPSVVV-QGATVGKFQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADFL-- 77 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~gv~ts-v~afvG~~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF~-- 77 (631) |+-+ =+.=--||.-.+.-....+..-+ .+-+++....=|++. ++..++..|-...||. .+..+.+.+.||. T Consensus 1 m~~~--~ip~s~iV~V~~~v~~~~~~~~~~~~lll~~~~~~~~~r-~~~y~s~~~V~~~FG~---~S~ey~aA~~yFs~~ 74 (501) T protein:vir:78 1 MPTT--TIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSIQPGQ-LADFFQKTDVENWFGG---LSNEAVIADAYFPGI 74 (501) T ss_pred CCcC--ccccceEEEEeeecccCCCcceeeeeEEEecCCCCCccc-eeeecCHHHHHHhcCC---ChHHHHHHHHHhhcC Confidence 6642 12223445433321111122222 123344444457764 5555777889999997 5666778888886 Q ss_pred --hC--CceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeeccce Q lcl|NC_013693. 78 --SY--SSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEFRNN 153 (631) Q Consensus 78 --ng--G~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~~~~ 153 (631) +- =+++||-|-...... +...+. .+....... +.+.+ |. +.+++.. ..... . T Consensus 75 ~~q~~~P~~l~igR~~~~a~~-~~l~g~-~l~~~~la~-----------~~~~~-G~----l~iti~g-~~~~~-~---- 130 (501) T protein:vir:78 75 VNGGQLPYDLKFARYVAADAP-ASVYGI-PLTGVTLTQ-----------LQGYS-GT----LTVTTAA-QHVSS-N---- 130 (501) T ss_pred CCCCcccceEEEEeecccCcc-eeEecc-ceeccchhh-----------hceee-eE----EEEEecc-ceeee-c---- Confidence 32 247899997543211 100000 000000000 00000 10 0111100 00000 0 Q ss_pred eeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeeccccc-ccccccccccccccccccccccccc Q lcl|NC_013693. 154 FAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVA-YTDTDTPATLATKIGTALTALTDVY 232 (631) Q Consensus 154 ~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (631) ...+............... + .. .... ...+.... ..... ...+... T Consensus 131 ------------i~~S~~ts~~~vA~~i~~a--l----~a-~~~t-v~~ds~~~~f~its------~t~G~~~------- 177 (501) T protein:vir:78 131 ------------ISLAAATSFANAATLIEAA--F----TS-PDFV-VSYDALRNRFVVNT------NATGTAA------- 177 (501) T ss_pred ------------cccccccCHHHHHHHHHhh--h----cC-cceE-EEEccccceEEEEe------eecCCce------- Confidence 0000000000000000000 0 00 0000 00000000 00000 0000000 Q ss_pred cccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhccccce Q lcl|NC_013693. 233 SSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTSNW 312 (631) Q Consensus 233 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (631) .+.... +..... ..+.+..... .. T Consensus 178 --------~i~~~t---~~~~~a-----------~~l~Lt~~~~-a~--------------------------------- 201 (501) T protein:vir:78 178 --------AISAVT---GTNNLA-----------DELGLSAAAG-AS--------------------------------- 201 (501) T ss_pred --------eEEEEe---cccchh-----------hhhcccccCc-ee--------------------------------- Confidence 000000 000000 0000000000 00 Q ss_pred eeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccceEEe--ecccccc Q lcl|NC_013693. 313 VYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDTVSFV--SPLRDVV 390 (631) Q Consensus 313 ~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~a~~--d~~~~~~ 390 (631) +. ..|.. +.....+...+.+...--..+.++..+..+...++.+.++....+|.+. |...... T Consensus 202 v~-------------~~g~~--aet~~~a~~a~~~~~~~Wy~f~~a~~~~~~~~lalA~wiea~~~~f~~~~~~~~~~~~ 266 (501) T protein:vir:78 202 LQ-------------AAGVA--ADTPASAMNRAVGLSRNWATFTTAWTAVIADRLALASWNSGQAYKYMYVAPDLEPASI 266 (501) T ss_pred eE-------------ecccc--ccCHHHHHHHHHhccCceEEEEEecCCCHHHHHHHHHHHHhcCceEEEEEecCCccee Confidence 00 00000 0000011111111110001111121122223345556666655554332 2111111 Q ss_pred cccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCc-eecccceeeceeeec Q lcl|NC_013693. 391 VGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGI-YKSPAFHNRGKYNNY 469 (631) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~-~~span~~~~~i~g~ 469 (631) +.. ...+....-...++ .+...+|+. ..+.+.+.|..+.+|-++-. -..-.+|.+ ..|+ T Consensus 267 ~~~---~~~~i~~~l~a~~y--~~t~~~y~~-------------~~~~aa~~g~~as~nf~~~~g~~T~~fkq~--~~Gv 326 (501) T protein:vir:78 267 VTN---NSASFGAQVFAAPY--QGTLPLYGD-------------QATAGAVMGYAASINFQLRNGRTVLAFRQF--NAGV 326 (501) T ss_pred ecc---cchhHHHHHhhcCC--CceEEEcCC-------------cchHHHHHHHHHhcCcccCcceeeeecccc--CCCc Confidence 111 11112111112222 233333321 12456677777777744311 001112221 0111 Q ss_pred ccceecCChhHhhhhhhcCceEEEEEcCC--cEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcC----CC Q lcl|NC_013693. 470 NRMAWSASSDERAVLYRNQINSIVTFSNE--GIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGE----NN 543 (631) Q Consensus 470 ~~~~~~~~~~~~~~L~~~gin~i~~~~~~--G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e----pn 543 (631) . .-.+++.|.+.|..+|+|++..+.+. .+.+|-.-++++ +|.+|.+-+=.+|+++.++..+....-. |- T Consensus 327 ~--a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPy 401 (501) T protein:vir:78 327 P--ATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---KFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPY 401 (501) T ss_pred C--cccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeec---cceeehhhhhHHHHHHHHHHHHHHHHHhCCCccc Confidence 1 23478899999999999999888654 477885545565 4677888888888888888888765422 33 Q ss_pred CHHHHHHHHHHHHHHHHHHHhCCcee-----------------------------eeEEEEcc-CCCCHHHhhCCeEEEE Q lcl|NC_013693. 544 DEFTRSLFSNAVRPYIRQLANMGAIY-----------------------------DGQVKCDA-DNNTADIIAANQMVAG 593 (631) Q Consensus 544 ~~~~~~~i~~~i~~~l~~l~~~gal~-----------------------------g~~v~~~~-~~nt~~~i~~G~~~~~ 593 (631) |..=...|+..|+.-|++-+++|.|. ||.+.++. ++.+++...+.-..+. T Consensus 402 t~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~~~~~R~~R~~p~~~ 481 (501) T protein:vir:78 402 NEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDPANPGQARQNRTTPTCT 481 (501) T ss_pred CHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeeccccCChhhhhhcccCcEE Confidence 67778889999999999999999883 35555543 2333444444445566 Q ss_pred EEEEecCCceEEEEEEEEEe Q lcl|NC_013693. 594 IWLKPEYSINWVYLDFAAVR 613 (631) Q Consensus 594 i~~~p~~p~e~i~~~~~~~~ 613 (631) +.++--..+++|++-..... T Consensus 482 ~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:78 482 LWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred EEEEeCCceeEEEeeeeecC Confidence 66666666666665332221 No 68 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=81.06 E-value=0.089 Score=26.34 Aligned_cols=409 Identities=11% Similarity=0.004 Sum_probs=123.0 Q ss_pred hhhhhccCcccceeeccceeeeecccccceEEeeeeeeeeecccccccc---------eeeeeecccccccceeEeeccc Q lcl|NC_013693. 135 VAINVCDAAGFPTWEFRNNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQ---------VDRISVSGTATGAGSISVAGED 205 (631) Q Consensus 135 l~v~v~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~ 205 (631) +. .+....+...++.... ...+..+...+. ......... T Consensus 1 m~-------------------------------I~~~~~V~i~~~v~aa~~~~~~~f~~li~t~~~~~p~-~r~~~y~s~ 48 (515) T protein:vir:10 1 MP-------------------------------ISFDKYVAITSGVAAQQQIAARSFAIRVYTPNPMVSV-DRLITATSA 48 (515) T ss_pred CC-------------------------------CCceeEEEeecccccCCccccccceeeeeecccCCCc-cceeeecCH Confidence 00 0000000000000000 000000000000 000000000 Q ss_pred cccccccccccccccccccccc---cccccc---ccccccccccccccccccccc--------cceeecccccccceeee Q lcl|NC_013693. 206 VAYTDTDTPATLATKIGTALTA---LTDVYS---SVVVKSNTVTVTHKAIGPQTV--------TAIVPDANGLTATAVTT 271 (631) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~---~~~~~~~~~~v~~~~~~~~~~--------~~~~~~~~~~~~~~~~~ 271 (631) ..+....+..-.. ....+. +.......+.+.-....+... ...+..........+.+ T Consensus 49 ---------~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~P~~L~igR~~~~a~~~~l~g~~~~~~~l~~~~~is~G~lti 119 (515) T protein:vir:10 49 ---------ADVGAYFGTASEEYKRAVKNFGFISKKTRRPTSIQFARWQREAGPVAIYGGAKKAAALATLQAVTAGAISF 119 (515) T ss_pred ---------HHHHHhcCCChHHHHHHHHHhhhccCCcccccEEEEEeccCcccceEEEeccchhhhHHhhhcccceeEEE Confidence 0000000000000 000000 000000000000000000000 00011111111111222 Q ss_pred ecccccccceeeeeeeeecccccccchhhhhhhhhcc----------------ccceeeeecccccccccccccccccch Q lcl|NC_013693. 272 TVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVIND----------------TSNWVYTFATTLAAGVTELEGGVDDYT 335 (631) Q Consensus 272 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~l~gg~d~~~ 335 (631) .+ +|..+.....+...... ........+...+.. ....+...............-+....+ T Consensus 120 ti--dG~~~~t~s~i~~S~at-s~~~vAs~i~tal~~~~~~~~~~~tv~~d~~~~~F~v~s~~tG~~~~is~~~~t~~~~ 196 (515) T protein:vir:10 120 LF--GGATTVTVSGISFSAAT-SLADVASELQTALRANADANLATCTVSYDPVGARFNFAGSPSDDTVQESISIVPQSNP 196 (515) T ss_pred EE--cceEEEEeecccccccc-CHHHHHHHHHhhhccccccccceeEEEEecCCCeEEEEEeecCCceeEEEEEecCCCc Confidence 22 22111011011111000 000111111111110 000000000000000001111111111 Q ss_pred hhhhhHH---------------------HHHhhhhhccc-ceeEEeccc--c----chHHHHHHHHHhhccceEEeeccc Q lcl|NC_013693. 336 GNRVAAI---------------------EALNNAEAYDA-KPVFAFCEE--L----IEQQTLIDLSTERKDTVSFVSPLR 387 (631) Q Consensus 336 ~~~~~~~---------------------~~l~~~~~~~~-~~~i~~~~~--~----~~~~~~~~~~~~~~~~~a~~d~~~ 387 (631) +.+.... +.+........ -..++.... . +...++.+..+....++-...... T Consensus 197 ~t~~a~~lglt~~~~av~~~g~aaet~~~a~~a~~~~s~nWy~f~~a~~~~~~~~~a~~~a~a~~~e~~~~~~~~~~~~~ 276 (515) T protein:vir:10 197 AIDVAQLLGWNSAQGASYIAASPVVSPVDTLIASVAGNNNFGSILFTKNGGTGITLSDAEAIALQNQSYNVAYKFQVGVD 276 (515) T ss_pred hhhHHHHhccccccceEEecccccccHHHHHHHHHhccCCeEEEEEeecCccccchhHHHHHHHHHhhcCceEEEEeccC Confidence 1111111 11111111110 011111111 0 111122223332222221111000 Q ss_pred ccccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCC-ceecccceeecee Q lcl|NC_013693. 388 DVVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAG-IYKSPAFHNRGKY 466 (631) Q Consensus 388 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g-~~~span~~~~~i 466 (631) .....+.-........ ..+...+++-. + ..+....+|.++.+|-++- =...-..|. + T Consensus 277 ------~~~~~~~~a~~~~~~~--~~~~~~~~~~~-------~----~~~~a~~~g~~asvnf~~~ng~iT~kfKq---~ 334 (515) T protein:vir:10 277 ------DTTYSSWQAALAAIGG--VNMIYSPVALA-------A----EYHDMQDGIIEAATDFTQQGGATGYMYVQ---F 334 (515) T ss_pred ------ccceechhhhhhhhhh--cCceEEEEecc-------C----cchHHHHHHHHHhcCCCccchhheecccc---C Confidence 0000000000010000 00011111100 0 0123455666677663321 111222333 3 Q ss_pred eecccceecCChhHhhhhhhcCceEEEEEcC--CcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhcCCC- Q lcl|NC_013693. 467 NNYNRMAWSASSDERAVLYRNQINSIVTFSN--EGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLGENN- 543 (631) Q Consensus 467 ~g~~~~~~~~~~~~~~~L~~~gin~i~~~~~--~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~epn- 543 (631) .|+.. -.+++.|.+.|..+|+|+...+.+ ..+.+|-.-++++-..+|+||.+.|-.+|++..|+..+.... .-+ T Consensus 335 ~Gita--~~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G~~~gG~~~~~WiD~~~g~~WL~~~iq~~l~~L~-~s~~ 411 (515) T protein:vir:10 335 NNQTP--AVNDDTLSGILDDLNINYYGQTQVNGTNLSFYQDGVMMGGPTDPRDSNVYANEQWLKSYAGASFMSLQ-LAQG 411 (515) T ss_pred CCCcc--ccCCHHHHHHHHhcCCeEEEEEeccCceEEEEeCCeeeCCccchhHHHHHhhHHHHHHHHHHHHHHHH-hcCC Confidence 33322 347899999999999999998865 458888665555544568899999999999999999997644 332 Q ss_pred ----CHHHHHHHHHHH-HHHHHHHHhCCceeee-----------------------------EEEEc-cCCCCHHHhhCC Q lcl|NC_013693. 544 ----DEFTRSLFSNAV-RPYIRQLANMGAIYDG-----------------------------QVKCD-ADNNTADIIAAN 588 (631) Q Consensus 544 ----~~~~~~~i~~~i-~~~l~~l~~~gal~g~-----------------------------~v~~~-~~~nt~~~i~~G 588 (631) ++.=...|+..| ++-|++-+++|.|.-. .+... .+..++.+...+ T Consensus 412 KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~~~~~~~~~Gyy~~~~~~~~~~~~~r~~~ 491 (515) T protein:vir:10 412 KIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDTAWQKVQNLGYWYDVQISSFVDTGGTTKY 491 (515) T ss_pred CCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcccccchhhcceeEecCcCCCCCccccccc Confidence 455556777666 4688888888887532 22211 111112222222 Q ss_pred eEEEEEEEEecCCceEEEEEEEEE Q lcl|NC_013693. 589 QMVAGIWLKPEYSINWVYLDFAAV 612 (631) Q Consensus 589 ~~~~~i~~~p~~p~e~i~~~~~~~ 612 (631) .+.+.+-+.-=-.+++|+...... T Consensus 492 ~~~~~~~y~~g~~i~~i~~~~~~v 515 (515) T protein:vir:10 492 QAVYSLVYSKDDLIRKVVGTHTLI 515 (515) T ss_pred CceeEEEEEcCceEEEEEeeeecC Confidence 222222222223333333222211 No 69 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=75.52 E-value=0.15 Score=25.17 Aligned_cols=444 Identities=11% Similarity=0.029 Sum_probs=174.9 Q ss_pred CCCcchhcCCceEEEEecCCCceecccCCceEEEee----eccCcCCCCeEEecCHHHHHHHcCCCCccchhHHHHHHHH Q lcl|NC_013693. 1 MATQSFSVAPSVQWTERDATLQTSPSVVVQGATVGK----FQWGEAELPVLVTGGETGLVKKFFKPNDATATDFLVIADF 76 (631) Q Consensus 1 m~~~~~ylsPGVyveEv~~~~~~~gv~tsv~afvG~----~~~Gp~~~p~~i~s~~~e~~~~fG~~~~~~~~~~av~~fF 76 (631) |++. .=--||.-.+ +. ++.+-+.-.|.|. ...=|++ .++..++..|-...||. .+..+.+.+.|| T Consensus 1 m~~i----p~s~iV~V~~-~v--~~~~~~~~~f~~~l~~~~~~~~~~-r~~~y~s~~~V~~~FG~---~S~ey~aA~~yF 69 (494) T protein:vir:94 1 MPNI----PISQIVSINP-QV--VSAGGTQGTLDGLLLTQATGFPVT-QPQVYFSAADVGTAFGL---TSDEYNAALVYF 69 (494) T ss_pred CCCC----CcccEEEeee-ec--cccCCcccccceeEeecCccCCcc-ceeeecCHHHHHHhcCC---ChHHHHHHHHHh Confidence 5543 1123444222 22 2222222333333 2334654 46666777889999997 556677888888 Q ss_pred H----hC--CceEEEEEecccCCCcccccccchhhhccccccccccccceeeehhhhhhhhhhchhhhhccCcccceeec Q lcl|NC_013693. 77 L----SY--SSVAWVTRVVGPAARNAVTKGQTAILIRNKLDFETASPSASITWTGRYAGSLGNDVAINVCDAAGFPTWEF 150 (631) Q Consensus 77 ~----ng--G~~~~vvRv~~~~a~~a~~~~~~~~~~~~~~~~~~~~~~~~l~~~a~~~G~~gn~l~v~v~~~~~~~~~~~ 150 (631) - +- =+++||-|-.... ..+.-.+. ....+++.....+ | .+++++ T Consensus 70 s~~~~q~p~P~~l~igR~~~~a-~~~~l~g~----------------~~~~tl~~~~~~~-g-~l~iti----------- 119 (494) T protein:vir:94 70 AGILGGGQQPASLTIGRYASAA-TSAAVFGA----------------PLTLSLAQLQTLS-G-TLIVTT----------- 119 (494) T ss_pred hhccCCCccccEEEEEeecCcc-ccceeecc----------------chhhhHHhhhhcc-e-EEEEEE----------- Confidence 6 22 3689999975321 11100000 0000000000000 0 001111 Q ss_pred cceeeeecccccceEEeeeeeeeeecccccccceeeeeecccccccceeEeecccccccccccccccccccccccccccc Q lcl|NC_013693. 151 RNNFAYAPQAGEYHIVIVDKVGRITDSSGAVGQVDRISVSGTATGAGSISVAGEDVAYTDTDTPATLATKIGTALTALTD 230 (631) Q Consensus 151 ~~~~~~~~~~g~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (631) .|........ .+... ...+...... ..+..... .+ T Consensus 120 ---------~g~~~~~~i~------lS~~t-------s~~~vA~~i~-~ai~~a~~---------~v------------- 154 (494) T protein:vir:94 120 ---------DTQRTSAAIN------LSGAT-------SFANAASLMT-SGFTTPNF---------AI------------- 154 (494) T ss_pred ---------cceEEEeeec------ccccC-------ChhhHHHHHh-hhhccccc---------eE------------- Confidence 0100000000 00000 0000000000 00000000 00 Q ss_pred cccccccccccccccccccccccccceeecccccccceeeeecccccccceeeeeeeeecccccccchhhhhhhhhcccc Q lcl|NC_013693. 231 VYSSVVVKSNTVTVTHKAIGPQTVTAIVPDANGLTATAVTTTVGASGSIIEKYELMQATQGSKKSDGSNAYFKDVINDTS 310 (631) Q Consensus 231 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (631) .+.... ....+... ..+.. ...... . +.+... + ....... T Consensus 155 ~~d~~~-~~f~v~s~--ttG~~-s~is~~--t--------------~~~a~~---l-----------------~lt~~~~ 194 (494) T protein:vir:94 155 TYDAQR-RRFVLSTT--ATGTT-ASVSAV--T--------------GTLADG---V-----------------GLSTASG 194 (494) T ss_pred EEcccC-cEEEEEEc--cCCce-eEEEEe--c--------------cchhhh---h-----------------hhhcccc Confidence 000000 00000000 00000 000000 0 000000 0 0000000 Q ss_pred ceeeeecccccccccccccccccchhhhhhHHHHHhhhhhcccceeEEeccccchHHHHHHHHHhhccceEEe--ecccc Q lcl|NC_013693. 311 NWVYTFATTLAAGVTELEGGVDDYTGNRVAAIEALNNAEAYDAKPVFAFCEELIEQQTLIDLSTERKDTVSFV--SPLRD 388 (631) Q Consensus 311 ~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~a~~--d~~~~ 388 (631) ..+ ...|.+. .....+...+.+...--..+.++.....+...++.+..+....++.+. +.-.. T Consensus 195 a~v-------------~~~g~~a--et~~~a~~a~~~~~~~Wy~f~~~~~~~~~~ilalA~wiea~~~~~~~~~~~~d~~ 259 (494) T protein:vir:94 195 AYV-------------EGSGLAA--DTAASALDRLAASSSTWAIFTTAWAASLSDRTALAQWTSDQVFRRIYAAWDQDAA 259 (494) T ss_pred ceE-------------eecCccc--ccHHHHHHHHHhccCceEEEEEecCCCHHHHHHHHHHHhhcCccEEEEEecCCcc Confidence 000 0001110 111112222222111111122222222234445666666655544332 11111 Q ss_pred cccccccCCHHHHHHHHHhcCCCcceEEEecCeeEEEeccCCceeEeehHHHHHHHHHHhhccCCceecccceeeceeee Q lcl|NC_013693. 389 VVVGNRGREMEDVVAWRESLVRDSSYFFMDDNWAYVYDKYNDKMRWIPACGGTAGVWARSIEIAGIYKSPAFHNRGKYNN 468 (631) Q Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~~i~g 468 (631) ..+. ....+....-...++ .+..+.|+. ..|.+.+.|..+.+|-++ .+.+..+. .+. T Consensus 260 ~~~~---~~~~~i~~~l~~~~y--~~t~~~y~~-------------~~~~aa~~g~~aa~~~~~----~~g~~T~~-~k~ 316 (494) T protein:vir:94 260 GLSV---NNVSSFGNIVKTTPF--SNTIPVYGL-------------LANAMIVLAWGASTNLQI----AEGRTTLA-LRS 316 (494) T ss_pred eeec---ccchhHHHHHHhhcC--CceEEEcCC-------------CChHHHHHHHHHhccccc----cCcceeEE-eec Confidence 1111 111222222222222 233333331 124466777777777433 33433322 111 Q ss_pred -cccce-ecCChhHhhhhhhcCceEEEEEcC--CcEEEEcceecCCCChhhceehhhHHHHHHHHHHHHHHHHHhc---- Q lcl|NC_013693. 469 -YNRMA-WSASSDERAVLYRNQINSIVTFSN--EGIVLYGDKTGLTRPSAFDRINVRGLFIMAEQNIAAIAKYYLG---- 540 (631) Q Consensus 469 -~~~~~-~~~~~~~~~~L~~~gin~i~~~~~--~G~~~wg~rT~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~---- 540 (631) ..++. -.+++.|.+.|..+|+|++..+.+ .=+.+|..-++++ +|.+|-+-+=.+|+++.|++.+...+- T Consensus 317 q~~gi~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG---~~~~id~~~~~~WL~~~iq~~l~~ll~~~~K 393 (494) T protein:vir:94 317 PVSSAGVRVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGG---QFLWADTALGWIALRRNLQQALFETLLAYRS 393 (494) T ss_pred cCCCCCCccCCHHHHHHHHhcCCeEEEEecccCceEEEecCceecc---ccceeeeeccHHHHHHHHHHHHHHHHHhCCC Confidence 11222 346789999999999999988853 3467776666654 344443333445777777777755432 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHhCCcee----------------------------eeEEEE-c-cCCCCHHHhhCCeE Q lcl|NC_013693. 541 ENNDEFTRSLFSNAVRPYIRQLANMGAIY----------------------------DGQVKC-D-ADNNTADIIAANQM 590 (631) Q Consensus 541 epn~~~~~~~i~~~i~~~l~~l~~~gal~----------------------------g~~v~~-~-~~~nt~~~i~~G~~ 590 (631) =|-|+.=...|+..|+.-|++-+++|.|. ||+|.. + .+.+.+.+....+ T Consensus 394 IPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~~~~~s~~~ra~R~~~~- 472 (494) T protein:vir:94 394 LPYNADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGWYLQVIDPITTTVRTDRGSPT- 472 (494) T ss_pred cccChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCccccceeccceeeeccCCCChhhhhccccCC- Confidence 34477778899999999999999999984 244543 2 2333443333333 Q ss_pred EEEEEEEecCCceEEEEEEEEEe Q lcl|NC_013693. 591 VAGIWLKPEYSINWVYLDFAAVR 613 (631) Q Consensus 591 ~~~i~~~p~~p~e~i~~~~~~~~ 613 (631) +.+.+.--..+++|++...... T Consensus 473 -~~~~y~~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 473 -VNFWYCDGGSIQRVVVSATTVI 494 (494) T ss_pred -ceEEEEecCcEEEEEEeeEEeC Confidence 3333344666677766655443 Done!