Query lcl|NC_016163.1_cdsid_YP_004934383.1 [gene=g149] [protein=putative contractile tail sheath structural protein] [protein_id=YP_004934383.1] [location=96008..97780] Match_columns 590 No_of_seqs 205 out of 684 Neff 9.5 Searched_HMMs 1612 Date Thu Nov 7 15:44:42 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_150 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_150_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:106984 Length: 743 100.0 5E-131 3E-134 734.7 50.9 544 1-590 1-732 (743) 2 protein:vir:104477 Length: 749 100.0 2E-130 1E-133 731.3 50.9 568 1-590 1-739 (749) 3 protein:vir:106427 Length: 679 100.0 6E-129 4E-132 723.6 48.6 576 1-590 1-665 (679) 4 protein:vir:104858 Length: 729 100.0 9E-129 5E-132 722.7 49.1 558 1-590 1-717 (729) 5 protein:vir:98263 Length: 664 100.0 2E-128 1E-131 721.1 49.6 579 1-590 1-650 (664) 6 protein:vir:103456 Length: 659 100.0 8E-128 5E-131 717.5 51.7 576 1-590 1-646 (659) 7 protein:vir:108052 Length: 660 100.0 1E-127 7E-131 716.6 51.0 584 1-590 1-647 (660) 8 protein:vir:6894 Length: 660 # 100.0 8E-128 5E-131 717.4 50.0 576 1-590 1-646 (660) 9 protein:vir:7206 Length: 659 # 100.0 1E-127 7E-131 716.4 50.8 571 1-590 1-646 (659) 10 protein:vir:101187 Length: 663 100.0 6E-128 3E-131 718.3 48.6 578 1-590 1-648 (663) 11 protein:vir:101804 Length: 663 100.0 1E-126 7E-130 711.0 47.8 580 1-590 1-648 (663) 12 protein:vir:6594 Length: 666 # 100.0 3E-126 2E-129 708.6 50.2 576 1-590 1-651 (666) 13 protein:vir:80984 Length: 666 100.0 6E-126 4E-129 707.1 49.4 573 1-590 1-651 (666) 14 protein:vir:5663 Length: 671 # 100.0 8E-126 5E-129 706.5 49.7 569 1-590 1-661 (671) 15 protein:vir:100539 Length: 663 100.0 6E-125 4E-128 701.7 48.4 578 1-590 1-648 (663) 16 protein:vir:79092 Length: 477 100.0 1E-109 9E-113 617.3 42.3 451 1-590 1-467 (477) 17 protein:vir:107865 Length: 477 100.0 7E-108 4E-111 608.0 40.4 450 1-590 1-467 (477) 18 protein:vir:98824 Length: 774 100.0 5E-103 3E-106 581.4 39.8 475 1-590 279-767 (774) 19 protein:vir:6079 Length: 396 # 100.0 6E-97 4E-100 548.1 36.8 367 1-590 1-383 (396) 20 protein:vir:103993 Length: 390 100.0 2.2E-96 1.4E-99 545.0 36.0 362 1-590 1-378 (390) 21 protein:vir:78206 Length: 390 100.0 2.2E-96 1.4E-99 545.0 36.0 362 1-590 1-378 (390) 22 protein:vir:79181 Length: 390 100.0 3.4E-96 2.1E-99 544.0 36.3 362 1-590 1-378 (390) 23 protein:vir:79141 Length: 391 100.0 3.2E-96 2E-99 544.1 36.1 362 1-590 1-378 (391) 24 protein:vir:5711 Length: 396 # 100.0 9E-96 5.6E-99 541.7 37.1 367 1-590 1-383 (396) 25 protein:vir:2035 Length: 396 # 100.0 4.7E-96 2.9E-99 543.2 35.4 367 1-590 1-383 (396) 26 protein:vir:1845 Length: 392 # 100.0 1.6E-95 9.7E-99 540.3 36.9 367 1-590 1-380 (392) 27 protein:vir:98553 Length: 395 100.0 6.6E-95 4.1E-98 536.9 37.8 370 1-590 1-383 (395) 28 protein:vir:1172 Length: 391 # 100.0 5.1E-95 3.1E-98 537.5 34.6 365 1-590 2-379 (391) 29 protein:vir:100323 Length: 393 100.0 3.2E-92 2E-95 522.2 37.5 363 1-590 3-380 (393) 30 protein:vir:103168 Length: 641 100.0 1E-90 6.4E-94 513.9 37.5 466 1-495 3-641 (641) 31 protein:vir:10336 Length: 386 100.0 1.5E-90 9.2E-94 513.1 35.6 365 1-590 1-379 (386) 32 protein:vir:96740 Length: 388 100.0 2.5E-89 1.5E-92 506.4 37.0 359 1-590 1-377 (388) 33 protein:vir:5833 Length: 742 # 100.0 3.4E-75 2.1E-78 428.9 38.6 502 1-590 201-736 (742) 34 protein:vir:63742 Length: 562 100.0 8.2E-70 5.1E-73 399.3 38.9 531 1-590 8-557 (562) 35 protein:vir:80488 Length: 562 100.0 2.4E-67 1.5E-70 385.8 39.4 529 1-590 8-557 (562) 36 protein:vir:102819 Length: 648 100.0 2.7E-66 1.6E-69 380.1 40.2 546 1-590 1-645 (648) 37 protein:vir:80779 Length: 569 100.0 8.2E-66 5.1E-69 377.4 39.0 530 1-590 1-564 (569) 38 protein:vir:95741 Length: 587 100.0 1.6E-62 9.6E-66 359.4 39.4 544 1-590 1-582 (587) 39 protein:vir:99306 Length: 587 100.0 3.6E-60 2.2E-63 346.5 42.9 540 1-590 1-582 (587) 40 protein:vir:79798 Length: 717 100.0 2E-59 1.2E-62 342.4 40.0 571 1-590 1-717 (717) 41 protein:vir:96586 Length: 587 100.0 9.3E-59 5.8E-62 338.7 40.8 541 1-590 8-582 (587) 42 protein:vir:100829 Length: 607 100.0 1.5E-54 9E-58 315.7 38.9 545 1-590 17-596 (607) 43 protein:vir:101326 Length: 529 100.0 3.7E-51 2.3E-54 297.1 33.4 496 1-590 1-529 (529) 44 protein:vir:102957 Length: 437 100.0 1.6E-47 1E-50 277.1 35.6 415 1-589 1-437 (437) 45 protein:vir:105470 Length: 451 100.0 9.1E-40 5.7E-43 234.6 36.3 426 1-589 1-451 (451) 46 protein:vir:107310 Length: 581 100.0 1.9E-38 1.2E-41 227.3 30.2 520 7-590 1-566 (581) 47 protein:vir:7653 Length: 581 # 100.0 7.8E-38 4.9E-41 224.0 32.6 526 1-590 1-566 (581) 48 protein:vir:78986 Length: 436 99.9 1E-27 6.2E-31 168.6 28.2 411 1-589 3-436 (436) 49 protein:vir:102359 Length: 356 99.2 3.8E-12 2.4E-15 83.2 18.6 322 200-588 1-356 (356) 50 protein:vir:3788 Length: 376 # 98.7 6.5E-09 4E-12 65.5 19.2 335 215-590 1-371 (376) 51 protein:vir:489 Length: 498 # 98.7 7.7E-08 4.8E-11 59.6 28.9 433 1-590 8-495 (498) 52 protein:vir:276 Length: 369 # 98.6 1.5E-07 9.1E-11 58.1 26.0 333 223-590 1-366 (369) 53 protein:vir:78782 Length: 370 98.6 1.8E-07 1.1E-10 57.6 21.7 334 215-590 1-363 (370) 54 protein:vir:3751 Length: 376 # 98.5 2.1E-07 1.3E-10 57.2 20.7 335 215-590 1-371 (376) 55 protein:vir:4463 Length: 498 # 98.4 8.2E-07 5.1E-10 53.9 30.5 434 1-590 8-495 (498) 56 protein:vir:4517 Length: 498 # 98.4 1E-06 6.3E-10 53.5 30.9 434 1-590 8-495 (498) 57 protein:vir:1996 Length: 495 # 98.3 1.9E-06 1.2E-09 51.9 33.0 434 1-586 9-495 (495) 58 protein:vir:80052 Length: 331 97.9 1.1E-05 6.7E-09 47.8 23.4 313 219-590 1-331 (331) 59 protein:vir:5260 Length: 502 # 97.4 7.7E-05 4.8E-08 43.1 32.6 458 1-590 1-502 (502) 60 protein:vir:95263 Length: 450 96.5 0.00055 3.4E-07 38.4 25.6 391 125-590 1-449 (450) 61 protein:vir:101576 Length: 501 72.8 0.18 0.00011 24.7 32.3 441 1-590 1-501 (501) 62 protein:vir:3636 Length: 501 # 60.7 0.37 0.00023 23.0 32.8 441 1-590 1-501 (501) 63 protein:vir:106730 Length: 501 58.5 0.41 0.00026 22.7 32.7 441 1-590 1-501 (501) 64 protein:vir:107720 Length: 515 52.1 0.56 0.00035 22.0 31.0 457 1-589 1-515 (515) 65 protein:vir:3165 Length: 426 # 49.6 0.64 0.00039 21.7 19.6 395 125-590 1-426 (426) 66 protein:vir:78611 Length: 501 48.2 0.68 0.00042 21.5 32.5 440 1-590 1-501 (501) 67 protein:vir:99586 Length: 507 46.3 0.74 0.00046 21.3 27.8 448 1-589 1-507 (507) 68 protein:vir:94073 Length: 494 33.1 1.4 0.00085 19.8 34.5 442 1-590 1-494 (494) No 1 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=5.4e-131 Score=734.74 Aligned_cols=544 Identities=18% Similarity=0.175 Sum_probs=353.0 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCCc Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGGT 80 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG~ 80 (590) ||+||||||||||+|+++++|+||+|++++|+|.++|||+++|++|+||.||++.||+++. ++++.|+|++||+|||+ T Consensus 1 m~~~~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~~~~Gp~~~p~~i~s~~~~~~~fG~~~~--~~~~~~~v~~~f~ngg~ 78 (743) T protein:vir:10 1 MASQVSPGILIKERDLTNAVVTGALQIRAAHASTFAKGPIGDIVNINTQKELVSVFGEPKE--DNAEDWMVASEFLNYGG 78 (743) T ss_pred CccccCCceEEEEecCCCceeccCCcceeEEEEeccCCCCCcCEEecCHHHHHHHcCCccC--CcchHHHHHHHHHhCCc Confidence 9999999999999999999999999999999999999999999999999999999999864 47899999999999995 Q ss_pred -EEEEEEecCCccccccccc----------------cceeecccccccceeeeeeccccc-------------------- Q lcl|NC_016163. 81 -AYVLRVMPDDAKFANSLIS----------------IKTTAAADPAKATVLVTAKAQTTN-------------------- 123 (590) Q Consensus 81 -~~vvRv~~~~a~~a~~~~~----------------~~~~~a~~~~~~~~~v~~~~~~~~-------------------- 123 (590) ||||||.+.++++++.... .....+..++.|.+.+.+...... T Consensus 79 ~~~vvrv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gN~i~V~v~~~~~d~~~~~~~~~~~~~~~~~~ 158 (743) T protein:vir:10 79 RLAVVRAETTGVLNATTGSAGVLVKNRESWDAGSGNGEVFVARTAGSWGNSLMGVLVDRGADYIVTFAATPTDTAVGTQL 158 (743) T ss_pred eEEEEEccCccccccccccccccccccccccccccceeEEEEeeccccccceEEEEecCCCcceeeeeccccccccceee Confidence 9999998776665543221 222334555555543332110000 Q ss_pred ------c-ccccceEEEee--------c--cccC---------C----cc------------eeeEeecccccccccceE Q lcl|NC_016163. 124 ------T-ASKNAMKTILS--------G--GTAG---------E----TP------------LCFIVPKGRGENYNGYGF 161 (590) Q Consensus 124 ------~-a~~~~~~~~~~--------~--~t~~---------~----~~------------~~~~~~~~~g~~~~~~~~ 161 (590) . ...+....... . .+.. . .. .......+.......... T Consensus 159 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tv 238 (743) T protein:vir:10 159 LFSYSGTLVTGEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVGRTPGTYSNVPASGGTGTGATFNV 238 (743) T ss_pred eecccccccccceeeeeecCcceeeeeccccceeeecccccccccccccccccccccccccceeeEEecccccccccccc Confidence 0 00000000000 0 0000 0 00 000000000000000000 Q ss_pred EE------------------------Eeecccc-----cccc----c--------------------------------- Q lcl|NC_016163. 162 RL------------------------SLRSDYD-----NTYN----F--------------------------------- 175 (590) Q Consensus 162 ~~------------------------~~~~~~~-----~~~~----~--------------------------------- 175 (590) .. ....... ...+ . T Consensus 239 ~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~~a~~~~~~~~~~~~~~~~~~~~~ 318 (743) T protein:vir:10 239 VVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAITELKDWYLNTEIGSTGIKLGDIG 318 (743) T ss_pred cccccccccccccccccccccceeeeccccccccccccccccchhheecccccceeeeecccccccchhhcccccccccc Confidence 00 0000000 0000 0 Q ss_pred --------------cccceeeeeecccC-----CCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccccc Q lcl|NC_016163. 176 --------------RTYNLSVTVKDSTG-----ADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRSAF 236 (590) Q Consensus 176 --------------~~~~l~i~v~d~~~-----~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~~~ 236 (590) ....+.+.+.+..+ ..+++|.+..++.+++++...+...++..+++..+.++...+.... T Consensus 319 ~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~- 397 (743) T protein:vir:10 319 PRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVINEQSAYLYHGNDAAV- 397 (743) T ss_pred ccceeeeccccccccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceeecceeccccceeeccCcccc- Confidence 00000000000000 0011122222222222222222222222222211111111100000 Q ss_pred eeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccccc Q lcl|NC_016163. 237 ETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEES 316 (590) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (590) .... .. ..+.. ....... .................+|.++. T Consensus 398 ---------------------------~~~~-~~---~~~~~---~~~~~~~-----~~~~~~~~~~~~~~~~~gG~d~~ 438 (743) T protein:vir:10 398 ---------------------------QIAA-SG---EAWGQ---SSDQVLA-----DAGTAFSRTTGYWVNLAGGNDDF 438 (743) T ss_pred ---------------------------eeee-cc---ccCcc---ccceeee-----ecccccccccceEEEeecCcccc Confidence 0000 00 00000 0000000 00000000001112334455554 Q ss_pred eeccchhhHHHHHHHhhhccCCceeeeccc------chhHHHHHHHHHHHHhcCCeEEEEecCCCC-------------- Q lcl|NC_016163. 317 ALLVKGYSGVLAPEILDKQQYEIDVLLDGN------NEVAVKNAMSDLCSEQRGDCIAILDCSFQG-------------- 376 (590) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~-------------- 376 (590) .+...+... ....+...+..++++++.|+ +..+++.+++.+|+++ ++||+++|+|++. T Consensus 439 ~~~~~~~~~-~~~~~~~~~~~~~~ll~~p~~~~~~~~~~~v~~a~~~~~~~~-~~~~a~~d~p~~~~~~~~~~~~~~~~~ 516 (743) T protein:vir:10 439 AYDAGEFGA-AMDLFLDTEETEIDFVLMGGSMADEADTKSKATKVIAIAASR-KDALAFVSPHKGNQIASTGNVALSSAQ 516 (743) T ss_pred ccchhHHHH-HHHHhhhccccCcceEEecCcccCccchHHHHHHHHHHHHhh-CCeEEEEecCCCccccccccccccccc Confidence 444433332 33345555666677887765 3468899999999764 5899999999753 Q ss_pred CHHHHHHHHHhhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceee Q lcl|NC_016163. 377 DAQQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFY 456 (590) Q Consensus 377 ~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~ 456 (590) +..++..|+..+ ++|+|+++||||++++|+.+++.+++|||+++||+|||+|.++||||||||+++.+|.|++++++. T Consensus 517 ~~~~~~~~~~~~--~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~~~~~~ 594 (743) T protein:vir:10 517 QKENTIAFFSDL--TSTSYAVFDSGYKYVYDRFTDKYRYIPCNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAVKLAYN 594 (743) T ss_pred cchHHHHHHHhc--cCCeeEEEEccceeeeccccCceeEechhHHHHHHHHHhhccCCcEEccCCeeeeeeeccccceec Confidence 234666677543 589999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cChhHHhhhhhcCceEEEEecCCeEEEecceecC-CCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHH Q lcl|NC_016163. 457 PNEPWKEKLYLAQVNYIERDPKKISFATQLTSQT-SRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSL 535 (590) Q Consensus 457 ~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~s-~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i 535 (590) ++++|++.||++||||||+|+++|+++||+||++ .|++||||||||||+|||++|+++++|+||||||+.||++|+++| T Consensus 595 ~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~~~~~i~~~i 674 (743) T protein:vir:10 595 PNKADRDELYQNRINPVVSLRGQGITLFGDKTALAAPSAFDRINVRRLFLNLEKRARRLAEGVLFEQNDATTRAGFSSAL 674 (743) T ss_pred CChhHHHhHhhCCceEEEEecCCeEEEEcccccCCCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHH Confidence 9999999999999999999999999999999985 589999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 536 NNYLQQWVANRACSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 536 ~~~L~~l~~~ga~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) ++||++||++|+|.+ +||+++||++||++|+|+++|+++|++|||||+|||.|.| T Consensus 675 ~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~ 732 (743) T protein:vir:10 675 NSYLSEVQARRGVTDYLVICDESNNTPDIIDRNEFVAEVYVKPTRSINFITITFTATK 732 (743) T ss_pred HHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEee Confidence 999999999998754 7999999999999999999999999999999999999999 No 2 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=2.4e-130 Score=731.25 Aligned_cols=568 Identities=18% Similarity=0.177 Sum_probs=370.5 Q ss_pred Ccc-ccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC Q lcl|NC_016163. 1 MAD-YLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG 79 (590) Q Consensus 1 Mp~-yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG 79 (590) ||. ||||||||||+|++ ++|++|+|++++|+|.++|||+++|++|+||.||++.||+++ .+++++|+|++||.||| T Consensus 1 M~~~~~~PgVyv~e~~~~-~~~~~~~t~~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~--~~~~~~~~v~~~F~ngg 77 (749) T protein:vir:10 1 MATNQSSPGVVIQERDLT-TVSTIPTANVGVIAAPFTKGPVEEVIEITSERQLAEKFGEPN--ESNYEYWFSAAQFLSYG 77 (749) T ss_pred CCccccCCeeEEEEecCC-cccccccCceeEEEeccCCCCCccCEEcCCHHHHHHHcCCcc--CCcccHHHHHHHHhhcC Confidence 995 99999999999987 569999999999999999999999999999999999999986 44679999999999999 Q ss_pred c-EEEEEEecCCccccccccc-------------------cceeecccccccceeeeeeccccc-----------cccc- Q lcl|NC_016163. 80 T-AYVLRVMPDDAKFANSLIS-------------------IKTTAAADPAKATVLVTAKAQTTN-----------TASK- 127 (590) Q Consensus 80 ~-~~vvRv~~~~a~~a~~~~~-------------------~~~~~a~~~~~~~~~v~~~~~~~~-----------~a~~- 127 (590) + ||||||++++++++..... .....+..|+.|.+.+.+...... .... T Consensus 78 ~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~pG~~gn~l~v~v~~~~~~~~~~~~~~~~~~~~ 157 (749) T protein:vir:10 78 GLLKTIRVNSSSLKNAVDTGTAPLVKNLQDYETSIEDASNNFSWVARTPGDTGNSIGIFVTDAGADQVVVVPAPGSGNEH 157 (749) T ss_pred CeEEEEEccCccccccccccccccccccccccccccccccceEEEeccCCCcCCceEEEEEcCCCceeeeeecCCcccee Confidence 5 9999998777666532211 122345556666554332110000 0000 Q ss_pred -------------cceE------EE-e-----------------eccc-----c---CC-cceeeEeecc---------- Q lcl|NC_016163. 128 -------------NAMK------TI-L-----------------SGGT-----A---GE-TPLCFIVPKG---------- 151 (590) Q Consensus 128 -------------~~~~------~~-~-----------------~~~t-----~---~~-~~~~~~~~~~---------- 151 (590) .... .+ . ..+. . +. .......+.+ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~~~a~~~ 237 (749) T protein:vir:10 158 EFVADAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDATNKKLEIGLPSGGVTGILADNQ 237 (749) T ss_pred eEEeeecccccccccccccccceeeeeccccceeecccceeeeccCcccccccccccCCcceEEEeeecccccceeeeee Confidence 0000 00 0 0000 0 00 0000000000 Q ss_pred ----------------------------------cccccccceEEEEeecc------------------------ccccc Q lcl|NC_016163. 152 ----------------------------------RGENYNGYGFRLSLRSD------------------------YDNTY 173 (590) Q Consensus 152 ----------------------------------~g~~~~~~~~~~~~~~~------------------------~~~~~ 173 (590) .....++....+..... +.... T Consensus 238 ~v~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~v~~~~~~~~~~t~~~~~~~a~~~gt~~~~~~~ 317 (749) T protein:vir:10 238 VITQGTNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITSVRDEYTEREYLPGVKWINVAPRPGTSLYANGV 317 (749) T ss_pred cccccccccccccccccchhhhhccccceeeccccccCCccceeEEEeeeccccccccccceeeccccccccceeeeecc Confidence 00000000000000000 00000 Q ss_pred cccccceeeeeecccCC-----CceeeeeeeeeccccccccccccceeeeeeccccceeeecCccccceeeeeecccccc Q lcl|NC_016163. 174 NFRTYNLSVTVKDSTGA-----DVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRSAFETISEFVVGDSE 248 (590) Q Consensus 174 ~~~~~~l~i~v~d~~~~-----~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~~~~~~~~~~~~~~~ 248 (590) +.....+++.+.+..+. .+++|.+.+++...+++...+...++.++++..+..+...+............ .... T Consensus 318 ~g~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~-~~~~ 396 (749) T protein:vir:10 318 GGHRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQKSEFIYWAEHESTLYAATSSA-SDGL 396 (749) T ss_pred cCCCCceEEEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhccCCCEEEEEecccccccccccc-cccc Confidence 01112233334433321 34567777777777777777777778888887777776654432211111000 0000 Q ss_pred cCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccccceeccchhhHHHH Q lcl|NC_016163. 249 ADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEESALLVKGYSGVLA 328 (590) Q Consensus 249 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (590) .............. ........ ....+. .. .............+.+.+... +.......++. ... T Consensus 397 ~~~~~~~~~~~~~~------~~~~~~~~---~~~~~~-~~--~~~~~~~~~~~~gg~d~~~~~--~~~~~~~~~~~-~~~ 461 (749) T protein:vir:10 397 FGQTAANRQFNLFR------SAAGSVDY---PAGVTT-LG--SKNNATYYYRLSGGVNYTVSA--GQYTITNTDIG-SAY 461 (749) T ss_pred cccccccceeeccc------ccccccee---cccccc-cc--ccCCcEEEEEccCCccccccc--ccccccchhHH-HHH Confidence 00000000000000 00000000 000000 00 000000000111111111111 11112222222 233 Q ss_pred HHHhhhccCCceeeec--cc----chhHHHHHHHHHHHHhcCCeEEEEecCCCCC---------HHHHHHHHHhhcCccc Q lcl|NC_016163. 329 PEILDKQQYEIDVLLD--GN----NEVAVKNAMSDLCSEQRGDCIAILDCSFQGD---------AQQTIDYRTGNISMST 393 (590) Q Consensus 329 ~~~~~~~~~~~~~~~~--~~----~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~---------~~~~~~~~~~~~~~~s 393 (590) ..+...+...+++++. +. +..+++.+++.+|++ |++||+++|+|.+.. ..++..|+.++ .+| T Consensus 462 ~~l~~~~~~~~~~li~~~~~~~~~~~~~v~~al~~~~~~-~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~--~~s 538 (749) T protein:vir:10 462 ELIGDPESQIVDFIISGPSGTSDANALAKITSLVNIAEE-RRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKKL--PSS 538 (749) T ss_pred HHhhhhhhcccceEEEecCCCCcchhHHHHHHHHHHHhh-cCCEEEEEcCCCCcccccccchhhhhHHHHHHhhc--cCc Confidence 3344444444444433 22 345788999999966 568999999987532 24556666543 478 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEE Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYI 473 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i 473 (590) .|+++||||++++|+.+++.+++|||||+||+|||+|.++||||||||+++++|+|++++++.++++|++.||++||||| T Consensus 539 ~~~~~~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i 618 (749) T protein:vir:10 539 SYMVFDSGYKYIYDKYNDVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPI 618 (749) T ss_pred eeEEEEccceeeeccccCceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccccceeecChhHHHhhhhCCceEE Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCeEEEecceec-CCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceE--- Q lcl|NC_016163. 474 ERDPKKISFATQLTSQ-TSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACS--- 549 (590) Q Consensus 474 ~~~~~~G~~~wG~rT~-s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~--- 549 (590) ++|+++|+++||+||+ +.|++|+||||||||+|||++|+++++|+||||||+.||++|+++|+.||++||++|+|. T Consensus 619 ~~~~g~G~~~wG~rT~~s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~G~i~~f~ 698 (749) T protein:vir:10 619 VSFPGQGVVLYGDKTALGFASAFDRINIRRLFLTVERVISTAAKAQLFEQNDEAQRSLFINIVEPYLRDVQGRRGVVDFL 698 (749) T ss_pred EEecCCeEEEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeE Confidence 9999999999999998 568999999999999999999999999999999999999999999999999999999875 Q ss_pred EEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 550 SISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 550 ~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) ++||+++||+++|++|+|+++|+|+|++|||||+|||.|+| T Consensus 699 V~~d~~~Nt~~~i~~G~~~~~i~~~P~~pae~I~~~~~~~~ 739 (749) T protein:vir:10 699 VKCDSTNNTPEAVDRGEFYAEVFLKPTRTINYVQLTFVATR 739 (749) T ss_pred EEEcCCCCCHHHhhCCEEEEEEEEEecCCccEEEEEEEEee Confidence 47999999999999999999999999999999999999999 No 3 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=5.8e-129 Score=723.61 Aligned_cols=576 Identities=13% Similarity=0.127 Sum_probs=343.7 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCCc Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGGT 80 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG~ 80 (590) |. ||||||||||++ ++++|+||+|++++|+|.++|||+++|++|+||.||++.||+++ .++++.|+|++||.|||+ T Consensus 1 ~~-~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vg~~~~gp~~~p~~i~s~~~~~~~fg~~~--~~~~~~~~~~~~f~~gg~ 76 (679) T protein:vir:10 1 MT-LLSPGVETKEIN-LQTTIARSSTGRAALVGKFNWGPAYQISQVVSEVDLVDKFGRPD--DQTADSFFSGVNFLNYGN 76 (679) T ss_pred Cc-eecCceEEEeec-CCcccccCccccceeeecccCCCCccCEEecCHHHHHHHcCCcc--cccchHHHHHHHHHhCCC Confidence 76 999999999995 89999999999999999999999999999999999999999986 447899999999999995 Q ss_pred -EEEEEEecCCc-cccccccccceeecccccc---cceeeeeeccccccccccceEEEeeccccCCcceeeEeeccc--- Q lcl|NC_016163. 81 -AYVLRVMPDDA-KFANSLISIKTTAAADPAK---ATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGR--- 152 (590) Q Consensus 81 -~~vvRv~~~~a-~~a~~~~~~~~~~a~~~~~---~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~--- 152 (590) ||||||.+..+ +++..........+..++. +.+.++.........+... .....++.. .. ...+... T Consensus 77 ~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~s~~~~~~~-~~~~~~~~~--~~--~~v~~~~~~~ 151 (679) T protein:vir:10 77 DLRLVRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGGNVIATGKV-TVVNASGGI--VA--FYVPTAAIID 151 (679) T ss_pred eEEEEEccCcccccccccccccccccccccccccccccceeeeeCCCcccceeE-EEeeccCce--ee--eeeccccccc Confidence 99999987543 3444333333333333332 2222222211111111110 001111100 00 0000000 Q ss_pred -cccccc---ceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccc-cccccceeeeeeccc-cce Q lcl|NC_016163. 153 -GENYNG---YGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKD-KSRQSIYYANIINKY-SQY 226 (590) Q Consensus 153 -g~~~~~---~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~-~~~~~~~~~~vv~~~-s~~ 226 (590) ...... .......+...............+........-.+ ........... ............... ... T Consensus 152 ~a~~~~~~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~----~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~ 227 (679) T protein:vir:10 152 KAKSLNDYPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFV----PNDEYAMSAISERSETKRTFIDICEEMKVPA 227 (679) T ss_pred ccccccccceecccceeeeeeccccccceeeeeeeeeccCCceee----ccccccccccccccccchhhhhhhhccccce Confidence 000000 00000000000000000000000000000000000 00000000000 000000000000000 000 Q ss_pred ee--ecCccccceeeee--ec------ccccccCccccceeccccc----ccc--c-ccccccccccceeecccccc--- Q lcl|NC_016163. 227 VE--IVDNRSAFETISE--FV------VGDSEADPQKVDIIFGQER----AVT--P-AETIHANVVWKSSSVETDDP--- 286 (590) Q Consensus 227 v~--~~~~~~~~~~~~~--~~------~~~~~~~~~~~~~~~~~~~----~~~--~-~~~~~~~~~~~~~~~~~~~~--- 286 (590) +. ..+.......... .. ................... ... . .................... T Consensus 228 ~~A~~~g~~gn~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~ 307 (679) T protein:vir:10 228 IVARYAGTYGDNIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVE 307 (679) T ss_pred eeeecccccCCcceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEeccccccc Confidence 00 0000000000000 00 0000000000000000000 000 0 00000000000000000000 Q ss_pred cc----ccccccc------------c-cceee---------eeccccccccccccceeccchhhHHHHHHHhhhccCCce Q lcl|NC_016163. 287 SY----DATAANF------------N-NIQYL---------TEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEID 340 (590) Q Consensus 287 ~~----~~~~~~~------------~-~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 340 (590) .. ....... . ....+ .......+.++.++...............+...+...++ T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 387 (679) T protein:vir:10 308 SKILSTKPGDRDIYGTSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVN 387 (679) T ss_pred ceeeecccccccccchhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccc Confidence 00 0000000 0 00000 000111222333332221212122222333444556677 Q ss_pred eeecccc-------hhHHHHHHHHHHHHhcCCeEEEEecCCC--------CCHHHHHHHHHhh----------cCcccce Q lcl|NC_016163. 341 VLLDGNN-------EVAVKNAMSDLCSEQRGDCIAILDCSFQ--------GDAQQTIDYRTGN----------ISMSTYF 395 (590) Q Consensus 341 ~~~~~~~-------~~~~~~a~~~~~~~~~~~~~a~~d~p~~--------~~~~~~~~~~~~~----------~~~~s~~ 395 (590) +++.|+. ..+++.+++.||+++ ++||+|+|+|.+ .+.+++.+||... .+++|.| T Consensus 388 ~l~~p~~~~~~~~~~~~v~~~l~~~~~~~-~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~ 466 (679) T protein:vir:10 388 LFIAGAVAGEGAQIASTVQKAVVAIADER-RDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTY 466 (679) T ss_pred eEEecCCCCCchhhhHHHHHHHHHHHHhh-CCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcce Confidence 7877753 357899999999764 689999999865 3457888998642 2568999 Q ss_pred EEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEE Q lcl|NC_016163. 396 TAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIER 475 (590) Q Consensus 396 ~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~ 475 (590) +++||||++++|+.+++.+++||||++||+|||+|.++||||||||+++++|.|++++++.++++|++.||++||||||+ T Consensus 467 ~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~ 546 (679) T protein:vir:10 467 ASVDGNYKYQYDKYNDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVG 546 (679) T ss_pred EEEEccceeeecccCCceEEechHHHHHHHHHHhhccCCcEECcCCeeeccccccccceeecChhhHHhhhhCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCeEEEecceecCCC-cccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---E Q lcl|NC_016163. 476 DPKKISFATQLTSQTSR-SALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---I 551 (590) Q Consensus 476 ~~~~G~~~wG~rT~s~d-~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~ 551 (590) |+++|+++||+||++++ ++|+||||||||+|||++|+++++|+||||||+.||++|+++|++||++||++|+|.| + T Consensus 547 ~~g~G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~gf~v~ 626 (679) T protein:vir:10 547 FAGQGYILYGDKTASQAPTPFDRINVRRLFNLLKKSISESAKYKLFELNDAFTRSSFRSEVGSYLDTIRSLGGIYDFRVV 626 (679) T ss_pred ecCCeEEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEE Confidence 99999999999999876 5899999999999999999999999999999999999999999999999999999876 6 Q ss_pred ecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 552 SGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 552 ~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) ||+++||++||++|+|+++|+|+|++|||||+|||.|.+ T Consensus 627 ~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~ 665 (679) T protein:vir:10 627 CDESNNTPAVIDRNEFVATILIKPARSINYITLSFVATS 665 (679) T ss_pred EcCCCCCHHHhhCCeEEEEEEEEecCCccEEEEEEEEee Confidence 999999999999999999999999999999999999999 No 4 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=8.6e-129 Score=722.68 Aligned_cols=558 Identities=16% Similarity=0.131 Sum_probs=350.1 Q ss_pred Cc-cccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC Q lcl|NC_016163. 1 MA-DYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG 79 (590) Q Consensus 1 Mp-~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG 79 (590) || +||||||||||+|+++++|+||+|++++|+|.++|||+++|++|+||.||+++||++....+.++.|+|++||.||| T Consensus 1 m~~~~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~~~f~ngg 80 (729) T protein:vir:10 1 MPLNLASPGIVVREVDLTIGRVDPTSGSIGALVAPFAKGPVNDPQLIESEEDLLQTFGQPYSTDKHYEYWMVASSYLAYG 80 (729) T ss_pred CCccccCCceEEEEecCCCcccccccccceeEEeccccCCCccCeEcCCHHHHHHHcCccccCCcchhHHHHHHHHHhCC Confidence 99 79999999999999999999999999999999999999999999999999999999865566788999999999999 Q ss_pred -cEEEEEEecCCccccccccc---------------------------------cceeecccccccceeeeeeccccccc Q lcl|NC_016163. 80 -TAYVLRVMPDDAKFANSLIS---------------------------------IKTTAAADPAKATVLVTAKAQTTNTA 125 (590) Q Consensus 80 -~~~vvRv~~~~a~~a~~~~~---------------------------------~~~~~a~~~~~~~~~v~~~~~~~~~a 125 (590) +||||||++++++.+..... .....+..++.|.+.+.......... T Consensus 81 ~~~~vvRv~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~~~v~v~~~~~~ 160 (729) T protein:vir:10 81 GTMQVVRADDYNTQTGVGLKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANGIKVAIIDGKAD 160 (729) T ss_pred ceEEEEecCcccccccccccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccceeeEEecccCc Confidence 59999998765544332211 01112233333332221110000000 Q ss_pred c----------ccceEE-EeeccccCCcceeeE---eeccccccccc--ceEE-----------EEeecccccccc---- Q lcl|NC_016163. 126 S----------KNAMKT-ILSGGTAGETPLCFI---VPKGRGENYNG--YGFR-----------LSLRSDYDNTYN---- 174 (590) Q Consensus 126 ~----------~~~~~~-~~~~~t~~~~~~~~~---~~~~~g~~~~~--~~~~-----------~~~~~~~~~~~~---- 174 (590) . ...... ............... ........... .... ............ T Consensus 161 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s~~~~~~~~~~~~~~~~~~~~~~~~ 240 (729) T protein:vir:10 161 QILTVASGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTLEVKVISHISAAGVETAVEYQQNGTYTFDNS 240 (729) T ss_pred ceeeeeccccccceeeeeeeccccccccccceeeeeeecccccccccccccceecccccccccceeccccccceeeeccc Confidence 0 000000 000000000000000 00000000000 0000 000000000000 Q ss_pred ---------------------------------------------------------ccccceeeeeecc-----cCCCc Q lcl|NC_016163. 175 ---------------------------------------------------------FRTYNLSVTVKDS-----TGADV 192 (590) Q Consensus 175 ---------------------------------------------------------~~~~~l~i~v~d~-----~~~~~ 192 (590) .....+...+.|. ..... T Consensus 241 ~s~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~~~~~g~ 320 (729) T protein:vir:10 241 GSVNVIAAGSSGSGSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTITGNSGT 320 (729) T ss_pred CccceeeeccccccccccceeeeccccccccccccccccccccccccccccccccccccccccceeeeccccccccCccc Confidence 0000000000110 00112 Q ss_pred eeeeeeeeeccccccccccccceeeeeeccccceeeecCccccceeeeeecccccccCccccceeccccccccccccccc Q lcl|NC_016163. 193 VVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHA 272 (590) Q Consensus 193 v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 272 (590) ++|.+..++.+.+.....+...++.++++..+.++...+................ T Consensus 321 vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s~~~~~~~~~~~~~~~~~~~~~~~------------------------- 375 (729) T protein:vir:10 321 ILEKHLSLSKAKDAEYSVGSSSYWRDFLATNSKYIFGGGATSGITTTGYSVSSTN------------------------- 375 (729) T ss_pred ceeeeeeeeeccccccccccccccceeeccccceeeecccccccccccccccccc------------------------- Confidence 2333334444444333333444444444444444333322211111000000000 Q ss_pred ccccceeecccccccccccc-cccccceeeeec---cccccccccccceeccchhhHHHHHHHhhhccCCceeeeccc-- Q lcl|NC_016163. 273 NVVWKSSSVETDDPSYDATA-ANFNNIQYLTEG---SEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGN-- 346 (590) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 346 (590) ................. ...........+ .+.......++... ..+.....+..+.+.+...+..++.+. T Consensus 376 ---~~~~~~~~~~~a~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~-~~~~~~~g~~~l~~~~~~~~~~~~~~~~~ 451 (729) T protein:vir:10 376 ---TLDTDSGWDQNAEGVNFGASGVATLTLAGGTNYGDKTDLTTSGALSS-GVDDIISGYTLFENTEEIEVDFILMGAAH 451 (729) T ss_pred ---eeccccccccccccccccccceeEEEeeccccccccccccccccccc-chhHHHHHHHHhhcccccccceeeecCCC Confidence 00000000000000000 000000000000 00000000001001 111112234445555555555444332 Q ss_pred ----chhHHHHHHHHHHHHhcCCeEEEEecCCCC-----------------CHHHHHHHHHhhcCcccceEEEEcCeEEE Q lcl|NC_016163. 347 ----NEVAVKNAMSDLCSEQRGDCIAILDCSFQG-----------------DAQQTIDYRTGNISMSTYFTAIFGQHMNV 405 (590) Q Consensus 347 ----~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~-----------------~~~~~~~~~~~~~~~~s~~~~~~~p~~~~ 405 (590) +...++.+++.+|++ +++||+++|+|... ..+++..|+..+ .++.|+++||||+++ T Consensus 452 ~~~~~~~~v~~a~~~~~~~-~~~~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~p~~~~ 528 (729) T protein:vir:10 452 HPKEQSQAVAEKVTAVAEA-RKDAVAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPL--SSSTYSVFDSGYKYM 528 (729) T ss_pred CCccchHHHHHHHHHHHHh-cCCeEEEecccccccccccccccccccccchhhHHHHHHHhhc--cCCceEEEEcCeeEE Confidence 346788899999975 57899999988432 234566677654 368899999999999 Q ss_pred eecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecCCeEEEec Q lcl|NC_016163. 406 YDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPKKISFATQ 485 (590) Q Consensus 406 ~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG 485 (590) +|+.++..+++|||+++||+|||+|.++||||||||+++.+|.|+.++++.++++|++.||++||||||+|+++|+++|| T Consensus 529 ~d~~~~~~~~~p~s~~~aGl~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG 608 (729) T protein:vir:10 529 FDRFNNTFRYVPLNGDIAGTCARTDIEQFPWFSPAGTARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAGIILFG 608 (729) T ss_pred ecccCCceEEechhHHHHHHHHHhhccCCcEEccCCccccceecccceeeecChhhHhhhhhCCceEEEEecCCeEEEEc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceec-CCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---EecCCCCCHHH Q lcl|NC_016163. 486 LTSQ-TSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---ISGTVYASDYD 561 (590) Q Consensus 486 ~rT~-s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~ 561 (590) +||+ +.|++|+||||||||+||+++|+++++|+||||||+.||++|+++|++||++||++|+|.| +||+++||++| T Consensus 609 ~rT~~~~d~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~ 688 (729) T protein:vir:10 609 DKTGFGKSSAFDRINVRRLFIYLEDAISAAAKDQLFEFNDELTRTNFVNIVEPFLRDVQAKRGIFDFVVICDETNNTAAV 688 (729) T ss_pred ceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEcCCCCCHHH Confidence 9998 5799999999999999999999999999999999999999999999999999999999866 69999999999 Q ss_pred hhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 562 KQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 562 i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |++|+|+++|+++|++|+|||+|||.|.| T Consensus 689 i~~G~~~~~v~~~p~~p~e~i~~~~~~~~ 717 (729) T protein:vir:10 689 IDSNEFVADIFIKPARSINFIGLTFVATR 717 (729) T ss_pred hhCCeEEEEEEEEecCCccEEEEEEEEee Confidence 99999999999999999999999999999 No 5 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=1.6e-128 Score=721.14 Aligned_cols=579 Identities=14% Similarity=0.113 Sum_probs=348.1 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCCc Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGGT 80 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG~ 80 (590) |+ ||||||||||++ ++++|+||+|++++|+|.++|||+++|++|+||.||++.||+++ .+++++|+|++||.|||+ T Consensus 1 ma-~~~PgVyv~E~~-~~~~i~~~~ts~~~~vG~~~~Gp~~~p~~i~s~~d~~~~fG~~~--~~~~~~~~v~~~f~ngg~ 76 (664) T protein:vir:98 1 MA-LQSPGIETKETS-VQSTVVRNSTGRAAIVGKFSWGPAYQIRQISNEVELVNYFGAPD--NLTADYFMSAVNFLQYGN 76 (664) T ss_pred Cc-eecCceEEEecC-CCcccccccccceEEEeeccCCCCCccEEecCHHHHHHhcCCcc--ccchhHHHHHHHHHhcCC Confidence 99 999999999995 89999999999999999999999999999999999999999986 447799999999999995 Q ss_pred -EEEEEEecC-Cccccccccccceeecccccc---cceeeeeecc----------ccccccccceEEEeeccccCCccee Q lcl|NC_016163. 81 -AYVLRVMPD-DAKFANSLISIKTTAAADPAK---ATVLVTAKAQ----------TTNTASKNAMKTILSGGTAGETPLC 145 (590) Q Consensus 81 -~~vvRv~~~-~a~~a~~~~~~~~~~a~~~~~---~~~~v~~~~~----------~~~~a~~~~~~~~~~~~t~~~~~~~ 145 (590) ||||||++. .++++.............++. ..+.+..... .....+.+............. . T Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~---~ 153 (664) T protein:vir:98 77 DLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSL---L 153 (664) T ss_pred eEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCccce---e Confidence 999999864 455555444444444333331 1111111110 011122222221111110000 0 Q ss_pred eEeeccccccc----ccceEEEEeeccccccc---cccc-cceeeeeecccC----CCceeeeeeeeec-ccccccc-cc Q lcl|NC_016163. 146 FIVPKGRGENY----NGYGFRLSLRSDYDNTY---NFRT-YNLSVTVKDSTG----ADVVVEGPYIVSF-DPEAKDK-SR 211 (590) Q Consensus 146 ~~~~~~~g~~~----~~~~~~~~~~~~~~~~~---~~~~-~~l~i~v~d~~~----~~~v~e~~~~ls~-~~da~~~-~~ 211 (590) .......... ....+.. ......... .... ........+... ............. ...+... .. T Consensus 154 -~~~~~~~~~~~~~~~~~s~~~-~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G~~ 231 (664) T protein:vir:98 154 -VLNRSVLTQIFLLVGTTEIVS-QSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALYPGEL 231 (664) T ss_pred -ecccccccccceecccceeee-eecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeeecccc Confidence 0000000000 0000000 000000000 0000 000000000000 0000000000000 0000000 00 Q ss_pred ccceeeeeeccc--cc--eeeecCccccceeeeeecccccccCccccceecccccccc---cccccccccccceeecccc Q lcl|NC_016163. 212 QSIYYANIINKY--SQ--YVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVT---PAETIHANVVWKSSSVETD 284 (590) Q Consensus 212 ~~~~~~~vv~~~--s~--~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ 284 (590) .......+.... .. .+............................+.+..++... ................... T Consensus 232 Gn~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 311 (664) T protein:vir:98 232 GSTVQVEIISKAAYDTGAMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIYMD 311 (664) T ss_pred cceeeeeecccccccCcceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeeeeech Confidence 000000000000 00 0000000000000000000000000000011111111000 0000000000000000000 Q ss_pred cccccccccccccceeeee----ccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccch-------hHHHH Q lcl|NC_016163. 285 DPSYDATAANFNNIQYLTE----GSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNE-------VAVKN 353 (590) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~ 353 (590) ...... ............ .......++.+.......+.....+..+.+.+...+++++.|+.. .+++. T Consensus 312 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~~~~v~~ 390 (664) T protein:vir:98 312 DFFANG-GSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGESVEIASTVQK 390 (664) T ss_pred hheecc-cceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccceEEecCCCCCcHHHHHHHHH Confidence 000000 000000000000 000122233332222222222334455666677778888777632 46888 Q ss_pred HHHHHHHHhcCCeEEEEecCC--------CCCHHHHHHHHHh-----------hcCcccceEEEEcCeEEEeecccCcee Q lcl|NC_016163. 354 AMSDLCSEQRGDCIAILDCSF--------QGDAQQTIDYRTG-----------NISMSTYFTAIFGQHMNVYDEYNGETI 414 (590) Q Consensus 354 a~~~~~~~~~~~~~a~~d~p~--------~~~~~~~~~~~~~-----------~~~~~s~~~~~~~p~~~~~d~~~~~~~ 414 (590) +++.||++ +++||+++|+|. +.+.+++++||+. ..+++|.|+++||||++++|+.+++.+ T Consensus 391 al~~~a~~-~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~ 469 (664) T protein:vir:98 391 HVISIGDE-RQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNR 469 (664) T ss_pred HHHHHHHh-cCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceE Confidence 89999976 568999999873 5678899999975 235789999999999999999999999 Q ss_pred eecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecC-CeEEEecceecCCC- Q lcl|NC_016163. 415 TVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPK-KISFATQLTSQTSR- 492 (590) Q Consensus 415 ~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~s~d- 492 (590) ++||||++||+|||+|.++||||||||+++.+|.|+.++++.+++.|++.||++|||||+.|++ +|+++||+||++++ T Consensus 470 ~~p~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~ 549 (664) T protein:vir:98 470 WVPLAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGDKTLTSVP 549 (664) T ss_pred EechHHHHHHHHHHhhhcCCcEECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcccccCCCC Confidence 9999999999999999999999999999999999999999999999999999999999999998 69999999999875 Q ss_pred cccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---EecCCCCCHHHhhCCEEEE Q lcl|NC_016163. 493 SALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---ISGTVYASDYDKQQSIARV 569 (590) Q Consensus 493 ~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~i~~G~l~~ 569 (590) ++|+||||||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|.| +||+++||++||++|+|++ T Consensus 550 s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~ 629 (664) T protein:vir:98 550 SPFDRINVRRLFNMIKKDIGDNAKYKLFENNDDFTRASFRMDTGQYMTNIRALGGCYDYRVICDTTNNTPDVIDRNEFVA 629 (664) T ss_pred cccceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEE Confidence 5899999999999999999999999999999999999999999999999999999866 6999999999999999999 Q ss_pred EEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 570 KVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 570 ~i~~ap~~paefi~~~~~~~~ 590 (590) +|+++|++|||||+|||.|++ T Consensus 630 ~i~~~p~~pae~I~~~~~q~~ 650 (664) T protein:vir:98 630 TVYVKPPRSINYITLNFVATS 650 (664) T ss_pred EEEEEecCCcceEEEEEEEee Confidence 999999999999999999999 No 6 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=7.5e-128 Score=717.52 Aligned_cols=576 Identities=14% Similarity=0.124 Sum_probs=345.8 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCCc Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGGT 80 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG~ 80 (590) |. ||||||||||+|+++++|++ +|++++|+|.++|||+++|++|+||.||+++||+++ .+++++|+|++||.|||+ T Consensus 1 ~~-~~~PgVyv~e~~~~~~~~~~-~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~--~~~~~~~~v~~~f~ngg~ 76 (659) T protein:vir:10 1 MT-LLSPGIELKETTVQSTVVNN-STGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPT--AETADYFMSAMNFLQYGN 76 (659) T ss_pred Cc-eecCceEEEEecCCceeccc-CccceEEEecccCCCCCccEEecCHHHHHHHcCCcC--CCcchhHHHHHHHhhCCC Confidence 87 99999999999999998876 899999999999999999999999999999999986 447899999999999995 Q ss_pred -EEEEEEecCC-ccccccccccceeecccccc-c--ceeeeeeccccccccccceEE--EeeccccCCcce---eeEeec Q lcl|NC_016163. 81 -AYVLRVMPDD-AKFANSLISIKTTAAADPAK-A--TVLVTAKAQTTNTASKNAMKT--ILSGGTAGETPL---CFIVPK 150 (590) Q Consensus 81 -~~vvRv~~~~-a~~a~~~~~~~~~~a~~~~~-~--~~~v~~~~~~~~~a~~~~~~~--~~~~~t~~~~~~---~~~~~~ 150 (590) ||||||++.+ ++++.............++. . ......... ...+.+...+ ....+....... ...... T Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~~ 154 (659) T protein:vir:10 77 DLRVVRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYV--SDAIETEGKITEVDTDGKIKKINIPTAKIIAKA 154 (659) T ss_pred eEEEEEccCcccccccccccccceeeEeecccccccccceeeeec--CCCccccceeeEEecccccceeeeccccccccc Confidence 9999998754 33443333332222222211 1 111111110 0111111111 111111000000 000000 Q ss_pred ccccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccc-------------cccceee Q lcl|NC_016163. 151 GRGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKS-------------RQSIYYA 217 (590) Q Consensus 151 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~-------------~~~~~~~ 217 (590) .....+.........+..... ......+............ ...+.....+.... ....... T Consensus 155 ~~~g~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~----~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G 228 (659) T protein:vir:10 155 KEVGEYPTLGSNWTAEISSSS--SGLAAVITLGKIITDSGIL----LAEIENAEAAMTAVDFQANLKKYGIPGVVALYPG 228 (659) T ss_pred ccccccceeeeeeeeeeeeec--cccceeeEEeeeecCCcee----EEeeccccccccccccccceeecccccccccccc Confidence 000000000000000000000 0000000000000000000 00000000000000 0000000 Q ss_pred eeeccccceeeecCccccc--eeeeeecc------------cccccCccccceeccccccccccc---ccccccccceee Q lcl|NC_016163. 218 NIINKYSQYVEIVDNRSAF--ETISEFVV------------GDSEADPQKVDIIFGQERAVTPAE---TIHANVVWKSSS 280 (590) Q Consensus 218 ~vv~~~s~~v~~~~~~~~~--~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 280 (590) ...+..+..+......... ........ .................+...... ............ T Consensus 229 ~~g~~~tv~~~~~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (659) T protein:vir:10 229 ELGDKIEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSN 308 (659) T ss_pred eecccceEEEechhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccccccch Confidence 0000000000000000000 00000000 000000000000000000000000 000000000000 Q ss_pred ccccccccccccccccccee----eeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccc-------hh Q lcl|NC_016163. 281 VETDDPSYDATAANFNNIQY----LTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNN-------EV 349 (590) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~ 349 (590) ..... .............. ........+.++.+................+...+..++++++.|+. .. T Consensus 309 ~~~~~-~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~ 387 (659) T protein:vir:10 309 IYIDD-FFAKGGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETAS 387 (659) T ss_pred hhhhh-hhccCcccEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhH Confidence 00000 00000000000000 00011122334433322222222333444556666677888888763 35 Q ss_pred HHHHHHHHHHHHhcCCeEEEEecCC--------CCCHHHHHHHHHhh-------cCcccceEEEEcCeEEEeecccCcee Q lcl|NC_016163. 350 AVKNAMSDLCSEQRGDCIAILDCSF--------QGDAQQTIDYRTGN-------ISMSTYFTAIFGQHMNVYDEYNGETI 414 (590) Q Consensus 350 ~~~~a~~~~~~~~~~~~~a~~d~p~--------~~~~~~~~~~~~~~-------~~~~s~~~~~~~p~~~~~d~~~~~~~ 414 (590) +++.+++.||+++ ++||+++|+|. +.+.+++.+||+.. .+++|+|+++||||++++|+.+++.+ T Consensus 388 ~v~~al~~~~~~~-~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~ 466 (659) T protein:vir:10 388 TVQKHVVSIGDAR-QDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNR 466 (659) T ss_pred HHHHHHHHHHHhh-CCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceE Confidence 7889999999765 58999999874 45678999999753 25789999999999999999999999 Q ss_pred eecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecCCeEEEecceecCCC-c Q lcl|NC_016163. 415 TVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPKKISFATQLTSQTSR-S 493 (590) Q Consensus 415 ~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~s~d-~ 493 (590) ++||||++||+|||+|.++||||||||+++++|.|+.++++.++++|++.||++||||||+|+++|+++||+||++++ + T Consensus 467 ~~p~sg~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s 546 (659) T protein:vir:10 467 WVPLAADIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPS 546 (659) T ss_pred EechHHHHHHHHHHHhccCCceEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCCCCc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999876 5 Q ss_pred ccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---EecCCCCCHHHhhCCEEEEE Q lcl|NC_016163. 494 ALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---ISGTVYASDYDKQQSIARVK 570 (590) Q Consensus 494 ~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~i~~G~l~~~ 570 (590) +|+||||||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|++ +||+++||+++|++|+|+++ T Consensus 547 ~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~ 626 (659) T protein:vir:10 547 PFDRINVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGIKALGGIYEYRVVCDTTNNTPSVIDRNEFVAT 626 (659) T ss_pred ccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEE Confidence 899999999999999999999999999999999999999999999999999998765 79999999999999999999 Q ss_pred EEEEecCccceEEEEEEeeC Q lcl|NC_016163. 571 VELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 571 i~~ap~~paefi~~~~~~~~ 590 (590) |+|+|++|||||+|||.|+| T Consensus 627 i~~~p~~pae~i~~~~~~~~ 646 (659) T protein:vir:10 627 FYIQPARSINYITLNFVATA 646 (659) T ss_pred EEEEecCCcceEEEEEEEEe Confidence 99999999999999999999 No 7 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=1.1e-127 Score=716.59 Aligned_cols=584 Identities=16% Similarity=0.125 Sum_probs=351.9 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) |. ||||||||||++ ++++|+||+|++++|+|.++|||+++|++|+||.||++.||+++. +.++.|++++||.||| T Consensus 1 ~~-~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vg~~~~gp~~~p~~v~s~~~~~~~fg~~~~--~~~~~~~~~~~f~~~g~ 76 (660) T protein:vir:10 1 MA-LLSPGIELKETS-VQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNN--EVADYFMSGMNFLQYGN 76 (660) T ss_pred Cc-eecCceEEEeec-CCccccCCCcccceEEeecCCCCCccCeEcCCHHHHHHHcCCcCC--CchhHHHHHHHHHhCCc Confidence 65 999999999995 789999999999999999999999999999999999999999864 4679999999999999 Q ss_pred cEEEEEEecCC-ccccccccccceeeccccc---ccceeeeeeccccccccccceEEEeeccccCCcce-ee---Eeecc Q lcl|NC_016163. 80 TAYVLRVMPDD-AKFANSLISIKTTAAADPA---KATVLVTAKAQTTNTASKNAMKTILSGGTAGETPL-CF---IVPKG 151 (590) Q Consensus 80 ~~~vvRv~~~~-a~~a~~~~~~~~~~a~~~~---~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~-~~---~~~~~ 151 (590) +||||||++.+ +++++.........+..++ .+++.++....................+....... .. ..... T Consensus 77 ~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~~a~~ 156 (660) T protein:vir:10 77 DLRTVRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIAYARS 156 (660) T ss_pred eEEEEEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeeccccccccccccc Confidence 59999998754 4555555444444444333 34444443322111111111111111110000000 00 00000 Q ss_pred ccccc---ccceEEEEeeccccc-ccc----ccccceeeeeecccCCCce-eeeeeeeeccc----ccccccccccee-e Q lcl|NC_016163. 152 RGENY---NGYGFRLSLRSDYDN-TYN----FRTYNLSVTVKDSTGADVV-VEGPYIVSFDP----EAKDKSRQSIYY-A 217 (590) Q Consensus 152 ~g~~~---~~~~~~~~~~~~~~~-~~~----~~~~~l~i~v~d~~~~~~v-~e~~~~ls~~~----da~~~~~~~~~~-~ 217 (590) .+... ............... ... .......+...+....... ........... .+.......... . T Consensus 157 v~~~~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~v 236 (660) T protein:vir:10 157 LNQYPTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYPGEIGSTLEV 236 (660) T ss_pred cccccccccceeEEEecccCccccceeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeecccccCcceeE Confidence 00000 000000000000000 000 0000000000000000000 00000000000 000000000000 0 Q ss_pred eeeccc----cceeee--cCccccceeeeeecccccccCccccceeccccccccc---ccccccccccceeecccccccc Q lcl|NC_016163. 218 NIINKY----SQYVEI--VDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTP---AETIHANVVWKSSSVETDDPSY 288 (590) Q Consensus 218 ~vv~~~----s~~v~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~ 288 (590) .+.... .+.... ............................+..++.... ...................... T Consensus 237 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (660) T protein:vir:10 237 EIVSKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYLDDYFA 316 (660) T ss_pred EEeeccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceeeeehhhc Confidence 000000 000000 0000000000000000000000000011111110000 0000000000000000000000 Q ss_pred cccccccccce----eeeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccc-------hhHHHHHHHH Q lcl|NC_016163. 289 DATAANFNNIQ----YLTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNN-------EVAVKNAMSD 357 (590) Q Consensus 289 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~a~~~ 357 (590) . ......... ........++.++.++......+........+...+...+++++.++. ..+++++++. T Consensus 317 ~-~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~~ 395 (660) T protein:vir:10 317 K-GTSNYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVVS 395 (660) T ss_pred C-CCccEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHHH Confidence 0 000000000 000011123444444433333333333444455555666777766542 3568889999 Q ss_pred HHHHhcCCeEEEEecCCC--------CCHHHHHHHHHhhc-------CcccceEEEEcCeEEEeecccCceeeecHHHHH Q lcl|NC_016163. 358 LCSEQRGDCIAILDCSFQ--------GDAQQTIDYRTGNI-------SMSTYFTAIFGQHMNVYDEYNGETITVTSTYFL 422 (590) Q Consensus 358 ~~~~~~~~~~a~~d~p~~--------~~~~~~~~~~~~~~-------~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~ 422 (590) ||+++ ++||+++|+|.+ .+.+++.+||+..+ +++|.|+++||||++++|+.+++.+++||||++ T Consensus 396 ~~~~~-~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~ 474 (660) T protein:vir:10 396 IADER-QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADL 474 (660) T ss_pred HHHhh-CCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechhHHH Confidence 99764 689999999965 36789999997543 578999999999999999999999999999999 Q ss_pred HHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecC-CeEEEecceecCCCc-ccceehh Q lcl|NC_016163. 423 ASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPK-KISFATQLTSQTSRS-ALSYINN 500 (590) Q Consensus 423 AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~s~d~-~~~~i~v 500 (590) ||+|||+|.++||||||||+++.+|.|+.++++.+++.|++.||++|||||++|++ +|+++||+||+++++ +|||||| T Consensus 475 AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~v 554 (660) T protein:vir:10 475 AGLCARTDDVSQPWMSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTATKVPSPMDHINV 554 (660) T ss_pred HHHHHHhhccCCcEEccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccCCCCcccceEeh Confidence 99999999999999999999998999999999999999999999999999999987 799999999998875 8999999 Q ss_pred hhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---EecCCCCCHHHhhCCEEEEEEEEEecC Q lcl|NC_016163. 501 VRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---ISGTVYASDYDKQQSIARVKVELVFTG 577 (590) Q Consensus 501 rR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~ 577 (590) ||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|.| +||+++||++||++|+|+|+|+++|++ T Consensus 555 rR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~i~~~P~~ 634 (660) T protein:vir:10 555 RRLFNMLKKNIGDASKYKLFELNDNFTRSSFRMEVSQYLDGIKALGGIYEGRVVCDTTVNTPAVIDRNEFIANIYVKPAR 634 (660) T ss_pred hhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecC Confidence 99999999999999999999999999999999999999999999999876 599999999999999999999999999 Q ss_pred ccceEEEEEEeeC Q lcl|NC_016163. 578 VIERIAIDLVVNK 590 (590) Q Consensus 578 paefi~~~~~~~~ 590 (590) |||||+|||.|+| T Consensus 635 pae~I~~~~~~~~ 647 (660) T protein:vir:10 635 SINYITLNFVATS 647 (660) T ss_pred CccEEEEEEEEee Confidence 9999999999999 No 8 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=7.9e-128 Score=717.41 Aligned_cols=576 Identities=15% Similarity=0.098 Sum_probs=347.2 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) |. ||||||||||+ +++++|+||+|++++|+|.++|||+++|++|+||.||++.||+++. ++++.|+|++||.||| T Consensus 1 ~~-~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~--~~~~~~~~~~~f~~~g~ 76 (660) T protein:vir:68 1 MA-LLSPGVELKET-TVQSTVVNNSTGTAALAGKFQWGPAFQIKQITDEVALVDMFGTPNT--DTADYFMSAMNFLQYGN 76 (660) T ss_pred Cc-cccCceEEEEe-cCCcccccCCCcceeEEecccCCCCccCEEecCHHHHHHhcCCccC--ccchhHHHHHHHHhCCC Confidence 76 99999999999 5899999999999999999999999999999999999999999864 4679999999999998 Q ss_pred cEEEEEEecC-Cccccccccccceeecccccc---cceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc- Q lcl|NC_016163. 80 TAYVLRVMPD-DAKFANSLISIKTTAAADPAK---ATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE- 154 (590) Q Consensus 80 ~~~vvRv~~~-~a~~a~~~~~~~~~~a~~~~~---~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~- 154 (590) +||||||+++ .++++....+....+...++. +...+.....................+..... ..+..... T Consensus 77 ~~~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~----~~~ta~~~~ 152 (660) T protein:vir:68 77 DLRVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNI----FIPSGKIIA 152 (660) T ss_pred eEEEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeee----eeccccccc Confidence 5999999853 455555554444444444432 22222222111111100001111111100000 00000000 Q ss_pred ----------ccccceEEEEeecccccccc-------ccccceeeeeecccCCCceeeeeeeeec--c---cccccc-cc Q lcl|NC_016163. 155 ----------NYNGYGFRLSLRSDYDNTYN-------FRTYNLSVTVKDSTGADVVVEGPYIVSF--D---PEAKDK-SR 211 (590) Q Consensus 155 ----------~~~~~~~~~~~~~~~~~~~~-------~~~~~l~i~v~d~~~~~~v~e~~~~ls~--~---~da~~~-~~ 211 (590) ........+ ......... .......+...+..+.....-....... . ..+... .. T Consensus 153 ~a~~~~~~~~~~~~~~~~v--~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~ 230 (660) T protein:vir:68 153 KAKEIGEYPELGSNWTAEM--SGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGEL 230 (660) T ss_pred cceeeccccccccceeEEe--ecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCcccccccccccc Confidence 000000000 000000000 0000000001111000000000000000 0 000000 00 Q ss_pred ccceeeeeeccccceee--------ecCccccceeeeeecccccccCccccceeccccccccc---ccccccccccceee Q lcl|NC_016163. 212 QSIYYANIINKYSQYVE--------IVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTP---AETIHANVVWKSSS 280 (590) Q Consensus 212 ~~~~~~~vv~~~s~~v~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~ 280 (590) .......+......... ................. ........+....+..... ............ . T Consensus 231 G~~i~v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~-~ 307 (660) T protein:vir:68 231 GDQLEIEIVSKADYDKGASAQLKIYPDGGTRYSTAKAIFGYG--PQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYG-S 307 (660) T ss_pred ccceEEEEeccccccccccccceeeecccccccceeeEeecc--cccccceeeeeecCCcceeeeeeecccccccccc-c Confidence 00000000000000000 00000000000000000 0000000011111100000 000000000000 0 Q ss_pred ccccccccccccccccccee----eeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccc-------hh Q lcl|NC_016163. 281 VETDDPSYDATAANFNNIQY----LTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNN-------EV 349 (590) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~ 349 (590) .................... ..........+|.++...............+...+...+.+++++.. .. T Consensus 308 ~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 387 (660) T protein:vir:68 308 NIFIDDFFAKGASNYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVAS 387 (660) T ss_pred ceeeehhhccCcccEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHH Confidence 00000000000000000000 00001122334444332222222223334455556666666665542 34 Q ss_pred HHHHHHHHHHHHhcCCeEEEEecC--------CCCCHHHHHHHHHhhc-------CcccceEEEEcCeEEEeecccCcee Q lcl|NC_016163. 350 AVKNAMSDLCSEQRGDCIAILDCS--------FQGDAQQTIDYRTGNI-------SMSTYFTAIFGQHMNVYDEYNGETI 414 (590) Q Consensus 350 ~~~~a~~~~~~~~~~~~~a~~d~p--------~~~~~~~~~~~~~~~~-------~~~s~~~~~~~p~~~~~d~~~~~~~ 414 (590) +++.+++.||++ +++||+++|+| .+.+.+++.+||+... +++|.|+++||||++++|+.+++.+ T Consensus 388 ~v~~~l~~~~~~-~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~ 466 (660) T protein:vir:68 388 TVQKHVVAIGDS-RQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNR 466 (660) T ss_pred HHHHHHHHHHHh-hCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceE Confidence 788899999976 56899888865 4677899999997532 5789999999999999999999999 Q ss_pred eecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecCCeEEEecceecCCC-c Q lcl|NC_016163. 415 TVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPKKISFATQLTSQTSR-S 493 (590) Q Consensus 415 ~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~s~d-~ 493 (590) ++||||++||+|||+|.++||||||||+++.+|.|++++++.++++|++.||++||||||+|+++|+++||+||++++ + T Consensus 467 ~~p~sg~~AGl~Ar~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s 546 (660) T protein:vir:68 467 WVPLAADIAGLCARTDNISQPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPS 546 (660) T ss_pred EechhHHHHHHHHHHhccCCcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999886 4 Q ss_pred ccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---EecCCCCCHHHhhCCEEEEE Q lcl|NC_016163. 494 ALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---ISGTVYASDYDKQQSIARVK 570 (590) Q Consensus 494 ~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~i~~G~l~~~ 570 (590) +||||||||||+|||++|+++++|+||||||+.||++|+++|+.||++||++|+|.| +||+++||++||++|+|+|+ T Consensus 547 ~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~ 626 (660) T protein:vir:68 547 PFDRINVRRLFNMVKTNIGSASKYRLFELNNAFTRSSFRTETSQYLQGIKALGGVYNFKVVCDTTNNTPAVIDRNEFVAT 626 (660) T ss_pred ccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEecCCCCHHHhhCCeEEEE Confidence 899999999999999999999999999999999999999999999999999999876 59999999999999999999 Q ss_pred EEEEecCccceEEEEEEeeC Q lcl|NC_016163. 571 VELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 571 i~~ap~~paefi~~~~~~~~ 590 (590) |+++|++|||||+|||.|.| T Consensus 627 i~~~p~~pae~i~l~~~~~~ 646 (660) T protein:vir:68 627 FYLQPARSINYITLNFVATA 646 (660) T ss_pred EEEEecCCcceEEEEEEEee Confidence 99999999999999999999 No 9 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=1.2e-127 Score=716.45 Aligned_cols=571 Identities=15% Similarity=0.130 Sum_probs=345.2 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) |. ||||||||||+|++++++++ +|++++|+|.++|||+++|++|+||.||+++||+++. ++++.|+|++||.||| T Consensus 1 ~~-~~~PgVyvee~~~~~~~~~~-~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~--~~~~~~~~~~~f~ngg~ 76 (659) T protein:vir:72 1 MT-LLSPGIELKETTVQSTVVNN-STGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPTA--ETADYFMSAMNFLQYGN 76 (659) T ss_pred Cc-eecCceEEEEecCCcccccC-CCcceEEEeecCCCCCcccEEecCHHHHHHHcCCcCC--CCchhHHHHHHHHhCCc Confidence 76 99999999999999987765 9999999999999999999999999999999999864 4679999999999999 Q ss_pred cEEEEEEecCC-ccccccccccceeecccccc-cceeeeeeccccccccccceEEE--eeccccCCcce---eeEeeccc Q lcl|NC_016163. 80 TAYVLRVMPDD-AKFANSLISIKTTAAADPAK-ATVLVTAKAQTTNTASKNAMKTI--LSGGTAGETPL---CFIVPKGR 152 (590) Q Consensus 80 ~~~vvRv~~~~-a~~a~~~~~~~~~~a~~~~~-~~~~v~~~~~~~~~a~~~~~~~~--~~~~t~~~~~~---~~~~~~~~ 152 (590) +||||||++.+ ++++.............++. .....+..............++. ...+....... ........ T Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~~~ 156 (659) T protein:vir:72 77 DLRVVRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKAKE 156 (659) T ss_pred eEEEEEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeeccccccccccc Confidence 59999998754 44444433333333222221 11111111111110000000111 11110000000 00000000 Q ss_pred ccccccce--EEEEeeccccccccccccceeeeeecccCCCceeeeeeeeecccccccc--------c-cc----cceee Q lcl|NC_016163. 153 GENYNGYG--FRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDK--------S-RQ----SIYYA 217 (590) Q Consensus 153 g~~~~~~~--~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~--------~-~~----~~~~~ 217 (590) ...+.... ........... ....+.+........ +... .+.....+... . .. ..+.. T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~----~a~~~~~v~v~~~~~--~~~~--~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~g 228 (659) T protein:vir:72 157 VGEYPTLGSNWTAEISSSSSG----LAAVITLGKIITDSG--ILLA--EIENAEAAMTAVDFQANLKKYGIPGVVALYPG 228 (659) T ss_pred cccccccccceeeEEeecccc----ccceEEEEEeecCcc--eeee--eccccchhhhcccccccccccccceeeecccc Confidence 00000000 00000000000 000000000000000 0000 00000000000 0 00 00000 Q ss_pred eeeccccceeeecCccccceeeeee--ccccccc------------Cccccceecccccccccccc---cccccccceee Q lcl|NC_016163. 218 NIINKYSQYVEIVDNRSAFETISEF--VVGDSEA------------DPQKVDIIFGQERAVTPAET---IHANVVWKSSS 280 (590) Q Consensus 218 ~vv~~~s~~v~~~~~~~~~~~~~~~--~~~~~~~------------~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~ 280 (590) ...+..+..+............... ....... ............+....... ........... T Consensus 229 t~g~~~tv~i~~~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (659) T protein:vir:72 229 ELGDKIEIEIVSKADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSN 308 (659) T ss_pred ccccceeEEEccccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchh Confidence 0000000000000000000000000 0000000 00000000000000000000 00000000000 Q ss_pred cccccccccccccccccceeee---------eccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccc---- Q lcl|NC_016163. 281 VETDDPSYDATAANFNNIQYLT---------EGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNN---- 347 (590) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 347 (590) . .......+ ....++. ......+.++.+................+...+..++++++.|+. T Consensus 309 ~-----~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~ 382 (659) T protein:vir:72 309 I-----YIDDFFAK-GGSEYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGES 382 (659) T ss_pred h-----hhhhhhhc-CCceEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcc Confidence 0 00000000 0000000 011122334433322222222333445556666777888887763 Q ss_pred ---hhHHHHHHHHHHHHhcCCeEEEEecCC--------CCCHHHHHHHHHhhc-------CcccceEEEEcCeEEEeecc Q lcl|NC_016163. 348 ---EVAVKNAMSDLCSEQRGDCIAILDCSF--------QGDAQQTIDYRTGNI-------SMSTYFTAIFGQHMNVYDEY 409 (590) Q Consensus 348 ---~~~~~~a~~~~~~~~~~~~~a~~d~p~--------~~~~~~~~~~~~~~~-------~~~s~~~~~~~p~~~~~d~~ 409 (590) ..+++.+++.||++ +++||+++|+|. +.+.+++.+||+... +++|+|+++||||++++|+. T Consensus 383 ~~~~~~v~~~l~~~~~~-~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~ 461 (659) T protein:vir:72 383 LETASTVQKHVVSIGDA-RQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKY 461 (659) T ss_pred hhhhHHHHHHHHHHHhh-hCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeecccc Confidence 34688889999976 468999999884 456789999997642 47899999999999999999 Q ss_pred cCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecCCeEEEecceec Q lcl|NC_016163. 410 NGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPKKISFATQLTSQ 489 (590) Q Consensus 410 ~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~ 489 (590) +++.+++||||++||+|||+|.++|+||||||+++.+|.|++++++.++++|++.||++||||||+|+++|+++||+||+ T Consensus 462 ~~~~~~~p~sg~vAGl~Ar~D~~~G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~ 541 (659) T protein:vir:72 462 NDVNRWVPLAADIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTA 541 (659) T ss_pred CCceEEechHHHHHHHHHHhhccCCcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCeEEEEccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCc-ccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---EecCCCCCHHHhhCC Q lcl|NC_016163. 490 TSRS-ALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---ISGTVYASDYDKQQS 565 (590) Q Consensus 490 s~d~-~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~i~~G 565 (590) ++++ +|+||||||||+|||++|+++++|+||||||+.||++|+++|+.||++||++|+|.+ +||+++||++||++| T Consensus 542 ~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G 621 (659) T protein:vir:72 542 TSVPSPFDRINVRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGNKALGGIYEYRVVCDTTNNTPSVIDRN 621 (659) T ss_pred CCCCcccceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCC Confidence 8775 899999999999999999999999999999999999999999999999999998765 799999999999999 Q ss_pred EEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 566 IARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 566 ~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) +|+++|+|+|++|||||+|||.|.| T Consensus 622 ~~~~~i~~~p~~pae~I~~~~~~~~ 646 (659) T protein:vir:72 622 EFVATFYIQPARSINYITLNFVATA 646 (659) T ss_pred eEEEEEEEEecCCccEEEEEEEEee Confidence 9999999999999999999999999 No 10 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=5.5e-128 Score=718.25 Aligned_cols=578 Identities=16% Similarity=0.147 Sum_probs=345.2 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) |. ||||||||||+ +++++|+||+|++++|+|.++|||+++|++|+||.||++.||++.. +++++|+|++||.||| T Consensus 1 ~~-~~~PgVyv~e~-~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~--~~~~~~~v~~~f~ngg~ 76 (663) T protein:vir:10 1 MA-LLSPGIEMKET-SINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDN--VTAPYFMSAMNFLQYGN 76 (663) T ss_pred Cc-eecCceEEEEe-cCcccccccCccceeEEeeeccCCCCccEEecCHHHHHHHhCCcCc--cchhHHHHHHHHHhCCC Confidence 76 99999999999 5999999999999999999999999999999999999999999763 4789999999999999 Q ss_pred cEEEEEEecCC-ccccccccccceeecccc---cccceeeeeeccccccccccceEEEeeccccCCcceeeEeecccccc Q lcl|NC_016163. 80 TAYVLRVMPDD-AKFANSLISIKTTAAADP---AKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGEN 155 (590) Q Consensus 80 ~~~vvRv~~~~-a~~a~~~~~~~~~~a~~~---~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~~ 155 (590) +||||||++++ ++++.............+ ..+.+.+.....................+. ......+...... T Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~----~~~v~~~~a~~~~ 152 (663) T protein:vir:10 77 DLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGK----IKSLFVPTAEIIA 152 (663) T ss_pred eEEEEEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCc----eEEEEeccccccc Confidence 59999998653 444433332222222111 122222222211111100000000000000 0000000000000 Q ss_pred cc---------cceEEEEeeccccc---cc---cccccc-eeeeeecccCCCceeeeeeeeecc-----cccccccc-cc Q lcl|NC_016163. 156 YN---------GYGFRLSLRSDYDN---TY---NFRTYN-LSVTVKDSTGADVVVEGPYIVSFD-----PEAKDKSR-QS 213 (590) Q Consensus 156 ~~---------~~~~~~~~~~~~~~---~~---~~~~~~-l~i~v~d~~~~~~v~e~~~~ls~~-----~da~~~~~-~~ 213 (590) .+ .............. .. ...... ..+...+................. ..+..... .. T Consensus 153 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn 232 (663) T protein:vir:10 153 KTRQLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGS 232 (663) T ss_pred cccccceeeeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeeccccccc Confidence 00 00000000000000 00 000000 000000000000000000000000 00000000 00 Q ss_pred ceeeeeecccc--ce----eeecCccccceeeeeecccccccCccccceeccccccc---ccccccccccccceeecccc Q lcl|NC_016163. 214 IYYANIINKYS--QY----VEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAV---TPAETIHANVVWKSSSVETD 284 (590) Q Consensus 214 ~~~~~vv~~~s--~~----v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 284 (590) .....+....+ .. +............... ............+...+... .................... T Consensus 233 ~i~v~i~~~~~~~~~~~~~v~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~ 310 (663) T protein:vir:10 233 TVEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVI--QYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMD 310 (663) T ss_pred ceeEEecccccccccccccccccccccccccceee--eeccccccceeEEEecCCcceeeeeeeecccccccchhhhhhh Confidence 00000000000 00 0000000000000000 00000000000000000000 00000000000000000000 Q ss_pred ccccccccccccccee----eeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeeccc-------chhHHHH Q lcl|NC_016163. 285 DPSYDATAANFNNIQY----LTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGN-------NEVAVKN 353 (590) Q Consensus 285 ~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~ 353 (590) . .............. ........+.+|.++......+........+.+.+...+++++.+. ...+++. T Consensus 311 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~ 389 (663) T protein:vir:10 311 D-YFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQK 389 (663) T ss_pred h-hhccCcceEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHH Confidence 0 00000000000000 0001112344555544333333333344455566666676666543 2367889 Q ss_pred HHHHHHHHhcCCeEEEEecCCC--------CCHHHHHHHHHhh----------cCcccceEEEEcCeEEEeecccCceee Q lcl|NC_016163. 354 AMSDLCSEQRGDCIAILDCSFQ--------GDAQQTIDYRTGN----------ISMSTYFTAIFGQHMNVYDEYNGETIT 415 (590) Q Consensus 354 a~~~~~~~~~~~~~a~~d~p~~--------~~~~~~~~~~~~~----------~~~~s~~~~~~~p~~~~~d~~~~~~~~ 415 (590) +++.||++ +++||+++|+|.+ .+.+++.+|++.. .+++|+|+++||||++++|+.+++.++ T Consensus 390 al~~~a~~-~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~ 468 (663) T protein:vir:10 390 YVVSLADD-RQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRW 468 (663) T ss_pred HHHHHHHh-hCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEE Confidence 99999976 4689999999964 3567889998653 357899999999999999999999999 Q ss_pred ecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecC-CeEEEecceecCCC-c Q lcl|NC_016163. 416 VTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPK-KISFATQLTSQTSR-S 493 (590) Q Consensus 416 ~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~s~d-~ 493 (590) +||||++||+|||+|.++|+||||||+++++|.|++++++.++++|++.||++|||||++|++ +|+++||+||++++ + T Consensus 469 ~p~s~~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s 548 (663) T protein:vir:10 469 VPLAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPS 548 (663) T ss_pred echhHHHHHHHHHhhccCCceEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCc Confidence 999999999999999999999999999999999999999999999999999999999999997 79999999999876 5 Q ss_pred ccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---EecCCCCCHHHhhCCEEEEE Q lcl|NC_016163. 494 ALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---ISGTVYASDYDKQQSIARVK 570 (590) Q Consensus 494 ~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~i~~G~l~~~ 570 (590) +||||||||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|.| +||+++||++||++|+|+|+ T Consensus 549 ~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~ 628 (663) T protein:vir:10 549 PFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGT 628 (663) T ss_pred ccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEE Confidence 899999999999999999999999999999999999999999999999999999866 69999999999999999999 Q ss_pred EEEEecCccceEEEEEEeeC Q lcl|NC_016163. 571 VELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 571 i~~ap~~paefi~~~~~~~~ 590 (590) |+|+|++|||||+|||.|++ T Consensus 629 i~~~p~~pae~i~~~~~~~~ 648 (663) T protein:vir:10 629 IYVKPPRSINYITLNMVATS 648 (663) T ss_pred EEEEecCCcceEEEEEEEee Confidence 99999999999999999999 No 11 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=1.1e-126 Score=711.03 Aligned_cols=580 Identities=16% Similarity=0.128 Sum_probs=345.5 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) |. ||||||||||++ ++++|+||+|++++|+|.++|||+++|++|+||.||++.||++. .+.++.|+|++||.||| T Consensus 1 ~~-~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vG~~~~Gp~~~p~~v~~~~~~~~~fg~~~--~~~~~~~~~~~~f~ngg~ 76 (663) T protein:vir:10 1 MA-LLSPGIEMKETS-INSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPD--NVTAPYFMSAMNFLQYGN 76 (663) T ss_pred Cc-eecCceEEEEec-CCccccccCcccceeEeecccCCCCccEEecCHHHHHHhcCCcC--CcchhHHHHHHHHHhCCC Confidence 76 999999999995 89999999999999999999999999999999999999999976 44689999999999999 Q ss_pred cEEEEEEecCC-ccccccccccceeeccccc---ccceeeeeeccccccccc--------cceEEEeeccccCCcceeeE Q lcl|NC_016163. 80 TAYVLRVMPDD-AKFANSLISIKTTAAADPA---KATVLVTAKAQTTNTASK--------NAMKTILSGGTAGETPLCFI 147 (590) Q Consensus 80 ~~~vvRv~~~~-a~~a~~~~~~~~~~a~~~~---~~~~~v~~~~~~~~~a~~--------~~~~~~~~~~t~~~~~~~~~ 147 (590) +||||||++.. ++++.............++ .+.+.+............ +.........++....... T Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~~- 155 (663) T protein:vir:10 77 DLRLVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTR- 155 (663) T ss_pred eEEEEEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeecccccccccc- Confidence 59999998643 4444333333222222211 222333222111110000 0000000000000000000 Q ss_pred eeccc-ccccccceEEEEeecccc-cc---ccccc-cceeeeeecccCCCc-eeeeeeeee---ccc-ccccc-ccccce Q lcl|NC_016163. 148 VPKGR-GENYNGYGFRLSLRSDYD-NT---YNFRT-YNLSVTVKDSTGADV-VVEGPYIVS---FDP-EAKDK-SRQSIY 215 (590) Q Consensus 148 ~~~~~-g~~~~~~~~~~~~~~~~~-~~---~~~~~-~~l~i~v~d~~~~~~-v~e~~~~ls---~~~-da~~~-~~~~~~ 215 (590) +.+. ..........+....... .. ..... ....+...+...... ..+...... ... .+... ...... T Consensus 156 -~v~~~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~i 234 (663) T protein:vir:10 156 -QLGTYPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGSTV 234 (663) T ss_pred -ccccceeeccceeeEeeeccCccccccccceeccccceEEeeccccccccccccccccccccccceEEeccCCccccee Confidence 0000 000000000000000000 00 00000 000000000000000 000000000 000 00000 000000 Q ss_pred eeeeecccc------ceeeecCccccceeeeeecccccccCccccceeccccccc---ccccccccccccceeecccccc Q lcl|NC_016163. 216 YANIINKYS------QYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAV---TPAETIHANVVWKSSSVETDDP 286 (590) Q Consensus 216 ~~~vv~~~s------~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~ 286 (590) ...+....+ ..+........ ...................+...++.. ..................... T Consensus 235 ~V~i~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~- 311 (663) T protein:vir:10 235 EVEIVSKTAFNSGAQQTIYPFGGTRT--SNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDD- 311 (663) T ss_pred eeeeccccccccccccceeccccccc--ccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhh- Confidence 000000000 00000000000 000000000000000000000000000 000000000000000000000 Q ss_pred cccccccccccce----eeeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeeccc-------chhHHHHHH Q lcl|NC_016163. 287 SYDATAANFNNIQ----YLTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGN-------NEVAVKNAM 355 (590) Q Consensus 287 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~a~ 355 (590) ............. ........++.+|.++......+........+.+.+...+++++.+. ...+++.++ T Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l 391 (663) T protein:vir:10 312 YFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYV 391 (663) T ss_pred hhcCCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHH Confidence 0000000000000 00001112444555554433333334444555566666776666553 225688889 Q ss_pred HHHHHHhcCCeEEEEecCCC--------CCHHHHHHHHHhh----------cCcccceEEEEcCeEEEeecccCceeeec Q lcl|NC_016163. 356 SDLCSEQRGDCIAILDCSFQ--------GDAQQTIDYRTGN----------ISMSTYFTAIFGQHMNVYDEYNGETITVT 417 (590) Q Consensus 356 ~~~~~~~~~~~~a~~d~p~~--------~~~~~~~~~~~~~----------~~~~s~~~~~~~p~~~~~d~~~~~~~~~p 417 (590) +.+|++ +++||+++|+|.+ .+.+++.+|++.. .+++|+|+++||||++++|+.+++.+++| T Consensus 392 ~~~a~~-~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p 470 (663) T protein:vir:10 392 VSLADD-RQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVP 470 (663) T ss_pred HHHHHh-hCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCceEEec Confidence 999976 5689999999964 3567888898653 35789999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecC-CeEEEecceecCCC-ccc Q lcl|NC_016163. 418 STYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPK-KISFATQLTSQTSR-SAL 495 (590) Q Consensus 418 psg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~s~d-~~~ 495 (590) |||++||+|||+|.++|+||||||+++++|.|++++++.+++.|++.||++||||||+|++ +|+++||+||++++ ++| T Consensus 471 ~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~s~~ 550 (663) T protein:vir:10 471 LAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPF 550 (663) T ss_pred hhHHHHHHHHHhhccCCceEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCccc Confidence 9999999999999999999999999999999999999999999999999999999999997 79999999999876 599 Q ss_pred ceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---EecCCCCCHHHhhCCEEEEEEE Q lcl|NC_016163. 496 SYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---ISGTVYASDYDKQQSIARVKVE 572 (590) Q Consensus 496 ~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~i~~G~l~~~i~ 572 (590) |||||||||+|||++|+++++|+||||||+.||++|+++|+.||++||++|+|.| +||+++||++||++|+|+++|+ T Consensus 551 ~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~ 630 (663) T protein:vir:10 551 DRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTIY 630 (663) T ss_pred ceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEE Confidence 9999999999999999999999999999999999999999999999999999866 6999999999999999999999 Q ss_pred EEecCccceEEEEEEeeC Q lcl|NC_016163. 573 LVFTGVIERIAIDLVVNK 590 (590) Q Consensus 573 ~ap~~paefi~~~~~~~~ 590 (590) |+|++|+|||+|||.|++ T Consensus 631 ~~p~~pae~i~~~~~~~~ 648 (663) T protein:vir:10 631 VKPPRSINYITLNMVATS 648 (663) T ss_pred EEecCCcceEEEEEEEee Confidence 999999999999999999 No 12 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=3.2e-126 Score=708.58 Aligned_cols=576 Identities=14% Similarity=0.126 Sum_probs=341.6 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) |. ||||||||||++ ++++|+||+|++++|+|.++|||+++|++|+||.||++.||+++ .++++.|+|++||.||| T Consensus 1 ~~-~~~PgVyv~e~~-~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~--~~~~~~~~~~~~f~ngg~ 76 (666) T protein:vir:65 1 MT-LLSPGFETKETT-LSTTIVQSETGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPD--NNTADYFMSGANFLQYGN 76 (666) T ss_pred Cc-eecCceEEEEec-CcccccccCcccceEEecccCCCCccCEEecCHHHHHHHcCCcc--ccchhHHHHHHHHHhcCc Confidence 75 999999999995 78899999999999999999999999999999999999999986 44779999999999999 Q ss_pred cEEEEEEecC-Cccccccccccceeecccc---cccceeeeeeccccccccccceEEEeeccccCCc---ceeeEe-ecc Q lcl|NC_016163. 80 TAYVLRVMPD-DAKFANSLISIKTTAAADP---AKATVLVTAKAQTTNTASKNAMKTILSGGTAGET---PLCFIV-PKG 151 (590) Q Consensus 80 ~~~vvRv~~~-~a~~a~~~~~~~~~~a~~~---~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~---~~~~~~-~~~ 151 (590) +||||||++. +++++..........+..+ ..+.+.+.....................+..... ...... ... T Consensus 77 ~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~~~~ 156 (666) T protein:vir:65 77 DLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKA 156 (666) T ss_pred eEEEEEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeeccccc Confidence 5999999865 3444444333333322222 2233333332211110000000000000000000 000000 000 Q ss_pred ccc---ccccceEEEEeecc-ccccccc----cccceeeeeecccC---CCceee-eeeeeeccc-ccccccc-ccceee Q lcl|NC_016163. 152 RGE---NYNGYGFRLSLRSD-YDNTYNF----RTYNLSVTVKDSTG---ADVVVE-GPYIVSFDP-EAKDKSR-QSIYYA 217 (590) Q Consensus 152 ~g~---~~~~~~~~~~~~~~-~~~~~~~----~~~~l~i~v~d~~~---~~~v~e-~~~~ls~~~-da~~~~~-~~~~~~ 217 (590) .+. ........+..... ....... ..........+... ...... ......... .+..... ....-. T Consensus 157 ~g~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~i~v 236 (666) T protein:vir:65 157 IGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNSLEV 236 (666) T ss_pred cCcceeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeeccccccceeE Confidence 000 00000000000000 0000000 00000000000000 000000 000000000 0000000 000000 Q ss_pred eeecccc-----cee--eecCccccceeeeeecccccccCccccceecccccccccc---cccccccccceee------- Q lcl|NC_016163. 218 NIINKYS-----QYV--EIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPA---ETIHANVVWKSSS------- 280 (590) Q Consensus 218 ~vv~~~s-----~~v--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~------- 280 (590) .+..... ..+ ...+..... ................++....+..... ............. T Consensus 237 ~i~~~~~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 313 (666) T protein:vir:65 237 EILARSAFKNTAPDLTMYPYGGERTA---ARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 313 (666) T ss_pred Eeeccccccccccccccccccccccc---ceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhhhhh Confidence 0000000 000 000000000 0000000000000011111111100000 0000000000000 Q ss_pred -cccccccccccccccccceeeeeccccccccccccc---------eeccchhhHHHHHHHhhhccCCceeeecccc--- Q lcl|NC_016163. 281 -VETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEES---------ALLVKGYSGVLAPEILDKQQYEIDVLLDGNN--- 347 (590) Q Consensus 281 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 347 (590) .+................... ..+.++.+.. .....+ ....+..+...+...+++++.|+. T Consensus 314 ~~~~~~~v~~~~~~~~~~~~~~-----~~~~~g~~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~~~~~~~~l~~p~~~~~ 387 (666) T protein:vir:65 314 ARGSSQYIYATAQGWVDGFSGI-----ISLAGGVSANEATTGGVGADPFIGA-MMQGWDLFAERESIHVNLLIAGACAGE 387 (666) T ss_pred cccccceeeeecccccccccce-----EEccCCCCcCccccccccccccccc-HHHHHHHHhhhhhccCCceeecCcCCc Confidence 000000000000000000000 0111111111 111111 112333444445556777776643 Q ss_pred ---hhHHHHHHHHHHHHhcCCeEEEEecC--------CCCCHHHHHHHHHhhc-------CcccceEEEEcCeEEEeecc Q lcl|NC_016163. 348 ---EVAVKNAMSDLCSEQRGDCIAILDCS--------FQGDAQQTIDYRTGNI-------SMSTYFTAIFGQHMNVYDEY 409 (590) Q Consensus 348 ---~~~~~~a~~~~~~~~~~~~~a~~d~p--------~~~~~~~~~~~~~~~~-------~~~s~~~~~~~p~~~~~d~~ 409 (590) ..+++.+++.+|+++ ++||+++|+| ++.+++++.+||+.+. +++|.|+++||||++++|+. T Consensus 388 ~~~~~~v~~~l~~~~~~~-~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~ 466 (666) T protein:vir:65 388 GDAFSTVQKHAVSIGDER-QDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKY 466 (666) T ss_pred cchhHHHHHHHHHHHhhc-cceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEeccc Confidence 468889999999764 6899888876 4578899999998643 47899999999999999999 Q ss_pred cCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecCCeEEEecceec Q lcl|NC_016163. 410 NGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPKKISFATQLTSQ 489 (590) Q Consensus 410 ~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~ 489 (590) +++.+++||||++||+|||+|.++||||||||+++.+|.|++++++.+++.|++.||++||||||+|+++|+++||+||+ T Consensus 467 ~~~~~~~p~sg~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~ 546 (666) T protein:vir:65 467 NDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTA 546 (666) T ss_pred CCceeEechHHHHHHHHHHHhccCCcEEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCCCeEEEEecccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCC-cccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---EecCCCCCHHHhhCC Q lcl|NC_016163. 490 TSR-SALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---ISGTVYASDYDKQQS 565 (590) Q Consensus 490 s~d-~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~i~~G 565 (590) +++ ++|+||||||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|.| +||+++||++||++| T Consensus 547 ~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G 626 (666) T protein:vir:65 547 TTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRN 626 (666) T ss_pred CCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCC Confidence 875 5899999999999999999999999999999999999999999999999999999876 699999999999999 Q ss_pred EEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 566 IARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 566 ~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) +|+++|+|+|++|||||+|||.|.+ T Consensus 627 ~~~~~i~~~p~~pae~i~~~~~~~~ 651 (666) T protein:vir:65 627 EFVASMFIKPAKSINYIMLNFTAVA 651 (666) T ss_pred eEEEEEEEEecCCcceEEEEEEEee Confidence 9999999999999999999999999 No 13 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=6e-126 Score=707.11 Aligned_cols=573 Identities=14% Similarity=0.121 Sum_probs=337.1 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) |. ||||||||||+ ++.++|+||+|++++|+|.++|||+++|++|+||.||++.||+++. ++++.|+|.+||.||| T Consensus 1 ~~-~~~Pgvyv~e~-~~~~~~~~~~t~~~~~vg~~~~gp~~~p~~i~~~~~~~~~fg~~~~--~~~~~~~~~~~f~~~g~ 76 (666) T protein:vir:80 1 MT-LLSPGFETKET-TLSTTIVQSATGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPDN--NTADYFMSGANFLQYGN 76 (666) T ss_pred Cc-eecCceEEEEe-cCCccccccCcccceEEeccccCCCccceEecCHHHHHHhcCCccC--ccchHHHHHHHHhcCCC Confidence 76 99999999999 4788999999999999999999999999999999999999999863 4678999999999999 Q ss_pred cEEEEEEecC-Cccccccccccceeecccccc---cceeeeeeccccccccccceEEEeeccccC----CcceeeEeecc Q lcl|NC_016163. 80 TAYVLRVMPD-DAKFANSLISIKTTAAADPAK---ATVLVTAKAQTTNTASKNAMKTILSGGTAG----ETPLCFIVPKG 151 (590) Q Consensus 80 ~~~vvRv~~~-~a~~a~~~~~~~~~~a~~~~~---~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~----~~~~~~~~~~~ 151 (590) +||||||++. +++++.............++. +...+.....................+... ........... T Consensus 77 ~~~v~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~ta~~~~~a~~ 156 (666) T protein:vir:80 77 DLRVVRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKA 156 (666) T ss_pred eEEEEEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecchhhhcccccc Confidence 5999999864 555655544443333322221 111111110000000000000000000000 00000000000 Q ss_pred cccc---cccceEEEEeeccccc----ccccc-ccceeeeeeccc-CCCceeeeeeeeec--c---cccccccc-cccee Q lcl|NC_016163. 152 RGEN---YNGYGFRLSLRSDYDN----TYNFR-TYNLSVTVKDST-GADVVVEGPYIVSF--D---PEAKDKSR-QSIYY 216 (590) Q Consensus 152 ~g~~---~~~~~~~~~~~~~~~~----~~~~~-~~~l~i~v~d~~-~~~~v~e~~~~ls~--~---~da~~~~~-~~~~~ 216 (590) .+.. ................ ..... .........+.. ...+.. .+..... . ..+..... ..... T Consensus 157 ~~~~~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~-~~~~~~~~~~~~a~~a~~~g~~g~~l~ 235 (666) T protein:vir:80 157 IGVYPELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQ-TFLTKLQKYDMPAVSAIYAGEIGNSLE 235 (666) T ss_pred ccccceeeccceeeeccccccceeeeeeeeeecCCccceeeecccccccccc-ccccccccccchhhhhhccccccccee Confidence 0000 0000000000000000 00000 000000000000 000000 0000000 0 00000000 00000 Q ss_pred eeeeccc-----ccee--eecCccc--cceeeeeecccccccCccccceecccccccc---cccccccccccceeec--- Q lcl|NC_016163. 217 ANIINKY-----SQYV--EIVDNRS--AFETISEFVVGDSEADPQKVDIIFGQERAVT---PAETIHANVVWKSSSV--- 281 (590) Q Consensus 217 ~~vv~~~-----s~~v--~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~--- 281 (590) ..+.... ...+ ...+... ......... ........+....+... ................ T Consensus 236 v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~ 310 (666) T protein:vir:80 236 VEILARSAFKNTAPDLTMYPYGGERTAARNLIPYAP-----QNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMD 310 (666) T ss_pred eeeccccccccccccceeeeccccccccceeeeecc-----ccccceeeEeccCCccceeeecccccccccccchhhhhh Confidence 0000000 0000 0000000 000000000 00000001111110000 0000000000000000 Q ss_pred --ccccc---cccccccccccceeeeeccccccccccccc---------eeccchhhHHHHHHHhhhccCCceeeecccc Q lcl|NC_016163. 282 --ETDDP---SYDATAANFNNIQYLTEGSEGTWTGGNEES---------ALLVKGYSGVLAPEILDKQQYEIDVLLDGNN 347 (590) Q Consensus 282 --~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 347 (590) ..... ................ .+.++.+.. +....+.. .....+...+..++++++.++. T Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~g~~~~~~~~~~~~~~~~~g~~~-~~~~~~~~~~~~~~~~l~~p~~ 384 (666) T protein:vir:80 311 DFFGRGSSQYIYATAQGWVDGFSGII-----SLAGGVSANEATTGGVGADPFIGAMM-QGWGLFAERESIHVNLLIAGAC 384 (666) T ss_pred hhhccccceeeeecccccccccceEE-----EecCCCCcccccccccccccccccch-hhhhhhhhhcccccceEeecCc Confidence 00000 0000000000000000 011111110 01111111 1122233334455667776653 Q ss_pred ------hhHHHHHHHHHHHHhcCCeEEEEecC--------CCCCHHHHHHHHHhhc-------CcccceEEEEcCeEEEe Q lcl|NC_016163. 348 ------EVAVKNAMSDLCSEQRGDCIAILDCS--------FQGDAQQTIDYRTGNI-------SMSTYFTAIFGQHMNVY 406 (590) Q Consensus 348 ------~~~~~~a~~~~~~~~~~~~~a~~d~p--------~~~~~~~~~~~~~~~~-------~~~s~~~~~~~p~~~~~ 406 (590) ..+++.+++.||++ +++||+++|+| ++.+++++.+||+..+ +++|.|+++||||++++ T Consensus 385 ~~~~~~~~~v~~~~~~~~~~-~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~ 463 (666) T protein:vir:80 385 AGEGDAFSTVQKHAVSIGDE-RQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQY 463 (666) T ss_pred CCcccchHHHHHHHHHHHHh-hcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEe Confidence 46788899999976 46787777654 5678999999997643 57899999999999999 Q ss_pred ecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecCCeEEEecc Q lcl|NC_016163. 407 DEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPKKISFATQL 486 (590) Q Consensus 407 d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~ 486 (590) |+.+++.+++||||++||+|||+|.++||||||||+++++|.|++++++.+++.|++.||++||||||+|+++|+++||+ T Consensus 464 d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~ 543 (666) T protein:vir:80 464 DKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGD 543 (666) T ss_pred cccCCceeEechHHHHHHHHHHHhhcCCceEccCCeecceeeccccceeecChhHHHhhhhCCeeEEEEeCCCeEEEEcc Confidence 99999999999999999999999999999999999999899999999999999999999999999999999999999999 Q ss_pred eecCCC-cccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---EecCCCCCHHHh Q lcl|NC_016163. 487 TSQTSR-SALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---ISGTVYASDYDK 562 (590) Q Consensus 487 rT~s~d-~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~i 562 (590) ||++++ ++||||||||||+|||++|+++++|+||||||+.||++|+++|+.||++||++|+|.| +||+++||++|| T Consensus 544 rT~~~~~s~~~~i~vRRl~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~di 623 (666) T protein:vir:80 544 KTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVI 623 (666) T ss_pred ccCCCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHh Confidence 999876 5999999999999999999999999999999999999999999999999999999876 699999999999 Q ss_pred hCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 563 QQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 563 ~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) ++|+|+|+|+|+|++|||||+|||.|.+ T Consensus 624 ~~G~~~~~i~~~P~~Pae~I~~~~~~~~ 651 (666) T protein:vir:80 624 DRNEFVASMFIKPAKSINYIMLNFTAVA 651 (666) T ss_pred hCCeEEEEEEEEecCCcceEEEEEEEee Confidence 9999999999999999999999999999 No 14 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=7.8e-126 Score=706.47 Aligned_cols=569 Identities=15% Similarity=0.121 Sum_probs=342.8 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCCc Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGGT 80 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG~ 80 (590) |. ||||||||||++ ++++|+||+|++++|+|.++|||+++|++|+||.||++.||+++. +++++|+|++||.|||+ T Consensus 1 ~~-~~~Pgvyv~e~~-~~~~i~~v~t~~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~--~~~~~~~v~~~f~ngg~ 76 (671) T protein:vir:56 1 MT-LLSPGIENKEIN-LASAIGRAATGRAAMVGKFEWGPAYSITQVTSESDLVTIFGRPND--YTAASFMTANNFLKYGN 76 (671) T ss_pred Cc-eecCceEEEeec-CcccccccCcccceEEecccCCCCccCEEcCCHHHHHHHcCCcCC--CcchhHHHHHHHHhcCC Confidence 76 999999999995 899999999999999999999999999999999999999999864 47799999999999995 Q ss_pred -EEEEEEecCC-ccccccccccce--eecccccccceeeeeeccccccccccc-eEEEee--ccccC--Cc---ceeeE- Q lcl|NC_016163. 81 -AYVLRVMPDD-AKFANSLISIKT--TAAADPAKATVLVTAKAQTTNTASKNA-MKTILS--GGTAG--ET---PLCFI- 147 (590) Q Consensus 81 -~~vvRv~~~~-a~~a~~~~~~~~--~~a~~~~~~~~~v~~~~~~~~~a~~~~-~~~~~~--~~t~~--~~---~~~~~- 147 (590) ||||||++.+ ++++........ ..+.....+.+.+.............. ...... .+... .. ..... T Consensus 77 ~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~~~ 156 (671) T protein:vir:56 77 DLRLVRICDATTAQNATPLYNAVEYTIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVAAA 156 (671) T ss_pred eEEEEEecCccccccchhhccccccccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEEee Confidence 9999998754 333322221111 111222222223332222111111100 000000 00000 00 00000 Q ss_pred eecccc---------cccccceEEEEeeccccccccccccceeeee---ecccCCCceeeeeeeeecc---ccccccccc Q lcl|NC_016163. 148 VPKGRG---------ENYNGYGFRLSLRSDYDNTYNFRTYNLSVTV---KDSTGADVVVEGPYIVSFD---PEAKDKSRQ 212 (590) Q Consensus 148 ~~~~~g---------~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v---~d~~~~~~v~e~~~~ls~~---~da~~~~~~ 212 (590) ...... ........... ..... ...... ........+......++.. .....+... T Consensus 157 ~~~~~~~~~~~~t~~~~~~~~~v~~~-~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (671) T protein:vir:56 157 KSDGNYPSVGTITLQPTQGDIALTNI-EIIDT--------GSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLS 227 (671) T ss_pred eccccccccccccccccccceeeeee-ccccc--------ceEEEeccccccccccccccccccccchhhhhcccccccc Confidence 000000 00000000000 00000 000000 0000000000000000000 000000000 Q ss_pred -------cce-eeeeeccccce-eeecCccccceeeeeecccccccCc--------------cccceeccccccccc--- Q lcl|NC_016163. 213 -------SIY-YANIINKYSQY-VEIVDNRSAFETISEFVVGDSEADP--------------QKVDIIFGQERAVTP--- 266 (590) Q Consensus 213 -------~~~-~~~vv~~~s~~-v~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~--- 266 (590) ... ...++...... ......................... .....+....+.... T Consensus 228 a~~~g~~g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~ 307 (671) T protein:vir:56 228 ARYVGDFGDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFI 307 (671) T ss_pred cccccccCcceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEE Confidence 000 00000000000 0000000000000000000000000 000000000000000 Q ss_pred ccccccccccceeecccccccccccccccccceeee--------eccccccccccccceeccchhhHHHHHHHhhhccCC Q lcl|NC_016163. 267 AETIHANVVWKSSSVETDDPSYDATAANFNNIQYLT--------EGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYE 338 (590) Q Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 338 (590) ............... ........ ....... ......+.++.+... ........+..+.+..... T Consensus 308 ~~~~~~~~~~~~~~~--~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~--~~~~~~~~~~~~~~~~~~~ 379 (671) T protein:vir:56 308 VSTNPGDKDVNGQSI--FIDEYFEN----SGSAYITAIAEGWKTESGAYNFGGGSDANA--GADDWMFGLDMLSDPEVLY 379 (671) T ss_pred Eeecccccccchhhh--hhhhhhcc----cCceEEEecCcccCCccccccccCcccccc--chhHHHHHHHhhhhccccc Confidence 000000000000000 00000000 0000000 001112223333221 1122233445566667777 Q ss_pred ceeeecccc-------hhHHHHHHHHHHHHhcCCeEEEEecCCC--------CCHHHHHHHHHhh-----------cCcc Q lcl|NC_016163. 339 IDVLLDGNN-------EVAVKNAMSDLCSEQRGDCIAILDCSFQ--------GDAQQTIDYRTGN-----------ISMS 392 (590) Q Consensus 339 ~~~~~~~~~-------~~~~~~a~~~~~~~~~~~~~a~~d~p~~--------~~~~~~~~~~~~~-----------~~~~ 392 (590) ++++++++. ...++.+++.+|++.+++||+++|+|.. .+.+++.+|+... .+++ T Consensus 380 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (671) T protein:vir:56 380 TNLVIAGNAAAEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVS 459 (671) T ss_pred eeEEEcCCCCCccchhHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCC Confidence 887776642 2346677888888888899999999853 4678899998643 3578 Q ss_pred cceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceE Q lcl|NC_016163. 393 TYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNY 472 (590) Q Consensus 393 s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~ 472 (590) |.|+++||||++++|+.+++.+++||||++||+|||+|.++||||||||+++++|.|+.++++.+++.|++.||++|||| T Consensus 460 s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~ 539 (671) T protein:vir:56 460 TTYAVIDGNYKYQYDKYNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNRLAVDLRRAHRDALYQIGINP 539 (671) T ss_pred cceEEEecCceEEecccCCceeEechHHHHHHHHHHhhccCCcEECcCCceeccccccccceeecChhHHHHHhhCCceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCeEEEecceecCCC-cccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE- Q lcl|NC_016163. 473 IERDPKKISFATQLTSQTSR-SALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS- 550 (590) Q Consensus 473 i~~~~~~G~~~wG~rT~s~d-~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~- 550 (590) ||+|+++|+++||+||++++ ++|+||||||||+|||++|+++++|+||||||+.||++|+++|+.||++||++|+|.| T Consensus 540 i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~g~ 619 (671) T protein:vir:56 540 VVGFAGQGFVLYGDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFELNDEFTRSSFKSEIDAYLTNIQDLGGVYDF 619 (671) T ss_pred EEEecCCeEEEEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHHhCCceeee Confidence 99999999999999999876 5899999999999999999999999999999999999999999999999999999866 Q ss_pred --EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 551 --ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 551 --~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) +||+++||++||++|+|+++|+|+|++|||||+|||.|++ T Consensus 620 ~v~~d~~~nt~~~i~~G~~~~~i~~~p~~Pae~I~~~~~~~~ 661 (671) T protein:vir:56 620 RVVCDETNNPGSVIDRNEFVASIYVKPAKSINFITLNFVATS 661 (671) T ss_pred EEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEee Confidence 6999999999999999999999999999999999999999 No 15 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=5.8e-125 Score=701.69 Aligned_cols=578 Identities=15% Similarity=0.140 Sum_probs=340.5 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) |. ||||||||||+| ++++|+||+|++++|+|.++|||+++|++|+||.||++.||+++.. .+++|+|++||.||| T Consensus 1 ~~-~~~Pgvyv~e~~-~~~~~~~v~t~~~~fvG~~~~gp~~~p~~i~s~~~~~~~fG~~~~~--~~~~~~v~~~f~ngg~ 76 (663) T protein:vir:10 1 MA-LLSPGIEMKETS-INSTVVRSATGRAALVGKFAWGPAYEIRQVTNEVELVDMFGSPDNV--TAPYFMSAMNFLQYGN 76 (663) T ss_pred Cc-cccCceEEEEec-CcccccccccccceeeeccccCCCCcCEEecCHHHHHHHcCCcccc--cchHHHHHHHHHhCCC Confidence 76 999999999996 7889999999999999999999999999999999999999998644 679999999999999 Q ss_pred cEEEEEEecCC-cccccccccccee---ecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecc---- Q lcl|NC_016163. 80 TAYVLRVMPDD-AKFANSLISIKTT---AAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKG---- 151 (590) Q Consensus 80 ~~~vvRv~~~~-a~~a~~~~~~~~~---~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~---- 151 (590) +||||||.+++ +++++........ +.+..+.+.+.+.....................+.. .....+.+ T Consensus 77 ~~~vvRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~----~~~~~~~a~~~~ 152 (663) T protein:vir:10 77 DLRLVRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKI----KALFVPSSAVIA 152 (663) T ss_pred eEEEEecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCce----eEEEeccccccc Confidence 59999998653 4444443322211 122223334444333222111111111111111100 00000000 Q ss_pred ----cccc---cccceEEEEeecccccccccccc------ceeeeeecccCCCceeeeeeeeecc---c--ccccc-ccc Q lcl|NC_016163. 152 ----RGEN---YNGYGFRLSLRSDYDNTYNFRTY------NLSVTVKDSTGADVVVEGPYIVSFD---P--EAKDK-SRQ 212 (590) Q Consensus 152 ----~g~~---~~~~~~~~~~~~~~~~~~~~~~~------~l~i~v~d~~~~~~v~e~~~~ls~~---~--da~~~-~~~ 212 (590) .... ......... ............. ...+...+................. + .+... ... T Consensus 153 ~a~~~~~~~~~~~a~~~~v~-~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G 231 (663) T protein:vir:10 153 KAKQLGTYPVLGDNWRAEVS-GASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIG 231 (663) T ss_pred cccccccccccccceeeEEe-eccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccC Confidence 0000 000000000 0000000000000 0000000000000000000000000 0 00000 000 Q ss_pred cceeeeeeccccce-----eeecCccccceeeeeecccccccCccccceecccccccc---cccccccccccceeecccc Q lcl|NC_016163. 213 SIYYANIINKYSQY-----VEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVT---PAETIHANVVWKSSSVETD 284 (590) Q Consensus 213 ~~~~~~vv~~~s~~-----v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ 284 (590) ......+.. .+.. .............................++...++... ................. . T Consensus 232 ~~i~v~~~~-~~~~~~~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~-~ 309 (663) T protein:vir:10 232 STVEVEVIS-KTAFQSGAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIF-M 309 (663) T ss_pred cceeEeecc-cccccccceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhh-h Confidence 000000000 0000 000000000000000000000000000011111110000 00000000000000000 0 Q ss_pred ccccccccccccccee----eeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecc-c------chhHHHH Q lcl|NC_016163. 285 DPSYDATAANFNNIQY----LTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDG-N------NEVAVKN 353 (590) Q Consensus 285 ~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~------~~~~~~~ 353 (590) ................ ........+.++.++......+........+...+..+..+++.+ . ...+++. T Consensus 310 ~~~~~~~~s~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~ 389 (663) T protein:vir:10 310 DDYFRNGSSNFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQK 389 (663) T ss_pred hhhhcCcccceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHH Confidence 0000000000000000 000001133344443322222222223334444444444344332 2 2356888 Q ss_pred HHHHHHHHhcCCeEEEEecCCCC--------CHHHHHHHHHh----------hcCcccceEEEEcCeEEEeecccCceee Q lcl|NC_016163. 354 AMSDLCSEQRGDCIAILDCSFQG--------DAQQTIDYRTG----------NISMSTYFTAIFGQHMNVYDEYNGETIT 415 (590) Q Consensus 354 a~~~~~~~~~~~~~a~~d~p~~~--------~~~~~~~~~~~----------~~~~~s~~~~~~~p~~~~~d~~~~~~~~ 415 (590) +++.||++ +++||+|+|+|.+. ..+++.+||.. ..+++|.|+++||||++++|+.+++.++ T Consensus 390 ~l~~~~~~-~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~ 468 (663) T protein:vir:10 390 HVVALADD-RQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRW 468 (663) T ss_pred HHHHHHHh-hCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEE Confidence 99999976 46899999999764 34678888854 2367899999999999999999999999 Q ss_pred ecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecC-CeEEEecceecCCC-c Q lcl|NC_016163. 416 VTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPK-KISFATQLTSQTSR-S 493 (590) Q Consensus 416 ~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~s~d-~ 493 (590) +||||++||+|||+|.++||||||||+++.+|.|++++++.+++.|++.||++|||||+.|++ +|+++||+||++++ + T Consensus 469 ~p~s~~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s 548 (663) T protein:vir:10 469 VPLSADIAGLCAYTDQVGHPWMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQVPS 548 (663) T ss_pred echHHHHHHHHHHhhccCCcEEccCCeeecceeccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCCCCc Confidence 999999999999999999999999999999999999999999999999999999999999997 79999999999876 5 Q ss_pred ccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE---EecCCCCCHHHhhCCEEEEE Q lcl|NC_016163. 494 ALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS---ISGTVYASDYDKQQSIARVK 570 (590) Q Consensus 494 ~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~i~~G~l~~~ 570 (590) +|+||||||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|.| +||+++||++||++|+|+++ T Consensus 549 ~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~ 628 (663) T protein:vir:10 549 PFDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMEVSQYLDNIRSLGGVYDFRVVCDTTNNTPQVIDSNEFVAT 628 (663) T ss_pred ccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEE Confidence 899999999999999999999999999999999999999999999999999999876 69999999999999999999 Q ss_pred EEEEecCccceEEEEEEeeC Q lcl|NC_016163. 571 VELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 571 i~~ap~~paefi~~~~~~~~ 590 (590) |+++|++|+|||+|||.|+| T Consensus 629 i~~~p~~pae~I~~~~~~~~ 648 (663) T protein:vir:10 629 IYIKAPRSINYITLNFVATS 648 (663) T ss_pred EEEEecCCcceEEEEEEEEe Confidence 99999999999999999999 No 16 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=1.4e-109 Score=617.32 Aligned_cols=451 Identities=13% Similarity=0.066 Sum_probs=306.1 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) ||+|++|||||||+++++++|++|+|+|.+|+|.+++||+|+|++|+||.||++ ||+... .+.|++||++||+||| T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p~n~pv~its~~d~~~-~g~~~~--~~tL~~Av~~~f~ngg~ 77 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDAAQ-FGPQLA--GFTIPQALDAVYDYGSG 77 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccCCCcccEEEccHHHHHH-hcCCCC--CCcHHHHHHHHhhcCCc Confidence 999999999999999999999999999999999999999999999999999997 555432 3579999999999999 Q ss_pred cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecccccccccc Q lcl|NC_016163. 80 TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGENYNGY 159 (590) Q Consensus 80 ~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~~~~~~ 159 (590) +||||||.+.+...+............ .......... T Consensus 78 ~~~vvrV~~~~~~~~~~a~~~~~~~~~-------------------------------------------~~~~~~~~~~ 114 (477) T protein:vir:79 78 TVIVINVLDPAVHKSNAASESVTFDAA-------------------------------------------TGRAKLAHPA 114 (477) T ss_pred eEEEEeccCCccccccccccccccccc-------------------------------------------cccccccccc Confidence 599999964322111100000000000 0000000000 Q ss_pred eEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccccceee Q lcl|NC_016163. 160 GFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRSAFETI 239 (590) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~~~~~~ 239 (590) ...+.+................ ...+.............. T Consensus 115 ~~~~~v~~~~~~~~~~~~~~~~-----------------------------------~~~~~~~~~~~~~~~~~~----- 154 (477) T protein:vir:79 115 AANLVLKNDSGGTTYTEGTDYA-----------------------------------VDLINGVITRIKTGTIPA----- 154 (477) T ss_pred cceeEEeecccccccccCcccc-----------------------------------ccccchhhhhhhcccccc----- Confidence 0000000000000000000000 000000000000000000 Q ss_pred eeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccccceec Q lcl|NC_016163. 240 SEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEESALL 319 (590) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (590) ........... . .... ......... .+.. ... .+ T Consensus 155 ----------~~~~~~~~~~~-~------------~~~~-----------~~~~~~~g~---~~a~-~~~---tg----- 188 (477) T protein:vir:79 155 ----------AATAAKATYDY-A------------DPTK-----------VTAADIIGA---VNAA-GMR---TG----- 188 (477) T ss_pred ----------ccceeeceecc-C------------Cccc-----------ceeeeeccc---cccc-ccc---hh----- Confidence 00000000000 0 0000 000000000 0000 000 00 Q ss_pred cchhhHHHHHHHhhhccCCceeeeccc--chhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhh----cCccc Q lcl|NC_016163. 320 VKGYSGVLAPEILDKQQYEIDVLLDGN--NEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGN----ISMST 393 (590) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~----~~~~s 393 (590) ..+...........+.++..++ ...++..++..+|++++ ||+++|+|.+.+.+++.+|+... .+++| T Consensus 189 -----~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l~~~~~~~~--~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~s 261 (477) T protein:vir:79 189 -----MKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLG--AIAYIDAPIGTTLAQALAGRGPAGTINFNTSS 261 (477) T ss_pred -----hhhhhhhhhhcccccceeeccccccchhHHHHHHHHHhhcC--eEEEEecCCCCChHHHhhhhhhcccccccccc Confidence 0000111111222233444444 34568888888887654 99999999999999999998643 35689 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeecc-ccce--eecChhHHhhhhhcCc Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGF-TDIN--FYPNEPWKEKLYLAQV 470 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~-~~~~--~~~~~~e~~~Ln~~gI 470 (590) .|+++||||++++|+.++..+++|||+++||+|||+|.++|+||||||+.+.++.++ ..+. ...++.|++.||++|| T Consensus 262 ~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i 341 (477) T protein:vir:79 262 DRVRLCYPHVKVYDIATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGI 341 (477) T ss_pred ceEEEEcCeeEEecccCCceeeechHHHHHHHHHHhhccCCceEccCCceeecceecccccccccCCChhhHHHHhhCCc Confidence 999999999999999999999999999999999999999999999999875444443 2222 2235678999999999 Q ss_pred eEEEEecCCeEEEecceecC---CCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_016163. 471 NYIERDPKKISFATQLTSQT---SRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRA 547 (590) Q Consensus 471 n~i~~~~~~G~~~wG~rT~s---~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga 547 (590) ||||+|+++|+++||+||++ .++.|+||||||+|++|+++|+++++|+|||||++.||++|+++|+.||++||++|+ T Consensus 342 ~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~l~~l~~~g~ 421 (477) T protein:vir:79 342 TTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDAPIDQGLIDSLVESVNGFGRKLIGDGA 421 (477) T ss_pred eEEEEecCCcEEEEcccccCCCCCCccceeeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCc Confidence 99999999999999999994 467899999999999999999999999999999999999999999999999999999 Q ss_pred eEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 548 CSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 548 ~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |.| +||+++||++||++|+|+++|+++|++|+|||+|++.++. T Consensus 422 l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 467 (477) T protein:vir:79 422 LLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITS 467 (477) T ss_pred eeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEec Confidence 876 5999999999999999999999999999999999999999 No 17 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=7.2e-108 Score=607.98 Aligned_cols=450 Identities=12% Similarity=0.051 Sum_probs=306.4 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCCc Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGGT 80 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG~ 80 (590) ||+|++|||||||+++++++|++|+|+|.+|+|++.+||+|+|++|+||.||. .||+...+ +.|++||++||+|||. T Consensus 1 M~~~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp~n~pv~its~~d~~-~~g~~~~~--~tL~~Av~~~f~nGg~ 77 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDAA-QFGPQLAG--FTIPQALDAVYDYGSG 77 (477) T ss_pred CcccCCCCeEEEEccCCcccccccCCceeEEEecccCCCCCcCEEEccHHHHH-HhccCCCC--CcHHHHHHHHHhccce Confidence 99999999999999999999999999999999999999999999999999995 57775433 6799999999999994 Q ss_pred -EEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecccccccccc Q lcl|NC_016163. 81 -AYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGENYNGY 159 (590) Q Consensus 81 -~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~~~~~~ 159 (590) ||||||.......+...... . . .....+. T Consensus 78 ~~~vVrV~~~~~~~~~~~~~~---------------~-----------------------~-------~~~~~~~----- 107 (477) T protein:vir:10 78 TVIVINVLDPAVHKSNAANEP---------------V-----------------------T-------FDAATGR----- 107 (477) T ss_pred EEEEEecCccccccccccccc---------------c-----------------------c-------cccccce----- Confidence 99999954321111000000 0 0 0000000 Q ss_pred eEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccccceee Q lcl|NC_016163. 160 GFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRSAFETI 239 (590) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~~~~~~ 239 (590) ....... ... . .+...+........ ..... T Consensus 108 ---~~~~~~~--------------------~~~--~-----------------------~v~~~a~~~~~~~~--~~~~~ 137 (477) T protein:vir:10 108 ---AKLAHPA--------------------AAN--L-----------------------VLKNDSGGTTYAEG--TDYAV 137 (477) T ss_pred ---ecccccc--------------------ccc--c-----------------------cccccccccccccc--hhhhh Confidence 0000000 000 0 00000000000000 00000 Q ss_pred eeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccccceec Q lcl|NC_016163. 240 SEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEESALL 319 (590) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (590) .... .. ......... .. ........... ...... ..... .+..+. T Consensus 138 ~~~~-----~~--~~~~~~~~~-----~~------~~~~~~~~~~~--~~~~~~--~~~~~---------~g~~~~---- 182 (477) T protein:vir:10 138 DLIN-----GV--ITRIKTGTI-----PP------GATAAKATYDY--ADPTKV--TAADI---------IGAVNA---- 182 (477) T ss_pred hhcc-----cc--ceecccccc-----cc------cceeeeecccc--cccccc--ccccc---------cccccc---- Confidence 0000 00 000000000 00 00000000000 000000 00000 000000 Q ss_pred cchhhH-HHHHHHhhhccCCceeeeccc--chhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhh----cCcc Q lcl|NC_016163. 320 VKGYSG-VLAPEILDKQQYEIDVLLDGN--NEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGN----ISMS 392 (590) Q Consensus 320 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~--~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~----~~~~ 392 (590) .....+ ...............++..++ ...++..++..+|++++ ||+++|+|.+.+.+++.+|+... .+++ T Consensus 183 ~~~~tGl~al~~~~~~~~~~~~~l~apg~~~~~~v~~~l~~~~~~~~--~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~ 260 (477) T protein:vir:10 183 AGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVELEAMAVQLG--AIAYIDAPIGTTLAQALAGRGPAGTINFNTS 260 (477) T ss_pred cchhhhhhhhhhhhhhcchhcccccccccccchhhHHHHHHHHhhCC--EEEEEecCCCCCHHHHHhhhhhccccccccc Confidence 000001 111111111222223444343 23467788888887654 99999999999999999999643 2567 Q ss_pred cceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeecc-cccee--ecChhHHhhhhhcC Q lcl|NC_016163. 393 TYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGF-TDINF--YPNEPWKEKLYLAQ 469 (590) Q Consensus 393 s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~-~~~~~--~~~~~e~~~Ln~~g 469 (590) |.|+++||||++++|+.++..+++|||+++||+|||+|.++|+||||||+.+.+|.++ ..+.+ ..++.|++.||++| T Consensus 261 s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gi~~~~~~~~~~~~~~~~~~~~L~~~g 340 (477) T protein:vir:10 261 SDRVRLCYPHVKVYDTATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQG 340 (477) T ss_pred cceEEEEcCeEEEecccCCceeEEchHHHHHHHHHHhhhcCCceeccCCceeccccccccccccccCCChhhHHHHhhCC Confidence 9999999999999999999999999999999999999999999999999876555554 22322 23567899999999 Q ss_pred ceEEEEecCCeEEEecceecC---CCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_016163. 470 VNYIERDPKKISFATQLTSQT---SRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANR 546 (590) Q Consensus 470 In~i~~~~~~G~~~wG~rT~s---~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~g 546 (590) |||||+|+++|+++||+||++ .++.|+||||||||++|+++|+++++|+||||||+.+|++|+++|++||++||++| T Consensus 341 i~~i~~~~~~G~~~wG~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~~v~~~~~~~~~~~i~~~i~~~l~~l~~~g 420 (477) T protein:vir:10 341 ITTVFSSYGSGLRLWGNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQFVDAPIDQGLIDSLVESVNGFGRKLIGDG 420 (477) T ss_pred ceEEEEecCCcEEEEcccccCCCCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCC Confidence 999999999999999999995 46789999999999999999999999999999999999999999999999999999 Q ss_pred ceEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 547 ACSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 547 a~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) +|.| +||+++||++||++|+|+++|+++|++|||||+|++.++. T Consensus 421 ~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 467 (477) T protein:vir:10 421 ALLGFKAWFDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITS 467 (477) T ss_pred ceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEcc Confidence 9876 5999999999999999999999999999999999999998 No 18 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=5.1e-103 Score=581.37 Aligned_cols=475 Identities=13% Similarity=0.033 Sum_probs=301.0 Q ss_pred Cc-cccCCceEEEEecCCCceecc-cccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCC Q lcl|NC_016163. 1 MA-DYLHPSVSSRIVDNSAVYATA-AGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSG 78 (590) Q Consensus 1 Mp-~yl~PGVYveEi~s~~~~i~g-v~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nG 78 (590) |- +|-.|||||||+|+++|+|+| |+|++++|+|.++|||+++|++|+||.||.+.||..... .+| T Consensus 279 ~~~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rGPvn~PvlITS~aD~~~~Fg~~~GG-------------l~G 345 (774) T protein:vir:98 279 ITRNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRGFTTSPALVTTIPDPAIHFTSFQGG-------------LDG 345 (774) T ss_pred eEEEEecCceEEEEeCCCCccccccccceeeeecccccCCCCCcCEEEeehhHhhhhhccccCC-------------ccc Confidence 44 578899999999999999998 999999999999999999999999999977777653211 135 Q ss_pred C-cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecccccccc Q lcl|NC_016163. 79 G-TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGENYN 157 (590) Q Consensus 79 G-~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~~~~ 157 (590) + +||.+.-.-.. ...+...+.++|.+.+.+++...+ .+.+.............- T Consensus 346 assA~r~~~~~sG-------~~~L~i~A~~pGawGN~ItV~I~~---------------~t~~~~~l~v~~~~~s~f--- 400 (774) T protein:vir:98 346 PRSAFRDFYTFNG-------TPLLRLQAVSEGNWGNQVTVSIYP---------------VNNSEFRLNVQDLNGSAF--- 400 (774) T ss_pred cceeeeeeeeecc-------cceEEEEEeecCcCCCceEEEEEe---------------cCCceeEEEEEecCCccc--- Confidence 5 35532211000 011222233333333222221111 000000000000000000 Q ss_pred cceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccccce Q lcl|NC_016163. 158 GYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRSAFE 237 (590) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~~~~ 237 (590) ........+.+...+......+.|.+.++.+........ ...++..+.++.... T Consensus 401 --------------~~~~a~e~~tv~~~~~~~~~~v~e~~dn~~i~~~~~~~~------~~~in~vs~lv~~~~------ 454 (774) T protein:vir:98 401 --------------NPPLADEVYTVKLGDTNESGELNALLDSKFIRGFFLPKS------IDSINYDAALVRQSP------ 454 (774) T ss_pred --------------cccccceeEEEecccccccceeeeeeceeeEeecccccc------cccccccccccccch------ Confidence 000000001111111111111122222111110000000 000000000000000 Q ss_pred eeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccccce Q lcl|NC_016163. 238 TISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEESA 317 (590) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (590) .... ..+.. ..........+. ...... ...+..+|.++.. T Consensus 455 ----~~~a-------------~~d~~----------~~~~~~~~~~~~--------~~~~~~-----v~v~lagG~Dg~~ 494 (774) T protein:vir:98 455 ----LRLA-------------PPDES----------ETDVENPAHVDF--------YGPNVL-----VDVTLENGYDGPP 494 (774) T ss_pred ----hccc-------------ccccc----------cccccccccccc--------cCCcce-----EEEeecCCCCccc Confidence 0000 00000 000000000000 000000 0011222222222 Q ss_pred eccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHh---cCCeEEEEecCCCCCHHHHHHHHHhhcCcccc Q lcl|NC_016163. 318 LLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQ---RGDCIAILDCSFQGDAQQTIDYRTGNISMSTY 394 (590) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~---~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s~ 394 (590) .+...+ +.. ....+...+.++..+.....++.+++.||+.. +++||+++|.|++.+.+++++|++.+ +|. T Consensus 495 tt~~~i-gg~---~~~~~~tgi~aLl~a~~~~~V~~aii~~~e~~~~~~~~r~avid~p~g~t~~~Ai~~r~~f---~S~ 567 (774) T protein:vir:98 495 VTNDDY-VSI---IRTLENQPVHILLVGTTNVGVQQALITEAERASDSDGLRIAVLAAPPRTTPTLAASVTRGF---NST 567 (774) T ss_pred ccchhe-ecc---cccccccceeEEEcCccchhhHHHHHHHHHHhhhcccceEEEEECCCCCCHHHHHHHHhcc---CCc Confidence 211111 111 11122344566667777788888999998753 46799999999999999999999754 799 Q ss_pred eEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeecc-c--cceeecChhHHhhhhhcCce Q lcl|NC_016163. 395 FTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGF-T--DINFYPNEPWKEKLYLAQVN 471 (590) Q Consensus 395 ~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~-~--~~~~~~~~~e~~~Ln~~gIn 471 (590) |+++||||++++|+.+++.+++||||++||+|||+| +||||+|+.+.++.|. + .+....++.|++.||+++|| T Consensus 568 ~aal~~Pwvkv~D~~~g~~~~vPpSg~vAGl~ArtD----v~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN 643 (774) T protein:vir:98 568 RAVMVAGWFTYAGQPNSSRYGVPGAAVYAGKLAAID----FFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLE 643 (774) T ss_pred eEEEEeCcEEEeccCCCceeecChhHHHHHHHHhcC----cccccCCceeecceeccccccccccccchhhhhhcccccc Confidence 999999999999999999999999999999999999 8999999764333332 1 23344578899999999999 Q ss_pred EEE-EecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE Q lcl|NC_016163. 472 YIE-RDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS 550 (590) Q Consensus 472 ~i~-~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~ 550 (590) |++ .++++|+++||+||++.|++||||+|||||+||+++|.++++|+||||||+.||++|+++|+.||++||++|+|.| T Consensus 644 ~i~itt~g~G~rvWG~RTlssDp~wr~InVRRlfd~Ie~SI~~~~~~~VfEPNd~~l~~~I~~sI~~fL~~L~~~GaL~G 723 (774) T protein:vir:98 644 VLSLDTVDRTYRFASGVTLSTDPAWERIYLRRVHDVVRQGAHAILRNYVAMPNSRLVRNQIAAALNAFMGELKRNGNIVS 723 (774) T ss_pred eeEEEEcCCcEEEEcccccCCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceec Confidence 998 6889999999999999999999999999999999999999999999999999999999999999999999999876 Q ss_pred ----EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 551 ----ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 551 ----~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) +||+++||+++|++|+|+++|+++|++|||||+|||.|+. T Consensus 724 ~~~V~~D~etNt~~dI~~G~l~i~I~vaP~~PAEfIilri~q~t 767 (774) T protein:vir:98 724 FRPAIIDGSNNSTAAYFSRELYVSLQFQPLYSADYIYVTISRDT 767 (774) T ss_pred ceEEEEcCCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEee Confidence 6999999999999999999999999999999999999999 No 19 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=6e-97 Score=548.10 Aligned_cols=367 Identities=12% Similarity=0.075 Sum_probs=292.8 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCC-----CCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAI-----GRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~-----Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) |++|+ |||||||+++++++|.+++|++.+|+|++.. .|.++|++|+|+.||...||. . +.|.++++.|| T Consensus 1 m~~~~-~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~---~--~tl~~a~~~~~ 74 (396) T protein:vir:60 1 MSDYH-HGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGK---K--GTLAASLQAIA 74 (396) T ss_pred CCCCC-CCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcC---c--chhHHHHHHHh Confidence 99997 8999999999999999999999999997743 378999999999999999995 2 56999999999 Q ss_pred cCCCc-EEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGGT-AYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG~-~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) +|||. |||+|+...... ..... T Consensus 75 ~~gg~~~~vv~~~~~~~~---------------------------------------------------------~~~~~ 97 (396) T protein:vir:60 75 DQSKPVTVVVRVEDGTGE---------------------------------------------------------DEETK 97 (396) T ss_pred hccCceEEEEeccccccc---------------------------------------------------------ccccc Confidence 99995 999998321000 00000 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) . ... . + ..... .+... T Consensus 98 ~--------------~~~----------------------~-------~--------------~~~~~-------~d~~~ 113 (396) T protein:vir:60 98 L--------------AQT----------------------V-------S--------------NIIGT-------TDENG 113 (396) T ss_pred c--------------ccc----------------------c-------c--------------ccccc-------ccccc Confidence 0 000 0 0 00000 00000 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) .. .+...+. T Consensus 114 ------------------------------------------~~-----------------tg~~al~------------ 122 (396) T protein:vir:60 114 ------------------------------------------QY-----------------TGLKALL------------ 122 (396) T ss_pred ------------------------------------------cc-----------------cchhhhh------------ Confidence 00 0000000 Q ss_pred cceeccchhhHHHHHHHhhhccCCceeee-cccchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCccc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDVLL-DGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMST 393 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s 393 (590) .........+.+++ ++.....++.+++.+|++++ +++++|+|.+.+++++.+||+.+ +| T Consensus 123 ---------------~~~~~~~~~~~il~ap~~~~~~v~~al~~~~~~~~--~~~i~d~p~~~~~~~a~~~~~~~---~s 182 (396) T protein:vir:60 123 ---------------AAESVTGVKPRILGVPGLDTKEVAVALASVCQKLR--AFGYISAWGCKTISEVKAYRQNF---SQ 182 (396) T ss_pred ---------------hcccceeeeeeeccccccccHHHHHHHHHHhccCC--eEEEEeCCCCCCHHHHHHHHhhc---CC Confidence 00000001112222 23345578888889987654 89999999999999999999754 78 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeec------ChhHHhhhhh Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYP------NEPWKEKLYL 467 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~------~~~e~~~Ln~ 467 (590) .|+++||||++++|+.++..+++|||+++||+|||+|.++|+||||||+. +.|+.++...+ ++.|+++||+ T Consensus 183 ~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~---l~gi~~~~~~~~~~~~~~~~~~~~Ln~ 259 (396) T protein:vir:60 183 RELMVIWPDFLAWDTVASTTATAYATARALGLRAKIDQEQGWHKTLSNVG---VNGVTGISASVFWDLQESGTDADLLNE 259 (396) T ss_pred ceEEEEeCceeeecccCCceeEEchhHHHHHHHHHhhhccCcEeCcCCce---ecceeeceeecccccCCCcchhhhhhh Confidence 99999999999999999999999999999999999999999999999976 44555444333 4678999999 Q ss_pred cCceEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_016163. 468 AQVNYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRA 547 (590) Q Consensus 468 ~gIn~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga 547 (590) +||||++ +++|+++||+||++.|++|+||+|||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+ T Consensus 260 ~gI~~~~--~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~ga 337 (396) T protein:vir:60 260 SGVTTLI--RRDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKTNGY 337 (396) T ss_pred cCcEEEE--cCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCc Confidence 9999995 47899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 548 CSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 548 ~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |.| +||+++||+++|++|+|+++|+++|++|||||+|++.++. T Consensus 338 l~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~ 383 (396) T protein:vir:60 338 IVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITD 383 (396) T ss_pred eeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEch Confidence 987 6999999999999999999999999999999999999999 No 20 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=2.2e-96 Score=545.00 Aligned_cols=362 Identities=12% Similarity=0.054 Sum_probs=290.4 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCC-----CCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIG-----RDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~G-----p~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) ||+|++|||||+|++.++++|+.++|++..|+|.+..+ |+++|++|+|+.+|...||. + +.|.++++.|| T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~---~--gtL~~al~~~~ 75 (390) T protein:vir:10 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---K--GTLRRTLDAIG 75 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCC---C--ceehhhhhhhc Confidence 99999999999999999999999999999999987554 89999999999999999995 2 56899999999 Q ss_pred cCCCc-EEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGGT-AYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG~-~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) .|||. |||||+...+-..+ + . ... .+. T Consensus 76 ~~gg~~~~vv~v~~~~~~~~-~----------------------------------~-------------~~~----ig~ 103 (390) T protein:vir:10 76 KQTKPLTVVVRVAEGKDADE-T----------------------------------T-------------SNV----IGT 103 (390) T ss_pred cccCceEEEEEecccccccc-c----------------------------------c-------------ccc----ccc Confidence 99995 99999842100000 0 0 000 000 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) . +..+..+..+. +.. T Consensus 104 ~------------------------------~~~~~~tg~~a-----------------------l~~------------ 118 (390) T protein:vir:10 104 V------------------------------TPDGKYTGIKA-----------------------LLA------------ 118 (390) T ss_pred c------------------------------ccccccchhhh-----------------------hhh------------ Confidence 0 00000000000 000 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) T Consensus 119 -------------------------------------------------------------------------------- 118 (390) T protein:vir:10 119 -------------------------------------------------------------------------------- 118 (390) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred cceeccchhhHHHHHHHhhhccCCcee-eecccchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCccc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDV-LLDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMST 393 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s 393 (590) ........+.+ +.++....+++.+++.+|++++ +++++|+|.+.+.+++.+||+++ +| T Consensus 119 ----------------~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~--~~aivD~p~~~t~~~a~~~~~~~---~s 177 (390) T protein:vir:10 119 ----------------AQGALGVKPRILAAPGLDTQPVAAALAATAQSLR--AMAYVSASGCKTKEEAAAYRKQF---GQ 177 (390) T ss_pred ----------------hhhhhcceehhhcccccchHHHHHHHHHhhcccc--eEEEEecCCCCCHHHHHHHhhcc---CC Confidence 00000000000 1111223456777888887654 89999999999999999999754 79 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecC------hhHHhhhhh Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPN------EPWKEKLYL 467 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~------~~e~~~Ln~ 467 (590) .|+++||||++++|+.++..+++|||+++||+|||+|.++||||||||+. |.|+.++...++ +.|.+.||+ T Consensus 178 ~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~---l~gi~~~~~~~~~~~~~~~~~~~~ln~ 254 (390) T protein:vir:10 178 REIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVV---VNGVSGISADVSWDLQDPATDAGYLNE 254 (390) T ss_pred ceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCce---eeceeecceecccccccccchhhhhhh Confidence 99999999999999999999999999999999999999999999999976 445555444333 456789999 Q ss_pred cCceEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_016163. 468 AQVNYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRA 547 (590) Q Consensus 468 ~gIn~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga 547 (590) +|||++++ ++|+++||+||++.|++|+||+|||||+||+++|+++++|+|||||++.||++|+++|+.||++||++|+ T Consensus 255 ~gi~t~~~--~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~ 332 (390) T protein:vir:10 255 HEVTTLVN--RNGFRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGY 332 (390) T ss_pred cCcEEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCc Confidence 99999965 6799999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 548 CSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 548 ~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |.| +||+++||++||++|+|+++|+++|++|+|||+|++.++. T Consensus 333 l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:10 333 LIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred eeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEch Confidence 976 5999999999999999999999999999999999999999 No 21 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=2.2e-96 Score=545.00 Aligned_cols=362 Identities=12% Similarity=0.054 Sum_probs=290.4 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCC-----CCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIG-----RDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~G-----p~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) ||+|++|||||+|++.++++|+.++|++..|+|.+..+ |+++|++|+|+.+|...||. + +.|.++++.|| T Consensus 1 M~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~---~--gtL~~al~~~~ 75 (390) T protein:vir:78 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---K--GTLRRTLDAIG 75 (390) T ss_pred CcccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCC---C--ceehhhhhhhc Confidence 99999999999999999999999999999999987554 89999999999999999995 2 56899999999 Q ss_pred cCCCc-EEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGGT-AYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG~-~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) .|||. |||||+...+-..+ + . ... .+. T Consensus 76 ~~gg~~~~vv~v~~~~~~~~-~----------------------------------~-------------~~~----ig~ 103 (390) T protein:vir:78 76 KQTKPLTVVVRVAEGKDADE-T----------------------------------T-------------SNV----IGT 103 (390) T ss_pred cccCceEEEEEecccccccc-c----------------------------------c-------------ccc----ccc Confidence 99995 99999842100000 0 0 000 000 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) . +..+..+..+. +.. T Consensus 104 ~------------------------------~~~~~~tg~~a-----------------------l~~------------ 118 (390) T protein:vir:78 104 V------------------------------TPDGKYTGIKA-----------------------LLA------------ 118 (390) T ss_pred c------------------------------ccccccchhhh-----------------------hhh------------ Confidence 0 00000000000 000 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) T Consensus 119 -------------------------------------------------------------------------------- 118 (390) T protein:vir:78 119 -------------------------------------------------------------------------------- 118 (390) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred cceeccchhhHHHHHHHhhhccCCcee-eecccchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCccc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDV-LLDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMST 393 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s 393 (590) ........+.+ +.++....+++.+++.+|++++ +++++|+|.+.+.+++.+||+++ +| T Consensus 119 ----------------~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~~--~~aivD~p~~~t~~~a~~~~~~~---~s 177 (390) T protein:vir:78 119 ----------------AQGALGVKPRILAAPGLDTQPVAAALAATAQSLR--AMAYVSASGCKTKEEAAAYRKQF---GQ 177 (390) T ss_pred ----------------hhhhhcceehhhcccccchHHHHHHHHHhhcccc--eEEEEecCCCCCHHHHHHHhhcc---CC Confidence 00000000000 1111223456777888887654 89999999999999999999754 79 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecC------hhHHhhhhh Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPN------EPWKEKLYL 467 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~------~~e~~~Ln~ 467 (590) .|+++||||++++|+.++..+++|||+++||+|||+|.++||||||||+. |.|+.++...++ +.|.+.||+ T Consensus 178 ~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~---l~gi~~~~~~~~~~~~~~~~~~~~ln~ 254 (390) T protein:vir:78 178 REIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVV---VNGVSGISADVSWDLQDPATDAGYLNE 254 (390) T ss_pred ceEEEEcCceEeecccCCcccccchHHHHHHHHHHhhcCCCcEECcCCce---eeceeecceecccccccccchhhhhhh Confidence 99999999999999999999999999999999999999999999999976 445555444333 456789999 Q ss_pred cCceEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_016163. 468 AQVNYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRA 547 (590) Q Consensus 468 ~gIn~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga 547 (590) +|||++++ ++|+++||+||++.|++|+||+|||||+||+++|+++++|+|||||++.||++|+++|+.||++||++|+ T Consensus 255 ~gi~t~~~--~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~ 332 (390) T protein:vir:78 255 HEVTTLVN--RNGFRFWGERTCSDDPKFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGY 332 (390) T ss_pred cCcEEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCc Confidence 99999965 6799999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 548 CSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 548 ~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |.| +||+++||++||++|+|+++|+++|++|+|||+|++.++. T Consensus 333 l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~pae~I~~~~~~~~ 378 (390) T protein:vir:78 333 LIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred eeeeEEEEccCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEch Confidence 976 5999999999999999999999999999999999999999 No 22 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=3.4e-96 Score=544.01 Aligned_cols=362 Identities=12% Similarity=0.060 Sum_probs=290.8 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCC-----CCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIG-----RDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~G-----p~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) ||+|++|||||||+++++++|+.++|++..|++.+..+ |+++|++|+|+.+|.+.||. + +.|.++++.|| T Consensus 1 M~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~---~--~tL~~al~~~~ 75 (390) T protein:vir:79 1 MPQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---K--GTLRRTLDAIG 75 (390) T ss_pred CccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCC---C--ccchhhhhhhc Confidence 99999999999999999999999999999999988665 89999999999999999995 2 45889999999 Q ss_pred cCCCc-EEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGGT-AYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG~-~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) .|||. |||||+...... .. + .... T Consensus 76 ~~~~~~~~vv~v~~~~~~-----------------~~--------------------------~----~~~~-------- 100 (390) T protein:vir:79 76 KQTKPLTVVVRVAEGKDA-----------------DE--------------------------T----TSNV-------- 100 (390) T ss_pred ccccceEEEEeecccccc-----------------cc--------------------------c----ccee-------- Confidence 99995 999998321000 00 0 0000 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) +. . .+..+..+..+ .+ T Consensus 101 --------ig---~----------------~~~~~~~tgl~-----------------------al-------------- 116 (390) T protein:vir:79 101 --------IG---T----------------VTPDGKYTGIK-----------------------AL-------------- 116 (390) T ss_pred --------ee---c----------------ccccccchhhh-----------------------hh-------------- Confidence 00 0 00000000000 00 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) T Consensus 117 -------------------------------------------------------------------------------- 116 (390) T protein:vir:79 117 -------------------------------------------------------------------------------- 116 (390) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred cceeccchhhHHHHHHHhhhccCCceeeec-ccchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCccc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDVLLD-GNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMST 393 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s 393 (590) ..........+.++++ +....+++.++..+|++. ++|+++|+|.+.+.+++.+||+. ++| T Consensus 117 --------------~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~a~~~--~~~ai~D~p~~~t~~~a~~~~~~---~~s 177 (390) T protein:vir:79 117 --------------LAAQGALGVKPRILAAPGLDTQPVAAALAATAQSL--RAMAYVSASGCKTKEEAAAYRRQ---FGQ 177 (390) T ss_pred --------------hhhhhhhccccccccCCcccchHHHHHHHHhhhhc--ceEEEEEccCCCCHHHHHHHhcC---CCC Confidence 0000000001111121 222345666777777654 49999999999999999999975 489 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecC------hhHHhhhhh Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPN------EPWKEKLYL 467 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~------~~e~~~Ln~ 467 (590) .|+++||||++++|+.++..+++|||+++||+|||+|.++||||||||+. |.|+.++...++ +.|+++||+ T Consensus 178 ~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~D~~~g~~~spsN~~---i~gi~~~~~~~~~~~~~~~~~a~~Ln~ 254 (390) T protein:vir:79 178 REIMVIWPDWLGWDDTTNSTAVIPAPAIAAGLRAKIDNDIGWHKTISNVV---VNGVSGISADVSWDLQDPATDAGYLNE 254 (390) T ss_pred ceEEEEcCceeecccccCceeEeehHHHHHHHHHhhhccCCcEEccCCce---eeccceeeeeccccccccchhhhhhhh Confidence 99999999999999999999999999999999999999999999999976 555655554433 346789999 Q ss_pred cCceEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_016163. 468 AQVNYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRA 547 (590) Q Consensus 468 ~gIn~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga 547 (590) +|||++++ ++|+++||+||++.|++|+||+|||||+||+++|+++++|+|||||++.||++|+++|+.||++||++|+ T Consensus 255 ~gi~t~~~--~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~ga 332 (390) T protein:vir:79 255 HEVTTLVN--RNGFRFWGERTCSDDPKFAFENYTRTAQVAADSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGY 332 (390) T ss_pred cCcEEEEc--CCCEEEEeccccCCCcccceeeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCc Confidence 99999965 7899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 548 CSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 548 ~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |.| +||+++||++||++|+|+++|+++|++|+|||+|++..+. T Consensus 333 l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 378 (390) T protein:vir:79 333 LIGGSAWIDPEPNTADILASGKAYIDYDYTPVPPLENLVLRQRITD 378 (390) T ss_pred eeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEch Confidence 877 5999999999999999999999999999999999999999 No 23 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=3.2e-96 Score=544.09 Aligned_cols=362 Identities=11% Similarity=0.052 Sum_probs=292.4 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecC-----CCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSA-----IGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~-----~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) ||+|.+|||||+|+++++++|.+++|.+.+|+|++. .+|+++|++|+|+.||...||. + +.|.++++.|| T Consensus 1 M~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~---~--gtl~~al~~~~ 75 (391) T protein:vir:79 1 MPTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGD---K--GTLAHTLDAIT 75 (391) T ss_pred CCCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCC---c--cccchhhhhhh Confidence 999999999999999999999999999999998764 6899999999999999999995 2 56899999999 Q ss_pred cCCC-cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGG-TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG-~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) +||| .||+|++........ . . ....+. T Consensus 76 ~~gg~~~~vv~~~~~~~~~~--------------------------~----~----------------------~~~~g~ 103 (391) T protein:vir:79 76 DQTNPLTVVVRVAGGASEAE--------------------------T----T----------------------SNLIGT 103 (391) T ss_pred cccccceeeecccccccccc--------------------------c----c----------------------cccccc Confidence 9999 599999732110000 0 0 000000 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) .+.++..+..+. +.. T Consensus 104 ------------------------------~~~~~~~tGl~~-----------------------l~~------------ 118 (391) T protein:vir:79 104 ------------------------------TNAAGRYTGMKA-----------------------LLT------------ 118 (391) T ss_pred ------------------------------ccchhhhHHHhh-----------------------hhh------------ Confidence 000000000000 000 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) T Consensus 119 -------------------------------------------------------------------------------- 118 (391) T protein:vir:79 119 -------------------------------------------------------------------------------- 118 (391) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred cceeccchhhHHHHHHHhhhccCCce-eeecccchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCccc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEID-VLLDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMST 393 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s 393 (590) ........+. ++.++....+++.+++.+|++++ +++++|+|.+.+.+++.+|++.+ +| T Consensus 119 ----------------~~~~~~~~p~~l~~p~~~~~~v~~al~~~~~~~~--~~ai~d~p~~~t~~~a~~~~~~~---~s 177 (391) T protein:vir:79 119 ----------------ARNRFGVAPRILAVPGLDSLPVGTELVTIAQKLR--AFAYLSAYGCQTKEEAVAYRSNF---GQ 177 (391) T ss_pred ----------------hhhhhcccchhhcCCccchhHHHHHHHHHHhhcC--cEEEEECCCCCCHHHHHHHHhcc---CC Confidence 0000000000 01122233467778889997654 89999999999999999999864 78 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecC------hhHHhhhhh Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPN------EPWKEKLYL 467 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~------~~e~~~Ln~ 467 (590) +|+++||||++++|+.++..+++|||+++||+|||+|.++||||||||+. |.|+.+++..++ +.|.+.||+ T Consensus 178 ~~~a~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D~~~g~~~spaN~~---l~gi~~~~~~~~~~~~~~~~~~~~Ln~ 254 (391) T protein:vir:79 178 REAMVMWPDFVGWDTAANAETTLWATARAVGLRAKIDNDTGWHKTLSNVA---VGGVTGLSRDVFWDLQDPATDAGYLNA 254 (391) T ss_pred ceeEEecceeeeecCcCCceeeechHHHHHHHHHHhhhcccceeccCCce---ehhhhccccccccccccccchhhhhhh Confidence 99999999999999999999999999999999999999999999999976 556666555443 346789999 Q ss_pred cCceEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_016163. 468 AQVNYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRA 547 (590) Q Consensus 468 ~gIn~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga 547 (590) +||||+++ ++|+++||+||++.|++|+||+|||||+||+++|+++++|+|||||++.||++|+++|+.||++||++|+ T Consensus 255 ~~I~t~~~--~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~ 332 (391) T protein:vir:79 255 NEVTTLVH--RDGYRFWGSRTCSADPLFAFENYTRTAQVLADTMAEAHMWANDLPMTPTLVRDLLEGINAKLRMLTRNGY 332 (391) T ss_pred cCceEEEC--CCcEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCc Confidence 99999964 6899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 548 CSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 548 ~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |.| +||+++||+++|++|+|+++|+++|++|+|||+|++.++. T Consensus 333 l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 378 (391) T protein:vir:79 333 LLGGAAWFDADANSKDTLKAGQLAIDYDYTPVPPLENLTFRQRITD 378 (391) T ss_pred eeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEch Confidence 976 5999999999999999999999999999999999999999 No 24 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=9e-96 Score=541.67 Aligned_cols=367 Identities=11% Similarity=0.065 Sum_probs=290.9 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCC-----CCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIG-----RDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~G-----p~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) |++|+ |||||||+++++++|.+++|++.+|+|.+..+ |.++|++|+|+.||...||. . +.|.+++++|| T Consensus 1 m~~~~-~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~---~--~tl~~al~~~~ 74 (396) T protein:vir:57 1 MSDYH-HGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGK---K--GTLAASLQAIA 74 (396) T ss_pred CCCCC-CceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhccc---c--cchHHHHHHhh Confidence 99998 69999999999999999999999999987655 77899999999999999995 2 56999999999 Q ss_pred cCCCc-EEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGGT-AYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG~-~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) +|||. |||+|+.......... ...... T Consensus 75 ~~~~~~~~vv~~~~~~~~~~~~----------------------------------------------------~~a~t~ 102 (396) T protein:vir:57 75 DQSKPVTVVVRVEDGTGDDEET----------------------------------------------------KLAQTV 102 (396) T ss_pred hcCCceeEeeeccccccccccc----------------------------------------------------cccccc Confidence 99995 9999873211000000 000000 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) . . .+. ..+..+..+ . +. T Consensus 103 ~--~---iiG-------------------~~~~~~~~t------g--------------------l~------------- 119 (396) T protein:vir:57 103 S--N---IIG-------------------TTDENGQYT------G--------------------LK------------- 119 (396) T ss_pred e--e---eee-------------------eccccccch------h--------------------hh------------- Confidence 0 0 000 000000000 0 00 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) .. T Consensus 120 ------------------------------------------------al------------------------------ 121 (396) T protein:vir:57 120 ------------------------------------------------AL------------------------------ 121 (396) T ss_pred ------------------------------------------------hh------------------------------ Confidence 00 Q ss_pred cceeccchhhHHHHHHHhhhccCCceee-ecccchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCccc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDVL-LDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMST 393 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s 393 (590) ..........+.++ .++.....++.+++.+|+++ ++|+++|+|.+.+++++.+||+. ++| T Consensus 122 --------------~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~--~~~~~~d~p~~~~~~~~~~~~~~---~~s 182 (396) T protein:vir:57 122 --------------MGAESVTGVKPRILGVPGLDTKEVAVALASVCQEL--NAFGYISAWGCKTISEVKAYRQN---FSQ 182 (396) T ss_pred --------------hhcccceeEEeccccCcccchhHHHHHHHHHhhhC--ceEEEEcCCCCCCHHHHHHHHhc---cCC Confidence 00000000000111 12223346778888888654 49999999999999999999985 479 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeec------ChhHHhhhhh Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYP------NEPWKEKLYL 467 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~------~~~e~~~Ln~ 467 (590) .|+++||||++++|+.++..+++|||+++||+|||+|.++|+||||||+.+ .|+.++...+ ++.|+++||+ T Consensus 183 ~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l---~gi~~~~~~~~~~~~~~~~~~~~Ln~ 259 (396) T protein:vir:57 183 RELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGV---NGVTGISASVFWDLQKPGTDADLLNE 259 (396) T ss_pred ceEEEEcceeeeecccCCceeEEehhHHHHHHHHHhhhccCcEeccCCcee---ccccccceecccccCCcchhhhhhhh Confidence 999999999999999999999999999999999999999999999999764 4554443333 4678999999 Q ss_pred cCceEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_016163. 468 AQVNYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRA 547 (590) Q Consensus 468 ~gIn~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga 547 (590) +||||+++ ++|+++||+||++.|++|+||+|||||+||+++|+++++|+|||||++.||++|+++|+.||++||++|+ T Consensus 260 ~gi~t~~~--~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~ga 337 (396) T protein:vir:57 260 AGVTTLVR--RDGFRFWGNRTCSDDPLFLFESYTRTAQVLADTMAEAHMWAIDKPITATLIRDIIDGINAKFRELKNNGY 337 (396) T ss_pred cCcEEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCc Confidence 99999965 6799999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 548 CSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 548 ~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |.| +||+++||+++|++|+|+++|+++|++|+|||+|++.++. T Consensus 338 l~g~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~ 383 (396) T protein:vir:57 338 IVDGTCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITS 383 (396) T ss_pred eeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEch Confidence 977 6999999999999999999999999999999999999999 No 25 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=4.7e-96 Score=543.22 Aligned_cols=367 Identities=12% Similarity=0.072 Sum_probs=290.8 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCC-----CCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAI-----GRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~-----Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) |++|+ |||||||++++++++.++.|++.+|+|++.. .|+++|++|+|+.||...||+. ..|++++++|| T Consensus 1 m~~~~-~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~~-----~tL~~al~~~~ 74 (396) T protein:vir:20 1 MSDYH-HGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGKK-----GTLAASLQAIA 74 (396) T ss_pred CCCCC-CCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhcccc-----cchhhhhhhhh Confidence 99996 8999999999999999999999999997744 3788999999999999999962 46899999999 Q ss_pred cCCCc-EEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGGT-AYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG~-~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) .|||. ||++|+....-. ..... T Consensus 75 ~ngg~~~~v~~~~~~~~~---------------------------------------------------------~~~~~ 97 (396) T protein:vir:20 75 DQSKPVTVVMRVEDGTGD---------------------------------------------------------DEETK 97 (396) T ss_pred ccCceeEEEEeccccccc---------------------------------------------------------ccccc Confidence 99995 999997321000 00000 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) . ..+. . . ...... ..+. T Consensus 98 ~--------------a~t~-----------------~----~----------------------~~~~~~----~~~~-- 114 (396) T protein:vir:20 98 L--------------AQTV-----------------S----N----------------------IIGTTD----ENGQ-- 114 (396) T ss_pred c--------------cccc-----------------c----c----------------------cccccc----cccc-- Confidence 0 0000 0 0 000000 0000 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) .. ..... .. T Consensus 115 -----------------------------------------~t--g~~al-----------------~~----------- 123 (396) T protein:vir:20 115 -----------------------------------------YT--GLKAM-----------------LA----------- 123 (396) T ss_pred -----------------------------------------cc--hhhhh-----------------hh----------- Confidence 00 00000 00 Q ss_pred cceeccchhhHHHHHHHhhhccCCceee-ecccchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCccc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDVL-LDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMST 393 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s 393 (590) ........+.++ .++.....++.+++.+|++++ +|+++|+|.+.+++++.+||+. ++| T Consensus 124 ----------------~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~--~~~~iD~p~~~~~~~a~~~r~~---~~s 182 (396) T protein:vir:20 124 ----------------AESVTGVKPRILGVPGLDTKEVAVALASVCQKLR--AFGYISAWGCKTISEVKAYRQN---FSQ 182 (396) T ss_pred ----------------hccccccchhhhhhhhhccHHHHHHHHHHHhcCC--cEEEEecCCCCCHHHHHHHhhC---CCC Confidence 000000000111 122334578888999997644 8999999999999999999975 478 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceee------cChhHHhhhhh Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFY------PNEPWKEKLYL 467 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~------~~~~e~~~Ln~ 467 (590) .|+++||||++++|+.++..+++|||+++||+|||+|.++|+||||||+.+ .|+.++... .++.|+++||+ T Consensus 183 ~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l---~gi~~~~~~~~~~~~~~~~~~~~Ln~ 259 (396) T protein:vir:20 183 RELMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGV---NGVTGISASVFWDLQESGTDADLLNE 259 (396) T ss_pred ceEEEEcCccccccCcCCcceeechhHHHHHHHHHhhhhcCcEeccCCcee---ccceecceecccccCCCcchhhhhhh Confidence 999999999999999999999999999999999999999999999999864 455444333 34678999999 Q ss_pred cCceEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_016163. 468 AQVNYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRA 547 (590) Q Consensus 468 ~gIn~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga 547 (590) +|||++++ ++|+++||+||++.|++|+||++||+|+||+++|.++++|+|||||++.||++|+++|+.||++||++|+ T Consensus 260 ~gi~~~~~--~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~G~ 337 (396) T protein:vir:20 260 SGVTTLIR--RDGFRFWGNRTCSDDPLFLFENYTRTAQVVADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKTNGY 337 (396) T ss_pred cCcEEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCcc Confidence 99999965 7899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 548 CSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 548 ~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |.| +||+++||+++|++|+|+++|+++|++|+|||+|++..+. T Consensus 338 l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 383 (396) T protein:vir:20 338 IVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITD 383 (396) T ss_pred eeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEch Confidence 876 6999999999999999999999999999999999999999 No 26 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=1.6e-95 Score=540.34 Aligned_cols=367 Identities=11% Similarity=0.050 Sum_probs=293.5 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCC-----CCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIG-----RDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~G-----p~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) |++|+ |||||+|+++++++|.+++|++.+|+|++..+ |+++|++|+|+.+|...||. . +.++++++.|| T Consensus 1 m~~~~-~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~---~--gtl~~al~~~~ 74 (392) T protein:vir:18 1 MSDFH-HGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGK---K--GTLSASLQAIA 74 (392) T ss_pred CCCCC-CCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCC---C--cchHHHHHHhh Confidence 99997 59999999999999999999999999987544 78999999999999999995 2 56899999999 Q ss_pred cCCCc-EEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGGT-AYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG~-~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) +|||. |+++|+...... + +. T Consensus 75 ~ngg~~~~vv~v~~~~~~--------------~-----------------------------------------~~---- 95 (392) T protein:vir:18 75 DQSKPVTVVVRVAEGTGD--------------D-----------------------------------------AE---- 95 (392) T ss_pred cccCceEEEecccccccc--------------c-----------------------------------------cc---- Confidence 99995 999987311000 0 00 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) .... .+ ++...+. ... T Consensus 96 -----------~~t~---------------~d--------------------------------liG~~~~---~~~--- 111 (392) T protein:vir:18 96 -----------AQTT---------------SN--------------------------------IIGGTDE---NGK--- 111 (392) T ss_pred -----------ccch---------------hh--------------------------------heecccc---cch--- Confidence 0000 00 0000000 000 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) .. . T Consensus 112 -----------------------------------------~t-------------------g----------------- 114 (392) T protein:vir:18 112 -----------------------------------------YT-------------------G----------------- 114 (392) T ss_pred -----------------------------------------hh-------------------h----------------- Confidence 00 0 Q ss_pred cceeccchhhHHHHHHHhhhccCCceeeeccc-chhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCccc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDVLLDGN-NEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMST 393 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s 393 (590) ..+...........+.+++.++ ...+++.+++.+|++.+ +|+++|+|++.+.+++.+||+. ++| T Consensus 115 ----------~~al~~~~~~~~~~p~il~ap~~~~~~v~~~l~~~~~~~~--~~~~~d~~~~~~~~~a~~~~~~---~~s 179 (392) T protein:vir:18 115 ----------IKALLTAEAVTGVKPRILGVPGLDTQEVATALASVCISLR--AFGYVSAWGCKTISEAMAYREN---FSQ 179 (392) T ss_pred ----------HHHHHhhhhhhceeehhcccCccchHHHHHHHHHHHhhcC--cEEEEecCCCCCHHHHHHHHhh---ccC Confidence 0000000011111123333333 45678888888887654 8999999999999999999975 479 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeecc-ccce--eecChhHHhhhhhcCc Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGF-TDIN--FYPNEPWKEKLYLAQV 470 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~-~~~~--~~~~~~e~~~Ln~~gI 470 (590) .|+++||||++++|+.++..+++|||+++||++||+|.++||||||||+.+.+|.++ ..+. ...++.|++.||++|| T Consensus 180 ~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI 259 (392) T protein:vir:18 180 RELMVIWPDFLAWDTTANATATAYATARALGLRAYIDQTIGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGV 259 (392) T ss_pred ceEEEEeCceeeecccCCceEEechHHHHHHHHHhhhccCCceEccCCceeeceeecceecccccCCCcchhhhhhhcCc Confidence 999999999999999999999999999999999999999999999999864444333 1222 2234678999999999 Q ss_pred eEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE Q lcl|NC_016163. 471 NYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS 550 (590) Q Consensus 471 n~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~ 550 (590) |+++ +++|+++||+||++.|++||||+|||+|+||+++|+++++|+|||||++.+|++|+++|+.||++||++|+|.| T Consensus 260 ~t~~--~~~G~~~wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~gal~g 337 (392) T protein:vir:18 260 TTLV--RKDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITASLIRDIVDGINAKFRELKSNGYIVD 337 (392) T ss_pred eEEE--cCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCcccc Confidence 9996 47899999999999999999999999999999999999999999999999999999999999999999999977 Q ss_pred ---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 551 ---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 551 ---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) +||+++||+++|++|+|+++|+++|++|+|||+|++.++. T Consensus 338 ~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~I~~~~~~~~ 380 (392) T protein:vir:18 338 GECWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITD 380 (392) T ss_pred eEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEch Confidence 7999999999999999999999999999999999999999 No 27 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=6.6e-95 Score=536.91 Aligned_cols=370 Identities=12% Similarity=0.048 Sum_probs=292.1 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCC-----CCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIG-----RDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~G-----p~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) |++|+ |||||+|++++++++..++|++.+|+|++..+ |.++|++|+|+.||+..||. . +.|.+++++|| T Consensus 1 m~~~~-~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~---~--~tl~~al~~~~ 74 (395) T protein:vir:98 1 MSDFH-HGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGK---K--GTLAASLQAIA 74 (395) T ss_pred CCCCC-CCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhccc---c--cchhhHHHHHh Confidence 99996 69999999999999999999998888877533 68999999999999999995 2 46899999999 Q ss_pred cCCCc-EEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGGT-AYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG~-~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) +|||. |||+|+........ . . T Consensus 75 ~~~~~~~~vv~~~~~~~~~~---------------------------------------------------------~-~ 96 (395) T protein:vir:98 75 DQSKPVTVVVRVEDGTGDDE---------------------------------------------------------E-A 96 (395) T ss_pred hccCceEEEeeccccccccc---------------------------------------------------------c-c Confidence 99995 99999732100000 0 0 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) . . . ..... +... .+.. T Consensus 97 ~-------~------a------------------------~~~~~-------------------i~g~-------~~~~- 112 (395) T protein:vir:98 97 A-------L------A------------------------QTVSN-------------------IIGG-------TDEN- 112 (395) T ss_pred c-------c------c------------------------ccccc-------------------cccc-------cccc- Confidence 0 0 0 00000 0000 0000 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) .. . .+ T Consensus 113 -------------------------------~~--------~-------------------Tg----------------- 117 (395) T protein:vir:98 113 -------------------------------GK--------Y-------------------TG----------------- 117 (395) T ss_pred -------------------------------cc--------h-------------------hH----------------- Confidence 00 0 00 Q ss_pred cceeccchhhHHHHHHHhhhccCCceeeec-ccchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCccc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDVLLD-GNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMST 393 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s 393 (590) ..+...........+.+++. +....+++.+++.+|++++ +|+++|+|.+.+.+++.+|++. ++| T Consensus 118 ----------l~al~~~~~~~~~~p~il~ap~~~~~~v~~al~~~~~~~~--~~~~~d~p~~~t~~~a~~~~~~---~~s 182 (395) T protein:vir:98 118 ----------IKALLTAQAVTGVKPRILGVPGLDTKEVAVALASAAIKLR--AFAYVSAWGCKTISEAMEYRKN---FSQ 182 (395) T ss_pred ----------HHHHhhhhhhhccchhhcccccccccHHHHHHHHHhhhcC--cEEEEEcCCCCCHHHHHHHHhc---cCC Confidence 00000000011111222322 3345567888888887654 8999999999999999999975 478 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeecc-ccce--eecChhHHhhhhhcCc Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGF-TDIN--FYPNEPWKEKLYLAQV 470 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~-~~~~--~~~~~~e~~~Ln~~gI 470 (590) +|+++||||++++|+.++..+++|||+++||+|||+|.++|+||||||+.+.++.|+ ..+. ...+++|++.||++|| T Consensus 183 ~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI 262 (395) T protein:vir:98 183 RELMVIWPDFLAWDTVKNTTATAYATARALGLRAYIDQTVGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGV 262 (395) T ss_pred ceEEEEecceeEecccCCceeeechHHHHHHHHHHhhcccCcEeccCCceeecccccceecccccCCCcchHHhhhhcCc Confidence 999999999999999999999999999999999999999999999999864434343 1222 2235789999999999 Q ss_pred eEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE Q lcl|NC_016163. 471 NYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS 550 (590) Q Consensus 471 n~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~ 550 (590) ||+++ ++|+++||+||+++|++|+||+|||||+||+++|+++++|+||||||+.||.+|+++|+.||++||++|+|.| T Consensus 263 ~~~~~--~~G~~~wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~g 340 (395) T protein:vir:98 263 TTLVR--KDGFRFWGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKSNGYIVE 340 (395) T ss_pred EEEEc--CCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceec Confidence 99954 7899999999999999999999999999999999999999999999999999999999999999999999876 Q ss_pred ---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 551 ---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 551 ---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) +||+++||+++|++|+|+++|+++|++|+|||+|++..+. T Consensus 341 ~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~ 383 (395) T protein:vir:98 341 GKCWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITD 383 (395) T ss_pred eEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEch Confidence 7999999999999999999999999999999999999999 No 28 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=5.1e-95 Score=537.55 Aligned_cols=365 Identities=10% Similarity=0.059 Sum_probs=292.5 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecC-----CCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSA-----IGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~-----~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) |++|.+|||||+|+++++++|..+.|++.+|+++++ .+|+++|++|+|+.||...||. + +.+.+++++|| T Consensus 2 ~~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~---~--~tl~~al~~~~ 76 (391) T protein:vir:11 2 AADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGT---S--GTLPASLQAIA 76 (391) T ss_pred CCCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCC---C--ccchhhhhhhh Confidence 778999999999999999999999999988888876 5689999999999999999995 2 46889999999 Q ss_pred cCCCc-EEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGGT-AYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG~-~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) +|||. ||++|+...+.. .. . T Consensus 77 ~~~g~~~~vv~~~~~~~~-----------------~~------------------------------------------t 97 (391) T protein:vir:11 77 DQANAATVVVRVKPGEDE-----------------AA------------------------------------------T 97 (391) T ss_pred ccccceeEEeeecccccc-----------------cc------------------------------------------c Confidence 99995 999998311000 00 0 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) . .+..+. .+...... .+. T Consensus 98 ~-----------~d~~g~------------~~a~~~~~--------------------------g~~------------- 115 (391) T protein:vir:11 98 N-----------SAVIGG------------VSADGKYT--------------------------GMK------------- 115 (391) T ss_pred c-----------hhhhcc------------cccccchh--------------------------hhh------------- Confidence 0 000000 00000000 000 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) . ..... T Consensus 116 ----------------------------------------------a--~~~~~-------------------------- 121 (391) T protein:vir:11 116 ----------------------------------------------A--LLAAK-------------------------- 121 (391) T ss_pred ----------------------------------------------h--hhhhh-------------------------- Confidence 0 00000 Q ss_pred cceeccchhhHHHHHHHhhhccCCceee-ecccchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCccc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDVL-LDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMST 393 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s 393 (590) ......+.++ .++....+++.+++.+|+++ ++|+++|+|++.+.+++.+||+. ++| T Consensus 122 ------------------~~~~~~p~~~~ap~~~~~~v~~al~~~~~~~--~~~~i~D~p~~~t~~~a~~~r~~---~~s 178 (391) T protein:vir:11 122 ------------------ARLGVVPRILGVPGLDTQPVATALIAIAQQL--RAFAYVSASGCKTKEEATAYREN---FAA 178 (391) T ss_pred ------------------hhheeccccccccccccHHHHHHHHHhhccc--ceEEEEEcCCCCCHHHHHHHhhh---cCC Confidence 0000001111 12233456888888888664 49999999999999999999975 489 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeecc-ccceee--cChhHHhhhhhcCc Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGF-TDINFY--PNEPWKEKLYLAQV 470 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~-~~~~~~--~~~~e~~~Ln~~gI 470 (590) .|+++||||++++|+.++..+++|||+++||+|||+|.++||||||||+++.++.++ ..+.+. .++.|++.||++|| T Consensus 179 ~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi 258 (391) T protein:vir:11 179 REAMVIWPDFLTWSTVVNQTVPAPAVAQALGLRARIDQEVGWHKTLSNVAVNGVTGISADVFWDLQSPSTDANYLNENEV 258 (391) T ss_pred ceEEEEcCcceecccccCceEEechHHHHHHHHHHhhccCCcEEccCCceeeceeecccccccccCCCcchhhhhhhcCc Confidence 999999999999999999999999999999999999999999999999875444443 222222 34678999999999 Q ss_pred eEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEE Q lcl|NC_016163. 471 NYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSS 550 (590) Q Consensus 471 n~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~ 550 (590) |+++ +++|+++||+||++.|++|+||+|||||+||+++|+++++|+|||||++.||++|+++|+.||++||++|+|.| T Consensus 259 ~~~~--~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~g~l~g 336 (391) T protein:vir:11 259 TTLV--QEGGFRFWGSRTCSDDPLFAFENYTRTAQVLADTIAEAHMWAVDKPMHPSLVRDILEGVNAKFRELKGLGLIID 336 (391) T ss_pred EEEE--cCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhccceec Confidence 9995 57899999999999999999999999999999999999999999999999999999999999999999999876 Q ss_pred ---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 551 ---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 551 ---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) +||+++||+++|++|+|+++|+++|++|+|||+|++.++. T Consensus 337 ~~~~~~~~~n~~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 379 (391) T protein:vir:11 337 AQAWYDPNVNDKDTLKAGKLRITYDYTPVPPLEDLTFFQKITD 379 (391) T ss_pred eEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEch Confidence 6999999999999999999999999999999999999999 No 29 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=3.2e-92 Score=522.17 Aligned_cols=363 Identities=13% Similarity=0.070 Sum_probs=283.3 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCC-----CCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIG-----RDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~G-----p~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) |+++.+|||||+|+++++++|..++|++.+|+|++..+ |+++|++|+|+.||.+.||. . +.|.+++++|| T Consensus 3 m~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~---~--g~L~~al~~~~ 77 (393) T protein:vir:10 3 ILDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGS---T--GTLRRTLNSIG 77 (393) T ss_pred CCCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCC---c--cchhhhhhhhh Confidence 88887899999999999999999999999999988766 89999999999999999995 2 57999999999 Q ss_pred cCCC-cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGG-TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG-~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) +|+| .||+||+...+.. .. + .... .+. T Consensus 78 ~~~~~~~~vv~v~~~~~~-----------------~~--------------------------t-----~~~i----ig~ 105 (393) T protein:vir:10 78 SIVKTPTVIVRVAESDDS-----------------DT--------------------------L-----TANI----VGT 105 (393) T ss_pred cccCceEEEeecccCccc-----------------cc--------------------------c-----cccc----ccc Confidence 9999 4999998311100 00 0 0000 000 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) . + ++..+..+ T Consensus 106 -----------~-------------------~-~~~~tgl~--------------------------------------- 115 (393) T protein:vir:10 106 -----------Q-------------------E-NGKFTGIK--------------------------------------- 115 (393) T ss_pred -----------c-------------------c-cchhhHHH--------------------------------------- Confidence 0 0 00000000 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) T Consensus 116 -------------------------------------------------------------------------------- 115 (393) T protein:vir:10 116 -------------------------------------------------------------------------------- 115 (393) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred cceeccchhhHHHHHHHhhhccCCceeeeccc-chhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCccc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDVLLDGN-NEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMST 393 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s 393 (590) +...........++++++++ +..+++.+++.+|+++ +.++ +++.|++.+.++++.|+..+ +| T Consensus 116 ------------al~~~~~~~~~~p~li~apg~~~~~~~~al~~~~~~~-~~~~-~v~d~~~~t~~~ai~~~~~~---~s 178 (393) T protein:vir:10 116 ------------ALLTAQSTVFVKPKLLCVPQHDNQAVATELLSVAKKL-NAFA-FISDNGATTKEQAYTYRQNF---SQ 178 (393) T ss_pred ------------HHHhhhhhcceeeeeeeeccccchHHHHHHHHHhhcc-CcEE-EEEcCCCCCHHHHHHHhhhc---CC Confidence 00000000000011222222 2345677888888765 4455 45555667888999999754 78 Q ss_pred ceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccc-ccee--ecChhHHhhhhhcCc Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFT-DINF--YPNEPWKEKLYLAQV 470 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~-~~~~--~~~~~e~~~Ln~~gI 470 (590) .|+++||||++++|+.++..+++|||+++||+|||+|.++||||||||+.+.++.|.. .+.+ .+++.|++.||++|| T Consensus 179 ~~~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~G~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI 258 (393) T protein:vir:10 179 REGMMIFGDWKSYNTDKKAYDTDYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGI 258 (393) T ss_pred ceEEEEecccccccccCCceeEeehhHHHHHHHHHhhcCCCcEEccCCceeeceeecceecccccCCCcchhHhHhhcCc Confidence 9999999999999999999999999999999999999999999999998654444432 2222 334779999999999 Q ss_pred eEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCC--ce Q lcl|NC_016163. 471 NYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANR--AC 548 (590) Q Consensus 471 n~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~g--a~ 548 (590) |||+ +++|+++||+||++.|++|+||+||||+++|+++|+++++|+|||||++.||++|+++|+.||++||+.| +| T Consensus 259 ~t~~--~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~~al 336 (393) T protein:vir:10 259 TICL--NHNGFRYWGSRTLATDTRWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRI 336 (393) T ss_pred eEEE--cCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcccccc Confidence 9995 4689999999999999999999999999999999999999999999999999999999999999999966 56 Q ss_pred EE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 549 SS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 549 ~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) .| +||++ ||++||++|+|+++|+++|++|+|||+|++..++ T Consensus 337 ~g~~v~~~~~-nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~ 380 (393) T protein:vir:10 337 LGARVWVAEE-ITADIIKSGKFVIKYDYHWIPSLESLGLEQRVND 380 (393) T ss_pred ccceEEecCC-CCHHHhhCCEEEEEEEEEecCCcceEEEEEEEch Confidence 44 68775 8889999999999999999999999999999999 No 30 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=100.00 E-value=1e-90 Score=513.92 Aligned_cols=466 Identities=18% Similarity=0.143 Sum_probs=282.2 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCCc Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGGT 80 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG~ 80 (590) ||+||||||||||+|++ ++|+||+|++++|+|.++|||+++|++|+||.||+++||+++ .+.++.|+|++||+|||+ T Consensus 3 m~~~~sPGVyv~E~~~~-~~i~~v~tsvaafvG~~~~GP~~~p~~v~s~~d~~~~FG~~~--~~~~l~~av~~fF~ngG~ 79 (641) T protein:vir:10 3 VSNQLSPGVVIQERDLT-AVTTPIGLNVGVLAAPFTKGPVEEIFEVSTERDLASVFGEPN--DYNYEYWFTASQFLSYGG 79 (641) T ss_pred CccccCCceEEEEecCC-CcccccCCccceEEecccCCCCCccEEecCHHHHHHHcCCcC--CCcchHHHHHHHHHhcCC Confidence 99999999999999987 689999999999999999999999999999999999999986 447899999999999995 Q ss_pred -EEEEEEecCCccccccccc--------------------cceeecccccccceeeeeecccccc--------------- Q lcl|NC_016163. 81 -AYVLRVMPDDAKFANSLIS--------------------IKTTAAADPAKATVLVTAKAQTTNT--------------- 124 (590) Q Consensus 81 -~~vvRv~~~~a~~a~~~~~--------------------~~~~~a~~~~~~~~~v~~~~~~~~~--------------- 124 (590) ||||||++.+++++..... .....+..|+.|.+.+.+....... T Consensus 80 ~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~pG~~gn~l~v~i~~~~~~~~~~~~~~~~~~~~ 159 (641) T protein:vir:10 80 VLKAIRLNAASLKNSVDSGTAPLIKNLQEYETTYESSNSNTFKFASRDAGALGNSVGIFITDAGPDQIAVLPAPGTGNEW 159 (641) T ss_pred EEEEEEecCccccccccccchhhccccccccccccCcCccccEEEeccCCCcCCceEEEEEcCCCcceeeeecccccccc Confidence 9999998776665532211 1223455666665543221100000 Q ss_pred ---------cc-------cc------------------ceEEEeec--c-------ccCCcceee--------------- Q lcl|NC_016163. 125 ---------AS-------KN------------------AMKTILSG--G-------TAGETPLCF--------------- 146 (590) Q Consensus 125 ---------a~-------~~------------------~~~~~~~~--~-------t~~~~~~~~--------------- 146 (590) .. .. ......+. . ..+...... T Consensus 160 ~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g~~g~~~~~~ 239 (641) T protein:vir:10 160 EFVADEAVTAASGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGGVTGIFADAQ 239 (641) T ss_pred eeccceeeeeccCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCcceeeeeeee Confidence 00 00 00000000 0 000000000 Q ss_pred ------------------------------EeecccccccccceEEEEeeccc------------------cc--c---- Q lcl|NC_016163. 147 ------------------------------IVPKGRGENYNGYGFRLSLRSDY------------------DN--T---- 172 (590) Q Consensus 147 ------------------------------~~~~~~g~~~~~~~~~~~~~~~~------------------~~--~---- 172 (590) +.....+.+............+. .. . T Consensus 240 ~~t~gt~~~t~a~~g~~~~~~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~~~a~~~~~g~~~~~va~~~gts~~a~~ 319 (641) T protein:vir:10 240 VVTQGTNTAAIASSGIERRLYIGKDSGSINFAATDAVVDTNATSATISSVRNEYAEREYLPGSKWVNVAARPGTSLYANS 319 (641) T ss_pred eccCCccceeeecccchhhhhhccccccceeeeecccccccceeeEeeeeeeeecccccccccccccccccchhhhhhhh Confidence 00000000000000000000000 00 0 Q ss_pred ccccccceeeeeecccC-----CCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccccceeeeeeccccc Q lcl|NC_016163. 173 YNFRTYNLSVTVKDSTG-----ADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRSAFETISEFVVGDS 247 (590) Q Consensus 173 ~~~~~~~l~i~v~d~~~-----~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~~~~~~~~~~~~~~ 247 (590) .......++..+.|.++ ..+++|.+.++++..+++...+...++.+++++.|.++...+.............. T Consensus 320 ~g~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~-- 397 (641) T protein:vir:10 320 VGGVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVIKQQSAYVYWGSHETAPFLGTAANAA-- 397 (641) T ss_pred cCCcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeeeccccceEEEeccccccccccccccc-- Confidence 01112223334444332 34688999999999998888888899999999999998776543221110000000 Q ss_pred ccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccccceec-----cch Q lcl|NC_016163. 248 EADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEESALL-----VKG 322 (590) Q Consensus 248 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~ 322 (590) ............... .......................... ..+.+|.++..+. ... T Consensus 398 --~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~l~gG~d~~~~~~~~~~~~~ 459 (641) T protein:vir:10 398 --AGDWGASALNRRYNL--------LRSTAGTTSFPAGAVTVGSKNNATHY--------YRLANGADYSASGALYNLSNV 459 (641) T ss_pred --ccccccccccccccc--------cccccccccccccccccCCCCcceeE--------EEeecCcccccccccccccch Confidence 000000000000000 00000000000000000000000111 1122222222111 111 Q ss_pred hhHHHHHHHhhhccCCceeeeccc------chhHHHHHHHHHHHHhcCCeEEEEecCCCC---------CHHHHHHHHHh Q lcl|NC_016163. 323 YSGVLAPEILDKQQYEIDVLLDGN------NEVAVKNAMSDLCSEQRGDCIAILDCSFQG---------DAQQTIDYRTG 387 (590) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~---------~~~~~~~~~~~ 387 (590) .....+..+.+.+..++++++.+. ...+++.+++.||++ |+|||+|+|+|.+. ..+++++||.. T Consensus 460 ~~~tg~~~~~~~e~~~i~~l~~~~~~~~~~~~~~v~~~~i~~ce~-~~d~~ailD~p~~~~~~~~~~~~~~~~~~~~~~~ 538 (641) T protein:vir:10 460 DIATAYELIEDPESQVIDYVLSGPAGADEAAAIAKATTITTIVES-RKDCMAFLSPLRSDVIGVSNTTTVTENLVNYFNQ 538 (641) T ss_pred hHHHHHHHhhhhhhhccceeeecCCCCCcchhHHHHHHHHHHHHh-cCCEEEEEcCCcccccCCCchhhHHHHHHHHHhh Confidence 123345556666776777766543 346799999999976 56999999999764 24667788865 Q ss_pred hcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhh Q lcl|NC_016163. 388 NISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYL 467 (590) Q Consensus 388 ~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~ 467 (590) +.+|+|+++||||++++||.+++.+++|||||+||+|||+|.+|||||||||.+++.|+|++++++.+++.||+.||+ T Consensus 539 --~~~s~yaa~y~P~~~v~dp~~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~~~~~e~~~Lnp 616 (641) T protein:vir:10 539 --LPSSNYVVFDSGYKYIYDKYNDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYSPNKTQRDRLYA 616 (641) T ss_pred --cCCCceEEEEeceeEeecccCCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEecChhHHhhhhh Confidence 368999999999999999999999999999999999999999999999999999899999999999999999999999 Q ss_pred cCceEEEEecCCeEEEecceecCCCccc Q lcl|NC_016163. 468 AQVNYIERDPKKISFATQLTSQTSRSAL 495 (590) Q Consensus 468 ~gIn~i~~~~~~G~~~wG~rT~s~d~~~ 495 (590) +||||||.|||+|++- +. ....... T Consensus 617 ~gIN~ir~fpg~G~v~--~~-~~~~~~~ 641 (641) T protein:vir:10 617 NRINPVVSFPGHAMIN--NN-IAFHTKL 641 (641) T ss_pred cccceEEecCCceeec--ce-eeeeecC Confidence 9999999999999752 11 1111111 No 31 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=1.5e-90 Score=513.05 Aligned_cols=365 Identities=15% Similarity=0.079 Sum_probs=280.6 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCC-----CCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIG-----RDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL 75 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~G-----p~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff 75 (590) ||+|++|||||+|+++++++++.++|.+.+|+|++..+ |.++|++++|+.++...||+. ..+.+++++|| T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~~-----~tl~~a~~~~~ 75 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGAG-----GTLPQAIDGIF 75 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCCC-----cchhHHHHHHh Confidence 99999999999999999999999999998888876543 889999999999999999962 45889999999 Q ss_pred cCCC-cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 76 QSGG-TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 76 ~nGG-~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) +||| .||++++...+.. .. + .....+. T Consensus 76 ~~gg~~~~vv~~~~~~~~-----------------~~--------------------------t---------~~~~ig~ 103 (386) T protein:vir:10 76 DQTGAVVVVIRVDEGVDS-----------------AA--------------------------T---------QSNVIGK 103 (386) T ss_pred ccCceeEEEeeccccccc-----------------cc--------------------------c---------chhhhcc Confidence 9999 5999997321100 00 0 0000000 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) . .+.+.. + ..+. .+. T Consensus 104 ------------------~-----------~~~t~~----~--tgl~-----------------~l~------------- 118 (386) T protein:vir:10 104 ------------------V-----------DADTEQ----Y--TGIL-----------------ALL------------- 118 (386) T ss_pred ------------------c-----------ccccch----h--hhhH-----------------Hhh------------- Confidence 0 000000 0 0000 000 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) . ... .. .. T Consensus 119 ----------------------------------~------~~~--------~~--------~~---------------- 126 (386) T protein:vir:10 119 ----------------------------------S------AEN--------TV--------KV---------------- 126 (386) T ss_pred ----------------------------------h------hcc--------cc--------cc---------------- Confidence 0 000 00 00 Q ss_pred cceeccchhhHHHHHHHhhhccCCceeeecc--cchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHhhcCcc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDVLLDG--NNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTGNISMS 392 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~ 392 (590) .+.++.++ .+..++..++...|.+.+ ++.+.|++ +.+.+++.+|+.. +. T Consensus 127 -----------------------~p~i~~ap~~~~~~~v~~~l~~~~~~~~--~~~~~~~~-~~~~~~a~~~~~~---~~ 177 (386) T protein:vir:10 127 -----------------------QPRILIAPGFSNQKAVADQLVSVADTAA--WLCHSGWS-NTTDAAAITYREL---FG 177 (386) T ss_pred -----------------------cccccccccccchhHHHHHHHHhhcceE--EEEEeCCC-CCchHHHHHhhhc---cc Confidence 00000011 112344444444443332 56666665 5666788888865 47 Q ss_pred cceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeecc-ccce--eecChhHHhhhhhcC Q lcl|NC_016163. 393 TYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGF-TDIN--FYPNEPWKEKLYLAQ 469 (590) Q Consensus 393 s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~-~~~~--~~~~~~e~~~Ln~~g 469 (590) |.|+++||||++++|+.++..+++|||+++||+|||+|.++|+||||||+.+.+|.|. ..+. ...++.|+++||++| T Consensus 178 s~~~~~~~p~~~v~~~~~~~~~~~p~s~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~g 257 (386) T protein:vir:10 178 SRRCEVVDPWYKVWDVETSAHIIQPPSARHAGVMAKVHNTLGFWWSNSNQEILGIDGLCRPVDFKLDDPTCRANLLNAKE 257 (386) T ss_pred ccceEEecCceeeeccccccceeechHHHHHHHHHHhhhcCCcEEccCCceeecccccceecccccccCcchhhhhhhcC Confidence 9999999999999999999999999999999999999999999999999875444444 1222 233577999999999 Q ss_pred ceEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceE Q lcl|NC_016163. 470 VNYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACS 549 (590) Q Consensus 470 In~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~ 549 (590) |++++ +++|+++||+||++.|++|+||+|||||++|+++|+++++|+|||||++.||++|+++|++||++||++|+|. T Consensus 258 i~~~~--~~~G~~~wG~rT~~~d~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~ 335 (386) T protein:vir:10 258 VTTTI--QQNGFRVWGDRTCSADSKWAFKNVVITNDMIADSLVRNHLWAVDRNITKTYVEDVTEGVNNYLRHLKNIGAIA 335 (386) T ss_pred cEEEE--cCCCEEEEcccccCCCcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCcee Confidence 99885 6899999999999999999999999999999999999999999999999999999999999999999999987 Q ss_pred E---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 550 S---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 550 ~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) | +||+++||+++|++|+|+++|+++|++|+|||+|++.++. T Consensus 336 g~~v~~d~~~nt~~~~~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 379 (386) T protein:vir:10 336 GGECWVDPELNSPDQIQQGKVYFDYDFSAYAPAEHITFRSHMVN 379 (386) T ss_pred eeEEEEcccCCCHHHhhCCeEEEEEEEEecCCceeEEEEEEEeh Confidence 7 6999999999999999999999999999999999999999 No 32 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=2.5e-89 Score=506.36 Aligned_cols=359 Identities=13% Similarity=0.065 Sum_probs=284.8 Q ss_pred Cc---cccCCceEEEEecCCCceecccccceeEEEeecCCC----CCCccEEecCHHHHHHhcCCccccccccHHHHHHH Q lcl|NC_016163. 1 MA---DYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIG----RDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILN 73 (590) Q Consensus 1 Mp---~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~G----p~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ 73 (590) || +|+| ||||+|+++++++|+.++|++.+|++++..+ |.++|++|.++.|+.+.+|... ..+.+..+++. T Consensus 1 m~~~~~~~h-Gv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~~d~~~~~~~~~--~~gtl~~al~~ 77 (388) T protein:vir:96 1 MPVIDQFEH-NGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGN--ELGTGWHAASE 77 (388) T ss_pred CCCCCCCCC-ceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecchhhhhhhcccc--ccccchhhhHh Confidence 66 5775 9999999999999999999999999987543 8899999999999999998643 33668899999 Q ss_pred HHcCCC-cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccc Q lcl|NC_016163. 74 WLQSGG-TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGR 152 (590) Q Consensus 74 ff~nGG-~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~ 152 (590) ||++|| .|+|+|+..++...+ + T Consensus 78 ~~~~~~~~~~vv~v~~g~~~~a-------------------------------------------t-------------- 100 (388) T protein:vir:96 78 TLKKTSVPQYFIVVPEGADDAA-------------------------------------------T-------------- 100 (388) T ss_pred hhccCCceEEEEEecccccccc-------------------------------------------c-------------- Confidence 999999 599999832100000 0 Q ss_pred ccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCc Q lcl|NC_016163. 153 GENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDN 232 (590) Q Consensus 153 g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~ 232 (590) .. + ++. ..+..+ + T Consensus 101 ----------~a---------~---------iig----------------~~~~~t----------------------g- 113 (388) T protein:vir:96 101 ----------MA---------N---------IIG----------------GIDPTT----------------------G- 113 (388) T ss_pred ----------cc---------e---------eee----------------eccccc----------------------c- Confidence 00 0 000 000000 0 Q ss_pred cccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccc Q lcl|NC_016163. 233 RSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGG 312 (590) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (590) . . T Consensus 114 ----------------------------------~--------~------------------------------------ 115 (388) T protein:vir:96 114 ----------------------------------R--------R------------------------------------ 115 (388) T ss_pred ----------------------------------h--------h------------------------------------ Confidence 0 0 Q ss_pred cccceeccchhhHHHHHHHhhhccCCceeeeccc--chhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHHh--h Q lcl|NC_016163. 313 NEESALLVKGYSGVLAPEILDKQQYEIDVLLDGN--NEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRTG--N 388 (590) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~--~ 388 (590) .+ +..+.. ....+.+++.|+ +..+++.+++.+|++++ ||+++|+|.+.. +++.+|+.. . T Consensus 116 -----------~g--l~al~~-~~~~p~il~aPg~s~~~~v~~al~~~~~~~~--~~~i~D~p~~~~-~~~~~~~~~~~~ 178 (388) T protein:vir:96 116 -----------TG--IAALTE-CTERPTLIGAPGFSQNKAVIDALASMAKRLK--CRAVIDGPSGST-QDAIDLSGLLGG 178 (388) T ss_pred -----------hH--HHHhhh-cccceeEEEeeccccchHHHHHHHHHHhhcC--cEEEEeccCCch-hHHHHHHhhhhc Confidence 00 000000 000122333333 23468888999997654 999999996644 555666643 3 Q ss_pred cCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccc---cceeecChhHHhhh Q lcl|NC_016163. 389 ISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFT---DINFYPNEPWKEKL 465 (590) Q Consensus 389 ~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~---~~~~~~~~~e~~~L 465 (590) .+++|.|+++||||++++|+.++..+++|||+++||+|||+| +||||||+.+. +.|+. +.....++.|++.| T Consensus 179 ~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~AG~~a~~D----~~~spaN~~i~-i~g~~~~~~~~~~~~~~~~~~L 253 (388) T protein:vir:96 179 EGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAMGAVAAVK----PWESPGNQGVL-IQDVARVIDYNILDKSTEGDLL 253 (388) T ss_pred cCcCcceEEEEeCceeeecccCCceeeechHHHHHHHHHhhc----CcccccCeeEE-eeeecccccccccCChhhHHhh Confidence 357899999999999999999999999999999999999999 49999998863 55652 33445567899999 Q ss_pred hhcCceEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_016163. 466 YLAQVNYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVAN 545 (590) Q Consensus 466 n~~gIn~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ 545 (590) |++|||||++|+++|+++||+||++ |+||||||||+||+++|+++++|+|||||++.||++|+++|+.||++||++ T Consensus 254 n~~gI~~i~~~~~~G~~~wG~rT~~----~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~ 329 (388) T protein:vir:96 254 NRNGVSYFARTSMGGFSLIGNRTVT----GKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAA 329 (388) T ss_pred hhcCceEEEEecCCcEEEEcccccC----CcceeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhC Confidence 9999999999999999999999986 999999999999999999999999999999999999999999999999999 Q ss_pred CceEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 546 RACSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 546 ga~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |+|.| +||+++||++||++|+|+++|+++|++|+|||+||+..+. T Consensus 330 Gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~ 377 (388) T protein:vir:96 330 EIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGRYSPNEHMIFHLNAVD 377 (388) T ss_pred CceeeeEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEch Confidence 99977 6999999999999999999999999999999999999999 No 33 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=3.4e-75 Score=428.86 Aligned_cols=502 Identities=16% Similarity=0.140 Sum_probs=253.7 Q ss_pred CccccCC--------ceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHH---HHHHhcCCccccccccHHH Q lcl|NC_016163. 1 MADYLHP--------SVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTD---EFLFKFGNPNLSKYGQTSY 69 (590) Q Consensus 1 Mp~yl~P--------GVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~---e~~~~fG~~~~~~~~~l~~ 69 (590) .+||-.| |.|.|-|+.-..--.=|- ++-.++--...||. +-.|++ .|+..-|-+..-.|-. T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~--- 272 (742) T protein:vir:58 201 FAEYGTPTSSLTLYKGFYLEGIDLNSFNKQFVV-SIENITVNREKGQV----LYPSFDVVVHFRDIRGVSANTEYIR--- 272 (742) T ss_pred ccccCCCccceeeeecccccccccCcccceeeE-EEeeeeecccCCce----eccceeEEEEEeeccCCCCCcccee--- Confidence 5556544 778777753221111111 12112222234553 222322 2444444332111110 Q ss_pred HHHHHHcCCCc-EEEEEEecCCccccccccccc-eeecc-cccccceeeeeeccccccccccceEEEeecc-ccCCccee Q lcl|NC_016163. 70 NILNWLQSGGT-AYVLRVMPDDAKFANSLISIK-TTAAA-DPAKATVLVTAKAQTTNTASKNAMKTILSGG-TAGETPLC 145 (590) Q Consensus 70 av~~ff~nGG~-~~vvRv~~~~a~~a~~~~~~~-~~~a~-~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~-t~~~~~~~ 145 (590) .|+-=-|--+ -||+||+.. ++...... ..+.+ -+..+ .+ ........ +.....+. T Consensus 273 -~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~g~~~~n~~-~~---------------~~~~~~~~~~~~~~~~s 331 (742) T protein:vir:58 273 -FRQVNLNPESPNYIERVIGN----MTFEFDGERIVTGGEYPNQV-PF---------------LRVVVSQDIKQNVAGVE 331 (742) T ss_pred -eeeeecCCCCcceeeecccc----eeeeeccceeeecccccccc-cc---------------eeeEeccccCcCcccee Confidence 0111112223 689998532 11110000 00000 00000 00 00000000 00000000 Q ss_pred eEeecccccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccc Q lcl|NC_016163. 146 FIVPKGRGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQ 225 (590) Q Consensus 146 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~ 225 (590) ...+.+....+.-..+.+. .....+..+.+.+....+ .+.+...+. . .......+..+.+.... T Consensus 332 ~~~~~~~~~~~~v~d~~~~---------~~~~~~v~~~~t~~~~~p--p~~~~~~e~----v-~~ngG~~f~v~s~~~~g 395 (742) T protein:vir:58 332 KWVPVGFEGIYSVGDFTVI---------VNELTNVSIPVTDSAIIP--PMRFTRIEQ----I-TLSGGASFSVISNQPYG 395 (742) T ss_pred EEEeccccccccccceeee---------ccccccceeeccccccCC--cccccccce----e-ecccCcceEEEEecccC Confidence 0000000000000000000 000001111111111000 001111110 0 00011111111110000 Q ss_pred e-eeecCccccceeeeeecccccccCccccceeccccccc-----------ccccccc-cccccceeecccccccccccc Q lcl|NC_016163. 226 Y-VEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAV-----------TPAETIH-ANVVWKSSSVETDDPSYDATA 292 (590) Q Consensus 226 ~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 292 (590) . +.... .. ... ........+.+.+... ....... .........++.+... T Consensus 396 ~~i~~~~--as-~~~---------s~ln~~~~V~Gt~aa~~~~d~~t~~~v~s~~~alp~~a~sv~laGG~dg~v----- 458 (742) T protein:vir:58 396 FNIQDSR--HS-YWL---------SPFKDDELIIGTELVLPALDVSTEFGVSSWEEALPEFSFLMPFQGGSDGYI----- 458 (742) T ss_pred cceeccC--cc-eEE---------eccCCceEEEeehhhccccccchheeccccccccceeeEEEeecCCccccc----- Confidence 0 00000 00 000 0000000000000000 0000000 0000000000100000 Q ss_pred cccccceeeeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccc-hhHHHHHHHHHHHHhcCCeEEEEe Q lcl|NC_016163. 293 ANFNNIQYLTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNN-EVAVKNAMSDLCSEQRGDCIAILD 371 (590) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~a~~~~~~~~~~~~~a~~d 371 (590) . .........++.+.......+..+. ..+...+ .+.+++.|+. ...++.++.++|+......++++| T Consensus 459 -------~-v~~~~~D~iG~~~~~d~~~adrTGL--~ALlev~--eVtILiAPG~t~~~v~aav~A~la~a~~Rl~vL~D 526 (742) T protein:vir:58 459 -------R-VDENEPDTIGRVKITPALLANYERL--LPLLTED--QFDLVLTPYLTFADHAGTVNAFINRAENRFLYLFD 526 (742) T ss_pred -------c-ccCCCcccccccccccccccchhHH--HHhhhcC--CCcEEEEcCCCchHHHHHHHHHHHhhcCCeEEEEe Confidence 0 0000000111111111122222222 2233322 3566766654 345666777777654433566778 Q ss_pred cCCCCCH-HHHHHHHHhhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeecc Q lcl|NC_016163. 372 CSFQGDA-QQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGF 450 (590) Q Consensus 372 ~p~~~~~-~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~ 450 (590) +|.+.+. +++.+|+.. ++|.|+++||||+++.| ++..+++||||++||+|||+|.++|+|+||+|+. .+.+ T Consensus 527 ~P~~~tt~~~A~a~r~~---~nSsraaly~PwVkv~d--~~~~r~vPpSgaIAGL~ARtD~erGvw~SPANrg--ii~~- 598 (742) T protein:vir:58 527 IAGDDDTENLAISLAGY---INSSFATTFFPWVRRLT--NKGMRTVPASLAAYRSIRTTDPETGLAPVGARRG--VVTG- 598 (742) T ss_pred cCCCCchHHHHHHHHhc---cCCceEEEEeceeeecc--CCcceeechHHHHHHHHHHhccCCceEecCCcce--eeec- Confidence 8876553 556777754 47999999999999876 4778899999999999999999999999999963 2322 Q ss_pred ccceeecChhHHhhhhhcCceEEEEecCCeEEEecceec-CCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_016163. 451 TDINFYPNEPWKEKLYLAQVNYIERDPKKISFATQLTSQ-TSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYD 529 (590) Q Consensus 451 ~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~-s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~ 529 (590) ..+++.|++.||++||||||+| ++|+++||+||+ +.|++|+||||||||+||+++|+++++|+||||||+.||+ T Consensus 599 ----~~~s~se~d~LN~~GINtIrsf-G~G~rlWGnRTlassDs~wryInVRRlfd~Ie~SI~~a~q~~VfEPNd~~L~~ 673 (742) T protein:vir:58 599 ----EPVRQVDWEDLYNNRINPIVRV-GNDVLLFGQKTMLNVNSALNRINVRRLLIVMRNRISQILSSYLFENNTSENRL 673 (742) T ss_pred ----cccchhhHHHHhhCCceEEEEC-CCcEEEEcceecCCCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHH Confidence 2457889999999999999997 689999999998 5699999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhCCceEE---EecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 530 SMSYSLNNYLQQWVANRACSS---ISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 530 ~v~~~i~~~L~~l~~~ga~~~---~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) +|+++|++||++||++|+|.| +||+ +||++||++|+|+++|+++|++|||||+|||+++| T Consensus 674 sIk~sInafL~~L~aqGALlGfrV~lDe-tNTpeDI~~Gklvv~I~vAP~~PAEfI~lrf~it~ 736 (742) T protein:vir:58 674 RAEALVRQYLESLRLRGAVTDYEVAIDS-VTTPTDIDNNTLRARVTVQPARSIEYIDITFVITP 736 (742) T ss_pred HHHHHHHHHHHHHHhCCceeeeEEEEcC-CCCHHHhhCCEEEEEEEEEccCCcceEEEEEEEEe Confidence 999999999999999999876 5995 58899999999999999999999999999999999 No 34 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=8.2e-70 Score=399.33 Aligned_cols=531 Identities=11% Similarity=0.061 Sum_probs=325.9 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) |--|++|||||||.+|+.+++++++|++++|+|.+++||+++|++|+||+||++.|||..+. ..+.+|+..||.||| T Consensus 8 ~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~~~~~fg~g~l~--~~i~~a~~~~~~~g~~ 85 (562) T protein:vir:63 8 RKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGELL--DAIERAWNPGEGTGAG 85 (562) T ss_pred CCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCCCCceeEEEccHHHHHHHhcCCchH--HHHHHhccccccCCce Confidence 77899999999999999999999999999999999999999999999999999999984322 224555555668988 Q ss_pred cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecccccccccc Q lcl|NC_016163. 80 TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGENYNGY 159 (590) Q Consensus 80 ~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~~~~~~ 159 (590) +||+||| ++++.++.....+.+.+..+|.|.+.++++.........+...+........+. .++-|. -+ T Consensus 86 ~~~~~rv--~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev------~~~~g~---V~ 154 (562) T protein:vir:63 86 DILAMRV--EEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQV------YDNLGS---IF 154 (562) T ss_pred EEEEEEc--CCCccceeEecceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchh------hhhccc---ee Confidence 6999999 778888888888888999999999888887655443333333333222221111 000000 00 Q ss_pred eEEEEeeccccc--cc--cccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeecccccee-eecCccc Q lcl|NC_016163. 160 GFRLSLRSDYDN--TY--NFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYV-EIVDNRS 234 (590) Q Consensus 160 ~~~~~~~~~~~~--~~--~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v-~~~~~~~ 234 (590) +++..-+..... .+ ......+.+..+. +..++.. +..-+..... .......++....+. ...+... T Consensus 155 ~i~y~g~~~~~~~~v~~~~~~~~a~~l~~~~--g~~~v~~-~~L~~g~~~~------~~~l~~~in~~~~~~aky~~~~g 225 (562) T protein:vir:63 155 SIKYKGTEASATFTVAVDPVTFKATKLTLKA--GDKTVKE-YDLGSGAYAE------TNVLISDINNLPDFEAKFFPIGD 225 (562) T ss_pred eeeeecccccceEEEEecCcceeEEEEEeec--CCcceeE-EEecCCccch------hHHHHHhhccccceEEEeeccCC Confidence 000000000000 00 0000001111111 1111111 1100000000 000000011000000 0000000 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) -......... ....+.. ...... .+..... .. ..... .... ................+++|.+ T Consensus 226 n~i~~~~~d~------~~~~~vk--t~~~~v--~t~~~d~--~~-~~~~~-~~v~---~~~~~~~~la~~~~~~LtGG~d 288 (562) T protein:vir:63 226 KNLTTDNFDA------QIDVDIK--TKEAYV--KAVGGDI--EK-QTAYN-GYVD---FEFDRSKEIANFPLTKLTGGDN 288 (562) T ss_pred ceeeeecccc------ccccchh--hhhhhh--hhhhhhh--hh-ccccc-ceee---eeeccccceecccceeeecCCC Confidence 0000000000 0000000 000000 0000000 00 00000 0000 0000111122333456667777 Q ss_pred cceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhc---CCeEEEEecCCCCCHHHHHHHHHhhcCc Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQR---GDCIAILDCSFQGDAQQTIDYRTGNISM 391 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~---~~~~a~~d~p~~~~~~~~~~~~~~~~~~ 391 (590) +..... ...+...+ +..+...+++...+.+++.++.+||.++| +.++++++.+.+.+++++...... + T Consensus 289 Gt~~~~---~~~al~al---e~~~~~~i~~~t~d~av~~~l~a~vkr~~~~g~~~~aVlg~~~~~~~~~~~~~a~~---~ 359 (562) T protein:vir:63 289 GTIPES---WADKFSYF---ANEGGYYLVPLTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIG---L 359 (562) T ss_pred CCchhh---HHHHHHHH---HhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCHHHHHHHhhh---c Confidence 643322 11122222 23344556666667778888888886543 348999999999888888775543 4 Q ss_pred ccceEEEEcCeEEEeecccCceeeecH---HHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhc Q lcl|NC_016163. 392 STYFTAIFGQHMNVYDEYNGETITVTS---TYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLA 468 (590) Q Consensus 392 ~s~~~~~~~p~~~~~d~~~~~~~~~pp---sg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~ 468 (590) ++.+.++++|+....+. +++.+..|+ ++++||++|..| +++||.|+. +. ..++...+++.|++.|+++ T Consensus 360 n~ervv~v~~~~~~~~~-~~~~~~~~~~~~aa~vAGl~A~~~----~~~SlT~~~---i~-~~~v~~~~t~~e~~~li~~ 430 (562) T protein:vir:63 360 QNERAGLIGFSGTVKMD-DGRSLKMPGYMFAAQVAGLTCGLE----IGEAITFKN---IA-IETLDTIYEGSQLDQLNES 430 (562) T ss_pred CCCcEEEEecCeeEECC-CCceeeechhHHHHHHHHHhhcCc----hhcCcccee---ec-cccccccCCHHHHHHHHhC Confidence 88999999999876654 566666776 789999999877 789999954 54 5677888999999999999 Q ss_pred CceEEEEecCCeEEEecc------eecCCCcccceehhhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_016163. 469 QVNYIERDPKKISFATQL------TSQTSRSALSYINNVRVLLRIRREVEKMM-ADYRQEFQDNTTYDSMSYSLNNYLQQ 541 (590) Q Consensus 469 gIn~i~~~~~~G~~~wG~------rT~s~d~~~~~i~vrR~~~~i~~si~~~~-~~~vfepn~~~l~~~v~~~i~~~L~~ 541 (590) |+++++.+.+++.++|.. +|...++.|++|+++|++|+|++.|++.+ +||+++||+..+|..|+..|..||.+ T Consensus 431 Gv~~l~~~~~~~v~~~~iv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yiGk~Nn~~~r~~v~~~i~~~L~~ 510 (562) T protein:vir:63 431 GIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDR 510 (562) T ss_pred CeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHH Confidence 999999998888777753 22355789999999999999999998775 59999999999999999999999999 Q ss_pred HHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 542 WVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 542 l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) ||++|+|++...+ +-+.+++.+++++++.++|+.|+|||.+++++.. T Consensus 511 l~~~gaI~~~~~~--dv~v~~~~d~~~v~~~v~pv~~mekIy~ti~~~~ 557 (562) T protein:vir:63 511 KKLAKEIQDYSPE--EVQVVIEGDVARISLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred HHhCCcccCCCcc--ceEEEecCCEEEEEEEEEEcccceEEEEEEEEee Confidence 9999999876432 2344467889999999999999999999999988 No 35 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=2.4e-67 Score=385.82 Aligned_cols=529 Identities=11% Similarity=0.057 Sum_probs=324.4 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) |--|.+|||||||.+|+.++++++++++.+|+|.+++||+++|++|+||+||++.||+.++. ..+.+|+..||.||| T Consensus 8 ~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~l~--~~i~~a~~~~~~~g~~ 85 (562) T protein:vir:80 8 RKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGELL--DAIERAWNPGEGTGAG 85 (562) T ss_pred CCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCcceeEEEccHHHHHHHhcCCChH--HHHHHhcccccccCce Confidence 44688999999999999999999999999999999999999999999999999999984433 235667777778988 Q ss_pred cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecccccccccc Q lcl|NC_016163. 80 TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGENYNGY 159 (590) Q Consensus 80 ~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~~~~~~ 159 (590) +||+||| ++++.++.....+.+.+...|.+.+.++++.........+...+....+...+. .++.|. T Consensus 86 ~~~~~rv--~~a~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev------~~~~g~----- 152 (562) T protein:vir:80 86 DILAMRV--EEAKEATFEAEGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQV------YDNLGS----- 152 (562) T ss_pred EEEEEEc--CCCCcceEEecceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEE------eeccCc----- Confidence 6999999 667777777778888888888888888777654433333333222222221111 000110 Q ss_pred eEEEEeeccccc-ccc-----ccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceee-ecCc Q lcl|NC_016163. 160 GFRLSLRSDYDN-TYN-----FRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVE-IVDN 232 (590) Q Consensus 160 ~~~~~~~~~~~~-~~~-----~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~-~~~~ 232 (590) .+.+........ ... .....+.+.... +..++. .+..-+..... .......++....... ..+. T Consensus 153 v~~i~y~g~~~~a~~~i~~~~~~~~a~~l~~~~--g~~~v~-~~~l~~g~~~~------~~~l~~~i~~~~~~tAky~g~ 223 (562) T protein:vir:80 153 IFSIKYKGTEASATFTVAVDPVTFKATKLTLKA--GDKTVK-EYDLGSGAYAE------TNVLISDINNLPDFEAKFFPI 223 (562) T ss_pred eeeeeeccccccceeEEEecCccceEEEEEEec--CCccee-EEEeCCCccch------hhhhhhhhccccceEEEeccc Confidence 111110000000 000 000001111111 111111 11100000000 0000000110000000 0000 Q ss_pred cccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccc Q lcl|NC_016163. 233 RSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGG 312 (590) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (590) ..-........... ..+.......+.. ...+. .. ..... .... ................++|| T Consensus 224 ~~n~i~~~~~d~~~-~~~~kt~~~~v~~---------~~~d~--~~-~~~~n-~~v~---~~~~~~~~la~~~~~~LtGG 286 (562) T protein:vir:80 224 GDKNLTTDNFDAQI-DVDIKTKEAYVKA---------VGGDI--EK-QTAYN-GYVE---FEFDRSKEIANFPLTKLTGG 286 (562) T ss_pred CCceeeecccccch-hhhcccceeeeee---------hhhhh--hh-ccccc-ceEE---EEeccCccccccceeeeeCC Confidence 00000000000000 0000000000000 00000 00 00000 0000 00001111222334566677 Q ss_pred cccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhc---CCeEEEEecCCCCCHHHHHHHHHhhc Q lcl|NC_016163. 313 NEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQR---GDCIAILDCSFQGDAQQTIDYRTGNI 389 (590) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~---~~~~a~~d~p~~~~~~~~~~~~~~~~ 389 (590) .++...... ..+...+ +..+...+++...+.+++.++.+||.++| +.++++++.+.+.+++++...... T Consensus 287 ~dG~~~~~~---~dal~~L---e~~~~~~i~~~t~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~~~~~~~~~~~a~~-- 358 (562) T protein:vir:80 287 DNGTIPESW---ADKFSYF---ANEGGYYLVPLTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGESMEQLFTRAIG-- 358 (562) T ss_pred CCCCccccH---HHHHHHH---HhCCcEEEEecCCChHHHHHHHHHHHHHHhCCCeEEEEecCCCCCCHHHHHHHhhh-- Confidence 776543221 1122222 23344555566667788888888886654 248999999999998888776654 Q ss_pred CcccceEEEEcCeEEEeecccCceeeecH---HHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhh Q lcl|NC_016163. 390 SMSTYFTAIFGQHMNVYDEYNGETITVTS---TYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLY 466 (590) Q Consensus 390 ~~~s~~~~~~~p~~~~~d~~~~~~~~~pp---sg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln 466 (590) +++.+.++++|+..+.+. +++.+..|| ++++||++|..| +++||.|+. +.+ .++...+++.|++.|+ T Consensus 359 -~n~e~vv~v~~~~~~~~~-~~~~~~~~~~~~aa~vAGl~Ag~~----~~~S~T~~~---i~~-~~v~~~lt~~e~~~li 428 (562) T protein:vir:80 359 -LQNERAGLIGFSGTVKMD-DGRSLKMPGYMFAAQVAGLTCGLE----IGEAITFKN---IAI-ETLDTIYEGSQLDQLN 428 (562) T ss_pred -cCCCeEEEEecCeeEECC-CCceeeechhHHHHHHHHHHhcCc----cccCcccee---ecc-ccccccCCHHHHHHHH Confidence 478899999998876654 455555566 899999999887 788999954 554 4777889999999999 Q ss_pred hcCceEEEEecCCeEEEec----ceec--CCCcccceehhhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHH Q lcl|NC_016163. 467 LAQVNYIERDPKKISFATQ----LTSQ--TSRSALSYINNVRVLLRIRREVEKMM-ADYRQEFQDNTTYDSMSYSLNNYL 539 (590) Q Consensus 467 ~~gIn~i~~~~~~G~~~wG----~rT~--s~d~~~~~i~vrR~~~~i~~si~~~~-~~~vfepn~~~l~~~v~~~i~~~L 539 (590) ++|+++++.+.+++.++|. -+|. ..++.|++|++||++|+|.+.|++.+ +||+++||+...|..|+..|..|| T Consensus 429 ~~G~l~l~~~~~~~v~~~riv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yIGk~Nn~~~r~~v~~~i~~~L 508 (562) T protein:vir:80 429 ESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFL 508 (562) T ss_pred hCCeEEEEEecCCcEEEEEeeccceeccCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHH Confidence 9999999999888777772 2333 45789999999999999999998876 699999999999999999999999 Q ss_pred HHHHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 540 QQWVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 540 ~~l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) .+||++|+|.+..++ +-+.++++++++|++.++|+.|+|||.+++++.. T Consensus 509 ~~l~~~gaI~~~~~~--dv~v~~~~d~~~v~~~v~Pv~~mekIy~ti~~~~ 557 (562) T protein:vir:80 509 DRKKLAKEIQDYSPE--EVQVVIEGDIARISLTVFPIRSMKKIEVSLVYRQ 557 (562) T ss_pred HHHHhCCcccCCCcc--ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEe Confidence 999999999876432 2344577889999999999999999999999988 No 36 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=2.7e-66 Score=380.10 Aligned_cols=546 Identities=12% Similarity=0.019 Sum_probs=292.4 Q ss_pred Cc--ccc------CCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHH Q lcl|NC_016163. 1 MA--DYL------HPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNIL 72 (590) Q Consensus 1 Mp--~yl------~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~ 72 (590) || .|. +|||||||+||+.++|+||+|++++|+|.+++||+++|++|+||.||++.||| ++|+|||+ T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~Gp~~~p~~v~s~~~~~~~fgg------g~l~~av~ 74 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEGGETYKPYRLTSFAEAVSIFKG------GPLLEHIK 74 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCCCCCceeEEecCHHHHHHHhcC------ccHHHHHH Confidence 76 363 49999999999999999999999999999999999999999999999999997 56999999 Q ss_pred HHHcCCC-cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecc Q lcl|NC_016163. 73 NWLQSGG-TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKG 151 (590) Q Consensus 73 ~ff~nGG-~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~ 151 (590) +||+||| +||+|||.+ ++.++........++..++.|.+.+....... ++.........+.... T Consensus 75 ~~F~nGg~~~~~vRv~~--~~~a~~~~~~~~~~a~~~g~~gn~i~~~v~~~-------------~~~~~~~~~l~v~~~~ 139 (648) T protein:vir:10 75 AAFIGGAGEVVAVRIGN--PTTASVSIPVAQNTSDTSPANLNFVSYEASTR-------------SNQIYVSFDLDENFTS 139 (648) T ss_pred HHHhCCCcEEEEEEcCC--CcccceecceeEEeecccCCCCCceEEEEEEc-------------CCCcCceeEEEEEecC Confidence 9999999 599999943 44454455556666666676666544222111 1111111111111111 Q ss_pred cccccccceEEEEee-ccccccccccccceeeeeecccCCCceeeeeeeeeccccc----cccccccceeeeeeccccce Q lcl|NC_016163. 152 RGENYNGYGFRLSLR-SDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEA----KDKSRQSIYYANIINKYSQY 226 (590) Q Consensus 152 ~g~~~~~~~~~~~~~-~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da----~~~~~~~~~~~~vv~~~s~~ 226 (590) .+..+.++.+.+... ..+.+. .......+.+.+ ..+..++..+....+. ........+...+++..... T Consensus 140 ~~~~~d~~v~~i~~~~~~y~gt----~~~~t~~v~~~~--~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~v~~~~~~ 213 (648) T protein:vir:10 140 ANEADDTIIFTIYQKHPDFSVT----RETFTFPRKFTT--PTVLVKRGSTLFFVDRSIVNAALAAGPAFQTALINLLKEQ 213 (648) T ss_pred CCcccceeEEEeccCCCccccc----ceeccccccccc--cccccccccceeecCccchhhhhccCccchhhhhhchhhh Confidence 222222222222100 000000 000000000000 0000011000000000 00000000111111100000 Q ss_pred e----ee----------cCccccceeeeeecccccccCccccceecccccc--------ccccccc-------------c Q lcl|NC_016163. 227 V----EI----------VDNRSAFETISEFVVGDSEADPQKVDIIFGQERA--------VTPAETI-------------H 271 (590) Q Consensus 227 v----~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~-------------~ 271 (590) . .. .+............................-+.. .....+. . T Consensus 214 ~~~~~~~~~~~~s~~~~~d~~~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~~~ 293 (648) T protein:vir:10 214 LQPTDVVQIFDASDTNPVDIPLGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTSLS 293 (648) T ss_pred hhhhhhheecccccccccccccccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeeccc Confidence 0 00 0000000000000000000000000000000000 0000000 0 Q ss_pred cccccceeecccccccccccccccccceeeeeccccccccccccceeccch------hhHHHHHHHhhhccCCceeeec- Q lcl|NC_016163. 272 ANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEESALLVKG------YSGVLAPEILDKQQYEIDVLLD- 344 (590) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~- 344 (590) ....+..............+... . .....--.++||+++..+..-. ...-+.+.+......+...+++ T Consensus 294 ~~~~~~~v~~~~~~~l~~~~~~p-~----~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ivp~ 368 (648) T protein:vir:10 294 DPANWFAKDAYTINHLVDTTINP-H----ILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVIPA 368 (648) T ss_pred cccceeeeeccchhhcccccccC-c----ccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEEee Confidence 00000000000000000000000 0 0000001366788887763311 1112223333222323323322 Q ss_pred ------------ccchhHHHHHHHHHHHHh--------cCCeEEEEecCCCCCHHHHHHHHHhhcCccc----------- Q lcl|NC_016163. 345 ------------GNNEVAVKNAMSDLCSEQ--------RGDCIAILDCSFQGDAQQTIDYRTGNISMST----------- 393 (590) Q Consensus 345 ------------~~~~~~~~~a~~~~~~~~--------~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s----------- 393 (590) ....++++..++.||.++ |...++++.++++.+..++..-+... .+++ T Consensus 369 ~~~~~~~~~~~~lt~~q~i~a~a~shv~~~s~~~~~~~r~~~~~~vg~~~~es~~~se~~~~~~-~~~~~~a~~~~~d~~ 447 (648) T protein:vir:10 369 YKFTNVTQLNDRLTIFKGIASTFLSHVQTMSQVNRRKARVGVFGLPAPSPNESVTASEYLYNRN-ILNTISAMFGGTDRA 447 (648) T ss_pred cccccccccccccCCccchHHHHHHHHHHhhhccccccccCeEEEeCCCCchhHHHHHHHhhhh-cccccceeeeecCCc Confidence 234567888888888643 21236666666666654432222221 1122 Q ss_pred ceEEEEcCeEEEeecccCceeeecH---HHHHHHHHHHhhccCCceECcCCcccceeecc-ccceeecChhHHhhhhhcC Q lcl|NC_016163. 394 YFTAIFGQHMNVYDEYNGETITVTS---TYFLASMIPSNDDQNGIQWTFVGPRRGVISGF-TDINFYPNEPWKEKLYLAQ 469 (590) Q Consensus 394 ~~~~~~~p~~~~~d~~~~~~~~~pp---sg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~-~~~~~~~~~~e~~~Ln~~g 469 (590) ..+...+.+... .-+|+...+|| ++++||++++ ..+++||.|+. +.++ +++.+.+++.|+|.|+++| T Consensus 448 ~~~~~~~~~~~~--~~~G~~~~~p~~~~Aa~VAGl~a~----l~~~~s~T~k~---i~~~~id~~~~~t~~qld~L~~~G 518 (648) T protein:vir:10 448 QAVVFPFYSNVF--NDEGKVELLGGEFFASYVAGMHAN----REPQDSITFLP---ISGIGAEPLYNWTYTQKDDLISNR 518 (648) T ss_pred eEEeecccceeE--CCCCcEEecchhhHHHHHHhhhhc----cccccCcccce---eeccccccccCCCHHHHHHHhcCC Confidence 122222333322 22677888998 7889999987 45899999965 4444 4445789999999999999 Q ss_pred ceEEEEecCC----eEEEe-cceec--CCCcccceehhhhHHHHHHHHHHH-HHHHHhcCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_016163. 470 VNYIERDPKK----ISFAT-QLTSQ--TSRSALSYINNVRVLLRIRREVEK-MMADYRQEFQDNTTYDSMSYSLNNYLQQ 541 (590) Q Consensus 470 In~i~~~~~~----G~~~w-G~rT~--s~d~~~~~i~vrR~~~~i~~si~~-~~~~~vfepn~~~l~~~v~~~i~~~L~~ 541 (590) |+||.++.++ ++|+- |=+|. +.++.|+.|+++|++||+.+.+++ ...+|+++||++..|..|++.|.+||.+ T Consensus 519 v~~ie~~~~~~~~~~~rvv~gITT~~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~~n~~~~~~~ik~~i~~~L~~ 598 (648) T protein:vir:10 519 VLFVEKVKTSFGGIVYRIHHNPTTWLGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGRKSYGRKTENDIKVYTEALLS 598 (648) T ss_pred cEEEEEecCCcceeeEEEeccceeecCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcccccHHHHHHHHHHHHHHHhh Confidence 9999998775 34444 32332 357889999999999999999987 5569999999999999999999999999 Q ss_pred HHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 542 WVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 542 l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) +++.++|+++.+.+ -..++++++++|++.+.|++|++||.+++.++- T Consensus 599 ~~~~~~I~~y~~~~--v~~~~~~~vv~V~~~v~Pv~~i~~I~vti~it~ 645 (648) T protein:vir:10 599 NLVGKQIVAYKDVK--VTSNEDKTVYYVEFFYQPVTEIKFILVTMKVTF 645 (648) T ss_pred HhhcCcccCcccce--EEEEecCCEEEEEEEEEecceeeEEEEEEEEEe Confidence 99999999876432 223456799999999999999999999999999 No 37 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=8.2e-66 Score=377.42 Aligned_cols=530 Identities=12% Similarity=0.081 Sum_probs=320.2 Q ss_pred Cc-------cccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHH Q lcl|NC_016163. 1 MA-------DYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILN 73 (590) Q Consensus 1 Mp-------~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ 73 (590) || .+-+||||+||.+++.++++|+++++++|+|.+++||.++|++|+||.||++.||+..+....+|+|.... T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~l~~a~~~a~~~~~ 80 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKPDTVYRFRNYQQAKQVLRSGDLLDAIELAWNASD 80 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCCceeEEecCHHHHHHHhcCCchhHHHHhhccCcc Confidence 44 56789999999999999999999999999999999999999999999999999987444444567777777 Q ss_pred HHcCCC-cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccc Q lcl|NC_016163. 74 WLQSGG-TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGR 152 (590) Q Consensus 74 ff~nGG-~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~ 152 (590) ||.||| +||++|| .+|+.++.....+..++...+.+.+.+++.......-......+.. ...+. T Consensus 81 ~~~~~~~~~~~~rv--~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~-------------~~~~~ 145 (569) T protein:vir:80 81 VNTASAGDILAVRV--EDAKNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAF-------------SKDGY 145 (569) T ss_pred ccccCceEEEEEEc--CCCeeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEee-------------ecCCC Confidence 778888 5999999 6666676666677777777777766666554322111111111111 11111 Q ss_pred ccccccceEEEEeeccccccc--------cccccceeeeeecccCCCceeee-eeeeeccccccccccccceeeeeeccc Q lcl|NC_016163. 153 GENYNGYGFRLSLRSDYDNTY--------NFRTYNLSVTVKDSTGADVVVEG-PYIVSFDPEAKDKSRQSIYYANIINKY 223 (590) Q Consensus 153 g~~~~~~~~~~~~~~~~~~~~--------~~~~~~l~i~v~d~~~~~~v~e~-~~~ls~~~da~~~~~~~~~~~~vv~~~ 223 (590) ...++..+-............ ........+..+.........+. ...+.......... ....++.. T Consensus 146 ~~~~~~ig~v~si~ytg~~~~a~~~~~~~~~~~~a~~l~~~~g~~~~~~~~v~~~~~~~~~~~~~~~-----lv~~~~~~ 220 (569) T protein:vir:80 146 KKVFDNLGKIFSIQYKGSEAQANFTIAQDSISKKATTLTLNVGSEPESTTEVMKYELGQGVYSETNV-----LVSAINSL 220 (569) T ss_pred ccccccccceeeEEEeeccccceEEeecCcCcceeEEEEEEecCCcceeEEEEeeccCCccchhhhh-----hhhhcCCc Confidence 111211110000000000000 00000000001100000000000 00000000000000 00000000 Q ss_pred ---cceee-ecCccccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccce Q lcl|NC_016163. 224 ---SQYVE-IVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQ 299 (590) Q Consensus 224 ---s~~v~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (590) +.... ..+.. .. .. ........... ........ ..... .... ........ ...+.. T Consensus 221 ~~f~a~~~~~~~~~--~~-~~------~~d~~~~~~~~--t~~~~~~~--~~~di-~~~~---~~~~~v~~---~~~~~~ 280 (569) T protein:vir:80 221 PDWEAKFFPIGDKN--LP-TD------ALEAVTKVDVK--TEAVFVGA--LAGDI-AKQL---EYNDYVTV---AVDATK 280 (569) T ss_pred cCceEEEEecCCCc--ce-eh------hccchhheecc--ccceeeeh--hHHHH-HHhh---cCCceEEE---EecCCc Confidence 00000 00000 00 00 00000000000 00000000 00000 0000 00000000 001111 Q ss_pred eeeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhcC---CeEEEEecCCCC Q lcl|NC_016163. 300 YLTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQRG---DCIAILDCSFQG 376 (590) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~---~~~a~~d~p~~~ 376 (590) .........++||.++.... ++. .+ +...+..+...+++...+.+++.++..||.++|. .++++++.+.+. T Consensus 281 ~l~~~~~~~LtGG~dG~~~~--~~~-~~---l~~le~~~~~~i~~~t~d~av~~~l~a~vkr~r~~g~~~~aVvg~~~~~ 354 (569) T protein:vir:80 281 PVEDFELTNLTGGSDGTAPE--SWA-NK---FPLLANEGGYYLVPLTDKQAVHSEALAFVKDRTDNGDPMRIIVGGGTNE 354 (569) T ss_pred ceeeecceeecCCCCCCccc--hHH-HH---HHHHhhCCcEEEEecCCChHHHHHHHHHHHHHHhCCCcEEEEecCCCCC Confidence 12223334567777764322 111 12 2223344566677777778899999999976543 489999999999 Q ss_pred CHHHHHHHHHhhcCcccceEEEEcCeEEEeecccCceeeecH---HHHHHHHHHHhhccCCceECcCCcccceeeccccc Q lcl|NC_016163. 377 DAQQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNGETITVTS---TYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDI 453 (590) Q Consensus 377 ~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~pp---sg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~ 453 (590) +++++..++.. +++.+.++++||..+.+. +++.+..|+ ++++||++|..+ +++||.|+. +. +.++ T Consensus 355 ~~~~~~~~a~~---~n~e~vv~v~~~~~~~~~-~g~~~~~~~~~~aa~vAG~~A~~~----~~~S~T~k~---i~-~~~i 422 (569) T protein:vir:80 355 TVEESITRATN---LRDPRASLVGFSGTRKMD-DGRLLKLPGYMMASQIAGIASGLE----VGEAITFKH---FN-VTSV 422 (569) T ss_pred CHHHHHHHHhh---cCCCeEEEEecCceeecC-CCcceeechhhHHHHHHHHHhcCc----cccCcccee---ec-cccc Confidence 99998887764 489999999999988764 455556665 678888888665 889999954 54 5678 Q ss_pred eeecChhHHhhhhhcCceEEEEecCCeEEEecc----ee--cCCCcccceehhhhHHHHHHHHHHHHH-HHHhcCCCCHH Q lcl|NC_016163. 454 NFYPNEPWKEKLYLAQVNYIERDPKKISFATQL----TS--QTSRSALSYINNVRVLLRIRREVEKMM-ADYRQEFQDNT 526 (590) Q Consensus 454 ~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~----rT--~s~d~~~~~i~vrR~~~~i~~si~~~~-~~~vfepn~~~ 526 (590) ...+++.|++.|+++|++++++++++..++|.. +| ...++.|++|+++|++|+|++.|++.+ +||+++||+.. T Consensus 423 ~~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk~nn~~ 502 (569) T protein:vir:80 423 DRVFESSQLDMLNESGVISIEFVRNRTLTAFRVVQDVTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGTKVIDT 502 (569) T ss_pred cccCCHHHHHHHHhCCeEEEEEecCceEEEEEEeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcccCChh Confidence 888999999999999999999998887777743 33 345778999999999999999998875 68999999999 Q ss_pred HHHHHHHHHHHHHHHHHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 527 TYDSMSYSLNNYLQQWVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 527 l~~~v~~~i~~~L~~l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) .|..|+..|..||.+||++|+|.++.++ +-+.+++.++++|++.++|+.|+|||++|+++.. T Consensus 503 ~r~~v~~~i~~~L~~l~~~gaI~~~~~~--dv~v~~~~d~~~v~~~v~Pv~~~ekI~~ti~~~~ 564 (569) T protein:vir:80 503 SASLIKNFIQSFLDNKKRAREIQDYTPE--EVQVVLEGDVASISMTVMPIRSLNKITVQLVYKQ 564 (569) T ss_pred HHHHHHHHHHHHHHHHHhCCcccCCCcc--ceEEEecCCEEEEEEEEEEcccccEEEEEEEEee Confidence 9999999999999999999999875322 2344577889999999999999999999999988 No 38 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=1.6e-62 Score=359.44 Aligned_cols=544 Identities=12% Similarity=0.034 Sum_probs=315.8 Q ss_pred Cc-------cccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHH Q lcl|NC_016163. 1 MA-------DYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILN 73 (590) Q Consensus 1 Mp-------~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ 73 (590) || -|.+|||||||.+|+.++++|+++++++|+|.+++||+++|++++||+||++.|||..+. ..+.||..+ T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~l~--~~~~~a~~~ 78 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGELL--DAIELAWGS 78 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCCCCceeEEeccHHHHHHHhcCcchH--HHHHHHhcc Confidence 54 578899999999999999999999999999999999999999999999999999984322 234555566 Q ss_pred HHcCCC-cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCc-----ceeeE Q lcl|NC_016163. 74 WLQSGG-TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGET-----PLCFI 147 (590) Q Consensus 74 ff~nGG-~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~-----~~~~~ 147 (590) ||.||| +||++|| .+++.|+.....+.+++..+|.|.+.+++..............+...+...... .+..+ T Consensus 79 ~~~~g~~~~~~~rv--~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~si 156 (587) T protein:vir:95 79 NPNYTAGRILAMRI--EDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTI 156 (587) T ss_pred ccCCCceEEEEEEc--CCCceeEEEecCeEEEEecccccccceEEEEecCCCCCceeEEEEEecccceeeeeeccceeee Confidence 668988 5999999 566677777788889999999999988876543322222222222221110000 00001 Q ss_pred eecccccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeec-------ccccc--ccccccceeee Q lcl|NC_016163. 148 VPKGRGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSF-------DPEAK--DKSRQSIYYAN 218 (590) Q Consensus 148 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~-------~~da~--~~~~~~~~~~~ 218 (590) ...+..... ......+. .. .....+..+. +..++. .+...+. ..+.. .+.....|... T Consensus 157 ~y~g~~~~~-------~~~v~~~~-~t--~~a~~~~l~~--g~~~v~-~yrL~~g~~~~~~~~~~~in~~~~~tAky~g~ 223 (587) T protein:vir:95 157 KYKGEEANA-------TFSVEHDE-ET--QKASRLVLKV--GDQEVK-SYDLTGGAYDYTNAIITDINQLPDFEAKLSPF 223 (587) T ss_pred eeecccccc-------ceeeeecc-cc--eeeeeeeeec--CCceEE-EEEecCCchHHHHHHHHhhccccceEEEEecc Confidence 111110000 00000000 00 0000000000 001110 0000000 00000 00000000000 Q ss_pred eeccccceeeecCccccceeeeeecccccccCccccceeccc--cccc-ccccccccccccceeeccccccccccccccc Q lcl|NC_016163. 219 IINKYSQYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQ--ERAV-TPAETIHANVVWKSSSVETDDPSYDATAANF 295 (590) Q Consensus 219 vv~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (590) .-+. ..+...+............... ....+..... .... ............ ........ ........ T Consensus 224 ~~~~--i~~~~~~~~~~~~v~~~~~~v~----a~~~d~~~~~~~~~~v~~~~~~g~~~~~~-~~~~~~~~--~~a~~~~~ 294 (587) T protein:vir:95 224 GDKN--LESSKLDKIENANIKDKAVYVK----AVFGDLEKQTAYNGIVSFEQLNAEGEVPS-NVEVEAGE--ESATVTAT 294 (587) T ss_pred cCce--eEEeecCcccccceehhhhhhh----hhhcceeeeeeceeeeeeecccccceecc-chhhhhcc--cchheecc Confidence 0000 0000000000000000000000 0000000000 0000 000000000000 00000000 00000000 Q ss_pred ccceeeeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhcC---CeEEEEec Q lcl|NC_016163. 296 NNIQYLTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQRG---DCIAILDC 372 (590) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~---~~~a~~d~ 372 (590) .............++||.++.... ++ ..+ +...+..+...++++..+.+++.++..||.+++. .++++++. T Consensus 295 ~~~~~~a~~~~t~LtGG~dG~~~~--~y-~~~---l~ale~~~~~~i~~~t~d~~v~a~l~a~vk~~~~~g~~~~aVvg~ 368 (587) T protein:vir:95 295 SPIKTIEPFELTKLKGGTNGEPPA--TW-ADK---LDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRAIVGG 368 (587) T ss_pred ccccceeccceeeeecCCCCCCcc--cH-HHH---HHHHHhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEEcC Confidence 000001111122366666664321 11 112 2223344556666677777888889999866543 48999999 Q ss_pred CCCCCHHHHHHHHHhhcCcccceEEEEcCeEEEeecccCceeeecH---HHHHHHHHHHhhccCCceECcCCcccceeec Q lcl|NC_016163. 373 SFQGDAQQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNGETITVTS---TYFLASMIPSNDDQNGIQWTFVGPRRGVISG 449 (590) Q Consensus 373 p~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~pp---sg~~AG~~A~~D~~~G~~~sPan~~~~~i~g 449 (590) +.+.+++++...+.. +++.+.++++|+..+. ..+++.+..|| ++++||++|..| +++||.|.. +. T Consensus 369 ~~~~~~~~~~~~a~~---~n~ervi~v~~~~~~~-~~dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~---i~- 436 (587) T protein:vir:95 369 GFNESKEQLFGRQES---LSNPRVSLVANSGTFV-MDDGRKNHVPAYMVAVALGGLASGLE----IGESITFKP---LR- 436 (587) T ss_pred CCCCCHHHHHHHHhh---cCCCcEEEecccceEe-cCCCceeeechHHHHHHHHHHHhcCc----hhcCcccee---ee- Confidence 988888888776544 4888999998886543 34567777777 789999999887 778999854 44 Q ss_pred cccceeecChhHHhhhhhcCceEEEEecCCeEEE----ecceec--CCCcccceehhhhHHHHHHHHHHHHH-HHHhcCC Q lcl|NC_016163. 450 FTDINFYPNEPWKEKLYLAQVNYIERDPKKISFA----TQLTSQ--TSRSALSYINNVRVLLRIRREVEKMM-ADYRQEF 522 (590) Q Consensus 450 ~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~----wG~rT~--s~d~~~~~i~vrR~~~~i~~si~~~~-~~~vfep 522 (590) ..++...+++.|++.|+++|++++..+++++... .+-+|. ..|+.|++|+++|++|+|.+.|++.+ +||+++| T Consensus 437 ~~~v~~~~t~~e~e~ai~~Gvl~l~~~~~~~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk~ 516 (587) T protein:vir:95 437 VSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTR 516 (587) T ss_pred cccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCccc Confidence 4577778999999999999999999887765333 354554 45778999999999999999999876 6999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 523 QDNTTYDSMSYSLNNYLQQWVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 523 n~~~l~~~v~~~i~~~L~~l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |++..|..|+..|..||.+||++|+|.+.-. .+.+-++...++++++.+.|+.|+|||.+++++.. T Consensus 517 nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~--~dv~v~~~~d~~~v~~~v~Pv~~mekI~vt~~~~~ 582 (587) T protein:vir:95 517 TINTSASIIKDFIQSYLGRKKRDNEIQDFPA--EDVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred cchHHHHHHHHHHHHHHHHHHhCCcccCCCc--cceEEEecCCEEEEEEEEEEcccceEEEEEEEEee Confidence 9999999999999999999999999987532 22333456678999999999999999999999977 No 39 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=3.6e-60 Score=346.50 Aligned_cols=540 Identities=12% Similarity=0.046 Sum_probs=315.1 Q ss_pred Cc-------cccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHH Q lcl|NC_016163. 1 MA-------DYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILN 73 (590) Q Consensus 1 Mp-------~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ 73 (590) || -|.+|||||||.+|+.++++++++++++|+|.+++||+++|++|+||+||+++||| ++|..++++ T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~------g~l~~~~~~ 74 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRS------GELLDAIEL 74 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCCccceeEEeccHHHHHHHhcC------cchHHHHHH Confidence 44 57899999999999999999999999999999999999999999999999999998 446666655 Q ss_pred HH----cCCC-cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCC-----cc Q lcl|NC_016163. 74 WL----QSGG-TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGE-----TP 143 (590) Q Consensus 74 ff----~nGG-~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~-----~~ 143 (590) +| .||| +||++|| .+++.|+.....+.+++..+|.|.+.+++..............+......... .. T Consensus 75 a~~~~~~~g~~~~~~~rv--~~~~~a~~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 152 (587) T protein:vir:99 75 AWGSNPNYTAGRILAMRI--EDAKPASAEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGN 152 (587) T ss_pred HhccccCCCceEEEEEEc--CCCceeEEEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEEecccceeeeeeccc Confidence 55 7887 5999999 56677888888889999999999998887654332222222222111111000 00 Q ss_pred eeeEeecccccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeee-------cccccc--ccccccc Q lcl|NC_016163. 144 LCFIVPKGRGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVS-------FDPEAK--DKSRQSI 214 (590) Q Consensus 144 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls-------~~~da~--~~~~~~~ 214 (590) +..+...+...... ..+-.+ ..+.. ...+..+. +..++. .+...+ .+.+.. .+..... T Consensus 153 v~~i~y~g~~~~a~-----~~v~~~-~~t~~----a~~~~l~~--g~~~v~-~yrL~~g~~~~~~~~~~~i~~~~~~tAk 219 (587) T protein:vir:99 153 IFTIKYKGEEANAT-----FSVEHD-EETQK----ASRLVLKV--GDQEVK-SYDLTGGAYDYTNAIITDINQLPDFEAK 219 (587) T ss_pred eeeEEeecccccce-----eeEeec-Cccee----eeeeeeec--CCceeE-EEEecCCchHHHHHHHhhhccccceeEE Confidence 11111111110000 000000 00000 00000000 001110 000000 000000 0000000 Q ss_pred eeeeeeccccceeeecCccccceeeeeecccccccCccccceeccccccc---ccccccccccccceeeccccccccccc Q lcl|NC_016163. 215 YYANIINKYSQYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAV---TPAETIHANVVWKSSSVETDDPSYDAT 291 (590) Q Consensus 215 ~~~~vv~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ 291 (590) |....-+ +..+...+................ ...+......... ............. ............. T Consensus 220 y~~~~~~--~i~~~~~~~~~~~~v~~~~~~v~a----~~~D~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~~~ 292 (587) T protein:vir:99 220 LSPFGDK--NLESSKLDKIENANIKDKAVYVKA----VFGDLEKQTAYNGIVSFEQLNAEGEVPSN-VEVEAGEESATVT 292 (587) T ss_pred eeccCCc--eeEeecccccccceeeeeeeeeeh----hccceeeecccceeeeeeecccccchhhh-hhhhhccccceee Confidence 0000000 000000000000000000000000 0000000000000 0000000000000 0000000000000 Q ss_pred ccccccceeeeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhcC---CeEE Q lcl|NC_016163. 292 AANFNNIQYLTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQRG---DCIA 368 (590) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~---~~~a 368 (590) . ..............++||.++.... ++ ..+ +...+..+...++++..+.+++.++.+||.+.|. .+++ T Consensus 293 ~--~~~~~~~a~~~~t~LtGG~dG~~~~--sy-~~a---l~ale~~~~~~i~~~t~d~~i~a~l~a~vk~~r~~g~~~~a 364 (587) T protein:vir:99 293 A--TSPIKTIEPFELTKLKGGTNGEPPA--TW-ADK---LDKFAHEGGYYIVPLSSKQSVHAEVASFVKERSDAGEPMRA 364 (587) T ss_pred e--eccccceecccceeeecCCCCCccc--cH-HHH---HHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCcEEE Confidence 0 0000001111122366666654321 11 112 2223334556666677778888889999866543 4899 Q ss_pred EEecCCCCCHHHHHHHHHhhcCcccceEEEEcCeEEEeecccCceeeecH---HHHHHHHHHHhhccCCceECcCCcccc Q lcl|NC_016163. 369 ILDCSFQGDAQQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNGETITVTS---TYFLASMIPSNDDQNGIQWTFVGPRRG 445 (590) Q Consensus 369 ~~d~p~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~pp---sg~~AG~~A~~D~~~G~~~sPan~~~~ 445 (590) +++.+.+.+++++...... +++.+.++++|+..+. ..+++.+..|| ++++||++|..| +++||.|.. T Consensus 365 Vlg~~~~~~~~~~~~~a~~---~n~e~vi~v~~~~~~~-~~dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~-- 434 (587) T protein:vir:99 365 IVGGGFNESKEQLFGRQAS---LSNPRVSLVANSGTFV-MDDGRKNHVPAYMVAVALGGLASGLE----IGESITFKP-- 434 (587) T ss_pred EecCCCCCCHHHHHHHhhh---cCCCcEEEEeccceEe-cCCCceeeechHHHHHHHHHHHhcCc----hhcCcccee-- Confidence 9999999998888876654 4888999998875543 23566677777 789999999876 788999954 Q ss_pred eeeccccceeecChhHHhhhhhcCceEEEEecCCeEEE----ecceec--CCCcccceehhhhHHHHHHHHHHHHH-HHH Q lcl|NC_016163. 446 VISGFTDINFYPNEPWKEKLYLAQVNYIERDPKKISFA----TQLTSQ--TSRSALSYINNVRVLLRIRREVEKMM-ADY 518 (590) Q Consensus 446 ~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~----wG~rT~--s~d~~~~~i~vrR~~~~i~~si~~~~-~~~ 518 (590) +. ..++...+++.|++.|+++|++++..+++++... .+-.|. ..++.|++|+++|++|+|.+.|++.+ +|| T Consensus 435 -i~-~~~v~~~~t~~e~e~li~~Gvl~l~~~~~~~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~y 512 (587) T protein:vir:99 435 -LR-VSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQF 512 (587) T ss_pred -ee-cccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhC Confidence 44 5577788999999999999999999887764322 454554 45778999999999999999999886 689 Q ss_pred hcCCCCHHHHHHHHHHHHHHHHHHHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 519 RQEFQDNTTYDSMSYSLNNYLQQWVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 519 vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) +++||++..|..|+..|..||.+||++|+|.++-.+ ..+-+....++++++.+.|+.|+|||.+++++.. T Consensus 513 iGk~Nn~~~r~~i~~~i~~~L~~l~~~gaI~~~~~~--dv~v~~~~d~~~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:99 513 IGTRTINTSASIIKDFIQSYLGRKKRDNEIQDFPAE--DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQ 582 (587) T ss_pred CccccchHHHHHHHHHHHHHHHHHHhCCcccCCCcc--ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEe Confidence 999999999999999999999999999999875321 2233356668999999999999999999999987 No 40 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=2e-59 Score=342.42 Aligned_cols=571 Identities=13% Similarity=0.098 Sum_probs=269.3 Q ss_pred Ccc---ccC-CceEEEEec----CCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCcccc----ccccHH Q lcl|NC_016163. 1 MAD---YLH-PSVSSRIVD----NSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLS----KYGQTS 68 (590) Q Consensus 1 Mp~---yl~-PGVYveEi~----s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~----~~~~l~ 68 (590) |+. |-. |||-+.=-+ .+..+ +--.|-.-.+-|++--||+++|++|+ ..--+.+||...-. +.+-|- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 78 (717) T protein:vir:79 1 MAGFDQYQAIPGHNARFKDGNLNLKSDP-NPRETESVVLLGTATDGPVMQPVRVT-PETAYNIFGKVAHENGVYNGATLL 78 (717) T ss_pred CCchhhhhcCCCceeeeecCceecCCCC-CccccceEEEEeeccCCcccCceeeC-hhHHHhhhhhhhhhcccccchhhh Confidence 883 665 999887322 22222 22234333445666679999999998 67778999963111 112233 Q ss_pred HHHHHHHcCCCc-EEEEEEecCCc----------cccccccccceeecccccccceeee------------eeccccccc Q lcl|NC_016163. 69 YNILNWLQSGGT-AYVLRVMPDDA----------KFANSLISIKTTAAADPAKATVLVT------------AKAQTTNTA 125 (590) Q Consensus 69 ~av~~ff~nGG~-~~vvRv~~~~a----------~~a~~~~~~~~~~a~~~~~~~~~v~------------~~~~~~~~a 125 (590) -+.+.-..-|.+ ....|+-...| ++..-........+..++++....+ .+++.--.- T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (717) T protein:vir:79 79 PKFEELWAAGNRDIRLMRTTGVNAVSSLLGTSYSKNSKEVAEDKLGGAQARGNVAATFTLPNGGIVEATFLLKARGVIIP 158 (717) T ss_pred HHHHHHHhcCCcceEEEEecchhHHHHHhhcccccchhhHHHHhhcccccccceEEEEEcCCCceeeeeeeeeecceEeC Confidence 344445566775 89999842221 1110000111111111221111110 000000000 Q ss_pred cccceEEEeeccc-----cCCcceeeE--eecccccc----cccceEEEEeecc---ccccc----ccccc-----ceee Q lcl|NC_016163. 126 SKNAMKTILSGGT-----AGETPLCFI--VPKGRGEN----YNGYGFRLSLRSD---YDNTY----NFRTY-----NLSV 182 (590) Q Consensus 126 ~~~~~~~~~~~~t-----~~~~~~~~~--~~~~~g~~----~~~~~~~~~~~~~---~~~~~----~~~~~-----~l~i 182 (590) ..+- .+..++ ++..+-+.+ ...+.... -..+.+...+... ...++ +.... ++.+ T Consensus 159 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (717) T protein:vir:79 159 PNNY---TLDVGTEEDMKAGTQPTFAQVLLNENVADMESEITVSYEFTYKDAQGETKTSEVLDNNTDKDGKPMIAKGADV 235 (717) T ss_pred CCcc---eEeccChhhhhcCCCchhhhhhhccchhhccceeEEEEEEEeecccCcchhhhhhcCCCCCCCceeEEecccc Confidence 0000 000000 000000000 00000000 0000000000000 00000 00000 0000 Q ss_pred eee----------------cccCCCceeeeeeeeeccccccccccc---------------------cceeeeeeccccc Q lcl|NC_016163. 183 TVK----------------DSTGADVVVEGPYIVSFDPEAKDKSRQ---------------------SIYYANIINKYSQ 225 (590) Q Consensus 183 ~v~----------------d~~~~~~v~e~~~~ls~~~da~~~~~~---------------------~~~~~~vv~~~s~ 225 (590) .++ ....+....-.+..++.-.++....++ ..+-..+.++.+. T Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~n~~~~ 315 (717) T protein:vir:79 236 TIKLEHVALAGLKLYADGIEVVDAKAFTVAGDQLTIHSNSKMKLGASLEAQYAYNLVEVIQPVIELESIFGGGVYNDIMR 315 (717) T ss_pred eeehhhhhhhhhHHhhcchhhhhhhheeeecceEEEEecCCcccchhhHHHHHhhHHHhhccceEEeecccCceeeeeee Confidence 000 000000000000011111111110000 1111122233333 Q ss_pred eeeecCccccceeeeeecccccccCccccceecccccccccccccccc-cccceeecccccccccccccccccceeeeec Q lcl|NC_016163. 226 YVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHAN-VVWKSSSVETDDPSYDATAANFNNIQYLTEG 304 (590) Q Consensus 226 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 304 (590) .+...|...... .+..................+.-.......+.... ........+. .+...+ ......... T Consensus 316 ~v~~~D~~~~~~-~t~~~~~~g~~~~~pl~~ts~dy~~~~~~vdgI~~~~~~~V~~~g~-~s~a~a-----~~~~g~~s~ 388 (717) T protein:vir:79 316 KVESKDGAVTVT-ITKPESKRGMISEDPLVFKSGDYTNFKMLVDAINNHPFNNVVRART-KPEFEA-----TFTSTLQAA 388 (717) T ss_pred EEecCCceEEEE-EecccccCcceeccccccccCceeeeeeeecccccCchhheeeeec-ccccce-----eeeecccCc Confidence 333322211100 00000000000000000000000000000000000 0000000000 000000 000000001 Q ss_pred cccccccccccceeccchhhH----------HHHH--HHhhhccCCceeeecccc---------hhHHHHHHHHHHHHhc Q lcl|NC_016163. 305 SEGTWTGGNEESALLVKGYSG----------VLAP--EILDKQQYEIDVLLDGNN---------EVAVKNAMSDLCSEQR 363 (590) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~----------~~~~--~~~~~~~~~~~~~~~~~~---------~~~~~~a~~~~~~~~~ 363 (590) ++....++.++..+....... ...+ .+...+..++++++.++. ...+..++.+||+... T Consensus 389 d~a~f~Gg~dgl~~~~ee~Y~~lGgk~~d~g~lt~~aays~LE~~dVDlVil~ga~adtt~ga~~d~va~alad~caalS 468 (717) T protein:vir:79 389 ADAKFSGGKDELSLDKEEMYKRLGGEKNEEGFVTKQGAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQLALACAVMS 468 (717) T ss_pred hhhccCCCccccccchhhhhccccccccccccccchhhhhhcCcceeEEEEecCccccccccchhhhHHHHHHHHHHHhh Confidence 111122222222111111000 0000 011112334556554431 2356778889997642 Q ss_pred ---CCeEEEEe--cCCCCCHHHHHHHHHhhcC----------------------cc-cceEEEEcCeEEEeecccCceee Q lcl|NC_016163. 364 ---GDCIAILD--CSFQGDAQQTIDYRTGNIS----------------------MS-TYFTAIFGQHMNVYDEYNGETIT 415 (590) Q Consensus 364 ---~~~~a~~d--~p~~~~~~~~~~~~~~~~~----------------------~~-s~~~~~~~p~~~~~d~~~~~~~~ 415 (590) +.++.+++ .|.+...+...+|+....+ .+ +.|...++++..+..+..+..+. T Consensus 469 al~r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis~y~~vv~~~~~iv~~~~~~~~~ 548 (717) T protein:vir:79 469 HYNSVTIGIIPTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLGQFIEVVAGPDFIVRNTRLGQMA 548 (717) T ss_pred hccccceeeeccccccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccccceeeeeecceeEEEcCCCceee Confidence 12444454 3434433333444332110 00 23444444444444555666777 Q ss_pred ecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecCCeEEEecceecCCCc-c Q lcl|NC_016163. 416 VTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPKKISFATQLTSQTSRS-A 494 (590) Q Consensus 416 ~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~s~d~-~ 494 (590) .||+|++||+ |..+|+||||+|+. |.|+.++++.++++|++.||++|||||++++++|+++||+||++.++ . T Consensus 549 ~p~AG~vAGl----dA~rGVwkSPANk~---I~GVvgLa~~lT~sE~d~Ln~aGIntIr~~~GrGirVWGaRTtasd~sd 621 (717) T protein:vir:79 549 STPDASYIGM----VSQLKTQSAPTNKP---LPSVTALRYTYSANQLNRLTKARFATFKYKQDGSIGVVDAPTSAHAGSD 621 (717) T ss_pred cCHHHHHHHH----HhcCCcccccccce---ecccccCcccCCHHHHHHHhhCCeEEEEEeCCceEEEEeeeecCCCCcc Confidence 8887666665 55689999999964 88999999999999999999999999999999999999999998764 7 Q ss_pred cceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhCCceEEEecCCCCCHHHhhCCEEEEEEEEE Q lcl|NC_016163. 495 LSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYLQQWVANRACSSISGTVYASDYDKQQSIARVKVELV 574 (590) Q Consensus 495 ~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L~~l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~a 574 (590) |+||+|||++++|+++|+++++|+|||||++.+|.+|+.+|++||++||++|+|.|+.-..+||++++++|+|+|+|+++ T Consensus 622 WryInVRRl~D~Ie~sIr~al~~yVgEPNd~~tr~~Ik~sI~afL~~L~r~GAI~GykvdvtnT~~di~~G~l~V~I~va 701 (717) T protein:vir:79 622 YTRLSTARIVKEAVNAVREVADPFIGEPNDTGNRNALTAAVDKRLSKMIENKALLGFDFRLVVTPQQELLGEGSIELSLE 701 (717) T ss_pred cceeehhhhHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHHHHHhcCceecceeeEecChhHhhCCEEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999998543458999999999999999999 Q ss_pred ecCccceEEEEEEeeC Q lcl|NC_016163. 575 FTGVIERIAIDLVVNK 590 (590) Q Consensus 575 p~~paefi~~~~~~~~ 590 (590) |++|+|||+|+++++= T Consensus 702 Pv~PaEfI~ititITA 717 (717) T protein:vir:79 702 APNELRRLTTIVSLSA 717 (717) T ss_pred ecCcccEEEEEEEEeC Confidence 9999999999999999 No 41 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=9.3e-59 Score=338.73 Aligned_cols=541 Identities=10% Similarity=0.030 Sum_probs=312.2 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHH----c Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWL----Q 76 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff----~ 76 (590) |--|.+||||||+.+++..+++|+++++.+|+|.+++||+++|++|++|+||++.||+ ++|..|++++| . T Consensus 8 ~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g~~~~~~~~~~~~~~~~~~g~------G~l~~ai~~a~~~~~~ 81 (587) T protein:vir:96 8 RRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGGEPNTVYQVRNYAQAKSVFRS------GELLDAIELAWGSNPQ 81 (587) T ss_pred CCcccCCceEEEEecCCccCCCCCCCceEEEEEEecCCCCceeEEEcChHHHHHhhcC------CcHHHHHHHHhccCcC Confidence 5579999999999999999999999999999999999999999999999999999997 45777777777 6 Q ss_pred CCC-cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecccccc Q lcl|NC_016163. 77 SGG-TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGEN 155 (590) Q Consensus 77 nGG-~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~~ 155 (590) ||| .||.+|| +++..++.....+.+++...+.|.+.++++......-............... ...++.|. T Consensus 82 ~g~~~~~a~rv--~~~~~a~~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~------~~~~n~G~- 152 (587) T protein:vir:96 82 YTAGKILAMRV--EDAKASQLEKGGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQ------EVFDNLGN- 152 (587) T ss_pred CCceEEEEEec--CCCccceeecccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCce------eeccccCc- Confidence 888 5999999 4566677666667777777777777766655322111111111111111100 00111110 Q ss_pred cccceEEEEeecccc-ccccccccc-----eeeeeecccCCCceeeeeeeeec-------ccccc--ccccccceeeeee Q lcl|NC_016163. 156 YNGYGFRLSLRSDYD-NTYNFRTYN-----LSVTVKDSTGADVVVEGPYIVSF-------DPEAK--DKSRQSIYYANII 220 (590) Q Consensus 156 ~~~~~~~~~~~~~~~-~~~~~~~~~-----l~i~v~d~~~~~~v~e~~~~ls~-------~~da~--~~~~~~~~~~~vv 220 (590) -+.++. ..+.. ......... ..+.. .. +..++. .+...+. ..+.. .+.....|....- T Consensus 153 --v~~i~y--~g~~~~a~~~~~~~~~~~~A~~l~l-~g-g~~~v~-~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g~~~ 225 (587) T protein:vir:96 153 --IFSINY--KGEGEKATFSVEKDKETQEAKRLVL-KV-DEKEVK-AYELNGGAYSFTNEIITDINELPDFEAKLSPFGD 225 (587) T ss_pred --eEEEEe--cccccceeEeeccCcccceeeeeEE-Ee-cCceEE-EEEeCCCchhhhhhhhhhhccccceEEEeecccC Confidence 000110 00000 000000000 00000 00 000000 0000000 00000 0000001100000 Q ss_pred ccccceeeecCccccceeeeeecccccccCccccceeccccccc-ccccccccccccceeecccccccccccccccccce Q lcl|NC_016163. 221 NKYSQYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAV-TPAETIHANVVWKSSSVETDDPSYDATAANFNNIQ 299 (590) Q Consensus 221 ~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (590) + +..+...+................. . .++........ .......... ........................ T Consensus 226 n--~~~v~v~d~~~~~~~k~~~~y~~t~-~---~di~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~~~~~~~~~ 298 (587) T protein:vir:96 226 K--NLESRKLDEATDVDIKGKAVYVKAV-F---GDIENQTQYNQYVKFEQLPEQA-SEPSDVEVHAETESATVTATSKPK 298 (587) T ss_pred c--eeEEEeeccccccccceEEEeehhh-h---hhhhhhhccccceeeccccchh-hhhhcccccccccceeeeeccccc Confidence 0 0000000000000000000000000 0 00000000000 0000000000 000000000000000000000000 Q ss_pred eeeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhcC---CeEEEEecCCCC Q lcl|NC_016163. 300 YLTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQRG---DCIAILDCSFQG 376 (590) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~---~~~a~~d~p~~~ 376 (590) .........++||.++.... ++ ..+ +...+..+...++++..+.+++.++.+||.++|. .+++++..+.+. T Consensus 299 ~~~~~~~~aLtGG~dG~~~~--~y-~~~---l~ale~~~~~~i~~~t~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~~~~ 372 (587) T protein:vir:96 299 AIEPFELTKLSGGTNGEPPT--SW-SAK---LEKFKNEGGYYIVPLTDRQSVHSEVATFVKNRSDAGEPMRAIVGGGTSE 372 (587) T ss_pred ccccccceeeecCCCCCCcc--cH-HHH---HHHHhhCCcEEEEecCCCHHHHHHHHHHHHHHHhCCCeEEEEecCCCCC Confidence 11112223466776664321 11 112 2222334556677777778889999999976543 389999999888 Q ss_pred CHHHHHHHHHhhcCcccceEEEEcCeEEEeecccCceeeecH---HHHHHHHHHHhhccCCceECcCCcccceeeccccc Q lcl|NC_016163. 377 DAQQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNGETITVTS---TYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDI 453 (590) Q Consensus 377 ~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~pp---sg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~ 453 (590) +++++...+.. +++.+.++++++..+.+. +++....|+ ++++||++|..+ +++||.|+. +.+ .++ T Consensus 373 ~~~~~~~~a~~---~n~e~vi~v~~~~~~~~~-~~~~~~~~~~~~aa~vAG~~Ag~~----~~~S~T~~~---~~~-~~v 440 (587) T protein:vir:96 373 TKEKLFGRQAI---LNNPRVALVANSGKFVMG-NGRILQAPAYMVASAVAGLVSGLD----IGESITFKP---LFV-NSL 440 (587) T ss_pred CHHHHHHHHhh---cCCCcEEEEecceEEecC-CCceeeechhhHHHHHHHHHhcCc----cccCcccee---eec-ccc Confidence 88888776544 488899999998887765 344444443 689999999776 788999954 544 577 Q ss_pred eeecChhHHhhhhhcCceEEEEecCCeEEEecc-ee---c--CCCcccceehhhhHHHHHHHHHHHHH-HHHhcCCCCHH Q lcl|NC_016163. 454 NFYPNEPWKEKLYLAQVNYIERDPKKISFATQL-TS---Q--TSRSALSYINNVRVLLRIRREVEKMM-ADYRQEFQDNT 526 (590) Q Consensus 454 ~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~-rT---~--s~d~~~~~i~vrR~~~~i~~si~~~~-~~~vfepn~~~ 526 (590) ...+++.|++.|+++|+.+++...+++.++|.. ++ . ..+..|++|+++|++|+|.+.|++.+ +||+++||+.. T Consensus 441 ~~~~t~~e~~~~i~~G~~~l~~~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk~nn~~ 520 (587) T protein:vir:96 441 DKVYESEELDELNENGIITIEFVRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIGTRTINT 520 (587) T ss_pred cccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCccccCHH Confidence 778999999999999999999988887777743 22 2 34668999999999999999999886 68999999999 Q ss_pred HHHHHHHHHHHHHHHHHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 527 TYDSMSYSLNNYLQQWVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 527 l~~~v~~~i~~~L~~l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) .|..|+..|..||.+||++|+|.++-. .+.+-++...+++|++.+.|+.|+|||.+++++.. T Consensus 521 ~r~~v~~~i~~~L~~l~~~g~I~~~~~--~dv~v~~~~D~~~v~~~v~Pv~~mekIy~tv~~~~ 582 (587) T protein:vir:96 521 SASQIKDFVQSYLGRKKRDNEIQDFPP--EDVQVIIEGNEARISLTIFPIRALKKISVSLVYRQ 582 (587) T ss_pred HHHHHHHHHHHHHHHHHhCCcccCCCc--cceEEEecCCEEEEEEEEEEcccceEEEEEEEEEe Confidence 999999999999999999999987532 23333456678999999999999999999999988 No 42 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=100.00 E-value=1.5e-54 Score=315.73 Aligned_cols=545 Identities=13% Similarity=0.080 Sum_probs=313.6 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG- 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG- 79 (590) |=-+-+||||+++.+|+..+++++++.+.+|+|.+++||+++|++|+||+|++++|||..+...-+|.|++..||.||| T Consensus 17 ~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~a~~~f~~g~l~~a~~~a~~~~~~~~~g~~ 96 (607) T protein:vir:10 17 LFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNGDPTKVYEIRTSQQATKIFGSGDLVDGIKLAFDPTGNSVTNGG 96 (607) T ss_pred CCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCCCCceEEEEcchhHHHHhhcCcchHHHHHHhhccccCCccCCc Confidence 4457799999999999999999999999999999999999999999999999999987655556678999999999997 Q ss_pred cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecccccc---- Q lcl|NC_016163. 80 TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGEN---- 155 (590) Q Consensus 80 ~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~~---- 155 (590) .||+||| +++..++.......++....+.+.+-++++...+..+.+....+...+.. ....++-|.. T Consensus 97 ~~~~~rv--~~~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~~-------~~~~~n~g~~~~i~ 167 (607) T protein:vir:10 97 TVYALRV--DNAKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDNY-------ERTYTNIGQMFSIT 167 (607) T ss_pred eEEEEeC--CCccccceecccccccccccccCCCceEEEEEecCCCccceeEEeecccc-------eeeeeeccceeecc Confidence 5999999 44455555555555566666666665555543333333333222222110 0011111110 Q ss_pred cccceE--EEEeeccccccccccccceeeeeecccCCCceeee--ee-------eeeccccccccccccceeeeeecccc Q lcl|NC_016163. 156 YNGYGF--RLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEG--PY-------IVSFDPEAKDKSRQSIYYANIINKYS 224 (590) Q Consensus 156 ~~~~~~--~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~--~~-------~ls~~~da~~~~~~~~~~~~vv~~~s 224 (590) |.+... .+.+..+.++ ....+...-....+..++. +. ..+.+..... ....+...++...+ T Consensus 168 y~g~~~~a~~~v~~~~~g------~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din--~~~~~~A~~~g~~~ 239 (607) T protein:vir:10 168 YSGKSASAGYTVSHDTDG------KAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAIS--ATPNFSASVVGSPS 239 (607) T ss_pred cCcccccccceeeecCCC------ceeEEEecCCCccceeeeeecccccccccchHHHHHHHhh--cCCceEEEEecccc Confidence 000000 0001111100 0001111100111111110 00 0000000000 00000000010000 Q ss_pred ceeeecCccccceeeeeecccccccCccccceeccccccc---ccccccccccccceeecccccccccccccccccceee Q lcl|NC_016163. 225 QYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAV---TPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYL 301 (590) Q Consensus 225 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (590) .-....+.......+... .........+......... .......... .......... ...... ....... T Consensus 240 i~tky~d~~~~~i~V~~~---~~iv~a~~~D~~~~~~~~~~~~~t~~~~~~~~-~~~~~~~~~~--~~~~~~-~~~~~~~ 312 (607) T protein:vir:10 240 VNTSYLDEVTSPVDVKTA---PAVVTAKIGDAISKLGYDPYVVVTQTSNNKPI-VNGVSAGTGS--ATASVT-TAPESFP 312 (607) T ss_pred eeeeccccccceeEEEEe---eeeechhhhhhhhcccccceEEeeecccchhh-hhhhhccccc--eeeeee-ccccccc Confidence 000000000000000000 0000000000000000000 0000000000 0000000000 000000 0000011 Q ss_pred eeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhcC---CeEEEEecCCCCCH Q lcl|NC_016163. 302 TEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQRG---DCIAILDCSFQGDA 378 (590) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~---~~~a~~d~p~~~~~ 378 (590) .......++||.++..... ...+ +...+..+...+++...+.+++.++..||.+++. .+++++..+.+.++ T Consensus 313 a~~a~~~LtGGtdG~~~~t---y~da---l~aLe~~e~~~i~~~t~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~~~~t~ 386 (607) T protein:vir:10 313 ANFDTAFLTGGSTGDVPVS---WADK---FNGAIGNNVYYIIPLTSEENIHAELQAFIDEQHVLGYNYHAFVGGGFAEPL 386 (607) T ss_pred cccceeeeeCCCCCCchhh---HHHH---HHHHhhcCceEEEecCCCHHHHHHHHHHHHHHHhCCCcEEEEecCCCCCCH Confidence 1222344667777643211 1112 2222233455666666777888889999876543 48999999999999 Q ss_pred HHHHHHHHhhcCcccceEEEEcCeEEEeecccCceeeecH---HHHHHHHHHHhhccCCceECcCCcccceeecccccee Q lcl|NC_016163. 379 QQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNGETITVTS---TYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINF 455 (590) Q Consensus 379 ~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~pp---sg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~ 455 (590) +++.++...+ ++.+..+++|+.++.| .|+.+..|+ ++++||++|..+ +++||.|.. +. ..++.. T Consensus 387 ~~~~t~a~~~---N~ervv~V~~~~~~~~--~G~~~~~~~~~~Aa~vAGl~Ag~~----~~~SlT~k~---i~-~~~v~~ 453 (607) T protein:vir:10 387 EQILSRQVNI---NDSRFGLVGQSGHVQE--GGESVHVPAYLMAAYVGGLSSSLG----VAVPITNKK---LA-LVDLDQ 453 (607) T ss_pred HHHHHHHHhh---CCCcEEEEecCeeEee--CCcceeccHHHHHHHHHHHHhcCc----cccCcccce---ec-cccccc Confidence 9988877654 8889999999987766 356666665 689999999776 788998854 53 557888 Q ss_pred ecChhHHhhhhhcCceEEEEecC----CeEEEecceec---CCCcccceehhhhHHHHHHHHHHHHH-HHHhcCCCCHHH Q lcl|NC_016163. 456 YPNEPWKEKLYLAQVNYIERDPK----KISFATQLTSQ---TSRSALSYINNVRVLLRIRREVEKMM-ADYRQEFQDNTT 527 (590) Q Consensus 456 ~~~~~e~~~Ln~~gIn~i~~~~~----~G~~~wG~rT~---s~d~~~~~i~vrR~~~~i~~si~~~~-~~~vfepn~~~l 527 (590) .+++.|++.|+++|+.++...++ ++++++..-|. ..+..|++|+++|++|+|.+.|++.+ +||++++|++.. T Consensus 454 ~lt~~e~e~ai~~Gv~~l~~~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nnd~~ 533 (607) T protein:vir:10 454 NFSGDDLNTLNQNGVIGIEHLVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIRSTS 533 (607) T ss_pred cCCHHHHHHHHhCCeEEEEEccCccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCcccCCcch Confidence 99999999999999999976544 36777755443 34678999999999999999999876 589999999999 Q ss_pred HHHHHHHHHHHHHHHHh--CCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 528 YDSMSYSLNNYLQQWVA--NRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 528 ~~~v~~~i~~~L~~l~~--~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |..++..+..||..+|+ .|+|.++-. + +-+-..+..++++++.+.|+.++|+|.+++++.- T Consensus 534 ~~~vk~~i~~~L~~~~l~~~gaI~df~~-e-dv~v~~~~D~v~v~~~v~Pv~~iekIyvtv~v~~ 596 (607) T protein:vir:10 534 ADDIKSTVASYLYSEMNNDDGLIVDFSE-S-DIVVTISGTVVYIQFAVAPTQEIKNIVVSGTYSN 596 (607) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeCCCc-c-ccEEeeCCCEEEEEEEEEEcccceEEEEEEEEEE Confidence 99999999999987766 578876532 1 2222355678999999999999999999999887 No 43 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=100.00 E-value=3.7e-51 Score=297.07 Aligned_cols=496 Identities=17% Similarity=0.161 Sum_probs=323.7 Q ss_pred Ccccc-------CCceEEEEecCCC--ceecccccceeEEEeecCCCCCCccEEec--CHHHHHHhcCCccccccccHHH Q lcl|NC_016163. 1 MADYL-------HPSVSSRIVDNSA--VYATAAGNTVLYAAIHSAIGRDNAVEFVT--TTDEFLFKFGNPNLSKYGQTSY 69 (590) Q Consensus 1 Mp~yl-------~PGVYveEi~s~~--~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~--s~~e~~~~fG~~~~~~~~~l~~ 69 (590) |..|- .-||-|.+++.-. +.-.|+++++.+++|.++||++++|.+|+ +|.+|.-.++++. +.....+ T Consensus 1 ~~~ysi~q~ig~aSGvav~pi~~d~t~~~~~g~g~~v~a~Vgif~RG~i~k~~~Vt~~n~~~~LGep~~~~--~ga~~E~ 78 (529) T protein:vir:10 1 MSQYSIQQSLGNASGVAVSPINADATLSTGVALNSSLWAGIGVFARGKPFTVLAVTESNYEDVLGEPLKPS--SGSQFEP 78 (529) T ss_pred CCceehhhhhhhhcccccCCcCcccccchheecCceEEEEEEEeecCCCcceEEEchhHHHHHhccccCCC--cchhhhh Confidence 99885 4799999886333 33345678999999999999999999999 8999999999865 3356888 Q ss_pred HHHHHHcCC-CcEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEe Q lcl|NC_016163. 70 NILNWLQSG-GTAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIV 148 (590) Q Consensus 70 av~~ff~nG-G~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~ 148 (590) .++.|++-+ |+||||||+++|||...-.... ...++.-...+.. +....+.........++.... T Consensus 79 ~~h~~eA~~~~s~yVVRvv~~dak~p~i~~~~----~~~~~~s~~~~s~---~~~l~~G~~~~iy~~Dgd~~~------- 144 (529) T protein:vir:10 79 IRHVYEAIQQTSGYVVRAVPDDAKFPIIMFDE----SGEPAYSALPYGS---EIELDSGEAFAIYVDDGDPCI------- 144 (529) T ss_pred HhhhhhhhcCCceEEEEEcccccCCceEEecC----Cccchhhcccccc---cccccccceEEEEEecCcCcc------- Confidence 888888554 4899999999998865222211 1111111000000 000111111122222222110 Q ss_pred ecccccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceee Q lcl|NC_016163. 149 PKGRGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVE 228 (590) Q Consensus 149 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~ 228 (590) .+++.+.+..+ ..+.....-+.+.+...+..+..+++|.|+ .+...++.+..+..-|++++++..|.... T Consensus 145 -------s~~~~l~i~~~--~ads~g~e~~~l~~~~~~~~g~~~~let~~-~sl~~~a~dd~G~~~yl~svle~~s~~l~ 214 (529) T protein:vir:10 145 -------SPTRELTIETA--TADSAGNERFLLKLTQTTSLGVVTTLETHT-VSLAEEAKDDMGRLCYLPTALEARSKYLR 214 (529) T ss_pred -------CCceEEEEEee--ccccCCCccceeeEEEEeecCCceEEEEEE-eeeeechhhhcCCccchhHHHhhccCcee Confidence 11222222222 112222223344455555566677777777 45666777777777777777766554332 Q ss_pred ecCccccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccc Q lcl|NC_016163. 229 IVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGT 308 (590) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (590) .-........ .. +... .+-. T Consensus 215 ai~~~e~~~t-----------------~~-------------------------------------~~t~------~d~~ 234 (529) T protein:vir:10 215 AVVNEELIST-----------------AK-------------------------------------VTNK------KSLA 234 (529) T ss_pred eeeeeccccc-----------------cc-------------------------------------hhhh------hhhh Confidence 1110000000 00 0000 0012 Q ss_pred ccccccccee--ccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHH Q lcl|NC_016163. 309 WTGGNEESAL--LVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRT 386 (590) Q Consensus 309 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~ 386 (590) .++|+++..+ ...+ ...++..+.+.+.....++..++++.++..+++..|.+.++++| .|+|+++++.++++|++ T Consensus 235 f~~GtdG~~~~i~s~~-y~~A~~~L~n~p~d~~~il~~g~y~~a~I~~L~~ic~~~~~d~f--~DV~~~LT~~aA~~~~e 311 (529) T protein:vir:10 235 FTGGTNGDQSKISTAA-YLRAVKVLNNAPYMYTAVLGLGCYDNAAITALGKICADRLIDGF--FDVKPTLTYAEALPAVE 311 (529) T ss_pred ccCCccccccccchHH-HHHHHHHhcCCcceeeeeeccCCccHHHHHHHHHHHhhhhhcEE--EcCCCCcCHHHHHHHHH Confidence 2233333222 1222 33455556666666677778888899999999999998886655 59999999999999999 Q ss_pred hhcCcccc--eEEE-EcCeEEEeecccCceeeecHHHH--HHHH--HHHhhccCCceECcCCcccceeeccccce--eec Q lcl|NC_016163. 387 GNISMSTY--FTAI-FGQHMNVYDEYNGETITVTSTYF--LASM--IPSNDDQNGIQWTFVGPRRGVISGFTDIN--FYP 457 (590) Q Consensus 387 ~~~~~~s~--~~~~-~~p~~~~~d~~~~~~~~~ppsg~--~AG~--~A~~D~~~G~~~sPan~~~~~i~g~~~~~--~~~ 457 (590) +++...+. ++.. |||| +..||.++.+..+++||. +|+. .++.....|+|++||+.+++.|.- .+++ +.. T Consensus 312 ~~gl~~~~~~~~s~y~~P~-~~~D~~tg~k~~~GlsG~A~~akargv~~na~v~g~hY~pAGe~r~~inr-~~I~~ly~~ 389 (529) T protein:vir:10 312 DTGLLGTDYVSCSVYHYPF-SCKDKWTQSRVVFGLSGVAYAAKARGVKKNSDVGGWHYSPAGEERAVIAR-ASIQPLYPE 389 (529) T ss_pred hcCccccCceeeEEEEcce-eeccccccCceeeCCCcceeeccccceeecccccccccccCCCccceeec-ccceeccCC Confidence 87654444 4554 4665 499999999999999995 3332 133333344599999998776643 3333 445 Q ss_pred ChhHHhhhhhcCceEEEEecCCeEEEecceec-CCCcccceehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH Q lcl|NC_016163. 458 NEPWKEKLYLAQVNYIERDPKKISFATQLTSQ-TSRSALSYINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLN 536 (590) Q Consensus 458 ~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~-s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~ 536 (590) ++.|...|-.++||+|.--.++++.+-.+-|+ ..++.|||+|+++|+++|++.+.+..+|.+|||++..+|. +++.++ T Consensus 390 d~~e~~~lv~~riNPV~~~~~g~~~idDsLt~~~knny~R~~hv~~lmn~I~~~~~k~a~~~~~~Pd~it~~g-l~~~l~ 468 (529) T protein:vir:10 390 DTPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQDNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAG-LTKGMT 468 (529) T ss_pred CccCHHHHHhhccCeeeeeccCcceeeeeeceeeeCCchhhhhHHHHHHHHHHHHHHHHHHHhhCCChHHHHH-HHHhHH Confidence 66677778888888887655554433223333 2489999999999999999999999999999999998887 999999 Q ss_pred HHHHHHHhCCceEEEecCC---------CCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 537 NYLQQWVANRACSSISGTV---------YASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 537 ~~L~~l~~~ga~~~~~d~~---------~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) .+|+.+|+.|+|.+..|.. +-+|.| .+++.++++++|...++.|.+.-..=| T Consensus 469 ~~L~r~~asgalv~prdp~~~G~epy~~~V~q~d--~D~~~v~~~~~ptGv~Rri~~~p~l~~ 529 (529) T protein:vir:10 469 KLLDRFVASGALVAPRDPDADGTEPYVLKVTQAE--FDKWEVVWACCPTGVARRIQGVPLLIK 529 (529) T ss_pred HHHHHHHhcCceecccCccCCCCCceEEEEeecc--cCeEEEEEEeecCCceeeEEeeeeecC Confidence 9999999999999987732 124544 489999999999999999988777777 No 44 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=100.00 E-value=1.6e-47 Score=277.13 Aligned_cols=415 Identities=13% Similarity=0.114 Sum_probs=252.4 Q ss_pred Ccc-------ccCCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHH-HH Q lcl|NC_016163. 1 MAD-------YLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYN-IL 72 (590) Q Consensus 1 Mp~-------yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~a-v~ 72 (590) |+- -.-|||||||++++.++|++++|++++|++.+.|||+++|++|+||+||++.||+.. .+..|. ++ T Consensus 1 m~gg~~~~~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~~Gp~~~~~~i~s~~d~~~~fG~~~----~~~~~~~~~ 76 (437) T protein:vir:10 1 MAGGIWKRQNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLALSFGQSKKLMKIRRGEDLFKKLGYEQ----ESPQLLLLN 76 (437) T ss_pred CCcceecccceecCceeEEEecCCcceeeccCCcEEEEEEEecCCCCceeEEEecHHHHHHHcCCcc----chhHHHHHH Confidence 773 235999999999999999999999999999999999999999999999999999743 234444 45 Q ss_pred HHHcCCC-cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecc Q lcl|NC_016163. 73 NWLQSGG-TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKG 151 (590) Q Consensus 73 ~ff~nGG-~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~ 151 (590) .|| ||| +||++|+.++ +++....... +..++ .+ T Consensus 77 ~~~-~g~~~~~~~R~~~g-~~a~~tl~~~--------------~~~~A------------------------------~~ 110 (437) T protein:vir:10 77 EAF-KRVSEVLLYRLNTG-EKANVSLSDN--------------VTAQA------------------------------KY 110 (437) T ss_pred HHh-cCCCEEEEEECCCC-ceeeEeeccc--------------eEEEe------------------------------cc Confidence 555 566 6999998532 2111111001 11111 12 Q ss_pred cccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecC Q lcl|NC_016163. 152 RGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVD 231 (590) Q Consensus 152 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~ 231 (590) +|.|++.+.+.+......... ..+.....+ ...+. ..+...+.. .. T Consensus 111 ~G~~gn~i~v~v~~~~~d~~~-------~~v~~~~~~---~~~d~---------------------~~v~~~~~~---~~ 156 (437) T protein:vir:10 111 SGVRGNDITVTVKTNVDDPSS-------FDVVTFLDT---VVMDL---------------------QTVKVLADL---KN 156 (437) T ss_pred CCcccceeEEEEeeccCCccc-------eEEEEecCc---ceeee---------------------eehhhhhhh---hh Confidence 333444333322211111000 000000000 00000 000000000 00 Q ss_pred ccccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeecccccccc Q lcl|NC_016163. 232 NRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTG 311 (590) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (590) . .+.... ............+++ T Consensus 157 ----------------------------n--------------~~v~~~----------------~~~~l~~~a~~~LtG 178 (437) T protein:vir:10 157 ----------------------------N--------------ALVEFS----------------GTGELQPVAGAKLTG 178 (437) T ss_pred ----------------------------h--------------cccccc----------------cccccccccceeeec Confidence 0 000000 000000011123334 Q ss_pred ccccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhcC---CeE-EEEecCCCCCHHHHHHHHHh Q lcl|NC_016163. 312 GNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQRG---DCI-AILDCSFQGDAQQTIDYRTG 387 (590) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~---~~~-a~~d~p~~~~~~~~~~~~~~ 387 (590) |.++. .+..++. +.+...+....+.++.+..+.+++.++.+||.+.|. ..+ +++ ++... T Consensus 179 G~dg~-~t~~dy~----~al~~le~~~~n~l~~~~~d~~~~t~~~~~ik~~r~~~g~~~~~V~-~~~~~----------- 241 (437) T protein:vir:10 179 GTDGA-ISTQDYL----EYFKALETVEFNYMALPVEDASIKKAAINFIKRMREDEGLGAQLVV-ADSDA----------- 241 (437) T ss_pred cccCC-CChhHHH----HHHHHhccCcceEEEecCCChhHHHHHHHHHHHHHhccCceEEEEe-CCCCC----------- Confidence 44443 2222322 223333455677888888888899999999876542 233 444 33222 Q ss_pred hcCcccceEEEEcCeEEEeecccCceee--ecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhh Q lcl|NC_016163. 388 NISMSTYFTAIFGQHMNVYDEYNGETIT--VTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKL 465 (590) Q Consensus 388 ~~~~~s~~~~~~~p~~~~~d~~~~~~~~--~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~L 465 (590) +.....-+.+-....| ...+ --..+++||++|..+ +++|+.|. .+.++.++...+++.|++.| T Consensus 242 ----d~e~Iin~~n~~~~~~----~~~~~~~~~~a~vAG~~Ag~~----~~~S~t~~---~~~~~~~v~~~~t~~e~~~~ 306 (437) T protein:vir:10 242 ----DSEAVINVKNGVILSD----KTVIDKTKATVWVAAASANAG----VEKSLTYE---KYEDSVDVVGRLSHTETEDA 306 (437) T ss_pred ----CCceEEEeecceeecC----cceechhhHHHHHHHHhccCc----cccCcccc---ccCCcccccccCCHHHHHHH Confidence 1222222222222221 1111 113588999999764 67788884 47788888889999999999 Q ss_pred hhcCceEEEEecCCeEEEecceecC-----CCcccceehhhhHHHHHHHHHHHHHH-HHhcC-CCCHHHHHHHHHHHHHH Q lcl|NC_016163. 466 YLAQVNYIERDPKKISFATQLTSQT-----SRSALSYINNVRVLLRIRREVEKMMA-DYRQE-FQDNTTYDSMSYSLNNY 538 (590) Q Consensus 466 n~~gIn~i~~~~~~G~~~wG~rT~s-----~d~~~~~i~vrR~~~~i~~si~~~~~-~~vfe-pn~~~l~~~v~~~i~~~ 538 (590) +++|+.++.+..++-..++|-.|+. .++.|++|.++|++|+|.+.|++.+. +|+++ |||+..|..++..|..| T Consensus 307 i~~G~~vl~~~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~~r~~~~~~i~~y 386 (437) T protein:vir:10 307 LLKGQFVFTARRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFSEYFLGKVSNNEDGRQAFKANRIRY 386 (437) T ss_pred HhCCcEEEEEeCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHHhccccccCCCHHHHHHHHHHHHHH Confidence 9999999977544334447755543 25789999999999999999999877 48887 79999999999999999 Q ss_pred HHHHHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEee Q lcl|NC_016163. 539 LQQWVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVN 589 (590) Q Consensus 539 L~~l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~ 589 (590) |++|+++|+|...-..+....+......+++.+.+.|+.+||+|.+++.+. T Consensus 387 l~~l~~~g~I~~~~~~d~~v~~~~~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 387 FKDLEARGAIEDFKVEDIEVLRGELKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred HHHHHhCCCccCCCceeEEeecCCCCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 999999999986432221111112346889999999999999999999999 No 45 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=100.00 E-value=9.1e-40 Score=234.61 Aligned_cols=426 Identities=16% Similarity=0.114 Sum_probs=247.0 Q ss_pred Ccc--cc-----CCceEEEEecCCCceecccccceeEEEee-cCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHH Q lcl|NC_016163. 1 MAD--YL-----HPSVSSRIVDNSAVYATAAGNTVLYAAIH-SAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNIL 72 (590) Q Consensus 1 Mp~--yl-----~PGVYveEi~s~~~~i~gv~Tsv~~~vg~-~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~ 72 (590) |+- .+ -|||||||++++.++++++++++.+|++. ..||| ++|+.|+|++||++.||....+ .-..+++ T Consensus 1 magg~~~~~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g~-~~~v~i~~~~d~~~~fG~~~~~---~~~~~~~ 76 (451) T protein:vir:10 1 MAGGTWKAQDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWGK-NGVIEVEANSDFTKKLGTTLDD---PSLTALK 76 (451) T ss_pred CCceeeccceeecCceEEEEeccCcceeeccCCcEEEEEeeecCCCC-cccEEeecHHHHHHHcCCcccc---hhHHHHH Confidence 773 22 49999999999999999999988888874 56777 6799999999999999974321 1223778 Q ss_pred HHHcCCCcEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccc Q lcl|NC_016163. 73 NWLQSGGTAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGR 152 (590) Q Consensus 73 ~ff~nGG~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~ 152 (590) .||++|.+||+.|+.++.+..+...... +. +.+.++ T Consensus 77 ~~~~g~~~v~~yrl~~g~~a~~t~~~~~--------------~~------------------------------~~Aky~ 112 (451) T protein:vir:10 77 ETLKGASKVLVLNPNEGTAATLTKEGLP--------------WT------------------------------VTANYP 112 (451) T ss_pred HHhcCCcEEEEEEcCCCceEEEEeecCc--------------eE------------------------------EEEeeC Confidence 7886554799999854322111110000 00 113334 Q ss_pred ccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCc Q lcl|NC_016163. 153 GENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDN 232 (590) Q Consensus 153 g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~ 232 (590) |.+++.+.+.+....+....+ .+..... ...++... +... ... .. ..+.++..... T Consensus 113 G~~Gn~i~v~v~~~~~d~~~~-------~v~t~~g---~~~vd~qt-v~~~-~~~--el----------~~nd~V~a~~~ 168 (451) T protein:vir:10 113 GEKGNQITVSVEVSPADQNAA-------TVSTIFG---TKLVDEQS-IKFN-ELD--KF----------KGNDYITAKVV 168 (451) T ss_pred CcCCceEEEEEecccCCcCce-------EEEEEEC---CeEEEEEE-eecc-chh--hc----------cCCceEEEEec Confidence 555555444443222211111 1111100 00001000 0000 000 00 00001100000 Q ss_pred cccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccc Q lcl|NC_016163. 233 RSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGG 312 (590) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (590) ..+... . ...... . +...++ T Consensus 169 ---------------------------~~g~~~------------------~-----------~~~~~l---~-~~~~gg 188 (451) T protein:vir:10 169 ---------------------------EEGSSK------------------P-----------VAFTNV---S-GTLTGG 188 (451) T ss_pred ---------------------------cccccc------------------c-----------eeeeec---c-cccccc Confidence 000000 0 000000 0 000011 Q ss_pred cccceeccchhhHHHHHHHhhhccCCceeeecc-c-chhHHHHHHHHHHHHhc----CCeEEEEecCCCCCHHHHHHHHH Q lcl|NC_016163. 313 NEESALLVKGYSGVLAPEILDKQQYEIDVLLDG-N-NEVAVKNAMSDLCSEQR----GDCIAILDCSFQGDAQQTIDYRT 386 (590) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~a~~~~~~~~~----~~~~a~~d~p~~~~~~~~~~~~~ 386 (590) .... ...++. . .+...+....+.++.+ . ....++..+.++|.+.| ....+++..++... T Consensus 189 ~~~~--~~~~~~-~---~l~~~e~~~~n~l~~~~~~~~~~i~~~~~a~ik~~r~~~g~~~~aVl~~~~~~~--------- 253 (451) T protein:vir:10 189 TTTE--SNKVES-L---LNDALENEEYAVVTTAGFEPSSNMNKLVVEAVKRLRENEGRKVRGVIPTDADTT--------- 253 (451) T ss_pred cccC--CccchH-H---HHHHhccceeeEEEEccCCCchHHHHHHHHHHHHHHHhcCCeEEEEecCccCCC--------- Confidence 1110 011111 1 1223344455555433 2 33456666777776544 23457776543221 Q ss_pred hhcCcccceEEEEcCeEEEeecccCceeeecH---HHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHh Q lcl|NC_016163. 387 GNISMSTYFTAIFGQHMNVYDEYNGETITVTS---TYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKE 463 (590) Q Consensus 387 ~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~pp---sg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~ 463 (590) .+......+.......| + +..++ .+++||++|..+ +.+|+.|. .+.|+.++...+++.|++ T Consensus 254 ----~d~egiinv~n~~~~~d---g--~~~~~~~~~~~vAG~~Ag~~----~~~S~T~~---~~~~~~~v~~~~t~~e~~ 317 (451) T protein:vir:10 254 ----YNYEGISTVVNGYTLSD---G--TNVDVKDATGYFAGISASAD----VATSLTYF---EVEDAVSAYPKFDNEKTI 317 (451) T ss_pred ----CCCcceEEeecceEecC---c--eeechhhhHHHHHHHHcccc----cccCccce---ecCCceeeeeeCCHHHHH Confidence 12222333333332222 1 12233 489999999764 67788884 477888888899999999 Q ss_pred hhhhcCceEEEEecCCeEEE-ecceecC-----CCcccceehhhhHHHHHHHHHHHHHHH-HhcC-CCCHHHHHHHHHHH Q lcl|NC_016163. 464 KLYLAQVNYIERDPKKISFA-TQLTSQT-----SRSALSYINNVRVLLRIRREVEKMMAD-YRQE-FQDNTTYDSMSYSL 535 (590) Q Consensus 464 ~Ln~~gIn~i~~~~~~G~~~-wG~rT~s-----~d~~~~~i~vrR~~~~i~~si~~~~~~-~vfe-pn~~~l~~~v~~~i 535 (590) .+.++|..+++...++++++ +|-.|+. .+..|+.|.++|++|+|.+.|++.+.. |+.+ |||...|..++..| T Consensus 318 ~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~gr~~~~~~i 397 (451) T protein:vir:10 318 KALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTYLGNVGNNAAGRDLFKADR 397 (451) T ss_pred HHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccceecCCCHHHHHHHHHHH Confidence 99999999987667777765 6765653 256899999999999999999999874 7775 69999999999999 Q ss_pred HHHHHHHHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEee Q lcl|NC_016163. 536 NNYLQQWVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVN 589 (590) Q Consensus 536 ~~~L~~l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~ 589 (590) ..||++|+++|+|....+.+-....-.....+++.+.+.|+..||+|.+++++. T Consensus 398 ~~yl~~l~~~g~i~~~~~~d~~v~~~~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 398 IAYLTSLQNRNMIQSFANTDITVEAGNDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred HHHHHHHHhCCCccCCCccceEEeecCCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 999999999999976543322222223467899999999999999999999988 No 46 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=1.9e-38 Score=227.33 Aligned_cols=520 Identities=11% Similarity=0.107 Sum_probs=231.4 Q ss_pred CceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCC---C-cEE Q lcl|NC_016163. 7 PSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSG---G-TAY 82 (590) Q Consensus 7 PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nG---G-~~~ 82 (590) -||-++ +- ..-.||-..|+.+....+...++++.-|=-.. | +|. T Consensus 1 ~~~~~~------------------------------~~--~~~~~~~~~~~~~~~g~~~~~~~~~~i~g~~~g~~g~~~s 48 (581) T protein:vir:10 1 MAIDFS------------------------------QY--QTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRES 48 (581) T ss_pred Ceeeec------------------------------cc--cccchhhhhccccccceeeeeccccccccccccccccccc Confidence 011111 00 11122333333332222233333333322221 2 343 Q ss_pred EEEEecC-Cccccccccccceeecccccccceeeeeecccccccc--------ccceEEEeeccccCCcceeeEeecccc Q lcl|NC_016163. 83 VLRVMPD-DAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTAS--------KNAMKTILSGGTAGETPLCFIVPKGRG 153 (590) Q Consensus 83 vvRv~~~-~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~--------~~~~~~~~~~~t~~~~~~~~~~~~~~g 153 (590) .|..|+ +..+....... ......|.. .+...+.+...-. +.++..+ . +.+.....+ . + T Consensus 49 -~~~~p~~~~~~e~q~v~~--~~~~t~GtF--tLsf~G~tT~~I~~~asa~~v~~AL~~L-~--~i~~~~v~v-~----g 115 (581) T protein:vir:10 49 -IRINPDTGETITTQILAL--VGEPTGGSF--KLSLAGEPTGNIPFNATQGQVQSALRAL-P--NVEDDEVTV-L----G 115 (581) T ss_pred -cccCCCCCCccceEEEEE--EecCCCceE--EEEeCceecccccccCCHHHHHHHHhcc-C--CCCcceEEE-E----C Confidence 344442 22222111100 010111111 1111111100000 0000000 0 000000000 0 0 Q ss_pred cccccceEEEEeecccccc-ccc----cccceeeeeeccc-CCCceee---------eeeeeeccccc-cccccccceee Q lcl|NC_016163. 154 ENYNGYGFRLSLRSDYDNT-YNF----RTYNLSVTVKDST-GADVVVE---------GPYIVSFDPEA-KDKSRQSIYYA 217 (590) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~-~~~----~~~~l~i~v~d~~-~~~~v~e---------~~~~ls~~~da-~~~~~~~~~~~ 217 (590) .. +..+.+.-..+.... .+. ......+.+.... +...... ....++..... ....+.. +.. T Consensus 116 ~~--g~~~~VtF~g~~~~l~~~~~~lt~g~~~~vtV~~~~~g~~~~~~~~s~~gi~~~~~~l~~~~~~~~~~~gsd-~~~ 192 (581) T protein:vir:10 116 DP--GGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTD-YVV 192 (581) T ss_pred CC--CceEEEEEcCCccceeeeeceecCCCceeEEEeccccCcccccccccccccccccccccccccCcceecccc-cee Confidence 00 001111111000000 000 0000111111100 0000000 00000000000 0000000 000 Q ss_pred eeeccccce-eeecCccccceeeeeecccccccCccccceec-ccccccccccccccccccceeecccccccccccccc- Q lcl|NC_016163. 218 NIINKYSQY-VEIVDNRSAFETISEFVVGDSEADPQKVDIIF-GQERAVTPAETIHANVVWKSSSVETDDPSYDATAAN- 294 (590) Q Consensus 218 ~vv~~~s~~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 294 (590) ...+..... ....+.. .+......+..........+.. -.++........ .+... ......+..+..+.. T Consensus 193 ~~~~~~~~~~~~~~~D~---~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~---~~~~~-~~~~~~~~~~~~g~~~ 265 (581) T protein:vir:10 193 TRVNAGEDGEANTRDDL---YTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRF---TDPDD-IQDFYGPAFDEAGNVQ 265 (581) T ss_pred eecccCccccccccccc---eeeeeeecccccccceEEEEEEEeecCCcceeEEe---ecCcc-hhhhhhhhhhccCccc Confidence 000000000 0000000 0000000000000000000000 000000000000 00000 000000000000000 Q ss_pred ----cccceeeeeccccccccccccce--eccchhhHHHHHHHhhhccCC-ceeeecccchhHHHHHHHHHHHHhc---C Q lcl|NC_016163. 295 ----FNNIQYLTEGSEGTWTGGNEESA--LLVKGYSGVLAPEILDKQQYE-IDVLLDGNNEVAVKNAMSDLCSEQR---G 364 (590) Q Consensus 295 ----~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~~~~~~~~---~ 364 (590) ..............++++.++.. ++..++. ..+...+..+ ..++++...+.+++.++..||.+.+ . T Consensus 266 ~~~t~~~~~~~tn~~~~~l~~gvd~~g~tvt~~dy~----~Al~ale~~~~~~ivv~~t~~~~v~a~l~ahv~~~s~~~~ 341 (581) T protein:vir:10 266 SEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQ----NALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKY 341 (581) T ss_pred cchhhhheeeeecccceeEEeeccCCCCccchHHHH----HHHHHHhcCCceEEEEeCCCCHHHHHHHHHHHHHHHhccC Confidence 00001112222223333333221 1222322 2233333333 3345666666777777877775542 3 Q ss_pred CeEEEEecCCCCCHHHHHHHHHhhcCcccceEEEEcCeEEEeecccC-ceeeecHHHHHHHHHHHhhccCCceECcCCcc Q lcl|NC_016163. 365 DCIAILDCSFQGDAQQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNG-ETITVTSTYFLASMIPSNDDQNGIQWTFVGPR 443 (590) Q Consensus 365 ~~~a~~d~p~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~-~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~ 443 (590) ++.+++..+.........++.....++++.+..++||+..+++...+ ..+.+|+ .++|+.+|-+-...++++||.|+. T Consensus 342 ~~ravigV~g~~~~~~~~~~~~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~-y~~AA~vAGl~a~~~~~~slT~~~ 420 (581) T protein:vir:10 342 ERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGG-QFMAAAVAGKSVSAIAAMPLTRKV 420 (581) T ss_pred CcEEEEEecCCCCCccHHHHHHhhccCCCceEEEEecCceeecCcccCceeccch-hhHHHHHHHHhhccccccCccccc Confidence 45566665433322222233333346789999999999998887543 4444555 333434444444556889999854 Q ss_pred cceeeccccceeecChhHHhhhhhcCceEEEEecCCeEEE-ecceecCCCcccceehhhhHHHHHHHHHHHHHHH--Hhc Q lcl|NC_016163. 444 RGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPKKISFA-TQLTSQTSRSALSYINNVRVLLRIRREVEKMMAD--YRQ 520 (590) Q Consensus 444 ~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~-wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~--~vf 520 (590) +.|+.++...+++.|++.|+++|+++++.++++++++ ||-+|+..+++|++|++||++++|.+.+++.++| |++ T Consensus 421 ---i~gi~~l~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~~~i~~iR~~D~v~~~ir~~~~~~~fIG 497 (581) T protein:vir:10 421 ---IRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIG 497 (581) T ss_pred ---ccccccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCCCCcceeeeeehhhhHHHHHHHHHhhhhcCCC Confidence 7788888999999999999999999999999999886 5778889999999999999999999999999864 778 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 521 EFQDNTTYDSMSYSLNNYLQQWVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 521 epn~~~l~~~v~~~i~~~L~~l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |||++.+|..|+..+.+||.+||+.|+|.++.+. ..++.+++.++++++|.++|++|+|||.+|+.++= T Consensus 498 ~~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~-~~~~~~~~~d~v~V~i~v~Pv~~i~~I~vti~~~p 566 (581) T protein:vir:10 498 MPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL-KARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAP 566 (581) T ss_pred cccCHHHHHHHHHHHHHHHHHHHhcCcccCCccc-eeeeeecCCCEEEEEEEEEecccceEEEEEEEEec Confidence 9999999999999999999999999999987533 35777889999999999999999999999999998 No 47 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=7.8e-38 Score=224.01 Aligned_cols=526 Identities=12% Similarity=0.083 Sum_probs=236.4 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecCCCC-CCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcCCC Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGR-DNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQSGG 79 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp-~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~nGG 79 (590) |+ ||...-.- -+....++ +....|- ...|+.+..+..+. | +.| T Consensus 1 ~~-----------~~~~~~~~--~~~~t~~~-~~~~~g~~~~~~~~~~i~g~~~----g------------------~~g 44 (581) T protein:vir:76 1 MA-----------IDFSQYQT--PGVYTEAV-GAPQLGIRSSVPTAVAIFGTAV----G------------------YQT 44 (581) T ss_pred Cc-----------cccccccc--chhhhhhc-cccccCcceeeeeeeeeccccc----c------------------ccc Confidence 22 11111000 01111111 1112221 22344332222221 1 012 Q ss_pred -cEEEEEEecC-Cccccccccccceeecccccccceeeeeecccccccc--------ccceEEEeeccccCCcceeeEee Q lcl|NC_016163. 80 -TAYVLRVMPD-DAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTAS--------KNAMKTILSGGTAGETPLCFIVP 149 (590) Q Consensus 80 -~~~vvRv~~~-~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~--------~~~~~~~~~~~t~~~~~~~~~~~ 149 (590) +|- +|..|. +..++...... +.....+.. .+...+.+...-. +.++..+. +.+.....+ . T Consensus 45 ~~~s-~r~~p~~~~~~evq~v~~--~~~~t~G~f--tLt~~g~tT~~I~~~asa~~v~~AL~~L~---~i~~~~v~v-t- 114 (581) T protein:vir:76 45 YRES-IRINPDTGETITTQILAL--VGEPTGGSF--KLSLAGEPTGNIPFNATQGQVQSALRALP---NVEDDEVTV-L- 114 (581) T ss_pred ccce-eeecCCCCCCCceEEEEE--eecCCcceE--EEEeCceeccccccCCCHHHHHHHHhhcc---CCCCceEEE-E- Confidence 343 455443 22222111110 100111111 1111111100000 00000000 000000000 0 Q ss_pred cccccccccceEEEEeeccccc-------cccccccceeeeee-cccCCCceeeeeeeeeccccccccccccceeeeeec Q lcl|NC_016163. 150 KGRGENYNGYGFRLSLRSDYDN-------TYNFRTYNLSVTVK-DSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIIN 221 (590) Q Consensus 150 ~~~g~~~~~~~~~~~~~~~~~~-------~~~~~~~~l~i~v~-d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~ 221 (590) +...+ .+.+.-..+... ........+.+... ...+.....-....+..............++... . T Consensus 115 ---g~~~~--~~~V~F~g~~~~~~~~~~~ltg~~~~~~~V~~~~~G~~~~~~~l~~~g~~~~~~~~~~s~~~~~~~l~-~ 188 (581) T protein:vir:76 115 ---GDPGG--PWTVTFTKAVAALTKDVTGLTGGDNPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLG-T 188 (581) T ss_pred ---cCCCc--eEEEEEcCCccceeEeeeeeecCCcceeEEEEEecCcCCcCceeeeccccccccceeecCCcceeeec-c Confidence 00000 111111100000 00000000011000 0000000000000000000000000111111100 0 Q ss_pred cccceeeecCccccc------eeeeeecccccccCccccceec-ccccccccccccccccccceeeccccccccccccc- Q lcl|NC_016163. 222 KYSQYVEIVDNRSAF------ETISEFVVGDSEADPQKVDIIF-GQERAVTPAETIHANVVWKSSSVETDDPSYDATAA- 293 (590) Q Consensus 222 ~~s~~v~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 293 (590) .........+..... .+.................+.. ..++...... ...++... ...........+. T Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~~~~~v---~~~~~~~~-~~~~~~~~~~~g~~ 264 (581) T protein:vir:76 189 DYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVI---RFTDPDDI-QDFYGPAFDEAGNV 264 (581) T ss_pred cccceeeccCcccceeeeeeeeeeEeecccccccceeEEEEEEEeecCCccceE---EEeccccc-ccceeeehhhcCcc Confidence 000000000000000 0000000000000000000000 0000000000 00000000 0000000000000 Q ss_pred ----ccccceeeeeccccccccccccce--eccchhhHHHHHHHhhhccCCc-eeeecccchhHHHHHHHHHHHHh---c Q lcl|NC_016163. 294 ----NFNNIQYLTEGSEGTWTGGNEESA--LLVKGYSGVLAPEILDKQQYEI-DVLLDGNNEVAVKNAMSDLCSEQ---R 363 (590) Q Consensus 294 ----~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~~~~~~~---~ 363 (590) ...............++++.++.. ++..++. ..+...+..+. .++++...+.+++..+..||.+. + T Consensus 265 ~~e~~~~~~~~~t~~~~~~l~~gvd~~g~tvt~~dy~----~aL~ale~~~~~~ivvp~t~~~~i~a~l~ahv~~~s~~~ 340 (581) T protein:vir:76 265 QSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQ----NALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNK 340 (581) T ss_pred ccchhhhhheeeccccceEEEeeecCCCCccchHHHH----HHHHHHhcCCeEEEEEecCCChHHHHHHHHHHHHHHhcc Confidence 000001122222233333333211 2222322 22333333333 34455556666666676776543 2 Q ss_pred CCeEEEEecCCCCCHHHHHHHHHhhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcc Q lcl|NC_016163. 364 GDCIAILDCSFQGDAQQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPR 443 (590) Q Consensus 364 ~~~~a~~d~p~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~ 443 (590) .++.+++..+.........++......+++.|..++|||.++++...+......|..++|+.+|....+.++++||.|+. T Consensus 341 ~~~ra~igv~g~~~~~~~~~~~~~a~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~~~slT~~~ 420 (581) T protein:vir:76 341 YERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKV 420 (581) T ss_pred CCceEEEEeeCCCCCchHHHHHHhhcccCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhhccccccCccccc Confidence 34555555443332222222333334678999999999999998765444444456666777777777888999999854 Q ss_pred cceeeccccceeecChhHHhhhhhcCceEEEEecCCeEEE-ecceecCCCcccceehhhhHHHHHHHHHHHHHHH--Hhc Q lcl|NC_016163. 444 RGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPKKISFA-TQLTSQTSRSALSYINNVRVLLRIRREVEKMMAD--YRQ 520 (590) Q Consensus 444 ~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~-wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~--~vf 520 (590) +.|+.++...+++.|++.|+++|+++++.++++++++ ||-+|+..+++|++|++||+++++.+.+++.++| |++ T Consensus 421 ---i~g~~~~~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~k~i~viR~~D~v~~~vr~~~~~~~fiG 497 (581) T protein:vir:76 421 ---IRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIG 497 (581) T ss_pred ---ccccccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEEeeecCCCCCccceeeehhhhHHHHHHHHHHHhhhcCCC Confidence 7788899999999999999999999999999999875 7889999999999999999999999999999874 677 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 521 EFQDNTTYDSMSYSLNNYLQQWVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 521 epn~~~l~~~v~~~i~~~L~~l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~~ 590 (590) |||++.+|.+|+..+.+||..||+.|+|.+.. ....++.+++.+++++++.++|++|+|||.+++.+.- T Consensus 498 ~~n~~~~r~~ik~~i~~~L~~l~~~g~I~g~~-~~~~~~~~~~~d~v~V~i~v~Pv~~ie~I~vt~~~~p 566 (581) T protein:vir:76 498 MPIYDTTIVQVKASAEAALVWLVDNNIIRGYR-NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAP 566 (581) T ss_pred cccChHHHHHHHHHHHHHHHHHHhcCcccCcc-cceeeEEecCCCEEEEEEEEEecccceEEEEEEEEee Confidence 99999999999999999999999999999864 3346777889999999999999999999999999998 No 48 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=99.94 E-value=1e-27 Score=168.59 Aligned_cols=411 Identities=13% Similarity=0.131 Sum_probs=245.2 Q ss_pred Ccc--cc-----CCceEEEEecCCCceecccccceeEEEeecCCCCCCccEEecC---HHHHHHhcCCccccccccHHHH Q lcl|NC_016163. 1 MAD--YL-----HPSVSSRIVDNSAVYATAAGNTVLYAAIHSAIGRDNAVEFVTT---TDEFLFKFGNPNLSKYGQTSYN 70 (590) Q Consensus 1 Mp~--yl-----~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~~Gp~~~p~~v~s---~~e~~~~fG~~~~~~~~~l~~a 70 (590) |+- .+ -||+|++-+......+++...-+.++.....|||+++++.|++ ..|+.+.||... . .+.... T Consensus 3 magg~~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~~wGp~~~v~~i~~~~~~~~~~~~~G~~~-~--~~~~~~ 79 (436) T protein:vir:78 3 LGGGTFVTQNKVLPGSYINFVSATRATSSLSDRGIVAMPLELDWGIDEEVFQVTSDDFEKYSTKYFGYDY-T--HEKLKG 79 (436) T ss_pred ccceeeccceeecCceEEEEEecCcceeeccCCeEEEEEEEecCCCCceeEEeecccchHHHHHHhcCcc-c--hHHHHH Confidence 552 22 4999999887666667777777777888889999999999998 568999999531 2 233446 Q ss_pred HHHHHcCCCcEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeec Q lcl|NC_016163. 71 ILNWLQSGGTAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPK 150 (590) Q Consensus 71 v~~ff~nGG~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~ 150 (590) ++..|.+...+|+.|+..+.++.+. + ..+. T Consensus 80 l~~~~~~~~tv~~yrl~~G~~a~~~-------------------v-------------------------------~~Ak 109 (436) T protein:vir:78 80 LRDLFKNIRLGYFYKLNKGVKASCS-------------------I-------------------------------ATAR 109 (436) T ss_pred HHHHhcCCCEEEEEECCCcceeeee-------------------e-------------------------------eeee Confidence 7778877767999998432111000 0 0112 Q ss_pred ccccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeec Q lcl|NC_016163. 151 GRGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIV 230 (590) Q Consensus 151 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~ 230 (590) ..|..++.+.+.+....+....+ .+.... +...+ +... ...+++. . T Consensus 110 y~g~~gn~i~v~v~~~~~d~~~~-------dv~~~~--g~~~~-----------d~~~--------~~~~~~l------~ 155 (436) T protein:vir:78 110 CSGIRGNDLKVIVTTNIDDNAKF-------DVVTLL--DNKKV-----------DTQI--------AKVITEL------Q 155 (436) T ss_pred cCCCCCcEEEEEecccccccCce-------EEEEEe--cchhh-----------hhhh--------HHHHhhc------c Confidence 22222222222221111111100 000000 00000 0000 0000000 0 Q ss_pred CccccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccc Q lcl|NC_016163. 231 DNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWT 310 (590) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (590) ..++..... .......+...++ T Consensus 156 ------------------------------------------~n~~V~~~~----------------~g~la~~a~~~Lt 177 (436) T protein:vir:78 156 ------------------------------------------DNDYVTWKK----------------EATLEATAGLTFT 177 (436) T ss_pred ------------------------------------------CCceEEEEe----------------cccccccceeeee Confidence 000000000 0001112223456 Q ss_pred cccccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhcCC----eEEEEecCCCCCHHHHHHHHH Q lcl|NC_016163. 311 GGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQRGD----CIAILDCSFQGDAQQTIDYRT 386 (590) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~----~~a~~d~p~~~~~~~~~~~~~ 386 (590) +|.++..++..++... +...+....+.++.+..+.+++..+.++|.+.|.+ .-+++......+.+.++. T Consensus 178 GG~dG~~~T~~dy~~a----l~~le~~~fn~l~~~~~d~~~~~~~~a~ikr~re~~g~~~~aV~~~~~~~d~EgIIn--- 250 (436) T protein:vir:78 178 NGTNGEAVTGTEYQAF----LDKIESYSFNALGCLATTAEIKSLFVEFTKRMRDKVGAKFQTVLYKKNDADYEGVVS--- 250 (436) T ss_pred ccccccccchHHHHHH----HHHHcccceeEEEecCCChHHHHHHHHHHHHHHhhcCCeEEEEecCCCCCCCceEEE--- Confidence 6666665554444433 33345566778887777888899999998766522 234432211222221111 Q ss_pred hhcCcccceEEEEcCeEEEeecccCcee-eecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeecChhHHhhh Q lcl|NC_016163. 387 GNISMSTYFTAIFGQHMNVYDEYNGETI-TVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYPNEPWKEKL 465 (590) Q Consensus 387 ~~~~~~s~~~~~~~p~~~~~d~~~~~~~-~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~~~~e~~~L 465 (590) +.. ...+... .--..+++||++|..+ +.+|+-|. .+.++.++...+++.|.+.+ T Consensus 251 ------------v~n------~v~g~~~~~~~~~a~vAG~~Ag~~----~~~S~T~~---~~~~~~~v~~~~t~~e~~~a 305 (436) T protein:vir:78 251 ------------VEN------KIKDTGLLESSLIYWTTGAIAGCD----INKSNTNK---RYDGEFDVDVNYTQIHLEEA 305 (436) T ss_pred ------------eec------ccCCceechhHHHHHHHHHHhcCc----cccCccce---ecCccccccccCCHHHHHHH Confidence 111 1112111 0114688999999765 45577774 47778888888999999999 Q ss_pred hhcCceEEEEecCCeEEEe-cceec---C--CCcccceehhhhHHHHHHHHHHHHHH-HHhcC-CCCHHHHHHHHHHHHH Q lcl|NC_016163. 466 YLAQVNYIERDPKKISFAT-QLTSQ---T--SRSALSYINNVRVLLRIRREVEKMMA-DYRQE-FQDNTTYDSMSYSLNN 537 (590) Q Consensus 466 n~~gIn~i~~~~~~G~~~w-G~rT~---s--~d~~~~~i~vrR~~~~i~~si~~~~~-~~vfe-pn~~~l~~~v~~~i~~ 537 (590) ..+|.-++.+. ++++++- |=.|+ . .+..|+.|.++|++|+|.+.|++.+. .|+++ ||+..-|..++..|.. T Consensus 306 i~~G~lvl~~d-~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yiGKv~N~~dgr~~l~~~i~~ 384 (436) T protein:vir:78 306 LKTGKFIFHKV-GDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYLGEVPNDKSGRISFWNDVVK 384 (436) T ss_pred HhCCeEEEEEe-CCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccccccCCCHHHHHHHHHHHHH Confidence 99999988764 5566555 32333 2 25689999999999999999999876 58996 6999999999999999 Q ss_pred HHHHHHhCCceEEEecCCCCCHHHhhCCEEEEEEEEEecCccceEEEEEEee Q lcl|NC_016163. 538 YLQQWVANRACSSISGTVYASDYDKQQSIARVKVELVFTGVIERIAIDLVVN 589 (590) Q Consensus 538 ~L~~l~~~ga~~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paefi~~~~~~~ 589 (590) ||++|.++|+|..+-+.+.--...-....+++.+.+.|+..||+|.+++.++ T Consensus 385 yl~~L~~~g~I~~f~~~Dv~v~~~~~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 385 HHEQLQNMRAIEDFKADDVSVEPGSDKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred HHHHHHhCCcccCCCCcceEEeecCCCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 9999999999975432221111112356788999999999999999999999 No 49 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.22 E-value=3.8e-12 Score=83.18 Aligned_cols=322 Identities=12% Similarity=0.054 Sum_probs=161.6 Q ss_pred eeccccccccccccceeeeeeccccceeeecCccccceeeeeecccccccCccccceeccccccccccccccccccccee Q lcl|NC_016163. 200 VSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSS 279 (590) Q Consensus 200 ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 279 (590) ...+|...- .+.......--.+..+ ............ .......+. .... .......+.. T Consensus 1 ~~glp~i~i----------~f~~~a~ta~~~g~rG----iv~~il~d~~~~---~~~~~~~~~-v~~~-~~~~n~~~i~- 60 (356) T protein:vir:10 1 MAGLVNINI----------EFKELATSFIQRSKAG----IVAIILKDTTKM---YKELTSEDD-IPIS-LSADNKKYIK- 60 (356) T ss_pred CCCCCceeE----------EEeecceeeccCCccc----eEEEEEecCCcc---eeEEecccc-chhH-HHHHHHHHHH- Confidence 111111100 0000000000000000 000000000000 000000000 0000 0000000000 Q ss_pred ecccccccccccccccccceeeeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHH Q lcl|NC_016163. 280 SVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLC 359 (590) Q Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 359 (590) ..+ . +...+.........+....+ +..++ ...+...+....+.++.+..+.+.+..+.+++ T Consensus 61 ------~~~-~--g~~~~~~~~~p~~~~~~~~~------t~~~y----~~aL~~le~~~fn~l~~~~~d~~~~~~~~a~i 121 (356) T protein:vir:10 61 ------YGF-V--GATDNEKVLRPSKVIISTFT------EDGKV----EDILEELESVEFNYLCMPEAIEAEKTKIVTWI 121 (356) T ss_pred ------HHh-h--ccccccccccceeeeeeccc------CchhH----HHHHHHhcCccceEEEecCCChHHHHHHHHHH Confidence 000 0 00000000000000011111 11121 22233345566677777777778888888888 Q ss_pred HHhcC---CeEEEEecCCCCCHHHHHHHHHhhcCcccceEEEEcCeEEEeecccCcee-eecHHHHHHHHHHHhhccCCc Q lcl|NC_016163. 360 SEQRG---DCIAILDCSFQGDAQQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNGETI-TVTSTYFLASMIPSNDDQNGI 435 (590) Q Consensus 360 ~~~~~---~~~a~~d~p~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~-~~ppsg~~AG~~A~~D~~~G~ 435 (590) .+.|. ..+..+-.....+.+.++.+ .-.+. + ++... .--.++++||++|... . T Consensus 122 kr~r~~~~~~~~~V~~~~~aD~EgIInv---------------~n~~~-~---~g~~~t~~~~~~~vAG~~Ag~~----~ 178 (356) T protein:vir:10 122 KKIREEESTEAKAVLANIKADNEAIINF---------------TENVV-V---DGEEITAEKYTTRVASLIASTP----N 178 (356) T ss_pred HHHHhcCCcEEEEEecCCCCCCceeEEe---------------ecCeE-e---cceeechhHHHHHHHHHHhccc----h Confidence 76552 23444433222222222111 11111 1 11111 1123679999999776 4 Q ss_pred eECcCCcccceeeccccceeecChhHHhhhhhcCceEEEEecCCeEEEe-cceec---C--CCcccceehhhhHHHHHHH Q lcl|NC_016163. 436 QWTFVGPRRGVISGFTDINFYPNEPWKEKLYLAQVNYIERDPKKISFAT-QLTSQ---T--SRSALSYINNVRVLLRIRR 509 (590) Q Consensus 436 ~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~w-G~rT~---s--~d~~~~~i~vrR~~~~i~~ 509 (590) .+|+-|.. +.++.... .+++.|.+.+-.+|--++.+. ++.+++- |=-|+ . .+..|+.|.+.|+++.|.+ T Consensus 179 n~S~T~~~---~~~~~~~~-~~t~~e~~~ai~~G~lvl~~d-~~~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~~ 253 (356) T protein:vir:10 179 TQSITYAP---LDEVESIV-KIDKASADAKVQAGELILRRL-SGKIRIARGINSLTTLTAEKGEIFQKIKLVDTKDLISK 253 (356) T ss_pred hcccccee---cCCccccc-cCCHHHHHHHHhCCeEEEEEE-cCeEEEEecCccceecCCCCCcchhhhHHHHHHHHHHH Confidence 45666643 44544333 588999999999999998765 4455554 43333 2 2457999999999999999 Q ss_pred HHHHHHH-HHhcC-CCCHHHHHHHHHHHHHHHHHHHhCCceEEEec------------------CCCCCHHHhhC----C Q lcl|NC_016163. 510 EVEKMMA-DYRQE-FQDNTTYDSMSYSLNNYLQQWVANRACSSISG------------------TVYASDYDKQQ----S 565 (590) Q Consensus 510 si~~~~~-~~vfe-pn~~~l~~~v~~~i~~~L~~l~~~ga~~~~~d------------------~~~nt~~~i~~----G 565 (590) .|++... .|+++ ||+..-|..+...|..||.+|.++|+|.-.++ .++|+...+.+ - T Consensus 254 Di~~~f~~~yiGKv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~~~~d~~v~~~~~~~ 333 (356) T protein:vir:10 254 DIKNIYVEKYLRKCPNTYDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLEGKKIAVSKMKENEIKEANTGS 333 (356) T ss_pred HHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhhhccccccccccceeecccCCc Confidence 9999986 59999 59999999999999999999999999953222 22333333333 3 Q ss_pred EEEEEEEEEecCccceEEEEEEe Q lcl|NC_016163. 566 IARVKVELVFTGVIERIAIDLVV 588 (590) Q Consensus 566 ~l~~~i~~ap~~paefi~~~~~~ 588 (590) .+.+.+.+.|+-.||.|.+++.+ T Consensus 334 ~v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 334 NGFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred EEEEEEEEEEEeeeeeEEeEEeC Confidence 57799999999999999999999 No 50 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=98.75 E-value=6.5e-09 Score=65.48 Aligned_cols=335 Identities=11% Similarity=0.039 Sum_probs=157.9 Q ss_pred eeeee-eccccceeeecCccccceeeeeecccccccCccccceecccccccccccccccccccceeeccccccc-ccccc Q lcl|NC_016163. 215 YYANI-INKYSQYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPS-YDATA 292 (590) Q Consensus 215 ~~~~v-v~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 292 (590) .|+.| +|..... .+......-.-.+++........ .+.+.. . .+.....+..+... ....+ T Consensus 1 ~~~~v~vn~~n~~---~g~~~~~er~~Lfig~~~~~~~~--~~~~~~----------~--sdld~~lg~~~~~lk~~v~a 63 (376) T protein:vir:37 1 MFPSVQINALNQL---SGETKEIERHALFVGVGTTNQGK--LLALTP----------D--SDFDKVFGETDTDLKKQVRA 63 (376) T ss_pred CCCeEEEeccccc---CCCcccccceEEeeccccccccc--eeeecC----------c--cchHhhhCCCchHHHHHHHH Confidence 11111 2211110 01100000000111110000000 000000 0 00000000000000 00000 Q ss_pred cccccceeeeeccccccccccccceecc-chhhHHHHHHHhhhccCCceeeecc-cchhHHHHHHHHHHHH----hcCCe Q lcl|NC_016163. 293 ANFNNIQYLTEGSEGTWTGGNEESALLV-KGYSGVLAPEILDKQQYEIDVLLDG-NNEVAVKNAMSDLCSE----QRGDC 366 (590) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~a~~~~~~~----~~~~~ 366 (590) +..++.. +.... ...+.. .+....++...........-.++.+ ..+.+.+.++...+++ ..+.. T Consensus 64 a~~naG~--------~~~~~--~~~~~~~~~~~~~Av~~a~~~~s~E~V~v~~pv~t~~a~i~aa~~~a~el~~~~~Rpv 133 (376) T protein:vir:37 64 AMLNAGQ--------NWFAH--VYIAQEDGYDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRT 133 (376) T ss_pred HHhCCCC--------cEEEE--EEeecCCchHHHHHHHHhhhhcCceEEEEeccccccHHHHHHHHHHHHHHHHhcCCeE Confidence 0001000 00000 000111 1112223322222222222233333 2344444444444433 34568 Q ss_pred EEEEecCC-CCC---HHHHHHHHHhh----cCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceEC Q lcl|NC_016163. 367 IAILDCSF-QGD---AQQTIDYRTGN----ISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWT 438 (590) Q Consensus 367 ~a~~d~p~-~~~---~~~~~~~~~~~----~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~s 438 (590) |.++.++. ..+ .++..+|.... .++.+.+..+. |- .+ | -.-|.+||.+++. ..-++.| T Consensus 134 ~file~r~~~~~~~~~e~w~~y~~~~~al~~gia~~~V~~V-~~--~~----g-----n~~G~~aGRl~~a--aVsVads 199 (376) T protein:vir:37 134 FFIQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLV-PL--LF----G-----NETGVLAGRLANR--AVTVADS 199 (376) T ss_pred EEEEeccCcCcccccccCHHHHHHHHHHhhcccccccceee-ee--eh----h-----hhHHHHHHHHhhc--ccchhhC Confidence 88888762 111 12222232221 12233322211 10 00 1 1258888888643 3337889 Q ss_pred cCCcccceeecccccee-------ecChhHHhhhhhcCceEEEEecCC-eEEEecceecCC-CcccceehhhhHHHHHHH Q lcl|NC_016163. 439 FVGPRRGVISGFTDINF-------YPNEPWKEKLYLAQVNYIERDPKK-ISFATQLTSQTS-RSALSYINNVRVLLRIRR 509 (590) Q Consensus 439 Pan~~~~~i~g~~~~~~-------~~~~~e~~~Ln~~gIn~i~~~~~~-G~~~wG~rT~s~-d~~~~~i~vrR~~~~i~~ 509 (590) |+.+..+.+.|...++. .++.+.++.|..+|-.+.+.++|+ |+.+-..|||+. .++|+||..+|..+-+.| T Consensus 200 pgRV~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDY~~ie~~RVvdKa~R 279 (376) T protein:vir:37 200 PARVQTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGYYWADGRTLDVEGGDYQVIENLRVVDKVAR 279 (376) T ss_pred ccceeccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHH Confidence 99888777777643332 457788999999999999999985 664446799876 479999999999999988 Q ss_pred HHHHHHHHHhcCCC---CHHHHHHHHHHHHHHHHHHHhCCceEEE-----ecCC---CCCHHHhhCCEEEEEEEEEecCc Q lcl|NC_016163. 510 EVEKMMADYRQEFQ---DNTTYDSMSYSLNNYLQQWVANRACSSI-----SGTV---YASDYDKQQSIARVKVELVFTGV 578 (590) Q Consensus 510 si~~~~~~~vfepn---~~~l~~~v~~~i~~~L~~l~~~ga~~~~-----~d~~---~nt~~~i~~G~l~~~i~~ap~~p 578 (590) .++...-..+...- ++.-.+..+.-+..-|++|-+..-+.+. +... .-++.-+...++.|.+-+.|..- T Consensus 280 ~vR~~ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~d~Di~i~w~s~~~V~I~~~v~P~~~ 359 (376) T protein:vir:37 280 KVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKTKVTIYIKVRPYDC 359 (376) T ss_pred HHHHHHHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCCCCCceEEeeccceEEEEEEEEeccC Confidence 88776666554422 3334455555577778888776655442 3111 11222347788999999999999 Q ss_pred cceEEEEEEeeC Q lcl|NC_016163. 579 IERIAIDLVVNK 590 (590) Q Consensus 579 aefi~~~~~~~~ 590 (590) ..+|+..|..+= T Consensus 360 pk~Itv~I~Ldl 371 (376) T protein:vir:37 360 PKEITANIFLDL 371 (376) T ss_pred CceEEEEEEeec Confidence 999998877766 No 51 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=98.73 E-value=7.7e-08 Score=59.57 Aligned_cols=433 Identities=12% Similarity=0.018 Sum_probs=197.6 Q ss_pred Ccc-ccCCceEEEEecCCCceecccccceeEEEee-c--CCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHc Q lcl|NC_016163. 1 MAD-YLHPSVSSRIVDNSAVYATAAGNTVLYAAIH-S--AIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQ 76 (590) Q Consensus 1 Mp~-yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~-~--~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~ 76 (590) .|. ++-||+|+|=-.+.+ ..+.++---.++|- . -..+.++|++|+|-+|-...||. .|.+..-++.|.+ T Consensus 8 IP~~iRvP~~y~E~dns~A--~~~~~~qrvLiiGq~la~gt~~~~~~v~v~s~~~a~~~fG~-----GS~l~~M~~a~~~ 80 (498) T protein:vir:48 8 VPSDTLVPLFYAEMDNSAA--NTAVTSAPALLIGHASNDAAIEVNSLVLMPSADYARQICGA-----GSQLARMVDVYRQ 80 (498) T ss_pred cCcccccceEEEEEecCCC--ccccCCcceEEEeecCccccccccceEEecCHHHHHHhcCc-----ccHHHHHHHHHHH Confidence 564 888999999544444 33444433333332 2 23467999999999999999994 2667777888887 Q ss_pred CCC--cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 77 SGG--TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 77 nGG--~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) +-- .+|++=+.+ .+.. ...+.+..+.......+..+.+.++ .+.+....+..+......+.. ... T Consensus 81 ~n~~~~l~~i~~~D-~ag~--aA~g~it~tg~at~~G~l~l~Igg~--------~v~v~V~~gdTaa~vA~al~a--ai~ 147 (498) T protein:vir:48 81 TDPFGELYVIAVPE-ARGA--AATVRVTVTGEAEESGTLSLYVGRS--------SVQVPVVNGDDATAVATAIKE--AVN 147 (498) T ss_pred hCCCceeEEEeeCC-cccc--eeEEEEEecccccCCceEEEEECCE--------EEEEeecCCCCHHHHHHHHHH--HHh Confidence 764 599998843 1211 1111111111111111111221110 011111111100000000000 000 Q ss_pred ccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCccc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRS 234 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~ 234 (590) ......+.. . .+.....++...+...+ +.+.... +|+.... + T Consensus 148 a~~~lPVTA--~------~~~~~VtlTAr~kG~~G-N~I~l~~----------------~~~~~~~----------g--- 189 (498) T protein:vir:48 148 GVITLPFAA--S------SDAGVVTLTARHKGLYG-NELPVCL----------------NYYGSGG----------G--- 189 (498) T ss_pred CCCCcceEE--E------ecCcEEEEEeeeccccc-ccceeee----------------eeccCcc----------c--- Confidence 000000000 0 00011112222221111 1111000 0000000 0 Q ss_pred cceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccccc Q lcl|NC_016163. 235 AFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNE 314 (590) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (590) +..+....+. ...++++.-.+ T Consensus 190 -------------e~~p~Glt~~------------------itamsgGag~P---------------------------- 210 (498) T protein:vir:48 190 -------------EILPAGLQVV------------------TEAGTAGSGAP---------------------------- 210 (498) T ss_pred -------------ccccceeeEE------------------EEcccCCccCc---------------------------- Confidence 0000000000 00000000000 Q ss_pred cceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHH--------hcCCeEEEEecCCCCCHHHHHHHHH Q lcl|NC_016163. 315 ESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSE--------QRGDCIAILDCSFQGDAQQTIDYRT 386 (590) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~--------~~~~~~a~~d~p~~~~~~~~~~~~~ 386 (590) +. .+.+........+.++.|..+.+-..++-+|+.+ ++++++++ .+...+..+...+-. T Consensus 211 -------Di----a~aLaal~~~~~~~I~~p~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~--~a~~gT~~~l~t~g~ 277 (498) T protein:vir:48 211 -------DL----TAAVAAMGDEAFDFIGLPFNDAASINMMMTEMNDSSGRWSYARQLYGHVY--TAKLGTLSELVNAGD 277 (498) T ss_pred -------ch----HHHHHhhccCCccEEEEeecCHHHHHHHHHHHhhhhhhhhHHhhcCeEEE--EeccCCHHHHHHhhh Confidence 00 0011111222233455566666666666666643 22345544 445567888877764 Q ss_pred hhcCcccceEEEEcCeEEEeecccCceeeecHH---HHHHHHHH---HhhccCCceECcCCcccceeeccc--cceeecC Q lcl|NC_016163. 387 GNISMSTYFTAIFGQHMNVYDEYNGETITVTST---YFLASMIP---SNDDQNGIQWTFVGPRRGVISGFT--DINFYPN 458 (590) Q Consensus 387 ~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~pps---g~~AG~~A---~~D~~~G~~~sPan~~~~~i~g~~--~~~~~~~ 458 (590) . .++.|..+.+. ++.. .-|+. +..|++.| +.|..| |=|. -.+.|+. ++...++ T Consensus 278 ~---~N~~~it~~~~--------~~~~-~~p~~~~AAa~a~~aA~~l~~DPAr-----PLqt--l~L~Gi~~p~~~~r~~ 338 (498) T protein:vir:48 278 M---HNQQHITLAGY--------EKET-QSPVDELVASRLAREAVFIRNDPAR-----PTQT--GELVGMLPAPKGKRFI 338 (498) T ss_pred c---cCCceEEEEec--------CCCC-CChHHHHHHHHHHHHHHhhhccccc-----cccc--eeeeccccCCchhcCC Confidence 3 46776654431 1111 12332 34444444 556555 2221 3466775 4556678 Q ss_pred hhHHhhhhhcCceEEEEecCCeEEEecceec-------CCCcccceehhhhHHHHHHHHHHHHHHHH-hcCCCCHH---- Q lcl|NC_016163. 459 EPWKEKLYLAQVNYIERDPKKISFATQLTSQ-------TSRSALSYINNVRVLLRIRREVEKMMADY-RQEFQDNT---- 526 (590) Q Consensus 459 ~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~-------s~d~~~~~i~vrR~~~~i~~si~~~~~~~-vfepn~~~---- 526 (590) ..|+|.|.-.||.++.- .++-..+--..|. ..|+.|..|+..|+.+|+.+.++...... --+.+-+. T Consensus 339 ~~ern~LL~~Gist~~V-~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~ 417 (498) T protein:vir:48 339 MTEQQTLLSHGVATAYV-EGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLANDGTRF 417 (498) T ss_pred hHHHHHHHhcCcceEEE-cCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCceecccCccc Confidence 99999999999999976 5543444444333 23788999999999999999999877642 22222222 Q ss_pred -------HHHHHHHHHHHHHHHHHhCCce----------EEEecCCCCCHHHhhCCEEEEEEEEEecCcc----ceEEEE Q lcl|NC_016163. 527 -------TYDSMSYSLNNYLQQWVANRAC----------SSISGTVYASDYDKQQSIARVKVELVFTGVI----ERIAID 585 (590) Q Consensus 527 -------l~~~v~~~i~~~L~~l~~~ga~----------~~~~d~~~nt~~~i~~G~l~~~i~~ap~~pa----efi~~~ 585 (590) +-..||..+-+-++.|..+|-+ .+-.|.++ ..||.+.+-.-.+-+. -.|.|+ T Consensus 418 ~~gq~IvTp~~ir~eli~~y~~le~~given~~~~~~~LiVerd~~d-------pnRln~~~p~d~vn~L~V~A~~~~f~ 490 (498) T protein:vir:48 418 GPGQAIVTPAVIKGELLATYRQMERAGIVENYDLFKQYLIVERDADN-------PNRLNTLFPPDYVNQLRVFAVVNQFR 490 (498) T ss_pred CCCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCC-------CcEEEEEecccccCchhhhhhhhhhh Confidence 6678999988888888887743 23333332 1344443332222222 112222 Q ss_pred EEeeC Q lcl|NC_016163. 586 LVVNK 590 (590) Q Consensus 586 ~~~~~ 590 (590) +..+. T Consensus 491 lq~~~ 495 (498) T protein:vir:48 491 LQYSE 495 (498) T ss_pred hhhhh Confidence 22222 No 52 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=98.65 E-value=1.5e-07 Score=58.05 Aligned_cols=333 Identities=11% Similarity=0.038 Sum_probs=159.8 Q ss_pred ccceeeecCccccceeeeeec-ccccccCccccceecccccccccc---cccccccccceeecccccccc-ccccccccc Q lcl|NC_016163. 223 YSQYVEIVDNRSAFETISEFV-VGDSEADPQKVDIIFGQERAVTPA---ETIHANVVWKSSSVETDDPSY-DATAANFNN 297 (590) Q Consensus 223 ~s~~v~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 297 (590) .+ ..... +.... ........+-..+..+...+.... ...-...+.....+..+...- ...++..++ T Consensus 1 m~--------~~~V~-in~~n~~qg~~~~ver~~lfig~g~~~~~~g~~~~~~~~sdld~~lg~~ds~lk~~v~aa~~na 71 (369) T protein:vir:27 1 MA--------WPTVI-IKILNLMNGPIADIECHFLFVIRGTVSGEVRNLIMVDSTSDLDDVLAEASAEGLAIVKAAQLNG 71 (369) T ss_pred CC--------CCceE-EecccccCCCcccccceEEEEEeccccccccceEEecCccchHhhcCCcChhHHHHHHHHHhCC Confidence 00 00000 00000 000111111111111111000000 000000000000000000000 000000000 Q ss_pred ceeeeeccccccccccccceec-cchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHH----HHhcCCeEEEEec Q lcl|NC_016163. 298 IQYLTEGSEGTWTGGNEESALL-VKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLC----SEQRGDCIAILDC 372 (590) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~----~~~~~~~~a~~d~ 372 (590) +..... ...+. ..+....+++..........-.++.+..+++.+.++.... .+..+..|.++.+ T Consensus 72 --------G~~w~a---~~~p~~~~~~~~~Av~~a~~~~s~E~V~v~~p~t~~a~i~aaq~~a~el~~~~~R~vffi~e~ 140 (369) T protein:vir:27 72 --------KQAWTA---GVMILSEEDNWQDAVKKANEVSSFEFVVLGFDAETKAMIEDAITLRTELKNSLGREVGVLCQL 140 (369) T ss_pred --------CCceEE---EEEEeCCchhHHHHHHhhhhhCCccEEEEecCcccHHHHHHHHHHHHHHHHhcCCeEEEEEec Confidence 000000 01111 1122233333332222233333334434444444443333 3334567888875 Q ss_pred CC-CC---CHHHHHHHHH----hhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCccc Q lcl|NC_016163. 373 SF-QG---DAQQTIDYRT----GNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRR 444 (590) Q Consensus 373 p~-~~---~~~~~~~~~~----~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~ 444 (590) +. +. +-+...+|.. ...++.+.+..++--+... | .-.|.+||.++.. ..-++.+|+.+.. T Consensus 141 ~~~~~~~~~~e~w~dy~a~l~al~~g~a~~~V~vv~~~~~~-----g-----n~~G~~aGRl~n~--aVsIadsp~RVkt 208 (369) T protein:vir:27 141 PAINNDPTNGQTWSEWLADTVDIPKDVASEYISVVPNVHAA-----G-----DTLGKYAGRLANK--EVSIADSPARVQT 208 (369) T ss_pred cccCCCccccCCHHHHHHHHHHHhhccCcccceeeeeeccc-----c-----chHHHHHHHHHhc--ccchhcCcceeee Confidence 42 11 1122223322 2233456666555211111 1 1358888888752 2336889998877 Q ss_pred ceeeccccce-----eecChhHHhhhhhcCceEEEEecCC-eEEEecceecCC-CcccceehhhhHHHHHHHHHHHHHHH Q lcl|NC_016163. 445 GVISGFTDIN-----FYPNEPWKEKLYLAQVNYIERDPKK-ISFATQLTSQTS-RSALSYINNVRVLLRIRREVEKMMAD 517 (590) Q Consensus 445 ~~i~g~~~~~-----~~~~~~e~~~Ln~~gIn~i~~~~~~-G~~~wG~rT~s~-d~~~~~i~vrR~~~~i~~si~~~~~~ 517 (590) +.+.|...+. ..++.+.+..|..+|..+.+.++|. |+.+-..||+.. .++|+||..+|..+-+.|.++...-. T Consensus 209 G~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~tl~~~gsDYq~iE~~RVvdKa~R~vR~~Ai~ 288 (369) T protein:vir:27 209 GSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGRTLDVPGGDYQDIRHIRVAMKAARKVRIRAIA 288 (369) T ss_pred cccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCCeehhhhhhHHHHHHHHHHHHHHH Confidence 7777764332 2356678899999999999999985 664445799876 47999999999999888877765554 Q ss_pred HhcCC---CCHHHHHHHHHHHHHHHHHHHhCCceEEEecCCCCCHHHh-----hCCEEEEEEEEEecCccceEEEEEEee Q lcl|NC_016163. 518 YRQEF---QDNTTYDSMSYSLNNYLQQWVANRACSSISGTVYASDYDK-----QQSIARVKVELVFTGVIERIAIDLVVN 589 (590) Q Consensus 518 ~vfep---n~~~l~~~v~~~i~~~L~~l~~~ga~~~~~d~~~nt~~~i-----~~G~l~~~i~~ap~~paefi~~~~~~~ 589 (590) .+-.| .++.-.+..+..+..=|++|.+.+ |.|-+..- ..+|| ...++.|-+-+.|.--...|+.+|..+ T Consensus 289 ~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~-fpgei~~P--~d~dI~i~w~~k~~V~I~~~vrP~~~pk~it~~I~ld 365 (369) T protein:vir:27 289 RIADRTLNSTPQSIAAAKLYFTQDLRTMALTG-VPGEIYPP--EDEDIQIKWVNSTDVEIYMSVQPYECPVKITIAISVK 365 (369) T ss_pred HhcCcccccChhHHHHHHHHHhhHHHHHHhhc-CCeEEecC--CCCceEEEeeccceEEEEEEEeeccCCceEEEEEEEe Confidence 44433 245566667777888888886653 55554211 01233 455777778888888888999999999 Q ss_pred C Q lcl|NC_016163. 590 K 590 (590) Q Consensus 590 ~ 590 (590) - T Consensus 366 l 366 (369) T protein:vir:27 366 Q 366 (369) T ss_pred c Confidence 9 No 53 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=98.55 E-value=1.8e-07 Score=57.61 Aligned_cols=334 Identities=12% Similarity=0.029 Sum_probs=152.9 Q ss_pred eeeee-eccccceeeecCccccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccc-cccc Q lcl|NC_016163. 215 YYANI-INKYSQYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSY-DATA 292 (590) Q Consensus 215 ~~~~v-v~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 292 (590) .|+.| +|..... .+......-.-.+.+.... +...+..-+. ..+.....+..+.... ...+ T Consensus 1 ~~~~v~vn~~n~~---~g~~~~~er~~lfig~~~~---~~g~~~~~~~-----------~sdld~~l~~~ds~lk~~v~a 63 (370) T protein:vir:78 1 MWPYVQIYNLNQM---QGPVTEVERHLLFIGSAAS---NTGKLLSLNA-----------QSDFDQLLGAADSELKANLLA 63 (370) T ss_pred CCceEEEeecccc---CCCcCccceeEEEEecccc---cccceEeecC-----------ccCHHHhcCCcChhHHHHHHH Confidence 22222 2211110 0111000000011111000 0000000000 0000000000000000 0000 Q ss_pred cccccceeeeecccccccccccccee-ccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHH----hcCCeE Q lcl|NC_016163. 293 ANFNNIQYLTEGSEGTWTGGNEESAL-LVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSE----QRGDCI 367 (590) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~~~~~~ 367 (590) +..++..... . ...+ ...+....+++.+........-.++.+....+.+.++...+++ .++..| T Consensus 64 a~~naG~~~~--------~---~~~p~~~~~d~~~Av~~a~~~~s~E~V~v~~~~s~~a~~~a~~~~a~el~n~~~Rpv~ 132 (370) T protein:vir:78 64 ARDNAGQNWS--------A---AAYVLPTDKPWLDAARDAQQTQSFEGVVVLGQEWHQAAINAAHALNQELIAKWGRWQF 132 (370) T ss_pred HHhCCCCceE--------E---EEEEecCchhHHHHHHHHHhhCCccEEEEecCcchHHHHHHHHHHHHHHHHhcCCeEE Confidence 0001100000 0 0001 1112233333333222222222333443444555554444433 335678 Q ss_pred EEEecCCCCCHHHHHHHHHhh----cCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcc Q lcl|NC_016163. 368 AILDCSFQGDAQQTIDYRTGN----ISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPR 443 (590) Q Consensus 368 a~~d~p~~~~~~~~~~~~~~~----~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~ 443 (590) .++.++.-.+-+...+|.... .++.+.+..++--|. +. .-|.+||.++.. .--++.+|.-+. T Consensus 133 file~~~~~~~e~w~~y~~~l~al~~gia~~~V~vvp~~~-------g~-----~~G~~aGRL~na--avsVadsP~Rv~ 198 (370) T protein:vir:78 133 MLLAVPAIADEQDWATYEAELATLQDGIAASSVSLIPQLW-------PT-----LAGAYAGRLCNR--AVSIADSPCRVK 198 (370) T ss_pred EEEeecCCCCcCCHHHHHHHHHHhhhccccccceEEeeec-------cc-----cHHHHHHHHhcC--eeeecccceeee Confidence 888875422222333333221 233444444442111 11 137788876532 223788898877 Q ss_pred cceeeccccc-----eeecChhHHhhhhhcCceEEEEecCC-eEEEecceecCC-CcccceehhhhHHHHHHHHHH-HHH Q lcl|NC_016163. 444 RGVISGFTDI-----NFYPNEPWKEKLYLAQVNYIERDPKK-ISFATQLTSQTS-RSALSYINNVRVLLRIRREVE-KMM 515 (590) Q Consensus 444 ~~~i~g~~~~-----~~~~~~~e~~~Ln~~gIn~i~~~~~~-G~~~wG~rT~s~-d~~~~~i~vrR~~~~i~~si~-~~~ 515 (590) .+.+.|...+ ...++.+.++.|..+|-.+.+.++|+ |+.+-..|||+. .+.|+||..+|..+-+.|.++ +++ T Consensus 199 tG~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai 278 (370) T protein:vir:78 199 TGALVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGIYWADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAI 278 (370) T ss_pred ccccccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHHHHHHH Confidence 7777665322 23467788999999999999999985 664445799876 479999999999999999888 555 Q ss_pred HHHhcCCCCHH--HHHHHHHHHHHHHHHHHhCCc-----eEEEecCC---CCCHHHhhCCEEEEEEEEEecCccceEEEE Q lcl|NC_016163. 516 ADYRQEFQDNT--TYDSMSYSLNNYLQQWVANRA-----CSSISGTV---YASDYDKQQSIARVKVELVFTGVIERIAID 585 (590) Q Consensus 516 ~~~vfepn~~~--l~~~v~~~i~~~L~~l~~~ga-----~~~~~d~~---~nt~~~i~~G~l~~~i~~ap~~paefi~~~ 585 (590) +...+|-.|+. ..+..+.-...=|+++-..+- |.|.+... .-++.-+...++.|-+.+.|..-...|+.+ T Consensus 279 ~~i~D~~lnst~gsia~~~~~~~~~L~ema~s~~i~~~~fpgeI~~p~d~Di~i~w~s~~~v~I~~~v~P~~~pk~Itv~ 358 (370) T protein:vir:78 279 ARIGDRSFNSTPGSTAAAITYFGKDLREMAKSTTINGQPFPGDIASPQDGDIRIQWVAKNLVSVFVVVRTVDCPKGITVN 358 (370) T ss_pred HHhCCcccCCCCcchhHHHHHHHhhHHHHHhhhhhcccccceeEeccCCCcceEEeeccceEEEEEEEEeccCCceEEEE Confidence 55555433221 112222233333444444443 33433211 112223477889999999998888889888 Q ss_pred EEeeC Q lcl|NC_016163. 586 LVVNK 590 (590) Q Consensus 586 ~~~~~ 590 (590) |..+= T Consensus 359 I~LDl 363 (370) T protein:vir:78 359 IMLDL 363 (370) T ss_pred EEEee Confidence 87766 No 54 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=98.49 E-value=2.1e-07 Score=57.20 Aligned_cols=335 Identities=11% Similarity=0.037 Sum_probs=163.4 Q ss_pred eeeee-eccccceeeecCccccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccc-cccc Q lcl|NC_016163. 215 YYANI-INKYSQYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSY-DATA 292 (590) Q Consensus 215 ~~~~v-v~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 292 (590) .|+.| +|.... ..+......-.-.+.+........ .+.+.. ..+.....+..+.... ...+ T Consensus 1 ~~~~v~vn~ln~---~qg~~~~ver~~lfig~~~~~~~~--~~~~~~------------~sdld~~lg~~ds~lk~~v~a 63 (376) T protein:vir:37 1 MFPSVQINALNQ---LSGETKEIERHALFVGVGTTNQGK--LLALTP------------DSDFDKVFGETDTDLKKQVRA 63 (376) T ss_pred CCCeEEEeeeec---cCCCcccccceEEEeeccccccCc--eEEecC------------CCChHHhhCCCchhHHHHHHH Confidence 11111 111100 011111110001111111000000 000000 0000000000000000 0000 Q ss_pred cccccceeeeeccccccccccccceec-cchhhHHHHHHHhhhccCCceeeecc-cchhHHHHHHHHHHH----HhcCCe Q lcl|NC_016163. 293 ANFNNIQYLTEGSEGTWTGGNEESALL-VKGYSGVLAPEILDKQQYEIDVLLDG-NNEVAVKNAMSDLCS----EQRGDC 366 (590) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~a~~~~~~----~~~~~~ 366 (590) +..++. .+.... ...+. ..+....+++.+.....+..-.++.+ ..+.+.+.++....+ +..+.. T Consensus 64 a~~naG--------~~w~a~--~~~p~~~~~~~~~Av~~a~~~~s~E~V~v~~p~~t~~a~i~a~qa~a~el~~~~~R~v 133 (376) T protein:vir:37 64 AMLNAG--------QNWFAH--VYIAQEDGYDFVECVKKANQTASFEYCVNTRYLGVDKASIGKLQECYAELLAKFGRRT 133 (376) T ss_pred HHhCCC--------CceEEE--EEecCCChhhHHHHHHHHHhhCCeeEEEEecCcchhHHHHHHHHHHHHHHHHhcCCeE Confidence 111100 000000 00111 11223334443332222233233333 233444444433322 234568 Q ss_pred EEEEecCC-CC---CHHHHHHHHHh----hcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceEC Q lcl|NC_016163. 367 IAILDCSF-QG---DAQQTIDYRTG----NISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWT 438 (590) Q Consensus 367 ~a~~d~p~-~~---~~~~~~~~~~~----~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~s 438 (590) |.++.++. +. +-+...+|.+. ..++.+.+..++- .+ -| -..|.+||.+++ ...-++.| T Consensus 134 ffile~~g~d~~~~~ge~w~~y~~~l~a~~~gia~~~V~vV~-~~------~g-----n~~G~~aGRl~n--aaVsVads 199 (376) T protein:vir:37 134 FFIQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVP-LL------FG-----NETGVLAGRLAN--RAVTVADS 199 (376) T ss_pred EEEEeccCCCCcccccCCHHHHHHHHHHHhccccccceeeee-ee------cc-----chHHHHHHHHHh--CCcchhcC Confidence 88888762 11 11222333322 2244555555441 11 11 136888998875 23346999 Q ss_pred cCCcccceeecccccee-------ecChhHHhhhhhcCceEEEEecCC-eEEEecceecCC-CcccceehhhhHHHHHHH Q lcl|NC_016163. 439 FVGPRRGVISGFTDINF-------YPNEPWKEKLYLAQVNYIERDPKK-ISFATQLTSQTS-RSALSYINNVRVLLRIRR 509 (590) Q Consensus 439 Pan~~~~~i~g~~~~~~-------~~~~~e~~~Ln~~gIn~i~~~~~~-G~~~wG~rT~s~-d~~~~~i~vrR~~~~i~~ 509 (590) |..+..+.|.|+..++. .++.+..+.|..+|..+.+.++|+ |+.+-+.||++. .++|+||..+|..+-+.| T Consensus 200 pgRV~tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG~Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R 279 (376) T protein:vir:37 200 PARVQTGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDGYYRADGRTLDVEGGDYQVIENLRVVDKVAR 279 (376) T ss_pred ccceeecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCceEEeCCeEeccCCCCeeeehhchHHHHHHH Confidence 99988777777643332 356678899999999999999985 664445799876 479999999999998888 Q ss_pred HHHHHHH-HHhcC--CCCHHHHHHHHHHHHHHHHHHHhCCceEEEec-CCCCCH--HHh-----hCCEEEEEEEEEecCc Q lcl|NC_016163. 510 EVEKMMA-DYRQE--FQDNTTYDSMSYSLNNYLQQWVANRACSSISG-TVYASD--YDK-----QQSIARVKVELVFTGV 578 (590) Q Consensus 510 si~~~~~-~~vfe--pn~~~l~~~v~~~i~~~L~~l~~~ga~~~~~d-~~~nt~--~~i-----~~G~l~~~i~~ap~~p 578 (590) .++...- +..++ ..++.-.+..+..++.=|+.|.+.+-|.|.-- .+-.+| +|| ...++.|-+-+.|.-- T Consensus 280 ~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~d~dI~i~w~sk~~V~I~~~vrPy~c 359 (376) T protein:vir:37 280 KVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKTKVTIYIKVRPYDC 359 (376) T ss_pred HHHHHHHHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccceeecCCCCceEEEeccCceEEEEEEEeeecC Confidence 7775544 44333 23577778888889999999999887766421 111222 233 3567778888888877 Q ss_pred cceEEEEEEeeC Q lcl|NC_016163. 579 IERIAIDLVVNK 590 (590) Q Consensus 579 aefi~~~~~~~~ 590 (590) ...|+..|..+= T Consensus 360 pk~i~~~I~LDl 371 (376) T protein:vir:37 360 PKEITANIFLDL 371 (376) T ss_pred cceeEEEEEEec Confidence 788888888887 No 55 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=98.40 E-value=8.2e-07 Score=53.94 Aligned_cols=434 Identities=12% Similarity=0.028 Sum_probs=196.0 Q ss_pred Ccc-ccCCceEEEEecCCCceecccccceeEEEeec--CCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcC Q lcl|NC_016163. 1 MAD-YLHPSVSSRIVDNSAVYATAAGNTVLYAAIHS--AIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQS 77 (590) Q Consensus 1 Mp~-yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~--~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~n 77 (590) .|. .+-||+|+|--.|.+ ....-+--+-.++-.. -..+.++|++|+|-+|-...||. .|.|..-++.|.++ T Consensus 8 IP~~iRvP~~y~E~dns~A-~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~~fG~-----GSml~~M~~a~~~~ 81 (498) T protein:vir:44 8 IPSDTRVPLFYAEMDNSAA-NTARDSGASLLIGHASNDASIAVNSLVLVSSVDYARQICGA-----GSQLARMVGAYRKT 81 (498) T ss_pred cCcccccCeEEEEEeCCCC-CCCcCCcceEEEEecCcccccccceeEeecCHHHHHHhcCc-----ccHHHHHHHHHHHh Confidence 554 778999999433444 2222222333333322 23467999999999999999994 26777788889987 Q ss_pred CC--cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecccccc Q lcl|NC_016163. 78 GG--TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGEN 155 (590) Q Consensus 78 GG--~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~~ 155 (590) -- .+|++=+.+ .+..+ ..+.+..+....+..+..+.+.++ .+.+....+..+......+. . .... T Consensus 82 n~~~~l~~i~~~D-~aG~a--Atg~it~tg~at~~G~l~l~Igg~--------~v~v~V~~gdTaa~vA~al~-a-aina 148 (498) T protein:vir:44 82 DPFGELYVIAVPE-STGAA--ATVALTVTGEATETGTVNVYTGRT--------RVQAPVTSGDDAAAVAVSIK-D-AVNA 148 (498) T ss_pred CCCceeEEEecCC-cccce--eEEEEEeecccCCCcEEEEEECCE--------EEEEEecCCCCHHHHHHHHH-H-HHhC Confidence 65 599997732 22111 111111111111111222222111 11111111110000000000 0 0000 Q ss_pred cccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCcccc Q lcl|NC_016163. 156 YNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRSA 235 (590) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~~ 235 (590) .....+.. . .+.....++...+...+ +.+.... +|+.... + T Consensus 149 ~~~lPVTA--~------~~~~~vtlTAr~kG~~G-N~I~l~~----------------~~~~~~~----------g---- 189 (498) T protein:vir:44 149 NPDLPFTA--T------SEAGVVTLTARHKGLYG-NEIPVTL----------------NYYGFGG----------G---- 189 (498) T ss_pred CCCCceEE--e------eccceEEEEEeccCccc-CcceEEE----------------eeccCcc----------c---- Confidence 00000000 0 00011112222221111 1111100 0000000 0 Q ss_pred ceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeecccccccccccc Q lcl|NC_016163. 236 FETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEE 315 (590) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (590) +..+....+. ...++++.-. T Consensus 190 ------------e~~p~Glt~t------------------itamsgGag~------------------------------ 209 (498) T protein:vir:44 190 ------------EVLPAGVNIT------------------VASGVKGAGA------------------------------ 209 (498) T ss_pred ------------cccccceeEE------------------EEcccCCccC------------------------------ Confidence 0000000000 0000000000 Q ss_pred ceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHH--------hcCCeEEEEecCCCCCHHHHHHHHHh Q lcl|NC_016163. 316 SALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSE--------QRGDCIAILDCSFQGDAQQTIDYRTG 387 (590) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~--------~~~~~~a~~d~p~~~~~~~~~~~~~~ 387 (590) | +. .+.+........++++.|..+.+-..++-+|+.+ ++++++++ .....+..++..+-.. T Consensus 210 --P---Di----a~alaal~~~~~~~i~~p~~D~asl~al~~~L~~~sgRw~~~~q~~g~~~--~a~~gT~a~l~t~g~~ 278 (498) T protein:vir:44 210 --P---AL----NDAVAAMGDEPFDYIGLPFNDTASVNSMATEMNDSSGRWSYVRQLYGHVY--TAKTGTLSELVAAGDQ 278 (498) T ss_pred --c---hh----HHHHHhhccCCccEEEEeecCHHHHHHHHHHHhhhhcchHHHhhcCeEEE--EeccCCHHHHHHhhhc Confidence 0 00 0011112222334555666666666666666643 22344544 3445677887777643 Q ss_pred hcCcccceEEEEcCeEEEeecccCceeeecHH---HHHHHHHH---HhhccCCceECcCCcccceeeccc--cceeecCh Q lcl|NC_016163. 388 NISMSTYFTAIFGQHMNVYDEYNGETITVTST---YFLASMIP---SNDDQNGIQWTFVGPRRGVISGFT--DINFYPNE 459 (590) Q Consensus 388 ~~~~~s~~~~~~~p~~~~~d~~~~~~~~~pps---g~~AG~~A---~~D~~~G~~~sPan~~~~~i~g~~--~~~~~~~~ 459 (590) .++.|..+.+.. + ...-|+- +.+||+.| +.|..| |=| .-.+.|+. ++...++. T Consensus 279 ---~N~~~it~~~~~--------~-~~~sp~~~~AAa~a~~aA~~l~~DPAr-----PL~--tl~L~Gi~~p~~~~r~~~ 339 (498) T protein:vir:44 279 ---FNLQHITLAGYE--------K-DTQTPADELAASRTARAAVFIRNDPAR-----PTQ--TGELVDMLPAPKGKRFTT 339 (498) T ss_pred ---cCCceEEEEecC--------C-CCCCHHHHHHHHHHHHHHHHhhccccc-----ccC--ceeecccccCCchhcCCh Confidence 467666554211 1 0111332 34444444 456544 222 13466776 44567899 Q ss_pred hHHhhhhhcCceEEEEecCCeEEEecceec-------CCCcccceehhhhHHHHHHHHHHHHHHH-HhcCCCCH------ Q lcl|NC_016163. 460 PWKEKLYLAQVNYIERDPKKISFATQLTSQ-------TSRSALSYINNVRVLLRIRREVEKMMAD-YRQEFQDN------ 525 (590) Q Consensus 460 ~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~-------s~d~~~~~i~vrR~~~~i~~si~~~~~~-~vfepn~~------ 525 (590) .|+|.|.-.||.++.--.|. ..+--..|. ..|+.|..|+..|+.+|+.+.++..... |--++.-+ T Consensus 340 ~ern~LL~~Gist~~V~~G~-V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~ 418 (498) T protein:vir:44 340 TEQQTLLSHGVATAYVESGV-LRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFG 418 (498) T ss_pred HHHHHHHhcCcceEEEcCCe-EEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCcccccCCcccC Confidence 99999999999999764432 333333332 2378899999999999999999987764 22222111 Q ss_pred -----HHHHHHHHHHHHHHHHHHhCCce----------EEEecCCCCCHHHhhCCEEEEEEEEEecCccc----eEEEEE Q lcl|NC_016163. 526 -----TTYDSMSYSLNNYLQQWVANRAC----------SSISGTVYASDYDKQQSIARVKVELVFTGVIE----RIAIDL 586 (590) Q Consensus 526 -----~l~~~v~~~i~~~L~~l~~~ga~----------~~~~d~~~nt~~~i~~G~l~~~i~~ap~~pae----fi~~~~ 586 (590) .+-..||..+-+-++.|..+|-+ .+-.|.++ ..||.+.+-...+-+.. .|.|++ T Consensus 419 ~gq~IvTp~~ir~eli~~y~~le~~givEn~~~~~~~LiVerd~~d-------pnRln~~~p~d~vn~L~V~A~~~~f~l 491 (498) T protein:vir:44 419 SGQAIVTPAVIRGELGSTYRQMEREGIVENFDLFQQHLIVERNAND-------SNRLDVLFPPDYVNQLRVFAVLNQFRL 491 (498) T ss_pred CCcccccHHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCC-------CcEEEEEecccccCchhhhhhhhhhhh Confidence 26678999998888888887743 22333332 23444443332222221 122222 Q ss_pred EeeC Q lcl|NC_016163. 587 VVNK 590 (590) Q Consensus 587 ~~~~ 590 (590) ..+. T Consensus 492 q~~~ 495 (498) T protein:vir:44 492 QYSE 495 (498) T ss_pred hhhh Confidence 2222 No 56 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=98.37 E-value=1e-06 Score=53.45 Aligned_cols=434 Identities=11% Similarity=0.020 Sum_probs=195.7 Q ss_pred Ccc-ccCCceEEEEecCCCceecccccceeEEEee--cCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcC Q lcl|NC_016163. 1 MAD-YLHPSVSSRIVDNSAVYATAAGNTVLYAAIH--SAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQS 77 (590) Q Consensus 1 Mp~-yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~--~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~n 77 (590) .|. .+-||+|+|-=.|.+. .....--+-.++-. .-..+.++|++|+|-+|-...||. .|.|..-++.|.++ T Consensus 8 IP~~iRvP~~y~E~dns~A~-~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~lfG~-----GSml~~M~~a~~~~ 81 (498) T protein:vir:45 8 IPSNTLVPLFYAEMDNQAAN-TAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA-----GSQLARMVEAYRQT 81 (498) T ss_pred cCcccccCeEEEEEeCCCCC-CCCCCcceEEEEecCCccccccceeEEecCHHHHHHhcCc-----CcHHHHHHHHHHHh Confidence 554 7789999994344442 22222233333322 223467999999999999999994 26777778888877 Q ss_pred CC--cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecccccc Q lcl|NC_016163. 78 GG--TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGEN 155 (590) Q Consensus 78 GG--~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~~ 155 (590) -- .+|++=+.+ .+..+ ..+.+..+.......+..+.+.+. .+.+....+..+......+.. .... T Consensus 82 n~~~~l~~i~~~d-~aG~a--A~g~it~tg~at~~G~l~l~Igg~--------~v~v~V~~gdTaa~vA~al~a--aina 148 (498) T protein:vir:45 82 DPFGELYVIAVPE-ATGAA--ATVTLTVTGEATESGTVNVYVGRT--------RVQAPVTNGDNVTTIASSIQD--AINA 148 (498) T ss_pred CCcceEEEEeeCC-cccce--eEEEEEeecccCCCcEEEEEECCE--------EEEEEecCCCCHHHHHHHHHH--HHhC Confidence 64 599998842 11111 111111111111111111111110 111111111100000000000 0000 Q ss_pred cccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCcccc Q lcl|NC_016163. 156 YNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNRSA 235 (590) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~~~ 235 (590) .....+.. . .+.....++...+...+ +.+.-. + +|+.... +. T Consensus 149 ~~~lPVTA--~------~~~~~VtlTAr~kG~~G-N~I~l~---~-------------~~~~~~~----------ge--- 190 (498) T protein:vir:45 149 VPTLPFTA--S------SSAGVVTLTARHKGLCG-NEIPVS---L-------------NYYGFGG----------GE--- 190 (498) T ss_pred CCCCceEE--E------ecCceEEEEeeccCccc-cceeEE---E-------------eeccccc----------cc--- Confidence 00000000 0 00011112222221111 111000 0 0000000 00 Q ss_pred ceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeecccccccccccc Q lcl|NC_016163. 236 FETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEE 315 (590) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (590) ..+....+. ...++++.-. T Consensus 191 -------------~~p~Glt~~------------------itamagGag~------------------------------ 209 (498) T protein:vir:45 191 -------------VLPAGVQIA------------------VATGTAGTGA------------------------------ 209 (498) T ss_pred -------------cccceeeEE------------------EEccCCCccC------------------------------ Confidence 000000000 0000000000 Q ss_pred ceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHH--------hcCCeEEEEecCCCCCHHHHHHHHHh Q lcl|NC_016163. 316 SALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSE--------QRGDCIAILDCSFQGDAQQTIDYRTG 387 (590) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~--------~~~~~~a~~d~p~~~~~~~~~~~~~~ 387 (590) | +.. +.+........+.++.|..+.+-..++-+|+.+ ++++++++ .....+..+...+-.. T Consensus 210 --P---D~a----~alaal~~~~~~~I~~p~~D~asL~al~~~L~~~sgRw~~~~q~~g~~~--~a~~gT~~~l~t~g~~ 278 (498) T protein:vir:45 210 --P---VLT----GAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVY--TAKTGTLSELVNAGDQ 278 (498) T ss_pred --c---hhH----HHHHHhccCCccEEEEeeCCHHHHHHHHHHHhhhhhhhhHHhhcCeEEE--EeccCCHHHHHHhhhc Confidence 0 000 011111222334555566666666666666643 12344444 4455678888777643 Q ss_pred hcCcccceEEEEcCeEEEeecccCceeeecH---HHHHHHHHH---HhhccCCceECcCCcccceeeccc--cceeecCh Q lcl|NC_016163. 388 NISMSTYFTAIFGQHMNVYDEYNGETITVTS---TYFLASMIP---SNDDQNGIQWTFVGPRRGVISGFT--DINFYPNE 459 (590) Q Consensus 388 ~~~~~s~~~~~~~p~~~~~d~~~~~~~~~pp---sg~~AG~~A---~~D~~~G~~~sPan~~~~~i~g~~--~~~~~~~~ 459 (590) .++.|..+.+. ++ ...-|| ++.+||..| +.|..| |=| .-.+.|+. ++...++. T Consensus 279 ---~N~~~it~~~~--------~~-~~~sp~~~~AAa~aa~~A~~l~~DPAr-----PL~--tl~L~Gi~~p~~~~r~~~ 339 (498) T protein:vir:45 279 ---FNQQHITLAGY--------EK-ETQTPADELAASRTARAAVFIRNDPAR-----PTQ--TGELVGMLPAPKGKRFTM 339 (498) T ss_pred ---cCCceEEEEec--------CC-CCCChHHHHHHHHHHHHHHHhhccccc-----ccC--ceeecceecCCchhcCCh Confidence 46776655431 11 111133 344444444 456544 222 23466765 45566889 Q ss_pred hHHhhhhhcCceEEEEecCCeEEEecceec-------CCCcccceehhhhHHHHHHHHHHHHHHHH-hcCCCCHH----- Q lcl|NC_016163. 460 PWKEKLYLAQVNYIERDPKKISFATQLTSQ-------TSRSALSYINNVRVLLRIRREVEKMMADY-RQEFQDNT----- 526 (590) Q Consensus 460 ~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~-------s~d~~~~~i~vrR~~~~i~~si~~~~~~~-vfepn~~~----- 526 (590) .|+|.|.-.||.++.-..|. ..+--..|. ..|+.|..|+..|+.+|+.+.++...... --+.+-+. T Consensus 340 ~ern~LL~~Gist~~V~~G~-V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~ 418 (498) T protein:vir:45 340 TEQQTLLSHGVATAYVESGV-LRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFG 418 (498) T ss_pred HHHHHHHhCCcceEEEcCCe-EEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCeeecccCcccC Confidence 99999999999999764432 333333332 24788999999999999999999887753 22222222 Q ss_pred ------HHHHHHHHHHHHHHHHHhCCce----------EEEecCCCCCHHHhhCCEEEEEEEEEecCcc----ceEEEEE Q lcl|NC_016163. 527 ------TYDSMSYSLNNYLQQWVANRAC----------SSISGTVYASDYDKQQSIARVKVELVFTGVI----ERIAIDL 586 (590) Q Consensus 527 ------l~~~v~~~i~~~L~~l~~~ga~----------~~~~d~~~nt~~~i~~G~l~~~i~~ap~~pa----efi~~~~ 586 (590) |-..||..+-+-++.|..+|-+ .+-.|.++ ..||.+.+-.-.+-+. -.|.|++ T Consensus 419 ~gq~IvTp~~ir~ell~~y~~le~~givEn~~~~~~~LiVerd~~d-------pnRln~~~p~d~vn~L~V~A~~~~f~l 491 (498) T protein:vir:45 419 PGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASV-------PNRLNTLFPPDYVNQLRVFAVVNQFRL 491 (498) T ss_pred CCCcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCC-------CcEEEEEecccccCchhhhhhhhhhhe Confidence 6678999988888888887743 23334332 1344433332222222 1222333 Q ss_pred EeeC Q lcl|NC_016163. 587 VVNK 590 (590) Q Consensus 587 ~~~~ 590 (590) ..+. T Consensus 492 q~~~ 495 (498) T protein:vir:45 492 QYSE 495 (498) T ss_pred ehhh Confidence 3333 No 57 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=98.26 E-value=1.9e-06 Score=51.89 Aligned_cols=434 Identities=12% Similarity=0.102 Sum_probs=195.2 Q ss_pred Ccc-ccCCceEEEEecCCCce-ecccccceeEEEee--cCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHc Q lcl|NC_016163. 1 MAD-YLHPSVSSRIVDNSAVY-ATAAGNTVLYAAIH--SAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQ 76 (590) Q Consensus 1 Mp~-yl~PGVYveEi~s~~~~-i~gv~Tsv~~~vg~--~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~ 76 (590) .|. .+-||+|+|=-.|.+.+ ...-.-.+-.++-. .-..+.++|++|+|-+|-...||. .|.+..-++.|.+ T Consensus 9 IP~~iRvP~~y~E~dns~A~~g~~~~~q~vLiiGq~la~gs~~~~~pv~v~s~~~a~~~fG~-----GS~la~M~~a~~~ 83 (495) T protein:vir:19 9 IPSDVRVPLTYIEFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQASAAFGQ-----GSMLALMADAFLN 83 (495) T ss_pred CCcccccCeEEEEEccCCCCcCCcCCCceEEEEEecCcccccccceeEEecCHHHHHHhcCc-----CcHHHHHHHHHHH Confidence 443 67899999943343321 12212233333332 233467999999999999999994 2667777788886 Q ss_pred CCC--cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccccc Q lcl|NC_016163. 77 SGG--TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRGE 154 (590) Q Consensus 77 nGG--~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~ 154 (590) +-- .+|++=+.+ .+..+ ..+....+.......+..+.+.+ ..+.+....+..+.. +++.-.. T Consensus 84 ~n~~~~l~~i~~~D-~aG~a--A~g~it~tg~at~~G~l~l~I~g--------~~v~v~V~~gdTaa~-----vA~al~a 147 (495) T protein:vir:19 84 ANRVAELWCIPQGN-GTGNA--AVGEISLSGTAGENGSLVTYIAG--------QRLAVSVAAGATGAA-----LADLLVA 147 (495) T ss_pred hCCcceEEEEeeCC-hhhce--eEEEEEEeecCCCCcEEEEEECC--------EEEEEEecCCCCHHH-----HHHHHHH Confidence 654 599998842 11111 11111111111111111122111 011111111110000 0000000 Q ss_pred ccccceEEEEeeccccccccccccceeeee-ecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCcc Q lcl|NC_016163. 155 NYNGYGFRLSLRSDYDNTYNFRTYNLSVTV-KDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNR 233 (590) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v-~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~ 233 (590) .-+. .. ..++...+ .+.+.... .....++ ++. ++. .|..+..+...+ T Consensus 148 aina---~~-------------~lPvTA~~~~~~~~~~a--~~~VtlT----Ar~-kG~-------~n~idi~~~~~~-- 195 (495) T protein:vir:19 148 RIKG---QP-------------DLPVTAEVRADSGDDDT--HADVVLS----AKF-TGA-------LSAVDVRWNYYA-- 195 (495) T ss_pred HhcC---Cc-------------cCceEEEeeccCCCCcC--ceeEEEE----Eee-ccc-------cccceeEEEeec-- Confidence 0000 00 00000000 00000000 0000000 000 000 000000000000 Q ss_pred ccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeecccccccccc Q lcl|NC_016163. 234 SAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGN 313 (590) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (590) ++..+...... ...++ +|. T Consensus 196 -------------ge~~p~Glt~t------------------itams------------------------------gGa 214 (495) T protein:vir:19 196 -------------GETTPYGIITA------------------FKAAS------------------------------GKN 214 (495) T ss_pred -------------ccccccceeEE------------------EEecC------------------------------CCC Confidence 00000000000 00000 100 Q ss_pred ccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHh-----cCCeEEEEecCCCCCHHHHHHHHHhh Q lcl|NC_016163. 314 EESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQ-----RGDCIAILDCSFQGDAQQTIDYRTGN 388 (590) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~-----~~~~~a~~d~p~~~~~~~~~~~~~~~ 388 (590) +. | |. .+.+........++++.|-.+.+...++-+|++.+ +++++++ .....+..+...+-.. T Consensus 215 g~--P---Di----a~alaal~~~~~~~I~~P~tD~asL~al~~~l~~rw~~~~q~~g~~~--~a~~gT~~~l~t~g~~- 282 (495) T protein:vir:19 215 GN--P---DI----SASIAGMGDLQYKYIVMPYTDEPNLNLLRTELQERWGPVNQADGFAV--TVLSGTYGDISTFGVS- 282 (495) T ss_pred CC--c---ch----HHHHHHhccCCCcEEEEecCcHHHHHHHHHHHHHhhhHHHhcCeEEE--EeecCCHHHHHHhhhc- Confidence 00 0 00 01122222333445666766667777788887652 3345555 3345577777776543 Q ss_pred cCcccceEEEEcCeEEEeecccCceeeecHHHHHHHH---HH---HhhccCCceECcCCcccceeeccc--cceeecChh Q lcl|NC_016163. 389 ISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASM---IP---SNDDQNGIQWTFVGPRRGVISGFT--DINFYPNEP 460 (590) Q Consensus 389 ~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~---~A---~~D~~~G~~~sPan~~~~~i~g~~--~~~~~~~~~ 460 (590) .++.|..+.+ .++. .-||....|++ .| +.|..| |=| .-.+.|+. ++...++.. T Consensus 283 --~N~~~it~~~--------~~gs--p~~~~~~AAA~aa~~A~~l~~DPAr-----PL~--tl~L~Gi~~p~~~~r~~~~ 343 (495) T protein:vir:19 283 --RNDHLISCMG--------IAGA--PEPSYLYAATLCAVASQALSIDPAR-----PLQ--TLTLPGRMPPAVGDRFTWS 343 (495) T ss_pred --cCCceEEEEe--------cCCC--CCcHHHHHHHHHHHHHHHhhccccc-----ccC--ceeecceecCCccccCChH Confidence 4676665542 1221 22444333333 32 455544 222 23466775 456668999 Q ss_pred HHhhhhhcCceEEEEecCCeEEEecceec-------CCCcccceehhhhHHHHHHHHHHHHHHHHhc-CCCCHH------ Q lcl|NC_016163. 461 WKEKLYLAQVNYIERDPKKISFATQLTSQ-------TSRSALSYINNVRVLLRIRREVEKMMADYRQ-EFQDNT------ 526 (590) Q Consensus 461 e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~-------s~d~~~~~i~vrR~~~~i~~si~~~~~~~vf-epn~~~------ 526 (590) |+|.|.-+||.++.--+++=..+--..|. ..|+.|..|+.-|+.+|+.+.++......-. +++-+. T Consensus 344 ern~LL~~Gist~~V~~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~ 423 (495) T protein:vir:19 344 ERNALLFDGISTFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRITQKFPNYKLASDGTRFAT 423 (495) T ss_pred HHHHHHhCCcceEEECCCCeEEEEeeeeeeeecCCCCcchhhhhhHHHHHHHHHHHHHHHHHhhhcCCcccccCCCCCCC Confidence 99999999999988655433333333332 2377899999999999999999987764222 222222 Q ss_pred -----HHHHHHHHHHHHHHHHHhCCce----------EEEecCCCCCHHHhhCCEEEEEEEEEecCccce----EEEEE Q lcl|NC_016163. 527 -----TYDSMSYSLNNYLQQWVANRAC----------SSISGTVYASDYDKQQSIARVKVELVFTGVIER----IAIDL 586 (590) Q Consensus 527 -----l~~~v~~~i~~~L~~l~~~ga~----------~~~~d~~~nt~~~i~~G~l~~~i~~ap~~paef----i~~~~ 586 (590) |-..||..+-+-++.|..+|-+ .+-.|.++ .+||.+.+-...+-+..- |.|++ T Consensus 424 gq~IvTp~~ir~ell~~~~~le~~given~~~~~~~LiVerd~~d-------pnRln~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 424 GQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEELYVARNKDD-------KDRLDVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred cccccChHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCC-------CcEEEEEecceeeCceeeeeeeeeeeC Confidence 5677899988888888887743 23334332 245554444333333221 12222 No 58 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=97.92 E-value=1.1e-05 Score=47.79 Aligned_cols=313 Identities=13% Similarity=0.036 Sum_probs=150.6 Q ss_pred eeccccceeeecCccccceeeeeecccccccCccccceeccccccccccccccc-ccccceeeccccccccccccccccc Q lcl|NC_016163. 219 IINKYSQYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHA-NVVWKSSSVETDDPSYDATAANFNN 297 (590) Q Consensus 219 vv~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (590) ++ +.++.+.-+..... ............++.+.........+... ..+. +.....+.+....+.. T Consensus 1 ~~---~~iv~V~v~~~~~~------~~~~~~~~~~~~~~~~t~~~~~~y~s~~~v~~d~-----~~~~~~Ykaa~~~f~Q 66 (331) T protein:vir:80 1 MV---ETITDVRVHISVLY------PSPRIGLGRPAIFVKGTAMGYKEYTTLEELKDTF-----ADNTEVYAKAKAVFLQ 66 (331) T ss_pred Cc---cceecceeeecccc------cccccccCcceeEEeccccceEEEechhhhccCC-----CCCcHHHHHHHHHHhc Confidence 11 11111111100000 00000000000000000000000000000 0000 0000000000000000 Q ss_pred ceeeeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhcCCeEEEEecCCCCC Q lcl|NC_016163. 298 IQYLTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGD 377 (590) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~ 377 (590) .. ....... ...... ..+..+.......--.+.......+-..++...++. +...|.+++... T Consensus 67 ~~-----~~~~i~v------~~~~~~--~~~~a~~a~~~~~w~~~~~~~~~~~~~~a~a~~~~a-~~~~f~~~~~~~--- 129 (331) T protein:vir:80 67 KD-----RPDTVAV------ITYEDT--KLLEAAEAYFLKSWHFALLAEFKAADALALSNLIEE-QKFKFAVFQVTA--- 129 (331) T ss_pred cC-----ccceEEE------eccchH--HHHHHHHHhccCceeEEEeecCCHHHHHHHHHHHhh-CCcEEEEEecCc--- Confidence 00 0000000 000000 011111111111111222222233334445555544 345666665421 Q ss_pred HHHHHHHHHhhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeeccccceeec Q lcl|NC_016163. 378 AQQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTDINFYP 457 (590) Q Consensus 378 ~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~~~~~~ 457 (590) ..++..... .+....++|+ .... . +.+.+.|.++..|.-+--|+ ++ ..+.|+.. -.+ T Consensus 130 ~~~~~~~~~-----~~~t~~~~~~-------~~~~---~-~~aa~~g~~~~~~~g~~t~~---fk--~~l~GV~~--~~l 186 (331) T protein:vir:80 130 VADITPLAK-----NTRTIAIVHS-------KTGE---K-LDAALIGNVASLPVGSATWK---GR--HGLAGITS--EEL 186 (331) T ss_pred hHHHHHhhc-----cccEEEEEcC-------Cccc---h-hHHHHHHHHHhcCccceeee---ee--cccCCCCC--CCC Confidence 122221111 2223333332 1111 1 35666777776665332232 22 12455543 257 Q ss_pred ChhHHhhhhhcCceEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHH Q lcl|NC_016163. 458 NEPWKEKLYLAQVNYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQE----FQDNTTYDSMSY 533 (590) Q Consensus 458 ~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfe----pn~~~l~~~v~~ 533 (590) +..|++.|..+++|++.++.+..+ +....+++++ ||.+.+-.+||+..|++.+...+-. |-|+.=...|+. T Consensus 187 t~t~~~al~~~~~N~y~~~~~~~~-~~~G~~~~G~----~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~~l~a 261 (331) T protein:vir:80 187 KVSEIDAIQKAGGMCYIEKAGIAQ-TSEGKTVSGE----FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQS 261 (331) T ss_pred CHHHHHHHHhcCceEEEEecCeeE-EecceEeCch----hHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhHHHHHH Confidence 899999999999999999877654 4555777774 8999999999999999888876544 456666788999 Q ss_pred HHHHHHHHHHhCCceEE-----------E-ecCCCCCHHHhhCCEEE-EEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 534 SLNNYLQQWVANRACSS-----------I-SGTVYASDYDKQQSIAR-VKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 534 ~i~~~L~~l~~~ga~~~-----------~-~d~~~nt~~~i~~G~l~-~~i~~ap~~paefi~~~~~~~~ 590 (590) .|+.-|++-.+.|.+.. . -+.++.+++|+.++++. +.+.+.+..-+++|.|++.++= T Consensus 262 ~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 262 ELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred HHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 99999999999998842 2 24567899999999886 8899999999999999999888 No 59 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=97.39 E-value=7.7e-05 Score=43.12 Aligned_cols=458 Identities=10% Similarity=0.037 Sum_probs=199.4 Q ss_pred CccccCCceEEEEecCCCceecccccceeEEEeecC----CCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHc Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGNTVLYAAIHSA----IGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQ 76 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~Tsv~~~vg~~~----~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~ 76 (590) |+==++.=|.|.- ...+..+.+.+=....|.+... .+|..+-+...|..|-...||. . ++...++..||. T Consensus 1 msip~s~ivnV~i-~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~---~--s~ey~aA~~yF~ 74 (502) T protein:vir:52 1 MALSISHIVNVQL-NTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGT---N--SETAKAAQPFFA 74 (502) T ss_pred CCCCccceeEEee-ccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCC---C--hHHHHHHHHHhc Confidence 8865565555542 2223333333323334444332 2233334446788999999994 1 445567888894 Q ss_pred CC---CcEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecccc Q lcl|NC_016163. 77 SG---GTAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGRG 153 (590) Q Consensus 77 nG---G~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~g 153 (590) .- +++||.|-....+... .... ++..........+ .. ...+|+ T Consensus 75 q~p~P~~l~igR~~~~~~~~~--~~~~-------------~~~~~~~~~~~~~---~~-~~~~G~--------------- 120 (502) T protein:vir:52 75 QSPRAKQLIVARWQKSASTIE--ATKN-------------TLSGATLSDDLER---FK-SVVNGR--------------- 120 (502) T ss_pred CCCccceEEEEecccccccee--echh-------------hhhhhhhHHhHHH---hh-hhcCce--------------- Confidence 32 2378888643211000 0000 0000000000000 00 000000 Q ss_pred cccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCcc Q lcl|NC_016163. 154 ENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDNR 233 (590) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~~ 233 (590) +.+..+.. .......+++- .+....+... ..+.-...... +....+. .. T Consensus 121 ---------l~i~i~g~-~~t~~~i~lS~----~ts~~~vA~~-------i~~~l~~~~~~-~tv~~d~---------~~ 169 (502) T protein:vir:52 121 ---------FSLTIGGD-VKKVDGLSFAR----LADFNAVATK-------IQEKLTTLSVA-VSIAYDE---------TG 169 (502) T ss_pred ---------eEEEecce-eeeeecccccc----ccchhHHHHH-------HHhhhcccccc-eEEEEec---------CC Confidence 00000000 00000000000 0000000000 00000000000 0000000 00 Q ss_pred ccceeeeeecccccccCccccceecccccccccccccccccccceeeccccc-ccccccccccccceeeeeccccccccc Q lcl|NC_016163. 234 SAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDD-PSYDATAANFNNIQYLTEGSEGTWTGG 312 (590) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (590) ......+... ............. +...+++. ........ .....+..... T Consensus 170 ~~F~i~s~tt-----g~~~~~~~~~a~~----------------~~~~gt~~a~~l~l~~~--~~av~v~~~~~------ 220 (502) T protein:vir:52 170 NRFIVSANVA-----GEDKKTEIDYAID----------------EGGEGEYIGALLKLENG--QASRKVGKNSV------ 220 (502) T ss_pred ceEEEEeccC-----CCcceeEEEEeec----------------CCcchhHHHHHhccccc--cceeeeeeecc------ Confidence 0000000000 0000000000000 00000000 00000000 00000000000 Q ss_pred cccceeccchhhHHHHHHHhhhc-cCCceeeecccchhHHHHHHHHHHHHhcCCeEEEEecCCCC---CHHHHH-HHHHh Q lcl|NC_016163. 313 NEESALLVKGYSGVLAPEILDKQ-QYEIDVLLDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQG---DAQQTI-DYRTG 387 (590) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~---~~~~~~-~~~~~ 387 (590) + ...+....++..+.+.. .+.. +++......+...++..+++. +...|.+....... ...+.. ..+. T Consensus 221 --g---~~aet~~~al~a~~~~~~~w~~-~~~a~~~~~~~~la~a~~iea-~~~~f~~~~~d~~~~~~~~~~i~~~l~a- 292 (502) T protein:vir:52 221 --S---LKKETLGEALFNVAEVNNTWYG-FTVAAQLTDSEVEAAAKYAQA-NTKLFGANVIRAEQIEWSADNIYKKLYD- 292 (502) T ss_pred --c---ccccCHHHHHHHHHhccCceEE-EEEeecCChhHHHHHHHHHhh-cCcEEEEEecCcceeccccchHHHHHHh- Confidence 0 00011111222222211 1221 222222233444455666654 34455543221111 111111 1111 Q ss_pred hcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCC-ceECcCCcccceeeccccceeecChhHHhhhh Q lcl|NC_016163. 388 NISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNG-IQWTFVGPRRGVISGFTDINFYPNEPWKEKLY 466 (590) Q Consensus 388 ~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G-~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln 466 (590) .+++ .-..+|++ .+ --|.+.+.|.+|.+|-.+- -...-.+ +++.|+..- .+++.|.+.|. T Consensus 293 -~~~~-~t~~~y~~-------~~-----~~~~aa~~g~~as~~f~~~~g~iT~~f---k~l~GV~~~--~lt~t~~~al~ 353 (502) T protein:vir:52 293 -AGLD-HTLAMFDK-------ND-----MYPVSSALARLLSTNFAANNSTLTLKF---KQQPTITAD--EITATEFAKAK 353 (502) T ss_pred -ccCc-eeEEEecC-------Cc-----chhHHHHHHHHHhcCCCcCcceeeecc---cccCCcccC--cCCHHHHHHHH Confidence 1111 11233332 11 1256777888888874331 1111223 235555432 47899999999 Q ss_pred hcCceEEEEecCCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcC-----CCCHHHHHHHHHHHHHHHHH Q lcl|NC_016163. 467 LAQVNYIERDPKKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQE-----FQDNTTYDSMSYSLNNYLQQ 541 (590) Q Consensus 467 ~~gIn~i~~~~~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfe-----pn~~~l~~~v~~~i~~~L~~ 541 (590) .+++|++.++.+.++ +....+++++ ||-+.+-.+||+..|++.+...++. |-|+.=...|+..|+.-|++ T Consensus 354 ~~~~N~y~~~~~~~~-~~~G~~~~G~----~iD~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~ 428 (502) T protein:vir:52 354 RLGINVYTYFDDVAM-IAEGTVIGGK----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLE 428 (502) T ss_pred hcCceEEEEecCeeE-EecCeeeCCc----hhhHHHHHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHH Confidence 999999999876654 4567888873 8889999999999999988876663 45666688899999999999 Q ss_pred HHhCCceEE--------------------E---e-cCCCCCHHHhhCCEE-EEEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 542 WVANRACSS--------------------I---S-GTVYASDYDKQQSIA-RVKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 542 l~~~ga~~~--------------------~---~-d~~~nt~~~i~~G~l-~~~i~~ap~~paefi~~~~~~~~ 590 (590) -.++|.|.. + . ..+..+++|+.+.++ -+.+.+.+..-+++|.|.+.++| T Consensus 429 a~~~G~I~~G~~~~~~~g~~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 429 GINNGAFAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred HHhcCccccccccCcccceeeecccccCceEEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 999998742 1 1 245788999999988 79999999999999999999999 No 60 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=96.51 E-value=0.00055 Score=38.44 Aligned_cols=391 Identities=12% Similarity=0.032 Sum_probs=151.6 Q ss_pred ccccceEEEeeccccCCcceeeEeecccccccccceEEEEeecccccccccccccee--eeeecccCCCceeeeeeeeec Q lcl|NC_016163. 125 ASKNAMKTILSGGTAGETPLCFIVPKGRGENYNGYGFRLSLRSDYDNTYNFRTYNLS--VTVKDSTGADVVVEGPYIVSF 202 (590) Q Consensus 125 a~~~~~~~~~~~~t~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~--i~v~d~~~~~~v~e~~~~ls~ 202 (590) -+.+++++...-.+.+-.... +...+-.. ...........+. -.|.+.-+.+. T Consensus 1 ~~s~iVnV~i~~~~~a~~~~~-------------f~~~l~~~--~~~~~~~r~~~yss~~~V~~~FG~~S---------- 55 (450) T protein:vir:95 1 MWNPIVNVDITLNTAGTTREG-------------FGLPLFLA--STDNFEERVRGYTSLTEVAEDFDENT---------- 55 (450) T ss_pred CCCceEEEeeccccccccccc-------------ceeEEEEc--CCCCCccceeeecCHHHHHHhcCCCc---------- Confidence 455555554433322211100 00000000 0000000000000 00001000000 Q ss_pred cccccccccccceeeeeeccccceeeecCc-----------cccceeeeeecccccccCccccceecccccccccccccc Q lcl|NC_016163. 203 DPEAKDKSRQSIYYANIINKYSQYVEIVDN-----------RSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIH 271 (590) Q Consensus 203 ~~da~~~~~~~~~~~~vv~~~s~~v~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 271 (590) .....+..|+.-.-.....++..-.. ............+. ......+.+........... T Consensus 56 ----~ey~aA~~yF~q~p~p~~l~igr~~~~~t~~~~~~~~~~~~g~lt~tv~G~---~~~~~~i~~s~a~s~~~va~-- 126 (450) T protein:vir:95 56 ----AAYKAAKQLWSQTPKVTQLYIGRRAMQYTVSIPDAVTESTDYSITVAAGGG---ISQPYQYTAQSSDTAENVLQ-- 126 (450) T ss_pred ----HHHHHHHHHHhCCCcccEEEEEeeccchhhhhhhhhccccceeEEEEecce---eeeeeEEEEEecCChhhHHH-- Confidence 00000001111000000000000000 00000000000000 00000000000000000000 Q ss_pred cccccceeecccccccccccccccccceeeeeccccccccccccceeccchhhHHHHHHHhhhccCCceeeecccchhHH Q lcl|NC_016163. 272 ANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAV 351 (590) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 351 (590) ........ ..... +.......+.....+.+ ..... .. .+.... .....+...+..... T Consensus 127 --~~~tai~~--~~~~~-----~~~~~~s~g~~~~~t~~-----~~~~~------~~-~~~~l~-~~~~~~~~~g~~aet 184 (450) T protein:vir:95 127 --QFKTQIEA--DPTIK-----DKVSVNVTGSNGSATMI-----IAKAG------DN-DFVKVT-TTAQTVYIASTTADT 184 (450) T ss_pred --Hhhhhhcc--cceee-----eeeeeeeecccceeeee-----eeccc------cc-hhhccc-cccceeEeccccccc Confidence 00000000 00000 00000000000000000 00000 00 000000 001111111111111 Q ss_pred HHHHHHHHHHhcCCeEEEEec-CCCCCHHHHHHHHHhhcCcccceEEEEcCe-E-------------------------- Q lcl|NC_016163. 352 KNAMSDLCSEQRGDCIAILDC-SFQGDAQQTIDYRTGNISMSTYFTAIFGQH-M-------------------------- 403 (590) Q Consensus 352 ~~a~~~~~~~~~~~~~a~~d~-p~~~~~~~~~~~~~~~~~~~s~~~~~~~p~-~-------------------------- 403 (590) ...++..|.+...+.+.+.-. +...+..++.+|.+.. +. ...|.+| . T Consensus 185 ~~~a~~a~~~~~~~w~~~~~~~~~~~~i~a~a~w~~a~----~~-~f~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~t 259 (450) T protein:vir:95 185 ASTALAAIEAYSTDWYFIAAEDRTQQFVLAMASEIQAR----KK-IFFTANSDVTALQGTELASANDVPAQLAKNMYTRT 259 (450) T ss_pred HHHHHHHHHHhhCCeEEEEecCCCHHHHHHHHHHHhhc----Cc-EEEEEcCCchhhhhhhhhcccchHHHHHhccCCee Confidence 222222232222222211110 0001111222232211 00 0011010 0 Q ss_pred -EEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeecccc-----ceeecChhHHhhhhhcCceEEEEec Q lcl|NC_016163. 404 -NVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISGFTD-----INFYPNEPWKEKLYLAQVNYIERDP 477 (590) Q Consensus 404 -~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g~~~-----~~~~~~~~e~~~Ln~~gIn~i~~~~ 477 (590) -+|++.+.. -.|.+.++|...-.+.-+=-|+ ++ .+.|+.. ....++..|.+.|..+++|++.++. T Consensus 260 ~~~y~~~~~~---~~~~aa~~g~~~~~~~g~~T~~---fk---~l~Gv~~~v~~~~~~~lt~~~~~al~~~~~n~y~~~~ 330 (450) T protein:vir:95 260 VCLWHHAAAE---DYPEMAYIAYGAPYDAGSIAWG---NA---QLTGVAASLQPSNQRPLTSIQKSALDVRHCNFIDLDG 330 (450) T ss_pred EEEeeCCCch---hHHHHHHHHHhhhcccceeeec---cc---cccceeeeccCccccccchHHHHHHHhCCcEEEEEec Confidence 111111111 1245555655544333221232 33 2344432 1236889999999999999999987 Q ss_pred CCeEEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhc------CCCCHHHHHHHHHHHHHHHHHHHhCCceEEE Q lcl|NC_016163. 478 KKISFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQ------EFQDNTTYDSMSYSLNNYLQQWVANRACSSI 551 (590) Q Consensus 478 ~~G~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vf------epn~~~l~~~v~~~i~~~L~~l~~~ga~~~~ 551 (590) +.++ ++...+++++ ||-++|-.+||+..|++.+...+. =|-|+.-...|+..|+.-|++..++|.|.++ T Consensus 331 ~~~~-~~~G~~~~G~----~iD~~~~~~wl~~~iq~~l~~ll~~~~~~KiPy~~~G~~~i~a~i~~~l~~a~~~G~Ia~~ 405 (450) T protein:vir:95 331 GVPV-VRRGITSGGE----WIDIIRGVDWLESDLKTSLRDLLINQKGGKITYDDTGITRIRQVIETSLQRAVNRNFLSSY 405 (450) T ss_pred Ccee-eeCCeeeCcc----hhHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccChhhHHHHHHHHHHHHHHHHhcCcccce Confidence 7764 6777888873 889999999999999999887652 2667777888999999999999999988764 Q ss_pred ----ecCCCCCHHHhhCCEEE-EEEEEEecCccceEEEEEEeeC Q lcl|NC_016163. 552 ----SGTVYASDYDKQQSIAR-VKVELVFTGVIERIAIDLVVNK 590 (590) Q Consensus 552 ----~d~~~nt~~~i~~G~l~-~~i~~ap~~paefi~~~~~~~~ 590 (590) -+.+..+++|+.+.++. +.+.+.....+.++.|++.++= T Consensus 406 ~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~ 449 (450) T protein:vir:95 406 TVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAY 449 (450) T ss_pred eEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEe Confidence 25677899999998875 8899999999999999999998 No 61 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=72.81 E-value=0.18 Score=24.69 Aligned_cols=441 Identities=10% Similarity=-0.009 Sum_probs=171.0 Q ss_pred Ccc-ccCCceEEEEecCCCceeccccc-ceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHc-- Q lcl|NC_016163. 1 MAD-YLHPSVSSRIVDNSAVYATAAGN-TVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQ-- 76 (590) Q Consensus 1 Mp~-yl~PGVYveEi~s~~~~i~gv~T-sv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~-- 76 (590) ||. =|.=--||.-.++. -+-.+..- +.+-+.+....=|+++.....|..|-...||.- ++-..+++.||. T Consensus 1 m~~~~ip~s~iV~V~~~v-~~~~~~~~~~~~l~l~~~~~~~~~~~~~~~s~~~V~~~FG~~-----S~ey~aA~~yFsg~ 74 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGV-IGAGGAPGRLTGLVLTQDTSVQPGQLADFFQETDVENWFGAL-----SNEAKIADAYFPGI 74 (501) T ss_pred CCCCCcccceEEEEeeec-ccCCCccccceeEEEeccCCCCccceEEecCHHHHHHhcCCC-----hHHHHHHHHHhhhh Confidence 993 24445566644432 22222222 456666666666789889999999999999952 334456677775 Q ss_pred -CC----CcEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecc Q lcl|NC_016163. 77 -SG----GTAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKG 151 (590) Q Consensus 77 -nG----G~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~ 151 (590) |- +++||.|-..+... + .+.. +..... T Consensus 75 ~~q~p~P~~l~igR~~~~~~~-~-------------------~l~g-------------------~~l~~~--------- 106 (501) T protein:vir:10 75 VNGGQLPYDLKFARYVAADAP-A-------------------SVYG-------------------IPLTGV--------- 106 (501) T ss_pred cCCCccccEEEEEeecCCCcc-c-------------------eEec-------------------cchhhh--------- Confidence 43 34888886421100 0 0000 000000 Q ss_pred cccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecC Q lcl|NC_016163. 152 RGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVD 231 (590) Q Consensus 152 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~ 231 (590) .... +. .....+.+.+ + +..+ . .-.++|... +..........-+...+..+.. + T Consensus 107 ~la~-------~~----------~~sg~l~vti-~--g~~~-~-~~i~ls~at---s~~~vAs~i~~al~~~~~tv~~-d 160 (501) T protein:vir:10 107 TLAQ-------LQ----------GYSGTLTVTT-A--AQHV-S-ANISLAAAT---SFANAATLIEAAFTSPDFVVAY-D 160 (501) T ss_pred hhhh-------cc----------eeeeEEEEee-c--ccee-e-ccccccccc---CHHHHHHHHhhhccCCceEEEE-c Confidence 0000 00 0000011110 1 0000 0 111111110 0000000000000000000000 0 Q ss_pred ccccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeecccccccc Q lcl|NC_016163. 232 NRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTG 311 (590) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (590) .....+.+.....+. ........ .++ +......+......... T Consensus 161 -----------------~~~~~f~its~ttG~--------~~~i~~~~-~~~----------~la~~l~Lt~~~~a~v~- 203 (501) T protein:vir:10 161 -----------------ALRNRFTVVTNATGT--------AAAISAVT-GTN----------NLADELGLSAAAGATLQ- 203 (501) T ss_pred -----------------ccCceEEEEeeccCC--------ceeEEEee-Cch----------hhhhhcCccccccceEE- Confidence 000000000000000 00000000 000 00000000000000000 Q ss_pred ccccceeccchhhHHHHHHHhhhc-cCCceeeecccchhHHHHHHHHHHHHhcCCeEEEEecCCCCC------HHHHHHH Q lcl|NC_016163. 312 GNEESALLVKGYSGVLAPEILDKQ-QYEIDVLLDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGD------AQQTIDY 384 (590) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~------~~~~~~~ 384 (590) ......+....++..+.... .+- .+........+...++..+++... ..|.+.-...... ..+.... T Consensus 204 ----~~g~~aet~~~a~~a~~~~~~~Wy-~f~~a~~~~~~~~la~A~wiea~~-~~f~~~~~~~~~~~~~~~~~~~i~~~ 277 (501) T protein:vir:10 204 ----AAGVAADTPASAMNRAVGLSRNWA-TFTTAWTAVIADRLAFAAWNSGQA-YKYMYVAPDLEAASIVTNNAASFGAQ 277 (501) T ss_pred ----ecCcccccHHHHHHHHHhccCceE-EEEEecCCChHHHHHHHHHHHhcC-ceEEEEEecCchhhhhhhhhhhHHHH Confidence 00000111111222222211 121 111122223344445555555432 2332221111100 0111111 Q ss_pred HHhhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccC--C--ceECcCCcccceeeccccce-eecCh Q lcl|NC_016163. 385 RTGNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQN--G--IQWTFVGPRRGVISGFTDIN-FYPNE 459 (590) Q Consensus 385 ~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~--G--~~~sPan~~~~~i~g~~~~~-~~~~~ 459 (590) ... . ...+....|+. -.+.+.+.|.+|.+|-++ | -|| .+ ++.+ ++. -.+++ T Consensus 278 l~~-~--~y~~t~~~y~~-------------~~~~aa~~g~~as~nf~~~~g~~T~~---fk---q~~~--Gi~a~~lt~ 333 (501) T protein:vir:10 278 VFA-A--PYQGTLPLYGD-------------QATAGAVMGYAASINFQLRNGRTVLA---FR---QFNA--GVPATAHDL 333 (501) T ss_pred HHh-c--CCCceEEECCC-------------CcHHHHHHHHHHhhCcccCccceeee---cc---ccCC--CcCcccCCH Confidence 111 1 11223322221 125677788888887543 2 121 11 1111 121 24789 Q ss_pred hHHhhhhhcCceEEEEecCCe--EEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHH Q lcl|NC_016163. 460 PWKEKLYLAQVNYIERDPKKI--SFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQE----FQDNTTYDSMSY 533 (590) Q Consensus 460 ~e~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfe----pn~~~l~~~v~~ 533 (590) .|.+.|..+|+|+...+.+.+ +.+|-.-+++++ |.+|.+-+-.+|++..++..+....-. |-|..=...|+. T Consensus 334 t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a 411 (501) T protein:vir:10 334 PTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALYR 411 (501) T ss_pred HHHHHHHhcCCeEEEEeccccceeeEEecCeeecc--ceeehhhhhHHHHHHHHHHHHHHHHHhcCCcccCHHHHHHHHH Confidence 999999999999999986544 677844455665 555666666666666666666543322 667888888999 Q ss_pred HHHHHHHHHHhCCceEEEecC---------------------------------CCCCHHHhhCCEEEEEEEEEecCccc Q lcl|NC_016163. 534 SLNNYLQQWVANRACSSISGT---------------------------------VYASDYDKQQSIARVKVELVFTGVIE 580 (590) Q Consensus 534 ~i~~~L~~l~~~ga~~~~~d~---------------------------------~~nt~~~i~~G~l~~~i~~ap~~pae 580 (590) .|+.-|++-+++|.|...-+. ++.+++.-.++...+.+.++--..+. T Consensus 412 ~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh 491 (501) T protein:vir:10 412 AGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQ 491 (501) T ss_pred HHHHHHHHHHhCceeecCCCCCcccceeeccccCccccccceeccceeEeeccccCChhhhhhccccceEEEEEeCCcee Confidence 999999999999988542110 01111111122222333333333333 Q ss_pred eEEEEEEeeC Q lcl|NC_016163. 581 RIAIDLVVNK 590 (590) Q Consensus 581 fi~~~~~~~~ 590 (590) +|.+-..--= T Consensus 492 ~v~i~s~~v~ 501 (501) T protein:vir:10 492 QLTIGSNAVI 501 (501) T ss_pred EEEeeeeecC Confidence 3322110000 No 62 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=60.70 E-value=0.37 Score=22.97 Aligned_cols=441 Identities=11% Similarity=0.020 Sum_probs=175.5 Q ss_pred Ccc-ccCCceEEEEecCCCceeccccc-ceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHc-- Q lcl|NC_016163. 1 MAD-YLHPSVSSRIVDNSAVYATAAGN-TVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQ-- 76 (590) Q Consensus 1 Mp~-yl~PGVYveEi~s~~~~i~gv~T-sv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~-- 76 (590) ||. =|.=--||.-.++ .-+-.+..- +.+.+.+....=|+++.+...|.+|-...||.- ++-..+++.||. T Consensus 1 m~~~~ip~s~iV~V~~~-v~~~~~~~~~~~~lllt~~~~~~~~r~~~y~s~~~V~~~FG~~-----S~ey~aA~~yFs~~ 74 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPG-VIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQETDVENWFGAL-----SNEAKIADAYFPGI 74 (501) T ss_pred CCcCCcccceEEEEeee-eccCCCcceeeeeEEEeccCCCCCcceeeecCHHHHHHhcCCC-----hHHHHHHHHHhhcc Confidence 994 1333455554432 222222222 445555555556778888888999999999952 344557778886 Q ss_pred -CCC----cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecc Q lcl|NC_016163. 77 -SGG----TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKG 151 (590) Q Consensus 77 -nGG----~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~ 151 (590) |-. ++||.|-..+... + .+...-.... +.+. . T Consensus 75 ~~q~~~P~~l~igR~~~~a~~-~-------------------~l~g~~l~~~--------------~~a~--~------- 111 (501) T protein:vir:36 75 VNGGQLPYDLKFARYVAADAP-A-------------------SVYGIPLTGV--------------TLAQ--L------- 111 (501) T ss_pred cCCCccccEEEEEeecCcCcc-e-------------------eEeccchhhh--------------hhhh--c------- Confidence 433 4899987432110 0 0000000000 0000 0 Q ss_pred cccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecC Q lcl|NC_016163. 152 RGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVD 231 (590) Q Consensus 152 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~ 231 (590) + .....+.+..+. ..+ . .-.++|..... .........-+...+..+. .+ T Consensus 112 ~-----~~sg~l~vti~g--------------------~~~-~-~~i~lS~~ts~---~~vA~~i~~al~~~~~tv~-~d 160 (501) T protein:vir:36 112 Q-----GYSGTLTVTTAA--------------------QHV-S-ANISLAAATSF---ANAATLIEAAFTSPDFVVA-YD 160 (501) T ss_pred c-----ceeEEEEEEecc--------------------eee-e-eecccccccCH---HHHHHHHhhhhcCcceEEE-Ec Confidence 0 000000000000 000 0 00011100000 0000000000000000000 01 Q ss_pred ccccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeecccccccc Q lcl|NC_016163. 232 NRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTG 311 (590) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (590) .......+... ... ....+.... .+++ ......+......... T Consensus 161 ~~~~~f~i~s~-----t~G-~~~~i~~~t--------------------~~~~----------ia~~l~Lt~~~~a~v~- 203 (501) T protein:vir:36 161 ALRNRFTVVTN-----ATG-TAAAISAVT--------------------GTNN----------FADEIGLSAAAGATLQ- 203 (501) T ss_pred CcceeEEEEec-----cCC-cceeeEeee--------------------cccc----------hhhhhcccccCcceEE- Confidence 00000000000 000 000000000 0000 0000000000000000 Q ss_pred ccccceeccchhhHHHHHHHhhhccCCceeeecccchhHHHHHHHHHHHHhcCCeEEEEecCCCC------CHHHHHHHH Q lcl|NC_016163. 312 GNEESALLVKGYSGVLAPEILDKQQYEIDVLLDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQG------DAQQTIDYR 385 (590) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~------~~~~~~~~~ 385 (590) ......+....++..+.+....--.+........+...++..+++.. ...|.+.-..... ...+..... T Consensus 204 ----~~g~~~et~~~al~a~~~~s~~Wy~f~~a~~~~~~~~la~A~wiea~-~~~f~~~~~~~~~~~~~~~~~~~i~~~l 278 (501) T protein:vir:36 204 ----AAGVAADTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFASWNSGQ-AYKYMYVAPDLEAASIVSNNAASFGAQV 278 (501) T ss_pred ----ecccccccHHHHHHHHHhccCceEEEEEecCCChHHHHHHHHHHhhc-CceEEEEEecCchhhhhccchhhHHHHH Confidence 00000011111222222221111112222333334444555665543 2333332111110 111121111 Q ss_pred HhhcCcccceEE-EEcCeEEEeecccCceeeecHHHHHHHHHHHhhccC--C--ceECcCCcccceee-ccccceeecCh Q lcl|NC_016163. 386 TGNISMSTYFTA-IFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQN--G--IQWTFVGPRRGVIS-GFTDINFYPNE 459 (590) Q Consensus 386 ~~~~~~~s~~~~-~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~--G--~~~sPan~~~~~i~-g~~~~~~~~~~ 459 (590) ... ...+.. +|++ -.|.+++.|..|.+|-++ | -|| ++ ++. |+.. -.+++ T Consensus 279 ~~~---~y~~t~~~y~~--------------~~~~aa~~g~~as~nf~~~~g~~T~~---fk---q~~~Gi~a--~~l~~ 333 (501) T protein:vir:36 279 FAA---PYQGTLPLYGD--------------QATAGAVMGYAASINFQLRNGRTVLA---FR---QFNAGVPA--TVHDL 333 (501) T ss_pred Hhc---CCCcEEEEcCC--------------CCHHHHHHHHHHhcCcccCcceeeee---cc---ccCCCcCc--CcCCH Confidence 111 122222 2321 125667788888887543 2 121 11 111 2211 24688 Q ss_pred hHHhhhhhcCceEEEEecCCe--EEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHH Q lcl|NC_016163. 460 PWKEKLYLAQVNYIERDPKKI--SFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQE----FQDNTTYDSMSY 533 (590) Q Consensus 460 ~e~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfe----pn~~~l~~~v~~ 533 (590) .|.+.|..+|+|++..|.+.+ +.+|-.=+++++ |.||.+.+-.+||+..++..+....-. |-|..=...|+. T Consensus 334 t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~g~dWL~~~iq~~l~~ll~~~~KIPytd~G~~~l~a 411 (501) T protein:vir:36 334 PTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTGLYR 411 (501) T ss_pred HHHHHHHhcCCcEEEEEecccceeeEEEcCeeecc--chhhhHHHhHHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHH Confidence 999999999999998886544 777744456665 556788888888888888888765433 567777888999 Q ss_pred HHHHHHHHHHhCCceEEEecCC---------------------------------CCCHHHhhCCEEEEEEEEEecCccc Q lcl|NC_016163. 534 SLNNYLQQWVANRACSSISGTV---------------------------------YASDYDKQQSIARVKVELVFTGVIE 580 (590) Q Consensus 534 ~i~~~L~~l~~~ga~~~~~d~~---------------------------------~nt~~~i~~G~l~~~i~~ap~~pae 580 (590) .|+.-|++-.++|.|...-+.+ +.+++.-.+....+.+.++--..+. T Consensus 412 ~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh 491 (501) T protein:vir:36 412 AGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQ 491 (501) T ss_pred HHHHHHHHHHhCceeecCCCCCcccceeecccccccccccceeccceEEeeCcccCChhhhhhcccCcEEEEEEeCCcee Confidence 9999999999999885421110 1111111122223333333333333 Q ss_pred eEEEEEEeeC Q lcl|NC_016163. 581 RIAIDLVVNK 590 (590) Q Consensus 581 fi~~~~~~~~ 590 (590) +|.+-..--= T Consensus 492 ~v~i~s~~v~ 501 (501) T protein:vir:36 492 SLTIGSNAVI 501 (501) T ss_pred EEEeeeeeeC Confidence 3332110000 No 63 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=58.45 E-value=0.41 Score=22.69 Aligned_cols=441 Identities=10% Similarity=0.008 Sum_probs=174.0 Q ss_pred Ccc-ccCCceEEEEecCCCceeccccc-ceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHc-- Q lcl|NC_016163. 1 MAD-YLHPSVSSRIVDNSAVYATAAGN-TVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQ-- 76 (590) Q Consensus 1 Mp~-yl~PGVYveEi~s~~~~i~gv~T-sv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~-- 76 (590) ||. =|.=--||.-.++ .-+-.+..- +.+-+.+....=|+++.+...|.+|-...||-- ++-..+++.||. T Consensus 1 m~~~~ip~s~iV~V~~~-v~~~~~~~~~f~~lll~~~~~~~~~r~~~y~s~~~V~~~FG~~-----S~ey~aA~~yFsg~ 74 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPG-VIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQKTDVENWFGAL-----SNEAKIADAYFPGI 74 (501) T ss_pred CCcCccccceEEEEeee-cccCCCcccccceEEEecccCCCccceeeecCHHHHHHhcCCC-----hHHHHHHHHHhhhh Confidence 994 2333455664433 222222222 445555555556778888899999999999952 344456777785 Q ss_pred -CC----CcEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecc Q lcl|NC_016163. 77 -SG----GTAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKG 151 (590) Q Consensus 77 -nG----G~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~ 151 (590) |- +++||.|-..+... + .+.....+.. + +..+ T Consensus 75 ~~q~p~P~~l~igR~~~~~~~-~-------------------~l~g~~l~~~--------------~-----la~~---- 111 (501) T protein:vir:10 75 VNGGQLPYDLKFARYVAADAP-A-------------------SVYGIPLTGI--------------T-----LAQL---- 111 (501) T ss_pred cCCCccccEEEEEeecccCcc-c-------------------eeeeceehhh--------------h-----hhhh---- Confidence 43 34899986432110 0 0000000000 0 0000 Q ss_pred cccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecC Q lcl|NC_016163. 152 RGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVD 231 (590) Q Consensus 152 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~ 231 (590) +. .+ ..+.+..+ +... . .-.++|..... .........-+...+..+.. + T Consensus 112 ~~--~~---g~l~i~i~--------------------g~~~-~-~~i~~s~ats~---~~vA~~i~~al~~~~~tv~~-d 160 (501) T protein:vir:10 112 QG--YS---GTLTVTTA--------------------AQHV-S-ANISLAAATSF---ANAATLIEAAFTSPDFVVAY-D 160 (501) T ss_pred hh--ee---eEEEEeec--------------------ccee-e-eccccccccCH---HHHHHHHHHhhcCCceEEEE-e Confidence 00 00 00000000 0000 0 00011111000 00000000000000000100 1 Q ss_pred ccccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeecccccccc Q lcl|NC_016163. 232 NRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTG 311 (590) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (590) ..... +.+.....+.. ..................+.. ...... T Consensus 161 ~~~~~-----------------f~i~~~t~G~~--------~~i~~~t~~~d~a~~l~Lt~~-----------~~a~v~- 203 (501) T protein:vir:10 161 ALRNR-----------------FTVVTNTTGTA--------AAISAVTGTNNLADELGLSAA-----------AGATLQ- 203 (501) T ss_pred cccce-----------------EEEEecccCcc--------eeEEEeeccccchhhhccccc-----------CceeEE- Confidence 00000 00000000000 000000000000000000000 000000 Q ss_pred ccccceeccchhhHHHHHHHhhhc-cCCceeeecccchhHHHHHHHHHHHHhcCCeEEEE--ecCCCC----CHHHHHHH Q lcl|NC_016163. 312 GNEESALLVKGYSGVLAPEILDKQ-QYEIDVLLDGNNEVAVKNAMSDLCSEQRGDCIAIL--DCSFQG----DAQQTIDY 384 (590) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~a~~--d~p~~~----~~~~~~~~ 384 (590) ..+ ...+....++..+.+.. .+- .+........+...++..+++... ..|.+. |..... ...+.... T Consensus 204 ~~g----~~aet~~~Al~a~~~~~~~Wy-~f~~a~~~~~~~~la~A~wi~a~~-~~f~~~~~~~~~~~~~~~~~~~i~~~ 277 (501) T protein:vir:10 204 AAG----VAADTPASAMNRAVGLSRNWA-TFTTAWTAVIADRLAFAAWNSGQA-YKYMYVAPDLEAASIVTNNAASFGAQ 277 (501) T ss_pred ecC----cccccHHHHHHHHHhcccceE-EEEEEecCChHHHHHHHHHHHhcC-ceEEEEEecCcceeeecccchhHHHH Confidence 000 00011111222222211 111 111222223344445555554432 222222 221110 11111111 Q ss_pred HHhhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccC--C--ceECcCCccccee-eccccceeecCh Q lcl|NC_016163. 385 RTGNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQN--G--IQWTFVGPRRGVI-SGFTDINFYPNE 459 (590) Q Consensus 385 ~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~--G--~~~sPan~~~~~i-~g~~~~~~~~~~ 459 (590) .... ...+....|. . -.|.+++.|..|.+|-++ | -|| .+ ++ .|+. .-.+++ T Consensus 278 l~~~---~y~~t~~~y~------~-------~~~~aa~~g~~as~nf~~~~g~~T~~---fk---ql~~Gv~--a~~l~~ 333 (501) T protein:vir:10 278 VFAA---PYQGTLPLYG------D-------QATAGAVMGYAASINFQLRNGRTVLA---FR---QFNAGVP--ATAHDL 333 (501) T ss_pred HHhc---CCCceEEECC------C-------CCHHHHHHHHHHhcCcccCcceeeee---ec---ccCCCcC--cccCCH Confidence 1111 1222222221 1 236778888888887544 2 121 11 11 1221 124688 Q ss_pred hHHhhhhhcCceEEEEecCCe--EEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHH Q lcl|NC_016163. 460 PWKEKLYLAQVNYIERDPKKI--SFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQE----FQDNTTYDSMSY 533 (590) Q Consensus 460 ~e~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfe----pn~~~l~~~v~~ 533 (590) .|.+.|..+|+|++..|.+.+ +.+|-.-.++++ |.||.+.+-.+|++..|+..+....-. |-|..=...|+. T Consensus 334 t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a 411 (501) T protein:vir:10 334 PTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTANYR 411 (501) T ss_pred HHHHHHHhcCCeEEEEEecccceeeEEEcceeecc--ceehhhHhhHHHHHHHHHHHHHHHHhcCCCcccCHHHHHHHHH Confidence 999999999999999987655 778844345665 556778888888888888777664322 556778888999 Q ss_pred HHHHHHHHHHhCCceEEEec---------------------------------CCCCCHHHhhCCEEEEEEEEEecCccc Q lcl|NC_016163. 534 SLNNYLQQWVANRACSSISG---------------------------------TVYASDYDKQQSIARVKVELVFTGVIE 580 (590) Q Consensus 534 ~i~~~L~~l~~~ga~~~~~d---------------------------------~~~nt~~~i~~G~l~~~i~~ap~~pae 580 (590) .|+.-|++-+++|.|...-+ .++.++..-.+....+.+.++--..+. T Consensus 412 ~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh 491 (501) T protein:vir:10 412 AGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAGVAGAGALVQTRGWYFLIGNPANPGQARQNRTSPACTLWYSDGGSIQ 491 (501) T ss_pred HHHHHHHHHHhCcceecCcccCcccceeecccccccccccceeccceEEeeCcccCChhhhhhcccCceEEEEEeCCcee Confidence 99999999999998854211 011112222222233333333334444 Q ss_pred eEEEEEEeeC Q lcl|NC_016163. 581 RIAIDLVVNK 590 (590) Q Consensus 581 fi~~~~~~~~ 590 (590) +|.+-..--= T Consensus 492 ~v~i~s~~v~ 501 (501) T protein:vir:10 492 ELTIGSNAVI 501 (501) T ss_pred EEEeeeeecC Confidence 4432111000 No 64 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=52.13 E-value=0.56 Score=21.95 Aligned_cols=457 Identities=8% Similarity=-0.070 Sum_probs=170.2 Q ss_pred CccccCCceEEEEecCCCceeccccc--ceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHc-- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGN--TVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQ-- 76 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~T--sv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~-- 76 (590) || ++.=|||.-.| ++..-.|+.. +.+.+......=|.++.+...|.+|-...||.- ++-..+++.||. T Consensus 1 m~--I~~~~~V~i~~-~v~aa~~~~~~~f~~li~t~~~~~p~~r~~~y~s~~~V~~~FG~~-----S~ey~aA~~yFsg~ 72 (515) T protein:vir:10 1 MP--ISFDKYVAITS-GVAAQQQIAARSFAIRVYTPNPMVSVDRLITATSAADVGAYFGTA-----SEEYKRAVKNFGFI 72 (515) T ss_pred CC--CCceeEEEeec-ccccCCccccccceeeeeecccCCCccceeeecCHHHHHHhcCCC-----hHHHHHHHHHhhhc Confidence 99 77889999654 3433344433 445545555555677888899999999999952 333446667775 Q ss_pred -CC----CcEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecc Q lcl|NC_016163. 77 -SG----GTAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKG 151 (590) Q Consensus 77 -nG----G~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~ 151 (590) |. +++||.|=..... . ..+.... .....+. . T Consensus 73 ~~q~p~P~~L~igR~~~~a~----~----------------~~l~g~~-------------------~~~~~l~----~- 108 (515) T protein:vir:10 73 SKKTRRPTSIQFARWQREAG----P----------------VAIYGGA-------------------KKAAALA----T- 108 (515) T ss_pred cCCcccccEEEEEeccCccc----c----------------eEEEecc-------------------chhhhHH----h- Confidence 43 3488888532100 0 0000000 0000000 0 Q ss_pred cccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecC Q lcl|NC_016163. 152 RGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVD 231 (590) Q Consensus 152 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~ 231 (590) +... .++ .+++.+ |. .....-.-.++|.... ..........-+.... T Consensus 109 -----------~~~i--s~G-------~ltiti-dG--~~~~t~s~i~~S~ats---~~~vAs~i~tal~~~~------- 155 (515) T protein:vir:10 109 -----------LQAV--TAG-------AISFLF-GG--ATTVTVSGISFSAATS---LADVASELQTALRANA------- 155 (515) T ss_pred -----------hhcc--cce-------eEEEEE-cc--eEEEEeeccccccccC---HHHHHHHHHhhhcccc------- Confidence 0000 000 011111 00 0000000001111100 0000000000000000 Q ss_pred ccccceeeeeecccccccCccccceecccccccccccccccccccc-eeecccccccccccccccccceeeeeccccccc Q lcl|NC_016163. 232 NRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWK-SSSVETDDPSYDATAANFNNIQYLTEGSEGTWT 310 (590) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (590) ........+........+............ ........ ..+.+...........+.... T Consensus 156 ----------------~~~~~~~tv~~d~~~~~F~v~s~~tG~~~~is~~~~t~~----~~~t~~a~~lglt~~~~av~~ 215 (515) T protein:vir:10 156 ----------------DANLATCTVSYDPVGARFNFAGSPSDDTVQESISIVPQS----NPAIDVAQLLGWNSAQGASYI 215 (515) T ss_pred ----------------ccccceeEEEEecCCCeEEEEEeecCCceeEEEEEecCC----CchhhHHHHhccccccceEEe Confidence 000000000000000000000000000000 00000000 000000000000000000000 Q ss_pred cccccceeccchhhHHHHHHHhhh-ccCCceeeecc----cchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHH Q lcl|NC_016163. 311 GGNEESALLVKGYSGVLAPEILDK-QQYEIDVLLDG----NNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYR 385 (590) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~----~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~ 385 (590) .|. ..+....++..+.+. ..+. .+.... ....+...++-.+.+. ....+.+...-...+..+..+-. T Consensus 216 ~g~------aaet~~~a~~a~~~~s~nWy-~f~~a~~~~~~~~~a~~~a~a~~~e~-~~~~~~~~~~~~~~~~~~~~a~~ 287 (515) T protein:vir:10 216 AAS------PVVSPVDTLIASVAGNNNFG-SILFTKNGGTGITLSDAEAIALQNQS-YNVAYKFQVGVDDTTYSSWQAAL 287 (515) T ss_pred ccc------ccccHHHHHHHHHhccCCeE-EEEEeecCccccchhHHHHHHHHHhh-cCceEEEEeccCccceechhhhh Confidence 000 001111111122221 1111 111111 1111222222233322 22233222211111111000000 Q ss_pred HhhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCc-eECcCCcccceeeccccceeecChhHHhh Q lcl|NC_016163. 386 TGNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGI-QWTFVGPRRGVISGFTDINFYPNEPWKEK 464 (590) Q Consensus 386 ~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~-~~sPan~~~~~i~g~~~~~~~~~~~e~~~ 464 (590) ..... ......+++-. +. -+....+|.+|.+|-++-. ...-.. ++..|+.. -.+++.|.+. T Consensus 288 ~~~~~--~~~~~~~~~~~-------~~----~~~a~~~g~~asvnf~~~ng~iT~kf---Kq~~Gita--~~lt~t~a~a 349 (515) T protein:vir:10 288 AAIGG--VNMIYSPVALA-------AE----YHDMQDGIIEAATDFTQQGGATGYMY---VQFNNQTP--AVNDDTLSGI 349 (515) T ss_pred hhhhh--cCceEEEEecc-------Cc----chHHHHHHHHHhcCCCccchhheecc---ccCCCCcc--ccCCHHHHHH Confidence 01100 00011111100 00 1234566677776633210 011112 22334332 2478899999 Q ss_pred hhhcCceEEEEecCC--eEEEec-ceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcC-----CCCHHHHHHHHHHHH Q lcl|NC_016163. 465 LYLAQVNYIERDPKK--ISFATQ-LTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQE-----FQDNTTYDSMSYSLN 536 (590) Q Consensus 465 Ln~~gIn~i~~~~~~--G~~~wG-~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfe-----pn~~~l~~~v~~~i~ 536 (590) |-.+|+|+...+.+. .+.+|. ..+++++..|++|.+.|-.+||+..++..+.. +|. |-++.=...|+..|. T Consensus 350 l~~~~~N~Y~~~~~~~~~~~~~~~G~~~gG~~~~~WiD~~~g~~WL~~~iq~~l~~-L~~s~~KIPytd~G~a~i~a~v~ 428 (515) T protein:vir:10 350 LDDLNINYYGQTQVNGTNLSFYQDGVMMGGPTDPRDSNVYANEQWLKSYAGASFMS-LQLAQGKIPANIEGRGLLLGKMT 428 (515) T ss_pred HHhcCCeEEEEEeccCceEEEEeCCeeeCCccchhHHHHHhhHHHHHHHHHHHHHH-HHhcCCCCccChhhHHHHHHHHH Confidence 999999999988664 478885 45556666889999999999999999998876 455 345555566776664 Q ss_pred -HHHHHHHhCCceEEEec--------------CCCCCHHHhhCCE-------------------EEEEEEEEecCccceE Q lcl|NC_016163. 537 -NYLQQWVANRACSSISG--------------TVYASDYDKQQSI-------------------ARVKVELVFTGVIERI 582 (590) Q Consensus 537 -~~L~~l~~~ga~~~~~d--------------~~~nt~~~i~~G~-------------------l~~~i~~ap~~paefi 582 (590) +-|++-+++|.|...-+ .+...++-...|. +.+..-+.=-..+++| T Consensus 429 q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~~~~~~~~~Gyy~~~~~~~~~~~~~r~~~~~~~~~~y~~g~~i~~i 508 (515) T protein:vir:10 429 KDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDTAWQKVQNLGYWYDVQISSFVDTGGTTKYQAVYSLVYSKDDLIRKV 508 (515) T ss_pred HHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcccccchhhcceeEecCcCCCCCcccccccCceeEEEEEcCceEEEE Confidence 67888888998864210 1111222233333 2222222222233333 Q ss_pred EEEEEee Q lcl|NC_016163. 583 AIDLVVN 589 (590) Q Consensus 583 ~~~~~~~ 589 (590) +.....- T Consensus 509 ~~~~~~v 515 (515) T protein:vir:10 509 VGTHTLI 515 (515) T ss_pred EeeeecC Confidence 3332222 No 65 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=49.56 E-value=0.64 Score=21.66 Aligned_cols=395 Identities=10% Similarity=0.036 Sum_probs=140.5 Q ss_pred ccccceEEEeeccccCCcceee----EeecccccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeee Q lcl|NC_016163. 125 ASKNAMKTILSGGTAGETPLCF----IVPKGRGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIV 200 (590) Q Consensus 125 a~~~~~~~~~~~~t~~~~~~~~----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~l 200 (590) =+++++++..+-.+.+-+...+ +........++.. +. .++. +.+.. .+.+.-+.+.. .+. T Consensus 1 m~~~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~-f~-~~~~-Yss~~---------~V~~Dfg~~s~--~Y~-- 64 (426) T protein:vir:31 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAE-FG-EVNQ-YSTST---------SVGDDYGEDSD--VYT-- 64 (426) T ss_pred CCcceEEEEeecccccccccccceeeeeeeccccccccc-cc-hhhh-hhhHH---------HHHhcCCCChH--HHH-- Confidence 1246665555544433221110 1111000000000 00 0000 00000 00110000000 000 Q ss_pred ecccccccccccccee-eeeeccccceeeecCccccceeeeeecccccccCccccceeccccccccccccccccccccee Q lcl|NC_016163. 201 SFDPEAKDKSRQSIYY-ANIINKYSQYVEIVDNRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSS 279 (590) Q Consensus 201 s~~~da~~~~~~~~~~-~~vv~~~s~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 279 (590) +... ...+..+. ...+...+..+....... .+.- ... +...... ........... T Consensus 65 --AA~~--~f~Q~~~~~r~~v~~at~~~~~~~t~~-~tv~-------------g~~-~s~~a~~-----~~~a~~i~~~~ 120 (426) T protein:vir:31 65 --ASEA--IEEMGAEQWRVMVLEATEVTEEELSDG-DTID-------------KVP-ILGNHEV-----ESPDGDIEFTT 120 (426) T ss_pred --HHHH--HHhCCceeEEeeccccceeeeccCCcc-eeec-------------cee-eeecccC-----cchHHHHHHhh Confidence 0000 00000000 000000000000000000 0000 000 0000000 00000000000 Q ss_pred ecccccccccccccccccceeeeeccccccccccccc--eeccchhhHHHHHHHhhhccCCceeee--ccc-chhHHHHH Q lcl|NC_016163. 280 SVETDDPSYDATAANFNNIQYLTEGSEGTWTGGNEES--ALLVKGYSGVLAPEILDKQQYEIDVLL--DGN-NEVAVKNA 354 (590) Q Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~-~~~~~~~a 354 (590) ... .... ..... .......++.+...... .....+..+.. .+.. ......+. .+. ........ T Consensus 121 ~~~-----~~~~--~~~~~-~~~~t~~g~~t~~~~~~~~~~s~~dw~~~~--~~~s--~~~~~~ia~~~~~~~~~~~~~~ 188 (426) T protein:vir:31 121 DDD-----PDVE--DFDAE-IVINSATGDVATSEDSIELTYFHADWSQLD--EFPS--DVNNFAVADRRFDLKGVGVLDE 188 (426) T ss_pred ccc-----cccc--cceee-eEeccccceeeccccceeeeeccCcchhhh--cccc--cchhhhhhccccchhhhhhhHh Confidence 000 0000 00000 00000011111100000 00011111000 0000 00000000 000 11111122 Q ss_pred HHHHHHHhcCCeEEEEe-cC-CC-CCHHHHHHHHHhhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhc Q lcl|NC_016163. 355 MSDLCSEQRGDCIAILD-CS-FQ-GDAQQTIDYRTGNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDD 431 (590) Q Consensus 355 ~~~~~~~~~~~~~a~~d-~p-~~-~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~ 431 (590) .....+..+ -+.+.. .. .. ...+....++..- .-|.|-..+....... .--..++++|.|+..+ T Consensus 189 ~~~wa~~~~--i~~va~~~e~~~~~~~~~~~a~~~~~--------~~y~p~~~~~~~~~~~--~~~~~~~~~~~~aa~~- 255 (426) T protein:vir:31 189 THSWASDED--MGMIANGVNVDDYDSVDEAMDVAHEV--------AGYVPSGDLMMIVDAS--DDDLAAYQLGKFAVSE- 255 (426) T ss_pred hhhhhhhcc--eeeeeeccchhhhcchhhhhhhhhcc--------cccccchhheeehhcc--ccchhhHHhhhhhhhc- Confidence 222222222 122211 11 10 1112233333221 1122221111100000 0013578888888776 Q ss_pred cCCceECcCCcccceee----cc--ccceeecChhHHhhhhhcCceEEEEecCCeEEEecceec-CCCcccceehhhhHH Q lcl|NC_016163. 432 QNGIQWTFVGPRRGVIS----GF--TDINFYPNEPWKEKLYLAQVNYIERDPKKISFATQLTSQ-TSRSALSYINNVRVL 504 (590) Q Consensus 432 ~~G~~~sPan~~~~~i~----g~--~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~-s~d~~~~~i~vrR~~ 504 (590) +|+.|.-....... .. .++...+..+++..|+ +..|.+..+.+ +..+|-.-+. .....-.||-++|.. T Consensus 256 ---~~~~~~~~~~~~~~~~~~~~~~~gv~~t~~~~~~A~~~-~~~n~~~~~~~-~~~i~~~~~~~G~~~~G~~iD~~~g~ 330 (426) T protein:vir:31 256 ---PWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGE-GPVNVLIDVSD-ANRVSNAVTTAGADSDTSFFDIRRTK 330 (426) T ss_pred ---cccchhhhhccccccceeeccccccccccchhhhhhhc-CCceEEEEecC-ceeeecceeecccccchhhhhhHHHH Confidence 46665421111010 11 1222233334555565 77799998876 4556644444 345667799999999 Q ss_pred HHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhCCc-eE-EE----ecCCCCCHHHhhCCEEE-EEEEE Q lcl|NC_016163. 505 LRIRREVEKMMADYRQE----FQDNTTYDSMSYSLNNYLQQWVANRA-CS-SI----SGTVYASDYDKQQSIAR-VKVEL 573 (590) Q Consensus 505 ~~i~~si~~~~~~~vfe----pn~~~l~~~v~~~i~~~L~~l~~~ga-~~-~~----~d~~~nt~~~i~~G~l~-~~i~~ 573 (590) +||+..++..++..+=. |-|..=...|+..|+.-|++..+.|+ +. ++ -..+. ++.|..+-++. +++.. T Consensus 331 dwl~~~iq~~l~~ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~~~~~y~v~~P~~~~-~~~dra~R~~~~i~~~~ 409 (426) T protein:vir:31 331 VYTAEMLELDLESLQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDD-DDVDRVNRNWGGIDLDA 409 (426) T ss_pred HHHHHHHHHHHHHHhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCccccceeecCCCccc-cchhhhhhccCCceEEE Confidence 99999999999876532 67788888999999999998888654 22 22 11222 33466665554 78888 Q ss_pred EecCccceEEEEEEeeC Q lcl|NC_016163. 574 VFTGVIERIAIDLVVNK 590 (590) Q Consensus 574 ap~~paefi~~~~~~~~ 590 (590) .....+.++.|+..++= T Consensus 410 ~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 410 RLAQRAHTFSLGLNVSV 426 (426) T ss_pred EEeCcEEEEEEEEEEeC Confidence 89999999999988888 No 66 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=48.16 E-value=0.68 Score=21.51 Aligned_cols=440 Identities=10% Similarity=0.008 Sum_probs=168.7 Q ss_pred Ccc-ccCCceEEEEecCCCceeccccc-ceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHc-- Q lcl|NC_016163. 1 MAD-YLHPSVSSRIVDNSAVYATAAGN-TVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQ-- 76 (590) Q Consensus 1 Mp~-yl~PGVYveEi~s~~~~i~gv~T-sv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~-- 76 (590) ||. =|.=--||.-.++ .-+-.+..- +.+.+.+....=|+++.+...|-+|-...||.- ++-..+++.||. T Consensus 1 m~~~~ip~s~iV~V~~~-v~~~~~~~~~~~~lll~~~~~~~~~r~~~y~s~~~V~~~FG~~-----S~ey~aA~~yFs~~ 74 (501) T protein:vir:78 1 MPTTTIPIDQIVQMLPG-VIGAGGAPGRLTGLVLTQDTSIQPGQLADFFQKTDVENWFGGL-----SNEAVIADAYFPGI 74 (501) T ss_pred CCcCccccceEEEEeee-cccCCCcceeeeeEEEecCCCCCccceeeecCHHHHHHhcCCC-----hHHHHHHHHHhhcC Confidence 994 1333455664433 222222222 445555555556778888888999999999952 344457777886 Q ss_pred -CCC----cEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecc Q lcl|NC_016163. 77 -SGG----TAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKG 151 (590) Q Consensus 77 -nGG----~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~ 151 (590) |-. ++||.|-..+... + .+.. +......+ T Consensus 75 ~~q~~~P~~l~igR~~~~a~~-~-------------------~l~g-------------------~~l~~~~l------- 108 (501) T protein:vir:78 75 VNGGQLPYDLKFARYVAADAP-A-------------------SVYG-------------------IPLTGVTL------- 108 (501) T ss_pred CCCCcccceEEEEeecccCcc-e-------------------eEec-------------------cceeccch------- Confidence 433 4788886422100 0 0000 00000000 Q ss_pred cccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecC Q lcl|NC_016163. 152 RGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVD 231 (590) Q Consensus 152 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~ 231 (590) ......... +++.+ +. ..+. .-.++|..... .........-+...+..+.. + T Consensus 109 --a~~~~~~G~-----------------l~iti-~g--~~~~--~~i~~S~~ts~---~~vA~~i~~al~a~~~tv~~-d 160 (501) T protein:vir:78 109 --TQLQGYSGT-----------------LTVTT-AA--QHVS--SNISLAAATSF---ANAATLIEAAFTSPDFVVSY-D 160 (501) T ss_pred --hhhceeeeE-----------------EEEEe-cc--ceee--eccccccccCH---HHHHHHHHhhhcCcceEEEE-c Confidence 000000000 11100 00 0000 00111111000 00000000001000000100 0 Q ss_pred ccccceeeeeecccccccCccccceecccccccccccccccccccceeecccc-cccccccccccccceeeeeccccccc Q lcl|NC_016163. 232 NRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETD-DPSYDATAANFNNIQYLTEGSEGTWT 310 (590) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (590) ..... +.+.....+. ........ .+++ ...... .... +... T Consensus 161 s~~~~-----------------f~its~t~G~--------~~~i~~~t-~~~~~a~~l~L-----------t~~~-~a~v 202 (501) T protein:vir:78 161 ALRNR-----------------FVVNTNATGT--------AAAISAVT-GTNNLADELGL-----------SAAA-GASL 202 (501) T ss_pred cccce-----------------EEEEeeecCC--------ceeEEEEe-cccchhhhhcc-----------cccC-ceee Confidence 00000 0000000000 00000000 0000 000000 0000 0000 Q ss_pred cccccceeccchhhHHHHHHHhhhc-cCCceeeecccchhHHHHHHHHHHHHhcCCeEEEE--ecCCCC----CHHHHHH Q lcl|NC_016163. 311 GGNEESALLVKGYSGVLAPEILDKQ-QYEIDVLLDGNNEVAVKNAMSDLCSEQRGDCIAIL--DCSFQG----DAQQTID 383 (590) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~a~~--d~p~~~----~~~~~~~ 383 (590) ...+ ...+....++..+.+.. .+-. +........+...++..+++... .+|.+. |..... ...+... T Consensus 203 ~~~g----~~aet~~~a~~a~~~~~~~Wy~-f~~a~~~~~~~~lalA~wiea~~-~~f~~~~~~~~~~~~~~~~~~~i~~ 276 (501) T protein:vir:78 203 QAAG----VAADTPASAMNRAVGLSRNWAT-FTTAWTAVIADRLALASWNSGQA-YKYMYVAPDLEPASIVTNNSASFGA 276 (501) T ss_pred Eecc----ccccCHHHHHHHHHhccCceEE-EEEecCCCHHHHHHHHHHHHhcC-ceEEEEEecCCcceeecccchhHHH Confidence 0000 00011111222222221 1111 11222223344445555555432 232222 221110 0011111 Q ss_pred HHHhhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccC--C--ceECcCCccccee-eccccceeecC Q lcl|NC_016163. 384 YRTGNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQN--G--IQWTFVGPRRGVI-SGFTDINFYPN 458 (590) Q Consensus 384 ~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~--G--~~~sPan~~~~~i-~g~~~~~~~~~ 458 (590) ..... ...+....|. |+ .+.+++.|..|.+|-++ | -|| .+ ++ .|+. .-.++ T Consensus 277 ~l~a~---~y~~t~~~y~-----~~--------~~~aa~~g~~as~nf~~~~g~~T~~---fk---q~~~Gv~--a~~l~ 332 (501) T protein:vir:78 277 QVFAA---PYQGTLPLYG-----DQ--------ATAGAVMGYAASINFQLRNGRTVLA---FR---QFNAGVP--ATAHD 332 (501) T ss_pred HHhhc---CCCceEEEcC-----Cc--------chHHHHHHHHHhcCcccCcceeeee---cc---ccCCCcC--cccCC Confidence 11111 1122222221 10 14567778888777543 2 121 11 11 1111 12468 Q ss_pred hhHHhhhhhcCceEEEEecCCe--EEEecceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHH Q lcl|NC_016163. 459 EPWKEKLYLAQVNYIERDPKKI--SFATQLTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQE----FQDNTTYDSMS 532 (590) Q Consensus 459 ~~e~~~Ln~~gIn~i~~~~~~G--~~~wG~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfe----pn~~~l~~~v~ 532 (590) +.|.+.|..+|+|++..|.+.+ +.+|-.-+++++ |.+|.+-+-.+|++..++..+....-. |-|..=...|+ T Consensus 333 ~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~--~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~ 410 (501) T protein:vir:78 333 LGTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK--FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALY 410 (501) T ss_pred HHHHHHHHhcCCeEEEEEecccceeeEEEcCeeecc--ceeehhhhhHHHHHHHHHHHHHHHHHhCCCcccCHHHHHHHH Confidence 8999999999999999887655 778844345665 455666666666666666666543222 66788888899 Q ss_pred HHHHHHHHHHHhCCceEEEec---------------------------------CCCCCHHHhhCCEEEEEEEEEecCcc Q lcl|NC_016163. 533 YSLNNYLQQWVANRACSSISG---------------------------------TVYASDYDKQQSIARVKVELVFTGVI 579 (590) Q Consensus 533 ~~i~~~L~~l~~~ga~~~~~d---------------------------------~~~nt~~~i~~G~l~~~i~~ap~~pa 579 (590) ..|+.-|++-+++|.|...-+ .++.++..-.++...+.+.++--..+ T Consensus 411 a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaI 490 (501) T protein:vir:78 411 RAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAGAGQLVQMRGWYFLIGDPANPGQARQNRTTPTCTLWYSDGGSI 490 (501) T ss_pred HHHHHHHHHHHhCceeecCCCCCCccceeeccccCccccccceeccceEEeeccccCChhhhhhcccCcEEEEEEeCCce Confidence 999999999999998854211 00111111122222333333333333 Q ss_pred ceEEEEEEeeC Q lcl|NC_016163. 580 ERIAIDLVVNK 590 (590) Q Consensus 580 efi~~~~~~~~ 590 (590) .+|.+-..--= T Consensus 491 h~v~i~s~~v~ 501 (501) T protein:vir:78 491 QELTIGSNAVI 501 (501) T ss_pred eEEEeeeeecC Confidence 33332110000 No 67 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=46.30 E-value=0.74 Score=21.30 Aligned_cols=448 Identities=8% Similarity=-0.049 Sum_probs=176.5 Q ss_pred CccccCCceEEEEecCCCceeccccc--ceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHcC- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAGN--TVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQS- 77 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~T--sv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~n- 77 (590) |-.. --||.-.++ .-+-.+..- ..+.+.+....=|+++.+...|-+|-...||.- ++-..+++.||.. T Consensus 1 mip~---s~iVnV~~~-v~~~a~~~~~~~~~lilt~~~~~~~~r~~~y~s~~~V~~~FG~~-----S~ey~aA~~yFsq~ 71 (507) T protein:vir:99 1 MISQ---SRYVRIVSG-VGAGAPVAQRRLIMRVMTTNAVLPPGVVFESSSADAVGAYFGMA-----SEEYKRAKAYMSFI 71 (507) T ss_pred CCCc---cceeEEeee-ccccCcccccccceeeeccccCCCccceEeecCHHHHHHhcCCC-----hHHHHHHHHHhccC Confidence 5432 345553332 222222222 345555444444678888899999999999952 3444567778852 Q ss_pred --C----CcEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeecc Q lcl|NC_016163. 78 --G----GTAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKG 151 (590) Q Consensus 78 --G----G~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~ 151 (590) . +++||.|-..+... + . +........... +. T Consensus 72 p~~~~~P~~L~igR~~~~~~~-a-~------------------l~g~~~~~~l~~--------------------~~--- 108 (507) T protein:vir:99 72 SKSINSPSYISFARWVNAAIA-S-M------------------IVGDSLVKNLPA--------------------LK--- 108 (507) T ss_pred CCCCcccceEEEEeecCcccc-c-e------------------eecchhhhhHHH--------------------Hh--- Confidence 1 25888887432110 0 0 000000000000 00 Q ss_pred cccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecC Q lcl|NC_016163. 152 RGENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVD 231 (590) Q Consensus 152 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~ 231 (590) ...+ + . +.+..+.. .... .-.++|....- .........-+...+ T Consensus 109 ~~~~--G-~--lti~v~G~--------~~t~-------------~~i~lS~~ts~---~~vAs~i~~~l~a~~------- 152 (507) T protein:vir:99 109 AVAT--P-T--LSLSIGGT--------VVPI-------------AGIDLTAALTL---TDVAATLQTKIRASA------- 152 (507) T ss_pred hhcc--e-e--EEEEEcCc--------eeEe-------------ccccccccCCH---HHHHHHHHHhhhccc------- Confidence 0000 0 0 00000000 0000 00001100000 000000000000000 Q ss_pred ccccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccc---cccccceeeeeccccc Q lcl|NC_016163. 232 NRSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATA---ANFNNIQYLTEGSEGT 308 (590) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 308 (590) . ................+... ....+........+. +............... T Consensus 153 ~----------------~~~~~~tv~~d~~~~~F~v~---------s~~tG~~s~i~~at~~~~gt~~s~l~~~~~~~a~ 207 (507) T protein:vir:99 153 N----------------AELATATVTFNTTTNQFVLN---------GTTTGALAPTITAVRTDPATDISSLLGWTNTGTV 207 (507) T ss_pred c----------------ccccceEEEEecCCceEEEE---------eeeccccceeEEEEcCCchhhHHHHhccccccce Confidence 0 00000000000000000000 000000000000000 0000000000000001 Q ss_pred cccccccceeccchhhHHHHHHHhhh-ccCCceee-ecccchhHHHHHHHHHHHHhcCCeEEEEecCCCCCHHHHHHHHH Q lcl|NC_016163. 309 WTGGNEESALLVKGYSGVLAPEILDK-QQYEIDVL-LDGNNEVAVKNAMSDLCSEQRGDCIAILDCSFQGDAQQTIDYRT 386 (590) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~a~~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~ 386 (590) ...+. ..+....++..+... ..+..-.. ..+......+.++..+++.. ..+|.++-..... ....... T Consensus 208 ~~~g~------~aet~~~a~~a~~~~~~nW~~~~~a~~~~~td~~~lalA~wiea~-~~~f~~~~~~~~a---~~~~~~~ 277 (507) T protein:vir:99 208 FVKGQ------AAETPDTSISKSAAISTNFGSFIYTSTPALTNDQITAVASWNASQ-NNMYMYSVPTTIA---NIGTLYA 277 (507) T ss_pred Eeecc------cccCHHHHHHHHHhhcCCeEEEEEEeccccChHHHHHHHHHHhhc-CcEEEEEEecCch---hhhhhhh Confidence 10000 111111122222221 11111111 11212223334444444432 2244333211111 1111100 Q ss_pred hhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccC--C--ceECcCCcccceeeccccceeecChhHH Q lcl|NC_016163. 387 GNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQN--G--IQWTFVGPRRGVISGFTDINFYPNEPWK 462 (590) Q Consensus 387 ~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~--G--~~~sPan~~~~~i~g~~~~~~~~~~~e~ 462 (590) .. .....+...++.+ ..-.-.+.+.+.|.+|.+|-.+ | -|| . +++.|+.. -.+++.|. T Consensus 278 ~~-~~~~~~~~~~~~~---------~~~~~~~~aa~~g~~as~nf~~~ng~~T~~---f---k~l~GV~a--~~lt~t~a 339 (507) T protein:vir:99 278 AV-KGFSGCALNITSD---------SLPVDYIEQSPCEILAATDYTRVNATQNYM---Y---YQFPSRNI--TVSDDTTA 339 (507) T ss_pred hh-hhcceeEEEeecc---------cccchhHHHHHHHHHHhhccCcCccceeec---c---cccCCccc--ccCCHHHH Confidence 00 0011111111111 0001124677777888776433 2 121 1 22344432 24789999 Q ss_pred hhhhhcCceEEEEecCCe--EEEec-ceecCCCcccceehhhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHH Q lcl|NC_016163. 463 EKLYLAQVNYIERDPKKI--SFATQ-LTSQTSRSALSYINNVRVLLRIRREVEKMMADYRQE----FQDNTTYDSMSYSL 535 (590) Q Consensus 463 ~~Ln~~gIn~i~~~~~~G--~~~wG-~rT~s~d~~~~~i~vrR~~~~i~~si~~~~~~~vfe----pn~~~l~~~v~~~i 535 (590) +.|-.+++|+...+.+.+ +.+|- ..++++.-+|.++.+-+=.+||+..++..+....-. |-|..=...|+..| T Consensus 340 ~al~~~n~N~y~~~a~~~~~~~~~~~G~~~gG~~~fid~d~~~~~~WL~~~iq~~l~~l~~~~~kIPyt~~G~~~l~a~i 419 (507) T protein:vir:99 340 NLVDANRGNYIGQTQSAGQSLAFYQRGILCGGPNDAVDMNIYANEIWLKSAISAQILSLFLNVPRVPANETGESMLLSVI 419 (507) T ss_pred HHHHhcCCeEEEEeccccceeeEEecCeeeCCcccceeeeeecchHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHH Confidence 999999999999987644 66774 466666557877777777777777777777653222 56778888899999 Q ss_pred HHHHHHHHhCCceEEEe---------------------------------cCCCCCHH-HhhCCEEEEEEEEEecCccce Q lcl|NC_016163. 536 NNYLQQWVANRACSSIS---------------------------------GTVYASDY-DKQQSIARVKVELVFTGVIER 581 (590) Q Consensus 536 ~~~L~~l~~~ga~~~~~---------------------------------d~~~nt~~-~i~~G~l~~~i~~ap~~paef 581 (590) +.-|++-+++|.|...- +.++.+++ ...++...+.+.+.--..+++ T Consensus 420 ~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r~~r~~~~~~~~y~~~gaI~~ 499 (507) T protein:vir:99 420 QSVVNTAKNNGTISAGKNLNVIQQQYITQISGDANAWRQVANIGYWLNITFSSYTNPNTQLTEWKASYQLIYSKDDAIRF 499 (507) T ss_pred HHHHHHHHhccccccCCcccccchheecccccccccccceeccceEEEeCChHhcChhhhhccccceEEEEEEeCCeEEE Confidence 99999999999874310 11222332 333555556666666666666 Q ss_pred EEEEEEee Q lcl|NC_016163. 582 IAIDLVVN 589 (590) Q Consensus 582 i~~~~~~~ 589 (590) |++....- T Consensus 500 v~~~~~~v 507 (507) T protein:vir:99 500 VEGTDTLI 507 (507) T ss_pred EEeeeecC Confidence 66655444 No 68 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=33.11 E-value=1.4 Score=19.82 Aligned_cols=442 Identities=10% Similarity=0.001 Sum_probs=162.8 Q ss_pred CccccCCceEEEEecCCCceecccc-cceeEEEeecCCCCCCccEEecCHHHHHHhcCCccccccccHHHHHHHHHc--- Q lcl|NC_016163. 1 MADYLHPSVSSRIVDNSAVYATAAG-NTVLYAAIHSAIGRDNAVEFVTTTDEFLFKFGNPNLSKYGQTSYNILNWLQ--- 76 (590) Q Consensus 1 Mp~yl~PGVYveEi~s~~~~i~gv~-Tsv~~~vg~~~~Gp~~~p~~v~s~~e~~~~fG~~~~~~~~~l~~av~~ff~--- 76 (590) ||. |.=--||.-.++. -+-.+-. .+.+.+......=|.++.+...|.+|-...||.- ++-..+++.||. T Consensus 1 m~~-ip~s~iV~V~~~v-~~~~~~~~~f~~~l~~~~~~~~~~r~~~y~s~~~V~~~FG~~-----S~ey~aA~~yFs~~~ 73 (494) T protein:vir:94 1 MPN-IPISQIVSINPQV-VSAGGTQGTLDGLLLTQATGFPVTQPQVYFSAADVGTAFGLT-----SDEYNAALVYFAGIL 73 (494) T ss_pred CCC-CCcccEEEeeeec-cccCCcccccceeEeecCccCCccceeeecCHHHHHHhcCCC-----hHHHHHHHHHhhhcc Confidence 883 3223455533332 2221111 1344444444455667777888999999999952 344456777886 Q ss_pred CC----CcEEEEEEecCCccccccccccceeecccccccceeeeeeccccccccccceEEEeeccccCCcceeeEeeccc Q lcl|NC_016163. 77 SG----GTAYVLRVMPDDAKFANSLISIKTTAAADPAKATVLVTAKAQTTNTASKNAMKTILSGGTAGETPLCFIVPKGR 152 (590) Q Consensus 77 nG----G~~~vvRv~~~~a~~a~~~~~~~~~~a~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~ 152 (590) |- +++||.|-..+.. .+... +.+... .. ..+. T Consensus 74 ~q~p~P~~l~igR~~~~a~-~~~l~--------g~~~~~----tl----------~~~~--------------------- 109 (494) T protein:vir:94 74 GGGQQPASLTIGRYASAAT-SAAVF--------GAPLTL----SL----------AQLQ--------------------- 109 (494) T ss_pred CCCccccEEEEEeecCccc-cceee--------ccchhh----hH----------Hhhh--------------------- Confidence 43 3489999743210 00000 000000 00 0000 Q ss_pred ccccccceEEEEeeccccccccccccceeeeeecccCCCceeeeeeeeeccccccccccccceeeeeeccccceeeecCc Q lcl|NC_016163. 153 GENYNGYGFRLSLRSDYDNTYNFRTYNLSVTVKDSTGADVVVEGPYIVSFDPEAKDKSRQSIYYANIINKYSQYVEIVDN 232 (590) Q Consensus 153 g~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~v~d~~~~~~v~e~~~~ls~~~da~~~~~~~~~~~~vv~~~s~~v~~~~~ 232 (590) ..+ ..+....+...+ . .-.++|..... .+........+...+..+..... T Consensus 110 --~~~---g~l~iti~g~~~---------~-------------~~i~lS~~ts~---~~vA~~i~~ai~~a~~~v~~d~~ 159 (494) T protein:vir:94 110 --TLS---GTLIVTTDTQRT---------S-------------AAINLSGATSF---ANAASLMTSGFTTPNFAITYDAQ 159 (494) T ss_pred --hcc---eEEEEEEcceEE---------E-------------eeecccccCCh---hhHHHHHhhhhccccceEEEccc Confidence 000 000000000000 0 00000000000 00000000000000000000000 Q ss_pred cccceeeeeecccccccCccccceecccccccccccccccccccceeecccccccccccccccccceeeeeccccccccc Q lcl|NC_016163. 233 RSAFETISEFVVGDSEADPQKVDIIFGQERAVTPAETIHANVVWKSSSVETDDPSYDATAANFNNIQYLTEGSEGTWTGG 312 (590) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (590) ...+...+. ... ....+... .+........+...... . ...+ T Consensus 160 ~~~f~v~s~------ttG-~~s~is~~---------------------t~~~a~~l~lt~~~~a~--v--------~~~g 201 (494) T protein:vir:94 160 RRRFVLSTT------ATG-TTASVSAV---------------------TGTLADGVGLSTASGAY--V--------EGSG 201 (494) T ss_pred CcEEEEEEc------cCC-ceeEEEEe---------------------ccchhhhhhhhccccce--E--------eecC Confidence 000000000 000 00000000 00000000000000000 0 0000 Q ss_pred cccceeccchhhHHHHHHHhhhc-cCCceeeecccchhHHHHHHHHHHHHhcCCeEEEE--ecCCC----CCHHHHHHHH Q lcl|NC_016163. 313 NEESALLVKGYSGVLAPEILDKQ-QYEIDVLLDGNNEVAVKNAMSDLCSEQRGDCIAIL--DCSFQ----GDAQQTIDYR 385 (590) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~a~~--d~p~~----~~~~~~~~~~ 385 (590) . ..+....++..+.+.. .+. .+.+......+.+.++..+.+... .++.+. +.... ....+..... T Consensus 202 ~------~aet~~~a~~a~~~~~~~Wy-~f~~~~~~~~~~ilalA~wiea~~-~~~~~~~~~~d~~~~~~~~~~~i~~~l 273 (494) T protein:vir:94 202 L------AADTAASALDRLAASSSTWA-IFTTAWAASLSDRTALAQWTSDQV-FRRIYAAWDQDAAGLSVNNVSSFGNIV 273 (494) T ss_pred c------ccccHHHHHHHHHhccCceE-EEEEecCCCHHHHHHHHHHHhhcC-ccEEEEEecCCcceeecccchhHHHHH Confidence 0 0011111222222211 111 122222222334444555544322 222222 21111 1111221111 Q ss_pred HhhcCcccceEEEEcCeEEEeecccCceeeecHHHHHHHHHHHhhccCCceECcCCcccceeec-ccccee-ecChhHHh Q lcl|NC_016163. 386 TGNISMSTYFTAIFGQHMNVYDEYNGETITVTSTYFLASMIPSNDDQNGIQWTFVGPRRGVISG-FTDINF-YPNEPWKE 463 (590) Q Consensus 386 ~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ppsg~~AG~~A~~D~~~G~~~sPan~~~~~i~g-~~~~~~-~~~~~e~~ 463 (590) ... ...+....|.. -.|.+++.|..|.+|-++ .+.+.... .+. .-++.. .++..|.+ T Consensus 274 ~~~---~y~~t~~~y~~-------------~~~~aa~~g~~aa~~~~~----~~g~~T~~-~k~q~~gi~~~~l~~t~a~ 332 (494) T protein:vir:94 274 KTT---PFSNTIPVYGL-------------LANAMIVLAWGASTNLQI----AEGRTTLA-LRSPVSSAGVRVDNLANAN 332 (494) T ss_pred Hhh---cCCceEEEcCC-------------CChHHHHHHHHHhccccc----cCcceeEE-eeccCCCCCCccCCHHHHH Confidence 111 22233322221 114567777777776433 22232211 111 112222 36788999 Q ss_pred hhhhcCceEEEEecCC--eEEEecceecCCCcccc--eehhhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHH Q lcl|NC_016163. 464 KLYLAQVNYIERDPKK--ISFATQLTSQTSRSALS--YINNVRVLLRIRREVEKMMADYRQEFQDNTTYDSMSYSLNNYL 539 (590) Q Consensus 464 ~Ln~~gIn~i~~~~~~--G~~~wG~rT~s~d~~~~--~i~vrR~~~~i~~si~~~~~~~vfepn~~~l~~~v~~~i~~~L 539 (590) .|..+|+|++..+.+. =+.+|..-+++++-.|- +++.--|-+.|+++|...+...-==|-|..=...|+..|+.-| T Consensus 333 al~~~~~N~y~~~~~~~~~~~~~~gg~~sG~~~~id~~~~~~WL~~~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l 412 (494) T protein:vir:94 333 ALLSNGYTYLGKYASATNTYTVTYNGAIGGQFLWADTALGWIALRRNLQQALFETLLAYRSLPYNADGYNALYQGAQDVV 412 (494) T ss_pred HHHhcCCeEEEEecccCceEEEecCceeccccceeeeeccHHHHHHHHHHHHHHHHHhCCCcccChhhHHHHHHHHHHHH Confidence 9999999999988643 35777656777765552 3333344445555554444332112678888889999999999 Q ss_pred HHHHhCCceEEE--ecC----------------------------CCCCHHHhhC-CEEEEEEEEEecCccceEEEEEEe Q lcl|NC_016163. 540 QQWVANRACSSI--SGT----------------------------VYASDYDKQQ-SIARVKVELVFTGVIERIAIDLVV 588 (590) Q Consensus 540 ~~l~~~ga~~~~--~d~----------------------------~~nt~~~i~~-G~l~~~i~~ap~~paefi~~~~~~ 588 (590) ++-.++|.|... -+. +..++++..+ .--.+.+...--..+++|.+.... T Consensus 413 ~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~~~~~~~~kGyy~~~~~~~s~~~ra~R~~~~~~~~y~~~GAIh~v~i~~~~ 492 (494) T protein:vir:94 413 SQFVAAGVIRAGVALSASQRAQIDQAAGVPISGDVVDKGWYLQVIDPITTTVRTDRGSPTVNFWYCDGGSIQRVVVSATT 492 (494) T ss_pred HHHHhCceeecccccCcchhhhhhhhhcCccccceeccceeeeccCCCChhhhhccccCCceEEEEecCcEEEEEEeeEE Confidence 999999988531 110 1122222111 111122222224444555444333 Q ss_pred eC Q lcl|NC_016163. 589 NK 590 (590) Q Consensus 589 ~~ 590 (590) -= T Consensus 493 v~ 494 (494) T protein:vir:94 493 VI 494 (494) T ss_pred eC Confidence 33 Done!