Query lcl|NC_012740.1_cdsid_YP_002922230.1 [gene=18] [protein=tail sheath protein] [protein_id=YP_002922230.1] [location=91506..93509] Match_columns 667 No_of_seqs 203 out of 825 Neff 9.4 Searched_HMMs 1612 Date Thu Nov 7 15:42:05 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_158 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_158_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80984 Length: 666 100.0 3E-170 2E-173 949.8 55.9 666 1-667 1-666 (666) 2 protein:vir:6594 Length: 666 # 100.0 9E-170 5E-173 947.4 58.2 666 1-667 1-666 (666) 3 protein:vir:6894 Length: 660 # 100.0 3E-163 2E-166 911.5 58.1 658 1-666 1-660 (660) 4 protein:vir:108052 Length: 660 100.0 2E-162 1E-165 906.8 58.4 656 1-665 1-660 (660) 5 protein:vir:101804 Length: 663 100.0 8E-162 5E-165 903.9 57.3 656 1-667 1-663 (663) 6 protein:vir:101187 Length: 663 100.0 2E-161 2E-164 901.1 57.9 657 1-667 1-663 (663) 7 protein:vir:100539 Length: 663 100.0 5E-161 3E-164 899.2 56.5 658 1-667 1-663 (663) 8 protein:vir:103456 Length: 659 100.0 9E-161 6E-164 897.9 57.3 657 1-665 1-659 (659) 9 protein:vir:106427 Length: 679 100.0 2E-160 1E-163 896.1 56.8 656 1-666 1-679 (679) 10 protein:vir:7206 Length: 659 # 100.0 8E-160 5E-163 892.7 58.4 657 1-665 1-659 (659) 11 protein:vir:98263 Length: 664 100.0 1E-157 7E-161 880.9 56.2 655 1-666 1-664 (664) 12 protein:vir:5663 Length: 671 # 100.0 4E-147 3E-150 823.1 52.1 642 1-662 1-671 (671) 13 protein:vir:104477 Length: 749 100.0 1E-143 6E-147 804.5 57.7 641 1-662 1-749 (749) 14 protein:vir:106984 Length: 743 100.0 3E-141 2E-144 791.2 52.6 635 1-663 1-743 (743) 15 protein:vir:104858 Length: 729 100.0 7E-140 4E-143 783.5 53.1 650 1-664 3-729 (729) 16 protein:vir:79092 Length: 477 100.0 1E-107 8E-111 606.5 40.3 467 1-664 1-477 (477) 17 protein:vir:107865 Length: 477 100.0 1E-106 9E-110 600.8 39.7 467 1-664 1-477 (477) 18 protein:vir:98824 Length: 774 100.0 5E-105 3E-108 592.5 40.4 467 1-659 281-774 (774) 19 protein:vir:103168 Length: 641 100.0 1.3E-97 8E-101 551.8 38.8 531 1-554 3-641 (641) 20 protein:vir:103993 Length: 390 100.0 4.7E-95 2.9E-98 537.7 35.8 380 1-666 2-390 (390) 21 protein:vir:78206 Length: 390 100.0 4.7E-95 2.9E-98 537.7 35.8 380 1-666 2-390 (390) 22 protein:vir:79181 Length: 390 100.0 1.2E-94 7.7E-98 535.4 36.4 380 1-666 2-390 (390) 23 protein:vir:6079 Length: 396 # 100.0 5.3E-94 3.3E-97 531.9 38.3 387 1-667 1-396 (396) 24 protein:vir:79141 Length: 391 100.0 2.4E-94 1.5E-97 533.8 36.3 381 1-667 2-391 (391) 25 protein:vir:5711 Length: 396 # 100.0 1.2E-93 7.3E-97 530.0 39.0 387 1-667 1-396 (396) 26 protein:vir:98553 Length: 395 100.0 7.4E-94 4.6E-97 531.2 37.6 383 1-666 1-395 (395) 27 protein:vir:2035 Length: 396 # 100.0 1.7E-93 1E-96 529.2 36.9 387 1-667 1-396 (396) 28 protein:vir:1172 Length: 391 # 100.0 2.4E-93 1.5E-96 528.3 35.2 380 1-666 3-391 (391) 29 protein:vir:1845 Length: 392 # 100.0 9.9E-93 6.2E-96 525.0 37.5 383 1-666 1-392 (392) 30 protein:vir:100323 Length: 393 100.0 1.6E-92 9.9E-96 523.8 38.1 379 1-667 4-393 (393) 31 protein:vir:96740 Length: 388 100.0 1E-90 6.3E-94 514.0 35.6 375 1-661 4-388 (388) 32 protein:vir:10336 Length: 386 100.0 2.6E-88 1.6E-91 500.7 35.7 376 1-661 2-386 (386) 33 protein:vir:5833 Length: 742 # 100.0 4E-77 2.5E-80 439.4 40.2 561 1-658 147-742 (742) 34 protein:vir:63742 Length: 562 100.0 3.4E-68 2.1E-71 390.4 36.3 538 1-657 9-562 (562) 35 protein:vir:79798 Length: 717 100.0 1.6E-66 9.8E-70 381.3 41.5 628 1-652 1-717 (717) 36 protein:vir:80488 Length: 562 100.0 2.5E-66 1.5E-69 380.3 40.3 538 1-657 9-562 (562) 37 protein:vir:95741 Length: 587 100.0 1.7E-65 1.1E-68 375.6 38.0 564 1-657 9-587 (587) 38 protein:vir:102819 Length: 648 100.0 1.7E-63 1E-66 364.8 38.6 598 1-656 1-648 (648) 39 protein:vir:99306 Length: 587 100.0 1.9E-63 1.2E-66 364.4 38.8 564 1-657 9-587 (587) 40 protein:vir:80779 Length: 569 100.0 3.6E-63 2.2E-66 362.9 39.2 541 1-657 9-569 (569) 41 protein:vir:96586 Length: 587 100.0 2.2E-61 1.4E-64 353.1 38.1 560 1-657 9-587 (587) 42 protein:vir:100829 Length: 607 100.0 6.2E-55 3.9E-58 317.8 35.4 552 1-663 18-607 (607) 43 protein:vir:102957 Length: 437 100.0 3.6E-53 2.2E-56 308.1 36.7 417 1-651 9-437 (437) 44 protein:vir:101326 Length: 529 100.0 4.4E-48 2.7E-51 280.2 32.4 487 1-652 1-529 (529) 45 protein:vir:105470 Length: 451 100.0 6.2E-43 3.9E-46 252.0 37.4 424 1-651 9-451 (451) 46 protein:vir:107310 Length: 581 100.0 1.3E-36 8.1E-40 217.3 32.8 536 40-664 1-581 (581) 47 protein:vir:7653 Length: 581 # 100.0 1.2E-36 7.6E-40 217.5 32.6 549 54-664 1-581 (581) 48 protein:vir:78986 Length: 436 100.0 1.7E-30 1.1E-33 183.8 35.2 406 1-651 11-436 (436) 49 protein:vir:102359 Length: 356 99.4 3.6E-14 2.2E-17 94.3 19.3 315 268-650 1-356 (356) 50 protein:vir:4517 Length: 498 # 99.2 2.1E-10 1.3E-13 73.7 27.7 441 1-655 10-498 (498) 51 protein:vir:489 Length: 498 # 99.2 1.5E-10 9.3E-14 74.4 26.8 440 1-667 10-496 (498) 52 protein:vir:4463 Length: 498 # 99.2 3.4E-10 2.1E-13 72.5 27.0 439 1-667 10-496 (498) 53 protein:vir:3788 Length: 376 # 99.1 1.2E-10 7.5E-14 75.0 20.9 351 248-656 1-376 (376) 54 protein:vir:276 Length: 369 # 99.0 2.3E-09 1.4E-12 67.9 25.3 345 275-655 1-369 (369) 55 protein:vir:3751 Length: 376 # 99.0 6E-10 3.7E-13 71.1 22.1 357 229-659 1-376 (376) 56 protein:vir:1996 Length: 495 # 99.0 4.7E-09 2.9E-12 66.3 26.4 438 1-652 11-495 (495) 57 protein:vir:78782 Length: 370 98.9 3.3E-09 2E-12 67.1 23.0 355 229-662 1-370 (370) 58 protein:vir:95263 Length: 450 98.5 5.8E-07 3.6E-10 54.8 30.4 411 201-658 1-450 (450) 59 protein:vir:80052 Length: 331 98.2 3.2E-06 2E-09 50.7 26.4 305 288-652 1-331 (331) 60 protein:vir:5260 Length: 502 # 98.0 6.9E-06 4.3E-09 48.9 34.1 464 1-652 1-502 (502) 61 protein:vir:3165 Length: 426 # 94.7 0.0036 2.2E-06 34.0 19.8 362 244-652 1-426 (426) 62 protein:vir:101576 Length: 501 86.2 0.047 2.9E-05 27.9 34.1 451 1-652 1-501 (501) 63 protein:vir:106730 Length: 501 73.5 0.17 0.00011 24.8 35.2 451 1-652 1-501 (501) 64 protein:vir:96104 Length: 504 70.7 0.21 0.00013 24.3 32.2 454 1-651 1-504 (504) 65 protein:vir:3636 Length: 501 # 65.8 0.28 0.00017 23.6 33.4 450 1-652 1-501 (501) 66 protein:vir:99586 Length: 507 61.9 0.34 0.00021 23.1 33.0 457 1-651 1-507 (507) 67 protein:vir:78611 Length: 501 48.8 0.66 0.00041 21.6 35.8 451 1-652 1-501 (501) No 1 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=3.2e-170 Score=949.82 Aligned_cols=666 Identities=96% Similarity=1.416 Sum_probs=575.3 Q ss_pred CceecCceEEEEecCCCcccccCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (667) |+|++|||||||++.+..+.+++||++||||+|+|||+|+|++|+||.||+++||++.+.++++|++++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~~~t~~~~~vg~~~~gp~~~p~~i~~~~~~~~~fg~~~~~~~~~~~~~~~f~~~g~~~~v 80 (666) T protein:vir:80 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (666) T ss_pred CceecCceEEEEecCCccccccCcccceEEeccccCCCccceEecCHHHHHHhcCCccCccchHHHHHHHHhcCCCeEEE Confidence 99999999999998666666678999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccccccc Q lcl|NC_012740. 81 VRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (667) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~~ 160 (667) |||.+.++++++......+..+...++....+++.+.+..........+....+....+....+.+.......+...... T Consensus 81 ~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~ta~~~~~a~~~~~~ 160 (666) T protein:vir:80 81 VRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (666) T ss_pred EEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecchhhhcccccccccc Confidence 99998888888877777777777788877778887777766665555566666666777666666666666666555555 Q ss_pred ccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEee Q lcl|NC_012740. 161 PELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEILA 240 (667) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~~ 240 (667) .........++....+...........+.+...........................+.+...+..+|.|++.+++.+.. T Consensus 161 ~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a~~~g~~g~~l~v~i~~ 240 (666) T protein:vir:80 161 PELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSAIYAGEIGNSLEVEILA 240 (666) T ss_pred ceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhhhcccccccceeeeecc Confidence 55555556666665555554444444455554444444444444444444445555566677788899999999888765 Q ss_pred cccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcccccc Q lcl|NC_012740. 241 RSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSSQ 320 (667) Q Consensus 241 ~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~s~ 320 (667) ..... ...+..+....+.............+..++++.+++..++.++|+|.++...+++...+...++.+++.++.+. T Consensus 241 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (666) T protein:vir:80 241 RSAFK-NTAPDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFGRGSSQ 319 (666) T ss_pred ccccc-cccccceeeeccccccccceeeeeccccccceeeEeccCCccceeeecccccccccccchhhhhhhhhccccce Confidence 54433 23334445555555555666777777778889999999999999999999988988888888888888888888 Q ss_pred eEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHHHH Q lcl|NC_012740. 321 YIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAV 400 (667) Q Consensus 321 ~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~~ 400 (667) ++..............+.+.+|.+.........+..+..++..++++++...+.+++++|++|+..+++.+..+++.+|+ T Consensus 320 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~~~~~ 399 (666) T protein:vir:80 320 YIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERESIHVNLLIAGACAGEGDAFSTVQKHAV 399 (666) T ss_pred eeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhcccccceEeecCcCCcccchHHHHHHHH Confidence 88887777777777788899998887777666777788888889999999999999999999999988888899999999 Q ss_pred HHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEechHHHH Q lcl|NC_012740. 401 SIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADI 480 (667) Q Consensus 401 ~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~sg~v 480 (667) +||+++++||+++|+|+..+++.++.++++++++||+..+.......+++|+|+++||||++++|+.+++.+++||||++ T Consensus 400 ~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~ 479 (666) T protein:vir:80 400 SIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADI 479 (666) T ss_pred HHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCceeEechHHHH Confidence 99999999999999999999999999999999999999998888889999999999999999999999999999999999 Q ss_pred HHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCcccceeeehh Q lcl|NC_012740. 481 AGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVR 560 (667) Q Consensus 481 Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vr 560 (667) ||+|||+|.++||||||+|+++.+|.|++++++.+++.|++.||++|||||++|+++|+++||+||+++++++||||||| T Consensus 480 AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~i~vR 559 (666) T protein:vir:80 480 AGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVR 559 (666) T ss_pred HHHHHHHhhcCCceEccCCeecceeeccccceeecChhHHHhhhhCCeeEEEEeCCCeEEEEccccCCCCCcccceeehh Confidence 99999999999999999999999999999999999999999999999999999999999999999999998999999999 Q ss_pred hhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCC Q lcl|NC_012740. 561 RLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPAKS 640 (667) Q Consensus 561 R~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p 640 (667) |||+||+++|++.++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||+++|++|+|+|+|+++|++| T Consensus 560 Rl~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~i~~~P~~P 639 (666) T protein:vir:80 560 RLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPAKS 639 (666) T ss_pred hHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 641 INYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 641 ~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) ||||+|||+|++++++|+|++++|||| T Consensus 640 ae~I~~~~~~~~~~~~~~e~~~~~~~~ 666 (666) T protein:vir:80 640 INYIMLNFTAVATGSDFDEIIGPVNQA 666 (666) T ss_pred cceEEEEEEEeecCccHHHHHHHHhcC Confidence 999999999999999999999999999 No 2 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=8.6e-170 Score=947.45 Aligned_cols=666 Identities=97% Similarity=1.423 Sum_probs=566.5 Q ss_pred CceecCceEEEEecCCCcccccCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (667) |+|+||||||||++.+..+.+++||++||||+|+|||+|+|++|+||.||+++||++.+.+|++|++++||+|||++||| T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg~~~~v 80 (666) T protein:vir:65 1 MTLLSPGFETKETTLSTTIVQSETGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (666) T ss_pred CceecCceEEEEecCcccccccCcccceEEecccCCCCccCEEecCHHHHHHHcCCccccchhHHHHHHHHHhcCceEEE Confidence 99999999999997555555567999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccccccc Q lcl|NC_012740. 81 VRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (667) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~~ 160 (667) ||+.+.++++++.........+...++....+|+.+.+.........+.....++..++....+.++............. T Consensus 81 vrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~~~~~g~~ 160 (666) T protein:vir:65 81 VRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (666) T ss_pred EEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeeccccccCcc Confidence 99999888888887777777777788888888999988887776666666666666666666666665555555555555 Q ss_pred ccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEee Q lcl|NC_012740. 161 PELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEILA 240 (667) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~~ 240 (667) ......+..++................+.+.............................+...+..++.||+.+++.+.. T Consensus 161 ~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~i~v~i~~ 240 (666) T protein:vir:65 161 PELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNSLEVEILA 240 (666) T ss_pred eeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeeccccccceeEEeec Confidence 44444444454444433333333333333333333333332222222223333344556677788999999999888766 Q ss_pred cccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcccccc Q lcl|NC_012740. 241 RSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSSQ 320 (667) Q Consensus 241 ~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~s~ 320 (667) ........ +..+.....................++.+.+++..+|.+.|+|.+++..+.+...+...++.+++.++.+. T Consensus 241 ~~~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (666) T protein:vir:65 241 RSAFKNTA-PDLTMYPYGGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSSQ 319 (666) T ss_pred cccccccc-ccccccccccccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhhhhhcccccc Confidence 55443322 22333334444444555666667777889999999999999999999889998888888888888899999 Q ss_pred eEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHHHH Q lcl|NC_012740. 321 YIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAV 400 (667) Q Consensus 321 ~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~~ 400 (667) ++++.....+......+.+.+|.+.........+..+..++..+++++++..+.+.+++|++|++.+.+.+..+++.+|+ T Consensus 320 ~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~~~l~ 399 (666) T protein:vir:65 320 YIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAV 399 (666) T ss_pred eeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCccchhHHHHHHHH Confidence 99988887777777888999999988877777788888888999999999988889999999999988888999999999 Q ss_pred HHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEechHHHH Q lcl|NC_012740. 401 SIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADI 480 (667) Q Consensus 401 ~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~sg~v 480 (667) +||+++++||+++|+|+..+++.+++++.+++++||+..+.+.....+++|+|+++||||++++|+.+++.+++|||+++ T Consensus 400 ~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~v 479 (666) T protein:vir:65 400 SIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADI 479 (666) T ss_pred HHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEecccCCceeEechHHHH Confidence 99999999999999999999999999999999999999999988899999999999999999999999999999999999 Q ss_pred HHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCcccceeeehh Q lcl|NC_012740. 481 AGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVR 560 (667) Q Consensus 481 Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vr 560 (667) ||+|||+|.++||||||+|+++.+|.|++++++.+++.|++.||++|||||++|+++|+++||+||+++++++|+||||| T Consensus 480 AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~vr 559 (666) T protein:vir:65 480 AGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVR 559 (666) T ss_pred HHHHHHHhccCCcEEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCCCeEEEEecccCCCCCcccceEehh Confidence 99999999999999999999999999999999999999999999999999999999999999999999988899999999 Q ss_pred hhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCC Q lcl|NC_012740. 561 RLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPAKS 640 (667) Q Consensus 561 R~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p 640 (667) |||+||+++|++.++|+||||||+.+|++|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+|+|+++|++| T Consensus 560 R~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~p 639 (666) T protein:vir:65 560 RLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPAKS 639 (666) T ss_pred hHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 641 INYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 641 ~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) ||||+|||+|++++++|+|++++|||| T Consensus 640 ae~i~~~~~~~~~~~~~~e~~~~~~~~ 666 (666) T protein:vir:65 640 INYIMLNFTAVATGSDFDEIIGPANQA 666 (666) T ss_pred cceEEEEEEEeecCccHHHHHHHHhcC Confidence 999999999999999999999999999 No 3 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=3.2e-163 Score=911.46 Aligned_cols=658 Identities=62% Similarity=1.079 Sum_probs=543.6 Q ss_pred CceecCceEEEEecCCCcccccCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (667) |+|+||||||||+++...+.+++|||+||||+|+|||+|+|++|+||.||+|.||++++.++++|++++||+|||++||| T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~~~g~~~~v 80 (660) T protein:vir:68 1 MALLSPGVELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQITDEVALVDMFGTPNTDTADYFMSAMNFLQYGNDLRV 80 (660) T ss_pred CccccCceEEEEecCCcccccCCCcceeEEecccCCCCccCEEecCHHHHHHhcCCccCccchhHHHHHHHHhCCCeEEE Confidence 99999999999997444444457999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccccccc Q lcl|NC_012740. 81 VRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (667) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~~ 160 (667) ||+.+.+.++++.........+...++....+++.+.++.........+..+.++.+++....+.+++.....+..+..+ T Consensus 81 vRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a~~~~~~ 160 (660) T protein:vir:68 81 VRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKAKEIGEY 160 (660) T ss_pred EEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeeccccccccceeeccc Confidence 99998887777776666777777778877788888887777665555566677777777777666666666555555555 Q ss_pred ccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEee Q lcl|NC_012740. 161 PELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEILA 240 (667) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~~ 240 (667) +.........+................+.+...........+...............+.....+...|.||+.+++.+.. T Consensus 161 ~~~~~~~~~~v~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~i~v~~~~ 240 (660) T protein:vir:68 161 PELGSNWTAEMSGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQLEIEIVS 240 (660) T ss_pred cccccceeEEeecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCccccccccccccccceEEEEec Confidence 55444444444443333333333333344444333344444433333333444444555667788889999999998877 Q ss_pred cccccccceeeeeeeecccc-cccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhccccc Q lcl|NC_012740. 241 RSSFSGAVAPELTMYPFGGT-RAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSS 319 (667) Q Consensus 241 ~a~~~~~~~~~~t~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~s 319 (667) .+..........+....... ......+....+..++.+.+.+..++.+.|++.+++..+.+...+...++.+...++.+ T Consensus 241 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (660) T protein:vir:68 241 KADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDFFAKGAS 320 (660) T ss_pred cccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeeeeecccccccccccceeeehhhccCcc Confidence 66655444333333332222 23344555666677778889999999999999999988888888888888888888888 Q ss_pred ceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCC-cchhhHHHHHH Q lcl|NC_012740. 320 QYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAG-EGDAFSTVQKH 398 (667) Q Consensus 320 ~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~v~~~ 398 (667) .++.+.....+.......++.+|.++... ...++...++.++...+.+.+.++++++... ...+..+++.+ T Consensus 321 ~~v~~~~~~~~~~~~~~~~~~gg~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~ 392 (660) T protein:vir:68 321 NYIFATAQGWPKGFSGVIKLNGGLSSNET--------VEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKH 392 (660) T ss_pred cEEEEeecCCCccccceeeeccccccccc--------cccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHH Confidence 88888776666666666677777664322 1233456677788887888888877776554 34556789999 Q ss_pred HHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEechHH Q lcl|NC_012740. 399 AVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAA 478 (667) Q Consensus 399 ~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~sg 478 (667) |++||+++++||+++|+|+.++++.+.+++.+++++||+..+.......+++|+|+++||||++++|+.++..+++|||| T Consensus 393 l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg 472 (660) T protein:vir:68 393 VVAIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAA 472 (660) T ss_pred HHHHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhH Confidence 99999999999999999999999999999999999999999988888889999999999999999999999999999999 Q ss_pred HHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCcccceeee Q lcl|NC_012740. 479 DIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRIN 558 (667) Q Consensus 479 ~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~ 558 (667) ++||+|||+|.++||||||+|+++.+|.|++++++.+++.|++.||++||||||+|+++|+++||+||+++++++||||| T Consensus 473 ~~AGl~Ar~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~ 552 (660) T protein:vir:68 473 DIAGLCARTDNISQPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRIN 552 (660) T ss_pred HHHHHHHHHhccCCcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCeEEEEcceecCCCCcccceEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999989999999 Q ss_pred hhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEEEEEec Q lcl|NC_012740. 559 VRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPA 638 (667) Q Consensus 559 vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~ 638 (667) |||||+||+++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||+++|++|+|+|+|+++|+ T Consensus 553 vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~~p~ 632 (660) T protein:vir:68 553 VRRLFNMVKTNIGSASKYRLFELNNAFTRSSFRTETSQYLQGIKALGGVYNFKVVCDTTNNTPAVIDRNEFVATFYLQPA 632 (660) T ss_pred hhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceEEEEEEEEeecCeeHHHHHHHHhc Q lcl|NC_012740. 639 KSINYIMLNFTAVATGADFDEIIGPANQ 666 (667) Q Consensus 639 ~p~e~i~~~~~~~~~~~~~~e~~~~~~~ 666 (667) +|+|||+|||+|++++++|+|++++|.+ T Consensus 633 ~pae~i~l~~~~~~~~~~~~e~~~~v~~ 660 (660) T protein:vir:68 633 RSINYITLNFVATATGADFDELIGAVGG 660 (660) T ss_pred CCcceEEEEEEEeecCccHHHHHHhhcC Confidence 9999999999999999999999999999 No 4 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=2.2e-162 Score=906.80 Aligned_cols=656 Identities=67% Similarity=1.097 Sum_probs=546.0 Q ss_pred CceecCceEEEEecCCCcccc-cCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQ-SATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~-~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (667) |+|+||||||||++. +++++ ++||++||||+|+|||+|+|++|+||.||++.||+|++.+|++|++++||+|||++|| T Consensus 1 ~~~~~Pgvyv~e~~~-~~~i~~~~t~~~~~vg~~~~gp~~~p~~v~s~~~~~~~fg~~~~~~~~~~~~~~~f~~~g~~~~ 79 (660) T protein:vir:10 1 MALLSPGIELKETSV-QSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNEVADYFMSGMNFLQYGNDLR 79 (660) T ss_pred CceecCceEEEeecC-CccccCCCcccceEEeecCCCCCccCeEcCCHHHHHHHcCCcCCCchhHHHHHHHHHhCCceEE Confidence 999999999999975 45555 5799999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccccccc Q lcl|NC_012740. 80 VVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGV 159 (667) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~ 159 (667) ||||.+++.+++++.....+..++..++....||+.+.+..........+.+...+..+.....+.+++.....+..... T Consensus 80 vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~~a~~v~~ 159 (660) T protein:vir:10 80 TVRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIAYARSLNQ 159 (660) T ss_pred EEEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeecccccccccccccccc Confidence 99999988777777777778888888888889999998888777666666667777776666666666666666666666 Q ss_pred cccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEe Q lcl|NC_012740. 160 YPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEIL 239 (667) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~ 239 (667) .......+..++....+......+....+.+.+...........................+...+...|.+|+.+.+.+. T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~v~i~ 239 (660) T protein:vir:10 160 YPTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYPGEIGSTLEVEIV 239 (660) T ss_pred ccccccceeEEEecccCccccceeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeecccccCcceeEEEe Confidence 55555555555555554444444444444444444443333333222223333344455566777889999999988887 Q ss_pred ecccccccceeeeeeeecccccc-cceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcccc Q lcl|NC_012740. 240 ARSSFSGAVAPELTMYPFGGTRA-AARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGS 318 (667) Q Consensus 240 ~~a~~~~~~~~~~t~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 318 (667) ...+................... ............++.+.+++..++...|++.++...+.+...+...+..+.+.++. T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (660) T protein:vir:10 240 SKAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYLDDYFAKGT 319 (660) T ss_pred eccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceeeeehhhcCCC Confidence 65554443333333332221111 11222233455667788888899999999999888888887788888888777888 Q ss_pred cceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCc-chhhHHHHH Q lcl|NC_012740. 319 SQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGE-GDAFSTVQK 397 (667) Q Consensus 319 s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~v~~ 397 (667) +.++.+............+.+.+|.++... ...++...++++++..+.+.++++++|+..+. +....+|++ T Consensus 320 ~~~v~~~~~~~~~~~~~~~~l~gg~~~~~~--------~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~ 391 (660) T protein:vir:10 320 SNYIYATSLNWPKGFSGIINLSGGISANDK--------VTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQK 391 (660) T ss_pred ccEEEEEeccCCCCcccceeeeccccCccc--------cccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHH Confidence 888888776666666667788887765432 23456778899999888889999999988763 456788999 Q ss_pred HHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEechH Q lcl|NC_012740. 398 HAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLA 477 (667) Q Consensus 398 ~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~s 477 (667) +|++||++|++||+++|+|.....+....++.+++++||+..+.....+.+++|+|+++||||++++|+.+++++++||| T Consensus 392 al~~~~~~~~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 471 (660) T protein:vir:10 392 HVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLA 471 (660) T ss_pred HHHHHHHhhCCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechh Confidence 99999999999999999999998999999999999999999998888888999999999999999999999999999999 Q ss_pred HHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecC-CeEEEEcceecCCCccccee Q lcl|NC_012740. 478 ADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGG-EGFILMGDKTATTVPSPFDR 556 (667) Q Consensus 478 g~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~~~~~~~~~ 556 (667) |++||+|||+|.++||||||+|+++.+|.|+.++++.+++.|++.||++|||||++|++ +|+++||+||+++++++||| T Consensus 472 g~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~ 551 (660) T protein:vir:10 472 ADLAGLCARTDDVSQPWMSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTATKVPSPMDH 551 (660) T ss_pred HHHHHHHHHhhccCCcEEccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccCCCCcccce Confidence 99999999999999999999999999999999999999999999999999999999986 79999999999999899999 Q ss_pred eehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEEEEE Q lcl|NC_012740. 557 INVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIK 636 (667) Q Consensus 557 i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~ 636 (667) |||||||+||+++|++.++|+||||||+.||++|+++|++||++||++|+|.||+|+||+++||++||++|+|+|+|+++ T Consensus 552 i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~i~~~ 631 (660) T protein:vir:10 552 INVRRLFNMLKKNIGDASKYKLFELNDNFTRSSFRMEVSQYLDGIKALGGIYEGRVVCDTTVNTPAVIDRNEFIANIYVK 631 (660) T ss_pred EehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCceEEEEEEEEeecCeeHHHHHHHHh Q lcl|NC_012740. 637 PAKSINYIMLNFTAVATGADFDEIIGPAN 665 (667) Q Consensus 637 p~~p~e~i~~~~~~~~~~~~~~e~~~~~~ 665 (667) |++|||||+|||+|++++++|+|++++|- T Consensus 632 P~~pae~I~~~~~~~~~~~~~~e~~~~~~ 660 (660) T protein:vir:10 632 PARSINYITLNFVATSTGADFDELIGPLV 660 (660) T ss_pred ecCCccEEEEEEEEeecCccHHHHhhhcC Confidence 99999999999999999999999999999 No 5 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=7.6e-162 Score=903.89 Aligned_cols=656 Identities=61% Similarity=1.041 Sum_probs=539.8 Q ss_pred CceecCceEEEEecCCCcccc-cCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQ-SATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~-~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (667) |+|+||||||||++ ++++++ ++||++||||+|+|||+|+|++|+||.||++.||++.+.+|++|++++||+|||++|| T Consensus 1 ~~~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vG~~~~Gp~~~p~~v~~~~~~~~~fg~~~~~~~~~~~~~~~f~ngg~~~~ 79 (663) T protein:vir:10 1 MALLSPGIEMKETS-INSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYGNDLR 79 (663) T ss_pred CceecCceEEEEec-CCccccccCcccceeEeecccCCCCccEEecCHHHHHHhcCCcCCcchhHHHHHHHHHhCCCeEE Confidence 99999999999997 555555 5799999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccccccc Q lcl|NC_012740. 80 VVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGV 159 (667) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~ 159 (667) ||||.++++++++.........+...++....+|+.+.+............+...+..+.....+...+........... T Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~~~v~~ 159 (663) T protein:vir:10 80 LVRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLGT 159 (663) T ss_pred EEEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeecccccccccccccc Confidence 99999888887777777777777777777778888887776665555555555555555555555555554444455555 Q ss_pred cccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEe Q lcl|NC_012740. 160 YPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEIL 239 (667) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~ 239 (667) ..........++................+.+...........................+.+.+.+..+|.|||.+++.+. T Consensus 160 ~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~i~V~i~ 239 (663) T protein:vir:10 160 YPTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGSTVEVEIV 239 (663) T ss_pred ceeeccceeeEeeeccCccccccccceeccccceEEeeccccccccccccccccccccccceEEeccCCcccceeeeeec Confidence 55555555555555444443334444455555555544444444444444445555667778889999999999999887 Q ss_pred ecccccccceeeeeeeecccc-cccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcccc Q lcl|NC_012740. 240 ARSSFSGAVAPELTMYPFGGT-RAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGS 318 (667) Q Consensus 240 ~~a~~~~~~~~~~t~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 318 (667) ...+......... ...... ......+.......++.+.+++..++.+.|.+.++...+.+...+...+..+.+.++. T Consensus 240 ~~~~~~~~~~~~~--~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 317 (663) T protein:vir:10 240 SKTAFNSGAQQTI--YPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRNGG 317 (663) T ss_pred cccccccccccce--ecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhhhhcCCc Confidence 7655544333222 222222 2233445556667778888889889998899999988888888888888888888888 Q ss_pred cceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCC-cchhhHHHHH Q lcl|NC_012740. 319 SQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAG-EGDAFSTVQK 397 (667) Q Consensus 319 s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~v~~ 397 (667) +.++.+.....+......+.+++|.++.... ...+...++..+...+.+.+++++++.... ......+++. T Consensus 318 ~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~--------~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~ 389 (663) T protein:vir:10 318 SNFIFASSEGWPAGFTGIIQLGGGTSANADV--------GADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQK 389 (663) T ss_pred ceEEEEeecccCccccceeEeccccCCcccc--------chhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHH Confidence 8888777666666666778888888765322 234567778888888888888888876543 3455678999 Q ss_pred HHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhcc---ccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 398 HAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSN---YSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 398 ~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) +|++||+++++||+++|+|..........++.+++++||+.... ......+++|+|+++||||++++|+.+++.+++ T Consensus 390 ~l~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ 469 (663) T protein:vir:10 390 YVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWV 469 (663) T ss_pred HHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCceEEe Confidence 99999999999999999999988888888999999999986542 234456789999999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecC-CeEEEEcceecCCCccc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGG-EGFILMGDKTATTVPSP 553 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~~~~~~ 553 (667) ||||++||+|||+|.++||||||+|+++.+|.|++++++.+++.|++.||++|||||+.|++ +|+++||+||+++++++ T Consensus 470 p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~s~ 549 (663) T protein:vir:10 470 PLAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSP 549 (663) T ss_pred chhHHHHHHHHHhhccCCceEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999997 79999999999999899 Q ss_pred ceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEE Q lcl|NC_012740. 554 FDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASM 633 (667) Q Consensus 554 ~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i 633 (667) |+||||||||+||+++|++.++|+||||||+.+|.+|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+|+| T Consensus 550 ~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i 629 (663) T protein:vir:10 550 FDRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTI 629 (663) T ss_pred cceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 634 FIKPAKSINYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 634 ~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) +++|++|+|||+|||+|++++++|+|++++|||| T Consensus 630 ~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~~ 663 (663) T protein:vir:10 630 YVKPPRSINYITLNMVATSTGANFDELIGPMQLA 663 (663) T ss_pred EEEecCCcceEEEEEEEeecCccHHHHHHHHhcC Confidence 9999999999999999999999999999999999 No 6 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=2.4e-161 Score=901.12 Aligned_cols=657 Identities=61% Similarity=1.046 Sum_probs=530.2 Q ss_pred CceecCceEEEEecCCCcccccCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (667) |+|+||||||||++....+.+++||++||||+|+|||+|+|++|+||.||++.||++.+.+|++|++++||+|||++||| T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYGNDLRL 80 (663) T ss_pred CceecCceEEEEecCcccccccCccceeEEeeeccCCCCccEEecCHHHHHHHhCCcCccchhHHHHHHHHHhCCCeEEE Confidence 99999999999997444445567999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccccccc Q lcl|NC_012740. 81 VRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (667) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~~ 160 (667) |||.++++++++.........+....+....+|+.+.+................+.++.....+.+.+............ T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~~~v~~~ 160 (663) T protein:vir:10 81 VRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLGTY 160 (663) T ss_pred EEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEecccccccccccccee Confidence 99998887777777666666777777777788888877665555444445555555555545555544444444444444 Q ss_pred ccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEee Q lcl|NC_012740. 161 PELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEILA 240 (667) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~~ 240 (667) ..........+................+.+.+........+..................+.+.+.++|.|||.+++.+.. T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~i~v~i~~ 240 (663) T protein:vir:10 161 PTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGSTVEVEIVS 240 (663) T ss_pred eeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeecccccccceeEEecc Confidence 44444444444333332222223333333444444444444444444444444555666778899999999999998876 Q ss_pred cccccccceeeeeeeeccc-ccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhccccc Q lcl|NC_012740. 241 RSSFSGAVAPELTMYPFGG-TRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSS 319 (667) Q Consensus 241 ~a~~~~~~~~~~t~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~s 319 (667) .......... ....... .......+.......++.+.+++..++.+.+.+.++...+.+...+...++.+.+.++.+ T Consensus 241 ~~~~~~~~~~--~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~ 318 (663) T protein:vir:10 241 KTAFNSGAQQ--TIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRNGGS 318 (663) T ss_pred cccccccccc--cccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeecccccccchhhhhhhhhhccCcc Confidence 6544333222 2222222 222333444556666677888898888888888899999888888888888888888888 Q ss_pred ceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCC-cchhhHHHHHH Q lcl|NC_012740. 320 QYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAG-EGDAFSTVQKH 398 (667) Q Consensus 320 ~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~v~~~ 398 (667) .++.+.....+......+.+++|.++.... ...++..++..+...+.+++++++++.... ......+|+.+ T Consensus 319 ~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~--------~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~a 390 (663) T protein:vir:10 319 NFIFASSEGWPAGFTGIIQLGGGTSANADV--------GADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKY 390 (663) T ss_pred eEEEEeecccCccccceeEcccccCCCccc--------cchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHH Confidence 888877766666666677888888765332 234566777888888888888888876543 44566889999 Q ss_pred HHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccc---cccccccCcceEEEEehhhcccccccCceeEec Q lcl|NC_012740. 399 AVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNY---SDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 475 (667) Q Consensus 399 ~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p 475 (667) |++||+++++||+|+|+|...........+.+++++||+..... .....+++|+|+++||||++++|+.+++.+++| T Consensus 391 l~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p 470 (663) T protein:vir:10 391 VVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVP 470 (663) T ss_pred HHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEec Confidence 99999999999999999999888888889999999999875432 234567899999999999999999999999999 Q ss_pred hHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecC-CeEEEEcceecCCCcccc Q lcl|NC_012740. 476 LAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGG-EGFILMGDKTATTVPSPF 554 (667) Q Consensus 476 ~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~~~~~~~ 554 (667) |||++||+|||+|.++||||||+|+++.+|.|+.++++.+++.|++.||++|||||+.|++ +|+++||+||+++++++| T Consensus 471 ~s~~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~ 550 (663) T protein:vir:10 471 LAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPF 550 (663) T ss_pred hhHHHHHHHHHhhccCCceEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCccc Confidence 9999999999999999999999999999999999999999999999999999999999997 799999999999998899 Q ss_pred eeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEEE Q lcl|NC_012740. 555 DRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMF 634 (667) Q Consensus 555 ~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~ 634 (667) +||||||||+||+++|++.++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||+++|++|+|+|+|+ T Consensus 551 ~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~ 630 (663) T protein:vir:10 551 DRINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTIY 630 (663) T ss_pred ceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 635 IKPAKSINYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 635 ~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) ++|++|+|||+|||+|++++++|+|++++|||| T Consensus 631 ~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~~ 663 (663) T protein:vir:10 631 VKPPRSINYITLNMVATSTGANFDELIGPMQLA 663 (663) T ss_pred EEecCCcceEEEEEEEeecCccHHHHHHHHhcC Confidence 999999999999999999999999999999999 No 7 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=5.4e-161 Score=899.23 Aligned_cols=658 Identities=60% Similarity=1.028 Sum_probs=540.1 Q ss_pred CceecCceEEEEecCCCcccccCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (667) |+|+||||||||+|++..+.+++||++||||+|+|||+|+|++|+||.||++.||++++.+|++|+|++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~v~t~~~~fvG~~~~gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAALVGKFAWGPAYEIRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYGNDLRL 80 (663) T ss_pred CccccCceEEEEecCcccccccccccceeeeccccCCCCcCEEecCHHHHHHHcCCcccccchHHHHHHHHHhCCCeEEE Confidence 99999999999998666666678999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccccccc Q lcl|NC_012740. 81 VRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (667) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~~ 160 (667) |||.+.++++++++.......+...++..+.+|+.+.+................++.+.....+.+.+.....+...... T Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~a~~~~~~ 160 (663) T protein:vir:10 81 VRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAKAKQLGTY 160 (663) T ss_pred EecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEeccccccccccccccc Confidence 99999888888887777777777778888889998887776665555555666666666666666655554455555555 Q ss_pred ccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEee Q lcl|NC_012740. 161 PELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEILA 240 (667) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~~ 240 (667) ......+...+................+.+.+.........................+.+...+..+|.+|+.+.+.+.. T Consensus 161 ~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~v~~~~ 240 (663) T protein:vir:10 161 PVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGSTVEVEVIS 240 (663) T ss_pred cccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCcceeEeecc Confidence 55555555555554444444444444444444544444444444444455555566677788889999999999998877 Q ss_pred cccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcccccc Q lcl|NC_012740. 241 RSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSSQ 320 (667) Q Consensus 241 ~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~s~ 320 (667) ................ +............++..++++.+++..++.+.|++.+++..+++...+...++.+.+.++.+. T Consensus 241 ~~~~~~~~~~~v~~~~-g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~~~~~~s~ 319 (663) T protein:vir:10 241 KTAFQSGAAQPIYPFG-GTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDYFRNGSSN 319 (663) T ss_pred cccccccceeeecccC-cccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhhhhhhhcCcccc Confidence 6655444332222111 111222233444566667788899999999999999999999988888888888888888899 Q ss_pred eEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecC-cCCcchhhHHHHHHH Q lcl|NC_012740. 321 YIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGA-CAGEGDAFSTVQKHA 399 (667) Q Consensus 321 ~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~v~~~~ 399 (667) ++.+.....+......+.+++|.++.... ...+...++..+...+.++..++++++ ..+..+...+|+.+| T Consensus 320 ~v~~~~~~~~~~~~~~~~l~gg~~~~~~~--------~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l 391 (663) T protein:vir:10 320 FIYASSVNWPAGFTGIIQLGGGASANNAV--------GSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHV 391 (663) T ss_pred eeEeeccccCcccceeEEecccccCcccc--------hhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHH Confidence 98887776666666667888887654322 234455666777766666665555544 444455668899999 Q ss_pred HHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhcccc---ccccccCcceEEEEehhhcccccccCceeEech Q lcl|NC_012740. 400 VSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYS---DNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPL 476 (667) Q Consensus 400 ~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~ 476 (667) ++||+++++||+|+|+|.....+.......+++.+||+...... ....+++|+|+++||||++++|+.+++.+++|| T Consensus 392 ~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~ 471 (663) T protein:vir:10 392 VALADDRQDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPL 471 (663) T ss_pred HHHHHhhCCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEech Confidence 99999999999999999998888888888999999998653322 345679999999999999999999999999999 Q ss_pred HHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecC-CeEEEEcceecCCCcccce Q lcl|NC_012740. 477 AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGG-EGFILMGDKTATTVPSPFD 555 (667) Q Consensus 477 sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~~~~~~~~ 555 (667) ||++||+|||+|.++||||||+|+++.+|.|+.++.+.+++.|++.||++|||+|+.|++ +|+++||+||+++++++|+ T Consensus 472 s~~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~ 551 (663) T protein:vir:10 472 SADIAGLCAYTDQVGHPWMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQVPSPFD 551 (663) T ss_pred HHHHHHHHHHhhccCCcEEccCCeeecceeccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCCCCcccc Confidence 999999999999999999999999999999999999999999999999999999999997 7999999999999988999 Q ss_pred eeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEEEE Q lcl|NC_012740. 556 RINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFI 635 (667) Q Consensus 556 ~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~ 635 (667) ||||||||+||+++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||+++|++|+|+|+|++ T Consensus 552 ~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~ 631 (663) T protein:vir:10 552 RINVRRLFNMLKKNIGDTSKYELFENNDAFTRQSFRMEVSQYLDNIRSLGGVYDFRVVCDTTNNTPQVIDSNEFVATIYI 631 (663) T ss_pred eEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecCCceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 636 KPAKSINYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 636 ~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) +|++|+|||+|||+|++++++|+|++++|||| T Consensus 632 ~p~~pae~I~~~~~~~~~~~~f~e~~~~~~~~ 663 (663) T protein:vir:10 632 KAPRSINYITLNFVATSTGANFDELIGPAQLA 663 (663) T ss_pred EecCCcceEEEEEEEEecCccHHHHHHHHhcC Confidence 99999999999999999999999999999999 No 8 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=9.4e-161 Score=897.91 Aligned_cols=657 Identities=64% Similarity=1.086 Sum_probs=523.1 Q ss_pred CceecCceEEEEecCCCcccccCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (667) |+|+||||||||+|+++++++++||++||||+|+|||+|+|++|+||.||+++||++++.++++|+|++||+|||++||| T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~~~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYGNDLRV 80 (659) T ss_pred CceecCceEEEEecCCceecccCccceEEEecccCCCCCccEEecCHHHHHHHcCCcCCCcchhHHHHHHHhhCCCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccccccc Q lcl|NC_012740. 81 VRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (667) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~~ 160 (667) ||+.+.+++.++.........+...++.....+..+..+...........+..++......................+.. T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~~~~~g~~ 160 (659) T protein:vir:10 81 VRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPTAKIIAKAKEVGEY 160 (659) T ss_pred EEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeeccccccccccccccc Confidence 99998887777777666677666666665555555555443322222223334444333333333333332333323333 Q ss_pred ccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEee Q lcl|NC_012740. 161 PELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEILA 240 (667) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~~ 240 (667) ...............+...........+.+.............................+...+..+|++++.+++.+.. T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G~~g~~~tv~~~~ 240 (659) T protein:vir:10 161 PTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIEIEIVS 240 (659) T ss_pred ceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecccccccccccceecccceEEEec Confidence 22222233333333322222222222222222222222222222222233333333445556777889999999998887 Q ss_pred cccccccceeeeeeeeccccc-ccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhccccc Q lcl|NC_012740. 241 RSSFSGAVAPELTMYPFGGTR-AAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSS 319 (667) Q Consensus 241 ~a~~~~~~~~~~t~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~s 319 (667) ..++................. ..........+..++.+.+.+...+.+.+++.++...+.........+....+.+..+ T Consensus 241 ~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (659) T protein:vir:10 241 KADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFFAKGGS 320 (659) T ss_pred hhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccccccchhhhhhhhccCcc Confidence 777665555544444333222 2333444455556667778888888889999888888888877888888888888888 Q ss_pred ceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcc-hhhHHHHHH Q lcl|NC_012740. 320 QYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEG-DAFSTVQKH 398 (667) Q Consensus 320 ~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~v~~~ 398 (667) .++.+.....+......+.+.+|.+... ....++...++.++...+.+++++|++|++.... ....+|+.+ T Consensus 321 ~~v~~~~~~~~~~~~~~~~l~gg~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~a 392 (659) T protein:vir:10 321 EYIFATAQNWPEGFSGILTLSGGLSSNA--------EVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKH 392 (659) T ss_pred cEEEEeecccCCCccceeeecccccccc--------cccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHH Confidence 8888877766666666777888776432 2234567788888888888899999999987643 356789999 Q ss_pred HHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEechHH Q lcl|NC_012740. 399 AVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAA 478 (667) Q Consensus 399 ~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~sg 478 (667) |++||+++++||+++|+|....++.+..++.+++++||+..+.......+++|+|+++||||++++|+.+++++++|||| T Consensus 393 l~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg 472 (659) T protein:vir:10 393 VVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAA 472 (659) T ss_pred HHHHHHhhCCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEechHH Confidence 99999999999999999999999999999999999999999988888899999999999999999999999999999999 Q ss_pred HHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCcccceeee Q lcl|NC_012740. 479 DIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRIN 558 (667) Q Consensus 479 ~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~ 558 (667) ++||+|||+|.++||||||+|+++.+|.|+.++++.+++.|++.||++|||||++|+++|+++||+||+++++++|+||| T Consensus 473 ~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~s~~~~i~ 552 (659) T protein:vir:10 473 DIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRIN 552 (659) T ss_pred HHHHHHHHHhccCCceEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEEcccccCCCCcccceEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999998888999999 Q ss_pred hhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEEEEEec Q lcl|NC_012740. 559 VRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPA 638 (667) Q Consensus 559 vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~ 638 (667) |||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+|+|+++|+ T Consensus 553 vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p~ 632 (659) T protein:vir:10 553 VRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGIKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYIQPA 632 (659) T ss_pred hhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEEEec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceEEEEEEEEeecCeeHHHHHHHHh Q lcl|NC_012740. 639 KSINYIMLNFTAVATGADFDEIIGPAN 665 (667) Q Consensus 639 ~p~e~i~~~~~~~~~~~~~~e~~~~~~ 665 (667) +|+|||+|||+|++++++|+|+++..- T Consensus 633 ~pae~i~~~~~~~~~~~~~~e~~~~~~ 659 (659) T protein:vir:10 633 RSINYITLNFVATATGADFDELTGLAG 659 (659) T ss_pred CCcceEEEEEEEEecCcchHHhhccCC Confidence 999999999999999999999987655 No 9 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=2e-160 Score=896.11 Aligned_cols=656 Identities=53% Similarity=0.930 Sum_probs=517.0 Q ss_pred CceecCceEEEEecCCCcccc-cCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQ-SATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~-~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (667) |+|+||||||||++ ++++++ ++||++||||+|+|||+|+|++|+||.||+++||++++.+|++|++++||+|||++|| T Consensus 1 ~~~~~Pgvyv~e~~-~~~~i~~~~t~~~~~vg~~~~gp~~~p~~i~s~~~~~~~fg~~~~~~~~~~~~~~~f~~gg~~~~ 79 (679) T protein:vir:10 1 MTLLSPGVETKEIN-LQTTIARSSTGRAALVGKFNWGPAYQISQVVSEVDLVDKFGRPDDQTADSFFSGVNFLNYGNDLR 79 (679) T ss_pred CceecCceEEEeec-CCcccccCccccceeeecccCCCCccCEEecCHHHHHHHcCCcccccchHHHHHHHHHhCCCeEE Confidence 99999999999997 455555 5799999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccccccc Q lcl|NC_012740. 80 VVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGV 159 (667) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~ 159 (667) ||||.++++.+++.+....+..++..++....+++.+++....... ..+.+..++........+.+.......++..+. T Consensus 80 vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~s~~-~~~~~~~~~~~~~~~~~~v~~~~~~~~a~~~~~ 158 (679) T protein:vir:10 80 LVRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGGNVI-ATGKVTVVNASGGIVAFYVPTAAIIDKAKSLND 158 (679) T ss_pred EEEccCcccccccccccccccccccccccccccccceeeeeCCCcc-cceeEEEeeccCceeeeeecccccccccccccc Confidence 9999998888787777777777788888878888877765543222 122334445555555555555555555555555 Q ss_pred cccccccceEEEEEeeccc--ccceeeeceeeeceeeeeeccccchhh-----hccccccccccccceeeeeeccccccc Q lcl|NC_012740. 160 YPELDGGWTAEFTSSSGNG--SAALSVTKIVTDSGLLLTDLETSRANI-----TNQDFLTKLKKYDMPAVSAIYAGEIGN 232 (667) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~--~~~~t~~~~v~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~A~~~G~~gn 232 (667) .+.....+..++....... ....++...+.+..........+...+ ..............+...+..++.+|+ T Consensus 159 ~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~~g~~gn 238 (679) T protein:vir:10 159 YPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARYAGTYGD 238 (679) T ss_pred cceecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccceeeeecccccCC Confidence 4444444444333222111 111112222222222222222222211 112223333445666778888999999 Q ss_pred eeEEEEeecccccccceeeeeeeec--cc---------ccccceeeeeccccccccceeeeeccceeeeeEeeeccCCcc Q lcl|NC_012740. 233 SLEVEILARSSFSGAVAPELTMYPF--GG---------TRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDK 301 (667) Q Consensus 233 ~i~v~i~~~a~~~~~~~~~~t~~~~--~~---------~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~ 301 (667) .+.+.+................... .. .............+....+.+++..++.+.|.+.++...++. T Consensus 239 ~i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~~~~~ 318 (679) T protein:vir:10 239 NIKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTKPGDR 318 (679) T ss_pred cceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeeecccccc Confidence 9888765443322111100000000 00 001111222333455566788888888888999998888888 Q ss_pred ccccccccchhhhcccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEE Q lcl|NC_012740. 302 DVYGNSIYMDDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLI 381 (667) Q Consensus 302 ~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 381 (667) ...+...++.+.+.++.+.++.......+......+.+.+|.++... ...++...+++++...+.+.+++|+ T Consensus 319 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~l~ 390 (679) T protein:vir:10 319 DIYGTSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTD--------ISAAEFMKGWDMFADREHTDVNLFI 390 (679) T ss_pred cccchhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCc--------cchhhhhhhhhhhhcccccccceEE Confidence 88888888888888888888877766666666777888888776432 2235667788889988889999999 Q ss_pred ecCcCCcc-hhhHHHHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhcc---ccccccccCcceEEEE Q lcl|NC_012740. 382 AGACAGEG-DAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSN---YSDNNMNINTTYAVID 457 (667) Q Consensus 382 ~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~s~~~~~~ 457 (667) +|+..... +...+|+.+|++||+++++||+|+|+|+...++.+..++.+++.+||+.... ......+++|.|+++| T Consensus 391 ~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~ 470 (679) T protein:vir:10 391 AGAVAGEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVD 470 (679) T ss_pred ecCCCCCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEE Confidence 99987643 4567899999999999999999999999999998999999999999986542 2234567899999999 Q ss_pred ehhhcccccccCceeEechHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCC Q lcl|NC_012740. 458 GNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGE 537 (667) Q Consensus 458 ~p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~ 537 (667) |||++++|+.+++++++||||++||+|||+|.++||||||+|+++.+|.|+.++++.+++.|++.||++||||||+|+++ T Consensus 471 ~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g~ 550 (679) T protein:vir:10 471 GNYKYQYDKYNDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVGFAGQ 550 (679) T ss_pred ccceeeecccCCceEEechHHHHHHHHHHhhccCCcEECcCCeeeccccccccceeecChhhHHhhhhCCceEEEEecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEcceecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEccc Q lcl|NC_012740. 538 GFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTT 617 (667) Q Consensus 538 G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~ 617 (667) |+++||+||+++++++|+||||||||+|||++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||++ T Consensus 551 G~~~wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~gf~v~~d~~ 630 (679) T protein:vir:10 551 GYILYGDKTASQAPTPFDRINVRRLFNLLKKSISESAKYKLFELNDAFTRSSFRSEVGSYLDTIRSLGGIYDFRVVCDES 630 (679) T ss_pred eEEEEcccccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCC Confidence 99999999999998899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhc Q lcl|NC_012740. 618 NNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQ 666 (667) Q Consensus 618 ~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~ 666 (667) +||+++|++|+|+|+|+++|++|||||+|||+|++++++|+|++++||| T Consensus 631 ~nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~~~ 679 (679) T protein:vir:10 631 NNTPAVIDRNEFVATILIKPARSINYITLSFVATSTGADFDELVGSFQQ 679 (679) T ss_pred CCCHHHhhCCeEEEEEEEEecCCccEEEEEEEEeecCccHHHHHHHhcC Confidence 9999999999999999999999999999999999999999999999999 No 10 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=8.5e-160 Score=892.67 Aligned_cols=657 Identities=63% Similarity=1.073 Sum_probs=514.9 Q ss_pred CceecCceEEEEecCCCcccccCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~v 80 (667) |+|+||||||||+|+++++++++|||+||||+|+|||+|+|++|+||.||+++||++++.++++|++++||+|||++||| T Consensus 1 ~~~~~PgVyvee~~~~~~~~~~~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg~~~~v 80 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYGNDLRV 80 (659) T ss_pred CceecCceEEEEecCCcccccCCCcceEEEeecCCCCCcccEEecCHHHHHHHcCCcCCCCchhHHHHHHHHhCCceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccccccc Q lcl|NC_012740. 81 VRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (667) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~~ 160 (667) |||.+.++++++.+....+..+...++.....+.....+.........+.+...+..........+.............+ T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~~~~~~~ 160 (659) T protein:vir:72 81 VRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKAKEVGEY 160 (659) T ss_pred EEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeeccccccccccccccc Confidence 99998887777776666666555555544444443333332211111122233333332222222222222222222222 Q ss_pred ccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEee Q lcl|NC_012740. 161 PELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEILA 240 (667) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~~ 240 (667) ..........+...........++...+.+.............................+...+..++.+++.+++.+.. T Consensus 161 ~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~~tv~i~~ 240 (659) T protein:vir:72 161 PTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIEIEIVS 240 (659) T ss_pred cccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeeccccccccceeEEEcc Confidence 22222222233322222222223333333333333222322222222222222333344556677888999999988877 Q ss_pred cccccccceeeeeeeecccc-cccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhccccc Q lcl|NC_012740. 241 RSSFSGAVAPELTMYPFGGT-RAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSS 319 (667) Q Consensus 241 ~a~~~~~~~~~~t~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~s 319 (667) ..+................. ......+....+..++.+.+.+...+.+.+.+.++...+.........+....+.++.+ T Consensus 241 ~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (659) T protein:vir:72 241 KADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFFAKGGS 320 (659) T ss_pred ccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhhhhhcCCc Confidence 66665444444333332222 22233344455556667777777888888999888888888777778888888878888 Q ss_pred ceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcc-hhhHHHHHH Q lcl|NC_012740. 320 QYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEG-DAFSTVQKH 398 (667) Q Consensus 320 ~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~v~~~ 398 (667) .++.+............+.+.+|.+.... ....+...++.++...+.+++++|++|++.+.. ....+++.+ T Consensus 321 ~~v~~~~~~~~~~~~~~~~l~gg~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~ 392 (659) T protein:vir:72 321 EYIFATAQNWPEGFSGILTLSGGLSSNAE--------VTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKH 392 (659) T ss_pred eEEEEEecccCCccccccccccccccccc--------ccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHH Confidence 88888776666666666777777654321 233556788888888888899999999987643 345789999 Q ss_pred HHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEechHH Q lcl|NC_012740. 399 AVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAA 478 (667) Q Consensus 399 ~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~sg 478 (667) |++||+++++||+++|+|+...++.+..++.+++++||+..+.+.....+++|+|+++||||++++|+.+++++++|||| T Consensus 393 l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg 472 (659) T protein:vir:72 393 VVSIGDARQDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLAA 472 (659) T ss_pred HHHHHhhhCCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechHH Confidence 99999999999999999999999999999999999999999998888999999999999999999999999999999999 Q ss_pred HHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCcccceeee Q lcl|NC_012740. 479 DIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRIN 558 (667) Q Consensus 479 ~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~ 558 (667) ++||+|||+|.++||||||+|+++.+|.|+.++++.+++.|++.||++|||||++|+++|+++||+||+++++++|+||| T Consensus 473 ~vAGl~Ar~D~~~G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~wG~rT~~~~~s~~~~i~ 552 (659) T protein:vir:72 473 DIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLYGDKTATSVPSPFDRIN 552 (659) T ss_pred HHHHHHHHhhccCCcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCeEEEEcccccCCCCcccceEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988999999 Q ss_pred hhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEEEEEec Q lcl|NC_012740. 559 VRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPA 638 (667) Q Consensus 559 vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~ 638 (667) |||||+||+++|+++++|+|||||++.+|++|+++|++||++||++|+|.+|+|+||+++||++||++|+|+|+|+|+|+ T Consensus 553 vrR~~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p~ 632 (659) T protein:vir:72 553 VRRLFNMLKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGNKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYIQPA 632 (659) T ss_pred ehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEEEec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceEEEEEEEEeecCeeHHHHHHHHh Q lcl|NC_012740. 639 KSINYIMLNFTAVATGADFDEIIGPAN 665 (667) Q Consensus 639 ~p~e~i~~~~~~~~~~~~~~e~~~~~~ 665 (667) +|+|||+|||+|++++++|+|+++..- T Consensus 633 ~pae~I~~~~~~~~~~~~~~e~~~~~~ 659 (659) T protein:vir:72 633 RSINYITLNFVATATGADFDELTGLAG 659 (659) T ss_pred CCccEEEEEEEEeecCcchHHhcccCC Confidence 999999999999999999999999877 No 11 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=1.2e-157 Score=880.90 Aligned_cols=655 Identities=58% Similarity=0.981 Sum_probs=514.0 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (667) |+|+||||||||++ ++++|++ +||++||||+|+|||+|+|++|+||.||++.||++++.+|++|+|++||+|||++|| T Consensus 1 ma~~~PgVyv~E~~-~~~~i~~~~ts~~~~vG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~ 79 (664) T protein:vir:98 1 MALQSPGIETKETS-VQSTVVRNSTGRAAIVGKFSWGPAYQIRQISNEVELVNYFGAPDNLTADYFMSAVNFLQYGNDLR 79 (664) T ss_pred CceecCceEEEecC-CCcccccccccceEEEeeccCCCCCccEEecCHHHHHHhcCCccccchhHHHHHHHHHhcCCeEE Confidence 99999999999997 5666665 799999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecc--cccccccccc Q lcl|NC_012740. 80 VVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPT--GKIIAHAKAI 157 (667) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~--~~~~~~a~~~ 157 (667) ||||.+.+++++++........++..+++...+++.+.+................+..+.....+++. .......... T Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~~~~~~~~ 159 (664) T protein:vir:98 80 LVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSLLVLNRSV 159 (664) T ss_pred EEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCccceeeccccc Confidence 99999888887777777777766777776667777776665443222222222222222211111111 0011111111 Q ss_pred cccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEE Q lcl|NC_012740. 158 GVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVE 237 (667) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~ 237 (667) ..... ......++...+.+.....+....+.+..............+..............+.+.+..+|.+||.+++. T Consensus 160 ~~~~~-~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G~~Gn~isv~ 238 (664) T protein:vir:98 160 LTQIF-LLVGTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALYPGELGSTVQVE 238 (664) T ss_pred ccccc-eecccceeeeeecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeeecccccceeeee Confidence 11100 01111223333333333333333334433333333333333444444444555667788889999999999999 Q ss_pred EeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhccc Q lcl|NC_012740. 238 ILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARG 317 (667) Q Consensus 238 i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~ 317 (667) +....++.......... ................+..++.+.+++..++.+.|++.++...++++......+..+.+.++ T Consensus 239 i~s~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (664) T protein:vir:98 239 IISKAAYDTGAMISGYP-SGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIYMDDFFANG 317 (664) T ss_pred ecccccccCcceEeecc-CceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeeeeechhheecc Confidence 88776665544222111 11111223344555666677888999999999999999999998888888888888887788 Q ss_pred ccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCc-chhhHHHH Q lcl|NC_012740. 318 SSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGE-GDAFSTVQ 396 (667) Q Consensus 318 ~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~v~ 396 (667) .+.++.......+........+.+|.+..... ...+..+++.++...+.+.+++|++|++.+. .....+++ T Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~--------g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~~~~v~ 389 (664) T protein:vir:98 318 GSQYVFGTSMNWPKGFSGILEFGGGLSSNDTV--------GADELMTGWDMFADREALHVPLLIAGGCAGESVEIASTVQ 389 (664) T ss_pred cceeeeeecccCCcccceeEeccCcccccccc--------CchhHHHHHHhhhcccccccceEEecCCCCCcHHHHHHHH Confidence 88887766665555555666777776543221 1235677888888888889999999998754 44567899 Q ss_pred HHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhcc----ccccccccCcceEEEEehhhcccccccCcee Q lcl|NC_012740. 397 KHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSN----YSDNNMNINTTYAVIDGNYKYQYDKYNDVNR 472 (667) Q Consensus 397 ~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~ 472 (667) .+|++||+++++||+++|+|+...++.+..++.+++++||+.... ......+++|+|+++||||++++|+.+++++ T Consensus 390 ~al~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~ 469 (664) T protein:vir:98 390 KHVISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNR 469 (664) T ss_pred HHHHHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceE Confidence 999999999999999999999999999999999999999986533 2234557999999999999999999999999 Q ss_pred EechHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecC-CeEEEEcceecCCCc Q lcl|NC_012740. 473 WVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGG-EGFILMGDKTATTVP 551 (667) Q Consensus 473 ~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~~~~ 551 (667) ++||||++||+|||+|.++||||||+|+++.+|.|+.++.+.+++.|++.||++|||||+.|++ +|+++||+||+++++ T Consensus 470 ~~p~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~ 549 (664) T protein:vir:98 470 WVPLAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGDKTLTSVP 549 (664) T ss_pred EechHHHHHHHHHHhhhcCCcEECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcccccCCCC Confidence 9999999999999999999999999999999999999999999999999999999999999997 799999999999888 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) ++|+||||||||+||+++|++.++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||+++|++|+|+| T Consensus 550 s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~ 629 (664) T protein:vir:98 550 SPFDRINVRRLFNMIKKDIGDNAKYKLFENNDDFTRASFRMDTGQYMTNIRALGGCYDYRVICDTTNNTPDVIDRNEFVA 629 (664) T ss_pred cccceEeehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEE Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhc Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQ 666 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~ 666 (667) +|+++|++|+|||+|||+|++++++|+|++++-.- T Consensus 630 ~i~~~p~~pae~I~~~~~q~~~~~~~~e~~~~~~~ 664 (664) T protein:vir:98 630 TVYVKPPRSINYITLNFVATSTGADFDELVGPQAV 664 (664) T ss_pred EEEEEecCCcceEEEEEEEeecCcchhHhcccccC Confidence 99999999999999999999999999999997555 No 12 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=4.1e-147 Score=823.10 Aligned_cols=642 Identities=50% Similarity=0.825 Sum_probs=457.3 Q ss_pred CceecCceEEEEecCCCcccc-cCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQ-SATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~-~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (667) |+|+||||||||++ ++++++ ++||++||||+|+|||+|+|++|+||.||+++||++++.+|++|+|++||+|||++|| T Consensus 1 ~~~~~Pgvyv~e~~-~~~~i~~v~t~~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~ 79 (671) T protein:vir:56 1 MTLLSPGIENKEIN-LASAIGRAATGRAAMVGKFEWGPAYSITQVTSESDLVTIFGRPNDYTAASFMTANNFLKYGNDLR 79 (671) T ss_pred CceecCceEEEeec-CcccccccCcccceEEecccCCCCccCEEcCCHHHHHHHcCCcCCCcchhHHHHHHHHhcCCeEE Confidence 99999999999997 555555 5799999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceecc-ccceeecc--cccceeeeeccccccccccc Q lcl|NC_012740. 80 VVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETA-GKVTKVDG--DGKVKGVFIPTGKIIAHAKA 156 (667) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~-~~~~~~d~--~~~~~~~~~~~~~~~~~a~~ 156 (667) ||||.+.++.+++.........+. ..+....+++.+.+.........+ +.....+. .......+.+.......... T Consensus 80 vvrv~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~~~~~ 158 (671) T protein:vir:56 80 LVRICDATTAQNATPLYNAVEYTI-GASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVAAAKS 158 (671) T ss_pred EEEecCccccccchhhcccccccc-ccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEEeeec Confidence 999998877777665554443332 223344455555544332221111 11122111 11111222222222222222 Q ss_pred ccccccccccceEEEEEeecccccceeeece-eeec-eeeeeecc-------ccchhhhccccccccccccceeeeeecc Q lcl|NC_012740. 157 IGVYPELDGGWTAEFTSSSGNGSAALSVTKI-VTDS-GLLLTDLE-------TSRANITNQDFLTKLKKYDMPAVSAIYA 227 (667) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~-v~~~-~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~A~~~ 227 (667) ...+........ ............ +.+. ........ ....................+.+.+... T Consensus 159 ~~~~~~~~~~t~-------~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 231 (671) T protein:vir:56 159 DGNYPSVGTITL-------QPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYV 231 (671) T ss_pred cccccccccccc-------cccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhcccccccccccc Confidence 221111100000 000000000000 0000 00000000 0000000111112222334556677788 Q ss_pred ccccceeEEEEeecccccccceeee--eeeecc-------cc-cccceeeeeccccccccceeeeeccceeeeeEeeecc Q lcl|NC_012740. 228 GEIGNSLEVEILARSSFSGAVAPEL--TMYPFG-------GT-RAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTL 297 (667) Q Consensus 228 G~~gn~i~v~i~~~a~~~~~~~~~~--t~~~~~-------~~-~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~ 297 (667) +.+++.+.+.+.............. ...... .. ...............+.+.+.+..++.+.|++.++.. T Consensus 232 g~~g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~ 311 (671) T protein:vir:56 232 GDFGDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTN 311 (671) T ss_pred cccCcceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEEEeec Confidence 8999998888765444332221111 111110 00 1111222333445556677788888889999988887 Q ss_pred CCccccccccccchhhhcccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhccccc Q lcl|NC_012740. 298 KGDKDVYGNSIYMDDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHV 377 (667) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 377 (667) .+.........+......++.+.++........ .......+.+|.+.. ....+...+++.+...+.+.+ T Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~gg~d~~----------~~~~~~~~~~~~~~~~~~~~~ 380 (671) T protein:vir:56 312 PGDKDVNGQSIFIDEYFENSGSAYITAIAEGWK-TESGAYNFGGGSDAN----------AGADDWMFGLDMLSDPEVLYT 380 (671) T ss_pred ccccccchhhhhhhhhhcccCceEEEecCcccC-CccccccccCccccc----------cchhHHHHHHHhhhhccccce Confidence 777776666666655555555544433322222 222333455554432 123345677788877777788 Q ss_pred ccEEecCcCCcc-hhhHHH-HHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhcccc----ccccccCc Q lcl|NC_012740. 378 NLLIAGACAGEG-DAFSTV-QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYS----DNNMNINT 451 (667) Q Consensus 378 ~~l~~~~~~~~~-~~~~~v-~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~s 451 (667) +++++|+..... .....+ +.++..+|+.+++|++++|+|+...++.+...+.+++.+||+...... ....+++| T Consensus 381 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s 460 (671) T protein:vir:56 381 NLVIAGNAAAEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVST 460 (671) T ss_pred eEEEcCCCCCccchhHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCc Confidence 888888766532 223333 445666777889999999999999999999999999999998654332 24567899 Q ss_pred ceEEEEehhhcccccccCceeEechHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEE Q lcl|NC_012740. 452 TYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPV 531 (667) Q Consensus 452 ~~~~~~~p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i 531 (667) .|+++||||++++|+.+++.+++|||+++||+|||+|.++||||||+|+++.+|.|+.++++.+++.|++.||++||||| T Consensus 461 ~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i 540 (671) T protein:vir:56 461 TYAVIDGNYKYQYDKYNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNRLAVDLRRAHRDALYQIGINPV 540 (671) T ss_pred ceEEEecCceEEecccCCceeEechHHHHHHHHHHhhccCCcEECcCCceeccccccccceeecChhHHHHHhhCCceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCeEEEEcceecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeE Q lcl|NC_012740. 532 IGAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFR 611 (667) Q Consensus 532 ~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~ 611 (667) ++|+++|+++||+||+++++++|+||||||||+||+++|++.++|+||||||+.||++|+++|++||++||++|+|.||+ T Consensus 541 ~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~g~~ 620 (671) T protein:vir:56 541 VGFAGQGFVLYGDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFELNDEFTRSSFKSEIDAYLTNIQDLGGVYDFR 620 (671) T ss_pred EEecCCeEEEEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeE Confidence 99999999999999999888899999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeHHHHHH Q lcl|NC_012740. 612 VQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFDEIIG 662 (667) Q Consensus 612 v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~ 662 (667) |+||+++||+++|++|+|+++|+++|++|+|||+|||+|++++++|+|+++ T Consensus 621 v~~d~~~nt~~~i~~G~~~~~i~~~p~~Pae~I~~~~~~~~~~~~f~e~~~ 671 (671) T protein:vir:56 621 VVCDETNNPGSVIDRNEFVASIYVKPAKSINFITLNFVATSTDADFAEIIG 671 (671) T ss_pred EEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchhhhcC Confidence 999999999999999999999999999999999999999999999999999 No 13 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=1e-143 Score=804.53 Aligned_cols=641 Identities=30% Similarity=0.483 Sum_probs=422.7 Q ss_pred Cc--eecCceEEEEecCCCcccccCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeE Q lcl|NC_012740. 1 MT--LLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDL 78 (667) Q Consensus 1 ~~--~~~PGVyvee~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~ 78 (667) |. |+||||||||+|++..+.+++||++||||+|+|||+|+|++|+||.||+++||+|++.+|++|+|++||+|||++| T Consensus 1 M~~~~~~PgVyv~e~~~~~~~~~~~t~~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~F~ngg~~~ 80 (749) T protein:vir:10 1 MATNQSSPGVVIQERDLTTVSTIPTANVGVIAAPFTKGPVEEVIEITSERQLAEKFGEPNESNYEYWFSAAQFLSYGGLL 80 (749) T ss_pred CCccccCCeeEEEEecCCcccccccCceeEEEeccCCCCCccCEEcCCHHHHHHHcCCccCCcccHHHHHHHHhhcCCeE Confidence 66 9999999999999887777789999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCcccccccccccccc----------------cceeeeccccccccceeeEeeeccceeccccceeec-ccccce Q lcl|NC_012740. 79 RVVRVLNKEKAKNATALAGNV----------------EFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVD-GDGKVK 141 (667) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~~----------------~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d-~~~~~~ 141 (667) |||||.+... ++++...... ..........+.||+.+.+........ ...... ...... T Consensus 81 ~vvRv~~~~~-~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~pG~~gn~l~v~v~~~~~~---~~~~~~~~~~~~~ 156 (749) T protein:vir:10 81 KTIRVNSSSL-KNAVDTGTAPLVKNLQDYETSIEDASNNFSWVARTPGDTGNSIGIFVTDAGAD---QVVVVPAPGSGNE 156 (749) T ss_pred EEEEccCccc-cccccccccccccccccccccccccccceEEEeccCCCcCCceEEEEEcCCCc---eeeeeecCCccce Confidence 9999976542 2222111100 001112234566777776655322111 000000 000000 Q ss_pred eeeeccccccc------------------------------------ccccccccccccccceEEEEEeecccccceeee Q lcl|NC_012740. 142 GVFIPTGKIIA------------------------------------HAKAIGVYPELDGGWTAEFTSSSGNGSAALSVT 185 (667) Q Consensus 142 ~~~~~~~~~~~------------------------------------~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~ 185 (667) ..+........ .................++....+......... T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~~~a~~ 236 (749) T protein:vir:10 157 HEFVADAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDATNKKLEIGLPSGGVTGILADN 236 (749) T ss_pred eeEEeeecccccccccccccccceeeeeccccceeecccceeeeccCcccccccccccCCcceEEEeeecccccceeeee Confidence 00000000000 000000000000000000100000000000000 Q ss_pred ceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEeecccccccceeeeeeeecccccccce Q lcl|NC_012740. 186 KIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEILARSSFSGAVAPELTMYPFGGTRAAAR 265 (667) Q Consensus 186 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~ 265 (667) ..+... .........+......................+..|..+.+........................+.... T Consensus 237 ~~v~~~----~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~v~~~~~~~~~~t~~~~~~~a~~~gt~~ 312 (749) T protein:vir:10 237 QVITQG----TNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITSVRDEYTEREYLPGVKWINVAPRPGTSL 312 (749) T ss_pred eccccc----ccccccccccccchhhhhccccceeeccccccCCccceeEEEeeeccccccccccceeecccccccccee Confidence 000000 0000000000000000100111111111112222223332222211111110100000111111111211 Q ss_pred eeeeccccccccceeee-------eccceeeeeEe-eeccCCccccccccccchhhhcccccceEEEecccccCcc---- Q lcl|NC_012740. 266 NLIPYAPQNDNQYAFIV-------RRDGVVVESYV-LSTLKGDKDVYGNSIYMDDFFARGSSQYIYATAQGWVDGF---- 333 (667) Q Consensus 266 ~~~~~~~~~~~~~~~~v-------~~~g~v~e~~~-~s~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~---- 333 (667) .........+..+.+++ ...+.++|++. ++...+.+...+...++.+++.. .|.++++...+..... T Consensus 313 ~~~~~~g~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~-~s~~v~~~~~~~~~~~~~~~ 391 (749) T protein:vir:10 313 YANGVGGHRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQ-KSEFIYWAEHESTLYAATSS 391 (749) T ss_pred eeecccCCCCceEEEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhcc-CCCEEEEEeccccccccccc Confidence 22222223332222222 23356778886 66667777777777787777644 5667665433211000 Q ss_pred ----------------------------------------cceEEecCCccccccccccccccccccchhHHHHHHhhhc Q lcl|NC_012740. 334 ----------------------------------------SGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERE 373 (667) Q Consensus 334 ----------------------------------------~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 373 (667) ...+.+.++.+..... ........+...+++++...+ T Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~---~~~~~~~~~~~~~~~~l~~~~ 468 (749) T protein:vir:10 392 ASDGLFGQTAANRQFNLFRSAAGSVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSA---GQYTITNTDIGSAYELIGDPE 468 (749) T ss_pred ccccccccccccceeeccccccccceeccccccccccCCcEEEEEccCCccccccc---ccccccchhHHHHHHHhhhhh Confidence 0011222222221111 111233456788888898888 Q ss_pred ccccccEEecCcCCcchhhHHHHHHHHHHHhhcCcEEEEEccCccccccccc-cCCHHHHHHHhhhhccccccccccCcc Q lcl|NC_012740. 374 SIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPV-TTAIDNLIAWREGNSNYSDNNMNINTT 452 (667) Q Consensus 374 ~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~s~ 452 (667) ...+++++++.....+.+..+++++|++||++|++||+++|+|.....+... .....++..|++. .++|+ T Consensus 469 ~~~~~~li~~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~---------~~~s~ 539 (749) T protein:vir:10 469 SQIVDFIISGPSGTSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKK---------LPSSS 539 (749) T ss_pred hcccceEEEecCCCCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhh---------ccCce Confidence 8889998887766666778899999999999999999999999877665443 3445667777754 35789 Q ss_pred eEEEEehhhcccccccCceeEechHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEE Q lcl|NC_012740. 453 YAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVI 532 (667) Q Consensus 453 ~~~~~~p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~ 532 (667) |+++||||++++|+.+++.+++|||+++||+|||+|.++||||||+|+++.+|.|++++++.+++.|++.||++|||||+ T Consensus 540 ~~~~~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~ 619 (749) T protein:vir:10 540 YMVFDSGYKYIYDKYNDVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPIV 619 (749) T ss_pred eEEEEccceeeeccccCceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccccceeecChhHHHhhhhCCceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecCCeEEEEcceecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEE Q lcl|NC_012740. 533 GAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRV 612 (667) Q Consensus 533 ~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v 612 (667) +|+++|+++||+||+++.+++|+||||||||+||+++|++.++|+||||||+.+|++|+++|++||++||++|+|.||+| T Consensus 620 ~~~g~G~~~wG~rT~~s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~G~i~~f~V 699 (749) T protein:vir:10 620 SFPGQGVVLYGDKTALGFASAFDRINIRRLFLTVERVISTAAKAQLFEQNDEAQRSLFINIVEPYLRDVQGRRGVVDFLV 699 (749) T ss_pred EecCCeEEEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEE Confidence 99999999999999977667999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeHHHHHH Q lcl|NC_012740. 613 QCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFDEIIG 662 (667) Q Consensus 613 ~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~ 662 (667) +||+++||+++|++|+|+|+|+++|++|||||+|||+|++++++|+|+++ T Consensus 700 ~~d~~~Nt~~~i~~G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~~s 749 (749) T protein:vir:10 700 KCDSTNNTPEAVDRGEFYAEVFLKPTRTINYVQLTFVATRTGVSFAEVAS 749 (749) T ss_pred EEcCCCCCHHHhhCCEEEEEEEEEecCCccEEEEEEEEeecCcchHHHhC Confidence 99999999999999999999999999999999999999999999999999 No 14 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=2.7e-141 Score=791.22 Aligned_cols=635 Identities=31% Similarity=0.477 Sum_probs=412.6 Q ss_pred Cc-eecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeE Q lcl|NC_012740. 1 MT-LLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDL 78 (667) Q Consensus 1 ~~-~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~ 78 (667) |. ||||||||||+|++++++++ +|+++||||+|+|||+|+|++|+||.||++.||++++.+|++|+|++||+|||++| T Consensus 1 m~~~~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~ 80 (743) T protein:vir:10 1 MASQVSPGILIKERDLTNAVVTGALQIRAAHASTFAKGPIGDIVNINTQKELVSVFGEPKEDNAEDWMVASEFLNYGGRL 80 (743) T ss_pred CccccCCceEEEEecCCCceeccCCcceeEEEEeccCCCCCcCEEecCHHHHHHHcCCccCCcchHHHHHHHHHhCCceE Confidence 65 99999999999999988776 69999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCccccccccccccc-------------ccceeeeccccccccceeeEeeeccceecc--ccceeecccccceee Q lcl|NC_012740. 79 RVVRVLNKEKAKNATALAGN-------------VEFEITNEGSNYEVGDTIKIKHNRQDIETA--GKVTKVDGDGKVKGV 143 (667) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~-------------~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~--~~~~~~d~~~~~~~~ 143 (667) |||||.+++.. +++..... ...........+.||+.+.+.......... ......+........ T Consensus 81 ~vvrv~~~~~~-~a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gN~i~V~v~~~~~d~~~~~~~~~~~~~~~~~~~ 159 (743) T protein:vir:10 81 AVVRAETTGVL-NATTGSAGVLVKNRESWDAGSGNGEVFVARTAGSWGNSLMGVLVDRGADYIVTFAATPTDTAVGTQLL 159 (743) T ss_pred EEEEccCcccc-ccccccccccccccccccccccceeEEEEeeccccccceEEEEecCCCcceeeeeccccccccceeee Confidence 99999876532 22211111 011111223455677776665543211000 000000000000000 Q ss_pred eecccc-----c-----cccccccccc-----------------ccccccceEEEEE---------eecccccceeeece Q lcl|NC_012740. 144 FIPTGK-----I-----IAHAKAIGVY-----------------PELDGGWTAEFTS---------SSGNGSAALSVTKI 187 (667) Q Consensus 144 ~~~~~~-----~-----~~~a~~~~~~-----------------~~~~~~~~~~~~~---------~~~~~~~~~t~~~~ 187 (667) ...... . ......+... ............. ..+........... T Consensus 160 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tv~ 239 (743) T protein:vir:10 160 FSYSGTLVTGEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVGRTPGTYSNVPASGGTGTGATFNVV 239 (743) T ss_pred ecccccccccceeeeeecCcceeeeeccccceeeecccccccccccccccccccccccccceeeEEeccccccccccccc Confidence 000000 0 0000000000 0000000000000 00000000000000 Q ss_pred eeeceeeeeeccccchhhhccccccccccccceee------------eeeccccccceeEEEEeecccccccce---eee Q lcl|NC_012740. 188 VTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAV------------SAIYAGEIGNSLEVEILARSSFSGAVA---PEL 252 (667) Q Consensus 188 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~A~~~G~~gn~i~v~i~~~a~~~~~~~---~~~ 252 (667) +.+.....+...... ...............+ .......++ .+.+........... ... T Consensus 240 v~~~~~~vg~~v~~~----~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g---~~~~~a~~~~~~~~~~~~~~~ 312 (743) T protein:vir:10 240 VADAGGGVGGSVVVT----LANPGTGYNQGETLTIASAATGDGTDILVTVATLSDG---TIAITELKDWYLNTEIGSTGI 312 (743) T ss_pred ccccccccccccccc----cccccceeeeccccccccccccccccchhheeccccc---ceeeeecccccccchhhcccc Confidence 000000000000000 0000000000000000 000000000 000100000000000 000 Q ss_pred eeeecccccccceeeeeccccccccce--------eeeeccceeeeeEe-eeccCCccccccccccchhhhcccccceEE Q lcl|NC_012740. 253 TMYPFGGTRAAARNLIPYAPQNDNQYA--------FIVRRDGVVVESYV-LSTLKGDKDVYGNSIYMDDFFARGSSQYIY 323 (667) Q Consensus 253 t~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~v~~~g~v~e~~~-~s~~~~~~~~~~~~~~~~~~~~~~~s~~v~ 323 (667) +............. ........+.+. ......+.++|++. ++...+.++..+...++..++ +..+.++. T Consensus 313 ~~~~~~~~~~t~~~-~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~-~~~s~~~~ 390 (743) T protein:vir:10 313 KLGDIGPRPGTSQF-ATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVI-NEQSAYLY 390 (743) T ss_pred ccccccccceeeec-cccccccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceeeccee-ccccceee Confidence 00000000000000 000001111111 12344567778886 677777777777777777665 33455543 Q ss_pred EecccccC-------------------------cccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccc Q lcl|NC_012740. 324 ATAQGWVD-------------------------GFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVN 378 (667) Q Consensus 324 ~~~~~~~~-------------------------~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (667) ........ .....+.+.+|.+ +......+...++.++...+.++++ T Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d---------~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (743) T protein:vir:10 391 HGNDAAVQIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGND---------DFAYDAGEFGAAMDLFLDTEETEID 461 (743) T ss_pred ccCcccceeeeccccCccccceeeeecccccccccceEEEeecCcc---------ccccchhHHHHHHHHhhhccccCcc Confidence 32211100 0011112222222 2223445667788888888888899 Q ss_pred cEEecCcCCcchhhHHHHHHHHHHHhhcCcEEEEEccCcccccc------ccccCCHHHHHHHhhhhccccccccccCcc Q lcl|NC_012740. 379 LLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVN------IPVTTAIDNLIAWREGNSNYSDNNMNINTT 452 (667) Q Consensus 379 ~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ai~d~p~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~ 452 (667) +|++|+..+...+..+++++|++||+++++||+++|+|...... .....+.+++..|++. .++|+ T Consensus 462 ll~~p~~~~~~~~~~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~s~ 532 (743) T protein:vir:10 462 FVLMGGSMADEADTKSKATKVIAIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSD---------LTSTS 532 (743) T ss_pred eEEecCcccCccchHHHHHHHHHHHHhhCCeEEEEecCCCccccccccccccccccchHHHHHHHh---------ccCCe Confidence 99999998888888999999999999999999999999765432 2334455666666653 46899 Q ss_pred eEEEEehhhcccccccCceeEechHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEE Q lcl|NC_012740. 453 YAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVI 532 (667) Q Consensus 453 ~~~~~~p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~ 532 (667) |+++||||++++|+.++..+++|||+++||+|||+|.++||||||+|+++.+|.|++++++.+++.|++.||++|||||+ T Consensus 533 ~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~ 612 (743) T protein:vir:10 533 YAVFDSGYKYVYDRFTDKYRYIPCNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAVKLAYNPNKADRDELYQNRINPVV 612 (743) T ss_pred eEEEEccceeeeccccCceeEechhHHHHHHHHHhhccCCcEEccCCeeeeeeeccccceecCChhHHHhHhhCCceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecCCeEEEEcceecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEE Q lcl|NC_012740. 533 GAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRV 612 (667) Q Consensus 533 ~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v 612 (667) +|+++|+++||+||+++++++||||||||||+||+++|++.++|+|||||++.+|++|+++|++||++||++|+|.||+| T Consensus 613 ~~~~~G~~~wG~rT~~s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~~~~~i~~~i~~fL~~l~~~gal~~~~V 692 (743) T protein:vir:10 613 SLRGQGITLFGDKTALAAPSAFDRINVRRLFLNLEKRARRLAEGVLFEQNDATTRAGFSSALNSYLSEVQARRGVTDYLV 692 (743) T ss_pred EecCCeEEEEcccccCCCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEE Confidence 99999999999999987778999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeHHHHHHH Q lcl|NC_012740. 613 QCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFDEIIGP 663 (667) Q Consensus 613 ~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~ 663 (667) +||+++||+++|++|+|+|+|+++|++|+|||+|||+|++++++|+|++++ T Consensus 693 ~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~e~~~~ 743 (743) T protein:vir:10 693 ICDESNNTPDIIDRNEFVAEVYVKPTRSINFITITFTATKTGVTFSEVVGR 743 (743) T ss_pred EEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchHhhhcC Confidence 999999999999999999999999999999999999999999999999999 No 15 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=6.8e-140 Score=783.54 Aligned_cols=650 Identities=30% Similarity=0.449 Sum_probs=403.7 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcC--ccchhHHHHHHHHHcCCCe Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPD--NNTADYFMSGANFLQYGND 77 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~--~~~~~~~~v~~~f~ngG~~ 77 (667) |+|+||||||||+|+++++|++ +||++||||+|+|||+|+|++|+||.||+|+||+|. +.++++|+++.||+|||++ T Consensus 3 ~~~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~~~f~ngg~~ 82 (729) T protein:vir:10 3 LNLASPGIVVREVDLTIGRVDPTSGSIGALVAPFAKGPVNDPQLIESEEDLLQTFGQPYSTDKHYEYWMVASSYLAYGGT 82 (729) T ss_pred ccccCCceEEEEecCCCcccccccccceeEEeccccCCCccCeEcCCHHHHHHHcCccccCCcchhHHHHHHHHHhCCce Confidence 7799999999999999988876 699999999999999999999999999999999984 5678899999999999999 Q ss_pred EEEEEcCCcccccccccccccccceeeeccc---------------cccccceeeEeeeccceeccccce-eecccccce Q lcl|NC_012740. 78 LRVVRVLNKEKAKNATALAGNVEFEITNEGS---------------NYEVGDTIKIKHNRQDIETAGKVT-KVDGDGKVK 141 (667) Q Consensus 78 ~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~---------------~~~~g~~~~~~~~~~~~~~~~~~~-~~d~~~~~~ 141 (667) ||||||.++++...+................ ....+..+.+.......+.+.... ..+...... T Consensus 83 ~~vvRv~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~~~v~v~~~~~~~~ 162 (729) T protein:vir:10 83 MQVVRADDYNTQTGVGLKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANGIKVAIIDGKADQI 162 (729) T ss_pred EEEEecCcccccccccccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccceeeEEecccCcce Confidence 9999998765433221111000000000000 000111122222222222221111 111100000 Q ss_pred eeeecccccccccccccc--ccc-ccccceEEEEEeec----ccccceeeeceeeeceeeeeeccccchhhhcccccccc Q lcl|NC_012740. 142 GVFIPTGKIIAHAKAIGV--YPE-LDGGWTAEFTSSSG----NGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKL 214 (667) Q Consensus 142 ~~~~~~~~~~~~a~~~~~--~~~-~~~~~~~~~~~~~~----~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (667) ..+. ............. ... ........+..... .................... ........ ....... T Consensus 163 ~~~~-~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s~~~~~~~--~~~~~~~~-~~~~~~~ 238 (729) T protein:vir:10 163 LTVA-SGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTLEVKVISHISAAGV--ETAVEYQQ-NGTYTFD 238 (729) T ss_pred eeee-ccccccceeeeeeeccccccccccceeeeeeecccccccccccccceeccccccccc--ceeccccc-cceeeec Confidence 0000 0000000000000 000 00000000000000 00000000000000000000 00000000 0000000 Q ss_pred ccccceeeeeeccccccceeEEEEeecccccccceeeeeeee-cccccccceeeeecccccc--------ccceeeeecc Q lcl|NC_012740. 215 KKYDMPAVSAIYAGEIGNSLEVEILARSSFSGAVAPELTMYP-FGGTRAAARNLIPYAPQND--------NQYAFIVRRD 285 (667) Q Consensus 215 ~~~~~~~~~A~~~G~~gn~i~v~i~~~a~~~~~~~~~~t~~~-~~~~~~~~~~~~~~~~~~~--------~~~~~~v~~~ 285 (667) .............+............................ ................... +.....+... T Consensus 239 ~~~s~~~~a~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~d~~~~~~~d~~~~~~~~~ 318 (729) T protein:vir:10 239 NSGSVNVIAAGSSGSGSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVSTRGGKNDEIHVLVIDDKGTITGNS 318 (729) T ss_pred ccCccceeeeccccccccccceeeeccccccccccccccccccccccccccccccccccccccccceeeeccccccccCc Confidence 000111111111110000000000000000000000000000 0000000000000000000 0001123445 Q ss_pred ceeeeeEe-eeccCCccccccccccchhhhcccccceEEEecccc------------------------------cCccc Q lcl|NC_012740. 286 GVVVESYV-LSTLKGDKDVYGNSIYMDDFFARGSSQYIYATAQGW------------------------------VDGFS 334 (667) Q Consensus 286 g~v~e~~~-~s~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~------------------------------~~~~~ 334 (667) +.++|.+. ++...+.+...+...++.+++. ..|.++....... ..... T Consensus 319 g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~-~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 397 (729) T protein:vir:10 319 GTILEKHLSLSKAKDAEYSVGSSSYWRDFLA-TNSKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGVNFGASGV 397 (729) T ss_pred ccceeeeeeeeeccccccccccccccceeec-cccceeeecccccccccccccccccceeccccccccccccccccccce Confidence 66677764 6777777777777766666653 2344443321110 00112 Q ss_pred ceEEecCCcccccccccccc--ccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHHHHHHHhhcCcEEEE Q lcl|NC_012740. 335 GIISLAGGVSANEASTGDRG--NDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVM 412 (667) Q Consensus 335 ~~~~~~~g~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ai 412 (667) ..+.+++|.+.......... ......+...++.++...+.+.++++++++...++.+...++.+|++||+++++|+++ T Consensus 398 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~~~~~a~ 477 (729) T protein:vir:10 398 ATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEARKDAVAF 477 (729) T ss_pred eEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhcCCeEEE Confidence 23345555443322221111 1122344577888888888888888888877777778899999999999999999999 Q ss_pred EccCcccccccc---------ccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEechHHHHHHH Q lcl|NC_012740. 413 VSPPRSTVVNIP---------VTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGL 483 (667) Q Consensus 413 ~d~p~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~sg~vAg~ 483 (667) +|+|+...+.-. ..+..+++..|++.+ .+|+|+++||||++++|+.++..+++|||+++||+ T Consensus 478 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aGl 548 (729) T protein:vir:10 478 ISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPL---------SSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAGT 548 (729) T ss_pred ecccccccccccccccccccccchhhHHHHHHHhhc---------cCCceEEEEcCeeEEecccCCceEEechhHHHHHH Confidence 999976544321 223445556666532 36889999999999999999999999999999999 Q ss_pred HHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCcccceeeehhhhh Q lcl|NC_012740. 484 CARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVRRLF 563 (667) Q Consensus 484 ~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~ 563 (667) |||+|.++||||||+|+++.+|.|+.++++.+++.|++.||++|||||++|+++|+++||+||+++.+++|+|||||||| T Consensus 549 ~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~d~~~~~i~vrR~~ 628 (729) T protein:vir:10 549 CARTDIEQFPWFSPAGTARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAGIILFGDKTGFGKSSAFDRINVRRLF 628 (729) T ss_pred HHHhhccCCcEEccCCccccceecccceeeecChhhHhhhhhCCceEEEEecCCeEEEEcceecCCCCcccceeehhhhH Confidence 99999999999999999999999999999999999999999999999999999999999999997666799999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCCceE Q lcl|NC_012740. 564 NMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPAKSINY 643 (667) Q Consensus 564 ~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~ 643 (667) +||+++|++.++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||++||++|+|+|+|+++|++|+|| T Consensus 629 ~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~p~e~ 708 (729) T protein:vir:10 629 IYLEDAISAAAKDQLFEFNDELTRTNFVNIVEPFLRDVQAKRGIFDFVVICDETNNTAAVIDSNEFVADIFIKPARSINF 708 (729) T ss_pred HHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCccE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEeecCeeHHHHHHHH Q lcl|NC_012740. 644 IMLNFTAVATGADFDEIIGPA 664 (667) Q Consensus 644 i~~~~~~~~~~~~~~e~~~~~ 664 (667) |+|||+|++++++|+|++++| T Consensus 709 i~~~~~~~~~~~~~~e~~~~~ 729 (729) T protein:vir:10 709 IGLTFVATRTGVAFEEVIGSV 729 (729) T ss_pred EEEEEEEeecCccHHHHHhcC Confidence 999999999999999999999 No 16 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=1.3e-107 Score=606.51 Aligned_cols=467 Identities=18% Similarity=0.163 Sum_probs=327.0 Q ss_pred Cc-eecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeE Q lcl|NC_012740. 1 MT-LLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDL 78 (667) Q Consensus 1 ~~-~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~ 78 (667) |. |++|||||||++++++++.. +|+|++|||++++||+|+|++|+||.||++ ||+.....+++++++.||.|||++| T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p~n~pv~its~~d~~~-~g~~~~~~tL~~Av~~~f~ngg~~~ 79 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDAAQ-FGPQLAGFTIPQALDAVYDYGSGTV 79 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccCCCcccEEEccHHHHHH-hcCCCCCCcHHHHHHHHhhcCCceE Confidence 65 77999999999999887764 799999999999999999999999999986 8888888999999999999999999 Q ss_pred EEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccccc Q lcl|NC_012740. 79 RVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIG 158 (667) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~ 158 (667) |||||.++.......... .. ... .. .... T Consensus 80 ~vvrV~~~~~~~~~~a~~---------~~-------------------------~~~--~~---------------~~~~ 108 (477) T protein:vir:79 80 IVINVLDPAVHKSNAASE---------SV-------------------------TFD--AA---------------TGRA 108 (477) T ss_pred EEEeccCCcccccccccc---------cc-------------------------ccc--cc---------------cccc Confidence 999997654322110000 00 000 00 0000 Q ss_pred ccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEE Q lcl|NC_012740. 159 VYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEI 238 (667) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i 238 (667) .. ... ....+. + T Consensus 109 ~~------------------------~~~------------------------------------------~~~~~~--v 120 (477) T protein:vir:79 109 KL------------------------AHP------------------------------------------AAANLV--L 120 (477) T ss_pred cc------------------------ccc------------------------------------------ccceeE--E Confidence 00 000 000000 0 Q ss_pred eecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcccc Q lcl|NC_012740. 239 LARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGS 318 (667) Q Consensus 239 ~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 318 (667) ....... .. .... ...+.. ..... ... ... ..... T Consensus 121 ~~~~~~~--------~~-------------~~~~----~~~~~~-~~~~~-~~~--~~~-~~~~~--------------- 155 (477) T protein:vir:79 121 KNDSGGT--------TY-------------TEGT----DYAVDL-INGVI-TRI--KTG-TIPAA--------------- 155 (477) T ss_pred eeccccc--------cc-------------ccCc----cccccc-cchhh-hhh--hcc-ccccc--------------- Confidence 0000000 00 0000 000000 00000 000 000 00000 Q ss_pred cceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhc---ccccccEEecCcCCcchhhHHH Q lcl|NC_012740. 319 SQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERE---SIHVNLLIAGACAGEGDAFSTV 395 (667) Q Consensus 319 s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~l~~~~~~~~~~~~~~v 395 (667) ...+....... ++.........+.........++.++...+ ...+++++.|+. .+..+| T Consensus 156 ~~~~~~~~~~~--------------~~~~~~~~~~~g~~~a~~~~tg~~al~~~~~~~~~~~~iv~apg~----~~~~~v 217 (477) T protein:vir:79 156 ATAAKATYDYA--------------DPTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAY----CTQNSV 217 (477) T ss_pred cceeeceeccC--------------Ccccceeeeecccccccccchhhhhhhhhhhhcccccceeecccc----ccchhH Confidence 00000000000 000000000011111112233333333222 234566777765 345678 Q ss_pred HHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEec Q lcl|NC_012740. 396 QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 475 (667) Q Consensus 396 ~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p 475 (667) +.+|.++|+++ +||+++|+|. ..+.+++.+|++..... ..+++|+|+++||||++++|+.++..+++| T Consensus 218 ~~~l~~~~~~~-~~~a~~d~p~--------~~~~~~~~~~~~~~~~~---~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p 285 (477) T protein:vir:79 218 SVELEAMAVQL-GAIAYIDAPI--------GTTLAQALAGRGPAGTI---NFNTSSDRVRLCYPHVKVYDIATNAERLEP 285 (477) T ss_pred HHHHHHHHhhc-CeEEEEecCC--------CCChHHHhhhhhhcccc---ccccccceEEEEcCeeEEecccCCceeeec Confidence 99999999987 4899999874 45677888998866543 357899999999999999999999999999 Q ss_pred hHHHHHHHHHHhhhcCCceeeecceeccceeccc---cccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCC--C Q lcl|NC_012740. 476 LAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV---KLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATT--V 550 (667) Q Consensus 476 ~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~--~ 550 (667) ||+++||+|||+|.++||||||+|+++.++.++. ......++.|++.||++|||||++|+++|+++||+||++. + T Consensus 286 ~s~~~ag~~a~~d~~~g~~~span~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~rT~~~~~~ 365 (477) T protein:vir:79 286 LSSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTV 365 (477) T ss_pred hHHHHHHHHHHhhccCCceEccCCceeecceecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCC Confidence 9999999999999999999999999987777653 2334456689999999999999999999999999999963 3 Q ss_pred cccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEE Q lcl|NC_012740. 551 PSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFV 630 (667) Q Consensus 551 ~~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~ 630 (667) +++||||||||+|++|+++|++.++|+|||||++.+|++|+++|++||++||++|+|+||+|+||+++||++||++|+|+ T Consensus 366 ~~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~ 445 (477) T protein:vir:79 366 THMRNFENVRRTGDVINESLRYFSQQFVDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLL 445 (477) T ss_pred CccceeeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEE Confidence 45799999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEecCCceEEEEEEEEeecCeeHHHHHHHH Q lcl|NC_012740. 631 ASMFIKPAKSINYIMLNFTAVATGADFDEIIGPA 664 (667) Q Consensus 631 ~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~ 664 (667) ++|+++|++|+|||+|++++.... |+++.+== T Consensus 446 ~~i~~~p~~p~e~i~~~~~~~~~~--~~~~~~~~ 477 (477) T protein:vir:79 446 INYKYTVPPPLERLTYETEITSEY--LLTLKGGN 477 (477) T ss_pred EEEEEEecCCceeEEEEEEEechH--HhhhccCC Confidence 999999999999999999998777 44433221 No 17 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=1.5e-106 Score=600.75 Aligned_cols=467 Identities=18% Similarity=0.158 Sum_probs=329.1 Q ss_pred Cc-eecCceEEEEecCCCcccc-cCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeE Q lcl|NC_012740. 1 MT-LLSPGFETKETTLSTTIVQ-SATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDL 78 (667) Q Consensus 1 ~~-~~~PGVyvee~~~~~~~~~-~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~ 78 (667) |. |++|||||||++++++++. ++|+|++|||++++||+|+|++|+||.|| +.||+.....++++|+++||.|||++| T Consensus 1 M~~~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp~n~pv~its~~d~-~~~g~~~~~~tL~~Av~~~f~nGg~~~ 79 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDA-AQFGPQLAGFTIPQALDAVYDYGSGTV 79 (477) T ss_pred CcccCCCCeEEEEccCCcccccccCCceeEEEecccCCCCCcCEEEccHHHH-HHhccCCCCCcHHHHHHHHHhccceEE Confidence 65 6789999999999987765 47999999999999999999999999999 569999999999999999999999999 Q ss_pred EEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccccc Q lcl|NC_012740. 79 RVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIG 158 (667) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~ 158 (667) |||||.++.......... . . ..... .. T Consensus 80 ~vVrV~~~~~~~~~~~~~-------------------~------------~--~~~~~--------------------~~ 106 (477) T protein:vir:10 80 IVINVLDPAVHKSNAANE-------------------P------------V--TFDAA--------------------TG 106 (477) T ss_pred EEEecCcccccccccccc-------------------c------------c--ccccc--------------------cc Confidence 999997654321110000 0 0 00000 00 Q ss_pred ccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEE Q lcl|NC_012740. 159 VYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEI 238 (667) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i 238 (667) .. . ..+.+.+...+.. T Consensus 107 ~~------------------------~----------------------------------------~~~~~~~~~~v~~ 122 (477) T protein:vir:10 107 RA------------------------K----------------------------------------LAHPAAANLVLKN 122 (477) T ss_pred ee------------------------c----------------------------------------ccccccccccccc Confidence 00 0 0000000000000 Q ss_pred eecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcccc Q lcl|NC_012740. 239 LARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGS 318 (667) Q Consensus 239 ~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 318 (667) .... . ...... ...+.....+ ........ ..... T Consensus 123 ~a~~---------------------~----~~~~~~--~~~~~~~~~~-~~~~~~~~------------------~~~~~ 156 (477) T protein:vir:10 123 DSGG---------------------T----TYAEGT--DYAVDLINGV-ITRIKTGT------------------IPPGA 156 (477) T ss_pred cccc---------------------c----ccccch--hhhhhhcccc-ceeccccc------------------ccccc Confidence 0000 0 000000 0000000000 00000000 00000 Q ss_pred cceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhc---ccccccEEecCcCCcchhhHHH Q lcl|NC_012740. 319 SQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERE---SIHVNLLIAGACAGEGDAFSTV 395 (667) Q Consensus 319 s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~l~~~~~~~~~~~~~~v 395 (667) . .+.+ ....+. ..........+.........++.++...+ ...+.++++|+.. +..+| T Consensus 157 ~-~~~~-------------~~~~~~-~~~~~~~~~~g~~~~~~~~tGl~al~~~~~~~~~~~~~l~apg~~----~~~~v 217 (477) T protein:vir:10 157 T-AAKA-------------TYDYAD-PTKVTAADIIGAVNAAGMRTGMKALKDTYNLYGYFSKILIAPAYC----TQNSV 217 (477) T ss_pred e-eeee-------------cccccc-ccccccccccccccccchhhhhhhhhhhhhhcchhcccccccccc----cchhh Confidence 0 0000 000000 00111111111111222334444443322 1234666777653 45678 Q ss_pred HHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEec Q lcl|NC_012740. 396 QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 475 (667) Q Consensus 396 ~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p 475 (667) +.+|.++|++++ |++++|.|. .++.+++++|++..... ..+++|+|+++||||++++|+.++..+++| T Consensus 218 ~~~l~~~~~~~~-~~~~~d~p~--------~~~~~~~~~~~~~~~~~---~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p 285 (477) T protein:vir:10 218 SVELEAMAVQLG-AIAYIDAPI--------GTTLAQALAGRGPAGTI---NFNTSSDRVRLCYPHVKVYDTATNAERLEP 285 (477) T ss_pred HHHHHHHHhhCC-EEEEEecCC--------CCCHHHHHhhhhhcccc---ccccccceEEEEcCeEEEecccCCceeEEc Confidence 999999999874 889999874 45678899999865443 446889999999999999999999999999 Q ss_pred hHHHHHHHHHHhhhcCCceeeecceeccceeccc---cccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCC-- Q lcl|NC_012740. 476 LAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV---KLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTV-- 550 (667) Q Consensus 476 ~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~-- 550 (667) ||+++||++||+|.++||||||+|+++.+|.++. ......++.|++.||++|||+|++|+++|+++||+||++.+ T Consensus 286 ~s~~~ag~~a~~d~~~g~~~span~~~~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G~~~wG~rT~~~~~~ 365 (477) T protein:vir:10 286 LSSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGNRTAAWPTV 365 (477) T ss_pred hHHHHHHHHHHhhhcCCceeccCCceeccccccccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcccccCCCCC Confidence 9999999999999999999999999988787763 23334466799999999999999999999999999999643 Q ss_pred cccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEE Q lcl|NC_012740. 551 PSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFV 630 (667) Q Consensus 551 ~~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~ 630 (667) ++.|+||+|||+|++|+++|++.++|+|||||++.+|++|+++|++||++||++|+|.||+|+||+++||++||++|+|+ T Consensus 366 ~~~~~~~~vrR~~~~i~~~~~~~~~~~v~~~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~ 445 (477) T protein:vir:10 366 THMRNFENVRRTGDVINESLRYFSQQFVDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLL 445 (477) T ss_pred CcccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEE Confidence 35799999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEecCCceEEEEEEEEeecCeeHHHHHHHH Q lcl|NC_012740. 631 ASMFIKPAKSINYIMLNFTAVATGADFDEIIGPA 664 (667) Q Consensus 631 ~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~ 664 (667) ++|+++|++|+|||+|++++.... |+++.+-= T Consensus 446 ~~i~~~p~~p~e~i~~~~~~~~~~--~~~~~~g~ 477 (477) T protein:vir:10 446 INYKYTVPPPLERLTYETEITSEY--LLTLKGGN 477 (477) T ss_pred EEEEEEecCCcceEEEEEEEcchH--HhhhhcCC Confidence 999999999999999999987766 44443322 No 18 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=4.8e-105 Score=592.47 Aligned_cols=467 Identities=15% Similarity=0.132 Sum_probs=319.7 Q ss_pred CceecCceEEEEecCCCccccc--CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS--ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDL 78 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~--~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~ 78 (667) ++|-.|||||||+|++++++++ +||++||||.++|||+|+|++|+||.||.+.||..... ++|+..| T Consensus 281 ~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rGPvn~PvlITS~aD~~~~Fg~~~GG-----------l~GassA 349 (774) T protein:vir:98 281 RNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRGFTTSPALVTTIPDPAIHFTSFQGG-----------LDGPRSA 349 (774) T ss_pred EEEecCceEEEEeCCCCccccccccceeeeecccccCCCCCcCEEEeehhHhhhhhccccCC-----------cccccee Confidence 8899999999999999998854 69999999999999999999999999987777643211 1344333 Q ss_pred EEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccccc Q lcl|NC_012740. 79 RVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIG 158 (667) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~ 158 (667) |.+..... T Consensus 350 ~r~~~~~s------------------------------------------------------------------------ 357 (774) T protein:vir:98 350 FRDFYTFN------------------------------------------------------------------------ 357 (774) T ss_pred eeeeeeec------------------------------------------------------------------------ Confidence 31110000 Q ss_pred ccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEE Q lcl|NC_012740. 159 VYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEI 238 (667) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i 238 (667) ......+.|..+|.|||.+++++ T Consensus 358 ---------------------------------------------------------G~~~L~i~A~~pGawGN~ItV~I 380 (774) T protein:vir:98 358 ---------------------------------------------------------GTPLLRLQAVSEGNWGNQVTVSI 380 (774) T ss_pred ---------------------------------------------------------ccceEEEEEeecCcCCCceEEEE Confidence 00112345667777777777776 Q ss_pred eecccccccceeeeeeeecccccccceeeeeccccccccceeeeec---cceeeeeEeee----ccCCcc----cccccc Q lcl|NC_012740. 239 LARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRR---DGVVVESYVLS----TLKGDK----DVYGNS 307 (667) Q Consensus 239 ~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~---~g~v~e~~~~s----~~~~~~----~~~~~~ 307 (667) .......... ........... . ...++.+++.... .+.+.|.+.-. ...... .+.... T Consensus 381 ~~~t~~~~~l----~v~~~~~s~f~--~-----~~a~e~~tv~~~~~~~~~~v~e~~dn~~i~~~~~~~~~~~in~vs~l 449 (774) T protein:vir:98 381 YPVNNSEFRL----NVQDLNGSAFN--P-----PLADEVYTVKLGDTNESGELNALLDSKFIRGFFLPKSIDSINYDAAL 449 (774) T ss_pred EecCCceeEE----EEEecCCcccc--c-----cccceeEEEecccccccceeeeeeceeeEeecccccccccccccccc Confidence 5432211111 11111100000 0 0000011111100 01111211100 000000 000000 Q ss_pred ccchhhhc----ccccceE-EEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEe Q lcl|NC_012740. 308 IYMDDFFA----RGSSQYI-YATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIA 382 (667) Q Consensus 308 ~~~~~~~~----~~~s~~v-~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 382 (667) ........ ......- ..............+++++|.++.......+++.... .+...+.+|+. T Consensus 450 v~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg~~tt~~~igg~~~~------------~~~tgi~aLl~ 517 (774) T protein:vir:98 450 VRQSPLRLAPPDESETDVENPAHVDFYGPNVLVDVTLENGYDGPPVTNDDYVSIIRT------------LENQPVHILLV 517 (774) T ss_pred cccchhcccccccccccccccccccccCCcceEEEeecCCCCcccccchheeccccc------------ccccceeEEEc Confidence 00000000 0000000 0000001111223356777777665544443332211 11223444444 Q ss_pred cCcCCcchhhHHHHHHHHHHHhhc----CcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEe Q lcl|NC_012740. 383 GACAGEGDAFSTVQKHAVSIGDER----QDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDG 458 (667) Q Consensus 383 ~~~~~~~~~~~~v~~~~~~~~~~~----~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~ 458 (667) +. ....++.+|+.+|+++ ++|++++|+|. +.+.+++++||+ +++|+|+++|| T Consensus 518 a~------~~~~V~~aii~~~e~~~~~~~~r~avid~p~--------g~t~~~Ai~~r~----------~f~S~~aal~~ 573 (774) T protein:vir:98 518 GT------TNVGVQQALITEAERASDSDGLRIAVLAAPP--------RTTPTLAASVTR----------GFNSTRAVMVA 573 (774) T ss_pred Cc------cchhhHHHHHHHHHHhhhcccceEEEEECCC--------CCCHHHHHHHHh----------ccCCceEEEEe Confidence 32 2344667777777654 88999999874 467889999996 47899999999 Q ss_pred hhhcccccccCceeEechHHHHHHHHHHhhhcCCceeeecceeccceeccc---cccccCChhhhhhhhhcCceEEE-Ee Q lcl|NC_012740. 459 NYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV---KLAIEPRKAHRDRLYQAAINPVI-GA 534 (667) Q Consensus 459 p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gIn~i~-~~ 534 (667) ||++++|+.+++.+++|||+++||++||+| ||+||+|+++.|+.|.. .+....++.|++.|++++||+++ .+ T Consensus 574 Pwvkv~D~~~g~~~~vPpSg~vAGl~ArtD----v~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt 649 (774) T protein:vir:98 574 GWFTYAGQPNSSRYGVPGAAVYAGKLAAID----FFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDT 649 (774) T ss_pred CcEEEeccCCCceeecChhHHHHHHHHhcC----cccccCCceeecceeccccccccccccchhhhhhcccccceeEEEE Confidence 999999999999999999999999999999 99999999987776642 24455678999999999999998 68 Q ss_pred cCCeEEEEcceecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeE-EE Q lcl|NC_012740. 535 GGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFR-VQ 613 (667) Q Consensus 535 ~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~-v~ 613 (667) +++|+++||+||+++|+ +||||++|||++||+++|++.++|+|||||++.+|++|+++++.||++||++|+|.||+ |+ T Consensus 650 ~g~G~rvWG~RTlssDp-~wr~InVRRlfd~Ie~SI~~~~~~~VfEPNd~~l~~~I~~sI~~fL~~L~~~GaL~G~~~V~ 728 (774) T protein:vir:98 650 VDRTYRFASGVTLSTDP-AWERIYLRRVHDVVRQGAHAILRNYVAMPNSRLVRNQIAAALNAFMGELKRNGNIVSFRPAI 728 (774) T ss_pred cCCcEEEEcccccCCCc-ccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceecceEEE Confidence 89999999999998875 89999999999999999999999999999999999999999999999999999999997 89 Q ss_pred EcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeHHH Q lcl|NC_012740. 614 CDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFDE 659 (667) Q Consensus 614 ~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e 659 (667) ||+++||+++|++|+|+|+|+++|++|+|||+|||+|.+++.+|+| T Consensus 729 ~D~etNt~~dI~~G~l~i~I~vaP~~PAEfIilri~q~t~~~~l~E 774 (774) T protein:vir:98 729 IDGSNNSTAAYFSRELYVSLQFQPLYSADYIYVTISRDTETSPLGE 774 (774) T ss_pred EcCCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEeecceeccC Confidence 9999999999999999999999999999999999999999999999 No 19 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=100.00 E-value=1.3e-97 Score=551.76 Aligned_cols=531 Identities=24% Similarity=0.344 Sum_probs=326.7 Q ss_pred Cc-eecCceEEEEecCCCcccccCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEE Q lcl|NC_012740. 1 MT-LLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (667) Q Consensus 1 ~~-~~~PGVyvee~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (667) |. |+||||||||+|++..+.+++|+++||||+|+|||+|+|++|+||.||+++||+|++.+|++|+|++||+|||++|| T Consensus 3 m~~~~sPGVyv~E~~~~~~i~~v~tsvaafvG~~~~GP~~~p~~v~s~~d~~~~FG~~~~~~~l~~av~~fF~ngG~~~~ 82 (641) T protein:vir:10 3 VSNQLSPGVVIQERDLTAVTTPIGLNVGVLAAPFTKGPVEEIFEVSTERDLASVFGEPNDYNYEYWFTASQFLSYGGVLK 82 (641) T ss_pred CccccCCceEEEEecCCCcccccCCccceEEecccCCCCCccEEecCHHHHHHHcCCcCCCcchHHHHHHHHHhcCCEEE Confidence 55 99999999999988655556799999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCcccccccccccccc-----------------cceeeeccccccccceeeEeeeccceeccccce-eeccccc-- Q lcl|NC_012740. 80 VVRVLNKEKAKNATALAGNV-----------------EFEITNEGSNYEVGDTIKIKHNRQDIETAGKVT-KVDGDGK-- 139 (667) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~-----------------~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~-~~d~~~~-- 139 (667) ||||.+.+. .+++...... ..........+.||+.+.+............+. ....... T Consensus 83 vvRv~~~~~-~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~pG~~gn~l~v~i~~~~~~~~~~~~~~~~~~~~~~ 161 (641) T protein:vir:10 83 AIRLNAASL-KNSVDSGTAPLIKNLQEYETTYESSNSNTFKFASRDAGALGNSVGIFITDAGPDQIAVLPAPGTGNEWEF 161 (641) T ss_pred EEEecCccc-cccccccchhhccccccccccccCcCccccEEEeccCCCcCCceEEEEEcCCCcceeeeeccccccccee Confidence 999986542 2222111110 001112234567777776654322111000000 0000000 Q ss_pred -ceeeeecccccccccc---cc----ccccccc--ccceEEEEEee---------------------cccccceeeecee Q lcl|NC_012740. 140 -VKGVFIPTGKIIAHAK---AI----GVYPELD--GGWTAEFTSSS---------------------GNGSAALSVTKIV 188 (667) Q Consensus 140 -~~~~~~~~~~~~~~a~---~~----~~~~~~~--~~~~~~~~~~~---------------------~~~~~~~t~~~~v 188 (667) ....+........... .. ....... ......+.... ++........... T Consensus 162 ~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g~~g~~~~~~~~ 241 (641) T protein:vir:10 162 VADEAVTAASGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGGVTGIFADAQVV 241 (641) T ss_pred ccceeeeeccCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCcceeeeeeeeec Confidence 0000000000000000 00 0000000 00000000000 0000000000000 Q ss_pred eeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEeeccccccc-ceeeeeeeecccccc-ccee Q lcl|NC_012740. 189 TDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEILARSSFSGA-VAPELTMYPFGGTRA-AARN 266 (667) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~~~a~~~~~-~~~~~t~~~~~~~~~-~~~~ 266 (667) ..... ..+........ ..........+.+.+...+.+++...+.+....++... .......+....+.. .... T Consensus 242 t~gt~----~~t~a~~g~~~-~~~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~~~a~~~~~g~~~~~va~~~gts~~ 316 (641) T protein:vir:10 242 TQGTN----TAAIASSGIER-RLYIGKDSGSINFAATDAVVDTNATSATISSVRNEYAEREYLPGSKWVNVAARPGTSLY 316 (641) T ss_pred cCCcc----ceeeecccchh-hhhhccccccceeeeecccccccceeeEeeeeeeeecccccccccccccccccchhhhh Confidence 00000 00000000000 00111123344556667777777666655443332211 111111111111111 1122 Q ss_pred eeeccccccccceeeee-------ccceeeeeEe-eeccCCccccccccccchhhhcccccceEEEecccccC------- Q lcl|NC_012740. 267 LIPYAPQNDNQYAFIVR-------RDGVVVESYV-LSTLKGDKDVYGNSIYMDDFFARGSSQYIYATAQGWVD------- 331 (667) Q Consensus 267 ~~~~~~~~~~~~~~~v~-------~~g~v~e~~~-~s~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~------- 331 (667) .....++.++.+.+++. .+|+++|+|. +++..++++..++..++.+++. ..|+++++....... T Consensus 317 a~~~g~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~~-~~s~~v~~~~~~~~~~~~~~~~ 395 (641) T protein:vir:10 317 ANSVGGVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVIK-QQSAYVYWGSHETAPFLGTAAN 395 (641) T ss_pred hhhcCCcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeeec-cccceEEEeccccccccccccc Confidence 22334455555555443 4567899997 7888888888888888888874 468887653221100 Q ss_pred --------------------------------------cccceEEecCCccccccccccccccccccchhHHHHHHhhhc Q lcl|NC_012740. 332 --------------------------------------GFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERE 373 (667) Q Consensus 332 --------------------------------------~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 373 (667) .......|.+|.++....... .....+..+++.++...+ T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~---~~~~~~~~tg~~~~~~~e 472 (641) T protein:vir:10 396 AAAGDWGASALNRRYNLLRSTAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALY---NLSNVDIATAYELIEDPE 472 (641) T ss_pred ccccccccccccccccccccccccccccccccccCCCCcceeEEEeecCcccccccccc---cccchhHHHHHHHhhhhh Confidence 001112344444433322111 122345678999999999 Q ss_pred ccccccEEecCcCCcchhhHHHHHHHHHHHhhcCcEEEEEccCccccccccc-cCCHHHHHHHhhhhccccccccccCcc Q lcl|NC_012740. 374 SIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPV-TTAIDNLIAWREGNSNYSDNNMNINTT 452 (667) Q Consensus 374 ~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~s~ 452 (667) .+++++||+++...+.....+++.++++|||+||+||+|+|+|+...++.+. ....+++++||+. +++|+ T Consensus 473 ~~~i~~l~~~~~~~~~~~~~~v~~~~i~~ce~~~d~~ailD~p~~~~~~~~~~~~~~~~~~~~~~~---------~~~s~ 543 (641) T protein:vir:10 473 SQVIDYVLSGPAGADEAAAIAKATTITTIVESRKDCMAFLSPLRSDVIGVSNTTTVTENLVNYFNQ---------LPSSN 543 (641) T ss_pred hhccceeeecCCCCCcchhHHHHHHHHHHHHhcCCEEEEEcCCcccccCCCchhhHHHHHHHHHhh---------cCCCc Confidence 9999999999988888888999999999999999999999999887766544 3456888899864 46899 Q ss_pred eEEEEehhhcccccccCceeEechHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEE Q lcl|NC_012740. 453 YAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVI 532 (667) Q Consensus 453 ~~~~~~p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~ 532 (667) |+++||||++++||.+++.+++||||+|||+|||+|.+|||||||||.+++.|+|++++++.+++.||+.||++|||||| T Consensus 544 yaa~y~P~~~v~dp~~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~~~~~e~~~Lnp~gIN~ir 623 (641) T protein:vir:10 544 YVVFDSGYKYIYDKYNDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYSPNKTQRDRLYANRINPVV 623 (641) T ss_pred eEEEEeceeEeecccCCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEecChhHHhhhhhcccceEE Confidence 99999999999999999999999999999999999999999999999998889999999999999999999999999999 Q ss_pred EecCCeEEEEcceecCCCcccc Q lcl|NC_012740. 533 GAGGEGFILMGDKTATTVPSPF 554 (667) Q Consensus 533 ~~~~~G~~~wG~rT~~~~~~~~ 554 (667) .|||+|++- +.-.- .. .. T Consensus 624 ~fpg~G~v~--~~~~~-~~-~~ 641 (641) T protein:vir:10 624 SFPGHAMIN--NNIAF-HT-KL 641 (641) T ss_pred ecCCceeec--ceeee-ee-cC Confidence 999999863 11100 00 00 No 20 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=4.7e-95 Score=537.73 Aligned_cols=380 Identities=15% Similarity=0.117 Sum_probs=307.3 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWG-----PAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) ++|.+|||||+|++.+++++.. .|++++|||+++++ |+|+|++|+|+.+|...||. ..++.+++..+|.|| T Consensus 2 ~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~---~gtL~~al~~~~~~g 78 (390) T protein:vir:10 2 PQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---KGTLRRTLDAIGKQT 78 (390) T ss_pred cccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCC---Cceehhhhhhhcccc Confidence 7889999999999999888765 69999999999875 99999999999999999996 678899999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |..|||||+....+...... T Consensus 79 g~~~~vv~v~~~~~~~~~~~------------------------------------------------------------ 98 (390) T protein:vir:10 79 KPLTVVVRVAEGKDADETTS------------------------------------------------------------ 98 (390) T ss_pred CceEEEEEeccccccccccc------------------------------------------------------------ Confidence 99999999854322100000 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) ..++... . . +...|. T Consensus 99 ~~ig~~~------------------~--------------------------------~----------~~~tg~----- 113 (390) T protein:vir:10 99 NVIGTVT------------------P--------------------------------D----------GKYTGI----- 113 (390) T ss_pred ccccccc------------------c--------------------------------c----------cccchh----- Confidence 0000000 0 0 000000 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) + .+ T Consensus 114 -------------------------------------------------------~----------------------al 116 (390) T protein:vir:10 114 -------------------------------------------------------K----------------------AL 116 (390) T ss_pred -------------------------------------------------------h----------------------hh Confidence 0 00 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) ..... .....++++++|+.. ..+ T Consensus 117 ~~~~~----------------------------------------------------~~~~~p~il~ap~~~-----~~~ 139 (390) T protein:vir:10 117 LAAQG----------------------------------------------------ALGVKPRILAAPGLD-----TQP 139 (390) T ss_pred hhhhh----------------------------------------------------hhcceehhhcccccc-----hHH Confidence 00000 000011223333332 245 Q ss_pred HHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 395 VQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 395 v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) |+.+|+.+|++++ +++++|.|. ..+.+++++||+. ++|+|+++||||++++|+.++..+++ T Consensus 140 v~~~l~~~a~~~~-~~aivD~p~--------~~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~ 200 (390) T protein:vir:10 140 VAAALAATAQSLR-AMAYVSASG--------CKTKEEAAAYRKQ----------FGQREIMVIWPDWLGWDDTTNSTAVI 200 (390) T ss_pred HHHHHHHhhcccc-eEEEEecCC--------CCCHHHHHHHhhc----------cCCceEEEEcCceEeecccCCccccc Confidence 8899999999876 678888763 4678899999963 67999999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceeccc-cccc--cCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV-KLAI--EPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~-~~~~--~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) |||+++||+++|+|.++|||+||+|+.+.++.++. .+.+ .....|.+.||++||+++++ ++|+++||+||++.|+ T Consensus 201 p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G~~~wG~rT~s~d~ 278 (390) T protein:vir:10 201 PAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNGFRFWGERTCSDDP 278 (390) T ss_pred chHHHHHHHHHHhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEc--CCCEEEEcccccCCCc Confidence 99999999999999999999999999987776642 2232 33456788999999999864 6899999999998875 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) +|+||++|||++||+++|++.++|+|||||++.+|.+|++++++||++||++|+|.||+|+||+++||+++|++|+|++ T Consensus 279 -~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~ 357 (390) T protein:vir:10 279 -KFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYI 357 (390) T ss_pred -ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEE Confidence 8999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhc Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQ 666 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~ 666 (667) +|+++|++|+|||+|++++.... ++++++.|++ T Consensus 358 ~v~~~p~~pae~I~~~~~~~~~~--~~~~~~~~~~ 390 (390) T protein:vir:10 358 DYDYTPVPPLENLVLRQRITDRF--LADFPARVAG 390 (390) T ss_pred EEEEEecCCcceEEEEEEEchHH--HHHHHHHhcC Confidence 99999999999999999987766 8999999999 No 21 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=4.7e-95 Score=537.73 Aligned_cols=380 Identities=15% Similarity=0.117 Sum_probs=307.3 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWG-----PAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) ++|.+|||||+|++.+++++.. .|++++|||+++++ |+|+|++|+|+.+|...||. ..++.+++..+|.|| T Consensus 2 ~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~---~gtL~~al~~~~~~g 78 (390) T protein:vir:78 2 PQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---KGTLRRTLDAIGKQT 78 (390) T ss_pred cccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCC---Cceehhhhhhhcccc Confidence 7889999999999999888765 69999999999875 99999999999999999996 678899999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |..|||||+....+...... T Consensus 79 g~~~~vv~v~~~~~~~~~~~------------------------------------------------------------ 98 (390) T protein:vir:78 79 KPLTVVVRVAEGKDADETTS------------------------------------------------------------ 98 (390) T ss_pred CceEEEEEeccccccccccc------------------------------------------------------------ Confidence 99999999854322100000 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) ..++... . . +...|. T Consensus 99 ~~ig~~~------------------~--------------------------------~----------~~~tg~----- 113 (390) T protein:vir:78 99 NVIGTVT------------------P--------------------------------D----------GKYTGI----- 113 (390) T ss_pred ccccccc------------------c--------------------------------c----------cccchh----- Confidence 0000000 0 0 000000 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) + .+ T Consensus 114 -------------------------------------------------------~----------------------al 116 (390) T protein:vir:78 114 -------------------------------------------------------K----------------------AL 116 (390) T ss_pred -------------------------------------------------------h----------------------hh Confidence 0 00 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) ..... .....++++++|+.. ..+ T Consensus 117 ~~~~~----------------------------------------------------~~~~~p~il~ap~~~-----~~~ 139 (390) T protein:vir:78 117 LAAQG----------------------------------------------------ALGVKPRILAAPGLD-----TQP 139 (390) T ss_pred hhhhh----------------------------------------------------hhcceehhhcccccc-----hHH Confidence 00000 000011223333332 245 Q ss_pred HHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 395 VQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 395 v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) |+.+|+.+|++++ +++++|.|. ..+.+++++||+. ++|+|+++||||++++|+.++..+++ T Consensus 140 v~~~l~~~a~~~~-~~aivD~p~--------~~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~ 200 (390) T protein:vir:78 140 VAAALAATAQSLR-AMAYVSASG--------CKTKEEAAAYRKQ----------FGQREIMVIWPDWLGWDDTTNSTAVI 200 (390) T ss_pred HHHHHHHhhcccc-eEEEEecCC--------CCCHHHHHHHhhc----------cCCceEEEEcCceEeecccCCccccc Confidence 8899999999876 678888763 4678899999963 67999999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceeccc-cccc--cCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV-KLAI--EPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~-~~~~--~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) |||+++||+++|+|.++|||+||+|+.+.++.++. .+.+ .....|.+.||++||+++++ ++|+++||+||++.|+ T Consensus 201 p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~--~~G~~~wG~rT~s~d~ 278 (390) T protein:vir:78 201 PAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN--RNGFRFWGERTCSDDP 278 (390) T ss_pred chHHHHHHHHHHhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEc--CCCEEEEcccccCCCc Confidence 99999999999999999999999999987776642 2232 33456788999999999864 6899999999998875 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) +|+||++|||++||+++|++.++|+|||||++.+|.+|++++++||++||++|+|.||+|+||+++||+++|++|+|++ T Consensus 279 -~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~ 357 (390) T protein:vir:78 279 -KFAFENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYI 357 (390) T ss_pred -ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEE Confidence 8999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhc Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQ 666 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~ 666 (667) +|+++|++|+|||+|++++.... ++++++.|++ T Consensus 358 ~v~~~p~~pae~I~~~~~~~~~~--~~~~~~~~~~ 390 (390) T protein:vir:78 358 DYDYTPVPPLENLVLRQRITDRF--LADFPARVAG 390 (390) T ss_pred EEEEEecCCcceEEEEEEEchHH--HHHHHHHhcC Confidence 99999999999999999987766 8999999999 No 22 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=1.2e-94 Score=535.40 Aligned_cols=380 Identities=15% Similarity=0.121 Sum_probs=306.6 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWG-----PAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) |+|++|||||+|++.+++++.. +|++++|+|+++++ |+|+|++|+|+.+|.+.||. ..++.+++..||.|| T Consensus 2 ~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~---~~tL~~al~~~~~~~ 78 (390) T protein:vir:79 2 PQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---KGTLRRTLDAIGKQT 78 (390) T ss_pred ccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCC---Cccchhhhhhhcccc Confidence 8899999999999999987766 69999999999876 89999999999999999986 567888999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |+.|||||+..+.+..... . T Consensus 79 ~~~~~vv~v~~~~~~~~~~----------------------~-------------------------------------- 98 (390) T protein:vir:79 79 KPLTVVVRVAEGKDADETT----------------------S-------------------------------------- 98 (390) T ss_pred cceEEEEeecccccccccc----------------------c-------------------------------------- Confidence 9999999985432110000 0 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) ..++.. . . .+...| T Consensus 99 ~~ig~~----------------------------~-----------------------~---------~~~~tg------ 112 (390) T protein:vir:79 99 NVIGTV----------------------------T-----------------------P---------DGKYTG------ 112 (390) T ss_pred eeeecc----------------------------c-----------------------c---------cccchh------ Confidence 000000 0 0 000000 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) + + .+ T Consensus 113 ---------------------------------------------l---------~----------------------al 116 (390) T protein:vir:79 113 ---------------------------------------------I---------K----------------------AL 116 (390) T ss_pred ---------------------------------------------h---------h----------------------hh Confidence 0 0 00 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) .... ......++++++|+.. ..+ T Consensus 117 ~~~~----------------------------------------------------~~~~~~p~il~ap~~~-----~~~ 139 (390) T protein:vir:79 117 LAAQ----------------------------------------------------GALGVKPRILAAPGLD-----TQP 139 (390) T ss_pred hhhh----------------------------------------------------hhhccccccccCCccc-----chH Confidence 0000 0000123344444432 245 Q ss_pred HHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 395 VQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 395 v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) ++.+|..+|++++ +++++|.|. ..+.+++.+||+ +++|+|+++||||++++|+.++..+++ T Consensus 140 v~~~l~~~a~~~~-~~ai~D~p~--------~~t~~~a~~~~~----------~~~s~~~~~~~p~~~~~d~~~~~~~~~ 200 (390) T protein:vir:79 140 VAAALAATAQSLR-AMAYVSASG--------CKTKEEAAAYRR----------QFGQREIMVIWPDWLGWDDTTNSTAVI 200 (390) T ss_pred HHHHHHHhhhhcc-eEEEEEccC--------CCCHHHHHHHhc----------CCCCceEEEEcCceeecccccCceeEe Confidence 7888999999875 889999873 456788999986 367999999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceeccc-ccccc--CChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV-KLAIE--PRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~-~~~~~--~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) |||+++||++||+|.++|||+||+|+++.++.++. .+.+. ....|++.||++||++++ +++|+++||+||++.|+ T Consensus 201 p~s~~~Ag~~a~~D~~~g~~~spsN~~i~gi~~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~--~~~G~~~wG~rT~~~d~ 278 (390) T protein:vir:79 201 PAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLV--NRNGFRFWGERTCSDDP 278 (390) T ss_pred ehHHHHHHHHHhhhccCCcEEccCCceeeccceeeeeccccccccchhhhhhhhcCcEEEE--cCCCEEEEeccccCCCc Confidence 99999999999999999999999999877666543 22222 334577899999999985 46899999999998875 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) +|+||+||||++||+++|++.++|+|||||++.+|++|+++++.||++||++|+|.||+|+||+++||+++|++|+|++ T Consensus 279 -~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 357 (390) T protein:vir:79 279 -KFAFENYTRTAQVAADSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYI 357 (390) T ss_pred -ccceeeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEE Confidence 7999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhc Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQ 666 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~ 666 (667) +|+++|++|+|||+|++...... ++++.+.|+| T Consensus 358 ~i~~~p~~p~e~i~~~~~~~~~~--~~~~~~~v~~ 390 (390) T protein:vir:79 358 DYDYTPVPPLENLVLRQRITDRF--LADFPARVAG 390 (390) T ss_pred EEEEEecCCcceEEEEEEEchHH--HHHHHHHhcC Confidence 99999999999999999987666 7899999999 No 23 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=5.3e-94 Score=531.95 Aligned_cols=387 Identities=14% Similarity=0.114 Sum_probs=310.9 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccC-----CCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQW-----GPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~-----Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) |..-+|||||+|++.+++++.. +|++++|||++++ .|.++|++|+|+.+|...||. ...+.+++..+|.|| T Consensus 1 m~~~~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g~---~~tl~~a~~~~~~~g 77 (396) T protein:vir:60 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAGK---KGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhcC---cchhHHHHHHHhhcc Confidence 9999999999999999877665 7999999999965 488999999999999999995 567889999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |..|||+|+..+...... T Consensus 78 g~~~~vv~~~~~~~~~~~-------------------------------------------------------------- 95 (396) T protein:vir:60 78 KPVTVVVRVEDGTGEDEE-------------------------------------------------------------- 95 (396) T ss_pred CceEEEEecccccccccc-------------------------------------------------------------- Confidence 999999997432100000 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) .. . +... ..+ T Consensus 96 --~~---------------------------~-------------------------------------~~~~----~~~ 105 (396) T protein:vir:60 96 --TK---------------------------L-------------------------------------AQTV----SNI 105 (396) T ss_pred --cc---------------------------c-------------------------------------cccc----ccc Confidence 00 0 0000 000 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) .... +. .+. T Consensus 106 ~~~~------------------------------------d~---------------------~~~-------------- 114 (396) T protein:vir:60 106 IGTT------------------------------------DE---------------------NGQ-------------- 114 (396) T ss_pred cccc------------------------------------cc---------------------ccc-------------- Confidence 0000 00 000 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) . +.+ ..+..........+.++++|+. .... T Consensus 115 -~--tg~------------------------------------------~al~~~~~~~~~~~~il~ap~~-----~~~~ 144 (396) T protein:vir:60 115 -Y--TGL------------------------------------------KALLAAESVTGVKPRILGVPGL-----DTKE 144 (396) T ss_pred -c--cch------------------------------------------hhhhhcccceeeeeeecccccc-----ccHH Confidence 0 000 0000000001122445555544 3467 Q ss_pred HHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 395 VQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 395 v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) ++.+++++|++++ +++++|.|. ..+.+++++||+. ++|+|+++||||++++|+.++..+++ T Consensus 145 v~~al~~~~~~~~-~~~i~d~p~--------~~~~~~a~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~ 205 (396) T protein:vir:60 145 VAVALASVCQKLR-AFGYISAWG--------CKTISEVKAYRQN----------FSQRELMVIWPDFLAWDTVASTTATA 205 (396) T ss_pred HHHHHHHHhccCC-eEEEEeCCC--------CCCHHHHHHHHhh----------cCCceEEEEeCceeeecccCCceeEE Confidence 8999999999876 778888874 4688899999974 57899999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceeccc-cc--cccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV-KL--AIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~-~~--~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) |||+++||++||+|.++|+|+||||+++.|+.+.. .+ ....+..|++.||++|||++. +++|+++||+||+++++ T Consensus 206 p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~--~~~G~~~wG~rT~~~d~ 283 (396) T protein:vir:60 206 YATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLI--RRDGFRFWGNRTCSDDP 283 (396) T ss_pred chhHHHHHHHHHhhhccCcEeCcCCceecceeeceeecccccCCCcchhhhhhhcCcEEEE--cCCCEEEEcccccCCCc Confidence 99999999999999999999999999987776642 22 234567899999999999994 57899999999999875 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) +|+||++||+++||+++|++.++|+|||||++.+|++|+++|++||++||++|+|.||+|+||+++||+++|++|+|++ T Consensus 284 -~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~~~~d~~~nt~~~i~~G~~~~ 362 (396) T protein:vir:60 284 -LFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYI 362 (396) T ss_pred -ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEE Confidence 7999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) +|+++|++|+|||+|++++.... ++++++.|++= T Consensus 363 ~i~~~p~~pae~I~~~~~~~~~~--~~~~~~~~~~~ 396 (396) T protein:vir:60 363 DYDYTPVPPLENLTLRQRITDKY--LANLVTSVNSN 396 (396) T ss_pred EEEEEecCCcceEEEEEEEchHH--HHHHHHHhhcC Confidence 99999999999999999988776 78888888888 No 24 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=2.4e-94 Score=533.79 Aligned_cols=381 Identities=14% Similarity=0.120 Sum_probs=305.4 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeecc-----CCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQ-----WGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~-----~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) |++.+|||||+|++.+++++.. +|++++|||+++ .+|+|+|++|+|+.||...||. ..++.+++..+|.|| T Consensus 2 ~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~---~gtl~~al~~~~~~g 78 (391) T protein:vir:79 2 PTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGD---KGTLAHTLDAITDQT 78 (391) T ss_pred CCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCC---ccccchhhhhhhccc Confidence 8999999999999998877665 799999999986 6899999999999999999996 567888999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |..||+|++.......... .. T Consensus 79 g~~~~vv~~~~~~~~~~~~----------------------------------~~------------------------- 99 (391) T protein:vir:79 79 NPLTVVVRVAGGASEAETT----------------------------------SN------------------------- 99 (391) T ss_pred ccceeeecccccccccccc----------------------------------cc------------------------- Confidence 9999999975321100000 00 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) . ..... . .....|- T Consensus 100 --~---------------------------~g~~~-----------------------~---------~~~~tGl----- 113 (391) T protein:vir:79 100 --L---------------------------IGTTN-----------------------A---------AGRYTGM----- 113 (391) T ss_pred --c---------------------------ccccc-----------------------c---------hhhhHHH----- Confidence 0 00000 0 0000000 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) ..+ T Consensus 114 -----------------------------------------------------------------------------~~l 116 (391) T protein:vir:79 114 -----------------------------------------------------------------------------KAL 116 (391) T ss_pred -----------------------------------------------------------------------------hhh Confidence 000 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) ..... .....+.++++|+. ...+ T Consensus 117 ~~~~~----------------------------------------------------~~~~~p~~l~~p~~-----~~~~ 139 (391) T protein:vir:79 117 LTARN----------------------------------------------------RFGVAPRILAVPGL-----DSLP 139 (391) T ss_pred hhhhh----------------------------------------------------hhcccchhhcCCcc-----chhH Confidence 00000 00001112223322 2346 Q ss_pred HHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 395 VQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 395 v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) ++.+|+.+|++++ +++++|.|. .++.+++++||+. ++|+|+++||||++++|+.++..+++ T Consensus 140 v~~al~~~~~~~~-~~ai~d~p~--------~~t~~~a~~~~~~----------~~s~~~a~~~P~~~~~d~~~~~~~~~ 200 (391) T protein:vir:79 140 VGTELVTIAQKLR-AFAYLSAYG--------CQTKEEAVAYRSN----------FGQREAMVMWPDFVGWDTAANAETTL 200 (391) T ss_pred HHHHHHHHHhhcC-cEEEEECCC--------CCCHHHHHHHHhc----------cCCceeEEecceeeeecCcCCceeee Confidence 8899999999987 667888763 4678899999974 67899999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceecccc-cc--ccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVK-LA--IEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~-~~--~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) |||+++||+|||+|.++|||+||+|+++.++.+... +. ......|.+.||++||||++ +++|+++||+||+++++ T Consensus 201 p~s~~~AG~~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~--~~~G~~~wG~rT~~~d~ 278 (391) T protein:vir:79 201 WATARAVGLRAKIDNDTGWHKTLSNVAVGGVTGLSRDVFWDLQDPATDAGYLNANEVTTLV--HRDGYRFWGSRTCSADP 278 (391) T ss_pred chHHHHHHHHHHhhhcccceeccCCceehhhhccccccccccccccchhhhhhhcCceEEE--CCCcEEEEcccccCCCc Confidence 999999999999999999999999998766555321 22 22334578899999999985 46899999999998875 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) +|+||++||++++|+++|+++++|+|||||++.+|.+|++++++||++||++|+|.||+|+||+++||+++|++|+|++ T Consensus 279 -~~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~ 357 (391) T protein:vir:79 279 -LFAFENYTRTAQVLADTMAEAHMWANDLPMTPTLVRDLLEGINAKLRMLTRNGYLLGGAAWFDADANSKDTLKAGQLAI 357 (391) T ss_pred -ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEE Confidence 8999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) +|+++|++|+|||+|+++..... ++++++.|+|| T Consensus 358 ~i~~~p~~p~e~i~~~~~~~~~~--~~~~~~~v~~a 391 (391) T protein:vir:79 358 DYDYTPVPPLENLTFRQRITDRY--LMQFAEAVKAA 391 (391) T ss_pred EEEEEecCCcceEEEEEEEchHH--HHHHHHHhhcC Confidence 99999999999999999988776 79999999999 No 25 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=1.2e-93 Score=530.04 Aligned_cols=387 Identities=15% Similarity=0.133 Sum_probs=308.5 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWG-----PAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) |++-+|||||+|++.+++++.. .|++++|+|+++++ |.++|++|+|+.||...||. ..++.+++.++|.|| T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~---~~tl~~al~~~~~~~ 77 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGK---KGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhccc---ccchHHHHHHhhhcC Confidence 9999999999999999887765 69999999999876 78999999999999999986 567889999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |..||++|+..+........ . + T Consensus 78 ~~~~~vv~~~~~~~~~~~~~---------------------~-------------------------------------a 99 (396) T protein:vir:57 78 KPVTVVVRVEDGTGDDEETK---------------------L-------------------------------------A 99 (396) T ss_pred CceeEeeecccccccccccc---------------------c-------------------------------------c Confidence 99999998754321100000 0 0 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) . +.... .+ T Consensus 100 --~-------------------------t~~~i-------iG-------------------------------------- 107 (396) T protein:vir:57 100 --Q-------------------------TVSNI-------IG-------------------------------------- 107 (396) T ss_pred --c-------------------------cceee-------ee-------------------------------------- Confidence 0 00000 00 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) .+.. . . . . .+. ..+ T Consensus 108 --~~~~-~----------------------------~-~---~--------------------tgl-----------~al 121 (396) T protein:vir:57 108 --TTDE-N----------------------------G-Q---Y--------------------TGL-----------KAL 121 (396) T ss_pred --eccc-c----------------------------c-c---c--------------------hhh-----------hhh Confidence 0000 0 0 0 0 000 000 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) ......+ ...++++++|+. .... T Consensus 122 ~~~~~~~----------------------------------------------------~~~p~i~~ap~~-----~~~~ 144 (396) T protein:vir:57 122 MGAESVT----------------------------------------------------GVKPRILGVPGL-----DTKE 144 (396) T ss_pred hhcccce----------------------------------------------------eEEeccccCccc-----chhH Confidence 0000000 011223333333 2346 Q ss_pred HHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 395 VQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 395 v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) ++.+|+++|+++ .+++++|.|. .++.+++++||+. ++|.|+++||||++++|+.++..+++ T Consensus 145 v~~al~~~~~~~-~~~~~~d~p~--------~~~~~~~~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~ 205 (396) T protein:vir:57 145 VAVALASVCQEL-NAFGYISAWG--------CKTISEVKAYRQN----------FSQRELMVIWPDFLAWDTVTSTTATA 205 (396) T ss_pred HHHHHHHHhhhC-ceEEEEcCCC--------CCCHHHHHHHHhc----------cCCceEEEEcceeeeecccCCceeEE Confidence 889999999876 5889998874 4678899999974 67999999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceeccc-cc--cccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV-KL--AIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~-~~--~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) |||+++||++||+|.++|+|+||+|+++.|+.+.. .+ ....++.|++.||++|||++++ ++|+++||+||+++++ T Consensus 206 p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~--~~G~~~wG~rT~~~d~ 283 (396) T protein:vir:57 206 YATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQKPGTDADLLNEAGVTTLVR--RDGFRFWGNRTCSDDP 283 (396) T ss_pred ehhHHHHHHHHHhhhccCcEeccCCceeccccccceecccccCCcchhhhhhhhcCcEEEEc--CCCEEEEcccccCCCc Confidence 99999999999999999999999999987776642 22 2334567999999999999954 6899999999998875 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) +|+||++||+++||+++|++.++|+|||||++.+|++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|++ T Consensus 284 -~~~~i~vrR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~ 362 (396) T protein:vir:57 284 -LFLFESYTRTAQVLADTMAEAHMWAIDKPITATLIRDIIDGINAKFRELKNNGYIVDGTCWFSEESNDAETLKAGKLYI 362 (396) T ss_pred -ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEE Confidence 7999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) +|+++|++|+|||+|++++.... ++++++.|++- T Consensus 363 ~v~~~p~~p~e~I~~~~~~~~~~--~~~~~~~~~~~ 396 (396) T protein:vir:57 363 DYDYTPVPPLENLTLRQRITSRY--LASLVTSVNSN 396 (396) T ss_pred EEEEEecCCcceEEEEEEEchHH--HHHHHHHhhcC Confidence 99999999999999999988777 78888888777 No 26 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=7.4e-94 Score=531.16 Aligned_cols=383 Identities=15% Similarity=0.114 Sum_probs=311.3 Q ss_pred CceecCceEEEEecCCCcccc-cCCCceEEEeeccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQ-SATGRAALVGKFQWG-----PAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~-~~ts~~afvG~~~~G-----p~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) |..-+|||||+|++.+++++. +.|++++|||+++.+ |+++|++|+|+.+|...||. ..++..++..+|.|| T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~---~~tl~~al~~~~~~~ 77 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGK---KGTLAASLQAIADQS 77 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhccc---ccchhhHHHHHhhcc Confidence 999999999999999988766 479999999999764 78999999999999999996 567888999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |+.|+|+++..+...... . T Consensus 78 ~~~~~vv~~~~~~~~~~~------------------------------------~------------------------- 96 (395) T protein:vir:98 78 KPVTVVVRVEDGTGDDEE------------------------------------A------------------------- 96 (395) T ss_pred CceEEEeecccccccccc------------------------------------c------------------------- Confidence 999999987532210000 0 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) .. ... . . ...|. T Consensus 97 ----~~--------------------------a~~-----------------------~---~-------~i~g~----- 108 (395) T protein:vir:98 97 ----AL--------------------------AQT-----------------------V---S-------NIIGG----- 108 (395) T ss_pred ----cc--------------------------ccc-----------------------c---c-------ccccc----- Confidence 00 000 0 0 00000 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) . .. .+ T Consensus 109 ---~---------------------------------~~------------------------~~--------------- 113 (395) T protein:vir:98 109 ---T---------------------------------DE------------------------NG--------------- 113 (395) T ss_pred ---c---------------------------------cc------------------------cc--------------- Confidence 0 00 00 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHh---hhcccccccEEecCcCCcchh Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFA---ERESIHVNLLIAGACAGEGDA 391 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~l~~~~~~~~~~~ 391 (667) ..+++.++. ......+.++++|+.. T Consensus 114 -----------------------------------------------~~Tgl~al~~~~~~~~~~p~il~ap~~~----- 141 (395) T protein:vir:98 114 -----------------------------------------------KYTGIKALLTAQAVTGVKPRILGVPGLD----- 141 (395) T ss_pred -----------------------------------------------chhHHHHHhhhhhhhccchhhccccccc----- Confidence 000000000 0111234456666653 Q ss_pred hHHHHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCce Q lcl|NC_012740. 392 FSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVN 471 (667) Q Consensus 392 ~~~v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~ 471 (667) ..+++.+|..+|++++ +++++|.|. +.+.+++++||+. ++|+|+++||||++++|+.++.. T Consensus 142 ~~~v~~al~~~~~~~~-~~~~~d~p~--------~~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~ 202 (395) T protein:vir:98 142 TKEVAVALASAAIKLR-AFAYVSAWG--------CKTISEAMEYRKN----------FSQRELMVIWPDFLAWDTVKNTT 202 (395) T ss_pred ccHHHHHHHHHhhhcC-cEEEEEcCC--------CCCHHHHHHHHhc----------cCCceEEEEecceeEecccCCce Confidence 3468899999999876 788888874 4678899999963 67899999999999999999999 Q ss_pred eEechHHHHHHHHHHhhhcCCceeeecceeccceeccc---cccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecC Q lcl|NC_012740. 472 RWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV---KLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTAT 548 (667) Q Consensus 472 ~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~ 548 (667) +++|||+++||++||+|.++|||+||+|+++.++.++. ..+..+++.|++.||++|||++. +++|+++||+||++ T Consensus 203 ~~~p~s~~~AG~~a~~d~~~g~~~spaN~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~--~~~G~~~wG~rT~s 280 (395) T protein:vir:98 203 ATAYATARALGLRAYIDQTVGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLV--RKDGFRFWGNRTCS 280 (395) T ss_pred eeechHHHHHHHHHHhhcccCcEeccCCceeecccccceecccccCCCcchHHhhhhcCcEEEE--cCCCEEEEcccccC Confidence 99999999999999999999999999999987777653 22344567899999999999995 57899999999998 Q ss_pred CCcccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCe Q lcl|NC_012740. 549 TVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNE 628 (667) Q Consensus 549 ~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~ 628 (667) +++ +|+||++||+++||+++|++.++|++||||++.+|.+|+++++.||++||++|+|.||+|+||+++||+++|++|+ T Consensus 281 ~d~-~~~~i~~rR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~ 359 (395) T protein:vir:98 281 DDP-LFLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKSNGYIVEGKCWFDEESNDKETLKAGK 359 (395) T ss_pred CCc-ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCe Confidence 775 8999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhc Q lcl|NC_012740. 629 FVASMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQ 666 (667) Q Consensus 629 ~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~ 666 (667) |+++|+++|++|+|||+|+++..... +++++++|++ T Consensus 360 ~~~~i~~~p~~p~e~I~~~~~~~~~~--~~~~~~~~~~ 395 (395) T protein:vir:98 360 LYIDYDYTPVPPLESLTLRQRITDKY--LVNLAESVNS 395 (395) T ss_pred EEEEEEEEecCCcceEEEEEEEchHH--HHHHHHHhcC Confidence 99999999999999999999988777 7788888888 No 27 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=1.7e-93 Score=529.23 Aligned_cols=387 Identities=13% Similarity=0.123 Sum_probs=309.5 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWG-----PAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) |+.-+|||||+|++.+++++.. .|++++|||+++++ |+++|++|+|+.+|...||+ ...+++++.++|.|| T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~---~~tL~~al~~~~~ng 77 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGK---KGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhccc---ccchhhhhhhhhccC Confidence 9988999999999999887765 69999999998654 78999999999999999996 566888999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |..||++|+......... .. T Consensus 78 g~~~~v~~~~~~~~~~~~------------------------------------~~------------------------ 97 (396) T protein:vir:20 78 KPVTVVMRVEDGTGDDEE------------------------------------TK------------------------ 97 (396) T ss_pred ceeEEEEecccccccccc------------------------------------cc------------------------ Confidence 999999997432110000 00 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) . .. ... .+ T Consensus 98 --~-----------------------------a~------------------------------------t~~-----~~ 105 (396) T protein:vir:20 98 --L-----------------------------AQ------------------------------------TVS-----NI 105 (396) T ss_pred --c-----------------------------cc------------------------------------ccc-----cc Confidence 0 00 000 00 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) .... . . .+. ..+. ..+ T Consensus 106 ~~~~--------------------------------~-~-----------~~~---------~tg~-----------~al 121 (396) T protein:vir:20 106 IGTT--------------------------------D-E-----------NGQ---------YTGL-----------KAM 121 (396) T ss_pred cccc--------------------------------c-c-----------ccc---------cchh-----------hhh Confidence 0000 0 0 000 0000 000 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) .... ......+.++++|+. .... T Consensus 122 ~~~~----------------------------------------------------~~~~~~p~i~~ap~~-----~~~~ 144 (396) T protein:vir:20 122 LAAE----------------------------------------------------SVTGVKPRILGVPGL-----DTKE 144 (396) T ss_pred hhhc----------------------------------------------------cccccchhhhhhhhh-----ccHH Confidence 0000 000011223344443 3356 Q ss_pred HHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 395 VQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 395 v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) |+.+|.++|++++ +++++|.|. ..+.+++++||+. ++|+|+++||||++++|+.++..+++ T Consensus 145 v~~al~~~~~~~~-~~~~iD~p~--------~~~~~~a~~~r~~----------~~s~~~~~~~P~~~~~d~~~~~~~~~ 205 (396) T protein:vir:20 145 VAVALASVCQKLR-AFGYISAWG--------CKTISEVKAYRQN----------FSQRELMVIWPDFLAWDTVTSTTATA 205 (396) T ss_pred HHHHHHHHHhcCC-cEEEEecCC--------CCCHHHHHHHhhC----------CCCceEEEEcCccccccCcCCcceee Confidence 8999999999876 678888874 4578899999963 67899999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceecccc-cc--ccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVK-LA--IEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~-~~--~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) |||+++||++||+|.++|+|+||+|+++.||.+... +. ..+++.|++.||++|||+++ +++|+++||+||+++++ T Consensus 206 p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~--~~~G~~~wG~rT~s~d~ 283 (396) T protein:vir:20 206 YATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLI--RRDGFRFWGNRTCSDDP 283 (396) T ss_pred chhHHHHHHHHHhhhhcCcEeccCCceeccceecceecccccCCCcchhhhhhhcCcEEEE--cCCCEEEEcccccCCCc Confidence 999999999999999999999999999877776532 22 33567899999999999995 47899999999998875 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) +|+||++||+++||+++|++.++|+|||||++.+|++|+++++.||++||++|+|.||+|+||+++||+++|++|+|++ T Consensus 284 -~~~~i~~rR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~G~l~g~~v~~d~~~nt~~~i~~G~~~~ 362 (396) T protein:vir:20 284 -LFLFENYTRTAQVVADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYI 362 (396) T ss_pred -ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCcceeceEEEEecCCCCHHHhhCCEEEE Confidence 8999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) +|+++|++|+|||+|+++..... +++++++|++= T Consensus 363 ~i~~~p~~p~e~i~~~~~~~~~~--~~~~~~~~~~~ 396 (396) T protein:vir:20 363 DYDYTPVPPLENLTLRQRITDKY--LANLVTSVNSN 396 (396) T ss_pred EEEEEecCCcceEEEEEEEchHH--HHHHHHHhhcC Confidence 99999999999999999987776 88999999988 No 28 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=2.4e-93 Score=528.31 Aligned_cols=380 Identities=16% Similarity=0.142 Sum_probs=309.2 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeecc-----CCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQ-----WGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~-----~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) |+|.+|||||+|++.+++++.. .|++++|+|+++ .+|+++|++|+|+.+|...||. ..++.+++.++|.|| T Consensus 3 ~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~---~~tl~~al~~~~~~~ 79 (391) T protein:vir:11 3 ADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGT---SGTLPASLQAIADQA 79 (391) T ss_pred CCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCC---Cccchhhhhhhhccc Confidence 8899999999999999887765 699999999997 4799999999999999999985 566888999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |+.||+||+..+...... . . T Consensus 80 g~~~~vv~~~~~~~~~~t-------------------~---------------------~-------------------- 99 (391) T protein:vir:11 80 NAATVVVRVKPGEDEAAT-------------------N---------------------S-------------------- 99 (391) T ss_pred cceeEEeeeccccccccc-------------------c---------------------h-------------------- Confidence 999999997432110000 0 0 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) +..+ . T Consensus 100 d~~g----------------------------------------------------------------------~----- 104 (391) T protein:vir:11 100 AVIG----------------------------------------------------------------------G----- 104 (391) T ss_pred hhhc----------------------------------------------------------------------c----- Confidence 0000 0 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) +.. .... .+.. .+ T Consensus 105 ---~~a------------------------------~~~~-----------------------~g~~-----------a~ 117 (391) T protein:vir:11 105 ---VSA------------------------------DGKY-----------------------TGMK-----------AL 117 (391) T ss_pred ---ccc------------------------------ccch-----------------------hhhh-----------hh Confidence 000 0000 0000 00 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) .+....+ ...+.++++|+. ...+ T Consensus 118 ~~~~~~~----------------------------------------------------~~~p~~~~ap~~-----~~~~ 140 (391) T protein:vir:11 118 LAAKARL----------------------------------------------------GVVPRILGVPGL-----DTQP 140 (391) T ss_pred hhhhhhh----------------------------------------------------eecccccccccc-----ccHH Confidence 0000000 001223344443 2356 Q ss_pred HHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 395 VQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 395 v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) ++.+|+++|+++ ++++++|.|. ..+.+++++||+. ++|+|+++||||++++|+.++..+++ T Consensus 141 v~~al~~~~~~~-~~~~i~D~p~--------~~t~~~a~~~r~~----------~~s~~~~~~~p~~~~~~~~~~~~~~~ 201 (391) T protein:vir:11 141 VATALIAIAQQL-RAFAYVSASG--------CKTKEEATAYREN----------FAAREAMVIWPDFLTWSTVVNQTVPA 201 (391) T ss_pred HHHHHHHhhccc-ceEEEEEcCC--------CCCHHHHHHHhhh----------cCCceEEEEcCcceecccccCceEEe Confidence 899999999887 5889999873 4578899999973 67999999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceecccc-cc--ccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVK-LA--IEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~-~~--~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) |||+++||++||+|.++|||+||||+++.|+.++.. +. ...++.|++.||++|||+++ +++|+++||+||++.++ T Consensus 202 p~s~~~ag~~a~~d~~~g~~~span~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~--~~~G~~~wG~rT~~~d~ 279 (391) T protein:vir:11 202 PAVAQALGLRARIDQEVGWHKTLSNVAVNGVTGISADVFWDLQSPSTDANYLNENEVTTLV--QEGGFRFWGSRTCSDDP 279 (391) T ss_pred chHHHHHHHHHHhhccCCcEEccCCceeeceeecccccccccCCCcchhhhhhhcCcEEEE--cCCCEEEEcccccCCCc Confidence 999999999999999999999999999877776532 22 33457899999999999984 57899999999998875 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) +|+||+|||+|+||+++|++.++|+|||||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|++ T Consensus 280 -~~~~i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~g~l~g~~~~~~~~~n~~~~i~~G~~~~ 358 (391) T protein:vir:11 280 -LFAFENYTRTAQVLADTIAEAHMWAVDKPMHPSLVRDILEGVNAKFRELKGLGLIIDAQAWYDPNVNDKDTLKAGKLRI 358 (391) T ss_pred -ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhccceeceEEEEecCCCCHHHhhCCeEEE Confidence 8999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhc Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQ 666 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~ 666 (667) +|+++|++|+|||++++++.... ++|+++.|+| T Consensus 359 ~i~~~p~~p~e~i~~~~~~~~~~--~~~~~~~~~a 391 (391) T protein:vir:11 359 TYDYTPVPPLEDLTFFQKITDSY--LVDFASRVNA 391 (391) T ss_pred EEEEEecCCcceEEEEEEEchHH--HHHHHHHhcC Confidence 99999999999999999987776 8999999999 No 29 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=9.9e-93 Score=524.98 Aligned_cols=383 Identities=15% Similarity=0.119 Sum_probs=311.1 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWG-----PAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) |.--+|||||+|++++++++.. +|++.+|+|+++++ |+++|++|+|+.+|...||. ...+.+++..+|.|| T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~---~gtl~~al~~~~~ng 77 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGK---KGTLSASLQAIADQS 77 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCC---CcchHHHHHHhhccc Confidence 8776799999999999988876 69999999999765 89999999999999999986 567888999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |..|+++++..+....... T Consensus 78 g~~~~vv~v~~~~~~~~~~------------------------------------------------------------- 96 (392) T protein:vir:18 78 KPVTVVVRVAEGTGDDAEA------------------------------------------------------------- 96 (392) T ss_pred CceEEEecccccccccccc------------------------------------------------------------- Confidence 9999999874321000000 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) .. .. .+ .|. T Consensus 97 --~t-------------------------~~------------------dl---------------------iG~----- 105 (392) T protein:vir:18 97 --QT-------------------------TS------------------NI---------------------IGG----- 105 (392) T ss_pred --cc-------------------------hh------------------hh---------------------eec----- Confidence 00 00 00 000 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) . . . .+. T Consensus 106 ---~--------------------------------~-~------------------------~~~-------------- 111 (392) T protein:vir:18 106 ---T--------------------------------D-E------------------------NGK-------------- 111 (392) T ss_pred ---c--------------------------------c-c------------------------cch-------------- Confidence 0 0 0 000 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) . + ...++..........++++++|+.. ..+ T Consensus 112 -~--t------------------------------------------g~~al~~~~~~~~~~p~il~ap~~~-----~~~ 141 (392) T protein:vir:18 112 -Y--T------------------------------------------GIKALLTAEAVTGVKPRILGVPGLD-----TQE 141 (392) T ss_pred -h--h------------------------------------------hHHHHHhhhhhhceeehhcccCccc-----hHH Confidence 0 0 0000000000011235667777653 357 Q ss_pred HHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 395 VQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 395 v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) ++.+|+++|++++ +++++|.| ++.+.+++.+||+. ++|+|+++||||++++|+.++..+++ T Consensus 142 v~~~l~~~~~~~~-~~~~~d~~--------~~~~~~~a~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~ 202 (392) T protein:vir:18 142 VATALASVCISLR-AFGYVSAW--------GCKTISEAMAYREN----------FSQRELMVIWPDFLAWDTTANATATA 202 (392) T ss_pred HHHHHHHHHhhcC-cEEEEecC--------CCCCHHHHHHHHhh----------ccCceEEEEeCceeeecccCCceEEe Confidence 8999999999876 67788875 35688999999973 67899999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceeccc-cc--cccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV-KL--AIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~-~~--~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) |||+++||+++++|.++|||+||+|+++.+|.++. .+ +..+++.|++.||++|||+++ +++|+++||+||+++++ T Consensus 203 p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~--~~~G~~~wG~rT~~~d~ 280 (392) T protein:vir:18 203 YATARALGLRAYIDQTIGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLV--RKDGFRFWGNRTCSDDP 280 (392) T ss_pred chHHHHHHHHHhhhccCCceEccCCceeeceeecceecccccCCCcchhhhhhhcCceEEE--cCCCEEEEcccccCCCc Confidence 99999999999999999999999999987776653 22 334567899999999999995 57899999999998875 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) +||||++||+++||+++|++.++|+|||||++.+|.+|++++++||++||++|+|.||+|.||+++||+++|++|+|++ T Consensus 281 -~~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 359 (392) T protein:vir:18 281 -LFLFENYTRTAQVLADTMAEAHMWAVDKPITASLIRDIVDGINAKFRELKSNGYIVDGECWFDEESNDKETLKAGKLYI 359 (392) T ss_pred -ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCcccceEEEEecCCCCHHHhhCCeEEE Confidence 8999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhc Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQ 666 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~ 666 (667) +|+++|++|+|||+|+++..... ++++.+.|++ T Consensus 360 ~v~~~p~~p~e~I~~~~~~~~~~--~~~~~~~~~~ 392 (392) T protein:vir:18 360 DYDYTPVPPLESLTLRQRITDKY--LVNLAESVNS 392 (392) T ss_pred EEEEEecCCcceEEEEEEEchHH--HHHHHHHhcC Confidence 99999999999999999988777 7888888888 No 30 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=1.6e-92 Score=523.84 Aligned_cols=379 Identities=15% Similarity=0.104 Sum_probs=309.2 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWG-----PAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) |++.+|||||+|++.+++++.. +|++++|||+++++ |+|+|++|+|+.||.+.||. ..++.+++..+|.|+ T Consensus 4 ~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g~---~g~L~~al~~~~~~~ 80 (393) T protein:vir:10 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAGS---TGTLRRTLNSIGSIV 80 (393) T ss_pred CCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhCC---ccchhhhhhhhhccc Confidence 5666799999999999987765 69999999999887 99999999999999999995 567889999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |..||+||+...+...... . T Consensus 81 ~~~~~vv~v~~~~~~~~t~----------------------~-------------------------------------- 100 (393) T protein:vir:10 81 KTPTVIVRVAESDDSDTLT----------------------A-------------------------------------- 100 (393) T ss_pred CceEEEeecccCccccccc----------------------c-------------------------------------- Confidence 9999999985332110000 0 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) ..++.. . ..... T Consensus 101 ~iig~~--------------------------------------~-------------~~~~t----------------- 112 (393) T protein:vir:10 101 NIVGTQ--------------------------------------E-------------NGKFT----------------- 112 (393) T ss_pred cccccc--------------------------------------c-------------cchhh----------------- Confidence 000000 0 00000 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) + ...+ T Consensus 113 ----------------------------------------------------------------g-----------l~al 117 (393) T protein:vir:10 113 ----------------------------------------------------------------G-----------IKAL 117 (393) T ss_pred ----------------------------------------------------------------H-----------HHHH Confidence 0 0000 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) .... ......++++++|++. ..+ T Consensus 118 ~~~~----------------------------------------------------~~~~~~p~li~apg~~-----~~~ 140 (393) T protein:vir:10 118 LTAQ----------------------------------------------------STVFVKPKLLCVPQHD-----NQA 140 (393) T ss_pred Hhhh----------------------------------------------------hhcceeeeeeeecccc-----chH Confidence 0000 0000123345555543 235 Q ss_pred HHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 395 VQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 395 v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) ++.+|+++|++++++++++|+| .++.++++.|++. ++|+|+++||||++++|+.++..+++ T Consensus 141 ~~~al~~~~~~~~~~~~v~d~~---------~~t~~~ai~~~~~----------~~s~~~~~~~P~~~~~d~~~~~~~~~ 201 (393) T protein:vir:10 141 VATELLSVAKKLNAFAFISDNG---------ATTKEQAYTYRQN----------FSQREGMMIFGDWKSYNTDKKAYDTD 201 (393) T ss_pred HHHHHHHHhhccCcEEEEEcCC---------CCCHHHHHHHhhh----------cCCceEEEEecccccccccCCceeEe Confidence 7889999999999999988876 4678899999974 67899999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceeccc---cccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV---KLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) |||+++||++||+|.++|||+||+|+.+.|+.++. .....+++.|++.||++|||||. +++|+++||+||++.++ T Consensus 202 p~s~~~Ag~~a~~d~~~G~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~--~~~G~~~wG~rT~s~d~ 279 (393) T protein:vir:10 202 YAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNGFRYWGSRTLATDT 279 (393) T ss_pred ehhHHHHHHHHHhhcCCCcEEccCCceeeceeecceecccccCCCcchhHhHhhcCceEEE--cCCCEEEEcccccCCCc Confidence 99999999999999999999999999987777753 23344567899999999999994 56899999999998875 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcC--CeeeeEEEEcccCCCHHHhhCCeE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLG--GIYDFRVQCDTTNNTPDVIDRNEF 629 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g--al~g~~v~~d~~~nt~~~i~~G~~ 629 (667) +|+||++|||+++|+++|++.++|++||||++.+|++|+++++.||++||+.| +|.||+|.||++ ||++||++|+| T Consensus 280 -~~~~i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~~al~g~~v~~~~~-nt~~~i~~G~~ 357 (393) T protein:vir:10 280 -RWAFQQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEE-ITADIIKSGKF 357 (393) T ss_pred -ccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhccccccccceEEecCC-CCHHHhhCCEE Confidence 89999999999999999999999999999999999999999999999999866 899999999885 88899999999 Q ss_pred EEEEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 630 VASMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 630 ~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) +++|+++|++|+|||+|+++..... ++++++.|+|- T Consensus 358 ~~~i~~~p~~p~e~I~~~~~~~~~~--~~~l~~~v~a~ 393 (393) T protein:vir:10 358 VIKYDYHWIPSLESLGLEQRVNDEY--VVDLVNTLKAL 393 (393) T ss_pred EEEEEEEecCCcceEEEEEEEchHH--HHHHHHHHhcC Confidence 9999999999999999999988777 99999999999 No 31 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=1e-90 Score=513.97 Aligned_cols=375 Identities=13% Similarity=0.111 Sum_probs=308.0 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCC----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWG----PAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYG 75 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~G----p~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG 75 (667) |+.-+|||||+|++.++++++. .|++++|||+++.+ |.++|++|.++.++...||.......+..++..+|.++| T Consensus 4 ~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~~d~~~~~~~~~~~gtl~~al~~~~~~~~ 83 (388) T protein:vir:96 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) T ss_pred CCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecchhhhhhhccccccccchhhhHhhhccCC Confidence 7676789999999999988875 69999999999764 899999999999999999988888889999999999999 Q ss_pred CeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccc Q lcl|NC_012740. 76 NDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAK 155 (667) Q Consensus 76 ~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~ 155 (667) ..|+++|+..+.+... +. . T Consensus 84 ~~~~vv~v~~g~~~~a-------------------t~-----------------------------------a------- 102 (388) T protein:vir:96 84 VPQYFIVVPEGADDAA-------------------TM-----------------------------------A------- 102 (388) T ss_pred ceEEEEEecccccccc-------------------cc-----------------------------------c------- Confidence 9999999843211000 00 0 Q ss_pred cccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeE Q lcl|NC_012740. 156 AIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLE 235 (667) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~ 235 (667) .+ .|. T Consensus 103 --------------~i-------------------------------------------------------ig~------ 107 (388) T protein:vir:96 103 --------------NI-------------------------------------------------------IGG------ 107 (388) T ss_pred --------------ee-------------------------------------------------------eee------ Confidence 00 000 Q ss_pred EEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhc Q lcl|NC_012740. 236 VEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFA 315 (667) Q Consensus 236 v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~ 315 (667) . +. T Consensus 108 --~------------------------------------------------------------~~--------------- 110 (388) T protein:vir:96 108 --I------------------------------------------------------------DP--------------- 110 (388) T ss_pred --c------------------------------------------------------------cc--------------- Confidence 0 00 Q ss_pred ccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHH Q lcl|NC_012740. 316 RGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTV 395 (667) Q Consensus 316 ~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v 395 (667) .....+++.++...+ ..+++|++|++. +..+| T Consensus 111 -------------------------------------------~tg~~~gl~al~~~~-~~p~il~aPg~s----~~~~v 142 (388) T protein:vir:96 111 -------------------------------------------TTGRRTGIAALTECT-ERPTLIGAPGFS----QNKAV 142 (388) T ss_pred -------------------------------------------ccchhhHHHHhhhcc-cceeEEEeeccc----cchHH Confidence 000001111111111 124677777754 44679 Q ss_pred HHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEec Q lcl|NC_012740. 396 QKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVP 475 (667) Q Consensus 396 ~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p 475 (667) +.+|+++|++++ ||+++|+|. .+.+++.+|+.... ..+++|.|+++||||++++|+.++..+++| T Consensus 143 ~~al~~~~~~~~-~~~i~D~p~---------~~~~~~~~~~~~~~-----~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p 207 (388) T protein:vir:96 143 IDALASMAKRLK-CRAVIDGPS---------GSTQDAIDLSGLLG-----GEGTGHDRVYMVDPMPAIYSRKAQGNIYVP 207 (388) T ss_pred HHHHHHHHhhcC-cEEEEeccC---------CchhHHHHHHhhhh-----ccCcCcceEEEEeCceeeecccCCceeeec Confidence 999999999875 899999984 34455666665432 346889999999999999999999999999 Q ss_pred hHHHHHHHHHHhhhcCCceeeecceeccceeccc---cccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCcc Q lcl|NC_012740. 476 LAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV---KLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPS 552 (667) Q Consensus 476 ~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~ 552 (667) ||+++||++||+| +||||+|+++ ++.|+. ..+..+++.|++.||++|||||++|+++|+++||+||++ T Consensus 208 ~s~~~AG~~a~~D----~~~spaN~~i-~i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~wG~rT~~---- 278 (388) T protein:vir:96 208 PSTIAMGAVAAVK----PWESPGNQGV-LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSLIGNRTVT---- 278 (388) T ss_pred hHHHHHHHHHhhc----CcccccCeeE-EeeeecccccccccCChhhHHhhhhcCceEEEEecCCcEEEEcccccC---- Confidence 9999999999999 5999999987 466653 344566778999999999999999999999999999974 Q ss_pred cceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEE Q lcl|NC_012740. 553 PFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVAS 632 (667) Q Consensus 553 ~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~ 632 (667) |+||+|||+++||+++|++.++|+|||||++.+|.+|+++|+.||++||++|+|.||++.||+++||+++|++|+|+++ T Consensus 279 -~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~Gal~g~~~~~d~~~nt~~~i~~G~~~~~ 357 (388) T protein:vir:96 279 -GKFISFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIV 357 (388) T ss_pred -CcceeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEE Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecCCceEEEEEEEEeecCee--HHHHH Q lcl|NC_012740. 633 MFIKPAKSINYIMLNFTAVATGAD--FDEII 661 (667) Q Consensus 633 i~~~p~~p~e~i~~~~~~~~~~~~--~~e~~ 661 (667) |+++|++|+|||+|+++.....++ |++++ T Consensus 358 i~~~p~~pae~I~~~~~~~~~~~~~~~~~~~ 388 (388) T protein:vir:96 358 IDYGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) T ss_pred EEEEecCCcceEEEEEEEchHHHHHHHHHhC Confidence 999999999999999999888766 66666 No 32 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=2.6e-88 Score=500.74 Aligned_cols=376 Identities=18% Similarity=0.190 Sum_probs=296.6 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCC-----CCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWG-----PAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) .+|.+|||||+|++.+++++.. +|++.+|||+++.+ |+++|++++|+.++...||+ ...+..++.++|.|| T Consensus 2 ~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~---~~tl~~a~~~~~~~g 78 (386) T protein:vir:10 2 AEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGA---GGTLPQAIDGIFDQT 78 (386) T ss_pred ccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCC---CcchhHHHHHHhccC Confidence 4588999999999999988775 69999999998764 89999999999999999986 456788999999999 Q ss_pred CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccc Q lcl|NC_012740. 75 GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHA 154 (667) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a 154 (667) |+.||++++....+..... . T Consensus 79 g~~~~vv~~~~~~~~~~t~------------------------------------------------------------~ 98 (386) T protein:vir:10 79 GAVVVVIRVDEGVDSAATQ------------------------------------------------------------S 98 (386) T ss_pred ceeEEEeeccccccccccc------------------------------------------------------------h Confidence 9999999974332110000 0 Q ss_pred ccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 155 KAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) ..++... .. .....| T Consensus 99 ~~ig~~~--------------------------------------------------~~---------t~~~tg------ 113 (386) T protein:vir:10 99 NVIGKVD--------------------------------------------------AD---------TEQYTG------ 113 (386) T ss_pred hhhcccc--------------------------------------------------cc---------cchhhh------ Confidence 0000000 00 000000 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) +. .+ T Consensus 114 ---------------------------------------------l~-------------------------------~l 117 (386) T protein:vir:10 114 ---------------------------------------------IL-------------------------------AL 117 (386) T ss_pred ---------------------------------------------hH-------------------------------Hh Confidence 00 00 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) ......+. ..++++.+|+. .+..+ T Consensus 118 ~~~~~~~~----------------------------------------------------~~p~i~~ap~~----~~~~~ 141 (386) T protein:vir:10 118 LSAENTVK----------------------------------------------------VQPRILIAPGF----SNQKA 141 (386) T ss_pred hhhccccc----------------------------------------------------ccccccccccc----cchhH Confidence 00000000 00011112221 23446 Q ss_pred HHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 395 VQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 395 v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) +..+|..+|++++ .+++.+++ .++.+++.+|++. ++|+|+++||||++++|+.++..+++ T Consensus 142 v~~~l~~~~~~~~-~~~~~~~~---------~~~~~~a~~~~~~----------~~s~~~~~~~p~~~v~~~~~~~~~~~ 201 (386) T protein:vir:10 142 VADQLVSVADTAA-WLCHSGWS---------NTTDAAAITYREL----------FGSRRCEVVDPWYKVWDVETSAHIIQ 201 (386) T ss_pred HHHHHHHhhcceE-EEEEeCCC---------CCchHHHHHhhhc----------ccccceEEecCceeeeccccccceee Confidence 7788888888765 44555544 4577788888863 67899999999999999999999999 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceeccc---cccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVV---KLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) |||+++||++||+|.++||||||+|+++.++.|+. ..+...++.|++.||++||+++ ++++|+++||+||++.+ T Consensus 202 p~s~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gi~~~--~~~~G~~~wG~rT~~~d- 278 (386) T protein:vir:10 202 PPSARHAGVMAKVHNTLGFWWSNSNQEILGIDGLCRPVDFKLDDPTCRANLLNAKEVTTT--IQQNGFRVWGDRTCSAD- 278 (386) T ss_pred chHHHHHHHHHHhhhcCCcEEccCCceeecccccceecccccccCcchhhhhhhcCcEEE--EcCCCEEEEcccccCCC- Confidence 99999999999999999999999999988777753 2234456789999999999987 56899999999999876 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) ++|+||++|||++||+++|++.++|+|||||++.+|.+|++++++||++||++|+|.||+|+||+++||+++|++|+|++ T Consensus 279 ~~~~~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~~~~G~~~~ 358 (386) T protein:vir:10 279 SKWAFKNVVITNDMIADSLVRNHLWAVDRNITKTYVEDVTEGVNNYLRHLKNIGAIAGGECWVDPELNSPDQIQQGKVYF 358 (386) T ss_pred cccceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcccCCCHHHhhCCeEEE Confidence 58999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCeeHHHHH Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVATGADFDEII 661 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~ 661 (667) +|+++|++|+|||+|++++.... |++++ T Consensus 359 ~i~~~p~~p~e~i~~~~~~~~~~--~~~~~ 386 (386) T protein:vir:10 359 DYDFSAYAPAEHITFRSHMVNGY--LTEVV 386 (386) T ss_pred EEEEEecCCceeEEEEEEEehhH--HHhhC Confidence 99999999999999999998877 88888 No 33 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=4e-77 Score=439.42 Aligned_cols=561 Identities=20% Similarity=0.187 Sum_probs=282.7 Q ss_pred CceecCceEEEEecCCCcccc----cCCCceEEEe-eccCCCC-CccEEec--CHHHHHHHcCCcCccchh--HHHHHHH Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQ----SATGRAALVG-KFQWGPA-FQIVQVT--NEVELVNKFGQPDNNTAD--YFMSGAN 70 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~----~~ts~~afvG-~~~~Gp~-~~p~~i~--s~~e~~~~FG~~~~~~~~--~~~v~~~ 70 (667) +++-+|.|-+ ++...|.. -.|.-.-|-- .-+.||. |.-+.++ -|- |...||-|...-.+ .+...+. T Consensus 147 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 222 (742) T protein:vir:58 147 INLNAPSVTL---PSNIVPLFFYYEPYTGSITLQSSVNYSGLTLNYTVSKATTPWV-YFAEYGTPTSSLTLYKGFYLEGI 222 (742) T ss_pred EEeeceeEee---ccccceeeeEeccccceEEEeeecccCCCcccceeeeeecCcc-cccccCCCccceeeeeccccccc Confidence 3333333322 21111110 1111111111 1244664 3333332 233 33556766654333 3334444 Q ss_pred HHcCCCeEEEEEcCCcccccccc----c-ccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeee Q lcl|NC_012740. 71 FLQYGNDLRVVRVLNKEKAKNAT----A-LAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFI 145 (667) Q Consensus 71 f~ngG~~~~vvRv~~~~~~~~a~----~-~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 145 (667) =+||=.+-+||.+-+........ . ....+.+......+. -.+.++++..+..-. .+.-..+.-++..-.++ T Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 298 (742) T protein:vir:58 223 DLNSFNKQFVVSIENITVNREKGQVLYPSFDVVVHFRDIRGVSA--NTEYIRFRQVNLNPE--SPNYIERVIGNMTFEFD 298 (742) T ss_pred ccCcccceeeEEEeeeeecccCCceeccceeEEEEEeeccCCCC--CccceeeeeeecCCC--Ccceeeecccceeeeec Confidence 45777888888764321110000 0 000000000000000 011233332222111 11111111111111110 Q ss_pred ccccccccccccccccc---ccccceEEEEEeec---ccccceeee------ceeeeceeeeeeccccchhhhccccccc Q lcl|NC_012740. 146 PTGKIIAHAKAIGVYPE---LDGGWTAEFTSSSG---NGSAALSVT------KIVTDSGLLLTDLETSRANITNQDFLTK 213 (667) Q Consensus 146 ~~~~~~~~a~~~~~~~~---~~~~~~~~~~~~~~---~~~~~~t~~------~~v~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (667) .+.+...+. ..+... .+...+. .......+. +.+.++..................... T Consensus 299 --------~~~~~~g~~~~n~~~~~~-~~~~~~~~~~~~~~s~~~~~~~~~~~~v~d~~~~~~~~~~v~~~~t~~~~~p- 368 (742) T protein:vir:58 299 --------GERIVTGGEYPNQVPFLR-VVVSQDIKQNVAGVEKWVPVGFEGIYSVGDFTVIVNELTNVSIPVTDSAIIP- 368 (742) T ss_pred --------cceeeeccccccccccee-eEeccccCcCccceeEEEeccccccccccceeeeccccccceeeccccccCC- Confidence 110000000 000000 0000000 000000000 011111111000000000000000000 Q ss_pred cccccceeeeeeccccccceeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEe Q lcl|NC_012740. 214 LKKYDMPAVSAIYAGEIGNSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYV 293 (667) Q Consensus 214 ~~~~~~~~~~A~~~G~~gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~ 293 (667) ......... .+...|.++.+....... +++....... . T Consensus 369 --p~~~~~~e~-v~~ngG~~f~v~s~~~~g----------------------------------~~i~~~~as~-----~ 406 (742) T protein:vir:58 369 --PMRFTRIEQ-ITLSGGASFSVISNQPYG----------------------------------FNIQDSRHSY-----W 406 (742) T ss_pred --cccccccce-eecccCcceEEEEecccC----------------------------------cceeccCcce-----E Confidence 000000000 011222222221111000 0000000000 0 Q ss_pred eeccCCccccccccccchhhhcccccceEEEecccccCcccceEEecCCcccccccc-------ccccccccccchhHHH Q lcl|NC_012740. 294 LSTLKGDKDVYGNSIYMDDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEAST-------GDRGNDPFIGAMMQGW 366 (667) Q Consensus 294 ~s~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~-------~~~~~~~~~~~~~~~~ 366 (667) ++...+.+...+.... .+.. .....+.................+.+|.++..... +.....+.......++ T Consensus 407 ~s~ln~~~~V~Gt~aa-~~~~-d~~t~~~v~s~~~alp~~a~sv~laGG~dg~v~v~~~~~D~iG~~~~~d~~~adrTGL 484 (742) T protein:vir:58 407 LSPFKDDELIIGTELV-LPAL-DVSTEFGVSSWEEALPEFSFLMPFQGGSDGYIRVDENEPDTIGRVKITPALLANYERL 484 (742) T ss_pred EeccCCceEEEeehhh-cccc-ccchheeccccccccceeeEEEeecCCccccccccCCCcccccccccccccccchhHH Confidence 0000000000000000 0000 00000000000000011112233444444321110 0011111122234555 Q ss_pred HHHhhhcccccccEEecCcCCcchhhHHHHHHHHHHHhhcCcEE-EEEccCccccccccccCCHHHHHHHhhhhcccccc Q lcl|NC_012740. 367 DLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCL-VMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDN 445 (667) Q Consensus 367 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~-ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (667) .++. +..+++||++|++.. ..++.++.++|+..++|+ +++|+|.. ..+.+++.+|++ T Consensus 485 ~ALl--ev~eVtILiAPG~t~-----~~v~aav~A~la~a~~Rl~vL~D~P~~-------~tt~~~A~a~r~-------- 542 (742) T protein:vir:58 485 LPLL--TEDQFDLVLTPYLTF-----ADHAGTVNAFINRAENRFLYLFDIAGD-------DDTENLAISLAG-------- 542 (742) T ss_pred HHhh--hcCCCcEEEEcCCCc-----hHHHHHHHHHHHhhcCCeEEEEecCCC-------CchHHHHHHHHh-------- Confidence 5554 344689999998753 345667777777655544 45566532 244567777775 Q ss_pred ccccCcceEEEEehhhcccccccCceeEechHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhh Q lcl|NC_012740. 446 NMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQ 525 (667) Q Consensus 446 ~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~ 525 (667) .++|+|+++||||+++.| ++..+++|||+++||++||+|.++|+|+||+|+.+ +.+ ..+++.|++.||+ T Consensus 543 --~~nSsraaly~PwVkv~d--~~~~r~vPpSgaIAGL~ARtD~erGvw~SPANrgi--i~~-----~~~s~se~d~LN~ 611 (742) T protein:vir:58 543 --YINSSFATTFFPWVRRLT--NKGMRTVPASLAAYRSIRTTDPETGLAPVGARRGV--VTG-----EPVRQVDWEDLYN 611 (742) T ss_pred --ccCCceEEEEeceeeecc--CCcceeechHHHHHHHHHHhccCCceEecCCccee--eec-----cccchhhHHHHhh Confidence 367999999999999876 46788999999999999999999999999999853 222 3457889999999 Q ss_pred cCceEEEEecCCeEEEEcceecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_012740. 526 AAINPVIGAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLG 605 (667) Q Consensus 526 ~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g 605 (667) +|||||+++ ++|+++||+||+++.+++|+||||||||+||+++|+++++|+||||||+.+|++|+++|++||++||++| T Consensus 612 ~GINtIrsf-G~G~rlWGnRTlassDs~wryInVRRlfd~Ie~SI~~a~q~~VfEPNd~~L~~sIk~sInafL~~L~aqG 690 (742) T protein:vir:58 612 NRINPIVRV-GNDVLLFGQKTMLNVNSALNRINVRRLLIVMRNRISQILSSYLFENNTSENRLRAEALVRQYLESLRLRG 690 (742) T ss_pred CCceEEEEC-CCcEEEEcceecCCCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCC Confidence 999999987 6899999999997666799999999999999999999999999999999999999999999999999999 Q ss_pred CeeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeHH Q lcl|NC_012740. 606 GIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFD 658 (667) Q Consensus 606 al~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~ 658 (667) +|.||+|+||+ +||++||++|+|+++|+++|++|||||+|+|.+.+++++|. T Consensus 691 ALlGfrV~lDe-tNTpeDI~~Gklvv~I~vAP~~PAEfI~lrf~it~tga~Fs 742 (742) T protein:vir:58 691 AVTDYEVAIDS-VTTPTDIDNNTLRARVTVQPARSIEYIDITFVITPTGVEIT 742 (742) T ss_pred ceeeeEEEEcC-CCCHHHhhCCEEEEEEEEEccCCcceEEEEEEEEecccccC Confidence 99999999995 68899999999999999999999999999999999999999 No 34 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=3.4e-68 Score=390.44 Aligned_cols=538 Identities=15% Similarity=0.047 Sum_probs=329.6 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (667) =-|++|||||||.+++.+++.+ +|++++|||.+++||+++|++|+||+||++.||+-....+..+|+..||.|||++|| T Consensus 9 ~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~~~~~fg~g~l~~~i~~a~~~~~~~g~~~~~ 88 (562) T protein:vir:63 9 KPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGELLDAIERAWNPGEGTGAGDIL 88 (562) T ss_pred CcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCCCCceeEEEccHHHHHHHhcCCchHHHHHHhccccccCCceEEE Confidence 4578999999999999988776 699999999999999999999999999999998866555566677777899999999 Q ss_pred EEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccccccc Q lcl|NC_012740. 80 VVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGV 159 (667) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~ 159 (667) +|||.+ ++.++....++.+++... ++|++.+++.......+....+.+.-.......++...+...... T Consensus 89 ~~rv~~---a~~a~~~~~~~~~~a~~~---g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~V~~i~----- 157 (562) T protein:vir:63 89 AMRVEE---AKEATFEAEGVKVSSTIY---GADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSIK----- 157 (562) T ss_pred EEEcCC---CccceeEecceeEEEeec---ccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhccceeeee----- Confidence 999943 344555555555555444 467888888776665555444333222222222221111110000 Q ss_pred cccccccceEEEEEeeccccccee-eeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEE Q lcl|NC_012740. 160 YPELDGGWTAEFTSSSGNGSAALS-VTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEI 238 (667) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~t-~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i 238 (667) +..........+.. ........ +...........-......... .......-+..+.+.|.+++..+|.+++.. T Consensus 158 y~g~~~~~~~~v~~--~~~~~~a~~l~~~~g~~~v~~~~L~~g~~~~---~~~l~~~in~~~~~~aky~~~~gn~i~~~~ 232 (562) T protein:vir:63 158 YKGTEASATFTVAV--DPVTFKATKLTLKAGDKTVKEYDLGSGAYAE---TNVLISDINNLPDFEAKFFPIGDKNLTTDN 232 (562) T ss_pred eecccccceEEEEe--cCcceeEEEEEeecCCcceeEEEecCCccch---hHHHHHhhccccceEEEeeccCCceeeeec Confidence 00000000000000 00000000 0000000000000000000000 000000112233456666666666665432 Q ss_pred eecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcccc Q lcl|NC_012740. 239 LARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGS 318 (667) Q Consensus 239 ~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 318 (667) ....... .. .+..... +++. +.. ...... T Consensus 233 ~d~~~~~-------~v-----------------------kt~~~~v-~t~~---------~d~-----------~~~~~~ 261 (562) T protein:vir:63 233 FDAQIDV-------DI-----------------------KTKEAYV-KAVG---------GDI-----------EKQTAY 261 (562) T ss_pred ccccccc-------ch-----------------------hhhhhhh-hhhh---------hhh-----------hhcccc Confidence 2110000 00 0000000 0000 000 000011 Q ss_pred cceEEEecccc-cCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHH Q lcl|NC_012740. 319 SQYIYATAQGW-VDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQK 397 (667) Q Consensus 319 s~~v~~~~~~~-~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~ 397 (667) ..++.+..... .-.......|.||.++... .++.+.++.++.. +..++++. ...++++. T Consensus 262 ~~~v~~~~~~~~~la~~~~~~LtGG~dGt~~-----------~~~~~al~ale~~---~~~~i~~~------t~d~av~~ 321 (562) T protein:vir:63 262 NGYVDFEFDRSKEIANFPLTKLTGGDNGTIP-----------ESWADKFSYFANE---GGYYLVPL------TSKQAVHA 321 (562) T ss_pred cceeeeeeccccceecccceeeecCCCCCch-----------hhHHHHHHHHHhC---CcEEEEec------CCCHHHHH Confidence 12222211110 0001122456777765421 1345566666543 44444432 24567888 Q ss_pred HHHHHHhhcCc----EEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeE Q lcl|NC_012740. 398 HAVSIGDERQD----CLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRW 473 (667) Q Consensus 398 ~~~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~ 473 (667) ++.+||+++++ ++++++.+. +.+++++..+.+ .+++.+.++++|+....+. ++..+. T Consensus 322 ~l~a~vkr~~~~g~~~~aVlg~~~--------~~~~~~~~~~a~----------~~n~ervv~v~~~~~~~~~-~~~~~~ 382 (562) T protein:vir:63 322 EALQFVRDCSYNGNPMRVFVGGGI--------GESMEQLFTRAI----------GLQNERAGLIGFSGTVKMD-DGRSLK 382 (562) T ss_pred HHHHHHHHHHhCCCcEEEEecCCC--------CCCHHHHHHHhh----------hcCCCcEEEEecCeeEECC-CCceee Confidence 89999987765 888887653 456677766554 3678999999999876655 455666 Q ss_pred ech---HHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcc-ee--c Q lcl|NC_012740. 474 VPL---AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGD-KT--A 547 (667) Q Consensus 474 ~p~---sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~-rT--~ 547 (667) .|+ ++++||++|+.| +++||.|+.+. ..++...+++.|++.|+++|++++++.++++.++|.. ++ . T Consensus 383 ~~~~~~aa~vAGl~A~~~----~~~SlT~~~i~----~~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT 454 (562) T protein:vir:63 383 MPGYMFAAQVAGLTCGLE----IGEAITFKNIA----IETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTT 454 (562) T ss_pred echhHHHHHHHHHhhcCc----hhcCccceeec----cccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeecccee Confidence 776 788999999877 78899999753 3467778999999999999999999988887777744 22 2 Q ss_pred --CCCcccceeeehhhhhHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHh Q lcl|NC_012740. 548 --TTVPSPFDRINVRRLFNMLKKNIGDSS-KYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVI 624 (667) Q Consensus 548 --~~~~~~~~~i~vrR~~~~i~~~l~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i 624 (667) ..++..|++|+++|++|+|++.|++.. +||++|||+...|.+|+..|..||.+||+.|+|.+|... +-+.++ T Consensus 455 ~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yiGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~ 529 (562) T protein:vir:63 455 FNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EVQVVI 529 (562) T ss_pred cCCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEe Confidence 223457999999999999999998776 599999999999999999999999999999999998531 233346 Q ss_pred hCCeEEEEEEEEecCCceEEEEEEEEeecCeeH Q lcl|NC_012740. 625 DRNEFVASMFIKPAKSINYIMLNFTAVATGADF 657 (667) Q Consensus 625 ~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 657 (667) +.++++|++.++|+.|+|||++++......++- T Consensus 530 ~~d~~~v~~~v~pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:63 530 EGDVARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred cCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 678899999999999999999998877666554 No 35 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=1.6e-66 Score=381.33 Aligned_cols=628 Identities=14% Similarity=0.088 Sum_probs=314.0 Q ss_pred Cc----e-ecCceEEEEec----CCCcccccCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccch------hHH Q lcl|NC_012740. 1 MT----L-LSPGFETKETT----LSTTIVQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTA------DYF 65 (667) Q Consensus 1 ~~----~-~~PGVyvee~~----~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~------~~~ 65 (667) |. | -.||+-+.-.| .+..+.+..|-...+.|.+-.|||.+||+|+-.. .++.||+....+- +-- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 79 (717) T protein:vir:79 1 MAGFDQYQAIPGHNARFKDGNLNLKSDPNPRETESVVLLGTATDGPVMQPVRVTPET-AYNIFGKVAHENGVYNGATLLP 79 (717) T ss_pred CCchhhhhcCCCceeeeecCceecCCCCCccccceEEEEeeccCCcccCceeeChhH-HHhhhhhhhhhcccccchhhhH Confidence 43 2 36999886554 3445666678888899999999999999999554 4599997655443 333 Q ss_pred HHHHHHHcCCCeEEEEEcCCcccccccccccccccceeeec-----cccccccceee-Eeeeccceeccccceeecc--- Q lcl|NC_012740. 66 MSGANFLQYGNDLRVVRVLNKEKAKNATALAGNVEFEITNE-----GSNYEVGDTIK-IKHNRQDIETAGKVTKVDG--- 136 (667) Q Consensus 66 ~v~~~f~ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~-----~~~~~~g~~~~-~~~~~~~~~~~~~~~~~d~--- 136 (667) +....+..|..+...+|..+.+..+ +.+...-+.....+ +.....|+... .+..+.+.-.+...++..+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (717) T protein:vir:79 80 KFEELWAAGNRDIRLMRTTGVNAVS--SLLGTSYSKNSKEVAEDKLGGAQARGNVAATFTLPNGGIVEATFLLKARGVII 157 (717) T ss_pred HHHHHHhcCCcceEEEEecchhHHH--HHhhcccccchhhHHHHhhcccccccceEEEEEcCCCceeeeeeeeeecceEe Confidence 5556677899999999986532211 11111000000010 11111111111 0111111111111111000 Q ss_pred -cccce-eeee----cccccccccccccccccccccceEEEEEeecccccceee-ecee----ee---ceeeeeeccccc Q lcl|NC_012740. 137 -DGKVK-GVFI----PTGKIIAHAKAIGVYPELDGGWTAEFTSSSGNGSAALSV-TKIV----TD---SGLLLTDLETSR 202 (667) Q Consensus 137 -~~~~~-~~~~----~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~-~~~v----~~---~~~~~~~~~~~~ 202 (667) .+.++ +..+ ..+.....+....+...........++-.+.+....+.. ..-+ ++ .+...++.+... T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (717) T protein:vir:79 158 PPNNYTLDVGTEEDMKAGTQPTFAQVLLNENVADMESEITVSYEFTYKDAQGETKTSEVLDNNTDKDGKPMIAKGADVTI 237 (717) T ss_pred CCCcceEeccChhhhhcCCCchhhhhhhccchhhccceeEEEEEEEeecccCcchhhhhhcCCCCCCCceeEEeccccee Confidence 00000 0000 000000011111111111111222223333333322211 1001 11 111222211100 Q ss_pred hhhhccccccccccccceeeeeeccccccceeEEEEeecccccccceeeeee--eecccccccc--eeeeeccccccccc Q lcl|NC_012740. 203 ANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEILARSSFSGAVAPELTM--YPFGGTRAAA--RNLIPYAPQNDNQY 278 (667) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~~~a~~~~~~~~~~t~--~~~~~~~~~~--~~~~~~~~~~~~~~ 278 (667) .--..+......-.+++-...|+.....++.+++-.. +....+.+..... +..--.+... ..+..... -+.+ T Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~--~n~~ 313 (717) T protein:vir:79 238 KLEHVALAGLKLYADGIEVVDAKAFTVAGDQLTIHSN--SKMKLGASLEAQYAYNLVEVIQPVIELESIFGGGV--YNDI 313 (717) T ss_pred ehhhhhhhhhHHhhcchhhhhhhheeeecceEEEEec--CCcccchhhHHHHHhhHHHhhccceEEeecccCce--eeee Confidence 0000000011111122222223333333344433221 1111111000000 0000001111 11111111 1223 Q ss_pred eeeeeccc-eeeeeEe-eeccCCcccccc----cccc-----chhhhc-ccccceEEEecccccCcc-------cceEEe Q lcl|NC_012740. 279 AFIVRRDG-VVVESYV-LSTLKGDKDVYG----NSIY-----MDDFFA-RGSSQYIYATAQGWVDGF-------SGIISL 339 (667) Q Consensus 279 ~~~v~~~g-~v~e~~~-~s~~~~~~~~~~----~~~~-----~~~~~~-~~~s~~v~~~~~~~~~~~-------~~~~~~ 339 (667) .+.+...+ .+.-++. ...+.+...... +..| ..+.+. .+.++.+.....+.+.+. .....+ T Consensus 314 ~~~v~~~D~~~~~~~t~~~~~~g~~~~~pl~~ts~dy~~~~~~vdgI~~~~~~~V~~~g~~s~a~a~~~~g~~s~d~a~f 393 (717) T protein:vir:79 314 MRKVESKDGAVTVTITKPESKRGMISEDPLVFKSGDYTNFKMLVDAINNHPFNNVVRARTKPEFEATFTSTLQAAADAKF 393 (717) T ss_pred eeEEecCCceEEEEEecccccCcceeccccccccCceeeeeeeecccccCchhheeeeecccccceeeeecccCchhhcc Confidence 33343333 2222222 111112111111 0000 001110 011111111111111110 011123 Q ss_pred cCCccccccccccc----cccccccchhHHHHHHhhhcccccccEEecCcCCc---chhhHHHHHHHHHHHhhc----Cc Q lcl|NC_012740. 340 AGGVSANEASTGDR----GNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGE---GDAFSTVQKHAVSIGDER----QD 408 (667) Q Consensus 340 ~~g~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~---~~~~~~v~~~~~~~~~~~----~~ 408 (667) .++.++..+..... ++.....+.......+...+..++++++.++...+ ......++.++++||+.+ +. T Consensus 394 ~Gg~dgl~~~~ee~Y~~lGgk~~d~g~lt~~aays~LE~~dVDlVil~ga~adtt~ga~~d~va~alad~caalSal~r~ 473 (717) T protein:vir:79 394 SGGKDELSLDKEEMYKRLGGEKNEEGFVTKQGAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQLALACAVMSHYNSV 473 (717) T ss_pred CCCccccccchhhhhccccccccccccccchhhhhhcCcceeEEEEecCccccccccchhhhHHHHHHHHHHHhhhcccc Confidence 44444433322111 11111111111113444445567888888875432 234456788999999754 23 Q ss_pred EEEEEccCccccccccccCCHHHHHHHhhhhccccc-----------------cccccCcceEEEEehhhcccccccCce Q lcl|NC_012740. 409 CLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSD-----------------NNMNINTTYAVIDGNYKYQYDKYNDVN 471 (667) Q Consensus 409 ~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~s~~~~~~~p~~~v~d~~~~~~ 471 (667) ++.+++... +.........+|+.....+.. ....+ +.|...++++..+..+..+.. T Consensus 474 ai~VI~l~s------p~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idi-s~y~~vv~~~~~iv~~~~~~~ 546 (717) T protein:vir:79 474 TIGIIPTTT------PSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDL-GQFIEVVAGPDFIVRNTRLGQ 546 (717) T ss_pred ceeeecccc------ccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccc-cceeeeeecceeEEEcCCCce Confidence 444443211 111122222333332211110 00111 334444444444444555667 Q ss_pred eEechHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCc Q lcl|NC_012740. 472 RWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVP 551 (667) Q Consensus 472 ~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~ 551 (667) +++||+|++||+ |..+|+|+||+|+++. |+.++++.+++.|++.||++|||||+.++++|+++||+||+++++ T Consensus 547 ~~~p~AG~vAGl----dA~rGVwkSPANk~I~---GVvgLa~~lT~sE~d~Ln~aGIntIr~~~GrGirVWGaRTtasd~ 619 (717) T protein:vir:79 547 MASTPDASYIGM----VSQLKTQSAPTNKPLP---SVTALRYTYSANQLNRLTKARFATFKYKQDGSIGVVDAPTSAHAG 619 (717) T ss_pred eecCHHHHHHHH----HhcCCcccccccceec---ccccCcccCCHHHHHHHhhCCeEEEEEeCCceEEEEeeeecCCCC Confidence 788887766665 5568999999999754 667789999999999999999999999999999999999999888 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEE Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVA 631 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~ 631 (667) ++|+||++||++++|+++|++.++|+|||||++.+|.+|+.+|++||++||++|+|.||++++ +||++++++|+++| T Consensus 620 sdWryInVRRl~D~Ie~sIr~al~~yVgEPNd~~tr~~Ik~sI~afL~~L~r~GAI~Gykvdv---tnT~~di~~G~l~V 696 (717) T protein:vir:79 620 SDYTRLSTARIVKEAVNAVREVADPFIGEPNDTGNRNALTAAVDKRLSKMIENKALLGFDFRL---VVTPQQELLGEGSI 696 (717) T ss_pred cccceeehhhhHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHHHHHhcCceecceeeE---ecChhHhhCCEEEE Confidence 899999999999999999999999999999999999999999999999999999999999765 89999999999999 Q ss_pred EEEEEecCCceEEEEEEEEee Q lcl|NC_012740. 632 SMFIKPAKSINYIMLNFTAVA 652 (667) Q Consensus 632 ~i~~~p~~p~e~i~~~~~~~~ 652 (667) +|+++|++|+|||+|+++... T Consensus 697 ~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 697 ELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred EEEEEecCcccEEEEEEEEeC Confidence 999999999999999998776 No 36 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=2.5e-66 Score=380.28 Aligned_cols=538 Identities=15% Similarity=0.056 Sum_probs=334.7 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (667) =.+..|||||||.+++.+++.+ +|++++|||.+++||+++|++|+||+||++.||+-....+..+|+..||.|||++|| T Consensus 9 ~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~l~~~i~~a~~~~~~~g~~~~~ 88 (562) T protein:vir:80 9 KPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGELLDAIERAWNPGEGTGAGDIL 88 (562) T ss_pred CcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCcceeEEEccHHHHHHHhcCCChHHHHHHhcccccccCceEEE Confidence 4577999999999999987766 799999999999999999999999999999998866556667788888899999999 Q ss_pred EEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccccccc Q lcl|NC_012740. 80 VVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGV 159 (667) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~ 159 (667) +|||.+ +..++....++.++.... ++|++.+++.......+....+.+.-...+...++...+.......... T Consensus 89 ~~rv~~---a~~a~~~~~~~~~~~~~~---g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~v~~i~y~g~- 161 (562) T protein:vir:80 89 AMRVEE---AKEATFEAEGVKVSSTIY---GADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSIKYKGT- 161 (562) T ss_pred EEEcCC---CCcceEEecceEEEEeec---ccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCceeeeeeccc- Confidence 999954 334454455555544444 4678888887766655554443333233333333332221111100000 Q ss_pred cccccccceEEEEEeecccc-cceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEE Q lcl|NC_012740. 160 YPELDGGWTAEFTSSSGNGS-AALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEI 238 (667) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~-~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i 238 (667) .......+........ ....+... ......-...... ..........-+..+.+.|.++|..+|.+++.. T Consensus 162 ----~~~a~~~i~~~~~~~~a~~l~~~~g--~~~v~~~~l~~g~---~~~~~~l~~~i~~~~~~tAky~g~~~n~i~~~~ 232 (562) T protein:vir:80 162 ----EASATFTVAVDPVTFKATKLTLKAG--DKTVKEYDLGSGA---YAETNVLISDINNLPDFEAKFFPIGDKNLTTDN 232 (562) T ss_pred ----cccceeEEEecCccceEEEEEEecC--CcceeEEEeCCCc---cchhhhhhhhhccccceEEEecccCCceeeecc Confidence 0000000000000000 00000000 0000000000000 000000001112223456667777666665422 Q ss_pred eecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcccc Q lcl|NC_012740. 239 LARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGS 318 (667) Q Consensus 239 ~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 318 (667) .....- +. ..... ..+....+.. . ..+.. T Consensus 233 ~d~~~~---------------------------------~~--~kt~~-----~~v~~~~~d~---------~--~~~~~ 261 (562) T protein:vir:80 233 FDAQID---------------------------------VD--IKTKE-----AYVKAVGGDI---------E--KQTAY 261 (562) T ss_pred cccchh---------------------------------hh--cccce-----eeeeehhhhh---------h--hcccc Confidence 110000 00 00000 0000000000 0 00111 Q ss_pred cceEEEecccc-cCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHH Q lcl|NC_012740. 319 SQYIYATAQGW-VDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQK 397 (667) Q Consensus 319 s~~v~~~~~~~-~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~ 397 (667) +.++.+..... .-.......|.||.++... .++.++++.++.. +...++++ ...++++. T Consensus 262 n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~-----------~~~~dal~~Le~~---~~~~i~~~------t~d~ai~~ 321 (562) T protein:vir:80 262 NGYVEFEFDRSKEIANFPLTKLTGGDNGTIP-----------ESWADKFSYFANE---GGYYLVPL------TSKQAVHA 321 (562) T ss_pred cceEEEEeccCccccccceeeeeCCCCCCcc-----------ccHHHHHHHHHhC---CcEEEEec------CCChHHHH Confidence 22332221111 1111223467788776432 2345566666543 34444432 23567889 Q ss_pred HHHHHHhhcCc----EEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeE Q lcl|NC_012740. 398 HAVSIGDERQD----CLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRW 473 (667) Q Consensus 398 ~~~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~ 473 (667) ++.+||++|++ ++++++.+ .+.+++++.++.+ .+++.+.++++|+..+.+. ++..+. T Consensus 322 ~~~a~vkr~r~~g~~~~aVvg~~--------~~~~~~~~~~~a~----------~~n~e~vv~v~~~~~~~~~-~~~~~~ 382 (562) T protein:vir:80 322 EALQFVRDCSYNGNPMRVFVGGG--------IGESMEQLFTRAI----------GLQNERAGLIGFSGTVKMD-DGRSLK 382 (562) T ss_pred HHHHHHHHHHhCCCeEEEEecCC--------CCCCHHHHHHHhh----------hcCCCeEEEEecCeeEECC-CCceee Confidence 99999988765 88888765 3457777777665 3678999999998776655 355555 Q ss_pred ech---HHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEc----cee Q lcl|NC_012740. 474 VPL---AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMG----DKT 546 (667) Q Consensus 474 ~p~---sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG----~rT 546 (667) .|+ ++++||++|+.+ +++||.|+.+. + .++...+++.|++.|+++|++++++.++++.++|. -.| T Consensus 383 ~~~~~~aa~vAGl~Ag~~----~~~S~T~~~i~---~-~~v~~~lt~~e~~~li~~G~l~l~~~~~~~v~~~riv~~itT 454 (562) T protein:vir:80 383 MPGYMFAAQVAGLTCGLE----IGEAITFKNIA---I-ETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTT 454 (562) T ss_pred echhHHHHHHHHHHhcCc----cccCccceeec---c-ccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeecccee Confidence 565 889999999887 77899999864 2 35677899999999999999999998888777762 223 Q ss_pred cC-CCcccceeeehhhhhHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHh Q lcl|NC_012740. 547 AT-TVPSPFDRINVRRLFNMLKKNIGDSS-KYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVI 624 (667) Q Consensus 547 ~~-~~~~~~~~i~vrR~~~~i~~~l~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i 624 (667) .. .++..|++|+++|++|+|++.|++.. +||++|||+...|.+|+..|..||.+||+.|+|.+|... +-+.++ T Consensus 455 ~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yIGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~ 529 (562) T protein:vir:80 455 FNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EVQVVI 529 (562) T ss_pred ccCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEe Confidence 22 33468999999999999999998886 699999999999999999999999999999999998531 223346 Q ss_pred hCCeEEEEEEEEecCCceEEEEEEEEeecCeeH Q lcl|NC_012740. 625 DRNEFVASMFIKPAKSINYIMLNFTAVATGADF 657 (667) Q Consensus 625 ~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 657 (667) ++++++|++.++|+.|||||++++......++- T Consensus 530 ~~d~~~v~~~v~Pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:80 530 EGDIARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred cCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 778999999999999999999998877666555 No 37 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=1.7e-65 Score=375.63 Aligned_cols=564 Identities=14% Similarity=0.072 Sum_probs=335.0 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (667) =.+.+|||||||.+++.+++.+ +|++++|||.++|||+++|++++||+||++.||+.+....+.+|..+||.|||++|| T Consensus 9 ~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~l~~~~~~a~~~~~~~g~~~~~ 88 (587) T protein:vir:95 9 RPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGELLDAIELAWGSNPNYTAGRIL 88 (587) T ss_pred cccccCceEEEEecCCccccCCCCCceEEEEEEeCCCCCceeEEeccHHHHHHHhcCcchHHHHHHHhccccCCCceEEE Confidence 4578999999999999987665 799999999999999999999999999999998866444455566666789999999 Q ss_pred EEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccccccc Q lcl|NC_012740. 80 VVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGV 159 (667) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~ 159 (667) +|||.+.. +|+....++..++..+ +.||+.+++........+.-++...........++...+.......... T Consensus 89 ~~rv~~~~---~a~~~~~~l~~~a~~~---G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~si~y~g~- 161 (587) T protein:vir:95 89 AMRIEDAK---PASAEIGGLKITSKIY---GNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKYKGE- 161 (587) T ss_pred EEEcCCCc---eeEEEecCeEEEEecc---cccccceEEEEecCCCCCceeEEEEEecccceeeeeeccceeeeeeecc- Confidence 99995443 4555555666666555 4688889888776555444333333333333344433222111110000 Q ss_pred cccccccceEEEEEeeccccc-ceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEE Q lcl|NC_012740. 160 YPELDGGWTAEFTSSSGNGSA-ALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEI 238 (667) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~-~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i 238 (667) .. .....+......... ...+... ....+.-...... ..........-+..+.+.|.+.|..++.+.+.. T Consensus 162 --~~--~~~~~v~~~~~t~~a~~~~l~~g--~~~v~~yrL~~g~---~~~~~~~~~~in~~~~~tAky~g~~~~~i~~~~ 232 (587) T protein:vir:95 162 --EA--NATFSVEHDEETQKASRLVLKVG--DQEVKSYDLTGGA---YDYTNAIITDINQLPDFEAKLSPFGDKNLESSK 232 (587) T ss_pred --cc--ccceeeeecccceeeeeeeeecC--CceEEEEEecCCc---hHHHHHHHHhhccccceEEEEecccCceeEEee Confidence 00 000000000000000 0000000 0000000000000 000111111223455678888888888777654 Q ss_pred eecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcccc Q lcl|NC_012740. 239 LARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGS 318 (667) Q Consensus 239 ~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 318 (667) ..... +... ................. . ....+.......+...+..... . ..... T Consensus 233 ~~~~~-~~~v--~~~~~~v~a~~~d~~~~---~--~~~~~v~~~~~~g~~~~~~~~~-------------~----~~~~~ 287 (587) T protein:vir:95 233 LDKIE-NANI--KDKAVYVKAVFGDLEKQ---T--AYNGIVSFEQLNAEGEVPSNVE-------------V----EAGEE 287 (587) T ss_pred cCccc-ccce--ehhhhhhhhhhcceeee---e--eceeeeeeecccccceeccchh-------------h----hhccc Confidence 22110 0000 00000000000000000 0 0000000000001000000000 0 00000 Q ss_pred cceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHH Q lcl|NC_012740. 319 SQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKH 398 (667) Q Consensus 319 s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~ 398 (667) ....................|.||.++... .++.+++++++. .++++|+++ ...++++.+ T Consensus 288 ~a~~~~~~~~~~~a~~~~t~LtGG~dG~~~-----------~~y~~~l~ale~---~~~~~i~~~------t~d~~v~a~ 347 (587) T protein:vir:95 288 SATVTATSPIKTIEPFELTKLKGGTNGEPP-----------ATWADKLDKFAH---EGGYYIVPL------SSKQSVHAE 347 (587) T ss_pred chheeccccccceeccceeeeecCCCCCCc-----------ccHHHHHHHHHh---CCcEEEEec------CCCHHHHHH Confidence 000111000000011112346777765321 245566666654 345555432 245678899 Q ss_pred HHHHHhhcCc----EEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 399 AVSIGDERQD----CLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 399 ~~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) +.+||+++++ ++++++.+ .+.+.+++.+.++ .+++.+.++++|+.++. ..++....+ T Consensus 348 l~a~vk~~~~~g~~~~aVvg~~--------~~~~~~~~~~~a~----------~~n~ervi~v~~~~~~~-~~dg~~~~~ 408 (587) T protein:vir:95 348 VASFVKERSDAGEPMRAIVGGG--------FNESKEQLFGRQE----------SLSNPRVSLVANSGTFV-MDDGRKNHV 408 (587) T ss_pred HHHHHHHHHhCCCcEEEEEcCC--------CCCCHHHHHHHHh----------hcCCCcEEEecccceEe-cCCCceeee Confidence 9999988765 88888764 2457777777664 36788999998886543 235666777 Q ss_pred ch---HHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCe---EEE-Ecceec Q lcl|NC_012740. 475 PL---AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEG---FIL-MGDKTA 547 (667) Q Consensus 475 p~---sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G---~~~-wG~rT~ 547 (667) || ++++||++|..| +++||.|+++. ..++...+++.|++.|+++|+++++..++++ +++ .+-.|. T Consensus 409 ~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~----~~~v~~~~t~~e~e~ai~~Gvl~l~~~~~~~~~~vriv~~itT~ 480 (587) T protein:vir:95 409 PAYMVAVALGGLASGLE----IGESITFKPLR----VSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTF 480 (587) T ss_pred chHHHHHHHHHHHhcCc----hhcCccceeee----cccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeecceec Confidence 87 788999999887 67799998764 3456778999999999999999999887664 443 344444 Q ss_pred C-CCcccceeeehhhhhHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhh Q lcl|NC_012740. 548 T-TVPSPFDRINVRRLFNMLKKNIGDSS-KYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVID 625 (667) Q Consensus 548 ~-~~~~~~~~i~vrR~~~~i~~~l~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~ 625 (667) . .++..|++|+++|++|+|.+.|++.+ +||++|||++..|..|+..|..||.+||+.|+|.+|.. .+.+-++. T Consensus 481 t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~-----~dv~v~~~ 555 (587) T protein:vir:95 481 NDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEIQDFPA-----EDVQVIVE 555 (587) T ss_pred cCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcccCCCc-----cceEEEec Confidence 2 33457999999999999999999886 69999999999999999999999999999999999854 22333456 Q ss_pred CCeEEEEEEEEecCCceEEEEEEEEeecCeeH Q lcl|NC_012740. 626 RNEFVASMFIKPAKSINYIMLNFTAVATGADF 657 (667) Q Consensus 626 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 657 (667) ..+++|++.++|+.|+|+|.++++.....++- T Consensus 556 ~d~~~v~~~v~Pv~~mekI~vt~~~~~~~~~~ 587 (587) T protein:vir:95 556 GNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred CCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 67899999999999999999998866555443 No 38 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=1.7e-63 Score=364.78 Aligned_cols=598 Identities=13% Similarity=0.045 Sum_probs=299.1 Q ss_pred Cce---------ecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHH Q lcl|NC_012740. 1 MTL---------LSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGAN 70 (667) Q Consensus 1 ~~~---------~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~ 70 (667) |+. .+|||||||+|++.+++++ +|++++|||.++|||+|+|++|+||.||++.||+ .+|.||+++| T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~Gp~~~p~~v~s~~~~~~~fgg----g~l~~av~~~ 76 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEGGETYKPYRLTSFAEAVSIFKG----GPLLEHIKAA 76 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCCCCCceeEEecCHHHHHHHhcC----ccHHHHHHHH Confidence 433 3599999999999988876 6999999999999999999999999999999996 4699999999 Q ss_pred HHcCCCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeee--ccceeccccceeecccccceeeeeccc Q lcl|NC_012740. 71 FLQYGNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHN--RQDIETAGKVTKVDGDGKVKGVFIPTG 148 (667) Q Consensus 71 f~ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~--~~~~~~~~~~~~~d~~~~~~~~~~~~~ 148 (667) |+|||++||+|||.+.+. ++........++.. .+.||+.+..... .........+........ ..+.... T Consensus 77 F~nGg~~~~~vRv~~~~~---a~~~~~~~~~~a~~---~g~~gn~i~~~v~~~~~~~~~~~~l~v~~~~~~--~~~d~~v 148 (648) T protein:vir:10 77 FIGGAGEVVAVRIGNPTT---ASVSIPVAQNTSDT---SPANLNFVSYEASTRSNQIYVSFDLDENFTSAN--EADDTII 148 (648) T ss_pred HhCCCcEEEEEEcCCCcc---cceecceeEEeecc---cCCCCCceEEEEEEcCCCcCceeEEEEEecCCC--cccceeE Confidence 999999999999976543 23223333333333 3456776653332 111221112222111111 0100000 Q ss_pred ccccccccccccccccccceEEEEEeecccc-cceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecc Q lcl|NC_012740. 149 KIIAHAKAIGVYPELDGGWTAEFTSSSGNGS-AALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYA 227 (667) Q Consensus 149 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~-~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~ 227 (667) + ........+.......+..+........ ...+....+......... .... .......................+ T Consensus 149 -~-~i~~~~~~y~gt~~~~t~~v~~~~~~~~~~~~~~~~~~~~~~v~~~~-~~~~-~~~~~~v~~~~~~~~~~~~~~~~~ 224 (648) T protein:vir:10 149 -F-TIYQKHPDFSVTRETFTFPRKFTTPTVLVKRGSTLFFVDRSIVNAAL-AAGP-AFQTALINLLKEQLQPTDVVQIFD 224 (648) T ss_pred -E-EeccCCCcccccceeccccccccccccccccccceeecCccchhhhh-ccCc-cchhhhhhchhhhhhhhhhheecc Confidence 0 0000000000000000000000000000 000000000000000000 0000 000000000000000000000000 Q ss_pred ccccceeEEEEeecccccccceeeeeeeeccc--ccccc--eeeeeccccccccceeeeeccceeeeeEee---eccCCc Q lcl|NC_012740. 228 GEIGNSLEVEILARSSFSGAVAPELTMYPFGG--TRAAA--RNLIPYAPQNDNQYAFIVRRDGVVVESYVL---STLKGD 300 (667) Q Consensus 228 G~~gn~i~v~i~~~a~~~~~~~~~~t~~~~~~--~~~~~--~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~---s~~~~~ 300 (667) ... ...+.+.. ............. ..... ...-....+......+.+. .........+ +...+. T Consensus 225 ~s~--~~~~d~~~------~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~-~tp~~~~~~~~~~~~~~~~ 295 (648) T protein:vir:10 225 ASD--TNPVDIPL------GLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLS-ATPFFDGSDYQDYTSLSDP 295 (648) T ss_pred ccc--cccccccc------ccccccccchhhhcCCcchhhhhhhccccccccccceeccc-ccccccccceeeeeccccc Confidence 000 00000000 0000000000000 00000 0000000000000000000 0000000000 000000 Q ss_pred cccccccccchhhhcccccceEEEecccccCcccceEEecCCccccccccccc-cccccccchhHHHHHHhhhccccc-- Q lcl|NC_012740. 301 KDVYGNSIYMDDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDR-GNDPFIGAMMQGWDLFAERESIHV-- 377 (667) Q Consensus 301 ~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-- 377 (667) .+... +.....+.++.....+.. .......|+||+++..+..... +......++.+++.+++..+.+.+ T Consensus 296 ~~~~~-------v~~~~~~~l~~~~~~p~~-~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ivp 367 (648) T protein:vir:10 296 ANWFA-------KDAYTINHLVDTTINPHI-LATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVIP 367 (648) T ss_pred cceee-------eeccchhhcccccccCcc-cccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEEe Confidence 00000 000111222222211111 0111235889998877654322 334456778888888877665431 Q ss_pred --ccEEecCcCCcchhhHHHHHHHHHHHhhcC---------cEEEEEccCccccccccccCCHHHHHHHh--hhhccccc Q lcl|NC_012740. 378 --NLLIAGACAGEGDAFSTVQKHAVSIGDERQ---------DCLVMVSPPRSTVVNIPVTTAIDNLIAWR--EGNSNYSD 444 (667) Q Consensus 378 --~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~---------~~~ai~d~p~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 444 (667) .+.+...........++++.++++||.+|. ..++++.++ .+.+.++....+ ......+. T Consensus 368 ~~~~~~~~~~~~~lt~~q~i~a~a~shv~~~s~~~~~~~r~~~~~~vg~~--------~~es~~~se~~~~~~~~~~~~a 439 (648) T protein:vir:10 368 AYKFTNVTQLNDRLTIFKGIASTFLSHVQTMSQVNRRKARVGVFGLPAPS--------PNESVTASEYLYNRNILNTISA 439 (648) T ss_pred ecccccccccccccCCccchHHHHHHHHHHhhhccccccccCeEEEeCCC--------CchhHHHHHHHhhhhcccccce Confidence 111112222223455788999999997542 124444332 233333322222 22211111 Q ss_pred cccccCcceE-EEEehhhcccccccCceeEech---HHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhh Q lcl|NC_012740. 445 NNMNINTTYA-VIDGNYKYQYDKYNDVNRWVPL---AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHR 520 (667) Q Consensus 445 ~~~~~~s~~~-~~~~p~~~v~d~~~~~~~~~p~---sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~ 520 (667) .....+-.++ ...+.+.+..| +|+..++|| ++++||+++++ .++.||.||++.+ .+ +++.+.++++|+ T Consensus 440 ~~~~~d~~~~~~~~~~~~~~~~--~G~~~~~p~~~~Aa~VAGl~a~l----~~~~s~T~k~i~~-~~-id~~~~~t~~ql 511 (648) T protein:vir:10 440 MFGGTDRAQAVVFPFYSNVFND--EGKVELLGGEFFASYVAGMHANR----EPQDSITFLPISG-IG-AEPLYNWTYTQK 511 (648) T ss_pred eeeecCCceEEeecccceeECC--CCcEEecchhhHHHHHHhhhhcc----ccccCcccceeec-cc-cccccCCCHHHH Confidence 1112221222 22233333222 567778898 67788888875 4888999998752 23 334478999999 Q ss_pred hhhhhcCceEEEEecCC----eEEEEcceecCC--CcccceeeehhhhhHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHH Q lcl|NC_012740. 521 DRLYQAAINPVIGAGGE----GFILMGDKTATT--VPSPFDRINVRRLFNMLKKNIGDSS-KYKLFENNDNFTRASFRME 593 (667) Q Consensus 521 ~~Ln~~gIn~i~~~~~~----G~~~wG~rT~~~--~~~~~~~i~vrR~~~~i~~~l~~~~-~~~v~epn~~~~~~~i~~~ 593 (667) +.|+++||+||.+.+++ ++++-.+.|... ++..|+.|+++|+.||+...|++.+ ++|+++||++..|.+||+. T Consensus 512 d~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~~n~~~~~~~ik~~ 591 (648) T protein:vir:10 512 DDLISNRVLFVEKVKTSFGGIVYRIHHNPTTWLGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGRKSYGRKTENDIKV 591 (648) T ss_pred HHHhcCCcEEEEEecCCcceeeEEEeccceeecCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcccccHHHHHHHHHH Confidence 99999999999988764 566666666543 3457999999999999999998755 5999999999999999999 Q ss_pred HHHHHHHHHhcCCeeeeE---EEEcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCee Q lcl|NC_012740. 594 VSQYLSTIRSLGGIYDFR---VQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGAD 656 (667) Q Consensus 594 i~~~l~~l~~~gal~g~~---v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~ 656 (667) |.+||.++++.++|++|. |.+ ++++++++|++.++|++|++||.+++.-... ++ T Consensus 592 i~~~L~~~~~~~~I~~y~~~~v~~--------~~~~~vv~V~~~v~Pv~~i~~I~vti~it~~-~~ 648 (648) T protein:vir:10 592 YTEALLSNLVGKQIVAYKDVKVTS--------NEDKTVYYVEFFYQPVTEIKFILVTMKVTFD-LE 648 (648) T ss_pred HHHHHhhHhhcCcccCcccceEEE--------EecCCEEEEEEEEEecceeeEEEEEEEEEec-cC Confidence 999999999999999975 333 2356899999999999999999999765543 33 No 39 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=1.9e-63 Score=364.44 Aligned_cols=564 Identities=15% Similarity=0.089 Sum_probs=330.0 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (667) =.+++|||||||.+++.++..+ ++++++|||.++|||+++|++|+||+||++.||+-.......+|...||.|||++|| T Consensus 9 ~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~l~~~~~~a~~~~~~~g~~~~~ 88 (587) T protein:vir:99 9 RPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGELLDAIELAWGSNPNYTAGRIL 88 (587) T ss_pred cccccCceEEEEecCCccccCCCCCceEEEEEEecCCccceeEEeccHHHHHHHhcCcchHHHHHHHhccccCCCceEEE Confidence 4578999999999999887665 799999999999999999999999999999998744222223333444479999999 Q ss_pred EEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccccccc Q lcl|NC_012740. 80 VVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGV 159 (667) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~ 159 (667) ++||.+. .+|+....++..++..+| .||+.+++........+.-++....+..+...++...+.......... T Consensus 89 ~~rv~~~---~~a~~~~~~l~~~a~~~G---~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~i~y~g~- 161 (587) T protein:vir:99 89 AMRIEDA---KPASAEIGGLKITSKIYG---NVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGNIFTIKYKGE- 161 (587) T ss_pred EEEcCCC---ceeEEEecCeEEEEeecc---ccccceEEEEccCCCCcceeEEEEEecccceeeeeeccceeeEEeecc- Confidence 9999543 345555666666665554 688989887776655554444333333343344433221111100000 Q ss_pred cccccccceEEEEEeeccccc-ceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEE Q lcl|NC_012740. 160 YPELDGGWTAEFTSSSGNGSA-ALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEI 238 (667) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~-~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i 238 (667) . ......+......... ...+... ....+.-...... ..........-+..+.+.|.+.+..++++.... T Consensus 162 --~--~~a~~~v~~~~~t~~a~~~~l~~g--~~~v~~yrL~~g~---~~~~~~~~~~i~~~~~~tAky~~~~~~~i~~~~ 232 (587) T protein:vir:99 162 --E--ANATFSVEHDEETQKASRLVLKVG--DQEVKSYDLTGGA---YDYTNAIITDINQLPDFEAKLSPFGDKNLESSK 232 (587) T ss_pred --c--ccceeeEeecCcceeeeeeeeecC--CceeEEEEecCCc---hHHHHHHHhhhccccceeEEeeccCCceeEeec Confidence 0 0000000000000000 0000000 0000000000000 000011111223344567777777777665432 Q ss_pred eecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcccc Q lcl|NC_012740. 239 LARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGS 318 (667) Q Consensus 239 ~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~ 318 (667) .... ..... ............ .+........... +....+........ ....... T Consensus 233 ~~~~-~~~~v--~~~~~~v~a~~~----------------D~~~~~~~~~~~~--~~~~~g~~~~~~~~----~~~~~~~ 287 (587) T protein:vir:99 233 LDKI-ENANI--KDKAVYVKAVFG----------------DLEKQTAYNGIVS--FEQLNAEGEVPSNV----EVEAGEE 287 (587) T ss_pred cccc-cccee--eeeeeeeehhcc----------------ceeeecccceeee--eeecccccchhhhh----hhhhccc Confidence 2110 00000 000000000000 0000000000000 00001100000000 0000001 Q ss_pred cceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHH Q lcl|NC_012740. 319 SQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKH 398 (667) Q Consensus 319 s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~ 398 (667) ...+................|.||.++... .++.+++++++. .++++|+++ ...++++.+ T Consensus 288 ~~~~~~~~~~~~~a~~~~t~LtGG~dG~~~-----------~sy~~al~ale~---~~~~~i~~~------t~d~~i~a~ 347 (587) T protein:vir:99 288 SATVTATSPIKTIEPFELTKLKGGTNGEPP-----------ATWADKLDKFAH---EGGYYIVPL------SSKQSVHAE 347 (587) T ss_pred cceeeeeccccceecccceeeecCCCCCcc-----------ccHHHHHHHHhh---CCcEEEEec------CCCHHHHHH Confidence 111111111111111122346677665321 235566666654 345555432 245678899 Q ss_pred HHHHHhhcCc----EEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 399 AVSIGDERQD----CLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 399 ~~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) +.+||+++++ ++++++.+. +.+++++.+++.. +++.+.+.++|+..+. ..++....+ T Consensus 348 l~a~vk~~r~~g~~~~aVlg~~~--------~~~~~~~~~~a~~----------~n~e~vi~v~~~~~~~-~~dg~~~~~ 408 (587) T protein:vir:99 348 VASFVKERSDAGEPMRAIVGGGF--------NESKEQLFGRQAS----------LSNPRVSLVANSGTFV-MDDGRKNHV 408 (587) T ss_pred HHHHHHHHHhCCCcEEEEecCCC--------CCCHHHHHHHhhh----------cCCCcEEEEeccceEe-cCCCceeee Confidence 9999988765 888887653 4577788776653 6788999998876543 234666677 Q ss_pred ch---HHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCe---EEE-Ecceec Q lcl|NC_012740. 475 PL---AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEG---FIL-MGDKTA 547 (667) Q Consensus 475 p~---sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G---~~~-wG~rT~ 547 (667) |+ ++++||++|..+ +++||.|+.+. ..++...+++.|++.|+++|+++++..++++ +++ .+-.|. T Consensus 409 ~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~----~~~v~~~~t~~e~e~li~~Gvl~l~~~~~~~~~~vriv~~ItT~ 480 (587) T protein:vir:99 409 PAYMVAVALGGLASGLE----IGESITFKPLR----VSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTF 480 (587) T ss_pred chHHHHHHHHHHHhcCc----hhcCccceeee----cccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeeceeec Confidence 77 688999999877 77899998753 3457778999999999999999999887664 443 344443 Q ss_pred C-CCcccceeeehhhhhHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhh Q lcl|NC_012740. 548 T-TVPSPFDRINVRRLFNMLKKNIGDSS-KYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVID 625 (667) Q Consensus 548 ~-~~~~~~~~i~vrR~~~~i~~~l~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~ 625 (667) . .++..|++|+++|++|+|++.|++.+ ++|++|||++..|..|+..|..||.+||+.|+|.+|... ..+-+.. T Consensus 481 t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk~Nn~~~r~~i~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~~~ 555 (587) T protein:vir:99 481 NDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEIQDFPAE-----DVQVIVE 555 (587) T ss_pred cCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEec Confidence 2 33457999999999999999999886 699999999999999999999999999999999998641 1222345 Q ss_pred CCeEEEEEEEEecCCceEEEEEEEEeecCeeH Q lcl|NC_012740. 626 RNEFVASMFIKPAKSINYIMLNFTAVATGADF 657 (667) Q Consensus 626 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 657 (667) ..+++|++.++|+.|+|+|.++++.....++- T Consensus 556 ~d~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 556 GNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred CCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 56899999999999999999998876665544 No 40 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=3.6e-63 Score=362.91 Aligned_cols=541 Identities=14% Similarity=0.078 Sum_probs=314.1 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCC--cCccchhHHHHHHHHHcCCCe Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQ--PDNNTADYFMSGANFLQYGND 77 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~--~~~~~~~~~~v~~~f~ngG~~ 77 (667) =.+..||||++|.+++.+++++ ++++++|||.+++||+|+|++|+||+||++.||+ +.+..++.|....+|.|||++ T Consensus 9 ~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~l~~a~~~a~~~~~~~~~~~~~ 88 (569) T protein:vir:80 9 KKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKPDTVYRFRNYQQAKQVLRSGDLLDAIELAWNASDVNTASAGD 88 (569) T ss_pred CccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCCceeEEecCHHHHHHHhcCCchhHHHHhhccCccccccCceE Confidence 3456999999999999988776 6999999999999999999999999999999965 455556666677778999999 Q ss_pred EEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceee-cccccceeeeeccccccccccc Q lcl|NC_012740. 78 LRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKV-DGDGKVKGVFIPTGKIIAHAKA 156 (667) Q Consensus 78 ~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~~~~~~~~~~~~a~~ 156 (667) ||+|||.+. ..++....+...+.. ..++|++.+.++.......+.-.+... .... . T Consensus 89 ~~~~rv~~a---~~a~~~~~~~~~~a~---~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~-----------------~ 145 (569) T protein:vir:80 89 ILAVRVEDA---KNATLTKGGLTFAST---IYGVDANEIQVALEDNNLTHTKRLTVAFSKDG-----------------Y 145 (569) T ss_pred EEEEEcCCC---eeeeeeccceeeeee---eccCCCceEEEEEecCcCCcceeeEEeeecCC-----------------C Confidence 999999432 333333333333333 334567777666544332221111100 0000 0 Q ss_pred ccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEE Q lcl|NC_012740. 157 IGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEV 236 (667) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v 236 (667) ...+.... ..++....+......... .. ........ .+.....+......++ T Consensus 146 ~~~~~~ig----~v~si~ytg~~~~a~~~~-~~-------~~~~~~a~----------------~l~~~~g~~~~~~~~v 197 (569) T protein:vir:80 146 KKVFDNLG----KIFSIQYKGSEAQANFTI-AQ-------DSISKKAT----------------TLTLNVGSEPESTTEV 197 (569) T ss_pred cccccccc----ceeeEEEeeccccceEEe-ec-------CcCcceeE----------------EEEEEecCCcceeEEE Confidence 00000000 001111111111000000 00 00000000 0000000000001111 Q ss_pred EEeecccccccce-eeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhc Q lcl|NC_012740. 237 EILARSSFSGAVA-PELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFA 315 (667) Q Consensus 237 ~i~~~a~~~~~~~-~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~ 315 (667) ............. ...+.+ ...+........... ...........+ .+.+.+... .......+... T Consensus 198 ~~~~~~~~~~~~~~~lv~~~--~~~~~f~a~~~~~~~---~~~~~~~~d~~~---~~~~~t~~~-----~~~~~~~di~~ 264 (569) T protein:vir:80 198 MKYELGQGVYSETNVLVSAI--NSLPDWEAKFFPIGD---KNLPTDALEAVT---KVDVKTEAV-----FVGALAGDIAK 264 (569) T ss_pred EeeccCCccchhhhhhhhhc--CCccCceEEEEecCC---Ccceehhccchh---heeccccce-----eeehhHHHHHH Confidence 1000000000000 000000 000000000000000 000000000000 001110000 00000111110 Q ss_pred -ccccceEEEecccc-cCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhH Q lcl|NC_012740. 316 -RGSSQYIYATAQGW-VDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFS 393 (667) Q Consensus 316 -~~~s~~v~~~~~~~-~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 393 (667) ...+.++.+..... .-.......|.||.++... .++.+.++.++. +++++++++ ...+ T Consensus 265 ~~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~~-----------~~~~~~l~~le~---~~~~~i~~~------t~d~ 324 (569) T protein:vir:80 265 QLEYNDYVTVAVDATKPVEDFELTNLTGGSDGTAP-----------ESWANKFPLLAN---EGGYYLVPL------TDKQ 324 (569) T ss_pred hhcCCceEEEEecCCcceeeecceeecCCCCCCcc-----------chHHHHHHHHhh---CCcEEEEec------CCCh Confidence 11234554433221 1111122356677664321 235556666553 455665543 2356 Q ss_pred HHHHHHHHHHhhcCc----EEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccC Q lcl|NC_012740. 394 TVQKHAVSIGDERQD----CLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYND 469 (667) Q Consensus 394 ~v~~~~~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~ 469 (667) +++.++.+||+++++ ++++++.+. +.+++++.++++ .+++.|.++++||..+++. ++ T Consensus 325 av~~~l~a~vkr~r~~g~~~~aVvg~~~--------~~~~~~~~~~a~----------~~n~e~vv~v~~~~~~~~~-~g 385 (569) T protein:vir:80 325 AVHSEALAFVKDRTDNGDPMRIIVGGGT--------NETVEESITRAT----------NLRDPRASLVGFSGTRKMD-DG 385 (569) T ss_pred HHHHHHHHHHHHHHhCCCcEEEEecCCC--------CCCHHHHHHHHh----------hcCCCeEEEEecCceeecC-CC Confidence 799999999998865 889887653 457788877765 4789999999999988774 45 Q ss_pred ceeEech---HHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcc-- Q lcl|NC_012740. 470 VNRWVPL---AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGD-- 544 (667) Q Consensus 470 ~~~~~p~---sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~-- 544 (667) ..+..|+ ++++||++|..+ +++||.|+.+. ..++...+++.|++.|+++|+.++++.+++..++|.. T Consensus 386 ~~~~~~~~~~aa~vAG~~A~~~----~~~S~T~k~i~----~~~i~~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn 457 (569) T protein:vir:80 386 RLLKLPGYMMASQIAGIASGLE----VGEAITFKHFN----VTSVDRVFESSQLDMLNESGVISIEFVRNRTLTAFRVVQ 457 (569) T ss_pred cceeechhhHHHHHHHHHhcCc----cccCccceeec----cccccccCCHHHHHHHHhCCeEEEEEecCceEEEEEEec Confidence 5555665 667788887665 88899999763 3457778999999999999999999998887777733 Q ss_pred --eecC-CCcccceeeehhhhhHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCC Q lcl|NC_012740. 545 --KTAT-TVPSPFDRINVRRLFNMLKKNIGDSS-KYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNT 620 (667) Q Consensus 545 --rT~~-~~~~~~~~i~vrR~~~~i~~~l~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt 620 (667) .|.. .++..|++|+++|++|+|++.|++.. +||++|||+...|..|+..|..||.+||++|+|.+|.. .+- T Consensus 458 ~itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~-----~dv 532 (569) T protein:vir:80 458 DVTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGTKVIDTSASLIKNFIQSFLDNKKRAREIQDYTP-----EEV 532 (569) T ss_pred cceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcccCChhHHHHHHHHHHHHHHHHHhCCcccCCCc-----cce Confidence 2222 23457999999999999999998875 68999999999999999999999999999999999852 123 Q ss_pred HHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeH Q lcl|NC_012740. 621 PDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADF 657 (667) Q Consensus 621 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 657 (667) +.++..++++|++.++|+.|||||+++++.....++- T Consensus 533 ~v~~~~d~~~v~~~v~Pv~~~ekI~~ti~~~~~~~~~ 569 (569) T protein:vir:80 533 QVVLEGDVASISMTVMPIRSLNKITVQLVYKQQILTA 569 (569) T ss_pred EEEecCCEEEEEEEEEEcccccEEEEEEEEeeeeecC Confidence 4446778999999999999999999999977776555 No 41 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=2.2e-61 Score=353.09 Aligned_cols=560 Identities=14% Similarity=0.065 Sum_probs=324.2 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHH----HcCC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANF----LQYG 75 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f----~ngG 75 (667) =.|.+|||||++.+++..++.+ ++++.+|||.+++||+++|++|++|+||++.||+.. |..|+.+.| .||| T Consensus 9 ~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g~~~~~~~~~~~~~~~~~~g~G~----l~~ai~~a~~~~~~~g~ 84 (587) T protein:vir:96 9 RPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGGEPNTVYQVRNYAQAKSVFRSGE----LLDAIELAWGSNPQYTA 84 (587) T ss_pred CcccCCceEEEEecCCccCCCCCCCceEEEEEEecCCCCceeEEEcChHHHHHhhcCCc----HHHHHHHHhccCcCCCc Confidence 4678999999999999887766 699999999999999999999999999999998754 555565555 7999 Q ss_pred CeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccc Q lcl|NC_012740. 76 NDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAK 155 (667) Q Consensus 76 ~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~ 155 (667) +.||.|||.+. ..++........+... .++||+.+.+........+...+...-...+...++...+...... T Consensus 85 ~~~~a~rv~~~---~~a~~~~~~~~~~~~~---~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~v~~i~- 157 (587) T protein:vir:96 85 GKILAMRVEDA---KASQLEKGGLRVTSKI---FGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGNIFSIN- 157 (587) T ss_pred eEEEEEecCCC---ccceeecccccccccc---cCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCceEEEE- Confidence 99999999543 3444444444443333 3568888888776554433322221111111111111110000000 Q ss_pred cccccccccccceEEEEEeeccccc-ceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 156 AIGVYPELDGGWTAEFTSSSGNGSA-ALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) ..+. . ......+.......++ ...+..... ....-...... ...........+..+.+.|.++|..+|++ T Consensus 158 y~g~--~--~~a~~~~~~~~~~~~A~~l~l~gg~~--~v~~yrl~~g~---~~~~~~~~~~~~~~~~~tAky~g~~~n~~ 228 (587) T protein:vir:96 158 YKGE--G--EKATFSVEKDKETQEAKRLVLKVDEK--EVKAYELNGGA---YSFTNEIITDINELPDFEAKLSPFGDKNL 228 (587) T ss_pred eccc--c--cceeEeeccCcccceeeeeEEEecCc--eEEEEEeCCCc---hhhhhhhhhhhccccceEEEeecccCcee Confidence 0000 0 0000000000000000 000000000 00000000000 00001111122345567888999988888 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) ++............ ....+ ........... ..+...+...+. .... .... ...+. T Consensus 229 ~v~v~d~~~~~~~k--~~~~y-~~t~~~di~~~--------~~~~~~~~~~~~----------~~~~---~~~~-~~~v~ 283 (587) T protein:vir:96 229 ESRKLDEATDVDIK--GKAVY-VKAVFGDIENQ--------TQYNQYVKFEQL----------PEQA---SEPS-DVEVH 283 (587) T ss_pred EEEeeccccccccc--eEEEe-ehhhhhhhhhh--------hccccceeeccc----------cchh---hhhh-ccccc Confidence 87653211100000 00000 00000000000 000000000000 0000 0000 00000 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) ........................|.||.++... .++.+.+++++. .++++|+++ ...++ T Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~~-----------~~y~~~l~ale~---~~~~~i~~~------t~d~a 343 (587) T protein:vir:96 284 AETESATVTATSKPKAIEPFELTKLSGGTNGEPP-----------TSWSAKLEKFKN---EGGYYIVPL------TDRQS 343 (587) T ss_pred ccccceeeeecccccccccccceeeecCCCCCCc-----------ccHHHHHHHHhh---CCcEEEEec------CCCHH Confidence 0000111111111111111122346677665321 234555666644 456666543 23467 Q ss_pred HHHHHHHHHhhcCc----EEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCc Q lcl|NC_012740. 395 VQKHAVSIGDERQD----CLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDV 470 (667) Q Consensus 395 v~~~~~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~ 470 (667) ++.++.+||+++++ ++++++.+ .+.+++++.+.++ .+++.+.++++++..+.+.. +. T Consensus 344 i~~~l~a~vk~~r~~gk~~~aVlg~~--------~~~~~~~~~~~a~----------~~n~e~vi~v~~~~~~~~~~-~~ 404 (587) T protein:vir:96 344 VHSEVATFVKNRSDAGEPMRAIVGGG--------TSETKEKLFGRQA----------ILNNPRVALVANSGKFVMGN-GR 404 (587) T ss_pred HHHHHHHHHHHHHhCCCeEEEEecCC--------CCCCHHHHHHHHh----------hcCCCcEEEEecceEEecCC-Cc Confidence 88999999988765 88888765 3457777777664 36789999999988877653 44 Q ss_pred eeEech---HHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcc-ee Q lcl|NC_012740. 471 NRWVPL---AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGD-KT 546 (667) Q Consensus 471 ~~~~p~---sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~-rT 546 (667) ....|+ ++++||++|..+ +++||.|+.+. + .++...+++.|++.|.++|+.+++..++++.++|.. ++ T Consensus 405 ~~~~~~~~~aa~vAG~~Ag~~----~~~S~T~~~~~---~-~~v~~~~t~~e~~~~i~~G~~~l~~~~~~~~~v~~~vns 476 (587) T protein:vir:96 405 ILQAPAYMVASAVAGLVSGLD----IGESITFKPLF---V-NSLDKVYESEELDELNENGIITIEFVRNRMTTMFRIVDD 476 (587) T ss_pred eeeechhhHHHHHHHHHhcCc----cccCccceeee---c-ccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeecc Confidence 444443 688999999776 77899998764 2 356778999999999999999999988877777733 33 Q ss_pred c---C-CCcccceeeehhhhhHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCH Q lcl|NC_012740. 547 A---T-TVPSPFDRINVRRLFNMLKKNIGDSS-KYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTP 621 (667) Q Consensus 547 ~---~-~~~~~~~~i~vrR~~~~i~~~l~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~ 621 (667) + . .++..|++|+++|++|+|.+.|++.. ++|++|||++..|..|+..|..||.+|++.|+|.+|.. .+-+ T Consensus 477 itT~t~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~g~I~~~~~-----~dv~ 551 (587) T protein:vir:96 477 VTTFPDKNDPVKSEMALGEANDFLVSELKILLEEQYIGTRTINTSASQIKDFVQSYLGRKKRDNEIQDFPP-----EDVQ 551 (587) T ss_pred ceecCCCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCccccCHHHHHHHHHHHHHHHHHHHhCCcccCCCc-----cceE Confidence 2 1 23457999999999999999999886 68999999999999999999999999999999999864 1222 Q ss_pred HHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeH Q lcl|NC_012740. 622 DVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADF 657 (667) Q Consensus 622 ~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 657 (667) -++...+++|++.++|+.|||||.++++.....++- T Consensus 552 v~~~~D~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:96 552 VIIEGNEARISLTIFPIRALKKISVSLVYRQQTLQA 587 (587) T ss_pred EEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 234556899999999999999999998855544433 No 42 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=100.00 E-value=6.2e-55 Score=317.75 Aligned_cols=552 Identities=14% Similarity=0.129 Sum_probs=309.1 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCC--cCccchhHHHHHHHHHcCCCe Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQ--PDNNTADYFMSGANFLQYGND 77 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~--~~~~~~~~~~v~~~f~ngG~~ 77 (667) =-+.+||||+++.+++..++.+ ++++.+|||.+++||+++|++|+||+|+++.||+ +.+...+.|.+..||.|||+. T Consensus 18 ~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~a~~~f~~g~l~~a~~~a~~~~~~~~~g~~~ 97 (607) T protein:vir:10 18 FYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNGDPTKVYEIRTSQQATKIFGSGDLVDGIKLAFDPTGNSVTNGGT 97 (607) T ss_pred CCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCCCCceEEEEcchhHHHHhhcCcchHHHHHHhhccccCCccCCce Confidence 3456999999999999987766 7999999999999999999999999999999965 444566788888889999999 Q ss_pred EEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceecccccee-ecccccceeeeeccccccccccc Q lcl|NC_012740. 78 LRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTK-VDGDGKVKGVFIPTGKIIAHAKA 156 (667) Q Consensus 78 ~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~~~~~~~~~~~~a~~ 156 (667) ||+|||.+. .+++....+...+.... +++++.+++... +..+..-.... +.... ....+...+.... T Consensus 98 ~~~~rv~~~---~~a~~~~~~~~~~~~~~---~~~~~~i~~~l~-~~~~~~~~~~~~~~~d~-~~~~~~n~g~~~~---- 165 (607) T protein:vir:10 98 VYALRVDNA---KQASLVKDGLTFTSSIF---GTNANQVSVALD-NDVFGVPRITVNYSPDN-YERTYTNIGQMFS---- 165 (607) T ss_pred EEEEeCCCc---cccceeccccccccccc---ccCCCceEEEEE-ecCCCccceeEEeeccc-ceeeeeeccceee---- Confidence 999999543 33444444444433333 345666665542 11111111110 11110 0111111000000 Q ss_pred ccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhcccc----ccccccccceeeeee------- Q lcl|NC_012740. 157 IGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDF----LTKLKKYDMPAVSAI------- 225 (667) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~A~------- 225 (667) +. +..........+.....+.....++..... ....... ....+..... .....-+..+.+.|. T Consensus 166 i~-y~g~~~~a~~~v~~~~~g~~~~lt~~~~~~-~~~~~~V---~~~~l~~~~~~t~~~l~~din~~~~~~A~~~g~~~i 240 (607) T protein:vir:10 166 IT-YSGKSASAGYTVSHDTDGKAILLTLGSGDS-IDKLTNV---ATFDLTMSKYDTIAKLMQAISATPNFSASVVGSPSV 240 (607) T ss_pred cc-cCcccccccceeeecCCCceeEEEecCCCc-cceeeee---ecccccccccchHHHHHHHhhcCCceEEEEecccce Confidence 00 000000000000000000000011000000 0000000 0000000000 000000111112222 Q ss_pred ---ccccccceeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEe-eeccCCcc Q lcl|NC_012740. 226 ---YAGEIGNSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYV-LSTLKGDK 301 (667) Q Consensus 226 ---~~G~~gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~-~s~~~~~~ 301 (667) +++..++.+.+.... ...+. ...+ +....... .+. +....+.. T Consensus 241 ~tky~d~~~~~i~V~~~~--------------~iv~a---~~~D-------------~~~~~~~~---~~~~~t~~~~~~ 287 (607) T protein:vir:10 241 NTSYLDEVTSPVDVKTAP--------------AVVTA---KIGD-------------AISKLGYD---PYVVVTQTSNNK 287 (607) T ss_pred eeeccccccceeEEEEee--------------eeech---hhhh-------------hhhccccc---ceEEeeecccch Confidence 222222222211100 00000 0000 00000000 000 00000000 Q ss_pred ccccccccchhhhcccccceEEEeccccc---CcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccc Q lcl|NC_012740. 302 DVYGNSIYMDDFFARGSSQYIYATAQGWV---DGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVN 378 (667) Q Consensus 302 ~~~~~~~~~~~~~~~~~s~~v~~~~~~~~---~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (667) .... ... ....+......... ........|.||.++... .++.+.++.++. .+.+ T Consensus 288 ~~~~------~~~--~~~~~~~~~~~~~~~~~~a~~a~~~LtGGtdG~~~-----------~ty~dal~aLe~---~e~~ 345 (607) T protein:vir:10 288 PIVN------GVS--AGTGSATASVTTAPESFPANFDTAFLTGGSTGDVP-----------VSWADKFNGAIG---NNVY 345 (607) T ss_pred hhhh------hhh--ccccceeeeeeccccccccccceeeeeCCCCCCch-----------hhHHHHHHHHhh---cCce Confidence 0000 000 00011111000000 011112346666655321 234555565554 3455 Q ss_pred cEEecCcCCcchhhHHHHHHHHHHHhhcCc----EEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceE Q lcl|NC_012740. 379 LLIAGACAGEGDAFSTVQKHAVSIGDERQD----CLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYA 454 (667) Q Consensus 379 ~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~ 454 (667) +++++ ...++++.++.+||++|++ +++++..+ .+.+++++.++++. +++.+. T Consensus 346 ~i~~~------t~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~--------~~~t~~~~~t~a~~----------~N~erv 401 (607) T protein:vir:10 346 YIIPL------TSEENIHAELQAFIDEQHVLGYNYHAFVGGG--------FAEPLEQILSRQVN----------INDSRF 401 (607) T ss_pred EEEec------CCCHHHHHHHHHHHHHHHhCCCcEEEEecCC--------CCCCHHHHHHHHHh----------hCCCcE Confidence 55443 2346789999999988765 88887765 35678888887754 678899 Q ss_pred EEEehhhcccccccCceeEech---HHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEE Q lcl|NC_012740. 455 VIDGNYKYQYDKYNDVNRWVPL---AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPV 531 (667) Q Consensus 455 ~~~~p~~~v~d~~~~~~~~~p~---sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i 531 (667) +++.|+.++.| .+..+..|+ ++++||++|..+ +.+||.|+.+. ..++...+++.|++.|.++|+.++ T Consensus 402 v~V~~~~~~~~--~G~~~~~~~~~~Aa~vAGl~Ag~~----~~~SlT~k~i~----~~~v~~~lt~~e~e~ai~~Gv~~l 471 (607) T protein:vir:10 402 GLVGQSGHVQE--GGESVHVPAYLMAAYVGGLSSSLG----VAVPITNKKLA----LVDLDQNFSGDDLNTLNQNGVIGI 471 (607) T ss_pred EEEecCeeEee--CCcceeccHHHHHHHHHHHHhcCc----cccCcccceec----cccccccCCHHHHHHHHhCCeEEE Confidence 99999987765 355555665 688999999776 67799998763 346777899999999999999999 Q ss_pred EEecC----CeEEEEcceecC--CCcccceeeehhhhhHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHh- Q lcl|NC_012740. 532 IGAGG----EGFILMGDKTAT--TVPSPFDRINVRRLFNMLKKNIGDSS-KYKLFENNDNFTRASFRMEVSQYLSTIRS- 603 (667) Q Consensus 532 ~~~~~----~G~~~wG~rT~~--~~~~~~~~i~vrR~~~~i~~~l~~~~-~~~v~epn~~~~~~~i~~~i~~~l~~l~~- 603 (667) ...++ ++++++.+.|.. .++..|++|+++|++|+|.+.+++.. ++|++|+|++..|.+++..+..||..+|+ T Consensus 472 ~~~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nnd~~~~~vk~~i~~~L~~~~l~ 551 (607) T protein:vir:10 472 EHLVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIRSTSADDIKSTVASYLYSEMNN 551 (607) T ss_pred EEccCccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCcccCCcchHHHHHHHHHHHHHHHHHH Confidence 76553 368888877763 33468999999999999999999876 58999999999999999999999976655 Q ss_pred -cCCeeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeHHHHHHH Q lcl|NC_012740. 604 -LGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFDEIIGP 663 (667) Q Consensus 604 -~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~ 663 (667) .|+|.+|.. .+-+-..+..+++|++.++|+.++|+|.+++......++-+.--.. T Consensus 552 ~~gaI~df~~-----edv~v~~~~D~v~v~~~v~Pv~~iekIyvtv~v~~~~~~~~~~~~~ 607 (607) T protein:vir:10 552 DDGLIVDFSE-----SDIVVTISGTVVYIQFAVAPTQEIKNIVVSGTYSNYSATSEDNTTK 607 (607) T ss_pred hcCceeCCCc-----cccEEeeCCCEEEEEEEEEEcccceEEEEEEEEEEEEEeeccCCCC Confidence 699999752 1222234566899999999999999999998888777664432222 No 43 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=100.00 E-value=3.6e-53 Score=308.09 Aligned_cols=417 Identities=14% Similarity=0.163 Sum_probs=277.3 Q ss_pred CceecCceEEEEecCCCccccc-CCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCCCeEE Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQS-ATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLR 79 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~~~~ 79 (667) .+-.-|||||||++.+.+++++ .|++++|+|.++|||+++|++|+||.||++.||.... +..+.+..+|++||++|| T Consensus 9 ~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~~Gp~~~~~~i~s~~d~~~~fG~~~~--~~~~~~~~~~~~g~~~~~ 86 (437) T protein:vir:10 9 QNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLALSFGQSKKLMKIRRGEDLFKKLGYEQE--SPQLLLLNEAFKRVSEVL 86 (437) T ss_pred cceecCceeEEEecCCcceeeccCCcEEEEEEEecCCCCceeEEEecHHHHHHHcCCccc--hhHHHHHHHHhcCCCEEE Confidence 8889999999999999887765 6999999999999999999999999999999997543 445555566779999999 Q ss_pred EEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccccccc Q lcl|NC_012740. 80 VVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGV 159 (667) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~ 159 (667) ++|+.++.. |+ . + + T Consensus 87 ~~R~~~g~~---a~---------~-------t----l------------------------------------------- 100 (437) T protein:vir:10 87 LYRLNTGEK---AN---------V-------S----L------------------------------------------- 100 (437) T ss_pred EEECCCCce---ee---------E-------e----e------------------------------------------- Confidence 999854310 00 0 0 0 Q ss_pred cccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEEEe Q lcl|NC_012740. 160 YPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVEIL 239 (667) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~i~ 239 (667) .+...+.|.++|.|||.+++.+. T Consensus 101 ---------------------------------------------------------~~~~~~~A~~~G~~gn~i~v~v~ 123 (437) T protein:vir:10 101 ---------------------------------------------------------SDNVTAQAKYSGVRGNDITVTVK 123 (437) T ss_pred ---------------------------------------------------------ccceEEEeccCCcccceeEEEEe Confidence 00012357789999999988775 Q ss_pred ecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhccccc Q lcl|NC_012740. 240 ARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSS 319 (667) Q Consensus 240 ~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~s 319 (667) ...+. ...+.+.+......++...+....+.. .+ T Consensus 124 ~~~~d------------------------------~~~~~v~~~~~~~~~d~~~v~~~~~~~----------------~n 157 (437) T protein:vir:10 124 TNVDD------------------------------PSSFDVVTFLDTVVMDLQTVKVLADLK----------------NN 157 (437) T ss_pred eccCC------------------------------ccceEEEEecCcceeeeeehhhhhhhh----------------hh Confidence 43211 112233333333333332222111100 01 Q ss_pred ceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHHH Q lcl|NC_012740. 320 QYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHA 399 (667) Q Consensus 320 ~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~ 399 (667) .++...... .........|.+|.++.. ...++.+++..+ +.++++.+++|.. ..++++++ T Consensus 158 ~~v~~~~~~-~l~~~a~~~LtGG~dg~~----------t~~dy~~al~~l---e~~~~n~l~~~~~------d~~~~t~~ 217 (437) T protein:vir:10 158 ALVEFSGTG-ELQPVAGAKLTGGTDGAI----------STQDYLEYFKAL---ETVEFNYMALPVE------DASIKKAA 217 (437) T ss_pred ccccccccc-ccccccceeeeccccCCC----------ChhHHHHHHHHh---ccCcceEEEecCC------ChhHHHHH Confidence 111111110 011122245777776532 123455666655 4457888888752 34677888 Q ss_pred HHHHhhcCc----E-EEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEe Q lcl|NC_012740. 400 VSIGDERQD----C-LVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWV 474 (667) Q Consensus 400 ~~~~~~~~~----~-~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~ 474 (667) .+||+++|+ . .+++..+ . .+....+-+.+.....|. ...--. T Consensus 218 ~~~ik~~r~~~g~~~~~V~~~~---------~----------------------~d~e~Iin~~n~~~~~~~--~~~~~~ 264 (437) T protein:vir:10 218 INFIKRMREDEGLGAQLVVADS---------D----------------------ADSEAVINVKNGVILSDK--TVIDKT 264 (437) T ss_pred HHHHHHHHhccCceEEEEeCCC---------C----------------------CCCceEEEeecceeecCc--ceechh Confidence 899887654 2 3333221 0 011112222222222221 000011 Q ss_pred chHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecC----CC Q lcl|NC_012740. 475 PLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTAT----TV 550 (667) Q Consensus 475 p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~----~~ 550 (667) -.++.+||++|..+ +++|+.|+.+. ++..+...++++|++.|.++|+.++.+..++-+.++|-.|+. .. T Consensus 265 ~~~a~vAG~~Ag~~----~~~S~t~~~~~---~~~~v~~~~t~~e~~~~i~~G~~vl~~~~~~v~i~~gInTltt~~~~~ 337 (437) T protein:vir:10 265 KATVWVAAASANAG----VEKSLTYEKYE---DSVDVVGRLSHTETEDALLKGQFVFTARRGRAVVEQDINSHVSFTIEK 337 (437) T ss_pred hHHHHHHHHhccCc----cccCccccccC---CcccccccCCHHHHHHHHhCCcEEEEEeCCeEEEEEccccccccCCCC Confidence 23577899998775 67799998754 555677789999999999999999977654445557776754 23 Q ss_pred cccceeeehhhhhHHHHHHHHHHHH-HHhcC-CCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCe Q lcl|NC_012740. 551 PSPFDRINVRRLFNMLKKNIGDSSK-YKLFE-NNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNE 628 (667) Q Consensus 551 ~~~~~~i~vrR~~~~i~~~l~~~~~-~~v~e-pn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~ 628 (667) +.+|++|.++|++|+|.+.|++..+ +|+++ |||...|..++..|..||.+|+++|+|.+|.+...+..+.. .... T Consensus 338 ~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~~r~~~~~~i~~yl~~l~~~g~I~~~~~~d~~v~~~~---~~~~ 414 (437) T protein:vir:10 338 NQDFRKNRILRTLDDIVNDTRYAFSEYFLGKVSNNEDGRQAFKANRIRYFKDLEARGAIEDFKVEDIEVLRGE---LKES 414 (437) T ss_pred CchhhhhhHHHHHHHHHHHHHHHHHhccccccCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCceeEEeecCC---CCCE Confidence 4689999999999999999999876 59997 79999999999999999999999999999987765543322 3468 Q ss_pred EEEEEEEEecCCceEEEEEEEEe Q lcl|NC_012740. 629 FVASMFIKPAKSINYIMLNFTAV 651 (667) Q Consensus 629 ~~~~i~~~p~~p~e~i~~~~~~~ 651 (667) +++.+.++|+.+||+|.+++.-. T Consensus 415 v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 415 VVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred EEEEEEEEEeeeeeeEEEEEEec Confidence 89999999999999999998877 No 44 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=100.00 E-value=4.4e-48 Score=280.22 Aligned_cols=487 Identities=14% Similarity=0.117 Sum_probs=318.0 Q ss_pred Cc-e-------ecCceEEEEecCCCcc---cccCCCceEEEeeccCCCCCccEEec--CHHHHHHHcCCcCccchhHHHH Q lcl|NC_012740. 1 MT-L-------LSPGFETKETTLSTTI---VQSATGRAALVGKFQWGPAFQIVQVT--NEVELVNKFGQPDNNTADYFMS 67 (667) Q Consensus 1 ~~-~-------~~PGVyvee~~~~~~~---~~~~ts~~afvG~~~~Gp~~~p~~i~--s~~e~~~~FG~~~~~~~~~~~v 67 (667) |. | -.-||.|.+++.-... ++..+++.|+||.|+||++++|.+|+ +|.+|+-.++++....+..+.+ T Consensus 1 ~~~ysi~q~ig~aSGvav~pi~~d~t~~~~~g~g~~v~a~Vgif~RG~i~k~~~Vt~~n~~~~LGep~~~~~ga~~E~~~ 80 (529) T protein:vir:10 1 MSQYSIQQSLGNASGVAVSPINADATLSTGVALNSSLWAGIGVFARGKPFTVLAVTESNYEDVLGEPLKPSSGSQFEPIR 80 (529) T ss_pred CCceehhhhhhhhcccccCCcCcccccchheecCceEEEEEEEeecCCCcceEEEchhHHHHHhccccCCCcchhhhhHh Confidence 22 1 1459999998844322 33368999999999999999999999 7999999999999999999999 Q ss_pred HHHHHcCCCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecc Q lcl|NC_012740. 68 GANFLQYGNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPT 147 (667) Q Consensus 68 ~~~f~ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 147 (667) ..|+.-+++.||||||+..+. +... +.+. .+.. T Consensus 81 h~~eA~~~~s~yVVRvv~~da-k~p~----------------------i~~~--------------~~~~---------- 113 (529) T protein:vir:10 81 HVYEAIQQTSGYVVRAVPDDA-KFPI----------------------IMFD--------------ESGE---------- 113 (529) T ss_pred hhhhhhcCCceEEEEEccccc-CCce----------------------EEec--------------CCcc---------- Confidence 999998788899999875432 1110 0000 0000 Q ss_pred cccccccccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecc Q lcl|NC_012740. 148 GKIIAHAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYA 227 (667) Q Consensus 148 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~ 227 (667) .....+..+.+ .. T Consensus 114 ----------~~~s~~~~s~~---------------------------------------------------------~~ 126 (529) T protein:vir:10 114 ----------PAYSALPYGSE---------------------------------------------------------IE 126 (529) T ss_pred ----------chhhccccccc---------------------------------------------------------cc Confidence 00000000000 00 Q ss_pred ccccceeEEEEeecccccccceeeeeeeecccccccceeeeecccccccc----ceeeeec-cceeeeeEeeeccCCccc Q lcl|NC_012740. 228 GEIGNSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQ----YAFIVRR-DGVVVESYVLSTLKGDKD 302 (667) Q Consensus 228 G~~gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~----~~~~v~~-~g~v~e~~~~s~~~~~~~ 302 (667) ..+|+.+.+.+... +.-....-..+.... ......... ++.++.. .+..+|+|.+|...++++ T Consensus 127 l~~G~~~~iy~~Dg-d~~~s~~~~l~i~~~-----------~ads~g~e~~~l~~~~~~~~g~~~~let~~~sl~~~a~d 194 (529) T protein:vir:10 127 LDSGEAFAIYVDDG-DPCISPTRELTIETA-----------TADSAGNERFLLKLTQTTSLGVVTTLETHTVSLAEEAKD 194 (529) T ss_pred ccccceEEEEEecC-cCccCCceEEEEEee-----------ccccCCCccceeeEEEEeecCCceEEEEEEeeeeechhh Confidence 11111111111100 000000000000000 000001111 2233332 356779999999999999 Q ss_pred cccccccchhhhcccccceEEEecccccCcc----cceEEecCCccccccccccccccccc--cchhHHHHHHhhhcccc Q lcl|NC_012740. 303 VYGNSIYMDDFFARGSSQYIYATAQGWVDGF----SGIISLAGGVSANEASTGDRGNDPFI--GAMMQGWDLFAERESIH 376 (667) Q Consensus 303 ~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~----~~~~~~~~g~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 376 (667) ..+...|+.+.+.+....+.-+.......-. ....++.+|+|+ +.... .++..++.+|... .++ T Consensus 195 d~G~~~yl~svle~~s~~l~ai~~~e~~~t~~~~t~~d~~f~~GtdG---------~~~~i~s~~y~~A~~~L~n~-p~d 264 (529) T protein:vir:10 195 DMGRLCYLPTALEARSKYLRAVVNEELISTAKVTNKKSLAFTGGTNG---------DQSKISTAAYLRAVKVLNNA-PYM 264 (529) T ss_pred hcCCccchhHHHhhccCceeeeeeeccccccchhhhhhhhccCCccc---------cccccchHHHHHHHHHhcCC-cce Confidence 9999999999886654444332222111100 011245555554 33322 3366677777643 456 Q ss_pred cccEEecCcCCcchhhHHHHHHHHHHHhhc-CcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcce-E Q lcl|NC_012740. 377 VNLLIAGACAGEGDAFSTVQKHAVSIGDER-QDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTY-A 454 (667) Q Consensus 377 ~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~-~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~-~ 454 (667) +..++..+ +..+++..+|+.+|+++ +++| .|+| +.+|+.++++|+++.+..... +-+ + T Consensus 265 ~~~il~~g-----~y~~a~I~~L~~ic~~~~~d~f--~DV~--------~~LT~~aA~~~~e~~gl~~~~-----~~~~s 324 (529) T protein:vir:10 265 YTAVLGLG-----CYDNAAITALGKICADRLIDGF--FDVK--------PTLTYAEALPAVEDTGLLGTD-----YVSCS 324 (529) T ss_pred eeeeeccC-----CccHHHHHHHHHHHhhhhhcEE--EcCC--------CCcCHHHHHHHHHhcCccccC-----ceeeE Confidence 66666554 35678899999999764 4443 3654 678999999999987754322 222 3 Q ss_pred EEEehhhcccccccCceeEechHHH--HHHH--HHHhhhcCCceeeecceeccceecc-ccccccCChhhhhhhhhcCce Q lcl|NC_012740. 455 VIDGNYKYQYDKYNDVNRWVPLAAD--IAGL--CARTDAVSQPWMSPAGYNRGQIMNV-VKLAIEPRKAHRDRLYQAAIN 529 (667) Q Consensus 455 ~~~~p~~~v~d~~~~~~~~~p~sg~--vAg~--~a~~d~~~g~~~span~~~~~i~g~-~~~~~~~~~~e~~~Ln~~gIn 529 (667) .++|||. .-||.++.....++||. .|+. .++.....|.|++|||+.++.|.-. +.+-+..++.|...|-.++|| T Consensus 325 ~y~~P~~-~~D~~tg~k~~~GlsG~A~~akargv~~na~v~g~hY~pAGe~r~~inr~~I~~ly~~d~~e~~~lv~~riN 403 (529) T protein:vir:10 325 VYHYPFS-CKDKWTQSRVVFGLSGVAYAAKARGVKKNSDVGGWHYSPAGEERAVIARASIQPLYPEDTPDEEAMVKGRLN 403 (529) T ss_pred EEEccee-eccccccCceeeCCCcceeeccccceeecccccccccccCCCccceeecccceeccCCCccCHHHHHhhccC Confidence 4667887 89999999999999994 3332 1333334445999999987645331 234466777788889999999 Q ss_pred EEEEecCCeEEEEcceecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeee Q lcl|NC_012740. 530 PVIGAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYD 609 (667) Q Consensus 530 ~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g 609 (667) ++..-.++++.+-.+.|++..++.|||+|+++||++|++.+.+..+|.+||||+..+|. +++.++.+|..+|+.|+|++ T Consensus 404 PV~~~~~g~~~idDsLt~~~knny~R~~hv~~lmn~I~~~~~k~a~~~~~~Pd~it~~g-l~~~l~~~L~r~~asgalv~ 482 (529) T protein:vir:10 404 KVSVGTSGQMIIDDALTCCTQDNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAG-LTKGMTKLLDRFVASGALVA 482 (529) T ss_pred eeeeeccCcceeeeeeceeeeCCchhhhhHHHHHHHHHHHHHHHHHHHhhCCChHHHHH-HHHhHHHHHHHHHhcCceec Confidence 99776666666656666665678999999999999999999999999999999999987 99999999999999999986 Q ss_pred -----------eEEEEcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_012740. 610 -----------FRVQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVA 652 (667) Q Consensus 610 -----------~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 652 (667) |.+++ +|.|+ +++.|++.++|.-.+++|.+.=...+ T Consensus 483 prdp~~~G~epy~~~V-----~q~d~--D~~~v~~~~~ptGv~Rri~~~p~l~~ 529 (529) T protein:vir:10 483 PRDPDADGTEPYVLKV-----TQAEF--DKWEVVWACCPTGVARRIQGVPLLIK 529 (529) T ss_pred ccCccCCCCCceEEEE-----eeccc--CeEEEEEEeecCCceeeEEeeeeecC Confidence 66665 34444 79999999999999999988644444 No 45 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=100.00 E-value=6.2e-43 Score=251.96 Aligned_cols=424 Identities=15% Similarity=0.116 Sum_probs=274.2 Q ss_pred CceecCceEEEEecCCCccc-ccCCCceEEEeecc-CCCCCccEEecCHHHHHHHcCCcCccchhHH-HHHHHHHcCCCe Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIV-QSATGRAALVGKFQ-WGPAFQIVQVTNEVELVNKFGQPDNNTADYF-MSGANFLQYGND 77 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~-~~~ts~~afvG~~~-~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~-~v~~~f~ngG~~ 77 (667) .+=.-|||||||++++.+++ ++++++++|+|.+. ||| ++|+.|.|++||++.||...... .+ +++ +|++||++ T Consensus 9 ~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g~-~~~v~i~~~~d~~~~fG~~~~~~--~~~~~~-~~~~g~~~ 84 (451) T protein:vir:10 9 QDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWGK-NGVIEVEANSDFTKKLGTTLDDP--SLTALK-ETLKGASK 84 (451) T ss_pred ceeecCceEEEEeccCcceeeccCCcEEEEEeeecCCCC-cccEEeecHHHHHHHcCCcccch--hHHHHH-HHhcCCcE Confidence 66679999999999987665 56899999999764 566 78999999999999999754332 33 454 45578999 Q ss_pred EEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccccc Q lcl|NC_012740. 78 LRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAI 157 (667) Q Consensus 78 ~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~~ 157 (667) ||+.|+.+++.+. ++ + . . T Consensus 85 v~~yrl~~g~~a~-~t----------------------~------------------~---------------------~ 102 (451) T protein:vir:10 85 VLVLNPNEGTAAT-LT----------------------K------------------E---------------------G 102 (451) T ss_pred EEEEEcCCCceEE-EE----------------------e------------------e---------------------c Confidence 9999985432100 00 0 0 0 Q ss_pred cccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEEE Q lcl|NC_012740. 158 GVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEVE 237 (667) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v~ 237 (667) ....+.|.++|.|||.+++. T Consensus 103 ------------------------------------------------------------~~~~~~Aky~G~~Gn~i~v~ 122 (451) T protein:vir:10 103 ------------------------------------------------------------LPWTVTANYPGEKGNQITVS 122 (451) T ss_pred ------------------------------------------------------------CceEEEEeeCCcCCceEEEE Confidence 00013578999999999998 Q ss_pred EeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhccc Q lcl|NC_012740. 238 ILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARG 317 (667) Q Consensus 238 i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~ 317 (667) +....+ ....+.+.+..+...++...++...- .... T Consensus 123 v~~~~~------------------------------d~~~~~v~t~~g~~~vd~qtv~~~~~--------------~el~ 158 (451) T protein:vir:10 123 VEVSPA------------------------------DQNAATVSTIFGTKLVDEQSIKFNEL--------------DKFK 158 (451) T ss_pred EecccC------------------------------CcCceEEEEEECCeEEEEEEeeccch--------------hhcc Confidence 754221 12334455555555555433221110 0011 Q ss_pred ccceEEEeccc-ccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHH Q lcl|NC_012740. 318 SSQYIYATAQG-WVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQ 396 (667) Q Consensus 318 ~s~~v~~~~~~-~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~ 396 (667) .+.++.+.... ..........+.++.++.... . ...++.. .+...+.+.++.+++|+... ...++ T Consensus 159 ~nd~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~-----~--~~~~~~~---~l~~~e~~~~n~l~~~~~~~----~~~i~ 224 (451) T protein:vir:10 159 GNDYITAKVVEEGSSKPVAFTNVSGTLTGGTTT-----E--SNKVESL---LNDALENEEYAVVTTAGFEP----SSNMN 224 (451) T ss_pred CCceEEEEecccccccceeeeeccccccccccc-----C--CccchHH---HHHHhccceeeEEEEccCCC----chHHH Confidence 24455443221 111222223344332221111 0 1122233 34445667788888876532 24567 Q ss_pred HHHHHHHhhcCc-----EEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCce Q lcl|NC_012740. 397 KHAVSIGDERQD-----CLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVN 471 (667) Q Consensus 397 ~~~~~~~~~~~~-----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~ 471 (667) ..+.++|+++|+ +.+++..+.. +. +|....+-+.......| + T Consensus 225 ~~~~a~ik~~r~~~g~~~~aVl~~~~~--------~~--------------------~d~egiinv~n~~~~~d---g-- 271 (451) T protein:vir:10 225 KLVVEAVKRLRENEGRKVRGVIPTDAD--------TT--------------------YNYEGISTVVNGYTLSD---G-- 271 (451) T ss_pred HHHHHHHHHHHHhcCCeEEEEecCccC--------CC--------------------CCCcceEEeecceEecC---c-- Confidence 778888887653 3566643211 00 12222233333332222 1 Q ss_pred eEech---HHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEE-Ecceec Q lcl|NC_012740. 472 RWVPL---AADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFIL-MGDKTA 547 (667) Q Consensus 472 ~~~p~---sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~-wG~rT~ 547 (667) ..+++ ++.+||++|..+ +.+|+.|+.+. ++.++...++++|++.+.++|..+++...++++++ +|-.|+ T Consensus 272 ~~~~~~~~~~~vAG~~Ag~~----~~~S~T~~~~~---~~~~v~~~~t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTl 344 (451) T protein:vir:10 272 TNVDVKDATGYFAGISASAD----VATSLTYFEVE---DAVSAYPKFDNEKTIKALDAGQIVFTTRPGQRVVIEQDINSL 344 (451) T ss_pred eeechhhhHHHHHHHHcccc----cccCccceecC---CceeeeeeCCHHHHHHHHhCCeEEEEEEcCCeEEEEEccccc Confidence 11232 578899999875 56699998754 56667788999999999999999887667777765 676665 Q ss_pred C----CCcccceeeehhhhhHHHHHHHHHHHHH-HhcC-CCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCH Q lcl|NC_012740. 548 T----TVPSPFDRINVRRLFNMLKKNIGDSSKY-KLFE-NNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTP 621 (667) Q Consensus 548 ~----~~~~~~~~i~vrR~~~~i~~~l~~~~~~-~v~e-pn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~ 621 (667) . ..+.+|++|.++|++|+|.+.+++..+. |+++ |||..-|..++..|..||.+|++.|+|..|... |.+- . T Consensus 345 tt~~~~k~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~~-d~~v--~ 421 (451) T protein:vir:10 345 HKFTAEKPQAFSKNRVIRTLDEIATNTENTFERTYLGNVGNNAAGRDLFKADRIAYLTSLQNRNMIQSFANT-DITV--E 421 (451) T ss_pred eecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccceecCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCcc-ceEE--e Confidence 3 2346899999999999999999999864 8885 699999999999999999999999999998631 2111 1 Q ss_pred HHhhCCeEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_012740. 622 DVIDRNEFVASMFIKPAKSINYIMLNFTAV 651 (667) Q Consensus 622 ~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 651 (667) .-.....+++.+.++|+..||+|.+++.-. T Consensus 422 ~~~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 422 AGNDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred ecCCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 112356899999999999999999997766 No 46 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=1.3e-36 Score=217.32 Aligned_cols=536 Identities=12% Similarity=0.041 Sum_probs=235.3 Q ss_pred ccEEecC---HHHHHHHcCCcCcc--chhHHHH--HHHHH-cCCCeEEEEEcCCcccccccccccccccceeeecccccc Q lcl|NC_012740. 40 QIVQVTN---EVELVNKFGQPDNN--TADYFMS--GANFL-QYGNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYE 111 (667) Q Consensus 40 ~p~~i~s---~~e~~~~FG~~~~~--~~~~~~v--~~~f~-ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~ 111 (667) ..+.+.- -.||...|+.+.-. ...++++ .+.+- -.|.+|..-.+-..+.. +. ...+ +....+..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~i~g~~~g~~g~~~s~~~~p~~~~~-~e---~q~v--~~~~~~t~Gt 74 (581) T protein:vir:10 1 MAIDFSQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGET-IT---TQIL--ALVGEPTGGS 74 (581) T ss_pred CeeeeccccccchhhhhccccccceeeeeccccccccccccccccccccccCCCCCCc-cc---eEEE--EEEecCCCce Confidence 2222221 12334444433211 0111111 11111 11334433222111000 00 0000 0000000000 Q ss_pred ccceeeEeeeccceeccccceeecccccceeeeecccccccccc-cc--------cccccccccceEEEEEeecccccce Q lcl|NC_012740. 112 VGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAK-AI--------GVYPELDGGWTAEFTSSSGNGSAAL 182 (667) Q Consensus 112 ~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~-~~--------~~~~~~~~~~~~~~~~~~~~~~~~~ 182 (667) . ++ .+.+.....+.+..++.....+- .+ .+.+.....+.+++....+... T Consensus 75 F--tL----------------sf~G~tT~~I~~~asa~~v~~AL~~L~~i~~~~v~v~g~~g~~~~VtF~g~~~~l~--- 133 (581) T protein:vir:10 75 F--KL----------------SLAGEPTGNIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVAALT--- 133 (581) T ss_pred E--EE----------------EeCceecccccccCCHHHHHHHHhccCCCCcceEEEECCCCceEEEEEcCCcccee--- Confidence 0 01 11100000000000000000000 00 0000111112222211100000 Q ss_pred eeeceeeeceeeeeeccccchhhhccccccccccccceeeeee--ccccccceeEEEEeecccccccceeeeeeeecccc Q lcl|NC_012740. 183 SVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAI--YAGEIGNSLEVEILARSSFSGAVAPELTMYPFGGT 260 (667) Q Consensus 183 t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~--~~G~~gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~ 260 (667) ... ...+...... +.......+.+...+. ..|..+....++-...+.. ....+.+... T Consensus 134 -~~~------~~lt~g~~~~-------vtV~~~~~g~~~~~~~~s~~gi~~~~~~l~~~~~~~~----~~~gsd~~~~-- 193 (581) T protein:vir:10 134 -KDV------TGLTGGDDPD-------LNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQV----YVLGTDYVVT-- 193 (581) T ss_pred -eee------ceecCCCcee-------EEEeccccCcccccccccccccccccccccccccCcc----eeccccceee-- Confidence 000 0000000000 0000000000000000 0000000000000000000 0000000000 Q ss_pred cccceeeeeccccccccceeeeeccceeeeeEe---eeccCCcccccccccc-----chhh-------hcccccceEEEe Q lcl|NC_012740. 261 RAAARNLIPYAPQNDNQYAFIVRRDGVVVESYV---LSTLKGDKDVYGNSIY-----MDDF-------FARGSSQYIYAT 325 (667) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~---~s~~~~~~~~~~~~~~-----~~~~-------~~~~~s~~v~~~ 325 (667) .............++..+++....|.++++.+ ++.....+.......+ ..++ ..+..+.+.... T Consensus 194 -~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~t~~~ 272 (581) T protein:vir:10 194 -RVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCA 272 (581) T ss_pred -ecccCccccccccccceeeeeeecccccccceEEEEEEEeecCCcceeEEeecCcchhhhhhhhhhccCccccchhhhh Confidence 00000000001111223333333443333221 1211111100000000 0000 001111111000 Q ss_pred cccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHHHHHHHhh Q lcl|NC_012740. 326 AQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDE 405 (667) Q Consensus 326 ~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~ 405 (667) ......+. ...|.++.++.. ......++.+++.+++.. +.+.+++|. ....+++.++.+||++ T Consensus 273 ~~~~tn~~--~~~l~~gvd~~g-------~tvt~~dy~~Al~ale~~---~~~~ivv~~-----t~~~~v~a~l~ahv~~ 335 (581) T protein:vir:10 273 QLAITNGA--STILACAVDPEG-------DTVTMGDYQNALNKFRDE---DEIAIIVAG-----TGAQPIQALVQQHVSA 335 (581) T ss_pred eeeeeccc--ceeEEeeccCCC-------CccchHHHHHHHHHHhcC---CceEEEEeC-----CCCHHHHHHHHHHHHH Confidence 00000011 112333333211 112344667777777653 444455554 3446788889999977 Q ss_pred cC----cEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhccccccc-CceeEechHHHH Q lcl|NC_012740. 406 RQ----DCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYN-DVNRWVPLAADI 480 (667) Q Consensus 406 ~~----~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~-~~~~~~p~sg~v 480 (667) +. .+.++++.+. .+...+.+.+.+.. .++++.|.+++||+..+++... +....+|+ .++ T Consensus 336 ~s~~~~~~ravigV~g-----~~~~~~~~~~~~~a----------~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~-y~~ 399 (581) T protein:vir:10 336 QSNNKYERRAILGMDG-----SVTPVPSATRIANA----------QSIKDQRVALISPSSFVYYAPELNREVVLGG-QFM 399 (581) T ss_pred HHhccCCcEEEEEecC-----CCCCccHHHHHHhh----------ccCCCceEEEEecCceeecCcccCceeccch-hhH Confidence 53 3445444321 11223334444332 2578999999999998888754 44455565 333 Q ss_pred HHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcc-eecCCCcccceeeeh Q lcl|NC_012740. 481 AGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGD-KTATTVPSPFDRINV 559 (667) Q Consensus 481 Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~-rT~~~~~~~~~~i~v 559 (667) |+.+|.+....++++||.|+++.+ +.++...+++.|++.|+++|+++++..+++++++|.+ +|+.++ .+|++|++ T Consensus 400 AA~vAGl~a~~~~~~slT~~~i~g---i~~l~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~-~~~~~i~~ 475 (581) T protein:vir:10 400 AAAVAGKSVSAIAAMPLTRKVIRG---FSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTS-LHTREWNI 475 (581) T ss_pred HHHHHHHhhccccccCcccccccc---cccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCCC-Ccceeeee Confidence 444444444555888999998654 5568888999999999999999999999999987555 565555 48999999 Q ss_pred hhhhHHHHHHHHHHHH--HHhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEEEEEe Q lcl|NC_012740. 560 RRLFNMLKKNIGDSSK--YKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKP 637 (667) Q Consensus 560 rR~~~~i~~~l~~~~~--~~v~epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p 637 (667) ||++|++.+.+++.++ +|++|||++.+|.+|+..+..||.+||++|+|.||+. ...++.+.+.++++|+|.++| T Consensus 476 iR~~D~v~~~ir~~~~~~~fIG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~----~~~~~~~~~~d~v~V~i~v~P 551 (581) T protein:vir:10 476 IGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPDVIEVRYEWRP 551 (581) T ss_pred ehhhhHHHHHHHHHhhhhcCCCcccCHHHHHHHHHHHHHHHHHHHhcCcccCCcc----ceeeeeecCCCEEEEEEEEEe Confidence 9999999999999985 5888999999999999999999999999999999863 234667788999999999999 Q ss_pred cCCceEEEEEEEEeecCeeHHHH---HHHH Q lcl|NC_012740. 638 AKSINYIMLNFTAVATGADFDEI---IGPA 664 (667) Q Consensus 638 ~~p~e~i~~~~~~~~~~~~~~e~---~~~~ 664 (667) ++|+|||.+|++.....=.+.-- -... T Consensus 552 v~~i~~I~vti~~~p~~~~~~~~~~~~~~~ 581 (581) T protein:vir:10 552 AYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) T ss_pred cccceEEEEEEEEecCCCceEEEEeccccC Confidence 99999999998766432111100 0000 No 47 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=1.2e-36 Score=217.47 Aligned_cols=549 Identities=13% Similarity=0.030 Sum_probs=231.8 Q ss_pred cCC-cCccc-hhHHHHHHHHHcCCCeEEEE-EcCCcccccccccccccccceeeeccccccccceeeEeeecc--ceecc Q lcl|NC_012740. 54 FGQ-PDNNT-ADYFMSGANFLQYGNDLRVV-RVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQ--DIETA 128 (667) Q Consensus 54 FG~-~~~~~-~~~~~v~~~f~ngG~~~~vv-Rv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~--~~~~~ 128 (667) .|- ...++ .-.|..+.-=..+|.+|.+- ++.-..... +..........+.+.+..-.+..... ...+. T Consensus 1 ~~~~~~~~~~~~~~t~~~~~~~~g~~~~~~~~~~i~g~~~-------g~~g~~~s~r~~p~~~~~~evq~v~~~~~~t~G 73 (581) T protein:vir:76 1 MAIDFSQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAV-------GYQTYRESIRINPDTGETITTQILALVGEPTGG 73 (581) T ss_pred CcccccccccchhhhhhccccccCcceeeeeeeeeccccc-------ccccccceeeecCCCCCCCceEEEEEeecCCcc Confidence 110 00000 00111110000233333220 000000000 00000000000000000000000000 00000 Q ss_pred ccceeecccccceeeeecccccccccccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhcc Q lcl|NC_012740. 129 GKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQ 208 (667) Q Consensus 129 ~~~~~~d~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~ 208 (667) ...+.+.+.....+.+..++.....+ +...+... ...+.+.... +.....++............... ...-. T Consensus 74 ~ftLt~~g~tT~~I~~~asa~~v~~A--L~~L~~i~-~~~v~vtg~~-~~~~~V~F~g~~~~~~~~~~~lt--g~~~~-- 145 (581) T protein:vir:76 74 SFKLSLAGEPTGNIPFNATQGQVQSA--LRALPNVE-DDEVTVLGDP-GGPWTVTFTKAVAALTKDVTGLT--GGDNP-- 145 (581) T ss_pred eEEEEeCceeccccccCCCHHHHHHH--HhhccCCC-CceEEEEcCC-CceEEEEEcCCccceeEeeeeee--cCCcc-- Confidence 00011111000000000000000000 00000000 0000111000 00000000000000000000000 00000 Q ss_pred cccccccccccee--eeeeccccccceeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccc Q lcl|NC_012740. 209 DFLTKLKKYDMPA--VSAIYAGEIGNSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDG 286 (667) Q Consensus 209 ~~~~~~~~~~~~~--~~A~~~G~~gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g 286 (667) .........+.+. +.-...|..++.+........ ...+..+.+........ ........++.++++...++ T Consensus 146 ~~~V~~~~~G~~~~~~~l~~~g~~~~~~~~~~s~~~----~~~~l~~~~~~~~~~~~---~~~~~~~~~~~~t~~~~~~g 218 (581) T protein:vir:76 146 DLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSG----QVYVLGTDYVVTRVNAG---EDGEANTRDDLYTIQRVVDG 218 (581) T ss_pred eeEEEEEecCcCCcCceeeeccccccccceeecCCc----ceeeecccccceeeccC---cccceeeeeeeeeeEeeccc Confidence 0000000000000 000000111111111110000 00011000000000000 00000011111222222222 Q ss_pred eeeee---EeeeccCCccccccccccc------------hhhhcccccceEEEecccccCcccceEEecCCccccccccc Q lcl|NC_012740. 287 VVVES---YVLSTLKGDKDVYGNSIYM------------DDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTG 351 (667) Q Consensus 287 ~v~e~---~~~s~~~~~~~~~~~~~~~------------~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 351 (667) ..+.. +.++....++.......+. .+...+..+.+..........+. ...|.++.++.. T Consensus 219 ~~~~~~~i~~~~~~~~D~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~e~~~~~~~~~t~~~--~~~l~~gvd~~g---- 292 (581) T protein:vir:76 219 GHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGA--STILACAVDPEG---- 292 (581) T ss_pred ccccceeEEEEEEEeecCCccceEEEecccccccceeeehhhcCccccchhhhhheeecccc--ceEEEeeecCCC---- Confidence 21111 0111111110000000000 00000111111110000011111 123444444321 Q ss_pred cccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHHHHHHHhhcC----cEEEEEccCccccccccccC Q lcl|NC_012740. 352 DRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQ----DCLVMVSPPRSTVVNIPVTT 427 (667) Q Consensus 352 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~----~~~ai~d~p~~~~~~~~~~~ 427 (667) ......++.+++.+++.. +...+++|. ....++++++.+||++++ .+.++++.+. .+... T Consensus 293 ---~tvt~~dy~~aL~ale~~---~~~~ivvp~-----t~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g-----~~~~~ 356 (581) T protein:vir:76 293 ---DTVTMGDYQNALNKFRDE---DEIAIIVAG-----TGAQPIQALVQQHVSAQSNNKYERRAILGMDG-----SVTPV 356 (581) T ss_pred ---CccchHHHHHHHHHHhcC---CeEEEEEec-----CCChHHHHHHHHHHHHHHhccCCceEEEEeeC-----CCCCc Confidence 122345677777777654 444445553 234568888888887653 3444443221 11223 Q ss_pred CHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEechHHHHHHHHHHhhhcCCceeeecceeccceec Q lcl|NC_012740. 428 AIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMN 507 (667) Q Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g 507 (667) +.+.+.+.. ..+++.|..++|||.++++...+......|..++|+.+|.+....++++||.|+++. | T Consensus 357 ~~~~~~~~a----------~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~~~~slT~~~i~---g 423 (581) T protein:vir:76 357 PSATRIANA----------QSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIR---G 423 (581) T ss_pred hHHHHHHhh----------cccCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhhccccccCccccccc---c Confidence 334444332 257899999999999998876443333334455666667777777899999999865 4 Q ss_pred cccccccCChhhhhhhhhcCceEEEEecCCeEEE-EcceecCCCcccceeeehhhhhHHHHHHHHHHHH--HHhcCCCCH Q lcl|NC_012740. 508 VVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFIL-MGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSK--YKLFENNDN 584 (667) Q Consensus 508 ~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~-wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~--~~v~epn~~ 584 (667) +.++...+++.|++.|+++|+++++.++++++++ ||-+|+.++ .+|++|++||++|++++.+++.++ +|++|||++ T Consensus 424 ~~~~~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~-~~~k~i~viR~~D~v~~~vr~~~~~~~fiG~~n~~ 502 (581) T protein:vir:76 424 FSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTS-LHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYD 502 (581) T ss_pred cccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEEeeecCCCC-CccceeeehhhhHHHHHHHHHHHhhhcCCCcccCh Confidence 5568888999999999999999999999999986 666777665 489999999999999999999976 588899999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeec--Cee-HHHHH Q lcl|NC_012740. 585 FTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVAT--GAD-FDEII 661 (667) Q Consensus 585 ~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~--~~~-~~e~~ 661 (667) .+|.+|+..+..||.+||++|+|.||. ..+.++.+++.++++|++.++|++|+|||.++++.... +++ ..|=- T Consensus 503 ~~r~~ik~~i~~~L~~l~~~g~I~g~~----~~~~~~~~~~~d~v~V~i~v~Pv~~ie~I~vt~~~~p~~~~~~~~~~~~ 578 (581) T protein:vir:76 503 TTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGT 578 (581) T ss_pred HHHHHHHHHHHHHHHHHHhcCcccCcc----cceeeEEecCCCEEEEEEEEEecccceEEEEEEEEeeCCCceEEEEecc Confidence 999999999999999999999999986 23456777889999999999999999999999776543 222 00000 Q ss_pred HHH Q lcl|NC_012740. 662 GPA 664 (667) Q Consensus 662 ~~~ 664 (667) ... T Consensus 579 ~~~ 581 (581) T protein:vir:76 579 TSF 581 (581) T ss_pred ccC Confidence 000 No 48 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=100.00 E-value=1.7e-30 Score=183.78 Aligned_cols=406 Identities=16% Similarity=0.131 Sum_probs=259.6 Q ss_pred CceecCceEEEEecCCCc-ccccCCCceEEEeeccCCCCCccEEecC---HHHHHHHcCCcCccchhHHHHHHHHHcCCC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTT-IVQSATGRAALVGKFQWGPAFQIVQVTN---EVELVNKFGQPDNNTADYFMSGANFLQYGN 76 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~-~~~~~ts~~afvG~~~~Gp~~~p~~i~s---~~e~~~~FG~~~~~~~~~~~v~~~f~ngG~ 76 (667) .+=.-||+|++-+..... +..+...+.++...+.|||+++++.|++ ..++.+.||.. ...+....++..| .|+. T Consensus 11 ~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~~wGp~~~v~~i~~~~~~~~~~~~~G~~-~~~~~~~~l~~~~-~~~~ 88 (436) T protein:vir:78 11 QNKVLPGSYINFVSATRATSSLSDRGIVAMPLELDWGIDEEVFQVTSDDFEKYSTKYFGYD-YTHEKLKGLRDLF-KNIR 88 (436) T ss_pred ceeecCceEEEEEecCcceeeccCCeEEEEEEEecCCCCceeEEeecccchHHHHHHhcCc-cchHHHHHHHHHh-cCCC Confidence 455689999999876654 4456799999999999999999999998 56899999953 2233333566554 6678 Q ss_pred eEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccc Q lcl|NC_012740. 77 DLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKA 156 (667) Q Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~ 156 (667) .+|+.|+.++..+.. + T Consensus 89 tv~~yrl~~G~~a~~-~--------------------------------------------------------------- 104 (436) T protein:vir:78 89 LGYFYKLNKGVKASC-S--------------------------------------------------------------- 104 (436) T ss_pred EEEEEECCCcceeee-e--------------------------------------------------------------- Confidence 999999854321100 0 Q ss_pred ccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccceeEE Q lcl|NC_012740. 157 IGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSLEV 236 (667) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i~v 236 (667) + ..|.++|..||.+++ T Consensus 105 --------------v--------------------------------------------------~~Aky~g~~gn~i~v 120 (436) T protein:vir:78 105 --------------I--------------------------------------------------ATARCSGIRGNDLKV 120 (436) T ss_pred --------------e--------------------------------------------------eeeecCCCCCcEEEE Confidence 0 124567777777777 Q ss_pred EEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhhcc Q lcl|NC_012740. 237 EILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFAR 316 (667) Q Consensus 237 ~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~ 316 (667) .+....+ ....|.+.+..+...+++..+.. +.+ . T Consensus 121 ~v~~~~~------------------------------d~~~~dv~~~~g~~~~d~~~~~~-------------~~~---l 154 (436) T protein:vir:78 121 IVTTNID------------------------------DNAKFDVVTLLDNKKVDTQIAKV-------------ITE---L 154 (436) T ss_pred Eeccccc------------------------------ccCceEEEEEecchhhhhhhHHH-------------Hhh---c Confidence 6643211 11122233322222222111110 000 0 Q ss_pred cccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHH Q lcl|NC_012740. 317 GSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQ 396 (667) Q Consensus 317 ~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~ 396 (667) ..+.++....... -.......|.||.++..+ +..++.+.+..+ +.+.++.|++|.. .++++ T Consensus 155 ~~n~~V~~~~~g~-la~~a~~~LtGG~dG~~~---------T~~dy~~al~~l---e~~~fn~l~~~~~------d~~~~ 215 (436) T protein:vir:78 155 QDNDYVTWKKEAT-LEATAGLTFTNGTNGEAV---------TGTEYQAFLDKI---ESYSFNALGCLAT------TAEIK 215 (436) T ss_pred cCCceEEEEeccc-ccccceeeeecccccccc---------chHHHHHHHHHH---cccceeEEEecCC------ChHHH Confidence 1234444432211 122223457777776432 234455665555 4557888888853 34678 Q ss_pred HHHHHHHhhcCcE-----EEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCce Q lcl|NC_012740. 397 KHAVSIGDERQDC-----LVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVN 471 (667) Q Consensus 397 ~~~~~~~~~~~~~-----~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~ 471 (667) +.+.++++++|+. -+++... .. .+.....-+... .++.. T Consensus 216 ~~~~a~ikr~re~~g~~~~aV~~~~--------~~----------------------~d~EgIInv~n~------v~g~~ 259 (436) T protein:vir:78 216 SLFVEFTKRMRDKVGAKFQTVLYKK--------ND----------------------ADYEGVVSVENK------IKDTG 259 (436) T ss_pred HHHHHHHHHHHhhcCCeEEEEecCC--------CC----------------------CCCceEEEeecc------cCCce Confidence 8889999887642 2222110 00 111111111111 11211 Q ss_pred -eEechHHHHHHHHHHhhhcCCceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcce-ec-- Q lcl|NC_012740. 472 -RWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDK-TA-- 547 (667) Q Consensus 472 -~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~r-T~-- 547 (667) --.-.++.+||++|..+. .+|+.|+.+. ++.++...++++|.+.+.++|.-++.+. ++++++--+- |+ T Consensus 260 ~~~~~~~a~vAG~~Ag~~~----~~S~T~~~~~---~~~~v~~~~t~~e~~~ai~~G~lvl~~d-~~~v~I~~~VNTltt 331 (436) T protein:vir:78 260 LLESSLIYWTTGAIAGCDI----NKSNTNKRYD---GEFDVDVNYTQIHLEEALKTGKFIFHKV-GDEVHVLEDINTFVS 331 (436) T ss_pred echhHHHHHHHHHHhcCcc----ccCccceecC---ccccccccCCHHHHHHHHhCCeEEEEEe-CCeEEEEEcccccee Confidence 011246789999998764 5588888754 4556777899999999999999988754 5666665544 33 Q ss_pred --CCCcccceeeehhhhhHHHHHHHHHHHH-HHhcC-CCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEE---EEcccCCC Q lcl|NC_012740. 548 --TTVPSPFDRINVRRLFNMLKKNIGDSSK-YKLFE-NNDNFTRASFRMEVSQYLSTIRSLGGIYDFRV---QCDTTNNT 620 (667) Q Consensus 548 --~~~~~~~~~i~vrR~~~~i~~~l~~~~~-~~v~e-pn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v---~~d~~~nt 620 (667) ...+.+|+.|.++|++|+|.+.+++... .|+++ ||+..-|..++..|..||.+|++.|+|..|.. ..++. T Consensus 332 ~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yiGKv~N~~dgr~~l~~~i~~yl~~L~~~g~I~~f~~~Dv~v~~~--- 408 (436) T protein:vir:78 332 FTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYLGEVPNDKSGRISFWNDVVKHHEQLQNMRAIEDFKADDVSVEPG--- 408 (436) T ss_pred cCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccccccCCCHHHHHHHHHHHHHHHHHHHhCCcccCCCCcceEEeec--- Confidence 2334689999999999999999998875 59996 69999999999999999999999999998863 22211 Q ss_pred HHHhhCCeEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_012740. 621 PDVIDRNEFVASMFIKPAKSINYIMLNFTAV 651 (667) Q Consensus 621 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 651 (667) -....+++.+.++|+..||+|.+++.-- T Consensus 409 ---~~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 409 ---SDKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred ---CCCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 1356788999999999999999997755 No 49 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.43 E-value=3.6e-14 Score=94.30 Aligned_cols=315 Identities=15% Similarity=0.103 Sum_probs=171.9 Q ss_pred eeccccccccc---eeeeec---cceeeeeEeeeccCCccccccccccchhhhcccccceEEEecccccCcccceEEecC Q lcl|NC_012740. 268 IPYAPQNDNQY---AFIVRR---DGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSSQYIYATAQGWVDGFSGIISLAG 341 (667) Q Consensus 268 ~~~~~~~~~~~---~~~v~~---~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~ 341 (667) ....|...-.| ..+... -|.+. .++.-.............+..........++. ..+.+ T Consensus 1 ~~glp~i~i~f~~~a~ta~~~g~rGiv~--~il~d~~~~~~~~~~~~~v~~~~~~~n~~~i~-------------~~~~g 65 (356) T protein:vir:10 1 MAGLVNINIEFKELATSFIQRSKAGIVA--IILKDTTKMYKELTSEDDIPISLSADNKKYIK-------------YGFVG 65 (356) T ss_pred CCCCCceeEEEeecceeeccCCccceEE--EEEecCCcceeEEeccccchhHHHHHHHHHHH-------------HHhhc Confidence 11111111001 111111 12211 11111100000000000000000000001110 01112 Q ss_pred Cccccc---cccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHHHHHHHhhcCc----EEEEEc Q lcl|NC_012740. 342 GVSANE---ASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQD----CLVMVS 414 (667) Q Consensus 342 g~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~----~~ai~d 414 (667) +..... +.....+......++.+.+..+ +.+.++.|++|.. ..++++.+.+++.++|+ .+..+- T Consensus 66 ~~~~~~~~~p~~~~~~~~~t~~~y~~aL~~l---e~~~fn~l~~~~~------d~~~~~~~~a~ikr~r~~~~~~~~~V~ 136 (356) T protein:vir:10 66 ATDNEKVLRPSKVIISTFTEDGKVEDILEEL---ESVEFNYLCMPEA------IEAEKTKIVTWIKKIREEESTEAKAVL 136 (356) T ss_pred cccccccccceeeeeecccCchhHHHHHHHh---cCccceEEEecCC------ChHHHHHHHHHHHHHHhcCCcEEEEEe Confidence 211110 1111111122334566666665 4568898988853 34577888888887664 333322 Q ss_pred cCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeE--echHHHHHHHHHHhhhcCC Q lcl|NC_012740. 415 PPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRW--VPLAADIAGLCARTDAVSQ 492 (667) Q Consensus 415 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~--~p~sg~vAg~~a~~d~~~g 492 (667) +. ...+.+.++ + +.-.+ +.| +. .+ .-.++.+||++|....++ T Consensus 137 ~~--------~~aD~EgII--------------n--------v~n~~-~~~---g~-~~t~~~~~~~vAG~~Ag~~~n~- 180 (356) T protein:vir:10 137 AN--------IKADNEAII--------------N--------FTENV-VVD---GE-EITAEKYTTRVASLIASTPNTQ- 180 (356) T ss_pred cC--------CCCCCceeE--------------E--------eecCe-Eec---ce-eechhHHHHHHHHHHhccchhc- Confidence 11 001111111 1 11111 111 11 11 223678999999887544 Q ss_pred ceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEE-cceec----CCCcccceeeehhhhhHHHH Q lcl|NC_012740. 493 PWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILM-GDKTA----TTVPSPFDRINVRRLFNMLK 567 (667) Q Consensus 493 ~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~w-G~rT~----~~~~~~~~~i~vrR~~~~i~ 567 (667) |+.|+.+.. +... ..++++|.+.+.++|--++.+.. +.+++- |-.|+ ...+.+|+.|.+.|++|.|. T Consensus 181 ---S~T~~~~~~---~~~~-~~~t~~e~~~ai~~G~lvl~~d~-~~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~i~ 252 (356) T protein:vir:10 181 ---SITYAPLDE---VESI-VKIDKASADAKVQAGELILRRLS-GKIRIARGINSLTTLTAEKGEIFQKIKLVDTKDLIS 252 (356) T ss_pred ---cccceecCC---cccc-ccCCHHHHHHHHhCCeEEEEEEc-CeEEEEecCccceecCCCCCcchhhhHHHHHHHHHH Confidence 888887553 2222 35789999999999999987654 445444 44444 23346799999999999999 Q ss_pred HHHHHHHH-HHhcC-CCCHHHHHHHHHHHHHHHHHHHhcCCee-eeEEEEcccCC--------------CHHHhh----C Q lcl|NC_012740. 568 KNIGDSSK-YKLFE-NNDNFTRASFRMEVSQYLSTIRSLGGIY-DFRVQCDTTNN--------------TPDVID----R 626 (667) Q Consensus 568 ~~l~~~~~-~~v~e-pn~~~~~~~i~~~i~~~l~~l~~~gal~-g~~v~~d~~~n--------------t~~~i~----~ 626 (667) +.+++... .|+++ ||+..-|..++..+..||.+|.+.|+|. +|.+..|.+.. +...|. . T Consensus 253 ~Di~~~f~~~yiGKv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~~~~d~~v~~~~~~ 332 (356) T protein:vir:10 253 KDIKNIYVEKYLRKCPNTYDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLEGKKIAVSKMKENEIKEANTG 332 (356) T ss_pred HHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhhhccccccccccceeecccCC Confidence 99998876 69998 5999999999999999999999999996 57777776432 222222 2 Q ss_pred CeEEEEEEEEecCCceEEEEEEEE Q lcl|NC_012740. 627 NEFVASMFIKPAKSINYIMLNFTA 650 (667) Q Consensus 627 G~~~~~i~~~p~~p~e~i~~~~~~ 650 (667) -.+++.+.++|+-.||.|.+++.. T Consensus 333 ~~v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 333 SNGFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred cEEEEEEEEEEEeeeeeEEeEEeC Confidence 457899999999999999999887 No 50 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=99.21 E-value=2.1e-10 Score=73.69 Aligned_cols=441 Identities=13% Similarity=0.100 Sum_probs=211.3 Q ss_pred CceecCceEEEEecCCCcccccCCCceEEEeec---cCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC-CC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKF---QWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY-GN 76 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~~ts~~afvG~~---~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng-G~ 76 (667) =....||+|+|--++.. .........-+||.. ...+.++|++|+|-.|-...||. .+.+..+++.|..+. =. T Consensus 10 ~~iRvP~~y~E~dns~A-~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~lfG~---GSml~~M~~a~~~~n~~~ 85 (498) T protein:vir:45 10 SNTLVPLFYAEMDNQAA-NTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGA---GSQLARMVEAYRQTDPFG 85 (498) T ss_pred cccccCeEEEEEeCCCC-CCCCCCcceEEEEecCCccccccceeEEecCHHHHHHhcCc---CcHHHHHHHHHHHhCCcc Confidence 45668999999544444 222234566788876 34578999999999999999996 567777788887753 37 Q ss_pred eEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccc Q lcl|NC_012740. 77 DLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKA 156 (667) Q Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~ 156 (667) ++|++-+.|. ....++.. + +.+..+ +.+..+.+.+... T Consensus 86 ~l~~i~~~d~-aG~aA~g~---i--t~tg~a---t~~G~l~l~Igg~--------------------------------- 123 (498) T protein:vir:45 86 ELYVIAVPEA-TGAAATVT---L--TVTGEA---TESGTVNVYVGRT--------------------------------- 123 (498) T ss_pred eEEEEeeCCc-ccceeEEE---E--Eeeccc---CCCcEEEEEECCE--------------------------------- Confidence 9999998653 22111110 0 000000 0001111111100 Q ss_pred ccccccccccceEEEEEeeccc--ccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 157 IGVYPELDGGWTAEFTSSSGNG--SAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~--~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) .+.+....+.. .....+...+... .+.+ .+.....+...++|.--|..||.+ T Consensus 124 -----------~v~v~V~~gdTaa~vA~al~aaina~-------~~lP--------VTA~~~~~~VtlTAr~kG~~GN~I 177 (498) T protein:vir:45 124 -----------RVQAPVTNGDNVTTIASSIQDAINAV-------PTLP--------FTASSSAGVVTLTARHKGLCGNEI 177 (498) T ss_pred -----------EEEEEecCCCCHHHHHHHHHHHHhCC-------CCCc--------eEEEecCceEEEEeeccCccccce Confidence 00000000000 0000000000000 0000 000000122233444445555554 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) .+++...... .|. .. T Consensus 178 ~l~~~~~~~~----------------------------------------~ge-------------------------~~ 192 (498) T protein:vir:45 178 PVSLNYYGFG----------------------------------------GGE-------------------------VL 192 (498) T ss_pred eEEEeecccc----------------------------------------ccc-------------------------cc Confidence 4433210000 000 00 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) ..+ -.+.++ .+++|.. ..+....+.++.. ..++++++|-... ....+ T Consensus 193 p~G--lt~~it------------amagGag--------------~PD~a~alaal~~---~~~~~I~~p~~D~--asL~a 239 (498) T protein:vir:45 193 PAG--VQIAVA------------TGTAGTG--------------APVLTGAVAAMAD---EPFDYIGLPFNDT--ASVNT 239 (498) T ss_pred cce--eeEEEE------------ccCCCcc--------------CchhHHHHHHhcc---CCccEEEEeeCCH--HHHHH Confidence 000 001111 1222211 1233444444443 3567778764321 12222 Q ss_pred HHHHHHHHH-----hhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccC Q lcl|NC_012740. 395 VQKHAVSIG-----DERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYND 469 (667) Q Consensus 395 v~~~~~~~~-----~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~ 469 (667) ...+|.+.. -++++++++.-. .-+..+..++.. ..|+.|..+.+...-. T Consensus 240 l~~~L~~~sgRw~~~~q~~g~~~~a~----------~gT~~~l~t~g~----------~~N~~~it~~~~~~~~------ 293 (498) T protein:vir:45 240 LVTEMNDTSGRWSYARQLYGHVYTAK----------TGTLSELVNAGD----------QFNQQHITLAGYEKET------ 293 (498) T ss_pred HHHHHhhhhhhhhHHhhcCeEEEEec----------cCCHHHHHHhhh----------ccCCceEEEEecCCCC------ Confidence 223332211 234555555432 124566666654 3577777765421110 Q ss_pred ceeEech---HHHHHHHHH---HhhhcCCceeeecc-eeccceeccc--cccccCChhhhhhhhhcCceEEEEecCCeEE Q lcl|NC_012740. 470 VNRWVPL---AADIAGLCA---RTDAVSQPWMSPAG-YNRGQIMNVV--KLAIEPRKAHRDRLYQAAINPVIGAGGEGFI 540 (667) Q Consensus 470 ~~~~~p~---sg~vAg~~a---~~d~~~g~~~span-~~~~~i~g~~--~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~ 540 (667) .-|| ++.+|++.| +.|..| |-+ .. +.|+. .+.-.++..|++.|..+||.++..-.|+ .. T Consensus 294 ---~sp~~~~AAa~aa~~A~~l~~DPAr-----PL~tl~---L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G~-V~ 361 (498) T protein:vir:45 294 ---QTPADELAASRTARAAVFIRNDPAR-----PTQTGE---LVGMLPAPKGKRFTMTEQQTLLSHGVATAYVESGV-LR 361 (498) T ss_pred ---CChHHHHHHHHHHHHHHHhhccccc-----ccCcee---ecceecCCchhcCChHHHHHHHhCCcceEEEcCCe-EE Confidence 1133 233444444 445433 322 23 33443 4556678999999999999999765443 44 Q ss_pred EEcceec-------CCCcccceeeehhhhhHHHHHHHHHHHHH-HhcCCCCHH-----------HHHHHHHHHHHHHHHH Q lcl|NC_012740. 541 LMGDKTA-------TTVPSPFDRINVRRLFNMLKKNIGDSSKY-KLFENNDNF-----------TRASFRMEVSQYLSTI 601 (667) Q Consensus 541 ~wG~rT~-------~~~~~~~~~i~vrR~~~~i~~~l~~~~~~-~v~epn~~~-----------~~~~i~~~i~~~l~~l 601 (667) +--..|. ..| ..|..|+..|+.+|+++.++..... |--+..-+. +-..||..+-.-+++| T Consensus 362 I~R~ITTY~~n~~G~~D-~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ell~~y~~l 440 (498) T protein:vir:45 362 IQRDVTTYRKNAYGVAD-NSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQL 440 (498) T ss_pred EEeeeeeeeecCCCCcc-hhhhhhhhHHHHHHHHHHHHHHhhhhcCCeeecccCcccCCCCcccchHHHHHHHHHHHHhh Confidence 4444443 234 4699999999999999999988753 222222221 6678999999999999 Q ss_pred HhcCCeeee---E--EEEcccCCCHHHhhCCeEEEEEEEEecCCceEE----EEEEEEeecCe Q lcl|NC_012740. 602 RSLGGIYDF---R--VQCDTTNNTPDVIDRNEFVASMFIKPAKSINYI----MLNFTAVATGA 655 (667) Q Consensus 602 ~~~gal~g~---~--v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i----~~~~~~~~~~~ 655 (667) ...|-+..+ + +.+.++-+. ..|+.+.+-...+-+..-+ .|+++.....+ T Consensus 441 e~~givEn~~~~~~~LiVerd~~d-----pnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 441 ERAGIVENYELFKQYLVVERDASV-----PNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred hhhccccChhhhcceeEEEECCCC-----CcEEEEEecccccCchhhhhhhhhhheehhhcCC Confidence 999988874 2 333332221 1466666544444443322 22222222222 No 51 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=99.20 E-value=1.5e-10 Score=74.43 Aligned_cols=440 Identities=13% Similarity=0.085 Sum_probs=211.7 Q ss_pred CceecCceEEEEecCCCcccccCCCceEEEeecc---CCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC-CC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQ---WGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY-GN 76 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~~ts~~afvG~~~---~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng-G~ 76 (667) =.+..||+|+|--++..... ..+.-.-+||..- ..|.++|++|+|..|-...||. .+.+..+++.|..+. =. T Consensus 10 ~~iRvP~~y~E~dns~A~~~-~~~qrvLiiGq~la~gt~~~~~~v~v~s~~~a~~~fG~---GS~l~~M~~a~~~~n~~~ 85 (498) T protein:vir:48 10 SDTLVPLFYAEMDNSAANTA-VTSAPALLIGHASNDAAIEVNSLVLMPSADYARQICGA---GSQLARMVDVYRQTDPFG 85 (498) T ss_pred cccccceEEEEEecCCCccc-cCCcceEEEeecCccccccccceEEecCHHHHHHhcCc---ccHHHHHHHHHHHhCCCc Confidence 45678999999655443322 2234577888753 3478999999999999999996 566666777777754 48 Q ss_pred eEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccc Q lcl|NC_012740. 77 DLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKA 156 (667) Q Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~ 156 (667) ++|++-+.|. ....++.. + +.+..+ +.+..+.+.+... T Consensus 86 ~l~~i~~~D~-ag~aA~g~---i--t~tg~a---t~~G~l~l~Igg~--------------------------------- 123 (498) T protein:vir:48 86 ELYVIAVPEA-RGAAATVR---V--TVTGEA---EESGTLSLYVGRS--------------------------------- 123 (498) T ss_pred eeEEEeeCCc-ccceeEEE---E--Eecccc---cCCceEEEEECCE--------------------------------- Confidence 9999998763 22111110 0 000000 0001111111100 Q ss_pred ccccccccccceEEEEEeecccc--cceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 157 IGVYPELDGGWTAEFTSSSGNGS--AALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~--~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) .+.+....+... ....+...+... .+.+. +.....+...++|.--|..||.+ T Consensus 124 -----------~v~v~V~~gdTaa~vA~al~aai~a~-------~~lPV--------TA~~~~~~VtlTAr~kG~~GN~I 177 (498) T protein:vir:48 124 -----------SVQVPVVNGDDATAVATAIKEAVNGV-------ITLPF--------AASSDAGVVTLTARHKGLYGNEL 177 (498) T ss_pred -----------EEEEeecCCCCHHHHHHHHHHHHhCC-------CCcce--------EEEecCcEEEEEeeecccccccc Confidence 000000000000 000000000000 00000 00000111223333334444444 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) .+++...+. ..+. .. T Consensus 178 ~l~~~~~~~-----------------------------------------------------~~ge------------~~ 192 (498) T protein:vir:48 178 PVCLNYYGS-----------------------------------------------------GGGE------------IL 192 (498) T ss_pred eeeeeeccC-----------------------------------------------------cccc------------cc Confidence 333211000 0000 00 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) -.+ -.+.+ ..++||.. ..+....+.++.. ..++++++|-.. .+ T Consensus 193 p~G--lt~~i------------tamsgGag--------------~PDia~aLaal~~---~~~~~I~~p~~D------~a 235 (498) T protein:vir:48 193 PAG--LQVVT------------EAGTAGSG--------------APDLTAAVAAMGD---EAFDFIGLPFND------AA 235 (498) T ss_pred cce--eeEEE------------EcccCCcc--------------CcchHHHHHhhcc---CCccEEEEeecC------HH Confidence 000 00111 11222211 1233444444433 356778886432 12 Q ss_pred HHHHHHHHHh---------hcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccc Q lcl|NC_012740. 395 VQKHAVSIGD---------ERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYD 465 (667) Q Consensus 395 v~~~~~~~~~---------~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d 465 (667) -..++.+|++ ++++++++.-. .-+..+..+|.. ..|+.|..+.+-. T Consensus 236 sl~al~~~L~~~sgRw~~~~q~~g~~~~a~----------~gT~~~l~t~g~----------~~N~~~it~~~~~----- 290 (498) T protein:vir:48 236 SINMMMTEMNDSSGRWSYARQLYGHVYTAK----------LGTLSELVNAGD----------MHNQQHITLAGYE----- 290 (498) T ss_pred HHHHHHHHHhhhhhhhhHHhhcCeEEEEec----------cCCHHHHHHhhh----------ccCCceEEEEecC----- Confidence 2233444442 34555555432 124566666654 3567777665421 Q ss_pred cccCceeEechH---HHHHHHHH---HhhhcCCceeeecceeccceeccc--cccccCChhhhhhhhhcCceEEEEecCC Q lcl|NC_012740. 466 KYNDVNRWVPLA---ADIAGLCA---RTDAVSQPWMSPAGYNRGQIMNVV--KLAIEPRKAHRDRLYQAAINPVIGAGGE 537 (667) Q Consensus 466 ~~~~~~~~~p~s---g~vAg~~a---~~d~~~g~~~span~~~~~i~g~~--~~~~~~~~~e~~~Ln~~gIn~i~~~~~~ 537 (667) +. ..-|+. +.+|++.+ +.|..| |-+. -.+.|+. .+.-.++..|++.|..+||.++.. .+. T Consensus 291 ---~~-~~~p~~~~AAa~a~~aA~~l~~DPAr-----PLqt--l~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V-~~G 358 (498) T protein:vir:48 291 ---KE-TQSPVDELVASRLAREAVFIRNDPAR-----PTQT--GELVGMLPAPKGKRFIMTEQQTLLSHGVATAYV-EGG 358 (498) T ss_pred ---CC-CCChHHHHHHHHHHHHHHhhhccccc-----cccc--eeeeccccCCchhcCChHHHHHHHhcCcceEEE-cCC Confidence 11 011332 23444443 455443 3222 1234443 455567899999999999999976 554 Q ss_pred eEEEEcceec-------CCCcccceeeehhhhhHHHHHHHHHHHHH-HhcCCCCHH-----------HHHHHHHHHHHHH Q lcl|NC_012740. 538 GFILMGDKTA-------TTVPSPFDRINVRRLFNMLKKNIGDSSKY-KLFENNDNF-----------TRASFRMEVSQYL 598 (667) Q Consensus 538 G~~~wG~rT~-------~~~~~~~~~i~vrR~~~~i~~~l~~~~~~-~v~epn~~~-----------~~~~i~~~i~~~l 598 (667) -..+--..|. ..| ..|..|+..|+.+|+++.++..... |--+..-+. +-..||..+-.-+ T Consensus 359 ~V~I~R~ITTY~~n~~G~~D-~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~eli~~y 437 (498) T protein:vir:48 359 TLRIQRSVTTYKKNAYGVAD-NSYLDSETLHTSAYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIKGELLATY 437 (498) T ss_pred eEEEEeeeeeeeecCCCCcc-hhhhhhhhHHHHHHHHHHHHHHhhhhcCCceecccCcccCCCCcccchHHHHHHHHHHH Confidence 4555555554 233 4699999999999999999988753 322222222 6678999999999 Q ss_pred HHHHhcCCeeee---E--EEEcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 599 STIRSLGGIYDF---R--VQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 599 ~~l~~~gal~g~---~--v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) ++|...|-+..+ + +.+.++-+. ..|+.+.+-...+-+..-+ +...++.-+.++| T Consensus 438 ~~le~~given~~~~~~~LiVerd~~d-----pnRln~~~p~d~vn~L~V~----------A~~~~f~lq~~~~ 496 (498) T protein:vir:48 438 RQMERAGIVENYDLFKQYLIVERDADN-----PNRLNTLFPPDYVNQLRVF----------AVVNQFRLQYSEE 496 (498) T ss_pred HhhhhhccccChhhhcceeEEEECCCC-----CcEEEEEecccccCchhhh----------hhhhhhhhhhhhc Confidence 999999988873 2 334332222 1466666544444433221 1122333333333 No 52 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=99.17 E-value=3.4e-10 Score=72.52 Aligned_cols=439 Identities=14% Similarity=0.102 Sum_probs=210.1 Q ss_pred CceecCceEEEEecCCCcccccCCCceEEEeecc---CCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC-CC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQ---WGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY-GN 76 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~~~ts~~afvG~~~---~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng-G~ 76 (667) =....||+|+|--++.. .........-+||..- ..|.++|++|+|.+|-...||. .+.+..+++.|..+. =. T Consensus 10 ~~iRvP~~y~E~dns~A-~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~~fG~---GSml~~M~~a~~~~n~~~ 85 (498) T protein:vir:44 10 SDTRVPLFYAEMDNSAA-NTARDSGASLLIGHASNDASIAVNSLVLVSSVDYARQICGA---GSQLARMVGAYRKTDPFG 85 (498) T ss_pred cccccCeEEEEEeCCCC-CCCcCCcceEEEEecCcccccccceeEeecCHHHHHHhcCc---ccHHHHHHHHHHHhCCCc Confidence 44668999999533333 2222344566788753 3478999999999999999996 667777888887764 48 Q ss_pred eEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccccccc Q lcl|NC_012740. 77 DLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKA 156 (667) Q Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~~ 156 (667) ++|++-+.|. ....++.. + +.+... +.+..+.+.+... T Consensus 86 ~l~~i~~~D~-aG~aAtg~---i--t~tg~a---t~~G~l~l~Igg~--------------------------------- 123 (498) T protein:vir:44 86 ELYVIAVPES-TGAAATVA---L--TVTGEA---TETGTVNVYTGRT--------------------------------- 123 (498) T ss_pred eeEEEecCCc-ccceeEEE---E--Eeeccc---CCCcEEEEEECCE--------------------------------- Confidence 9999988653 22222111 0 000000 0001111111100 Q ss_pred ccccccccccceEEEEEeeccc--ccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccccee Q lcl|NC_012740. 157 IGVYPELDGGWTAEFTSSSGNG--SAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGNSL 234 (667) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~--~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn~i 234 (667) .+.+....+.. .....+...+.... ..+ .+.....+...++|.--|..||.+ T Consensus 124 -----------~v~v~V~~gdTaa~vA~al~aaina~~-------~lP--------VTA~~~~~~vtlTAr~kG~~GN~I 177 (498) T protein:vir:44 124 -----------RVQAPVTSGDDAAAVAVSIKDAVNANP-------DLP--------FTATSEAGVVTLTARHKGLYGNEI 177 (498) T ss_pred -----------EEEEEecCCCCHHHHHHHHHHHHhCCC-------CCc--------eEEeeccceEEEEEeccCcccCcc Confidence 00000000000 00000000000000 000 000000011223344344444444 Q ss_pred EEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhhh Q lcl|NC_012740. 235 EVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFF 314 (667) Q Consensus 235 ~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~~ 314 (667) .+++...+.. .|. .. T Consensus 178 ~l~~~~~~~~----------------------------------------~ge-------------------------~~ 192 (498) T protein:vir:44 178 PVTLNYYGFG----------------------------------------GGE-------------------------VL 192 (498) T ss_pred eEEEeeccCc----------------------------------------ccc-------------------------cc Confidence 4433210000 000 00 Q ss_pred cccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHH Q lcl|NC_012740. 315 ARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFST 394 (667) Q Consensus 315 ~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 394 (667) .. .-.+.++ .+++|. ...+....+.++.. ..++++++|-.. .+ T Consensus 193 p~--Glt~tit------------amsgGa--------------g~PDia~alaal~~---~~~~~i~~p~~D------~a 235 (498) T protein:vir:44 193 PA--GVNITVA------------SGVKGA--------------GAPALNDAVAAMGD---EPFDYIGLPFND------TA 235 (498) T ss_pred cc--ceeEEEE------------cccCCc--------------cCchhHHHHHhhcc---CCccEEEEeecC------HH Confidence 00 0001111 122221 11234444455543 356777776431 12 Q ss_pred HHHHHHHHHh---------hcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccc Q lcl|NC_012740. 395 VQKHAVSIGD---------ERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYD 465 (667) Q Consensus 395 v~~~~~~~~~---------~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d 465 (667) -..++.+|++ ++++.+++... .-+..++.++.. ..|+.|..+.+...-. T Consensus 236 sl~al~~~L~~~sgRw~~~~q~~g~~~~a~----------~gT~a~l~t~g~----------~~N~~~it~~~~~~~~-- 293 (498) T protein:vir:44 236 SVNSMATEMNDSSGRWSYVRQLYGHVYTAK----------TGTLSELVAAGD----------QFNLQHITLAGYEKDT-- 293 (498) T ss_pred HHHHHHHHHhhhhcchHHHhhcCeEEEEec----------cCCHHHHHHhhh----------ccCCceEEEEecCCCC-- Confidence 2233444442 24455555432 224556666654 3567777664321110 Q ss_pred cccCceeEechH---HHHHHHHH---HhhhcCCceeeecc-eeccceeccc--cccccCChhhhhhhhhcCceEEEEecC Q lcl|NC_012740. 466 KYNDVNRWVPLA---ADIAGLCA---RTDAVSQPWMSPAG-YNRGQIMNVV--KLAIEPRKAHRDRLYQAAINPVIGAGG 536 (667) Q Consensus 466 ~~~~~~~~~p~s---g~vAg~~a---~~d~~~g~~~span-~~~~~i~g~~--~~~~~~~~~e~~~Ln~~gIn~i~~~~~ 536 (667) .-|+. +.+|++.+ +.|..| |-+ .. +.|+. .+.-.++..|++.|..+||.++..-.| T Consensus 294 -------~sp~~~~AAa~a~~aA~~l~~DPAr-----PL~tl~---L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~G 358 (498) T protein:vir:44 294 -------QTPADELAASRTARAAVFIRNDPAR-----PTQTGE---LVDMLPAPKGKRFTTTEQQTLLSHGVATAYVESG 358 (498) T ss_pred -------CCHHHHHHHHHHHHHHHHhhccccc-----ccCcee---ecccccCCchhcCChHHHHHHHhcCcceEEEcCC Confidence 01322 33444444 444433 322 22 33443 445667899999999999999976544 Q ss_pred CeEEEEcceec-------CCCcccceeeehhhhhHHHHHHHHHHHHH-HhcCCCCH-----------HHHHHHHHHHHHH Q lcl|NC_012740. 537 EGFILMGDKTA-------TTVPSPFDRINVRRLFNMLKKNIGDSSKY-KLFENNDN-----------FTRASFRMEVSQY 597 (667) Q Consensus 537 ~G~~~wG~rT~-------~~~~~~~~~i~vrR~~~~i~~~l~~~~~~-~v~epn~~-----------~~~~~i~~~i~~~ 597 (667) + ..+--..|. ..| ..|..|+..|+.+|+++.++..... |--+..-+ -+-..||..+-.- T Consensus 359 ~-V~I~R~ITTY~~n~~G~~D-~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~eli~~ 436 (498) T protein:vir:44 359 V-LRIQRDITTYRKNAYGVAD-NSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGSGQAIVTPAVIRGELGST 436 (498) T ss_pred e-EEEEeeeeeeeecCCCCcc-hhhhhhhhHHHHHHHHHHHHHHhhhhcCCcccccCCcccCCCcccccHHHHHHHHHHH Confidence 3 444444443 234 4699999999999999999988742 32222211 2677899999999 Q ss_pred HHHHHhcCCeeee---E--EEEcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeHHHHHHHHhcC Q lcl|NC_012740. 598 LSTIRSLGGIYDF---R--VQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFDEIIGPANQA 667 (667) Q Consensus 598 l~~l~~~gal~g~---~--v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~~~~ 667 (667) +++|...|-+..+ + +.+.++-+. ..|+.+.+-...+-...-+- ...++.-+.+++ T Consensus 437 y~~le~~givEn~~~~~~~LiVerd~~d-----pnRln~~~p~d~vn~L~V~A----------~~~~f~lq~~~~ 496 (498) T protein:vir:44 437 YRQMEREGIVENFDLFQQHLIVERNAND-----SNRLDVLFPPDYVNQLRVFA----------VLNQFRLQYSEE 496 (498) T ss_pred HHhhhhhccccChhhhcceeEEEECCCC-----CcEEEEEecccccCchhhhh----------hhhhhhhhhhhh Confidence 9999999988874 2 334333221 24666665444444433221 122223333333 No 53 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=99.08 E-value=1.2e-10 Score=74.97 Aligned_cols=351 Identities=10% Similarity=0.020 Sum_probs=177.5 Q ss_pred ceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCcccc-----ccccccchhhhcccccceE Q lcl|NC_012740. 248 VAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDV-----YGNSIYMDDFFARGSSQYI 322 (667) Q Consensus 248 ~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~-----~~~~~~~~~~~~~~~s~~v 322 (667) .-+..+++..+.-++.. . -+|++.+-...+.+.. ...+.-.+..+....|.+. T Consensus 1 ~~~~v~vn~~n~~~g~~--------~--------------~~er~~Lfig~~~~~~~~~~~~~~~sdld~~lg~~~~~lk 58 (376) T protein:vir:37 1 MFPSVQINALNQLSGET--------K--------------EIERHALFVGVGTTNQGKLLALTPDSDFDKVFGETDTDLK 58 (376) T ss_pred CCCeEEEecccccCCCc--------c--------------cccceEEeeccccccccceeeecCccchHhhhCCCchHHH Confidence 11111111111100000 0 0122222111111110 0000111111111111110 Q ss_pred EEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCc-CCcchhhHHHHHHHHH Q lcl|NC_012740. 323 YATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGAC-AGEGDAFSTVQKHAVS 401 (667) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~v~~~~~~ 401 (667) .-.. . ..+.+|.+=.. ....-.....++.++.+.. .+.+.+-.+..-+. ..+.....+.++.... T Consensus 59 ~~v~--a-------a~~naG~~~~~---~~~~~~~~~~~~~~Av~~a--~~~~s~E~V~v~~pv~t~~a~i~aa~~~a~e 124 (376) T protein:vir:37 59 KQVR--A-------AMLNAGQNWFA---HVYIAQEDGYDFVECVKKA--NQTASFEYCVNTRYLGVDKASIGKLQECYAE 124 (376) T ss_pred HHHH--H-------HHhCCCCcEEE---EEEeecCCchHHHHHHHHh--hhhcCceEEEEeccccccHHHHHHHHHHHHH Confidence 0000 0 00111110000 0000001112344444443 33455554443332 2223333444444444 Q ss_pred HHhh-cCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEe-hhhcccccccCceeEechHHH Q lcl|NC_012740. 402 IGDE-RQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDG-NYKYQYDKYNDVNRWVPLAAD 479 (667) Q Consensus 402 ~~~~-~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~-p~~~v~d~~~~~~~~~p~sg~ 479 (667) +..+ +|-.+.++..+.... +...+.+..+-.+.+. ....++.+.+..++. -| -...|. T Consensus 125 l~~~~~Rpv~file~r~~~~-~~~~~e~w~~y~~~~~------al~~gia~~~V~~V~~~~-------------gn~~G~ 184 (376) T protein:vir:37 125 LLAKFGRRTFFIQAVQGINH-DQSDGETWDQYVQKLT------TLQQTIVADHVCLVPLLF-------------GNETGV 184 (376) T ss_pred HHHhcCCeEEEEEeccCcCc-ccccccCHHHHHHHHH------Hhhcccccccceeeeeeh-------------hhhHHH Confidence 4444 577788877652110 1111233333222222 222345555544321 11 122577 Q ss_pred HHHHHHHhhhcCCceeeecceeccceecccccc-------ccCChhhhhhhhhcCceEEEEecCC-eEEEEcceecCCCc Q lcl|NC_012740. 480 IAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLA-------IEPRKAHRDRLYQAAINPVIGAGGE-GFILMGDKTATTVP 551 (667) Q Consensus 480 vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~-------~~~~~~e~~~Ln~~gIn~i~~~~~~-G~~~wG~rT~~~~~ 551 (667) +||.+++. ..-++.||.-...+.+.|....+ ..++...++.|..+|..+.+.++|. |+.+-++||++... T Consensus 185 ~aGRl~~a--aVsVadspgRV~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~g 262 (376) T protein:vir:37 185 LAGRLANR--AVTVADSPARVQTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGYYWADGRTLDVEG 262 (376) T ss_pred HHHHHhhc--ccchhhCccceeccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCC Confidence 88877643 22367889887766676653332 2456788999999999999999984 99999999999888 Q ss_pred ccceeeehhhhhHHHHHHHHHHHHHHhcCCC---CHHHHHHHHHHHHHHHHHHHhcCCeeee----EEEEcccCC-CHHH Q lcl|NC_012740. 552 SPFDRINVRRLFNMLKKNIGDSSKYKLFENN---DNFTRASFRMEVSQYLSTIRSLGGIYDF----RVQCDTTNN-TPDV 623 (667) Q Consensus 552 ~~~~~i~vrR~~~~i~~~l~~~~~~~v~epn---~~~~~~~i~~~i~~~l~~l~~~gal~g~----~v~~d~~~n-t~~~ 623 (667) +++++|..+|.+|.+.+.++..+-.++...- .+.-.+..+.-+..=|+.|.+..-+.|. +|...++.+ ++.- T Consensus 263 sDY~~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~d~Di~i~w 342 (376) T protein:vir:37 263 GDYQVIENLRVVDKVARKVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVW 342 (376) T ss_pred CChhhhhhhhHHHHHHHHHHHHHHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCCCCCceEEe Confidence 9999999999999999999988877765422 3444556666677789999888888873 344433221 2233 Q ss_pred hhCCeEEEEEEEEecCCceEEEEEEEEeec-Cee Q lcl|NC_012740. 624 IDRNEFVASMFIKPAKSINYIMLNFTAVAT-GAD 656 (667) Q Consensus 624 i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~-~~~ 656 (667) +...++.|.+.+.|.--..+|+..|--.-. ..+ T Consensus 343 ~s~~~V~I~~~v~P~~~pk~Itv~I~Ldlsn~~~ 376 (376) T protein:vir:37 343 QSKTKVTIYIKVRPYDCPKEITANIFLDLDSLGE 376 (376) T ss_pred eccceEEEEEEEEeccCCceEEEEEEeecCCCCC Confidence 477889999999999988999866443222 222 No 54 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=99.01 E-value=2.3e-09 Score=67.92 Aligned_cols=345 Identities=13% Similarity=0.064 Sum_probs=181.8 Q ss_pred cccceeeeecc----ce--eeeeEeeeccCCcc-ccccc------cccchhhhcccccceEEEecccccCcccceEEecC Q lcl|NC_012740. 275 DNQYAFIVRRD----GV--VVESYVLSTLKGDK-DVYGN------SIYMDDFFARGSSQYIYATAQGWVDGFSGIISLAG 341 (667) Q Consensus 275 ~~~~~~~v~~~----g~--v~e~~~~s~~~~~~-~~~~~------~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~ 341 (667) -..-++++..- |. -+|.+.+-...+.. ...+. +.-.+..+....|.+.. .... ..+.+ T Consensus 1 m~~~~V~in~~n~~qg~~~~ver~~lfig~g~~~~~~g~~~~~~~~sdld~~lg~~ds~lk~--------~v~a-a~~na 71 (369) T protein:vir:27 1 MAWPTVIIKILNLMNGPIADIECHFLFVIRGTVSGEVRNLIMVDSTSDLDDVLAEASAEGLA--------IVKA-AQLNG 71 (369) T ss_pred CCCCceEEecccccCCCcccccceEEEEEeccccccccceEEecCccchHhhcCCcChhHHH--------HHHH-HHhCC Confidence 11112232211 11 13444332222221 11111 11111222222222100 0000 01111 Q ss_pred CccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHHHHHHHhh-cCcEEEEEccCcccc Q lcl|NC_012740. 342 GVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDE-RQDCLVMVSPPRSTV 420 (667) Q Consensus 342 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~-~~~~~ai~d~p~~~~ 420 (667) |.+-.... .......++.++.+.. .+.+++-.+.+-+-..+.....+.++....+-.+ +|-.+.++..+... T Consensus 72 G~~w~a~~----~p~~~~~~~~~Av~~a--~~~~s~E~V~v~~p~t~~a~i~aaq~~a~el~~~~~R~vffi~e~~~~~- 144 (369) T protein:vir:27 72 KQAWTAGV----MILSEEDNWQDAVKKA--NEVSSFEFVVLGFDAETKAMIEDAITLRTELKNSLGREVGVLCQLPAIN- 144 (369) T ss_pred CCceEEEE----EEeCCchhHHHHHHhh--hhhCCccEEEEecCcccHHHHHHHHHHHHHHHHhcCCeEEEEEeccccC- Confidence 11111000 0011223455555443 3445555555544322222333444444444444 46677777654211 Q ss_pred ccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCceeEechHHHHHHHHHHhhhcCCceeeecce Q lcl|NC_012740. 421 VNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGY 500 (667) Q Consensus 421 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~ 500 (667) -+...+.+.. +|..... ....++.+.|..++--+... -.-.|.+||.++.. ..-++.||+-. T Consensus 145 ~~~~~~e~w~---dy~a~l~---al~~g~a~~~V~vv~~~~~~----------gn~~G~~aGRl~n~--aVsIadsp~RV 206 (369) T protein:vir:27 145 NDPTNGQTWS---EWLADTV---DIPKDVASEYISVVPNVHAA----------GDTLGKYAGRLANK--EVSIADSPARV 206 (369) T ss_pred CCccccCCHH---HHHHHHH---HHhhccCcccceeeeeeccc----------cchHHHHHHHHHhc--ccchhcCccee Confidence 0111223333 3333222 22345677777776322211 12357788887642 22358889877 Q ss_pred eccceeccccccc-----cCChhhhhhhhhcCceEEEEecCC-eEEEEcceecCCCcccceeeehhhhhHHHHHHHHHHH Q lcl|NC_012740. 501 NRGQIMNVVKLAI-----EPRKAHRDRLYQAAINPVIGAGGE-GFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSS 574 (667) Q Consensus 501 ~~~~i~g~~~~~~-----~~~~~e~~~Ln~~gIn~i~~~~~~-G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~ 574 (667) ..+.+.|...+.. .++.+.++.|..+|..+.+.++|. |+.+-++||++...+++++|..+|.+|.+.+.++..+ T Consensus 207 ktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~tl~~~gsDYq~iE~~RVvdKa~R~vR~~A 286 (369) T protein:vir:27 207 QTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGRTLDVPGGDYQDIRHIRVAMKAARKVRIRA 286 (369) T ss_pred eecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCCeehhhhhhHHHHHHHHHHHHH Confidence 7666666543322 255678889999999999999984 9999999999988899999999999999999998887 Q ss_pred HHHhcCCC---CHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcccC-CCHHHhhCCeEEEEEEEEecCCceEEEEEEEE Q lcl|NC_012740. 575 KYKLFENN---DNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTN-NTPDVIDRNEFVASMFIKPAKSINYIMLNFTA 650 (667) Q Consensus 575 ~~~v~epn---~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~-nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~ 650 (667) -..+..|- ++.-.+..+..+..=|+.|.+.+ +..+|.-.++. -+..-....++.|-+.+.|.--...|+.+|.- T Consensus 287 i~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~--fpgei~~P~d~dI~i~w~~k~~V~I~~~vrP~~~pk~it~~I~l 364 (369) T protein:vir:27 287 IARIADRTLNSTPQSIAAAKLYFTQDLRTMALTG--VPGEIYPPEDEDIQIKWVNSTDVEIYMSVQPYECPVKITIAISV 364 (369) T ss_pred HHHhcCcccccChhHHHHHHHHHhhHHHHHHhhc--CCeEEecCCCCceEEEeeccceEEEEEEEeeccCCceEEEEEEE Confidence 76666543 55667777888888888887664 22333332211 01111145577788888888888889988877 Q ss_pred eecCe Q lcl|NC_012740. 651 VATGA 655 (667) Q Consensus 651 ~~~~~ 655 (667) .-+.. T Consensus 365 dl~~~ 369 (369) T protein:vir:27 365 KQGDY 369 (369) T ss_pred eccCC Confidence 66655 No 55 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=99.01 E-value=6e-10 Score=71.13 Aligned_cols=357 Identities=11% Similarity=0.039 Sum_probs=181.5 Q ss_pred cccceeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccc Q lcl|NC_012740. 229 EIGNSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSI 308 (667) Q Consensus 229 ~~gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~ 308 (667) -|| +|++......+..... -+.+-+.++.+-. +.+......++. T Consensus 1 ~~~---~v~vn~ln~~qg~~~~------------------------ver~~lfig~~~~---------~~~~~~~~~~~s 44 (376) T protein:vir:37 1 MFP---SVQINALNQLSGETKE------------------------IERHALFVGVGTT---------NQGKLLALTPDS 44 (376) T ss_pred CCC---eEEEeeeeccCCCccc------------------------ccceEEEeecccc---------ccCceEEecCCC Confidence 222 1222111111100000 0011111111100 000000011111 Q ss_pred cchhhhcccccceEEEeccc-ccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCC Q lcl|NC_012740. 309 YMDDFFARGSSQYIYATAQG-WVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAG 387 (667) Q Consensus 309 ~~~~~~~~~~s~~v~~~~~~-~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 387 (667) -.+..+....|.+..-.... .-.+........ .-.....+..++.+.. .+.+++-.+.+-+... T Consensus 45 dld~~lg~~ds~lk~~v~aa~~naG~~w~a~~~-------------~p~~~~~~~~~Av~~a--~~~~s~E~V~v~~p~~ 109 (376) T protein:vir:37 45 DFDKVFGETDTDLKKQVRAAMLNAGQNWFAHVY-------------IAQEDGYDFVECVKKA--NQTASFEYCVNTRYLG 109 (376) T ss_pred ChHHhhCCCchhHHHHHHHHHhCCCCceEEEEE-------------ecCCChhhHHHHHHHH--HhhCCeeEEEEecCcc Confidence 11112222222211000000 000011100000 0001123455665554 3445665555544321 Q ss_pred -cchhhHHHHHHHHHHHhh-cCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccc Q lcl|NC_012740. 388 -EGDAFSTVQKHAVSIGDE-RQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYD 465 (667) Q Consensus 388 -~~~~~~~v~~~~~~~~~~-~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d 465 (667) +.....+.+.....+..+ +|-.|.++..+.-. .+...+.+..+ |..... ....++.+.+..++-. +. T Consensus 110 t~~a~i~a~qa~a~el~~~~~R~vffile~~g~d-~~~~~ge~w~~---y~~~l~---a~~~gia~~~V~vV~~-~~--- 178 (376) T protein:vir:37 110 VDKASIGKLQECYAELLAKFGRRTFFIQAVQGIN-HDQSDGETWDQ---YVQKLT---TLQQTIVADHVCLVPL-LF--- 178 (376) T ss_pred hhHHHHHHHHHHHHHHHHhcCCeEEEEEeccCCC-CcccccCCHHH---HHHHHH---HHhccccccceeeeee-ec--- Confidence 222223333333333344 46677787765211 01112233333 333222 2234566777766532 11 Q ss_pred cccCceeEechHHHHHHHHHHhhhcCCceeeecceeccceecccccc-------ccCChhhhhhhhhcCceEEEEecCC- Q lcl|NC_012740. 466 KYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLA-------IEPRKAHRDRLYQAAINPVIGAGGE- 537 (667) Q Consensus 466 ~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~-------~~~~~~e~~~Ln~~gIn~i~~~~~~- 537 (667) + ...|.+||.++.. ..-++.||.-..-+.|.|+..++ ..++....+.|..+|..+.+.++|. T Consensus 179 ---g-----n~~G~~aGRl~na--aVsVadspgRV~tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gyd 248 (376) T protein:vir:37 179 ---G-----NETGVLAGRLANR--AVTVADSPARVQTGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYD 248 (376) T ss_pred ---c-----chHHHHHHHHHhC--CcchhcCccceeecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCC Confidence 1 2368888887642 33368899888777776654332 2345678888999999999999984 Q ss_pred eEEEEcceecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcCC---CCHHHHHHHHHHHHHHHHHHHhcCCeeeeE--- Q lcl|NC_012740. 538 GFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFEN---NDNFTRASFRMEVSQYLSTIRSLGGIYDFR--- 611 (667) Q Consensus 538 G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~ep---n~~~~~~~i~~~i~~~l~~l~~~gal~g~~--- 611 (667) |+.+-++||++...+++++|..+|.+|.+.+.++..+-..+..+ .++.-.+..+..++.=|+.|.+.+.|.|.. T Consensus 249 G~Yw~dg~tl~~~gsDYq~ie~~RVvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpg 328 (376) T protein:vir:37 249 GYYRADGRTLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPG 328 (376) T ss_pred ceEEeCCeEeccCCCCeeeehhchHHHHHHHHHHHHHHHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccc Confidence 99999999999888999999999999999999988776666543 367778888999999999999999999843 Q ss_pred -EEEcccC-CCHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeHHH Q lcl|NC_012740. 612 -VQCDTTN-NTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFDE 659 (667) Q Consensus 612 -v~~d~~~-nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e 659 (667) |+..++. -+..-....++.|-+.+.|.--...|+..|--.-.. .-| T Consensus 329 ei~~P~d~dI~i~w~sk~~V~I~~~vrPy~cpk~i~~~I~LDls~--~~~ 376 (376) T protein:vir:37 329 ECMPPKDDAITIVWQSKTKVTIYIKVRPYDCPKEITANIFLDLDS--LGE 376 (376) T ss_pred eeecCCCCceEEEeccCceEEEEEEEeeecCcceeEEEEEEecCC--CCC Confidence 3332210 011111356677888888887677788776543321 112 No 56 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=98.99 E-value=4.7e-09 Score=66.26 Aligned_cols=438 Identities=10% Similarity=0.060 Sum_probs=210.7 Q ss_pred CceecCceEEEEecCCCc-ccccCCCceEEEeec---cCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHc-CC Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTT-IVQSATGRAALVGKF---QWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQ-YG 75 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~-~~~~~ts~~afvG~~---~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~n-gG 75 (667) =....||+|+|--++... ....-....-+||.. ...|.++|++|+|.+|-...||. .+.+..+++.|..+ -- T Consensus 11 ~~iRvP~~y~E~dns~A~~g~~~~~q~vLiiGq~la~gs~~~~~pv~v~s~~~a~~~fG~---GS~la~M~~a~~~~n~~ 87 (495) T protein:vir:19 11 SDVRVPLTYIEFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQASAAFGQ---GSMLALMADAFLNANRV 87 (495) T ss_pred cccccCeEEEEEccCCCCcCCcCCCceEEEEEecCcccccccceeEEecCHHHHHHhcCc---CcHHHHHHHHHHHhCCc Confidence 567799999985443322 222234455678874 34578999999999999999996 56666677777664 55 Q ss_pred CeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccccccc Q lcl|NC_012740. 76 NDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAK 155 (667) Q Consensus 76 ~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~a~ 155 (667) .++|++-+.|. ....++.. + +.+... +....+.+.+.. T Consensus 88 ~~l~~i~~~D~-aG~aA~g~---i--t~tg~a---t~~G~l~l~I~g--------------------------------- 125 (495) T protein:vir:19 88 AELWCIPQGNG-TGNAAVGE---I--SLSGTA---GENGSLVTYIAG--------------------------------- 125 (495) T ss_pred ceEEEEeeCCh-hhceeEEE---E--EEeecC---CCCcEEEEEECC--------------------------------- Confidence 89999998653 11111100 0 000000 000001111000 Q ss_pred cccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecc--ccccce Q lcl|NC_012740. 156 AIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYA--GEIGNS 233 (667) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~--G~~gn~ 233 (667) -.+.+....+ +++ ..+..+..........+|...+... +..+.. T Consensus 126 -----------~~v~v~V~~g----------------------dTa-a~vA~al~aaina~~~lPvTA~~~~~~~~~~a~ 171 (495) T protein:vir:19 126 -----------QRLAVSVAAG----------------------ATG-AALADLLVARIKGQPDLPVTAEVRADSGDDDTH 171 (495) T ss_pred -----------EEEEEEecCC----------------------CCH-HHHHHHHHHHhcCCccCceEEEeeccCCCCcCc Confidence 0000000000 000 0000000000001111111111100 000000 Q ss_pred eEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccchhh Q lcl|NC_012740. 234 LEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDF 313 (667) Q Consensus 234 i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~~~~ 313 (667) ...+++.+..|.. .-.++..+ ++. T Consensus 172 ------------------------------------------~~VtlTAr~kG~~-n~idi~~~-----------~~~-- 195 (495) T protein:vir:19 172 ------------------------------------------ADVVLSAKFTGAL-SAVDVRWN-----------YYA-- 195 (495) T ss_pred ------------------------------------------eeEEEEEeecccc-ccceeEEE-----------eec-- Confidence 0012222222210 00000000 000 Q ss_pred hcccccceEEEecccccCcccc-eEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcchhh Q lcl|NC_012740. 314 FARGSSQYIYATAQGWVDGFSG-IISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAF 392 (667) Q Consensus 314 ~~~~~s~~v~~~~~~~~~~~~~-~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 392 (667) .+..+.+... ...++||.. ..+....+.++. ...++++++|-.. T Consensus 196 ------------ge~~p~Glt~titamsgGag--------------~PDia~alaal~---~~~~~~I~~P~tD------ 240 (495) T protein:vir:19 196 ------------GETTPYGIITAFKAASGKNG--------------NPDISASIAGMG---DLQYKYIVMPYTD------ 240 (495) T ss_pred ------------ccccccceeEEEEecCCCCC--------------CcchHHHHHHhc---cCCCcEEEEecCc------ Confidence 0000111111 111233321 123445555554 3467788886431 Q ss_pred HHHHHHHHHHHhh------cCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhccccc Q lcl|NC_012740. 393 STVQKHAVSIGDE------RQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDK 466 (667) Q Consensus 393 ~~v~~~~~~~~~~------~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~ 466 (667) .+-..++.+|++. +++++++.-. .-+..+..+|.. ..|+.|..+.+ + T Consensus 241 ~asL~al~~~l~~rw~~~~q~~g~~~~a~----------~gT~~~l~t~g~----------~~N~~~it~~~--~----- 293 (495) T protein:vir:19 241 EPNLNLLRTELQERWGPVNQADGFAVTVL----------SGTYGDISTFGV----------SRNDHLISCMG--I----- 293 (495) T ss_pred HHHHHHHHHHHHHhhhHHHhcCeEEEEee----------cCCHHHHHHhhh----------ccCCceEEEEe--c----- Confidence 2334567777765 3445555432 124455555554 35677766643 1 Q ss_pred ccCceeEechHH---HHHHHHH---HhhhcCCceeeecc-eeccceeccc--cccccCChhhhhhhhhcCceEEEEecCC Q lcl|NC_012740. 467 YNDVNRWVPLAA---DIAGLCA---RTDAVSQPWMSPAG-YNRGQIMNVV--KLAIEPRKAHRDRLYQAAINPVIGAGGE 537 (667) Q Consensus 467 ~~~~~~~~p~sg---~vAg~~a---~~d~~~g~~~span-~~~~~i~g~~--~~~~~~~~~e~~~Ln~~gIn~i~~~~~~ 537 (667) ++. .-||.. .+|+..+ +.|..| |-+ .. +.|+. .+.-.++..|++.|..+||.++..-.+. T Consensus 294 -~gs--p~~~~~~AAA~aa~~A~~l~~DPAr-----PL~tl~---L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~~G 362 (495) T protein:vir:19 294 -AGA--PEPSYLYAATLCAVASQALSIDPAR-----PLQTLT---LPGRMPPAVGDRFTWSERNALLFDGISTFNVNDGG 362 (495) T ss_pred -CCC--CCcHHHHHHHHHHHHHHHhhccccc-----ccCcee---ecceecCCccccCChHHHHHHHhCCcceEEECCCC Confidence 111 123333 3333332 344433 322 23 33443 4455678999999999999999765544 Q ss_pred eEEEEcceec-------CCCcccceeeehhhhhHHHHHHHHHHHHH-HhcCCCCHH-----------HHHHHHHHHHHHH Q lcl|NC_012740. 538 GFILMGDKTA-------TTVPSPFDRINVRRLFNMLKKNIGDSSKY-KLFENNDNF-----------TRASFRMEVSQYL 598 (667) Q Consensus 538 G~~~wG~rT~-------~~~~~~~~~i~vrR~~~~i~~~l~~~~~~-~v~epn~~~-----------~~~~i~~~i~~~l 598 (667) =..+--..|. ..| ..|..|++-|+.+|+++.++..... |--+..-+. +-..||..+-+-+ T Consensus 363 ~V~I~R~ITTY~~n~~G~~D-~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~ 441 (495) T protein:vir:19 363 EMQIERMITMYRTNKYGDSD-PSYLNVNTIATLSYLRYSLRTRITQKFPNYKLASDGTRFATGQAVVTPSVIKTELLALF 441 (495) T ss_pred eEEEEeeeeeeeecCCCCcc-hhhhhhHHHHHHHHHHHHHHHHHhhhcCCcccccCCCCCCCcccccChHHHHHHHHHHH Confidence 3455444444 133 3699999999999999999987753 322322222 5678999999999 Q ss_pred HHHHhcCCeeee---E--EEEcccCCCHHHhhCCeEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_012740. 599 STIRSLGGIYDF---R--VQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVA 652 (667) Q Consensus 599 ~~l~~~gal~g~---~--v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 652 (667) ++|...|-+..+ + +.+.++-+. .+|+.+.+-...+-...-+-.+++-.= T Consensus 442 ~~le~~given~~~~~~~LiVerd~~d-----pnRln~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 442 EEWENAGLVEDFDTFKEELYVARNKDD-----KDRLDVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred HhhhhhccccChhhhcceeEEEECCCC-----CcEEEEEecceeeCceeeeeeeeeeeC Confidence 999999988874 2 333332221 257777776665555543333322111 No 57 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=98.92 E-value=3.3e-09 Score=67.10 Aligned_cols=355 Identities=10% Similarity=0.010 Sum_probs=171.0 Q ss_pred cccceeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccc Q lcl|NC_012740. 229 EIGNSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSI 308 (667) Q Consensus 229 ~~gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~ 308 (667) -|+ +|.+......+... ..-+.+-+.++....-..+. .....+. T Consensus 1 ~~~---~v~vn~~n~~~g~~------------------------~~~er~~lfig~~~~~~g~~---------~~~~~~s 44 (370) T protein:vir:78 1 MWP---YVQIYNLNQMQGPV------------------------TEVERHLLFIGSAASNTGKL---------LSLNAQS 44 (370) T ss_pred CCc---eEEEeeccccCCCc------------------------CccceeEEEEecccccccce---------EeecCcc Confidence 222 12221111111000 00001111111111000000 0000000 Q ss_pred cchhhhcccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCc Q lcl|NC_012740. 309 YMDDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGE 388 (667) Q Consensus 309 ~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 388 (667) -.+..+....|.+..-.. . ..+.+|.+=.... .......++.++.+.. .+.+++-.+.+-+-..+ T Consensus 45 dld~~l~~~ds~lk~~v~--a-------a~~naG~~~~~~~----~p~~~~~d~~~Av~~a--~~~~s~E~V~v~~~~s~ 109 (370) T protein:vir:78 45 DFDQLLGAADSELKANLL--A-------ARDNAGQNWSAAA----YVLPTDKPWLDAARDA--QQTQSFEGVVVLGQEWH 109 (370) T ss_pred CHHHhcCCcChhHHHHHH--H-------HHhCCCCceEEEE----EEecCchhHHHHHHHH--HhhCCccEEEEecCcch Confidence 111111111111100000 0 0000111000000 0001122455555444 34455555555443332 Q ss_pred chhhHHHHHHHHHHHhhc-CcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccc Q lcl|NC_012740. 389 GDAFSTVQKHAVSIGDER-QDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKY 467 (667) Q Consensus 389 ~~~~~~v~~~~~~~~~~~-~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~ 467 (667) .....+.++....+..++ |-.+.++..+.. ..+.+..+ |.... .....++.+.+..++--|-. T Consensus 110 ~a~~~a~~~~a~el~n~~~Rpv~file~~~~-----~~~e~w~~---y~~~l---~al~~gia~~~V~vvp~~~g----- 173 (370) T protein:vir:78 110 QAAINAAHALNQELIAKWGRWQFMLLAVPAI-----ADEQDWAT---YEAEL---ATLQDGIAASSVSLIPQLWP----- 173 (370) T ss_pred HHHHHHHHHHHHHHHHhcCCeEEEEEeecCC-----CCcCCHHH---HHHHH---HHhhhccccccceEEeeecc----- Confidence 233333444444444443 667777766532 12233333 22222 12234556666666533311 Q ss_pred cCceeEechHHHHHHHHHHhhhcCCceeeecceeccceeccccc-----cccCChhhhhhhhhcCceEEEEecCC-eEEE Q lcl|NC_012740. 468 NDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKL-----AIEPRKAHRDRLYQAAINPVIGAGGE-GFIL 541 (667) Q Consensus 468 ~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~-----~~~~~~~e~~~Ln~~gIn~i~~~~~~-G~~~ 541 (667) -.-|.+||.++.. .--+..+|.-...+.+.|...+ ...++...++.|..+|..+.+.++|. |+.+ T Consensus 174 -------~~~G~~aGRL~na--avsVadsP~Rv~tG~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~ 244 (370) T protein:vir:78 174 -------TLAGAYAGRLCNR--AVSIADSPCRVKTGALVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGIYW 244 (370) T ss_pred -------ccHHHHHHHHhcC--eeeecccceeeeccccccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCceEE Confidence 1137778865432 2226778887766666664322 23466788999999999999999984 9999 Q ss_pred EcceecCCCcccceeeehhhhhHHHHHHHHHHHHHHh-cCCC--CHHHHHHHHHHHHHHHHHHHhcCCeee--eEEEEcc Q lcl|NC_012740. 542 MGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKL-FENN--DNFTRASFRMEVSQYLSTIRSLGGIYD--FRVQCDT 616 (667) Q Consensus 542 wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v-~epn--~~~~~~~i~~~i~~~l~~l~~~gal~g--~~v~~d~ 616 (667) -++||++...+++++|..+|.+|.+.+.++..+-..+ +|-. .+...+..+.....=|++|.+.+.+.+ |.-+|.. T Consensus 245 ~d~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~gsia~~~~~~~~~L~ema~s~~i~~~~fpgeI~~ 324 (370) T protein:vir:78 245 ADGRTLDAEGGDYQVIENLRIAYKVARRMRLRAIARIGDRSFNSTPGSTAAAITYFGKDLREMAKSTTINGQPFPGDIAS 324 (370) T ss_pred eCceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCcccCCCCcchhHHHHHHHhhHHHHHhhhhhcccccceeEec Confidence 9999999888999999999999999999995554443 3321 122223444455555666677887777 4444443 Q ss_pred cCC---CHHHhhCCeEEEEEEEEecCCceEEEEEEEEeecCeeHHHHHH Q lcl|NC_012740. 617 TNN---TPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGADFDEIIG 662 (667) Q Consensus 617 ~~n---t~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~ 662 (667) ..+ ++.-....++.|.+.+.|.--...|+..|.-.-. +++==+ T Consensus 325 p~d~Di~i~w~s~~~v~I~~~v~P~~~pk~Itv~I~LDls---~e~~~~ 370 (370) T protein:vir:78 325 PQDGDIRIQWVAKNLVSVFVVVRTVDCPKGITVNIMLDLS---LNNGEG 370 (370) T ss_pred cCCCcceEEeeccceEEEEEEEEeccCCceEEEEEEEeec---cccCCC Confidence 221 2223477888999999998888888887753322 111111 No 58 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=98.45 E-value=5.8e-07 Score=54.76 Aligned_cols=411 Identities=11% Similarity=0.028 Sum_probs=171.6 Q ss_pred cchhhhcccccccc-----ccccceeeeeeccccccceeEEEEeeccccccc----------------ceeeeeeeeccc Q lcl|NC_012740. 201 SRANITNQDFLTKL-----KKYDMPAVSAIYAGEIGNSLEVEILARSSFSGA----------------VAPELTMYPFGG 259 (667) Q Consensus 201 ~~~~~~~~~~~~~~-----~~~~~~~~~A~~~G~~gn~i~v~i~~~a~~~~~----------------~~~~~t~~~~~~ 259 (667) .-..+......... ..-+.+.+...... ...++. ......+.... ..+.......+. T Consensus 1 ~~s~iVnV~i~~~~~a~~~~~f~~~l~~~~~~~-~~~r~~-~yss~~~V~~~FG~~S~ey~aA~~yF~q~p~p~~l~igr 78 (450) T protein:vir:95 1 MWNPIVNVDITLNTAGTTREGFGLPLFLASTDN-FEERVR-GYTSLTEVAEDFDENTAAYKAAKQLWSQTPKVTQLYIGR 78 (450) T ss_pred CCCceEEEeecccccccccccceeEEEEcCCCC-Ccccee-eecCHHHHHHhcCCCcHHHHHHHHHHhCCCcccEEEEEe Confidence 00011000000000 00011111111100 001111 01000000000 000000000000 Q ss_pred ccccceeeeeccccccccceeeeeccceeeeeE--eeeccCCccccccccccchhhhcccccceEEEecccccCcccceE Q lcl|NC_012740. 260 TRAAARNLIPYAPQNDNQYAFIVRRDGVVVESY--VLSTLKGDKDVYGNSIYMDDFFARGSSQYIYATAQGWVDGFSGII 337 (667) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~--~~s~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~ 337 (667) -..................++++..+|...... .++...... .....+...+....+..-............... T Consensus 79 ~~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~---~va~~~~tai~~~~~~~~~~~~~s~g~~~~~t~ 155 (450) T protein:vir:95 79 RAMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAE---NVLQQFKTQIEADPTIKDKVSVNVTGSNGSATM 155 (450) T ss_pred eccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChh---hHHHHhhhhhcccceeeeeeeeeeecccceeee Confidence 000000000000011111223333344332222 222211110 000001111100000000000000011111111 Q ss_pred EecCCcccc--cccc--c-cccccccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHHHHHHHhhcCcEEEE Q lcl|NC_012740. 338 SLAGGVSAN--EAST--G-DRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVM 412 (667) Q Consensus 338 ~~~~g~~~~--~~~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ai 412 (667) ....+.... .+.. . ..........+.+.+..+..... +.-.++.+.. ..+-..+|...++....++.. T Consensus 156 ~~~~~~~~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~-~w~~~~~~~~------~~~~i~a~a~w~~a~~~~f~~ 228 (450) T protein:vir:95 156 IIAKAGDNDFVKVTTTAQTVYIASTTADTASTALAAIEAYST-DWYFIAAEDR------TQQFVLAMASEIQARKKIFFT 228 (450) T ss_pred eeeccccchhhccccccceeEecccccccHHHHHHHHHHhhC-CeEEEEecCC------CHHHHHHHHHHHhhcCcEEEE Confidence 111111000 0000 0 00011112234444444443221 1112222211 112234455555554444444 Q ss_pred EccCccccccccccCCHHHHHHHhhhhccccccccccCcceE-EEEehhhcccccccCceeEechHHHHHHHHHHhhhcC Q lcl|NC_012740. 413 VSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYA-VIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVS 491 (667) Q Consensus 413 ~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~-~~~~p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~ 491 (667) ...-...........+ .+.....+. -+..|. .+|++ .+. .-.+.+.++|.....+..+ T Consensus 229 ~~~~~~~~~~~~~~~~-~~i~~~l~~----------~~~~~t~~~y~~-------~~~---~~~~~aa~~g~~~~~~~g~ 287 (450) T protein:vir:95 229 ANSDVTALQGTELASA-NDVPAQLAK----------NMYTRTVCLWHH-------AAA---EDYPEMAYIAYGAPYDAGS 287 (450) T ss_pred EcCCchhhhhhhhhcc-cchHHHHHh----------ccCCeeEEEeeC-------CCc---hhHHHHHHHHHhhhcccce Confidence 3221100000000000 000000000 111222 23322 111 1234566666655443332 Q ss_pred Cceeeecceeccceecc-c-cccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCcccceeeehhhhhHHHHHH Q lcl|NC_012740. 492 QPWMSPAGYNRGQIMNV-V-KLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKN 569 (667) Q Consensus 492 g~~~span~~~~~i~g~-~-~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~ 569 (667) =-| .+|.+.||..- . .....++..|.+.|..+++|++.++.+.+ .++.++|+++ .||-++|-.+||+.. T Consensus 288 ~T~---~fk~l~Gv~~~v~~~~~~~lt~~~~~al~~~~~n~y~~~~~~~-~~~~G~~~~G-----~~iD~~~~~~wl~~~ 358 (450) T protein:vir:95 288 IAW---GNAQLTGVAASLQPSNQRPLTSIQKSALDVRHCNFIDLDGGVP-VVRRGITSGG-----EWIDIIRGVDWLESD 358 (450) T ss_pred eee---ccccccceeeeccCccccccchHHHHHHHhCCcEEEEEecCce-eeeCCeeeCc-----chhHHHHHHHHHHHH Confidence 233 36665555421 1 12246889999999999999999987775 4778888775 368899999999999 Q ss_pred HHHHHHHHhc------CCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEc-ccCCCHHHhhCCeEE-EEEEEEecCCc Q lcl|NC_012740. 570 IGDSSKYKLF------ENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCD-TTNNTPDVIDRNEFV-ASMFIKPAKSI 641 (667) Q Consensus 570 l~~~~~~~v~------epn~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d-~~~nt~~~i~~G~~~-~~i~~~p~~p~ 641 (667) |++.+...+- =|-|+.-...|+..|+.-|++..++|.|.||+|.+. .+..++.|+.++++. +.+.++....+ T Consensus 359 iq~~l~~ll~~~~~~KiPy~~~G~~~i~a~i~~~l~~a~~~G~Ia~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAI 438 (450) T protein:vir:95 359 LKTSLRDLLINQKGGKITYDDTGITRIRQVIETSLQRAVNRNFLSSYTVNVPKASQVALADKKARILKDVTFAGILAGAI 438 (450) T ss_pred HHHHHHHHHHhcCCCCCccChhhHHHHHHHHHHHHHHHHhcCcccceeEecCChHhcCHHHHhccCCCCeeEEEEEccce Confidence 9999987652 266778888999999999999999999999999997 588899999988875 88999999999 Q ss_pred eEEEEEEEEeecCeeHH Q lcl|NC_012740. 642 NYIMLNFTAVATGADFD 658 (667) Q Consensus 642 e~i~~~~~~~~~~~~~~ 658 (667) +++.|++.-+= | T Consensus 439 h~~~i~~~v~~-----~ 450 (450) T protein:vir:95 439 LDVDLKGTVAY-----E 450 (450) T ss_pred EEEEEEEEEEe-----C Confidence 99999865443 3 No 59 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=98.17 E-value=3.2e-06 Score=50.74 Aligned_cols=305 Identities=14% Similarity=0.070 Sum_probs=152.9 Q ss_pred eeeeEe-eeccCCccccc-cccccchhhhcccc-cceEEEecccccCcc-c--------ceEEecCCccccccccccccc Q lcl|NC_012740. 288 VVESYV-LSTLKGDKDVY-GNSIYMDDFFARGS-SQYIYATAQGWVDGF-S--------GIISLAGGVSANEASTGDRGN 355 (667) Q Consensus 288 v~e~~~-~s~~~~~~~~~-~~~~~~~~~~~~~~-s~~v~~~~~~~~~~~-~--------~~~~~~~g~~~~~~~~~~~~~ 355 (667) ++++.. +.......... .........+..+. .++..-......... . ....+..+..+.... .. T Consensus 1 ~~~~iv~V~v~~~~~~~~~~~~~~~~~~~~~~t~~~~~~y~s~~~v~~d~~~~~~~Ykaa~~~f~Q~~~~~~i~----v~ 76 (331) T protein:vir:80 1 MVETITDVRVHISVLYPSPRIGLGRPAIFVKGTAMGYKEYTTLEELKDTFADNTEVYAKAKAVFLQKDRPDTVA----VI 76 (331) T ss_pred CccceecceeeecccccccccccCcceeEEeccccceEEEechhhhccCCCCCcHHHHHHHHHHhccCccceEE----Ee Confidence 222221 21111100000 00000111111111 111100000000000 0 000011111000000 00 Q ss_pred cccccchhHHHHHHhhhcccccccEEecCcCCcchhhHHHHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHH Q lcl|NC_012740. 356 DPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAW 435 (667) Q Consensus 356 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~ 435 (667) ..........+...... .--.++.... ..+-..++...++....+|.+++.. ....+... T Consensus 77 ~~~~~~~~~a~~a~~~~---~w~~~~~~~~------~~~~~~a~a~~~~a~~~~f~~~~~~-----------~~~~~~~~ 136 (331) T protein:vir:80 77 TYEDTKLLEAAEAYFLK---SWHFALLAEF------KAADALALSNLIEEQKFKFAVFQVT-----------AVADITPL 136 (331) T ss_pred ccchHHHHHHHHHhccC---ceeEEEeecC------CHHHHHHHHHHHhhCCcEEEEEecC-----------chHHHHHh Confidence 00000011111110000 1001222111 1122345556666655566554321 11111111 Q ss_pred hhhhccccccccccCcceEEEEehhhcccccccCceeEechHHHHHHHHHHhhhcCCceeeecceeccceeccccccccC Q lcl|NC_012740. 436 REGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEP 515 (667) Q Consensus 436 ~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~ 515 (667) .+ ++....++++. .+ --+.+.++|.++..+..+--| +++. .+.|+.. -.+ T Consensus 137 ~~------------~~~t~~~~~~~-------~~----~~~~aa~~g~~~~~~~g~~t~---~fk~--~l~GV~~--~~l 186 (331) T protein:vir:80 137 AK------------NTRTIAIVHSK-------TG----EKLDAALIGNVASLPVGSATW---KGRH--GLAGITS--EEL 186 (331) T ss_pred hc------------cccEEEEEcCC-------cc----chhHHHHHHHHHhcCccceee---eeec--ccCCCCC--CCC Confidence 10 12233444331 11 113455667766666433223 3442 2334322 357 Q ss_pred ChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcC----CCCHHHHHHHH Q lcl|NC_012740. 516 RKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFR 591 (667) Q Consensus 516 ~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~e----pn~~~~~~~i~ 591 (667) +..|++.|..+++|++.++.+.. .++.+.|+++ .||.+.+-.+||+..|++.+...+-. |-|+.=...|+ T Consensus 187 t~t~~~al~~~~~N~y~~~~~~~-~~~~G~~~~G-----~~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~~l~ 260 (331) T protein:vir:80 187 KVSEIDAIQKAGGMCYIEKAGIA-QTSEGKTVSG-----EFIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQ 260 (331) T ss_pred CHHHHHHHHhcCceEEEEecCee-EEecceEeCc-----hhHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhHHHHH Confidence 89999999999999999987664 4567777766 27899999999999999998776543 34677788999 Q ss_pred HHHHHHHHHHHhcCCee--------eeEEEEc-ccCCCHHHhhCCeEE-EEEEEEecCCceEEEEEEEEee Q lcl|NC_012740. 592 MEVSQYLSTIRSLGGIY--------DFRVQCD-TTNNTPDVIDRNEFV-ASMFIKPAKSINYIMLNFTAVA 652 (667) Q Consensus 592 ~~i~~~l~~l~~~gal~--------g~~v~~d-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 652 (667) ..++.-|++-+++|.|. +|+|.+. .++.+++|+.+++.. +.+.+++...+++|+|++.-.= T Consensus 261 a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 261 SELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred HHHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 99999999999999996 6888886 577899999998886 8889999999999999765443 No 60 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=98.02 E-value=6.9e-06 Score=48.89 Aligned_cols=464 Identities=13% Similarity=0.034 Sum_probs=211.8 Q ss_pred Ccee-cCceEEEEecCCCccccc-CCCceEEEeec-cCCCCC---ccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcC Q lcl|NC_012740. 1 MTLL-SPGFETKETTLSTTIVQS-ATGRAALVGKF-QWGPAF---QIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQY 74 (667) Q Consensus 1 ~~~~-~PGVyvee~~~~~~~~~~-~ts~~afvG~~-~~Gp~~---~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ng 74 (667) |.+= +.=|.|.. ...+..++. .=+...|+|.. ..-|.+ .-...+|..|-...||. .+.++.+.+.+|-+- T Consensus 1 msip~s~ivnV~i-~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG~---~s~ey~aA~~yF~q~ 76 (502) T protein:vir:52 1 MALSISHIVNVQL-NTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFGT---NSETAKAAQPFFAQS 76 (502) T ss_pred CCCCccceeEEee-ccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcCC---ChHHHHHHHHHhcCC Confidence 6653 22222321 122222222 24567788874 333333 33445789999999994 667777888888421 Q ss_pred --CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeeccccccc Q lcl|NC_012740. 75 --GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIA 152 (667) Q Consensus 75 --G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 152 (667) =+++||-|-........ .... +..+. .+. ..........+ T Consensus 77 p~P~~l~igR~~~~~~~~~--~~~~------~~~~~------~~~-------~~~~~~~~~~~----------------- 118 (502) T protein:vir:52 77 PRAKQLIVARWQKSASTIE--ATKN------TLSGA------TLS-------DDLERFKSVVN----------------- 118 (502) T ss_pred CccceEEEEecccccccee--echh------hhhhh------hhH-------HhHHHhhhhcC----------------- Confidence 13455555322110000 0000 00000 000 00000000000 Q ss_pred ccccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeeccccccc Q lcl|NC_012740. 153 HAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIGN 232 (667) Q Consensus 153 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~gn 232 (667) +. .+...++... +...+ + ....+........+ +...+..+. T Consensus 119 -----G~-----------l~i~i~g~~~--t~~~i-~--lS~~ts~~~vA~~i------------------~~~l~~~~~ 159 (502) T protein:vir:52 119 -----GR-----------FSLTIGGDVK--KVDGL-S--FARLADFNAVATKI------------------QEKLTTLSV 159 (502) T ss_pred -----ce-----------eEEEecceee--eeecc-c--cccccchhHHHHHH------------------Hhhhccccc Confidence 00 0000000000 00000 0 00000000000000 000000000 Q ss_pred eeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeee-eEeeeccCCccccccccccch Q lcl|NC_012740. 233 SLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVE-SYVLSTLKGDKDVYGNSIYMD 311 (667) Q Consensus 233 ~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e-~~~~s~~~~~~~~~~~~~~~~ 311 (667) ..++.... ....|.++....|.... .+.....+. ....+.. T Consensus 160 ~~tv~~d~---------------------------------~~~~F~i~s~ttg~~~~~~~~~a~~~~-----~~gt~~a 201 (502) T protein:vir:52 160 AVSIAYDE---------------------------------TGNRFIVSANVAGEDKKTEIDYAIDEG-----GEGEYIG 201 (502) T ss_pred ceEEEEec---------------------------------CCceEEEEeccCCCcceeEEEEeecCC-----cchhHHH Confidence 11111100 00111111111110000 000000000 0000000 Q ss_pred hhhc-ccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcch Q lcl|NC_012740. 312 DFFA-RGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGD 390 (667) Q Consensus 312 ~~~~-~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 390 (667) ..+. ...+..+.+. ....| .....+.+.+..+......-.-+++.-. T Consensus 202 ~~l~l~~~~~av~v~------------~~~~g--------------~~aet~~~al~a~~~~~~~w~~~~~a~~------ 249 (502) T protein:vir:52 202 ALLKLENGQASRKVG------------KNSVS--------------LKKETLGEALFNVAEVNNTWYGFTVAAQ------ 249 (502) T ss_pred HHhccccccceeeee------------eeccc--------------ccccCHHHHHHHHHhccCceEEEEEeec------ Confidence 0000 0000000000 00001 1122345555555543221222233211 Q ss_pred hhHHHHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCc Q lcl|NC_012740. 391 AFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDV 470 (667) Q Consensus 391 ~~~~v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~ 470 (667) ...+...++..++|....+|....... ...+.. . .+.....+. .+..+.++.|- +.+ T Consensus 250 ~~~~~~la~a~~iea~~~~f~~~~~d~-~~~~~~---~-~~i~~~l~a----------~~~~~t~~~y~------~~~-- 306 (502) T protein:vir:52 250 LTDSEVEAAAKYAQANTKLFGANVIRA-EQIEWS---A-DNIYKKLYD----------AGLDHTLAMFD------KND-- 306 (502) T ss_pred CChhHHHHHHHHHhhcCcEEEEEecCc-ceeccc---c-chHHHHHHh----------ccCceeEEEec------CCc-- Confidence 112334566677776666665533211 111111 1 111111111 11223333322 111 Q ss_pred eeEechHHHHHHHHHHhhhcCC-ceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCC Q lcl|NC_012740. 471 NRWVPLAADIAGLCARTDAVSQ-PWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATT 549 (667) Q Consensus 471 ~~~~p~sg~vAg~~a~~d~~~g-~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~ 549 (667) -.+.+.++|.++.+|-.+- -...-.+|.+.|+. .-.++..|++.|..+++|++.++.+.++ +..++++++ T Consensus 307 ---~~~~aa~~g~~as~~f~~~~g~iT~~fk~l~GV~-----~~~lt~t~~~al~~~~~N~y~~~~~~~~-~~~G~~~~G 377 (502) T protein:vir:52 307 ---MYPVSSALARLLSTNFAANNSTLTLKFKQQPTIT-----ADEITATEFAKAKRLGINVYTYFDDVAM-IAEGTVIGG 377 (502) T ss_pred ---chhHHHHHHHHHhcCCCcCcceeeecccccCCcc-----cCcCCHHHHHHHHhcCceEEEEecCeeE-EecCeeeCC Confidence 1256667788887774321 22233466544432 2357899999999999999999877654 567788776 Q ss_pred CcccceeeehhhhhHHHHHHHHHHHHHHhcC-----CCCHHHHHHHHHHHHHHHHHHHhcCCee---------------- Q lcl|NC_012740. 550 VPSPFDRINVRRLFNMLKKNIGDSSKYKLFE-----NNDNFTRASFRMEVSQYLSTIRSLGGIY---------------- 608 (667) Q Consensus 550 ~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~e-----pn~~~~~~~i~~~i~~~l~~l~~~gal~---------------- 608 (667) + ||-+.+-.+||+..|++.+...++. |-|+.=...|+..|+.-|++-+++|.|. T Consensus 378 ~-----~iD~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~~~~~d 452 (502) T protein:vir:52 378 K-----FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGD 452 (502) T ss_pred c-----hhhHHHHHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeecc Confidence 2 6778889999999999998776652 4477778999999999999999999984 Q ss_pred ----eeEEEEc-ccCCCHHHhhCCeE-EEEEEEEecCCceEEEEEEEEee Q lcl|NC_012740. 609 ----DFRVQCD-TTNNTPDVIDRNEF-VASMFIKPAKSINYIMLNFTAVA 652 (667) Q Consensus 609 ----g~~v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 652 (667) ||+|.+. .++.++.|+.+++. -|.+.+++...+++|+|.+.-.+ T Consensus 453 ~~~~gy~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 453 YLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred cccCceEEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 5889887 57889999999988 89999999999999999877666 No 61 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=94.75 E-value=0.0036 Score=33.98 Aligned_cols=362 Identities=11% Similarity=-0.024 Sum_probs=140.2 Q ss_pred ccccceeeeeeeecccccccceeeeeccccccccceeeeecccee--------eeeEeeeccCCccccccccccchhhhc Q lcl|NC_012740. 244 FSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVV--------VESYVLSTLKGDKDVYGNSIYMDDFFA 315 (667) Q Consensus 244 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v--------~e~~~~s~~~~~~~~~~~~~~~~~~~~ 315 (667) +.. ....+.+...+.+..+..+.. -+.+.....+ +.+|.-...-..-....+..|-..... T Consensus 1 m~~---~iVnV~Is~~t~A~~~~~Fg~--------~liigs~~~~~p~~~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~ 69 (426) T protein:vir:31 1 MPK---QIVEIELTAEIADRPQETFTD--------AAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAI 69 (426) T ss_pred CCc---ceEEEEeecccccccccccce--------eeeeeeccccccccccchhhhhhhHHHHHhcCCCChHHHHHHHHH Confidence 000 000011111111111111110 0111111000 001100000000000111111111000 Q ss_pred ccccc-eEEEecccccC----cccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecC------ Q lcl|NC_012740. 316 RGSSQ-YIYATAQGWVD----GFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGA------ 384 (667) Q Consensus 316 ~~~s~-~v~~~~~~~~~----~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~------ 384 (667) -.++- ..+........ ......++ ++.+... ..+..........++..- .+..++...+... T Consensus 70 f~Q~~~~~r~~v~~at~~~~~~~t~~~tv----~g~~~s~-~a~~~~~a~~i~~~~~~~--~~~~~~~~~~~~~t~~g~~ 142 (426) T protein:vir:31 70 EEMGAEQWRVMVLEATEVTEEELSDGDTI----DKVPILG-NHEVESPDGDIEFTTDDD--PDVEDFDAEIVINSATGDV 142 (426) T ss_pred HhCCceeEEeeccccceeeeccCCcceee----cceeeee-cccCcchHHHHHHhhccc--cccccceeeeEecccccee Confidence 00110 00100000000 00000000 0000000 000000111111111100 0000110000000 Q ss_pred ----------cC----C------------------cchhhHHHHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHH Q lcl|NC_012740. 385 ----------CA----G------------------EGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNL 432 (667) Q Consensus 385 ----------~~----~------------------~~~~~~~v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~ 432 (667) .. . .+.....+..++...++..+ -+.+... .....-...+.. T Consensus 143 t~~~~~~~~~~s~~dw~~~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa~~~~-i~~va~~-----~e~~~~~~~~~~ 216 (426) T protein:vir:31 143 ATSEDSIELTYFHADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDED-MGMIANG-----VNVDDYDSVDEA 216 (426) T ss_pred eccccceeeeeccCcchhhhcccccchhhhhhccccchhhhhhhHhhhhhhhhcc-eeeeeec-----cchhhhcchhhh Confidence 00 0 00000111111111111110 0000000 000000011111 Q ss_pred HHHhhhhccccccccccCcceEEEEehhhcccccccCceeEechHHHHHHHHHHhhhcCCceeeecceeccceecc---- Q lcl|NC_012740. 433 IAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRGQIMNV---- 508 (667) Q Consensus 433 ~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~~~~~p~sg~vAg~~a~~d~~~g~~~span~~~~~i~g~---- 508 (667) ..++-. +.-|.|-...+...... .--..+.+++.++..+ ||..|.-+...+-..+ T Consensus 217 ~a~~~~---------------~~~y~p~~~~~~~~~~~--~~~~~~~~~~~~aa~~----~~~~~~~~~~~~~~~~~~~~ 275 (426) T protein:vir:31 217 MDVAHE---------------VAGYVPSGDLMMIVDAS--DDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNV 275 (426) T ss_pred hhhhhc---------------ccccccchhheeehhcc--ccchhhHHhhhhhhhc----cccchhhhhccccccceeec Confidence 222111 11122221111100000 0012567778877665 5766643222111111 Q ss_pred --ccccccCChhhhhhhhhcCceEEEEecCCeEEEEcceecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcC----CC Q lcl|NC_012740. 509 --VKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILMGDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NN 582 (667) Q Consensus 509 --~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~e----pn 582 (667) .+..-.+..+++-.|+ +..|.++.+.+ +..+|-+-|..+....-.||-++|..+||++.++..++..+-. |- T Consensus 276 ~~~gv~~t~~~~~~A~~~-~~~n~~~~~~~-~~~i~~~~~~~G~~~~G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIpy 353 (426) T protein:vir:31 276 GDPEEQGTFEGGDEAEGE-GPVNVLIDVSD-ANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVPF 353 (426) T ss_pred cccccccccchhhhhhhc-CCceEEEEecC-ceeeecceeecccccchhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCcc Confidence 1111122333445565 67799988864 5677766676666667789999999999999999999876632 66 Q ss_pred CHHHHHHHHHHHHHHHHHHHhcCC--eeeeEEEEcccCCCHHHhhCCeEE-EEEEEEecCCceEEEEEEEEee Q lcl|NC_012740. 583 DNFTRASFRMEVSQYLSTIRSLGG--IYDFRVQCDTTNNTPDVIDRNEFV-ASMFIKPAKSINYIMLNFTAVA 652 (667) Q Consensus 583 ~~~~~~~i~~~i~~~l~~l~~~ga--l~g~~v~~d~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 652 (667) ++.=...|+..|+.-|++..+.|. +.+|.|...+...++.|..+.++. +++..+..-.+.++.|+..-+- T Consensus 354 t~~Gi~~I~~~i~~~L~~~v~~~g~~~~~y~v~~P~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 354 TEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred chhHHHHHHHHHHHHHHHHhcCCCccccceeecCCCccccchhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 788888999999999999998644 557988877655566788877776 8899999999999999866443 No 62 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=86.24 E-value=0.047 Score=27.87 Aligned_cols=451 Identities=10% Similarity=0.005 Sum_probs=193.1 Q ss_pred Cce--ecCceEEEEecCCCcccccC-CCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHH---cC Q lcl|NC_012740. 1 MTL--LSPGFETKETTLSTTIVQSA-TGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFL---QY 74 (667) Q Consensus 1 ~~~--~~PGVyvee~~~~~~~~~~~-ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~---ng 74 (667) |-+ +-=--+|+..+.-....... .-.+-|++....=|+++..+.+|..|-...||. .+.++.+.+.+|- |- T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~l~l~~~~~~~~~~~~~~~s~~~V~~~FG~---~S~ey~aA~~yFsg~~~q 77 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQETDVENWFGA---LSNEAKIADAYFPGIVNG 77 (501) T ss_pred CCCCCcccceEEEEeeecccCCCccccceeEEEeccCCCCccceEEecCHHHHHHhcCC---ChHHHHHHHHHhhhhcCC Confidence 765 33345555444222211122 223446666666788999999999999999996 6667777777775 22 Q ss_pred ---CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccc Q lcl|NC_012740. 75 ---GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKII 151 (667) Q Consensus 75 ---G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 151 (667) =+++||-|-..... .+...+..+...........+ | .+.+..... .....++.+. ... +. T Consensus 78 ~p~P~~l~igR~~~~~~--~~~l~g~~l~~~~la~~~~~s-g-~l~vti~g~-----~~~~~i~ls~--ats------~~ 140 (501) T protein:vir:10 78 GQLPYDLKFARYVAADA--PASVYGIPLTGVTLAQLQGYS-G-TLTVTTAAQ-----HVSANISLAA--ATS------FA 140 (501) T ss_pred CccccEEEEEeecCCCc--cceEeccchhhhhhhhcceee-e-EEEEeeccc-----eeeccccccc--ccC------HH Confidence 36899999653211 111111001000000000000 0 000000000 0000000000 000 00 Q ss_pred cccccccccccccccceEEEEEeecccccceeeece-eeeceeeeeeccccchhhhccccccccccccceeeeeeccccc Q lcl|NC_012740. 152 AHAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKI-VTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEI 230 (667) Q Consensus 152 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~ 230 (667) +.+..+.... . ... .+..+. ....+....... . T Consensus 141 ~vAs~i~~al------~--------~~~--~tv~~d~~~~~f~its~tt-----------------G------------- 174 (501) T protein:vir:10 141 NAATLIEAAF------T--------SPD--FVVAYDALRNRFTVVTNAT-----------------G------------- 174 (501) T ss_pred HHHHHHhhhc------c--------CCc--eEEEEcccCceEEEEeecc-----------------C------------- Confidence 0011110000 0 000 000000 000000000000 0 Q ss_pred cceeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccc Q lcl|NC_012740. 231 GNSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYM 310 (667) Q Consensus 231 gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~ 310 (667) ....+.+..... ...+...++...+ T Consensus 175 -~~~~i~~~~~~~-----------------------------~la~~l~Lt~~~~------------------------- 199 (501) T protein:vir:10 175 -TAAAISAVTGTN-----------------------------NLADELGLSAAAG------------------------- 199 (501) T ss_pred -CceeEEEeeCch-----------------------------hhhhhcCcccccc------------------------- Confidence 000011100000 0000000000000 Q ss_pred hhhhcccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcch Q lcl|NC_012740. 311 DDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGD 390 (667) Q Consensus 311 ~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 390 (667) .++.+ . |.. ...+.+.+..+......=..+..+.. T Consensus 200 ---------a~v~~--~--------------g~~--------------aet~~~a~~a~~~~~~~Wy~f~~a~~------ 234 (501) T protein:vir:10 200 ---------ATLQA--A--------------GVA--------------ADTPASAMNRAVGLSRNWATFTTAWT------ 234 (501) T ss_pred ---------ceEEe--c--------------Ccc--------------cccHHHHHHHHHhccCceEEEEEecC------ Confidence 00000 0 000 00011111111111100001111100 Q ss_pred hhHHHHHHHHHHHhhcCcEEEEE--ccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhccccccc Q lcl|NC_012740. 391 AFSTVQKHAVSIGDERQDCLVMV--SPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYN 468 (667) Q Consensus 391 ~~~~v~~~~~~~~~~~~~~~ai~--d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~ 468 (667) ...+-..++...+|....++.+. |..... .... ........... -+..+....|+. T Consensus 235 ~~~~~~la~A~wiea~~~~f~~~~~~~~~~~-~~~~---~~~~i~~~l~~----------~~y~~t~~~y~~-------- 292 (501) T protein:vir:10 235 AVIADRLAFAAWNSGQAYKYMYVAPDLEAAS-IVTN---NAASFGAQVFA----------APYQGTLPLYGD-------- 292 (501) T ss_pred CChHHHHHHHHHHHhcCceEEEEEecCchhh-hhhh---hhhhHHHHHHh----------cCCCceEEECCC-------- Confidence 01112234455555443333222 111000 0000 00011111111 122344444431 Q ss_pred CceeEechHHHHHHHHHHhhhcCCc-eeeecceec-cceeccccccccCChhhhhhhhhcCceEEEEecCC--eEEEEcc Q lcl|NC_012740. 469 DVNRWVPLAADIAGLCARTDAVSQP-WMSPAGYNR-GQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGE--GFILMGD 544 (667) Q Consensus 469 ~~~~~~p~sg~vAg~~a~~d~~~g~-~~span~~~-~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~--G~~~wG~ 544 (667) ..+...+.|.++.+|-++-. -.+-.+|.+ .|+ ..-.++..|.+.|..+|+|+...+.+. -+.+|-. T Consensus 293 -----~~~~aa~~g~~as~nf~~~~g~~T~~fkq~~~Gi-----~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~ 362 (501) T protein:vir:10 293 -----QATAGAVMGYAASINFQLRNGRTVLAFRQFNAGV-----PATAHDLPTANALRSNNYTYIGAYANAANNYTIAYD 362 (501) T ss_pred -----CcHHHHHHHHHHhhCcccCccceeeeccccCCCc-----CcccCCHHHHHHHHhcCCeEEEEeccccceeeEEec Confidence 12456677777777743321 111223332 122 123478899999999999999988654 4778855 Q ss_pred eecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee------------ Q lcl|NC_012740. 545 KTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY------------ 608 (667) Q Consensus 545 rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l~~~gal~------------ 608 (667) -++++ +|.+|.+-+-.+||+..++..+...+-. |-++.=...|+..|+.-|++-+++|.|. T Consensus 363 G~~sG---~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~ 439 (501) T protein:vir:10 363 GKLSG---KFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQ 439 (501) T ss_pred Ceeec---cceeehhhhhHHHHHHHHHHHHHHHHHhcCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCccccee Confidence 55665 3677888887888888888887654433 5588888899999999999999999883 Q ss_pred -----------------eeEEEEcccCCC-HHHhhCCeEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_012740. 609 -----------------DFRVQCDTTNNT-PDVIDRNEFVASMFIKPAKSINYIMLNFTAVA 652 (667) Q Consensus 609 -----------------g~~v~~d~~~nt-~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 652 (667) ||++.++...++ ++--.+.-..+.+.++---.+++|++-....- T Consensus 440 i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 440 IDAAAGVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQQLTIGSNAVI 501 (501) T ss_pred eccccCccccccceeccceeEeeccccCChhhhhhccccceEEEEEeCCceeEEEeeeeecC Confidence 377777754334 43334455677777788888888877433222 No 63 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=73.49 E-value=0.17 Score=24.81 Aligned_cols=451 Identities=10% Similarity=0.005 Sum_probs=195.2 Q ss_pred Cce--ecCceEEEEecCCCcccccC-CCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHH---cC Q lcl|NC_012740. 1 MTL--LSPGFETKETTLSTTIVQSA-TGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFL---QY 74 (667) Q Consensus 1 ~~~--~~PGVyvee~~~~~~~~~~~-ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~---ng 74 (667) |-+ +-=--+|+..+.-....... .-.+-+++.-..=|++.....+|..|-...||. .+.++.+.+.||- |- T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~f~~lll~~~~~~~~~r~~~y~s~~~V~~~FG~---~S~ey~aA~~yFsg~~~q 77 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQKTDVENWFGA---LSNEAKIADAYFPGIVNG 77 (501) T ss_pred CCcCccccceEEEEeeecccCCCcccccceEEEecccCCCccceeeecCHHHHHHhcCC---ChHHHHHHHHHhhhhcCC Confidence 776 33345555444221111112 223345555556688888888999999999996 6667777777774 22 Q ss_pred ---CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccc Q lcl|NC_012740. 75 ---GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKII 151 (667) Q Consensus 75 ---G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 151 (667) =+++||-|-..... .+...+..+...........+ | .+.+..... .....++.+.. .. +. T Consensus 78 ~p~P~~l~igR~~~~~~--~~~l~g~~l~~~~la~~~~~~-g-~l~i~i~g~-----~~~~~i~~s~a--ts------~~ 140 (501) T protein:vir:10 78 GQLPYDLKFARYVAADA--PASVYGIPLTGITLAQLQGYS-G-TLTVTTAAQ-----HVSANISLAAA--TS------FA 140 (501) T ss_pred CccccEEEEEeecccCc--cceeeeceehhhhhhhhhhee-e-EEEEeeccc-----eeeeccccccc--cC------HH Confidence 36899999654221 111111111000000000000 0 000000000 00000000000 00 00 Q ss_pred cccccccccccccccceEEEEEeecccccceeeece-eeeceeeeeeccccchhhhccccccccccccceeeeeeccccc Q lcl|NC_012740. 152 AHAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKI-VTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEI 230 (667) Q Consensus 152 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~ 230 (667) +.+..+... .. ... .+..+. ....+.... ...+ T Consensus 141 ~vA~~i~~a------l~--------~~~--~tv~~d~~~~~f~i~~------------------~t~G------------ 174 (501) T protein:vir:10 141 NAATLIEAA------FT--------SPD--FVVAYDALRNRFTVVT------------------NTTG------------ 174 (501) T ss_pred HHHHHHHHh------hc--------CCc--eEEEEecccceEEEEe------------------cccC------------ Confidence 000001000 00 000 000000 000000000 0000 Q ss_pred cceeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccc Q lcl|NC_012740. 231 GNSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYM 310 (667) Q Consensus 231 gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~ 310 (667) ....+...... ........++. T Consensus 175 -~~~~i~~~t~~-----------------------------~d~a~~l~Lt~---------------------------- 196 (501) T protein:vir:10 175 -TAAAISAVTGT-----------------------------NNLADELGLSA---------------------------- 196 (501) T ss_pred -cceeEEEeecc-----------------------------ccchhhhcccc---------------------------- Confidence 00001100000 00000000000 Q ss_pred hhhhcccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcch Q lcl|NC_012740. 311 DDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGD 390 (667) Q Consensus 311 ~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 390 (667) ....++.+. |... ..+.+.+..+......-..+..+.. T Consensus 197 ------~~~a~v~~~----------------g~~a--------------et~~~Al~a~~~~~~~Wy~f~~a~~------ 234 (501) T protein:vir:10 197 ------AAGATLQAA----------------GVAA--------------DTPASAMNRAVGLSRNWATFTTAWT------ 234 (501) T ss_pred ------cCceeEEec----------------Cccc--------------ccHHHHHHHHHhcccceEEEEEEec------ Confidence 000000000 0000 0011122222211110011111100 Q ss_pred hhHHHHHHHHHHHhhcCcEEEE--EccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhccccccc Q lcl|NC_012740. 391 AFSTVQKHAVSIGDERQDCLVM--VSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYN 468 (667) Q Consensus 391 ~~~~v~~~~~~~~~~~~~~~ai--~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~ 468 (667) ...+-..++...+|....++.+ -|..... ......+ ........ -+..+....|+. T Consensus 235 ~~~~~~la~A~wi~a~~~~f~~~~~~~~~~~-~~~~~~~---~i~~~l~~----------~~y~~t~~~y~~-------- 292 (501) T protein:vir:10 235 AVIADRLAFAAWNSGQAYKYMYVAPDLEAAS-IVTNNAA---SFGAQVFA----------APYQGTLPLYGD-------- 292 (501) T ss_pred CChHHHHHHHHHHHhcCceEEEEEecCccee-eecccch---hHHHHHHh----------cCCCceEEECCC-------- Confidence 0112223445555544333322 2221111 1111111 11111111 122344444431 Q ss_pred CceeEechHHHHHHHHHHhhhcCCc-eeeecceec-cceeccccccccCChhhhhhhhhcCceEEEEecCC--eEEEEcc Q lcl|NC_012740. 469 DVNRWVPLAADIAGLCARTDAVSQP-WMSPAGYNR-GQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGE--GFILMGD 544 (667) Q Consensus 469 ~~~~~~p~sg~vAg~~a~~d~~~g~-~~span~~~-~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~--G~~~wG~ 544 (667) .+|.+.+.|..+.+|-++-+ -.+-.+|.+ .|+ ..-.++..|.+.|..+|+|++..+.+. .+.+|-. T Consensus 293 -----~~~~aa~~g~~as~nf~~~~g~~T~~fkql~~Gv-----~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~ 362 (501) T protein:vir:10 293 -----QATAGAVMGYAASINFQLRNGRTVLAFRQFNAGV-----PATAHDLPTANALRSNNYTYIGAYANAANNYTIAYD 362 (501) T ss_pred -----CCHHHHHHHHHHhcCcccCcceeeeeecccCCCc-----CcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEc Confidence 23567778888888754311 111223332 222 123478899999999999999888754 4778855 Q ss_pred eecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee------------ Q lcl|NC_012740. 545 KTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY------------ 608 (667) Q Consensus 545 rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l~~~gal~------------ 608 (667) -++++ +|.+|.+-+-.+||+..|++.+....-. |-++.=...|+..|+.-|++-+++|.|. T Consensus 363 G~~sG---~~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~ 439 (501) T protein:vir:10 363 GKLSG---KFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQ 439 (501) T ss_pred ceeec---cceehhhHhhHHHHHHHHHHHHHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCccccee Confidence 55665 4678888888999999999988765433 4477888899999999999999999883 Q ss_pred -----------------eeEEEEcccCC-CHHHhhCCeEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_012740. 609 -----------------DFRVQCDTTNN-TPDVIDRNEFVASMFIKPAKSINYIMLNFTAVA 652 (667) Q Consensus 609 -----------------g~~v~~d~~~n-t~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 652 (667) ||++.++...+ +++--.+.-..+.+.++---.+++|++-....- T Consensus 440 i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:10 440 IDAVAGVAGAGALVQTRGWYFLIGNPANPGQARQNRTSPACTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred ecccccccccccceeccceEEeeCcccCChhhhhhcccCceEEEEEeCCceeEEEeeeeecC Confidence 37777775433 344344555677777888888888877533222 No 64 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=70.68 E-value=0.21 Score=24.35 Aligned_cols=454 Identities=11% Similarity=0.038 Sum_probs=199.5 Q ss_pred CceecCceEEEEecCCCcc---cccCCCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCC-- Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTI---VQSATGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYG-- 75 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~---~~~~ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG-- 75 (667) |--++ -||+..+ +..+ ......++-|++.-..=|++.....+|..|-...||. .+.++.+.+.+|-+-- T Consensus 1 mip~s--~iV~V~~-~v~~~~~~~~~~~~~l~l~~~~~~~~~r~~~y~s~~~V~~~FG~---~S~ey~aA~~yF~~~~~~ 74 (504) T protein:vir:96 1 MISQS--RYIRIIS-GVGAGAPVAGRKLILRVMTTNNVIPPGIVIEFDNANAVLSYFGA---QSEEYQRAAAYFKFISKS 74 (504) T ss_pred CCCcc--ceeEeee-cccccccccccccceeEeecccCCCccceEEecCHHHHHHhcCC---ChHHHHHHHHHhhcCCCC Confidence 65555 3444333 2222 2223567788888777788888889999999999996 5577778888887633 Q ss_pred ----CeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccc Q lcl|NC_012740. 76 ----NDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKII 151 (667) Q Consensus 76 ----~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 151 (667) +++||-|-... +..+...+..+..... .......| .+.+........ +..++.+.. ..+...+... T Consensus 75 ~~~P~~l~igR~~~~--a~~~~l~g~~~~~~~~-~~~~i~~G-~lsitv~G~~~~----~~~i~~S~~--ts~~~vA~~i 144 (504) T protein:vir:96 75 VNSPSSISFARWVNT--AIAPMVVGDNLPKTIA-DFAGFSAG-VLTIMVGAAEKN----ITAIDTSAA--TSMDNVASII 144 (504) T ss_pred CccccEEEEEeecCc--CccceEEechhHHHHH-HHhhhhce-EEEEEEcceeee----ecccccccc--cchHHHHHHH Confidence 79999996432 1111111111110000 00000111 011111100000 011110000 0000000000 Q ss_pred cccccccccccccccceEEEEEeecccccceeeeceee-eceeeeeeccccchhhhccccccccccccceeeeeeccccc Q lcl|NC_012740. 152 AHAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVT-DSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEI 230 (667) Q Consensus 152 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~ 230 (667) ..+ +..... ......+..+.-. ..+. +....+ +....... T Consensus 145 ~~a--l~~~~~--------------~~~~~~tv~~d~~~~~f~-its~~t-----------------g~~~~~~~----- 185 (504) T protein:vir:96 145 QTE--IRKNTD--------------PQLAQATVTWNPNTNQFT-LVGATI-----------------GTGVLAVA----- 185 (504) T ss_pred Hhh--hhcccc--------------cccccceEEEeccCCeEE-EEeecc-----------------ccceeEEE----- Confidence 000 000000 0000000110000 0000 000000 00000000 Q ss_pred cceeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccc Q lcl|NC_012740. 231 GNSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYM 310 (667) Q Consensus 231 gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~ 310 (667) ...... .......++. ++ T Consensus 186 ------~~a~~~------------------------------~~~~~lgl~~--~~------------------------ 203 (504) T protein:vir:96 186 ------KSADPQ------------------------------DMSTALGWST--SN------------------------ 203 (504) T ss_pred ------eecccc------------------------------chhhhhhccc--cc------------------------ Confidence 000000 0000000000 00 Q ss_pred hhhhcccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcch Q lcl|NC_012740. 311 DDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGD 390 (667) Q Consensus 311 ~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 390 (667) .+.+ .|.+. ..+.+.+..+......-..++.+... T Consensus 204 ----------~~~v----------------~g~~a--------------et~~~al~al~~~~~~Wy~f~~a~~~----- 238 (504) T protein:vir:96 204 ----------VVNV----------------AGQAA--------------DLPDAAVAKSTNVSNNFGSFLFAGAT----- 238 (504) T ss_pred ----------ceEE----------------eeccc--------------ccHHHHHHHHHhhcCCeEEEEEEecc----- Confidence 0000 00000 00111122221111100111111100 Q ss_pred hhHHHHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCc Q lcl|NC_012740. 391 AFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDV 470 (667) Q Consensus 391 ~~~~v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~ 470 (667) .......++..++|....++...-. +.. ... ........ ........+++... +. T Consensus 239 ~~dd~ilalA~w~ea~~~~~~~~~~------~~~-~~~-~~~~~~~~----------~~~~~~~~~~~~~~------~~- 293 (504) T protein:vir:96 239 LDNDQIKAVSAWNAAQNNQFIYTVA------TSL-ANL-GALFDLVK----------GNSGTALNVLSATA------SN- 293 (504) T ss_pred CCHHHHHHHHHHHhhcCceEEEEEe------ecc-cch-hhHHHhhh----------hcceeEEEEeecCc------cc- Confidence 0111223445555543333322211 000 001 11111110 01111122222210 00 Q ss_pred eeEechHHHHHHHHHHhhhcC--CceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCCe--EEEE-cce Q lcl|NC_012740. 471 NRWVPLAADIAGLCARTDAVS--QPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEG--FILM-GDK 545 (667) Q Consensus 471 ~~~~p~sg~vAg~~a~~d~~~--g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~G--~~~w-G~r 545 (667) . -+..+.++.++.+|-++ | -..-.+|.+. |+. .-.++..|.+.|..+|+|++..+.+.| +.+| .+. T Consensus 294 --~-~~~~~~~~~~as~~f~~~ng-~~T~~fk~l~---GVt--a~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~ 364 (504) T protein:vir:96 294 --D-FVEQCPSEILAATNYDEPGA-SQNYMYYQFP---GRN--ITVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGI 364 (504) T ss_pred --h-hHHHHHHHHHHhcCcCcccc-cccccccccC---CcC--cccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCe Confidence 1 23455567777776322 2 1112244433 332 235789999999999999998887554 4454 556 Q ss_pred ecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee------------- Q lcl|NC_012740. 546 TATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY------------- 608 (667) Q Consensus 546 T~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l~~~gal~------------- 608 (667) ++++.. +|.+|.+-+-.+||+..|+..+....-. |-++.=..+|+..|+.-|++-+++|.|. T Consensus 365 ~~gG~~-~~~wiDv~~~~~WL~~~lq~~l~~l~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I 443 (504) T protein:vir:96 365 LCGGPT-DAVDMNVYANEIWLKSAIAQALLDLFLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYI 443 (504) T ss_pred eeCCcc-ccchhhhhhhHHHHHHHHHHHHHHHHhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchhee Confidence 666542 5888999999999999999999775443 3478888999999999999999999772 Q ss_pred ----------------eeEEEEc-ccCCCHHHhh-CCeEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_012740. 609 ----------------DFRVQCD-TTNNTPDVID-RNEFVASMFIKPAKSINYIMLNFTAV 651 (667) Q Consensus 609 ----------------g~~v~~d-~~~nt~~~i~-~G~~~~~i~~~p~~p~e~i~~~~~~~ 651 (667) ||+|.++ .++.++++.. ++-..|.+.++---.+++|++.-.-. T Consensus 444 ~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~~r~~R~~~~~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 444 TQITGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKANYTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred cccccccccccceeccceEEEecChhccChhHhhhccccceEEEEEECCeEEEEEeccccC Confidence 4888886 4555655554 44557788888888888888863333 No 65 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=65.75 E-value=0.28 Score=23.63 Aligned_cols=450 Identities=10% Similarity=0.021 Sum_probs=195.3 Q ss_pred Cce--ecCceEEEEecCCCcccccC-CCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHH---cC Q lcl|NC_012740. 1 MTL--LSPGFETKETTLSTTIVQSA-TGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFL---QY 74 (667) Q Consensus 1 ~~~--~~PGVyvee~~~~~~~~~~~-ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~---ng 74 (667) |-+ +-=--+|+..+.-....... .-.+-+++.-..=|++.....+|..|-...||. .+.++.+.+.+|- |- T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lllt~~~~~~~~r~~~y~s~~~V~~~FG~---~S~ey~aA~~yFs~~~~q 77 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQETDVENWFGA---LSNEAKIADAYFPGIVNG 77 (501) T ss_pred CCcCCcccceEEEEeeeeccCCCcceeeeeEEEeccCCCCCcceeeecCHHHHHHhcCC---ChHHHHHHHHHhhcccCC Confidence 776 33345555444221212222 223344444455578888888899999999996 6677778888885 22 Q ss_pred ---CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccc Q lcl|NC_012740. 75 ---GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKII 151 (667) Q Consensus 75 ---G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 151 (667) =+++||-|-..... .+...+..+...........+ | .+.+..... .....++.+.. . .+. T Consensus 78 ~~~P~~l~igR~~~~a~--~~~l~g~~l~~~~~a~~~~~s-g-~l~vti~g~-----~~~~~i~lS~~--t------s~~ 140 (501) T protein:vir:36 78 GQLPYDLKFARYVAADA--PASVYGIPLTGVTLAQLQGYS-G-TLTVTTAAQ-----HVSANISLAAA--T------SFA 140 (501) T ss_pred CccccEEEEEeecCcCc--ceeEeccchhhhhhhhcccee-E-EEEEEecce-----eeeeecccccc--c------CHH Confidence 35789999753211 111111101000000000000 0 000000000 00000000000 0 000 Q ss_pred cccccccccccccccceEEEEEeecccccceeeece-eeeceeeeeeccccchhhhccccccccccccceeeeeeccccc Q lcl|NC_012740. 152 AHAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKI-VTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEI 230 (667) Q Consensus 152 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~ 230 (667) +.+..+... .. .. ..+..+. ....+........ T Consensus 141 ~vA~~i~~a------l~--------~~--~~tv~~d~~~~~f~i~s~t~G------------------------------ 174 (501) T protein:vir:36 141 NAATLIEAA------FT--------SP--DFVVAYDALRNRFTVVTNATG------------------------------ 174 (501) T ss_pred HHHHHHhhh------hc--------Cc--ceEEEEcCcceeEEEEeccCC------------------------------ Confidence 000001000 00 00 0000000 0000000000000 Q ss_pred cceeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccc Q lcl|NC_012740. 231 GNSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYM 310 (667) Q Consensus 231 gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~ 310 (667) ....+...... ....+...++. T Consensus 175 -~~~~i~~~t~~-----------------------------~~ia~~l~Lt~---------------------------- 196 (501) T protein:vir:36 175 -TAAAISAVTGT-----------------------------NNFADEIGLSA---------------------------- 196 (501) T ss_pred -cceeeEeeecc-----------------------------cchhhhhcccc---------------------------- Confidence 00000000000 00000000000 Q ss_pred hhhhcccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcch Q lcl|NC_012740. 311 DDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGD 390 (667) Q Consensus 311 ~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 390 (667) ....++... |.+. ..+.+.+..+......-..+..+.. T Consensus 197 ------~~~a~v~~~----------------g~~~--------------et~~~al~a~~~~s~~Wy~f~~a~~------ 234 (501) T protein:vir:36 197 ------AAGATLQAA----------------GVAA--------------DTPASAMNRAVGLSRNWATFTTAWT------ 234 (501) T ss_pred ------cCcceEEec----------------cccc--------------ccHHHHHHHHHhccCceEEEEEecC------ Confidence 000000000 0000 0111222222211111111111111 Q ss_pred hhHHHHHHHHHHHhhcCcEEEEE--ccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhccccccc Q lcl|NC_012740. 391 AFSTVQKHAVSIGDERQDCLVMV--SPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYN 468 (667) Q Consensus 391 ~~~~v~~~~~~~~~~~~~~~ai~--d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~ 468 (667) ...+-..++...+|....++.++ |..... .. ............. -+..+.+..|. + T Consensus 235 ~~~~~~la~A~wiea~~~~f~~~~~~~~~~~-~~---~~~~~~i~~~l~~----------~~y~~t~~~y~------~-- 292 (501) T protein:vir:36 235 AVIADRLAFASWNSGQAYKYMYVAPDLEAAS-IV---SNNAASFGAQVFA----------APYQGTLPLYG------D-- 292 (501) T ss_pred CChHHHHHHHHHHhhcCceEEEEEecCchhh-hh---ccchhhHHHHHHh----------cCCCcEEEEcC------C-- Confidence 01122334555665554443332 111000 00 0011111111111 12233444332 1 Q ss_pred CceeEechHHHHHHHHHHhhhcC--Cceeeecceec-cceeccccccccCChhhhhhhhhcCceEEEEecC--CeEEEEc Q lcl|NC_012740. 469 DVNRWVPLAADIAGLCARTDAVS--QPWMSPAGYNR-GQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGG--EGFILMG 543 (667) Q Consensus 469 ~~~~~~p~sg~vAg~~a~~d~~~--g~~~span~~~-~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~--~G~~~wG 543 (667) ..+..++.|..+.+|-++ | -..-.+|.+ .|+ ..-.++..|.+.|..+|+|++..+.+ ..+.+|- T Consensus 293 -----~~~~aa~~g~~as~nf~~~~g-~~T~~fkq~~~Gi-----~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~ 361 (501) T protein:vir:36 293 -----QATAGAVMGYAASINFQLRNG-RTVLAFRQFNAGV-----PATVHDLPTANALRSNNYTYIGAYANAANNYTIAY 361 (501) T ss_pred -----CCHHHHHHHHHHhcCcccCcc-eeeeeccccCCCc-----CcCcCCHHHHHHHHhcCCcEEEEEecccceeeEEE Confidence 234566777888777433 2 111124432 222 12346789999999999999988764 4477775 Q ss_pred ceecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee----------- Q lcl|NC_012740. 544 DKTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY----------- 608 (667) Q Consensus 544 ~rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l~~~gal~----------- 608 (667) .-++++ +|.||.+.+-.+||+..|++.+...+-. |-++.=...|+..|+.-|++-+++|.|. T Consensus 362 ~G~~sG---~~~wiD~~~g~dWL~~~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~ 438 (501) T protein:vir:36 362 DGKLSG---KFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQ 438 (501) T ss_pred cCeeec---cchhhhHHHhHHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccce Confidence 556665 3678889999999999999999876543 4477788899999999999999999883 Q ss_pred ------------------eeEEEEcccCCCHHHh-hCCeEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_012740. 609 ------------------DFRVQCDTTNNTPDVI-DRNEFVASMFIKPAKSINYIMLNFTAVA 652 (667) Q Consensus 609 ------------------g~~v~~d~~~nt~~~i-~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 652 (667) ||++.++...+++++. .+.-..+.+.++---.+++|++-....- T Consensus 439 ~I~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:36 439 QIDKAARVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQSLTIGSNAVI 501 (501) T ss_pred eecccccccccccceeccceEEeeCcccCChhhhhhcccCcEEEEEEeCCceeEEEeeeeeeC Confidence 4777777554444443 4455577778888888888877533222 No 66 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=61.95 E-value=0.34 Score=23.13 Aligned_cols=457 Identities=9% Similarity=0.012 Sum_probs=204.5 Q ss_pred CceecCceEEEEecCCCcccc--cC-CCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHHcCC-- Q lcl|NC_012740. 1 MTLLSPGFETKETTLSTTIVQ--SA-TGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFLQYG-- 75 (667) Q Consensus 1 ~~~~~PGVyvee~~~~~~~~~--~~-ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~ngG-- 75 (667) |--++ -+|+..+ ++.+.. .. -.++.|++....=|++.....+|..|-...||. .+.++.+.+.||-+-= T Consensus 1 mip~s--~iVnV~~-~v~~~a~~~~~~~~~lilt~~~~~~~~r~~~y~s~~~V~~~FG~---~S~ey~aA~~yFsq~p~~ 74 (507) T protein:vir:99 1 MISQS--RYVRIVS-GVGAGAPVAQRRLIMRVMTTNAVLPPGVVFESSSADAVGAYFGM---ASEEYKRAKAYMSFISKS 74 (507) T ss_pred CCCcc--ceeEEee-eccccCcccccccceeeeccccCCCccceEeecCHHHHHHhcCC---ChHHHHHHHHHhccCCCC Confidence 55555 3444333 322222 22 457778877666688888899999999999996 6667778888887543 Q ss_pred ----CeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccc Q lcl|NC_012740. 76 ----NDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKII 151 (667) Q Consensus 76 ----~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 151 (667) +++||-|-..... .+...+..+.. ... ......+ +.+ T Consensus 75 ~~~P~~L~igR~~~~~~--~a~l~g~~~~~------------~l~------------~~~~~~~------G~l------- 115 (507) T protein:vir:99 75 INSPSYISFARWVNAAI--ASMIVGDSLVK------------NLP------------ALKAVAT------PTL------- 115 (507) T ss_pred CcccceEEEEeecCccc--cceeecchhhh------------hHH------------HHhhhcc------eeE------- Confidence 4889998743211 11100000000 000 0000000 000 Q ss_pred cccccccccccccccceEEEEEeecccccceeeeceeeeceeeeeeccccchhhhccccccccccccceeeeeecccccc Q lcl|NC_012740. 152 AHAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEIG 231 (667) Q Consensus 152 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~g 231 (667) +...++.. .+... + +. ...+........+. ...... .+.. T Consensus 116 --------------------ti~v~G~~--~t~~~-i-~l-S~~ts~~~vAs~i~-----~~l~a~-----~~~~----- 155 (507) T protein:vir:99 116 --------------------SLSIGGTV--VPIAG-I-DL-TAALTLTDVAATLQ-----TKIRAS-----ANAE----- 155 (507) T ss_pred --------------------EEEEcCce--eEecc-c-cc-cccCCHHHHHHHHH-----Hhhhcc-----cccc----- Confidence 00000000 00000 0 00 00000000000000 000000 0000 Q ss_pred ceeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceee-eeEeeeccCCccccccccccc Q lcl|NC_012740. 232 NSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVV-ESYVLSTLKGDKDVYGNSIYM 310 (667) Q Consensus 232 n~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~-e~~~~s~~~~~~~~~~~~~~~ 310 (667) ...+.+.-+ .....|.++....|.-. -.+......+ + . T Consensus 156 -~~~~tv~~d-------------------------------~~~~~F~v~s~~tG~~s~i~~at~~~~g-t-------~- 194 (507) T protein:vir:99 156 -LATATVTFN-------------------------------TTTNQFVLNGTTTGALAPTITAVRTDPA-T-------D- 194 (507) T ss_pred -ccceEEEEe-------------------------------cCCceEEEEeeeccccceeEEEEcCCch-h-------h- Confidence 000000000 00011222221111000 0000000000 0 0 Q ss_pred hhhhcccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcch Q lcl|NC_012740. 311 DDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGD 390 (667) Q Consensus 311 ~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 390 (667) -+.++...+ ...+ ...|.+ ...+.+.+..+......=.-++.+-. .. T Consensus 195 -------~s~l~~~~~-------~~a~-~~~g~~--------------aet~~~a~~a~~~~~~nW~~~~~a~~----~~ 241 (507) T protein:vir:99 195 -------ISSLLGWTN-------TGTV-FVKGQA--------------AETPDTSISKSAAISTNFGSFIYTST----PA 241 (507) T ss_pred -------HHHHhcccc-------ccce-Eeeccc--------------ccCHHHHHHHHHhhcCCeEEEEEEec----cc Confidence 000000000 0000 011111 12234444544432221122222111 01 Q ss_pred hhHHHHHHHHHHHhhcCcEEEEEccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhcccccccCc Q lcl|NC_012740. 391 AFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYNDV 470 (667) Q Consensus 391 ~~~~v~~~~~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~~~ 470 (667) .......++...+|....++.+.-.-. + .. ....-.... ....+..-++.+.. T Consensus 242 ~td~~~lalA~wiea~~~~f~~~~~~~----~--a~-----~~~~~~~~~--------~~~~~~~~~~~~~~-------- 294 (507) T protein:vir:99 242 LTNDQITAVASWNASQNNMYMYSVPTT----I--AN-----IGTLYAAVK--------GFSGCALNITSDSL-------- 294 (507) T ss_pred cChHHHHHHHHHHhhcCcEEEEEEecC----c--hh-----hhhhhhhhh--------hcceeEEEeecccc-------- Confidence 122334566777777666665432110 0 00 000000000 00111112222110 Q ss_pred eeEechHHHHHHHHHHhhhcC--CceeeecceeccceeccccccccCChhhhhhhhhcCceEEEEecCC--eEEEEc-ce Q lcl|NC_012740. 471 NRWVPLAADIAGLCARTDAVS--QPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGE--GFILMG-DK 545 (667) Q Consensus 471 ~~~~p~sg~vAg~~a~~d~~~--g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~--G~~~wG-~r 545 (667) -...+...+.|.++.+|-++ | -.+-..|.+. |+. .-.++..|.+.|..+|+|+...+.+. .+.+|- +. T Consensus 295 -~~~~~~aa~~g~~as~nf~~~ng-~~T~~fk~l~---GV~--a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~ 367 (507) T protein:vir:99 295 -PVDYIEQSPCEILAATDYTRVNA-TQNYMYYQFP---SRN--ITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGI 367 (507) T ss_pred -cchhHHHHHHHHHHhhccCcCcc-ceeecccccC---Ccc--cccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCe Confidence 01124566777777776322 2 1112244433 332 23588999999999999999988664 366664 44 Q ss_pred ecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee------------- Q lcl|NC_012740. 546 TATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY------------- 608 (667) Q Consensus 546 T~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l~~~gal~------------- 608 (667) ++.+. .+|.++.+-+=.+||+..|+..+....-. |-++.=...|+..|+.-|++-+++|.|. T Consensus 368 ~~gG~-~~fid~d~~~~~~WL~~~iq~~l~~l~~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~i 446 (507) T protein:vir:99 368 LCGGP-NDAVDMNIYANEIWLKSAISAQILSLFLNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYI 446 (507) T ss_pred eeCCc-ccceeeeeecchHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchhee Confidence 44443 35778777777788888888888764332 4478888899999999999999998774 Q ss_pred ----------------eeEEEEc-ccCCCHHHh-hCCeEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_012740. 609 ----------------DFRVQCD-TTNNTPDVI-DRNEFVASMFIKPAKSINYIMLNFTAV 651 (667) Q Consensus 609 ----------------g~~v~~d-~~~nt~~~i-~~G~~~~~i~~~p~~p~e~i~~~~~~~ 651 (667) ||++.++ .++.++++. .++-..|.+.++---.+++|++.-.-. T Consensus 447 n~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 447 TQISGDANAWRQVANIGYWLNITFSSYTNPNTQLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred cccccccccccceeccceEEEeCChHhcChhhhhccccceEEEEEEeCCeEEEEEeeeecC Confidence 2777775 455555444 467778888888888899888864444 No 67 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=48.77 E-value=0.66 Score=21.58 Aligned_cols=451 Identities=10% Similarity=0.005 Sum_probs=192.2 Q ss_pred Cce--ecCceEEEEecCCCcccccC-CCceEEEeeccCCCCCccEEecCHHHHHHHcCCcCccchhHHHHHHHHH---cC Q lcl|NC_012740. 1 MTL--LSPGFETKETTLSTTIVQSA-TGRAALVGKFQWGPAFQIVQVTNEVELVNKFGQPDNNTADYFMSGANFL---QY 74 (667) Q Consensus 1 ~~~--~~PGVyvee~~~~~~~~~~~-ts~~afvG~~~~Gp~~~p~~i~s~~e~~~~FG~~~~~~~~~~~v~~~f~---ng 74 (667) |-+ +-=--+|+..+.-....... .-.+-+++....=|++.....+|..|-...||. .+.++.+.+.+|- |- T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lll~~~~~~~~~r~~~y~s~~~V~~~FG~---~S~ey~aA~~yFs~~~~q 77 (501) T protein:vir:78 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSIQPGQLADFFQKTDVENWFGG---LSNEAVIADAYFPGIVNG 77 (501) T ss_pred CCcCccccceEEEEeeecccCCCcceeeeeEEEecCCCCCccceeeecCHHHHHHhcCC---ChHHHHHHHHHhhcCCCC Confidence 776 33345555444221211112 223445555555678888888899999999996 6667778888886 22 Q ss_pred ---CCeEEEEEcCCcccccccccccccccceeeeccccccccceeeEeeeccceeccccceeecccccceeeeecccccc Q lcl|NC_012740. 75 ---GNDLRVVRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKII 151 (667) Q Consensus 75 ---G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 151 (667) =+++||-|-..... .+...+..+...........+ ..+.+..... .....++.+.. .. +. T Consensus 78 ~~~P~~l~igR~~~~a~--~~~l~g~~l~~~~la~~~~~~--G~l~iti~g~-----~~~~~i~~S~~--ts------~~ 140 (501) T protein:vir:78 78 GQLPYDLKFARYVAADA--PASVYGIPLTGVTLTQLQGYS--GTLTVTTAAQ-----HVSSNISLAAA--TS------FA 140 (501) T ss_pred CcccceEEEEeecccCc--ceeEeccceeccchhhhceee--eEEEEEeccc-----eeeeccccccc--cC------HH Confidence 24679998654211 111111111000000000000 0000000000 00000000000 00 00 Q ss_pred cccccccccccccccceEEEEEeecccccceeeece-eeeceeeeeeccccchhhhccccccccccccceeeeeeccccc Q lcl|NC_012740. 152 AHAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKI-VTDSGLLLTDLETSRANITNQDFLTKLKKYDMPAVSAIYAGEI 230 (667) Q Consensus 152 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~G~~ 230 (667) +.+..+... .. . ...+..+. ....+....... . T Consensus 141 ~vA~~i~~a------l~--------a--~~~tv~~ds~~~~f~its~t~-----------------G------------- 174 (501) T protein:vir:78 141 NAATLIEAA------FT--------S--PDFVVSYDALRNRFVVNTNAT-----------------G------------- 174 (501) T ss_pred HHHHHHHhh------hc--------C--cceEEEEccccceEEEEeeec-----------------C------------- Confidence 000000000 00 0 00000000 000000000000 0 Q ss_pred cceeEEEEeecccccccceeeeeeeecccccccceeeeeccccccccceeeeeccceeeeeEeeeccCCccccccccccc Q lcl|NC_012740. 231 GNSLEVEILARSSFSGAVAPELTMYPFGGTRAAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYM 310 (667) Q Consensus 231 gn~i~v~i~~~a~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~v~e~~~~s~~~~~~~~~~~~~~~ 310 (667) ....+...... ........++. T Consensus 175 -~~~~i~~~t~~-----------------------------~~~a~~l~Lt~---------------------------- 196 (501) T protein:vir:78 175 -TAAAISAVTGT-----------------------------NNLADELGLSA---------------------------- 196 (501) T ss_pred -CceeEEEEecc-----------------------------cchhhhhcccc---------------------------- Confidence 00000000000 00000000000 Q ss_pred hhhhcccccceEEEecccccCcccceEEecCCccccccccccccccccccchhHHHHHHhhhcccccccEEecCcCCcch Q lcl|NC_012740. 311 DDFFARGSSQYIYATAQGWVDGFSGIISLAGGVSANEASTGDRGNDPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGD 390 (667) Q Consensus 311 ~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 390 (667) ....++.+ .|.. ...+.+.+..+......-..+..+.. T Consensus 197 ------~~~a~v~~----------------~g~~--------------aet~~~a~~a~~~~~~~Wy~f~~a~~------ 234 (501) T protein:vir:78 197 ------AAGASLQA----------------AGVA--------------ADTPASAMNRAVGLSRNWATFTTAWT------ 234 (501) T ss_pred ------cCceeeEe----------------cccc--------------ccCHHHHHHHHHhccCceEEEEEecC------ Confidence 00000000 0000 00011222222211111111111110 Q ss_pred hhHHHHHHHHHHHhhcCcEEEE--EccCccccccccccCCHHHHHHHhhhhccccccccccCcceEEEEehhhccccccc Q lcl|NC_012740. 391 AFSTVQKHAVSIGDERQDCLVM--VSPPRSTVVNIPVTTAIDNLIAWREGNSNYSDNNMNINTTYAVIDGNYKYQYDKYN 468 (667) Q Consensus 391 ~~~~v~~~~~~~~~~~~~~~ai--~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~v~d~~~ 468 (667) ...+-..++...+|....++.+ .|..... ......++ ....... -+..+.+..|+. T Consensus 235 ~~~~~~lalA~wiea~~~~f~~~~~~~~~~~-~~~~~~~~---i~~~l~a----------~~y~~t~~~y~~-------- 292 (501) T protein:vir:78 235 AVIADRLALASWNSGQAYKYMYVAPDLEPAS-IVTNNSAS---FGAQVFA----------APYQGTLPLYGD-------- 292 (501) T ss_pred CCHHHHHHHHHHHHhcCceEEEEEecCCcce-eecccchh---HHHHHhh----------cCCCceEEEcCC-------- Confidence 1112233455566554443322 2221111 11111111 1111111 122344444431 Q ss_pred CceeEechHHHHHHHHHHhhhcCCc-eeeecceec-cceeccccccccCChhhhhhhhhcCceEEEEecCC--eEEEEcc Q lcl|NC_012740. 469 DVNRWVPLAADIAGLCARTDAVSQP-WMSPAGYNR-GQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGE--GFILMGD 544 (667) Q Consensus 469 ~~~~~~p~sg~vAg~~a~~d~~~g~-~~span~~~-~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~--G~~~wG~ 544 (667) ..+...+.|..+.+|-++-. -.+-.+|.+ .|+ ..-.++..|.+.|..+|+|++..+.+. .+.+|-. T Consensus 293 -----~~~~aa~~g~~as~nf~~~~g~~T~~fkq~~~Gv-----~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~ 362 (501) T protein:vir:78 293 -----QATAGAVMGYAASINFQLRNGRTVLAFRQFNAGV-----PATAHDLGTANALRSNNYTYIGAYANAANNYTIAYD 362 (501) T ss_pred -----cchHHHHHHHHHhcCcccCcceeeeeccccCCCc-----CcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEc Confidence 12345667777777643311 111123332 222 123478899999999999999888654 4788855 Q ss_pred eecCCCcccceeeehhhhhHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcCCee------------ Q lcl|NC_012740. 545 KTATTVPSPFDRINVRRLFNMLKKNIGDSSKYKLFE----NNDNFTRASFRMEVSQYLSTIRSLGGIY------------ 608 (667) Q Consensus 545 rT~~~~~~~~~~i~vrR~~~~i~~~l~~~~~~~v~e----pn~~~~~~~i~~~i~~~l~~l~~~gal~------------ 608 (667) -++++ +|.+|.+-+-.+||+..++..+....-. |-++.=...|+..|+.-|++-+++|.|. T Consensus 363 G~~sG---~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~ 439 (501) T protein:vir:78 363 GKLSG---KFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQ 439 (501) T ss_pred Ceeec---cceeehhhhhHHHHHHHHHHHHHHHHHhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCcccee Confidence 55665 4677888888888888888888764432 4588888899999999999999999883 Q ss_pred -----------------eeEEEEcccCC-CHHHhhCCeEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_012740. 609 -----------------DFRVQCDTTNN-TPDVIDRNEFVASMFIKPAKSINYIMLNFTAVA 652 (667) Q Consensus 609 -----------------g~~v~~d~~~n-t~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 652 (667) ||++.++...+ +++--.+.-..+.+.++---.+++|++-....- T Consensus 440 I~~~~g~~~~~~~~~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s~~v~ 501 (501) T protein:vir:78 440 IDAAAGVAGAGQLVQMRGWYFLIGDPANPGQARQNRTTPTCTLWYSDGGSIQELTIGSNAVI 501 (501) T ss_pred eccccCccccccceeccceEEeeccccCChhhhhhcccCcEEEEEEeCCceeEEEeeeeecC Confidence 37777775433 344334455577777777888888877433222 Done!