Query lcl|NC_014792.1_cdsid_YP_004063862.1 [gene=18] [protein=tail sheath protein] [protein_id=YP_004063862.1] [location=94338..96317] Match_columns 659 No_of_seqs 233 out of 831 Neff 9.3 Searched_HMMs 1612 Date Thu Nov 7 15:36:15 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_181 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_181_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:108052 Length: 660 100.0 3E-166 2E-169 928.3 57.0 659 1-659 1-659 (660) 2 protein:vir:103456 Length: 659 100.0 6E-164 4E-167 915.5 58.5 658 1-659 1-658 (659) 3 protein:vir:7206 Length: 659 # 100.0 5E-163 3E-166 910.4 59.4 658 1-659 1-658 (659) 4 protein:vir:101187 Length: 663 100.0 2E-162 1E-165 907.5 56.6 657 1-659 1-661 (663) 5 protein:vir:6894 Length: 660 # 100.0 2E-162 1E-165 907.0 55.4 658 1-659 1-658 (660) 6 protein:vir:101804 Length: 663 100.0 7E-162 4E-165 904.0 55.9 657 1-659 1-661 (663) 7 protein:vir:100539 Length: 663 100.0 1E-161 6E-165 903.2 55.8 657 1-659 1-660 (663) 8 protein:vir:98263 Length: 664 100.0 4E-161 2E-164 900.1 57.1 655 1-659 1-662 (664) 9 protein:vir:80984 Length: 666 100.0 7E-160 4E-163 893.3 53.1 655 1-659 1-663 (666) 10 protein:vir:106427 Length: 679 100.0 1E-159 9E-163 891.3 55.0 657 1-659 1-677 (679) 11 protein:vir:6594 Length: 666 # 100.0 4E-159 2E-162 889.2 53.7 655 1-659 1-665 (666) 12 protein:vir:5663 Length: 671 # 100.0 1E-147 9E-151 825.7 54.2 645 1-657 1-671 (671) 13 protein:vir:104477 Length: 749 100.0 1E-139 7E-143 782.2 56.6 642 1-657 1-749 (749) 14 protein:vir:106984 Length: 743 100.0 2E-139 1E-142 780.7 51.8 643 1-658 1-743 (743) 15 protein:vir:104858 Length: 729 100.0 3E-136 2E-139 763.7 50.5 629 1-659 3-729 (729) 16 protein:vir:79092 Length: 477 100.0 6E-106 4E-109 597.3 42.4 467 1-659 1-477 (477) 17 protein:vir:107865 Length: 477 100.0 3E-105 2E-108 593.8 39.8 467 1-659 1-477 (477) 18 protein:vir:98824 Length: 774 100.0 4E-102 2E-105 576.8 41.3 482 1-654 281-774 (774) 19 protein:vir:103168 Length: 641 100.0 1.1E-94 6.9E-98 535.7 36.4 526 1-549 3-641 (641) 20 protein:vir:79181 Length: 390 100.0 5.1E-94 3.2E-97 532.0 33.2 378 1-657 2-390 (390) 21 protein:vir:79141 Length: 391 100.0 9.5E-94 5.9E-97 530.6 33.0 376 1-658 2-391 (391) 22 protein:vir:103993 Length: 390 100.0 1.1E-93 6.8E-97 530.2 33.3 378 1-657 2-390 (390) 23 protein:vir:78206 Length: 390 100.0 1.1E-93 6.8E-97 530.2 33.3 378 1-657 2-390 (390) 24 protein:vir:6079 Length: 396 # 100.0 3.2E-93 2E-96 527.7 35.5 382 1-658 1-396 (396) 25 protein:vir:5711 Length: 396 # 100.0 7.2E-93 4.5E-96 525.7 36.1 385 1-658 1-396 (396) 26 protein:vir:98553 Length: 395 100.0 1.3E-92 8E-96 524.4 35.1 381 1-657 1-395 (395) 27 protein:vir:2035 Length: 396 # 100.0 1.2E-91 7.4E-95 519.0 33.8 385 1-658 1-396 (396) 28 protein:vir:1172 Length: 391 # 100.0 1.1E-91 7E-95 519.2 32.0 379 1-658 3-391 (391) 29 protein:vir:1845 Length: 392 # 100.0 4.6E-91 2.9E-94 515.8 34.0 381 1-657 1-392 (392) 30 protein:vir:100323 Length: 393 100.0 2E-90 1.2E-93 512.4 34.7 377 1-658 4-393 (393) 31 protein:vir:96740 Length: 388 100.0 1E-89 6.2E-93 508.5 34.9 375 1-656 4-388 (388) 32 protein:vir:10336 Length: 386 100.0 2.9E-87 1.8E-90 495.0 34.6 376 1-656 1-386 (386) 33 protein:vir:5833 Length: 742 # 100.0 1.1E-76 7.1E-80 436.9 47.6 590 5-653 1-742 (742) 34 protein:vir:63742 Length: 562 100.0 5.7E-64 3.5E-67 367.3 40.8 530 1-652 8-562 (562) 35 protein:vir:79798 Length: 717 100.0 2.4E-63 1.5E-66 363.9 38.6 606 1-647 1-717 (717) 36 protein:vir:80488 Length: 562 100.0 1.9E-62 1.2E-65 359.0 40.8 537 1-652 1-562 (562) 37 protein:vir:95741 Length: 587 100.0 8.2E-62 5.1E-65 355.5 39.1 565 1-652 1-587 (587) 38 protein:vir:102819 Length: 648 100.0 1.4E-61 8.8E-65 354.2 38.3 584 1-650 1-648 (648) 39 protein:vir:80779 Length: 569 100.0 3.5E-60 2.1E-63 346.6 41.1 544 1-652 1-569 (569) 40 protein:vir:99306 Length: 587 100.0 1.2E-59 7.5E-63 343.6 40.3 560 1-652 1-587 (587) 41 protein:vir:96586 Length: 587 100.0 6.5E-57 4E-60 328.6 44.3 560 1-652 9-587 (587) 42 protein:vir:102957 Length: 437 100.0 1.6E-50 1E-53 293.5 37.9 417 1-646 9-437 (437) 43 protein:vir:100829 Length: 607 100.0 3.7E-50 2.3E-53 291.6 39.6 542 1-658 17-607 (607) 44 protein:vir:101326 Length: 529 100.0 6.9E-46 4.3E-49 268.2 33.8 494 1-647 1-529 (529) 45 protein:vir:105470 Length: 451 100.0 1.6E-39 1E-42 233.3 38.2 424 1-646 9-451 (451) 46 protein:vir:7653 Length: 581 # 100.0 5.3E-36 3.3E-39 214.0 36.0 542 40-659 1-579 (581) 47 protein:vir:107310 Length: 581 100.0 7.4E-35 4.6E-38 207.7 36.6 533 40-659 1-579 (581) 48 protein:vir:78986 Length: 436 100.0 4E-28 2.5E-31 170.8 38.4 406 1-646 1-436 (436) 49 protein:vir:102359 Length: 356 99.4 2.4E-13 1.5E-16 89.7 22.4 325 245-645 1-356 (356) 50 protein:vir:4463 Length: 498 # 99.2 2.2E-10 1.4E-13 73.5 26.7 436 1-650 1-498 (498) 51 protein:vir:4517 Length: 498 # 99.2 2.6E-10 1.6E-13 73.1 26.7 437 1-650 1-498 (498) 52 protein:vir:489 Length: 498 # 99.2 4.2E-10 2.6E-13 72.0 26.9 438 1-650 1-498 (498) 53 protein:vir:3788 Length: 376 # 99.0 2.6E-09 1.6E-12 67.6 26.3 358 229-650 1-376 (376) 54 protein:vir:78782 Length: 370 98.9 4.3E-09 2.7E-12 66.5 21.9 352 229-657 1-370 (370) 55 protein:vir:276 Length: 369 # 98.8 3.1E-08 1.9E-11 61.7 24.8 358 228-650 1-369 (369) 56 protein:vir:3751 Length: 376 # 98.8 4.7E-08 2.9E-11 60.8 29.2 360 229-654 1-376 (376) 57 protein:vir:95263 Length: 450 98.7 5.4E-08 3.4E-11 60.4 23.9 412 127-648 1-450 (450) 58 protein:vir:1996 Length: 495 # 98.5 4.3E-07 2.7E-10 55.5 33.5 439 1-647 11-495 (495) 59 protein:vir:80052 Length: 331 98.0 8.7E-06 5.4E-09 48.3 28.2 314 229-647 1-331 (331) 60 protein:vir:5260 Length: 502 # 97.8 2.2E-05 1.3E-08 46.2 37.1 464 1-647 1-502 (502) 61 protein:vir:3165 Length: 426 # 96.5 0.00057 3.6E-07 38.4 16.0 355 245-647 1-426 (426) 62 protein:vir:96104 Length: 504 96.1 0.0011 6.5E-07 36.9 20.9 426 144-646 1-504 (504) 63 protein:vir:99586 Length: 507 95.8 0.0015 9.3E-07 36.1 23.4 423 144-646 1-507 (507) 64 protein:vir:101576 Length: 501 86.5 0.045 2.8E-05 27.9 35.8 455 1-656 1-501 (501) 65 protein:vir:106730 Length: 501 85.8 0.05 3.1E-05 27.7 34.8 452 1-656 1-501 (501) 66 protein:vir:3636 Length: 501 # 84.3 0.062 3.8E-05 27.2 35.8 452 1-656 1-501 (501) 67 protein:vir:78611 Length: 501 69.1 0.23 0.00014 24.1 35.8 453 1-656 1-501 (501) 68 protein:vir:107720 Length: 515 50.3 0.61 0.00038 21.7 31.0 469 1-646 1-515 (515) 69 protein:vir:94073 Length: 494 37.1 1.1 0.00071 20.3 35.2 447 1-647 1-494 (494) No 1 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=2.7e-166 Score=928.33 Aligned_cols=659 Identities=90% Similarity=1.357 Sum_probs=557.5 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|+||||||||++....+.+++||++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~~~t~~~~~vg~~~~gp~~~p~~v~s~~~~~~~fg~~~~~~~~~~~~~~~f~~~g~~~~v 80 (660) T protein:vir:10 1 MALLSPGIELKETSVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNEVADYFMSGMNFLQYGNDLRT 80 (660) T ss_pred CceecCceEEEeecCCccccCCCcccceEEeecCCCCCccCeEcCCHHHHHHHcCCcCCCchhHHHHHHHHHhCCceEEE Confidence 99999999999998544455667999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccccccee Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVNQY 160 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 160 (659) ||+.+.+.++++....+.+..+...++....||+.+++.+..............+.++.....+............+... T Consensus 81 vrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~~a~~v~~~ 160 (660) T protein:vir:10 81 VRVVSREFAKNASPIAGNIETTITTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIAYARSLNQY 160 (660) T ss_pred EEecccccccccccccccceeEEeeccccccccceeeEeeccccccccccceeeccccceeeeccccccccccccccccc Confidence 99998877777777777777777777788889999999887776665565666666655444444443333333444444 Q ss_pred eeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEee Q lcl|NC_014792. 161 PDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVS 240 (659) Q Consensus 161 ~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~ 240 (659) +.....+...+............+...+.+.+...........................+...+.+.+.+++.+.+++.. T Consensus 161 ~~~~~~~~~~~~~~~~~~~~a~sv~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~v~i~~ 240 (660) T protein:vir:10 161 PTLGPAWTAEVTSASSGVSGTITVGKIVTDSGILLTEAENSEEAITSLEFQAALKKFAMPGVVALYPGEIGSTLEVEIVS 240 (660) T ss_pred cccccceeEEEecccCccccceeeeeeeccCcceEEeeeccccccccccceeeccccccceeeeecccccCcceeEEEee Confidence 44444444444333333333333444444444333332222222222333333444445556677888999999988866 Q ss_pred cccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcccc Q lcl|NC_014792. 241 KAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKGTS 320 (659) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~~s 320 (659) ......+........+...+...........++..++.+.+++..++...|++.++.+.+++.......++.+.+.++.+ T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (660) T protein:vir:10 241 KAAYEAGSSKMLDVYPGGGTRASIAKAVFNYGPQTDDQYAIIVRRDGAIVESVVLSTKEGEKDVYGNNIYLDDYFAKGTS 320 (660) T ss_pred ccccCCcceeEEeeeeccceeeEEeeeecccccccccccccccccCCcccceeeeeccccccccccceeeeehhhcCCCc Confidence 55555555555666655555555555566666677778888999999999999999988888887877778788888888 Q ss_pred cceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhh Q lcl|NC_014792. 321 NYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADER 400 (659) Q Consensus 321 ~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~ 400 (659) .++.+.....+.......++.||.++.+.++..++.+++++++..+.++++++++|+..+.++.+..+|+++|++||+++ T Consensus 321 ~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~~~~~~~ 400 (660) T protein:vir:10 321 NYIYATSLNWPKGFSGIINLSGGISANDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVVSIADER 400 (660) T ss_pred cEEEEEeccCCCCcccceeeeccccCccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHHHHHHhh Confidence 89998888888888889999999999988899999999999999888999999999999888888999999999999999 Q ss_pred CCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHHHHH Q lcl|NC_014792. 401 QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGLCAR 480 (659) Q Consensus 401 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~ 480 (659) ++||+++|+|++...+....++.+++.+||+..+..+..+++++|+|+++||||++++|+.+++.+++|||+++||+||| T Consensus 401 ~~~~aiid~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar 480 (660) T protein:vir:10 401 QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADLAGLCAR 480 (660) T ss_pred CCEEEEEecCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechhHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccceeehhhHHHH Q lcl|NC_014792. 481 TDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHINVRRLTNM 560 (659) Q Consensus 481 ~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~ 560 (659) +|.++||||||||++++++.|+.++++.+++.|++.||++|||+|++|++++||++||+||+++++++|+||||||||+| T Consensus 481 ~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~~~~ 560 (660) T protein:vir:10 481 TDDVSQPWMSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVLFGDKTATKVPSPMDHINVRRLFNM 560 (660) T ss_pred hhccCCcEEccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEEEcccccCCCCcccceEehhhHHHH Confidence 99999999999999999999999999999999999999999999999998789999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEE Q lcl|NC_014792. 561 LKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPARSINYIV 640 (659) Q Consensus 561 i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~ 640 (659) |+++|+++++|+||||||+.||++|+++|+.||++||++|+|.||+|+||+++||++||++|+|+++|+++|++|||||+ T Consensus 561 i~~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~i~~~P~~pae~I~ 640 (660) T protein:vir:10 561 LKKNIGDASKYKLFELNDNFTRSSFRMEVSQYLDGIKALGGIYEGRVVCDTTVNTPAVIDRNEFIANIYVKPARSINYIT 640 (660) T ss_pred HHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCccEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeecCeeEEEecCCC Q lcl|NC_014792. 641 LNFVATSTGADFDELIGVQ 659 (659) Q Consensus 641 ~~~~~~~~~~~~~e~~~~~ 659 (659) |||+|+++|++|+||+|.= T Consensus 641 ~~~~~~~~~~~~~e~~~~~ 659 (660) T protein:vir:10 641 LNFVATSTGADFDELIGPL 659 (660) T ss_pred EEEEEeecCccHHHHhhhc Confidence 9999999999999999966 No 2 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=5.9e-164 Score=915.47 Aligned_cols=658 Identities=72% Similarity=1.152 Sum_probs=548.5 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|+||||||||+|+++++++++||++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~~~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYGNDLRV 80 (659) T ss_pred CceecCceEEEEecCCceecccCccceEEEecccCCCCCccEEecCHHHHHHHcCCcCCCcchhHHHHHHHhhCCCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccccccee Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVNQY 160 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 160 (659) ||+++.+.+.++....+....+....+.....++.+.+++..........+..++.++......+............... T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~vd~~~~~~~~~i~~~~~~~~~~~~g~~ 160 (659) T protein:vir:10 81 VRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSDAIETEGKITEVDTDGKIKKINIPTAKIIAKAKEVGEY 160 (659) T ss_pred EEccCcccccccccccccceeeEeecccccccccceeeeecCCCccccceeeEEecccccceeeeccccccccccccccc Confidence 99998887777777666666655555555566777777777666666667766666655444443333222222222222 Q ss_pred eeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEee Q lcl|NC_014792. 161 PDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVS 240 (659) Q Consensus 161 ~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~ 240 (659) ..+.............+............+......+......................+.....+++.+++.+.+++.. T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G~~g~~~tv~~~~ 240 (659) T protein:vir:10 161 PTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIEIEIVS 240 (659) T ss_pred ceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecccccccccccceecccceEEEec Confidence 22222222222222222222222222333333333333332222333333333344444555667788888888888776 Q ss_pred cccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcccc Q lcl|NC_014792. 241 KAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKGTS 320 (659) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~~s 320 (659) ...............+...+........+...+..+..+.+.+...+.+.+++.++.+.++........++...+.++.+ T Consensus 241 ~a~~~~~~~v~v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (659) T protein:vir:10 241 KADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFFAKGGS 320 (659) T ss_pred hhhccccceeeeeeeeecccccccceeeeeeccccccchhhccccccceeeeeeeeccccccccccchhhhhhhhccCcc Confidence 66666666666666666666666667777777777777777888888889999888888888888888888888888888 Q ss_pred cceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhh Q lcl|NC_014792. 321 NYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADER 400 (659) Q Consensus 321 ~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~ 400 (659) .++.+.....+.......++.||.++...++..++.++++++...+..+++|+++|++.+....+..+|+.+|++||+++ T Consensus 321 ~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~~~~~~~v~~al~~~~~~~ 400 (659) T protein:vir:10 321 EYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVSIGDAR 400 (659) T ss_pred cEEEEeecccCCCccceeeecccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHHHHHhh Confidence 99988888888888888999999999888899999999999998888999999999998877777889999999999999 Q ss_pred CCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHHHHH Q lcl|NC_014792. 401 QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGLCAR 480 (659) Q Consensus 401 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~ 480 (659) ++||+++|+|....++....++.+++.+||+..+.......+++|+|+++||||++++|+.+++++++|||+++||+||| T Consensus 401 ~~~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar 480 (659) T protein:vir:10 401 QDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADIAGLCAR 480 (659) T ss_pred CCeEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEecccCCceEEechHHHHHHHHHH Confidence 99999999999999999999999999999999999988899999999999999999999999999999999999999999 Q ss_pred hhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccceeehhhHHHH Q lcl|NC_014792. 481 TDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHINVRRLTNM 560 (659) Q Consensus 481 ~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~ 560 (659) +|.++||||||+|+++++|.|++++++.+++.|++.||++|||||++|++ +|+++||+||+++++++|+||||||||+| T Consensus 481 ~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~~~~s~~~~i~vrR~~~~ 559 (659) T protein:vir:10 481 TDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGG-DGYVLYGDKTATSVPSPFDRINVRRLFNM 559 (659) T ss_pred HhccCCceEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCC-CeEEEEcccccCCCCcccceEehhhHHHH Confidence 99999999999999999999999999999999999999999999999987 79999999999989899999999999999 Q ss_pred HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEE Q lcl|NC_014792. 561 LKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPARSINYIV 640 (659) Q Consensus 561 i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~ 640 (659) |+++|+++++|+||||||+.||++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+|+|++|+|||+ T Consensus 560 i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i~ 639 (659) T protein:vir:10 560 LKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGIKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYIQPARSINYIT 639 (659) T ss_pred HHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeecCeeEEEecCCC Q lcl|NC_014792. 641 LNFVATSTGADFDELIGVQ 659 (659) Q Consensus 641 ~~~~~~~~~~~~~e~~~~~ 659 (659) |||+|++++++|+|++|.- T Consensus 640 ~~~~~~~~~~~~~e~~~~~ 658 (659) T protein:vir:10 640 LNFVATATGADFDELTGLA 658 (659) T ss_pred EEEEEEecCcchHHhhccC Confidence 9999999999999999999 No 3 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=4.9e-163 Score=910.42 Aligned_cols=658 Identities=71% Similarity=1.136 Sum_probs=544.4 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|++|||||||+|+++++++++|||+||||+|+|||+|+|++|+||.||+++||+++..++++|++++||+|||++||| T Consensus 1 ~~~~~PgVyvee~~~~~~~~~~~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg~~~~v 80 (659) T protein:vir:72 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPTAETADYFMSAMNFLQYGNDLRV 80 (659) T ss_pred CceecCceEEEEecCCcccccCCCcceEEEeecCCCCCcccEEecCHHHHHHHcCCcCCCCchhHHHHHHHHhCCceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccccccee Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVNQY 160 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 160 (659) |||++.+.+.++......+.......+.....+.....++..........+...+.++......+..............+ T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~v~~~~~~~~~~~~~v~t~~~~a~~~~~~~~ 160 (659) T protein:vir:72 81 VRAVDRDTAKNSSPIAGNIDYTISTPGSNYAVGDKITVKYVSDDIETEGKITEVDADGKIKKINIPTGKNYAKAKEVGEY 160 (659) T ss_pred EEccCCcccccccccccccceeecccccccccceeeeeeeccccccccceEEEeeccccceeeeeccccccccccccccc Confidence 99998877777776666665555555555556666666666666666666666555554433333322222222222222 Q ss_pred eeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEee Q lcl|NC_014792. 161 PDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVS 240 (659) Q Consensus 161 ~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~ 240 (659) ................+......+.....+.................................+..++.+++.+.+.+.. T Consensus 161 ~~~~~~~~~~~~~~~~~~a~~~~~v~v~~~~~~~~~~v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~~tv~i~~ 240 (659) T protein:vir:72 161 PTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIEIEIVS 240 (659) T ss_pred cccccceeeEEeeccccccceEEEEEeecCcceeeeeccccchhhhcccccccccccccceeeeccccccccceeEEEcc Confidence 22222222222222222223333333333333333333333332233333333333344445566778888888888766 Q ss_pred cccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcccc Q lcl|NC_014792. 241 KAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKGTS 320 (659) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~~s 320 (659) ...............+...........+....+..+..+.+.+...+...+.+.++...++........++...+.++.+ T Consensus 241 ~~~~~~~~~~~v~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (659) T protein:vir:72 241 KADYAKGASALLPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDAIVQSVVLSTKRGEKDIYDSNIYIDDFFAKGGS 320 (659) T ss_pred ccccccceeeeeecccccccccccceeeeeeecccccccceeeecccceeeeeeeeeccccccccchhhhhhhhhhcCCc Confidence 66555555555665666666666666777777777777777888888889999888888888888888888888888888 Q ss_pred cceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhh Q lcl|NC_014792. 321 NYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADER 400 (659) Q Consensus 321 ~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~ 400 (659) .++.+.....+.......++.||.++...++..++.++++++...+..+++||++|++.+....+..+|+++|++||+++ T Consensus 321 ~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~~~~~~ 400 (659) T protein:vir:72 321 EYIFATAQNWPEGFSGILTLSGGLSSNAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVSIGDAR 400 (659) T ss_pred eEEEEEecccCCcccccccccccccccccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHHHHhhh Confidence 99988888888888888999999999888889999999999998888999999999998877777889999999999999 Q ss_pred CCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHHHHH Q lcl|NC_014792. 401 QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGLCAR 480 (659) Q Consensus 401 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~ 480 (659) ++||+++|+|.....+.....+.+++.+||+..+.......+++|+|+++||||++++|+.+++++++|||+++||+||| T Consensus 401 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar 480 (659) T protein:vir:72 401 QDCLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLAADIAGLCAR 480 (659) T ss_pred CCEEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechHHHHHHHHHH Confidence 99999999999988888888999999999999999888889999999999999999999999999999999999999999 Q ss_pred hhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccceeehhhHHHH Q lcl|NC_014792. 481 TDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHINVRRLTNM 560 (659) Q Consensus 481 ~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~ 560 (659) +|.++|+||||+|+++++|.|++++++.+++.|++.||++|||||++|++ +|+++||+||+++++++|+||||||||+| T Consensus 481 ~D~~~G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g-~G~~~wG~rT~~~~~s~~~~i~vrR~~~~ 559 (659) T protein:vir:72 481 TDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGG-DGYVLYGDKTATSVPSPFDRINVRRLFNM 559 (659) T ss_pred hhccCCcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecC-CeEEEEcccccCCCCcccceEeehhHHHH Confidence 99999999999999999999999999999999999999999999999997 79999999999999889999999999999 Q ss_pred HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEE Q lcl|NC_014792. 561 LKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPARSINYIV 640 (659) Q Consensus 561 i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~ 640 (659) |+++|+++++|+||||||+.||++|+++|++||++||++|+|.||+|+||+++||++||++|+|+++|+|+|++|+|||+ T Consensus 560 i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~ 639 (659) T protein:vir:72 560 LKTNIGRSSKYRLFELNNAFTRSSFRTETAQYLQGNKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYIQPARSINYIT 639 (659) T ss_pred HHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEEEecCCccEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeecCeeEEEecCCC Q lcl|NC_014792. 641 LNFVATSTGADFDELIGVQ 659 (659) Q Consensus 641 ~~~~~~~~~~~~~e~~~~~ 659 (659) |||+|+++|++|+||.|-- T Consensus 640 ~~~~~~~~~~~~~e~~~~~ 658 (659) T protein:vir:72 640 LNFVATATGADFDELTGLA 658 (659) T ss_pred EEEEEeecCcchHHhcccC Confidence 9999999999999999999 No 4 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=1.7e-162 Score=907.51 Aligned_cols=657 Identities=64% Similarity=1.066 Sum_probs=552.9 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|++|||||||++....+.+++||++||||+|+|||+|+|++|+||.||++.||++.+.+|++|++++||+|||++||| T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYGNDLRL 80 (663) T ss_pred CceecCceEEEEecCcccccccCccceeEEeeeccCCCCccEEecCHHHHHHHhCCcCccchhHHHHHHHHHhCCCeEEE Confidence 99999999999997444566778999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccccccee Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVNQY 160 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 160 (659) |||.+.+.++++..+.+....+....+....+|+.+++................++++.....+............+... T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~~~v~~~ 160 (663) T protein:vir:10 81 VRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLGTY 160 (663) T ss_pred EEccCCcccccccccCCcceeeeeccccCccccccccccccccccccccceeeeccCCceEEEEecccccccccccccee Confidence 99998877777776666666666666777788999988877766666677777888776666665555444444445555 Q ss_pred eeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEee Q lcl|NC_014792. 161 PDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVS 240 (659) Q Consensus 161 ~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~ 240 (659) ......+.........+......+..++.+.+...........................+...+.++|.+|+.+.+++.. T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn~i~v~i~~ 240 (663) T protein:vir:10 161 PTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLVSAVYPGEIGSTVEVEIVS 240 (663) T ss_pred eeccccceeEeeeccccccccccccceecccceeeEeeccccccccccchhhhcccccceeeeeecccccccceeEEecc Confidence 55554444444444444434444444555554444344444444444444444445555667788999999999988765 Q ss_pred cccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcccc Q lcl|NC_014792. 241 KAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKGTS 320 (659) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~~s 320 (659) .... .......+.+..+.............+..+..+.+++..++...+.+.++...+++...+...++...+.++.+ T Consensus 241 ~~~~--~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~ 318 (663) T protein:vir:10 241 KTAF--NSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRNGGS 318 (663) T ss_pred cccc--cccccccccccccccccccceeeeeccccccceeEEEecCCcceeeeeeeecccccccchhhhhhhhhhccCcc Confidence 4333 22233444555566666666777777788888888888899888888888888888888888888888888888 Q ss_pred cceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhh Q lcl|NC_014792. 321 NYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADER 400 (659) Q Consensus 321 ~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~ 400 (659) .++.+.....+......+++.||.|+...++..++.++++++...+.++++++++|........+..+|+.+|++||+++ T Consensus 319 ~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~~~a~~~ 398 (663) T protein:vir:10 319 NFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLADDR 398 (663) T ss_pred eEEEEeecccCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHHHhh Confidence 88888888888878888999999999888889999999999988888889999998777666677889999999999999 Q ss_pred CCEEEEEecCccccccccccCCHHHHHHHhhccccc---cccccccccceEEEEcCceeEecccCCcceeecHHHHHHHH Q lcl|NC_014792. 401 QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSF---DTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGL 477 (659) Q Consensus 401 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~ 477 (659) ++||+|+|+|.++........+.+++.+||+..... ...+.+++|+|+++||||++++|+.+++.+++|||+++||+ T Consensus 399 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p~s~~vAGl 478 (663) T protein:vir:10 399 QDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAADIAGL 478 (663) T ss_pred CCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEechhHHHHHH Confidence 999999999999988888888999999999875432 23345789999999999999999999999999999999999 Q ss_pred HHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccceeehhhH Q lcl|NC_014792. 478 CARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHINVRRL 557 (659) Q Consensus 478 ~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~ 557 (659) |||+|.++|+||||||+++++|.|++++++.+++.|++.||++|||||++|++++|+++||+||+++++++|+|||+||| T Consensus 479 ~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~~i~vrR~ 558 (663) T protein:vir:10 479 CAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFDRINVRRL 558 (663) T ss_pred HHHhhccCCceEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccceEehhhH Confidence 99999999999999999999999999999999999999999999999999998789999999999999899999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCce Q lcl|NC_014792. 558 TNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPARSIN 637 (659) Q Consensus 558 ~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e 637 (659) |+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+|+|++|+| T Consensus 559 ~~~i~~si~~~~~~~v~e~n~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~pae 638 (663) T protein:vir:10 559 FNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTIYVKPPRSIN 638 (663) T ss_pred HHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEeecCeeEEEecCC-C Q lcl|NC_014792. 638 YIVLNFVATSTGADFDELIGV-Q 659 (659) Q Consensus 638 ~i~~~~~~~~~~~~~~e~~~~-~ 659 (659) ||+|||+|++++++|+|++|- | T Consensus 639 ~i~~~~~~~~~~~~~~e~~~~~~ 661 (663) T protein:vir:10 639 YITLNMVATSTGANFDELIGPMQ 661 (663) T ss_pred eEEEEEEEeecCccHHHHHHHHh Confidence 999999999999999999984 4 No 5 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=2.1e-162 Score=906.97 Aligned_cols=658 Identities=71% Similarity=1.157 Sum_probs=542.8 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|+||||||||+++...+.+++||++||||+|+|||+|+|++|+||.||+|.||++++.++++|++++||+|||++||| T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~~~g~~~~v 80 (660) T protein:vir:68 1 MALLSPGVELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQITDEVALVDMFGTPNTDTADYFMSAMNFLQYGNDLRV 80 (660) T ss_pred CccccCceEEEEecCCcccccCCCcceeEEecccCCCCccCEEecCHHHHHHhcCCccCccchhHHHHHHHHhCCCeEEE Confidence 99999999999997555566778999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccccccee Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVNQY 160 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 160 (659) ||+++.+.++++....+.+..+....+....+++.++++..............++..+.....+................ T Consensus 81 vRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a~~~~~~ 160 (660) T protein:vir:68 81 VRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKAKEIGEY 160 (660) T ss_pred EEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeeccccccccceeeccc Confidence 99998777777776666666666666777788988888877766666666666666666555444333222222222222 Q ss_pred eeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEee Q lcl|NC_014792. 161 PDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVS 240 (659) Q Consensus 161 ~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~ 240 (659) .......................+.+.+.+.+.........+.....................+.+.+.+|+.+.+++.. T Consensus 161 ~~~~~~~~~~v~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~i~v~~~~ 240 (660) T protein:vir:68 161 PELGSNWTAEMSGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQLEIEIVS 240 (660) T ss_pred cccccceeEEeecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCccccccccccccccceEEEEec Confidence 22222222222222222222223333334444333333333332222332333333333445567788899999888766 Q ss_pred cccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcccc Q lcl|NC_014792. 241 KAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKGTS 320 (659) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~~s 320 (659) .....................+......+...+..+..+.+.+..++..++++.++...+.....+...++...+.++.+ T Consensus 241 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (660) T protein:vir:68 241 KADYDKGASAQLKIYPDGGTRYSTAKAIFGYGPQTDDQYAIIVRRNDSVVQSVVLSTKRGERDIYGSNIFIDDFFAKGAS 320 (660) T ss_pred cccccccccccceeeecccccccceeeEeecccccccceeeeeecCCcceeeeeeecccccccccccceeeehhhccCcc Confidence 65555444444444445555566666666666777778888888999999999988887777777777777777778888 Q ss_pred cceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhh Q lcl|NC_014792. 321 NYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADER 400 (659) Q Consensus 321 ~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~ 400 (659) .++.+.....+.......++.||.++...++.+++.++++++...+.+++.++++++...+...+..+|+.+|++||+++ T Consensus 321 ~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~v~~~l~~~~~~~ 400 (660) T protein:vir:68 321 NYIFATAQGWPKGFSGVIKLNGGLSSNETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESLEVASTVQKHVVAIGDSR 400 (660) T ss_pred cEEEEeecCCCccccceeeeccccccccccccchhhhHHHHhhhhhccccceeeccccCCCchHHHHHHHHHHHHHHHhh Confidence 88888888888888888899999999888888899999999988888888999888888777778889999999999999 Q ss_pred CCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHHHHH Q lcl|NC_014792. 401 QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGLCAR 480 (659) Q Consensus 401 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~ 480 (659) ++||+++|+|..+..+.+..++.+++.+||+..+.....+.+++|+|+++||||++++|+.+++.+++|||+++||+||| T Consensus 401 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar 480 (660) T protein:vir:68 401 QDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYNDVNRWVPLAADIAGLCAR 480 (660) T ss_pred CCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceEEechhHHHHHHHHH Confidence 99999999999999999989999999999999999888889999999999999999999999999999999999999999 Q ss_pred hhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccceeehhhHHHH Q lcl|NC_014792. 481 TDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHINVRRLTNM 560 (659) Q Consensus 481 ~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~ 560 (659) +|.++||||||+|+++.+|.|++++++.+++.|++.||++|||+|++|++ +|+++||+||+++++++|+||||||||+| T Consensus 481 ~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~~~~s~~~~i~vrR~~~~ 559 (660) T protein:vir:68 481 TDNISQPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGG-DGYVLYGDKTATSVPSPFDRINVRRLFNM 559 (660) T ss_pred HhccCCcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecC-CeEEEEcceecCCCCcccceEehhhHHHH Confidence 99999999999999999999999999999999999999999999999987 79999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEE Q lcl|NC_014792. 561 LKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPARSINYIV 640 (659) Q Consensus 561 i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~ 640 (659) |+++|+++++|+||||||+.||++|+++|++||++||++|+|+||+|+||+++||+++|++|+|+++|+|+|++|+|||+ T Consensus 560 i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i~ 639 (660) T protein:vir:68 560 VKTNIGSASKYRLFELNNAFTRSSFRTETSQYLQGIKALGGVYNFKVVCDTTNNTPAVIDRNEFVATFYLQPARSINYIT 639 (660) T ss_pred HHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeecCeeEEEecCCC Q lcl|NC_014792. 641 LNFVATSTGADFDELIGVQ 659 (659) Q Consensus 641 ~~~~~~~~~~~~~e~~~~~ 659 (659) |||+|++++++|+|++|.= T Consensus 640 l~~~~~~~~~~~~e~~~~v 658 (660) T protein:vir:68 640 LNFVATATGADFDELIGAV 658 (660) T ss_pred EEEEEeecCccHHHHHHhh Confidence 9999999999999998877 No 6 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=7.2e-162 Score=904.02 Aligned_cols=657 Identities=64% Similarity=1.070 Sum_probs=551.6 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|+||||||||++....+.+++||++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~~~t~~~~~vG~~~~Gp~~~p~~v~~~~~~~~~fg~~~~~~~~~~~~~~~f~ngg~~~~v 80 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYGNDLRL 80 (663) T ss_pred CceecCceEEEEecCCccccccCcccceeEeecccCCCCccEEecCHHHHHHhcCCcCCcchhHHHHHHHHHhCCCeEEE Confidence 99999999999997444455678999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccccccee Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVNQY 160 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 160 (659) |||.+.+.++++....+....+....+....+|+.++++...........+...++++.....+............+... T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~~~~~~~~~~~v~~~ta~~~~~~~~v~~~ 160 (663) T protein:vir:10 81 VRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLGTY 160 (663) T ss_pred EEccCCccccccccccccceeEEeecccccccccccccccccccccccccceeeecccceEEEeeccccccccccccccc Confidence 99998877777777777766666666777789999988777666666666677777766666555555444444455555 Q ss_pred eeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEee Q lcl|NC_014792. 161 PDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVS 240 (659) Q Consensus 161 ~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~ 240 (659) +........+.............+..++.+.+.............+.............+.+.+.+.|.+|+.++|++.. T Consensus 161 ~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~Gn~i~V~i~~ 240 (663) T protein:vir:10 161 PTLGDNWRIDVSGASGGSAAALALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEIGSTVEVEIVS 240 (663) T ss_pred eeeccceeeEeeeccCccccccccceeccccceEEeeccccccccccccccccccccccceEEeccCCcccceeeeeecc Confidence 55555555554444444444444444555555544444433333444444444444555667778899999999988765 Q ss_pred cccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcccc Q lcl|NC_014792. 241 KAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKGTS 320 (659) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~~s 320 (659) ....... ......+................+..++.+.+++..++...|.+.++...++....+...++...+.++.+ T Consensus 241 ~~~~~~~--~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~ 318 (663) T protein:vir:10 241 KTAFNSG--AQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRNGGS 318 (663) T ss_pred ccccccc--cccceecccccccccccceeecccccccceeeEeecCCcceeeecccccccccccccchhhhhhhhcCCcc Confidence 4433322 22334445555556666667777778888888888899999999888888888888888888888888888 Q ss_pred cceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhh Q lcl|NC_014792. 321 NYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADER 400 (659) Q Consensus 321 ~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~ 400 (659) .++.+.....+......++++||.|+...++..++.++++++...+...++++++|...........+|+.+|++||+++ T Consensus 319 ~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~a~~~ 398 (663) T protein:vir:10 319 NFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLADDR 398 (663) T ss_pred eEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHHHhh Confidence 88888888888888888999999999888889999999999988888889999998877666677789999999999999 Q ss_pred CCEEEEEecCccccccccccCCHHHHHHHhhcccc---ccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHH Q lcl|NC_014792. 401 QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGS---FDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGL 477 (659) Q Consensus 401 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~ 477 (659) ++||+++|+|.++........+.+++.+|++.... ......+++|+|+++||||++++|+.+++.+++|||+++||+ T Consensus 399 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl 478 (663) T protein:vir:10 399 QDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAADIAGL 478 (663) T ss_pred CCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCceEEechhHHHHHH Confidence 99999999999998888888899999999986532 233455788999999999999999999999999999999999 Q ss_pred HHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccceeehhhH Q lcl|NC_014792. 478 CARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHINVRRL 557 (659) Q Consensus 478 ~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~ 557 (659) |||+|.++|+||||+|+++++|.|++++++.+++.|++.||++|||||++|++++||++||+||+++++++|+||||||| T Consensus 479 ~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~vrR~ 558 (663) T protein:vir:10 479 CAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVLFGDKMATQVPSPFDRINVRRL 558 (663) T ss_pred HHHhhccCCceEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCCCcEEEEcccccCCCCcccceEehhhH Confidence 99999999999999999999999999999999999999999999999999998789999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCce Q lcl|NC_014792. 558 TNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPARSIN 637 (659) Q Consensus 558 ~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e 637 (659) |+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|.||+|+||+++||+++|++|+|+++|+|+|++|+| T Consensus 559 ~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~pae 638 (663) T protein:vir:10 559 FNMLKKNIGDTSKYELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTIYVKPPRSIN 638 (663) T ss_pred HHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEeecCeeEEEecCC-C Q lcl|NC_014792. 638 YIVLNFVATSTGADFDELIGV-Q 659 (659) Q Consensus 638 ~i~~~~~~~~~~~~~~e~~~~-~ 659 (659) ||+|||+|+++|++|+|++|. | T Consensus 639 ~i~~~~~~~~~~~~~~e~~~~~~ 661 (663) T protein:vir:10 639 YITLNMVATSTGANFDELIGPMQ 661 (663) T ss_pred eEEEEEEEeecCccHHHHHHHHh Confidence 999999999999999999985 4 No 7 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=1e-161 Score=903.16 Aligned_cols=657 Identities=66% Similarity=1.093 Sum_probs=549.7 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|+||||||||++++..+.+++||++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~v~t~~~~fvG~~~~gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAALVGKFAWGPAYEIRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYGNDLRL 80 (663) T ss_pred CccccCceEEEEecCcccccccccccceeeeccccCCCCcCEEecCHHHHHHHcCCcccccchHHHHHHHHHhCCCeEEE Confidence 99999999999998766777888999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccccccee Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVNQY 160 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 160 (659) |||.+.+.+.+++.+.+....+....+..+.+|+.+.+................+..+.....+++.............. T Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~~~~~a~~~~~~ 160 (663) T protein:vir:10 81 VRVIDMEQAKNASPLFNQIEVTITTEGQGYTVGDTVSIKHNTTTVTEEGKVTKVDADGKIKALFVPSSAVIAKAKQLGTY 160 (663) T ss_pred EecCCcccccccccccccceeeEeecccCccccceeeecccccccccCcceeeeccCCceeEEEeccccccccccccccc Confidence 99998877777777666666555666778889999998877776666666666666766666666554433333333333 Q ss_pred eeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEee Q lcl|NC_014792. 161 PDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVS 240 (659) Q Consensus 161 ~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~ 240 (659) ......+...+............+.++..+.+...........................+...+.+.+.+|+.+.+.+.. T Consensus 161 ~~~~~a~~~~v~~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~~~g~~G~~i~v~~~~ 240 (663) T protein:vir:10 161 PVLGDNWRAEVSGASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLISAVYPGEIGSTVEVEVIS 240 (663) T ss_pred cccccceeeEEeeccccccccceeEeeecCCceeEEeeeccccccccceeeeeccccccceeeeecccccCcceeEeecc Confidence 33333344444333333333444445555555555444444444444444555555566666777889999999887755 Q ss_pred cccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcccc Q lcl|NC_014792. 241 KAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKGTS 320 (659) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~~s 320 (659) ...... .......+..+.+..........++..++.+.+++..++...|++.++...++....+...++...+.++.+ T Consensus 241 ~~~~~~--~~~~~v~~~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~g~~~e~~~ls~~~~~~~~~~~~~~~~~~~~~~~s 318 (663) T protein:vir:10 241 KTAFQS--GAAQPIYPFGGTRASNARSVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRRGDRDVYGNNIFMDDYFRNGSS 318 (663) T ss_pred cccccc--cceeeecccCcccccccccccccccccchhhcccccCCCcccceeeeeccccccccchhhhhhhhhhcCccc Confidence 443322 234445555666666666666777777788889999999999999988888888787777788888889899 Q ss_pred cceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhh Q lcl|NC_014792. 321 NYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADER 400 (659) Q Consensus 321 ~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~ 400 (659) .++.+.....+......++++||.++.+.++..++.++++++...+..+..++++++...+..++..+|+++|++||+++ T Consensus 319 ~~v~~~~~~~~~~~~~~~~l~gg~~~~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~~~~~ 398 (663) T protein:vir:10 319 NFIYASSVNWPAGFTGIIQLGGGASANNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVALADDR 398 (663) T ss_pred ceeEeeccccCcccceeEEecccccCcccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHHHHhh Confidence 99999998888888888999999999988999999999998877766666666666666666777889999999999999 Q ss_pred CCEEEEEecCccccccccccCCHHHHHHHhhcccc---ccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHH Q lcl|NC_014792. 401 QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGS---FDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGL 477 (659) Q Consensus 401 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~ 477 (659) ++||+|+|+|+++..+.......+++.+||+.... ....+.+++|+|+++||||++++|+.+++++++|||+++||+ T Consensus 399 ~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s~~vAGl 478 (663) T protein:vir:10 399 QDCVAFVNPPSELLVGVPTTQAVKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPLSADIAGL 478 (663) T ss_pred CCEEEEEecCcccccccchhhhHHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEechHHHHHHH Confidence 99999999999998888888888999999986432 223456789999999999999999999999999999999999 Q ss_pred HHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccceeehhhH Q lcl|NC_014792. 478 CARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHINVRRL 557 (659) Q Consensus 478 ~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~ 557 (659) |||+|.++||||||+|+++++|.|++++++.+++.|++.||++|||+|++|++++||++||+||+++++++|+|||+||| T Consensus 479 ~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~wG~rT~s~~~s~~~~i~vrR~ 558 (663) T protein:vir:10 479 CAYTDQVGHPWMSPAGYRRGQLRNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVLFGDKMATQVPSPFDRINVRRL 558 (663) T ss_pred HHHhhccCCcEEccCCeeecceeccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEEEcccccCCCCcccceEehhhH Confidence 99999999999999999999999999999999999999999999999999998789999999999999899999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCce Q lcl|NC_014792. 558 TNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPARSIN 637 (659) Q Consensus 558 ~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e 637 (659) |+||+++|+++++|+||||||+.||++|+++|++||++||++|+|+||+|+||+++||+++|++|+|+++|+|+|++|+| T Consensus 559 ~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae 638 (663) T protein:vir:10 559 FNMLKKNIGDTSKYELFENNDAFTRQSFRMEVSQYLDNIRSLGGVYDFRVVCDTTNNTPQVIDSNEFVATIYIKAPRSIN 638 (663) T ss_pred HHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEeecCeeEEEecCCC Q lcl|NC_014792. 638 YIVLNFVATSTGADFDELIGVQ 659 (659) Q Consensus 638 ~i~~~~~~~~~~~~~~e~~~~~ 659 (659) ||+|||+|+++|++|+|++|.+ T Consensus 639 ~I~~~~~~~~~~~~f~e~~~~~ 660 (663) T protein:vir:10 639 YITLNFVATSTGANFDELIGPA 660 (663) T ss_pred eEEEEEEEEecCccHHHHHHHH Confidence 9999999999999999999987 No 8 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=3.7e-161 Score=900.11 Aligned_cols=655 Identities=65% Similarity=1.056 Sum_probs=530.6 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|+||||||||++....+.+++||++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ma~~~PgVyv~E~~~~~~i~~~~ts~~~~vG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (664) T protein:vir:98 1 MALQSPGIETKETSVQSTVVRNSTGRAAIVGKFSWGPAYQIRQISNEVELVNYFGAPDNLTADYFMSAVNFLQYGNDLRL 80 (664) T ss_pred CceecCceEEEecCCCcccccccccceEEEeeccCCCCCccEEecCHHHHHHhcCCccccchhHHHHHHHHHhcCCeEEE Confidence 99999999999997433455567999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccc---ccccc Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIA---FAKSV 157 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~---~~~~~ 157 (659) |||.+.+.++++....+.+..+....+....+++.+++.+..............+..++....++....... ..... T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i~~~~~~~~~~~~~~~ 160 (664) T protein:vir:98 81 VRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTIPKRKKSLLVLNRSVL 160 (664) T ss_pred EEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEeeccCccceeecccccc Confidence 999988877777776666666666666667788888888776655554444444555555544443221110 00000 Q ss_pred ceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEE Q lcl|NC_014792. 158 NQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVE 237 (659) Q Consensus 158 ~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~ 237 (659) ....... ...+......+..+...+...+.+..............+..................+.+++.+|+.+++. T Consensus 161 ~~~~~~~--~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~~~~a~~~i~~~~~~~~~~~~~~~~~~a~~~G~~Gn~isv~ 238 (664) T protein:vir:98 161 TQIFLLV--GTTEIVSQSSGVSASITIDGIESDSGITLLNLDIAKETIQGTSFQTLTQKYQIPSVVALYPGELGSTVQVE 238 (664) T ss_pred cccceec--ccceeeeeecccceeeecccccccceeeccccceeeeccccccceeeeeccccceeeeeecccccceeeee Confidence 0000000 11112222222222222223333323222222222222333333333444455566677889999999888 Q ss_pred EeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhc Q lcl|NC_014792. 238 IVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAK 317 (659) Q Consensus 238 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (659) +......... ....+.................++..++.+.+++..++.+.|++.++...++........+....+.+ T Consensus 239 i~s~~~~~~~--~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (664) T protein:vir:98 239 IISKAAYDTG--AMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIVQESFIVSTDKTDKDIYGVNIYMDDFFAN 316 (664) T ss_pred ecccccccCc--ceEeeccCceecccceeeeeeccccCccceeEEEecCCceeeeEEeecccCcccceeeeeechhheec Confidence 7655443322 23444445555666667777777888888999999999999999998888888888887777778888 Q ss_pred ccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHH Q lcl|NC_014792. 318 GTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIA 397 (659) Q Consensus 318 ~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~ 397 (659) +.+.++.+.....+.......++.||.++.+.++..+..+++.+++..+..+++||++|++...+.....+|+.+|++|| T Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~~~ll~~p~~~~~~~~~~~~v~~al~~~a 396 (664) T protein:vir:98 317 GGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALHVPLLIAGGCAGESVEIASTVQKHVISIG 396 (664) T ss_pred ccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccccceEEecCCCCCcHHHHHHHHHHHHHHH Confidence 88888877777777777778889999988877888888899999888888889999999998877778889999999999 Q ss_pred HhhCCEEEEEecCccccccccccCCHHHHHHHhhccccc----cccccccccceEEEEcCceeEecccCCcceeecHHHH Q lcl|NC_014792. 398 DERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSF----DTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAAD 473 (659) Q Consensus 398 ~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~ 473 (659) +++++||+++|+|.....+....++.+++.+||+..... .....+++|+|+++||||++++|+.+++++++|||++ T Consensus 397 ~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~ 476 (664) T protein:vir:98 397 DERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNLNVSSSYGFLDGNYKYQYDKYNDVNRWVPLAGD 476 (664) T ss_pred HhcCCeEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhcCCccceEEEEcCeEEEecccCCceEEechHHH Confidence 999999999999999999999999999999999864332 2234578899999999999999999999999999999 Q ss_pred HHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccceee Q lcl|NC_014792. 474 MAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHIN 553 (659) Q Consensus 474 ~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i~ 553 (659) +||+|||+|.++||||||+|+++.++.|++++++.+++.|++.||++|||+|++|++++|+++||+||++++|++|+||| T Consensus 477 ~AGl~A~~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~~G~~~wG~rT~~~~~s~~~~i~ 556 (664) T protein:vir:98 477 IAGLCVYTDSVANPWMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQINPVTGFAGGSGFVLYGDKTLTSVPSPFDRIN 556 (664) T ss_pred HHHHHHHhhhcCCcEECcCCceeeeeeccccceeecChhhHHHHHhCCCeEEEEeeCCCcEEEEcccccCCCCcccceEe Confidence 99999999999999999999999999999999999999999999999999999999878999999999999988999999 Q ss_pred hhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEec Q lcl|NC_014792. 554 VRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPA 633 (659) Q Consensus 554 vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~ 633 (659) +||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+++|+ T Consensus 557 vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~ 636 (664) T protein:vir:98 557 VRRLFNMIKKDIGDNAKYKLFENNDDFTRASFRMDTGQYMTNIRALGGCYDYRVICDTTNNTPDVIDRNEFVATVYVKPP 636 (664) T ss_pred ehhHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceEEEEEEEEeecCeeEEEecCCC Q lcl|NC_014792. 634 RSINYIVLNFVATSTGADFDELIGVQ 659 (659) Q Consensus 634 ~p~e~i~~~~~~~~~~~~~~e~~~~~ 659 (659) +|+|||+|||+|+++|++|+|++|.| T Consensus 637 ~pae~I~~~~~q~~~~~~~~e~~~~~ 662 (664) T protein:vir:98 637 RSINYITLNFVATSTGADFDELVGPQ 662 (664) T ss_pred CCcceEEEEEEEeecCcchhHhcccc Confidence 99999999999999999999999999 No 9 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=6.5e-160 Score=893.30 Aligned_cols=655 Identities=67% Similarity=1.102 Sum_probs=532.1 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|+||||||||++.+..+.+++||++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~~~~~~t~~~~~vg~~~~gp~~~p~~i~~~~~~~~~fg~~~~~~~~~~~~~~~f~~~g~~~~v 80 (666) T protein:vir:80 1 MTLLSPGFETKETTLSTTIVQSATGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (666) T ss_pred CceecCceEEEEecCCccccccCcccceEEeccccCCCccceEecCHHHHHHhcCCccCccchHHHHHHHHhcCCCeEEE Confidence 99999999999998666777888999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccccccee Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVNQY 160 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 160 (659) |||++.+.++++....+.+.......+....+++.+++++.............+...++..................... T Consensus 81 ~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~~~~ta~~~~~a~~~~~~ 160 (666) T protein:vir:80 81 VRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (666) T ss_pred EEecCccccccccccccceeEEEeeccccccccccccccccCcccccCcceEEEeecceeeeeecchhhhcccccccccc Confidence 99998888888887777777777777777778888887776655554455555555555444444444444444445555 Q ss_pred eeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEee Q lcl|NC_014792. 161 PDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVS 240 (659) Q Consensus 161 ~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~ 240 (659) ..+...+..+......+......+.+++.+.+...........................+...+.+.+.+++.+.+++.. T Consensus 161 ~~v~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a~~~~t~~~~~~~~~~~~~~a~~a~~~g~~g~~l~v~i~~ 240 (666) T protein:vir:80 161 PELDGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLQKYDMPAVSAIYAGEIGNSLEVEILA 240 (666) T ss_pred ceeeccceeeeccccccceeeeeeeeeecCCccceeeeccccccccccccccccccccchhhhhhcccccccceeeeecc Confidence 55555555555555554444444555555555444333333333333333333334444445566788888888776543 Q ss_pred cccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcccc Q lcl|NC_014792. 241 KAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKGTS 320 (659) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~~s 320 (659) ...... ............ ............+..+.++.+++..++.++|+|.++...+++...+...++...+.++.+ T Consensus 241 ~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (666) T protein:vir:80 241 RSAFKN-TAPDLTMYPYGG-ERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFGRGSS 318 (666) T ss_pred cccccc-ccccceeeeccc-cccccceeeeeccccccceeeEeccCCccceeeecccccccccccchhhhhhhhhccccc Confidence 322221 111111111111 112234445555666777888999999999999998888888888888888888878877 Q ss_pred cceEEeecccCCccceeEEeecccccccc--------cchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHH Q lcl|NC_014792. 321 NYIYATSLNWPKGFAGIINLMGGISANDQ--------VTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKH 392 (659) Q Consensus 321 ~~v~~~~~~~~~~~~~~~~~~gg~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~ 392 (659) .++...............++.+|.+.... ...+++.++++++++.+.++++++++|++.+.. .+..+++.+ T Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~p~~~~~~-~~~~~v~~~ 397 (666) T protein:vir:80 319 QYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWGLFAERESIHVNLLIAGACAGEG-DAFSTVQKH 397 (666) T ss_pred eeeeecccccccccceEEEecCCCCcccccccccccccccccchhhhhhhhhhcccccceEeecCcCCcc-cchHHHHHH Confidence 77777666666666667788887664332 223455667888888888999999999988654 456789999 Q ss_pred HHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHH Q lcl|NC_014792. 393 VVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAA 472 (659) Q Consensus 393 l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~ 472 (659) |++||+++++||+++|+|+...++.+..++++++.+||+..+.......+++|+|+++||||++++|+.+++.+++|||+ T Consensus 398 ~~~~~~~~~~~~~~~~~~~~~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg 477 (666) T protein:vir:80 398 AVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAA 477 (666) T ss_pred HHHHHHhhcceEEEeecceeEEeecCCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCceeEechHH Confidence 99999999999999999999998988899999999999999999998999999999999999999999999999999999 Q ss_pred HHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCcccccee Q lcl|NC_014792. 473 DMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHI 552 (659) Q Consensus 473 ~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i 552 (659) ++||+|||+|.++||||||||++++++.|++++++.+++.|++.||++|||||++|++ +|+++||+||+++++++|+|| T Consensus 478 ~~AGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g-~G~~~wG~rT~~~~~s~~~~i 556 (666) T protein:vir:80 478 DIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGG-EGFILMGDKTATTVPSPFDRI 556 (666) T ss_pred HHHHHHHHHhhcCCceEccCCeecceeeccccceeecChhHHHhhhhCCeeEEEEeCC-CeEEEEccccCCCCCccccee Confidence 9999999999999999999999999999999999999999999999999999999987 799999999999998999999 Q ss_pred ehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEe Q lcl|NC_014792. 553 NVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKP 632 (659) Q Consensus 553 ~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p 632 (659) ||||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|+||+|+||+++||++||++|+|+++|+|+| T Consensus 557 ~vRRl~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~i~~~P 636 (666) T protein:vir:80 557 NVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKP 636 (666) T ss_pred ehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCceEEEEEEEEeecCeeEEEecCCC Q lcl|NC_014792. 633 ARSINYIVLNFVATSTGADFDELIGVQ 659 (659) Q Consensus 633 ~~p~e~i~~~~~~~~~~~~~~e~~~~~ 659 (659) ++|||||+|||+|+++|++|+||+|.= T Consensus 637 ~~Pae~I~~~~~~~~~~~~~~e~~~~~ 663 (666) T protein:vir:80 637 AKSINYIMLNFTAVATGSDFDEIIGPV 663 (666) T ss_pred cCCcceEEEEEEEeecCccHHHHHHHH Confidence 999999999999999999999999842 No 10 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=1.5e-159 Score=891.33 Aligned_cols=657 Identities=53% Similarity=0.928 Sum_probs=519.3 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|+||||||||++....+.+++||++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~~~t~~~~~vg~~~~gp~~~p~~i~s~~~~~~~fg~~~~~~~~~~~~~~~f~~gg~~~~v 80 (679) T protein:vir:10 1 MTLLSPGVETKEINLQTTIARSSTGRAALVGKFNWGPAYQISQVVSEVDLVDKFGRPDDQTADSFFSGVNFLNYGNDLRL 80 (679) T ss_pred CceecCceEEEeecCCcccccCccccceeeecccCCCCccCEEecCHHHHHHHcCCcccccchHHHHHHHHHhCCCeEEE Confidence 99999999999997544455667999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccccccee Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVNQY 160 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 160 (659) |||.+.+..+++....+.+..+....+....+++.+++...... .........+..++....+++...........+.. T Consensus 81 vrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~s~-~~~~~~~~~~~~~~~~~~~v~~~~~~~~a~~~~~~ 159 (679) T protein:vir:10 81 VRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNVLQGGNV-IATGKVTVVNASGGIVAFYVPTAAIIDKAKSLNDY 159 (679) T ss_pred EEccCcccccccccccccccccccccccccccccceeeeeCCCc-ccceeEEEeeccCceeeeeeccccccccccccccc Confidence 99999887777777666666666666666778888877655433 34445555666666555555554444444444444 Q ss_pred eeeccceeeEEEeecCCcc--ccccccceeccccceeeecccccccccc-----cceeecccccccceeeecccccccee Q lcl|NC_014792. 161 PDLGPAWTAEILTTSSGVS--GTITLGKIVTDSGILLTEAENSEEAITS-----LEFQASLQKYAMPGVVALYPGEIGST 233 (659) Q Consensus 161 ~~~~~~~~~~v~~~~~g~~--~~~~~~~~v~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~a~~~g~~g~~ 233 (659) ..++..+..+......... ....+..+..+.+........+...... ............+...+..++.+++. T Consensus 160 ~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~~~~~~~~~~~A~~~g~~gn~ 239 (679) T protein:vir:10 160 PALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDICEEMKVPAIVARYAGTYGDN 239 (679) T ss_pred ceecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhhhhccccceeeeecccccCCc Confidence 4444444443333222211 1112222223333332222222221111 11111122223334455667788888 Q ss_pred EEEEEeeccccccc--ceeeeeeecccc--------ccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccc Q lcl|NC_014792. 234 LEVEIVSKAAYDVG--ASKMLDIYPNGG--------SRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKD 303 (659) Q Consensus 234 i~V~v~~~~~~~~~--~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~ 303 (659) +.+.+......... ............ ............+......+.+++..++...|.+.++...++.. T Consensus 240 i~v~~va~~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vvv~~~g~~~~~~~~~~~~~~~~ 319 (679) T protein:vir:10 240 IKVLMIAYKDYYKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLEFGPQNESQFAFIVFNNGVAVESKILSTKPGDRD 319 (679) T ss_pred ceEEEEeecccccccccccccccccccccccccccccccceeeeecccccccccceeeEEecccccccceeeeccccccc Confidence 77665432221110 111111100000 01111122223334455667777888888899999998888888 Q ss_pred cccchhhhhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccch Q lcl|NC_014792. 304 VYGNNIYLDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGD 383 (659) Q Consensus 304 ~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 383 (659) ......++...+.++.+.++.......+......++++||.++...++.+++..+++++++.+.++++|+|+|+..+... T Consensus 320 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~ 399 (679) T protein:vir:10 320 IYGTSIYINEYFGNGYSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIAGAVAGEGA 399 (679) T ss_pred ccchhhhhhhhhcCcccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEecCCCCCch Confidence 88888888888888888888888777777778889999999998888889999999999999999999999999988777 Q ss_pred hhhHHHHHHHHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhcccc---ccccccccccceEEEEcCceeEecc Q lcl|NC_014792. 384 ATASTVQKHVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGS---FDTDNMNISTTYAAIDGNYKYQYDK 460 (659) Q Consensus 384 ~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~s~~~~~~~p~~~~~d~ 460 (659) .+..+|+.+|++||+++++||+|+|+|.+...+.....+.+++.+||+.... ......+++|+|+++||||++++|+ T Consensus 400 ~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~ 479 (679) T protein:vir:10 400 QIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGNYKYQYDK 479 (679) T ss_pred hhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEccceeeecc Confidence 7888999999999999999999999999998888888899999999986432 2334557889999999999999999 Q ss_pred cCCcceeecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEccc Q lcl|NC_014792. 461 YNDVNRWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDK 540 (659) Q Consensus 461 ~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~r 540 (659) .+++.+++|||+++||+|||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||+|++|++ +|+++||+| T Consensus 480 ~~~~~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g-~G~~~wG~r 558 (679) T protein:vir:10 480 YNDVNRWIPLAADIAGLCARTDTVGQPWQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVGFAG-QGYILYGDK 558 (679) T ss_pred cCCceEEechHHHHHHHHHHhhccCCcEECcCCeeeccccccccceeecChhhHHhhhhCCceEEEEecC-CeEEEEccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999997 799999999 Q ss_pred ccCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhh Q lcl|NC_014792. 541 TATKVPSPMDHINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVID 620 (659) Q Consensus 541 T~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~ 620 (659) |+++++++|+||||||||+|||++|+++++|+||||||+.||++|+++|++||++||++|+|+||+|+||+++||+++|+ T Consensus 559 T~~~~~s~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~gf~v~~d~~~nt~~~i~ 638 (679) T protein:vir:10 559 TASQAPTPFDRINVRRLFNLLKKSISESAKYKLFELNDAFTRSSFRSEVGSYLDTIRSLGGIYDFRVVCDESNNTPAVID 638 (679) T ss_pred ccCCCCcccceEehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCEEEEEEEEEecCCceEEEEEEEEeecCeeEEEecCCC Q lcl|NC_014792. 621 RNEFVASIYYKPARSINYIVLNFVATSTGADFDELIGVQ 659 (659) Q Consensus 621 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~ 659 (659) +|+|+++|+|+|++|||||+|||+|++++++|+||+|.+ T Consensus 639 ~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e~~~~~ 677 (679) T protein:vir:10 639 RNEFVATILIKPARSINYITLSFVATSTGADFDELVGSF 677 (679) T ss_pred CCeEEEEEEEEecCCccEEEEEEEEeecCccHHHHHHHh Confidence 999999999999999999999999999999999999987 No 11 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=3.6e-159 Score=889.24 Aligned_cols=655 Identities=66% Similarity=1.105 Sum_probs=522.1 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|+||||||||++.+..+.+++||++||||+|+|||+|+|++|+||.||+++||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~PgVyv~e~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngg~~~~v 80 (666) T protein:vir:65 1 MTLLSPGFETKETTLSTTIVQSETGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPDNNTADYFMSGANFLQYGNDLRV 80 (666) T ss_pred CceecCceEEEEecCcccccccCcccceEEecccCCCCccCEEecCHHHHHHHcCCccccchhHHHHHHHHHhcCceEEE Confidence 99999999999998666666778999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccccccee Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVNQY 160 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 160 (659) ||+.+.+.++++....+........++....+|+.+.++...............+..+.....+.+.............. T Consensus 81 vrv~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~~~~~~~~~~g~~ 160 (666) T protein:vir:65 81 VRVLNKEKAKNATALAGNVEFEITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGVFIPTGKIIAHAKAIGVY 160 (666) T ss_pred EEccCcccccccccccCceeeeEeeccccccccceEEEEecccccccccccccccccccccccccccceeeccccccCcc Confidence 99998888888877777776666777777889999998877655444343444444443333333322222222333333 Q ss_pred eeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEee Q lcl|NC_014792. 161 PDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVS 240 (659) Q Consensus 161 ~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~ 240 (659) ..+...+..+......+......+.....+.+...............................+...+.+++.+.+++.. T Consensus 161 ~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~i~v~i~~ 240 (666) T protein:vir:65 161 PELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNSLEVEILA 240 (666) T ss_pred eeEeeccceeecccCcccccceeeeecccccceeeeeecccccccccccccccccccccceeeeeeccccccceeEEeec Confidence 33333333444333333333333333333333333333222222222222333333344445567788888888877654 Q ss_pred cccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcccc Q lcl|NC_014792. 241 KAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKGTS 320 (659) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~~s 320 (659) ..........+ ..... ..........+......+..+.+++...|..+|+|.++...+++...+...++.+.+.++.+ T Consensus 241 ~~~~~~~~~~l-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (666) T protein:vir:65 241 RSAFKNTAPDL-TMYPY-GGERTAARNLIPYAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGNSIYMDDFFARGSS 318 (666) T ss_pred ccccccccccc-ccccc-ccccccceeeecccccccccceeeeecCCcccceeecccCcccccccchhhhhhhhhccccc Confidence 43322221111 11111 11112233344445556667888888999999999988888888888888888888888889 Q ss_pred cceEEeecccCCccceeEEeeccccccccc--------chhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHH Q lcl|NC_014792. 321 NYIYATSLNWPKGFAGIINLMGGISANDQV--------TAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKH 392 (659) Q Consensus 321 ~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~ 392 (659) .++++.....+......+++.+|.+..... ..++..+++++++..+...++++++|++.+.+ .+..+|+.+ T Consensus 319 ~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~-~~~~~v~~~ 397 (666) T protein:vir:65 319 QYIYATAQGWVDGFSGIISLAGGVSANEATTGGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEG-DAFSTVQKH 397 (666) T ss_pred ceeeeecccccccccceEEccCCCCcCcccccccccccccccHHHHHHHHhhhhhccCCceeecCcCCcc-chhHHHHHH Confidence 998888877776667778888887654322 23456678888888887889999999987644 467899999 Q ss_pred HHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHH Q lcl|NC_014792. 393 VVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAA 472 (659) Q Consensus 393 l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~ 472 (659) |++||+++++||+++|+|+...++.....+.+++.+||+..+.....+.+++|+|+++||||++++|+.+++.+++|||+ T Consensus 398 l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg 477 (666) T protein:vir:65 398 AVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAA 477 (666) T ss_pred HHHHHhhccceEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEecccCCceeEechHH Confidence 99999999999999999999999999999999999999999999988899999999999999999999999999999999 Q ss_pred HHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCcccccee Q lcl|NC_014792. 473 DMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHI 552 (659) Q Consensus 473 ~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i 552 (659) ++||+|||+|.++||||||+|+++++|.|++++++.+++.|++.||++|||||++|++ +|+++||+||+++++++|+|| T Consensus 478 ~vAGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~~~~~s~~~~i 556 (666) T protein:vir:65 478 DIAGLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGG-EGFILMGDKTATTVPSPFDRI 556 (666) T ss_pred HHHHHHHHHhccCCcEEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCC-CeEEEEecccCCCCCcccceE Confidence 9999999999999999999999999999999999999999999999999999999986 799999999999998999999 Q ss_pred ehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEe Q lcl|NC_014792. 553 NVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKP 632 (659) Q Consensus 553 ~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p 632 (659) ||||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++|+|+| T Consensus 557 ~vrR~~~~i~~si~~~~~~~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p 636 (666) T protein:vir:65 557 NVRRLFNMLKKNIGDSSKYKLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKP 636 (666) T ss_pred ehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCceEEEEEEEEeecCeeEEEecCC--C Q lcl|NC_014792. 633 ARSINYIVLNFVATSTGADFDELIGV--Q 659 (659) Q Consensus 633 ~~p~e~i~~~~~~~~~~~~~~e~~~~--~ 659 (659) ++|||||+|||+|++++++|+||++. | T Consensus 637 ~~pae~i~~~~~~~~~~~~~~e~~~~~~~ 665 (666) T protein:vir:65 637 AKSINYIMLNFTAVATGSDFDEIIGPANQ 665 (666) T ss_pred cCCcceEEEEEEEeecCccHHHHHHHHhc Confidence 99999999999999999999999984 3 No 12 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=1.4e-147 Score=825.73 Aligned_cols=645 Identities=52% Similarity=0.841 Sum_probs=465.6 Q ss_pred CceecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRT 80 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~v 80 (659) |+|++|||||||++....+.+++||++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++||| T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~v~t~~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~~v 80 (671) T protein:vir:56 1 MTLLSPGIENKEINLASAIGRAATGRAAMVGKFEWGPAYSITQVTSESDLVTIFGRPNDYTAASFMTANNFLKYGNDLRL 80 (671) T ss_pred CceecCceEEEeecCcccccccCcccceEEecccCCCCccCEEcCCHHHHHHHcCCcCCCcchhHHHHHHHHhcCCeEEE Confidence 99999999999997444455667999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCCcccccccccccccccccccccccccccceeeeeecccccc-ccceeeeeec--cCcceeeeecccccccccccc Q lcl|NC_014792. 81 VRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVE-TSGRITKVDV--DGKILAVFIPSDKIIAFAKSV 157 (659) Q Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-~~~~~~~~~~--~g~~~~~~~~~~~~~~~~~~~ 157 (659) |||.+.+..+++....+.... ....+....+++.+.+........ ....+...+. ++.....+............. T Consensus 81 vrv~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~v~~~~~~ 159 (671) T protein:vir:56 81 VRICDATTAQNATPLYNAVEY-TIGASNGCVVGDDITITYSGVGALTAKGKVLEVDAGNNNAASKIFLPSAEIVAAAKSD 159 (671) T ss_pred EEecCccccccchhhcccccc-ccccCcceeeceeeeeecCcccccccCcceeEEeeeccceeeeeeccceeEEEeeecc Confidence 999988766666554443332 233445566677776655433211 1111211111 111111111111100000000 Q ss_pred ceeeeeccceeeEEEeecCCcccccccccee-ccccc-eeeecccc-------cccccccceeecccccccceeeecccc Q lcl|NC_014792. 158 NQYPDLGPAWTAEILTTSSGVSGTITLGKIV-TDSGI-LLTEAENS-------EEAITSLEFQASLQKYAMPGVVALYPG 228 (659) Q Consensus 158 ~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v-~~~~~-~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~a~~~g 228 (659) ....... .....+......+.... .+.+. ........ ........+.........+...+.+.+ T Consensus 160 ~~~~~~~-------~~t~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g 232 (671) T protein:vir:56 160 GNYPSVG-------TITLQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEGGALKYADLIEKQGFPRLSARYVG 232 (671) T ss_pred ccccccc-------cccccccccceeeeeecccccceEEEeccccccccccccccccccccchhhhhccccccccccccc Confidence 0000000 00001111000000000 00000 00000000 000111111122222223344556778 Q ss_pred ccceeEEEEEeeccccccc--cee--eeeee-----ccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccc Q lcl|NC_014792. 229 EIGSTLEVEIVSKAAYDVG--ASK--MLDIY-----PNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKE 299 (659) Q Consensus 229 ~~g~~i~V~v~~~~~~~~~--~~~--~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~ 299 (659) .+++.+.+.+......... ... ..... ................+...+..+.+++..++...|++.++.+. T Consensus 233 ~~g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~~~~~~~~~~ 312 (671) T protein:vir:56 233 DFGDAISVEIINYADYQTAFAFAAGHTLGDIELPIYPDGGTRSINLSSYFTFGPSNSNQYAVIVRVSGEVEEAFIVSTNP 312 (671) T ss_pred ccCcceEEEEecccccccccccccceeeeeccccccccccccccccceeecccccccccceeEEeecCccceeEEEeecc Confidence 8888888776543221111 111 11111 11122222233334445556666777788888999999988888 Q ss_pred cccccccchhhhhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEecccc Q lcl|NC_014792. 300 GDKDVYGNNIYLDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVA 379 (659) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 379 (659) ++........++.....++.+.++....... .......++.||.++. ....++.++++.+...+...++++++|+.. T Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~gg~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 389 (671) T protein:vir:56 313 GDKDVNGQSIFIDEYFENSGSAYITAIAEGW-KTESGAYNFGGGSDAN--AGADDWMFGLDMLSDPEVLYTNLVIAGNAA 389 (671) T ss_pred cccccchhhhhhhhhhcccCceEEEecCccc-CCccccccccCccccc--cchhHHHHHHHhhhhccccceeEEEcCCCC Confidence 8777777777666666666554443333332 2344556788887663 345667888888887777889999999877 Q ss_pred ccchhhhHHHHHH-HHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccc----cccccccceEEEEcCc Q lcl|NC_014792. 380 GEGDATASTVQKH-VVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDT----DNMNISTTYAAIDGNY 454 (659) Q Consensus 380 ~~~~~~~~~v~~~-l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~s~~~~~~~p~ 454 (659) .........++.+ +..+|+.++++|+++|+|.....+.....+.+++.+||+.....+. .+.+++|+|+++|||| T Consensus 390 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~ 469 (671) T protein:vir:56 390 AEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRTGIDPTNGQAVVDNLNVSTTYAVIDGNY 469 (671) T ss_pred CccchhHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhhhccccchhhhhhhccCCcceEEEecCc Confidence 6655555555555 5556677889999999999988888888999999999987654433 3457889999999999 Q ss_pred eeEecccCCcceeecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeE Q lcl|NC_014792. 455 KYQYDKYNDVNRWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGF 534 (659) Q Consensus 455 ~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~ 534 (659) ++++|+.+++.+++|||+++||+|||+|.++||||||||+++++|.|+.++++.+++.|++.||++|||+|++|++ +|+ T Consensus 470 ~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~-~G~ 548 (671) T protein:vir:56 470 KYQYDKYNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNRLAVDLRRAHRDALYQIGINPVVGFAG-QGF 548 (671) T ss_pred eEEecccCCceeEechHHHHHHHHHHhhccCCcEECcCCceeccccccccceeecChhHHHHHhhCCceEEEEecC-CeE Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999997 799 Q ss_pred EEEcccccCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCC Q lcl|NC_014792. 535 VLYGDKTATKVPSPMDHINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNN 614 (659) Q Consensus 535 ~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n 614 (659) ++||+||+++++++|+|||+||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|+||+|+||+++| T Consensus 549 ~~wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~g~~v~~d~~~n 628 (671) T protein:vir:56 549 VLYGDKTATQQASAFDRINVRRLFNLLKKAISDAAKYRLFELNDEFTRSSFKSEIDAYLTNIQDLGGVYDFRVVCDETNN 628 (671) T ss_pred EEEcceecCCCCcccceEehhhHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCC Confidence 99999999988889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCeeEEEecC Q lcl|NC_014792. 615 TPSVIDRNEFVASIYYKPARSINYIVLNFVATSTGADFDELIG 657 (659) Q Consensus 615 t~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~ 657 (659) |+++|++|+|+++|+|+|++|+|||+|||+|++++++|+||+| T Consensus 629 t~~~i~~G~~~~~i~~~p~~Pae~I~~~~~~~~~~~~f~e~~~ 671 (671) T protein:vir:56 629 PGSVIDRNEFVASIYVKPAKSINFITLNFVATSTDADFAEIIG 671 (671) T ss_pred CHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchhhhcC Confidence 9999999999999999999999999999999999999999999 No 13 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=1.2e-139 Score=782.22 Aligned_cols=642 Identities=29% Similarity=0.450 Sum_probs=415.0 Q ss_pred Cc--eecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeE Q lcl|NC_014792. 1 MA--LLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDL 78 (659) Q Consensus 1 ~~--~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~ 78 (659) |+ |+||||||||+|++.++.+++||++||||+|+|||+|+|++|+||.||++.||+|++.+|++|++++||+|||++| T Consensus 1 M~~~~~~PgVyv~e~~~~~~~~~~~t~~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~~~~~v~~~F~ngg~~~ 80 (749) T protein:vir:10 1 MATNQSSPGVVIQERDLTTVSTIPTANVGVIAAPFTKGPVEEVIEITSERQLAEKFGEPNESNYEYWFSAAQFLSYGGLL 80 (749) T ss_pred CCccccCCeeEEEEecCCcccccccCceeEEEeccCCCCCccCEEcCCHHHHHHHcCCccCCcccHHHHHHHHhhcCCeE Confidence 88 9999999999999888888889999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeccCCccccccccccccc----------------ccccccccccccccceeeeeeccccccccceeeeeeccCccee Q lcl|NC_014792. 79 RTVRVVNRDHAKNASPVAGNI----------------ESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILA 142 (659) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~~----------------~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~ 142 (659) ||||+.+.+ ++++....+.. ......+..++.||+.+++............+... ....... T Consensus 81 ~vvRv~~~~-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~pG~~gn~l~v~v~~~~~~~~~~~~~~-~~~~~~~ 158 (749) T protein:vir:10 81 KTIRVNSSS-LKNAVDTGTAPLVKNLQDYETSIEDASNNFSWVARTPGDTGNSIGIFVTDAGADQVVVVPAP-GSGNEHE 158 (749) T ss_pred EEEEccCcc-ccccccccccccccccccccccccccccceEEEeccCCCcCCceEEEEEcCCCceeeeeecC-Cccceee Confidence 999997654 22222211110 01112345678899988876543322111111100 0000000 Q ss_pred eeeccccc-----c--ccccccceeeeeccceeeEEEe-ecCCccccccccceeccccc---eeeecccccc-------- Q lcl|NC_014792. 143 VFIPSDKI-----I--AFAKSVNQYPDLGPAWTAEILT-TSSGVSGTITLGKIVTDSGI---LLTEAENSEE-------- 203 (659) Q Consensus 143 ~~~~~~~~-----~--~~~~~~~~~~~~~~~~~~~v~~-~~~g~~~~~~~~~~v~~~~~---~~~~~~~~~~-------- 203 (659) ........ . ....+................. ..........+.....+... .+........ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~~~~~~~~~~~s~~~~~~~a~~~~ 238 (749) T protein:vir:10 159 FVADAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDATNKKLEIGLPSGGVTGILADNQV 238 (749) T ss_pred EEeeecccccccccccccccceeeeeccccceeecccceeeeccCcccccccccccCCcceEEEeeecccccceeeeeec Confidence 00000000 0 0000000000000000000000 00000000000000000000 0000000000 Q ss_pred ------------cccccceeecccccccceeeeccccccceeEEEEEeecccccccceeeeeeeccccccccceeeeeee Q lcl|NC_014792. 204 ------------AITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNY 271 (659) Q Consensus 204 ------------~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 271 (659) .+.........................+..+.+............. ...........+......... T Consensus 239 v~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~v~~~~~~~~~~-t~~~~~~~a~~~gt~~~~~~~ 317 (749) T protein:vir:10 239 ITQGTNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITSVRDEYTEREYL-PGVKWINVAPRPGTSLYANGV 317 (749) T ss_pred ccccccccccccccccchhhhhccccceeeccccccCCccceeEEEeeeccccccccc-cceeeccccccccceeeeecc Confidence 0000000000000000000001112222222222111100000000 000011111112222222223 Q ss_pred ccccccceeeeeccC-------Cceeeeeeeecccc-ccccccchhhhhhhhhcccccceEEeecccC------------ Q lcl|NC_014792. 272 GPQTDDQYAIIVRRD-------GAIVENVVLSTKEG-DKDVYGNNIYLDDYFAKGTSNYIYATSLNWP------------ 331 (659) Q Consensus 272 ~~~~~~~~~~~v~~~-------g~~~et~~~~~~~~-~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~------------ 331 (659) +...+..+.+++..+ +.++|.+....++. .+...+...++...+.. .+.++++.....+ T Consensus 318 ~g~~D~~~v~v~~~~g~~~~~~g~v~e~~~~~~~~~~~~~~~~~~~~~~~~~~~-~s~~v~~~~~~~~~~~~~~~~~~~~ 396 (749) T protein:vir:10 318 GGHRDEMHVILVDIDGGVTGTVGALLERYIDVSKASDAKTSVGETNYYAEVIKQ-KSEFIYWAEHESTLYAATSSASDGL 396 (749) T ss_pred cCCCCceEEEEecCCCeeeecccceeeeeeeccccccccccccccchhhhhhcc-CCCEEEEEecccccccccccccccc Confidence 333343443344333 45677776654443 34444444444444433 2344432111100 Q ss_pred --------------------------------CccceeEEeecccccc-----cccchhhhhhhHhhhhhcccccceEEE Q lcl|NC_014792. 332 --------------------------------KGFAGIINLMGGISAN-----DQVTAGDLMQGWDLFADREALHINLLI 374 (659) Q Consensus 332 --------------------------------~~~~~~~~~~gg~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 374 (659) ......+.+.+|.|.. ...+..++.++++++...+...+++++ T Consensus 397 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~d~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~li 476 (749) T protein:vir:10 397 FGQTAANRQFNLFRSAAGSVDYPAGVTTLGSKNNATYYYRLSGGVNYTVSAGQYTITNTDIGSAYELIGDPESQIVDFII 476 (749) T ss_pred cccccccceeeccccccccceeccccccccccCCcEEEEEccCCcccccccccccccchhHHHHHHHhhhhhhcccceEE Confidence 0000123345554432 234556788889999888888888887 Q ss_pred eccccccchhhhHHHHHHHHHHHHhhCCEEEEEecCcccccccccc-CCHHHHHHHhhccccccccccccccceEEEEcC Q lcl|NC_014792. 375 AGAVAGEGDATASTVQKHVVSIADERQDCLAFISPPKGLLVNVPLT-RAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGN 453 (659) Q Consensus 375 ~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p 453 (659) ++... ..+.+..+|+.+|++||+++++||+++|+|.+...+.... ....++..|+.. +.+|+|+++||| T Consensus 477 ~~~~~-~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~---------~~~s~~~~~~~p 546 (749) T protein:vir:10 477 SGPSG-TSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKK---------LPSSSYMVFDSG 546 (749) T ss_pred EecCC-CCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhh---------ccCceeEEEEcc Confidence 76533 2345667899999999999999999999998876554332 334566777654 346899999999 Q ss_pred ceeEecccCCcceeecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCe Q lcl|NC_014792. 454 YKYQYDKYNDVNRWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDG 533 (659) Q Consensus 454 ~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G 533 (659) |++++|+.+++.+++|||+++||+|||+|.++||||||||+++.+|.|++++++.+++.|++.||++|||||++|++ +| T Consensus 547 ~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~SPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~g-~G 625 (749) T protein:vir:10 547 YKYIYDKYNDVYRYIPCNGDTAGLCLQTNEISEPWFSPAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPIVSFPG-QG 625 (749) T ss_pred ceeeeccccCceEEechHHHHHHHHHHhhccCCcEECcCCceeeeeeccccceeecChhHHHhhhhCCceEEEEecC-Ce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999997 79 Q ss_pred EEEEcccccCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCC Q lcl|NC_014792. 534 FVLYGDKTATKVPSPMDHINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTN 613 (659) Q Consensus 534 ~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~ 613 (659) +++||+||+++.|++|+||||||||+||+++|+++++|+||||||+.||++|+++|+.||++||++|+|.||+|+||+++ T Consensus 626 ~~~wG~rT~~s~d~~~~~i~vRRl~~~ie~si~~~~~~~v~epn~~~l~~~i~~~i~~fL~~l~~~G~i~~f~V~~d~~~ 705 (749) T protein:vir:10 626 VVLYGDKTALGFASAFDRINIRRLFLTVERVISTAAKAQLFEQNDEAQRSLFINIVEPYLRDVQGRRGVVDFLVKCDSTN 705 (749) T ss_pred EEEEcceecCCCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcCCC Confidence 99999999977778999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCeeEEEecC Q lcl|NC_014792. 614 NTPSVIDRNEFVASIYYKPARSINYIVLNFVATSTGADFDELIG 657 (659) Q Consensus 614 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~ 657 (659) ||+++|++|+|+++|+|+|++|||||+|||+|++++++|+|+.. T Consensus 706 Nt~~~i~~G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e~~s 749 (749) T protein:vir:10 706 NTPEAVDRGEFYAEVFLKPTRTINYVQLTFVATRTGVSFAEVAS 749 (749) T ss_pred CCHHHhhCCEEEEEEEEEecCCccEEEEEEEEeecCcchHHHhC Confidence 99999999999999999999999999999999999999999999 No 14 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=2.3e-139 Score=780.69 Aligned_cols=643 Identities=30% Similarity=0.464 Sum_probs=409.0 Q ss_pred Cc-eecCceEEEEecCCCcccc-cCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeE Q lcl|NC_014792. 1 MA-LLSPGIELKETTVQSTVVR-NATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDL 78 (659) Q Consensus 1 ~~-~~~PGVyveE~~~~~~~~~-~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~ 78 (659) |. ||+|||||||+|+++++++ +.|+++||||+|+|||+|+|++|+||.||++.||++++.+|++|++++||+|||++| T Consensus 1 m~~~~~Pgvyv~e~~~~~~~i~~~~t~~~~~vg~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~v~~~f~ngg~~~ 80 (743) T protein:vir:10 1 MASQVSPGILIKERDLTNAVVTGALQIRAAHASTFAKGPIGDIVNINTQKELVSVFGEPKEDNAEDWMVASEFLNYGGRL 80 (743) T ss_pred CccccCCceEEEEecCCCceeccCCcceeEEEEeccCCCCCcCEEecCHHHHHHHcCCccCCcchHHHHHHHHHhCCceE Confidence 87 9999999999999997665 569999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeccCCcccccccccccc-------------cccccccccccccccceeeeeeccccccccceeeeeeccCcc-eeee Q lcl|NC_014792. 79 RTVRVVNRDHAKNASPVAGN-------------IESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKI-LAVF 144 (659) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~-------------~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~-~~~~ 144 (659) |||||.+++.. +++..... .......+..++.||+.+++...................... .... T Consensus 81 ~vvrv~~~~~~-~a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gN~i~V~v~~~~~d~~~~~~~~~~~~~~~~~~~ 159 (743) T protein:vir:10 81 AVVRAETTGVL-NATTGSAGVLVKNRESWDAGSGNGEVFVARTAGSWGNSLMGVLVDRGADYIVTFAATPTDTAVGTQLL 159 (743) T ss_pred EEEEccCcccc-ccccccccccccccccccccccceeEEEEeeccccccceEEEEecCCCcceeeeeccccccccceeee Confidence 99999876532 22221111 111122345677888888876543221111111100000000 0000 Q ss_pred eccccccccccccceeeeeccceeeEEE--e-----ecCCcccccc-ccce----ecc-cc-ce-eeeccccccc-cccc Q lcl|NC_014792. 145 IPSDKIIAFAKSVNQYPDLGPAWTAEIL--T-----TSSGVSGTIT-LGKI----VTD-SG-IL-LTEAENSEEA-ITSL 208 (659) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~--~-----~~~g~~~~~~-~~~~----v~~-~~-~~-~~~~~~~~~~-~~~~ 208 (659) ........ ...+............... . .......... .... ... .+ .. .......... .... T Consensus 160 ~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tv 238 (743) T protein:vir:10 160 FSYSGTLV-TGEILSYDSATNTATITASGTLTSQYLLDTPEQGLIGSFTDNSTTEVGRTPGTYSNVPASGGTGTGATFNV 238 (743) T ss_pred eccccccc-ccceeeeeecCcceeeeeccccceeeecccccccccccccccccccccccccceeeEEecccccccccccc Confidence 00000000 0000000000000000000 0 0000000000 0000 000 00 00 0000000000 0000 Q ss_pred ceeec----ccccccceeeeccccccceeEEEE-------------Eeecccccccceee----------eee--ecccc Q lcl|NC_014792. 209 EFQAS----LQKYAMPGVVALYPGEIGSTLEVE-------------IVSKAAYDVGASKM----------LDI--YPNGG 259 (659) Q Consensus 209 ~~~~~----~~~~~~~~~~a~~~g~~g~~i~V~-------------v~~~~~~~~~~~~~----------~~~--~~~~~ 259 (659) ..... .................+....+. .............. ... ..+.. T Consensus 239 ~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~~~~~~t~~~~~~~~~~~g~~~~~a~~~~~~~~~~~~~~~~~~~~~ 318 (743) T protein:vir:10 239 VVADAGGGVGGSVVVTLANPGTGYNQGETLTIASAATGDGTDILVTVATLSDGTIAITELKDWYLNTEIGSTGIKLGDIG 318 (743) T ss_pred cccccccccccccccccccccceeeeccccccccccccccccchhheecccccceeeeecccccccchhhcccccccccc Confidence 00000 000000000000000000000000 00000000000000 000 00000 Q ss_pred ccccceeeeeeeccccccceee-------eeccCCceeeeeeeecc-ccccccccchhhhhhhhhcccccceEE------ Q lcl|NC_014792. 260 SRASVARAVFNYGPQTDDQYAI-------IVRRDGAIVENVVLSTK-EGDKDVYGNNIYLDDYFAKGTSNYIYA------ 325 (659) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~-------~v~~~g~~~et~~~~~~-~~~~~~~~~~~~~~~~~~~~~s~~v~~------ 325 (659) ..+.........+...+..... .....+..+|++.+... ...++..+...++...+.. .+.++.. T Consensus 319 ~~~~t~~~~~~~~~~~d~~~v~v~~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~-~s~~~~~~~~~~~ 397 (743) T protein:vir:10 319 PRPGTSQFATDNGITDDQVHFAVIDTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVINE-QSAYLYHGNDAAV 397 (743) T ss_pred ccceeeeccccccccccceEEEEecCcceeeeccCceeEEEeeeecccccccccCcceeecceecc-ccceeeccCcccc Confidence 0000000000001111111111 12234566777766443 3333333333333333221 2222211 Q ss_pred ------------------eec-ccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhh Q lcl|NC_014792. 326 ------------------TSL-NWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATA 386 (659) Q Consensus 326 ------------------~~~-~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 386 (659) ... ..........++.||.|+. .++..++.++++++...+..+++|+++|+.... ..+. T Consensus 398 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gG~d~~-~~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~~-~~~~ 475 (743) T protein:vir:10 398 QIAASGEAWGQSSDQVLADAGTAFSRTTGYWVNLAGGNDDF-AYDAGEFGAAMDLFLDTEETEIDFVLMGGSMAD-EADT 475 (743) T ss_pred eeeeccccCccccceeeeecccccccccceEEEeecCcccc-ccchhHHHHHHHHhhhccccCcceEEecCcccC-ccch Confidence 110 1112223446788887754 346777888999998888888999999998654 3456 Q ss_pred HHHHHHHHHHHHhhCCEEEEEecCcccccccc------ccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecc Q lcl|NC_014792. 387 STVQKHVVSIADERQDCLAFISPPKGLLVNVP------LTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDK 460 (659) Q Consensus 387 ~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~ 460 (659) .+|+++|++||+++++||+|+|+|++...... ...+..+...|++. +++|+|+++||||++++|+ T Consensus 476 ~~v~~a~~~~~~~~~~~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~s~~~~~~~p~~~~~d~ 546 (743) T protein:vir:10 476 KSKATKVIAIAASRKDALAFVSPHKGNQIASTGNVALSSAQQKENTIAFFSD---------LTSTSYAVFDSGYKYVYDR 546 (743) T ss_pred HHHHHHHHHHHHhhCCeEEEEecCCCccccccccccccccccchHHHHHHHh---------ccCCeeEEEEccceeeecc Confidence 78999999999999999999999987654332 22334555666543 4578999999999999999 Q ss_pred cCCcceeecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEccc Q lcl|NC_014792. 461 YNDVNRWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDK 540 (659) Q Consensus 461 ~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~r 540 (659) .+++++++|||+++||++||+|.++||||||+|+++.+|.|++++++.+++.|++.||++|||||++|++ +|+++||+| T Consensus 547 ~~~~~~~~p~s~~~AGl~a~~D~~~g~~~span~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~-~G~~~wG~r 625 (743) T protein:vir:10 547 FTDKYRYIPCNGDVAGLCVQTSNQLDDWYSPAGLNRGGILNAVKLAYNPNKADRDELYQNRINPVVSLRG-QGITLFGDK 625 (743) T ss_pred ccCceeEechhHHHHHHHHHhhccCCcEEccCCeeeeeeeccccceecCChhHHHhHhhCCceEEEEecC-CeEEEEccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999987 799999999 Q ss_pred ccCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhh Q lcl|NC_014792. 541 TATKVPSPMDHINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVID 620 (659) Q Consensus 541 T~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~ 620 (659) |++++|++|+|||+||||+||+++|+++++|+||||||+.+|++|+++|++||++||++|+|.||+|+||+++||+++|+ T Consensus 626 T~~s~d~~~~~i~vrR~~~~i~~si~~~~~~~v~e~n~~~~~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~ 705 (743) T protein:vir:10 626 TALAAPSAFDRINVRRLFLNLEKRARRLAEGVLFEQNDATTRAGFSSALNSYLSEVQARRGVTDYLVICDESNNTPDIID 705 (743) T ss_pred ccCCCCcccceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhh Confidence 99877889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCEEEEEEEEEecCCceEEEEEEEEeecCeeEEEecCC Q lcl|NC_014792. 621 RNEFVASIYYKPARSINYIVLNFVATSTGADFDELIGV 658 (659) Q Consensus 621 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~ 658 (659) +|+|+++|+++|++|+|||+|||+|+++|++|+||++. T Consensus 706 ~G~~~~~i~~~p~~pae~I~~~~~~~~~~~~~~e~~~~ 743 (743) T protein:vir:10 706 RNEFVAEVYVKPTRSINFITITFTATKTGVTFSEVVGR 743 (743) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEeecCcchHhhhcC Confidence 99999999999999999999999999999999999999 No 15 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=2.9e-136 Score=763.66 Aligned_cols=629 Identities=30% Similarity=0.479 Sum_probs=384.2 Q ss_pred CceecCceEEEEecCCCcccc-cCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcC--CCchhHHHHHHHHHcCCCe Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVR-NATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPN--NITADYFMSGMNFLQYGND 77 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~-~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~--~~~~~~~~~~~~f~ngG~~ 77 (659) |+|++|||||||+++++++++ ++||++||||+|+|||+|+|++|+||.||+++||+|. +.++++|++++||+|||++ T Consensus 3 ~~~~~PgVyv~e~~~~~~~i~~v~ts~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~~~f~ngg~~ 82 (729) T protein:vir:10 3 LNLASPGIVVREVDLTIGRVDPTSGSIGALVAPFAKGPVNDPQLIESEEDLLQTFGQPYSTDKHYEYWMVASSYLAYGGT 82 (729) T ss_pred ccccCCceEEEEecCCCcccccccccceeEEeccccCCCccCeEcCCHHHHHHHcCccccCCcchhHHHHHHHHHhCCce Confidence 669999999999999987665 5699999999999999999999999999999999984 5678899999999999999 Q ss_pred EEEEeccCCccccccccccccccc-c-----------------------------cccccccccccceeeeeeccccccc Q lcl|NC_014792. 78 LRTVRVVNRDHAKNASPVAGNIES-T-----------------------------IATAGSNYAVGDVIQVKHNQTVVET 127 (659) Q Consensus 78 ~~vvRv~~~~~~~~a~~~~~~~~~-~-----------------------------~~~~~~~~~~~~~~~v~~~~~~~~~ 127 (659) |||||+.+.+++.++. ..+.... . ......++.|++.+++......... T Consensus 83 ~~vvRv~~~~~~~a~~-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~~~v~v~~~~~~~ 161 (729) T protein:vir:10 83 MQVVRADDYNTQTGVG-LKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANGIKVAIIDGKADQ 161 (729) T ss_pred EEEEecCccccccccc-ccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccceeeEEecccCcc Confidence 9999998865433221 1111000 0 0011122233333333221110000 Q ss_pred cceeeeeeccCcceeeeeccccccccccccceeeeeccceeeEEEeecCCccccccccceec--cccceeeecccccccc Q lcl|NC_014792. 128 SGRITKVDVDGKILAVFIPSDKIIAFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVT--DSGILLTEAENSEEAI 205 (659) Q Consensus 128 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~--~~~~~~~~~~~~~~~~ 205 (659) ...+. ...+........... ..... .............+............ ................ T Consensus 162 ~~~~~--~~~~~~~~t~~~~~~-------~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~v~~~s~~~~~~~~~~~~~~ 230 (729) T protein:vir:10 162 ILTVA--SGNTTAVGSAVTQSI-------SKTIG--TATGTTTIDGVLKGIVTGSTDTTLEVKVISHISAAGVETAVEYQ 230 (729) T ss_pred eeeee--ccccccceeeeeeec-------ccccc--ccccceeeeeeecccccccccccccceecccccccccceecccc Confidence 00000 000000000000000 00000 00000000000000000000000000 0000000000000000 Q ss_pred cccceeecccccccceeeeccccccceeEEEEEeec--cccc----ccceeeeeeeccccccccceeeeeeeccccccce Q lcl|NC_014792. 206 TSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVSK--AAYD----VGASKMLDIYPNGGSRASVARAVFNYGPQTDDQY 279 (659) Q Consensus 206 ~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~~--~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 279 (659) ..... ................+. +.......... .... ........+.+...+..... ......+... T Consensus 231 ~~~~~-~~~~~~s~~~~a~~~~~~-~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~----~~~~~~d~~~ 304 (729) T protein:vir:10 231 QNGTY-TFDNSGSVNVIAAGSSGS-GSAKSYTAQTDWFESQNIVLSNSTLEWDSIADAPGTSTYVS----TRGGKNDEIH 304 (729) T ss_pred cccee-eecccCccceeeeccccc-cccccceeeeccccccccccccccccccccccccccccccc----cccccccccc Confidence 00000 000000000000000000 00000000000 0000 00000000000000000000 0000000000 Q ss_pred e-------eeeccCCceeeeeee-eccccccccccchhhhhhhhhcccccceEEe------------------------- Q lcl|NC_014792. 280 A-------IIVRRDGAIVENVVL-STKEGDKDVYGNNIYLDDYFAKGTSNYIYAT------------------------- 326 (659) Q Consensus 280 ~-------~~v~~~g~~~et~~~-~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~------------------------- 326 (659) . ......+.++|.+.. +.........+...+....+.. .+.++.+. T Consensus 305 ~~~~d~~~~~~~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~-~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 383 (729) T protein:vir:10 305 VLVIDDKGTITGNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLAT-NSKYIFGGGATSGITTTGYSVSSTNTLDTDSGW 383 (729) T ss_pred eeeeccccccccCcccceeeeeeeeeccccccccccccccceeecc-ccceeeecccccccccccccccccceecccccc Confidence 0 112233444455432 2222222222222222222211 11111110 Q ss_pred -----ecccCCccceeEEeecccccccc----------cchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHH Q lcl|NC_014792. 327 -----SLNWPKGFAGIINLMGGISANDQ----------VTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQK 391 (659) Q Consensus 327 -----~~~~~~~~~~~~~~~gg~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~ 391 (659) ....+......+++++|.+..+. ....++.+++.+++..+...++++++++.. .++.+...++. T Consensus 384 ~~~a~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~-~~~~~~~~v~~ 462 (729) T protein:vir:10 384 DQNAEGVNFGASGVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAH-HPKEQSQAVAE 462 (729) T ss_pred ccccccccccccceeEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCC-CCccchHHHHH Confidence 01111223345667777664433 123455677887777666667766666543 34457789999 Q ss_pred HHHHHHHhhCCEEEEEecCcccccccc---------ccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccC Q lcl|NC_014792. 392 HVVSIADERQDCLAFISPPKGLLVNVP---------LTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYN 462 (659) Q Consensus 392 ~l~~~~~~~~~~~ai~d~p~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~ 462 (659) +|++||+++++||+++|+|+...+... .....+++..|++.+ .+++|+++||||++++|+.+ T Consensus 463 a~~~~~~~~~~~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~p~~~~~d~~~ 533 (729) T protein:vir:10 463 KVTAVAEARKDAVAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPL---------SSSTYSVFDSGYKYMFDRFN 533 (729) T ss_pred HHHHHHHhcCCeEEEecccccccccccccccccccccchhhHHHHHHHhhc---------cCCceEEEEcCeeEEecccC Confidence 999999999999999999976554322 223345566666543 35789999999999999999 Q ss_pred CcceeecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEccccc Q lcl|NC_014792. 463 DVNRWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTA 542 (659) Q Consensus 463 ~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~ 542 (659) +..+++|||+++||++||+|.++||||||+|+++++|.|+.++++.+++.|++.||++|||+|++|++ +|+++||+||+ T Consensus 534 ~~~~~~p~s~~~aGl~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~-~G~~~wG~rT~ 612 (729) T protein:vir:10 534 NTFRYVPLNGDIAGTCARTDIEQFPWFSPAGTARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPG-AGIILFGDKTG 612 (729) T ss_pred CceEEechhHHHHHHHHHhhccCCcEEccCCccccceecccceeeecChhhHhhhhhCCceEEEEecC-CeEEEEcceec Confidence 99999999999999999999999999999999999999999999999999999999999999999997 79999999999 Q ss_pred CCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCC Q lcl|NC_014792. 543 TKVPSPMDHINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRN 622 (659) Q Consensus 543 ~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G 622 (659) ++.+++|+|||+||||+||+++|+++++|+||||||+.||++|+++|++||++||++|+|+||+|+||+++||++||++| T Consensus 613 ~~~d~~~~~i~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G 692 (729) T protein:vir:10 613 FGKSSAFDRINVRRLFIYLEDAISAAAKDQLFEFNDELTRTNFVNIVEPFLRDVQAKRGIFDFVVICDETNNTAAVIDSN 692 (729) T ss_pred CCCCcccceeehhhhHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEcCCCCCHHHhhCC Confidence 77778999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecCCceEEEEEEEEeecCeeEEEecCCC Q lcl|NC_014792. 623 EFVASIYYKPARSINYIVLNFVATSTGADFDELIGVQ 659 (659) Q Consensus 623 ~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~ 659 (659) +|+++|+|+|++|+|||+|||+|++++++|+|+++.= T Consensus 693 ~~~~~v~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~~ 729 (729) T protein:vir:10 693 EFVADIFIKPARSINFIGLTFVATRTGVAFEEVIGSV 729 (729) T ss_pred eEEEEEEEEecCCccEEEEEEEEeecCccHHHHHhcC Confidence 9999999999999999999999999999999999988 No 16 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=6.3e-106 Score=597.35 Aligned_cols=467 Identities=20% Similarity=0.225 Sum_probs=323.7 Q ss_pred Cc-eecCceEEEEecCCCccc-ccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeE Q lcl|NC_014792. 1 MA-LLSPGIELKETTVQSTVV-RNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDL 78 (659) Q Consensus 1 ~~-~~~PGVyveE~~~~~~~~-~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~ 78 (659) |+ |++|||||||++++++++ +++|+|++|||++++||+|+|++|+||.||++ ||+.....++++++..||.|||++| T Consensus 1 M~~~~~pGVyv~E~~~g~~~I~~v~Tsv~~~VG~a~~~p~n~pv~its~~d~~~-~g~~~~~~tL~~Av~~~f~ngg~~~ 79 (477) T protein:vir:79 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDAAQ-FGPQLAGFTIPQALDAVYDYGSGTV 79 (477) T ss_pred CcCCCCCCeEEEEecCCcccccccCCceEEEEeecccCCCcccEEEccHHHHHH-hcCCCCCCcHHHHHHHHhhcCCceE Confidence 87 779999999999999765 56799999999999999999999999999986 7778888899999999999999999 Q ss_pred EEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccccccc Q lcl|NC_014792. 79 RTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVN 158 (659) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 158 (659) |||||.+.+.......... . ..... ........ .......... T Consensus 80 ~vvrV~~~~~~~~~~a~~~---------~---~~~~~----------~~~~~~~~---~~~~~~~v~~------------ 122 (477) T protein:vir:79 80 IVINVLDPAVHKSNAASES---------V---TFDAA----------TGRAKLAH---PAAANLVLKN------------ 122 (477) T ss_pred EEEeccCCccccccccccc---------c---ccccc----------cccccccc---cccceeEEee------------ Confidence 9999976542221110000 0 00000 00000000 0000000000 Q ss_pred eeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEE Q lcl|NC_014792. 159 QYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEI 238 (659) Q Consensus 159 ~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v 238 (659) ..... ..... .... .......... ...+..+.. T Consensus 123 ---------------~~~~~--~~~~~-----~~~~---~~~~~~~~~~-----------------~~~~~~~~~----- 155 (477) T protein:vir:79 123 ---------------DSGGT--TYTEG-----TDYA---VDLINGVITR-----------------IKTGTIPAA----- 155 (477) T ss_pred ---------------ccccc--ccccC-----cccc---ccccchhhhh-----------------hhccccccc----- Confidence 00000 00000 0000 0000000000 000000000 Q ss_pred eecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcc Q lcl|NC_014792. 239 VSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKG 318 (659) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (659) .......... . .+... T Consensus 156 -----------------------~~~~~~~~~~---~----------~~~~~---------------------------- 171 (477) T protein:vir:79 156 -----------------------ATAAKATYDY---A----------DPTKV---------------------------- 171 (477) T ss_pred -----------------------cceeeceecc---C----------Ccccc---------------------------- Confidence 0000000000 0 00000 Q ss_pred cccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhh---cccccceEEEeccccccchhhhHHHHHHHHH Q lcl|NC_014792. 319 TSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFAD---REALHINLLIAGAVAGEGDATASTVQKHVVS 395 (659) Q Consensus 319 ~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~ 395 (659) ......+..+. ....+++.++.. .....+.+++.|+. ++..+|+.+|.+ T Consensus 172 -----------------~~~~~~g~~~a------~~~~tg~~al~~~~~~~~~~~~iv~apg~-----~~~~~v~~~l~~ 223 (477) T protein:vir:79 172 -----------------TAADIIGAVNA------AGMRTGMKALKDTYNLYGYFSKILIAPAY-----CTQNSVSVELEA 223 (477) T ss_pred -----------------eeeeecccccc------cccchhhhhhhhhhhhcccccceeecccc-----ccchhHHHHHHH Confidence 00000000000 000111111111 11124567777765 345679999999 Q ss_pred HHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHH Q lcl|NC_014792. 396 IADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMA 475 (659) Q Consensus 396 ~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~A 475 (659) +|++++ +|+++|+|.+ .+.+++.+|++..+.. ..+++|.|+++||||++++|+.++..+++|||+++| T Consensus 224 ~~~~~~-~~a~~d~p~~--------~~~~~~~~~~~~~~~~---~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~a 291 (477) T protein:vir:79 224 MAVQLG-AIAYIDAPIG--------TTLAQALAGRGPAGTI---NFNTSSDRVRLCYPHVKVYDIATNAERLEPLSSRAA 291 (477) T ss_pred HHhhcC-eEEEEecCCC--------CChHHHhhhhhhcccc---ccccccceEEEEcCeeEEecccCCceeeechHHHHH Confidence 999875 9999999965 3567788888765543 346789999999999999999999999999999999 Q ss_pred HHHHHhhhcCCceECcCCcchhheeccc---cceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccC--CCccccc Q lcl|NC_014792. 476 GLCARTDDVSQPWMSPPGYNRGQILNVL---KLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTAT--KVPSPMD 550 (659) Q Consensus 476 g~~a~~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~--~~~~~~~ 550 (659) |++||+|.++|+||||+|+++.++.++. ......++.|++.||++|||+|++|++ +|+++||+||++ +++++|+ T Consensus 292 g~~a~~d~~~g~~~span~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~-~G~~~wG~rT~~~~~~~~~~~ 370 (477) T protein:vir:79 292 GLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYG-SGLRLWGNRTAAWPTVTHMRN 370 (477) T ss_pred HHHHHhhccCCceEccCCceeecceecccccccccCCChhhHHHHhhCCceEEEEecC-CcEEEEcccccCCCCCCccce Confidence 9999999999999999999977666643 223444678999999999999999987 799999999996 3446899 Q ss_pred eeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEE Q lcl|NC_014792. 551 HINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYY 630 (659) Q Consensus 551 ~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~ 630 (659) |||+||+|++|+++|++.++|+|||||++.+|++|+++|+.||++||++|+|+||+|+||+++||++||++|+|+++|++ T Consensus 371 ~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~ 450 (477) T protein:vir:79 371 FENVRRTGDVINESLRYFSQQFVDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKY 450 (477) T ss_pred eeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecCCceEEEEEEEEeecCeeEEEecCCC Q lcl|NC_014792. 631 KPARSINYIVLNFVATSTGADFDELIGVQ 659 (659) Q Consensus 631 ~p~~p~e~i~~~~~~~~~~~~~~e~~~~~ 659 (659) +|++|+|||+|+|++...+.. ++.|-. T Consensus 451 ~p~~p~e~i~~~~~~~~~~~~--~~~~~~ 477 (477) T protein:vir:79 451 TVPPPLERLTYETEITSEYLL--TLKGGN 477 (477) T ss_pred EecCCceeEEEEEEEechHHh--hhccCC Confidence 999999999999999888755 444444 No 17 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=2.8e-105 Score=593.77 Aligned_cols=467 Identities=18% Similarity=0.176 Sum_probs=327.7 Q ss_pred Cc-eecCceEEEEecCCCccc-ccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeE Q lcl|NC_014792. 1 MA-LLSPGIELKETTVQSTVV-RNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDL 78 (659) Q Consensus 1 ~~-~~~PGVyveE~~~~~~~~-~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~ 78 (659) |. |++|||||||++++++++ +++|+|++|||++++||+|+|++|+||.|| +.||+.....++.++++.||.|||++| T Consensus 1 M~~~~~pGVyv~E~~~~~~~i~~v~T~v~~~VG~a~~gp~n~pv~its~~d~-~~~g~~~~~~tL~~Av~~~f~nGg~~~ 79 (477) T protein:vir:10 1 MAANYLHGVETIEKETGSRPVKVVKSAVIGLIGTAPIGPVNTPVQSLSDVDA-AQFGPQLAGFTIPQALDAVYDYGSGTV 79 (477) T ss_pred CcccCCCCeEEEEccCCcccccccCCceeEEEecccCCCCCcCEEEccHHHH-HHhccCCCCCcHHHHHHHHHhccceEE Confidence 87 678999999999998755 567999999999999999999999999999 569999888999999999999999999 Q ss_pred EEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccccccc Q lcl|NC_014792. 79 RTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVN 158 (659) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 158 (659) |||||.+.+....... . . . ... ..+ . T Consensus 80 ~vVrV~~~~~~~~~~~--~-------~-~------------------------~~~-~~~-~------------------ 105 (477) T protein:vir:10 80 IVINVLDPAVHKSNAA--N-------E-P------------------------VTF-DAA-T------------------ 105 (477) T ss_pred EEEecCcccccccccc--c-------c-c------------------------ccc-ccc-c------------------ Confidence 9999976432110000 0 0 0 000 000 0 Q ss_pred eeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEE Q lcl|NC_014792. 159 QYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEI 238 (659) Q Consensus 159 ~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v 238 (659) + . . .. .+.+.....+.. T Consensus 106 ------------------~---~------~----------~~--------------------------~~~~~~~~~v~~ 122 (477) T protein:vir:10 106 ------------------G---R------A----------KL--------------------------AHPAAANLVLKN 122 (477) T ss_pred ------------------c---e------e----------cc--------------------------cccccccccccc Confidence 0 0 0 00 000000000000 Q ss_pred eecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcc Q lcl|NC_014792. 239 VSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKG 318 (659) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (659) . ... .. ... .. ...+.....+. .......... . T Consensus 123 ~--a~~--------------------~~--~~~--~~--~~~~~~~~~~~-~~~~~~~~~~------------------~ 155 (477) T protein:vir:10 123 D--SGG--------------------TT--YAE--GT--DYAVDLINGVI-TRIKTGTIPP------------------G 155 (477) T ss_pred c--ccc--------------------cc--ccc--ch--hhhhhhccccc-eecccccccc------------------c Confidence 0 000 00 000 00 00000000000 0000000000 0 Q ss_pred cccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcc---cccceEEEeccccccchhhhHHHHHHHHH Q lcl|NC_014792. 319 TSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADRE---ALHINLLIAGAVAGEGDATASTVQKHVVS 395 (659) Q Consensus 319 ~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~p~~~~~~~~~~~~v~~~l~~ 395 (659) ... .................+.+..+. ....+++.++...+ ...+.++++|+. ++..+|+.+|.+ T Consensus 156 ~~~-~~~~~~~~~~~~~~~~~~~g~~~~------~~~~tGl~al~~~~~~~~~~~~~l~apg~-----~~~~~v~~~l~~ 223 (477) T protein:vir:10 156 ATA-AKATYDYADPTKVTAADIIGAVNA------AGMRTGMKALKDTYNLYGYFSKILIAPAY-----CTQNSVSVELEA 223 (477) T ss_pred cee-eeeccccccccccccccccccccc------cchhhhhhhhhhhhhhcchhccccccccc-----ccchhhHHHHHH Confidence 000 000000000000011112222111 11122333322211 123466677765 345679999999 Q ss_pred HHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHH Q lcl|NC_014792. 396 IADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMA 475 (659) Q Consensus 396 ~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~A 475 (659) +|++++ +|+++|+|.+ .+.+++.+|++..... ..+++|+|++++|||++++|+.++..+++|||+++| T Consensus 224 ~~~~~~-~~~~~d~p~~--------~~~~~~~~~~~~~~~~---~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~a 291 (477) T protein:vir:10 224 MAVQLG-AIAYIDAPIG--------TTLAQALAGRGPAGTI---NFNTSSDRVRLCYPHVKVYDTATNAERLEPLSSRAA 291 (477) T ss_pred HHhhCC-EEEEEecCCC--------CCHHHHHhhhhhcccc---ccccccceEEEEcCeEEEecccCCceeEEchHHHHH Confidence 999885 9999999864 3567889999876543 346789999999999999999999999999999999 Q ss_pred HHHHHhhhcCCceECcCCcchhheecccc---ceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCC--Cccccc Q lcl|NC_014792. 476 GLCARTDDVSQPWMSPPGYNRGQILNVLK---LAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATK--VPSPMD 550 (659) Q Consensus 476 g~~a~~d~~~g~~~span~~~~~i~g~~~---~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~--~~~~~~ 550 (659) |++||+|.++|+||||+|+++.++.++.. .....++.|++.||++|||+|++|++ +|+++||+||++. ++..|+ T Consensus 292 g~~a~~d~~~g~~~span~~~~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~-~G~~~wG~rT~~~~~~~~~~~ 370 (477) T protein:vir:10 292 GLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYG-SGLRLWGNRTAAWPTVTHMRN 370 (477) T ss_pred HHHHHhhhcCCceeccCCceeccccccccccccccCCChhhHHHHhhCCceEEEEecC-CcEEEEcccccCCCCCCcccc Confidence 99999999999999999999877777532 33444678999999999999999987 7999999999954 345799 Q ss_pred eeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEE Q lcl|NC_014792. 551 HINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYY 630 (659) Q Consensus 551 ~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~ 630 (659) ||++||+|++|+++|+++++|+|||||++.+|++|+++++.||++||++|+|+||+|+||+++||++||++|+|+++|++ T Consensus 371 ~~~vrR~~~~i~~~~~~~~~~~v~~~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~ 450 (477) T protein:vir:10 371 FENVRRTGDVINESLRYFSQQFVDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKY 450 (477) T ss_pred eeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecCCceEEEEEEEEeecCeeEEEecCCC Q lcl|NC_014792. 631 KPARSINYIVLNFVATSTGADFDELIGVQ 659 (659) Q Consensus 631 ~p~~p~e~i~~~~~~~~~~~~~~e~~~~~ 659 (659) +|++|+|||+|++++.... ++|+.+-. T Consensus 451 ~p~~p~e~i~~~~~~~~~~--~~~~~~g~ 477 (477) T protein:vir:10 451 TVPPPLERLTYETEITSEY--LLTLKGGN 477 (477) T ss_pred EecCCcceEEEEEEEcchH--HhhhhcCC Confidence 9999999999999987666 55555544 No 18 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=3.6e-102 Score=576.76 Aligned_cols=482 Identities=15% Similarity=0.111 Sum_probs=308.9 Q ss_pred CceecCceEEEEecCCCcccc--cCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVR--NATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDL 78 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~--~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~ 78 (659) -+|-+|||||||+++++++++ ++||++||||.++|||+|+|++|+||.||.+.||..... ++|+++| T Consensus 281 ~~v~~~GVYVEEVpSGvrtIeGGV~TSVAAFVG~A~rGPvn~PvlITS~aD~~~~Fg~~~GG-----------l~GassA 349 (774) T protein:vir:98 281 RNVEDNGVVIQLEPALTGSISNRFSFYVTANDNTANRGFTTSPALVTTIPDPAIHFTSFQGG-----------LDGPRSA 349 (774) T ss_pred EEEecCceEEEEeCCCCccccccccceeeeecccccCCCCCcCEEEeehhHhhhhhccccCC-----------cccccee Confidence 458899999999999998774 469999999999999999999999999977777654322 2566666 Q ss_pred EEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccccccc Q lcl|NC_014792. 79 RTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVN 158 (659) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 158 (659) |.+.....+ ... ....+..++.||+.+++..... T Consensus 350 ~r~~~~~sG--------~~~---L~i~A~~pGawGN~ItV~I~~~----------------------------------- 383 (774) T protein:vir:98 350 FRDFYTFNG--------TPL---LRLQAVSEGNWGNQVTVSIYPV----------------------------------- 383 (774) T ss_pred eeeeeeecc--------cce---EEEEEeecCcCCCceEEEEEec----------------------------------- Confidence 532211000 000 0011111222222222110000 Q ss_pred eeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEE Q lcl|NC_014792. 159 QYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEI 238 (659) Q Consensus 159 ~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v 238 (659) . .... .......... .. .... .+....+. T Consensus 384 -----------------t--------------~~~~------------~l~v~~~~~s-~f---~~~~---a~e~~tv~- 412 (774) T protein:vir:98 384 -----------------N--------------NSEF------------RLNVQDLNGS-AF---NPPL---ADEVYTVK- 412 (774) T ss_pred -----------------C--------------Ccee------------EEEEEecCCc-cc---cccc---cceeEEEe- Confidence 0 0000 0000000000 00 0000 01111110 Q ss_pred eecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCc-eeeeeeeeccccccccccchhhhhhhhhc Q lcl|NC_014792. 239 VSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGA-IVENVVLSTKEGDKDVYGNNIYLDDYFAK 317 (659) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~-~~et~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (659) ........ .....+.+. ............-.-..... ..+.+........... . T Consensus 413 -~~~~~~~~--~v~e~~dn~----------~i~~~~~~~~~~~in~vs~lv~~~~~~~a~~d~~~~~------~------ 467 (774) T protein:vir:98 413 -LGDTNESG--ELNALLDSK----------FIRGFFLPKSIDSINYDAALVRQSPLRLAPPDESETD------V------ 467 (774) T ss_pred -cccccccc--eeeeeecee----------eEeecccccccccccccccccccchhccccccccccc------c------ Confidence 00000000 000000000 00000000000000000000 0000000000000000 0 Q ss_pred ccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHH Q lcl|NC_014792. 318 GTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIA 397 (659) Q Consensus 318 ~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~ 397 (659) .. ..............+++++|.|+.. .+..++....+.. +...+++++.+. ....++.+|+.|| T Consensus 468 --~~--~~~~~~~~~~~~v~v~lagG~Dg~~-tt~~~igg~~~~~---~~tgi~aLl~a~-------~~~~V~~aii~~~ 532 (774) T protein:vir:98 468 --EN--PAHVDFYGPNVLVDVTLENGYDGPP-VTNDDYVSIIRTL---ENQPVHILLVGT-------TNVGVQQALITEA 532 (774) T ss_pred --cc--cccccccCCcceEEEeecCCCCccc-ccchheecccccc---cccceeEEEcCc-------cchhhHHHHHHHH Confidence 00 0000001111223456778877643 3444444433333 234566666532 2345677777777 Q ss_pred Hhh----CCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHH Q lcl|NC_014792. 398 DER----QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAAD 473 (659) Q Consensus 398 ~~~----~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~ 473 (659) +++ ++||+++|.|++ .+.+++.+|++. ++|+|+++||||++++|+.+++.+++|||++ T Consensus 533 e~~~~~~~~r~avid~p~g--------~t~~~Ai~~r~~----------f~S~~aal~~Pwvkv~D~~~g~~~~vPpSg~ 594 (774) T protein:vir:98 533 ERASDSDGLRIAVLAAPPR--------TTPTLAASVTRG----------FNSTRAVMVAGWFTYAGQPNSSRYGVPGAAV 594 (774) T ss_pred HHhhhcccceEEEEECCCC--------CCHHHHHHHHhc----------cCCceEEEEeCcEEEeccCCCceeecChhHH Confidence 765 789999999865 357889999963 5689999999999999999999999999999 Q ss_pred HHHHHHHhhhcCCceECcCCcchhheeccc---cceeecChhHHHhhhhCCceEEE-EEeCCCeEEEEcccccCCCcccc Q lcl|NC_014792. 474 MAGLCARTDDVSQPWMSPPGYNRGQILNVL---KLAIEPRQTQRDRMYQEAINPVV-GFAGGDGFVLYGDKTATKVPSPM 549 (659) Q Consensus 474 ~Ag~~a~~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gin~i~-~~~~~~G~~~wG~rT~~~~~~~~ 549 (659) +||++||+| +||||+|+.+.++.|.. .+....++.|++.|++++||+++ .+++ +|+++||+||+++|+ +| T Consensus 595 vAGl~ArtD----v~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g-~G~rvWG~RTlssDp-~w 668 (774) T protein:vir:98 595 YAGKLAAID----FFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVD-RTYRFASGVTLSTDP-AW 668 (774) T ss_pred HHHHHHhcC----cccccCCceeecceeccccccccccccchhhhhhcccccceeEEEEcC-CcEEEEcccccCCCc-cc Confidence 999999999 89999999876665532 23455678999999999999998 5776 799999999998765 89 Q ss_pred ceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeE-EEEccCCCCHHHhhCCEEEEEE Q lcl|NC_014792. 550 DHINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGR-VVCDTTNNTPSVIDRNEFVASI 628 (659) Q Consensus 550 ~~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~-v~~d~~~nt~~~i~~G~~~~~i 628 (659) +||++||||+||+++|+++++|+||||||+.+|++|+++++.||++||++|+|+||+ |+||+++||+++|++|+|+++| T Consensus 669 r~InVRRlfd~Ie~SI~~~~~~~VfEPNd~~l~~~I~~sI~~fL~~L~~~GaL~G~~~V~~D~etNt~~dI~~G~l~i~I 748 (774) T protein:vir:98 669 ERIYLRRVHDVVRQGAHAILRNYVAMPNSRLVRNQIAAALNAFMGELKRNGNIVSFRPAIIDGSNNSTAAYFSRELYVSL 748 (774) T ss_pred ceEeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceecceEEEEcCCCCCHHHhhCCEEEEEE Confidence 999999999999999999999999999999999999999999999999999999997 8999999999999999999999 Q ss_pred EEEecCCceEEEEEEEEeecCeeEEE Q lcl|NC_014792. 629 YYKPARSINYIVLNFVATSTGADFDE 654 (659) Q Consensus 629 ~~~p~~p~e~i~~~~~~~~~~~~~~e 654 (659) +++|++|+|||+|||+|++++.+|+| T Consensus 749 ~vaP~~PAEfIilri~q~t~~~~l~E 774 (774) T protein:vir:98 749 QFQPLYSADYIYVTISRDTETSPLGE 774 (774) T ss_pred EEEecCCcceEEEEEEEeecceeccC Confidence 99999999999999999999999999 No 19 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=100.00 E-value=1.1e-94 Score=535.69 Aligned_cols=526 Identities=23% Similarity=0.374 Sum_probs=311.7 Q ss_pred Cc-eecCceEEEEecCCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEE Q lcl|NC_014792. 1 MA-LLSPGIELKETTVQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLR 79 (659) Q Consensus 1 ~~-~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~ 79 (659) |. |++|||||||+|++..+.+++||++||||+|+|||+|+|++|+||.||+++||++++.+|++|++++||+|||++|| T Consensus 3 m~~~~sPGVyv~E~~~~~~i~~v~tsvaafvG~~~~GP~~~p~~v~s~~d~~~~FG~~~~~~~l~~av~~fF~ngG~~~~ 82 (641) T protein:vir:10 3 VSNQLSPGVVIQERDLTAVTTPIGLNVGVLAAPFTKGPVEEIFEVSTERDLASVFGEPNDYNYEYWFTASQFLSYGGVLK 82 (641) T ss_pred CccccCCceEEEEecCCCcccccCCccceEEecccCCCCCccEEecCHHHHHHHcCCcCCCcchHHHHHHHHHhcCCEEE Confidence 76 99999999999998766677899999999999999999999999999999999999999999999999999999999 Q ss_pred EEeccCCccccccccccccc-----------------ccccccccccccccceeeeeeccccccccceeeeeeccCccee Q lcl|NC_014792. 80 TVRVVNRDHAKNASPVAGNI-----------------ESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILA 142 (659) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~-----------------~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~ 142 (659) |||+.+.+.. ++....... ......+..++.||+.+++............... .+.+.... T Consensus 83 vvRv~~~~~~-~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~pG~~gn~l~v~i~~~~~~~~~~~~~-~~~~~~~~ 160 (641) T protein:vir:10 83 AIRLNAASLK-NSVDSGTAPLIKNLQEYETTYESSNSNTFKFASRDAGALGNSVGIFITDAGPDQIAVLPA-PGTGNEWE 160 (641) T ss_pred EEEecCcccc-ccccccchhhccccccccccccCcCccccEEEeccCCCcCCceEEEEEcCCCcceeeeec-ccccccce Confidence 9999875422 222111110 1112234557889998887654322111110000 00000000 Q ss_pred -----eeeccccccccccccc---eeeeeccce---------------------------eeEEEeecCCccccccccce Q lcl|NC_014792. 143 -----VFIPSDKIIAFAKSVN---QYPDLGPAW---------------------------TAEILTTSSGVSGTITLGKI 187 (659) Q Consensus 143 -----~~~~~~~~~~~~~~~~---~~~~~~~~~---------------------------~~~v~~~~~g~~~~~~~~~~ 187 (659) .+.............. ......... ..+......+..+....... T Consensus 161 ~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g~~g~~~~~~~ 240 (641) T protein:vir:10 161 FVADEAVTAASGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGGVTGIFADAQV 240 (641) T ss_pred eccceeeeeccCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCcceeeeeeeee Confidence 0000000000000000 000000000 00000000000000000000 Q ss_pred eccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEeecccccccceeeee-eecccccccccee Q lcl|NC_014792. 188 VTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVSKAAYDVGASKMLD-IYPNGGSRASVAR 266 (659) Q Consensus 188 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 266 (659) ........... ........ ............+...+..+....+.......+......... ........+.... T Consensus 241 ~t~gt~~~t~a---~~g~~~~~--~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~~~a~~~~~g~~~~~va~~~gts~ 315 (641) T protein:vir:10 241 VTQGTNTAAIA---SSGIERRL--YIGKDSGSINFAATDAVVDTNATSATISSVRNEYAEREYLPGSKWVNVAARPGTSL 315 (641) T ss_pred ccCCccceeee---cccchhhh--hhccccccceeeeecccccccceeeEeeeeeeeecccccccccccccccccchhhh Confidence 00000000000 00000000 000000000111222334444444433222111111111000 0000111111122 Q ss_pred eeeeeccccccceeeeecc-------CCceeeeeeeecc-ccccccccchhhhhhhhhcccccceEEeecc--------- Q lcl|NC_014792. 267 AVFNYGPQTDDQYAIIVRR-------DGAIVENVVLSTK-EGDKDVYGNNIYLDDYFAKGTSNYIYATSLN--------- 329 (659) Q Consensus 267 ~~~~~~~~~~~~~~~~v~~-------~g~~~et~~~~~~-~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~--------- 329 (659) .....++..+..+.++++. +|+++|++....+ .++++..+...++...+. ..|.+++..... T Consensus 316 ~a~~~g~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~~-~~s~~v~~~~~~~~~~~~~~~ 394 (641) T protein:vir:10 316 YANSVGGVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVIK-QQSAYVYWGSHETAPFLGTAA 394 (641) T ss_pred hhhhcCCcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeeec-cccceEEEecccccccccccc Confidence 2233455666666666654 5578899875443 344445555555555443 345555421100 Q ss_pred ------------------------------------cCCccceeEEeeccccccc-----ccchhhhhhhHhhhhhcccc Q lcl|NC_014792. 330 ------------------------------------WPKGFAGIINLMGGISAND-----QVTAGDLMQGWDLFADREAL 368 (659) Q Consensus 330 ------------------------------------~~~~~~~~~~~~gg~~~~~-----~~~~~~~~~~~~~~~~~~~~ 368 (659) ........+.++||.|+.. .....++.+++++++..+.. T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~~~~~~~~~tg~~~~~~~e~~ 474 (641) T protein:vir:10 395 NAAAGDWGASALNRRYNLLRSTAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALYNLSNVDIATAYELIEDPESQ 474 (641) T ss_pred cccccccccccccccccccccccccccccccccccCCCCcceeEEEeecCcccccccccccccchhHHHHHHHhhhhhhh Confidence 0011123467888877643 23456778899999988888 Q ss_pred cceEEEeccccccchhhhHHHHHHHHHHHHhhCCEEEEEecCcccccccccc-CCHHHHHHHhhccccccccccccccce Q lcl|NC_014792. 369 HINLLIAGAVAGEGDATASTVQKHVVSIADERQDCLAFISPPKGLLVNVPLT-RAVDNLIDWRTGGGSFDTDNMNISTTY 447 (659) Q Consensus 369 ~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~s~~ 447 (659) .++++|+|+... +..+..+++.+|++|||+|||||+|+|+|++..++.... ...+++.+||+. +.+|+| T Consensus 475 ~i~~l~~~~~~~-~~~~~~~v~~~~i~~ce~~~d~~ailD~p~~~~~~~~~~~~~~~~~~~~~~~---------~~~s~y 544 (641) T protein:vir:10 475 VIDYVLSGPAGA-DEAAAIAKATTITTIVESRKDCMAFLSPLRSDVIGVSNTTTVTENLVNYFNQ---------LPSSNY 544 (641) T ss_pred ccceeeecCCCC-CcchhHHHHHHHHHHHHhcCCEEEEEcCCcccccCCCchhhHHHHHHHHHhh---------cCCCce Confidence 899999987643 445678999999999999999999999999877665433 346888999864 457899 Q ss_pred EEEEcCceeEecccCCcceeecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEE Q lcl|NC_014792. 448 AAIDGNYKYQYDKYNDVNRWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVG 527 (659) Q Consensus 448 ~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~ 527 (659) +++||||++++||.+++.+++||||++||+|||+|.+|||||||||++++.|+|++++++.+++.|++.||++||||||. T Consensus 545 aa~y~P~~~v~dp~~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~~~~~e~~~Lnp~gIN~ir~ 624 (641) T protein:vir:10 545 VVFDSGYKYIYDKYNDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYSPNKTQRDRLYANRINPVVS 624 (641) T ss_pred EEEEeceeEeecccCCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEecChhHHhhhhhcccceEEe Confidence 99999999999999999999999999999999999999999999999988899999999999999999999999999999 Q ss_pred EeCCCeEEEEcccccCCCcccc Q lcl|NC_014792. 528 FAGGDGFVLYGDKTATKVPSPM 549 (659) Q Consensus 528 ~~~~~G~~~wG~rT~~~~~~~~ 549 (659) |++ +|++- |.-.-.. +. T Consensus 625 fpg-~G~v~--~~~~~~~--~~ 641 (641) T protein:vir:10 625 FPG-HAMIN--NNIAFHT--KL 641 (641) T ss_pred cCC-ceeec--ceeeeee--cC Confidence 997 78653 2221110 11 No 20 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=5.1e-94 Score=532.04 Aligned_cols=378 Identities=15% Similarity=0.131 Sum_probs=302.3 Q ss_pred CceecCceEEEEecCCCccccc-CCcceEEEeecccC-----CCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRN-ATGRAALVGKFQWG-----PAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) |+|.+|||||+|++.+++++.. .|++.+|+|.++.+ |+|+|++|+|+.+|.+.||. ..++.+++..+|.|| T Consensus 2 ~~~~~~Gv~v~e~~~~~~~i~~~~tav~~~vg~a~dad~~~~p~n~pv~its~~~~~~~~g~---~~tL~~al~~~~~~~ 78 (390) T protein:vir:79 2 PQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---KGTLRRTLDAIGKQT 78 (390) T ss_pred ccccCCCeEEEEcCCCcccccccCCceeEEEEecCCCCccccccccceEeecHHHHHHhcCC---Cccchhhhhhhcccc Confidence 6799999999999999986655 69999999999886 89999999999999999994 566778999999999 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) |+.||+||+........ . . . . T Consensus 79 ~~~~~vv~v~~~~~~~~-------------~-~------------------------~----------~----------- 99 (390) T protein:vir:79 79 KPLTVVVRVAEGKDADE-------------T-T------------------------S----------N----------- 99 (390) T ss_pred cceEEEEeecccccccc-------------c-c------------------------c----------e----------- Confidence 99999999743210000 0 0 0 0 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) . .+. .. T Consensus 100 --------------------------------~---------------------------------------ig~---~~ 105 (390) T protein:vir:79 100 --------------------------------V---------------------------------------IGT---VT 105 (390) T ss_pred --------------------------------e---------------------------------------eec---cc Confidence 0 000 00 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) . . + . ..|. + . T Consensus 106 ----~---~-------------------------------~--~------~tgl--~-------------------a--- 115 (390) T protein:vir:79 106 ----P---D-------------------------------G--K------YTGI--K-------------------A--- 115 (390) T ss_pred ----c---c-------------------------------c--c------chhh--h-------------------h--- Confidence 0 0 0 0 0000 0 0 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVV 394 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~ 394 (659) +... ...-...++++++|+.. ..+|+.+|. T Consensus 116 l~~~--------------------------------------------~~~~~~~p~il~ap~~~------~~~v~~~l~ 145 (390) T protein:vir:79 116 LLAA--------------------------------------------QGALGVKPRILAAPGLD------TQPVAAALA 145 (390) T ss_pred hhhh--------------------------------------------hhhhccccccccCCccc------chHHHHHHH Confidence 0000 00000124555555542 346888999 Q ss_pred HHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHH Q lcl|NC_014792. 395 SIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADM 474 (659) Q Consensus 395 ~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 474 (659) .+|++++ +|+++|+|.+ .+.+++.+||+. ++|+|+++||||++++|+.++..+++|||+++ T Consensus 146 ~~a~~~~-~~ai~D~p~~--------~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 206 (390) T protein:vir:79 146 ATAQSLR-AMAYVSASGC--------KTKEEAAAYRRQ----------FGQREIMVIWPDWLGWDDTTNSTAVIPAPAIA 206 (390) T ss_pred Hhhhhcc-eEEEEEccCC--------CCHHHHHHHhcC----------CCCceEEEEcCceeecccccCceeEeehHHHH Confidence 9999876 8999999853 456789999863 46899999999999999999999999999999 Q ss_pred HHHHHHhhhcCCceECcCCcchhheecccc-c--eeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccce Q lcl|NC_014792. 475 AGLCARTDDVSQPWMSPPGYNRGQILNVLK-L--AIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDH 551 (659) Q Consensus 475 Ag~~a~~d~~~g~~~span~~~~~i~g~~~-~--~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~ 551 (659) ||++||+|.++||||||||+.+.++.++.. . .......|.+.||++||+++++ ++||++||+||+++|+ +|+| T Consensus 207 Ag~~a~~D~~~g~~~spsN~~i~gi~~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~~---~~G~~~wG~rT~~~d~-~~~~ 282 (390) T protein:vir:79 207 AGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN---RNGFRFWGERTCSDDP-KFAF 282 (390) T ss_pred HHHHHhhhccCCcEEccCCceeeccceeeeeccccccccchhhhhhhhcCcEEEEc---CCCEEEEeccccCCCc-ccce Confidence 999999999999999999998766555432 1 1233456778999999999854 3799999999998765 7999 Q ss_pred eehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEE Q lcl|NC_014792. 552 INVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYK 631 (659) Q Consensus 552 i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~ 631 (659) |++||||+||+++|+++++|+|||||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++ T Consensus 283 i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~ 362 (390) T protein:vir:79 283 ENYTRTAQVAADSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYT 362 (390) T ss_pred eeehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCceEEEEEEEEeecCee--EEEecC Q lcl|NC_014792. 632 PARSINYIVLNFVATSTGAD--FDELIG 657 (659) Q Consensus 632 p~~p~e~i~~~~~~~~~~~~--~~e~~~ 657 (659) |++|+|||+|++..+....+ +++|.+ T Consensus 363 p~~p~e~i~~~~~~~~~~~~~~~~~v~~ 390 (390) T protein:vir:79 363 PVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred ecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999999988866 666666 No 21 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=9.5e-94 Score=530.56 Aligned_cols=376 Identities=14% Similarity=0.129 Sum_probs=301.2 Q ss_pred CceecCceEEEEecCCCccc-ccCCcceEEEeecc-----cCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVV-RNATGRAALVGKFQ-----WGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~-~~~ts~~afvG~~~-----~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) |++.+|||||+|++.+++++ ++.|++++|||+++ .+|+|+|++|+|+.||...||. ..++.+++..+|.|| T Consensus 2 ~~~~~pGv~v~e~~~~~~~i~~~~tav~~~vg~a~~a~~~~~p~n~pv~iss~~~~~~~~g~---~gtl~~al~~~~~~g 78 (391) T protein:vir:79 2 PTDYHHGVRVVELNDGTRPIRTIETAVAGIVCTADDADAATFPLDTPVLLTNPQAYIGKAGD---KGTLAHTLDAITDQT 78 (391) T ss_pred CCCCCCCeEEEECCCCcccccccCCceEEEEeecccccccccccccCEEeccHHHHHHhcCC---ccccchhhhhhhccc Confidence 77889999999999888755 45699999999986 6899999999999999999994 567788999999999 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) |..||+|++........ . ... T Consensus 79 g~~~~vv~~~~~~~~~~-------------~------------------------------------~~~---------- 99 (391) T protein:vir:79 79 NPLTVVVRVAGGASEAE-------------T------------------------------------TSN---------- 99 (391) T ss_pred ccceeeecccccccccc-------------c------------------------------------ccc---------- Confidence 99999998743210000 0 000 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) . .+.... T Consensus 100 -----------------------------~------------------------------------------~g~~~~-- 106 (391) T protein:vir:79 100 -----------------------------L------------------------------------------IGTTNA-- 106 (391) T ss_pred -----------------------------c------------------------------------------cccccc-- Confidence 0 000000 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) .. T Consensus 107 -------~~----------------------------------------------------------------------- 108 (391) T protein:vir:79 107 -------AG----------------------------------------------------------------------- 108 (391) T ss_pred -------hh----------------------------------------------------------------------- Confidence 00 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhh---cccccceEEEeccccccchhhhHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFAD---REALHINLLIAGAVAGEGDATASTVQK 391 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~p~~~~~~~~~~~~v~~ 391 (659) ..+++..+.. .....+.++++|+. +..+++. T Consensus 109 ----------------------------------------~~tGl~~l~~~~~~~~~~p~~l~~p~~------~~~~v~~ 142 (391) T protein:vir:79 109 ----------------------------------------RYTGMKALLTARNRFGVAPRILAVPGL------DSLPVGT 142 (391) T ss_pred ----------------------------------------hhHHHhhhhhhhhhhcccchhhcCCcc------chhHHHH Confidence 0000000000 00001122233332 3456889 Q ss_pred HHHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHH Q lcl|NC_014792. 392 HVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLA 471 (659) Q Consensus 392 ~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 471 (659) +|+.+|++++ +++++|+|.+ .+.+++.+|++. ++|+|+++||||++++|+.++..+++||| T Consensus 143 al~~~~~~~~-~~ai~d~p~~--------~t~~~a~~~~~~----------~~s~~~a~~~P~~~~~d~~~~~~~~~p~s 203 (391) T protein:vir:79 143 ELVTIAQKLR-AFAYLSAYGC--------QTKEEAVAYRSN----------FGQREAMVMWPDFVGWDTAANAETTLWAT 203 (391) T ss_pred HHHHHHhhcC-cEEEEECCCC--------CCHHHHHHHHhc----------cCCceeEEecceeeeecCcCCceeeechH Confidence 9999999987 7899999854 467889999874 45889999999999999999999999999 Q ss_pred HHHHHHHHHhhhcCCceECcCCcchhheecccc-c--eeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccc Q lcl|NC_014792. 472 ADMAGLCARTDDVSQPWMSPPGYNRGQILNVLK-L--AIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSP 548 (659) Q Consensus 472 ~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~-~--~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~ 548 (659) +++||++||+|.++||||||||+.+.++.|... + .......|.+.||.+|||++++ ++||++||+||+++++ + T Consensus 204 ~~~AG~~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~~---~~G~~~wG~rT~~~d~-~ 279 (391) T protein:vir:79 204 ARAVGLRAKIDNDTGWHKTLSNVAVGGVTGLSRDVFWDLQDPATDAGYLNANEVTTLVH---RDGYRFWGSRTCSADP-L 279 (391) T ss_pred HHHHHHHHHhhhcccceeccCCceehhhhccccccccccccccchhhhhhhcCceEEEC---CCcEEEEcccccCCCc-c Confidence 999999999999999999999998665554332 1 2233456788999999999854 4799999999998764 8 Q ss_pred cceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEE Q lcl|NC_014792. 549 MDHINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASI 628 (659) Q Consensus 549 ~~~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i 628 (659) |+||++||++++|+++|+++++|+|||||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++| T Consensus 280 ~~~i~~rR~~~~i~~~i~~~~~~~v~epn~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i 359 (391) T protein:vir:79 280 FAFENYTRTAQVLADTMAEAHMWANDLPMTPTLVRDLLEGINAKLRMLTRNGYLLGGAAWFDADANSKDTLKAGQLAIDY 359 (391) T ss_pred cceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCceEEEEEEEEeecCee--EEEecCC Q lcl|NC_014792. 629 YYKPARSINYIVLNFVATSTGAD--FDELIGV 658 (659) Q Consensus 629 ~~~p~~p~e~i~~~~~~~~~~~~--~~e~~~~ 658 (659) +++|++|+|||+|++.++..... ++||.-+ T Consensus 360 ~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~a 391 (391) T protein:vir:79 360 DYTPVPPLENLTFRQRITDRYLMQFAEAVKAA 391 (391) T ss_pred EEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 99999999999999999999877 7777777 No 22 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=1.1e-93 Score=530.22 Aligned_cols=378 Identities=16% Similarity=0.150 Sum_probs=299.5 Q ss_pred CceecCceEEEEecCCCccccc-CCcceEEEeecccC-----CCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRN-ATGRAALVGKFQWG-----PAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) |+|.+|||||+|++.+++++.. .|++++|||.++++ |+|+|++|+|+.+|...||. ..++.+++..+|.|| T Consensus 2 ~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~---~gtL~~al~~~~~~g 78 (390) T protein:vir:10 2 PQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---KGTLRRTLDAIGKQT 78 (390) T ss_pred cccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCC---Cceehhhhhhhcccc Confidence 6688999999999999886655 69999999999875 99999999999999999994 567888999999999 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) |+.||+||+...+....... ..+ T Consensus 79 g~~~~vv~v~~~~~~~~~~~------------------------------------------------~~i--------- 101 (390) T protein:vir:10 79 KPLTVVVRVAEGKDADETTS------------------------------------------------NVI--------- 101 (390) T ss_pred CceEEEEEeccccccccccc------------------------------------------------ccc--------- Confidence 99999999853211000000 000 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) .+. ...+. ..+ T Consensus 102 ---------------------g~~----------~~~~~--------------------------------~tg------ 112 (390) T protein:vir:10 102 ---------------------GTV----------TPDGK--------------------------------YTG------ 112 (390) T ss_pred ---------------------ccc----------ccccc--------------------------------cch------ Confidence 000 00000 000 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) .+ . T Consensus 113 -------------------------------------------------------~~----------------------a 115 (390) T protein:vir:10 113 -------------------------------------------------------IK----------------------A 115 (390) T ss_pred -------------------------------------------------------hh----------------------h Confidence 00 0 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVV 394 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~ 394 (659) +... ...-...++++++|+. +..+|+++|+ T Consensus 116 l~~~--------------------------------------------~~~~~~~p~il~ap~~------~~~~v~~~l~ 145 (390) T protein:vir:10 116 LLAA--------------------------------------------QGALGVKPRILAAPGL------DTQPVAAALA 145 (390) T ss_pred hhhh--------------------------------------------hhhhcceehhhccccc------chHHHHHHHH Confidence 0000 0000011233444443 2346889999 Q ss_pred HHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHH Q lcl|NC_014792. 395 SIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADM 474 (659) Q Consensus 395 ~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 474 (659) .+|++++ +++++|+|.+ .+.+++.+||+. ++|+|+++||||++++|+.++..+++|||+++ T Consensus 146 ~~a~~~~-~~aivD~p~~--------~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 206 (390) T protein:vir:10 146 ATAQSLR-AMAYVSASGC--------KTKEEAAAYRKQ----------FGQREIMVIWPDWLGWDDTTNSTAVIPAPAIA 206 (390) T ss_pred Hhhcccc-eEEEEecCCC--------CCHHHHHHHhhc----------cCCceEEEEcCceEeecccCCcccccchHHHH Confidence 9999887 7999999853 467889999874 46899999999999999999999999999999 Q ss_pred HHHHHHhhhcCCceECcCCcchhheecccc-c--eeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccce Q lcl|NC_014792. 475 AGLCARTDDVSQPWMSPPGYNRGQILNVLK-L--AIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDH 551 (659) Q Consensus 475 Ag~~a~~d~~~g~~~span~~~~~i~g~~~-~--~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~ 551 (659) ||++||+|.++||||||||+.+.++.++.. . .......|.+.||++||+++++ ++||++||+||++.|+ +|+| T Consensus 207 Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~---~~G~~~wG~rT~s~d~-~~~~ 282 (390) T protein:vir:10 207 AGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN---RNGFRFWGERTCSDDP-KFAF 282 (390) T ss_pred HHHHHHhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEc---CCCEEEEcccccCCCc-ccce Confidence 999999999999999999998776666432 2 2333456788999999999864 3799999999998764 8999 Q ss_pred eehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEE Q lcl|NC_014792. 552 INVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYK 631 (659) Q Consensus 552 i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~ 631 (659) |++||||++|+++|+++++|+|||||++.+|++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++ T Consensus 283 i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~ 362 (390) T protein:vir:10 283 ENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYT 362 (390) T ss_pred eehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCceEEEEEEEEeecCee--EEEecC Q lcl|NC_014792. 632 PARSINYIVLNFVATSTGAD--FDELIG 657 (659) Q Consensus 632 p~~p~e~i~~~~~~~~~~~~--~~e~~~ 657 (659) |++|+|||+|+++++..... +++|.. T Consensus 363 p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:10 363 PVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred ecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999998887654 333333 No 23 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=1.1e-93 Score=530.22 Aligned_cols=378 Identities=16% Similarity=0.150 Sum_probs=299.5 Q ss_pred CceecCceEEEEecCCCccccc-CCcceEEEeecccC-----CCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRN-ATGRAALVGKFQWG-----PAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) |+|.+|||||+|++.+++++.. .|++++|||.++++ |+|+|++|+|+.+|...||. ..++.+++..+|.|| T Consensus 2 ~~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vg~a~~ad~~~~pln~pv~i~s~~~~~~~~g~---~gtL~~al~~~~~~g 78 (390) T protein:vir:78 2 PQDYHHGVRVIEINEGGRPIRSVSTAVLGVVCTAADADASAFPLNTPVLLTNVVAALGKAGK---KGTLRRTLDAIGKQT 78 (390) T ss_pred cccccCCeEEEEcCCCcccccccCcceeEEEEcccCcCccccccccceEeccHHHHHhhcCC---Cceehhhhhhhcccc Confidence 6688999999999999886655 69999999999875 99999999999999999994 567888999999999 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) |+.||+||+...+....... ..+ T Consensus 79 g~~~~vv~v~~~~~~~~~~~------------------------------------------------~~i--------- 101 (390) T protein:vir:78 79 KPLTVVVRVAEGKDADETTS------------------------------------------------NVI--------- 101 (390) T ss_pred CceEEEEEeccccccccccc------------------------------------------------ccc--------- Confidence 99999999853211000000 000 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) .+. ...+. ..+ T Consensus 102 ---------------------g~~----------~~~~~--------------------------------~tg------ 112 (390) T protein:vir:78 102 ---------------------GTV----------TPDGK--------------------------------YTG------ 112 (390) T ss_pred ---------------------ccc----------ccccc--------------------------------cch------ Confidence 000 00000 000 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) .+ . T Consensus 113 -------------------------------------------------------~~----------------------a 115 (390) T protein:vir:78 113 -------------------------------------------------------IK----------------------A 115 (390) T ss_pred -------------------------------------------------------hh----------------------h Confidence 00 0 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVV 394 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~ 394 (659) +... ...-...++++++|+. +..+|+++|+ T Consensus 116 l~~~--------------------------------------------~~~~~~~p~il~ap~~------~~~~v~~~l~ 145 (390) T protein:vir:78 116 LLAA--------------------------------------------QGALGVKPRILAAPGL------DTQPVAAALA 145 (390) T ss_pred hhhh--------------------------------------------hhhhcceehhhccccc------chHHHHHHHH Confidence 0000 0000011233444443 2346889999 Q ss_pred HHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHH Q lcl|NC_014792. 395 SIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADM 474 (659) Q Consensus 395 ~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 474 (659) .+|++++ +++++|+|.+ .+.+++.+||+. ++|+|+++||||++++|+.++..+++|||+++ T Consensus 146 ~~a~~~~-~~aivD~p~~--------~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 206 (390) T protein:vir:78 146 ATAQSLR-AMAYVSASGC--------KTKEEAAAYRKQ----------FGQREIMVIWPDWLGWDDTTNSTAVIPAPAIA 206 (390) T ss_pred Hhhcccc-eEEEEecCCC--------CCHHHHHHHhhc----------cCCceEEEEcCceEeecccCCcccccchHHHH Confidence 9999887 7999999853 467889999874 46899999999999999999999999999999 Q ss_pred HHHHHHhhhcCCceECcCCcchhheecccc-c--eeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccce Q lcl|NC_014792. 475 AGLCARTDDVSQPWMSPPGYNRGQILNVLK-L--AIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDH 551 (659) Q Consensus 475 Ag~~a~~d~~~g~~~span~~~~~i~g~~~-~--~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~ 551 (659) ||++||+|.++||||||||+.+.++.++.. . .......|.+.||++||+++++ ++||++||+||++.|+ +|+| T Consensus 207 Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~~---~~G~~~wG~rT~s~d~-~~~~ 282 (390) T protein:vir:78 207 AGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLVN---RNGFRFWGERTCSDDP-KFAF 282 (390) T ss_pred HHHHHHhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEEc---CCCEEEEcccccCCCc-ccce Confidence 999999999999999999998776666432 2 2333456788999999999864 3799999999998764 8999 Q ss_pred eehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEE Q lcl|NC_014792. 552 INVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYK 631 (659) Q Consensus 552 i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~ 631 (659) |++||||++|+++|+++++|+|||||++.+|++|++++++||++||++|+|+||+|+||+++||+++|++|+|+++|+++ T Consensus 283 i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~ 362 (390) T protein:vir:78 283 ENYTRTAQVAGDSIAEAQMPVVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASGKAYIDYDYT 362 (390) T ss_pred eehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhhCCeEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCceEEEEEEEEeecCee--EEEecC Q lcl|NC_014792. 632 PARSINYIVLNFVATSTGAD--FDELIG 657 (659) Q Consensus 632 p~~p~e~i~~~~~~~~~~~~--~~e~~~ 657 (659) |++|+|||+|+++++..... +++|.. T Consensus 363 p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:78 363 PVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred ecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999998887654 333333 No 24 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=3.2e-93 Score=527.68 Aligned_cols=382 Identities=13% Similarity=0.116 Sum_probs=305.7 Q ss_pred CceecCceEEEEecCCCccc-ccCCcceEEEeeccc-----CCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVV-RNATGRAALVGKFQW-----GPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~-~~~ts~~afvG~~~~-----Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) |+.-.|||||+|++.+++++ +++|++++|||.+++ .|.++|++|+|+.+|...|| ....+.+++..+|.|| T Consensus 1 m~~~~~Gv~v~e~~~~~~~v~~~~tav~~fvGta~~~~~~~~p~~~p~~v~s~~~~~~~~g---~~~tl~~a~~~~~~~g 77 (396) T protein:vir:60 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAEIFPLNKPVLITNVQSAIAKAG---KKGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCCeEEEEcCCCcccccccCceeEEEEecccccccccccCccCeEeechHHHHHhhc---CcchhHHHHHHHhhcc Confidence 99778999999999988755 557999999999865 48899999999999999999 4567888999999999 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) |..||++|+....... . T Consensus 78 g~~~~vv~~~~~~~~~----------------~----------------------------------------------- 94 (396) T protein:vir:60 78 KPVTVVVRVEDGTGED----------------E----------------------------------------------- 94 (396) T ss_pred CceEEEEecccccccc----------------c----------------------------------------------- Confidence 9999999874321000 0 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) +.... .. ........ T Consensus 95 ----------------------~~~~~-----------~~--------------------------------~~~~~~~~ 109 (396) T protein:vir:60 95 ----------------------ETKLA-----------QT--------------------------------VSNIIGTT 109 (396) T ss_pred ----------------------ccccc-----------cc--------------------------------cccccccc Confidence 00000 00 00000000 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) . . . + . T Consensus 110 d-------~-------------------------------~-----------~----------~---------------- 114 (396) T protein:vir:60 110 D-------E-------------------------------N-----------G----------Q---------------- 114 (396) T ss_pred c-------c-------------------------------c-----------c----------c---------------- Confidence 0 0 0 0 0 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhh---hhcccccceEEEeccccccchhhhHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLF---ADREALHINLLIAGAVAGEGDATASTVQK 391 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~p~~~~~~~~~~~~v~~ 391 (659) .+++.++ .......++++++|+. +...|++ T Consensus 115 -----------------------------------------~tg~~al~~~~~~~~~~~~il~ap~~------~~~~v~~ 147 (396) T protein:vir:60 115 -----------------------------------------YTGLKALLAAESVTGVKPRILGVPGL------DTKEVAV 147 (396) T ss_pred -----------------------------------------ccchhhhhhcccceeeeeeecccccc------ccHHHHH Confidence 0000000 0001123456666654 3467999 Q ss_pred HHHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHH Q lcl|NC_014792. 392 HVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLA 471 (659) Q Consensus 392 ~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 471 (659) +|+++|++++ +++++|+|.+ .+.+++.+||+. ++|.|+++||||++++|+.++..+++||| T Consensus 148 al~~~~~~~~-~~~i~d~p~~--------~~~~~a~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 208 (396) T protein:vir:60 148 ALASVCQKLR-AFGYISAWGC--------KTISEVKAYRQN----------FSQRELMVIWPDFLAWDTVASTTATAYAT 208 (396) T ss_pred HHHHHhccCC-eEEEEeCCCC--------CCHHHHHHHHhh----------cCCceEEEEeCceeeecccCCceeEEchh Confidence 9999999886 8999999864 467889999974 45889999999999999999999999999 Q ss_pred HHHHHHHHHhhhcCCceECcCCcchhheecccc---ceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccc Q lcl|NC_014792. 472 ADMAGLCARTDDVSQPWMSPPGYNRGQILNVLK---LAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSP 548 (659) Q Consensus 472 ~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~---~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~ 548 (659) +++||++||+|.++|+||||||+.+.++.+... .....++.|++.||++|||++.+ + +|+++||+||+++++ + T Consensus 209 ~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~--~-~G~~~wG~rT~~~d~-~ 284 (396) T protein:vir:60 209 ARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIR--R-DGFRFWGNRTCSDDP-L 284 (396) T ss_pred HHHHHHHHHhhhccCcEeCcCCceecceeeceeecccccCCCcchhhhhhhcCcEEEEc--C-CCEEEEcccccCCCc-c Confidence 999999999999999999999998776655432 23445678999999999999954 4 799999999998765 7 Q ss_pred cceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEE Q lcl|NC_014792. 549 MDHINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASI 628 (659) Q Consensus 549 ~~~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i 628 (659) |+||++||+++||+++|+++++|+||||||+.+|++|+++|+.||++||++|+|+||+|+||+++||+++|++|+|+++| T Consensus 285 ~~~i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~~~~d~~~nt~~~i~~G~~~~~i 364 (396) T protein:vir:60 285 FLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDY 364 (396) T ss_pred cceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCceEEEEEEEEeecCee--EEEecCC Q lcl|NC_014792. 629 YYKPARSINYIVLNFVATSTGAD--FDELIGV 658 (659) Q Consensus 629 ~~~p~~p~e~i~~~~~~~~~~~~--~~e~~~~ 658 (659) +++|++|+|||+|++.++....+ |+||..- T Consensus 365 ~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~~ 396 (396) T protein:vir:60 365 DYTPVPPLENLTLRQRITDKYLANLVTSVNSN 396 (396) T ss_pred EEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 99999999999999999888655 6666655 No 25 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=7.2e-93 Score=525.74 Aligned_cols=385 Identities=14% Similarity=0.120 Sum_probs=303.1 Q ss_pred CceecCceEEEEecCCCcccc-cCCcceEEEeecccC-----CCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVR-NATGRAALVGKFQWG-----PAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~-~~ts~~afvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) |+.-+|||||+|++.+++++. +.|++++|||.++++ |.++|++|+|+.+|...||. ..++.+++..+|.|| T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~~d~~~~~~~~pv~i~s~~~~~~~~g~---~~tl~~al~~~~~~~ 77 (396) T protein:vir:57 1 MSDYHHGVQVLEINDGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAIAKAGK---KGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCceEEEEcCCCcccccccCCceEEEEEeccCCCcccccCccCeEeecchhhhhhccc---ccchHHHHHHhhhcC Confidence 998889999999999987654 569999999999876 78999999999999999984 567888999999999 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) |..||++|+.......... . .. T Consensus 78 ~~~~~vv~~~~~~~~~~~~-------------~-----------------------------~a---------------- 99 (396) T protein:vir:57 78 KPVTVVVRVEDGTGDDEET-------------K-----------------------------LA---------------- 99 (396) T ss_pred CceeEeeeccccccccccc-------------c-----------------------------cc---------------- Confidence 9999999864321000000 0 00 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) . . . . .. .|. T Consensus 100 --~-------------------t---~---~-----------~i----------------------------iG~----- 108 (396) T protein:vir:57 100 --Q-------------------T---V---S-----------NI----------------------------IGT----- 108 (396) T ss_pred --c-------------------c---c---e-----------ee----------------------------eee----- Confidence 0 0 0 0 00 000 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) +. . . .. ..| +. . T Consensus 109 ---~~---~-------------------------------~-~~------~tg-----------------------l~-a 120 (396) T protein:vir:57 109 ---TD---E-------------------------------N-GQ------YTG-----------------------LK-A 120 (396) T ss_pred ---cc---c-------------------------------c-cc------chh-----------------------hh-h Confidence 00 0 0 00 000 00 0 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVV 394 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~ 394 (659) +.+. .......++++++|+. ....++++|. T Consensus 121 l~~~--------------------------------------------~~~~~~~p~i~~ap~~------~~~~v~~al~ 150 (396) T protein:vir:57 121 LMGA--------------------------------------------ESVTGVKPRILGVPGL------DTKEVAVALA 150 (396) T ss_pred hhhc--------------------------------------------ccceeEEeccccCccc------chhHHHHHHH Confidence 0000 0000012334444443 3456899999 Q ss_pred HHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHH Q lcl|NC_014792. 395 SIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADM 474 (659) Q Consensus 395 ~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 474 (659) .+|++++ +|+++|+|.+ .+.+++.+||+. ++|.|+++||||++++|+.++..+++|||+++ T Consensus 151 ~~~~~~~-~~~~~d~p~~--------~~~~~~~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 211 (396) T protein:vir:57 151 SVCQELN-AFGYISAWGC--------KTISEVKAYRQN----------FSQRELMVIWPDFLAWDTVTSTTATAYATARA 211 (396) T ss_pred HHhhhCc-eEEEEcCCCC--------CCHHHHHHHHhc----------cCCceEEEEcceeeeecccCCceeEEehhHHH Confidence 9998775 9999999864 467889999974 46899999999999999999999999999999 Q ss_pred HHHHHHhhhcCCceECcCCcchhheecccc---ceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccce Q lcl|NC_014792. 475 AGLCARTDDVSQPWMSPPGYNRGQILNVLK---LAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDH 551 (659) Q Consensus 475 Ag~~a~~d~~~g~~~span~~~~~i~g~~~---~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~ 551 (659) ||++||+|.++|+||||||+.+.++.++.. .....++.|++.||++|||++++ + +||++||+||+++++ +|+| T Consensus 212 Ag~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~~--~-~G~~~wG~rT~~~d~-~~~~ 287 (396) T protein:vir:57 212 LGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQKPGTDADLLNEAGVTTLVR--R-DGFRFWGNRTCSDDP-LFLF 287 (396) T ss_pred HHHHHHhhhccCcEeccCCceeccccccceecccccCCcchhhhhhhhcCcEEEEc--C-CCEEEEcccccCCCc-ccce Confidence 999999999999999999998776665432 23344578999999999999854 3 799999999998765 7999 Q ss_pred eehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEE Q lcl|NC_014792. 552 INVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYK 631 (659) Q Consensus 552 i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~ 631 (659) |++||++++|+++|+++++|+|||||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++ T Consensus 288 i~vrR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~v~~~ 367 (396) T protein:vir:57 288 ESYTRTAQVLADTMAEAHMWAIDKPITATLIRDIIDGINAKFRELKNNGYIVDGTCWFSEESNDAETLKAGKLYIDYDYT 367 (396) T ss_pred eehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCceEEEEEEEEeecCee--EEEecCC Q lcl|NC_014792. 632 PARSINYIVLNFVATSTGAD--FDELIGV 658 (659) Q Consensus 632 p~~p~e~i~~~~~~~~~~~~--~~e~~~~ 658 (659) |++|+|||+|+++++....+ +++|+.- T Consensus 368 p~~p~e~I~~~~~~~~~~~~~~~~~~~~~ 396 (396) T protein:vir:57 368 PVPPLENLTLRQRITSRYLASLVTSVNSN 396 (396) T ss_pred ecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 99999999999999988865 4444444 No 26 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=1.3e-92 Score=524.37 Aligned_cols=381 Identities=15% Similarity=0.129 Sum_probs=308.5 Q ss_pred CceecCceEEEEecCCCcccc-cCCcceEEEeecccC-----CCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVR-NATGRAALVGKFQWG-----PAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~-~~ts~~afvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) |+.-.|||||+|++.+++++. +.|++.+|||+++.+ |+++|++|+|+.+|...||. ..++..++..+|.|| T Consensus 1 m~~~~~GV~v~e~~~g~~~v~~v~tav~~~vgta~~~~~~~~p~~~pv~v~s~~~~~~~~g~---~~tl~~al~~~~~~~ 77 (395) T protein:vir:98 1 MSDFHHGTQVIEINDGTRVISTVATAVVGMVCTASDADATLFPLNEPVLITNVQSAIAKAGK---KGTLAASLQAIADQS 77 (395) T ss_pred CCCCCCCeEEEEcCCCcccccccCcceEEEEeeccCCCccccccccceEeechHHhHhhccc---ccchhhHHHHHhhcc Confidence 998889999999999997654 569999999999765 78999999999999999994 457788999999999 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) |+.||++|+........ . ... T Consensus 78 ~~~~~vv~~~~~~~~~~---------------~----------------------------------~~~---------- 98 (395) T protein:vir:98 78 KPVTVVVRVEDGTGDDE---------------E----------------------------------AAL---------- 98 (395) T ss_pred CceEEEeeccccccccc---------------c----------------------------------ccc---------- Confidence 99999998743210000 0 000 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) . ...... .+... T Consensus 99 --------------------------a-----------~~~~~i----------------------------~g~~~--- 110 (395) T protein:vir:98 99 --------------------------A-----------QTVSNI----------------------------IGGTD--- 110 (395) T ss_pred --------------------------c-----------cccccc----------------------------ccccc--- Confidence 0 000000 00000 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) .. + T Consensus 111 ---------------------------------------~~-----------~--------------------------- 113 (395) T protein:vir:98 111 ---------------------------------------EN-----------G--------------------------- 113 (395) T ss_pred ---------------------------------------cc-----------c--------------------------- Confidence 00 0 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhh---hcccccceEEEeccccccchhhhHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFA---DREALHINLLIAGAVAGEGDATASTVQK 391 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~p~~~~~~~~~~~~v~~ 391 (659) ..+++.++. ......+.++++|+.. ..+++. T Consensus 114 ----------------------------------------~~Tgl~al~~~~~~~~~~p~il~ap~~~------~~~v~~ 147 (395) T protein:vir:98 114 ----------------------------------------KYTGIKALLTAQAVTGVKPRILGVPGLD------TKEVAV 147 (395) T ss_pred ----------------------------------------chhHHHHHhhhhhhhccchhhccccccc------ccHHHH Confidence 000000000 0001234566666653 356889 Q ss_pred HHHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHH Q lcl|NC_014792. 392 HVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLA 471 (659) Q Consensus 392 ~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 471 (659) +|.++|++++ +|+++|+|.+ .+.+++.+||+. ++|+|+++||||++++|+.++..+++||| T Consensus 148 al~~~~~~~~-~~~~~d~p~~--------~t~~~a~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 208 (395) T protein:vir:98 148 ALASAAIKLR-AFAYVSAWGC--------KTISEAMEYRKN----------FSQRELMVIWPDFLAWDTVKNTTATAYAT 208 (395) T ss_pred HHHHHhhhcC-cEEEEEcCCC--------CCHHHHHHHHhc----------cCCceEEEEecceeEecccCCceeeechH Confidence 9999999887 8999999864 467899999974 45889999999999999999999999999 Q ss_pred HHHHHHHHHhhhcCCceECcCCcchhheecccc---ceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccc Q lcl|NC_014792. 472 ADMAGLCARTDDVSQPWMSPPGYNRGQILNVLK---LAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSP 548 (659) Q Consensus 472 ~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~---~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~ 548 (659) +++||++||+|.++|+||||||+.+.++.|+.. ....+++.|++.||++|||++.+ + +|+++||+||++++ ++ T Consensus 209 ~~~AG~~a~~d~~~g~~~spaN~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~~~~~--~-~G~~~wG~rT~s~d-~~ 284 (395) T protein:vir:98 209 ARALGLRAYIDQTVGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVR--K-DGFRFWGNRTCSDD-PL 284 (395) T ss_pred HHHHHHHHHhhcccCcEeccCCceeecccccceecccccCCCcchHHhhhhcCcEEEEc--C-CCEEEEcccccCCC-cc Confidence 999999999999999999999998776666432 23445688999999999999853 4 79999999999876 48 Q ss_pred cceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEE Q lcl|NC_014792. 549 MDHINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASI 628 (659) Q Consensus 549 ~~~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i 628 (659) |+||++||++++|+++|++.++|++|||||+.||++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++| T Consensus 285 ~~~i~~rR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~i 364 (395) T protein:vir:98 285 FLFENYTRTAQVLADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKSNGYIVEGKCWFDEESNDKETLKAGKLYIDY 364 (395) T ss_pred cceeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCceEEEEEEEEeecCee--EEEecC Q lcl|NC_014792. 629 YYKPARSINYIVLNFVATSTGAD--FDELIG 657 (659) Q Consensus 629 ~~~p~~p~e~i~~~~~~~~~~~~--~~e~~~ 657 (659) +++|++|+|||+|+++.+..+.. |+||.. T Consensus 365 ~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:98 365 DYTPVPPLESLTLRQRITDKYLVNLAESVNS 395 (395) T ss_pred EEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999999999999977 888888 No 27 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=1.2e-91 Score=519.04 Aligned_cols=385 Identities=12% Similarity=0.106 Sum_probs=300.4 Q ss_pred CceecCceEEEEecCCCcccc-cCCcceEEEeecccC-----CCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVR-NATGRAALVGKFQWG-----PAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~-~~ts~~afvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) |+.-.|||||+|++.+++++. +.|++++|||.++++ |+++|++|+|+.+|...||+ ...+..++..+|.|| T Consensus 1 m~~~~~GV~v~e~~~g~~~i~~v~tav~~~vg~a~~a~~~~~~l~~pvlvts~~~~~~~~g~---~~tL~~al~~~~~ng 77 (396) T protein:vir:20 1 MSDYHHGVQVLEINEGTRVISTVSTAIVGMVCTASDADAETFPLNKPVLITNVQSAISKAGK---KGTLAASLQAIADQS 77 (396) T ss_pred CCCCCCCeEEEEcCCCcceeeecCCceeEEEeeeccCCCccccCccCEEeechHHHHhhccc---ccchhhhhhhhhccC Confidence 997779999999999987665 569999999998664 78999999999999999994 456778899999999 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) |..||++|+........ . T Consensus 78 g~~~~v~~~~~~~~~~~---------------~----------------------------------------------- 95 (396) T protein:vir:20 78 KPVTVVMRVEDGTGDDE---------------E----------------------------------------------- 95 (396) T ss_pred ceeEEEEeccccccccc---------------c----------------------------------------------- Confidence 99999998743210000 0 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) .+. . .. ....... T Consensus 96 ---------------------~~~--a-----------~t--------------------------------~~~~~~~- 108 (396) T protein:vir:20 96 ---------------------TKL--A-----------QT--------------------------------VSNIIGT- 108 (396) T ss_pred ---------------------ccc--c-----------cc--------------------------------ccccccc- Confidence 000 0 00 0000000 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) .. .. . . ..+ + .. T Consensus 109 ---~~--~~-------------------------------~-~-------~tg-----------------------~-~a 120 (396) T protein:vir:20 109 ---TD--EN-------------------------------G-Q-------YTG-----------------------L-KA 120 (396) T ss_pred ---cc--cc-------------------------------c-c-------cch-----------------------h-hh Confidence 00 00 0 0 000 0 00 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVV 394 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~ 394 (659) +.+. .......+.++++|+. ....|+.+|+ T Consensus 121 l~~~--------------------------------------------~~~~~~~p~i~~ap~~------~~~~v~~al~ 150 (396) T protein:vir:20 121 MLAA--------------------------------------------ESVTGVKPRILGVPGL------DTKEVAVALA 150 (396) T ss_pred hhhh--------------------------------------------ccccccchhhhhhhhh------ccHHHHHHHH Confidence 0000 0000012234445543 3456899999 Q ss_pred HHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHH Q lcl|NC_014792. 395 SIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADM 474 (659) Q Consensus 395 ~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 474 (659) ++|++++ +|+++|+|.+ .+.+++.+||+. ++|+|+++||||++++|+.++..+++|||+++ T Consensus 151 ~~~~~~~-~~~~iD~p~~--------~~~~~a~~~r~~----------~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~ 211 (396) T protein:vir:20 151 SVCQKLR-AFGYISAWGC--------KTISEVKAYRQN----------FSQRELMVIWPDFLAWDTVTSTTATAYATARA 211 (396) T ss_pred HHHhcCC-cEEEEecCCC--------CCHHHHHHHhhC----------CCCceEEEEcCccccccCcCCcceeechhHHH Confidence 9999887 7899999964 467899999964 45889999999999999999999999999999 Q ss_pred HHHHHHhhhcCCceECcCCcchhheecccc---ceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccce Q lcl|NC_014792. 475 AGLCARTDDVSQPWMSPPGYNRGQILNVLK---LAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDH 551 (659) Q Consensus 475 Ag~~a~~d~~~g~~~span~~~~~i~g~~~---~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~ 551 (659) ||++||+|.++|+||||||+.+.++.|... ....+++.|++.||++|||++++ + +||++||+||++++ ++|+| T Consensus 212 Ag~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~--~-~G~~~wG~rT~s~d-~~~~~ 287 (396) T protein:vir:20 212 LGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTTLIR--R-DGFRFWGNRTCSDD-PLFLF 287 (396) T ss_pred HHHHHHhhhhcCcEeccCCceeccceecceecccccCCCcchhhhhhhcCcEEEEc--C-CCEEEEcccccCCC-cccce Confidence 999999999999999999998776666432 23445678999999999999854 4 79999999999876 47999 Q ss_pred eehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEE Q lcl|NC_014792. 552 INVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYK 631 (659) Q Consensus 552 i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~ 631 (659) |++||+++||+++|++.++|+|||||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++ T Consensus 288 i~~rR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~G~l~g~~v~~d~~~nt~~~i~~G~~~~~i~~~ 367 (396) T protein:vir:20 288 ENYTRTAQVVADTMAEAHMWAVDKPITATLIRDIVDGINAKFRELKTNGYIVDATCWFSEESNDAETLKAGKLYIDYDYT 367 (396) T ss_pred eehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCcceeceEEEEecCCCCHHHhhCCEEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCceEEEEEEEEeecCee--EEEecCC Q lcl|NC_014792. 632 PARSINYIVLNFVATSTGAD--FDELIGV 658 (659) Q Consensus 632 p~~p~e~i~~~~~~~~~~~~--~~e~~~~ 658 (659) |++|+|||+|+++++....+ |++|..- T Consensus 368 p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 396 (396) T protein:vir:20 368 PVPPLENLTLRQRITDKYLANLVTSVNSN 396 (396) T ss_pred ecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 99999999999998877754 2222222 No 28 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=1.1e-91 Score=519.18 Aligned_cols=379 Identities=14% Similarity=0.141 Sum_probs=301.0 Q ss_pred CceecCceEEEEecCCCcccc-cCCcceEEEeecc-----cCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVR-NATGRAALVGKFQ-----WGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~-~~ts~~afvG~~~-----~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) |++.+|||||+|++.+++++. +.|++.+|+|.++ .+|+++|++|+|+.+|...||. ..++.+++..+|.|+ T Consensus 3 ~~~~~~GV~v~e~~~~~~~i~~v~tavig~vg~a~~a~~~~~~~~~p~~v~s~~~~~~~~g~---~~tl~~al~~~~~~~ 79 (391) T protein:vir:11 3 ADQYHHGVRVQEINDGTRPIRTIATAIIGLVATAEDADATAFPLDTPVLITNVQAAIGKAGT---SGTLPASLQAIADQA 79 (391) T ss_pred CCcCCCcEEEEECCCCcceecccCCceeEEEEecCCCCCccccccccEEEecchhhheecCC---Cccchhhhhhhhccc Confidence 779999999999999987665 5699999999998 4699999999999999999984 556778999999999 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) |+.||+||+...+... . .. T Consensus 80 g~~~~vv~~~~~~~~~----------------~-----------------------------------t~---------- 98 (391) T protein:vir:11 80 NAATVVVRVKPGEDEA----------------A-----------------------------------TN---------- 98 (391) T ss_pred cceeEEeeeccccccc----------------c-----------------------------------cc---------- Confidence 9999999974211000 0 00 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) . +.. +... . T Consensus 99 --------------------------~---------------d~~----------------------------g~~~-a- 107 (391) T protein:vir:11 99 --------------------------S---------------AVI----------------------------GGVS-A- 107 (391) T ss_pred --------------------------h---------------hhh----------------------------cccc-c- Confidence 0 000 0000 0 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) .. .. .+ .. . T Consensus 108 --------~~-----------------------------~~----------~g-----------------------~~-a 116 (391) T protein:vir:11 108 --------DG-----------------------------KY----------TG-----------------------MK-A 116 (391) T ss_pred --------cc-----------------------------ch----------hh-----------------------hh-h Confidence 00 00 00 00 0 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVV 394 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~ 394 (659) +.... + . -...+.++++|+. +..+++.+|+ T Consensus 117 ~~~~~-----------~-----------------------------~----~~~~p~~~~ap~~------~~~~v~~al~ 146 (391) T protein:vir:11 117 LLAAK-----------A-----------------------------R----LGVVPRILGVPGL------DTQPVATALI 146 (391) T ss_pred hhhhh-----------h-----------------------------h----heecccccccccc------ccHHHHHHHH Confidence 00000 0 0 0011234444543 3456899999 Q ss_pred HHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHH Q lcl|NC_014792. 395 SIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADM 474 (659) Q Consensus 395 ~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 474 (659) ++|++++ +|+++|+|.+ .+.+++.+||+. ++|+|+++||||++++|+.++..+++|||+++ T Consensus 147 ~~~~~~~-~~~i~D~p~~--------~t~~~a~~~r~~----------~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ 207 (391) T protein:vir:11 147 AIAQQLR-AFAYVSASGC--------KTKEEATAYREN----------FAAREAMVIWPDFLTWSTVVNQTVPAPAVAQA 207 (391) T ss_pred Hhhcccc-eEEEEEcCCC--------CCHHHHHHHhhh----------cCCceEEEEcCcceecccccCceEEechHHHH Confidence 9998874 9999999854 467889999963 56899999999999999999999999999999 Q ss_pred HHHHHHhhhcCCceECcCCcchhheecccc---ceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccce Q lcl|NC_014792. 475 AGLCARTDDVSQPWMSPPGYNRGQILNVLK---LAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDH 551 (659) Q Consensus 475 Ag~~a~~d~~~g~~~span~~~~~i~g~~~---~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~ 551 (659) ||++||+|.++||||||||+.+.++.++.. .+...++.|++.||++|||++++ + +||++||+||++.++ +|+| T Consensus 208 ag~~a~~d~~~g~~~span~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~~--~-~G~~~wG~rT~~~d~-~~~~ 283 (391) T protein:vir:11 208 LGLRARIDQEVGWHKTLSNVAVNGVTGISADVFWDLQSPSTDANYLNENEVTTLVQ--E-GGFRFWGSRTCSDDP-LFAF 283 (391) T ss_pred HHHHHHhhccCCcEEccCCceeeceeecccccccccCCCcchhhhhhhcCcEEEEc--C-CCEEEEcccccCCCc-ccce Confidence 999999999999999999999877766532 23444678999999999999843 4 799999999998765 7999 Q ss_pred eehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEE Q lcl|NC_014792. 552 INVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYK 631 (659) Q Consensus 552 i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~ 631 (659) |++||+|++|+++|++.++|+|||||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++ T Consensus 284 i~vrR~~~~i~~~~~~~~~~~v~e~n~~~~~~~i~~~i~~~l~~l~~~g~l~g~~~~~~~~~n~~~~i~~G~~~~~i~~~ 363 (391) T protein:vir:11 284 ENYTRTAQVLADTIAEAHMWAVDKPMHPSLVRDILEGVNAKFRELKGLGLIIDAQAWYDPNVNDKDTLKAGKLRITYDYT 363 (391) T ss_pred eehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhccceeceEEEEecCCCCHHHhhCCeEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCceEEEEEEEEeecCee-EEEecCC Q lcl|NC_014792. 632 PARSINYIVLNFVATSTGAD-FDELIGV 658 (659) Q Consensus 632 p~~p~e~i~~~~~~~~~~~~-~~e~~~~ 658 (659) |++|+|||+++++++..... +.+-+++ T Consensus 364 p~~p~e~i~~~~~~~~~~~~~~~~~~~a 391 (391) T protein:vir:11 364 PVPPLEDLTFFQKITDSYLVDFASRVNA 391 (391) T ss_pred ecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999999888743 3333333 No 29 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=4.6e-91 Score=515.83 Aligned_cols=381 Identities=14% Similarity=0.132 Sum_probs=306.3 Q ss_pred CceecCceEEEEecCCCccccc-CCcceEEEeecccC-----CCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRN-ATGRAALVGKFQWG-----PAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) |+--.|||||+|++++++++.. .|++.+|||.++++ |+++|++|+|+.+|...||. ...+.+++..+|.|| T Consensus 1 m~~~~~Gv~v~e~~~g~~~i~~~~tav~g~vgta~~~~~~~~~~~~p~~its~~~~~~~~g~---~gtl~~al~~~~~ng 77 (392) T protein:vir:18 1 MSDFHHGTKVIEINDGTRVISTVATAIVGMVWTASDADAETFPLNEPVLITNVQSAIAKAGK---KGTLSASLQAIADQS 77 (392) T ss_pred CCCCCCCeEEEEcCCCceeeeccCcceeEEEEeccCCCCcccccccceEeechHHHHhhcCC---CcchHHHHHHhhccc Confidence 9854689999999999987765 59999999999875 89999999999999999995 556788999999999 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) |..|+++|+....... T Consensus 78 g~~~~vv~v~~~~~~~---------------------------------------------------------------- 93 (392) T protein:vir:18 78 KPVTVVVRVAEGTGDD---------------------------------------------------------------- 93 (392) T ss_pred CceEEEeccccccccc---------------------------------------------------------------- Confidence 9999999864311000 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) + . . ....+. .| .. T Consensus 94 ----------------------~-~-~-----------~t~~dl----------------------------iG---~~- 106 (392) T protein:vir:18 94 ----------------------A-E-A-----------QTTSNI----------------------------IG---GT- 106 (392) T ss_pred ----------------------c-c-c-----------cchhhh----------------------------ee---cc- Confidence 0 0 0 000000 00 00 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) . .. + ... + .. . T Consensus 107 ----~----------------------------------~~-----------~----------~~t-----g----~~-a 117 (392) T protein:vir:18 107 ----D----------------------------------EN-----------G----------KYT-----G----IK-A 117 (392) T ss_pred ----c----------------------------------cc-----------c----------hhh-----h----HH-H Confidence 0 00 0 000 0 00 0 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVV 394 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~ 394 (659) + ... ...-...++++++|+.. ..+|+++|. T Consensus 118 l----------------------------------------~~~----~~~~~~~p~il~ap~~~------~~~v~~~l~ 147 (392) T protein:vir:18 118 L----------------------------------------LTA----EAVTGVKPRILGVPGLD------TQEVATALA 147 (392) T ss_pred H----------------------------------------Hhh----hhhhceeehhcccCccc------hHHHHHHHH Confidence 0 000 00001235677777753 457999999 Q ss_pred HHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHH Q lcl|NC_014792. 395 SIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADM 474 (659) Q Consensus 395 ~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 474 (659) ++|++++ +|+++|+|.+ .+.+++.+||+. ++|+|+++||||++++|+.++..+++|||+++ T Consensus 148 ~~~~~~~-~~~~~d~~~~--------~~~~~a~~~~~~----------~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 208 (392) T protein:vir:18 148 SVCISLR-AFGYVSAWGC--------KTISEAMAYREN----------FSQRELMVIWPDFLAWDTTANATATAYATARA 208 (392) T ss_pred HHHhhcC-cEEEEecCCC--------CCHHHHHHHHhh----------ccCceEEEEeCceeeecccCCceEEechHHHH Confidence 9999887 8999998754 567889999963 46899999999999999999999999999999 Q ss_pred HHHHHHhhhcCCceECcCCcchhheecccc---ceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccce Q lcl|NC_014792. 475 AGLCARTDDVSQPWMSPPGYNRGQILNVLK---LAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDH 551 (659) Q Consensus 475 Ag~~a~~d~~~g~~~span~~~~~i~g~~~---~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~ 551 (659) ||++||+|.++||||||+|+.+.++.++.. .+..+++.|++.||++|||++++ + +|+++||+||+++++ +|+| T Consensus 209 AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~--~-~G~~~wG~rT~~~d~-~~~~ 284 (392) T protein:vir:18 209 LGLRAYIDQTIGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLVR--K-DGFRFWGNRTCSDDP-LFLF 284 (392) T ss_pred HHHHHhhhccCCceEccCCceeeceeecceecccccCCCcchhhhhhhcCceEEEc--C-CCEEEEcccccCCCc-ccce Confidence 999999999999999999998776666432 23445678999999999999853 4 799999999998765 8999 Q ss_pred eehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEE Q lcl|NC_014792. 552 INVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYK 631 (659) Q Consensus 552 i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~ 631 (659) |++||++++|+++|+++++|+|||||++.+|++|+++++.||++||++|+|+||+|+||+++||+++|++|+|+++|+++ T Consensus 285 i~~rR~~~~i~~~i~~~~~~~v~e~n~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~v~~~ 364 (392) T protein:vir:18 285 ENYTRTAQVLADTMAEAHMWAVDKPITASLIRDIVDGINAKFRELKSNGYIVDGECWFDEESNDKETLKAGKLYIDYDYT 364 (392) T ss_pred eehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhcCcccceEEEEecCCCCHHHhhCCeEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCceEEEEEEEEeecCee--EEEecC Q lcl|NC_014792. 632 PARSINYIVLNFVATSTGAD--FDELIG 657 (659) Q Consensus 632 p~~p~e~i~~~~~~~~~~~~--~~e~~~ 657 (659) |++|+|||+|+++....+.+ +++|.. T Consensus 365 p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 392 (392) T protein:vir:18 365 PVPPLESLTLRQRITDKYLVNLAESVNS 392 (392) T ss_pred ecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999999999876 555555 No 30 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=2e-90 Score=512.36 Aligned_cols=377 Identities=14% Similarity=0.112 Sum_probs=300.0 Q ss_pred CceecCceEEEEecCCCccccc-CCcceEEEeecccC-----CCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRN-ATGRAALVGKFQWG-----PAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng 74 (659) |++..|||||+|++.+++++.. .|++.+|||+++++ |+|+|++|+|+.||.+.|| ...++.+++..+|.|+ T Consensus 4 ~~~~~~GV~v~e~~~g~~~i~~~~tav~~~vgta~~~~~~~~pln~pv~i~s~~~~~~~~g---~~g~L~~al~~~~~~~ 80 (393) T protein:vir:10 4 LDTYLHGVEVVEVNAGGVTISTAATSVIGVVCTGDQADAETFPLNTPVLITNPLNYLEKAG---STGTLRRTLNSIGSIV 80 (393) T ss_pred CCccCCCeEEEEcCCCcceecccCcceeEEEeeccCcCcccccCccceEecchHHHHHhhC---Cccchhhhhhhhhccc Confidence 4455699999999999976654 69999999999987 9999999999999999999 4567888999999999 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) |+.||+||+...+.... . .. T Consensus 81 ~~~~~vv~v~~~~~~~~----------------------------------------t---------~~----------- 100 (393) T protein:vir:10 81 KTPTVIVRVAESDDSDT----------------------------------------L---------TA----------- 100 (393) T ss_pred CceEEEeecccCccccc----------------------------------------c---------cc----------- Confidence 99999999843210000 0 00 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) . .. | .. T Consensus 101 --------------------------------------~----ii----------------------------g---~~- 106 (393) T protein:vir:10 101 --------------------------------------N----IV----------------------------G---TQ- 106 (393) T ss_pred --------------------------------------c----cc----------------------------c---cc- Confidence 0 00 0 00 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) .+ . ..+ + + .. T Consensus 107 ----~~--~----------------------------------~~t------g-----------------------l-~a 116 (393) T protein:vir:10 107 ----EN--G----------------------------------KFT------G-----------------------I-KA 116 (393) T ss_pred ----cc--c----------------------------------hhh------H-----------------------H-HH Confidence 00 0 000 0 0 00 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVV 394 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~ 394 (659) +... .......++|+++|++. ..+++.+|+ T Consensus 117 l~~~--------------------------------------------~~~~~~~p~li~apg~~------~~~~~~al~ 146 (393) T protein:vir:10 117 LLTA--------------------------------------------QSTVFVKPKLLCVPQHD------NQAVATELL 146 (393) T ss_pred HHhh--------------------------------------------hhhcceeeeeeeecccc------chHHHHHHH Confidence 0000 00000124667777753 346789999 Q ss_pred HHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHH Q lcl|NC_014792. 395 SIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADM 474 (659) Q Consensus 395 ~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 474 (659) ++|+++++++++.|+|. .+.+++.+|++. +.|.|+++||||++++|+.+++.+++|||+++ T Consensus 147 ~~~~~~~~~~~v~d~~~---------~t~~~ai~~~~~----------~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~ 207 (393) T protein:vir:10 147 SVAKKLNAFAFISDNGA---------TTKEQAYTYRQN----------FSQREGMMIFGDWKSYNTDKKAYDTDYAVARA 207 (393) T ss_pred HHhhccCcEEEEEcCCC---------CCHHHHHHHhhh----------cCCceEEEEecccccccccCCceeEeehhHHH Confidence 99999998888877663 467889999974 45789999999999999999999999999999 Q ss_pred HHHHHHhhhcCCceECcCCcchhheecccc---ceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccce Q lcl|NC_014792. 475 AGLCARTDDVSQPWMSPPGYNRGQILNVLK---LAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDH 551 (659) Q Consensus 475 Ag~~a~~d~~~g~~~span~~~~~i~g~~~---~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~ 551 (659) ||++||+|.++|+||||||+.+.++.|+.. ....+++.|++.||++|||+|.+ + +|+++||+||+++++ +|+| T Consensus 208 Ag~~a~~d~~~G~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~~--~-~G~~~wG~rT~s~d~-~~~~ 283 (393) T protein:vir:10 208 CALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICLN--H-NGFRYWGSRTLATDT-RWAF 283 (393) T ss_pred HHHHHHhhcCCCcEEccCCceeeceeecceecccccCCCcchhHhHhhcCceEEEc--C-CCEEEEcccccCCCc-ccce Confidence 999999999999999999998777666532 23445688999999999999843 4 799999999998764 8999 Q ss_pred eehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhcc--ceeeeEEEEccCCCCHHHhhCCEEEEEEE Q lcl|NC_014792. 552 INVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALG--GIYEGRVVCDTTNNTPSVIDRNEFVASIY 629 (659) Q Consensus 552 i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~g--al~g~~v~~d~~~nt~~~i~~G~~~~~i~ 629 (659) |++|||+++|+++|++.++|+|||||++.+|++|+++++.||++||+.| +|.||+|+||++ ||++||++|+|+++|+ T Consensus 284 i~vrR~~~~i~~~i~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~~al~g~~v~~~~~-nt~~~i~~G~~~~~i~ 362 (393) T protein:vir:10 284 QQSVRTAQIIKETIGAGLAWAVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEE-ITADIIKSGKFVIKYD 362 (393) T ss_pred eehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhccccccccceEEecCC-CCHHHhhCCEEEEEEE Confidence 9999999999999999999999999999999999999999999999966 899999999875 8899999999999999 Q ss_pred EEecCCceEEEEEEEEeecCee--EEEecCC Q lcl|NC_014792. 630 YKPARSINYIVLNFVATSTGAD--FDELIGV 658 (659) Q Consensus 630 ~~p~~p~e~i~~~~~~~~~~~~--~~e~~~~ 658 (659) ++|++|+|||+|++..+..+.. |++|+-. T Consensus 363 ~~p~~p~e~I~~~~~~~~~~~~~l~~~v~a~ 393 (393) T protein:vir:10 363 YHWIPSLESLGLEQRVNDEYVVDLVNTLKAL 393 (393) T ss_pred EEecCCcceEEEEEEEchHHHHHHHHHHhcC Confidence 9999999999999999887633 4444333 No 31 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=1e-89 Score=508.52 Aligned_cols=375 Identities=16% Similarity=0.158 Sum_probs=309.2 Q ss_pred CceecCceEEEEecCCCccccc-CCcceEEEeecccC----CCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRN-ATGRAALVGKFQWG----PAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYG 75 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~-~ts~~afvG~~~~G----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG 75 (659) |+..+|||||+|++.++++++. .|++.+|||.++.+ |.++|++|.++.++...||.......+..++..+|.++| T Consensus 4 ~~~~~hGv~v~ev~~g~~~i~~~~tavi~~Vgta~~ad~~~p~~~~~~i~~~~d~~~~~~~~~~~gtl~~al~~~~~~~~ 83 (388) T protein:vir:96 4 IDQFEHNGISIETHEPPPPMGPPGDNVVAWVVTAPDKHADVAFSVPFRVANTADAQYLDSTGNELGTGWHAASETLKKTS 83 (388) T ss_pred CCCCCCceEEEEcCCCcccccccCcceeEEEEecCCCccccccccceeeecchhhhhhhccccccccchhhhHhhhccCC Confidence 6666789999999999987755 69999999999764 899999999999999999988888888889999999999 Q ss_pred CeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccc Q lcl|NC_014792. 76 NDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAK 155 (659) Q Consensus 76 ~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 155 (659) ..||++|+...+... . . T Consensus 84 ~~~~vv~v~~g~~~~----------------a-----------------------------------t------------ 100 (388) T protein:vir:96 84 VPQYFIVVPEGADDA----------------A-----------------------------------T------------ 100 (388) T ss_pred ceEEEEEeccccccc----------------c-----------------------------------c------------ Confidence 999999974321000 0 0 Q ss_pred ccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEE Q lcl|NC_014792. 156 SVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLE 235 (659) Q Consensus 156 ~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~ 235 (659) .. +. .| T Consensus 101 ----------------------------~a-----------~i----------------------------ig------- 106 (388) T protein:vir:96 101 ----------------------------MA-----------NI----------------------------IG------- 106 (388) T ss_pred ----------------------------cc-----------ee----------------------------ee------- Confidence 00 00 00 Q ss_pred EEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhh Q lcl|NC_014792. 236 VEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYF 315 (659) Q Consensus 236 V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~ 315 (659) T Consensus 107 -------------------------------------------------------------------------------- 106 (388) T protein:vir:96 107 -------------------------------------------------------------------------------- 106 (388) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHH Q lcl|NC_014792. 316 AKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVS 395 (659) Q Consensus 316 ~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~ 395 (659) +.+..+ ...+++.++...+ ..++||++|++. +..+|+++|++ T Consensus 107 ---------------------------~~~~~t-----g~~~gl~al~~~~-~~p~il~aPg~s-----~~~~v~~al~~ 148 (388) T protein:vir:96 107 ---------------------------GIDPTT-----GRRTGIAALTECT-ERPTLIGAPGFS-----QNKAVIDALAS 148 (388) T ss_pred ---------------------------eccccc-----chhhHHHHhhhcc-cceeEEEeeccc-----cchHHHHHHHH Confidence 000000 0001111111111 236889999864 45689999999 Q ss_pred HHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHH Q lcl|NC_014792. 396 IADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMA 475 (659) Q Consensus 396 ~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~A 475 (659) +|++++ +|+++|+|.+ +.++..+|+...+ ..+++|+|+++||||++++|+.++..+++|||+++| T Consensus 149 ~~~~~~-~~~i~D~p~~---------~~~~~~~~~~~~~-----~~~~~s~~~~~~~P~~~~~d~~~~~~~~~p~s~~~A 213 (388) T protein:vir:96 149 MAKRLK-CRAVIDGPSG---------STQDAIDLSGLLG-----GEGTGHDRVYMVDPMPAIYSRKAQGNIYVPPSTIAM 213 (388) T ss_pred HHhhcC-cEEEEeccCC---------chhHHHHHHhhhh-----ccCcCcceEEEEeCceeeecccCCceeeechHHHHH Confidence 999886 8999999954 3344555554322 346789999999999999999999999999999999 Q ss_pred HHHHHhhhcCCceECcCCcchhheeccc---cceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCcccccee Q lcl|NC_014792. 476 GLCARTDDVSQPWMSPPGYNRGQILNVL---KLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHI 552 (659) Q Consensus 476 g~~a~~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i 552 (659) |++||+| +||||||+.+ ++.|+. +....+++.|++.||++|||+|++|++ +|+++||+||++ |+|| T Consensus 214 G~~a~~D----~~~spaN~~i-~i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~-~G~~~wG~rT~~-----~~~i 282 (388) T protein:vir:96 214 GAVAAVK----PWESPGNQGV-LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSM-GGFSLIGNRTVT-----GKFI 282 (388) T ss_pred HHHHhhc----CcccccCeeE-EeeeecccccccccCChhhHHhhhhcCceEEEEecC-CcEEEEcccccC-----Ccce Confidence 9999999 5999999987 466653 445666788999999999999999986 799999999974 9999 Q ss_pred ehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEe Q lcl|NC_014792. 553 NVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKP 632 (659) Q Consensus 553 ~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p 632 (659) ++|||++||+++|+++++|+|||||++.||++|+++++.||++||++|+|+||+++||+++||+++|++|+|+++|+++| T Consensus 283 ~vrR~~~~i~~si~~~~~~~v~epn~~~~~~~i~~~i~~fL~~l~~~Gal~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p 362 (388) T protein:vir:96 283 SFVGLEDAIARKLEAASQRAMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGSWYIVIDYGR 362 (388) T ss_pred eehhhHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCEEEEEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCceEEEEEEEEeecCee--EEEec Q lcl|NC_014792. 633 ARSINYIVLNFVATSTGAD--FDELI 656 (659) Q Consensus 633 ~~p~e~i~~~~~~~~~~~~--~~e~~ 656 (659) ++|+|||+|+++.+....+ |+||+ T Consensus 363 ~~pae~I~~~~~~~~~~~~~~~~~~~ 388 (388) T protein:vir:96 363 YSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) T ss_pred cCCcceEEEEEEEchHHHHHHHHHhC Confidence 9999999999999999998 99999 No 32 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=2.9e-87 Score=495.02 Aligned_cols=376 Identities=16% Similarity=0.189 Sum_probs=293.9 Q ss_pred Cc-eecCceEEEEecCCCccccc-CCcceEEEeecccC-----CCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHc Q lcl|NC_014792. 1 MA-LLSPGIELKETTVQSTVVRN-ATGRAALVGKFQWG-----PAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQ 73 (659) Q Consensus 1 ~~-~~~PGVyveE~~~~~~~~~~-~ts~~afvG~~~~G-----p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~n 73 (659) |. |.+|||||+|++.+++++.. .|++.+|||.++.+ |+++|++++|+.++...||+ ...+..++..+|.| T Consensus 1 M~~~~~~Gv~v~ev~~~~~~i~~v~tav~~~vg~a~~a~~~~~~~~~pv~i~s~~~~~~~~g~---~~tl~~a~~~~~~~ 77 (386) T protein:vir:10 1 MAEQYLHGAEVVEIDNGARPIRTAQSGVIGLVGTAPDADATAFPLNTPVLIAGSRREAAKLGA---GGTLPQAIDGIFDQ 77 (386) T ss_pred CccccCCCeEEEEcCCCcccccccCcceeEEEEecCCCCCcccccccceEecchHHHHhhcCC---CcchhHHHHHHhcc Confidence 66 88999999999999987665 69999999998765 89999999999999999994 45677889999999 Q ss_pred CCCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccc Q lcl|NC_014792. 74 YGNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAF 153 (659) Q Consensus 74 gG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 153 (659) ||+.||++++......... . ... T Consensus 78 gg~~~~vv~~~~~~~~~~t----------------------------------------~--------~~~--------- 100 (386) T protein:vir:10 78 TGAVVVVIRVDEGVDSAAT----------------------------------------Q--------SNV--------- 100 (386) T ss_pred CceeEEEeecccccccccc----------------------------------------c--------hhh--------- Confidence 9999999987432100000 0 000 Q ss_pred ccccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeecccccccee Q lcl|NC_014792. 154 AKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGST 233 (659) Q Consensus 154 ~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~ 233 (659) .+ .. .... . ...+.. T Consensus 101 -------------------------------ig------~~--~~~t--------------~---------~~tgl~--- 115 (386) T protein:vir:10 101 -------------------------------IG------KV--DADT--------------E---------QYTGIL--- 115 (386) T ss_pred -------------------------------hc------cc--cccc--------------c---------hhhhhH--- Confidence 00 00 0000 0 000000 Q ss_pred EEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhh Q lcl|NC_014792. 234 LEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDD 313 (659) Q Consensus 234 i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~ 313 (659) . T Consensus 116 ---------------------------------~---------------------------------------------- 116 (386) T protein:vir:10 116 ---------------------------------A---------------------------------------------- 116 (386) T ss_pred ---------------------------------H---------------------------------------------- Confidence 0 Q ss_pred hhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHH Q lcl|NC_014792. 314 YFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHV 393 (659) Q Consensus 314 ~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l 393 (659) +....... + ..++++.+|+. ++..+|..+| T Consensus 117 -l~~~~~~~--------------------~------------------------~~p~i~~ap~~-----~~~~~v~~~l 146 (386) T protein:vir:10 117 -LLSAENTV--------------------K------------------------VQPRILIAPGF-----SNQKAVADQL 146 (386) T ss_pred -hhhhcccc--------------------c------------------------ccccccccccc-----cchhHHHHHH Confidence 00000000 0 00112222222 2345678888 Q ss_pred HHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHH Q lcl|NC_014792. 394 VSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAAD 473 (659) Q Consensus 394 ~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~ 473 (659) ..+|++++ .+.+.|++. .+.+++.+|++. +.|+|+++||||++++|+.++..+++|||++ T Consensus 147 ~~~~~~~~-~~~~~~~~~---------~~~~~a~~~~~~----------~~s~~~~~~~p~~~v~~~~~~~~~~~p~s~~ 206 (386) T protein:vir:10 147 VSVADTAA-WLCHSGWSN---------TTDAAAITYREL----------FGSRRCEVVDPWYKVWDVETSAHIIQPPSAR 206 (386) T ss_pred HHhhcceE-EEEEeCCCC---------CchHHHHHhhhc----------ccccceEEecCceeeeccccccceeechHHH Confidence 88888776 556666552 455777888864 4589999999999999999999999999999 Q ss_pred HHHHHHHhhhcCCceECcCCcchhheeccc---cceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccc Q lcl|NC_014792. 474 MAGLCARTDDVSQPWMSPPGYNRGQILNVL---KLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMD 550 (659) Q Consensus 474 ~Ag~~a~~d~~~g~~~span~~~~~i~g~~---~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~ 550 (659) +||++||+|.++|+||||+|+++.++.|+. ..+...++.|++.||++||+++ |++ +|+++||+||++.+ +.|+ T Consensus 207 ~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gi~~~--~~~-~G~~~wG~rT~~~d-~~~~ 282 (386) T protein:vir:10 207 HAGVMAKVHNTLGFWWSNSNQEILGIDGLCRPVDFKLDDPTCRANLLNAKEVTTT--IQQ-NGFRVWGDRTCSAD-SKWA 282 (386) T ss_pred HHHHHHHhhhcCCcEEccCCceeecccccceecccccccCcchhhhhhhcCcEEE--EcC-CCEEEEcccccCCC-cccc Confidence 999999999999999999999887777653 2334556889999999999876 444 89999999999866 5899 Q ss_pred eeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEE Q lcl|NC_014792. 551 HINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYY 630 (659) Q Consensus 551 ~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~ 630 (659) ||++|||+++|+++|+++++|+|||||++.+|++|++++++||++||++|+|+||+|+||+++||++++++|+|+++|++ T Consensus 283 ~i~vrR~~~~i~~~~~~~~~~~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~~~~G~~~~~i~~ 362 (386) T protein:vir:10 283 FKNVVITNDMIADSLVRNHLWAVDRNITKTYVEDVTEGVNNYLRHLKNIGAIAGGECWVDPELNSPDQIQQGKVYFDYDF 362 (386) T ss_pred eeehhhHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcccCCCHHHhhCCeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecCCceEEEEEEEEeecCeeEEEec Q lcl|NC_014792. 631 KPARSINYIVLNFVATSTGADFDELI 656 (659) Q Consensus 631 ~p~~p~e~i~~~~~~~~~~~~~~e~~ 656 (659) +|++|+|||+|+++++... |++++ T Consensus 363 ~p~~p~e~i~~~~~~~~~~--~~~~~ 386 (386) T protein:vir:10 363 SAYAPAEHITFRSHMVNGY--LTEVV 386 (386) T ss_pred EecCCceeEEEEEEEehhH--HHhhC Confidence 9999999999999987766 88888 No 33 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=1.1e-76 Score=436.91 Aligned_cols=590 Identities=18% Similarity=0.191 Sum_probs=292.4 Q ss_pred cCceEEEEecCCCcccccCCcceEEEeecccC-CCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEEEEec Q lcl|NC_014792. 5 SPGIELKETTVQSTVVRNATGRAALVGKFQWG-PAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLRTVRV 83 (659) Q Consensus 5 ~PGVyveE~~~~~~~~~~~ts~~afvG~~~~G-p~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~vvRv 83 (659) .=-|-|+|+++...+.-+-.--.++||.+.-- |-.-|+.++ .+||.|. |.-.-. ....-+-||.+.-|+|- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~------~~~~~~~~~~~~~~~~~ 72 (742) T protein:vir:58 1 MYRVNVKEVDLSITPEVGTPVQTALVGAFDLPIPSELPVSVT-PDEFRRV-GSTELS------LIADSLVGGQEVTVIRP 72 (742) T ss_pred CeeeeeeeeeeeeccccCCchhhheeeeecCCCCccccceec-hhHHhhc-ccceee------ehhhhhcCcceEEEEcc Confidence 23577899998876654433345688877642 446677775 5788654 422111 11112346666666664 Q ss_pred cCCcccccccc---------------------------ccccccccc---------------cccccccc-----cccee Q lcl|NC_014792. 84 VNRDHAKNASP---------------------------VAGNIESTI---------------ATAGSNYA-----VGDVI 116 (659) Q Consensus 84 ~~~~~~~~a~~---------------------------~~~~~~~~~---------------~~~~~~~~-----~~~~~ 116 (659) -......++.- ..++..... ...++... +-+.. T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (742) T protein:vir:58 73 RGETQSLNAAFVVVGGYNVTLGAFNVFYLMFLGYDPQKGYTDVSYVDVQLAGTPTDTILFSYSLDGSSTTHSLTINLNAP 152 (742) T ss_pred CCcccccceeEEEEecceeeehhhheeeeeeeecccCCCcccceeEEEEEccCCCeeEEEeeecCCCcceeEEEEEeece Confidence 33221111100 000000000 00000000 01111 Q ss_pred eeeecccc-cc------ccceeeeee--------ccCcceeee---eccccccccccccceeee--ec--------ccee Q lcl|NC_014792. 117 QVKHNQTV-VE------TSGRITKVD--------VDGKILAVF---IPSDKIIAFAKSVNQYPD--LG--------PAWT 168 (659) Q Consensus 117 ~v~~~~~~-~~------~~~~~~~~~--------~~g~~~~~~---~~~~~~~~~~~~~~~~~~--~~--------~~~~ 168 (659) ++....+- +. ..+.+.... .+-...... +..........+.+.+.. +. ..+. T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (742) T protein:vir:58 153 SVTLPSNIVPLFFYYEPYTGSITLQSSVNYSGLTLNYTVSKATTPWVYFAEYGTPTSSLTLYKGFYLEGIDLNSFNKQFV 232 (742) T ss_pred eEeeccccceeeeEeccccceEEEeeecccCCCcccceeeeeecCcccccccCCCccceeeeecccccccccCcccceee Confidence 11110000 00 001111000 000000000 110000011111111100 00 0000 Q ss_pred eEEEeecCC----ccccccccceeccccceeeeccccccccc----------------------ccceeeccccccccee Q lcl|NC_014792. 169 AEILTTSSG----VSGTITLGKIVTDSGILLTEAENSEEAIT----------------------SLEFQASLQKYAMPGV 222 (659) Q Consensus 169 ~~v~~~~~g----~~~~~~~~~~v~~~~~~~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~ 222 (659) ..+.....+ ...-....+ ...+.+....+.+.. ..++.-.......+ T Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-- 305 (742) T protein:vir:58 233 VSIENITVNREKGQVLYPSFDV-----VVHFRDIRGVSANTEYIRFRQVNLNPESPNYIERVIGNMTFEFDGERIVTG-- 305 (742) T ss_pred EEEeeeeecccCCceeccceeE-----EEEEeeccCCCCCccceeeeeeecCCCCcceeeecccceeeeeccceeeec-- Confidence 000000000 000000000 000111000000000 00000000000000 Q ss_pred eecccccccee---EEEEEeec-ccccccceeeeeee-------------------------ccccccccceeeeeeecc Q lcl|NC_014792. 223 VALYPGEIGST---LEVEIVSK-AAYDVGASKMLDIY-------------------------PNGGSRASVARAVFNYGP 273 (659) Q Consensus 223 ~a~~~g~~g~~---i~V~v~~~-~~~~~~~~~~~~~~-------------------------~~~~~~~~~~~~~~~~~~ 273 (659) |..-|. +++.+.-. .....+.+.+.++. .....++........... T Consensus 306 -----~~~~n~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~v~d~~~~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ 380 (742) T protein:vir:58 306 -----GEYPNQVPFLRVVVSQDIKQNVAGVEKWVPVGFEGIYSVGDFTVIVNELTNVSIPVTDSAIIPPMRFTRIEQITL 380 (742) T ss_pred -----ccccccccceeeEeccccCcCccceeEEEeccccccccccceeeeccccccceeeccccccCCcccccccceeec Confidence 000000 01110000 00000011111100 000000000000000000 Q ss_pred ccccceeeeeccC-Ccee----eeeeeeccccccccccchhhhhhhhhcccccceEEeecccCCccceeEEeeccccccc Q lcl|NC_014792. 274 QTDDQYAIIVRRD-GAIV----ENVVLSTKEGDKDVYGNNIYLDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISAND 348 (659) Q Consensus 274 ~~~~~~~~~v~~~-g~~~----et~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~ 348 (659) .....+.+..... +..+ ..+.++.-.++....+...... .. .....+.................+.||.++.. T Consensus 381 ngG~~f~v~s~~~~g~~i~~~~as~~~s~ln~~~~V~Gt~aa~~-~~-d~~t~~~v~s~~~alp~~a~sv~laGG~dg~v 458 (742) T protein:vir:58 381 SGGASFSVISNQPYGFNIQDSRHSYWLSPFKDDELIIGTELVLP-AL-DVSTEFGVSSWEEALPEFSFLMPFQGGSDGYI 458 (742) T ss_pred ccCcceEEEEecccCcceeccCcceEEeccCCceEEEeehhhcc-cc-ccchheeccccccccceeeEEEeecCCccccc Confidence 1111111111000 0000 0011111111111111100000 00 00000000000011111122344455544321 Q ss_pred c--------c---ch----hhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhhCCEE-EEEecCcc Q lcl|NC_014792. 349 Q--------V---TA----GDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADERQDCL-AFISPPKG 412 (659) Q Consensus 349 ~--------~---~~----~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~-ai~d~p~~ 412 (659) . + .. ....+++. ++.+..+++|+++||+.. ..++.++.++|+.+++|+ +++|+|.+ T Consensus 459 ~v~~~~~D~iG~~~~~d~~~adrTGL~--ALlev~eVtILiAPG~t~------~~v~aav~A~la~a~~Rl~vL~D~P~~ 530 (742) T protein:vir:58 459 RVDENEPDTIGRVKITPALLANYERLL--PLLTEDQFDLVLTPYLTF------ADHAGTVNAFINRAENRFLYLFDIAGD 530 (742) T ss_pred cccCCCcccccccccccccccchhHHH--HhhhcCCCcEEEEcCCCc------hHHHHHHHHHHHhhcCCeEEEEecCCC Confidence 0 0 00 01122333 444556789999999753 345677788888766665 55677643 Q ss_pred ccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHHHHHhhhcCCceECcC Q lcl|NC_014792. 413 LLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGLCARTDDVSQPWMSPP 492 (659) Q Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spa 492 (659) .+..+++.+|++ +++|+|+++||||+++.+ ++..+++|||+++||++||+|.++|+|+||+ T Consensus 531 -------~tt~~~A~a~r~----------~~nSsraaly~PwVkv~d--~~~~r~vPpSgaIAGL~ARtD~erGvw~SPA 591 (742) T protein:vir:58 531 -------DDTENLAISLAG----------YINSSFATTFFPWVRRLT--NKGMRTVPASLAAYRSIRTTDPETGLAPVGA 591 (742) T ss_pred -------CchHHHHHHHHh----------ccCCceEEEEeceeeecc--CCcceeechHHHHHHHHHHhccCCceEecCC Confidence 234466777775 346899999999999876 4678999999999999999999999999999 Q ss_pred CcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccceeehhhHHHHHHHHHHHHHHHH Q lcl|NC_014792. 493 GYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHINVRRLTNMLKKNIGDASKYK 572 (659) Q Consensus 493 n~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~ 572 (659) |+.+ +.+ ...++.|++.||++|||+|++| + +|+++||+||+++.|++|+||||||||+||+++|+++++|+ T Consensus 592 Nrgi--i~~-----~~~s~se~d~LN~~GINtIrsf-G-~G~rlWGnRTlassDs~wryInVRRlfd~Ie~SI~~a~q~~ 662 (742) T protein:vir:58 592 RRGV--VTG-----EPVRQVDWEDLYNNRINPIVRV-G-NDVLLFGQKTMLNVNSALNRINVRRLLIVMRNRISQILSSY 662 (742) T ss_pred ccee--eec-----cccchhhHHHHhhCCceEEEEC-C-CcEEEEcceecCCCCcccceEeehhhHHHHHHHHHHHHHHh Confidence 9853 222 3567889999999999999987 4 69999999999777789999999999999999999999999 Q ss_pred hcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCeeE Q lcl|NC_014792. 573 LFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPARSINYIVLNFVATSTGADF 652 (659) Q Consensus 573 v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 652 (659) ||||||+.||++|+++|++||++||++|+|+||+|+||+ +||++||++|+|+++|+++|++|||||+|+|.+++.|++| T Consensus 663 VfEPNd~~L~~sIk~sInafL~~L~aqGALlGfrV~lDe-tNTpeDI~~Gklvv~I~vAP~~PAEfI~lrf~it~tga~F 741 (742) T protein:vir:58 663 LFENNTSENRLRAEALVRQYLESLRLRGAVTDYEVAIDS-VTTPTDIDNNTLRARVTVQPARSIEYIDITFVITPTGVEI 741 (742) T ss_pred ccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcC-CCCHHHhhCCEEEEEEEEEccCCcceEEEEEEEEeccccc Confidence 999999999999999999999999999999999999995 6899999999999999999999999999999999999999 Q ss_pred E Q lcl|NC_014792. 653 D 653 (659) Q Consensus 653 ~ 653 (659) + T Consensus 742 s 742 (742) T protein:vir:58 742 T 742 (742) T ss_pred C Confidence 9 No 34 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=5.7e-64 Score=367.31 Aligned_cols=530 Identities=14% Similarity=0.072 Sum_probs=314.3 Q ss_pred Cc-eecCceEEEEecCCCccccc-CCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeE Q lcl|NC_014792. 1 MA-LLSPGIELKETTVQSTVVRN-ATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDL 78 (659) Q Consensus 1 ~~-~~~PGVyveE~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~ 78 (659) |- |+.|||||||.+++++++.+ +|++++|||.+++||+++|++|+||.||+++||+-.--.+..++...+|.|||++| T Consensus 8 ~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~~~~~fg~g~l~~~i~~a~~~~~~~g~~~~ 87 (562) T protein:vir:63 8 RKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGELLDAIERAWNPGEGTGAGDI 87 (562) T ss_pred CCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCCCCCceeEEEccHHHHHHHhcCCchHHHHHHhccccccCCceEE Confidence 44 78999999999999986655 69999999999999999999999999999999985533344445555668999999 Q ss_pred EEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccccccc Q lcl|NC_014792. 79 RTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVN 158 (659) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 158 (659) |+|||.+. .+++...+++. .++...+.|+|.+++....+......+....-..++..+++...+ .+. T Consensus 88 ~~~rv~~a---~~a~~~~~~~~---~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g-------~V~ 154 (562) T protein:vir:63 88 LAMRVEEA---KEATFEAEGVK---VSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLG-------SIF 154 (562) T ss_pred EEEEcCCC---ccceeEeccee---EEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcchhhhhcc-------cee Confidence 99999442 23333333333 355567789999998775443333333222111111111111000 000 Q ss_pred eeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecc--------ccc-ccceeeeccccc Q lcl|NC_014792. 159 QYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASL--------QKY-AMPGVVALYPGE 229 (659) Q Consensus 159 ~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~-~~~~~~a~~~g~ 229 (659) ...+.+.......+....+... .........+... +......... ... ......+.+++. T Consensus 155 ~i~y~g~~~~~~~~v~~~~~~~----------~a~~l~~~~g~~~-v~~~~L~~g~~~~~~~l~~~in~~~~~~aky~~~ 223 (562) T protein:vir:63 155 SIKYKGTEASATFTVAVDPVTF----------KATKLTLKAGDKT-VKEYDLGSGAYAETNVLISDINNLPDFEAKFFPI 223 (562) T ss_pred eeeeecccccceEEEEecCcce----------eEEEEEeecCCcc-eeEEEecCCccchhHHHHHhhccccceEEEeecc Confidence 0000000000000000000000 0000000000000 0000000000 000 000011122222 Q ss_pred cceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchh Q lcl|NC_014792. 230 IGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNI 309 (659) Q Consensus 230 ~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~ 309 (659) .++.+++.+. ..+. ...+.. ...+. ... T Consensus 224 ~gn~i~~~~~-----------------------------------------------d~~~-~~~vkt---~~~~v-~t~ 251 (562) T protein:vir:63 224 GDKNLTTDNF-----------------------------------------------DAQI-DVDIKT---KEAYV-KAV 251 (562) T ss_pred CCceeeeecc-----------------------------------------------cccc-ccchhh---hhhhh-hhh Confidence 2222221100 0000 000000 00000 000 Q ss_pred hhhhhhhcccccceEEeecccCC-ccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHH Q lcl|NC_014792. 310 YLDDYFAKGTSNYIYATSLNWPK-GFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATAST 388 (659) Q Consensus 310 ~~~~~~~~~~s~~v~~~~~~~~~-~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 388 (659) .......+....++......... .......|.||.++... .++.++++.++. .+.+++++. .+..+ T Consensus 252 ~~d~~~~~~~~~~v~~~~~~~~~la~~~~~~LtGG~dGt~~---~~~~~al~ale~---~~~~~i~~~-------t~d~a 318 (562) T protein:vir:63 252 GGDIEKQTAYNGYVDFEFDRSKEIANFPLTKLTGGDNGTIP---ESWADKFSYFAN---EGGYYLVPL-------TSKQA 318 (562) T ss_pred hhhhhhcccccceeeeeeccccceecccceeeecCCCCCch---hhHHHHHHHHHh---CCcEEEEec-------CCCHH Confidence 00000011122233222211111 11124578898887432 345666666653 344555432 24557 Q ss_pred HHHHHHHHHHhhCC----EEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCc Q lcl|NC_014792. 389 VQKHVVSIADERQD----CLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDV 464 (659) Q Consensus 389 v~~~l~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~ 464 (659) ++.++.+||+++++ ++++++.+.+ .+.+++..... .+++.+.++++|+....+. .+. T Consensus 319 v~~~l~a~vkr~~~~g~~~~aVlg~~~~--------~~~~~~~~~a~----------~~n~ervv~v~~~~~~~~~-~~~ 379 (562) T protein:vir:63 319 VHAEALQFVRDCSYNGNPMRVFVGGGIG--------ESMEQLFTRAI----------GLQNERAGLIGFSGTVKMD-DGR 379 (562) T ss_pred HHHHHHHHHHHHHhCCCcEEEEecCCCC--------CCHHHHHHHhh----------hcCCCcEEEEecCeeEECC-CCc Confidence 88889999987765 8999987753 35566655443 4568899999998776554 456 Q ss_pred ceeecH---HHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcc-c Q lcl|NC_014792. 465 NRWVPL---AADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGD-K 540 (659) Q Consensus 465 ~~~~p~---s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~-r 540 (659) .+.+|+ ++++||++|+.| +++||.|+.+. ..++...+++.|++.|+++|+++++...+ ++.++|.. + T Consensus 380 ~~~~~~~~~aa~vAGl~A~~~----~~~SlT~~~i~----~~~v~~~~t~~e~~~li~~Gv~~l~~~~~-~~v~~~~iv~ 450 (562) T protein:vir:63 380 SLKMPGYMFAAQVAGLTCGLE----IGEAITFKNIA----IETLDTIYEGSQLDQLNESGIITAEFVRN-RAVTNFRIVD 450 (562) T ss_pred eeeechhHHHHHHHHHhhcCc----hhcCccceeec----cccccccCCHHHHHHHHhCCeEEEEEecC-CcEEEEEeec Confidence 666777 789999999987 77899998753 34667789999999999999999987665 56666643 3 Q ss_pred cc----CCCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCC Q lcl|NC_014792. 541 TA----TKVPSPMDHINVRRLTNMLKKNIGDAS-KYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNT 615 (659) Q Consensus 541 T~----~~~~~~~~~i~vrR~~~~i~~~i~~~~-~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt 615 (659) ++ ...+..|++|+++|++|+|++.|++.+ +||+++||+...|..|+..|..||.+||+.|+|.+|... +- T Consensus 451 ~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yiGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv 525 (562) T protein:vir:63 451 DVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EV 525 (562) T ss_pred cceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ce Confidence 32 123457999999999999999998775 589999999999999999999999999999999999632 12 Q ss_pred HHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCeeE Q lcl|NC_014792. 616 PSVIDRNEFVASIYYKPARSINYIVLNFVATSTGADF 652 (659) Q Consensus 616 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 652 (659) +.++..+++++++.+.|+.|+|+|++++.....-.+- T Consensus 526 ~v~~~~d~~~v~~~v~pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:63 526 QVVIEGDVARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred EEEecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 3345678899999999999999999999977666665 No 35 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=2.4e-63 Score=363.87 Aligned_cols=606 Identities=13% Similarity=0.091 Sum_probs=301.6 Q ss_pred Cc----ee-cCceEEEEec----CCCcccccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHH------ Q lcl|NC_014792. 1 MA----LL-SPGIELKETT----VQSTVVRNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYF------ 65 (659) Q Consensus 1 ~~----~~-~PGVyveE~~----~~~~~~~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~------ 65 (659) |. |- .||+-+.--| .+..+..-.|-..-+.|.+--|||.+||+|+-... +..||+....+..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 79 (717) T protein:vir:79 1 MAGFDQYQAIPGHNARFKDGNLNLKSDPNPRETESVVLLGTATDGPVMQPVRVTPETA-YNIFGKVAHENGVYNGATLLP 79 (717) T ss_pred CCchhhhhcCCCceeeeecCceecCCCCCccccceEEEEeeccCCcccCceeeChhHH-HhhhhhhhhhcccccchhhhH Confidence 55 32 5999997665 23344455677778999999999999999995554 589998765554332 Q ss_pred HHHHHHHcCCCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeee Q lcl|NC_014792. 66 MSGMNFLQYGNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFI 145 (659) Q Consensus 66 ~~~~~f~ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~ 145 (659) +.-..+..|..++...|+.+..+- .+-++.+-.+...++...-..++.+.-+...+.....+.+.....--+.+.+.+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (717) T protein:vir:79 80 KFEELWAAGNRDIRLMRTTGVNAV--SSLLGTSYSKNSKEVAEDKLGGAQARGNVAATFTLPNGGIVEATFLLKARGVII 157 (717) T ss_pred HHHHHHhcCCcceEEEEecchhHH--HHHhhcccccchhhHHHHhhcccccccceEEEEEcCCCceeeeeeeeeecceEe Confidence 334445678888999998653321 111122211111111111111111111111111111111111000001111111 Q ss_pred ccccccccccccce--------eeeeccce--------eeEE----EeecCCccccccccceeccccceeeecccccccc Q lcl|NC_014792. 146 PSDKIIAFAKSVNQ--------YPDLGPAW--------TAEI----LTTSSGVSGTITLGKIVTDSGILLTEAENSEEAI 205 (659) Q Consensus 146 ~~~~~~~~~~~~~~--------~~~~~~~~--------~~~v----~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~ 205 (659) +..+. +..++. .+...... ..++ -..+...++......+ .+. -++-.+.+..+ T Consensus 158 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~---~~~~~~~~~~~ 230 (717) T protein:vir:79 158 PPNNY---TLDVGTEEDMKAGTQPTFAQVLLNENVADMESEITVSYEFTYKDAQGETKTSEV-LDN---NTDKDGKPMIA 230 (717) T ss_pred CCCcc---eEeccChhhhhcCCCchhhhhhhccchhhccceeEEEEEEEeecccCcchhhhh-hcC---CCCCCCceeEE Confidence 11110 000000 00000000 0000 0001111110000000 000 00000111111 Q ss_pred cccceeecccccccceeeeccccccceeEEEEEeecccccccceeeeeeeccc------------------cccccceee Q lcl|NC_014792. 206 TSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVSKAAYDVGASKMLDIYPNG------------------GSRASVARA 267 (659) Q Consensus 206 ~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~ 267 (659) .+. ....+..+....-...+.+.++|.-+ ..+ ..+...+..-.... ..+...+.+ T Consensus 231 ~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (717) T protein:vir:79 231 KGA-----DVTIKLEHVALAGLKLYADGIEVVDA-KAF-TVAGDQLTIHSNSKMKLGASLEAQYAYNLVEVIQPVIELES 303 (717) T ss_pred ecc-----cceeehhhhhhhhhHHhhcchhhhhh-hhe-eeecceEEEEecCCcccchhhHHHHHhhHHHhhccceEEee Confidence 111 00111111110001111112222100 000 00000000000000 001111111 Q ss_pred eeeeccccccceeeeeccCC-ceeeeeeeec-cccccccccch-----hh-----hhhhhhcccccceEEeecccCCc-- Q lcl|NC_014792. 268 VFNYGPQTDDQYAIIVRRDG-AIVENVVLST-KEGDKDVYGNN-----IY-----LDDYFAKGTSNYIYATSLNWPKG-- 333 (659) Q Consensus 268 ~~~~~~~~~~~~~~~v~~~g-~~~et~~~~~-~~~~~~~~~~~-----~~-----~~~~~~~~~s~~v~~~~~~~~~~-- 333 (659) ... ...-+.+...+..++ ...-+++... +.+.. +.... ++ ..+-+.+.+-.-+.+. .+++.. T Consensus 304 ~~~--g~~~n~~~~~v~~~D~~~~~~~t~~~~~~g~~-~~~pl~~ts~dy~~~~~~vdgI~~~~~~~V~~~-g~~s~a~a 379 (717) T protein:vir:79 304 IFG--GGVYNDIMRKVESKDGAVTVTITKPESKRGMI-SEDPLVFKSGDYTNFKMLVDAINNHPFNNVVRA-RTKPEFEA 379 (717) T ss_pred ccc--CceeeeeeeEEecCCceEEEEEecccccCcce-eccccccccCceeeeeeeecccccCchhheeee-ecccccce Confidence 111 111122233333332 2222222111 11111 00000 00 0011110000000000 001100 Q ss_pred -------cceeEEeecccccccccchhhhh------------hhHhhhhhcccccceEEEeccccccch--hhhHHHHHH Q lcl|NC_014792. 334 -------FAGIINLMGGISANDQVTAGDLM------------QGWDLFADREALHINLLIAGAVAGEGD--ATASTVQKH 392 (659) Q Consensus 334 -------~~~~~~~~gg~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~p~~~~~~~--~~~~~v~~~ 392 (659) ......+.||.++.......-+. +.-.++...+.+++++++.|+...... .....++.+ T Consensus 380 ~~~~g~~s~d~a~f~Gg~dgl~~~~ee~Y~~lGgk~~d~g~lt~~aays~LE~~dVDlVil~ga~adtt~ga~~d~va~a 459 (717) T protein:vir:79 380 TFTSTLQAAADAKFSGGKDELSLDKEEMYKRLGGEKNEEGFVTKQGAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQ 459 (717) T ss_pred eeeecccCchhhccCCCccccccchhhhhccccccccccccccchhhhhhcCcceeEEEEecCccccccccchhhhHHHH Confidence 00111234444443322111110 001234444556789999998754321 234567888 Q ss_pred HHHHHHhh----CCEEEEEec--CccccccccccCCHHHHHHHhhcccc---------------c--cccccccccceEE Q lcl|NC_014792. 393 VVSIADER----QDCLAFISP--PKGLLVNVPLTRAVDNLIDWRTGGGS---------------F--DTDNMNISTTYAA 449 (659) Q Consensus 393 l~~~~~~~----~~~~ai~d~--p~~~~~~~~~~~~~~~~~~~~~~~~~---------------~--~~~~~~~~s~~~~ 449 (659) +++||+.+ +.++.+++. |.+. ..+.+.+|++.... . ...... .+.|.. T Consensus 460 lad~caalSal~r~ai~VI~l~sp~D~--------~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~id-is~y~~ 530 (717) T protein:vir:79 460 LALACAVMSHYNSVTIGIIPTTTPSDI--------SLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKID-LGQFIE 530 (717) T ss_pred HHHHHHHhhhccccceeeecccccccc--------chhhHHHHHHHHHhhhhhhhhhcchhcccccccccccc-ccceee Confidence 99999754 234554442 2221 11222233221110 0 000111 234555 Q ss_pred EEcCceeEecccCCcceeecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEe Q lcl|NC_014792. 450 IDGNYKYQYDKYNDVNRWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFA 529 (659) Q Consensus 450 ~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~ 529 (659) +++++..++.+..+..+..||+|++||+ |..+|+||||+|+. |.|+.+++..+++.|++.||++|||||+.++ T Consensus 531 vv~~~~~iv~~~~~~~~~~p~AG~vAGl----dA~rGVwkSPANk~---I~GVvgLa~~lT~sE~d~Ln~aGIntIr~~~ 603 (717) T protein:vir:79 531 VVAGPDFIVRNTRLGQMASTPDASYIGM----VSQLKTQSAPTNKP---LPSVTALRYTYSANQLNRLTKARFATFKYKQ 603 (717) T ss_pred eeecceeEEEcCCCceeecCHHHHHHHH----HhcCCcccccccce---ecccccCcccCCHHHHHHHhhCCeEEEEEeC Confidence 5555555555556667888887666665 55579999999996 5567788999999999999999999999998 Q ss_pred CCCeEEEEcccccCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEE Q lcl|NC_014792. 530 GGDGFVLYGDKTATKVPSPMDHINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVC 609 (659) Q Consensus 530 ~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~ 609 (659) + +|+++||+||+++++++|+||++||++++|+++|+++++|+|||||++.+|.+|+.+|++||++||++|+|.||++++ T Consensus 604 G-rGirVWGaRTtasd~sdWryInVRRl~D~Ie~sIr~al~~yVgEPNd~~tr~~Ik~sI~afL~~L~r~GAI~Gykvdv 682 (717) T protein:vir:79 604 D-GSIGVVDAPTSAHAGSDYTRLSTARIVKEAVNAVREVADPFIGEPNDTGNRNALTAAVDKRLSKMIENKALLGFDFRL 682 (717) T ss_pred C-ceEEEEeeeecCCCCcccceeehhhhHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHHHHHHhcCceecceeeE Confidence 6 799999999999888899999999999999999999999999999999999999999999999999999999999866 Q ss_pred ccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_014792. 610 DTTNNTPSVIDRNEFVASIYYKPARSINYIVLNFVATS 647 (659) Q Consensus 610 d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 647 (659) +||++++++|+|+++|+++|++|+|||+|+++.+. T Consensus 683 ---tnT~~di~~G~l~V~I~vaPv~PaEfI~ititITA 717 (717) T protein:vir:79 683 ---VVTPQQELLGEGSIELSLEAPNELRRLTTIVSLSA 717 (717) T ss_pred ---ecChhHhhCCEEEEEEEEEecCcccEEEEEEEEeC Confidence 79999999999999999999999999999999887 No 36 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=1.9e-62 Score=358.96 Aligned_cols=537 Identities=15% Similarity=0.069 Sum_probs=318.7 Q ss_pred Cc--------eecCceEEEEecCCCcccc-cCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHH Q lcl|NC_014792. 1 MA--------LLSPGIELKETTVQSTVVR-NATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNF 71 (659) Q Consensus 1 ~~--------~~~PGVyveE~~~~~~~~~-~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f 71 (659) |+ +..|||||||.+++.+++. ++|++++|||.+++||+++|++|+||.||++.||+..--.+..+|...+| T Consensus 1 ~~~~~~~~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~l~~~i~~a~~~~~ 80 (562) T protein:vir:80 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKPNAVYKVRNYSQAKSVFRSGELLDAIERAWNPGE 80 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCcceeEEEccHHHHHHHhcCCChHHHHHHhccccc Confidence 43 5679999999999998665 56999999999999999999999999999999998554444556666677 Q ss_pred HcCCCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccc Q lcl|NC_014792. 72 LQYGNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKII 151 (659) Q Consensus 72 ~ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 151 (659) .|||++||+|||.... +++...+++. .++...+.|+|.+++...........+....-..++..+++...+. T Consensus 81 ~~g~~~~~~~rv~~a~---~a~~~~~~~~---~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~-- 152 (562) T protein:vir:80 81 GTGAGDILAMRVEEAK---EATFEAEGVK---VSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGS-- 152 (562) T ss_pred ccCceEEEEEEcCCCC---cceEEecceE---EEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCc-- Confidence 8999999999995432 2333333333 3455567899999987754433333333222222222233211111 Q ss_pred ccccccceeeeeccceeeEEEeecCCccccccccceecccc-ceeeecccccccc-cccceeecccccccceeeeccccc Q lcl|NC_014792. 152 AFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSG-ILLTEAENSEEAI-TSLEFQASLQKYAMPGVVALYPGE 229 (659) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~-~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~~g~ 229 (659) +....+.........+......... ...+....+ ............. ....... .-.......+.+++. T Consensus 153 -----v~~i~y~g~~~~a~~~i~~~~~~~~--a~~l~~~~g~~~v~~~~l~~g~~~~~~~l~~--~i~~~~~~tAky~g~ 223 (562) T protein:vir:80 153 -----IFSIKYKGTEASATFTVAVDPVTFK--ATKLTLKAGDKTVKEYDLGSGAYAETNVLIS--DINNLPDFEAKFFPI 223 (562) T ss_pred -----eeeeeeccccccceeEEEecCccce--EEEEEEecCCcceeEEEeCCCccchhhhhhh--hhccccceEEEeccc Confidence 0000000000000000000000000 000000000 0000000000000 0000000 000001112223333 Q ss_pred cceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchh Q lcl|NC_014792. 230 IGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNI 309 (659) Q Consensus 230 ~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~ 309 (659) .++.+.+..... .... ..+.... .++...++. T Consensus 224 ~~n~i~~~~~d~--------------------------~~~~----------~~kt~~~-----~v~~~~~d~------- 255 (562) T protein:vir:80 224 GDKNLTTDNFDA--------------------------QIDV----------DIKTKEA-----YVKAVGGDI------- 255 (562) T ss_pred CCceeeeccccc--------------------------chhh----------hccccee-----eeeehhhhh------- Confidence 333332211000 0000 0000000 000000000 Q ss_pred hhhhhhhcccccceEEeeccc-CCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHH Q lcl|NC_014792. 310 YLDDYFAKGTSNYIYATSLNW-PKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATAST 388 (659) Q Consensus 310 ~~~~~~~~~~s~~v~~~~~~~-~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 388 (659) ...+....++.+..... .........|.||.++... .++.++++.++. .+.++++++ .+..+ T Consensus 256 ----~~~~~~n~~v~~~~~~~~~la~~~~~~LtGG~dG~~~---~~~~dal~~Le~---~~~~~i~~~-------t~d~a 318 (562) T protein:vir:80 256 ----EKQTAYNGYVEFEFDRSKEIANFPLTKLTGGDNGTIP---ESWADKFSYFAN---EGGYYLVPL-------TSKQA 318 (562) T ss_pred ----hhcccccceEEEEeccCccccccceeeeeCCCCCCcc---ccHHHHHHHHHh---CCcEEEEec-------CCChH Confidence 00011122332221111 1111234578899887432 346666666654 344555443 23457 Q ss_pred HHHHHHHHHHhhCC----EEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCc Q lcl|NC_014792. 389 VQKHVVSIADERQD----CLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDV 464 (659) Q Consensus 389 v~~~l~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~ 464 (659) ++.++.+||+++++ ++++++.+.+ .+.+++..... .+++.+.++++|+..+.+. ++. T Consensus 319 i~~~~~a~vkr~r~~g~~~~aVvg~~~~--------~~~~~~~~~a~----------~~n~e~vv~v~~~~~~~~~-~~~ 379 (562) T protein:vir:80 319 VHAEALQFVRDCSYNGNPMRVFVGGGIG--------ESMEQLFTRAI----------GLQNERAGLIGFSGTVKMD-DGR 379 (562) T ss_pred HHHHHHHHHHHHHhCCCeEEEEecCCCC--------CCHHHHHHHhh----------hcCCCeEEEEecCeeEECC-CCc Confidence 88999999988865 8999987753 35566665543 3567889999998776554 455 Q ss_pred ceeecH---HHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEc--- Q lcl|NC_014792. 465 NRWVPL---AADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYG--- 538 (659) Q Consensus 465 ~~~~p~---s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG--- 538 (659) .+.+|+ ++++||++|+.| +++||.|+.+. + .++...+++.|++.|+++|+++++...+ ++.++|. T Consensus 380 ~~~~~~~~~aa~vAGl~Ag~~----~~~S~T~~~i~---~-~~v~~~lt~~e~~~li~~G~l~l~~~~~-~~v~~~riv~ 450 (562) T protein:vir:80 380 SLKMPGYMFAAQVAGLTCGLE----IGEAITFKNIA---I-ETLDTIYEGSQLDQLNESGIITAEFVRN-RAVTNFRIVD 450 (562) T ss_pred eeeechhHHHHHHHHHHhcCc----cccCccceeec---c-ccccccCCHHHHHHHHhCCeEEEEEecC-CcEEEEEeec Confidence 566666 889999999987 67799998753 3 3567789999999999999999987654 5556662 Q ss_pred -ccccC-CCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCC Q lcl|NC_014792. 539 -DKTAT-KVPSPMDHINVRRLTNMLKKNIGDAS-KYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNT 615 (659) Q Consensus 539 -~rT~~-~~~~~~~~i~vrR~~~~i~~~i~~~~-~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt 615 (659) -.|.. ..+..|++|+++|++|+|++.|++.+ +||++|||+...|..|+..|..||.+||+.|+|.+|... +- T Consensus 451 ~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~~~yIGk~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv 525 (562) T protein:vir:80 451 DVTTFNDKTDPVKSEIGVGEANDFLVSELKISLDNEYIGTKIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPE-----EV 525 (562) T ss_pred cceeccCCCCchhhhhhhhHHHHHHHHHHHHHHHhcCCccccChHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ce Confidence 22322 33568999999999999999998876 689999999999999999999999999999999998632 12 Q ss_pred HHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCeeE Q lcl|NC_014792. 616 PSVIDRNEFVASIYYKPARSINYIVLNFVATSTGADF 652 (659) Q Consensus 616 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 652 (659) +.+..+++++|++.+.|+.|+|||++++.....-.+- T Consensus 526 ~v~~~~d~~~v~~~v~Pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:80 526 QVVIEGDIARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred EEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 3345678899999999999999999999977766665 No 37 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=8.2e-62 Score=355.48 Aligned_cols=565 Identities=15% Similarity=0.071 Sum_probs=318.3 Q ss_pred Cc--------eecCceEEEEecCCCccc-ccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHH Q lcl|NC_014792. 1 MA--------LLSPGIELKETTVQSTVV-RNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNF 71 (659) Q Consensus 1 ~~--------~~~PGVyveE~~~~~~~~-~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f 71 (659) |+ +..|||||||.+++.++. ++++++++|||.+++||+++|++++||.||+++||+.+--..+.++...|| T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~l~~~~~~a~~~~~ 80 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGELLDAIELAWGSNP 80 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCCCCCceeEEeccHHHHHHHhcCcchHHHHHHHhcccc Confidence 33 668999999999999865 557999999999999999999999999999999988553333334444555 Q ss_pred HcCCCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccc Q lcl|NC_014792. 72 LQYGNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKII 151 (659) Q Consensus 72 ~ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 151 (659) .|||++||++||.+.. +|+....++. .++..++.|||.+++....+......++.......+..+++-..+. T Consensus 81 ~~g~~~~~~~rv~~~~---~a~~~~~~l~---~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-- 152 (587) T protein:vir:95 81 NYTAGRILAMRIEDAK---PASAEIGGLK---ITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNIGN-- 152 (587) T ss_pred CCCceEEEEEEcCCCc---eeEEEecCeE---EEEecccccccceEEEEecCCCCCceeEEEEEecccceeeeeeccc-- Confidence 7999999999995543 2333333333 4456788999999998775544444333322222221122211110 Q ss_pred ccccccceeeeeccceeeEEEeecCCccccccccceecccc-ceeeecccccccccccceeecccccccceeeecccccc Q lcl|NC_014792. 152 AFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSG-ILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEI 230 (659) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~ 230 (659) +....+........+.......... ........+ ......... .+...............+...+.+++.. T Consensus 153 -----v~si~y~g~~~~~~~~v~~~~~t~~--a~~~~l~~g~~~v~~yrL~-~g~~~~~~~~~~~in~~~~~tAky~g~~ 224 (587) T protein:vir:95 153 -----IFTIKYKGEEANATFSVEHDEETQK--ASRLVLKVGDQEVKSYDLT-GGAYDYTNAIITDINQLPDFEAKLSPFG 224 (587) T ss_pred -----eeeeeeeccccccceeeeeccccee--eeeeeeecCCceEEEEEec-CCchHHHHHHHHhhccccceEEEEeccc Confidence 0000000000000000000000000 000000000 000000000 0000000000000011222344555555 Q ss_pred ceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhh Q lcl|NC_014792. 231 GSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIY 310 (659) Q Consensus 231 g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~ 310 (659) ++.+.+.... .............. ... ..+..... .. ..+.......+...+..... . T Consensus 225 ~~~i~~~~~~-~~~~~~v~~~~~~v---~a~--~~d~~~~~--~~-~~~v~~~~~~g~~~~~~~~~-------------~ 282 (587) T protein:vir:95 225 DKNLESSKLD-KIENANIKDKAVYV---KAV--FGDLEKQT--AY-NGIVSFEQLNAEGEVPSNVE-------------V 282 (587) T ss_pred CceeEEeecC-cccccceehhhhhh---hhh--hcceeeee--ec-eeeeeeecccccceeccchh-------------h Confidence 5554432210 00000000000000 000 00000000 00 00000000001000000000 0 Q ss_pred hhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHH Q lcl|NC_014792. 311 LDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQ 390 (659) Q Consensus 311 ~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~ 390 (659) .........................|.||.|+.. ..++..++++++. .+.++++++ .+..+++ T Consensus 283 ----~~~~~~a~~~~~~~~~~~a~~~~t~LtGG~dG~~---~~~y~~~l~ale~---~~~~~i~~~-------t~d~~v~ 345 (587) T protein:vir:95 283 ----EAGEESATVTATSPIKTIEPFELTKLKGGTNGEP---PATWADKLDKFAH---EGGYYIVPL-------SSKQSVH 345 (587) T ss_pred ----hhcccchheeccccccceeccceeeeecCCCCCC---cccHHHHHHHHHh---CCcEEEEec-------CCCHHHH Confidence 0000000000000000001112245889988643 2356777777654 345666543 2445788 Q ss_pred HHHHHHHHhhCC----EEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcce Q lcl|NC_014792. 391 KHVVSIADERQD----CLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNR 466 (659) Q Consensus 391 ~~l~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~ 466 (659) .++.+||+++++ ++++++.+.+ .+.+++..... .+++.+.++++|+..+. ..++... T Consensus 346 a~l~a~vk~~~~~g~~~~aVvg~~~~--------~~~~~~~~~a~----------~~n~ervi~v~~~~~~~-~~dg~~~ 406 (587) T protein:vir:95 346 AEVASFVKERSDAGEPMRAIVGGGFN--------ESKEQLFGRQE----------SLSNPRVSLVANSGTFV-MDDGRKN 406 (587) T ss_pred HHHHHHHHHHHhCCCcEEEEEcCCCC--------CCHHHHHHHHh----------hcCCCcEEEecccceEe-cCCCcee Confidence 999999988765 8999986643 35566665543 35678888888876543 2355667 Q ss_pred eecH---HHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCe--EE-EEccc Q lcl|NC_014792. 467 WVPL---AADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDG--FV-LYGDK 540 (659) Q Consensus 467 ~~p~---s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G--~~-~wG~r 540 (659) .+|+ ++++||++|..| +++||.|+.+. ..++...+++.|++.|+++|++++....+..+ ++ +.+-. T Consensus 407 ~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~----~~~v~~~~t~~e~e~ai~~Gvl~l~~~~~~~~~~vriv~~it 478 (587) T protein:vir:95 407 HVPAYMVAVALGGLASGLE----IGESITFKPLR----VSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVT 478 (587) T ss_pred eechHHHHHHHHHHHhcCc----hhcCccceeee----cccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeecce Confidence 7777 688999999987 66799998753 34667789999999999999999976654322 33 24555 Q ss_pred ccC-CCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHH Q lcl|NC_014792. 541 TAT-KVPSPMDHINVRRLTNMLKKNIGDAS-KYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSV 618 (659) Q Consensus 541 T~~-~~~~~~~~i~vrR~~~~i~~~i~~~~-~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~ 618 (659) |.. .++..|++|+++|++|+|++.|++.+ +||++|||+...|..|+..|..||.+||+.|+|.+|... +.+-+ T Consensus 479 T~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv~v~ 553 (587) T protein:vir:95 479 TFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEIQDFPAE-----DVQVI 553 (587) T ss_pred eccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEE Confidence 543 33457999999999999999999876 699999999999999999999999999999999998642 22233 Q ss_pred hhCCEEEEEEEEEecCCceEEEEEEEEeecCeeE Q lcl|NC_014792. 619 IDRNEFVASIYYKPARSINYIVLNFVATSTGADF 652 (659) Q Consensus 619 i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 652 (659) +...++++++.+.|+.|+|+|.++++....-++- T Consensus 554 ~~~d~~~v~~~v~Pv~~mekI~vt~~~~~~~~~~ 587 (587) T protein:vir:95 554 VEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred ecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 4556899999999999999999999976555544 No 38 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=1.4e-61 Score=354.18 Aligned_cols=584 Identities=13% Similarity=0.062 Sum_probs=288.6 Q ss_pred Ccee---------cCceEEEEecCCCcccc-cCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHH Q lcl|NC_014792. 1 MALL---------SPGIELKETTVQSTVVR-NATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMN 70 (659) Q Consensus 1 ~~~~---------~PGVyveE~~~~~~~~~-~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~ 70 (659) |+.. +|||||||+|++.++++ ++|++++|||.++|||+|+|++|+||.||++.||+ .++.+++++| T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~~Gp~~~p~~v~s~~~~~~~fgg----g~l~~av~~~ 76 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAEGGETYKPYRLTSFAEAVSIFKG----GPLLEHIKAA 76 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeCCCCCceeEEecCHHHHHHHhcC----ccHHHHHHHH Confidence 6633 49999999999998665 56999999999999999999999999999999996 4699999999 Q ss_pred HHcCCCeEEEEeccCCcccccccccccccccccccccccccccceeeeeec--cccccccceeeeeeccCcceeeeeccc Q lcl|NC_014792. 71 FLQYGNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHN--QTVVETSGRITKVDVDGKILAVFIPSD 148 (659) Q Consensus 71 f~ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~~~~g~~~~~~~~~~ 148 (659) |+|||++||+|||.+...+ +.....+ ..++...+.||+.+++... ........++........ ..+ . T Consensus 77 F~nGg~~~~~vRv~~~~~a---~~~~~~~---~~~a~~~g~~gn~i~~~v~~~~~~~~~~~~l~v~~~~~~--~~~---d 145 (648) T protein:vir:10 77 FIGGAGEVVAVRIGNPTTA---SVSIPVA---QNTSDTSPANLNFVSYEASTRSNQIYVSFDLDENFTSAN--EAD---D 145 (648) T ss_pred HhCCCcEEEEEEcCCCccc---ceeccee---EEeecccCCCCCceEEEEEEcCCCcCceeEEEEEecCCC--ccc---c Confidence 9999999999999764332 2112222 2344556778888764443 222222222221111100 000 0 Q ss_pred ccc-ccccccceeeeeccceeeEEEeecCCccccccccceeccccceeeeccccccccccc-ceeecccccccce----- Q lcl|NC_014792. 149 KII-AFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSL-EFQASLQKYAMPG----- 221 (659) Q Consensus 149 ~~~-~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~----- 221 (659) +.. ........+..........+....... ... .+................ ............. T Consensus 146 ~~v~~i~~~~~~y~gt~~~~t~~v~~~~~~~--~~~-------~~~~~~~~~~~~v~~~~~~~~~~~~~~v~~~~~~~~~ 216 (648) T protein:vir:10 146 TIIFTIYQKHPDFSVTRETFTFPRKFTTPTV--LVK-------RGSTLFFVDRSIVNAALAAGPAFQTALINLLKEQLQP 216 (648) T ss_pred eeEEEeccCCCcccccceecccccccccccc--ccc-------cccceeecCccchhhhhccCccchhhhhhchhhhhhh Confidence 000 000000000000000000000000000 000 000000000000000000 0000000000000 Q ss_pred --eeeccccccceeEEEEEeecccccccceeeeeeeccccccccceeeee-eecccccc--ceeeeeccCCce-eeeeee Q lcl|NC_014792. 222 --VVALYPGEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVF-NYGPQTDD--QYAIIVRRDGAI-VENVVL 295 (659) Q Consensus 222 --~~a~~~g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~v~~~g~~-~et~~~ 295 (659) .....+....+..+.. ..............+....... ......+. .+...+...-++ .....+ T Consensus 217 ~~~~~~~~~s~~~~~d~~----------~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~ 286 (648) T protein:vir:10 217 TDVVQIFDASDTNPVDIP----------LGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDY 286 (648) T ss_pred hhhheecccccccccccc----------cccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccce Confidence 0000000000000000 0000000000000000000000 00000000 000000000000 000000 Q ss_pred eccccccccccchhhhhhhhhcccccceEEeecccCCccceeEEeeccccccccc---------chhhhhhhHhhhhhcc Q lcl|NC_014792. 296 STKEGDKDVYGNNIYLDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQV---------TAGDLMQGWDLFADRE 366 (659) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~---------~~~~~~~~~~~~~~~~ 366 (659) ... .................+.++......... ......|+||.++..+. +..|+..++++++..+ T Consensus 287 ~~~----~~~~~~~~~~~v~~~~~~~l~~~~~~p~~~-~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~ 361 (648) T protein:vir:10 287 QDY----TSLSDPANWFAKDAYTINHLVDTTINPHIL-ATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEE 361 (648) T ss_pred eee----eccccccceeeeeccchhhcccccccCccc-ccccceecccccCCCcccccccccccchhhHHHHhhhccCCC Confidence 000 000000000000000111122211211111 11123588998886652 4567777777776544 Q ss_pred cccceEEEeccccc-----cchhhhHHHHHHHHHHHHhhC---------CEEEEEecCccccccccccCCHHHHHHHhhc Q lcl|NC_014792. 367 ALHINLLIAGAVAG-----EGDATASTVQKHVVSIADERQ---------DCLAFISPPKGLLVNVPLTRAVDNLIDWRTG 432 (659) Q Consensus 367 ~~~~~~~~~p~~~~-----~~~~~~~~v~~~l~~~~~~~~---------~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~ 432 (659) .+. ++.+++... ..-...++++.++++||+.|. ..++++.++.+ .+..+...-+.. T Consensus 362 ~~~--ivp~~~~~~~~~~~~~lt~~q~i~a~a~shv~~~s~~~~~~~r~~~~~~vg~~~~--------es~~~se~~~~~ 431 (648) T protein:vir:10 362 VNF--VIPAYKFTNVTQLNDRLTIFKGIASTFLSHVQTMSQVNRRKARVGVFGLPAPSPN--------ESVTASEYLYNR 431 (648) T ss_pred ceE--EEeecccccccccccccCCccchHHHHHHHHHHhhhccccccccCeEEEeCCCCc--------hhHHHHHHHhhh Confidence 432 122221111 112345788889999987552 12555554432 222222222111 Q ss_pred ccccccccc---ccc-cceEEEEcCceeEecccCCcceeecH---HHHHHHHHHHhhhcCCceECcCCcchhheeccccc Q lcl|NC_014792. 433 GGSFDTDNM---NIS-TTYAAIDGNYKYQYDKYNDVNRWVPL---AADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKL 505 (659) Q Consensus 433 ~~~~~~~~~---~~~-s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~ 505 (659) ..... .+. ..+ .+.+...+.+... ..+++...+|| ++++||+++++. ++.||.||++.+ .+ +.+ T Consensus 432 ~~~~~-~~a~~~~~d~~~~~~~~~~~~~~--~~~G~~~~~p~~~~Aa~VAGl~a~l~----~~~s~T~k~i~~-~~-id~ 502 (648) T protein:vir:10 432 NILNT-ISAMFGGTDRAQAVVFPFYSNVF--NDEGKVELLGGEFFASYVAGMHANRE----PQDSITFLPISG-IG-AEP 502 (648) T ss_pred hcccc-cceeeeecCCceEEeecccceeE--CCCCcEEecchhhHHHHHHhhhhccc----cccCcccceeec-cc-ccc Confidence 11000 000 011 1222222233222 22567777898 678999999875 788999998652 22 344 Q ss_pred eeecChhHHHhhhhCCceEEEEEeCCC---eEEEEcccccC--CCccccceeehhhHHHHHHHHHHH-HHHHHhcCCCCH Q lcl|NC_014792. 506 AIEPRQTQRDRMYQEAINPVVGFAGGD---GFVLYGDKTAT--KVPSPMDHINVRRLTNMLKKNIGD-ASKYKLFELNDN 579 (659) Q Consensus 506 ~~~~~~~e~~~Ln~~gin~i~~~~~~~---G~~~wG~rT~~--~~~~~~~~i~vrR~~~~i~~~i~~-~~~~~v~epn~~ 579 (659) ...++++|++.|+++||+||....+.. ++++--+-|.. .++..|+.|+++|++|++...+++ ..++|+++||+. T Consensus 503 ~~~~t~~qld~L~~~Gv~~ie~~~~~~~~~~~rvv~gITT~~~~~~~~~~eisv~ri~D~l~~~vr~~l~~~fIG~~n~~ 582 (648) T protein:vir:10 503 LYNWTYTQKDDLISNRVLFVEKVKTSFGGIVYRIHHNPTTWLGPVTQGFQEFVLRRIDDFLQSYVYKNLQEQFIGRKSYG 582 (648) T ss_pred ccCCCHHHHHHHhcCCcEEEEEecCCcceeeEEEeccceeecCCCCcceeeeeeeehhhHHHHHHHHHHhhhcCcccccH Confidence 578999999999999999998765421 34443333322 234579999999999999999987 556999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccceeeeE---EEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCe Q lcl|NC_014792. 580 FTRASFRMETSQYLDGIRALGGIYEGR---VVCDTTNNTPSVIDRNEFVASIYYKPARSINYIVLNFVATSTGA 650 (659) Q Consensus 580 ~l~~~i~~~i~~~l~~l~~~gal~g~~---v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~ 650 (659) ..|.+|++.|.+||.++++.++|++|. |.++ ++++++++++.+.|++|++||.+++..+-.-. T Consensus 583 ~~~~~ik~~i~~~L~~~~~~~~I~~y~~~~v~~~--------~~~~vv~V~~~v~Pv~~i~~I~vti~it~~~~ 648 (648) T protein:vir:10 583 RKTENDIKVYTEALLSNLVGKQIVAYKDVKVTSN--------EDKTVYYVEFFYQPVTEIKFILVTMKVTFDLE 648 (648) T ss_pred HHHHHHHHHHHHHHhhHhhcCcccCcccceEEEE--------ecCCEEEEEEEEEecceeeEEEEEEEEEeccC Confidence 999999999999999999999999975 3333 35689999999999999999999877543222 No 39 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=3.5e-60 Score=346.57 Aligned_cols=544 Identities=15% Similarity=0.079 Sum_probs=312.4 Q ss_pred Cc--------eecCceEEEEecCCCccccc-CCcceEEEeecccCCCCccEEeCCHHHHHHHcCC--cCCCchhHHHHHH Q lcl|NC_014792. 1 MA--------LLSPGIELKETTVQSTVVRN-ATGRAALVGKFQWGPAFQVTQITNEVELVDLFGG--PNNITADYFMSGM 69 (659) Q Consensus 1 ~~--------~~~PGVyveE~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~--~~~~~~~~~~~~~ 69 (659) |+ +..|||||||.+++++++++ ++++++|||.+++||.|+|++|+||.||++.||+ +.+..++.|.... T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~l~~a~~~a~~~~~ 80 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKPDTVYRFRNYQQAKQVLRSGDLLDAIELAWNASD 80 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCCceeEEecCHHHHHHHhcCCchhHHHHhhccCcc Confidence 43 45799999999999986655 6999999999999999999999999999999976 2233334444455 Q ss_pred HHHcCCCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccc Q lcl|NC_014792. 70 NFLQYGNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDK 149 (659) Q Consensus 70 ~f~ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 149 (659) +|.|||+.||++|+.+.. +++....++. .++...+.|++.+++...........+.......++....+ T Consensus 81 ~~~~~~~~~~~~rv~~a~---~a~~~~~~~~---~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~~~~~----- 149 (569) T protein:vir:80 81 VNTASAGDILAVRVEDAK---NATLTKGGLT---FASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGYKKVF----- 149 (569) T ss_pred ccccCceEEEEEEcCCCe---eeeeecccee---eeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCCcccc----- Confidence 568999999999994432 2222222222 33444566888888765433222222211110000000000 Q ss_pred ccccccccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccc Q lcl|NC_014792. 150 IIAFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGE 229 (659) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~ 229 (659) ..+..... ..+.+....... .+..+.. . .+.... ....-++ T Consensus 150 -----------~~ig~v~s----i~ytg~~~~a~~-~~~~~~~-------------~--------~~a~~l--~~~~g~~ 190 (569) T protein:vir:80 150 -----------DNLGKIFS----IQYKGSEAQANF-TIAQDSI-------------S--------KKATTL--TLNVGSE 190 (569) T ss_pred -----------ccccceee----EEEeeccccceE-EeecCcC-------------c--------ceeEEE--EEEecCC Confidence 00000000 001110000000 0000000 0 000000 0000001 Q ss_pred cceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchh Q lcl|NC_014792. 230 IGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNI 309 (659) Q Consensus 230 ~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~ 309 (659) ......+.............. ..+......+....+. ... .+........... +.+.++....-. ..... T Consensus 191 ~~~~~~v~~~~~~~~~~~~~~-~lv~~~~~~~~f~a~~-~~~---~~~~~~~~~~d~~---~~~~~~t~~~~~--~~~~~ 260 (569) T protein:vir:80 191 PESTTEVMKYELGQGVYSETN-VLVSAINSLPDWEAKF-FPI---GDKNLPTDALEAV---TKVDVKTEAVFV--GALAG 260 (569) T ss_pred cceeEEEEeeccCCccchhhh-hhhhhcCCccCceEEE-Eec---CCCcceehhccch---hheeccccceee--ehhHH Confidence 111111111100000000000 0000001111111111 000 0000000000000 001111100000 00000 Q ss_pred hhhhhhhcccccceEEeecccCC-ccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHH Q lcl|NC_014792. 310 YLDDYFAKGTSNYIYATSLNWPK-GFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATAST 388 (659) Q Consensus 310 ~~~~~~~~~~s~~v~~~~~~~~~-~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 388 (659) .+... ...+.++.+....... .......|.||.|+.. ..++..+++.++. +++++++++ ++..+ T Consensus 261 di~~~--~~~~~~v~~~~~~~~~l~~~~~~~LtGG~dG~~---~~~~~~~l~~le~---~~~~~i~~~-------t~d~a 325 (569) T protein:vir:80 261 DIAKQ--LEYNDYVTVAVDATKPVEDFELTNLTGGSDGTA---PESWANKFPLLAN---EGGYYLVPL-------TDKQA 325 (569) T ss_pred HHHHh--hcCCceEEEEecCCcceeeecceeecCCCCCCc---cchHHHHHHHHhh---CCcEEEEec-------CCChH Confidence 01111 1123445443322111 1122356889987632 2346667776653 456666543 23457 Q ss_pred HHHHHHHHHHhhCC----EEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCc Q lcl|NC_014792. 389 VQKHVVSIADERQD----CLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDV 464 (659) Q Consensus 389 v~~~l~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~ 464 (659) ++.++.+||+++++ ++++++.+.+ .+.+++..+.. ++++.+.++++||..+.+. +++ T Consensus 326 v~~~l~a~vkr~r~~g~~~~aVvg~~~~--------~~~~~~~~~a~----------~~n~e~vv~v~~~~~~~~~-~g~ 386 (569) T protein:vir:80 326 VHSEALAFVKDRTDNGDPMRIIVGGGTN--------ETVEESITRAT----------NLRDPRASLVGFSGTRKMD-DGR 386 (569) T ss_pred HHHHHHHHHHHHHhCCCcEEEEecCCCC--------CCHHHHHHHHh----------hcCCCeEEEEecCceeecC-CCc Confidence 99999999998865 8999987753 35667776654 4578999999999888764 455 Q ss_pred ceeecH---HHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcc-- Q lcl|NC_014792. 465 NRWVPL---AADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGD-- 539 (659) Q Consensus 465 ~~~~p~---s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~-- 539 (659) .+.+|+ ++++||++|..+ +++||.|+.+. ..++...+++.|++.|+++|+.+++...+ +..++|.. T Consensus 387 ~~~~~~~~~aa~vAG~~A~~~----~~~S~T~k~i~----~~~i~~~lt~~e~~~li~~G~~~l~~~~~-~~~~v~~~vn 457 (569) T protein:vir:80 387 LLKLPGYMMASQIAGIASGLE----VGEAITFKHFN----VTSVDRVFESSQLDMLNESGVISIEFVRN-RTLTAFRVVQ 457 (569) T ss_pred ceeechhhHHHHHHHHHhcCc----cccCccceeec----cccccccCCHHHHHHHHhCCeEEEEEecC-ceEEEEEEec Confidence 566665 577888888776 78899998753 34667789999999999999999987765 45556633 Q ss_pred --cccC-CCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCC Q lcl|NC_014792. 540 --KTAT-KVPSPMDHINVRRLTNMLKKNIGDAS-KYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNT 615 (659) Q Consensus 540 --rT~~-~~~~~~~~i~vrR~~~~i~~~i~~~~-~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt 615 (659) .|.. .++..|++|+++|++|+|++.|++.. +||+++||+...|..|+..|+.||.+||++|+|.+|... +- T Consensus 458 ~itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~-----dv 532 (569) T protein:vir:80 458 DVTTYNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGTKVIDTSASLIKNFIQSFLDNKKRAREIQDYTPE-----EV 532 (569) T ss_pred cceecCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcccCChhHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ce Confidence 2222 23457999999999999999998875 689999999999999999999999999999999998532 22 Q ss_pred HHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCeeE Q lcl|NC_014792. 616 PSVIDRNEFVASIYYKPARSINYIVLNFVATSTGADF 652 (659) Q Consensus 616 ~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 652 (659) +.++..++++|++.++|+.|+|||+++++....-.+- T Consensus 533 ~v~~~~d~~~v~~~v~Pv~~~ekI~~ti~~~~~~~~~ 569 (569) T protein:vir:80 533 QVVLEGDVASISMTVMPIRSLNKITVQLVYKQQILTA 569 (569) T ss_pred EEEecCCEEEEEEEEEEcccccEEEEEEEEeeeeecC Confidence 3345678999999999999999999999977766665 No 40 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=1.2e-59 Score=343.58 Aligned_cols=560 Identities=15% Similarity=0.078 Sum_probs=316.2 Q ss_pred Cc--------eecCceEEEEecCCCccc-ccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHH Q lcl|NC_014792. 1 MA--------LLSPGIELKETTVQSTVV-RNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNF 71 (659) Q Consensus 1 ~~--------~~~PGVyveE~~~~~~~~-~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f 71 (659) |+ +..|||||||.+++.++. ++++++++|||.+++||+++|++++||.||+++||+-+ +..++.++| T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~----l~~~~~~a~ 76 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGE----LLDAIELAW 76 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCCccceeEEeccHHHHHHHhcCcc----hHHHHHHHh Confidence 33 678999999999999865 55799999999999999999999999999999998733 555666665 Q ss_pred ----HcCCCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecc Q lcl|NC_014792. 72 ----LQYGNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPS 147 (659) Q Consensus 72 ----~ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 147 (659) .|||++||++||.+.. .|+....++. .++..++.|||.++++...+.......+......++..+++-.. T Consensus 77 ~~~~~~g~~~~~~~rv~~~~---~a~~~~~~l~---~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 150 (587) T protein:vir:99 77 GSNPNYTAGRILAMRIEDAK---PASAEIGGLK---ITSKIYGNVANNIQVGLEKNTLSDSLRLRVIFQDDRFNEVYDNI 150 (587) T ss_pred ccccCCCceEEEEEEcCCCc---eeEEEecCeE---EEEeeccccccceEEEEccCCCCcceeEEEEEecccceeeeeec Confidence 7999999999995443 2333333333 34567889999999987766554444333222222221222111 Q ss_pred ccccccccccceeeeeccceeeEEEeecCCccccccccceecccc-ceeeecccccccccccceeecccccccceeeecc Q lcl|NC_014792. 148 DKIIAFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSG-ILLTEAENSEEAITSLEFQASLQKYAMPGVVALY 226 (659) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 226 (659) +. +....+........+.......... ........+ .......... +...............+...+.+ T Consensus 151 g~-------v~~i~y~g~~~~a~~~v~~~~~t~~--a~~~~l~~g~~~v~~yrL~~-g~~~~~~~~~~~i~~~~~~tAky 220 (587) T protein:vir:99 151 GN-------IFTIKYKGEEANATFSVEHDEETQK--ASRLVLKVGDQEVKSYDLTG-GAYDYTNAIITDINQLPDFEAKL 220 (587) T ss_pred cc-------eeeEEeecccccceeeEeecCccee--eeeeeeecCCceeEEEEecC-CchHHHHHHHhhhccccceeEEe Confidence 10 0000000000000000000000000 000000000 0000000000 00000000000000111123334 Q ss_pred ccccceeEEEEEeeccc-ccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccc Q lcl|NC_014792. 227 PGEIGSTLEVEIVSKAA-YDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVY 305 (659) Q Consensus 227 ~g~~g~~i~V~v~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~ 305 (659) ++..++.+.... ... ...... .... ...+. ..+ +...........+... .+.... T Consensus 221 ~~~~~~~i~~~~--~~~~~~~~v~-~~~~---------~v~a~-----~~D----~~~~~~~~~~~~~~~~--~g~~~~- 276 (587) T protein:vir:99 221 SPFGDKNLESSK--LDKIENANIK-DKAV---------YVKAV-----FGD----LEKQTAYNGIVSFEQL--NAEGEV- 276 (587) T ss_pred eccCCceeEeec--ccccccceee-eeee---------eeehh-----ccc----eeeecccceeeeeeec--ccccch- Confidence 443333332211 000 000000 0000 00000 000 0000000000000000 000000 Q ss_pred cchhhhhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhh Q lcl|NC_014792. 306 GNNIYLDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDAT 385 (659) Q Consensus 306 ~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~ 385 (659) ...............................|.||.|+... .++..++++++. .+.++++++ .+ T Consensus 277 ---~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~LtGG~dG~~~---~sy~~al~ale~---~~~~~i~~~-------t~ 340 (587) T protein:vir:99 277 ---PSNVEVEAGEESATVTATSPIKTIEPFELTKLKGGTNGEPP---ATWADKLDKFAH---EGGYYIVPL-------SS 340 (587) T ss_pred ---hhhhhhhhccccceeeeeccccceecccceeeecCCCCCcc---ccHHHHHHHHhh---CCcEEEEec-------CC Confidence 00000000000001111111111111123458899886432 356777777654 345666543 23 Q ss_pred hHHHHHHHHHHHHhhCC----EEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEeccc Q lcl|NC_014792. 386 ASTVQKHVVSIADERQD----CLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKY 461 (659) Q Consensus 386 ~~~v~~~l~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~ 461 (659) ..+++.++.+||+++++ ++++++.+.+ .+.+++..... .+++.+.++++|+..+. .. T Consensus 341 d~~i~a~l~a~vk~~r~~g~~~~aVlg~~~~--------~~~~~~~~~a~----------~~n~e~vi~v~~~~~~~-~~ 401 (587) T protein:vir:99 341 KQSVHAEVASFVKERSDAGEPMRAIVGGGFN--------ESKEQLFGRQA----------SLSNPRVSLVANSGTFV-MD 401 (587) T ss_pred CHHHHHHHHHHHHHHHhCCCcEEEEecCCCC--------CCHHHHHHHhh----------hcCCCcEEEEeccceEe-cC Confidence 45788999999988765 8999987643 35566666543 34678888888876543 23 Q ss_pred CCcceeecH---HHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCC--eEEE Q lcl|NC_014792. 462 NDVNRWVPL---AADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGD--GFVL 536 (659) Q Consensus 462 ~~~~~~~p~---s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~--G~~~ 536 (659) ++....+|+ ++++||++|..| +++||.|+.+. ..++...+++.|++.|+++|++++....+.. ++++ T Consensus 402 dg~~~~~~~~~~aa~vAGl~Ag~~----~~~SlT~~~i~----~~~v~~~~t~~e~e~li~~Gvl~l~~~~~~~~~~vri 473 (587) T protein:vir:99 402 DGRKNHVPAYMVAVALGGLASGLE----IGESITFKPLR----VSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRI 473 (587) T ss_pred CCceeeechHHHHHHHHHHHhcCc----hhcCccceeee----cccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEE Confidence 456667777 688999999987 67799998753 3466778999999999999999998665432 2332 Q ss_pred -EcccccC-CCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCC Q lcl|NC_014792. 537 -YGDKTAT-KVPSPMDHINVRRLTNMLKKNIGDAS-KYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTN 613 (659) Q Consensus 537 -wG~rT~~-~~~~~~~~i~vrR~~~~i~~~i~~~~-~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~ 613 (659) .+-.|.. .++..|++|+++|++|+|++.|++.+ ++|+++||+...|..|+..|..||.+||+.|+|.+|... T Consensus 474 v~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk~Nn~~~r~~i~~~i~~~L~~l~~~gaI~~~~~~----- 548 (587) T protein:vir:99 474 VDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGTRTINTSASIIKDFIQSYLGRKKRDNEIQDFPAE----- 548 (587) T ss_pred eeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHHhhCCccccchHHHHHHHHHHHHHHHHHHhCCcccCCCcc----- Confidence 4545543 33457999999999999999999876 689999999999999999999999999999999998642 Q ss_pred CCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCeeE Q lcl|NC_014792. 614 NTPSVIDRNEFVASIYYKPARSINYIVLNFVATSTGADF 652 (659) Q Consensus 614 nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 652 (659) ..+-+....++++++.+.|+.|+|+|.+++.....-.+- T Consensus 549 dv~v~~~~d~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 549 DVQVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred ceEEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 112223445799999999999999999999876665555 No 41 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=6.5e-57 Score=328.61 Aligned_cols=560 Identities=14% Similarity=0.056 Sum_probs=315.0 Q ss_pred CceecCceEEEEecCCCccccc-CCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHH----HcCC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRN-ATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNF----LQYG 75 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~-~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f----~ngG 75 (659) =-|..||||||+.+++..++.+ ++++.+|||.+++||+++|++|++|.||++.||+.. +..++.++| .||| T Consensus 9 ~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g~~~~~~~~~~~~~~~~~~g~G~----l~~ai~~a~~~~~~~g~ 84 (587) T protein:vir:96 9 RPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGGEPNTVYQVRNYAQAKSVFRSGE----LLDAIELAWGSNPQYTA 84 (587) T ss_pred CcccCCceEEEEecCCccCCCCCCCceEEEEEEecCCCCceeEEEcChHHHHHhhcCCc----HHHHHHHHhccCcCCCc Confidence 2367899999999999976655 699999999999999999999999999999999753 556677777 6999 Q ss_pred CeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccccc Q lcl|NC_014792. 76 NDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAK 155 (659) Q Consensus 76 ~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 155 (659) +.||.|||.+.. .++.....+. .++...+.|++.+++...........+....-..++...++-.-+. T Consensus 85 ~~~~a~rv~~~~---~a~~~~~~~~---~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~------ 152 (587) T protein:vir:96 85 GKILAMRVEDAK---ASQLEKGGLR---VTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGN------ 152 (587) T ss_pred eEEEEEecCCCc---cceeeccccc---ccccccCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCc------ Confidence 999999995533 2222222222 2344457799999987754332222222221112222222110000 Q ss_pred ccceeeeeccceeeEEEeecCCccccccccceecc-ccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 156 SVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTD-SGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 156 ~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) +....+...................+ ..++.. .......... ..+...............+...+.|++..++.+ T Consensus 153 -v~~i~y~g~~~~a~~~~~~~~~~~~A--~~l~l~gg~~~v~~yrl-~~g~~~~~~~~~~~~~~~~~~tAky~g~~~n~~ 228 (587) T protein:vir:96 153 -IFSINYKGEGEKATFSVEKDKETQEA--KRLVLKVDEKEVKAYEL-NGGAYSFTNEIITDINELPDFEAKLSPFGDKNL 228 (587) T ss_pred -eEEEEecccccceeEeeccCccccee--eeeEEEecCceEEEEEe-CCCchhhhhhhhhhhccccceEEEeecccCcee Confidence 00000000000000000000000000 000000 0000000000 000000000000000112233455666666666 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) .+.+... ...... ........+....+ .. ....++ .++ +......... ...... T Consensus 229 ~v~v~d~-~~~~~~---k~~~~y~~t~~~di------------~~--~~~~~~-~~~-~~~~~~~~~~------~~~~~v 282 (587) T protein:vir:96 229 ESRKLDE-ATDVDI---KGKAVYVKAVFGDI------------EN--QTQYNQ-YVK-FEQLPEQASE------PSDVEV 282 (587) T ss_pred EEEeecc-cccccc---ceEEEeehhhhhhh------------hh--hhcccc-cee-eccccchhhh------hhcccc Confidence 5543210 000000 00000000000000 00 000000 000 0000000000 000000 Q ss_pred hhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVV 394 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~ 394 (659) .........................|.||.++.. ..++...+++++. .++++++++ ++.++++.++. T Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~aLtGG~dG~~---~~~y~~~l~ale~---~~~~~i~~~-------t~d~ai~~~l~ 349 (587) T protein:vir:96 283 HAETESATVTATSKPKAIEPFELTKLSGGTNGEP---PTSWSAKLEKFKN---EGGYYIVPL-------TDRQSVHSEVA 349 (587) T ss_pred cccccceeeeecccccccccccceeeecCCCCCC---cccHHHHHHHHhh---CCcEEEEec-------CCCHHHHHHHH Confidence 0000000000011000111112245889987643 2346666666653 456776654 23457889999 Q ss_pred HHHHhhCC----EEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecH Q lcl|NC_014792. 395 SIADERQD----CLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPL 470 (659) Q Consensus 395 ~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~ 470 (659) +||+++++ ++++++.+.+ .+.+++...+. .+++.+.++++++..+.+. ++....+|+ T Consensus 350 a~vk~~r~~gk~~~aVlg~~~~--------~~~~~~~~~a~----------~~n~e~vi~v~~~~~~~~~-~~~~~~~~~ 410 (587) T protein:vir:96 350 TFVKNRSDAGEPMRAIVGGGTS--------ETKEKLFGRQA----------ILNNPRVALVANSGKFVMG-NGRILQAPA 410 (587) T ss_pred HHHHHHHhCCCeEEEEecCCCC--------CCHHHHHHHHh----------hcCCCcEEEEecceEEecC-CCceeeech Confidence 99988865 8999986643 35555655443 4568889999998887765 344444543 Q ss_pred ---HHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcc-cccC--- Q lcl|NC_014792. 471 ---AADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGD-KTAT--- 543 (659) Q Consensus 471 ---s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~-rT~~--- 543 (659) ++++||++|..+ +++||.|+.+. + .++...+++.|++.|.++|+.+++...+ ++.++|.. +++. T Consensus 411 ~~~aa~vAG~~Ag~~----~~~S~T~~~~~---~-~~v~~~~t~~e~~~~i~~G~~~l~~~~~-~~~~v~~~vnsitT~t 481 (587) T protein:vir:96 411 YMVASAVAGLVSGLD----IGESITFKPLF---V-NSLDKVYESEELDELNENGIITIEFVRN-RMTTMFRIVDDVTTFP 481 (587) T ss_pred hhHHHHHHHHHhcCc----cccCccceeee---c-ccccccCCHHHHHHHHhCCeEEEEEecC-CcEEEEEeeccceecC Confidence 688999999887 67799998753 2 3567789999999999999999987665 45556633 4332 Q ss_pred -CCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhC Q lcl|NC_014792. 544 -KVPSPMDHINVRRLTNMLKKNIGDAS-KYKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDR 621 (659) Q Consensus 544 -~~~~~~~~i~vrR~~~~i~~~i~~~~-~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~ 621 (659) .++..|++|+++|++|+|.+.|++.+ ++|++|||+...|..|+..|..||.+|++.|+|.+|... +-+-++.. T Consensus 482 ~~~~~~~~~i~virv~D~i~~di~~~~~~~yiGk~nn~~~r~~v~~~i~~~L~~l~~~g~I~~~~~~-----dv~v~~~~ 556 (587) T protein:vir:96 482 DKNDPVKSEMALGEANDFLVSELKILLEEQYIGTRTINTSASQIKDFVQSYLGRKKRDNEIQDFPPE-----DVQVIIEG 556 (587) T ss_pred CCCCchhhhhhhHHHHHHHHHHHHHHHHhcCCccccCHHHHHHHHHHHHHHHHHHHhCCcccCCCcc-----ceEEEecC Confidence 23457999999999999999999886 689999999999999999999999999999999998642 12223345 Q ss_pred CEEEEEEEEEecCCceEEEEEEEEeecCeeE Q lcl|NC_014792. 622 NEFVASIYYKPARSINYIVLNFVATSTGADF 652 (659) Q Consensus 622 G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~ 652 (659) .+++|++.+.|+.|+|||++++.....-++- T Consensus 557 D~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:96 557 NEARISLTIFPIRALKKISVSLVYRQQTLQA 587 (587) T ss_pred CEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 5799999999999999999999865555444 No 42 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=100.00 E-value=1.6e-50 Score=293.52 Aligned_cols=417 Identities=11% Similarity=0.107 Sum_probs=267.8 Q ss_pred CceecCceEEEEecCCCccc-ccCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeEE Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVV-RNATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDLR 79 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~-~~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~~ 79 (659) +.-.-|||||||++.+.+.+ ++.|+++||+|.++|||+++|++|+||.||++.||+... +..+....+|++||++|| T Consensus 9 ~~k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~~Gp~~~~~~i~s~~d~~~~fG~~~~--~~~~~~~~~~~~g~~~~~ 86 (437) T protein:vir:10 9 QNKVRPGAYINVKSKDIAMTRLGGDGVVTVPLALSFGQSKKLMKIRRGEDLFKKLGYEQE--SPQLLLLNEAFKRVSEVL 86 (437) T ss_pred cceecCceeEEEecCCcceeeccCCcEEEEEEEecCCCCceeEEEecHHHHHHHcCCccc--hhHHHHHHHHhcCCCEEE Confidence 66789999999999998755 556999999999999999999999999999999997543 445556666779999999 Q ss_pred EEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccccccce Q lcl|NC_014792. 80 TVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVNQ 159 (659) Q Consensus 80 vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ 159 (659) ++|+.++..+ ...+.+. T Consensus 87 ~~R~~~g~~a--~~tl~~~------------------------------------------------------------- 103 (437) T protein:vir:10 87 LYRLNTGEKA--NVSLSDN------------------------------------------------------------- 103 (437) T ss_pred EEECCCCcee--eEeeccc------------------------------------------------------------- Confidence 9998642110 0000000 Q ss_pred eeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEe Q lcl|NC_014792. 160 YPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIV 239 (659) Q Consensus 160 ~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~ 239 (659) ..+.+.++|.|||.++|.+. T Consensus 104 ------------------------------------------------------------~~~~A~~~G~~gn~i~v~v~ 123 (437) T protein:vir:10 104 ------------------------------------------------------------VTAQAKYSGVRGNDITVTVK 123 (437) T ss_pred ------------------------------------------------------------eEEEeccCCcccceeEEEEe Confidence 00112345555555555443 Q ss_pred ecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhccc Q lcl|NC_014792. 240 SKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKGT 319 (659) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (659) ....... .+.+.+......++...+..... ... T Consensus 124 ~~~~d~~-------------------------------~~~v~~~~~~~~~d~~~v~~~~~-------------~~~--- 156 (437) T protein:vir:10 124 TNVDDPS-------------------------------SFDVVTFLDTVVMDLQTVKVLAD-------------LKN--- 156 (437) T ss_pred eccCCcc-------------------------------ceEEEEecCcceeeeeehhhhhh-------------hhh--- Confidence 2110000 00000000111111110000000 000 Q ss_pred ccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHh Q lcl|NC_014792. 320 SNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADE 399 (659) Q Consensus 320 s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~ 399 (659) ..++.......+. .....++.||.++. ++..++.++++.++ .++++++|+|.. ..+++.++.+||++ T Consensus 157 n~~v~~~~~~~l~-~~a~~~LtGG~dg~--~t~~dy~~al~~le---~~~~n~l~~~~~-------d~~~~t~~~~~ik~ 223 (437) T protein:vir:10 157 NALVEFSGTGELQ-PVAGAKLTGGTDGA--ISTQDYLEYFKALE---TVEFNYMALPVE-------DASIKKAAINFIKR 223 (437) T ss_pred hcccccccccccc-cccceeeeccccCC--CChhHHHHHHHHhc---cCcceEEEecCC-------ChhHHHHHHHHHHH Confidence 0011111111111 11235788998873 45667777777664 456899999853 34577888899887 Q ss_pred hCC----EE-EEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHH Q lcl|NC_014792. 400 RQD----CL-AFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADM 474 (659) Q Consensus 400 ~~~----~~-ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~ 474 (659) +|+ .+ +++..+. . ++.....+.+-....|. ...--.-.++.+ T Consensus 224 ~r~~~g~~~~~V~~~~~---------~----------------------d~e~Iin~~n~~~~~~~--~~~~~~~~~a~v 270 (437) T protein:vir:10 224 MREDEGLGAQLVVADSD---------A----------------------DSEAVINVKNGVILSDK--TVIDKTKATVWV 270 (437) T ss_pred HHhccCceEEEEeCCCC---------C----------------------CCceEEEeecceeecCc--ceechhhHHHHH Confidence 754 33 4442210 0 01111112221111110 001112245788 Q ss_pred HHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccC----CCccccc Q lcl|NC_014792. 475 AGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTAT----KVPSPMD 550 (659) Q Consensus 475 Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~----~~~~~~~ 550 (659) ||++|..+ +++|+.|+.+ .++..+...+++.|++.|.++|+.++.+ .+++-+.++|-.|+. ..+.+|+ T Consensus 271 AG~~Ag~~----~~~S~t~~~~---~~~~~v~~~~t~~e~~~~i~~G~~vl~~-~~~~v~i~~gInTltt~~~~~~~~~~ 342 (437) T protein:vir:10 271 AAASANAG----VEKSLTYEKY---EDSVDVVGRLSHTETEDALLKGQFVFTA-RRGRAVVEQDINSHVSFTIEKNQDFR 342 (437) T ss_pred HHHhccCc----cccCcccccc---CCcccccccCCHHHHHHHHhCCcEEEEE-eCCeEEEEEccccccccCCCCCchhh Confidence 99999875 6679999864 4566677799999999999999988864 454444558877764 2346899 Q ss_pred eeehhhHHHHHHHHHHHHHHH-HhcC-CCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEE Q lcl|NC_014792. 551 HINVRRLTNMLKKNIGDASKY-KLFE-LNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASI 628 (659) Q Consensus 551 ~i~vrR~~~~i~~~i~~~~~~-~v~e-pn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i 628 (659) +|.++|++|+|.+.|++.+.. |+++ |||...|..++..|+.||++|+++|+|.+|.+...+..+.. ....+++.+ T Consensus 343 ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~~r~~~~~~i~~yl~~l~~~g~I~~~~~~d~~v~~~~---~~~~v~v~~ 419 (437) T protein:vir:10 343 KNRILRTLDDIVNDTRYAFSEYFLGKVSNNEDGRQAFKANRIRYFKDLEARGAIEDFKVEDIEVLRGE---LKESVVVNV 419 (437) T ss_pred hhhHHHHHHHHHHHHHHHHHhccccccCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCceeEEeecCC---CCCEEEEEE Confidence 999999999999999998774 9997 79999999999999999999999999999998766543221 346889999 Q ss_pred EEEecCCceEEEEEEEEe Q lcl|NC_014792. 629 YYKPARSINYIVLNFVAT 646 (659) Q Consensus 629 ~~~p~~p~e~i~~~~~~~ 646 (659) .++|+.+||+|.+++... T Consensus 420 ~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 420 KVKPVDSMEKLYMTVTVE 437 (437) T ss_pred EEEEeeeeeeEEEEEEec Confidence 999999999999998865 No 43 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=100.00 E-value=3.7e-50 Score=291.60 Aligned_cols=542 Identities=15% Similarity=0.132 Sum_probs=306.7 Q ss_pred Cc-eecCceEEEEecCCCcccc-cCCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcC--CCchhHHHHHHHHHcCCC Q lcl|NC_014792. 1 MA-LLSPGIELKETTVQSTVVR-NATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPN--NITADYFMSGMNFLQYGN 76 (659) Q Consensus 1 ~~-~~~PGVyveE~~~~~~~~~-~~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~--~~~~~~~~~~~~f~ngG~ 76 (659) |- +..|||||++.+++..++. +++++.+|||.+++||+|+|++++||.|+++.||+.. +.-.+.|....||.|||+ T Consensus 17 ~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~a~~~f~~g~l~~a~~~a~~~~~~~~~g~~ 96 (607) T protein:vir:10 17 LFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNGDPTKVYEIRTSQQATKIFGSGDLVDGIKLAFDPTGNSVTNGG 96 (607) T ss_pred CCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCCCCceEEEEcchhHHHHhhcCcchHHHHHHhhccccCCccCCc Confidence 33 5579999999999998665 4699999999999999999999999999999997632 333455556667799999 Q ss_pred eEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeee-eeccCcceeeeecccccccccc Q lcl|NC_014792. 77 DLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITK-VDVDGKILAVFIPSDKIIAFAK 155 (659) Q Consensus 77 ~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~-~~~~g~~~~~~~~~~~~~~~~~ 155 (659) .||+|||.+...+ +....++ ..++...+.+++.+++... +......++.. ...+ +..+++-..+. T Consensus 97 ~~~~~rv~~~~~a---~~~~~~~---~~~~~~~~~~~~~i~~~l~-~~~~~~~~~~~~~~~d-~~~~~~~n~g~------ 162 (607) T protein:vir:10 97 TVYALRVDNAKQA---SLVKDGL---TFTSSIFGTNANQVSVALD-NDVFGVPRITVNYSPD-NYERTYTNIGQ------ 162 (607) T ss_pred eEEEEeCCCcccc---ceecccc---cccccccccCCCceEEEEE-ecCCCccceeEEeecc-cceeeeeeccc------ Confidence 9999999554322 2211111 1234556678888888662 22222222211 1111 11111110000 Q ss_pred ccceeeeeccceeeEEEeecCCccccccccceec---cccceeeecc-ccccc--------ccccceeecc--------- Q lcl|NC_014792. 156 SVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVT---DSGILLTEAE-NSEEA--------ITSLEFQASL--------- 214 (659) Q Consensus 156 ~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~---~~~~~~~~~~-~~~~~--------~~~~~~~~~~--------- 214 (659) .+. ..+.+........ ++. +.+..+.... ..... +....+.... T Consensus 163 ----------~~~----i~y~g~~~~a~~~-v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~l~~din~~ 227 (607) T protein:vir:10 163 ----------MFS----ITYSGKSASAGYT-VSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAKLMQAISAT 227 (607) T ss_pred ----------eee----cccCcccccccce-eeecCCCceeEEEecCCCccceeeeeecccccccccchHHHHHHHhhcC Confidence 000 0111111111000 000 0000000000 00000 0000000000 Q ss_pred -----cccccceeeeccccccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCce Q lcl|NC_014792. 215 -----QKYAMPGVVALYPGEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAI 289 (659) Q Consensus 215 -----~~~~~~~~~a~~~g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~ 289 (659) .......+...+.+..++.+.+.+. ..+.. ... ..+........ T Consensus 228 ~~~~A~~~g~~~i~tky~d~~~~~i~V~~~---------------------------~~iv~-a~~---~D~~~~~~~~~ 276 (607) T protein:vir:10 228 PNFSASVVGSPSVNTSYLDEVTSPVDVKTA---------------------------PAVVT-AKI---GDAISKLGYDP 276 (607) T ss_pred CceEEEEecccceeeeccccccceeEEEEe---------------------------eeeec-hhh---hhhhhcccccc Confidence 0000000111111111111111110 00000 000 00000000000 Q ss_pred eeeeeeeccccccccccchhhhhhhhhcccccceEEeecccCC---ccceeEEeecccccccccchhhhhhhHhhhhhcc Q lcl|NC_014792. 290 VENVVLSTKEGDKDVYGNNIYLDDYFAKGTSNYIYATSLNWPK---GFAGIINLMGGISANDQVTAGDLMQGWDLFADRE 366 (659) Q Consensus 290 ~et~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~---~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~ 366 (659) ...+... .+... +...... ...+..+.....+. .......|.||.|+.. ..++...++.++. T Consensus 277 ~~~~t~~--~~~~~-------~~~~~~~-~~~~~~~~~~~~~~~~~a~~a~~~LtGGtdG~~---~~ty~dal~aLe~-- 341 (607) T protein:vir:10 277 YVVVTQT--SNNKP-------IVNGVSA-GTGSATASVTTAPESFPANFDTAFLTGGSTGDV---PVSWADKFNGAIG-- 341 (607) T ss_pred eEEeeec--ccchh-------hhhhhhc-cccceeeeeeccccccccccceeeeeCCCCCCc---hhhHHHHHHHHhh-- Confidence 0000000 00000 0000000 00111111111111 1112345889988743 2345666666654 Q ss_pred cccceEEEeccccccchhhhHHHHHHHHHHHHhhCC----EEEEEecCccccccccccCCHHHHHHHhhccccccccccc Q lcl|NC_014792. 367 ALHINLLIAGAVAGEGDATASTVQKHVVSIADERQD----CLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMN 442 (659) Q Consensus 367 ~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (659) ++.++++++. ...+++.++.+||+++++ ++++++.+.+ .+.+++..+.. . T Consensus 342 -~e~~~i~~~t-------~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~~~--------~t~~~~~t~a~----------~ 395 (607) T protein:vir:10 342 -NNVYYIIPLT-------SEENIHAELQAFIDEQHVLGYNYHAFVGGGFA--------EPLEQILSRQV----------N 395 (607) T ss_pred -cCceEEEecC-------CCHHHHHHHHHHHHHHHhCCCcEEEEecCCCC--------CCHHHHHHHHH----------h Confidence 3455655432 345789999999988765 8899887643 45667776654 3 Q ss_pred cccceEEEEcCceeEecccCCcceeecH---HHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhh Q lcl|NC_014792. 443 ISTTYAAIDGNYKYQYDKYNDVNRWVPL---AADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQ 519 (659) Q Consensus 443 ~~s~~~~~~~p~~~~~d~~~~~~~~~p~---s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~ 519 (659) +++.+..+++|+..+.| .+..+.+|+ ++++||++|..+ +.+||.|+.+. ..++...+++.|++.|.+ T Consensus 396 ~N~ervv~V~~~~~~~~--~G~~~~~~~~~~Aa~vAGl~Ag~~----~~~SlT~k~i~----~~~v~~~lt~~e~e~ai~ 465 (607) T protein:vir:10 396 INDSRFGLVGQSGHVQE--GGESVHVPAYLMAAYVGGLSSSLG----VAVPITNKKLA----LVDLDQNFSGDDLNTLNQ 465 (607) T ss_pred hCCCcEEEEecCeeEee--CCcceeccHHHHHHHHHHHHhcCc----cccCcccceec----cccccccCCHHHHHHHHh Confidence 46788999999887755 345556665 688999999887 66799998753 346777899999999999 Q ss_pred CCceEEEEEeC---CCeEEEEccccc--CCCccccceeehhhHHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHH Q lcl|NC_014792. 520 EAINPVVGFAG---GDGFVLYGDKTA--TKVPSPMDHINVRRLTNMLKKNIGDAS-KYKLFELNDNFTRASFRMETSQYL 593 (659) Q Consensus 520 ~gin~i~~~~~---~~G~~~wG~rT~--~~~~~~~~~i~vrR~~~~i~~~i~~~~-~~~v~epn~~~l~~~i~~~i~~~l 593 (659) +|+.++....+ .++++++.+-|. ..++..|++|+++|++|+|.+.|++.. ++|++++|+...|..++..+..|| T Consensus 466 ~Gv~~l~~~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk~nnd~~~~~vk~~i~~~L 545 (607) T protein:vir:10 466 NGVIGIEHLVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGSNIRSTSADDIKSTVASYL 545 (607) T ss_pred CCeEEEEEccCccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCcccCCcchHHHHHHHHHHHH Confidence 99988864332 136777666554 233468999999999999999999876 589999999999999999999999 Q ss_pred HHHHh--ccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCeeEEEecCC Q lcl|NC_014792. 594 DGIRA--LGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPARSINYIVLNFVATSTGADFDELIGV 658 (659) Q Consensus 594 ~~l~~--~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~~ 658 (659) ..+|+ .|+|.+|... +-+-.....++++++.+.|+.++|+|++++.....-.+-++---. T Consensus 546 ~~~~l~~~gaI~df~~e-----dv~v~~~~D~v~v~~~v~Pv~~iekIyvtv~v~~~~~~~~~~~~~ 607 (607) T protein:vir:10 546 YSEMNNDDGLIVDFSES-----DIVVTISGTVVYIQFAVAPTQEIKNIVVSGTYSNYSATSEDNTTK 607 (607) T ss_pred HHHHHHhcCceeCCCcc-----ccEEeeCCCEEEEEEEEEEcccceEEEEEEEEEEEEEeeccCCCC Confidence 76554 6899998521 222234456899999999999999999999988877776553333 No 44 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=100.00 E-value=6.9e-46 Score=268.18 Aligned_cols=494 Identities=14% Similarity=0.092 Sum_probs=326.7 Q ss_pred Cc-ee-------cCceEEEEecCCCc--c-cccCCcceEEEeecccCCCCccEEeC--CHHHHHHHcCCcCCCchhHHHH Q lcl|NC_014792. 1 MA-LL-------SPGIELKETTVQST--V-VRNATGRAALVGKFQWGPAFQVTQIT--NEVELVDLFGGPNNITADYFMS 67 (659) Q Consensus 1 ~~-~~-------~PGVyveE~~~~~~--~-~~~~ts~~afvG~~~~Gp~~~p~~i~--s~~~~~~~fG~~~~~~~~~~~~ 67 (659) |. |- .-||.|.+++.-.+ . ++.-+++.|+||.|+||++++|.+|+ +|.+|.-.++++....+..+++ T Consensus 1 ~~~ysi~q~ig~aSGvav~pi~~d~t~~~~~g~g~~v~a~Vgif~RG~i~k~~~Vt~~n~~~~LGep~~~~~ga~~E~~~ 80 (529) T protein:vir:10 1 MSQYSIQQSLGNASGVAVSPINADATLSTGVALNSSLWAGIGVFARGKPFTVLAVTESNYEDVLGEPLKPSSGSQFEPIR 80 (529) T ss_pred CCceehhhhhhhhcccccCCcCcccccchheecCceEEEEEEEeecCCCcceEEEchhHHHHHhccccCCCcchhhhhHh Confidence 43 32 47999999985443 2 33358999999999999999999999 7999999999999999999999 Q ss_pred HHHHHcCCCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecc Q lcl|NC_014792. 68 GMNFLQYGNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPS 147 (659) Q Consensus 68 ~~~f~ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 147 (659) +.|+.-+++.|||||++..+.... .+.. ..+.. .+.. .+ T Consensus 81 h~~eA~~~~s~yVVRvv~~dak~p------~i~~-~~~~~-----------------------------~~~s--~~--- 119 (529) T protein:vir:10 81 HVYEAIQQTSGYVVRAVPDDAKFP------IIMF-DESGE-----------------------------PAYS--AL--- 119 (529) T ss_pred hhhhhhcCCceEEEEEcccccCCc------eEEe-cCCcc-----------------------------chhh--cc--- Confidence 999987777899999987653211 0000 00000 0000 00 Q ss_pred ccccccccccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccc Q lcl|NC_014792. 148 DKIIAFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYP 227 (659) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 227 (659) +. +. .+..+.+..+..-. .+.. . .. T Consensus 120 --------~~------------------s~--------~~~l~~G~~~~iy~-----------~Dgd-~--------~~- 144 (529) T protein:vir:10 120 --------PY------------------GS--------EIELDSGEAFAIYV-----------DDGD-P--------CI- 144 (529) T ss_pred --------cc------------------cc--------cccccccceEEEEE-----------ecCc-C--------cc- Confidence 00 00 00000000000000 0000 0 00 Q ss_pred cccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeec-cCCceeeeeeeecccccccccc Q lcl|NC_014792. 228 GEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVR-RDGAIVENVVLSTKEGDKDVYG 306 (659) Q Consensus 228 g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~-~~g~~~et~~~~~~~~~~~~~~ 306 (659) ..+..+.+.... ....+... .. ..++.++. -.+..+|+|+++++..+.+..+ T Consensus 145 -s~~~~l~i~~~~--ads~g~e~-------------~~-----------l~~~~~~~~g~~~~let~~~sl~~~a~dd~G 197 (529) T protein:vir:10 145 -SPTRELTIETAT--ADSAGNER-------------FL-----------LKLTQTTSLGVVTTLETHTVSLAEEAKDDMG 197 (529) T ss_pred -CCceEEEEEeec--cccCCCcc-------------ce-----------eeEEEEeecCCceEEEEEEeeeeechhhhcC Confidence 001112221110 00000000 00 00111121 2467789999999999999999 Q ss_pred chhhhhhhhhcccccceEEeecccCC----ccceeEEeecccccccc-cchhhhhhhHhhhhhcccccceEEEecccccc Q lcl|NC_014792. 307 NNIYLDDYFAKGTSNYIYATSLNWPK----GFAGIINLMGGISANDQ-VTAGDLMQGWDLFADREALHINLLIAGAVAGE 381 (659) Q Consensus 307 ~~~~~~~~~~~~~s~~v~~~~~~~~~----~~~~~~~~~gg~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 381 (659) ...++.+.+++....++......... .......+.||+|+... +.+.++..++.+|...... .+.++..| T Consensus 198 ~~~yl~svle~~s~~l~ai~~~e~~~t~~~~t~~d~~f~~GtdG~~~~i~s~~y~~A~~~L~n~p~d-~~~il~~g---- 272 (529) T protein:vir:10 198 RLCYLPTALEARSKYLRAVVNEELISTAKVTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAPYM-YTAVLGLG---- 272 (529) T ss_pred CccchhHHHhhccCceeeeeeeccccccchhhhhhhhccCCccccccccchHHHHHHHHHhcCCcce-eeeeeccC---- Confidence 99999999988776665543322111 11123578899988653 5667888899888755443 34444333 Q ss_pred chhhhHHHHHHHHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEeccc Q lcl|NC_014792. 382 GDATASTVQKHVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKY 461 (659) Q Consensus 382 ~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~ 461 (659) +...+++.+|+.+|++++ +..+.|+|.. .+++++.+|+++.+..+++..+ -+.+||||. .-||. T Consensus 273 --~y~~a~I~~L~~ic~~~~-~d~f~DV~~~--------LT~~aA~~~~e~~gl~~~~~~~----~s~y~~P~~-~~D~~ 336 (529) T protein:vir:10 273 --CYDNAAITALGKICADRL-IDGFFDVKPT--------LTYAEALPAVEDTGLLGTDYVS----CSVYHYPFS-CKDKW 336 (529) T ss_pred --CccHHHHHHHHHHHhhhh-hcEEEcCCCC--------cCHHHHHHHHHhcCccccCcee----eEEEEccee-ecccc Confidence 345678999999997765 4445598865 4789999999999876654322 145788987 88999 Q ss_pred CCcceeecHHHH--HHHH--HHHhhhcCCceECcCCcchhheecc-ccceeecChhHHHhhhhCCceEEEEEeCCCeEEE Q lcl|NC_014792. 462 NDVNRWVPLAAD--MAGL--CARTDDVSQPWMSPPGYNRGQILNV-LKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVL 536 (659) Q Consensus 462 ~~~~~~~p~s~~--~Ag~--~a~~d~~~g~~~span~~~~~i~g~-~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~ 536 (659) ++....+++||. +|.. .++.....|+|++|||+.++.|.-. +.+-+..++-|...|-.++||+|.--.+ +++.+ T Consensus 337 tg~k~~~GlsG~A~~akargv~~na~v~g~hY~pAGe~r~~inr~~I~~ly~~d~~e~~~lv~~riNPV~~~~~-g~~~i 415 (529) T protein:vir:10 337 TQSRVVFGLSGVAYAAKARGVKKNSDVGGWHYSPAGEERAVIARASIQPLYPEDTPDEEAMVKGRLNKVSVGTS-GQMII 415 (529) T ss_pred ccCceeeCCCcceeeccccceeecccccccccccCCCccceeecccceeccCCCccCHHHHHhhccCeeeeecc-Cccee Confidence 999999999994 3332 1344444456999999988766542 2445666777888899999998865433 34444 Q ss_pred EcccccCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccceee-----------e Q lcl|NC_014792. 537 YGDKTATKVPSPMDHINVRRLTNMLKKNIGDASKYKLFELNDNFTRASFRMETSQYLDGIRALGGIYE-----------G 605 (659) Q Consensus 537 wG~rT~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g-----------~ 605 (659) -.+-|++..++.|||+|+++|+++|++.+.+..+|.+|||++..+|. +++-++.+|+.+|+.|+|++ | T Consensus 416 dDsLt~~~knny~R~~hv~~lmn~I~~~~~k~a~~~~~~Pd~it~~g-l~~~l~~~L~r~~asgalv~prdp~~~G~epy 494 (529) T protein:vir:10 416 DDALTCCTQDNYLHFQHVPSLMNAISRFFVQLARQMKHSPDGITAAG-LTKGMTKLLDRFVASGALVAPRDPDADGTEPY 494 (529) T ss_pred eeeeceeeeCCchhhhhHHHHHHHHHHHHHHHHHHHhhCCChHHHHH-HHHhHHHHHHHHHhcCceecccCccCCCCCce Confidence 33445444567999999999999999999999999999999998877 99999999999999999975 6 Q ss_pred EEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_014792. 606 RVVCDTTNNTPSVIDRNEFVASIYYKPARSINYIVLNFVATS 647 (659) Q Consensus 606 ~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 647 (659) ++.+ +|.| .++|.+++.++|...+++|...-.-.+ T Consensus 495 ~~~V-----~q~d--~D~~~v~~~~~ptGv~Rri~~~p~l~~ 529 (529) T protein:vir:10 495 VLKV-----TQAE--FDKWEVVWACCPTGVARRIQGVPLLIK 529 (529) T ss_pred EEEE-----eecc--cCeEEEEEEeecCCceeeEEeeeeecC Confidence 6666 2333 488999999999999999977544333 No 45 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=100.00 E-value=1.6e-39 Score=233.27 Aligned_cols=424 Identities=14% Similarity=0.098 Sum_probs=259.9 Q ss_pred CceecCceEEEEecCCCc-ccccCCcceEEEeec-ccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCCCeE Q lcl|NC_014792. 1 MALLSPGIELKETTVQST-VVRNATGRAALVGKF-QWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYGNDL 78 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~-~~~~~ts~~afvG~~-~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG~~~ 78 (659) +.=.-|||||||++++.+ +.++++++++|+|.+ .||| ++|+.|.|+.||++.||..... ..+....+|++||++| T Consensus 9 ~~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g~-~~~v~i~~~~d~~~~fG~~~~~--~~~~~~~~~~~g~~~v 85 (451) T protein:vir:10 9 QDKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWGK-NGVIEVEANSDFTKKLGTTLDD--PSLTALKETLKGASKV 85 (451) T ss_pred ceeecCceEEEEeccCcceeeccCCcEEEEEeeecCCCC-cccEEeecHHHHHHHcCCcccc--hhHHHHHHHhcCCcEE Confidence 444579999999999876 456789999999975 5677 7899999999999999975543 3334445556899999 Q ss_pred EEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccccccc Q lcl|NC_014792. 79 RTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFAKSVN 158 (659) Q Consensus 79 ~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 158 (659) |+.|+.+.. ++.++...+. .. T Consensus 86 ~~yrl~~g~-~a~~t~~~~~---~~------------------------------------------------------- 106 (451) T protein:vir:10 86 LVLNPNEGT-AATLTKEGLP---WT------------------------------------------------------- 106 (451) T ss_pred EEEEcCCCc-eEEEEeecCc---eE------------------------------------------------------- Confidence 999985432 1100000000 00 Q ss_pred eeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEE Q lcl|NC_014792. 159 QYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEI 238 (659) Q Consensus 159 ~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v 238 (659) +.+.++|.+||.++|.+ T Consensus 107 ---------------------------------------------------------------~~Aky~G~~Gn~i~v~v 123 (451) T protein:vir:10 107 ---------------------------------------------------------------VTANYPGEKGNQITVSV 123 (451) T ss_pred ---------------------------------------------------------------EEEeeCCcCCceEEEEE Confidence 11345566666665544 Q ss_pred eecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhhhhcc Q lcl|NC_014792. 239 VSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKG 318 (659) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (659) ...... ...+.+.+..++..++...+.... ...-. T Consensus 124 ~~~~~d-------------------------------~~~~~v~t~~g~~~vd~qtv~~~~--------------~~el~ 158 (451) T protein:vir:10 124 EVSPAD-------------------------------QNAATVSTIFGTKLVDEQSIKFNE--------------LDKFK 158 (451) T ss_pred ecccCC-------------------------------cCceEEEEEECCeEEEEEEeeccc--------------hhhcc Confidence 211100 001111111222222222111000 00001 Q ss_pred cccceEEeecccCC-ccceeEEeecccccc-cccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHH Q lcl|NC_014792. 319 TSNYIYATSLNWPK-GFAGIINLMGGISAN-DQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSI 396 (659) Q Consensus 319 ~s~~v~~~~~~~~~-~~~~~~~~~gg~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~ 396 (659) .+.++.+....... .......+.++.++. ...+..++... +...+.++++.+++|+.. ....++..+.+| T Consensus 159 ~nd~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~~~~~~~~~~---l~~~e~~~~n~l~~~~~~-----~~~~i~~~~~a~ 230 (451) T protein:vir:10 159 GNDYITAKVVEEGSSKPVAFTNVSGTLTGGTTTESNKVESLL---NDALENEEYAVVTTAGFE-----PSSNMNKLVVEA 230 (451) T ss_pred CCceEEEEecccccccceeeeecccccccccccCCccchHHH---HHHhccceeeEEEEccCC-----CchHHHHHHHHH Confidence 12333333221111 112223344443322 22233344443 444466778999988653 224577788899 Q ss_pred HHhhCC-----EEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecH- Q lcl|NC_014792. 397 ADERQD-----CLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPL- 470 (659) Q Consensus 397 ~~~~~~-----~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~- 470 (659) |+++|+ ..+++..+.... .++.....+.+.....| .+.+++ T Consensus 231 ik~~r~~~g~~~~aVl~~~~~~~----------------------------~d~egiinv~n~~~~~d-----g~~~~~~ 277 (451) T protein:vir:10 231 VKRLRENEGRKVRGVIPTDADTT----------------------------YNYEGISTVVNGYTLSD-----GTNVDVK 277 (451) T ss_pred HHHHHHhcCCeEEEEecCccCCC----------------------------CCCcceEEeecceEecC-----ceeechh Confidence 988753 357765332110 01112222222222111 122233 Q ss_pred --HHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEE-EcccccC---- Q lcl|NC_014792. 471 --AADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVL-YGDKTAT---- 543 (659) Q Consensus 471 --s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~-wG~rT~~---- 543 (659) ++.+||++|..+ +.+|+.|+. +.|+..+...+++.|++.+.++|..++....+ +++++ +|-.|+. T Consensus 278 ~~~~~vAG~~Ag~~----~~~S~T~~~---~~~~~~v~~~~t~~e~~~~i~~G~lvl~~~~g-~~v~i~~~INTltt~~~ 349 (451) T protein:vir:10 278 DATGYFAGISASAD----VATSLTYFE---VEDAVSAYPKFDNEKTIKALDAGQIVFTTRPG-QRVVIEQDINSLHKFTA 349 (451) T ss_pred hhHHHHHHHHcccc----cccCcccee---cCCceeeeeeCCHHHHHHHHhCCeEEEEEEcC-CeEEEEEccccceecCC Confidence 478899999875 556999986 44566777899999999999999977654444 45654 7777763 Q ss_pred CCccccceeehhhHHHHHHHHHHHHHHH-HhcC-CCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhC Q lcl|NC_014792. 544 KVPSPMDHINVRRLTNMLKKNIGDASKY-KLFE-LNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDR 621 (659) Q Consensus 544 ~~~~~~~~i~vrR~~~~i~~~i~~~~~~-~v~e-pn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~ 621 (659) ..+.+|+.|.++|++|+|.+.+++.... |+++ |||...|..++..|+.||++|+++|+|..|... |.+- ...-.. T Consensus 350 ~k~~~~~ki~vir~~D~i~~di~~~~~~~yiGk~~N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~~-d~~v--~~~~~~ 426 (451) T protein:vir:10 350 EKPQAFSKNRVIRTLDEIATNTENTFERTYLGNVGNNAAGRDLFKADRIAYLTSLQNRNMIQSFANT-DITV--EAGNDM 426 (451) T ss_pred CCCcchhhhhHHHHHHHHHHHHHHHhhhccceecCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCcc-ceEE--eecCCC Confidence 2245899999999999999999999875 8885 699999999999999999999999999998732 2111 111135 Q ss_pred CEEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_014792. 622 NEFVASIYYKPARSINYIVLNFVAT 646 (659) Q Consensus 622 G~~~~~i~~~p~~p~e~i~~~~~~~ 646 (659) ..+++.+.++|+..||+|.+++... T Consensus 427 ~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 427 DSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred CEEEEEEEEEEEeeeeeEEEEEEEc Confidence 6799999999999999999998866 No 46 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=5.3e-36 Score=213.99 Aligned_cols=542 Identities=13% Similarity=0.066 Sum_probs=241.7 Q ss_pred ccEEe---CCHHHHHHHcCCcCCCchh--HHH--HHHHHH-cCCCeEEEEeccCCccccccccccccccccccccccccc Q lcl|NC_014792. 40 QVTQI---TNEVELVDLFGGPNNITAD--YFM--SGMNFL-QYGNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYA 111 (659) Q Consensus 40 ~p~~i---~s~~~~~~~fG~~~~~~~~--~~~--~~~~f~-ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~ 111 (659) ..+-+ +.-.+|...||.+.....- +.+ +...+. ..|-+|- +|........+.. T Consensus 1 ~~~~~~~~~~~~~~t~~~~~~~~g~~~~~~~~~~i~g~~~g~~g~~~s-~r~~p~~~~~~ev------------------ 61 (581) T protein:vir:76 1 MAIDFSQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRES-IRINPDTGETITT------------------ 61 (581) T ss_pred CcccccccccchhhhhhccccccCcceeeeeeeeecccccccccccce-eeecCCCCCCCce------------------ Confidence 11111 1123344434433221110 000 011111 0122232 2322111111000 Q ss_pred ccceeeeeeccccccccceeeeeeccCcceeeeeccccccc------c---ccccceeeeeccceeeEEEeecCCccccc Q lcl|NC_014792. 112 VGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIA------F---AKSVNQYPDLGPAWTAEILTTSSGVSGTI 182 (659) Q Consensus 112 ~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~------~---~~~~~~~~~~~~~~~~~v~~~~~g~~~~~ 182 (659) ..+.+.-......+...+.+... ....+-.+..... . ...+.........+..+.....+.... T Consensus 62 --q~v~~~~~~t~G~ftLt~~g~tT---~~I~~~asa~~v~~AL~~L~~i~~~~v~vtg~~~~~~~V~F~g~~~~~~~-- 134 (581) T protein:vir:76 62 --QILALVGEPTGGSFKLSLAGEPT---GNIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVAALTK-- 134 (581) T ss_pred --EEEEEeecCCcceEEEEeCceec---cccccCCCHHHHHHHHhhccCCCCceEEEEcCCCceEEEEEcCCccceeE-- Confidence 00011000000011110000000 0000000000000 0 000000000000011110000000000 Q ss_pred cccceeccccceeeecccccccccccceeecccccccceeeeccccccceeEEEEEeecc-cccccceeeeeeecccccc Q lcl|NC_014792. 183 TLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEVEIVSKA-AYDVGASKMLDIYPNGGSR 261 (659) Q Consensus 183 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V~v~~~~-~~~~~~~~~~~~~~~~~~~ 261 (659) ....+....+... .+.....+.....+ .....|..+..+.+...... .+..+..... ...+.+.. T Consensus 135 ~~~~ltg~~~~~~-~V~~~~~G~~~~~~------------~l~~~g~~~~~~~~~~s~~~~~~~l~~~~~~-~~~~~~~~ 200 (581) T protein:vir:76 135 DVTGLTGGDNPDL-NIASEQTGVPAMNR------------ALAKKGIKTDTIRVVNPNSGQVYVLGTDYVV-TRVNAGED 200 (581) T ss_pred eeeeeecCCccee-EEEEEecCcCCcCc------------eeeeccccccccceeecCCcceeeecccccc-eeeccCcc Confidence 0000000000000 00000000000000 00000111111111000000 0000000000 00000000 Q ss_pred --ccceeeeeeecccccc-------ceeeeec-cCCceeeeeeeeccccccccccchhhhhhhhhcccccceEEeecccC Q lcl|NC_014792. 262 --ASVARAVFNYGPQTDD-------QYAIIVR-RDGAIVENVVLSTKEGDKDVYGNNIYLDDYFAKGTSNYIYATSLNWP 331 (659) Q Consensus 262 --~~~~~~~~~~~~~~~~-------~~~~~v~-~~g~~~et~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~ 331 (659) ......+..+....++ .+.+... .+..--|.+............+ .......+..+........... T Consensus 201 ~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~~~~~v~~~~~~~~~~~~~---~~~~~~g~~~~e~~~~~~~~~t 277 (581) T protein:vir:76 201 GEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYG---PAFDEAGNVQSEITLCAQLAIT 277 (581) T ss_pred cceeeeeeeeeeEeecccccccceeEEEEEEEeecCCccceEEEeccccccccee---eehhhcCccccchhhhhheeec Confidence 0000000000000000 0000000 0000001111100000000000 0000000001111111111111 Q ss_pred CccceeEEeeccccc-ccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhhC----CEEEE Q lcl|NC_014792. 332 KGFAGIINLMGGISA-NDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADERQ----DCLAF 406 (659) Q Consensus 332 ~~~~~~~~~~gg~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~~----~~~ai 406 (659) ......+.+|.++ .+.++.+|+..++++++. .+...+++|+. ...++++++.+||++++ .+.++ T Consensus 278 --~~~~~~l~~gvd~~g~tvt~~dy~~aL~ale~---~~~~~ivvp~t------~~~~i~a~l~ahv~~~s~~~~~~ra~ 346 (581) T protein:vir:76 278 --NGASTILACAVDPEGDTVTMGDYQNALNKFRD---EDEIAIIVAGT------GAQPIQALVQQHVSAQSNNKYERRAI 346 (581) T ss_pred --cccceEEEeeecCCCCccchHHHHHHHHHHhc---CCeEEEEEecC------CChHHHHHHHHHHHHHHhccCCceEE Confidence 1123456677765 334678889888887764 33455566653 34568888888887663 34555 Q ss_pred EecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHHHHHhhhcCC Q lcl|NC_014792. 407 ISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGLCARTDDVSQ 486 (659) Q Consensus 407 ~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g 486 (659) ++.+... ...+.+.+... ...+++.|..++|||..+++..........|..++|+.+|.+....+ T Consensus 347 igv~g~~-----~~~~~~~~~~~----------a~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~AA~vAG~~a~~~ 411 (581) T protein:vir:76 347 LGMDGSV-----TPVPSATRIAN----------AQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAI 411 (581) T ss_pred EEeeCCC-----CCchHHHHHHh----------hcccCCCcEEEEEcCceEeccccCCcceecchhhhhhhHHhhhhccc Confidence 5544221 11122222221 12567899999999999988765544444455666666677777777 Q ss_pred ceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEE-EcccccCCCccccceeehhhHHHHHHHHH Q lcl|NC_014792. 487 PWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVL-YGDKTATKVPSPMDHINVRRLTNMLKKNI 565 (659) Q Consensus 487 ~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~-wG~rT~~~~~~~~~~i~vrR~~~~i~~~i 565 (659) +++||.|+.+ .|+.++...+++.|++.|+++|+++++.+++ +++++ ||-+|+.+++ +|++|++||++|++++.+ T Consensus 412 ~~~slT~~~i---~g~~~~~~~~s~~e~e~ll~~Gv~~l~~~~~-~~v~Iv~gItT~~s~~-~~k~i~viR~~D~v~~~v 486 (581) T protein:vir:76 412 AAMPLTRKVI---RGFSGPAEVQRDGEKSRESSEGLMVIEKTPR-NLVHVRHGVTTDPTSL-HTREWNIIGQQDVMVYRI 486 (581) T ss_pred cccCcccccc---cccccccccCCHHHHHHHHhCCeEEEEEecC-CeEEEEEeeecCCCCC-ccceeeehhhhHHHHHHH Confidence 8999999875 4566788899999999999999999998776 68875 7878877654 799999999999999999 Q ss_pred HHHHH--HHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEE Q lcl|NC_014792. 566 GDASK--YKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPARSINYIVLNF 643 (659) Q Consensus 566 ~~~~~--~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~ 643 (659) ++.++ .|++|||++.+|.+|+..+..||.+||++|+|.||.. .+.++.+.+.+++++++.++|++|+|||++++ T Consensus 487 r~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~g~~~----~~~~~~~~~~d~v~V~i~v~Pv~~ie~I~vt~ 562 (581) T protein:vir:76 487 RDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPDVIEVRYEWRPAYPLNYIVVRY 562 (581) T ss_pred HHHHhhhcCCCcccChHHHHHHHHHHHHHHHHHHhcCcccCccc----ceeeEEecCCCEEEEEEEEEecccceEEEEEE Confidence 99986 5888999999999999999999999999999999973 23466777889999999999999999999998 Q ss_pred EEeecCeeEEE-ecCCC Q lcl|NC_014792. 644 VATSTGADFDE-LIGVQ 659 (659) Q Consensus 644 ~~~~~~~~~~e-~~~~~ 659 (659) ......=.|.- +.|-- T Consensus 563 ~~~p~~~~~~~~~~~~~ 579 (581) T protein:vir:76 563 SIAPETGDITSTIEGTT 579 (581) T ss_pred EEeeCCCceEEEEeccc Confidence 87654433322 22333 No 47 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=7.4e-35 Score=207.70 Aligned_cols=533 Identities=14% Similarity=0.069 Sum_probs=238.3 Q ss_pred ccEEeCC---HHHHHHHcCCcCCCc--hhHHHHHHH--HH-cCCCeEEEEeccCCccccccccccccccccccccccccc Q lcl|NC_014792. 40 QVTQITN---EVELVDLFGGPNNIT--ADYFMSGMN--FL-QYGNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYA 111 (659) Q Consensus 40 ~p~~i~s---~~~~~~~fG~~~~~~--~~~~~~~~~--f~-ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~ 111 (659) ..|.+.- -.||...|+.+...- ..++++..+ +- -.|-+|..-...+. ...+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~i~g~~~g~~g~~~s~~~~p~~-~~~~e~------------------ 61 (581) T protein:vir:10 1 MAIDFSQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDT-GETITT------------------ 61 (581) T ss_pred CeeeeccccccchhhhhccccccceeeeeccccccccccccccccccccccCCCC-CCccce------------------ Confidence 3333321 233444444332211 111111111 11 01223422111111 100000 Q ss_pred ccceeeeeeccccccccceeeeeeccCcceeeeeccccccc------c---ccccceeeeeccceeeEEEeecCCccccc Q lcl|NC_014792. 112 VGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIA------F---AKSVNQYPDLGPAWTAEILTTSSGVSGTI 182 (659) Q Consensus 112 ~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~------~---~~~~~~~~~~~~~~~~~v~~~~~g~~~~~ 182 (659) ..+.+.-......+.....+.... ...+-.+..-.. . ...+.........+..+. .+..+.. T Consensus 62 --q~v~~~~~~t~GtFtLsf~G~tT~---~I~~~asa~~v~~AL~~L~~i~~~~v~v~g~~g~~~~VtF----~g~~~~l 132 (581) T protein:vir:10 62 --QILALVGEPTGGSFKLSLAGEPTG---NIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTF----TKAVAAL 132 (581) T ss_pred --EEEEEEecCCCceEEEEeCceecc---cccccCCHHHHHHHHhccCCCCcceEEEECCCCceEEEEE----cCCccce Confidence 001111000111111111000000 000000000000 0 000000000000111110 0000000 Q ss_pred ccccee-c-cccceeeecccccccccccceeecccccccceeeeccccccceeEEE-EEe-ecccccccceeeeeeeccc Q lcl|NC_014792. 183 TLGKIV-T-DSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTLEV-EIV-SKAAYDVGASKMLDIYPNG 258 (659) Q Consensus 183 ~~~~~v-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i~V-~v~-~~~~~~~~~~~~~~~~~~~ 258 (659) ...... . ........... ...... ....+. ..+.......+ ... .............. .+. T Consensus 133 ~~~~~~lt~g~~~~vtV~~~-~~g~~~--~~~~~s----------~~gi~~~~~~l~~~~~~~~~~~gsd~~~~~--~~~ 197 (581) T protein:vir:10 133 TKDVTGLTGGDDPDLNIASE-QTGVPA--MNRALA----------KKGIKTDTIRVVNPNSGQVYVLGTDYVVTR--VNA 197 (581) T ss_pred eeeeceecCCCceeEEEecc-ccCccc--cccccc----------ccccccccccccccccCcceeccccceeee--ccc Confidence 000000 0 00000000000 000000 000000 00000000000 000 00000000000000 000 Q ss_pred cccccceeeeeeeccccccceeeeeccCCceeee---eeeecccccccc-----ccchhhhhhhh----h---cccccce Q lcl|NC_014792. 259 GSRASVARAVFNYGPQTDDQYAIIVRRDGAIVEN---VVLSTKEGDKDV-----YGNNIYLDDYF----A---KGTSNYI 323 (659) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et---~~~~~~~~~~~~-----~~~~~~~~~~~----~---~~~s~~v 323 (659) .... .....++..+++....|...++ +.+.....+..+ .........++ . +..+... T Consensus 198 ~~~~--------~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~t 269 (581) T protein:vir:10 198 GEDG--------EANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEIT 269 (581) T ss_pred Cccc--------cccccccceeeeeeecccccccceEEEEEEEeecCCcceeEEeecCcchhhhhhhhhhccCccccchh Confidence 0000 0000001111111111211111 111110001000 00000011110 0 1111111 Q ss_pred EEeecccCCccceeEEeecccccc-cccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhhC- Q lcl|NC_014792. 324 YATSLNWPKGFAGIINLMGGISAN-DQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADERQ- 401 (659) Q Consensus 324 ~~~~~~~~~~~~~~~~~~gg~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~~- 401 (659) ....... .......+.+|.++. +.++.+|+..++++++. .+.+.+++|+. +..+++.+|.+||++++ T Consensus 270 ~~~~~~~--tn~~~~~l~~gvd~~g~tvt~~dy~~Al~ale~---~~~~~ivv~~t------~~~~v~a~l~ahv~~~s~ 338 (581) T protein:vir:10 270 LCAQLAI--TNGASTILACAVDPEGDTVTMGDYQNALNKFRD---EDEIAIIVAGT------GAQPIQALVQQHVSAQSN 338 (581) T ss_pred hhheeee--ecccceeEEeeccCCCCccchHHHHHHHHHHhc---CCceEEEEeCC------CCHHHHHHHHHHHHHHHh Confidence 1111000 011234456666653 34677888888877764 33455567653 44578888999997763 Q ss_pred ---CEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCC-cceeecHHHHHHHH Q lcl|NC_014792. 402 ---DCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYND-VNRWVPLAADMAGL 477 (659) Q Consensus 402 ---~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~-~~~~~p~s~~~Ag~ 477 (659) ++.++++.+... ...+.+....- ...++++|..++||+..+++...+ +...+|+ .++|+. T Consensus 339 ~~~~~ravigV~g~~-----~~~~~~~~~~~----------a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~-y~~AA~ 402 (581) T protein:vir:10 339 NKYERRAILGMDGSV-----TPVPSATRIAN----------AQSIKDQRVALISPSSFVYYAPELNREVVLGG-QFMAAA 402 (581) T ss_pred ccCCcEEEEEecCCC-----CCccHHHHHHh----------hccCCCceEEEEecCceeecCcccCceeccch-hhHHHH Confidence 355666544221 11122222221 125678999999999988877544 4444555 333444 Q ss_pred HHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEE-EcccccCCCccccceeehhh Q lcl|NC_014792. 478 CARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVL-YGDKTATKVPSPMDHINVRR 556 (659) Q Consensus 478 ~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~-wG~rT~~~~~~~~~~i~vrR 556 (659) +|.+....++++||.|+.+. |+.++...+++.|++.|+++|+++++.+++ +++++ ||-+|+.+++ +|++|++|| T Consensus 403 vAGl~a~~~~~~slT~~~i~---gi~~l~~~~s~~e~e~ll~~Gv~~l~~~~~-~~v~Iv~gItT~~s~~-~~~~i~~iR 477 (581) T protein:vir:10 403 VAGKSVSAIAAMPLTRKVIR---GFSGPAEVQRDGEKSRESSEGLMVIEKTPR-NLVHVRHGVTTDPTSL-HTREWNIIG 477 (581) T ss_pred HHHHhhccccccCccccccc---ccccccccCCHHHHHHHHhCCeEEEEEecC-CeEEEEeeeecCCCCC-cceeeeeeh Confidence 44444444578899998754 566778899999999999999999998776 68886 6667776654 799999999 Q ss_pred HHHHHHHHHHHHHH--HHhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccCCCCHHHhhCCEEEEEEEEEecC Q lcl|NC_014792. 557 LTNMLKKNIGDASK--YKLFELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTTNNTPSVIDRNEFVASIYYKPAR 634 (659) Q Consensus 557 ~~~~i~~~i~~~~~--~~v~epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~ 634 (659) ++|++.+.+++.++ +|++|||++.+|.+|+..+..||.+||++|+|.||+.. +.++.+.+.++++++|.++|++ T Consensus 478 ~~D~v~~~ir~~~~~~~fIG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~----~~~~~~~~~d~v~V~i~v~Pv~ 553 (581) T protein:vir:10 478 QQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL----KARQIERQPDVIEVRYEWRPAY 553 (581) T ss_pred hhhHHHHHHHHHhhhhcCCCcccCHHHHHHHHHHHHHHHHHHHhcCcccCCccc----eeeeeecCCCEEEEEEEEEecc Confidence 99999999999985 58889999999999999999999999999999999732 3466677889999999999999 Q ss_pred CceEEEEEEEEeecCeeEEE-ecCCC Q lcl|NC_014792. 635 SINYIVLNFVATSTGADFDE-LIGVQ 659 (659) Q Consensus 635 p~e~i~~~~~~~~~~~~~~e-~~~~~ 659 (659) |+|||.+|+.++...=.|.- +.|-- T Consensus 554 ~i~~I~vti~~~p~~~~~~~~~~~~~ 579 (581) T protein:vir:10 554 PLNYIVVRYSIAPETGDITSTIEGTT 579 (581) T ss_pred cceEEEEEEEEecCCCceEEEEeccc Confidence 99999999887765444332 22333 No 48 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=99.96 E-value=4e-28 Score=170.79 Aligned_cols=406 Identities=16% Similarity=0.121 Sum_probs=249.2 Q ss_pred Cc----------eecCceEEEEecCCC-cccccCCcceEEEeecccCCCCccEEeCC---HHHHHHHcCCcCCCchhHHH Q lcl|NC_014792. 1 MA----------LLSPGIELKETTVQS-TVVRNATGRAALVGKFQWGPAFQVTQITN---EVELVDLFGGPNNITADYFM 66 (659) Q Consensus 1 ~~----------~~~PGVyveE~~~~~-~~~~~~ts~~afvG~~~~Gp~~~p~~i~s---~~~~~~~fG~~~~~~~~~~~ 66 (659) |+ =.-||+|++-+.... ++..+...+.++...+.|||+++++.|++ ..++...||.... .+.... T Consensus 1 ~~magg~~~~~~K~~PG~Y~n~~~~~~~~~~~~~rGi~a~p~~~~wGp~~~v~~i~~~~~~~~~~~~~G~~~~-~~~~~~ 79 (436) T protein:vir:78 1 MALGGGTFVTQNKVLPGSYINFVSATRATSSLSDRGIVAMPLELDWGIDEEVFQVTSDDFEKYSTKYFGYDYT-HEKLKG 79 (436) T ss_pred CcccceeeccceeecCceEEEEEecCcceeeccCCeEEEEEEEecCCCCceeEEeecccchHHHHHHhcCccc-hHHHHH Confidence 33 246999999997665 46677799999999999999999999998 5689999995322 222234 Q ss_pred HHHHHHcCCCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeec Q lcl|NC_014792. 67 SGMNFLQYGNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIP 146 (659) Q Consensus 67 ~~~~f~ngG~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 146 (659) ++..| .|.+.+|+.|+.++..+. ++ ..++ T Consensus 80 l~~~~-~~~~tv~~yrl~~G~~a~-~~---------v~~A---------------------------------------- 108 (436) T protein:vir:78 80 LRDLF-KNIRLGYFYKLNKGVKAS-CS---------IATA---------------------------------------- 108 (436) T ss_pred HHHHh-cCCCEEEEEECCCcceee-ee---------eeee---------------------------------------- Confidence 55544 667889999986432110 00 0011 Q ss_pred cccccccccccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeecc Q lcl|NC_014792. 147 SDKIIAFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALY 226 (659) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 226 (659) .| T Consensus 109 ------------------------------------------------------------------------------ky 110 (436) T protein:vir:78 109 ------------------------------------------------------------------------------RC 110 (436) T ss_pred ------------------------------------------------------------------------------ec Confidence 12 Q ss_pred ccccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeecccccccccc Q lcl|NC_014792. 227 PGEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYG 306 (659) Q Consensus 227 ~g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~ 306 (659) +|..||.++|.+...... . ..+.+....+...++...+.. T Consensus 111 ~g~~gn~i~v~v~~~~~d-----------------~--------------~~~dv~~~~g~~~~d~~~~~~--------- 150 (436) T protein:vir:78 111 SGIRGNDLKVIVTTNIDD-----------------N--------------AKFDVVTLLDNKKVDTQIAKV--------- 150 (436) T ss_pred CCCCCcEEEEEecccccc-----------------c--------------CceEEEEEecchhhhhhhHHH--------- Confidence 222233332222100000 0 000000000000000000000 Q ss_pred chhhhhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhh Q lcl|NC_014792. 307 NNIYLDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATA 386 (659) Q Consensus 307 ~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 386 (659) + ..+ ..+.|+.......+... ....|.||.++. .++..++..+++.++ .++++.+++|+. . T Consensus 151 ----~-~~l--~~n~~V~~~~~g~la~~-a~~~LtGG~dG~-~~T~~dy~~al~~le---~~~fn~l~~~~~-------d 211 (436) T protein:vir:78 151 ----I-TEL--QDNDYVTWKKEATLEAT-AGLTFTNGTNGE-AVTGTEYQAFLDKIE---SYSFNALGCLAT-------T 211 (436) T ss_pred ----H-hhc--cCCceEEEEeccccccc-ceeeeecccccc-ccchHHHHHHHHHHc---ccceeEEEecCC-------C Confidence 0 000 01123322222222211 235688998875 356788888777664 457899999863 2 Q ss_pred HHHHHHHHHHHHhhCCE-----EEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEeccc Q lcl|NC_014792. 387 STVQKHVVSIADERQDC-----LAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKY 461 (659) Q Consensus 387 ~~v~~~l~~~~~~~~~~-----~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~ 461 (659) .+++..+.++++++|+. .+++..... .++....-+. +.+ T Consensus 212 ~~~~~~~~a~ikr~re~~g~~~~aV~~~~~~------------------------------~d~EgIInv~------n~v 255 (436) T protein:vir:78 212 AEIKSLFVEFTKRMRDKVGAKFQTVLYKKND------------------------------ADYEGVVSVE------NKI 255 (436) T ss_pred hHHHHHHHHHHHHHHhhcCCeEEEEecCCCC------------------------------CCCceEEEee------ccc Confidence 46788899999888742 233321000 0011111111 111 Q ss_pred CCcc-eeecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcc- Q lcl|NC_014792. 462 NDVN-RWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGD- 539 (659) Q Consensus 462 ~~~~-~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~- 539 (659) .+.. --.-.++.+||++|..+. .+|+.|+.+ .++..+...++++|.+.+..+|.-++. +.+ +++++--+ T Consensus 256 ~g~~~~~~~~~a~vAG~~Ag~~~----~~S~T~~~~---~~~~~v~~~~t~~e~~~ai~~G~lvl~-~d~-~~v~I~~~V 326 (436) T protein:vir:78 256 KDTGLLESSLIYWTTGAIAGCDI----NKSNTNKRY---DGEFDVDVNYTQIHLEEALKTGKFIFH-KVG-DEVHVLEDI 326 (436) T ss_pred CCceechhHHHHHHHHHHhcCcc----ccCccceec---CccccccccCCHHHHHHHHhCCeEEEE-EeC-CeEEEEEcc Confidence 1111 112256889999998864 448888764 456677788999999999999987765 444 45555443 Q ss_pred cccC----CCccccceeehhhHHHHHHHHHHHHHH-HHhcC-CCCHHHHHHHHHHHHHHHHHHHhccceeeeE---EEEc Q lcl|NC_014792. 540 KTAT----KVPSPMDHINVRRLTNMLKKNIGDASK-YKLFE-LNDNFTRASFRMETSQYLDGIRALGGIYEGR---VVCD 610 (659) Q Consensus 540 rT~~----~~~~~~~~i~vrR~~~~i~~~i~~~~~-~~v~e-pn~~~l~~~i~~~i~~~l~~l~~~gal~g~~---v~~d 610 (659) .|+. ..+.+|+.|.++|++|+|.+.+++... .|+++ ||+..-|..++..|+.||++|.+.|+|..|. +.++ T Consensus 327 NTltt~~~~k~~~~~kI~vir~~D~i~~di~~~~~~~yiGKv~N~~dgr~~l~~~i~~yl~~L~~~g~I~~f~~~Dv~v~ 406 (436) T protein:vir:78 327 NTFVSFTDEKNDDFSSNQSVRVLDQIANDIATLFNTKYLGEVPNDKSGRISFWNDVVKHHEQLQNMRAIEDFKADDVSVE 406 (436) T ss_pred ccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhhhccccccCCCHHHHHHHHHHHHHHHHHHHhCCcccCCCCcceEEe Confidence 3432 234689999999999999999998875 59996 6999999999999999999999999999887 3333 Q ss_pred cCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_014792. 611 TTNNTPSVIDRNEFVASIYYKPARSINYIVLNFVAT 646 (659) Q Consensus 611 ~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 646 (659) +. + ....+++.+.+.|+..||+|.+++... T Consensus 407 ~~-~-----~~~~v~v~~~v~pvdamekiy~ti~v~ 436 (436) T protein:vir:78 407 PG-S-----DKKTVVVSDAVKVISAMSKLYMTVSVS 436 (436) T ss_pred ec-C-----CCCEEEEEEEEEEEEeeeeEEEEEEEC Confidence 21 1 355688999999999999999998876 No 49 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=99.41 E-value=2.4e-13 Score=89.73 Aligned_cols=325 Identities=13% Similarity=0.053 Sum_probs=170.8 Q ss_pred cccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeee-eeccccccccccchhhhhhhhhcccccce Q lcl|NC_014792. 245 DVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVV-LSTKEGDKDVYGNNIYLDDYFAKGTSNYI 323 (659) Q Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~-~~~~~~~~~~~~~~~~~~~~~~~~~s~~v 323 (659) ..+.....-.+.. +.........- +...++..........+. +..-. .........++...+..+ T Consensus 1 ~~glp~i~i~f~~-------~a~ta~~~g~r-Giv~~il~d~~~~~~~~~~~~~v~-~~~~~~n~~~i~~~~~g~----- 66 (356) T protein:vir:10 1 MAGLVNINIEFKE-------LATSFIQRSKA-GIVAIILKDTTKMYKELTSEDDIP-ISLSADNKKYIKYGFVGA----- 66 (356) T ss_pred CCCCCceeEEEee-------cceeeccCCcc-ceEEEEEecCCcceeEEeccccch-hHHHHHHHHHHHHHhhcc----- Confidence 1111000000000 00000000000 001111111111111110 00000 000000111111111110 Q ss_pred EEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhhCC- Q lcl|NC_014792. 324 YATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADERQD- 402 (659) Q Consensus 324 ~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~~~- 402 (659) ...........++... ..+.+++..+++.+ +.++++.+++|+. ..+++..+.++++++|+ T Consensus 67 -------~~~~~~~~p~~~~~~~--~~t~~~y~~aL~~l---e~~~fn~l~~~~~-------d~~~~~~~~a~ikr~r~~ 127 (356) T protein:vir:10 67 -------TDNEKVLRPSKVIIST--FTEDGKVEDILEEL---ESVEFNYLCMPEA-------IEAEKTKIVTWIKKIREE 127 (356) T ss_pred -------ccccccccceeeeeec--ccCchhHHHHHHHh---cCccceEEEecCC-------ChHHHHHHHHHHHHHHhc Confidence 0000000011111111 11345677766665 4578899999963 23577888888888764 Q ss_pred ---EEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHHHH Q lcl|NC_014792. 403 ---CLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGLCA 479 (659) Q Consensus 403 ---~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a 479 (659) .+..+-... ....+.++ |+.+ ..++... ..--.-.++.+||++| T Consensus 128 ~~~~~~~V~~~~--------~aD~EgII--------------nv~n--~~~~~g~---------~~t~~~~~~~vAG~~A 174 (356) T protein:vir:10 128 ESTEAKAVLANI--------KADNEAII--------------NFTE--NVVVDGE---------EITAEKYTTRVASLIA 174 (356) T ss_pred CCcEEEEEecCC--------CCCCceeE--------------Eeec--CeEecce---------eechhHHHHHHHHHHh Confidence 333332110 00011111 1111 1111111 0111224678999999 Q ss_pred HhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEE-EcccccC----CCccccceeeh Q lcl|NC_014792. 480 RTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVL-YGDKTAT----KVPSPMDHINV 554 (659) Q Consensus 480 ~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~-wG~rT~~----~~~~~~~~i~v 554 (659) ....++ |+.|+.+. ++.. ...+++.|.+.+-.+|.-++. +.+ +.+++ .|-.|+. ..+.+|+.|.+ T Consensus 175 g~~~n~----S~T~~~~~---~~~~-~~~~t~~e~~~ai~~G~lvl~-~d~-~~V~I~~~VNSltt~t~~k~~~f~Kirv 244 (356) T protein:vir:10 175 STPNTQ----SITYAPLD---EVES-IVKIDKASADAKVQAGELILR-RLS-GKIRIARGINSLTTLTAEKGEIFQKIKL 244 (356) T ss_pred ccchhc----cccceecC---Cccc-cccCCHHHHHHHHhCCeEEEE-EEc-CeEEEEecCccceecCCCCCcchhhhHH Confidence 987544 88887654 3322 246889999999999987664 344 33444 3444541 23357999999 Q ss_pred hhHHHHHHHHHHHHHH-HHhcC-CCCHHHHHHHHHHHHHHHHHHHhcccee-eeEEEEccCC--------------CCHH Q lcl|NC_014792. 555 RRLTNMLKKNIGDASK-YKLFE-LNDNFTRASFRMETSQYLDGIRALGGIY-EGRVVCDTTN--------------NTPS 617 (659) Q Consensus 555 rR~~~~i~~~i~~~~~-~~v~e-pn~~~l~~~i~~~i~~~l~~l~~~gal~-g~~v~~d~~~--------------nt~~ 617 (659) .|++|.|.+.+++... .|+++ ||+..-|..++..++.||.+|.+.|+|. +|.+..|.+. ++.. T Consensus 245 vr~~D~i~~Di~~~f~~~yiGKv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~~~~d~ 324 (356) T protein:vir:10 245 VDTKDLISKDIKNIYVEKYLRKCPNTYDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLEGKKIAVSKMKEN 324 (356) T ss_pred HHHHHHHHHHHHHHHhhccccccCCCHHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhhhccccccccccc Confidence 9999999999998876 69998 5999999999999999999999999996 6777777533 2222 Q ss_pred Hhh----CCEEEEEEEEEecCCceEEEEEEEE Q lcl|NC_014792. 618 VID----RNEFVASIYYKPARSINYIVLNFVA 645 (659) Q Consensus 618 ~i~----~G~~~~~i~~~p~~p~e~i~~~~~~ 645 (659) .+. .-.+.+.+.+.|+-.||.|.+++.. T Consensus 325 ~v~~~~~~~~v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 325 EIKEANTGSNGFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred eeecccCCcEEEEEEEEEEEeeeeeEEeEEeC Confidence 222 2357899999999999999999887 No 50 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=99.18 E-value=2.2e-10 Score=73.48 Aligned_cols=436 Identities=13% Similarity=0.091 Sum_probs=205.9 Q ss_pred Cc---------eecCceEEEEec-CCCcccccCCcceEEEeecc---cCCCCccEEeCCHHHHHHHcCCcCCCchhHHHH Q lcl|NC_014792. 1 MA---------LLSPGIELKETT-VQSTVVRNATGRAALVGKFQ---WGPAFQVTQITNEVELVDLFGGPNNITADYFMS 67 (659) Q Consensus 1 ~~---------~~~PGVyveE~~-~~~~~~~~~ts~~afvG~~~---~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~ 67 (659) |+ ...||+|+| ++ +.... .....-.-+||..- ..|.++|++|+|-.|-...|| ..+.+..++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E-~dns~A~~-~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~~fG---~GSml~~M~ 75 (498) T protein:vir:44 1 MAISFNSIPSDTRVPLFYAE-MDNSAANT-ARDSGASLLIGHASNDASIAVNSLVLVSSVDYARQICG---AGSQLARMV 75 (498) T ss_pred CCCchhhcCcccccCeEEEE-EeCCCCCC-CcCCcceEEEEecCcccccccceeEeecCHHHHHHhcC---cccHHHHHH Confidence 43 447999995 55 33322 22233556777643 347799999999999999999 667777778 Q ss_pred HHHHHcCC-CeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeec Q lcl|NC_014792. 68 GMNFLQYG-NDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIP 146 (659) Q Consensus 68 ~~~f~ngG-~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 146 (659) +.+..+.- .++|++-+.+.. ...|+. .+..+... . ..+.+. +.-.|......+ T Consensus 76 ~a~~~~n~~~~l~~i~~~D~a-G~aAtg---~it~tg~a-t-------------------~~G~l~-l~Igg~~v~v~V- 129 (498) T protein:vir:44 76 GAYRKTDPFGELYVIAVPEST-GAAATV---ALTVTGEA-T-------------------ETGTVN-VYTGRTRVQAPV- 129 (498) T ss_pred HHHHHhCCCceeEEEecCCcc-cceeEE---EEEeeccc-C-------------------CCcEEE-EEECCEEEEEEe- Confidence 88887655 789999886532 111111 11110000 0 000000 000000000000 Q ss_pred cccccccccccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeecc Q lcl|NC_014792. 147 SDKIIAFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALY 226 (659) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 226 (659) ..+.... .+.. .+..+-......+..+.. ........+.. T Consensus 130 ----------------------------~~gdTaa-~vA~----------al~aaina~~~lPVTA~~-~~~~vtlTAr~ 169 (498) T protein:vir:44 130 ----------------------------TSGDDAA-AVAV----------SIKDAVNANPDLPFTATS-EAGVVTLTARH 169 (498) T ss_pred ----------------------------cCCCCHH-HHHH----------HHHHHHhCCCCCceEEee-ccceEEEEEec Confidence 0000000 0000 000000000000000000 00112223334 Q ss_pred ccccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeecccccccccc Q lcl|NC_014792. 227 PGEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYG 306 (659) Q Consensus 227 ~g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~ 306 (659) .|..||.|++.+...+.. T Consensus 170 kG~~GN~I~l~~~~~~~~-------------------------------------------------------------- 187 (498) T protein:vir:44 170 KGLYGNEIPVTLNYYGFG-------------------------------------------------------------- 187 (498) T ss_pred cCcccCcceEEEeeccCc-------------------------------------------------------------- Confidence 444444444432110000 Q ss_pred chhhhhhhhhcccccceEEeecccCCccc-eeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhh Q lcl|NC_014792. 307 NNIYLDDYFAKGTSNYIYATSLNWPKGFA-GIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDAT 385 (659) Q Consensus 307 ~~~~~~~~~~~~~s~~v~~~~~~~~~~~~-~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~ 385 (659) .....|.+.. .....+||. ...|+..++.++.. ...+++++|=. + T Consensus 188 -------------------~ge~~p~Glt~titamsgGa------g~PDia~alaal~~---~~~~~i~~p~~------D 233 (498) T protein:vir:44 188 -------------------GGEVLPAGVNITVASGVKGA------GAPALNDAVAAMGD---EPFDYIGLPFN------D 233 (498) T ss_pred -------------------cccccccceeEEEEcccCCc------cCchhHHHHHhhcc---CCccEEEEeec------C Confidence 0000011111 011223332 22355566555542 34578888731 1 Q ss_pred hHHHHHHHHHHHHh---------hCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCcee Q lcl|NC_014792. 386 ASTVQKHVVSIADE---------RQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKY 456 (659) Q Consensus 386 ~~~v~~~l~~~~~~---------~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~ 456 (659) .+-..++.+|++. +++.+++... ..+..++..|-.. .++.+..+.|.... T Consensus 234 -~asl~al~~~L~~~sgRw~~~~q~~g~~~~a~----------~gT~a~l~t~g~~----------~N~~~it~~~~~~~ 292 (498) T protein:vir:44 234 -TASVNSMATEMNDSSGRWSYVRQLYGHVYTAK----------TGTLSELVAAGDQ----------FNLQHITLAGYEKD 292 (498) T ss_pred -HHHHHHHHHHHhhhhcchHHHhhcCeEEEEec----------cCCHHHHHHhhhc----------cCCceEEEEecCCC Confidence 2223445555432 3344444321 1245666666543 34666655432110 Q ss_pred EecccCCcceeecHH---HHHHHHHH---HhhhcCCceECcCCcchhheeccc--cceeecChhHHHhhhhCCceEEEEE Q lcl|NC_014792. 457 QYDKYNDVNRWVPLA---ADMAGLCA---RTDDVSQPWMSPPGYNRGQILNVL--KLAIEPRQTQRDRMYQEAINPVVGF 528 (659) Q Consensus 457 ~~d~~~~~~~~~p~s---~~~Ag~~a---~~d~~~g~~~span~~~~~i~g~~--~~~~~~~~~e~~~Ln~~gin~i~~~ 528 (659) ..-|+- +.+|++.| +.|..+ |-|. ..+.|+. .+...++..|++.|..+||.+... T Consensus 293 ---------~~sp~~~~AAa~a~~aA~~l~~DPAr-----PL~t--l~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V- 355 (498) T protein:vir:44 293 ---------TQTPADELAASRTARAAVFIRNDPAR-----PTQT--GELVDMLPAPKGKRFTTTEQQTLLSHGVATAYV- 355 (498) T ss_pred ---------CCCHHHHHHHHHHHHHHHHhhccccc-----ccCc--eeecccccCCchhcCChHHHHHHHhcCcceEEE- Confidence 011322 23334433 444433 2221 1244554 456778999999999999998854 Q ss_pred eCCCe-EEEEccccc-----C-CCccccceeehhhHHHHHHHHHHHHHHHHhc-CCCCH-----------HHHHHHHHHH Q lcl|NC_014792. 529 AGGDG-FVLYGDKTA-----T-KVPSPMDHINVRRLTNMLKKNIGDASKYKLF-ELNDN-----------FTRASFRMET 589 (659) Q Consensus 529 ~~~~G-~~~wG~rT~-----~-~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~-epn~~-----------~l~~~i~~~i 589 (659) + .| ..+--.-|. . ..|..|..|++.|+.+|+.+.++......-. +..-+ .+-..|+..+ T Consensus 356 -~-~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~el 433 (498) T protein:vir:44 356 -E-SGVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGSGQAIVTPAVIRGEL 433 (498) T ss_pred -c-CCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCcccccCCcccCCCcccccHHHHHHHH Confidence 2 34 333222232 1 2234699999999999999999987754322 22111 2667899999 Q ss_pred HHHHHHHHhccceeee---E----EEEccCCCCHHHhhCCEEEEEEEEEecCC----ceEEEEEEEEeecCe Q lcl|NC_014792. 590 SQYLDGIRALGGIYEG---R----VVCDTTNNTPSVIDRNEFVASIYYKPARS----INYIVLNFVATSTGA 650 (659) Q Consensus 590 ~~~l~~l~~~gal~g~---~----v~~d~~~nt~~~i~~G~~~~~i~~~p~~p----~e~i~~~~~~~~~~~ 650 (659) -.-+++|...|-+..+ + |+-|.++ ..|+++.+-...+-+ +-.|.|+++.....+ T Consensus 434 i~~y~~le~~givEn~~~~~~~LiVerd~~d-------pnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:44 434 GSTYRQMEREGIVENFDLFQQHLIVERNAND-------SNRLDVLFPPDYVNQLRVFAVLNQFRLQYSEEAA 498 (498) T ss_pred HHHHHhhhhhccccChhhhcceeEEEECCCC-------CcEEEEEecccccCchhhhhhhhhhhhhhhhhcC Confidence 9999999999988763 2 3333322 235555553333333 233444555444444 No 51 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=99.17 E-value=2.6e-10 Score=73.14 Aligned_cols=437 Identities=13% Similarity=0.077 Sum_probs=207.4 Q ss_pred Cc---------eecCceEEEEecCCCcccccCCcceEEEeec---ccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHH Q lcl|NC_014792. 1 MA---------LLSPGIELKETTVQSTVVRNATGRAALVGKF---QWGPAFQVTQITNEVELVDLFGGPNNITADYFMSG 68 (659) Q Consensus 1 ~~---------~~~PGVyveE~~~~~~~~~~~ts~~afvG~~---~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~ 68 (659) |+ ...||+|+|-=.+... ......-.-+||.. ...+.++|++|+|-.|-...|| ..+.+..+++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~-~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~a~~lfG---~GSml~~M~~ 76 (498) T protein:vir:45 1 MTISFNTIPSNTLVPLFYAEMDNQAAN-TAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICG---AGSQLARMVE 76 (498) T ss_pred CCCchhhcCcccccCeEEEEEeCCCCC-CCCCCcceEEEEecCCccccccceeEEecCHHHHHHhcC---cCcHHHHHHH Confidence 43 4479999953334442 22223456678875 3447799999999999999999 6677777788 Q ss_pred HHHHcCC-CeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecc Q lcl|NC_014792. 69 MNFLQYG-NDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPS 147 (659) Q Consensus 69 ~~f~ngG-~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 147 (659) .+..+.- .++|++-+.+.. ...|+. .+..+... ... ..+.+ .-.|......+ T Consensus 77 a~~~~n~~~~l~~i~~~d~a-G~aA~g---~it~tg~a-t~~----G~l~l----------------~Igg~~v~v~V-- 129 (498) T protein:vir:45 77 AYRQTDPFGELYVIAVPEAT-GAAATV---TLTVTGEA-TES----GTVNV----------------YVGRTRVQAPV-- 129 (498) T ss_pred HHHHhCCcceEEEEeeCCcc-cceeEE---EEEeeccc-CCC----cEEEE----------------EECCEEEEEEe-- Confidence 8876654 689999886532 111111 11110000 000 00000 00000000000 Q ss_pred ccccccccccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccc Q lcl|NC_014792. 148 DKIIAFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYP 227 (659) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 227 (659) ..+.... .+.. .+..+-......+..+. ......+..+... T Consensus 130 ---------------------------~~gdTaa-~vA~----------al~aaina~~~lPVTA~-~~~~~VtlTAr~k 170 (498) T protein:vir:45 130 ---------------------------TNGDNVT-TIAS----------SIQDAINAVPTLPFTAS-SSAGVVTLTARHK 170 (498) T ss_pred ---------------------------cCCCCHH-HHHH----------HHHHHHhCCCCCceEEE-ecCceEEEEeecc Confidence 0000000 0000 00000000000000000 0001122333444 Q ss_pred cccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccc Q lcl|NC_014792. 228 GEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGN 307 (659) Q Consensus 228 g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~ 307 (659) |..||.|++.+...... .| T Consensus 171 G~~GN~I~l~~~~~~~~-----------------------------------------~g-------------------- 189 (498) T protein:vir:45 171 GLCGNEIPVSLNYYGFG-----------------------------------------GG-------------------- 189 (498) T ss_pred CccccceeEEEeecccc-----------------------------------------cc-------------------- Confidence 44455444433211000 00 Q ss_pred hhhhhhhhhcccccceEEeecccCCccc-eeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhh Q lcl|NC_014792. 308 NIYLDDYFAKGTSNYIYATSLNWPKGFA-GIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATA 386 (659) Q Consensus 308 ~~~~~~~~~~~~s~~v~~~~~~~~~~~~-~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 386 (659) ...|.+.. .....+||. ...|+..++.++. ....+++++|=. +. T Consensus 190 --------------------e~~p~Glt~~itamagGa------g~PD~a~alaal~---~~~~~~I~~p~~------D~ 234 (498) T protein:vir:45 190 --------------------EVLPAGVQIAVATGTAGT------GAPVLTGAVAAMA---DEPFDYIGLPFN------DT 234 (498) T ss_pred --------------------ccccceeeEEEEccCCCc------cCchhHHHHHHhc---cCCccEEEEeeC------CH Confidence 00011110 111223332 1224555555554 234578888732 21 Q ss_pred HHHHHHHHHHHH---------hhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeE Q lcl|NC_014792. 387 STVQKHVVSIAD---------ERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQ 457 (659) Q Consensus 387 ~~v~~~l~~~~~---------~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~ 457 (659) +-..++.+|++ ++++.+++.-- ..+..++..|-. ..++.+..+.|...- T Consensus 235 -asL~al~~~L~~~sgRw~~~~q~~g~~~~a~----------~gT~~~l~t~g~----------~~N~~~it~~~~~~~- 292 (498) T protein:vir:45 235 -ASVNTLVTEMNDTSGRWSYARQLYGHVYTAK----------TGTLSELVNAGD----------QFNQQHITLAGYEKE- 292 (498) T ss_pred -HHHHHHHHHHhhhhhhhhHHhhcCeEEEEec----------cCCHHHHHHhhh----------ccCCceEEEEecCCC- Confidence 22244444443 23344444321 124566666654 345667665432110 Q ss_pred ecccCCcceeecHH---HHHHHHHH---HhhhcCCceECcCCcchhheeccc--cceeecChhHHHhhhhCCceEEEEEe Q lcl|NC_014792. 458 YDKYNDVNRWVPLA---ADMAGLCA---RTDDVSQPWMSPPGYNRGQILNVL--KLAIEPRQTQRDRMYQEAINPVVGFA 529 (659) Q Consensus 458 ~d~~~~~~~~~p~s---~~~Ag~~a---~~d~~~g~~~span~~~~~i~g~~--~~~~~~~~~e~~~Ln~~gin~i~~~~ 529 (659) ..-|+- +.+|++.| +.|..+ .--... +.|+. .+...++..|++.|...||.+... T Consensus 293 --------~~sp~~~~AAa~aa~~A~~l~~DPAr----PL~tl~---L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V-- 355 (498) T protein:vir:45 293 --------TQTPADELAASRTARAAVFIRNDPAR----PTQTGE---LVGMLPAPKGKRFTMTEQQTLLSHGVATAYV-- 355 (498) T ss_pred --------CCChHHHHHHHHHHHHHHHhhccccc----ccCcee---ecceecCCchhcCChHHHHHHHhCCcceEEE-- Confidence 111332 33333444 445433 112222 34544 556778999999999999998854 Q ss_pred CCCe-EEEEccccc-----C-CCccccceeehhhHHHHHHHHHHHHHHHHhc-CCCCHH-----------HHHHHHHHHH Q lcl|NC_014792. 530 GGDG-FVLYGDKTA-----T-KVPSPMDHINVRRLTNMLKKNIGDASKYKLF-ELNDNF-----------TRASFRMETS 590 (659) Q Consensus 530 ~~~G-~~~wG~rT~-----~-~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~-epn~~~-----------l~~~i~~~i~ 590 (659) + .| ..+--.-|. . ..|..|..|++.|+.+|+.+.++......-. +..-.. +-..|+..+- T Consensus 356 ~-~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~ell 434 (498) T protein:vir:45 356 E-SGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELL 434 (498) T ss_pred c-CCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCeeecccCcccCCCCcccchHHHHHHHH Confidence 2 34 333222232 1 2234699999999999999999987765422 111111 5678999999 Q ss_pred HHHHHHHhccceeee---E----EEEccCCCCHHHhhCCEEEEEEEEEecCC----ceEEEEEEEEeecCe Q lcl|NC_014792. 591 QYLDGIRALGGIYEG---R----VVCDTTNNTPSVIDRNEFVASIYYKPARS----INYIVLNFVATSTGA 650 (659) Q Consensus 591 ~~l~~l~~~gal~g~---~----v~~d~~~nt~~~i~~G~~~~~i~~~p~~p----~e~i~~~~~~~~~~~ 650 (659) .-+++|..+|-+..+ + |+-|.++ ..|+++.+-...+-+ +-.|.|+++.....+ T Consensus 435 ~~y~~le~~givEn~~~~~~~LiVerd~~d-------pnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 435 ATYRQLERAGIVENYELFKQYLVVERDASV-------PNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred HHHHhhhhhccccChhhhcceeEEEECCCC-------CcEEEEEecccccCchhhhhhhhhhheehhhcCC Confidence 999999999988763 2 4443322 235555553333333 333445555544444 No 52 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=99.15 E-value=4.2e-10 Score=72.02 Aligned_cols=438 Identities=13% Similarity=0.071 Sum_probs=207.6 Q ss_pred Cc---------eecCceEEEEecCCCcccccCCcceEEEeecc---cCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHH Q lcl|NC_014792. 1 MA---------LLSPGIELKETTVQSTVVRNATGRAALVGKFQ---WGPAFQVTQITNEVELVDLFGGPNNITADYFMSG 68 (659) Q Consensus 1 ~~---------~~~PGVyveE~~~~~~~~~~~ts~~afvG~~~---~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~ 68 (659) |+ +..||+|+|--.+.... ...+.-.-+||..- ..|.++|++|+|-.|-...|| ..+.+..+++ T Consensus 1 M~IsF~~IP~~iRvP~~y~E~dns~A~~-~~~~qrvLiiGq~la~gt~~~~~~v~v~s~~~a~~~fG---~GS~l~~M~~ 76 (498) T protein:vir:48 1 MTISFSAVPSDTLVPLFYAEMDNSAANT-AVTSAPALLIGHASNDAAIEVNSLVLMPSADYARQICG---AGSQLARMVD 76 (498) T ss_pred CCccccccCcccccceEEEEEecCCCcc-ccCCcceEEEeecCccccccccceEEecCHHHHHHhcC---cccHHHHHHH Confidence 33 45799999543444322 22234566778643 447799999999999999999 6667777777 Q ss_pred HHHHcCC-CeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecc Q lcl|NC_014792. 69 MNFLQYG-NDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPS 147 (659) Q Consensus 69 ~~f~ngG-~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 147 (659) .|..+.- .++|++-+.+.. ...|+. .+..+... .. .+. ..+.-.|......+ T Consensus 77 a~~~~n~~~~l~~i~~~D~a-g~aA~g---~it~tg~a-t~-------------------~G~-l~l~Igg~~v~v~V-- 129 (498) T protein:vir:48 77 VYRQTDPFGELYVIAVPEAR-GAAATV---RVTVTGEA-EE-------------------SGT-LSLYVGRSSVQVPV-- 129 (498) T ss_pred HHHHhCCCceeEEEeeCCcc-cceeEE---EEEecccc-cC-------------------Cce-EEEEECCEEEEEee-- Confidence 7776655 789999986532 111111 11110000 00 000 00000000000000 Q ss_pred ccccccccccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccc Q lcl|NC_014792. 148 DKIIAFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYP 227 (659) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 227 (659) ..+... ..+.. .+..+-......+..+.. .....+..+... T Consensus 130 ---------------------------~~gdTa-a~vA~----------al~aai~a~~~lPVTA~~-~~~~VtlTAr~k 170 (498) T protein:vir:48 130 ---------------------------VNGDDA-TAVAT----------AIKEAVNGVITLPFAASS-DAGVVTLTARHK 170 (498) T ss_pred ---------------------------cCCCCH-HHHHH----------HHHHHHhCCCCcceEEEe-cCcEEEEEeeec Confidence 000000 00000 000000000000000000 001112223333 Q ss_pred cccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccc Q lcl|NC_014792. 228 GEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGN 307 (659) Q Consensus 228 g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~ 307 (659) |..||.|++.+...+.. T Consensus 171 G~~GN~I~l~~~~~~~~--------------------------------------------------------------- 187 (498) T protein:vir:48 171 GLYGNELPVCLNYYGSG--------------------------------------------------------------- 187 (498) T ss_pred ccccccceeeeeeccCc--------------------------------------------------------------- Confidence 44444433332110000 Q ss_pred hhhhhhhhhcccccceEEeecccCCccce-eEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhh Q lcl|NC_014792. 308 NIYLDDYFAKGTSNYIYATSLNWPKGFAG-IINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATA 386 (659) Q Consensus 308 ~~~~~~~~~~~~s~~v~~~~~~~~~~~~~-~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 386 (659) .....|.+..- ....+||.. ..|+..++.++. ....+++++|=. +. T Consensus 188 ------------------~ge~~p~Glt~~itamsgGag------~PDia~aLaal~---~~~~~~I~~p~~------D~ 234 (498) T protein:vir:48 188 ------------------GGEILPAGLQVVTEAGTAGSG------APDLTAAVAAMG---DEAFDFIGLPFN------DA 234 (498) T ss_pred ------------------ccccccceeeEEEEcccCCcc------CcchHHHHHhhc---cCCccEEEEeec------CH Confidence 00000111111 112334422 224555555543 234578888732 22 Q ss_pred HHHHHHHHHHHHh---------hCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeE Q lcl|NC_014792. 387 STVQKHVVSIADE---------RQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQ 457 (659) Q Consensus 387 ~~v~~~l~~~~~~---------~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~ 457 (659) +-..++.+|++. +++.+++.-- ..+..++..|-. ..++.+..+.+- T Consensus 235 -asl~al~~~L~~~sgRw~~~~q~~g~~~~a~----------~gT~~~l~t~g~----------~~N~~~it~~~~---- 289 (498) T protein:vir:48 235 -ASINMMMTEMNDSSGRWSYARQLYGHVYTAK----------LGTLSELVNAGD----------MHNQQHITLAGY---- 289 (498) T ss_pred -HHHHHHHHHHhhhhhhhhHHhhcCeEEEEec----------cCCHHHHHHhhh----------ccCCceEEEEec---- Confidence 223445555532 3344444321 124566666654 345666665431 Q ss_pred ecccCCcceeecHH---HHHHHHHH---HhhhcCCceECcCCcchhheeccc--cceeecChhHHHhhhhCCceEEEEEe Q lcl|NC_014792. 458 YDKYNDVNRWVPLA---ADMAGLCA---RTDDVSQPWMSPPGYNRGQILNVL--KLAIEPRQTQRDRMYQEAINPVVGFA 529 (659) Q Consensus 458 ~d~~~~~~~~~p~s---~~~Ag~~a---~~d~~~g~~~span~~~~~i~g~~--~~~~~~~~~e~~~Ln~~gin~i~~~~ 529 (659) ++.. .-|+. +.+|++.| +.|..+ |-|. ..+.|+. .+...++..|++.|..+||.+... . T Consensus 290 ----~~~~-~~p~~~~AAa~a~~aA~~l~~DPAr-----PLqt--l~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V-~ 356 (498) T protein:vir:48 290 ----EKET-QSPVDELVASRLAREAVFIRNDPAR-----PTQT--GELVGMLPAPKGKRFIMTEQQTLLSHGVATAYV-E 356 (498) T ss_pred ----CCCC-CChHHHHHHHHHHHHHHhhhccccc-----cccc--eeeeccccCCchhcCChHHHHHHHhcCcceEEE-c Confidence 1111 11332 23333333 455433 2221 1244554 556778999999999999998865 4 Q ss_pred CCCeEEEEccccc-----C-CCccccceeehhhHHHHHHHHHHHHHHHHhc-CCCCHH-----------HHHHHHHHHHH Q lcl|NC_014792. 530 GGDGFVLYGDKTA-----T-KVPSPMDHINVRRLTNMLKKNIGDASKYKLF-ELNDNF-----------TRASFRMETSQ 591 (659) Q Consensus 530 ~~~G~~~wG~rT~-----~-~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~-epn~~~-----------l~~~i~~~i~~ 591 (659) + +-..+--..|. . ..|..|..|++.|+.+|+.+.++......-. +..-.. +-..|+..+-. T Consensus 357 ~-G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~dg~~~~~gq~IvTp~~ir~eli~ 435 (498) T protein:vir:48 357 G-GTLRIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIKGELLA 435 (498) T ss_pred C-CeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhhhhcCCceecccCcccCCCCcccchHHHHHHHHH Confidence 4 23444443333 1 2234699999999999999999987765332 122111 56789999999 Q ss_pred HHHHHHhccceeee---E----EEEccCCCCHHHhhCCEEEEEEEEEecCC----ceEEEEEEEEeecCe Q lcl|NC_014792. 592 YLDGIRALGGIYEG---R----VVCDTTNNTPSVIDRNEFVASIYYKPARS----INYIVLNFVATSTGA 650 (659) Q Consensus 592 ~l~~l~~~gal~g~---~----v~~d~~~nt~~~i~~G~~~~~i~~~p~~p----~e~i~~~~~~~~~~~ 650 (659) -+++|..+|-+..+ + |+-|.++ ..|+++.+-...+-+ +-.|.|+++...+.+ T Consensus 436 ~y~~le~~given~~~~~~~LiVerd~~d-------pnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 436 TYRQMERAGIVENYDLFKQYLIVERDADN-------PNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred HHHhhhhhccccChhhhcceeEEEECCCC-------CcEEEEEecccccCchhhhhhhhhhhhhhhhcCC Confidence 99999999988763 2 3333322 235555553333333 233445555444444 No 53 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=99.03 E-value=2.6e-09 Score=67.61 Aligned_cols=358 Identities=10% Similarity=0.032 Sum_probs=178.9 Q ss_pred ccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccch Q lcl|NC_014792. 229 EIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNN 308 (659) Q Consensus 229 ~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~ 308 (659) -|+ +|.+............+...+.. +..+..+.+. .+.+... .-++... +..+ .... T Consensus 1 ~~~---~v~vn~~n~~~g~~~~~er~~Lf-----------ig~~~~~~~~-~~~~~~~----sdld~~l--g~~~-~~lk 58 (376) T protein:vir:37 1 MFP---SVQINALNQLSGETKEIERHALF-----------VGVGTTNQGK-LLALTPD----SDFDKVF--GETD-TDLK 58 (376) T ss_pred CCC---eEEEecccccCCCcccccceEEe-----------eccccccccc-eeeecCc----cchHhhh--CCCc-hHHH Confidence 111 11111110000000000000000 0000000000 0000000 0000001 1111 1112 Q ss_pred hhhhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHH Q lcl|NC_014792. 309 IYLDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATAST 388 (659) Q Consensus 309 ~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 388 (659) ..+.....|++..|....-. +. ....++.++++... +.+.+..+.+-+........-.+ T Consensus 59 ~~v~aa~~naG~~~~~~~~~-----------~~--------~~~~~~~~Av~~a~--~~~s~E~V~v~~pv~t~~a~i~a 117 (376) T protein:vir:37 59 KQVRAAMLNAGQNWFAHVYI-----------AQ--------EDGYDFVECVKKAN--QTASFEYCVNTRYLGVDKASIGK 117 (376) T ss_pred HHHHHHHhCCCCcEEEEEEe-----------ec--------CCchHHHHHHHHhh--hhcCceEEEEeccccccHHHHHH Confidence 23344445555554211110 00 01123445544321 22333343333322111122233 Q ss_pred HHHHHHHHHHh-hCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCccee Q lcl|NC_014792. 389 VQKHVVSIADE-RQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRW 467 (659) Q Consensus 389 v~~~l~~~~~~-~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~ 467 (659) .++....+..+ +|-.|.++..+.- ..+.... ++..+|..... ..+..+.+.+..++. ..| + T Consensus 118 a~~~a~el~~~~~Rpv~file~r~~-~~~~~~~---e~w~~y~~~~~---al~~gia~~~V~~V~---~~~----g---- 179 (376) T protein:vir:37 118 LQECYAELLAKFGRRTFFIQAVQGI-NHDQSDG---ETWDQYVQKLT---TLQQTIVADHVCLVP---LLF----G---- 179 (376) T ss_pred HHHHHHHHHHhcCCeEEEEEeccCc-Ccccccc---cCHHHHHHHHH---Hhhcccccccceeee---eeh----h---- Confidence 33333344344 4667788876521 1111112 23334433222 122233444443321 011 0 Q ss_pred ecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccce-------eecChhHHHhhhhCCceEEEEEeCCCeEEEEccc Q lcl|NC_014792. 468 VPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLA-------IEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDK 540 (659) Q Consensus 468 ~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~-------~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~r 540 (659) ...|.+||++++- ..-++.||.-..-+.+.|....+ ..++...++.|..+|..+.+.|++-.|+.+-..| T Consensus 180 -n~~G~~aGRl~~a--aVsVadspgRV~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~ 256 (376) T protein:vir:37 180 -NETGVLAGRLANR--AVTVADSPARVQTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGYYWADGR 256 (376) T ss_pred -hhHHHHHHHHhhc--ccchhhCccceeccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCceEEeCce Confidence 2357888887643 22357788877766666543222 3567888899999999999999998999888889 Q ss_pred ccCCCccccceeehhhHHHHHHHHHHHHHHHHhcCCC---CHHHHHHHHHHHHHHHHHHHhccceeee----EEEEccC- Q lcl|NC_014792. 541 TATKVPSPMDHINVRRLTNMLKKNIGDASKYKLFELN---DNFTRASFRMETSQYLDGIRALGGIYEG----RVVCDTT- 612 (659) Q Consensus 541 T~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~epn---~~~l~~~i~~~i~~~l~~l~~~gal~g~----~v~~d~~- 612 (659) |+....++|++|..+|..+-+.+.++..+-.++...- .+.-.+..+.-+..-|++|.+..-+.|. +|...++ T Consensus 257 tl~~~gsDY~~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~d~ 336 (376) T protein:vir:37 257 TLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDD 336 (376) T ss_pred EeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCCCC Confidence 9998888999999999999999988887777765432 3334556666677789999998888883 4555432 Q ss_pred CCCHHHhhCCEEEEEEEEEecCCceEEEEEEE--EeecCe Q lcl|NC_014792. 613 NNTPSVIDRNEFVASIYYKPARSINYIVLNFV--ATSTGA 650 (659) Q Consensus 613 ~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~--~~~~~~ 650 (659) .-+..-+...++.|.+.+.|.--..+|+..|. -+..+. T Consensus 337 Di~i~w~s~~~V~I~~~v~P~~~pk~Itv~I~Ldlsn~~~ 376 (376) T protein:vir:37 337 AITIVWQSKTKVTIYIKVRPYDCPKEITANIFLDLDSLGE 376 (376) T ss_pred CceEEeeccceEEEEEEEEeccCCceEEEEEEeecCCCCC Confidence 22333347788999999999999999986644 332232 No 54 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=98.87 E-value=4.3e-09 Score=66.45 Aligned_cols=352 Identities=9% Similarity=-0.028 Sum_probs=173.5 Q ss_pred ccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccch Q lcl|NC_014792. 229 EIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNN 308 (659) Q Consensus 229 ~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~ 308 (659) -|+ .|.+............+...+. . +..+....+. .+.+... .-++..+ ++.+. ... T Consensus 1 ~~~---~v~vn~~n~~~g~~~~~er~~l----------f-ig~~~~~~g~-~~~~~~~----sdld~~l--~~~ds-~lk 58 (370) T protein:vir:78 1 MWP---YVQIYNLNQMQGPVTEVERHLL----------F-IGSAASNTGK-LLSLNAQ----SDFDQLL--GAADS-ELK 58 (370) T ss_pred CCc---eEEEeeccccCCCcCccceeEE----------E-Eecccccccc-eEeecCc----cCHHHhc--CCcCh-hHH Confidence 111 1111111000000000000000 0 0000000000 0000000 0000001 11111 112 Q ss_pred hhhhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHH Q lcl|NC_014792. 309 IYLDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATAST 388 (659) Q Consensus 309 ~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 388 (659) ..+.....|++..|-. +. ..+. +..++.++++... +...+-.+.+-+-. +..+ T Consensus 59 ~~v~aa~~naG~~~~~-~~----------~p~~---------~~~d~~~Av~~a~--~~~s~E~V~v~~~~-----s~~a 111 (370) T protein:vir:78 59 ANLLAARDNAGQNWSA-AA----------YVLP---------TDKPWLDAARDAQ--QTQSFEGVVVLGQE-----WHQA 111 (370) T ss_pred HHHHHHHhCCCCceEE-EE----------EEec---------CchhHHHHHHHHH--hhCCccEEEEecCc-----chHH Confidence 2334445555555421 11 1111 2234555554432 23333344433321 1223 Q ss_pred HHHHHHHHHH----hh-CCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCC Q lcl|NC_014792. 389 VQKHVVSIAD----ER-QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYND 463 (659) Q Consensus 389 v~~~l~~~~~----~~-~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~ 463 (659) .+.+|.++++ ++ |-.|.++..+.-. .+ ++..+|..... ..+..+.+.+..++--|.. T Consensus 112 ~~~a~~~~a~el~n~~~Rpv~file~~~~~-----~~---e~w~~y~~~l~---al~~gia~~~V~vvp~~~g------- 173 (370) T protein:vir:78 112 AINAAHALNQELIAKWGRWQFMLLAVPAIA-----DE---QDWATYEAELA---TLQDGIAASSVSLIPQLWP------- 173 (370) T ss_pred HHHHHHHHHHHHHHhcCCeEEEEEeecCCC-----Cc---CCHHHHHHHHH---HhhhccccccceEEeeecc------- Confidence 3344444443 33 5577777665321 11 23334433221 1223444556555522211 Q ss_pred cceeecHHHHHHHHHHHhhhcCCceECcCCcchhheeccc-----cceeecChhHHHhhhhCCceEEEEEeCCCeEEEEc Q lcl|NC_014792. 464 VNRWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVL-----KLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYG 538 (659) Q Consensus 464 ~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~-----~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG 538 (659) -.-|.+||+++.. .--+..+|.-...+.+.|.. +....++.+.++.|..+|-.+.+.|++-.|+.+-. T Consensus 174 -----~~~G~~aGRL~na--avsVadsP~Rv~tG~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d 246 (370) T protein:vir:78 174 -----TLAGAYAGRLCNR--AVSIADSPCRVKTGALVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGIYWAD 246 (370) T ss_pred -----ccHHHHHHHHhcC--eeeecccceeeeccccccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeC Confidence 1136777865432 11266788766666665532 12345677889999999999999999989998888 Q ss_pred ccccCCCccccceeehhhHHHHHHHHHHHH-HHHHhcCCCCH--HHHHHHHHHHHHHHHHHHhccceee--eEEEEccC- Q lcl|NC_014792. 539 DKTATKVPSPMDHINVRRLTNMLKKNIGDA-SKYKLFELNDN--FTRASFRMETSQYLDGIRALGGIYE--GRVVCDTT- 612 (659) Q Consensus 539 ~rT~~~~~~~~~~i~vrR~~~~i~~~i~~~-~~~~v~epn~~--~l~~~i~~~i~~~l~~l~~~gal~g--~~v~~d~~- 612 (659) .||+....++|++|..+|..+-+.+.++.. ++...++-.|+ .-....+.-...=|+++...+.+.| |.-++... T Consensus 247 ~~tl~~~gsDYq~ie~~RVvdKa~R~vR~~ai~~i~D~~lnst~gsia~~~~~~~~~L~ema~s~~i~~~~fpgeI~~p~ 326 (370) T protein:vir:78 247 GRTLDAEGGDYQVIENLRIAYKVARRMRLRAIARIGDRSFNSTPGSTAAAITYFGKDLREMAKSTTINGQPFPGDIASPQ 326 (370) T ss_pred ceEeccCCCChhhhhhhhHHHHHHHHHHHHHHHHhCCcccCCCCcchhHHHHHHHhhHHHHHhhhhhcccccceeEeccC Confidence 899988888999999999999999999944 44444432222 2223344445555666677887776 44444321 Q ss_pred --CCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCeeEEEecC Q lcl|NC_014792. 613 --NNTPSVIDRNEFVASIYYKPARSINYIVLNFVATSTGADFDELIG 657 (659) Q Consensus 613 --~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~~ 657 (659) .-+..-+...++.|.+.+.|.--...|+..|.- +.++++=.| T Consensus 327 d~Di~i~w~s~~~v~I~~~v~P~~~pk~Itv~I~L---Dls~e~~~~ 370 (370) T protein:vir:78 327 DGDIRIQWVAKNLVSVFVVVRTVDCPKGITVNIML---DLSLNNGEG 370 (370) T ss_pred CCcceEEeeccceEEEEEEEEeccCCceEEEEEEE---eeccccCCC Confidence 123333467889999999999999999998873 344555555 No 55 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=98.81 E-value=3.1e-08 Score=61.75 Aligned_cols=358 Identities=9% Similarity=0.033 Sum_probs=175.5 Q ss_pred cccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeecccc-ccceeeeeccCCceeeeeeeecccccccccc Q lcl|NC_014792. 228 GEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQT-DDQYAIIVRRDGAIVENVVLSTKEGDKDVYG 306 (659) Q Consensus 228 g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~ 306 (659) =.|+ .|.+............+.. ...+....+. .+.-.+..-+.. .-++... ++.+. . T Consensus 1 m~~~---~V~in~~n~~qg~~~~ver------------~~lfig~g~~~~~~g~~~~~~~~---sdld~~l--g~~ds-~ 59 (369) T protein:vir:27 1 MAWP---TVIIKILNLMNGPIADIEC------------HFLFVIRGTVSGEVRNLIMVDST---SDLDDVL--AEASA-E 59 (369) T ss_pred CCCC---ceEEecccccCCCcccccc------------eEEEEEeccccccccceEEecCc---cchHhhc--CCcCh-h Confidence 0011 0111100000000000000 0000000000 000000000000 0000001 11111 1 Q ss_pred chhhhhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhh Q lcl|NC_014792. 307 NNIYLDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATA 386 (659) Q Consensus 307 ~~~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 386 (659) ....+..+..|++..|-..+ ..+. +..++.++.+... +.+.+-++.+-+-.. ....- T Consensus 60 lk~~v~aa~~naG~~w~a~~-----------~p~~---------~~~~~~~Av~~a~--~~~s~E~V~v~~p~t-~~a~i 116 (369) T protein:vir:27 60 GLAIVKAAQLNGKQAWTAGV-----------MILS---------EEDNWQDAVKKAN--EVSSFEFVVLGFDAE-TKAMI 116 (369) T ss_pred HHHHHHHHHhCCCCceEEEE-----------EEeC---------CchhHHHHHHhhh--hhCCccEEEEecCcc-cHHHH Confidence 23334455556655542211 1111 1234455554332 223333444333111 00111 Q ss_pred HHHHHHHHHHHHh-hCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcc Q lcl|NC_014792. 387 STVQKHVVSIADE-RQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVN 465 (659) Q Consensus 387 ~~v~~~l~~~~~~-~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~ 465 (659) .+.++....+-.+ +|-.|.++..+.-. .+. ...++..+|..... ..+..+.+.+..++--+.... T Consensus 117 ~aaq~~a~el~~~~~R~vffi~e~~~~~-~~~---~~~e~w~dy~a~l~---al~~g~a~~~V~vv~~~~~~g------- 182 (369) T protein:vir:27 117 EDAITLRTELKNSLGREVGVLCQLPAIN-NDP---TNGQTWSEWLADTV---DIPKDVASEYISVVPNVHAAG------- 182 (369) T ss_pred HHHHHHHHHHHHhcCCeEEEEEeccccC-CCc---cccCCHHHHHHHHH---HHhhccCcccceeeeeecccc------- Confidence 2233333333333 35677777654210 111 12233444443322 223345667777662222211 Q ss_pred eeecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccc-----eeecChhHHHhhhhCCceEEEEEeCCCeEEEEccc Q lcl|NC_014792. 466 RWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKL-----AIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDK 540 (659) Q Consensus 466 ~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~-----~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~r 540 (659) .-.|.+||+++.- ..-++.||.-..-+.+.|...+ -..++.+.+..|..+|..+.+.|++-.|+.+-.+| T Consensus 183 ---n~~G~~aGRl~n~--aVsIadsp~RVktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~ 257 (369) T protein:vir:27 183 ---DTLGKYAGRLANK--EVSIADSPARVQTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGR 257 (369) T ss_pred ---chHHHHHHHHHhc--ccchhcCcceeeecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCce Confidence 2357788887652 2235778887766666664322 13356678889999999999999998999888889 Q ss_pred ccCCCccccceeehhhHHHHHHHHHHHHHHHHhcCC---CCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEccC-CCCH Q lcl|NC_014792. 541 TATKVPSPMDHINVRRLTNMLKKNIGDASKYKLFEL---NDNFTRASFRMETSQYLDGIRALGGIYEGRVVCDTT-NNTP 616 (659) Q Consensus 541 T~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~ep---n~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~-~nt~ 616 (659) |+....++|++|..+|..+-+.+.++...-..+..| .++.-++..+..+..=|++|.+.+ ..++|...++ .-+- T Consensus 258 tl~~~gsDYq~iE~~RVvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~--fpgei~~P~d~dI~i 335 (369) T protein:vir:27 258 TLDVPGGDYQDIRHIRVAMKAARKVRIRAIARIADRTLNSTPQSIAAAKLYFTQDLRTMALTG--VPGEIYPPEDEDIQI 335 (369) T ss_pred EeccCCCCeehhhhhhHHHHHHHHHHHHHHHHhcCcccccChhHHHHHHHHHhhHHHHHHhhc--CCeEEecCCCCceEE Confidence 999888999999999999999998887776666544 244556666777778888887653 2333333211 1111 Q ss_pred HHhhCCEEEEEEEEEecCCceEEEEEEEEeecCe Q lcl|NC_014792. 617 SVIDRNEFVASIYYKPARSINYIVLNFVATSTGA 650 (659) Q Consensus 617 ~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~ 650 (659) .-....++.|.+.+.|.--...|+.+|.-+-..- T Consensus 336 ~w~~k~~V~I~~~vrP~~~pk~it~~I~ldl~~~ 369 (369) T protein:vir:27 336 KWVNSTDVEIYMSVQPYECPVKITIAISVKQGDY 369 (369) T ss_pred EeeccceEEEEEEEeeccCCceEEEEEEEeccCC Confidence 1114456778888888888888888887665554 No 56 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=98.79 E-value=4.7e-08 Score=60.77 Aligned_cols=360 Identities=9% Similarity=0.014 Sum_probs=181.8 Q ss_pred ccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccch Q lcl|NC_014792. 229 EIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNN 308 (659) Q Consensus 229 ~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~ 308 (659) -|+ +|.+............+...+.. +..+..+.+. .+.+... .-++..+. +. ..... T Consensus 1 ~~~---~v~vn~ln~~qg~~~~ver~~lf-----------ig~~~~~~~~-~~~~~~~----sdld~~lg--~~-ds~lk 58 (376) T protein:vir:37 1 MFP---SVQINALNQLSGETKEIERHALF-----------VGVGTTNQGK-LLALTPD----SDFDKVFG--ET-DTDLK 58 (376) T ss_pred CCC---eEEEeeeeccCCCcccccceEEE-----------eeccccccCc-eEEecCC----CChHHhhC--CC-chhHH Confidence 111 11111111110000000000000 0000000000 0000000 00111111 10 11122 Q ss_pred hhhhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHH Q lcl|NC_014792. 309 IYLDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATAST 388 (659) Q Consensus 309 ~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 388 (659) ..+.....|++..|...+-. + ..+..++..+++... +.+.+-++.+-+-.......-.+ T Consensus 59 ~~v~aa~~naG~~w~a~~~~-----------p--------~~~~~~~~~Av~~a~--~~~s~E~V~v~~p~~t~~a~i~a 117 (376) T protein:vir:37 59 KQVRAAMLNAGQNWFAHVYI-----------A--------QEDGYDFVECVKKAN--QTASFEYCVNTRYLGVDKASIGK 117 (376) T ss_pred HHHHHHHhCCCCceEEEEEe-----------c--------CCChhhHHHHHHHHH--hhCCeeEEEEecCcchhHHHHHH Confidence 23444455555554211110 0 012234555655442 33444444444321111111122 Q ss_pred HHHHHHHHHHh-hCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCccee Q lcl|NC_014792. 389 VQKHVVSIADE-RQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRW 467 (659) Q Consensus 389 v~~~l~~~~~~-~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~ 467 (659) .+.....+-.+ +|-.|.++..+.- ..+. ...++..+|..... ..+..+.+.+..++-. .+ + T Consensus 118 ~qa~a~el~~~~~R~vffile~~g~-d~~~---~~ge~w~~y~~~l~---a~~~gia~~~V~vV~~---~~----g---- 179 (376) T protein:vir:37 118 LQECYAELLAKFGRRTFFIQAVQGI-NHDQ---SDGETWDQYVQKLT---TLQQTIVADHVCLVPL---LF----G---- 179 (376) T ss_pred HHHHHHHHHHhcCCeEEEEEeccCC-CCcc---cccCCHHHHHHHHH---HHhccccccceeeeee---ec----c---- Confidence 22222333333 3567788876521 1111 11233444443322 2233455666665522 11 0 Q ss_pred ecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccce-------eecChhHHHhhhhCCceEEEEEeCCCeEEEEccc Q lcl|NC_014792. 468 VPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLA-------IEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDK 540 (659) Q Consensus 468 ~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~-------~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~r 540 (659) ...|.+||+++.- ..-++.||.-..-+.+.|+...+ ..++.+....|..+|.-+.+.+++-.|+.+-.+| T Consensus 180 -n~~G~~aGRl~na--aVsVadspgRV~tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG~Yw~dg~ 256 (376) T protein:vir:37 180 -NETGVLAGRLANR--AVTVADSPARVQTGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDGYYRADGR 256 (376) T ss_pred -chHHHHHHHHHhC--CcchhcCccceeecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCceEEeCCe Confidence 2358888887652 33367899887767666654322 3456678889999999999999998999888899 Q ss_pred ccCCCccccceeehhhHHHHHHHHHHHHHHHHhcCC---CCHHHHHHHHHHHHHHHHHHHhccceeeeE----EEEccC- Q lcl|NC_014792. 541 TATKVPSPMDHINVRRLTNMLKKNIGDASKYKLFEL---NDNFTRASFRMETSQYLDGIRALGGIYEGR----VVCDTT- 612 (659) Q Consensus 541 T~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~ep---n~~~l~~~i~~~i~~~l~~l~~~gal~g~~----v~~d~~- 612 (659) |+....++|++|..+|.++-+.+.++...-..+..+ .++.-++..+..++.-|+.|.+.+.|.|.. |...++ T Consensus 257 tl~~~gsDYq~ie~~RVvdKa~R~vR~~Ai~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~d~ 336 (376) T protein:vir:37 257 TLDVEGGDYQVIENLRVVDKVARKVRLLAIGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDD 336 (376) T ss_pred EeccCCCCeeeehhchHHHHHHHHHHHHHHHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccceeecCCCC Confidence 999888999999999999999998887666555543 356677888888999999999999999943 443211 Q ss_pred CCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEeecCeeEEE Q lcl|NC_014792. 613 NNTPSVIDRNEFVASIYYKPARSINYIVLNFVATSTGADFDE 654 (659) Q Consensus 613 ~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e 654 (659) .-+-.-....++.|.+.++|.---+.|+..|--+.. ...| T Consensus 337 dI~i~w~sk~~V~I~~~vrPy~cpk~i~~~I~LDls--~~~~ 376 (376) T protein:vir:37 337 AITIVWQSKTKVTIYIKVRPYDCPKEITANIFLDLD--SLGE 376 (376) T ss_pred ceEEEeccCceEEEEEEEeeecCcceeEEEEEEecC--CCCC Confidence 111111124567777888888777777776654332 2223 No 57 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=98.74 E-value=5.4e-08 Score=60.41 Aligned_cols=412 Identities=11% Similarity=-0.013 Sum_probs=164.0 Q ss_pred ccceeeeeeccC----cceeeeeccccccccccccceeeeeccce--eeEEEeecCCccccccccceeccccceeeeccc Q lcl|NC_014792. 127 TSGRITKVDVDG----KILAVFIPSDKIIAFAKSVNQYPDLGPAW--TAEILTTSSGVSGTITLGKIVTDSGILLTEAEN 200 (659) Q Consensus 127 ~~~~~~~~~~~g----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~ 200 (659) .+.++..+..+- ...+.| +... +.+.. ..+..+.++... . +.. T Consensus 1 ~~s~iVnV~i~~~~~a~~~~~f-------------~~~l-~~~~~~~~~~r~~~yss~~-------------~----V~~ 49 (450) T protein:vir:95 1 MWNPIVNVDITLNTAGTTREGF-------------GLPL-FLASTDNFEERVRGYTSLT-------------E----VAE 49 (450) T ss_pred CCCceEEEeecccccccccccc-------------eeEE-EEcCCCCCccceeeecCHH-------------H----HHH Confidence 112222111100 000000 0000 00000 000000000000 0 000 Q ss_pred ccccccccceeecc-cccccceeeeccccccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccce Q lcl|NC_014792. 201 SEEAITSLEFQASL-QKYAMPGVVALYPGEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQY 279 (659) Q Consensus 201 ~~~~~~~~~~~~~~-~~~~~~~~~a~~~g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 279 (659) .. +..+.++.... ..-+.+.....+.|.|...-.... ..... ...+.. T Consensus 50 ~F-G~~S~ey~aA~~yF~q~p~p~~l~igr~~~~~t~~~-~~~~~----------------------------~~~~g~- 98 (450) T protein:vir:95 50 DF-DENTAAYKAAKQLWSQTPKVTQLYIGRRAMQYTVSI-PDAVT----------------------------ESTDYS- 98 (450) T ss_pred hc-CCCcHHHHHHHHHHhCCCcccEEEEEeeccchhhhh-hhhhc----------------------------ccccee- Confidence 00 00000000000 000011111112222221100000 00000 000000 Q ss_pred eeeeccCCceeee--eeeeccccccccccchhhhhhhhhcccccceEEeecccCCcc-------------------ceeE Q lcl|NC_014792. 280 AIIVRRDGAIVEN--VVLSTKEGDKDVYGNNIYLDDYFAKGTSNYIYATSLNWPKGF-------------------AGII 338 (659) Q Consensus 280 ~~~v~~~g~~~et--~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~-------------------~~~~ 338 (659) +++..+|..... +.++... +...-...+...+................... .... T Consensus 99 -lt~tv~G~~~~~~~i~~s~a~---s~~~va~~~~tai~~~~~~~~~~~~~s~g~~~~~t~~~~~~~~~~~~~l~~~~~~ 174 (450) T protein:vir:95 99 -ITVAAGGGISQPYQYTAQSSD---TAENVLQQFKTQIEADPTIKDKVSVNVTGSNGSATMIIAKAGDNDFVKVTTTAQT 174 (450) T ss_pred -EEEEecceeeeeeEEEEEecC---ChhhHHHHhhhhhcccceeeeeeeeeeecccceeeeeeeccccchhhccccccce Confidence 111111111110 0100000 00000000011110000000000000000000 0001 Q ss_pred EeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhhCCEEEEEecCcccccccc Q lcl|NC_014792. 339 NLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADERQDCLAFISPPKGLLVNVP 418 (659) Q Consensus 339 ~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~ 418 (659) ....|.. ...+..++..+......-. .++.+.. + ..-+.+|..+++....+|.....-....... T Consensus 175 ~~~~g~~------aet~~~a~~a~~~~~~~w~-~~~~~~~------~-~~~i~a~a~w~~a~~~~f~~~~~~~~~~~~~- 239 (450) T protein:vir:95 175 VYIASTT------ADTASTALAAIEAYSTDWY-FIAAEDR------T-QQFVLAMASEIQARKKIFFTANSDVTALQGT- 239 (450) T ss_pred eEecccc------cccHHHHHHHHHHhhCCeE-EEEecCC------C-HHHHHHHHHHHhhcCcEEEEEcCCchhhhhh- Confidence 1111111 1112222332222111111 2233221 1 1222445555655554554432211100000 Q ss_pred ccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHHHHHhhhcCCceECcCCcchhh Q lcl|NC_014792. 419 LTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQ 498 (659) Q Consensus 419 ~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~ 498 (659) ......++....... +..+.. .+|++.+.. -.+.+.++|.....+.-+--| .+|.+.+ T Consensus 240 ~~~~~~~i~~~l~~~----------~~~~t~------~~y~~~~~~---~~~~aa~~g~~~~~~~g~~T~---~fk~l~G 297 (450) T protein:vir:95 240 ELASANDVPAQLAKN----------MYTRTV------CLWHHAAAE---DYPEMAYIAYGAPYDAGSIAW---GNAQLTG 297 (450) T ss_pred hhhcccchHHHHHhc----------cCCeeE------EEeeCCCch---hHHHHHHHHHhhhcccceeee---ccccccc Confidence 000001111110000 011111 122211111 124566666655544322233 2455444 Q ss_pred eec-cc-cceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccceeehhhHHHHHHHHHHHHHHHHhc-- Q lcl|NC_014792. 499 ILN-VL-KLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHINVRRLTNMLKKNIGDASKYKLF-- 574 (659) Q Consensus 499 i~g-~~-~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~-- 574 (659) |.. +. +....++..|.+.|..+|+|++..+-+ .+ .++.++|+++ + ||-++|-.+|++..|++.+....- T Consensus 298 v~~~v~~~~~~~lt~~~~~al~~~~~n~y~~~~~-~~-~~~~G~~~~G---~--~iD~~~~~~wl~~~iq~~l~~ll~~~ 370 (450) T protein:vir:95 298 VAASLQPSNQRPLTSIQKSALDVRHCNFIDLDGG-VP-VVRRGITSGG---E--WIDIIRGVDWLESDLKTSLRDLLINQ 370 (450) T ss_pred eeeeccCccccccchHHHHHHHhCCcEEEEEecC-ce-eeeCCeeeCc---c--hhHHHHHHHHHHHHHHHHHHHHHHhc Confidence 432 11 223568899999999999999887754 45 4778888775 2 588999999999999999988651 Q ss_pred ----CCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEc-cCCCCHHHhhCCEEE-EEEEEEecCCceEEEEEEEEeec Q lcl|NC_014792. 575 ----ELNDNFTRASFRMETSQYLDGIRALGGIYEGRVVCD-TTNNTPSVIDRNEFV-ASIYYKPARSINYIVLNFVATST 648 (659) Q Consensus 575 ----epn~~~l~~~i~~~i~~~l~~l~~~gal~g~~v~~d-~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~~ 648 (659) =|-|..=...|+..|+.-|++..++|.|.||+|.+. .+..++.|+.++++. +.+.++....++++.++...+=. T Consensus 371 ~~~KiPy~~~G~~~i~a~i~~~l~~a~~~G~Ia~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~~ 450 (450) T protein:vir:95 371 KGGKITYDDTGITRIRQVIETSLQRAVNRNFLSSYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAYE 450 (450) T ss_pred CCCCCccChhhHHHHHHHHHHHHHHHHhcCcccceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEeC Confidence 267788888999999999999999999999999988 578899999998866 88888999999999998775544 No 58 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=98.50 E-value=4.3e-07 Score=55.50 Aligned_cols=439 Identities=13% Similarity=0.097 Sum_probs=204.2 Q ss_pred CceecCceEEEEecCCCcccc--cCCcceEEEeec---ccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcC- Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVR--NATGRAALVGKF---QWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFLQY- 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~--~~ts~~afvG~~---~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ng- 74 (659) =....||+|+ |++.+....+ .-..-.-+||.. -..|.++|++|+|-.|-...|| ..+.+..+++.|..+. T Consensus 11 ~~iRvP~~y~-E~dns~A~~g~~~~~q~vLiiGq~la~gs~~~~~pv~v~s~~~a~~~fG---~GS~la~M~~a~~~~n~ 86 (495) T protein:vir:19 11 SDVRVPLTYI-EFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQASAAFG---QGSMLALMADAFLNANR 86 (495) T ss_pred cccccCeEEE-EEccCCCCcCCcCCCceEEEEEecCcccccccceeEEecCHHHHHHhcC---cCcHHHHHHHHHHHhCC Confidence 2355799999 5654433222 223445677763 3457799999999999999999 5666766777777654 Q ss_pred CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccccc Q lcl|NC_014792. 75 GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAFA 154 (659) Q Consensus 75 G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 154 (659) -.++|++-+.+.. ...|+ +.+..+... . ..+. ..+.-.|...... T Consensus 87 ~~~l~~i~~~D~a-G~aA~---g~it~tg~a-t-------------------~~G~-l~l~I~g~~v~v~---------- 131 (495) T protein:vir:19 87 VAELWCIPQGNGT-GNAAV---GEISLSGTA-G-------------------ENGS-LVTYIAGQRLAVS---------- 131 (495) T ss_pred cceEEEEeeCChh-hceeE---EEEEEeecC-C-------------------CCcE-EEEEECCEEEEEE---------- Confidence 4789999886532 11111 111110000 0 0000 0000000000000 Q ss_pred cccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccceeE Q lcl|NC_014792. 155 KSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGSTL 234 (659) Q Consensus 155 ~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~i 234 (659) ...+... ..+. ..+..+-... ..+ T Consensus 132 -------------------V~~gdTa-a~vA----------~al~aaina~--------------------------~~l 155 (495) T protein:vir:19 132 -------------------VAAGATG-AALA----------DLLVARIKGQ--------------------------PDL 155 (495) T ss_pred -------------------ecCCCCH-HHHH----------HHHHHHhcCC--------------------------ccC Confidence 0000000 0000 0000000000 001 Q ss_pred EEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhhhh Q lcl|NC_014792. 235 EVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLDDY 314 (659) Q Consensus 235 ~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~~~ 314 (659) .|.......... .......+++.+..|+. ....+... ++ T Consensus 156 PvTA~~~~~~~~--------------------------~~a~~~VtlTAr~kG~~-n~idi~~~-----------~~--- 194 (495) T protein:vir:19 156 PVTAEVRADSGD--------------------------DDTHADVVLSAKFTGAL-SAVDVRWN-----------YY--- 194 (495) T ss_pred ceEEEeeccCCC--------------------------CcCceeEEEEEeecccc-ccceeEEE-----------ee--- Confidence 111000000000 00000111222222211 00111000 00 Q ss_pred hhcccccceEEeecccCCccceeE-EeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHH Q lcl|NC_014792. 315 FAKGTSNYIYATSLNWPKGFAGII-NLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHV 393 (659) Q Consensus 315 ~~~~~s~~v~~~~~~~~~~~~~~~-~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l 393 (659) .....|.+..-++ ..+||.. ..|+..++.++. ....+++++|= .+. +-..+| T Consensus 195 -----------~ge~~p~Glt~titamsgGag------~PDia~alaal~---~~~~~~I~~P~------tD~-asL~al 247 (495) T protein:vir:19 195 -----------AGETTPYGIITAFKAASGKNG------NPDISASIAGMG---DLQYKYIVMPY------TDE-PNLNLL 247 (495) T ss_pred -----------cccccccceeEEEEecCCCCC------CcchHHHHHHhc---cCCCcEEEEec------CcH-HHHHHH Confidence 0011122222222 2344432 234556655554 34567888872 222 233567 Q ss_pred HHHHHhh------CCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCccee Q lcl|NC_014792. 394 VSIADER------QDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRW 467 (659) Q Consensus 394 ~~~~~~~------~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~ 467 (659) .+|++.+ ++.+++.-- ..+..++..|-.. .++.+..+.+ + ++. . T Consensus 248 ~~~l~~rw~~~~q~~g~~~~a~----------~gT~~~l~t~g~~----------~N~~~it~~~--~------~gs--p 297 (495) T protein:vir:19 248 RTELQERWGPVNQADGFAVTVL----------SGTYGDISTFGVS----------RNDHLISCMG--I------AGA--P 297 (495) T ss_pred HHHHHHhhhHHHhcCeEEEEee----------cCCHHHHHHhhhc----------cCCceEEEEe--c------CCC--C Confidence 7777653 334444321 1245666666543 3466665542 1 111 1 Q ss_pred ecHHHHHHHHHH------HhhhcCCceECcCCcchhheeccc--cceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcc Q lcl|NC_014792. 468 VPLAADMAGLCA------RTDDVSQPWMSPPGYNRGQILNVL--KLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGD 539 (659) Q Consensus 468 ~p~s~~~Ag~~a------~~d~~~g~~~span~~~~~i~g~~--~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~ 539 (659) -||....|++.+ +.|..+ .--... +.|+. .+...++..|++.|..+||.++.--.++ -..+--. T Consensus 298 ~~~~~~AAA~aa~~A~~l~~DPAr----PL~tl~---L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~~G-~V~I~R~ 369 (495) T protein:vir:19 298 EPSYLYAATLCAVASQALSIDPAR----PLQTLT---LPGRMPPAVGDRFTWSERNALLFDGISTFNVNDGG-EMQIERM 369 (495) T ss_pred CcHHHHHHHHHHHHHHHhhccccc----ccCcee---ecceecCCccccCChHHHHHHHhCCcceEEECCCC-eEEEEee Confidence 244433333333 344433 222223 34544 5567789999999999999887543332 2333333 Q ss_pred ccc-----C-CCccccceeehhhHHHHHHHHHHHHHHHHhcC-CCCHH-----------HHHHHHHHHHHHHHHHHhccc Q lcl|NC_014792. 540 KTA-----T-KVPSPMDHINVRRLTNMLKKNIGDASKYKLFE-LNDNF-----------TRASFRMETSQYLDGIRALGG 601 (659) Q Consensus 540 rT~-----~-~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~e-pn~~~-----------l~~~i~~~i~~~l~~l~~~ga 601 (659) -|. . ..|..|..|++-|+.+|+.+.++......-.+ ..-+. +-..|+..+-.-+++|...|- T Consensus 370 ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~~gi 449 (495) T protein:vir:19 370 ITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRITQKFPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGL 449 (495) T ss_pred eeeeeecCCCCcchhhhhhHHHHHHHHHHHHHHHHHhhhcCCcccccCCCCCCCcccccChHHHHHHHHHHHHhhhhhcc Confidence 332 1 12346899999999999999999877653332 22211 556899999999999999998 Q ss_pred eeee---E----EEEccCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_014792. 602 IYEG---R----VVCDTTNNTPSVIDRNEFVASIYYKPARSINYIVLNFVATS 647 (659) Q Consensus 602 l~g~---~----v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 647 (659) +..+ + |+-|.++ .+|+++.+-...+-+++-+-.+++-.- T Consensus 450 ven~~~~~~~LiVerd~~d-------pnRln~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 450 VEDFDTFKEELYVARNKDD-------KDRLDVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred ccChhhhcceeEEEECCCC-------CcEEEEEecceeeCceeeeeeeeeeeC Confidence 8763 2 3333322 246666665555555443333332211 No 59 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=97.97 E-value=8.7e-06 Score=48.32 Aligned_cols=314 Identities=15% Similarity=0.074 Sum_probs=158.1 Q ss_pred ccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccch Q lcl|NC_014792. 229 EIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNN 308 (659) Q Consensus 229 ~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~ 308 (659) -..+.+.|.+.-......+. ..-+.-.+.+.....-...+. +...-..+..... T Consensus 1 ~~~~iv~V~v~~~~~~~~~~-------------------------~~~~~~~~~~~~t~~~~~~y~-s~~~v~~d~~~~~ 54 (331) T protein:vir:80 1 MVETITDVRVHISVLYPSPR-------------------------IGLGRPAIFVKGTAMGYKEYT-TLEELKDTFADNT 54 (331) T ss_pred Cccceecceeeecccccccc-------------------------cccCcceeEEeccccceEEEe-chhhhccCCCCCc Confidence 00111111110000000000 000000000000000000000 0000001111111 Q ss_pred hh---hhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhh Q lcl|NC_014792. 309 IY---LDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDAT 385 (659) Q Consensus 309 ~~---~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~ 385 (659) .. ....+..+. .. ..+..+... + . ....+.. +.....-. .+++.+. + T Consensus 55 ~~Ykaa~~~f~Q~~------------~~----~~i~v~~~~-~---~-~~~~a~~--a~~~~~w~-~~~~~~~------~ 104 (331) T protein:vir:80 55 EVYAKAKAVFLQKD------------RP----DTVAVITYE-D---T-KLLEAAE--AYFLKSWH-FALLAEF------K 104 (331) T ss_pred HHHHHHHHHHhccC------------cc----ceEEEeccc-h---H-HHHHHHH--HhccCcee-EEEeecC------C Confidence 11 111121111 00 011111110 0 0 1111111 11111111 2232221 1 Q ss_pred hHHHHHHHHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcc Q lcl|NC_014792. 386 ASTVQKHVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVN 465 (659) Q Consensus 386 ~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~ 465 (659) .+-..++..+++..+.+|.+++.. ...++.+... .+....++++. .+ T Consensus 105 -~~~~~a~a~~~~a~~~~f~~~~~~-----------~~~~~~~~~~------------~~~t~~~~~~~-------~~-- 151 (331) T protein:vir:80 105 -AADALALSNLIEEQKFKFAVFQVT-----------AVADITPLAK------------NTRTIAIVHSK-------TG-- 151 (331) T ss_pred -HHHHHHHHHHHhhCCcEEEEEecC-----------chHHHHHhhc------------cccEEEEEcCC-------cc-- Confidence 122346667777777777766431 1122222111 12223333321 11 Q ss_pred eeecHHHHHHHHHHHhhhcCCceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCC Q lcl|NC_014792. 466 RWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKV 545 (659) Q Consensus 466 ~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~ 545 (659) - -+.+.+.|.++.+|.-+--| .+|. .+.|+.. ..++..|.+.|..+|+|++.++-+ .. .++.+.|+++ T Consensus 152 -~-~~~aa~~g~~~~~~~g~~t~---~fk~--~l~GV~~--~~lt~t~~~al~~~~~N~y~~~~~-~~-~~~~G~~~~G- 219 (331) T protein:vir:80 152 -E-KLDAALIGNVASLPVGSATW---KGRH--GLAGITS--EELKVSEIDAIQKAGGMCYIEKAG-IA-QTSEGKTVSG- 219 (331) T ss_pred -c-hhHHHHHHHHHhcCccceee---eeec--ccCCCCC--CCCCHHHHHHHHhcCceEEEEecC-ee-EEecceEeCc- Confidence 1 13566667777776533222 2231 2344432 357899999999999999988754 44 4667777765 Q ss_pred ccccceeehhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcccee--------eeEEEEc-cC Q lcl|NC_014792. 546 PSPMDHINVRRLTNMLKKNIGDASKYKLFE----LNDNFTRASFRMETSQYLDGIRALGGIY--------EGRVVCD-TT 612 (659) Q Consensus 546 ~~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----pn~~~l~~~i~~~i~~~l~~l~~~gal~--------g~~v~~d-~~ 612 (659) + ||.+.+-.+|++..|++.+...+-. |-|..=...|+..++.-|++-+++|.|. ||+|.+. .+ T Consensus 220 --~--~iD~~~~~dWl~~~lq~~l~~ll~~~~kiPy~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~ 295 (331) T protein:vir:80 220 --E--FIDSIHGDDWIKATIETRLQKLLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRS 295 (331) T ss_pred --h--hHHHHHHHHHHHHHHHHHHHHHHHhCCCCccChhhHHHHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchh Confidence 2 6999999999999999988886543 5566777899999999999999999996 6889887 56 Q ss_pred CCCHHHhhCCEEE-EEEEEEecCCceEEEEEEEEee Q lcl|NC_014792. 613 NNTPSVIDRNEFV-ASIYYKPARSINYIVLNFVATS 647 (659) Q Consensus 613 ~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 647 (659) +.+++|+.++++. +.+.+++..-+++|.+++..+- T Consensus 296 ~~s~~dr~~R~~~~~~~~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 296 DLNDDDIAKRNYKGLSFRYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred cCCHHHHhccCCCCeEEEEEEcceEEEEEEEEEEeC Confidence 7899999998887 8888999999999999877655 No 60 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=97.76 E-value=2.2e-05 Score=46.16 Aligned_cols=464 Identities=12% Similarity=0.041 Sum_probs=212.5 Q ss_pred Cce-ecCceEEEEecCCCcccccCCcceEEEeec-ccCCCC---ccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHHcCC Q lcl|NC_014792. 1 MAL-LSPGIELKETTVQSTVVRNATGRAALVGKF-QWGPAF---QVTQITNEVELVDLFGGPNNITADYFMSGMNFLQYG 75 (659) Q Consensus 1 ~~~-~~PGVyveE~~~~~~~~~~~ts~~afvG~~-~~Gp~~---~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~ngG 75 (659) |.+ ++.=|.|.--.......+..-+...|+|.. ..-|.. +-...+|..+-...|| ..+.++.+++.+|-+-= T Consensus 1 msip~s~ivnV~i~~~~~a~~~~~f~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~FG---~~s~ey~aA~~yF~q~p 77 (502) T protein:vir:52 1 MALSISHIVNVQLNTVPKSAARKSFGIVALFTPEAGQAFADEKTRYVYVENQRDVEQLFG---TNSETAKAAQPFFAQSP 77 (502) T ss_pred CCCCccceeEEeeccccccccccccCceEEEeeccCccccCCccceEEecCHHHHHHhcC---CChHHHHHHHHHhcCCC Confidence 774 344444432222222222234677888873 343433 2333478899999999 55677777777774321 Q ss_pred --CeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccccc Q lcl|NC_014792. 76 --NDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIAF 153 (659) Q Consensus 76 --~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 153 (659) +.+||-|-........ ...+.+.. ...... .... ..+ .+|.. ...++ T Consensus 78 ~P~~l~igR~~~~~~~~~--~~~~~~~~--~~~~~~---------------~~~~---~~~-~~G~l-~i~i~------- 126 (502) T protein:vir:52 78 RAKQLIVARWQKSASTIE--ATKNTLSG--ATLSDD---------------LERF---KSV-VNGRF-SLTIG------- 126 (502) T ss_pred ccceEEEEecccccccee--echhhhhh--hhhHHh---------------HHHh---hhh-cCcee-EEEec------- Confidence 3588888654321110 00000000 000000 0000 000 00000 00000 Q ss_pred ccccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeecccccccee Q lcl|NC_014792. 154 AKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGST 233 (659) Q Consensus 154 ~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~~ 233 (659) +. +.+...-..+..... .. .+..+. ...+..+.. T Consensus 127 -----------g~---~~t~~~i~lS~~ts~--------------~~------------vA~~i~------~~l~~~~~~ 160 (502) T protein:vir:52 127 -----------GD---VKKVDGLSFARLADF--------------NA------------VATKIQ------EKLTTLSVA 160 (502) T ss_pred -----------ce---eeeeeccccccccch--------------hH------------HHHHHH------hhhcccccc Confidence 00 000000000000000 00 000000 000000000 Q ss_pred EEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCcee-eeeeeeccccccccccchhhhh Q lcl|NC_014792. 234 LEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIV-ENVVLSTKEGDKDVYGNNIYLD 312 (659) Q Consensus 234 i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~-et~~~~~~~~~~~~~~~~~~~~ 312 (659) ..|..... ...|.+.....|... -++.....+. ....++. T Consensus 161 ~tv~~d~~----------------------------------~~~F~i~s~ttg~~~~~~~~~a~~~~-----~~gt~~a 201 (502) T protein:vir:52 161 VSIAYDET----------------------------------GNRFIVSANVAGEDKKTEIDYAIDEG-----GEGEYIG 201 (502) T ss_pred eEEEEecC----------------------------------CceEEEEeccCCCcceeEEEEeecCC-----cchhHHH Confidence 01100000 000001000000000 0000000000 0000000 Q ss_pred hhhh-cccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHH Q lcl|NC_014792. 313 DYFA-KGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQK 391 (659) Q Consensus 313 ~~~~-~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~ 391 (659) ..+. .....-+... ....|. ....+..++..+......-.-+++ +.. ...+-+. T Consensus 202 ~~l~l~~~~~av~v~------------~~~~g~------~aet~~~al~a~~~~~~~w~~~~~-a~~------~~~~~~l 256 (502) T protein:vir:52 202 ALLKLENGQASRKVG------------KNSVSL------KKETLGEALFNVAEVNNTWYGFTV-AAQ------LTDSEVE 256 (502) T ss_pred HHhccccccceeeee------------eecccc------cccCHHHHHHHHHhccCceEEEEE-eec------CChhHHH Confidence 0000 0000000000 001111 122233344433332222222333 211 1122345 Q ss_pred HHHHHHHhhCCEEEEEecCccccccccccCCHHHHH-HHhhccccccccccccccceEEEEcCceeEecccCCcceeecH Q lcl|NC_014792. 392 HVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLI-DWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPL 470 (659) Q Consensus 392 ~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~ 470 (659) ++..+++..+.+|.+....... . .... .++. ..+. . ...+..+ +|++.+ -.+ T Consensus 257 a~a~~iea~~~~f~~~~~d~~~-~---~~~~-~~i~~~l~a-~----------~~~~t~~------~y~~~~-----~~~ 309 (502) T protein:vir:52 257 AAAKYAQANTKLFGANVIRAEQ-I---EWSA-DNIYKKLYD-A----------GLDHTLA------MFDKND-----MYP 309 (502) T ss_pred HHHHHHhhcCcEEEEEecCcce-e---cccc-chHHHHHHh-c----------cCceeEE------EecCCc-----chh Confidence 6777777776677653221111 1 1111 1121 1211 1 1112222 222211 135 Q ss_pred HHHHHHHHHHhhhcCC-ceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCcccc Q lcl|NC_014792. 471 AADMAGLCARTDDVSQ-PWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPM 549 (659) Q Consensus 471 s~~~Ag~~a~~d~~~g-~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~ 549 (659) .+.+.|.++.+|-.+- -...-.+|. +.|+.. ..++..|.+.|..+++|++..+-+ .+ .+..++++++ + T Consensus 310 ~aa~~g~~as~~f~~~~g~iT~~fk~---l~GV~~--~~lt~t~~~al~~~~~N~y~~~~~-~~-~~~~G~~~~G---~- 378 (502) T protein:vir:52 310 VSSALARLLSTNFAANNSTLTLKFKQ---QPTITA--DEITATEFAKAKRLGINVYTYFDD-VA-MIAEGTVIGG---K- 378 (502) T ss_pred HHHHHHHHHhcCCCcCcceeeecccc---cCCccc--CcCCHHHHHHHHhcCceEEEEecC-ee-EEecCeeeCC---c- Confidence 6777788888874331 112223343 344432 357899999999999999988743 44 4677777776 2 Q ss_pred ceeehhhHHHHHHHHHHHHHHHHhcC-----CCCHHHHHHHHHHHHHHHHHHHhcccee--------------------e Q lcl|NC_014792. 550 DHINVRRLTNMLKKNIGDASKYKLFE-----LNDNFTRASFRMETSQYLDGIRALGGIY--------------------E 604 (659) Q Consensus 550 ~~i~vrR~~~~i~~~i~~~~~~~v~e-----pn~~~l~~~i~~~i~~~l~~l~~~gal~--------------------g 604 (659) ||-+.+-.+|++..|++.+...++. |-|+.=...|+..|+.-|++-+++|.|. | T Consensus 379 -~iD~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~I~~G~~~~~~~g~~~~~d~~~~g 457 (502) T protein:vir:52 379 -FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGAFAPGKWTGAGFGNLSTGDYLDKG 457 (502) T ss_pred -hhhHHHHHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCccccccccCcccceeeecccccCc Confidence 5778899999999999998776542 5677778999999999999999999984 6 Q ss_pred eEEEEc-cCCCCHHHhhCCEE-EEEEEEEecCCceEEEEEEEEee Q lcl|NC_014792. 605 GRVVCD-TTNNTPSVIDRNEF-VASIYYKPARSINYIVLNFVATS 647 (659) Q Consensus 605 ~~v~~d-~~~nt~~~i~~G~~-~~~i~~~p~~p~e~i~~~~~~~~ 647 (659) |.|.+. .++.++.|+.++++ -+.+.+++...+++|.|.+...+ T Consensus 458 y~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 458 FYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred eEEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 888887 56889999999998 89999999999999999877666 No 61 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=96.49 E-value=0.00057 Score=38.35 Aligned_cols=355 Identities=13% Similarity=0.015 Sum_probs=142.5 Q ss_pred cccceeeeeeeccccccccceeeeeeeccccccceeee--eccC-Cc-----eeeeeeeecccc-ccccccchh---hhh Q lcl|NC_014792. 245 DVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAII--VRRD-GA-----IVENVVLSTKEG-DKDVYGNNI---YLD 312 (659) Q Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--v~~~-g~-----~~et~~~~~~~~-~~~~~~~~~---~~~ 312 (659) .. ..+.++- ......+.....|... +... .. ..|.-..+.... ..+...... ... T Consensus 1 m~--~~iVnV~-----------Is~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~ 67 (426) T protein:vir:31 1 MP--KQIVEIE-----------LTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASE 67 (426) T ss_pred CC--cceEEEE-----------eecccccccccccceeeeeeeccccccccccchhhhhhhHHHHHhcCCCChHHHHHHH Confidence 00 0111110 0001111111222211 1100 00 001001111000 001111111 111 Q ss_pred hhhhcccccceEEeecc------cCCcccee---EEeecccccccccchhhhhhhHhhhhhcccccceEEEeccc----- Q lcl|NC_014792. 313 DYFAKGTSNYIYATSLN------WPKGFAGI---INLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAV----- 378 (659) Q Consensus 313 ~~~~~~~s~~v~~~~~~------~~~~~~~~---~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~----- 378 (659) ..+..+.- ..+..... .......+ ++..+. .+.. .++.++..++.... +....+...+... T Consensus 68 ~~f~Q~~~-~~r~~v~~at~~~~~~~t~~~tv~g~~~s~~-a~~~-~~a~~i~~~~~~~~--~~~~~~~~~~~~t~~g~~ 142 (426) T protein:vir:31 68 AIEEMGAE-QWRVMVLEATEVTEEELSDGDTIDKVPILGN-HEVE-SPDGDIEFTTDDDP--DVEDFDAEIVINSATGDV 142 (426) T ss_pred HHHhCCce-eEEeeccccceeeeccCCcceeecceeeeec-ccCc-chHHHHHHhhcccc--ccccceeeeEecccccee Confidence 22222110 00100000 00000000 111111 1111 11222222221111 1111111111000 Q ss_pred -----------cccc---------------------hhhhHHHHHHHHHHHHhhCCEEEEEecCccccccccccCCHHHH Q lcl|NC_014792. 379 -----------AGEG---------------------DATASTVQKHVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNL 426 (659) Q Consensus 379 -----------~~~~---------------------~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~ 426 (659) ...+ +.....+...+...++..+ -+.+... .....-...+.. T Consensus 143 t~~~~~~~~~~s~~dw~~~~~~~s~~~~~~ia~~~~~~~~~~~~~~~~~wa~~~~-i~~va~~-----~e~~~~~~~~~~ 216 (426) T protein:vir:31 143 ATSEDSIELTYFHADWSQLDEFPSDVNNFAVADRRFDLKGVGVLDETHSWASDED-MGMIANG-----VNVDDYDSVDEA 216 (426) T ss_pred eccccceeeeeccCcchhhhcccccchhhhhhccccchhhhhhhHhhhhhhhhcc-eeeeeec-----cchhhhcchhhh Confidence 0000 0011111111111111110 1111000 000000001111 Q ss_pred HHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHHHHHhhhcCCceECcCCcchhheecc---- Q lcl|NC_014792. 427 IDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGLCARTDDVSQPWMSPPGYNRGQILNV---- 502 (659) Q Consensus 427 ~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~span~~~~~i~g~---- 502 (659) ..++.. ..-|.|-......... ..--..+++++.++..+ ||..|.-+...+-..+ T Consensus 217 ~a~~~~---------------~~~y~p~~~~~~~~~~--~~~~~~~~~~~~~aa~~----~~~~~~~~~~~~~~~~~~~~ 275 (426) T protein:vir:31 217 MDVAHE---------------VAGYVPSGDLMMIVDA--SDDDLAAYQLGKFAVSE----PWYNPLWNELPAGETVSKNV 275 (426) T ss_pred hhhhhc---------------ccccccchhheeehhc--cccchhhHHhhhhhhhc----cccchhhhhccccccceeec Confidence 111111 1112222111100000 00012467788888777 4555532221111111 Q ss_pred --ccceeecChhHHHhhhhCCceEEEEEeCCCeEEEEcccccCCCccccceeehhhHHHHHHHHHHHHHHHHhc---C-C Q lcl|NC_014792. 503 --LKLAIEPRQTQRDRMYQEAINPVVGFAGGDGFVLYGDKTATKVPSPMDHINVRRLTNMLKKNIGDASKYKLF---E-L 576 (659) Q Consensus 503 --~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~---e-p 576 (659) .+....+...++..++ +..|.+..+-+ +..+|-.-|..+......||-++|..+|+++.++..++..+= + | T Consensus 276 ~~~gv~~t~~~~~~A~~~-~~~n~~~~~~~--~~~i~~~~~~~G~~~~G~~iD~~~g~dwl~~~iq~~l~~ll~~~~KIp 352 (426) T protein:vir:31 276 GDPEEQGTFEGGDEAEGE-GPVNVLIDVSD--ANRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLESLQVSDDDVP 352 (426) T ss_pred cccccccccchhhhhhhc-CCceEEEEecC--ceeeecceeecccccchhhhhhHHHHHHHHHHHHHHHHHHhhcCCCCc Confidence 1222233333444555 66788887753 566666666666655667899999999999999999988663 2 7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhccc--eeeeEEEEccCCCCHHHhhCCEEE-EEEEEEecCCceEEEEEEEEee Q lcl|NC_014792. 577 NDNFTRASFRMETSQYLDGIRALGG--IYEGRVVCDTTNNTPSVIDRNEFV-ASIYYKPARSINYIVLNFVATS 647 (659) Q Consensus 577 n~~~l~~~i~~~i~~~l~~l~~~ga--l~g~~v~~d~~~nt~~~i~~G~~~-~~i~~~p~~p~e~i~~~~~~~~ 647 (659) -+..=+..|+..|+.-|++.++.|. +.+|.|...+...++.|..+.++. +++.......++++.++...+- T Consensus 353 yt~~Gi~~I~~~i~~~L~~~v~~~g~~~~~y~v~~P~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 353 FTEDGQAMIEDAIKGTMSGLTGSVGQPLAEYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred cchhHHHHHHHHHHHHHHHHhcCCCccccceeecCCCccccchhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 7788888999999999999998653 457998877555566788887777 8888899999999999877655 No 62 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=96.06 E-value=0.0011 Score=36.90 Aligned_cols=426 Identities=13% Similarity=0.012 Sum_probs=143.3 Q ss_pred eeccccccccccccceeeeeccceee-EEEeecCCccccccccceeccccceeeecccccccccccceeeccccc-cc-- Q lcl|NC_014792. 144 FIPSDKIIAFAKSVNQYPDLGPAWTA-EILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKY-AM-- 219 (659) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-- 219 (659) .++-+.+....+.+.........+.. -+.......+ ...+. ...... ++.... +..+.++....... +. T Consensus 1 mip~s~iV~V~~~v~~~~~~~~~~~~~l~l~~~~~~~-~~r~~-~y~s~~----~V~~~F-G~~S~ey~aA~~yF~~~~~ 73 (504) T protein:vir:96 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTTNNVIP-PGIVI-EFDNAN----AVLSYF-GAQSEEYQRAAAYFKFISK 73 (504) T ss_pred CCCccceeEeeecccccccccccccceeEeecccCCC-ccceE-EecCHH----HHHHhc-CCChHHHHHHHHHhhcCCC Confidence 11111111111111000000000000 0000000000 00000 000000 000000 00000000000000 00 Q ss_pred --ceeeeccccccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCcee--eeeee Q lcl|NC_014792. 220 --PGVVALYPGEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIV--ENVVL 295 (659) Q Consensus 220 --~~~~a~~~g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~--et~~~ 295 (659) ......+.+.|.+.- ....+.......... .+. ...++.+++++ +|... ..+.+ T Consensus 74 ~~~~P~~l~igR~~~~a-------------~~~~l~g~~~~~~~~-----~~~--~i~~G~lsitv--~G~~~~~~~i~~ 131 (504) T protein:vir:96 74 SVNSPSSISFARWVNTA-------------IAPMVVGDNLPKTIA-----DFA--GFSAGVLTIMV--GAAEKNITAIDT 131 (504) T ss_pred CCccccEEEEEeecCcC-------------ccceEEechhHHHHH-----HHh--hhhceEEEEEE--cceeeeeccccc Confidence 000111112221110 000000000000000 000 00011112222 12111 11111 Q ss_pred eccccccccccchhhhhhhhhccccc---ceEEeecccCCccceeEEeeccccccccc---------------------- Q lcl|NC_014792. 296 STKEGDKDVYGNNIYLDDYFAKGTSN---YIYATSLNWPKGFAGIINLMGGISANDQV---------------------- 350 (659) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~s~---~v~~~~~~~~~~~~~~~~~~gg~~~~~~~---------------------- 350 (659) +. +++.......+...+...... -+.++-. .....+++.++..+.... T Consensus 132 S~---~ts~~~vA~~i~~al~~~~~~~~~~~tv~~d----~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lgl~~~~~ 204 (504) T protein:vir:96 132 SA---ATSMDNVASIIQTEIRKNTDPQLAQATVTWN----PNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNV 204 (504) T ss_pred cc---ccchHHHHHHHHhhhhcccccccccceEEEe----ccCCeEEEEeeccccceeEEEeeccccchhhhhhcccccc Confidence 10 000000000111111000000 0000000 000011111111110000 Q ss_pred ------chhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhhCCEEEEEecCccccccccccCCHH Q lcl|NC_014792. 351 ------TAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADERQDCLAFISPPKGLLVNVPLTRAVD 424 (659) Q Consensus 351 ------~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~ 424 (659) .......++..+......--.+..+... .....+.++..+++....++.+... +. .... . T Consensus 205 ~~v~g~~aet~~~al~al~~~~~~Wy~f~~a~~~------~~dd~ilalA~w~ea~~~~~~~~~~--~~-----~~~~-~ 270 (504) T protein:vir:96 205 VNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGAT------LDNDQIKAVSAWNAAQNNQFIYTVA--TS-----LANL-G 270 (504) T ss_pred eEEeecccccHHHHHHHHHhhcCCeEEEEEEecc------CCHHHHHHHHHHHhhcCceEEEEEe--ec-----ccch-h Confidence 0000111111111111110111111100 0011122344444443333322110 00 0000 0 Q ss_pred HHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHHHHHhhhcC--CceECcCCcchhheecc Q lcl|NC_014792. 425 NLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGLCARTDDVS--QPWMSPPGYNRGQILNV 502 (659) Q Consensus 425 ~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~--g~~~span~~~~~i~g~ 502 (659) ........ .......+++. .. ... -++.+..+.++.+|-++ | -.+-..| .+.|+ T Consensus 271 ~~~~~~~~----------~~~~~~~~~~~-------~~--~~~-~~~~~~~~~~as~~f~~~ng-~~T~~fk---~l~GV 326 (504) T protein:vir:96 271 ALFDLVKG----------NSGTALNVLSA-------TA--SND-FVEQCPSEILAATNYDEPGA-SQNYMYY---QFPGR 326 (504) T ss_pred hHHHhhhh----------cceeEEEEeec-------Cc--cch-hHHHHHHHHHHhcCcCcccc-ccccccc---ccCCc Confidence 00000000 00000111110 00 011 13455667777777333 2 0011223 33454 Q ss_pred ccceeecChhHHHhhhhCCceEEEEEeC-CCeEEEE-cccccCCCccccceeehhhHHHHHHHHHHHHHHHHhcC----C Q lcl|NC_014792. 503 LKLAIEPRQTQRDRMYQEAINPVVGFAG-GDGFVLY-GDKTATKVPSPMDHINVRRLTNMLKKNIGDASKYKLFE----L 576 (659) Q Consensus 503 ~~~~~~~~~~e~~~Ln~~gin~i~~~~~-~~G~~~w-G~rT~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----p 576 (659) . ...++..|.+.|..+|+|++..|-+ +..+.+| .+.++++. .+|.+|.+-+-.+|++..|+..+....-. | T Consensus 327 t--a~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~gG~-~~~~wiDv~~~~~WL~~~lq~~l~~l~~~~~kIP 403 (504) T protein:vir:96 327 N--ITVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCGGP-TDAVDMNVYANEIWLKSAIAQALLDLFLNVNAVP 403 (504) T ss_pred C--cccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeCCc-cccchhhhhhhHHHHHHHHHHHHHHHHhcCCCcc Confidence 3 2467899999999999999988753 1234555 44555543 24677999999999999999998875433 5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhcccee-----------------------------eeEEEEc-cCCCCHHHhh-CCEEE Q lcl|NC_014792. 577 NDNFTRASFRMETSQYLDGIRALGGIY-----------------------------EGRVVCD-TTNNTPSVID-RNEFV 625 (659) Q Consensus 577 n~~~l~~~i~~~i~~~l~~l~~~gal~-----------------------------g~~v~~d-~~~nt~~~i~-~G~~~ 625 (659) -|..=...|+..++.-|++-+++|.|. ||.|.++ .++-++++.. ++... T Consensus 404 yt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~~~GYyv~~~~~s~~s~~~r~~R~~~~ 483 (504) T protein:vir:96 404 ASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQTLGYWINITFSSYTNSNTGLTEWKAN 483 (504) T ss_pred cCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheecccccccccccceeccceEEEecChhccChhHhhhccccc Confidence 577778899999999999999999872 4888876 3444555444 45566 Q ss_pred EEEEEEecCCceEEEEEEEEe Q lcl|NC_014792. 626 ASIYYKPARSINYIVLNFVAT 646 (659) Q Consensus 626 ~~i~~~p~~p~e~i~~~~~~~ 646 (659) +.+.++--..+++|++.-.-. T Consensus 484 ~~~~y~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 484 YTLIYSKGDAIRFVEGSDVMI 504 (504) T ss_pred eEEEEEECCeEEEEEeccccC Confidence 777777788888887743322 No 63 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=95.76 E-value=0.0015 Score=36.06 Aligned_cols=423 Identities=10% Similarity=0.003 Sum_probs=148.1 Q ss_pred eeccccccccccccceeeeeccceeeEEEeecCCccccccccceeccccceeee-----ccccc-----ccccccceeec Q lcl|NC_014792. 144 FIPSDKIIAFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTE-----AENSE-----EAITSLEFQAS 213 (659) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~-----~~~~~-----~~~~~~~~~~~ 213 (659) .++-+.+. +..+.+.... +..... ...++......++. ..... -+..+.++... T Consensus 1 mip~s~iV------nV~~~v~~~a---------~~~~~~-~~~lilt~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA 64 (507) T protein:vir:99 1 MISQSRYV------RIVSGVGAGA---------PVAQRR-LIMRVMTTNAVLPPGVVFESSSADAVGAYFGMASEEYKRA 64 (507) T ss_pred CCCcccee------EEeeeccccC---------cccccc-cceeeeccccCCCccceEeecCHHHHHHhcCCChHHHHHH Confidence 22222222 2222111110 000000 00010000000000 00000 00000000000 Q ss_pred ccc-cccc----eeeeccccccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCc Q lcl|NC_014792. 214 LQK-YAMP----GVVALYPGEIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGA 288 (659) Q Consensus 214 ~~~-~~~~----~~~a~~~g~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~ 288 (659) ... -..+ .....+.+.|.+. .....+...... ..+. ..-...+. ++++..+|. T Consensus 65 ~~yFsq~p~~~~~P~~L~igR~~~~-------------~~~a~l~g~~~~----~~l~---~~~~~~~G--~lti~v~G~ 122 (507) T protein:vir:99 65 KAYMSFISKSINSPSYISFARWVNA-------------AIASMIVGDSLV----KNLP---ALKAVATP--TLSLSIGGT 122 (507) T ss_pred HHHhccCCCCCcccceEEEEeecCc-------------cccceeecchhh----hhHH---HHhhhcce--eEEEEEcCc Confidence 000 0000 0001111111110 000000000000 0000 00000111 122222232 Q ss_pred eee--eeeeeccccccccccchhhhhhhhhcc---------------cccceEEeecccCCccceeEE-eeccccc---- Q lcl|NC_014792. 289 IVE--NVVLSTKEGDKDVYGNNIYLDDYFAKG---------------TSNYIYATSLNWPKGFAGIIN-LMGGISA---- 346 (659) Q Consensus 289 ~~e--t~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~s~~v~~~~~~~~~~~~~~~~-~~gg~~~---- 346 (659) ... .+.++. .++...-...+...+... .+.++......+........+ ...|.+- T Consensus 123 ~~t~~~i~lS~---~ts~~~vAs~i~~~l~a~~~~~~~~~tv~~d~~~~~F~v~s~~tG~~s~i~~at~~~~gt~~s~l~ 199 (507) T protein:vir:99 123 VVPIAGIDLTA---ALTLTDVAATLQTKIRASANAELATATVTFNTTTNQFVLNGTTTGALAPTITAVRTDPATDISSLL 199 (507) T ss_pred eeEeccccccc---cCCHHHHHHHHHHhhhccccccccceEEEEecCCceEEEEeeeccccceeEEEEcCCchhhHHHHh Confidence 111 111111 011111111111111100 000000000000000000000 0000000 Q ss_pred ---------ccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHHHHHHHHhhCCEEEEEecCccccccc Q lcl|NC_014792. 347 ---------NDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKHVVSIADERQDCLAFISPPKGLLVNV 417 (659) Q Consensus 347 ---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~ 417 (659) ........+..++..+......-.-++.+-. + +....-+.+|.+++|....+|.+.-.-.. T Consensus 200 ~~~~~~a~~~~g~~aet~~~a~~a~~~~~~nW~~~~~a~~----~-~~td~~~lalA~wiea~~~~f~~~~~~~~----- 269 (507) T protein:vir:99 200 GWTNTGTVFVKGQAAETPDTSISKSAAISTNFGSFIYTST----P-ALTNDQITAVASWNASQNNMYMYSVPTTI----- 269 (507) T ss_pred ccccccceEeecccccCHHHHHHHHHhhcCCeEEEEEEec----c-ccChHHHHHHHHHHhhcCcEEEEEEecCc----- Confidence 0000011111122221111110011111000 0 00111223444445544444433211000 Q ss_pred cccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHHHHHHHHHHHhhhcC--CceECcCCcc Q lcl|NC_014792. 418 PLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADMAGLCARTDDVS--QPWMSPPGYN 495 (659) Q Consensus 418 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~--g~~~span~~ 495 (659) . ....+... ......+...++. .......+.+.+.|.++.+|-++ | -.+-..|. T Consensus 270 --a----~~~~~~~~--------~~~~~~~~~~~~~---------~~~~~~~~~aa~~g~~as~nf~~~ng-~~T~~fk~ 325 (507) T protein:vir:99 270 --A----NIGTLYAA--------VKGFSGCALNITS---------DSLPVDYIEQSPCEILAATDYTRVNA-TQNYMYYQ 325 (507) T ss_pred --h----hhhhhhhh--------hhhcceeEEEeec---------ccccchhHHHHHHHHHHhhccCcCcc-ceeecccc Confidence 0 00000000 0000001111111 11111235667778888877433 2 00111222 Q ss_pred hhheeccccceeecChhHHHhhhhCCceEEEEEeC-CCeEEEEcccccCCCccccceeehhhHHHHHHHHHHHHHHHHhc Q lcl|NC_014792. 496 RGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAG-GDGFVLYGDKTATKVPSPMDHINVRRLTNMLKKNIGDASKYKLF 574 (659) Q Consensus 496 ~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~-~~G~~~wG~rT~~~~~~~~~~i~vrR~~~~i~~~i~~~~~~~v~ 574 (659) +.|+. ...++..|.+.|..+|+|+...+-+ ++.+.+|-.-.+++-..+|.++.+-+=.+|++..++..+....- T Consensus 326 ---l~GV~--a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~gG~~~fid~d~~~~~~WL~~~iq~~l~~l~~ 400 (507) T protein:vir:99 326 ---FPSRN--ITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILCGGPNDAVDMNIYANEIWLKSAISAQILSLFL 400 (507) T ss_pred ---cCCcc--cccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeeCCcccceeeeeecchHHHHHHHHHHHHHHHh Confidence 33443 2468899999999999999988754 23466665544443333577777777777888888888876433 Q ss_pred C----CCCHHHHHHHHHHHHHHHHHHHhcccee-----------------------------eeEEEEc-cCCCCHHHhh Q lcl|NC_014792. 575 E----LNDNFTRASFRMETSQYLDGIRALGGIY-----------------------------EGRVVCD-TTNNTPSVID 620 (659) Q Consensus 575 e----pn~~~l~~~i~~~i~~~l~~l~~~gal~-----------------------------g~~v~~d-~~~nt~~~i~ 620 (659) . |-|..=...|+..++.-|++-+++|.|. ||.+.++ .++.++++.. T Consensus 401 ~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~~~~~~~~~Gyy~~~~~~s~~~~~~r~ 480 (507) T protein:vir:99 401 NVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYITQISGDANAWRQVANIGYWLNITFSSYTNPNTQL 480 (507) T ss_pred cCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchheecccccccccccceeccceEEEeCChHhcChhhhh Confidence 2 5677778899999999999999999884 3667765 3444544444 Q ss_pred -CCEEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_014792. 621 -RNEFVASIYYKPARSINYIVLNFVAT 646 (659) Q Consensus 621 -~G~~~~~i~~~p~~p~e~i~~~~~~~ 646 (659) ++...+.+.+.--..+++|++.-.-. T Consensus 481 ~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 481 TEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred ccccceEEEEEEeCCeEEEEEeeeecC Confidence 66777778888888888887754432 No 64 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=86.46 E-value=0.045 Score=27.95 Aligned_cols=455 Identities=11% Similarity=0.051 Sum_probs=191.4 Q ss_pred Cce--ecCceEEEEecCCCcccccC-CcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHH---cC Q lcl|NC_014792. 1 MAL--LSPGIELKETTVQSTVVRNA-TGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFL---QY 74 (659) Q Consensus 1 ~~~--~~PGVyveE~~~~~~~~~~~-ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~---ng 74 (659) |-+ +.=--+|+-.+.-....... .-.+-|++....=|+++..+.+|..|-...|| ..+.++.+.+.+|- |. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~l~l~~~~~~~~~~~~~~~s~~~V~~~FG---~~S~ey~aA~~yFsg~~~q 77 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQETDVENWFG---ALSNEAKIADAYFPGIVNG 77 (501) T ss_pred CCCCCcccceEEEEeeecccCCCccccceeEEEeccCCCCccceEEecCHHHHHHhcC---CChHHHHHHHHHhhhhcCC Confidence 876 43455665444322222222 22344666666678899999999999999999 55666677777775 32 Q ss_pred C---CeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccc Q lcl|NC_014792. 75 G---NDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKII 151 (659) Q Consensus 75 G---~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 151 (659) . ++|||-|-...... +...+..+.. .+...-...-..+.+.. +|.....-++.+... T Consensus 78 ~p~P~~l~igR~~~~~~~--~~l~g~~l~~--~~la~~~~~sg~l~vti----------------~g~~~~~~i~ls~at 137 (501) T protein:vir:10 78 GQLPYDLKFARYVAADAP--ASVYGIPLTG--VTLAQLQGYSGTLTVTT----------------AAQHVSANISLAAAT 137 (501) T ss_pred CccccEEEEEeecCCCcc--ceEeccchhh--hhhhhcceeeeEEEEee----------------ccceeeccccccccc Confidence 2 57999997643211 1111111110 00000000000111111 110000000000000 Q ss_pred ccccccceeeeeccceeeEEEeecCCccccccccceecccc-ceeeecccccccccccceeecccccccceeeecccccc Q lcl|NC_014792. 152 AFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSG-ILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEI 230 (659) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~ 230 (659) +............ . ...+ .+..++. ..+..... .. |. T Consensus 138 ----s~~~vAs~i~~al-------~----~~~~-tv~~d~~~~~f~its~------------tt-------------G~- 175 (501) T protein:vir:10 138 ----SFANAATLIEAAF-------T----SPDF-VVAYDALRNRFTVVTN------------AT-------------GT- 175 (501) T ss_pred ----CHHHHHHHHhhhc-------c----CCce-EEEEcccCceEEEEee------------cc-------------CC- Confidence 0000000000000 0 0000 0000000 01100000 00 00 Q ss_pred ceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhh Q lcl|NC_014792. 231 GSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIY 310 (659) Q Consensus 231 g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~ 310 (659) +..+.. .+.. ...+ ..+ ..+... ...+...|...|+ T Consensus 176 ~~~i~~--~~~~-~~la--~~l-----~Lt~~~----------------~a~v~~~g~~aet------------------ 211 (501) T protein:vir:10 176 AAAISA--VTGT-NNLA--DEL-----GLSAAA----------------GATLQAAGVAADT------------------ 211 (501) T ss_pred ceeEEE--eeCc-hhhh--hhc-----Cccccc----------------cceEEecCccccc------------------ Confidence 000110 0000 0000 000 000000 0000000000000 Q ss_pred hhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHH Q lcl|NC_014792. 311 LDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQ 390 (659) Q Consensus 311 ~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~ 390 (659) +..++..+......-..+..+.. ...+-+ T Consensus 212 --------------------------------------------~~~a~~a~~~~~~~Wy~f~~a~~-------~~~~~~ 240 (501) T protein:vir:10 212 --------------------------------------------PASAMNRAVGLSRNWATFTTAWT-------AVIADR 240 (501) T ss_pred --------------------------------------------HHHHHHHHHhccCceEEEEEecC-------CChHHH Confidence 00111111110000001111100 011122 Q ss_pred HHHHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecH Q lcl|NC_014792. 391 KHVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPL 470 (659) Q Consensus 391 ~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~ 470 (659) .++.++++....+|.+.-.- ............++....... ...+...+|+. ..+ T Consensus 241 la~A~wiea~~~~f~~~~~~--~~~~~~~~~~~~~i~~~l~~~----------~y~~t~~~y~~-------------~~~ 295 (501) T protein:vir:10 241 LAFAAWNSGQAYKYMYVAPD--LEAASIVTNNAASFGAQVFAA----------PYQGTLPLYGD-------------QAT 295 (501) T ss_pred HHHHHHHHhcCceEEEEEec--CchhhhhhhhhhhHHHHHHhc----------CCCceEEECCC-------------CcH Confidence 34555666554444332110 000000011111221111110 12233343321 124 Q ss_pred HHHHHHHHHHhhhcCCc-eECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeC-CCeEEEEcccccCCCccc Q lcl|NC_014792. 471 AADMAGLCARTDDVSQP-WMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAG-GDGFVLYGDKTATKVPSP 548 (659) Q Consensus 471 s~~~Ag~~a~~d~~~g~-~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~-~~G~~~wG~rT~~~~~~~ 548 (659) .+.+.|..+.+|-++-. -.+-..|.+. .|+ ....++..|.+.|..+|+|+...+-+ ++.+.+|-.-++++ . T Consensus 296 ~aa~~g~~as~nf~~~~g~~T~~fkq~~--~Gi--~a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG---~ 368 (501) T protein:vir:10 296 AGAVMGYAASINFQLRNGRTVLAFRQFN--AGV--PATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSG---K 368 (501) T ss_pred HHHHHHHHHhhCcccCccceeeeccccC--CCc--CcccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeeec---c Confidence 56778888888754311 0011112110 011 12457899999999999999988853 24577886556665 3 Q ss_pred cceeehhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcccee--------------------- Q lcl|NC_014792. 549 MDHINVRRLTNMLKKNIGDASKYKLFE----LNDNFTRASFRMETSQYLDGIRALGGIY--------------------- 603 (659) Q Consensus 549 ~~~i~vrR~~~~i~~~i~~~~~~~v~e----pn~~~l~~~i~~~i~~~l~~l~~~gal~--------------------- 603 (659) |.+|.+-+-.+|+++.++..+....-. |-|..=...|+..++.-|++-+++|.|. T Consensus 369 ~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~ 448 (501) T protein:vir:10 369 FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAG 448 (501) T ss_pred ceeehhhhhHHHHHHHHHHHHHHHHHhcCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccc Confidence 566888887888888888887764432 6677788889999999999999999883 Q ss_pred --------eeEEEEccCCCCHHHh-hCCEEEEEEEEEecCCceEEEEEEEEeecCeeEEEec Q lcl|NC_014792. 604 --------EGRVVCDTTNNTPSVI-DRNEFVASIYYKPARSINYIVLNFVATSTGADFDELI 656 (659) Q Consensus 604 --------g~~v~~d~~~nt~~~i-~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~ 656 (659) ||.+.++...+++++. .+....+.+.++--..+++|++-. .||+ T Consensus 449 ~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s---------~~v~ 501 (501) T protein:vir:10 449 AGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQQLTIGS---------NAVI 501 (501) T ss_pred cccceeccceeEeeccccCChhhhhhccccceEEEEEeCCceeEEEeee---------eecC Confidence 3666666433333333 334455666666666667666532 2344 No 65 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=85.84 E-value=0.05 Score=27.72 Aligned_cols=452 Identities=10% Similarity=0.030 Sum_probs=192.5 Q ss_pred Cce--ecCceEEEEecCCCcccccC-CcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHH---cC Q lcl|NC_014792. 1 MAL--LSPGIELKETTVQSTVVRNA-TGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFL---QY 74 (659) Q Consensus 1 ~~~--~~PGVyveE~~~~~~~~~~~-ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~---ng 74 (659) |-+ +.=--+|+-.+.-....... .-.+-+++....=|+++....+|..|-...|| ..+.++.+.+.+|- |. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~f~~lll~~~~~~~~~r~~~y~s~~~V~~~FG---~~S~ey~aA~~yFsg~~~q 77 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQKTDVENWFG---ALSNEAKIADAYFPGIVNG 77 (501) T ss_pred CCcCccccceEEEEeeecccCCCcccccceEEEecccCCCccceeeecCHHHHHHhcC---CChHHHHHHHHHhhhhcCC Confidence 887 43455565444222222222 22234555555568888888899999999999 55666677777774 32 Q ss_pred C---CeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccc Q lcl|NC_014792. 75 G---NDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKII 151 (659) Q Consensus 75 G---~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 151 (659) . ++|||-|-...... +...+..+.......-. ..--.+.+. .+|.....-++.+... T Consensus 78 ~p~P~~l~igR~~~~~~~--~~l~g~~l~~~~la~~~--~~~g~l~i~----------------i~g~~~~~~i~~s~at 137 (501) T protein:vir:10 78 GQLPYDLKFARYVAADAP--ASVYGIPLTGITLAQLQ--GYSGTLTVT----------------TAAQHVSANISLAAAT 137 (501) T ss_pred CccccEEEEEeecccCcc--ceeeeceehhhhhhhhh--heeeEEEEe----------------eccceeeecccccccc Confidence 2 57999997653211 11111111110000000 000011111 1111000000000000 Q ss_pred ccccccceeeeeccceeeEEEeecCCccccccccceecccc-ceeeecccccccccccceeecccccccceeeecccccc Q lcl|NC_014792. 152 AFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSG-ILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEI 230 (659) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~ 230 (659) +....+....... . ...+ .+..+.. ..+..... ..+ T Consensus 138 ----s~~~vA~~i~~al-------~----~~~~-tv~~d~~~~~f~i~~~------------t~G--------------- 174 (501) T protein:vir:10 138 ----SFANAATLIEAAF-------T----SPDF-VVAYDALRNRFTVVTN------------TTG--------------- 174 (501) T ss_pred ----CHHHHHHHHHHhh-------c----CCce-EEEEecccceEEEEec------------ccC--------------- Confidence 0000000000000 0 0000 0000000 01100000 000 Q ss_pred ceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhh Q lcl|NC_014792. 231 GSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIY 310 (659) Q Consensus 231 g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~ 310 (659) ....+...+... .. ...+ ..+... ...+...|...|+ T Consensus 175 -~~~~i~~~t~~~-d~--a~~l-----~Lt~~~----------------~a~v~~~g~~aet------------------ 211 (501) T protein:vir:10 175 -TAAAISAVTGTN-NL--ADEL-----GLSAAA----------------GATLQAAGVAADT------------------ 211 (501) T ss_pred -cceeEEEeeccc-cc--hhhh-----cccccC----------------ceeEEecCccccc------------------ Confidence 000000000000 00 0000 000000 0000000000000 Q ss_pred hhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHH Q lcl|NC_014792. 311 LDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQ 390 (659) Q Consensus 311 ~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~ 390 (659) +..++..+......--.+..+.. ....-+ T Consensus 212 --------------------------------------------~~~Al~a~~~~~~~Wy~f~~a~~-------~~~~~~ 240 (501) T protein:vir:10 212 --------------------------------------------PASAMNRAVGLSRNWATFTTAWT-------AVIADR 240 (501) T ss_pred --------------------------------------------HHHHHHHHHhcccceEEEEEEec-------CChHHH Confidence 00111111111100001111100 001122 Q ss_pred HHHHHHHHhhCCEEEEE--ecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceee Q lcl|NC_014792. 391 KHVVSIADERQDCLAFI--SPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWV 468 (659) Q Consensus 391 ~~l~~~~~~~~~~~ai~--d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ 468 (659) .++.++++....+|.+. |..... .......++....... +..+....|+ . - T Consensus 241 la~A~wi~a~~~~f~~~~~~~~~~~----~~~~~~~~i~~~l~~~----------~y~~t~~~y~------~-------~ 293 (501) T protein:vir:10 241 LAFAAWNSGQAYKYMYVAPDLEAAS----IVTNNAASFGAQVFAA----------PYQGTLPLYG------D-------Q 293 (501) T ss_pred HHHHHHHHhcCceEEEEEecCccee----eecccchhHHHHHHhc----------CCCceEEECC------C-------C Confidence 34556666555444332 221110 0111112222111111 1223333332 1 2 Q ss_pred cHHHHHHHHHHHhhhcC--CceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeC-CCeEEEEcccccCCC Q lcl|NC_014792. 469 PLAADMAGLCARTDDVS--QPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAG-GDGFVLYGDKTATKV 545 (659) Q Consensus 469 p~s~~~Ag~~a~~d~~~--g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~-~~G~~~wG~rT~~~~ 545 (659) +|.+.+.|..+.+|-++ | -.+-..|.+. .|+ ....++..|.+.|..+|+|++..|-+ ++.+.+|-.-+++++ T Consensus 294 ~~~aa~~g~~as~nf~~~~g-~~T~~fkql~--~Gv--~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~ 368 (501) T protein:vir:10 294 ATAGAVMGYAASINFQLRNG-RTVLAFRQFN--AGV--PATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK 368 (501) T ss_pred CHHHHHHHHHHhcCcccCcc-eeeeeecccC--CCc--CcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcceeecc Confidence 36678888888887543 2 0011122210 111 12467889999999999999988753 234778755556653 Q ss_pred ccccceeehhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcccee------------------ Q lcl|NC_014792. 546 PSPMDHINVRRLTNMLKKNIGDASKYKLFE----LNDNFTRASFRMETSQYLDGIRALGGIY------------------ 603 (659) Q Consensus 546 ~~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----pn~~~l~~~i~~~i~~~l~~l~~~gal~------------------ 603 (659) |.+|.+.+-.+|+++.|+..+....-. |-|..=...|+..++.-|++-+++|.|. T Consensus 369 ---~~wiD~~~g~dWl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~~~i~~~~g 445 (501) T protein:vir:10 369 ---FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAVAG 445 (501) T ss_pred ---ceehhhHhhHHHHHHHHHHHHHHHHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccceeeccccc Confidence 567888898999999999988875433 5567778889999999999999999883 Q ss_pred -----------eeEEEEccCCCCHHH-hhCCEEEEEEEEEecCCceEEEEEEEEeecCeeEEEec Q lcl|NC_014792. 604 -----------EGRVVCDTTNNTPSV-IDRNEFVASIYYKPARSINYIVLNFVATSTGADFDELI 656 (659) Q Consensus 604 -----------g~~v~~d~~~nt~~~-i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~ 656 (659) ||.+.++....++++ ..+....+.+.++--..+++|++-. .||+ T Consensus 446 ~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s---------~~v~ 501 (501) T protein:vir:10 446 VAGAGALVQTRGWYFLIGNPANPGQARQNRTSPACTLWYSDGGSIQELTIGS---------NAVI 501 (501) T ss_pred ccccccceeccceEEeeCcccCChhhhhhcccCceEEEEEeCCceeEEEeee---------eecC Confidence 366666643333333 3334455666666666677766532 2334 No 66 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=84.28 E-value=0.062 Score=27.22 Aligned_cols=452 Identities=12% Similarity=0.041 Sum_probs=194.0 Q ss_pred Cce--ecCceEEEEecCCCcccccC-CcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHH---cC Q lcl|NC_014792. 1 MAL--LSPGIELKETTVQSTVVRNA-TGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFL---QY 74 (659) Q Consensus 1 ~~~--~~PGVyveE~~~~~~~~~~~-ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~---ng 74 (659) |-+ +.=--+|+-.+.-....+.. .-.+-+++....=|+++...-+|..|-...|| ..+.++.+++.+|- |. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lllt~~~~~~~~r~~~y~s~~~V~~~FG---~~S~ey~aA~~yFs~~~~q 77 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQETDVENWFG---ALSNEAKIADAYFPGIVNG 77 (501) T ss_pred CCcCCcccceEEEEeeeeccCCCcceeeeeEEEeccCCCCCcceeeecCHHHHHHhcC---CChHHHHHHHHHhhcccCC Confidence 887 43455555444221222222 22333444444457788888889999999999 56667777888875 32 Q ss_pred C---CeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccc Q lcl|NC_014792. 75 G---NDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKII 151 (659) Q Consensus 75 G---~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 151 (659) . ++|||-|-...... +...+..+... +.+.-...-..+.++. +|.....-++.+.. T Consensus 78 ~~~P~~l~igR~~~~a~~--~~l~g~~l~~~--~~a~~~~~sg~l~vti----------------~g~~~~~~i~lS~~- 136 (501) T protein:vir:36 78 GQLPYDLKFARYVAADAP--ASVYGIPLTGV--TLAQLQGYSGTLTVTT----------------AAQHVSANISLAAA- 136 (501) T ss_pred CccccEEEEEeecCcCcc--eeEeccchhhh--hhhhccceeEEEEEEe----------------cceeeeeecccccc- Confidence 2 46999997643211 11111111110 0000000000111111 11100000000000 Q ss_pred ccccccceeeeeccceeeEEEeecCCcccccccc-ceeccc-c-ceeeecccccccccccceeecccccccceeeecccc Q lcl|NC_014792. 152 AFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLG-KIVTDS-G-ILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPG 228 (659) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~-~~v~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g 228 (659) .+.......... ..+.. ..+... . ..+..... .. | T Consensus 137 ---ts~~~vA~~i~~--------------al~~~~~tv~~d~~~~~f~i~s~------------t~-------------G 174 (501) T protein:vir:36 137 ---TSFANAATLIEA--------------AFTSPDFVVAYDALRNRFTVVTN------------AT-------------G 174 (501) T ss_pred ---cCHHHHHHHHhh--------------hhcCcceEEEEcCcceeEEEEec------------cC-------------C Confidence 000000000000 00000 001000 0 00000000 00 0 Q ss_pred ccceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccch Q lcl|NC_014792. 229 EIGSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNN 308 (659) Q Consensus 229 ~~g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~ 308 (659) ....+...+. ....+. . ....... ...+...+...|+ T Consensus 175 ---~~~~i~~~t~-~~~ia~--~-----l~Lt~~~----------------~a~v~~~g~~~et---------------- 211 (501) T protein:vir:36 175 ---TAAAISAVTG-TNNFAD--E-----IGLSAAA----------------GATLQAAGVAADT---------------- 211 (501) T ss_pred ---cceeeEeeec-ccchhh--h-----hcccccC----------------cceEEeccccccc---------------- Confidence 0000000000 000000 0 0000000 0000000000000 Q ss_pred hhhhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHH Q lcl|NC_014792. 309 IYLDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATAST 388 (659) Q Consensus 309 ~~~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 388 (659) +..++..+......-..+..+... ... T Consensus 212 ----------------------------------------------~~~al~a~~~~s~~Wy~f~~a~~~-------~~~ 238 (501) T protein:vir:36 212 ----------------------------------------------PASAMNRAVGLSRNWATFTTAWTA-------VIA 238 (501) T ss_pred ----------------------------------------------HHHHHHHHHhccCceEEEEEecCC-------ChH Confidence 001111111111111111111111 011 Q ss_pred HHHHHHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceee Q lcl|NC_014792. 389 VQKHVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWV 468 (659) Q Consensus 389 v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ 468 (659) -..++..+++....+|.+.-.- ............++....... ...+....| ++ . T Consensus 239 ~~la~A~wiea~~~~f~~~~~~--~~~~~~~~~~~~~i~~~l~~~----------~y~~t~~~y------~~-------~ 293 (501) T protein:vir:36 239 DRLAFASWNSGQAYKYMYVAPD--LEAASIVSNNAASFGAQVFAA----------PYQGTLPLY------GD-------Q 293 (501) T ss_pred HHHHHHHHHhhcCceEEEEEec--CchhhhhccchhhHHHHHHhc----------CCCcEEEEc------CC-------C Confidence 2235666666665555433110 000111111112222222111 122333322 11 2 Q ss_pred cHHHHHHHHHHHhhhcC--CceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeC-CCeEEEEcccccCCC Q lcl|NC_014792. 469 PLAADMAGLCARTDDVS--QPWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAG-GDGFVLYGDKTATKV 545 (659) Q Consensus 469 p~s~~~Ag~~a~~d~~~--g~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~-~~G~~~wG~rT~~~~ 545 (659) .+.+++.|..+.+|-++ | -..-.+|.+. .|+ ....++..|.+.|..+|+|++..|-+ ++.+.+|-.-+++++ T Consensus 294 ~~~aa~~g~~as~nf~~~~g-~~T~~fkq~~--~Gi--~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG~ 368 (501) T protein:vir:36 294 ATAGAVMGYAASINFQLRNG-RTVLAFRQFN--AGV--PATVHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLSGK 368 (501) T ss_pred CHHHHHHHHHHhcCcccCcc-eeeeeccccC--CCc--CcCcCCHHHHHHHHhcCCcEEEEEecccceeeEEEcCeeecc Confidence 35567788888887443 2 0011122210 111 12457889999999999999877753 245777766566663 Q ss_pred ccccceeehhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcccee------------------ Q lcl|NC_014792. 546 PSPMDHINVRRLTNMLKKNIGDASKYKLFE----LNDNFTRASFRMETSQYLDGIRALGGIY------------------ 603 (659) Q Consensus 546 ~~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----pn~~~l~~~i~~~i~~~l~~l~~~gal~------------------ 603 (659) |.+|.+.+-.+|++..|+..+....-. |-|..=...|+..++.-|++-+++|.|. T Consensus 369 ---~~wiD~~~g~dWL~~~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~I~~~~g 445 (501) T protein:vir:36 369 ---FLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTGLYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDKAAR 445 (501) T ss_pred ---chhhhHHHhHHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccc Confidence 567999999999999999998876543 5677778889999999999999999883 Q ss_pred -----------eeEEEEccCCCCHHHhh-CCEEEEEEEEEecCCceEEEEEEEEeecCeeEEEec Q lcl|NC_014792. 604 -----------EGRVVCDTTNNTPSVID-RNEFVASIYYKPARSINYIVLNFVATSTGADFDELI 656 (659) Q Consensus 604 -----------g~~v~~d~~~nt~~~i~-~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~ 656 (659) ||.+.++....++++.. +....+.+.++--..+++|++-. .||+ T Consensus 446 ~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s---------~~v~ 501 (501) T protein:vir:36 446 VAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQSLTIGS---------NAVI 501 (501) T ss_pred ccccccceeccceEEeeCcccCChhhhhhcccCcEEEEEEeCCceeEEEeee---------eeeC Confidence 36666664434444433 34455666666667777766532 2334 No 67 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=69.05 E-value=0.23 Score=24.10 Aligned_cols=453 Identities=10% Similarity=0.022 Sum_probs=189.9 Q ss_pred Cce--ecCceEEEEecCCCcccccC-CcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHH---cC Q lcl|NC_014792. 1 MAL--LSPGIELKETTVQSTVVRNA-TGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFL---QY 74 (659) Q Consensus 1 ~~~--~~PGVyveE~~~~~~~~~~~-ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~---ng 74 (659) |-+ +.=--+|+-.+.-....... .-.+-+++....=|+++....+|..|-...|| ..+.++.+++.+|- |. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~lll~~~~~~~~~r~~~y~s~~~V~~~FG---~~S~ey~aA~~yFs~~~~q 77 (501) T protein:vir:78 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSIQPGQLADFFQKTDVENWFG---GLSNEAVIADAYFPGIVNG 77 (501) T ss_pred CCcCccccceEEEEeeecccCCCcceeeeeEEEecCCCCCccceeeecCHHHHHHhcC---CChHHHHHHHHHhhcCCCC Confidence 887 33455555444222222222 22344555555557888888889999999999 55667777888875 22 Q ss_pred C---CeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeecccccc Q lcl|NC_014792. 75 G---NDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKII 151 (659) Q Consensus 75 G---~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 151 (659) . +++||-|-...... +...+..+.......-.. .--.++++. +|......++.+... T Consensus 78 ~~~P~~l~igR~~~~a~~--~~l~g~~l~~~~la~~~~--~~G~l~iti----------------~g~~~~~~i~~S~~t 137 (501) T protein:vir:78 78 GQLPYDLKFARYVAADAP--ASVYGIPLTGVTLTQLQG--YSGTLTVTT----------------AAQHVSSNISLAAAT 137 (501) T ss_pred CcccceEEEEeecccCcc--eeEeccceeccchhhhce--eeeEEEEEe----------------ccceeeecccccccc Confidence 2 46899997653211 111111111100000000 000111111 111000000000000 Q ss_pred ccccccceeeeeccceeeEEEeecCCccccccccceecccc-ceeeecccccccccccceeecccccccceeeecccccc Q lcl|NC_014792. 152 AFAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSG-ILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEI 230 (659) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~ 230 (659) +............ . .+.+ .+..++. ..+.... . ..+. T Consensus 138 ----s~~~vA~~i~~al-------~----a~~~-tv~~ds~~~~f~its--------~----t~G~-------------- 175 (501) T protein:vir:78 138 ----SFANAATLIEAAF-------T----SPDF-VVSYDALRNRFVVNT--------N----ATGT-------------- 175 (501) T ss_pred ----CHHHHHHHHHhhh-------c----Ccce-EEEEccccceEEEEe--------e----ecCC-------------- Confidence 0000000000000 0 0000 0000000 0000000 0 0000 Q ss_pred ceeEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhh Q lcl|NC_014792. 231 GSTLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIY 310 (659) Q Consensus 231 g~~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~ 310 (659) ...+...+. ....+. .+ ..+. .. ...+...|...|+ T Consensus 176 --~~~i~~~t~-~~~~a~--~l-----~Lt~--~~--------------~a~v~~~g~~aet------------------ 211 (501) T protein:vir:78 176 --AAAISAVTG-TNNLAD--EL-----GLSA--AA--------------GASLQAAGVAADT------------------ 211 (501) T ss_pred --ceeEEEEec-ccchhh--hh-----cccc--cC--------------ceeeEeccccccC------------------ Confidence 000000000 000000 00 0000 00 0000000000000 Q ss_pred hhhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHH Q lcl|NC_014792. 311 LDDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQ 390 (659) Q Consensus 311 ~~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~ 390 (659) +..++..+......--.+..+... ..+-+ T Consensus 212 --------------------------------------------~~~a~~a~~~~~~~Wy~f~~a~~~-------~~~~~ 240 (501) T protein:vir:78 212 --------------------------------------------PASAMNRAVGLSRNWATFTTAWTA-------VIADR 240 (501) T ss_pred --------------------------------------------HHHHHHHHHhccCceEEEEEecCC-------CHHHH Confidence 001111111111101111111110 11123 Q ss_pred HHHHHHHHhhCCEEEEE--ecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceee Q lcl|NC_014792. 391 KHVVSIADERQDCLAFI--SPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWV 468 (659) Q Consensus 391 ~~l~~~~~~~~~~~ai~--d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~ 468 (659) .++.++++....+|.+. |..... .......++....... +..+...+|+. - T Consensus 241 lalA~wiea~~~~f~~~~~~~~~~~----~~~~~~~~i~~~l~a~----------~y~~t~~~y~~-------------~ 293 (501) T protein:vir:78 241 LALASWNSGQAYKYMYVAPDLEPAS----IVTNNSASFGAQVFAA----------PYQGTLPLYGD-------------Q 293 (501) T ss_pred HHHHHHHHhcCceEEEEEecCCcce----eecccchhHHHHHhhc----------CCCceEEEcCC-------------c Confidence 35666666655554332 221111 0111111221111110 12233333321 1 Q ss_pred cHHHHHHHHHHHhhhcCCc-eECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeC-CCeEEEEcccccCCCc Q lcl|NC_014792. 469 PLAADMAGLCARTDDVSQP-WMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAG-GDGFVLYGDKTATKVP 546 (659) Q Consensus 469 p~s~~~Ag~~a~~d~~~g~-~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~-~~G~~~wG~rT~~~~~ 546 (659) .+.+.+.|..+.+|-++-. -.+-..|.+. .|+ ....++..|.+.|..+|+|++..|-+ ++.+.+|-.-++++ T Consensus 294 ~~~aa~~g~~as~nf~~~~g~~T~~fkq~~--~Gv--~a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~sG-- 367 (501) T protein:vir:78 294 ATAGAVMGYAASINFQLRNGRTVLAFRQFN--AGV--PATAHDLGTANALRSNNYTYIGAYANAANNYTIAYDGKLSG-- 367 (501) T ss_pred chHHHHHHHHHhcCcccCcceeeeeccccC--CCc--CcccCCHHHHHHHHhcCCeEEEEEecccceeeEEEcCeeec-- Confidence 2456777888888754311 0011122110 111 12457889999999999999988753 24577885555665 Q ss_pred cccceeehhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHHHHHHHHHhcccee------------------- Q lcl|NC_014792. 547 SPMDHINVRRLTNMLKKNIGDASKYKLFE----LNDNFTRASFRMETSQYLDGIRALGGIY------------------- 603 (659) Q Consensus 547 ~~~~~i~vrR~~~~i~~~i~~~~~~~v~e----pn~~~l~~~i~~~i~~~l~~l~~~gal~------------------- 603 (659) +|.+|.+-+-.+|+++.++..+....-. |-|..=...|+..++.-|++-+++|.|. T Consensus 368 -~~~wiD~~~~~~Wl~~~iq~~l~~ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~I~~~~g~ 446 (501) T protein:vir:78 368 -KFLWVDTYLDQIYLNAELQRAEFEAMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGV 446 (501) T ss_pred -cceeehhhhhHHHHHHHHHHHHHHHHHhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCccceeeccccCc Confidence 3566888888888888888888765432 6677778889999999999999999883 Q ss_pred ----------eeEEEEccCCCCHHH-hhCCEEEEEEEEEecCCceEEEEEEEEeecCeeEEEec Q lcl|NC_014792. 604 ----------EGRVVCDTTNNTPSV-IDRNEFVASIYYKPARSINYIVLNFVATSTGADFDELI 656 (659) Q Consensus 604 ----------g~~v~~d~~~nt~~~-i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~e~~ 656 (659) ||.+.++...+++++ ..+....+.+.++--..+++|++-. .||+ T Consensus 447 ~~~~~~~~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i~s---------~~v~ 501 (501) T protein:vir:78 447 AGAGQLVQMRGWYFLIGDPANPGQARQNRTTPTCTLWYSDGGSIQELTIGS---------NAVI 501 (501) T ss_pred cccccceeccceEEeeccccCChhhhhhcccCcEEEEEEeCCceeEEEeee---------eecC Confidence 366666643333333 3334455666666666666666532 2344 No 68 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=50.28 E-value=0.61 Score=21.74 Aligned_cols=469 Identities=11% Similarity=0.025 Sum_probs=188.0 Q ss_pred CceecCceEEEEecCCCcccccCC-c-ceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHH---cCC Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRNAT-G-RAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFL---QYG 75 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~~t-s-~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~---ngG 75 (659) |- ++.-|+|+..+......+.+. . ..-+.+.-..=|+++...-+|..+-...|| ..+.++.+.+.+|- |.. T Consensus 1 m~-I~~~~~V~i~~~v~aa~~~~~~~f~~li~t~~~~~p~~r~~~y~s~~~V~~~FG---~~S~ey~aA~~yFsg~~~q~ 76 (515) T protein:vir:10 1 MP-ISFDKYVAITSGVAAQQQIAARSFAIRVYTPNPMVSVDRLITATSAADVGAYFG---TASEEYKRAVKNFGFISKKT 76 (515) T ss_pred CC-CCceeEEEeecccccCCccccccceeeeeecccCCCccceeeecCHHHHHHhcC---CChHHHHHHHHHhhhccCCc Confidence 76 667788865554333333332 2 233345555567788888899999999999 55666667777764 222 Q ss_pred ---CeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccc Q lcl|NC_014792. 76 ---NDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIA 152 (659) Q Consensus 76 ---~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 152 (659) ++|||-|=..... .+........ + ........+. +|. .. T Consensus 77 p~P~~L~igR~~~~a~--~~~l~g~~~~------~---------------~~l~~~~~is----~G~-lt---------- 118 (515) T protein:vir:10 77 RRPTSIQFARWQREAG--PVAIYGGAKK------A---------------AALATLQAVT----AGA-IS---------- 118 (515) T ss_pred ccccEEEEEeccCccc--ceEEEeccch------h---------------hhHHhhhccc----cee-EE---------- Confidence 5699988543211 0000000000 0 0000000000 000 00 Q ss_pred cccccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccce Q lcl|NC_014792. 153 FAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGS 232 (659) Q Consensus 153 ~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~ 232 (659) ++ +.+.. ..+...-..+..... .+.+..+....-.... .+ T Consensus 119 ----it----idG~~--~~t~s~i~~S~ats~--------------------------~~vAs~i~tal~~~~~----~~ 158 (515) T protein:vir:10 119 ----FL----FGGAT--TVTVSGISFSAATSL--------------------------ADVASELQTALRANAD----AN 158 (515) T ss_pred ----EE----EcceE--EEEeeccccccccCH--------------------------HHHHHHHHhhhccccc----cc Confidence 00 00000 000000000000000 0000000000000000 00 Q ss_pred eEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCce-eeeeeeeccccccccccchhhh Q lcl|NC_014792. 233 TLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAI-VENVVLSTKEGDKDVYGNNIYL 311 (659) Q Consensus 233 ~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~-~et~~~~~~~~~~~~~~~~~~~ 311 (659) ...+ .+.+... ...|.+.....|.. ..++......... .++ T Consensus 159 ~~~~-------------------------------tv~~d~~-~~~F~v~s~~tG~~~~is~~~~t~~~~~------t~~ 200 (515) T protein:vir:10 159 LATC-------------------------------TVSYDPV-GARFNFAGSPSDDTVQESISIVPQSNPA------IDV 200 (515) T ss_pred ccee-------------------------------EEEEecC-CCeEEEEEeecCCceeEEEEEecCCCch------hhH Confidence 0000 0111000 01122222111111 1111111110000 000 Q ss_pred hhhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHH Q lcl|NC_014792. 312 DDYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQK 391 (659) Q Consensus 312 ~~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~ 391 (659) ...+ .+.. ..+ ...+.|... ..+..++..+......-..+....... ......... T Consensus 201 a~~l-------------glt~-~~~-av~~~g~aa------et~~~a~~a~~~~s~nWy~f~~a~~~~---~~~~~a~~~ 256 (515) T protein:vir:10 201 AQLL-------------GWNS-AQG-ASYIAASPV------VSPVDTLIASVAGNNNFGSILFTKNGG---TGITLSDAE 256 (515) T ss_pred HHHh-------------cccc-ccc-eEEeccccc------ccHHHHHHHHHhccCCeEEEEEeecCc---cccchhHHH Confidence 0000 0000 001 122233221 112233333332222222333332211 111123334 Q ss_pred HHHHHHHhhCCEEEEEecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecHH Q lcl|NC_014792. 392 HVVSIADERQDCLAFISPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLA 471 (659) Q Consensus 392 ~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 471 (659) ++..+.+.....+.....-.. ....+......... ...+...+++- +++ . +. T Consensus 257 a~a~~~e~~~~~~~~~~~~~~-----~~~~~~~a~~~~~~------------~~~~~~~~~~~---~~~-------~-~~ 308 (515) T protein:vir:10 257 AIALQNQSYNVAYKFQVGVDD-----TTYSSWQAALAAIG------------GVNMIYSPVAL---AAE-------Y-HD 308 (515) T ss_pred HHHHHHhhcCceEEEEeccCc-----cceechhhhhhhhh------------hcCceEEEEec---cCc-------c-hH Confidence 455556655444433221110 00111111111000 00111111111 000 1 23 Q ss_pred HHHHHHHHHhhhcCC-ceECcCCcchhheeccccceeecChhHHHhhhhCCceEEEEEeC-CCeEEEEcccccCCCcccc Q lcl|NC_014792. 472 ADMAGLCARTDDVSQ-PWMSPPGYNRGQILNVLKLAIEPRQTQRDRMYQEAINPVVGFAG-GDGFVLYGDKTATKVPSPM 549 (659) Q Consensus 472 ~~~Ag~~a~~d~~~g-~~~span~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~-~~G~~~wG~rT~~~~~~~~ 549 (659) ....|..+.+|-++- =...-..|. ..|+.. ..+++.|.+.|..+|+|....|.+ ++.+.+|-.-++++-..+| T Consensus 309 a~~~g~~asvnf~~~ng~iT~kfKq---~~Gita--~~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G~~~gG~~~~ 383 (515) T protein:vir:10 309 MQDGIIEAATDFTQQGGATGYMYVQ---FNNQTP--AVNDDTLSGILDDLNINYYGQTQVNGTNLSFYQDGVMMGGPTDP 383 (515) T ss_pred HHHHHHHHhcCCCccchhheecccc---CCCCcc--ccCCHHHHHHHHhcCCeEEEEEeccCceEEEEeCCeeeCCccch Confidence 456677777763321 111122232 233322 358899999999999999988854 2468888665555544467 Q ss_pred ceeehhhHHHHHHHHHHHHHHHHhcC----CCCHHHHHHHHHHHH-HHHHHHHhccceee-------------------- Q lcl|NC_014792. 550 DHINVRRLTNMLKKNIGDASKYKLFE----LNDNFTRASFRMETS-QYLDGIRALGGIYE-------------------- 604 (659) Q Consensus 550 ~~i~vrR~~~~i~~~i~~~~~~~v~e----pn~~~l~~~i~~~i~-~~l~~l~~~gal~g-------------------- 604 (659) ++|.+.|-.+|++..|+..+....-. |-+..=...|+..+. +-|++-+++|.|.- T Consensus 384 ~WiD~~~g~~WL~~~iq~~l~~L~~s~~KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~ 463 (515) T protein:vir:10 384 RDSNVYANEQWLKSYAGASFMSLQLAQGKIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDT 463 (515) T ss_pred hHHHHHhhHHHHHHHHHHHHHHHHhcCCCCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcc Confidence 88999999999999999999874322 345555566776664 67888888888753 Q ss_pred ---------eEEEEc-cCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEe Q lcl|NC_014792. 605 ---------GRVVCD-TTNNTPSVIDRNEFVASIYYKPARSINYIVLNFVAT 646 (659) Q Consensus 605 ---------~~v~~d-~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~ 646 (659) |.+... .+..++.+...+.+.+..-+.--..+++|+....-. T Consensus 464 ~~~~~~~~Gyy~~~~~~~~~~~~~r~~~~~~~~~~y~~g~~i~~i~~~~~~v 515 (515) T protein:vir:10 464 AWQKVQNLGYWYDVQISSFVDTGGTTKYQAVYSLVYSKDDLIRKVVGTHTLI 515 (515) T ss_pred cccchhhcceeEecCcCCCCCcccccccCceeEEEEEcCceEEEEEeeeecC Confidence 333322 222233333334443333333344455544432211 No 69 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=37.07 E-value=1.1 Score=20.27 Aligned_cols=447 Identities=11% Similarity=0.007 Sum_probs=181.2 Q ss_pred CceecCceEEEEecCCCccccc--CCcceEEEeecccCCCCccEEeCCHHHHHHHcCCcCCCchhHHHHHHHHH---cC- Q lcl|NC_014792. 1 MALLSPGIELKETTVQSTVVRN--ATGRAALVGKFQWGPAFQVTQITNEVELVDLFGGPNNITADYFMSGMNFL---QY- 74 (659) Q Consensus 1 ~~~~~PGVyveE~~~~~~~~~~--~ts~~afvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~---ng- 74 (659) |-.|.=--+|+-.+ +..+.++ ..-.+-++.....=|+++....+|..|-...|| ..+.++.+++.+|- |. T Consensus 1 m~~ip~s~iV~V~~-~v~~~~~~~~~f~~~l~~~~~~~~~~r~~~y~s~~~V~~~FG---~~S~ey~aA~~yFs~~~~q~ 76 (494) T protein:vir:94 1 MPNIPISQIVSINP-QVVSAGGTQGTLDGLLLTQATGFPVTQPQVYFSAADVGTAFG---LTSDEYNAALVYFAGILGGG 76 (494) T ss_pred CCCCCcccEEEeee-eccccCCcccccceeEeecCccCCccceeeecCHHHHHHhcC---CChHHHHHHHHHhhhccCCC Confidence 66555555665333 2222222 122333444444557777777889999999999 55667777777775 22 Q ss_pred --CCeEEEEeccCCcccccccccccccccccccccccccccceeeeeeccccccccceeeeeeccCcceeeeeccccccc Q lcl|NC_014792. 75 --GNDLRTVRVVNRDHAKNASPVAGNIESTIATAGSNYAVGDVIQVKHNQTVVETSGRITKVDVDGKILAVFIPSDKIIA 152 (659) Q Consensus 75 --G~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 152 (659) =++|||-|-.....+ +...+..+.. ....-... --.+.+ ..+|.....-++.+.. T Consensus 77 p~P~~l~igR~~~~a~~--~~l~g~~~~~-tl~~~~~~--~g~l~i----------------ti~g~~~~~~i~lS~~-- 133 (494) T protein:vir:94 77 QQPASLTIGRYASAATS--AAVFGAPLTL-SLAQLQTL--SGTLIV----------------TTDTQRTSAAINLSGA-- 133 (494) T ss_pred ccccEEEEEeecCcccc--ceeeccchhh-hHHhhhhc--ceEEEE----------------EEcceEEEeeeccccc-- Confidence 257999997543211 1111111100 00000000 001111 1111100000100000 Q ss_pred cccccceeeeeccceeeEEEeecCCccccccccceeccccceeeecccccccccccceeecccccccceeeeccccccce Q lcl|NC_014792. 153 FAKSVNQYPDLGPAWTAEILTTSSGVSGTITLGKIVTDSGILLTEAENSEEAITSLEFQASLQKYAMPGVVALYPGEIGS 232 (659) Q Consensus 153 ~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~g~ 232 (659) .+....+........ ... ..+..+... ... .+... . .| . T Consensus 134 --ts~~~vA~~i~~ai~-----~a~-------~~v~~d~~~------~~f-~v~s~----t-------------tG---~ 172 (494) T protein:vir:94 134 --TSFANAASLMTSGFT-----TPN-------FAITYDAQR------RRF-VLSTT----A-------------TG---T 172 (494) T ss_pred --CChhhHHHHHhhhhc-----ccc-------ceEEEcccC------cEE-EEEEc----c-------------CC---c Confidence 000000000000000 000 000001000 000 00000 0 00 0 Q ss_pred eEEEEEeecccccccceeeeeeeccccccccceeeeeeeccccccceeeeeccCCceeeeeeeeccccccccccchhhhh Q lcl|NC_014792. 233 TLEVEIVSKAAYDVGASKMLDIYPNGGSRASVARAVFNYGPQTDDQYAIIVRRDGAIVENVVLSTKEGDKDVYGNNIYLD 312 (659) Q Consensus 233 ~i~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~g~~~et~~~~~~~~~~~~~~~~~~~~ 312 (659) ...+...+ ...+ ..+ .+ + T Consensus 173 ~s~is~~t---~~~a---------------~~l--------------~l--------------t---------------- 190 (494) T protein:vir:94 173 TASVSAVT---GTLA---------------DGV--------------GL--------------S---------------- 190 (494) T ss_pred eeEEEEec---cchh---------------hhh--------------hh--------------h---------------- Confidence 00010000 0000 000 00 0 Q ss_pred hhhhcccccceEEeecccCCccceeEEeecccccccccchhhhhhhHhhhhhcccccceEEEeccccccchhhhHHHHHH Q lcl|NC_014792. 313 DYFAKGTSNYIYATSLNWPKGFAGIINLMGGISANDQVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDATASTVQKH 392 (659) Q Consensus 313 ~~~~~~~s~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~v~~~ 392 (659) ......+ ...|.+ ...+..++..+......-..+.+..+. . ..-+.+ T Consensus 191 ----~~~~a~v----------------~~~g~~------aet~~~a~~a~~~~~~~Wy~f~~~~~~---~----~~~ila 237 (494) T protein:vir:94 191 ----TASGAYV----------------EGSGLA------ADTAASALDRLAASSSTWAIFTTAWAA---S----LSDRTA 237 (494) T ss_pred ----ccccceE----------------eecCcc------cccHHHHHHHHHhccCceEEEEEecCC---C----HHHHHH Confidence 0000000 000000 000111111111111111111221110 1 112234 Q ss_pred HHHHHHhhCCEEEEE--ecCccccccccccCCHHHHHHHhhccccccccccccccceEEEEcCceeEecccCCcceeecH Q lcl|NC_014792. 393 VVSIADERQDCLAFI--SPPKGLLVNVPLTRAVDNLIDWRTGGGSFDTDNMNISTTYAAIDGNYKYQYDKYNDVNRWVPL 470 (659) Q Consensus 393 l~~~~~~~~~~~ai~--d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~ 470 (659) |.++.+....++.+. +.-.. ........++....... ...+....|+. ..| T Consensus 238 lA~wiea~~~~~~~~~~~~d~~----~~~~~~~~~i~~~l~~~----------~y~~t~~~y~~-------------~~~ 290 (494) T protein:vir:94 238 LAQWTSDQVFRRIYAAWDQDAA----GLSVNNVSSFGNIVKTT----------PFSNTIPVYGL-------------LAN 290 (494) T ss_pred HHHHHhhcCccEEEEEecCCcc----eeecccchhHHHHHHhh----------cCCceEEEcCC-------------CCh Confidence 555555544433332 21111 00111112222222111 12334333321 124 Q ss_pred HHHHHHHHHHhhhcCCceECcCCcchhheec-cccc-eeecChhHHHhhhhCCceEEEEEeC-CCeEEEEcccccCCCcc Q lcl|NC_014792. 471 AADMAGLCARTDDVSQPWMSPPGYNRGQILN-VLKL-AIEPRQTQRDRMYQEAINPVVGFAG-GDGFVLYGDKTATKVPS 547 (659) Q Consensus 471 s~~~Ag~~a~~d~~~g~~~span~~~~~i~g-~~~~-~~~~~~~e~~~Ln~~gin~i~~~~~-~~G~~~wG~rT~~~~~~ 547 (659) .+.+.|..+.+|-+. .+.+..+.. +. ..++ ...++..|.+.|..+|+|+...+-+ +.-+.+|.+.+++++ T Consensus 291 ~aa~~g~~aa~~~~~----~~g~~T~~~-k~q~~gi~~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg~~sG~-- 363 (494) T protein:vir:94 291 AMIVLAWGASTNLQI----AEGRTTLAL-RSPVSSAGVRVDNLANANALLSNGYTYLGKYASATNTYTVTYNGAIGGQ-- 363 (494) T ss_pred HHHHHHHHHhccccc----cCcceeEEe-eccCCCCCCccCCHHHHHHHHhcCCeEEEEecccCceEEEecCceeccc-- Confidence 466677777777433 233333221 10 1122 2346788999999999999988753 234678877777754 Q ss_pred ccceeehhhHHHHHHHHHHHHHHHHhc----CCCCHHHHHHHHHHHHHHHHHHHhcccee-------------------- Q lcl|NC_014792. 548 PMDHINVRRLTNMLKKNIGDASKYKLF----ELNDNFTRASFRMETSQYLDGIRALGGIY-------------------- 603 (659) Q Consensus 548 ~~~~i~vrR~~~~i~~~i~~~~~~~v~----epn~~~l~~~i~~~i~~~l~~l~~~gal~-------------------- 603 (659) |.+|-+-+-.+|+++.|+..+....- =|-|..=...|+..++.-|++-+++|.|. T Consensus 364 -~~~id~~~~~~WL~~~iq~~l~~ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~i~~~~G~~ 442 (494) T protein:vir:94 364 -FLWADTALGWIALRRNLQQALFETLLAYRSLPYNADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQIDQAAGVP 442 (494) T ss_pred -cceeeeeccHHHHHHHHHHHHHHHHHhCCCcccChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhhhhhhhcCc Confidence 33343333445777777777765433 36677778899999999999999999984 Q ss_pred --------eeEEEE-c-cCCCCHHHhhCCEEEEEEEEEecCCceEEEEEEEEee Q lcl|NC_014792. 604 --------EGRVVC-D-TTNNTPSVIDRNEFVASIYYKPARSINYIVLNFVATS 647 (659) Q Consensus 604 --------g~~v~~-d-~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~ 647 (659) ||++.. + .+.++..+....++.+.+ .--..+++|++...-.. T Consensus 443 ~~~~~~~kGyy~~~~~~~s~~~ra~R~~~~~~~~y--~~~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 443 ISGDVVDKGWYLQVIDPITTTVRTDRGSPTVNFWY--CDGGSIQRVVVSATTVI 494 (494) T ss_pred cccceeccceeeeccCCCChhhhhccccCCceEEE--EecCcEEEEEEeeEEeC Confidence 355543 3 334444444333333333 34666777776554333 Done!