Query lcl|NC_010325.1_cdsid_YP_001671929.1 [gene=gp56] [protein=hypothetical protein] [protein_id=YP_001671929.1] [location=complement(34296..35837)] Match_columns 513 No_of_seqs 48 out of 54 Neff 6.4 Searched_HMMs 1612 Date Thu Nov 7 13:22:29 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_56 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_56_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:8837 Length: 513 # 100.0 2E-174 1E-177 973.4 52.6 513 1-513 1-513 (513) 2 protein:vir:2625 Length: 715 # 100.0 9.1E-37 5.7E-40 218.2 35.3 500 1-512 1-715 (715) 3 protein:vir:95475 Length: 771 100.0 1.1E-29 6.7E-33 179.4 39.5 499 1-512 1-771 (771) 4 protein:vir:3529 Length: 477 # 99.9 6.1E-24 3.8E-27 147.8 34.4 433 1-505 7-477 (477) 5 protein:vir:3133 Length: 911 # 99.9 5E-25 3.1E-28 153.8 26.9 490 1-513 152-785 (911) 6 protein:vir:108312 Length: 458 99.9 3.9E-23 2.4E-26 143.4 35.9 428 1-507 1-458 (458) 7 protein:vir:100960 Length: 472 99.9 4.7E-22 2.9E-25 137.5 37.4 438 1-507 1-472 (472) 8 protein:vir:9268 Length: 472 # 99.9 5E-22 3.1E-25 137.4 37.4 434 1-507 1-472 (472) 9 protein:vir:105428 Length: 472 99.9 1.9E-21 1.2E-24 134.2 35.6 433 1-507 1-472 (472) 10 protein:vir:177 Length: 472 # 99.9 2E-21 1.2E-24 134.1 35.4 433 1-507 1-472 (472) 11 protein:vir:105525 Length: 472 99.9 2.7E-21 1.7E-24 133.3 34.0 423 1-505 1-472 (472) 12 protein:vir:2109 Length: 472 # 99.9 2.1E-20 1.3E-23 128.5 37.0 430 1-507 1-472 (472) 13 protein:vir:352 Length: 536 # 99.8 9.9E-20 6.1E-23 124.8 32.4 476 1-512 2-536 (536) 14 protein:vir:107423 Length: 681 99.5 7.5E-12 4.6E-15 81.6 43.5 481 1-510 1-681 (681) 15 protein:vir:98487 Length: 681 99.5 7.5E-12 4.6E-15 81.6 43.5 481 1-510 1-681 (681) 16 protein:vir:107802 Length: 681 99.5 7.5E-12 4.6E-15 81.6 43.5 481 1-510 1-681 (681) 17 protein:vir:102644 Length: 594 99.3 1.3E-10 8E-14 74.8 42.1 484 1-511 1-594 (594) 18 protein:vir:103790 Length: 768 99.0 5.5E-09 3.4E-12 65.9 42.8 484 1-513 1-768 (768) 19 protein:vir:8887 Length: 808 # 99.0 6.7E-09 4.1E-12 65.4 43.2 483 1-513 1-807 (808) 20 protein:vir:10452 Length: 794 98.8 4.8E-08 3E-11 60.7 44.0 483 1-513 1-793 (794) 21 protein:vir:2203 Length: 794 # 98.7 6.6E-08 4.1E-11 59.9 42.6 482 1-513 1-793 (794) 22 protein:vir:93631 Length: 580 98.7 8.2E-08 5.1E-11 59.4 30.9 462 1-513 1-579 (580) 23 protein:vir:80253 Length: 777 98.7 9E-08 5.6E-11 59.2 43.2 483 1-513 1-777 (777) 24 protein:vir:80177 Length: 1027 98.7 3.6E-08 2.3E-11 61.4 21.5 467 1-513 338-1022(1027) 25 protein:vir:94602 Length: 1012 98.6 1.8E-07 1.1E-10 57.6 24.1 479 1-513 352-1008(1012) 26 protein:vir:94583 Length: 792 98.6 1.8E-07 1.1E-10 57.6 44.6 481 1-513 1-791 (792) 27 protein:vir:94713 Length: 785 98.5 3.1E-07 1.9E-10 56.2 44.4 481 1-513 1-784 (785) 28 protein:vir:3366 Length: 801 # 98.5 5.2E-07 3.2E-10 55.1 40.6 483 1-513 1-800 (801) 29 protein:vir:105563 Length: 396 98.4 3.2E-07 2E-10 56.2 19.5 273 1-316 1-396 (396) 30 protein:vir:105647 Length: 800 98.3 1.1E-06 7.1E-10 53.2 41.9 482 1-513 1-799 (800) 31 protein:vir:1543 Length: 801 # 98.3 1.1E-06 7.1E-10 53.2 43.3 483 1-513 1-800 (801) 32 protein:vir:78957 Length: 826 98.2 2.7E-06 1.7E-09 51.1 42.7 483 1-513 1-825 (826) 33 protein:vir:95324 Length: 823 98.2 2.8E-06 1.7E-09 51.0 36.5 476 1-513 1-692 (823) 34 protein:vir:99677 Length: 794 98.2 2.8E-06 1.7E-09 51.0 43.8 483 1-513 1-793 (794) 35 protein:vir:7329 Length: 825 # 98.1 3.9E-06 2.4E-09 50.2 36.2 465 1-513 1-694 (825) 36 protein:vir:7021 Length: 803 # 98.0 7.4E-06 4.6E-09 48.7 41.4 473 1-513 165-802 (803) 37 protein:vir:5120 Length: 615 # 97.8 1.7E-05 1E-08 46.8 30.5 471 1-513 28-613 (615) 38 protein:vir:3306 Length: 567 # 97.6 3.8E-05 2.4E-08 44.8 33.0 424 1-513 25-563 (567) 39 protein:vir:2792 Length: 567 # 97.6 3.8E-05 2.4E-08 44.8 33.0 424 1-513 25-563 (567) 40 protein:vir:10145 Length: 567 97.6 3.8E-05 2.4E-08 44.8 33.0 424 1-513 25-563 (567) 41 protein:vir:9979 Length: 567 # 97.6 3.8E-05 2.4E-08 44.8 33.0 424 1-513 25-563 (567) 42 protein:vir:78703 Length: 905 97.6 3.8E-05 2.4E-08 44.8 38.1 466 1-513 317-904 (905) 43 protein:vir:97014 Length: 800 97.5 4.8E-05 3E-08 44.3 38.8 473 1-513 180-799 (800) 44 protein:vir:827 Length: 567 # 97.5 5.2E-05 3.2E-08 44.1 32.9 429 1-513 25-563 (567) 45 protein:vir:6326 Length: 826 # 97.3 0.0001 6.5E-08 42.4 43.1 482 1-513 1-825 (826) 46 protein:vir:103364 Length: 103 97.2 0.00012 7.2E-08 42.2 15.6 445 1-513 1-521 (1031) 47 protein:vir:96439 Length: 1031 97.1 0.00017 1E-07 41.3 16.0 445 1-513 1-521 (1031) 48 protein:vir:104388 Length: 566 97.0 0.0002 1.3E-07 40.8 34.8 430 1-513 24-562 (566) 49 protein:vir:103341 Length: 806 96.2 0.00093 5.8E-07 37.2 41.1 474 1-513 177-805 (806) 50 protein:vir:1778 Length: 680 # 96.0 0.0011 7.1E-07 36.7 25.6 298 1-349 331-680 (680) 51 protein:vir:100022 Length: 976 95.4 0.0022 1.4E-06 35.2 42.4 470 1-513 328-975 (976) No 1 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=100.00 E-value=1.6e-174 Score=973.41 Aligned_cols=513 Identities=95% Similarity=1.562 Sum_probs=504.9 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeEECCCcceeeecCCCcceeeeeeeeCCceEEEEEcCceE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTAQAPILDMFPFIRNNIPYWLLCSEQRL 80 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~v~~~~kl 80 (513) |||||+++||++|+|||++|+|||++|||+|+||+|++|+++|++|++|++++++++|+++++|+++|.++++++++++| T Consensus 1 ~~~~~~~~~~~~g~~~d~~p~~lp~~a~s~~~N~~~~~~~~~~~~g~~pv~a~~~~~~~g~~~~~~~g~~~~~~~~~~~~ 80 (513) T protein:vir:88 1 MALERQEVKNPTGIVTDIAPADLPLDKWSFGNNVRFKNGKAQKALGHSPIFDTAQAPILDMFPFIRNNIPYWLLCSEKRL 80 (513) T ss_pred CCcCChhhcccccceeccChhhcCCCcceeeeeeeEecceeeecCccceeeecCCCCceeeeeeecCCCeEEEEeeceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCceEEeccccceeeCCCCceeEEeeCCEEEEEeCCCceEEEcCCCceecccCCCcccceeeEEEEEcCEEEEEECC Q lcl|NC_010325. 81 YLADGTTIIDVSPGPYSASITNRWSVGSFNGVIFANDGVNPPHHLPPSESTFRVLPNFPANTTFKRLKSFKNFLVGLNAT 160 (513) Q Consensus 81 y~~~~~t~~dis~~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q~~~~~s~~f~~L~g~p~~~ka~~v~~~~~~l~~~g~t 160 (513) |+++++||+|||+++|+++.+++|+|++|+|++||+||+++|||+++++++|++|+|+|+++||++|++|++|||++|++ T Consensus 81 ~~~~~~t~~dvs~~~~~~~~~~~w~~~~f~~~i~a~ng~~~~q~~~~~s~~f~dl~g~p~~~~a~~i~v~~~flv~~~~t 160 (513) T protein:vir:88 81 YLADGTTIIDVSPGPYSASVTNRWSVGSFNGVIFANDGVNPPHHLPPTESVFRVLPNFPANTTFRRLKSFKNFLIGLNVT 160 (513) T ss_pred EEecCceeeeccccceeecccCceeeeeecCEEEEEcCCCcceEEcCCCceeeeccCCCcccceEEEEEEeeEEEEeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCcccCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceEEEecCcEEEEEecCCCceeEeEE Q lcl|NC_010325. 161 SNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGGLFIFQFQQ 240 (513) Q Consensus 161 ~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g~~~~f~~~~ 240 (513) ++++++|||||||+++|++++|++|++++++++|+||+|++++|++|+|++++++++||+|++||+|+|+|++.+|++++ T Consensus 161 ~~~~~~PnrV~wS~~~D~~~~P~~W~~t~~t~~a~~~~l~d~~g~~v~g~~~g~~liif~e~~i~~m~y~g~~~if~~~~ 240 (513) T protein:vir:88 161 SNSIEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGGLYIFQFQQ 240 (513) T ss_pred cCcCCCCceEEEecccCCcccccccccccccCcccccccCCCccceeeeeecccceEEEecccEEEEEecCCCceEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCccccccCceeEEECCeEEEEeCCCeEEECCcccccCCchhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCC Q lcl|NC_010325. 241 LFNDVGILGPNCAVEFDGNHFVVGHGDVYVHNGVQKQSVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRS 320 (513) Q Consensus 241 i~~~~G~~~~~siv~~~~~~ffls~~G~y~~~G~~~~~Ig~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~ 320 (513) |+.++||++|+||++++++|||++++|||+|+|+++++|++|||+|+|+.++|+.++++|++++||++++|||+||+.++ T Consensus 241 i~~~~G~~~p~SI~~~~~~~ffls~~Gf~~~~G~~~~~Ig~ekVdk~f~~~~n~~~~~~~~~~~d~~~~~v~~~y~s~~~ 320 (513) T protein:vir:88 241 LFNDVGILGPNCAIEFDGNHFVVGHGDVYVHNGVQKQSVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRS 320 (513) T ss_pred ecccccccCCceeEEECCeEEEEeCCceEEecCceeeecccchhhhhhhccCCcccceEEEEEEcCcccEEEEEecCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcccCccceeccccccccCccceEEEEeecCcee Q lcl|NC_010325. 321 KPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNPWDTDTSVWGEGSYNPAKSSMIFSSFQDKKLF 400 (513) Q Consensus 321 ~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~~d~d~~~~~~ds~~~~~~~~~~~~~~~~~~~ 400 (513) .++++|||+|||||++++||++++|+++++++|+++++.+++|..+..+||.+.++|..|.+...+.+++++.+++++++ T Consensus 321 ~~~~~~~~~lVYd~~~~~Ws~~~~p~~~~g~~g~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~sl~~~~~~~~~~~ 400 (513) T protein:vir:88 321 EPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKTSNLWDDDSNPWDTDTSVWGEGSYNPAKSSMIFTSFQDAKLF 400 (513) T ss_pred CCCcccceEEEEEccCCeEEEEeccchhhcccccccccccceecccccccccchhhhhccccccccceeEeeeccCCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccceeecCccEEEEeecccccCCCcceEEEeeeeeccCCCeeEEEEeeeeecCCCCceEcCceeeecCCceEEEeec Q lcl|NC_010325. 401 LFGNNSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTCNIWVGNAQVQGSGIRWKGPYPYRIGQDYKIDTKH 480 (513) Q Consensus 401 ~~~~~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~~~~~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~ 480 (513) .++.+++|+|++||++++++++++++++++++|++++|+++.++.+++.+|..++++++++|+++.++++.++|+|++|+ T Consensus 401 ~fd~~~~f~G~~lea~~~t~~~~~~~~~~~~~i~~v~~~~t~~g~~t~~vg~~~~~~~~~~~s~~~~~~~~~~~~~~~r~ 480 (513) T protein:vir:88 401 LFGETSTFSGQSFTSTLERSDIYLGDDRMMKTVSAVIPHITGNGVCNIWVGNAQVQGSGIRWKGPYPYRIGQDYKIDTKH 480 (513) T ss_pred eecccccccCCceEEEEEecCccccCchhheeeeeeeeeeecceEEEEEEeeeccCccccccccceeeecccCceEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCeEEEEEEccCCCcEEEEEEeeEEeccccCC Q lcl|NC_010325. 481 VGRYIALKFDFSSEGDWYFNGYTIEMAPKAGMR 513 (513) Q Consensus 481 ~~Ry~~~rl~~~~g~~w~~~G~~~~~~~~g~rr 513 (513) ++|||+|||+++++++|+|+|||+|++|.-||| T Consensus 481 ~gRy~~~ri~i~~~~~w~~~G~~ve~~~~~g~R 513 (513) T protein:vir:88 481 VGRYIALKFDFASAGDWYFNGYTLEMAPKAGMR 513 (513) T ss_pred CCceEEEEEEccCCCceEEeeEEEEEecCCCCC Confidence 999999999999999999999999999844444 No 2 >protein:vir:2625 Length: 715 # NCBI annotation: gp27 # Family: family:all:5234 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064766;genbank:gi:9964636;genbank:GeneID:1263056 Probab=100.00 E-value=9.1e-37 Score=218.16 Aligned_cols=500 Identities=16% Similarity=0.182 Sum_probs=315.8 Q ss_pred Cccc---chhhcCccccccccCcccCCCCcEEEeEEEEEeCCe-eEECCCcc----eeeec---CCCcceeeeeeee-CC Q lcl|NC_010325. 1 MALE---RQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGK-AQKTLGHT----PIFDT---AQAPILDMFPFIR-NN 68 (513) Q Consensus 1 m~~~---~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~-~~~~~g~~----~~~~~---~~~~~~~~~~~~~-~g 68 (513) |+-. |-=+-=.-|.||.-+|.-.|-||..|-.|+-....+ -+++.|.- .+..+ +...++.-..|+. .| T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~vls~~~vp~galv~~~~W~na~G 80 (715) T protein:vir:26 1 MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLSDVVVPEGALVQTLDWYNVAG 80 (715) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhccceeecceEEEEEeecCceeeeeechhhccc Confidence 3211 111111469999999999999999999999887544 44443321 11111 2333444445553 22 Q ss_pred --ceEE-EEEcCceEEEecCceEEeccccce-------------eeCCC-CceeEEeeCCEEEEEeCCCceEEEc--CCC Q lcl|NC_010325. 69 --IPYW-LLCSEQRLYLADGTTIIDVSPGPY-------------SASIT-NRWSVGSFNGVIFANDGVNPPHHLP--PSE 129 (513) Q Consensus 69 --~~~~-~v~~~~kly~~~~~t~~dis~~~~-------------~~~~~-~~w~f~~~~~~~ia~ng~d~~q~~~--~~s 129 (513) ++.. ++--..+||-+..++- .++-+.+ +-++. ++-+.+...+.+|.+|+.-.|-++. ..+ T Consensus 81 ~v~~~~livqvg~~l~f~q~t~~-pLs~~n~~~svdl~~~~~~vn~SPsh~~v~v~~~~G~livanp~i~~~~~~~d~~t 159 (715) T protein:vir:26 81 QVNLEFLVVQVNNILYFYEKSTD-PLSANKYSGSVDLNTHSASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTST 159 (715) T ss_pred ccCcEEEEEEeccEEEEEeccCC-ccccCceeEEeeecceecccccccceeEEEEEEeeeEEEEecCCccEEEEEecCCc Confidence 3333 3333457887776641 2222222 12222 3677888999999999877766532 111 Q ss_pred ceec-------cc--C------------------------------CC-------------------------------- Q lcl|NC_010325. 130 STFR-------VL--P------------------------------NF-------------------------------- 138 (513) Q Consensus 130 ~~f~-------~L--~------------------------------g~-------------------------------- 138 (513) ..|+ ++ . || T Consensus 160 ~s~t~~~ll~r~r~f~~qg~d~~~g~~y~~~gt~~tn~~iynlyN~gw~~p~gt~~~N~~~~yiVypa~s~~~~S~kd~n 239 (715) T protein:vir:26 160 EAFTATSISFKERDFEWQGSDVDVTSLYFGEGTSVSNQRIYDTYNVGWVGPKGSAALNTYGSYIVYPALTHPWYSGKDAN 239 (715) T ss_pred ceeEeeEEEEEeeeheeeccccccccccccCCcccCchhheecccceeecceeEEEEcCCCCceEecccccccCCCcccc Confidence 1111 11 0 00 Q ss_pred -----------------cc---------------------cceeeEEEEEcCEEEEEECCcCcccCCceEEEecc----C Q lcl|NC_010325. 139 -----------------PA---------------------NTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTS----A 176 (513) Q Consensus 139 -----------------p~---------------------~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~----~ 176 (513) |+ --+.++++-|++||++.|+..+ +...||.+|.+ . T Consensus 240 ~afsk~ad~ei~tGt~~~~~G~yi~D~~~~g~~~leeev~k~R~rsv~~yaGrV~yagiD~d--kng~rilfSqLv~s~~ 317 (715) T protein:vir:26 240 GAFNKADWLEIYTGSSLASNGHYVLDVFNKARTGLTTEVETGRFRSVAAYAGRVFYAGIDSA--KNGGKVYFSRLTERMS 317 (715) T ss_pred cccChhhccccccccccccCceEEEeeeecCCccchhhhhcCCCcceeeecceEEEeecccc--cCCCeEEEehhhcchh Confidence 00 0156678999999999987433 33448999987 5 Q ss_pred CcccccccccccccccCcceecccCCCCc---------eeEEEecCcceEEEecCcEEEEEe---cCCCceeEeEEecCc Q lcl|NC_010325. 177 DAGGVPASWDPTDPTKDAGQNTLADTNGA---------IVDGVKLRDSFIIYKEDSVYSMRY---IGGLFIFQFQQLFND 244 (513) Q Consensus 177 d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~---------iv~g~~l~~~~vIf~en~i~~m~y---~g~~~~f~~~~i~~~ 244 (513) |+.+|+|.-|+|+..- .||.|++|. |+.++.++..++||++|+||++.- +-..+.|.+.||++ T Consensus 318 di~nCyQd~DPTsee~----~dLidTDGg~iri~gah~ii~Lv~f~~sLlvf~~NGVWAi~G~d~g~tATdY~ltKIs~- 392 (715) T protein:vir:26 318 DVGNCYQVNDPTSEVL----SDLLDTDGGVVRIPDAHNIRKLHVLGASLLVFAENGVWAVAGVDNVFRATEYAITRISD- 392 (715) T ss_pred hcccccccCCCchhhh----hhhhhcCCCEEEecCCCCceeEEEecceEEEEEecceEEEeccCCceeeeeeEEEEeee- Confidence 7999999999887764 356666665 778889999999999999999952 23466899999997 Q ss_pred cccccCceeEEECCeEEEEeCCCeEEEC----Cccccc-CCchhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCC Q lcl|NC_010325. 245 VGILGPNCAVEFDGNHFVVGHGDVYVHN----GVQKQS-VIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTR 319 (513) Q Consensus 245 ~G~~~~~siv~~~~~~ffls~~G~y~~~----G~~~~~-Ig~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~ 319 (513) +||.+|+||+.+|+.++||+++|||.+. -+.+.+ +.+|+--..++..|+.+..-...+.||..++||||+||.++ T Consensus 393 vg~sspnSvVvv~~~i~~WsdtGIyal~~Nd~fn~~tAqNLTekTIq~~~~~I~~dk~knVtg~fd~~e~rVyW~yPn~d 472 (715) T protein:vir:26 393 VGLSNENSFVVADGIPIWWGKTGIYAVQQSENLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDND 472 (715) T ss_pred eccCCCccEEEecceEEEeeCCcEEEEEeccccCcchhhccchHHHHHHHhhcchhhhcceEEEEEccCCEEEEEEcCCc Confidence 9999999999999999999999999653 233332 44544444577999999999999999999999999999987 Q ss_pred CCCCcccceEEEEecccCeEE---EEeccce--------eeeeecccccccceeecccCcccCccceeccccccccCcc- Q lcl|NC_010325. 320 SKPGKHCDRAIIWNWKENTWS---IRDLPNV--------LSGAYGIIDPKVSNLWDDDPNPWDTDTSVWGEGSYNPAKS- 387 (513) Q Consensus 320 ~~~~~~~d~~lvyd~~~~~Ws---~~d~~~~--------~~~~~g~~~~~~~~~~~~~~~~~d~d~~~~~~ds~~~~~~- 387 (513) ..-.+++|.+||+|.++|+++ +.+-... +++.++.......+.-..+-..-+.+..+-.+.+...-+. T Consensus 473 t~vdykyd~vLV~dLalgaFYp~~v~~~a~~~~~~ig~~~~~~~~~~~t~~~vv~~~~v~~~g~~~~v~~~~r~~~~~~~ 552 (715) T protein:vir:26 473 ESVDYKYNNILVMDLALQAFYPWRVEDEASSTSYIIGTSYYGGLGSTSTETQVVNGADVVVNGSDNVVATLYRDYLEGDS 552 (715) T ss_pred eeeceeecCeEEEEecccccccccccccccccceeeeeeeeCCcccccchhheeccceEEEeccceEEEEeecccccccc Confidence 766678899999999999965 4442221 2223333333333222222222222222211111111111 Q ss_pred ------------ceEEEEe----------------ecCceeeecccceeecCccEEEEeeccccc---C----CCc---- Q lcl|NC_010325. 388 ------------SMIFSSF----------------QDKKLFLFGNNSTFSGQNFVSTLERSDIYL---G----DDR---- 428 (513) Q Consensus 388 ------------~~~~~~~----------------~~~~~~~~~~~~~~~g~~l~a~~~~~~~~~---~----~~~---- 428 (513) ++-++.| ...+++.+++-.+|+-.|+..+.++...+. + ++. T Consensus 553 ~~~~~~~~~~~~~~~f~~~~~~~~~dw~s~d~~~~~~~gy~~~gd~~~~k~~pyvt~~~~~tedg~v~~~~g~~p~n~sS 632 (715) T protein:vir:26 553 EIKLLVRDGTTGKMTFATFRGDTYLDWGSADYKSFAEAGYDFMGDITTFKNAPYVTTYMRVTEDGYVASGAGYEFINPSS 632 (715) T ss_pred eEEEEEEcCCceeEEEecccCceeeeccccchhhHHHhhhhhcccceeeecCceEEEEEEEecccceeccCCccccCCcc Confidence 1223333 333555677788899899887776543332 1 111 Q ss_pred --ceEEEeeeeeccCCCeeEEEEeeeeecCCCCceEcCceeeecC-CceEEEeecCCCeEEEEEEccCCCcEEEEEEeeE Q lcl|NC_010325. 429 --MMKTVSAIIPHITGNGTCNIWVGNAQVQGSGIRWKGPYPYRIG-QDYKIDTKHVGRYIALKFDFSSEGDWYFNGYTIE 505 (513) Q Consensus 429 --~~~~i~~~~~~~t~~~~~~~~~g~~~~~~~~~~w~~~~~~~~~-~~~~~~~R~~~Ry~~~rl~~~~g~~w~~~G~~~~ 505 (513) |+.+++--+...+.++++++.-..-..++ -.+++.+... ++.|..+|.+||-++||++...|++.++.||.+. T Consensus 633 clm~~sw~ws~s~st~~eaYk~~~~~~~~p~----~~s~~~yp~~~VvTKsriRG~Gr~~~~rf~s~~gKdlhl~Gysil 708 (715) T protein:vir:26 633 CLMSVSWNLSKSGSTPREIYKLKDVPVVNPN----DLSSINYPTDTVVTKSKVRGRGRSMKFRFESVAGKDFHLVGYEVI 708 (715) T ss_pred eEEEEEeeeccCCCChhhhheecceeeeCCC----ccccccCCcceeEeeeeeeccceEEEEEEEecCCcceEEEeEEEE Confidence 44455544555666777766532122222 1223222211 2578899999999999999999999999999999 Q ss_pred EeccccC Q lcl|NC_010325. 506 MAPKAGM 512 (513) Q Consensus 506 ~~~~g~r 512 (513) .++--.- T Consensus 709 g~~~~~~ 715 (715) T protein:vir:26 709 GAKNNSY 715 (715) T ss_pred ecccCCC Confidence 7765544 No 3 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=100.00 E-value=1.1e-29 Score=179.38 Aligned_cols=499 Identities=11% Similarity=0.081 Sum_probs=287.7 Q ss_pred Cccc---chhhcCccccccccCcccCCCCcEEEeEEEEEeCCe-eEECCCc------ceeeec--CCCc--c-eeeeeee Q lcl|NC_010325. 1 MALE---RQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGK-AQKTLGH------TPIFDT--AQAP--I-LDMFPFI 65 (513) Q Consensus 1 m~~~---~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~-~~~~~g~------~~~~~~--~~~~--~-~~~~~~~ 65 (513) |+-. |-=+-=.-|.||.-+|.-.|-||..|-.|+-....+ -+++.|. ..+..+ +|+. + +.-..|+ T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~~~vls~~~vpa~g~~~v~~~~W~ 80 (771) T protein:vir:95 1 MAKTTNAAEFNTFVGGLITEASPLTFPQNASIDEVNFILNRDGSRNRRNGMDFENGATKVVCNTLVPADGTIAVTSHNWE 80 (771) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhceeeeecCCceEEEEEEecccceEEeeeechh Confidence 3211 111111469999999999999999999999887544 4444332 222222 1221 1 2223444 Q ss_pred e-CC--ceEE-EEEcCceEEEecCceEEecccc------ceeeCCCCceeEEeeCCEEEEEeCCCceEEEc--CCCceec Q lcl|NC_010325. 66 R-NN--IPYW-LLCSEQRLYLADGTTIIDVSPG------PYSASITNRWSVGSFNGVIFANDGVNPPHHLP--PSESTFR 133 (513) Q Consensus 66 ~-~g--~~~~-~v~~~~kly~~~~~t~~dis~~------~~~~~~~~~w~f~~~~~~~ia~ng~d~~q~~~--~~s~~f~ 133 (513) . .| ++.. ++--..+||-+..++- .+|-+ ..+.++..+-+.+...+.+|.+|+.-.|-++. ..+-.|+ T Consensus 81 na~G~v~~~~livqvg~~l~f~q~t~~-pLs~~n~~~~a~~nlSPsh~isv~v~~G~livanp~i~~~~~~~d~~t~s~t 159 (771) T protein:vir:95 81 NAGGEVGRWISLVQVGTELKFFQTTGE-TLSEGNFYNYQFVNMSPSHKLSYAVVDGLLVVANGSRDIYVFEYDSGSVSVT 159 (771) T ss_pred hcccccCcEEEEEEeccEEEEEecCCC-cccccceeeeecceeccceeEEEEEeeeEEEEecCCccEEEEEecCCcceeE Confidence 2 22 3333 3333457777776641 12211 22345556788888888888888655444321 1110111 Q ss_pred ccC-------CC--------------------------------------------------------------cc---- Q lcl|NC_010325. 134 VLP-------NF--------------------------------------------------------------PA---- 140 (513) Q Consensus 134 ~L~-------g~--------------------------------------------------------------p~---- 140 (513) .+. ++ |+ T Consensus 160 ~~~ll~r~rf~~q~~~~G~d~~~~~~~~~~gt~~tn~~iynlyN~gw~~pk~~~~snt~~~~iV~~y~a~~g~~pS~sd~ 239 (771) T protein:vir:95 160 TKRLLVRDLFGVQDIVNGVDLRQGNDIATRPTVQTNAHIYNLRNQTFGVPRVTWHSNEPSDPIVTFRSAASGKFPSNSDS 239 (771) T ss_pred eeeeeeeehhhccccccccceecccccccCCcccCchhheeccccceeccccccccCCccccceEeeeccCCCCcCCcee Confidence 000 00 00 Q ss_pred ----------------------------------------------------------------------------ccee Q lcl|NC_010325. 141 ----------------------------------------------------------------------------NTTF 144 (513) Q Consensus 141 ----------------------------------------------------------------------------~~ka 144 (513) .-.+ T Consensus 240 ~N~a~~k~~~~Ei~t~~~f~~~~~~~~~~Gt~~~~~G~yi~da~~~g~~~Lt~~ve~~gr~~s~~~~~~~l~~~~t~~~~ 319 (771) T protein:vir:95 240 VNLALSKRADVEPSTTDRFRAEDIVLNPIGTYETARGFFIIDAMARGKSRLEEIVKLKQRYPSLSFGVSSLPQDETPGGA 319 (771) T ss_pred eccccchhhccceeeecccchhhhhhcccCcccccCcceeeehhhhcccccceeeeccccchhhhccccccccccCCCCc Confidence 0034 Q ss_pred eEEEEEcCEEEEEEC-------CcCcccCCceEEEecc----CCcccccccccccccccCcceecccCCCCc-------- Q lcl|NC_010325. 145 KRLKSFKNFLVGLNA-------TSNSVEMPQMVWWSTS----ADAGGVPASWDPTDPTKDAGQNTLADTNGA-------- 205 (513) Q Consensus 145 ~~v~~~~~~l~~~g~-------t~~~~~~p~rv~wS~~----~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~-------- 205 (513) +.|+-+.+||++.|- .+++.+...||.+|.+ .|+.+|+|.-|+|+..- .||.|++|. T Consensus 320 ~~vaeyagRvwYag~~~~~iD~dkng~~~~~~ilfSqLv~s~~di~nCyQd~DPTsee~----~dLidTDGg~iri~gah 395 (771) T protein:vir:95 320 SVVCEYAGRVWYAGFSGQIIDGDDQSPRLVSYILFSQLVDSPADIVNCYQDGDPTSTEE----PELVDTDGGFIRIEGAH 395 (771) T ss_pred eeEEeeeeeEEEecceeEEeeccccCCceeeeEeeehhhcchhhcccccccCCCchhhh----hhhhhcCCCEEEecCCC Confidence 558889999999882 3555555678999977 57999999999887764 356666665 Q ss_pred -eeEEEecCcceEEEecCcEEEEEec----CCCceeEeEEecCccccccCceeEEECCeEEEEeCCCeEEEC---Ccccc Q lcl|NC_010325. 206 -IVDGVKLRDSFIIYKEDSVYSMRYI----GGLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVYVHN---GVQKQ 277 (513) Q Consensus 206 -iv~g~~l~~~~vIf~en~i~~m~y~----g~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y~~~---G~~~~ 277 (513) |+.++.++..++||++|+||++.-+ -..+.|.+.||++ +||.+|+||+.+|+.++||+++|||.+. -+.+. T Consensus 396 ~ii~Lv~f~~sLlvfc~NGVWAi~ggsd~g~tAtdY~ltKIs~-vg~sspnSvVvvg~~i~ywsdtgIyal~~Ndfn~~t 474 (771) T protein:vir:95 396 DIINLVNVGSAVMVVAANGIWMIQGGSDYGFTATNYLVTKISE-HGCSSPNSVVVVDNSFMYWGDDGIYHLTRNQYGDYV 474 (771) T ss_pred CceeEEEecceEEEEEecceEEEEeccCCceeeeeeEEEEeee-eccCCCccEEEecceEEEeeCCceEEEeecccCcch Confidence 7788899999999999999999533 2366799999997 9999999999999999999999999653 33333 Q ss_pred c-CCchhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccc--eEEEEecccCeEEE---Eec---cc-- Q lcl|NC_010325. 278 S-VIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCD--RAIIWNWKENTWSI---RDL---PN-- 346 (513) Q Consensus 278 ~-Ig~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d--~~lvyd~~~~~Ws~---~d~---~~-- 346 (513) + +.+|+--..++..|+.+..-...+.||..++||||+||.. ..++.+ .-||+|.++|+++- .++ +. T Consensus 475 AqnLTekTIq~~~~~I~~dk~knVtg~fd~~e~rvyw~yPn~---~D~~~e~~t~LV~dLalgaFYp~~i~~~~ag~l~~ 551 (771) T protein:vir:95 475 ANNLTEKTIQKYYEKIPSDAILNATGFYDSYDKKVKWLYNTV---LDGRTEPVTELVFDLALGAFYPSKIGSLTAGRLPI 551 (771) T ss_pred hhccchHHHHHHHhhcchhhhcceEEEEEccCCEEEEEecce---ecCCCcceeeeeeeecccccccccccccccCccce Confidence 3 4454444457799999999999999999999999999943 111222 23999999999653 332 11 Q ss_pred ----eeeeeecccccccceeecc-cCcccCccceeccccccccCccceEEEEeecCcee--------------------- Q lcl|NC_010325. 347 ----VLSGAYGIIDPKVSNLWDD-DPNPWDTDTSVWGEGSYNPAKSSMIFSSFQDKKLF--------------------- 400 (513) Q Consensus 347 ----~~~~~~g~~~~~~~~~~~~-~~~~~d~d~~~~~~ds~~~~~~~~~~~~~~~~~~~--------------------- 400 (513) .++.++++.....+++... +-..-+.+..+-..-+...-++..+...+.++-.. T Consensus 552 ~vg~~~~p~~~lv~T~~eV~v~~~~v~~tG~~vtV~~~~r~~~~~~~~y~~~~~dg~~g~~~Fa~~~~~~f~DW~sv~~~ 631 (771) T protein:vir:95 552 PVGSVKIPPYKLVETGEEVTVASEQVTATGELVTVKVSTRSPVIRETKYIIVEKLSSPMRISFGGYTDEEFVDWKSVDGI 631 (771) T ss_pred eeeeeecCccccccccceEEecceeeEecCCceEEEEEEeeccccceEEEEEEecCCCeeEEeccccCcceeecccCCCc Confidence 1222233322222221111 11111222222222222222333333333333221 Q ss_pred -------------eecccceeecCccEEEEeecccc---cCCCc-----------ceEEEeeeeeccCCC-----eeEEE Q lcl|NC_010325. 401 -------------LFGNNSTFSGQNFVSTLERSDIY---LGDDR-----------MMKTVSAIIPHITGN-----GTCNI 448 (513) Q Consensus 401 -------------~~~~~~~~~g~~l~a~~~~~~~~---~~~~~-----------~~~~i~~~~~~~t~~-----~~~~~ 448 (513) .+++-..++-.|+.....+...+ .+-.+ |+.+++--...-+.. +++++ T Consensus 632 ~vdy~sy~~~gY~~~gd~~~~k~~PYit~y~~~tedg~v~~~~g~~~p~n~sSclm~~sw~ws~s~~t~k~~~~~eaYk~ 711 (771) T protein:vir:95 632 GVDAPAYLLTGYLAGGDYQREKFVPYITFHFKKTEDGFVEDAEGDWTPTNQSSCMVQSQWSWTNSPASNKWGRTWQAYRF 711 (771) T ss_pred ccchHHHHHhhhhccchheeeeccceEEEEEEeecccceecccccccccCCcceEEEEEeeeecCCCCCccccchheeee Confidence 12222233333332222221111 11000 222221111111111 23332 Q ss_pred EeeeeecCCCCceEcCceeeecCCceEEEeecCCCeEEEEEEccCCCcEEEEEEeeEEeccccC Q lcl|NC_010325. 449 WVGNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYFNGYTIEMAPKAGM 512 (513) Q Consensus 449 ~~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~~G~~~~~~~~g~r 512 (513) . ...+++.-- ....++++=++.|..+|.+||-++||++...|++.++.||.+..-.-|.- T Consensus 712 ~--~~~~p~~~~--~~~yp~~~VV~TKsriRG~Gr~~~~rf~s~~gKdlhl~Gysil~~~~~~~ 771 (771) T protein:vir:95 712 R--RHFFPDNID--NQFDDGNSVVETKSRLRGSGKVLSLYITTEPKKNLHIYGWSMLVDVNGTV 771 (771) T ss_pred c--ceeccCCcc--hhcCCccceeeeeheeeecceEEEEEEEecCCcceEEEeEEEEEeecCcC Confidence 2 223333211 12223343345788999999999999999999999999999988766666 No 4 >protein:vir:3529 Length: 477 # NCBI annotation: P28 # Family: family:all:1540 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050989;genbank:gi:9633575;genbank:GeneID:1262322 Probab=99.91 E-value=6.1e-24 Score=147.85 Aligned_cols=433 Identities=14% Similarity=0.126 Sum_probs=261.9 Q ss_pred CcccchhhcCccccccccCccc----CCCCcEEEeEEEEEeCCeeEECCCcceeeecCCCcceeeeeeeeCCceEEEEEc Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPAD----LPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTAQAPILDMFPFIRNNIPYWLLCS 76 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~----lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~v~~ 76 (513) |.++++-. +.|..++..+.| ||-|..-.-.++....++++.+||..+..+ .++++.|++....+|..+ ++. T Consensus 7 m~~~~ipl--~~g~~~~~~~~d~~~~~PVN~~a~p~~~~~s~~~L~~~pG~~~~~~-~~G~~RG~~~~~~~g~lY--~V~ 81 (477) T protein:vir:35 7 MPKIQIPL--AKGLVKDIKTADYIDALPVNMLATPKEVLNASGYLRSFPGIEKKQD-AKGVSRGVHFNTKNNALY--RVC 81 (477) T ss_pred eeeecccc--ccccccccccccceeeeeeccceeeccccccccccccCCcceeecc-CCccccceeEeecCCeEE--EEe Confidence 88877666 577777665553 344433333444455688888899988544 688888886544555544 445 Q ss_pred CceEEEecCceEEeccccceeeCCCCceeEEeeCCE-EEEEeCCCceEEEcCCCceecccCC--Ccc--cceeeEEEEEc Q lcl|NC_010325. 77 EQRLYLADGTTIIDVSPGPYSASITNRWSVGSFNGV-IFANDGVNPPHHLPPSESTFRVLPN--FPA--NTTFKRLKSFK 151 (513) Q Consensus 77 ~~kly~~~~~t~~dis~~~~~~~~~~~w~f~~~~~~-~ia~ng~d~~q~~~~~s~~f~~L~g--~p~--~~ka~~v~~~~ 151 (513) .++||+-+ ++..+|+++ .+-+++-=+.. .|.++|...-.+|+++..+++.++. .|. ...++.|+... T Consensus 82 G~~LY~v~-~~vG~I~gs-------g~VsMa~n~~~~aIv~~g~~~gy~y~~t~~~~~~~~~~~~p~~~l~~~~~v~f~d 153 (477) T protein:vir:35 82 GNTLYRND-KEVADIAGM-------SRVSMSHSSHSQAICFEGKVKLYRYDGTEKALSNWPKDKYPQYDLGEVIDVCRNR 153 (477) T ss_pred cCeeEeee-eeeeeeccc-------ccEEEeeCCcEEEEEECCcceeEEEecccceeeecCccccCCccccceeEEEeeC Confidence 56899864 333444322 34566554444 4666777666788988888777762 221 22366778888 Q ss_pred CEEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceEEEecCcE--EEEEe Q lcl|NC_010325. 152 NFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSV--YSMRY 229 (513) Q Consensus 152 ~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i--~~m~y 229 (513) +|. ++...+ .+++.+|++.|+ ..++..+. |---...+.+||+.....+.+++|-+++| |..+ T Consensus 154 Gyf-V~~~~g-----t~~~~iS~L~d~----s~~d~~~~-----FasAE~~pD~Ivgi~~~~~~i~lfG~~TiEvw~nt- 217 (477) T protein:vir:35 154 GRY-IWLQKG-----GERFGVTDLEDE----SKPDRYQP-----FYRAESQPDGIVSVDAWRDLIVCFGSSSIEYFTLT- 217 (477) T ss_pred ceE-EEeecC-----CCeEEEeecCCc----cccccccc-----cccccCCCCceEEEEeeccEEEEEeccceEEEEec- Confidence 884 344433 366888999985 34443321 11111334679999999999999999998 7763 Q ss_pred cCCCceeEeEEec----CccccccCceeEEECCeEEEEeCC-----CeEEECCcccccCCchhHHHHHHhhcCcchhCCE Q lcl|NC_010325. 230 IGGLFIFQFQQLF----NDVGILGPNCAVEFDGNHFVVGHG-----DVYVHNGVQKQSVIDAQVRKFFFSDINPDNYQRT 300 (513) Q Consensus 230 ~g~~~~f~~~~i~----~~~G~~~~~siv~~~~~~ffls~~-----G~y~~~G~~~~~Ig~~~V~~~~~~~i~~~~~~~~ 300 (513) ++.+.-|.++.+. -..||.++.|++.+++.+|||+++ -+|+++|.+++.|.+..|++.+ ..+....+... T Consensus 218 G~a~f~~p~~r~~~~~mIq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~q~~rIST~aIE~~i-~ay~~~e~a~a 296 (477) T protein:vir:35 218 GSADTSQPLYIHQAAYMIQAGIAGRDCKCRYQDKYAILSHQSTGQPAVYLIGAGEKNKISTATIDKII-RYYSADELAAS 296 (477) T ss_pred CCCCCCcceeecCCceeeeecccCchhhhhhCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHH-HhcCCcchhce Confidence 3445444555543 488999999999999999999997 2678899999999999999865 56665555565 Q ss_pred EEEE--ecCCCEEEEEEccCCCCCCcccceEEEEecccCeEE----EEeccceeeeeecccccccceeecccCcccCccc Q lcl|NC_010325. 301 FVLA--DHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWS----IRDLPNVLSGAYGIIDPKVSNLWDDDPNPWDTDT 374 (513) Q Consensus 301 ~~~~--d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws----~~d~~~~~~~~~g~~~~~~~~~~~~~~~~~d~d~ 374 (513) +++. ..-...|++.|| ++.++||..++-|. ...--. . - T Consensus 297 f~~t~~~eGH~fy~LtfP----------~~Tw~yD~at~~w~e~W~~~~~g~------------~--------------~ 340 (477) T protein:vir:35 297 FMESIRFDNHELLLLHLP----------KHTLCFDGSASHQYSQWSLLKSGF------------Y--------------D 340 (477) T ss_pred eEEEEEeCCeeEEEEEcC----------CceEEEecccccccceeeeeccCC------------c--------------c Confidence 5434 333334556665 47899999887654 321100 0 0 Q ss_pred eeccccccccCccceEEEEeecCceeeec-ccceeecCccEEEEeecccccCCCcceEEEeeeeeccCCCeeE--EEEee Q lcl|NC_010325. 375 SVWGEGSYNPAKSSMIFSSFQDKKLFLFG-NNSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTC--NIWVG 451 (513) Q Consensus 375 ~~~~~ds~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~--~~~~g 451 (513) ..|......+..+..+.|...++.++.++ ..-+-.|.+++..+.++.+..++ .|..... +...++.++. .+.+. T Consensus 341 ~~~Ra~~~~~~~g~~~vGD~~ng~l~~ld~~~~~d~g~~i~~~~~~p~~~~d~-~Rv~~~e--l~~~tGvgq~~d~v~L~ 417 (477) T protein:vir:35 341 EPYRAIDFMFFDNQITVGDKKEGVLGHLIFNASNQYEQQTEHLLYTPMIKADN-ARLFDFE--LEASTGVAQIADKLFLS 417 (477) T ss_pred CceEEEEEEEeCCeEEEEEcCCCeEEEECCCCcccCCCccceEEecceeeCCC-CeEEEEE--EEEecCcCccCceEEEE Confidence 11333334444556677777777777653 23344678888888877666554 3443222 2222222211 13332 Q ss_pred eeecCCCCceEcCceeeecCCceEEEeecCC--------Ce-EEEEEEccCCCcEEEEEEeeE Q lcl|NC_010325. 452 NAQVQGSGIRWKGPYPYRIGQDYKIDTKHVG--------RY-IALKFDFSSEGDWYFNGYTIE 505 (513) Q Consensus 452 ~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~--------Ry-~~~rl~~~~g~~w~~~G~~~~ 505 (513) -++ +-.+|+.++...-|.-+..+.|+.. |- .+||+..++=-.=+.-+.-+| T Consensus 418 ~sd---dG~~~~~~~~~~~g~~g~~~~r~~~~RlG~~r~~vgf~~r~~~~~pv~l~~~~~~~e 477 (477) T protein:vir:35 418 VTT---DGINYSREQLIEQNSPFQYDKRILWRRIGRVRKNIGFKIRIITKSPVTLSDLSIRME 477 (477) T ss_pred Eec---cccccccceeecCCCccccccceeeeeeeeceeccceEEEEEecCCceeccceeEeC Confidence 232 3667888876665554444444322 22 345554443222222334444 No 5 >protein:vir:3133 Length: 911 # NCBI annotation: hypothetical protein # Family: family:all:5234 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640315;genbank:gi:21234408;genbank:GeneID:956056 Probab=99.91 E-value=5e-25 Score=153.83 Aligned_cols=490 Identities=13% Similarity=0.099 Sum_probs=219.9 Q ss_pred CcccchhhcCcccccc-ccCcccCCC------CcEEEeEEE----EEeCCeeEECCCcceeeecCCCcceeeeeeeeCCc Q lcl|NC_010325. 1 MALERQEVKNPTGIVT-DIAPADLPL------EKWSFGNNV----RFKNGKAQKTLGHTPIFDTAQAPILDMFPFIRNNI 69 (513) Q Consensus 1 m~~~~~~~~~~~G~~~-~~~P~~lp~------~a~~~~~Nv----~~~~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~ 69 (513) --+.+++++|--|+-+ .+.|.-|-. .-++++.|- .|+.------.|++.+..+...+...-..|. + T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~ 228 (911) T protein:vir:31 152 PVLIKLDDVDDEGVPTLSYEPLTLLIRTRELLTPYTTGTNYGDTLTPEEEWNLYNSGWATITRATKDKSGSGTVYV---N 228 (911) T ss_pred eEEEEeeccCccCcccccccceeeEeeehhhccccccccccCcccCchhhcccccccceeeeeecccCCccceEEE---c Confidence 2234555666666655 444442211 112222221 1110000001122222222211111100000 0 Q ss_pred eEEEEEcCceEEEecCc---eEEecccc------ceeeCCCCceeEEe--------------eCCEEEEEe--CCCceEE Q lcl|NC_010325. 70 PYWLLCSEQRLYLADGT---TIIDVSPG------PYSASITNRWSVGS--------------FNGVIFAND--GVNPPHH 124 (513) Q Consensus 70 ~~~~v~~~~kly~~~~~---t~~dis~~------~~~~~~~~~w~f~~--------------~~~~~ia~n--g~d~~q~ 124 (513) ...+-.++..+|---.. +-++-|.. -.+-.++++..|+. |....|..= |+..|-- T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (911) T protein:vir:31 229 PVQYYFDKRGVYPSHSVLYNSMKQESAKEIVALNVFSPWADEKINFGTTTPPLGRYIHSAYYFDSAAILSLGIGNLTPPT 308 (911) T ss_pred hhheeecccCcCcchhhhhhhhhhhccceeEEEeeeccccccccccccCCCchhhhhhhheeeccceeeeecccccCCCC Confidence 00011111111100000 00000000 01122233333221 111111111 1222221 Q ss_pred EcCC---Cceec-------ccCCCcccceeeEEE----------------EEcCEEEEEECCcCcccCCceEEEecc--- Q lcl|NC_010325. 125 LPPS---ESTFR-------VLPNFPANTTFKRLK----------------SFKNFLVGLNATSNSVEMPQMVWWSTS--- 175 (513) Q Consensus 125 ~~~~---s~~f~-------~L~g~p~~~ka~~v~----------------~~~~~l~~~g~t~~~~~~p~rv~wS~~--- 175 (513) -|++ |+.-+ .|........||+|. .|++|||+.+...++ .+||.||.+ T Consensus 309 ~~~~~~~~~p~~~e~~np~gl~~igt~~n~k~~a~~~~~~~~~~r~r~~~~yaGRVfyaD~dkng---k~rIlFSqLv~s 385 (911) T protein:vir:31 309 SDGTTEGSGPAEEEISNPIGLDNIGTVNNLKLIAEGTVRWTVKDRPRCSGYHNGHVYFGDRDKNG---KTRILVSQLVNS 385 (911) T ss_pred CCCccCCCCCchhhhcCCCCcccccchhceeeeeccceeeeecccccceeeeccEEEEeeeccCc---ceeEEEEeeccc Confidence 1110 11101 111112222455555 999999998765544 789999977 Q ss_pred -CCcccccccccccccccCcceecccCCCCc---------eeEEEecCcceEEEecCcEEEEE----ecCCCceeEeEEe Q lcl|NC_010325. 176 -ADAGGVPASWDPTDPTKDAGQNTLADTNGA---------IVDGVKLRDSFIIYKEDSVYSMR----YIGGLFIFQFQQL 241 (513) Q Consensus 176 -~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~---------iv~g~~l~~~~vIf~en~i~~m~----y~g~~~~f~~~~i 241 (513) .|+.+|+|.-|+++...+ ||.+++|. |+..+..+..++||++|+||.+. |++..+.|.+.|| T Consensus 386 l~di~nCYQdaDPTSeee~----DLIdTDGg~vri~gah~Ii~LV~~G~sLlVFcaNGVWAI~G~d~~g~TATdy~ItKI 461 (911) T protein:vir:31 386 LDNIPKCFQDADPTAEEIN----DLIATDGFTMYPVGMGAPITMVEFNKRLLLLCTNGVWAIRGTSGGGATATDFTLDKV 461 (911) T ss_pred cccccccccCCCccccccc----hhhhcCCcEEecCCCCCceEEEEecCeEEEEEeCcEEEEeccCCCceeeeeeEEEEE Confidence 678999999887766643 55555554 88888999999999999999995 3334567999999 Q ss_pred cCccccccCceeEEECCeEEEEeCCCeEEEC---Ccccc-cCCchhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEcc Q lcl|NC_010325. 242 FNDVGILGPNCAVEFDGNHFVVGHGDVYVHN---GVQKQ-SVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSS 317 (513) Q Consensus 242 ~~~~G~~~~~siv~~~~~~ffls~~G~y~~~---G~~~~-~Ig~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s 317 (513) ++ +||.+|+|||.+|+.+||||+.|||.+. -+.+. .+.+++--..++..|+.+..-...+.+|..+++|||+||. T Consensus 462 sd-vGcsspNSVVvVgn~i~fWSd~GIyaLganqfnD~tAnNLTesTIQ~y~d~I~~dkIkNVtgtyd~de~rVyW~yPn 540 (911) T protein:vir:31 462 AS-VEFNSPQSVVDIGTAIVFWSERGIIAIGVNDFGDLTSNNLTENTIDEYYDSLDRDIIKNVKGTFINDENRVYWVVPN 540 (911) T ss_pred ee-eeeCCCCeEEEecCceEEeeCCcEEEEeecccCccccccccHHHHHHHHhhcChhhhceEEEEEEccCCEEEEEecC Confidence 97 7999999999999999999999999653 23322 3445444445789999999999999999999999999996 Q ss_pred -CCCCCCcccc--eEEEEecccCeEEEEec---cceeeeeecc--cccccceeecc-cC--ccc-Cccceecccc-cccc Q lcl|NC_010325. 318 -TRSKPGKHCD--RAIIWNWKENTWSIRDL---PNVLSGAYGI--IDPKVSNLWDD-DP--NPW-DTDTSVWGEG-SYNP 384 (513) Q Consensus 318 -~~~~~~~~~d--~~lvyd~~~~~Ws~~d~---~~~~~~~~g~--~~~~~~~~~~~-~~--~~~-d~d~~~~~~d-s~~~ 384 (513) .++.--+..+ ++||+|.++++|.--.+ |.+-.-++++ +..++...... .. +.- ..+...-++. .+.. T Consensus 541 ~lDe~teykt~~~~ILVfdLatgaFYPwtvs~gpLl~~p~y~Lv~TreEvtvPi~~etgaiIve~gsdPV~~tl~vdttG 620 (911) T protein:vir:31 541 KQDSNGEYKTDGELVLVLNLDTGGFYKHTVSGGPLLHAPFRRLVNTRAEVSIPITETDGTVITDTLGDPVTVTRTVTTTG 620 (911) T ss_pred ccCCccceeecCceEEEEEeccCcccceeeecceeecccccccccccccceeeEEeecceEEEecCCCCeEEEEeeeccc Confidence 3332222333 79999999999862221 1111111221 11222210000 00 000 0111111111 2222 Q ss_pred CccceEEEEeecCceeeec------------------------------------ccceeecCccEEEEe------eccc Q lcl|NC_010325. 385 AKSSMIFSSFQDKKLFLFG------------------------------------NNSTFSGQNFVSTLE------RSDI 422 (513) Q Consensus 385 ~~~~~~~~~~~~~~~~~~~------------------------------------~~~~~~g~~l~a~~~------~~~~ 422 (513) ..+...+..|+|+....|. .-..|+..|+..+.- ..+. T Consensus 621 vDg~ayLl~frdg~~g~~~f~a~~~~~~~~dw~~~~~~~~~~y~s~~~~~y~~~~~~~~~~~~pyi~sy~~~~~rv~~~~ 700 (911) T protein:vir:31 621 VDGLAYFASFDDGVNGQFNFIAEHQPWGFADWANVPNMTRVNYSSYVDFAYEYPEVMIGNISLPYIHSYYLTGIRVQTEQ 700 (911) T ss_pred ccceeEEEeeccCCcceEEEEEeecCCeeeccccCccccccchhHHHHhhhhhhhhhhhcccCceeeeeeeeeeEEeccc Confidence 2333445556555333221 111222223222221 1111 Q ss_pred ccCCCcceEEEeeeeeccCCCeeEEEEeeeeecCCCCceEcCceeeecCCceEEEeecCCCeEEEEEEccCCCcEEEEEE Q lcl|NC_010325. 423 YLGDDRMMKTVSAIIPHITGNGTCNIWVGNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYFNGY 502 (513) Q Consensus 423 ~~~~~~~~~~i~~~~~~~t~~~~~~~~~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~~G~ 502 (513) ....++....-+-..-+.+.-+++++--=.++.+ .+.+...- ||-++ -|-.++.|..+++.-=+++|- T Consensus 701 y~~~~a~~~f~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~---~~~~~---~~~~~~~vVNGDAE~GtmTGW 768 (911) T protein:vir:31 701 YTTETAHLSFHRVQAHQTTALGTVTFHKVDMMVS------TGMQVISF---HKDDL---LRTEAVTLVNPDAETGDATGW 768 (911) T ss_pred eeeecccceeEeeecccceeeeeeeeeeeeehhh------ccceeeee---ccccc---eeeeeeEEEcCCCCCCCCCcc Confidence 1112222222222222233333333210001100 01111100 11000 134455555555443333444 Q ss_pred ee-----EEeccc-cCC Q lcl|NC_010325. 503 TI-----EMAPKA-GMR 513 (513) Q Consensus 503 ~~-----~~~~~g-~rr 513 (513) ++ ....++ -+| T Consensus 769 tvtaG~~d~~Ta~p~~r 785 (911) T protein:vir:31 769 TVTAGTLDVRTAAPLYQ 785 (911) T ss_pred eeeccchhhccCCchhc Confidence 43 221111 222 No 6 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=99.91 E-value=3.9e-23 Score=143.42 Aligned_cols=428 Identities=13% Similarity=0.153 Sum_probs=268.0 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeC-------CeeEECCCcceeeecCCCcceeeeeeeeCCceEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKN-------GKAQKTLGHTPIFDTAQAPILDMFPFIRNNIPYWL 73 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~-------g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~ 73 (513) |..+.+-. |- +.-+++.-++ ..+.|+.++. +.+.+.||.+.+....+.++.+++. . .-.++ T Consensus 1 m~~~~ip~----gs---y~a~~~~~da-q~~VN~yp~~~e~g~ss~~l~~tPGl~~f~~~~~~~~~g~~~---~-~g~ly 68 (458) T protein:vir:10 1 MVQRQIPL----VA---TTAEGDVSGQ-EILVNVYPRKSDGGKYPFTLRHTPGLAFFCELPTFPVMAMHQ---N-GSRAF 68 (458) T ss_pred Cceeeece----ee---eecccccccc-eeeeeeeeecccccccccceEecCCceeeecCCCCceeeEEe---c-CCEEE Confidence 66665553 22 2223444443 5788999983 5578888888886555666666663 2 34567 Q ss_pred EEcCceEEEecCce-EEeccccceeeCCCCceeEEeeCCEEEEEeCCCceEEEcCCCceecccCCCcccceeeEEEEEcC Q lcl|NC_010325. 74 LCSEQRLYLADGTT-IIDVSPGPYSASITNRWSVGSFNGVIFANDGVNPPHHLPPSESTFRVLPNFPANTTFKRLKSFKN 152 (513) Q Consensus 74 v~~~~kly~~~~~t-~~dis~~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q~~~~~s~~f~~L~g~p~~~ka~~v~~~~~ 152 (513) +++...||+-.+.. .+.| |...+ +.+-+++--++.+++++|.. =.+||...+.++... ++...-++.|....+ T Consensus 69 ~v~g~~LY~V~~~~~~~~i--G~i~g--sg~VsMa~ng~q~vi~~G~~-gY~yd~at~~~~~i~-d~~~~~~~~v~~~dG 142 (458) T protein:vir:10 69 AVTPRDMYEISKDGTYKRL--GSVDF--KGRVVMEDNGKQIVMVDGEK-GYYYDSETEIVQEIK-AEGFYPASTVTYQDG 142 (458) T ss_pred EeeCceEEEEeCCceEEEE--ecccC--ceeEEEeeCCcEEEEEECCe-EEEEeecccEEEecc-CccccCcceEEEeCc Confidence 88889999988773 2332 33333 36788988899999999973 455787666565333 222234888999999 Q ss_pred EEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceEEEecCcE--EEEEec Q lcl|NC_010325. 153 FLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSV--YSMRYI 230 (513) Q Consensus 153 ~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i--~~m~y~ 230 (513) |++. ..++ -.++..|++.|. ++|+. +|-.-...+.+||+.....+.+++|-+++| |..+ + T Consensus 143 y~V~-~~~g-----~~~~~is~L~d~-----s~d~l------~fa~Ae~~pD~iv~i~~~~~~i~~fG~~TiEvw~nt-G 204 (458) T protein:vir:10 143 YFIF-DRKG-----TGQFFISELLDV-----AFDPL------DFATAEGQPDPLLAVLSDHREVFMFGQETIEVWYNS-G 204 (458) T ss_pred EEEE-EeeC-----CCEEEEEecCcc-----eeCcc------eeeeecCCCCceEEEEeeccEEEEEeccceEEEEec-C Confidence 9853 2222 246888998883 45432 122233445779999999999999999998 7763 3 Q ss_pred CCCceeEeEEec-CccccccCceeEEECCeEEEEeCCCe-EEECCcccccCCchhHHHHHHhhcCcchhCCEEEEEe--c Q lcl|NC_010325. 231 GGLFIFQFQQLF-NDVGILGPNCAVEFDGNHFVVGHGDV-YVHNGVQKQSVIDAQVRKFFFSDINPDNYQRTFVLAD--H 306 (513) Q Consensus 231 g~~~~f~~~~i~-~~~G~~~~~siv~~~~~~ffls~~G~-y~~~G~~~~~Ig~~~V~~~~~~~i~~~~~~~~~~~~d--~ 306 (513) +.++.|+..+-. =..||.++.|++.+++.+|||++++. |+++|.+++.|.+..|++.+ ..++ ....+++.. + T Consensus 205 ~a~fpy~r~~ga~i~~Gcaa~~sv~~~~~t~~~l~~d~~Vy~l~g~~~~rIST~aIE~~i-~sy~---~~da~a~t~~~e 280 (458) T protein:vir:10 205 AADFPFERNQGAFIEKGIGAPYSVAKTNNTVYFIGSDLMIYQITGYTPVRISTHAVEQTL-KGVN---LSDAFAYTYQSE 280 (458) T ss_pred CCCcceeecccceeeecccCcchhhhhCceEEEEcCCeEEEEecCceeEEeeCHHHHHHH-hcCC---hhheEEEEEEec Confidence 445445555533 38899999999999999999998864 47899999999999999976 5553 333444433 4 Q ss_pred CCCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcccCccceeccccccccCc Q lcl|NC_010325. 307 VNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNPWDTDTSVWGEGSYNPAK 386 (513) Q Consensus 307 ~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~~d~d~~~~~~ds~~~~~ 386 (513) -...|++.||+. ++..+||-.++.|+.+.- +..+ .|-.....+.. T Consensus 281 GH~fy~LtfP~a--------~~Tw~yD~~t~~Wher~S-------------------------g~~~--~~Ra~~~v~~~ 325 (458) T protein:vir:10 281 GHLFYVLTIPGK--------NLTWCYDISSGSWHVRQS-------------------------YQFD--RHVSNNSIYFD 325 (458) T ss_pred CeEEEEEECCCC--------CceeEEecccccceeecc-------------------------CCCC--ceEEEEEEEeC Confidence 444466888764 578899999999998731 1111 24444445556 Q ss_pred cceEEEEeecCceeeeccc-ceeecCccEEEEeecccccCCCcceEEEeeeee-ccCCCe-------eEEEEeeeeecCC Q lcl|NC_010325. 387 SSMIFSSFQDKKLFLFGNN-STFSGQNFVSTLERSDIYLGDDRMMKTVSAIIP-HITGNG-------TCNIWVGNAQVQG 457 (513) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~-~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~-~~t~~~-------~~~~~~g~~~~~~ 457 (513) +..+.|.+.++..+.++.+ -+-.|.+++-.+.+..++ +...++ ..+.+-. ..++-+ ...+.+. -+-+ T Consensus 326 g~~~vGD~~ng~ly~ld~~~~td~g~~i~~~~~~p~~~-~~~~rl-~~~~~el~~~tGvg~~~~~~~~p~~~l~--~S~d 401 (458) T protein:vir:10 326 QKTLVGDFQNGRIYIMADNYYTDDGDPVVREFILPVVN-NGREFL-TVDSLELDLSSGVGLTVGQGSDPELRVY--FSKD 401 (458) T ss_pred CeEEEEEcCCCeEEEEcccCcCCCCceeeeeeecccee-CCCCeE-EEEEEEEEEecceeeeeCCCCCceEEEE--EeeC Confidence 6777788888866666544 355688888888776653 222332 2222222 112211 1111121 1233 Q ss_pred CCceEcCceee-ecCCceEEEeecCC------CeEEEEEEccCCCcEEEEEEeeEEe Q lcl|NC_010325. 458 SGIRWKGPYPY-RIGQDYKIDTKHVG------RYIALKFDFSSEGDWYFNGYTIEMA 507 (513) Q Consensus 458 ~~~~w~~~~~~-~~~~~~~~~~R~~~------Ry~~~rl~~~~g~~w~~~G~~~~~~ 507 (513) ...+|+..+.. ..|.-+....|... |=--|||++.+=.+=.+.|.-++.. T Consensus 402 ~g~~~s~~~~~~~lg~~gey~tr~~~~rlG~ar~rvf~v~~s~p~~~~l~ga~~~~r 458 (458) T protein:vir:10 402 NGNEYSQNFKVGKIGRKGEFLTRAKVNRFGCARQFTFKVEISDPIPVDIGGAWVEVR 458 (458) T ss_pred CCcccchhHHHhhcCCcchhhhhhhhhhhccCcceEEEEEEecchhhcceeeeEEeC Confidence 45677765544 33443333333221 2223666665556667888888866 No 7 >protein:vir:100960 Length: 472 # NCBI annotation: gp10 # Family: family:all:1540 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006412;genbank:gi:46358704;genbank:GeneID:2777110 Probab=99.89 E-value=4.7e-22 Score=137.53 Aligned_cols=438 Identities=12% Similarity=0.126 Sum_probs=256.6 Q ss_pred CcccchhhcCccccccccCcccC----CCCcEEEeEEEEEeCCeeEECCCcceeeecCCCcceeeeeeeeCCceEEEEEc Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADL----PLEKWSFGNNVRFKNGKAQKTLGHTPIFDTAQAPILDMFPFIRNNIPYWLLCS 76 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~l----p~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~v~~ 76 (513) |.++.+-. +.|...+.++.|. |.|-.-...++.-..++++..||.+.. +.+++++.|+.--..++. ++++. T Consensus 1 m~~~~ipl--~~g~~~~~~~a~~~~~~pvn~y~~~~~~~~ss~~Lr~~pG~~~~-a~~~G~~RG~~~~~~~~~--ly~V~ 75 (472) T protein:vir:10 1 MPIQQLPM--MKGMGKDFKNADYIDYLPINMLATPKEVLNSSGYLRSFPGIAKR-NDVNGVSRGVEYNTAQNA--VYRVC 75 (472) T ss_pred Cceeeccc--ccccccCCCcCcceeeeeeccccccccccccccceeecccceee-cCCCCcccceeeeeeCCe--EEEEe Confidence 88877666 4677776655544 433222233333335778889998887 457888888743223333 56677 Q ss_pred CceEEEecCceEEeccccceeeCCCCceeEEeeCCEEEEEeCCCceE-EEcCCCceecccCCCcccc-----eeeEEEEE Q lcl|NC_010325. 77 EQRLYLADGTTIIDVSPGPYSASITNRWSVGSFNGVIFANDGVNPPH-HLPPSESTFRVLPNFPANT-----TFKRLKSF 150 (513) Q Consensus 77 ~~kly~~~~~t~~dis~~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q-~~~~~s~~f~~L~g~p~~~-----ka~~v~~~ 150 (513) .++||+-+.. .=+|+ .+.+-+++-=+..+.++-+.+..- ++++...++...+-+.... .++.|+.. T Consensus 76 G~~Ly~v~~~-iG~i~-------gsgrVsMa~n~~~~~v~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~dl~~~~dv~f~ 147 (472) T protein:vir:10 76 GGKLYKGEAV-VGDVA-------GSGRVSMAHGRTSQAVGVNGQLIEYRYDGAVKTVSNWPADSGFTQYELGSVRDITRL 147 (472) T ss_pred CcceEEEEee-Eeecc-------CcccEEEeeCCeEEEEEECCceeEEEEecchhhhhcccCccccccccccceeEEEEe Confidence 7889986542 11121 224556654444344444555543 4677666655555332222 23356677 Q ss_pred cCEEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceEEEecCcE--EEEE Q lcl|NC_010325. 151 KNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSV--YSMR 228 (513) Q Consensus 151 ~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i--~~m~ 228 (513) .+|.+ +... + .++..-|.+.|.. .++.... |---...+.+||+.....+.+++|-+++| |..+ T Consensus 148 dGyfV-~~~~-g----t~~~~iS~l~d~~----~~~~y~~-----fa~AE~~pD~Ivgi~~~~~~l~lfG~~TiEvw~nt 212 (472) T protein:vir:10 148 RGRYA-WSKD-G----TDSWFITDLEDES----HPDRYSA-----EYRAESQPDGIIGIGSWRDFIVCFGSSTIEYFSLT 212 (472) T ss_pred cceEE-EccC-C----CceEEEeccCCcc----ccccccc-----cccccCCCCceEEEEeeccEEEEEeccceEEEEec Confidence 77743 3322 1 2445555666642 2322110 11111334679999999999999999998 7663 Q ss_pred ecCCC----ceeEeEE-ecCccccccCceeEEECCeEEEEeCCC-----eEEECCcccccCCchhHHHHHHhhcCcchhC Q lcl|NC_010325. 229 YIGGL----FIFQFQQ-LFNDVGILGPNCAVEFDGNHFVVGHGD-----VYVHNGVQKQSVIDAQVRKFFFSDINPDNYQ 298 (513) Q Consensus 229 y~g~~----~~f~~~~-i~~~~G~~~~~siv~~~~~~ffls~~G-----~y~~~G~~~~~Ig~~~V~~~~~~~i~~~~~~ 298 (513) |+. ..|+.++ ..=+.||.++.|++.+++.+|||++++ +|+++|.+++.|.+..|++.+ .......+. T Consensus 213 --G~ad~~~fpy~r~~g~~i~~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE~~i-~~y~~~e~~ 289 (472) T protein:vir:10 213 --GATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKII-RSYTAEELA 289 (472) T ss_pred --CCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHH-HhcCCcccc Confidence 554 2344443 235789999999999999999999998 889999999999999999976 666555555 Q ss_pred CEEEEEec-CCC-EEEEEEccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcccCcccee Q lcl|NC_010325. 299 RTFVLADH-VNT-EMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNPWDTDTSV 376 (513) Q Consensus 299 ~~~~~~d~-~~~-~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~~d~d~~~ 376 (513) ...++... .++ .|++.|| ++.++||..++.|..+-. ....+ .--.. T Consensus 290 ~A~~~t~~~~GH~fy~LtfP----------~~Tw~yD~at~~w~erw~-------~~~~g---------------~~~~~ 337 (472) T protein:vir:10 290 TGVMETLRFDSHELLIIHLP----------RHVLVYDASSSQNGPQWC-------VLKTG---------------LYDDV 337 (472) T ss_pred ceEEEEEEeCCeEEEEEEcC----------CeeEEEEcccCcccceee-------eecCC---------------Ccccc Confidence 54444433 333 3556665 578999999997754310 00000 00112 Q ss_pred ccccccccCccceEEEEeecCceeeeccc-ceeecCccEEEEeecccccCCCcceEEEeeeeeccCC--CeeEEEEeeee Q lcl|NC_010325. 377 WGEGSYNPAKSSMIFSSFQDKKLFLFGNN-STFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITG--NGTCNIWVGNA 453 (513) Q Consensus 377 ~~~ds~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~--~~~~~~~~g~~ 453 (513) |......+..+..+.|.+.++.++.++.+ .+-.|.+++..+.++.+..+ .+|..... +...++ +..-.+.+.-+ T Consensus 338 ~R~~~~~~~~g~~ivGD~~nG~ly~ld~~~~t~~g~~~~~~~~~p~l~~d-n~R~~d~e--ve~~~Gv~~~~d~v~L~wS 414 (472) T protein:vir:10 338 YRAVDFMYEGNQITCGDKSEALTGQLQFDISSQYGLQQEHLLFTPLFKAD-NARCFDLE--VESSTGVAQYADRLFLSAT 414 (472) T ss_pred eeEEEEEeeCCeEEEEEcCCCeEEEEecccCCCCCCcccceEEcccccCC-CCEEEEEe--eeccCCCCCcCcEEEEEee Confidence 44444555566677787777777666544 45568888888877766643 34443211 222222 11124444333 Q ss_pred ecCCCCceEcCceeeecCCceEEEeecCCC-------eEEEEEEccCCCcEEEEEEeeEEe Q lcl|NC_010325. 454 QVQGSGIRWKGPYPYRIGQDYKIDTKHVGR-------YIALKFDFSSEGDWYFNGYTIEMA 507 (513) Q Consensus 454 ~~~~~~~~w~~~~~~~~~~~~~~~~R~~~R-------y~~~rl~~~~g~~w~~~G~~~~~~ 507 (513) + +-.+|+.++...-|.-+..+.|+..| =..|||+...-.+-.+.|..+..- T Consensus 415 d---dG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:10 415 T---DGINYGREQMIEQNEPFVYDKRVIWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred c---cccccccceeeccCCccchhcceeeeeeeecccceeEEEEEEecCcceeeeeEEeeC Confidence 3 35588888766655555444443321 123666655546667777766544 No 8 >protein:vir:9268 Length: 472 # NCBI annotation: 10 # Family: family:all:1540 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720332;genbank:gi:24371590;genbank:GeneID:955815 Probab=99.89 E-value=5e-22 Score=137.35 Aligned_cols=434 Identities=13% Similarity=0.162 Sum_probs=254.8 Q ss_pred CcccchhhcCccccccccCcccC----CCCcEEEeEEEEEeCCeeEECCCcceeeecCCCcceeeeeeeeCCceEEEEEc Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADL----PLEKWSFGNNVRFKNGKAQKTLGHTPIFDTAQAPILDMFPFIRNNIPYWLLCS 76 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~l----p~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~v~~ 76 (513) |.++.+-. +.|...+..+.|. |.|-.-...++.-..++++..||.+.. +.+++++.|++--..++. ++++. T Consensus 1 m~~~~ipl--~~g~~~~~~~a~~~~~~pvn~y~~~~~~~~ss~~Lr~~pG~~~~-a~~~G~~RG~~~~~~~~~--ly~V~ 75 (472) T protein:vir:92 1 MPIQQLPM--MKGMGKDFKNADYIDYLPINMLATPKEVLDSSGYLRSFPGIAKR-NDVNGVSRGVEYNTAQNA--VYRVC 75 (472) T ss_pred Cceeeccc--cccccccCccCcceeeeecccccccccccccccceeecccceee-cCCCCcccceeeeeeCCe--EEEEe Confidence 88877666 4677776555544 433222233333335778889998887 457888888743223333 56677 Q ss_pred CceEEEecCceEEeccccceeeCCCCceeEEeeCCEEEEEeCCCceE-EEcCCCceecccCCCcccc-----eeeEEEEE Q lcl|NC_010325. 77 EQRLYLADGTTIIDVSPGPYSASITNRWSVGSFNGVIFANDGVNPPH-HLPPSESTFRVLPNFPANT-----TFKRLKSF 150 (513) Q Consensus 77 ~~kly~~~~~t~~dis~~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q-~~~~~s~~f~~L~g~p~~~-----ka~~v~~~ 150 (513) .++||+-+.. . |... .+.+-+++-=+..+.++.+.+..- ++++...++...+-+.... .++.|+.. T Consensus 76 G~~Ly~v~~~-i-----G~i~--gsgrVsMa~n~~~~av~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~dl~~~~dv~f~ 147 (472) T protein:vir:92 76 GGKLYKGEAV-V-----GDVA--GSGRVSMAHGRTSQAVGVNGQLIEYRYDGAVKTVSNWPADSGFTQYELGSVRDITRL 147 (472) T ss_pred CcceEEEEee-E-----eecc--CcccEEEecCCeEEEEEECCceeEEEEecchhhhhcccCccccccccccceeEEEEe Confidence 7889986542 1 1111 225566755444455556665554 4677666655555332222 23356677 Q ss_pred cCEEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceEEEecCcE--EEEE Q lcl|NC_010325. 151 KNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSV--YSMR 228 (513) Q Consensus 151 ~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i--~~m~ 228 (513) .+|.+ +... + .++..-|.+.|.. .++.... |---...+.+||+.....+.+++|-+++| |..+ T Consensus 148 dGyfV-~~~~-g----t~~~~iS~l~d~~----~~~~y~~-----fa~AE~~pD~Ivgi~~~~~~l~lfG~~TiEvw~nt 212 (472) T protein:vir:92 148 RGRYA-WSKD-G----TDSWFITDLEDES----HPDRYSA-----EYRAESQPDGIIGIGSWRDFIVCFGSSTIEYFSLT 212 (472) T ss_pred cceEE-EccC-C----CceEEEeccCCcc----ccccccc-----cccccCCCCceEEEEeeccEEEEEeccceEEEEec Confidence 77743 3322 1 2445555666642 2322110 11111334679999999999999999998 7663 Q ss_pred ecCCC----ceeEeEE-ecCccccccCceeEEECCeEEEEeCCC-----eEEECCcccccCCchhHHHHHHhhcCcchhC Q lcl|NC_010325. 229 YIGGL----FIFQFQQ-LFNDVGILGPNCAVEFDGNHFVVGHGD-----VYVHNGVQKQSVIDAQVRKFFFSDINPDNYQ 298 (513) Q Consensus 229 y~g~~----~~f~~~~-i~~~~G~~~~~siv~~~~~~ffls~~G-----~y~~~G~~~~~Ig~~~V~~~~~~~i~~~~~~ 298 (513) |+. ..|+.++ ..=+.||.++.|++.+++.+|||++++ +|+++|.+++.|.+..|++.+ ..+....+. T Consensus 213 --G~ad~~~fpy~r~~g~~i~~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE~~i-~~y~~~e~~ 289 (472) T protein:vir:92 213 --GATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKII-RSYTADELA 289 (472) T ss_pred --CCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHH-HhcCcchhc Confidence 554 2444443 235789999999999999999999998 889999999999999999854 677766666 Q ss_pred CEEEEEecC-CCE-EEEEEccCCCCCCcccceEEEEecccCe----EEEEeccceeeeeecccccccceeecccCcccCc Q lcl|NC_010325. 299 RTFVLADHV-NTE-MWVCYSSTRSKPGKHCDRAIIWNWKENT----WSIRDLPNVLSGAYGIIDPKVSNLWDDDPNPWDT 372 (513) Q Consensus 299 ~~~~~~d~~-~~~-v~~~~~s~~~~~~~~~d~~lvyd~~~~~----Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~~d~ 372 (513) ...++.... +++ |++.|| ++.++||..++. |+.++--. T Consensus 290 ~a~~~s~~~eGH~fy~LtfP----------~~Tw~yD~at~~~~e~W~~~~sg~-------------------------- 333 (472) T protein:vir:92 290 TGVMEALRFDSHELLIIHLP----------RHVLVYDASSSQNGPQWCVLKTGL-------------------------- 333 (472) T ss_pred eeeEEEEEecCeeEEEEEcC----------CceEEEEcccCcCCceeeeecCCC-------------------------- Confidence 666666543 333 456665 579999999885 65543210 Q ss_pred cceeccccccccCccceEEEEeecCceeeec-ccceeecCccEEEEeecccccCCCcceEEEeeeeeccCC--CeeEEEE Q lcl|NC_010325. 373 DTSVWGEGSYNPAKSSMIFSSFQDKKLFLFG-NNSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITG--NGTCNIW 449 (513) Q Consensus 373 d~~~~~~ds~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~--~~~~~~~ 449 (513) --..|......+..+..+.|.+.++.++.++ +.-++.|.+.+..+..+.+. ++.+|..... +...++ +..-.+. T Consensus 334 ~~~~~R~~~~~~~~g~~ivGD~~nG~ly~l~~~~~t~~~~~~~~~~~~P~~~-~dn~R~~d~e--ve~~~Gv~q~~d~v~ 410 (472) T protein:vir:92 334 YDDVYRAIDFMYEGNQITCGDKSEAVTGQLQFDISSQYDKQQEHLLFTPIFK-ADNARCFDLE--VESSTGVAQYADRLF 410 (472) T ss_pred cccceeEEEEEeeCCeEEEEEcCCCeEEEEeccccccCCCcceEEEEeceEe-cCCCEEEEEe--eeccCCCCCcCceEE Confidence 0012333444445555666777666665552 33455677777766544444 3444443211 221222 1112344 Q ss_pred eeeeecCCCCceEcCceeeecCCceEEEeecCCC-------eEEEEEEccCCCcEEEEEEeeEEe Q lcl|NC_010325. 450 VGNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVGR-------YIALKFDFSSEGDWYFNGYTIEMA 507 (513) Q Consensus 450 ~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~R-------y~~~rl~~~~g~~w~~~G~~~~~~ 507 (513) +.-++ +-.+|+.++...-|.-+..+.|+..| =..|||+...-.+-.+.|..+..- T Consensus 411 L~wSd---dG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:92 411 LSATT---DGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred EEeec---cccccccceeeccCCccchhcceeeeeeeecccceeEEEEEEecCcceeeeeEEeeC Confidence 43333 35578888766655545444443221 123566655446667777766544 No 9 >protein:vir:105428 Length: 472 # NCBI annotation: gene 8 protein # Family: family:all:1540 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958184;genbank:gi:41057286;genbank:GeneID:2716675 Probab=99.88 E-value=1.9e-21 Score=134.16 Aligned_cols=433 Identities=11% Similarity=0.103 Sum_probs=245.8 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-------CCeeEECCCcceeeecCCCcceeeeeeeeCCceEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-------NGKAQKTLGHTPIFDTAQAPILDMFPFIRNNIPYWL 73 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-------~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~ 73 (513) |.++++-+ ..|...+..-.|-- ++ . -.||++. .+.+.+.||.... +.+++++.|+.-...++. +. T Consensus 1 m~~~~~pl--~~G~~~~~~~~d~~-~~-~-pVN~~a~~~~~~~s~~~l~~tPGl~~~-a~v~G~~RG~~~~~~~g~--lY 72 (472) T protein:vir:10 1 MPIQQLPL--MKGVGKDFRNADYI-DY-L-PVNMLATPKEILNSSGYLRSFPGIAKR-SDVNGVSRGVEYNMAQNA--VY 72 (472) T ss_pred CCeeeeee--ccCceeeccccchh-he-e-eeeeeeeccCCCcccceeecCCCceee-ccCCccccceEEEeeCCe--EE Confidence 77766555 34544432211111 11 1 1566665 3668888998776 456788877532234444 44 Q ss_pred EEcCceEEEecCceEEeccccceeeCCCCceeEEeeCCEEEEEeCCC-ceEEEcCCCceecccCCCccc-----ceeeEE Q lcl|NC_010325. 74 LCSEQRLYLADGTTIIDVSPGPYSASITNRWSVGSFNGVIFANDGVN-PPHHLPPSESTFRVLPNFPAN-----TTFKRL 147 (513) Q Consensus 74 v~~~~kly~~~~~t~~dis~~~~~~~~~~~w~f~~~~~~~ia~ng~d-~~q~~~~~s~~f~~L~g~p~~-----~ka~~v 147 (513) ++..++||+.+. .|-+|.++ .+-+++-=+..++++.+.+ ...++++...+...++..... .-++.| T Consensus 73 ~V~G~~LY~v~~-~iGsiag~-------grVsMa~n~~~~av~~~g~~~~Y~yd~~v~t~~~~~~d~~~p~~dlg~~~dv 144 (472) T protein:vir:10 73 RVCGGKLYKGES-EVGDVAGS-------GRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPTDSGFTQYELGSVRDI 144 (472) T ss_pred EEecceEeeeec-ceecccCc-------ccEEEecCCcEEEEEECCceeEEEeeccchhhhccccccccccccccceeee Confidence 446679999774 35444422 4566655444554444444 445577665554544432221 235567 Q ss_pred EEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceEEEecCcE--E Q lcl|NC_010325. 148 KSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSV--Y 225 (513) Q Consensus 148 ~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i--~ 225 (513) +...+|.+ +...+ .++..-|.+.|.. -|.+.+. |---...+.+||+.....+.+++|-+++| | T Consensus 145 ~f~dGyfV-~~~~G-----t~~~~is~l~d~~-~~~~y~~--------fa~AE~~pD~Ivgi~~~~~~i~lfG~~TiEvw 209 (472) T protein:vir:10 145 TRLRGRYA-WSKDG-----TDSWFITDLEDES-HPDRYSA--------QYRAESQPDGIIGIGTWRDFIVCFGSSTIEYF 209 (472) T ss_pred eeecceEE-EeccC-----cceEEEeccCCcc-ccccccc--------cccccCCCCceEEEEeeccEEEEEeccceEEE Confidence 77777743 33321 2334456676642 1222210 00112334679999999999999999998 7 Q ss_pred EEEecCCCc--eeEeEE---ecCccccccCceeEEECCeEEEEeCC-----CeEEECCcccccCCchhHHHHHHhhcCcc Q lcl|NC_010325. 226 SMRYIGGLF--IFQFQQ---LFNDVGILGPNCAVEFDGNHFVVGHG-----DVYVHNGVQKQSVIDAQVRKFFFSDINPD 295 (513) Q Consensus 226 ~m~y~g~~~--~f~~~~---i~~~~G~~~~~siv~~~~~~ffls~~-----G~y~~~G~~~~~Ig~~~V~~~~~~~i~~~ 295 (513) ..+ |+.. -|.++. ..=..||.++.|++.+++.+|||+++ -+|+++|.+++.|.+..|++.+ ..+... T Consensus 210 ~nt--G~a~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aIE~~i-~~y~~~ 286 (472) T protein:vir:10 210 SLT--GATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYIIGSGQVSPIASASIEKIL-RSYTAD 286 (472) T ss_pred Eec--CCCCcccCceeecccceeeecccCcchhhecCceEEEEecCCccccEEEEccCceeEEecCHHHHHHH-HhcCCc Confidence 764 5432 244444 44578999999999999999999996 3568899999999999999855 677665 Q ss_pred hhCCEEEEEecC--CCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcccCcc Q lcl|NC_010325. 296 NYQRTFVLADHV--NTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNPWDTD 373 (513) Q Consensus 296 ~~~~~~~~~d~~--~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~~d~d 373 (513) .+...+++.... ...|++.|| ++.++||..++.|+.+-. ....+ .- T Consensus 287 e~~dA~~~t~~~~GH~fy~LtfP----------~~Tw~yD~~t~~Wherw~-------~~~~g---------------~~ 334 (472) T protein:vir:10 287 ELADGVMESLRFDAHELLIIHLP----------RHVLVYDASSSANGPQWC-------VLKTG---------------LY 334 (472) T ss_pred cccceeEEEEEeCCeEEEEEEcC----------CceeEeecccccCceeee-------eecCC---------------Cc Confidence 666656665443 333556665 579999999999987511 10000 00 Q ss_pred ceeccccccccCccceEEEEeecCceeeeccc-ceeecCccEEEEeecccccCCCcceEEEeeeeeccCCCe---eEEEE Q lcl|NC_010325. 374 TSVWGEGSYNPAKSSMIFSSFQDKKLFLFGNN-STFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGNG---TCNIW 449 (513) Q Consensus 374 ~~~~~~ds~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~---~~~~~ 449 (513) -..|......+..+..+.|...++.++.++.+ -+-.|.+++..+..+.+..+ +.|.. +..+...++.. .-.+. T Consensus 335 ~~~~Ra~~~~~~~g~~~vGD~~ng~ly~l~~~~~td~G~~i~~~~~~p~~~~d-~~Rv~--d~~ve~~~G~~~~adp~~~ 411 (472) T protein:vir:10 335 DDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYGLQQEHLLFTPLFKAD-NARCF--DLEVESSTGVAQYADRLFL 411 (472) T ss_pred cCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCcCCCcceEEEeccceeCC-CCeEE--EEEEEeecCCCcccCceEE Confidence 11234444455566677788888877766555 45678888888886665544 44442 22222122211 11111 Q ss_pred eeeeecCCCCceEcCceeeecCCceEEEeecCC--------CeEEEEEEccCCCcEEEEEEeeEEe Q lcl|NC_010325. 450 VGNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVG--------RYIALKFDFSSEGDWYFNGYTIEMA 507 (513) Q Consensus 450 ~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~--------Ry~~~rl~~~~g~~w~~~G~~~~~~ 507 (513) +-.+| ...|+.+..-.-+..++.+.|+.. |. .|||++..=.+-.|.|.-++.. T Consensus 412 ~~~sD----g~~~g~~~~~~~~~~g~~~~R~~~~RlG~~r~~v-gf~~r~~~~~~v~l~ga~~~~e 472 (472) T protein:vir:10 412 SATTD----GINYGREQMIEQNEPFVYDKRVLWKRVGRIRKNV-GFKLRVITKSPVTLSGAQIRIE 472 (472) T ss_pred EeccC----CcccchhhhhhhccCcccccceeeeeeeeccccc-eEEEEEEeccccceeeeeEEeC Confidence 21222 344444432222223333333222 22 2455444335555777655544 No 10 >protein:vir:177 Length: 472 # NCBI annotation: DNA stabilization protein # Family: family:all:1540 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112082;genbank:gi:13559872;genbank:GeneID:920982 Probab=99.88 E-value=2e-21 Score=134.07 Aligned_cols=433 Identities=11% Similarity=0.099 Sum_probs=248.0 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-------CCeeEECCCcceeeecCCCcceeeeeeeeCCceEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-------NGKAQKTLGHTPIFDTAQAPILDMFPFIRNNIPYWL 73 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-------~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~ 73 (513) |.++++-+ ..|...+..-.|-- ++ - -.||++. .+.+.+.||.... +.+++++.|+.-...++. +. T Consensus 1 m~~~~~Pl--~~G~~~~~~~~d~~-~~-~-pVN~~a~~~~~~~s~~~l~~tPGl~~~-a~v~G~~RG~~~~~~~g~--lY 72 (472) T protein:vir:17 1 MPIQQLPL--MKGVGKDFRNADYI-DY-L-PVNMLATPKEILNSSGYLRSFPGIAKR-SDVNGVSRGVEYNMAQNA--VY 72 (472) T ss_pred CCeeeeee--ccCceeeccccchh-he-e-eeeeeeeccCCCcccceeecCCCceee-ccCCccccceEEEeeCCe--EE Confidence 77766555 34544432211111 11 1 1566665 3668888998776 456788877532234444 44 Q ss_pred EEcCceEEEecCceEEeccccceeeCCCCceeEEeeCCEE-EEEeCCCceEEEcCCCceecccCCCccc-----ceeeEE Q lcl|NC_010325. 74 LCSEQRLYLADGTTIIDVSPGPYSASITNRWSVGSFNGVI-FANDGVNPPHHLPPSESTFRVLPNFPAN-----TTFKRL 147 (513) Q Consensus 74 v~~~~kly~~~~~t~~dis~~~~~~~~~~~w~f~~~~~~~-ia~ng~d~~q~~~~~s~~f~~L~g~p~~-----~ka~~v 147 (513) ++..++||+.+. .|-+|.++ .+-+++-=+..+ ++.++.-...++++...+...++..... .-++.| T Consensus 73 ~V~G~~LY~v~~-~iGsiag~-------grVsMa~n~~~~av~~~g~~~~Y~y~~~v~t~~~~~~d~~~~~~dlg~~~dv 144 (472) T protein:vir:17 73 RVCGGKLYKGES-EVGDVAGS-------GRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPTDSGFTQYELGSVRDI 144 (472) T ss_pred EEecceEeeeec-ceecccCc-------ccEEEecCCcEEEEEECCceeEEEeeccchhhhccccccccccccccceeee Confidence 446679999774 35555432 445555433344 4444444455677765554544532221 235567 Q ss_pred EEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceEEEecCcE--E Q lcl|NC_010325. 148 KSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSV--Y 225 (513) Q Consensus 148 ~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i--~ 225 (513) +...+|.+ +...+ .++..-|.+.|.. .++.... |---...+.+||+.....+.+++|-+++| | T Consensus 145 ~f~dGyfV-~~~~G-----t~~~~is~l~d~~----~~~~y~~-----fa~AE~~pD~Ivgi~~~~~~i~lfG~~TiEvw 209 (472) T protein:vir:17 145 TRLRGRYA-WSKDG-----TDSWFITDLEDES----HPDRYSA-----QYRAESQPDGIIGIGTWRDFIVCFGSSTIEYF 209 (472) T ss_pred eeecceEE-EeccC-----cceEEEeccCCcc----ccccccc-----cccccCCCCceEEEEeeccEEEEEeccceEEE Confidence 77777743 33321 2334456676642 2221110 00112334679999999999999999998 7 Q ss_pred EEEecCCCc--eeEeEEe---cCccccccCceeEEECCeEEEEeCC-----CeEEECCcccccCCchhHHHHHHhhcCcc Q lcl|NC_010325. 226 SMRYIGGLF--IFQFQQL---FNDVGILGPNCAVEFDGNHFVVGHG-----DVYVHNGVQKQSVIDAQVRKFFFSDINPD 295 (513) Q Consensus 226 ~m~y~g~~~--~f~~~~i---~~~~G~~~~~siv~~~~~~ffls~~-----G~y~~~G~~~~~Ig~~~V~~~~~~~i~~~ 295 (513) ..+ |+.. .|.++.. .=..||.++.|++.+++.+|||+++ -+|+++|.+++.|.+..|++.+ ..+... T Consensus 210 ~nt--G~a~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~~g~~~~rIST~aIE~~i-~~y~~~ 286 (472) T protein:vir:17 210 SLT--GATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYIIGSGQVSPISSASIEKIL-RSYTAD 286 (472) T ss_pred Eee--CCCCCCcCceeecCcceeeecccCcchhhecCceEEEEecCCccccEEEEccCceeEEecCHHHHHHH-HhcCCc Confidence 764 4432 2444443 2578999999999999999999996 3568899999999999999865 677666 Q ss_pred hhCCEEEEEecC--CCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcccCcc Q lcl|NC_010325. 296 NYQRTFVLADHV--NTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNPWDTD 373 (513) Q Consensus 296 ~~~~~~~~~d~~--~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~~d~d 373 (513) .+...+++.... ...|++.|| ++.++||..++.|+.+-. ....+ .- T Consensus 287 e~~dA~~~t~~~~GH~fy~LtfP----------~~Tw~yD~~t~~Wherw~-------~~~~g---------------~~ 334 (472) T protein:vir:17 287 ELADGVMESLRFDAHELLIIHLP----------RHVLVYDASSSANGPQWC-------VLKTG---------------LY 334 (472) T ss_pred cccceeEEEEEeCCeEEEEEEcC----------CceeEeecccccCceeee-------eecCC---------------Cc Confidence 666666665443 333556665 579999999999987511 10000 00 Q ss_pred ceeccccccccCccceEEEEeecCceeeeccc-ceeecCccEEEEeecccccCCCcceEEEeeeeeccCCCe--e-EEEE Q lcl|NC_010325. 374 TSVWGEGSYNPAKSSMIFSSFQDKKLFLFGNN-STFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGNG--T-CNIW 449 (513) Q Consensus 374 ~~~~~~ds~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~--~-~~~~ 449 (513) -..|......+..+..+.|...++.++.++.+ -+..|.+++..+.++.+..+ ..|........ .++.+ + -.+. T Consensus 335 ~~~~Ra~~~~~~~g~~~vGD~~ng~ly~ld~~~~td~g~pi~~~~~~p~~~~~-~~RV~d~el~~--~tG~~~~adp~~l 411 (472) T protein:vir:17 335 DDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYDKQQEHLLFTPLFKAD-NARVFDLEVES--STGVAQYADRLFL 411 (472) T ss_pred cCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCCCCceeEEEEecceeeCC-CceEEEEEEee--eCCcccCCCceEE Confidence 11234444455566777888888877766554 56789999999987666644 44443222111 22211 1 0122 Q ss_pred eeeeecCCCCceEcCceeeecCCceEEEeecCC--------CeEEEEEEccCCCcEEEEEEeeEEe Q lcl|NC_010325. 450 VGNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVG--------RYIALKFDFSSEGDWYFNGYTIEMA 507 (513) Q Consensus 450 ~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~--------Ry~~~rl~~~~g~~w~~~G~~~~~~ 507 (513) +-.+| ...|+.+..-.-+..++.+.|+.. | ..|||+..+-.+-.|.|--++.. T Consensus 412 ~~~sD----g~~~g~~~~~~~~~~g~~~~R~~~~RlG~~r~~-v~f~~~~~~~~~~~l~~a~~~~e 472 (472) T protein:vir:17 412 SATTD----GINYGREQMIEQNEPFVYDKRVLWKRVGRIRKN-VGFKLRVITKSPVTLSGCQIRIE 472 (472) T ss_pred EcccC----CcccchhhhhhhccCcccccceeeeeeeecccc-ceEEEEEeecccceeeeeEEEeC Confidence 22222 344444432222223433333222 2 12666655446666777766655 No 11 >protein:vir:105525 Length: 472 # NCBI annotation: phage DNA stabilization protein # Family: family:all:1540 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516194;genbank:gi:89885997;genbank:GeneID:3964385 Probab=99.87 E-value=2.7e-21 Score=133.35 Aligned_cols=423 Identities=13% Similarity=0.083 Sum_probs=240.8 Q ss_pred CcccchhhcCccccccccCcc----cCCCCcEEEeEEEEEe-------CCeeEECCCcceeeecCCCcceeeeeeeeCCc Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPA----DLPLEKWSFGNNVRFK-------NGKAQKTLGHTPIFDTAQAPILDMFPFIRNNI 69 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~----~lp~~a~~~~~Nv~~~-------~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~ 69 (513) |.+.++-++ .|+.++..-. .|| .||++. .+..+-.||...+ +.+++++.|+.-...++ T Consensus 1 m~~~q~pl~--~g~~~~~~~~~~~~~lp-------vN~y~~p~~~~~ss~~lr~~PG~~~~-~~~~g~~RG~~~~~~~~- 69 (472) T protein:vir:10 1 MAIMQLPLL--RGLGKARDDADYIDALP-------VNMLATPKPVLNASGYLRSFPGITHK-AEVAGVSRGVQYNTHEK- 69 (472) T ss_pred CCceeeecc--cccccCccccCceeeee-------eeeeeccccccccceeecccCCceee-cCCCcccceeEeeeeCC- Confidence 777766553 4444431110 112 355433 3556666777444 44677888863222333 Q ss_pred eEEEEEcCceEEEecCceEEeccccceeeCCCCceeEEe-eCCEEEEEeCCCceEEEcCCCceecccCCC-----cccce Q lcl|NC_010325. 70 PYWLLCSEQRLYLADGTTIIDVSPGPYSASITNRWSVGS-FNGVIFANDGVNPPHHLPPSESTFRVLPNF-----PANTT 143 (513) Q Consensus 70 ~~~~v~~~~kly~~~~~t~~dis~~~~~~~~~~~w~f~~-~~~~~ia~ng~d~~q~~~~~s~~f~~L~g~-----p~~~k 143 (513) .++++..++||+-+ +.|-+|++ ..+-+++- -+..++++++...-.+++++..++++.+-. +...- T Consensus 70 -~lY~V~G~~Ly~v~-~~vG~iag-------sg~VsMa~~~~~q~v~v~g~~~~y~y~g~~~t~~~~~~~~~it~~dl~~ 140 (472) T protein:vir:10 70 -TVYRGLGNQLYKGH-KPIADLAG-------KGRISMAFSRNSQAVVAAGKMTLYRYDGTVKTLENWPKEKKYTQYDIGN 140 (472) T ss_pred -eEEEEecceEEEEE-eeeeeecc-------cccEEEEecCCceEEEEecceeEEEeccchhhhhhccccccCCccccCC Confidence 34555667788853 33434332 24566653 456788888876667787665555532211 11124 Q ss_pred eeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceEEEecCc Q lcl|NC_010325. 144 FKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDS 223 (513) Q Consensus 144 a~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~ 223 (513) ++.|....+|.+ +...+ .+++.+|++.|+. .++..+. |---...+.+||+.....+.+++|-+++ T Consensus 141 ~~~v~~~dGyfV-~~~~g-----t~~~~iS~L~d~s----~~~~~~~-----FatAE~~pD~Ivgi~~~~~~i~lfG~~T 205 (472) T protein:vir:10 141 VRDMCHLRGRYV-WCKDG-----SDIFGVTDLEDES----HPDRYRA-----LYRAESQPDGIIGIDSWRDFIVCFGAST 205 (472) T ss_pred ceeEEEeCceEE-EeecC-----CceEEEeecCCcc----cCCcccc-----eeeecCCCCceEEEEeeccEEEEEeccc Confidence 777888888843 34432 3567799998853 3432221 1111234467999999999999999999 Q ss_pred E--EEEEecCCCceeEeEEec------CccccccCceeEEECCeEEEEeCCC-----eEEECCcccccCCchhHHHHHHh Q lcl|NC_010325. 224 V--YSMRYIGGLFIFQFQQLF------NDVGILGPNCAVEFDGNHFVVGHGD-----VYVHNGVQKQSVIDAQVRKFFFS 290 (513) Q Consensus 224 i--~~m~y~g~~~~f~~~~i~------~~~G~~~~~siv~~~~~~ffls~~G-----~y~~~G~~~~~Ig~~~V~~~~~~ 290 (513) | |..+ |+. .|.++.+. =+.||.++.|++.+++.+|||+++. +|+++|.+++.|.+..|++.+ . T Consensus 206 iEvw~nt--G~a-~fpf~r~~~~pg~~iq~Gcaa~~sv~~~~~s~~~l~~d~~g~~~V~~~~g~q~~rIST~aIE~~i-~ 281 (472) T protein:vir:10 206 IEYFSLT--GAA-DGQSAIYAAQPALMVEKGIAGTHCKTRLGDAHVIISHQATGAPSVFLINQAQATSIATATIEKIL-R 281 (472) T ss_pred eEEEEec--CCC-CcceeeeccCccceeeecccCchhhhhhCceEEEEecCCCcceEEEEccCceEEEecCHHHHHHH-H Confidence 8 7663 443 26666544 3489999999999999999999995 778899999999999999855 7 Q ss_pred hcCcchhCCEEEEEecC--CCEEEEEEccCCCCCCcccceEEEEecccCeE----EEEeccceeeeeecccccccceeec Q lcl|NC_010325. 291 DINPDNYQRTFVLADHV--NTEMWVCYSSTRSKPGKHCDRAIIWNWKENTW----SIRDLPNVLSGAYGIIDPKVSNLWD 364 (513) Q Consensus 291 ~i~~~~~~~~~~~~d~~--~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~W----s~~d~~~~~~~~~g~~~~~~~~~~~ 364 (513) .+....+...+++.... ...|++.|| ++.++||..++.| +.+.--. T Consensus 282 ~y~~~e~~dA~~~s~~~eGH~fy~LtfP----------~~Tw~yD~at~~~~~~w~~~~~g~------------------ 333 (472) T protein:vir:10 282 SYTHDELASAVMETVRFDSHELVLIHLS----------RQVLCYDAAANQNGLQWSLLKTGF------------------ 333 (472) T ss_pred hCCcccccceeEEEEEeCCeEEEEEEcC----------CeeEEEeccCCccceeeeeeecCC------------------ Confidence 77776777766666443 333566765 4689999777765 3222100 Q ss_pred ccCcccCccceeccccccccCccceEEEEeecCceeeecc-cceeecCccEEEEeecccccCCCcceEEEeeeeeccCCC Q lcl|NC_010325. 365 DDPNPWDTDTSVWGEGSYNPAKSSMIFSSFQDKKLFLFGN-NSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGN 443 (513) Q Consensus 365 ~~~~~~d~d~~~~~~ds~~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~ 443 (513) .-..|-.....+..+..+.|...++.++.++. .-+..|.+++..+.++.+.. +.+|........ .++. T Consensus 334 --------~~~~~Ra~~~~~~~g~~~vGD~~ng~l~~ld~~~~td~g~pi~~~~~tp~~~~-~n~Rvfd~el~~--~tGv 402 (472) T protein:vir:10 334 --------YHAPYRGIDFMFADHHLTCGDKNDSLLGQLDFASSAQYEKPQEHVLYTPLFKA-DNARVFDFELEA--STGV 402 (472) T ss_pred --------ccCceEEEEEEEeCCeEEEEEcCCCeEEEEcCcCcCCCCceeEEEeeccceec-CCCeEEEEEEEe--eCCc Confidence 00123333444455666777777776666532 33567778888887665553 344443322222 2332 Q ss_pred eeE--EEEeeeeecCCCCceEcCceeeecCC---ceEEEee----cCCCe-EEEEEEccCCCcEE--EEEEeeE Q lcl|NC_010325. 444 GTC--NIWVGNAQVQGSGIRWKGPYPYRIGQ---DYKIDTK----HVGRY-IALKFDFSSEGDWY--FNGYTIE 505 (513) Q Consensus 444 ~~~--~~~~g~~~~~~~~~~w~~~~~~~~~~---~~~~~~R----~~~Ry-~~~rl~~~~g~~w~--~~G~~~~ 505 (513) +.. ++.+.- +-+....|....-...|. +.++-.| ++-|- .+|||... .+.. +.|.-+| T Consensus 403 g~~~~~v~L~w--Sddg~~~~~~~~~~~~g~~~~~~r~~w~RlG~ar~~vgf~~rv~~s--~pv~~~~~~a~~e 472 (472) T protein:vir:10 403 AHIADRLFLSA--TADGLHFGREQMINQNAPFAYDRRILWRRMGRVRKNLGFKVRVITS--SPVTLSGCQIRME 472 (472) T ss_pred CccCceEEEEE--eccccccchhHHHhhcCccchhheeeeheeeccccccceEEEEEEe--cccccccceeeeC Confidence 222 233322 224444443322111111 1111111 11121 23444333 3333 3455555 No 12 >protein:vir:2109 Length: 472 # NCBI annotation: head completion protein # Family: family:all:1540 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059633;genbank:gi:9635541;genbank:GeneID:1262840 Probab=99.86 E-value=2.1e-20 Score=128.45 Aligned_cols=430 Identities=13% Similarity=0.172 Sum_probs=244.7 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeC-------CeeEECCCcceeeecCCCcceeeeeeeeCCceEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKN-------GKAQKTLGHTPIFDTAQAPILDMFPFIRNNIPYWL 73 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~-------g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~ 73 (513) |.++.+-+ ..|...|..--|-- + |.- .||++.. +.++++||..+..+ ++++..|++....++. ++ T Consensus 1 m~~~q~Pl--~~g~~~~~~~~d~~-~-~~p-VN~~a~~~~~~~s~~~lr~tPG~~~~~~-~~g~~RG~~~~t~~~~--ly 72 (472) T protein:vir:21 1 MPIQQLPM--MKGMGKDFKNADYI-D-YLP-VNMLATPKEILNSSGYLRSFPGITKRYD-MNGVSRGVEYNTAQNA--VY 72 (472) T ss_pred CceEEeec--ccccccccccccee-e-eee-eeeeeeccCCcccceeeeecCCcceecc-CCCceeeeeecccCCe--EE Confidence 66555444 24444432222111 1 111 5666652 56888999888866 5778877764322333 36 Q ss_pred EEcCceEEEecCceEEeccccceeeCCCCceeEEeeCCEEEEEeCCCceE-EEcCCCceecccCCCcccc-----eeeEE Q lcl|NC_010325. 74 LCSEQRLYLADGTTIIDVSPGPYSASITNRWSVGSFNGVIFANDGVNPPH-HLPPSESTFRVLPNFPANT-----TFKRL 147 (513) Q Consensus 74 v~~~~kly~~~~~t~~dis~~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q-~~~~~s~~f~~L~g~p~~~-----ka~~v 147 (513) ++..++||+-+.. .-+|. .+.+-+++-=+..+.++-+.+..- +++++..++...+-+.... .++.| T Consensus 73 ~V~G~~LY~v~~~-~G~i~-------gsgrVsMa~n~~~~~v~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~dl~~~~dv 144 (472) T protein:vir:21 73 RVCGGKLYKGESE-VGDVA-------GSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPADSGFTQYELGSVRDI 144 (472) T ss_pred EEeCCceEEEeee-eeeec-------ccccEEEeeCCeEEEEEECCceeEEEEecchhhhhcccCccccccccccceeEE Confidence 6778899999853 22222 224556644444344444544443 4677666655555332222 23356 Q ss_pred EEEcCEEEEEECCcCcccCCceEEEec-cCCcccccccccccccccCcceecccCCCCceeEEEecCcceEEEecCcE-- Q lcl|NC_010325. 148 KSFKNFLVGLNATSNSVEMPQMVWWST-SADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSV-- 224 (513) Q Consensus 148 ~~~~~~l~~~g~t~~~~~~p~rv~wS~-~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i-- 224 (513) +...+|.+ +... + .+.+|.. +.|. .+++.... |---...+.+||+.....+.+++|-+++| T Consensus 145 ~f~dGyfV-~~~~-g-----t~~f~is~l~d~----~~~~~y~~-----FatAE~~pD~Iv~i~~~~~~l~lfG~~TiEv 208 (472) T protein:vir:21 145 TRLRGRYA-WSKD-G-----TDSWFITDLEDE----SHPDRYSA-----QYRAESQPDGIIGIGTWRDFIVCFGSSTIEY 208 (472) T ss_pred EEecceEE-EccC-C-----cceeEEecCCCC----ccccCCcc-----ceeeccCCCceEEEEeeccEEEEEeccceEE Confidence 67777743 3321 1 2245544 4553 22321100 11112344679999999999999999998 Q ss_pred EEEEecCCC----ceeEeEE-ecCccccccCceeEEECCeEEEEeCCC-----eEEECCcccccCCchhHHHHHHhhcCc Q lcl|NC_010325. 225 YSMRYIGGL----FIFQFQQ-LFNDVGILGPNCAVEFDGNHFVVGHGD-----VYVHNGVQKQSVIDAQVRKFFFSDINP 294 (513) Q Consensus 225 ~~m~y~g~~----~~f~~~~-i~~~~G~~~~~siv~~~~~~ffls~~G-----~y~~~G~~~~~Ig~~~V~~~~~~~i~~ 294 (513) |..+ |+. ..|+.++ ..=+.||.++.|++.+++.+|||++++ +|+++|.+++.|.+..|++.+ ..... T Consensus 209 w~nt--G~ad~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~~~g~qa~rIST~aIE~~i-~~y~~ 285 (472) T protein:vir:21 209 FSLT--GATTAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYIIGSGQASPIATASIEKII-RSYTA 285 (472) T ss_pred EEec--CCCCcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEEccCceeEEecCHHHHHHH-HhcCC Confidence 7663 554 3444443 235789999999999999999999998 889999999999999999976 66654 Q ss_pred chhCCEEEEEec-CCCE-EEEEEccCCCCCCcccceEEEEecccCe----EEEEeccceeeeeecccccccceeecccCc Q lcl|NC_010325. 295 DNYQRTFVLADH-VNTE-MWVCYSSTRSKPGKHCDRAIIWNWKENT----WSIRDLPNVLSGAYGIIDPKVSNLWDDDPN 368 (513) Q Consensus 295 ~~~~~~~~~~d~-~~~~-v~~~~~s~~~~~~~~~d~~lvyd~~~~~----Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~ 368 (513) ..+....++... .+++ |++.|| ++.++||..++. |+.++--. T Consensus 286 ~e~~~A~~~t~~~eGH~fy~LtfP----------~~Tw~yD~at~~~~e~W~~~~sg~---------------------- 333 (472) T protein:vir:21 286 EEMATGVMETLRFDSHELLIIHLP----------RHVLVYDASSSQNGPQWCVLKTGL---------------------- 333 (472) T ss_pred ccccceEEEEEEeCCeEEEEEEcC----------CeeEEEEcccCccCceeeeeccCC---------------------- Confidence 444454444433 3333 556665 578999999886 55543210 Q ss_pred ccCccceeccccccccCccceEEEEeecCceeeec-ccceeecCccEEEEeecccccCCCcceEEEeeeeeccCC--Cee Q lcl|NC_010325. 369 PWDTDTSVWGEGSYNPAKSSMIFSSFQDKKLFLFG-NNSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITG--NGT 445 (513) Q Consensus 369 ~~d~d~~~~~~ds~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~--~~~ 445 (513) --..|......+..+..+.|.+.++.++.+. ++..+.+...+..+..+.+. .+.+|..... +...++ +.. T Consensus 334 ----~~~~~R~~~~~~~~g~~ivGD~~nG~ly~L~fd~~~~~d~~~~~~r~~p~~~-~dn~R~fd~e--ve~~~Gv~q~~ 406 (472) T protein:vir:21 334 ----YDDVYRGVDFMYEGNQITCGDKSEAVVGQLQFDISSQYDKQQEHLLFTPLFK-ADNARCFDLE--VESSTGVAQYA 406 (472) T ss_pred ----CcCceeEEEEEeeCCeEEEEEcCCCeEEEEEecccccCCCcCcEEEEcccee-CCCCEEEEEe--eeccCCCCCcC Confidence 0012333444445555666777666666542 34455566667766655444 3444443221 221222 111 Q ss_pred EEEEeeeeecCCCCceEcCceeeecCCceEEEeecCCC-------eEEEEEEccCCCcEEEEEEeeEEe Q lcl|NC_010325. 446 CNIWVGNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVGR-------YIALKFDFSSEGDWYFNGYTIEMA 507 (513) Q Consensus 446 ~~~~~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~R-------y~~~rl~~~~g~~w~~~G~~~~~~ 507 (513) -.+.+.-++ +-.+|+.++...-|.-+..+.|+..| =..|||+...-.+-.+.|..+..- T Consensus 407 d~v~L~wSd---dG~~~~~~~~~~~g~~g~~~tr~~~~RlG~~r~~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:21 407 DRLFLSATT---DGINYGREQMIEQNEPFVYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred cEEEEEeec---cccccccceeeccCCccchhcceeeeeeeecccceeEEEEEEecCcceeeeeEEeeC Confidence 234443333 35578888766655545444443221 123666655556667777766544 No 13 >protein:vir:352 Length: 536 # NCBI annotation: hypothetical protein # Family: family:all:3197 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203466;genbank:gi:15320622;genbank:GeneID:921729 Probab=99.83 E-value=9.9e-20 Score=124.77 Aligned_cols=476 Identities=14% Similarity=0.124 Sum_probs=247.9 Q ss_pred Ccc-----cchh------hcCccc-cccccCcccCCCCcEEEeEEEEEeCCeeEECCCcceeeecCCCcceeeeee---- Q lcl|NC_010325. 1 MAL-----ERQE------VKNPTG-IVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTAQAPILDMFPF---- 64 (513) Q Consensus 1 m~~-----~~~~------~~~~~G-~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~~~~~~~~~~~---- 64 (513) |-+ -||. +--++| .+++.+=-..||+...-+.|-.|+..+++-|.|++...+.++.++.-++.| T Consensus 2 ~~~~a~r~~~~~~~~~~~~pAPv~G~~t~~~~A~m~~~~A~vldN~fpt~~g~r~R~G~~~~at~~~~~v~s~~~~~~~~ 81 (536) T protein:vir:35 2 MPLRARRVPPPPSIQEAHLPAPVGGLNTVSAGSAMPVSDCLQGFNLIASELGLRSRLGYREWCTGLGVPARSTLPFAGSA 81 (536) T ss_pred CccccccCCCCccceeeeeCccccceeccchhhcCCCCceEEEeecCCChhhhhhhccchhHhcCCccceEEeeeeeecc Confidence 211 1111 112333 444444445588888999999999999999999998877789999888887 Q ss_pred eeCCceEEEEEcCceEEEecCceE--EeccccceeeCCCCceeEEe---eCCE-EEEEeCCCceEEEcCCC--c-eeccc Q lcl|NC_010325. 65 IRNNIPYWLLCSEQRLYLADGTTI--IDVSPGPYSASITNRWSVGS---FNGV-IFANDGVNPPHHLPPSE--S-TFRVL 135 (513) Q Consensus 65 ~~~g~~~~~v~~~~kly~~~~~t~--~dis~~~~~~~~~~~w~f~~---~~~~-~ia~ng~d~~q~~~~~s--~-~f~~L 135 (513) +.+++.++|..+.+.||.-.+..- +.+...+.+....-.|++.+ +++. ++..||.+.+|.++++. . +...+ T Consensus 82 ~~Ga~~klf~at~~~i~dvT~pa~p~~~~~~~g~~~g~~~~w~~v~~~~~gG~~l~~~nG~~~~~~~~gt~~~w~~v~~~ 161 (536) T protein:vir:35 82 KSGAANRLFQTTSEGIWDVSASSQTPTQVLTFGDQTGDAGFGVSHAFVTQRGHFLFYADETNGLFRYSESTDTWTAVAQG 161 (536) T ss_pred ccCcceeEEEecccceeeeecCCCCcceEEEeccCCCceeeEEEEEecCCCceEEEEEEcCCCceEeecccCchhhcccC Confidence 467888999999998887765421 11111111222234576555 5555 99999999999998653 2 22332 Q ss_pred C-------CCcccceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceecccCCCCceeE Q lcl|NC_010325. 136 P-------NFPANTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVD 208 (513) Q Consensus 136 ~-------g~p~~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~ 208 (513) + -+++ +..+|+.|++|||+.... --++|+-..+-+...-+..+.-..-+..++ |.-+...-+. T Consensus 162 t~~~~i~Gv~~~--~l~~i~~~knRLffvq~~------s~~awYLp~~av~G~A~~f~lg~~~~~GGs--L~~~~sWS~~ 231 (536) T protein:vir:35 162 TGVGEIDGVNPA--NIVFVAVFKQRVWLVERD------TARAWYLPAGAIAGTAQPFEMGAQFRAGGH--LVGLWNWTYD 231 (536) T ss_pred CcccccCCCCcc--cceeeeeEeeeEEEEEeC------CceEEEeecccccceeeeeeccCccccCce--Eccceeeccc Confidence 2 2455 566799999999986552 234677655544332122211111111111 1111111112 Q ss_pred EEecCcceEEEecCc----EEEEEecCCCceeEeEEecCccc--cccCceeEEECCeEEEEeCCCeEEECCcc-----cc Q lcl|NC_010325. 209 GVKLRDSFIIYKEDS----VYSMRYIGGLFIFQFQQLFNDVG--ILGPNCAVEFDGNHFVVGHGDVYVHNGVQ-----KQ 277 (513) Q Consensus 209 g~~l~~~~vIf~en~----i~~m~y~g~~~~f~~~~i~~~~G--~~~~~siv~~~~~~ffls~~G~y~~~G~~-----~~ 277 (513) +..+-+..++|...+ ||+.++..+...|.+..|..-.+ .+.++|++.+++++.+++++|+.-++-.. .. T Consensus 232 ~G~Gl~d~~VfvSs~GeVaVyqGsdPs~s~~Wsl~giy~IG~~pp~G~r~~i~~G~Dl~iit~dGivplsq~~q~d~~a~ 311 (536) T protein:vir:35 232 GGAGMDDSLVAISGGGDVAIWQGTDPASSATFGLRGVWSLGGSPPAGRRIATDYGGDVLVLSRLGVRPLSRLVAGEVDKD 311 (536) T ss_pred cCCCcceeEEEEecCCcEEEEecCCCCcccceeEEEEEEeccCCCCCceEEEeecCeeEeeecCCccchhhhhhhhhhcc Confidence 222234445565554 56666655666799999986644 78999999999999999999999553211 11 Q ss_pred cCCchhHHHHHHhhc-CcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccc-eeeeeec-- Q lcl|NC_010325. 278 SVIDAQVRKFFFSDI-NPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPN-VLSGAYG-- 353 (513) Q Consensus 278 ~Ig~~~V~~~~~~~i-~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~-~~~~~~g-- 353 (513) ...+.+|+..+...+ ..+....+++..-|.++.+....|...... .+.+|+|..+++|+.+.-++ .|++.+. T Consensus 312 ~~it~~I~~~~~~~v~~~a~~~gWq~~~~P~~n~liV~~P~~~g~~----~~~fV~N~~tgaW~~ftgw~a~C~~v~~~~ 387 (536) T protein:vir:35 312 TYVTAKVSNLFSALMLTRASLPGWSMQLHPEDNALLVTVPTYPGQP----TEQLVMALAGRAWFRYRDLPIYSSAVWGGK 387 (536) T ss_pred cCCCccchhhHHHHHhhccCCCccEEEEccCCCeEEEEccCCCCCC----ceEEEeecccCceeeecCCcceEEEEecCe Confidence 123556776554444 333334578999999999999998875543 47999999999999776544 4555443 Q ss_pred -ccccccceeecccCcccCcccee-ccccccccCccceE---EEEeec-----CceeeecccceeecCccEEEEe--ecc Q lcl|NC_010325. 354 -IIDPKVSNLWDDDPNPWDTDTSV-WGEGSYNPAKSSMI---FSSFQD-----KKLFLFGNNSTFSGQNFVSTLE--RSD 421 (513) Q Consensus 354 -~~~~~~~~~~~~~~~~~d~d~~~-~~~ds~~~~~~~~~---~~~~~~-----~~~~~~~~~~~~~g~~l~a~~~--~~~ 421 (513) +.+...+..|..+. .+ =+++...++|.++. +.+|+. .+...+.+..+++-...+.... +-+ T Consensus 388 LyFG~~dG~v~~~da-------~v~g~D~~~~~ag~~I~~~~~~af~~~G~~~~K~~~~~r~~~~s~~~~p~l~l~~~~d 460 (536) T protein:vir:35 388 LYFGTVDGRVCVNDG-------YVDGVLLSEPSAFTPVQWSLLSAFTNLGSARQKQVQLLRPTLLSESATPSYEVQARYR 460 (536) T ss_pred EEEeecCCEEEeccc-------ccCccccccCcCcceeeeccccchhhcCchHHHHHHHhhhhhhhccCCceEEEEEEEE Confidence 22222222211110 00 01122223333221 112210 0111111111111111111111 111 Q ss_pred cccCCCcceEEEeeeeeccCCCeeEEEEeeeeecCCCCceEcCceeeecCCceEEEeecCCCeEEEEEEccCCCcEEEEE Q lcl|NC_010325. 422 IYLGDDRMMKTVSAIIPHITGNGTCNIWVGNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYFNG 501 (513) Q Consensus 422 ~~~~~~~~~~~i~~~~~~~t~~~~~~~~~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~~G 501 (513) +++..+.. +. +.......+- ...=+...|++..-+.. ....+-..|+=.+++++=.+-.+-.+.+ T Consensus 461 ~D~~~p~~---~~------~~~~~~~~Wd---~s~Wd~~~Ws~~~~v~~---~~~s~~g~G~~is~~~~g~a~~~~~~~~ 525 (536) T protein:vir:35 461 YDFAELAP---VS------AMGGGSGTWD---GSTWDVDVWSGEYQASQ---QVRGGTGVGVDLAIAIRGTAVARTVLVG 525 (536) T ss_pred eccCCCCC---cC------CCCCCcccCC---cccCCceecCCcceeEe---eeeEeccceEEEEEEEeeccccceEEEE Confidence 11111100 00 0000000000 00002234443321110 0011223344445555534444555677 Q ss_pred EeeEEeccccC Q lcl|NC_010325. 502 YTIEMAPKAGM 512 (513) Q Consensus 502 ~~~~~~~~g~r 512 (513) +|+.+.+-|-= T Consensus 526 ~d~~~e~G~v~ 536 (536) T protein:vir:35 526 IDILFTAGGLL 536 (536) T ss_pred EEEEEeecccC Confidence 77776543333 No 14 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=99.48 E-value=7.5e-12 Score=81.58 Aligned_cols=481 Identities=12% Similarity=0.090 Sum_probs=248.8 Q ss_pred CcccchhhcCccccccccCcc-----cCC--CCcEEEeEEEEEe-CCeeEECCCcceeeecC-CCcceeeeeee-eCCce Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPA-----DLP--LEKWSFGNNVRFK-NGKAQKTLGHTPIFDTA-QAPILDMFPFI-RNNIP 70 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~-----~lp--~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~-~~~~~~~~~~~-~~g~~ 70 (513) |+..++--..-+|.- +.|. ||. .++...|.|+++. .|++++|+|..=+.... ++....+.+|. ..+.. T Consensus 1 m~~~~~~~~~f~~Ge--~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~ 78 (681) T protein:vir:10 1 MSNVRVLQRSFGGGE--ISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQT 78 (681) T ss_pred CcceeEeeeecCCce--eeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCce Confidence 776555433333322 3333 121 1567789999998 58899999987665554 33445677776 45788 Q ss_pred EEEEEcCceEEEecCce-EEec-----cccceeeCCCCceeEEeeCCEEEEEeCCCceEEEc---CCCceecccC----- Q lcl|NC_010325. 71 YWLLCSEQRLYLADGTT-IIDV-----SPGPYSASITNRWSVGSFNGVIFANDGVNPPHHLP---PSESTFRVLP----- 136 (513) Q Consensus 71 ~~~v~~~~kly~~~~~t-~~di-----s~~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q~~~---~~s~~f~~L~----- 136 (513) +++..+.++|+-|.++. ..+- ..+||....-..-+|+|-.|++++++..-+||.+. .++..|+.+. T Consensus 79 ~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~f~~~p 158 (681) T protein:vir:10 79 MVIELGAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIAFTSPV 158 (681) T ss_pred EEEEEeCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEEecccc Confidence 89999999998886663 2111 02345443333456777777777777666666320 0111111000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_010325. 137 -------------------------------------------------------------------------------- 136 (513) Q Consensus 137 -------------------------------------------------------------------------------- 136 (513) T Consensus 159 ~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~~~~~t~~w~a~~g~~~~~V~~~~~gi~g~ig 238 (681) T protein:vir:10 159 ATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQGGLYGYIG 238 (681) T ss_pred ccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeeeecCCcceeEEEEecCCceeeeecccceeEEEEee Confidence Q ss_pred -----------------CCcc---------cceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCccccccccccccc Q lcl|NC_010325. 137 -----------------NFPA---------NTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDP 190 (513) Q Consensus 137 -----------------g~p~---------~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~ 190 (513) ..|+ +--...|..|+|||++++... .|++|+.|..+|. .+++.... T Consensus 239 ~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v~f~q~RL~f~~~~~----~p~~v~~Srsgdy----~nF~~~~~ 310 (681) T protein:vir:10 239 QTTGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTN----KPQNIWMTRSGTE----SAMSYSLP 310 (681) T ss_pred ccceeeeeecccccCccccccccccccccCCCceEEEEEEcceEEEeeCCC----CCcEEEEEcccCc----ccccccCC Confidence 0000 001234889999999988753 5899999999996 45554444 Q ss_pred ccCcceeccc---CCCCceeEEEecCcceEEEecCcEEEEEecC-C---CceeEeEEecCccccccCceeEEECCeEEEE Q lcl|NC_010325. 191 TKDAGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG-G---LFIFQFQQLFNDVGILGPNCAVEFDGNHFVV 263 (513) Q Consensus 191 t~~a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g-~---~~~f~~~~i~~~~G~~~~~siv~~~~~~ffl 263 (513) ..+++=.++. +....|..+++.+ .++|+...+-|.++-.+ + |.--++++.+. .||- .=.-+.+|+.++|+ T Consensus 311 ~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~-~g~~-~~~Pv~vg~~v~fv 387 (681) T protein:vir:10 311 VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSY-VGAT-DVQPVVVNNTTIYG 387 (681) T ss_pred CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeee-eccc-cccceeeCCeEEEE Confidence 4444444443 3333466667765 69999999999997442 2 44578888875 6774 45678899999999 Q ss_pred eCCCeE----EECC--cccccCCchhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccC Q lcl|NC_010325. 264 GHGDVY----VHNG--VQKQSVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKEN 337 (513) Q Consensus 264 s~~G~y----~~~G--~~~~~Ig~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~ 337 (513) ++.|=. .++- ..+++..--.+-+-++..+ .-+...+......+.|+-.+.+. --.+.|+-+.+ T Consensus 388 ~~~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~~-----~i~~~a~~~~p~~~~~~v~~dg~------l~~~ty~~eq~ 456 (681) T protein:vir:10 388 AARGGHVRELAYNWQANGFVTGDLSLRAAHLFDNL-----DILDMAYAKAPQPIVWFISSSGK------LLGLTYVPEQQ 456 (681) T ss_pred ecCCCEEEEEEEeeecCceeccchhhhhhhhcCCC-----CeEEEEEecCCCEEEEEEecCCc------EEEEEEecccc Confidence 999822 3331 1222221111222233222 11223344444566666655432 23456665555 Q ss_pred --eEEEEecccee-ee-ee------------------------cccccccce-----eecccCcccCccce-ec-ccccc Q lcl|NC_010325. 338 --TWSIRDLPNVL-SG-AY------------------------GIIDPKVSN-----LWDDDPNPWDTDTS-VW-GEGSY 382 (513) Q Consensus 338 --~Ws~~d~~~~~-~~-~~------------------------g~~~~~~~~-----~~~~~~~~~d~d~~-~~-~~ds~ 382 (513) .|+.-+.+..+ .. ++ ......... ...+-..++..+.. .+ .++ T Consensus 457 v~aW~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~~~~~~~~~sgl~-- 534 (681) T protein:vir:10 457 IGAWHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTYSGEPVSHISGLE-- 534 (681) T ss_pred eeeEEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccccccceEeeccccccCcceeeecccc-- Confidence 68766544321 11 00 000000000 00000001110100 01 111 Q ss_pred ccCccceEEEEeecCceee----------e--cccceeecCccEEEEeecccccC-----CCcceEEEeeeeeccCCCee Q lcl|NC_010325. 383 NPAKSSMIFSSFQDKKLFL----------F--GNNSTFSGQNFVSTLERSDIYLG-----DDRMMKTVSAIIPHITGNGT 445 (513) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~----------~--~~~~~~~g~~l~a~~~~~~~~~~-----~~~~~~~i~~~~~~~t~~~~ 445 (513) -..|..+.+ ..|+.... . ......-|-++++.++...+.+. ...+.+++.++..++....- T Consensus 535 ~leG~tv~i--~aDG~~~~~~~V~~G~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g 612 (681) T protein:vir:10 535 HLEGKTVSI--LADGAVHPQRVVTDGAIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSG 612 (681) T ss_pred CCCCcEEEE--EeCCeecCcEeecCcEEEeCcCCceEEEeeeceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccc Confidence 112333322 22332221 1 11225667788888886665542 22456677666655433332 Q ss_pred EEEEeeeeecCC----CCceEcCceeeecCCceEEEeecC-CCeEEEEEEccCCCcEEEEEEeeEEeccc Q lcl|NC_010325. 446 CNIWVGNAQVQG----SGIRWKGPYPYRIGQDYKIDTKHV-GRYIALKFDFSSEGDWYFNGYTIEMAPKA 510 (513) Q Consensus 446 ~~~~~g~~~~~~----~~~~w~~~~~~~~~~~~~~~~R~~-~Ry~~~rl~~~~g~~w~~~G~~~~~~~~g 510 (513) +.+.....+.-. .+-.+.++.+.-+| +..++++.. .+=..++|+.+.-.+.++.++.+|....| T Consensus 613 ~~~~~~~~~l~~~~~~~~~~~g~~~~l~TG-~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:10 613 IFAGPHADALTEVKQRTSEPYGSPPALKSE-EIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred eEEeeCCCceEEEEEeccccccccCCccCC-eEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 222211110000 00011222222233 233444432 34456788888889999999999988666 No 15 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=99.48 E-value=7.5e-12 Score=81.58 Aligned_cols=481 Identities=12% Similarity=0.090 Sum_probs=248.8 Q ss_pred CcccchhhcCccccccccCcc-----cCC--CCcEEEeEEEEEe-CCeeEECCCcceeeecC-CCcceeeeeee-eCCce Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPA-----DLP--LEKWSFGNNVRFK-NGKAQKTLGHTPIFDTA-QAPILDMFPFI-RNNIP 70 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~-----~lp--~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~-~~~~~~~~~~~-~~g~~ 70 (513) |+..++--..-+|.- +.|. ||. .++...|.|+++. .|++++|+|..=+.... ++....+.+|. ..+.. T Consensus 1 m~~~~~~~~~f~~Ge--~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~ 78 (681) T protein:vir:98 1 MSNVRVLQRSFGGGE--ISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQT 78 (681) T ss_pred CcceeEeeeecCCce--eeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCce Confidence 776555433333322 3333 121 1567789999998 58899999987665554 33445677776 45788 Q ss_pred EEEEEcCceEEEecCce-EEec-----cccceeeCCCCceeEEeeCCEEEEEeCCCceEEEc---CCCceecccC----- Q lcl|NC_010325. 71 YWLLCSEQRLYLADGTT-IIDV-----SPGPYSASITNRWSVGSFNGVIFANDGVNPPHHLP---PSESTFRVLP----- 136 (513) Q Consensus 71 ~~~v~~~~kly~~~~~t-~~di-----s~~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q~~~---~~s~~f~~L~----- 136 (513) +++..+.++|+-|.++. ..+- ..+||....-..-+|+|-.|++++++..-+||.+. .++..|+.+. T Consensus 79 ~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~f~~~p 158 (681) T protein:vir:98 79 MVIELGAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIAFTSPV 158 (681) T ss_pred EEEEEeCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEEecccc Confidence 89999999998886663 2111 02345443333456777777777777666666320 0111111000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_010325. 137 -------------------------------------------------------------------------------- 136 (513) Q Consensus 137 -------------------------------------------------------------------------------- 136 (513) T Consensus 159 ~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~~~~~t~~w~a~~g~~~~~V~~~~~gi~g~ig 238 (681) T protein:vir:98 159 ATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQGGLYGYIG 238 (681) T ss_pred ccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeeeecCCcceeEEEEecCCceeeeecccceeEEEEee Confidence Q ss_pred -----------------CCcc---------cceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCccccccccccccc Q lcl|NC_010325. 137 -----------------NFPA---------NTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDP 190 (513) Q Consensus 137 -----------------g~p~---------~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~ 190 (513) ..|+ +--...|..|+|||++++... .|++|+.|..+|. .+++.... T Consensus 239 ~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v~f~q~RL~f~~~~~----~p~~v~~Srsgdy----~nF~~~~~ 310 (681) T protein:vir:98 239 QTTGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTN----KPQNIWMTRSGTE----SAMSYSLP 310 (681) T ss_pred ccceeeeeecccccCccccccccccccccCCCceEEEEEEcceEEEeeCCC----CCcEEEEEcccCc----ccccccCC Confidence 0000 001234889999999988753 5899999999996 45554444 Q ss_pred ccCcceeccc---CCCCceeEEEecCcceEEEecCcEEEEEecC-C---CceeEeEEecCccccccCceeEEECCeEEEE Q lcl|NC_010325. 191 TKDAGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG-G---LFIFQFQQLFNDVGILGPNCAVEFDGNHFVV 263 (513) Q Consensus 191 t~~a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g-~---~~~f~~~~i~~~~G~~~~~siv~~~~~~ffl 263 (513) ..+++=.++. +....|..+++.+ .++|+...+-|.++-.+ + |.--++++.+. .||- .=.-+.+|+.++|+ T Consensus 311 ~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~-~g~~-~~~Pv~vg~~v~fv 387 (681) T protein:vir:98 311 VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSY-VGAT-DVQPVVVNNTTIYG 387 (681) T ss_pred CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeee-eccc-cccceeeCCeEEEE Confidence 4444444443 3333466667765 69999999999997442 2 44578888875 6774 45678899999999 Q ss_pred eCCCeE----EECC--cccccCCchhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccC Q lcl|NC_010325. 264 GHGDVY----VHNG--VQKQSVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKEN 337 (513) Q Consensus 264 s~~G~y----~~~G--~~~~~Ig~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~ 337 (513) ++.|=. .++- ..+++..--.+-+-++..+ .-+...+......+.|+-.+.+. --.+.|+-+.+ T Consensus 388 ~~~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~~-----~i~~~a~~~~p~~~~~~v~~dg~------l~~~ty~~eq~ 456 (681) T protein:vir:98 388 AARGGHVRELAYNWQANGFVTGDLSLRAAHLFDNL-----DILDMAYAKAPQPIVWFISSSGK------LLGLTYVPEQQ 456 (681) T ss_pred ecCCCEEEEEEEeeecCceeccchhhhhhhhcCCC-----CeEEEEEecCCCEEEEEEecCCc------EEEEEEecccc Confidence 999822 3331 1222221111222233222 11223344444566666655432 23456665555 Q ss_pred --eEEEEecccee-ee-ee------------------------cccccccce-----eecccCcccCccce-ec-ccccc Q lcl|NC_010325. 338 --TWSIRDLPNVL-SG-AY------------------------GIIDPKVSN-----LWDDDPNPWDTDTS-VW-GEGSY 382 (513) Q Consensus 338 --~Ws~~d~~~~~-~~-~~------------------------g~~~~~~~~-----~~~~~~~~~d~d~~-~~-~~ds~ 382 (513) .|+.-+.+..+ .. ++ ......... ...+-..++..+.. .+ .++ T Consensus 457 v~aW~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~~~~~~~~~sgl~-- 534 (681) T protein:vir:98 457 IGAWHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTYSGEPVSHISGLE-- 534 (681) T ss_pred eeeEEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccccccceEeeccccccCcceeeecccc-- Confidence 68766544321 11 00 000000000 00000001110100 01 111 Q ss_pred ccCccceEEEEeecCceee----------e--cccceeecCccEEEEeecccccC-----CCcceEEEeeeeeccCCCee Q lcl|NC_010325. 383 NPAKSSMIFSSFQDKKLFL----------F--GNNSTFSGQNFVSTLERSDIYLG-----DDRMMKTVSAIIPHITGNGT 445 (513) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~----------~--~~~~~~~g~~l~a~~~~~~~~~~-----~~~~~~~i~~~~~~~t~~~~ 445 (513) -..|..+.+ ..|+.... . ......-|-++++.++...+.+. ...+.+++.++..++....- T Consensus 535 ~leG~tv~i--~aDG~~~~~~~V~~G~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g 612 (681) T protein:vir:98 535 HLEGKTVSI--LADGAVHPQRVVTDGAIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSG 612 (681) T ss_pred CCCCcEEEE--EeCCeecCcEeecCcEEEeCcCCceEEEeeeceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccc Confidence 112333322 22332221 1 11225667788888886665542 22456677666655433332 Q ss_pred EEEEeeeeecCC----CCceEcCceeeecCCceEEEeecC-CCeEEEEEEccCCCcEEEEEEeeEEeccc Q lcl|NC_010325. 446 CNIWVGNAQVQG----SGIRWKGPYPYRIGQDYKIDTKHV-GRYIALKFDFSSEGDWYFNGYTIEMAPKA 510 (513) Q Consensus 446 ~~~~~g~~~~~~----~~~~w~~~~~~~~~~~~~~~~R~~-~Ry~~~rl~~~~g~~w~~~G~~~~~~~~g 510 (513) +.+.....+.-. .+-.+.++.+.-+| +..++++.. .+=..++|+.+.-.+.++.++.+|....| T Consensus 613 ~~~~~~~~~l~~~~~~~~~~~g~~~~l~TG-~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:98 613 IFAGPHADALTEVKQRTSEPYGSPPALKSE-EIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred eEEeeCCCceEEEEEeccccccccCCccCC-eEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 222211110000 00011222222233 233444432 34456788888889999999999988666 No 16 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=99.48 E-value=7.5e-12 Score=81.58 Aligned_cols=481 Identities=12% Similarity=0.090 Sum_probs=248.8 Q ss_pred CcccchhhcCccccccccCcc-----cCC--CCcEEEeEEEEEe-CCeeEECCCcceeeecC-CCcceeeeeee-eCCce Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPA-----DLP--LEKWSFGNNVRFK-NGKAQKTLGHTPIFDTA-QAPILDMFPFI-RNNIP 70 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~-----~lp--~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~-~~~~~~~~~~~-~~g~~ 70 (513) |+..++--..-+|.- +.|. ||. .++...|.|+++. .|++++|+|..=+.... ++....+.+|. ..+.. T Consensus 1 m~~~~~~~~~f~~Ge--~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~ 78 (681) T protein:vir:10 1 MSNVRVLQRSFGGGE--ISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQT 78 (681) T ss_pred CcceeEeeeecCCce--eeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCce Confidence 776555433333322 3333 121 1567789999998 58899999987665554 33445677776 45788 Q ss_pred EEEEEcCceEEEecCce-EEec-----cccceeeCCCCceeEEeeCCEEEEEeCCCceEEEc---CCCceecccC----- Q lcl|NC_010325. 71 YWLLCSEQRLYLADGTT-IIDV-----SPGPYSASITNRWSVGSFNGVIFANDGVNPPHHLP---PSESTFRVLP----- 136 (513) Q Consensus 71 ~~~v~~~~kly~~~~~t-~~di-----s~~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q~~~---~~s~~f~~L~----- 136 (513) +++..+.++|+-|.++. ..+- ..+||....-..-+|+|-.|++++++..-+||.+. .++..|+.+. T Consensus 79 ~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~f~~~p 158 (681) T protein:vir:10 79 MVIELGAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIAFTSPV 158 (681) T ss_pred EEEEEeCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEEecccc Confidence 89999999998886663 2111 02345443333456777777777777666666320 0111111000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_010325. 137 -------------------------------------------------------------------------------- 136 (513) Q Consensus 137 -------------------------------------------------------------------------------- 136 (513) T Consensus 159 ~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~~~~~t~~w~a~~g~~~~~V~~~~~gi~g~ig 238 (681) T protein:vir:10 159 ATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQGGLYGYIG 238 (681) T ss_pred ccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeeeecCCcceeEEEEecCCceeeeecccceeEEEEee Confidence Q ss_pred -----------------CCcc---------cceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCccccccccccccc Q lcl|NC_010325. 137 -----------------NFPA---------NTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDP 190 (513) Q Consensus 137 -----------------g~p~---------~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~ 190 (513) ..|+ +--...|..|+|||++++... .|++|+.|..+|. .+++.... T Consensus 239 ~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v~f~q~RL~f~~~~~----~p~~v~~Srsgdy----~nF~~~~~ 310 (681) T protein:vir:10 239 QTTGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQRRCFAGTTN----KPQNIWMTRSGTE----SAMSYSLP 310 (681) T ss_pred ccceeeeeecccccCccccccccccccccCCCceEEEEEEcceEEEeeCCC----CCcEEEEEcccCc----ccccccCC Confidence 0000 001234889999999988753 5899999999996 45554444 Q ss_pred ccCcceeccc---CCCCceeEEEecCcceEEEecCcEEEEEecC-C---CceeEeEEecCccccccCceeEEECCeEEEE Q lcl|NC_010325. 191 TKDAGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG-G---LFIFQFQQLFNDVGILGPNCAVEFDGNHFVV 263 (513) Q Consensus 191 t~~a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g-~---~~~f~~~~i~~~~G~~~~~siv~~~~~~ffl 263 (513) ..+++=.++. +....|..+++.+ .++|+...+-|.++-.+ + |.--++++.+. .||- .=.-+.+|+.++|+ T Consensus 311 ~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~-~g~~-~~~Pv~vg~~v~fv 387 (681) T protein:vir:10 311 VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSY-VGAT-DVQPVVVNNTTIYG 387 (681) T ss_pred CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeee-eccc-cccceeeCCeEEEE Confidence 4444444443 3333466667765 69999999999997442 2 44578888875 6774 45678899999999 Q ss_pred eCCCeE----EECC--cccccCCchhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccC Q lcl|NC_010325. 264 GHGDVY----VHNG--VQKQSVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKEN 337 (513) Q Consensus 264 s~~G~y----~~~G--~~~~~Ig~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~ 337 (513) ++.|=. .++- ..+++..--.+-+-++..+ .-+...+......+.|+-.+.+. --.+.|+-+.+ T Consensus 388 ~~~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~~-----~i~~~a~~~~p~~~~~~v~~dg~------l~~~ty~~eq~ 456 (681) T protein:vir:10 388 AARGGHVRELAYNWQANGFVTGDLSLRAAHLFDNL-----DILDMAYAKAPQPIVWFISSSGK------LLGLTYVPEQQ 456 (681) T ss_pred ecCCCEEEEEEEeeecCceeccchhhhhhhhcCCC-----CeEEEEEecCCCEEEEEEecCCc------EEEEEEecccc Confidence 999822 3331 1222221111222233222 11223344444566666655432 23456665555 Q ss_pred --eEEEEecccee-ee-ee------------------------cccccccce-----eecccCcccCccce-ec-ccccc Q lcl|NC_010325. 338 --TWSIRDLPNVL-SG-AY------------------------GIIDPKVSN-----LWDDDPNPWDTDTS-VW-GEGSY 382 (513) Q Consensus 338 --~Ws~~d~~~~~-~~-~~------------------------g~~~~~~~~-----~~~~~~~~~d~d~~-~~-~~ds~ 382 (513) .|+.-+.+..+ .. ++ ......... ...+-..++..+.. .+ .++ T Consensus 457 v~aW~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~~~~~~~~~sgl~-- 534 (681) T protein:vir:10 457 IGAWHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTYSGEPVSHISGLE-- 534 (681) T ss_pred eeeEEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccccccceEeeccccccCcceeeecccc-- Confidence 68766544321 11 00 000000000 00000001110100 01 111 Q ss_pred ccCccceEEEEeecCceee----------e--cccceeecCccEEEEeecccccC-----CCcceEEEeeeeeccCCCee Q lcl|NC_010325. 383 NPAKSSMIFSSFQDKKLFL----------F--GNNSTFSGQNFVSTLERSDIYLG-----DDRMMKTVSAIIPHITGNGT 445 (513) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~----------~--~~~~~~~g~~l~a~~~~~~~~~~-----~~~~~~~i~~~~~~~t~~~~ 445 (513) -..|..+.+ ..|+.... . ......-|-++++.++...+.+. ...+.+++.++..++....- T Consensus 535 ~leG~tv~i--~aDG~~~~~~~V~~G~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g 612 (681) T protein:vir:10 535 HLEGKTVSI--LADGAVHPQRVVTDGAIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGRVKNINKLWLRVHRSSG 612 (681) T ss_pred CCCCcEEEE--EeCCeecCcEeecCcEEEeCcCCceEEEeeeceeEEEecceeeecCCcccCCceEEEEEEEEEEEcccc Confidence 112333322 22332221 1 11225667788888886665542 22456677666655433332 Q ss_pred EEEEeeeeecCC----CCceEcCceeeecCCceEEEeecC-CCeEEEEEEccCCCcEEEEEEeeEEeccc Q lcl|NC_010325. 446 CNIWVGNAQVQG----SGIRWKGPYPYRIGQDYKIDTKHV-GRYIALKFDFSSEGDWYFNGYTIEMAPKA 510 (513) Q Consensus 446 ~~~~~g~~~~~~----~~~~w~~~~~~~~~~~~~~~~R~~-~Ry~~~rl~~~~g~~w~~~G~~~~~~~~g 510 (513) +.+.....+.-. .+-.+.++.+.-+| +..++++.. .+=..++|+.+.-.+.++.++.+|....| T Consensus 613 ~~~~~~~~~l~~~~~~~~~~~g~~~~l~TG-~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:10 613 IFAGPHADALTEVKQRTSEPYGSPPALKSE-EIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred eEEeeCCCceEEEEEeccccccccCCccCC-eEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 222211110000 00011222222233 233444432 34456788888889999999999988666 No 17 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=99.31 E-value=1.3e-10 Score=74.79 Aligned_cols=484 Identities=11% Similarity=0.020 Sum_probs=249.3 Q ss_pred CcccchhhcCccccccccCcc-----cCCC--CcEEEeEEEEEe-CCeeEECCCcceeeecC-CCcceeeeeee-eCCce Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPA-----DLPL--EKWSFGNNVRFK-NGKAQKTLGHTPIFDTA-QAPILDMFPFI-RNNIP 70 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~-----~lp~--~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~-~~~~~~~~~~~-~~g~~ 70 (513) |+-..|..-.. | -++|. ||.. +++..|.||++. .|++.+|+|..=+..+. ++.-..+.+|. +.... T Consensus 1 m~~~~~~~F~~-G---elsP~l~~r~Dl~~y~~~~~~~~n~~~~~~G~~~rR~G~~~~~~~~~~~~~~~lipF~~s~~~~ 76 (594) T protein:vir:10 1 MADFSQTSFKG-G---VIAPRLQFNEYESAYHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDGEVRLFRLPAVDAPSND 76 (594) T ss_pred CceeeccccCc-c---eecceeccchhHHHHHHHHhhhhceEEEecCCeecCChhHhhhhccCCCCCEEEEEEEeCCCCe Confidence 76655444221 1 12222 2222 677889999998 68899998877665554 34446777776 45677 Q ss_pred EEEEEcCceEEEecCc-eEEeccc-ccee------eCC---CCceeEEeeCCEEEEEeCCCceEEEc---CCCceecccC Q lcl|NC_010325. 71 YWLLCSEQRLYLADGT-TIIDVSP-GPYS------ASI---TNRWSVGSFNGVIFANDGVNPPHHLP---PSESTFRVLP 136 (513) Q Consensus 71 ~~~v~~~~kly~~~~~-t~~dis~-~~~~------~~~---~~~w~f~~~~~~~ia~ng~d~~q~~~---~~s~~f~~L~ 136 (513) +++.++...++-|.+. +++...+ .+|. ... -..-++++-++.++++-..-+|+.+- .+...+..+. T Consensus 77 ~~le~g~~~~r~~~~~~~~v~~~~~~~~~~~tp~~~t~~~~l~~i~~tqsad~~~~~~~~~~p~~L~R~~~~~w~~~~~~ 156 (594) T protein:vir:10 77 VIVEVGNTNIAVWVNDVRQVVANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRLIMVHPALQPKRLYRDNNNAWQFVNMH 156 (594) T ss_pred EEEEEcCCeEEEEecCcEEEEccCCCcccccccceeeccCCccceEEEEEeeEEEEEcCCCCceEEEEccCCCceEEecc Confidence 8888889888877555 3444332 2332 211 12347888899998888877777532 1233333332 Q ss_pred --CCcc----cceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceeccc-CCCCceeEE Q lcl|NC_010325. 137 --NFPA----NTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLA-DTNGAIVDG 209 (513) Q Consensus 137 --g~p~----~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~-d~~G~iv~g 209 (513) +.|. .--...|..|++||++++.. +.||.|+.|..+|.+ +++......+++-.++. .....++.. T Consensus 157 ~~~~p~~~~~~~~p~~v~f~q~RL~f~~~~----~~p~~v~~Srtgd~~----nF~~~~~~~ddd~i~~~~s~~~~~~~~ 228 (594) T protein:vir:10 157 TGAVPAEWSPSNYPQTVGIFQNRVWYVGSP----VHRTYFWATRAGKLE----DIAPSTANNPNDPISFVGIMEGTPCWI 228 (594) T ss_pred cCcccccccCCccceEEEEEeeeEEEEeCC----CCCceEEEEeccccc----ccccCCCCCCCccEEEEEecccceEEE Confidence 1111 00223589999999987764 469999999999964 34444444455544544 223556777 Q ss_pred EecCcceEEEecCcEEEEEec-C---CCceeEeEEecCccccccCceeEEECCeEEEEeCCCe----EEECCcc--cccC Q lcl|NC_010325. 210 VKLRDSFIIYKEDSVYSMRYI-G---GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDV----YVHNGVQ--KQSV 279 (513) Q Consensus 210 ~~l~~~~vIf~en~i~~m~y~-g---~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~----y~~~G~~--~~~I 279 (513) ++-.+.++||...+-|.++-. + .|..-++++.+. .|| +.---+.+|+.++|+++.|= +.++..+ +++. T Consensus 229 v~~~~~L~i~t~~~e~~l~~~~~~~lTp~~~~~~~~s~-~g~-~~~~P~~vg~~~~fv~~~g~~vre~~y~~~~d~y~~~ 306 (594) T protein:vir:10 229 IASSDVLTIGTTINDYQLAASTGVSVTAATAILRRSSV-QGT-AAVQGIPAEEQVIFCSRNKSKVYAMNYVREQDNWIPD 306 (594) T ss_pred EecCCceEEEecCceEEEecCCCcccccceEEEEEeee-ecc-CCCcceeeCCeEEEEcCCCCEEEEEEEeeccCceecc Confidence 778888999999999999743 2 244577887775 477 33345788999999999982 2333222 2222 Q ss_pred CchhHHHHHHhhcCcchhCCEE-EEEecCCCEEEEEEccCCCCCCcccceEEEEecccC--eEEEEeccce-----e--- Q lcl|NC_010325. 280 IDAQVRKFFFSDINPDNYQRTF-VLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKEN--TWSIRDLPNV-----L--- 348 (513) Q Consensus 280 g~~~V~~~~~~~i~~~~~~~~~-~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~--~Ws~~d~~~~-----~--- 348 (513) .--++-.-++..+.......+. -++...-..+.|+..+.+.- ..+-|+-+.+ .|+.-+++.- | T Consensus 307 dlt~~a~hl~~~~~~~~~~~i~~~a~~~~p~~~~~~v~~dG~l------~~~ty~~eq~v~aWs~~~~t~G~v~~va~i~ 380 (594) T protein:vir:10 307 EMSSQAQHLFTPISSAKGASVRRVAYISDAAKSLWVVLENGQI------NYCCFDRTTDTKAWTQLELSGGKVIDIAAAF 380 (594) T ss_pred chhhhhhhhcCccccccCceEEEEEEecCCceEEEEEeCCCeE------EEEEEecccceeeeEeeccCCCcEEEEEEee Confidence 1111222333333333333332 22344435666776554332 2345554443 5765443221 1 Q ss_pred -------eeeeccc---ccc------cceeecc----cCc-cc-C----ccceeccccccccCccceEEEEeecCcee-- Q lcl|NC_010325. 349 -------SGAYGII---DPK------VSNLWDD----DPN-PW-D----TDTSVWGEGSYNPAKSSMIFSSFQDKKLF-- 400 (513) Q Consensus 349 -------~~~~g~~---~~~------~~~~~~~----~~~-~~-d----~d~~~~~~ds~~~~~~~~~~~~~~~~~~~-- 400 (513) +...... ..+ .+..... .+. .+ + .+..+.+++.. .|..+. -..|+.+. T Consensus 381 ~~~~d~l~~~V~R~~ti~g~~~~y~~lE~~~~~~~~~~~~~~~~d~~~~~~~~vsgl~hL--eg~tv~--v~aDG~~~~~ 456 (594) T protein:vir:10 381 NPDSDYAYVAVVRSKAINGVQKNYTVLEKISSPRTDWKRADGWVVAQVNQNGDVLNLDRY--IGRTAV--IFSKYGLEAE 456 (594) T ss_pred cCCCCEEEEEEEECCccccceeeEEEeecCCCccccccccceeeeecccccceeeccccc--CCceEE--EEeCCeecCC Confidence 0000000 000 0000000 000 00 0 00001111111 122221 12233221 Q ss_pred -eec-------------ccceeecCccEEEEeecccccCC-----CcceEEEeeeeeccCCCeeEEEEeeeeecCC--CC Q lcl|NC_010325. 401 -LFG-------------NNSTFSGQNFVSTLERSDIYLGD-----DRMMKTVSAIIPHITGNGTCNIWVGNAQVQG--SG 459 (513) Q Consensus 401 -~~~-------------~~~~~~g~~l~a~~~~~~~~~~~-----~~~~~~i~~~~~~~t~~~~~~~~~g~~~~~~--~~ 459 (513) .+. .....-|-++++.++...+.... -.+.+++.++....-.... +.+|....+. .. T Consensus 457 ~~V~~g~itL~~~~~~~~~~v~VGl~Y~s~i~~lp~~~~~~~gs~~g~r~ri~r~~v~~~~S~g--~~vg~~~~~~r~~~ 534 (594) T protein:vir:10 457 VEVNNIGLTHRINGYDPNTVYYVGYKMDSYFRTLTPSNGDMKKSMFGSKIRISKVQLALFDSIE--PTVNGEPADDRSTD 534 (594) T ss_pred eEEcCCeeEeeccCCCCcceEEEeeeeeEEEEeecccccCCcccccCccEEEEEEEEEEEccee--eEECCcccccccch Confidence 110 11133477777787755554432 2356666665544333322 2334321111 00 Q ss_pred -------ceEcCceeeecCCceEEEeecCC--CeEEEEEEccCCCcEEEEEEeeEEecccc Q lcl|NC_010325. 460 -------IRWKGPYPYRIGQDYKIDTKHVG--RYIALKFDFSSEGDWYFNGYTIEMAPKAG 511 (513) Q Consensus 460 -------~~w~~~~~~~~~~~~~~~~R~~~--Ry~~~rl~~~~g~~w~~~G~~~~~~~~g~ 511 (513) ....+.....+| ++.+.+...| +=..++|+-+.-.+.++.|+.+|...--- T Consensus 535 ~~~~~~~~~~~g~~~~~tg-~~~v~~~~~G~~~~~~i~I~qd~PlPltvlai~~ev~~~~~ 594 (594) T protein:vir:10 535 DIMDARLLDFSSNSGSSNG-TRLVDYNPLGWENDGKMVIAVEQPFLCEVVGVFSVVQSNKV 594 (594) T ss_pred hhccccCCcccCcccccCC-ceEEEEccCCcCcccEEEEEECCCcCEEEEEEEEEEEeccC Confidence 111122222333 2334443333 56677788888899999998888753333 No 18 >protein:vir:103790 Length: 768 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024932;genbank:gi:48697202;genbank:GeneID:2846114 Probab=99.01 E-value=5.5e-09 Score=65.87 Aligned_cols=484 Identities=11% Similarity=0.033 Sum_probs=236.1 Q ss_pred CcccchhhcCccccccccCccc-----C--CCCcEEEeEEEEEe-CCeeEECCCcceeeecCCC-cceeeeeee-eCCce Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPAD-----L--PLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTAQA-PILDMFPFI-RNNIP 70 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~-----l--p~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~~~-~~~~~~~~~-~~g~~ 70 (513) |++.++-..+-+|.- ++|.- | =++....|.||.+. .|++++|+|..=+...... .-..+.++. ..+.. T Consensus 1 M~~~~~~~~~F~~Ge--lsP~l~~r~Dl~ry~~~~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~lipf~~~~~~~ 78 (768) T protein:vir:10 1 MPKAAPQQVSFDAGE--LSPLLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAATKDSTKQSWLLPFIVADGIA 78 (768) T ss_pred CCcceeeeeeccCce--echhhcccchHHHHHHHHhhhhcceeeecCCceecCchhhhhhhcCCCCCeeEEEEEecCccE Confidence 998887777777762 12211 1 13677889999998 5899999997666544321 222344444 45778 Q ss_pred EEEEEcCceEEEecCceEEeccccce------eeCC---C---CceeEEeeCCEEEEEeCCCceEEEcC---CCc----- Q lcl|NC_010325. 71 YWLLCSEQRLYLADGTTIIDVSPGPY------SASI---T---NRWSVGSFNGVIFANDGVNPPHHLPP---SES----- 130 (513) Q Consensus 71 ~~~v~~~~kly~~~~~t~~dis~~~~------~~~~---~---~~w~f~~~~~~~ia~ng~d~~q~~~~---~s~----- 130 (513) ++++.+.+.|+-|+++........++ +... + ..-++++.+|+++++|..-+||.+.- ++. T Consensus 79 y~l~fg~~~irv~~~~g~v~~~~~~~e~~tp~~~~~l~~~~~~~~L~~~q~aD~~~i~~~~~~p~~l~r~~~~~w~l~~~ 158 (768) T protein:vir:10 79 YMLEFGDHYIRFFVNRGQLVNAGAPVEIATPYALADLTTEDGTFAIRATQSADTMYLFHGGYPTQKLLRTSATTFSLQPV 158 (768) T ss_pred EEEEEcCCEEEEEECCcEEEecCeeEEEEcCCCcceeecccccceeEEEeecCEEEEEcCCcceeEEEEecCCCceeEEe Confidence 88888888888887664221122111 1100 0 01356666666666665544432000 000 Q ss_pred -------------------------e------------------------------------------------------ Q lcl|NC_010325. 131 -------------------------T------------------------------------------------------ 131 (513) Q Consensus 131 -------------------------~------------------------------------------------------ 131 (513) . T Consensus 159 ~~~~gp~~~~n~~~~vti~~s~~~~~~T~tasa~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 238 (768) T protein:vir:10 159 TFVGGPFAAVNSDNNVRVHASAGTGAVTLVASASVFRPSDVGTLFYLEQEDNSFVKPWVVHQKIGPSELRRVGDRVYLCT 238 (768) T ss_pred eecCccccccccceeEEEEecccceeEEEeecCCccchhhcceeeeeeeeccccccccEEEEeeeeEEEEecCCceEEee Confidence 0 Q ss_pred -----eccc-----C----CC----------------------------------------------------------- Q lcl|NC_010325. 132 -----FRVL-----P----NF----------------------------------------------------------- 138 (513) Q Consensus 132 -----f~~L-----~----g~----------------------------------------------------------- 138 (513) ..+. + |. T Consensus 239 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~t~~~~~~~~~~~~~~~~~~ 318 (768) T protein:vir:10 239 AVGTATPQVTGTETPTHTSGSRWDGTGQDESATDEYGSIGAEWEYQHSGYGTVLITGYTNDQVVTGTVATNDPADPGMLP 318 (768) T ss_pred eeccccccccceeccccccCceEEEecCcccccccccccceEEEEEEcCCceEEEEEecCCeeEEeeeeeecCccccccc Confidence 0000 0 00 Q ss_pred ------------------cccceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccccc-cCcceecc Q lcl|NC_010325. 139 ------------------PANTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPT-KDAGQNTL 199 (513) Q Consensus 139 ------------------p~~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t-~~a~~~dl 199 (513) +.+.-...|..|++||++++ |+.|+.|..+|.+++ |-.+..+ .+++=.++ T Consensus 319 ~~~~~~~~t~~~~~~~~~~~~g~Ps~v~f~q~RL~f~~--------~~~v~~Srtgd~~nF---~~~s~~~~~DdD~I~~ 387 (768) T protein:vir:10 319 NTVVTLTGTYKWARSLFNSTDGFPQMGTFWRNRLCLMR--------DRWLAMSVSADFETF---KTKDADQQTDDSAIVQ 387 (768) T ss_pred ccccccCCCcccccCCCcCCCCCceEEEEEeeeEEEee--------CCEEEEEcccccccc---cccccccccCCccEEE Confidence 00001145778888888765 688999999996532 3222211 12222333 Q ss_pred c---CCCCceeEEEecCcceEEEecCcEEEEEecC-----CCceeEeEEecCccccccCceeEEECCeEEEEeCCC--eE Q lcl|NC_010325. 200 A---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG-----GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGD--VY 269 (513) Q Consensus 200 ~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g-----~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G--~y 269 (513) . +....|..+++.+ .++||...+-|.++-.+ .|.-.++++++. .||-+ =.-+.+|+.++|+++.| +. T Consensus 388 ~~ss~~~~~i~~~v~~~-~L~i~T~~~q~~l~~~~~~~~lTP~~~~i~~~s~-~g~~~-~~Pv~vG~~v~fv~~~g~~vr 464 (768) T protein:vir:10 388 QLNARQLNKLAWMVESD-SLLIGMTGDEWVIGPANASQPVSAANLNAARRTS-YGSKR-IQPVQVGGTIMFVQKAGRKLR 464 (768) T ss_pred EecCCcceeEEEEeecC-cEEEEecCceEEEecCCCCcccccceEEEEEeeh-hcccc-cccEEeCCeEEEEcCCCCEEE Confidence 3 2233467777775 69999999999997432 466688888875 58843 34478999999999998 42 Q ss_pred --EECC--cccccCCchhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEeccc--C---eEE Q lcl|NC_010325. 270 --VHNG--VQKQSVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKE--N---TWS 340 (513) Q Consensus 270 --~~~G--~~~~~Ig~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~--~---~Ws 340 (513) .++- ..+++..--.+-+-|+..+.....+-..-++-.....+.|+-.+.+.- -.+-|+.+. | .|+ T Consensus 465 e~~y~~~~d~y~a~DlT~~a~hl~~~~~~~~~~i~~~a~~~~p~~v~~~v~~dg~l------~~~ty~~e~~~q~v~aW~ 538 (768) T protein:vir:10 465 DFKYDFSSDNYVSTDVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAARADGQL------IGCTYDEEAGRSDVYGWH 538 (768) T ss_pred EEEeeeecCceecchhhhhhhhhccccCccccceeeEEEeecCCeEEEEEecCCeE------EEEEEecCCCceeEEeEE Confidence 2332 222221111121223333211111112222333445567776654321 134555542 2 688 Q ss_pred EEecccee-----------------eeeecc--------cccccceeeccc---Cc--------ccCc-cceeccccccc Q lcl|NC_010325. 341 IRDLPNVL-----------------SGAYGI--------IDPKVSNLWDDD---PN--------PWDT-DTSVWGEGSYN 383 (513) Q Consensus 341 ~~d~~~~~-----------------~~~~g~--------~~~~~~~~~~~~---~~--------~~d~-d~~~~~~ds~~ 383 (513) .-+++.-. +..... ........+..+ +. +++. ....++ .... T Consensus 539 ~~~~~~g~v~~v~~i~~~~g~~d~l~~~v~r~~~g~~~~~ie~l~~~~~~~~~~~~~~~~D~~~~~~~~~~~~~~-gl~~ 617 (768) T protein:vir:10 539 RHPDANGFVECVASMPAPDGASDDLWVIVRRQVNGQTVRYVEYLNPALQDDEPQSSAFYVDAGITYNGVPTSTIA-GLGH 617 (768) T ss_pred EEEcCCCEEEEEEEEecCCCCccEEEEEEEecCCCeEEEEEEecCcccccccccccceEeccccccCCcceeeec-CCCC Confidence 66543311 000000 000000000000 00 0000 000000 0011 Q ss_pred cCccceEEEEeecCcee------------eecccceeecCccEEEEeecccccC--C---CcceEEEeeeeeccCCCeeE Q lcl|NC_010325. 384 PAKSSMIFSSFQDKKLF------------LFGNNSTFSGQNFVSTLERSDIYLG--D---DRMMKTVSAIIPHITGNGTC 446 (513) Q Consensus 384 ~~~~~~~~~~~~~~~~~------------~~~~~~~~~g~~l~a~~~~~~~~~~--~---~~~~~~i~~~~~~~t~~~~~ 446 (513) ..|....+ ..|+... ........-|-++++.++...+... + ....+++.++.-.+.....+ T Consensus 618 leg~~v~v--~~dG~~~~~~~v~~g~itl~~~~~~v~vG~~y~s~~~~~p~~~~~~~gs~~~~~~ri~r~~v~~~~S~~~ 695 (768) T protein:vir:10 618 LEGVTVAV--LTDGAVHPSRTVTAGAITLDWSASIVHIGVPTTCRIQTMQLNAGAANGTAQGKTKRVTNIATRFSRSLGG 695 (768) T ss_pred cccceEEE--EECCEeccCceecCCEEEeCCCCceEEEeEeeeEEEEecceEeecCCccccccceEEEEEEEEEecccce Confidence 12332222 1222211 1122335667788888886665542 2 23566666655543333222 Q ss_pred EEEeeeeecCCCCce-------EcCceeeecCCceEEEeecCCCe---EEEEEEccCCCcEEEEEEeeEEeccccCC Q lcl|NC_010325. 447 NIWVGNAQVQGSGIR-------WKGPYPYRIGQDYKIDTKHVGRY---IALKFDFSSEGDWYFNGYTIEMAPKAGMR 513 (513) Q Consensus 447 ~~~~g~~~~~~~~~~-------w~~~~~~~~~~~~~~~~R~~~Ry---~~~rl~~~~g~~w~~~G~~~~~~~~g~rr 513 (513) .+.....+....... ...+.+.- ++.+++...+++ ..++|+.+.-.+.++.++++|.. -.-|| T Consensus 696 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~l~---TG~~~v~~~~~~~~~~~i~i~~d~P~P~tvlsi~~~~~-~nd~~ 768 (768) T protein:vir:10 696 VVGPTFDDNDLEQLSFRKPSNAMDRAVPLF---DGDMESDWRGGYEGQSWICYQNDQPLPVTLLGFFPILD-TQDDR 768 (768) T ss_pred EEEecCCCCCceeeeeEecCcccCccCCcc---cCEEEEEecCCCCcceEEEEEECCCCCEEEEEEEEEEE-EeecC Confidence 222111111000111 11122222 244445545544 56788888889999999999976 66777 No 19 >protein:vir:8887 Length: 808 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813776;genbank:gi:29366731;genbank:GeneID:1258831 Probab=99.00 E-value=6.7e-09 Score=65.41 Aligned_cols=483 Identities=13% Similarity=0.095 Sum_probs=225.7 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeee---c--CCCcc-eeeeeeeeCCceEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFD---T--AQAPI-LDMFPFIRNNIPYWL 73 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~---~--~~~~~-~~~~~~~~~g~~~~~ 73 (513) |+..-+.+.+..|.|.-..+..==++....|.||+++ .++++||||..=+.. + .+..+ +..+.. .....+++ T Consensus 1 M~~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gG~~rRpgt~~v~~l~~~~~~~~~~~~~~~~~-~~~~~y~v 79 (808) T protein:vir:88 1 MGLVSQSVKNLKGGISQQPDILRFSNQGALQINGWSSETQGLQKRPPTTFTKRLQNKGFLGTKPLVHLINR-DAQEQYFV 79 (808) T ss_pred CcceeeecchhccceeccchhHhhhhhhhhhhcceeeeccccccCCchheeeeeeccCCCCCCcEEEEEEe-CcCceEEE Confidence 9999999999999998665554456888999999999 588999998655532 1 22222 233322 22233333 Q ss_pred EEcCc--eEEEecCceEEeccc-cce--eeCCCCceeEEeeCCEEEEEeCCCceEE------------------------ Q lcl|NC_010325. 74 LCSEQ--RLYLADGTTIIDVSP-GPY--SASITNRWSVGSFNGVIFANDGVNPPHH------------------------ 124 (513) Q Consensus 74 v~~~~--kly~~~~~t~~dis~-~~~--~~~~~~~w~f~~~~~~~ia~ng~d~~q~------------------------ 124 (513) +.+.+ ++|..++. ...++. .+| .+.+-..-++++.+|+++++|..-+|+. T Consensus 80 ~~~~~~i~v~~~~G~-~~~v~~~~~y~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~~vr~g~ 158 (808) T protein:vir:88 80 GFSGTGLAVWDLKGN-NYTVRGYNGYANCANPRTDLRLITVADYTFVVNRNTVCQMGSTLTHAAYPRLDGRAIINVRGGQ 158 (808) T ss_pred EEeCCeEEEEEcCCc-eEEEeecCcceEecCChhheeEEEEcCEEEEEcCCcceeecccccccCCCCCCccEEEEEcccc Confidence 44444 33333332 111111 111 0000011222222222222221111110 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_010325. 125 -------------------------------------------------------------------------------- 124 (513) Q Consensus 125 -------------------------------------------------------------------------------- 124 (513) T Consensus 159 y~~~y~i~i~g~~s~~~~t~t~~~~~~s~~~v~~~~~~~~~~~~~~~~~~~ia~~l~~~~~~~~~~~~~~~~~~~~~~~i 238 (808) T protein:vir:88 159 YGRTLSITINGDGTGSSPQASIKMPNGSAEKVPAGDPYAGMNQVDMTDASWIAAELARQLTVSLGGSGWSFQAGTGWILI 238 (808) T ss_pred cCceEEEEEecCCcceeeeEeEEEccCcccceeeccceeecccCCccccccchhhheeeeeecccccceEEEeccceEEE Confidence Q ss_pred ----------Ec---C-----------CCceecccCC------------------------------------------- Q lcl|NC_010325. 125 ----------LP---P-----------SESTFRVLPN------------------------------------------- 137 (513) Q Consensus 125 ----------~~---~-----------~s~~f~~L~g------------------------------------------- 137 (513) .. + ....+++|+. T Consensus 239 ~~~a~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~lp~~~p~g~~v~i~~~~~~~~~~~yv~~~~~~~~w~e~~~~~~~~~ 318 (808) T protein:vir:88 239 NAPANDNVRQIATKDGYADTLLSGFIYQVQTFTKLPANAPPGYLVEITGESARSGDNYWVQYDASGKVWKETAKPKIIAG 318 (808) T ss_pred EeccCceeEEEcccCCcCcceeeeeeeeccceeeccccCCCCcEEEEEecCCCCCceeEEEEEcCCeEEEEeeeccceee Confidence 00 0 0000111110 Q ss_pred -------------------------------------Ccccc--eeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCc Q lcl|NC_010325. 138 -------------------------------------FPANT--TFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADA 178 (513) Q Consensus 138 -------------------------------------~p~~~--ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~ 178 (513) .|+.. ....|..|++||++++ |++|+.|..+|. T Consensus 319 ~~~~tmp~~lv~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~--------~~~v~~Srtgd~ 390 (808) T protein:vir:88 319 FNNATLPHALVRAADGQFDWTPLTWDGRNAGDDDTNPMPSFVGATINDVFFFRNRLGFLS--------GENVVMSRTSKY 390 (808) T ss_pred ecccceeEEEEecCCceEEEEecccccccccccccCccceecCCceeEEEEEcceEEEee--------CCeEEEEeccCc Confidence 00000 0113679999998865 678999999996 Q ss_pred cccccccccccc--ccCcceecc---cCCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCc Q lcl|NC_010325. 179 GGVPASWDPTDP--TKDAGQNTL---ADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPN 251 (513) Q Consensus 179 ~~~P~~Wd~t~~--t~~a~~~dl---~d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~ 251 (513) + ++..... ..+++=.++ .+....|..+++.+..|+||...+-|.++-.+ .|...++.+.+ ..||.+.= T Consensus 391 ~----nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~~~lTP~~~~~~~~s-~~~~~~~~ 465 (808) T protein:vir:88 391 F----NFFPSSVATLSDDDPIDVAISHNRISILKYAVPFSEQLLLWSDQAQFVLSSKTILSSKTIELDLTT-EFDVSDGA 465 (808) T ss_pred c----cccCCcccCCCCCccEEEEecCCccceeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEE-EecccCCC Confidence 4 4433322 122333333 23334466688999999999999999996322 24445666665 34676767 Q ss_pred eeEEECCeEEEEeCCCeE-------EECC--cccccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCC Q lcl|NC_010325. 252 CAVEFDGNHFVVGHGDVY-------VHNG--VQKQSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSK 321 (513) Q Consensus 252 siv~~~~~~ffls~~G~y-------~~~G--~~~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~ 321 (513) .-+.+|+.++|+++.|=| .++- ..+.+.. +.-+.+.|- .....+ +.+-..+..+.|+-... T Consensus 466 ~Pv~vG~~v~f~~~~g~~~~v~r~~~~~~~~d~y~~~dlt~~~~h~~~-----~~~~~~-~~~~~~~~~~v~~~~~~--- 536 (808) T protein:vir:88 466 RPYGIGRGVYFAAPRASFTSLKRYYAIQDVSDVKSAEDVSAHVPSYIT-----NTVHAI-HGSGTENFVSILSDGSP--- 536 (808) T ss_pred CceEeCCeEEEEecCCCeeEEEEEEEeeeccCceehhhHHHHHHHhcC-----CCeEEE-EEeCCCCeEEEEEEcCC--- Confidence 789999999999999832 2332 2223221 223333321 111111 12222333344554322 Q ss_pred CCcccceEEEEecc----cC---eEEEEeccce----eeeee------------------cccccccce----------- Q lcl|NC_010325. 322 PGKHCDRAIIWNWK----EN---TWSIRDLPNV----LSGAY------------------GIIDPKVSN----------- 361 (513) Q Consensus 322 ~~~~~d~~lvyd~~----~~---~Ws~~d~~~~----~~~~~------------------g~~~~~~~~----------- 361 (513) +++++|-|. .+ .|+.-+++.. +..+- +........ T Consensus 537 -----g~l~~~~y~~~~~e~~v~aW~r~~~~g~~~~~~~~~~~~~d~l~~vV~r~~~~~ler~~~~~~~~~~~~~~~~~~ 611 (808) T protein:vir:88 537 -----NKVFIYKFLYLDEILQQQSFSHWEFGDAATTRVLAASCIGSYCYLMIDRPEGLCLERMEFTQHTIDYSIEPYRTY 611 (808) T ss_pred -----CEEEEEEEeccCCceeEEeeEEEecCCCeeEEEEEEeccCCEEEEEEEcCCcEEEEEEeeccCCCCCccccceee Confidence 356666652 22 5776554421 10100 000000000 Q ss_pred eeccc---CcccCccceeccccccc-cCc----cceEEEEeecCce-------------eeec----ccceeecCccEEE Q lcl|NC_010325. 362 LWDDD---PNPWDTDTSVWGEGSYN-PAK----SSMIFSSFQDKKL-------------FLFG----NNSTFSGQNFVST 416 (513) Q Consensus 362 ~~~~~---~~~~d~d~~~~~~ds~~-~~~----~~~~~~~~~~~~~-------------~~~~----~~~~~~g~~l~a~ 416 (513) +++.. ...++.++..+.++-.. ..+ ....+..-.++.+ +.+. .....-|-++++. T Consensus 612 lD~~~~~~~g~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~ 691 (808) T protein:vir:88 612 MDMKKTIVLGAYNIDTNLTSFDVRTAYGGTPGPESTFYTIDQQGVLIEHEARDWATNPYISFVGNRAGEQMVIGKQYTFQ 691 (808) T ss_pred eeeeeeeccccccCccccceeecccccccccccceeEEEEcCCceEEeeecccccCcceEEeCCCccCceEEEeeeeeEE Confidence 00000 00112222111111000 000 0000111111111 1111 1235668888888 Q ss_pred EeecccccC---CCcc-------eEEEeeeeeccCCCeeEEEEeeeeecCC----CCceE-----cCceeeecCCceEEE Q lcl|NC_010325. 417 LERSDIYLG---DDRM-------MKTVSAIIPHITGNGTCNIWVGNAQVQG----SGIRW-----KGPYPYRIGQDYKID 477 (513) Q Consensus 417 ~~~~~~~~~---~~~~-------~~~i~~~~~~~t~~~~~~~~~g~~~~~~----~~~~w-----~~~~~~~~~~~~~~~ 477 (513) ++...+.+. ...+ +.++++........+.+.+.+....... .+-.+ .+..+.-+| +..++ T Consensus 692 ~~~~p~~~~~~~g~~~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg-~~~vp 770 (808) T protein:vir:88 692 YEFSKFLIKQTADDGSTSTEDIGRLQLRRAWLNYEESGAFEINVNNGSSEFVYVMTGGRLGIQRVLGELSVGTG-QFKFP 770 (808) T ss_pred EEecceEEecCCCCcceeecccceEEEEEEEEEeecccceEEEeCCCcccceeeccCcccCcccccCccccccc-eEEEE Confidence 886665442 1111 1234433333333333333332211110 01111 111122233 36688 Q ss_pred eecCCCeEEEEEEccCCCcEEEEEEeeEEe-ccccCC Q lcl|NC_010325. 478 TKHVGRYIALKFDFSSEGDWYFNGYTIEMA-PKAGMR 513 (513) Q Consensus 478 ~R~~~Ry~~~rl~~~~g~~w~~~G~~~~~~-~~g~rr 513 (513) ++..++-..++|+.+.-.+.++.++++|.. -.=.|| T Consensus 771 ~~~~~~~~~v~i~~d~P~P~tilsi~~eg~y~~r~~~ 807 (808) T protein:vir:88 771 VTGNAVNQRVTITSSNPNPLNVIGCGWEGNYIRRSSG 807 (808) T ss_pred ecccCceeEEEEEECCCCceEEEEEEEEEEEeccccC Confidence 888888888889888899999999998872 233555 No 20 >protein:vir:10452 Length: 794 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848299;genbank:gi:30387490;genbank:GeneID:1733952 Probab=98.79 E-value=4.8e-08 Score=60.69 Aligned_cols=483 Identities=14% Similarity=0.128 Sum_probs=237.8 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeeecCCCccee---eeeeee--CCceEEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTAQAPILD---MFPFIR--NNIPYWLL 74 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~~~~~~~---~~~~~~--~g~~~~~v 74 (513) |+..-+.+.+..|.|.-..+..-=++....|.||+++ .|+++||||..=+.........+ +..++. ....+.++ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~Ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~rd~~e~~~v~ 80 (794) T protein:vir:10 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLKTLGDNGALGQAPYIHLINRDENEQYYAV 80 (794) T ss_pred CcceeeecchhhcccccCCchHHhhhhHhhhhcceeeeccCcccCcchhhheeccCCCccccceeeeEEecCCCceEEEE Confidence 9999999999999988665555556888999999999 58899999976664432222111 222221 23333334 Q ss_pred EcCc--eEEEecCceEEe--cccccee--eCCCCceeEEeeCCEEEEEeCCCceEE------------------------ Q lcl|NC_010325. 75 CSEQ--RLYLADGTTIID--VSPGPYS--ASITNRWSVGSFNGVIFANDGVNPPHH------------------------ 124 (513) Q Consensus 75 ~~~~--kly~~~~~t~~d--is~~~~~--~~~~~~w~f~~~~~~~ia~ng~d~~q~------------------------ 124 (513) .+.+ ++|.+++..-+- -...+|- +.....-++++.+|+++++|..-+||. T Consensus 81 ~~~~~irv~~~~G~~~~v~~~~~~~Y~~aa~~~~~l~~~q~aD~~fivn~~~~~~~~~~~~~~~~~~~~~~~~~~v~~g~ 160 (794) T protein:vir:10 81 FTGTGIRVFDLAGNEKQVRYPNGSNYIKTANPRSDLRMVTVADYTFIVNRNVVVQKDPNSVNLANYNPKQDGLINIRGGQ 160 (794) T ss_pred EeCCeEEEEEcCCcEEEEEcCCCCcceecCCCcceEEEEEEcCEEEEEcCCeeeeeeccccccCCCCCCccEEEEecccc Confidence 4444 555555432211 1123452 212223567777777777665443331 Q ss_pred ----------------E---cCC--------------------------------------------------------- Q lcl|NC_010325. 125 ----------------L---PPS--------------------------------------------------------- 128 (513) Q Consensus 125 ----------------~---~~~--------------------------------------------------------- 128 (513) + +++ T Consensus 161 y~r~y~i~i~~~~~at~~tpdgt~~~~~~~~s~~~ia~~L~~~l~a~~~g~t~~~~g~~i~i~a~s~~~~~t~s~~~~~~ 240 (794) T protein:vir:10 161 YGRELIVHINGKDVATYKIPDGSKPEHVNNTDAQWLAERLAKQMRINLSGWTVNVGQGFIHVTAPSGQQIDSFTTKDGYA 240 (794) T ss_pred cceEEEeccCCcceeEEEecCCCCcccceecchhhhhhhhhhhhhcccCCceEEeCCeEEEEEeccCceeccccccCCcC Confidence 0 000 Q ss_pred ----------CceecccCCCcc---------------------------------------------------------- Q lcl|NC_010325. 129 ----------ESTFRVLPNFPA---------------------------------------------------------- 140 (513) Q Consensus 129 ----------s~~f~~L~g~p~---------------------------------------------------------- 140 (513) .+.+++|+...+ T Consensus 241 ~~~~~~v~~~~~~~~~lp~~~~~G~~v~i~~~~~~~~~~yyv~~~~~~~~w~E~~~~g~~~~~~~~tmP~~l~r~~~~t~ 320 (794) T protein:vir:10 241 DQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTENQVLLETMPHALVRAADGNF 320 (794) T ss_pred cceeEEEEeccCcceecccCCCCCcEEEEEeCCCCCcceeEEEEEcCCcEEEEecccceeEEEecccceeEEEEeccceE Confidence 000011110000 Q ss_pred -----cce-------------------eeEEEEEcCEEEEEECCcCcccCCceEEEeccCCccccccccccccc--ccCc Q lcl|NC_010325. 141 -----NTT-------------------FKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDP--TKDA 194 (513) Q Consensus 141 -----~~k-------------------a~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~--t~~a 194 (513) .|. ...|..|++||++++ |+.|+.|..+|.+ +|..... ..++ T Consensus 321 ~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~--------~~~v~~Srtgd~~----nF~~~t~~~~~Dd 388 (794) T protein:vir:10 321 DFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLS--------GENIILSRTAKYF----NFYPASIANLSND 388 (794) T ss_pred EeeecccccccccccccCccCcccCCCccEEEEEcceEEEee--------CCeEEEEecCCcc----cccccccccCCCC Confidence 000 023678888888754 6789999999964 4444432 2233 Q ss_pred ceecc---cCCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCceeEEECCeEEEEeCCC-- Q lcl|NC_010325. 195 GQNTL---ADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGD-- 267 (513) Q Consensus 195 ~~~dl---~d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G-- 267 (513) +=.++ .+....|..+++.+..|+||...+-|.++-.+ .|.-.++.+.+ ..||-+.=.-+.+|+.++|+++.| T Consensus 389 D~I~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~~~~~lTP~~~~~~~~s-~~~~~~~~~Pv~vg~~v~f~~~~g~~ 467 (794) T protein:vir:10 389 DPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSVELNLTT-QFDVQDRARPYGIGRNVYFASPRSSY 467 (794) T ss_pred ccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEE-eecccCCCCceEeCCeEEEEecCCCe Confidence 43333 23334466688899999999999999996222 24345666666 345666556788999999999987 Q ss_pred --eEE---ECCcc--cccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecc---- Q lcl|NC_010325. 268 --VYV---HNGVQ--KQSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWK---- 335 (513) Q Consensus 268 --~y~---~~G~~--~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~---- 335 (513) ++. ++-.+ +.+.. +.-+.+ ++ +...-. .+.+-..+..++|+-... +++++|-|. T Consensus 468 ~~~~r~~~~~~~~d~y~a~Dlt~~~~h-l~----~~~v~~-~~~~~~~~~~~~~~~~~~--------~~l~~~~y~~~~~ 533 (794) T protein:vir:10 468 TSIHRYYAVQDVSSVKNSEDITSHVPN-YI----PNGVFS-ICGSGTENFCSVLSHGDP--------SKIFMYKFLYLNE 533 (794) T ss_pred eEEEEEeeeccccCceehhhHHHHHHH-hc----CCceEE-EEEeCCCCcEEEEEEcCC--------CEEEEEEEeecCC Confidence 332 22212 22221 112222 21 211111 122233344555654321 357776652 Q ss_pred c---CeEEEEeccce----eeeeec----------------ccccccce-----------eeccc-----CcccCccce- Q lcl|NC_010325. 336 E---NTWSIRDLPNV----LSGAYG----------------IIDPKVSN-----------LWDDD-----PNPWDTDTS- 375 (513) Q Consensus 336 ~---~~Ws~~d~~~~----~~~~~g----------------~~~~~~~~-----------~~~~~-----~~~~d~d~~- 375 (513) . ..|+.-+.+.. |..+.+ ........ +++.. ..+|+.+.. T Consensus 534 e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~iv~r~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~ 613 (794) T protein:vir:10 534 ELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYNDDTFT 613 (794) T ss_pred ceEEEeEEEEEcCCcEEEEEEEecCCeEEEEEEeCCCEEEEEEEEeecCCCCCCccceeeeecceEEEecCccccccccc Confidence 2 26886665431 111111 00000000 00000 001111100 Q ss_pred -----eccccccccCccceEEEEeecCce---------------eee----cccceeecCccEEEEeecccccC---CCc Q lcl|NC_010325. 376 -----VWGEGSYNPAKSSMIFSSFQDKKL---------------FLF----GNNSTFSGQNFVSTLERSDIYLG---DDR 428 (513) Q Consensus 376 -----~~~~ds~~~~~~~~~~~~~~~~~~---------------~~~----~~~~~~~g~~l~a~~~~~~~~~~---~~~ 428 (513) ++........|....+ ..|+.. +.+ ......-|-++++.++-..+.+- +.. T Consensus 614 t~~~~~~~~g~~~~eg~~v~~--~adg~~~~~~~~~~~~~g~~~l~i~~~~~a~~v~vGl~y~s~~~~~~~~i~~~~~~~ 691 (794) T protein:vir:10 614 TSIHIPTIYGANFGRGKITVL--EPDGKITVFEQPTSGWQSDPWLRLSGNLEGREVFIGFNINFVYEFSKFLIKQTTDDG 691 (794) T ss_pred ceEEcccccCcccccccEEEE--ecCCceeeeeeeeeeeecceEEEecCCCCCceEEEeeeeeEEEEecceEEEccCCCc Confidence 0000111111222211 122211 111 12336668888888886665432 111 Q ss_pred ce-------EEEeeeeeccCCCeeEEEEeee--eecCC--CCceE------cCceeeecCCceEEEeecCCCeEEEEEEc Q lcl|NC_010325. 429 MM-------KTVSAIIPHITGNGTCNIWVGN--AQVQG--SGIRW------KGPYPYRIGQDYKIDTKHVGRYIALKFDF 491 (513) Q Consensus 429 ~~-------~~i~~~~~~~t~~~~~~~~~g~--~~~~~--~~~~w------~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~ 491 (513) +. .++.++.......+.+.+.+.. ++... .+... .+....-+| ...++++..++-..++|+. T Consensus 692 ~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg-~~~vp~~g~~~~~~v~i~~ 770 (794) T protein:vir:10 692 STSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSNWKYTMAGARLGSNTLRAGRLNLGTG-QYRFPVVGNAKFNTVYILS 770 (794) T ss_pred ceeeeccccEEEEEEEEEeeccccEEEEEcCCccccceeeccceeccccccccccccccc-eEEEEecccCceEEEEEEE Confidence 11 2233333222223333333211 11100 11111 111112223 3678899999999999999 Q ss_pred cCCCcEEEEEEeeEEe-ccccCC Q lcl|NC_010325. 492 SSEGDWYFNGYTIEMA-PKAGMR 513 (513) Q Consensus 492 ~~g~~w~~~G~~~~~~-~~g~rr 513 (513) +.-.+.++.++++|.. -.=.|| T Consensus 771 d~P~P~tvlsi~~eg~y~~r~~~ 793 (794) T protein:vir:10 771 DETTPLNIIGCGWEGNYLRRSSG 793 (794) T ss_pred CCCCceEEEEEEEEEEEeccccC Confidence 9999999999999983 333444 No 21 >protein:vir:2203 Length: 794 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042000;swissprot:sw:p03747;genbank:gi:9627472;uniprot:P03747;genbank:GeneID:1261024 Probab=98.75 E-value=6.6e-08 Score=59.94 Aligned_cols=482 Identities=14% Similarity=0.122 Sum_probs=238.8 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeeecC------CCcceeeeeeeeCCceEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTA------QAPILDMFPFIRNNIPYWL 73 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~------~~~~~~~~~~~~~g~~~~~ 73 (513) |+...+.+.+..|.|.-..+..-=++....|.||.++ -|+++||+|..=+..-. .+..+..+.+ .....+++ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~-~~~~~y~l 79 (794) T protein:vir:22 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINR-DEHEQYYA 79 (794) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeccCCceeCCchHhhhhhcccCCCCCccEEEEEEe-CCCcEEEE Confidence 9999999999999988654444456888999999999 58899999976553211 1122333222 33344555 Q ss_pred EEcCceEEEecCc-eEEecc---cccee--eCCCCceeEEeeCCEEEEEeCCCceEEEcC-------------------- Q lcl|NC_010325. 74 LCSEQRLYLADGT-TIIDVS---PGPYS--ASITNRWSVGSFNGVIFANDGVNPPHHLPP-------------------- 127 (513) Q Consensus 74 v~~~~kly~~~~~-t~~dis---~~~~~--~~~~~~w~f~~~~~~~ia~ng~d~~q~~~~-------------------- 127 (513) +.+.+.|+-|+.. ....+. ..+|. ......-++++.+|+++++|..-+||...- T Consensus 80 ~~~~~~irv~~~~G~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~~~g~v~v~~g 159 (794) T protein:vir:22 80 VFTGSGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGG 159 (794) T ss_pred EEcCCeEEEEecCCcEEEeecCCCccceecCCCcccEEEEEEcCEEEEEcCCeeeeEeeccccCCCCCCCceEEEEccCC Confidence 5555544444322 122332 12342 222233577777777777776555532100 Q ss_pred -----------------------C-------------------------------------------------------- Q lcl|NC_010325. 128 -----------------------S-------------------------------------------------------- 128 (513) Q Consensus 128 -----------------------~-------------------------------------------------------- 128 (513) + T Consensus 160 ~y~~ty~v~I~~~~~a~~~~p~gt~~~~~~~~~~~~ia~~L~~~l~~~~~~~t~~~~~~~~~i~a~~~~~~~~~t~~~g~ 239 (794) T protein:vir:22 160 QYGRELIVHINGKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDSFTTKDGY 239 (794) T ss_pred ccceeEEEEeccCcceEEEEcCCCccccceeechhhhhhhhhhhheeccccceEEeCCceEEEEEcCCceEEEEeeeccc Confidence 0 Q ss_pred -----------CceecccCCCcc--------------------------------------------------------- Q lcl|NC_010325. 129 -----------ESTFRVLPNFPA--------------------------------------------------------- 140 (513) Q Consensus 129 -----------s~~f~~L~g~p~--------------------------------------------------------- 140 (513) ...+++|+...+ T Consensus 240 ~~t~~~~~~~~~~~~~~lp~~~~~G~~v~i~~~~~~~~~~Y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv~~~~~~ 319 (794) T protein:vir:22 240 ADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHALVRAADGN 319 (794) T ss_pred CcceeEEEEeccccceeccccCCCCeEEEEEeCCCCCcceeEEEEeccceEEEEeeeccceeeecccceeeEeeeccCCc Confidence 000112221100 Q ss_pred ------ccee-------------------eEEEEEcCEEEEEECCcCcccCCceEEEeccCCccccccccccccc--ccC Q lcl|NC_010325. 141 ------NTTF-------------------KRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDP--TKD 193 (513) Q Consensus 141 ------~~ka-------------------~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~--t~~ 193 (513) .|.. ..|..|++||++++ |+.|+.|..+|.+ +++.... ..+ T Consensus 320 ~~~~~~~w~~r~~Gd~~tnp~psf~g~~i~~v~f~q~RL~f~~--------~~~v~~Srtgd~~----nF~~~t~~~~~D 387 (794) T protein:vir:22 320 FDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLS--------GENIILSRTAKYF----NFYPASIANLSD 387 (794) T ss_pred EEEeeccccccccCccccCCcceecCCCcceEEEEcceEEEec--------CCeEEEEccCCcc----ccccccCcCCCC Confidence 0000 23678888888753 6779999999964 4443332 223 Q ss_pred cceecc---cCCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCceeEEECCeEEEEeCCCe Q lcl|NC_010325. 194 AGQNTL---ADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDV 268 (513) Q Consensus 194 a~~~dl---~d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~ 268 (513) ++=.++ .+....|..+++....|+||...+-|.++-.+ .|...++.+.+. .+|-+.=.=+.+|+.++|+++.|= T Consensus 388 dD~i~~~~ss~~~~~i~~~v~~~~~L~i~t~~~e~~l~~~~~lTP~~~~~~~~s~-~~~~~~~~Pv~vg~~v~f~~~~g~ 466 (794) T protein:vir:22 388 DDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQ-FDVQDRARPFGIGRNVYFASPRSS 466 (794) T ss_pred CccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEE-eeccCCCCceEeCCeEEEEecCCC Confidence 343333 23334466688899999999999999996222 244456666653 346555667889999999999883 Q ss_pred E-------EECC--cccccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecc--- Q lcl|NC_010325. 269 Y-------VHNG--VQKQSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWK--- 335 (513) Q Consensus 269 y-------~~~G--~~~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~--- 335 (513) | .++- ..+.+.. +.-+.+.| +.....+ +..-..+..++|+-... +++++|-|. T Consensus 467 ~~~~~r~~~~~~~~d~y~~~Dlt~~~~~~~-----~~~~~~~-~~~~~~~~~v~~~~~~~--------~~l~~~~y~~~~ 532 (794) T protein:vir:22 467 FTSIHRYYAVQDVSSVKNAEDITSHVPNYI-----PNGVFSI-CGSGTENFCSVLSHGDP--------SKIFMYKFLYLN 532 (794) T ss_pred eeEEEEeEeeecccCceehhhHHHHHHHhc-----CCceEEE-EEeCCCCcEEEEEEcCC--------CEEEEEEEeecC Confidence 2 2222 1222211 11222221 1111111 22233344566654322 357777763 Q ss_pred ----cCeEEEEeccce----eeeee----------------cccccccce-----------eecc-----cCcccCccce Q lcl|NC_010325. 336 ----ENTWSIRDLPNV----LSGAY----------------GIIDPKVSN-----------LWDD-----DPNPWDTDTS 375 (513) Q Consensus 336 ----~~~Ws~~d~~~~----~~~~~----------------g~~~~~~~~-----------~~~~-----~~~~~d~d~~ 375 (513) -..|+.-+.+.. |..+. +........ +++. ....++.+.. T Consensus 533 ~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~iv~r~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~g~~~~~~~ 612 (794) T protein:vir:22 533 EELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYNDDTF 612 (794) T ss_pred CceeEEeeEEEEcCCCEEEEEEEecCCEEEEEEEeCCCEEEEEEEEeeccccCCCccceeeeeeeEEEeeccceeecCCc Confidence 246886665442 11111 000000000 0000 0011111100 Q ss_pred e--cc----ccccccCccceEEEEeecCcee---------------ee----cccceeecCccEEEEeecccccC---CC Q lcl|NC_010325. 376 V--WG----EGSYNPAKSSMIFSSFQDKKLF---------------LF----GNNSTFSGQNFVSTLERSDIYLG---DD 427 (513) Q Consensus 376 ~--~~----~ds~~~~~~~~~~~~~~~~~~~---------------~~----~~~~~~~g~~l~a~~~~~~~~~~---~~ 427 (513) + .. -...-..|... ....|+... .+ ......-|.++++.++-..+.+. +. T Consensus 613 ~t~~~~~~~~g~~~~~g~~v--~~~~dg~~~~~~~~~~~~~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~~~ 690 (794) T protein:vir:22 613 TTSIHIPTIYGANFGRGKIT--VLEPDGKITVFEQPTAGWNSDPWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADD 690 (794) T ss_pred ceEEEcccccCcccccceEE--EEEcCCceeeceeeeeeeeccceEEeCCCCCCcEEEEeeeeeEEEEecceEEEecCCC Confidence 0 00 00001111111 222233211 11 11235668888998886665432 11 Q ss_pred cce-------EEEeeeeeccCCCeeEEEEeee--eecC--CCCceE------cCceeeecCCceEEEeecCCCeEEEEEE Q lcl|NC_010325. 428 RMM-------KTVSAIIPHITGNGTCNIWVGN--AQVQ--GSGIRW------KGPYPYRIGQDYKIDTKHVGRYIALKFD 490 (513) Q Consensus 428 ~~~-------~~i~~~~~~~t~~~~~~~~~g~--~~~~--~~~~~w------~~~~~~~~~~~~~~~~R~~~Ry~~~rl~ 490 (513) .+. .++.++...+...+.+.+.+-. ++.. ..+... .+......+ ...++++...+-..++|+ T Consensus 691 ~~~~~~~~grl~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~g~~~~~~g~~~~~tg-~~~vp~~~~~~~~~v~i~ 769 (794) T protein:vir:22 691 GSTSTEDIGRLQLRRAWVNYENSGTFDIYVENQSSNWKYTMAGARLGSNTLRAGRLNLGTG-QYRFPVVGNAKFNTVYIL 769 (794) T ss_pred ccceeeecceEEEEEEEEEeccccceEEEEcCCCcccceeecCceecccccccCcccccCc-eEEEEecccCceEEEEEE Confidence 111 2333333322222333332211 1100 000111 011111122 256788999999999999 Q ss_pred ccCCCcEEEEEEeeEEe-ccccCC Q lcl|NC_010325. 491 FSSEGDWYFNGYTIEMA-PKAGMR 513 (513) Q Consensus 491 ~~~g~~w~~~G~~~~~~-~~g~rr 513 (513) .+.-.+.++.++++|.. -.=.|| T Consensus 770 ~d~p~P~tvlsi~~eg~y~~r~~~ 793 (794) T protein:vir:22 770 SDETTPLNIIGCGWEGNYLRRSSG 793 (794) T ss_pred ECCCCCEEEEEEeEEEEEeccccC Confidence 99999999999999983 334444 No 22 >protein:vir:93631 Length: 580 # NCBI annotation: Bcep22gp67 # Family: family:all:1544 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944296;genbank:gi:38640373;genbank:GeneID:2658280 Probab=98.72 E-value=8.2e-08 Score=59.42 Aligned_cols=462 Identities=11% Similarity=0.117 Sum_probs=206.1 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeEECCCcc---eeeecCCCcceeeeeeeeCCceEEEEEcC Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHT---PIFDTAQAPILDMFPFIRNNIPYWLLCSE 77 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~---~~~~~~~~~~~~~~~~~~~g~~~~~v~~~ 77 (513) |.+. ++-+-.|=++-+.|--||.+.-+-+.|.+|+.|.+.|.+.-+ -+...+.+...-++ +. +.+||.-++ T Consensus 1 M~~i--~i~~f~Ge~Prl~p~lLP~~~a~~a~n~~~~~G~i~P~~~~~~~~~~~~i~~~~~~t~~--~~--~~~W~~w~~ 74 (580) T protein:vir:93 1 MTII--KITGFSGEIPRLVPRLLPDTAAQNATNARLESGGLTPYRKPKFITRISTIPAGQIETIY--RN--GETWMAWDK 74 (580) T ss_pred CeeE--eecccccccccchhhhccccccceEEeeeccCCeeeeeeCchhhccccccCcCcceEEE--ec--CceeEEeCC Confidence 6554 555678999999999999999999999999999998886642 12222222211111 11 113332211 Q ss_pred -----------ceEEEecCce-EEeccccceeeCC---CCceeEEeeCC-------EEEEEe-----C-----CCceEE- Q lcl|NC_010325. 78 -----------QRLYLADGTT-IIDVSPGPYSASI---TNRWSVGSFNG-------VIFAND-----G-----VNPPHH- 124 (513) Q Consensus 78 -----------~kly~~~~~t-~~dis~~~~~~~~---~~~w~f~~~~~-------~~ia~n-----g-----~d~~q~- 124 (513) .++|--+++- -.+.+.+.|.... +.....++-++ +.+... | +.+-+. T Consensus 75 ~V~~i~~PvA~DRvy~Td~g~Pkvt~~g~sy~lgVpaPs~Apt~~~~g~g~l~~~~y~Yv~TfVt~~GeES~PS~~S~~v 154 (580) T protein:vir:93 75 PVYAAPGPVAADRLYVMGDGAPKMIVGGTTYPLAVPMPSAALTAATSGTGTGDVFSRVYVYTFVTGFGEESEPSAISNEV 154 (580) T ss_pred ceeeecCccccceeEEcCCcccceecCCccccccCCCcccCceeeecCCCCcCccceEEEEEEEcCCCCcCCCcccccce Confidence 2666655542 2333444443111 11112222111 111100 0 000000 Q ss_pred -EcCCCc-eecccCCCcccceeeEEEEEc--------CEEEEEECC-------c-------------------------- Q lcl|NC_010325. 125 -LPPSES-TFRVLPNFPANTTFKRLKSFK--------NFLVGLNAT-------S-------------------------- 161 (513) Q Consensus 125 -~~~~s~-~f~~L~g~p~~~ka~~v~~~~--------~~l~~~g~t-------~-------------------------- 161 (513) +..++. +++.++-.|.+-....+++|. +|.++..+. + T Consensus 155 tv~~g~tVtLs~~p~p~~~~~i~~~RIYRS~tG~~gtdy~lVAel~Ag~~sF~Dd~s~a~Lge~Lps~~~~~PP~~m~gL 234 (580) T protein:vir:93 155 NWQAGQTVTLSGFQAAPAGRNITKQRIYRSQTSLSGTDLYFIAERDASAANFVDNVPLSDQNEPLPSLEWNAPPDDLTGL 234 (580) T ss_pred eeCCCCeEEEEecCCCCCCCccceEEEEEeccCCCceeEEEEeeeccceeeeeecccccccccccchhhccCcCCCcceE Confidence 111111 122222222221222245552 454444321 0 Q ss_pred ----Ccc---cCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceEEEecCcEEEEEecCCCc Q lcl|NC_010325. 162 ----NSV---EMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGGLF 234 (513) Q Consensus 162 ----~~~---~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g~~~ 234 (513) |+. =..|-|+||-..-|..+|... +.-.| -.||++++.++.++|-..-.-|..+ +-+|. T Consensus 235 ~~m~nGi~agF~Gnev~fsEpy~P~AWP~~y-----------r~t~~--~~Ivaia~~g~~LvV~T~g~pyl~~-G~~P~ 300 (580) T protein:vir:93 235 ISLPNGMMAAFRGKELWLCEPWRPHAWPQKY-----------VLTMD--YNIVALGAYGTTIVVATDGQPYIVS-GASPD 300 (580) T ss_pred EeeccceEEEEeCCEEEEecCCCCccchhhc-----------CCCCC--CCceeEeeeCceEEEEEcCceEEEE-ccChh Confidence 000 023668888887765544433 22223 4599999999999999999999973 55688 Q ss_pred eeEeEEecCccccccCceeEEECCeEEEEeCCCeEEECCcccccCCchhH---HHHHHhhcCcchhCCEEEEEecCCCEE Q lcl|NC_010325. 235 IFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVYVHNGVQKQSVIDAQV---RKFFFSDINPDNYQRTFVLADHVNTEM 311 (513) Q Consensus 235 ~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y~~~G~~~~~Ig~~~V---~~~~~~~i~~~~~~~~~~~~d~~~~~v 311 (513) -.+.+|+..+--|++++|||..++-+.|-|++|.-++++....-+ ++.+ +.| ..++ ++.|.+.. .+.+| T Consensus 301 ~ms~~kL~~~q~CvS~rsiV~~~~~v~Yas~dGLv~i~~~ga~vv-T~~l~t~~qW--~~~~---P~ti~a~~--~eG~Y 372 (580) T protein:vir:93 301 AMSQEKLELNLPCINARGLVDLGYAIAYPSHDGLVVASSSGARVV-TDQLMTRNDW--LKTA---PGRFVSGQ--FFGRY 372 (580) T ss_pred hccccccccccccccccceeecCceEEeecCCcEEEEeCChHHHH-HhhccChhHH--HhcC---CceEEEEe--ecCeE Confidence 899999999999999999999999999999999999987653222 2211 223 2344 44555444 34567 Q ss_pred EEEEccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcccCccceeccccccccCccceEE Q lcl|NC_010325. 312 WVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNPWDTDTSVWGEGSYNPAKSSMIF 391 (513) Q Consensus 312 ~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~~d~d~~~~~~ds~~~~~~~~~~ 391 (513) +-.|..... .+..-...+|+|..-+.=.+..+...+. .+..+.....++-.. +.+-..|. +++.... T Consensus 373 ~a~Y~~~~~-~~~~~~g~fi~d~~~~~~~~~~~~~~~d--~~~~d~~~d~Ly~~~----~~~i~~~~------~~~~~~~ 439 (580) T protein:vir:93 373 LASYEYIDP-AGTARRGSFIIDLTGQEAFLHRTNYKAD--ATFYDITEGKLYLCI----GQDIYEWD------ALDSENE 439 (580) T ss_pred EEEEccccc-ccccccceEEEecCCCcceeEEeccccc--eeeeeccCCeEEEEe----CCEEEEEc------CCCCCcc Confidence 666654321 1112234688887444311222211111 111111111111000 11111122 1222222 Q ss_pred EEeecCceeeecccceeecCccEEEEeecccccC-------CCcceEE-------E--eeeeeccCCCeeEE-EEeeeee Q lcl|NC_010325. 392 SSFQDKKLFLFGNNSTFSGQNFVSTLERSDIYLG-------DDRMMKT-------V--SAIIPHITGNGTCN-IWVGNAQ 454 (513) Q Consensus 392 ~~~~~~~~~~~~~~~~~~g~~l~a~~~~~~~~~~-------~~~~~~~-------i--~~~~~~~t~~~~~~-~~~g~~~ 454 (513) ...=.++.|.+.....|. |.....+.... +..+... + .++..... ..+++ +.+...+ T Consensus 440 ~~~WrSK~f~~~~~~sf~-----~~rV~s~~~~~~~~~~a~~~~~~~~~a~n~~~~~~~~~~~~~~-~~~v~~~~i~gd~ 513 (580) T protein:vir:93 440 ILVWRSKQYVVQKPTNFG-----VILIEGSVLMTPEEEAAEQAAIDAAKAHNDSIFGDASIGGELN-GAALNVYPIDGDA 513 (580) T ss_pred eEEEecceEEecCCcCce-----EEEEeeccccchhhhhhhhhhhhhhhhhhhhcccccccccccc-cccceeeeecccc Confidence 223344555554333332 22221111000 0000000 0 00000011 11222 3333443 Q ss_pred cCCCCceEcCceeeecCCceEE--EeecCCCeEEEEEEcc-CCCcEEE--EE------EeeEEeccccCC Q lcl|NC_010325. 455 VQGSGIRWKGPYPYRIGQDYKI--DTKHVGRYIALKFDFS-SEGDWYF--NG------YTIEMAPKAGMR 513 (513) Q Consensus 455 ~~~~~~~w~~~~~~~~~~~~~~--~~R~~~Ry~~~rl~~~-~g~~w~~--~G------~~~~~~~~g~rr 513 (513) .+.-+- ..+....+-.+|+. .+-..+| .+||.-. .++.|.+ +| +.+-.....-|+ T Consensus 514 ~~~~~~--~~~~~~~~~adG~~~~t~~~~~~--~~RLPag~~a~~Wev~vsg~~~V~~v~la~s~~EL~~ 579 (580) T protein:vir:93 514 LVRIES--SRFVAATVYADGKAVATVSKLNR--MCRLPSGFLAQTWEVEVSANADIAQVTLAGTGAELAG 579 (580) T ss_pred cccccc--ccceEEEEeeCCeEEEEEecCCc--eEEccCCccccEEEEEEEeccceeEEEEecChHHHhc Confidence 333210 00111111112221 1111121 2222211 3455653 33 333333344444 No 23 >protein:vir:80253 Length: 777 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522886;genbank:gi:158345179;genbank:GeneID:5687516 Probab=98.71 E-value=9e-08 Score=59.21 Aligned_cols=483 Identities=11% Similarity=0.081 Sum_probs=237.4 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeeecCCCcc----eeeeeeee-CCceEEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTAQAPI----LDMFPFIR-NNIPYWLL 74 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~~~~~----~~~~~~~~-~g~~~~~v 74 (513) |+..-+...+-.|.|.-..+..-=++....|.||+++ .|+++||||..=+....+++- ..+..... ....+++. T Consensus 1 M~~i~~~~~nf~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~~~~e~~~~l~ 80 (777) T protein:vir:80 1 MSYFAGSYRQLLFGVSQQTAKDRLEGQVESQLNMQSDLVTGPRRRSPVHLIADAMAATDANRLAYSLATFSGREVLLLVD 80 (777) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeeccCceeCcchHhhhhhcCCCcccceeEEEEecCCCeeEEEEE Confidence 9999999999999988655554456888999999999 588999999655533222221 22233222 23456666 Q ss_pred EcCceEEEe--cCceEEeccccce-eeCCCCceeEEeeCCEEEEEeCCCceEEEc---------CCC------------- Q lcl|NC_010325. 75 CSEQRLYLA--DGTTIIDVSPGPY-SASITNRWSVGSFNGVIFANDGVNPPHHLP---------PSE------------- 129 (513) Q Consensus 75 ~~~~kly~~--~~~t~~dis~~~~-~~~~~~~w~f~~~~~~~ia~ng~d~~q~~~---------~~s------------- 129 (513) .+.+.|+-+ +++........+| ++..-..-++++.+|+++++|..-+||... +.. T Consensus 81 ~g~g~irv~~~~~g~~~~~~~~~Yl~a~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~~~~g~ 160 (777) T protein:vir:80 81 TLDGTLTILDDATGEVLFTGTNSYLTAGTGRSIRFAALDDSVFVANTEVIPQTQLWSGASAYPDPTRAGYLYVVAGAFSK 160 (777) T ss_pred ecCCeEEEEECCCCeEEEecCCCceeeccccceeEEEEcCEEEEEeCCccceeeecccCCCccCcccceEEEeeccCCCc Confidence 777756644 4555444444566 343333467888888888888666664210 000 Q ss_pred -----------------------------c-------------------------------------------------- Q lcl|NC_010325. 130 -----------------------------S-------------------------------------------------- 130 (513) Q Consensus 130 -----------------------------~-------------------------------------------------- 130 (513) + T Consensus 161 ~y~i~i~~~~~~~~~t~~~~t~~~~~~~~~~~~ia~~L~~~~~~~~~~~s~~~~~~~~~g~~~~i~~~~~~~~t~~~g~~ 240 (777) T protein:vir:80 161 QYRLSITNQVTGVTTSVDVTTSATEASQATGEYVITQLRTAAEADATIGTAAGFAYYQDGAYLYVTAPEAIAVSTDSGSN 240 (777) T ss_pred eeeEeecCCcCceeEEEecCCcccccccccchhhhhhhhhhhccccceeecCceEEEeCCcEEEEEecCceeEecCCcCc Confidence 0 Q ss_pred --------e---ecccCCC-c--------------------------------------------------c-------- Q lcl|NC_010325. 131 --------T---FRVLPNF-P--------------------------------------------------A-------- 140 (513) Q Consensus 131 --------~---f~~L~g~-p--------------------------------------------------~-------- 140 (513) + +.+|+.. | + T Consensus 241 ~~~~~~~~~v~~~~~lp~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~t~p~~l~~~~~~~~~~~~ 320 (777) T protein:vir:80 241 FLRASNAASIRDAAELPAKLPADADGFIIATGAAKNKTYFRWVDLERKWDEDASRGAQAELIDMPLRITYSAPNFSLTAL 320 (777) T ss_pred cceeeeeEEEeeccccccccccccceEEEeCCCCCCceEEEEEccCcEEEEeecccccccccccceEEEecCCceEeecc Confidence 0 0000000 0 0 Q ss_pred ccee-------------------eEEEEEcCEEEEEECCcCcccCCceEEEeccCCccccccccccccc--ccCcceecc Q lcl|NC_010325. 141 NTTF-------------------KRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDP--TKDAGQNTL 199 (513) Q Consensus 141 ~~ka-------------------~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~--t~~a~~~dl 199 (513) .|.. .-|..|++||++++ |+.|+.|..+|.+ +|..... ..+++=.++ T Consensus 321 ~w~~r~~gd~~tn~~Psf~g~~i~~v~f~q~RL~f~~--------~~~v~~Srtgd~~----nF~~~s~~~~~DdDpI~~ 388 (777) T protein:vir:80 321 NYERRASGDATSNPALKFTEQGISGMTTMQGRLVLLA--------GEYVCMSASGNPL----RWFRASVSTQSDDDPIEV 388 (777) T ss_pred CCccccccccccCCCceecCCceeEEEEEcceeeeec--------CCeEEEEeccCcc----ccccccccCCCCCccEEE Confidence 0000 12577888887653 5779999999964 4443321 223333333 Q ss_pred c---CCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCceeEEECCeEEEEeCC-C----eE Q lcl|NC_010325. 200 A---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHG-D----VY 269 (513) Q Consensus 200 ~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~-G----~y 269 (513) . +....|..+++....|+||...+-|.++-.+ .|.--++.+.+. .+|-+.=.=+.+|+.++|+++. | ++ T Consensus 389 ~~ss~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~~~lTP~~~~~~~~s~-~~~~~~~~Pv~vG~~v~Fv~~r~g~~s~v~ 467 (777) T protein:vir:80 389 AATAPVASPYEYAVAFNKDLVLFAKTHQGLVPGANLLTSRNATAAVVTE-YSFQNSCSPVVAGRTVFFASPRSGPWSAVW 467 (777) T ss_pred EEcCCcceeeeeeeecCCcEEEEecCceEEEeCCCcccceeEEEEEEEe-eccCCCCCceEeCCeEEEEecCCCceeEEe Confidence 2 3334466688889999999999999996322 244466666653 3455544558999999999864 4 54 Q ss_pred EEC--C---cccccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEec----ccC-- Q lcl|NC_010325. 270 VHN--G---VQKQSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNW----KEN-- 337 (513) Q Consensus 270 ~~~--G---~~~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~----~~~-- 337 (513) .+. . ..+++.. +.-+.+.|- ....++ .+=.....+.|+-... +++++|-| ..+ T Consensus 468 e~~~~~~~~d~y~a~Dlt~~~~hl~~-----~~v~~~--a~s~~p~~v~~~~~~d--------g~l~~~ty~~~~~e~~v 532 (777) T protein:vir:80 468 EMLPSQYTDAQVEASDSTSHLPKYIA-----GPVRFL--ATSSTTSIVVVGTSNL--------RELVVHEYLWQGGEKVH 532 (777) T ss_pred eeeecccccCceehhHHHHHHHHhcC-----CceEEE--EEcCCCceEEEEEcCC--------CeEEEEEEeecCCceEE Confidence 332 1 1122211 222333221 111122 1222333444554322 35666655 222 Q ss_pred -eEEEEeccceee-eee-----------------cccccccce---------eec--ccCcccCccc--eeccccccccC Q lcl|NC_010325. 338 -TWSIRDLPNVLS-GAY-----------------GIIDPKVSN---------LWD--DDPNPWDTDT--SVWGEGSYNPA 385 (513) Q Consensus 338 -~Ws~~d~~~~~~-~~~-----------------g~~~~~~~~---------~~~--~~~~~~d~d~--~~~~~ds~~~~ 385 (513) .|+.-+.+..+- .++ +........ .++ .-...++.+. ....++.. .. T Consensus 533 ~aW~r~~~~g~v~~v~~i~d~l~~iv~r~~~~~le~~~~~~~~d~~~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~-~~ 611 (777) T protein:vir:80 533 AAWHKWSFPQDITGAYFRGDRLILLFHVAGRVILGELFMQRLGDAQSIPGGFLDLYRVGAANADEEVAIPAFAADLY-PE 611 (777) T ss_pred EeeEEeccCCcEEEEEEECCEEEEEEEcCCeEEEEEEeeccCCCCcccceeeeeeeeeeeeeeCCccceeEeecccc-CC Confidence 577555443211 110 000000000 000 0000111100 00111100 00 Q ss_pred ccc--eEEEEe------------ecCce--e----eecccceeecCccEEEEeecccccCCCc------ceEEEeeeeec Q lcl|NC_010325. 386 KSS--MIFSSF------------QDKKL--F----LFGNNSTFSGQNFVSTLERSDIYLGDDR------MMKTVSAIIPH 439 (513) Q Consensus 386 ~~~--~~~~~~------------~~~~~--~----~~~~~~~~~g~~l~a~~~~~~~~~~~~~------~~~~i~~~~~~ 439 (513) +.. ..+.+. .+... . ........-|-++++.++-..+.+.+.. -+.++.++... T Consensus 612 ~~~~~v~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~~r~~i~r~~~~ 691 (777) T protein:vir:80 612 DSTFAYKLSGEFQSLGQRCGDRRVDGATVYIKVVGAQAGDQYRIGLRYLSKLGPTRPILRDPNGVPITTERTQLHRLTWS 691 (777) T ss_pred cceeEEEecCcccccceeeeeEEeCCceeeEEEcCCCCCCEEEEeeeeEEEEEeCceEEeCCCCceeeecCeEEEEEEEE Confidence 000 001000 00000 0 0112235668888888885554432111 11244444443 Q ss_pred cCCCeeEEEEeeeeecC-----CCCceEcCc------eeeecCCceEEEeecCCCeEEEEEEccCCCcEEEEEEeeEE-- Q lcl|NC_010325. 440 ITGNGTCNIWVGNAQVQ-----GSGIRWKGP------YPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYFNGYTIEM-- 506 (513) Q Consensus 440 ~t~~~~~~~~~g~~~~~-----~~~~~w~~~------~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~~G~~~~~-- 506 (513) +...+.+.+.+...... ..+....++ ...-+| +..++++....-..++|+.+.-.+.++.++++|. T Consensus 692 ~~~sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg-~~~vp~~~~~~~~~v~i~~d~P~P~tilsi~~e~~y 770 (777) T protein:vir:80 692 LDSTGEVTFRVADQARGESAYTTTPLRLYSRDLGAGLPLAATA-TLDTPARVDMQTAQFSLETDDYYDMNITSLEYGFRY 770 (777) T ss_pred eeccccEEEEEcCCCCcceeeeecCceecccccccccccccce-EEEEEEeecCcceEEEEEECCCCceEEEEEEEEEEe Confidence 33333333333221100 011111111 111122 3557777777777888888889999998888877 Q ss_pred eccccCC Q lcl|NC_010325. 507 APKAGMR 513 (513) Q Consensus 507 ~~~g~rr 513 (513) .+-.-|| T Consensus 771 ~~r~~r~ 777 (777) T protein:vir:80 771 NQRYRRQ 777 (777) T ss_pred ecccccC Confidence 5555555 No 24 >protein:vir:80177 Length: 1027 # NCBI annotation: tail tubular protein B # Family: family:all:12083 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285795;genbank:gi:148747829;genbank:GeneID:5220453 Probab=98.68 E-value=3.6e-08 Score=61.37 Aligned_cols=467 Identities=16% Similarity=0.202 Sum_probs=211.5 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeEECCCcceeeecCCCcceeeeeeeeCCceEEEEEcCceE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTAQAPILDMFPFIRNNIPYWLLCSEQRL 80 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~v~~~~kl 80 (513) ..|-|+|-|--+|.--.. .-|.+.-+...+..-+++. +..+.-....-+++.|.+|--+.+-... T Consensus 338 V~lLR~RELRFN~G~GA~------------~~~L~V~~D~~~~s~N~ss---T~~~T~R~~~L~~A~G~~~~~A~dlayY 402 (1027) T protein:vir:80 338 VHLLRQRELRFNYGNGAT------------GANLRVTVDGTALSANYSS---TVAGTNRAYALYKADGTLCTSASDLAYY 402 (1027) T ss_pred eeeeeeeeeeeccCCCCC------------CcceEEEEcceeeeeeeee---eeeecceeEEEeeeccccccccccceee Confidence 444444444444332221 1233333333333222211 1111111111122334433311111111 Q ss_pred EEecCceEEeccccceeeCCCCceeEEeeCCEEEEEeCCCc-eE---------EEcCCCceecc--cC-----CCcccce Q lcl|NC_010325. 81 YLADGTTIIDVSPGPYSASITNRWSVGSFNGVIFANDGVNP-PH---------HLPPSESTFRV--LP-----NFPANTT 143 (513) Q Consensus 81 y~~~~~t~~dis~~~~~~~~~~~w~f~~~~~~~ia~ng~d~-~q---------~~~~~s~~f~~--L~-----g~p~~~k 143 (513) +...+-| .+..++.++ +-..|++.. +. +.++. -|+. |+ |-.+ - T Consensus 403 ~A~~GAT--PL~IS~~aA--------------~t~~~~~R~yi~~~~~~T~~~~~~G~--Y~k~YGlG~~~~Y~~~~--F 462 (1027) T protein:vir:80 403 IAFTGAT--PLGISPTAA--------------VTITNVDRTYIGSAATQTDNAYVQGG--YFKVYGLGLWANYGTGQ--F 462 (1027) T ss_pred eeeeccc--cccccccce--------------eeeecCceeeeeeeccccCCceEeee--EEEEEEeeeeeecCCcc--c Confidence 2222222 111111111 111111111 11 11111 1111 11 1122 3 Q ss_pred eeEEEEEcCEEEEEECCcCcccCCceEEEeccCC---ccccccccccc---ccccCcceeccc--C-CCCc-eeEEEecC Q lcl|NC_010325. 144 FKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSAD---AGGVPASWDPT---DPTKDAGQNTLA--D-TNGA-IVDGVKLR 213 (513) Q Consensus 144 a~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d---~~~~P~~Wd~t---~~t~~a~~~dl~--d-~~G~-iv~g~~l~ 213 (513) .++-.+|++||++.|-+. .|.|+-+|.++| ++..+++..-+ ++-..+-| +|. . ...+ |++...-. T Consensus 463 ~~I~TvY~~RLvL~~~t~----~~~~~~~S~~GD~~~~G~~Y~F~QvTD~L~G~~sDPF-~L~VsSsq~~d~vT~~~~WQ 537 (1027) T protein:vir:80 463 PRIATVYQSRLVLGGFTN----DPTRVVFSATGDTVEGGVKYNFFQVTDDLDGLDSDPF-DLVVSSSQADDYVTGLVEWQ 537 (1027) T ss_pred cceeeeeeeeeEEeccCC----CcceEEEeecCCcccCceeeeeeeeehhhccCcCCce-eEEEecccccceeeeeeeec Confidence 457789999999766543 488999999998 33333443222 22222222 222 1 2233 44444457 Q ss_pred cceEEEecCcEEEEEecCCCce---eEeEEecCccccccCceeEEECCeEEEEeCCCeEEE----CCcccccCC-chhHH Q lcl|NC_010325. 214 DSFIIYKEDSVYSMRYIGGLFI---FQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVYVH----NGVQKQSVI-DAQVR 285 (513) Q Consensus 214 ~~~vIf~en~i~~m~y~g~~~~---f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y~~----~G~~~~~Ig-~~~V~ 285 (513) ..++||..+..|+.. +|+..+ =.+.......|+++|+|||.-+-.+|||++-|++-+ +.+..+.|- +-|++ T Consensus 538 ~~LFV~T~~~T~~~~-GGd~t~~~a~~~VN~iSs~G~~N~~~VV~T~~~V~Yl~~~G~F~L~~r~~~~~Y~A~EkSiKIR 616 (1027) T protein:vir:80 538 SSLFVLTRRATFRAN-GGDATISPARRFVNYISSLGLVNPFSVVRTDTAVFYLSDSGVFNLTPRVEDGEYQAIEKSIKIR 616 (1027) T ss_pred eeEEEEecceeEEee-cCccccchhHHHHHHHHhhcccCcceEEEeeeEEEEeeccceeeccCCccCCcchhhhhhhhhh Confidence 899999999999985 444322 112223346699999999999999999999999933 334455443 34676 Q ss_pred HHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccc------------------- Q lcl|NC_010325. 286 KFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPN------------------- 346 (513) Q Consensus 286 ~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~------------------- 346 (513) +-|-+.-.++-.+..--.+|+..|+.|...+-.+...- -.+.++||..-+.|+..+-.. T Consensus 617 ~~F~~~~~ta~~~~~Wm~~~q~~~~LYv~L~~~~eT~~--~S~~~~~N~~~DSWt~~~t~~~Fk~YtghP~V~~~~~~s~ 694 (1027) T protein:vir:80 617 KVFGKTTSTAVSSAAWMSFDQNRKVLYVALPRGSETTV--ASALYVYNTFRDSWTQYDTLGGFKTYTGHPYVDTVLGDSF 694 (1027) T ss_pred hhhhhhccccccceeeeeeccCCceEEEEecCCCcchh--hhhhhhhhhhhcchhhhhcccCcccccCCchhhhhhhhhh Confidence 64433323333344455689999999998875533211 246899999999999665111 Q ss_pred ---e------e------------eeeeccccccc-----------ceeecc-----cCc-----------ccCccce--- Q lcl|NC_010325. 347 ---V------L------------SGAYGIIDPKV-----------SNLWDD-----DPN-----------PWDTDTS--- 375 (513) Q Consensus 347 ---~------~------------~~~~g~~~~~~-----------~~~~~~-----~~~-----------~~d~d~~--- 375 (513) + | +..+|..+... .-.|.. +.+ -...+.+ T Consensus 695 L~~v~~~~TV~ML~~~~~~YvDFF~~CG~~~~~Vlt~~~GIY~~~~P~wnsP~I~~~svs~tt~~~~q~Ye~~T~~~vvp 774 (1027) T protein:vir:80 695 LLMVAYGGTVCMLKLYGSRYVDFFNKCGSFTGNVLTANSGIYTWTAPFWNSPVISNISVSGTTTLAVQRYELPTDLQVVP 774 (1027) T ss_pred hhhhcCchhhhhhhhhcchhhhhhhhcccceeeEEecCCceeEeecccccCCeeeEEEeeccchhhhheecccccccccc Confidence 0 0 00111100000 001110 000 0000111 Q ss_pred ----------------eccccccccCccceEEEEeecCceeee---------------------------------cccc Q lcl|NC_010325. 376 ----------------VWGEGSYNPAKSSMIFSSFQDKKLFLF---------------------------------GNNS 406 (513) Q Consensus 376 ----------------~~~~ds~~~~~~~~~~~~~~~~~~~~~---------------------------------~~~~ 406 (513) ....+=-..+..-.+++.--+++.+.+ ..+- T Consensus 775 ydnvedlsiyvnGT~Ls~~~~~~~~~~~i~LL~~~~~~~~~s~Vprcpvnvsy~~~~~~~~TT~~TV~~N~~~~iQ~Tdy 854 (1027) T protein:vir:80 775 YDNVEDLSIYVNGTRLSFGTDWVKQGKAIYLLSDPGDGKTVSIVPRCPVNVSYQGDVTFDETTAQTVWVNNLLQIQGTDY 854 (1027) T ss_pred ccccccceeeecceeEeecCchhhcCCEEEEecCCCCcceEEEEecccccccccccccccccccceEEecceeeecccee Confidence 000000001111111222111111111 0000 Q ss_pred eeecC-----------------ccEEEEeecccccCCCcceEEEeeeeeccCCCeeEEE-Eeee----ee---------- Q lcl|NC_010325. 407 TFSGQ-----------------NFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTCNI-WVGN----AQ---------- 454 (513) Q Consensus 407 ~~~g~-----------------~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~~~-~~g~----~~---------- 454 (513) +..|. .+.+...++.|+|.--+++|+++.+..++..+..+-+ .+|. ++ T Consensus 855 ~~~GS~L~~~~~LtN~~~~~G~~Y~S~Y~SP~F~L~SL~~LKk~K~~~L~~Dnedvlpvytigdlasgqdvddlvgkwkt 934 (1027) T protein:vir:80 855 TLSGSTLTFTDTLTNAVVEVGNAYISYYQSPMFLLGSLSNLKKVKHVYLYFDNEDVLPVYTIGDLASGQDVDDLVGKWKT 934 (1027) T ss_pred eeccCccccccccccceEEEeecchhhhcchhhhhhhhhhhhheeeeEEEEcCCcceeeeeeccccCCCchhHhhhhhcc Confidence 11222 2234455888888888899999888877665553322 1221 11 Q ss_pred --cCCCCceEcCceeeecCCceE-------------EEee----cCCCeEEEE-------------EEccCCCcEEEEEE Q lcl|NC_010325. 455 --VQGSGIRWKGPYPYRIGQDYK-------------IDTK----HVGRYIALK-------------FDFSSEGDWYFNGY 502 (513) Q Consensus 455 --~~~~~~~w~~~~~~~~~~~~~-------------~~~R----~~~Ry~~~r-------------l~~~~g~~w~~~G~ 502 (513) ..+..+++.++.+-. +.+. .++. ++.||.-|| +-.-+.++|.+-|| T Consensus 935 rananisvtydsentse--tsydiysfsdlvwdnaffdvdptnlqstryalfkeallgvgynyqigvwsfdeaswklcgy 1012 (1027) T protein:vir:80 935 RANANISVTYDSENTSE--TSYDIYSFSDLVWDNAFFDVDPTNLQSTRYALFKEALLGVGYNYQIGVWSFDEASWKLCGY 1012 (1027) T ss_pred cccceeEEEecCcCccc--ceeeeeehhhhhcccceecccccccchhhHHhhhhhHhhcccceeeeeeeecccceeeeee Confidence 122223332222111 1111 1222 566886554 22346789999999 Q ss_pred eeEEeccccCC Q lcl|NC_010325. 503 TIEMAPKAGMR 513 (513) Q Consensus 503 ~~~~~~~g~rr 513 (513) .++.. ..++| T Consensus 1013 qvdar-lsgkr 1022 (1027) T protein:vir:80 1013 QVDAR-LSGKR 1022 (1027) T ss_pred eeeee-ecccc Confidence 99965 55555 No 25 >protein:vir:94602 Length: 1012 # NCBI annotation: PfWMP4_35 # Family: family:all:12083 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762665;genbank:gi:115304373;genbank:GeneID:5142302 Probab=98.62 E-value=1.8e-07 Score=57.57 Aligned_cols=479 Identities=14% Similarity=0.150 Sum_probs=214.0 Q ss_pred CcccchhhcCccccccccCcccC---CCCcEEEeEEEEEeCCeeEECCCcceeeecCCCcceeeeeeeeCCceEEEEEcC Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADL---PLEKWSFGNNVRFKNGKAQKTLGHTPIFDTAQAPILDMFPFIRNNIPYWLLCSE 77 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~l---p~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~v~~~ 77 (513) ..|-|+|-|--+|.--.. |.+| -|+---..+||-|... .++..+.+..+.-....-+++.|++|- . T Consensus 352 V~iLR~RELRFN~G~GA~-~~~L~V~~D~~~~t~Nnvpfsps------nfqt~atT~~~T~R~~~L~~A~G~~~~----~ 420 (1012) T protein:vir:94 352 VNILRLRELRFNGGTGAK-PDDLQVYNDTVEHTWNNVPFSPS------NFQTWATTYTATDRVITLMSAVGDRFN----N 420 (1012) T ss_pred eeeeeeeeeeeccCCCCC-CcceEEEEcceeeeccccccCcc------cccceeeeeeecceeEEEeeecccccc----C Confidence 666676666655554331 1222 1111122233322211 111122222111111111222333221 0 Q ss_pred ceEEEecCceEEeccc--cceeeCCCCceeEEeeCCEEEEEeCCCceEEEc-CC-C-----ceecc--cC-----CCccc Q lcl|NC_010325. 78 QRLYLADGTTIIDVSP--GPYSASITNRWSVGSFNGVIFANDGVNPPHHLP-PS-E-----STFRV--LP-----NFPAN 141 (513) Q Consensus 78 ~kly~~~~~t~~dis~--~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q~~~-~~-s-----~~f~~--L~-----g~p~~ 141 (513) ...+...+-| +..+ .|+.- +.. ..+-..|++..+++.. ++ . .-|+. |+ +-.+ T Consensus 421 A~Y~A~~GAT--nnlpanaPL~I------S~~---sA~s~~~~~R~v~~~~~~T~~~~~~G~Y~r~YGiG~~~~Y~~~~- 488 (1012) T protein:vir:94 421 ANYFAILGAT--NNLPANAPLHI------SCL---SASSYLGGSRRVWYRNLPTTGGTLDGCYVRAYGIGKYVDYSKRS- 488 (1012) T ss_pred cceEEEeecc--cccccCCcccc------ccc---cceeeeccceeeeeeccccCCceEeeeEEEEEEeeeeeecCCcc- Confidence 0111111111 0000 11111 110 1122233444433311 10 0 11111 11 1112 Q ss_pred ceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCC---ccccccccccc---ccccCcceeccc-CCCCc-eeEEEecC Q lcl|NC_010325. 142 TTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSAD---AGGVPASWDPT---DPTKDAGQNTLA-DTNGA-IVDGVKLR 213 (513) Q Consensus 142 ~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d---~~~~P~~Wd~t---~~t~~a~~~dl~-d~~G~-iv~g~~l~ 213 (513) -.++-.+|++||++.|-+. .|.++-+|.++| ++..+++..-+ ++-..+-|-... ....+ |++...-. T Consensus 489 -F~~I~TiY~~RLiL~~~s~----~~~~~~~S~~GD~~~~G~~Y~F~QiTD~L~G~~tDPF~L~VtSe~~e~iT~~~~WQ 563 (1012) T protein:vir:94 489 -FHAIGTIYRDRLILVNPST----ATDQLLISEIGDATVPGEFYQFMQITDMLQGVTTDPFTLNVTSEGRERITAVTGWQ 563 (1012) T ss_pred -ccceeeeeeeeeEEeccCC----CcceEEEeecCCcccCceeeeeeeeehhhccCcCCceeEEEcccccceeeeeeeec Confidence 3457789999999766543 478999999998 33334443222 222222232111 22233 44444457 Q ss_pred cceEEEecCcEEEEEecCCC---ceeEeEEecCccccccCceeEEECCeEEEEeCCCeEEE----CCcccccCC-chhHH Q lcl|NC_010325. 214 DSFIIYKEDSVYSMRYIGGL---FIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVYVH----NGVQKQSVI-DAQVR 285 (513) Q Consensus 214 ~~~vIf~en~i~~m~y~g~~---~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y~~----~G~~~~~Ig-~~~V~ 285 (513) ..++||..+..|+.+ +|+. .-+-+. .....|+++++|||.-+-.+|||++-|++-+ +.+..+.|- +-|++ T Consensus 564 ~~LFV~T~~~T~~~~-GGe~~~~s~~~VN-~vSt~G~~N~~~VV~T~~~V~Ym~~~G~F~L~~k~~~~~Y~A~ErSvKIR 641 (1012) T protein:vir:94 564 KRLFVFTGSNTYSIE-GGEQFGESSYAVN-LVSTYGAFNQNCVVVTNLTVLYMNKFGLFDLMNKPNTDSYGAFERSVKIR 641 (1012) T ss_pred eeEEEEeccceEeec-cccccchhHHHHH-hHHhhcccCcceEEEeeeEEEEeeccceeeccCCccCCcchhhhhhhhhh Confidence 899999999999985 3333 112222 3346699999999999999999999999933 334444442 33676 Q ss_pred HHHHhhc-CcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccc------------------ Q lcl|NC_010325. 286 KFFFSDI-NPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPN------------------ 346 (513) Q Consensus 286 ~~~~~~i-~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~------------------ 346 (513) +- |..+ .++-.+..--.+|+..|+.|...+-.+..-- -.+.++||..-+.|+..+-.. T Consensus 642 ~~-F~~~~~ss~~~~~Wl~~~e~~~~LYi~L~~~~dT~~--~S~~~~~N~~~DSWs~~~s~~~Fq~YP~V~~~~~~t~L~ 718 (1012) T protein:vir:94 642 GL-FQNLAGSSGDNLHWLRYNESSNKLYIGLAAEGDTRT--TSRNLMLNFTWDSWSTLSSAAPFQMYPAVQLFKYMTWLT 718 (1012) T ss_pred hh-hhhhccccccceeeeeeccCCceEEEEecCCCcchh--hhhhhhhhhhhcchhhhhccCCcccchhhhhhhhhhhhh Confidence 64 4444 3333344455689999999998875533211 246899999999999665110 Q ss_pred -e------e--------eeee--------------cccccccc---e---ee--------cccCcccC--ccceeccccc Q lcl|NC_010325. 347 -V------L--------SGAY--------------GIIDPKVS---N---LW--------DDDPNPWD--TDTSVWGEGS 381 (513) Q Consensus 347 -~------~--------~~~~--------------g~~~~~~~---~---~~--------~~~~~~~d--~d~~~~~~ds 381 (513) + | ++-| |..+.... + .| .++.++-. ...+.+.+.. T Consensus 719 ~i~~~~TV~ML~~~~~~YiDFatirthiypF~~CaG~~~~~Vms~~~GIY~~~~P~tP~I~~~tit~ss~~~~k~Yq~~T 798 (1012) T protein:vir:94 719 NINAPLTVAMLATEMPFYIDFATIRTHIYPFTFCAGQRDVSVMSDSRGIYNLPLPVTPGILDYTITASSKAGAKTYQRNT 798 (1012) T ss_pred hhcCchhhhhhhhccceeeeeehhcccccceeeeccceeeEEEecCCceEEecccccceeeeeEeeccchhhhheecccc Confidence 0 0 0000 10000000 0 00 00111000 1111122111 Q ss_pred cccCccc---------------eEEEEeecCceeee-cccc---------------------eee--------------- Q lcl|NC_010325. 382 YNPAKSS---------------MIFSSFQDKKLFLF-GNNS---------------------TFS--------------- 409 (513) Q Consensus 382 ~~~~~~~---------------~~~~~~~~~~~~~~-~~~~---------------------~~~--------------- 409 (513) . -+|.. .++++--+++.+.+ ..++ +.. T Consensus 799 ~-~~GT~tLt~~~~~~~~~~~l~LL~~~~~~~~~a~V~~~~~~~~TT~~TV~~N~~~~lQ~T~~~GS~L~~~~~LsqN~~ 877 (1012) T protein:vir:94 799 A-SAGTETLTLRNPMMDYADTLELLGGNVNASQFAMVMSNGFEPYTTYPTVTYNGVAPLQWTVTGGSGLNNRPILSQNNN 877 (1012) T ss_pred c-cccceeeeecChhhhcCcEEEEecCCCCccEEEEEeecccccccccceEEecceeeeeEEEecCCccccccccccCce Confidence 1 11111 11222112211111 1111 111 Q ss_pred ---cCccEEEEeecccccCCCcceEEEeeeeecc----CCCeeEEEEeeeeecCCCCceEcCc------------eeeec Q lcl|NC_010325. 410 ---GQNFVSTLERSDIYLGDDRMMKTVSAIIPHI----TGNGTCNIWVGNAQVQGSGIRWKGP------------YPYRI 470 (513) Q Consensus 410 ---g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~----t~~~~~~~~~g~~~~~~~~~~w~~~------------~~~~~ 470 (513) |-.+.+...++.|+|.--+++|+++.+..++ +++-.+++.-|=++..--...|-.- .+|.. T Consensus 878 ~~~G~~Y~S~Y~SP~F~L~SL~~LKr~K~~~L~~Dttvtsqlkynltsgfsqvsvlntawvavvsnynenivpavvsyqv 957 (1012) T protein:vir:94 878 CIMGMIYPSVYASPIFDLESLGRLKRLKKLHLQMDTTVTSQLKYNLTSGFSQVSVLNTAWVAVVSNYNENIVPAVVSYQV 957 (1012) T ss_pred EEEeecchhhhcchhhhhhhhhhhhheeeeeEEeeeeeeeeeeeehhcccceeeeecceeeeeeeccCccccceeeeeec Confidence 2222344557888888888888888877753 3444455555555554444445221 12223 Q ss_pred CCceEE--------EeecCCCeEEEEEEccCCCcEEEEEEeeEEeccccCC Q lcl|NC_010325. 471 GQDYKI--------DTKHVGRYIALKFDFSSEGDWYFNGYTIEMAPKAGMR 513 (513) Q Consensus 471 ~~~~~~--------~~R~~~Ry~~~rl~~~~g~~w~~~G~~~~~~~~g~rr 513 (513) |..+.| ++..-|-=..|-+...-.+-+.+-.|+.+++|.--+| T Consensus 958 gnsyeirrvvelsiplqgygcdyqfyiasvgaeafklaayefdiqpqrdkr 1008 (1012) T protein:vir:94 958 GNSYEIRRVVELSIPLQGYGCDYQFYIASVGAEAFKLAAYEFDIQPQRDKR 1008 (1012) T ss_pred CCceeeeEEEEEeecccccccceeEeeeeccccceeeeeeeeccccchhhh Confidence 322211 0000111111122222234456789999999888777 No 26 >protein:vir:94583 Length: 792 # NCBI annotation: Tubular tail protein B # Family: family:all:825 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919014;genbank:gi:119637778;genbank:GeneID:5179343 Probab=98.62 E-value=1.8e-07 Score=57.56 Aligned_cols=481 Identities=15% Similarity=0.100 Sum_probs=231.8 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeee-----cCCCcceeeeeee-eCCceEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFD-----TAQAPILDMFPFI-RNNIPYWL 73 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~-----~~~~~~~~~~~~~-~~g~~~~~ 73 (513) |+...+.+.+..|.|.-.-+..-=++....|.||+++ .|+++||+|..=+.. .++..+. +.++. .....+++ T Consensus 1 M~~i~~s~~n~~~GiSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~-l~~~~~~~~q~y~l 79 (792) T protein:vir:94 1 MALISQSVKNLKGGISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTIGDQNALGAKPL-VHLINRDSAEQYYV 79 (792) T ss_pred CcceeeecchhhcceecCcchHHhhhhhhhhhcceeeeccccccCChhHHHHhhhcCCCCCcccE-EEEEEeCCCceEEE Confidence 9999999999999988665554456888999999999 588999999654421 1222222 22322 23345555 Q ss_pred EEcCceEEEec-CceEEeccc-cceeeCCC--CceeEEeeCCEEEEEeCCCceEEEc----------------------- Q lcl|NC_010325. 74 LCSEQRLYLAD-GTTIIDVSP-GPYSASIT--NRWSVGSFNGVIFANDGVNPPHHLP----------------------- 126 (513) Q Consensus 74 v~~~~kly~~~-~~t~~dis~-~~~~~~~~--~~w~f~~~~~~~ia~ng~d~~q~~~----------------------- 126 (513) +.+.+.|+-++ ++....++. .+|-.... ..-++++.+|+++++|..-+|+... T Consensus 80 ~f~~~~~rv~~~~g~~~~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~~~~~~~v~i~~g~y~~ 159 (792) T protein:vir:94 80 VFTGQGVRVFDLNGKEYDVKGDLSYVKVENPRDDLRMVTVADYTFIVNRNMVVRPDTTPLYTLKENGDCLINIRGGMYGR 159 (792) T ss_pred EEcCCeEEEEecCCceEEecccCceeeecCCcceeEEEEEcCEEEEEeCCccceeEecCcCCCCCCceEEEEccCCCcce Confidence 65665444342 111222221 23311111 1234444444444444332222100 Q ss_pred --------------------------------------------------------------CC---------------- Q lcl|NC_010325. 127 --------------------------------------------------------------PS---------------- 128 (513) Q Consensus 127 --------------------------------------------------------------~~---------------- 128 (513) ++ T Consensus 160 ~y~i~i~~~~~~~~~~~~t~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~g~~~~ 239 (792) T protein:vir:94 160 TLAFTINNTKIAYEIAHGDAPEHSKQTDAQWLVKKLAGLARLNVAFKGWTFTEGPGYIHVIAPSNSQINSLSTEDGYADQ 239 (792) T ss_pred eEEEEecCceeeeeeecCcccceecccchhhhhhhhhhhccccccccccEEEECCeEEEEEecCCceeeeeecccCcCcc Confidence 00 Q ss_pred --------CceecccCCC------------------------------------------------cc------------ Q lcl|NC_010325. 129 --------ESTFRVLPNF------------------------------------------------PA------------ 140 (513) Q Consensus 129 --------s~~f~~L~g~------------------------------------------------p~------------ 140 (513) .+.+..|+.. |. T Consensus 240 ~~~~~~~~v~~~~~lp~~~~~G~~v~i~~~~~~~~d~y~v~~~~~~~~w~E~~~~~~~~~~~~~tmp~~lv~~~~~~~~~ 319 (792) T protein:vir:94 240 LMNAVMHTSQSFSRLPVEAPNGYTVKIVGDTSKTSDMFYVQYDNMKKVWKEVAGWGVQKGLNGGTMPHALVRQADGSFQM 319 (792) T ss_pred eeeeeeecccccccccccCCCCcEEEEEccCCCCccceEEEEEcCCceEEEecccceeeeecccccCeeEEEcCCCcEEE Confidence 0001111100 00 Q ss_pred ---ccee-------------------eEEEEEcCEEEEEECCcCcccCCceEEEeccCCccccccccccccc--ccCcce Q lcl|NC_010325. 141 ---NTTF-------------------KRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDP--TKDAGQ 196 (513) Q Consensus 141 ---~~ka-------------------~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~--t~~a~~ 196 (513) .|.. .-|..|+|||++++ |+.|+.|..+|.+ +|..... ..+++= T Consensus 320 ~~~~w~~r~~gd~~tnp~psf~g~~i~~v~f~q~RL~f~~--------~~~v~~Srtgd~~----nF~~~t~~~~~DdD~ 387 (792) T protein:vir:94 320 QVLPWTQRTCGDMDTNPTPSIVDQKINDVFFFRNRLGFLA--------GENIVMSRTSKYF----SLFPASVANLSDDDP 387 (792) T ss_pred EeccccccccCccccCccceeccCCcceEEEEcceEEEec--------CCeEEEEccCCcc----cCccccccCCCCCcc Confidence 0000 23788999998764 6789999999964 4444332 223344 Q ss_pred eccc---CCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCceeEEECCeEEEEeCCCe--- Q lcl|NC_010325. 197 NTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDV--- 268 (513) Q Consensus 197 ~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~--- 268 (513) .++. +....|..+++....|+||...+-|.++-.+ +|.-.++.+.+. .+|-+.=.=+.+|+.++|+++.|= T Consensus 388 I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~l~~~~~lTP~~~~i~~~s~-~~~~~~~~Pv~vG~~v~Fv~~~g~~~~ 466 (792) T protein:vir:94 388 IDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTE-FDVSDRARPFGVGRGVYFASPRASYTS 466 (792) T ss_pred EEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEE-eeccCCCCceEeCCeEEEeecCCCeeE Confidence 4443 3334466688899999999999999996322 244466666664 345444456789999999999873 Q ss_pred -EE---ECCc--ccccCC-chhHHHHHHhhcCcchhCCEEEEEe-cCCCEEEEEEccCCCCCCcccceEEEEec----cc Q lcl|NC_010325. 269 -YV---HNGV--QKQSVI-DAQVRKFFFSDINPDNYQRTFVLAD-HVNTEMWVCYSSTRSKPGKHCDRAIIWNW----KE 336 (513) Q Consensus 269 -y~---~~G~--~~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d-~~~~~v~~~~~s~~~~~~~~~d~~lvyd~----~~ 336 (513) +. ++-. .+.+.. +.-+.+.| +... +.-++. ..+..+.|+-... +++++|-| .. T Consensus 467 v~r~~~~~~~~d~y~a~DlT~~~~hl~-----~~~v--~~~~a~~~~~~~vv~~~~~~--------g~l~~~ty~~~~~e 531 (792) T protein:vir:94 467 LNRYYAVQDVSSVKSAEDMSAHVPNYI-----PNGV--FSIRGSSTENFISVLSSNAP--------SRIFLYKFLYLNEE 531 (792) T ss_pred EEeeeeeccccCceehhhHHHHHHHhc-----CCce--EEEEEeCCCCcEEEEEEcCC--------CeEEEEEEeecCCc Confidence 22 2321 222221 11222221 1111 112222 2233455553322 36777765 23 Q ss_pred C---eEEEEeccce----eeeeec-----cc----cccccee---ec-----ccCcccCcccee---------------- Q lcl|NC_010325. 337 N---TWSIRDLPNV----LSGAYG-----II----DPKVSNL---WD-----DDPNPWDTDTSV---------------- 376 (513) Q Consensus 337 ~---~Ws~~d~~~~----~~~~~g-----~~----~~~~~~~---~~-----~~~~~~d~d~~~---------------- 376 (513) + .|+.-+.+.. |..+.+ +. ....... .. ..++..-+|... T Consensus 532 ~~v~aW~~~~~~g~~~~~~~~~~~D~l~~~v~r~~~~~~~r~~~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~T 611 (792) T protein:vir:94 532 IAQQSWSHWELGSNVTVLACDSIGSTMYLVLRNQSHTWMCRAHFTKNSIDFPDEPYRLYIDNKVKYVIPEGSYNDDTYAT 611 (792) T ss_pred eEEEeEEEEEcCCcEEEEEEeecCCEEEEEEEeCCCEEEEEEEEeecccccCCCcceeeeeeeeeEEecCcceecCceee Confidence 3 6887665431 111111 00 0000000 00 000000011100 Q ss_pred -----ccccccccCccceEEEEeecCce----------------eee----cccceeecCccEEEEeecccccC----CC Q lcl|NC_010325. 377 -----WGEGSYNPAKSSMIFSSFQDKKL----------------FLF----GNNSTFSGQNFVSTLERSDIYLG----DD 427 (513) Q Consensus 377 -----~~~ds~~~~~~~~~~~~~~~~~~----------------~~~----~~~~~~~g~~l~a~~~~~~~~~~----~~ 427 (513) +.-......|+..... .|+.. +.+ ......-|-++++.++...+.+. +. T Consensus 612 ~~~~~~~~gl~~l~G~~v~v~--~dG~~~~~~~~~~~~~~~~~~i~~~g~~~a~~v~VGl~y~~~~~~~~~~~~~~~g~~ 689 (792) T protein:vir:94 612 TVKPVDVYGMKYWTGKFYIVA--SDGLVSWFEPPRGGWPNGVPMLTMSGNREGETIYVGLAISFRYVFSKFLIKKTADDG 689 (792) T ss_pred eeccccccCcccccCcEEEEE--ecCceeEeecccceecCCccEEEecCCccCCeEEEeeeeeEEEEeccceeeccCCCc Confidence 0001111122222211 22221 111 11234568888888886554331 11 Q ss_pred c------ceEEEeeeeeccCCCeeEEEEeee--eecCC--------CCceEcCceeeecCCceEEEeecCCCeEEEEEEc Q lcl|NC_010325. 428 R------MMKTVSAIIPHITGNGTCNIWVGN--AQVQG--------SGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDF 491 (513) Q Consensus 428 ~------~~~~i~~~~~~~t~~~~~~~~~g~--~~~~~--------~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~ 491 (513) . -+.++.++.......+.+.+.+.. ++... .+..-.+.....++ ...++++..++-..++|+. T Consensus 690 ~~~~~~~gr~rl~r~~~~~~~tg~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg-~~~vp~~g~~~~~~v~i~~ 768 (792) T protein:vir:94 690 SIATEDIGRLQLRRAWVNYEDSGAFTVEVENTSRLFSYDMAGARLGSNVLRAGGLNVGTG-QFRFPVTGNAQLNEVRIIS 768 (792) T ss_pred CccccceeeEEEEEEEEeeeccceeEEEEcCCCcceeeeeccceeccccccccccccccc-eEEEEeeccCceEEEEEEE Confidence 1 012444433333333444433321 11100 00000111122223 3678899999999999999 Q ss_pred cCCCcEEEEEEeeEEe-ccccCC Q lcl|NC_010325. 492 SSEGDWYFNGYTIEMA-PKAGMR 513 (513) Q Consensus 492 ~~g~~w~~~G~~~~~~-~~g~rr 513 (513) +.-.+.++.++++|.. -.=.|| T Consensus 769 d~P~P~tvlai~~eg~y~~r~~~ 791 (792) T protein:vir:94 769 EHTTPLNVIGCGWEGNYLRRSSG 791 (792) T ss_pred CCCCCEEEEEEEEEEEEeccccC Confidence 9999999999999983 334444 No 27 >protein:vir:94713 Length: 785 # NCBI annotation: tail tube # Family: family:all:825 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338122;genbank:gi:77118200;genbank:GeneID:3707736 Probab=98.55 E-value=3.1e-07 Score=56.23 Aligned_cols=481 Identities=14% Similarity=0.107 Sum_probs=243.7 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeeecC-CCcc-eeeeee-eeCCceEEEEEc Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTA-QAPI-LDMFPF-IRNNIPYWLLCS 76 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~-~~~~-~~~~~~-~~~g~~~~~v~~ 76 (513) |+..-+.+.+..|.|.-..+..-=++....|.||+++ .|+++||||..=+.... ..+. .....+ ......++++.+ T Consensus 1 M~~~~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~~v~~l~~~~~~~~~~~~f~~~~~~~y~l~~~ 80 (785) T protein:vir:94 1 MPLITQSIKNLKGGISQQPDILRFSDQGEAQVNCWSSESDGLQKRPPTVFKRRLNIDVGSNPKFHLINRDEQEQYYIVFN 80 (785) T ss_pred CcceeeecchhhcceecCCchHHhhhHHhhhhcceeeeccCcccCChhHhhhcccCCCCcCcEEEEEEeCCCceEEEEEc Confidence 9999999999999998665554456888999999999 58899999976664321 2221 222333 345566777777 Q ss_pred CceEEEecC-ceEEeccc-cceeeCC--CCceeEEeeCCEEEEEeCCCceEEEcCC------------------------ Q lcl|NC_010325. 77 EQRLYLADG-TTIIDVSP-GPYSASI--TNRWSVGSFNGVIFANDGVNPPHHLPPS------------------------ 128 (513) Q Consensus 77 ~~kly~~~~-~t~~dis~-~~~~~~~--~~~w~f~~~~~~~ia~ng~d~~q~~~~~------------------------ 128 (513) .+.|+-|+. +....++. .+|-... -+.-++++.+|+++++|..-+||...-. T Consensus 81 ~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~i~~g~y~~~y 160 (785) T protein:vir:94 81 GSNIQIVDLSGNQYSVSGSVDYVKSSNPRDDIRVVTVADYTFVVNRKVVVKGGSEKSHSGYNRKARALINLRGGQYGRTL 160 (785) T ss_pred CCeEEEEecCCcEEEEecCCCceeecCchhheeeEeeCCEEEEEcCCcceeeeeccCCcCCCCCCceEEEecccccceeE Confidence 887776653 33334443 3552221 1236788888888888755555421000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_010325. 129 -------------------------------------------------------------------------------- 128 (513) Q Consensus 129 -------------------------------------------------------------------------------- 128 (513) T Consensus 161 ~i~i~g~~~at~~t~~~s~a~~s~~~~s~~~i~~~l~~~l~a~~t~~t~~~~g~~i~i~a~s~t~~~~~s~~~~~~~t~~ 240 (785) T protein:vir:94 161 KVGINGGVKVSHKLPAGNDAENDPPKVDAQAIGAALRDLLVTAYPTFTFDLGSGFLLITAPSGTDINSVETEDGYANQLI 240 (785) T ss_pred EEeeCCcceeEEEEccCccccccccccchHHHHHHHHHHhhccccceeEEecCcEEEEEecCCccccceeeecccCCeEE Confidence Q ss_pred ------CceecccCC---------------Ccc--------------------c-------------------------- Q lcl|NC_010325. 129 ------ESTFRVLPN---------------FPA--------------------N-------------------------- 141 (513) Q Consensus 129 ------s~~f~~L~g---------------~p~--------------------~-------------------------- 141 (513) .+.+++||. .++ + T Consensus 241 ~~~~~~~~~~~~Lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~g~w~e~~~~g~~~~~~~~tmp~~l~~~~~~~~~~~~ 320 (785) T protein:vir:94 241 SPVLDTVQTISKLPLAAPNGYIIKIQGETNSSADEYYVMYDSNTKTWKETVEPGVVTGFDNTTMPHALVRQSDGSFEFKA 320 (785) T ss_pred EEEEeeccceeccccccCCCCEEEEEccCCCCccceEEEEEcCCceEEEecccceeeeeeccccceEEEeccCCceEEec Confidence 000111210 000 0 Q ss_pred --ce-------------------eeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccc--cccCcceec Q lcl|NC_010325. 142 --TT-------------------FKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTD--PTKDAGQNT 198 (513) Q Consensus 142 --~k-------------------a~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~--~t~~a~~~d 198 (513) |. ...|..|++||++++ |+.|+.|..+|.+ ++.... ...+++=.+ T Consensus 321 ~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~--------~~~v~~Srtgd~~----nF~~~t~~~~~DdD~i~ 388 (785) T protein:vir:94 321 LDWSKRGAGNDDTNPMPSFVDATINDVFFYRNRLGFLS--------GENVIMSRSASYF----AFFPKSVATLSDDDPID 388 (785) T ss_pred cccccccCCCcccCCcceecccccceEEEEeceEEEec--------CCeEEEEccCCcc----cCccccccCCCCCccEE Confidence 00 022678888887753 6789999999964 444332 222344444 Q ss_pred cc---CCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCceeEEECCeEEEEeCCCeE---- Q lcl|NC_010325. 199 LA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVY---- 269 (513) Q Consensus 199 l~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y---- 269 (513) +. +....|..+++.+..|+||...+-|.++-.+ +|...++.+.+. .+|-+.=.-+.+|+.++|+++.|=+ T Consensus 389 ~~~~~~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~lTP~~~~~~~~s~-~~~~~~~~Pv~vg~~v~f~~~~g~~~~v~ 467 (785) T protein:vir:94 389 VAVSHPRISILKYAVPFSEQLLLWSDEVQFVMTSSGVLTSKSIQLDVGSE-FALGDNARPFAVGRSVFFSAPRGSFTSIK 467 (785) T ss_pred EEecCCcceeeEEEeecCCcEEEEecCcEEEEcCCCcccceeEEEEEEEe-eeccCCCCceEeCCeEEEEecCCCeeEEE Confidence 43 3334577788999999999999999995222 244466666663 3455555577899999999997732 Q ss_pred EE---CC--cccccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEec----ccC-- Q lcl|NC_010325. 270 VH---NG--VQKQSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNW----KEN-- 337 (513) Q Consensus 270 ~~---~G--~~~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~----~~~-- 337 (513) .+ +- ..+.+.. +..+.+.| +...-.+ ..+=.....+.|+-... +++++|-| ..+ T Consensus 468 r~~~~~~~~d~y~~~dlt~~~~~~~-----~g~~~~~-~a~~~~~~~~~~~~~~~--------g~l~~~~y~~~~~e~~v 533 (785) T protein:vir:94 468 RYFAVADVSDVKDADDTTGHVLSYI-----PNGVFDI-QGTGTENYICVNSTGAY--------NRIYIYKFLFKDSVQLQ 533 (785) T ss_pred eeeeecccccceehhhHHHHHHHhc-----CCCcEEE-EEecCCCcEEEEEEcCC--------CEEEEEEEeecCCceEE Confidence 22 21 1122211 22233332 1111111 22222333455654322 35666665 222 Q ss_pred -eEEEEeccc----eeeeeec----------------ccccccceeec-ccC---------------cccCccc------ Q lcl|NC_010325. 338 -TWSIRDLPN----VLSGAYG----------------IIDPKVSNLWD-DDP---------------NPWDTDT------ 374 (513) Q Consensus 338 -~Ws~~d~~~----~~~~~~g----------------~~~~~~~~~~~-~~~---------------~~~d~d~------ 374 (513) .|+.-+++. .+..+++ ........... ..+ ..++.+. T Consensus 534 ~aW~r~~~~~~~~~~~~~~~~d~~~~vv~r~~g~~~~~ie~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~ 613 (785) T protein:vir:94 534 ASWSHWEFPKDDKILASASIGSTMFIVRQHQGGVDIEHLKFIKEATDFPSEPYRLHVDSKVSMVIPIGSFNADTYKTTVD 613 (785) T ss_pred EEEEEEEeCCCeEEEEEEEeCCEEEEEEEcCCCEEEEEEEeecccCCCCCcceeEEeeeeeEEEecCcceeccccccccc Confidence 577655432 1111111 00000000000 000 0111110 Q ss_pred --eeccccccccCccceEEEEeecCce------------eeec----ccceeecCccEEEEeecccccC--CC------- Q lcl|NC_010325. 375 --SVWGEGSYNPAKSSMIFSSFQDKKL------------FLFG----NNSTFSGQNFVSTLERSDIYLG--DD------- 427 (513) Q Consensus 375 --~~~~~ds~~~~~~~~~~~~~~~~~~------------~~~~----~~~~~~g~~l~a~~~~~~~~~~--~~------- 427 (513) ..|.. ..-..|+.. ....|+.+ +.+. .....-|.++++.++-..+.+- ++ T Consensus 614 ~~~~~~g-~~~leg~~v--~v~adG~~~~~~~v~~~~~tl~~~g~~~~~~v~vGl~y~~~~~~~~~~~~~~~~~~~~~~~ 690 (785) T protein:vir:94 614 IGAAYGG-NAPSPGRYY--LIDSQGAYLDLGELTSISTVITLNGDWSGRTVFIGRSYLMSYKFSRFLIKIEDDSGTQSED 690 (785) T ss_pred ccccccc-CCccCCeEE--EEeeCCcCccCceEcCCCcEEEecCCCCCceEEEeeeeeEEEeecceeEEecCCCcccccc Confidence 00100 000111111 11222222 1111 1224568888888874444321 11 Q ss_pred -cceEEEeeeeeccCCCeeEEEEe--eeeecC--CCCceEcC----ceeeecCCceEEEeecCCCeEEEEEEccCCCcEE Q lcl|NC_010325. 428 -RMMKTVSAIIPHITGNGTCNIWV--GNAQVQ--GSGIRWKG----PYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWY 498 (513) Q Consensus 428 -~~~~~i~~~~~~~t~~~~~~~~~--g~~~~~--~~~~~w~~----~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~ 498 (513) +| .++.++...+...+.+.+.+ +..+.. -.+-++.. ..+.-+| ...++++...+-..++|+.+.-.+.+ T Consensus 691 ~gr-~~l~r~~~~~~~sg~~~v~v~~~~~~~~~~~~~~~~g~~~~~~~~~~tg-~~~vp~~g~~~~~~v~i~~~~P~P~t 768 (785) T protein:vir:94 691 TGR-LQLRRAWVNYRDTGALRLIVRNGEREFVNTFNGYTLGQQTIGTTNIGDG-QYRFAMNGNALTTSLTLESDYPTPVS 768 (785) T ss_pred ccc-EEEEEEEEEeecccceEEEecCCCccceeeecCcccCcccccccccccc-eEEEEeecccceEEEEEEECCCCceE Confidence 22 24444443333333333332 222110 00111111 1112233 46688888888888999999999999 Q ss_pred EEEEeeEEe-ccccCC Q lcl|NC_010325. 499 FNGYTIEMA-PKAGMR 513 (513) Q Consensus 499 ~~G~~~~~~-~~g~rr 513 (513) +.++++|.. -.=.|| T Consensus 769 vlsi~~eg~y~~r~~~ 784 (785) T protein:vir:94 769 IVGCGWEASYAKKARS 784 (785) T ss_pred EEEEEEEEEEeccccC Confidence 999999882 223555 No 28 >protein:vir:3366 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523337;swissprot:trembl:q8w5u3;genbank:gi:17570828;goa:Q8W5U3;uniprot:Q8W5U3;genbank:GeneID:927453 Probab=98.47 E-value=5.2e-07 Score=55.05 Aligned_cols=483 Identities=15% Similarity=0.138 Sum_probs=229.6 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeeecCCCcceeeeee-----eeCCceEEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTAQAPILDMFPF-----IRNNIPYWLL 74 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~~~~~~~~~~~-----~~~g~~~~~v 74 (513) |+..-+.+.+..|.|.-..+.+==++....|.||++. .++++||||..=+.....++-.+..++ ......++++ T Consensus 1 M~~i~~~~~nl~~GvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpGt~~va~~~~~~~~~~~~~~~~~~r~~~~~y~l~ 80 (801) T protein:vir:33 1 MALISQSIKNLKGGISQQPDILRFTEQGSVQINGWSSESEGIQKRPPMIHLKTLGTAGYVGAQPYVHLINRDEFEQYFVV 80 (801) T ss_pred CceeEeeccceecceeccchhHhhhhhHhhhhcceeecccCcccCchhHhhhhhcCCCccccceEEEEEEeCCceEEEEE Confidence 9999999999999998655554456888999999999 588999988755533222211121111 1233445555 Q ss_pred EcCceEEEec-CceEEeccc-ccee--eCCCCceeEEeeCCEEEEEeCCCceEE-------------------------- Q lcl|NC_010325. 75 CSEQRLYLAD-GTTIIDVSP-GPYS--ASITNRWSVGSFNGVIFANDGVNPPHH-------------------------- 124 (513) Q Consensus 75 ~~~~kly~~~-~~t~~dis~-~~~~--~~~~~~w~f~~~~~~~ia~ng~d~~q~-------------------------- 124 (513) .+.+.|+-|+ .+....++. .+|- ...-+.-++++.+|+++++|..-+|+- T Consensus 81 ~~~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~t~aD~~fi~nr~~~p~~~~~~~~~~~~~~~~~~li~v~~~~yg 160 (801) T protein:vir:33 81 FTGEDIKVFDLDGKEYQVRGDRSYVRTANPREDLRMVTVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGGQYG 160 (801) T ss_pred EcCCeEEEEccCCcEEEEecCCcceeecCcchheEEEEEcCEEEEeeCCeeecccCCcccccccCCCcceEEEEeecccc Confidence 5555444443 121222221 2221 110111234444444333332111110 Q ss_pred ----E--c---------------------------------------------------------------CC------- Q lcl|NC_010325. 125 ----L--P---------------------------------------------------------------PS------- 128 (513) Q Consensus 125 ----~--~---------------------------------------------------------------~~------- 128 (513) + . ++ T Consensus 161 ~t~~I~i~gs~~~~~~~~~gs~~~~v~~~s~~~~A~~l~~~~~~~~~~~~~~~~~~~w~~~~~~g~~~i~~p~~~~~~~i 240 (801) T protein:vir:33 161 RRLSIEFNGAERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHILAPNNDNVWGL 240 (801) T ss_pred eEEEEEECCcceEEEEeeccccccccccccchhhhhhhhhhhhccCccceeeecCceEEEEecCeEEEEecCCCcccccc Confidence 0 0 00 Q ss_pred -----------------CceecccCCC----------------------------------------------------- Q lcl|NC_010325. 129 -----------------ESTFRVLPNF----------------------------------------------------- 138 (513) Q Consensus 129 -----------------s~~f~~L~g~----------------------------------------------------- 138 (513) ...+.+|+.. T Consensus 241 tt~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~g~~~~~~~~tmp~~l~ 320 (801) T protein:vir:33 241 QTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLHYHTMPWALV 320 (801) T ss_pred cccCCccceeEEEEeecccceeeeeeecCCCcEEEEEecCCCcccceEEEEEcCCcEEEEeeccccceeeeecccceEEE Confidence 0001111000 Q ss_pred ---------------------------cc--cceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccc Q lcl|NC_010325. 139 ---------------------------PA--NTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTD 189 (513) Q Consensus 139 ---------------------------p~--~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~ 189 (513) |+ +.....|..|++||++++ |++|+.|..+|.+ +|.... T Consensus 321 ~~~~~tf~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~f~q~RL~f~~--------~~~v~~Srtgd~~----nF~~~t 388 (801) T protein:vir:33 321 RASDGNFDFKYLEWGARTVGDDTTNPYPSFTGQTINDIFFFRNRLGFLS--------GENIILSRTSKYF----NFFPAS 388 (801) T ss_pred EccCceEEecccCccccccCCccccCcccccCCCceEEEEEcceEEEee--------CCeEEEEecCCcc----cccccc Confidence 00 000123688999998764 6789999999964 444333 Q ss_pred c--ccCcceeccc---CCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCceeEEECCeEEE Q lcl|NC_010325. 190 P--TKDAGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFV 262 (513) Q Consensus 190 ~--t~~a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ff 262 (513) . ..+++=.++. +....|..+++....|+||...+-|.++-.+ .|...++.+.+. .||-+.=.=+.+|+.++| T Consensus 389 ~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~-~~~~~~~~Pv~vg~~v~f 467 (801) T protein:vir:33 389 VSNYSDDDPIDVAVSHDRVSTLKYAVPFSEELLLWSDQAQFVLTASDILSSRSVGLNLTTQ-FDVQDRARPHGVGRNVYF 467 (801) T ss_pred ccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEe-ecccCCCCceEecCeEEE Confidence 2 2233333332 3334466688899999999999999996222 244466666663 356655566889999999 Q ss_pred EeCCCe----E---EECCc--ccccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEE Q lcl|NC_010325. 263 VGHGDV----Y---VHNGV--QKQSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIW 332 (513) Q Consensus 263 ls~~G~----y---~~~G~--~~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvy 332 (513) +++.|= + .++-. .+.+.. +.-+.+.| +. ..+.-++-+.... ++.+...+. +++++| T Consensus 468 ~~~~g~~~~v~r~~~~~~~~d~y~~~Dlt~~~~~~~-----~~--~~~~~~~~~~~~~-~~~~~~~~~------~~l~~~ 533 (801) T protein:vir:33 468 SSPRASFTSINRYYAVQDVSSVKNAEDMTAHVPNYI-----PN--GVFSISGTTAENF-VAILTSGAP------NRVYIY 533 (801) T ss_pred EecCCCeeEEEEEEeecccccceehhhHHHHHHHhc-----CC--ceEEEEEcCCCCe-EEEEEecCC------CEEEEE Confidence 999983 2 22321 222211 11222221 11 1122233333222 112222111 356776 Q ss_pred ecc----cC---eEEEEecccee--eeee------------------cccccccce-----------eeccc-----Ccc Q lcl|NC_010325. 333 NWK----EN---TWSIRDLPNVL--SGAY------------------GIIDPKVSN-----------LWDDD-----PNP 369 (513) Q Consensus 333 d~~----~~---~Ws~~d~~~~~--~~~~------------------g~~~~~~~~-----------~~~~~-----~~~ 369 (513) -|. .+ .|+.-+.+..+ ..+. +........ ++... ..+ T Consensus 534 ~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~vv~r~~~~~le~~~~~~~~~d~~~~~~~~~lD~~~~~~~~~~~ 613 (801) T protein:vir:33 534 KFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINSTMTVLMSNEHAVWMGRLHFTKDSIDLPGEPYRLYIDAKRKYTIPAGT 613 (801) T ss_pred EEecCCCceEEEeeEEEEcCCCEEEEEEecCCCEEEEEEEcCCcEEEEEEEEeeccccCCCccceEEeecceEEEecccc Confidence 542 22 78876654321 1110 000000000 00000 011 Q ss_pred cCcccee--cccc----ccccCccceEEEEeecCcee---------------eec----ccceeecCccEEEEeeccccc Q lcl|NC_010325. 370 WDTDTSV--WGEG----SYNPAKSSMIFSSFQDKKLF---------------LFG----NNSTFSGQNFVSTLERSDIYL 424 (513) Q Consensus 370 ~d~d~~~--~~~d----s~~~~~~~~~~~~~~~~~~~---------------~~~----~~~~~~g~~l~a~~~~~~~~~ 424 (513) ++.+... |++. .....|.. +....|+.++ .+. .....-|-++++.++-..+.+ T Consensus 614 ~~~~~~~t~~~~~~~~gl~~~eg~~--v~~~~dG~v~~~~~~~~~~~~~~~l~i~~~~~~~~v~vGl~y~s~~~~~~~~~ 691 (801) T protein:vir:33 614 YNDDTYQTSISLSTIYGMNFTKGRV--SVVFPDGKIVEIDQPINGWSSDPMLRLDGNQEGQVVYIGFNIPFTYTFSKFLI 691 (801) T ss_pred eecCccccccccccccCCccccceE--EEEEeCCceEeeeeccccccCceeEEecCCCCCCEEEEeeeeeEEEEeCceEE Confidence 1111111 1110 00011111 2223333332 111 112456888888888555543 Q ss_pred C----CCcc------eEEEeeeeeccCCCeeEEEEeeee--ec--CCCCceEcCc------eeeecCCceEEEeecCCCe Q lcl|NC_010325. 425 G----DDRM------MKTVSAIIPHITGNGTCNIWVGNA--QV--QGSGIRWKGP------YPYRIGQDYKIDTKHVGRY 484 (513) Q Consensus 425 ~----~~~~------~~~i~~~~~~~t~~~~~~~~~g~~--~~--~~~~~~w~~~------~~~~~~~~~~~~~R~~~Ry 484 (513) - +... +.++.++...+...+.+.+.+... +. ...+..+.++ ...-+| ...++++...+- T Consensus 692 ~~~~~~~~~~~~~~~r~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg-~~~vp~~g~~~~ 770 (801) T protein:vir:33 692 KKTAEDGSTATEDIGRLQLRRAWVNYEDSGAFIIRVNNLSREFIYTMAGARLGSDNLRVGGSNIGTG-QYRFPVVGNAQT 770 (801) T ss_pred eccCCCCceeeeeeccEEEEEEEEEeecCcceEEEECCcccceeeeecccccccccccccccccccc-eEEEEeeccCce Confidence 1 1111 112333333333334444444221 11 1111222111 111122 467888888899 Q ss_pred EEEEEEccCCCcEEEEEEeeEEe-ccccCC Q lcl|NC_010325. 485 IALKFDFSSEGDWYFNGYTIEMA-PKAGMR 513 (513) Q Consensus 485 ~~~rl~~~~g~~w~~~G~~~~~~-~~g~rr 513 (513) ..++|+.+.-.+.++.++++|+. -.=.|| T Consensus 771 ~~v~i~~d~P~P~tvl~i~~eg~y~~r~~~ 800 (801) T protein:vir:33 771 NTVTIESDASTPLNIIGCGWEGNYLRRSSG 800 (801) T ss_pred EEEEEEeCCCCCEEEEEEEEEEEEeccccC Confidence 99999999999999999999983 334444 No 29 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=98.40 E-value=3.2e-07 Score=56.17 Aligned_cols=273 Identities=11% Similarity=-0.014 Sum_probs=150.6 Q ss_pred CcccchhhcCccccccccCccc-------CCCCcEEEeEEEEEe-CCeeEECCCcceeeecCCCcceeeeeeeeCCceEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPAD-------LPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTAQAPILDMFPFIRNNIPYW 72 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~-------lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 72 (513) |+.-- +-+.=.|+.++|+. -|.-+--++.||-++ +|+.+++.++..+..-.=.+ +. .+--.... T Consensus 1 ~~~~~---~~~~~ginnv~~e~~l~~~~~~~~~~~r~a~nvdi~~~G~~~~r~~~tr~~~g~l~~---~~--~~~~~~~~ 72 (396) T protein:vir:10 1 MATTS---LVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQ---LW--QSPLHGDA 72 (396) T ss_pred Cccee---eeeeecccccccccccccCCCcccceeeeeeeecccCCCchhhhccCcccCCceecc---cc--cCccccce Confidence 55432 22344456666665 566778889999988 68888898988875432111 11 11111233 Q ss_pred EEEcCceEEEecCceEEeccccceeeCCCCceeEEeeCCEEEEEeCCCceEEEcCCCce--------------------- Q lcl|NC_010325. 73 LLCSEQRLYLADGTTIIDVSPGPYSASITNRWSVGSFNGVIFANDGVNPPHHLPPSEST--------------------- 131 (513) Q Consensus 73 ~v~~~~kly~~~~~t~~dis~~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q~~~~~s~~--------------------- 131 (513) |..+.+.|++++..+|+-+... ....+.-|+ .+++|.++.++ ...|++.++.++. T Consensus 73 ~~~~~~tl~~~~~~~w~~~~~v--~v~~~pva~-d~~~~Rvy~t~-~~~p~~~~~~~~y~L~vp~P~~a~~~a~~Gsl~~ 148 (396) T protein:vir:10 73 FGALGDQWGKVDPHSWTFEPLA--QIGEGDLSH-EVLNNRVCVAG-TAGIFTYDGAQAERLTLDTPAPPLLVAGAGSLSQ 148 (396) T ss_pred eeeCCceEEEEeCCeEEEEeee--eeccCchhc-cccCCeEEEEc-CCCceeeeCCcceecCcCCCcccccccccCccCC Confidence 4456888888888877554321 222222233 55665555554 2222332221100 Q ss_pred --------ecccCCC----------------------c----ccc----------------------------------- Q lcl|NC_010325. 132 --------FRVLPNF----------------------P----ANT----------------------------------- 142 (513) Q Consensus 132 --------f~~L~g~----------------------p----~~~----------------------------------- 142 (513) |-+..|. | ... T Consensus 149 ~~~~Y~~t~V~~~gEEs~p~~~S~~v~~~gg~~vtl~~~~~~~i~~~RiYrS~~~G~~~~l~aE~~a~~~s~vlPs~~w~ 228 (396) T protein:vir:10 149 GTYGAAVAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANGGELLLAGDYPLGAATVILPTLPEL 228 (396) T ss_pred ceEEEEEEEEecCCCcCcccccccccCCCCCcEEEEEcccCCCcceEEEEEeCCChhhhhheehhccceeeeeeecCCCC Confidence 0000000 0 000 Q ss_pred -------------eeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEE Q lcl|NC_010325. 143 -------------TFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDG 209 (513) Q Consensus 143 -------------ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g 209 (513) ...++..|.+||+++ ..|-|+||-..- |--|++- ++|.. +.+.|+.. T Consensus 229 gpP~~~~gL~pmP~G~~~A~faGRi~~A--------~Gn~V~FSEp~~----Ph~~~~~-----~~~~~---~~~~Iv~l 288 (396) T protein:vir:10 229 GRPAQFRHLSPMPTGKHLAYWRGRLLIA--------RANVLRFSEALA----YHLHDER-----YGFVQ---MPQRITFV 288 (396) T ss_pred CCCccccccccCchhHhhhhhcceEEEE--------eCCEEEEecCCC----Cceecch-----hccCC---CCCceEEE Confidence 111222222222211 235678877654 3333321 23432 44679999 Q ss_pred EecCcceEEEecCcEEEEEecCCCceeEeEEecCc-----------cccccCceeEEECCeEEEEeCCCeEEECC-cccc Q lcl|NC_010325. 210 VKLRDSFIIYKEDSVYSMRYIGGLFIFQFQQLFND-----------VGILGPNCAVEFDGNHFVVGHGDVYVHNG-VQKQ 277 (513) Q Consensus 210 ~~l~~~~vIf~en~i~~m~y~g~~~~f~~~~i~~~-----------~G~~~~~siv~~~~~~ffls~~G~y~~~G-~~~~ 277 (513) ++.++.++|-.+..+|.+ .+.+|.-++++++... -+|++.+|++..+.-+.|.+++|++.-.+ +... T Consensus 289 apv~~gL~Vgt~~~~y~~-~G~dP~sms~~~l~~~~pvp~S~v~~p~~~~s~rs~~~~~~~~lwas~dGl~~g~~~G~v~ 367 (396) T protein:vir:10 289 QPVDGGIWVGQVDHVAFL-DGADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAENGYVMGTSSGAIA 367 (396) T ss_pred EEecCeEEEEEcCcEEEE-EcCChhHcceeecccCCCcccchhcccchhhhcccccccCcEEEEccCCcEEEEcCCceee Confidence 999999999999999998 4667888999998532 12578899999999999999999997643 3433 Q ss_pred cCCchhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEc Q lcl|NC_010325. 278 SVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYS 316 (513) Q Consensus 278 ~Ig~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~ 316 (513) .. .+++ |.+.. ...+.+-....||+..|. T Consensus 368 ~l-~~~~-------i~p~~--~~A~~~~~~drRy~~~~~ 396 (396) T protein:vir:10 368 EV-HAGV-------LAGIT--GRAGTSVVFDRRLLTAVS 396 (396) T ss_pred ee-cccc-------cCCCc--ccceEEEeecCeEEEEeC Confidence 33 3333 22221 111233334456666654 No 30 >protein:vir:105647 Length: 800 # NCBI annotation: putative tail tubular B protein # Family: family:all:825 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425011;genbank:gi:83571759;uniprot:Q2WC41;genbank:GeneID:3837288 Probab=98.35 E-value=1.1e-06 Score=53.16 Aligned_cols=482 Identities=13% Similarity=0.080 Sum_probs=223.9 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeeecCCCcceeeeeee--e-CCce--EEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTAQAPILDMFPFI--R-NNIP--YWLL 74 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~~~~~~~~~~~~--~-~g~~--~~~v 74 (513) |+++ +.+.+..|.|.-..|..-=++....|.||+++ .++++||||..=+....++...++..+. . ++.. ++++ T Consensus 1 ~~v~-~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (800) T protein:vir:10 1 MEVQ-GSLGRQIQGISQQPPAVRLDGQCTTMVNMVPDVVNGTQSRMGTTHIAKLLDEGTDNMATHHYRRGEGDEEYFFTL 79 (800) T ss_pred CeEE-eecchhcccccccchhHhhhhhhhhhhcceeeeccCcccCCcceEEEeecCCCCCccEEEEEecCCccceEEEEE Confidence 8876 67888899998666665567889999999999 5889999997555332233322222221 1 1111 1222 Q ss_pred EcCc--eEEEecCceEEecc-ccc----eeeCCC--CceeEEeeCCEEEEEeCCCceEEE-------------------- Q lcl|NC_010325. 75 CSEQ--RLYLADGTTIIDVS-PGP----YSASIT--NRWSVGSFNGVIFANDGVNPPHHL-------------------- 125 (513) Q Consensus 75 ~~~~--kly~~~~~t~~dis-~~~----~~~~~~--~~w~f~~~~~~~ia~ng~d~~q~~-------------------- 125 (513) .+.+ +++..++.. ..+. ..+ +...++ +.-++++.+|+++++|..-+|+.. T Consensus 80 ~~g~~~rv~~~~G~~-~~v~~~~~~~~~~~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~~~~~~~~~~~vr~g~y 158 (800) T protein:vir:10 80 KKGQVPEIFDKHGRK-CNVISQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKVSNRKSPKVGDKAIVFCAYGQY 158 (800) T ss_pred EcCCeEEEEecCCcE-EEeecCCcceeeeeccCCchhhEEEEEEcCEEEEecCcccccccccCCCCCCceEEEEEecccc Confidence 2333 333333221 1111 111 000111 113344444444444322222110 Q ss_pred ----------------------------------------------------------------------------cC-- Q lcl|NC_010325. 126 ----------------------------------------------------------------------------PP-- 127 (513) Q Consensus 126 ----------------------------------------------------------------------------~~-- 127 (513) ++ T Consensus 159 ~~~y~i~i~g~~~~~~~t~~~~~~~~~~~~s~~~i~~~L~~~l~~~~~~~~~t~~~~g~~i~i~~~~~~~~~~~~~~~~~ 238 (800) T protein:vir:10 159 GTSYSIIINGTTAASFKTPDGGSAEHVEQIRTERITSELYSKLQQWSGVNDYEIQRDGTSIFIERRDGKSFTVTTTDGAK 238 (800) T ss_pred ccceeEEeccceEEEEEecCCCcccccccccHHHHHHHHHhhhhhcCcccceEEEEcCcEEEEEEecCCceEEEEeecCC Confidence 00 Q ss_pred ---------CCceecccCC------------------------------------------------------------- Q lcl|NC_010325. 128 ---------SESTFRVLPN------------------------------------------------------------- 137 (513) Q Consensus 128 ---------~s~~f~~L~g------------------------------------------------------------- 137 (513) ..+.+.+|+. T Consensus 239 ~~~~~~~~~~v~~~~~Lp~~~~~g~~~~i~~~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~~lv~~~~ 318 (800) T protein:vir:10 239 GKDLVAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTMPYIIERTGI 318 (800) T ss_pred cceEEEEEeeccceeeccccCCCCceEEEEcCCCCCCceeEEEEEeccccceEEEeecccCceeeeecccccEEEEEeee Confidence 0000111110 Q ss_pred --------------------------Ccccc------eeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccc Q lcl|NC_010325. 138 --------------------------FPANT------TFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASW 185 (513) Q Consensus 138 --------------------------~p~~~------ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~W 185 (513) +|+.+ .-..|..|++||++++ |+.|+.|..+|.+ ++ T Consensus 319 ~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~~~~~~~~i~~v~f~q~RL~f~~--------~~~v~~Srtgd~~----nF 386 (800) T protein:vir:10 319 IDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTA--------GEAVIASRTSYFF----DF 386 (800) T ss_pred eecceeEEEEeccccccccCCCCCCCCchhcCCCCCCCceeEEEEeeeEEEee--------CCeEEEEccCCcc----cc Confidence 00000 0112678999998754 6789999999964 44 Q ss_pred cccc--cccCcceeccc---CCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCceeEEECC Q lcl|NC_010325. 186 DPTD--PTKDAGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPNCAVEFDG 258 (513) Q Consensus 186 d~t~--~t~~a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~siv~~~~ 258 (513) .... ...+++=.++. +....|..+++....|+||...+-|.++-.+ .|...++++.+. .+|-+.=.=+.+|+ T Consensus 387 ~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~q~~l~g~~~lTP~~~~i~~~s~-~~~~~~~~Pv~vG~ 465 (800) T protein:vir:10 387 FRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGDKPLEKSNALLKPVTT-FEVNNKVKPVVTGE 465 (800) T ss_pred ccccccCCCCCccEEEEEcCCcceeeeeEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEe-eeccCCCCceEeCC Confidence 3332 22233333442 3345577788899999999999999996322 244466666663 34555666688999 Q ss_pred eEEEEeCCC----eE--EECCc--ccccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceE Q lcl|NC_010325. 259 NHFVVGHGD----VY--VHNGV--QKQSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRA 329 (513) Q Consensus 259 ~~ffls~~G----~y--~~~G~--~~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~ 329 (513) .++|+++.| ++ .++-. .+.... +..+.+.| +.....+ +..-..+..+||+-... +++ T Consensus 466 ~v~Fv~~~g~~s~vre~~~~~~~d~~~a~DlT~~~~hl~-----~~~v~~~-~~~~~~~~~v~~~~~~~--------~~l 531 (800) T protein:vir:10 466 SVMFATNDGSYSGVREFYTDSYSDTKKAQAITSHVNKLI-----EGNITNM-AASTNVNRLLVTTDKYR--------NII 531 (800) T ss_pred eEEEecCCCCeeEEEEEeeeecccceehhhHHhHHHHhc-----CCceEEE-EEeCCCCeEEEEEEcCC--------CeE Confidence 999999987 43 23211 121110 11222222 1111111 12222333455553221 356 Q ss_pred EEEecc-----c--CeEEEEeccc--e--eeeee----------------cccccccceeecccCcccCcc--------- Q lcl|NC_010325. 330 IIWNWK-----E--NTWSIRDLPN--V--LSGAY----------------GIIDPKVSNLWDDDPNPWDTD--------- 373 (513) Q Consensus 330 lvyd~~-----~--~~Ws~~d~~~--~--~~~~~----------------g~~~~~~~~~~~~~~~~~d~d--------- 373 (513) ++|-|. . +.|+.-+++. . +..+. +....... .....+.-.-+| T Consensus 532 ~~~~yl~~~~e~~~~aW~~w~~~~~~~~~~~~~~~d~l~~iv~r~~~~~ier~~~~~~-~~~~~~~~~~lD~~~~~~~~~ 610 (800) T protein:vir:10 532 YCYDWLWQGTDRVQSAWHVWEWPMGTKVRGMFYSGELLYLLLERGDGVYLEKMDMGDA-LTYGLNDRIRMDRQAELIFKH 610 (800) T ss_pred EEEEEeecCCceEEEEEEEEEcCCCcEEEEEEEeCCeEEEEEECCCcEEEEEEecccC-ccccccceeeeecceeecccc Confidence 666652 1 2677555432 1 10000 11100000 000000000000 Q ss_pred ---ce-------ecc----------ccc--cccCccceEE--EEeecCceeee-------cccceeecCccEEEEeeccc Q lcl|NC_010325. 374 ---TS-------VWG----------EGS--YNPAKSSMIF--SSFQDKKLFLF-------GNNSTFSGQNFVSTLERSDI 422 (513) Q Consensus 374 ---~~-------~~~----------~ds--~~~~~~~~~~--~~~~~~~~~~~-------~~~~~~~g~~l~a~~~~~~~ 422 (513) +. .|. .++ ..+.++.... ......-.+.+ ......-|-++++.++...+ T Consensus 611 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~~~~~~g~~~~~~~~~~g~~~~~~v~VGl~Y~s~~~~~~~ 690 (800) T protein:vir:10 611 FKAEDEWISEPLPWTPTNPELLDCILIEGWDSYIGGSFLFKYKPSDNTLSTTFDMHDDNHVKAKVIVGQIYPQEFEPTPV 690 (800) T ss_pred cccCcceEEEeccccccCCcceEEeeeccceeecCceeEEEEEecCCceEeeeeecCCCcccceEEEeeeeeEEEeecce Confidence 00 000 000 0001111100 00000000100 11235668888888886665 Q ss_pred ccCC---C---cceEEEeeeeeccCCCeeEEEEeeee------------ecCCCCceEcCceeeecCCceEEEeecCCCe Q lcl|NC_010325. 423 YLGD---D---RMMKTVSAIIPHITGNGTCNIWVGNA------------QVQGSGIRWKGPYPYRIGQDYKIDTKHVGRY 484 (513) Q Consensus 423 ~~~~---~---~~~~~i~~~~~~~t~~~~~~~~~g~~------------~~~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry 484 (513) .+.+ . .-+.++.++.......+.+.+.+... +..+.+..-.+..+..+| ...++++..++- T Consensus 691 ~i~~~~g~~~~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~~~~~~g~~~~~tg-~~~vp~~g~~~~ 769 (800) T protein:vir:10 691 VIRDRQDRVSYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGALNNTVGYVEPREG-VFRFPLRAKSTD 769 (800) T ss_pred EEEcCCCcccccCCeEEEEEEEEeecCceEEEEeccCcccceeEEccCCeeccccccccCcccccCc-eEEEEEeccCce Confidence 4421 1 11234555544444444444432211 111111100111122233 467899999999 Q ss_pred EEEEEEccCCCcEEEEEEeeEE-eccccCC Q lcl|NC_010325. 485 IALKFDFSSEGDWYFNGYTIEM-APKAGMR 513 (513) Q Consensus 485 ~~~rl~~~~g~~w~~~G~~~~~-~~~g~rr 513 (513) ..++|+.+.-.+.++.++++|. .-.=.|| T Consensus 770 ~~v~i~~d~P~P~tvlai~~eg~y~~r~~r 799 (800) T protein:vir:10 770 AVYRIIVESPHTFQLRDIEWEGSYNPTKRR 799 (800) T ss_pred eEEEEEECCCCcEEEEEEEEEEEeeccccc Confidence 9999999999999999999998 3344555 No 31 >protein:vir:1543 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052111;swissprot:trembl:q9t105;genbank:gi:9634037;uniprot:Q9T105;genbank:GeneID:1262408 Probab=98.35 E-value=1.1e-06 Score=53.15 Aligned_cols=483 Identities=13% Similarity=0.127 Sum_probs=226.0 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeeecCCCcceee--ee--e-eeCCceEEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTAQAPILDM--FP--F-IRNNIPYWLL 74 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~~~~~~~~--~~--~-~~~g~~~~~v 74 (513) |+...+.+.+..|.|.-..+.+-=++....|.||++. .|+++||+|..=+...-..+..+. +. + ......++++ T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~gGl~rRpGt~~va~~~~~~~~~~~~~~~~~~~~~~e~y~l~ 80 (801) T protein:vir:15 1 MALISQSIKNLKGGISQQPDILRFAEQGSVQINGWSSESEGLQKRPPMIHLKTLGPAGYVGAQPYVHLINRDEFEQYFVV 80 (801) T ss_pred CceeeeecchhhcceecCcchHhhhhhHhhhhcceeccccCcccCCchheeeeecCCCCcccceeEEEEEeCCceEEEEE Confidence 9999999999999998765555567888999999999 588999998755543322222221 11 1 1334455555 Q ss_pred EcCceEEEec-CceEEeccc-cceeeCC--CCceeEEeeCCEEEEEeCCCceEE-------------------------- Q lcl|NC_010325. 75 CSEQRLYLAD-GTTIIDVSP-GPYSASI--TNRWSVGSFNGVIFANDGVNPPHH-------------------------- 124 (513) Q Consensus 75 ~~~~kly~~~-~~t~~dis~-~~~~~~~--~~~w~f~~~~~~~ia~ng~d~~q~-------------------------- 124 (513) .+.+.|+-|+ ++....++. .+|-... -+.-++++..|+++++|..-+|+. T Consensus 81 ~~~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~~~aD~~fi~nr~~~~~~~~~~~~~~~~~~~~~alv~v~~~~yg 160 (801) T protein:vir:15 81 FTGEDIKVFDLDGKEYQVRGDRSYVRTANPREDLRMITVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGGQYG 160 (801) T ss_pred EcCCeEEEEccCCcEEEEecCCccccccCchhheeEEEEcCEEEEeeCCeeeecccCccccCccCCCCceEEEeeeccCc Confidence 5555444443 121111111 1221000 001223333333222221111110 Q ss_pred ---------------------------------------------------------E---------------------- Q lcl|NC_010325. 125 ---------------------------------------------------------L---------------------- 125 (513) Q Consensus 125 ---------------------------------------------------------~---------------------- 125 (513) | T Consensus 161 ~t~~I~i~gs~~~~~t~~~gs~~~~~~~~s~~~ia~~l~~~~~~~~p~~~~~~~~~~w~~~~~~g~~~i~a~~~~~~~~~ 240 (801) T protein:vir:15 161 RRLSIEFNGAERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHILAPNNDNVWGL 240 (801) T ss_pred eeEEEEeCCcceEEEEeccCcccchhhhcceeechHHHhhhhhhccCccceeccCccEEEEecCcEEEEeCCCCccccee Confidence 0 Q ss_pred --------------cCCCceecccCC---------------C-------------------------------------- Q lcl|NC_010325. 126 --------------PPSESTFRVLPN---------------F-------------------------------------- 138 (513) Q Consensus 126 --------------~~~s~~f~~L~g---------------~-------------------------------------- 138 (513) ......+++|+. . T Consensus 241 ~t~dg~~~~~~~~~~~~v~~~~~lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~~~w~E~a~~g~~~~~~~~tmp~~lv 320 (801) T protein:vir:15 241 QTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLYYHTMPWALV 320 (801) T ss_pred eeccccCceeeeEEeecccceeeeeeecCCCcEEEEEecCCCccceEEEEEEcCCeeEEeecccccceeeeccccceEEE Confidence 000000111110 0 Q ss_pred ---------------------------cc--cceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccc Q lcl|NC_010325. 139 ---------------------------PA--NTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTD 189 (513) Q Consensus 139 ---------------------------p~--~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~ 189 (513) |+ +..-..|..|++||++++ |++|+.|..+|.+ ++.... T Consensus 321 ~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~f~q~RL~f~~--------~~~v~~Srtgd~~----nF~~~t 388 (801) T protein:vir:15 321 RASDGNFDFKVLEWGARTVGDDTTNPYPSFTGQTINDIFFFRNRLGFLS--------GENIILSRTSKYF----NFFPAS 388 (801) T ss_pred eeccceEEEeccccccccCCccccCCcccccCCCceEEEEEcceEEEee--------CCeEEEEecCCcc----cccccc Confidence 00 000123789999998864 6889999999964 444433 Q ss_pred c--ccCcceeccc---CCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCceeEEECCeEEE Q lcl|NC_010325. 190 P--TKDAGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFV 262 (513) Q Consensus 190 ~--t~~a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ff 262 (513) . ..+++=.++. +....|..+++....|+||...+-|.++-.+ +|.-.++.+.+. .+|-+.=.=+.+|+.++| T Consensus 389 ~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~t~~~q~~ls~~~~lTP~~~~~~~~s~-~~~~~~~~Pv~vg~~v~f 467 (801) T protein:vir:15 389 VSNYSDDDPIDVAVSHNRVSTLKYAVPFSEELLLWSDQAQFVLTASGILSSRSVELNLTTQ-FDVQDRARPHGVGRNVYF 467 (801) T ss_pred ccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEcCCCcccceeEEEEEEEe-eeccCCCCceEeCCeEEE Confidence 2 1233333332 3334466688899999999999999996322 244466666663 345555566789999999 Q ss_pred EeCCCeE-------EECC--cccccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEE Q lcl|NC_010325. 263 VGHGDVY-------VHNG--VQKQSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIW 332 (513) Q Consensus 263 ls~~G~y-------~~~G--~~~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvy 332 (513) +++.|=| .++- ..+.+.. +.-+.+.| +.....+ +..-..+..++|+-... +++++| T Consensus 468 ~~~~g~~~~~~r~~~~~~~~d~y~a~Dlt~~~~hl~-----~~~v~~~-~~~~~~~~~~~~~~~~~--------~~l~~~ 533 (801) T protein:vir:15 468 ASPRASFTSINRYYAVQDVSSVKNAEDMTAHVPNYI-----PNGVFSI-SGTTAENFAAILTSGAP--------NRVYIY 533 (801) T ss_pred EecCCCeeEEEEEEeecccccceehhhHHHHHHHhc-----CCceEEE-EEeCCCCcEEEEEEcCC--------CEEEEE Confidence 9998732 2232 1222211 11222221 1111111 11112233344443221 357776 Q ss_pred ec-----cc--CeEEEEeccce----eeeeec-----ccccccce----------------------eecccC-----cc Q lcl|NC_010325. 333 NW-----KE--NTWSIRDLPNV----LSGAYG-----IIDPKVSN----------------------LWDDDP-----NP 369 (513) Q Consensus 333 d~-----~~--~~Ws~~d~~~~----~~~~~g-----~~~~~~~~----------------------~~~~~~-----~~ 369 (513) -| +. ..|+.-+.+.. |..+.+ +..-.... +++... .+ T Consensus 534 ~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~t 613 (801) T protein:vir:15 534 KFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINSTMTVLMGNEHAVWMGRLHFTKNSIDIPGEPYRLYIDAKRKYTIPAGT 613 (801) T ss_pred EEecCCCceEEEeeEEEEcCCCEEEEEEEecCCEEEEEEEecCcEEEEEEEEccccccCCCcceeeeeeeeeeEeeccce Confidence 65 22 27887666442 111110 00000000 000000 00 Q ss_pred cCccceecccc------ccccCccceEEEEeecCcee---------------ee----cccceeecCccEEEEeeccccc Q lcl|NC_010325. 370 WDTDTSVWGEG------SYNPAKSSMIFSSFQDKKLF---------------LF----GNNSTFSGQNFVSTLERSDIYL 424 (513) Q Consensus 370 ~d~d~~~~~~d------s~~~~~~~~~~~~~~~~~~~---------------~~----~~~~~~~g~~l~a~~~~~~~~~ 424 (513) +..++....+. .....|.. .....|+.+. .+ ......-|-++++.++...+.+ T Consensus 614 ~~~~~~~~~~~~~~~~gl~~l~g~~--v~v~~dG~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~ 691 (801) T protein:vir:15 614 YNDDTYQTSISLATIYGMNFTKGRV--SVVFPDGKIIEVDQPINGWSSDPVLRLDGNQEGQVVYIGFNIPFTYTFSKFLI 691 (801) T ss_pred eccCceecccccccccccccccceE--EEEEeCCceeeeeeecCcccCcceEEEcCCCCCcEEEEeeeeeEEEEecceEE Confidence 00111000000 00001111 1112222211 11 1122455778888888655543 Q ss_pred C----CCcc------eEEEeeeeeccCCCeeEEEEeee--eec--CCCCceEcCc------eeeecCCceEEEeecCCCe Q lcl|NC_010325. 425 G----DDRM------MKTVSAIIPHITGNGTCNIWVGN--AQV--QGSGIRWKGP------YPYRIGQDYKIDTKHVGRY 484 (513) Q Consensus 425 ~----~~~~------~~~i~~~~~~~t~~~~~~~~~g~--~~~--~~~~~~w~~~------~~~~~~~~~~~~~R~~~Ry 484 (513) - +... +.++.++.......+.+.+.+.. ++. ...+..+.+. ...-+| ...++++...+- T Consensus 692 ~~~~~~~~~~~~~~~rl~l~r~~~~~~~tg~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg-~~~vp~~g~~~~ 770 (801) T protein:vir:15 692 KKTAEDGSTATEDIGRLQLRRAWVNYEDSGAFTIRVNNLSREFIYTMAGARLGSDNLRVGRSNIGTG-QYRFPVVGNAQT 770 (801) T ss_pred eccCCCCCceeeeeccEEEEEEEEEeccCcceEEEECCcccccceeecCcccccccccccccccccc-eEEEEEeecCce Confidence 2 1111 11233333323333444443322 111 1111222111 111122 356788888888 Q ss_pred EEEEEEccCCCcEEEEEEeeEEe-ccccCC Q lcl|NC_010325. 485 IALKFDFSSEGDWYFNGYTIEMA-PKAGMR 513 (513) Q Consensus 485 ~~~rl~~~~g~~w~~~G~~~~~~-~~g~rr 513 (513) ..++|+.+.-.+.++.++++|+. -.=.|| T Consensus 771 ~~v~i~~d~P~P~tvlsi~~e~~y~~r~~~ 800 (801) T protein:vir:15 771 NLVTIESDASTPLNIIGCGWEGNYLRRSSG 800 (801) T ss_pred EEEEEEECCCCcEEEEEEEEEEEEeccccC Confidence 89999999999999999999983 334444 No 32 >protein:vir:78957 Length: 826 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522826;genbank:gi:158345061;genbank:GeneID:5687447 Probab=98.20 E-value=2.7e-06 Score=51.11 Aligned_cols=483 Identities=11% Similarity=0.061 Sum_probs=229.8 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeeecCCC--cceeeeeee----eCCceEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTAQA--PILDMFPFI----RNNIPYWL 73 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~~~--~~~~~~~~~----~~g~~~~~ 73 (513) |+..-+.+.+..|.|.-..+..==++....|.||.+. .+++.||+|..=+.....+ +....+.+. +....+++ T Consensus 1 M~~i~~~~~nl~gGvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpgt~~va~~~~~~~~~~~~f~~~~~r~s~e~~~~l 80 (826) T protein:vir:78 1 MSYKQSAYPNLLMGVSQQVAFERLPGQLSEQINMVSDPVSGLRRRSGIELMASLLHTDQPWPRPYLYHTNLGGRSIAMLV 80 (826) T ss_pred CcceeeecchhccceecccchHhhhhhhhhhhcceeccccccccCCchHhhhhhccCCcCCceeEEEEeccCCcceEEEE Confidence 9999999999999998655544456788999999999 5889999887666433221 112222322 22334455 Q ss_pred EEcCceEEEe--cCceEEecc---ccceeeCCCCceeEEeeCCEEEEEeCCCceEEE----------------------- Q lcl|NC_010325. 74 LCSEQRLYLA--DGTTIIDVS---PGPYSASITNRWSVGSFNGVIFANDGVNPPHHL----------------------- 125 (513) Q Consensus 74 v~~~~kly~~--~~~t~~dis---~~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q~~----------------------- 125 (513) ..+.+.|+-| .++...... ..+|++.....-++++.+|+++++|..-+|+.- T Consensus 81 ~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~v~~g~y~ 160 (826) T protein:vir:78 81 AQHRGELYLFDEKDGRLLMGQPLVHDYLKASDYRQLRAATVADDLFIANLEVRPEADKADVLGVDPSKTGWLYIKAGQYS 160 (826) T ss_pred EEcCCcEEEEECCCCEEEEecCcccceeecCCcceeEEEEEcCEEEEEcCcEeeeeccccccCCCCCceEEEEecccccC Confidence 5555545544 333322211 122333222224555555555555543333210 Q ss_pred -------cC--------------------C-------------------------------------------------- Q lcl|NC_010325. 126 -------PP--------------------S-------------------------------------------------- 128 (513) Q Consensus 126 -------~~--------------------~-------------------------------------------------- 128 (513) .+ + T Consensus 161 ~~y~v~i~~~~~~~~~~~s~t~~y~t~~~~~~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~t~~~~~~~~~~~~ 240 (826) T protein:vir:78 161 KAFSLTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLFGKFFGAPEYTLPNSTKKYPKVDPDPAA 240 (826) T ss_pred ceeEEEeccceeecccccceeEEEEeccCCccccccccccceecchhhheecceeeccccceeeeccceeEeeccccccc Confidence 00 0 Q ss_pred -----------------------C----------------------ceeccc----CCC--------------------- Q lcl|NC_010325. 129 -----------------------E----------------------STFRVL----PNF--------------------- 138 (513) Q Consensus 129 -----------------------s----------------------~~f~~L----~g~--------------------- 138 (513) + ..+++| +.. T Consensus 241 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~a~~p~~~~~~~~~~~~~~~~~~~g~~~ 320 (826) T protein:vir:78 241 ATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAIMATGSTK 320 (826) T ss_pred eeeccceeecccccceEEEecCCCeEEEeccCCCccceEEEeeEEEecccceeeeecccccceEEEEEEeeeEecCCCcc Confidence 0 000010 000 Q ss_pred -------------------------------------cc--------cce-------------------eeEEEEEcCEE Q lcl|NC_010325. 139 -------------------------------------PA--------NTT-------------------FKRLKSFKNFL 154 (513) Q Consensus 139 -------------------------------------p~--------~~k-------------------a~~v~~~~~~l 154 (513) .. .|. -..|..|++|| T Consensus 321 ~~~y~~~~~~~~~w~e~a~~g~~~~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~i~~v~f~q~RL 400 (826) T protein:vir:78 321 APVYFAWDAANRRWAERAAYGTDWVLKKMPLALRWDESTDTYSLNELEYDRRGSGDEETNPTFNFVKRGITGMTTFQGRL 400 (826) T ss_pred cceeEEEEcCCceEEEeeccCcccccccccEEEEEecCCCeEEEeeccccccccCcccccCcccccCCCceEEEEEeceE Confidence 00 010 02367889999 Q ss_pred EEEECCcCcccCCceEEEeccCCcccccccccccc--cccCcceeccc---CCCCceeEEEecCcceEEEecCcEEEEEe Q lcl|NC_010325. 155 VGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTD--PTKDAGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRY 229 (513) Q Consensus 155 ~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~--~t~~a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y 229 (513) ++++ |+.|+.|..+|.+ +|.... ...+++=.++. +..-.|..+++....|+||...+-|.++- T Consensus 401 ~f~~--------~~~v~~Srtgd~~----nF~~~t~~~~~DdD~I~~~~~s~~~~~i~~~v~~~~~L~l~T~~~e~~l~~ 468 (826) T protein:vir:78 401 VLLS--------QEYVCMSASNNPH----RWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPG 468 (826) T ss_pred EEee--------CCeEEEEeccCcc----ccccccccCCCCCCcEEEEEccCcceeEEEEEecCCcEEEEecCcEEEEeC Confidence 8753 6789999999964 443332 12233333332 33344667888899999999999999963 Q ss_pred cC--CCceeEeEEecCccccccCceeEEECCeEEEEeCCC-----eEE--ECC--cc-cccCC-chhHHHHHHhhcCcch Q lcl|NC_010325. 230 IG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGD-----VYV--HNG--VQ-KQSVI-DAQVRKFFFSDINPDN 296 (513) Q Consensus 230 ~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G-----~y~--~~G--~~-~~~Ig-~~~V~~~~~~~i~~~~ 296 (513) .+ .|.--++.+.+. .||-+.=.=+.+|+.++|+++.| ++. ++. .+ +.... +.-+.+.| . .. T Consensus 469 ~~~lTP~~~~~~~~s~-~~~~~~~~Pv~vG~~v~F~~~r~~~~s~v~e~~~~~~~~~~y~~~dlt~~~~~l~-~----~~ 542 (826) T protein:vir:78 469 GGIVTPRTAVISITTQ-YDVDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYM-P----GP 542 (826) T ss_pred CCcccceeEEEEEEEe-ecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeecccCccchHHHHHHHHHhc-C----CC Confidence 22 233455666653 35555555589999999998865 553 222 11 22211 22333322 1 11 Q ss_pred hCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecc----c---CeEEEEeccceeee--eec--------cc-ccc Q lcl|NC_010325. 297 YQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWK----E---NTWSIRDLPNVLSG--AYG--------II-DPK 358 (513) Q Consensus 297 ~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~----~---~~Ws~~d~~~~~~~--~~g--------~~-~~~ 358 (513) ...+ .+-.....+.|+-... +++++|-|. . ..|+.-+.+..+-. +++ .. ... T Consensus 543 v~~~--a~s~~~~~~v~~~~~~--------g~l~~~ty~~~~~e~~v~aW~~~~~~g~v~~v~~i~d~l~~vv~r~~~~~ 612 (826) T protein:vir:78 543 AEYI--QAAASSGYLVFGTSAA--------DEMICHQYLWQGNEKVQNAYHRWTLRHQIIGAYFTGDNLMVLIQKGQEIA 612 (826) T ss_pred eEEE--EEeCCCCeEEEEEcCC--------CeEEEEEEEecCCcEEEEeEEEEccCCcEEEEEEECCeEEEEEEeCCCEE Confidence 1111 2222223344554322 357777662 2 26876554432111 110 00 000 Q ss_pred cceee-cccCcc--cCccce--------eccccc--------cccCccce-EE----EEeecC------------ceeee Q lcl|NC_010325. 359 VSNLW-DDDPNP--WDTDTS--------VWGEGS--------YNPAKSSM-IF----SSFQDK------------KLFLF 402 (513) Q Consensus 359 ~~~~~-~~~~~~--~d~d~~--------~~~~ds--------~~~~~~~~-~~----~~~~~~------------~~~~~ 402 (513) ..... ...+.. .+.... .|.-.. ....+..+ .+ ++..+. -.+.+ T Consensus 613 ~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~l~~ 692 (826) T protein:vir:78 613 LGRMHLNSLPAREGLQYPKYDYWRRIEATVDGELELTKQHWDLIKDGAAVYQLQPQVGAYMERYQLGVKRETSTKVFLDV 692 (826) T ss_pred EEEEEEEecCCCccccccccceeEEEEEEEcceeccccceeEEecCCceeeeeccceeeeccccceeccccCCCceEEEe Confidence 00000 000000 000000 000000 00000000 00 000000 01111 Q ss_pred c----ccceeecCccEEEEeecccccCCCc---c---eEEEeeeeeccCCCeeEEEEeeeeecCC-------------CC Q lcl|NC_010325. 403 G----NNSTFSGQNFVSTLERSDIYLGDDR---M---MKTVSAIIPHITGNGTCNIWVGNAQVQG-------------SG 459 (513) Q Consensus 403 ~----~~~~~~g~~l~a~~~~~~~~~~~~~---~---~~~i~~~~~~~t~~~~~~~~~g~~~~~~-------------~~ 459 (513) . .....-|-++++.++...+.+.+.. + +.+++++.......+.+.+.++...-.. .. T Consensus 693 ~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~~r~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~ 772 (826) T protein:vir:78 693 PEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLSSRQ 772 (826) T ss_pred CCCccccEEEEeeceeEEEEeCceEEecCCCcceeecceEEEEEEEEeeccccEEEEeCCCccCcceeeeeccccccccc Confidence 1 1236678888888886555332111 1 1234443333333344444433211000 00 Q ss_pred ceEcCceeeecCCceEEEeecCCCeEEEEEEccCCCcEEEEEEeeEE-eccccCC Q lcl|NC_010325. 460 IRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYFNGYTIEM-APKAGMR 513 (513) Q Consensus 460 ~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~~G~~~~~-~~~g~rr 513 (513) +....+. ..++ .-.++++....-..++|+.+.-.+.++.+++.|+ ...=.|| T Consensus 773 l~~g~~~-~~t~-~v~vp~~~~~~~~~i~i~~d~P~P~tvlai~~~~~y~~r~rr 825 (826) T protein:vir:78 773 LNAGEPL-VDSA-VVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRR 825 (826) T ss_pred ccCCccc-ccce-EEEEeeeccCceEEEEEEeCCCCcEEEEEEeEEEEecceeec Confidence 0000010 1122 2446777778888899999999999999999999 5666666 No 33 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=98.20 E-value=2.8e-06 Score=51.04 Aligned_cols=476 Identities=14% Similarity=0.098 Sum_probs=198.5 Q ss_pred CcccchhhcCcccc-----cc---ccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeeecC-CCcceeeeeeee-CCc Q lcl|NC_010325. 1 MALERQEVKNPTGI-----VT---DIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTA-QAPILDMFPFIR-NNI 69 (513) Q Consensus 1 m~~~~~~~~~~~G~-----~~---~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~-~~~~~~~~~~~~-~g~ 69 (513) |.|..++. .-+|. +. |++- =++....|.||.+. .|++++|+|..=+.... ..+-..+.++.. .+. T Consensus 1 m~i~~~q~-sF~~GElsP~l~gR~Dl~r---y~~q~~~~~N~~~~~~GGl~rRpGt~fva~~~~~~g~~rLipf~~s~~q 76 (823) T protein:vir:95 1 MAISWIQP-SFAGGEIGPSLYGRIDMAK---YQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQFSTVQ 76 (823) T ss_pred Ccceeech-hccCceechheeccchHHH---HHHHHhhhhCcEeeecCCceecCchhhhhhhcCCCCCeeEEEEEeCCCc Confidence 77666554 33333 11 2211 13677889999998 58999999976664432 123334555553 467 Q ss_pred eEEEEEcCceEEEecCce-EEeccc------cceeeCCCCceeEEeeCCEEEEEeCCCceEEEc---CCCceec------ Q lcl|NC_010325. 70 PYWLLCSEQRLYLADGTT-IIDVSP------GPYSASITNRWSVGSFNGVIFANDGVNPPHHLP---PSESTFR------ 133 (513) Q Consensus 70 ~~~~v~~~~kly~~~~~t-~~dis~------~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q~~~---~~s~~f~------ 133 (513) .++++.+.+.|+-|.... ..+-+. .||+...-..-+|++.+|+++++|..-+||.+. ..+..+. T Consensus 77 ~y~Lefg~~~irV~~~~g~vv~~~~~~~ev~tPy~~~~l~~Lr~~qsaD~~fivh~~~~p~~L~r~~~~~w~l~~~~~~~ 156 (823) T protein:vir:95 77 TYALEFGHQYMRVIKDGALVLNSSNVIYEIATPYTEADLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVDVVTKN 156 (823) T ss_pred EEEEEEcCCeEEEEeCCcEEEecCCceeEEecccccccccceeEEEeccEEEEEcCCccceEEEecCCCCceEEEEEEec Confidence 788888888887776653 222111 234333223456777777777776655554210 0000000 Q ss_pred --------------------------------------------------------------------------cc---- Q lcl|NC_010325. 134 --------------------------------------------------------------------------VL---- 135 (513) Q Consensus 134 --------------------------------------------------------------------------~L---- 135 (513) ++ T Consensus 157 gp~~~~~~~~t~~v~~~~~~~~~t~ta~~~~~~~d~vg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (823) T protein:vir:95 157 GPFEDINIDESLTVYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAVTAGK 236 (823) T ss_pred cccccccccceeEEeccccCceeEEeecccccchhhccceEEEeccccceeeecceeeeecccceEEecccceeeeeccc Confidence 00 Q ss_pred -----------------------------------CCC-------------------cc--------------------c Q lcl|NC_010325. 136 -----------------------------------PNF-------------------PA--------------------N 141 (513) Q Consensus 136 -----------------------------------~g~-------------------p~--------------------~ 141 (513) .|. |. + T Consensus 237 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~v~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~ 316 (823) T protein:vir:95 237 TGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARITAVNGTTATAEVISYIPSQVVGEDNASYKWAKYAWNSVN 316 (823) T ss_pred cceeecccCCcceEEeceecccccceeEEEEEeCCcceEEEEeecceeeeceEeeeeccccccCCcCCccccccccCcCC Confidence 000 00 0 Q ss_pred ceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceeccc---CCCCceeEEEecCcceEE Q lcl|NC_010325. 142 TTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLA---DTNGAIVDGVKLRDSFII 218 (513) Q Consensus 142 ~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~---d~~G~iv~g~~l~~~~vI 218 (513) -.-..+..|+|||++++..+ .|+.|+.|..+|. .+|.......+++=.++. +....|..+++.+ .++| T Consensus 317 g~Ps~v~f~q~RL~f~g~~~----~p~~v~~Srtgd~----~nF~~~~~~~DdD~I~~~~s~~~~~~i~~~v~~~-~Lli 387 (823) T protein:vir:95 317 GYPGTVVYYQQRLYFAASTA----FPQTIWASRTGDY----KDFGKSNPTQDDDRIIYTYAGRQVNEIRHLIDVG-SLVA 387 (823) T ss_pred CCccEEEEEeceEEEEEcCC----CCcEEEEeccCCc----cccccccCCCCCCcEEEEEcCCcceEEEEEeecC-cEEE Confidence 01234788999999877653 5899999999996 455544444343434333 2223466777775 6999 Q ss_pred EecCcEEEEEecC----CCceeEeEEecCccccccCceeEEECCeEEEEeCCC--eE--EECC--cccccCCchhHHHHH Q lcl|NC_010325. 219 YKEDSVYSMRYIG----GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGD--VY--VHNG--VQKQSVIDAQVRKFF 288 (513) Q Consensus 219 f~en~i~~m~y~g----~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G--~y--~~~G--~~~~~Ig~~~V~~~~ 288 (513) |..++-|.++-.+ .|..-++++.+ ..||- .=.=+.+|+.++|+++.| ++ .++- ..+.+..--.+-+-+ T Consensus 388 ~t~~~e~~l~~~~~~~lTP~~~~~~~~s-~~g~~-~~~Pv~vg~~~~Fv~~~g~~vre~~~~~~~d~~~~~dlT~~a~hl 465 (823) T protein:vir:95 388 LTSGGEYVITGDQNKVLTPSSFAFSSQG-SNGSS-NVPPIAVANIALFVQEKGSVVRDLAYSFDVDGYQGNDLTILANHL 465 (823) T ss_pred EecCcEEEEEcCCCcccceeeEEEEEee-ccccc-cccceEeCCeEEEEecCCCEEEEEEEeeecCceecchhhhhhhhh Confidence 9999999997432 24457777776 55874 456678999999999988 33 2221 112221111111222 Q ss_pred HhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccC--eEEEEeccceeeeeecccccccceeeccc Q lcl|NC_010325. 289 FSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKEN--TWSIRDLPNVLSGAYGIIDPKVSNLWDDD 366 (513) Q Consensus 289 ~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~--~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~ 366 (513) +.. + .-+.-++-.....+.|+..+.+.- -.+-|+-+.+ .|+.-+.+..+-....+.......+|.-. T Consensus 466 ~~~----~-~i~~~a~~~~p~~~~~~v~~dG~l------~~~ty~~~q~v~aW~~~~~~g~~~~~~~i~~~~~d~l~~~v 534 (823) T protein:vir:95 466 FQK----H-SIVDWCFSIVPYSSAFCIRDDGKL------LVMTYLRDQQVFAWAPQSSTGKYESTCSISEGNEDAVYFVV 534 (823) T ss_pred cCC----C-ceEEEEEecCCCeEEEEEecCCcE------EEEEEecccceeeeEEEecCCcEEEEEEecCCCCCEEEEEE Confidence 211 0 111222333333455665544321 2345555544 68877665543222222221122222221 Q ss_pred CcccCccceec--cccccccCccceEEEEeecCceeeecccceeecCccEEEEeecccccCCCcce--EEE----eeeee Q lcl|NC_010325. 367 PNPWDTDTSVW--GEGSYNPAKSSMIFSSFQDKKLFLFGNNSTFSGQNFVSTLERSDIYLGDDRMM--KTV----SAIIP 438 (513) Q Consensus 367 ~~~~d~d~~~~--~~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~a~~~~~~~~~~~~~~~--~~i----~~~~~ 438 (513) .-..+....-. -.++..+. ...+ .+.++-..+++|.+...+...-+-..++-.+. ..+ .++.+ T Consensus 535 ~R~i~g~~~~yiE~~~~~~~~-------~~~~--~~~lD~~~s~~g~~~~~~~~~l~~g~~~l~~l~g~~v~~adg~~~~ 605 (823) T protein:vir:95 535 NRTVNGQTVRYIERLSSRLFT-------SDED--AFFVDSGLSYDGRNTSDRTMTITGGSGEWDYLAEYTISVSGGAYFT 605 (823) T ss_pred EeccCCeEEEEEEeeccccCC-------Cccc--eeEEEEEEEeecCcccceeeEecCCCCcccccCceEEEecCcceEC Confidence 11101000000 00000000 0011 11111122344444432222111110100000 000 00111 Q ss_pred -ccCCCeeEEEEeeeeecCCCCceEcCceeeecC--CceEEEeecCCCe--EEEEE--Ecc---CCCcEE----EEEEee Q lcl|NC_010325. 439 -HITGNGTCNIWVGNAQVQGSGIRWKGPYPYRIG--QDYKIDTKHVGRY--IALKF--DFS---SEGDWY----FNGYTI 504 (513) Q Consensus 439 -~~t~~~~~~~~~g~~~~~~~~~~w~~~~~~~~~--~~~~~~~R~~~Ry--~~~rl--~~~---~g~~w~----~~G~~~ 504 (513) +.. .+.+++-.-..........++.-....+. ..+-..+|. +|- .+|+. ... +...|+ +.|=+| T Consensus 606 ~~~v-~g~i~l~~~~~~~~vGl~~~~~i~~~~~~v~~~~a~~~~~-~r~v~a~l~~~~t~~~~~~~~~~~gL~hleg~tv 683 (823) T protein:vir:95 606 SSDV-GAQLQFPYTGADPDTGYEVSKELRCDIISVTSNTAVVVRA-NRNVPPSLRNVATTNWQMARRTFGGLSHLEGQTV 683 (823) T ss_pred Cccc-eeEEEeCcCCCccccccceEEEEEEeeceeeCCceEEEcc-CCcccceeeeeeccccccccceeeeccccccceE Confidence 000 01111100000000000000000000000 000000110 011 11111 111 111111 222223 Q ss_pred EEeccccCC Q lcl|NC_010325. 505 EMAPKAGMR 513 (513) Q Consensus 505 ~~~~~g~rr 513 (513) .....|.=. T Consensus 684 ~v~~dg~~~ 692 (823) T protein:vir:95 684 NILSDANVE 692 (823) T ss_pred EEEEcCeee Confidence 222222211 No 34 >protein:vir:99677 Length: 794 # NCBI annotation: Tail tubular protein B # Family: family:all:825 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249591;genbank:gi:68299742;genbank:GeneID:3799992 Probab=98.19 E-value=2.8e-06 Score=51.03 Aligned_cols=483 Identities=13% Similarity=0.089 Sum_probs=238.8 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeeecCCCcc----eeeeeee-eCCceEEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTAQAPI----LDMFPFI-RNNIPYWLL 74 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~~~~~----~~~~~~~-~~g~~~~~v 74 (513) |+...+.+.+..|.|.-..+..-=++....|.||+++ .|+++||+|..=+....++.- ..+.++. .....++++ T Consensus 1 M~~i~~s~~n~~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (794) T protein:vir:99 1 MALISQSIKNLKGGISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHIKRLTDQFGLGQKPYCHIINRDEVERYAVF 80 (794) T ss_pred CceeeeecchhhcceecCCchHHhhhhHhhhhcceeeeccCcccCCccceeeeecCCCCCccccEEEEEEeCCCceEEEE Confidence 9999999999999988655554456888999999999 588999999876654322211 1233443 334566667 Q ss_pred EcCceEEEe--cCceEEeccc---cceeeCCC--CceeEEeeCCEEEEEeCCCceEEEc---------CC---------- Q lcl|NC_010325. 75 CSEQRLYLA--DGTTIIDVSP---GPYSASIT--NRWSVGSFNGVIFANDGVNPPHHLP---------PS---------- 128 (513) Q Consensus 75 ~~~~kly~~--~~~t~~dis~---~~~~~~~~--~~w~f~~~~~~~ia~ng~d~~q~~~---------~~---------- 128 (513) .+.+.|+-| .++.-..+.. .+|-..++ ..-++++-+|+++++|..-+||... .. T Consensus 81 f~~~~irv~~~~~g~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~g 160 (794) T protein:vir:99 81 FTGSNIRVFDLFTGDEKTVNAPNGLSYVSSSNPRKDLRMVTVADYTFILNRNVATAQGTTNTPSGLAPFGHFGLVVIRGG 160 (794) T ss_pred EcCCeEEEEECCCCeEEEeeccccccccccCCccceeeEEEEccEEEEEcCCeeeeEeeeeccccCcCCCceEEEEeccC Confidence 777776645 3343333221 11211111 2356777777777777655554210 00 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_010325. 129 -------------------------------------------------------------------------------- 128 (513) Q Consensus 129 -------------------------------------------------------------------------------- 128 (513) T Consensus 161 ~y~~~y~v~i~gs~ta~~~tp~~~~~~~~~~~s~~~ia~~l~~~l~~~g~~v~~~~g~~~i~~~~~~~v~t~s~~~g~~~ 240 (794) T protein:vir:99 161 QYGRTYRIKVNGSVEASFETPLGDQVAHAKQIDIAYIIDQLAAGLINKGWAVTKGSGYFYFSKSGSVIINSLEVEDGYNG 240 (794) T ss_pred CCCceEEEEecCCcccceeeccCcccccccccchhhhhhhhHhhhhcccceEEeCCeEEEEEecCCceeEEEEeecCCCC Confidence Q ss_pred ---------CceecccCCCc--------------------------------------------c--------------- Q lcl|NC_010325. 129 ---------ESTFRVLPNFP--------------------------------------------A--------------- 140 (513) Q Consensus 129 ---------s~~f~~L~g~p--------------------------------------------~--------------- 140 (513) .+.+.+||... + T Consensus 241 t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~~v~~~~~~~~ 320 (794) T protein:vir:99 241 QLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVRFDASRNVWTECPAPNIKADYNKATMPHVLIREADGTFT 320 (794) T ss_pred ceeeEEeeeccceeecccCCCCCeEEEEeccCCCCCCceEEEEEcCCceEEeeccceeecceeccceEEEEeccCCCcee Confidence 00001111000 0 Q ss_pred ----ccee-------------------eEEEEEcCEEEEEECCcCcccCCceEEEeccCCccccccccccccc--ccCcc Q lcl|NC_010325. 141 ----NTTF-------------------KRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDP--TKDAG 195 (513) Q Consensus 141 ----~~ka-------------------~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~--t~~a~ 195 (513) .|.. .-|..|++||++++ ++.|+.|..+|.+ +|..... ..+++ T Consensus 321 ~~~~~w~~r~~Gd~~tnp~psf~g~~is~v~f~q~RL~f~~--------~~~v~~Srtgd~~----nF~~~t~~~~~DdD 388 (794) T protein:vir:99 321 FKQADWTHRAAGDDETNPYPSFIGNSINDIFFFRNRLGFLS--------GENVILSGSGNYF----NFFPESVAVLTDTD 388 (794) T ss_pred EeeccccccccCCcccCCCccccCcceeEEEEEeeeEEEec--------CCeEEEEecCCcc----ccccccccCCCCCc Confidence 0100 12667788887642 3679999999964 4433322 12333 Q ss_pred eeccc---CCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCceeEEECCeEEEEeCCCeE- Q lcl|NC_010325. 196 QNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVY- 269 (513) Q Consensus 196 ~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y- 269 (513) =.++. +....|..+++....|+||...+-|.++-.+ .|.-.++.+.+. .+|-+.=.=+.+|+.++|+++.|=| T Consensus 389 ~I~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~-~~~~~~~~Pv~vg~~v~f~~~~g~~~ 467 (794) T protein:vir:99 389 PIDVAVSTNRISILKYAVPFSEELILWSDQAQFVLSSDGGLTPTTIRLDLTTE-FEVTEQARPYGIGRGVYFVSPRAKFS 467 (794) T ss_pred cEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEE-eeccCCCCceEeCCeEEEEecCCCee Confidence 33332 3334466678889999999999999996322 244466666663 3466656678899999999999832 Q ss_pred ------EECCcc--cccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecc----c Q lcl|NC_010325. 270 ------VHNGVQ--KQSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWK----E 336 (513) Q Consensus 270 ------~~~G~~--~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~----~ 336 (513) .++-.+ +.+.. +.-+.+.| +...-.+ ..+=.....+.|+-... +++++|-|. . T Consensus 468 ~v~r~~~~~~~~d~y~a~Dlt~~~~hl~-----~~~~~~~-~a~~~~~~~~v~~~~~~--------g~l~~~~y~~~~~e 533 (794) T protein:vir:99 468 SVRRFYAVQDVTQVKNAEDISAHVPYYV-----ENGVFKM-SGSSTENFLTILTEGNE--------QRVYFYKFLYLQEQ 533 (794) T ss_pred EEEEeeeeccccCceehhhHHHHHHHhc-----CCCeEEE-EEeCCCCcEEEEEEcCC--------CEEEEEEEeecCCc Confidence 233222 22221 11222222 1111111 12222233444554322 356676652 2 Q ss_pred C---eEEEEeccce----eeeeec-----ccccc----ccee--------ecccCcccCcccee--------cccc---- Q lcl|NC_010325. 337 N---TWSIRDLPNV----LSGAYG-----IIDPK----VSNL--------WDDDPNPWDTDTSV--------WGEG---- 380 (513) Q Consensus 337 ~---~Ws~~d~~~~----~~~~~g-----~~~~~----~~~~--------~~~~~~~~d~d~~~--------~~~d---- 380 (513) + .|+.-+.+.. |..+.+ +..-. .... +...+.-..+|... ++.+ T Consensus 534 q~v~aW~~~~~~g~~~~~~~~~~~d~l~~~v~r~~~~~ler~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~ 613 (794) T protein:vir:99 534 LVQQSWSHWDFGVNCRVLCCDMIGAVMHLIIDSPSGVLMEKIEFTQNTKDYPDEPYRLYVDRKIEYTFPEGSYNDDDFKT 613 (794) T ss_pred eEEEeEEEEEcCCCeEEEEEEEcCCEEEEEEEeCCCEEEEEEEeeeCCCCCCCcccceeeeeeeeeeecccccccCccee Confidence 2 6887665431 111110 00000 0000 00001101111100 0000 Q ss_pred ---------ccccCccceEEEEeecCceee----------------e----cccceeecCccEEEEeecccccC---CCc Q lcl|NC_010325. 381 ---------SYNPAKSSMIFSSFQDKKLFL----------------F----GNNSTFSGQNFVSTLERSDIYLG---DDR 428 (513) Q Consensus 381 ---------s~~~~~~~~~~~~~~~~~~~~----------------~----~~~~~~~g~~l~a~~~~~~~~~~---~~~ 428 (513) ..-..|... ....++..+. + ......-|-++++.++...+.+- ... T Consensus 614 ~~~~~~~~g~~~l~g~~v--~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~g 691 (794) T protein:vir:99 614 RVKLKDIYGSTPANGQYV--FISLGGVTFTFDPPAGGWQANDGLIEFDGDLRGTKFFVGEAYTFLYEFSKFLIKTTDTAD 691 (794) T ss_pred EEeccccccccccCCceE--EEEeCCceeeeecccceEecCccEEEecCCCCCcEEEEeeeeeEEEeecceEEeecCCCC Confidence 000111121 2223332221 1 11225568888888876655432 111 Q ss_pred c-------eEEEeeeeeccCCCeeEEEEee--eeec--CCCCceEcCc------eeeecCCceEEEeecCCCeEEEEEEc Q lcl|NC_010325. 429 M-------MKTVSAIIPHITGNGTCNIWVG--NAQV--QGSGIRWKGP------YPYRIGQDYKIDTKHVGRYIALKFDF 491 (513) Q Consensus 429 ~-------~~~i~~~~~~~t~~~~~~~~~g--~~~~--~~~~~~w~~~------~~~~~~~~~~~~~R~~~Ry~~~rl~~ 491 (513) + +.++.++.......+.+.+.+. .++. ...+..|.++ ...-+| +..++++...+-..++|+. T Consensus 692 ~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg-~~~vp~~g~~~~~~v~i~~ 770 (794) T protein:vir:99 692 GVATEDIGRLQLRRAWVNYDKSGNFRVEVNNQGRTFTYNMTGNRLSTNELILGDESLDTG-QFRYAVSGNATQVTVSLIS 770 (794) T ss_pred ceeeeccceEEEEEEEEEeecccceEEEECCCccceeeeccccccccccccccccccccc-eEEEEecccccceEEEEEE Confidence 1 1234443333322222222111 1111 0112223221 111222 4567787777888999999 Q ss_pred cCCCcEEEEEEeeEEe-ccccCC Q lcl|NC_010325. 492 SSEGDWYFNGYTIEMA-PKAGMR 513 (513) Q Consensus 492 ~~g~~w~~~G~~~~~~-~~g~rr 513 (513) +.-.+.++.++++|.. -.=.|| T Consensus 771 d~P~P~tvlsi~~e~~y~~r~~~ 793 (794) T protein:vir:99 771 DTPNPLSIIGGGWEGYYVRRSSG 793 (794) T ss_pred CCCCCEEEEEEEEEEEEeccccC Confidence 9999999999999983 333444 No 35 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=98.13 E-value=3.9e-06 Score=50.21 Aligned_cols=465 Identities=13% Similarity=0.106 Sum_probs=198.9 Q ss_pred CcccchhhcCccccccccCcc-----cCC--CCcEEEeEEEEEe-CCeeEECCCcceeeecC-CCcceeeeeee-eCCce Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPA-----DLP--LEKWSFGNNVRFK-NGKAQKTLGHTPIFDTA-QAPILDMFPFI-RNNIP 70 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~-----~lp--~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~-~~~~~~~~~~~-~~g~~ 70 (513) |.+..++.-=.-|-+ +|. ||. .++...|.||++. -|++++|+|..=+..+. +..-..+.+|. +.+.. T Consensus 1 m~~~~~q~sF~~GEl---sP~l~gR~Dl~~y~~g~~~~~N~~~~p~Gg~~rRpGt~fva~~~~~~~~~rLipF~fs~~q~ 77 (825) T protein:vir:73 1 MAFSWIQPSFAGGEI---GPSLYGRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPDRKCRLIPFQFSTVQT 77 (825) T ss_pred Cccceecccccccee---chhhcccchHHHHHHHHHHhcCcEEEecCCceecCchHHhHhhcCCCCCEEEEEEEeCCCcE Confidence 877776653222322 222 121 2677899999999 58899999976654332 22334555665 45678 Q ss_pred EEEEEcCceEEEecCce-EEecc------ccceeeCCCCceeEEeeCCEEEEEeCCCceEEEc----------------- Q lcl|NC_010325. 71 YWLLCSEQRLYLADGTT-IIDVS------PGPYSASITNRWSVGSFNGVIFANDGVNPPHHLP----------------- 126 (513) Q Consensus 71 ~~~v~~~~kly~~~~~t-~~dis------~~~~~~~~~~~w~f~~~~~~~ia~ng~d~~q~~~----------------- 126 (513) ++++.+.+.|+-|.++. ..+-. .+||+...-..-++++-+|++++++...+||.+. T Consensus 78 y~Lefg~~~lrv~~~gg~v~~~~~~~~e~~TPy~~~~l~~l~~~QsaD~~~i~h~~~pp~~L~r~~~~~W~l~~~~f~~g 157 (825) T protein:vir:73 78 YALEFGHNYMRVIKDGAYVLTTSNVIYELAMPYADTDLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQIVDVTTKNG 157 (825) T ss_pred EEEEEeCCeEEEEeCCceEeccCCceEEEecccchhhhhhheeeeecCEEEEEcCCCceeEEEEecCCCcEEEEEeccCC Confidence 88888898887775542 21111 1344332222334555555555555444443210 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_010325. 127 -------------------------------------------------------------------------------- 126 (513) Q Consensus 127 -------------------------------------------------------------------------------- 126 (513) T Consensus 158 p~~~in~~~sv~v~asg~tg~~TiTaS~a~~~~~~vG~~i~~~~~~v~si~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 237 (825) T protein:vir:73 158 PFEDINVDETVKVYASASTGTITLTASSAIFGAEQVGKLFYLEQPAVDSVPVWETSKTTAINDVRRADSNYYRANTSGKT 237 (825) T ss_pred ccccccccccceeeecccCceeEEEeeccccCchhcCeEEEEecccccccceeeeeeEEEeeeEEECCCceeeeeccccc Confidence Q ss_pred --------------------------------CCCcee--cccCCC------------cc-------------------- Q lcl|NC_010325. 127 --------------------------------PSESTF--RVLPNF------------PA-------------------- 140 (513) Q Consensus 127 --------------------------------~~s~~f--~~L~g~------------p~-------------------- 140 (513) ...+.+ ..+.+. +. T Consensus 238 ~t~~~~a~~g~~~~~~~g~~~~~~~~~~~~~~~~~g~~~it~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~ 317 (825) T protein:vir:73 238 GTLRPSHTEGMSWDGWGGTGSDDTGIQWEYLHSGFGIAKITAVAGDGLTATADVVSFIPSQVVGSANASYKWAKYAWNSV 317 (825) T ss_pred ceeeccccCCceeEeeeeecccCCceEEEEEecCCceEEEeeccccceeeccccceecccccccCCCCCcccccCCcccC Confidence 000000 000000 00 Q ss_pred cceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccccccCcceeccc---CCCCceeEEEecCcceE Q lcl|NC_010325. 141 NTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLA---DTNGAIVDGVKLRDSFI 217 (513) Q Consensus 141 ~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~---d~~G~iv~g~~l~~~~v 217 (513) +.-..+|..|+|||++++.. ..|++|+.|..+|.+ ++.......+++-.++. +....|..+++.+ .++ T Consensus 318 ~gyPs~v~f~q~RL~f~g~~----~~p~~v~~Srtgd~~----nF~~~~~~~DdD~I~~~~s~~~~~~i~~~~~~~-~L~ 388 (825) T protein:vir:73 318 NGYPSTVVYYQQRLYFAAST----AYPQTIWASRTGDYK----DFGKNNPIQDDDRIIYTYAGRQVNEIRHLIDVG-NLV 388 (825) T ss_pred CCCccEEEEEcceEEEeecC----CCCCEEEEEccCCcc----ccccCCCCCCCccEEEEEcCCcceeEEEEeecC-cEE Confidence 00124588899999987764 358999999999964 44444443344444443 3333465666765 799 Q ss_pred EEecCcEEEEEecC----CCceeEeEEecCccccccCceeEEECCeEEEEeCCC--eE--EECCc--ccccCCchhHHHH Q lcl|NC_010325. 218 IYKEDSVYSMRYIG----GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGD--VY--VHNGV--QKQSVIDAQVRKF 287 (513) Q Consensus 218 If~en~i~~m~y~g----~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G--~y--~~~G~--~~~~Ig~~~V~~~ 287 (513) ||...+-|.++-.. .|..-++++.+. .||-+ =.=+.+|+.++|+++.| ++ .++.. .+.+..--.+-+- T Consensus 389 ~~t~~~e~~l~~~~~~~lTP~~~~~~~~s~-~g~~~-~~Pv~vg~~~~Fv~~~g~~vre~~~~~~~d~~~~~dlt~~a~h 466 (825) T protein:vir:73 389 ALTSGGEYTISGDQNKVLTPSAFSFSSQGN-NGSSN-VPPIAVANIALFIQEKGSVVRDLAYSFDVDGYQGTDLTILANH 466 (825) T ss_pred EEecCceEEEecCCCcccceeeEEEEeeee-ecccc-ccceEeCCeEEEEeCCCCeEEEEEEeeecCceeccchhhhhHh Confidence 99999999996331 244567777774 57743 45678899999999988 33 22221 1222111111122 Q ss_pred HHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccC--eEEEEeccceeeeeecccccccceeecc Q lcl|NC_010325. 288 FFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKEN--TWSIRDLPNVLSGAYGIIDPKVSNLWDD 365 (513) Q Consensus 288 ~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~--~Ws~~d~~~~~~~~~g~~~~~~~~~~~~ 365 (513) ++.. ..-+.-.+...-..+.|+-...+.- -.+-|+.+.+ .|+.-+....+-....+.......+|.- T Consensus 467 l~~~-----~~~~~~a~~~~p~~~~~~v~~dg~l------~~~ty~~~q~v~aW~~~~~~g~v~~~~~i~~~~~D~l~~i 535 (825) T protein:vir:73 467 LFQK-----HSIVDWSFCIVPYSSAFCIRDDGKL------LVLTYLRDQQVFAWAPQSSAGKYESTCSISEGSEDAVYFV 535 (825) T ss_pred hccC-----CceEEEEEcCCCceEEEEEecCCeE------EEEEEeccccceeeEEEecCCcEEEEEEecCCCccEEEEE Confidence 3221 1112233444445666776554321 2455665555 6887665543322222222111112211 Q ss_pred cCcccCccceec--cccccccCccceEEEEeecCceeeecccceeecCccEEEEeecccccCCCcc-eEEEeeeeeccCC Q lcl|NC_010325. 366 DPNPWDTDTSVW--GEGSYNPAKSSMIFSSFQDKKLFLFGNNSTFSGQNFVSTLERSDIYLGDDRM-MKTVSAIIPHITG 442 (513) Q Consensus 366 ~~~~~d~d~~~~--~~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~a~~~~~~~~~~~~~~-~~~i~~~~~~~t~ 442 (513) ..-..+...... -.++..+ ....+..++ +-..+++|.+....+..-+ +.- ....+...+..-. T Consensus 536 V~R~~~g~~~~yiE~~~~~~~-------~~~~~~~~v--D~g~~~~g~~~~~~l~~l~-----g~tv~~~~~g~~~~~v~ 601 (825) T protein:vir:73 536 VNRTINGQTVRYIERLSSRLF-------TNDEDAFFV--DCGLSYDGRNTSSRTMTIS-----GGTGDWSYQVDYPVTVS 601 (825) T ss_pred EEEeeCCceEEEEEEeccccc-------CCCcceeEE--EEEeeecccceeeceeeeC-----CceEEEEeCCeEEEEEc Confidence 110000000000 0000000 000111111 1112233433322222100 000 0000000000000 Q ss_pred CeeEEE------Eeeeeec----CCCCceEcCceeeecCCceEEEeecCCCeEEEEEEcc-----------------CCC Q lcl|NC_010325. 443 NGTCNI------WVGNAQV----QGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFS-----------------SEG 495 (513) Q Consensus 443 ~~~~~~------~~g~~~~----~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~-----------------~g~ 495 (513) .+.+++ .+|-.-. ...++.++ +.+..+..-.+=..++++.. ++. T Consensus 602 ~g~itl~~~~~~~i~l~~~~~~~~~~~~~~~---------~~~~~i~~~~~~~~v~v~~~~~~~a~~~~~~~t~~~~a~~ 672 (825) T protein:vir:73 602 GGAYFVNTDVGAQIQFPYTGTDPDTNEPVAK---------ELRGDIISVTSNTAVVVRFNRNVPPVLRNVATTNWQMARQ 672 (825) T ss_pred CCeEEecccceEEEEecccCcccccccceec---------eeeEEEccccCceEEEEEecccccceeeeecccCCCcchh Confidence 111111 1110000 00011110 11111111111112222110 011 Q ss_pred c----EEEEEEeeEEeccccCC Q lcl|NC_010325. 496 D----WYFNGYTIEMAPKAGMR 513 (513) Q Consensus 496 ~----w~~~G~~~~~~~~g~rr 513 (513) . +++-|=+|.....|.=+ T Consensus 673 ~~~gL~hLeG~~v~v~~Dg~~~ 694 (825) T protein:vir:73 673 TFSGLAHLEGQTVNILSDASVE 694 (825) T ss_pred eeccccccCCceEEEEECCeee Confidence 1 12334444444444333 No 36 >protein:vir:7021 Length: 803 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853594;genbank:gi:31711676;genbank:GeneID:1481802 Probab=98.00 E-value=7.4e-06 Score=48.71 Aligned_cols=473 Identities=9% Similarity=0.038 Sum_probs=183.3 Q ss_pred Ccccchhh-----------------------------cCcccccc----ccCcccCCCCcEEEe---EEEEEeCCe---- Q lcl|NC_010325. 1 MALERQEV-----------------------------KNPTGIVT----DIAPADLPLEKWSFG---NNVRFKNGK---- 40 (513) Q Consensus 1 m~~~~~~~-----------------------------~~~~G~~~----~~~P~~lp~~a~~~~---~Nv~~~~g~---- 40 (513) +.|..... +...|-+. ...+..+...+ .++ ..+...++. T Consensus 165 itIng~~~a~~~t~~~~~~~~~~~~~~~~ia~~l~~~~~~~~s~a~~~~~~~g~~~~i~~-~~~~~~~~~~t~~g~~~~~ 243 (803) T protein:vir:70 165 IIIDGVVAAGYKTRDGAEAHHIEDIRTESIAYNLYQSLQSWDKIADYEIQLDGTSIYITR-RDGSTTFDITTEDGAKGKD 243 (803) T ss_pred EEeCCcceEEEEeCCCcccccccccchhhhhhhhhhheeccccccceEEEECCcEEEEEE-cCCCCeeEEEeecCcCCcE Confidence 11110000 00000000 00000000000 000 001111110 Q ss_pred eEECCCcceeeecCCCcceeeeeeeeCCceEEEEEcCc-----eEEEecC-----ceEEeccccceeeCCCCceeEEeeC Q lcl|NC_010325. 41 AQKTLGHTPIFDTAQAPILDMFPFIRNNIPYWLLCSEQ-----RLYLADG-----TTIIDVSPGPYSASITNRWSVGSFN 110 (513) Q Consensus 41 ~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~v~~~~-----kly~~~~-----~t~~dis~~~~~~~~~~~w~f~~~~ 110 (513) +....+...-...+|..+ .+|....+..+.+ .+.+|+. .+|.....-+. . ..+..++.. T Consensus 244 ~~~~~~~v~~~~~Lp~~~-------~~g~~v~v~~~g~~~~d~y~v~~~~~~~~~~~w~e~a~~g~--~--~~~~~~t~p 312 (803) T protein:vir:70 244 LVAIKYKVASTDLLPSRA-------PEGYKVQVWPTGSKPESRYWLQAEKQNGNIVSWKETLAADV--L--IGFDKSTMP 312 (803) T ss_pred EEEEEecccceeeccccC-------CCCceEEEEcCCCCCCceeeEEEEeccCCccceEeeeccce--e--eeeeccccc Confidence 000000000011111111 1111111111111 1333332 13444321100 0 011111111 Q ss_pred CEEEE---EeCC--Cc--------eEEEcCCCceecccCCCcccceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCC Q lcl|NC_010325. 111 GVIFA---NDGV--NP--------PHHLPPSESTFRVLPNFPANTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSAD 177 (513) Q Consensus 111 ~~~ia---~ng~--d~--------~q~~~~~s~~f~~L~g~p~~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d 177 (513) -.++- .|+. .. ..+.|-.+..+..+.+++..=.-..|..|+|||++++ |++|+.|..+| T Consensus 313 ~~~v~~~~~~~~~~~~~~~~~~~~r~~gdd~tnp~psf~~~~~~~~~~~v~f~q~RL~f~~--------~~~v~~Srtgd 384 (803) T protein:vir:70 313 YIIERTGFVNGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTLGGMFMVQNRLCVTA--------GEAVIATRTSY 384 (803) T ss_pred EEEEEEEEeecceeEEEEeeccccccccccccCccccccCccCCCCceeEEEEeceEEEee--------CCeEEEEccCC Confidence 11110 1111 01 1111211122223333321101223899999999864 68899999999 Q ss_pred cccccccccccc--cccCcceeccc---CCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccC Q lcl|NC_010325. 178 AGGVPASWDPTD--PTKDAGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGP 250 (513) Q Consensus 178 ~~~~P~~Wd~t~--~t~~a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~ 250 (513) .+ ++.... ...+++=.++. +....|..+++....|+||...+-|.++-.+ +|...++.+.+. .+|-+. T Consensus 385 ~~----nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~g~~~lTP~~~~i~~~s~-~~~~~~ 459 (803) T protein:vir:70 385 FF----DFFRYTAVSAVATDPFDVFSDASEVYQLKHAVTLDGSTVLFADKSQFILPGDKPLEKSNVLLKPVTT-FEVNNN 459 (803) T ss_pred cc----ccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEE-eeccCC Confidence 64 444332 22233333332 3334466688899999999999999996322 244466666663 356666 Q ss_pred ceeEEECCeEEEEeCCC----eEE--ECC--cccccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCC Q lcl|NC_010325. 251 NCAVEFDGNHFVVGHGD----VYV--HNG--VQKQSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSK 321 (513) Q Consensus 251 ~siv~~~~~~ffls~~G----~y~--~~G--~~~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~ 321 (513) -.-+.+|+.++|+++.| ++. ++- ..+.+.. +..+.+ ++. .....+ +..-..+..+||+-... T Consensus 460 ~~Pv~vg~~v~fv~~~g~~s~vre~~~~~~~d~y~a~Dlt~~a~h-l~~----~~v~~~-~~~~~~~~~v~~~~~~~--- 530 (803) T protein:vir:70 460 VKPVATGESVMFATSEGAYSGIREFYTDSYSDTKKAQAITSHVNK-LLE----GNVIMM-SASTNVNRLLVLTDKYR--- 530 (803) T ss_pred CccEEeCCeEEEeccCCCeeEEEEEeccccccceehhhhhhhhHh-hcC----CceEEE-EEeCCCCeEEEEEEcCC--- Confidence 77889999999999987 432 221 1122111 112222 221 111111 11222333444442211 Q ss_pred CCcccceEEEEecc-------cCeEEEEeccce----eeeeec----------ccccccceeecc------cCcccCccc Q lcl|NC_010325. 322 PGKHCDRAIIWNWK-------ENTWSIRDLPNV----LSGAYG----------IIDPKVSNLWDD------DPNPWDTDT 374 (513) Q Consensus 322 ~~~~~d~~lvyd~~-------~~~Ws~~d~~~~----~~~~~g----------~~~~~~~~~~~~------~~~~~d~d~ 374 (513) +++++|-|. -..|+.-+.+.. |+.+.+ ..+.-.+..... .++..-+|- T Consensus 531 -----~~l~~~~yl~~~~e~~v~aW~r~~~~g~~~~~~~~~~~d~l~~vv~r~~~g~~ier~~~~~~~~~~~~~~~~lD~ 605 (803) T protein:vir:70 531 -----NIIYCYDWLWQGTERVQAAWHKWEWPLGTFIRGMFYSGEHLYLLIERGSTGVYLERMDMGDALVYNLNDRIRMDR 605 (803) T ss_pred -----CeEEEEEEEecCCcEEEEeEEEEEcCCCEEEEEEEecCCEEEEEEEECCCeEEEEEEecccccccCCcceeEecc Confidence 345565542 125776555431 111101 000000000000 000000110 Q ss_pred ------------eeccccccccC-------ccceE--EEEeecCceee-------------------ecccceeecCccE Q lcl|NC_010325. 375 ------------SVWGEGSYNPA-------KSSMI--FSSFQDKKLFL-------------------FGNNSTFSGQNFV 414 (513) Q Consensus 375 ------------~~~~~ds~~~~-------~~~~~--~~~~~~~~~~~-------------------~~~~~~~~g~~l~ 414 (513) ..|..-..... ...++ ...+....+.. .......-|-+++ T Consensus 606 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~g~~t~~~~~~~~~~~~~a~~v~VGl~Y~ 685 (803) T protein:vir:70 606 QAELIFRHIKAEDVWVSEPLPWQPTDVTLLDCVLIDGWDSYIGGSFLFSYNPGDNTLTTTFDMHDDDHVKAKVVVGQLYP 685 (803) T ss_pred ceeEeeccccCCceeeeecccccCcccceeeEEEeeeeeeecCCeEEEEEcCCCccceeeeeEECCCCcccEEEEeeeee Confidence 01110000000 00000 00111111110 1112355677888 Q ss_pred EEEeecccccC---C---CcceEEEeeeeeccCCCeeEEEEeeeeecCCCCceEcC------------ceeeecCCceEE Q lcl|NC_010325. 415 STLERSDIYLG---D---DRMMKTVSAIIPHITGNGTCNIWVGNAQVQGSGIRWKG------------PYPYRIGQDYKI 476 (513) Q Consensus 415 a~~~~~~~~~~---~---~~~~~~i~~~~~~~t~~~~~~~~~g~~~~~~~~~~w~~------------~~~~~~~~~~~~ 476 (513) +.++...+.+. + ..-.+++.++.......+.+.+.++...-...+....+ ..+.-+| ...+ T Consensus 686 ~~~~~~~~~i~~~~~~~~~~~~~rl~r~~~~~~~sg~~~v~v~~~~~~~~~~~~~s~~~~g~~~~~~g~~~~~tg-~~~v 764 (803) T protein:vir:70 686 QEFEPTQVVIRDNQERVSYIDVPTVGLVHLNLDKYPDFKVEVKNLKSGKVRNVLASNRVGGAINNIVGYVEPREG-VFKF 764 (803) T ss_pred EEEeecceEEEcCCCccccccccEEEEEEEEeecccceEEEEecCCccccceeeccchhccccccccCccccccc-eEEE Confidence 87776555432 1 11123455444444444444444432111111111111 1112223 3567 Q ss_pred EeecCCCeEEEEEEccCCCcEEEEEEeeEE-eccccCC Q lcl|NC_010325. 477 DTKHVGRYIALKFDFSSEGDWYFNGYTIEM-APKAGMR 513 (513) Q Consensus 477 ~~R~~~Ry~~~rl~~~~g~~w~~~G~~~~~-~~~g~rr 513 (513) +++....-..++|+.+.-.+.++.++++|+ .-.=.|| T Consensus 765 P~~~~~~~~~v~i~~d~P~P~tvlsi~weg~y~~r~rr 802 (803) T protein:vir:70 765 PLRSLSTDTVYRVMVESPHTFQLRDIEWEGSYNPTKRR 802 (803) T ss_pred EeeccCcceEEEEEECCCCCeEEEEEEEEEEEeccccc Confidence 888888888899999999999999999998 3233566 No 37 >protein:vir:5120 Length: 615 # NCBI annotation: unknown # Family: family:all:1544 # MgeID: mge:114 # MgeName: PBC5 # Cross-refs: genbank:acc:NP_542277;genbank:gi:18071220;genbank:GeneID:929342 Probab=97.82 E-value=1.7e-05 Score=46.76 Aligned_cols=471 Identities=12% Similarity=0.140 Sum_probs=194.0 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeEECCCcceeeecC--CC-cce-eeeeeee-CCceEE--- Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTA--QA-PIL-DMFPFIR-NNIPYW--- 72 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~--~~-~~~-~~~~~~~-~g~~~~--- 72 (513) +.|..+++-.-.|-++-+.|..||.+.-+-+.|.+|++|.+.|.+.-.-+.... +. .+. .-..|.. .+.+-+ T Consensus 28 ~~M~~I~i~~f~Ge~Prl~P~lLP~~~A~~A~N~~~~~G~ltP~~~~~~~~~~~~~~~~Tif~~~~~W~~w~~~V~av~s 107 (615) T protein:vir:51 28 LGMVAIKISAFAGEQPMLLPRLLPETGATAAMNVRLNDGGLTPINKPIEVATIATASQKTIYRHQGSWLSWPNVVNAVPG 107 (615) T ss_pred eeeEEEeecccccccccchhhhccCcccceEEeeeecCCeeeeecCcccccccccccceeeeeecCceeccCCceeEccC Confidence 788888898999999999999999999999999999999998876644332221 11 111 1111211 111111 Q ss_pred EEEcCceEEEecCce-EEeccccceeeC---CCCceeEEe------------eCCEEEEEeCC-----CceEE--EcCCC Q lcl|NC_010325. 73 LLCSEQRLYLADGTT-IIDVSPGPYSAS---ITNRWSVGS------------FNGVIFANDGV-----NPPHH--LPPSE 129 (513) Q Consensus 73 ~v~~~~kly~~~~~t-~~dis~~~~~~~---~~~~w~f~~------------~~~~~ia~ng~-----d~~q~--~~~~s 129 (513) -|+.+ ++|-.+.+. ....+.+.|... ++...+... |--..+-..|- ..-+. +.+++ T Consensus 108 PvA~D-Rvy~tgdg~Pkv~~~~~sY~LgVpaPs~ap~~~~~g~g~~d~etr~Yv~TfVt~~GeES~PSp~S~~v~v~~g~ 186 (615) T protein:vir:51 108 PVAQD-RLYFTGDGAPKVKIGGVDYALKVPRPTGALTAALSGTGSGDIQSRTYVYTWVTSFGEESAPCPASIIVDWKPGQ 186 (615) T ss_pred Ccccc-eeEEcCCCcceEeecccCccccccCCCccceEEecCCCCccccceEEEEEEEcCCCCcCCCCccceeeEecCCC Confidence 11112 444443331 111111111100 001111111 11111111111 11111 11221 Q ss_pred ceecccCCCcccc---eeeEEEEEc--------CEEEEEECC-------------------------------------c Q lcl|NC_010325. 130 STFRVLPNFPANT---TFKRLKSFK--------NFLVGLNAT-------------------------------------S 161 (513) Q Consensus 130 ~~f~~L~g~p~~~---ka~~v~~~~--------~~l~~~g~t-------------------------------------~ 161 (513) + -+|.+.|+.. ....+++|. +|+++..+. - T Consensus 187 -t-VtLs~~pa~~~~~~i~~rRIYRS~tg~~gtdy~lVAel~as~~sf~D~~~~~~Lg~~Lps~~w~~PP~~l~GL~~m~ 264 (615) T protein:vir:51 187 -T-VTLSGFAATPGGRSITTQRIYRSQTGKTGTGLYLIAERAASAGNFTDNIAVDQFQEPLPSADWNEPPDGLAGLAEMP 264 (615) T ss_pred -e-EEEeeccCCcCCCceeeEEEEEeccCCCceeeEEEeeecccceeeeeccchhhcCcccccccccCcCcchhhhhccc Confidence 1 1233333311 111234443 477666421 0 Q ss_pred Ccc---cCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceEEEecCcEEEEEecCCCceeEe Q lcl|NC_010325. 162 NSV---EMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGGLFIFQF 238 (513) Q Consensus 162 ~~~---~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g~~~~f~~ 238 (513) |+. =..|.|+||-..-|..+|+.. +.-.| -+||++++.++.++|-..-.-|..+ +-+|.--+. T Consensus 265 NGimAgF~GneV~FsEpy~PyAWP~~Y-----------r~t~d--~dIVaiA~~gt~LVV~TkG~PYl~s-G~sP~sms~ 330 (615) T protein:vir:51 265 NGMMAAFVGRSIYFCEPYRPHAWPEKY-----------SRNVG--SDIVGIAALGSILVVVTKGKPYLLA-GTHPDSMQQ 330 (615) T ss_pred cceEEeecCCEEEEecCCCCcccchhc-----------ccCcC--CCeeEEEecccEEEEEEcCceEEEE-cCChhhccc Confidence 000 024567777776665555543 22223 4599999999999999999999984 445777799 Q ss_pred EEecCccccccCceeEEECCeEEEEeCCCeEEECCcccccCCchh---HHHHHHhhcCcchhCCEEEEEecCCCEEEEEE Q lcl|NC_010325. 239 QQLFNDVGILGPNCAVEFDGNHFVVGHGDVYVHNGVQKQSVIDAQ---VRKFFFSDINPDNYQRTFVLADHVNTEMWVCY 315 (513) Q Consensus 239 ~~i~~~~G~~~~~siv~~~~~~ffls~~G~y~~~G~~~~~Ig~~~---V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~ 315 (513) +|+..+--|++++|||.+++-+.|-|++|.-++++...-.+-++. -+.|. .++ ++.|.+.. .+.+|+-.| T Consensus 331 ~kL~~~qpCvS~rsiV~~~~~v~Yas~dGLV~v~~~G~a~vvT~~l~t~~qW~--~l~---P~ti~a~~--~eG~Y~~~Y 403 (615) T protein:vir:51 331 QQLEENLPCINARSIVDLGHAVCYASNDGLVAVRGDGSIRLVTEQLLSREKWL--DLS---PFTIIGGQ--INGAYLLFY 403 (615) T ss_pred cccccccccccccceeEecceEEeecCCceEEEecCCchhhhhhhccChhHHH--hcC---CceEEEEe--ecCeEEEEe Confidence 999999999999999999999999999999988765532222221 22332 244 45565554 345666556 Q ss_pred ccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcccCccceeccccccccCccceEEEEee Q lcl|NC_010325. 316 SSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNPWDTDTSVWGEGSYNPAKSSMIFSSFQ 395 (513) Q Consensus 316 ~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~~d~d~~~~~~ds~~~~~~~~~~~~~~ 395 (513) ...++ .+.+.--.+..++.-+.|-++..-..-...+. .+...++--.... .+-.-|.. ++|.++ ...= T Consensus 404 ~~~~~-~g~~~~g~~~~~~~~~~f~ir~~~~~~~~~~d---~~~~~Ly~l~~g~--~~i~~~~a----~~g~~~--~~~W 471 (615) T protein:vir:51 404 DNLSA-SGERIAGSISIYVDGQPFLVRSSEIASSSFFD---IGDTALYFMAPGS--KTIQRFDA----PQGAPQ--TLYW 471 (615) T ss_pred ccCCC-CcceeeeeeEEecCCceeEEEeecccceeeeE---ecCceEEEEEcCC--ceEEEEec----CCCCcc--eEEe Confidence 43222 22110012333334445433321110001111 0000000000000 00000110 011111 1111 Q ss_pred cCceeeecccceeecCccEEEEeecccccCCCcceEEEeeeeeccCCCeeEEEEeee------------eecCCCCceEc Q lcl|NC_010325. 396 DKKLFLFGNNSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTCNIWVGN------------AQVQGSGIRWK 463 (513) Q Consensus 396 ~~~~~~~~~~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~~~~~g~------------~~~~~~~~~w~ 463 (513) .++.|.+. ...++.|.+.-++...........=...++.+ .++.-.+.-|. ..+-++..+-- T Consensus 472 rSK~F~~~-----~p~sf~~~~V~~~~~~~~~e~~~~~~~~~~~~-aa~~ti~a~g~~~~~l~~~~l~~~~i~gd~~~~i 545 (615) T protein:vir:51 472 RSKEFITT-----SPSSMGAVLVDSGSAISLKALEALQEERNQII-AANAALFAAGDLQGGINARPLNDRSINGDDLQPV 545 (615) T ss_pred cCceEEcc-----CCCcceEEEEcCCcccchhhhhhhhhhhhhcc-ccceeEEeccccccccccccccccccCccccccc Confidence 22233221 11222222222221111100000000000000 00000000000 00000000000 Q ss_pred -CceeeecCCceEEEeecCCCeEEEE------EEccC---CCcEEE--EE------EeeEEeccccCC Q lcl|NC_010325. 464 -GPYPYRIGQDYKIDTKHVGRYIALK------FDFSS---EGDWYF--NG------YTIEMAPKAGMR 513 (513) Q Consensus 464 -~~~~~~~~~~~~~~~R~~~Ry~~~r------l~~~~---g~~w~~--~G------~~~~~~~~g~rr 513 (513) ++.+...-.+=.+.+.+.|+..+=. ++.|+ +..|.+ +| +.+-..-..-|+ T Consensus 546 p~~~t~~~~~~v~~~l~a~G~~~~t~~k~~~~~RLP~g~~ar~Wevevsg~~~V~~v~LA~S~~EL~~ 613 (615) T protein:vir:51 546 PPPPTAADAASLTVSIFADGKLIQTIDKVDRIARVRAGLKARKWEVAISTNMQIAQVIMAASVEELKQ 613 (615) T ss_pred ccccccccccceeEEEecCCceeeeeccCCceeEcccCcccceEEEEEEecccEEEEEEecChHHHHh Confidence 0000000001111222333222111 23343 456653 33 222223333344 No 38 >protein:vir:3306 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049522;genbank:gi:9632528;genbank:GeneID:1262016 Probab=97.61 E-value=3.8e-05 Score=44.82 Aligned_cols=424 Identities=15% Similarity=0.199 Sum_probs=187.5 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeEECCCcceeeecC--CCcceeeeeeeeCCceEEEEEcC- Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTA--QAPILDMFPFIRNNIPYWLLCSE- 77 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~--~~~~~~~~~~~~~g~~~~~v~~~- 77 (513) +.|--+++-+-.|=++-+.|--||.+.-+-+.|.+|+.|.+.|.+.-.-+...+ +.+.+ +-|. +.+||.-++ T Consensus 25 ~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~~Ti--f~y~---~~~W~~w~~~ 99 (567) T protein:vir:33 25 ISMPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTI--FHYR---DDFWFAWPDV 99 (567) T ss_pred ceeeEEeecccccccccchhhhccccccceEEeeeccCCeeeeeecccccccccccCceee--EEEc---CcEEEEeCCc Confidence 455556666778999999999999999999999999999987765532221111 11111 1111 112222111 Q ss_pred -------------ceEEEecCc----eEEeccc---cc-----eeeC---CCCceeEEe--eCC---------------- Q lcl|NC_010325. 78 -------------QRLYLADGT----TIIDVSP---GP-----YSAS---ITNRWSVGS--FNG---------------- 111 (513) Q Consensus 78 -------------~kly~~~~~----t~~dis~---~~-----~~~~---~~~~w~f~~--~~~---------------- 111 (513) .++|--+.+ |-.+|.. ++ |... ++.....+. -++ T Consensus 100 V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~ 179 (567) T protein:vir:33 100 VDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTE 179 (567) T ss_pred eeeccCccccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEE Confidence 244443333 1222211 00 0000 000001000 000 Q ss_pred EEEEEeCCC--------ceEEEcCCCceecccCCCcc---cceeeEEEEEc--------CEEEEEECC-------c---- Q lcl|NC_010325. 112 VIFANDGVN--------PPHHLPPSESTFRVLPNFPA---NTTFKRLKSFK--------NFLVGLNAT-------S---- 161 (513) Q Consensus 112 ~~ia~ng~d--------~~q~~~~~s~~f~~L~g~p~---~~ka~~v~~~~--------~~l~~~g~t-------~---- 161 (513) ..+-..|-. ..-+..+++.+ +|.+.|+ +-....+++|. +|+++..+. + T Consensus 180 TfVt~~GeES~PS~~S~~~~v~~pg~~V--~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~D~~~~ 257 (567) T protein:vir:33 180 TFVSDYGEEGPPGPASLEVTLRTPGTAV--QLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPA 257 (567) T ss_pred EEEcCCCCcCCCcccccceeeecCCceE--EEeeccCCccccccceEEEEEecCCCCceeeEEEEeeccceeeeeeccch Confidence 000001100 00011111111 1222211 11122344443 366665321 0 Q ss_pred --------------------------Ccc---cCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEec Q lcl|NC_010325. 162 --------------------------NSV---EMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKL 212 (513) Q Consensus 162 --------------------------~~~---~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l 212 (513) |+. =..|.|+||-..-|..+|+.. +.-.| -.||++++. T Consensus 258 ~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-----------r~t~~--~dIVaiA~~ 324 (567) T protein:vir:33 258 KNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-----------RHTTA--EDIVAICPL 324 (567) T ss_pred hhcccccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-----------ccCCC--CCeEEEeec Confidence 000 023567777766655544443 22222 459999999 Q ss_pred CcceEEEecCcEEEEEecCCCceeEeEEecCccccccCceeEEECCeEEEEeCCCeEEECCcccccCCchh---HHHHHH Q lcl|NC_010325. 213 RDSFIIYKEDSVYSMRYIGGLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVYVHNGVQKQSVIDAQ---VRKFFF 289 (513) Q Consensus 213 ~~~~vIf~en~i~~m~y~g~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y~~~G~~~~~Ig~~~---V~~~~~ 289 (513) ++.++|-..-.-|..+ +-+|.--+.+|+..+--|++++|||.+++-+.|-|++|.-++++..--.|-++. -+.|. T Consensus 325 gt~LVV~TkG~PYl~s-G~sP~sms~~kL~~~qpCvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l~t~~qW~- 402 (567) T protein:vir:33 325 GTSLVVATKGEPYLFS-GVSPSTISGSKIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQIVSPEQWQ- 402 (567) T ss_pred ccEEEEEEcCceEEEE-cCChhhccccccccccccccccceeEeccEEEeecCCcEEEEecCCchhhhhhhccChHHHH- Confidence 9999999999999984 445777799999999999999999999999999999999999754311111111 12221 Q ss_pred hhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcc Q lcl|NC_010325. 290 SDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNP 369 (513) Q Consensus 290 ~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~ 369 (513) ..+ +.+.+.+.. ...+|+-.|...++.. ..+++|...+..+..+.+..| .+ .+.....+ |.. T Consensus 403 a~~---~P~ti~A~~--~eG~Y~a~Y~~~~g~~-----~~fifdp~~~~~~~i~~~~~~--~~--~d~~~d~L---y~~- 464 (567) T protein:vir:33 403 SQF---NPASIVAYP--WRGEYIACYTKPDGKQ-----DVFVFSPVNMDIRYLSTPFDC--AW--VDLAKDMM---RVV- 464 (567) T ss_pred hcC---CcceEEEEe--ecCeEEEEEecCCCCc-----ceEEEcccccEEEEEecCcee--EE--EEeecCeE---EEe- Confidence 113 334555444 3456777766554443 478999887776554443221 11 11111100 000 Q ss_pred cCccceeccccccccCccceEEEEeecCceeeecccceeecCccEEEEeecccccCCCcceEEEeeeeeccCCCeeEEEE Q lcl|NC_010325. 370 WDTDTSVWGEGSYNPAKSSMIFSSFQDKKLFLFGNNSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTCNIW 449 (513) Q Consensus 370 ~d~d~~~~~~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~~~~ 449 (513) -+..-..|..+ ...+...=.++.|.+..... +.|.+.... -.....+++. T Consensus 465 ~~~~l~~~~~g-------~~~~~~~WrSK~f~~p~~~s-----f~~~rV~s~------------------~~~~v~i~~~ 514 (567) T protein:vir:33 465 TGDKMSVLAGG-------ALPSTIRWHSKIFSLPERTS-----FSCIRVKSP------------------APERVGITIM 514 (567) T ss_pred eCCEEeeecCC-------CCceeEEEecceEEecCccc-----eeEEEEecc------------------CCcceeEEEE Confidence 01111122211 11112223344554432222 222222111 0001112211 Q ss_pred eeeeecCCCCceEcCceeeecCCceEEEeecCCCeEEEEEEccCCCcEEE--EEEe-eEEec-cccCC Q lcl|NC_010325. 450 VGNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYF--NGYT-IEMAP-KAGMR 513 (513) Q Consensus 450 ~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~--~G~~-~~~~~-~g~rr 513 (513) . -++.+.-. ..++ .. .| .+||---.+..|.+ +|.- |+-.. ++..- T Consensus 515 ~-----dg~~v~~~-----~~g~---~~----~~--~~rlp~~~ar~Weveisg~~~V~~v~LA~S~~ 563 (567) T protein:vir:33 515 A-----DDVPVIHF-----APGT---FK----GS--VVRLPAATGQNWQVMVSGFGQVERITLSTSMS 563 (567) T ss_pred E-----cCCceeec-----CCcc---cc----Cc--eeecCCcccceEEEEEEecccEEEEEEecchh Confidence 1 01111100 1110 00 00 12221112444442 2210 11000 00000 No 39 >protein:vir:2792 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612909;genbank:gi:20065826;genbank:GeneID:935648 Probab=97.61 E-value=3.8e-05 Score=44.82 Aligned_cols=424 Identities=15% Similarity=0.199 Sum_probs=187.5 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeEECCCcceeeecC--CCcceeeeeeeeCCceEEEEEcC- Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTA--QAPILDMFPFIRNNIPYWLLCSE- 77 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~--~~~~~~~~~~~~~g~~~~~v~~~- 77 (513) +.|--+++-+-.|=++-+.|--||.+.-+-+.|.+|+.|.+.|.+.-.-+...+ +.+.+ +-|. +.+||.-++ T Consensus 25 ~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~~Ti--f~y~---~~~W~~w~~~ 99 (567) T protein:vir:27 25 ISMPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTI--FHYR---DDFWFAWPDV 99 (567) T ss_pred ceeeEEeecccccccccchhhhccccccceEEeeeccCCeeeeeecccccccccccCceee--EEEc---CcEEEEeCCc Confidence 455556666778999999999999999999999999999987765532221111 11111 1111 112222111 Q ss_pred -------------ceEEEecCc----eEEeccc---cc-----eeeC---CCCceeEEe--eCC---------------- Q lcl|NC_010325. 78 -------------QRLYLADGT----TIIDVSP---GP-----YSAS---ITNRWSVGS--FNG---------------- 111 (513) Q Consensus 78 -------------~kly~~~~~----t~~dis~---~~-----~~~~---~~~~w~f~~--~~~---------------- 111 (513) .++|--+.+ |-.+|.. ++ |... ++.....+. -++ T Consensus 100 V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~ 179 (567) T protein:vir:27 100 VDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTE 179 (567) T ss_pred eeeccCccccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEE Confidence 244443333 1222211 00 0000 000001000 000 Q ss_pred EEEEEeCCC--------ceEEEcCCCceecccCCCcc---cceeeEEEEEc--------CEEEEEECC-------c---- Q lcl|NC_010325. 112 VIFANDGVN--------PPHHLPPSESTFRVLPNFPA---NTTFKRLKSFK--------NFLVGLNAT-------S---- 161 (513) Q Consensus 112 ~~ia~ng~d--------~~q~~~~~s~~f~~L~g~p~---~~ka~~v~~~~--------~~l~~~g~t-------~---- 161 (513) ..+-..|-. ..-+..+++.+ +|.+.|+ +-....+++|. +|+++..+. + T Consensus 180 TfVt~~GeES~PS~~S~~~~v~~pg~~V--~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~D~~~~ 257 (567) T protein:vir:27 180 TFVSDYGEEGPPGPASLEVTLRTPGTAV--QLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPA 257 (567) T ss_pred EEEcCCCCcCCCcccccceeeecCCceE--EEeeccCCccccccceEEEEEecCCCCceeeEEEEeeccceeeeeeccch Confidence 000001100 00011111111 1222211 11122344443 366665321 0 Q ss_pred --------------------------Ccc---cCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEec Q lcl|NC_010325. 162 --------------------------NSV---EMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKL 212 (513) Q Consensus 162 --------------------------~~~---~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l 212 (513) |+. =..|.|+||-..-|..+|+.. +.-.| -.||++++. T Consensus 258 ~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-----------r~t~~--~dIVaiA~~ 324 (567) T protein:vir:27 258 KNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-----------RHTTA--EDIVAICPL 324 (567) T ss_pred hhcccccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-----------ccCCC--CCeEEEeec Confidence 000 023567777766655544443 22222 459999999 Q ss_pred CcceEEEecCcEEEEEecCCCceeEeEEecCccccccCceeEEECCeEEEEeCCCeEEECCcccccCCchh---HHHHHH Q lcl|NC_010325. 213 RDSFIIYKEDSVYSMRYIGGLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVYVHNGVQKQSVIDAQ---VRKFFF 289 (513) Q Consensus 213 ~~~~vIf~en~i~~m~y~g~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y~~~G~~~~~Ig~~~---V~~~~~ 289 (513) ++.++|-..-.-|..+ +-+|.--+.+|+..+--|++++|||.+++-+.|-|++|.-++++..--.|-++. -+.|. T Consensus 325 gt~LVV~TkG~PYl~s-G~sP~sms~~kL~~~qpCvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l~t~~qW~- 402 (567) T protein:vir:27 325 GTSLVVATKGEPYLFS-GVSPSTISGSKIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQIVSPEQWQ- 402 (567) T ss_pred ccEEEEEEcCceEEEE-cCChhhccccccccccccccccceeEeccEEEeecCCcEEEEecCCchhhhhhhccChHHHH- Confidence 9999999999999984 445777799999999999999999999999999999999999754311111111 12221 Q ss_pred hhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcc Q lcl|NC_010325. 290 SDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNP 369 (513) Q Consensus 290 ~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~ 369 (513) ..+ +.+.+.+.. ...+|+-.|...++.. ..+++|...+..+..+.+..| .+ .+.....+ |.. T Consensus 403 a~~---~P~ti~A~~--~eG~Y~a~Y~~~~g~~-----~~fifdp~~~~~~~i~~~~~~--~~--~d~~~d~L---y~~- 464 (567) T protein:vir:27 403 SQF---NPASIVAYP--WRGEYIACYTKPDGKQ-----DVFVFSPVNMDIRYLSTPFDC--AW--VDLAKDMM---RVV- 464 (567) T ss_pred hcC---CcceEEEEe--ecCeEEEEEecCCCCc-----ceEEEcccccEEEEEecCcee--EE--EEeecCeE---EEe- Confidence 113 334555444 3456777766554443 478999887776554443221 11 11111100 000 Q ss_pred cCccceeccccccccCccceEEEEeecCceeeecccceeecCccEEEEeecccccCCCcceEEEeeeeeccCCCeeEEEE Q lcl|NC_010325. 370 WDTDTSVWGEGSYNPAKSSMIFSSFQDKKLFLFGNNSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTCNIW 449 (513) Q Consensus 370 ~d~d~~~~~~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~~~~ 449 (513) -+..-..|..+ ...+...=.++.|.+..... +.|.+.... -.....+++. T Consensus 465 ~~~~l~~~~~g-------~~~~~~~WrSK~f~~p~~~s-----f~~~rV~s~------------------~~~~v~i~~~ 514 (567) T protein:vir:27 465 TGDKMSVLAGG-------ALPSTIRWHSKIFSLPERTS-----FSCIRVKSP------------------APERVGITIM 514 (567) T ss_pred eCCEEeeecCC-------CCceeEEEecceEEecCccc-----eeEEEEecc------------------CCcceeEEEE Confidence 01111122211 11112223344554432222 222222111 0001112211 Q ss_pred eeeeecCCCCceEcCceeeecCCceEEEeecCCCeEEEEEEccCCCcEEE--EEEe-eEEec-cccCC Q lcl|NC_010325. 450 VGNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYF--NGYT-IEMAP-KAGMR 513 (513) Q Consensus 450 ~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~--~G~~-~~~~~-~g~rr 513 (513) . -++.+.-. ..++ .. .| .+||---.+..|.+ +|.- |+-.. ++..- T Consensus 515 ~-----dg~~v~~~-----~~g~---~~----~~--~~rlp~~~ar~Weveisg~~~V~~v~LA~S~~ 563 (567) T protein:vir:27 515 A-----DDVPVIHF-----APGT---FK----GS--VVRLPAATGQNWQVMVSGFGQVERITLSTSMS 563 (567) T ss_pred E-----cCCceeec-----CCcc---cc----Cc--eeecCCcccceEEEEEEecccEEEEEEecchh Confidence 1 01111100 1110 00 00 12221112444442 2210 11000 00000 No 40 >protein:vir:10145 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859275;genbank:gi:32171031;genbank:GeneID:2653447 Probab=97.61 E-value=3.8e-05 Score=44.82 Aligned_cols=424 Identities=15% Similarity=0.199 Sum_probs=187.5 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeEECCCcceeeecC--CCcceeeeeeeeCCceEEEEEcC- Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTA--QAPILDMFPFIRNNIPYWLLCSE- 77 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~--~~~~~~~~~~~~~g~~~~~v~~~- 77 (513) +.|--+++-+-.|=++-+.|--||.+.-+-+.|.+|+.|.+.|.+.-.-+...+ +.+.+ +-|. +.+||.-++ T Consensus 25 ~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~~Ti--f~y~---~~~W~~w~~~ 99 (567) T protein:vir:10 25 ISMPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTI--FHYR---DDFWFAWPDV 99 (567) T ss_pred ceeeEEeecccccccccchhhhccccccceEEeeeccCCeeeeeecccccccccccCceee--EEEc---CcEEEEeCCc Confidence 455556666778999999999999999999999999999987765532221111 11111 1111 112222111 Q ss_pred -------------ceEEEecCc----eEEeccc---cc-----eeeC---CCCceeEEe--eCC---------------- Q lcl|NC_010325. 78 -------------QRLYLADGT----TIIDVSP---GP-----YSAS---ITNRWSVGS--FNG---------------- 111 (513) Q Consensus 78 -------------~kly~~~~~----t~~dis~---~~-----~~~~---~~~~w~f~~--~~~---------------- 111 (513) .++|--+.+ |-.+|.. ++ |... ++.....+. -++ T Consensus 100 V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~ 179 (567) T protein:vir:10 100 VDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTE 179 (567) T ss_pred eeeccCccccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEE Confidence 244443333 1222211 00 0000 000001000 000 Q ss_pred EEEEEeCCC--------ceEEEcCCCceecccCCCcc---cceeeEEEEEc--------CEEEEEECC-------c---- Q lcl|NC_010325. 112 VIFANDGVN--------PPHHLPPSESTFRVLPNFPA---NTTFKRLKSFK--------NFLVGLNAT-------S---- 161 (513) Q Consensus 112 ~~ia~ng~d--------~~q~~~~~s~~f~~L~g~p~---~~ka~~v~~~~--------~~l~~~g~t-------~---- 161 (513) ..+-..|-. ..-+..+++.+ +|.+.|+ +-....+++|. +|+++..+. + T Consensus 180 TfVt~~GeES~PS~~S~~~~v~~pg~~V--~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~D~~~~ 257 (567) T protein:vir:10 180 TFVSDYGEEGPPGPASLEVTLRTPGTAV--QLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPA 257 (567) T ss_pred EEEcCCCCcCCCcccccceeeecCCceE--EEeeccCCccccccceEEEEEecCCCCceeeEEEEeeccceeeeeeccch Confidence 000001100 00011111111 1222211 11122344443 366665321 0 Q ss_pred --------------------------Ccc---cCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEec Q lcl|NC_010325. 162 --------------------------NSV---EMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKL 212 (513) Q Consensus 162 --------------------------~~~---~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l 212 (513) |+. =..|.|+||-..-|..+|+.. +.-.| -.||++++. T Consensus 258 ~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-----------r~t~~--~dIVaiA~~ 324 (567) T protein:vir:10 258 KNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-----------RHTTA--EDIVAICPL 324 (567) T ss_pred hhcccccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-----------ccCCC--CCeEEEeec Confidence 000 023567777766655544443 22222 459999999 Q ss_pred CcceEEEecCcEEEEEecCCCceeEeEEecCccccccCceeEEECCeEEEEeCCCeEEECCcccccCCchh---HHHHHH Q lcl|NC_010325. 213 RDSFIIYKEDSVYSMRYIGGLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVYVHNGVQKQSVIDAQ---VRKFFF 289 (513) Q Consensus 213 ~~~~vIf~en~i~~m~y~g~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y~~~G~~~~~Ig~~~---V~~~~~ 289 (513) ++.++|-..-.-|..+ +-+|.--+.+|+..+--|++++|||.+++-+.|-|++|.-++++..--.|-++. -+.|. T Consensus 325 gt~LVV~TkG~PYl~s-G~sP~sms~~kL~~~qpCvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l~t~~qW~- 402 (567) T protein:vir:10 325 GTSLVVATKGEPYLFS-GVSPSTISGSKIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQIVSPEQWQ- 402 (567) T ss_pred ccEEEEEEcCceEEEE-cCChhhccccccccccccccccceeEeccEEEeecCCcEEEEecCCchhhhhhhccChHHHH- Confidence 9999999999999984 445777799999999999999999999999999999999999754311111111 12221 Q ss_pred hhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcc Q lcl|NC_010325. 290 SDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNP 369 (513) Q Consensus 290 ~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~ 369 (513) ..+ +.+.+.+.. ...+|+-.|...++.. ..+++|...+..+..+.+..| .+ .+.....+ |.. T Consensus 403 a~~---~P~ti~A~~--~eG~Y~a~Y~~~~g~~-----~~fifdp~~~~~~~i~~~~~~--~~--~d~~~d~L---y~~- 464 (567) T protein:vir:10 403 SQF---NPASIVAYP--WRGEYIACYTKPDGKQ-----DVFVFSPVNMDIRYLSTPFDC--AW--VDLAKDMM---RVV- 464 (567) T ss_pred hcC---CcceEEEEe--ecCeEEEEEecCCCCc-----ceEEEcccccEEEEEecCcee--EE--EEeecCeE---EEe- Confidence 113 334555444 3456777766554443 478999887776554443221 11 11111100 000 Q ss_pred cCccceeccccccccCccceEEEEeecCceeeecccceeecCccEEEEeecccccCCCcceEEEeeeeeccCCCeeEEEE Q lcl|NC_010325. 370 WDTDTSVWGEGSYNPAKSSMIFSSFQDKKLFLFGNNSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTCNIW 449 (513) Q Consensus 370 ~d~d~~~~~~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~~~~ 449 (513) -+..-..|..+ ...+...=.++.|.+..... +.|.+.... -.....+++. T Consensus 465 ~~~~l~~~~~g-------~~~~~~~WrSK~f~~p~~~s-----f~~~rV~s~------------------~~~~v~i~~~ 514 (567) T protein:vir:10 465 TGDKMSVLAGG-------ALPSTIRWHSKIFSLPERTS-----FSCIRVKSP------------------APERVGITIM 514 (567) T ss_pred eCCEEeeecCC-------CCceeEEEecceEEecCccc-----eeEEEEecc------------------CCcceeEEEE Confidence 01111122211 11112223344554432222 222222111 0001112211 Q ss_pred eeeeecCCCCceEcCceeeecCCceEEEeecCCCeEEEEEEccCCCcEEE--EEEe-eEEec-cccCC Q lcl|NC_010325. 450 VGNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYF--NGYT-IEMAP-KAGMR 513 (513) Q Consensus 450 ~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~--~G~~-~~~~~-~g~rr 513 (513) . -++.+.-. ..++ .. .| .+||---.+..|.+ +|.- |+-.. ++..- T Consensus 515 ~-----dg~~v~~~-----~~g~---~~----~~--~~rlp~~~ar~Weveisg~~~V~~v~LA~S~~ 563 (567) T protein:vir:10 515 A-----DDVPVIHF-----APGT---FK----GS--VVRLPAATGQNWQVMVSGFGQVERITLSTSMS 563 (567) T ss_pred E-----cCCceeec-----CCcc---cc----Cc--eeecCCcccceEEEEEEecccEEEEEEecchh Confidence 1 01111100 1110 00 00 12221112444442 2210 11000 00000 No 41 >protein:vir:9979 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859109;genbank:gi:32170864;genbank:GeneID:2653256 Probab=97.61 E-value=3.8e-05 Score=44.82 Aligned_cols=424 Identities=15% Similarity=0.199 Sum_probs=187.5 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeEECCCcceeeecC--CCcceeeeeeeeCCceEEEEEcC- Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTA--QAPILDMFPFIRNNIPYWLLCSE- 77 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~--~~~~~~~~~~~~~g~~~~~v~~~- 77 (513) +.|--+++-+-.|=++-+.|--||.+.-+-+.|.+|+.|.+.|.+.-.-+...+ +.+.+ +-|. +.+||.-++ T Consensus 25 ~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~~Ti--f~y~---~~~W~~w~~~ 99 (567) T protein:vir:99 25 ISMPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTI--FHYR---DDFWFAWPDV 99 (567) T ss_pred ceeeEEeecccccccccchhhhccccccceEEeeeccCCeeeeeecccccccccccCceee--EEEc---CcEEEEeCCc Confidence 455556666778999999999999999999999999999987765532221111 11111 1111 112222111 Q ss_pred -------------ceEEEecCc----eEEeccc---cc-----eeeC---CCCceeEEe--eCC---------------- Q lcl|NC_010325. 78 -------------QRLYLADGT----TIIDVSP---GP-----YSAS---ITNRWSVGS--FNG---------------- 111 (513) Q Consensus 78 -------------~kly~~~~~----t~~dis~---~~-----~~~~---~~~~w~f~~--~~~---------------- 111 (513) .++|--+.+ |-.+|.. ++ |... ++.....+. -++ T Consensus 100 V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~~~d~etr~Yv~ 179 (567) T protein:vir:99 100 VDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTE 179 (567) T ss_pred eeeccCccccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCcccceeEEEE Confidence 244443333 1222211 00 0000 000001000 000 Q ss_pred EEEEEeCCC--------ceEEEcCCCceecccCCCcc---cceeeEEEEEc--------CEEEEEECC-------c---- Q lcl|NC_010325. 112 VIFANDGVN--------PPHHLPPSESTFRVLPNFPA---NTTFKRLKSFK--------NFLVGLNAT-------S---- 161 (513) Q Consensus 112 ~~ia~ng~d--------~~q~~~~~s~~f~~L~g~p~---~~ka~~v~~~~--------~~l~~~g~t-------~---- 161 (513) ..+-..|-. ..-+..+++.+ +|.+.|+ +-....+++|. +|+++..+. + T Consensus 180 TfVt~~GeES~PS~~S~~~~v~~pg~~V--~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~D~~~~ 257 (567) T protein:vir:99 180 TFVSDYGEEGPPGPASLEVTLRTPGTAV--QLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPA 257 (567) T ss_pred EEEcCCCCcCCCcccccceeeecCCceE--EEeeccCCccccccceEEEEEecCCCCceeeEEEEeeccceeeeeeccch Confidence 000001100 00011111111 1222211 11122344443 366665321 0 Q ss_pred --------------------------Ccc---cCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEec Q lcl|NC_010325. 162 --------------------------NSV---EMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKL 212 (513) Q Consensus 162 --------------------------~~~---~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l 212 (513) |+. =..|.|+||-..-|..+|+.. +.-.| -.||++++. T Consensus 258 ~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-----------r~t~~--~dIVaiA~~ 324 (567) T protein:vir:99 258 KNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-----------RHTTA--EDIVAICPL 324 (567) T ss_pred hhcccccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-----------ccCCC--CCeEEEeec Confidence 000 023567777766655544443 22222 459999999 Q ss_pred CcceEEEecCcEEEEEecCCCceeEeEEecCccccccCceeEEECCeEEEEeCCCeEEECCcccccCCchh---HHHHHH Q lcl|NC_010325. 213 RDSFIIYKEDSVYSMRYIGGLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVYVHNGVQKQSVIDAQ---VRKFFF 289 (513) Q Consensus 213 ~~~~vIf~en~i~~m~y~g~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y~~~G~~~~~Ig~~~---V~~~~~ 289 (513) ++.++|-..-.-|..+ +-+|.--+.+|+..+--|++++|||.+++-+.|-|++|.-++++..--.|-++. -+.|. T Consensus 325 gt~LVV~TkG~PYl~s-G~sP~sms~~kL~~~qpCvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l~t~~qW~- 402 (567) T protein:vir:99 325 GTSLVVATKGEPYLFS-GVSPSTISGSKIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQIVSPEQWQ- 402 (567) T ss_pred ccEEEEEEcCceEEEE-cCChhhccccccccccccccccceeEeccEEEeecCCcEEEEecCCchhhhhhhccChHHHH- Confidence 9999999999999984 445777799999999999999999999999999999999999754311111111 12221 Q ss_pred hhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcc Q lcl|NC_010325. 290 SDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNP 369 (513) Q Consensus 290 ~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~ 369 (513) ..+ +.+.+.+.. ...+|+-.|...++.. ..+++|...+..+..+.+..| .+ .+.....+ |.. T Consensus 403 a~~---~P~ti~A~~--~eG~Y~a~Y~~~~g~~-----~~fifdp~~~~~~~i~~~~~~--~~--~d~~~d~L---y~~- 464 (567) T protein:vir:99 403 SQF---NPASIVAYP--WRGEYIACYTKPDGKQ-----DVFVFSPVNMDIRYLSTPFDC--AW--VDLAKDMM---RVV- 464 (567) T ss_pred hcC---CcceEEEEe--ecCeEEEEEecCCCCc-----ceEEEcccccEEEEEecCcee--EE--EEeecCeE---EEe- Confidence 113 334555444 3456777766554443 478999887776554443221 11 11111100 000 Q ss_pred cCccceeccccccccCccceEEEEeecCceeeecccceeecCccEEEEeecccccCCCcceEEEeeeeeccCCCeeEEEE Q lcl|NC_010325. 370 WDTDTSVWGEGSYNPAKSSMIFSSFQDKKLFLFGNNSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTCNIW 449 (513) Q Consensus 370 ~d~d~~~~~~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~~~~ 449 (513) -+..-..|..+ ...+...=.++.|.+..... +.|.+.... -.....+++. T Consensus 465 ~~~~l~~~~~g-------~~~~~~~WrSK~f~~p~~~s-----f~~~rV~s~------------------~~~~v~i~~~ 514 (567) T protein:vir:99 465 TGDKMSVLAGG-------ALPSTIRWHSKIFSLPERTS-----FSCIRVKSP------------------APERVGITIM 514 (567) T ss_pred eCCEEeeecCC-------CCceeEEEecceEEecCccc-----eeEEEEecc------------------CCcceeEEEE Confidence 01111122211 11112223344554432222 222222111 0001112211 Q ss_pred eeeeecCCCCceEcCceeeecCCceEEEeecCCCeEEEEEEccCCCcEEE--EEEe-eEEec-cccCC Q lcl|NC_010325. 450 VGNAQVQGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYF--NGYT-IEMAP-KAGMR 513 (513) Q Consensus 450 ~g~~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~--~G~~-~~~~~-~g~rr 513 (513) . -++.+.-. ..++ .. .| .+||---.+..|.+ +|.- |+-.. ++..- T Consensus 515 ~-----dg~~v~~~-----~~g~---~~----~~--~~rlp~~~ar~Weveisg~~~V~~v~LA~S~~ 563 (567) T protein:vir:99 515 A-----DDVPVIHF-----APGT---FK----GS--VVRLPAATGQNWQVMVSGFGQVERITLSTSMS 563 (567) T ss_pred E-----cCCceeec-----CCcc---cc----Cc--eeecCCcccceEEEEEEecccEEEEEEecchh Confidence 1 01111100 1110 00 00 12221112444442 2210 11000 00000 No 42 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=97.61 E-value=3.8e-05 Score=44.79 Aligned_cols=466 Identities=11% Similarity=0.031 Sum_probs=184.1 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeE----ECCCcceeeecCCCcceee-eeeee--C---Cce Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQ----KTLGHTPIFDTAQAPILDM-FPFIR--N---NIP 70 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~----~~~g~~~~~~~~~~~~~~~-~~~~~--~---g~~ 70 (513) .+-+.....+..=.|...+. |. .|+...+|..- ...+...-.+.+|+.+-.- ..... . .+. T Consensus 317 ~~~~~~~~~g~~i~v~~~~~---~~------~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~~~~~~d~ 387 (905) T protein:vir:78 317 ISNYSAQAVGNVIEIERTDG---RD------FNLGVRGGATNRAMTAIKGTANSIVDLPGQCFDGFELKVINTENAESDD 387 (905) T ss_pred cccEEEEecCcEEEEEecCC---Cc------cEEEEeccCCcceEEEEeccccccccCccccCCCcEEEEEeCCCCCcce Confidence 22222222221111111110 00 12233332211 1111111112233222111 11100 0 011 Q ss_pred E-E-EEEcCceEEEecCceEEeccccce-eeCC--CCceeEEe--eCCE-EEEEeCCCceEEEcC---CC---ceecccC Q lcl|NC_010325. 71 Y-W-LLCSEQRLYLADGTTIIDVSPGPY-SASI--TNRWSVGS--FNGV-IFANDGVNPPHHLPP---SE---STFRVLP 136 (513) Q Consensus 71 ~-~-~v~~~~kly~~~~~t~~dis~~~~-~~~~--~~~w~f~~--~~~~-~ia~ng~d~~q~~~~---~s---~~f~~L~ 136 (513) + + |....... ...++|.....-+. .+.. .-.|.-.. .+.. +.+.++.-..+.|.. ++ ......- T Consensus 388 yyv~~~~~~~~~--~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gd~~Tnp~psf~ 465 (905) T protein:vir:78 388 YYVVFRSAAEGI--PGSGSWEETVAPGIERGFNTSTMPHALIRQADGNFTLEALNDEGTITGWAQREVGDDDTNPKPSFV 465 (905) T ss_pred EEEEEEecccCC--cCceeEEEecccccccccccccccEEEEEecCceEEEEEeccccccccccccccCCcccCCCCccc Confidence 1 1 11110000 01123543221000 0000 00111111 1111 122222222222211 00 0111111 Q ss_pred CCcccceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCcccccccccccccc--cCcceeccc---CCCCceeEEEe Q lcl|NC_010325. 137 NFPANTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDPT--KDAGQNTLA---DTNGAIVDGVK 211 (513) Q Consensus 137 g~p~~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~t--~~a~~~dl~---d~~G~iv~g~~ 211 (513) |.++ ..|..|++||++++ |+.|+.|..+|.+ ++...... .+++=.++. +....|..+++ T Consensus 466 g~~i----s~v~f~q~RL~f~s--------~~~v~~Srtgd~~----nF~~~t~~~~~DdDpI~~~~ss~~~~~i~~~v~ 529 (905) T protein:vir:78 466 GRGI----SDMFFYNNRLGFLS--------EDAVIMSQPGDYF----NFFVTSAITISDSDPIDVTASSTKPAILRAAIG 529 (905) T ss_pred CCCc----ceEEEEcceEEEec--------CCeEEEEccCCcc----ccccccccCCCCCccEEEEEcCCcceeeEEEee Confidence 2222 23999999998864 6789999999964 44333221 233333332 43444666888 Q ss_pred cCcceEEEecCcEEEEEecC---CCceeEeEEecCccccccCceeEEECCeEEEEeCCC----eE--EECCcc--cccCC Q lcl|NC_010325. 212 LRDSFIIYKEDSVYSMRYIG---GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGD----VY--VHNGVQ--KQSVI 280 (513) Q Consensus 212 l~~~~vIf~en~i~~m~y~g---~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G----~y--~~~G~~--~~~Ig 280 (513) ....|+||...+-|.++-.+ .|.-.++.+.+. .||-+.=.=+.+|+.++|+++.| ++ .++-.+ +.... T Consensus 530 ~~~~L~ifT~g~ef~lsg~~~~lTP~s~~i~~~S~-~~~~~~v~Pv~vG~~vlFv~~~g~~s~vre~~y~~~~d~y~a~D 608 (905) T protein:vir:78 530 APKGLILFAENSQFLLASQEVVFSTATIKLTEISD-YFYRSLAKPVSTGVSIAFVSEADTYSKIFEMSIDSVDNRPQVAD 608 (905) T ss_pred cCCcEEEEecCceEEEecCCccccceeEEEEeEEe-ecccCCCCcEEeCCeEEEeecCCCeeEEEEEEeeecccceehhH Confidence 89999999999999997433 244466666653 34433222378999999999987 53 333211 11110 Q ss_pred -chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecc-------cCeEEEEeccceee--- Q lcl|NC_010325. 281 -DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWK-------ENTWSIRDLPNVLS--- 349 (513) Q Consensus 281 -~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~-------~~~Ws~~d~~~~~~--- 349 (513) +..+.+.|-. ....+ +.....-+.|+-.+. +++++|-|. -..|+.-+.+..+- T Consensus 609 lT~~a~hl~~g-----~v~~~---~~s~~~~~v~~~~~~--------~~l~~ytyl~~~~eq~v~AWsrw~~~G~~~~~a 672 (905) T protein:vir:78 609 ITRIVPEYVPT-----GLTWS---VSTPNNSMMLFGDNS--------NTAYIFKFFNQGNERQVAGWSKWILPGEQRMCG 672 (905) T ss_pred HHHHHHHhcCC-----ceEEE---EecCCCcEEEEEcCC--------CeEEEEEeecCCCceeEEeEEEEecCCCeEEEE Confidence 2223332211 11111 122222233332221 457776552 12688766543210 Q ss_pred eeecccccc-----cc-----eeec-ccCcccCccc---------eecccc--------------ccccCccce----EE Q lcl|NC_010325. 350 GAYGIIDPK-----VS-----NLWD-DDPNPWDTDT---------SVWGEG--------------SYNPAKSSM----IF 391 (513) Q Consensus 350 ~~~g~~~~~-----~~-----~~~~-~~~~~~d~d~---------~~~~~d--------------s~~~~~~~~----~~ 391 (513) ...+..... .+ .... ..+....++. ..+..+ ...+.+..+ .. T Consensus 673 ~i~d~~~~vV~r~~~G~~~~~~~~l~~~~~~~~~d~~~~~~~~~~d~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~ 752 (905) T protein:vir:78 673 FFADTGYFVLYDSTTGSYVLSAMELLDDPDSASIDTAFSSFLPRLDNYVVKSDLTVVDNGDGTLTVDLEAGQAMTGATPV 752 (905) T ss_pred EEcCCEEEEEEEccCCeEEEEEEeeccccCccccccceeeeeeccceeeecccceecccCcceEeeeccCccccccceeE Confidence 000100000 00 0000 0000000000 000000 000011100 11 Q ss_pred EEeecCcee-------------e-ecccceeecCccEEEEeecccccC-----CCcceEEEeeeeeccCCCeeEEEEeee Q lcl|NC_010325. 392 SSFQDKKLF-------------L-FGNNSTFSGQNFVSTLERSDIYLG-----DDRMMKTVSAIIPHITGNGTCNIWVGN 452 (513) Q Consensus 392 ~~~~~~~~~-------------~-~~~~~~~~g~~l~a~~~~~~~~~~-----~~~~~~~i~~~~~~~t~~~~~~~~~g~ 452 (513) ....++.+. . .......-|-++++.++...+... ......+|.++...+...+.+.+.+.. T Consensus 753 ~~~~dG~~~~~~~~~~~~~~~~t~~~a~~v~VGl~Y~s~v~~~p~~~~~~~~s~~~~~~rI~rv~lr~~~Sg~~~v~v~~ 832 (905) T protein:vir:78 753 IMFTDGPSEFAFSQPTITAGQFTVDTTDDFVVGFKYETKITLPGFFTSEENKADRVYAPIVEFLYLDLYYSGRYQIEVDR 832 (905) T ss_pred EEeeCCceeeeEEEEEeeceeeccccCCeEEEeeeeeEEEeecceEeccCCCcccccceEEEEEEEEeecceeEEEEEcC Confidence 111222110 0 011234678888888886555432 123444566655543333333322111 Q ss_pred ------------eecCCCCceEcCceeeecCCceEEEeecCCCeEEEEEEccCCCcEEEEEEeeEEe--ccccCC Q lcl|NC_010325. 453 ------------AQVQGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYFNGYTIEMA--PKAGMR 513 (513) Q Consensus 453 ------------~~~~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~~G~~~~~~--~~g~rr 513 (513) ........ .++....+| ...++++...+-..++|+.+.-.+.++.++++|+. +-.-|| T Consensus 833 ~~~~~~~~~~~~~~~~~~~~--~~~p~~~tg-~~~vP~~g~~~~~~v~I~sd~PlP~tvlsi~weg~Yn~r~~~~ 904 (905) T protein:vir:78 833 IGYDTINIDAGSIDANIYLA--DGAPLKEIA-TENVPLFTPGDQVTVTIKAPDPFPSAITGYSWQGHYNRRGIAF 904 (905) T ss_pred CCcceecccccceecCcccC--ccccccccc-EEEEEeeccCceeEEEEEECCCCcEEEEEEEEEEEeccceeec Confidence 11111111 111122223 46789999999999999999999999999998873 333333 No 43 >protein:vir:97014 Length: 800 # NCBI annotation: 33 # Family: family:all:825 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654134;genbank:gi:108862018;genbank:GeneID:5075963 Probab=97.54 E-value=4.8e-05 Score=44.28 Aligned_cols=473 Identities=10% Similarity=0.025 Sum_probs=186.2 Q ss_pred CcccchhhcCccccccccCcccC---CCCcEEE---------------eEEEEEeCCee----EECCCcceeeecCCCcc Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADL---PLEKWSF---------------GNNVRFKNGKA----QKTLGHTPIFDTAQAPI 58 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~l---p~~a~~~---------------~~Nv~~~~g~~----~~~~g~~~~~~~~~~~~ 58 (513) =+-......+...+..++...-. ....|+- ..||...++.- ....+..+-.+.+|+.+ T Consensus 180 t~~~~~~~~~~~~ia~ql~~~~~~~~~~~~~t~~~~G~~~~i~~~~~~~~~v~t~~g~~~~~~~~~~~~v~~~~~lp~~~ 259 (800) T protein:vir:97 180 GSADHVEQIRTERITSELYSKLQQWSGVSDYEIQRDGTSIFIERRDGASFTITTTDGAKGKDLVAIKNKVSSTDLLPSRA 259 (800) T ss_pred CCcccceeccHHHHHHHHHHhhhccccccceEEEeCCcEEEEEEcCCceEEEEecCCcCceeeeEEeeeccchhhchhhC Confidence 00000000000111100000000 0000000 01222222211 11111111111122211 Q ss_pred eeeeeeeeCCceEEEEEcCc-----eEEEecCc-----eEEeccccceeeCCCCceeEEeeCCEEEEE-----eCCCceE Q lcl|NC_010325. 59 LDMFPFIRNNIPYWLLCSEQ-----RLYLADGT-----TIIDVSPGPYSASITNRWSVGSFNGVIFAN-----DGVNPPH 123 (513) Q Consensus 59 ~~~~~~~~~g~~~~~v~~~~-----kly~~~~~-----t~~dis~~~~~~~~~~~w~f~~~~~~~ia~-----ng~d~~q 123 (513) .+|...-+.++.. .+++|+.. +|..-..- .. ...|..+.+.-.++-. ++.-..+ T Consensus 260 -------~~g~~v~i~~~~~~~~~~y~~~~~~~~~~~~~w~e~~~~--~~--~~~~~~~tmp~~~~~~~~~~~~g~~~~~ 328 (800) T protein:vir:97 260 -------PAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAA--DV--LLGFDKGTMPYIIERTDIINGIAQFKIR 328 (800) T ss_pred -------CCCcEEEEEccCCCCCceEEEEEEecccCcceEEEeecc--cc--ccceecccceEEEEEeecccccceeEEE Confidence 1111111111111 13334322 35432110 00 0112111111111111 1111111 Q ss_pred --EEcC--CCc----eecccCCCcccceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCccccccccccccc--ccC Q lcl|NC_010325. 124 --HLPP--SES----TFRVLPNFPANTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDP--TKD 193 (513) Q Consensus 124 --~~~~--~s~----~f~~L~g~p~~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~--t~~ 193 (513) .|+. ..+ ....+.|++..=.-..|..|++||++++ |++|+.|..+|.+ ++..... ..+ T Consensus 329 ~~~w~~r~~gd~~tnp~p~f~~~~~~~~~~~v~f~q~RL~f~~--------~~~v~~Srtgd~~----nF~~~t~~~~~D 396 (800) T protein:vir:97 329 QGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTA--------GEAVIASRTSYFF----DFFRYTVISALA 396 (800) T ss_pred eccccccccCccccCccccccCCcCCCCceeEEEEeeeEEEec--------CCeEEEEecCCcc----ccccccccCCCC Confidence 1211 000 0111111110001123899999999864 6789999999964 4433322 223 Q ss_pred cceeccc---CCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCceeEEECCeEEEEeCCC- Q lcl|NC_010325. 194 AGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGD- 267 (513) Q Consensus 194 a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G- 267 (513) ++=.++. +....|..+++.+..|+||...+-|.++-.+ .|.-.++.+.+. .+|-+.=.-+.+|+.++|+++.| T Consensus 397 dD~I~~~~ss~~v~~i~~~v~~~~~L~i~T~~~q~~ls~~~~lTP~~~~~~~~s~-~~~~~~~~Pv~vG~~v~fv~~~g~ 475 (800) T protein:vir:97 397 TDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGDKPLEKSNALLKPVTT-FEVNNKVKPVVTGESVMFATNDGS 475 (800) T ss_pred CccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEe-eeccCCCCcEEeCCeEEEeeCCCC Confidence 4444443 3334577788899999999999999996322 244466666663 34555566789999999999987 Q ss_pred ---eEEE--CC--cccccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecc---- Q lcl|NC_010325. 268 ---VYVH--NG--VQKQSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWK---- 335 (513) Q Consensus 268 ---~y~~--~G--~~~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~---- 335 (513) ++.+ +- ..+.... +..+.+.| +.....+ +..-..+..++|+-... +++++|-|. T Consensus 476 ~s~vre~~~~~~~d~~~a~DlT~~~~hl~-----~~~v~~~-~~~~~~~~~v~~~~~~~--------~~l~~~~y~~~~~ 541 (800) T protein:vir:97 476 YSGVREFYTDSYSDTKKAQAITSHVNKLI-----EGNITNM-AASTNVNRLLVTTDKYR--------NIIYCYDWLWQGT 541 (800) T ss_pred eeEEEEEeeeecccceehhhHHHHHHHhc-----CCceEEE-EEeCCCCeEEEEEEcCC--------CEEEEEEEeecCC Confidence 4433 21 1122111 11222222 1111111 22222333455553221 356777763 Q ss_pred -c--CeEEEEeccc--e--eeeee----------------cccccccceeecccCcccCcc------------ceecccc Q lcl|NC_010325. 336 -E--NTWSIRDLPN--V--LSGAY----------------GIIDPKVSNLWDDDPNPWDTD------------TSVWGEG 380 (513) Q Consensus 336 -~--~~Ws~~d~~~--~--~~~~~----------------g~~~~~~~~~~~~~~~~~d~d------------~~~~~~d 380 (513) . +.|+.-+++. . +..+. +........ ....+...-+| ...|... T Consensus 542 e~~~~aW~~~~~~~~~~~~~~~~~~d~l~~vv~r~~~~~ler~~~~~~~-~~~~~~~~~lD~~~~~~~~~~~~~~~~v~~ 620 (800) T protein:vir:97 542 DRVQSAWHVWKWPIGTKVRGMFYSGELLYLLLERGDGVYLEKMDMGDAL-TYGLNDRIRMDRQAELVFKHFKAEDEWVSE 620 (800) T ss_pred ceEEEeEEEEecCCCeEEEEEEEcCCeEEEEEEcCCcEEEEEEecccCc-CcccccceeccccceeeeeeeecccceEec Confidence 1 3688665543 1 11111 111100000 00000000000 0001000 Q ss_pred -------------------ccccCccceEEEEeecC-----ceee----ecccceeecCccEEEEeecccccC---C--- Q lcl|NC_010325. 381 -------------------SYNPAKSSMIFSSFQDK-----KLFL----FGNNSTFSGQNFVSTLERSDIYLG---D--- 426 (513) Q Consensus 381 -------------------s~~~~~~~~~~~~~~~~-----~~~~----~~~~~~~~g~~l~a~~~~~~~~~~---~--- 426 (513) -....|+.+++...... .... .......-|.++++.++-..+.+- + T Consensus 621 ~~~~~~~~~~~~~~~~v~g~~~~~G~~v~~~~~~~~~~~~~~~~~~~~~~~~~~v~vGl~Y~~~~~~~p~~i~~~~g~~~ 700 (800) T protein:vir:97 621 PLPWVPTNPELLDCILIEGWDSYIGGSFLFKYNPSDNTLSTTFDMYDDSHVKAKVIVGQIYPQEFEPTPVVIRDNQDRVS 700 (800) T ss_pred cccccCCCcceeEEEEecccccccCceEEEEecCccCcccccceEEeCCCCCcEEEEeeeeeEEEEecceEEEecCCCce Confidence 00112222211110000 0000 011234567778888875444331 1 Q ss_pred CcceEEEeeeeeccCCCeeEEEEeeeeecCCC-----CceEcC-------ceeeecCCceEEEeecCCCeEEEEEEccCC Q lcl|NC_010325. 427 DRMMKTVSAIIPHITGNGTCNIWVGNAQVQGS-----GIRWKG-------PYPYRIGQDYKIDTKHVGRYIALKFDFSSE 494 (513) Q Consensus 427 ~~~~~~i~~~~~~~t~~~~~~~~~g~~~~~~~-----~~~w~~-------~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g 494 (513) ...+.++.++.......+.+.+.+........ ...... ..+..+| +..++++...+-..++|+.+.- T Consensus 701 ~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~~~~~~g~~~~~tg-~~~vp~~g~~~~~~v~i~~d~P 779 (800) T protein:vir:97 701 YIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGALNNTVGYVEPREG-VFRFPLRAKSTDVVYRIIVESP 779 (800) T ss_pred eecceEEEEEEEeecccccEEEEEccccCCceeeeecCccccccccccCCccccccc-eEEEEeecccceeEEEEEECCC Confidence 11223455555444444444443322111000 000000 0112233 4778899899999999999999 Q ss_pred CcEEEEEEeeEE-eccccCC Q lcl|NC_010325. 495 GDWYFNGYTIEM-APKAGMR 513 (513) Q Consensus 495 ~~w~~~G~~~~~-~~~g~rr 513 (513) -+.++.++++|. .-.=.|| T Consensus 780 lP~tvlsi~~eg~y~~r~~r 799 (800) T protein:vir:97 780 HTFQLRDIEWEGSYNPTKRR 799 (800) T ss_pred CcEEEEEEEEEEEeeccccc Confidence 999999999998 3344555 No 44 >protein:vir:827 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050560;genbank:gi:9633457;genbank:GeneID:1262210 Probab=97.52 E-value=5.2e-05 Score=44.08 Aligned_cols=429 Identities=14% Similarity=0.183 Sum_probs=187.2 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeEECCCcceeeecC---CCcceeee-e-eee-CCceEE-- Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTA---QAPILDMF-P-FIR-NNIPYW-- 72 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~---~~~~~~~~-~-~~~-~g~~~~-- 72 (513) +.|--+++-+-.|=++-+.|--||.+.-+-+.|.+|+.|.+.|.+.-.-+...+ +..++..- . |.+ .+.+-+ T Consensus 25 ~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir 104 (567) T protein:vir:82 25 ISMPYIDITTMRGMMPRVVTSMLPEHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIR 104 (567) T ss_pred ceeeEEeecccccccccchhhhccccccceEEeeeecCCeeeeeecccccccccccCceeeeeecCcEeEEeCCceeecc Confidence 455556666778999999999999999999999999999987765533221111 11111100 0 111 011111 Q ss_pred -EEEcC--ceEEEecCc----eEEeccc---cc-----eeeC---CCCceeEEe--eCC----------------EEEEE Q lcl|NC_010325. 73 -LLCSE--QRLYLADGT----TIIDVSP---GP-----YSAS---ITNRWSVGS--FNG----------------VIFAN 116 (513) Q Consensus 73 -~v~~~--~kly~~~~~----t~~dis~---~~-----~~~~---~~~~w~f~~--~~~----------------~~ia~ 116 (513) -|+.+ .++|--+.+ |-.+|.. ++ |... ++.....+. -++ ..+-. T Consensus 105 ~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~a~~~~~~~~~~~p~d~etr~Yv~TfVt~ 184 (567) T protein:vir:82 105 SPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVCTVQQGGDVSDDNPNDDETRFYTETFVSD 184 (567) T ss_pred CccccCCcccEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCccccceeeecCCCCCCCCCCccccceEEEEEEEcC Confidence 01111 244443333 1222211 00 0000 000001000 000 00000 Q ss_pred eCCC--------ceEEEcCCCceecccCCCcc---cceeeEEEEEc--------CEEEEEECC-------c--------- Q lcl|NC_010325. 117 DGVN--------PPHHLPPSESTFRVLPNFPA---NTTFKRLKSFK--------NFLVGLNAT-------S--------- 161 (513) Q Consensus 117 ng~d--------~~q~~~~~s~~f~~L~g~p~---~~ka~~v~~~~--------~~l~~~g~t-------~--------- 161 (513) .|-. ..-+..+++.+ +|.+.|+ +-....+++|. +|+++..+. + T Consensus 185 ~GeES~PS~~S~~~~v~~pg~~V--~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~D~~~~~~lg~ 262 (567) T protein:vir:82 185 YGEEGPPGPASLEVTLRTPGTAV--QLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDASVLSYTDKIPAKNLGP 262 (567) T ss_pred CCCcCCCcccccceeeecCCceE--EEeeccCCccccccceEEEEEecCCCCceeeEEEEeeccceeeeeeccchhhccc Confidence 1100 00011111111 1222211 11122344443 366665321 0 Q ss_pred ---------------------Ccc---cCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceE Q lcl|NC_010325. 162 ---------------------NSV---EMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFI 217 (513) Q Consensus 162 ---------------------~~~---~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~v 217 (513) |+. =..|.|+||-..-|..+|+.. +.-.| -.||++++.++.++ T Consensus 263 ~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-----------r~t~~--~dIVaiA~~gt~LV 329 (567) T protein:vir:82 263 SLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-----------RHTTA--EDIVAICPLRTSLV 329 (567) T ss_pred ccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-----------ccCCC--CCeEEEEecccEEE Confidence 000 023567777766655544443 22222 45999999999999 Q ss_pred EEecCcEEEEEecCCCceeEeEEecCccccccCceeEEECCeEEEEeCCCeEEECCcccccCCchh---HHHHHHhhcCc Q lcl|NC_010325. 218 IYKEDSVYSMRYIGGLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVYVHNGVQKQSVIDAQ---VRKFFFSDINP 294 (513) Q Consensus 218 If~en~i~~m~y~g~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y~~~G~~~~~Ig~~~---V~~~~~~~i~~ 294 (513) |-..-.-|..+ +-+|.--+.+|+..+--|++++|||.+++-+.|-|++|.-++++..--.|-++. -+.|. ..+ T Consensus 330 V~TkG~PYl~s-G~sP~sms~~kL~~~qpCvS~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l~t~~qW~-a~~-- 405 (567) T protein:vir:82 330 VATKGEPYLFS-GVSPSTISGSKIPSMQACLSRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQIVSPEQWQ-SQF-- 405 (567) T ss_pred EEEcCceEEEE-cCChhhccccccccccccccccceeeecceEEeecCCcEEEEecCCchhhhhhhccChHHHH-hcC-- Confidence 99999999984 456777888999999999999999999999999999999999754311111111 12221 113 Q ss_pred chhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcccCccc Q lcl|NC_010325. 295 DNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNPWDTDT 374 (513) Q Consensus 295 ~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~~d~d~ 374 (513) +.+.+.+.. ...+|+-.|...++.. ..+++|...+..+..+.+..| .+ .+.....+ |.. -+..- T Consensus 406 -~P~ti~A~~--~eG~Y~a~Y~~~~g~~-----~~fifdp~~~~~~~i~~~~~~--~~--~d~~~d~L---y~~-~~~~l 469 (567) T protein:vir:82 406 -NPASIVAYP--WRGEYIACYTKPDGKQ-----DVFVFSPVNMDIRYLSTPFDC--AW--VDLAKDMM---RVV-TGDKM 469 (567) T ss_pred -CcceEEEEe--ecCeEEEEEeCCCCCc-----ceEEEcccccEEEEEecCcee--EE--EEeecCeE---EEe-eCCEE Confidence 334555444 3456777776555443 478999887776554443221 11 11111100 000 01111 Q ss_pred eeccccccccCccceEEEEeecCceeeecccceeecCccEEEEeecccccCCCcceEEEeeeeeccCCCeeEEEEeeeee Q lcl|NC_010325. 375 SVWGEGSYNPAKSSMIFSSFQDKKLFLFGNNSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTCNIWVGNAQ 454 (513) Q Consensus 375 ~~~~~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~~~~~g~~~ 454 (513) ..|..+ ...+...=.++.|.+..... +.|.+.... -.....+++.. T Consensus 470 ~~~~~g-------~~~~~~~WrSK~f~~p~~~s-----f~~~rV~s~------------------~~~~v~i~~~~---- 515 (567) T protein:vir:82 470 SVLAGG-------ALPSTIRWHSKIFSLPERTS-----FSCIRVKSP------------------APERVGITIMA---- 515 (567) T ss_pred eeecCC-------CCceeEEEecceEEecCccc-----eeEEEEecc------------------CCCceeEEEEE---- Confidence 122211 11112223344554432222 222222111 00111122111 Q ss_pred cCCCCceEcCceeeecCCceEEEeecCCCeEEEEEEccCCCcEEE--EEEe-eEEec-cccCC Q lcl|NC_010325. 455 VQGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYF--NGYT-IEMAP-KAGMR 513 (513) Q Consensus 455 ~~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~--~G~~-~~~~~-~g~rr 513 (513) -++.+.-. ..++ .. .| .+||---.+..|.+ +|.- |+-.. ++..- T Consensus 516 -dg~~v~~~-----~~g~---~~----~~--~~rlp~~~ar~Weveisg~~~V~~v~LA~S~~ 563 (567) T protein:vir:82 516 -DDVPVIHF-----APGT---FK----GS--VVRLPAATGQNWQVMVSGFGQVERITLSTSMS 563 (567) T ss_pred -cCCceeec-----CCcc---cC----Cc--eeeccCcccceEEEEEEecccEEEEEEecchh Confidence 01111100 1110 00 00 12221112444442 2210 11000 00000 No 45 >protein:vir:6326 Length: 826 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877473;genbank:gi:33300845;uniprot:Q7Y2D1;genbank:GeneID:1482615 Probab=97.29 E-value=0.0001 Score=42.42 Aligned_cols=482 Identities=11% Similarity=0.056 Sum_probs=228.8 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEe-CCeeEECCCcceeeecCCC--c-ceeeeeee---eCCceEEE Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFK-NGKAQKTLGHTPIFDTAQA--P-ILDMFPFI---RNNIPYWL 73 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~-~g~~~~~~g~~~~~~~~~~--~-~~~~~~~~---~~g~~~~~ 73 (513) |+...+.+.+..|.|.-..+.+-=++....|.||++. .++++||+|..=+.....+ + ..++.-+. +.+-.+++ T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~~G~~rRpg~~~v~~~~~~~~~~~~~~~~~~~r~~~~~~~~~ 80 (826) T protein:vir:63 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) T ss_pred CceeeeecchhhcceeccCchHhhhhhhhhhhcceeeccCCcccCchhHhhhhhccCCccccccEEEEEecCCCceEEEE Confidence 9999999999999998766555566888999999999 5889999887665443221 1 12221111 12333444 Q ss_pred EEcCceEEEe--cCceEEeccc--cce-eeCCCCceeEEeeCCEEEEEeCCCceEE-------EcCCC------------ Q lcl|NC_010325. 74 LCSEQRLYLA--DGTTIIDVSP--GPY-SASITNRWSVGSFNGVIFANDGVNPPHH-------LPPSE------------ 129 (513) Q Consensus 74 v~~~~kly~~--~~~t~~dis~--~~~-~~~~~~~w~f~~~~~~~ia~ng~d~~q~-------~~~~s------------ 129 (513) ..+.+.|+-| +++....... .+| ++..-..-++++.+|+++++|..-+||. +++.. T Consensus 81 ~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~v~~g~Y~ 160 (826) T protein:vir:63 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQYS 160 (826) T ss_pred EecCCcEEEEEcCCCeEEEcCCCCCceeeecCccceEEEEeCCEEEEEeCCeeeeeccccccccCCCCcEEEEeeccccC Confidence 5555545433 4443332221 122 2322223455566666555555444431 00000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_010325. 130 -------------------------------------------------------------------------------- 129 (513) Q Consensus 130 -------------------------------------------------------------------------------- 129 (513) T Consensus 161 ~~y~vti~~~~~~~gt~~s~t~t~~t~~~~~a~~~~~~~~~~~s~~yia~~l~~~~~a~~~~~~~~~t~~~~~~~~~~~a 240 (826) T protein:vir:63 161 KAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDANA 240 (826) T ss_pred ceEEEEEEeccccCCccccceEEEEeccCCcccccccccceeeeeeeeeeeceeeeeeccccccCCCccccceecCCccc Confidence Q ss_pred ----------------------------------------------ceecccCCCcc----------------------- Q lcl|NC_010325. 130 ----------------------------------------------STFRVLPNFPA----------------------- 140 (513) Q Consensus 130 ----------------------------------------------~~f~~L~g~p~----------------------- 140 (513) ..+.+|+...+ T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~~p~~~~~~~~~~~~~~~~~~~g~~~ 320 (826) T protein:vir:63 241 ATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMATGSTK 320 (826) T ss_pred ceeecceeEecccccEEEEeeCCcccEEEccCCCCcceEEEEEeeccceeeccccCCCcccceEEEeeEEeEEecCCCcc Confidence 00001100000 Q ss_pred -----------------------------------------------cc-------------------eeeEEEEEcCEE Q lcl|NC_010325. 141 -----------------------------------------------NT-------------------TFKRLKSFKNFL 154 (513) Q Consensus 141 -----------------------------------------------~~-------------------ka~~v~~~~~~l 154 (513) .| .-..|..|++|| T Consensus 321 d~~y~~~~~~~~~w~e~~~~~~~~~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL 400 (826) T protein:vir:63 321 APVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRL 400 (826) T ss_pred cceEEEEEcCCceEEEEeecCcccccccceEEEEEeccCCeEEEeccccccccccccccCCCccccCCCceEEEEEeceE Confidence 00 001246778888 Q ss_pred EEEECCcCcccCCceEEEeccCCcccccccccccc--cccCcceeccc---CCCCceeEEEecCcceEEEecCcEEEEEe Q lcl|NC_010325. 155 VGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTD--PTKDAGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRY 229 (513) Q Consensus 155 ~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~--~t~~a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y 229 (513) ++++ |+.|+.|..+|.+ +|.... ...+++=.++. +..-.|..+++....|+||...+-|.++- T Consensus 401 ~f~~--------~~~v~~Srtgd~~----nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~ls~ 468 (826) T protein:vir:63 401 VLLS--------QEYVCMSASNNPH----RWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPG 468 (826) T ss_pred EEee--------CCeEEEEccCCcc----ccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCcEEEEeC Confidence 7653 5779999999964 443332 22233333442 33344566888999999999999999963 Q ss_pred cC--CCceeEeEEecCccccccCceeEEECCeEEEEeCCC-----eEEE--C--Ccc-cccCC-chhHHHHHHhhcCcch Q lcl|NC_010325. 230 IG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGD-----VYVH--N--GVQ-KQSVI-DAQVRKFFFSDINPDN 296 (513) Q Consensus 230 ~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G-----~y~~--~--G~~-~~~Ig-~~~V~~~~~~~i~~~~ 296 (513) .+ .|.--++++.+. .+|-+.=.=+.+|+.++|+++.| ++.+ + ..+ +.... +.-+.+.|-. . T Consensus 469 ~~~lTP~~~~i~~~s~-~~~~~~~~Pv~vG~~v~Fv~~~g~~~s~v~e~~~~~d~~~~y~~~dlt~~~~~l~~~-----~ 542 (826) T protein:vir:63 469 GGIVTPRTAVISITTQ-YDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPG-----P 542 (826) T ss_pred CCcccceeEEEEEEEe-ecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeeccccceehhHHHHHHHHhcCC-----C Confidence 22 244456666653 34555555678999999999865 5533 2 111 22211 2233332211 1 Q ss_pred hCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecc----c---CeEEEEecccee----------eeee------- Q lcl|NC_010325. 297 YQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWK----E---NTWSIRDLPNVL----------SGAY------- 352 (513) Q Consensus 297 ~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~----~---~~Ws~~d~~~~~----------~~~~------- 352 (513) ...+ .+=.....+.|+-... +++++|-|. . ..|+.-+.+..+ +..+ T Consensus 543 v~~~--a~s~~~~~v~~~~~~d--------g~l~~~~y~~~~~e~~v~aW~~~~~~g~v~~~~~i~d~l~~iv~r~~~~~ 612 (826) T protein:vir:63 543 AEYI--QAAASSGYLVFGTSTA--------DEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIA 612 (826) T ss_pred eEEE--EEcCCCCEEEEEEcCC--------CEEEEEEEeeCCCcEEEEeEEEEecCCcEEEEEEECCeEEEEEEeCCCEE Confidence 1111 1112223344443221 357777762 2 357765544321 1110 Q ss_pred -cccccccce---------------eecccCcccCccceeccccccccCccceEEEEeecCc----------------ee Q lcl|NC_010325. 353 -GIIDPKVSN---------------LWDDDPNPWDTDTSVWGEGSYNPAKSSMIFSSFQDKK----------------LF 400 (513) Q Consensus 353 -g~~~~~~~~---------------~~~~~~~~~d~d~~~~~~ds~~~~~~~~~~~~~~~~~----------------~~ 400 (513) .....+... ..+............|..-.....+.. +....++. .+ T Consensus 613 ~~r~~~e~~~~~~~~~~~~~d~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~g~v~l 690 (826) T protein:vir:63 613 LGRMHLNSLPAREGLQYPKYDYWRRIEATVAGELELTKQHWDLIKDASAVYQ--LQPVAGAYMERTHLGVKRETNTKVFL 690 (826) T ss_pred EEEEEEEecCCccccccCCccceEEEEEeeeeeeccCcceeecccCcccccE--EEEeeCccccCCccceEEecCCEEEE Confidence 000000000 000000000011111110000001110 11111111 11 Q ss_pred eec----ccceeecCccEEEEeecccccC---CCcc---eEEEeeeeeccCCCeeEEEEeee--ee----cCCCCceEcC Q lcl|NC_010325. 401 LFG----NNSTFSGQNFVSTLERSDIYLG---DDRM---MKTVSAIIPHITGNGTCNIWVGN--AQ----VQGSGIRWKG 464 (513) Q Consensus 401 ~~~----~~~~~~g~~l~a~~~~~~~~~~---~~~~---~~~i~~~~~~~t~~~~~~~~~g~--~~----~~~~~~~w~~ 464 (513) .+. .....-|-++++.++-..+.+- ...+ +.++.++......-+.+.+.+.. ++ ....+....+ T Consensus 691 ~~~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~ 770 (826) T protein:vir:63 691 DVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFS 770 (826) T ss_pred ecCCCccccEEEEeeeeeEEEEecceEEEccCCCcceeccEEEEEEEEEeeccccEEEEecCccccceeEeecCCceecc Confidence 111 1124568889998886555431 1111 12333333323222223222221 10 0001111111 Q ss_pred c------eeeecCCceEEEeecCCCeEEEEEEccCCCcEEEEEEeeEEe-ccccCC Q lcl|NC_010325. 465 P------YPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYFNGYTIEMA-PKAGMR 513 (513) Q Consensus 465 ~------~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~~G~~~~~~-~~g~rr 513 (513) + ....++ ...+++|...--..++|+.+...++++.+++.|+. -.=.|| T Consensus 771 ~~~~~g~p~~~t~-~~~vP~~~~~~~~~i~i~~d~P~p~~il~i~~~~~yn~r~rr 825 (826) T protein:vir:63 771 RQLNAGEPLVDSA-VVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRR 825 (826) T ss_pred cccccccccccce-EEEEEEeeccceEEEEEEeCCCCcEEEEEEEEEEEEeceeec Confidence 1 111122 35567888888888888889999999999999983 444555 No 46 >protein:vir:103364 Length: 1031 # NCBI annotation: hypothetical protein # Family: family:all:11264 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024749;genbank:gi:48697091;genbank:GeneID:2846032 Probab=97.17 E-value=0.00012 Score=42.15 Aligned_cols=445 Identities=16% Similarity=0.160 Sum_probs=206.2 Q ss_pred CcccchhhcCccccccccCcccCCCC----------------cEEEeEEEEEeCCeeEECCCcceeeec--CC------- Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLE----------------KWSFGNNVRFKNGKAQKTLGHTPIFDT--AQ------- 55 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~----------------a~~~~~Nv~~~~g~~~~~~g~~~~~~~--~~------- 55 (513) ||-+|-+-+|.+--+--++|.+.|.+ .--.+.|++|..-+|...=|..+...+ +| T Consensus 1 MAds~~~ylD~sRs~l~iDP~k~Pnnl~~aTaEDt~dss~PViAY~~~NILPt~~GY~S~FGl~~~L~~D~~Pddgnplk 80 (1031) T protein:vir:10 1 MADSRGMYLDTSRSMLVIDPAKMPNNLGKATAEDTTDSSFPVIAYHGWNILPTPMGYKSAFGLMPYLRADPQPDDGNPLK 80 (1031) T ss_pred CcCCCcceEeeccceEEechhhhhhhhcccccccCcccccceEEeeccccccCcccchhhhccchhhccccCCCCCchhh Confidence 77777777776666666666655542 112456888877666655566666443 22 Q ss_pred -CcceeeeeeeeCC--ceEEEEEcCceEEEec------------CceEEeccccceeeCCCCceeEEeeCCEEEEEeCCC Q lcl|NC_010325. 56 -APILDMFPFIRNN--IPYWLLCSEQRLYLAD------------GTTIIDVSPGPYSASITNRWSVGSFNGVIFANDGVN 120 (513) Q Consensus 56 -~~~~~~~~~~~~g--~~~~~v~~~~kly~~~------------~~t~~dis~~~~~~~~~~~w~f~~~~~~~ia~ng~d 120 (513) .-|--++.+-... +.-+ .-+.+-+|-+. =..|+++..-+-...+=--|.-|.+.+.+.+-.-.| T Consensus 81 q~R~q~LF~~Qt~~f~Nl~V-~LTE~G~~m~s~~~nyksgidldl~eWvQ~~~v~~~~D~LY~WT~~VI~ntiYiY~QGd 159 (1031) T protein:vir:10 81 QLRKQDLFTYQTINFYNLGV-MLTEGGFWMYSSIGNYKSGIDLDLEEWVQIFPVPTAQDTLYLWTRCVIDNTIYIYHQGD 159 (1031) T ss_pred hhhhcceeeEeehhhhhhhh-hhhcCceeEeecccccccccccchhhhheeeccCCCcchhHHhhhhhhcceeEEEEcCC Confidence 1122233332211 1111 11233333222 234777664321111114699999999876655444 Q ss_pred ceEEEcCCCcee--------cccCCCcccceeeEEEEEcCEEEEEEC-----CcCcc---cCCceEEEeccCCccccccc Q lcl|NC_010325. 121 PPHHLPPSESTF--------RVLPNFPANTTFKRLKSFKNFLVGLNA-----TSNSV---EMPQMVWWSTSADAGGVPAS 184 (513) Q Consensus 121 ~~q~~~~~s~~f--------~~L~g~p~~~ka~~v~~~~~~l~~~g~-----t~~~~---~~p~rv~wS~~~d~~~~P~~ 184 (513) +-.|.-..-... +.|......|++.+|...--||=..|- .++.. +..|.+-||.+.|. +. T Consensus 160 peiyvladY~k~Qq~ka~~~t~lvV~SVdwqfgivkfiPTFLNMeGQVGlFkA~~RLG~WDtdNA~~WSs~~D~----qD 235 (1031) T protein:vir:10 160 PEIYVLADYGKIQQWKAKPNTTLVVESVDWQFGIVKFIPTFLNMEGQVGLFKADNRLGMWDTDNAIAWSSAVDK----QD 235 (1031) T ss_pred cceeeeccccceeeeecccccceEEEeecccccchhhcchhhccccceeEEeeCCeeeeeecCccceeccchhh----hc Confidence 332211111111 112222223444444433333321110 01100 22455789998875 33 Q ss_pred ccccccccCcceecccCCCCceeEEEecCcceEEEecCcEEEEEec-CCCceeEeEEecCccccccCceeE--EECCeEE Q lcl|NC_010325. 185 WDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSVYSMRYI-GGLFIFQFQQLFNDVGILGPNCAV--EFDGNHF 261 (513) Q Consensus 185 Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i~~m~y~-g~~~~f~~~~i~~~~G~~~~~siv--~~~~~~f 261 (513) +.+ +.+.-|+.--+++-+|.|+.+.+-|.-++||..+||...+.. |.++-|+-+.|.++.|...++-+. +-+--|| T Consensus 236 FkP-d~tT~A~V~kFAsvdGqI~~I~~HG~GfiIY~srSi~~~~P~~~t~~~~s~~ai~n~tGv~Y~~QaTm~qPDTvHF 314 (1031) T protein:vir:10 236 FKP-DATTFAGVTKFASVDGQISMILQHGPGFIIYASRSISICTPITGTPEKFSGRAILNKTGVPYYFQATMGQPDTVHF 314 (1031) T ss_pred cCc-chHhhhhhheeeecCceEEEEeecCCcEEEEeeceEEEEecCCcCcccccceEEEccCCCcceeEEeccCCceeEE Confidence 332 334456666778888999999999999999999999988876 889999999999999998887544 3346699 Q ss_pred EEeCCCeEEECCcccccCCchhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccCeEEE Q lcl|NC_010325. 262 VVGHGDVYVHNGVQKQSVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSI 341 (513) Q Consensus 262 fls~~G~y~~~G~~~~~Ig~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~ 341 (513) .++.-|.|.+--+-++-| ...|-+.|.+....-++-+|-+.| .++....+.-.+++....+||||-.-.-=.| T Consensus 315 ~~Tn~GL~rI~N~n~e~I-~PdVsn~L~n~~q~i~~~vinG~~------LFisvannsrdsansryavlvydgkgkpgyf 387 (1031) T protein:vir:10 315 SWTNSGLLRITNGNPEFI-EPDVSNFLMNNFQVIRPMVINGSH------LFISVANNSRDSANSRYAVLVYDGKGKPGYF 387 (1031) T ss_pred EEeeeeeEEEecCCccee-ccchHHHHhhcCceeeEEEecccE------EEEEeecCCcccccceeEEEEEcCCCCCCCC Confidence 999999998765555655 445878776665655555554433 3445443332223333578999965433222 Q ss_pred EeccceeeeeecccccccceeecccCc----------ccCccceeccccccccCccceEEEEeecCceeeecccceeecC Q lcl|NC_010325. 342 RDLPNVLSGAYGIIDPKVSNLWDDDPN----------PWDTDTSVWGEGSYNPAKSSMIFSSFQDKKLFLFGNNSTFSGQ 411 (513) Q Consensus 342 ~d~~~~~~~~~g~~~~~~~~~~~~~~~----------~~d~d~~~~~~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 411 (513) .. .. +-......-|.-+-+ .-...+.+-.+|....+.++++.--+ +++.+.- -. T Consensus 388 np--pr------yeypdqaidwlhdfiwgqlpeyqeqlpeyetippdndppplaeqrplipcy-egytfip-------ps 451 (1031) T protein:vir:10 388 NP--PR------YEYPDQAIDWLHDFIWGQLPEYQEQLPEYETIPPDNDPPPLAEQRPLIPCY-EGYTFIP-------PS 451 (1031) T ss_pred CC--cc------ccCchhHHHHHHHhhhccCchHHHhCCccccCCCCCCCCCccccCccceec-cCeeecC-------CC Confidence 11 00 000111111111110 00001111122222223333322111 1111100 00 Q ss_pred ccEEEEeecccccCCCcceEEEeeeeeccCCCeeEEEEeee---eecCCCCceEcCceeeecCCceEE-Eee---cCCCe Q lcl|NC_010325. 412 NFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTCNIWVGN---AQVQGSGIRWKGPYPYRIGQDYKI-DTK---HVGRY 484 (513) Q Consensus 412 ~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~~~~~g~---~~~~~~~~~w~~~~~~~~~~~~~~-~~R---~~~Ry 484 (513) -+|..-..++ -+ .++.+-+ -++++. .|+. .+-+| -+- +-.|| T Consensus 452 gfetidwntg---------------------tg-ttisvpsipsfrfpgg--------vynm-idpqihglnqdtalwry 500 (1031) T protein:vir:10 452 GFETIDWNTG---------------------TG-TTISVPSIPSFRFPGG--------VYNM-IDPQIHGLNQDTALWRY 500 (1031) T ss_pred CcceeecccC---------------------cc-ceeecccccccccCcc--------chhc-ccchhccccccceeeee Confidence 0111111110 00 1111110 011110 0100 00000 011 12344 Q ss_pred EEEEEEccCCCcEEEEEEeeEEeccccCC Q lcl|NC_010325. 485 IALKFDFSSEGDWYFNGYTIEMAPKAGMR 513 (513) Q Consensus 485 ~~~rl~~~~g~~w~~~G~~~~~~~~g~rr 513 (513) .+=-+..|-|+.+- -++|..- T Consensus 501 eagpivsppgahfi--------dkageef 521 (1031) T protein:vir:10 501 EAGPIVSPPGAHFI--------DKAGEEF 521 (1031) T ss_pred ccCcccCCCcchhh--------hhhhhHH Confidence 44444554443321 1222211 No 47 >protein:vir:96439 Length: 1031 # NCBI annotation: hypothetical protein # Family: family:all:11264 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218823;genbank:gi:147917340;genbank:GeneID:5142642 Probab=97.11 E-value=0.00017 Score=41.29 Aligned_cols=445 Identities=16% Similarity=0.165 Sum_probs=205.9 Q ss_pred CcccchhhcCccccccccCcccCCCC----------------cEEEeEEEEEeCCeeEECCCcceeeec--CC------- Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLE----------------KWSFGNNVRFKNGKAQKTLGHTPIFDT--AQ------- 55 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~----------------a~~~~~Nv~~~~g~~~~~~g~~~~~~~--~~------- 55 (513) ||-+|-+-+|.+--+--++|.+.|.+ .--.+.|++|..-+|...=|..+...+ +| T Consensus 1 MAds~~~ylD~sRs~l~iDP~k~Pnnl~~aTaEDt~dss~PViAY~~~NILPt~~GY~S~FGl~~~L~~D~~Pddgnplk 80 (1031) T protein:vir:96 1 MADSRGMYLDTSRSMLVIDPAKMPNNLGKATAEDTTDSSFPVIAYHGWNILPTPMGYKSAFGLMPYLRADPQPDDGNPLK 80 (1031) T ss_pred CcCCCcceEeeccceEEechhhhhhhhcccccccCcccccceEEeeccccccCcccchhhhccchhhccCCCCCCCchhh Confidence 77777777777666666666655542 112456888877666655566666443 22 Q ss_pred -CcceeeeeeeeCC--ceEEEEEcCceEEEec------------CceEEeccccceeeCCCCceeEEeeCCEEEEEeCCC Q lcl|NC_010325. 56 -APILDMFPFIRNN--IPYWLLCSEQRLYLAD------------GTTIIDVSPGPYSASITNRWSVGSFNGVIFANDGVN 120 (513) Q Consensus 56 -~~~~~~~~~~~~g--~~~~~v~~~~kly~~~------------~~t~~dis~~~~~~~~~~~w~f~~~~~~~ia~ng~d 120 (513) .-|--++.+-... +.-+ .-+.+-+|-+. =..|+++..-+-...+=--|.-|.+.+.+.+-.-.| T Consensus 81 q~R~q~LF~~Qt~~f~Nl~V-~LTE~G~~m~s~~~nyksgidldl~eWvQ~~~v~~~~D~LY~WT~~VI~ntiYiY~QGd 159 (1031) T protein:vir:96 81 QLRKQDLFTYQTINFYNLGV-MLTEGGFWMYSSIGNYKSGIDLDLEEWVQIFPVPTAQDTLYLWTRCVIDNTIYIYHQGD 159 (1031) T ss_pred hhhhcceeeeeechhhhhhh-hhhcCceeEeecccccccccccchhhhheeeccCCCcchhhhhhhhhhcceEEEEEcCC Confidence 1122233332211 1111 11233333222 234777664321111114699999999876655444 Q ss_pred ceEEEcCCCcee--------cccCCCcccceeeEEEEEcCEEEEEEC-----CcCcc---cCCceEEEeccCCccccccc Q lcl|NC_010325. 121 PPHHLPPSESTF--------RVLPNFPANTTFKRLKSFKNFLVGLNA-----TSNSV---EMPQMVWWSTSADAGGVPAS 184 (513) Q Consensus 121 ~~q~~~~~s~~f--------~~L~g~p~~~ka~~v~~~~~~l~~~g~-----t~~~~---~~p~rv~wS~~~d~~~~P~~ 184 (513) +-.|.-..-... +.|......|++.+|...--||=..|- .++.. +..|.+-||.+.|. +. T Consensus 160 peiyvladY~k~Qq~ka~~~t~lvV~SVdwqfgivkfiPTFLNMeGQVGvFkA~~RLG~WDtdNA~~WSs~~D~----qD 235 (1031) T protein:vir:96 160 PEIYVLADYGKIQQWKAKPNTTLVVESVDWQFGIVKFIPTFLNMEGQVGVFKADNRLGMWDTDNAIAWSSAVDK----QD 235 (1031) T ss_pred cceeeeecccceeeeecccccceEEEeecccccchhhcchhhccccceeEEeeCCeeeeeeccccceeccchhh----hc Confidence 332211111111 112222223444444333333211110 00000 22455789998875 33 Q ss_pred ccccccccCcceecccCCCCceeEEEecCcceEEEecCcEEEEEec-CCCceeEeEEecCccccccCceeE--EECCeEE Q lcl|NC_010325. 185 WDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFIIYKEDSVYSMRYI-GGLFIFQFQQLFNDVGILGPNCAV--EFDGNHF 261 (513) Q Consensus 185 Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vIf~en~i~~m~y~-g~~~~f~~~~i~~~~G~~~~~siv--~~~~~~f 261 (513) +.+ +.+.-|+.--+++-+|.|+-+.+-|.-++||..+||...+.. |.++-|+-+.|.++.|...++-+. +-+--|| T Consensus 236 FkP-d~tT~A~V~kFAsvdGqI~~I~~HG~GfiIY~srSi~~~~P~~~t~~~~s~~ai~n~tGv~Y~~QaTm~qPDTvHF 314 (1031) T protein:vir:96 236 FKP-DATTFAGVTKFASVDGQISIILQHGPGFIIYASRSISICTPITGTPEKFSGRAILNKTGVPYYFQATMGQPDTVHF 314 (1031) T ss_pred cCc-chHhhhhhheeeecCceEEEEeecCCcEEEEeeceEEEEecCccCcccccceEEEccCCCcceeEEeccCCceeEE Confidence 332 334456666778888999999999999999999999888876 889999999999999998887544 3346699 Q ss_pred EEeCCCeEEECCcccccCCchhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccCeEEE Q lcl|NC_010325. 262 VVGHGDVYVHNGVQKQSVIDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSI 341 (513) Q Consensus 262 fls~~G~y~~~G~~~~~Ig~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~ 341 (513) .++.-|.|.+--+-++-| ...|-+.|.+....-++-+|-+.| .++....+.-.+++....+||||-.-.-=.| T Consensus 315 ~~Tn~GL~rI~N~n~e~I-~PdVsn~L~n~~q~i~~~vinG~~------LFisvannsrdsansryavlvydgkgkpgyf 387 (1031) T protein:vir:96 315 SWTNSGLLRITNGNPEFI-EPDVSNFLMNNFQVIRPMVINGSH------LFISVANNSRDSANSRYAVLVYDGKGKPGYF 387 (1031) T ss_pred EEeeeeeEEEecCCccee-ccchHHHHhhcCceeeeEEecccE------EEEEeecCCcccccceeEEEEEcCCCCCCCC Confidence 999999998765555655 445878776665555555554433 3445443332223333578999965433222 Q ss_pred EeccceeeeeecccccccceeecccCc----------ccCccceeccccccccCccceEEEEeecCceeeecccceeecC Q lcl|NC_010325. 342 RDLPNVLSGAYGIIDPKVSNLWDDDPN----------PWDTDTSVWGEGSYNPAKSSMIFSSFQDKKLFLFGNNSTFSGQ 411 (513) Q Consensus 342 ~d~~~~~~~~~g~~~~~~~~~~~~~~~----------~~d~d~~~~~~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 411 (513) .. .. +-......-|.-+-+ .-...+.+-.+|....+.++++.--+ +++.+.- -. T Consensus 388 np--pr------yeypdqaidwlhdfiwgqlpeyqeqlpeyetippdndppplaeqrplipcy-egytfip-------ps 451 (1031) T protein:vir:96 388 NP--PR------YEYPDQAIDWLHDFIWGQLPEYQEQLPEYETIPPDNDPPPLAEQRPLIPCY-EGYTFIP-------PS 451 (1031) T ss_pred CC--cc------ccCchhHHHHHHHhhhccCchHHHhCCccccCCCCCcCCCccccCccceec-cCeeecC-------CC Confidence 11 00 000111111111110 00001111122222223333322111 1111100 00 Q ss_pred ccEEEEeecccccCCCcceEEEeeeeeccCCCeeEEEEeee---eecCCCCceEcCceeeecCCceEE-Eee---cCCCe Q lcl|NC_010325. 412 NFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTCNIWVGN---AQVQGSGIRWKGPYPYRIGQDYKI-DTK---HVGRY 484 (513) Q Consensus 412 ~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~~~~~g~---~~~~~~~~~w~~~~~~~~~~~~~~-~~R---~~~Ry 484 (513) -+|..-..+ +-+ .++.+-+ -++++. .|+. .+-+| -+- +-.|| T Consensus 452 gfetidwnt---------------------gtg-ttisvpsipsfrfpgg--------vynm-idpqihglnqdtalwry 500 (1031) T protein:vir:96 452 GFETIDWNT---------------------GTG-TTISVPSIPSFRFPGG--------VYNM-IDPQIHGLNQDTALWRY 500 (1031) T ss_pred Ccceeeccc---------------------Ccc-ceEecCcccccccCcc--------chhc-ccchhccccccceeeee Confidence 011111111 000 1111110 011110 0100 00000 011 12344 Q ss_pred EEEEEEccCCCcEEEEEEeeEEeccccCC Q lcl|NC_010325. 485 IALKFDFSSEGDWYFNGYTIEMAPKAGMR 513 (513) Q Consensus 485 ~~~rl~~~~g~~w~~~G~~~~~~~~g~rr 513 (513) .+=-+..|-|+.+- -++|..- T Consensus 501 eagpivsppgahfi--------dkageef 521 (1031) T protein:vir:96 501 EAGPIVSPPGAHFI--------DKAGEEF 521 (1031) T ss_pred ccCcccCCCcchhh--------hhhhhHH Confidence 44444554443321 1222211 No 48 >protein:vir:104388 Length: 566 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794072;genbank:gi:116222017;genbank:GeneID:4397450 Probab=97.03 E-value=0.0002 Score=40.84 Aligned_cols=430 Identities=12% Similarity=0.135 Sum_probs=189.8 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeEECCCcceeeecC---CCcceeee-e-eee-CCceEE-- Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTA---QAPILDMF-P-FIR-NNIPYW-- 72 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~---~~~~~~~~-~-~~~-~g~~~~-- 72 (513) +.|--+++-+-.|-++-+.|--||.+.-+-+.|.+|+.|.+.|.+.-.-+...+ +..++..- . |.+ .+.+-+ T Consensus 24 ~~M~~i~i~~f~Ge~Pr~~p~lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~~~~~~~kTif~y~~~~W~~w~~~V~~ir 103 (566) T protein:vir:10 24 ISMPYIDITTMRGMMPRVVTSMLPDHSAVLAEDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIR 103 (566) T ss_pred ceeeEEeecccccccccchhhhccccccceEEeeeecCCeeeeeecccccccccccCceeeeeecCcEeEEeCCceeecc Confidence 555556666778999999999999999999999999999988776543332221 12222110 0 111 011111 Q ss_pred -EEEcC--ceEEEecCce--E--Eecccc---cee-----eCCCCceeEE-----------------eeCCE-----EEE Q lcl|NC_010325. 73 -LLCSE--QRLYLADGTT--I--IDVSPG---PYS-----ASITNRWSVG-----------------SFNGV-----IFA 115 (513) Q Consensus 73 -~v~~~--~kly~~~~~t--~--~dis~~---~~~-----~~~~~~w~f~-----------------~~~~~-----~ia 115 (513) -|+.+ .++|--+... + .||... +|. ... .+.+-+ .-+.. .+- T Consensus 104 ~PvAqD~~~rvY~tg~~~Pk~t~~diAt~g~~~~pa~~y~LgV-PaPs~apv~~~~~~sg~~~~~~~d~~tr~Yv~TfVt 182 (566) T protein:vir:10 104 SPVAQDNYGRIYYTDGKFPKVTAAEIATKGEGNFPAASYRLGI-PAPTTAPVCTVQKGEGATDENPNDDETRFYTETFVS 182 (566) T ss_pred CccccCCcceEEEeeCCcceeeecceeeccccccccccccccC-CCCcccceeeccCCCcccCCCCcccceeEEEEEEEc Confidence 01111 2444443331 1 111110 100 000 000000 00000 011 Q ss_pred EeCCC--------ceEEEcCCCceecccCCCcccc-eeeEEEEEc--------CEEEEEECC-------c---------- Q lcl|NC_010325. 116 NDGVN--------PPHHLPPSESTFRVLPNFPANT-TFKRLKSFK--------NFLVGLNAT-------S---------- 161 (513) Q Consensus 116 ~ng~d--------~~q~~~~~s~~f~~L~g~p~~~-ka~~v~~~~--------~~l~~~g~t-------~---------- 161 (513) ..|-. ...+..+++.+--.|...|++- ....++.|. +|+++..+. + T Consensus 183 ~~GeES~PS~~S~~v~v~~~gs~V~ltl~~~p~~~~~i~~~RIYRS~tg~~gtdy~lVael~as~~sf~Dd~~~~~lg~~ 262 (566) T protein:vir:10 183 AYGEEGPPGPESLEVTVGIPDTPVQLTLSPVPLQDANINRRRIYRSVSGGGEADFLLVAELEASVLSYTDNIPAKNLGPS 262 (566) T ss_pred CCCCcCCCccccceeEecCCCceEEEEecCCCcCcCCceeEEEEEecCCCCceeEEEEeeecccceeeeccccccccCcc Confidence 11110 0001111221211222222210 111233332 366666421 0 Q ss_pred --------------------Ccc---cCCceEEEeccCCcccccccccccccccCcceecccCCCCceeEEEecCcceEE Q lcl|NC_010325. 162 --------------------NSV---EMPQMVWWSTSADAGGVPASWDPTDPTKDAGQNTLADTNGAIVDGVKLRDSFII 218 (513) Q Consensus 162 --------------------~~~---~~p~rv~wS~~~d~~~~P~~Wd~t~~t~~a~~~dl~d~~G~iv~g~~l~~~~vI 218 (513) |+. =..|-|+||-..-|..+|+.. +.-.| -.||++++.++.++| T Consensus 263 Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP~~Y-----------r~t~~--~dIVaiA~~gt~LVV 329 (566) T protein:vir:10 263 LATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWPEVN-----------RHTTA--EDIVAVCPLGTSLVV 329 (566) T ss_pred cccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccchhh-----------ccCCC--CCeEEEEeccceEEE Confidence 000 023556777666655544443 22222 459999999999999 Q ss_pred EecCcEEEEEecCCCceeEeEEecCccccccCceeEEECCeEEEEeCCCeEEECCcccccCCchhHHHHHHhhcCc---c Q lcl|NC_010325. 219 YKEDSVYSMRYIGGLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGDVYVHNGVQKQSVIDAQVRKFFFSDINP---D 295 (513) Q Consensus 219 f~en~i~~m~y~g~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G~y~~~G~~~~~Ig~~~V~~~~~~~i~~---~ 295 (513) -..-.-|..+ +-+|.--+.+|+..+--|++++|||.+++-+.|-|.+|.-++++..--.+-++ -++..=+. . T Consensus 330 ~TkG~PYl~s-G~sP~sms~~kL~~~qaCvS~rsiV~~~g~v~Yas~dGLv~v~a~g~a~vvT~----~l~t~~qW~~~~ 404 (566) T protein:vir:10 330 ATKGEPYLFS-GVSPSTISGSKIPSMQACLSRQSMVAMEGFVLYAGTNGLVSVDANGNAALATE----QIISPEQWQTQF 404 (566) T ss_pred EEcCceEEEE-cCChhhccccccccccccccccceeeecceEEeecCCceEEEecCCChhhhhh----hhcChhHHHhcC Confidence 9999999984 45677788899999999999999999999999999999999976431111121 12211111 2 Q ss_pred hhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecccCeEEEEeccceeeeeecccccccceeecccCcccCccce Q lcl|NC_010325. 296 NYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWKENTWSIRDLPNVLSGAYGIIDPKVSNLWDDDPNPWDTDTS 375 (513) Q Consensus 296 ~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~~~~Ws~~d~~~~~~~~~g~~~~~~~~~~~~~~~~~d~d~~ 375 (513) +.+.+.+.. ...+|+-+|...++.+ .+.++|..-...+..+.+.. ..+. +.....+ +-. -+.+-. T Consensus 405 ~P~ti~A~~--~eG~Y~a~Y~~~~g~~-----~~fi~dp~g~~i~~l~~~~d--~~~~--d~~~d~l---y~~-~g~~i~ 469 (566) T protein:vir:10 405 NPASIVAYP--WRGEYIACYTKPDGEK-----DVFVFNPAGMDIRHLSTPFD--CACV--DLVNDVM---RVV-SGQNMS 469 (566) T ss_pred CcceEEEEe--ecCeEEEEEeCCCCCc-----cEEEEcccCceEEEeccccc--eeEE--eeccCee---eee-eCCeee Confidence 334454443 4466777776555443 57889976665444433322 1111 1111111 000 011112 Q ss_pred eccccccccCccceEEEEeecCceeeecccceeecCccEEEEeecccccCCCcceEEEeeeeeccCCCeeEEEEeeeeec Q lcl|NC_010325. 376 VWGEGSYNPAKSSMIFSSFQDKKLFLFGNNSTFSGQNFVSTLERSDIYLGDDRMMKTVSAIIPHITGNGTCNIWVGNAQV 455 (513) Q Consensus 376 ~~~~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~a~~~~~~~~~~~~~~~~~i~~~~~~~t~~~~~~~~~g~~~~ 455 (513) .|..+ ..+ +...=.+++|.+.... ++.|.+...+ .+.++ .+++..... T Consensus 470 ~~~~g------~~~-~~~~WrSK~f~~p~~~-----sf~~~rV~s~----~~~~v--------------~i~i~ad~~-- 517 (566) T protein:vir:10 470 AMAGG------RLP-SLIRWHSKVFSLPERT-----SFSCLRVKSP----TPERV--------------GITVLADDV-- 517 (566) T ss_pred eecCC------CCC-ceEEEecceEEecCCc-----ceeEEEeecC----Cccce--------------eEEEEECCE-- Confidence 23221 111 1123344455553322 2222222111 11111 112211110 Q ss_pred CCCCceEcCceeeecCCceEEEeecCCCeEEEEEEccCCCcEEE--EEEe-eEEec-cccCC Q lcl|NC_010325. 456 QGSGIRWKGPYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYF--NGYT-IEMAP-KAGMR 513 (513) Q Consensus 456 ~~~~~~w~~~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~--~G~~-~~~~~-~g~rr 513 (513) ++ ..+. +++++ ++ .+||---.+..|.+ +|+- |+-.. ++..- T Consensus 518 ---~v-----~~~a---~G~~~----~~--~~rLp~~~~~~Wevevsg~~~V~~v~La~S~~ 562 (566) T protein:vir:10 518 ---PV-----IHLA---PGSLS----GS--VVRLPAATGQNWQVLVSGFGQVERITLSTSMS 562 (566) T ss_pred ---EE-----EEeC---CCccc----cc--eeecCCCccceEEEEEEecccEEEEEEecchh Confidence 00 1111 11111 00 22221113444542 2221 11000 00000 No 49 >protein:vir:103341 Length: 806 # NCBI annotation: tail tubular protein B-like protein # Family: family:all:825 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039670;genbank:gi:125999999;genbank:GeneID:4818417 Probab=96.16 E-value=0.00093 Score=37.20 Aligned_cols=474 Identities=11% Similarity=0.045 Sum_probs=185.4 Q ss_pred Cc------------------ccchhhcC----------ccccccccCcccCCCCcEEEeEEEEEeCCe----eEECCCcc Q lcl|NC_010325. 1 MA------------------LERQEVKN----------PTGIVTDIAPADLPLEKWSFGNNVRFKNGK----AQKTLGHT 48 (513) Q Consensus 1 m~------------------~~~~~~~~----------~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~----~~~~~g~~ 48 (513) ++ +..+-... ..|-+.-+.+.+.|+.. +...+|+ +....+.. T Consensus 177 ~~~~~~~~~~~~~~~~~a~~l~~~l~~~~~~~~~~~~~~~g~~~~i~~~~~~~~~------~~~~~g~~~~~~~~~~~~v 250 (806) T protein:vir:10 177 ASSEDVKNEDLVRTDYVAGKLLENFNSRTASFPGFSMYQDGNVLVVDNSNGANYA------LTTVDGADGQDLVAIRHKV 250 (806) T ss_pred ccCCCcccccccchhHHHHHHHhhhcccccccceeEEEEcccEEEEecCCCCccE------EEEeeCCCCceeEEeeccc Confidence 00 00000000 01111111111111100 1111111 01111111 Q ss_pred eeeecCCCcc-eeeeeeeeCCceEEEEEcCceEEEec---C--ceEEeccc-cceeeCCCCceeEEeeCCEEEEEeCCCc Q lcl|NC_010325. 49 PIFDTAQAPI-LDMFPFIRNNIPYWLLCSEQRLYLAD---G--TTIIDVSP-GPYSASITNRWSVGSFNGVIFANDGVNP 121 (513) Q Consensus 49 ~~~~~~~~~~-~~~~~~~~~g~~~~~v~~~~kly~~~---~--~t~~dis~-~~~~~~~~~~w~f~~~~~~~ia~ng~d~ 121 (513) .-.+.+|.-+ -+......+... ...+...+++. + ..|...-+ +-..+......-++...+.+. .++... T Consensus 251 ~~~~~lp~~~~~g~~v~i~~~~~---~~~~~y~v~~~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~~v~~~~~-~~~~~~ 326 (806) T protein:vir:10 251 TNLDTLPNRAPVGYKVQVWPTGS---KPESRYWLQAESQDGSKVTWVETIAPGVRKGWNAATMPHVLVRESLN-ANGSAN 326 (806) T ss_pred CccccCccccCCCcEEEEeccCC---CCCCceEEEEEeeccCceEEEeecccccccceeccccceEEEeeeee-ecccce Confidence 1111122111 011111111000 00001112221 1 12543221 111111111111111112111 112211 Q ss_pred e--EE--EcC---CCce---ecccCCCcccceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCccccccccccccc- Q lcl|NC_010325. 122 P--HH--LPP---SEST---FRVLPNFPANTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDP- 190 (513) Q Consensus 122 ~--q~--~~~---~s~~---f~~L~g~p~~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~- 190 (513) - +. |.. +.+. .-.+-|+..+-...-|..|+|||+++. |+.|+.|..+|.+ +|..... T Consensus 327 ~~~~~~~w~~r~~Gd~~tn~~psF~~~~~~~~it~v~f~q~RL~f~s--------~~~v~~Srsgd~~----nF~~~t~~ 394 (806) T protein:vir:10 327 FTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRLMLTS--------GEAVVASRTSRFF----DFFRYTVL 394 (806) T ss_pred eEEEecccccccccccccCccCcccCCCCCccceEEEEEeeeEEEec--------CCeEEEEccCCcc----cCcccccc Confidence 1 11 111 0000 001111111012234899999999862 7889999999964 4443332 Q ss_pred -ccCcceeccc---CCCCceeEEEecCcceEEEecCcEEEEEecC--CCceeEeEEecCccccccCceeEEECCeEEEEe Q lcl|NC_010325. 191 -TKDAGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG--GLFIFQFQQLFNDVGILGPNCAVEFDGNHFVVG 264 (513) Q Consensus 191 -t~~a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g--~~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls 264 (513) ..+++=.++. +....|..+++....|+||...+-|.++-.+ .|.--++.+.+. .||-+.=.=+.+|+.++|.+ T Consensus 395 ~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~-~~~~~~~~Pv~vG~~v~Fv~ 473 (806) T protein:vir:10 395 ATVDTDPFDVFADIEEVYNIRWSAQMDGDVVLFTSDQQFTLPGDKPLTPTSAVIRPVTQ-FKMTPGVKPAPSGDSILFAF 473 (806) T ss_pred CCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEe-ecccCCCCceEeCCeEEEee Confidence 2233333442 3334577788899999999999999996322 244466666653 34544344567999999999 Q ss_pred CCC----eEEE--CCcc--cccCC-chhHHHHHHhhcCcchhCCEEEEEec-CCCEEEEEEccCCCCCCcccceEEEEec Q lcl|NC_010325. 265 HGD----VYVH--NGVQ--KQSVI-DAQVRKFFFSDINPDNYQRTFVLADH-VNTEMWVCYSSTRSKPGKHCDRAIIWNW 334 (513) Q Consensus 265 ~~G----~y~~--~G~~--~~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~-~~~~v~~~~~s~~~~~~~~~d~~lvyd~ 334 (513) +.| ++.+ +-.+ +.+.. +.-+.+ ++ +.. .+.-++.. .+..+.|+-... +++++|-| T Consensus 474 ~~g~~s~vre~~y~~~~d~~~~~DlT~~~~h-l~----~g~--~~~~~~~~~~~~~~~~~~~~d--------g~l~~~ty 538 (806) T protein:vir:10 474 DQGSYSGIREFFTDSYSDTKKAQPATSHVDK-YI----RGK--VLELSASSSFNRAFIITSPDR--------NILYVYDW 538 (806) T ss_pred CCCCeeEEEEEEeeeeccceehhhHHHHHHH-hc----CCC--eEEEEEeCCCCcEEEEEEcCC--------CEEEEEEE Confidence 987 5433 2111 11110 111222 11 111 12222222 333455654322 35677665 Q ss_pred c----c---CeEEEEeccc--e--eeeeec--------ccc-------cccceeecc------------------cCccc Q lcl|NC_010325. 335 K----E---NTWSIRDLPN--V--LSGAYG--------IID-------PKVSNLWDD------------------DPNPW 370 (513) Q Consensus 335 ~----~---~~Ws~~d~~~--~--~~~~~g--------~~~-------~~~~~~~~~------------------~~~~~ 370 (513) . . +.|+.-+.+. + |..+.+ ... .-.+.+... +..++ T Consensus 539 ~~~~~e~~v~aW~rw~~~~~~~~~~~~~~~d~l~~vv~R~~~~~g~~~~~iE~~~~~~~~~~~~~~~~~lD~~~~~~~~~ 618 (806) T protein:vir:10 539 LYEGTEKVQNAWHKWSFPAGTVLHAVSYSNEKLYLVLTRTNTSGGVAGVYIEVMDMGDELEYGLQDRVRMDRRATLSMTY 618 (806) T ss_pred eecCCceEEEeEEeeeeCCCeEEEEEEEecCeEEEEEEEcCCcccEEEEEEEeecCCCCCCcccceeeeccccceEEEec Confidence 2 2 2677554432 1 111111 000 000000000 00111 Q ss_pred Cccceecccccccc----CccceEEEEee----cCcee--ee--------------c--ccceeecCccEEEEeeccccc Q lcl|NC_010325. 371 DTDTSVWGEGSYNP----AKSSMIFSSFQ----DKKLF--LF--------------G--NNSTFSGQNFVSTLERSDIYL 424 (513) Q Consensus 371 d~d~~~~~~ds~~~----~~~~~~~~~~~----~~~~~--~~--------------~--~~~~~~g~~l~a~~~~~~~~~ 424 (513) +....+|.+..... .+........- ++... .. + .....-|-++++.++-....+ T Consensus 619 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~g~~~~~g~~~~~~~~~~~~~v~~~~~~~~~~~~~v~vGl~Y~s~~~~t~p~~ 698 (806) T protein:vir:10 619 NATTRVWTSSALPWLPQDLSSLDAVLVSGWAGYVGGAFQFSYNASNNTISTNFDLAEGNTATIVVGETYWYEVEPTPPLI 698 (806) T ss_pred cccccceeeeeeccccccccceeEEEEeeccccCCceEEEEEcCccceEeeeeeecCCCCcEEEEeeeeeEEEEECCeeE Confidence 11111121100000 00000000000 00000 00 0 112567888888887433322 Q ss_pred ---CC---CcceEEEeeeeeccCCCeeEEEEeeeeecCCCCce---------E---cCceeeecCCceEEEeecCCCeEE Q lcl|NC_010325. 425 ---GD---DRMMKTVSAIIPHITGNGTCNIWVGNAQVQGSGIR---------W---KGPYPYRIGQDYKIDTKHVGRYIA 486 (513) Q Consensus 425 ---~~---~~~~~~i~~~~~~~t~~~~~~~~~g~~~~~~~~~~---------w---~~~~~~~~~~~~~~~~R~~~Ry~~ 486 (513) .+ ....+++.++.......+.+.+.++........+. + .+..+..+| ...++++..++-.. T Consensus 699 ~~~~~~~~~~~r~~l~r~~~~~~~s~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg-~~~vp~~~~~~~~~ 777 (806) T protein:vir:10 699 KDSKDRVSYLDTPTVGNVYLNLDMYPDFSVVVTDKETLQERTVYLANKTAGSITNVIGYIAPHEG-TLRIPLRRKSTDVS 777 (806) T ss_pred eccCCCccccccEEEEEEEEEeecceeeEEEEcccCCCcceeeeccCcccccccccccccccccc-eEEEEeeecCceeE Confidence 11 12234555555544444445444433222111111 1 011223333 46778888899999 Q ss_pred EEEEccCCCcEEEEEEeeEE-eccccCC Q lcl|NC_010325. 487 LKFDFSSEGDWYFNGYTIEM-APKAGMR 513 (513) Q Consensus 487 ~rl~~~~g~~w~~~G~~~~~-~~~g~rr 513 (513) ++|+.+.-.+.++.++++|. .-.=.|| T Consensus 778 v~i~~d~P~P~tvlai~~eg~y~~r~~r 805 (806) T protein:vir:10 778 FKIRSKSPATFQLRDIEWTGSYNPRKRR 805 (806) T ss_pred EEEEECCCCceEEEEEEEEEEeeccccc Confidence 99999999999999999998 3344555 No 50 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=95.99 E-value=0.0011 Score=36.70 Aligned_cols=298 Identities=9% Similarity=0.018 Sum_probs=121.4 Q ss_pred CcccchhhcCccccccccCcccCCCCcEEEeEEEEEeCCeeEECCCcceeeecCCCcce-eeeeee--eCCceEEEEEcC Q lcl|NC_010325. 1 MALERQEVKNPTGIVTDIAPADLPLEKWSFGNNVRFKNGKAQKTLGHTPIFDTAQAPIL-DMFPFI--RNNIPYWLLCSE 77 (513) Q Consensus 1 m~~~~~~~~~~~G~~~~~~P~~lp~~a~~~~~Nv~~~~g~~~~~~g~~~~~~~~~~~~~-~~~~~~--~~g~~~~~v~~~ 77 (513) +-+++.+-.+.+-+. +.++ ....| ++.++. -+.+... +.+|+.+- |...-. +++.. .+ T Consensus 331 i~i~~~~~~~~~~~~--~~~~--~g~~~-~~~~~~--~~~v~~~-------~~Lp~~a~~g~~v~v~~~~~~~-----~~ 391 (680) T protein:vir:17 331 IRVRYSDPTRTDEFT--MSAR--GGTSG-TGLESI--KYSVDTL-------AELPTKCWNDYQVAVRNTQDTE-----VD 391 (680) T ss_pred EEEEeccCCCceEEE--eecc--CCCCc-eeeeee--eeeeccc-------cccccccCCCcEEEEEeCCCCc-----cc Confidence 233222211111110 0000 00111 111111 1112111 11222111 000000 01100 01 Q ss_pred ceEEEecC----------ceEEeccccc-eeeCCC--CceeEEe--eCC-EEEEEeCCCceEEEcC--CCc-------ee Q lcl|NC_010325. 78 QRLYLADG----------TTIIDVSPGP-YSASIT--NRWSVGS--FNG-VIFANDGVNPPHHLPP--SES-------TF 132 (513) Q Consensus 78 ~kly~~~~----------~t~~dis~~~-~~~~~~--~~w~f~~--~~~-~~ia~ng~d~~q~~~~--~s~-------~f 132 (513) ..+++|+. ++|.....-. ..+... -.|.... .+. .+.+.+.......|.. .++ .| T Consensus 392 ~Yyv~~~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gdd~tnp~psF 471 (680) T protein:vir:17 392 DYYVKFETDVEDADVPGSGYWVETVKNGDDGGLVDDTMPHVLVRNALGDFTFSSLNNSSYGKTWADRSVGSEDTNPHPTF 471 (680) T ss_pred ceEEEEeccCcccCcccccceeecccCcccceeccCcceEEEEEccCceeEEEeeccccccccccccccCCcccCCCccc Confidence 12223322 1343322100 000000 1121111 111 1122222222222211 111 12 Q ss_pred cccCCCcccceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCCccccccccccccc--ccCcceeccc---CCCCcee Q lcl|NC_010325. 133 RVLPNFPANTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSADAGGVPASWDPTDP--TKDAGQNTLA---DTNGAIV 207 (513) Q Consensus 133 ~~L~g~p~~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d~~~~P~~Wd~t~~--t~~a~~~dl~---d~~G~iv 207 (513) .+.+.. ...|..|++||++++ |+.|+.|..+|.+ ++....+ ..+++=.++. +....|. T Consensus 472 ~~~G~~-----p~~v~f~q~RL~f~s--------~~~v~~Srtgd~~----nF~~~t~~~~~DdD~I~~~~ss~~~~~i~ 534 (680) T protein:vir:17 472 TESGNG-----IYGMFMYKNRLGFLT--------QDAVIMSQVGDYF----NFYATSGVTISDADPIDMATSDTKPVKLE 534 (680) T ss_pred ccCCCC-----ceEEEEEcceEEEee--------CCeEEEEccCCcc----cccccccccCCCCccEEEEEcCCcceeee Confidence 222222 335899999998763 6789999999964 4443332 1233333332 3345577 Q ss_pred EEEecCcceEEEecCcEEEEEecCC---CceeEeEEecCccccccCceeEEECCeEEEEeCCC----eE--EECC--ccc Q lcl|NC_010325. 208 DGVKLRDSFIIYKEDSVYSMRYIGG---LFIFQFQQLFNDVGILGPNCAVEFDGNHFVVGHGD----VY--VHNG--VQK 276 (513) Q Consensus 208 ~g~~l~~~~vIf~en~i~~m~y~g~---~~~f~~~~i~~~~G~~~~~siv~~~~~~ffls~~G----~y--~~~G--~~~ 276 (513) .+++....|++|...+-|.++-.++ |.--++++.+. .+|-+.=.=+.+|+.++|+++.| +. .++- ..+ T Consensus 535 ~~v~~~~~L~l~t~g~q~~ls~~~~~lTP~~~~i~~~s~-~~~~~~~~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~y 613 (680) T protein:vir:17 535 AAISSTSGAILFGNQAQFRLSSPDESFGPKTATLDKISN-YTYESKADPVQTGVSMIFPTNMGTYSSVYELSTESAKGTP 613 (680) T ss_pred EEeecCCcEEEEecCeEEEEecCCceecceeEEEEEEEe-ecccCCCCceEeCCeEEEeecCCCcceEEEEeeeeccCce Confidence 7888999999999999999973232 33456666663 34544455678999999999987 43 2322 222 Q ss_pred ccCC-chhHHHHHHhhcCcchhCCEEEEEecCCCEEEEEEccCCCCCCcccceEEEEecc----cC---eEEEEecccee Q lcl|NC_010325. 277 QSVI-DAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSKPGKHCDRAIIWNWK----EN---TWSIRDLPNVL 348 (513) Q Consensus 277 ~~Ig-~~~V~~~~~~~i~~~~~~~~~~~~d~~~~~v~~~~~s~~~~~~~~~d~~lvyd~~----~~---~Ws~~d~~~~~ 348 (513) .... +..+.+.| +.....+. .+-+..-.+.|+-... +.+++|-|. .+ .|+.-+++.+= T Consensus 614 ~a~DlT~~a~hl~-----~g~v~~~~-~~~~~~~~~~~~~~~~--------~~l~~~~yl~~~~e~~v~aW~rw~~~~~d 679 (680) T protein:vir:17 614 VIEDSSRVIPRLI-----PSGLTWST-ASMNNDTVFFGNAKKG--------RNVYVFRFFNEGQERKVAGWTTWYYEDQD 679 (680) T ss_pred ehhhHHHHHHHhc-----CCceEEEE-eeCCCCeEEEEEEcCC--------CEEEEEEEeeCCCceEEEEEEEEecCCCC Confidence 2211 11222221 22222221 2222222222332111 357777652 22 57766655442 Q ss_pred e Q lcl|NC_010325. 349 S 349 (513) Q Consensus 349 ~ 349 (513) - T Consensus 680 ~ 680 (680) T protein:vir:17 680 H 680 (680) T ss_pred C Confidence 1 No 51 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=95.38 E-value=0.0022 Score=35.17 Aligned_cols=470 Identities=10% Similarity=0.080 Sum_probs=177.2 Q ss_pred Cc----ccchh--------hcCccccc----cc------cCcccCC----C--Cc-EE-EeEEEEEeCCeeE-ECCCcce Q lcl|NC_010325. 1 MA----LERQE--------VKNPTGIV----TD------IAPADLP----L--EK-WS-FGNNVRFKNGKAQ-KTLGHTP 49 (513) Q Consensus 1 m~----~~~~~--------~~~~~G~~----~~------~~P~~lp----~--~a-~~-~~~Nv~~~~g~~~-~~~g~~~ 49 (513) .. .|+.. ..-..+.+ +. ..|+.+. - ++ |+ .+-|+....+++- ..+.... T Consensus 328 V~v~g~~Y~it~~~~~~~~~~a~~~~~~~~~t~~d~~~~~~~~~ia~~L~~~l~a~~~~~g~tv~~~g~~~~i~~~~~~~ 407 (976) T protein:vir:10 328 VWMKDGYYKITVEAISTANVQANLGLIRPNPTPFDTETAVTAESIIGDIRTAIIATGNFTSANVQQIGTGLYVTRPSGTF 407 (976) T ss_pred EEccccceeeEEEEeeceeEEeccccccCcCCcCcccccccHHHHHHHHHHhhcccccccceEEEEcCcEEEEEecCcce Confidence 00 00000 00000000 00 0111110 0 00 11 1112222222211 1111110 Q ss_pred eeecCCCcceeeeee----------eeCCceEEEEEcCce-----EEEecCc-------eEEeccccceeeCCCCceeEE Q lcl|NC_010325. 50 IFDTAQAPILDMFPF----------IRNNIPYWLLCSEQR-----LYLADGT-------TIIDVSPGPYSASITNRWSVG 107 (513) Q Consensus 50 ~~~~~~~~~~~~~~~----------~~~g~~~~~v~~~~k-----ly~~~~~-------t~~dis~~~~~~~~~~~w~f~ 107 (513) ...+..+..+..+.. ..-.+..+-|+.+.+ ..+|+.. .|.....- ... ..+..+ T Consensus 408 ~~s~~~~~~~~~~~~~V~~~~~LP~~~~~g~~v~V~~~~~~~d~yyv~~~~~~~~~~~~~w~E~~~~--g~~--~g~~~~ 483 (976) T protein:vir:10 408 NVTAPSSDLLRVMSGEVANVDDLPSQCKHGYVVKVANSEADADDYYVKFFGHNNRDGDGVWEECAKP--SRN--IEFDKG 483 (976) T ss_pred EecCCCceeEEEEEeeecchhhhhhhccCCcEEEEecCCCCceeEEEEeeccccccccceEEEeecc--ccc--cccccc Confidence 011111111111100 000011111222211 1222221 13221110 000 001111 Q ss_pred eeCCEEEE--EeCCCceEE--EcC--CCc----eecccCCCcccceeeEEEEEcCEEEEEECCcCcccCCceEEEeccCC Q lcl|NC_010325. 108 SFNGVIFA--NDGVNPPHH--LPP--SES----TFRVLPNFPANTTFKRLKSFKNFLVGLNATSNSVEMPQMVWWSTSAD 177 (513) Q Consensus 108 ~~~~~~ia--~ng~d~~q~--~~~--~s~----~f~~L~g~p~~~ka~~v~~~~~~l~~~g~t~~~~~~p~rv~wS~~~d 177 (513) .+. +.+. .++.-.++. |+. .++ .+...-|+++ .-|..|++||++++ |++|+.|..+| T Consensus 484 tmP-~~l~~~~~g~f~~~~~~w~~r~vGd~~tnp~psf~g~~i----s~v~f~q~RL~f~s--------~~~v~~Srtgd 550 (976) T protein:vir:10 484 TMP-IQLVRQANGTFTVSQATWQNAEVGDELTNPNPSFVGKTI----NQLVFFRNRLVFLS--------DENVIMSRPGE 550 (976) T ss_pred ccc-EEEEecccCeEEeeeccccccccCCcccCcCceeccccc----ceEEEEcceEEEec--------CCeEEEEecCC Confidence 111 0111 111111111 211 000 0111112222 23899999999864 68899999999 Q ss_pred ccccccccccccc--ccCcceeccc---CCCCceeEEEecCcceEEEecCcEEEEEecCC---CceeEeEEecCcccccc Q lcl|NC_010325. 178 AGGVPASWDPTDP--TKDAGQNTLA---DTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGG---LFIFQFQQLFNDVGILG 249 (513) Q Consensus 178 ~~~~P~~Wd~t~~--t~~a~~~dl~---d~~G~iv~g~~l~~~~vIf~en~i~~m~y~g~---~~~f~~~~i~~~~G~~~ 249 (513) .+ +|..... ..+++-.++. +....|..+++....|+||...+-|.++-.+. |.-.++.+.+ ..+|-+ T Consensus 551 ~~----nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~e~~lsg~~~~lTP~t~~i~~~s-~~~~~~ 625 (976) T protein:vir:10 551 FF----NFWSKTATTFTPQDVIDLSCSSTYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKINAVS-SYNFNE 625 (976) T ss_pred cc----ccccccccCCCCCccEEEEecCCcceeeEEEEecCCcEEEEecCceEEEecCCceecceeEEEEEEE-eeeccC Confidence 64 4433322 1233444442 33455777788899999999999999973332 3345566555 335666 Q ss_pred CceeEEECCeEEEEeCCC----eEEECCcc--cccC---CchhHHHHHHhhcCcchhCCEEEEEecCCC-EEEEEEccCC Q lcl|NC_010325. 250 PNCAVEFDGNHFVVGHGD----VYVHNGVQ--KQSV---IDAQVRKFFFSDINPDNYQRTFVLADHVNT-EMWVCYSSTR 319 (513) Q Consensus 250 ~~siv~~~~~~ffls~~G----~y~~~G~~--~~~I---g~~~V~~~~~~~i~~~~~~~~~~~~d~~~~-~v~~~~~s~~ 319 (513) .=.-+.+|+.++|+++.| ++.+.-.+ .... .+.-+.+.+ . ..+.-+....++ .+.|+-... T Consensus 626 ~v~Pv~vG~~v~Fv~~~g~~~r~~~~~~~~~~~~~~~~dlt~~~~~l~----~----g~~~~~a~~~~~~~vv~~~~~~- 696 (976) T protein:vir:10 626 KTHPVSLGTTVAFIDNANQFTRFFEMSNVVRQGEPDVVDQSKVISRLL----D----KNISLVSVSRENSVVFFSQKDT- 696 (976) T ss_pred CCccEEeCCeEEEEecCCCeEEEEEEeecccccccchhHHHHHhhhhc----C----CceEEEEEcCCCcEEEEEEcCC- Confidence 666789999999999987 44442211 1111 011122211 1 112222233333 233443221 Q ss_pred CCCCcccceEEEEecc----c---CeEEEEeccceeee--eec-c---------------cccccc---eeec------- Q lcl|NC_010325. 320 SKPGKHCDRAIIWNWK----E---NTWSIRDLPNVLSG--AYG-I---------------IDPKVS---NLWD------- 364 (513) Q Consensus 320 ~~~~~~~d~~lvyd~~----~---~~Ws~~d~~~~~~~--~~g-~---------------~~~~~~---~~~~------- 364 (513) +++++|-|. . ..|+.-+.+..+-. +++ . ...... ..+. T Consensus 697 -------g~l~~~ty~~~~~eq~v~aWsr~~~~G~v~sv~~i~D~ly~vV~r~~~g~~~r~~e~~~~~~~~~~~~~~~~~ 769 (976) T protein:vir:10 697 -------DKIYCFRYFTSGEKRLLQAWTTWTITGNIQYHCMLDDALYVVTRNNNKDQIVKYSLKLDDAGHFVTDTQGTTS 769 (976) T ss_pred -------CEEEEEEEeecCCceeEEeeEEEecCCcEEEEEEeCCeEEEEEEecCCeEEEEEEEEECCccceeeeccCccc Confidence 467777652 2 26886555432111 110 0 000000 0000 Q ss_pred ---ccC---------------cccCcc------ceeccccccccCccceEEEEeec--------------Cceeee---- Q lcl|NC_010325. 365 ---DDP---------------NPWDTD------TSVWGEGSYNPAKSSMIFSSFQD--------------KKLFLF---- 402 (513) Q Consensus 365 ---~~~---------------~~~d~d------~~~~~~ds~~~~~~~~~~~~~~~--------------~~~~~~---- 402 (513) ..+ .+++.+ ..+|... ....+.+...+ +....+ T Consensus 770 ~~~~~~~~~~lD~~~~~~~~~~t~~~~t~~t~~~~~~~~~-----~~~~~~~~~~d~~~~~~~~~~~~v~g~~i~l~g~~ 844 (976) T protein:vir:10 770 TDDDSIYRVHLDHSSSVTAASNTYNTTTIKTTIPKPNGYE-----STKQLVAYDTDAGNDLGRYALVTVSGSNLEIPGNW 844 (976) T ss_pred cccCCcceeeeccceEEEeccccccCCceeEEeecCcccc-----CceeEEEEecccCcccccceeeeecCCeeEecCCC Confidence 000 000000 0001000 00111111111 111112 Q ss_pred cccceeecCccEEEEeeccccc----CCCcce-----EEEeeeeeccCCCeeEEEEeee---eecCC-CCce-E----cC Q lcl|NC_010325. 403 GNNSTFSGQNFVSTLERSDIYL----GDDRMM-----KTVSAIIPHITGNGTCNIWVGN---AQVQG-SGIR-W----KG 464 (513) Q Consensus 403 ~~~~~~~g~~l~a~~~~~~~~~----~~~~~~-----~~i~~~~~~~t~~~~~~~~~g~---~~~~~-~~~~-w----~~ 464 (513) ......-|-++++.++-..+.. +++... ..+.++.......+.+.+.+-. ..... .... + .. T Consensus 845 ~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~~~~gRl~i~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~ 924 (976) T protein:vir:10 845 SNNSFIIGYLYEMDVQLPTLYVTQQVGDKYRSDAKSSLIVHRIKFSFGPLGVYSTTIQRDGKPDFTETKELGLAGVVGAS 924 (976) T ss_pred CCCeEEEeeeeEEEEeecceeEEeCCCCcccccceeeEEEEEEEEEeecccceEEEEcCCCCccccccccccccCccccc Confidence 1123555778888887544433 121111 1222322222222222222211 10000 0000 0 00 Q ss_pred ceeeecCCceEEEeecCCCeEEEEEEccCCCcEEEEEEeeEEe--ccccCC Q lcl|NC_010325. 465 PYPYRIGQDYKIDTKHVGRYIALKFDFSSEGDWYFNGYTIEMA--PKAGMR 513 (513) Q Consensus 465 ~~~~~~~~~~~~~~R~~~Ry~~~rl~~~~g~~w~~~G~~~~~~--~~g~rr 513 (513) ..+.-.+....++++..++-..++|+.+.-.|.++.++++|.. +---|| T Consensus 925 ~~pl~~~~~~~vP~~~~~~~~~v~i~~d~PlP~tilsi~~eg~yn~r~~r~ 975 (976) T protein:vir:10 925 RLPIVPEVIETVPCYERNTNLKVNVKSEHPAPATLYSLAWEGDFTNRFYKR 975 (976) T ss_pred ccceecCcEEEEEeccCCceeEEEEEECCCCceEEEEEEEEEEeccceeec Confidence 1111222335688999999999999999999999999998874 222233 Done!