Query lcl|NC_010811.2_cdsid_YP_001950077.1 [gene=RSL1_ORF198] [protein=hypothetical protein] [protein_id=YP_001950077.1] [location=complement(136586..138370)] Match_columns 594 No_of_seqs 281 out of 698 Neff 7.6 Searched_HMMs 1612 Date Thu Nov 7 15:41:18 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_198 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_198_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:107802 Length: 681 98.8 4.9E-10 3E-13 71.6 13.6 223 361-594 1-298 (681) 2 protein:vir:98487 Length: 681 98.8 4.9E-10 3E-13 71.6 13.6 223 361-594 1-298 (681) 3 protein:vir:107423 Length: 681 98.8 4.9E-10 3E-13 71.6 13.6 223 361-594 1-298 (681) 4 protein:vir:96666 Length: 462 97.8 2.2E-06 1.3E-09 51.7 12.6 327 226-594 1-439 (462) 5 protein:vir:102644 Length: 594 96.6 0.00011 6.7E-08 42.3 11.3 180 361-594 1-196 (594) 6 protein:vir:7329 Length: 825 # 93.6 0.0041 2.5E-06 33.7 10.4 279 294-594 1-347 (825) 7 protein:vir:95324 Length: 823 89.6 0.026 1.6E-05 29.3 12.8 222 361-594 1-345 (823) 8 protein:vir:1778 Length: 680 # 85.1 0.055 3.4E-05 27.5 10.0 425 101-594 1-500 (680) 9 protein:vir:93631 Length: 580 38.0 1.1 0.00068 20.4 8.2 230 353-594 1-277 (580) 10 protein:vir:7021 Length: 803 # 7.3 9.1 0.0056 15.3 10.1 312 230-594 1-381 (803) No 1 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=98.77 E-value=4.9e-10 Score=71.62 Aligned_cols=223 Identities=17% Similarity=0.183 Sum_probs=101.6 Q ss_pred CcccEEEEecceEEEEEEEEEEEEecCCCHHHHHHHHHHHHHHHhhhcccCCCCcccHHHHHHHHHhcCCceEEE----- Q lcl|NC_010811. 361 MYAAYFLWQDPIAIPRDVNVDVFVFNSAIPSQVQQNSVTALTNLFAPRPGLLMTNFYTSDLIETVVNASPGQVSY----- 435 (594) Q Consensus 361 ~~~~~v~V~~P~yv~v~V~~~v~~~~~~~~~~v~~~v~~aL~~~~~p~~~~fG~~v~~S~l~~avi~~V~GVvd~----- 435 (594) +..+ ...-+.+--= ++.=.+....+.+..+..+++ +++++--..+++-+.--. +.++++ ..-++-++. T Consensus 1 m~~~--~~~~~~f~~G--e~~p~l~~r~D~~~y~~~~~~-~~N~~~~~~G~~~~R~g~-~~~~~~-~~~~~~~rlipf~~ 73 (681) T protein:vir:10 1 MSNV--RVLQRSFGGG--EISPEMFGRIDDVKYQSGLAI-CRNFVVKPQGPAENRAGF-AFVREV-KDSAKKVRLIPFTY 73 (681) T ss_pred Ccce--eEeeeecCCc--eeeeeeccchhHHHHHHHHHH-hcCcEEEecCCceecChh-Hhhhhc-CCCCCcEEEEEEEe Confidence 1111 1111111100 011111223344555555543 333331111111110000 111111 000000000 Q ss_pred --------------EEEe----------------ecc--------------cccccccccccccc-------------cc Q lcl|NC_010811. 436 --------------VRVN----------------SPV--------------GPMIVTAPESPLPT-------------FT 458 (594) Q Consensus 436 --------------v~l~----------------~~~--------------~~~~~~~~~~~v~~-------------~~ 458 (594) +++. .|- +-+.+-+...+... +. T Consensus 74 ~~~~~~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~ 153 (681) T protein:vir:10 74 SVTQTMVIELGAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIA 153 (681) T ss_pred CCCceEEEEEeCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEE Confidence 0000 000 00000000000000 00 Q ss_pred c--c---cc------ccccccccccceEeeecCccceec-cccccccCcccccccceEEEEecccCCcccceeeeecCCc Q lcl|NC_010811. 459 I--L---PS------LGTLGPLVYAYAVSTTLTNGDVGY-PTKWIFPQITDPGNTHAITLTWPAVAGAASYQVWGRQAGG 526 (594) Q Consensus 459 ~--~---ps------~~s~~~~t~~~~v~av~t~g~~~~-~~~~~~~~~t~~~~~~~~~~~w~~v~ga~~y~iy~~~~g~ 526 (594) + . |. ..+....++.+.+.++++.+...+ +.............++..++.|.++.|+..|+||+...|. T Consensus 154 f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~~~~~t~~w~a~~g~~~~~V~~~~~gi 233 (681) T protein:vir:10 154 FTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQGGL 233 (681) T ss_pred eccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeeeecCCcceeEEEEecCCceeeeecccceeE Confidence 0 0 00 001122456677777766554322 2222222233334445669999999999999999999999 Q ss_pred eeeEEEeccceeEeecCCccCccccCCCCCccccccCCcc-CccceeeeeeeeecccccCCCceeeccC Q lcl|NC_010811. 527 LGLLANVPATSTTFTDNGSITPSGGLPSSGEFPIRYNSLS-SLTVNAYYSERQQPTITDADPTRLLGGQ 594 (594) Q Consensus 527 ~~~~~~~~~~~~~~~d~~~~~~~~~~p~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 594 (594) +|++|.. ..+++.|+...++...+|+-.+.| +...+ +|+.++||||||.|+++...|+.+...+ T Consensus 234 ~g~ig~~--~~~~~~~~~~~~~~~~t~~~~~~~--~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Sr 298 (681) T protein:vir:10 234 YGYIGQT--TGTSLVDDNIAPDLSVTPPIYDAV--FNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTR 298 (681) T ss_pred EEEeecc--ceeeeeecccccCccccccccccc--cccCCCceEEEEEEcceEEEeeCCCCCcEEEEEc Confidence 9998844 567777774444444455433333 22222 5999999999999999999999999888 No 2 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=98.77 E-value=4.9e-10 Score=71.62 Aligned_cols=223 Identities=17% Similarity=0.183 Sum_probs=101.6 Q ss_pred CcccEEEEecceEEEEEEEEEEEEecCCCHHHHHHHHHHHHHHHhhhcccCCCCcccHHHHHHHHHhcCCceEEE----- Q lcl|NC_010811. 361 MYAAYFLWQDPIAIPRDVNVDVFVFNSAIPSQVQQNSVTALTNLFAPRPGLLMTNFYTSDLIETVVNASPGQVSY----- 435 (594) Q Consensus 361 ~~~~~v~V~~P~yv~v~V~~~v~~~~~~~~~~v~~~v~~aL~~~~~p~~~~fG~~v~~S~l~~avi~~V~GVvd~----- 435 (594) +..+ ...-+.+--= ++.=.+....+.+..+..+++ +++++--..+++-+.--. +.++++ ..-++-++. T Consensus 1 m~~~--~~~~~~f~~G--e~~p~l~~r~D~~~y~~~~~~-~~N~~~~~~G~~~~R~g~-~~~~~~-~~~~~~~rlipf~~ 73 (681) T protein:vir:98 1 MSNV--RVLQRSFGGG--EISPEMFGRIDDVKYQSGLAI-CRNFVVKPQGPAENRAGF-AFVREV-KDSAKKVRLIPFTY 73 (681) T ss_pred Ccce--eEeeeecCCc--eeeeeeccchhHHHHHHHHHH-hcCcEEEecCCceecChh-Hhhhhc-CCCCCcEEEEEEEe Confidence 1111 1111111100 011111223344555555543 333331111111110000 111111 000000000 Q ss_pred --------------EEEe----------------ecc--------------cccccccccccccc-------------cc Q lcl|NC_010811. 436 --------------VRVN----------------SPV--------------GPMIVTAPESPLPT-------------FT 458 (594) Q Consensus 436 --------------v~l~----------------~~~--------------~~~~~~~~~~~v~~-------------~~ 458 (594) +++. .|- +-+.+-+...+... +. T Consensus 74 ~~~~~~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~ 153 (681) T protein:vir:98 74 SVTQTMVIELGAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIA 153 (681) T ss_pred CCCceEEEEEeCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEE Confidence 0000 000 00000000000000 00 Q ss_pred c--c---cc------ccccccccccceEeeecCccceec-cccccccCcccccccceEEEEecccCCcccceeeeecCCc Q lcl|NC_010811. 459 I--L---PS------LGTLGPLVYAYAVSTTLTNGDVGY-PTKWIFPQITDPGNTHAITLTWPAVAGAASYQVWGRQAGG 526 (594) Q Consensus 459 ~--~---ps------~~s~~~~t~~~~v~av~t~g~~~~-~~~~~~~~~t~~~~~~~~~~~w~~v~ga~~y~iy~~~~g~ 526 (594) + . |. ..+....++.+.+.++++.+...+ +.............++..++.|.++.|+..|+||+...|. T Consensus 154 f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~~~~~t~~w~a~~g~~~~~V~~~~~gi 233 (681) T protein:vir:98 154 FTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQGGL 233 (681) T ss_pred eccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeeeecCCcceeEEEEecCCceeeeecccceeE Confidence 0 0 00 001122456677777766554322 2222222233334445669999999999999999999999 Q ss_pred eeeEEEeccceeEeecCCccCccccCCCCCccccccCCcc-CccceeeeeeeeecccccCCCceeeccC Q lcl|NC_010811. 527 LGLLANVPATSTTFTDNGSITPSGGLPSSGEFPIRYNSLS-SLTVNAYYSERQQPTITDADPTRLLGGQ 594 (594) Q Consensus 527 ~~~~~~~~~~~~~~~d~~~~~~~~~~p~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 594 (594) +|++|.. ..+++.|+...++...+|+-.+.| +...+ +|+.++||||||.|+++...|+.+...+ T Consensus 234 ~g~ig~~--~~~~~~~~~~~~~~~~t~~~~~~~--~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Sr 298 (681) T protein:vir:98 234 YGYIGQT--TGTSLVDDNIAPDLSVTPPIYDAV--FNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTR 298 (681) T ss_pred EEEeecc--ceeeeeecccccCccccccccccc--cccCCCceEEEEEEcceEEEeeCCCCCcEEEEEc Confidence 9998844 567777774444444455433333 22222 5999999999999999999999999888 No 3 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=98.77 E-value=4.9e-10 Score=71.62 Aligned_cols=223 Identities=17% Similarity=0.183 Sum_probs=101.6 Q ss_pred CcccEEEEecceEEEEEEEEEEEEecCCCHHHHHHHHHHHHHHHhhhcccCCCCcccHHHHHHHHHhcCCceEEE----- Q lcl|NC_010811. 361 MYAAYFLWQDPIAIPRDVNVDVFVFNSAIPSQVQQNSVTALTNLFAPRPGLLMTNFYTSDLIETVVNASPGQVSY----- 435 (594) Q Consensus 361 ~~~~~v~V~~P~yv~v~V~~~v~~~~~~~~~~v~~~v~~aL~~~~~p~~~~fG~~v~~S~l~~avi~~V~GVvd~----- 435 (594) +..+ ...-+.+--= ++.=.+....+.+..+..+++ +++++--..+++-+.--. +.++++ ..-++-++. T Consensus 1 m~~~--~~~~~~f~~G--e~~p~l~~r~D~~~y~~~~~~-~~N~~~~~~G~~~~R~g~-~~~~~~-~~~~~~~rlipf~~ 73 (681) T protein:vir:10 1 MSNV--RVLQRSFGGG--EISPEMFGRIDDVKYQSGLAI-CRNFVVKPQGPAENRAGF-AFVREV-KDSAKKVRLIPFTY 73 (681) T ss_pred Ccce--eEeeeecCCc--eeeeeeccchhHHHHHHHHHH-hcCcEEEecCCceecChh-Hhhhhc-CCCCCcEEEEEEEe Confidence 1111 1111111100 011111223344555555543 333331111111110000 111111 000000000 Q ss_pred --------------EEEe----------------ecc--------------cccccccccccccc-------------cc Q lcl|NC_010811. 436 --------------VRVN----------------SPV--------------GPMIVTAPESPLPT-------------FT 458 (594) Q Consensus 436 --------------v~l~----------------~~~--------------~~~~~~~~~~~v~~-------------~~ 458 (594) +++. .|- +-+.+-+...+... +. T Consensus 74 ~~~~~~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~ 153 (681) T protein:vir:10 74 SVTQTMVIELGAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIA 153 (681) T ss_pred CCCceEEEEEeCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEE Confidence 0000 000 00000000000000 00 Q ss_pred c--c---cc------ccccccccccceEeeecCccceec-cccccccCcccccccceEEEEecccCCcccceeeeecCCc Q lcl|NC_010811. 459 I--L---PS------LGTLGPLVYAYAVSTTLTNGDVGY-PTKWIFPQITDPGNTHAITLTWPAVAGAASYQVWGRQAGG 526 (594) Q Consensus 459 ~--~---ps------~~s~~~~t~~~~v~av~t~g~~~~-~~~~~~~~~t~~~~~~~~~~~w~~v~ga~~y~iy~~~~g~ 526 (594) + . |. ..+....++.+.+.++++.+...+ +.............++..++.|.++.|+..|+||+...|. T Consensus 154 f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~~~~~t~~w~a~~g~~~~~V~~~~~gi 233 (681) T protein:vir:10 154 FTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTNGGANTIAWSASSGASRYNVYKEQGGL 233 (681) T ss_pred eccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeeeecCCcceeEEEEecCCceeeeecccceeE Confidence 0 0 00 001122456677777766554322 2222222233334445669999999999999999999999 Q ss_pred eeeEEEeccceeEeecCCccCccccCCCCCccccccCCcc-CccceeeeeeeeecccccCCCceeeccC Q lcl|NC_010811. 527 LGLLANVPATSTTFTDNGSITPSGGLPSSGEFPIRYNSLS-SLTVNAYYSERQQPTITDADPTRLLGGQ 594 (594) Q Consensus 527 ~~~~~~~~~~~~~~~d~~~~~~~~~~p~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 594 (594) +|++|.. ..+++.|+...++...+|+-.+.| +...+ +|+.++||||||.|+++...|+.+...+ T Consensus 234 ~g~ig~~--~~~~~~~~~~~~~~~~t~~~~~~~--~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Sr 298 (681) T protein:vir:10 234 YGYIGQT--TGTSLVDDNIAPDLSVTPPIYDAV--FNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTR 298 (681) T ss_pred EEEeecc--ceeeeeecccccCccccccccccc--cccCCCceEEEEEEcceEEEeeCCCCCcEEEEEc Confidence 9998844 567777774444444455433333 22222 5999999999999999999999999888 No 4 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=97.75 E-value=2.2e-06 Score=51.65 Aligned_cols=327 Identities=10% Similarity=0.095 Sum_probs=119.6 Q ss_pred cCCCCEEEEEEeecCCcccc-ccCccee-eeeccCcccceeeeccccC-CCC-CCCCHHHHHHHHHHHHhcCCCCcCHHH Q lcl|NC_010811. 226 PQVNDIVTISYPVTQGQAGD-SIITLNK-AVTVVGFSSITGVATSNPT-GGS-SEKSIVAYKNVASGSFGTYSSAVTKSQ 301 (594) Q Consensus 226 pp~G~~V~~~Y~~g~G~~GN-~v~ag~i-~~l~~~~~gV~~vn~~~a~-GGa-D~Es~e~~r~Rap~~lr~~~RaVT~~D 301 (594) .+.-.++... . ...| ..+ ..+ +.+.++. | +++.... ||+ -.|++++-....- .+-+| T Consensus 1 ~~~~~~~~~~----~-~~~~~~~~-e~~~KS~~tg~-g---~~p~~q~~~gAlR~esL~~~i~~Lt---------~~~~~ 61 (462) T protein:vir:96 1 MHKDTNLTAE----Q-NKYADKFQ-EEVMKSYQTGY-G---ITPDTQVDAGALRREILDDQITMLT---------WTQDD 61 (462) T ss_pred Cccccccchh----h-hhhhchhh-HHHHHHHhcCC-C---cCCccccccchhhhhhhhhhhheee---------ecccc Confidence 1111111110 0 0000 000 000 0111111 1 1222222 121 2333333221110 11122 Q ss_pred HHHHH---H--------------hCCCceEEEEecCCC---CCCce-------EEEEEEecCC----Cc---CCC---HH Q lcl|NC_010811. 302 YQALI---A--------------TYPGVKDAVTQAQRE---IDPGD-------VRWMNVIRVS----AL---TSS---PW 344 (594) Q Consensus 302 Ye~la---~--------------~~pgV~~a~~~~~~~---~~~~~-------vv~i~v~~~~----~~---~~~---~~ 344 (594) |..|- + ++.++......+.-. ...+. +.++.-.... .+ ..+ -+ T Consensus 62 ~~~~~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~ 141 (462) T protein:vir:96 62 LIFYREISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQIL 141 (462) T ss_pred hhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHH Confidence 32221 0 122222222211100 00111 1111111000 00 001 11 Q ss_pred HHHHHHHHHHHhhhc---CCcc------------cE-EEEecceEEEEEEEEEEEEecCCCHHHHHHHHHHHHHHHhhhc Q lcl|NC_010811. 345 TQDQIQSFLNYCQSV---TMYA------------AY-FLWQDPIAIPRDVNVDVFVFNSAIPSQVQQNSVTALTNLFAPR 408 (594) Q Consensus 345 ~~~~v~~~l~~~~~~---~~~~------------~~-v~V~~P~yv~v~V~~~v~~~~~~~~~~v~~~v~~aL~~~~~p~ 408 (594) .++.|...++.++-. +... .- ...++|.-+ +..+-+.-.+.+.......+ T Consensus 142 ~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~~NV-------iDarG~~Ls~~~ln~aa~~i------- 207 (462) T protein:vir:96 142 TEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLIDKDNV-------IDAKGESLTETLLNRSAVLI------- 207 (462) T ss_pred HHHHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhcCCCce-------eecCCCCccHHHHhhhhhhc------- Confidence 223333333222111 1000 00 001122211 11111111111111111111 Q ss_pred ccCCCCc--ccHHHHHHHH------------Hhc----------CCceEE---EEEEee---ccccccccccccccccc- Q lcl|NC_010811. 409 PGLLMTN--FYTSDLIETV------------VNA----------SPGQVS---YVRVNS---PVGPMIVTAPESPLPTF- 457 (594) Q Consensus 409 ~~~fG~~--v~~S~l~~av------------i~~----------V~GVvd---~v~l~~---~~~~~~~~~~~~~v~~~- 457 (594) ...||-. ++.+.-..+- ++. ++|... .++|.. -..+..+.......+.. T Consensus 208 ~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~p~ap 287 (462) T protein:vir:96 208 GKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPLPNAP 287 (462) T ss_pred ccccCChhheecchHHHHHHHHhhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccCCCCC Confidence 1122221 1111101110 000 111111 111110 00011111111011110 Q ss_pred ---ccccc--cc-------ccccccccceEeeecCccceeccccccccCcccccccceEEEEecccCCccc--ceeee-- Q lcl|NC_010811. 458 ---TILPS--LG-------TLGPLVYAYAVSTTLTNGDVGYPTKWIFPQITDPGNTHAITLTWPAVAGAAS--YQVWG-- 521 (594) Q Consensus 458 ---~~~ps--~~-------s~~~~t~~~~v~av~t~g~~~~~~~~~~~~~t~~~~~~~~~~~w~~v~ga~~--y~iy~-- 521 (594) .+.-. .+ ..+...|.|+|++++..|.+...+. +....+....+-+.+|+|+++.|+.. |+||| T Consensus 288 ~~~~vsaTv~t~~~g~f~~~~d~~~y~Y~V~avs~dgeS~PS~~-VtaTva~~~~gv~ltIt~~a~~~~~~~~~~IYRk~ 366 (462) T protein:vir:96 288 QPATVKATVETGKKGLFTDEHDRAELTYKVVVNSDDAQSAPSEA-VTATVNNATDGVKLEISVNAMYQQQPQFVSIYRQG 366 (462) T ss_pred CCCceeEEEEeCCCCCCCCccCceeEEEEEEEECCCCcccccee-eEeeeecccccceEEEEEcCCccccceEEEEEeec Confidence 00000 00 0135779999999999998865443 22234445566788999999999966 99997 Q ss_pred ecCCceeeEEEec------cceeEeecCCccCccccCCCCCccccccCCccCccceeeee--eeeecccccCCCceeecc Q lcl|NC_010811. 522 RQAGGLGLLANVP------ATSTTFTDNGSITPSGGLPSSGEFPIRYNSLSSLTVNAYYS--ERQQPTITDADPTRLLGG 593 (594) Q Consensus 522 ~~~g~~~~~~~~~------~~~~~~~d~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~p~~~~~~ 593 (594) +..|.|++||+++ ++.++|+|-+...|.+..|. +... +|.++.++| |+|.+..+...|+...+= T Consensus 367 ~~sg~y~li~rv~~~~~n~~gt~tf~D~n~~iPgt~~~f-----Vge~---~p~vi~~~qllpm~~~plA~~n~~~~waV 438 (462) T protein:vir:96 367 RKTGDFYLIKRLGMKEVNDEGKLVFYDLNETIPETTDVF-----VGEM---SPQVLHLFELLPMMKLPLAQINASVTFAV 438 (462) T ss_pred CCccccceeeeeeceeecCCcceeEeeccCCCCCcccce-----eecC---CchhhhhhhhhhhhhcCcccccchhhhhh Confidence 4567899999884 55679999766655333222 2222 389999999 999999999998654432 Q ss_pred C Q lcl|NC_010811. 594 Q 594 (594) Q Consensus 594 ~ 594 (594) - T Consensus 439 l 439 (462) T protein:vir:96 439 L 439 (462) T ss_pred h Confidence 2 No 5 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=96.63 E-value=0.00011 Score=42.32 Aligned_cols=180 Identities=10% Similarity=0.053 Sum_probs=70.3 Q ss_pred CcccEEEEecceEEEEEEEEEE--EEecCCCHHHHHHHHHHHHHHHhhhcccCCCCcccH--HHHHHHHHhcCCceEEEE Q lcl|NC_010811. 361 MYAAYFLWQDPIAIPRDVNVDV--FVFNSAIPSQVQQNSVTALTNLFAPRPGLLMTNFYT--SDLIETVVNASPGQVSYV 436 (594) Q Consensus 361 ~~~~~v~V~~P~yv~v~V~~~v--~~~~~~~~~~v~~~v~~aL~~~~~p~~~~fG~~v~~--S~l~~avi~~V~GVvd~v 436 (594) +.. + .-+.+- .-+| .+....+.+..+..++ .+++++- .+.|.-.++ .+.++.+ +.-.+ .+ T Consensus 1 m~~--~--~~~~F~----~GelsP~l~~r~Dl~~y~~~~~-~~~n~~~---~~~G~~~rR~G~~~~~~~-~~~~~---~~ 64 (594) T protein:vir:10 1 MAD--F--SQTSFK----GGVIAPRLQFNEYESAYHHSIE-DAVNFVV---TEQGSLITRCGSEEVGLC-QDGEV---RL 64 (594) T ss_pred Cce--e--eccccC----cceecceeccchhHHHHHHHHh-hhhceEE---EecCCeecCChhHhhhhc-cCCCC---CE Confidence 100 0 000000 0000 0111223344444443 3333331 122332222 1222221 11111 11 Q ss_pred EEeecc----ccccccccccccccccccccccccccccccceEeeec-Cccce-ecccccc------ccCcccccccceE Q lcl|NC_010811. 437 RVNSPV----GPMIVTAPESPLPTFTILPSLGTLGPLVYAYAVSTTL-TNGDV-GYPTKWI------FPQITDPGNTHAI 504 (594) Q Consensus 437 ~l~~~~----~~~~~~~~~~~v~~~~~~ps~~s~~~~t~~~~v~av~-t~g~~-~~~~~~~------~~~~t~~~~~~~~ 504 (594) +|..+. +...++-...-+ -.|.+....+. ..+.- ...++.. ...+.-......+ T Consensus 65 ~lipF~~s~~~~~~le~g~~~~--------------r~~~~~~~~v~~~~~~~~~~~tp~~~t~~~~l~~i~~tqsad~~ 130 (594) T protein:vir:10 65 FRLPAVDAPSNDVIVEVGNTNI--------------AVWVNDVRQVVANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRL 130 (594) T ss_pred EEEEEEeCCCCeEEEEEcCCeE--------------EEEecCcEEEEccCCCcccccccceeeccCCccceEEEEEeeEE Confidence 222211 111111100000 00111111111 11111 1111111 1111222223334 Q ss_pred EEEecccCCcccceeeeecCCceeeEEEeccceeEeecCCccCccccCCCCCccccccCCccCccceeeeeeeeeccccc Q lcl|NC_010811. 505 TLTWPAVAGAASYQVWGRQAGGLGLLANVPATSTTFTDNGSITPSGGLPSSGEFPIRYNSLSSLTVNAYYSERQQPTITD 584 (594) Q Consensus 505 ~~~w~~v~ga~~y~iy~~~~g~~~~~~~~~~~~~~~~d~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 584 (594) .+.|.+. ..|++||...|.+++.+.. |++ .|..+....+|+.++||||||.|+++. T Consensus 131 ~~~~~~~---~p~~L~R~~~~~w~~~~~~------~~~---------------~p~~~~~~~~p~~v~f~q~RL~f~~~~ 186 (594) T protein:vir:10 131 IMVHPAL---QPKRLYRDNNNAWQFVNMH------TGA---------------VPAEWSPSNYPQTVGIFQNRVWYVGSP 186 (594) T ss_pred EEEcCCC---CceEEEEccCCCceEEecc------cCc---------------ccccccCCccceEEEEEeeeEEEEeCC Confidence 4555443 3799999888888885422 111 122344455799999999999999999 Q ss_pred CCCceeeccC Q lcl|NC_010811. 585 ADPTRLLGGQ 594 (594) Q Consensus 585 ~~p~~~~~~~ 594 (594) +.|+++...+ T Consensus 187 ~~p~~v~~Sr 196 (594) T protein:vir:10 187 VHRTYFWATR 196 (594) T ss_pred CCCceEEEEe Confidence 9999999887 No 6 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=93.63 E-value=0.0041 Score=33.67 Aligned_cols=279 Identities=11% Similarity=0.074 Sum_probs=72.2 Q ss_pred CCCcCHHHHHHHHH-hCC-CceEEEEecCCCCCC---c--eEEEEEEecCCCcCCCHHHHHHHHHHHHHhhhcCC----- Q lcl|NC_010811. 294 SSAVTKSQYQALIA-TYP-GVKDAVTQAQREIDP---G--DVRWMNVIRVSALTSSPWTQDQIQSFLNYCQSVTM----- 361 (594) Q Consensus 294 ~RaVT~~DYe~la~-~~p-gV~~a~~~~~~~~~~---~--~vv~i~v~~~~~~~~~~~~~~~v~~~l~~~~~~~~----- 361 (594) .|. + +++ +|. |--.=......+... + ...-+.+.+.++..--+.+ +.|......-++.++ T Consensus 1 m~~-~------~~q~sF~~GElsP~l~gR~Dl~~y~~g~~~~~N~~~~p~Gg~~rRpGt-~fva~~~~~~~~~rLipF~f 72 (825) T protein:vir:73 1 MAF-S------WIQPSFAGGEIGPSLYGRIDMSKYQVALRKCDNFIVRQYGGVENRPGT-RFVGPAKYPDRKCRLIPFQF 72 (825) T ss_pred Ccc-c------eeccccccceechhhcccchHHHHHHHHHHhcCcEEEecCCceecCch-HHhHhhcCCCCCEEEEEEEe Confidence 000 0 000 010 000000000000000 0 0011111122221111111 112111100011111 Q ss_pred --cccEEEEecceEEEEEEE---------EEEEEecCCCHHHHHHHHHH-HHHH--H-hhhcc------------c---- Q lcl|NC_010811. 362 --YAAYFLWQDPIAIPRDVN---------VDVFVFNSAIPSQVQQNSVT-ALTN--L-FAPRP------------G---- 410 (594) Q Consensus 362 --~~~~v~V~~P~yv~v~V~---------~~v~~~~~~~~~~v~~~v~~-aL~~--~-~~p~~------------~---- 410 (594) ....+-+..+.|+.|--. ....+...+...++. +|+- ...+ | .|+.. | T Consensus 73 s~~q~y~Lefg~~~lrv~~~gg~v~~~~~~~~e~~TPy~~~~l~-~l~~~QsaD~~~i~h~~~pp~~L~r~~~~~W~l~~ 151 (825) T protein:vir:73 73 STVQTYALEFGHNYMRVIKDGAYVLTTSNVIYELAMPYADTDLF-RIKFTQSADVLTLVHPAYPPKELRRYAHDNWQIVD 151 (825) T ss_pred CCCcEEEEEEeCCeEEEEeCCceEeccCCceEEEecccchhhhh-hheeeeecCEEEEEcCCCceeEEEEecCCCcEEEE Confidence 011222334555544211 111222233322221 1110 0000 0 11110 0 Q ss_pred -CCCCcccHHH-HHHHHHhcCCceEEEEEEeeccccccccccccccccccccccc---cccccccccceEee-ecCccce Q lcl|NC_010811. 411 -LLMTNFYTSD-LIETVVNASPGQVSYVRVNSPVGPMIVTAPESPLPTFTILPSL---GTLGPLVYAYAVST-TLTNGDV 484 (594) Q Consensus 411 -~fG~~v~~S~-l~~avi~~V~GVvd~v~l~~~~~~~~~~~~~~~v~~~~~~ps~---~s~~~~t~~~~v~a-v~t~g~~ 484 (594) .|+.....+. +-..+.-..+|....+.+....... ...+.+. .+.+.... .........+.+.. +...+.. T Consensus 152 ~~f~~gp~~~in~~~sv~v~asg~tg~~TiTaS~a~~--~~~~vG~-~i~~~~~~v~si~~~~~~~~~~~~~v~~~~~~~ 228 (825) T protein:vir:73 152 VTTKNGPFEDINVDETVKVYASASTGTITLTASSAIF--GAEQVGK-LFYLEQPAVDSVPVWETSKTTAINDVRRADSNY 228 (825) T ss_pred EeccCCccccccccccceeeecccCceeEEEeecccc--CchhcCe-EEEEecccccccceeeeeeEEEeeeEEECCCce Confidence 1111100000 0000000000000000011100000 0000000 00000000 00000000000000 0000000 Q ss_pred eccccccccCccccccc------ceEEEEecccCCcccce------eeeecCCceeeEEEec------cceeEeecCCcc Q lcl|NC_010811. 485 GYPTKWIFPQITDPGNT------HAITLTWPAVAGAASYQ------VWGRQAGGLGLLANVP------ATSTTFTDNGSI 546 (594) Q Consensus 485 ~~~~~~~~~~~t~~~~~------~~~~~~w~~v~ga~~y~------iy~~~~g~~~~~~~~~------~~~~~~~d~~~~ 546 (594) . ..+..+.+ ...-.+|.++.|+..++ .++...|..-+.+... ++.+.|.++... T Consensus 229 ~--------~~~~~~~~~t~~~~a~~g~~~~~~~g~~~~~~~~~~~~~~~~~g~~~it~~~~~~~~~~~~~~~~~~~~~~ 300 (825) T protein:vir:73 229 Y--------RANTSGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWEYLHSGFGIAKITAVAGDGLTATADVVSFIPSQVV 300 (825) T ss_pred e--------eeecccccceeeccccCCceeEeeeeecccCCceEEEEEecCCceEEEeeccccceeeccccceecccccc Confidence 0 00011111 11125677777765444 3333444433332111 112334433222 Q ss_pred CccccCCCCCccccccCCcc-CccceeeeeeeeecccccCCCceeeccC Q lcl|NC_010811. 547 TPSGGLPSSGEFPIRYNSLS-SLTVNAYYSERQQPTITDADPTRLLGGQ 594 (594) Q Consensus 547 ~~~~~~p~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 594 (594) ..+..++.-.+.+ |...+ +|+.++||||||.|+++...|+.+...+ T Consensus 301 ~~~~~t~~~~~~~--~~~~~gyPs~v~f~q~RL~f~g~~~~p~~v~~Sr 347 (825) T protein:vir:73 301 GSANASYKWAKYA--WNSVNGYPSTVVYYQQRLYFAASTAYPQTIWASR 347 (825) T ss_pred cCCCCCcccccCC--cccCCCCccEEEEEcceEEEeecCCCCCEEEEEc Confidence 2222222222221 22222 6999999999999999999999998888 No 7 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=89.58 E-value=0.026 Score=29.32 Aligned_cols=222 Identities=11% Similarity=0.101 Sum_probs=68.2 Q ss_pred CcccEEEEecceEEEEEEEEEEE--EecCCCHHHHHHHHHHHHHHHhhhcccCCCCcccHHHHHHHHHhcCCceEEEE-- Q lcl|NC_010811. 361 MYAAYFLWQDPIAIPRDVNVDVF--VFNSAIPSQVQQNSVTALTNLFAPRPGLLMTNFYTSDLIETVVNASPGQVSYV-- 436 (594) Q Consensus 361 ~~~~~v~V~~P~yv~v~V~~~v~--~~~~~~~~~v~~~v~~aL~~~~~p~~~~fG~~v~~S~l~~avi~~V~GVvd~v-- 436 (594) +. + ...-+.+- .-+|. +....+.+....++++.++-+.+|..+ +-+..- .+.++.+ ..-.|-..++ T Consensus 1 m~-i--~~~q~sF~----~GElsP~l~gR~Dl~ry~~q~~~~~N~~~~~~GG-l~rRpG-t~fva~~-~~~~g~~rLipf 70 (823) T protein:vir:95 1 MA-I--SWIQPSFA----GGEIGPSLYGRIDMAKYQVALRKCDNFIVRQYGG-VENRPG-TRFVGAA-KYPNRKCRLIPF 70 (823) T ss_pred Cc-c--eeechhcc----CceechheeccchHHHHHHHHhhhhCcEeeecCC-ceecCc-hhhhhhh-cCCCCCeeEEEE Confidence 00 0 00000000 00000 111234455555555444433333211 100000 0001111 0000100000 Q ss_pred -----------------EEeecccccc--------c-----------------------ccccccc---cccc-cccccc Q lcl|NC_010811. 437 -----------------RVNSPVGPMI--------V-----------------------TAPESPL---PTFT-ILPSLG 464 (594) Q Consensus 437 -----------------~l~~~~~~~~--------~-----------------------~~~~~~v---~~~~-~~ps~~ 464 (594) ++....+.+. + -+...+. .++. ..|... T Consensus 71 ~~s~~q~y~Lefg~~~irV~~~~g~vv~~~~~~~ev~tPy~~~~l~~Lr~~qsaD~~fivh~~~~p~~L~r~~~~~w~l~ 150 (823) T protein:vir:95 71 QFSTVQTYALEFGHQYMRVIKDGALVLNSSNVIYEIATPYTEADLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLV 150 (823) T ss_pred EeCCCcEEEEEEcCCeEEEEeCCcEEEecCCceeEEecccccccccceeEEEeccEEEEEcCCccceEEEecCCCCceEE Confidence 0000000000 0 0000000 0000 000000 Q ss_pred c----ccc----------------ccccceEee-----------------------ecCcc-ceecccc----ccccCcc Q lcl|NC_010811. 465 T----LGP----------------LVYAYAVST-----------------------TLTNG-DVGYPTK----WIFPQIT 496 (594) Q Consensus 465 s----~~~----------------~t~~~~v~a-----------------------v~t~g-~~~~~~~----~~~~~~t 496 (594) . .++ ..-.+.+++ +...+ ....+.. ....... T Consensus 151 ~~~~~~gp~~~~~~~~t~~v~~~~~~~~~t~ta~~~~~~~d~vg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (823) T protein:vir:95 151 DVVTKNGPFEDINIDESLTVYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYR 230 (823) T ss_pred EEEEeccccccccccceeEEeccccCceeEEeecccccchhhccceEEEeccccceeeecceeeeecccceEEeccccee Confidence 0 000 000000010 00000 0000000 0000000 Q ss_pred cccccceEEE--------EecccCCccc------ceeeeecCCceeeE---EEecc-ceeEeecCCccCccccCCCCCcc Q lcl|NC_010811. 497 DPGNTHAITL--------TWPAVAGAAS------YQVWGRQAGGLGLL---ANVPA-TSTTFTDNGSITPSGGLPSSGEF 558 (594) Q Consensus 497 ~~~~~~~~~~--------~w~~v~ga~~------y~iy~~~~g~~~~~---~~~~~-~~~~~~d~~~~~~~~~~p~~~~~ 558 (594) +...++..++ .|..+.|+.. |++|+...|..-.. +.+.+ ..+++.+.+......+++.-.+- T Consensus 231 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~v~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~ 310 (823) T protein:vir:95 231 AVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARITAVNGTTATAEVISYIPSQVVGEDNASYKWAKY 310 (823) T ss_pred eeeccccceeecccCCcceEEeceecccccceeEEEEEeCCcceEEEEeecceeeeceEeeeeccccccCCcCCcccccc Confidence 0001111122 2444445553 33333333432221 22222 22455555445555544443333 Q ss_pred ccccCCcc-CccceeeeeeeeecccccCCCceeeccC Q lcl|NC_010811. 559 PIRYNSLS-SLTVNAYYSERQQPTITDADPTRLLGGQ 594 (594) Q Consensus 559 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 594 (594) | |..++ +|+.++||||||.|+++...|+.+...+ T Consensus 311 ~--~~~~~g~Ps~v~f~q~RL~f~g~~~~p~~v~~Sr 345 (823) T protein:vir:95 311 A--WNSVNGYPGTVVYYQQRLYFAASTAFPQTIWASR 345 (823) T ss_pred c--cCcCCCCccEEEEEeceEEEEEcCCCCcEEEEec Confidence 3 33333 5999999999999999999999999888 No 8 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=85.15 E-value=0.055 Score=27.49 Aligned_cols=425 Identities=12% Similarity=-0.034 Sum_probs=105.0 Q ss_pred Eeeeeeeecccceeeecccccc-cCcceeEeecccceeeeecceeeEEEecccCCccceeEecccccccccceEEEecCc Q lcl|NC_010811. 101 TSSSVVSIPPLTQFSCAGNYFF-NRQQLTLAANSPTTVTLYEGQIYSYGMAGLGTERQTWVSTQDSFVISDQDVWVQVNG 179 (594) Q Consensus 101 ~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~~g~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~w~~V~~ 179 (594) .+.-.=+||....+++-+.+.. .+.|+....+..- .+.+|-. .|....... ...+ T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~--d~v~Gl~----------kRpg~~~i~------------~l~~ 56 (680) T protein:vir:17 1 MAAVEQMVPNLLGGISQQPDPLKLPGQVKQARNVQL--DPTFGAL----------KRPGTELIM------------QVTG 56 (680) T ss_pred CccceecchhhhCcceecchhhcCcchhhhhhcccc--CcCcccc----------cCccceeee------------eccC Confidence 0000112232222222211100 0111110000000 0011110 000000000 0001 Q ss_pred cccCCccceeEEcccccceEEEeecCCe-------EEEEe---cccccccccccc---ccCC---C-CEEEE----EEe- Q lcl|NC_010811. 180 TFIPKSFGTLWNYDGLPAFADLTLSDGR-------LITVF---GNLGGVSGQFGT---IPQV---N-DIVTI----SYP- 237 (594) Q Consensus 180 ~~~~~~~dr~y~~d~~~~~~~~~~~dg~-------~~i~F---GD~~~~~G~~G~---~pp~---G-~~V~~----~Y~- 237 (594) ....+...+|..|....+++....+|. ..|-. |......+..|- -.+. + +.++. .|. T Consensus 57 -~~~~~~~~~~~rd~~e~~~~~~~~~g~~~~~~~~i~v~d~~~G~~~~v~~~~~~~~~~~~~~~~~~~~lr~~tv~d~tf 135 (680) T protein:vir:17 57 -IPKRAKWIPIMRDAREHYYVAIYREGANESGDLRIRVFDLKAGVERAVSFVGGEVEEYFPGDETDWEAIRSLTIGDYTF 135 (680) T ss_pred -CCCCceeEEEecCCCCeEEEEEEcCCCcccccceeEEEEccCCeEEEEEcCCCceEEEeecCCCCccceEEEEEcCEEE Confidence 111222223333333222332222221 11100 100000000010 0000 0 11110 000 Q ss_pred -e-------cCCccccccCcceeeeeccCcc----cceeeeccc-c-CCCCCCCCHHHHHHHHHHHHhc-------CCCC Q lcl|NC_010811. 238 -V-------TQGQAGDSIITLNKAVTVVGFS----SITGVATSN-P-TGGSSEKSIVAYKNVASGSFGT-------YSSA 296 (594) Q Consensus 238 -~-------g~G~~GN~v~ag~i~~l~~~~~----gV~~vn~~~-a-~GGaD~Es~e~~r~Rap~~lr~-------~~Ra 296 (594) + ..+..-+....+ +..+...-- .|.+ |... . ..+.+......-..-++..... -.+. T Consensus 136 i~N~~v~~~~~~~~~~~~~~g-~~~v~~~ayg~ty~v~i-ng~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Ag~~t~~~ 213 (680) T protein:vir:17 136 LSNPNVQPTTWSRSFSRRPEG-LVTIGAAGYGTSYIVDF-ATEDSGQQRRWAVQEMQAPKTKRKKGDGSPDEAGETTVNN 213 (680) T ss_pred EECCeEEEeccCCCCCCCCee-EEEEEEeeeeeEEEEEE-eccccceeeeeeeeeeeccccccccccccCCCCcceeeee Confidence 0 000000100001 110000000 0000 0000 0 0000000000000000000000 0122 Q ss_pred cCHHHHHHHHHhCCCceEEEEecCCCCCCceEEEEEEecCCCcCCCHHHHHHHHHHHHHhhhcCCcccEEEEecceEEEE Q lcl|NC_010811. 297 VTKSQYQALIATYPGVKDAVTQAQREIDPGDVRWMNVIRVSALTSSPWTQDQIQSFLNYCQSVTMYAAYFLWQDPIAIPR 376 (594) Q Consensus 297 VT~~DYe~la~~~pgV~~a~~~~~~~~~~~~vv~i~v~~~~~~~~~~~~~~~v~~~l~~~~~~~~~~~~v~V~~P~yv~v 376 (594) ++..+|+...+. .. +.+... .++ .......+......... ..+.+.+ ...+.+..+.| .| T Consensus 214 ~~~a~la~~l~~---~~--~~~~~~-~g~-~~~~~y~~~~~l~~tg~----~~~~~~~--------t~~v~~~G~~y-~I 273 (680) T protein:vir:17 214 WNGTGLSFRVKV---EA--RAFLVD-DGE-EYGHNYIPYVTLLTPGN----NTSPFPD--------TIRVDVSGEGW-DI 273 (680) T ss_pred eeeeeeeeeeee---cc--ceeeec-CCC-ceEEEEeeEEEEecCCc----cccccCc--------eEEEeccccee-EE Confidence 222222111110 00 001100 011 11111111000000000 0001110 01111111122 12 Q ss_pred EEEEEE-------------E--Ee---cCCCHHHHHHHHHHHHHHHhhhcccCCCCcccHHHHHHHHHhcCCceEEEEEE Q lcl|NC_010811. 377 DVNVDV-------------F--VF---NSAIPSQVQQNSVTALTNLFAPRPGLLMTNFYTSDLIETVVNASPGQVSYVRV 438 (594) Q Consensus 377 ~V~~~v-------------~--~~---~~~~~~~v~~~v~~aL~~~~~p~~~~fG~~v~~S~l~~avi~~V~GVvd~v~l 438 (594) .|.... . .. .....+.+.+++.++|..+=...-..-|..++.-.. +-++ .....+ T Consensus 274 sI~~~~~~~~~~~~~s~~~~t~~~~~a~~at~~~Ia~~L~~~i~~~~~~~~~~~g~~i~i~~~------~~~~-~~~~~~ 346 (680) T protein:vir:17 274 KVTKQIQSKVYANLGTAQFTTPVDQSGGGASTSDIVTGLSAAINGLGTFTAESIGNVIRVRYS------DPTR-TDEFTM 346 (680) T ss_pred EEccceeeEeccCccceeeeeccCCcccceeHHHHHHHHHHhhcccCcEEEEECCCEEEEEec------cCCC-ceEEEe Confidence 211111 0 11 112345555555555533200000001222222000 0001 000111 Q ss_pred ee--ccccccccccccccccccccccccccccccccceEeeecCccceecccc----ccccCcccccccce-E----EEE Q lcl|NC_010811. 439 NS--PVGPMIVTAPESPLPTFTILPSLGTLGPLVYAYAVSTTLTNGDVGYPTK----WIFPQITDPGNTHA-I----TLT 507 (594) Q Consensus 439 ~~--~~~~~~~~~~~~~v~~~~~~ps~~s~~~~t~~~~v~av~t~g~~~~~~~----~~~~~~t~~~~~~~-~----~~~ 507 (594) .. ...+......-..+. .....|..-+ .+|.|..+.+.+....+.- .........+.+.| . -+. T Consensus 347 ~~~~g~~~~~~~~~~~~v~---~~~~Lp~~a~--~g~~v~v~~~~~~~~~~Yyv~~~~~~~~~~~~~~~~W~E~~~~~~~ 421 (680) T protein:vir:17 347 SARGGTSGTGLESIKYSVD---TLAELPTKCW--NDYQVAVRNTQDTEVDDYYVKFETDVEDADVPGSGYWVETVKNGDD 421 (680) T ss_pred eccCCCCceeeeeeeeeec---cccccccccC--CCcEEEEEeCCCCcccceEEEEeccCcccCcccccceeecccCccc Confidence 10 000000000000111 1111222211 2344544443332221100 00011111122222 1 144 Q ss_pred ecccCCcccceeeeecCCceeeEEEec--cceeEeecCCccCccccCCCCCccccccCCccCccceeeeeeeeecccccC Q lcl|NC_010811. 508 WPAVAGAASYQVWGRQAGGLGLLANVP--ATSTTFTDNGSITPSGGLPSSGEFPIRYNSLSSLTVNAYYSERQQPTITDA 585 (594) Q Consensus 508 w~~v~ga~~y~iy~~~~g~~~~~~~~~--~~~~~~~d~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 585 (594) |..+.++..|++||...|.|.|.+ +. ...-.|.|. ..-|+...|. |=+......|+.++||||||+|++ T Consensus 422 ~~~~~~tmp~~l~r~~~g~f~~~~-~~~~~~~~~~~~r-~~Gdd~tnp~----psF~~~G~~p~~v~f~q~RL~f~s--- 492 (680) T protein:vir:17 422 GGLVDDTMPHVLVRNALGDFTFSS-LNNSSYGKTWADR-SVGSEDTNPH----PTFTESGNGIYGMFMYKNRLGFLT--- 492 (680) T ss_pred ceeccCcceEEEEEccCceeEEEe-ecccccccccccc-ccCCcccCCC----cccccCCCCceEEEEEcceEEEee--- Confidence 556778899999999999988864 21 111234333 2333333332 212223446999999999999985 Q ss_pred CCceeeccC Q lcl|NC_010811. 586 DPTRLLGGQ 594 (594) Q Consensus 586 ~p~~~~~~~ 594 (594) |+.+...+ T Consensus 493 -~~~v~~Sr 500 (680) T protein:vir:17 493 -QDAVIMSQ 500 (680) T ss_pred -CCeEEEEc Confidence 77777776 No 9 >protein:vir:93631 Length: 580 # NCBI annotation: Bcep22gp67 # Family: family:all:1544 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944296;genbank:gi:38640373;genbank:GeneID:2658280 Probab=37.97 E-value=1.1 Score=20.38 Aligned_cols=230 Identities=12% Similarity=0.050 Sum_probs=83.0 Q ss_pred HHHhhhcCCccc----EEEEecceEEEEEEEEEEEE---ecCCCHHHHH--HHHHHH-HHHHh-hhcccCCCCcccHHHH Q lcl|NC_010811. 353 LNYCQSVTMYAA----YFLWQDPIAIPRDVNVDVFV---FNSAIPSQVQ--QNSVTA-LTNLF-APRPGLLMTNFYTSDL 421 (594) Q Consensus 353 l~~~~~~~~~~~----~v~V~~P~yv~v~V~~~v~~---~~~~~~~~v~--~~v~~a-L~~~~-~p~~~~fG~~v~~S~l 421 (594) +..++.-...+. ..+..+-.+-.+..++++.- .|-.....+. ..+..+ .+.+| +.. --+..+++ T Consensus 1 M~~i~i~~f~Ge~Prl~p~lLP~~~a~~a~n~~~~~G~i~P~~~~~~~~~~~~i~~~~~~t~~~~~~-----~W~~w~~~ 75 (580) T protein:vir:93 1 MTIIKITGFSGEIPRLVPRLLPDTAAQNATNARLESGGLTPYRKPKFITRISTIPAGQIETIYRNGE-----TWMAWDKP 75 (580) T ss_pred CeeEeecccccccccchhhhccccccceEEeeeccCCeeeeeeCchhhccccccCcCcceEEEecCc-----eeEEeCCc Confidence 111010000000 01111222333334444321 1111111110 000000 01111 000 00222233 Q ss_pred HHHHHhcCCceEEEEEEeeccc-ccccccc--cccccccccccccc-----ccccccccceEeeecCccceecccccccc Q lcl|NC_010811. 422 IETVVNASPGQVSYVRVNSPVG-PMIVTAP--ESPLPTFTILPSLG-----TLGPLVYAYAVSTTLTNGDVGYPTKWIFP 493 (594) Q Consensus 422 ~~avi~~V~GVvd~v~l~~~~~-~~~~~~~--~~~v~~~~~~ps~~-----s~~~~t~~~~v~av~t~g~~~~~~~~~~~ 493 (594) +.++ . -|-.-|.+-+..... .+..... ...++.-...|+.. .....+|.|.++-++..|....|.+... T Consensus 76 V~~i-~-~PvA~DRvy~Td~g~Pkvt~~g~sy~lgVpaPs~Apt~~~~g~g~l~~~~y~Yv~TfVt~~GeES~PS~~S~- 152 (580) T protein:vir:93 76 VYAA-P-GPVAADRLYVMGDGAPKMIVGGTTYPLAVPMPSAALTAATSGTGTGDVFSRVYVYTFVTGFGEESEPSAISN- 152 (580) T ss_pred eeee-c-CccccceeEEcCCcccceecCCccccccCCCcccCceeeecCCCCcCccceEEEEEEEcCCCCcCCCccccc- Confidence 2222 0 111123221111000 0111111 11222223333322 2234678999999999887766654221 Q ss_pred CcccccccceEEEEeccc-CCcccc---eeeeecCCc----eeeEEEeccceeEeecCCccCc-cccCCCCCccccccCC Q lcl|NC_010811. 494 QITDPGNTHAITLTWPAV-AGAASY---QVWGRQAGG----LGLLANVPATSTTFTDNGSITP-SGGLPSSGEFPIRYNS 564 (594) Q Consensus 494 ~~t~~~~~~~~~~~w~~v-~ga~~y---~iy~~~~g~----~~~~~~~~~~~~~~~d~~~~~~-~~~~p~~~~~~~~~~~ 564 (594) .++ ...+..++|+-.+. .|-.++ ||||-..|. |=+++++.++.++|+||+...+ ...+| +-+.. ..| T Consensus 153 ~vt-v~~g~tVtLs~~p~p~~~~~i~~~RIYRS~tG~~gtdy~lVAel~Ag~~sF~Dd~s~a~Lge~Lp-s~~~~--~PP 228 (580) T protein:vir:93 153 EVN-WQAGQTVTLSGFQAAPAGRNITKQRIYRSQTSLSGTDLYFIAERDASAANFVDNVPLSDQNEPLP-SLEWN--APP 228 (580) T ss_pred cee-eCCCCeEEEEecCCCCCCCccceEEEEEeccCCCceeEEEEeeeccceeeeeecccccccccccc-hhhcc--CcC Confidence 222 23456678776554 333356 999877664 3388999999999999976655 22333 22221 111 Q ss_pred ccCccceeeeeeeeec-cc------ccCC------------CceeeccC Q lcl|NC_010811. 565 LSSLTVNAYYSERQQP-TI------TDAD------------PTRLLGGQ 594 (594) Q Consensus 565 ~~~~~~~~~~~~~~~~-~~------~~~~------------p~~~~~~~ 594 (594) .+.-.+.++.-=+|.. .+ -... +..+.+.+ T Consensus 229 ~~m~gL~~m~nGi~agF~Gnev~fsEpy~P~AWP~~yr~t~~~~Ivaia 277 (580) T protein:vir:93 229 DDLTGLISLPNGMMAAFRGKELWLCEPWRPHAWPQKYVLTMDYNIVALG 277 (580) T ss_pred CCcceEEeeccceEEEEeCCEEEEecCCCCccchhhcCCCCCCCceeEe Confidence 1111222222111111 00 0000 01122222 No 10 >protein:vir:7021 Length: 803 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853594;genbank:gi:31711676;genbank:GeneID:1481802 Probab=7.32 E-value=9.1 Score=15.32 Aligned_cols=312 Identities=13% Similarity=0.058 Sum_probs=70.6 Q ss_pred CEEEEEEeecCCccccccCcceeee---eccCcccceee-ecc-ccCCCCCCCCHHHHHHHHHHHHhc-CCCCcCHH--- Q lcl|NC_010811. 230 DIVTISYPVTQGQAGDSIITLNKAV---TVVGFSSITGV-ATS-NPTGGSSEKSIVAYKNVASGSFGT-YSSAVTKS--- 300 (594) Q Consensus 230 ~~V~~~Y~~g~G~~GN~v~ag~i~~---l~~~~~gV~~v-n~~-~a~GGaD~Es~e~~r~Rap~~lr~-~~RaVT~~--- 300 (594) ..|. |+-.|.+ .| +++ +..-..+++.. |-. ...||. ++|-+-.+-+ ...+-... T Consensus 1 ~~v~-------~s~~nl~-~G-vSqQ~d~~R~~~q~~~~~N~~~~~~gGl--------~rRpGt~~va~l~~~~~~~~~~ 63 (803) T protein:vir:70 1 MEVQ-------GSLGRQI-QG-ISQQPPAVRLDGQCSEMVNMVPDVVEGT--------KSRMGTTHIAKLLEYGEDDMAV 63 (803) T ss_pred CeEE-------eecchhc-cc-cccCchHHhhhhhhhhhhcceeeecccc--------ccCChhhhhhhhcCCCccccee Confidence 1111 3333321 11 110 00111122222 211 223553 2332222211 00000000 Q ss_pred ------HH---HHHHHhCCCceEEEEecCCCCCCceEEEEEEecCCC----cCCCHHHHHHHHH-------HH------- Q lcl|NC_010811. 301 ------QY---QALIATYPGVKDAVTQAQREIDPGDVRWMNVIRVSA----LTSSPWTQDQIQS-------FL------- 353 (594) Q Consensus 301 ------DY---e~la~~~pgV~~a~~~~~~~~~~~~vv~i~v~~~~~----~~~~~~~~~~v~~-------~l------- 353 (594) ++ +++.+++.+ ...+++.. + |... .+..... ++.....++.|+- +| T Consensus 64 ~~~~~~~~~~e~~~~~~~~~-~~irv~~~---~-G~~~--~v~~~~~~~~~l~~~~~~~~~l~~~tvaD~~fi~n~~~~~ 136 (803) T protein:vir:70 64 HHYRRGGEGEEEYFFIMKKG-QVPEIFDK---Q-GRKC--MVQSQDAPMTYLSEVTNPREDVQFMTIADVTFMLNRKKIV 136 (803) T ss_pred eEEEecCCCceEEEEEEecC-CeEEEEEc---C-CcEE--EEecCCceeEEEeecCCChhheeEEEEcCEEEEecCceee Confidence 00 011111110 01111110 0 0000 0000000 0000000011100 00 Q ss_pred ---HHhhhcCCcccEEEEecceE-E--EEEEE----EEEEEecCCCHHHHHHHHHHHHHHHhhhcccCCCCcccHHHHHH Q lcl|NC_010811. 354 ---NYCQSVTMYAAYFLWQDPIA-I--PRDVN----VDVFVFNSAIPSQVQQNSVTALTNLFAPRPGLLMTNFYTSDLIE 423 (594) Q Consensus 354 ---~~~~~~~~~~~~v~V~~P~y-v--~v~V~----~~v~~~~~~~~~~v~~~v~~aL~~~~~p~~~~fG~~v~~S~l~~ 423 (594) ....+.......+.|..-.| . .|+|+ +.....++......... +-.-..+++.+ T Consensus 137 ~~~~~~~~~~~~~~~~~vr~g~y~~~y~itIng~~~a~~~t~~~~~~~~~~~~----------------~~~~ia~~l~~ 200 (803) T protein:vir:70 137 KARPERSPQVGSTAIVFMAYGQYGTHYKIIIDGVVAAGYKTRDGAEAHHIEDI----------------RTESIAYNLYQ 200 (803) T ss_pred eeccccCCCCCCceEEEEeecCCcceEEEEeCCcceEEEEeCCCccccccccc----------------chhhhhhhhhh Confidence 00000000000111211111 1 12221 11222222111100000 00001112221 Q ss_pred HHHhcCCceEEEEEEeecccccccccccccccccccccccccccc--ccccceEeeecCccceeccccccccCccccc-- Q lcl|NC_010811. 424 TVVNASPGQVSYVRVNSPVGPMIVTAPESPLPTFTILPSLGTLGP--LVYAYAVSTTLTNGDVGYPTKWIFPQITDPG-- 499 (594) Q Consensus 424 avi~~V~GVvd~v~l~~~~~~~~~~~~~~~v~~~~~~ps~~s~~~--~t~~~~v~av~t~g~~~~~~~~~~~~~t~~~-- 499 (594) ++ .+.-++.++ ........+.+..... ...+.+..+.+..+. ....+.+....+-. ..+++. ...+++..+ T Consensus 201 ~~-~~~~s~a~~-~~~~~g~~~~i~~~~~-~~~~~~~t~~g~~~~~~~~~~~~v~~~~~Lp-~~~~~g-~~v~v~~~g~~ 275 (803) T protein:vir:70 201 SL-QSWDKIADY-EIQLDGTSIYITRRDG-STTFDITTEDGAKGKDLVAIKYKVASTDLLP-SRAPEG-YKVQVWPTGSK 275 (803) T ss_pred he-eccccccce-EEEECCcEEEEEEcCC-CCeeEEEeecCcCCcEEEEEEecccceeecc-ccCCCC-ceEEEEcCCCC Confidence 11 111110000 0000000000000000 000000000000000 00000000000000 000100 000000000 Q ss_pred --ccceE--------EEEecccCCcccceeeeecCCceeeEEEec---cceeEeecC----CccCccccCCCCCccc--c Q lcl|NC_010811. 500 --NTHAI--------TLTWPAVAGAASYQVWGRQAGGLGLLANVP---ATSTTFTDN----GSITPSGGLPSSGEFP--I 560 (594) Q Consensus 500 --~~~~~--------~~~w~~v~ga~~y~iy~~~~g~~~~~~~~~---~~~~~~~d~----~~~~~~~~~p~~~~~~--~ 560 (594) ...++ ...|...++.-.+..+....++..++.... .+..+|+.- -..-++...| .| + T Consensus 276 ~~d~y~v~~~~~~~~~~~w~e~a~~g~~~~~~~~t~p~~~v~~~~~~~~~~~~~~~~~~~~r~~gdd~tnp----~psf~ 351 (803) T protein:vir:70 276 PESRYWLQAEKQNGNIVSWKETLAADVLIGFDKSTMPYIIERTGFVNGIAQFKIRQGDWEDRKVGDDLTNP----MPSFI 351 (803) T ss_pred CCceeeEEEEeccCCccceEeeeccceeeeeecccccEEEEEEEEeecceeEEEEeeccccccccccccCc----ccccc Confidence 00111 124655544444444445556766664221 122222211 0011111122 23 2 Q ss_pred ccCCccCccceeeeeeeeecccccCCCceeeccC Q lcl|NC_010811. 561 RYNSLSSLTVNAYYSERQQPTITDADPTRLLGGQ 594 (594) Q Consensus 561 ~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 594 (594) .+.....|+.++||||||+|++ |+.+...+ T Consensus 352 ~~~~~~~~~~v~f~q~RL~f~~----~~~v~~Sr 381 (803) T protein:vir:70 352 DEEVPQTLGGMFMVQNRLCVTA----GEAVIATR 381 (803) T ss_pred CccCCCCceeEEEEeceEEEee----CCeEEEEc Confidence 2222245999999999999986 88888777 Done!