Query lcl|NC_016567.1_cdsid_YP_004957595.1 [gene=VPEG_00103] [protein=hypothetical protein] [protein_id=YP_004957595.1] [location=complement(72008..72631)] Match_columns 207 No_of_seqs 2 out of 5 Neff 1.5 Searched_HMMs 1612 Date Thu Nov 7 14:05:31 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_102 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_102_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80320 Length: 188 97.9 4.7E-07 2.9E-10 55.3 11.8 167 1-202 4-188 (188) 2 protein:vir:1435 Length: 188 # 97.9 1.2E-06 7.3E-10 53.1 12.6 167 1-202 4-188 (188) 3 protein:vir:93592 Length: 108 95.9 9E-05 5.6E-08 42.8 7.3 107 1-201 2-108 (108) 4 protein:vir:100245 Length: 113 95.1 0.00032 2E-07 39.7 7.4 113 1-202 1-113 (113) 5 protein:vir:100103 Length: 120 94.4 0.00039 2.4E-07 39.3 6.3 115 1-202 5-120 (120) 6 protein:vir:486 Length: 107 # 92.7 0.0022 1.4E-06 35.1 7.6 106 1-200 1-107 (107) 7 protein:vir:192 Length: 108 # 91.7 0.0025 1.6E-06 34.8 6.6 101 1-207 6-106 (108) 8 protein:vir:1887 Length: 108 # 91.7 0.0025 1.6E-06 34.8 6.6 101 1-207 6-106 (108) 9 protein:vir:10365 Length: 115 90.7 0.0049 3E-06 33.3 7.1 113 3-203 1-115 (115) 10 protein:vir:4512 Length: 107 # 89.7 0.0078 4.9E-06 32.1 7.4 106 1-200 1-107 (107) 11 protein:vir:81069 Length: 115 88.3 0.011 6.9E-06 31.3 7.3 113 3-203 1-115 (115) 12 protein:vir:4458 Length: 107 # 88.3 0.012 7.5E-06 31.1 7.4 106 2-200 1-107 (107) 13 protein:vir:97069 Length: 115 83.1 0.031 1.9E-05 28.8 7.0 113 3-204 1-115 (115) 14 protein:vir:81159 Length: 95 # 64.4 0.23 0.00014 24.1 6.7 95 1-205 1-95 (95) 15 protein:vir:5742 Length: 110 # 51.0 0.59 0.00037 21.8 7.2 109 1-200 1-110 (110) 16 protein:vir:4831 Length: 105 # 50.0 0.6 0.00037 21.8 6.4 99 1-207 1-99 (105) 17 protein:vir:4857 Length: 104 # 49.1 0.54 0.00034 22.0 6.1 99 1-207 1-99 (104) 18 protein:vir:102158 Length: 99 40.1 0.98 0.00061 20.6 6.2 99 2-204 1-99 (99) No 1 >protein:vir:80320 Length: 188 # NCBI annotation: gp8, conserved hypothetical protein # Family: family:all:501 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111087;genbank:gi:134288682;genbank:GeneID:4960567 Probab=97.94 E-value=4.7e-07 Score=55.29 Aligned_cols=167 Identities=15% Similarity=0.201 Sum_probs=101.4 Q ss_pred Cccc--------chhHHHhhcccccchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCc Q lcl|NC_016567. 1 MILA--------SISELRQRLNVKDSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTS 72 (207) Q Consensus 1 m~la--------si~elr~r~nv~ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s 72 (207) ||+. +++|+|+.++|.++..=..++..++....-.+|.+....|-+.+.+-.+-..+... T Consensus 4 ~~~~~ppa~ePVtL~e~K~hLRid~~~eD~~l~~~lI~aA~~~~E~~~gr~l~~qt~~~~~~~~~~~~------------ 71 (188) T protein:vir:80 4 VLVEYLDDAEPLTFEEVAFQCRIDDDDERDFVERIVIPGARQAAESKSGAAIRKARYVERLSGFPLAE------------ 71 (188) T ss_pred eeeccCCCCcccCHHHHHHHcCCCCchhhHHHHHHHHHHHHHHHHHHhCCeeeeeeEEEEecCCCCCc------------ Confidence 4442 79999999999988764444455777777778887777777777665543322211 Q ss_pred chhhhhhhhccccccc-eEEecccccccccee----eeeeecccc---eEEeecc--cCCcceEEEEEcCcceeeeecce Q lcl|NC_016567. 73 SGVLSLWLDGFNVSNV-ELRICDDINASGVLT----TSIKVDSKG---IVSIYDG--WVDYDYFKVTFNSGFDIETVDAY 142 (207) Q Consensus 73 ~gvl~l~l~g~n~s~~-~~~~~~~~~~sg~~t----s~~~~~~~g---iv~~~~g--w~~~~y~~vt~~~gf~ietvd~~ 142 (207) +.|.---++.| +|+. +.++|..+ +.|.++..| .+-.-.| |-.-+-++|+|+.||. | T Consensus 72 -----i~Lp~~PV~sV~sV~~---~d~~G~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~~----~-- 137 (188) T protein:vir:80 72 -----ISLSVGQVIRVDSIEI---RDASGATTTLDADAFELVQLGREALLVPEGQARWPFARAVTITYQAGVD----L-- 137 (188) T ss_pred -----eEecccccceeeEEEE---EcCCCcEEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEeccc----c-- Confidence 11211122222 1222 13456554 677776543 3433333 4445789999999994 2 Q ss_pred eEeecchhHHHHHHHHHhhhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCcccc Q lcl|NC_016567. 143 PTYVGVPDKLKQACLMYAEHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGC 202 (207) Q Consensus 143 ~~y~gvp~~~kqa~l~~aeh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~ 202 (207) -||..||||.+|.+.|.|-.+ +-. .....-++-|.++..-|..=-|=|||- T Consensus 138 ----~vP~~ik~aill~va~~Ye~R----e~~-~~g~~~~~~P~~~v~~Ll~pyRvp~~~ 188 (188) T protein:vir:80 138 ----ARYPSVRTWMLLAAAWAYDHR----ELF-SEGQPIGEMPGGYADVLLNPITVPPRF 188 (188) T ss_pred ----cChHHHHHHHHHHHHHHHhcc----ccc-ccccccccccHHHHHHHhhccCCCCCC Confidence 279999999999999997533 100 012233567888654555556888888 No 2 >protein:vir:1435 Length: 188 # NCBI annotation: hypothetical protein # Family: family:all:501 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536364;genbank:gi:17975169;genbank:GeneID:929149 Probab=97.85 E-value=1.2e-06 Score=53.08 Aligned_cols=167 Identities=14% Similarity=0.156 Sum_probs=105.2 Q ss_pred Ccc----c----chhHHHhhcccccchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCc Q lcl|NC_016567. 1 MIL----A----SISELRQRLNVKDSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTS 72 (207) Q Consensus 1 m~l----a----si~elr~r~nv~ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s 72 (207) ||+ + +++|+|++++|.++..=..++..++....-.+|.++...|-+.+.+-.+=+-+... T Consensus 4 ~~~~~ppa~epVtLae~K~~lrid~~~eD~~l~~~li~aA~~~~E~~tgr~l~~qt~~~~~~~~~~~~------------ 71 (188) T protein:vir:14 4 VLVEYLDDAEPLTFEEVAFQCRIDDDDERDFVERVVIPGARQAAESKAGAAIRKARYVEHLSGFPPAE------------ 71 (188) T ss_pred eeeecCCCCCccCHHHHHHHcCCCCchhHHHHHHHHHHHHHHHHHHHhCCeeeeeeEEEEecCcCCCc------------ Confidence 333 1 79999999999988763444455777777788888887787777776653332211 Q ss_pred chhhhhhhhccccccc-eEEecccccccccee----eeeeecc---cceEEeeccc--CCcceEEEEEcCcceeeeecce Q lcl|NC_016567. 73 SGVLSLWLDGFNVSNV-ELRICDDINASGVLT----TSIKVDS---KGIVSIYDGW--VDYDYFKVTFNSGFDIETVDAY 142 (207) Q Consensus 73 ~gvl~l~l~g~n~s~~-~~~~~~~~~~sg~~t----s~~~~~~---~giv~~~~gw--~~~~y~~vt~~~gf~ietvd~~ 142 (207) +.|.---++.| +|+. ...+|... ++|.++. ++.+..-.|+ -.-.-++|+|+.||. | T Consensus 72 -----~~Lp~~Pv~sV~sV~~---~d~~g~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~~----~-- 137 (188) T protein:vir:14 72 -----VPLSVGQVISVDSIEI---RDASGATTTLDAGAFELVQLGRETLLVPAGQARWPYARAVTIKYQAGID----L-- 137 (188) T ss_pred -----eEecccCcceeeEEEE---EcCCCceEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEecCc----c-- Confidence 22222223222 1222 12345433 5777764 3555555553 234679999999995 2 Q ss_pred eEeecchhHHHHHHHHHhhhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCcccc Q lcl|NC_016567. 143 PTYVGVPDKLKQACLMYAEHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGC 202 (207) Q Consensus 143 ~~y~gvp~~~kqa~l~~aeh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~ 202 (207) -||..||||++|.+.|.|-.+ | +......-++-|-++.+-|.+=-|=||+- T Consensus 138 ----~vP~~ik~Aill~va~~Y~~R----e-~~~~g~~~~~lP~~~v~~Ll~pyRvP~~~ 188 (188) T protein:vir:14 138 ----ARYPSVRSWMLLAAAWAYDHR----E-LYSDGQPMGEMPGGYSDVLLNPITVPPRF 188 (188) T ss_pred ----CchHHHHHHHHHHHHHHHhcc----c-ccccccccccccHHHHHHHhhccCCCCCC Confidence 279999999999999997644 1 11122344567888655566666889998 No 3 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=95.92 E-value=9e-05 Score=42.77 Aligned_cols=107 Identities=17% Similarity=0.167 Sum_probs=72.0 Q ss_pred CcccchhHHHhhcccccchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 1 MILASISELRQRLNVKDSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 1 m~lasi~elr~r~nv~ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) |-+-+++|.++.++|..+. =...|...+..-+-.|..|++.+.++. T Consensus 2 m~~vtLeevK~hLRId~d~-dD~li~~~i~aA~~~v~~~l~~~~~~~--------------------------------- 47 (108) T protein:vir:93 2 TALLTLEEIKAHLRVDHDA-DDDMLMDKVRQATAVLLAYIQGSRDKV--------------------------------- 47 (108) T ss_pred CcCCCHHHHHHHcCCCCCc-ChHHHHHHHHHHHHHHHHHhccccccc--------------------------------- Confidence 8899999999999996665 356677777777777777776443211 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) . -+ ++.+....+|..+|||+|+.+ T Consensus 48 ----------------------------------------~--------------~~--~~~~~~~~~~~~i~~AvLlLv 71 (108) T protein:vir:93 48 ----------------------------------------I--------------RE--DGELIPGEALTRMKGAAMRLT 71 (108) T ss_pred ----------------------------------------c--------------cc--ccccccccCChHHHHHHHHHH Confidence 0 00 112333456889999999999 Q ss_pred hhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCccc Q lcl|NC_016567. 161 EHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIG 201 (207) Q Consensus 161 eh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~ 201 (207) .|++-.+=+.++.. -.=...|.+|+.||++|-|.++- T Consensus 72 ~~~YenRe~~~~~~----~~~~elP~~v~~Ll~~~R~p~~~ 108 (108) T protein:vir:93 72 GMLYRNPDLAEREE----LLQGELPFSVSVLIYDLRCPTVL 108 (108) T ss_pred HHHHhccccccccc----cccccCCHHHHHHHHHccccccC Confidence 99965443222111 01124799999999999887665 No 4 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=95.06 E-value=0.00032 Score=39.73 Aligned_cols=113 Identities=17% Similarity=0.144 Sum_probs=74.2 Q ss_pred CcccchhHHHhhcccccchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 1 MILASISELRQRLNVKDSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 1 m~lasi~elr~r~nv~ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) |.+-+++|.++-+.|.+++. +-.|..++....-.|+.+++-+|....-+.. T Consensus 1 M~~vtLee~K~hLRvd~d~d-D~lI~~li~AA~~~ve~~l~r~l~~~~~~~~---------------------------- 51 (113) T protein:vir:10 1 MALVELKLALGFVRANAGVE-DDVVQMLLDAATQSAVDYLNRQVFETEDAMT---------------------------- 51 (113) T ss_pred CCCCCHHHHHHHcCCCCCcc-hHHHHHHHHHHHHHHHHHhCccccccccccc---------------------------- Confidence 99999999999999987753 6778888888888888888766532110000 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) +.+|..+..+.-.=+|..+|||+|+.. T Consensus 52 -----------------------------------------------------~~~~~~~~~~~~~~~p~~i~~AvLllv 78 (113) T protein:vir:10 52 -----------------------------------------------------TAIEAGTAGQNPMVVNAAIRAAILKIT 78 (113) T ss_pred -----------------------------------------------------cccccccccccccccChHHHHHHHHHH Confidence 000000000011118999999999999 Q ss_pred hhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCcccc Q lcl|NC_016567. 161 EHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGC 202 (207) Q Consensus 161 eh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~ 202 (207) .|.+--+ | +..+..-+.-|-+|..||..| |.-+|. T Consensus 79 ~~~Y~nR----e--~~~~~~~~~lP~~v~~Ll~~y-R~~~g~ 113 (113) T protein:vir:10 79 AELYANR----E--DTAFGPITELPLNARALLRPH-RIIPGV 113 (113) T ss_pred HHHHhhh----h--hhchhhhhccCHHHHHHHHHh-hhhcCC Confidence 9996433 2 111222346799999999887 666676 No 5 >protein:vir:100103 Length: 120 # NCBI annotation: gp7 # Family: family:all:363 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945037;genbank:gi:38707897;genbank:GeneID:2744150 Probab=94.39 E-value=0.00039 Score=39.25 Aligned_cols=115 Identities=14% Similarity=0.203 Sum_probs=75.7 Q ss_pred CcccchhHHHhhcccccchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 1 MILASISELRQRLNVKDSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 1 m~lasi~elr~r~nv~ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) |-+-+++|.++-++|... .=...|.+++....-.|+.|++.+|.....+.. T Consensus 5 m~~vtL~e~K~hLRvd~d-~DD~lI~~~i~AA~~~v~~~~~r~l~~~~~~~~---------------------------- 55 (120) T protein:vir:10 5 TPIVSLEVALAHLREDAG-VADDLIKIYIGAATQSASDYVDRKLYANDAEMQ---------------------------- 55 (120) T ss_pred CCccCHHHHHHHcCCCCC-cchHHHHHHHHHHHHHHHHHhCCcccccccccc---------------------------- Confidence 999999999999999644 447788888888888888888777742211100 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) ++ . +.+..| +...-+|..+|+|+||.+ T Consensus 56 ----------------~~--------------~--------------~~~~~~---------~~~~~~~~~i~~AvLllv 82 (120) T protein:vir:10 56 ----------------AA--------------V--------------ADATAG---------ADPIVANDAIRAAILLTI 82 (120) T ss_pred ----------------hh--------------h--------------hccccc---------cccccCCHHHHHHHHHHH Confidence 00 0 000000 011227999999999999 Q ss_pred hhhhhhcccCccc-ccccccccCCCchhHHHHHHHhcCCcccc Q lcl|NC_016567. 161 EHFFMAKYANGEV-NEKDDDDYTDAPNGVSALLYSLSRSPIGC 202 (207) Q Consensus 161 eh~fm~ky~~~~i-~e~~~~dytd~pn~vs~~ly~~~r~~~~~ 202 (207) .|++- |-|- ....+..-+.-|-+|..||.+|. .-+|. T Consensus 83 g~~Ye----nRe~~~~~~~~~~~~lP~~v~~Ll~~yR-~~~gv 120 (120) T protein:vir:10 83 GKLYA----FREDVVSGASASVTELPSGAKSLLFPYR-VGLGV 120 (120) T ss_pred HHHHh----chhhhhhcccccccccCHHHHHHHHHhh-hccCC Confidence 99864 4332 22233445678999999999884 44555 No 6 >protein:vir:486 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543093;swissprot:trembl:q8w626;genbank:gi:18249905;uniprot:Q8W626;genbank:GeneID:929692 Probab=92.74 E-value=0.0022 Score=35.14 Aligned_cols=106 Identities=17% Similarity=0.252 Sum_probs=70.9 Q ss_pred CcccchhHHHhhcccccc-hHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhh Q lcl|NC_016567. 1 MILASISELRQRLNVKDS-EQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLW 79 (207) Q Consensus 1 m~lasi~elr~r~nv~ds-~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~ 79 (207) |+ -+++|+++-++|... ..=.-.|..++....-.|+.+++-++..-+.+. T Consensus 1 M~-vtL~e~K~hLRid~D~~ddD~li~~~i~aA~~~i~~~~~r~l~~~~~~~---------------------------- 51 (107) T protein:vir:48 1 ML-LKEEEIKSHLRLDDGLYSDGDFLKLLAQAVQKRTETYLNRKLYAPEETI---------------------------- 51 (107) T ss_pred CC-CCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHHhccccccccccc---------------------------- Confidence 76 689999999999632 122567777887777778887776654311110 Q ss_pred hhccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHH Q lcl|NC_016567. 80 LDGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMY 159 (207) Q Consensus 80 l~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~ 159 (207) -...|....+|.-+|||+||. T Consensus 52 -----------------------------------------------------------~~~~~~~~~~~~~ik~Avlll 72 (107) T protein:vir:48 52 -----------------------------------------------------------PEDDPDGMHLTDDVRLAMLML 72 (107) T ss_pred -----------------------------------------------------------cccCccccccchhHHHHHHHH Confidence 011111223799999999999 Q ss_pred hhhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCcc Q lcl|NC_016567. 160 AEHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPI 200 (207) Q Consensus 160 aeh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~ 200 (207) +.|.+--+=+ +.+ ..=+.-|.||..||.+|-|-+. T Consensus 73 v~~~Y~NRe~---v~~---~~~~~iP~~v~~LL~~yR~~~l 107 (107) T protein:vir:48 73 VSHFYENRST---ITD---VEKLETPMSFRWLAGPYRIVPL 107 (107) T ss_pred HHHHHhhhhh---hcc---ccccccCHHHHHHHHHhhccCC Confidence 9999764421 111 1224579999999999988777 No 7 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=91.70 E-value=0.0025 Score=34.84 Aligned_cols=101 Identities=12% Similarity=0.143 Sum_probs=66.3 Q ss_pred CcccchhHHHhhcccccchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 1 MILASISELRQRLNVKDSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 1 m~lasi~elr~r~nv~ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) |-+-+++|.++.++|... .=...+..++....-.|+.+++.+|... +. T Consensus 6 M~~vtLee~K~hLRid~d-ddD~lI~~~i~AA~~~v~~~~~~~~~~~---------~~---------------------- 53 (108) T protein:vir:19 6 LDVISLSLFKQQIEFEED-DRDELITLYAQAAFDYCMRWCDEPAWKV---------AA---------------------- 53 (108) T ss_pred ccccCHHHHHHHcCCCCC-cchHHHHHHHHHHHHHHHHHhCCccccc---------cc---------------------- Confidence 889999999999999544 4566777888888878887776443210 00 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) .+|.-+|+|+|+.+ T Consensus 54 ------------------------------------------------------------------~~p~~ik~AiLllv 67 (108) T protein:vir:19 54 ------------------------------------------------------------------DIPAAVKGAVLLVF 67 (108) T ss_pred ------------------------------------------------------------------ccchHHHHHHHHHH Confidence 05778999999999 Q ss_pred hhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCccccccccC Q lcl|NC_016567. 161 EHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGCLTPII 207 (207) Q Consensus 161 eh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~l~~ii 207 (207) .|++-.+ | ...+..-+-|| +|..||.-| |+-+|--.+=- T Consensus 68 ~~~YenR----E--~~~~~~~~~~~-~~~~LL~pY-R~~~g~~~~~~ 106 (108) T protein:vir:19 68 ADMFEHR----T--AQSEVQLYENA-AAERMMFIH-RNWRGKAESEE 106 (108) T ss_pred HHHHhcc----c--ccccchhhhhH-HHHHHHHHH-HhcCCCCCccc Confidence 9997444 2 12233344444 999999776 44333222111 No 8 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=91.70 E-value=0.0025 Score=34.84 Aligned_cols=101 Identities=12% Similarity=0.143 Sum_probs=66.3 Q ss_pred CcccchhHHHhhcccccchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 1 MILASISELRQRLNVKDSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 1 m~lasi~elr~r~nv~ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) |-+-+++|.++.++|... .=...+..++....-.|+.+++.+|... +. T Consensus 6 M~~vtLee~K~hLRid~d-ddD~lI~~~i~AA~~~v~~~~~~~~~~~---------~~---------------------- 53 (108) T protein:vir:18 6 LDVISLSLFKQQIEFEED-DRDELITLYAQAAFDYCMRWCDEPAWKV---------AA---------------------- 53 (108) T ss_pred ccccCHHHHHHHcCCCCC-cchHHHHHHHHHHHHHHHHHhCCccccc---------cc---------------------- Confidence 889999999999999544 4566777888888878887776443210 00 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) .+|.-+|+|+|+.+ T Consensus 54 ------------------------------------------------------------------~~p~~ik~AiLllv 67 (108) T protein:vir:18 54 ------------------------------------------------------------------DIPAAVKGAVLLVF 67 (108) T ss_pred ------------------------------------------------------------------ccchHHHHHHHHHH Confidence 05778999999999 Q ss_pred hhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCccccccccC Q lcl|NC_016567. 161 EHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGCLTPII 207 (207) Q Consensus 161 eh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~l~~ii 207 (207) .|++-.+ | ...+..-+-|| +|..||.-| |+-+|--.+=- T Consensus 68 ~~~YenR----E--~~~~~~~~~~~-~~~~LL~pY-R~~~g~~~~~~ 106 (108) T protein:vir:18 68 ADMFEHR----T--AQSEVQLYENA-AAERMMFIH-RNWRGKAESEE 106 (108) T ss_pred HHHHhcc----c--ccccchhhhhH-HHHHHHHHH-HhcCCCCCccc Confidence 9997444 2 12233344444 999999776 44333222111 No 9 >protein:vir:10365 Length: 115 # NCBI annotation: conserved phage protein # Family: family:all:363 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858957;genbank:gi:32128422;genbank:GeneID:2648387 Probab=90.67 E-value=0.0049 Score=33.26 Aligned_cols=113 Identities=7% Similarity=0.080 Sum_probs=69.6 Q ss_pred ccchhHHHhhcccc--cchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 3 LASISELRQRLNVK--DSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 3 lasi~elr~r~nv~--ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) +-|++|+++-++|. |...=.-.|..++....-.|+.+++-+|-..+.+...- +.+ T Consensus 1 mvtLe~~K~hLRid~~d~d~dD~li~~~i~AA~~~v~~~~~r~l~~~~~~~~~~-------~~~---------------- 57 (115) T protein:vir:10 1 MITLAMVQRHLQAELYEDDERDYVMQQLLPAARESAELFINRKLYDTQADMLAD-------QAA---------------- 57 (115) T ss_pred CCCHHHHHHHcCCCCCCCchhhHHHHHHHHHHHHHHHHHhCCcccccccccccc-------ccc---------------- Confidence 34899999999984 54445667778888888788888876653221111000 000 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) .++.+.-.-+|.-+|||+||.. T Consensus 58 ----------------------------------------------------------~~~~~~~~~~p~~i~~AiLLlv 79 (115) T protein:vir:10 58 ----------------------------------------------------------GVDPAGQLLITRTVEQAILLTV 79 (115) T ss_pred ----------------------------------------------------------ccCCcccccCChHHHHHHHHHH Confidence 0000111128999999999999 Q ss_pred hhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCccccc Q lcl|NC_016567. 161 EHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGCL 203 (207) Q Consensus 161 eh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~l 203 (207) .|.|--+=+.+ +..-+.-|.+|..||..|- .-+|.- T Consensus 80 g~~Y~nRe~~~------~~~~~elP~~v~~LL~pyR-~~~gv~ 115 (115) T protein:vir:10 80 GEWYANREQVW------VKGVGLVTSSAQNLLHPYR-KFAGVR 115 (115) T ss_pred HHHHhcchhcc------cchhhhcCHHHHHHHHHHH-hcCCCC Confidence 99976542211 1122467999999999994 444433 No 10 >protein:vir:4512 Length: 107 # NCBI annotation: unknown # Family: family:all:363 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599038;genbank:gi:19548996;genbank:GeneID:935218 Probab=89.69 E-value=0.0078 Score=32.13 Aligned_cols=106 Identities=17% Similarity=0.274 Sum_probs=73.7 Q ss_pred CcccchhHHHhhcccc-cchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhh Q lcl|NC_016567. 1 MILASISELRQRLNVK-DSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLW 79 (207) Q Consensus 1 m~lasi~elr~r~nv~-ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~ 79 (207) |+ -+++|+++-++|. |...=+.+|..++....-.|+.+++-+|...+.+. +...+ T Consensus 1 M~-vtL~e~K~hLRId~D~~ddD~lI~~~i~AA~~~i~~~~~r~~~~~~~~~-~~~~~---------------------- 56 (107) T protein:vir:45 1 ML-LKMEEIKLQLRLDDDFSDEDELLELLGKAAQSRTENFLNRKLYATADDR-PADDP---------------------- 56 (107) T ss_pred CC-CCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHHhccccccccccc-ccccc---------------------- Confidence 55 5899999999995 43233788888998888899888877776544321 11100 Q ss_pred hhccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHH Q lcl|NC_016567. 80 LDGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMY 159 (207) Q Consensus 80 l~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~ 159 (207) .-.-+|.-+|+|+||. T Consensus 57 ----------------------------------------------------------------~~~~~~~~~~~AvLll 72 (107) T protein:vir:45 57 ----------------------------------------------------------------DGLVISDDVKLALLLL 72 (107) T ss_pred ----------------------------------------------------------------ccccCChhHHHHHHHH Confidence 0112699999999999 Q ss_pred hhhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCcc Q lcl|NC_016567. 160 AEHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPI 200 (207) Q Consensus 160 aeh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~ 200 (207) .-|.+-.+-+-+ +..-..-|.||..||.+|-|-|- T Consensus 73 v~~~Y~NRe~~~------~~~~~~lp~~v~~Ll~~~R~~~~ 107 (107) T protein:vir:45 73 VSHFYENRSTVT------DVEKMELPMSFNWLVAPYRLIPL 107 (107) T ss_pred HHHHHhhhhhcc------ccchhccchHHHHHHHHHhhcCC Confidence 999875543222 12223469999999999988776 No 11 >protein:vir:81069 Length: 115 # NCBI annotation: p10 # Family: family:all:363 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285680;genbank:gi:148727188;genbank:GeneID:5247114 Probab=88.28 E-value=0.011 Score=31.28 Aligned_cols=113 Identities=8% Similarity=0.107 Sum_probs=67.0 Q ss_pred ccchhHHHhhcccc--cchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 3 LASISELRQRLNVK--DSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 3 lasi~elr~r~nv~--ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) +-|++|.++-++|. |.+.=.-.|..++-...-.++.|++-+|.....+...- + T Consensus 1 ivtLee~K~HlRid~dd~deDD~li~~~i~AA~~~v~~~l~r~l~~~~~~~~~~-------~------------------ 55 (115) T protein:vir:81 1 MITLAMVQRHLQAELYEDDERDYVMQQLLPAARESAELFINRKLYDTQADMLAD-------Q------------------ 55 (115) T ss_pred CCCHHHHHHHcCCCCCCCccchHHHHHHHHHHHHHHHHHhCCcccccccccccc-------c------------------ Confidence 44899999999984 23334556777777777788888865552111000000 0 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) ..++.|..+ .-+|.-||||+||.. T Consensus 56 ------------------------------------------------------~~~~~~~~~--~~~p~~i~~AiLllv 79 (115) T protein:vir:81 56 ------------------------------------------------------AAGVDPAGQ--LLITRTVEQAILLTL 79 (115) T ss_pred ------------------------------------------------------cccCCCCcc--cccCHHHHHHHHHHH Confidence 000001110 117999999999999 Q ss_pred hhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCccccc Q lcl|NC_016567. 161 EHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGCL 203 (207) Q Consensus 161 eh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~l 203 (207) -|+|-.+=+-+ +.--+.-|.++..||..|-+-| |.- T Consensus 80 g~~Y~NRE~v~------~~~~~elP~~~~~LL~pyR~~~-g~~ 115 (115) T protein:vir:81 80 GEWYSSREQVW------TKGAGLVTSSAQNLLHPYRKFA-GVR 115 (115) T ss_pred HHHHhccchhc------chhhhhcCHHHHHHHHHHHhhc-CCC Confidence 99975432211 1123457999999999985544 322 No 12 >protein:vir:4458 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700381;genbank:gi:23505453;genbank:GeneID:955660 Probab=88.26 E-value=0.012 Score=31.10 Aligned_cols=106 Identities=21% Similarity=0.270 Sum_probs=69.9 Q ss_pred cccchhHHHhhcccc-cchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 2 ILASISELRQRLNVK-DSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 2 ~lasi~elr~r~nv~-ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) .+-+++|+++-++|. |-.-=...|..++....-.|+.++..+|.....+. . T Consensus 1 M~vtLee~K~hLRId~D~~dDD~lI~~~i~AA~~~i~~~~~r~l~~~~~~~-~--------------------------- 52 (107) T protein:vir:44 1 MLLSVEEIKAQLRLDEDFEADERYLQLLARAVQKRTETYLNRKLYAPDETI-P--------------------------- 52 (107) T ss_pred CCCCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHhhcCccccccccc-c--------------------------- Confidence 445899999999995 32222677888888888888888776664322110 0 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) +..+.-..+|.-+|+|+||.+ T Consensus 53 -----------------------------------------------------------~~~~~~~~~~~~~~~AiLllv 73 (107) T protein:vir:44 53 -----------------------------------------------------------DSDPDGLLLQDDIRLGMLMLI 73 (107) T ss_pred -----------------------------------------------------------ccccccccchhhHHHHHHHHH Confidence 000112336888999999999 Q ss_pred hhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCcc Q lcl|NC_016567. 161 EHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPI 200 (207) Q Consensus 161 eh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~ 200 (207) .|.+--+-+-+ +..-+.-|.||..||.+|-+-|- T Consensus 74 ~~~Y~NRe~~~------~~~~~~lP~~v~~Ll~~yR~~p~ 107 (107) T protein:vir:44 74 SHFYENRSSVT------EVEKLDMPQSFGWLVGPYRYFPQ 107 (107) T ss_pred HHHHhhhhhhc------cccccccCHHHHHHHHHhhhcCC Confidence 99975442211 22235579999999999966555 No 13 >protein:vir:97069 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453566;genbank:gi:84662601;genbank:GeneID:5142484 Probab=83.08 E-value=0.031 Score=28.82 Aligned_cols=113 Identities=9% Similarity=0.107 Sum_probs=69.2 Q ss_pred ccchhHHHhhccccc--chHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 3 LASISELRQRLNVKD--SEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 3 lasi~elr~r~nv~d--s~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) +-+++|.++-++|.. .+.-.-.|..++....-.++.|++-++...+.+... -...+. T Consensus 1 mvtLee~K~hLRid~d~~d~DDali~~~i~AA~~~v~~~l~r~l~~~~~~~~~---------~~~~~~------------ 59 (115) T protein:vir:97 1 MITLAMMQRHLQAELYEDDERDYVMQQLLPAARESAELFLNRKLYDVQADMLA---------DQVLGV------------ 59 (115) T ss_pred CCCHHHHHHHcCCCCCCCchhhHHHHHHHHHHHHHHHHHhCCcccchhhcccc---------cccccC------------ Confidence 448999999999852 233466788888888888888886555321111000 000000 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) .+ +.=.-+|.-||||+||.. T Consensus 60 ----------------------------------------------------------~~--~~~~~~p~~i~~AiLllv 79 (115) T protein:vir:97 60 ----------------------------------------------------------DP--SDQLLITRTVEQAILLTV 79 (115) T ss_pred ----------------------------------------------------------CC--cccccCCHHHHHHHHHHH Confidence 00 001126999999999999 Q ss_pred hhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCcccccc Q lcl|NC_016567. 161 EHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGCLT 204 (207) Q Consensus 161 eh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~l~ 204 (207) -|++-.+=+-+ +..-+.-|.+|..||+.|-|-|- +| T Consensus 80 g~~Y~NRE~v~------~~~~~elP~~~~~LL~pyR~~~G--v~ 115 (115) T protein:vir:97 80 GEWYSSREQVW------IKGAGLVTSSAQNLLHPYRKFAG--VR 115 (115) T ss_pred HHHHhcccccc------cccccccCHHHHHHHHHHHhhcC--CC Confidence 99976552211 11235579999999999865442 22 No 14 >protein:vir:81159 Length: 95 # NCBI annotation: putative DNA packaging protein # Family: family:all:316 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285813;genbank:gi:148747734;genbank:GeneID:5247202 Probab=64.44 E-value=0.23 Score=24.10 Aligned_cols=95 Identities=16% Similarity=0.275 Sum_probs=59.6 Q ss_pred CcccchhHHHhhcccccchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 1 MILASISELRQRLNVKDSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 1 m~lasi~elr~r~nv~ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) |.+-+++|+++-++|... .=..+|..++-...-.|..+++.+|+. T Consensus 1 Mm~vtLee~K~~LRID~d-~dD~lI~~li~aA~~~i~~~~g~~~~~---------------------------------- 45 (95) T protein:vir:81 1 MMIVTLEEVKNWLRVDFS-DDDALITTLINAAEEYLKNATGTTFDA---------------------------------- 45 (95) T ss_pred CCcCCHHHHHHHcCCCCC-cchHHHHHHHHHHHHHHHHhhcccccc---------------------------------- Confidence 999999999999999533 346677777766655555554322210 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) .|.-+|+|+++.+ T Consensus 46 -------------------------------------------------------------------~~~~~~~Avl~lv 58 (95) T protein:vir:81 46 -------------------------------------------------------------------TNHLAKIFCMTLI 58 (95) T ss_pred -------------------------------------------------------------------CchHHHHHHHHHH Confidence 1234899999999 Q ss_pred hhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCccccccc Q lcl|NC_016567. 161 EHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGCLTP 205 (207) Q Consensus 161 eh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~l~~ 205 (207) .|.|-.+=+-++ .. ..-|.+|..||.||. -.-.-=+. T Consensus 59 ~~~YeNRe~~~~------~~-~~~p~~v~sll~~lr-~~~~~~~~ 95 (95) T protein:vir:81 59 ADWYENRELVGR------AS-DQVRPILQSILAQLT-YAYGGETA 95 (95) T ss_pred HHHHhhcccccc------cc-ccccHHHHHHHHHhh-hccccccC Confidence 999754422111 11 246999999999984 11111111 No 15 >protein:vir:5742 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892053;genbank:gi:33770516;uniprot:Q7Y407;genbank:GeneID:2637465 Probab=50.99 E-value=0.59 Score=21.82 Aligned_cols=109 Identities=17% Similarity=0.238 Sum_probs=65.3 Q ss_pred CcccchhHHHhhcccc-cchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhh Q lcl|NC_016567. 1 MILASISELRQRLNVK-DSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLW 79 (207) Q Consensus 1 m~lasi~elr~r~nv~-ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~ 79 (207) |-+-+++|+|+.+++. |...=...|.-.+--..-.++.|+..|+-. +.+ -+ ++ T Consensus 1 m~mitLeeiK~hlRid~D~~~eD~lL~~y~~AA~~~~e~~~~rkLy~---------~~~-~~----~~------------ 54 (110) T protein:vir:57 1 MGMTSLSNVKTQLRLEEDFTEHDDFIESLIDAAQRSIERTYYCVLVD---------SQE-AL----EK------------ 54 (110) T ss_pred CCCCCHHHHHHHcCCCCCCChhHHHHHHHHHHHHHHHHHHhCCcccC---------Ccc-cc----cc------------ Confidence 9999999999999996 444455666666666666666666544321 100 00 00 Q ss_pred hhccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHH Q lcl|NC_016567. 80 LDGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMY 159 (207) Q Consensus 80 l~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~ 159 (207) .| +..|| ...++-|||||||+ T Consensus 55 -----------------------------------------~p--------------~~~~g----l~~~~di~~A~Lll 75 (110) T protein:vir:57 55 -----------------------------------------LP--------------EGVRG----FLIEPDTQLAARMM 75 (110) T ss_pred -----------------------------------------CC--------------CCCCc----cccCHHHHHHHHHH Confidence 00 00111 23578899999999 Q ss_pred hhhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCcc Q lcl|NC_016567. 160 AEHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPI 200 (207) Q Consensus 160 aeh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~ 200 (207) +-|++-.+=+-+..+ -+.-|-||.+||.=|-.... T Consensus 76 v~hwYeNREav~~~~------~~~~P~~v~~Ll~P~~~~~~ 110 (110) T protein:vir:57 76 VAQWYLNPKGTSPDG------DTPAQLGVEYLLFPLMEHTV 110 (110) T ss_pred HHHHHhccccccccc------ccchhHHHHHHHHHHHhhcC Confidence 999965443323221 23449999999876544333 No 16 >protein:vir:4831 Length: 105 # NCBI annotation: ORF27 # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038328;genbank:gi:9634654;genbank:GeneID:1262588 Probab=49.96 E-value=0.6 Score=21.80 Aligned_cols=99 Identities=14% Similarity=0.144 Sum_probs=56.3 Q ss_pred CcccchhHHHhhcccccchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 1 MILASISELRQRLNVKDSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 1 m~lasi~elr~r~nv~ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) |++ +++|+++-++|. ...=...|..++-... .+++. T Consensus 1 M~v-tLee~K~~LRID-~dddD~lI~~~i~aA~----~yi~~-------------------------------------- 36 (105) T protein:vir:48 1 MSV-SKTSIMQTLNLD-ETDDTALIPAYIESAK----QYIIN-------------------------------------- 36 (105) T ss_pred Ccc-cHHHHHHHcCCC-CccchHHHHHHHHHHH----HHHHH-------------------------------------- Confidence 776 899999999994 3334455555554332 22221 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) |+.. +.+..| --++|..+|+|+|+++ T Consensus 37 ---------------------------------------~ig~-----------~~~~~~----~~~~~~~~~~Avl~lv 62 (105) T protein:vir:48 37 ---------------------------------------AVGS-----------DSKFYD----LENVQPLFDTAVIALT 62 (105) T ss_pred ---------------------------------------hhCC-----------CCcccc----ccCCchHHHHHHHHHH Confidence 0000 000001 1125677899999999 Q ss_pred hhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCccccccccC Q lcl|NC_016567. 161 EHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGCLTPII 207 (207) Q Consensus 161 eh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~l~~ii 207 (207) .|.|-.+=+- .+......|.+|..||.||-..- ...- T Consensus 63 ~~~YeNR~~~------~~~~~~~ip~~v~sli~~lR~~y----~~~~ 99 (105) T protein:vir:48 63 SSYFTYRVAL------TDTVTYPINLTLNSIIGQLRGLY----ATYS 99 (105) T ss_pred HHHHhhhhhc------cCcccchhhHHHHHHHHHHhhhh----hhhh Confidence 9998665211 12333458999999999985321 1111 No 17 >protein:vir:4857 Length: 104 # NCBI annotation: putative DNA packaging protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049397;genbank:gi:9632425;genbank:GeneID:1258493 Probab=49.13 E-value=0.54 Score=22.03 Aligned_cols=99 Identities=14% Similarity=0.178 Sum_probs=56.0 Q ss_pred CcccchhHHHhhcccccchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhh Q lcl|NC_016567. 1 MILASISELRQRLNVKDSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWL 80 (207) Q Consensus 1 m~lasi~elr~r~nv~ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l 80 (207) |++ +++|+++-++|...+ =...|..++-.-.--|.. T Consensus 1 M~v-tLeevK~~LRID~d~-dD~li~~~i~aA~~~i~~------------------------------------------ 36 (104) T protein:vir:48 1 MSV-SKETIMQTLNLDETD-DTALIPAYIESARQYVVN------------------------------------------ 36 (104) T ss_pred Ccc-cHHHHHHHcCCCCcc-chHHHHHHHHHHHHHHHH------------------------------------------ Confidence 776 689999988885333 355555554443222211 Q ss_pred hccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHh Q lcl|NC_016567. 81 DGFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYA 160 (207) Q Consensus 81 ~g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~a 160 (207) ++ |-+. ..+..-++|..+|+|++|++ T Consensus 37 ---------------------------------------~i-----------g~~~----~~~~~~~~~~~~~~Avl~lv 62 (104) T protein:vir:48 37 ---------------------------------------SV-----------GDDP----KFYNLDSVRALFDTAVIALT 62 (104) T ss_pred ---------------------------------------hh-----------CCCC----CcccccCCChhHHHHHHHHH Confidence 00 0000 01112357888999999999 Q ss_pred hhhhhhcccCcccccccccccCCCchhHHHHHHHhcCCccccccccC Q lcl|NC_016567. 161 EHFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGCLTPII 207 (207) Q Consensus 161 eh~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~l~~ii 207 (207) .|.|-.+=+ -.+..-...|.+|..||.||-.. ....- T Consensus 63 ~~~Y~NR~~------~~~~~~~~ip~~v~sli~~lR~~----y~~~~ 99 (104) T protein:vir:48 63 SSYFTYRVA------LTDTATYPVNLTLNSIIGQLRGL----YATYS 99 (104) T ss_pred HHHHhhhhh------hcccccchhhHHHHHHHHHHHHh----hhhhc Confidence 999865511 12333445799999999887421 11111 No 18 >protein:vir:102158 Length: 99 # NCBI annotation: uncharacterized phage protein (possible DNA packaging) # Family: family:all:316 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699940;genbank:gi:110804046;genbank:GeneID:4206702 Probab=40.15 E-value=0.98 Score=20.62 Aligned_cols=99 Identities=14% Similarity=0.191 Sum_probs=58.1 Q ss_pred cccchhHHHhhcccccchHHHHHHHHHHhhhhhhhhhhhhcccchhhhceeeeecCccceeecccCCCCCcchhhhhhhh Q lcl|NC_016567. 2 ILASISELRQRLNVKDSEQYNRILASVLTTVTMRIGEHLKTGFDKETKENVFLATPANHLVSGALGRSNTSSGVLSLWLD 81 (207) Q Consensus 2 ~lasi~elr~r~nv~ds~~ynril~~~l~~vt~~i~~hl~t~f~~~~~~~~~~a~~~n~lvsga~gr~n~s~gvl~l~l~ 81 (207) .+-+++|+++-++| |.+.=..+|..++....-.|...++.+|... T Consensus 1 M~vtLee~K~~LRI-D~d~dD~lI~~~i~aA~~~i~~~~~~~~~~~---------------------------------- 45 (99) T protein:vir:10 1 MILSVDEVKNYLRV-DYDEDDILIQDLIESAEDYLYNATGKKFTEK---------------------------------- 45 (99) T ss_pred CcCCHHHHHHHcCC-CCCcchHHHHHHHHHHHHHHHHhhCCCCCCC---------------------------------- Confidence 45589999999999 4444566777777666555554432222100 Q ss_pred ccccccceEEeccccccccceeeeeeecccceEEeecccCCcceEEEEEcCcceeeeecceeEeecchhHHHHHHHHHhh Q lcl|NC_016567. 82 GFNVSNVELRICDDINASGVLTTSIKVDSKGIVSIYDGWVDYDYFKVTFNSGFDIETVDAYPTYVGVPDKLKQACLMYAE 161 (207) Q Consensus 82 g~n~s~~~~~~~~~~~~sg~~ts~~~~~~~giv~~~~gw~~~~y~~vt~~~gf~ietvd~~~~y~gvp~~~kqa~l~~ae 161 (207) |..+|+|++|.+. T Consensus 46 -------------------------------------------------------------------~~~~k~Avl~lv~ 58 (99) T protein:vir:10 46 -------------------------------------------------------------------NKLAKRYCLALVY 58 (99) T ss_pred -------------------------------------------------------------------ChHHHHHHHHHHH Confidence 1246999999999 Q ss_pred hhhhhcccCcccccccccccCCCchhHHHHHHHhcCCcccccc Q lcl|NC_016567. 162 HFFMAKYANGEVNEKDDDDYTDAPNGVSALLYSLSRSPIGCLT 204 (207) Q Consensus 162 h~fm~ky~~~~i~e~~~~dytd~pn~vs~~ly~~~r~~~~~l~ 204 (207) |.+=.+=+.++.... ......-|.+|..||.||.-. -.-=+ T Consensus 59 ~~YenR~~~~~~~~~-~~~~~~lp~~v~sli~qlr~~-~~~~~ 99 (99) T protein:vir:10 59 DWYKDKGMNIRATKN-TTVSEKVKYTLQSILLQLKFC-KEEDT 99 (99) T ss_pred HhHhcchhhhhhhhc-cchhhhhhHHHHHHHHHHhhc-cCCCC Confidence 997544322221111 122234699999999998321 11111 Done!