Query lcl|NC_021296.1_cdsid_YP_008050644.1 [gene=11] [protein=hypothetical protein] [protein_id=YP_008050644.1] [location=8030..8461] Match_columns 143 No_of_seqs 6 out of 9 Neff 2.1 Searched_HMMs 1612 Date Thu Nov 7 16:36:24 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_11 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_11_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99004 Length: 131 94.7 0.0002 1.2E-07 40.9 5.4 114 1-119 1-131 (131) 2 protein:vir:95062 Length: 116 93.1 0.0025 1.5E-06 34.9 8.4 106 29-139 1-116 (116) 3 protein:vir:97327 Length: 116 91.5 0.0056 3.5E-06 32.9 8.3 106 29-139 1-116 (116) 4 protein:vir:1243 Length: 116 # 91.5 0.0056 3.5E-06 32.9 8.3 106 29-139 1-116 (116) 5 protein:vir:102441 Length: 137 87.3 0.02 1.3E-05 29.9 8.0 127 6-143 1-135 (137) 6 protein:vir:743 Length: 108 # 86.8 0.018 1.1E-05 30.2 7.4 97 15-135 1-108 (108) 7 protein:vir:98409 Length: 108 86.7 0.013 8.3E-06 30.8 6.7 97 15-135 1-108 (108) 8 protein:vir:106506 Length: 137 86.7 0.027 1.7E-05 29.2 8.3 125 7-143 1-132 (137) 9 protein:vir:105330 Length: 137 86.6 0.042 2.6E-05 28.1 9.4 123 12-139 1-137 (137) 10 protein:vir:3617 Length: 112 # 80.7 0.035 2.2E-05 28.6 6.4 107 6-138 1-112 (112) 11 protein:vir:9578 Length: 78 # 80.3 0.011 6.7E-06 31.4 3.5 78 12-135 1-78 (78) 12 protein:vir:107099 Length: 137 80.0 0.099 6.2E-05 26.1 8.8 118 12-139 1-137 (137) 13 protein:vir:101594 Length: 173 79.1 0.063 3.9E-05 27.1 7.3 125 15-143 1-168 (173) 14 protein:vir:100243 Length: 140 77.9 0.069 4.3E-05 27.0 7.1 123 12-143 1-139 (140) 15 protein:vir:106570 Length: 182 77.4 0.13 7.8E-05 25.5 9.7 129 12-143 1-182 (182) 16 protein:vir:100075 Length: 140 77.3 0.067 4.1E-05 27.0 6.8 123 12-143 1-139 (140) 17 protein:vir:96225 Length: 115 77.0 0.1 6.4E-05 26.0 7.8 96 15-135 1-115 (115) 18 protein:vir:78858 Length: 115 77.0 0.1 6.4E-05 26.0 7.8 96 15-135 1-115 (115) 19 protein:vir:103917 Length: 115 77.0 0.1 6.4E-05 26.0 7.8 96 15-135 1-115 (115) 20 protein:vir:97144 Length: 115 77.0 0.1 6.4E-05 26.0 7.8 96 15-135 1-115 (115) 21 protein:vir:9312 Length: 115 # 77.0 0.1 6.4E-05 26.0 7.8 96 15-135 1-115 (115) 22 protein:vir:96358 Length: 115 77.0 0.1 6.4E-05 26.0 7.8 96 15-135 1-115 (115) 23 protein:vir:1642 Length: 78 # 77.0 0.017 1.1E-05 30.3 3.5 78 12-135 1-78 (78) 24 protein:vir:94763 Length: 78 # 77.0 0.017 1.1E-05 30.3 3.5 78 12-135 1-78 (78) 25 protein:vir:80362 Length: 140 76.9 0.09 5.6E-05 26.3 7.4 119 12-143 1-139 (140) 26 protein:vir:5978 Length: 144 # 76.7 0.13 8.2E-05 25.4 9.9 132 1-142 1-144 (144) 27 protein:vir:96829 Length: 135 75.6 0.14 9E-05 25.2 9.1 116 12-139 1-135 (135) 28 protein:vir:99101 Length: 142 75.2 0.092 5.7E-05 26.3 7.0 134 5-143 1-138 (142) 29 protein:vir:8669 Length: 142 # 75.2 0.092 5.7E-05 26.3 7.0 134 5-143 1-138 (142) 30 protein:vir:78077 Length: 141 74.1 0.13 8.2E-05 25.4 7.6 124 12-141 1-141 (141) 31 protein:vir:95894 Length: 137 73.6 0.17 0.0001 24.8 8.6 123 12-139 1-137 (137) 32 protein:vir:94490 Length: 137 72.4 0.18 0.00011 24.6 8.7 123 12-139 1-137 (137) 33 protein:vir:93738 Length: 137 72.4 0.18 0.00011 24.6 8.7 123 12-139 1-137 (137) 34 protein:vir:97427 Length: 137 72.4 0.18 0.00011 24.6 8.7 123 12-139 1-137 (137) 35 protein:vir:94796 Length: 137 72.0 0.19 0.00012 24.6 8.8 123 12-139 1-137 (137) 36 protein:vir:94654 Length: 142 67.5 0.25 0.00016 23.9 8.6 128 3-138 1-142 (142) 37 protein:vir:106623 Length: 115 66.5 0.26 0.00016 23.8 7.4 95 15-135 1-115 (115) 38 protein:vir:4906 Length: 114 # 64.1 0.21 0.00013 24.3 6.5 106 12-143 1-114 (114) 39 protein:vir:2740 Length: 114 # 64.1 0.21 0.00013 24.3 6.5 106 12-143 1-114 (114) 40 protein:vir:97088 Length: 157 63.0 0.33 0.0002 23.3 7.8 135 8-143 1-156 (157) 41 protein:vir:9763 Length: 89 # 62.1 0.042 2.6E-05 28.2 2.2 85 5-135 1-89 (89) 42 protein:vir:4347 Length: 164 # 61.3 0.36 0.00022 23.0 7.9 126 12-142 1-164 (164) 43 protein:vir:194 Length: 149 # 58.1 0.42 0.00026 22.6 7.6 124 12-140 1-149 (149) 44 protein:vir:93617 Length: 148 57.0 0.44 0.00028 22.5 7.7 114 12-140 1-148 (148) 45 protein:vir:1437 Length: 140 # 56.4 0.46 0.00028 22.4 8.2 119 12-143 1-139 (140) 46 protein:vir:96121 Length: 137 56.3 0.46 0.00029 22.4 8.7 123 12-139 1-137 (137) 47 protein:vir:105916 Length: 149 54.7 0.5 0.00031 22.2 9.3 125 1-139 1-149 (149) 48 protein:vir:99744 Length: 115 52.0 0.57 0.00035 21.9 7.6 92 15-135 1-115 (115) 49 protein:vir:80665 Length: 96 # 48.8 0.12 7.7E-05 25.6 2.5 74 11-87 1-96 (96) 50 protein:vir:1273 Length: 127 # 45.8 0.52 0.00032 22.2 5.4 110 12-143 1-126 (127) 51 protein:vir:94108 Length: 149 41.9 0.91 0.00056 20.8 9.2 125 1-141 1-149 (149) 52 protein:vir:3873 Length: 128 # 41.8 0.78 0.00048 21.2 5.7 113 15-143 1-127 (128) 53 protein:vir:105007 Length: 146 41.8 0.87 0.00054 20.9 6.0 117 12-143 1-144 (146) 54 protein:vir:102085 Length: 146 41.8 0.87 0.00054 20.9 6.0 117 12-143 1-144 (146) 55 protein:vir:102875 Length: 146 41.8 0.87 0.00054 20.9 6.0 117 12-143 1-144 (146) 56 protein:vir:107568 Length: 146 41.8 0.87 0.00054 20.9 6.0 117 12-143 1-144 (146) 57 protein:vir:97982 Length: 140 38.3 1.1 0.00067 20.4 8.3 132 8-143 1-136 (140) 58 protein:vir:107545 Length: 140 38.3 1.1 0.00067 20.4 8.3 132 8-143 1-136 (140) 59 protein:vir:105089 Length: 133 38.3 1.1 0.00067 20.4 6.1 116 12-140 1-133 (133) 60 protein:vir:9930 Length: 108 # 37.3 1.1 0.0007 20.3 7.7 94 21-139 1-108 (108) 61 protein:vir:106041 Length: 137 33.7 1.3 0.00083 19.9 8.3 129 1-143 1-133 (137) 62 protein:vir:5745 Length: 135 # 33.6 1.3 0.00083 19.9 6.1 115 11-143 1-133 (135) 63 protein:vir:78163 Length: 92 # 33.0 0.23 0.00014 24.1 1.4 79 12-97 1-92 (92) 64 protein:vir:81147 Length: 126 31.9 1.5 0.00091 19.7 8.8 112 12-143 1-126 (126) 65 protein:vir:94538 Length: 125 31.2 1.5 0.00094 19.6 7.9 108 1-141 1-125 (125) 66 protein:vir:1891 Length: 179 # 29.1 1.7 0.001 19.3 7.1 129 12-143 1-174 (179) 67 protein:vir:79091 Length: 175 28.2 1.8 0.0011 19.2 5.6 109 8-143 1-174 (175) 68 protein:vir:107851 Length: 175 22.9 2.4 0.0015 18.5 7.7 106 8-140 1-175 (175) No 1 >protein:vir:99004 Length: 131 # NCBI annotation: gp33 # Family: family:all:32654 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655898;genbank:gi:109521470;genbank:GeneID:4157969 Probab=94.69 E-value=0.0002 Score=40.85 Aligned_cols=114 Identities=24% Similarity=0.303 Sum_probs=60.8 Q ss_pred CCCCcccceeeeecccc-CCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhcc-----cc-----ccccccceEEEe- Q lcl|NC_021296. 1 MPAAGTTIGHRLTDIQV-PNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAK-----RT-----GKLMSSASSETM- 68 (143) Q Consensus 1 ~pa~g~~i~~~~~d~k~-~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAk-----RT-----g~LArsvrvetf- 68 (143) |.---.|.+|. .|+|. +-||.-|+|||+-..|-.||+++--.|+--+-.+-|. || |--.-+-+|+-| T Consensus 1 madlnsgyayi-edlklygppnkvlaqilvgnqmynlvaeyvlkvaihfttkearspyrdrtrqyrrgghtpgqqvrnmd 79 (131) T protein:vir:99 1 MADLNSGYAYI-EDLKLYGPPNKVLAQILVGNQMYNLVAEYVLKVAIHFTTKEARSPYRDRTRQYRRGGHTPGQQVRNMD 79 (131) T ss_pred CCccccchhhh-hhhhccCChHHHHHHHHhhhhHHHHHHHHHHHhhhhccchhhcCchhhhhHHhhhcCCCchhhhhccc Confidence 66556666654 46665 5567899999999999999999987776555433331 11 111112233333 Q ss_pred -ecCCCCCeeEEEEEecccccccCccc----CCCcCCcchhhhhhhhhhcCCCCCC Q lcl|NC_021296. 69 -IGGKKNDRWVSHVTIGGETAVSTWHS----PRNPNPGDLFFYGVLHEHGDGGNPP 119 (143) Q Consensus 69 -IGG~K~DRwVg~VtVG~e~aa~~~Hs----pr~g~pgd~f~ygvlh~~g~~~~~p 119 (143) .=-.-||||+|.||.-+.|.-++--. .|- -|.--.---||..=. ..| T Consensus 80 ydvamgndrwigqitlredysgadqygrkkyary--rgsqslreslhavlp--hqp 131 (131) T protein:vir:99 80 YDVAMGNDRWIGQITLREDYSGADQYGRKKYARY--RGSQSLRESLHAVLP--HQP 131 (131) T ss_pred cceeccCcceeeeeeeeccccchhhhhhhhHhhh--ccchHHHHHHHhhcC--CCC Confidence 22335899999999988776554300 000 000000001111100 001 No 2 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=93.12 E-value=0.0025 Score=34.87 Aligned_cols=106 Identities=12% Similarity=0.030 Sum_probs=72.2 Q ss_pred hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccccccCc-----ccCCCcCC--- Q lcl|NC_021296. 29 LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGETAVSTW-----HSPRNPNP--- 100 (143) Q Consensus 29 ~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~aa~~~-----Hspr~g~p--- 100 (143) +...++..|...++.++..=+..+..+||.|.+|...++-. |...+.|..+.+||.-.+ |..+.... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~-----~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~~~ 75 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD-----GGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAK 75 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEeec-----CcEEEEEecCCCccceeecCccccccCCCccccc Confidence 78888888999999999999999999999999999887744 446677888888888655 44222111 Q ss_pred cchhhh-hhh-hhhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 101 GDLFFY-GVL-HEHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 101 gd~f~y-gvl-h~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) ..++|| .+. ..|.--|.||...-+||-++-++-+.-.-+ T Consensus 76 ~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 76 NIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cccceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 122232 111 122345788888888888777665443333 No 3 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=91.53 E-value=0.0056 Score=32.94 Aligned_cols=106 Identities=14% Similarity=0.059 Sum_probs=71.0 Q ss_pred hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccccccCc-----ccCCCcCC--- Q lcl|NC_021296. 29 LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGETAVSTW-----HSPRNPNP--- 100 (143) Q Consensus 29 ~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~aa~~~-----Hspr~g~p--- 100 (143) +...++..|..-+++++..=+..+..+||+|.+|..+++-.+| ..+.|..+.+||.-.+ |..++... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~-----~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~ 75 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGG-----FTGVINIGSEYAIYVNYGTGIYATGAGGSRAK 75 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCc-----EEEEEecCCCcccccccCCcccccCCCccccc Confidence 7778888888999999988899999999999999998875544 5577777788887655 44332210 Q ss_pred cchhhh--hhhhhhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 101 GDLFFY--GVLHEHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 101 gd~f~y--gvlh~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) -.+++| ..-+.|.--|.||...-+||-++-++-+.-.-+ T Consensus 76 ~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 76 KIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 111222 111233345778888888887776665433223 No 4 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=91.53 E-value=0.0056 Score=32.94 Aligned_cols=106 Identities=14% Similarity=0.059 Sum_probs=71.0 Q ss_pred hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccccccCc-----ccCCCcCC--- Q lcl|NC_021296. 29 LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGETAVSTW-----HSPRNPNP--- 100 (143) Q Consensus 29 ~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~aa~~~-----Hspr~g~p--- 100 (143) +...++..|..-+++++..=+..+..+||+|.+|..+++-.+| ..+.|..+.+||.-.+ |..++... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~-----~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~ 75 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGG-----FTGVINIGSEYAIYVNYGTGIYATGAGGSRAK 75 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCc-----EEEEEecCCCcccccccCCcccccCCCccccc Confidence 7778888888999999988899999999999999998875544 5577777788887655 44332210 Q ss_pred cchhhh--hhhhhhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 101 GDLFFY--GVLHEHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 101 gd~f~y--gvlh~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) -.+++| ..-+.|.--|.||...-+||-++-++-+.-.-+ T Consensus 76 ~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 76 KIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 111222 111233345778888888887776665433223 No 5 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=87.29 E-value=0.02 Score=29.87 Aligned_cols=127 Identities=26% Similarity=0.165 Sum_probs=77.1 Q ss_pred ccceeeeeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecc Q lcl|NC_021296. 6 TTIGHRLTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGG 85 (143) Q Consensus 6 ~~i~~~~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~ 85 (143) .-..-++ .+|| ...+-.+...|+..|.+.+++++..=++.+-.|||.|-+|.+.++.+-+.. -+..+.|.-+. T Consensus 1 ~~~~~~~----~~~~--~~~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~-~~~~~~V~~~~ 73 (137) T protein:vir:10 1 MTVTARY----ERNP--VGEARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGP-LRLDSGVTAHA 73 (137) T ss_pred CeeEEEe----ccCc--hhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeecccc-ceEEEEecCCC Confidence 1122233 2566 444446888999999999999999999999999999999999887532211 11122333345 Q ss_pred cccccCc-----cc--CCCcCCcchhhhhhhhh-hcCCCCCCcccccchhhHHHHHHHHhhhccCC Q lcl|NC_021296. 86 ETAVSTW-----HS--PRNPNPGDLFFYGVLHE-HGDGGNPPSGWDFPAHKDLKKALAVVKARNGA 143 (143) Q Consensus 86 e~aa~~~-----Hs--pr~g~pgd~f~ygvlh~-~g~~~~~p~~~~f~ah~dl~~a~~~~~~~~~~ 143 (143) +|+.-.+ |. |+++ ++.++|++==|. |+---|-| -.+++.=|+.|+...+.|.-+ T Consensus 74 ~YA~~ve~GT~ph~I~Pk~~-k~~l~~~~~g~~vf~k~V~hP---G~~a~PfL~~A~~~~~~~~~~ 135 (137) T protein:vir:10 74 DYARYVHDGTRAHVIRPRRP-GGVLRFTVGGRVVYARRVNHP---GTRARPFLRNAAERVVARETA 135 (137) T ss_pred ccceeeecCCCCceeecccc-ceeeeEeeCCeeEecceeecC---CCCCCchHHHHHHHhhhhhcc Confidence 5665544 43 2221 446666531111 12111111 134667799999999999888 No 6 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=86.77 E-value=0.018 Score=30.20 Aligned_cols=97 Identities=19% Similarity=0.160 Sum_probs=64.6 Q ss_pred cccCCch---hhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccccccC Q lcl|NC_021296. 15 IQVPNPN---RGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGETAVST 91 (143) Q Consensus 15 ~k~~np~---rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~aa~~ 91 (143) |++..-. +.|.+.-....++.-+...++.++..=+.++-.+||.|.+|..+++-.||. .++|+... T Consensus 1 i~i~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~~~~-------~~~V~~~~---- 69 (108) T protein:vir:74 1 MKITGIDALQKKLRKNATLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTDGGL-------SGTTGPHT---- 69 (108) T ss_pred CcchhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeecCce-------EEEeecCC---- Confidence 4544322 233332233456777888888888888888889999999999998766653 24443211 Q ss_pred cccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhh--------HHHHHHH Q lcl|NC_021296. 92 WHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHK--------DLKKALA 135 (143) Q Consensus 92 ~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~--------dl~~a~~ 135 (143) .|+.+=|||....||..+=+||-+ +|++.+. T Consensus 70 -------------~Ya~~vE~GT~km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 70 -------------DYAGYVEYGTRFQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred -------------CcccceeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 278888999999999888888854 3433333 No 7 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=86.70 E-value=0.013 Score=30.85 Aligned_cols=97 Identities=18% Similarity=0.113 Sum_probs=64.5 Q ss_pred cccCCch---hhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccccccC Q lcl|NC_021296. 15 IQVPNPN---RGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGETAVST 91 (143) Q Consensus 15 ~k~~np~---rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~aa~~ 91 (143) |++..=. +.|.+.-.-..++..+...+++++..=++++-.+||.|.+|..+++-.||. +++|+... T Consensus 1 i~i~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~~~~-------~~~V~~~~---- 69 (108) T protein:vir:98 1 MKITGIDALQKKLRKNATLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTDGGL-------TGTTIPHT---- 69 (108) T ss_pred CcchhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeecCce-------EEEeecCC---- Confidence 4444321 233332223346677888888888888888888999999999988766664 34553211 Q ss_pred cccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhh--------HHHHHHH Q lcl|NC_021296. 92 WHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHK--------DLKKALA 135 (143) Q Consensus 92 ~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~--------dl~~a~~ 135 (143) .|+.+=|||....||...=+||-. +|++++. T Consensus 70 -------------~Ya~~vE~GT~~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 70 -------------DYAGYVEYGTRFQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred -------------CccceeeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 278888999988888888777754 4444443 No 8 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=86.66 E-value=0.027 Score=29.22 Aligned_cols=125 Identities=17% Similarity=0.090 Sum_probs=73.1 Q ss_pred cceeeeeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEeccc Q lcl|NC_021296. 7 TIGHRLTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGE 86 (143) Q Consensus 7 ~i~~~~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e 86 (143) -|...+ +.- |+.-|=+..+.|+..+..++..++..=+...-.|||.|.+|.+.+.-.++ ...-++.|.-..+ T Consensus 1 ~~~~~~---~l~---~~~l~~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~--g~~v~~~V~~~~~ 72 (137) T protein:vir:10 1 MVAHTL---RIE---RAQLHGLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRER--GAVVIGSVEYTAR 72 (137) T ss_pred Cccccc---ccC---hhhHhhHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeecc--ccEEEEEecCCcc Confidence 233333 222 33444456788999999999999988888888899999999988765322 2234444444445 Q ss_pred ccccCc-----ccCCCcCCcchhhh--hhhhhhcCCCCCCcccccchhhHHHHHHHHhhhccCC Q lcl|NC_021296. 87 TAVSTW-----HSPRNPNPGDLFFY--GVLHEHGDGGNPPSGWDFPAHKDLKKALAVVKARNGA 143 (143) Q Consensus 87 ~aa~~~-----Hspr~g~pgd~f~y--gvlh~~g~~~~~p~~~~f~ah~dl~~a~~~~~~~~~~ 143 (143) ||.-.+ |.-+.-+...|.|| |=.|--..--.|- -++..=|+.|+.-++.+.|- T Consensus 73 YA~~ve~GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG----~k~~PfL~~Al~~~~~~~~~ 132 (137) T protein:vir:10 73 YAAAVHNGRRALTIRAKGNGRLKFTVEGRTVYARSVHQPA----RAGRPYLSQALREVAPQEGF 132 (137) T ss_pred cceeeecCCCCceeecCCCccceeecCCeeEeccceecCC----CCCChhhHHHHHHhhcccce Confidence 655444 43222223344443 1111001110111 22566699999999999998 No 9 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=86.57 E-value=0.042 Score=28.14 Aligned_cols=123 Identities=15% Similarity=0.054 Sum_probs=72.1 Q ss_pred eeccccCCc--hhhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccc Q lcl|NC_021296. 12 LTDIQVPNP--NRGLAQI--LLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGET 87 (143) Q Consensus 12 ~~d~k~~np--~rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~ 87 (143) |..+..+.- .+.|.++ .+...++.-|...+++++..=+..+-.+||.|.+|.++++-.+| -.+.|..+.+| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~-----~~~~V~~~~~Y 75 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGG-----LTGVINIGSEY 75 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCc-----EEEEEecCCcc Confidence 544432211 0122111 12345667778888888888888888999999999998875544 55777777788 Q ss_pred cccCc-----ccCCC--cC--Ccchhhhhh-hhhhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 88 AVSTW-----HSPRN--PN--PGDLFFYGV-LHEHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 88 aa~~~-----Hspr~--g~--pgd~f~ygv-lh~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) |.-.+ |..+. .. ....||+.+ -..+...|.||...=+||-++-++-+.-.-+ T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 76 AVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 77666 33111 11 111222222 1234556778887878887665554443333 No 10 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=80.67 E-value=0.035 Score=28.58 Aligned_cols=107 Identities=16% Similarity=0.116 Sum_probs=65.8 Q ss_pred ccceeeeeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecc Q lcl|NC_021296. 6 TTIGHRLTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGG 85 (143) Q Consensus 6 ~~i~~~~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~ 85 (143) -.+.+.+.-++ .=-+.|.+.=.-..++..+...++.++..=+..+-.+||.|.+|.++++-.||. +++||. T Consensus 1 M~~~i~i~Gld--~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~-------~~~V~~ 71 (112) T protein:vir:36 1 MKSSLSFKGID--QLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGGF-------SGQAGP 71 (112) T ss_pred CceeeeehhHH--HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCce-------EEEeec Confidence 22222221010 001233322222457778888889999888999999999999999988766653 345542 Q ss_pred cccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHH-----HHhh Q lcl|NC_021296. 86 ETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKAL-----AVVK 138 (143) Q Consensus 86 e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~-----~~~~ 138 (143) . . .|+.+-|||-..-||..+=+||-...++.+ .+|| T Consensus 72 ~---------------~--~Ya~~vE~GT~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 72 H---------------T--DYSAYVEYGTRFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred C---------------C--CccceeeccccccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 1 1 288899999998888888777764433321 1223 No 11 >protein:vir:9578 Length: 78 # NCBI annotation: gp44 # Family: family:all:1171 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862883;genbank:gi:32469475;genbank:GeneID:1461320 Probab=80.33 E-value=0.011 Score=31.36 Aligned_cols=78 Identities=17% Similarity=0.200 Sum_probs=47.6 Q ss_pred eeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccccccC Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGETAVST 91 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~aa~~ 91 (143) |..+|+-=-..|+.|||.|+.||.++...++.+. .|-|+. -++.+.+| .+|-.+-|.-. + T Consensus 1 ms~~k~klN~aGvr~ll~s~~~Qa~l~~~A~~i~--------~rag~g---Y~~d~~~g---k~Ra~a~V~~~------t 60 (78) T protein:vir:95 1 MSNTKIKLIGAGVGALLKSKEIQDILNKEATVIK--------KRCGPG---YEQDSHVG---KTRANAMIYPT------T 60 (78) T ss_pred CCcceeeeCHHHHHHHhcChhHHHHHHHHHHHHH--------Hhhccc---cccccccC---CcccceEeecC------C Confidence 5555543333699999999999999999988876 344432 33334444 45544444321 1 Q ss_pred cccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH Q lcl|NC_021296. 92 WHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA 135 (143) Q Consensus 92 ~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~ 135 (143) . -...+=..||-|-|||. T Consensus 61 ~--------------------------~A~~~N~khNTLLKAv~ 78 (78) T protein:vir:95 61 R--------------------------KAKRDNLKNNTLLKAVH 78 (78) T ss_pred h--------------------------HHHHhhhhhhhhhhhcC Confidence 1 12233467888888877 No 12 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=79.97 E-value=0.099 Score=26.08 Aligned_cols=118 Identities=17% Similarity=0.105 Sum_probs=68.1 Q ss_pred eeccccCCchhhHHHHH---------hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEE Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQIL---------LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVT 82 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL---------~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~Vt 82 (143) |..+. .||-++. +...++..+.+.+++++..=+..+-.+||.|.+|..+++..+| -.+.|. T Consensus 1 Ma~~~-----~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~-----~~~~V~ 70 (137) T protein:vir:10 1 MAKVK-----YGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGG-----LTGVIN 70 (137) T ss_pred CchhH-----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCc-----EEEEEe Confidence 43321 1333321 1345667788888899998899999999999999998875554 446666 Q ss_pred ecccccccCcc--cCCCcCCc-------ch-hhhhhhhhhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 83 IGGETAVSTWH--SPRNPNPG-------DL-FFYGVLHEHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 83 VG~e~aa~~~H--spr~g~pg-------d~-f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) .+.+||.-.+. .+..+.|. .. +++.-...+..-|-||..+=+||-++=|+=+.-.-+ T Consensus 71 ~~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 71 IGSEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred cCCCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 67777765551 12211111 11 222223344555677777777775554443332222 No 13 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=79.09 E-value=0.063 Score=27.15 Aligned_cols=125 Identities=15% Similarity=0.113 Sum_probs=60.7 Q ss_pred cccCCch------hhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEeccccc Q lcl|NC_021296. 15 IQVPNPN------RGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGETA 88 (143) Q Consensus 15 ~k~~np~------rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~a 88 (143) |++.--. +-|++.+ ...++.-+..-++.++..=+.++-.+||.|.+|..++..- +.+..++.|.-...|+ T Consensus 1 i~i~Gld~L~~~L~~l~~~~-~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~---~~~~~~~~v~~~~~Ya 76 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDI-DKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLK---AKDLISKKITVNELYG 76 (173) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeec---cCceeEEeeCCCcccc Confidence 4554322 1122222 3356667778888888888999999999999999887642 2223344443333333 Q ss_pred ccCc-----cc--CC-------CcC--------------Ccch--------hhhhhhhhhcCCCCCCcccccchhhHHHH Q lcl|NC_021296. 89 VSTW-----HS--PR-------NPN--------------PGDL--------FFYGVLHEHGDGGNPPSGWDFPAHKDLKK 132 (143) Q Consensus 89 a~~~-----Hs--pr-------~g~--------------pgd~--------f~ygvlh~~g~~~~~p~~~~f~ah~dl~~ 132 (143) .-.+ |. |. ++. +.+. +++++.-....-|.||-.+-|||-++-++ T Consensus 77 ~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~ 156 (173) T protein:vir:10 77 AYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKK 156 (173) T ss_pred hhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHH Confidence 2222 11 00 000 0000 00111001111367888888888766554 Q ss_pred HH-HHhhhccCC Q lcl|NC_021296. 133 AL-AVVKARNGA 143 (143) Q Consensus 133 a~-~~~~~~~~~ 143 (143) .+ ..++.+--. T Consensus 157 ~~~~~i~~~i~~ 168 (173) T protein:vir:10 157 QYLKDLENLLKT 168 (173) T ss_pred HHHHHHHHHHHH Confidence 32 122111110 No 14 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=77.90 E-value=0.069 Score=26.96 Aligned_cols=123 Identities=15% Similarity=0.142 Sum_probs=69.0 Q ss_pred eeccccCCch---hhHHHHHh--h-hhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecc Q lcl|NC_021296. 12 LTDIQVPNPN---RGLAQILL--S-PNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGG 85 (143) Q Consensus 12 ~~d~k~~np~---rgl~eiL~--S-~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~ 85 (143) |-++++.... +.|.++-. + ..++..+..-++.++..-+..+-.+||.|..+..+..---..+. .+..+.|+. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~--~~~~~~~~~ 78 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSP--GIATAGVRV 78 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceeccccccccc--ceeEEeecc Confidence 7777876443 22222211 1 12355677778888888888888889999999887653221111 111222211 Q ss_pred cccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHH-HHh---------hhccCC Q lcl|NC_021296. 86 ETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKAL-AVV---------KARNGA 143 (143) Q Consensus 86 e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~-~~~---------~~~~~~ 143 (143) ... .+-..+++ .||+-+-|||.-..||..+=+||-..-++.+ ..+ |+-+|- T Consensus 79 ~~~------~~~~~~~~-~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:10 79 RTK------GKADSPNN-AFYWRFVELGTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGGG 139 (140) T ss_pred ccc------cccCCCCc-ccccceeccCcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 100 01112333 4678889999999999999999875443322 111 222333 No 15 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=77.41 E-value=0.13 Score=25.53 Aligned_cols=129 Identities=17% Similarity=0.203 Sum_probs=74.9 Q ss_pred eeccccCCch---h---hHHHHH---hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEE Q lcl|NC_021296. 12 LTDIQVPNPN---R---GLAQIL---LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVT 82 (143) Q Consensus 12 ~~d~k~~np~---r---gl~eiL---~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~Vt 82 (143) |--|++..-. + .+.+.+ +...|+..+++.+++++..=+..+--+||.|.+|.+.++.. +.+..+|.|. T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~---~~~~~~g~V~ 77 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKV---DGDEVIGRWW 77 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeee---cCCeEEEEee Confidence 5444554332 1 111111 12346666677777888777888889999999999988864 3466778887 Q ss_pred ecccccccCc-----cc---CCC----cCC----cchh------------hhhhhhh-------hcCCCCCCcccccchh Q lcl|NC_021296. 83 IGGETAVSTW-----HS---PRN----PNP----GDLF------------FYGVLHE-------HGDGGNPPSGWDFPAH 127 (143) Q Consensus 83 VG~e~aa~~~-----Hs---pr~----g~p----gd~f------------~ygvlh~-------~g~~~~~p~~~~f~ah 127 (143) ...+||.--+ |- .+. ..| ..-| .|+.-.- ++.-|.||-.+-|||- T Consensus 78 ~~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~ 157 (182) T protein:vir:10 78 NSSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAA 157 (182) T ss_pred cCCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHH Confidence 7777766443 21 110 111 0111 1222111 2334789999999998 Q ss_pred hHHHHHH---------HHhhhccCC Q lcl|NC_021296. 128 KDLKKAL---------AVVKARNGA 143 (143) Q Consensus 128 ~dl~~a~---------~~~~~~~~~ 143 (143) ++.++-+ .+||...|. T Consensus 158 ~~~~~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 158 NKMAKEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred HHhHHHHHHHHHHHHHHHHHHhhcC Confidence 7755433 245666666 No 16 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=77.27 E-value=0.067 Score=27.03 Aligned_cols=123 Identities=15% Similarity=0.122 Sum_probs=70.2 Q ss_pred eeccccCCch---hhHHHHH--hhh-hHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecc Q lcl|NC_021296. 12 LTDIQVPNPN---RGLAQIL--LSP-NMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGG 85 (143) Q Consensus 12 ~~d~k~~np~---rgl~eiL--~S~-~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~ 85 (143) |.+++.-.-. +.|.++- .+. .++..+..-++.+...-+.++-..||.|..|..+..--. .+.+. +..+.+ T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~-~~~~~-~~~~g~-- 76 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQ-KDAPG-LATAGV-- 76 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhcccccccc-ccccc-eEEeee-- Confidence 7778776443 2222221 122 245677788888888888888889999999887654211 11111 112211 Q ss_pred cccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH-H---------hhhccCC Q lcl|NC_021296. 86 ETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA-V---------VKARNGA 143 (143) Q Consensus 86 e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~-~---------~~~~~~~ 143 (143) ..-. . +.-. .+...||..+.|||.-..||..+=.||-..-++.+. . =|+-+|. T Consensus 77 ~~~~--~--~~~~-~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:10 77 RVRT--K--GKAD-SPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGR 139 (140) T ss_pred eecc--c--cccC-CCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 1100 0 1111 234568999999999999999998888665444221 1 1223344 No 17 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=77.04 E-value=0.1 Score=25.98 Aligned_cols=96 Identities=15% Similarity=0.191 Sum_probs=56.6 Q ss_pred cccCCch---hhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhh------ccccccccccceEEEeecCCCCCeeEEEEEe Q lcl|NC_021296. 15 IQVPNPN---RGLAQI--LLSPNMELLMGIIGQEVVLAYRAGV------AKRTGKLMSSASSETMIGGKKNDRWVSHVTI 83 (143) Q Consensus 15 ~k~~np~---rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~V------AkRTg~LArsvrvetfIGG~K~DRwVg~VtV 83 (143) |+..--. +.|.++ -....++..+...+++++..=++.. ..+||.|.+|..++. .|| .+++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~-------~~~~v 72 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-TGD-------LQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-cCc-------eEEEe Confidence 4443222 122111 1112345566666666666544442 458999999998873 343 22345 Q ss_pred cccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhH--------HHHHHH Q lcl|NC_021296. 84 GGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKD--------LKKALA 135 (143) Q Consensus 84 G~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~d--------l~~a~~ 135 (143) +.. =+|+..-|||-.--||...=+||-.. |++++. T Consensus 73 ~~~-----------------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 73 TSH-----------------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecC-----------------ccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 332 14899999999999999998888654 333333 No 18 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=77.04 E-value=0.1 Score=25.98 Aligned_cols=96 Identities=15% Similarity=0.191 Sum_probs=56.6 Q ss_pred cccCCch---hhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhh------ccccccccccceEEEeecCCCCCeeEEEEEe Q lcl|NC_021296. 15 IQVPNPN---RGLAQI--LLSPNMELLMGIIGQEVVLAYRAGV------AKRTGKLMSSASSETMIGGKKNDRWVSHVTI 83 (143) Q Consensus 15 ~k~~np~---rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~V------AkRTg~LArsvrvetfIGG~K~DRwVg~VtV 83 (143) |+..--. +.|.++ -....++..+...+++++..=++.. ..+||.|.+|..++. .|| .+++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~-------~~~~v 72 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-TGD-------LQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-cCc-------eEEEe Confidence 4443222 122111 1112345566666666666544442 458999999998873 343 22345 Q ss_pred cccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhH--------HHHHHH Q lcl|NC_021296. 84 GGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKD--------LKKALA 135 (143) Q Consensus 84 G~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~d--------l~~a~~ 135 (143) +.. =+|+..-|||-.--||...=+||-.. |++++. T Consensus 73 ~~~-----------------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 73 TSH-----------------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecC-----------------ccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 332 14899999999999999998888654 333333 No 19 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=77.04 E-value=0.1 Score=25.98 Aligned_cols=96 Identities=15% Similarity=0.191 Sum_probs=56.6 Q ss_pred cccCCch---hhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhh------ccccccccccceEEEeecCCCCCeeEEEEEe Q lcl|NC_021296. 15 IQVPNPN---RGLAQI--LLSPNMELLMGIIGQEVVLAYRAGV------AKRTGKLMSSASSETMIGGKKNDRWVSHVTI 83 (143) Q Consensus 15 ~k~~np~---rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~V------AkRTg~LArsvrvetfIGG~K~DRwVg~VtV 83 (143) |+..--. +.|.++ -....++..+...+++++..=++.. ..+||.|.+|..++. .|| .+++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~-------~~~~v 72 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-TGD-------LQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-cCc-------eEEEe Confidence 4443222 122111 1112345566666666666544442 458999999998873 343 22345 Q ss_pred cccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhH--------HHHHHH Q lcl|NC_021296. 84 GGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKD--------LKKALA 135 (143) Q Consensus 84 G~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~d--------l~~a~~ 135 (143) +.. =+|+..-|||-.--||...=+||-.. |++++. T Consensus 73 ~~~-----------------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 73 TSH-----------------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecC-----------------ccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 332 14899999999999999998888654 333333 No 20 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=77.04 E-value=0.1 Score=25.98 Aligned_cols=96 Identities=15% Similarity=0.191 Sum_probs=56.6 Q ss_pred cccCCch---hhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhh------ccccccccccceEEEeecCCCCCeeEEEEEe Q lcl|NC_021296. 15 IQVPNPN---RGLAQI--LLSPNMELLMGIIGQEVVLAYRAGV------AKRTGKLMSSASSETMIGGKKNDRWVSHVTI 83 (143) Q Consensus 15 ~k~~np~---rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~V------AkRTg~LArsvrvetfIGG~K~DRwVg~VtV 83 (143) |+..--. +.|.++ -....++..+...+++++..=++.. ..+||.|.+|..++. .|| .+++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~-------~~~~v 72 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-TGD-------LQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-cCc-------eEEEe Confidence 4443222 122111 1112345566666666666544442 458999999998873 343 22345 Q ss_pred cccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhH--------HHHHHH Q lcl|NC_021296. 84 GGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKD--------LKKALA 135 (143) Q Consensus 84 G~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~d--------l~~a~~ 135 (143) +.. =+|+..-|||-.--||...=+||-.. |++++. T Consensus 73 ~~~-----------------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 73 TSH-----------------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecC-----------------ccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 332 14899999999999999998888654 333333 No 21 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=77.04 E-value=0.1 Score=25.98 Aligned_cols=96 Identities=15% Similarity=0.191 Sum_probs=56.6 Q ss_pred cccCCch---hhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhh------ccccccccccceEEEeecCCCCCeeEEEEEe Q lcl|NC_021296. 15 IQVPNPN---RGLAQI--LLSPNMELLMGIIGQEVVLAYRAGV------AKRTGKLMSSASSETMIGGKKNDRWVSHVTI 83 (143) Q Consensus 15 ~k~~np~---rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~V------AkRTg~LArsvrvetfIGG~K~DRwVg~VtV 83 (143) |+..--. +.|.++ -....++..+...+++++..=++.. ..+||.|.+|..++. .|| .+++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~-------~~~~v 72 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-TGD-------LQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-cCc-------eEEEe Confidence 4443222 122111 1112345566666666666544442 458999999998873 343 22345 Q ss_pred cccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhH--------HHHHHH Q lcl|NC_021296. 84 GGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKD--------LKKALA 135 (143) Q Consensus 84 G~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~d--------l~~a~~ 135 (143) +.. =+|+..-|||-.--||...=+||-.. |++++. T Consensus 73 ~~~-----------------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 73 TSH-----------------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecC-----------------ccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 332 14899999999999999998888654 333333 No 22 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=77.04 E-value=0.1 Score=25.98 Aligned_cols=96 Identities=15% Similarity=0.191 Sum_probs=56.6 Q ss_pred cccCCch---hhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhh------ccccccccccceEEEeecCCCCCeeEEEEEe Q lcl|NC_021296. 15 IQVPNPN---RGLAQI--LLSPNMELLMGIIGQEVVLAYRAGV------AKRTGKLMSSASSETMIGGKKNDRWVSHVTI 83 (143) Q Consensus 15 ~k~~np~---rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~V------AkRTg~LArsvrvetfIGG~K~DRwVg~VtV 83 (143) |+..--. +.|.++ -....++..+...+++++..=++.. ..+||.|.+|..++. .|| .+++| T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-~g~-------~~~~v 72 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-TGD-------LQYTI 72 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-cCc-------eEEEe Confidence 4443222 122111 1112345566666666666544442 458999999998873 343 22345 Q ss_pred cccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhH--------HHHHHH Q lcl|NC_021296. 84 GGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKD--------LKKALA 135 (143) Q Consensus 84 G~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~d--------l~~a~~ 135 (143) +.. =+|+..-|||-.--||...=+||-.. |++++. T Consensus 73 ~~~-----------------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 73 TSH-----------------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ecC-----------------ccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 332 14899999999999999998888654 333333 No 23 >protein:vir:1642 Length: 78 # NCBI annotation: hypothetical protein # Family: family:all:1171 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695063;genbank:gi:23455754;genbank:GeneID:955475 Probab=76.97 E-value=0.017 Score=30.26 Aligned_cols=78 Identities=18% Similarity=0.229 Sum_probs=45.9 Q ss_pred eeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccccccC Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGETAVST 91 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~aa~~ 91 (143) |..+|+-=-..|+.|+|.|++||..+..+++.+. .|-|+ .-+..|.+| .+|-.+.|.-. T Consensus 1 Ms~~kfklN~aGv~~llks~~iQa~l~~~a~~i~--------~raG~---gy~~dv~vg---k~Ra~a~V~~~------- 59 (78) T protein:vir:16 1 MAKNLFKLNRSGVASMMKSPEMQAILKEKASAVK--------QRCGP---GYGQDMHVG---KNRANAMVFAE------- 59 (78) T ss_pred CCcceeEeCHHHHHHHhcCchhHHHHHHHHHHHH--------HhhcC---cceeccccC---CcccceEeccC------- Confidence 5444443333599999999999999999988876 33332 123333333 22322222211 Q ss_pred cccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH Q lcl|NC_021296. 92 WHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA 135 (143) Q Consensus 92 ~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~ 135 (143) ++-+..+=..||-|-|||. T Consensus 60 -------------------------t~~A~~~N~KhNTLLKAv~ 78 (78) T protein:vir:16 60 -------------------------TYQAKRDNMKNNTILKAVR 78 (78) T ss_pred -------------------------ChhhHHhhhhcchhhhhcC Confidence 1223444568999988887 No 24 >protein:vir:94763 Length: 78 # NCBI annotation: unknown # Family: family:all:1171 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996710;genbank:gi:45597425;genbank:GeneID:2769032 Probab=76.97 E-value=0.017 Score=30.26 Aligned_cols=78 Identities=18% Similarity=0.229 Sum_probs=45.9 Q ss_pred eeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccccccC Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGETAVST 91 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~aa~~ 91 (143) |..+|+-=-..|+.|+|.|++||..+..+++.+. .|-|+ .-+..|.+| .+|-.+.|.-. T Consensus 1 Ms~~kfklN~aGv~~llks~~iQa~l~~~a~~i~--------~raG~---gy~~dv~vg---k~Ra~a~V~~~------- 59 (78) T protein:vir:94 1 MAKNLFKLNRSGVASMMKSPEMQAILKEKASAVK--------QRCGP---GYGQDMHVG---KNRANAMVFAE------- 59 (78) T ss_pred CCcceeEeCHHHHHHHhcCchhHHHHHHHHHHHH--------HhhcC---cceeccccC---CcccceEeccC------- Confidence 5444443333599999999999999999988876 33332 123333333 22322222211 Q ss_pred cccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH Q lcl|NC_021296. 92 WHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA 135 (143) Q Consensus 92 ~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~ 135 (143) ++-+..+=..||-|-|||. T Consensus 60 -------------------------t~~A~~~N~KhNTLLKAv~ 78 (78) T protein:vir:94 60 -------------------------TYQAKRDNMKNNTILKAVR 78 (78) T ss_pred -------------------------ChhhHHhhhhcchhhhhcC Confidence 1223444568999988887 No 25 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=76.92 E-value=0.09 Score=26.31 Aligned_cols=119 Identities=21% Similarity=0.187 Sum_probs=69.5 Q ss_pred eeccccCCchhhHHHHH-----hhhh-----HHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEE Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQIL-----LSPN-----MELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHV 81 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL-----~S~~-----m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~V 81 (143) |-.|++- ||-+++ ++.+ ++..+..-++.+...-+..+-+.||.|..+..+...-. .+.+ +.+ T Consensus 1 Ma~~~i~----Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~-~~~~---~~~ 72 (140) T protein:vir:80 1 MSSIQIV----GLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQ-KDAP---GLA 72 (140) T ss_pred Cceeeeh----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeecccc-cccc---cee Confidence 6666665 444433 2222 35577888999999999998899999999987654321 1111 112 Q ss_pred EecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHH-HH---------hhhccCC Q lcl|NC_021296. 82 TIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKAL-AV---------VKARNGA 143 (143) Q Consensus 82 tVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~-~~---------~~~~~~~ 143 (143) .|+...-. -+. -+ ..+.+||+.+-|||.-..||..+=+||-..-++.+ .. =|+-+|+ T Consensus 73 ~~~~~~~~--~~~--~~-~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:80 73 TAGVRVRT--KGK--AD-SPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGGR 139 (140) T ss_pred eeeeeccc--ccc--cC-CCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 22111110 011 11 23456788888999999999999888865543222 11 1333444 No 26 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=76.73 E-value=0.13 Score=25.40 Aligned_cols=132 Identities=13% Similarity=0.095 Sum_probs=69.7 Q ss_pred CCCCcccceeeeeccc-cCCchhhHHHHH--hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCee Q lcl|NC_021296. 1 MPAAGTTIGHRLTDIQ-VPNPNRGLAQIL--LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRW 77 (143) Q Consensus 1 ~pa~g~~i~~~~~d~k-~~np~rgl~eiL--~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRw 77 (143) |+-.--.| -. + .-.-.+.|.++- +...++..+.+.+++++..=+..+-.+||+|.+|...++-.+ .. T Consensus 1 m~~ms~~i--~~---~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~-----g~ 70 (144) T protein:vir:59 1 MALMSVRI--DP---SWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNN-----GL 70 (144) T ss_pred CCcceeee--hh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecC-----cE Confidence 43221111 00 0 000001111111 245677888888999988888888889999999999887443 45 Q ss_pred EEEEEecccccccCc--ccCCCcCC---cchhhhhhhh---hhcCCCCCCcccccchhhHHHHHH-HHhhhccC Q lcl|NC_021296. 78 VSHVTIGGETAVSTW--HSPRNPNP---GDLFFYGVLH---EHGDGGNPPSGWDFPAHKDLKKAL-AVVKARNG 142 (143) Q Consensus 78 Vg~VtVG~e~aa~~~--Hspr~g~p---gd~f~ygvlh---~~g~~~~~p~~~~f~ah~dl~~a~-~~~~~~~~ 142 (143) .++|....+||.-.+ |.+..+.| ..+.+|...- .+..-|.||...=+||-++-++-+ ..++.--| T Consensus 71 ~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 71 TAEITVGAEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFEREMRRLRG 144 (144) T ss_pred EEEEecCCCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHHHHHHhcC Confidence 677777777776655 22222222 2222222100 011235677777777766554433 23444444 No 27 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=75.65 E-value=0.14 Score=25.19 Aligned_cols=116 Identities=16% Similarity=0.117 Sum_probs=63.4 Q ss_pred eeccccCCchhhHHHHH---------hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEE Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQIL---------LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVT 82 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL---------~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~Vt 82 (143) |-.++ +||-++. +...++.-+...+++++..=+..+-.+||.|.+|..+++-.+| ..++|. T Consensus 1 Ma~~~-----~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g-----~~~~V~ 70 (135) T protein:vir:96 1 MAKVK-----YGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGG-----FTGVVK 70 (135) T ss_pred Cchhh-----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCc-----EEEEEe Confidence 21111 2333322 2345666777778888877778888899999999998875444 566776 Q ss_pred ecccccccCc-----cc--CCCcCCcchhhhhh---hhhhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 83 IGGETAVSTW-----HS--PRNPNPGDLFFYGV---LHEHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 83 VG~e~aa~~~-----Hs--pr~g~pgd~f~ygv---lh~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) -..+||.--+ |. +..+.+.--||++. .+. .-+-||..+=+||-.+-++-+...=+ T Consensus 71 ~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~--~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 71 IGSNYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHT--TYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred cCCCccchhhcccccccCCCccccccccccccCCcceee--cCCcCCCcchhHHHHHHHHHHHHhcC Confidence 6667776555 22 22233333333322 111 13455655555555544443332222 No 28 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=75.20 E-value=0.092 Score=26.27 Aligned_cols=134 Identities=19% Similarity=0.104 Sum_probs=73.1 Q ss_pred cccceeeeeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEec Q lcl|NC_021296. 5 GTTIGHRLTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIG 84 (143) Q Consensus 5 g~~i~~~~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG 84 (143) =..+-|.+... ..++ +.+.+ .....++.-+...++.++..=++.+-.+||.|.+|...++-.. ....+..+.|... T Consensus 1 m~~~~~~~~gl-~~~l-~~~~~-~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~-~~~~~~~~~v~~~ 76 (142) T protein:vir:99 1 MVQVSVRYEGF-DYNP-VGAAA-QVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVM-VTPFHVSGGVTAH 76 (142) T ss_pred CceeEEEeeec-chhH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccc-cccceEEEEeccC Confidence 11223333211 1233 22222 2345677788888888888888888889999999998765322 1222333444444 Q ss_pred ccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCccc----ccchhhHHHHHHHHhhhccCC Q lcl|NC_021296. 85 GETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGW----DFPAHKDLKKALAVVKARNGA 143 (143) Q Consensus 85 ~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~----~f~ah~dl~~a~~~~~~~~~~ 143 (143) .+|+.-. |.+-.+|...|+....|+..-+|+..+.+. -.+++.=|+.|+...+.+.-+ T Consensus 77 a~YA~~v-e~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~ 138 (142) T protein:vir:99 77 AKYAAAV-HEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRR 138 (142) T ss_pred cccccee-ccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhh Confidence 4554433 334445556666666666555554444442 123666677777665544333 No 29 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=75.20 E-value=0.092 Score=26.27 Aligned_cols=134 Identities=19% Similarity=0.104 Sum_probs=73.1 Q ss_pred cccceeeeeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEec Q lcl|NC_021296. 5 GTTIGHRLTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIG 84 (143) Q Consensus 5 g~~i~~~~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG 84 (143) =..+-|.+... ..++ +.+.+ .....++.-+...++.++..=++.+-.+||.|.+|...++-.. ....+..+.|... T Consensus 1 m~~~~~~~~gl-~~~l-~~~~~-~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~-~~~~~~~~~v~~~ 76 (142) T protein:vir:86 1 MVQVSVRYEGF-DYNP-VGAAA-QVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVM-VTPFHVSGGVTAH 76 (142) T ss_pred CceeEEEeeec-chhH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccc-cccceEEEEeccC Confidence 11223333211 1233 22222 2345677788888888888888888889999999998765322 1222333444444 Q ss_pred ccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCccc----ccchhhHHHHHHHHhhhccCC Q lcl|NC_021296. 85 GETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGW----DFPAHKDLKKALAVVKARNGA 143 (143) Q Consensus 85 ~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~----~f~ah~dl~~a~~~~~~~~~~ 143 (143) .+|+.-. |.+-.+|...|+....|+..-+|+..+.+. -.+++.=|+.|+...+.+.-+ T Consensus 77 a~YA~~v-e~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~ 138 (142) T protein:vir:86 77 AKYAAAV-HEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRR 138 (142) T ss_pred cccccee-ccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhh Confidence 4554433 334445556666666666555554444442 123666677777665544333 No 30 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=74.09 E-value=0.13 Score=25.41 Aligned_cols=124 Identities=13% Similarity=0.125 Sum_probs=55.1 Q ss_pred eeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhh-----hhccccccccccceEEEeecCCCCCeeEEEEEeccc Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRA-----GVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGE 86 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a-----~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e 86 (143) |.|+|..+--..+-.-+ ....+.-+..+++..+..-+. ....+||.|.+|...++..+|.. +.|-...+ T Consensus 1 ~~~~~f~~~~~~~~~~~-~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~-----~~V~~~~~ 74 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLI-EKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKE-----VIVGNSSD 74 (141) T ss_pred CcchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcE-----EEEecCCC Confidence 88888775533332222 233333344444443333322 33458999999999999888752 23334444 Q ss_pred ccccCc-----ccCCCcCCcchhhhhhhh-h-hcCCCCCCcccccchhhHHHHHHH-----Hhhhcc Q lcl|NC_021296. 87 TAVSTW-----HSPRNPNPGDLFFYGVLH-E-HGDGGNPPSGWDFPAHKDLKKALA-----VVKARN 141 (143) Q Consensus 87 ~aa~~~-----Hspr~g~pgd~f~ygvlh-~-~g~~~~~p~~~~f~ah~dl~~a~~-----~~~~~~ 141 (143) ||.--+ |.......-.+.||-.-. + |=--|.||...-+||-++-++=+. +++--| T Consensus 75 YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~~l~ 141 (141) T protein:vir:78 75 YAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTERALRGIN 141 (141) T ss_pred ccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHHHHhhccC Confidence 443332 221111122333331100 0 000134555555555443332211 111112 No 31 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=73.55 E-value=0.17 Score=24.82 Aligned_cols=123 Identities=13% Similarity=0.047 Sum_probs=66.8 Q ss_pred eeccccCCc--hhhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccc Q lcl|NC_021296. 12 LTDIQVPNP--NRGLAQI--LLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGET 87 (143) Q Consensus 12 ~~d~k~~np--~rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~ 87 (143) |-.+..+-- .+.|-++ .+...++..+...+++++..=+..+-.+||.|.+|.+.++-.+| -.++|.-..+| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~~-----~~~~V~~~~~Y 75 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGG-----FTGVINIGSEY 75 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCCc-----eEEEEecCCCc Confidence 433321110 0111111 12345566677778888888888888999999999998876554 44677777777 Q ss_pred cccCc--ccCCCcCC-------cchhhhhh-hhhhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 88 AVSTW--HSPRNPNP-------GDLFFYGV-LHEHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 88 aa~~~--Hspr~g~p-------gd~f~ygv-lh~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) |.-.+ |.+..+.+ +.-||+.+ -..+-.-|.||..+=+||-.+.++-+.-.-+ T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 76555 22211111 11222211 1111123567777777776665554443333 No 32 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=72.37 E-value=0.18 Score=24.62 Aligned_cols=123 Identities=12% Similarity=0.029 Sum_probs=66.3 Q ss_pred eeccccCCc--hhhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccc Q lcl|NC_021296. 12 LTDIQVPNP--NRGLAQI--LLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGET 87 (143) Q Consensus 12 ~~d~k~~np--~rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~ 87 (143) |-.+..+-- .+.|-++ .+...++..+++.+++++..=+..+-.+||.|.+|..+++-.+| .-+.|....+| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~-----~~~~V~~~~~Y 75 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSG-----FTGVINIGSEY 75 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCc-----eEEEEecCCCc Confidence 433221111 0111111 11244556677778888888888999999999999998876554 45677777777 Q ss_pred cccCc--ccCCCcCC-------cchhhhhhhh-hhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 88 AVSTW--HSPRNPNP-------GDLFFYGVLH-EHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 88 aa~~~--Hspr~g~p-------gd~f~ygvlh-~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) |.-.+ |.+....+ ..-||+.+-. .+-.-|.||..+=+||-.+.++-+.-.-+ T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 76555 22221111 2222222110 11123556666667776665555443333 No 33 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=72.37 E-value=0.18 Score=24.62 Aligned_cols=123 Identities=12% Similarity=0.029 Sum_probs=66.3 Q ss_pred eeccccCCc--hhhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccc Q lcl|NC_021296. 12 LTDIQVPNP--NRGLAQI--LLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGET 87 (143) Q Consensus 12 ~~d~k~~np--~rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~ 87 (143) |-.+..+-- .+.|-++ .+...++..+++.+++++..=+..+-.+||.|.+|..+++-.+| .-+.|....+| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~-----~~~~V~~~~~Y 75 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSG-----FTGVINIGSEY 75 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCc-----eEEEEecCCCc Confidence 433221111 0111111 11244556677778888888888999999999999998876554 45677777777 Q ss_pred cccCc--ccCCCcCC-------cchhhhhhhh-hhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 88 AVSTW--HSPRNPNP-------GDLFFYGVLH-EHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 88 aa~~~--Hspr~g~p-------gd~f~ygvlh-~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) |.-.+ |.+....+ ..-||+.+-. .+-.-|.||..+=+||-.+.++-+.-.-+ T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 76555 22221111 2222222110 11123556666667776665555443333 No 34 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=72.37 E-value=0.18 Score=24.62 Aligned_cols=123 Identities=12% Similarity=0.029 Sum_probs=66.3 Q ss_pred eeccccCCc--hhhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccc Q lcl|NC_021296. 12 LTDIQVPNP--NRGLAQI--LLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGET 87 (143) Q Consensus 12 ~~d~k~~np--~rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~ 87 (143) |-.+..+-- .+.|-++ .+...++..+++.+++++..=+..+-.+||.|.+|..+++-.+| .-+.|....+| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~-----~~~~V~~~~~Y 75 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSG-----FTGVINIGSEY 75 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCc-----eEEEEecCCCc Confidence 433221111 0111111 11244556677778888888888999999999999998876554 45677777777 Q ss_pred cccCc--ccCCCcCC-------cchhhhhhhh-hhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 88 AVSTW--HSPRNPNP-------GDLFFYGVLH-EHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 88 aa~~~--Hspr~g~p-------gd~f~ygvlh-~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) |.-.+ |.+....+ ..-||+.+-. .+-.-|.||..+=+||-.+.++-+.-.-+ T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 76555 22221111 2222222110 11123556666667776665555443333 No 35 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=72.02 E-value=0.19 Score=24.56 Aligned_cols=123 Identities=13% Similarity=0.049 Sum_probs=68.0 Q ss_pred eeccccCCch--hhHHHH--HhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccc Q lcl|NC_021296. 12 LTDIQVPNPN--RGLAQI--LLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGET 87 (143) Q Consensus 12 ~~d~k~~np~--rgl~ei--L~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~ 87 (143) |-.|+-+--. +.|-++ .+...++..+...+++++..=+..+-.+||.|.+|.++++-.+| -.+.|..+.+| T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~-----~~~~V~~~~~Y 75 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGG-----FTGVINIGSEY 75 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCc-----EEEEEecCCCc Confidence 5444321110 111111 12344566677778888888889999999999999998875544 44677777777 Q ss_pred cccCc--ccCCC-------cCCcchhhhhhhh-hhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 88 AVSTW--HSPRN-------PNPGDLFFYGVLH-EHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 88 aa~~~--Hspr~-------g~pgd~f~ygvlh-~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) |.-.+ |.+.. ..++.-||+-+.. .+-.-|.||..+=+||-.+.++-+.-.-+ T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 76544 22211 1223333332211 11223566766667776666655444333 No 36 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=67.52 E-value=0.25 Score=23.88 Aligned_cols=128 Identities=16% Similarity=0.161 Sum_probs=64.3 Q ss_pred CCcccceeeeeccccCCchhhHHHHH--hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEE Q lcl|NC_021296. 3 AAGTTIGHRLTDIQVPNPNRGLAQIL--LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSH 80 (143) Q Consensus 3 a~g~~i~~~~~d~k~~np~rgl~eiL--~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~ 80 (143) -+ .+-++ |....-++.|-.++ +...++..+.+.++.++..=+..+-.+||.|.+|..+++-..| +...++ T Consensus 1 Ma----~~~~~-~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g---~~~~~~ 72 (142) T protein:vir:94 1 MA----GLNYR-VNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGR---FSFSVT 72 (142) T ss_pred Cc----eeEEE-ecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCC---ceEEEE Confidence 11 12221 11111222332222 2467888889999999888888888899999999988775443 334455 Q ss_pred EEecccccccCc--ccCC---CcCCcchhhhhhhhhhcC---CCCCCcccccchhhH----HHHHHHHhh Q lcl|NC_021296. 81 VTIGGETAVSTW--HSPR---NPNPGDLFFYGVLHEHGD---GGNPPSGWDFPAHKD----LKKALAVVK 138 (143) Q Consensus 81 VtVG~e~aa~~~--Hspr---~g~pgd~f~ygvlh~~g~---~~~~p~~~~f~ah~d----l~~a~~~~~ 138 (143) |.-+.+||.--+ |.|. .-....++|.+--|-... -|-||..+=.||-.+ +++=+..|| T Consensus 73 v~~~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~~~~~ 142 (142) T protein:vir:94 73 IGTNVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHAKGIR 142 (142) T ss_pred EecCcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHHHhcC Confidence 555556665444 2221 111222333222111110 134566665565433 222233344 No 37 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=66.49 E-value=0.26 Score=23.79 Aligned_cols=95 Identities=22% Similarity=0.196 Sum_probs=53.8 Q ss_pred cccCCchhhHHHHHh------hhhHHHHHHHHHHHHHHHHhhhh------ccccccccccceEEEeecCCCCCeeEEEEE Q lcl|NC_021296. 15 IQVPNPNRGLAQILL------SPNMELLMGIIGQEVVLAYRAGV------AKRTGKLMSSASSETMIGGKKNDRWVSHVT 82 (143) Q Consensus 15 ~k~~np~rgl~eiL~------S~~m~~Lva~~~e~v~~~Y~a~V------AkRTg~LArsvrvetfIGG~K~DRwVg~Vt 82 (143) |++--=. .|-+-|. ...++..|..-+++++..=+... .-+||.|.+|..++ +.+.--++|. T Consensus 1 i~i~Gld-~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~------~~g~~~~~v~ 73 (115) T protein:vir:10 1 MQSKGLK-KLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVK------KIGDLHYRVI 73 (115) T ss_pred CeehhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeee------ecCcEEEEee Confidence 4443221 1222221 12345566666666655544433 34799999998765 2222223332 Q ss_pred ecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhh--------HHHHHHH Q lcl|NC_021296. 83 IGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHK--------DLKKALA 135 (143) Q Consensus 83 VG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~--------dl~~a~~ 135 (143) .+ =+|+.+.|||---.||...=+||-+ +|++++. T Consensus 74 ~~-------------------~~Ya~~vEfGT~km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 74 ST-------------------AHYSGFLEFGTRYMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred CC-------------------CccchheecccccCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 21 1499999999999999888888875 4444444 No 38 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=64.13 E-value=0.21 Score=24.29 Aligned_cols=106 Identities=12% Similarity=0.062 Sum_probs=59.8 Q ss_pred eeccccCCch---hhHHHHH----hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEec Q lcl|NC_021296. 12 LTDIQVPNPN---RGLAQIL----LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIG 84 (143) Q Consensus 12 ~~d~k~~np~---rgl~eiL----~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG 84 (143) |.+|+..-=. +.|.++. +...++.-.++++++++..-....-.+||.|.+|..+++--|| .+|| T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~---------~~V~ 71 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDK---------ATVE 71 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCe---------eEec Confidence 7777764221 2232221 1222334444555555554444455699999999988754333 3453 Q ss_pred ccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH-HhhhccCC Q lcl|NC_021296. 85 GETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA-VVKARNGA 143 (143) Q Consensus 85 ~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~-~~~~~~~~ 143 (143) .. -+|+.+.|||-..-||..+=+||-..-++.+. .++..--. T Consensus 72 ~~-----------------~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 72 AL-----------------TSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred CC-----------------CCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 21 24889999999999999988888866554322 11111111 No 39 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=64.13 E-value=0.21 Score=24.29 Aligned_cols=106 Identities=12% Similarity=0.062 Sum_probs=59.8 Q ss_pred eeccccCCch---hhHHHHH----hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEec Q lcl|NC_021296. 12 LTDIQVPNPN---RGLAQIL----LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIG 84 (143) Q Consensus 12 ~~d~k~~np~---rgl~eiL----~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG 84 (143) |.+|+..-=. +.|.++. +...++.-.++++++++..-....-.+||.|.+|..+++--|| .+|| T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~---------~~V~ 71 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDK---------ATVE 71 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCe---------eEec Confidence 7777764221 2232221 1222334444555555554444455699999999988754333 3453 Q ss_pred ccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH-HhhhccCC Q lcl|NC_021296. 85 GETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA-VVKARNGA 143 (143) Q Consensus 85 ~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~-~~~~~~~~ 143 (143) .. -+|+.+.|||-..-||..+=+||-..-++.+. .++..--. T Consensus 72 ~~-----------------~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 72 AL-----------------TSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred CC-----------------CCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 21 24889999999999999988888866554322 11111111 No 40 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=62.97 E-value=0.33 Score=23.26 Aligned_cols=135 Identities=12% Similarity=0.068 Sum_probs=64.4 Q ss_pred ceeeeeccccCCchhhHHHHHh--hhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecc Q lcl|NC_021296. 8 IGHRLTDIQVPNPNRGLAQILL--SPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGG 85 (143) Q Consensus 8 i~~~~~d~k~~np~rgl~eiL~--S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~ 85 (143) ..+-|+++..+.-...|-++-. .-.++.-+..=++.+...=++++=++||+|..+..+.+---=..+.+.+-.|.|.. T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~ 80 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRK 80 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecC Confidence 6666766666554444433311 11234445555666677778899999999999987755321001223333344433 Q ss_pred ccccc---Cc--ccC---CCcCCcchhhhhhhhhhcCC-CCCCcccccchhhHHHHHHH-Hh---------hhccCC Q lcl|NC_021296. 86 ETAVS---TW--HSP---RNPNPGDLFFYGVLHEHGDG-GNPPSGWDFPAHKDLKKALA-VV---------KARNGA 143 (143) Q Consensus 86 e~aa~---~~--Hsp---r~g~pgd~f~ygvlh~~g~~-~~~p~~~~f~ah~dl~~a~~-~~---------~~~~~~ 143 (143) ..+-- .+ |+. -..+|-+.||+..+. +|-. .-||...-.||-..-++++. ++ ...+|- T Consensus 81 ~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~-~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g~ 156 (157) T protein:vir:97 81 KAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVK-LVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAELQRGD 156 (157) T ss_pred CccceeeeeecCcccccccccCCcccccccccc-cCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHHhcCC Confidence 21100 00 221 122444555555442 3321 23566666666544433322 22 223344 No 41 >protein:vir:9763 Length: 89 # NCBI annotation: hypothetical protein # Family: family:all:1171 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795525;genbank:gi:28876279;genbank:GeneID:1257820 Probab=62.09 E-value=0.042 Score=28.16 Aligned_cols=85 Identities=18% Similarity=0.237 Sum_probs=48.1 Q ss_pred cccce----eeeeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEE Q lcl|NC_021296. 5 GTTIG----HRLTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSH 80 (143) Q Consensus 5 g~~i~----~~~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~ 80 (143) =.||- -||..+|+-=-..|+.|||.|+.||.++...++.+. .|-|+. -++.+.+| .+|-.+- T Consensus 1 ~~~~~~~~~~~MskvkfklN~aGvr~llks~~iQa~l~~~A~~I~--------~rAg~g---Y~adv~~G---k~Ra~a~ 66 (89) T protein:vir:97 1 MNGIRKLWWKDMSKFKFKLNKAGVAELMKSSEMQQVLTTKATAIR--------ERCGDG---YAQDIHVG---KNRANAM 66 (89) T ss_pred CchhHHHHHHHhhcceeeeCHHHHHHHhcChhHHHHHHHHHHHHH--------Hhhccc---cccccccC---CcccceE Confidence 22332 133333433223599999999999999999988876 344432 33334443 4444443 Q ss_pred EEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH Q lcl|NC_021296. 81 VTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA 135 (143) Q Consensus 81 VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~ 135 (143) |.-. +. -...+=..||-|-|||. T Consensus 67 V~t~------t~--------------------------~A~~~N~KHNTLLKAv~ 89 (89) T protein:vir:97 67 VSAK------TI--------------------------KAKKDNSKNNTLLKAVR 89 (89) T ss_pred eccC------Ch--------------------------HHHHhhhhhhhhhhhcC Confidence 3321 11 12233467888888877 No 42 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=61.34 E-value=0.36 Score=23.05 Aligned_cols=126 Identities=16% Similarity=0.041 Sum_probs=62.1 Q ss_pred eec---cccCCchhhHHHHHh-----hhh-----HHHHHHHHHHHHHHHHhhhhc-----cccccccccceEEEeecCCC Q lcl|NC_021296. 12 LTD---IQVPNPNRGLAQILL-----SPN-----MELLMGIIGQEVVLAYRAGVA-----KRTGKLMSSASSETMIGGKK 73 (143) Q Consensus 12 ~~d---~k~~np~rgl~eiL~-----S~~-----m~~Lva~~~e~v~~~Y~a~VA-----kRTg~LArsvrvetfIGG~K 73 (143) |.| +++- ||.|++. +.+ ++.-+..-++.|...=+.++- ..+++|..+..+..--...+ T Consensus 1 Ma~~~~~~i~----Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~ 76 (164) T protein:vir:43 1 MADTVEFSIT----GLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFK 76 (164) T ss_pred CCcceEEeee----cHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccc Confidence 444 2222 4444432 122 233344444444444444332 13456776666544333333 Q ss_pred CCeeEEEEEecccccccCc-cc-CCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHH----------------- Q lcl|NC_021296. 74 NDRWVSHVTIGGETAVSTW-HS-PRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKAL----------------- 134 (143) Q Consensus 74 ~DRwVg~VtVG~e~aa~~~-Hs-pr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~----------------- 134 (143) ..-++. ..||........ ++ .....++...||.-+.|||.-..||..+-.||-..=++.+ T Consensus 77 ~~~~~~-~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~i~ka~~ 155 (164) T protein:vir:43 77 RTGDLG-FRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVTSTFVSEYEKGIDRAIK 155 (164) T ss_pred ccccee-EEecccccccccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHHHHHHHH Confidence 333332 234333333322 33 2334455667899999999999999999888865433322 Q ss_pred -HHhhhccC Q lcl|NC_021296. 135 -AVVKARNG 142 (143) Q Consensus 135 -~~~~~~~~ 142 (143) ++-|++.| T Consensus 156 k~~~~~~~~ 164 (164) T protein:vir:43 156 RAAKKAAQG 164 (164) T ss_pred HHHhhhccC Confidence 12233333 No 43 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=58.08 E-value=0.42 Score=22.65 Aligned_cols=124 Identities=19% Similarity=0.155 Sum_probs=65.4 Q ss_pred eeccccCCchhhHHHHH-----hhhh-----HHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEE Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQIL-----LSPN-----MELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHV 81 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL-----~S~~-----m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~V 81 (143) |-|+++.=- ||-+++ ++.+ ++.-+..-++.+....+..+-..||.|..+..++..---...+ ....| T Consensus 1 mm~~~~~i~--Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~-~~~~v 77 (149) T protein:vir:19 1 MIETSLDFS--GLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGE-ISSGV 77 (149) T ss_pred Ccceeeehh--hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccc-eeecc Confidence 656655544 655553 1222 2445556677888888888888899999888764331000000 00111 Q ss_pred EecccccccCcc---cCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHH------------HHHHHHhhhc Q lcl|NC_021296. 82 TIGGETAVSTWH---SPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDL------------KKALAVVKAR 140 (143) Q Consensus 82 tVG~e~aa~~~H---spr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl------------~~a~~~~~~~ 140 (143) .|. ....... .-.........||.-+.|||.-.-||..|=+||-..= +++|.-+-.+ T Consensus 78 ~~~--~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l~k~~~k 149 (149) T protein:vir:19 78 HIR--GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) T ss_pred ccc--ccccccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 110 0001110 0011113345688889999999999999988886533 2222222222 No 44 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=57.02 E-value=0.44 Score=22.52 Aligned_cols=114 Identities=21% Similarity=0.157 Sum_probs=62.4 Q ss_pred eeccccCCchhhHHHHHh-----hhh-----HHHHHHHHHHHHHHHHhhhhccccccccccceEEEe------------e Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQILL-----SPN-----MELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETM------------I 69 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL~-----S~~-----m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetf------------I 69 (143) |-|+.+.=- ||-+++. +.+ .+.-+..-++.|...=+.++-.+||.|..+..++.. + T Consensus 1 mm~~~~~i~--Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~ 78 (148) T protein:vir:93 1 MIETLLDFS--GLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHI 78 (148) T ss_pred Ccceeeeeh--hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeee Confidence 555555433 4444431 222 233444456666666666777788887777655432 2 Q ss_pred cCCCCCeeEEEEEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHH------------HHHHh Q lcl|NC_021296. 70 GGKKNDRWVSHVTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKK------------ALAVV 137 (143) Q Consensus 70 GG~K~DRwVg~VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~------------a~~~~ 137 (143) .+.+.+..-....|+ .......||+.+.|||.-..||..+=+||-..-++ +|.-+ T Consensus 79 ~~~~~~~~~~~~~~~-------------~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i~k~ 145 (148) T protein:vir:93 79 RGVNPDTGNSDNTMK-------------ADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEV 145 (148) T ss_pred cccccccccccceee-------------cCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHHHH Confidence 222222222222221 11334567899999999999999999888755433 33322 Q ss_pred hhc Q lcl|NC_021296. 138 KAR 140 (143) Q Consensus 138 ~~~ 140 (143) -++ T Consensus 146 ~~k 148 (148) T protein:vir:93 146 LRR 148 (148) T ss_pred hcC Confidence 233 No 45 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=56.37 E-value=0.46 Score=22.44 Aligned_cols=119 Identities=20% Similarity=0.211 Sum_probs=67.4 Q ss_pred eeccccCCchhhHHHHH---------hhhh-HHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEE Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQIL---------LSPN-MELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHV 81 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL---------~S~~-m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~V 81 (143) |-+++.- ||.+++ .+.+ ++.-+..-++.+...-+..+-..||.|..|..+...- ..+. +..+ T Consensus 1 M~~~~i~----Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~-~~~~---~~~~ 72 (140) T protein:vir:14 1 MSSIQII----GLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALR-QKDA---PGLA 72 (140) T ss_pred Cceeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhccccccc-cccc---ceeE Confidence 6666665 444433 1222 3556778888888888888888899999997764421 1111 1222 Q ss_pred EecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH-Hh---------hhccCC Q lcl|NC_021296. 82 TIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA-VV---------KARNGA 143 (143) Q Consensus 82 tVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~-~~---------~~~~~~ 143 (143) .||...-.. ..-.++ ..+||.-+.|||.-.-||..+=+||-..-++.+. .+ |+-+|. T Consensus 73 ~vg~~~~~~----~~~~~~-~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~~ 139 (140) T protein:vir:14 73 TAGVRVRTK----GKADSP-NNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGR 139 (140) T ss_pred Eeeeeeccc----cccCCC-CccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 222111100 111223 3466677789999999999998888755433221 11 233344 No 46 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=56.28 E-value=0.46 Score=22.43 Aligned_cols=123 Identities=16% Similarity=0.062 Sum_probs=66.3 Q ss_pred eeccccCCc--hhhHHH--HHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccc Q lcl|NC_021296. 12 LTDIQVPNP--NRGLAQ--ILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGET 87 (143) Q Consensus 12 ~~d~k~~np--~rgl~e--iL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~ 87 (143) |--+.-+.- .+.|.+ =.+...++.-+.+.+++++..=+..+-.+||.|.+|..+++-.+|. .+.|..+.+| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~~g~-----~~~V~~~~~Y 75 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTDGGF-----SSVISVGAEY 75 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeecCce-----EEEEecCCCc Confidence 322110000 011211 1223455666777888888888888999999999999988866653 4566666677 Q ss_pred cccCc--ccCCCcCCcc-------hhhhhhh-hhhcCCCCCCcccccchhhHHHHHHHHhhh Q lcl|NC_021296. 88 AVSTW--HSPRNPNPGD-------LFFYGVL-HEHGDGGNPPSGWDFPAHKDLKKALAVVKA 139 (143) Q Consensus 88 aa~~~--Hspr~g~pgd-------~f~ygvl-h~~g~~~~~p~~~~f~ah~dl~~a~~~~~~ 139 (143) |.-.+ |.+-...|.. .||+.+. ..+-.-|.||..+=+||-.+-|+-+.-.-+ T Consensus 76 A~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 76 AIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred ccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 76555 2222222211 2222211 111224567777777776665554443333 No 47 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=54.66 E-value=0.5 Score=22.24 Aligned_cols=125 Identities=13% Similarity=-0.003 Sum_probs=68.0 Q ss_pred CCCCcccceeeeeccccCCch-----hhHHHH---------HhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEE Q lcl|NC_021296. 1 MPAAGTTIGHRLTDIQVPNPN-----RGLAQI---------LLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSE 66 (143) Q Consensus 1 ~pa~g~~i~~~~~d~k~~np~-----rgl~ei---------L~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrve 66 (143) |- --+||+ ++-| .||-++ .+...++.-+.+.+++++..=++.+-.+||.|.+|..++ T Consensus 1 ~~----~~~~~~-----~~~~Ma~v~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~ 71 (149) T protein:vir:10 1 MK----LNYYDL-----SRCHMAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFK 71 (149) T ss_pred Ce----eeeecc-----chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEE Confidence 10 001221 1111 122221 223455666677788888888889999999999999988 Q ss_pred EeecCCCCCeeEEEEEecccccccCc--ccCCCcC------Ccchhhh-hhhh-hhcCCCCCCcccccchhhHHHHHHHH Q lcl|NC_021296. 67 TMIGGKKNDRWVSHVTIGGETAVSTW--HSPRNPN------PGDLFFY-GVLH-EHGDGGNPPSGWDFPAHKDLKKALAV 136 (143) Q Consensus 67 tfIGG~K~DRwVg~VtVG~e~aa~~~--Hspr~g~------pgd~f~y-gvlh-~~g~~~~~p~~~~f~ah~dl~~a~~~ 136 (143) +.- |...++|....+||.-.+ |.+-.+. ...+++| +..+ .+..-|.||..+=+||-++-++-+.- T Consensus 72 ~~~-----~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~ 146 (149) T protein:vir:10 72 YFD-----GGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQ 146 (149) T ss_pred ecC-----CcEEEEEecCCCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHH Confidence 643 335577777777776665 2211111 1223333 2221 23345677777777776665554332 Q ss_pred hhh Q lcl|NC_021296. 137 VKA 139 (143) Q Consensus 137 ~~~ 139 (143) .-+ T Consensus 147 ~i~ 149 (149) T protein:vir:10 147 YFS 149 (149) T ss_pred hhC Confidence 222 No 48 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=51.99 E-value=0.57 Score=21.94 Aligned_cols=92 Identities=17% Similarity=0.201 Sum_probs=56.9 Q ss_pred cccCCchhhHHHHHh---------hhhHHHHHHHHHHHHHHHHhhhh------ccccccccccceEEEeecCCCCCeeEE Q lcl|NC_021296. 15 IQVPNPNRGLAQILL---------SPNMELLMGIIGQEVVLAYRAGV------AKRTGKLMSSASSETMIGGKKNDRWVS 79 (143) Q Consensus 15 ~k~~np~rgl~eiL~---------S~~m~~Lva~~~e~v~~~Y~a~V------AkRTg~LArsvrvetfIGG~K~DRwVg 79 (143) |++. ||-+++. ...+...|...+++++..-+... -.+||.|-+|..++. .|| . T Consensus 1 i~i~----Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~-~g~-------~ 68 (115) T protein:vir:99 1 MNID----GLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-TVD-------L 68 (115) T ss_pred Ccch----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeee-cCc-------E Confidence 4544 3333221 12356667777777776665543 568999999988763 222 1 Q ss_pred EEEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhH--------HHHHHH Q lcl|NC_021296. 80 HVTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKD--------LKKALA 135 (143) Q Consensus 80 ~VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~d--------l~~a~~ 135 (143) +++|+.. =+|+..-|||--.-||...=+||-.. |++++. T Consensus 69 ~~~V~~~-----------------~~Ya~~vE~GT~~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 69 QYTITSH-----------------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred EEEecCC-----------------ccccccccccccccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 2344321 14888999999999999888888754 443333 No 49 >protein:vir:80665 Length: 96 # NCBI annotation: gp9 # Family: family:all:30540 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285585;genbank:gi:148727091;genbank:GeneID:5247036 Probab=48.76 E-value=0.12 Score=25.56 Aligned_cols=74 Identities=27% Similarity=0.417 Sum_probs=43.1 Q ss_pred eeec--cccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhcc--------ccccccccceE----------EEeec Q lcl|NC_021296. 11 RLTD--IQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAK--------RTGKLMSSASS----------ETMIG 70 (143) Q Consensus 11 ~~~d--~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAk--------RTg~LArsvrv----------etfIG 70 (143) .-.| ||..-| |+-|+|+|.-+++++++++|+|.++-.+-|-. |+| |++.+++ .|+-| T Consensus 1 maqdvnvklnlp--girevlkssgvqsmlaergervrraasanvggnafdraqyrsg-lssevqvhrveavarigttykg 77 (96) T protein:vir:80 1 MAQDVNVKLNLP--GIREVLKSSGVQSMLAERGERVRRAASANVGGNAFDRAQYRSG-LSSEVQVHRVEAVARIGTTYKG 77 (96) T ss_pred CCccceeeecch--hHHHHHhhcchhHHHHhhhhHhhhhhccccCcchhhhhhhhcc-ccchhhhhhhhhhhhhcccccc Confidence 2234 455567 99999999999999999999998876665543 333 3333332 12345 Q ss_pred CCCCCeeEEEE--Eecccc Q lcl|NC_021296. 71 GKKNDRWVSHV--TIGGET 87 (143) Q Consensus 71 G~K~DRwVg~V--tVG~e~ 87 (143) |++-.---|.+ ++|..+ T Consensus 78 gkrieakhgtlarsigaas 96 (96) T protein:vir:80 78 GKRIEAKHGTLARSIGAAS 96 (96) T ss_pred cceeccccchhhhhccccC Confidence 55433322322 111111 No 50 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=45.77 E-value=0.52 Score=22.16 Aligned_cols=110 Identities=15% Similarity=0.218 Sum_probs=59.1 Q ss_pred eeccccCCchhhHHHHHh-----hhhHHHHHHH----HHHHHHHHHhhhhc---cccccccccceEEEeecCCCCCe-eE Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQILL-----SPNMELLMGI----IGQEVVLAYRAGVA---KRTGKLMSSASSETMIGGKKNDR-WV 78 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL~-----S~~m~~Lva~----~~e~v~~~Y~a~VA---kRTg~LArsvrvetfIGG~K~DR-wV 78 (143) |.++++. ||-|++. +.+++..+.. -++.+...=+..+- ++||.|..+..+ ...|.|+ =+ T Consensus 1 M~~~~i~----Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~----~~~k~~~~g~ 72 (127) T protein:vir:12 1 MADMSFD----GIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITV----SNVRESKDGV 72 (127) T ss_pred Ceeeeeh----hHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhc----cccccccCce Confidence 8888876 4444432 3334433333 33333333333322 246788877754 2333332 24 Q ss_pred EEEEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHHH-hhhc--cCC Q lcl|NC_021296. 79 SHVTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALAV-VKAR--NGA 143 (143) Q Consensus 79 g~VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~~-~~~~--~~~ 143 (143) ..|+|| | -.+..||.-+-|||.--.||..+=+||-+.-++.+.- ++.. ... T Consensus 73 ~~v~Vg-------~-------~~~~~~y~~f~E~GT~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~l 126 (127) T protein:vir:12 73 RFVAVG-------P-------NKKVAYRGRFLEWGTSKMPPQPFIEKGGKEGEGPAVELMERILTAPI 126 (127) T ss_pred eEEEEe-------e-------CCCCcceeeeeccCccCCCCCccchHhHHHHHHHHHHHHHHHHHHhc Confidence 456665 2 1234688889999999999999999887655543321 1110 000 No 51 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=41.87 E-value=0.91 Score=20.81 Aligned_cols=125 Identities=14% Similarity=0.024 Sum_probs=68.7 Q ss_pred CCCCcccceeeeeccccCCch-----hhHHHH---------HhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEE Q lcl|NC_021296. 1 MPAAGTTIGHRLTDIQVPNPN-----RGLAQI---------LLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSE 66 (143) Q Consensus 1 ~pa~g~~i~~~~~d~k~~np~-----rgl~ei---------L~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrve 66 (143) |- --+||+ ++-| .||-++ .+...++.-+.+.+++++..=+..+..+||.|.+|..++ T Consensus 1 ~~----~~~~~~-----~~~~Ma~~~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~ 71 (149) T protein:vir:94 1 MK----LSYYDL-----SRCHMAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFK 71 (149) T ss_pred Ce----eeeeec-----chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEE Confidence 10 001221 1221 122221 123456666778888888888999999999999999988 Q ss_pred EeecCCCCCeeEEEEEecccccccCc--ccCCCcCC------cchhhh-hhh-hhhcCCCCCCcccccchhhHHHHHHHH Q lcl|NC_021296. 67 TMIGGKKNDRWVSHVTIGGETAVSTW--HSPRNPNP------GDLFFY-GVL-HEHGDGGNPPSGWDFPAHKDLKKALAV 136 (143) Q Consensus 67 tfIGG~K~DRwVg~VtVG~e~aa~~~--Hspr~g~p------gd~f~y-gvl-h~~g~~~~~p~~~~f~ah~dl~~a~~~ 136 (143) +.- |...+.|.-+.+||.-.+ |.+-.+.| -.++|| +.. -.+..-|.||..+=+||-++-++-+.- T Consensus 72 ~~~-----~g~~~~V~~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~ 146 (149) T protein:vir:94 72 YFD-----GGLSSVISVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQ 146 (149) T ss_pred eeC-----CcEEEEEecCCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHH Confidence 743 335577777777777666 22211111 122333 222 123345677777777776665554332 Q ss_pred hhhcc Q lcl|NC_021296. 137 VKARN 141 (143) Q Consensus 137 ~~~~~ 141 (143) .- | T Consensus 147 ~i--~ 149 (149) T protein:vir:94 147 YF--S 149 (149) T ss_pred hh--C Confidence 22 2 No 52 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=41.81 E-value=0.78 Score=21.18 Aligned_cols=113 Identities=12% Similarity=0.138 Sum_probs=57.8 Q ss_pred cccCCchhhHHHHHh-----hhhHHHHHHHHHHHHHHHHhhhhcccc----cccccc--ceEEEeecCCCCCeeEEEEEe Q lcl|NC_021296. 15 IQVPNPNRGLAQILL-----SPNMELLMGIIGQEVVLAYRAGVAKRT----GKLMSS--ASSETMIGGKKNDRWVSHVTI 83 (143) Q Consensus 15 ~k~~np~rgl~eiL~-----S~~m~~Lva~~~e~v~~~Y~a~VAkRT----g~LArs--vrvetfIGG~K~DRwVg~VtV 83 (143) +.+.- -||.|++. ..++..........++..++..+.+++ |++..+ .+-.+-|++.+.++-++.++| T Consensus 1 m~v~i--~Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~V 78 (128) T protein:vir:38 1 MGVKV--TGDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDV 78 (128) T ss_pred Cccch--hhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeEEEe Confidence 12221 26666542 223333333333333333433333332 332221 223344577777777888888 Q ss_pred cccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH-Hhhh--ccCC Q lcl|NC_021296. 84 GGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA-VVKA--RNGA 143 (143) Q Consensus 84 G~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~-~~~~--~~~~ 143 (143) |- ++ +..||.-+-|||.--.||..+-.||-+.-++.+. +++. +-+- T Consensus 79 G~-------------~k-~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i 127 (128) T protein:vir:38 79 GY-------------GK-DTGWRAHFPNSGTSMQDPQHFIEETQEIMRPVVIAAFLSHLKEGG 127 (128) T ss_pred ee-------------cC-CCceEEeeeccCccCCCCCcchhHHHHHhHHHHHHHHHHHHHhhc Confidence 73 12 2347899999999999999888887765543322 1111 1111 No 53 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=41.79 E-value=0.87 Score=20.91 Aligned_cols=117 Identities=17% Similarity=0.226 Sum_probs=61.3 Q ss_pred eec---cccCCchhhHHHHH-----hhhh----HHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCC---- Q lcl|NC_021296. 12 LTD---IQVPNPNRGLAQIL-----LSPN----MELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKND---- 75 (143) Q Consensus 12 ~~d---~k~~np~rgl~eiL-----~S~~----m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~D---- 75 (143) |-| +++- ||-+++ ++.+ ++..+..-++.++..=+..+=.++|.+.++.......||+-.| T Consensus 1 Ma~~~~~~i~----Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLL----GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccccccccccccee Confidence 433 2333 444433 2223 4445555566666666666666677776654443333333222 Q ss_pred ------eeEEEEEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH-Hh----hhccCC Q lcl|NC_021296. 76 ------RWVSHVTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA-VV----KARNGA 143 (143) Q Consensus 76 ------RwVg~VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~-~~----~~~~~~ 143 (143) .=.-.+.||. ... .++..||.-+-|||.-..||..+-+||-+.-++.+. ++ +..-.- T Consensus 77 ~~~~~~~g~~~~~vg~----------~~~-~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~k 144 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGL----------NKA-DRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRL 144 (146) T ss_pred ccccccccceeEEeee----------ccC-CCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhh Confidence 1111222321 111 234567888889999999999999998877664432 11 111111 No 54 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=41.79 E-value=0.87 Score=20.91 Aligned_cols=117 Identities=17% Similarity=0.226 Sum_probs=61.3 Q ss_pred eec---cccCCchhhHHHHH-----hhhh----HHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCC---- Q lcl|NC_021296. 12 LTD---IQVPNPNRGLAQIL-----LSPN----MELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKND---- 75 (143) Q Consensus 12 ~~d---~k~~np~rgl~eiL-----~S~~----m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~D---- 75 (143) |-| +++- ||-+++ ++.+ ++..+..-++.++..=+..+=.++|.+.++.......||+-.| T Consensus 1 Ma~~~~~~i~----Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLL----GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccccccccccccee Confidence 433 2333 444433 2223 4445555566666666666666677776654443333333222 Q ss_pred ------eeEEEEEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH-Hh----hhccCC Q lcl|NC_021296. 76 ------RWVSHVTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA-VV----KARNGA 143 (143) Q Consensus 76 ------RwVg~VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~-~~----~~~~~~ 143 (143) .=.-.+.||. ... .++..||.-+-|||.-..||..+-+||-+.-++.+. ++ +..-.- T Consensus 77 ~~~~~~~g~~~~~vg~----------~~~-~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~k 144 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGL----------NKA-DRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRL 144 (146) T ss_pred ccccccccceeEEeee----------ccC-CCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhh Confidence 1111222321 111 234567888889999999999999998877664432 11 111111 No 55 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=41.79 E-value=0.87 Score=20.91 Aligned_cols=117 Identities=17% Similarity=0.226 Sum_probs=61.3 Q ss_pred eec---cccCCchhhHHHHH-----hhhh----HHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCC---- Q lcl|NC_021296. 12 LTD---IQVPNPNRGLAQIL-----LSPN----MELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKND---- 75 (143) Q Consensus 12 ~~d---~k~~np~rgl~eiL-----~S~~----m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~D---- 75 (143) |-| +++- ||-+++ ++.+ ++..+..-++.++..=+..+=.++|.+.++.......||+-.| T Consensus 1 Ma~~~~~~i~----Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLL----GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccccccccccccee Confidence 433 2333 444433 2223 4445555566666666666666677776654443333333222 Q ss_pred ------eeEEEEEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH-Hh----hhccCC Q lcl|NC_021296. 76 ------RWVSHVTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA-VV----KARNGA 143 (143) Q Consensus 76 ------RwVg~VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~-~~----~~~~~~ 143 (143) .=.-.+.||. ... .++..||.-+-|||.-..||..+-+||-+.-++.+. ++ +..-.- T Consensus 77 ~~~~~~~g~~~~~vg~----------~~~-~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~k 144 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGL----------NKA-DRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRL 144 (146) T ss_pred ccccccccceeEEeee----------ccC-CCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhh Confidence 1111222321 111 234567888889999999999999998877664432 11 111111 No 56 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=41.79 E-value=0.87 Score=20.91 Aligned_cols=117 Identities=17% Similarity=0.226 Sum_probs=61.3 Q ss_pred eec---cccCCchhhHHHHH-----hhhh----HHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCC---- Q lcl|NC_021296. 12 LTD---IQVPNPNRGLAQIL-----LSPN----MELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKND---- 75 (143) Q Consensus 12 ~~d---~k~~np~rgl~eiL-----~S~~----m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~D---- 75 (143) |-| +++- ||-+++ ++.+ ++..+..-++.++..=+..+=.++|.+.++.......||+-.| T Consensus 1 Ma~~~~~~i~----Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLL----GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKV 76 (146) T ss_pred CCCceeeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccccccccccccccccccccee Confidence 433 2333 444433 2223 4445555566666666666666677776654443333333222 Q ss_pred ------eeEEEEEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHHHH-Hh----hhccCC Q lcl|NC_021296. 76 ------RWVSHVTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA-VV----KARNGA 143 (143) Q Consensus 76 ------RwVg~VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~-~~----~~~~~~ 143 (143) .=.-.+.||. ... .++..||.-+-|||.-..||..+-+||-+.-++.+. ++ +..-.- T Consensus 77 ~~~~~~~g~~~~~vg~----------~~~-~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~k 144 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGL----------NKA-DRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRL 144 (146) T ss_pred ccccccccceeEEeee----------ccC-CCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhh Confidence 1111222321 111 234567888889999999999999998877664432 11 111111 No 57 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=38.29 E-value=1.1 Score=20.41 Aligned_cols=132 Identities=11% Similarity=0.020 Sum_probs=72.6 Q ss_pred ceeeeeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccc Q lcl|NC_021296. 8 IGHRLTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGET 87 (143) Q Consensus 8 i~~~~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~ 87 (143) ..-|---++..- ++...+=+.-+.++..+.+.+.+++..=++.+--+||.|-+|.+.+...+|.. ..++.|....+| T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~--~~~~~v~~~a~Y 77 (140) T protein:vir:97 1 MATIRARARIEI-DEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPF--RVRGGVEATADY 77 (140) T ss_pred Ceeeeeeeeeee-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCc--eEEEEecCCccc Confidence 222221222221 12344445668999999999999988888888889999999999887765432 356666666677 Q ss_pred cccCcccCCCcCCcchhhhhhhhhhcCCCCCCccc----ccchhhHHHHHHHHhhhccCC Q lcl|NC_021296. 88 AVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGW----DFPAHKDLKKALAVVKARNGA 143 (143) Q Consensus 88 aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~----~f~ah~dl~~a~~~~~~~~~~ 143 (143) |.-.++- -.++.-.|.....|+-.-+|..-+.++ --+++.=|+.|+..+.++.-- T Consensus 78 A~~Ve~G-T~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~ 136 (140) T protein:vir:97 78 AAPVHEG-SRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPR 136 (140) T ss_pred hhhhccC-CCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhh Confidence 7665522 222222222233332222222111110 113566688888765332222 No 58 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=38.29 E-value=1.1 Score=20.41 Aligned_cols=132 Identities=11% Similarity=0.020 Sum_probs=72.6 Q ss_pred ceeeeeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccc Q lcl|NC_021296. 8 IGHRLTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGET 87 (143) Q Consensus 8 i~~~~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~ 87 (143) ..-|---++..- ++...+=+.-+.++..+.+.+.+++..=++.+--+||.|-+|.+.+...+|.. ..++.|....+| T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~--~~~~~v~~~a~Y 77 (140) T protein:vir:10 1 MATIRARARIEI-DEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPF--RVRGGVEATADY 77 (140) T ss_pred Ceeeeeeeeeee-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCc--eEEEEecCCccc Confidence 222221222221 12344445668999999999999988888888889999999999887765432 356666666677 Q ss_pred cccCcccCCCcCCcchhhhhhhhhhcCCCCCCccc----ccchhhHHHHHHHHhhhccCC Q lcl|NC_021296. 88 AVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGW----DFPAHKDLKKALAVVKARNGA 143 (143) Q Consensus 88 aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~----~f~ah~dl~~a~~~~~~~~~~ 143 (143) |.-.++- -.++.-.|.....|+-.-+|..-+.++ --+++.=|+.|+..+.++.-- T Consensus 78 A~~Ve~G-T~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~ 136 (140) T protein:vir:10 78 AAPVHEG-SRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPR 136 (140) T ss_pred hhhhccC-CCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhh Confidence 7665522 222222222233332222222111110 113566688888765332222 No 59 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=38.25 E-value=1.1 Score=20.41 Aligned_cols=116 Identities=12% Similarity=0.027 Sum_probs=61.2 Q ss_pred eeccccCCch---hhHHHHHhhh---hHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeec-CCCCCeeEEEEEec Q lcl|NC_021296. 12 LTDIQVPNPN---RGLAQILLSP---NMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIG-GKKNDRWVSHVTIG 84 (143) Q Consensus 12 ~~d~k~~np~---rgl~eiL~S~---~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIG-G~K~DRwVg~VtVG 84 (143) |-.+++..-. +.|.++-... .++.-+..-++.+...=+..+-..+|.+-+..+..+-+. ..+.++--|.|.|. T Consensus 1 M~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~ 80 (133) T protein:vir:10 1 MIRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLR 80 (133) T ss_pred CeeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEEE Confidence 7677766332 2233221111 113345555565666666666555566544444444332 23333334444432 Q ss_pred ccccccCcccCCCcCCc-chhhhhhhhhhcCCCCCCcccccchhhHHHHHHH-H--------hhhc Q lcl|NC_021296. 85 GETAVSTWHSPRNPNPG-DLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKALA-V--------VKAR 140 (143) Q Consensus 85 ~e~aa~~~Hspr~g~pg-d~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~~-~--------~~~~ 140 (143) .- |+ +.+||..+-|||.--.||..+-.||-+.-++.+. + |+.+ T Consensus 81 vg-------------~~~~~~~y~~f~E~GT~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 81 VG-------------PSKQHHMKVLAQEFGTVKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred ec-------------CCCCccceEeeeccCCCCCCCCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 11 22 2357888899999999999999898775554332 2 2222 No 60 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=37.26 E-value=1.1 Score=20.30 Aligned_cols=94 Identities=18% Similarity=0.162 Sum_probs=59.7 Q ss_pred hhhHHHHH---------hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecccccccC Q lcl|NC_021296. 21 NRGLAQIL---------LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGGETAVST 91 (143) Q Consensus 21 ~rgl~eiL---------~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~e~aa~~ 91 (143) =.||-++. ++..++.-+...++.++..=++.+=.+||.|.+|..++. .|+ ..++|+. T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~-~~~-------~~~~v~~------ 66 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQ-QRL-------LHYRVVS------ 66 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeee-cCc-------EEEEeec------ Confidence 22333322 234566777888888888778888889999999988764 121 2344422 Q ss_pred cccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHHHH----HH-Hhhh Q lcl|NC_021296. 92 WHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLKKA----LA-VVKA 139 (143) Q Consensus 92 ~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a----~~-~~~~ 139 (143) + -.|+.+-|||-..-|+..+=+||-..-++. +. ++|. T Consensus 67 --------~---~~Ya~~vE~GT~~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 67 --------P---ALYSIYLELGTRKMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred --------C---cccchhcccCccccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 1 148899999998888888888876544322 11 2222 No 61 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=33.71 E-value=1.3 Score=19.89 Aligned_cols=129 Identities=19% Similarity=0.098 Sum_probs=63.6 Q ss_pred CCCCcccceeeeeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEE Q lcl|NC_021296. 1 MPAAGTTIGHRLTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSH 80 (143) Q Consensus 1 ~pa~g~~i~~~~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~ 80 (143) || ....+ + -|+ ++| +=.+.+.++..+...++.++..=++.+-.+||.|.+|.+.+.-.+|.. ...+. T Consensus 1 m~-----~s~~i---~-i~~-~~l-~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~--~~~~~ 67 (137) T protein:vir:10 1 MP-----VTARI---H-INE-PEL-ERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPF--HVGGG 67 (137) T ss_pred CC-----eeEEE---e-eCH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccc--eEEEE Confidence 43 23333 1 133 122 223467788888899988888888888889999999999887655432 34455 Q ss_pred EEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcc--c--ccchhhHHHHHHHHhhhccCC Q lcl|NC_021296. 81 VTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSG--W--DFPAHKDLKKALAVVKARNGA 143 (143) Q Consensus 81 VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~--~--~f~ah~dl~~a~~~~~~~~~~ 143 (143) |.-..+||.-.++ +-.+|.--|..-..|.-...|.--..+ | -.+++.=|+.|+.-+.++--- T Consensus 68 v~~~~~YA~~ve~-GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~r 133 (137) T protein:vir:10 68 VEDNVDYAAPVHE-GSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAADPD 133 (137) T ss_pred EecCCCceeeeee-cCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhcccc Confidence 5444455443331 111111111111111100011100000 0 112566677777765433322 No 62 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=33.61 E-value=1.3 Score=19.88 Aligned_cols=115 Identities=10% Similarity=0.067 Sum_probs=59.5 Q ss_pred eeeccccCCchhhHHHHHh-----hhhH-----HHHHHHHHHHHHHHHhhhhccccc-cccccceEEEeecCCCCCeeEE Q lcl|NC_021296. 11 RLTDIQVPNPNRGLAQILL-----SPNM-----ELLMGIIGQEVVLAYRAGVAKRTG-KLMSSASSETMIGGKKNDRWVS 79 (143) Q Consensus 11 ~~~d~k~~np~rgl~eiL~-----S~~m-----~~Lva~~~e~v~~~Y~a~VAkRTg-~LArsvrvetfIGG~K~DRwVg 79 (143) -+.++++- ||.|++. +.++ +.-+..-++.+...-+..+ ++.+ +..+..+-.+-|...|.++-.+ T Consensus 1 M~~~~~i~----Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~a-p~~~~~~~g~l~~~I~i~~~k~~~~~~ 75 (135) T protein:vir:57 1 MIPEIEIS----GLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNA-GYDNSSTNAHMRDSIKIRSSRGKAGST 75 (135) T ss_pred Cceeeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCCCCCchhhHHhhcccccccccccce Confidence 33334443 4554432 1222 3445555666666655554 3322 2333444445566667777666 Q ss_pred EEEecccccccCcccCCCcCCcch-hhhhhhhhhcCCCCCCcccccchhhHHHHHH-HHhhhcc-----CC Q lcl|NC_021296. 80 HVTIGGETAVSTWHSPRNPNPGDL-FFYGVLHEHGDGGNPPSGWDFPAHKDLKKAL-AVVKARN-----GA 143 (143) Q Consensus 80 ~VtVG~e~aa~~~Hspr~g~pgd~-f~ygvlh~~g~~~~~p~~~~f~ah~dl~~a~-~~~~~~~-----~~ 143 (143) .|+|+.- |... +||+++=|||.-..||..+-.||-+.-++.+ ..+...- -+ T Consensus 76 ~v~v~vg-------------~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~l~ka 133 (135) T protein:vir:57 76 VVVLRVG-------------PTRSHYMKALAQEFGTIKQVAKPFIRPALDYNKMQVLRILTVEIRDGLSTL 133 (135) T ss_pred eEEEEec-------------CCCCcceeEeecccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHHHHHHHh Confidence 6666331 2223 3567777999999999999888866543322 1111100 00 No 63 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=33.01 E-value=0.23 Score=24.07 Aligned_cols=79 Identities=24% Similarity=0.332 Sum_probs=35.8 Q ss_pred eeccccCCchhhHHHHHhhhhHHHHHHHHHHHHHHHHhhhhcc-ccc---------cccccceEEEeecCCCCCeeEEEE Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQILLSPNMELLMGIIGQEVVLAYRAGVAK-RTG---------KLMSSASSETMIGGKKNDRWVSHV 81 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL~S~~m~~Lva~~~e~v~~~Y~a~VAk-RTg---------~LArsvrvetfIGG~K~DRwVg~V 81 (143) |.|.-.||| .=|-|||.|+.++-|+.--+|+.. .|..--|+ -|| ..-|.-|-..|+=|..--- + T Consensus 1 madaftpNp-~~FDqIl~s~~VrALt~gaAe~aL-a~AKAsAPVDTGAYRDGL~iE~~q~~~RtT~MVVG~D~KT----l 74 (92) T protein:vir:78 1 MADAFTPNP-TWFDQIMRTPKVRALVDGVAEETL-ADAKASAPVDTGAYRDGLHIEHRQGRSRETAMVVGSDEKT----L 74 (92) T ss_pred CCCccCCCh-hHHHHhhcccchhhhhhhhhhhhh-hhhcccCcccccccccccchhhhhccccceeEEeecCcce----e Confidence 888889999 568899999999999988777653 33221111 111 1111112122222211111 1 Q ss_pred Eeccc---ccccCcccCCC Q lcl|NC_021296. 82 TIGGE---TAVSTWHSPRN 97 (143) Q Consensus 82 tVG~e---~aa~~~Hspr~ 97 (143) -|++. .+-++- ..|. T Consensus 75 LvESrTGNLakalk-~~rs 92 (92) T protein:vir:78 75 LIESRTGNLARSVK-RRRS 92 (92) T ss_pred eeecccchHHHHHh-hhcC Confidence 11111 011110 0111 No 64 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=31.92 E-value=1.5 Score=19.68 Aligned_cols=112 Identities=19% Similarity=0.175 Sum_probs=59.2 Q ss_pred eeccccCCchhhHHHHH------hhhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecCCCCCeeEEEEEecc Q lcl|NC_021296. 12 LTDIQVPNPNRGLAQIL------LSPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGGKKNDRWVSHVTIGG 85 (143) Q Consensus 12 ~~d~k~~np~rgl~eiL------~S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG~K~DRwVg~VtVG~ 85 (143) |..|.+...-..|.+-| ....|+..+.+.+++++..-++..-+|||+|+.|-++..-.- +--...|++.. T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~----~g~~~~vv~~~ 76 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDG----YGTTKRIIWNK 76 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhcccccccccc----CCcceEEEecc Confidence 98999988765554433 467889999999999999999999999999999987664311 00011222222 Q ss_pred cccccCcccCCCcCCcchhhhhhhhhhcC----CC-CCCcccccchhhHHHHHHH-Hhh--hccCC Q lcl|NC_021296. 86 ETAVSTWHSPRNPNPGDLFFYGVLHEHGD----GG-NPPSGWDFPAHKDLKKALA-VVK--ARNGA 143 (143) Q Consensus 86 e~aa~~~Hspr~g~pgd~f~ygvlh~~g~----~~-~~p~~~~f~ah~dl~~a~~-~~~--~~~~~ 143 (143) ....- .| |=|||- || -|+-..--||-.-+++.+. .++ .+||- T Consensus 77 ~~~~l-------~H---------LLEfGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 77 KHYRR-------VH---------LLEFGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVIENGG 126 (126) T ss_pred CCCCc-------ee---------eeecceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHhhcCC Confidence 11111 00 122221 11 2222222233222222211 111 13444 No 65 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=31.22 E-value=1.5 Score=19.59 Aligned_cols=108 Identities=15% Similarity=0.123 Sum_probs=60.4 Q ss_pred CCCCcccceeeeeccccCCchhhHHHHHh---------hhhHHHHHHHHHHHHHHHHhhhhccccccccccceEEEeecC Q lcl|NC_021296. 1 MPAAGTTIGHRLTDIQVPNPNRGLAQILL---------SPNMELLMGIIGQEVVLAYRAGVAKRTGKLMSSASSETMIGG 71 (143) Q Consensus 1 ~pa~g~~i~~~~~d~k~~np~rgl~eiL~---------S~~m~~Lva~~~e~v~~~Y~a~VAkRTg~LArsvrvetfIGG 71 (143) |... + +|++- ||-+++. ...+..-+..-++.++..=+...-.+||.|.+|.+++-. - T Consensus 1 Ma~~---~-----~i~~~----Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~--~ 66 (125) T protein:vir:94 1 MAND---F-----NIKFK----GVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEV--K 66 (125) T ss_pred CCCc---e-----eeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecce--e Confidence 4332 1 23332 4443321 123333444556666666666677899999999876521 1 Q ss_pred CCCCeeEEEEEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCCcccccchh--------hHHHHHHHHhhhcc Q lcl|NC_021296. 72 KKNDRWVSHVTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAH--------KDLKKALAVVKARN 141 (143) Q Consensus 72 ~K~DRwVg~VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah--------~dl~~a~~~~~~~~ 141 (143) .+++...+ +||.. =.|+.+.|||-...||..+=+||- ++|+++|.-.-.|. T Consensus 67 ~~~~~~~~--~v~~~-----------------~~Ya~~vEfGT~~~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 67 EEHGVVTG--RYVAR-----------------ADYSSYNEYGTYRMSAQPFMAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred ccCCcEEE--EeeCC-----------------CCccceeecccccCCCCcccchhHHHHHHHHHHHHHHHHHHHhccC Confidence 12222222 33221 138899999999999999999984 44555554333333 No 66 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=29.10 E-value=1.7 Score=19.34 Aligned_cols=129 Identities=17% Similarity=0.129 Sum_probs=55.6 Q ss_pred eec-cccCCchhhHHHHHh-----hhhH-----HHHHHHHHHHHHHHHhhhhccc------cccccccceE--------- Q lcl|NC_021296. 12 LTD-IQVPNPNRGLAQILL-----SPNM-----ELLMGIIGQEVVLAYRAGVAKR------TGKLMSSASS--------- 65 (143) Q Consensus 12 ~~d-~k~~np~rgl~eiL~-----S~~m-----~~Lva~~~e~v~~~Y~a~VAkR------Tg~LArsvrv--------- 65 (143) |-| |++.= .||.|++. +.++ +.-+..=++.|+..=+. -|++ ++.|..++-+ T Consensus 1 Ma~~~~~~i--~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~-~ap~~~~~~~~~~l~~~i~~~~~~~~~~~ 77 (179) T protein:vir:18 1 MADSVEVSL--TGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARS-NASRVDDPLTKEAIHKNIVASFSSKQFRR 77 (179) T ss_pred CCceEEEEe--ecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH-hCCccccccchhhhhhheeeccccccccc Confidence 443 22211 15555431 1222 22333333333333333 3332 3333332211 Q ss_pred ----EEeecCCCCCeeEEEEEecccccccCc---ccCCCcCCcchhhhhhhhhhcCCCCCCcccccchhhHHH------- Q lcl|NC_021296. 66 ----ETMIGGKKNDRWVSHVTIGGETAVSTW---HSPRNPNPGDLFFYGVLHEHGDGGNPPSGWDFPAHKDLK------- 131 (143) Q Consensus 66 ----etfIGG~K~DRwVg~VtVG~e~aa~~~---Hspr~g~pgd~f~ygvlh~~g~~~~~p~~~~f~ah~dl~------- 131 (143) ...+|-..+......+.+......... =+++.++++...||.-+.|||.-..||..+-+||-..=+ T Consensus 78 ~g~~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i 157 (179) T protein:vir:18 78 TGDLAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVF 157 (179) T ss_pred ccceeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHH Confidence 122222222222222111111110000 125677787778899999999999999999999875332 Q ss_pred -----HHHHHhhhccCC Q lcl|NC_021296. 132 -----KALAVVKARNGA 143 (143) Q Consensus 132 -----~a~~~~~~~~~~ 143 (143) ++|.-+-.+++. T Consensus 158 ~~~l~~~i~k~lk~~~~ 174 (179) T protein:vir:18 158 STEMGKAIDRAIRLAMK 174 (179) T ss_pred HHHHHHHHHHHHHhhcc Confidence 333222222222 No 67 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=28.22 E-value=1.8 Score=19.23 Aligned_cols=109 Identities=17% Similarity=0.266 Sum_probs=46.8 Q ss_pred ceeeeeccccCCch--hhHHHHHhh-hhHHHHHHHHHHHHHHHHhhhhccc----------------------------- Q lcl|NC_021296. 8 IGHRLTDIQVPNPN--RGLAQILLS-PNMELLMGIIGQEVVLAYRAGVAKR----------------------------- 55 (143) Q Consensus 8 i~~~~~d~k~~np~--rgl~eiL~S-~~m~~Lva~~~e~v~~~Y~a~VAkR----------------------------- 55 (143) .+. |-+|++.+-. +.|.++... .+++.|++.+++.+...-+.+.... T Consensus 1 Ms~-~i~i~~d~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~ 79 (175) T protein:vir:79 1 MSD-FVNFQIDDSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGEL 79 (175) T ss_pred Cce-EEEEEechHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccc Confidence 222 2345655542 455544322 3678888888888876555444333 Q ss_pred ----------------cccccccceEEEeecCCCCCeeEEEEEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCC Q lcl|NC_021296. 56 ----------------TGKLMSSASSETMIGGKKNDRWVSHVTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPP 119 (143) Q Consensus 56 ----------------Tg~LArsvrvetfIGG~K~DRwVg~VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p 119 (143) ||.|++|...++ + .| .+.||.. --|+-+|+||.---.. T Consensus 80 ~~~~~~~~~~~~~L~~tG~L~~Si~~~~--~---~~----~v~vGtn-----------------~~YAaiHqfGg~~~~~ 133 (175) T protein:vir:79 80 TAAASRRKAGLMILQDSGQMAASTATDS--G---ED----YSVIGSN-----------------KEYAAIQHFGGQAGRG 133 (175) T ss_pred hhhHhhhccCCCcceechhhhhhhhhee--c---CC----EEEEecC-----------------cchhhHhhcccccCCC Confidence 334444444332 1 11 3444432 2489999999521111 Q ss_pred cccccch------------hhHHHHHHHH-----hhhccCC Q lcl|NC_021296. 120 SGWDFPA------------HKDLKKALAV-----VKARNGA 143 (143) Q Consensus 120 ~~~~f~a------------h~dl~~a~~~-----~~~~~~~ 143 (143) .+-..|| +.++++.+.. |+.--.. T Consensus 134 ~~v~IPARPfLG~s~~de~~~~~~~~I~~~i~~~l~~a~~~ 174 (175) T protein:vir:79 134 LKVTIPGRAWLPVTADGELQPEAVEPVLNTILRHLMDAANR 174 (175) T ss_pred cccccCcccccCCCcccchhHHHHHHHHHHHHHHHHHHhcc Confidence 1111111 1111122211 1111111 No 68 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=22.91 E-value=2.4 Score=18.52 Aligned_cols=106 Identities=20% Similarity=0.312 Sum_probs=50.0 Q ss_pred ceeeeeccccCCch--hhHHHHHh-hhhHHHHHHHHHHHHHHHHhhhhccc----------------------------- Q lcl|NC_021296. 8 IGHRLTDIQVPNPN--RGLAQILL-SPNMELLMGIIGQEVVLAYRAGVAKR----------------------------- 55 (143) Q Consensus 8 i~~~~~d~k~~np~--rgl~eiL~-S~~m~~Lva~~~e~v~~~Y~a~VAkR----------------------------- 55 (143) .+. |-+|++.+.. +.|.++.. ..+++.|++.+++.+...-+.+.... T Consensus 1 Ms~-~i~i~~~~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~ 79 (175) T protein:vir:10 1 MSD-FVNFQIDDSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGEL 79 (175) T ss_pred Cce-eEEEEecHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhh Confidence 222 3466766653 44444432 23567888888888877666655443 Q ss_pred ----------------cccccccceEEEeecCCCCCeeEEEEEecccccccCcccCCCcCCcchhhhhhhhhhcCCCCCC Q lcl|NC_021296. 56 ----------------TGKLMSSASSETMIGGKKNDRWVSHVTIGGETAVSTWHSPRNPNPGDLFFYGVLHEHGDGGNPP 119 (143) Q Consensus 56 ----------------Tg~LArsvrvetfIGG~K~DRwVg~VtVG~e~aa~~~Hspr~g~pgd~f~ygvlh~~g~~~~~p 119 (143) ||.|++|....+ ++ | .+.||.. --|+-+|+||.--..+ T Consensus 80 ~~~~~~~~~~~~~L~~tG~L~~Si~~~~--~~---~----~v~vGtn-----------------~~YAaiHqfGg~~~~~ 133 (175) T protein:vir:10 80 TAAASRRKAGLMILQDSGQMAASVSTDH--DD---N----SAVIGSN-----------------KEYAAIHQFGGQAGRG 133 (175) T ss_pred hhhhhhhccCCCcceechhhhhhhheee--cC---C----EEEEecC-----------------hhhhhhhhcccccCCC Confidence 333444444332 11 0 3444332 2488899998532111 Q ss_pred cccccchh----------------hHHHHH-H----HHhhhc Q lcl|NC_021296. 120 SGWDFPAH----------------KDLKKA-L----AVVKAR 140 (143) Q Consensus 120 ~~~~f~ah----------------~dl~~a-~----~~~~~~ 140 (143) -+-..||- .++.+. + .+++.| T Consensus 134 ~~v~iPaRpfLG~s~~d~~~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 134 LKVTIPARPWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred CccccCCccccCCCcccccchHHHHHHHHHHHHHHHHHhccC Confidence 11122221 111111 1 245555 Done!