Query lcl|NC_017976.1_cdsid_YP_006383461.1 [gene=PBC1_gp08] [protein=major capsid protein] [protein_id=YP_006383461.1] [location=6051..6941] Match_columns 296 No_of_seqs 16 out of 21 Neff 2.6 Searched_HMMs 1612 Date Thu Nov 7 12:52:41 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_8 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_8_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:98871 Length: 314 100.0 1E-187 6E-191 1045.8 26.5 295 1-295 19-314 (314) 2 protein:vir:94528 Length: 286 100.0 9E-185 6E-188 1029.6 24.9 286 1-293 1-286 (286) 3 protein:vir:3969 Length: 287 # 100.0 4E-182 2E-185 1015.2 24.8 284 8-293 1-287 (287) 4 protein:vir:4786 Length: 295 # 100.0 6E-168 4E-171 937.2 16.9 278 1-296 1-280 (295) 5 protein:vir:79712 Length: 285 99.1 1.8E-11 1.1E-14 79.5 15.9 265 8-294 1-285 (285) 6 protein:vir:94800 Length: 319 99.1 2.3E-11 1.4E-14 78.9 16.0 274 1-296 1-298 (319) 7 protein:vir:97331 Length: 319 99.1 2.3E-11 1.4E-14 78.9 16.0 274 1-296 1-298 (319) 8 protein:vir:78090 Length: 302 98.9 4.6E-10 2.9E-13 71.8 16.7 265 1-295 1-302 (302) 9 protein:vir:107120 Length: 329 98.8 3.1E-10 1.9E-13 72.7 15.0 273 1-296 12-310 (329) 10 protein:vir:105464 Length: 346 98.8 2.2E-09 1.3E-12 68.1 17.1 271 8-296 1-302 (346) 11 protein:vir:102605 Length: 273 98.7 2.7E-09 1.7E-12 67.5 15.5 264 8-292 1-273 (273) 12 protein:vir:105822 Length: 273 98.7 2.7E-09 1.7E-12 67.5 15.5 264 8-292 1-273 (273) 13 protein:vir:79008 Length: 299 98.6 7E-09 4.4E-12 65.3 16.1 271 1-295 1-299 (299) 14 protein:vir:99523 Length: 311 98.6 1.1E-08 6.8E-12 64.2 16.3 269 1-293 1-311 (311) 15 protein:vir:7990 Length: 273 # 98.5 1.1E-08 6.6E-12 64.3 14.7 264 8-292 1-273 (273) 16 protein:vir:102335 Length: 312 98.4 7.2E-08 4.5E-11 59.7 15.5 271 1-296 1-310 (312) 17 protein:vir:78920 Length: 290 98.3 1.9E-07 1.2E-10 57.5 16.8 261 8-292 1-290 (290) 18 protein:vir:78739 Length: 332 98.0 3.7E-07 2.3E-10 55.9 12.2 265 1-290 7-332 (332) 19 protein:vir:3033 Length: 272 # 97.5 5.7E-05 3.5E-08 43.8 17.9 263 1-296 1-271 (272) 20 protein:vir:9820 Length: 272 # 97.5 5.7E-05 3.5E-08 43.8 17.9 263 1-296 1-271 (272) 21 protein:vir:80213 Length: 334 97.5 5.4E-05 3.3E-08 44.0 16.7 254 1-296 1-295 (334) 22 protein:vir:10450 Length: 344 97.4 2E-05 1.2E-08 46.4 13.5 269 1-292 1-344 (344) 23 protein:vir:2201 Length: 345 # 97.3 6E-05 3.7E-08 43.7 15.2 268 1-292 1-345 (345) 24 protein:vir:93742 Length: 274 97.0 0.00022 1.4E-07 40.6 17.9 262 1-296 1-274 (274) 25 protein:vir:6324 Length: 335 # 97.0 0.00022 1.4E-07 40.6 16.5 250 1-296 1-291 (335) 26 protein:vir:8885 Length: 347 # 96.7 8.7E-05 5.4E-08 42.8 11.4 269 1-293 1-347 (347) 27 protein:vir:100057 Length: 375 96.5 0.00019 1.2E-07 40.9 11.7 264 1-296 1-330 (375) 28 protein:vir:80180 Length: 381 96.5 0.0004 2.5E-07 39.2 13.4 273 1-296 11-338 (381) 29 protein:vir:94711 Length: 347 96.4 0.0007 4.4E-07 37.9 14.3 265 1-293 1-347 (347) 30 protein:vir:80930 Length: 278 96.3 0.00073 4.5E-07 37.8 15.5 265 1-293 1-278 (278) 31 protein:vir:96833 Length: 275 96.3 0.00081 5.1E-07 37.5 16.8 263 1-296 1-273 (275) 32 protein:vir:94494 Length: 274 96.2 0.00094 5.8E-07 37.2 17.8 263 1-296 1-274 (274) 33 protein:vir:97433 Length: 274 96.2 0.00094 5.8E-07 37.2 17.8 263 1-296 1-274 (274) 34 protein:vir:1541 Length: 347 # 95.7 0.0015 9.5E-07 36.0 14.0 272 1-296 1-347 (347) 35 protein:vir:78935 Length: 335 95.6 0.0018 1.1E-06 35.7 15.3 248 1-296 1-291 (335) 36 protein:vir:96123 Length: 274 95.6 0.0018 1.1E-06 35.7 14.1 262 1-296 1-274 (274) 37 protein:vir:95898 Length: 274 95.4 0.0021 1.3E-06 35.3 14.7 265 1-296 1-274 (274) 38 protein:vir:96262 Length: 274 95.4 0.0021 1.3E-06 35.3 14.7 265 1-296 1-274 (274) 39 protein:vir:94576 Length: 347 95.2 0.0016 1E-06 35.9 11.5 261 1-296 1-310 (347) 40 protein:vir:105334 Length: 276 94.9 0.0032 2E-06 34.2 13.2 262 1-296 1-273 (276) 41 protein:vir:94622 Length: 341 94.7 0.0032 2E-06 34.2 11.9 266 1-294 12-341 (341) 42 protein:vir:3364 Length: 347 # 93.6 0.0069 4.3E-06 32.4 13.1 270 1-296 1-347 (347) 43 protein:vir:99075 Length: 392 92.8 0.01 6.2E-06 31.6 13.8 259 8-296 1-278 (392) 44 protein:vir:1239 Length: 274 # 91.3 0.017 1E-05 30.3 17.0 263 1-296 1-274 (274) 45 protein:vir:97031 Length: 402 90.7 0.02 1.2E-05 30.0 14.8 247 1-296 1-296 (402) 46 protein:vir:102655 Length: 322 90.2 0.022 1.4E-05 29.6 14.1 269 1-293 13-322 (322) 47 protein:vir:3613 Length: 272 # 89.3 0.027 1.7E-05 29.1 14.3 258 1-294 1-272 (272) 48 protein:vir:7019 Length: 401 # 80.2 0.097 6E-05 26.1 13.0 249 1-296 1-296 (401) 49 protein:vir:99675 Length: 324 79.7 0.1 6.3E-05 26.0 8.6 234 38-296 1-303 (324) 50 protein:vir:105645 Length: 400 79.1 0.11 6.7E-05 25.9 13.5 246 1-296 1-296 (400) 51 protein:vir:95107 Length: 270 68.0 0.24 0.00015 23.9 13.4 260 1-296 1-269 (270) 52 protein:vir:103323 Length: 364 55.6 0.48 0.00029 22.4 18.2 261 1-296 1-342 (364) 53 protein:vir:1781 Length: 221 # 51.4 0.58 0.00036 21.9 10.3 182 94-296 1-206 (221) 54 protein:vir:4830 Length: 397 # 33.4 1.4 0.00084 19.9 13.5 261 1-296 111-389 (397) 55 protein:vir:4856 Length: 293 # 31.7 1.5 0.00092 19.6 16.5 262 1-296 5-285 (293) 56 protein:vir:108303 Length: 418 31.5 1.5 0.00092 19.6 13.5 260 1-296 1-294 (418) No 1 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=100.00 E-value=1e-187 Score=1045.75 Aligned_cols=295 Identities=66% Similarity=1.040 Sum_probs=291.8 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecccccCcccceeccccC Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVGFGTGTS 80 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~NvaFGtGTg 80 (296) -+|+|||+|+|.|+|||+||||+||++|++|+|+|||+||+|||||||+|||||||||+|||||+.|+||+|||||+||| T Consensus 19 ~~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVVig~~Y~TdeNvaFGtGTg 98 (314) T protein:vir:98 19 SGTANQNKAARSYQKEFRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVVVGNEYNKDENVGFGEGTS 98 (314) T ss_pred eccccCccceeeecHHHHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeecccceeecCcccCCCCcccccCCc Confidence 78999999999999999999999999999999999999999999999999999999999999998899999999999999 Q ss_pred CcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhhhhcc Q lcl|NC_017976. 81 KSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEALTAYD 160 (296) Q Consensus 81 ~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l~~~~ 160 (296) +||||||||||||.||||||+|+|+|||||||+|||||+|+||||||+||||||+|+||+++||+||++|++||+|++++ T Consensus 99 ~SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~lS~~As~te~ltd~~ 178 (314) T protein:vir:98 99 RSTRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNAQHSKFISSIAEKTETLTDYS 178 (314) T ss_pred cccccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccceeeeccCcceeecCeEEEEchHHHhc-CCeEE Q lcl|NC_017976. 161 EAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGANVDSNEIVKFKGFLIEEVPQAKLG-ANAAL 239 (296) Q Consensus 161 ~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi~~fKgf~l~e~p~~y~q-g~~ai 239 (296) +|+|+||||+||++|||+|+++||+|||+||||||||||+|+||+||||+|||||||||||||+|+|+|++||| |++|| T Consensus 179 ~d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~~l~TsaK~SsaNIDengi~~FkGf~i~e~P~~~~q~g~ia~ 258 (314) T protein:vir:98 179 ADNVLRLFNELSKYYVNIEAIGTKAAKVSPELYNAIVDHPLTTSAKSSSANIDQNGIVNFKGFAIQEIPESMLQSGDVAY 258 (314) T ss_pred hhhHHHHHHHHHhhhhcceeeEEEEEEEchhHHhHhhccccccccccceeeeccCCcceecceEEEecchhhcCCCcEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999 88999 Q ss_pred EeecceeeeccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCC Q lcl|NC_017976. 240 VYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVT 295 (296) Q Consensus 240 fs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~ 295 (296) |+||||||+||||+|||+||||||||||||||||||+||||||||||+|+|++... T Consensus 259 ~s~dnig~aftGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~k~t~tp~~ 314 (314) T protein:vir:98 259 TYITNIGKAFTGINTSRIIESEDFDGVALQGAGKAGEFILDDNKKAVAKVTSTPEG 314 (314) T ss_pred EccccceeecccceeeeeeecccccceeeecccccccccccccceeeEEEecCCCC Confidence 99999999999999999999999999999999999999999999999999986655 No 2 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=100.00 E-value=9.1e-185 Score=1029.57 Aligned_cols=286 Identities=47% Similarity=0.730 Sum_probs=283.6 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecccccCcccceeccccC Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVGFGTGTS 80 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~NvaFGtGTg 80 (296) |+|+|||||+|.|+|||+||||+||++|++|+|+||| ||+|||||||+|||||||||+|||||+ |+||+||+||+||| T Consensus 1 m~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~fgg-lQalDGV~~N~tafsvKt~D~pVVig~-Y~TdeNv~FGtgTg 78 (286) T protein:vir:94 1 MATTNNDLPVRVYSKEFLQLLSTVYQAQSVFTPTFGA-LQALDGVPNNATAFSVKTNDMAVVVGE-YSTDANTAFGTGTS 78 (286) T ss_pred CCCCccccceeehhHHHHHHHHHHHhhHHHhhhhhcc-hhhhhCCCccceEEEEeecCcceEEec-ccCCCccccccCCc Confidence 9999999999999999999999999999999999999 999999999999999999999999998 99999999999999 Q ss_pred CcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhhhhcc Q lcl|NC_017976. 81 KSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEALTAYD 160 (296) Q Consensus 81 ~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l~~~~ 160 (296) +||||||||||||.||||||+|+|+|||||||+|||||+|+||||||+||||||+|+||+++||+||++|++|++| T Consensus 79 ~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~~A~~t~~~---- 154 (286) T protein:vir:94 79 NSSRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEALATAGTDLGAV---- 154 (286) T ss_pred cccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh---- Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999887 Q ss_pred hhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccceeeeccCcceeecCeEEEEchHHHhcCCeEEE Q lcl|NC_017976. 161 EAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGANVDSNEIVKFKGFLIEEVPQAKLGANAALV 240 (296) Q Consensus 161 ~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi~~fKgf~l~e~p~~y~qg~~aif 240 (296) |+|+||||+||++|||+||++|++|||+||||||||||+|+||+|+||+|||||||||||||+|+|+|++||+|+++|| T Consensus 155 -D~V~~LF~~as~~yvn~ev~~~~~ayV~~evYnaiiD~~l~TsaK~SsaNiDengi~~FkGf~i~e~P~~~~~g~~aif 233 (286) T protein:vir:94 155 -DDVNALFESAVEKYTDLEVIAPVRAYVTASVYNAIIDLANVTTAKNSAVNIDTNGMLSFRGIAITKVPTQYMGGKAVIF 233 (286) T ss_pred -hhHHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhccccccccccceeeeccCCcceecceEEeecchhhccCceEEE Confidence 5799999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecceeeeccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEeccc Q lcl|NC_017976. 241 YIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPT 293 (296) Q Consensus 241 s~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~ 293 (296) |||||||+||||+|||+|||||||||+||||||||+||||||||||+|++... T Consensus 234 s~dnig~aftGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~~~~~k~ 286 (286) T protein:vir:94 234 APDNVARVFTGINIARTIQAIDFAGVELQGAGKYGTFILDDNKKAIFTATPKA 286 (286) T ss_pred ccccceeeeccceeeeeeeccccCceeeeccccccccccccCceeEEEeecCC Confidence 99999999999999999999999999999999999999999999999999888 No 3 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=100.00 E-value=3.8e-182 Score=1015.18 Aligned_cols=284 Identities=37% Similarity=0.610 Sum_probs=280.9 Q ss_pred ceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecccccCcccceeccccCCcccccc Q lcl|NC_017976. 8 LAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVGFGTGTSKSSRFGD 87 (296) Q Consensus 8 ~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~NvaFGtGTg~s~RFG~ 87 (296) ||+|.|+|||+||||+||++||+|+|+|||.||++||||||+|||||||||+|||||+ |+||+|||||+|||+|||||| T Consensus 1 ~avr~y~Kq~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~-Y~Td~Nv~FGtGTg~ssRFG~ 79 (287) T protein:vir:39 1 MAIKYFTKQYAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQA-YSTDANVGFGSGTGNTSRFGQ 79 (287) T ss_pred CCcccccHHHHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEec-ccCCCCcccccCCCccccccc Confidence 9999999999999999999999999999999999999999999999999999999997 999999999999999999999 Q ss_pred eeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhhhhcchhHHHHH Q lcl|NC_017976. 88 RQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEALTAYDEAAVLKL 167 (296) Q Consensus 88 rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l~~~~~~~V~kl 167 (296) ||||||.||||||+|||+|||||||+|||||+|+||||||+||||||+|+||+++||+||++|++|+++ .+|+|+|+|| T Consensus 80 rkEi~y~dt~V~Y~~~~~ihEGiD~~TVNnd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t~~~-~~t~d~V~~L 158 (287) T protein:vir:39 80 RKEVKSVNKQVSYDAPLAINEGIDDFTVNDIKDQVVAERLALHGVAWAQHVDKLLGKLLSDSASETLTV-KLDEDSVTKL 158 (287) T ss_pred eeEEEEecccccceeccccccccccccccCChhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchheee-eecccchHHH Confidence 999999999999999999999999999999999999999999999999999999999999999999998 5999999999 Q ss_pred HHHhhhhhcc--eeeeeeEEEEECchhhhhhhccccccccccceeeeccCcceeecCeEEEEchHHHhc-CCeEEEeecc Q lcl|NC_017976. 168 FNNLSAYYIN--IEAIGTKVAKVGPELYNIIVDHRLTTKEKASGANVDSNEIVKFKGFLIEEVPQAKLG-ANAALVYIKG 244 (296) Q Consensus 168 Fn~~~~~yvn--~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi~~fKgf~l~e~p~~y~q-g~~aifs~dn 244 (296) ||+||++||| +|+++||+|||+||+||+||||+|+||+||||+|||||||||||||+|+|+|+++|| |++||||||| T Consensus 159 F~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~l~TsaK~SsaNiDen~i~kFkGf~l~e~P~~~~q~g~~a~fs~dn 238 (287) T protein:vir:39 159 FSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSKLATTAKNSSANVDEQTLYKFKGFILSELPDEKFQLNEGAYFAADN 238 (287) T ss_pred HHHHHHHhhccceeeEEEEEEEEChhHHhHHhccccccccccceeeeccCCcceecceEEEecchHhhccCcEEEEcccc Confidence 9999999998 566899999999999999999999999999999999999999999999999999999 9999999999 Q ss_pred eeeeccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEeccc Q lcl|NC_017976. 245 VGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPT 293 (296) Q Consensus 245 Ig~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~ 293 (296) ||||||||+|+|+||||||||||||||||||+||||||||||+|+++++ T Consensus 239 ig~af~GI~vaR~i~sEdF~GvalQgAgK~G~~i~e~Nk~Ai~k~t~~k 287 (287) T protein:vir:39 239 VGVAGVGIQVTRAMDSEDFAGTALQAAAKYGKYLPEKNKKAILKATVTK 287 (287) T ss_pred ceeecccceeEEeeecccccceeeecccccccccccccceEEEEEecCC Confidence 9999999999999999999999999999999999999999999999999 No 4 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=100.00 E-value=6.3e-168 Score=937.25 Aligned_cols=278 Identities=40% Similarity=0.627 Sum_probs=266.8 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecccccCcccce-ecccc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVG-FGTGT 79 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~Nva-FGtGT 79 (296) |++ |||+|+|.|+|||+||||+||++|++|+|+||| ||++||||||+|||||||||+|||||+ |+||+|+| ||+|| T Consensus 1 mp~-N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg-lQalDGV~~N~tafsvKt~D~pVVig~-Y~TdeNvagFGtGT 77 (295) T protein:vir:47 1 MPS-NQNNAVRRYEKQYAGILETVFGVRAAFSNALAP-IQILDGVQENSKAFSVKTNNTPVVIGE-YKTGENDGGFGDNS 77 (295) T ss_pred CCC-CCCccchhhhHHHHHHHHHHHhHHHHHhhhhcc-hhhhhCCCccceEEEEeecCcceEeec-ccCCCcccccccCC Confidence 999 999999999999999999999999999999999 999999999999999999999999998 99999996 99999 Q ss_pred CCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhhhhc Q lcl|NC_017976. 80 SKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEALTAY 159 (296) Q Consensus 80 g~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l~~~ 159 (296) |+||||||||||||.||||||+|+|+|||||||+|||||+|+||||||+||||||+|+||+++||+||++|++||+++++ T Consensus 78 g~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~ls~~A~~te~~td~ 157 (295) T protein:vir:47 78 GAQSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKYLSDTATKTEALADF 157 (295) T ss_pred ccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred chhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccceeeeccCcceeecCeEEEEchHHHhc-CCeE Q lcl|NC_017976. 160 DEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGANVDSNEIVKFKGFLIEEVPQAKLG-ANAA 238 (296) Q Consensus 160 ~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi~~fKgf~l~e~p~~y~q-g~~a 238 (296) ++|+|+||||++|++|||+||++|++|||+||||||||||+|+||+||||+|||||||+|||||+|+|+|++||| |++| T Consensus 158 t~d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~~l~TsaK~SsaNiDengi~~FkGf~i~e~P~~~~q~G~~a 237 (295) T protein:vir:47 158 TDDKVKALFNKLSAFYTNNEVTAPITVYLRSEFYNAIVDMASVTSAKGATISLDENGLPKYKGFTLEETPAQYFETGVIA 237 (295) T ss_pred cchhHHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhccccccccccceeeeccCCcceecceEEEeccHhhccCCcEE Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 9999 Q ss_pred EEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 239 LVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 239 ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) ||||||||||||||+|||+||||||||||||=--. +.++---. T Consensus 238 ifs~dnig~aftGIn~aR~IesEdF~GValQ~~~~---------------~~~~~~~~ 280 (295) T protein:vir:47 238 IFSPNGIIIPFVGISTARVIEAENFDGVNCKLLLR---------------VVLTLLMT 280 (295) T ss_pred EEccccceeecccceeeeeeecccccchHHHHHHH---------------HHHHHHHH Confidence 99999999999999999999999999999995321 11110000 No 5 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.07 E-value=1.8e-11 Score=79.49 Aligned_cols=265 Identities=13% Similarity=0.173 Sum_probs=181.0 Q ss_pred ceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecC-Cc-ccceeEEEeecccc-eE-ecccccCcccceeccccCCcc Q lcl|NC_017976. 8 LAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDG-VQ-NNETAFYVKTSDLP-VV-VGTGYNTDANVGFGTGTSKSS 83 (296) Q Consensus 8 ~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDG-Vq-nN~tafsvKtnd~p-Vv-vg~~Y~td~NvaFGtGTg~s~ 83 (296) ||+- |.++|-.+|...|...+++....-+. -.+ ++ ++ +=+||+..+. ++ ++. |+. +.+|-.|+- T Consensus 1 Main-~~~k~~~~ld~~~~~~~~~~~l~~~~---n~~~~~~~g--ak~VkIp~ist~~gl~d-Y~R--~~g~~~g~v--- 68 (285) T protein:vir:79 1 MTVV-LDSKDLARIDEEYKADSQVWSYLTGG---NGVTQRFRG--HNEVRINKLSGFVDATA-YKR--GQDNARKTI--- 68 (285) T ss_pred Ccch-hhHHHHHHHHHHHHHhhhhhhhcccC---CcceeEecC--CCEEEEeeecccccccc-ccc--ccCcccccc--- Confidence 6655 67788899999999988887765541 000 11 11 1256766664 33 655 755 445554443 Q ss_pred cccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHH-HHHHHH-HhHHHhhhhcchhhhhhcch Q lcl|NC_017976. 84 RFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKT-MLFDKK-HAEFIVANAGKTEALTAYDE 161 (296) Q Consensus 84 RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~-~~~n~~-~gk~ls~~A~~t~~l~~~~~ 161 (296) ..-|+.....+|-.|.+. ||.+-|+......+|-=++.+..-++ -.++.. +.| |...|+...+ +.+|. T Consensus 69 ------~~~~et~tl~~DR~~~f~--iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfsk-la~~a~~~~~-~~~T~ 138 (285) T protein:vir:79 69 ------SVGKETVKLTHEDWFGYD--LDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQK-LFDSAAKKAT-DSITK 138 (285) T ss_pred ------ceeeeEEEeeccccceec--ccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHH-HHhhcccccc-cccCH Confidence 234555566677777765 89999976544444544554333332 344544 333 3344444444 57899 Q ss_pred hHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccceeeeccCc----ceeecC-eEEEEchHHHhcC- Q lcl|NC_017976. 162 AAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGANVDSNE----IVKFKG-FLIEEVPQAKLGA- 235 (296) Q Consensus 162 ~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ng----i~~fKg-f~l~e~p~~y~qg- 235 (296) +++.+.+-.+.++--+.+|..+.++||+|++|.+|-..+.-+...+.+.+.-..| +..+.| +.|.+||..+|++ T Consensus 139 ~nv~~~i~~~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~ 218 (285) T protein:vir:79 139 DNALDAYDTAEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGL 218 (285) T ss_pred HHHHHHHHHHHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCc Confidence 9999999999999999999889999999999999998877766554433333334 468888 8999999999984 Q ss_pred ---C--eEEEeecceeeeccceEEEEEeecc---CccceeeeecccccccCCCCCcceEEEEecccC Q lcl|NC_017976. 236 ---N--AALVYIKGVGKAFTGITTARTIESE---DFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTV 294 (296) Q Consensus 236 ---~--~aifs~dnIg~af~GI~taRtieSE---DFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~ 294 (296) + .-|..|...-++.+=.+..|.++.+ +=||=..|+--=++-||+|.=|++|.--..+.. T Consensus 219 ~~~k~Infiiv~~~a~i~~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 219 GITNHVNFILTPLSAIAPIVKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred CcchhccEEEecCceeccceeeeeeEeECCCCCCCcceeeeeeeeeeeeeehhhccceeeeeecccC Confidence 1 2455566677888888999999877 777888888888899999877777766555555 No 6 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=99.05 E-value=2.3e-11 Score=78.89 Aligned_cols=274 Identities=14% Similarity=0.119 Sum_probs=176.1 Q ss_pred CC----------CC------cc--cceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE Q lcl|NC_017976. 1 MG----------TK------NQ--QLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV 62 (296) Q Consensus 1 m~----------t~------Nn--n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv 62 (296) |- -- |+ .-=--.++++|..+|..++...++-.+.... +-..|. + +=+||++.+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N--~~~e~~--g--g~tVkIp~i~~~ 74 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALIS--NDAIFM--E--GRSFTVMKGDTT 74 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccC--cceEec--c--CcEEEEeeeccc Confidence 11 10 11 0111247889999999999888776554433 112232 2 225777776653 Q ss_pred -ecccccCcccceeccccCCcccccceeEEEEeccccccccCchhh-hccccccccCCHhH--HHHHHHhhHHHHHHHHH Q lcl|NC_017976. 63 -VGTGYNTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWH-EGIDRYTVNNSMES--AMADRVELQAQAKTMLF 138 (296) Q Consensus 63 -vg~~Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiH-EGiDr~TVNndl~a--avAdRl~Lqa~Ak~~~~ 138 (296) ++. |+.+ .+|..|+-+ .-|+.-..-++-.|+|- ..+|.--.|.++.+ ++++... ...+..+ T Consensus 75 gl~D-Y~R~--~g~~~g~vt---------~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~---~~v~PEi 139 (319) T protein:vir:94 75 ELKD-YKRN--ATNEFDHPK---------IEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGA---EVVAPYL 139 (319) T ss_pred cccc-ccCC--CCcccCCcc---------cceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHH---HHhhhhh Confidence 333 6553 344444322 12233334445555543 34555555655543 2333333 2334467 Q ss_pred HHHHhHHHhhhhcchhhhhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccceeeeccCcc- Q lcl|NC_017976. 139 DKKHAEFIVANAGKTEALTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGANVDSNEI- 217 (296) Q Consensus 139 n~~~gk~ls~~A~~t~~l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi- 217 (296) |......|...|+...+ ..+|.+++...+-.+.++.-+.+|-...++||+|++|.+|-..+.-+...+-.-.+--||. T Consensus 140 Day~~skla~~a~~~~~-~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~V 218 (319) T protein:vir:94 140 DNLRFATLARNKAKHLT-VGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQ 218 (319) T ss_pred hHHHHHHHHhhcccccc-cccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeec Confidence 87767777777776554 5689999999999999998888887779999999999999887766554432222223443 Q ss_pred eeecCeEEEEchHHHhcCCeEEEeecceeeeccceEEEEEee-ccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 218 VKFKGFLIEEVPQAKLGANAALVYIKGVGKAFTGITTARTIE-SEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 218 ~~fKgf~l~e~p~~y~qg~~aifs~dnIg~af~GI~taRtie-SEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) -++.||.|-++|...|.+-..|..+.+--.+.+=+...|... +++=.|=+.||--=||-||++.-+++|+....++++- T Consensus 219 g~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~ 298 (319) T protein:vir:94 219 GELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVAT 298 (319) T ss_pred eeecCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCccc Confidence 468999999999999987776666666666667777788765 6776689999999999999999999998754443333 No 7 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=99.05 E-value=2.3e-11 Score=78.89 Aligned_cols=274 Identities=14% Similarity=0.119 Sum_probs=176.1 Q ss_pred CC----------CC------cc--cceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE Q lcl|NC_017976. 1 MG----------TK------NQ--QLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV 62 (296) Q Consensus 1 m~----------t~------Nn--n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv 62 (296) |- -- |+ .-=--.++++|..+|..++...++-.+.... +-..|. + +=+||++.+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N--~~~e~~--g--g~tVkIp~i~~~ 74 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALIS--NDAIFM--E--GRSFTVMKGDTT 74 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccC--cceEec--c--CcEEEEeeeccc Confidence 11 10 11 0111247889999999999888776554433 112232 2 225777776653 Q ss_pred -ecccccCcccceeccccCCcccccceeEEEEeccccccccCchhh-hccccccccCCHhH--HHHHHHhhHHHHHHHHH Q lcl|NC_017976. 63 -VGTGYNTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWH-EGIDRYTVNNSMES--AMADRVELQAQAKTMLF 138 (296) Q Consensus 63 -vg~~Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiH-EGiDr~TVNndl~a--avAdRl~Lqa~Ak~~~~ 138 (296) ++. |+.+ .+|..|+-+ .-|+.-..-++-.|+|- ..+|.--.|.++.+ ++++... ...+..+ T Consensus 75 gl~D-Y~R~--~g~~~g~vt---------~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~---~~v~PEi 139 (319) T protein:vir:97 75 ELKD-YKRN--ATNEFDHPK---------IEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGA---EVVAPYL 139 (319) T ss_pred cccc-ccCC--CCcccCCcc---------cceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHH---HHhhhhh Confidence 333 6553 344444322 12233334445555543 34555555655543 2333333 2334467 Q ss_pred HHHHhHHHhhhhcchhhhhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccceeeeccCcc- Q lcl|NC_017976. 139 DKKHAEFIVANAGKTEALTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGANVDSNEI- 217 (296) Q Consensus 139 n~~~gk~ls~~A~~t~~l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi- 217 (296) |......|...|+...+ ..+|.+++...+-.+.++.-+.+|-...++||+|++|.+|-..+.-+...+-.-.+--||. T Consensus 140 Day~~skla~~a~~~~~-~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~V 218 (319) T protein:vir:97 140 DNLRFATLARNKAKHLT-VGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQ 218 (319) T ss_pred hHHHHHHHHhhcccccc-cccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeec Confidence 87767777777776554 5689999999999999998888887779999999999999887766554432222223443 Q ss_pred eeecCeEEEEchHHHhcCCeEEEeecceeeeccceEEEEEee-ccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 218 VKFKGFLIEEVPQAKLGANAALVYIKGVGKAFTGITTARTIE-SEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 218 ~~fKgf~l~e~p~~y~qg~~aifs~dnIg~af~GI~taRtie-SEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) -++.||.|-++|...|.+-..|..+.+--.+.+=+...|... +++=.|=+.||--=||-||++.-+++|+....++++- T Consensus 219 g~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~ 298 (319) T protein:vir:97 219 GELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVAT 298 (319) T ss_pred eeecCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCccc Confidence 468999999999999987776666666666667777788765 6776689999999999999999999998754443333 No 8 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=98.87 E-value=4.6e-10 Score=71.76 Aligned_cols=265 Identities=14% Similarity=0.206 Sum_probs=174.6 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCc-ccceeEEEeecccceE------ecccccCcccc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQ-NNETAFYVKTSDLPVV------VGTGYNTDANV 73 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVq-nN~tafsvKtnd~pVv------vg~~Y~td~Nv 73 (296) ||. . + -|.++|.+.|..+|...+.+...-+..=+ |+ ++++ .||+..+-|- .+. |+-+ . T Consensus 1 Man---t--l-~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~----v~~~Gak--~vkIp~is~~~~~TsGl~d-y~R~--~ 65 (302) T protein:vir:78 1 MAN---S--L-ALAQIYQDNIDKAIAVNSKSAFLEANPNN----VQYNGGN--TIKIADISFGSGTTGDLKA-YNRS--T 65 (302) T ss_pred CCc---h--h-HHHHHHHHHHHHHHHhhhceeecccCCce----EEEecCc--EEEEEEEEeeccccccccc-cccc--c Confidence 761 1 2 57899999999999988877654343111 11 1222 3555555542 223 6653 3 Q ss_pred eeccccCCcccccceeEEEEeccccccccCchhhhccccccccC-CHhHHHHHHHhhHHHH-HHHHHHHH-HhHHHhhhh Q lcl|NC_017976. 74 GFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNN-SMESAMADRVELQAQA-KTMLFDKK-HAEFIVANA 150 (296) Q Consensus 74 aFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNn-dl~aavAdRl~Lqa~A-k~~~~n~~-~gk~ls~~A 150 (296) +|-.|+ - +.-|+.....+|-.|.|. ||++-|+. .....+|.=++.|..- .+=.+|.. +.|-.+... T Consensus 66 g~~~g~--------v-~~~~et~tlt~DR~~~f~--vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~ 134 (302) T protein:vir:78 66 GFTQGS--------V-TLAWSDYTLDYDLAQSFQ--IDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGT 134 (302) T ss_pred Cccccc--------e-eeeeeeEEeeeccceeee--ccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhh Confidence 444333 2 356666777788888876 89988877 3344456655554333 33344544 333222221 Q ss_pred c----chhhhhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccce----eeeccCcceeecC Q lcl|NC_017976. 151 G----KTEALTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASG----ANVDSNEIVKFKG 222 (296) Q Consensus 151 ~----~t~~l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss----~NiD~ngi~~fKg 222 (296) + ...+...+|.++|.+-|-.+.++..+.+ +.++||+|.+|.+|-+....+...+.. -.||.+ +-.+.| T Consensus 135 ~~~~~~~~~~~~~t~~nvl~~i~~~~~~~~e~~---~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~~i~~~-V~~lDg 210 (302) T protein:vir:78 135 GVGGVIDLSKPDASAQALMGDIATAMELVDDSN---QLILVTSPTTLAGLLNTALIRESKNTQVLRRGEVDTK-ITFIQD 210 (302) T ss_pred ccCccccccccchhHHHHHHHHHHHHHHhhccC---CeEEEEChHHHHHHhcchhhccceeccccccccccce-eeeecc Confidence 1 1112235788999999999988888864 999999999999999887665443332 125444 888999 Q ss_pred eEEEEchHHHhcCC----------------eEEEeecceeeeccceEEEEEeeccC---ccceeeeecccccccCCCCCc Q lcl|NC_017976. 223 FLIEEVPQAKLGAN----------------AALVYIKGVGKAFTGITTARTIESED---FDGVAFQGAGKAGEFILDDNK 283 (296) Q Consensus 223 f~l~e~p~~y~qg~----------------~aifs~dnIg~af~GI~taRtieSED---FdGVaLQgAgK~G~~IlddNK 283 (296) +.|.+||+.+|+++ ..|..|...-++.+=.+..|.++.+- =||=..|+--=+.-||+|.=| T Consensus 211 v~Ii~VPs~r~~t~~~f~~G~~~~~~ak~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~l~~~R~Y~D~fV~~nk~ 290 (302) T protein:vir:78 211 VEVLQVPSEYLYDKVAPKVGVPDYTGAKKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAYKVDLRLYHDLIVPKNQR 290 (302) T ss_pred cEEEEchhhhcccceeccCCccccCCccceeEEEECCCeeeeeeeeeeeEeeCCCCCCCcceeeeeeeeEeeeeeecccc Confidence 99999999999864 36666777788888888888886532 223466777677889999888 Q ss_pred ceEEEEecccCC Q lcl|NC_017976. 284 PAVVKVTAPTVT 295 (296) Q Consensus 284 kAI~k~t~~~~~ 295 (296) ++|.-...++.+ T Consensus 291 ~gI~~~~~~~~~ 302 (302) T protein:vir:78 291 PGIIKASFGTIA 302 (302) T ss_pred CeEEEeeccccC Confidence 999988888888 No 9 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=98.85 E-value=3.1e-10 Score=72.71 Aligned_cols=273 Identities=15% Similarity=0.127 Sum_probs=169.9 Q ss_pred CCCC-----------cccce-------eeeechhHHHHHHHHHhhhhhhhhhcc-cceeeecCCcccceeEEEeecccce Q lcl|NC_017976. 1 MGTK-----------NQQLA-------AKTYQKQFKEMLQAVFSHQAYFADFFG-GGIEALDGVQNNETAFYVKTSDLPV 61 (296) Q Consensus 1 m~t~-----------Nnn~a-------~r~Y~kq~~~ll~~vf~~qa~F~~~fg-g~lQ~lDGVqnN~tafsvKtnd~pV 61 (296) |--+ =|..| .=.|.++|..+|..+|..+++=.+... ...+ +. + +=+||++.+.+ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e---~~--~--g~tVkIp~i~~ 84 (329) T protein:vir:10 12 MNKEIKNATGKLKLNLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAI---FM--Q--GRSFTVIKGDV 84 (329) T ss_pred hhhhhhcccceeEEehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeeccccee---ec--c--CcEEEEeeecc Confidence 1100 01111 114789999999999987764333211 1122 11 1 22567766655 Q ss_pred E-ecccccCcccceeccccCCcccccceeEEEEeccccccccCchhh-hccccccccCCHhH--HHHHHHhhHHHHHHHH Q lcl|NC_017976. 62 V-VGTGYNTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWH-EGIDRYTVNNSMES--AMADRVELQAQAKTML 137 (296) Q Consensus 62 v-vg~~Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiH-EGiDr~TVNndl~a--avAdRl~Lqa~Ak~~~ 137 (296) + ++. |+.+. +|..|+-+ .-|+.-..-++-.|+|- ..+|.-..|.++.+ ++++. |....+.. T Consensus 85 ~gl~D-Y~R~~--g~~~g~vt---------~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~---~~~~v~pE 149 (329) T protein:vir:10 85 TELKD-YKRNA--TNEFDHPQ---------IQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQ---ASEVVAPY 149 (329) T ss_pred ccccc-ccCCC--Cccccccc---------cceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHH---HHHHhhhH Confidence 2 333 65543 34333322 12333334445555543 34555555555532 23332 34444556 Q ss_pred HHHHHhHHHhhhhcchhhhhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccceeeeccCcc Q lcl|NC_017976. 138 FDKKHAEFIVANAGKTEALTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGANVDSNEI 217 (296) Q Consensus 138 ~n~~~gk~ls~~A~~t~~l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi 217 (296) ++...-..|...|+...+ ..++.+++...+-.+..+--..++...+.+||+|++|.+|-+.+..+....-..++--||. T Consensus 150 iDay~~skla~~a~~~~~-~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g~ 228 (329) T protein:vir:10 150 LDNLRFATLARNKAKHLT-VGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKGV 228 (329) T ss_pred HHHHHHHHHHhhcccccc-cccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeee Confidence 777766667777766544 5688999999999999888777777789999999999999988776543222222223454 Q ss_pred e-eecCeEEEEchHHHhcCCeEEEeecceeeeccceEEEEEee-ccCccceeeeecccccccCCCCCcceEEEEecc-cC Q lcl|NC_017976. 218 V-KFKGFLIEEVPQAKLGANAALVYIKGVGKAFTGITTARTIE-SEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAP-TV 294 (296) Q Consensus 218 ~-~fKgf~l~e~p~~y~qg~~aifs~dnIg~af~GI~taRtie-SEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~-~~ 294 (296) + +..||.|-++|...|.+-..|..+.+--.+.+=++..|... +++=+|=+.||--=||-||++.-+++|+....+ ++ T Consensus 229 Vg~idG~~Ii~vps~~~k~in~ii~~~~A~~~~~K~~~~~~~~p~~~~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a~~ 308 (329) T protein:vir:10 229 QGELDGFTIVKVPSKMLQGVEAMAVIGEVMASPIQANEAKLNSNVPGMFGTLAEQMLYTGAFVPEHLQKYIFTIGGKEVE 308 (329) T ss_pred eeeecCeEEEEecCCcccceeEEEEcCCceeeeeeeeeeeeeCCCCccchheeeeeeeeeeEEEccccCEEEEecccCcc Confidence 4 68999999999999987655555555555666677777765 677789999999999999999988998774433 33 Q ss_pred CC Q lcl|NC_017976. 295 TP 296 (296) Q Consensus 295 ~p 296 (296) +. T Consensus 309 ~~ 310 (329) T protein:vir:10 309 TN 310 (329) T ss_pred cC Confidence 32 No 10 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=98.77 E-value=2.2e-09 Score=68.08 Aligned_cols=271 Identities=11% Similarity=0.131 Sum_probs=158.7 Q ss_pred ceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecc-cccCcccceecc-ccCCcccc Q lcl|NC_017976. 8 LAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGT-GYNTDANVGFGT-GTSKSSRF 85 (296) Q Consensus 8 ~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~-~Y~td~NvaFGt-GTg~s~RF 85 (296) ||+ -|.++|...|...|...+.......+..-. .-|+-| -+=+||+..+-|..|= .|+. +.+|+. |+- T Consensus 1 Mai-nya~~~~~~Ld~~~~~~~lts~~l~~~~~~-~~v~~~-ggktVkIp~is~tsGl~DY~R--~~g~~~~g~v----- 70 (346) T protein:vir:10 1 MTI-NYAEKYQAAVQQAFYDGHLYSAELWNSPSN-SIIKFD-GAKHIKVPRLEITSGRKDRQR--RTITTPVANY----- 70 (346) T ss_pred Ccc-hhHHHHHHHHHHHHHhhhccchhhcccccc-cceEec-CCCEEEEEEeeeecccccccc--cCCccccccc----- Confidence 665 457889999999997765443332220100 001111 1336777777654331 1553 223331 221 Q ss_pred cceeEEEEeccccccccCchhhhccccccccC-CHhHHHHHHHhhHHHHH-HHHHHHHHhHHHhhhh-----cchhhhhh Q lcl|NC_017976. 86 GDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNN-SMESAMADRVELQAQAK-TMLFDKKHAEFIVANA-----GKTEALTA 158 (296) Q Consensus 86 G~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNn-dl~aavAdRl~Lqa~Ak-~~~~n~~~gk~ls~~A-----~~t~~l~~ 158 (296) ..-|+.-...+|-.|.|. ||.+-|+. .....+|.=++.+..-+ +-.+++..=..|...| +...+ .. T Consensus 71 ----~~~~et~tl~qDR~~~F~--vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~~~~~~-~a 143 (346) T protein:vir:10 71 ----SNDWDSYELKNERYWSTL--VDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHDGGITT-NT 143 (346) T ss_pred ----ccceeEEEeeccccceec--ccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhccccccc-cc Confidence 233555556677777664 88887775 33444555554433333 2345654333333222 11111 25 Q ss_pred cchhHHHHHHHHhhhhhcceee-eeeEEEEECchhhhhhhccccccccc--cceeeeccCcceeecCeEEEEchHHHhc- Q lcl|NC_017976. 159 YDEAAVLKLFNNLSAYYINIEA-IGTKVAKVGPELYNIIVDHRLTTKEK--ASGANVDSNEIVKFKGFLIEEVPQAKLG- 234 (296) Q Consensus 159 ~~~~~V~klFn~~~~~yvn~ev-~~~~~ayV~~evYNaIvD~~l~Ts~K--~Ss~NiD~ngi~~fKgf~l~e~p~~y~q- 234 (296) +|.+++.+.+-.+..+--+.+| ..+.++||+|++|..|=..+.-+..- ++.-+|+ --+-++-||.|.|||+..|+ T Consensus 144 ~T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~i~-~~V~siDGv~Ii~VPs~r~~t 222 (346) T protein:vir:10 144 LDEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNNIQ-RTVYSLDDVTIRVVPSDLMQT 222 (346) T ss_pred cCHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccccccccc-eeeeeecCeEEEEcchhhccc Confidence 7899999999999999999999 57799999999999887665433111 1222342 22567899999999999996 Q ss_pred ------CC---------eEEEeecceeeeccceEEEEEeecc-Cccce-eeeecccccccCCCCCcceEEEEecccC-CC Q lcl|NC_017976. 235 ------AN---------AALVYIKGVGKAFTGITTARTIESE-DFDGV-AFQGAGKAGEFILDDNKPAVVKVTAPTV-TP 296 (296) Q Consensus 235 ------g~---------~aifs~dnIg~af~GI~taRtieSE-DFdGV-aLQgAgK~G~~IlddNKkAI~k~t~~~~-~p 296 (296) |- ..|..|...-++.+=.+..|..+.. .-.|- ..|+--=+.-||+|.=+++|.-....+| ++ T Consensus 223 ~~~f~~G~~~~t~ak~INfiiv~~~A~ia~~K~~~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~~ 302 (346) T protein:vir:10 223 AYDFSDGSKIIDTAKQIEMFLIYNGVQIAPEKYSFVGFDQPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKPKKD 302 (346) T ss_pred chhhccCccccCCccceeEEEECCceeeeeeeeeeeEeeCCCCCcccceeeeeeeeeeeeeeccccceEEEeeecccccC Confidence 21 2455566777778888888887653 22332 3444444678899777777754443333 33 No 11 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.69 E-value=2.7e-09 Score=67.55 Aligned_cols=264 Identities=15% Similarity=0.135 Sum_probs=157.3 Q ss_pred ceeeeec-hhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecccccCcccceeccc-cCCcccc Q lcl|NC_017976. 8 LAAKTYQ-KQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVGFGTG-TSKSSRF 85 (296) Q Consensus 8 ~a~r~Y~-kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~NvaFGtG-Tg~s~RF 85 (296) ||+-.|. ++|.+.+...|++++.|.+..-- -...+|. +.+|.---|... +-+.+ |.. -|+. +...-.- T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~-~~~~~~~-~Gdtv~ip~~~~--~~~~d-~~~-----~~~~~~~~~~~~ 70 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNR-EYEGTAS-KGNVVHIAGVVA--PTVKD-YKA-----AGRQTSADAISD 70 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhcc-ccccccc-cCceEEEeeccc--ccccc-ccc-----CCCccCcccccc Confidence 7777774 67999999999999999876432 1112232 222221112221 11222 321 1111 1111111 Q ss_pred cceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhhhhcchhHHH Q lcl|NC_017976. 86 GDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEALTAYDEAAVL 165 (296) Q Consensus 86 G~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l~~~~~~~V~ 165 (296) ++..=.|-.+..+++..+ .+|+.....|+.+ +.+ -|++|-.+.++..+...+...+.+...-..++.+++. T Consensus 71 ~~~~~tid~~~~~~~~i~-----d~d~~~~~~~~~~-~~~---~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~ 141 (273) T protein:vir:10 71 TGVDLLIDQEKSIDFLVD-----DIDRVQVAGSLEA-YTR---AGATALATDTDKFIADMLVDNGTALTGSAPTDADDAF 141 (273) T ss_pred ceEEEEEeeeeecceEee-----cHHHhhhhccHHH-HHH---HHHHHHHHHHHHHHHHHHhccccccccccccchhHHH Confidence 222212223333443322 6778888888754 444 4677778899988887777665554333456666666 Q ss_pred HHHHHhhhhhcceee-eeeEEEEECchhhhhhhcccc-ccc-cc-cceeeeccCcceeecCeEEEE---chHHHhcCCeE Q lcl|NC_017976. 166 KLFNNLSAYYINIEA-IGTKVAKVGPELYNIIVDHRL-TTK-EK-ASGANVDSNEIVKFKGFLIEE---VPQAKLGANAA 238 (296) Q Consensus 166 klFn~~~~~yvn~ev-~~~~~ayV~~evYNaIvD~~l-~Ts-~K-~Ss~NiD~ngi~~fKgf~l~e---~p~~y~qg~~a 238 (296) ..|-.+....-+..| .....++|+|++|..|.-.+. .+. .+ ++...+=+--|-++-||.+-+ +|..-- ...+ T Consensus 142 ~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~-~~~~ 220 (273) T protein:vir:10 142 DLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-EQFV 220 (273) T ss_pred HHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCc-cEEE Confidence 667777666655555 345789999999999986552 332 22 222222233356899999998 564210 2246 Q ss_pred EEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecc Q lcl|NC_017976. 239 LVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAP 292 (296) Q Consensus 239 ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~ 292 (296) .|.+..++.+ ..|........++--|-.+.|---||..+++.-+.++++++-+ T Consensus 221 ~~~~~A~~~a-~q~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 221 AFHPSAAAYV-SQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred EEeccceeee-eeeehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 7788887765 4565555555555558889988889999999888888887766 No 12 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.69 E-value=2.7e-09 Score=67.55 Aligned_cols=264 Identities=15% Similarity=0.135 Sum_probs=157.3 Q ss_pred ceeeeec-hhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecccccCcccceeccc-cCCcccc Q lcl|NC_017976. 8 LAAKTYQ-KQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVGFGTG-TSKSSRF 85 (296) Q Consensus 8 ~a~r~Y~-kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~NvaFGtG-Tg~s~RF 85 (296) ||+-.|. ++|.+.+...|++++.|.+..-- -...+|. +.+|.---|... +-+.+ |.. -|+. +...-.- T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~-~~~~~~~-~Gdtv~ip~~~~--~~~~d-~~~-----~~~~~~~~~~~~ 70 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNR-EYEGTAS-KGNVVHIAGVVA--PTVKD-YKA-----AGRQTSADAISD 70 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhcc-ccccccc-cCceEEEeeccc--ccccc-ccc-----CCCccCcccccc Confidence 7777774 67999999999999999876432 1112232 222221112221 11222 321 1111 1111111 Q ss_pred cceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhhhhcchhHHH Q lcl|NC_017976. 86 GDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEALTAYDEAAVL 165 (296) Q Consensus 86 G~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l~~~~~~~V~ 165 (296) ++..=.|-.+..+++..+ .+|+.....|+.+ +.+ -|++|-.+.++..+...+...+.+...-..++.+++. T Consensus 71 ~~~~~tid~~~~~~~~i~-----d~d~~~~~~~~~~-~~~---~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~ 141 (273) T protein:vir:10 71 TGVDLLIDQEKSIDFLVD-----DIDRVQVAGSLEA-YTR---AGATALATDTDKFIADMLVDNGTALTGSAPTDADDAF 141 (273) T ss_pred ceEEEEEeeeeecceEee-----cHHHhhhhccHHH-HHH---HHHHHHHHHHHHHHHHHHhccccccccccccchhHHH Confidence 222212223333443322 6778888888754 444 4677778899988887777665554333456666666 Q ss_pred HHHHHhhhhhcceee-eeeEEEEECchhhhhhhcccc-ccc-cc-cceeeeccCcceeecCeEEEE---chHHHhcCCeE Q lcl|NC_017976. 166 KLFNNLSAYYINIEA-IGTKVAKVGPELYNIIVDHRL-TTK-EK-ASGANVDSNEIVKFKGFLIEE---VPQAKLGANAA 238 (296) Q Consensus 166 klFn~~~~~yvn~ev-~~~~~ayV~~evYNaIvD~~l-~Ts-~K-~Ss~NiD~ngi~~fKgf~l~e---~p~~y~qg~~a 238 (296) ..|-.+....-+..| .....++|+|++|..|.-.+. .+. .+ ++...+=+--|-++-||.+-+ +|..-- ...+ T Consensus 142 ~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~-~~~~ 220 (273) T protein:vir:10 142 DLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-EQFV 220 (273) T ss_pred HHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCc-cEEE Confidence 667777666655555 345789999999999986552 332 22 222222233356899999998 564210 2246 Q ss_pred EEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecc Q lcl|NC_017976. 239 LVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAP 292 (296) Q Consensus 239 ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~ 292 (296) .|.+..++.+ ..|........++--|-.+.|---||..+++.-+.++++++-+ T Consensus 221 ~~~~~A~~~a-~q~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 221 AFHPSAAAYV-SQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred EEeccceeee-eeeehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 7788887765 4565555555555558889988889999999888888887766 No 13 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=98.63 E-value=7e-09 Score=65.29 Aligned_cols=271 Identities=14% Similarity=0.155 Sum_probs=156.5 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCc-ccceeEEEeecccceE-ecccccCcccceeccc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQ-NNETAFYVKTSDLPVV-VGTGYNTDANVGFGTG 78 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVq-nN~tafsvKtnd~pVv-vg~~Y~td~NvaFGtG 78 (296) ||+-| |.++|.+.|...|.+.++|....+..- -.-|+ ++ +=+||++.+.++ ++. |+.+ +.+|-.| T Consensus 1 MA~~n-------~a~~~~~~Ld~~~~~~l~~~~L~~~~~--~~~v~~~g--g~tVkI~~i~~~gl~D-Y~R~-~~g~~~g 67 (299) T protein:vir:79 1 MAALN-------YAKEYSNVLAQAYPYTLNFGDLYATPN--NGRYRWTG--SKTIEIPTISTTGRVD-SNRD-TIAVAQR 67 (299) T ss_pred Cccch-------hHHHHHHHHHHHHHhhceeeeeccCcc--cceeeecC--CCEEEEeccccccccc-cccC-CCccccc Confidence 66422 678999999999999998865444300 01111 12 225676666542 333 6542 2244333 Q ss_pred cCCcccccceeEEEEeccccccccCchhhhccccccccCC-HhHHHHHHHhh-HHHHHHHHHHHHHhHHHhhhh---cch Q lcl|NC_017976. 79 TSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNS-MESAMADRVEL-QAQAKTMLFDKKHAEFIVANA---GKT 153 (296) Q Consensus 79 Tg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNnd-l~aavAdRl~L-qa~Ak~~~~n~~~gk~ls~~A---~~t 153 (296) +- ..-|+.-..-++-.|+|. ||.+-|+.- ....+|.-++. |....+..+|+.....|.+.| +.. T Consensus 68 ~~---------~~~~~t~~ldqdr~~~f~--vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~ 136 (299) T protein:vir:79 68 NY---------DNAWEPKVLTNQRKWSTL--VHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNT 136 (299) T ss_pred cc---------CcceeEEEeeccccceec--cchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCc Confidence 21 112333333444445443 565554431 11112222211 223334455766555554444 222 Q ss_pred hhhhhcchhHHHHHHHHhhhhhcceee-eeeEEEEECchhhhhhhccccccccccce-eeeccCcc-eeecCeEEEEchH Q lcl|NC_017976. 154 EALTAYDEAAVLKLFNNLSAYYINIEA-IGTKVAKVGPELYNIIVDHRLTTKEKASG-ANVDSNEI-VKFKGFLIEEVPQ 230 (296) Q Consensus 154 ~~l~~~~~~~V~klFn~~~~~yvn~ev-~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss-~NiD~ngi-~~fKgf~l~e~p~ 230 (296) ..-..+|.+++.+.+-.+.++--+.+| ..++.+||+|++|.+|-.++.-+...... .++.-||. -++-||.|.|||+ T Consensus 137 ~~~~~~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps 216 (299) T protein:vir:79 137 ADTTVLTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPS 216 (299) T ss_pred ccccccCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEech Confidence 222467899999999999999988898 56799999999999999887644333221 22233444 3689999999999 Q ss_pred HHhcCC----------------eEEEeecceeeeccceEEEEEeeccCc-cceeeeeccc-ccccCCCCCcceEEEEecc Q lcl|NC_017976. 231 AKLGAN----------------AALVYIKGVGKAFTGITTARTIESEDF-DGVAFQGAGK-AGEFILDDNKPAVVKVTAP 292 (296) Q Consensus 231 ~y~qg~----------------~aifs~dnIg~af~GI~taRtieSEDF-dGVaLQgAgK-~G~~IlddNKkAI~k~t~~ 292 (296) ..|.+. ..|..+...-++.+=++..|..+.+-. .|=+|..=-+ +.-||+|.=|++|.-...+ T Consensus 217 ~r~~t~~~~~~G~~~~~~ak~in~ii~~~~a~~~~~K~~~~~~~~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 296 (299) T protein:vir:79 217 NLMKTAYDFTTGWKVGAGAKQIFMSLVHPSAIITPVSYQFSKLDEPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEG 296 (299) T ss_pred hhcCccceeccCccccCcccccceEEEcCCeeeeeEeeeeEEeecCCCCCccceeeeeeeeeeeeeeccccCeEEEEeee Confidence 999853 245556677778888888888765432 2223332233 4568887667777333332 Q ss_pred cCC Q lcl|NC_017976. 293 TVT 295 (296) Q Consensus 293 ~~~ 295 (296) +-+ T Consensus 297 a~~ 299 (299) T protein:vir:79 297 AGA 299 (299) T ss_pred cCC Confidence 223 No 14 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=98.60 E-value=1.1e-08 Score=64.24 Aligned_cols=269 Identities=14% Similarity=0.121 Sum_probs=162.7 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCc-ccceeEEEeecccceEecc-cccCcccceeccc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQ-NNETAFYVKTSDLPVVVGT-GYNTDANVGFGTG 78 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVq-nN~tafsvKtnd~pVvvg~-~Y~td~NvaFGtG 78 (296) |+|.=|+||+. |.++|.+.|..+|...+.. +.|..-++.. ++++ +||+..+.+ .|= .|+. |.+|-.| T Consensus 1 ~~~~an~mAln-ya~~~~~~Ld~~~~~~~~t-----~~l~~~~~~~~~Gak--~VkIp~i~~-~gl~dY~R--~~g~~~g 69 (311) T protein:vir:99 1 MPTDAETRGFN-YVTKDGNLLDQKITAGLFT-----AALGTPEVDLVNGGR--SFTLKTIST-SGLKDHTR--GKGFNSG 69 (311) T ss_pred CCCcchhhHHH-HHHHHHHHHHHHHHhhhcc-----cceecCchheeecCC--EEEEEeeee-cccccccc--ccCcccc Confidence 99999999954 8999999999999887632 3243333322 2244 677777764 332 1655 3333322 Q ss_pred cCCcccccceeEEEEeccccccccCchhhhccccccccC-CHhHHHHHHHhhHHHHH-HHHHHHHHhHHHhhhhcc---- Q lcl|NC_017976. 79 TSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNN-SMESAMADRVELQAQAK-TMLFDKKHAEFIVANAGK---- 152 (296) Q Consensus 79 Tg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNn-dl~aavAdRl~Lqa~Ak-~~~~n~~~gk~ls~~A~~---- 152 (296) + . ..-|+-....+|-.|.|. ||++-|+. .....+|.=++.|...+ +=.+++..=..|...|+. T Consensus 70 ~--------v-~~~~et~tl~~DR~~~f~--vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~ 138 (311) T protein:vir:99 70 T--------I-SDEKTIYTMGQDRDVEFY--LDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGT 138 (311) T ss_pred c--------e-eeeeeEEEeeeccceeee--cchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccc Confidence 2 1 345666677788888775 88888875 23333443333332222 223333322223222211 Q ss_pred ----------hhhhhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhcccccccccc----ceeeeccCcce Q lcl|NC_017976. 153 ----------TEALTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKA----SGANVDSNEIV 218 (296) Q Consensus 153 ----------t~~l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~----Ss~NiD~ngi~ 218 (296) .+.-..+|+++|..-+-.+-.+--.. ...+.++||+|++|.+|=+.+.-+...+ +.-.||.. +- T Consensus 139 ~~~~~~~~~~~~~~~~lt~~nvl~~l~~~~~~~~~v-~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~-V~ 216 (311) T protein:vir:99 139 DTEGTLLAKTHKTEETLDETNAYSQLKTGIGKVRKY-GTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESR-IT 216 (311) T ss_pred ccchhhhccccccccccCHHHHHHHHHHHHHHHHhc-CCCCeEEEEChHHHHHHhhchhhheeeecccccccccccc-cc Confidence 01113578888876665554433222 3456999999999998766543332111 22235554 78 Q ss_pred eecCeEEEEc-hHHHhc-------CC---------eEEEeecceeeeccceEEEEEee---ccCccceeeeecccccccC Q lcl|NC_017976. 219 KFKGFLIEEV-PQAKLG-------AN---------AALVYIKGVGKAFTGITTARTIE---SEDFDGVAFQGAGKAGEFI 278 (296) Q Consensus 219 ~fKgf~l~e~-p~~y~q-------g~---------~aifs~dnIg~af~GI~taRtie---SEDFdGVaLQgAgK~G~~I 278 (296) .+.|+.|.|+ |+..|+ |. .-|..|...-++.+=.+..|.++ ..+=||=..|+--=+.-|| T Consensus 217 ~lDgv~Ii~V~ps~r~~t~~~ft~G~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~gd~~l~~~R~Y~D~fv 296 (311) T protein:vir:99 217 SIDGVQLIEVYESNRFMTKYDFTDGAKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQHTDGDGYLYQNRLYHDLFI 296 (311) T ss_pred eecCeEEEEecCchhhcchhhhcCCccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCCCCCcceeeeeeeeeeeeee Confidence 9999999999 999987 32 24555566667777777888885 4445688888888889999 Q ss_pred CCCCcceEEEEeccc Q lcl|NC_017976. 279 LDDNKPAVVKVTAPT 293 (296) Q Consensus 279 lddNKkAI~k~t~~~ 293 (296) +|.=|++|.-....+ T Consensus 297 ~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 297 KKHKRDGIFVSVKKA 311 (311) T ss_pred eccccCeEEEeeecC Confidence 987777774433333 No 15 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=98.54 E-value=1.1e-08 Score=64.29 Aligned_cols=264 Identities=16% Similarity=0.145 Sum_probs=154.2 Q ss_pred ceeeee-chhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecccccCcccceecc-ccCCcccc Q lcl|NC_017976. 8 LAAKTY-QKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVGFGT-GTSKSSRF 85 (296) Q Consensus 8 ~a~r~Y-~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~NvaFGt-GTg~s~RF 85 (296) ||+-.+ .++|.+.+...|+++..|.+..--..+ ..|.+ .+ +|+..-.+.+--..|... |+ ++...-=. T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~-~~~~~-Gd---Tv~ip~~~~~~~~d~~~~-----~~~~~~~~~~~ 70 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYE-GIASK-GN---VVHIAGVVAPTVKDYKAA-----GRQTSADAISD 70 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhcccc-ccccC-Cc---EEEEeecCcccccccccC-----CCccCcccccc Confidence 555545 588999999999999998775422111 11211 22 233222221110113211 11 11111111 Q ss_pred cceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhhhhcchhHHH Q lcl|NC_017976. 86 GDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEALTAYDEAAVL 165 (296) Q Consensus 86 G~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l~~~~~~~V~ 165 (296) ++..-.|-.+..+++..+ .+|+....-|+.+ +.+ -|++|-.+.++..+-..+...+++...-...+.+++. T Consensus 71 ~~~~~tid~~~~~~~~i~-----d~d~~~~~~~~~~-~~~---~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~ 141 (273) T protein:vir:79 71 TGVDLLIDQEKSIDFLVD-----DIDRVQVAGSLEA-YTR---AGATALATDTDKFIADMLVDNGTALTGSAPSDADDAF 141 (273) T ss_pred ceEEEEEeeecccceeec-----cHHHHhhcccHHH-HHH---HHHHHHHHHHHHHHHHHHhhcccccccccccchhhHH Confidence 222222333444444432 6677777778764 444 3677778899998888887665554333455556666 Q ss_pred HHHHHhhhhhcceee-eeeEEEEECchhhhhhhcccc-ccccc-c-ceeeeccCcceeecCeEEEE---chHHHhcCCeE Q lcl|NC_017976. 166 KLFNNLSAYYINIEA-IGTKVAKVGPELYNIIVDHRL-TTKEK-A-SGANVDSNEIVKFKGFLIEE---VPQAKLGANAA 238 (296) Q Consensus 166 klFn~~~~~yvn~ev-~~~~~ayV~~evYNaIvD~~l-~Ts~K-~-Ss~NiD~ngi~~fKgf~l~e---~p~~y~qg~~a 238 (296) +.|-.+....=+..| .....++|+|+.|..|.-.+. .+.+. . +...+-+--+-++.||.|-+ +|..-- ...+ T Consensus 142 ~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~-~~~~ 220 (273) T protein:vir:79 142 DLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-EQFV 220 (273) T ss_pred HHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCc-eEEE Confidence 667777776655565 345789999999999975542 33222 2 22223333355899999988 464211 1246 Q ss_pred EEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecc Q lcl|NC_017976. 239 LVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAP 292 (296) Q Consensus 239 ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~ 292 (296) .|.+..++-+ ..|....+-..++--|-.+-|-=-||..+++..+.++++++-+ T Consensus 221 a~~~~A~~~a-~~~~~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 221 AFHPSAAAYV-SQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred EEeccceeee-eehhhhhcccCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 7788877653 4555444444455448888888889999999998888887776 No 16 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=98.38 E-value=7.2e-08 Score=59.74 Aligned_cols=271 Identities=13% Similarity=0.121 Sum_probs=163.4 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE-ecccccCcccceecccc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV-VGTGYNTDANVGFGTGT 79 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv-vg~~Y~td~NvaFGtGT 79 (296) ||.+ + -|.++|.+.|..+|...+.+...=+.. +.|+-| -+=.||+..+.+. ++. |+.+...+|..|+ T Consensus 1 Mant-----l-~ya~~~~~~LD~~~~~~~~s~~l~~~~----~~v~~~-ggktVkIp~i~~~gl~D-Y~R~~g~~~~~g~ 68 (312) T protein:vir:10 1 MANT-----L-AYGQVLQQGLDKQATQELLTGWMDSNA----KQIKYE-GGKEVKIGKLSTDGLGD-YSRGSANAYVGGD 68 (312) T ss_pred CCcc-----h-hHHHHHHHHHHHHHHhhhccccccCCC----ceEEEe-cCcEEEEEeeecccccc-cccccCCcccccc Confidence 7721 2 477999999999999988765332221 112211 1224566555532 222 6655555565443 Q ss_pred CCcccccceeEEEEeccccccccCchhhhccccccccC-CHhHHHHHHHhh-HHHHHHHHHHHHHhHHHhhhhcc----- Q lcl|NC_017976. 80 SKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNN-SMESAMADRVEL-QAQAKTMLFDKKHAEFIVANAGK----- 152 (296) Q Consensus 80 g~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNn-dl~aavAdRl~L-qa~Ak~~~~n~~~gk~ls~~A~~----- 152 (296) =+ .-|+.....+|-.|.|. ||++-|+. .+...+|.=++. |....+-.+|+..=..|...|.. T Consensus 69 v~---------~~~et~tl~qDR~~~F~--vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~ 137 (312) T protein:vir:10 69 VK---------FEYETKTMTQDRGRKFT--LDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDT 137 (312) T ss_pred cc---------ccceeEEeeecccceee--ccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccc Confidence 21 13344445666677765 88888775 344445655555 34444456666522233322221 Q ss_pred -hhhhhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhcccc--ccccccceeeeccCcceeecCeEEEEch Q lcl|NC_017976. 153 -TEALTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRL--TTKEKASGANVDSNEIVKFKGFLIEEVP 229 (296) Q Consensus 153 -t~~l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l--~Ts~K~Ss~NiD~ngi~~fKgf~l~e~p 229 (296) .+.-..+|.++|.+.+-.+.++--+.++..+.++||+|++|.+|=+... .++...+..+|| --+-++.|+.|.||| T Consensus 138 ~~~~~~~~T~~ni~~~i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~~~~~~~~~~~~~~~i~-~~V~~iDgv~Ii~VP 216 (312) T protein:vir:10 138 NVEYSYSVNSSTIINKIKTGIKIIRENGYNGPLVCHLTYDSMFAIEEKVLEKLTAVTFAQGGIQ-TQVPSIDGCALIKTP 216 (312) T ss_pred ccccccccCHHHHHHHHHHHHHHHHHccCCCceEEEeChHHHHHHhhhhhceecccccccceee-eeeeeecccEEEEch Confidence 1112357899999999999999999999889999999999976655321 122222333342 224578899999999 Q ss_pred HHHhcCC-------------------------eEEEeecceeeeccceEEEEEeecc---CccceeeeecccccccCCCC Q lcl|NC_017976. 230 QAKLGAN-------------------------AALVYIKGVGKAFTGITTARTIESE---DFDGVAFQGAGKAGEFILDD 281 (296) Q Consensus 230 ~~y~qg~-------------------------~aifs~dnIg~af~GI~taRtieSE---DFdGVaLQgAgK~G~~Ildd 281 (296) +..|... ..|..|...-++.+=.+..|.++.+ +=||=..|+--=+.-||+|. T Consensus 217 s~r~~t~~~f~dG~t~~~~~gg~~~~~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv~~n 296 (312) T protein:vir:10 217 QNRMYSSILLNDGTTSNQTAGGYLKGTKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRRYHDLWVTDN 296 (312) T ss_pred hhhccceeeeccCcccccccCceeecCcccccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeecc Confidence 9999421 2455556667777777778877543 22344666666677888876 Q ss_pred CcceEEEEecccCCC Q lcl|NC_017976. 282 NKPAVVKVTAPTVTP 296 (296) Q Consensus 282 NKkAI~k~t~~~~~p 296 (296) =+++| .+.-+.+.| T Consensus 297 k~~~I-yv~~k~a~~ 310 (312) T protein:vir:10 297 KANSV-YANFKDAKP 310 (312) T ss_pred ccCeE-EEEeecccC Confidence 66666 555555556 No 17 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=98.33 E-value=1.9e-07 Score=57.48 Aligned_cols=261 Identities=13% Similarity=0.141 Sum_probs=162.2 Q ss_pred ceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE-ecccccCcccceeccccCCccccc Q lcl|NC_017976. 8 LAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV-VGTGYNTDANVGFGTGTSKSSRFG 86 (296) Q Consensus 8 ~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv-vg~~Y~td~NvaFGtGTg~s~RFG 86 (296) ||+- |.++|.+.|...|.+.+++...--+.. .+ ++ +=+||++.+.++ ++. |+.+. +|..|+-+ T Consensus 1 Main-~a~~~~~~Ld~~~~~~~~t~~l~~~~~---~~--~g--gktVkI~~i~~~gl~D-Y~R~~--g~~~g~v~----- 64 (290) T protein:vir:78 1 MAIN-YVDKYGKELDQKLVFGTYTNELETPNL---LW--LD--AKTFKIQTITTTGLKA-HTRNK--GYNEGSAS----- 64 (290) T ss_pred Cchh-HHHHHHHHHHHHHHhhheeeeccccce---ee--cc--CCEEEEeeeccCcccc-cccCC--CcccCccc----- Confidence 6664 568899999999999988766432211 12 12 224676666542 333 77643 55444321 Q ss_pred ceeEEEEeccccccccCchhhhccccccccCC-HhHHHHHHHh-hHHHHHHHHHHHHHhHHHhhhhcchhh--hhhcchh Q lcl|NC_017976. 87 DRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNS-MESAMADRVE-LQAQAKTMLFDKKHAEFIVANAGKTEA--LTAYDEA 162 (296) Q Consensus 87 ~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNnd-l~aavAdRl~-Lqa~Ak~~~~n~~~gk~ls~~A~~t~~--l~~~~~~ 162 (296) .-|+.....++-.|+|. ||.+-|+.- ....+|.-++ .|+...+-.+|+..-..|...|..... ...+|++ T Consensus 65 ----~~~et~tl~qdR~~~F~--vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~ 138 (290) T protein:vir:78 65 ----NTNKSYTIDFDRDVEFF--VDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKD 138 (290) T ss_pred ----cceeeEEeeccccceee--ccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHH Confidence 23444445566666653 777666431 2222333322 234444556676655555544432111 2456888 Q ss_pred HHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhcccccccccc----ceeeeccCcceeecCeEEEEchHH-Hhc--- Q lcl|NC_017976. 163 AVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKA----SGANVDSNEIVKFKGFLIEEVPQA-KLG--- 234 (296) Q Consensus 163 ~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~----Ss~NiD~ngi~~fKgf~l~e~p~~-y~q--- 234 (296) ++.+.+-.+..+--+ -...+..+||+|++|.+|-.++.-+...+ +.-.|+. -+-++.||.|.|+|.+ .|. T Consensus 139 n~~~~i~~~~~~lde-vp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~-~V~~idG~~ii~vps~~r~~t~~ 216 (290) T protein:vir:78 139 NVFTKLKAAIRKVKK-YGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIET-RITAIDGTRIVEVEAEDRFYDTF 216 (290) T ss_pred HHHHHHHHHHHHHHh-cCCCCeEEEECHHHHHHHhhChhhhccccccccccccccc-eeeeecCcEEEEecccchhhhhh Confidence 898888888777532 22556999999999999987765544222 1223322 3468999999999964 443 Q ss_pred -----------CC--eEEEeecceeeeccceEEEEEeeccCc---cceeeeecccccccCCCCCcceEEEEecc Q lcl|NC_017976. 235 -----------AN--AALVYIKGVGKAFTGITTARTIESEDF---DGVAFQGAGKAGEFILDDNKPAVVKVTAP 292 (296) Q Consensus 235 -----------g~--~aifs~dnIg~af~GI~taRtieSEDF---dGVaLQgAgK~G~~IlddNKkAI~k~t~~ 292 (296) ++ ..|..|.+.-++.+=.+..|..+.+-. ||=..|+--=+.-||+|.=|++|.....- T Consensus 217 ~f~~G~~~~~~ak~in~ii~~~~a~i~~~K~~~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 217 DFTDGYKPAAGAKKLNFLLVNKGSVVGGAKHASIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred hhcccccccCCccceeEEEEcCCceeeeeeeeEEEeeCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 22 256667777888888889999987766 77788888888999997777777754433 No 18 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.00 E-value=3.7e-07 Score=55.86 Aligned_cols=265 Identities=12% Similarity=0.106 Sum_probs=146.7 Q ss_pred CCCCccc--------ceee--eechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccc-eEecccccC Q lcl|NC_017976. 1 MGTKNQQ--------LAAK--TYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLP-VVVGTGYNT 69 (296) Q Consensus 1 m~t~Nnn--------~a~r--~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~p-Vvvg~~Y~t 69 (296) |...|+- -.++ +|-|+|.+.+.+-|++++.|++.+=- +.. .+ -.- |+.+-++ +-+ .+|.. T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~--r~i---~~-G~t--v~i~~ig~~~~-~~~~~ 77 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS--YDL---RG-GKS--KQFMFTGKLSA-GYHTP 77 (332) T ss_pred ccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhcccc--ccc---cc-cce--EEEEeccceeE-eeecC Confidence 4443332 1166 89999999999999999999876542 211 11 111 2221111 112 12333 Q ss_pred cccceeccccCCccc-ccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhh Q lcl|NC_017976. 70 DANVGFGTGTSKSSR-FGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVA 148 (296) Q Consensus 70 d~NvaFGtGTg~s~R-FG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~ 148 (296) +...- ++. -=.-+-++-.|+..=+ .+.| +.||+..++-|+ ++++..-++.|=.|.++..+...|.+ T Consensus 78 g~~l~-------~~~~~~~~~~~l~ID~~ky~--~~~V-ddiD~~q~~~dl---~~~~~~~~g~aLA~~~D~~i~~~l~~ 144 (332) T protein:vir:78 78 GTPIV-------GDAGIKANEKTLVMDDLLVS--SQFV-YSLDEIFSQYST---RAEVSKQIGEALATHYDERIARVLAK 144 (332) T ss_pred CCCCC-------CCCCCCCceEEEEEehhhhh--HHHH-HhHHHHhcCcch---HHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 22110 010 0001112444544322 2223 468888988876 66777788999999999999988877 Q ss_pred hhcchhh-----------h---hhcchhH----HHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhc---cccccc-cc Q lcl|NC_017976. 149 NAGKTEA-----------L---TAYDEAA----VLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVD---HRLTTK-EK 206 (296) Q Consensus 149 ~A~~t~~-----------l---~~~~~~~----V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD---~~l~Ts-~K 206 (296) .|..... + +..+.++ +.++..+|.++.|- ...+.+.|+|+.|..|+. ..++.. .. T Consensus 145 aa~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP---~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~ 221 (332) T protein:vir:78 145 ASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAP---QEGRVAVLSPRQYYSLISSVDTNILNREIG 221 (332) T ss_pred hhcccCcccccccccccccCCccccCHHHHHHHHHHHHHHHhhcCCC---ccCCEEEeCHHHHHHHHhhcCceeeeeecc Confidence 6543110 0 1223333 44555555555442 233778899999999985 333333 23 Q ss_pred cceeeeccC-cceeecCeEEEEchHHHhc------------------C---C--eEEEeecceeeec---cceEEEEEee Q lcl|NC_017976. 207 ASGANVDSN-EIVKFKGFLIEEVPQAKLG------------------A---N--AALVYIKGVGKAF---TGITTARTIE 259 (296) Q Consensus 207 ~Ss~NiD~n-gi~~fKgf~l~e~p~~y~q------------------g---~--~aifs~dnIg~af---~GI~taRtie 259 (296) +++-.+..- +|.+.-||.|-+.+.--.. + + ..+|.|+-+|.+= .=|+++|..- T Consensus 222 ~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~ 301 (332) T protein:vir:78 222 NSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF 301 (332) T ss_pred ccccceecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhccc Confidence 444445443 4789999999886533211 0 1 2566776655431 1244455555 Q ss_pred ccCccceeeeecccccccCCCCCcceEEEEe Q lcl|NC_017976. 260 SEDFDGVAFQGAGKAGEFILDDNKPAVVKVT 290 (296) Q Consensus 260 SEDFdGVaLQgAgK~G~~IlddNKkAI~k~t 290 (296) .|+.-+-.+-|---||-=++.-...+.+++- T Consensus 302 ~~~~~~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 302 NVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred chhhhHhhhhhhhhhcCceecccceEEEeeC Confidence 5665555555555566656666666666554 No 19 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=97.49 E-value=5.7e-05 Score=43.84 Aligned_cols=263 Identities=12% Similarity=0.057 Sum_probs=146.2 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEe--ccc--ccCcccceec Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVV--GTG--YNTDANVGFG 76 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvv--g~~--Y~td~NvaFG 76 (296) ||.++-..+-=+--.+|..++..-+..+..|.+.--- ...+.|..-+ .| .+|+.- |+. |.-++..+.. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~-~~~~~g~~G~----tv---~iP~~~~~~~a~~v~eg~~i~~~ 72 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEV-DTTLEGQPGT----TL---TVPKWDYIGDAEDVAEGEAIPMT 72 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccc-cccccCCCCC----EE---EEEEecCCCCcccccCCCccccc Confidence 9976666664333447788887777777776542111 1122332222 11 123321 221 2222333322 Q ss_pred cccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhh Q lcl|NC_017976. 77 TGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEAL 156 (296) Q Consensus 77 tGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l 156 (296) .-+ |++.+-.+.. +-.-|. +++.......-..++..++-++.+|.|.+++.+-..+....... + T Consensus 73 ~~~-----~~~~~~~~~~-----~~~~~~----itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~-~- 136 (272) T protein:vir:30 73 QLG-----FKKTTMTIKK-----AGKGVE----ITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTV-E- 136 (272) T ss_pred ccc-----cceEEEEeee-----eeeeee----ecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-c- Confidence 222 2222111111 011121 33333333333356666667788999999998887776544333 2 Q ss_pred hhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhcccccccccccee--eeccCcc-eeecCeEEEEchHHHh Q lcl|NC_017976. 157 TAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGA--NVDSNEI-VKFKGFLIEEVPQAKL 233 (296) Q Consensus 157 ~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~--NiD~ngi-~~fKgf~l~e~p~~y~ 233 (296) +..+.+.|..+...+...+.. +....|+|++|..|.-..+....+.+.. ++=.+|. -++.|+.+-+.+. + T Consensus 137 ~~~t~d~i~da~~~l~~~~~~-----~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~--~ 209 (272) T protein:vir:30 137 ATATVDGVSKALDIFNDEDDA-----ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRK--C 209 (272) T ss_pred cccCHHHHHHHHHHHhccCCC-----ccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCC--C Confidence 466777788777777655432 4568899999999987665555554443 3333453 4899998877653 4 Q ss_pred c-CCeEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 234 G-ANAALVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 234 q-g~~aifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) - |...+|.+..++..--+-....+-.+++..-..+.+-.-||-.+++.. +|++.|.+...- T Consensus 210 p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~--~vv~~t~~~a~~ 271 (272) T protein:vir:30 210 PKGTAYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAE--KAVKITLKDAAK 271 (272) T ss_pred CcceEEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCC--ceEEEEeccccc Confidence 4 667788887776542222122333344555688889899998888654 666666654444 No 20 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=97.49 E-value=5.7e-05 Score=43.84 Aligned_cols=263 Identities=12% Similarity=0.057 Sum_probs=146.2 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEe--ccc--ccCcccceec Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVV--GTG--YNTDANVGFG 76 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvv--g~~--Y~td~NvaFG 76 (296) ||.++-..+-=+--.+|..++..-+..+..|.+.--- ...+.|..-+ .| .+|+.- |+. |.-++..+.. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~-~~~~~g~~G~----tv---~iP~~~~~~~a~~v~eg~~i~~~ 72 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEV-DTTLEGQPGT----TL---TVPKWDYIGDAEDVAEGEAIPMT 72 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccc-cccccCCCCC----EE---EEEEecCCCCcccccCCCccccc Confidence 9976666664333447788887777777776542111 1122332222 11 123321 221 2222333322 Q ss_pred cccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhh Q lcl|NC_017976. 77 TGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEAL 156 (296) Q Consensus 77 tGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l 156 (296) .-+ |++.+-.+.. +-.-|. +++.......-..++..++-++.+|.|.+++.+-..+....... + T Consensus 73 ~~~-----~~~~~~~~~~-----~~~~~~----itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~-~- 136 (272) T protein:vir:98 73 QLG-----FKKTTMTIKK-----AGKGVE----ITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTV-E- 136 (272) T ss_pred ccc-----cceEEEEeee-----eeeeee----ecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-c- Confidence 222 2222111111 011121 33333333333356666667788999999998887776544333 2 Q ss_pred hhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhcccccccccccee--eeccCcc-eeecCeEEEEchHHHh Q lcl|NC_017976. 157 TAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGA--NVDSNEI-VKFKGFLIEEVPQAKL 233 (296) Q Consensus 157 ~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~--NiD~ngi-~~fKgf~l~e~p~~y~ 233 (296) +..+.+.|..+...+...+.. +....|+|++|..|.-..+....+.+.. ++=.+|. -++.|+.+-+.+. + T Consensus 137 ~~~t~d~i~da~~~l~~~~~~-----~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~--~ 209 (272) T protein:vir:98 137 ATATVDGVSKALDIFNDEDDA-----ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRK--C 209 (272) T ss_pred cccCHHHHHHHHHHHhccCCC-----ccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCC--C Confidence 466777788777777655432 4568899999999987665555554443 3333453 4899998877653 4 Q ss_pred c-CCeEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 234 G-ANAALVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 234 q-g~~aifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) - |...+|.+..++..--+-....+-.+++..-..+.+-.-||-.+++.. +|++.|.+...- T Consensus 210 p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~--~vv~~t~~~a~~ 271 (272) T protein:vir:98 210 PKGTAYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAE--KAVKITLKDAAK 271 (272) T ss_pred CcceEEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCC--ceEEEEeccccc Confidence 4 667788887776542222122333344555688889899998888654 666666654444 No 21 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=97.47 E-value=5.4e-05 Score=43.98 Aligned_cols=254 Identities=15% Similarity=0.178 Sum_probs=141.9 Q ss_pred CCCCcccce-----------eeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecccccC Q lcl|NC_017976. 1 MGTKNQQLA-----------AKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNT 69 (296) Q Consensus 1 m~t~Nnn~a-----------~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~t 69 (296) |+|-++|.= .-+|-|+|-+.+.+-|+.++.|++..=- +.+-| -|.--|. .+. .+.+ .+|.. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~--r~i~~--G~s~~~~-~iG--~~~~-~~~~~ 72 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNV--RSLRG--TNQLRVD-RVG--ASTI-AGRKA 72 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhcccee--eeccc--cceEEEe-eec--ceee-eeecC Confidence 777643321 3477899999999999999999976532 33322 1111111 111 2233 23555 Q ss_pred cccceeccccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhh Q lcl|NC_017976. 70 DANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVAN 149 (296) Q Consensus 70 d~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~ 149 (296) ++.. ++ +|.-.-+-+|-.|+..-+... =+=||+.+.+=|+-+.++.. +++|-.|++|..+..-|... T Consensus 73 g~~l---~~----~~~~~~~~~l~ID~~l~~~~~---VddiD~~q~~~D~rse~~~~---~G~aLA~~~D~~~~~~l~ka 139 (334) T protein:vir:80 73 GEEL---VV----QKNVSDKLNLTVDTVLYARHF---FDKFDEWTSNLDVRKETARE---DGIALARQYDQACIIQLQKC 139 (334) T ss_pred CCCC---CC----CCcccCceEEEEeeeeehhhh---HhhHHHHhcCcchHHHHHHH---HHHHHHHHHHHHHHHHHHHh Confidence 5555 22 233334456777776544432 24578888888877777654 67899999999876665544 Q ss_pred hcchh----------------------hhhhcch----hHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhcccccc Q lcl|NC_017976. 150 AGKTE----------------------ALTAYDE----AAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTT 203 (296) Q Consensus 150 A~~t~----------------------~l~~~~~----~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~T 203 (296) |...- .-...+. +.+..++..+.++.+.-+...++.++|+|..|.+|+.++-.. T Consensus 140 a~~~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~ 219 (334) T protein:vir:80 140 GDFLAPAHLKPAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLM 219 (334) T ss_pred hhhcccccccccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccc Confidence 42110 0000111 234567788888888877778899999999999999997654 Q ss_pred cc-cc---ceeeeccCcceeecCeEEEEchHHHhcCCeEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCC Q lcl|NC_017976. 204 KE-KA---SGANVDSNEIVKFKGFLIEEVPQAKLGANAALVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFIL 279 (296) Q Consensus 204 s~-K~---Ss~NiD~ngi~~fKgf~l~e~p~~y~qg~~aifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~Il 279 (296) .. -+ +...+-.-.+.++-||.|-+.+.- -+. ++.... .|..+ ..-..||.- ..|.|. T Consensus 220 n~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~--P~~-~~t~~~-~g~~~-------~~~agd~t~-------~~~~~~- 280 (334) T protein:vir:80 220 NVEFGAKEGGNSFVGGRIAMLNGVRVVETPRF--PQS-AITANA-LGADF-------NVTDAEVRR-------KMITFI- 280 (334) T ss_pred cceeccccccccccceeEEEEeceEEEeecCC--CCc-cccccc-ccccc-------ccccccccc-------eEEEEE- Confidence 32 12 222344444778888888876531 100 000000 00000 000111111 112222 Q ss_pred CCCcceEEEEecccCCC Q lcl|NC_017976. 280 DDNKPAVVKVTAPTVTP 296 (296) Q Consensus 280 ddNKkAI~k~t~~~~~p 296 (296) .+.|+..+.+...++ T Consensus 281 --~~~Al~t~~~~~~~~ 295 (334) T protein:vir:80 281 --PSMALISAQVHPVSA 295 (334) T ss_pred --eCceEEEEEEeecce Confidence 345777777776666 No 22 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=97.39 E-value=2e-05 Score=46.40 Aligned_cols=269 Identities=13% Similarity=0.115 Sum_probs=141.9 Q ss_pred CCCC------cccc---------eeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecc Q lcl|NC_017976. 1 MGTK------NQQL---------AAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGT 65 (296) Q Consensus 1 m~t~------Nnn~---------a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~ 65 (296) |+.. |++. +.-+|-|+|-+-+-+-|+.++.|++..=- +.+.| -| +--.|. ||. T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~--r~i~~--g~-------s~~~~~-iG~ 68 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV--RSISS--GK-------SAQFPV-LGR 68 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhccccee--eeecc--cc-------eEEEEe-ece Confidence 6532 3222 34468899999999999999999987643 33222 11 111222 222 Q ss_pred ----cccCcccceeccccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_017976. 66 ----GYNTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKK 141 (296) Q Consensus 66 ----~Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~ 141 (296) +|..++... || .+..=.-+-+|-.|+..-+.+.. .-||+.+.+=|+-+..+++ +++|-.|.+|.. T Consensus 69 ~~~~~~~~G~~l~---~t--~~~~~~~e~~l~ID~~~y~~~~V---dDiD~~q~~~D~r~~~~~~---~G~aLA~~~D~~ 137 (344) T protein:vir:10 69 TQAAYLAPGENLD---DI--RKDIKHTEKVITIDGLLTADVLI---YDIEDAMNHYDVRSEYTSQ---LGESLAMAADGA 137 (344) T ss_pred eEEEeeecCCCCC---CC--CCCcccceEEEEEcchhhhhhhh---hhHHHHhcCcchHHHHHHH---HHHHHHHHHHHH Confidence 122222211 00 11222223357777766554332 3678888887877666555 568889999988 Q ss_pred HhHHHhhhhc------------------------chhhhhhcchhHHHHHHHHhhhhhcceee-eeeEEEEECchhhhhh Q lcl|NC_017976. 142 HAEFIVANAG------------------------KTEALTAYDEAAVLKLFNNLSAYYINIEA-IGTKVAKVGPELYNII 196 (296) Q Consensus 142 ~gk~ls~~A~------------------------~t~~l~~~~~~~V~klFn~~~~~yvn~ev-~~~~~ayV~~evYNaI 196 (296) +...|...+. .+.+=...+.+.+...+-.+.+.-....| .....++|+|+.|.+| T Consensus 138 i~~~la~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~L 217 (344) T protein:vir:10 138 VLAEIAGLCNVESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAI 217 (344) T ss_pred HHHHHHhhhccccccccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHH Confidence 8766643221 11110111122333333333333333333 2347899999999999 Q ss_pred hccccccccccceeeeccCc-ceeecCeEEEEchHHHhc----------CCe--------------------EEEeecce Q lcl|NC_017976. 197 VDHRLTTKEKASGANVDSNE-IVKFKGFLIEEVPQAKLG----------ANA--------------------ALVYIKGV 245 (296) Q Consensus 197 vD~~l~Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~q----------g~~--------------------aifs~dnI 245 (296) ++++..+.....+-+.-.+| +.+..||.|-+.|.--.. |.. .+|.|+-+ T Consensus 218 l~~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~ 297 (344) T protein:vir:10 218 LAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAV 297 (344) T ss_pred hhcccccccccccccceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhh Confidence 99988776655555555678 568899999988853221 000 12222222 Q ss_pred eeeccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecc Q lcl|NC_017976. 246 GKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAP 292 (296) Q Consensus 246 g~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~ 292 (296) |..=.=--+.+.-.+|..-|=.+-|---||-=++.-.-.+.++-+.+ T Consensus 298 ~~v~~~~~~~e~~r~~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 298 GTVKLRDLALERARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred hhhhhccceeecccchhHHHHHHHHHhhcccceecccceEEEEeecC Confidence 21110000112222344333344444445555666555566666665 No 23 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=97.31 E-value=6e-05 Score=43.74 Aligned_cols=268 Identities=14% Similarity=0.118 Sum_probs=145.9 Q ss_pred CC---------CCcc------cceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecc Q lcl|NC_017976. 1 MG---------TKNQ------QLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGT 65 (296) Q Consensus 1 m~---------t~Nn------n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~ 65 (296) |+ ++++ +-+.-+|-|+|-+.+-+-|+.++.|++..=- +.+.| -|.- -.| ++|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~--r~i~~--gks~-------~~~-~iG~ 68 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV--RSISS--GKSA-------QFP-VLGR 68 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhccccee--eeccc--cceE-------EEe-eecc Confidence 22 2221 2334688999999999999999999977642 33332 2211 122 2233 Q ss_pred ----cccCcccceeccccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_017976. 66 ----GYNTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKK 141 (296) Q Consensus 66 ----~Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~ 141 (296) +|..++... ++.++-. .-+-+|-.|+..-+.+.. .-||+.+.+=|+-+..+++ +++|-.|.+|+. T Consensus 69 ~~~~~~~~G~~l~---~~~~~~~--~~e~~ltID~~~y~~~~V---ddiD~~q~~~D~r~~~s~~---~G~aLA~~~D~~ 137 (345) T protein:vir:22 69 TQAAYLAPGENLD---DKRKDIK--HTEKVITIDGLLTADVLI---YDIEDAMNHYDVRSEYTSQ---LGESLAMAADGA 137 (345) T ss_pred eEEEeeecCCCCC---CCCCCcc--cceEEEEecchhhhhhhH---hhHHHHhcCchhHHHHHHH---HHHHHHHHHHHH Confidence 133222211 1111111 134567888877665433 3688888887876665554 678888999987 Q ss_pred HhHHHhhhhcch------------------------hhhhhcchhHHHHHHHHhhhhhcceeee-eeEEEEECchhhhhh Q lcl|NC_017976. 142 HAEFIVANAGKT------------------------EALTAYDEAAVLKLFNNLSAYYINIEAI-GTKVAKVGPELYNII 196 (296) Q Consensus 142 ~gk~ls~~A~~t------------------------~~l~~~~~~~V~klFn~~~~~yvn~ev~-~~~~ayV~~evYNaI 196 (296) +-..|...|... .+-...+.+.+-..|-.+.+..-..+|- ....++|+|+.|.+| T Consensus 138 i~~~l~k~a~~~~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~L 217 (345) T protein:vir:22 138 VLAEIAGLCNVESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAI 217 (345) T ss_pred HHHHHHHhhcccccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHH Confidence 765554322211 0001112233333343344333333332 348899999999999 Q ss_pred hccccccccccceeeeccCc-ceeecCeEEEEchHHHhc--C---------------------------C--eEEEeecc Q lcl|NC_017976. 197 VDHRLTTKEKASGANVDSNE-IVKFKGFLIEEVPQAKLG--A---------------------------N--AALVYIKG 244 (296) Q Consensus 197 vD~~l~Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~q--g---------------------------~--~aifs~dn 244 (296) ++++..+..--...+...+| +.+.-||.|-|.|.--.. | . ..+|.|+- T Consensus 218 l~~~~~~~~~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A 297 (345) T protein:vir:22 218 LAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSA 297 (345) T ss_pred hccccccccccccccccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhh Confidence 99998766554555556788 789999999997632211 0 0 13444544 Q ss_pred eeeeccceE-EEEEeeccCccceeeeecccccccCCCCCcceEEEEecc Q lcl|NC_017976. 245 VGKAFTGIT-TARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAP 292 (296) Q Consensus 245 Ig~af~GI~-taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~ 292 (296) +|..= =|. +.+.-.+|+.-+=.+-|---||-=++.-.-.+.++-+.. T Consensus 298 ~~~v~-~~~~~~e~~r~~~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 298 VGTVK-LRDLALERARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred eeeee-eecceeeeeechhHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 43111 010 111222444444444444455655666666666666665 No 24 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=96.99 E-value=0.00022 Score=40.64 Aligned_cols=262 Identities=14% Similarity=0.104 Sum_probs=150.2 Q ss_pred CCCCcccceeeeech-hHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE--eccc--ccCccccee Q lcl|NC_017976. 1 MGTKNQQLAAKTYQK-QFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV--VGTG--YNTDANVGF 75 (296) Q Consensus 1 m~t~Nnn~a~r~Y~k-q~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv--vg~~--Y~td~NvaF 75 (296) ||.+ .-....+..| .|..++..=+.++..|.+..=- .-.+.|.. -.+. .+|.. +|+. |..++..+. T Consensus 1 ma~~-~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~-~~~l~g~~-G~tv------~ip~~~~~g~~~~~~eg~~i~~ 71 (274) T protein:vir:93 1 MPQG-ITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEV-DSTLQGQP-GDTL------TFPAFVYSGDAQVVAEGEKIPT 71 (274) T ss_pred CCcc-ceehhheechHHHHHHHHHHHHhhhhhcccccc-cccccCCC-CCEE------EEEeeccCCCcccccCCCcccc Confidence 8873 3333333444 5777777767666665443211 22223321 1121 22332 2222 443344433 Q ss_pred ccccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhh Q lcl|NC_017976. 76 GTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEA 155 (296) Q Consensus 76 GtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~ 155 (296) ..-| +++.+-.|. . +-+.|.+.. +++.....|+ +++..+-++.++.+.+++.+-..|..+.....+ T Consensus 72 ~~it-----~~~~~~~i~--~---~~~~~~i~D-~~~~~~~~d~---~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~ 137 (274) T protein:vir:93 72 DILE-----TKKREAKIR--K---IAKGTSITD-EALLSGYGDP---QGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA 137 (274) T ss_pred cccc-----cceeEEEee--e---ecccccccH-HHHHhhccch---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 3333 222222221 1 234566655 5666666665 455666678999999999998888776655433 Q ss_pred hhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccce--eeeccCcce-eecCeEEEEchHHH Q lcl|NC_017976. 156 LTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASG--ANVDSNEIV-KFKGFLIEEVPQAK 232 (296) Q Consensus 156 l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y 232 (296) ..++.+.+.....++.. +-..+..+.|+|++|..|.-.++-...+.|. -++=.+|.+ +|.||.+-+.+. T Consensus 138 -~~~~~d~i~dA~~~l~d-----~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~-- 209 (274) T protein:vir:93 138 -DITKLNGLQSAIDKFND-----EDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK-- 209 (274) T ss_pred -cccCHHHHHHHHHHhhh-----ccCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCC-- Confidence 45566666655555543 2345678999999999998554333222222 233334443 899999987653 Q ss_pred hc-CCeEEEeecceeee---ccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 233 LG-ANAALVYIKGVGKA---FTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 233 ~q-g~~aifs~dnIg~a---f~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) +. +...+|.+..||-. .+-+++.| .++...-.+-|---||..++++++-..++-.++..-- T Consensus 210 ~p~~t~~l~~~gai~~~~~~~~~vE~~R---d~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 210 LEAGTAILAKKGAVKLILKRDFFLEVAR---DASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred CCcceEEEEeCCeEEEEecCCccccccc---chhhcccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 43 67788888888753 12344444 4455667889999999999999887666633322222 No 25 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=96.99 E-value=0.00022 Score=40.60 Aligned_cols=250 Identities=16% Similarity=0.245 Sum_probs=143.5 Q ss_pred CCCCcc---------cceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecc----cc Q lcl|NC_017976. 1 MGTKNQ---------QLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGT----GY 67 (296) Q Consensus 1 m~t~Nn---------n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~----~Y 67 (296) |.+-|+ +--+-+|-|+|-+.+.+-|+.++.|++..-- +++.| - |+--.|.+ |+ ++ T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~--rti~~--g-------~s~~~~~i-G~~~~~~~ 68 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNI--RDLRG--S-------NVVRLDRL-GNVEAKGR 68 (335) T ss_pred CCCcccchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccce--eeecc--c-------eeEEEeee-eeeeeecc Confidence 766541 1113467799999999999999999987643 44322 1 12222332 33 23 Q ss_pred cCcccceeccccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHh Q lcl|NC_017976. 68 NTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIV 147 (296) Q Consensus 68 ~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls 147 (296) ..++.. .++|.-.-|-+|-.|+-+ |.--|- .=||+.+-.=|+-+.++ .-+++|-.|++|..+-.-|. T Consensus 69 ~pG~~l-------~~~~~~~~k~~itVD~ll-~a~~~I--~dlDe~~~~yDvRse~s---~e~G~aLA~~~D~~~~~~i~ 135 (335) T protein:vir:63 69 RAGEEL-------ERSRVVNDKWNLTVDTLL-YLRHQF--DHQDEWTQSFDMRKEVA---ELDGQELARKFDQACLIQVI 135 (335) T ss_pred cCCcCc-------CCCCccccceEEEeccee-echhhh--hhHHHHhcCchhHHHHH---HHHHHHHHHHHHHHHHHHHH Confidence 333433 223444445588888876 333332 33677776667665554 45678899999998765555 Q ss_pred hhhcchh------------------------hhhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhcccccc Q lcl|NC_017976. 148 ANAGKTE------------------------ALTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTT 203 (296) Q Consensus 148 ~~A~~t~------------------------~l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~T 203 (296) ..|...- +-.+.-.+.+..++.++.+++|.-+...++.++|+|+.|.+|++++-.- T Consensus 136 ~aa~~~a~~~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~ 215 (335) T protein:vir:63 136 KAAAMDAPVDLEDAFSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLM 215 (335) T ss_pred hhccccCccccCCCcCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccc Confidence 4443210 0011111345566788888888888888899999999999999986432 Q ss_pred ccc--ccee--eeccCcceeecCeEEEEchHHHhcCCeEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCC Q lcl|NC_017976. 204 KEK--ASGA--NVDSNEIVKFKGFLIEEVPQAKLGANAALVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFIL 279 (296) Q Consensus 204 s~K--~Ss~--NiD~ngi~~fKgf~l~e~p~~y~qg~~aifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~Il 279 (296) ..- +|.. ..=.-++++.-||.|-+.|. |-... -+..-+|.+|-++. .||. -..|.|. T Consensus 216 n~~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~--lP~~~--~t~~~lg~a~n~~~-------~d~~-------~~~~~~~- 276 (335) T protein:vir:63 216 NVEYQATGATNDYVKSRVAILNGVKVLETPR--FATKA--IAAHPLGRHFNVSA-------EESE-------RQIALFL- 276 (335) T ss_pred ccccccccccccccCceeEEeeceEEEeecc--CCCCC--cccccccccCCccc-------cccc-------eeEEEEE- Confidence 211 1111 11122488899999888762 21110 01111233332211 2331 1123333 Q ss_pred CCCcceEEEEecccCCC Q lcl|NC_017976. 280 DDNKPAVVKVTAPTVTP 296 (296) Q Consensus 280 ddNKkAI~k~t~~~~~p 296 (296) .++|+..+.....+| T Consensus 277 --~~~Al~t~~~~~vt~ 291 (335) T protein:vir:63 277 --PSKTLITAQVAPVQA 291 (335) T ss_pred --ecceEEEEEEeeccc Confidence 356888888888888 No 26 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=96.73 E-value=8.7e-05 Score=42.85 Aligned_cols=269 Identities=14% Similarity=0.141 Sum_probs=135.5 Q ss_pred CC---------CCcccc------eeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecc Q lcl|NC_017976. 1 MG---------TKNQQL------AAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGT 65 (296) Q Consensus 1 m~---------t~Nnn~------a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~ 65 (296) || | |+.. ++-+|-|+|.+.+.+-|+.++.|++..=- + +...-|.-.|. ++..+.+ + T Consensus 1 ~a~~~~~~~~~~-~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~--r--~i~~G~sv~~~-~iG~~~~---~ 71 (347) T protein:vir:88 1 MANATGGQQIGA-NQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV--R--TIQNGKSASFP-VMGRTKG---Y 71 (347) T ss_pred CCCcccchhhhc-cCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhcccc--c--cccCcceEEEe-eecceee---e Confidence 43 3 2222 36789999999999999999999987643 2 21111111111 2222221 2 Q ss_pred cccCcccceeccccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHH Q lcl|NC_017976. 66 GYNTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEF 145 (296) Q Consensus 66 ~Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ 145 (296) +|..+... .++.++-...++ +|-.|+..-+.+ .=+-+|+...+-|+-+.+++ .+++|=.|.+|..+-.. T Consensus 72 ~~~~g~~l---~~~~~~~~~~~~--~i~ID~~~y~~~---~Vdd~D~~q~~~D~r~~~~~---~~g~aLA~~~D~~i~~~ 140 (347) T protein:vir:88 72 YLAPGENL---DDKRKDIKHSEK--VIQIDGLLTSDV---LIYDIEDAMNHYDVRAEYSA---QLGEALAIAADGAVLAE 140 (347) T ss_pred eeccccCC---CCCCCCCccceE--EEEEechhhhhh---hhhhHHHHhhcCCchHHHHH---HHHHHHHHHHHHHHHHH Confidence 12322221 122222233332 233333221111 11467788888786655554 56778888999877665 Q ss_pred Hhhhhcchhhh------------------hhcc---------hhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhc Q lcl|NC_017976. 146 IVANAGKTEAL------------------TAYD---------EAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVD 198 (296) Q Consensus 146 ls~~A~~t~~l------------------~~~~---------~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD 198 (296) |...|...... .+.+ -+.+.++...+.+..|. ...+++.|+|+.|..|++ T Consensus 141 l~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP---~~gR~~vv~P~~y~~Ll~ 217 (347) T protein:vir:88 141 MAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVP---AGDRRFYCAPEDYSAILS 217 (347) T ss_pred HHHhhccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCC---CCCCEEEeCHHHHHHHhc Confidence 54443221100 0111 12233344444444432 235899999999999999 Q ss_pred cccccccccce-eeeccCcceeecCeEEEEchHHHhc--CCe--------------------------------EEEeec Q lcl|NC_017976. 199 HRLTTKEKASG-ANVDSNEIVKFKGFLIEEVPQAKLG--ANA--------------------------------ALVYIK 243 (296) Q Consensus 199 ~~l~Ts~K~Ss-~NiD~ngi~~fKgf~l~e~p~~y~q--g~~--------------------------------aifs~d 243 (296) ++-.+++--.+ ..+..-++.++-||.|-+.|.-=+. |.. .+|.+. T Consensus 218 ~~~~~~~~~~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~ 297 (347) T protein:vir:88 218 ALMPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRS 297 (347) T ss_pred chhhhhhhhccccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechh Confidence 87665544333 3344334678999998887644221 110 112222 Q ss_pred ceeee-ccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEeccc Q lcl|NC_017976. 244 GVGKA-FTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPT 293 (296) Q Consensus 244 nIg~a-f~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~ 293 (296) -+|.. -..+ ..+.-..++.-+-.+-|---||-=++...-.+.++.+.++ T Consensus 298 a~g~v~~~d~-~~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 298 AVGTVKLKDM-ALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred hhhheecccc-eeeeeechhhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 22111 0000 0111122333344555555566667776666666666665 No 27 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=96.46 E-value=0.00019 Score=40.93 Aligned_cols=264 Identities=15% Similarity=0.118 Sum_probs=132.0 Q ss_pred CCCCccc-----------------ceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEe Q lcl|NC_017976. 1 MGTKNQQ-----------------LAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVV 63 (296) Q Consensus 1 m~t~Nnn-----------------~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvv 63 (296) |+..|+. -+.-+|-|+|.+.+.+-|++++.|++..=- +.+-| .+...+..+ . .+-+ T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~--rti~~-Gksv~f~~i--G--~~t~ 73 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTK--RTLKN-GKSLQFIYT--G--RMTS 73 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccc--ccccc-CceEEEEee--e--eeEE Confidence 4444432 334678899999999999999999976542 22222 111111111 1 1112 Q ss_pred cccccCcccceeccccCCcccccc--eeE-EEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHH Q lcl|NC_017976. 64 GTGYNTDANVGFGTGTSKSSRFGD--RQE-IIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDK 140 (296) Q Consensus 64 g~~Y~td~NvaFGtGTg~s~RFG~--rkE-Iiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~ 140 (296) .+|..+... ..+..-. -.| +|-.|+..=+.+. =+-||+.+.+=|+-+.. ..-+++|-.|.+|. T Consensus 74 -~~~t~G~~i-------~~~~~~d~~~te~~l~ID~~~y~~~~---VdDiD~aqa~~Dlr~e~---s~~~G~aLA~~~D~ 139 (375) T protein:vir:10 74 -SFHTPGTPI-------LGNADKAPPVAEKTIVMDDLLISSAF---VYDLDETLAHYELRGEI---SKKIGYALAEKYDR 139 (375) T ss_pred -eeecCCcCc-------CCccccCCCCCceEEEecchhhhhhh---HhhHHHHhcCchhHHHH---HHHHHHHHHHHHHH Confidence 113221111 1111100 111 3667765544332 24688888887766554 45577899999999 Q ss_pred HHhHHHhhhhcch----------------------hhhhhcch----hHHHHHHHHhhhhhcceeeeeeEEEEECchhhh Q lcl|NC_017976. 141 KHAEFIVANAGKT----------------------EALTAYDE----AAVLKLFNNLSAYYINIEAIGTKVAKVGPELYN 194 (296) Q Consensus 141 ~~gk~ls~~A~~t----------------------~~l~~~~~----~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYN 194 (296) .+-.-|...|... ..-...+. +.+..+..++.++.|- .....++|+|+.|. T Consensus 140 ~i~~~l~kaa~~~~p~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP---~~~R~~vv~P~~y~ 216 (375) T protein:vir:10 140 LIFRSITRGARSASPVSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVS---SQGRCAVLNPRQYY 216 (375) T ss_pred HHHHHHHHhhhhccccccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCC---CCCCEEEeChHHHH Confidence 8877665443111 00011222 3455566677776665 23578999999999 Q ss_pred hhhcc---c-cccccccceeeecc-CcceeecCeEEEEchHHHhc-C-CeEEEeecceeeec--------cceEEEEE-- Q lcl|NC_017976. 195 IIVDH---R-LTTKEKASGANVDS-NEIVKFKGFLIEEVPQAKLG-A-NAALVYIKGVGKAF--------TGITTART-- 257 (296) Q Consensus 195 aIvD~---~-l~Ts~K~Ss~NiD~-ngi~~fKgf~l~e~p~~y~q-g-~~aifs~dnIg~af--------~GI~taRt-- 257 (296) +|+.+ + +....- ..-.+.. .++.+..||.|-+...--.- + +..|-.+.++-.+. .++++.=+ T Consensus 217 ~Ll~~~d~~~~~n~d~-~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g 295 (375) T protein:vir:10 217 ALIQDIGSNGLVNRDV-QGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGG 295 (375) T ss_pred HHHhcCCccceeeecc-cccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeecc Confidence 99866 2 332222 1222333 45678889888776553222 1 12222222221111 11211100 Q ss_pred ---eeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 258 ---IESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 258 ---ieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) ---+||++ ..=.-|.|. +|.|...+.+.-+++ T Consensus 296 ~~~~y~~d~~~----~~~~~~~~~---~~~A~g~v~~~~~~~ 330 (375) T protein:vir:10 296 VNNDYGTNAEL----GAKSCGLIF---QKEAAGVVEAIGPQV 330 (375) T ss_pred ccccccccccc----cCceEEEEE---chhheeeeeeecccc Confidence 00012211 000112222 666777777777777 No 28 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=96.46 E-value=0.0004 Score=39.21 Aligned_cols=273 Identities=7% Similarity=-0.000 Sum_probs=121.7 Q ss_pred CCCCcccceeeeec-hhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccce-EecccccCcccceeccc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQ-KQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPV-VVGTGYNTDANVGFGTG 78 (296) Q Consensus 1 m~t~Nnn~a~r~Y~-kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pV-vvg~~Y~td~NvaFGtG 78 (296) |.+..+.-.+..|- ++|.+.|...|+++..|.+...- ....| ++.+| |++..++- -+ +.|+.+....+ T Consensus 11 ~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~--~~~~~-~~GdT---V~ip~~g~~~a-~d~~~g~~i~~--- 80 (381) T protein:vir:80 11 KGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKK--IPFEG-KKGDL---IHIPNISRAAV-YDKQPQTPVNL--- 80 (381) T ss_pred cCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhcccc--cccee-ecCce---EEeeccCccee-eeecCCCcccc--- Confidence 55555544455555 69999999999999999764322 11111 12222 22221111 12 22443322212 Q ss_pred cCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcc------ Q lcl|NC_017976. 79 TSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGK------ 152 (296) Q Consensus 79 Tg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~------ 152 (296) .+-...+..+-.|+..=+... ..-+|+....-|+.+.+..++. +|-.|.+++.+=..+...+.. T Consensus 81 ----~~~~~~~~~itID~~~~~~~~---Idd~D~~~~~~D~~~~~~~~~~---~aLA~~~D~~i~~~~~~~~~~~~~~~~ 150 (381) T protein:vir:80 81 ----QARTDSEFTFTVTKYKESSFM---IEDIVNTQASYTLRQYYTKEAG---YALARDMDNFALAHRAVINAFPSQRIY 150 (381) T ss_pred ----cccCCceEEEEEeeeeeccee---echHHHHhhccChHHHHHHHHH---HHHHHHHHHHHHHHHhhcccccccccc Confidence 111222222334443322221 2356667777777776666654 555666666553333211100 Q ss_pred -----------hhhhhhcchhHHHHHHHHhhhhhcceee-eeeEEEEECchhhhhhhccccccccccceeeeccCc-cee Q lcl|NC_017976. 153 -----------TEALTAYDEAAVLKLFNNLSAYYINIEA-IGTKVAKVGPELYNIIVDHRLTTKEKASGANVDSNE-IVK 219 (296) Q Consensus 153 -----------t~~l~~~~~~~V~klFn~~~~~yvn~ev-~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ng-i~~ 219 (296) ....+..+.+...+.|-.+.+..-..+| ...++++|+|+.|..|.-++--+.+...+.++-.+| |-+ T Consensus 151 t~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~ 230 (381) T protein:vir:80 151 SYDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGT 230 (381) T ss_pred cccccccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeE Confidence 0001111122222333334444444444 235789999999999998765555443333444555 678 Q ss_pred ecCeEEEE---chHHHhcCCe-EEEeecceeeeccceEEEEEeeccCccceeeeecccccc----------------c-- Q lcl|NC_017976. 220 FKGFLIEE---VPQAKLGANA-ALVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGE----------------F-- 277 (296) Q Consensus 220 fKgf~l~e---~p~~y~qg~~-aifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~----------------~-- 277 (296) +-||.|-+ +|..--.+-. +-..|. .+.-+|+-.+.-.++.+.+.++-.--.|+- + T Consensus 231 i~G~~Vv~Sn~lp~~~~t~~~~~agap~---~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~ 307 (381) T protein:vir:80 231 ILGMEVIVTTQIGINSLTGYVNGQGAPT---QPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATA 307 (381) T ss_pred EcceEEEeecccccccccceeeeccccc---cccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeee Confidence 99999998 4431110111 111221 122233444444444444333332211111 1 Q ss_pred --------CCCCCcceEEEEe----cccCCC Q lcl|NC_017976. 278 --------ILDDNKPAVVKVT----APTVTP 296 (296) Q Consensus 278 --------IlddNKkAI~k~t----~~~~~p 296 (296) +....++.+..+. -.+.-| T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 338 (381) T protein:vir:80 308 ADGGQTLGSFGGANRWATAVVCHPDWLAVGV 338 (381) T ss_pred cCCCceeeeehhhhhhhhhcccccccccccc Confidence 1112222221111 022222 No 29 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=96.36 E-value=0.0007 Score=37.86 Aligned_cols=265 Identities=14% Similarity=0.118 Sum_probs=138.4 Q ss_pred CCCCcccce-------------eeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeeccc-ceEeccc Q lcl|NC_017976. 1 MGTKNQQLA-------------AKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDL-PVVVGTG 66 (296) Q Consensus 1 m~t~Nnn~a-------------~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~-pVvvg~~ 66 (296) |+-.|+... +-+|-|+|.+.+-+-|+.++.|++..- .+. |+. -+ +++..-+ ++.+ .+ T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~--~r~---i~~-G~--sv~i~~iG~~tv-~~ 71 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHI--VRT---IQN-GK--SAQFPVMGRTSG-VY 71 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccc--ccc---ccc-cc--eEEEecccceee-ee Confidence 544443322 567889999999999999999997652 222 111 11 1111111 1222 22 Q ss_pred ccCcccceeccccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHH Q lcl|NC_017976. 67 YNTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFI 146 (296) Q Consensus 67 Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~l 146 (296) |..++... |+-.+ .-.-+-+|-.|+.. |-+.+=.-||+...+=|+-+ +....+++|=.|.++..+...| T Consensus 72 ~t~G~~l~---~~~~~--~~~~e~~itID~~~---~~~~~VddiD~~q~~~D~~~---~~~~~~g~aLa~~~D~~i~~~~ 140 (347) T protein:vir:94 72 LAPGERLS---DKRKG--IKHTEKVITIDGLL---TADVMIFDIEDAMNHYDVAG---EYSNQLGEALAIAADGAVLAEM 140 (347) T ss_pred ecCCCCcC---CCCCC--CCcceEEEEecchh---hhhHHhhhHHHHhcCcchHH---HHHHHHHHHHHHHHHHHHHHHH Confidence 44322110 00001 11112245666543 33334457888888877655 5666788999999998876544 Q ss_pred hhhhcc------------------------hhhhhhcchhHHHHHHHHhhhhhcceeee-eeEEEEECchhhhhhhcccc Q lcl|NC_017976. 147 VANAGK------------------------TEALTAYDEAAVLKLFNNLSAYYINIEAI-GTKVAKVGPELYNIIVDHRL 201 (296) Q Consensus 147 s~~A~~------------------------t~~l~~~~~~~V~klFn~~~~~yvn~ev~-~~~~ayV~~evYNaIvD~~l 201 (296) ...+.. +.+ ...+.+.+.+.+-.+.+..-+..|- ..+.+.|+|+.|-.|++++. T Consensus 141 ~~~aa~~~~~~~~~~g~~~~s~~~~~~~~~~~~-~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~ 219 (347) T protein:vir:94 141 AILCNLPAASNENIAGLGTASVLEVGKKADLDT-PAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALM 219 (347) T ss_pred HHHhccccccccccCCCcccceeeccccccccc-hhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccch Confidence 321110 000 0111233333333344444433442 35799999999999999988 Q ss_pred ccccccce-eeeccCcceeecCeEEEEchHHHh-------cCC-e---------------------------EEEeecce Q lcl|NC_017976. 202 TTKEKASG-ANVDSNEIVKFKGFLIEEVPQAKL-------GAN-A---------------------------ALVYIKGV 245 (296) Q Consensus 202 ~Ts~K~Ss-~NiD~ngi~~fKgf~l~e~p~~y~-------qg~-~---------------------------aifs~dnI 245 (296) .++.-.++ ..++.-.|.++-||.|-+.|.-=. +|+ . .+|.|+-+ T Consensus 220 ~~~~~~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~ 299 (347) T protein:vir:94 220 PNAANYAALIDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAV 299 (347) T ss_pred hhhhhccccccccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhh Confidence 76654333 334434567999999988763311 111 1 22222222 Q ss_pred eeeccceEEEEEe--eccCcc-----ceeeeecccccccCCCCCcceEEEEeccc Q lcl|NC_017976. 246 GKAFTGITTARTI--ESEDFD-----GVAFQGAGKAGEFILDDNKPAVVKVTAPT 293 (296) Q Consensus 246 g~af~GI~taRti--eSEDFd-----GVaLQgAgK~G~~IlddNKkAI~k~t~~~ 293 (296) | +++.| +.|-|. +=.+-|---||-=++.-.-.+.++++.+- T Consensus 300 ~-------~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 300 G-------TVKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred h-------hhhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 2 22233 344444 33333444455556655556666666444 No 30 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=96.33 E-value=0.00073 Score=37.77 Aligned_cols=265 Identities=13% Similarity=0.081 Sum_probs=145.9 Q ss_pred CCCCcccceeeeechh-HHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE--ecc--cccCccccee Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQ-FKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV--VGT--GYNTDANVGF 75 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq-~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv--vg~--~Y~td~NvaF 75 (296) ||..=- ...-+..|| |..+++.=|.++..|.+..=- .-.+.|..- ++ |+ +|.. +|+ .|..+..... T Consensus 1 Ma~~~T-~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~-~~~l~g~~G-~t---v~---ip~~~~~g~a~~~~~g~~i~~ 71 (278) T protein:vir:80 1 MADLTT-KLANLIDPEVMGPMISAKLPKAIKFGKIAPI-DNSLEGQPG-SE---IT---VPKYKYIGDAQDVAEGAAIDY 71 (278) T ss_pred CCCcce-ehhheecHHHHHHHHHHHHHHhhhhccccee-cccccCCCC-CE---EE---EeeeccCCcceeecCCCcCcc Confidence 663100 112235554 888888888887777653211 112222211 11 11 2221 222 2443333333 Q ss_pred ccccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhh Q lcl|NC_017976. 76 GTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEA 155 (296) Q Consensus 76 GtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~ 155 (296) ..-| +++.+-.| +. +-+.|.+.. +|+.....|+ +++.++-++.+|.|.+++.+-..|..+..+... T Consensus 72 ~~lt-----~~~~~~~i--~~---~~~a~~v~D-~~~~~~~~d~---~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~ 137 (278) T protein:vir:80 72 SALE-----TESVKHGI--KK---AGKGVKLTD-ESVLSGYGDP---VEEAQKQIRMAIASKVDNDILEEALTTTLEVKG 137 (278) T ss_pred cccc-----cceeeEee--eh---hhccccccH-HHHhhccccH---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 3322 22222222 11 223455544 5666666665 566777789999999999888877554322110 Q ss_pred -hhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccc--cccccceeeeccCcce-eecCeEEEEchHH Q lcl|NC_017976. 156 -LTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLT--TKEKASGANVDSNEIV-KFKGFLIEEVPQA 231 (296) Q Consensus 156 -l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~--Ts~K~Ss~NiD~ngi~-~fKgf~l~e~p~~ 231 (296) ....+.+....+|..+..+.-...+..+.++.|+|++|..|.-.++. ++.....-++=.||.+ +|.||.|-+.+. T Consensus 138 ~~t~~~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~- 216 (278) T protein:vir:80 138 AINIGLIDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKK- 216 (278) T ss_pred ccccchhhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCC- Confidence 11122233456676665554433344456789999999999755422 2222222244445554 899999987653 Q ss_pred Hhc-CCeEEEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEeccc Q lcl|NC_017976. 232 KLG-ANAALVYIKGVGK---AFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPT 293 (296) Q Consensus 232 y~q-g~~aifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~ 293 (296) +- |...+|.+..||- +.+-+++-| .++.-.-.|.+---||..+++..+...++..+.. T Consensus 217 -~p~~t~~l~~~gAi~~~~~~~~~vE~~R---d~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 217 -LADGNALAVKAGALKTFLKRNLLAESGR---DMDHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred -CCcceEEEEeccceeeeecCCccccccc---chhhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 23 6678888887763 222344444 3444556788888899999988877666666665 No 31 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=96.26 E-value=0.00081 Score=37.52 Aligned_cols=263 Identities=17% Similarity=0.094 Sum_probs=145.2 Q ss_pred CCCCcccceeeeech-hHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecc--cccCcccceecc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQK-QFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGT--GYNTDANVGFGT 77 (296) Q Consensus 1 m~t~Nnn~a~r~Y~k-q~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~--~Y~td~NvaFGt 77 (296) |+-.|.-.-.-+-.| -|..+++.-+.++..|.+..=- .-.+.|..- .+.=.-|.+.+ |+ .|..+...+.+. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~-~~~l~g~~G-~tv~iP~~~~i----g~a~~~~~g~~i~~~~ 74 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADI-DNTLVGQPG-NTITFPAFVYS----GDAKVVPEGEEIPIDL 74 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhccccee-cccccCCCC-CEEEeeeeccC----CccccccCCCCcchhh Confidence 666665555555555 4777777778877777543111 112223211 11101111111 11 133333333222 Q ss_pred ccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhhh Q lcl|NC_017976. 78 GTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEALT 157 (296) Q Consensus 78 GTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l~ 157 (296) -|.. +.+-.| .. +-+.|.+.. +++.....|+ ++++++.++.++.+.++..+-..|........+ . T Consensus 75 lt~~-----~~~~~i---~~--~~~~~~i~D-~~~~~~~~d~---~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~~-~ 139 (275) T protein:vir:96 75 IETK-----KRQATI---RK--IGKGTVLTD-EALLSGYGDP---KGEAVRQHGLAIANKVDNDVLEALQGATLKVEA-D 139 (275) T ss_pred cccc-----eeeEEe---eh--hcccccccH-HHHHhhccch---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-c Confidence 2211 111111 11 233454443 4555555555 777888899999999999988877665544432 4 Q ss_pred hcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccc--cccccceeeeccCc-ceeecCeEEEEchHHHhc Q lcl|NC_017976. 158 AYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLT--TKEKASGANVDSNE-IVKFKGFLIEEVPQAKLG 234 (296) Q Consensus 158 ~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~--Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~q 234 (296) .++.+.+...-..+.. +-..+..+.|+|++|..|.-+++- +.+..+.-++=.|| |-+|.|+.+-+... +- T Consensus 140 ~~~~d~i~dA~~~lgd-----~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~--~p 212 (275) T protein:vir:96 140 ITKLAGLQTAIDKFND-----EDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNK--IK 212 (275) T ss_pred ccCHHHHHHHHHHhcc-----ccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCC--CC Confidence 5566666554444432 334577899999999999666432 22333333444466 56899999977652 33 Q ss_pred -CCeEEEeecceeee---ccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 235 -ANAALVYIKGVGKA---FTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 235 -g~~aifs~dnIg~a---f~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) +...+|.+..+|.. -+-+++-|-+++ ---.|.+---||..++++.|-+.++. +..+- T Consensus 213 ~~t~~i~~~gA~~~~~~~~~~vE~~Rd~~~---~~d~i~~~~~y~~~~~~~~~vv~~t~--~~~~~ 273 (275) T protein:vir:96 213 EGEAILAKRGAVKLITKRDFFLETERHASH---KSTALFSDKHYVAYLYDESKVVKITK--SASGL 273 (275) T ss_pred cceEEEEeccceeeeecCCcccccccchhh---cCcEEEEeEEEEEEEEcCccEEEEEe--ccccc Confidence 66677777766642 233566665444 34677888889999997766555544 22222 No 32 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=96.15 E-value=0.00094 Score=37.17 Aligned_cols=263 Identities=14% Similarity=0.104 Sum_probs=147.4 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE--eccc--ccCcccceec Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV--VGTG--YNTDANVGFG 76 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv--vg~~--Y~td~NvaFG 76 (296) |+..---++==+--..|..++..=+.++-.|.+..-- .-.+.|..-+ + | .+|.. +|+. |..++....+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~-d~~l~g~~G~-t---v---~iP~~~~~g~a~~~~~g~~i~~~ 72 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEV-DSTLQGQPGD-T---L---TFPAFVYSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhccccee-cccccCCCCC-E---E---EEeeecCCCccccccCCCccccc Confidence 7764322222222335677776666666555443211 1122232111 1 1 22321 1221 3333333332 Q ss_pred cccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhh Q lcl|NC_017976. 77 TGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEAL 156 (296) Q Consensus 77 tGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l 156 (296) .-| .++.+-.|.. +-+.|.+.. +++..-..|+ +++.++.++.++.+.++..+-..|..+...... T Consensus 73 ~lt-----~~~~~~~i~~-----~~~~~~i~D-~~~~~~~~dp---~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~- 137 (274) T protein:vir:94 73 ILE-----TKKREAKIRK-----IAKGTSITD-EALLSGYGDP---QGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA- 137 (274) T ss_pred ccc-----cceeEEEeee-----ecceecccH-HHHHhccchH---HHHHHHHHHHHHHHHHHHHHHHHHhccCccccc- Confidence 222 2222222211 234566665 5666666665 677788889999999999999998877665433 Q ss_pred hhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccce--eeeccCcce-eecCeEEEEchHHHh Q lcl|NC_017976. 157 TAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASG--ANVDSNEIV-KFKGFLIEEVPQAKL 233 (296) Q Consensus 157 ~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y~ 233 (296) ..++.+.+...-..+. -+-..+..+.|+|++|..|.-.++-...+.|. -++=.||.+ +|.||.|-+.+. + T Consensus 138 ~~~~~d~i~dA~~~l~-----d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~ 210 (274) T protein:vir:94 138 DITKLNGLQSAIDKFN-----DEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--L 210 (274) T ss_pred cccCHHHHHHHHHHhh-----ccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCC--C Confidence 4445555544333333 23346688999999999998654333222222 233345544 899999987653 3 Q ss_pred c-CCeEEEeecceeee---ccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 234 G-ANAALVYIKGVGKA---FTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 234 q-g~~aifs~dnIg~a---f~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) . +...+|.+..+|.. -+-+++.|- ++.---.|-+---||..++++.|-+.++-+.+..-- T Consensus 211 p~~t~~l~~~gA~~~~~~~~~~vE~~Rd---~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 211 EAGTAILAKKGAVKLILKRDFFLEVARD---ASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred CcceEEEEeCcceEeeecCCceeccccc---hhhcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 3 66788888777742 223555554 334456788888999999999887777644333333 No 33 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=96.15 E-value=0.00094 Score=37.17 Aligned_cols=263 Identities=14% Similarity=0.104 Sum_probs=147.4 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE--eccc--ccCcccceec Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV--VGTG--YNTDANVGFG 76 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv--vg~~--Y~td~NvaFG 76 (296) |+..---++==+--..|..++..=+.++-.|.+..-- .-.+.|..-+ + | .+|.. +|+. |..++....+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~-d~~l~g~~G~-t---v---~iP~~~~~g~a~~~~~g~~i~~~ 72 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEV-DSTLQGQPGD-T---L---TFPAFVYSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhccccee-cccccCCCCC-E---E---EEeeecCCCccccccCCCccccc Confidence 7764322222222335677776666666555443211 1122232111 1 1 22321 1221 3333333332 Q ss_pred cccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhh Q lcl|NC_017976. 77 TGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEAL 156 (296) Q Consensus 77 tGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l 156 (296) .-| .++.+-.|.. +-+.|.+.. +++..-..|+ +++.++.++.++.+.++..+-..|..+...... T Consensus 73 ~lt-----~~~~~~~i~~-----~~~~~~i~D-~~~~~~~~dp---~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~- 137 (274) T protein:vir:97 73 ILE-----TKKREAKIRK-----IAKGTSITD-EALLSGYGDP---QGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA- 137 (274) T ss_pred ccc-----cceeEEEeee-----ecceecccH-HHHHhccchH---HHHHHHHHHHHHHHHHHHHHHHHHhccCccccc- Confidence 222 2222222211 234566665 5666666665 677788889999999999999998877665433 Q ss_pred hhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccce--eeeccCcce-eecCeEEEEchHHHh Q lcl|NC_017976. 157 TAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASG--ANVDSNEIV-KFKGFLIEEVPQAKL 233 (296) Q Consensus 157 ~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y~ 233 (296) ..++.+.+...-..+. -+-..+..+.|+|++|..|.-.++-...+.|. -++=.||.+ +|.||.|-+.+. + T Consensus 138 ~~~~~d~i~dA~~~l~-----d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~ 210 (274) T protein:vir:97 138 DITKLNGLQSAIDKFN-----DEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--L 210 (274) T ss_pred cccCHHHHHHHHHHhh-----ccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCC--C Confidence 4445555544333333 23346688999999999998654333222222 233345544 899999987653 3 Q ss_pred c-CCeEEEeecceeee---ccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 234 G-ANAALVYIKGVGKA---FTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 234 q-g~~aifs~dnIg~a---f~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) . +...+|.+..+|.. -+-+++.|- ++.---.|-+---||..++++.|-+.++-+.+..-- T Consensus 211 p~~t~~l~~~gA~~~~~~~~~~vE~~Rd---~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 211 EAGTAILAKKGAVKLILKRDFFLEVARD---ASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred CcceEEEEeCcceEeeecCCceeccccc---hhhcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 3 66788888777742 223555554 334456788888999999999887777644333333 No 34 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=95.74 E-value=0.0015 Score=36.02 Aligned_cols=272 Identities=13% Similarity=0.109 Sum_probs=130.2 Q ss_pred CCCC--------cccce-----ee-eechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccc-eEecc Q lcl|NC_017976. 1 MGTK--------NQQLA-----AK-TYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLP-VVVGT 65 (296) Q Consensus 1 m~t~--------Nnn~a-----~r-~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~p-Vvvg~ 65 (296) |+.. +..++ +. +|-|+|.+.+-+-|++++.|++.+=- +...+ -| +++..-+. +.+ + T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~--~~~~~--G~----sv~i~~ig~~t~-~ 71 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHML--RSIAS--GK----SAQFPVIGRTKA-A 71 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhcccc--ccccc--cc----eeEeeeccceee-e Confidence 4321 11221 23 78899999999999999999987642 21111 11 22222221 223 3 Q ss_pred cccCcccceeccccCCcccc--cceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHh Q lcl|NC_017976. 66 GYNTDANVGFGTGTSKSSRF--GDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHA 143 (296) Q Consensus 66 ~Y~td~NvaFGtGTg~s~RF--G~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~g 143 (296) +|..+....- ++- -.-+-++-.|+..-+. +.| +-||+...+-|+- ++...-++.|=.|.+|..+- T Consensus 72 ~~~~g~~l~~-------~~~~~~~~e~~ltID~~~~~~--~~V-ddlD~~q~~~D~~---~~~~~~~g~aLA~~~D~~i~ 138 (347) T protein:vir:15 72 YLKPGENLDD-------KRKDIKHTEKVIHIDGLLTAD--VLI-YDIEDAMNHYDVR---AEYTAQLGESLAMAADGAVL 138 (347) T ss_pred eeccCCCCCC-------CCCCCccceEEEEechhhhhh--HHh-hhHHHHhcCCcch---HHHHHHHHHHHHHHHHHHHH Confidence 3444433311 111 1112235556554333 223 6888888887754 45566688899999998887 Q ss_pred HHHhhhhcc--------------------hhhhhhcc-----hhHHHHHHHHhhhhhcceee-eeeEEEEECchhhhhhh Q lcl|NC_017976. 144 EFIVANAGK--------------------TEALTAYD-----EAAVLKLFNNLSAYYINIEA-IGTKVAKVGPELYNIIV 197 (296) Q Consensus 144 k~ls~~A~~--------------------t~~l~~~~-----~~~V~klFn~~~~~yvn~ev-~~~~~ayV~~evYNaIv 197 (296) ..|...+.. +.+-++.+ .+.+..++-.+.+..-...| .....+.|+|+.|.+|. T Consensus 139 ~~l~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL 218 (347) T protein:vir:15 139 AELAGLVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAIL 218 (347) T ss_pred HHHHHHhhccccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHh Confidence 666432110 00001112 23344444444444444444 24588999999999999 Q ss_pred ccccccccccceeeeccCcce-eecCeEEEEchHHHhc--CCeEEEeecceeeeccceEEEEEeeccCcccee------- Q lcl|NC_017976. 198 DHRLTTKEKASGANVDSNEIV-KFKGFLIEEVPQAKLG--ANAALVYIKGVGKAFTGITTARTIESEDFDGVA------- 267 (296) Q Consensus 198 D~~l~Ts~K~Ss~NiD~ngi~-~fKgf~l~e~p~~y~q--g~~aifs~dnIg~af~GI~taRtieSEDFdGVa------- 267 (296) .++-.++.-..+...-.+|.+ +.-||.|-+.+.--.. ++.....+.|-+-++..=.+..+ .++|+..+ T Consensus 219 ~~~~~~~~d~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~--~~~f~~~~~l~~h~~ 296 (347) T protein:vir:15 219 AALMPNAANYQALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTV--KVALDNVVGLFQHRS 296 (347) T ss_pred cccccccccccccccccceEEEEEeceEEEeccccccccccccccccccccccccccccccee--eeccccceeeeeccc Confidence 886555433333344567754 8889988885542221 11111111111111110000000 11221111 Q ss_pred ------eee--------cccccccCCCC--------CcceEEEEecccCCC Q lcl|NC_017976. 268 ------FQG--------AGKAGEFILDD--------NKPAVVKVTAPTVTP 296 (296) Q Consensus 268 ------LQg--------AgK~G~~Ildd--------NKkAI~k~t~~~~~p 296 (296) +|. .-..+.+|... +-.+.+...+++..- T Consensus 297 A~g~v~~~~~~~e~~~~~~~~~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 297 AVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred eeeeeEeeceeeeecccchhhhhhhehhhhcCCceeccccEEEEecCCCCC Confidence 000 01122333322 223344444444444 No 35 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=95.60 E-value=0.0018 Score=35.67 Aligned_cols=248 Identities=16% Similarity=0.276 Sum_probs=136.6 Q ss_pred CCCCc-----------ccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEeccc--- Q lcl|NC_017976. 1 MGTKN-----------QQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTG--- 66 (296) Q Consensus 1 m~t~N-----------nn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~--- 66 (296) |.+-| .+. -+|-|+|-+.+.+-|+.++.|++.+-- +++.| - |+--.| .+|+. T Consensus 1 ms~~~~~t~~~~~~s~~d~--al~le~f~geV~~af~~~s~~~~~~~~--rti~~--g-------~s~~~~-~iG~~~~~ 66 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADV--DIHLEEHLGIVDKHFAYTSKFAPLMNI--RDLRG--S-------NVVRLD-RLGNVEAK 66 (335) T ss_pred CCccccccccccccccchh--hhhhhhhhhHHHHHHHHhhhhccccce--eeecc--c-------eeEEEe-eeeeeeec Confidence 66554 333 477799999999999999999987643 33322 1 223334 23441 Q ss_pred -ccCcccceeccccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHH Q lcl|NC_017976. 67 -YNTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEF 145 (296) Q Consensus 67 -Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ 145 (296) ...++.. .++|.-.-|-+|-.|+-+ |.--| =.=||..+-+=|+-+.+++ -+++|-.|++|..+-.. T Consensus 67 ~~~pG~~l-------~~~~~~~~k~~itID~ll-~a~~~--VddlDe~~~~yDvR~e~s~---~~G~aLA~~~Dq~~~~~ 133 (335) T protein:vir:78 67 GRRAGEEL-------ERSRVVNDKWNLTVDTLL-YLRHQ--FDHQDEWTQSFDMRKEVAE---LDGQELARKFDQACLIQ 133 (335) T ss_pred ccccCccc-------CCCCcccCCeEEEeccee-echhh--HhhHHHhhcCchhHHHHHH---HHHHHHHHHHHHHHHHH Confidence 2222222 233454545588889877 33333 2347777777777776665 46788999999988766 Q ss_pred Hhhhhcchh----------------hhh----hcchhHHHHHHHHhhhhhcceee----eeeEEEEECchhhhhhhcccc Q lcl|NC_017976. 146 IVANAGKTE----------------ALT----AYDEAAVLKLFNNLSAYYINIEA----IGTKVAKVGPELYNIIVDHRL 201 (296) Q Consensus 146 ls~~A~~t~----------------~l~----~~~~~~V~klFn~~~~~yvn~ev----~~~~~ayV~~evYNaIvD~~l 201 (296) |...|...- .+. .-+.+.+...|-.+.+......| ....+++|+|+.|.+|++++- T Consensus 134 l~~aa~~~a~~~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~ 213 (335) T protein:vir:78 134 VIKAAAMDAPVDLEDAFSPGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDK 213 (335) T ss_pred HHhhcccccccccCCCcCCCcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccc Confidence 665553210 000 01222333444444444443322 334889999999999999864 Q ss_pred ccccc--ccee--eeccCcceeecCeEEEEchHHHhcCCeEEEeecceeeeccceEEEEEeeccCccceeeeeccccccc Q lcl|NC_017976. 202 TTKEK--ASGA--NVDSNEIVKFKGFLIEEVPQAKLGANAALVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEF 277 (296) Q Consensus 202 ~Ts~K--~Ss~--NiD~ngi~~fKgf~l~e~p~~y~qg~~aifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~ 277 (296) .-..- +|.. ..=.-++++.-||.|.+.|. |-... -+..-+|.+|.+. | .||. -..|.| T Consensus 214 l~n~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~--lP~~~--~t~~~lg~a~n~~---~----~d~~-------~~~~~~ 275 (335) T protein:vir:78 214 LMSVEYQATGATNDYVKSRVAILNGVKVLETPR--FATKA--ISAHPLGRHFNVS---A----EEAE-------RQIALF 275 (335) T ss_pred cccccccccccccccccceeEEeeceEEEeecc--CCCCC--CccccccccCCcc---c----cccc-------ceEEEE Confidence 32211 1111 11122478889999888752 11100 0111112222221 1 1221 112333 Q ss_pred CCCCCcceEEEEecccCCC Q lcl|NC_017976. 278 ILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 278 IlddNKkAI~k~t~~~~~p 296 (296) =.++|+..+.....+| T Consensus 276 ---~~~~Al~t~~~~~~~~ 291 (335) T protein:vir:78 276 ---LPSKTLITAQVAPVQA 291 (335) T ss_pred ---EecceEEEEEEEeccc Confidence 4678999998888888 No 36 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=95.59 E-value=0.0018 Score=35.66 Aligned_cols=262 Identities=14% Similarity=0.091 Sum_probs=142.4 Q ss_pred CCCCcccceeeeech-hHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE--eccc--ccCccccee Q lcl|NC_017976. 1 MGTKNQQLAAKTYQK-QFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV--VGTG--YNTDANVGF 75 (296) Q Consensus 1 m~t~Nnn~a~r~Y~k-q~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv--vg~~--Y~td~NvaF 75 (296) ||+.+--++ .++.| .|..++..-+.++..|....-- .-.+.|..- ++ | .+|.. +|+. |..+...+. T Consensus 1 ma~~~T~~~-d~i~Pev~s~~v~~~~~~~~~~~~~~~~-~~~l~g~~G-~t---v---~ip~~~~~g~~~~~~~g~~i~~ 71 (274) T protein:vir:96 1 MAQGTTKVS-NLIVPEVLAPMMQAELDKKLRFAQFADI-DSTLVGQPG-DT---L---TFPAFTYSGDAQVIAEGEKIPV 71 (274) T ss_pred CCccccchh-hhhhhHHHHHHHHHHHHhhhhhcccccc-cccccCCCC-CE---E---EEEeeccCCCccccCCCCcCch Confidence 997664444 56666 4677777777776666443211 111222211 11 1 12332 2221 433333333 Q ss_pred ccccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhh Q lcl|NC_017976. 76 GTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEA 155 (296) Q Consensus 76 GtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~ 155 (296) ..-|. ++.+-.| +. +-+.|.+.. +++....-|+ +++.++-++.+|.|.+++.+-..|......... T Consensus 72 ~~it~-----~~~~~~i--~~---~~~~~~i~D-~~~~~~~~d~---~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~~ 137 (274) T protein:vir:96 72 DQIGT-----SKREAKV--RK---IGKGTELTD-EAVLSGFGDP---QGEAVRQHGLAIANKVDNDVLEALKGATLTVEA 137 (274) T ss_pred hhccc-----ceeEEEE--Ee---eeceeeecH-HHHHhhcchH---HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCc Confidence 33322 2222111 11 223454432 5555554444 566667788999999999988888654433322 Q ss_pred hhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccce--eeeccCc-ceeecCeEEEEchHHH Q lcl|NC_017976. 156 LTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASG--ANVDSNE-IVKFKGFLIEEVPQAK 232 (296) Q Consensus 156 l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ng-i~~fKgf~l~e~p~~y 232 (296) ..++-+.+. .|...+ +-+-..+..+.|+|++|..|.-.++-...+.|. -++-.+| |-+|-||.|-+.+. T Consensus 138 -~~~~~d~i~----dA~~~l-~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~-- 209 (274) T protein:vir:96 138 -DITKLDGLQ----TAIDKF-NDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK-- 209 (274) T ss_pred -ccccHHHHH----HHHHHh-cccCCCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCC-- Confidence 334444443 333333 223346778999999999997765432222222 2333445 66899999876543 Q ss_pred hc-CCeEEEeecceeeec---cceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 233 LG-ANAALVYIKGVGKAF---TGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 233 ~q-g~~aifs~dnIg~af---~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) +- +...+|.+..+|-.- +-+++-| .++...-.|-|---||..+++..|-.+++-.++..-- T Consensus 210 ~p~~t~~l~~~gA~~~~~~~~~~vE~~R---d~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 210 LNKGEALLAKKGAVKLITKRDFFLEKDR---DASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred CCcceEEEEeCcceeeeecCCccccccc---chhhcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 33 666777777776521 1234444 3444566788888899999998887766554443333 No 37 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=95.42 E-value=0.0021 Score=35.27 Aligned_cols=265 Identities=15% Similarity=0.099 Sum_probs=146.5 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEeccc--ccCcccceeccc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTG--YNTDANVGFGTG 78 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~--Y~td~NvaFGtG 78 (296) |+..---++==+=-..|..+++.-+.+.-.|.+..-- .-.+.|..- +|.=.=|.+. +|+. |..+.....+.- T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~-~~~l~g~~G-~tv~iP~~~~----ig~a~~~~~g~~i~~~~l 74 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEI-DNTLVGQPG-DTLTFPAFIY----SGDAKVVAEGEKIPTDIL 74 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhcccccee-cccccCCCC-CEEEeeeecC----CCccccccCCCccchhhc Confidence 7764333332222335777777777777777554211 112223211 1110001121 1221 333333323222 Q ss_pred cCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhhhh Q lcl|NC_017976. 79 TSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEALTA 158 (296) Q Consensus 79 Tg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l~~ 158 (296) |.. +.+-.| +. +-+.|.+.. +|+.....|+ ++++++.++.+|.+.+++.+-..|..+.....+ .. T Consensus 75 t~~-----~~~~~i--~~---~~~a~~i~D-~~~~~~~~d~---~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~-~~ 139 (274) T protein:vir:95 75 ETK-----KREAKI--RK---IAKGTSISD-EALLSGYGDP---QGEQVRQHGLAHANKVDDDVLEALKSAKLTVEA-DI 139 (274) T ss_pred ccc-----eeEEEe--ee---eecceeehH-HHHhhccchH---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-cc Confidence 211 111111 11 223455553 6777766665 667778889999999999998888665544332 45 Q ss_pred cchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccce--eeeccCcce-eecCeEEEEchHHHhc- Q lcl|NC_017976. 159 YDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASG--ANVDSNEIV-KFKGFLIEEVPQAKLG- 234 (296) Q Consensus 159 ~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y~q- 234 (296) ++.+.+...-..+. .|-..+..+.|+|++|..|.-.++-...+.|. .+|=-||.+ +|.||.+-+.+. +. T Consensus 140 ~~~d~i~~A~~~lg-----d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~~~ 212 (274) T protein:vir:95 140 TKLTGLQTAIDKFN-----DEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNK--LEA 212 (274) T ss_pred cCHHHHHHHHHHhc-----cccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCC--CCC Confidence 55555554443333 33456778999999999998877655444443 344445544 899999887642 33 Q ss_pred CCeEEEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 235 ANAALVYIKGVGK---AFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 235 g~~aifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) +...+|-+..+|- +-+-+++-|-+.+ ---.|.+---||..+++..|...++-.+..--- T Consensus 213 ~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~---~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 213 GTAILAKKGAVKLITKRDFFLETDRDPST---KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ceEEEEeccceeeeecCCccccccccccc---ccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 5567777766664 2233566664443 446788888999999998886665522221111 No 38 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=95.42 E-value=0.0021 Score=35.27 Aligned_cols=265 Identities=15% Similarity=0.099 Sum_probs=146.5 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEeccc--ccCcccceeccc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTG--YNTDANVGFGTG 78 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~--Y~td~NvaFGtG 78 (296) |+..---++==+=-..|..+++.-+.+.-.|.+..-- .-.+.|..- +|.=.=|.+. +|+. |..+.....+.- T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~-~~~l~g~~G-~tv~iP~~~~----ig~a~~~~~g~~i~~~~l 74 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEI-DNTLVGQPG-DTLTFPAFIY----SGDAKVVAEGEKIPTDIL 74 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhcccccee-cccccCCCC-CEEEeeeecC----CCccccccCCCccchhhc Confidence 7764333332222335777777777777777554211 112223211 1110001121 1221 333333323222 Q ss_pred cCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhhhh Q lcl|NC_017976. 79 TSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEALTA 158 (296) Q Consensus 79 Tg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l~~ 158 (296) |.. +.+-.| +. +-+.|.+.. +|+.....|+ ++++++.++.+|.+.+++.+-..|..+.....+ .. T Consensus 75 t~~-----~~~~~i--~~---~~~a~~i~D-~~~~~~~~d~---~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~-~~ 139 (274) T protein:vir:96 75 ETK-----KREAKI--RK---IAKGTSISD-EALLSGYGDP---QGEQVRQHGLAHANKVDDDVLEALKSAKLTVEA-DI 139 (274) T ss_pred ccc-----eeEEEe--ee---eecceeehH-HHHhhccchH---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-cc Confidence 211 111111 11 223455553 6777766665 667778889999999999998888665544332 45 Q ss_pred cchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccce--eeeccCcce-eecCeEEEEchHHHhc- Q lcl|NC_017976. 159 YDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASG--ANVDSNEIV-KFKGFLIEEVPQAKLG- 234 (296) Q Consensus 159 ~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y~q- 234 (296) ++.+.+...-..+. .|-..+..+.|+|++|..|.-.++-...+.|. .+|=-||.+ +|.||.+-+.+. +. T Consensus 140 ~~~d~i~~A~~~lg-----d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~~~ 212 (274) T protein:vir:96 140 TKLTGLQTAIDKFN-----DEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNK--LEA 212 (274) T ss_pred cCHHHHHHHHHHhc-----cccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCC--CCC Confidence 55555554443333 33456778999999999998877655444443 344445544 899999887642 33 Q ss_pred CCeEEEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 235 ANAALVYIKGVGK---AFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 235 g~~aifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) +...+|-+..+|- +-+-+++-|-+.+ ---.|.+---||..+++..|...++-.+..--- T Consensus 213 ~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~---~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 213 GTAILAKKGAVKLITKRDFFLETDRDPST---KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ceEEEEeccceeeeecCCccccccccccc---ccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 5567777766664 2233566664443 446788888999999998886665522221111 No 39 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=95.16 E-value=0.0016 Score=35.90 Aligned_cols=261 Identities=18% Similarity=0.162 Sum_probs=132.3 Q ss_pred CC---C-----Ccccce------eeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecc- Q lcl|NC_017976. 1 MG---T-----KNQQLA------AKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGT- 65 (296) Q Consensus 1 m~---t-----~Nnn~a------~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~- 65 (296) || + +|+.++ .-+|-|+|-+-+.+-|+.++.|++..=- +..-| =|+--.| .||. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~--rti~~---------G~sv~~~-~iG~~ 68 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLV--RSIQS---------GKSAQFP-VLGRT 68 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhh--eeccc---------cceEEee-eccce Confidence 43 1 122222 2279999999999999999999987753 22211 1222222 2232 Q ss_pred ---cccCcccceeccccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_017976. 66 ---GYNTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKH 142 (296) Q Consensus 66 ---~Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~ 142 (296) +|..+.+.. ++.+.-...++ +|-.|+..-+.+. =+=||+.+.+=|+.+..+ ..+++|=.|.+|+.+ T Consensus 69 ~~~~~~~G~~l~---~~~~~~~~~e~--~ltID~~~y~~~~---VddiD~~q~~~D~rs~~~---~~~g~ALA~~~D~~i 137 (347) T protein:vir:94 69 KAAYLQPGENLD---DKRKDMKHTEK--TINIDGLLTADVL---IYDIEDAMNHYDVRSEYT---AQLGESLAMAADGAV 137 (347) T ss_pred eEeeeecCcCCC---CCcCCccccce--EEEEcchhhhhhh---hhhHHHHhcCcchHHHHH---HHHHHHHHHHHHHHH Confidence 122221110 11111122332 3555654433321 135777777777766555 567889999999766 Q ss_pred hHHHhhhhcc---------------------hhhh---hhcchhHHHHHHHHhhhhhcceee-eeeEEEEECchhhhhhh Q lcl|NC_017976. 143 AEFIVANAGK---------------------TEAL---TAYDEAAVLKLFNNLSAYYINIEA-IGTKVAKVGPELYNIIV 197 (296) Q Consensus 143 gk~ls~~A~~---------------------t~~l---~~~~~~~V~klFn~~~~~yvn~ev-~~~~~ayV~~evYNaIv 197 (296) =.-|...+.. ..++ ...+...+...|-++.+......| ..+++++|+|+.|..|+ T Consensus 138 ~~~l~~~a~~~~~~~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LL 217 (347) T protein:vir:94 138 LAEMAKLCNLPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAIL 217 (347) T ss_pred HHHHHHhhccccccccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHH Confidence 4333221110 0000 011223333344444444444445 34799999999999999 Q ss_pred ccccccccccceeeeccCc-ceeecCeEEEEchHHHhcCCeEEEeecceeeeccceEEE-EEeeccC----ccceeeeec Q lcl|NC_017976. 198 DHRLTTKEKASGANVDSNE-IVKFKGFLIEEVPQAKLGANAALVYIKGVGKAFTGITTA-RTIESED----FDGVAFQGA 271 (296) Q Consensus 198 D~~l~Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~qg~~aifs~dnIg~af~GI~ta-RtieSED----FdGVaLQgA 271 (296) ...-.++....+.+...+| +.+.-||.|-+.|.-=..+ ..-.+-+=|.++++.--+ +.-++++ |+++. T Consensus 218 k~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~---- 291 (347) T protein:vir:94 218 AALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGG--AGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVV---- 291 (347) T ss_pred HhhcccccccccccccccceeEEeeceEEEEcCcccccc--CcccccccccccccccccccccccccccccccceE---- Confidence 7666666655566665565 5688899988876532221 111222222333332111 1112222 22221 Q ss_pred ccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 272 GKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 272 gK~G~~IlddNKkAI~k~t~~~~~p 296 (296) |-+ =.+.|+..+.+..+++ T Consensus 292 ---~l~---~~~~A~~tv~~~~~~~ 310 (347) T protein:vir:94 292 ---GLF---NHRSAVGTVKLKDMAL 310 (347) T ss_pred ---EEE---echhhhhhhhhcccce Confidence 111 1455666666666666 No 40 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=94.89 E-value=0.0032 Score=34.23 Aligned_cols=262 Identities=15% Similarity=0.107 Sum_probs=141.7 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE--eccc--ccCcccceec Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV--VGTG--YNTDANVGFG 76 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv--vg~~--Y~td~NvaFG 76 (296) ||..---++-=+-=.-|..++..-+.+...|.+. +--.-.+.|..-+ + | .+|.. +|+. |..+...+.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~-~~~~~~l~g~~G~-t---i---~iP~~~~igda~~~~eg~~i~~~ 72 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQF-ADIDSTLVGQPGD-T---L---TFPAFVYSGDATVVPEGQKIPVD 72 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhccc-ceecccccCCCCC-E---E---EeeeecCCCccccccCCCccCcc Confidence 7733222322233334566666666666666332 1111122332111 1 1 12221 1110 1111111111 Q ss_pred cccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhh Q lcl|NC_017976. 77 TGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEAL 156 (296) Q Consensus 77 tGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l 156 (296) .=|. ++.+-.|. -+-+.|.+. -++...-.-|+ +++.++.++.+|.|.+++.+=..|......... T Consensus 73 ~lt~-----~~~~a~i~-----~~~k~~~~t-D~a~~~~~~dp---~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~- 137 (276) T protein:vir:10 73 KIET-----NRREAKIH-----KIGKGTDIT-DEALLSGYGDP---QGEAVRQHGLAIANKVDNDVLEALRGTKLTVSA- 137 (276) T ss_pred cccc-----ceeeEEee-----hcccccccc-HHHHHhhccch---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccc- Confidence 1111 11111110 012333332 23344444444 677888999999999999888887766555433 Q ss_pred hhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccc--cccccccceeeeccCc-ceeecCeEEEEchHHHh Q lcl|NC_017976. 157 TAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHR--LTTKEKASGANVDSNE-IVKFKGFLIEEVPQAKL 233 (296) Q Consensus 157 ~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~--l~Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~ 233 (296) +.++.+.+.+....+... -..+.++.|+|++|..|.-.. -.+......-++-.|| |-+|.|+.+-..+. + T Consensus 138 ~~~t~d~i~~A~~~lgd~-----~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~ 210 (276) T protein:vir:10 138 DIGTLAGLEAAIDTFDDE-----DLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKK--L 210 (276) T ss_pred cccCHHHHHHHHHHhccc-----cCcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCC--C Confidence 566777777666555442 234578899999999996432 2222233334555666 55899999887653 3 Q ss_pred c-CCeEEEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 234 G-ANAALVYIKGVGK---AFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 234 q-g~~aifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) - +...+|.+.-|+. .-+-+++-|-+.+ ---.+-+---||..+.++.|...++-.. ..+| T Consensus 211 p~~t~~l~~~gAi~~~~~~~~~vE~dRd~~~---~~d~i~~~~~y~~~~~~~~~vv~~t~~~-~~~~ 273 (276) T protein:vir:10 211 DEGEAILAKRGAVKLITKRDFFLETDRDPST---KTTALYSDKHYVAYLYDESKAVKVTKGA-GTTD 273 (276) T ss_pred CcceEEEEeccceeeeecCCceeecccchhh---cccEEEEeeEEEEEEEcCcceEEEecCC-cCCc Confidence 3 6677888877763 2233566664443 3456677777999999999988887444 5666 No 41 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=94.67 E-value=0.0032 Score=34.23 Aligned_cols=266 Identities=15% Similarity=0.078 Sum_probs=115.6 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecccccCcccceeccccC Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVGFGTGTS 80 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~NvaFGtGTg 80 (296) +.|++=+.-+ .+.|.+.|...|+++..|++..-. ...++ .+.+| +.+..-..| .+. .|..+....+..-+. T Consensus 12 ~~t~~v~~fi---pei~s~~i~~~l~~~~v~~~~~~d--~~~~~-~~Gdt-v~ip~~g~~-~~~-d~~~~~~i~~~~~~~ 82 (341) T protein:vir:94 12 INTQRGQQFI---PEQWLSEVQMFRKAKMLDTSVVKT--WGAQV-KKGDT-FHVPRISEL-GVE-DKATDVPVGVQPVND 82 (341) T ss_pred ccchhHHHHH---HHHHHHHHHHHHHhhcchhhcccc--ccccc-cCCce-EEEeccCcc-eee-eecCCCccccccccC Confidence 4443333332 477889999999999888775321 11122 22222 222211111 132 255444333322211 Q ss_pred CcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhh----- Q lcl|NC_017976. 81 KSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEA----- 155 (296) Q Consensus 81 ~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~----- 155 (296) .-++ +-.|+..=+.+.+ .-+|+...+-|+- ++.+..|++|=.|.++..+-..+...+..... T Consensus 83 ------~~~~-itiD~~~~~~~~i---~d~d~~~~~~d~~---~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~ 149 (341) T protein:vir:94 83 ------TDFV-ITVDTDRTTAVAL---DDLLEIQASYDLR---APYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSS 149 (341) T ss_pred ------ceEE-EEEeeeeecceee---chHHHHhhccchH---HHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccC Confidence 1122 2233332222221 2466666666654 44455677788888888876666544322100 Q ss_pred ----h----hhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccc-cccceeeeccCcc-eeecCeEE Q lcl|NC_017976. 156 ----L----TAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTK-EKASGANVDSNEI-VKFKGFLI 225 (296) Q Consensus 156 ----l----~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts-~K~Ss~NiD~ngi-~~fKgf~l 225 (296) . ..++-+.|.++...+.+..|. .....++|+|+.|..|.-.+.-+. ....+.-+ .+|. -++-||.| T Consensus 150 ~~~~~t~~~~~~~~~~i~~a~~~Lde~~VP---~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l-~~G~ig~i~G~~V 225 (341) T protein:vir:94 150 SNGAITGNGQAFSFAVFLAARRLLLEADVP---EEKIVLLISPGQESALFTIPQFISKDFINNAPI-AQGQIGSLMGVRV 225 (341) T ss_pred ccccccCchhhhhHHHHHHHHHHHhhcCCC---ccCCEEEeCHHHHHHHhhchhhhhhhccccchh-heeeeeeEeceEE Confidence 0 012223344444444443332 244789999999999975543332 22222223 4564 48999999 Q ss_pred EEchHHHhcCCeEEEeecceee-----eccceEEEEEeeccCccceeeee------------------------------ Q lcl|NC_017976. 226 EEVPQAKLGANAALVYIKGVGK-----AFTGITTARTIESEDFDGVAFQG------------------------------ 270 (296) Q Consensus 226 ~e~p~~y~qg~~aifs~dnIg~-----af~GI~taRtieSEDFdGVaLQg------------------------------ 270 (296) -+.+.- -...+.-.+.+-+. +-.+|+-.++.-.++-+.-...| T Consensus 226 ~~Sn~l--p~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~ 303 (341) T protein:vir:94 226 IRTSLI--GNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVT 303 (341) T ss_pred EEeccc--cccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhcccccccccc Confidence 884431 11111111111110 11122223333222222222222 Q ss_pred ---------c---cc--ccccCCCCCcceEEEEecccC Q lcl|NC_017976. 271 ---------A---GK--AGEFILDDNKPAVVKVTAPTV 294 (296) Q Consensus 271 ---------A---gK--~G~~IlddNKkAI~k~t~~~~ 294 (296) - || +|-=+|.-.-...++....++ T Consensus 304 ~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 304 QSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred ccchhhhhhhhhhhhhhhcccccCcceeEEEecCcCCC Confidence 1 22 222233332222233222222 No 42 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=93.63 E-value=0.0069 Score=32.43 Aligned_cols=270 Identities=13% Similarity=0.092 Sum_probs=131.2 Q ss_pred CC---C-----Ccccce------eeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEeccc Q lcl|NC_017976. 1 MG---T-----KNQQLA------AKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTG 66 (296) Q Consensus 1 m~---t-----~Nnn~a------~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~ 66 (296) |+ + +++.++ .-+|-|+|.+.+.+-|++++.|++..=- +. +.. -+...+..-.. +-+ ++ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~--r~---~~~-G~sv~i~~iG~-~t~-~~ 72 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHML--RS---IAS-GKSAQFPVIGR-TKA-AY 72 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhcc--cc---ccc-cceeEeeeccc-eee-ee Confidence 44 1 122222 2378899999999999999999987643 11 111 11111111111 122 22 Q ss_pred ccCcccceeccccCCccccc--ceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhH Q lcl|NC_017976. 67 YNTDANVGFGTGTSKSSRFG--DRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAE 144 (296) Q Consensus 67 Y~td~NvaFGtGTg~s~RFG--~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk 144 (296) |..+.... .++-. .-+-+|..|+..-+.+ +=+-||+...+=|+.+..+ .-++.|=.|.+|..+-. T Consensus 73 ~~~g~~l~-------~~~~~~~~~e~~ltiD~~~y~~~---~VddiD~~q~~~D~~~~~~---~~~g~aLA~~~D~~i~~ 139 (347) T protein:vir:33 73 LKPGENLD-------DKRKDIKHTEKVIHIDGLLTADV---LIYDIEDAMNHYDVRAEYT---AQLGESLAMAADGAVLA 139 (347) T ss_pred ecCCCCCC-------CCCCCCccceEEEEechhhhhhH---HHhhHHHHhcCCchhHHHH---HHHHHHHHHHHHHHHHH Confidence 43322211 11111 1122344454432221 2346788888877766554 45678888888887754 Q ss_pred HHhhh-------hc-------------chhhhh-----hcchhHHHHHHHHhhhhhcceee-eeeEEEEECchhhhhhhc Q lcl|NC_017976. 145 FIVAN-------AG-------------KTEALT-----AYDEAAVLKLFNNLSAYYINIEA-IGTKVAKVGPELYNIIVD 198 (296) Q Consensus 145 ~ls~~-------A~-------------~t~~l~-----~~~~~~V~klFn~~~~~yvn~ev-~~~~~ayV~~evYNaIvD 198 (296) .|... ++ ...+-+ ..+.+.+.+.+-.+.+.....+| .....+.|+|+.|.+|+. T Consensus 140 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~ 219 (347) T protein:vir:33 140 ELAGLVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILA 219 (347) T ss_pred HHHHhhhhhcccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhc Confidence 43211 00 000001 11123344444444444444455 245889999999999999 Q ss_pred cccccccccceeeeccCc-ceeecCeEEEEchHHHhcCC----e---------------------------EEEeeccee Q lcl|NC_017976. 199 HRLTTKEKASGANVDSNE-IVKFKGFLIEEVPQAKLGAN----A---------------------------ALVYIKGVG 246 (296) Q Consensus 199 ~~l~Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~qg~----~---------------------------aifs~dnIg 246 (296) ++-.++..-.+...=.+| +.+.-||.|-+.+.--..+. . .+|.++-+| T Consensus 220 ~~~~~~~d~~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g 299 (347) T protein:vir:33 220 ALMPNAANYQALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVG 299 (347) T ss_pred cccccccccccccccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhhe Confidence 887665433322223456 45889999987664211100 0 133333333 Q ss_pred ee-ccc--eEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 247 KA-FTG--ITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 247 ~a-f~G--I~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) .. ..+ ++..| .++.-|=.+-|-=-||-=+++-.. .+...++++.. T Consensus 300 ~v~~~~~~~e~~r---~~~~~~d~i~~~~~~G~~vlrP~~--av~i~~~~~~~ 347 (347) T protein:vir:33 300 TVKLKDLALERAR---RANYQADQIIAKYAMGHGGLRPEA--AGAIVLPKVSE 347 (347) T ss_pred eeeeeceeeeecc---chhhhhHhhhhhhhcCCceecccc--eEEEecCCCCC Confidence 11 111 22222 223323333333333444554443 44445666666 No 43 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=92.79 E-value=0.01 Score=31.56 Aligned_cols=259 Identities=12% Similarity=0.050 Sum_probs=114.7 Q ss_pred ceeeeechh-HHHHHHHHHhhhhhhhhhc----ccceeeecCCcccceeEEEeecccceEecccccCcccceeccccCCc Q lcl|NC_017976. 8 LAAKTYQKQ-FKEMLQAVFSHQAYFADFF----GGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVGFGTGTSKS 82 (296) Q Consensus 8 ~a~r~Y~kq-~~~ll~~vf~~qa~F~~~f----gg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~NvaFGtGTg~s 82 (296) ||..++.+| ++..+-..|++...|.+.. -+++.. +.++| +.++.-. ++.+ ..|. ..+.++|....-. T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~----~~Gdt-V~i~~~~-~~~~-~~~~-~~~~~~~~~~~~~ 72 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAH----KFNDT-ITVRVPA-PSRG-HTRK-LRGAGAERNLTVS 72 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhcccccccccc----CCCCe-EEEeecc-cccc-eeee-ccccccCCccccc Confidence 555567776 7777777788887776542 111211 12222 2444322 2333 3243 2223333222111 Q ss_pred ccccceeEEE-EeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchh-hhhhcc Q lcl|NC_017976. 83 SRFGDRQEII-YIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTE-ALTAYD 160 (296) Q Consensus 83 ~RFG~rkEIi-y~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~-~l~~~~ 160 (296) ..--.-.++. =.+..+++.++ .+|+..... ...++.|+-|.+|=.+.++..+.+.+..+..... ....++ T Consensus 73 ~~~~~~~~~~id~~k~~~~~i~-----d~e~~~~~~---~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~ 144 (392) T protein:vir:99 73 DFTEDSFPVTLTDVAYHLGVLT-----DEELTFDLE---SFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVA 144 (392) T ss_pred ccccceEEEEEeeeeecceeec-----hHHHhhhhh---hhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccC Confidence 1111222232 23333333332 445444333 3456667778888888888888777654433221 123345 Q ss_pred hhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccce---eeeccCcce-eecCeEEEEchHHHhcCC Q lcl|NC_017976. 161 EAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASG---ANVDSNEIV-KFKGFLIEEVPQAKLGAN 236 (296) Q Consensus 161 ~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss---~NiD~ngi~-~fKgf~l~e~p~~y~qg~ 236 (296) .+...+-|-.+.+..-..++-..+++.|+|+.|.+|.-.+-.+....+. ..+=.+|.+ ++-||.+-+-+.--. +. T Consensus 145 ~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~-~t 223 (392) T protein:vir:99 145 PDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPH-GD 223 (392) T ss_pred hhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeeccccc-cc Confidence 4444555555555544445544589999999999998665444332221 122234543 677887765443211 22 Q ss_pred eEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCCCcce--EE--EEec----ccCCC Q lcl|NC_017976. 237 AALVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPA--VV--KVTA----PTVTP 296 (296) Q Consensus 237 ~aifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkA--I~--k~t~----~~~~p 296 (296) ...|.+..+.. ... ..+..+|+.....+..+..-.. .. ..+. ....+ T Consensus 224 ~~a~~~~a~~~---------at~----a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~ 278 (392) T protein:vir:99 224 AYLYHPTAFIM---------ATR----APAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDT 278 (392) T ss_pred ceeeecccccc---------ccc----cccccccccceeEEecccceecceeecccceeeccccccce Confidence 22222222111 000 1122333332222211110000 00 0000 00111 No 44 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=91.26 E-value=0.017 Score=30.33 Aligned_cols=263 Identities=14% Similarity=0.119 Sum_probs=142.0 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE--eccc--ccCcccceec Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV--VGTG--YNTDANVGFG 76 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv--vg~~--Y~td~NvaFG 76 (296) |+..---++-=+--..|..++..=+.++-.|.+..-- .-.+.|..- + .|+ +|.. +|+. |..++..... T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~-d~~l~g~~G-~---tv~---iP~~~~ig~a~~~~~g~~i~~~ 72 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEV-DSTLQGQPG-D---TLT---FPAFVYSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhccccee-cccccCCCC-C---EEE---EeeecCCCccccccCCCccchh Confidence 7764322322222334666666555555555432111 112222211 1 111 2211 1221 3333333222 Q ss_pred cccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhh Q lcl|NC_017976. 77 TGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEAL 156 (296) Q Consensus 77 tGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l 156 (296) .-|. ++.+-.| +. +-+.|.+.. +++..-+-|+ +++.++.++.++.+.++..+-..|..+..+... T Consensus 73 ~lt~-----~~~~~~i--~~---~~~~~~i~D-~~~~~~~~d~---~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~- 137 (274) T protein:vir:12 73 ILET-----KKREAKI--RK---IAKGTSITD-EALLSGYGDP---QGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA- 137 (274) T ss_pred hccc-----ceeeEEe--ee---ecceeeecH-HHHHhcccch---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccc- Confidence 2221 1222222 11 134566655 6777777676 566778888999999999998888765555433 Q ss_pred hhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccce--eeeccCcce-eecCeEEEEchHHHh Q lcl|NC_017976. 157 TAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASG--ANVDSNEIV-KFKGFLIEEVPQAKL 233 (296) Q Consensus 157 ~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y~ 233 (296) ..++.+.+...-.++ +.+-..+..+.|+|++|..|.-.++-...+.|. .++=.||.+ +|.||.+-+.+ .+ T Consensus 138 ~a~~~d~i~dA~~~l-----gd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~--~~ 210 (274) T protein:vir:12 138 DITKLNGLQSAIDKF-----NDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRSN--KL 210 (274) T ss_pred cccCHHHHHHHHHHh-----ccccccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEeC--CC Confidence 455555554443333 334456778999999999998776543333332 233344443 79999988764 23 Q ss_pred c-CCeEEEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 234 G-ANAALVYIKGVGK---AFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 234 q-g~~aifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) . +...+|.+..+|. +-+-+|+-|-+.+ ---.|-+---||..+.+..|-.+++-.++..-- T Consensus 211 p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~---~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 211 EAGTAILAKKGAVKLILKRDFFLEVARDAST---KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred CcceEEEEeccceeeeecCCceeccccchhh---cccEEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 3 5567777776664 2233666664443 334677777899999988887766633322222 No 45 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=90.69 E-value=0.02 Score=29.95 Aligned_cols=247 Identities=14% Similarity=0.092 Sum_probs=129.6 Q ss_pred CCCCcc---------cceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE----ecccc Q lcl|NC_017976. 1 MGTKNQ---------QLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV----VGTGY 67 (296) Q Consensus 1 m~t~Nn---------n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv----vg~~Y 67 (296) |...|. .-..-+|-|+|-|.+.+-|+.++.|++.+=- +++.| - |+-..|.+ + .+| T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~v--rti~~--G-------kS~qf~~iG~~~a-~y~ 68 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV--QTVTG--T-------NTVSNKYLGETEL-QVL 68 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee--eeecc--c-------ceEEEEEEeeeEE-eee Confidence 655442 3456789999999999999999999977643 33221 1 22223333 2 223 Q ss_pred cCcccceeccccCCcccccceeEEEEeccccccccCchhhhccccccccCC-HhHHHHHHHhhHHHHHHHHHHHHHhHHH Q lcl|NC_017976. 68 NTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNS-MESAMADRVELQAQAKTMLFDKKHAEFI 146 (296) Q Consensus 68 ~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNnd-l~aavAdRl~Lqa~Ak~~~~n~~~gk~l 146 (296) ..++.. .|+ +.---|-+|-.|+-. |.--|.. =||.+.-+=| +.+.+++ .+++|-.|++|..+=..+ T Consensus 69 ~~G~~l---dg~----~~~~~k~~ItID~lL-~a~~~V~--diDeaq~~yD~vRse~s~---e~G~ALA~~~Dq~ii~~i 135 (402) T protein:vir:97 69 APGQSP---NAT----PTQADKNQLVIDTTV-IARNTVA--HIHDVQGDIDSLKPKLAM---NQAKQLKRLEDQMAIQQM 135 (402) T ss_pred cccccc---CCC----CcccccEEEEeCcee-echhhhh--hHHHHHhcccchhHHHHH---HHHHHHHHHHHHHHHHHH Confidence 333332 122 222235578899876 3333322 2555555555 5555544 467888888888663323 Q ss_pred hhhhc-chhh---------------h------hhcchh----HHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccc Q lcl|NC_017976. 147 VANAG-KTEA---------------L------TAYDEA----AVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHR 200 (296) Q Consensus 147 s~~A~-~t~~---------------l------~~~~~~----~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~ 200 (296) ...+- .+.. + ...+.. .+..++.++.+++|.. ..++++|+|+.|++|+.++ T Consensus 136 ~~aa~a~t~~~~~~~~~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~---~dRv~vv~P~~y~~Ll~~~ 212 (402) T protein:vir:97 136 LLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDI---SDVAIMMPWKFFNALRDAD 212 (402) T ss_pred HHhhccccccccccCcccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCc---cccEEEeChHHHHHHhhcc Confidence 21110 0000 0 012222 2335666777777664 3389999999999999875 Q ss_pred ccc-cccc-ceee-eccCcceeecCeEEEEchHHHhcCCe---EEEeecceeeecc---ceEEEEEeeccCccceeeeec Q lcl|NC_017976. 201 LTT-KEKA-SGAN-VDSNEIVKFKGFLIEEVPQAKLGANA---ALVYIKGVGKAFT---GITTARTIESEDFDGVAFQGA 271 (296) Q Consensus 201 l~T-s~K~-Ss~N-iD~ngi~~fKgf~l~e~p~~y~qg~~---aifs~dnIg~af~---GI~taRtieSEDFdGVaLQgA 271 (296) --- ..=+ ++.+ +=.-++++--||.|.+.|.-=+.+.. .=.++.+-|.+|. +..+++ |. T Consensus 213 rl~n~d~~~~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~-------------~~ 279 (402) T protein:vir:97 213 RIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAV-------------AV 279 (402) T ss_pred cccchhhccccCCccccceeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeE-------------EE Confidence 432 1110 1111 22334566777777665442221100 1112333344443 111111 11 Q ss_pred ccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 272 GKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 272 gK~G~~IlddNKkAI~k~t~~~~~p 296 (296) .| .++|+..+.+...|+ T Consensus 280 ----~f----~~~Av~tvk~~~vT~ 296 (402) T protein:vir:97 280 ----LF----TSDALLVGRTIEVTG 296 (402) T ss_pred ----EE----ecceEEEEEeecccc Confidence 23 346999999988888 No 46 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=90.15 E-value=0.022 Score=29.63 Aligned_cols=269 Identities=13% Similarity=0.049 Sum_probs=151.8 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhh-hhhhhcccceeeecCCcccceeEEEeecccceEecccccCcccceecccc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQA-YFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVGFGTGT 79 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa-~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~NvaFGtGT 79 (296) |+| .--..|.+||..-...+|+-+. -|+++ .+....+...+++=+...++.+++= . +....-|+.++ T Consensus 13 Ms~----~i~~~fv~qy~~~v~~~~qq~~s~L~~t----V~~~~~~~~~~~~~~~~~~~~~~~~-~---~~~~~~~~d~~ 80 (322) T protein:vir:10 13 IAG----DIDQAFVQTYETTLRILSQQKSAKLKQY----CQHKNESSESHNWETLASMDPDAVK-R---KRSRQQSADGT 80 (322) T ss_pred eec----hhhhHHHHHHHHHHHHHHHHhhhhhhcc----cccccccccccceeecccccccccc-c---ccccccccCcc Confidence 666 2356799999999999987654 33333 7777788888887788777777762 1 12222233322 Q ss_pred C---Cccc-ccceeEEEEeccccccccCch-hhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchh Q lcl|NC_017976. 80 S---KSSR-FGDRQEIIYIDTPVPYTWGYN-WHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTE 154 (296) Q Consensus 80 g---~s~R-FG~rkEIiy~DtdVpY~~~~a-iHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~ 154 (296) - -.+. ++.|. +.-. +|.|+ .=+.+|+.-.+=|+.+..++ .|+-|..|-.+..+=.++...|.... T Consensus 81 ~dtp~~~~~~~~r~-~~~~------d~~~~~~VDd~D~~k~~~D~~~~~~~---~~a~AL~R~~D~~I~~a~~g~a~~~~ 150 (322) T protein:vir:10 81 YPTPVNNKPFAKRR-TNVD------TYDTGHVVEQEDISQMLLDPNSALIT---SQAYAMARKTDDLIIAGAWKPASIKG 150 (322) T ss_pred cCCCccccccceEE-Eeec------ccccceecchHHHHHhhcCchHHHHH---HHHHHhhhHHHHHHHhhhhccccccc Confidence 1 1111 34444 3322 23343 23678888899999999987 67888888888877666666664322 Q ss_pred h---------------hhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccceee-eccCc-c Q lcl|NC_017976. 155 A---------------LTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGAN-VDSNE-I 217 (296) Q Consensus 155 ~---------------l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~N-iD~ng-i 217 (296) . -..++-+.+.+++..+.+..++-+ ++..+=|+|+-|+.|.--+-.|++--.+++ +-.+| + T Consensus 151 ~gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d--~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~i 228 (322) T protein:vir:10 151 TGQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPE--VSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGII 228 (322) T ss_pred cccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCC--CCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCee Confidence 1 013445556666666666555332 345678899999998865555544333333 33567 4 Q ss_pred eeecCeEE---EEch----HHHhcCC----------eEEEeecceeeeccceEEEEEee-ccCccceeeeecccccccCC Q lcl|NC_017976. 218 VKFKGFLI---EEVP----QAKLGAN----------AALVYIKGVGKAFTGITTARTIE-SEDFDGVAFQGAGKAGEFIL 279 (296) Q Consensus 218 ~~fKgf~l---~e~p----~~y~qg~----------~aifs~dnIg~af~GI~taRtie-SEDFdGVaLQgAgK~G~~Il 279 (296) =+|=||.+ +.+| .++-+|. ...+....||.+--=.-+++.-+ .+-+....+-+...+|-=.+ T Consensus 229 g~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~Ga~ri 308 (322) T protein:vir:10 229 TNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTADCVRV 308 (322) T ss_pred eeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcchhhhhhhhhhhCceEe Confidence 45666665 4444 2222211 12444455554421111223222 11122333445556666677 Q ss_pred CCCcceEEEEeccc Q lcl|NC_017976. 280 DDNKPAVVKVTAPT 293 (296) Q Consensus 280 ddNKkAI~k~t~~~ 293 (296) |+++-.-+...-.- T Consensus 309 ~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 309 EDEHIFKLRLKNSL 322 (322) T ss_pred ccCcEEEEEEeccC Confidence 88777666664443 No 47 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=89.25 E-value=0.027 Score=29.15 Aligned_cols=258 Identities=13% Similarity=0.076 Sum_probs=139.1 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE--eccc--ccCcccceec Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV--VGTG--YNTDANVGFG 76 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv--vg~~--Y~td~NvaFG 76 (296) |+..---++-=+--.-|..+++.=|.++..|.+. +--...+.|..-+ | -.+|.. +|+. |..+...+.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~-~~~~~~l~g~~G~-t------i~iP~~~~~gda~~~~eg~~i~~~ 72 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPL-AQVDTTLQGQPGN-T------LKFPAFTYIGDAADVAEGGEISLD 72 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccc-cccccccccCCCC-E------EEEeeeccCccccccCCCCccChh Confidence 8753322222222334666776667777666553 2212333332211 1 122221 1321 2212222211 Q ss_pred cccCCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhh Q lcl|NC_017976. 77 TGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEAL 156 (296) Q Consensus 77 tGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l 156 (296) .=| .++.+-.| + -+-+.|.+.. +|+..-.-|+ +.+..+.++.+|.|.+++.+-..|+....+. - T Consensus 73 ~lt-----~~~~~~~i--~---~~~k~~~vtD-~~~~~~~~d~---~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~--~ 136 (272) T protein:vir:36 73 KIG-----TTTKSVTI--K---KAAKGTEITD-EAALSGYGDP---IGESNKQLGLSLANKVDDDLLSAAKTTSQTV--S 136 (272) T ss_pred hcC-----CcceeEee--e---hhhccccccH-HHHhhccchH---HHHHHHHHHHHHHHHHHHHHHHHhccccccc--c Confidence 111 22222111 0 1233455533 6666666665 4555566778999999998877776554443 2 Q ss_pred hhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccce-eeeccCc-ceeecCeEEEEchHHHhc Q lcl|NC_017976. 157 TAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASG-ANVDSNE-IVKFKGFLIEEVPQAKLG 234 (296) Q Consensus 157 ~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss-~NiD~ng-i~~fKgf~l~e~p~~y~q 234 (296) ...+.+.|.+....+..... .+..+.|+|.+|..|.-.+.......+. .++-.|| |-+|-|+.+-+... +- T Consensus 137 ~~~~~d~i~~A~~~lgd~~~-----~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~--~p 209 (272) T protein:vir:36 137 TKANVDGVQAALDIFNDEDA-----QAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKK--LA 209 (272) T ss_pred ccccHHHHHHHHHHhhhcCC-----CceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCC--CC Confidence 45666677777666654433 3567999999999997766655554332 3344455 45899998866442 33 Q ss_pred -CCe----EEEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccC Q lcl|NC_017976. 235 -ANA----ALVYIKGVGK---AFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTV 294 (296) Q Consensus 235 -g~~----aifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~ 294 (296) +.. .+|.+..+|. .-+-+++-|-+. .-.-.|-+---||..++++.|-+.+ |-+-+ T Consensus 210 ~~~~~~~~~~~~~gA~~~~~~~~~~vE~~R~~~---~~~d~i~~~~~y~~~v~~~~~vv~~--t~~g~ 272 (272) T protein:vir:36 210 EGSALMFKIVSNSPALKLVLKRGVQVETDRDIV---TKTTVITADEHYAAYLYDLTKVVNI--TFTGV 272 (272) T ss_pred CCceeEEEEEecccceeeeecCCcccccccchh---hcCcEEEEEEEEEEEEEcCccEEEE--eecCC Confidence 443 4566777762 233466666433 3345677777799999887765544 44444 No 48 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=80.21 E-value=0.097 Score=26.14 Aligned_cols=249 Identities=12% Similarity=0.111 Sum_probs=130.8 Q ss_pred CCCCccc---------ceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecc----cc Q lcl|NC_017976. 1 MGTKNQQ---------LAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGT----GY 67 (296) Q Consensus 1 m~t~Nnn---------~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~----~Y 67 (296) |.+.|+. -+.-+|-|+|-|.+.+-|+.++.|++.+= ++++.|= |+--.|.+ |+ ++ T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~--vRti~~g---------kS~qf~~~-G~s~~~~~ 68 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFD--VQTVTGT---------NTVSNKYL-GETELQVL 68 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccce--eeeeccc---------ceEEEEEe-eeeEeeee Confidence 8776653 35678999999999999999999998764 3443321 22223332 22 23 Q ss_pred cCcccceeccccCCcccccceeEEEEeccccccccC-chhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHH Q lcl|NC_017976. 68 NTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWG-YNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFI 146 (296) Q Consensus 68 ~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~-~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~l 146 (296) ..++.- .++|.---|-+|-.|+-+--... |-|||=-..|+ -+. ++=...+.+|-.|+||..+...| T Consensus 69 ~pG~~l-------d~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD---~vR---se~s~e~G~ALA~~~Dq~iiq~i 135 (401) T protein:vir:70 69 APGQSP-------AATSTQADKNQLVIDATVIARNTVAHLHDVQGDID---SLK---PKLATNQAKQLKRMEDEMLIQQM 135 (401) T ss_pred cCCCCc-------CCCCcccccEEEEeCceeehhhhhhhHHHHHhccc---ccc---hHHHHHHHHHHHHHHHHHHHHHH Confidence 333332 12344344557888887644332 23344333332 023 33345577888899998775555 Q ss_pred hhhh-------cchh---------hh------hhcchh----HHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccc Q lcl|NC_017976. 147 VANA-------GKTE---------AL------TAYDEA----AVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHR 200 (296) Q Consensus 147 s~~A-------~~t~---------~l------~~~~~~----~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~ 200 (296) -..+ +... ++ ...+.+ .+..++..+.+++|..+ -.+.+..|+.|+.|.+|+ T Consensus 136 ~~aa~ana~~~~~~p~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~---r~vvl~pp~~Ys~Ll~~d 212 (401) T protein:vir:70 136 MLGGIANTQAKRTNPRVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDIS---DVAILMPWRYFNVLRDAD 212 (401) T ss_pred HHhccccccccccCCCcCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCcc---ceEEEcCHHHHHHHHhcC Confidence 2211 0000 01 111222 24566777777777632 467777999999999997 Q ss_pred -ccccccc-ce--eeeccCcceeecCeEEEEchHHHhcCCe---EEEeecceeeeccceEEEEEeeccCccceeeeeccc Q lcl|NC_017976. 201 -LTTKEKA-SG--ANVDSNEIVKFKGFLIEEVPQAKLGANA---ALVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGK 273 (296) Q Consensus 201 -l~Ts~K~-Ss--~NiD~ngi~~fKgf~l~e~p~~y~qg~~---aifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK 273 (296) +....=+ |+ ..+. -.+++--||.|.|.|.-=+.+.. ---|+.+-|.+|. -.-||... +|. T Consensus 213 ~L~nrd~~~s~~g~~~~-G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~--------~~~d~s~~--~~v-- 279 (401) T protein:vir:70 213 RIVDKTYTISQSGATIQ-GFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYD--------PLPAMNGA--IAV-- 279 (401) T ss_pred cccchhhccccCCcccc-ceEEEEeceEEEeeccccccccccccccccccCCCccCC--------CCccccce--eEE-- Confidence 4432211 11 1122 22456778888777543221110 1113344455554 00111111 111 Q ss_pred ccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 274 AGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 274 ~G~~IlddNKkAI~k~t~~~~~p 296 (296) .|.+ .|+..+.+...|+ T Consensus 280 --~f~~----~Av~tvk~~~lt~ 296 (401) T protein:vir:70 280 --LFTA----DALLVGRSIDVTG 296 (401) T ss_pred --EEeh----hheEEEEeecccc Confidence 3332 2677787777777 No 49 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=79.65 E-value=0.1 Score=26.01 Aligned_cols=234 Identities=13% Similarity=0.118 Sum_probs=102.8 Q ss_pred ceeeecCCcccceeEEEeecccceEecccccCcccceeccccCCccc--ccceeEEEEeccccccccCchhhhccccccc Q lcl|NC_017976. 38 GIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVGFGTGTSKSSR--FGDRQEIIYIDTPVPYTWGYNWHEGIDRYTV 115 (296) Q Consensus 38 ~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~NvaFGtGTg~s~R--FG~rkEIiy~DtdVpY~~~~aiHEGiDr~TV 115 (296) -++.+.| .+...+..+= .+-+ .+|..+... ..++ ...-+-+|-.|+..-+.+ +=+-||+.+. T Consensus 1 ~vr~i~~-g~s~~~~~iG----~~~~-~~~~~G~~l-------~~~~~~~~~~e~~itID~~l~~~~---~VdDiD~~qa 64 (324) T protein:vir:99 1 MTRTITS-GKSAQFPVMG----RTKA-RYLKQGQSL-------DDGREDIKHTEKVITIDGLLTTDV---LIYDIEDAMN 64 (324) T ss_pred Ceeeeec-CceEEEeeee----eeEe-ccccCCCCc-------CCCcCCcCcccEEEEecchhhhhh---hhhhHHHHhc Confidence 2333333 2222111110 0112 112222221 0111 111223455555443332 2246777776 Q ss_pred cCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhh-----------------------cchhhhhhcchhHHHHHHHHhh Q lcl|NC_017976. 116 NNSMESAMADRVELQAQAKTMLFDKKHAEFIVANA-----------------------GKTEALTAYDEAAVLKLFNNLS 172 (296) Q Consensus 116 Nndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A-----------------------~~t~~l~~~~~~~V~klFn~~~ 172 (296) +=|+ ..+...-+++|-.+.+|+.+-..|..-+ +..+. ...+.+.+.+.|-.+. T Consensus 65 ~~Dl---r~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~-~~~~~~~~~dai~~a~ 140 (324) T protein:vir:99 65 HYDV---RSEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKED-PAKYGTQVIQALTYAR 140 (324) T ss_pred Cccc---hhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccc-cccCHHHHHHHHHHHH Confidence 6564 4566677889999999976643332111 01111 1122233444443344 Q ss_pred hhhcceee-eeeEEEEECchhhhhhhccccccccccceee-eccCcceeecCeEEEEchHHHhc-C-------------- Q lcl|NC_017976. 173 AYYINIEA-IGTKVAKVGPELYNIIVDHRLTTKEKASGAN-VDSNEIVKFKGFLIEEVPQAKLG-A-------------- 235 (296) Q Consensus 173 ~~yvn~ev-~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~N-iD~ngi~~fKgf~l~e~p~~y~q-g-------------- 235 (296) +......| .....++|+|+.|.+|.|++..+...-.+.+ +=.-.|.+.-||.|-+.+.--.. + T Consensus 141 ~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~ 220 (324) T protein:vir:99 141 AAFAKKYIPAGDRTFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIF 220 (324) T ss_pred HHHhhcCCCCCCCEEEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccc Confidence 44433333 2458899999999999999887765333333 33335678899999876543222 1 Q ss_pred -----------------C-e-EEEeecceeeeccceEEEEEeeccCc-----cceeeeecccccccCCCCCcceEEEEec Q lcl|NC_017976. 236 -----------------N-A-ALVYIKGVGKAFTGITTARTIESEDF-----DGVAFQGAGKAGEFILDDNKPAVVKVTA 291 (296) Q Consensus 236 -----------------~-~-aifs~dnIg~af~GI~taRtieSEDF-----dGVaLQgAgK~G~~IlddNKkAI~k~t~ 291 (296) . + .+|.++-+| ..-+.-+..|-| -|=.+-|-=-||-=++.-.-.+.++... T Consensus 221 ~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~-----tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~ 295 (324) T protein:vir:99 221 PATGDSTTTGKMTVGADNVVGLFVHRSAVA-----TLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFED 295 (324) T ss_pred ccccccccccccccccCceeEEEEehhheE-----EEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEcc Confidence 0 0 223332221 111222222222 2222222222333344322233343322 Q ss_pred c---cCCC Q lcl|NC_017976. 292 P---TVTP 296 (296) Q Consensus 292 ~---~~~p 296 (296) . .++| T Consensus 296 ~~~~~~~~ 303 (324) T protein:vir:99 296 GETPAVAP 303 (324) T ss_pred Cccccccc Confidence 2 2445 No 50 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=79.14 E-value=0.11 Score=25.90 Aligned_cols=246 Identities=12% Similarity=0.102 Sum_probs=136.1 Q ss_pred CCCCcc---------cceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecc----cc Q lcl|NC_017976. 1 MGTKNQ---------QLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGT----GY 67 (296) Q Consensus 1 m~t~Nn---------n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~----~Y 67 (296) |.+.|+ .-+.-+|-|+|-|.+.+-|+.++.|++.+= ++++.|= |+--.|.+ |+ ++ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~--vRtI~~g---------kS~qf~~l-G~s~a~y~ 68 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFD--VQTVTGT---------NTVSNKYL-GETELQVL 68 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccce--eeeeccc---------ceEEEEEe-eeeEEeee Confidence 877665 345678999999999999999999998764 3443321 22222332 22 23 Q ss_pred cCcccceeccccCCcccccceeEEEEeccccccccC-chhhhcccccc-ccCCHhHHHHHHHhhHHHHHHHHHHHHHhHH Q lcl|NC_017976. 68 NTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWG-YNWHEGIDRYT-VNNSMESAMADRVELQAQAKTMLFDKKHAEF 145 (296) Q Consensus 68 ~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~-~aiHEGiDr~T-VNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ 145 (296) ..++.. .| + +.---|-+|-.|+-+--... |-|||=.+.|+ | . ++-...+.+|-.++||..+=.. T Consensus 69 ~pG~~l---dg--~--~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~v----R---se~s~e~G~ALA~~~Dq~iiq~ 134 (400) T protein:vir:10 69 APGQSP---AA--T--STQADKNQLVIDATVIARNTVAHLHDVQGDIDSL----K---PKLATNQAKQLKKMEDEMLIQQ 134 (400) T ss_pred cCCCCc---CC--C--CcccCcEEEEeCceeeecchhhhHHHHhhccccc----c---HHHHHHHHHHHHHHHHHHHHHH Confidence 333332 12 2 33333667888887654433 45566555554 4 2 4444567788888888754222 Q ss_pred Hhhh--h---cc-------hh----hh---h---hcchh----HHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhcc Q lcl|NC_017976. 146 IVAN--A---GK-------TE----AL---T---AYDEA----AVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDH 199 (296) Q Consensus 146 ls~~--A---~~-------t~----~l---~---~~~~~----~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~ 199 (296) +-.. | .. .. ++ + ..+.+ .+..++..+.+++|..+ -.+.++.|+.|++|.+| T Consensus 135 i~~a~~a~t~~~~~~~~g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~---d~vvl~pp~~Ys~Ll~~ 211 (400) T protein:vir:10 135 MLLGGIANTQAKRTNPRVKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDIS---DVAILMPWRYFNVLRDA 211 (400) T ss_pred HHHhcccccccccccCCccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCcc---ceEEEcCHHHHHHHHhC Confidence 2111 0 00 00 00 0 01212 24456677777777733 46788899999999998 Q ss_pred c-cccccccceee--eccCcceeecCeEEEEchHHHhc---CCeEEEeecceeeecc---ceEEEEEeeccCccceeeee Q lcl|NC_017976. 200 R-LTTKEKASGAN--VDSNEIVKFKGFLIEEVPQAKLG---ANAALVYIKGVGKAFT---GITTARTIESEDFDGVAFQG 270 (296) Q Consensus 200 ~-l~Ts~K~Ss~N--iD~ngi~~fKgf~l~e~p~~y~q---g~~aifs~dnIg~af~---GI~taRtieSEDFdGVaLQg 270 (296) + |....=+-+.+ .=.-.+++.-|+.|.|.|.-=+. ..-.-.|+.+-|.+|- +...++.+ T Consensus 212 dkLvnrdf~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av------------ 279 (400) T protein:vir:10 212 DRIVDKSYTISQSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAV------------ 279 (400) T ss_pred CcccchhccccCCCccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEE------------ Confidence 6 55444321111 12223567888888887654221 1123456666666664 22222221 Q ss_pred cccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 271 AGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 271 AgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) .|.++ |+..+.+...|+ T Consensus 280 -----~F~~s----Av~tvk~~~lt~ 296 (400) T protein:vir:10 280 -----LFTAD----ALLVGRSIDVIG 296 (400) T ss_pred -----EEehh----heEEEEeecccc Confidence 33332 777788877777 No 51 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=67.97 E-value=0.24 Score=23.94 Aligned_cols=260 Identities=13% Similarity=0.063 Sum_probs=128.5 Q ss_pred CCCCc-ccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceEecccccCcccceecccc Q lcl|NC_017976. 1 MGTKN-QQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVVVGTGYNTDANVGFGTGT 79 (296) Q Consensus 1 m~t~N-nn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVvvg~~Y~td~NvaFGtGT 79 (296) |+.+. .++ +--.-|..+++.=+.++..|.+.- .-.-.|.|..-+. -.+|.- + |..|+.+ -..|+ T Consensus 1 Ma~T~~~d~---I~Pev~~~~V~e~~~~~~~~~~~~-~~d~~L~g~~G~t-------i~~P~~--~-~igdae~-~~eg~ 65 (270) T protein:vir:95 1 MTQTKKANL---INPEVLANVVSAQMQNAIRFTPYA-VTDDTLVGQPGDT-------ITRPKY--A-YIGAAED-LQEGV 65 (270) T ss_pred CCceehhhh---cchHHHHHHHHHHHHhHHhhcccc-ccccccCCCCCCE-------EEeeee--c-CCCcccc-ccCCC Confidence 55321 111 234468888888888888886632 2122222221111 111211 0 1111110 11111 Q ss_pred CCcccccceeEEEEecccc---ccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhh Q lcl|NC_017976. 80 SKSSRFGDRQEIIYIDTPV---PYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEAL 156 (296) Q Consensus 80 g~s~RFG~rkEIiy~DtdV---pY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l 156 (296) .- +..++-.....+ -+-..|.+.. +++++--.|+ +.+..+.++.+|.|.+++.+=..|...... .+ T Consensus 66 ~i-----~~~~lt~~~~~a~i~~~gk~~~itD-~a~~~~~~dp---~~~~~~q~a~~~a~~~d~~li~~l~~a~~~-~~- 134 (270) T protein:vir:95 66 AM-----DTTQMSMTTTKVTVKETGKAVEVTQ-TAIITNVNGT---LQEASRQLAMSLADKVEIDYIAELNKSKQT-AT- 134 (270) T ss_pred cc-----chhhcccchheeeeehhhCcceecH-HHHhhhccch---HHHHHHHHHHHHHHHHHHHHHHHhcccccc-cc- Confidence 11 011111111110 0122343332 2444444455 566667788999999887653333221111 01 Q ss_pred hhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccccccccccceeeeccC-cceeecCeEEEEchHHHhc- Q lcl|NC_017976. 157 TAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGANVDSN-EIVKFKGFLIEEVPQAKLG- 234 (296) Q Consensus 157 ~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~n-gi~~fKgf~l~e~p~~y~q- 234 (296) ...+.+ .|..+...+ +-|...+-.++|+|.+|..+--.++.+..+.+. |+=.| .|-.|.|+.+-. .+..-- T Consensus 135 ~~~t~~----~~~dA~~~l-gd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~-~~~~~G~ig~~~G~~Viv-~s~~~~~ 207 (270) T protein:vir:95 135 VSADAT----GILDAIEVF-NSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQD-RAISKGDLVEIVGVSDIV-KSKRVSE 207 (270) T ss_pred cccCHH----HHHHHHHHh-ccccCCCcEEEEcHHHHHHHHhhhccccccccc-chhcccccceecceeEEE-eCCCCCc Confidence 122333 334444443 335566678999999999987666666555543 33333 366789986421 121112 Q ss_pred CCeEEEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 235 ANAALVYIKGVGK---AFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 235 g~~aifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) +...+|.+..||. .-+-||+-|-+..- =-.|=+-=-||.++.++-|...++...+-+|- T Consensus 208 ~~~~l~~~gAi~~~~~~~~~vEtdRd~~~~---~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~ 269 (270) T protein:vir:95 208 NTAFLQRYGAMEIVNKKKPEAYTDFDILKR---THLLSTNYHYSVNLKDETGVVKVTFKPSGSLE 269 (270) T ss_pred eeEEEEeccceeeeecCCceeeeccchhhc---ccEEEeeeEEEEEEEccceEEEEEecCCCCcC Confidence 5678888887772 12347777765541 22555666689999998887776664333333 No 52 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=55.63 E-value=0.48 Score=22.36 Aligned_cols=261 Identities=13% Similarity=0.063 Sum_probs=126.1 Q ss_pred CCCCcc---------cceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecccceE----ecccc Q lcl|NC_017976. 1 MGTKNQ---------QLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSDLPVV----VGTGY 67 (296) Q Consensus 1 m~t~Nn---------n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd~pVv----vg~~Y 67 (296) |...|. .-+.-+|-|+|-|.+.+-|+.++.|++.+=- +++.| - |+--.|.+ + .+| T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~--rti~~----g-----kS~q~~~iG~~~~-~~~ 68 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDV--QEVVG----T-----NSVSNKYIGETEL-QVL 68 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee--eeecc----c-----ceEEeeeeeeeEE-eee Confidence 655442 3456789999999999999999999977643 33221 1 22223332 3 223 Q ss_pred cCcccceeccccCCcccccceeEEEEeccccccccCchhhhccccccccCC-HhHHHHHHHhhHHHHHHHHHHHHHhHHH Q lcl|NC_017976. 68 NTDANVGFGTGTSKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNS-MESAMADRVELQAQAKTMLFDKKHAEFI 146 (296) Q Consensus 68 ~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNnd-l~aavAdRl~Lqa~Ak~~~~n~~~gk~l 146 (296) ..++.. .|+ +.-.-|=+|-.|+-. |.--+. .=||.+.-+=| +.+.++. .+++|-.|++|..+-..+ T Consensus 69 ~~G~~l---d~~----~~~~~k~~itID~ll-~a~~~V--~diDe~q~~~D~vR~e~s~---e~G~ALA~~~Dq~i~~~v 135 (364) T protein:vir:10 69 SPGKSP---DAS----PTEFDKNRLVVDTTV-IARNTV--AHFHDVQNDIDGLKSKLSV---NQAKKLKKMEDSMVIQQL 135 (364) T ss_pred ccCccc---CCC----CcccCcEEEEeccee-eechhh--hhHHHHhcCccchhHHHHH---HHHHHHHHHHHHHHHHHH Confidence 334443 222 222235588888866 322222 22455555555 4555544 457888888888774433 Q ss_pred hhhh-cchh---------------hh------hhcchhH----HHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhccc Q lcl|NC_017976. 147 VANA-GKTE---------------AL------TAYDEAA----VLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHR 200 (296) Q Consensus 147 s~~A-~~t~---------------~l------~~~~~~~----V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~ 200 (296) ...| +..+ .+ .+.+.+. +..++..+.+++|.. ..++++|+|+.|.+|+.++ T Consensus 136 ~~aa~a~~~~~~~~~~~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~---~~R~~vv~P~~y~~Ll~~~ 212 (364) T protein:vir:10 136 VLGGISNTEAIRKNPRVAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDT---SELCGLMPWTAFNCLRDAD 212 (364) T ss_pred HhhhhhcccccccCCcccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCc---cccEEEeChHHHHHHhcCC Confidence 2221 1000 00 0111122 334556666666554 3399999999999999975 Q ss_pred c-cccc---ccceeeeccCcceeecCeEEEEchHHHhc---------------------------CC-----eEEEeecc Q lcl|NC_017976. 201 L-TTKE---KASGANVDSNEIVKFKGFLIEEVPQAKLG---------------------------AN-----AALVYIKG 244 (296) Q Consensus 201 l-~Ts~---K~Ss~NiD~ngi~~fKgf~l~e~p~~y~q---------------------------g~-----~aifs~dn 244 (296) - .... .++...++. .+.+.-||.|.|.|.-=+. ++ .++|.|+- T Consensus 213 ~lvn~d~~~~~~~~~~~G-~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~A 291 (364) T protein:vir:10 213 RIVDKSYTIAASDNTVDG-FVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDA 291 (364) T ss_pred ccccccccccCCCccccc-eeEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecce Confidence 3 2211 122222222 2456788888775433111 11 24555543 Q ss_pred eeeeccceEEEEEeeccCccce---eeeeccc--ccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 245 VGKAFTGITTARTIESEDFDGV---AFQGAGK--AGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 245 Ig~af~GI~taRtieSEDFdGV---aLQgAgK--~G~~IlddNKkAI~k~t~~~~~p 296 (296) +|. -.+.-+..|=|.+. .=|.-+| ||-=++.---.+++ .+..+.+| T Consensus 292 l~t-----v~~~~~t~e~~~~~~~~~~~ida~~a~G~g~lRPeaa~~i-~~~~~~~~ 342 (364) T protein:vir:10 292 LLV-----GRTISITGDIFYEKKEKTWYIDTFLAEGAIPDRWEAVAVV-TAADTAEL 342 (364) T ss_pred EEE-----EEEecceeeeeeccceeeeeeeeehcccCcccCccceEEE-EecCCCCC Confidence 331 11111122212111 1111111 23223333222222 23333344 No 53 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=51.38 E-value=0.58 Score=21.87 Aligned_cols=182 Identities=13% Similarity=0.187 Sum_probs=86.0 Q ss_pred eccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhh-----------hh---hc Q lcl|NC_017976. 94 IDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEA-----------LT---AY 159 (296) Q Consensus 94 ~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~-----------l~---~~ 159 (296) .|+...-+ .+=+=||+...+=|+-+...++ +++|-.+.+|+.+...|.+.|..... ++ .. T Consensus 1 iD~lL~a~---~~VdDiD~aqa~~dvr~e~t~e---~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~ 74 (221) T protein:vir:17 1 MDDLLVAS---QFVYDLDEILAQWNTRSEISKQ---IGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTN 74 (221) T ss_pred CCcchhHH---HHHHhHHHHHhhhHHHHHHHHH---HHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccC Confidence 33322211 1123344444454544444444 56788899999998888766543110 10 01 Q ss_pred ch----hHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhc--ccccccc-c-cceeeeccC-cceeecCeEEEEchH Q lcl|NC_017976. 160 DE----AAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVD--HRLTTKE-K-ASGANVDSN-EIVKFKGFLIEEVPQ 230 (296) Q Consensus 160 ~~----~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD--~~l~Ts~-K-~Ss~NiD~n-gi~~fKgf~l~e~p~ 230 (296) +. +.+.++..++.++.|-. ...+++|+|+.|-+|+- ++..++. . +|...++.- ++.+.-||.|-+.+. T Consensus 75 ~~~~l~dai~~a~~~LdekdVP~---~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~Snn 151 (221) T protein:vir:17 75 NAQAIVDGFFEAAAVLDERSAPM---DGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNV 151 (221) T ss_pred CHHHHHHHHHHHHHHHhhcCCCC---CCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEecc Confidence 11 33444445555555442 44789999999999983 3433332 2 244445544 488899999988764 Q ss_pred HHhc-CCeEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 231 AKLG-ANAALVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 231 ~y~q-g~~aifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) -=.. |.. +. +..|-.+.-.-++|.+.|-.=-=. | -+.-.+..+.+|.-.+..-| T Consensus 152 lP~~~gt~-~~-------~~ag~~~~~~~~~~~yr~~fs~~~---g-lv~~~~Avgtvkl~~~~~~~ 206 (221) T protein:vir:17 152 LASLYGTN-LV-------TDPGDATTSGENNGSYRPAITDRA---G-LVFHKEAADTVEVLLPPSRP 206 (221) T ss_pred CCcccccc-cc-------cCCccccccccccccccccccceE---E-EEEcchheeeeeeecCCCCC Confidence 3222 211 10 111211111112222222100000 1 12223344455555555555 No 54 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=33.44 E-value=1.4 Score=19.86 Aligned_cols=261 Identities=10% Similarity=0.019 Sum_probs=108.3 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEee-cccc--eEecccccCcccceecc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKT-SDLP--VVVGTGYNTDANVGFGT 77 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKt-nd~p--Vvvg~~Y~td~NvaFGt 77 (296) +.+..+- --+--+++..-+-......+..++. +. .+..++-. -+....+. +..| -.+++ +....+ T Consensus 111 ~~t~~~g--g~~iP~~~~~~ii~~~~~~~~l~~~-~~-~~~~~~~~--~~~~~~~~~~~~~~a~~v~E------~~~~~~ 178 (397) T protein:vir:48 111 DASGSDA--GLTIPQDIQTAIHTLVRQYDSLQEY-VN-VENVTTLT--GSRVYEKWADITGLAKLDDE------AGSIGT 178 (397) T ss_pred ccCCccc--cccccHHHHHHHHHHHHHHHHHHhh-hc-eeeccCCc--ceEEEEeecCCCcceeeecc------cccccc Confidence 2221111 0011223322222222333333332 22 12222211 12222221 1111 12222 211111 Q ss_pred ccCCcccccceeEEE-EeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhh Q lcl|NC_017976. 78 GTSKSSRFGDRQEII-YIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEAL 156 (296) Q Consensus 78 GTg~s~RFG~rkEIi-y~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l 156 (296) . ..--|++.+--. -.-.-+|++..+--- + +-|+.+-|.++|. +|-.+..|..+ |.-.-+..... T Consensus 179 ~--~~~~~~~v~~~~~k~~~~~~iS~ell~d------s-~~~l~~~v~~~l~---~~~~~~~d~~i---l~G~g~~~~~~ 243 (397) T protein:vir:48 179 N--DDPKLYPIRYAIKRYAGISTVTNSLLAD------S-AENILAWLSGWIA---KKVVVTRNKAI---LEAIATLPTKP 243 (397) T ss_pred c--cccceeeEEeeheeeeeehhhHHHHHhh------c-hHHHHHHHHHHHH---HHHHHHHHHHH---hhccccccccc Confidence 1 011244322110 011123444333110 0 1134555555543 22222233322 11111111112 Q ss_pred hhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhcccccccccccee---eeccCcceeecCeEEEEchHHHh Q lcl|NC_017976. 157 TAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGA---NVDSNEIVKFKGFLIEEVPQAKL 233 (296) Q Consensus 157 ~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~---NiD~ngi~~fKgf~l~e~p~~y~ 233 (296) +..+-+++.++..++...|.++ -..+++|..|++|-=..-. -|.-+ ++-..+-..+.|+-+..++...+ T Consensus 244 ~~~~~d~i~~~~~~l~~~~~~~-----a~~v~n~~~~~~L~~lkd~---~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~ 315 (397) T protein:vir:48 244 TLTKWDDIIDLQAKVDPAIKQT-----SFFLTNTSGFTALKKVKNA---FGDYLMERDVKSPTGYSIDGFAVKEVADRWL 315 (397) T ss_pred ccccHHHHHHHHHHhhhhhcCC-----CEEEECHHHHHHHHHhhcC---CCceeeccCcCCCCCceeccceeEEeccccc Confidence 4456667999999999888765 3457899999988643211 12222 22234445778888887777676 Q ss_pred c----CCe-EEEee-c-ce-eeeccceEEEEEeecc---CccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 234 G----ANA-ALVYI-K-GV-GKAFTGITTARTIESE---DFDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 234 q----g~~-aifs~-d-nI-g~af~GI~taRtieSE---DFdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) . +.. ++|-. . .+ ....-|+.+...-+.+ +.+-+++.+....+--+.+...-++++.+.++..| T Consensus 316 ~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:48 316 ANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQK 389 (397) T ss_pred CCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEecccccCC Confidence 5 232 44431 2 12 1223344444332222 34568888888888878776544444444443333 No 55 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=31.66 E-value=1.5 Score=19.65 Aligned_cols=262 Identities=9% Similarity=-0.018 Sum_probs=122.8 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcccceeEEEeecc-cc--eEecccccCcccceecc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQNNETAFYVKTSD-LP--VVVGTGYNTDANVGFGT 77 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqnN~tafsvKtnd-~p--Vvvg~~Y~td~NvaFGt 77 (296) |++..-.-.--.--+++..-+-..-+..+.+++...- ++..- .+.+.-..|.++ .+ -.++|+ .... T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~-~~~~~---~~g~~~~~~~~~~~~~a~~v~Eg------~~~~- 73 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNV-ENVTT---LTGSRVYEKWTDITGLANIDDEA------GKIA- 73 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhcee-eeccC---CcceEEEEeecCCCcceeeecCC------cccc- Confidence 6554444333333566665444444555555554322 32211 111222222221 11 234331 1111 Q ss_pred ccCCc-ccccceeEEE-EeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhh Q lcl|NC_017976. 78 GTSKS-SRFGDRQEII-YIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEA 155 (296) Q Consensus 78 GTg~s-~RFG~rkEIi-y~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~ 155 (296) ..| --|++.+--. -.-.-+|.+..+.- -. .-|+++-|.+||. ++-.+..|+.+=.-+...+... T Consensus 74 --~~~~~~~~~i~l~~~k~~~~~~iS~ell~------ds-~~~l~~~i~~~la---~~~~~~~~~~i~~g~~~~~~~~-- 139 (293) T protein:vir:48 74 --DIDDPKLSLIKYTIKRYAGISTVTNSLLA------DS-AENILAWLSGWIA---KKVVVTRNKAILGVVDKLPTKP-- 139 (293) T ss_pred --cccccceeEEEEeeeEEEEeehhhHHHHh------hh-hHHHHHHHHHHHH---HHHHHHHHhHHhhccccccccc-- Confidence 011 1233221100 00011222222211 11 1136666777764 3333344544333333334333 Q ss_pred hhhcchhHHHHHHHHhhhhhcceeeeeeEEEEECchhhhhhhcccccccccccee---eeccCcceeecCeEEEEchHHH Q lcl|NC_017976. 156 LTAYDEAAVLKLFNNLSAYYINIEAIGTKVAKVGPELYNIIVDHRLTTKEKASGA---NVDSNEIVKFKGFLIEEVPQAK 232 (296) Q Consensus 156 l~~~~~~~V~klFn~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~---NiD~ngi~~fKgf~l~e~p~~y 232 (296) +..+-+++.+|++++...|-.+ -+-+++|..|..|--+.- + -|.-+ ++-+.+-.++.|+-+..+++.+ T Consensus 140 -~~~~~d~i~~~~~~l~~~~~~~-----a~~vmn~~~~~~L~~lkd--~-~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~ 210 (293) T protein:vir:48 140 -TLTKWDDIIDLEAKVDPAIKQT-----SFFLTNTSGFTALKKVKN--A-LGDYLMERDVKSPTGYSIAGFAVKEISDRW 210 (293) T ss_pred -cccCHHHHHHHHHhhhhhhcCC-----CEEEEcHHHHHHHHHhhc--c-CCceEeecCcCCCCCceecceeeEEecccc Confidence 4556677999999998887654 244679999988754331 1 12222 2223444578888887777766 Q ss_pred hc----CCe-EEEee-c-ce-eeeccceEEEEEeeccC---ccceeeeecccccccCCCCCcceEEEEecccCCC Q lcl|NC_017976. 233 LG----ANA-ALVYI-K-GV-GKAFTGITTARTIESED---FDGVAFQGAGKAGEFILDDNKPAVVKVTAPTVTP 296 (296) Q Consensus 233 ~q----g~~-aifs~-d-nI-g~af~GI~taRtieSED---FdGVaLQgAgK~G~~IlddNKkAI~k~t~~~~~p 296 (296) +. |.. ++|.. . .+ ..-.-|+++.+.-+.++ -+-+++.+...+|--+.+.+.-+.+|.+.++.+| T Consensus 211 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~ 285 (293) T protein:vir:48 211 LPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQK 285 (293) T ss_pred cCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCC Confidence 65 222 44432 1 22 11123555554434332 3468888888888877776555555655555555 No 56 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=31.55 E-value=1.5 Score=19.63 Aligned_cols=260 Identities=12% Similarity=-0.002 Sum_probs=114.8 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhcccceeeecCCcc-cceeEEEeecccceEecccccCcccceecccc Q lcl|NC_017976. 1 MGTKNQQLAAKTYQKQFKEMLQAVFSHQAYFADFFGGGIEALDGVQN-NETAFYVKTSDLPVVVGTGYNTDANVGFGTGT 79 (296) Q Consensus 1 m~t~Nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg~lQ~lDGVqn-N~tafsvKtnd~pVvvg~~Y~td~NvaFGtGT 79 (296) |++-+|+.- + -+..+..+-.+|+++.+|.+..-- +-.+-.+. ++ -+.+.. .-++.+.+ |.+ ..+ . T Consensus 1 m~~~~N~~l-t--p~iia~~~l~~l~~~lV~~~lv~r--~y~~e~~~~GD-TV~I~v-p~~~~v~d-g~~--~~~----~ 66 (418) T protein:vir:10 1 MAVQDNNLL-T--DDVIAKEALRLLKNNLVMAKCVYR--NYEKTFGKVGD-TIRLKL-PYRVKSAS-GRT--LVK----Q 66 (418) T ss_pred CCccccccc-c--HHHHHHHHHHHHHHhccchhhhcC--CCchHHhhCCC-EEEEee-CCceeecc-cCC--ccc----c Confidence 999776651 1 123334444556776666543211 00111111 22 234443 22333433 211 110 0 Q ss_pred CCcccccceeEEEEeccccccccCchhhhccccccccCCHhHHHHHHHhhHHHHHHHHHHHHHhHHHhhhhcchhhhhhc Q lcl|NC_017976. 80 SKSSRFGDRQEIIYIDTPVPYTWGYNWHEGIDRYTVNNSMESAMADRVELQAQAKTMLFDKKHAEFIVANAGKTEALTAY 159 (296) Q Consensus 80 g~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~A~~t~~l~~~ 159 (296) .- -+.+--+-.|++.-+.+.|. .+|+--.+. .-..++|.-+..|=.+.+|..+..-+...+....+- - T Consensus 67 ~~----te~~v~l~id~~k~~~~~it---D~e~a~~~~---d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~--g 134 (418) T protein:vir:10 67 PM----VDQTIPFKIAYQEHVGLEYT---VKDKTLDIM---QFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTP--G 134 (418) T ss_pred cc----ccceEEEEEecccccceeec---hHHHhhhhh---HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccC--C Confidence 00 01111122233322222222 445433333 345677788889999999988887665554433321 1 Q ss_pred chhHHHHHHHHhhhhhcceeee--eeEEEEECchhhhhhhccccccccccceeeeccCcce-eecCeEEEEchHH----- Q lcl|NC_017976. 160 DEAAVLKLFNNLSAYYINIEAI--GTKVAKVGPELYNIIVDHRLTTKEKASGANVDSNEIV-KFKGFLIEEVPQA----- 231 (296) Q Consensus 160 ~~~~V~klFn~~~~~yvn~ev~--~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi~-~fKgf~l~e~p~~----- 231 (296) +..+--.-|-.+.++.-+..|- +.+.+-|+|+.|..|.+.....-.+..+-..=.||.+ +.-||.+-+...- T Consensus 135 t~~~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~ta 214 (418) T protein:vir:10 135 VRPGAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTV 214 (418) T ss_pred cCcchHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEecCCCcccc Confidence 2222234455666666666774 3478889999999999866543222211111235544 7889988764321 Q ss_pred HhcCC-eEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCCC---------------cceEEEEecccCC Q lcl|NC_017976. 232 KLGAN-AALVYIKGVGKAFTGITTARTIESEDFDGVAFQGAGKAGEFILDDN---------------KPAVVKVTAPTVT 295 (296) Q Consensus 232 y~qg~-~aifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddN---------------KkAI~k~t~~~~~ 295 (296) ...+. ..+. +-+.....+ +-+.+-+..-|--+.|.++-=.. +.-.+++...+.+ T Consensus 215 g~~~~t~~v~---ga~~~~~~~-------~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~ 284 (418) T protein:vir:10 215 GDHGGTPLVN---GTVVNGDTV-------GFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDA 284 (418) T ss_pred cccccceeee---cccccceeE-------EEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccc Confidence 11111 1111 111222222 11233333345556666422111 2223332211111 Q ss_pred ---------C Q lcl|NC_017976. 296 ---------P 296 (296) Q Consensus 296 ---------p 296 (296) | T Consensus 285 ~~~~tv~i~p 294 (418) T protein:vir:10 285 GGAGSIKISP 294 (418) T ss_pred cCcceeEecc Confidence 1 Done!