Query lcl|NC_019916.1_cdsid_YP_007236692.1 [gene=G168_gp06] [protein=major capsid protein] [protein_id=YP_007236692.1] [location=4780..5640] Match_columns 286 No_of_seqs 16 out of 21 Neff 2.6 Searched_HMMs 1612 Date Thu Nov 7 15:55:35 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_6 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_6_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:94528 Length: 286 100.0 1E-187 7E-191 1045.5 25.8 286 1-286 1-286 (286) 2 protein:vir:98871 Length: 314 100.0 3E-182 2E-185 1016.0 25.7 286 1-286 19-314 (314) 3 protein:vir:3969 Length: 287 # 100.0 1E-179 8E-183 1001.5 24.6 279 8-286 1-287 (287) 4 protein:vir:4786 Length: 295 # 100.0 3E-166 2E-169 928.0 18.7 270 1-286 1-277 (295) 5 protein:vir:79712 Length: 285 98.9 1.3E-10 7.8E-14 74.9 16.1 263 8-286 1-284 (285) 6 protein:vir:97331 Length: 319 98.9 2.4E-10 1.5E-13 73.3 15.3 267 1-286 1-295 (319) 7 protein:vir:94800 Length: 319 98.9 2.4E-10 1.5E-13 73.3 15.3 267 1-286 1-295 (319) 8 protein:vir:107120 Length: 329 98.7 2.5E-09 1.6E-12 67.7 14.8 265 1-286 12-306 (329) 9 protein:vir:78090 Length: 302 98.6 4.5E-09 2.8E-12 66.4 15.1 259 1-286 1-300 (302) 10 protein:vir:99523 Length: 311 98.5 2.9E-08 1.8E-11 61.9 15.8 263 1-286 1-311 (311) 11 protein:vir:105464 Length: 346 98.4 3.5E-08 2.2E-11 61.5 15.0 262 8-286 1-298 (346) 12 protein:vir:79008 Length: 299 98.4 3.9E-08 2.4E-11 61.2 15.1 261 1-286 1-297 (299) 13 protein:vir:102605 Length: 273 98.4 8.8E-08 5.5E-11 59.3 16.3 259 8-285 1-273 (273) 14 protein:vir:105822 Length: 273 98.4 8.8E-08 5.5E-11 59.3 16.3 259 8-285 1-273 (273) 15 protein:vir:78920 Length: 290 98.2 3.9E-07 2.4E-10 55.7 16.0 256 8-285 1-290 (290) 16 protein:vir:7990 Length: 273 # 98.2 5.2E-07 3.2E-10 55.1 15.7 258 8-285 1-273 (273) 17 protein:vir:102335 Length: 312 98.1 8.8E-07 5.4E-10 53.8 16.0 263 1-286 1-308 (312) 18 protein:vir:3033 Length: 272 # 97.9 9.2E-06 5.7E-09 48.2 17.9 256 1-286 1-268 (272) 19 protein:vir:9820 Length: 272 # 97.9 9.2E-06 5.7E-09 48.2 17.9 256 1-286 1-268 (272) 20 protein:vir:78739 Length: 332 97.6 7.9E-06 4.9E-09 48.6 13.9 261 1-283 7-332 (332) 21 protein:vir:93742 Length: 274 97.4 7.7E-05 4.8E-08 43.1 16.7 258 1-286 1-269 (274) 22 protein:vir:10450 Length: 344 97.4 2.1E-05 1.3E-08 46.2 13.2 261 1-285 1-344 (344) 23 protein:vir:80213 Length: 334 97.2 0.00014 8.5E-08 41.8 16.3 238 1-286 1-292 (334) 24 protein:vir:80930 Length: 278 97.2 0.00011 6.6E-08 42.4 15.3 262 1-286 1-276 (278) 25 protein:vir:8885 Length: 347 # 97.1 2E-05 1.2E-08 46.4 10.8 260 1-286 1-347 (347) 26 protein:vir:2201 Length: 345 # 96.8 0.00019 1.2E-07 41.0 13.9 264 1-285 1-345 (345) 27 protein:vir:96833 Length: 275 96.8 0.00035 2.1E-07 39.6 15.6 260 1-286 1-272 (275) 28 protein:vir:6324 Length: 335 # 96.8 0.00029 1.8E-07 40.0 14.6 241 1-286 1-288 (335) 29 protein:vir:97433 Length: 274 96.4 0.00061 3.8E-07 38.2 16.6 260 1-286 1-271 (274) 30 protein:vir:94494 Length: 274 96.4 0.00061 3.8E-07 38.2 16.6 260 1-286 1-271 (274) 31 protein:vir:96123 Length: 274 96.2 0.00059 3.6E-07 38.3 13.1 254 1-286 1-269 (274) 32 protein:vir:100057 Length: 375 95.7 0.00095 5.9E-07 37.2 11.9 253 1-286 1-327 (375) 33 protein:vir:95898 Length: 274 95.4 0.0022 1.3E-06 35.2 14.3 257 1-286 1-269 (274) 34 protein:vir:96262 Length: 274 95.4 0.0022 1.3E-06 35.2 14.3 257 1-286 1-269 (274) 35 protein:vir:78935 Length: 335 94.8 0.0034 2.1E-06 34.1 13.8 241 1-286 1-288 (335) 36 protein:vir:80180 Length: 381 94.3 0.0049 3.1E-06 33.2 14.8 262 1-286 11-336 (381) 37 protein:vir:105334 Length: 276 94.0 0.0058 3.6E-06 32.8 13.4 260 1-286 1-271 (276) 38 protein:vir:1541 Length: 347 # 93.6 0.0069 4.3E-06 32.4 15.6 265 1-286 1-346 (347) 39 protein:vir:102655 Length: 322 93.3 0.0082 5.1E-06 32.0 13.8 256 1-286 13-321 (322) 40 protein:vir:94711 Length: 347 93.1 0.0089 5.5E-06 31.8 14.4 261 1-286 1-346 (347) 41 protein:vir:1239 Length: 274 # 92.7 0.01 6.4E-06 31.5 15.9 258 1-286 1-269 (274) 42 protein:vir:94576 Length: 347 92.1 0.013 8E-06 30.9 13.1 256 1-286 1-307 (347) 43 protein:vir:94622 Length: 341 91.3 0.017 1E-05 30.4 11.4 262 1-286 1-339 (341) 44 protein:vir:3364 Length: 347 # 90.5 0.02 1.3E-05 29.8 14.5 264 1-286 1-346 (347) 45 protein:vir:3613 Length: 272 # 86.2 0.047 2.9E-05 27.8 15.3 253 1-286 1-271 (272) 46 protein:vir:99075 Length: 392 79.0 0.11 6.8E-05 25.9 14.6 262 8-286 1-301 (392) 47 protein:vir:105645 Length: 400 77.8 0.12 7.5E-05 25.6 13.5 238 1-286 1-293 (400) 48 protein:vir:99675 Length: 324 77.5 0.12 7.7E-05 25.6 11.8 214 20-286 1-248 (324) 49 protein:vir:97031 Length: 402 75.9 0.14 8.7E-05 25.2 14.7 241 1-286 1-293 (402) 50 protein:vir:95107 Length: 270 74.2 0.16 0.0001 24.9 11.9 254 1-286 1-266 (270) 51 protein:vir:7019 Length: 401 # 73.1 0.17 0.00011 24.7 13.1 235 1-286 1-285 (401) 52 protein:vir:103323 Length: 364 58.3 0.42 0.00026 22.7 15.3 245 1-286 1-299 (364) 53 protein:vir:108303 Length: 418 38.1 1.1 0.00067 20.4 12.2 250 1-286 1-284 (418) 54 protein:vir:99424 Length: 360 35.9 1.2 0.00075 20.1 7.3 263 1-286 1-360 (360) 55 protein:vir:1781 Length: 221 # 30.6 1.6 0.00097 19.5 10.0 164 92-286 1-201 (221) 56 protein:vir:4830 Length: 397 # 25.8 2 0.0012 18.9 14.6 256 1-286 111-384 (397) 57 protein:vir:10324 Length: 320 23.6 2.3 0.0014 18.6 12.7 249 10-286 1-316 (320) No 1 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=100.00 E-value=1.1e-187 Score=1045.50 Aligned_cols=286 Identities=69% Similarity=1.046 Sum_probs=285.6 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCCc Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAKS 80 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~s 80 (286) |+|.|||||+|.|+|||+||||+||++|++|+|+|||||+|||||||+|||||||||+|||||+|+||||++||+|||+| T Consensus 1 m~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNv~FGtgTg~S 80 (286) T protein:vir:94 1 MATTNNDLPVRVYSKEFLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAVVVGEYSTDANTAFGTGTSNS 80 (286) T ss_pred CCCCccccceeehhHHHHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcceEEecccCCCccccccCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhHHHH Q lcl|NC_019916. 81 TRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDLGAVDDVNVM 160 (286) Q Consensus 81 ~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~~t~d~V~kl 160 (286) |||||||||||.||||||+|+|+|||||||+|||||+|+||||||+||||||+|+||+++||+||++|++|+++|+|+|| T Consensus 81 sRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~~A~~t~~~D~V~~L 160 (286) T protein:vir:94 81 SRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEALATAGTDLGAVDDVNAL 160 (286) T ss_pred cccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccceeeeccCceeeecCeEEEEchHHhhcCceEEEeecceee Q lcl|NC_019916. 161 FETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAVNIDENGIVRFRDIIITKVPEKYMQGKAIMFVPDNIGR 240 (286) Q Consensus 161 F~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi~~fKgf~l~e~p~~y~qg~~~ifs~dnIg~ 240 (286) ||++|++|||+||++|++|||+||||||||||+|+||+|+||+|||||||||||||+|+|+|++||+|+++||||||||| T Consensus 161 F~~as~~yvn~ev~~~~~ayV~~evYnaiiD~~l~TsaK~SsaNiDengi~~FkGf~i~e~P~~~~~g~~aifs~dnig~ 240 (286) T protein:vir:94 161 FESAVEKYTDLEVIAPVRAYVTASVYNAIIDLANVTTAKNSAVNIDTNGMLSFRGIAITKVPTQYMGGKAVIFAPDNVAR 240 (286) T ss_pred HHHHHHHhhhhheeeeeEEEEchhHHHHHhccccccccccceeeeccCCcceecceEEeecchhhccCceEEEcccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 241 AFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 241 af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) +||||+|||+|||||||||+||||||||+||||||||||+|+|||| T Consensus 241 aftGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~~~~~k~ 286 (286) T protein:vir:94 241 VFTGINIARTIQAIDFAGVELQGAGKYGTFILDDNKKAIFTATPKA 286 (286) T ss_pred eeccceeeeeeeccccCceeeeccccccccccccCceeEEEeecCC Confidence 9999999999999999999999999999999999999999999999 No 2 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=100.00 E-value=2.7e-182 Score=1015.98 Aligned_cols=286 Identities=55% Similarity=0.858 Sum_probs=280.4 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcc-cchhcCCcccceeEEEeecccceEee-cccCCCcceeecCcC Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGD-LQALDGVQNNATAFSVKTNNVPVVVG-EYSQDEAVAFGAGTA 78 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg-lQ~lDGVqnN~taf~vKtnd~pVvvg-~Y~td~NvaFGtGTg 78 (286) -+|.|||+|+|.|+|||+||||+||++|++|+|+||| ||+|||||||+|||||||||+||||| +|+||||||||+||| T Consensus 19 ~~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVVig~~Y~TdeNvaFGtGTg 98 (314) T protein:vir:98 19 SGTANQNKAARSYQKEFRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVVVGNEYNKDENVGFGEGTS 98 (314) T ss_pred eccccCccceeeecHHHHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeecccceeecCcccCCCCcccccCCc Confidence 7899999999999999999999999999999999999 99999999999999999999999998 899999999999999 Q ss_pred CcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhhhhh---- Q lcl|NC_019916. 79 KSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDLGAV---- 154 (286) Q Consensus 79 ~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~~t~---- 154 (286) +||||||||||||.||||||+|+|+|||||||+|||||+|+||||||+||||||+|+||+++||+||++|++++.+ T Consensus 99 ~SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~lS~~As~te~ltd~~ 178 (314) T protein:vir:98 99 RSTRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNAQHSKFISSIAEKTETLTDYS 178 (314) T ss_pred cccccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999887644 Q ss_pred -hhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccceeeeccCceeeecCeEEEEchHHhhc-CceEE Q lcl|NC_019916. 155 -DDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAVNIDENGIVRFRDIIITKVPEKYMQ-GKAIM 232 (286) Q Consensus 155 -d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi~~fKgf~l~e~p~~y~q-g~~~i 232 (286) |+|+||||+||++|||+|+++||+|||+||||||||||+|+||+|+||+|||||||||||||+|+|+|++||| |++++ T Consensus 179 ~d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~~l~TsaK~SsaNIDengi~~FkGf~i~e~P~~~~q~g~ia~ 258 (314) T protein:vir:98 179 ADNVLRLFNELSKYYVNIEAIGTKAAKVSPELYNAIVDHPLTTSAKSSSANIDQNGIVNFKGFAIQEIPESMLQSGDVAY 258 (314) T ss_pred hhhHHHHHHHHHhhhhcceeeEEEEEEEchhHHhHhhccccccccccceeeeccCCcceecceEEEecchhhcCCCcEEE Confidence 7899999999999999999999999999999999999999999999999999999999999999999999999 99999 Q ss_pred EeecceeeeccceEEEEEeeccCccceeeeecccccccCCCcCcceEEE--EecCC Q lcl|NC_019916. 233 FVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFS--ATPKA 286 (286) Q Consensus 233 fs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k--~t~ka 286 (286) |+||||||+||||+|||+||||||||||||||||||+||||||||||+| +||++ T Consensus 259 ~s~dnig~aftGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~k~t~tp~~ 314 (314) T protein:vir:98 259 TYITNIGKAFTGINTSRIIESEDFDGVALQGAGKAGEFILDDNKKAVAKVTSTPEG 314 (314) T ss_pred EccccceeecccceeeeeeecccccceeeecccccccccccccceeeEEEecCCCC Confidence 9999999999999999999999999999999999999999999999966 67888 No 3 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=100.00 E-value=1.2e-179 Score=1001.46 Aligned_cols=279 Identities=39% Similarity=0.659 Sum_probs=275.4 Q ss_pred ceeeeechhHHHHHHHHHhhhhhhhhhhcc-cchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCCcccccce Q lcl|NC_019916. 8 LAARTYTKQFAQLMQTVFGAQSVFGPTFGD-LQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAKSTRFGER 86 (286) Q Consensus 8 ~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg-lQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~s~RFG~r 86 (286) ||+|.|+|||+||||+||++||+|+|+||| ||+|||||||+|||||||||+|||||+|+||||||||+|||+||||||| T Consensus 1 ~avr~y~Kq~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGtGTg~ssRFG~r 80 (287) T protein:vir:39 1 MAIKYFTKQYAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGSGTGNTSRFGQR 80 (287) T ss_pred CCcccccHHHHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecccCCCCcccccCCCccccccce Confidence 999999999999999999999999999999 9999999999999999999999999999999999999999999999999 Q ss_pred eEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhhhhh----hhHHHHHH Q lcl|NC_019916. 87 TEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDLGAV----DDVNVMFE 162 (286) Q Consensus 87 kEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~~t~----d~V~klF~ 162 (286) |||||.||||||+|||+|||||||+|||||+|+||||||+||||||+|+||+++||+||++|++++++ |+|+|||| T Consensus 81 kEi~y~dt~V~Y~~~~~ihEGiD~~TVNnd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t~~~~~t~d~V~~LF~ 160 (287) T protein:vir:39 81 KEVKSVNKQVSYDAPLAINEGIDDFTVNDIKDQVVAERLALHGVAWAQHVDKLLGKLLSDSASETLTVKLDEDSVTKLFS 160 (287) T ss_pred eEEEEecccccceeccccccccccccccCChhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchheeeeecccchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999887 89999999 Q ss_pred HHhhhhhc--eeeeEEEEEEECchhhhhhhccccccccccceeeeccCceeeecCeEEEEchHHhhc-CceEEEeeccee Q lcl|NC_019916. 163 TASAKYTN--LEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAVNIDENGIVRFRDIIITKVPEKYMQ-GKAIMFVPDNIG 239 (286) Q Consensus 163 ~~~~~yvn--~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi~~fKgf~l~e~p~~y~q-g~~~ifs~dnIg 239 (286) ++|++||| +|+++||+|||+||+||+||||+|+||+||||+|||||||||||||+|+|+|+++|| |+++|||||||| T Consensus 161 ~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~l~TsaK~SsaNiDen~i~kFkGf~l~e~P~~~~q~g~~a~fs~dnig 240 (287) T protein:vir:39 161 DAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSKLATTAKNSSANVDEQTLYKFKGFILSELPDEKFQLNEGAYFAADNVG 240 (287) T ss_pred HHHHHhhccceeeEEEEEEEEChhHHhHHhccccccccccceeeeccCCcceecceEEEecchHhhccCcEEEEccccce Confidence 99999998 555699999999999999999999999999999999999999999999999999999 999999999999 Q ss_pred eeccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 240 RAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 240 ~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) ||||||+|+|+||||||||||||||||||+||||||||||+|+|++- T Consensus 241 ~af~GI~vaR~i~sEdF~GvalQgAgK~G~~i~e~Nk~Ai~k~t~~k 287 (287) T protein:vir:39 241 VAGVGIQVTRAMDSEDFAGTALQAAAKYGKYLPEKNKKAILKATVTK 287 (287) T ss_pred eecccceeEEeeecccccceeeecccccccccccccceEEEEEecCC Confidence 99999999999999999999999999999999999999999999988 No 4 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=100.00 E-value=3e-166 Score=928.04 Aligned_cols=270 Identities=46% Similarity=0.767 Sum_probs=260.7 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcce-eecCcCC Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVA-FGAGTAK 79 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~Nva-FGtGTg~ 79 (286) |++ |||+|+|.|+|||+||||+||++|++|+|+|||||++||||||+|||||||||+|||||+|+||||+| ||+|||+ T Consensus 1 mp~-N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNvagFGtGTg~ 79 (295) T protein:vir:47 1 MPS-NQNNAVRRYEKQYAGILETVFGVRAAFSNALAPIQILDGVQENSKAFSVKTNNTPVVIGEYKTGENDGGFGDNSGA 79 (295) T ss_pred CCC-CCCccchhhhHHHHHHHHHHHhHHHHHhhhhcchhhhhCCCccceEEEEeecCcceEeecccCCCcccccccCCcc Confidence 999 88889999999999999999999999999999999999999999999999999999999999999995 9999999 Q ss_pred cccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhhhhh----- Q lcl|NC_019916. 80 STRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDLGAV----- 154 (286) Q Consensus 80 s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~~t~----- 154 (286) ||||||||||||.||||||+|+|+|||||||+|||||+|+||||||+||||||+|+||+++||+||++|++|+++ T Consensus 80 SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~ls~~A~~te~~td~t~ 159 (295) T protein:vir:47 80 QSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKYLSDTATKTEALADFTD 159 (295) T ss_pred ccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccc Confidence 999999999999999999999999999999999999999999999999999999999999999999999998877 Q ss_pred hhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccceeeeccCceeeecCeEEEEchHHhhc-CceEEE Q lcl|NC_019916. 155 DDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAVNIDENGIVRFRDIIITKVPEKYMQ-GKAIMF 233 (286) Q Consensus 155 d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi~~fKgf~l~e~p~~y~q-g~~~if 233 (286) |+|+||||++|++|||+||++|++|||+||||||||||+|+||+||||+|||||||+|||||+|+|+|++||| |+++|| T Consensus 160 d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~~l~TsaK~SsaNiDengi~~FkGf~i~e~P~~~~q~G~~aif 239 (295) T protein:vir:47 160 DKVKALFNKLSAFYTNNEVTAPITVYLRSEFYNAIVDMASVTSAKGATISLDENGLPKYKGFTLEETPAQYFETGVIAIF 239 (295) T ss_pred hhHHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhccccccccccceeeeccCCcceecceEEEeccHhhccCCcEEEE Confidence 8999999999999999999999999999999999999999999999999999999999999999999999999 999999 Q ss_pred eecceeeeccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 234 VPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 234 s~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) ||||||||||||+|||+||||||||||||=-- .|.+.- T Consensus 240 s~dnig~aftGIn~aR~IesEdF~GValQ~~~---------------~~~~~~ 277 (295) T protein:vir:47 240 SPNGIIIPFVGISTARVIEAENFDGVNCKLLL---------------RVVLTL 277 (295) T ss_pred ccccceeecccceeeeeeecccccchHHHHHH---------------HHHHHH Confidence 99999999999999999999999999999431 121111 No 5 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=98.95 E-value=1.3e-10 Score=74.85 Aligned_cols=263 Identities=15% Similarity=0.167 Sum_probs=171.5 Q ss_pred ceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccc-e-EeecccCCCcceeecCcCCcccccc Q lcl|NC_019916. 8 LAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVP-V-VVGEYSQDEAVAFGAGTAKSTRFGE 85 (286) Q Consensus 8 ~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~p-V-vvg~Y~td~NvaFGtGTg~s~RFG~ 85 (286) ||+- |.++|-.+|...|...+++....-+=-....--+++ =+||+..+. + =++.|+. |.+|-.|+-+ T Consensus 1 Main-~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~ga--k~VkIp~ist~~gl~dY~R--~~g~~~g~v~------ 69 (285) T protein:vir:79 1 MTVV-LDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGH--NEVRINKLSGFVDATAYKR--GQDNARKTIS------ 69 (285) T ss_pred Ccch-hhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCC--CEEEEeeeccccccccccc--ccCccccccc------ Confidence 6655 677888889999998888766543310000001112 256777665 3 3778866 3455554442 Q ss_pred eeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHH-HHHHHH-HHHHHhhhh---hhhhhhhhHHHH Q lcl|NC_019916. 86 RTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKV-RMFNNA-LGKKLADAS---TDLGAVDDVNVM 160 (286) Q Consensus 86 rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~-~~~n~~-~gk~ls~~a---~~~~t~d~V~kl 160 (286) .-|+.....+|-.|.+. ||.+-|+......+|-=++.+..-++ -.++.. +.|..+.+. +.+.|-+.+.+. T Consensus 70 ---~~~et~tl~~DR~~~f~--iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~~~~~~~~T~~nv~~~ 144 (285) T protein:vir:79 70 ---VGKETVKLTHEDWFGYD--LDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAAKKATDSITKDNALDA 144 (285) T ss_pred ---eeeeEEEeeccccceec--ccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcccccccccCHHHHHHH Confidence 34555566677777765 89999976443334444444333332 334433 332222221 223466889999 Q ss_pred HHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccceeeeccCc----eeeecC-eEEEEchHHhhcC----ce- Q lcl|NC_019916. 161 FETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAVNIDENG----IVRFRD-IIITKVPEKYMQG----KA- 230 (286) Q Consensus 161 F~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ng----i~~fKg-f~l~e~p~~y~qg----~~- 230 (286) +-.+.++--+.+|-.+.++||+|++|..|-..+.-+...+.+.+.-..| +-.+.| +.|.+||..+|++ +. T Consensus 145 i~~~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~I 224 (285) T protein:vir:79 145 YDTAEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHV 224 (285) T ss_pred HHHHHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhc Confidence 9999999999999888999999999999998877766554433333334 468888 8999999999984 22 Q ss_pred -EEEeecceeeeccceEEEEEeecc---CccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 231 -IMFVPDNIGRAFTGIVTTRTIESE---DFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 231 -~ifs~dnIg~af~GI~taRtieSE---DFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) .|..|...-++.+=.+..|.++.+ +=||=..|+--=++-||+|.=|++|.--+.+| T Consensus 225 nfiiv~~~a~i~~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~ 284 (285) T protein:vir:79 225 NFILTPLSAIAPIVKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATAG 284 (285) T ss_pred cEEEecCceeccceeeeeeEeECCCCCCCcceeeeeeeeeeeeeehhhccceeeeeeccc Confidence 255566677888888899999877 67788888888889999966566665444444 No 6 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=98.88 E-value=2.4e-10 Score=73.34 Aligned_cols=267 Identities=14% Similarity=0.129 Sum_probs=165.9 Q ss_pred CC------------------CCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceE- Q lcl|NC_019916. 1 MA------------------TNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVV- 61 (286) Q Consensus 1 M~------------------t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVv- 61 (286) |- -.|-.-=--.++++|..+|..++...++-.+....-+ ..|. + +=+||++.+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~-~e~~--g--g~tVkIp~i~~~g 75 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISND-AIFM--E--GRSFTVMKGDTTE 75 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcc-eEec--c--CcEEEEeeecccc Confidence 11 1000000124788999999999988887665544411 2332 2 225777777652 Q ss_pred eecccCCCcceeecCcCCcccccceeEEEEecccccccccchhh-hccccccccCChhH--HHHHHHhhHHHHHHHHHHH Q lcl|NC_019916. 62 VGEYSQDEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIH-EGLDRFTVNNDLNA--AVADRLDLQAQAKVRMFNN 138 (286) Q Consensus 62 vg~Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiH-EGiDr~TVNndl~a--avAdRl~Lqa~Ak~~~~n~ 138 (286) ++.|+.+ .+|..|+-+. -|+.-..-++-.|+|- ..+|.-..|.++.+ ++++....| .+..+|. T Consensus 76 l~DY~R~--~g~~~g~vt~---------~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~---v~PEiDa 141 (319) T protein:vir:97 76 LKDYKRN--ATNEFDHPKI---------EETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEV---VAPYLDN 141 (319) T ss_pred cccccCC--CCcccCCccc---------ceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHH---hhhhhhH Confidence 3467654 3444443221 2223333445555543 44555555655543 234333333 3445676 Q ss_pred HHHHHHhhhhhhh----hhhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccceeeeccCce-eee Q lcl|NC_019916. 139 ALGKKLADASTDL----GAVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAVNIDENGI-VRF 213 (286) Q Consensus 139 ~~gk~ls~~a~~~----~t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi-~~f 213 (286) .....|+..+... .|.+.+...+-.+.++.-+.+|-...++||+|++|.+|-..+.-+...+-.-.+--||. -++ T Consensus 142 y~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~i 221 (319) T protein:vir:97 142 LRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGEL 221 (319) T ss_pred HHHHHHHhhcccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceee Confidence 6555565544332 24456667777777777777776668899999999999887766654432222223443 468 Q ss_pred cCeEEEEchHHhhcCceEEEeecceeeeccceEEEEEee-ccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 214 RDIIITKVPEKYMQGKAIMFVPDNIGRAFTGIVTTRTIE-SEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 214 Kgf~l~e~p~~y~qg~~~ifs~dnIg~af~GI~taRtie-SEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) .||.|-++|...|.+-..|..+.+--.+.+=+...|... +++=.|=+.||--=||-||++.-+++|+.....+ T Consensus 222 dG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~ 295 (319) T protein:vir:97 222 DGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) T ss_pred cCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCC Confidence 999999999999998777777666666667777788765 6776689999999999999999999999754444 No 7 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=98.88 E-value=2.4e-10 Score=73.34 Aligned_cols=267 Identities=14% Similarity=0.129 Sum_probs=165.9 Q ss_pred CC------------------CCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceE- Q lcl|NC_019916. 1 MA------------------TNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVV- 61 (286) Q Consensus 1 M~------------------t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVv- 61 (286) |- -.|-.-=--.++++|..+|..++...++-.+....-+ ..|. + +=+||++.+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~-~e~~--g--g~tVkIp~i~~~g 75 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISND-AIFM--E--GRSFTVMKGDTTE 75 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcc-eEec--c--CcEEEEeeecccc Confidence 11 1000000124788999999999988887665544411 2332 2 225777777652 Q ss_pred eecccCCCcceeecCcCCcccccceeEEEEecccccccccchhh-hccccccccCChhH--HHHHHHhhHHHHHHHHHHH Q lcl|NC_019916. 62 VGEYSQDEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIH-EGLDRFTVNNDLNA--AVADRLDLQAQAKVRMFNN 138 (286) Q Consensus 62 vg~Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiH-EGiDr~TVNndl~a--avAdRl~Lqa~Ak~~~~n~ 138 (286) ++.|+.+ .+|..|+-+. -|+.-..-++-.|+|- ..+|.-..|.++.+ ++++....| .+..+|. T Consensus 76 l~DY~R~--~g~~~g~vt~---------~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~---v~PEiDa 141 (319) T protein:vir:94 76 LKDYKRN--ATNEFDHPKI---------EETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEV---VAPYLDN 141 (319) T ss_pred cccccCC--CCcccCCccc---------ceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHH---hhhhhhH Confidence 3467654 3444443221 2223333445555543 44555555655543 234333333 3445676 Q ss_pred HHHHHHhhhhhhh----hhhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccceeeeccCce-eee Q lcl|NC_019916. 139 ALGKKLADASTDL----GAVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAVNIDENGI-VRF 213 (286) Q Consensus 139 ~~gk~ls~~a~~~----~t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi-~~f 213 (286) .....|+..+... .|.+.+...+-.+.++.-+.+|-...++||+|++|.+|-..+.-+...+-.-.+--||. -++ T Consensus 142 y~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~i 221 (319) T protein:vir:94 142 LRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGEL 221 (319) T ss_pred HHHHHHHhhcccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceee Confidence 6555565544332 24456667777777777777776668899999999999887766654432222223443 468 Q ss_pred cCeEEEEchHHhhcCceEEEeecceeeeccceEEEEEee-ccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 214 RDIIITKVPEKYMQGKAIMFVPDNIGRAFTGIVTTRTIE-SEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 214 Kgf~l~e~p~~y~qg~~~ifs~dnIg~af~GI~taRtie-SEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) .||.|-++|...|.+-..|..+.+--.+.+=+...|... +++=.|=+.||--=||-||++.-+++|+.....+ T Consensus 222 dG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~ 295 (319) T protein:vir:94 222 DGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) T ss_pred cCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCC Confidence 999999999999998777777666666667777788765 6776689999999999999999999999754444 No 8 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=98.67 E-value=2.5e-09 Score=67.73 Aligned_cols=265 Identities=16% Similarity=0.168 Sum_probs=159.9 Q ss_pred CCCC-----------cccce-------eeeechhHHHHHHHHHhhhhhhhhhhcc--cchhcCCcccceeEEEeecccce Q lcl|NC_019916. 1 MATN-----------NNNLA-------ARTYTKQFAQLMQTVFGAQSVFGPTFGD--LQALDGVQNNATAFSVKTNNVPV 60 (286) Q Consensus 1 M~t~-----------nnn~a-------~r~Y~kq~~~ll~~vf~~qa~F~~~fgg--lQ~lDGVqnN~taf~vKtnd~pV 60 (286) |--+ =+..| .=.|.++|..+|..+|..+++=.+.... .+ +. + +=+||++.+.+ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e---~~--~--g~tVkIp~i~~ 84 (329) T protein:vir:10 12 MNKEIKNATGKLKLNLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAI---FM--Q--GRSFTVIKGDV 84 (329) T ss_pred hhhhhhcccceeEEehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeeccccee---ec--c--CcEEEEeeecc Confidence 1100 01111 1147899999999999877654333222 22 11 2 22677776655 Q ss_pred E-eecccCCCcceeecCcCCcccccceeEEEEecccccccccchhh-hccccccccCChhH--HHHHHHhhHHHHHHHHH Q lcl|NC_019916. 61 V-VGEYSQDEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIH-EGLDRFTVNNDLNA--AVADRLDLQAQAKVRMF 136 (286) Q Consensus 61 v-vg~Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiH-EGiDr~TVNndl~a--avAdRl~Lqa~Ak~~~~ 136 (286) + ++.|+.+. +|..|+-+. -|+.-..-++-.|+|- ..+|.-..|.++.+ ++++. |....+..+ T Consensus 85 ~gl~DY~R~~--g~~~g~vt~---------~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~---~~~~v~pEi 150 (329) T protein:vir:10 85 TELKDYKRNA--TNEFDHPQI---------QETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQ---ASEVVAPYL 150 (329) T ss_pred cccccccCCC--Ccccccccc---------ceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHH---HHHHhhhHH Confidence 2 34676543 343333221 2233333445555543 34555555555532 23333 333344566 Q ss_pred HHHHHHHHhhhhhhh----hhhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccceeeeccCcee- Q lcl|NC_019916. 137 NNALGKKLADASTDL----GAVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAVNIDENGIV- 211 (286) Q Consensus 137 n~~~gk~ls~~a~~~----~t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi~- 211 (286) +...-..|+..+... .|.+.+...+-.+..+--..+|.....+||+|++|.+|-+.+..+....-..++--||.+ T Consensus 151 Day~~skla~~a~~~~~~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g~Vg 230 (329) T protein:vir:10 151 DNLRFATLARNKAKHLTVGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKGVQG 230 (329) T ss_pred HHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeee Confidence 665555554443221 233455566666666655566666788999999999999887765432222222234544 Q ss_pred eecCeEEEEchHHhhcCceEEEeecceeeeccceEEEEEee-ccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 212 RFRDIIITKVPEKYMQGKAIMFVPDNIGRAFTGIVTTRTIE-SEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 212 ~fKgf~l~e~p~~y~qg~~~ifs~dnIg~af~GI~taRtie-SEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) ++.||.|-++|...|.+-..|..+.+--.+.+=++..|... +++=+|=+.||--=||-||++.-+++|+....+| T Consensus 231 ~idG~~Ii~vps~~~k~in~ii~~~~A~~~~~K~~~~~~~~p~~~~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a 306 (329) T protein:vir:10 231 ELDGFTIVKVPSKMLQGVEAMAVIGEVMASPIQANEAKLNSNVPGMFGTLAEQMLYTGAFVPEHLQKYIFTIGGKE 306 (329) T ss_pred eecCeEEEEecCCcccceeEEEEcCCceeeeeeeeeeeeeCCCCccchheeeeeeeeeeEEEccccCEEEEecccC Confidence 68999999999999997666666555555666677777765 6777899999999999999999999998876666 No 9 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=98.63 E-value=4.5e-09 Score=66.36 Aligned_cols=259 Identities=14% Similarity=0.223 Sum_probs=167.6 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccce------EeecccCCCcceee Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPV------VVGEYSQDEAVAFG 74 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pV------vvg~Y~td~NvaFG 74 (286) ||. + + -|.++|.+.|..+|...+.+...-+.-+..- =++++ .||+..+-| =.+.|+-+ .+|- T Consensus 1 Man-t----l-~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~--~~Gak--~vkIp~is~~~~~TsGl~dy~R~--~g~~ 68 (302) T protein:vir:78 1 MAN-S----L-ALAQIYQDNIDKAIAVNSKSAFLEANPNNVQ--YNGGN--TIKIADISFGSGTTGDLKAYNRS--TGFT 68 (302) T ss_pred CCc-h----h-HHHHHHHHHHHHHHHhhhceeecccCCceEE--EecCc--EEEEEEEEeeccccccccccccc--cCcc Confidence 771 1 2 4789999999999998887665444311111 12222 456655554 24468774 3444 Q ss_pred cCcCCcccccceeEEEEecccccccccchhhhccccccccC-ChhHHHHHHHhhHHHH-HHHHHHHH-HHHHHhhhh--- Q lcl|NC_019916. 75 AGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNN-DLNAAVADRLDLQAQA-KVRMFNNA-LGKKLADAS--- 148 (286) Q Consensus 75 tGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNn-dl~aavAdRl~Lqa~A-k~~~~n~~-~gk~ls~~a--- 148 (286) .|+= +.-|+.....+|-.|.|. ||++-|+. .....+|.=++.|..- .+-.+|.. +.|-.+.+. T Consensus 69 ~g~v---------~~~~et~tlt~DR~~~f~--vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~ 137 (302) T protein:vir:78 69 QGSV---------TLAWSDYTLDYDLAQSFQ--IDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVG 137 (302) T ss_pred ccce---------eeeeeeEEeeeccceeee--ccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccC Confidence 4332 356666677788888875 89988877 3334356555554333 23344444 222222221 Q ss_pred ------hhhhhhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccce----eeeccCceeeecCeEE Q lcl|NC_019916. 149 ------TDLGAVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSA----VNIDENGIVRFRDIII 218 (286) Q Consensus 149 ------~~~~t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss----~NiD~ngi~~fKgf~l 218 (286) ....|.+.|.+-|-.+.++..+.+ +.++||+|.+|.+|-+....+...+.. -.||.+ +-.+.|+.| T Consensus 138 ~~~~~~~~~~t~~nvl~~i~~~~~~~~e~~---~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~~i~~~-V~~lDgv~I 213 (302) T protein:vir:78 138 GVIDLSKPDASAQALMGDIATAMELVDDSN---QLILVTSPTTLAGLLNTALIRESKNTQVLRRGEVDTK-ITFIQDVEV 213 (302) T ss_pred ccccccccchhHHHHHHHHHHHHHHhhccC---CeEEEEChHHHHHHhcchhhccceeccccccccccce-eeeecccEE Confidence 112355778877888888877764 999999999999999877665443322 125444 888999999 Q ss_pred EEchHHhhcCc----------------eEEEeecceeeeccceEEEEEeeccC---ccceeeeecccccccCCCcCcceE Q lcl|NC_019916. 219 TKVPEKYMQGK----------------AIMFVPDNIGRAFTGIVTTRTIESED---FDGVALQGAGKAGSFILDDNKAAI 279 (286) Q Consensus 219 ~e~p~~y~qg~----------------~~ifs~dnIg~af~GI~taRtieSED---FdGVaLQgAgK~G~~IlddNKkAI 279 (286) .+||+.+|+++ ..|..|...-++.+=.+..|..+.+- =||=..|+--=++-||+|.=|++| T Consensus 214 i~VPs~r~~t~~~f~~G~~~~~~ak~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI 293 (302) T protein:vir:78 214 LQVPSEYLYDKVAPKVGVPDYTGAKKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAYKVDLRLYHDLIVPKNQRPGI 293 (302) T ss_pred EEchhhhcccceeccCCccccCCccceeEEEECCCeeeeeeeeeeeEeeCCCCCCCcceeeeeeeeEeeeeeeccccCeE Confidence 99999999863 34666777788888888888886532 223466776677889998888888 Q ss_pred EEEecCC Q lcl|NC_019916. 280 FSATPKA 286 (286) Q Consensus 280 ~k~t~ka 286 (286) .--+.+| T Consensus 294 ~~~~~~~ 300 (302) T protein:vir:78 294 IKASFGT 300 (302) T ss_pred EEeeccc Confidence 8666666 No 10 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=98.49 E-value=2.9e-08 Score=61.92 Aligned_cols=263 Identities=16% Similarity=0.154 Sum_probs=158.0 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCc-ccceeEEEeecccceEee--cccCCCcceeecCc Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQ-NNATAFSVKTNNVPVVVG--EYSQDEAVAFGAGT 77 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVq-nN~taf~vKtnd~pVvvg--~Y~td~NvaFGtGT 77 (286) |+|.-|+||+. |.++|.+.|..+|...+.. |.|..-++.. ++++ +||+..+.+ .| .|+-+ .+|-.|+ T Consensus 1 ~~~~an~mAln-ya~~~~~~Ld~~~~~~~~t----~~l~~~~~~~~~Gak--~VkIp~i~~-~gl~dY~R~--~g~~~g~ 70 (311) T protein:vir:99 1 MPTDAETRGFN-YVTKDGNLLDQKITAGLFT----AALGTPEVDLVNGGR--SFTLKTIST-SGLKDHTRG--KGFNSGT 70 (311) T ss_pred CCCcchhhHHH-HHHHHHHHHHHHHHhhhcc----cceecCchheeecCC--EEEEEeeee-ccccccccc--cCccccc Confidence 99999999954 8999999999999887632 2243333222 2244 677777764 44 58764 3333222 Q ss_pred CCcccccceeEEEEecccccccccchhhhccccccccC-ChhHHHHHHHhhHHHHH-HHHHHHHHHHHHhhhh------- Q lcl|NC_019916. 78 AKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNN-DLNAAVADRLDLQAQAK-VRMFNNALGKKLADAS------- 148 (286) Q Consensus 78 g~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNn-dl~aavAdRl~Lqa~Ak-~~~~n~~~gk~ls~~a------- 148 (286) - ..-|+.....+|-.|.|. ||++-|+. .....+|.=++.|...+ +-.+++..=..|+..+ T Consensus 71 v---------~~~~et~tl~~DR~~~f~--vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~ 139 (311) T protein:vir:99 71 I---------SDEKTIYTMGQDRDVEFY--LDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTD 139 (311) T ss_pred e---------eeeeeEEEeeeccceeee--cchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccc Confidence 1 345666667778888775 88888875 23333343333332222 2233332222222111 Q ss_pred ------------hhhhhhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhcccccccccc----ceeeeccCceee Q lcl|NC_019916. 149 ------------TDLGAVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKG----SAVNIDENGIVR 212 (286) Q Consensus 149 ------------~~~~t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~----Ss~NiD~ngi~~ 212 (286) +.+.+.++|...+-.+-.+--.. ...+.++||+|++|.+|=+.+.-+...+ +.-.||.. +-. T Consensus 140 ~~~~~~~~~~~~~~~lt~~nvl~~l~~~~~~~~~v-~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~-V~~ 217 (311) T protein:vir:99 140 TEGTLLAKTHKTEETLDETNAYSQLKTGIGKVRKY-GTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESR-ITS 217 (311) T ss_pred cchhhhccccccccccCHHHHHHHHHHHHHHHHhc-CCCCeEEEEChHHHHHHhhchhhheeeecccccccccccc-cce Confidence 11122333433222222222111 2357899999999998766543332111 22235554 789 Q ss_pred ecCeEEEEc-hHHhhc-------Cc---------eEEEeecceeeeccceEEEEEee---ccCccceeeeecccccccCC Q lcl|NC_019916. 213 FRDIIITKV-PEKYMQ-------GK---------AIMFVPDNIGRAFTGIVTTRTIE---SEDFDGVALQGAGKAGSFIL 272 (286) Q Consensus 213 fKgf~l~e~-p~~y~q-------g~---------~~ifs~dnIg~af~GI~taRtie---SEDFdGVaLQgAgK~G~~Il 272 (286) +.|+.|.|+ |+..|+ |. +.|..|...-++.+=.+..|.++ ..+=||=..|+--=+.-||+ T Consensus 218 lDgv~Ii~V~ps~r~~t~~~ft~G~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~gd~~l~~~R~Y~D~fv~ 297 (311) T protein:vir:99 218 IDGVQLIEVYESNRFMTKYDFTDGAKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQHTDGDGYLYQNRLYHDLFIK 297 (311) T ss_pred ecCeEEEEecCchhhcchhhhcCCccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeee Confidence 999999999 999986 32 23555666777777777888885 44456888888888899999 Q ss_pred CcCcceEEEEecCC Q lcl|NC_019916. 273 DDNKAAIFSATPKA 286 (286) Q Consensus 273 ddNKkAI~k~t~ka 286 (286) |.=+++|.-....| T Consensus 298 ~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 298 KHKRDGIFVSVKKA 311 (311) T ss_pred ccccCeEEEeeecC Confidence 87778886666677 No 11 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=98.43 E-value=3.5e-08 Score=61.45 Aligned_cols=262 Identities=11% Similarity=0.143 Sum_probs=153.9 Q ss_pred ceeeeechhHHHHHHHHHhhhhhhhhhhcc-cchhcCCcccceeEEEeecccceEe--ecccCCCcceeec-CcCCcccc Q lcl|NC_019916. 8 LAARTYTKQFAQLMQTVFGAQSVFGPTFGD-LQALDGVQNNATAFSVKTNNVPVVV--GEYSQDEAVAFGA-GTAKSTRF 83 (286) Q Consensus 8 ~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg-lQ~lDGVqnN~taf~vKtnd~pVvv--g~Y~td~NvaFGt-GTg~s~RF 83 (286) ||+ -|.++|...|...|...+.......+ .-. .-|+-| -+=+||+..+-|.. +.|+. +.+|+. |+- T Consensus 1 Mai-nya~~~~~~Ld~~~~~~~lts~~l~~~~~~-~~v~~~-ggktVkIp~is~tsGl~DY~R--~~g~~~~g~v----- 70 (346) T protein:vir:10 1 MTI-NYAEKYQAAVQQAFYDGHLYSAELWNSPSN-SIIKFD-GAKHIKVPRLEITSGRKDRQR--RTITTPVANY----- 70 (346) T ss_pred Ccc-hhHHHHHHHHHHHHHhhhccchhhcccccc-cceEec-CCCEEEEEEeeeecccccccc--cCCccccccc----- Confidence 665 35788999999999776543322211 111 111111 13367877776654 46764 233331 221 Q ss_pred cceeEEEEecccccccccchhhhccccccccC-ChhHHHHHHHhhHHHHH-HHHHHHHHHHHHhhhh---------hhhh Q lcl|NC_019916. 84 GERTEIVYTDTDVPYEFTWAIHEGLDRFTVNN-DLNAAVADRLDLQAQAK-VRMFNNALGKKLADAS---------TDLG 152 (286) Q Consensus 84 G~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNn-dl~aavAdRl~Lqa~Ak-~~~~n~~~gk~ls~~a---------~~~~ 152 (286) ..-|+.-...+|-.|.|. ||.+-|+. .....+|.=++.+..-+ +-.+++..=..|...+ +.+. T Consensus 71 ----~~~~et~tl~qDR~~~F~--vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~~~~~~~a~ 144 (346) T protein:vir:10 71 ----SNDWDSYELKNERYWSTL--VDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHDGGITTNTL 144 (346) T ss_pred ----ccceeEEEeeccccceec--ccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccccccc Confidence 233445556667777664 88887765 23344555554433333 2344554333332211 1123 Q ss_pred hhhhHHHHHHHHhhhhhceee-eEEEEEEECchhhhhhhccccccccc--cceeeeccCceeeecCeEEEEchHHhhc-- Q lcl|NC_019916. 153 AVDDVNVMFETASAKYTNLEV-VVPVRAYVTADVYNAIIDHNLVTSQK--GSAVNIDENGIVRFRDIIITKVPEKYMQ-- 227 (286) Q Consensus 153 t~d~V~klF~~~~~~yvn~ev-~~~~~ayV~~evYNaIvD~~l~Ts~K--~Ss~NiD~ngi~~fKgf~l~e~p~~y~q-- 227 (286) |.+.+.+.+-.+..+--+.+| ..+.++||+|++|..|=..+.-+..- ++.-+|+ --+-++-||.|.|||+..|+ T Consensus 145 T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~i~-~~V~siDGv~Ii~VPs~r~~t~ 223 (346) T protein:vir:10 145 DEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNNIQ-RTVYSLDDVTIRVVPSDLMQTA 223 (346) T ss_pred CHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccccccccc-eeeeeecCeEEEEcchhhcccc Confidence 557788889999999888999 47899999999999887666433111 1222342 22567899999999999996 Q ss_pred -----C---------ceEEEeecceeeeccceEEEEEeecc-Cccce-eeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 228 -----G---------KAIMFVPDNIGRAFTGIVTTRTIESE-DFDGV-ALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 228 -----g---------~~~ifs~dnIg~af~GI~taRtieSE-DFdGV-aLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) | -..|..|-..-++.+=.+..|..+.. .-.|- ..|+--=+.-||+|.=+++|.-....| T Consensus 224 ~~f~~G~~~~t~ak~INfiiv~~~A~ia~~K~~~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a 298 (346) T protein:vir:10 224 YDFSDGSKIIDTAKQIEMFLIYNGVQIAPEKYSFVGFDQPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDK 298 (346) T ss_pred hhhccCccccCCccceeEEEECCceeeeeeeeeeeEeeCCCCCcccceeeeeeeeeeeeeeccccceEEEeeecc Confidence 3 22355566777778888888887653 22332 344444467889977777774422222 No 12 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=98.43 E-value=3.9e-08 Score=61.23 Aligned_cols=261 Identities=15% Similarity=0.138 Sum_probs=149.8 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCc-ccceeEEEeecccce-EeecccCCCcceeecCcC Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQ-NNATAFSVKTNNVPV-VVGEYSQDEAVAFGAGTA 78 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVq-nN~taf~vKtnd~pV-vvg~Y~td~NvaFGtGTg 78 (286) ||+ =-|.++|.+.|...|.+.++|..-.+.-.. .-|+ ++ +=+||++.+.+ -++.|+.+ +.+|-.|+- T Consensus 1 MA~-------~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~-~~v~~~g--g~tVkI~~i~~~gl~DY~R~-~~g~~~g~~ 69 (299) T protein:vir:79 1 MAA-------LNYAKEYSNVLAQAYPYTLNFGDLYATPNN-GRYRWTG--SKTIEIPTISTTGRVDSNRD-TIAVAQRNY 69 (299) T ss_pred Ccc-------chhHHHHHHHHHHHHHhhceeeeeccCccc-ceeeecC--CCEEEEeccccccccccccC-CCccccccc Confidence 663 236789999999999999987644432111 1112 22 22566666654 34467652 224444321 Q ss_pred CcccccceeEEEEecccccccccchhhhccccccccC---ChhH--HHHHHHhhHHHHHHHHHHHHHHHHHhhhhh---- Q lcl|NC_019916. 79 KSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNN---DLNA--AVADRLDLQAQAKVRMFNNALGKKLADAST---- 149 (286) Q Consensus 79 ~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNn---dl~a--avAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~---- 149 (286) . .-|+.-..-++-.|+|. ||.+-|+. .+.+ ++++..+ ...+..+|+.....|...+. T Consensus 70 -----~----~~~~t~~ldqdr~~~f~--vD~~Dvdet~~~~~~a~v~~~~~~---~~v~pEiDay~~skl~~~a~~~g~ 135 (299) T protein:vir:79 70 -----D----NAWEPKVLTNQRKWSTL--VHPADINQTNYVASIGNITKVYNE---EQKFPEMDAYCISKIYADWTALGN 135 (299) T ss_pred -----C----cceeEEEeeccccceec--cchhhHHHHhhhhHHHHHHHHHHH---HHhhhHhhHHHHHHHHHhhhhcCC Confidence 1 12233333444445442 55555443 2211 2333333 33344556655555533331 Q ss_pred ----hhhhhhhHHHHHHHHhhhhhceee-eEEEEEEECchhhhhhhccccccccccce-eeeccCce-eeecCeEEEEch Q lcl|NC_019916. 150 ----DLGAVDDVNVMFETASAKYTNLEV-VVPVRAYVTADVYNAIIDHNLVTSQKGSA-VNIDENGI-VRFRDIIITKVP 222 (286) Q Consensus 150 ----~~~t~d~V~klF~~~~~~yvn~ev-~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss-~NiD~ngi-~~fKgf~l~e~p 222 (286) .+.|.+++...+-.+.++--+.+| .....+||+|++|.+|-.++.-+...... .++.-||. -++-||.|.||| T Consensus 136 ~~~~~~~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vp 215 (299) T protein:vir:79 136 TADTTVLTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVP 215 (299) T ss_pred cccccccCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEec Confidence 123557788888888888888888 46799999999999999887644333221 22233444 368999999999 Q ss_pred HHhhcCc----------------eEEEeecceeeeccceEEEEEeeccCc-cceeeeecccc-cccCCCcCcceEEEEec Q lcl|NC_019916. 223 EKYMQGK----------------AIMFVPDNIGRAFTGIVTTRTIESEDF-DGVALQGAGKA-GSFILDDNKAAIFSATP 284 (286) Q Consensus 223 ~~y~qg~----------------~~ifs~dnIg~af~GI~taRtieSEDF-dGVaLQgAgK~-G~~IlddNKkAI~k~t~ 284 (286) +..|... ..|..+...-.+.+=++..|..+.+-. .|=+|..=-+| .-||+|.=|++|.-... T Consensus 216 s~r~~t~~~~~~G~~~~~~ak~in~ii~~~~a~~~~~K~~~~~~~~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~ 295 (299) T protein:vir:79 216 SNLMKTAYDFTTGWKVGAGAKQIFMSLVHPSAIITPVSYQFSKLDEPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVE 295 (299) T ss_pred hhhcCccceeccCccccCcccccceEEEcCCeeeeeEeeeeEEeecCCCCCccceeeeeeeeeeeeeeccccCeEEEEee Confidence 9999852 345556677778888888888765432 22233333344 56777555555533333 Q ss_pred CC Q lcl|NC_019916. 285 KA 286 (286) Q Consensus 285 ka 286 (286) .| T Consensus 296 ~a 297 (299) T protein:vir:79 296 GA 297 (299) T ss_pred ec Confidence 33 No 13 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.39 E-value=8.8e-08 Score=59.27 Aligned_cols=259 Identities=16% Similarity=0.180 Sum_probs=148.2 Q ss_pred ceeeeec-hhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecC-cCCcccccc Q lcl|NC_019916. 8 LAARTYT-KQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAG-TAKSTRFGE 85 (286) Q Consensus 8 ~a~r~Y~-kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtG-Tg~s~RFG~ 85 (286) ||+-.|. ++|.+.+...|++++.|.+..---...+|. +.+|.---|... +-+.+|... |+. +...-.-++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~-~Gdtv~ip~~~~--~~~~d~~~~-----~~~~~~~~~~~~~ 72 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTAS-KGNVVHIAGVVA--PTVKDYKAA-----GRQTSADAISDTG 72 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccc-cCceEEEeeccc--ccccccccC-----CCccCccccccce Confidence 7777664 679999999999999988754332222332 223221112221 223445321 111 111111122 Q ss_pred eeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh-----hhhhhHHHH Q lcl|NC_019916. 86 RTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL-----GAVDDVNVM 160 (286) Q Consensus 86 rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~-----~t~d~V~kl 160 (286) ..=.|-.+..+++..+ .+|+.....|+.+ +.+ -|++|-.+.++..+...+..++... .+.+++... T Consensus 73 ~~~tid~~~~~~~~i~-----d~d~~~~~~~~~~-~~~---~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~ 143 (273) T protein:vir:10 73 VDLLIDQEKSIDFLVD-----DIDRVQVAGSLEA-YTR---AGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDL 143 (273) T ss_pred EEEEEeeeeecceEee-----cHHHhhhhccHHH-HHH---HHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHH Confidence 2212223334443322 6778888888754 444 4677777889988877776544322 233344444 Q ss_pred HHHHhhhhhceee-eEEEEEEECchhhhhhhcccc-ccc-cc-cceeeeccCceeeecCeEEEE---chHHhhcCceEEE Q lcl|NC_019916. 161 FETASAKYTNLEV-VVPVRAYVTADVYNAIIDHNL-VTS-QK-GSAVNIDENGIVRFRDIIITK---VPEKYMQGKAIMF 233 (286) Q Consensus 161 F~~~~~~yvn~ev-~~~~~ayV~~evYNaIvD~~l-~Ts-~K-~Ss~NiD~ngi~~fKgf~l~e---~p~~y~qg~~~if 233 (286) |-.+....-+..| ...-.++|+|+.|..|.-.+. .+. .+ ++...+=+--|-++-||.+-+ +|..-- ...+.| T Consensus 144 i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~-~~~~~~ 222 (273) T protein:vir:10 144 IAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-EQFVAF 222 (273) T ss_pred HHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCc-cEEEEE Confidence 5555555444444 234578999999999986552 332 22 222222233356899999998 574211 235677 Q ss_pred eecceeeeccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecC Q lcl|NC_019916. 234 VPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPK 285 (286) Q Consensus 234 s~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~k 285 (286) .+..+|.+ ..|........++--|-.+.|---||..+++.-+.++++.+-. T Consensus 223 ~~~A~~~a-~q~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 223 HPSAAAYV-SQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eccceeee-eeeehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 88888765 4565555555555558889988889999999876666665544 No 14 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.39 E-value=8.8e-08 Score=59.27 Aligned_cols=259 Identities=16% Similarity=0.180 Sum_probs=148.2 Q ss_pred ceeeeec-hhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecC-cCCcccccc Q lcl|NC_019916. 8 LAARTYT-KQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAG-TAKSTRFGE 85 (286) Q Consensus 8 ~a~r~Y~-kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtG-Tg~s~RFG~ 85 (286) ||+-.|. ++|.+.+...|++++.|.+..---...+|. +.+|.---|... +-+.+|... |+. +...-.-++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~-~Gdtv~ip~~~~--~~~~d~~~~-----~~~~~~~~~~~~~ 72 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTAS-KGNVVHIAGVVA--PTVKDYKAA-----GRQTSADAISDTG 72 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccc-cCceEEEeeccc--ccccccccC-----CCccCccccccce Confidence 7777664 679999999999999988754332222332 223221112221 223445321 111 111111122 Q ss_pred eeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh-----hhhhhHHHH Q lcl|NC_019916. 86 RTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL-----GAVDDVNVM 160 (286) Q Consensus 86 rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~-----~t~d~V~kl 160 (286) ..=.|-.+..+++..+ .+|+.....|+.+ +.+ -|++|-.+.++..+...+..++... .+.+++... T Consensus 73 ~~~tid~~~~~~~~i~-----d~d~~~~~~~~~~-~~~---~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~ 143 (273) T protein:vir:10 73 VDLLIDQEKSIDFLVD-----DIDRVQVAGSLEA-YTR---AGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDL 143 (273) T ss_pred EEEEEeeeeecceEee-----cHHHhhhhccHHH-HHH---HHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHH Confidence 2212223334443322 6778888888754 444 4677777889988877776544322 233344444 Q ss_pred HHHHhhhhhceee-eEEEEEEECchhhhhhhcccc-ccc-cc-cceeeeccCceeeecCeEEEE---chHHhhcCceEEE Q lcl|NC_019916. 161 FETASAKYTNLEV-VVPVRAYVTADVYNAIIDHNL-VTS-QK-GSAVNIDENGIVRFRDIIITK---VPEKYMQGKAIMF 233 (286) Q Consensus 161 F~~~~~~yvn~ev-~~~~~ayV~~evYNaIvD~~l-~Ts-~K-~Ss~NiD~ngi~~fKgf~l~e---~p~~y~qg~~~if 233 (286) |-.+....-+..| ...-.++|+|+.|..|.-.+. .+. .+ ++...+=+--|-++-||.+-+ +|..-- ...+.| T Consensus 144 i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~-~~~~~~ 222 (273) T protein:vir:10 144 IAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-EQFVAF 222 (273) T ss_pred HHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCc-cEEEEE Confidence 5555555444444 234578999999999986552 332 22 222222233356899999998 574211 235677 Q ss_pred eecceeeeccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecC Q lcl|NC_019916. 234 VPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPK 285 (286) Q Consensus 234 s~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~k 285 (286) .+..+|.+ ..|........++--|-.+.|---||..+++.-+.++++.+-. T Consensus 223 ~~~A~~~a-~q~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 223 HPSAAAYV-SQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eccceeee-eeeehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 88888765 4565555555555558889988889999999876666665544 No 15 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=98.20 E-value=3.9e-07 Score=55.69 Aligned_cols=256 Identities=14% Similarity=0.127 Sum_probs=154.7 Q ss_pred ceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccce-EeecccCCCcceeecCcCCcccccce Q lcl|NC_019916. 8 LAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPV-VVGEYSQDEAVAFGAGTAKSTRFGER 86 (286) Q Consensus 8 ~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pV-vvg~Y~td~NvaFGtGTg~s~RFG~r 86 (286) ||+- |.++|.+.|...|.+.+++...--+ + ..+ ++ +=+||++.+.+ -++.|+.+. +|..|+-+ T Consensus 1 Main-~a~~~~~~Ld~~~~~~~~t~~l~~~-~-~~~--~g--gktVkI~~i~~~gl~DY~R~~--g~~~g~v~------- 64 (290) T protein:vir:78 1 MAIN-YVDKYGKELDQKLVFGTYTNELETP-N-LLW--LD--AKTFKIQTITTTGLKAHTRNK--GYNEGSAS------- 64 (290) T ss_pred Cchh-HHHHHHHHHHHHHHhhheeeecccc-c-eee--cc--CCEEEEeeeccCcccccccCC--CcccCccc------- Confidence 6664 5688999999999999886553211 1 112 12 22467666554 345688753 55444332 Q ss_pred eEEEEecccccccccchhhhccccccccCC-hhHHHHHHHh-hHHHHHHHHHHHHHHHHHhhhhh-------hhhhhhhH Q lcl|NC_019916. 87 TEIVYTDTDVPYEFTWAIHEGLDRFTVNND-LNAAVADRLD-LQAQAKVRMFNNALGKKLADAST-------DLGAVDDV 157 (286) Q Consensus 87 kEIiy~DtdVpY~~~~aiHEGiDr~TVNnd-l~aavAdRl~-Lqa~Ak~~~~n~~~gk~ls~~a~-------~~~t~d~V 157 (286) .-|+.....++-.|+|. ||.+-|+.- ....+|.-++ .|+...+-.+|+..-..|...|. .+.|.+.+ T Consensus 65 --~~~et~tl~qdR~~~F~--vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~n~ 140 (290) T protein:vir:78 65 --NTNKSYTIDFDRDVEFF--VDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKDNV 140 (290) T ss_pred --cceeeEEeeccccceee--ccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHHHH Confidence 23344445566666653 777666431 2222333322 23344455667665445544331 12244556 Q ss_pred HHHHHHHhhhhhceeeeEEEEEEECchhhhhhhcccccccccc----ceeeeccCceeeecCeEEEEchHH-hhc----- Q lcl|NC_019916. 158 NVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKG----SAVNIDENGIVRFRDIIITKVPEK-YMQ----- 227 (286) Q Consensus 158 ~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~----Ss~NiD~ngi~~fKgf~l~e~p~~-y~q----- 227 (286) .+.+-.+..+--+ -...+..+||+|++|.+|-.++.-+...+ +...|+. -+-++.||.|.|+|.+ .|. T Consensus 141 ~~~i~~~~~~lde-vp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~-~V~~idG~~ii~vps~~r~~t~~~f 218 (290) T protein:vir:78 141 FTKLKAAIRKVKK-YGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIET-RITAIDGTRIVEVEAEDRFYDTFDF 218 (290) T ss_pred HHHHHHHHHHHHh-cCCCCeEEEECHHHHHHHhhChhhhccccccccccccccc-eeeeecCcEEEEecccchhhhhhhh Confidence 5555555554322 12457899999999999987765544222 1223322 3468999999999964 442 Q ss_pred ---------Cce--EEEeecceeeeccceEEEEEeeccCc---cceeeeecccccccCCCcCcceEEEEecC Q lcl|NC_019916. 228 ---------GKA--IMFVPDNIGRAFTGIVTTRTIESEDF---DGVALQGAGKAGSFILDDNKAAIFSATPK 285 (286) Q Consensus 228 ---------g~~--~ifs~dnIg~af~GI~taRtieSEDF---dGVaLQgAgK~G~~IlddNKkAI~k~t~k 285 (286) ++. .|..|.+.-++.+=.+..|..+.+-. ||=..|+--=+.-||+|.=|++|..-+-- T Consensus 219 ~~G~~~~~~ak~in~ii~~~~a~i~~~K~~~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 219 TDGYKPAAGAKKLNFLLVNKGSVVGGAKHASIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred cccccccCCccceeEEEEcCCceeeeeeeeEEEeeCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 333 35567777888888889999987766 77788888888999998888888872222 No 16 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=98.15 E-value=5.2e-07 Score=55.05 Aligned_cols=258 Identities=17% Similarity=0.185 Sum_probs=146.4 Q ss_pred ceeeee-chhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccc-eEeecccCCCcceeec-CcCCccccc Q lcl|NC_019916. 8 LAARTY-TKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVP-VVVGEYSQDEAVAFGA-GTAKSTRFG 84 (286) Q Consensus 8 ~a~r~Y-~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~p-Vvvg~Y~td~NvaFGt-GTg~s~RFG 84 (286) ||+-.+ .++|.+.+...|+++..|.+..-.---..|.+ .+ +|+..-.+ +-+.+|... |+ ++...-=.+ T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~-Gd---Tv~ip~~~~~~~~d~~~~-----~~~~~~~~~~~~ 71 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASK-GN---VVHIAGVVAPTVKDYKAA-----GRQTSADAISDT 71 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhccccccccC-Cc---EEEEeecCcccccccccC-----CCccCccccccc Confidence 555545 58899999999999998876531111111221 22 23332222 122334321 11 111111112 Q ss_pred ceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh-----hhhhhHHH Q lcl|NC_019916. 85 ERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL-----GAVDDVNV 159 (286) Q Consensus 85 ~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~-----~t~d~V~k 159 (286) +..-.|-.+..+++..+ .+|+....-|+.+ +.+ -|++|-.+.++..+-..+..++... .+.+.+.. T Consensus 72 ~~~~tid~~~~~~~~i~-----d~d~~~~~~~~~~-~~~---~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~ 142 (273) T protein:vir:79 72 GVDLLIDQEKSIDFLVD-----DIDRVQVAGSLEA-YTR---AGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFD 142 (273) T ss_pred eEEEEEeeecccceeec-----cHHHHhhcccHHH-HHH---HHHHHHHHHHHHHHHHHHhhcccccccccccchhhHHH Confidence 22222334444544432 6677777778764 444 3667778888988887776654332 22344445 Q ss_pred HHHHHhhhhhceeee-EEEEEEECchhhhhhhcccc-ccccc-c-ceeeeccCceeeecCeEEEE---chHHhhcCceEE Q lcl|NC_019916. 160 MFETASAKYTNLEVV-VPVRAYVTADVYNAIIDHNL-VTSQK-G-SAVNIDENGIVRFRDIIITK---VPEKYMQGKAIM 232 (286) Q Consensus 160 lF~~~~~~yvn~ev~-~~~~ayV~~evYNaIvD~~l-~Ts~K-~-Ss~NiD~ngi~~fKgf~l~e---~p~~y~qg~~~i 232 (286) .|-.+....=+..|- ..-.++|+|+.|..|.-.+. .+.+. . +...+-+--+-++.||.|-+ +|..-- ...+. T Consensus 143 ~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~-~~~~a 221 (273) T protein:vir:79 143 LIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-EQFVA 221 (273) T ss_pred HHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCc-eEEEE Confidence 555566555555552 34578999999999975542 33222 2 22223333355899999988 464211 23557 Q ss_pred EeecceeeeccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecC Q lcl|NC_019916. 233 FVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPK 285 (286) Q Consensus 233 fs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~k 285 (286) |.+..++-+ ..|....+-..++--|-.+-|---||..+++..+.++++.+-. T Consensus 222 ~~~~A~~~a-~~~~~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 222 FHPSAAAYV-SQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred Eeccceeee-eehhhhhcccCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 788887754 4555444444455448888888889999999887777765544 No 17 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=98.10 E-value=8.8e-07 Score=53.79 Aligned_cols=263 Identities=16% Similarity=0.163 Sum_probs=159.6 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCc-ccceeEEEeecccceE-eecccCCCcceeecCcC Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQ-NNATAFSVKTNNVPVV-VGEYSQDEAVAFGAGTA 78 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVq-nN~taf~vKtnd~pVv-vg~Y~td~NvaFGtGTg 78 (286) ||. + + -|.++|.+.|..+|...+.+..-=+.- +.|+ ++ +=.||+..+.+. ++.|+.+...+|..|+= T Consensus 1 Man-t----l-~ya~~~~~~LD~~~~~~~~s~~l~~~~---~~v~~~g--gktVkIp~i~~~gl~DY~R~~g~~~~~g~v 69 (312) T protein:vir:10 1 MAN-T----L-AYGQVLQQGLDKQATQELLTGWMDSNA---KQIKYEG--GKEVKIGKLSTDGLGDYSRGSANAYVGGDV 69 (312) T ss_pred CCc-c----h-hHHHHHHHHHHHHHHhhhccccccCCC---ceEEEec--CcEEEEEeeecccccccccccCCccccccc Confidence 772 1 2 477999999999999988665322220 1121 22 224566655531 33577766556655542 Q ss_pred CcccccceeEEEEecccccccccchhhhccccccccC-ChhHHHHHHHhh-HHHHHHHHHHHH-HHHHHhhhhh------ Q lcl|NC_019916. 79 KSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNN-DLNAAVADRLDL-QAQAKVRMFNNA-LGKKLADAST------ 149 (286) Q Consensus 79 ~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNn-dl~aavAdRl~L-qa~Ak~~~~n~~-~gk~ls~~a~------ 149 (286) +. -|+.....+|-.|.|. ||++-|+. .+...+|.=++. |....+-.+|+. +.|--+.+.. T Consensus 70 ~~---------~~et~tl~qDR~~~F~--vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~ 138 (312) T protein:vir:10 70 KF---------EYETKTMTQDRGRKFT--LDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTN 138 (312) T ss_pred cc---------cceeEEeeecccceee--ccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccc Confidence 21 2334445566677765 78888775 344445655555 344444566665 3332222211 Q ss_pred ----hhhhhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhcccc--ccccccceeeeccCceeeecCeEEEEchH Q lcl|NC_019916. 150 ----DLGAVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNL--VTSQKGSAVNIDENGIVRFRDIIITKVPE 223 (286) Q Consensus 150 ----~~~t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l--~Ts~K~Ss~NiD~ngi~~fKgf~l~e~p~ 223 (286) .+.|.+.+...+-.+.++--+.+|-.+.++||+|++|.+|=+... .++...+..+|| --+-++.|+.|.|||+ T Consensus 139 ~~~~~~~T~~ni~~~i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~~~~~~~~~~~~~~~i~-~~V~~iDgv~Ii~VPs 217 (312) T protein:vir:10 139 VEYSYSVNSSTIINKIKTGIKIIRENGYNGPLVCHLTYDSMFAIEEKVLEKLTAVTFAQGGIQ-TQVPSIDGCALIKTPQ 217 (312) T ss_pred cccccccCHHHHHHHHHHHHHHHHHccCCCceEEEeChHHHHHHhhhhhceecccccccceee-eeeeeecccEEEEchh Confidence 123557788888888888888898889999999999976655321 122222333342 2245788999999999 Q ss_pred Hhhc-------C----------------c--eEEEeecceeeeccceEEEEEeecc---CccceeeeecccccccCCCcC Q lcl|NC_019916. 224 KYMQ-------G----------------K--AIMFVPDNIGRAFTGIVTTRTIESE---DFDGVALQGAGKAGSFILDDN 275 (286) Q Consensus 224 ~y~q-------g----------------~--~~ifs~dnIg~af~GI~taRtieSE---DFdGVaLQgAgK~G~~IlddN 275 (286) ..|. | + ..|..|...-++.+=.+..|.++.+ +=||=..|+--=+.-||+|.= T Consensus 218 ~r~~t~~~f~dG~t~~~~~gg~~~~~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv~~nk 297 (312) T protein:vir:10 218 NRMYSSILLNDGTTSNQTAGGYLKGTKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRRYHDLWVTDNK 297 (312) T ss_pred hhccceeeeccCcccccccCceeecCcccccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccc Confidence 9994 1 1 2355566677777777788877543 223456666666788999888 Q ss_pred cceEEEEecCC Q lcl|NC_019916. 276 KAAIFSATPKA 286 (286) Q Consensus 276 KkAI~k~t~ka 286 (286) +++|.--...| T Consensus 298 ~~~Iyv~~k~a 308 (312) T protein:vir:10 298 ANSVYANFKDA 308 (312) T ss_pred cCeEEEEeecc Confidence 88884433334 No 18 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=97.88 E-value=9.2e-06 Score=48.20 Aligned_cols=256 Identities=13% Similarity=0.098 Sum_probs=144.0 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceE-----eecccCCCcceeec Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVV-----VGEYSQDEAVAFGA 75 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVv-----vg~Y~td~NvaFGt 75 (286) |+..+..++-=+--.+|..++..-+..+..|.+.---...+.|..-+ + ++ +|+. ...|.-++..+... T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~-t---v~---iP~~~~~~~a~~v~eg~~i~~~~ 73 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGT-T---LT---VPKWDYIGDAEDVAEGEAIPMTQ 73 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCC-E---EE---EEEecCCCCcccccCCCcccccc Confidence 99666555543333467777777777777664321111223333222 1 11 2332 11233333444333 Q ss_pred CcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhh---hh Q lcl|NC_019916. 76 GTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTD---LG 152 (286) Q Consensus 76 GTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~---~~ 152 (286) -+ |++.+-.+.. +-.-|. +++.......-..++..++-++.+|.|.+++.+-..+..+... .. T Consensus 74 ~~-----~~~~~~~~~~-----~~~~~~----itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~~~ 139 (272) T protein:vir:30 74 LG-----FKKTTMTIKK-----AGKGVE----ITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEATA 139 (272) T ss_pred cc-----cceEEEEeee-----eeeeee----ecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 32 2222111111 011121 3333333333335666666778899999998888777554322 34 Q ss_pred hhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhcccccccccccee--eeccCce-eeecCeEEEEchHHhhc-C Q lcl|NC_019916. 153 AVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAV--NIDENGI-VRFRDIIITKVPEKYMQ-G 228 (286) Q Consensus 153 t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~--NiD~ngi-~~fKgf~l~e~p~~y~q-g 228 (286) +.|.+..+...+...+.. +-...|+|++|..|.-..+....+.+.. ++=.+|. -++.|+.+-+.+. +- | T Consensus 140 t~d~i~da~~~l~~~~~~-----~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~--~p~~ 212 (272) T protein:vir:30 140 TVDGVSKALDIFNDEDDA-----ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRK--CPKG 212 (272) T ss_pred CHHHHHHHHHHHhccCCC-----ccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCC--CCcc Confidence 667777777666655433 3468899999999977665555554443 3333453 4899998877653 55 7 Q ss_pred ceEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 229 KAIMFVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 229 ~~~ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) ...+|.+..++..--+-.+..+-.+++..-..+.+-.-||-.+++. .+|+++|.++ T Consensus 213 t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~--~~vv~~t~~~ 268 (272) T protein:vir:30 213 TAYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKA--EKAVKITLKD 268 (272) T ss_pred eEEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcC--CceEEEEecc Confidence 8888888877765322222233334455568888989999888864 4677777777 No 19 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=97.88 E-value=9.2e-06 Score=48.20 Aligned_cols=256 Identities=13% Similarity=0.098 Sum_probs=144.0 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceE-----eecccCCCcceeec Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVV-----VGEYSQDEAVAFGA 75 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVv-----vg~Y~td~NvaFGt 75 (286) |+..+..++-=+--.+|..++..-+..+..|.+.---...+.|..-+ + ++ +|+. ...|.-++..+... T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~-t---v~---iP~~~~~~~a~~v~eg~~i~~~~ 73 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGT-T---LT---VPKWDYIGDAEDVAEGEAIPMTQ 73 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCC-E---EE---EEEecCCCCcccccCCCcccccc Confidence 99666555543333467777777777777664321111223333222 1 11 2332 11233333444333 Q ss_pred CcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhh---hh Q lcl|NC_019916. 76 GTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTD---LG 152 (286) Q Consensus 76 GTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~---~~ 152 (286) -+ |++.+-.+.. +-.-|. +++.......-..++..++-++.+|.|.+++.+-..+..+... .. T Consensus 74 ~~-----~~~~~~~~~~-----~~~~~~----itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~~~ 139 (272) T protein:vir:98 74 LG-----FKKTTMTIKK-----AGKGVE----ITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEATA 139 (272) T ss_pred cc-----cceEEEEeee-----eeeeee----ecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 32 2222111111 011121 3333333333335666666778899999998888777554322 34 Q ss_pred hhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhcccccccccccee--eeccCce-eeecCeEEEEchHHhhc-C Q lcl|NC_019916. 153 AVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAV--NIDENGI-VRFRDIIITKVPEKYMQ-G 228 (286) Q Consensus 153 t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~--NiD~ngi-~~fKgf~l~e~p~~y~q-g 228 (286) +.|.+..+...+...+.. +-...|+|++|..|.-..+....+.+.. ++=.+|. -++.|+.+-+.+. +- | T Consensus 140 t~d~i~da~~~l~~~~~~-----~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~--~p~~ 212 (272) T protein:vir:98 140 TVDGVSKALDIFNDEDDA-----ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRK--CPKG 212 (272) T ss_pred CHHHHHHHHHHHhccCCC-----ccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCC--CCcc Confidence 667777777666655433 3468899999999977665555554443 3333453 4899998877653 55 7 Q ss_pred ceEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 229 KAIMFVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 229 ~~~ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) ...+|.+..++..--+-.+..+-.+++..-..+.+-.-||-.+++. .+|+++|.++ T Consensus 213 t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~--~~vv~~t~~~ 268 (272) T protein:vir:98 213 TAYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKA--EKAVKITLKD 268 (272) T ss_pred eEEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcC--CceEEEEecc Confidence 8888888877765322222233334455568888989999888864 4677777777 No 20 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=97.62 E-value=7.9e-06 Score=48.57 Aligned_cols=261 Identities=17% Similarity=0.164 Sum_probs=149.0 Q ss_pred CCCCccc--------ceee--eechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccc-eEeecccCCC Q lcl|NC_019916. 1 MATNNNN--------LAAR--TYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVP-VVVGEYSQDE 69 (286) Q Consensus 1 M~t~nnn--------~a~r--~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~p-Vvvg~Y~td~ 69 (286) |...|+- -.++ +|-|+|.+.+.+-|++++.|++..-- +.. .+ -.- |+.+-++ +-++.|..+. T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~-r~i---~~-G~t--v~i~~ig~~~~~~~~~g~ 79 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS-YDL---RG-GKS--KQFMFTGKLSAGYHTPGT 79 (332) T ss_pred ccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhcccc-ccc---cc-cce--EEEEeccceeEeeecCCC Confidence 4433322 1155 89999999999999999999876542 211 11 111 2222211 2234444433 Q ss_pred cceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019916. 70 AVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADAST 149 (286) Q Consensus 70 NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~ 149 (286) ..-.-.. -=.-+-++-.|+..=+ .+.| +.||+..++-|+ ++++..-++.|=.|.++..+...|.+++. T Consensus 80 ~l~~~~~------~~~~~~~l~ID~~ky~--~~~V-ddiD~~q~~~dl---~~~~~~~~g~aLA~~~D~~i~~~l~~aa~ 147 (332) T protein:vir:78 80 PIVGDAG------IKANEKTLVMDDLLVS--SQFV-YSLDEIFSQYST---RAEVSKQIGEALATHYDERIARVLAKASA 147 (332) T ss_pred CCCCCCC------CCCceEEEEEehhhhh--HHHH-HhHHHHhcCcch---HHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 2111000 0001112445544322 2223 568899988876 66777788999999999999888866542 Q ss_pred h-------------------hhhh----hhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhc---cccccc-cccce Q lcl|NC_019916. 150 D-------------------LGAV----DDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIID---HNLVTS-QKGSA 202 (286) Q Consensus 150 ~-------------------~~t~----d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD---~~l~Ts-~K~Ss 202 (286) . +.+. |.+.++..++.++.|- .....+.|+|+.|..|+. ..++.. ..+++ T Consensus 148 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP---~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~ 224 (332) T protein:vir:78 148 EASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAP---QEGRVAVLSPRQYYSLISSVDTNILNREIGNSQ 224 (332) T ss_pred ccCcccccccccccccCCccccCHHHHHHHHHHHHHHHhhcCCC---ccCCEEEeCHHHHHHHHhhcCceeeeeeccccc Confidence 2 1122 3455666666666543 223568899999999985 333333 23444 Q ss_pred eeeccC-ceeeecCeEEEEchHHhhc-C----------------------ceEEEeecceeeec---cceEEEEEeeccC Q lcl|NC_019916. 203 VNIDEN-GIVRFRDIIITKVPEKYMQ-G----------------------KAIMFVPDNIGRAF---TGIVTTRTIESED 255 (286) Q Consensus 203 ~NiD~n-gi~~fKgf~l~e~p~~y~q-g----------------------~~~ifs~dnIg~af---~GI~taRtieSED 255 (286) -.+..- +|.+.-||.|-+.+.--.. | ..++|.|+-+|.+= .=|+++|..-.|+ T Consensus 225 ~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~ 304 (332) T protein:vir:78 225 GDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQ 304 (332) T ss_pred cceecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchh Confidence 445443 4789999999886533111 0 13577777655431 2244455555566 Q ss_pred ccceeeeecccccccCCCcCcceEEEEe Q lcl|NC_019916. 256 FDGVALQGAGKAGSFILDDNKAAIFSAT 283 (286) Q Consensus 256 FdGVaLQgAgK~G~~IlddNKkAI~k~t 283 (286) .-+-.+-|---||-=++.-...+.+++. T Consensus 305 ~~~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 305 YQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhHhhhhhhhhhcCceecccceEEEeeC Confidence 6555555555567667777777777766 No 21 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=97.39 E-value=7.7e-05 Score=43.12 Aligned_cols=258 Identities=12% Similarity=0.090 Sum_probs=145.7 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCCc Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAKS 80 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~s 80 (286) ||.+--.++-=+--..|..++..=+.++..|.+..=-.-.+.|.. -.|.=.-|.+.++ -..+|.-++..+.+.-|. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~-G~tv~ip~~~~~g-~~~~~~eg~~i~~~~it~-- 76 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQP-GDTLTFPAFVYSG-DAQVVAEGEKIPTDILET-- 76 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCC-CCEEEEEeeccCC-CcccccCCCccccccccc-- Confidence 887444333333334577777766666655544322222233322 1121111222221 122465555555444432 Q ss_pred ccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh----hhhhh Q lcl|NC_019916. 81 TRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL----GAVDD 156 (286) Q Consensus 81 ~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~----~t~d~ 156 (286) ++.+-.|. . +-+.|.+.. +++.....|+ +++..+-++.++.+.+++.+-..|.++.... .+.|. T Consensus 77 ---~~~~~~i~--~---~~~~~~i~D-~~~~~~~~d~---~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~ 144 (274) T protein:vir:93 77 ---KKREAKIR--K---IAKGTSITD-EALLSGYGDP---QGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNG 144 (274) T ss_pred ---ceeEEEee--e---ecccccccH-HHHHhhccch---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHH Confidence 22222221 1 234566655 5666666665 4555666789999999999888886654332 24455 Q ss_pred HHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccce--eeeccCcee-eecCeEEEEchHHhhc-CceEE Q lcl|NC_019916. 157 VNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSA--VNIDENGIV-RFRDIIITKVPEKYMQ-GKAIM 232 (286) Q Consensus 157 V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y~q-g~~~i 232 (286) +.....++... -..+-.+.|+|++|..|.-.++-...+.|. -++=.+|.+ +|.||.+-+.+. +. +...+ T Consensus 145 i~dA~~~l~d~-----~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l 217 (274) T protein:vir:93 145 LQSAIDKFNDE-----DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAIL 217 (274) T ss_pred HHHHHHHhhhc-----cCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCC--CCcceEEE Confidence 55444444332 234668999999999998554333222222 233334443 899999988753 44 88888 Q ss_pred Eeecceeee---ccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 233 FVPDNIGRA---FTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 233 fs~dnIg~a---f~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) |.+..||-. .+-+++.| .++...-.+-|---||..++++++-..+ |.++ T Consensus 218 ~~~gai~~~~~~~~~vE~~R---d~~~~~d~i~~~~~y~~~~~~~~~~v~~--t~~~ 269 (274) T protein:vir:93 218 AKKGAVKLILKRDFFLEVAR---DASTKTTALYSDKHYVAYLYDESKAVKI--TKGS 269 (274) T ss_pred EeCCeEEEEecCCccccccc---chhhcccEEEEEEEEEEEEEcCCceEEE--eeCc Confidence 888888853 12344444 4455667888999999999988875544 4444 No 22 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=97.36 E-value=2.1e-05 Score=46.23 Aligned_cols=261 Identities=14% Similarity=0.143 Sum_probs=144.8 Q ss_pred CCCC------cccc---------eeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceE---- Q lcl|NC_019916. 1 MATN------NNNL---------AARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVV---- 61 (286) Q Consensus 1 M~t~------nnn~---------a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVv---- 61 (286) |+.. |++. +.-+|-|+|-+-+-+-|+.++.|++..=- +.+.| - |+--.|.+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~-r~i~~--g-------~s~~~~~iG~~~ 70 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV-RSISS--G-------KSAQFPVLGRTQ 70 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhccccee-eeecc--c-------ceEEEEeeceeE Confidence 5522 2221 33468899999999999999999976543 22222 1 11222322 Q ss_pred eecccCCCcceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_019916. 62 VGEYSQDEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALG 141 (286) Q Consensus 62 vg~Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~g 141 (286) ++-|..++... || .+..=.-+-+|-.|+..-+.+.- .-||+.+.+=|+-+..+++ +++|=.|.+|..+. T Consensus 71 ~~~~~~G~~l~---~t--~~~~~~~e~~l~ID~~~y~~~~V---dDiD~~q~~~D~r~~~~~~---~G~aLA~~~D~~i~ 139 (344) T protein:vir:10 71 AAYLAPGENLD---DI--RKDIKHTEKVITIDGLLTADVLI---YDIEDAMNHYDVRSEYTSQ---LGESLAMAADGAVL 139 (344) T ss_pred EEeeecCCCCC---CC--CCCcccceEEEEEcchhhhhhhh---hhHHHHhcCcchHHHHHHH---HHHHHHHHHHHHHH Confidence 22233333221 11 11222223467777766554332 3678888887877666655 56888999998887 Q ss_pred HHHhhhh------------------------hhhh-----hhhh----HHHHHHHHhhhhhceeeeEEEEEEECchhhhh Q lcl|NC_019916. 142 KKLADAS------------------------TDLG-----AVDD----VNVMFETASAKYTNLEVVVPVRAYVTADVYNA 188 (286) Q Consensus 142 k~ls~~a------------------------~~~~-----t~d~----V~klF~~~~~~yvn~ev~~~~~ayV~~evYNa 188 (286) ..|...+ ..++ +.+. +.++-..+.+..|- ...-.++|+|+.|.+ T Consensus 140 ~~la~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP---~~gR~~vv~P~~y~~ 216 (344) T protein:vir:10 140 AEIAGLCNVESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVP---SSDRVFYCDPDSYSA 216 (344) T ss_pred HHHHhhhccccccccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCC---ccCCEEEeChHHHHH Confidence 6664311 1111 1122 33344455555543 224578899999999 Q ss_pred hhccccccccccceeeeccCc-eeeecCeEEEEchHHhhc----------Cc--------------------eEEEeecc Q lcl|NC_019916. 189 IIDHNLVTSQKGSAVNIDENG-IVRFRDIIITKVPEKYMQ----------GK--------------------AIMFVPDN 237 (286) Q Consensus 189 IvD~~l~Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~q----------g~--------------------~~ifs~dn 237 (286) |++++..+.....+-+.-.+| +.+..||.|-+.|.--.. |. .++|.|+- T Consensus 217 Ll~~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A 296 (344) T protein:vir:10 217 ILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSA 296 (344) T ss_pred HhhcccccccccccccceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhh Confidence 999988776655555555678 568899999988753211 10 01233333 Q ss_pred eeeeccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecC Q lcl|NC_019916. 238 IGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPK 285 (286) Q Consensus 238 Ig~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~k 285 (286) +|..=.=--+.+.-.+|..-|=.+-|---||-=++.-.-.+.++.+.| T Consensus 297 ~~~v~~~~~~~e~~r~~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 297 VGTVKLRDLALERARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred hhhhhhccceeecccchhHHHHHHHHHhhcccceecccceEEEEeecC Confidence 322111000112222444444444444456666777777777888888 No 23 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=97.19 E-value=0.00014 Score=41.77 Aligned_cols=238 Identities=14% Similarity=0.131 Sum_probs=137.3 Q ss_pred CCCCcccc-----------eeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCC Q lcl|NC_019916. 1 MATNNNNL-----------AARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDE 69 (286) Q Consensus 1 M~t~nnn~-----------a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~ 69 (286) |++-++|. -.-+|-|+|-+.+.+-|+.++.|++..-= +.+.| -|.--|. .+ -.+.++.|..++ T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~-r~i~~--G~s~~~~-~i--G~~~~~~~~~g~ 74 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNV-RSLRG--TNQLRVD-RV--GASTIAGRKAGE 74 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhcccee-eeccc--cceEEEe-ee--cceeeeeecCCC Confidence 77764332 13578899999999999999999976533 33222 1111111 11 123345566666 Q ss_pred cceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019916. 70 AVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADAST 149 (286) Q Consensus 70 NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~ 149 (286) .. ++ +|.-.-+-+|-.|+..-+... =+=||+.+.+=|+-+.++.. +++|-.|++|..+..-|..++. T Consensus 75 ~l---~~----~~~~~~~~~l~ID~~l~~~~~---VddiD~~q~~~D~rse~~~~---~G~aLA~~~D~~~~~~l~kaa~ 141 (334) T protein:vir:80 75 EL---VV----QKNVSDKLNLTVDTVLYARHF---FDKFDEWTSNLDVRKETARE---DGIALARQYDQACIIQLQKCGD 141 (334) T ss_pred CC---CC----CCcccCceEEEEeeeeehhhh---HhhHHHHhcCcchHHHHHHH---HHHHHHHHHHHHHHHHHHHhhh Confidence 65 22 233334556777776544432 24588888888887777654 6789999999987766644331 Q ss_pred hh---------------------------hhhh----hHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhcccccccc Q lcl|NC_019916. 150 DL---------------------------GAVD----DVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQ 198 (286) Q Consensus 150 ~~---------------------------~t~d----~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~ 198 (286) .. .+.| -+..++..+.++.+.-+.+.+..++|+|+.|.+|+.++-.... T Consensus 142 ~~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~ 221 (334) T protein:vir:80 142 FLAPAHLKPAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNV 221 (334) T ss_pred hcccccccccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccc Confidence 10 1112 2446777888888887777889999999999999999765432 Q ss_pred -cc---ceeeeccCceeeecCeEEEEchHHhhcCceEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCc Q lcl|NC_019916. 199 -KG---SAVNIDENGIVRFRDIIITKVPEKYMQGKAIMFVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDD 274 (286) Q Consensus 199 -K~---Ss~NiD~ngi~~fKgf~l~e~p~~y~qg~~~ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~Ildd 274 (286) -+ +...+-.-.+.++-||.|-+.+.- |.+ .|. .... .+.+..|--|- T Consensus 222 d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~----------P~~------~~t-------~~~~------g~~~~~~agd~ 272 (334) T protein:vir:80 222 EFGAKEGGNSFVGGRIAMLNGVRVVETPRF----------PQS------AIT-------ANAL------GADFNVTDAEV 272 (334) T ss_pred eeccccccccccceeEEEEeceEEEeecCC----------CCc------ccc-------cccc------ccccccccccc Confidence 12 222334344777778888776431 111 000 0000 12333333333 Q ss_pred C--------cceEEEEecCC Q lcl|NC_019916. 275 N--------KAAIFSATPKA 286 (286) Q Consensus 275 N--------KkAI~k~t~ka 286 (286) . +.|+..+.... T Consensus 273 t~~~~~~~~~~Al~t~~~~~ 292 (334) T protein:vir:80 273 RRKMITFIPSMALISAQVHP 292 (334) T ss_pred cceEEEEEeCceEEEEEEee Confidence 3 34555444433 No 24 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=97.17 E-value=0.00011 Score=42.38 Aligned_cols=262 Identities=11% Similarity=0.094 Sum_probs=146.4 Q ss_pred CCCCcccceeeeechh-HHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCC Q lcl|NC_019916. 1 MATNNNNLAARTYTKQ-FAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAK 79 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq-~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~ 79 (286) |+..--- ..-+..|| |..+++.=|.++..|.+..=-.-.+.|..- ++.=.-|.+.+. -...|.-+.......-| T Consensus 1 Ma~~~T~-~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G-~tv~ip~~~~~g-~a~~~~~g~~i~~~~lt-- 75 (278) T protein:vir:80 1 MADLTTK-LANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPG-SEITVPKYKYIG-DAQDVAEGAAIDYSALE-- 75 (278) T ss_pred CCCccee-hhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCC-CEEEEeeeccCC-cceeecCCCcCcccccc-- Confidence 7631001 12235554 888888888877777554221222233211 111111222111 11235544444444333 Q ss_pred cccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh------hh Q lcl|NC_019916. 80 STRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL------GA 153 (286) Q Consensus 80 s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~------~t 153 (286) +++.+-.| +. +-+.|.+.. +|+.....|+ +++..+-++.+|.|.+++.+-..|..+.... .+ T Consensus 76 ---~~~~~~~i--~~---~~~a~~v~D-~~~~~~~~d~---~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~ 143 (278) T protein:vir:80 76 ---TESVKHGI--KK---AGKGVKLTD-ESVLSGYGDP---VEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGL 143 (278) T ss_pred ---cceeeEee--eh---hhccccccH-HHHhhccccH---HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccch Confidence 22222222 11 223455544 5666666665 5667777899999999998888885543221 12 Q ss_pred hhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccc--cccccceeeeccCcee-eecCeEEEEchHHhhc-Cc Q lcl|NC_019916. 154 VDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLV--TSQKGSAVNIDENGIV-RFRDIIITKVPEKYMQ-GK 229 (286) Q Consensus 154 ~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~--Ts~K~Ss~NiD~ngi~-~fKgf~l~e~p~~y~q-g~ 229 (286) .+....+|..+..+.-...+-.+-++.|+|++|..|.-.++. ++.....-++-.||.+ +|.||.|-+.+. +- |. T Consensus 144 ~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t 221 (278) T protein:vir:80 144 IDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKK--LADGN 221 (278) T ss_pred hhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCC--CCcce Confidence 345556676665554333333345788999999999755422 2222222244445554 899999987653 33 88 Q ss_pred eEEEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 230 AIMFVPDNIGR---AFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 230 ~~ifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) ..+|.+..||- +.+-+++-| .++.-.-.|.+---||..+++.. +++++|..| T Consensus 222 ~~l~~~gAi~~~~~~~~~vE~~R---d~~~~~d~i~~~~~yg~~v~~~~--~~v~it~~a 276 (278) T protein:vir:80 222 ALAVKAGALKTFLKRNLLAESGR---DMDHKLTKFNADQHYAVALVDET--KAVKVVPVA 276 (278) T ss_pred EEEEeccceeeeecCCccccccc---chhhccceeeeeeEEEEEEEcCc--ceEEEeecc Confidence 88998888873 222344444 34445567888888899888654 577888888 No 25 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=97.12 E-value=2e-05 Score=46.39 Aligned_cols=260 Identities=17% Similarity=0.224 Sum_probs=139.4 Q ss_pred CC---------CCccc-----ceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeec-- Q lcl|NC_019916. 1 MA---------TNNNN-----LAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGE-- 64 (286) Q Consensus 1 M~---------t~nnn-----~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~-- 64 (286) |+ |.+.. -++-+|-|+|.+.+.+-|+.++.|++..-- +. +++. |+--.| .||+ T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~-r~---i~~G------~sv~~~-~iG~~~ 69 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV-RT---IQNG------KSASFP-VMGRTK 69 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhcccc-cc---ccCc------ceEEEe-eeccee Confidence 43 32211 136789999999999999999999887543 21 1111 222222 2222 Q ss_pred ---ccCCCcceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_019916. 65 ---YSQDEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALG 141 (286) Q Consensus 65 ---Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~g 141 (286) |..++.. .++.++-...++ +|-.|+..-+.+ .=+-+|+...+-|+-+.+++ .+++|=.|.+|..+- T Consensus 70 ~~~~~~g~~l---~~~~~~~~~~~~--~i~ID~~~y~~~---~Vdd~D~~q~~~D~r~~~~~---~~g~aLA~~~D~~i~ 138 (347) T protein:vir:88 70 GYYLAPGENL---DDKRKDIKHSEK--VIQIDGLLTSDV---LIYDIEDAMNHYDVRAEYSA---QLGEALAIAADGAVL 138 (347) T ss_pred eeeeccccCC---CCCCCCCccceE--EEEEechhhhhh---hhhhHHHHhhcCCchHHHHH---HHHHHHHHHHHHHHH Confidence 3333321 122233233332 233343221111 11467888888787665554 567777888888776 Q ss_pred HHHhhhhhhh------------------h--------------hhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhh Q lcl|NC_019916. 142 KKLADASTDL------------------G--------------AVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAI 189 (286) Q Consensus 142 k~ls~~a~~~------------------~--------------t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaI 189 (286) ..|...+... + ..|.+..+...+.+..|.. ...++.|+|+.|..| T Consensus 139 ~~l~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~---~gR~~vv~P~~y~~L 215 (347) T protein:vir:88 139 AEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPA---GDRRFYCAPEDYSAI 215 (347) T ss_pred HHHHHhhccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCC---CCCEEEeCHHHHHHH Confidence 5553322110 0 0233444555555544422 357889999999999 Q ss_pred hccccccccccce-eeeccCceeeecCeEEEEchHHhhc--Cce--------------------------------EEEe Q lcl|NC_019916. 190 IDHNLVTSQKGSA-VNIDENGIVRFRDIIITKVPEKYMQ--GKA--------------------------------IMFV 234 (286) Q Consensus 190 vD~~l~Ts~K~Ss-~NiD~ngi~~fKgf~l~e~p~~y~q--g~~--------------------------------~ifs 234 (286) ++++-.+++--.+ ..+..-++.++-||.|-+.|.-=+. |.. ++|. T Consensus 216 l~~~~~~~~~~~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~ 295 (347) T protein:vir:88 216 LSALMPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNH 295 (347) T ss_pred hcchhhhhhhhccccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEec Confidence 9987665544333 3344344678999998887644221 100 1222 Q ss_pred ecceeee-ccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 235 PDNIGRA-FTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 235 ~dnIg~a-f~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) +.-+|.. -..+ +.+.-..++.-+-.+-|---||-=++...-.+.++.++.| T Consensus 296 ~~a~g~v~~~d~-~~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 296 RSAVGTVKLKDM-ALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred hhhhhheecccc-eeeeeechhhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 2222211 0000 1111223334455555666677778888888888888888 No 26 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=96.83 E-value=0.00019 Score=41.00 Aligned_cols=264 Identities=13% Similarity=0.119 Sum_probs=144.9 Q ss_pred CC---------CCcc------cceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecc Q lcl|NC_019916. 1 MA---------TNNN------NLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEY 65 (286) Q Consensus 1 M~---------t~nn------n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y 65 (286) |+ ++++ +-+.-+|-|+|-+.+-+-|+.++.|++..=- +.+.| -|.-.|. .+. .+-+..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~-r~i~~--gks~~~~-~iG--~~~~~~~ 74 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV-RSISS--GKSAQFP-VLG--RTQAAYL 74 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhccccee-eeccc--cceEEEe-eec--ceEEEee Confidence 22 2111 1234688999999999999999999976543 33222 1211111 000 1112224 Q ss_pred cCCCcceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 66 SQDEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLA 145 (286) Q Consensus 66 ~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls 145 (286) ..++... ++.++-.. -+-+|-.|+..-+.+.. .-||+.+.+=|+-+..+++ +++|-.|.+|+.+-..|. T Consensus 75 ~~G~~l~---~~~~~~~~--~e~~ltID~~~y~~~~V---ddiD~~q~~~D~r~~~s~~---~G~aLA~~~D~~i~~~l~ 143 (345) T protein:vir:22 75 APGENLD---DKRKDIKH--TEKVITIDGLLTADVLI---YDIEDAMNHYDVRSEYTSQ---LGESLAMAADGAVLAEIA 143 (345) T ss_pred ecCCCCC---CCCCCccc--ceEEEEecchhhhhhhH---hhHHHHhcCchhHHHHHHH---HHHHHHHHHHHHHHHHHH Confidence 4443321 11111111 34568888877665433 3688888888876665554 678888999987765553 Q ss_pred hhhh------------------------h----hh-hhhhHH----HHHHHHhhhhhceeeeEEEEEEECchhhhhhhcc Q lcl|NC_019916. 146 DAST------------------------D----LG-AVDDVN----VMFETASAKYTNLEVVVPVRAYVTADVYNAIIDH 192 (286) Q Consensus 146 ~~a~------------------------~----~~-t~d~V~----klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~ 192 (286) ..+. . .. +.+.+. .+-..+.++.|.. ..-.++|+|+.|.+|+++ T Consensus 144 k~a~~~~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~---~~R~~vv~P~~y~~Ll~~ 220 (345) T protein:vir:22 144 GLCNVESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPA---ADRVFYCDPDSYSAILAA 220 (345) T ss_pred HhhcccccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCc---cCCEEEeChHHHHHHhcc Confidence 2111 0 00 112233 3334445555443 246799999999999999 Q ss_pred ccccccccceeeeccCc-eeeecCeEEEEchHHhhc--C-----------------------------ceEEEeecceee Q lcl|NC_019916. 193 NLVTSQKGSAVNIDENG-IVRFRDIIITKVPEKYMQ--G-----------------------------KAIMFVPDNIGR 240 (286) Q Consensus 193 ~l~Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~q--g-----------------------------~~~ifs~dnIg~ 240 (286) +..+..--...+...+| +.+.-||.|-|.|.--.. + ..++|.|+-+|. T Consensus 221 ~~~~~~~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~ 300 (345) T protein:vir:22 221 LMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGT 300 (345) T ss_pred ccccccccccccccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheee Confidence 98766555555556788 789999999997632211 0 113555554441 Q ss_pred eccceE-EEEEeeccCccceeeeecccccccCCCcCcceEEEEecC Q lcl|NC_019916. 241 AFTGIV-TTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPK 285 (286) Q Consensus 241 af~GI~-taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~k 285 (286) .= =|. +.+.-.+|+.-+=.+-|---||-=++.-.-.+.++...+ T Consensus 301 v~-~~~~~~e~~r~~~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 301 VK-LRDLALERARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred ee-eecceeeeeechhHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 11 011 112222444444444444455655666666666666666 No 27 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=96.78 E-value=0.00035 Score=39.55 Aligned_cols=260 Identities=14% Similarity=0.089 Sum_probs=143.8 Q ss_pred CCCCcccceeeeech-hHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCC Q lcl|NC_019916. 1 MATNNNNLAARTYTK-QFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAK 79 (286) Q Consensus 1 M~t~nnn~a~r~Y~k-q~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~ 79 (286) |+-.|.-.-.-+-.| -|..+++.-+.++..|.+..--.-.+.|..-+ +.=.-|.+.++ =..+|.-++..+.+.-|.. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~-tv~iP~~~~ig-~a~~~~~g~~i~~~~lt~~ 78 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGN-TITFPAFVYSG-DAKVVPEGEEIPIDLIETK 78 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCC-EEEeeeeccCC-ccccccCCCCcchhhcccc Confidence 665554444445545 47777777777777774432112223332211 11111112211 0112444333333332221 Q ss_pred cccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhh----hhhhh Q lcl|NC_019916. 80 STRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTD----LGAVD 155 (286) Q Consensus 80 s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~----~~t~d 155 (286) + .+-.| .. +-+.|.+.. +++.....|+ ++++++.++.++.+.++..+-..|.++... ..+.| T Consensus 79 ~-----~~~~i---~~--~~~~~~i~D-~~~~~~~~d~---~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~~~~~~~d 144 (275) T protein:vir:96 79 K-----RQATI---RK--IGKGTVLTD-EALLSGYGDP---KGEAVRQHGLAIANKVDNDVLEALQGATLKVEADITKLA 144 (275) T ss_pred e-----eeEEe---eh--hcccccccH-HHHHhhccch---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHH Confidence 1 11111 11 233454443 4555555555 777888899999999999887777554322 22445 Q ss_pred hHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccc--cccccceeeeccCc-eeeecCeEEEEchHHhhc-CceE Q lcl|NC_019916. 156 DVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLV--TSQKGSAVNIDENG-IVRFRDIIITKVPEKYMQ-GKAI 231 (286) Q Consensus 156 ~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~--Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~q-g~~~ 231 (286) .+...-..+.. +-..+-.+.|+|++|..|.-+++- +.+..+.-++=.|| |-+|.|+.+-+... +- |... T Consensus 145 ~i~dA~~~lgd-----~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~ 217 (275) T protein:vir:96 145 GLQTAIDKFND-----EDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNK--IKEGEAI 217 (275) T ss_pred HHHHHHHHhcc-----ccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCC--CCcceEE Confidence 54433333322 223456799999999999666432 22333333444466 56899999977652 43 7788 Q ss_pred EEeecceeee---ccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 232 MFVPDNIGRA---FTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 232 ifs~dnIg~a---f~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) +|.+..+|.. -+-+|+-|-+++ ---.+.+---||..++++.|-+.++.+|.- T Consensus 218 i~~~gA~~~~~~~~~~vE~~Rd~~~---~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 272 (275) T protein:vir:96 218 LAKRGAVKLITKRDFFLETERHASH---KSTALFSDKHYVAYLYDESKVVKITKSASG 272 (275) T ss_pred EEeccceeeeecCCcccccccchhh---cCcEEEEeEEEEEEEEcCccEEEEEecccc Confidence 8887777652 233566665444 346788888899999988877777666665 No 28 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=96.77 E-value=0.00029 Score=39.96 Aligned_cols=241 Identities=17% Similarity=0.233 Sum_probs=135.4 Q ss_pred CCCCcc---------cceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeec-----cc Q lcl|NC_019916. 1 MATNNN---------NLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGE-----YS 66 (286) Q Consensus 1 M~t~nn---------n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~-----Y~ 66 (286) |.+-|+ +--+-+|-|+|-+.+.+-|+.++.|++..-- +.+.| - |+--.|.+ |+ +. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~-rti~~--g-------~s~~~~~i-G~~~~~~~~ 69 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNI-RDLRG--S-------NVVRLDRL-GNVEAKGRR 69 (335) T ss_pred CCCcccchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccce-eeecc--c-------eeEEEeee-eeeeeeccc Confidence 766542 1113467799999999999999999976543 33222 1 22233433 54 44 Q ss_pred CCCcceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhh Q lcl|NC_019916. 67 QDEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLAD 146 (286) Q Consensus 67 td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~ 146 (286) .+++. .++|.-.-|-+|-.|+-+ |.--|- .=||+.+-.=|+-+.++. -+++|-.|++|..+-..|.. T Consensus 70 pG~~l-------~~~~~~~~k~~itVD~ll-~a~~~I--~dlDe~~~~yDvRse~s~---e~G~aLA~~~D~~~~~~i~~ 136 (335) T protein:vir:63 70 AGEEL-------ERSRVVNDKWNLTVDTLL-YLRHQF--DHQDEWTQSFDMRKEVAE---LDGQELARKFDQACLIQVIK 136 (335) T ss_pred CCcCc-------CCCCccccceEEEeccee-echhhh--hhHHHHhcCchhHHHHHH---HHHHHHHHHHHHHHHHHHHh Confidence 44444 223444445588888876 333332 336777766676655554 56788999999988666644 Q ss_pred hhhh-----h------h--------------hhhh----HHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccc Q lcl|NC_019916. 147 ASTD-----L------G--------------AVDD----VNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTS 197 (286) Q Consensus 147 ~a~~-----~------~--------------t~d~----V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts 197 (286) +|.. + + ..+. +..++.++.+++|.-+...+..++|+|+.|.+|++++-.-. T Consensus 137 aa~~~a~~~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n 216 (335) T protein:vir:63 137 AAAMDAPVDLEDAFSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMN 216 (335) T ss_pred hccccCccccCCCcCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccc Confidence 4421 0 0 1222 44566888888888888888999999999999999864322 Q ss_pred cc--cceee--eccCceeeecCeEEEEchHHhhcCceEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCC Q lcl|NC_019916. 198 QK--GSAVN--IDENGIVRFRDIIITKVPEKYMQGKAIMFVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILD 273 (286) Q Consensus 198 ~K--~Ss~N--iD~ngi~~fKgf~l~e~p~~y~qg~~~ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~Ild 273 (286) .- +|... .=.-++++.-||.|-+.|.-= ++. -+..-+|.+|-++. .||. -..|.|. T Consensus 217 ~~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP-~~~---~t~~~lg~a~n~~~-------~d~~-------~~~~~~~-- 276 (335) T protein:vir:63 217 VEYQATGATNDYVKSRVAILNGVKVLETPRFA-TKA---IAAHPLGRHFNVSA-------EESE-------RQIALFL-- 276 (335) T ss_pred cccccccccccccCceeEEeeceEEEeeccCC-CCC---cccccccccCCccc-------cccc-------eeEEEEE-- Confidence 11 11111 112247888888888865210 111 01111222332211 1221 1112222 Q ss_pred cCcceEEEEecCC Q lcl|NC_019916. 274 DNKAAIFSATPKA 286 (286) Q Consensus 274 dNKkAI~k~t~ka 286 (286) .++|+..+..+. T Consensus 277 -~~~Al~t~~~~~ 288 (335) T protein:vir:63 277 -PSKTLITAQVAP 288 (335) T ss_pred -ecceEEEEEEee Confidence 244666665554 No 29 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=96.45 E-value=0.00061 Score=38.19 Aligned_cols=260 Identities=12% Similarity=0.089 Sum_probs=143.2 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCCc Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAKS 80 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~s 80 (286) |+...--++==+--..|..++..=+.++-.|.+..--.-.+.|..-+ |.=.-+.+.+. =..+|.-++..+.+.-| T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~-tv~iP~~~~~g-~a~~~~~g~~i~~~~lt--- 75 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGD-TLTFPAFVYSG-DAQVVAEGEKIPTDILE--- 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCC-EEEEeeecCCC-ccccccCCCcccccccc--- Confidence 87643333322323356677766666555554432222233332211 11111111110 11235444434333332 Q ss_pred ccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh----hhhhh Q lcl|NC_019916. 81 TRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL----GAVDD 156 (286) Q Consensus 81 ~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~----~t~d~ 156 (286) .++.+-.|.. +-+.|.+.. +++..-..|+ +++.++.++.++.+.++..+-..|.++.... .+.|. T Consensus 76 --~~~~~~~i~~-----~~~~~~i~D-~~~~~~~~dp---~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~ 144 (274) T protein:vir:97 76 --TKKREAKIRK-----IAKGTSITD-EALLSGYGDP---QGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNG 144 (274) T ss_pred --cceeEEEeee-----ecceecccH-HHHHhccchH---HHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHH Confidence 2222222211 234566665 5666666665 6777888899999999999998886655332 23454 Q ss_pred HHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccce--eeeccCcee-eecCeEEEEchHHhhc-CceEE Q lcl|NC_019916. 157 VNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSA--VNIDENGIV-RFRDIIITKVPEKYMQ-GKAIM 232 (286) Q Consensus 157 V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y~q-g~~~i 232 (286) +...-..+.. +-..+-.+.|+|++|..|.-.++-...+.|. -++=.||.+ +|.||.|-+.+. +. +...+ T Consensus 145 i~dA~~~l~d-----~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l 217 (274) T protein:vir:97 145 LQSAIDKFND-----EDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAIL 217 (274) T ss_pred HHHHHHHhhc-----cCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCC--CCcceEEE Confidence 4433333322 2235668999999999998654333222222 233345544 899999987653 44 88888 Q ss_pred Eeecceeee---ccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 233 FVPDNIGRA---FTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 233 fs~dnIg~a---f~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) |.+..+|.. -+-+++.|- ++.---.|-+-.-||..++++.|-+.++-+-.. T Consensus 218 ~~~gA~~~~~~~~~~vE~~Rd---~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:97 218 AKKGAVKLILKRDFFLEVARD---ASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred EeCcceEeeecCCceeccccc---hhhcccEEEEEEEEEEEEEcCCceEEEecCccc Confidence 888887742 223555553 334456788888999999988876665532222 No 30 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=96.45 E-value=0.00061 Score=38.19 Aligned_cols=260 Identities=12% Similarity=0.089 Sum_probs=143.2 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCCc Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAKS 80 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~s 80 (286) |+...--++==+--..|..++..=+.++-.|.+..--.-.+.|..-+ |.=.-+.+.+. =..+|.-++..+.+.-| T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~-tv~iP~~~~~g-~a~~~~~g~~i~~~~lt--- 75 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGD-TLTFPAFVYSG-DAQVVAEGEKIPTDILE--- 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCC-EEEEeeecCCC-ccccccCCCcccccccc--- Confidence 87643333322323356677766666555554432222233332211 11111111110 11235444434333332 Q ss_pred ccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh----hhhhh Q lcl|NC_019916. 81 TRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL----GAVDD 156 (286) Q Consensus 81 ~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~----~t~d~ 156 (286) .++.+-.|.. +-+.|.+.. +++..-..|+ +++.++.++.++.+.++..+-..|.++.... .+.|. T Consensus 76 --~~~~~~~i~~-----~~~~~~i~D-~~~~~~~~dp---~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~ 144 (274) T protein:vir:94 76 --TKKREAKIRK-----IAKGTSITD-EALLSGYGDP---QGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNG 144 (274) T ss_pred --cceeEEEeee-----ecceecccH-HHHHhccchH---HHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHH Confidence 2222222211 234566665 5666666665 6777888899999999999998886655332 23454 Q ss_pred HHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccce--eeeccCcee-eecCeEEEEchHHhhc-CceEE Q lcl|NC_019916. 157 VNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSA--VNIDENGIV-RFRDIIITKVPEKYMQ-GKAIM 232 (286) Q Consensus 157 V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y~q-g~~~i 232 (286) +...-..+.. +-..+-.+.|+|++|..|.-.++-...+.|. -++=.||.+ +|.||.|-+.+. +. +...+ T Consensus 145 i~dA~~~l~d-----~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l 217 (274) T protein:vir:94 145 LQSAIDKFND-----EDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAIL 217 (274) T ss_pred HHHHHHHhhc-----cCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCC--CCcceEEE Confidence 4433333322 2235668999999999998654333222222 233345544 899999987653 44 88888 Q ss_pred Eeecceeee---ccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 233 FVPDNIGRA---FTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 233 fs~dnIg~a---f~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) |.+..+|.. -+-+++.|- ++.---.|-+-.-||..++++.|-+.++-+-.. T Consensus 218 ~~~gA~~~~~~~~~~vE~~Rd---~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:94 218 AKKGAVKLILKRDFFLEVARD---ASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred EeCcceEeeecCCceeccccc---hhhcccEEEEEEEEEEEEEcCCceEEEecCccc Confidence 888887742 223555553 334456788888999999988876665532222 No 31 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=96.23 E-value=0.00059 Score=38.30 Aligned_cols=254 Identities=15% Similarity=0.145 Sum_probs=137.6 Q ss_pred CCCCcccceeeeech-hHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceE--ee---cccCCCcceee Q lcl|NC_019916. 1 MATNNNNLAARTYTK-QFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVV--VG---EYSQDEAVAFG 74 (286) Q Consensus 1 M~t~nnn~a~r~Y~k-q~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVv--vg---~Y~td~NvaFG 74 (286) ||+.+--++ .++.| .|..++..-+.++..|.+..--.-.+.|..- ++ ++ +|.. +| +|.-++-.+.. T Consensus 1 ma~~~T~~~-d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G-~t---v~---ip~~~~~g~~~~~~~g~~i~~~ 72 (274) T protein:vir:96 1 MAQGTTKVS-NLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPG-DT---LT---FPAFTYSGDAQVIAEGEKIPVD 72 (274) T ss_pred CCccccchh-hhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCC-CE---EE---EEeeccCCCccccCCCCcCchh Confidence 997664444 56666 4666777777666665442211112222211 11 11 2221 22 35444444444 Q ss_pred cCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh--- Q lcl|NC_019916. 75 AGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL--- 151 (286) Q Consensus 75 tGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~--- 151 (286) .-|.. +.+-.| +. +-+.|.+.. +++....-|+ +++.++-++.+|.|.+++.+-..|..+.... T Consensus 73 ~it~~-----~~~~~i--~~---~~~~~~i~D-~~~~~~~~d~---~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~~~ 138 (274) T protein:vir:96 73 QIGTS-----KREAKV--RK---IGKGTELTD-EAVLSGFGDP---QGEAVRQHGLAIANKVDNDVLEALKGATLTVEAD 138 (274) T ss_pred hcccc-----eeEEEE--Ee---eeceeeecH-HHHHhhcchH---HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCcc Confidence 33322 222111 11 223454432 5555554444 5666677889999999999888875543222 Q ss_pred -hhhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccce--eeeccCc-eeeecCeEEEEchHHhhc Q lcl|NC_019916. 152 -GAVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSA--VNIDENG-IVRFRDIIITKVPEKYMQ 227 (286) Q Consensus 152 -~t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ng-i~~fKgf~l~e~p~~y~q 227 (286) .+.|.+.. |...+=+ +-..+-.+.|+|++|..|.-.++-...+.|. -++-.+| +-+|-||.|-+.+. +- T Consensus 139 ~~~~d~i~d----A~~~l~d-~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~--~p 211 (274) T protein:vir:96 139 ITKLDGLQT----AIDKFND-EDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LN 211 (274) T ss_pred cccHHHHHH----HHHHhcc-cCCCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCC--CC Confidence 23344432 2222212 2234667999999999997765432222222 2333445 66899999876543 43 Q ss_pred -CceEEEeecceeeeccceE-EEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 228 -GKAIMFVPDNIGRAFTGIV-TTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 228 -g~~~ifs~dnIg~af~GI~-taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) +...+|.+..+|-. .+.. ..++-..++...-.|-|---||..+++..| ++++|..+ T Consensus 212 ~~t~~l~~~gA~~~~-~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~--vv~~t~~~ 269 (274) T protein:vir:96 212 KGEALLAKKGAVKLI-TKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESK--VVKITKGA 269 (274) T ss_pred cceEEEEeCcceeee-ecCCcccccccchhhcccEEEEeeEEEEEEEcCcc--EEEEEcCc Confidence 77788888777752 2211 223333444456678888889999998765 55566655 No 32 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=95.66 E-value=0.00095 Score=37.16 Aligned_cols=253 Identities=15% Similarity=0.082 Sum_probs=126.4 Q ss_pred CCCCccc-----------------ceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEee Q lcl|NC_019916. 1 MATNNNN-----------------LAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVG 63 (286) Q Consensus 1 M~t~nnn-----------------~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg 63 (286) |+..|+. -+.-+|-|+|.+.+.+-|++++.|++..=- +.+.| .+...+..+ =.+-+. T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~-rti~~-Gksv~f~~i----G~~t~~ 74 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTK-RTLKN-GKSLQFIYT----GRMTSS 74 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccc-ccccc-CceEEEEee----eeeEEe Confidence 3332221 244688899999999999999999976543 22222 111111111 112222 Q ss_pred cccCCCcceeecCcCCcccccc--eeE-EEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_019916. 64 EYSQDEAVAFGAGTAKSTRFGE--RTE-IVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNAL 140 (286) Q Consensus 64 ~Y~td~NvaFGtGTg~s~RFG~--rkE-Iiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~ 140 (286) .|..++.. ..+..-. -.| +|-.|+..=+.+. =+-||+.+.+=|+-+.. ..-+++|-.|.+|..+ T Consensus 75 ~~t~G~~i-------~~~~~~d~~~te~~l~ID~~~y~~~~---VdDiD~aqa~~Dlr~e~---s~~~G~aLA~~~D~~i 141 (375) T protein:vir:10 75 FHTPGTPI-------LGNADKAPPVAEKTIVMDDLLISSAF---VYDLDETLAHYELRGEI---SKKIGYALAEKYDRLI 141 (375) T ss_pred eecCCcCc-------CCccccCCCCCceEEEecchhhhhhh---HhhHHHHhcCchhHHHH---HHHHHHHHHHHHHHHH Confidence 34332221 1111100 111 3667766544332 24688888887766554 4557788999999888 Q ss_pred HHHHhhhhhh-------------------------------hhhhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhh Q lcl|NC_019916. 141 GKKLADASTD-------------------------------LGAVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAI 189 (286) Q Consensus 141 gk~ls~~a~~-------------------------------~~t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaI 189 (286) -.-|..+|.. ....|.+..+..++.++.|-.+ .-.++|+|+.|.+| T Consensus 142 ~~~l~kaa~~~~p~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~---~R~~vv~P~~y~~L 218 (375) T protein:vir:10 142 FRSITRGARSASPVSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQ---GRCAVLNPRQYYAL 218 (375) T ss_pred HHHHHHhhhhccccccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCC---CCEEEeChHHHHHH Confidence 7666443211 1122556677778888777632 46788999999999 Q ss_pred hcc---c-cccccccceeeecc-CceeeecCeEEEEchHHhhc-C-ceE------EEeecceeee--ccceEEEEEeecc Q lcl|NC_019916. 190 IDH---N-LVTSQKGSAVNIDE-NGIVRFRDIIITKVPEKYMQ-G-KAI------MFVPDNIGRA--FTGIVTTRTIESE 254 (286) Q Consensus 190 vD~---~-l~Ts~K~Ss~NiD~-ngi~~fKgf~l~e~p~~y~q-g-~~~------ifs~dnIg~a--f~GI~taRtieSE 254 (286) +.+ + +....- ..-.+.. .++.+..||.|-+.+.--.- + +.. .-+|...++. +.++++. .-.. T Consensus 219 l~~~d~~~~~n~d~-~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~--~~~g 295 (375) T protein:vir:10 219 IQDIGSNGLVNRDV-QGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENAN--ATGG 295 (375) T ss_pred HhcCCccceeeecc-cccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcce--eecc Confidence 866 2 322222 1222333 45678888888775542221 1 111 1112222211 1111111 0000 Q ss_pred Cccceeeeeccccc-ccCCCc-------CcceEEEEecCC Q lcl|NC_019916. 255 DFDGVALQGAGKAG-SFILDD-------NKAAIFSATPKA 286 (286) Q Consensus 255 DFdGVaLQgAgK~G-~~Ildd-------NKkAI~k~t~ka 286 (286) -| .+|+ .|+.+. +|.|...+.++. T Consensus 296 ~~--------~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~ 327 (375) T protein:vir:10 296 VN--------NDYGTNAELGAKSCGLIFQKEAAGVVEAIG 327 (375) T ss_pred cc--------ccccccccccCceEEEEEchhheeeeeeec Confidence 00 1221 333333 444444444444 No 33 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=95.39 E-value=0.0022 Score=35.20 Aligned_cols=257 Identities=12% Similarity=0.086 Sum_probs=142.9 Q ss_pred CCCCcccceeeeech-hHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCC Q lcl|NC_019916. 1 MATNNNNLAARTYTK-QFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAK 79 (286) Q Consensus 1 M~t~nnn~a~r~Y~k-q~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~ 79 (286) |+...--++ -+..| .|..+++.-+.+.-.|.+..--...+.|..- +|.=.=|.+.+. =...|.-++-...+.-|.. T Consensus 1 m~~~~T~l~-d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G-~tv~iP~~~~ig-~a~~~~~g~~i~~~~lt~~ 77 (274) T protein:vir:95 1 MAQGMTKLT-NQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPG-DTLTFPAFIYSG-DAKVVAEGEKIPTDILETK 77 (274) T ss_pred CCcceeehh-heechHHHHHHHHHHHHhhhhccccceecccccCCCC-CEEEeeeecCCC-ccccccCCCccchhhcccc Confidence 776433322 23333 5777777777766666544222222333221 111011112211 0123444333333332222 Q ss_pred cccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh----hhhh Q lcl|NC_019916. 80 STRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL----GAVD 155 (286) Q Consensus 80 s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~----~t~d 155 (286) + .+-.| +. +-+.|.+.. +|+.....|+ ++++++.++.+|.+.+++.+-..|.++.... .+.| T Consensus 78 ~-----~~~~i--~~---~~~a~~i~D-~~~~~~~~d~---~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d 143 (274) T protein:vir:95 78 K-----REAKI--RK---IAKGTSISD-EALLSGYGDP---QGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLT 143 (274) T ss_pred e-----eEEEe--ee---eecceeehH-HHHhhccchH---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHH Confidence 1 11111 11 223455553 6777766665 6677788899999999999888776544322 2344 Q ss_pred hHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccce--eeeccCcee-eecCeEEEEchHHhhc-CceE Q lcl|NC_019916. 156 DVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSA--VNIDENGIV-RFRDIIITKVPEKYMQ-GKAI 231 (286) Q Consensus 156 ~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y~q-g~~~ 231 (286) .+...-..+. -|-..+-.+.|+|++|..|.-.++-...+.|. .++=-||.+ +|.||.+-+.+ .+. +... T Consensus 144 ~i~~A~~~lg-----d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~--~~~~~t~~ 216 (274) T protein:vir:95 144 GLQTAIDKFN-----DEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSN--KLEAGTAI 216 (274) T ss_pred HHHHHHHHhc-----cccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeC--CCCCceEE Confidence 4443332232 22335667999999999998877655444443 344445544 89999988764 233 7777 Q ss_pred EEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 232 MFVPDNIGR---AFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 232 ifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) +|.+..+|- +-+-+|+-|-+.+ ---.|.+---||..+++..| ++++|+.+ T Consensus 217 l~~~gA~~~~~~~~~~vE~~Rd~~~---~~d~i~~~~~y~~~~~~~~~--~v~~tk~~ 269 (274) T protein:vir:95 217 LAKKGAVKLITKRDFFLETDRDPST---KTTALYSDKHYVAYLYDESK--AVKITKGS 269 (274) T ss_pred EEeccceeeeecCCccccccccccc---ccCEEEEeEEEEEEEEcCCc--EEEEEcCC Confidence 888777664 2233566664443 44678888899999998876 45555555 No 34 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=95.39 E-value=0.0022 Score=35.20 Aligned_cols=257 Identities=12% Similarity=0.086 Sum_probs=142.9 Q ss_pred CCCCcccceeeeech-hHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCC Q lcl|NC_019916. 1 MATNNNNLAARTYTK-QFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAK 79 (286) Q Consensus 1 M~t~nnn~a~r~Y~k-q~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~ 79 (286) |+...--++ -+..| .|..+++.-+.+.-.|.+..--...+.|..- +|.=.=|.+.+. =...|.-++-...+.-|.. T Consensus 1 m~~~~T~l~-d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G-~tv~iP~~~~ig-~a~~~~~g~~i~~~~lt~~ 77 (274) T protein:vir:96 1 MAQGMTKLT-NQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPG-DTLTFPAFIYSG-DAKVVAEGEKIPTDILETK 77 (274) T ss_pred CCcceeehh-heechHHHHHHHHHHHHhhhhccccceecccccCCCC-CEEEeeeecCCC-ccccccCCCccchhhcccc Confidence 776433322 23333 5777777777766666544222222333221 111011112211 0123444333333332222 Q ss_pred cccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh----hhhh Q lcl|NC_019916. 80 STRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL----GAVD 155 (286) Q Consensus 80 s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~----~t~d 155 (286) + .+-.| +. +-+.|.+.. +|+.....|+ ++++++.++.+|.+.+++.+-..|.++.... .+.| T Consensus 78 ~-----~~~~i--~~---~~~a~~i~D-~~~~~~~~d~---~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d 143 (274) T protein:vir:96 78 K-----REAKI--RK---IAKGTSISD-EALLSGYGDP---QGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLT 143 (274) T ss_pred e-----eEEEe--ee---eecceeehH-HHHhhccchH---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHH Confidence 1 11111 11 223455553 6777766665 6677788899999999999888776544322 2344 Q ss_pred hHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccce--eeeccCcee-eecCeEEEEchHHhhc-CceE Q lcl|NC_019916. 156 DVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSA--VNIDENGIV-RFRDIIITKVPEKYMQ-GKAI 231 (286) Q Consensus 156 ~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y~q-g~~~ 231 (286) .+...-..+. -|-..+-.+.|+|++|..|.-.++-...+.|. .++=-||.+ +|.||.+-+.+ .+. +... T Consensus 144 ~i~~A~~~lg-----d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~--~~~~~t~~ 216 (274) T protein:vir:96 144 GLQTAIDKFN-----DEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSN--KLEAGTAI 216 (274) T ss_pred HHHHHHHHhc-----cccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeC--CCCCceEE Confidence 4443332232 22335667999999999998877655444443 344445544 89999988764 233 7777 Q ss_pred EEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 232 MFVPDNIGR---AFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 232 ifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) +|.+..+|- +-+-+|+-|-+.+ ---.|.+---||..+++..| ++++|+.+ T Consensus 217 l~~~gA~~~~~~~~~~vE~~Rd~~~---~~d~i~~~~~y~~~~~~~~~--~v~~tk~~ 269 (274) T protein:vir:96 217 LAKKGAVKLITKRDFFLETDRDPST---KTTALYSDKHYVAYLYDESK--AVKITKGS 269 (274) T ss_pred EEeccceeeeecCCccccccccccc---ccCEEEEeEEEEEEEEcCCc--EEEEEcCC Confidence 888777664 2233566664443 44678888899999998876 45555555 No 35 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=94.83 E-value=0.0034 Score=34.13 Aligned_cols=241 Identities=15% Similarity=0.220 Sum_probs=130.3 Q ss_pred CCCCcc---------cceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeec----ccC Q lcl|NC_019916. 1 MATNNN---------NLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGE----YSQ 67 (286) Q Consensus 1 M~t~nn---------n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~----Y~t 67 (286) |.+-|+ +--.-+|-|+|-+.+.+-|+.++.|++.+--. .+.| =|+--.| .+|+ |.+ T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~r-ti~~---------g~s~~~~-~iG~~~~~~~~ 69 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIR-DLRG---------SNVVRLD-RLGNVEAKGRR 69 (335) T ss_pred CCccccccccccccccchhhhhhhhhhhHHHHHHHHhhhhcccccee-eecc---------ceeEEEe-eeeeeeecccc Confidence 766541 01124778999999999999999999765432 2211 1333445 3465 322 Q ss_pred -CCcceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhh Q lcl|NC_019916. 68 -DEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLAD 146 (286) Q Consensus 68 -d~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~ 146 (286) ++.. .++|.-.-|-+|-.|+-+ |.--| =.=||..+-+=|+-+.+++ -+++|-.|++|..+-..|.. T Consensus 70 pG~~l-------~~~~~~~~k~~itID~ll-~a~~~--VddlDe~~~~yDvR~e~s~---~~G~aLA~~~Dq~~~~~l~~ 136 (335) T protein:vir:78 70 AGEEL-------ERSRVVNDKWNLTVDTLL-YLRHQ--FDHQDEWTQSFDMRKEVAE---LDGQELARKFDQACLIQVIK 136 (335) T ss_pred cCccc-------CCCCcccCCeEEEeccee-echhh--HhhHHHhhcCchhHHHHHH---HHHHHHHHHHHHHHHHHHHh Confidence 2222 233454545588889877 33333 2347777877777776665 46788999999988766655 Q ss_pred hhhhh-------------------------hhhhhHHHHHHHHhhhhhceee----eEEEEEEECchhhhhhhccccccc Q lcl|NC_019916. 147 ASTDL-------------------------GAVDDVNVMFETASAKYTNLEV----VVPVRAYVTADVYNAIIDHNLVTS 197 (286) Q Consensus 147 ~a~~~-------------------------~t~d~V~klF~~~~~~yvn~ev----~~~~~ayV~~evYNaIvD~~l~Ts 197 (286) ++... ...+.+...+-.+.+......| .-.-+++|+|+.|.+|++++-.-. T Consensus 137 aa~~~a~~~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n 216 (335) T protein:vir:78 137 AAAMDAPVDLEDAFSPGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMS 216 (335) T ss_pred hcccccccccCCCcCCCcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccc Confidence 44210 0112333444444444443333 334689999999999999864322 Q ss_pred cc--ccee--eeccCceeeecCeEEEEchHHhhcCceEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCC Q lcl|NC_019916. 198 QK--GSAV--NIDENGIVRFRDIIITKVPEKYMQGKAIMFVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILD 273 (286) Q Consensus 198 ~K--~Ss~--NiD~ngi~~fKgf~l~e~p~~y~qg~~~ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~Ild 273 (286) .- +|.. ..=.-++++.-||.|.+.|. | |.+ + .++.++ +..|++-+=-..-..|.| T Consensus 217 ~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~--l--------P~~------~-~t~~~l-g~a~n~~~~d~~~~~~~~--- 275 (335) T protein:vir:78 217 VEYQATGATNDYVKSRVAILNGVKVLETPR--F--------ATK------A-ISAHPL-GRHFNVSAEEAERQIALF--- 275 (335) T ss_pred ccccccccccccccceeEEeeceEEEeecc--C--------CCC------C-Cccccc-cccCCcccccccceEEEE--- Confidence 11 1111 11122478888888887652 1 211 0 112222 112222110000112333 Q ss_pred cCcceEEEEecCC Q lcl|NC_019916. 274 DNKAAIFSATPKA 286 (286) Q Consensus 274 dNKkAI~k~t~ka 286 (286) =.++|+..+.++. T Consensus 276 ~~~~Al~t~~~~~ 288 (335) T protein:vir:78 276 LPSKTLITAQVAP 288 (335) T ss_pred EecceEEEEEEEe Confidence 3566777666654 No 36 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=94.25 E-value=0.0049 Score=33.23 Aligned_cols=262 Identities=8% Similarity=0.002 Sum_probs=120.6 Q ss_pred CCCCcccceeeeec-hhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccce-EeecccCCCcceeecCcC Q lcl|NC_019916. 1 MATNNNNLAARTYT-KQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPV-VVGEYSQDEAVAFGAGTA 78 (286) Q Consensus 1 M~t~nnn~a~r~Y~-kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pV-vvg~Y~td~NvaFGtGTg 78 (286) |+...+.-.+..|- ++|.+.+...|+++..|.+...- ..+.| ++.+| |++..++- -+..|..+....+. T Consensus 11 ~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~-~~~~~-~~GdT---V~ip~~g~~~a~d~~~g~~i~~~---- 81 (381) T protein:vir:80 11 KGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKK-IPFEG-KKGDL---IHIPNISRAAVYDKQPQTPVNLQ---- 81 (381) T ss_pred cCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhcccc-cccee-ecCce---EEeeccCcceeeeecCCCccccc---- Confidence 44444444444454 69999999999999999764322 11122 22222 22222211 23446544332221 Q ss_pred CcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhh---------- Q lcl|NC_019916. 79 KSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADAS---------- 148 (286) Q Consensus 79 ~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a---------- 148 (286) +-...+..+-.|+..=+... ..-+|+....-|+.+.+..++. +|-.|.++..+=..+.... T Consensus 82 ---~~~~~~~~itID~~~~~~~~---Idd~D~~~~~~D~~~~~~~~~~---~aLA~~~D~~i~~~~~~~~~~~~~~~~t~ 152 (381) T protein:vir:80 82 ---ARTDSEFTFTVTKYKESSFM---IEDIVNTQASYTLRQYYTKEAG---YALARDMDNFALAHRAVINAFPSQRIYSY 152 (381) T ss_pred ---ccCCceEEEEEeeeeeccee---echHHHHhhccChHHHHHHHHH---HHHHHHHHHHHHHHHhhcccccccccccc Confidence 11122222334443322221 2356667777777777666654 5556666665433321100 Q ss_pred ----------------hhhhhhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccceeeeccCc-ee Q lcl|NC_019916. 149 ----------------TDLGAVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAVNIDENG-IV 211 (286) Q Consensus 149 ----------------~~~~t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ng-i~ 211 (286) ....+.+.+.++-..+.+..+. .....++|+|+.|..|.-++--+.+...+.++-.+| |- T Consensus 153 ~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP---~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig 229 (381) T protein:vir:80 153 DTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVP---QEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVG 229 (381) T ss_pred cccccccccccccccchhhHHHHHHHHHHHHHhhcCCC---cCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeee Confidence 0011223344444444444332 134688999999999998765555443333444555 67 Q ss_pred eecCeEEEE---chHHhhcCceEEE-eecceeeeccceEEEEEeeccCccceeeeecccccc----------------cC Q lcl|NC_019916. 212 RFRDIIITK---VPEKYMQGKAIMF-VPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGS----------------FI 271 (286) Q Consensus 212 ~fKgf~l~e---~p~~y~qg~~~if-s~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~----------------~I 271 (286) ++-||.|-+ +|..--.+....+ .|. .+.-+|+-.+.-.++.+.+.++-.--.|+- +. T Consensus 230 ~i~G~~Vv~Sn~lp~~~~t~~~~~agap~---~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~ 306 (381) T protein:vir:80 230 TILGMEVIVTTQIGINSLTGYVNGQGAPT---QPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGAT 306 (381) T ss_pred EEcceEEEeecccccccccceeeeccccc---cccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceee Confidence 899999998 4431111111111 121 112233334444444443333322111111 11 Q ss_pred CCcCc----------ceEEEEe-----cCC Q lcl|NC_019916. 272 LDDNK----------AAIFSAT-----PKA 286 (286) Q Consensus 272 lddNK----------kAI~k~t-----~ka 286 (286) .++.| +.+..+. +.+ T Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 336 (381) T protein:vir:80 307 AADGGQTLGSFGGANRWATAVVCHPDWLAV 336 (381) T ss_pred ecCCCceeeeehhhhhhhhhcccccccccc Confidence 22222 2222222 111 No 37 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=93.96 E-value=0.0058 Score=32.84 Aligned_cols=260 Identities=13% Similarity=0.090 Sum_probs=134.2 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCCc Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAKS 80 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~s 80 (286) |+...--++-=+-=.-|..++..-+.+...|.+.---.-.|.|..-+ +.=.-+.+++. =..+|..++..+.+.=|. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~-ti~iP~~~~ig-da~~~~eg~~i~~~~lt~-- 76 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGD-TLTFPAFVYSG-DATVVPEGQKIPVDKIET-- 76 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCC-EEEeeeecCCC-ccccccCCCccCcccccc-- Confidence 77432222222333345666666666666663321112223332211 11001111110 000122222222221111 Q ss_pred ccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhh----hhhhhh Q lcl|NC_019916. 81 TRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTD----LGAVDD 156 (286) Q Consensus 81 ~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~----~~t~d~ 156 (286) ++.+-.|. -+-+.|.+. -++...-.-|+ +++.++.++.+|.|.+++.+=..|..+... ..+.|. T Consensus 77 ---~~~~a~i~-----~~~k~~~~t-D~a~~~~~~dp---~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~~~~t~d~ 144 (276) T protein:vir:10 77 ---NRREAKIH-----KIGKGTDIT-DEALLSGYGDP---QGEAVRQHGLAIANKVDNDVLEALRGTKLTVSADIGTLAG 144 (276) T ss_pred ---ceeeEEee-----hcccccccc-HHHHHhhccch---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHH Confidence 11111110 012333332 23344444444 677888899999999998887777554322 234566 Q ss_pred HHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccc--cccccccceeeeccCc-eeeecCeEEEEchHHhhc-CceEE Q lcl|NC_019916. 157 VNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHN--LVTSQKGSAVNIDENG-IVRFRDIIITKVPEKYMQ-GKAIM 232 (286) Q Consensus 157 V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~--l~Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~q-g~~~i 232 (286) +.+....+... -..+.++.|+|++|..|.-.. -.+......-++-.|| |-+|.|+.+-..+. +- |...+ T Consensus 145 i~~A~~~lgd~-----~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l 217 (276) T protein:vir:10 145 LEAAIDTFDDE-----DLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKK--LDEGEAIL 217 (276) T ss_pred HHHHHHHhccc-----cCcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCC--CCcceEEE Confidence 55544444332 124567899999999996432 2232333334555666 55899999887653 43 78888 Q ss_pred Eeecceee---eccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 233 FVPDNIGR---AFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 233 fs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) |.+.-++. .-+-+++-|-+.+ ---.+-|---||..+.++.|...++...+. T Consensus 218 ~~~gAi~~~~~~~~~vE~dRd~~~---~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (276) T protein:vir:10 218 AKRGAVKLITKRDFFLETDRDPST---KTTALYSDKHYVAYLYDESKAVKVTKGAGT 271 (276) T ss_pred EeccceeeeecCCceeecccchhh---cccEEEEeeEEEEEEEcCcceEEEecCCcC Confidence 88887774 2233566664443 345666777799999998887777644444 No 38 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=93.62 E-value=0.0069 Score=32.42 Aligned_cols=265 Identities=15% Similarity=0.129 Sum_probs=132.5 Q ss_pred CCCC---c-----ccce-----ee-eechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeeccc-ceEeecc Q lcl|NC_019916. 1 MATN---N-----NNLA-----AR-TYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNV-PVVVGEY 65 (286) Q Consensus 1 M~t~---n-----nn~a-----~r-~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~-pVvvg~Y 65 (286) |+.. + ..++ +. +|-|+|.+.+-+-|++++.|++.+-- +...| -+ +++..-+ ++.++.| T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~-~~~~~----G~--sv~i~~ig~~t~~~~ 73 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHML-RSIAS----GK--SAQFPVIGRTKAAYL 73 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhcccc-ccccc----cc--eeEeeeccceeeeee Confidence 4321 1 1111 22 78899999999999999999987643 21111 11 2222211 2334445 Q ss_pred cCCCcceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 66 SQDEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLA 145 (286) Q Consensus 66 ~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls 145 (286) ..+....-. -.+- -.-+-++-.|+..-+. +.| +-||+...+-|+- ++...-++.|=.|.+|..+-..|. T Consensus 74 ~~g~~l~~~---~~~~--~~~e~~ltID~~~~~~--~~V-ddlD~~q~~~D~~---~~~~~~~g~aLA~~~D~~i~~~l~ 142 (347) T protein:vir:15 74 KPGENLDDK---RKDI--KHTEKVIHIDGLLTAD--VLI-YDIEDAMNHYDVR---AEYTAQLGESLAMAADGAVLAELA 142 (347) T ss_pred ccCCCCCCC---CCCC--ccceEEEEechhhhhh--HHh-hhHHHHhcCCcch---HHHHHHHHHHHHHHHHHHHHHHHH Confidence 554443211 0000 1112235556554333 223 6888888887765 455666888999999988876664 Q ss_pred hhhh--------------------------hh----hhhhhHHHHHHHHhhhhhceee-eEEEEEEECchhhhhhhcccc Q lcl|NC_019916. 146 DAST--------------------------DL----GAVDDVNVMFETASAKYTNLEV-VVPVRAYVTADVYNAIIDHNL 194 (286) Q Consensus 146 ~~a~--------------------------~~----~t~d~V~klF~~~~~~yvn~ev-~~~~~ayV~~evYNaIvD~~l 194 (286) ..+. +. .+.+.+..++-.+.+..-...| ...-.+.|+|+.|.+|..++- T Consensus 143 ~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~ 222 (347) T protein:vir:15 143 GLVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALM 222 (347) T ss_pred HHhhccccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccc Confidence 3210 00 0112333444444444444444 235679999999999998866 Q ss_pred ccccccceeeeccCcee-eecCeEEEEchHHhhc-C-c-----------------------------eEEEeecceeeec Q lcl|NC_019916. 195 VTSQKGSAVNIDENGIV-RFRDIIITKVPEKYMQ-G-K-----------------------------AIMFVPDNIGRAF 242 (286) Q Consensus 195 ~Ts~K~Ss~NiD~ngi~-~fKgf~l~e~p~~y~q-g-~-----------------------------~~ifs~dnIg~af 242 (286) .++.-..+...-.+|.+ +.-||.|-+.+.--.. + . -++|.+.-+|..= T Consensus 223 ~~~~d~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~ 302 (347) T protein:vir:15 223 PNAANYQALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVK 302 (347) T ss_pred cccccccccccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeE Confidence 55433333344577854 8889999885542211 1 0 1122222222111 Q ss_pred -c--ceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 243 -T--GIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 243 -~--GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) . =++..| .+..-+-.+-|-=-||-=+++.....-++...-+ T Consensus 303 ~~~~~~e~~~---~~~~~~d~i~~~~~~G~~vlrP~~av~~~~~~~~ 346 (347) T protein:vir:15 303 LKDLALERAR---RANYQADQIIAKYAMGHGGLRPEAAGAIVLPKVS 346 (347) T ss_pred eeceeeeecc---cchhhhhhhehhhhcCCceeccccEEEEecCCCC Confidence 0 112222 2333333333333444445555544433222112 No 39 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=93.25 E-value=0.0082 Score=32.01 Aligned_cols=256 Identities=13% Similarity=0.071 Sum_probs=144.8 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhh-hhhhhhcccchhcCCcccceeEEEeecccceE----eec------ccCCC Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQS-VFGPTFGDLQALDGVQNNATAFSVKTNNVPVV----VGE------YSQDE 69 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa-~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVv----vg~------Y~td~ 69 (286) |++ .--..|.+||..-...+|+-+. -|+++ .+...++...++.=+...++.+++ +++ |+|.. T Consensus 13 Ms~----~i~~~fv~qy~~~v~~~~qq~~s~L~~t---V~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~dtp~ 85 (322) T protein:vir:10 13 IAG----DIDQAFVQTYETTLRILSQQKSAKLKQY---CQHKNESSESHNWETLASMDPDAVKRKRSRQQSADGTYPTPV 85 (322) T ss_pred eec----hhhhHHHHHHHHHHHHHHHHhhhhhhcc---cccccccccccceeecccccccccccccccccccCcccCCCc Confidence 666 2356799999999999987554 34433 777778888888777777777776 221 22222 Q ss_pred c-ceeecCcCCcccccceeEEEEecccccccccch-hhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 70 A-VAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWA-IHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADA 147 (286) Q Consensus 70 N-vaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~a-iHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~ 147 (286) | .+ ++.|. +.-. +|.|+ .=+.+|+.-.+-|+.+..++ .|+-|..|-.+..+=.++... T Consensus 86 ~~~~----------~~~r~-~~~~------d~~~~~~VDd~D~~k~~~D~~~~~~~---~~a~AL~R~~D~~I~~a~~g~ 145 (322) T protein:vir:10 86 NNKP----------FAKRR-TNVD------TYDTGHVVEQEDISQMLLDPNSALIT---SQAYAMARKTDDLIIAGAWKP 145 (322) T ss_pred cccc----------cceEE-Eeec------ccccceecchHHHHHhhcCchHHHHH---HHHHHhhhHHHHHHHhhhhcc Confidence 1 22 33444 3322 23343 23678888899999999987 577888888887665555443 Q ss_pred hhhh--------------------hhhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccceee-ec Q lcl|NC_019916. 148 STDL--------------------GAVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAVN-ID 206 (286) Q Consensus 148 a~~~--------------------~t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~N-iD 206 (286) |..- .|.+.+.+++..+.+..++-+ ++-.+=|+|+-|+.|.--+-.|++--.+.+ +- T Consensus 146 a~~~~~gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d--~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~ 223 (322) T protein:vir:10 146 ASIKGTGQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPE--VSKVIVIGPTQARKLLQITEATSADYTSAMDLQ 223 (322) T ss_pred ccccccccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCC--CCeEEEeCHHHHHHHhcchhhhhhhcccchhhh Confidence 3210 123555556666665555433 233577899999998855555544333333 33 Q ss_pred cCc-eeeecCeEE---EEch----HHhhcC----------ceEEEeecceeeeccceEEEEEee-ccCccceeeeecccc Q lcl|NC_019916. 207 ENG-IVRFRDIII---TKVP----EKYMQG----------KAIMFVPDNIGRAFTGIVTTRTIE-SEDFDGVALQGAGKA 267 (286) Q Consensus 207 ~ng-i~~fKgf~l---~e~p----~~y~qg----------~~~ifs~dnIg~af~GI~taRtie-SEDFdGVaLQgAgK~ 267 (286) .+| +=+|=||.+ +.+| .++-+| ..+++....||.+--=.-+++.-+ .+-+....+-+...+ T Consensus 224 ~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~ 303 (322) T protein:vir:10 224 SKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTA 303 (322) T ss_pred hcCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcchhhhhhhhhhh Confidence 567 445666665 4444 222221 133555666665521112223322 111223334455556 Q ss_pred cccCCCcCcceEEEEecCC Q lcl|NC_019916. 268 GSFILDDNKAAIFSATPKA 286 (286) Q Consensus 268 G~~IlddNKkAI~k~t~ka 286 (286) |-=.+|+++-.-+.. +.+ T Consensus 304 Ga~ri~~~gVv~i~~-~e~ 321 (322) T protein:vir:10 304 DCVRVEDEHIFKLRL-KNS 321 (322) T ss_pred CceEeccCcEEEEEE-ecc Confidence 666778866555555 334 No 40 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=93.06 E-value=0.0089 Score=31.82 Aligned_cols=261 Identities=18% Similarity=0.137 Sum_probs=133.7 Q ss_pred CCCCcccc-------------eeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccC Q lcl|NC_019916. 1 MATNNNNL-------------AARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQ 67 (286) Q Consensus 1 M~t~nnn~-------------a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~t 67 (286) |+-.|... ++-+|-|+|.+.+-+-|+.++.|++..- .+. |+. -+-..+..- =++.++.|.. T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~-~r~---i~~-G~sv~i~~i-G~~tv~~~t~ 74 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHI-VRT---IQN-GKSAQFPVM-GRTSGVYLAP 74 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccc-ccc---ccc-cceEEEecc-cceeeeeecC Confidence 43333221 2567889999999999999999987552 221 111 111111111 1233334443 Q ss_pred CCcceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019916. 68 DEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADA 147 (286) Q Consensus 68 d~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~ 147 (286) ++... |+-.+ --.-+-+|-.|+.. |-+.+=.-||+...+=|+.+ +....+++|=.|.++..+...|... T Consensus 75 G~~l~---~~~~~--~~~~e~~itID~~~---~~~~~VddiD~~q~~~D~~~---~~~~~~g~aLa~~~D~~i~~~~~~~ 143 (347) T protein:vir:94 75 GERLS---DKRKG--IKHTEKVITIDGLL---TADVMIFDIEDAMNHYDVAG---EYSNQLGEALAIAADGAVLAEMAIL 143 (347) T ss_pred CCCcC---CCCCC--CCcceEEEEecchh---hhhHHhhhHHHHhcCcchHH---HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 32211 00001 11112255666543 33334457888888878655 5666788899999998776544321 Q ss_pred h------------------------hhhh----hhhhHHHHHHHHhhhhhceeee-EEEEEEECchhhhhhhcccccccc Q lcl|NC_019916. 148 S------------------------TDLG----AVDDVNVMFETASAKYTNLEVV-VPVRAYVTADVYNAIIDHNLVTSQ 198 (286) Q Consensus 148 a------------------------~~~~----t~d~V~klF~~~~~~yvn~ev~-~~~~ayV~~evYNaIvD~~l~Ts~ 198 (286) + .++. +.+.+...+-.+.+..-+..|- ....+.|+|+.|-.|++++..++. T Consensus 144 aa~~~~~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~ 223 (347) T protein:vir:94 144 CNLPAASNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAA 223 (347) T ss_pred hccccccccccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhh Confidence 1 0000 1123333333344444444442 256899999999999999887665 Q ss_pred ccce-eeeccCceeeecCeEEEEchHHhh-------c--------Cc--------------------eEEEeecceeeec Q lcl|NC_019916. 199 KGSA-VNIDENGIVRFRDIIITKVPEKYM-------Q--------GK--------------------AIMFVPDNIGRAF 242 (286) Q Consensus 199 K~Ss-~NiD~ngi~~fKgf~l~e~p~~y~-------q--------g~--------------------~~ifs~dnIg~af 242 (286) -.++ ..++.-.|.++-||.|-+.|.-=. + |+ .++|.|+-+| T Consensus 224 ~~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~--- 300 (347) T protein:vir:94 224 NYAALIDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVG--- 300 (347) T ss_pred hccccccccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhh--- Confidence 4333 334434567999999988763211 1 11 1122233222 Q ss_pred cceEEEEEe--eccCccce-----eeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 243 TGIVTTRTI--ESEDFDGV-----ALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 243 ~GI~taRti--eSEDFdGV-----aLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) +++.| +.|-|.-. .+-|---||-=++.-.-.+.++++ .| T Consensus 301 ----~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~-~A 346 (347) T protein:vir:94 301 ----TVKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFS-PA 346 (347) T ss_pred ----hhhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEec-CC Confidence 22233 34444433 333333455445544444444444 34 No 41 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=92.72 E-value=0.01 Score=31.49 Aligned_cols=258 Identities=13% Similarity=0.099 Sum_probs=136.7 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCCc Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAKS 80 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~s 80 (286) |+...--++-=+--..|..++..=+.++-.|.+..--.-.+.|..- +|.=.-|.+.++ =..+|.-++......-|.. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G-~tv~iP~~~~ig-~a~~~~~g~~i~~~~lt~~- 77 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPG-DTLTFPAFVYSG-DAQVVAEGEKIPTDILETK- 77 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCC-CEEEEeeecCCC-ccccccCCCccchhhcccc- Confidence 7764433332232334666665555555444332211222223211 111011111110 0123444333333322222 Q ss_pred ccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhh----hhhhhh Q lcl|NC_019916. 81 TRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTD----LGAVDD 156 (286) Q Consensus 81 ~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~----~~t~d~ 156 (286) +.+-.| +. +-+.|.+.. +++..-+-|+ +++.++.++.++.+.++..+-..+..+..+ ..+.|. T Consensus 78 ----~~~~~i--~~---~~~~~~i~D-~~~~~~~~d~---~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~~a~~~d~ 144 (274) T protein:vir:12 78 ----KREAKI--RK---IAKGTSITD-EALLSGYGDP---QGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNG 144 (274) T ss_pred ----eeeEEe--ee---ecceeeecH-HHHHhcccch---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHH Confidence 222222 11 123566655 6777777676 566777888999999999888877554322 223444 Q ss_pred HHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccce--eeeccCcee-eecCeEEEEchHHhhc-CceEE Q lcl|NC_019916. 157 VNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSA--VNIDENGIV-RFRDIIITKVPEKYMQ-GKAIM 232 (286) Q Consensus 157 V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss--~NiD~ngi~-~fKgf~l~e~p~~y~q-g~~~i 232 (286) +...-.++ +.+-..+-.+.|+|++|..|.-.++-...+.|. .++=.||.+ +|.||.+-+.+ .+. +...+ T Consensus 145 i~dA~~~l-----gd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~--~~p~~t~~l 217 (274) T protein:vir:12 145 LQSAIDKF-----NDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRSN--KLEAGTAIL 217 (274) T ss_pred HHHHHHHh-----ccccccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEeC--CCCcceEEE Confidence 43322222 223335567999999999998776543333332 233344443 79999988764 344 77788 Q ss_pred Eeecceee---eccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 233 FVPDNIGR---AFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 233 fs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) |.+..+|. +-+-+|+-|-+.+ ---.|-+---||..+.+..|-..++ ... T Consensus 218 ~~~gA~~~~~~~~~~vE~~Rd~~~---~~d~i~~~~~y~~~~~~~~~vv~~t--~~~ 269 (274) T protein:vir:12 218 AKKGAVKLILKRDFFLEVARDAST---KTTALYSDKHYVAYLYDESKAVKIT--KGS 269 (274) T ss_pred EeccceeeeecCCceeccccchhh---cccEEEeeeEEEEEEEcCCceEEEE--cCC Confidence 88777764 2233666664443 3346777777999999877765555 333 No 42 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=92.07 E-value=0.013 Score=30.93 Aligned_cols=256 Identities=18% Similarity=0.123 Sum_probs=123.3 Q ss_pred CC---CC-----cccce------eeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeec-- Q lcl|NC_019916. 1 MA---TN-----NNNLA------ARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGE-- 64 (286) Q Consensus 1 M~---t~-----nnn~a------~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~-- 64 (286) |+ +- +..++ .-+|-|+|-+-+.+-|+.++.|++..-- +..-| =|+--.| .||+ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~-rti~~---------G~sv~~~-~iG~~~ 69 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLV-RSIQS---------GKSAQFP-VLGRTK 69 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhh-eeccc---------cceEEee-ecccee Confidence 43 11 22212 2279999999999999999999987643 22111 1222223 2332 Q ss_pred ---ccCCCcceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_019916. 65 ---YSQDEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALG 141 (286) Q Consensus 65 ---Y~td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~g 141 (286) |..+.+.. ++.+.-...++ +|-.|+..-+.+. =+=||+.+.+=|+.+..+ ..+++|=.|.+|+.+= T Consensus 70 ~~~~~~G~~l~---~~~~~~~~~e~--~ltID~~~y~~~~---VddiD~~q~~~D~rs~~~---~~~g~ALA~~~D~~i~ 138 (347) T protein:vir:94 70 AAYLQPGENLD---DKRKDMKHTEK--TINIDGLLTADVL---IYDIEDAMNHYDVRSEYT---AQLGESLAMAADGAVL 138 (347) T ss_pred EeeeecCcCCC---CCcCCccccce--EEEEcchhhhhhh---hhhHHHHhcCcchHHHHH---HHHHHHHHHHHHHHHH Confidence 23222211 11111122332 3556654433321 135777777777766655 5678899999997664 Q ss_pred HHHhhhhh-------------------------hh----hhhhhHHHHHHHHhhhhhceeee-EEEEEEECchhhhhhhc Q lcl|NC_019916. 142 KKLADAST-------------------------DL----GAVDDVNVMFETASAKYTNLEVV-VPVRAYVTADVYNAIID 191 (286) Q Consensus 142 k~ls~~a~-------------------------~~----~t~d~V~klF~~~~~~yvn~ev~-~~~~ayV~~evYNaIvD 191 (286) .-|...+. .. .+.+.+...|-.+.+......|- .+.+++|+|+.|..|+. T Consensus 139 ~~l~~~a~~~~~~~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk 218 (347) T protein:vir:94 139 AEMAKLCNLPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILA 218 (347) T ss_pred HHHHHhhccccccccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHH Confidence 33321110 00 01112223343444444444443 47899999999999997 Q ss_pred cccccccccceeeeccCc-eeeecCeEEEEchHHhhcCceEEEeecceeeeccceEEE-EEeeccCccceeeeecccccc Q lcl|NC_019916. 192 HNLVTSQKGSAVNIDENG-IVRFRDIIITKVPEKYMQGKAIMFVPDNIGRAFTGIVTT-RTIESEDFDGVALQGAGKAGS 269 (286) Q Consensus 192 ~~l~Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~qg~~~ifs~dnIg~af~GI~ta-RtieSEDFdGVaLQgAgK~G~ 269 (286) ..-.++....+.+...+| +.+.-||.|-+.|.-=..+- .-.+-+=|.++++.--+ +.-++++.+|-.=..+ |- T Consensus 219 ~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~---~l 293 (347) T protein:vir:94 219 ALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGA--GDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVV---GL 293 (347) T ss_pred hhcccccccccccccccceeEEeeceEEEEcCccccccC--cccccccccccccccccccccccccccccccceE---EE Confidence 655666555556655555 56788888877765322110 11122222223332110 1112222221000000 11 Q ss_pred cCCCcCcceEEEEecCC Q lcl|NC_019916. 270 FILDDNKAAIFSATPKA 286 (286) Q Consensus 270 ~IlddNKkAI~k~t~ka 286 (286) +. .+.|+..|.++. T Consensus 294 ~~---~~~A~~tv~~~~ 307 (347) T protein:vir:94 294 FN---HRSAVGTVKLKD 307 (347) T ss_pred Ee---chhhhhhhhhcc Confidence 11 333444443333 No 43 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=91.30 E-value=0.017 Score=30.36 Aligned_cols=262 Identities=12% Similarity=0.024 Sum_probs=113.3 Q ss_pred CCCCccc-------ceeeee-chhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcce Q lcl|NC_019916. 1 MATNNNN-------LAARTY-TKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVA 72 (286) Q Consensus 1 M~t~nnn-------~a~r~Y-~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~Nva 72 (286) |+--|-- --++-| .+.|.+.+.+.|+++..|++..-.. -.++ .+.+| +.+..-. .+.+..|..+.... T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~-~~~~-~~Gdt-v~ip~~g-~~~~~d~~~~~~i~ 76 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTW-GAQV-KKGDT-FHVPRIS-ELGVEDKATDVPVG 76 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccc-cccc-cCCce-EEEeccC-cceeeeecCCCccc Confidence 4332211 011112 4777888999999988887754221 1122 22222 2222212 12355676554444 Q ss_pred eecCcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh- Q lcl|NC_019916. 73 FGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL- 151 (286) Q Consensus 73 FGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~- 151 (286) +..-+.. -++ +-.|+..=+.+.+ ..+|+...+-|+-+ +.+..|++|=.|.++..+-..++..+... T Consensus 77 ~~~~~~~------~~~-itiD~~~~~~~~i---~d~d~~~~~~d~~~---~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~ 143 (341) T protein:vir:94 77 VQPVNDT------DFV-ITVDTDRTTAVAL---DDLLEIQASYDLRA---PYLEAMGYALAKDMTGSILGLRAAVQNTAS 143 (341) T ss_pred cccccCc------eEE-EEEeeeeecceee---chHHHHhhccchHH---HHHHHHHHHHHHHHHHHHHHHhhhcccccc Confidence 3322211 122 2233332222221 25666666666544 44456777888888888766664433211 Q ss_pred -----------------hhhhhHHHHHHHHhhhhhceeee-EEEEEEECchhhhhhhccccccc-cccceeeeccCce-e Q lcl|NC_019916. 152 -----------------GAVDDVNVMFETASAKYTNLEVV-VPVRAYVTADVYNAIIDHNLVTS-QKGSAVNIDENGI-V 211 (286) Q Consensus 152 -----------------~t~d~V~klF~~~~~~yvn~ev~-~~~~ayV~~evYNaIvD~~l~Ts-~K~Ss~NiD~ngi-~ 211 (286) .+.|.+..+...+.+ .+|- ..-.++|+|+.|..|.-.+.-+. ....+.-+ .+|. - T Consensus 144 ~~~~~~~~~~~t~~~~~~~~~~i~~a~~~Lde----~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l-~~G~ig 218 (341) T protein:vir:94 144 QNVFSSSNGAITGNGQAFSFAVFLAARRLLLE----ADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPI-AQGQIG 218 (341) T ss_pred CccccCccccccCchhhhhHHHHHHHHHHHhh----cCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchh-heeeee Confidence 112334444444444 4442 34578999999999975543332 22222223 4564 4 Q ss_pred eecCeEEEEchHHhhcCceEEEeecceee-----eccceEEEEEeeccCccceeeeecc-----ccc------------- Q lcl|NC_019916. 212 RFRDIIITKVPEKYMQGKAIMFVPDNIGR-----AFTGIVTTRTIESEDFDGVALQGAG-----KAG------------- 268 (286) Q Consensus 212 ~fKgf~l~e~p~~y~qg~~~ifs~dnIg~-----af~GI~taRtieSEDFdGVaLQgAg-----K~G------------- 268 (286) ++-||.|-+.+.--.. . +.-.+.+-+. +-.+|+-.++.-.++-+.-...|-. =++ T Consensus 219 ~i~G~~V~~Sn~lp~~-~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~ 296 (341) T protein:vir:94 219 SLMGVRVIRTSLIGNN-S-ATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVV 296 (341) T ss_pred eEeceEEEEecccccc-c-cccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccc Confidence 8899999884431111 1 1101111110 1123333343333333333333321 001 Q ss_pred ----------------ccCCCc--------CcceEEEEec-CC Q lcl|NC_019916. 269 ----------------SFILDD--------NKAAIFSATP-KA 286 (286) Q Consensus 269 ----------------~~Ildd--------NKkAI~k~t~-ka 286 (286) .+|.-+ +-.+++.+.. .+ T Consensus 297 ~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~ 339 (341) T protein:vir:94 297 SKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGD 339 (341) T ss_pred cccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecCcC Confidence 111111 1111111111 11 No 44 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=90.52 E-value=0.02 Score=29.85 Aligned_cols=264 Identities=16% Similarity=0.135 Sum_probs=127.4 Q ss_pred CC---CC-----cccce------eeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeeccc Q lcl|NC_019916. 1 MA---TN-----NNNLA------ARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYS 66 (286) Q Consensus 1 M~---t~-----nnn~a------~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~ 66 (286) |+ +- +..++ .-+|-|+|.+.+.+-|++++.|++..-- +..-| -+...+..-. ++-++.|. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~-r~~~~----G~sv~i~~iG-~~t~~~~~ 74 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHML-RSIAS----GKSAQFPVIG-RTKAAYLK 74 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhcc-ccccc----cceeEeeecc-ceeeeeec Confidence 44 11 12112 2378899999999999999999887643 21111 1111111111 12234454 Q ss_pred CCCcceeecCcCCccccc--ceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_019916. 67 QDEAVAFGAGTAKSTRFG--ERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKL 144 (286) Q Consensus 67 td~NvaFGtGTg~s~RFG--~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~l 144 (286) .+....- ++-. .-+-+|..|+..-+.+ +=+-||+...+=|+.+..+ .-++.|=.|.+|..+-..| T Consensus 75 ~g~~l~~-------~~~~~~~~e~~ltiD~~~y~~~---~VddiD~~q~~~D~~~~~~---~~~g~aLA~~~D~~i~~~l 141 (347) T protein:vir:33 75 PGENLDD-------KRKDIKHTEKVIHIDGLLTADV---LIYDIEDAMNHYDVRAEYT---AQLGESLAMAADGAVLAEL 141 (347) T ss_pred CCCCCCC-------CCCCCccceEEEEechhhhhhH---HHhhHHHHhcCCchhHHHH---HHHHHHHHHHHHHHHHHHH Confidence 4332211 1111 1122344555432221 2346888888878766554 4567888888888775444 Q ss_pred hhhh--------------------h-h--hh-------hhhhHHHHHHHHhhhhhceeee-EEEEEEECchhhhhhhccc Q lcl|NC_019916. 145 ADAS--------------------T-D--LG-------AVDDVNVMFETASAKYTNLEVV-VPVRAYVTADVYNAIIDHN 193 (286) Q Consensus 145 s~~a--------------------~-~--~~-------t~d~V~klF~~~~~~yvn~ev~-~~~~ayV~~evYNaIvD~~ 193 (286) .... . . ++ +.+.+.+.+-.+.+.....+|- ..-.+.|+|+.|.+|+.++ T Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~ 221 (347) T protein:vir:33 142 AGLVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAAL 221 (347) T ss_pred HHhhhhhcccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccc Confidence 2210 0 0 00 1122333333344444444442 3567889999999999988 Q ss_pred cccccccceeeeccCc-eeeecCeEEEEchHHhhcCc-------------------------------eEEEeecceeee Q lcl|NC_019916. 194 LVTSQKGSAVNIDENG-IVRFRDIIITKVPEKYMQGK-------------------------------AIMFVPDNIGRA 241 (286) Q Consensus 194 l~Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~qg~-------------------------------~~ifs~dnIg~a 241 (286) -.++..-.+...=.+| +.+.-||.|-+.+.=-..+. -++|.++-+|.. T Consensus 222 ~~~~~d~~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v 301 (347) T protein:vir:33 222 MPNAANYQALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTV 301 (347) T ss_pred cccccccccccccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheee Confidence 6665433322223456 45889999887664211100 123334433311 Q ss_pred -ccc--eEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 242 -FTG--IVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 242 -f~G--I~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) ..+ ++..| .++.-|=.+-|-=-||-=+++.....-+|...-+ T Consensus 302 ~~~~~~~e~~r---~~~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~~ 346 (347) T protein:vir:33 302 KLKDLALERAR---RANYQADQIIAKYAMGHGGLRPEAAGAIVLPKVS 346 (347) T ss_pred eeeceeeeecc---chhhhhHhhhhhhhcCCceecccceEEEecCCCC Confidence 111 22222 2333333333333344445555544444322222 No 45 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=86.19 E-value=0.047 Score=27.85 Aligned_cols=253 Identities=13% Similarity=0.121 Sum_probs=135.4 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceE--eec---ccCCCcceeec Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVV--VGE---YSQDEAVAFGA 75 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVv--vg~---Y~td~NvaFGt 75 (286) |+...--++-=+=-.-|..+++.=|.++..|.+.---...+.|..-+ | -.+|.. +|+ |.-++..+.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~-t------i~iP~~~~~gda~~~~eg~~i~~~~ 73 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGN-T------LKFPAFTYIGDAADVAEGGEISLDK 73 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCC-E------EEEeeeccCccccccCCCCccChhh Confidence 87543333322223346666766666666665432113334443211 1 122221 122 22222222221 Q ss_pred CcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh---h Q lcl|NC_019916. 76 GTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL---G 152 (286) Q Consensus 76 GTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~---~ 152 (286) =| .++.+-.| + -+-+.|.+.. +|+..-.-|+ +.+..+.++.+|.|.+++.+-..|+.+..+. . T Consensus 74 lt-----~~~~~~~i--~---~~~k~~~vtD-~~~~~~~~d~---~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~~~~~ 139 (272) T protein:vir:36 74 IG-----TTTKSVTI--K---KAAKGTEITD-EAALSGYGDP---IGESNKQLGLSLANKVDDDLLSAAKTTSQTVSTKA 139 (272) T ss_pred cC-----CcceeEee--e---hhhccccccH-HHHhhccchH---HHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 11 12222111 1 1233455533 6666666665 4555566778999999998877776544332 3 Q ss_pred hhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccce-eeeccCc-eeeecCeEEEEchHHhhc-Cc Q lcl|NC_019916. 153 AVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSA-VNIDENG-IVRFRDIIITKVPEKYMQ-GK 229 (286) Q Consensus 153 t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss-~NiD~ng-i~~fKgf~l~e~p~~y~q-g~ 229 (286) +.|.+......+..... .+-...|+|.+|..|.-.+..+....+. .++-.|| |-+|-|+.+-+... +. |. T Consensus 140 ~~d~i~~A~~~lgd~~~-----~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~--~p~~~ 212 (272) T protein:vir:36 140 NVDGVQAALDIFNDEDA-----QAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKK--LAEGS 212 (272) T ss_pred cHHHHHHHHHHhhhcCC-----CceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCC--CCCCc Confidence 55666666665554433 2456999999999997766655554332 3344455 45899998866442 33 44 Q ss_pred e----EEEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 230 A----IMFVPDNIGR---AFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 230 ~----~ifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) . ++|.+..+|. .-+-+|+-|-+. .-.-.|-|---||..++++.|-+ ++|-|- T Consensus 213 ~~~~~~~~~~gA~~~~~~~~~~vE~~R~~~---~~~d~i~~~~~y~~~v~~~~~vv--~~t~~g 271 (272) T protein:vir:36 213 ALMFKIVSNSPALKLVLKRGVQVETDRDIV---TKTTVITADEHYAAYLYDLTKVV--NITFTG 271 (272) T ss_pred eeEEEEEecccceeeeecCCcccccccchh---hcCcEEEEEEEEEEEEEcCccEE--EEeecC Confidence 3 4566777762 233466666333 33456777777899998876654 445455 No 46 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=78.98 E-value=0.11 Score=25.86 Aligned_cols=262 Identities=13% Similarity=0.057 Sum_probs=109.3 Q ss_pred ceeeeechh-HHHHHHHHHhhhhhhhhhh-----cccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCCcc Q lcl|NC_019916. 8 LAARTYTKQ-FAQLMQTVFGAQSVFGPTF-----GDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAKST 81 (286) Q Consensus 8 ~a~r~Y~kq-~~~ll~~vf~~qa~F~~~f-----gglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~s~ 81 (286) ||..++.+| ++..+-..|++...|.+.. |.+.. +.++| +.++.-. ++.+..|. ..+.++|....-.. T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~----~~Gdt-V~i~~~~-~~~~~~~~-~~~~~~~~~~~~~~ 73 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAH----KFNDT-ITVRVPA-PSRGHTRK-LRGAGAERNLTVSD 73 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhcccccccccc----CCCCe-EEEeecc-cccceeee-ccccccCCcccccc Confidence 555567776 7777777788888775532 11211 12222 3444322 34555554 22233332221111 Q ss_pred cccceeEEE-EecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhh------hhh Q lcl|NC_019916. 82 RFGERTEIV-YTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDL------GAV 154 (286) Q Consensus 82 RFG~rkEIi-y~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~------~t~ 154 (286) .--.-.++. =.+..+++.++ .+|+..... ...++.|+-|.+|=.+.++..+.+.+..+.... .+. T Consensus 74 ~~~~~~~~~id~~k~~~~~i~-----d~e~~~~~~---~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~ 145 (392) T protein:vir:99 74 FTEDSFPVTLTDVAYHLGVLT-----DEELTFDLE---SFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAP 145 (392) T ss_pred cccceEEEEEeeeeecceeec-----hHHHhhhhh---hhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccCh Confidence 111222232 23333333332 445444333 345666777888888888888877775433211 111 Q ss_pred hhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccce---eeeccCcee-eecCeEEEEchHHhhcCce Q lcl|NC_019916. 155 DDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSA---VNIDENGIV-RFRDIIITKVPEKYMQGKA 230 (286) Q Consensus 155 d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss---~NiD~ngi~-~fKgf~l~e~p~~y~qg~~ 230 (286) +...+-|-.+.+..-..+|-..-++.|+|+.|.+|.-.+-.+....+. ..+=.+|.+ ++-||.+-+-+.--. +.. T Consensus 146 ~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~-~t~ 224 (392) T protein:vir:99 146 DEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPH-GDA 224 (392) T ss_pred hhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeeccccc-ccc Confidence 222233333333333333333468889999999998665444332221 122234543 677777765443211 222 Q ss_pred EEEeecceeeeccceEE-------EEEee---------ccCccceeeeeccc----cccc-CCCcCcceEEE-EecCC Q lcl|NC_019916. 231 IMFVPDNIGRAFTGIVT-------TRTIE---------SEDFDGVALQGAGK----AGSF-ILDDNKAAIFS-ATPKA 286 (286) Q Consensus 231 ~ifs~dnIg~af~GI~t-------aRtie---------SEDFdGVaLQgAgK----~G~~-IlddNKkAI~k-~t~ka 286 (286) +.|.+..+..+ ++..+ ..... .-++++...+...- .|.. +.+.+...+.. ...++ T Consensus 225 ~a~~~~a~~~a-t~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~ 301 (392) T protein:vir:99 225 YLYHPTAFIMA-TRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHL 301 (392) T ss_pred eeeeccccccc-cccccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeee Confidence 33333322110 00000 00000 01112221111000 0000 11111111111 00111 No 47 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=77.79 E-value=0.12 Score=25.61 Aligned_cols=238 Identities=13% Similarity=0.101 Sum_probs=126.2 Q ss_pred CCCCccc---------ceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceE----eecccC Q lcl|NC_019916. 1 MATNNNN---------LAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVV----VGEYSQ 67 (286) Q Consensus 1 M~t~nnn---------~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVv----vg~Y~t 67 (286) |.+-|+. -+.-+|-|+|-|.+.+-|+.++.|++.+- .+++.| =|+--.|.+ ++-+.. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~-vRtI~~---------gkS~qf~~lG~s~a~y~~p 70 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFD-VQTVTG---------TNTVSNKYLGETELQVLAP 70 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccce-eeeecc---------cceEEEEEeeeeEEeeecC Confidence 7776542 35678999999999999999999997653 333222 122333433 222444 Q ss_pred CCcceeecCcCCcccccceeEEEEeccccccccc-chhhhcccccc-ccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 68 DEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFT-WAIHEGLDRFT-VNNDLNAAVADRLDLQAQAKVRMFNNALGKKLA 145 (286) Q Consensus 68 d~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~-~aiHEGiDr~T-VNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls 145 (286) ++.. .|+ + .---|-+|-.|+.+--... |-|||=.+.|+ | . ++-...+.+|-.++||..+=..+- T Consensus 71 G~~l---dg~--~--~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~v----R---se~s~e~G~ALA~~~Dq~iiq~i~ 136 (400) T protein:vir:10 71 GQSP---AAT--S--TQADKNQLVIDATVIARNTVAHLHDVQGDIDSL----K---PKLATNQAKQLKKMEDEMLIQQML 136 (400) T ss_pred CCCc---CCC--C--cccCcEEEEeCceeeecchhhhHHHHhhccccc----c---HHHHHHHHHHHHHHHHHHHHHHHH Confidence 5543 122 3 3333667888887654433 35566555554 4 2 444456778888888875432221 Q ss_pred hh--h------------hh------------h-hhhhh----HHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccc- Q lcl|NC_019916. 146 DA--S------------TD------------L-GAVDD----VNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHN- 193 (286) Q Consensus 146 ~~--a------------~~------------~-~t~d~----V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~- 193 (286) .+ | +. . ...+. +..++..+.+++|..+ ..+.+++|+.|++|.+++ T Consensus 137 ~a~~a~t~~~~~~~~g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~---d~vvl~pp~~Ys~Ll~~dk 213 (400) T protein:vir:10 137 LGGIANTQAKRTNPRVKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDIS---DVAILMPWRYFNVLRDADR 213 (400) T ss_pred HhcccccccccccCCccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCcc---ceEEEcCHHHHHHHHhCCc Confidence 10 0 00 0 01112 4456777778888733 578888999999999987 Q ss_pred cccccccceee--eccCceeeecCeEEEEchHHhhc---CceEEEeecceeeecc---ceEEEEEeeccCccceeeeecc Q lcl|NC_019916. 194 LVTSQKGSAVN--IDENGIVRFRDIIITKVPEKYMQ---GKAIMFVPDNIGRAFT---GIVTTRTIESEDFDGVALQGAG 265 (286) Q Consensus 194 l~Ts~K~Ss~N--iD~ngi~~fKgf~l~e~p~~y~q---g~~~ifs~dnIg~af~---GI~taRtieSEDFdGVaLQgAg 265 (286) +....=+-+.+ .=.-.+++.-|+.|.|.|.-=+. ...--.|+.+-|.+|- +...++.+ T Consensus 214 Lvnrdf~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av-------------- 279 (400) T protein:vir:10 214 IVDKSYTISQSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAV-------------- 279 (400) T ss_pred ccchhccccCCCccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEE-------------- Confidence 44443221111 11223456777777776543221 1112344444455443 11111111 Q ss_pred cccccCCCcCcceEEEEecCC Q lcl|NC_019916. 266 KAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 266 K~G~~IlddNKkAI~k~t~ka 286 (286) .|.++ |+..+.++. T Consensus 280 ---~F~~s----Av~tvk~~~ 293 (400) T protein:vir:10 280 ---LFTAD----ALLVGRSID 293 (400) T ss_pred ---EEehh----heEEEEeec Confidence 22222 444444443 No 48 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=77.50 E-value=0.12 Score=25.55 Aligned_cols=214 Identities=13% Similarity=0.123 Sum_probs=95.3 Q ss_pred HHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCCccc--ccceeEEEEeccccc Q lcl|NC_019916. 20 LMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAKSTR--FGERTEIVYTDTDVP 97 (286) Q Consensus 20 ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~s~R--FG~rkEIiy~DtdVp 97 (286) |+++|=+-+++.-+..| .+-++-|..++..- +++ ...-+-+|-.|+..- T Consensus 1 ~vr~i~~g~s~~~~~iG----------------------~~~~~~~~~G~~l~-------~~~~~~~~~e~~itID~~l~ 51 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMG----------------------RTKARYLKQGQSLD-------DGREDIKHTEKVITIDGLLT 51 (324) T ss_pred CeeeeecCceEEEeeee----------------------eeEeccccCCCCcC-------CCcCCcCcccEEEEecchhh Confidence 11111111111111110 01122233333211 111 111222455555443 Q ss_pred ccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhh-----------h----------------h Q lcl|NC_019916. 98 YEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADAS-----------T----------------D 150 (286) Q Consensus 98 Y~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a-----------~----------------~ 150 (286) +.+ +=+-||+.+.+=|+ ..+...-+++|-.+.+|+.+-..|...+ . . T Consensus 52 ~~~---~VdDiD~~qa~~Dl---r~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~ 125 (324) T protein:vir:99 52 TDV---LIYDIEDAMNHYDV---RSEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDP 125 (324) T ss_pred hhh---hhhhHHHHhcCccc---hhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceeccccccccc Confidence 332 22467777766564 4566677889999999976643332110 0 0 Q ss_pred hhhhhhHHHHHHHHhhhhhceeee-EEEEEEECchhhhhhhccccccccccceee-eccCceeeecCeEEEEchHHhhcC Q lcl|NC_019916. 151 LGAVDDVNVMFETASAKYTNLEVV-VPVRAYVTADVYNAIIDHNLVTSQKGSAVN-IDENGIVRFRDIIITKVPEKYMQG 228 (286) Q Consensus 151 ~~t~d~V~klF~~~~~~yvn~ev~-~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~N-iD~ngi~~fKgf~l~e~p~~y~qg 228 (286) ..+.+.+.+.|-.+.+......|- ..-.++|+|+.|.+|.|++..+...-.+.+ +=.-.|.+.-||.|-+.+.--.. T Consensus 126 ~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~- 204 (324) T protein:vir:99 126 AKYGTQVIQALTYARAAFAKKYIPAGDRTFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQ- 204 (324) T ss_pred ccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccc- Confidence 111223333343333333333332 356799999999999999887765333333 22234567778877766543222 Q ss_pred ceEEEeecceeeeccceEEEEEeeccCccceeeeecccc---cccCCCcCcceEEEEecCC Q lcl|NC_019916. 229 KAIMFVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKA---GSFILDDNKAAIFSATPKA 286 (286) Q Consensus 229 ~~~ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~---G~~IlddNKkAI~k~t~ka 286 (286) .|. +. .+.-++.|.+|+-+|-. ++|-+|..+-+=+-..+.| T Consensus 205 ---------~~t-----~~---~~a~~~~~~~~~~~~~~~~~~ky~~d~~~~~gl~~~~~a 248 (324) T protein:vir:99 205 ---------MVT-----NP---TDAFDGTGHIFPATGDSTTTGKMTVGADNVVGLFVHRSA 248 (324) T ss_pred ---------ccc-----cc---cccccccccccccccccccccccccccCceeEEEEehhh Confidence 111 11 12223344455444432 3444444333222222222 No 49 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=75.95 E-value=0.14 Score=25.25 Aligned_cols=241 Identities=13% Similarity=0.108 Sum_probs=121.0 Q ss_pred CCCCcc---------cceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeec-----cc Q lcl|NC_019916. 1 MATNNN---------NLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGE-----YS 66 (286) Q Consensus 1 M~t~nn---------n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~-----Y~ 66 (286) |..-|. .-..-+|-|+|-|.+.+-|+.++.|++.+=- +++.| =|+--.|.+ |+ |. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~v-rti~~---------GkS~qf~~i-G~~~a~y~~ 69 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-QTVTG---------TNTVSNKYL-GETELQVLA 69 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-eeecc---------cceEEEEEE-eeeEEeeec Confidence 654432 2346789999999999999999999976643 22211 122333443 33 33 Q ss_pred CCCcceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCC-hhHHHHHHHhhHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 67 QDEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNND-LNAAVADRLDLQAQAKVRMFNNALGKKLA 145 (286) Q Consensus 67 td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNnd-l~aavAdRl~Lqa~Ak~~~~n~~~gk~ls 145 (286) .++.. .|+ +.---|-+|-.|+-. |.--|.. =||.+.-+=| +.+.+++ -+++|-.|++|..+=..+. T Consensus 70 ~G~~l---dg~----~~~~~k~~ItID~lL-~a~~~V~--diDeaq~~yD~vRse~s~---e~G~ALA~~~Dq~ii~~i~ 136 (402) T protein:vir:97 70 PGQSP---NAT----PTQADKNQLVIDTTV-IARNTVA--HIHDVQGDIDSLKPKLAM---NQAKQLKRLEDQMAIQQML 136 (402) T ss_pred ccccc---CCC----CcccccEEEEeCcee-echhhhh--hHHHHHhcccchhHHHHH---HHHHHHHHHHHHHHHHHHH Confidence 34443 222 222235578899876 3333322 2555555555 5555544 4578888888886633221 Q ss_pred h--hh---hh---------------hh-------hh----hhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhcccc Q lcl|NC_019916. 146 D--AS---TD---------------LG-------AV----DDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNL 194 (286) Q Consensus 146 ~--~a---~~---------------~~-------t~----d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l 194 (286) . .+ .. ++ +. +-+..++.++.+++|-. ...+++|+|+.|++|+.++- T Consensus 137 ~aa~a~t~~~~~~~~~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~---~dRv~vv~P~~y~~Ll~~~r 213 (402) T protein:vir:97 137 LGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDI---SDVAIMMPWKFFNALRDADR 213 (402) T ss_pred HhhccccccccccCcccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCc---cccEEEeChHHHHHHhhccc Confidence 1 10 00 01 11 22445667777777764 34799999999999998754 Q ss_pred cc-cccc-ceee-eccCceeeecCeEEEEchHHhhcCceE---EEeecceeeeccceEEEEEeeccCccceeeeeccccc Q lcl|NC_019916. 195 VT-SQKG-SAVN-IDENGIVRFRDIIITKVPEKYMQGKAI---MFVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAG 268 (286) Q Consensus 195 ~T-s~K~-Ss~N-iD~ngi~~fKgf~l~e~p~~y~qg~~~---ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G 268 (286) -- ..=+ ++.+ +=.-++++--||.|.+.|.-=+.+..+ =-|+.+-|.+|. -.-||.. .+|. T Consensus 214 l~n~d~~~~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~--------~t~d~t~--~~~~---- 279 (402) T protein:vir:97 214 IVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYD--------PIAEMNG--AVAV---- 279 (402) T ss_pred ccchhhccccCCccccceeEEEeceEEEecCccccccccccccccccCCCCccCC--------cCcccce--eEEE---- Confidence 22 1110 1111 112235556666665554322111000 012222233332 0001110 0111 Q ss_pred ccCCCcCcceEEEEecCC Q lcl|NC_019916. 269 SFILDDNKAAIFSATPKA 286 (286) Q Consensus 269 ~~IlddNKkAI~k~t~ka 286 (286) .| .++|+..+.++. T Consensus 280 ~f----~~~Av~tvk~~~ 293 (402) T protein:vir:97 280 LF----TSDALLVGRTIE 293 (402) T ss_pred EE----ecceEEEEEeec Confidence 22 234666666655 No 50 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=74.20 E-value=0.16 Score=24.93 Aligned_cols=254 Identities=15% Similarity=0.088 Sum_probs=129.0 Q ss_pred CCCC-cccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCC Q lcl|NC_019916. 1 MATN-NNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAK 79 (286) Q Consensus 1 M~t~-nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~ 79 (286) |+.+ =.+| +--.-|..+++.=+.++..|.+.---.-.|.|..-+. -.+|.- +|.-|+.. -..|+.- T Consensus 1 Ma~T~~~d~---I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~t-------i~~P~~--~~igdae~-~~eg~~i 67 (270) T protein:vir:95 1 MTQTKKANL---INPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDT-------ITRPKY--AYIGAAED-LQEGVAM 67 (270) T ss_pred CCceehhhh---cchHHHHHHHHHHHHhHHhhccccccccccCCCCCCE-------EEeeee--cCCCcccc-ccCCCcc Confidence 6532 1111 2334688888888888887765322123333322111 111211 11111110 1111111 Q ss_pred cccccceeEEEEecccc---cccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhh---hhhhh Q lcl|NC_019916. 80 STRFGERTEIVYTDTDV---PYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADAS---TDLGA 153 (286) Q Consensus 80 s~RFG~rkEIiy~DtdV---pY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a---~~~~t 153 (286) +..++-.....+ -+-..|.+.. +++++--.|+ +.+..+.++.+|.|.+++.+=..|..+. +...+ T Consensus 68 -----~~~~lt~~~~~a~i~~~gk~~~itD-~a~~~~~~dp---~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~~~~~t 138 (270) T protein:vir:95 68 -----DTTQMSMTTTKVTVKETGKAVEVTQ-TAIITNVNGT---LQEASRQLAMSLADKVEIDYIAELNKSKQTATVSAD 138 (270) T ss_pred -----chhhcccchheeeeehhhCcceecH-HHHhhhccch---HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccC Confidence 011111111111 0122343332 2444444455 5666677889999998876644443221 11223 Q ss_pred hhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccccccccccceeeeccCc-eeeecCeEEEEchHHhhc-CceE Q lcl|NC_019916. 154 VDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAVNIDENG-IVRFRDIIITKVPEKYMQ-GKAI 231 (286) Q Consensus 154 ~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ng-i~~fKgf~l~e~p~~y~q-g~~~ 231 (286) .+++ ..+...+ +-|...+-.++|+|.+|..+--.++.+..+.+. |+=.|| +-.|.|+.+-. .+..-. |... T Consensus 139 ~~~~----~dA~~~l-gd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~-~~~~~G~ig~~~G~~Viv-~s~~~~~~~~~ 211 (270) T protein:vir:95 139 ATGI----LDAIEVF-NSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQD-RAISKGDLVEIVGVSDIV-KSKRVSENTAF 211 (270) T ss_pred HHHH----HHHHHHh-ccccCCCcEEEEcHHHHHHHHhhhccccccccc-chhcccccceecceeEEE-eCCCCCceeEE Confidence 3333 2232332 334455667999999999987666666555543 443333 66789986421 122222 7888 Q ss_pred EEeecceee---eccceEEEEEeeccCccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 232 MFVPDNIGR---AFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 232 ifs~dnIg~---af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) +|.+..||. .-+-||+-|-+..- =-.|=+-=-||.++.++-|...++..|.- T Consensus 212 l~~~gAi~~~~~~~~~vEtdRd~~~~---~d~i~~~~~y~v~~~~~skvv~~t~~~a~ 266 (270) T protein:vir:95 212 LQRYGAMEIVNKKKPEAYTDFDILKR---THLLSTNYHYSVNLKDETGVVKVTFKPSG 266 (270) T ss_pred EEeccceeeeecCCceeeeccchhhc---ccEEEeeeEEEEEEEccceEEEEEecCCC Confidence 999888872 22347777765541 23555666689999998887777765544 No 51 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=73.11 E-value=0.17 Score=24.74 Aligned_cols=235 Identities=15% Similarity=0.098 Sum_probs=123.0 Q ss_pred CCCCccc---------ceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeec-----cc Q lcl|NC_019916. 1 MATNNNN---------LAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGE-----YS 66 (286) Q Consensus 1 M~t~nnn---------~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~-----Y~ 66 (286) |.+-|+. -+.-+|-|+|-|.+.+-|+.++.|++.+- .+++.| =|+--.|.+ |+ +. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~-vRti~~---------gkS~qf~~~-G~s~~~~~~ 69 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFD-VQTVTG---------TNTVSNKYL-GETELQVLA 69 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccce-eeeecc---------cceEEEEEe-eeeEeeeec Confidence 7776553 34568999999999999999999997653 333222 123333443 43 44 Q ss_pred CCCcceeecCcCCcccccceeEEEEeccccccccc-chhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHh Q lcl|NC_019916. 67 QDEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFT-WAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLA 145 (286) Q Consensus 67 td~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~-~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls 145 (286) .++.. .++|.---|-+|-.|+-+--... |-|||=-..|+ -+. ++=...+.+|-.|+||..+...|- T Consensus 70 pG~~l-------d~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD---~vR---se~s~e~G~ALA~~~Dq~iiq~i~ 136 (401) T protein:vir:70 70 PGQSP-------AATSTQADKNQLVIDATVIARNTVAHLHDVQGDID---SLK---PKLATNQAKQLKRMEDEMLIQQMM 136 (401) T ss_pred CCCCc-------CCCCcccccEEEEeCceeehhhhhhhHHHHHhccc---ccc---hHHHHHHHHHHHHHHHHHHHHHHH Confidence 44543 12344344567888887644332 23333333332 023 333455678888999987755551 Q ss_pred hhh-------hhhh--------------------hh----hhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhccc- Q lcl|NC_019916. 146 DAS-------TDLG--------------------AV----DDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHN- 193 (286) Q Consensus 146 ~~a-------~~~~--------------------t~----d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~- 193 (286) .++ +... +. +.+..++..+.+++|.. ...+.+..|+.|+.|.+|+ T Consensus 137 ~aa~ana~~~~~~p~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~---~r~vvl~pp~~Ys~Ll~~d~ 213 (401) T protein:vir:70 137 LGGIANTQAKRTNPRVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDI---SDVAILMPWRYFNVLRDADR 213 (401) T ss_pred HhccccccccccCCCcCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCc---cceEEEcCHHHHHHHHhcCc Confidence 110 0000 11 22557777888888873 3577888999999999997 Q ss_pred ccccccc-ce--eeeccCceeeecCeEEEEchHHhhcCceEEEeecceeeeccceEEEEEeeccCccceeeeeccccccc Q lcl|NC_019916. 194 LVTSQKG-SA--VNIDENGIVRFRDIIITKVPEKYMQGKAIMFVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSF 270 (286) Q Consensus 194 l~Ts~K~-Ss--~NiD~ngi~~fKgf~l~e~p~~y~qg~~~ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~ 270 (286) +....=+ |+ ..+. -.+++--||.|.|.|.-=+.+.. | +...+ .-.|.|-...+ T Consensus 214 L~nrd~~~s~~g~~~~-G~v~~vaGv~Vv~SnnlP~~a~~-i--------------t~~~l--------s~a~~G~~y~~ 269 (401) T protein:vir:70 214 IVDKTYTISQSGATIQ-GFTLSSYNCPVIPSNRFPKYSQG-Q--------------THHLL--------SNEDNGYRYDP 269 (401) T ss_pred ccchhhccccCCcccc-ceEEEEeceEEEeeccccccccc-c--------------ccccc--------cccCCCccCCC Confidence 4332211 11 1111 12445566666665432221100 0 00000 01233333344 Q ss_pred CCCcCcceEEEEecCC Q lcl|NC_019916. 271 ILDDNKAAIFSATPKA 286 (286) Q Consensus 271 IlddNKkAI~k~t~ka 286 (286) --|..|...+--+|.| T Consensus 270 ~~d~s~~~~v~f~~~A 285 (401) T protein:vir:70 270 LPAMNGAIAVLFTADA 285 (401) T ss_pred CccccceeEEEEehhh Confidence 4455444444444444 No 52 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=58.30 E-value=0.42 Score=22.67 Aligned_cols=245 Identities=13% Similarity=0.115 Sum_probs=120.6 Q ss_pred CCCCcc---------cceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceE----eecccC Q lcl|NC_019916. 1 MATNNN---------NLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVV----VGEYSQ 67 (286) Q Consensus 1 M~t~nn---------n~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVv----vg~Y~t 67 (286) |..-|. .-+.-+|-|+|-|.+.+-|+.++.|++..=- +++.| =|+--.|.+ ++-|.. T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~-rti~~---------gkS~q~~~iG~~~~~~~~~ 70 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDV-QEVVG---------TNSVSNKYIGETELQVLSP 70 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-eeecc---------cceEEeeeeeeeEEeeecc Confidence 654432 2356789999999999999999999976543 22221 122233433 333444 Q ss_pred CCcceeecCcCCcccccceeEEEEecccccccccchhhhccccccccCC-hhHHHHHHHhhHHHHHHHHHHHHHHHHHhh Q lcl|NC_019916. 68 DEAVAFGAGTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNND-LNAAVADRLDLQAQAKVRMFNNALGKKLAD 146 (286) Q Consensus 68 d~NvaFGtGTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNnd-l~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~ 146 (286) +++. .|+ +.-.-|=+|-.|+-. |.--+. .=||.+.-+=| +.+.++. .+++|-.+++|..+-..+.. T Consensus 71 G~~l---d~~----~~~~~k~~itID~ll-~a~~~V--~diDe~q~~~D~vR~e~s~---e~G~ALA~~~Dq~i~~~v~~ 137 (364) T protein:vir:10 71 GKSP---DAS----PTEFDKNRLVVDTTV-IARNTV--AHFHDVQNDIDGLKSKLSV---NQAKKLKKMEDSMVIQQLVL 137 (364) T ss_pred Cccc---CCC----CcccCcEEEEeccee-eechhh--hhHHHHhcCccchhHHHHH---HHHHHHHHHHHHHHHHHHHh Confidence 5553 232 222235588888866 322222 22455555555 4555554 45788888888877433321 Q ss_pred hh-hh--------------------------hhhh----hhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhcccc- Q lcl|NC_019916. 147 AS-TD--------------------------LGAV----DDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNL- 194 (286) Q Consensus 147 ~a-~~--------------------------~~t~----d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l- 194 (286) ++ .. +... +-+..++..+.+++|.. ...+++|+|+.|.+|+.++- T Consensus 138 aa~a~~~~~~~~~~~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~---~~R~~vv~P~~y~~Ll~~~~l 214 (364) T protein:vir:10 138 GGISNTEAIRKNPRVAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDT---SELCGLMPWTAFNCLRDADRI 214 (364) T ss_pred hhhhcccccccCCcccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCc---cccEEEeChHHHHHHhcCCcc Confidence 11 00 0011 22345666666666654 34799999999999999753 Q ss_pred cccc---ccceeeeccCceeeecCeEEEEchHHhhcCceEEEeecceeeeccceEEEEEeeccCccceeee---e--ccc Q lcl|NC_019916. 195 VTSQ---KGSAVNIDENGIVRFRDIIITKVPEKYMQGKAIMFVPDNIGRAFTGIVTTRTIESEDFDGVALQ---G--AGK 266 (286) Q Consensus 195 ~Ts~---K~Ss~NiD~ngi~~fKgf~l~e~p~~y~qg~~~ifs~dnIg~af~GI~taRtieSEDFdGVaLQ---g--AgK 266 (286) .... .++...++. .+.+.-||.|.|.|.-=+.+... ..+|.-+.+.+..+.. |-+.. + .-+ T Consensus 215 vn~d~~~~~~~~~~~G-~v~~v~Gv~Vv~Sn~lP~~~~~~---------~~t~~~t~h~ls~~~~-g~~y~v~~d~~~~~ 283 (364) T protein:vir:10 215 VDKSYTIAASDNTVDG-FVLKSWNTPIVPSNRFPKLSDNT---------EGTGNTKHHKLSNAGN-GNRYDVTAGQTSAQ 283 (364) T ss_pred ccccccccCCCccccc-eeEEEeceEEEeccccccccccc---------cccccccccccccccC-CcccccccccceeE Confidence 2111 112222222 23456677766654321111100 0123333333321111 11110 0 000 Q ss_pred ccccCCCcCcceEEEEecCC Q lcl|NC_019916. 267 AGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 267 ~G~~IlddNKkAI~k~t~ka 286 (286) +=.| .++|+..+.++. T Consensus 284 ~~~f----~~~Al~tv~~~~ 299 (364) T protein:vir:10 284 AVLF----TQDALLVGRTIS 299 (364) T ss_pred EEEE----ecceEEEEEEec Confidence 1123 234666655554 No 53 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=38.08 E-value=1.1 Score=20.39 Aligned_cols=250 Identities=14% Similarity=0.017 Sum_probs=110.1 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhh-----hcccchhcCCcccceeEEEeecccceEeecccCCCcceeec Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPT-----FGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGA 75 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~-----fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGt 75 (286) |++-+|+. ++ -+..+..+-..|+++..|.+. .|.++ .-++ -+.+.. .-++.+.+|.+ ..+ T Consensus 1 m~~~~N~~-lt--p~iia~~~l~~l~~~lV~~~lv~r~y~~e~~-----~~GD-TV~I~v-p~~~~v~dg~~--~~~--- 65 (418) T protein:vir:10 1 MAVQDNNL-LT--DDVIAKEALRLLKNNLVMAKCVYRNYEKTFG-----KVGD-TIRLKL-PYRVKSASGRT--LVK--- 65 (418) T ss_pred CCcccccc-cc--HHHHHHHHHHHHHHhccchhhhcCCCchHHh-----hCCC-EEEEee-CCceeecccCC--ccc--- Confidence 99977774 22 123334444556766665442 22222 1122 233433 22233333321 110 Q ss_pred CcCCcccccceeEEEEecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhhhh--- Q lcl|NC_019916. 76 GTAKSTRFGERTEIVYTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTDLG--- 152 (286) Q Consensus 76 GTg~s~RFG~rkEIiy~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~~~--- 152 (286) .+- -+.+--+-.|++.-+.+.|. .+|+--.+. .-..++|.-+..|=.+.+|..+..-+..++...+ T Consensus 66 -~~~----te~~v~l~id~~k~~~~~it---D~e~a~~~~---d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~g 134 (418) T protein:vir:10 66 -QPM----VDQTIPFKIAYQEHVGLEYT---VKDKTLDIM---QFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTPG 134 (418) T ss_pred -ccc----ccceEEEEEecccccceeec---hHHHhhhhh---HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccCC Confidence 000 01111122233222222222 444433333 3456777778888899999888776655443322 Q ss_pred hhhhHHHHHHHHhhhhhceeee--EEEEEEECchhhhhhhccccccccccceeeeccCcee-eecCeEEEEchHH----- Q lcl|NC_019916. 153 AVDDVNVMFETASAKYTNLEVV--VPVRAYVTADVYNAIIDHNLVTSQKGSAVNIDENGIV-RFRDIIITKVPEK----- 224 (286) Q Consensus 153 t~d~V~klF~~~~~~yvn~ev~--~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~NiD~ngi~-~fKgf~l~e~p~~----- 224 (286) +-.....-|-.+.++.-+..|- +...+-|+|+.|..|.+.....-.+..+-..=.||.+ +.-||.+-+...- T Consensus 135 t~~~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~ta 214 (418) T protein:vir:10 135 VRPGAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTV 214 (418) T ss_pred cCcchHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEecCCCcccc Confidence 1122223344455555556664 3467789999999999866543222211111235544 7888888764321 Q ss_pred hhc-CceEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCcC-------------cceEEEEec----CC Q lcl|NC_019916. 225 YMQ-GKAIMFVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDDN-------------KAAIFSATP----KA 286 (286) Q Consensus 225 y~q-g~~~ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~IlddN-------------KkAI~k~t~----ka 286 (286) ... |...+. +-+.....+ +-+.+-+..-|--+.|.++-=.. +..-+.|+- .+ T Consensus 215 g~~~~t~~v~---ga~~~~~~~-------~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~ 284 (418) T protein:vir:10 215 GDHGGTPLVN---GTVVNGDTV-------GFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDA 284 (418) T ss_pred cccccceeee---cccccceeE-------EEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccc Confidence 111 111121 111111111 11233333345556665422111 111222211 11 No 54 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=35.88 E-value=1.2 Score=20.14 Aligned_cols=263 Identities=14% Similarity=0.176 Sum_probs=112.4 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcc--cc------hhcCCcccceeEEEeecccceEe-ecccCC-Cc Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGD--LQ------ALDGVQNNATAFSVKTNNVPVVV-GEYSQD-EA 70 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgg--lQ------~lDGVqnN~taf~vKtnd~pVvv-g~Y~td-~N 70 (286) |..+ -.+=++-.++-.-|...-..... +|+ |+ -++-||++..++ ..+.++- +.+..+ .. T Consensus 1 ~~~~--~~~~~~~n~~~~~i~k~~it~~~-----l~~g~L~p~~a~~Fl~~v~~~t~iL----~~~r~~~~~s~~~ei~k 69 (360) T protein:vir:99 1 MSSN--STIDSVRNQNMNSLSQKDIGLAE-----LDGFQLPVDVTEEFLERMQKGVQIL----GMADTMTLARLEMEVPQ 69 (360) T ss_pred Ccch--hHHHHHhhhHHHHHHhhhccccc-----cCceeecHHHHHHHHHHHhhccchh----hhcceeecccccccccc Confidence 5432 11111111111112111111111 111 11 234455555554 2233332 223222 26 Q ss_pred ceeec--------CcCCcccccc-eeEEEEecccccccccchhhhcccccccc-------CChhHHHHHHHhh------- Q lcl|NC_019916. 71 VAFGA--------GTAKSTRFGE-RTEIVYTDTDVPYEFTWAIHEGLDRFTVN-------NDLNAAVADRLDL------- 127 (286) Q Consensus 71 vaFGt--------GTg~s~RFG~-rkEIiy~DtdVpY~~~~aiHEGiDr~TVN-------ndl~aavAdRl~L------- 127 (286) ++||- +...+.|.+. +..|.|..++ .+..+|-|++=-.+-.++ +.+-+++|+|.+. T Consensus 70 ig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~-~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~ 148 (360) T protein:vir:99 70 FGVPRLSGHTRDEEGSRTENSEAESGSVKFNATD-KSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGI 148 (360) T ss_pred cccceeeccccccCCCCCcCCcCccccCcccccc-ceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHh Confidence 77764 1123444443 2244444332 566677665433233333 2334666655443 Q ss_pred HHHHH-------------HHHHHHHHHHHHhh-----hhhhh----------------------------hhhhh--HHH Q lcl|NC_019916. 128 QAQAK-------------VRMFNNALGKKLAD-----ASTDL----------------------------GAVDD--VNV 159 (286) Q Consensus 128 qa~Ak-------------~~~~n~~~gk~ls~-----~a~~~----------------------------~t~d~--V~k 159 (286) |.-+- -.+.++.+=++-++ .|++. ..+++ +.+ T Consensus 149 ~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~ 228 (360) T protein:vir:99 149 RAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNE 228 (360) T ss_pred hccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHH Confidence 33221 22334444343211 11100 01112 347 Q ss_pred HHHHHhhhhhceeeeEEEEEEECch---hhhhhhccccccccccceeeeccCceeeecCeEEEEchHHhhcCceEEEe-e Q lcl|NC_019916. 160 MFETASAKYTNLEVVVPVRAYVTAD---VYNAIIDHNLVTSQKGSAVNIDENGIVRFRDIIITKVPEKYMQGKAIMFV-P 235 (286) Q Consensus 160 lF~~~~~~yvn~ev~~~~~ayV~~e---vYNaIvD~~l~Ts~K~Ss~NiD~ngi~~fKgf~l~e~p~~y~qg~~~ifs-~ 235 (286) +...+-.+|-|...+.+ +-+++|. .|.-.+...-+ .-|+++-+-. +.+.++|+.|+.+| -|.-..+||+ | T Consensus 229 ~~~~Lp~kyr~~~~~~~-~~~~s~~~~~~yr~~L~~R~t--~LGd~~l~g~-~~~~~~Gipi~~v~--~~pd~~~mlT~p 302 (360) T protein:vir:99 229 TIQTLDSRYRESDAYSP-VLMTSPNQVQSYTMSLTERED--PLGSAVIFGD-SDITPFSYDLVGVN--GFPDEYMMFTDP 302 (360) T ss_pred HHHhcchhhhcCcccce-EEEccCchHHHHHHHHhccCc--ccchhheecc-cccccceeeeEEcC--CCCCCceEEecc Confidence 77777888988766544 4445544 45555544433 5677776644 55789999999999 5674456665 9 Q ss_pred cceeeeccceEEE-------EEeeccCccceee-eecccccccCCCcCcceEEEE----ecCC Q lcl|NC_019916. 236 DNIGRAFTGIVTT-------RTIESEDFDGVAL-QGAGKAGSFILDDNKAAIFSA----TPKA 286 (286) Q Consensus 236 dnIg~af~GI~ta-------RtieSEDFdGVaL-QgAgK~G~~IlddNKkAI~k~----t~ka 286 (286) +|+ +.+.+-++= +.|+.++|.=... ++ +. .|+. .+..|++.| +|+| T Consensus 303 ~NL-i~g~~~~iri~~~~e~~~~~~~~~~~~~~~~~--~~-D~~i-ee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 303 NNL-AFGLYEEMELDQSTDTDKVHEQRLHSRNWLEG--QF-DFQI-KEQQAGVLVTDLETPTA 360 (360) T ss_pred Cce-eEEeeeeeEEeecccchhhhhhceeeeEEEEE--Ee-eEEE-EecccEEEEecCCCCCC Confidence 999 444443331 1122333321110 00 00 1111 122333333 3555 No 55 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=30.64 E-value=1.6 Score=19.53 Aligned_cols=164 Identities=14% Similarity=0.115 Sum_probs=80.3 Q ss_pred ecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhhhh-------------------hh Q lcl|NC_019916. 92 TDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKLADASTD-------------------LG 152 (286) Q Consensus 92 ~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~ls~~a~~-------------------~~ 152 (286) .|+...-+ .+=+=||+...+=|+-+...++ +++|-.+.+|+.+...|.++|.. +. T Consensus 1 iD~lL~a~---~~VdDiD~aqa~~dvr~e~t~e---~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~ 74 (221) T protein:vir:17 1 MDDLLVAS---QFVYDLDEILAQWNTRSEISKQ---IGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTN 74 (221) T ss_pred CCcchhHH---HHHHhHHHHHhhhHHHHHHHHH---HHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccC Confidence 33332211 1123344555555554444444 56788889999998888665422 11 Q ss_pred hh----hhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhc--ccccccc-c-cceeeeccC-ceeeecCeEEEEchH Q lcl|NC_019916. 153 AV----DDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIID--HNLVTSQ-K-GSAVNIDEN-GIVRFRDIIITKVPE 223 (286) Q Consensus 153 t~----d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD--~~l~Ts~-K-~Ss~NiD~n-gi~~fKgf~l~e~p~ 223 (286) +. |.+.++..++.++.|-. .--+++|+|+.|-+|+- ++..+.. . +|...++.- ++.+.-||.|-+.+. T Consensus 75 ~~~~l~dai~~a~~~LdekdVP~---~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~Snn 151 (221) T protein:vir:17 75 NAQAIVDGFFEAAAVLDERSAPM---DGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNV 151 (221) T ss_pred CHHHHHHHHHHHHHHHhhcCCCC---CCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEecc Confidence 11 33445555666655542 34468999999999983 3433332 2 234445544 477888888877663 Q ss_pred Hhhc-CceEEEeecceeeeccceEEEEEeeccCccceeeeecccccccCCCc--------CcceEEEEecCC Q lcl|NC_019916. 224 KYMQ-GKAIMFVPDNIGRAFTGIVTTRTIESEDFDGVALQGAGKAGSFILDD--------NKAAIFSATPKA 286 (286) Q Consensus 224 ~y~q-g~~~ifs~dnIg~af~GI~taRtieSEDFdGVaLQgAgK~G~~Ildd--------NKkAI~k~t~ka 286 (286) -=.. |.... +..| .....+...++|=.|. .+.|+-+|.+-. T Consensus 152 lP~~~gt~~~--------~~ag--------------~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~ 201 (221) T protein:vir:17 152 LASLYGTNLV--------TDPG--------------DATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLL 201 (221) T ss_pred CCcccccccc--------cCCc--------------cccccccccccccccccceEEEEEcchheeeeeeec Confidence 3222 22110 1111 1122222222333332 233333333333 No 56 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=25.85 E-value=2 Score=18.92 Aligned_cols=256 Identities=10% Similarity=0.075 Sum_probs=104.2 Q ss_pred CCCCcccceeeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEee-cccceEeecccCCCcceeecCcCC Q lcl|NC_019916. 1 MATNNNNLAARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKT-NNVPVVVGEYSQDEAVAFGAGTAK 79 (286) Q Consensus 1 M~t~nnn~a~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKt-nd~pVvvg~Y~td~NvaFGtGTg~ 79 (286) +.+..+- --+--+++..-+-......+..++. +..+-.++-. -+....+. +..|. .|-.+|+....+. . T Consensus 111 ~~t~~~g--g~~iP~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~--~~~~~~~~~~~~~~---a~~v~E~~~~~~~--~ 180 (397) T protein:vir:48 111 DASGSDA--GLTIPQDIQTAIHTLVRQYDSLQEY-VNVENVTTLT--GSRVYEKWADITGL---AKLDDEAGSIGTN--D 180 (397) T ss_pred ccCCccc--cccccHHHHHHHHHHHHHHHHHHhh-hceeeccCCc--ceEEEEeecCCCcc---eeeeccccccccc--c Confidence 1111110 0011122211111122222222221 2211111111 11111111 11111 1222222111111 0 Q ss_pred cccccceeEEE-EecccccccccchhhhccccccccCChhHHHHHHHhhHHHHHHHHHHHHHHHHH--hhhhhhhhhhhh Q lcl|NC_019916. 80 STRFGERTEIV-YTDTDVPYEFTWAIHEGLDRFTVNNDLNAAVADRLDLQAQAKVRMFNNALGKKL--ADASTDLGAVDD 156 (286) Q Consensus 80 s~RFG~rkEIi-y~DtdVpY~~~~aiHEGiDr~TVNndl~aavAdRl~Lqa~Ak~~~~n~~~gk~l--s~~a~~~~t~d~ 156 (286) .--|++.+--. -.-.-+|++..+-- . + +-|+.+-|.++|. +|-.+..|..+=.-. ....+.+.+.|+ T Consensus 181 ~~~~~~v~~~~~k~~~~~~iS~ell~-----d-s-~~~l~~~v~~~l~---~~~~~~~d~~il~G~g~~~~~~~~~~~d~ 250 (397) T protein:vir:48 181 DPKLYPIRYAIKRYAGISTVTNSLLA-----D-S-AENILAWLSGWIA---KKVVVTRNKAILEAIATLPTKPTLTKWDD 250 (397) T ss_pred ccceeeEEeeheeeeeehhhHHHHHh-----h-c-hHHHHHHHHHHHH---HHHHHHHHHHHhhcccccccccccccHHH Confidence 11344332111 01112344433211 0 0 1134555555543 222222333211001 111223346689 Q ss_pred HHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhcccccccccccee---eeccCceeeecCeEEEEchHHhhc----Cc Q lcl|NC_019916. 157 VNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQKGSAV---NIDENGIVRFRDIIITKVPEKYMQ----GK 229 (286) Q Consensus 157 V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~K~Ss~---NiD~ngi~~fKgf~l~e~p~~y~q----g~ 229 (286) +.++..++...|.++ -..+++|..|++|-=..-+ -|.-+ ++-..+-..+-|+-+..++...+. +. T Consensus 251 i~~~~~~l~~~~~~~-----a~~v~n~~~~~~L~~lkd~---~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~ 322 (397) T protein:vir:48 251 IIDLQAKVDPAIKQT-----SFFLTNTSGFTALKKVKNA---FGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGA 322 (397) T ss_pred HHHHHHHhhhhhcCC-----CEEEECHHHHHHHHHhhcC---CCceeeccCcCCCCCceeccceeEEecccccCCcCCCc Confidence 999999999888765 3557899999988643211 12222 222334457788877777766664 23 Q ss_pred e-EEEe-ec-ce-eeeccceEEEEEeecc---CccceeeeecccccccCCCcCcceEEEEecCC Q lcl|NC_019916. 230 A-IMFV-PD-NI-GRAFTGIVTTRTIESE---DFDGVALQGAGKAGSFILDDNKAAIFSATPKA 286 (286) Q Consensus 230 ~-~ifs-~d-nI-g~af~GI~taRtieSE---DFdGVaLQgAgK~G~~IlddNKkAI~k~t~ka 286 (286) . ++|- .. .+ .....|+.+...-+.+ +-+-+++.+....+--+.+. +|+++++.++ T Consensus 323 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~--~a~~~~~~~~ 384 (397) T protein:vir:48 323 MPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDT--ESFVPASFKA 384 (397) T ss_pred eEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecc--cceEEEEecc Confidence 2 3332 12 22 1223344443322222 34568888888888777665 5777777766 No 57 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=23.60 E-value=2.3 Score=18.62 Aligned_cols=249 Identities=18% Similarity=0.173 Sum_probs=112.3 Q ss_pred eeeechhHHHHHHHHHhhhhhhhhhhcccchhcCCcccceeEEEeecccceEeecccCCCcceeecCcCCcccccceeEE Q lcl|NC_019916. 10 ARTYTKQFAQLMQTVFGAQSVFGPTFGDLQALDGVQNNATAFSVKTNNVPVVVGEYSQDEAVAFGAGTAKSTRFGERTEI 89 (286) Q Consensus 10 ~r~Y~kq~~~ll~~vf~~qa~F~~~fgglQ~lDGVqnN~taf~vKtnd~pVvvg~Y~td~NvaFGtGTg~s~RFG~rkEI 89 (286) |.. -+..-+++..+| .. ..+|.--.-++..+-.. -+-=+.++.|.. |.-.+-+.|+-. T Consensus 1 i~~-~P~~~g~~~glf----------f~---~~~v~T~~V~ie~~~~~-------l~lip~v~rg~~-g~~~~~~~~~~~ 58 (320) T protein:vir:10 1 MNL-LPVNYGDSRALF----------AR---EKKVRTRTILVEEKNGV-------LTLIQSREPGST-ENVAKRGKRKVR 58 (320) T ss_pred CCc-CCchhhhhhhhc----------cC---CCCcccceEEEEEecCc-------eeeeeccCCCCC-ceeecCCcceEE Confidence 111 111111111111 00 11222222222222111 111233556654 334444777777 Q ss_pred EEecccccccccchhhh--ccccccccCChh---HHHHHHHhhHHHHHHH-----HHHHHHHHHHhhhhh---------- Q lcl|NC_019916. 90 VYTDTDVPYEFTWAIHE--GLDRFTVNNDLN---AAVADRLDLQAQAKVR-----MFNNALGKKLADAST---------- 149 (286) Q Consensus 90 iy~DtdVpY~~~~aiHE--GiDr~TVNndl~---aavAdRl~Lqa~Ak~~-----~~n~~~gk~ls~~a~---------- 149 (286) .+.=.-+|......-+| |+-.|= .+.++ .+|++||.-+.+.--+ ....+.||-| |+-+ T Consensus 59 ~f~~p~~~~~d~i~a~eiq~~Ra~G-~~~~~~~~~~v~~~l~~lr~~~~~T~E~m~~~AL~G~il-dadGtv~~d~y~~f 136 (320) T protein:vir:10 59 SFVIPHLPLEDVILPDEYEGLRGFG-TTALAAKSELVKERXETMKSSHDITHEHLRMGAKKGQIL-DADGTVLYDLYAEF 136 (320) T ss_pred EEecceeccCCccCHHHHcCcccCC-CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCeEE-cCCCcEEEechhhh Confidence 77766777777766666 555442 12333 3467766554432111 1222334422 1100 Q ss_pred ---------hh--------hhhhhHHHHHHHHhhhhhceeeeEEEEEEECchhhhhhhcccccccc-c--cceeeec--- Q lcl|NC_019916. 150 ---------DL--------GAVDDVNVMFETASAKYTNLEVVVPVRAYVTADVYNAIIDHNLVTSQ-K--GSAVNID--- 206 (286) Q Consensus 150 ---------~~--------~t~d~V~klF~~~~~~yvn~ev~~~~~ayV~~evYNaIvD~~l~Ts~-K--~Ss~NiD--- 206 (286) ++ ...+++....+... .-+.+..++|.+.+++|++|+.|+.+..+ + ..+.++. T Consensus 137 Gi~~~~i~~~l~~a~~dv~~~~~~~~~~i~~~l----~g~~~t~v~al~g~~f~~al~~h~~Vke~y~~~~~~~~~l~~~ 212 (320) T protein:vir:10 137 GITKKTIYFGLDNKDANVAESCRQVLRHVEDNL----RGDVMKDVSVDVSEEFFDKFIKHASVKEVFLNHEAAVNRLGGD 212 (320) T ss_pred CCccceeEEecCCCCccHHHHHHHHHHHHHHHh----ccCCCCceEEEEChHHHHHHhcCHHHHHHHHhhhhhhhhcccc Confidence 00 01122222222211 12345568999999999999999986432 1 1122222 Q ss_pred cCceeeecCeEEEEchHHhhc--CceEEEeecceeeec-cce---EEEEEeeccCccceeeeecccccccCCCcC----- Q lcl|NC_019916. 207 ENGIVRFRDIIITKVPEKYMQ--GKAIMFVPDNIGRAF-TGI---VTTRTIESEDFDGVALQGAGKAGSFILDDN----- 275 (286) Q Consensus 207 ~ngi~~fKgf~l~e~p~~y~q--g~~~ifs~dnIg~af-~GI---~taRtieSEDFdGVaLQgAgK~G~~IlddN----- 275 (286) .-+-+.|.|+++++--..|.. |..--|-|++=++.| +|. -.++-=..++++=|-.+|.=-|.+-.++++ T Consensus 213 ~~~~f~~gGi~~~~Y~g~~~d~~g~~~~~I~~~~~~~~p~g~~~~f~~~~apad~~e~vnt~g~p~y~k~~~~~~~~g~~ 292 (320) T protein:vir:10 213 TRKGFKFGGLIFNENRARHVDEEGKETRFIKAGKGHAFPTGTTNTFFTALAPADFNETAGTLGKRYYAKMEPRRMGRGFD 292 (320) T ss_pred ccceEEecCEEEEEcccEEEcCCCCeeEeecCCeeEEEEecCchhheeeecccCcHhhcCCcccccccccccccCCCeEE Confidence 234568899999986554433 444444455533333 221 111222233333344555555555555544 Q ss_pred -------------cceEEEEecCC Q lcl|NC_019916. 276 -------------KAAIFSATPKA 286 (286) Q Consensus 276 -------------KkAI~k~t~ka 286 (286) -.|++|+|-.| T Consensus 293 l~~qS~PLpi~~rP~~lv~~~~~a 316 (320) T protein:vir:10 293 LHSQSNVLPMCCRPGVLVELDAAA 316 (320) T ss_pred EEeeecccccccCcceEEEEEecC Confidence 35666766666 Done!