Query lcl|NC_020198.1_cdsid_YP_007392345.1 [gene=H380_gp38] [protein=hypothetical protein] [protein_id=YP_007392345.1] [location=20609..21523] Match_columns 304 No_of_seqs 203 out of 339 Neff 5.8 Searched_HMMs 1612 Date Thu Nov 7 16:36:31 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_38 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_38_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99228 Length: 304 100.0 1E-157 8E-161 880.8 21.1 304 1-304 1-304 (304) 2 protein:vir:79246 Length: 304 100.0 2E-157 1E-160 879.6 21.0 304 1-304 1-304 (304) 3 protein:vir:1991 Length: 305 # 100.0 2E-152 1E-155 852.6 23.4 302 1-303 1-305 (305) 4 protein:vir:103886 Length: 302 100.0 6.4E-93 4E-96 526.0 19.7 232 1-304 1-232 (302) 5 protein:vir:95512 Length: 693 100.0 3.4E-66 2.1E-69 379.5 14.6 220 1-304 394-627 (693) 6 protein:vir:79548 Length: 652 100.0 6.3E-63 3.9E-66 361.6 14.2 217 1-304 359-587 (652) 7 protein:vir:103886 Length: 302 99.9 4.7E-30 2.9E-33 181.4 1.6 174 1-235 122-302 (302) 8 protein:vir:108211 Length: 318 95.9 0.00016 9.9E-08 41.4 8.6 206 1-304 20-250 (318) 9 protein:vir:3033 Length: 272 # 95.4 0.0021 1.3E-06 35.3 13.7 185 1-304 1-210 (272) 10 protein:vir:9820 Length: 272 # 95.4 0.0021 1.3E-06 35.3 13.7 185 1-304 1-210 (272) 11 protein:vir:79928 Length: 393 93.3 0.00073 4.5E-07 37.8 5.7 262 1-287 74-393 (393) 12 protein:vir:3613 Length: 272 # 92.9 0.0096 6E-06 31.6 13.5 185 1-304 1-209 (272) 13 protein:vir:1638 Length: 298 # 92.7 0.0041 2.6E-06 33.7 9.0 229 1-304 1-259 (298) 14 protein:vir:105334 Length: 276 91.3 0.016 1E-05 30.4 14.4 187 1-304 1-211 (276) 15 protein:vir:9574 Length: 300 # 89.7 0.012 7.6E-06 31.1 8.5 228 1-254 1-300 (300) 16 protein:vir:99920 Length: 311 89.2 0.01 6.3E-06 31.5 7.7 234 1-304 1-267 (311) 17 protein:vir:2504 Length: 305 # 88.0 0.011 6.9E-06 31.3 7.1 230 1-269 1-305 (305) 18 protein:vir:7771 Length: 330 # 88.0 0.016 9.7E-06 30.5 7.9 236 1-271 1-330 (330) 19 protein:vir:96833 Length: 275 86.8 0.043 2.7E-05 28.1 14.2 194 1-304 1-212 (275) 20 protein:vir:94771 Length: 298 86.5 0.041 2.5E-05 28.2 9.3 225 1-238 1-298 (298) 21 protein:vir:96223 Length: 324 86.5 0.045 2.8E-05 28.0 11.9 225 1-304 30-271 (324) 22 protein:vir:95898 Length: 274 83.9 0.065 4E-05 27.1 13.2 186 1-304 1-211 (274) 23 protein:vir:96262 Length: 274 83.9 0.065 4E-05 27.1 13.2 186 1-304 1-211 (274) 24 protein:vir:2344 Length: 397 # 83.6 0.067 4.2E-05 27.0 13.8 222 1-304 1-262 (397) 25 protein:vir:8187 Length: 311 # 83.6 0.038 2.3E-05 28.4 7.7 228 1-272 1-311 (311) 26 protein:vir:103955 Length: 324 82.0 0.08 5E-05 26.6 13.4 223 1-304 30-271 (324) 27 protein:vir:100247 Length: 425 81.8 0.074 4.6E-05 26.8 8.6 225 1-236 105-425 (425) 28 protein:vir:97148 Length: 324 72.7 0.18 0.00011 24.7 12.7 225 1-304 27-271 (324) 29 protein:vir:4226 Length: 326 # 72.5 0.18 0.00011 24.6 8.2 235 1-242 1-326 (326) 30 protein:vir:79642 Length: 329 71.7 0.19 0.00012 24.5 9.2 213 1-304 26-261 (329) 31 protein:vir:104085 Length: 320 71.0 0.14 8.5E-05 25.3 6.9 230 1-269 1-320 (320) 32 protein:vir:1239 Length: 274 # 71.0 0.2 0.00012 24.4 14.2 186 1-304 1-211 (274) 33 protein:vir:93742 Length: 274 70.2 0.21 0.00013 24.3 13.9 187 1-304 1-211 (274) 34 protein:vir:78830 Length: 324 66.3 0.27 0.00017 23.7 12.0 225 1-304 27-271 (324) 35 protein:vir:96392 Length: 324 66.3 0.27 0.00017 23.7 12.0 225 1-304 27-271 (324) 36 protein:vir:107687 Length: 319 65.8 0.28 0.00017 23.6 9.7 203 1-304 31-254 (319) 37 protein:vir:104256 Length: 458 65.0 0.29 0.00018 23.5 8.0 238 1-304 163-428 (458) 38 protein:vir:96762 Length: 632 64.9 0.29 0.00018 23.5 8.2 232 1-246 332-632 (632) 39 protein:vir:2430 Length: 318 # 62.6 0.33 0.00021 23.2 10.1 227 1-304 14-269 (318) 40 protein:vir:97433 Length: 274 62.5 0.33 0.00021 23.2 14.2 187 1-304 1-211 (274) 41 protein:vir:94494 Length: 274 62.5 0.33 0.00021 23.2 14.2 187 1-304 1-211 (274) 42 protein:vir:104342 Length: 314 61.2 0.36 0.00022 23.0 8.8 206 1-304 1-246 (314) 43 protein:vir:9759 Length: 303 # 58.8 0.41 0.00025 22.7 10.8 237 1-304 1-263 (303) 44 protein:vir:102655 Length: 322 58.2 0.42 0.00026 22.7 14.7 210 1-304 10-243 (322) 45 protein:vir:78523 Length: 338 57.7 0.43 0.00027 22.6 10.7 236 1-304 1-289 (338) 46 protein:vir:99749 Length: 324 56.8 0.45 0.00028 22.5 12.1 223 1-304 30-271 (324) 47 protein:vir:96123 Length: 274 56.3 0.46 0.00029 22.4 13.2 186 1-304 1-211 (274) 48 protein:vir:80930 Length: 278 54.7 0.5 0.00031 22.2 10.8 193 1-304 1-218 (278) 49 protein:vir:105038 Length: 428 52.1 0.39 0.00024 22.8 5.8 260 1-272 98-428 (428) 50 protein:vir:103285 Length: 296 49.2 0.65 0.0004 21.6 9.2 207 1-304 1-228 (296) 51 protein:vir:485 Length: 407 # 42.4 0.89 0.00055 20.9 9.6 225 1-236 90-407 (407) 52 protein:vir:6242 Length: 390 # 39.8 0.87 0.00054 20.9 5.7 242 1-269 84-390 (390) 53 protein:vir:78223 Length: 333 39.5 1 0.00063 20.5 9.8 242 1-304 1-289 (333) 54 protein:vir:95763 Length: 297 39.4 1 0.00063 20.5 10.9 223 1-304 9-250 (297) 55 protein:vir:105905 Length: 304 39.3 1 0.00063 20.5 11.0 231 1-304 1-259 (304) 56 protein:vir:94142 Length: 304 39.3 1 0.00063 20.5 11.0 231 1-304 1-259 (304) 57 protein:vir:7990 Length: 273 # 38.5 1.1 0.00066 20.4 14.9 191 1-304 1-212 (273) 58 protein:vir:80068 Length: 301 33.4 1.4 0.00084 19.9 9.5 211 1-304 6-236 (301) 59 protein:vir:7409 Length: 408 # 32.7 1.4 0.00087 19.8 8.8 233 1-239 105-408 (408) 60 protein:vir:4456 Length: 401 # 31.6 1.5 0.00092 19.6 8.5 224 1-242 107-401 (401) 61 protein:vir:1268 Length: 397 # 31.2 1.5 0.00094 19.6 6.7 231 1-237 123-397 (397) 62 protein:vir:1025 Length: 408 # 30.9 1.5 0.00095 19.6 7.5 233 1-239 105-408 (408) 63 protein:vir:41 Length: 299 # N 30.6 1.6 0.00097 19.5 12.2 230 1-304 6-242 (299) 64 protein:vir:7855 Length: 497 # 27.9 1.8 0.0011 19.2 7.6 269 1-304 151-459 (497) 65 protein:vir:101650 Length: 497 27.9 1.8 0.0011 19.2 7.6 269 1-304 151-459 (497) 66 protein:vir:4092 Length: 390 # 27.4 1.8 0.0011 19.1 9.1 241 1-304 61-336 (390) 67 protein:vir:102605 Length: 273 27.4 1.8 0.0011 19.1 15.7 192 1-304 1-212 (273) 68 protein:vir:105822 Length: 273 27.4 1.8 0.0011 19.1 15.7 192 1-304 1-212 (273) 69 protein:vir:80213 Length: 334 26.7 1.9 0.0012 19.0 12.1 216 1-304 1-252 (334) 70 protein:vir:100057 Length: 375 23.0 2.4 0.0015 18.5 11.7 213 1-304 1-258 (375) 71 protein:vir:107593 Length: 392 22.3 2.5 0.0015 18.4 8.4 217 1-304 106-350 (392) 72 protein:vir:102873 Length: 392 22.3 2.5 0.0015 18.4 8.4 217 1-304 106-350 (392) 73 protein:vir:102082 Length: 392 22.3 2.5 0.0015 18.4 8.4 217 1-304 106-350 (392) 74 protein:vir:105004 Length: 392 22.3 2.5 0.0015 18.4 8.4 217 1-304 106-350 (392) 75 protein:vir:80684 Length: 315 21.7 2.6 0.0016 18.3 11.6 235 1-304 1-266 (315) 76 protein:vir:8102 Length: 543 # 20.3 2.8 0.0017 18.1 11.2 226 1-304 251-506 (543) 77 protein:vir:80180 Length: 381 20.2 2.8 0.0017 18.1 14.2 214 1-304 1-243 (381) No 1 >protein:vir:99228 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:776 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950457;genbank:gi:119953658;genbank:GeneID:4643088 Probab=100.00 E-value=1.2e-157 Score=880.82 Aligned_cols=304 Identities=99% Similarity=1.468 Sum_probs=302.9 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccceeecccccccceeeeecc Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQGYQITNKLF 80 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~~~l~~~~~~i~nk~f 80 (304) |||||+++|++|+++||+.|++||+.++++|++|||+|||++++|+|+|||+||+||||||||++++|++|+|+|+||+| T Consensus 1 M~ii~~~~L~~l~~~~~~~f~~~~~~a~~~~~~iA~~VpSt~~~~~Y~WLg~~P~mreWiG~r~i~~l~~~~y~I~Nk~f 80 (304) T protein:vir:99 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQGYQITNKLF 80 (304) T ss_pred CCccCHHHHHHHHHHHHHHHHHHHhhcCcccceeEeEeecCccccccchhcccccchhhhhhhhhhhhhhccceeecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhhhhhh Q lcl|NC_020198. 81 ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVSNLF 160 (304) Q Consensus 81 e~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~snl~ 160 (304) |.||+|+|+|||||+||+|+|++++||++|++|||+|||+||++||+++||||||||||||||++|++++|..+++||+. T Consensus 81 E~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~~~~dg~g~~~~vsn~~ 160 (304) T protein:vir:99 81 ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVSNLF 160 (304) T ss_pred ccccccccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCcccccccccCcccccceec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccchhHHH Q lcl|NC_020198. 161 APAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQVNFE 240 (304) Q Consensus 161 ~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~ 240 (304) ++++++|++|||||++|+|||||||+||+|+|+++++|+|+||||++||+||||+|||+||||||+||+|+++|+++||+ T Consensus 161 ~~~~~~g~~w~Lld~~r~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfWQlA~gS~a~Lt~~nl~ 240 (304) T protein:vir:99 161 APAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNTANFE 240 (304) T ss_pred cCCCCCCCcEEEEeCCCCccceeeeccccceeeeccCCCchhhhhhcceeEeeeeeeccchhhhhhhhhcCCCcChHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCCCCcceecceeeEEeccccC Q lcl|NC_020198. 241 KVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGADNPNFELVQVLDTAWLN 304 (304) Q Consensus 241 aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~g~~N~~~g~~~~iv~p~Ld 304 (304) +||+|||+||+++|+||+|+|++|||||+||.+|+|||+++.+++|++|||+|++||||+|||| T Consensus 241 aAr~aMr~qk~d~G~pL~I~P~~LvVPp~LE~aA~~ll~a~~~~~G~tNp~~g~~eliV~P~Ld 304 (304) T protein:vir:99 241 KVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGADNPNFELVQVLDTAWLN 304 (304) T ss_pred HHHHHHHhhcCCCCceeccccCeEEecchHHHHHHHHHhhhccCCCCcceecceEEEEeecccC Confidence 9999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:79246 Length: 304 # NCBI annotation: conserved hypothetical protein # Family: family:all:776 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469162;genbank:gi:157835004;genbank:GeneID:5648827 Probab=100.00 E-value=2.1e-157 Score=879.57 Aligned_cols=304 Identities=99% Similarity=1.463 Sum_probs=302.9 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccceeecccccccceeeeecc Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQGYQITNKLF 80 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~~~l~~~~~~i~nk~f 80 (304) |||||+++|++|+++||+.|++||+.++++|++|||+|||++++|+|+|||+||+||||||||++++|++|+|+|+||+| T Consensus 1 M~ii~~~~L~~l~~~~~~~f~~~~~~a~~~~~~iA~~VpSt~~~~tY~WLg~~P~mreWiG~r~i~~l~~~~y~I~Nk~f 80 (304) T protein:vir:79 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQGYQITNKLF 80 (304) T ss_pred CCccCHHHHHHHHHHHHHHHHHHHhhcCcccceeEeEeecCccccccchhcccccchhhhhhhhhhhhhhccceeecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhhhhhh Q lcl|NC_020198. 81 ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVSNLF 160 (304) Q Consensus 81 e~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~snl~ 160 (304) |.||+|+|+|||||+||+|+|++++||++|++|||+|||+||++||+++||||||||||||||+++++++|..+++||+. T Consensus 81 E~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~~~~d~~g~~~~vsn~~ 160 (304) T protein:vir:79 81 ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVSNLF 160 (304) T ss_pred ccceeeccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCccccccccccccccceeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccchhHHH Q lcl|NC_020198. 161 APAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQVNFE 240 (304) Q Consensus 161 ~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~ 240 (304) ++++++|++|||||++|+|||||||+||+|+|+++|+|+|+||||++||+||||+|||+||||||+||+|+++|+++||+ T Consensus 161 ~~~~~~g~~w~LlD~sr~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfWQlA~gS~a~Ls~~nl~ 240 (304) T protein:vir:79 161 APAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSLTKEDNEQVFMADEYVYGVRSRCNVGFGFWQLAAMSTEELNQVNFE 240 (304) T ss_pred cCCCCCCCeEEEEeCCCcccceeeeccccceeeecCCCCchhhhhhcceEEeeeeeeccchhhhhhhhhcCCccchHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCCCCcceecceeeEEeccccC Q lcl|NC_020198. 241 KVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGADNPNFELVQVLDTAWLN 304 (304) Q Consensus 241 aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~g~~N~~~g~~~~iv~p~Ld 304 (304) +||+|||+||+++|+||+|+|++|||||+||.+|+|||+++.+++|++|||+|++||||+|||| T Consensus 241 aAr~aMr~qk~d~G~pL~I~P~~LvVPp~LE~~A~~ll~a~~~~~G~tNp~~g~~eliV~P~Ld 304 (304) T protein:vir:79 241 KVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGADNPNFELVQVLDTAWLN 304 (304) T ss_pred HHHHHHHhhcCCCCceeccccCEEEecchhHHHHHHHHhhhhcCCCCcceecceEEEEeecccC Confidence 9999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=100.00 E-value=1.7e-152 Score=852.61 Aligned_cols=302 Identities=49% Similarity=0.887 Sum_probs=297.5 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccceeecccccccceeeeecc Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQGYQITNKLF 80 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~~~l~~~~~~i~nk~f 80 (304) |.| |+++|++|+++||+.|++||+.+||+|++|||+|||++++|+|+|||+||+||||+|||+|++|++|+|+|+||+| T Consensus 1 M~i-~~~~l~~l~~~~~~~f~~~~~~a~~~~~~iA~~vpSt~~~~tY~wLg~fP~lrewiGer~i~~l~~~~y~i~Nk~f 79 (305) T protein:vir:19 1 MIV-TPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSIANKTF 79 (305) T ss_pred Ccc-CHHHHHHHHHHHHHHHHHHHhhcCcccceEEeEecCCCCcccccccccCCccchhhcceeeeeccccceeEeeccc Confidence 754 8999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhhhhhh Q lcl|NC_020198. 81 ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVSNLF 160 (304) Q Consensus 81 e~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~snl~ 160 (304) |.||+|+|+|||||+||+|+|++++||++|++|||+|||+||++||+++||||||||||||||+++++++|..+++||+. T Consensus 80 e~tV~V~R~dIeDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~cyDGq~FFdtDHpv~~~~~~tg~~~~vsn~~ 159 (305) T protein:vir:19 80 EGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIV 159 (305) T ss_pred cceeccchhhccccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccCCCCCcccCCCCCcccCCcccccccchhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccchhHHH Q lcl|NC_020198. 161 APAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQVNFE 240 (304) Q Consensus 161 ~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~ 240 (304) .+.+++|++|||+|++|+|||||||+||+|+|+++++|+|+|||+++||+||||+|||+||||||+||+|+++||++||+ T Consensus 160 ~~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~~~~~~~~~d~~vf~~~e~~ygvd~R~n~Gygfwq~a~gS~~~Ls~~nl~ 239 (305) T protein:vir:19 160 EQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASTRRAAGYGFWQMAVAVKGDLTLDNLW 239 (305) T ss_pred cCCCCCCceeeeeecCCcceeEEEecccccceeeccCCCchhhhhhceeeeeeeeeeeccccchhheecCCCCCCHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCCCC---cceecceeeEEecccc Q lcl|NC_020198. 241 KVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGA---DNPNFELVQVLDTAWL 303 (304) Q Consensus 241 aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~g~---~N~~~g~~~~iv~p~L 303 (304) +||+|||+||+++|+||+|+|++|||||+||.+|+|||+++.+++|. +|||+|++||||+||| T Consensus 240 aar~aM~~qk~d~G~pL~I~P~~LvVPp~LE~~A~qll~s~~i~~g~~~~~Np~~g~~eliV~P~L 305 (305) T protein:vir:19 240 KGWQLMRSFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNEMKGKLQLVVADYL 305 (305) T ss_pred HHHHHHHhhcCCCCceeeeecCeEEeCchhHHHHHHHHhhcccCCccccccceecceEEEEecccC Confidence 99999999999999999999999999999999999999999887765 5999999999999999 No 4 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=100.00 E-value=6.4e-93 Score=526.01 Aligned_cols=232 Identities=36% Similarity=0.620 Sum_probs=223.9 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccceeecccccccceeeeecc Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQGYQITNKLF 80 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~~~l~~~~~~i~nk~f 80 (304) |.| |+++|++|++++|+.|++||+++|++|++||+++||++++++|+|||+||+||||+|||++++|++++|+|+|++| T Consensus 1 m~i-t~~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge~~~~~l~~~~~~i~~~~~ 79 (302) T protein:vir:10 1 MLI-NKQSLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGAKVVKNLKAYKYVVENEDF 79 (302) T ss_pred Ccc-cHHHHHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccceeeccccccceeEEeecc Confidence 765 8999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhhhhhh Q lcl|NC_020198. 81 ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVSNLF 160 (304) Q Consensus 81 e~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~snl~ 160 (304) |++|+|+|++|||||||+|++++++||++|++||+++||++|++|++++|||||+|||+|||++ T Consensus 80 g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g---------------- 143 (302) T protein:vir:10 80 EATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVG---------------- 143 (302) T ss_pred cceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceeccccccc---------------- Confidence 9999999999999999999999999999999999999999999999999999999999999753 Q ss_pred cccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccchhHHH Q lcl|NC_020198. 161 APAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQVNFE 240 (304) Q Consensus 161 ~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~ 240 (304) .+.|||.|+++||++ +.+|++++|. T Consensus 144 ----------------------------------------------------~~~~~N~g~~~~~~~---~~~l~~~~~~ 168 (302) T protein:vir:10 144 ----------------------------------------------------DASVSNKGTAPLSNA---SQAAAKAGYG 168 (302) T ss_pred ----------------------------------------------------ccccccccchhhhhc---ccccchHHHH Confidence 223799999999976 6789999999 Q ss_pred HHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCCCCcceecceeeEEeccccC Q lcl|NC_020198. 241 KVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGADNPNFELVQVLDTAWLN 304 (304) Q Consensus 241 aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~g~~N~~~g~~~~iv~p~Ld 304 (304) ++|.+|++||+++|++|+|+|++|||||+||.+|++|+.+++.++|++|||+|++++||+|||+ T Consensus 169 aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~~~g~~Np~~g~~~~vv~p~L~ 232 (302) T protein:vir:10 169 AARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKLADNTPNPYVGTAELVVDGRIE 232 (302) T ss_pred HHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhccccCCCCcceeccceEEEEeeccC Confidence 9999999999999999999999999999999999999999998899999999999999999998 No 5 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=100.00 E-value=3.4e-66 Score=379.50 Aligned_cols=220 Identities=25% Similarity=0.358 Sum_probs=196.6 Q ss_pred CCC--ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEec-CCccccccccccCCccchhcc--cceeeccccccccee Q lcl|NC_020198. 1 MAI--ITPALISALKTSFQKHFQDALATAPSTYLQVATVIP-STTASNTYGWLGQFPKLREWI--GQRVIKDMAAQGYQI 75 (304) Q Consensus 1 mai--i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~-S~~~~~~y~~Lg~~P~lrEw~--Ge~~~~~l~~~~~~i 75 (304) ||+ .|+..-..|...+||.+.++|+++|+||++||.+.+ ++|+..+..+||+||.|++.- ||+++++++|.+++| T Consensus 394 ~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~ 473 (693) T protein:vir:95 394 LAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLRQVREGAEYKYVTLGERGEQI 473 (693) T ss_pred HHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCCChhhcCCCCceeeeecCCcccee Confidence 444 355566666778999999999999999999997765 899988888899999998874 999999999999999 Q ss_pred eeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhh Q lcl|NC_020198. 76 TNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATT 155 (304) Q Consensus 76 ~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s 155 (304) +.+|||++|+||||+|||||||+|+++++.||++|+++++++||++|.+ ||+|+|||+|||+||. T Consensus 474 ~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~--Np~m~DGk~LFhadH~------------- 538 (693) T protein:vir:95 474 ILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTG--NPAMSDGKTLFHADHS------------- 538 (693) T ss_pred ehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CccccCCcceeecccc------------- Confidence 9999999999999999999999999999999999999999999999998 8999999999999994 Q ss_pred hhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccc Q lcl|NC_020198. 156 VSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELN 235 (304) Q Consensus 156 ~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~ 235 (304) |+.++. ..+|+ T Consensus 539 --Nl~tga-------------------------------------------------------------------~sals 549 (693) T protein:vir:95 539 --NLLTGA-------------------------------------------------------------------ASALS 549 (693) T ss_pred --cccccc-------------------------------------------------------------------ccccC Confidence 332211 24788 Q ss_pred hhHHHHHHHHHHHhccC----CCceeceecCeEEecchHHHHHHHHHhhhccC-----CCCcceecceeeEEeccccC Q lcl|NC_020198. 236 QVNFEKVYDAMRNQKAD----GGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLA-----NGADNPNFELVQVLDTAWLN 304 (304) Q Consensus 236 ~~~l~aar~aM~~~k~~----~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~-----~g~~N~~~g~~~~iv~p~Ld 304 (304) ++++.+||++|+.||+. +|++|+|+|.||||||+||++|+||++++.++ .|..|||+|.++||++|||| T Consensus 550 ~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~a~~~~~~~NP~~~~~~vi~~prL~ 627 (693) T protein:vir:95 550 IDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPGADVNSGIVNPIRAFAQVIGEPRLD 627 (693) T ss_pred hHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccccccccccccchhccccccccceec Confidence 99999999999999964 68999999999999999999999999987754 35689999999999999998 No 6 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=100.00 E-value=6.3e-63 Score=361.61 Aligned_cols=217 Identities=24% Similarity=0.341 Sum_probs=195.1 Q ss_pred CCC--ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEec-CCccccccccccCCccchhcc--cceeeccccccccee Q lcl|NC_020198. 1 MAI--ITPALISALKTSFQKHFQDALATAPSTYLQVATVIP-STTASNTYGWLGQFPKLREWI--GQRVIKDMAAQGYQI 75 (304) Q Consensus 1 mai--i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~-S~~~~~~y~~Lg~~P~lrEw~--Ge~~~~~l~~~~~~i 75 (304) +|+ -|+..-..|...+||.+.++|+.+|+||++||.+.+ ++|+......||+||.|+++- ||+++++++|.+++| T Consensus 359 ~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~ 438 (652) T protein:vir:79 359 AAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDKQATI 438 (652) T ss_pred HHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccceeeecCcccee Confidence 554 366677778888999999999999999999997766 899998888899999999984 999999999999999 Q ss_pred eeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCccc-Cccccc-ccccccccccccccch Q lcl|NC_020198. 76 TNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCY-DGQNFF-DTDHPVYPNVDGTGTA 153 (304) Q Consensus 76 ~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cy-DGq~fF-~tdH~v~~~~~~tg~~ 153 (304) +.+|||++|+||||+|+|||||+|+++++.||++|+++++++||++|.+ ||+|+ |||+|| |+||. T Consensus 439 ~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~--Np~~~~DGk~LF~hA~H~----------- 505 (652) T protein:vir:79 439 ALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTS--NPKISTDNVSLFDKAKHA----------- 505 (652) T ss_pred eeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhc--CcccccCCceeecccccc----------- Confidence 9999999999999999999999999999999999999999999999998 89996 999999 89994 Q ss_pred hhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccc Q lcl|NC_020198. 154 TTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEE 233 (304) Q Consensus 154 ~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~ 233 (304) |+. ..++ T Consensus 506 ----Nl~---------------------------------------------------------------------~~aa 512 (652) T protein:vir:79 506 ----NVL---------------------------------------------------------------------ESAA 512 (652) T ss_pred ----ccc---------------------------------------------------------------------cccc Confidence 221 0246 Q ss_pred cchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccC-----CCCcceecceeeEEeccccC Q lcl|NC_020198. 234 LNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLA-----NGADNPNFELVQVLDTAWLN 304 (304) Q Consensus 234 l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~-----~g~~N~~~g~~~~iv~p~Ld 304 (304) |+++++++||++|++||+ ++++|+|+|.+|||||+||++|+||+++..++ .|..||+++.+++||+|||| T Consensus 513 ~~~~~l~~ar~aM~~Qk~-g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~a~~~~~~~Np~~~~~~~i~eprL~ 587 (652) T protein:vir:79 513 MDVASLDKARQLMRVQKE-GERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPVKDFATVIAEPRLD 587 (652) T ss_pred CCHHHHHHHHHHHHHhcc-CCccccccccEEEecchhHHHHHHHhccCCCcccccccccccccccccccccccccC Confidence 788999999999999995 45789999999999999999999999886543 36789999999999999998 No 7 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=99.91 E-value=4.7e-30 Score=181.38 Aligned_cols=174 Identities=22% Similarity=0.370 Sum_probs=112.1 Q ss_pred CCCccHHHH--HHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccceeecccccccceeeee Q lcl|NC_020198. 1 MAIITPALI--SALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQGYQITNK 78 (304) Q Consensus 1 maii~~~~l--~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~~~l~~~~~~i~nk 78 (304) -+-.++..= +.+|.+-++...+ ..+++... T Consensus 122 ~~g~~~~~~DG~~fF~~dH~~g~~-----------------------------------------~~~N~g~~------- 153 (302) T protein:vir:10 122 NGAFTKPCFDGQYFIDTDHPVGDA-----------------------------------------SVSNKGTA------- 153 (302) T ss_pred hccCCCcccCCcceeccccccccc-----------------------------------------ccccccch------- Confidence 000000000 0122111110000 00000000 Q ss_pred cccceeecchhhhhcCCcc----hhHHHHHHHHHHHHhcHHHHHHH-HHhccCCCcccCcccccccccccccccccccch Q lcl|NC_020198. 79 LFESTVGVKRTDIEDDNLG----VYGPLMQEMGRAAGAHPDELVFA-LLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTA 153 (304) Q Consensus 79 ~fe~tv~v~R~~I~dDdlG----~~~~~~~~~G~aAa~~~~~lv~~-lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~ 153 (304) .+.. +..++..+.++ ++..+....|+....+|+.||+. .|..+....|++++++++++||+++. - T Consensus 154 ~~~~----~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~~~g~~Np~~g~----~-- 223 (302) T protein:vir:10 154 PLSN----ASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKLADNTPNPYVGT----A-- 223 (302) T ss_pred hhhh----cccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhccccCCCCcceeccc----e-- Confidence 0000 00111111111 12223444588889999999996 67777788999999999999997532 1 Q ss_pred hhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccc Q lcl|NC_020198. 154 TTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEE 233 (304) Q Consensus 154 ~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~ 233 (304) ..+..+...++++|||+|.+++|||+|+|.|+.|++++++++++++||++.++.||||+|||+||||||++|+|+++ T Consensus 224 ---~~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~ 300 (302) T protein:vir:10 224 ---ELVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGT 300 (302) T ss_pred ---EEEEeeccCCCCceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCcc Confidence 22222334467899999999999999999999999999999999999999999999999999999999999999886 Q ss_pred cc Q lcl|NC_020198. 234 LN 235 (304) Q Consensus 234 l~ 235 (304) -+ T Consensus 301 ~~ 302 (302) T protein:vir:10 301 GA 302 (302) T ss_pred CC Confidence 55 No 8 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=95.90 E-value=0.00016 Score=41.41 Aligned_cols=206 Identities=14% Similarity=0.067 Sum_probs=101.3 Q ss_pred CCCccHHHHHH-----H---HHHHHHHHHHHHh-hcchhhcceEEEecCCccccccccccCCccchhcccceeecccccc Q lcl|NC_020198. 1 MAIITPALISA-----L---KTSFQKHFQDALA-TAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQ 71 (304) Q Consensus 1 maii~~~~l~~-----l---~~~~~~~f~~a~~-~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~~~l~~~ 71 (304) +-+=++..|.. + +.+ .+.|+++-. .+. .+ ++-.-.|+.... ++. ..-| -||+..-..... T Consensus 20 ~ll~~P~~I~~~i~e~~~~~~ia-d~lf~~~~a~~~~-~v-~f~~~~p~~~~~-d~e------~VaE-ggEiP~~~~~~G 88 (318) T protein:vir:10 20 ELVGNPLWIPTALKKMMVNQFIS-ESLFRNGGANPNG-VV-AYNEGNPSFLED-DVA------DVAE-FGEIPVSAGARG 88 (318) T ss_pred HhhCCchhHHHHHHHHHhccchh-hhhhhcccccccc-ee-EEEecccccccC-cHh------hccC-cccccccCCCCC Confidence 11112222211 1 111 222332110 000 00 000111221111 110 1111 255555555443 Q ss_pred ccee-eeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccc Q lcl|NC_020198. 72 GYQI-TNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGT 150 (304) Q Consensus 72 ~~~i-~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~t 150 (304) .-.| +.++||..+.||++++..-+.+.+.+.+++++.+-+++-|..+|+.|.++-++.. ++....+ T Consensus 89 ~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~-----------~~s~~w~-- 155 (318) T protein:vir:10 89 LPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTL-----------AVPTAWD-- 155 (318) T ss_pred chhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----------cCCcCCC-- Confidence 4444 4479999999999999999999999999999999999999999999987544332 1100000 Q ss_pred cchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcc Q lcl|NC_020198. 151 GTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMS 230 (304) Q Consensus 151 g~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s 230 (304) + -++... |..-+ T Consensus 156 ~----~~~~~~------------d~~~A---------------------------------------------------- 167 (318) T protein:vir:10 156 N----GGKVRT------------DIAIA---------------------------------------------------- 167 (318) T ss_pred C----cccccc------------cchhh---------------------------------------------------- Confidence 0 000000 00000 Q ss_pred ccccchhHHHHHHH-HHHHhccCCCceeceecCeEEecchHHHHH------HHHHhhhcc-----CCCCcce---eccee Q lcl|NC_020198. 231 TEELNQVNFEKVYD-AMRNQKADGGRPLDIRPNLLVVPTTLRSKA------KEVVGVQRL-----ANGADNP---NFELV 295 (304) Q Consensus 231 ~~~l~~~~l~aar~-aM~~~k~~~G~~L~i~P~~LvVpp~le~~A------~~ll~~~~~-----~~g~~N~---~~g~~ 295 (304) .++...|+. ++-..........+-+|+.||+.|.....- ++++..+.. .+.+.+- +.| + T Consensus 168 -----~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~lG-l 241 (318) T protein:vir:10 168 -----IEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSVMG-L 241 (318) T ss_pred -----hhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccccccceeec-e Confidence 001111111 111122222357888999999999988776 444432211 1112232 234 9 Q ss_pred eEEeccccC Q lcl|NC_020198. 296 QVLDTAWLN 304 (304) Q Consensus 296 ~~iv~p~Ld 304 (304) ++|++|.+. T Consensus 242 ~vi~s~~~p 250 (318) T protein:vir:10 242 NVIRSRTFP 250 (318) T ss_pred EEeecCccC Confidence 999999999 No 9 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=95.42 E-value=0.0021 Score=35.25 Aligned_cols=185 Identities=14% Similarity=0.108 Sum_probs=103.4 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEec----CCccccccccccCCccchhcc---cceee Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIP----STTASNTYGWLGQFPKLREWI---GQRVI 65 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~----S~~~~~~y~~Lg~~P~lrEw~---Ge~~~ 65 (304) ||. |.|+.+..+. ...+++.. -+.+++.+-. .....-++..+...+.. +|+ ++... T Consensus 1 MA~~~T~~~~~~iPev~s~~v---~~~~~~~~-----~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a-~~v~eg~~i~~ 71 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMI---DAEVGKAI-----RFAPLAEVDTTLEGQPGTTLTVPKWDYIGDA-EDVAEGEAIPM 71 (272) T ss_pred CCCccccchheechHHHHHHH---HHHHHHHh-----hhhccccccccccCCCCCEEEEEEecCCCCc-ccccCCCcccc Confidence 994 4555555542 22222222 1233332211 11111112222222232 454 34566 Q ss_pred cccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccc Q lcl|NC_020198. 66 KDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYP 145 (304) Q Consensus 66 ~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~ 145 (304) .++....-+++.++++..+.|+++++.+....+...+.++++++.++..+..+++.|+...+ . T Consensus 72 ~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~--------------~--- 134 (272) T protein:vir:30 72 TQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ--------------T--- 134 (272) T ss_pred cccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------c--- Confidence 66777777888899999999999999998889999999999999999999888776533100 0 Q ss_pred ccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchh Q lcl|NC_020198. 146 NVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQ 225 (304) Q Consensus 146 ~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq 225 (304) T Consensus 135 -------------------------------------------------------------------------------- 134 (272) T protein:vir:30 135 -------------------------------------------------------------------------------- 134 (272) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhcc--CCCCcc--------eeccee Q lcl|NC_020198. 226 LAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRL--ANGADN--------PNFELV 295 (304) Q Consensus 226 ~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~--~~g~~N--------~~~g~~ 295 (304) .++..+.+.+.+|+..+... + ..++.+||+|.....-++.-.-+.. .....+ -+.| + T Consensus 135 ----~~~~~t~d~i~da~~~l~~~----~----~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G-~ 201 (272) T protein:vir:30 135 ----VEATATVDGVSKALDIFNDE----D----DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLG-V 201 (272) T ss_pred ----cccccCHHHHHHHHHHHhcc----C----CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcC-e Confidence 00112334555555555322 2 3367899999865544332111111 111111 1345 6 Q ss_pred eEEeccccC Q lcl|NC_020198. 296 QVLDTAWLN 304 (304) Q Consensus 296 ~~iv~p~Ld 304 (304) .|++++.++ T Consensus 202 ~Vi~s~~~p 210 (272) T protein:vir:30 202 QIVRSRKCP 210 (272) T ss_pred eEEEcCCCC Confidence 899999998 No 10 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=95.42 E-value=0.0021 Score=35.25 Aligned_cols=185 Identities=14% Similarity=0.108 Sum_probs=103.4 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEec----CCccccccccccCCccchhcc---cceee Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIP----STTASNTYGWLGQFPKLREWI---GQRVI 65 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~----S~~~~~~y~~Lg~~P~lrEw~---Ge~~~ 65 (304) ||. |.|+.+..+. ...+++.. -+.+++.+-. .....-++..+...+.. +|+ ++... T Consensus 1 MA~~~T~~~~~~iPev~s~~v---~~~~~~~~-----~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a-~~v~eg~~i~~ 71 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMI---DAEVGKAI-----RFAPLAEVDTTLEGQPGTTLTVPKWDYIGDA-EDVAEGEAIPM 71 (272) T ss_pred CCCccccchheechHHHHHHH---HHHHHHHh-----hhhccccccccccCCCCCEEEEEEecCCCCc-ccccCCCcccc Confidence 994 4555555542 22222222 1233332211 11111112222222232 454 34566 Q ss_pred cccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccc Q lcl|NC_020198. 66 KDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYP 145 (304) Q Consensus 66 ~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~ 145 (304) .++....-+++.++++..+.|+++++.+....+...+.++++++.++..+..+++.|+...+ . T Consensus 72 ~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~--------------~--- 134 (272) T protein:vir:98 72 TQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ--------------T--- 134 (272) T ss_pred cccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------c--- Confidence 66777777888899999999999999998889999999999999999999888776533100 0 Q ss_pred ccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchh Q lcl|NC_020198. 146 NVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQ 225 (304) Q Consensus 146 ~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq 225 (304) T Consensus 135 -------------------------------------------------------------------------------- 134 (272) T protein:vir:98 135 -------------------------------------------------------------------------------- 134 (272) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhcc--CCCCcc--------eeccee Q lcl|NC_020198. 226 LAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRL--ANGADN--------PNFELV 295 (304) Q Consensus 226 ~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~--~~g~~N--------~~~g~~ 295 (304) .++..+.+.+.+|+..+... + ..++.+||+|.....-++.-.-+.. .....+ -+.| + T Consensus 135 ----~~~~~t~d~i~da~~~l~~~----~----~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G-~ 201 (272) T protein:vir:98 135 ----VEATATVDGVSKALDIFNDE----D----DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLG-V 201 (272) T ss_pred ----cccccCHHHHHHHHHHHhcc----C----CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcC-e Confidence 00112334555555555322 2 3367899999865544332111111 111111 1345 6 Q ss_pred eEEeccccC Q lcl|NC_020198. 296 QVLDTAWLN 304 (304) Q Consensus 296 ~~iv~p~Ld 304 (304) .|++++.++ T Consensus 202 ~Vi~s~~~p 210 (272) T protein:vir:98 202 QIVRSRKCP 210 (272) T ss_pred eEEEcCCCC Confidence 899999998 No 11 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=93.27 E-value=0.00073 Score=37.77 Aligned_cols=262 Identities=16% Similarity=0.209 Sum_probs=133.2 Q ss_pred CC-----CccHHHHHHHHHHHHHHHHHHHhhcchhh------cceEEEecCCccccccccccCCccchhc-c---cceee Q lcl|NC_020198. 1 MA-----IITPALISALKTSFQKHFQDALATAPSTY------LQVATVIPSTTASNTYGWLGQFPKLREW-I---GQRVI 65 (304) Q Consensus 1 ma-----ii~~~~l~~l~~~~~~~f~~a~~~a~~~~------~~~a~~v~S~~~~~~y~~Lg~~P~lrEw-~---Ge~~~ 65 (304) |+ |+-|..|.. .+ .++|+|-| ++++. ...++..+.-+| -||+. | ||+.- T Consensus 74 mtt~~a~IliP~vis~---v~-------~Eaaepl~~~~kl~qk~~L---~~Grsm~F~~~g---~~Ra~~IgEGgE~~~ 137 (393) T protein:vir:79 74 MATPSAQILIPRVIVG---TM-------REAAEPLYIGTKMLQKIRL---KSGQSMIFPSIG---IMRAYDVAEGQEIPE 137 (393) T ss_pred hcCCCcceechhhhhh---hh-------hhcccchhHHHHHHHHHhh---hcCcceeccchh---eeeeccccccccccc Confidence 22 222222221 11 22333322 11111 011122222111 45555 2 67777 Q ss_pred cccc---cccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccc Q lcl|NC_020198. 66 KDMA---AQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHP 142 (304) Q Consensus 66 ~~l~---~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~ 142 (304) .+|+ +.+-.++.+++|-.|+++-++|+|-++.+..-+...+||+-+++-++.+|.++++ .....|||-.==---|| T Consensus 138 ~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~-~ghtvfDa~st~t~ahp 216 (393) T protein:vir:79 138 DSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRS-HGHTVFDNYSTNKLAHT 216 (393) T ss_pred cchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhc-ccceeeeccccCcccee Confidence 7776 3456788899999999999999999999999999999999999999999999976 34567888221112466 Q ss_pred ccccccc-ccchhhhhhhhcc------cCCC-------Cccceec----------------------cCCccchhhhhhh Q lcl|NC_020198. 143 VYPNVDG-TGTATTVSNLFAP------AADP-------GAAWYLL----------------------DTSRSLKPLIYQE 186 (304) Q Consensus 143 v~~~~~~-tg~~~s~snl~~~------~~~~-------g~~w~L~----------------------d~~~~~kP~i~Q~ 186 (304) +|-++++ -...+|..++.+- ...+ -.+|-++ -+++.+-|-..|. T Consensus 217 tGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp~~i~~ 296 (393) T protein:vir:79 217 TGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGPDSIQG 296 (393) T ss_pred ecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhchhhhcc Confidence 6522221 1123444443220 1111 1235443 4788889999999 Q ss_pred ccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccchhHHHHHHHHHHHhcc--CCCc-eeceecCe Q lcl|NC_020198. 187 RMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQVNFEKVYDAMRNQKA--DGGR-PLDIRPNL 263 (304) Q Consensus 187 r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~aar~aM~~~k~--~~G~-~L~i~P~~ 263 (304) |-+++|.-...|==+.--....|-|=.=.|-|+|.=| -+..|+.+.++.-..-+++.|= ..|- .|+=.-. T Consensus 297 ~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlL------V~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gka- 369 (393) T protein:vir:79 297 RLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLL------VRDDLKTDQWDEKARGLQNIKMIERYGIGILNEGKA- 369 (393) T ss_pred ccccceeEEEecccccccccceeeEEEeecCCceEEE------EecCcceeccccccccceeeeeeeeeceeeeeCCce- Confidence 9888877665552221112445555444566777655 2347777777654444433332 1121 1110000 Q ss_pred EEecchHHHHHHHHHhhhcc-CCCC Q lcl|NC_020198. 264 LVVPTTLRSKAKEVVGVQRL-ANGA 287 (304) Q Consensus 264 LvVpp~le~~A~~ll~~~~~-~~g~ 287 (304) +.|- -+-+.+...-....+ +-|. T Consensus 370 iava-kNI~~~k~y~~P~~~~~~~~ 393 (393) T protein:vir:79 370 IAVA-KNISMDKSYAEPMLIKNVGN 393 (393) T ss_pred EEEE-ecceeecccccchhhhccCC Confidence 0000 000000000000000 0011 No 12 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=92.87 E-value=0.0096 Score=31.63 Aligned_cols=185 Identities=14% Similarity=0.208 Sum_probs=101.1 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecC-------CccccccccccCCccchhcccceee Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIPS-------TTASNTYGWLGQFPKLREWIGQRVI 65 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S-------~~~~~~y~~Lg~~P~lrEw~Ge~~~ 65 (304) ||. |.|+++.... ...|++.+ -+..++..-+. +-....|.-+|+.-.+.| -++... T Consensus 1 ma~~~T~~~d~iiPev~~~~v---~~~~~~~~-----~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~e-g~~i~~ 71 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIV---SYELNKAL-----RFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAE-GGEISL 71 (272) T ss_pred CCCcceehhhhhchHHHHHHH---HHHHHhhh-----hhccccccccccccCCCCEEEEeeeccCccccccCC-CCccCh Confidence 883 4455555532 22333332 12223322111 111112222333322222 145666 Q ss_pred cccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccc Q lcl|NC_020198. 66 KDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYP 145 (304) Q Consensus 66 ~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~ 145 (304) ..+....-+.+.+.+++.++|+..+...---.....+.++++++.++..|..+++.|..... T Consensus 72 ~~lt~~~~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~------------------ 133 (272) T protein:vir:36 72 DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQ------------------ 133 (272) T ss_pred hhcCCcceeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------ Confidence 77777777788889999999999888766556677788888888888888777766532000 Q ss_pred ccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchh Q lcl|NC_020198. 146 NVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQ 225 (304) Q Consensus 146 ~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq 225 (304) + T Consensus 134 ---------~---------------------------------------------------------------------- 134 (272) T protein:vir:36 134 ---------T---------------------------------------------------------------------- 134 (272) T ss_pred ---------c---------------------------------------------------------------------- Confidence 0 Q ss_pred hhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccC--C-------CCcceecceee Q lcl|NC_020198. 226 LAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLA--N-------GADNPNFELVQ 296 (304) Q Consensus 226 ~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~--~-------g~~N~~~g~~~ 296 (304) ....++.+.+..|++.|.... . .+++++|+|.....-++....+... . |..-.+.| ++ T Consensus 135 ----~~~~~~~d~i~~A~~~lgd~~----~----~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G-~~ 201 (272) T protein:vir:36 135 ----VSTKANVDGVQAALDIFNDED----A----QAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLG-AQ 201 (272) T ss_pred ----ccccccHHHHHHHHHHhhhcC----C----CceEEEEcHHHHHHHhcccccccccccccccceeeeccceecC-ee Confidence 011233445556666665332 2 3578999998655444433322221 1 11224556 69 Q ss_pred EEeccccC Q lcl|NC_020198. 297 VLDTAWLN 304 (304) Q Consensus 297 ~iv~p~Ld 304 (304) +|++..+. T Consensus 202 Vv~s~~~p 209 (272) T protein:vir:36 202 IVRSKKLA 209 (272) T ss_pred EEEeCCCC Confidence 99999998 No 13 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=92.74 E-value=0.0041 Score=33.66 Aligned_cols=229 Identities=10% Similarity=0.061 Sum_probs=104.5 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhccccee---ecccccccceeee Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRV---IKDMAAQGYQITN 77 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~---~~~l~~~~~~i~n 77 (304) ||.-....+-. .+...+-+.+... +...++|+++|-.+...++.++..-|.- .|+||-. ..+++=..-+++- T Consensus 1 ma~~gG~lvp~---~~~~~ii~~~~~~-s~i~~l~~~~~~~~~~~~ip~~~~~~~a-~~v~E~~~~~~~~~~f~~v~l~~ 75 (298) T protein:vir:16 1 MVLNKGTLFDP---TLVTDLISKVAGK-SSIARLSAQKPIPFNGEKVFTFTMDSEI-DVVAESGKKTHGGVTLAPQTMVP 75 (298) T ss_pred CcccCcceech---hHHHHHHHHHHhh-hhhhhhcceeeccCCceEEEEEecCcce-EEecCCccccccccceeEEEEee Confidence 88644332211 1112222222222 4466778888765555567766666654 7887643 3333334557788 Q ss_pred ecccceeecchhhh---hcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccc--ccccccccccccccccc Q lcl|NC_020198. 78 KLFESTVGVKRTDI---EDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQN--FFDTDHPVYPNVDGTGT 152 (304) Q Consensus 78 k~fe~tv~v~R~~I---~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~--fF~tdH~v~~~~~~tg~ 152 (304) ++++..+.||++.+ .++..++..-+..+++++.++..++.++. |..+. +|++ +....+ ... T Consensus 76 ~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~----G~~~~--~g~~~~~~~~~~--------~~~ 141 (298) T protein:vir:16 76 IKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFH----GVNPR--LGTASAVIGTNH--------FDS 141 (298) T ss_pred eeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhc----cccCC--CCcccccccccc--------ccc Confidence 99999999999999 56678999999999999999998887763 32221 2222 111111 000 Q ss_pred hhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcccc Q lcl|NC_020198. 153 ATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTE 232 (304) Q Consensus 153 ~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~ 232 (304) .+.+..... ...+-+.+ .+.-++.+... .+ +.+++ | T Consensus 142 --~~~~~~~~~---~~~~~~~~---~i~~~~~~~~~----------~~---------------~~~~~---~-------- 177 (298) T protein:vir:16 142 --KVTQKVEAP---RGIADPNG---AIENAVELLTG----------VD---------------ADVTG---I-------- 177 (298) T ss_pred --ccccccccc---cccccHHH---HHHHHHHHhhh----------cC---------------CCccE---E-------- Confidence 000000000 00010110 01111111000 00 00000 1 Q ss_pred ccchhHHHHHHHHHHHhccCCCceece------ecCeEEecchHHHHHHHHHhhhccCCC-Ccc---ee----------- Q lcl|NC_020198. 233 ELNQVNFEKVYDAMRNQKADGGRPLDI------RPNLLVVPTTLRSKAKEVVGVQRLANG-ADN---PN----------- 291 (304) Q Consensus 233 ~l~~~~l~aar~aM~~~k~~~G~~L~i------~P~~LvVpp~le~~A~~ll~~~~~~~g-~~N---~~----------- 291 (304) -++ ...+.++++.|+..|+||-. .|..|. .+-++-++.++.+ .++ .+ T Consensus 178 vmn----~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~--------G~PV~~~~~v~~~~~~~~~~~~~GDfs~~~~~~ 245 (298) T protein:vir:16 178 AIN----PSFRSALAKQKDLQDNALFPELKWGATPDTIN--------GLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWG 245 (298) T ss_pred EEc----HHHHHHHHHhhccCCCeeecCcccCCCCceec--------ceeeEEecccccccCCCccEEEEeeccceEEEE Confidence 111 12346678888989988732 122111 0111111122211 111 11 Q ss_pred -cceeeEEeccccC Q lcl|NC_020198. 292 -FELVQVLDTAWLN 304 (304) Q Consensus 292 -~g~~~~iv~p~Ld 304 (304) ++.+++-+.++-| T Consensus 246 ~~~~~~~~~~~~~~ 259 (298) T protein:vir:16 246 YAKEVPLEVIQYGD 259 (298) T ss_pred EecCceEEEeeccC Confidence 2233444444434 No 14 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=91.31 E-value=0.016 Score=30.36 Aligned_cols=187 Identities=13% Similarity=0.141 Sum_probs=110.1 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecC----CccccccccccCCccchhcc--cceeec Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIPS----TTASNTYGWLGQFPKLREWI--GQRVIK 66 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S----~~~~~~y~~Lg~~P~lrEw~--Ge~~~~ 66 (304) ||- |.|+++.... ++.+.++ .-+.++|....+ ....-+..........+++. .+.... T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v-------~~~~~~~-~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~ 72 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMM-------QAELDKK-LRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVD 72 (276) T ss_pred CCcceeehhhhhchHHHHHHH-------HHHHHhh-hhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCcc Confidence 883 4555555542 2233222 222344433221 11112222222222233332 456777 Q ss_pred ccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccc Q lcl|NC_020198. 67 DMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPN 146 (304) Q Consensus 67 ~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~ 146 (304) .|....-+.+.+.+++.++++..+..---...+..+.+++|.+-++..+.-+++.|+.+... T Consensus 73 ~lt~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~------------------ 134 (276) T protein:vir:10 73 KIETNRREAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLT------------------ 134 (276) T ss_pred ccccceeeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------ Confidence 78777777788899999999999998866667888899999999988888887766531000 Q ss_pred cccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhh Q lcl|NC_020198. 147 VDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQL 226 (304) Q Consensus 147 ~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~ 226 (304) T Consensus 135 -------------------------------------------------------------------------------- 134 (276) T protein:vir:10 135 -------------------------------------------------------------------------------- 134 (276) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhcc--CC--------CCcceecceee Q lcl|NC_020198. 227 AAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRL--AN--------GADNPNFELVQ 296 (304) Q Consensus 227 a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~--~~--------g~~N~~~g~~~ 296 (304) .+..+++.+.+.+|++.|.... ..++.|+|+|.....-++....+.+ .. |...-+.| ++ T Consensus 135 --~~~~~~t~d~i~~A~~~lgd~~--------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~ 203 (276) T protein:vir:10 135 --VSADIGTLAGLEAAIDTFDDED--------LEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALG-AV 203 (276) T ss_pred --ccccccCHHHHHHHHHHhcccc--------CcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecc-ee Confidence 0112344555667777765432 2356899999988777665332222 11 22234455 69 Q ss_pred EEeccccC Q lcl|NC_020198. 297 VLDTAWLN 304 (304) Q Consensus 297 ~iv~p~Ld 304 (304) ||+++.+. T Consensus 204 Vi~s~~~p 211 (276) T protein:vir:10 204 IVRSKKLD 211 (276) T ss_pred EEEcCCCC Confidence 99999998 No 15 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=89.70 E-value=0.012 Score=31.07 Aligned_cols=228 Identities=9% Similarity=0.036 Sum_probs=96.9 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccceeecc---cccccceeee Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKD---MAAQGYQITN 77 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~~~---l~~~~~~i~n 77 (304) ||.-+...=.-+-..+...+-+.+.. .+.-.++|+.+|-.....+|..+..-|. -.|+||-.-.. ++=..-+++- T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~-~s~i~~l~~~~~~~~~~~~~p~~~~~~~-a~wv~Eg~~~~~s~~~f~~v~l~~ 78 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKG-HSSIAKLSPQKPIPFNGQREFVFDFDSD-IDIVAENGKKTHGGVSLDPVTIVP 78 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHh-hhhhhhhcceeeccCCceEEEEEecCcc-eEEeeCCcccccccccceeeEeee Confidence 87644332000111111111111111 1233456666664433444555444444 36876543322 2223455667 Q ss_pred ecccceeecchhhh---hcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccC-------CCcccCccc----------cc Q lcl|NC_020198. 78 KLFESTVGVKRTDI---EDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGN-------ANLCYDGQN----------FF 137 (304) Q Consensus 78 k~fe~tv~v~R~~I---~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~-------~~~cyDGq~----------fF 137 (304) ++++..+.||++.+ .||..++.+-+...++++.++.+++.++.=..++. ....+++.. .+ T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (300) T protein:vir:95 79 LKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNPD 158 (300) T ss_pred EEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccchH Confidence 88999999999998 57779999999999999999999988873321110 011111111 00 Q ss_pred c-------cccccccccccccchhhhhhhh---cccCCCCc-----------c------------------------cee Q lcl|NC_020198. 138 D-------TDHPVYPNVDGTGTATTVSNLF---APAADPGA-----------A------------------------WYL 172 (304) Q Consensus 138 ~-------tdH~v~~~~~~tg~~~s~snl~---~~~~~~g~-----------~------------------------w~L 172 (304) + .-|..+. ..++-..+.+.+. .-....|. + .++ T Consensus 159 ~~i~~~~~~~~~~~~--~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 236 (300) T protein:vir:95 159 ESMEDAVGMIDGSER--DITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDPKNTAIV 236 (300) T ss_pred HHHHHHHHHhhhcCC--CccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCCCCccEEEE Confidence 0 0000000 0000001111100 00001111 1 111 Q ss_pred ccCCccchhhhhhhccccchhhcccCccc----ccccccceEEEeeccccccccchhhhhccccccchhHHHHHHHHHHH Q lcl|NC_020198. 173 LDTSRSLKPLIYQERMKPSFTSMTKEDDE----QVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQVNFEKVYDAMRN 248 (304) Q Consensus 173 ~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~----~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~aar~aM~~ 248 (304) -|-++.+. +-.|...++...+..+.+ +.|.+|.+.|-+..|+..+. .+ . +|+-. T Consensus 237 GDf~~~~~---~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v--~~----------~-------~a~~~ 294 (300) T protein:vir:95 237 GDFETMFK---WGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGI--MD----------A-------ASFAR 294 (300) T ss_pred eeccceEE---EEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeeccee--ec----------c-------cceEE Confidence 12222221 222333333322221111 23666666665555554321 11 1 23333 Q ss_pred hccCCC Q lcl|NC_020198. 249 QKADGG 254 (304) Q Consensus 249 ~k~~~G 254 (304) .|+..| T Consensus 295 l~~~~g 300 (300) T protein:vir:95 295 IVKTGG 300 (300) T ss_pred EecCCC Confidence 445455 No 16 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=89.24 E-value=0.01 Score=31.51 Aligned_cols=234 Identities=13% Similarity=0.035 Sum_probs=104.0 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---eecccccccceeee Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VIKDMAAQGYQITN 77 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~~~l~~~~~~i~n 77 (304) ||-+++..--.+=.-+...+.+.+.+. +...++|+++|......+|.++..-|.. .|+||- ...+..=..-++.- T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~-s~l~~~~~~i~~~~~~~~~p~~~~~~~a-~wv~Eg~~~~~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQG-STVAVLSARKPQRFGNEDIITFNGRPKA-EFVGEGQQKSSTTGEFDFVTSTP 78 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEeCCcee-EEeecCcccccccceeeEEEEee Confidence 887655432222222333333333322 3356778888866666678777666654 788653 33333334566778 Q ss_pred ecccceeecchhhh---hcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchh Q lcl|NC_020198. 78 KLFESTVGVKRTDI---EDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTAT 154 (304) Q Consensus 78 k~fe~tv~v~R~~I---~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~ 154 (304) ++++..+.|||+.+ +|+...+..-+...|+++.++..++.++.-...| .|..+--..+.. .... T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~------~g~~~~g~~~~~----~~~~--- 145 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPL------TGTVIPGWSNYL----GAAS--- 145 (311) T ss_pred EEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcc------cCcccccccccc----cccc--- Confidence 89999999999998 4667889999999999999999988776322111 122221111100 0000 Q ss_pred hhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcccccc Q lcl|NC_020198. 155 TVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEEL 234 (304) Q Consensus 155 s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l 234 (304) +... .+.++......| +.-++...+. .+.+.+.+ | | -+ T Consensus 146 ---~~~~-~~~~~~~~~~~~----i~~~~~~~~~----------------------~~~~~~~~-~---~--------vm 183 (311) T protein:vir:99 146 ---KRVE-LTADTIANPDLA----IEAAVGLLVA----------------------NGHPTPVN-G---L--------AL 183 (311) T ss_pred ---ceee-ccccccchhHHH----HHHHHHHHhh----------------------hccCCCcc-E---E--------EE Confidence 0000 000000000011 1111100000 00000000 0 1 11 Q ss_pred chhHHHHHHHHHHHhccCCCceece------ecCeEE-ecchHHHHHHHHHhhhccCC-------------CC-cceecc Q lcl|NC_020198. 235 NQVNFEKVYDAMRNQKADGGRPLDI------RPNLLV-VPTTLRSKAKEVVGVQRLAN-------------GA-DNPNFE 293 (304) Q Consensus 235 ~~~~l~aar~aM~~~k~~~G~~L~i------~P~~Lv-Vpp~le~~A~~ll~~~~~~~-------------g~-~N~~~g 293 (304) +. ....++++.|+..|+||-. .|..|. .| +.-++.+++ +. .-.+.| T Consensus 184 n~----~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~P---------v~~s~~i~~~~~~~~~~~~~~~~~~~~~~~G 250 (311) T protein:vir:99 184 HP----SIAWGLSTARYTDGRKKFPELGLGIGVSSFEGID---------ASVSDTVNGGDEADPDDEDLDAARAVRGIVG 250 (311) T ss_pred cH----HHHHHHHhhhccCCCeeecCcccCCCCceeccee---------eEeecccccccccccccchhhccCcceEEEe Confidence 21 2346678889999998721 121110 11 011111111 00 111112 Q ss_pred e----eeEEec--cccC Q lcl|NC_020198. 294 L----VQVLDT--AWLN 304 (304) Q Consensus 294 ~----~~~iv~--p~Ld 304 (304) . +.+-+. ..|. T Consensus 251 df~~~~~~~~~~~~~~~ 267 (311) T protein:vir:99 251 DFANGIHWGVQRDIPVE 267 (311) T ss_pred eccccEEEEEecCceEE Confidence 1 111111 1122 No 17 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=88.05 E-value=0.011 Score=31.29 Aligned_cols=230 Identities=15% Similarity=0.199 Sum_probs=97.8 Q ss_pred CCCccHHHHHHHH-HHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccceee--------cccccc Q lcl|NC_020198. 1 MAIITPALISALK-TSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVI--------KDMAAQ 71 (304) Q Consensus 1 maii~~~~l~~l~-~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~--------~~l~~~ 71 (304) ||.++...-..|. .-+...+.+..... +...++++.++......+|..+..-|.. .|+||-.- .+.+=. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~-s~l~~l~~~~~~~~~~~~~p~~~~~~~a-~wv~E~~~~~~~~~~~s~~~f~ 78 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQG-STVLSAFQNVNMGTKTTHLPVLATLPEA-DWVGESATDPKGVKPTSKVTWA 78 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhh-chhhhhcceeeccCCcEEEEEEeCCcce-EEeeccccccccccccccccee Confidence 9988876544432 22222322222222 2355677888866555566555555543 68766421 122223 Q ss_pred cceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccc-----cc Q lcl|NC_020198. 72 GYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVY-----PN 146 (304) Q Consensus 72 ~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~-----~~ 146 (304) .-+++-++++..+.|+++.+.|-...+.+-+.+.++++.++..++.++. |.+ .++.++....-.. .. T Consensus 79 ~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~----G~g----~~~~~~~~~~~~~~~~~~~~ 150 (305) T protein:vir:25 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTD----KPASWVSPALIPAAVTAGQA 150 (305) T ss_pred eEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhhee----ccC----CCCCcccccccccccccccc Confidence 4467778999999999999998889999999999999999999988772 211 1111111110000 00 Q ss_pred cccccch----------------------------hhh---hhhhcccCCCCccceeccCCccc-hhhhh---------- Q lcl|NC_020198. 147 VDGTGTA----------------------------TTV---SNLFAPAADPGAAWYLLDTSRSL-KPLIY---------- 184 (304) Q Consensus 147 ~~~tg~~----------------------------~s~---snl~~~~~~~g~~w~L~d~~~~~-kP~i~---------- 184 (304) ...++.. .+. ..+..-...+| .+|+-..... .|+++ T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G--~~i~~~~~l~G~Pv~~~~~~~~~~~~ 228 (305) T protein:vir:25 151 VEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANG--NPVFRDDSFAGFRTFFNRNGAWDADA 228 (305) T ss_pred ccccccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCC--ceeecCCcccccceEEcCccCCCCCc Confidence 0000000 000 00000000111 1121100000 12211 Q ss_pred -------------hhccccchhhccc-----Cc-ccccccccceEEEeeccccccccchhhhhccccccchhHHHHHHHH Q lcl|NC_020198. 185 -------------QERMKPSFTSMTK-----ED-DEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQVNFEKVYDA 245 (304) Q Consensus 185 -------------Q~r~~~~~~~~~~-----~~-~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~aar~a 245 (304) -.|...++...+. .. .-..|.++.+.+=+..|+..+..=..... T Consensus 229 ~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v----------------- 291 (305) T protein:vir:25 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQ----------------- 291 (305) T ss_pred cEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEE----------------- Confidence 1111111111100 00 00123333333333333322211111111 Q ss_pred HHHhccCCCceeceecCeEEecch Q lcl|NC_020198. 246 MRNQKADGGRPLDIRPNLLVVPTT 269 (304) Q Consensus 246 M~~~k~~~G~~L~i~P~~LvVpp~ 269 (304) .+...|--.|-|.+ T Consensus 292 ----------~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 292 ----------GANKTPVAVVAPAA 305 (305) T ss_pred ----------EEccccccccCCCC Confidence 12222222233333 No 18 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=88.02 E-value=0.016 Score=30.48 Aligned_cols=236 Identities=10% Similarity=0.035 Sum_probs=93.1 Q ss_pred CCCccHHHHHHHH---------HHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---eeccc Q lcl|NC_020198. 1 MAIITPALISALK---------TSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VIKDM 68 (304) Q Consensus 1 maii~~~~l~~l~---------~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~~~l 68 (304) |+=-......+.. .-+...+-+.+... +...++++.++-....-.|..+..-|. -.|++|- .-.++ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~~p~~~~~~~-a~~v~Eg~~~~~~~~ 78 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKT-SIVQRIARKVPMGPTGISIPHWTGAVS-ASWTGEAERKPITKG 78 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhc-cchhhhcceeeccCCceEEEEEcCCcc-eeEecCCCccccccc Confidence 3311111111100 01111222222222 234566777775444455666655555 3577543 33333 Q ss_pred ccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHH----------HHhcc--CC----Ccc-- Q lcl|NC_020198. 69 AAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFA----------LLKAG--NA----NLC-- 130 (304) Q Consensus 69 ~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~----------lL~~g--~~----~~c-- 130 (304) .=..-++..++++..+.|+|+.++|-...+..-+.+.++++.++..+..++. +++.. .+ ... T Consensus 79 ~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~ 158 (330) T protein:vir:77 79 SFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNLTT 158 (330) T ss_pred eeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccccc Confidence 3345678889999999999999999889999999999999999999887661 11100 00 000 Q ss_pred --------c------------------------------------Ccccccccccccccccccc-----cchhh-hhhhh Q lcl|NC_020198. 131 --------Y------------------------------------DGQNFFDTDHPVYPNVDGT-----GTATT-VSNLF 160 (304) Q Consensus 131 --------y------------------------------------DGq~fF~tdH~v~~~~~~t-----g~~~s-~snl~ 160 (304) | +|+++|......+.-.... |-.+- .+++. T Consensus 159 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p 238 (330) T protein:vir:77 159 ASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVV 238 (330) T ss_pred cccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEecccc Confidence 0 0111111100000000000 00000 01111 Q ss_pred cccCCCCccceeccCCccchhhhhhhccccchhhccc--------------CcccccccccceEEEeeccccccccchhh Q lcl|NC_020198. 161 APAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTK--------------EDDEQVFMADEYRYGVRSRCNVGFGFWQL 226 (304) Q Consensus 161 ~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~--------------~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~ 226 (304) ++.+.+...-++.|-++.+ +..|+..++...++ ...-+.|.+|...|=+..|+....--.++ T Consensus 239 ~~~~~~~~~~~~gd~s~~~----i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 314 (330) T protein:vir:77 239 NGTVGNRVVGVMGDFSQVI----WGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDA 314 (330) T ss_pred CCCCCCccEEEEEecceEE----EEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccc Confidence 1111111112222222221 12222322221111 01123455555554444444332211111 Q ss_pred hhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHH Q lcl|NC_020198. 227 AAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLR 271 (304) Q Consensus 227 a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le 271 (304) -.. |....=--||+-| T Consensus 315 ~~~-----------------------------i~~~~~~~~~~~~ 330 (330) T protein:vir:77 315 FVK-----------------------------LTDQVAGTDPEEE 330 (330) T ss_pred eEE-----------------------------EEeccCCcCCCCC Confidence 000 0000000022222 No 19 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=86.76 E-value=0.043 Score=28.06 Aligned_cols=194 Identities=12% Similarity=0.085 Sum_probs=106.2 Q ss_pred CCCccHHHHHHHH--HHHHHHHHHHHhhcchhhcceEEEecCC----ccccccccccCCccchhcc--cceeeccccccc Q lcl|NC_020198. 1 MAIITPALISALK--TSFQKHFQDALATAPSTYLQVATVIPST----TASNTYGWLGQFPKLREWI--GQRVIKDMAAQG 72 (304) Q Consensus 1 maii~~~~l~~l~--~~~~~~f~~a~~~a~~~~~~~a~~v~S~----~~~~~y~~Lg~~P~lrEw~--Ge~~~~~l~~~~ 72 (304) ||-.|...|.++. .-|....++.+.+ ..-+..+|..-+.- ...-+.......+...++. .+.....+.... T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~-~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~ 79 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDK-KLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKK 79 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHH-hhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhcccce Confidence 6665544455543 1233333333322 12233444332210 1111111111111222222 456677777777 Q ss_pred ceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccc Q lcl|NC_020198. 73 YQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGT 152 (304) Q Consensus 73 ~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~ 152 (304) -+.+.+.+++.++|+..+..----.......+++|.+-++..|.-++..|+.+. +. T Consensus 80 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~--------------~~---------- 135 (275) T protein:vir:96 80 RQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGAT--------------LK---------- 135 (275) T ss_pred eeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc--------------cc---------- Confidence 788889999999999988875444567778888999888888888877665310 00 Q ss_pred hhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcccc Q lcl|NC_020198. 153 ATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTE 232 (304) Q Consensus 153 ~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~ 232 (304) .+.. T Consensus 136 ----------------------------------------------------------------------------~~~~ 139 (275) T protein:vir:96 136 ----------------------------------------------------------------------------VEAD 139 (275) T ss_pred ----------------------------------------------------------------------------cccc Confidence 0011 Q ss_pred ccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhcc--CCCCcc--------eecceeeEEeccc Q lcl|NC_020198. 233 ELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRL--ANGADN--------PNFELVQVLDTAW 302 (304) Q Consensus 233 ~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~--~~g~~N--------~~~g~~~~iv~p~ 302 (304) +++.+.+-.|++.|... + ..++.|+|+|.....-++....+.+ .....| -+.| ++||++.. T Consensus 140 ~~~~d~i~dA~~~lgd~----~----~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G-~~Vi~s~~ 210 (275) T protein:vir:96 140 ITKLAGLQTAIDKFNDE----D----LEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALG-AIIVRSNK 210 (275) T ss_pred ccCHHHHHHHHHHhccc----c----CCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecC-eeEEEeCC Confidence 23455566666666422 1 2467999999987766554322222 111122 2444 69999999 Q ss_pred cC Q lcl|NC_020198. 303 LN 304 (304) Q Consensus 303 Ld 304 (304) +. T Consensus 211 ~p 212 (275) T protein:vir:96 211 IK 212 (275) T ss_pred CC Confidence 98 No 20 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=86.54 E-value=0.041 Score=28.20 Aligned_cols=225 Identities=9% Similarity=0.006 Sum_probs=94.3 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhccccee---ecccccccceeee Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRV---IKDMAAQGYQITN 77 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~---~~~l~~~~~~i~n 77 (304) |+.-+...+-. -+...+.+.+... +...++|+.++-.....+|..+..-|.- .|++|-. ..++.=..-+++- T Consensus 1 ma~~gG~lip~---~~~~~ii~~~~~~-s~i~~~~~~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~~~~~~f~~v~l~~ 75 (298) T protein:vir:94 1 MVLNKGTLFDP---ELVTDLISKVAGK-SSIARLSAQKPIPFNGEKVFTFTMDSEI-DVVAESGKKTHGGVTLAPQTMVP 75 (298) T ss_pred CeeccccccCh---hHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEecCcce-EEeeCCccccccccceeEEEEee Confidence 88743322211 1222222223222 2355667777655455567666655653 6886543 2233334556677 Q ss_pred ecccceeecchhhhh---cCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccc---------- Q lcl|NC_020198. 78 KLFESTVGVKRTDIE---DDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVY---------- 144 (304) Q Consensus 78 k~fe~tv~v~R~~I~---dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~---------- 144 (304) ++++..+.|||+.+. +|..++..-+...|+++.++..+..++.-...|-. .--.+...-...+... T Consensus 76 ~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g-~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (298) T protein:vir:94 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLG-TASAVIGTNHFDSKVTQKVEAPRGIA 154 (298) T ss_pred eEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCC-cccccccccccccccccccccccccc Confidence 899999999999984 66688889999999999999988877632110000 0000000000000000 Q ss_pred ---------------cccccccchhhhhhh---hcccCCCCc-----------ccee----------------------- Q lcl|NC_020198. 145 ---------------PNVDGTGTATTVSNL---FAPAADPGA-----------AWYL----------------------- 172 (304) Q Consensus 145 ---------------~~~~~tg~~~s~snl---~~~~~~~g~-----------~w~L----------------------- 172 (304) .+...++-.++.+.. ..-...+|. +..| T Consensus 155 ~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~ 234 (298) T protein:vir:94 155 DPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAI 234 (298) T ss_pred cHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEE Confidence 000000000000000 000000111 1111 Q ss_pred -ccCCccchhhhhhhccccchhhcccCccc----ccccccceEEEeeccccccccchhhhhccccccchhH Q lcl|NC_020198. 173 -LDTSRSLKPLIYQERMKPSFTSMTKEDDE----QVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQVN 238 (304) Q Consensus 173 -~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~----~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~ 238 (304) -|-++. +.+-.|...++...+..+.+ +.|.+|...|=+..|+....--.+.- ..|...+ T Consensus 235 ~Gdfs~~---~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~----~~l~~~t 298 (298) T protein:vir:94 235 IGDFANG---FKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKF----ARVTEAN 298 (298) T ss_pred Eeeccce---EEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccce----EEEEecC Confidence 111111 11223333333322211111 23556655555555554321111110 0111111 No 21 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=86.47 E-value=0.045 Score=27.95 Aligned_cols=225 Identities=11% Similarity=0.068 Sum_probs=96.7 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---eecccccccceeee Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VIKDMAAQGYQITN 77 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~~~l~~~~~~i~n 77 (304) |+--+...+ |-..+...+.+...+ .+...++++++|-.+...+|..+..-|.. .|+||- ...++.=..-++.- T Consensus 30 ~~~~~~~~l--ip~~~~~~ii~~~~~-~s~l~~l~~~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~~~~~~f~~v~~~~ 105 (324) T protein:vir:96 30 MMHEKKDGT--LLNDFTTPILQEVME-NSKIMQLGKYEPMEGTEKKFTFWADKPGA-YWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred cccCCCcce--echhHHHHHHHHHHh-hchhhhhcceeeccCCceEEEEEecCcce-eeecCCccccccccceeEEEEEe Confidence 111000000 001111222221211 12245567777765555567666555554 787654 33334445567788 Q ss_pred ecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhhh Q lcl|NC_020198. 78 KLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVS 157 (304) Q Consensus 78 k~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~s 157 (304) ++++..+.|+|+.+.|.+..+..-+.+.++++.++..++.++. |.. .-..+..++++-+. . . T Consensus 106 ~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~----G~g-~~~~~~~~~~~~~~----------~---~ 167 (324) T protein:vir:96 106 FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQG-NNPFGKSIAQSIKK----------T---N 167 (324) T ss_pred EEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh----cCC-CCCcCccccccccc----------c---c Confidence 9999999999999999889999999999999999998886652 211 11111111111110 0 0 Q ss_pred hhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccchh Q lcl|NC_020198. 158 NLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQV 237 (304) Q Consensus 158 nl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~ 237 (304) .. ... ...|- .|.-++.+. .+ .-+..+. | -++. T Consensus 168 ~~-~~~---~~~~~------~i~~~~~~i----------~~---~~~~~~~---------------~--------i~n~- 200 (324) T protein:vir:96 168 KV-IKG---DFTQD------NIIDLEALL----------ED---DELEANA---------------F--------ISKT- 200 (324) T ss_pred ee-ccc---ccchH------HHHHHHHhh----------hh---ccCCCCE---------------E--------EEcH- Confidence 00 000 00010 011111110 00 0000000 1 0111 Q ss_pred HHHHHHHHHHHhccCCCceece--ecC-eEEecchHHHHHHHHHhhhccCC-----CCc-ceec---ceeeEEec--ccc Q lcl|NC_020198. 238 NFEKVYDAMRNQKADGGRPLDI--RPN-LLVVPTTLRSKAKEVVGVQRLAN-----GAD-NPNF---ELVQVLDT--AWL 303 (304) Q Consensus 238 ~l~aar~aM~~~k~~~G~~L~i--~P~-~LvVpp~le~~A~~ll~~~~~~~-----g~~-N~~~---g~~~~iv~--p~L 303 (304) ..+.++++.|+..|+++-. .|. ++=+|.-. ..+..... |+. +-+. +-+++-++ ..+ T Consensus 201 ---~~~~~L~~lkd~~G~~~~~~~~~~~l~G~PV~~-------~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~ 270 (324) T protein:vir:96 201 ---QNRSLLRKIVDPETKERIYDRNSDSLDGLPVVN-------LKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQL 270 (324) T ss_pred ---HHHHHHHHhhCCCCCeeecCCCCCcccceeeEe-------ecCCCCCcceEEEEecceEEEEEecCcEEEEeecccc Confidence 2345677889999998732 222 22122100 00000111 111 1111 11122111 111 Q ss_pred C Q lcl|NC_020198. 304 N 304 (304) Q Consensus 304 d 304 (304) . T Consensus 271 ~ 271 (324) T protein:vir:96 271 S 271 (324) T ss_pred c Confidence 1 No 22 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=83.90 E-value=0.065 Score=27.10 Aligned_cols=186 Identities=13% Similarity=0.169 Sum_probs=103.1 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEec-------CCccccccccccCCccchhcccceee Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIP-------STTASNTYGWLGQFPKLREWIGQRVI 65 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~-------S~~~~~~y~~Lg~~P~lrEw~Ge~~~ 65 (304) ||- |.|+++.... ...+.+.+. +..+|..-. .+-....|.-+|+.-.+.| -.+... T Consensus 1 m~~~~T~l~d~i~Pev~~~~v---~~~~~~~l~-----~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~ 71 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMM---QAELEKKLR-----FASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE-GEKIPT 71 (274) T ss_pred CCcceeehhheechHHHHHHH---HHHHHhhhh-----ccccceecccccCCCCCEEEeeeecCCCccccccC-CCccch Confidence 664 5566655543 222333222 122222111 1111122222444332222 145667 Q ss_pred cccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccc Q lcl|NC_020198. 66 KDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYP 145 (304) Q Consensus 66 ~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~ 145 (304) .++....-+.+.+.+++.+.|+..+..-.--.....+.+++|.+-++.-|..+++.|+.+. ..+ T Consensus 72 ~~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~--------------~~~-- 135 (274) T protein:vir:95 72 DILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK--------------LTV-- 135 (274) T ss_pred hhcccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc--------------ccc-- Confidence 7787777788888889999999887776555567778888888888888887777765421 000 Q ss_pred ccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchh Q lcl|NC_020198. 146 NVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQ 225 (304) Q Consensus 146 ~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq 225 (304) T Consensus 136 -------------------------------------------------------------------------------- 135 (274) T protein:vir:95 136 -------------------------------------------------------------------------------- 135 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccC--CCCcce--------eccee Q lcl|NC_020198. 226 LAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLA--NGADNP--------NFELV 295 (304) Q Consensus 226 ~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~--~g~~N~--------~~g~~ 295 (304) +..+++.+.+-+|++.|.... ..++.|+|+|.....-++-..-+.+. .+..|+ +.| + T Consensus 136 ----~~~~~~~d~i~~A~~~lgd~~--------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~ 202 (274) T protein:vir:95 136 ----EADITKLTGLQTAIDKFNDED--------LEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALG-A 202 (274) T ss_pred ----cccccCHHHHHHHHHHhcccc--------ccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecC-e Confidence 011234445556666554321 25679999998876655532112221 122232 444 7 Q ss_pred eEEeccccC Q lcl|NC_020198. 296 QVLDTAWLN 304 (304) Q Consensus 296 ~~iv~p~Ld 304 (304) +|++++.++ T Consensus 203 ~Vi~s~~~~ 211 (274) T protein:vir:95 203 VIVRSNKLE 211 (274) T ss_pred EEEEeCCCC Confidence 899999998 No 23 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=83.90 E-value=0.065 Score=27.10 Aligned_cols=186 Identities=13% Similarity=0.169 Sum_probs=103.1 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEec-------CCccccccccccCCccchhcccceee Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIP-------STTASNTYGWLGQFPKLREWIGQRVI 65 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~-------S~~~~~~y~~Lg~~P~lrEw~Ge~~~ 65 (304) ||- |.|+++.... ...+.+.+. +..+|..-. .+-....|.-+|+.-.+.| -.+... T Consensus 1 m~~~~T~l~d~i~Pev~~~~v---~~~~~~~l~-----~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~ 71 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMM---QAELEKKLR-----FASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE-GEKIPT 71 (274) T ss_pred CCcceeehhheechHHHHHHH---HHHHHhhhh-----ccccceecccccCCCCCEEEeeeecCCCccccccC-CCccch Confidence 664 5566655543 222333222 122222111 1111122222444332222 145667 Q ss_pred cccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccc Q lcl|NC_020198. 66 KDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYP 145 (304) Q Consensus 66 ~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~ 145 (304) .++....-+.+.+.+++.+.|+..+..-.--.....+.+++|.+-++.-|..+++.|+.+. ..+ T Consensus 72 ~~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~--------------~~~-- 135 (274) T protein:vir:96 72 DILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK--------------LTV-- 135 (274) T ss_pred hhcccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc--------------ccc-- Confidence 7787777788888889999999887776555567778888888888888887777765421 000 Q ss_pred ccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchh Q lcl|NC_020198. 146 NVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQ 225 (304) Q Consensus 146 ~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq 225 (304) T Consensus 136 -------------------------------------------------------------------------------- 135 (274) T protein:vir:96 136 -------------------------------------------------------------------------------- 135 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccC--CCCcce--------eccee Q lcl|NC_020198. 226 LAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLA--NGADNP--------NFELV 295 (304) Q Consensus 226 ~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~--~g~~N~--------~~g~~ 295 (304) +..+++.+.+-+|++.|.... ..++.|+|+|.....-++-..-+.+. .+..|+ +.| + T Consensus 136 ----~~~~~~~d~i~~A~~~lgd~~--------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~ 202 (274) T protein:vir:96 136 ----EADITKLTGLQTAIDKFNDED--------LEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALG-A 202 (274) T ss_pred ----cccccCHHHHHHHHHHhcccc--------ccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecC-e Confidence 011234445556666554321 25679999998876655532112221 122232 444 7 Q ss_pred eEEeccccC Q lcl|NC_020198. 296 QVLDTAWLN 304 (304) Q Consensus 296 ~~iv~p~Ld 304 (304) +|++++.++ T Consensus 203 ~Vi~s~~~~ 211 (274) T protein:vir:96 203 VIVRSNKLE 211 (274) T ss_pred EEEEeCCCC Confidence 899999998 No 24 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=83.63 E-value=0.067 Score=27.02 Aligned_cols=222 Identities=16% Similarity=0.133 Sum_probs=101.3 Q ss_pred CCCccHHHHHHHHHH------------HHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---ee Q lcl|NC_020198. 1 MAIITPALISALKTS------------FQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VI 65 (304) Q Consensus 1 maii~~~~l~~l~~~------------~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~ 65 (304) |.. |+++...+.++ .+..+....+ .+.-.+++++++-.....+|..+..-|.. .|+||- .. T Consensus 1 ~g~-~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~--~s~i~~l~~~~~~~~~~~~ip~~~~~~~a-~wv~Eg~~~~~ 76 (397) T protein:vir:23 1 MGF-SADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEK--TSIVQRVAQKIPMGATGIVIPHWTGDVSA-QWIGEGDMKPI 76 (397) T ss_pred CCc-CHHHHHHhhccCCCCccccchhHHHHHHHHHHh--ccchhhhcceeeccCCceEEEEEcCCcce-EEecCCccccc Confidence 433 33333332211 2233333332 23345667777744444455555555553 677553 33 Q ss_pred cccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccc Q lcl|NC_020198. 66 KDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYP 145 (304) Q Consensus 66 ~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~ 145 (304) .+++=..-+++.++++..+.|+++.++|...++..-+.+.++++.++..|+.++ +|.+. +++...-.. T Consensus 77 s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l----~G~gt----~~~~~~~~~---- 144 (397) T protein:vir:23 77 TKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAAL----HGTNA----PSAFQGYLD---- 144 (397) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh----hcccC----Ccccccccc---- Confidence 333334466778999999999999999999999999999999999999998765 23222 233221111 Q ss_pred ccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchh Q lcl|NC_020198. 146 NVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQ 225 (304) Q Consensus 146 ~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq 225 (304) .++.... .+.......+++.-..+++ +.+.+++ | T Consensus 145 ---~~~~~~~-------~~~~~~~~~~~~~~~~l~~--------------------------------~~~~~a~---~- 178 (397) T protein:vir:23 145 ---QSNKTQS-------ISPNAYQGLGVSGLTKLVT--------------------------------DGKKWTH---T- 178 (397) T ss_pred ---cccceee-------ecccchhHHHHHHHHhhhh--------------------------------cccCCCE---E- Confidence 0100000 0000011111111000100 0000100 1 Q ss_pred hhhccccccchhHHHHHHHHHHHhccCCCceeceec------------CeEEecchHHHHHHHHHhhhccCCCCcceecc Q lcl|NC_020198. 226 LAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRP------------NLLVVPTTLRSKAKEVVGVQRLANGADNPNFE 293 (304) Q Consensus 226 ~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P------------~~LvVpp~le~~A~~ll~~~~~~~g~~N~~~g 293 (304) -++. ..+.++++.|+.+|++|-... +++=+|. .-++..+.|.+-.+.| T Consensus 179 -------vmn~----~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv---------~~s~~~~~g~~~~~~g 238 (397) T protein:vir:23 179 -------LLDD----TVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPT---------ILSDHVAEGDVVGYAG 238 (397) T ss_pred -------EEcH----HHHHHHHHhhccCCceeecccccccccccccCceeeeeeE---------EEeCCCCCCceEEEEe Confidence 1111 235678888999998873311 1111121 1112233333221211 Q ss_pred e-------------eeEEeccccC Q lcl|NC_020198. 294 L-------------VQVLDTAWLN 304 (304) Q Consensus 294 ~-------------~~~iv~p~Ld 304 (304) . +++.-+..+. T Consensus 239 Dfs~~~i~~~~~i~i~~~~e~~~~ 262 (397) T protein:vir:23 239 DFSQIIWGQVGGLSFDVTDQATLN 262 (397) T ss_pred ecceEEEEEEeceEEEEeeeeeee Confidence 1 1111111111 No 25 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=83.60 E-value=0.038 Score=28.38 Aligned_cols=228 Identities=15% Similarity=0.107 Sum_probs=94.5 Q ss_pred CCCccHH-HHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---eecccccccceee Q lcl|NC_020198. 1 MAIITPA-LISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VIKDMAAQGYQIT 76 (304) Q Consensus 1 maii~~~-~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~~~l~~~~~~i~ 76 (304) ||-++.. .+ +=.-+...+-+.... .+...++|+++|-.....+|..+..-|.- .|+||- ...+.+=..-+++ T Consensus 1 mat~~~gg~l--vP~~~~~~ii~~~~~-~s~i~~~~~~i~~~~~~~~~p~~~~~~~a-~wv~Eg~~~~~~~~~f~~v~l~ 76 (311) T protein:vir:81 1 MVALATGTFQ--LPKHLVPGVWQKAQG-QSVLARLSMAEPQEFGEQQYMTLTAPPRG-EVVGEGAQKSESTATFAPVTAI 76 (311) T ss_pred CceecCCceE--cchhHHHHHHHHHHh-cchhhhhcceeecCCCceEEEEEeCCcee-EEeecCcccccccceeeEEEEe Confidence 7754331 11 001122222222222 23456778888765555666666555553 788654 3333444566777 Q ss_pred eecccceeecchhhh---hcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccc--cccccccccccccccc Q lcl|NC_020198. 77 NKLFESTVGVKRTDI---EDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQN--FFDTDHPVYPNVDGTG 151 (304) Q Consensus 77 nk~fe~tv~v~R~~I---~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~--fF~tdH~v~~~~~~tg 151 (304) -++++..+.|+++.+ .||..++..-+..+++++.++.+++.++.--.+| ++....|.. ..++-..+. ..... T Consensus 77 ~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~-~~~~~~gi~~~~~~~~~~~~--~~~~~ 153 (311) T protein:vir:81 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPL-TGAALSGSPAKILDTTNIVE--LTTGT 153 (311) T ss_pred eEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCC-CCcccccccccccccceeee--ecccc Confidence 889999999999988 4677899999999999999999988876432211 112222211 112111110 00000 Q ss_pred chh---hhhhhh---cccCCCCccceec-----------c-CCccc---------------hhhhhhh------------ Q lcl|NC_020198. 152 TAT---TVSNLF---APAADPGAAWYLL-----------D-TSRSL---------------KPLIYQE------------ 186 (304) Q Consensus 152 ~~~---s~snl~---~~~~~~g~~w~L~-----------d-~~~~~---------------kP~i~Q~------------ 186 (304) ... .+..+. .........|.+= | ..+++ .|+++.. T Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~ 233 (311) T protein:vir:81 154 SATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTAS 233 (311) T ss_pred cchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccccc Confidence 000 011110 0000111112110 0 00111 1222221 Q ss_pred -------------------------ccccchhhcccCccc---ccccccceEEEeeccccccccchhhhhccccccchhH Q lcl|NC_020198. 187 -------------------------RMKPSFTSMTKEDDE---QVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQVN 238 (304) Q Consensus 187 -------------------------r~~~~~~~~~~~~~~---~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~ 238 (304) |+.+++......+.+ +.|.++.+.|=+..|+ T Consensus 234 ~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~--------------------- 292 (311) T protein:vir:81 234 TGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVY--------------------- 292 (311) T ss_pred cchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEe--------------------- Confidence 222222111111111 1233333333333333 Q ss_pred HHHHHHHHHHhccCCCceeceecCeE-EecchHHH Q lcl|NC_020198. 239 FEKVYDAMRNQKADGGRPLDIRPNLL-VVPTTLRS 272 (304) Q Consensus 239 l~aar~aM~~~k~~~G~~L~i~P~~L-vVpp~le~ 272 (304) ++.++. |+-+ ++-..-+. T Consensus 293 --------------d~~v~~--~~a~~~l~~a~~~ 311 (311) T protein:vir:81 293 --------------GIGIMS--TDAFAVVRDADES 311 (311) T ss_pred --------------ccEeec--ccceEEEEeeccC Confidence 222222 1111 01000000 No 26 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=82.03 E-value=0.08 Score=26.59 Aligned_cols=223 Identities=13% Similarity=0.093 Sum_probs=95.6 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---eecccccccceeee Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VIKDMAAQGYQITN 77 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~~~l~~~~~~i~n 77 (304) |.--+... -|-..+...+.+...+. +...++|+.+|..+..-+|..+..-|. -+|+||- ...++.-..-++.- T Consensus 30 ~~~~~~~~--liP~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~~p~~~~~~~-a~~v~Eg~~~~~~~~~~~~v~~~~ 105 (324) T protein:vir:10 30 MMHEKKDG--TLLNDFTTPILQEVMEN-SKIMQLGKYEPMEGTEKKFTFWADKPG-AYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred eccCCCcc--eechhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEeCCcc-eeEeccCccccccccceeEEEEee Confidence 11100000 01111222222223222 234556777775555556666544455 3787554 33334445567778 Q ss_pred ecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhhh Q lcl|NC_020198. 78 KLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVS 157 (304) Q Consensus 78 k~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~s 157 (304) ++++..+.|+|+.+.|....+.+-+.+.++++.++..+..++. |... -..+..++.+.... ...+...++.. T Consensus 106 ~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~----G~g~-~~~~~~i~~~~~~~---~~~~~~~~t~~ 177 (324) T protein:vir:10 106 FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGN-NPFGKSIAQSIEKT---NKVIKGDFTQD 177 (324) T ss_pred EEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh----cCCC-CccCcccccccccc---ceeccccCCHH Confidence 9999999999999999889999999999999999988886642 2111 01122222111100 00000000000 Q ss_pred hhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccchh Q lcl|NC_020198. 158 NLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQV 237 (304) Q Consensus 158 nl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~ 237 (304) .+ ++.-..|++ . .+.+.+ | -++.. T Consensus 178 ~i-------------~~~~~~l~~-----------------~---------------~~~~~~---~--------v~n~~ 201 (324) T protein:vir:10 178 NI-------------IDLEALLED-----------------D---------------ELEANA---F--------ISKTQ 201 (324) T ss_pred HH-------------HHHHHhhhh-----------------c---------------cCCCCE---E--------EEcHH Confidence 00 000000000 0 000000 1 11222 Q ss_pred HHHHHHHHHHHhccCCCceece--ecCeEE-ecchHHHHHHHHHhhhccCCCC--------cceec----c-eeeEEecc Q lcl|NC_020198. 238 NFEKVYDAMRNQKADGGRPLDI--RPNLLV-VPTTLRSKAKEVVGVQRLANGA--------DNPNF----E-LVQVLDTA 301 (304) Q Consensus 238 ~l~aar~aM~~~k~~~G~~L~i--~P~~Lv-Vpp~le~~A~~ll~~~~~~~g~--------~N~~~----g-~~~~iv~p 301 (304) . +..+++.|+..|+++-. .|..|. +|. +-....+.+. .+.+. + .+++.-++ T Consensus 202 ~----~~~L~~l~d~~g~~~~~~~~~~~l~G~PV---------~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~ 268 (324) T protein:vir:10 202 N----RSLLRKIVDPETKERIYDRNSDTLDGLPV---------VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETA 268 (324) T ss_pred H----HHHHHHhhccCCceeecCCCCccccceeE---------EeecCCCCCcceEEEEecccEEEEEecCcEEEEeecc Confidence 2 34566788888987622 222111 121 0001111111 11111 1 12222222 Q ss_pred ccC Q lcl|NC_020198. 302 WLN 304 (304) Q Consensus 302 ~Ld 304 (304) .+. T Consensus 269 ~~~ 271 (324) T protein:vir:10 269 QLS 271 (324) T ss_pred ccc Confidence 222 No 27 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=81.80 E-value=0.074 Score=26.78 Aligned_cols=225 Identities=12% Similarity=0.092 Sum_probs=88.7 Q ss_pred CCCccH--------------HHHHHHH------------HHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCc Q lcl|NC_020198. 1 MAIITP--------------ALISALK------------TSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFP 54 (304) Q Consensus 1 maii~~--------------~~l~~l~------------~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P 54 (304) ...+.. +.-+++. +-+...+.+.+... +...++|+.+|.++...+|...-.-| T Consensus 105 ~~~~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~-s~l~~l~~~~~~~~~~~~~~~~~~~~ 183 (425) T protein:vir:10 105 VKPLRDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLI-SPMRQLCRVQPVSKAGFSKLFNMGGT 183 (425) T ss_pred ccccccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhh-hhhhhhceeeeccCCceEEEEEcCCc Confidence 000000 0000110 01122222222222 23345677777665556665554455 Q ss_pred cchhcccceeecc---c-ccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHH---------H Q lcl|NC_020198. 55 KLREWIGQRVIKD---M-AAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFA---------L 121 (304) Q Consensus 55 ~lrEw~Ge~~~~~---l-~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~---------l 121 (304) .. .|+||-.-.. . .=..-++..++++..+.||++.+.|-...+..-+.+.++++.++..+..++. + T Consensus 184 ~a-~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gi 262 (425) T protein:vir:10 184 TS-GWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGL 262 (425) T ss_pred ce-eeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCccee Confidence 54 7887654321 1 2244678889999999999999998889999999999999999988875542 1 Q ss_pred Hhc---cCC------------------Ccc------------------------------------cCcccccccccccc Q lcl|NC_020198. 122 LKA---GNA------------------NLC------------------------------------YDGQNFFDTDHPVY 144 (304) Q Consensus 122 L~~---g~~------------------~~c------------------------------------yDGq~fF~tdH~v~ 144 (304) |+. +.+ ..- -+|+|+|..+...+ T Consensus 263 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g 342 (425) T protein:vir:10 263 LTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAG 342 (425) T ss_pred eeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeeccCccCC Confidence 110 000 000 01122221111000 Q ss_pred cccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccch Q lcl|NC_020198. 145 PNVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFW 224 (304) Q Consensus 145 ~~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~w 224 (304) ..--=-|..+.+++-....+....+.++-|.++.+. | ..|...++ ..+.-|.++...|=+..|+..+.--. T Consensus 343 ~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~--i-~~~~~~~v------~~d~~~~~~~~~~~~~~r~d~~v~~~ 413 (425) T protein:vir:10 343 QPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYL--I-IDRIGVRV------LRDPYTAKPYVLFYTTKRVGGGLLNP 413 (425) T ss_pred CCceecceeeEEecCcCCccCCccEEEEEehhccEE--E-EEecceEE------EecccccCCcEEEEEEEEeccEeecc Confidence 000000001111110000000111122223222211 1 11211111 11222344554444444443322111 Q ss_pred hhhhccccccch Q lcl|NC_020198. 225 QLAAMSTEELNQ 236 (304) Q Consensus 225 q~a~~s~~~l~~ 236 (304) +.-..-+-+-+. T Consensus 414 ~A~~~l~~~as~ 425 (425) T protein:vir:10 414 EPMRAMKVAASE 425 (425) T ss_pred cceEEEEeeccC Confidence 100000000000 No 28 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=72.72 E-value=0.18 Score=24.68 Aligned_cols=225 Identities=12% Similarity=0.084 Sum_probs=95.5 Q ss_pred CCCccHHHHHHH-HHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---eecccccccceee Q lcl|NC_020198. 1 MAIITPALISAL-KTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VIKDMAAQGYQIT 76 (304) Q Consensus 1 maii~~~~l~~l-~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~~~l~~~~~~i~ 76 (304) +.+.....-..+ =..+...+.+..... +...++|+++|.......|..+..-|.. .|+||- ...++.-..-++. T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~Eg~~~~~~~~~f~~v~~~ 104 (324) T protein:vir:97 27 DNVMMHEKKDGTLMNEFTTPILQEVMEN-SKIMQLGKYEPMEGTEKKFTFWADKPGA-YWVGEGQKIETSKATWVNATMR 104 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhh-cchhhhcceeeccCCceEEEEEecCcce-eEeccCccccccccceeEEEEe Confidence 100000000000 011222222222222 2245667888866555566655555553 687554 3333444556778 Q ss_pred eecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhh Q lcl|NC_020198. 77 NKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTV 156 (304) Q Consensus 77 nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~ 156 (304) .++++..+.|+|+.+.|-...+..-+.+.++++.++..++.++. |.... ..+.-++++.+.. ...+...++. T Consensus 105 ~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~----G~g~~-~~~~gi~~~~~~~---~~~~~~~~~~ 176 (324) T protein:vir:97 105 AFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN-PFGKSIAQSIEKT---NKVIKGDFTQ 176 (324) T ss_pred eEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----cCCCC-ccCcccccccccc---ceeccccCCH Confidence 89999999999999998888999999999999999998886653 21111 0111112211110 0000000111 Q ss_pred hhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccch Q lcl|NC_020198. 157 SNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQ 236 (304) Q Consensus 157 snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~ 236 (304) .++ ++ ++.... +. .+.+++ | -++. T Consensus 177 ~~i-------------~~-------~~~~l~----------~~---------------~~~~~~---~--------v~n~ 200 (324) T protein:vir:97 177 DNI-------------ID-------LEALLE----------DD---------------ELEANA---F--------ISKT 200 (324) T ss_pred HHH-------------HH-------HHHhhh----------hc---------------cCCCCE---E--------EEcH Confidence 111 00 110000 00 000000 1 1222 Q ss_pred hHHHHHHHHHHHhccCCCceece--ecCeEE-ecchHHHHHHHHHhhhccCCCCc--------ceecc-----eeeEEec Q lcl|NC_020198. 237 VNFEKVYDAMRNQKADGGRPLDI--RPNLLV-VPTTLRSKAKEVVGVQRLANGAD--------NPNFE-----LVQVLDT 300 (304) Q Consensus 237 ~~l~aar~aM~~~k~~~G~~L~i--~P~~Lv-Vpp~le~~A~~ll~~~~~~~g~~--------N~~~g-----~~~~iv~ 300 (304) .. +..+++.|+..|+++-. .+..|. +| ++.++..+.+.. +-+.+ .+++.-+ T Consensus 201 ~~----~~~L~~lkd~~g~~~~~~~~~~tl~G~P---------V~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~ 267 (324) T protein:vir:97 201 QN----RSLLRKIVDPETKERIYDRNSDTLDGLP---------VVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDET 267 (324) T ss_pred HH----HHHHHHhhcCCCceeecCCCCcccccee---------eEeecCCCCCcceEEEEecccEEEEEecCcEEEEeec Confidence 22 34567788889987632 111111 11 111111111111 11111 1122222 Q ss_pred cccC Q lcl|NC_020198. 301 AWLN 304 (304) Q Consensus 301 p~Ld 304 (304) ..+. T Consensus 268 ~~~~ 271 (324) T protein:vir:97 268 AQLS 271 (324) T ss_pred cccc Confidence 2222 No 29 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=72.47 E-value=0.18 Score=24.64 Aligned_cols=235 Identities=14% Similarity=0.131 Sum_probs=94.7 Q ss_pred CCC--------ccHHHHHHHHHH-----------H-HHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcc Q lcl|NC_020198. 1 MAI--------ITPALISALKTS-----------F-QKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWI 60 (304) Q Consensus 1 mai--------i~~~~l~~l~~~-----------~-~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~ 60 (304) |++ +.....+++.++ + +..+....+.. .-.++|+++|.....-.|..+..-|.. .|+ T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s--~i~~~~~~~~~~~~~~~~p~~~~~~~a-~~v 77 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKIS--IVQQFAQKIPMGTTGQKIPHWTGDVSA-SWI 77 (326) T ss_pred CCCCccchhhhcCcchhhheeccccCCcceechhhHHHHHHHHHhcc--hhhhhcceeeccCCceEEEEEeCCcce-EEe Confidence 111 111111111110 0 11122222222 234456777754444445444444443 466 Q ss_pred cc---eeecccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHH---------HHhc--cC Q lcl|NC_020198. 61 GQ---RVIKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFA---------LLKA--GN 126 (304) Q Consensus 61 Ge---~~~~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~---------lL~~--g~ 126 (304) || +.-.++.=..-++.-++++..|.|||+.+.+=...+.+-+.++++++.++..++.++. +++. +. T Consensus 78 ~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~ 157 (326) T protein:vir:42 78 GEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEV 157 (326) T ss_pred cCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccc Confidence 44 3333444455778889999999999999998789999999999999999999987762 1100 00 Q ss_pred CCcccCccc--------------ccccccccccc-----------------cccccchhhhhhhhcccCC-------CCc Q lcl|NC_020198. 127 ANLCYDGQN--------------FFDTDHPVYPN-----------------VDGTGTATTVSNLFAPAAD-------PGA 168 (304) Q Consensus 127 ~~~cyDGq~--------------fF~tdH~v~~~-----------------~~~tg~~~s~snl~~~~~~-------~g~ 168 (304) ...-..++. +....++.+.. .+++|..+-......+... -|- T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~ 237 (326) T protein:vir:42 158 SLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVAR 237 (326) T ss_pred ceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeee Confidence 000000000 00000000000 0111111111111100000 011 Q ss_pred cceecc---CCccc------hhhhhhhccccchhhc--------ccCccc--ccccccceEEEeeccccccccchhhhhc Q lcl|NC_020198. 169 AWYLLD---TSRSL------KPLIYQERMKPSFTSM--------TKEDDE--QVFMADEYRYGVRSRCNVGFGFWQLAAM 229 (304) Q Consensus 169 ~w~L~d---~~~~~------kP~i~Q~r~~~~~~~~--------~~~~~~--~vf~~~~~~~Gvd~R~n~G~g~wq~a~~ 229 (304) |.++-+ ..+.+ +-+++-.|...++... ++++.. +.|.+|+..|=+..|+..+. .+-.- T Consensus 238 pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v--~~~~a- 314 (326) T protein:vir:42 238 PTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHC--NDKDA- 314 (326) T ss_pred eEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEE--ecccc- Confidence 111111 11100 1112223333333211 122222 33777777776666664322 22110 Q ss_pred cccccchhHHHHH Q lcl|NC_020198. 230 STEELNQVNFEKV 242 (304) Q Consensus 230 s~~~l~~~~l~aa 242 (304) =..|+...-.+| T Consensus 315 -~~~l~~~~~~~~ 326 (326) T protein:vir:42 315 -FVKLTNVDATEA 326 (326) T ss_pred -eEEEeeccccCC Confidence 011211111111 No 30 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=71.68 E-value=0.19 Score=24.51 Aligned_cols=213 Identities=13% Similarity=0.099 Sum_probs=96.0 Q ss_pred CCCccHH---HHHHHHHHHHHHHHHHHhh--cchhhcceE---EEecCCccccccccccCCccchhccccee----eccc Q lcl|NC_020198. 1 MAIITPA---LISALKTSFQKHFQDALAT--APSTYLQVA---TVIPSTTASNTYGWLGQFPKLREWIGQRV----IKDM 68 (304) Q Consensus 1 maii~~~---~l~~l~~~~~~~f~~a~~~--a~~~~~~~a---~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~----~~~l 68 (304) |...+.. ...-+..-+...=+.-|+. ++-+|+++- +.++-...+-+|...... +.-+|+|++. .-+. T Consensus 26 ~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~-G~a~~~~d~~~dip~vd~ 104 (329) T protein:vir:79 26 LRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKV-GHAKIIADYTDDLSTVDA 104 (329) T ss_pred cccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecc-eeeeeecCcccccceeec Confidence 2221111 1111111112222223332 233344443 222222333345444332 3346776542 2333 Q ss_pred ccccceeeeecccceeecchhhhhcCC---cchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccc Q lcl|NC_020198. 69 AAQGYQITNKLFESTVGVKRTDIEDDN---LGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYP 145 (304) Q Consensus 69 ~~~~~~i~nk~fe~tv~v~R~~I~dDd---lG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~ 145 (304) .-..++.+...|+..+.++.++++--. +-+=++....+.++.+++.|+++| .| ++.|.+++ T Consensus 105 ~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f----~G------------~~~~g~~G 168 (329) T protein:vir:79 105 LMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVF----KG------------SKPHKIIS 168 (329) T ss_pred ccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEE----ee------------ccccccee Confidence 445677888899999999999988663 333344555555555666666655 22 22332221 Q ss_pred ccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchh Q lcl|NC_020198. 146 NVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQ 225 (304) Q Consensus 146 ~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq 225 (304) -+...+ +..... ++.+.+.|.-- T Consensus 169 LlN~p~----v~~~~~-~~~~~~~w~~k---------------------------------------------------- 191 (329) T protein:vir:79 169 VFEHPN----LTTINS-AGWNNAAGTGK---------------------------------------------------- 191 (329) T ss_pred eecCCC----cccccc-CCCCCcccccc---------------------------------------------------- Confidence 110000 000000 01111122100 Q ss_pred hhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCCCCc--------ceecceeeE Q lcl|NC_020198. 226 LAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGAD--------NPNFELVQV 297 (304) Q Consensus 226 ~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~g~~--------N~~~g~~~~ 297 (304) +..--.+-+.+++.++..+... .+.|+.|++||++... |.+...+.|.+ || .+++ T Consensus 192 -----t~~ei~~di~~~~~~l~~~s~g-----~~~p~~L~Lpp~~~~~----L~~~~~~~~~tvl~~lk~~~~---~l~I 254 (329) T protein:vir:79 192 -----KPETAQDELEQAIEKIETLTNG-----QHRANMILIPPSMRKV----LMVRMPETTMSYLDYFKQQNG---GITI 254 (329) T ss_pred -----CHHHHHHHHHHHHHHHHHhcCc-----eecccEEEecHHHHHH----hhcccCCCCccHHHHHHHhCC---CcEE Confidence 0011123355666666665442 3579999999987643 33322222221 33 3678 Q ss_pred EeccccC Q lcl|NC_020198. 298 LDTAWLN 304 (304) Q Consensus 298 iv~p~Ld 304 (304) .-.|+|+ T Consensus 255 ~~~~el~ 261 (329) T protein:vir:79 255 ESISELE 261 (329) T ss_pred EEccccc Confidence 8889998 No 31 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=71.02 E-value=0.14 Score=25.31 Aligned_cols=230 Identities=13% Similarity=0.110 Sum_probs=92.3 Q ss_pred CCCc---cHHHHHH-----------HHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce--- Q lcl|NC_020198. 1 MAII---TPALISA-----------LKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR--- 63 (304) Q Consensus 1 maii---~~~~l~~-----------l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~--- 63 (304) |+-= +.+.... |-..+...+.+..... +.-.+++++++-....-+|..+..-|.. .|++|- T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~~p~~~~~~~a-~~v~E~~~~ 78 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKT-SIVQQFAQKVPMGTTGQKIPHWIGDVSA-QWIGEGDMK 78 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhc-cchhhhcceeeccCCceEEEEEeCCcce-EEecCCccc Confidence 2210 1111111 1111222222222222 2245567777754444455555555554 577543 Q ss_pred eecccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhcc-----------CCC---- Q lcl|NC_020198. 64 VIKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAG-----------NAN---- 128 (304) Q Consensus 64 ~~~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g-----------~~~---- 128 (304) ...++.=..-+++-++++..+.|+|+.+.|-...+..-+.+.|+++.++..++.++.==.+| .+. T Consensus 79 ~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~ 158 (320) T protein:vir:10 79 PITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPG 158 (320) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecc Confidence 33333334567788999999999999999989999999999999999999998875200000 000 Q ss_pred -cccCcccccccc--------ccccc-----------------ccccccchhhhhhhhcccC------------------ Q lcl|NC_020198. 129 -LCYDGQNFFDTD--------HPVYP-----------------NVDGTGTATTVSNLFAPAA------------------ 164 (304) Q Consensus 129 -~cyDGq~fF~td--------H~v~~-----------------~~~~tg~~~s~snl~~~~~------------------ 164 (304) .-+++-..++.+ |..+. -.++.|..+-......+.. T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~ 238 (320) T protein:vir:10 159 GATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDH 238 (320) T ss_pred cccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCC Confidence 000100000000 00000 0001111100000000000 Q ss_pred -CCCccc-eeccCCccchhhhhhhccccchhhccc--------Ccc--cccccccceEEEeeccccccccchhhh-hcc- Q lcl|NC_020198. 165 -DPGAAW-YLLDTSRSLKPLIYQERMKPSFTSMTK--------EDD--EQVFMADEYRYGVRSRCNVGFGFWQLA-AMS- 230 (304) Q Consensus 165 -~~g~~w-~L~d~~~~~kP~i~Q~r~~~~~~~~~~--------~~~--~~vf~~~~~~~Gvd~R~n~G~g~wq~a-~~s- 230 (304) ..+..+ ++-|.++ +++..|..+++...+. +.. -+.|.+|+..|=+..|+.... ++-. ... T Consensus 239 ~~~~~~~~~~gd~~~----~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v--~~~~a~~~l 312 (320) T protein:vir:10 239 VADGTTVGYMGDFRN----VIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHN--NDKDAFVKL 312 (320) T ss_pred CCCCceEEEEeecce----EEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEE--ecccceEEE Confidence 000000 1112211 2233344443322111 111 123555555554444443221 1110 000 Q ss_pred ccccchhHHHHHHHHHHHhccCCCceeceecCeEEecch Q lcl|NC_020198. 231 TEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTT 269 (304) Q Consensus 231 ~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~ 269 (304) +. ...|++ T Consensus 313 ~~-------------------------------~~ap~~ 320 (320) T protein:vir:10 313 TN-------------------------------VVTPDA 320 (320) T ss_pred Ee-------------------------------ccCCCC Confidence 00 011222 No 32 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=71.00 E-value=0.2 Score=24.40 Aligned_cols=186 Identities=12% Similarity=0.155 Sum_probs=100.1 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecC----Ccc---ccccccccCCccchhcccceee Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIPS----TTA---SNTYGWLGQFPKLREWIGQRVI 65 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S----~~~---~~~y~~Lg~~P~lrEw~Ge~~~ 65 (304) ||. |.|+++.+.. ...+.+.+ -+..++.+-.+ -.. ...|.-+|+.-.+.| -.+... T Consensus 1 ma~~~T~l~d~iiPev~~~~v---~~~~~~~l-----~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~ 71 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMM---QAQLEKKL-----RFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAE-GEKIPT 71 (274) T ss_pred CCcceeehhhhhchHHHHHHH---HHHHHhhh-----hhcccceecccccCCCCCEEEEeeecCCCccccccC-CCccch Confidence 775 5666666543 22222222 12223222111 011 111222333222222 145667 Q ss_pred cccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccc Q lcl|NC_020198. 66 KDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYP 145 (304) Q Consensus 66 ~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~ 145 (304) .++....-+.+.+..++.++|+..+..----.......+++|.+-++.-|.-+...|+.+ ... T Consensus 72 ~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a--------------~~~--- 134 (274) T protein:vir:12 72 DILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--------------KLT--- 134 (274) T ss_pred hhcccceeeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcc--------------ccc--- Confidence 777777777888888999999887766543345667778888887777777666655421 000 Q ss_pred ccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchh Q lcl|NC_020198. 146 NVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQ 225 (304) Q Consensus 146 ~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq 225 (304) T Consensus 135 -------------------------------------------------------------------------------- 134 (274) T protein:vir:12 135 -------------------------------------------------------------------------------- 134 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccC--CCCcc--------eeccee Q lcl|NC_020198. 226 LAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLA--NGADN--------PNFELV 295 (304) Q Consensus 226 ~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~--~g~~N--------~~~g~~ 295 (304) .+..+++.+.+-.|++.|.... ..+++|+|+|.....-++-..-+.+. .+..| -+.| + T Consensus 135 ---~~~~a~~~d~i~dA~~~lgd~~--------~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G-~ 202 (274) T protein:vir:12 135 ---VNADITKLNGLQSAIDKFNDED--------LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-A 202 (274) T ss_pred ---ccccccCHHHHHHHHHHhcccc--------ccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecC-e Confidence 0011234555556666653321 25678999999876655542222221 12223 2555 6 Q ss_pred eEEeccccC Q lcl|NC_020198. 296 QVLDTAWLN 304 (304) Q Consensus 296 ~~iv~p~Ld 304 (304) +||+++.+. T Consensus 203 ~Vi~s~~~p 211 (274) T protein:vir:12 203 IIVRSNKLE 211 (274) T ss_pred eEEEeCCCC Confidence 999999998 No 33 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=70.24 E-value=0.21 Score=24.28 Aligned_cols=187 Identities=12% Similarity=0.132 Sum_probs=103.5 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCC----ccccccccccCCccchhcc--cceeec Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIPST----TASNTYGWLGQFPKLREWI--GQRVIK 66 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~----~~~~~y~~Lg~~P~lrEw~--Ge~~~~ 66 (304) ||. |.|+++.+.. ...+.+.+ -+..++.+..+- ...-+.......+...++. .+.... T Consensus 1 ma~~~T~~~~~iiPev~~~~v---~~~~~~~~-----~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~ 72 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMM---QAQLEKKL-----RFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCccceehhheechHHHHHHH---HHHHHhhh-----hhcccccccccccCCCCCEEEEEeeccCCCcccccCCCccccc Confidence 775 3344444432 22222221 123333322110 1111111112222222322 456677 Q ss_pred ccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccc Q lcl|NC_020198. 67 DMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPN 146 (304) Q Consensus 67 ~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~ 146 (304) ++....-+.+.+.++..++|+..+....--.......++++++.++..|.-++..|+.+. . T Consensus 73 ~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~--------------~----- 133 (274) T protein:vir:93 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK--------------L----- 133 (274) T ss_pred ccccceeEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc--------------c----- Confidence 777778888888999999999999888766778888899999999999988887764310 0 Q ss_pred cccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhh Q lcl|NC_020198. 147 VDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQL 226 (304) Q Consensus 147 ~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~ 226 (304) . T Consensus 134 -~------------------------------------------------------------------------------ 134 (274) T protein:vir:93 134 -T------------------------------------------------------------------------------ 134 (274) T ss_pred -c------------------------------------------------------------------------------ Confidence 0 Q ss_pred hhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhcc--CCCCcc--------eecceee Q lcl|NC_020198. 227 AAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRL--ANGADN--------PNFELVQ 296 (304) Q Consensus 227 a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~--~~g~~N--------~~~g~~~ 296 (304) .+..+++.+.+-.|++.+... +..++.|+|+|.....-++--.-+.+ .....+ -+.| +. T Consensus 135 --~~~~~~~~d~i~dA~~~l~d~--------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~ 203 (274) T protein:vir:93 135 --VNADITKLNGLQSAIDKFNDE--------DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-AI 203 (274) T ss_pred --ccccccCHHHHHHHHHHhhhc--------cCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecC-ee Confidence 001122344455566655432 13578999999976655543111111 111112 2445 68 Q ss_pred EEeccccC Q lcl|NC_020198. 297 VLDTAWLN 304 (304) Q Consensus 297 ~iv~p~Ld 304 (304) |++++.+. T Consensus 204 Vi~s~~~p 211 (274) T protein:vir:93 204 IVRTNKLE 211 (274) T ss_pred EEEcCCCC Confidence 99999998 No 34 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=66.26 E-value=0.27 Score=23.70 Aligned_cols=225 Identities=10% Similarity=0.040 Sum_probs=96.6 Q ss_pred CCCccHHHHHHH-HHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---eecccccccceee Q lcl|NC_020198. 1 MAIITPALISAL-KTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VIKDMAAQGYQIT 76 (304) Q Consensus 1 maii~~~~l~~l-~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~~~l~~~~~~i~ 76 (304) +.......-..+ -..+...+.+..... +...++++++|......+|..+..-|.. .|++|- ...++.=..-+++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~-s~l~~l~~~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~~~~~~~~~v~~~ 104 (324) T protein:vir:78 27 DNVMMHEKKDGTLMNEFTTPILQEVMEN-SKIMQLGKYEPMEGTEKKFTFWADKPGA-YWVGEGQKIETSKATWVNATMR 104 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEecCcce-eEecCCccccccccceeEEEEe Confidence 111111110000 011222222222222 2244567777765555566666555543 777553 3333444556778 Q ss_pred eecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhh Q lcl|NC_020198. 77 NKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTV 156 (304) Q Consensus 77 nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~ 156 (304) .++++..+.|+|+.+.|.+..+..-+.+.++++.++..+..++. |.... ..+.-+.+.... T Consensus 105 ~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~----G~g~~-~~~~gi~~~~~~-------------- 165 (324) T protein:vir:78 105 AFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN-PFGKSIAQSIEK-------------- 165 (324) T ss_pred eEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCC-CcCccccccccc-------------- Confidence 89999999999999999999999999999999999998886652 21110 111111111110 Q ss_pred hhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccch Q lcl|NC_020198. 157 SNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQ 236 (304) Q Consensus 157 snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~ 236 (304) .+..... ...|-- |.=++.+.. +. .+.+++ | -++. T Consensus 166 ~~~~~~~---~~t~~~------i~~~~~~l~----------~~---------------~~~~~~---~--------vmn~ 200 (324) T protein:vir:78 166 TNKVIKG---DFTQDN------IIDLEALLE----------DD---------------ELEANA---F--------ISKT 200 (324) T ss_pred cceeccc---cccHHH------HHHHHHhhh----------hc---------------cCCCCE---E--------EEcH Confidence 0000000 000100 000111000 00 000000 1 1122 Q ss_pred hHHHHHHHHHHHhccCCCceece--ecC-eEEecchHHHHHHHHHhhhccCCCCcceecc-------------eeeEEec Q lcl|NC_020198. 237 VNFEKVYDAMRNQKADGGRPLDI--RPN-LLVVPTTLRSKAKEVVGVQRLANGADNPNFE-------------LVQVLDT 300 (304) Q Consensus 237 ~~l~aar~aM~~~k~~~G~~L~i--~P~-~LvVpp~le~~A~~ll~~~~~~~g~~N~~~g-------------~~~~iv~ 300 (304) +.+.++++.|+..|+++-. .|. ++=+|. +.....+.+..-.+.| .+++.-+ T Consensus 201 ----~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV---------~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~ 267 (324) T protein:vir:78 201 ----QNRSLLRKIVDPETKERIYDRNSDSLDGLPV---------VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDET 267 (324) T ss_pred ----HHHHHHHHhhccCCCeeecCCCCCcccceee---------EeeCCCCCCcceEEEEecceEEEEEecCcEEEEeec Confidence 2345678889999988732 222 221221 1111111111111111 1122222 Q ss_pred cccC Q lcl|NC_020198. 301 AWLN 304 (304) Q Consensus 301 p~Ld 304 (304) +.+. T Consensus 268 ~~~~ 271 (324) T protein:vir:78 268 AQLS 271 (324) T ss_pred cccc Confidence 2221 No 35 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=66.26 E-value=0.27 Score=23.70 Aligned_cols=225 Identities=10% Similarity=0.040 Sum_probs=96.6 Q ss_pred CCCccHHHHHHH-HHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---eecccccccceee Q lcl|NC_020198. 1 MAIITPALISAL-KTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VIKDMAAQGYQIT 76 (304) Q Consensus 1 maii~~~~l~~l-~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~~~l~~~~~~i~ 76 (304) +.......-..+ -..+...+.+..... +...++++++|......+|..+..-|.. .|++|- ...++.=..-+++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~-s~l~~l~~~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~~~~~~~~~v~~~ 104 (324) T protein:vir:96 27 DNVMMHEKKDGTLMNEFTTPILQEVMEN-SKIMQLGKYEPMEGTEKKFTFWADKPGA-YWVGEGQKIETSKATWVNATMR 104 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEecCcce-eEecCCccccccccceeEEEEe Confidence 111111110000 011222222222222 2244567777765555566666555543 777553 3333444556778 Q ss_pred eecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhh Q lcl|NC_020198. 77 NKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTV 156 (304) Q Consensus 77 nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~ 156 (304) .++++..+.|+|+.+.|.+..+..-+.+.++++.++..+..++. |.... ..+.-+.+.... T Consensus 105 ~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~----G~g~~-~~~~gi~~~~~~-------------- 165 (324) T protein:vir:96 105 AFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN-PFGKSIAQSIEK-------------- 165 (324) T ss_pred eEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCC-CcCccccccccc-------------- Confidence 89999999999999999999999999999999999998886652 21110 111111111110 Q ss_pred hhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccch Q lcl|NC_020198. 157 SNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQ 236 (304) Q Consensus 157 snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~ 236 (304) .+..... ...|-- |.=++.+.. +. .+.+++ | -++. T Consensus 166 ~~~~~~~---~~t~~~------i~~~~~~l~----------~~---------------~~~~~~---~--------vmn~ 200 (324) T protein:vir:96 166 TNKVIKG---DFTQDN------IIDLEALLE----------DD---------------ELEANA---F--------ISKT 200 (324) T ss_pred cceeccc---cccHHH------HHHHHHhhh----------hc---------------cCCCCE---E--------EEcH Confidence 0000000 000100 000111000 00 000000 1 1122 Q ss_pred hHHHHHHHHHHHhccCCCceece--ecC-eEEecchHHHHHHHHHhhhccCCCCcceecc-------------eeeEEec Q lcl|NC_020198. 237 VNFEKVYDAMRNQKADGGRPLDI--RPN-LLVVPTTLRSKAKEVVGVQRLANGADNPNFE-------------LVQVLDT 300 (304) Q Consensus 237 ~~l~aar~aM~~~k~~~G~~L~i--~P~-~LvVpp~le~~A~~ll~~~~~~~g~~N~~~g-------------~~~~iv~ 300 (304) +.+.++++.|+..|+++-. .|. ++=+|. +.....+.+..-.+.| .+++.-+ T Consensus 201 ----~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV---------~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~ 267 (324) T protein:vir:96 201 ----QNRSLLRKIVDPETKERIYDRNSDSLDGLPV---------VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDET 267 (324) T ss_pred ----HHHHHHHHhhccCCCeeecCCCCCcccceee---------EeeCCCCCCcceEEEEecceEEEEEecCcEEEEeec Confidence 2345678889999988732 222 221221 1111111111111111 1122222 Q ss_pred cccC Q lcl|NC_020198. 301 AWLN 304 (304) Q Consensus 301 p~Ld 304 (304) +.+. T Consensus 268 ~~~~ 271 (324) T protein:vir:96 268 AQLS 271 (324) T ss_pred cccc Confidence 2221 No 36 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=65.75 E-value=0.28 Score=23.63 Aligned_cols=203 Identities=13% Similarity=0.145 Sum_probs=97.2 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhc--chhhcceE---EEecCCccccccccccCCccchhcccce----eecccccc Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATA--PSTYLQVA---TVIPSTTASNTYGWLGQFPKLREWIGQR----VIKDMAAQ 71 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a--~~~~~~~a---~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~----~~~~l~~~ 71 (304) +++++.+.|+.|-.. .|+.. +-+|+++- +.++-...+-+|...... +.-+|+|+. ..-+..-. T Consensus 31 ~g~~~~~ql~~id~~-------v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~-G~a~~~~d~~~dip~v~~~~~ 102 (319) T protein:vir:10 31 MGIWTAQELHRIKSQ-------SYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDKV-GTAQIIADYTDDLPLVDALGT 102 (319) T ss_pred hhhHHHHHHHHHHHH-------HHhhhhcceechhhcccccCCCCceEEEEeeeeccc-cceeeecCccccccceeccce Confidence 333444444443322 23221 22444443 222222223334433332 334577654 22233445 Q ss_pred cceeeeecccceeecchhhhhcCC---cchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccc Q lcl|NC_020198. 72 GYQITNKLFESTVGVKRTDIEDDN---LGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVD 148 (304) Q Consensus 72 ~~~i~nk~fe~tv~v~R~~I~dDd---lG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~ 148 (304) .++.+...|+..+.+++++++.-. +-+=......+.++.++++|+++| .| ++.|.+++ T Consensus 103 ~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f----~G------------~~~~g~~G--- 163 (319) T protein:vir:10 103 SEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVF----KG------------SAPHKIVS--- 163 (319) T ss_pred eeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEE----ee------------ccccccee--- Confidence 677888999999999999998763 333455566667777777777766 22 22232111 Q ss_pred cccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhh Q lcl|NC_020198. 149 GTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAA 228 (304) Q Consensus 149 ~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~ 228 (304) |+..+. +- ..+++.+.+. + T Consensus 164 -----------------------LlN~p~-------------------------~~-----------~~~~~~~~~~-~- 182 (319) T protein:vir:10 164 -----------------------VFNHPN-------------------------IT-----------KITSGKWIDV-S- 182 (319) T ss_pred -----------------------EEeCCC-------------------------ce-----------eeecCCCCCc-c- Confidence 111100 00 0001111000 0 Q ss_pred cccc-ccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCCCC--------cceecceeeEEe Q lcl|NC_020198. 229 MSTE-ELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGA--------DNPNFELVQVLD 299 (304) Q Consensus 229 ~s~~-~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~g~--------~N~~~g~~~~iv 299 (304) ++. .--.+.+.+++.++..+.. | .+.|+.|++||++.....+ ...+.|. .|| .+++.- T Consensus 183 -t~t~~~i~~di~~~~~~l~~~s~--g---~~~p~~L~L~p~~~~~L~~----~~~~~~~t~l~~lk~~~~---~l~I~~ 249 (319) T protein:vir:10 183 -TMKPETAEAELTQAIETIETITR--G---QHRATNILIPPSMRKVLAI----RMPETTMSYLDYFKSQNS---GIEIDS 249 (319) T ss_pred -ccCHHHHHHHHHHHHHHHHHhcC--c---eeeceEEEecHHHHHhhhc----ccCCCCeeHHHHHHHhcC---CceEEE Confidence 000 0112335555555555433 3 3589999999998754432 2222221 133 367777 Q ss_pred ccccC Q lcl|NC_020198. 300 TAWLN 304 (304) Q Consensus 300 ~p~Ld 304 (304) .|+|+ T Consensus 250 ~pel~ 254 (319) T protein:vir:10 250 IAELE 254 (319) T ss_pred eeeec Confidence 88887 No 37 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=64.95 E-value=0.29 Score=23.52 Aligned_cols=238 Identities=11% Similarity=-0.010 Sum_probs=97.3 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccceeeccc---------ccc Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDM---------AAQ 71 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~~~l---------~~~ 71 (304) .+..++..-..+-..+...+.......-+ ..++|.++|-......|.....-|.. .|++|-....- .=. T Consensus 163 ~~~~~~~g~~~ip~~~~~~ii~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~e~~~~~~~~~~~~~~~~~~ 240 (458) T protein:vir:10 163 QSSSVEVSSESYETIFSQRIIRDLQKELV-VGALFEELPMSSKILTMLVEPDAGKA-TWVAASTYGTDTTTGEEVKGALK 240 (458) T ss_pred hcccCccccceehhhHhHHHHHHHHhhhh-HHhhcceeecCCcceEEEEecCCcce-eecccccccccccccccccccce Confidence 00000000000001122222222222222 34457777755555555545444443 67655433221 112 Q ss_pred cceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccccccccc Q lcl|NC_020198. 72 GYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTG 151 (304) Q Consensus 72 ~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg 151 (304) .-++..++++..|.|+++.+.|-+.++.+-+...|+++.++..+..++ . |... |+| .|--...++ T Consensus 241 ~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l---~-G~G~----~~p-------~Gi~~~~~~ 305 (458) T protein:vir:10 241 EIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFM---T-GDGS----GKP-------KGLLTLASE 305 (458) T ss_pred eeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh---c-CCCC----Ccc-------ceeeecccc Confidence 235777899999999999998878899999999999999999888553 2 2111 232 111000110 Q ss_pred chhhh-hhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcc Q lcl|NC_020198. 152 TATTV-SNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMS 230 (304) Q Consensus 152 ~~~s~-snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s 230 (304) ....+ ....+... + .+ ..+.|+- +...-. ...+.+++ | T Consensus 306 ~~~~~~~~~~~~~~-~-----~~----~~~~i~~-------~~~~l~---------------~~~~~~~~---~------ 344 (458) T protein:vir:10 306 DSAKVVTEAKADGS-V-----LV----TAKTISK-------LRRKLG---------------RHGLKLSK---L------ 344 (458) T ss_pred cccceeeccccccc-c-----cc----cHHHHHH-------HHHhhh---------------hhhcCCCE---E------ Confidence 00000 00000000 0 00 0111110 000000 00011111 1 Q ss_pred ccccchhHHHHHHHHHHHhccCCCceecee---cCeE-EecchHHHHHHHHHhhhccCCCC--ccee------------c Q lcl|NC_020198. 231 TEELNQVNFEKVYDAMRNQKADGGRPLDIR---PNLL-VVPTTLRSKAKEVVGVQRLANGA--DNPN------------F 292 (304) Q Consensus 231 ~~~l~~~~l~aar~aM~~~k~~~G~~L~i~---P~~L-vVpp~le~~A~~ll~~~~~~~g~--~N~~------------~ 292 (304) -++... +.+++..|+.+|+||... +... -.|..|- ...++-++..+.++ ..++ + T Consensus 345 --v~~~~~----~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~--G~pv~~~~~~p~~~~~~~~~~~~f~~~~~~~~~ 416 (458) T protein:vir:10 345 --VLIVSM----DAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIY--GLPVVVSEYFPAKANSAEFAVIVYKDNFVMPRQ 416 (458) T ss_pred --EEcHHH----HHHHHhhcccCCceeeccccccccccCcCceec--ceeeEEccccccccCCcceEEEEecccEEEEEe Confidence 112222 356778888899887421 1110 0111111 11222222222211 1111 1 Q ss_pred ceeeEEeccccC Q lcl|NC_020198. 293 ELVQVLDTAWLN 304 (304) Q Consensus 293 g~~~~iv~p~Ld 304 (304) .-+++..+|+-+ T Consensus 417 ~~~~v~~d~~~~ 428 (458) T protein:vir:10 417 RAVTVERERQAG 428 (458) T ss_pred eceEEEeecccC Confidence 235666666655 No 38 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=64.89 E-value=0.29 Score=23.51 Aligned_cols=232 Identities=16% Similarity=0.141 Sum_probs=94.1 Q ss_pred CCCc------------cHHHH--HHHHHH-------------HHHHHHHHHhhcchhhcce-EEEecCCccccccccccC Q lcl|NC_020198. 1 MAII------------TPALI--SALKTS-------------FQKHFQDALATAPSTYLQV-ATVIPSTTASNTYGWLGQ 52 (304) Q Consensus 1 maii------------~~~~l--~~l~~~-------------~~~~f~~a~~~a~~~~~~~-a~~v~S~~~~~~y~~Lg~ 52 (304) +++- ....+ +++.++ +...|-+.+.. .+...++ ++.+|.....-++..+.. T Consensus 332 ~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~-~s~i~~l~~~~~~~~~g~~~ip~~~~ 410 (632) T protein:vir:96 332 LAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRN-KAIIGQMGARMLPGLVGDVDIPKKTS 410 (632) T ss_pred HHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhh-cchhhhhcceEeecCCcceEEEEEeC Confidence 0000 00000 010000 01112222211 1222333 455554433333333333 Q ss_pred Cccchhccc---ceeecccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCC- Q lcl|NC_020198. 53 FPKLREWIG---QRVIKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNAN- 128 (304) Q Consensus 53 ~P~lrEw~G---e~~~~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~- 128 (304) -|.. -|+| +....++.-...++.-++++..|.|||+.++|++.++..-+...|+.+.+...|..++. |..+ T Consensus 411 ~~~a-~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~----G~G~~ 485 (632) T protein:vir:96 411 GANF-YWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLT----GTGLA 485 (632) T ss_pred Ccee-EeecCCccccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhc----ccCCC Confidence 3332 3554 44555555566788899999999999999999999999999999999999999987652 2211 Q ss_pred cccCcccccccccccccccccccchhhhhhhh-------cccCC-CCccceeccC-Cc-cc----------hhhhhhhc- Q lcl|NC_020198. 129 LCYDGQNFFDTDHPVYPNVDGTGTATTVSNLF-------APAAD-PGAAWYLLDT-SR-SL----------KPLIYQER- 187 (304) Q Consensus 129 ~cyDGq~fF~tdH~v~~~~~~tg~~~s~snl~-------~~~~~-~g~~w~L~d~-~~-~~----------kP~i~Q~r- 187 (304) ..--|- +.+++++. +..++..++...+. ..... ....| ++.. .+ .+ .| |||.- T Consensus 486 ~~p~Gi-~~~~~~~~---~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~-~~~~~~~~~l~~~~l~d~~G~~-i~~~~~ 559 (632) T protein:vir:96 486 NDPVGL-LNMTGVPA---LTYPAGGVDWASVVDMETKISTFNADAGRLAY-LTSVTQRGAAKKAQVFDNTGER-IWQNNE 559 (632) T ss_pred Ccccee-eecccccc---eecccccCCHHHHHHHHHHHhhcccccCccEE-EEchhHHHHHHHHhccCCCCce-eecCCe Confidence 111221 44555532 22222222222221 11111 11223 2210 00 00 11 22211 Q ss_pred --cccchhhcccCcccccccc-cceEEEeeccc-------------cccccchhhhhccccccchhHHHHHHHHH Q lcl|NC_020198. 188 --MKPSFTSMTKEDDEQVFMA-DEYRYGVRSRC-------------NVGFGFWQLAAMSTEELNQVNFEKVYDAM 246 (304) Q Consensus 188 --~~~~~~~~~~~~~~~vf~~-~~~~~Gvd~R~-------------n~G~g~wq~a~~s~~~l~~~~l~aar~aM 246 (304) --|..++-.-|.+.-+|-. .+|.+|..+.. .+.+-.||-. ..+....+.|-.++.+= T Consensus 560 l~G~pv~~s~~ip~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~--d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 560 VNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDV--DAGVRRKEAFCIAKKGA 632 (632) T ss_pred ecccceEeccccccCcEEEeecceEEEEEecceEEEEccccccccCceEEEEEeec--CceeechhhhhheeecC Confidence 1122222222333333322 23344432211 1112122211 11222333332222111 No 39 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=62.65 E-value=0.33 Score=23.21 Aligned_cols=227 Identities=16% Similarity=0.155 Sum_probs=96.9 Q ss_pred CCCccHHHHHH-HHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhccccee---ecccccccceee Q lcl|NC_020198. 1 MAIITPALISA-LKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRV---IKDMAAQGYQIT 76 (304) Q Consensus 1 maii~~~~l~~-l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~---~~~l~~~~~~i~ 76 (304) |+..+...-.. |-..+...+.+.+... +...++|+++|-....-.|..+..-|.. +|+||-. -.+++=..-+++ T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~Eg~~~~~~~~~f~~i~~~ 91 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKT-SIVQQFAQKVPMGTTGQKIPHWVGDVSA-QWIGEGDMKPITKGNMTSQTIA 91 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEeCCcce-EEecCCccccccccceeEEEEe Confidence 22222111111 0011222222222222 2344567777755555566666666664 7876543 233333445667 Q ss_pred eecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhh Q lcl|NC_020198. 77 NKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTV 156 (304) Q Consensus 77 nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~ 156 (304) -++++..+.|||+.+.|....+..-+.+.++++.++..++.++ +|.+... +..+.+... .++. T Consensus 92 ~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l----~G~g~~~--~~~~~~~~~-----------~~~~ 154 (318) T protein:vir:24 92 PHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAM----HGTDSPF--PTYIGQTTK-----------AISI 154 (318) T ss_pred eEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhh----cccCCCC--Ccccccccc-----------cccc Confidence 7999999999999999988899999999999999999988764 2322111 111111111 0111 Q ss_pred hhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccch Q lcl|NC_020198. 157 SNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQ 236 (304) Q Consensus 157 snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~ 236 (304) .... ....++-.+ .++- +.. ..+ .. +.+. .| -++. T Consensus 155 ~~~~-----~~~~~~~~~---~~~~-~~~----------~~~----~~-----------~~~~---~~--------v~n~ 189 (318) T protein:vir:24 155 ADTT-----GATTVYDQV---AVNG-LSL----------LVN----DG-----------KKWT---HT--------LLDD 189 (318) T ss_pred cccc-----cccchHHHH---HHHH-HHh----------hcc----cc-----------CCCC---EE--------EEcH Confidence 1110 011111000 0000 000 000 00 0000 00 1122 Q ss_pred hHHHHHHHHHHHhccCCCceeceec------------CeEEecchHHHHHHHHHhhhccCCCCcceec-----------c Q lcl|NC_020198. 237 VNFEKVYDAMRNQKADGGRPLDIRP------------NLLVVPTTLRSKAKEVVGVQRLANGADNPNF-----------E 293 (304) Q Consensus 237 ~~l~aar~aM~~~k~~~G~~L~i~P------------~~LvVpp~le~~A~~ll~~~~~~~g~~N~~~-----------g 293 (304) . .+.++++.|+.+|++|-... ..+.+|.-. ++..+.|..=.+. + T Consensus 190 ~----~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~---------~~~~~~~~~~~~~gdfs~~~~~~~~ 256 (318) T protein:vir:24 190 I----TEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTIL---------SDHVVEGTTVGFMGDFSQLIWGQIG 256 (318) T ss_pred H----HHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEE---------eCCCCCCccEEEEeecceEEEEEec Confidence 2 23566778888888863221 112222211 1112222211111 1 Q ss_pred --eeeEEeccccC Q lcl|NC_020198. 294 --LVQVLDTAWLN 304 (304) Q Consensus 294 --~~~~iv~p~Ld 304 (304) .+++.-+..|. T Consensus 257 ~l~i~~~~~~~~~ 269 (318) T protein:vir:24 257 GLSFDVTDQATLN 269 (318) T ss_pred CeEEEEeecccee Confidence 11111222222 No 40 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=62.51 E-value=0.33 Score=23.20 Aligned_cols=187 Identities=13% Similarity=0.139 Sum_probs=102.0 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecC----CccccccccccCCccchhcc--cceeec Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIPS----TTASNTYGWLGQFPKLREWI--GQRVIK 66 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S----~~~~~~y~~Lg~~P~lrEw~--Ge~~~~ 66 (304) ||. |.|+++.+.. ...+.+.+ -+..++..-.. -...-+....+......++. .+.... T Consensus 1 ma~~~T~~~d~iiPev~~~~v---~~~~~~~l-----~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~ 72 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMM---QAQLEKKL-----RFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCccceehhheechHHHHHHH---HHhhhhhh-----hhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccc Confidence 775 6666666653 22222222 13334433211 01111111111122222332 456677 Q ss_pred ccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccc Q lcl|NC_020198. 67 DMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPN 146 (304) Q Consensus 67 ~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~ 146 (304) .+....-+.+.+.++..++|+-.+...---.......+++|++-++..|.-+++.|+.+ +. T Consensus 73 ~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a--~~----------------- 133 (274) T protein:vir:97 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--KL----------------- 133 (274) T ss_pred ccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc--Cc----------------- Confidence 77777777888888999999988877754446777888888888888888888776431 00 Q ss_pred cccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhh Q lcl|NC_020198. 147 VDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQL 226 (304) Q Consensus 147 ~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~ 226 (304) . + T Consensus 134 -~-------~---------------------------------------------------------------------- 135 (274) T protein:vir:97 134 -T-------V---------------------------------------------------------------------- 135 (274) T ss_pred -c-------c---------------------------------------------------------------------- Confidence 0 0 Q ss_pred hhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccC--CCCcc--------eecceee Q lcl|NC_020198. 227 AAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLA--NGADN--------PNFELVQ 296 (304) Q Consensus 227 a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~--~g~~N--------~~~g~~~ 296 (304) +..+++.+.+-.|++.+... + ..++.|+|+|.....-++-..-+.+. ....+ -+.| ++ T Consensus 136 ---~~~~~~~d~i~dA~~~l~d~----~----~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~ 203 (274) T protein:vir:97 136 ---NADITKLNGLQSAIDKFNDE----D----LEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-AI 203 (274) T ss_pred ---cccccCHHHHHHHHHHhhcc----C----CCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecC-ee Confidence 00122334455555555432 1 24678999999776655432112211 11112 2445 59 Q ss_pred EEeccccC Q lcl|NC_020198. 297 VLDTAWLN 304 (304) Q Consensus 297 ~iv~p~Ld 304 (304) |++++.+. T Consensus 204 Vi~s~~~p 211 (274) T protein:vir:97 204 IVRTNKLE 211 (274) T ss_pred EEEcCCCC Confidence 99999888 No 41 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=62.51 E-value=0.33 Score=23.20 Aligned_cols=187 Identities=13% Similarity=0.139 Sum_probs=102.0 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecC----CccccccccccCCccchhcc--cceeec Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIPS----TTASNTYGWLGQFPKLREWI--GQRVIK 66 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S----~~~~~~y~~Lg~~P~lrEw~--Ge~~~~ 66 (304) ||. |.|+++.+.. ...+.+.+ -+..++..-.. -...-+....+......++. .+.... T Consensus 1 ma~~~T~~~d~iiPev~~~~v---~~~~~~~l-----~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~ 72 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMM---QAQLEKKL-----RFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCccceehhheechHHHHHHH---HHhhhhhh-----hhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccc Confidence 775 6666666653 22222222 13334433211 01111111111122222332 456677 Q ss_pred ccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccc Q lcl|NC_020198. 67 DMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPN 146 (304) Q Consensus 67 ~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~ 146 (304) .+....-+.+.+.++..++|+-.+...---.......+++|++-++..|.-+++.|+.+ +. T Consensus 73 ~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a--~~----------------- 133 (274) T protein:vir:94 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--KL----------------- 133 (274) T ss_pred ccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc--Cc----------------- Confidence 77777777888888999999988877754446777888888888888888888776431 00 Q ss_pred cccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhh Q lcl|NC_020198. 147 VDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQL 226 (304) Q Consensus 147 ~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~ 226 (304) . + T Consensus 134 -~-------~---------------------------------------------------------------------- 135 (274) T protein:vir:94 134 -T-------V---------------------------------------------------------------------- 135 (274) T ss_pred -c-------c---------------------------------------------------------------------- Confidence 0 0 Q ss_pred hhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccC--CCCcc--------eecceee Q lcl|NC_020198. 227 AAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLA--NGADN--------PNFELVQ 296 (304) Q Consensus 227 a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~--~g~~N--------~~~g~~~ 296 (304) +..+++.+.+-.|++.+... + ..++.|+|+|.....-++-..-+.+. ....+ -+.| ++ T Consensus 136 ---~~~~~~~d~i~dA~~~l~d~----~----~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~ 203 (274) T protein:vir:94 136 ---NADITKLNGLQSAIDKFNDE----D----LEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-AI 203 (274) T ss_pred ---cccccCHHHHHHHHHHhhcc----C----CCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecC-ee Confidence 00122334455555555432 1 24678999999776655432112211 11112 2445 59 Q ss_pred EEeccccC Q lcl|NC_020198. 297 VLDTAWLN 304 (304) Q Consensus 297 ~iv~p~Ld 304 (304) |++++.+. T Consensus 204 Vi~s~~~p 211 (274) T protein:vir:94 204 IVRTNKLE 211 (274) T ss_pred EEEcCCCC Confidence 99999888 No 42 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=61.19 E-value=0.36 Score=23.03 Aligned_cols=206 Identities=17% Similarity=0.168 Sum_probs=94.1 Q ss_pred CCCccHHHHHHHHHH-------------------HHHHHHHHHhh--cchhhcceE---EEecCCccccccccccCCccc Q lcl|NC_020198. 1 MAIITPALISALKTS-------------------FQKHFQDALAT--APSTYLQVA---TVIPSTTASNTYGWLGQFPKL 56 (304) Q Consensus 1 maii~~~~l~~l~~~-------------------~~~~f~~a~~~--a~~~~~~~a---~~v~S~~~~~~y~~Lg~~P~l 56 (304) ||+==.+.+..+... +...=+.-|+. ++-+|+++- +.++.-..+-+|...... +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~-G~ 79 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGV-GI 79 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccc-cc Confidence 333111122221111 11111122321 123344442 122222223344433222 33 Q ss_pred hhccccee----ecccccccceeeeecccceeecchhhhhcCC---cchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCc Q lcl|NC_020198. 57 REWIGQRV----IKDMAAQGYQITNKLFESTVGVKRTDIEDDN---LGVYGPLMQEMGRAAGAHPDELVFALLKAGNANL 129 (304) Q Consensus 57 rEw~Ge~~----~~~l~~~~~~i~nk~fe~tv~v~R~~I~dDd---lG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~ 129 (304) -+|+|++. .-+..-..++.+...|+..+.++.++++-=. +-+=++....+.++..+++|+++| .| T Consensus 80 a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f----~G---- 151 (314) T protein:vir:10 80 AQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVW----SG---- 151 (314) T ss_pred eeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEE----ee---- Confidence 35776542 2233446678888999999999999998652 333345555556666666666655 12 Q ss_pred ccCcccccccccccccccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccce Q lcl|NC_020198. 130 CYDGQNFFDTDHPVYPNVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEY 209 (304) Q Consensus 130 cyDGq~fF~tdH~v~~~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~ 209 (304) |+.|.+++ |+..+. |.+ T Consensus 152 --------~~~~g~~G--------------------------LlN~p~-v~~---------------------------- 168 (314) T protein:vir:10 152 --------SAPHGIVS--------------------------VFDQPN-INN---------------------------- 168 (314) T ss_pred --------ccccccee--------------------------EeecCC-Ccc---------------------------- Confidence 22332211 110000 000 Q ss_pred EEEeeccccccccchhhhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCCCCc- Q lcl|NC_020198. 210 RYGVRSRCNVGFGFWQLAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGAD- 288 (304) Q Consensus 210 ~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~g~~- 288 (304) + ++ -+-| ++..--.+-+.+++.++..|.. | .+.|+.|++||+... +|.+.. ++++. T Consensus 169 ---~----~~-~~~W-----aT~~ei~~Di~~~~~~l~~~s~--g---~~~p~~l~Lpp~~~~----~L~~~~-~~~~~t 225 (314) T protein:vir:10 169 ---V----VA-TPNW-----SVPQNAIDDVTAMIDAVESSTQ--G---LHHVTDILLPASARR----VMQGLV-PQTNLS 225 (314) T ss_pred ---c----cC-CCCc-----ccHHHHHHHHHHHHHHHHHhcC--c---cccceeEEecHHHHH----hhcccc-cCCCcc Confidence 0 00 0012 0111112344555666665543 3 357999999999764 444322 22222 Q ss_pred --------ceecceeeEEeccccC Q lcl|NC_020198. 289 --------NPNFELVQVLDTAWLN 304 (304) Q Consensus 289 --------N~~~g~~~~iv~p~Ld 304 (304) || .+++.-.|+|+ T Consensus 226 vl~~l~~n~~---~l~I~~~~el~ 246 (314) T protein:vir:10 226 YGELFTRNNP---GLTIRFLQFLD 246 (314) T ss_pred HHHHHHHhCC---CcEEEEccccc Confidence 33 48888899999 No 43 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=58.81 E-value=0.41 Score=22.74 Aligned_cols=237 Identities=8% Similarity=0.045 Sum_probs=102.1 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---eecccccccceeee Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VIKDMAAQGYQITN 77 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~~~l~~~~~~i~n 77 (304) |+--+..-. .+=..+...+.+.+.. .+...++|+++|-......|.++..-|. -.|+||- ...++.=..-+++- T Consensus 1 m~t~t~gg~-liP~~~~~~ii~~l~~-~s~i~~l~~~~~~~~~~~~ip~~~~~~~-a~wv~E~~~~~~s~~~f~~v~l~~ 77 (303) T protein:vir:97 1 MGTETSKAS-LFDKHLVSDLINKVKG-HSSLAKLSSQKPIPFNGSKEFTFTLDSD-IDVVAENGKKTHGGLSLEPVTIVP 77 (303) T ss_pred CcccCCCCe-EcchhHHHHHHHHHHh-hchhhhhcceeecCCCceEEEEEecCcc-eEEeecCccccccccceeeEEeee Confidence 774322110 0111222222222222 3446677887776555556666655554 3788654 33333335667788 Q ss_pred ecccceeecchhhh---hcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchh Q lcl|NC_020198. 78 KLFESTVGVKRTDI---EDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTAT 154 (304) Q Consensus 78 k~fe~tv~v~R~~I---~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~ 154 (304) ++.+..+.|+|+.+ .||..++..-+..+++++.++..+..++. |.++. +|...=-... .++.+. . T Consensus 78 ~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~----G~~~~--~g~~~~~~~~---~~~~~~--~- 145 (303) T protein:vir:97 78 IKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMH----GINPR--TKKASDVIGT---NHFDSK--V- 145 (303) T ss_pred EEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhc----ccccC--Cccccccccc---cccccc--c- Confidence 89999999999988 57788999999999999999988876552 21111 1111000000 000000 0 Q ss_pred hhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcccccc Q lcl|NC_020198. 155 TVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEEL 234 (304) Q Consensus 155 s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l 234 (304) .+.. ..+.....|- .|.=++..... ...+... ++ + T Consensus 146 --~~~~-~~~~~~~~~~------~i~~~~~~~~~--------~~~~~~~-----~v-----------------------m 180 (303) T protein:vir:97 146 --TQVV-KFTESEDADA------NIEAAVNLIQG--------AEGVVTG-----LA-----------------------M 180 (303) T ss_pred --cccc-ccccccchHH------HHHHHHHHHhh--------cCCCccE-----EE-----------------------E Confidence 0000 0000001110 01100000000 0000000 11 1 Q ss_pred chhHHHHHHHHHHHhccCCCceeceecCeE--EecchHHHHHHHHHhhhccCCCC-----cc-eecc------------e Q lcl|NC_020198. 235 NQVNFEKVYDAMRNQKADGGRPLDIRPNLL--VVPTTLRSKAKEVVGVQRLANGA-----DN-PNFE------------L 294 (304) Q Consensus 235 ~~~~l~aar~aM~~~k~~~G~~L~i~P~~L--vVpp~le~~A~~ll~~~~~~~g~-----~N-~~~g------------~ 294 (304) +. ..+.++++.|+..|+++- .|+.= ..|..|. ...++-++..+.+. .+ .+.| - T Consensus 181 n~----~~~~~L~~lkd~~g~~~~-~~~~~~~~~~~~l~--G~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~ 253 (303) T protein:vir:97 181 DT----EFSTALAKVTNGEMGPKM-YPELAWGANPDSIN--GLKSSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQ 253 (303) T ss_pred cH----HHHHHHHHhhccCCCeEE-ecCccCCCCCceec--ceeeEEecccCCccccCCCccEEEEeeccccEEEEEecC Confidence 22 234567788999998763 22110 0011111 11122222222111 11 1222 2 Q ss_pred eeEEeccccC Q lcl|NC_020198. 295 VQVLDTAWLN 304 (304) Q Consensus 295 ~~~iv~p~Ld 304 (304) +++-+.++-| T Consensus 254 ~~~~~~~~~~ 263 (303) T protein:vir:97 254 IPMEIIKYGD 263 (303) T ss_pred cEEEEeeccC Confidence 3444455554 No 44 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=58.20 E-value=0.42 Score=22.66 Aligned_cols=210 Identities=11% Similarity=0.136 Sum_probs=116.0 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCcccccccccc--CCccchh-----cccceeec----ccc Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLG--QFPKLRE-----WIGQRVIK----DMA 69 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg--~~P~lrE-----w~Ge~~~~----~l~ 69 (304) .++++.++-++.-.-|...++.+|++-.+-.++-++......+...+..++ .++..++ =.++-.+. ... T Consensus 10 ~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~dtp~~~~~ 89 (322) T protein:vir:10 10 LPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSADGTYPTPVNNKP 89 (322) T ss_pred eeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccccCcccCCCccccc Confidence 144555444444466788888888887777776654333344444444333 3333321 11222222 222 Q ss_pred cccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccccccc Q lcl|NC_020198. 70 AQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDG 149 (304) Q Consensus 70 ~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~ 149 (304) ...-.+...+|.--+-|++.|...=.+..-++..+..|.+-++--|+.++..+..+.+ . .+ T Consensus 90 ~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~-~----------~~-------- 150 (322) T protein:vir:10 90 FAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS-I----------KG-------- 150 (322) T ss_pred cceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc-c----------cc-------- Confidence 3334456667777788888886655566667777899999999999999876654211 1 11 Q ss_pred ccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhc Q lcl|NC_020198. 150 TGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAM 229 (304) Q Consensus 150 tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~ 229 (304) .|..+ .+. .++.+ . . T Consensus 151 ~gt~v----------------~~~-------------------------ss~~i--------------~----------~ 165 (322) T protein:vir:10 151 TGQPV----------------EFL-------------------------ATQEI--------------G----------D 165 (322) T ss_pred ccccc----------------ccC-------------------------CCccc--------------c----------c Confidence 11000 000 00000 0 0 Q ss_pred cccccchhHHHHHHHHHHHhccC-CCceeceecCeEEecchHHHHHHHHHhh------hcc------CCCCcceecceee Q lcl|NC_020198. 230 STEELNQVNFEKVYDAMRNQKAD-GGRPLDIRPNLLVVPTTLRSKAKEVVGV------QRL------ANGADNPNFELVQ 296 (304) Q Consensus 230 s~~~l~~~~l~aar~aM~~~k~~-~G~~L~i~P~~LvVpp~le~~A~~ll~~------~~~------~~g~~N~~~g~~~ 296 (304) +...++.+-|-+|++.+++..-+ +|. +++||+|+.+.. ||+- ++. .+|...-|.| ++ T Consensus 166 g~~g~t~~kl~~a~~~l~~~dvp~d~~------R~~vv~p~~~~~---LL~d~~~ts~D~~~~~~l~~~G~ig~~lG-f~ 235 (322) T protein:vir:10 166 GTKPISFDYVTEITERFLENEIEPEVS------KVIVIGPTQARK---LLQITEATSADYTSAMDLQSKGIITNWMG-YT 235 (322) T ss_pred CccchhHHHHHHHHHHHHhcCCCCCCC------eEEEeCHHHHHH---HhcchhhhhhhcccchhhhhcCeeeeeee-EE Confidence 13456666677777777666543 342 279999998654 4432 221 1344444656 67 Q ss_pred EEeccccC Q lcl|NC_020198. 297 VLDTAWLN 304 (304) Q Consensus 297 ~iv~p~Ld 304 (304) ++++.+|. T Consensus 236 ~i~s~~lp 243 (322) T protein:vir:10 236 WIVSTRLD 243 (322) T ss_pred EEEeccCC Confidence 88888887 No 45 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=57.69 E-value=0.43 Score=22.60 Aligned_cols=236 Identities=15% Similarity=0.129 Sum_probs=98.2 Q ss_pred CCCccHHHHHHHHHH--------------HHHHHHH----HHhhcchhhcceEEEecCCccccccccccCCccc------ Q lcl|NC_020198. 1 MAIITPALISALKTS--------------FQKHFQD----ALATAPSTYLQVATVIPSTTASNTYGWLGQFPKL------ 56 (304) Q Consensus 1 maii~~~~l~~l~~~--------------~~~~f~~----a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~l------ 56 (304) ||.+|. |++..++ +...+.+ -.. ..+...++|.++|-.....+|..+..-|.. T Consensus 1 ~~~~~e--~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~-~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~ 77 (338) T protein:vir:78 1 MATLNE--LAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQ-ESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVG 77 (338) T ss_pred CcchHH--hhhhhcccccccceecccccccchHHHHHHHHHHH-hhchhhhhcceeeccCCceEEEEEecCccceeeccc Confidence 666553 3333221 1111211 111 123345667777755444444433333322 Q ss_pred -hhcccc---eeecccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccC Q lcl|NC_020198. 57 -REWIGQ---RVIKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYD 132 (304) Q Consensus 57 -rEw~Ge---~~~~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyD 132 (304) -.|+|| +...++.=..-+++-++++..+.|+++.+.|....+..-+.+.++++.++..++.++. |..+.... T Consensus 78 ~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~----G~g~~~~~ 153 (338) T protein:vir:78 78 TSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFH----GKSPLTGS 153 (338) T ss_pred ccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCCccc Confidence 234444 3333444445667788999999999999999999999999999999999999886552 32221111 Q ss_pred cccccccccccccccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEE Q lcl|NC_020198. 133 GQNFFDTDHPVYPNVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYG 212 (304) Q Consensus 133 Gq~fF~tdH~v~~~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~G 212 (304) +-.-+-+++.. ......+.. .. .... ++| .+.=.+.+.+... ..+..+| T Consensus 154 ~~~gi~~~~~~-------~~~~~~~~~---~~-~~~~--~~~---~~~~~~~~~~~~~-------~~~~~~~-------- 202 (338) T protein:vir:78 154 ALQGIDTNNVI-------VNTTNVDYL---QT-GTTP--LLD---RFLDGYDLVSANT-------DVDFNGW-------- 202 (338) T ss_pred ccccccccccc-------ccccccccc---cc-cchh--hHH---HHHHHHHHhhhhc-------cccceEE-------- Confidence 10001111110 000000000 00 0000 000 0000000000000 0011111 Q ss_pred eeccccccccchhhhhccccccchhHHHHHHHHHHHhccCCCceece------ecCeEE-ecchHHHHHHHHHhhhccCC Q lcl|NC_020198. 213 VRSRCNVGFGFWQLAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDI------RPNLLV-VPTTLRSKAKEVVGVQRLAN 285 (304) Q Consensus 213 vd~R~n~G~g~wq~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i------~P~~Lv-Vpp~le~~A~~ll~~~~~~~ 285 (304) -++. ...+....++..|+.+|+||-. .|..|. +|. +-++.++. T Consensus 203 --------------------~m~~-~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~G~PV---------~~~~~ip~ 252 (338) T protein:vir:78 203 --------------------AADP-RYRARLLRSQAYRDANGNVDPTRINLAASAGDLLGLPV---------QFGKAVGG 252 (338) T ss_pred --------------------EEch-HHHHHHHHHhhhccCCCceeecccccCCCCceeeeeeE---------EEccccCc Confidence 0111 1223345566778888888732 111111 111 11111110 Q ss_pred --------------CC-ccee---cceeeEEeccccC Q lcl|NC_020198. 286 --------------GA-DNPN---FELVQVLDTAWLN 304 (304) Q Consensus 286 --------------g~-~N~~---~g~~~~iv~p~Ld 304 (304) |+ ++.+ ++-+++-+++.-. T Consensus 253 ~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~ 289 (338) T protein:vir:78 253 DLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTAT 289 (338) T ss_pred cccccCCcccEEEEEecceEEEEeecccEEEEeeccc Confidence 11 1111 1223444444333 No 46 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=56.83 E-value=0.45 Score=22.50 Aligned_cols=223 Identities=12% Similarity=0.066 Sum_probs=94.7 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---eecccccccceeee Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VIKDMAAQGYQITN 77 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~~~l~~~~~~i~n 77 (304) |.--+... .|-..+...+.+...+. +...++|+.+|..+....|..+..-|.. +|++|- ...++.-..-++.- T Consensus 30 ~~~~~~~~--lip~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~~~~~~~~~v~~~~ 105 (324) T protein:vir:99 30 MMHEKKDG--TLLNDFTTPILQEVMEN-SKIMRLGKYEPMEGTEKKFTFWADKPGA-YWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred eccCCCcc--eechhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEecCcce-eEeccCccccccccceeEEEEee Confidence 11100000 01111222222222222 2245567777755555556555444443 787543 33444445567788 Q ss_pred ecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhhh Q lcl|NC_020198. 78 KLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVS 157 (304) Q Consensus 78 k~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~s 157 (304) ++++..+.|+|+.+.|-...+..-+.+.++++.++..++.++. |.+.. ..+..+..+-.. . ...+...++.. T Consensus 106 ~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~----G~g~~-~~~~~~~~~~~~--~-~~~~~~~~~~~ 177 (324) T protein:vir:99 106 FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN-PFGKSIAQSIEK--T-NKVIKGDFTQD 177 (324) T ss_pred EEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh----cCCCC-ccCccccccccc--c-ceeccccCCHH Confidence 9999999999999999889999999999999999999886652 21110 111111111000 0 00000000000 Q ss_pred hhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccchh Q lcl|NC_020198. 158 NLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQV 237 (304) Q Consensus 158 nl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~ 237 (304) . |++.-..|++ . .+.+. -| -++.. T Consensus 178 ~-------------i~~~~~~l~~-----------------~---------------~~~~~---~~--------v~n~~ 201 (324) T protein:vir:99 178 N-------------IIDLEALLED-----------------D---------------ELEAN---AF--------ISKTQ 201 (324) T ss_pred H-------------HHHHHHhhhh-----------------c---------------cCCCC---EE--------EEcHH Confidence 0 0000000000 0 00000 01 12222 Q ss_pred HHHHHHHHHHHhccCCCceece--ecCeEE-ecchHHHHHHHHHhhhccCCCCc--------cee----cc-eeeEEecc Q lcl|NC_020198. 238 NFEKVYDAMRNQKADGGRPLDI--RPNLLV-VPTTLRSKAKEVVGVQRLANGAD--------NPN----FE-LVQVLDTA 301 (304) Q Consensus 238 ~l~aar~aM~~~k~~~G~~L~i--~P~~Lv-Vpp~le~~A~~ll~~~~~~~g~~--------N~~----~g-~~~~iv~p 301 (304) . +..++++|+..|+++-. .|..|. +|. +-+...+.+.. +-+ .+ .+++.-++ T Consensus 202 ~----~~~L~~l~d~~g~~~~~~~~~~~l~G~PV---------v~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~ 268 (324) T protein:vir:99 202 N----RSLLRKIVDPETKERIYDRNSDTLDGLPV---------VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETA 268 (324) T ss_pred H----HHHHHHhhcCCCceeecCCCCccccceeE---------EeecCCCCCcceEEEEecccEEEEEecCcEEEEeecc Confidence 2 34567789988987622 222211 111 11111111110 111 11 12222222 Q ss_pred ccC Q lcl|NC_020198. 302 WLN 304 (304) Q Consensus 302 ~Ld 304 (304) .+. T Consensus 269 ~~~ 271 (324) T protein:vir:99 269 QLS 271 (324) T ss_pred ccc Confidence 222 No 47 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=56.26 E-value=0.46 Score=22.43 Aligned_cols=186 Identities=15% Similarity=0.186 Sum_probs=98.7 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecC----Cccccccc-c--ccCCccchhcccceee Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIPS----TTASNTYG-W--LGQFPKLREWIGQRVI 65 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S----~~~~~~y~-~--Lg~~P~lrEw~Ge~~~ 65 (304) ||- |.|+++..+. ...|++.+ -+..++..-.+ ....-++. | +|+.-.+.|. .+... T Consensus 1 ma~~~T~~~d~i~Pev~s~~v---~~~~~~~~-----~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g-~~i~~ 71 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMM---QAELDKKL-----RFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEG-EKIPV 71 (274) T ss_pred CCccccchhhhhhhHHHHHHH---HHHHHhhh-----hhcccccccccccCCCCCEEEEEeeccCCCccccCCC-CcCch Confidence 885 3334444332 22222222 12233322111 01111111 2 2333333332 35667 Q ss_pred cccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccc Q lcl|NC_020198. 66 KDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYP 145 (304) Q Consensus 66 ~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~ 145 (304) .++....-+.+.+.++..+.|+-.+..----.....+.+++|++-++..|.-+++.|+.+ + . T Consensus 72 ~~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a--~------------~---- 133 (274) T protein:vir:96 72 DQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA--T------------L---- 133 (274) T ss_pred hhcccceeEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcC--C------------C---- Confidence 777777777788888999999988776655567788888899998888888887766321 0 0 Q ss_pred ccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchh Q lcl|NC_020198. 146 NVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQ 225 (304) Q Consensus 146 ~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq 225 (304) + T Consensus 134 ---------~---------------------------------------------------------------------- 134 (274) T protein:vir:96 134 ---------T---------------------------------------------------------------------- 134 (274) T ss_pred ---------C---------------------------------------------------------------------- Confidence 0 Q ss_pred hhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccC--CCCcc--------eeccee Q lcl|NC_020198. 226 LAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLA--NGADN--------PNFELV 295 (304) Q Consensus 226 ~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~--~g~~N--------~~~g~~ 295 (304) .+..+++.+.+-.|++.+... + ..++.|+|+|.....-++.-.-+.+. .+..| -+.| + T Consensus 135 ---~~~~~~~~d~i~dA~~~l~d~----~----~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G-~ 202 (274) T protein:vir:96 135 ---VEADITKLDGLQTAIDKFNDE----D----LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALG-A 202 (274) T ss_pred ---cCcccccHHHHHHHHHHhccc----C----CCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecC-e Confidence 001122333444555555332 1 25789999999766655532112211 11112 2344 6 Q ss_pred eEEeccccC Q lcl|NC_020198. 296 QVLDTAWLN 304 (304) Q Consensus 296 ~~iv~p~Ld 304 (304) +|++++.|. T Consensus 203 ~Vi~s~~~p 211 (274) T protein:vir:96 203 VIVRSNKLN 211 (274) T ss_pred eEEEcCCCC Confidence 899999997 No 48 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=54.66 E-value=0.5 Score=22.24 Aligned_cols=193 Identities=12% Similarity=0.109 Sum_probs=95.3 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecC-------CccccccccccCCccchhcccceee Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIPS-------TTASNTYGWLGQFPKLREWIGQRVI 65 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S-------~~~~~~y~~Lg~~P~lrEw~Ge~~~ 65 (304) ||- |.|+.+.++- ...|++.+ -+.+++....+ +-....|.-+|+.-.+.| -.+... T Consensus 1 Ma~~~T~~~~~iiPev~s~~v---~~~~~~~~-----v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~-g~~i~~ 71 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMI---SAKLPKAI-----KFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAE-GAAIDY 71 (278) T ss_pred CCCcceehhheecHHHHHHHH---HHHHHHhh-----hhcccceecccccCCCCCEEEEeeeccCCcceeecC-CCcCcc Confidence 884 5566555543 22333322 12233322211 111122222332211111 134455 Q ss_pred cccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccc Q lcl|NC_020198. 66 KDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYP 145 (304) Q Consensus 66 ~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~ 145 (304) .++....-+.+.+.++..+.|+..+...---.....+.+++|++.++..|..+++.|+..++. .++ ++ T Consensus 72 ~~lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~--~~~-------~~--- 139 (278) T protein:vir:80 72 SALETESVKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE--VKG-------AI--- 139 (278) T ss_pred cccccceeeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--ccc-------cc--- Confidence 667666667777888999999999988877778899999999999999999999888642110 000 00 Q ss_pred ccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchh Q lcl|NC_020198. 146 NVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQ 225 (304) Q Consensus 146 ~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq 225 (304) +.. ... T Consensus 140 ---------t~~--------~~~--------------------------------------------------------- 145 (278) T protein:vir:80 140 ---------NIG--------LID--------------------------------------------------------- 145 (278) T ss_pred ---------ccc--------hhh--------------------------------------------------------- Confidence 000 000 Q ss_pred hhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccC--CCCcc--------eeccee Q lcl|NC_020198. 226 LAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLA--NGADN--------PNFELV 295 (304) Q Consensus 226 ~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~--~g~~N--------~~~g~~ 295 (304) -..+.+..++.+|.. .+.| .+.+|+|+|.-...-++.-..+.+. ....| -+.| + T Consensus 146 --------~~~~~~~da~~~l~~----~~~~---~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G-~ 209 (278) T protein:vir:80 146 --------KIENTFTDAPDAIED----ESIT---TTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLG-W 209 (278) T ss_pred --------hHHHHHHHHHHhhcc----cCCC---cccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecc-e Confidence 000122222222211 1112 1346777776554333332112211 11111 2333 5 Q ss_pred eEEeccccC Q lcl|NC_020198. 296 QVLDTAWLN 304 (304) Q Consensus 296 ~~iv~p~Ld 304 (304) ++++++.|. T Consensus 210 ~Vi~s~~~p 218 (278) T protein:vir:80 210 EIVRTKKLA 218 (278) T ss_pred eEEEcCCCC Confidence 888888888 No 49 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=52.10 E-value=0.39 Score=22.82 Aligned_cols=260 Identities=14% Similarity=0.148 Sum_probs=83.8 Q ss_pred CCCccH-HHH-HHH---------------------------HHHHHHHHHHHHhhcchhhcce-EEEecCCccccccccc Q lcl|NC_020198. 1 MAIITP-ALI-SAL---------------------------KTSFQKHFQDALATAPSTYLQV-ATVIPSTTASNTYGWL 50 (304) Q Consensus 1 maii~~-~~l-~~l---------------------------~~~~~~~f~~a~~~a~~~~~~~-a~~v~S~~~~~~y~~L 50 (304) |++... ..+ ++. -+-+...+-+-+...-+ -.++ ++.+|..+..-++..+ T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~-l~~~~~~~~~~~~g~~~~p~~ 176 (428) T protein:vir:10 98 MSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTI-VRKLGARSIPLPNGNMSLPRL 176 (428) T ss_pred HHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhch-hhhhcceeeecCCcceEEEEE Confidence 000000 000 000 00001111111111111 1222 4555544444455555 Q ss_pred cCCccchhcccce---eecccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCC Q lcl|NC_020198. 51 GQFPKLREWIGQR---VIKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNA 127 (304) Q Consensus 51 g~~P~lrEw~Ge~---~~~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~ 127 (304) ..-|.. .|+||- ...+++=..-++.-++++..+.|||+.+.|-+.++..-+.+.|+++.+...++.++. -.|.+ T Consensus 177 ~~~~~a-~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~--G~G~~ 253 (428) T protein:vir:10 177 AGGATA-SYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMR--DDGTG 253 (428) T ss_pred eCCcce-eeeccCccccccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhc--cCCCC Confidence 444543 577543 333333344577889999999999999998889999999999999999999886641 11111 Q ss_pred CcccCcc--------cccccccccccccc-------------------cccc--hhhhhh---hhcccCCCCccceecc- Q lcl|NC_020198. 128 NLCYDGQ--------NFFDTDHPVYPNVD-------------------GTGT--ATTVSN---LFAPAADPGAAWYLLD- 174 (304) Q Consensus 128 ~~cyDGq--------~fF~tdH~v~~~~~-------------------~tg~--~~s~sn---l~~~~~~~g~~w~L~d- 174 (304) .-.+|- .+..+........+ ..++ .++.+. |..-...+|. |++. T Consensus 254 -~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~--~i~~~ 330 (428) T protein:vir:10 254 -DTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGN--KVYPE 330 (428) T ss_pred -ccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCc--eeccC Confidence 011111 00000000000000 0000 000000 0000111222 2221 Q ss_pred -CCccc--hhhhhhhccccchhhcccCcccccc--cccceEEEeeccccccccchhhhhccccccchhHHHHHHHHHHHh Q lcl|NC_020198. 175 -TSRSL--KPLIYQERMKPSFTSMTKEDDEQVF--MADEYRYGVRSRCNVGFGFWQLAAMSTEELNQVNFEKVYDAMRNQ 249 (304) Q Consensus 175 -~~~~~--kP~i~Q~r~~~~~~~~~~~~~~~vf--~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~aar~aM~~~ 249 (304) ....| .|++.-.-.+-.... ..+...++ .-.+|++|.+.-...-. ..+..+.....-....+..--.++|.. T Consensus 331 ~~~g~l~G~pv~~~~~~p~~~~~--~~~~~~i~~gd~s~~~i~~~~~i~i~~-~~~~~~~~~~~~~~~~f~~~~~~~R~~ 407 (428) T protein:vir:10 331 MAQGMLKGYPIQRTSAIPANLGE--GGKESEIYFADFNDVVIGEDGNMKVDF-SKEASYIDTDGKLVSAFSRNQSLIRVV 407 (428) T ss_pred CCCCeeeceeeEEeccccccccC--CCccceEEEEecceEEEEEecceEEEe-ecccccccccccccchhhcchhheeee Confidence 11011 122211111110000 00011111 11122222211000000 000000000000001111111111211 Q ss_pred ccCCCceeceecCeEEecchHHH Q lcl|NC_020198. 250 KADGGRPLDIRPNLLVVPTTLRS 272 (304) Q Consensus 250 k~~~G~~L~i~P~~LvVpp~le~ 272 (304) ...++.. .+|.-+++-...+. T Consensus 408 ~r~d~~v--~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 408 TEHDIGF--RHPEGLVLGTGVLF 428 (428) T ss_pred eeeCcee--eccceEEEEeccCC Confidence 1112111 12333333333333 No 50 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=49.18 E-value=0.65 Score=21.62 Aligned_cols=207 Identities=16% Similarity=0.166 Sum_probs=96.2 Q ss_pred CCCccHHH-HHHHHHHHHHHHHHHHhhc--chhhcceE---EEecCCccccccccccCCccchhccccee----eccccc Q lcl|NC_020198. 1 MAIITPAL-ISALKTSFQKHFQDALATA--PSTYLQVA---TVIPSTTASNTYGWLGQFPKLREWIGQRV----IKDMAA 70 (304) Q Consensus 1 maii~~~~-l~~l~~~~~~~f~~a~~~a--~~~~~~~a---~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~----~~~l~~ 70 (304) |-+--.+. +.-+..-+...=+.-|+.. +=+|+++- +.++.-..+-+|...... ..-+|+|++. .-+..- T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~-G~a~~~~~~~~dip~v~~~~ 79 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGV-GIAQIVADYTDDLPLVDALA 79 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeecc-CceeEeCCCccccceeeccc Confidence 55532111 1111111222223334322 22455543 222222233345443332 3345776542 223344 Q ss_pred ccceeeeecccceeecchhhhhcCC---cchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccccc Q lcl|NC_020198. 71 QGYQITNKLFESTVGVKRTDIEDDN---LGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNV 147 (304) Q Consensus 71 ~~~~i~nk~fe~tv~v~R~~I~dDd---lG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~ 147 (304) ..++.+...|+..+.++.++|+.=. +-+=.+....+.++.++++|+++| .| +++|.+.+ T Consensus 80 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f----~G------------~~~~g~~G-- 141 (296) T protein:vir:10 80 TERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVW----SG------------STAHGIPS-- 141 (296) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEE----ee------------ccccccee-- Confidence 5677888999999999999987542 333345555666666666666665 12 23332211 Q ss_pred ccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhh Q lcl|NC_020198. 148 DGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLA 227 (304) Q Consensus 148 ~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a 227 (304) |+..+.. | .. ++ -+-|. T Consensus 142 ------------------------LlN~p~v--~------------~~----------------------~~-~~~W~-- 158 (296) T protein:vir:10 142 ------------------------VFDYPNI--N------------NV----------------------VS-GGSWS-- 158 (296) T ss_pred ------------------------EeecCCC--c------------cc----------------------cc-cCCcc-- Confidence 1110000 0 00 00 00121 Q ss_pred hccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCCCCc--------ceecceeeEEe Q lcl|NC_020198. 228 AMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGAD--------NPNFELVQVLD 299 (304) Q Consensus 228 ~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~g~~--------N~~~g~~~~iv 299 (304) ... --.+.+.+++.++..+.. | .+.|+.|++||++...-.+. ..+.|-+ || .++++- T Consensus 159 --~~t-~i~~Di~~~~~~l~~~s~--g---~~~p~~l~L~p~~~~~L~~~----~~~~~~t~l~~ik~~~~---~l~i~~ 223 (296) T protein:vir:10 159 --QPT-TAVSDITSLLDIIETSTN--G---QHRATHLLLPTTARRIMQNL----VPGTSVSYGEFFRQNNS---GVTVEF 223 (296) T ss_pred --CHH-HHHHHHHHHHHHHHHhhC--c---eecceeEEeCHHHHHHHhhc----cCCCCccHHHHHHHhcC---CceEEE Confidence 111 113345555555555533 3 36799999999988644322 2222211 33 366777 Q ss_pred ccccC Q lcl|NC_020198. 300 TAWLN 304 (304) Q Consensus 300 ~p~Ld 304 (304) .|+|+ T Consensus 224 ~~~l~ 228 (296) T protein:vir:10 224 VQYLN 228 (296) T ss_pred eeeec Confidence 88887 No 51 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=42.43 E-value=0.89 Score=20.87 Aligned_cols=225 Identities=11% Similarity=0.029 Sum_probs=86.8 Q ss_pred CC---C--ccHHHHHHHH------------HHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce Q lcl|NC_020198. 1 MA---I--ITPALISALK------------TSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR 63 (304) Q Consensus 1 ma---i--i~~~~l~~l~------------~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~ 63 (304) |. . ++..-.+++. .-+...+.+.+....+ ..++|+.+|-.+....|.-...-+. -.|++|- T Consensus 90 l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~-a~~v~E~ 167 (407) T protein:vir:48 90 MRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVV-MRQEATVITLGGSDYKKLVNLGGTT-SGWVGET 167 (407) T ss_pred HhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhh-hhhhceeeecCCCceEEEEecCCcc-eeeeccc Confidence 00 0 0000000110 0112222233322222 3456777764444444433333333 3677654 Q ss_pred ee-cccc--c-ccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHH---------HHhc------ Q lcl|NC_020198. 64 VI-KDMA--A-QGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFA---------LLKA------ 124 (304) Q Consensus 64 ~~-~~l~--~-~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~---------lL~~------ 124 (304) .- ...+ . ..-++..++++..+.||++.+.|-...+..-+.+.|+++.+...+..+.. +|.. T Consensus 168 ~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~ 247 (407) T protein:vir:48 168 DARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDED 247 (407) T ss_pred ccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeeccccccc Confidence 32 1121 1 33567778999999999999999888999999999999988887765431 0100 Q ss_pred ----------------------------------cC--C---------------CcccCcccccccccccccccccccch Q lcl|NC_020198. 125 ----------------------------------GN--A---------------NLCYDGQNFFDTDHPVYPNVDGTGTA 153 (304) Q Consensus 125 ----------------------------------g~--~---------------~~cyDGq~fF~tdH~v~~~~~~tg~~ 153 (304) +. + -+=-+|+|+|..+=..+..--=-|.. T Consensus 248 ~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~P 327 (407) T protein:vir:48 248 DKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYG 327 (407) T ss_pred ccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceeccee Confidence 00 0 00011222221110000000000011 Q ss_pred hhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccc------cchhhh Q lcl|NC_020198. 154 TTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGF------GFWQLA 227 (304) Q Consensus 154 ~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~------g~wq~a 227 (304) +.+++-....+....+.++-|.++.+. ...|...++. . +.-|.++...|=+..|+..+. -.-..+ T Consensus 328 V~~~~~~p~~~~~~~~i~~Gd~~~~~~---i~~~~~~~i~-----~-d~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~ 398 (407) T protein:vir:48 328 IVENEQMPDIAADAKAIAFGNFKRGYT---IVDRIGTRIL-----R-DPYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIG 398 (407) T ss_pred eEEecCcCCccCCccEEEEEeccccEE---EEEeeceEEE-----e-eccccCCcEEEEEEEEeccEEecccceEEEEee Confidence 111110000000111122233333221 2233333321 1 122445555555555553322 111111 Q ss_pred hccccccch Q lcl|NC_020198. 228 AMSTEELNQ 236 (304) Q Consensus 228 ~~s~~~l~~ 236 (304) -.+++.-.+ T Consensus 399 aa~~~~~~~ 407 (407) T protein:vir:48 399 AATRQKAAA 407 (407) T ss_pred ccCCCCCCC Confidence 111111111 No 52 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=39.79 E-value=0.87 Score=20.92 Aligned_cols=242 Identities=17% Similarity=0.138 Sum_probs=88.0 Q ss_pred CCC-ccHHHHHHHHHH---------------------------HHHHHHHHHhhcchhhcceEEEecCCcc-cccccccc Q lcl|NC_020198. 1 MAI-ITPALISALKTS---------------------------FQKHFQDALATAPSTYLQVATVIPSTTA-SNTYGWLG 51 (304) Q Consensus 1 mai-i~~~~l~~l~~~---------------------------~~~~f~~a~~~a~~~~~~~a~~v~S~~~-~~~y~~Lg 51 (304) ... -..+.+++...+ ++..+...... .+..+.+|++++.+.. .-.+.... T Consensus 84 ~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~-~~~l~~~~~~~~~~~~~~~~~p~~~ 162 (390) T protein:vir:62 84 SADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVER-SAIMRGGATTFTTSDANPLDFTVIT 162 (390) T ss_pred hcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhh-hhhhhhcceeeecCCCceeEEEEEc Confidence 000 000011110000 11111112211 1224556666653322 12233333 Q ss_pred CCccchhcccc---eeecccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCC Q lcl|NC_020198. 52 QFPKLREWIGQ---RVIKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNAN 128 (304) Q Consensus 52 ~~P~lrEw~Ge---~~~~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~ 128 (304) .-|. -.|++| +.-.+..=..-++..++++..+-||++.++|-...+..-+.+.++++.+...++.++ +|.+. T Consensus 163 ~~~~-a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l----~G~G~ 237 (390) T protein:vir:62 163 GRSS-ASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFI----TGTGQ 237 (390) T ss_pred CCcc-eeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhh----ccCCc Confidence 3333 257654 233444445567788999999999999999988899999999999999999888544 33211 Q ss_pred cccCccccccccccccccc-ccccchhhhhhhhc-----ccCC-CCccceeccCCc-c---c-----hhhhhhhc----- Q lcl|NC_020198. 129 LCYDGQNFFDTDHPVYPNV-DGTGTATTVSNLFA-----PAAD-PGAAWYLLDTSR-S---L-----KPLIYQER----- 187 (304) Q Consensus 129 ~cyDGq~fF~tdH~v~~~~-~~tg~~~s~snl~~-----~~~~-~g~~w~L~d~~~-~---~-----kP~i~Q~r----- 187 (304) ++.++...-+..... .++...++..++.. ..+. ....|++=..+. . + +| |||.- T Consensus 238 ----p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~-l~~~~~~~g~ 312 (390) T protein:vir:62 238 ----PRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQY-LWQSGLTVGA 312 (390) T ss_pred ----cccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCe-eecCCcCCCc Confidence 122333211100000 01111122222211 0111 111232210000 0 0 01 11100 Q ss_pred -----cccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccchh--HHHHHHH---HHHHhccCCCcee Q lcl|NC_020198. 188 -----MKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQV--NFEKVYD---AMRNQKADGGRPL 257 (304) Q Consensus 188 -----~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~--~l~aar~---aM~~~k~~~G~~L 257 (304) -.|..+.-.-| .+.+.||--+++.. +. ...++.+ .-..+.. +.+.....+|.++ T Consensus 313 ~~~l~G~Pv~~~~~~p-------~~~i~~gd~s~~~i--~~-------~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~ 376 (390) T protein:vir:62 313 PSLFNGKVVETDDGMP-------ADKILFADLSKYRV--RF-------AGSLRVDRSVDAKFSTDQIVYRFLQRADGLLV 376 (390) T ss_pred cceecccceEEecCCC-------CccEEEeeccceeE--Ee-------ecceEEEeeccccccCCcEEEEEEEEeCcEee Confidence 01111111111 12333432111111 00 1111111 0011111 2222223344443 Q ss_pred ceecCe-E-Eecch Q lcl|NC_020198. 258 DIRPNL-L-VVPTT 269 (304) Q Consensus 258 ~i~P~~-L-vVpp~ 269 (304) +-..=. | |.+.+ T Consensus 377 ~~~A~~~l~~~~~a 390 (390) T protein:vir:62 377 DARGAKVLTVTPGA 390 (390) T ss_pred chhheEEEEeecCC Confidence 222211 1 22333 No 53 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=39.47 E-value=1 Score=20.55 Aligned_cols=242 Identities=14% Similarity=0.120 Sum_probs=95.4 Q ss_pred CCCccHHHHHHHHH------------------HHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccc Q lcl|NC_020198. 1 MAIITPALISALKT------------------SFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQ 62 (304) Q Consensus 1 maii~~~~l~~l~~------------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge 62 (304) ||.++. |++... .+...+-+.+...- ...++|+++|-.+....+..+..-|. -.|+|| T Consensus 1 ~a~l~e--l~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s-~l~~~~~~~~~~~~~~~~p~~~~~~~-a~~v~e 76 (333) T protein:vir:78 1 MATLNE--LLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESS-LVLRMGEQIPISYGETIIPTTVKRPE-VGQVGV 76 (333) T ss_pred CchhHH--hhhhcccccccCceecCCccccchhHHHHHHHHHHhhc-hhhhhcceeeccCCceEEEEEeCCce-eEeecC Confidence 554432 211111 11122222222211 23455666664444444444433333 234443 Q ss_pred ee-----------ecccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCccc Q lcl|NC_020198. 63 RV-----------IKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCY 131 (304) Q Consensus 63 ~~-----------~~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cy 131 (304) -+ ..+..=..-++.-++.+..+.|+|+.+.+....+..-+.+.|+++.++.+++.++. |.+..- T Consensus 77 g~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~----G~g~~~- 151 (333) T protein:vir:78 77 GTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFH----GKSPLT- 151 (333) T ss_pred cccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhc----ccCCCC- Confidence 32 22222233467778999999999999999999999999999999999999987753 222110 Q ss_pred CcccccccccccccccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEE Q lcl|NC_020198. 132 DGQNFFDTDHPVYPNVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRY 211 (304) Q Consensus 132 DGq~fF~tdH~v~~~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~ 211 (304) +..+-..-+ .++ ..+....... +..+.. ..| .+.=++....... ....++| + T Consensus 152 -~~~~~g~~~-------~~~-~~~~~~~~~~-~~~~~~--~~~---~i~~~~~~~~~~~-------~~~~~~~-----v- 203 (333) T protein:vir:78 152 -GSALQGIDT-------DNV-IANTTNVDYL-QETGDP--LLD---RLLDGYDLVSANT-------DVEFNGW-----A- 203 (333) T ss_pred -Ccccccccc-------ccc-cccccccccc-ccccch--hHH---HHHHHHHhhcccc-------ccCceEE-----E- Confidence 000000000 000 0000010000 000111 010 0000000000000 0001111 0 Q ss_pred EeeccccccccchhhhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCC------ Q lcl|NC_020198. 212 GVRSRCNVGFGFWQLAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLAN------ 285 (304) Q Consensus 212 Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~------ 285 (304) ++. ........++..|+.+|+++-...-.-..|..+. ..-++.++.++. T Consensus 204 ----------------------mn~-~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~--G~Pv~~~~~i~~~~~~~~ 258 (333) T protein:vir:78 204 ----------------------VDP-RFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVL--GLPAQFGRAVGGDLGAAV 258 (333) T ss_pred ----------------------Ecc-hHHHHHHHHhhhcCCCCceeecCccccCCCceee--ceeeEEccccCCCccccC Confidence 111 1122233466677778887642110000000000 011111111111 Q ss_pred --------CC-ccee---cceeeEEeccccC Q lcl|NC_020198. 286 --------GA-DNPN---FELVQVLDTAWLN 304 (304) Q Consensus 286 --------g~-~N~~---~g~~~~iv~p~Ld 304 (304) |+ .+-+ ++-+++.++++-+ T Consensus 259 ~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~ 289 (333) T protein:vir:78 259 DSKTRIIGGDFSQLKFGFADEIRIKMSDTAT 289 (333) T ss_pred CCccEEEEEecccEEEEEeeccEEEEecccc Confidence 11 1111 2346666666543 No 54 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=39.44 E-value=1 Score=20.54 Aligned_cols=223 Identities=8% Similarity=0.009 Sum_probs=95.2 Q ss_pred CCCccHHHHHHHH-HHHHHHHHHHHhhcchhhcceEEEecCCccc-cccccccCCccchhcccc---eeeccccccccee Q lcl|NC_020198. 1 MAIITPALISALK-TSFQKHFQDALATAPSTYLQVATVIPSTTAS-NTYGWLGQFPKLREWIGQ---RVIKDMAAQGYQI 75 (304) Q Consensus 1 maii~~~~l~~l~-~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~-~~y~~Lg~~P~lrEw~Ge---~~~~~l~~~~~~i 75 (304) |.+.+++.-..|. ..+...+-+...+. +...++|+++|..... ..+.+...-|.. .|++| +...+..=..-++ T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~Eg~~~~~~~~~f~~v~l 86 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQN-SLVMQLGQYQEMEGEQEKTVYVQTDGISA-YWVNETEKIKTDKPEVVPVTL 86 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhh-chhhhhcceeecCCCccEEEEEEcCCcee-EEeecCccccccccceeEEEE Confidence 4333333222221 22222333333222 2355667777744333 344445555543 57654 3333333355678 Q ss_pred eeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhh Q lcl|NC_020198. 76 TNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATT 155 (304) Q Consensus 76 ~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s 155 (304) +-++++..+.|+|+.++|-+..+.+-+...++++.++..++.++ +|. |.+- |.+ .+...+. T Consensus 87 ~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l----~G~------g~~~-----~~g-i~~~~~~--- 147 (297) T protein:vir:95 87 KAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGL----LGH------DTPF-----ANS-VAKAAKD--- 147 (297) T ss_pred eeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHh----ccc------CCcc-----ccc-ccccccc--- Confidence 88999999999999999988999999999999999999988876 232 2110 000 0000000 Q ss_pred hhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCccccc-ccccceEEEeeccccccccchhhhhcccccc Q lcl|NC_020198. 156 VSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQV-FMADEYRYGVRSRCNVGFGFWQLAAMSTEEL 234 (304) Q Consensus 156 ~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~v-f~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l 234 (304) ..+.. .. . ++-+++ -+.... . ...+.+ .-| -+ T Consensus 148 ~~~~~-~~---~------------------------------~t~~~i~~~~~~l-~-~~~~~~---~~~--------v~ 180 (297) T protein:vir:95 148 ANKVI-GG---P------------------------------INYDNILKLQDAL-Y-DADVEP---NAF--------VS 180 (297) T ss_pred cceec-cc---c------------------------------cCHHHHHHHHHHh-h-hccCCc---CEE--------EE Confidence 00000 00 0 000000 000000 0 000000 011 12 Q ss_pred chhHHHHHHHHHHHhccCCCceecee-cCeEE-ecchHHHHHHHHHhhhccCCCCcceec-----------ceeeEEecc Q lcl|NC_020198. 235 NQVNFEKVYDAMRNQKADGGRPLDIR-PNLLV-VPTTLRSKAKEVVGVQRLANGADNPNF-----------ELVQVLDTA 301 (304) Q Consensus 235 ~~~~l~aar~aM~~~k~~~G~~L~i~-P~~Lv-Vpp~le~~A~~ll~~~~~~~g~~N~~~-----------g~~~~iv~p 301 (304) +.. .+..+++.|+..|++|--. +..|. .| ++.+...+....-++. +-+++-+.. T Consensus 181 ~~~----~~~~L~~l~d~~G~~i~~~~~~~l~G~P---------v~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~ 247 (297) T protein:vir:95 181 KIQ----NRSALREARDGNKVSIYDKAANTIDGIT---------TVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKISE 247 (297) T ss_pred cHH----HHHHHHHhhccCCceeecCCCCccccee---------eEeecCCCCCCceEEEEecccEEEEEecCeEEEEee Confidence 222 2355667788888875321 11111 11 0111111111111111 112222222 Q ss_pred ccC Q lcl|NC_020198. 302 WLN 304 (304) Q Consensus 302 ~Ld 304 (304) ... T Consensus 248 ~~~ 250 (297) T protein:vir:95 248 EGQ 250 (297) T ss_pred ccc Confidence 221 No 55 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=39.32 E-value=1 Score=20.53 Aligned_cols=231 Identities=13% Similarity=0.105 Sum_probs=94.8 Q ss_pred CCCcc--HHHHH------H-HHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhccccee---eccc Q lcl|NC_020198. 1 MAIIT--PALIS------A-LKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRV---IKDM 68 (304) Q Consensus 1 maii~--~~~l~------~-l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~---~~~l 68 (304) ||.-. +.+.. . +-..+...+.+.....-+ ..++|+.+|-....-+|..+..-+.. .|++|-. ..+. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~-l~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~E~~~~~~~~~ 78 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSA-IMKLAKNEPMTAQKKKFTYLAKGVGA-YWVSETERIQTSKP 78 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccc-hhhhcceeeccCCceEEEEEeCCcce-EEeecCcccccccc Confidence 65422 11110 0 111222333333333322 45567777755444455555544443 6876543 2233 Q ss_pred ccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccc Q lcl|NC_020198. 69 AAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVD 148 (304) Q Consensus 69 ~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~ 148 (304) .=..-+++.++++..+.|+|+.++|-...+..-+.+.++++.++..++.++. |.... ++..- .+ . T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~----G~g~~----~~~~~--~~-----~ 143 (304) T protein:vir:10 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF----GTKSP----YNTST--SG-----K 143 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee----ccCCC----ccccc--cc-----c Confidence 3345667789999999999999999999999999999999999888776531 21110 11000 00 0 Q ss_pred cccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhh Q lcl|NC_020198. 149 GTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAA 228 (304) Q Consensus 149 ~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~ 228 (304) +.-......... ...+...+- .|.-++.+.. +. .+.+++ | T Consensus 144 ~~~~~~~~~~~~--~~~~~~~~~------~i~~~~~~l~----------~~---------------~~~~~~---~---- 183 (304) T protein:vir:10 144 PLVEGAEEKGNV--VTDTNNLYV------DLSALMATIE----------DE---------------ELDPNG---V---- 183 (304) T ss_pred cccccccccccc--cccccchHH------HHHHHHHHhh----------hc---------------cCCcCE---E---- Confidence 000000000000 000000000 0111111100 00 000110 1 Q ss_pred ccccccchhHHHHHHHHHHHhccCCCceece-ecCeEEecchHHHHHHHHHhhhccCCC---------C-cceecce--- Q lcl|NC_020198. 229 MSTEELNQVNFEKVYDAMRNQKADGGRPLDI-RPNLLVVPTTLRSKAKEVVGVQRLANG---------A-DNPNFEL--- 294 (304) Q Consensus 229 ~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i-~P~~LvVpp~le~~A~~ll~~~~~~~g---------~-~N~~~g~--- 294 (304) -++. ..+.++++.|+..|+||-. .|..|. -..+.-++..+.. + .+.+.+. T Consensus 184 ----v~~~----~~~~~L~~lkd~~G~~l~~~~~~~l~--------G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 247 (304) T protein:vir:10 184 ----LTTR----SFRSKMRNALDANDRPLFDANGNEIM--------GLPLSYTGADVYDKKKSLALMGDWDYARYGILQG 247 (304) T ss_pred ----EEcH----HHHHHHHHhhccCCcEeecCCCcccc--------ceeeEEecccccCCCCcEEEEEehhhEEEEEecc Confidence 1122 2345667788999998722 111110 0111111111111 1 1111111 Q ss_pred eeEEe--ccccC Q lcl|NC_020198. 295 VQVLD--TAWLN 304 (304) Q Consensus 295 ~~~iv--~p~Ld 304 (304) +++-+ ++-+. T Consensus 248 ~~i~~~~e~~~~ 259 (304) T protein:vir:10 248 IEYAISEDATLT 259 (304) T ss_pred eEEEEeecceee Confidence 11111 11111 No 56 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=39.32 E-value=1 Score=20.53 Aligned_cols=231 Identities=13% Similarity=0.105 Sum_probs=94.8 Q ss_pred CCCcc--HHHHH------H-HHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhccccee---eccc Q lcl|NC_020198. 1 MAIIT--PALIS------A-LKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRV---IKDM 68 (304) Q Consensus 1 maii~--~~~l~------~-l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~---~~~l 68 (304) ||.-. +.+.. . +-..+...+.+.....-+ ..++|+.+|-....-+|..+..-+.. .|++|-. ..+. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~-l~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~E~~~~~~~~~ 78 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSA-IMKLAKNEPMTAQKKKFTYLAKGVGA-YWVSETERIQTSKP 78 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccc-hhhhcceeeccCCceEEEEEeCCcce-EEeecCcccccccc Confidence 65422 11110 0 111222333333333322 45567777755444455555544443 6876543 2233 Q ss_pred ccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccc Q lcl|NC_020198. 69 AAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVD 148 (304) Q Consensus 69 ~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~ 148 (304) .=..-+++.++++..+.|+|+.++|-...+..-+.+.++++.++..++.++. |.... ++..- .+ . T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~----G~g~~----~~~~~--~~-----~ 143 (304) T protein:vir:94 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF----GTKSP----YNTST--SG-----K 143 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee----ccCCC----ccccc--cc-----c Confidence 3345667789999999999999999999999999999999999888776531 21110 11000 00 0 Q ss_pred cccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhh Q lcl|NC_020198. 149 GTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAA 228 (304) Q Consensus 149 ~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~ 228 (304) +.-......... ...+...+- .|.-++.+.. +. .+.+++ | T Consensus 144 ~~~~~~~~~~~~--~~~~~~~~~------~i~~~~~~l~----------~~---------------~~~~~~---~---- 183 (304) T protein:vir:94 144 PLVEGAEEKGNV--VTDTNNLYV------DLSALMATIE----------DE---------------ELDPNG---V---- 183 (304) T ss_pred cccccccccccc--cccccchHH------HHHHHHHHhh----------hc---------------cCCcCE---E---- Confidence 000000000000 000000000 0111111100 00 000110 1 Q ss_pred ccccccchhHHHHHHHHHHHhccCCCceece-ecCeEEecchHHHHHHHHHhhhccCCC---------C-cceecce--- Q lcl|NC_020198. 229 MSTEELNQVNFEKVYDAMRNQKADGGRPLDI-RPNLLVVPTTLRSKAKEVVGVQRLANG---------A-DNPNFEL--- 294 (304) Q Consensus 229 ~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i-~P~~LvVpp~le~~A~~ll~~~~~~~g---------~-~N~~~g~--- 294 (304) -++. ..+.++++.|+..|+||-. .|..|. -..+.-++..+.. + .+.+.+. T Consensus 184 ----v~~~----~~~~~L~~lkd~~G~~l~~~~~~~l~--------G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 247 (304) T protein:vir:94 184 ----LTTR----SFRSKMRNALDANDRPLFDANGNEIM--------GLPLSYTGADVYDKKKSLALMGDWDYARYGILQG 247 (304) T ss_pred ----EEcH----HHHHHHHHhhccCCcEeecCCCcccc--------ceeeEEecccccCCCCcEEEEEehhhEEEEEecc Confidence 1122 2345667788999998722 111110 0111111111111 1 1111111 Q ss_pred eeEEe--ccccC Q lcl|NC_020198. 295 VQVLD--TAWLN 304 (304) Q Consensus 295 ~~~iv--~p~Ld 304 (304) +++-+ ++-+. T Consensus 248 ~~i~~~~e~~~~ 259 (304) T protein:vir:94 248 IEYAISEDATLT 259 (304) T ss_pred eEEEEeecceee Confidence 11111 11111 No 57 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=38.48 E-value=1.1 Score=20.43 Aligned_cols=191 Identities=11% Similarity=0.117 Sum_probs=88.9 Q ss_pred CCCcc--HHHHHHHHHHHHHHHHHHHhhc---chhhcceEEEecCCccccccccccCCccchhcc---cceeeccccccc Q lcl|NC_020198. 1 MAIIT--PALISALKTSFQKHFQDALATA---PSTYLQVATVIPSTTASNTYGWLGQFPKLREWI---GQRVIKDMAAQG 72 (304) Q Consensus 1 maii~--~~~l~~l~~~~~~~f~~a~~~a---~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~---Ge~~~~~l~~~~ 72 (304) ||+-+ +++... .+-..|++.+--+ ...|+.. .....+-+...+|. +...+.. +......+.+.. T Consensus 1 MA~~~~~pei~~~---~v~~~~~~~lv~~~l~~~~~~~~----~~~GdTv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~ 72 (273) T protein:vir:79 1 MAFNNFIPELWSD---MLLEEWTAQTVFANLVNREYEGI----ASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTG 72 (273) T ss_pred CcchhhhHHHHHH---HHHHHHHhhccchhhhhcccccc----ccCCcEEEEeecCc-ccccccccCCCccCccccccce Confidence 99833 333333 2334444443211 1122211 11111222222221 2223322 223455677777 Q ss_pred ceeeeecc-cceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccccccccc Q lcl|NC_020198. 73 YQITNKLF-ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTG 151 (304) Q Consensus 73 ~~i~nk~f-e~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg 151 (304) -+++..++ ...+.|+..+-.-+... +..+.++++++-++--|..+++++..+.+ ++ ..+ T Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~vD~~i~~~~~~a~~--------------~~-----~~~ 132 (273) T protein:vir:79 73 VDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGT--------------AL-----TGS 132 (273) T ss_pred EEEEEeeecccceeeccHHHHhhccc-HHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------cc-----ccc Confidence 77777553 66677775444443333 46788889999888889988888854210 00 000 Q ss_pred chhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccc Q lcl|NC_020198. 152 TATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMST 231 (304) Q Consensus 152 ~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~ 231 (304) ..++ T Consensus 133 ~~~~---------------------------------------------------------------------------- 136 (273) T protein:vir:79 133 APSD---------------------------------------------------------------------------- 136 (273) T ss_pred cccc---------------------------------------------------------------------------- Confidence 0000 Q ss_pred cccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHH----HHhhhccCCCCccee--------cceeeEEe Q lcl|NC_020198. 232 EELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKE----VVGVQRLANGADNPN--------FELVQVLD 299 (304) Q Consensus 232 ~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~----ll~~~~~~~g~~N~~--------~g~~~~iv 299 (304) .....+.+.+|+++|.... -|- ..++|||+|.....-++ +.++.. .|+.+.+ .| ++++. T Consensus 137 ~~~~~~~i~~a~~~ld~~~----vP~--~~R~lvv~p~~~~~Ll~~~~~~~~~~~--~~~~~~l~~G~ig~~~G-~~i~~ 207 (273) T protein:vir:79 137 ADDAFDLIASALKELTKAN----VPN--VGRVVVVNAEMAFWLRSSGSKLTSADT--SGDAAGLRAGTIGNLLG-ARIVE 207 (273) T ss_pred hhhHHHHHHHHHHHhhhcc----CCc--cCcEEEECHHHHHHHhhchhhhhhhhh--cccccceeeeEeeEEec-eEEEe Confidence 0000122334444443332 222 12478898877664322 222211 2333333 34 68888 Q ss_pred ccccC Q lcl|NC_020198. 300 TAWLN 304 (304) Q Consensus 300 ~p~Ld 304 (304) ++.|. T Consensus 208 s~~lp 212 (273) T protein:vir:79 208 SNNLR 212 (273) T ss_pred ccccc Confidence 88886 No 58 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=33.45 E-value=1.4 Score=19.86 Aligned_cols=211 Identities=11% Similarity=0.063 Sum_probs=100.5 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHHhhcchhhcce---EEEecCCccccccccccCCccchhcccceee----cccccccc Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQV---ATVIPSTTASNTYGWLGQFPKLREWIGQRVI----KDMAAQGY 73 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~---a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~----~~l~~~~~ 73 (304) .+.+..+.++.|-..+.....+-+. .+++ .+.++-...+-+|... +.-..-+|+|+..- -+..-..+ T Consensus 6 ~g~f~~~~l~~id~~v~e~~~~~l~-----~r~l~~v~~~~~~~~~~~~~~~~-~~~G~~~~~~~~~~dip~~~~~~~~~ 79 (301) T protein:vir:80 6 TATIEARDLQAIDNVIYEPKQEELT-----ARSVFPQKFDVNEGAESYSFDVM-TRSGAAKIIANGADDLPLVDVDMVRK 79 (301) T ss_pred cchhhHHHHHHHHHHHHHhhhhhhh-----hhhhcccccCCCCceEEEEEeee-ccceeEEEecCcccccccccccceeE Confidence 4445556666665444433333222 2222 1222222222233322 22234466655321 12223567 Q ss_pred eeeeecccceeecchhhhhcCC---cchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccc Q lcl|NC_020198. 74 QITNKLFESTVGVKRTDIEDDN---LGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGT 150 (304) Q Consensus 74 ~i~nk~fe~tv~v~R~~I~dDd---lG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~t 150 (304) ..+...|+..+.+++++++.=. +.+=.+....+.++.++++|+++| .|. +.|.+++-+... T Consensus 80 ~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f----~G~------------~~~g~~GLlN~p 143 (301) T protein:vir:80 80 SVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAF----RGE------------KKYAIKGAFEAT 143 (301) T ss_pred EEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEe----eec------------ccccceeeecCC Confidence 8888999999999999998763 444466677777888888888777 221 122111111000 Q ss_pred cchhhhhhhhcc--cCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhh Q lcl|NC_020198. 151 GTATTVSNLFAP--AADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAA 228 (304) Q Consensus 151 g~~~s~snl~~~--~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~ 228 (304) + +....++ .+.....|--- T Consensus 144 ~----~~~~~~~~~~~~~~~~w~~~------------------------------------------------------- 164 (301) T protein:vir:80 144 G----IQIDVSPTTGVGNVSKWEKK------------------------------------------------------- 164 (301) T ss_pred C----cccccccCcccccccccccC------------------------------------------------------- Confidence 0 0000000 00011122100 Q ss_pred ccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCCCC--------cceecceeeEEec Q lcl|NC_020198. 229 MSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGA--------DNPNFELVQVLDT 300 (304) Q Consensus 229 ~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~g~--------~N~~~g~~~~iv~ 300 (304) +..=-.+-+.+++.++..+.+ | ...|..|++||+....-.+...++. .|. .||+ ++++-. T Consensus 165 --t~~ei~~di~~~~~~l~~~s~--g---~~~p~~L~L~p~~~~~L~~~~~~~~--~~~tvl~~l~~~~~~---~~I~~~ 232 (301) T protein:vir:80 165 --TAEQIIDEIGEAHTKITVLPG--Y---GTASLKLCLPPKQFELINKKRYSNE--DSRSVLKVLQDNAWF---SAIVRV 232 (301) T ss_pred --CHHHHHHHHHHHHHHHHHhcC--c---eecccEEEecHHHHHhhhhccccCC--CCeeHHHHHHHHcCc---ceEEEc Confidence 000012335556666655533 2 2468999999997765433222111 111 2443 667777 Q ss_pred cccC Q lcl|NC_020198. 301 AWLN 304 (304) Q Consensus 301 p~Ld 304 (304) |+|+ T Consensus 233 p~L~ 236 (301) T protein:vir:80 233 PDLA 236 (301) T ss_pred ceec Confidence 8887 No 59 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=32.68 E-value=1.4 Score=19.77 Aligned_cols=233 Identities=12% Similarity=0.084 Sum_probs=95.6 Q ss_pred CCCccHHHHHHHHH------------HHHHHHHHHHhhcchhhcceEEEecCCccccccccc--cCCccchhccccee-e Q lcl|NC_020198. 1 MAIITPALISALKT------------SFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWL--GQFPKLREWIGQRV-I 65 (304) Q Consensus 1 maii~~~~l~~l~~------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~L--g~~P~lrEw~Ge~~-~ 65 (304) +...+....+++.. -+...+.+..... +....+|+.+|.+.....+.+. ......-.|++|-. + T Consensus 105 ~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~ 183 (408) T protein:vir:74 105 MAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQY-DSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKI 183 (408) T ss_pred hhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhh-cchhhhcceeeccCCcceEEEEeecCCccccccccccccc Confidence 11111111111100 0111111111111 1133445666644444444432 33444456876532 2 Q ss_pred c---ccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCC---CcccCcc-cccc Q lcl|NC_020198. 66 K---DMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNA---NLCYDGQ-NFFD 138 (304) Q Consensus 66 ~---~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~---~~cyDGq-~fF~ 138 (304) . +..=..-++..++++..+.||++.+.|-..++.+-+.+.++++.+...++.++.-..++-. ..-||+- ..+. T Consensus 184 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~~~~~~~~~i~~~~~ 263 (408) T protein:vir:74 184 PDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKPTIANFDDVITMIN 263 (408) T ss_pred ccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccHHHHHHHHH Confidence 2 2333556778889999999999999988899999999999999999988865542211100 0011110 0010 Q ss_pred -cccccccc-----------------cccccchhhhhhhhccc--CCCCcc------------------ceeccCCccch Q lcl|NC_020198. 139 -TDHPVYPN-----------------VDGTGTATTVSNLFAPA--ADPGAA------------------WYLLDTSRSLK 180 (304) Q Consensus 139 -tdH~v~~~-----------------~~~tg~~~s~snl~~~~--~~~g~~------------------w~L~d~~~~~k 180 (304) .=|+.+.+ .+++|..+-..++..+. ..-|-| .++-|-++.+. T Consensus 264 ~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~ 343 (408) T protein:vir:74 264 TSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAIT 343 (408) T ss_pred HhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEE Confidence 00110000 01111111111110000 001111 11112222111 Q ss_pred hhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhh-----------hccccccchhHH Q lcl|NC_020198. 181 PLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLA-----------AMSTEELNQVNF 239 (304) Q Consensus 181 P~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a-----------~~s~~~l~~~~l 239 (304) +=.|+...+. .++....-|.++...|.+..|+..+.-....- -+.+...++.++ T Consensus 344 ---~~~~~~~~i~--~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 344 ---LFDRENMSLL--PTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQVGNFKTTTSTAV 408 (408) T ss_pred ---EEEecceEEE--EeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecccCCCCCCCCCccccC Confidence 0112222221 22233344777877777777776543222210 011223333333 No 60 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=31.65 E-value=1.5 Score=19.65 Aligned_cols=224 Identities=9% Similarity=0.003 Sum_probs=86.6 Q ss_pred CCCccHHHHH-HHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccceee-ccc---cccccee Q lcl|NC_020198. 1 MAIITPALIS-ALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVI-KDM---AAQGYQI 75 (304) Q Consensus 1 maii~~~~l~-~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~-~~l---~~~~~~i 75 (304) |...+...=- .+=.-+...+...+... +...++|+.+|-++....+.-.-.-+.. .|+||-.- ... .=..-++ T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~a-~wv~E~~~~~~~~~~~~~~v~~ 184 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDE-VVMRQEATVITVGGSDYKKLVNLGGTAS-GWVGETDTRSQTATSRLGLIEP 184 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhh-hhhhhhceeeecCCCceEEEEecCCccc-eeeccccccCccccccceeeee Confidence 1111100000 00011222222222222 2344567777644433333333233332 57766532 211 1233466 Q ss_pred eeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHH---------HHhc---------------------- Q lcl|NC_020198. 76 TNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFA---------LLKA---------------------- 124 (304) Q Consensus 76 ~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~---------lL~~---------------------- 124 (304) .-++++..+.||++.+.|-...+..-+.+.|+++-++..+..++. +|+. T Consensus 185 ~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~ 264 (401) T protein:vir:44 185 FMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGE 264 (401) T ss_pred ehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccccccccccccccc Confidence 778899999999999998888999999999999888877665541 0000 Q ss_pred ------------------c--CC---------------CcccCcccccccccccccccccccchhhhhhhhcccCCCCcc Q lcl|NC_020198. 125 ------------------G--NA---------------NLCYDGQNFFDTDHPVYPNVDGTGTATTVSNLFAPAADPGAA 169 (304) Q Consensus 125 ------------------g--~~---------------~~cyDGq~fF~tdH~v~~~~~~tg~~~s~snl~~~~~~~g~~ 169 (304) + .+ -+--+|+|+|..+=..+..--=-|..+.+++-....+.+..+ T Consensus 265 ~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~ 344 (401) T protein:vir:44 265 ATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKA 344 (401) T ss_pred ccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEecCcCCccCCccE Confidence 0 00 001122333322211000000011111111111000111111 Q ss_pred ceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccchhHHHHH Q lcl|NC_020198. 170 WYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQVNFEKV 242 (304) Q Consensus 170 w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~aa 242 (304) -++.|.++.+. +..|...++. .++.|.++...|-+..|.... +++-. ++-.-.+.+| T Consensus 345 i~~Gd~~~~~~---i~~~~~~~~~------~~~~~~~~~v~~~a~~r~d~~--~~~~~-----a~~~l~~~aa 401 (401) T protein:vir:44 345 IAFGNFKRGYT---IVDRIGTRIL------RDPYTNKPFVGFYTTKRTGGM--LVDSQ-----AIKLLKIAAA 401 (401) T ss_pred EEEeehhccEE---EEEecceEEe------eeccccCCcEEEEEEEEeccE--Eeccc-----ceEEEEeecC Confidence 22234443332 1233333321 112244555555444444322 11111 0000001111 No 61 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=31.21 E-value=1.5 Score=19.59 Aligned_cols=231 Identities=15% Similarity=0.121 Sum_probs=90.4 Q ss_pred CCCccHHHHHHH-HHHHHHHHHHHHhhcchhhcceEEEecCCcccccccc--ccCCccchhccccee-ec---ccccccc Q lcl|NC_020198. 1 MAIITPALISAL-KTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGW--LGQFPKLREWIGQRV-IK---DMAAQGY 73 (304) Q Consensus 1 maii~~~~l~~l-~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~--Lg~~P~lrEw~Ge~~-~~---~l~~~~~ 73 (304) |.-.+...=..+ =+-+...+.+......+ ..++|..+|-+.....|.. -...|. -.|++|-. +. ...=..- T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~-a~~v~Eg~~~~~~~~~~~~~v 200 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEP-LEQYVTVEPVTTRSGTRLLEKNADMVP-FSPVEELGNLPEIDQPRFTKV 200 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhh-HHhhcceeeccCCceeEEEEEecCCcc-eeeecccccccccccccceeE Confidence 111111000000 00111112122221112 2234555543332223322 233333 25775542 11 1223455 Q ss_pred eeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCC--CcccCccc-cc-ccccccccc--- Q lcl|NC_020198. 74 QITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNA--NLCYDGQN-FF-DTDHPVYPN--- 146 (304) Q Consensus 74 ~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~--~~cyDGq~-fF-~tdH~v~~~--- 146 (304) ++.-++++..+.||++.+.|-...+..-+...++++-++..+..++.-..+|.. ..-||+-- .+ +.=++.+.. T Consensus 201 ~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~~~g~~~~~~i~~~~~~~l~~~~~~~a~ 280 (397) T protein:vir:12 201 SYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLKKVDIDGLDGIKKALNVTLDPMVAPGSI 280 (397) T ss_pred EeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccHHHHHHHHhhccchhhhCCCE Confidence 788899999999999999988889999999999999999998877654333211 11122110 01 011111000 Q ss_pred --------------cccccchhhhhhhhcccC--CCCccceeccC-----Cccchhhhhh---------hccccchhhcc Q lcl|NC_020198. 147 --------------VDGTGTATTVSNLFAPAA--DPGAAWYLLDT-----SRSLKPLIYQ---------ERMKPSFTSMT 196 (304) Q Consensus 147 --------------~~~tg~~~s~snl~~~~~--~~g~~w~L~d~-----~~~~kP~i~Q---------~r~~~~~~~~~ 196 (304) .++.|..+-..++..+.. .-|-|-++.+. ...-.++++- .|....+.. T Consensus 281 ~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~-- 358 (397) T protein:vir:12 281 VLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIAS-- 358 (397) T ss_pred EEEcHHHHHHHHHhhccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEE-- Confidence 011111111001000000 00111111100 0000011111 111111111 Q ss_pred cCcccccccccceEEEeeccccccccchhhhhccccccchh Q lcl|NC_020198. 197 KEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQV 237 (304) Q Consensus 197 ~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~ 237 (304) ......-|.++...|-+..|+.. ..++-..-....++++ T Consensus 359 ~~~~~~~f~~~~~~~r~~~r~d~--~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 359 TDTGAGAFETNSTKVRGIEREDV--RKWDEDAVVFGQITVE 397 (397) T ss_pred eccccchhhcCceEEEEEEeecc--EEecccceEEEEEeeC Confidence 11222335556555555555543 3344443344555655 No 62 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=30.94 E-value=1.5 Score=19.56 Aligned_cols=233 Identities=12% Similarity=0.097 Sum_probs=92.2 Q ss_pred CCCccHHHHHHHHH------------HHHHHHHHHHhhcchhhcceEEEecCCccccccccc--cCCccchhccccee-e Q lcl|NC_020198. 1 MAIITPALISALKT------------SFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWL--GQFPKLREWIGQRV-I 65 (304) Q Consensus 1 maii~~~~l~~l~~------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~L--g~~P~lrEw~Ge~~-~ 65 (304) +...+....+++.. .+...+.+..... ....++|+.+|-++....+..+ ...-..-.|+||-. + T Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~ 183 (408) T protein:vir:10 105 MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQY-DSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKI 183 (408) T ss_pred hhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhh-chhhhhcceeeccCCcceEEEeeccccccceeeecCcccc Confidence 11111111111111 1111111111111 1234456666644444444332 22223347886542 2 Q ss_pred cc---cccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCC---CcccCcc-cccc Q lcl|NC_020198. 66 KD---MAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNA---NLCYDGQ-NFFD 138 (304) Q Consensus 66 ~~---l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~---~~cyDGq-~fF~ 138 (304) .. ..=..-++..++++..+.||++.+.|-..++-+-+...++++.+...++-++.-...|-. ..-||.- ..+. T Consensus 184 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~~~~~~~~~l~~~~~ 263 (408) T protein:vir:10 184 PDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITMIN 263 (408) T ss_pred ccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccHHHHHHHHH Confidence 11 222455778889999999999999998889999999999999999988876655433211 1111110 0000 Q ss_pred c-cccccc-c----------------cccccchhhhhhhhc-------------------cc-CCCCccceeccCCccch Q lcl|NC_020198. 139 T-DHPVYP-N----------------VDGTGTATTVSNLFA-------------------PA-ADPGAAWYLLDTSRSLK 180 (304) Q Consensus 139 t-dH~v~~-~----------------~~~tg~~~s~snl~~-------------------~~-~~~g~~w~L~d~~~~~k 180 (304) . =++.+. + .+++|..+-..++.. +. +....+.++.|-++.+. T Consensus 264 ~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~ 343 (408) T protein:vir:10 264 TAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAIT 343 (408) T ss_pred HhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEE Confidence 0 000000 0 011111111000000 00 00001122222222111 Q ss_pred hhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhh-----------ccccccchhHH Q lcl|NC_020198. 181 PLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAA-----------MSTEELNQVNF 239 (304) Q Consensus 181 P~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~-----------~s~~~l~~~~l 239 (304) .-.|....+. .++....-|.++...|-+..|+..+.-....-. +...+.+..++ T Consensus 344 ---~~~~~~~~v~--~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:10 344 ---LFDRENMSLL--PTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) T ss_pred ---EEEecceEEE--EcccccchhhcCceEEEEEEeeccEEeccccEEEEEeeccccCCCCCCCCCcccC Confidence 0012222221 112222345666666665555554322211100 11112222222 No 63 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=30.61 E-value=1.6 Score=19.52 Aligned_cols=230 Identities=12% Similarity=0.113 Sum_probs=96.5 Q ss_pred CCCccHHHHHHH-HHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccce---eecccccccceee Q lcl|NC_020198. 1 MAIITPALISAL-KTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---VIKDMAAQGYQIT 76 (304) Q Consensus 1 maii~~~~l~~l-~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~---~~~~l~~~~~~i~ 76 (304) |...+.+.-..+ =..+...+.+.+.+.- .-.++|+.+|.......+..... |. -.|++|- ...+.+=..-++. T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s-~l~~~~~~~~~~~~~~~~~~~~~-~~-a~~v~E~~~~~~~~~~f~~v~l~ 82 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGS-AAMKLAKAVPMTKPEEEFTFMSG-VG-AFWVDEAERIQTSKPTFTKAKMR 82 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcc-hhhhhceeeecCCCcEEEEEEcC-Cc-eeeeecCccccccccceeEEEEe Confidence 332222111111 0112222222222222 24556777775544444443332 33 3576543 3333444566788 Q ss_pred eecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchhhh Q lcl|NC_020198. 77 NKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTV 156 (304) Q Consensus 77 nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~s~ 156 (304) -++++..+.|+++.+.|-...+..-+...++++.++..++.++ +|... +++. + -+.... .. T Consensus 83 ~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l----~G~g~----~~~~-------g-il~~~~---~~ 143 (299) T protein:vir:41 83 SKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVF----TGVES----PYNW-------N-ILKSAT---DA 143 (299) T ss_pred eEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHh----hcccC----cccc-------c-cccccc---cc Confidence 8999999999999999888899999999999999999987554 23211 1221 0 000000 00 Q ss_pred hhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccccccch Q lcl|NC_020198. 157 SNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNQ 236 (304) Q Consensus 157 snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~ 236 (304) .+..... ...| | .|.=++.+.. ..+ +.+++ |- ++. T Consensus 144 ~~~~~~~---~~~~---~---~l~~~~~~l~------------~~~-------------~~~~~---~v--------~n~ 178 (299) T protein:vir:41 144 SNLVEET---ANKY---D---DLNEAIGLIE------------AED-------------LEPNG---IA--------TIR 178 (299) T ss_pred ceeeccc---cccH---H---HHHHHHHhhh------------ccc-------------CCcCE---EE--------EcH Confidence 0100000 0001 0 0000111100 000 00000 10 122 Q ss_pred hHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCCCCcce--ecc-eeeEEeccccC Q lcl|NC_020198. 237 VNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGADNP--NFE-LVQVLDTAWLN 304 (304) Q Consensus 237 ~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~g~~N~--~~g-~~~~iv~p~Ld 304 (304) ..+.++++.|+.+|+||-. |..---.+.+- ...++-++..+.|+.++ +.| .-.+++-.|-+ T Consensus 179 ----~~~~~L~~lkd~~G~~l~~-~~~~~~~~~l~--G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~~ 242 (299) T protein:vir:41 179 ----KQRVKYRSTKDGNGMPIFN-TATSNGVDDVL--GLPIAYTPKYTFGDKDISELVGDWNQAYYGILRG 242 (299) T ss_pred ----HHHHHHHHhhccCCceeec-CCcCCCCceec--ceeeEEecccCCCCCceEEEEEecccEEEEEecC Confidence 2356778889999999843 21100000000 11122223333333221 111 11111211111 No 64 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=27.92 E-value=1.8 Score=19.19 Aligned_cols=269 Identities=12% Similarity=0.017 Sum_probs=95.6 Q ss_pred CC---------CccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCcccccccc-ccCCccchhccccee---ecc Q lcl|NC_020198. 1 MA---------IITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGW-LGQFPKLREWIGQRV---IKD 67 (304) Q Consensus 1 ma---------ii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~-Lg~~P~lrEw~Ge~~---~~~ 67 (304) |. .|-+.....| -+-+. ..+....++.++|-+...-+|.. .+.-|. -.|+||-. ..+ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~i--------i~~~~-~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~-a~wv~E~~~~~~s~ 220 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGI--------VEQLF-YELSLADLISSRPVTSPNLSYLTESAAHNN-AAAVAEAGTYPFSS 220 (497) T ss_pred hhcccCcccccccchhhhHHH--------HHHHH-hhhhHHhhccccccCCCceEEEEEcCCCCc-ceeeccCccccccc Confidence 11 1112222121 11111 12344556666664433333432 222222 24775532 333 Q ss_pred cccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccc-cccccccc Q lcl|NC_020198. 68 MAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFD-TDHPVYPN 146 (304) Q Consensus 68 l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~-tdH~v~~~ 146 (304) +.=..-++.-++++..+.||++.+. |--.+-.-+...++++-++.+|..+ |. |..+..-.| ++. +... .. T Consensus 221 ~~f~~i~~~~~k~a~~~~iS~ell~-d~~~l~~~i~~~l~~~i~~~~d~~~---l~-G~G~~~p~G--il~~~~~~--~~ 291 (497) T protein:vir:78 221 EEFARVYEQVGKVANALTITDEGLR-DAPELFNFVQGRLLEGIQRKEEVQL---LA-GGGYPGVNG--LLQRSTGF--TA 291 (497) T ss_pred ccceeeEeeeeeeEeecHhHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHh---hc-CCCcccccc--cccccccc--cc Confidence 3334566777888999999999886 5556777788888888888887543 22 211111111 110 0000 00 Q ss_pred cccccchhhhh----hhhcccCCCCccceec-cCCccchhhhhh--hccccchhhcccCccccc----ccccceEEEeec Q lcl|NC_020198. 147 VDGTGTATTVS----NLFAPAADPGAAWYLL-DTSRSLKPLIYQ--ERMKPSFTSMTKEDDEQV----FMADEYRYGVRS 215 (304) Q Consensus 147 ~~~tg~~~s~s----nl~~~~~~~g~~w~L~-d~~~~~kP~i~Q--~r~~~~~~~~~~~~~~~v----f~~~~~~~Gvd~ 215 (304) ..+.+.....+ ++.....+. ..|..- +.-..++-.... ......+.....+.+... |..-..++-... T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 370 (497) T protein:vir:78 292 SSASSLFGATSATVSNVKFPADGT-NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF 370 (497) T ss_pred cccccchhhhhhhhhhhhhhcccc-cchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc Confidence 00000000000 000000001 111111 000000000000 000000111111111111 100000000000 Q ss_pred cccccccchhhhhccccccchhHHHHHHHHHHHhccCCCceeceecCeE-EecchHHH---HHHHHHhhhccCCC----- Q lcl|NC_020198. 216 RCNVGFGFWQLAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLL-VVPTTLRS---KAKEVVGVQRLANG----- 286 (304) Q Consensus 216 R~n~G~g~wq~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~L-vVpp~le~---~A~~ll~~~~~~~g----- 286 (304) +.. +.-.++. ....+|++.|+-+|++|..-|.-. ...|.... -.+-++.++.++.| T Consensus 371 ~~~-----------~~~vmn~----~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~G 435 (497) T protein:vir:78 371 QTP-----------NAVVMNP----RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVG 435 (497) T ss_pred cCC-----------CeEEEch----HHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEe Confidence 000 0011222 345778999999999876432111 00000000 00111222223322 Q ss_pred Ccce-----e-cceeeEEeccccC Q lcl|NC_020198. 287 ADNP-----N-FELVQVLDTAWLN 304 (304) Q Consensus 287 ~~N~-----~-~g~~~~iv~p~Ld 304 (304) +-+. + +.-++|.++++.. T Consensus 436 d~~~~~~~i~~r~~~~v~~~~~~~ 459 (497) T protein:vir:78 436 HFAPSVIQTARREGVTMQMTNSNG 459 (497) T ss_pred ecccceEEEEEecccEEEeecccc Confidence 2221 1 3456666666543 No 65 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=27.92 E-value=1.8 Score=19.19 Aligned_cols=269 Identities=12% Similarity=0.017 Sum_probs=95.6 Q ss_pred CC---------CccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCcccccccc-ccCCccchhccccee---ecc Q lcl|NC_020198. 1 MA---------IITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGW-LGQFPKLREWIGQRV---IKD 67 (304) Q Consensus 1 ma---------ii~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~-Lg~~P~lrEw~Ge~~---~~~ 67 (304) |. .|-+.....| -+-+. ..+....++.++|-+...-+|.. .+.-|. -.|+||-. ..+ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~i--------i~~~~-~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~-a~wv~E~~~~~~s~ 220 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGI--------VEQLF-YELSLADLISSRPVTSPNLSYLTESAAHNN-AAAVAEAGTYPFSS 220 (497) T ss_pred hhcccCcccccccchhhhHHH--------HHHHH-hhhhHHhhccccccCCCceEEEEEcCCCCc-ceeeccCccccccc Confidence 11 1112222121 11111 12344556666664433333432 222222 24775532 333 Q ss_pred cccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccc-cccccccc Q lcl|NC_020198. 68 MAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFD-TDHPVYPN 146 (304) Q Consensus 68 l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~-tdH~v~~~ 146 (304) +.=..-++.-++++..+.||++.+. |--.+-.-+...++++-++.+|..+ |. |..+..-.| ++. +... .. T Consensus 221 ~~f~~i~~~~~k~a~~~~iS~ell~-d~~~l~~~i~~~l~~~i~~~~d~~~---l~-G~G~~~p~G--il~~~~~~--~~ 291 (497) T protein:vir:10 221 EEFARVYEQVGKVANALTITDEGLR-DAPELFNFVQGRLLEGIQRKEEVQL---LA-GGGYPGVNG--LLQRSTGF--TA 291 (497) T ss_pred ccceeeEeeeeeeEeecHhHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHh---hc-CCCcccccc--cccccccc--cc Confidence 3334566777888999999999886 5556777788888888888887543 22 211111111 110 0000 00 Q ss_pred cccccchhhhh----hhhcccCCCCccceec-cCCccchhhhhh--hccccchhhcccCccccc----ccccceEEEeec Q lcl|NC_020198. 147 VDGTGTATTVS----NLFAPAADPGAAWYLL-DTSRSLKPLIYQ--ERMKPSFTSMTKEDDEQV----FMADEYRYGVRS 215 (304) Q Consensus 147 ~~~tg~~~s~s----nl~~~~~~~g~~w~L~-d~~~~~kP~i~Q--~r~~~~~~~~~~~~~~~v----f~~~~~~~Gvd~ 215 (304) ..+.+.....+ ++.....+. ..|..- +.-..++-.... ......+.....+.+... |..-..++-... T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 370 (497) T protein:vir:10 292 SSASSLFGATSATVSNVKFPADGT-NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF 370 (497) T ss_pred cccccchhhhhhhhhhhhhhcccc-cchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc Confidence 00000000000 000000001 111111 000000000000 000000111111111111 100000000000 Q ss_pred cccccccchhhhhccccccchhHHHHHHHHHHHhccCCCceeceecCeE-EecchHHH---HHHHHHhhhccCCC----- Q lcl|NC_020198. 216 RCNVGFGFWQLAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLL-VVPTTLRS---KAKEVVGVQRLANG----- 286 (304) Q Consensus 216 R~n~G~g~wq~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~L-vVpp~le~---~A~~ll~~~~~~~g----- 286 (304) +.. +.-.++. ....+|++.|+-+|++|..-|.-. ...|.... -.+-++.++.++.| T Consensus 371 ~~~-----------~~~vmn~----~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~G 435 (497) T protein:vir:10 371 QTP-----------NAVVMNP----RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVG 435 (497) T ss_pred cCC-----------CeEEEch----HHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEe Confidence 000 0011222 345778999999999876432111 00000000 00111222223322 Q ss_pred Ccce-----e-cceeeEEeccccC Q lcl|NC_020198. 287 ADNP-----N-FELVQVLDTAWLN 304 (304) Q Consensus 287 ~~N~-----~-~g~~~~iv~p~Ld 304 (304) +-+. + +.-++|.++++.. T Consensus 436 d~~~~~~~i~~r~~~~v~~~~~~~ 459 (497) T protein:vir:10 436 HFAPSVIQTARREGVTMQMTNSNG 459 (497) T ss_pred ecccceEEEEEecccEEEeecccc Confidence 2221 1 3456666666543 No 66 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=27.43 E-value=1.8 Score=19.13 Aligned_cols=241 Identities=10% Similarity=0.024 Sum_probs=98.6 Q ss_pred CCCccHHHHHHHHHHHHHHHHHHH------------------------hhcchhhcceEEEecCCccccccccccCCccc Q lcl|NC_020198. 1 MAIITPALISALKTSFQKHFQDAL------------------------ATAPSTYLQVATVIPSTTASNTYGWLGQFPKL 56 (304) Q Consensus 1 maii~~~~l~~l~~~~~~~f~~a~------------------------~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~l 56 (304) +..........|....++.+++.. ... +.-.+.|+.+|.......+.+...-|.. T Consensus 61 ~~~~~~~~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~-s~i~~~~~~~~~~~~~~~i~~~~~~~~a 139 (390) T protein:vir:40 61 NNVLASRGANALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVE-HPLLSKINFVNTTATTEWIISVGDVATA 139 (390) T ss_pred HHHHHhcCchhccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhh-hhhhhhceeeecCCceeEEEEEcCCcce Confidence 000001111111111222222211 111 1223456777765555556665555543 Q ss_pred hhcccc---ee-ecccccccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccC Q lcl|NC_020198. 57 REWIGQ---RV-IKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYD 132 (304) Q Consensus 57 rEw~Ge---~~-~~~l~~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyD 132 (304) .|++| ++ ..+..=..-++.-+++...|.||++.++|-..++..-+.+.++++.+...++.++. |.. . T Consensus 140 -~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~----G~G----~ 210 (390) T protein:vir:40 140 -WWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVN----GSG----K 210 (390) T ss_pred -eeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhc----ccC----C Confidence 66654 22 22333355677888999999999999999999999999999999999999875442 321 1 Q ss_pred cccccccccccccccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEE Q lcl|NC_020198. 133 GQNFFDTDHPVYPNVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYG 212 (304) Q Consensus 133 Gq~fF~tdH~v~~~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~G 212 (304) |+| .|--...++..... ....+....--.|....+.-++-. |+ T Consensus 211 ~~P-------~Gil~~~~~~~~~~-----~~~~~~~~~t~~~~~~~~~~l~~~-------------------------~~ 253 (390) T protein:vir:40 211 DQP-------IGMMRDLNNVTAGE-----HPVKTATPLTDLTPATLATKVMLP-------------------------LT 253 (390) T ss_pred Ccc-------ceeeeccccccccc-----cccccccccchhhHHHHHHHHHHH-------------------------hh Confidence 233 11000000000000 000000000000000011100000 00 Q ss_pred eeccccccccchhhhhccccccchhHHHHHHHHHHHhccCCCceece---ecCeEEecchHHHHHHHHHhhhccCCCCcc Q lcl|NC_020198. 213 VRSRCNVGFGFWQLAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDI---RPNLLVVPTTLRSKAKEVVGVQRLANGADN 289 (304) Q Consensus 213 vd~R~n~G~g~wq~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i---~P~~LvVpp~le~~A~~ll~~~~~~~g~~N 289 (304) ..+....+-+.| -++...+..-..+++.+++.+|+++.- .+.-+|+.+.+- +-.++ -|+-+ T Consensus 254 ~~~~~~~~~a~~--------i~n~~t~~~~l~~~~~~~d~~G~~v~~~~~~g~pvv~~~~~p--~~~i~------~Gd~s 317 (390) T protein:vir:40 254 DNGKKSVSDAIL--------VINPADYWSKIYAATSYMTPQGVWVTGILPVPLEIVQSVAVP--VGKAV------AGRAK 317 (390) T ss_pred cchhhhhcCceE--------EEcchhHHHHHHHHhhccCCCCccccccCCCceeEEEcCCCC--CCcEE------EEeec Confidence 000000011111 122222333345677888888887531 121222222111 00000 12111 Q ss_pred ee----cceeeEEeccccC Q lcl|NC_020198. 290 PN----FELVQVLDTAWLN 304 (304) Q Consensus 290 ~~----~g~~~~iv~p~Ld 304 (304) -| ++-+++.+++..- T Consensus 318 ~~~i~~~~~~~v~~~~~~~ 336 (390) T protein:vir:40 318 DYFMGIGSEQVIRTSTEYR 336 (390) T ss_pred eEEEEeecceEEEecchhh Confidence 11 2334555544332 No 67 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=27.42 E-value=1.8 Score=19.13 Aligned_cols=192 Identities=10% Similarity=0.064 Sum_probs=88.3 Q ss_pred CCCcc--HHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcc---cceeeccccccccee Q lcl|NC_020198. 1 MAIIT--PALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWI---GQRVIKDMAAQGYQI 75 (304) Q Consensus 1 maii~--~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~---Ge~~~~~l~~~~~~i 75 (304) ||+-+ ++...+ .+-..|++.+--++-.++...-.. ....+-+...++.. ...+.. +......+.+...++ T Consensus 1 MA~~~~~pe~~~~---~v~~~~~~~lv~~~l~~~~~~~~~-~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAFNNFIPELWSD---MLLEEWTAQTVFANLVNREYEGTA-SKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred CcchhhhHHHHHH---HHHHHHHhhhccchhhcccccccc-ccCceEEEeecccc-cccccccCCCccCccccccceEEE Confidence 99833 343333 234445555433332222211000 11112222222221 222222 223445666666666 Q ss_pred eeecc-cceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchh Q lcl|NC_020198. 76 TNKLF-ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTAT 154 (304) Q Consensus 76 ~nk~f-e~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~ 154 (304) +..++ ...+.|+..+-...... +..+.++++++-++.-|..+++++..+.++ + .. T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~alA~~vD~~i~~~~~~a~~~--------------~-----~~---- 131 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTA--------------L-----TG---- 131 (273) T ss_pred EEeeeeecceEeecHHHhhhhcc-HHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------c-----cc---- Confidence 66443 55666665443333333 466788899999988898888877642110 0 00 Q ss_pred hhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcccccc Q lcl|NC_020198. 155 TVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEEL 234 (304) Q Consensus 155 s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l 234 (304) +.++ T Consensus 132 ----------------------------------------------------------------------------~~~~ 135 (273) T protein:vir:10 132 ----------------------------------------------------------------------------SAPT 135 (273) T ss_pred ----------------------------------------------------------------------------cccc Confidence 0011 Q ss_pred ch----hHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHH---HHhhhccCCCCcceec-ce------eeEEec Q lcl|NC_020198. 235 NQ----VNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKE---VVGVQRLANGADNPNF-EL------VQVLDT 300 (304) Q Consensus 235 ~~----~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~---ll~~~~~~~g~~N~~~-g~------~~~iv~ 300 (304) +. +.+-+|+++| ++..-|- ..++|||+|.....-++ .+.... ..|+.+.++ |. ++++.+ T Consensus 136 ~~~~~~~~i~~a~~~l----d~~~vP~--~~R~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~i~G~~v~~s 208 (273) T protein:vir:10 136 DADDAFDLIAKALKEL----TKANVPN--VGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVES 208 (273) T ss_pred chhHHHHHHHHHHHHh----hhcCCCc--CCCEEEECHHHHHHHhcchhhhhhhh-ccccccceeeeeeeEEeceEEEEe Confidence 11 1233333333 3333332 23578999977665432 221111 123333332 21 678888 Q ss_pred cccC Q lcl|NC_020198. 301 AWLN 304 (304) Q Consensus 301 p~Ld 304 (304) ..|- T Consensus 209 ~~lp 212 (273) T protein:vir:10 209 NNLR 212 (273) T ss_pred cccc Confidence 7774 No 68 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=27.42 E-value=1.8 Score=19.13 Aligned_cols=192 Identities=10% Similarity=0.064 Sum_probs=88.3 Q ss_pred CCCcc--HHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcc---cceeeccccccccee Q lcl|NC_020198. 1 MAIIT--PALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWI---GQRVIKDMAAQGYQI 75 (304) Q Consensus 1 maii~--~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~---Ge~~~~~l~~~~~~i 75 (304) ||+-+ ++...+ .+-..|++.+--++-.++...-.. ....+-+...++.. ...+.. +......+.+...++ T Consensus 1 MA~~~~~pe~~~~---~v~~~~~~~lv~~~l~~~~~~~~~-~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAFNNFIPELWSD---MLLEEWTAQTVFANLVNREYEGTA-SKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred CcchhhhHHHHHH---HHHHHHHhhhccchhhcccccccc-ccCceEEEeecccc-cccccccCCCccCccccccceEEE Confidence 99833 343333 234445555433332222211000 11112222222221 222222 223445666666666 Q ss_pred eeecc-cceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccchh Q lcl|NC_020198. 76 TNKLF-ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTAT 154 (304) Q Consensus 76 ~nk~f-e~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~~ 154 (304) +..++ ...+.|+..+-...... +..+.++++++-++.-|..+++++..+.++ + .. T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~alA~~vD~~i~~~~~~a~~~--------------~-----~~---- 131 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTA--------------L-----TG---- 131 (273) T ss_pred EEeeeeecceEeecHHHhhhhcc-HHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------c-----cc---- Confidence 66443 55666665443333333 466788899999988898888877642110 0 00 Q ss_pred hhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcccccc Q lcl|NC_020198. 155 TVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEEL 234 (304) Q Consensus 155 s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~~l 234 (304) +.++ T Consensus 132 ----------------------------------------------------------------------------~~~~ 135 (273) T protein:vir:10 132 ----------------------------------------------------------------------------SAPT 135 (273) T ss_pred ----------------------------------------------------------------------------cccc Confidence 0011 Q ss_pred ch----hHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHH---HHhhhccCCCCcceec-ce------eeEEec Q lcl|NC_020198. 235 NQ----VNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKE---VVGVQRLANGADNPNF-EL------VQVLDT 300 (304) Q Consensus 235 ~~----~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~---ll~~~~~~~g~~N~~~-g~------~~~iv~ 300 (304) +. +.+-+|+++| ++..-|- ..++|||+|.....-++ .+.... ..|+.+.++ |. ++++.+ T Consensus 136 ~~~~~~~~i~~a~~~l----d~~~vP~--~~R~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~i~G~~v~~s 208 (273) T protein:vir:10 136 DADDAFDLIAKALKEL----TKANVPN--VGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVES 208 (273) T ss_pred chhHHHHHHHHHHHHh----hhcCCCc--CCCEEEECHHHHHHHhcchhhhhhhh-ccccccceeeeeeeEEeceEEEEe Confidence 11 1233333333 3333332 23578999977665432 221111 123333332 21 678888 Q ss_pred cccC Q lcl|NC_020198. 301 AWLN 304 (304) Q Consensus 301 p~Ld 304 (304) ..|- T Consensus 209 ~~lp 212 (273) T protein:vir:10 209 NNLR 212 (273) T ss_pred cccc Confidence 7774 No 69 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=26.65 E-value=1.9 Score=19.03 Aligned_cols=216 Identities=13% Similarity=0.036 Sum_probs=101.4 Q ss_pred CCCccHHHH-----------HHHH-HHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhc-ccceeecc Q lcl|NC_020198. 1 MAIITPALI-----------SALK-TSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREW-IGQRVIKD 67 (304) Q Consensus 1 maii~~~~l-----------~~l~-~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw-~Ge~~~~~ 67 (304) |+-++...| .+|+ .-|..+...+|+.. +-+..+.++ .+-...+++ +||.+-+- ++.++-+. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~-s~~~~~~~~-r~i~~G~s~----~~~~iG~~~~~~~~~g~ 74 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYS-SKFASWMNV-RSLRGTNQL----RVDRVGASTIAGRKAGE 74 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHh-hhhhcccee-eeccccceE----EEeeecceeeeeecCCC Confidence 776633222 3555 56677777777665 333333322 122222222 34443221 11111111 Q ss_pred -cccccceeeeecccceeecch-----hhhhc---C--CcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCC-cccCccc Q lcl|NC_020198. 68 -MAAQGYQITNKLFESTVGVKR-----TDIED---D--NLGVYGPLMQEMGRAAGAHPDELVFALLKAGNAN-LCYDGQN 135 (304) Q Consensus 68 -l~~~~~~i~nk~fe~tv~v~R-----~~I~d---D--dlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~-~cyDGq~ 135 (304) |.. - .+++=+.+|.|+- ..|.| = .+.+=+++.+++|++=++..|+.++..|.+|... .-..-++ T Consensus 75 ~l~~--~--~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~ 150 (334) T protein:vir:80 75 ELVV--Q--KNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKP 150 (334) T ss_pred CCCC--C--CcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 111 1 1223344444432 22222 1 2457788999999999999999999887654321 1111222 Q ss_pred ccccccccccccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeec Q lcl|NC_020198. 136 FFDTDHPVYPNVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRS 215 (304) Q Consensus 136 fF~tdH~v~~~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~ 215 (304) -|+.-|. ......+.+ T Consensus 151 ~~~~G~~-------------~~~~~~g~~--------------------------------------------------- 166 (334) T protein:vir:80 151 AFHDGIL-------------LPSTISGLA--------------------------------------------------- 166 (334) T ss_pred cccCCcc-------------eeecccccc--------------------------------------------------- Confidence 2222110 000000000 Q ss_pred cccccccchhhhhccccccchhHHHHHHHHHHHhccCCCceec-eecCeEEecchHHHH---HHHHHhhhccCCCCccee Q lcl|NC_020198. 216 RCNVGFGFWQLAAMSTEELNQVNFEKVYDAMRNQKADGGRPLD-IRPNLLVVPTTLRSK---AKEVVGVQRLANGADNPN 291 (304) Q Consensus 216 R~n~G~g~wq~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~-i~P~~LvVpp~le~~---A~~ll~~~~~~~g~~N~~ 291 (304) ....-+...+..|...++.+-+....|-. ..++++||+|....+ +.++++++....++.|++ T Consensus 167 --------------~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~ 232 (334) T protein:vir:80 167 --------------ADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSF 232 (334) T ss_pred --------------cchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccc Confidence 01112223333333333333333333322 345789999976655 455666665544556766 Q ss_pred cce-------eeEEeccccC Q lcl|NC_020198. 292 FEL-------VQVLDTAWLN 304 (304) Q Consensus 292 ~g~-------~~~iv~p~Ld 304 (304) .+. ++|+.+++|= T Consensus 233 ~~g~i~~v~G~~V~~Sn~~P 252 (334) T protein:vir:80 233 VGGRIAMLNGVRVVETPRFP 252 (334) T ss_pred cceeEEEEeceEEEeecCCC Confidence 543 6788888776 No 70 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=22.98 E-value=2.4 Score=18.53 Aligned_cols=213 Identities=12% Similarity=0.151 Sum_probs=104.5 Q ss_pred CCCccHHHHH-----------------HHH-HHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccc Q lcl|NC_020198. 1 MAIITPALIS-----------------ALK-TSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQ 62 (304) Q Consensus 1 maii~~~~l~-----------------~l~-~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge 62 (304) |+..|+.-|- +|+ +-|..+...+|+.. +-...+.+. .+-...+++ +||.+ |+ T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~-si~~~~~~~-rti~~Gksv----~f~~i----G~ 70 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHE-TIARDLVTK-RTLKNGKSL----QFIYT----GR 70 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHH-Hhhhccccc-cccccCceE----EEEee----ee Confidence 7777766554 444 34666666677655 333333322 232333333 34442 33 Q ss_pred eeecccccccce----e-eeecccceeecchh--------hhhcC--CcchhHHHHHHHHHHHHhcHHHHHHHHHhccCC Q lcl|NC_020198. 63 RVIKDMAAQGYQ----I-TNKLFESTVGVKRT--------DIEDD--NLGVYGPLMQEMGRAAGAHPDELVFALLKAGNA 127 (304) Q Consensus 63 ~~~~~l~~~~~~----i-~nk~fe~tv~v~R~--------~I~dD--dlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~ 127 (304) -+++.......- . -.++=|.++.|+-. ||++= .+.+-+++.+++|++=++..|+.++..|.+|-. T Consensus 71 ~t~~~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~ 150 (375) T protein:vir:10 71 MTSSFHTPGTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGAR 150 (375) T ss_pred eEEeeecCCcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 333332211110 0 11122334444433 22111 256778899999999999999999998865321 Q ss_pred Ccc-cCcccccccccccccccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccc Q lcl|NC_020198. 128 NLC-YDGQNFFDTDHPVYPNVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMA 206 (304) Q Consensus 128 ~~c-yDGq~fF~tdH~v~~~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~ 206 (304) ... -.+++.|... |. .+..+. T Consensus 151 ~~~p~~~~~~~~~G----------g~-----~i~~~s------------------------------------------- 172 (375) T protein:vir:10 151 SASPVSATNFVEPG----------GT-----QIRVGS------------------------------------------- 172 (375) T ss_pred hccccccccccccC----------cc-----eeeecc------------------------------------------- Confidence 111 1222221100 00 000000 Q ss_pred cceEEEeeccccccccchhhhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHH------HHHHhh Q lcl|NC_020198. 207 DEYRYGVRSRCNVGFGFWQLAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKA------KEVVGV 280 (304) Q Consensus 207 ~~~~~Gvd~R~n~G~g~wq~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A------~~ll~~ 280 (304) ...+..++++.++.++...++.+-+...-|- ..+++||+|..-.+- .++++. T Consensus 173 --------------------g~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~--~~R~~vv~P~~y~~Ll~~~d~~~~~n~ 230 (375) T protein:vir:10 173 --------------------GTNESDAFTASALVNAFYDAAAAMDEKGVSS--QGRCAVLNPRQYYALIQDIGSNGLVNR 230 (375) T ss_pred --------------------ccccccccCHHHHHHHHHHHHHHHhhcCCCC--CCCEEEeChHHHHHHHhcCCccceeee Confidence 0012345677777777666666666666662 257899999875443 345554 Q ss_pred hccCCCCc-c----eecceeeEEeccccC Q lcl|NC_020198. 281 QRLANGAD-N----PNFELVQVLDTAWLN 304 (304) Q Consensus 281 ~~~~~g~~-N----~~~g~~~~iv~p~Ld 304 (304) +...+|.. + ... -+.++.++.|- T Consensus 231 d~~~~~~~~~g~v~~i~-Gv~V~~Sn~lP 258 (375) T protein:vir:10 231 DVQGSALQSGNGVIEIA-GIHIYKSMNIP 258 (375) T ss_pred cccccceeccceEEEEe-ceEEEEecccc Confidence 43222211 1 111 25667776666 No 71 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=22.34 E-value=2.5 Score=18.44 Aligned_cols=217 Identities=12% Similarity=0.078 Sum_probs=89.7 Q ss_pred CCCccHHHHHH-HHHHHHHHHHHHHhhcchhhcceEEEecCCccc--cccccccCCccchhcccceee----cccccccc Q lcl|NC_020198. 1 MAIITPALISA-LKTSFQKHFQDALATAPSTYLQVATVIPSTTAS--NTYGWLGQFPKLREWIGQRVI----KDMAAQGY 73 (304) Q Consensus 1 maii~~~~l~~-l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~--~~y~~Lg~~P~lrEw~Ge~~~----~~l~~~~~ 73 (304) |..++...=.. +=+-+...+.+.....-+ -.++|+.++-++.. ..+.....-|.. .|++|-.- .+..=..- T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~-l~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDA-LEQYVTVEPVRTRSGSRVLEKNSDMIPF-AEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhh-hhhhceeeeccCCceeEEEEeecCCccc-eeecccccccccccccceeE Confidence 22221110000 000111122222222111 22345554422222 223333444443 57755321 12233455 Q ss_pred eeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccch Q lcl|NC_020198. 74 QITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTA 153 (304) Q Consensus 74 ~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~ 153 (304) ++.-++++..+.|+|+.+.|-+..+..-+.+.++++.++..+..+......+ + .... T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~--------~---------------~~~~ 240 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL--------T---------------KQAI 240 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------c---------------ccCc Confidence 7788899999999999998878888999999999999998888775432211 0 0001 Q ss_pred hhhhhhhcccCCCCccceeccC-CccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcccc Q lcl|NC_020198. 154 TTVSNLFAPAADPGAAWYLLDT-SRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTE 232 (304) Q Consensus 154 ~s~snl~~~~~~~g~~w~L~d~-~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~ 232 (304) ++...+ ++. ...+++ ..|.|+. | T Consensus 241 ~~~d~i-------------~~~~~~~l~~--------------------------------~~~~~a~---~-------- 264 (392) T protein:vir:10 241 KSLDDI-------------KDVLNVKLDP--------------------------------AISPNAI---L-------- 264 (392) T ss_pred cCHHHH-------------HHHHHHhhhh--------------------------------hhccCCE---E-------- Confidence 111111 000 000110 0001100 1 Q ss_pred ccchhHHHHHHHHHHHhccCCCceeceecC--------eEEecchHHHHHHHHHhhhccCCCCccee------------c Q lcl|NC_020198. 233 ELNQVNFEKVYDAMRNQKADGGRPLDIRPN--------LLVVPTTLRSKAKEVVGVQRLANGADNPN------------F 292 (304) Q Consensus 233 ~l~~~~l~aar~aM~~~k~~~G~~L~i~P~--------~LvVpp~le~~A~~ll~~~~~~~g~~N~~------------~ 292 (304) -++... +.++++.|+.+|+||= .|+ +|=.||-.... ...+.......+....+ + T Consensus 265 vm~~~~----~~~L~~lkd~~G~~l~-~~~~~~~~~~tllG~~~v~~~~-~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~ 338 (392) T protein:vir:10 265 LTNQDG----FNYLDKLKDKDGKYIL-QSDPTQKNKKLFAGTNPVVVVS-NRFLKSKGTTAKKAPLIIGDLKEAIVLFKR 338 (392) T ss_pred EEcHHH----HHHHHHhhccCCCeEe-ecCccCCccccccCcccEEEec-ccccCCCcccCCceEEEEEehhceEEEEee Confidence 122222 3567778888898872 221 11122210000 00000000111121122 2 Q ss_pred ceeeEEeccccC Q lcl|NC_020198. 293 ELVQVLDTAWLN 304 (304) Q Consensus 293 g~~~~iv~p~Ld 304 (304) .-+++.++++=+ T Consensus 339 ~~~~~~~~~~~~ 350 (392) T protein:vir:10 339 EDMELASTDVGG 350 (392) T ss_pred cceEEEEecccc Confidence 334444444444 No 72 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=22.34 E-value=2.5 Score=18.44 Aligned_cols=217 Identities=12% Similarity=0.078 Sum_probs=89.7 Q ss_pred CCCccHHHHHH-HHHHHHHHHHHHHhhcchhhcceEEEecCCccc--cccccccCCccchhcccceee----cccccccc Q lcl|NC_020198. 1 MAIITPALISA-LKTSFQKHFQDALATAPSTYLQVATVIPSTTAS--NTYGWLGQFPKLREWIGQRVI----KDMAAQGY 73 (304) Q Consensus 1 maii~~~~l~~-l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~--~~y~~Lg~~P~lrEw~Ge~~~----~~l~~~~~ 73 (304) |..++...=.. +=+-+...+.+.....-+ -.++|+.++-++.. ..+.....-|.. .|++|-.- .+..=..- T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~-l~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDA-LEQYVTVEPVRTRSGSRVLEKNSDMIPF-AEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhh-hhhhceeeeccCCceeEEEEeecCCccc-eeecccccccccccccceeE Confidence 22221110000 000111122222222111 22345554422222 223333444443 57755321 12233455 Q ss_pred eeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccch Q lcl|NC_020198. 74 QITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTA 153 (304) Q Consensus 74 ~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~ 153 (304) ++.-++++..+.|+|+.+.|-+..+..-+.+.++++.++..+..+......+ + .... T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~--------~---------------~~~~ 240 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL--------T---------------KQAI 240 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------c---------------ccCc Confidence 7788899999999999998878888999999999999998888775432211 0 0001 Q ss_pred hhhhhhhcccCCCCccceeccC-CccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcccc Q lcl|NC_020198. 154 TTVSNLFAPAADPGAAWYLLDT-SRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTE 232 (304) Q Consensus 154 ~s~snl~~~~~~~g~~w~L~d~-~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~ 232 (304) ++...+ ++. ...+++ ..|.|+. | T Consensus 241 ~~~d~i-------------~~~~~~~l~~--------------------------------~~~~~a~---~-------- 264 (392) T protein:vir:10 241 KSLDDI-------------KDVLNVKLDP--------------------------------AISPNAI---L-------- 264 (392) T ss_pred cCHHHH-------------HHHHHHhhhh--------------------------------hhccCCE---E-------- Confidence 111111 000 000110 0001100 1 Q ss_pred ccchhHHHHHHHHHHHhccCCCceeceecC--------eEEecchHHHHHHHHHhhhccCCCCccee------------c Q lcl|NC_020198. 233 ELNQVNFEKVYDAMRNQKADGGRPLDIRPN--------LLVVPTTLRSKAKEVVGVQRLANGADNPN------------F 292 (304) Q Consensus 233 ~l~~~~l~aar~aM~~~k~~~G~~L~i~P~--------~LvVpp~le~~A~~ll~~~~~~~g~~N~~------------~ 292 (304) -++... +.++++.|+.+|+||= .|+ +|=.||-.... ...+.......+....+ + T Consensus 265 vm~~~~----~~~L~~lkd~~G~~l~-~~~~~~~~~~tllG~~~v~~~~-~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~ 338 (392) T protein:vir:10 265 LTNQDG----FNYLDKLKDKDGKYIL-QSDPTQKNKKLFAGTNPVVVVS-NRFLKSKGTTAKKAPLIIGDLKEAIVLFKR 338 (392) T ss_pred EEcHHH----HHHHHHhhccCCCeEe-ecCccCCccccccCcccEEEec-ccccCCCcccCCceEEEEEehhceEEEEee Confidence 122222 3567778888898872 221 11122210000 00000000111121122 2 Q ss_pred ceeeEEeccccC Q lcl|NC_020198. 293 ELVQVLDTAWLN 304 (304) Q Consensus 293 g~~~~iv~p~Ld 304 (304) .-+++.++++=+ T Consensus 339 ~~~~~~~~~~~~ 350 (392) T protein:vir:10 339 EDMELASTDVGG 350 (392) T ss_pred cceEEEEecccc Confidence 334444444444 No 73 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=22.34 E-value=2.5 Score=18.44 Aligned_cols=217 Identities=12% Similarity=0.078 Sum_probs=89.7 Q ss_pred CCCccHHHHHH-HHHHHHHHHHHHHhhcchhhcceEEEecCCccc--cccccccCCccchhcccceee----cccccccc Q lcl|NC_020198. 1 MAIITPALISA-LKTSFQKHFQDALATAPSTYLQVATVIPSTTAS--NTYGWLGQFPKLREWIGQRVI----KDMAAQGY 73 (304) Q Consensus 1 maii~~~~l~~-l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~--~~y~~Lg~~P~lrEw~Ge~~~----~~l~~~~~ 73 (304) |..++...=.. +=+-+...+.+.....-+ -.++|+.++-++.. ..+.....-|.. .|++|-.- .+..=..- T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~-l~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDA-LEQYVTVEPVRTRSGSRVLEKNSDMIPF-AEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhh-hhhhceeeeccCCceeEEEEeecCCccc-eeecccccccccccccceeE Confidence 22221110000 000111122222222111 22345554422222 223333444443 57755321 12233455 Q ss_pred eeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccch Q lcl|NC_020198. 74 QITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTA 153 (304) Q Consensus 74 ~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~ 153 (304) ++.-++++..+.|+|+.+.|-+..+..-+.+.++++.++..+..+......+ + .... T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~--------~---------------~~~~ 240 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL--------T---------------KQAI 240 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------c---------------ccCc Confidence 7788899999999999998878888999999999999998888775432211 0 0001 Q ss_pred hhhhhhhcccCCCCccceeccC-CccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcccc Q lcl|NC_020198. 154 TTVSNLFAPAADPGAAWYLLDT-SRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTE 232 (304) Q Consensus 154 ~s~snl~~~~~~~g~~w~L~d~-~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~ 232 (304) ++...+ ++. ...+++ ..|.|+. | T Consensus 241 ~~~d~i-------------~~~~~~~l~~--------------------------------~~~~~a~---~-------- 264 (392) T protein:vir:10 241 KSLDDI-------------KDVLNVKLDP--------------------------------AISPNAI---L-------- 264 (392) T ss_pred cCHHHH-------------HHHHHHhhhh--------------------------------hhccCCE---E-------- Confidence 111111 000 000110 0001100 1 Q ss_pred ccchhHHHHHHHHHHHhccCCCceeceecC--------eEEecchHHHHHHHHHhhhccCCCCccee------------c Q lcl|NC_020198. 233 ELNQVNFEKVYDAMRNQKADGGRPLDIRPN--------LLVVPTTLRSKAKEVVGVQRLANGADNPN------------F 292 (304) Q Consensus 233 ~l~~~~l~aar~aM~~~k~~~G~~L~i~P~--------~LvVpp~le~~A~~ll~~~~~~~g~~N~~------------~ 292 (304) -++... +.++++.|+.+|+||= .|+ +|=.||-.... ...+.......+....+ + T Consensus 265 vm~~~~----~~~L~~lkd~~G~~l~-~~~~~~~~~~tllG~~~v~~~~-~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~ 338 (392) T protein:vir:10 265 LTNQDG----FNYLDKLKDKDGKYIL-QSDPTQKNKKLFAGTNPVVVVS-NRFLKSKGTTAKKAPLIIGDLKEAIVLFKR 338 (392) T ss_pred EEcHHH----HHHHHHhhccCCCeEe-ecCccCCccccccCcccEEEec-ccccCCCcccCCceEEEEEehhceEEEEee Confidence 122222 3567778888898872 221 11122210000 00000000111121122 2 Q ss_pred ceeeEEeccccC Q lcl|NC_020198. 293 ELVQVLDTAWLN 304 (304) Q Consensus 293 g~~~~iv~p~Ld 304 (304) .-+++.++++=+ T Consensus 339 ~~~~~~~~~~~~ 350 (392) T protein:vir:10 339 EDMELASTDVGG 350 (392) T ss_pred cceEEEEecccc Confidence 334444444444 No 74 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=22.34 E-value=2.5 Score=18.44 Aligned_cols=217 Identities=12% Similarity=0.078 Sum_probs=89.7 Q ss_pred CCCccHHHHHH-HHHHHHHHHHHHHhhcchhhcceEEEecCCccc--cccccccCCccchhcccceee----cccccccc Q lcl|NC_020198. 1 MAIITPALISA-LKTSFQKHFQDALATAPSTYLQVATVIPSTTAS--NTYGWLGQFPKLREWIGQRVI----KDMAAQGY 73 (304) Q Consensus 1 maii~~~~l~~-l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~--~~y~~Lg~~P~lrEw~Ge~~~----~~l~~~~~ 73 (304) |..++...=.. +=+-+...+.+.....-+ -.++|+.++-++.. ..+.....-|.. .|++|-.- .+..=..- T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~-l~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDA-LEQYVTVEPVRTRSGSRVLEKNSDMIPF-AEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhh-hhhhceeeeccCCceeEEEEeecCCccc-eeecccccccccccccceeE Confidence 22221110000 000111122222222111 22345554422222 223333444443 57755321 12233455 Q ss_pred eeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccch Q lcl|NC_020198. 74 QITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTA 153 (304) Q Consensus 74 ~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~~ 153 (304) ++.-++++..+.|+|+.+.|-+..+..-+.+.++++.++..+..+......+ + .... T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~--------~---------------~~~~ 240 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL--------T---------------KQAI 240 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------c---------------ccCc Confidence 7788899999999999998878888999999999999998888775432211 0 0001 Q ss_pred hhhhhhhcccCCCCccceeccC-CccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhcccc Q lcl|NC_020198. 154 TTVSNLFAPAADPGAAWYLLDT-SRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTE 232 (304) Q Consensus 154 ~s~snl~~~~~~~g~~w~L~d~-~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~~ 232 (304) ++...+ ++. ...+++ ..|.|+. | T Consensus 241 ~~~d~i-------------~~~~~~~l~~--------------------------------~~~~~a~---~-------- 264 (392) T protein:vir:10 241 KSLDDI-------------KDVLNVKLDP--------------------------------AISPNAI---L-------- 264 (392) T ss_pred cCHHHH-------------HHHHHHhhhh--------------------------------hhccCCE---E-------- Confidence 111111 000 000110 0001100 1 Q ss_pred ccchhHHHHHHHHHHHhccCCCceeceecC--------eEEecchHHHHHHHHHhhhccCCCCccee------------c Q lcl|NC_020198. 233 ELNQVNFEKVYDAMRNQKADGGRPLDIRPN--------LLVVPTTLRSKAKEVVGVQRLANGADNPN------------F 292 (304) Q Consensus 233 ~l~~~~l~aar~aM~~~k~~~G~~L~i~P~--------~LvVpp~le~~A~~ll~~~~~~~g~~N~~------------~ 292 (304) -++... +.++++.|+.+|+||= .|+ +|=.||-.... ...+.......+....+ + T Consensus 265 vm~~~~----~~~L~~lkd~~G~~l~-~~~~~~~~~~tllG~~~v~~~~-~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~ 338 (392) T protein:vir:10 265 LTNQDG----FNYLDKLKDKDGKYIL-QSDPTQKNKKLFAGTNPVVVVS-NRFLKSKGTTAKKAPLIIGDLKEAIVLFKR 338 (392) T ss_pred EEcHHH----HHHHHHhhccCCCeEe-ecCccCCccccccCcccEEEec-ccccCCCcccCCceEEEEEehhceEEEEee Confidence 122222 3567778888898872 221 11122210000 00000000111121122 2 Q ss_pred ceeeEEeccccC Q lcl|NC_020198. 293 ELVQVLDTAWLN 304 (304) Q Consensus 293 g~~~~iv~p~Ld 304 (304) .-+++.++++=+ T Consensus 339 ~~~~~~~~~~~~ 350 (392) T protein:vir:10 339 EDMELASTDVGG 350 (392) T ss_pred cceEEEEecccc Confidence 334444444444 No 75 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=21.73 E-value=2.6 Score=18.35 Aligned_cols=235 Identities=13% Similarity=0.078 Sum_probs=97.9 Q ss_pred CCCccHHHHHH-HHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccceee---cccccccceee Q lcl|NC_020198. 1 MAIITPALISA-LKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVI---KDMAAQGYQIT 76 (304) Q Consensus 1 maii~~~~l~~-l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge~~~---~~l~~~~~~i~ 76 (304) ||.-+.+.=.. +-..+...+-+.+.+ .+.-+++|+++|.....-+|..+..-|.- .|+||-.- .+..=..-+++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~-~s~i~~l~~~i~~~~~~~~ip~~~~~~~a-~wv~Eg~~~~~s~~~f~~v~l~ 78 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAID-SGVLAKLSPEQPTIFGPVKGAVFSGVPRA-KIVGEGEVKPSASVDVSAFTAQ 78 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHh-hchhhhhcceeecCCCceEEEEEeCCcce-EEeeCCccccccccceeeeEee Confidence 77533221000 011122222222222 23356778888866655666666555543 68866432 22333445667 Q ss_pred eecccceeecchhhhhcCCcch----hHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccccccccccccc Q lcl|NC_020198. 77 NKLFESTVGVKRTDIEDDNLGV----YGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGT 152 (304) Q Consensus 77 nk~fe~tv~v~R~~I~dDdlG~----~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~tg~ 152 (304) -++++..+.||++.+.++.... ..-+.+.++++.++..|+.++. |.++. .|+++ .+... T Consensus 79 ~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~----G~~~~--~~~~~-----------~~~~~ 141 (315) T protein:vir:80 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFH----GIDPA--TGKAA-----------SAVHT 141 (315) T ss_pred eeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheee----ccCCC--CCccc-----------ccccc Confidence 7889999999999998777663 3556777788777776654441 21110 11110 00000 Q ss_pred hhh-hhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhccc Q lcl|NC_020198. 153 ATT-VSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMST 231 (304) Q Consensus 153 ~~s-~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~s~ 231 (304) .++ ..+-. ..++..|--+ ++ ++.+. ...+. +.+.+ |-+ T Consensus 142 ~~~~~~~~~---~~~~~~~~d~-----~~-~~~~~------------~~~~~------------~~~~~---~im----- 180 (315) T protein:vir:80 142 SLNKTKNIV---DATDSATADL-----VK-AVGLI------------AGAGL------------QVPNG---VAL----- 180 (315) T ss_pred cccccccee---eccccchHHH-----HH-HHHHH------------hhccC------------ccceE---EEE----- Confidence 000 00000 0001111000 00 00000 00001 11111 111 Q ss_pred cccchhHHHHHHHHHHHhccCCCceeceecCeEE----ecchHHHHHHHHHhhhccCCC----Cc-----------cee- Q lcl|NC_020198. 232 EELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLV----VPTTLRSKAKEVVGVQRLANG----AD-----------NPN- 291 (304) Q Consensus 232 ~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~Lv----Vpp~le~~A~~ll~~~~~~~g----~~-----------N~~- 291 (304) + .+.+.++++.|+..|++++-.|-+-- .|..|- .+-++-++..+.+ .. +.+ T Consensus 181 ---n----~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~--G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~ 251 (315) T protein:vir:80 181 ---D----PAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWR--GLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHW 251 (315) T ss_pred ---c----HHHHHHHHHHhhccCCcccccccccccccCCCceec--ceeeEecCcCCcccccccccccEEEEeecccEEE Confidence 1 12357778888888877665442210 111111 1222223333321 11 111 Q ss_pred --cceeeEEeccccC Q lcl|NC_020198. 292 --FELVQVLDTAWLN 304 (304) Q Consensus 292 --~g~~~~iv~p~Ld 304 (304) ++.+++-+.++-| T Consensus 252 g~~~~~~i~i~~~~~ 266 (315) T protein:vir:80 252 GFQRNFPIELIEYGD 266 (315) T ss_pred EEecCeeEEEecccc Confidence 1223343444444 No 76 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=20.34 E-value=2.8 Score=18.14 Aligned_cols=226 Identities=15% Similarity=0.087 Sum_probs=95.3 Q ss_pred CCC--------ccHHHHHHHHHHHHHHHHHHHhhcchhhcceEEEecCCccccccccccCCccchhcccc---eeecccc Q lcl|NC_020198. 1 MAI--------ITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQ---RVIKDMA 69 (304) Q Consensus 1 mai--------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrEw~Ge---~~~~~l~ 69 (304) +.. |-.... ...+...+.... ...++++.++.+. .-.+.+...-|. -.|+|| ..-.++. T Consensus 251 ~~~t~~~gg~lip~~~~-------~~ii~~~~~~~~-~l~~~~~~~~~~g-~~~~~~~~~~~~-a~~v~Eg~~~~~~~~~ 320 (543) T protein:vir:81 251 MGLTKADGGYLVPFQLD-------PTVIITSNGSLN-DIRRFARQVVATG-DVWHGVSSAAVQ-WSWDAEFEEVSDDSPE 320 (543) T ss_pred cccccccCcccCchhhh-------hHHHHHHHhhhc-hhhhhcccccCCc-ceEEEEecCCcc-eeecccCccccccccc Confidence 000 111111 122333333222 2344555544432 233444444443 356643 3333344 Q ss_pred cccceeeeecccceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCccccccccccccccccc Q lcl|NC_020198. 70 AQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDG 149 (304) Q Consensus 70 ~~~~~i~nk~fe~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~v~~~~~~ 149 (304) =..-++..++++..+.|||+.+. |...+..-+...|+++.++..+..++ +| ||.. ++|.|--... T Consensus 321 ~~~i~~~~~k~~~~~~is~ell~-d~~~~~~~i~~~l~~~~~~~~d~ail----~G------~Gt~----~~p~Gi~~~~ 385 (543) T protein:vir:81 321 FGQPEIPVKKAQGFVPISIEALQ-DEANVTETVALLFAEGKDELEAVTLT----TG------TGQG----NQPTGIVTAL 385 (543) T ss_pred cceeeeeeeeeEeeehhhHHHHh-ccHHHHHHHHHHHHHHHHHHHHHHHh----cc------CCCC----cccccchhhc Confidence 45567888999999999999885 66899999999999999998888654 22 2321 2332210000 Q ss_pred ccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeeccccccccchhhhhc Q lcl|NC_020198. 150 TGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAM 229 (304) Q Consensus 150 tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g~wq~a~~ 229 (304) ++... ...++.+.+ ..|-. +.=++... .+ ..+.++ .| T Consensus 386 ~~~~~---~~~~~~~~~-~~~~~------~~~~~~~l----------~~---------------~~~~~~---~~----- 422 (543) T protein:vir:81 386 AGTAA---EIAPVTAET-FALAD------VYAVYEQL----------AA---------------RHRRQG---AW----- 422 (543) T ss_pred ccccc---ccccccccc-ccHHH------HHHHHHhh----------hc---------------cccCCc---EE----- Confidence 11000 010000000 00100 00000000 00 000110 01 Q ss_pred cccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHHHHhhhccCC--------CCcceec--------- Q lcl|NC_020198. 230 STEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLAN--------GADNPNF--------- 292 (304) Q Consensus 230 s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~--------g~~N~~~--------- 292 (304) -++. ..+..++++|+..|+||--.+.- =.|+.+- ..-++-++..+. |..-.+. T Consensus 423 ---v~n~----~~~~~l~~lkd~~G~~l~~~~~~-g~~~~l~--G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~ 492 (543) T protein:vir:81 423 ---LANN----LIYNKIRQFDTQGGAGLWTTIGN-GEPSQLL--GRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIA 492 (543) T ss_pred ---EEcH----HHHHHHHHhhcCCCceeccCcCC-CCCcccc--ceeeEEeccccccccccccCCcceEEEeeccceeEE Confidence 1222 23456778999999987421100 0111111 111111222111 1111111 Q ss_pred --ceeeEEeccccC Q lcl|NC_020198. 293 --ELVQVLDTAWLN 304 (304) Q Consensus 293 --g~~~~iv~p~Ld 304 (304) +-+++.++|+.+ T Consensus 493 ~~~~~~i~~~~~~~ 506 (543) T protein:vir:81 493 DRIGMTVEFIPHLF 506 (543) T ss_pred eecccEEEEecccc Confidence 235666667666 No 77 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=20.25 E-value=2.8 Score=18.13 Aligned_cols=214 Identities=11% Similarity=0.057 Sum_probs=104.6 Q ss_pred CCC-----------ccHHHHHHHH-HHHHHHHHHHHhhcchhhcceEEEec-C--CccccccccccCCccchhcc--cce Q lcl|NC_020198. 1 MAI-----------ITPALISALK-TSFQKHFQDALATAPSTYLQVATVIP-S--TTASNTYGWLGQFPKLREWI--GQR 63 (304) Q Consensus 1 mai-----------i~~~~l~~l~-~~~~~~f~~a~~~a~~~~~~~a~~v~-S--~~~~~~y~~Lg~~P~lrEw~--Ge~ 63 (304) ||- ++.+..+++. .-+...++..|+.. .-+..++.+.. + ..++-+...+|. |...... ++. T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~-lv~~~l~~~~~~~~~~GdTV~ip~~g~-~~a~d~~~g~~i 78 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQK-FAALEATKKIPFEGKKGDLIHIPNISR-AAVYDKQPQTPV 78 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHh-hhhhhccccccceeecCceEEeeccCc-ceeeeecCCCcc Confidence 332 2333344432 23344444444332 11122222111 0 111122233442 2332222 456 Q ss_pred eecccccccceeeeecc-cceeecchhhhhcCCcchhHHHHHHHHHHHHhcHHHHHHHHHhccCCCcccCcccccccccc Q lcl|NC_020198. 64 VIKDMAAQGYQITNKLF-ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHP 142 (304) Q Consensus 64 ~~~~l~~~~~~i~nk~f-e~tv~v~R~~I~dDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~cyDGq~fF~tdH~ 142 (304) .+.++.+...+++..++ ...+.|+..|.....+..-....+++|.+-++.-|+.++.++....... -+.. T Consensus 79 ~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~--~~~~------- 149 (381) T protein:vir:80 79 NLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFP--SQRI------- 149 (381) T ss_pred cccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--cccc------- Confidence 67778887777877554 4568888888888888899999999999999999999999876521100 0000 Q ss_pred cccccccccchhhhhhhhcccCCCCccceeccCCccchhhhhhhccccchhhcccCcccccccccceEEEeecccccccc Q lcl|NC_020198. 143 VYPNVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFG 222 (304) Q Consensus 143 v~~~~~~tg~~~s~snl~~~~~~~g~~w~L~d~~~~~kP~i~Q~r~~~~~~~~~~~~~~~vf~~~~~~~Gvd~R~n~G~g 222 (304) ...+..+...+.. T Consensus 150 -----~t~~~~i~~~~~~-------------------------------------------------------------- 162 (381) T protein:vir:80 150 -----YSYDTTLGDGTVN-------------------------------------------------------------- 162 (381) T ss_pred -----ccccccccccccc-------------------------------------------------------------- Confidence 0000000000000 Q ss_pred chhhhhccccccchhHHHHHHHHHHHhccCCCceeceecCeEEecchHHHHHHH---HHhhhccCCCCcce--------e Q lcl|NC_020198. 223 FWQLAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKE---VVGVQRLANGADNP--------N 291 (304) Q Consensus 223 ~wq~a~~s~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~---ll~~~~~~~g~~N~--------~ 291 (304) ....+....++.+.+-+|++.|.+.. -|. ..++|||+|.....-++ +++++. +.++. + T Consensus 163 --~~~t~~~~~~t~~~i~~a~~~Lde~~----VP~--egR~lvv~P~~~~~Ll~~~~~~~ad~---~~~~~l~~G~Ig~i 231 (381) T protein:vir:80 163 --AHLTGTPAPLTYAALLLAKQKLDEAD----VPQ--EGRIVMVSPAQYIDLLSINQFISVDF---SQVKPVTSGVVGTI 231 (381) T ss_pred --cccccchhhHHHHHHHHHHHHHhhcC----CCc--CCcEEEeCHHHHHHHhhchhhhhhhh---ccchhhhceeeeEE Confidence 00001112334444555555554332 232 24578888877665443 223222 22222 2 Q ss_pred cceeeEEeccccC Q lcl|NC_020198. 292 FELVQVLDTAWLN 304 (304) Q Consensus 292 ~g~~~~iv~p~Ld 304 (304) .| ++++++++|- T Consensus 232 ~G-~~Vv~Sn~lp 243 (381) T protein:vir:80 232 LG-MEVIVTTQIG 243 (381) T ss_pred cc-eEEEeecccc Confidence 23 7888888886 Done!