Query lcl|NC_013594.1_cdsid_YP_003335782.1 [gene=T] [protein=major head protein] [protein_id=YP_003335782.1] [location=19187..20104] Match_columns 305 No_of_seqs 197 out of 398 Neff 6.3 Searched_HMMs 1612 Date Thu Nov 7 13:07:07 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_34 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_34_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:1991 Length: 305 # 100.0 2E-147 1E-150 824.8 22.1 305 1-305 1-305 (305) 2 protein:vir:99228 Length: 304 100.0 1E-142 9E-146 798.1 19.1 302 1-305 1-303 (304) 3 protein:vir:79246 Length: 304 100.0 3E-142 2E-145 796.2 19.1 302 1-305 1-303 (304) 4 protein:vir:103886 Length: 302 100.0 3.2E-91 2E-94 516.7 19.5 231 1-305 1-231 (302) 5 protein:vir:95512 Length: 693 100.0 6.4E-69 4E-72 394.4 16.1 221 1-305 394-626 (693) 6 protein:vir:79548 Length: 652 100.0 1.3E-66 8E-70 381.8 15.4 218 1-305 359-586 (652) 7 protein:vir:103886 Length: 302 99.9 6.5E-28 4E-31 169.6 -1.9 203 1-234 89-302 (302) 8 protein:vir:108211 Length: 318 96.4 0.00011 7.1E-08 42.2 10.1 210 1-305 21-249 (318) 9 protein:vir:9820 Length: 272 # 95.3 0.0024 1.5E-06 35.0 14.9 186 1-305 1-209 (272) 10 protein:vir:3033 Length: 272 # 95.3 0.0024 1.5E-06 35.0 14.9 186 1-305 1-209 (272) 11 protein:vir:105334 Length: 276 94.4 0.0044 2.7E-06 33.5 14.5 188 1-305 1-210 (276) 12 protein:vir:79928 Length: 393 93.6 0.00028 1.7E-07 40.0 4.0 266 1-305 42-387 (393) 13 protein:vir:95898 Length: 274 92.8 0.01 6.3E-06 31.5 13.9 192 1-305 10-210 (274) 14 protein:vir:96262 Length: 274 92.8 0.01 6.3E-06 31.5 13.9 192 1-305 10-210 (274) 15 protein:vir:3613 Length: 272 # 92.3 0.012 7.4E-06 31.1 13.6 193 1-305 1-208 (272) 16 protein:vir:8187 Length: 311 # 88.6 0.0065 4E-06 32.6 6.2 262 1-270 1-311 (311) 17 protein:vir:96833 Length: 275 85.8 0.05 3.1E-05 27.7 13.6 195 1-305 1-211 (275) 18 protein:vir:93742 Length: 274 85.5 0.053 3.3E-05 27.6 14.0 187 1-305 1-210 (274) 19 protein:vir:1239 Length: 274 # 80.8 0.092 5.7E-05 26.3 13.9 187 1-305 10-210 (274) 20 protein:vir:96123 Length: 274 77.6 0.12 7.6E-05 25.6 13.2 187 1-305 1-210 (274) 21 protein:vir:9759 Length: 303 # 77.4 0.055 3.4E-05 27.5 6.4 256 1-269 1-303 (303) 22 protein:vir:2344 Length: 397 # 75.8 0.14 8.8E-05 25.2 11.9 224 1-305 1-261 (397) 23 protein:vir:103955 Length: 324 75.6 0.15 9E-05 25.2 11.2 225 1-305 30-270 (324) 24 protein:vir:94771 Length: 298 74.9 0.1 6.4E-05 26.0 7.2 249 1-268 1-298 (298) 25 protein:vir:97433 Length: 274 73.9 0.16 0.0001 24.9 13.7 185 1-305 10-210 (274) 26 protein:vir:94494 Length: 274 73.9 0.16 0.0001 24.9 13.7 185 1-305 10-210 (274) 27 protein:vir:96223 Length: 324 71.6 0.19 0.00012 24.5 11.8 225 1-305 4-270 (324) 28 protein:vir:99920 Length: 311 70.1 0.19 0.00012 24.5 7.5 258 1-273 1-311 (311) 29 protein:vir:1638 Length: 298 # 69.6 0.14 9E-05 25.2 6.7 253 1-268 1-298 (298) 30 protein:vir:485 Length: 407 # 69.4 0.22 0.00014 24.2 7.7 244 1-305 96-369 (407) 31 protein:vir:104256 Length: 458 66.1 0.076 4.7E-05 26.7 4.4 239 1-305 143-427 (458) 32 protein:vir:80930 Length: 278 65.5 0.28 0.00018 23.6 14.0 195 1-305 1-217 (278) 33 protein:vir:96762 Length: 632 64.7 0.24 0.00015 24.0 6.8 245 1-281 345-632 (632) 34 protein:vir:8102 Length: 543 # 64.2 0.3 0.00019 23.4 8.4 229 1-305 251-505 (543) 35 protein:vir:100247 Length: 425 62.1 0.28 0.00017 23.6 6.7 233 1-242 120-425 (425) 36 protein:vir:4226 Length: 326 # 61.5 0.35 0.00022 23.1 7.5 228 1-305 1-278 (326) 37 protein:vir:739 Length: 231 # 60.5 0.37 0.00023 22.9 10.3 161 34-305 1-167 (231) 38 protein:vir:94142 Length: 304 60.2 0.054 3.3E-05 27.5 2.5 239 1-268 1-304 (304) 39 protein:vir:105905 Length: 304 60.2 0.054 3.3E-05 27.5 2.5 239 1-268 1-304 (304) 40 protein:vir:4092 Length: 390 # 58.1 0.42 0.00026 22.6 10.4 238 1-305 62-335 (390) 41 protein:vir:7771 Length: 330 # 57.8 0.3 0.00019 23.4 6.1 253 1-270 10-330 (330) 42 protein:vir:95107 Length: 270 57.4 0.43 0.00027 22.6 13.3 181 1-305 1-204 (270) 43 protein:vir:3870 Length: 400 # 56.4 0.46 0.00028 22.4 9.1 213 1-305 123-359 (400) 44 protein:vir:80684 Length: 315 55.7 0.45 0.00028 22.5 6.7 235 1-305 1-265 (315) 45 protein:vir:99749 Length: 324 55.1 0.49 0.0003 22.3 11.2 225 1-305 30-270 (324) 46 protein:vir:9574 Length: 300 # 54.6 0.5 0.00031 22.2 9.0 244 1-253 1-300 (300) 47 protein:vir:104085 Length: 320 53.4 0.38 0.00024 22.9 6.0 249 1-268 7-320 (320) 48 protein:vir:7990 Length: 273 # 52.2 0.56 0.00035 22.0 14.3 199 1-305 1-211 (273) 49 protein:vir:1383 Length: 421 # 49.6 0.48 0.0003 22.3 5.8 276 1-305 90-416 (421) 50 protein:vir:9704 Length: 394 # 48.9 0.21 0.00013 24.3 3.8 243 1-304 137-394 (394) 51 protein:vir:8324 Length: 410 # 47.8 0.48 0.0003 22.3 5.6 216 1-236 141-410 (410) 52 protein:vir:105822 Length: 273 47.7 0.69 0.00043 21.5 14.8 193 1-305 1-211 (273) 53 protein:vir:102605 Length: 273 47.7 0.69 0.00043 21.5 14.8 193 1-305 1-211 (273) 54 protein:vir:2504 Length: 305 # 43.6 0.5 0.00031 22.2 5.0 236 1-268 1-305 (305) 55 protein:vir:191 Length: 385 # 43.2 0.85 0.00053 21.0 7.2 232 1-305 105-349 (385) 56 protein:vir:1886 Length: 385 # 43.2 0.85 0.00053 21.0 7.2 232 1-305 105-349 (385) 57 protein:vir:4456 Length: 401 # 40.7 0.73 0.00045 21.3 5.4 221 1-261 97-401 (401) 58 protein:vir:78523 Length: 338 40.0 0.99 0.00061 20.6 11.2 239 1-305 1-288 (338) 59 protein:vir:78830 Length: 324 32.4 1.4 0.00089 19.7 11.9 224 1-305 4-270 (324) 60 protein:vir:96392 Length: 324 32.4 1.4 0.00089 19.7 11.9 224 1-305 4-270 (324) 61 protein:vir:4953 Length: 397 # 32.3 1.4 0.00089 19.7 7.9 215 1-305 84-367 (397) 62 protein:vir:6242 Length: 390 # 30.8 1.5 0.00096 19.5 5.8 237 1-268 84-390 (390) 63 protein:vir:101650 Length: 497 29.2 1.7 0.001 19.3 7.7 279 1-305 151-458 (497) 64 protein:vir:7855 Length: 497 # 29.2 1.7 0.001 19.3 7.7 279 1-305 151-458 (497) 65 protein:vir:107593 Length: 392 27.9 1.3 0.00083 19.9 4.6 265 1-285 106-392 (392) 66 protein:vir:102082 Length: 392 27.9 1.3 0.00083 19.9 4.6 265 1-285 106-392 (392) 67 protein:vir:102873 Length: 392 27.9 1.3 0.00083 19.9 4.6 265 1-285 106-392 (392) 68 protein:vir:105004 Length: 392 27.9 1.3 0.00083 19.9 4.6 265 1-285 106-392 (392) 69 protein:vir:2430 Length: 318 # 27.4 1.8 0.0011 19.1 9.8 223 1-305 14-268 (318) 70 protein:vir:97148 Length: 324 25.6 2 0.0013 18.9 13.3 226 1-305 1-270 (324) 71 protein:vir:1025 Length: 408 # 22.7 2.4 0.0015 18.5 6.9 214 1-305 106-358 (408) 72 protein:vir:1328 Length: 392 # 22.2 2.5 0.0015 18.4 6.7 235 1-273 97-392 (392) 73 protein:vir:4600 Length: 415 # 21.9 2.5 0.0016 18.4 12.8 235 1-305 99-374 (415) 74 protein:vir:4700 Length: 415 # 21.9 2.5 0.0016 18.4 12.8 235 1-305 99-374 (415) 75 protein:vir:962 Length: 397 # 20.4 2.7 0.0016 18.3 4.7 247 1-268 110-397 (397) No 1 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=100.00 E-value=2.1e-147 Score=824.75 Aligned_cols=305 Identities=99% Similarity=1.515 Sum_probs=303.4 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeeeccccccceeeeeccc Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSIANKTFE 80 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~~~~i~n~tfg 80 (305) ||||+++|++|+++||+.|++||+.+|++|++||++|||++++++|+|||+||+||||+|||++++|++|+|+|+||+|| T Consensus 1 M~i~~~~l~~l~~~~~~~f~~~~~~a~~~~~~iA~~vpSt~~~~tY~wLg~fP~lrewiGer~i~~l~~~~y~i~Nk~fe 80 (305) T protein:vir:19 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSIANKTFE 80 (305) T ss_pred CccCHHHHHHHHHHHHHHHHHHHhhcCcccceEEeEecCCCCcccccccccCCccchhhcceeeeeccccceeEeecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhHhhhc Q lcl|NC_013594. 81 GTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIVE 160 (305) Q Consensus 81 ~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s~l~~ 160 (305) .||+|+|+|||||+||+|++++++||++|++|||+|||+||++||+++|||||+||||||||++|++|+|.+.++|||+. T Consensus 81 ~tV~V~R~dIeDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~cyDGq~FFdtDHpv~~~~~~tg~~~~vsn~~~ 160 (305) T protein:vir:19 81 GTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIVE 160 (305) T ss_pred ceeccchhhccccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccCCCCCcccCCCCCcccCCcccccccchhhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccchHHHHH Q lcl|NC_013594. 161 QDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNLWK 240 (305) Q Consensus 161 ~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ 240 (305) +++++|++|+|+|++++|||+|||+|++|+|+++++|+|+|||+++||+||+|+|||+|||||||||||+++||++||++ T Consensus 161 ~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~~~~~~~~~d~~vf~~~e~~ygvd~R~n~Gygfwq~a~gS~~~Ls~~nl~a 240 (305) T protein:vir:19 161 QDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASTRRAAGYGFWQMAVAVKGDLTLDNLWK 240 (305) T ss_pred CCCCCCceeeeeecCCcceeEEEecccccceeeccCCCchhhhhhceeeeeeeeeeeccccchhheecCCCCCCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccccccccccceeeEEecccC Q lcl|NC_013594. 241 GWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNEMKGKLQLVVADYL 305 (305) Q Consensus 241 ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~~~N~~~~~~~~iv~p~L 305 (305) ||++|++||+++|+||+|+|++|||||+||.+|+|||+++.+++++++++|||+|++||||+||| T Consensus 241 ar~aM~~qk~d~G~pL~I~P~~LvVPp~LE~~A~qll~s~~i~~g~~~~~Np~~g~~eliV~P~L 305 (305) T protein:vir:19 241 GWQLMRSFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNEMKGKLQLVVADYL 305 (305) T ss_pred HHHHHHhhcCCCCceeeeecCeEEeCchhHHHHHHHHhhcccCCccccccceecceEEEEecccC Confidence 99999999999999999999999999999999999999999999988899999999999999999 No 2 >protein:vir:99228 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:776 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950457;genbank:gi:119953658;genbank:GeneID:4643088 Probab=100.00 E-value=1.5e-142 Score=798.14 Aligned_cols=302 Identities=49% Similarity=0.888 Sum_probs=298.0 Q ss_pred CC-cCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeeeccccccceeeeecc Q lcl|NC_013594. 1 MI-VTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSIANKTF 79 (305) Q Consensus 1 M~-i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~~~~i~n~tf 79 (305) |. ||+++|++|+++||+.|++||+.+|+.|+++||+|||++++|+|+|||+||+||||+|||++++|++|+|+|+||+| T Consensus 1 M~ii~~~~L~~l~~~~~~~f~~~~~~a~~~~~~iA~~VpSt~~~~~Y~WLg~~P~mreWiG~r~i~~l~~~~y~I~Nk~f 80 (304) T protein:vir:99 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQGYQITNKLF 80 (304) T ss_pred CCccCHHHHHHHHHHHHHHHHHHHhhcCcccceeEeEeecCccccccchhcccccchhhhhhhhhhhhhhccceeecccc Confidence 87 99999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhHhhh Q lcl|NC_013594. 80 EGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIV 159 (305) Q Consensus 80 g~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s~l~ 159 (305) |.||+|+|+|||||+||+|++++++||++|++|||+|||+||++||+++|||||+||||||||++|++|+|...++||+. T Consensus 81 E~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~~~~dg~g~~~~vsn~~ 160 (304) T protein:vir:99 81 ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVSNLF 160 (304) T ss_pred ccccccccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCcccccccccCcccccceec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccchHHHH Q lcl|NC_013594. 160 EQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNLW 239 (305) Q Consensus 160 ~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~ 239 (305) .+.+.+|++|||+|++++|||+|||+|++|+|+++++|+|+|||+++||+||+|+|||+|||||||||||+++||++||+ T Consensus 161 ~~~~~~g~~w~Lld~~r~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfWQlA~gS~a~Lt~~nl~ 240 (304) T protein:vir:99 161 APAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNTANFE 240 (304) T ss_pred cCCCCCCCcEEEEeCCCCccceeeeccccceeeeccCCCchhhhhhcceeEeeeeeeccchhhhhhhhhcCCCcChHHHH Confidence 88888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccccccccccceeeEEecccC Q lcl|NC_013594. 240 KGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNEMKGKLQLVVADYL 305 (305) Q Consensus 240 ~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~~~N~~~~~~~~iv~p~L 305 (305) +||+|||+||+++|+||+|+|++|||||+||.+|+|||+++.+++|. +|||+|+++|||+||| T Consensus 241 aAr~aMr~qk~d~G~pL~I~P~~LvVPp~LE~aA~~ll~a~~~~~G~---tNp~~g~~eliV~P~L 303 (304) T protein:vir:99 241 KVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGA---DNPNFELVQVLDTAWL 303 (304) T ss_pred HHHHHHHhhcCCCCceeccccCeEEecchHHHHHHHHHhhhccCCCC---cceecceEEEEeeccc Confidence 99999999999999999999999999999999999999999887654 6999999999999999 No 3 >protein:vir:79246 Length: 304 # NCBI annotation: conserved hypothetical protein # Family: family:all:776 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469162;genbank:gi:157835004;genbank:GeneID:5648827 Probab=100.00 E-value=3.3e-142 Score=796.25 Aligned_cols=302 Identities=49% Similarity=0.886 Sum_probs=297.8 Q ss_pred CC-cCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeeeccccccceeeeecc Q lcl|NC_013594. 1 MI-VTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSIANKTF 79 (305) Q Consensus 1 M~-i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~~~~i~n~tf 79 (305) |. ||+++|++|+++||+.|++||+.+|+.|+++||+|||++++|+|+|||+||+||||+|||++++|++|+|+|+||+| T Consensus 1 M~ii~~~~L~~l~~~~~~~f~~~~~~a~~~~~~iA~~VpSt~~~~tY~WLg~~P~mreWiG~r~i~~l~~~~y~I~Nk~f 80 (304) T protein:vir:79 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQGYQITNKLF 80 (304) T ss_pred CCccCHHHHHHHHHHHHHHHHHHHhhcCcccceeEeEeecCccccccchhcccccchhhhhhhhhhhhhhccceeecccc Confidence 87 59999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhHhhh Q lcl|NC_013594. 80 EGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIV 159 (305) Q Consensus 80 g~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s~l~ 159 (305) |.||+|+|+|||||+||+|++++++||++|++|||+|||+||++||+++|||||+||||||||++|++|+|+..++||+. T Consensus 81 E~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~~~~d~~g~~~~vsn~~ 160 (304) T protein:vir:79 81 ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVSNLF 160 (304) T ss_pred ccceeeccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCccccccccccccccceeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccchHHHH Q lcl|NC_013594. 160 EQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNLW 239 (305) Q Consensus 160 ~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~ 239 (305) .+.+..|++|||+|++++|||+|||+|++++|+++++|+|+|||+++||+||+|+|||+|||||||||||+++||++||+ T Consensus 161 ~~~~~~g~~w~LlD~sr~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfWQlA~gS~a~Ls~~nl~ 240 (304) T protein:vir:79 161 APAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSLTKEDNEQVFMADEYVYGVRSRCNVGFGFWQLAAMSTEELNQVNFE 240 (304) T ss_pred cCCCCCCCeEEEEeCCCcccceeeeccccceeeecCCCCchhhhhhcceEEeeeeeeccchhhhhhhhhcCCccchHHHH Confidence 98888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccccccccccceeeEEecccC Q lcl|NC_013594. 240 KGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNEMKGKLQLVVADYL 305 (305) Q Consensus 240 ~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~~~N~~~~~~~~iv~p~L 305 (305) +||+|||+||+++|+||+|+|++|||||+||.+|+|||+++.+++|. +|||+|+++|||+||| T Consensus 241 aAr~aMr~qk~d~G~pL~I~P~~LvVPp~LE~~A~~ll~a~~~~~G~---tNp~~g~~eliV~P~L 303 (304) T protein:vir:79 241 KVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGA---DNPNFELVQVLDTAWL 303 (304) T ss_pred HHHHHHHhhcCCCCceeccccCEEEecchhHHHHHHHHhhhhcCCCC---cceecceEEEEeeccc Confidence 99999999999999999999999999999999999999999887765 6999999999999999 No 4 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=100.00 E-value=3.2e-91 Score=516.74 Aligned_cols=231 Identities=34% Similarity=0.624 Sum_probs=222.0 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeeeccccccceeeeeccc Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSIANKTFE 80 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~~~~i~n~tfg 80 (305) |+||+++|++|++++||.|++||+.+|++|++||++++|++|+++|.|||+||.|+||+|||++++|+|++|+|+|++|| T Consensus 1 m~it~~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge~~~~~l~~~~~~i~~~~~g 80 (302) T protein:vir:10 1 MLINKQSLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGAKVVKNLKAYKYVVENEDFE 80 (302) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccceeeccccccceeEEeeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhHhhhc Q lcl|NC_013594. 81 GTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIVE 160 (305) Q Consensus 81 ~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s~l~~ 160 (305) ++|+|+||+|||||||+|++++++||++|++||+++||++|++|++++|||||+|||+|||++. T Consensus 81 ~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~---------------- 144 (302) T protein:vir:10 81 ATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGD---------------- 144 (302) T ss_pred ceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccc---------------- Confidence 9999999999999999999999999999999999999999999999999999999999997432 Q ss_pred cCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccchHHHHH Q lcl|NC_013594. 161 QDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNLWK 240 (305) Q Consensus 161 ~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ 240 (305) +.|+|.|+++||++ +.+|+.++|.+ T Consensus 145 ----------------------------------------------------~~~~N~g~~~~~~~---~~~l~~~~~~a 169 (302) T protein:vir:10 145 ----------------------------------------------------ASVSNKGTAPLSNA---SQAAAKAGYGA 169 (302) T ss_pred ----------------------------------------------------cccccccchhhhhc---ccccchHHHHH Confidence 23799999999975 67899999999 Q ss_pred HHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccccccccccceeeEEecccC Q lcl|NC_013594. 241 GWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNEMKGKLQLVVADYL 305 (305) Q Consensus 241 ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~~~N~~~~~~~~iv~p~L 305 (305) +|.+|++||+++|++|+|+|++|||||+||..|++|+.++.++++. +|||+|++++||+||| T Consensus 170 a~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~~~g~---~Np~~g~~~~vv~p~L 231 (302) T protein:vir:10 170 ARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKLADNT---PNPYVGTAELVVDGRI 231 (302) T ss_pred HHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhccccCCCC---cceeccceEEEEeecc Confidence 9999999999999999999999999999999999999998876554 6999999999999999 No 5 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=100.00 E-value=6.4e-69 Score=394.43 Aligned_cols=221 Identities=22% Similarity=0.345 Sum_probs=200.2 Q ss_pred CCc---CHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecC-CccchhhhhhccCCCchhhhc--cceeeecccccccee Q lcl|NC_013594. 1 MIV---TPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVN-SSTRSNTYGWLGKFPTLKEWV--GKRTIQQMEAHGYSI 74 (305) Q Consensus 1 M~i---~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~-S~~~~~~y~~Lg~~P~l~ew~--Ge~~~~~l~~~~~~i 74 (305) |.+ |+.+-..|...+||.++++|+++|+||++||.+.+ ++||..+...||+||.|+++. |||+|++++|.+++| T Consensus 394 ~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~ 473 (693) T protein:vir:95 394 LAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLRQVREGAEYKYVTLGERGEQI 473 (693) T ss_pred HHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCCChhhcCCCCceeeeecCCcccee Confidence 444 56677788888999999999999999999998766 899999998899999999984 999999999999999 Q ss_pred eeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhh Q lcl|NC_013594. 75 ANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVN 154 (305) Q Consensus 75 ~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~ 154 (305) +++|||++|+||||+|||||||+|+++++.||++|+++++++||.+|.+ ||+|+|||+|||+||. |+. +| T Consensus 474 ~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~--Np~m~DGk~LFhadH~---Nl~-tg---- 543 (693) T protein:vir:95 474 ILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTG--NPAMSDGKTLFHADHS---NLL-TG---- 543 (693) T ss_pred ehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CccccCCcceeecccc---ccc-cc---- Confidence 9999999999999999999999999999999999999999999999998 8999999999999994 321 00 Q ss_pred hHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccc Q lcl|NC_013594. 155 TSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLT 234 (305) Q Consensus 155 ~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt 234 (305) +..+|+ T Consensus 544 --------------------------------------------------------------------------a~sals 549 (693) T protein:vir:95 544 --------------------------------------------------------------------------AASALS 549 (693) T ss_pred --------------------------------------------------------------------------cccccC Confidence 012689 Q ss_pred hHHHHHHHHHHHHhhcC----CCceeccccCeEEecchhHHHHHHHHhhcccCCc--cccccccccceeeEEecccC Q lcl|NC_013594. 235 LDNLWKGWQLMRAFEGD----GGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADG--NTTVSNEMKGKLQLVVADYL 305 (305) Q Consensus 235 ~~~l~~ar~aM~~~k~~----~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~--~~~~~N~~~~~~~~iv~p~L 305 (305) +++|.+||++|+.||+. +|++|+|+|+||||||+||.+|+||++++.+++. ++++.||+++.++||++||| T Consensus 550 ~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~a~~~~~~~NP~~~~~~vi~~prL 626 (693) T protein:vir:95 550 IDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPGADVNSGIVNPIRAFAQVIGEPRL 626 (693) T ss_pred hHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccccccccccccchhcccccccccee Confidence 99999999999999964 6899999999999999999999999999988864 45678999999999999999 No 6 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=100.00 E-value=1.3e-66 Score=381.82 Aligned_cols=218 Identities=22% Similarity=0.329 Sum_probs=198.2 Q ss_pred CCc---CHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecC-CccchhhhhhccCCCchhhhc--cceeeecccccccee Q lcl|NC_013594. 1 MIV---TPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVN-SSTRSNTYGWLGKFPTLKEWV--GKRTIQQMEAHGYSI 74 (305) Q Consensus 1 M~i---~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~-S~~~~~~y~~Lg~~P~l~ew~--Ge~~~~~l~~~~~~i 74 (305) +.+ |+.+-..|..++||.++++|+++|+||++||.+.+ ++||..+...||+||.|+++. |||++++++|++++| T Consensus 359 ~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~ 438 (652) T protein:vir:79 359 AAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDKQATI 438 (652) T ss_pred HHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccceeeecCcccee Confidence 333 66778888899999999999999999999998866 899999999999999999994 999999999999999 Q ss_pred eeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCcccc-CCcccc-cccccccccccccchh Q lcl|NC_013594. 75 ANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCY-DGQNFF-DKEHPVYPNVDGTGSA 152 (305) Q Consensus 75 ~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~-DGk~fF-~adH~~~~n~~~tg~~ 152 (305) +++|||++|+||||+|||||||+|+++|+.||++|+++++++||++|.+ ||+|+ |||+|| |+||. |+.+ T Consensus 439 ~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~--Np~~~~DGk~LF~hA~H~---Nl~~---- 509 (652) T protein:vir:79 439 ALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTS--NPKISTDNVSLFDKAKHA---NVLE---- 509 (652) T ss_pred eeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhc--CcccccCCceeecccccc---cccc---- Confidence 9999999999999999999999999999999999999999999999998 89996 999999 89994 3310 Q ss_pred hhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccc Q lcl|NC_013594. 153 VNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGD 232 (305) Q Consensus 153 ~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~ 232 (305) .++ T Consensus 510 -----------------------------------------------------------------------------~aa 512 (652) T protein:vir:79 510 -----------------------------------------------------------------------------SAA 512 (652) T ss_pred -----------------------------------------------------------------------------ccc Confidence 125 Q ss_pred cchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCc--cccccccccceeeEEecccC Q lcl|NC_013594. 233 LTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADG--NTTVSNEMKGKLQLVVADYL 305 (305) Q Consensus 233 Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~--~~~~~N~~~~~~~~iv~p~L 305 (305) |++++|.+||++|++||+ ++++|+|+|+||||||+||.+|+||+++..+++. ++++.||+++.+++||+||| T Consensus 513 ~~~~~l~~ar~aM~~Qk~-g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~a~~~~~~~Np~~~~~~~i~eprL 586 (652) T protein:vir:79 513 MDVASLDKARQLMRVQKE-GERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPVKDFATVIAEPRL 586 (652) T ss_pred CCHHHHHHHHHHHHHhcc-CCccccccccEEEecchhHHHHHHHhccCCCccccccccccccccccccccccccc Confidence 778899999999999996 4478999999999999999999999998888755 44679999999999999999 No 7 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=99.87 E-value=6.5e-28 Score=169.63 Aligned_cols=203 Identities=26% Similarity=0.351 Sum_probs=125.7 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhh-----hhhhceeeecCCcc-chhhhhhccCCCchhhhccceeeecccccccee Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAP-----SQYNKIAMVVNSST-RSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSI 74 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~-----~t~~~~a~~v~S~~-~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~~~~i 74 (305) +|||... .+ +.+.+++-=+++. --|..+..-. +.. ....+-+--++|-. ... . T Consensus 89 ~i~nDdl-g~----~~~~~~~~G~aaa~~~~~lv~~~L~~g~-~~~~~DG~~fF~~dH~~g-----~~~----------~ 147 (302) T protein:vir:10 89 DIEDDQI-GI----YSPQAKMAGYSAAQLPDELVYEAVNGAF-TKPCFDGQYFIDTDHPVG-----DAS----------V 147 (302) T ss_pred hhccccc-ch----hHHHHHHHHHHHHhhHHHHHHHHHhccC-CCcccCCcceeccccccc-----ccc----------c Confidence 6666432 11 1111111111111 1122221100 000 00000001111111 111 1 Q ss_pred eeecccceeeccHHHhhccccc----hhHHHHHHHHHHHHhhHHHHHHH-HHhccCCccccCCccccccccccccccccc Q lcl|NC_013594. 75 ANKTFEGTVGISRDDFEDDNLG----IYAPIFQEMGRSAAVQPDELIFK-LLKDGFTQPCYDGQNFFDKEHPVYPNVDGT 149 (305) Q Consensus 75 ~n~tfg~~i~i~R~~I~nDdlG----~~~~~~~~~G~aAa~~~~~lv~~-lL~~g~~~~~~DGk~fF~adH~~~~n~~~t 149 (305) .|..++. +..+..++.-+.++ ++..+....|+....+|+.||+. .|..++...|++++++++++||+.+. T Consensus 148 ~N~g~~~-~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~~~g~~Np~~g~---- 222 (302) T protein:vir:10 148 SNKGTAP-LSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKLADNTPNPYVGT---- 222 (302) T ss_pred ccccchh-hhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhccccCCCCcceeccc---- Confidence 1222211 11222344444444 34445555699999999999996 67777788999999999999997532 Q ss_pred chhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhc Q lcl|NC_013594. 150 GSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAV 229 (305) Q Consensus 150 g~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s 229 (305) ... +....-.+|++|||++..+.++|++||.|+.|++++++++++++||++.++.||||+|+|+|||||||||+| T Consensus 223 --~~~---vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s 297 (302) T protein:vir:10 223 --AEL---VVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGS 297 (302) T ss_pred --eEE---EEeeccCCCCceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhcc Confidence 111 222233478999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccc Q lcl|NC_013594. 230 KGDLT 234 (305) Q Consensus 230 ~~~Lt 234 (305) +++-+ T Consensus 298 ~g~~~ 302 (302) T protein:vir:10 298 TGTGA 302 (302) T ss_pred CccCC Confidence 98766 No 8 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=96.40 E-value=0.00011 Score=42.20 Aligned_cols=210 Identities=14% Similarity=0.099 Sum_probs=103.8 Q ss_pred CCcCHHHHHH-HHHHHHHHHHHHHhhhhhhhhceeeecCCc-cchhhhhhc-cC--CCchhhh--ccceeeeccccccce Q lcl|NC_013594. 1 MIVTPASIKA-LMTSWRKDFQGGLEDAPSQYNKIAMVVNSS-TRSNTYGWL-GK--FPTLKEW--VGKRTIQQMEAHGYS 73 (305) Q Consensus 1 M~i~~~~l~~-L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~-~~~~~y~~L-g~--~P~l~ew--~Ge~~~~~l~~~~~~ 73 (305) |+=+|..|.. +...+.+.| -++.-+++. .++ ..+-.|.-. .. .....++ .||+.........-. T Consensus 21 ll~~P~~I~~~i~e~~~~~~-----iad~lf~~~----~a~~~~~v~f~~~~p~~~~~d~e~VaEggEiP~~~~~~G~~~ 91 (318) T protein:vir:10 21 LVGNPLWIPTALKKMMVNQF-----ISESLFRNG----GANPNGVVAYNEGNPSFLEDDVADVAEFGEIPVSAGARGLPR 91 (318) T ss_pred hhCCchhHHHHHHHHHhccc-----hhhhhhhcc----cccccceeEEEecccccccCcHhhccCcccccccCCCCCchh Confidence 3333444321 111111111 112222222 111 011111000 00 0122332 377776666554444 Q ss_pred e-eeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchh Q lcl|NC_013594. 74 I-ANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSA 152 (305) Q Consensus 74 i-~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~ 152 (305) | +.++||..+.||++++..-+.+.+.+.+.+++.+.+++.+..+|+.|.++..+.. ++. ++... T Consensus 92 ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~-----------~~s----~~w~~ 156 (318) T protein:vir:10 92 TAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTL-----------AVP----TAWDN 156 (318) T ss_pred hhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----------cCC----cCCCC Confidence 5 4579999999999999999999999999999999999999999999987544321 110 00000 Q ss_pred hhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccc Q lcl|NC_013594. 153 VNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGD 232 (305) Q Consensus 153 ~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~ 232 (305) .+++.. |...+ .+.++..++ T Consensus 157 ------------~~~~~~--d~~~A-----------~e~v~~a~~----------------------------------- 176 (318) T protein:vir:10 157 ------------GGKVRT--DIAIA-----------IEQISTAAP----------------------------------- 176 (318) T ss_pred ------------cccccc--cchhh-----------hhhhhhhhh----------------------------------- Confidence 000000 00000 000000000 Q ss_pred cchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHH------HHHHhhcc--cCCcccccccc---ccceeeEEe Q lcl|NC_013594. 233 LTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAA------EQLLNREL--FADGNTTVSNE---MKGKLQLVV 301 (305) Q Consensus 233 Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A------~~ll~~~~--~~~~~~~~~N~---~~~~~~~iv 301 (305) .++-.......+..+-+|+.||++|.....- +.++..+. +..+...+.+- +.| +++|+ T Consensus 177 ----------~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~lG-l~vi~ 245 (318) T protein:vir:10 177 ----------TAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSVMG-LNVIR 245 (318) T ss_pred ----------hhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccccccceeec-eEEee Confidence 0111112222357889999999999988776 44432211 00000001121 123 99999 Q ss_pred cccC Q lcl|NC_013594. 302 ADYL 305 (305) Q Consensus 302 ~p~L 305 (305) +|.+ T Consensus 246 s~~~ 249 (318) T protein:vir:10 246 SRTF 249 (318) T ss_pred cCcc Confidence 9999 No 9 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=95.28 E-value=0.0024 Score=34.97 Aligned_cols=186 Identities=15% Similarity=0.113 Sum_probs=101.6 Q ss_pred CCcC---------HHHHHHHHHHHHHHHHHHHhhhhhhhhceeeec----CCccchhhhhhccCCCchhhhc---cceee Q lcl|NC_013594. 1 MIVT---------PASIKALMTSWRKDFQGGLEDAPSQYNKIAMVV----NSSTRSNTYGWLGKFPTLKEWV---GKRTI 64 (305) Q Consensus 1 M~i~---------~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v----~S~~~~~~y~~Lg~~P~l~ew~---Ge~~~ 64 (305) |..+ |+.+..+ +...+.+.. -+..++..- ......-++......+. -+|+ ++... T Consensus 1 MA~~~T~~~~~~iPev~s~~---v~~~~~~~~-----~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~-a~~v~eg~~i~~ 71 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADM---IDAEVGKAI-----RFAPLAEVDTTLEGQPGTTLTVPKWDYIGD-AEDVAEGEAIPM 71 (272) T ss_pred CCCccccchheechHHHHHH---HHHHHHHHh-----hhhccccccccccCCCCCEEEEEEecCCCC-cccccCCCcccc Confidence 8743 3333333 222222211 123333221 11111111222222233 2343 44556 Q ss_pred eccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccc Q lcl|NC_013594. 65 QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYP 144 (305) Q Consensus 65 ~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~ 144 (305) .++....-++..++++..+.|+++++....-.+...+.+.++++.++..++.+++.|....+. T Consensus 72 ~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~----------------- 134 (272) T protein:vir:98 72 TQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT----------------- 134 (272) T ss_pred cccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------------- Confidence 667767778888999999999999999988889999999999999999998888765431000 Q ss_pred cccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhh Q lcl|NC_013594. 145 NVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQ 224 (305) Q Consensus 145 n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq 224 (305) T Consensus 135 -------------------------------------------------------------------------------- 134 (272) T protein:vir:98 135 -------------------------------------------------------------------------------- 134 (272) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccc---cccc----ccccee Q lcl|NC_013594. 225 MAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNT---TVSN----EMKGKL 297 (305) Q Consensus 225 ~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~---~~~N----~~~~~~ 297 (305) .+++.+.+.+..|+..+.. .+ ..+++++|+|.....-++.-.-+....+.. ...| -+.| + T Consensus 135 ----~~~~~t~d~i~da~~~l~~----~~----~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G-~ 201 (272) T protein:vir:98 135 ----VEATATVDGVSKALDIFND----ED----DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLG-V 201 (272) T ss_pred ----cccccCHHHHHHHHHHHhc----cC----CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcC-e Confidence 0011234556666655532 22 336789999986554443311122211111 0111 1334 6 Q ss_pred eEEecccC Q lcl|NC_013594. 298 QLVVADYL 305 (305) Q Consensus 298 ~~iv~p~L 305 (305) .+|+++.+ T Consensus 202 ~Vi~s~~~ 209 (272) T protein:vir:98 202 QIVRSRKC 209 (272) T ss_pred eEEEcCCC Confidence 89999999 No 10 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=95.28 E-value=0.0024 Score=34.97 Aligned_cols=186 Identities=15% Similarity=0.113 Sum_probs=101.6 Q ss_pred CCcC---------HHHHHHHHHHHHHHHHHHHhhhhhhhhceeeec----CCccchhhhhhccCCCchhhhc---cceee Q lcl|NC_013594. 1 MIVT---------PASIKALMTSWRKDFQGGLEDAPSQYNKIAMVV----NSSTRSNTYGWLGKFPTLKEWV---GKRTI 64 (305) Q Consensus 1 M~i~---------~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v----~S~~~~~~y~~Lg~~P~l~ew~---Ge~~~ 64 (305) |..+ |+.+..+ +...+.+.. -+..++..- ......-++......+. -+|+ ++... T Consensus 1 MA~~~T~~~~~~iPev~s~~---v~~~~~~~~-----~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~-a~~v~eg~~i~~ 71 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADM---IDAEVGKAI-----RFAPLAEVDTTLEGQPGTTLTVPKWDYIGD-AEDVAEGEAIPM 71 (272) T ss_pred CCCccccchheechHHHHHH---HHHHHHHHh-----hhhccccccccccCCCCCEEEEEEecCCCC-cccccCCCcccc Confidence 8743 3333333 222222211 123333221 11111111222222233 2343 44556 Q ss_pred eccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccc Q lcl|NC_013594. 65 QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYP 144 (305) Q Consensus 65 ~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~ 144 (305) .++....-++..++++..+.|+++++....-.+...+.+.++++.++..++.+++.|....+. T Consensus 72 ~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~----------------- 134 (272) T protein:vir:30 72 TQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT----------------- 134 (272) T ss_pred cccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------------- Confidence 667767778888999999999999999988889999999999999999998888765431000 Q ss_pred cccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhh Q lcl|NC_013594. 145 NVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQ 224 (305) Q Consensus 145 n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq 224 (305) T Consensus 135 -------------------------------------------------------------------------------- 134 (272) T protein:vir:30 135 -------------------------------------------------------------------------------- 134 (272) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccc---cccc----ccccee Q lcl|NC_013594. 225 MAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNT---TVSN----EMKGKL 297 (305) Q Consensus 225 ~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~---~~~N----~~~~~~ 297 (305) .+++.+.+.+..|+..+.. .+ ..+++++|+|.....-++.-.-+....+.. ...| -+.| + T Consensus 135 ----~~~~~t~d~i~da~~~l~~----~~----~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G-~ 201 (272) T protein:vir:30 135 ----VEATATVDGVSKALDIFND----ED----DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLG-V 201 (272) T ss_pred ----cccccCHHHHHHHHHHHhc----cC----CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcC-e Confidence 0011234556666655532 22 336789999986554443311122211111 0111 1334 6 Q ss_pred eEEecccC Q lcl|NC_013594. 298 QLVVADYL 305 (305) Q Consensus 298 ~~iv~p~L 305 (305) .+|+++.+ T Consensus 202 ~Vi~s~~~ 209 (272) T protein:vir:30 202 QIVRSRKC 209 (272) T ss_pred eEEEcCCC Confidence 89999999 No 11 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=94.45 E-value=0.0044 Score=33.51 Aligned_cols=188 Identities=12% Similarity=0.079 Sum_probs=109.4 Q ss_pred CC---------cCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCC----ccchhhhhhccCCCchhhh--ccceeee Q lcl|NC_013594. 1 MI---------VTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNS----STRSNTYGWLGKFPTLKEW--VGKRTIQ 65 (305) Q Consensus 1 M~---------i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S----~~~~~~y~~Lg~~P~l~ew--~Ge~~~~ 65 (305) |. |.|+.+... +++.++.+ .-+..++...++ ....-+.......+..+++ ..+.... T Consensus 1 Ma~~~T~l~d~i~Pev~~~~-------v~~~~~~~-~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~ 72 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPM-------MQAELDKK-LRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVD 72 (276) T ss_pred CCcceeehhhhhchHHHHHH-------HHHHHHhh-hhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCcc Confidence 77 445554433 23333222 123344432211 1111122111222233333 2456677 Q ss_pred ccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccccccccccc Q lcl|NC_013594. 66 QMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPN 145 (305) Q Consensus 66 ~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n 145 (305) .+....-+.+.+++++.++++..+..---...+..+.+++|.+-++..+..+++.|.++... T Consensus 73 ~lt~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~------------------ 134 (276) T protein:vir:10 73 KIETNRREAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLT------------------ 134 (276) T ss_pred ccccceeeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------ Confidence 78777777778899999999999998866667888889999999999888888776541000 Q ss_pred ccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhh Q lcl|NC_013594. 146 VDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQM 225 (305) Q Consensus 146 ~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~ 225 (305) T Consensus 135 -------------------------------------------------------------------------------- 134 (276) T protein:vir:10 135 -------------------------------------------------------------------------------- 134 (276) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcccc-------ccccccceee Q lcl|NC_013594. 226 AVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTT-------VSNEMKGKLQ 298 (305) Q Consensus 226 A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~-------~~N~~~~~~~ 298 (305) .+..+++.+.+..|+..|.... ..++.|+|+|+....-++....+.+.....+ ...-+.| ++ T Consensus 135 --~~~~~~t~d~i~~A~~~lgd~~--------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~ 203 (276) T protein:vir:10 135 --VSADIGTLAGLEAAIDTFDDED--------LEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALG-AV 203 (276) T ss_pred --ccccccCHHHHHHHHHHhcccc--------CcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecc-ee Confidence 0111344555667777765432 2467899999998887775444444332211 1223344 79 Q ss_pred EEecccC Q lcl|NC_013594. 299 LVVADYL 305 (305) Q Consensus 299 ~iv~p~L 305 (305) ||+++.+ T Consensus 204 Vi~s~~~ 210 (276) T protein:vir:10 204 IVRSKKL 210 (276) T ss_pred EEEcCCC Confidence 9999999 No 12 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=93.64 E-value=0.00028 Score=40.05 Aligned_cols=266 Identities=14% Similarity=0.115 Sum_probs=130.5 Q ss_pred CCcCHHHHHHHHHHHHHHHH----------------------------HH-Hhhhhhh------hhceeeecCCccchhh Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQ----------------------------GG-LEDAPSQ------YNKIAMVVNSSTRSNT 45 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~----------------------------~~-~~~a~~t------~~~~a~~v~S~~~~~~ 45 (305) ..+++..++-+ ..|-|.+. .- .++|.|- +++++. -..++.. T Consensus 42 ~~~~~~e~el~-E~f~Kmm~G~~p~~eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L---~~Grsm~ 117 (393) T protein:vir:79 42 LALNEEETQIL-ESFAKMMEGETPTNEVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRL---KSGQSMI 117 (393) T ss_pred hhcchhHHHHH-HHHHHHhcCCCchhheehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhh---hcCccee Confidence 33333332221 01111000 00 1111111 111111 0112222 Q ss_pred hhhccCCCchhhh----ccceeeecccc---ccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHH Q lcl|NC_013594. 46 YGWLGKFPTLKEW----VGKRTIQQMEA---HGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIF 118 (305) Q Consensus 46 y~~Lg~~P~l~ew----~Ge~~~~~l~~---~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~ 118 (305) +.-.| .||+. .||++-.+|+. .+-+++.+++|-.|.+|-++|.|-++.+..-+...+|++-+|+-++.+| T Consensus 118 F~~~g---~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~ 194 (393) T protein:vir:79 118 FPSIG---IMRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAY 194 (393) T ss_pred ccchh---eeeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHH Confidence 22122 45655 37888888873 4568889999999999999999999999999999999999999999999 Q ss_pred HHHhccCCccccCCccccccccccc----ccccccchhhhhHhhhcc---CCCccc------------------------ Q lcl|NC_013594. 119 KLLKDGFTQPCYDGQNFFDKEHPVY----PNVDGTGSAVNTSNIVEQ---DSFSGL------------------------ 167 (305) Q Consensus 119 ~lL~~g~~~~~~DGk~fF~adH~~~----~n~~~tg~~~~~s~l~~~---~~~~G~------------------------ 167 (305) ..+.+ ...+.|||-+==---||+| +-..|+-++...-.|..+ ....+. T Consensus 195 n~fk~-~ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~n 273 (393) T protein:vir:79 195 HQFRS-HGHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQAN 273 (393) T ss_pred hhhhc-ccceeeeccccCccceeecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeec Confidence 99987 2456688822222246665 223344444443222111 111221 Q ss_pred -----eeeecccccccccchhhcccchhhhhhhccccccchhcccce-eeeecccccccchhhhhhhcccccchHHHHHH Q lcl|NC_013594. 168 -----PFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFL-FGASARRAAGYGFWQMAVAVKGDLTLDNLWKG 241 (305) Q Consensus 168 -----~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~-~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~a 241 (305) +-...++++.+.|-..|.|-++.|.-...|==+.--....|- |-|| |.|+|--+ .+..|+.+.+..- T Consensus 274 a~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd-~NnvgvlL------V~D~i~tdq~ddk 346 (393) T protein:vir:79 274 PYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVD-RNNVGVLL------VRDDLKTDQWDEK 346 (393) T ss_pred cccccCccccchhhhhchhhhccccccceeEEEecccccccccceeeEEEee-cCCceEEE------EecCcceeccccc Confidence 122335777888888888876666544433100000122343 3455 67777554 4457777777654 Q ss_pred HHHHHHhhcCCCceeccccCeEE-ecchhHHHHHHHHhhcccCCccccccccccceeeEEecccC Q lcl|NC_013594. 242 WQLMRAFEGDGGKKLGLKPTHIV-VPVGLEKAAEQLLNRELFADGNTTVSNEMKGKLQLVVADYL 305 (305) Q Consensus 242 r~aM~~~k~~~G~~L~i~P~~Lv-Vpp~le~~A~~ll~~~~~~~~~~~~~N~~~~~~~~iv~p~L 305 (305) ..-.++.| ++-+|=+ |=. +..|-.+....+ ..+-..+|-| T Consensus 347 ~rdiq~iK--------l~ERYG~gvLn--~gkaiavakNI~--------------~~k~y~~P~~ 387 (393) T protein:vir:79 347 ARGLQNIK--------MIERYGIGILN--EGKAIAVAKNIS--------------MDKSYAEPML 387 (393) T ss_pred cccceeee--------eeeeeceeeee--CCceEEEEecce--------------eecccccchh Confidence 33333332 2222211 000 000000000000 0011111222 No 13 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=92.75 E-value=0.01 Score=31.52 Aligned_cols=192 Identities=13% Similarity=0.080 Sum_probs=103.1 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhh--hhhceeeecCCccchhhhhhccCCCchhhhccceeeeccccccceeeeec Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPS--QYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSIANKT 78 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~--t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~~~~i~n~t 78 (305) =+|.|+.+...- ...+.+.+.-.+- ..+.+..+...+-....|..+|+.-.+.| ..+.....+....-+.+.+. T Consensus 10 d~i~Pev~~~~v---~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~~~lt~~~~~~~i~~ 85 (274) T protein:vir:95 10 NQIVPEVLAPMM---QAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE-GEKIPTDILETKKREAKIRK 85 (274) T ss_pred heechHHHHHHH---HHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccC-CCccchhhcccceeEEEeee Confidence 234555554442 2222222211110 01111111111112222333444333332 34566677877777888888 Q ss_pred ccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhHhh Q lcl|NC_013594. 79 FEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNI 158 (305) Q Consensus 79 fg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s~l 158 (305) +++.+.|+..+..-.--.....+.+++|.+.++..+..++..|..+... + T Consensus 86 ~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~---------------------------~--- 135 (274) T protein:vir:95 86 IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT---------------------------V--- 135 (274) T ss_pred eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------------------c--- Confidence 9999999988877654556778888888888888888877777542100 0 Q ss_pred hccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccchHHH Q lcl|NC_013594. 159 VEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNL 238 (305) Q Consensus 159 ~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l 238 (305) +..+++.+.+ T Consensus 136 ----------------------------------------------------------------------~~~~~~~d~i 145 (274) T protein:vir:95 136 ----------------------------------------------------------------------EADITKLTGL 145 (274) T ss_pred ----------------------------------------------------------------------cccccCHHHH Confidence 0012445556 Q ss_pred HHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcccc---ccc----cccceeeEEecccC Q lcl|NC_013594. 239 WKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTT---VSN----EMKGKLQLVVADYL 305 (305) Q Consensus 239 ~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~---~~N----~~~~~~~~iv~p~L 305 (305) ..|++.+... + ..+++|+|+|.....-++...-+++.+...+ ..| -+.| +++|+++.+ T Consensus 146 ~~A~~~lgd~----~----~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~Vi~s~~~ 210 (274) T protein:vir:95 146 QTAIDKFNDE----D----LEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALG-AVIVRSNKL 210 (274) T ss_pred HHHHHHhccc----c----ccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecC-eEEEEeCCC Confidence 6666665432 1 2578999999887765554222333332211 111 2333 789999999 No 14 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=92.75 E-value=0.01 Score=31.52 Aligned_cols=192 Identities=13% Similarity=0.080 Sum_probs=103.1 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhh--hhhceeeecCCccchhhhhhccCCCchhhhccceeeeccccccceeeeec Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPS--QYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSIANKT 78 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~--t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~~~~i~n~t 78 (305) =+|.|+.+...- ...+.+.+.-.+- ..+.+..+...+-....|..+|+.-.+.| ..+.....+....-+.+.+. T Consensus 10 d~i~Pev~~~~v---~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~~~lt~~~~~~~i~~ 85 (274) T protein:vir:96 10 NQIVPEVLAPMM---QAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE-GEKIPTDILETKKREAKIRK 85 (274) T ss_pred heechHHHHHHH---HHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccC-CCccchhhcccceeEEEeee Confidence 234555554442 2222222211110 01111111111112222333444333332 34566677877777888888 Q ss_pred ccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhHhh Q lcl|NC_013594. 79 FEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNI 158 (305) Q Consensus 79 fg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s~l 158 (305) +++.+.|+..+..-.--.....+.+++|.+.++..+..++..|..+... + T Consensus 86 ~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~---------------------------~--- 135 (274) T protein:vir:96 86 IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT---------------------------V--- 135 (274) T ss_pred eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------------------c--- Confidence 9999999988877654556778888888888888888877777542100 0 Q ss_pred hccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccchHHH Q lcl|NC_013594. 159 VEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNL 238 (305) Q Consensus 159 ~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l 238 (305) +..+++.+.+ T Consensus 136 ----------------------------------------------------------------------~~~~~~~d~i 145 (274) T protein:vir:96 136 ----------------------------------------------------------------------EADITKLTGL 145 (274) T ss_pred ----------------------------------------------------------------------cccccCHHHH Confidence 0012445556 Q ss_pred HHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcccc---ccc----cccceeeEEecccC Q lcl|NC_013594. 239 WKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTT---VSN----EMKGKLQLVVADYL 305 (305) Q Consensus 239 ~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~---~~N----~~~~~~~~iv~p~L 305 (305) ..|++.+... + ..+++|+|+|.....-++...-+++.+...+ ..| -+.| +++|+++.+ T Consensus 146 ~~A~~~lgd~----~----~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~Vi~s~~~ 210 (274) T protein:vir:96 146 QTAIDKFNDE----D----LEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALG-AVIVRSNKL 210 (274) T ss_pred HHHHHHhccc----c----ccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecC-eEEEEeCCC Confidence 6666665432 1 2578999999887765554222333332211 111 2333 789999999 No 15 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=92.31 E-value=0.012 Score=31.13 Aligned_cols=193 Identities=13% Similarity=0.139 Sum_probs=99.6 Q ss_pred CCcCHHHHHHHH--HHHHHHHHHHHhhhhhhhhceeeecC-------CccchhhhhhccCCCchhhhccceeeecccccc Q lcl|NC_013594. 1 MIVTPASIKALM--TSWRKDFQGGLEDAPSQYNKIAMVVN-------SSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHG 71 (305) Q Consensus 1 M~i~~~~l~~L~--~~~~~~~~~~~~~a~~t~~~~a~~v~-------S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~~ 71 (305) |.-+.-.|.++. .-+...+++.+..+ --+..++..-+ ++-....|.-+|+.-.+.| .++.....+.... T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~-~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~e-g~~i~~~~lt~~~ 78 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKA-LRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAE-GGEISLDKIGTTT 78 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhh-hhhccccccccccccCCCCEEEEeeeccCccccccCC-CCccChhhcCCcc Confidence 663222222211 01111122222111 11233332211 1111222232343322222 2456666777777 Q ss_pred ceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccch Q lcl|NC_013594. 72 YSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGS 151 (305) Q Consensus 72 ~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~ 151 (305) -+.+.+.+++.+.|+-.+..----.....+.++++.+.++..+..++..|.....+ T Consensus 79 ~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~------------------------ 134 (272) T protein:vir:36 79 KSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQT------------------------ 134 (272) T ss_pred eeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------------------ Confidence 77788899999999998887655556777888888888888888777665420000 Q ss_pred hhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhccc Q lcl|NC_013594. 152 AVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKG 231 (305) Q Consensus 152 ~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~ 231 (305) .+. T Consensus 135 -----------------------------------------------------------------------------~~~ 137 (272) T protein:vir:36 135 -----------------------------------------------------------------------------VST 137 (272) T ss_pred -----------------------------------------------------------------------------ccc Confidence 011 Q ss_pred ccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCC--ccc----cccccccceeeEEecccC Q lcl|NC_013594. 232 DLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFAD--GNT----TVSNEMKGKLQLVVADYL 305 (305) Q Consensus 232 ~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~--~~~----~~~N~~~~~~~~iv~p~L 305 (305) .++.+.+..|+..|... +. .+++++|+|.....-++....+.... ++. +....+.| +++|++..+ T Consensus 138 ~~~~d~i~~A~~~lgd~----~~----~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G-~~Vv~s~~~ 208 (272) T protein:vir:36 138 KANVDGVQAALDIFNDE----DA----QAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLG-AQIVRSKKL 208 (272) T ss_pred cccHHHHHHHHHHhhhc----CC----CceEEEEcHHHHHHHhcccccccccccccccceeeeccceecC-eeEEEeCCC Confidence 23445566666666533 22 35789999986655444333222221 111 11223445 799999999 No 16 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=88.61 E-value=0.0065 Score=32.59 Aligned_cols=262 Identities=14% Similarity=0.024 Sum_probs=101.3 Q ss_pred CCcCHHH--H--HHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeee---ccccccce Q lcl|NC_013594. 1 MIVTPAS--I--KALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQ---QMEAHGYS 73 (305) Q Consensus 1 M~i~~~~--l--~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~---~l~~~~~~ 73 (305) |.....- + ..+. +..+....+. ..-+++|+.++-.....+|..+..-|. -.|++|-.-. ++.=..-+ T Consensus 1 mat~~~gg~lvP~~~~---~~ii~~~~~~--s~i~~~~~~i~~~~~~~~~p~~~~~~~-a~wv~Eg~~~~~~~~~f~~v~ 74 (311) T protein:vir:81 1 MVALATGTFQLPKHLV---PGVWQKAQGQ--SVLARLSMAEPQEFGEQQYMTLTAPPR-GEVVGEGAQKSESTATFAPVT 74 (311) T ss_pred CceecCCceEcchhHH---HHHHHHHHhc--chhhhhcceeecCCCceEEEEEeCCce-eEEeecCcccccccceeeEEE Confidence 4443221 0 1111 1122222222 225667777764444455555544454 3777664433 33335567 Q ss_pred eeeecccceeeccHHHh---hccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcc--ccccccccccc-cc Q lcl|NC_013594. 74 IANKTFEGTVGISRDDF---EDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQN--FFDKEHPVYPN-VD 147 (305) Q Consensus 74 i~n~tfg~~i~i~R~~I---~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~--fF~adH~~~~n-~~ 147 (305) ++.++++..+.||++.+ .+|..++..-+...++++.++.+++.++.--.+| +.....|.. ..++-+.+... .. T Consensus 75 l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~-~~~~~~gi~~~~~~~~~~~~~~~~~ 153 (311) T protein:vir:81 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPL-TGAALSGSPAKILDTTNIVELTTGT 153 (311) T ss_pred EeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCC-CCcccccccccccccceeeeecccc Confidence 78899999999999988 4667889999999999999999988876432111 111112211 11221111000 00 Q ss_pred ccchhhhh------------------------HhhhccCCCccceeeecc------cccccccchhhcccchhhhhhhcc Q lcl|NC_013594. 148 GTGSAVNT------------------------SNIVEQDSFSGLPFYLLD------CSRAVKPLIFQERRKPELVARTRI 197 (305) Q Consensus 148 ~tg~~~~~------------------------s~l~~~~~~~G~~~~l~~------~~~~vkP~i~Q~r~~~~f~a~t~~ 197 (305) .......+ ..|..-++..|.+++.-. .++.=.|+++.++-+-.....++. T Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~ 233 (311) T protein:vir:81 154 SATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTAS 233 (311) T ss_pred cchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccccc Confidence 00000001 112222344455443211 111112333322221111110000 Q ss_pred ccccch-hcccceeeeecccccccchhhh-hhhcccccchH----HHHHHHHHHHHhhcCCCceeccccCeEEecchhH Q lcl|NC_013594. 198 DDDHVF-MDNEFLFGASARRAAGYGFWQM-AVAVKGDLTLD----NLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLE 270 (305) Q Consensus 198 ~~~nvf-~~~~~~~g~d~r~n~G~g~wq~-A~~s~~~Lt~~----~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le 270 (305) .....+ ..+..++-.|... ..+|.|+- .+......+.+ -+...-.+.|.....++.++.-..=..|.+.... T Consensus 234 ~~~~~~~~~~~~~~~gDfs~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 234 TGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred cchhcccCCccEEEEEeccc-EEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 000000 0111112223211 22333321 11111111111 1222223333333344444432110112221111 No 17 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=85.78 E-value=0.05 Score=27.70 Aligned_cols=195 Identities=10% Similarity=0.067 Sum_probs=102.5 Q ss_pred CCc-CHHHHHHHHH--HHHHHHHHHHhhhhhhhhceeeecCC-------ccchhhhhhccCCCchhhhccceeeeccccc Q lcl|NC_013594. 1 MIV-TPASIKALMT--SWRKDFQGGLEDAPSQYNKIAMVVNS-------STRSNTYGWLGKFPTLKEWVGKRTIQQMEAH 70 (305) Q Consensus 1 M~i-~~~~l~~L~~--~~~~~~~~~~~~a~~t~~~~a~~v~S-------~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~ 70 (305) |.. +.-.|.++.. -+....++.+..+ .-+..++..-+. +-....|..+|+.-.+.| ..+.....+... T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~-~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~~~lt~~ 78 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKK-LKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPE-GEEIPIDLIETK 78 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHh-hhhcccceecccccCCCCCEEEeeeeccCCccccccC-CCCcchhhcccc Confidence 443 1122222211 1222223323221 122334332111 111112222333221211 255666677777 Q ss_pred cceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccc Q lcl|NC_013594. 71 GYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTG 150 (305) Q Consensus 71 ~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg 150 (305) .-+.+.+.+++.+.|+..+..----.......+++|.+-++..+..++..|..+... T Consensus 79 ~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~----------------------- 135 (275) T protein:vir:96 79 KRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLK----------------------- 135 (275) T ss_pred eeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------------------- Confidence 777888999999999999887543446777888889888888888887776531000 Q ss_pred hhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcc Q lcl|NC_013594. 151 SAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVK 230 (305) Q Consensus 151 ~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~ 230 (305) .+. T Consensus 136 -----------------------------------------------------------------------------~~~ 138 (275) T protein:vir:96 136 -----------------------------------------------------------------------------VEA 138 (275) T ss_pred -----------------------------------------------------------------------------ccc Confidence 001 Q ss_pred cccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcccc---ccccc---cceeeEEeccc Q lcl|NC_013594. 231 GDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTT---VSNEM---KGKLQLVVADY 304 (305) Q Consensus 231 ~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~---~~N~~---~~~~~~iv~p~ 304 (305) .+++.+.+-.|+..|.. .+ ..+++|+|+|.....-++....+.+.+...+ ..|-. ..-++||+++. T Consensus 139 ~~~~~d~i~dA~~~lgd----~~----~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~ 210 (275) T protein:vir:96 139 DITKLAGLQTAIDKFND----ED----LEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNK 210 (275) T ss_pred cccCHHHHHHHHHHhcc----cc----CCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCC Confidence 12345556666666642 22 2578999999987776665433344332211 11211 22379999999 Q ss_pred C Q lcl|NC_013594. 305 L 305 (305) Q Consensus 305 L 305 (305) + T Consensus 211 ~ 211 (275) T protein:vir:96 211 I 211 (275) T ss_pred C Confidence 9 No 18 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=85.46 E-value=0.053 Score=27.60 Aligned_cols=187 Identities=12% Similarity=0.108 Sum_probs=101.9 Q ss_pred CCc---------CHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCC-------ccchhhhhhccCCCchhhhccceee Q lcl|NC_013594. 1 MIV---------TPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNS-------STRSNTYGWLGKFPTLKEWVGKRTI 64 (305) Q Consensus 1 M~i---------~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S-------~~~~~~y~~Lg~~P~l~ew~Ge~~~ 64 (305) |.= .|+.+...- ...+.+.+ -+..++..-.+ +-....+.-+|+.-.+.| ..+... T Consensus 1 ma~~~T~~~~~iiPev~~~~v---~~~~~~~~-----~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~e-g~~i~~ 71 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMM---QAQLEKKL-----RFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAE-GEKIPT 71 (274) T ss_pred CCccceehhheechHHHHHHH---HHHHHhhh-----hhcccccccccccCCCCCEEEEEeeccCCCcccccC-CCcccc Confidence 433 344433321 22222211 12333322111 111111121333222222 355667 Q ss_pred eccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccc Q lcl|NC_013594. 65 QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYP 144 (305) Q Consensus 65 ~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~ 144 (305) ..+....-+.+.++++..+.|+..+....--.......+.++++.++..+..++..|..+... T Consensus 72 ~~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~----------------- 134 (274) T protein:vir:93 72 DILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT----------------- 134 (274) T ss_pred cccccceeEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------------- Confidence 777777788888999999999999988766667888889999999999998888776541000 Q ss_pred cccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhh Q lcl|NC_013594. 145 NVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQ 224 (305) Q Consensus 145 n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq 224 (305) T Consensus 135 -------------------------------------------------------------------------------- 134 (274) T protein:vir:93 135 -------------------------------------------------------------------------------- 134 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcccc---ccc----ccccee Q lcl|NC_013594. 225 MAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTT---VSN----EMKGKL 297 (305) Q Consensus 225 ~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~---~~N----~~~~~~ 297 (305) .+..+++.+.+-.|+..+.. . +..+++|+|+|.....-++--.-+.+.....+ ..| -+.| + T Consensus 135 ---~~~~~~~~d~i~dA~~~l~d----~----~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~ 202 (274) T protein:vir:93 135 ---VNADITKLNGLQSAIDKFND----E----DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-A 202 (274) T ss_pred ---ccccccCHHHHHHHHHHhhh----c----cCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecC-e Confidence 00112334455566665543 2 23578999999977665543212333322211 112 2344 7 Q ss_pred eEEecccC Q lcl|NC_013594. 298 QLVVADYL 305 (305) Q Consensus 298 ~~iv~p~L 305 (305) +|++++.+ T Consensus 203 ~Vi~s~~~ 210 (274) T protein:vir:93 203 IIVRTNKL 210 (274) T ss_pred eEEEcCCC Confidence 89999999 No 19 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=80.79 E-value=0.092 Score=26.27 Aligned_cols=187 Identities=12% Similarity=0.086 Sum_probs=99.1 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeec------C-CccchhhhhhccCCCchhhhccceeeeccccccce Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVV------N-SSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYS 73 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v------~-S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~~~~ 73 (305) =+|.|+.+...- ...+.+.+ -+..++..- + .+-....|..+|+.-.+.| ..+.....+....-+ T Consensus 10 d~iiPev~~~~v---~~~~~~~l-----~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~~~lt~~~~~ 80 (274) T protein:vir:12 10 NQIIPEVLAPMM---QAQLEKKL-----RFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAE-GEKIPTDILETKKRE 80 (274) T ss_pred hhhchHHHHHHH---HHHHHhhh-----hhcccceecccccCCCCCEEEEeeecCCCccccccC-CCccchhhcccceee Confidence 224455554432 22222221 122233221 1 1111122232343322222 245666777777778 Q ss_pred eeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhh Q lcl|NC_013594. 74 IANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAV 153 (305) Q Consensus 74 i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~ 153 (305) .+.+.+++.++|+..+..----.......++++.+-++..+..+...|..+. .. T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~--------------~~------------ 134 (274) T protein:vir:12 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK--------------LT------------ 134 (274) T ss_pred EEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccc--------------cc------------ Confidence 8888899999999887665433456677788888877777777666654310 00 Q ss_pred hhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhccccc Q lcl|NC_013594. 154 NTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDL 233 (305) Q Consensus 154 ~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~L 233 (305) .+..++ T Consensus 135 --------------------------------------------------------------------------~~~~a~ 140 (274) T protein:vir:12 135 --------------------------------------------------------------------------VNADIT 140 (274) T ss_pred --------------------------------------------------------------------------cccccc Confidence 001124 Q ss_pred chHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccccc-------cccccceeeEEecccC Q lcl|NC_013594. 234 TLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTV-------SNEMKGKLQLVVADYL 305 (305) Q Consensus 234 t~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~~-------~N~~~~~~~~iv~p~L 305 (305) +.+.+-.|++.|.... ..+++|+|+|.....-++...-+++.+...+. .--+.| +++|+++.+ T Consensus 141 ~~d~i~dA~~~lgd~~--------~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G-~~Vi~s~~~ 210 (274) T protein:vir:12 141 KLNGLQSAIDKFNDED--------LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-AIIVRSNKL 210 (274) T ss_pred CHHHHHHHHHHhcccc--------ccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecC-eeEEEeCCC Confidence 4556666666653321 25789999999877666543223444332211 112444 799999999 No 20 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=77.59 E-value=0.12 Score=25.57 Aligned_cols=187 Identities=13% Similarity=0.143 Sum_probs=98.4 Q ss_pred CC---------cCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecC-------CccchhhhhhccCCCchhhhccceee Q lcl|NC_013594. 1 MI---------VTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVN-------SSTRSNTYGWLGKFPTLKEWVGKRTI 64 (305) Q Consensus 1 M~---------i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~-------S~~~~~~y~~Lg~~P~l~ew~Ge~~~ 64 (305) |. |.|+.+..+. ...+.+.. -+..++..-+ .+-....|..+|+.-.+.| ..+... T Consensus 1 ma~~~T~~~d~i~Pev~s~~v---~~~~~~~~-----~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~-g~~i~~ 71 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMM---QAELDKKL-----RFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAE-GEKIPV 71 (274) T ss_pred CCccccchhhhhhhHHHHHHH---HHHHHhhh-----hhcccccccccccCCCCCEEEEEeeccCCCccccCC-CCcCch Confidence 54 2333333321 22222221 1223332211 0111111222333322222 245666 Q ss_pred eccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccc Q lcl|NC_013594. 65 QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYP 144 (305) Q Consensus 65 ~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~ 144 (305) ..+....-+.+.+.++..+.|+-.+..----.....+.+++|.+.++..+..++..|..+ + . T Consensus 72 ~~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a--~------------~---- 133 (274) T protein:vir:96 72 DQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA--T------------L---- 133 (274) T ss_pred hhcccceeEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcC--C------------C---- Confidence 677777777788889999999988876655567788888889888888888887766431 0 0 Q ss_pred cccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhh Q lcl|NC_013594. 145 NVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQ 224 (305) Q Consensus 145 n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq 224 (305) . T Consensus 134 --~----------------------------------------------------------------------------- 134 (274) T protein:vir:96 134 --T----------------------------------------------------------------------------- 134 (274) T ss_pred --C----------------------------------------------------------------------------- Confidence 0 Q ss_pred hhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcccc---ccc----ccccee Q lcl|NC_013594. 225 MAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTT---VSN----EMKGKL 297 (305) Q Consensus 225 ~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~---~~N----~~~~~~ 297 (305) .+..+++.+.+-.|++.+... +..+++|+|+|.....-++.-.-+.+.....+ ..| -+.| + T Consensus 135 ---~~~~~~~~d~i~dA~~~l~d~--------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G-~ 202 (274) T protein:vir:96 135 ---VEADITKLDGLQTAIDKFNDE--------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALG-A 202 (274) T ss_pred ---cCcccccHHHHHHHHHHhccc--------CCCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecC-e Confidence 001123334455555555322 12578999999976665554222333322211 111 1233 6 Q ss_pred eEEecccC Q lcl|NC_013594. 298 QLVVADYL 305 (305) Q Consensus 298 ~~iv~p~L 305 (305) ++|+++.| T Consensus 203 ~Vi~s~~~ 210 (274) T protein:vir:96 203 VIVRSNKL 210 (274) T ss_pred eEEEcCCC Confidence 89999999 No 21 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=77.35 E-value=0.055 Score=27.47 Aligned_cols=256 Identities=10% Similarity=-0.022 Sum_probs=98.3 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeee---ccccccceeeee Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQ---QMEAHGYSIANK 77 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~---~l~~~~~~i~n~ 77 (305) |..+...=--+=..+...+.+.... .+...++|++++-.....+|.....-|. -.|++|-.-. ++.=..-+++-+ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~-~s~i~~l~~~~~~~~~~~~ip~~~~~~~-a~wv~E~~~~~~s~~~f~~v~l~~~ 78 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKG-HSSLAKLSSQKPIPFNGSKEFTFTLDSD-IDVVAENGKKTHGGLSLEPVTIVPI 78 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHh-hchhhhhcceeecCCCceEEEEEecCcc-eEEeecCccccccccceeeEEeeeE Confidence 6643221000001111112221211 2336677777664444445555544443 4787664333 333345677889 Q ss_pred cccceeeccHHHh---hccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhh- Q lcl|NC_013594. 78 TFEGTVGISRDDF---EDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAV- 153 (305) Q Consensus 78 tfg~~i~i~R~~I---~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~- 153 (305) +.+..+.|||+.+ .+|..++..-+...++++.++..+..++.=..++ ...--.+++....+.....-...+.... T Consensus 79 kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (303) T protein:vir:97 79 KVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPR-TKKASDVIGTNHFDSKVTQVVKFTESEDA 157 (303) T ss_pred EEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccC-Cccccccccccccccccccccccccccch Confidence 9999999999988 5777889999999999999999888665322110 1111111221111110000000000000 Q ss_pred --hh------------------------HhhhccCCCccceeeecccc-------cccccchhhcccchhhhhhhccccc Q lcl|NC_013594. 154 --NT------------------------SNIVEQDSFSGLPFYLLDCS-------RAVKPLIFQERRKPELVARTRIDDD 200 (305) Q Consensus 154 --~~------------------------s~l~~~~~~~G~~~~l~~~~-------~~vkP~i~Q~r~~~~f~a~t~~~~~ 200 (305) .+ ..|..-++..|.+++.-+.. +.=.|+++-..-+. ......+.+. T Consensus 158 ~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~-~~~~~~~~~~ 236 (303) T protein:vir:97 158 DANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGA-GADEAESKDL 236 (303) T ss_pred HHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCC-ccccCCCccE Confidence 11 11222234444444322211 11123322111100 0000001111 Q ss_pred cch--hcccceeeeecccccccchhhhhhhcccccchHHHHHHHHHHHHhhc---CCCceeccccCeE--Eecchh Q lcl|NC_013594. 201 HVF--MDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEG---DGGKKLGLKPTHI--VVPVGL 269 (305) Q Consensus 201 nvf--~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~---~~G~~L~i~P~~L--vVpp~l 269 (305) -+| +.+.|.+|.. .+..+-. .-++.. +........+.|..++. .++.+++ |+-+ |++..- T Consensus 237 ~~~Gdf~~~~~~~~~--~~~~~~~--~~~~~~---d~~~~~~~~~n~~~~r~~~r~~~~v~~--p~af~~l~~~~~ 303 (303) T protein:vir:97 237 VIIGDFESMFKWGYA--KQIPMEI--IKYGDP---DNSGKDLKGYNQIYLRAEAYIGWGILD--AKSFARVTKGEV 303 (303) T ss_pred EEEeeccccEEEEEe--cCcEEEE--eeccCC---CCcchhhhhcCcEEEEEEEEeccEeec--ccceEEeeCCCC Confidence 111 1122333321 1111000 000000 00011122333333332 2333332 3322 222222 No 22 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=75.80 E-value=0.14 Score=25.22 Aligned_cols=224 Identities=10% Similarity=0.077 Sum_probs=102.4 Q ss_pred CCcCHHHHHHHHHH------------HHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceee---e Q lcl|NC_013594. 1 MIVTPASIKALMTS------------WRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTI---Q 65 (305) Q Consensus 1 M~i~~~~l~~L~~~------------~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~---~ 65 (305) |=.|+++-..+.++ .+..+....+. ..-.++++.++-.....+|.....-|. -.|+||-.- . T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~--s~i~~l~~~~~~~~~~~~ip~~~~~~~-a~wv~Eg~~~~~s 77 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKT--SIVQRVAQKIPMGATGIVIPHWTGDVS-AQWIGEGDMKPIT 77 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhc--cchhhhcceeeccCCceEEEEEcCCcc-eEEecCCcccccc Confidence 66666665444332 23333333322 224556666663333334444444444 367755332 3 Q ss_pred ccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccccccccccc Q lcl|NC_013594. 66 QMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPN 145 (305) Q Consensus 66 ~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n 145 (305) ++.=..-++..++++..+.||++.+.+...++..-+.+.++++.++.+++.++. |.+. ++++.+-... T Consensus 78 ~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~----G~gt----~~~~~~~~~~---- 145 (397) T protein:vir:23 78 KGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALH----GTNA----PSAFQGYLDQ---- 145 (397) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh----cccC----Cccccccccc---- Confidence 333344677889999999999999999999999999999999999999986652 2221 2222222111 Q ss_pred ccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhh Q lcl|NC_013594. 146 VDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQM 225 (305) Q Consensus 146 ~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~ 225 (305) ++.... .........+++....+.+ +.+.+++ | T Consensus 146 ---~~~~~~-------~~~~~~~~~~~~~~~~l~~--------------------------------~~~~~a~---~-- 178 (397) T protein:vir:23 146 ---SNKTQS-------ISPNAYQGLGVSGLTKLVT--------------------------------DGKKWTH---T-- 178 (397) T ss_pred ---ccceee-------ecccchhHHHHHHHHhhhh--------------------------------cccCCCE---E-- Confidence 000000 0000000000000000000 0111111 0 Q ss_pred hhhcccccchHHHHHHHHHHHHhhcCCCceecccc-----------CeEE-ecchhHHHHHHHHhhcccCCcccc----- Q lcl|NC_013594. 226 AVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKP-----------THIV-VPVGLEKAAEQLLNRELFADGNTT----- 288 (305) Q Consensus 226 A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P-----------~~Lv-Vpp~le~~A~~ll~~~~~~~~~~~----- 288 (305) -++. ..+.++++.||.+|++|-... ..|+ +| +.-++.++.+.+. T Consensus 179 ------vmn~----~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~P---------v~~s~~~~~g~~~~~~gD 239 (397) T protein:vir:23 179 ------LLDD----TVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRP---------TILSDHVAEGDVVGYAGD 239 (397) T ss_pred ------EEcH----HHHHHHHHhhccCCceeecccccccccccccCceeeeee---------EEEeCCCCCCceEEEEee Confidence 1121 234677788999998863211 1111 11 1112223332210 Q ss_pred ccccccc-----eeeEEecccC Q lcl|NC_013594. 289 VSNEMKG-----KLQLVVADYL 305 (305) Q Consensus 289 ~~N~~~~-----~~~~iv~p~L 305 (305) -.+-+.+ .+++.-+..+ T Consensus 240 fs~~~i~~~~~i~i~~~~e~~~ 261 (397) T protein:vir:23 240 FSQIIWGQVGGLSFDVTDQATL 261 (397) T ss_pred cceEEEEEEeceEEEEeeeeee Confidence 0111111 1111112222 No 23 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=75.58 E-value=0.15 Score=25.18 Aligned_cols=225 Identities=9% Similarity=0.071 Sum_probs=92.4 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceee---eccccccceeeee Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTI---QQMEAHGYSIANK 77 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~---~~l~~~~~~i~n~ 77 (305) |..+.+.. -|-..+...+.+...+.. ...+.|+.++..+..-+|..+..-|. -+|+||..- .++.=..-++..+ T Consensus 30 ~~~~~~~~-liP~~~~~~ii~~~~~~s-~l~~~~~~~~~~~~~~~~p~~~~~~~-a~~v~Eg~~~~~~~~~~~~v~~~~~ 106 (324) T protein:vir:10 30 MMHEKKDG-TLLNDFTTPILQEVMENS-KIMQLGKYEPMEGTEKKFTFWADKPG-AYWVGEGQKIETSKATWVNATMRAF 106 (324) T ss_pred eccCCCcc-eechhHHHHHHHHHHhhc-hhhhhcceeeccCCceEEEEEeCCcc-eeEeccCccccccccceeEEEEeeE Confidence 11111100 011112222222222222 24455666664444445554443344 467765433 3333355677889 Q ss_pred cccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhHh Q lcl|NC_013594. 78 TFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSN 157 (305) Q Consensus 78 tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s~ 157 (305) +++..+.|||+.+.|....+..-+.+.++++.++.+++.++.- .|.+ ..+..++.+-... ...+....+... T Consensus 107 k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G--~g~~---~~~~~i~~~~~~~---~~~~~~~~t~~~ 178 (324) T protein:vir:10 107 KLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN--QGNN---PFGKSIAQSIEKT---NKVIKGDFTQDN 178 (324) T ss_pred EEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhc--CCCC---ccCcccccccccc---ceeccccCCHHH Confidence 9999999999999998899999999999999999988866421 1111 1112222211100 000000000000 Q ss_pred hhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccchHH Q lcl|NC_013594. 158 IVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDN 237 (305) Q Consensus 158 l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~ 237 (305) |++..-.|++ ..+.+.+ | -++... T Consensus 179 -------------i~~~~~~l~~--------------------------------~~~~~~~---~--------v~n~~~ 202 (324) T protein:vir:10 179 -------------IIDLEALLED--------------------------------DELEANA---F--------ISKTQN 202 (324) T ss_pred -------------HHHHHHhhhh--------------------------------ccCCCCE---E--------EEcHHH Confidence 0000000000 0011111 1 122233 Q ss_pred HHHHHHHHHHhhcCCCceec--cccCeEE-ecchhHHHHHHHHhhcccCCcccc-----cccccc----c-eeeEEeccc Q lcl|NC_013594. 238 LWKGWQLMRAFEGDGGKKLG--LKPTHIV-VPVGLEKAAEQLLNRELFADGNTT-----VSNEMK----G-KLQLVVADY 304 (305) Q Consensus 238 l~~ar~aM~~~k~~~G~~L~--i~P~~Lv-Vpp~le~~A~~ll~~~~~~~~~~~-----~~N~~~----~-~~~~iv~p~ 304 (305) + ..+++.|+..|+++- -.|..|+ +|. +-+...+.+... -.+.+. + .+++.-++. T Consensus 203 ~----~~L~~l~d~~g~~~~~~~~~~~l~G~PV---------~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~ 269 (324) T protein:vir:10 203 R----SLLRKIVDPETKERIYDRNSDTLDGLPV---------VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQ 269 (324) T ss_pred H----HHHHHhhccCCceeecCCCCccccceeE---------EeecCCCCCcceEEEEecccEEEEEecCcEEEEeeccc Confidence 2 345678899998762 1222211 221 000111111100 011111 1 122222222 Q ss_pred C Q lcl|NC_013594. 305 L 305 (305) Q Consensus 305 L 305 (305) + T Consensus 270 ~ 270 (324) T protein:vir:10 270 L 270 (324) T ss_pred c Confidence 2 No 24 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=74.90 E-value=0.1 Score=25.98 Aligned_cols=249 Identities=10% Similarity=0.011 Sum_probs=95.6 Q ss_pred CCcCHHHH--HHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeeec---cccccceee Q lcl|NC_013594. 1 MIVTPASI--KALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQ---MEAHGYSIA 75 (305) Q Consensus 1 M~i~~~~l--~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~---l~~~~~~i~ 75 (305) |.++...+ .-+ -..+.+..... +...++|+.++-.....+|..+..-|. -.|++|-.-.. +.=..-+++ T Consensus 1 ma~~gG~lip~~~----~~~ii~~~~~~-s~i~~~~~~~~~~~~~~~~p~~~~~~~-a~~v~Eg~~~~~~~~~f~~v~l~ 74 (298) T protein:vir:94 1 MVLNKGTLFDPEL----VTDLISKVAGK-SSIARLSAQKPIPFNGEKVFTFTMDSE-IDVVAESGKKTHGGVTLAPQTMV 74 (298) T ss_pred CeeccccccChhH----HHHHHHHHHhh-chhhhhcceeeccCCceEEEEEecCcc-eEEeeCCccccccccceeEEEEe Confidence 88876543 222 22222222222 225666766654434445655554454 36776654332 222445667 Q ss_pred eecccceeeccHHHhh---ccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccc-- Q lcl|NC_013594. 76 NKTFEGTVGISRDDFE---DDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTG-- 150 (305) Q Consensus 76 n~tfg~~i~i~R~~I~---nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg-- 150 (305) -++++..+.|||+.+. +|..++..-+...++++.++.++..++.-...+ +..-..+...-+..+... +...++ T Consensus 75 ~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~-~g~~~~~~~~~~~~~~~~-~~~~~~~~ 152 (298) T protein:vir:94 75 PIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPR-LGTASAVIGTNHFDSKVT-QKVEAPRG 152 (298) T ss_pred eeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccC-CCcccccccccccccccc-cccccccc Confidence 8899999999999984 566788899999999999999988776421111 000001111111111110 000000 Q ss_pred hh---hhhH------------------------hhhccCCCccceeeecc------cccccccchhhcccchhhhhhhcc Q lcl|NC_013594. 151 SA---VNTS------------------------NIVEQDSFSGLPFYLLD------CSRAVKPLIFQERRKPELVARTRI 197 (305) Q Consensus 151 ~~---~~~s------------------------~l~~~~~~~G~~~~l~~------~~~~vkP~i~Q~r~~~~f~a~t~~ 197 (305) .. ..+. .|..-++..|.+++--+ ..+.=.|+++...-+ ..... T Consensus 153 ~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~----~~~~~ 228 (298) T protein:vir:94 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVS----DMSLT 228 (298) T ss_pred cccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccc----cccCC Confidence 00 0011 11112344454443211 011112332221100 00000 Q ss_pred ccccchhcccceeeeecccccccchhhhh---hhcccccchHHHHHHHHHHHHhh---cCCCceeccccCeEEecch Q lcl|NC_013594. 198 DDDHVFMDNEFLFGASARRAAGYGFWQMA---VAVKGDLTLDNLWKGWQLMRAFE---GDGGKKLGLKPTHIVVPVG 268 (305) Q Consensus 198 ~~~nvf~~~~~~~g~d~r~n~G~g~wq~A---~~s~~~Lt~~~l~~ar~aM~~~k---~~~G~~L~i~P~~LvVpp~ 268 (305) +.+.+|. .|.....-||.|+.. +-.....+-.......+-|..++ -.++.++.-..=..|.+.. T Consensus 229 ~~~~~~~-------Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 229 QRDRAII-------GDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CccEEEE-------eeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 1111221 111111111211111 00000000001111122222221 1233333211111233333 No 25 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=73.89 E-value=0.16 Score=24.88 Aligned_cols=185 Identities=12% Similarity=0.087 Sum_probs=100.5 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCC----cc---chhhhhhccCCCchhhh--ccceeeecccccc Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNS----ST---RSNTYGWLGKFPTLKEW--VGKRTIQQMEAHG 71 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S----~~---~~~~y~~Lg~~P~l~ew--~Ge~~~~~l~~~~ 71 (305) =+|.|+.+..+- ...+.+.+ -+..++..-.. .. ....|..+|+ ..++ ..+.....+.... T Consensus 10 d~iiPev~~~~v---~~~~~~~l-----~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~---a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:97 10 DQIIPEVLAPMM---QAQLEKKL-----RFASFAEVDSTLQGQPGDTLTFPAFVYSGD---AQVVAEGEKIPTDILETKK 78 (274) T ss_pred heechHHHHHHH---HHhhhhhh-----hhcccceecccccCCCCCEEEEeeecCCCc---cccccCCCcccccccccce Confidence 234555554442 22222221 13334332110 01 1111222333 2333 2456667777777 Q ss_pred ceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccch Q lcl|NC_013594. 72 YSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGS 151 (305) Q Consensus 72 ~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~ 151 (305) -+.+.+.++..++|+-.+..----.....+.+++|++-++..+..++..|..+... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~------------------------ 134 (274) T protein:vir:97 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT------------------------ 134 (274) T ss_pred eEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc------------------------ Confidence 78888889999999998877754446778888888888888888888776541000 Q ss_pred hhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhccc Q lcl|NC_013594. 152 AVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKG 231 (305) Q Consensus 152 ~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~ 231 (305) + +.. T Consensus 135 ---~-------------------------------------------------------------------------~~~ 138 (274) T protein:vir:97 135 ---V-------------------------------------------------------------------------NAD 138 (274) T ss_pred ---c-------------------------------------------------------------------------ccc Confidence 0 001 Q ss_pred ccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcccc---ccc----cccceeeEEeccc Q lcl|NC_013594. 232 DLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTT---VSN----EMKGKLQLVVADY 304 (305) Q Consensus 232 ~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~---~~N----~~~~~~~~iv~p~ 304 (305) +++.+.+-.|+..+... + ..+++|+|+|.....-++-..-+.+..+..+ ..| -+.| ++|++++. T Consensus 139 ~~~~d~i~dA~~~l~d~----~----~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~Vi~s~~ 209 (274) T protein:vir:97 139 ITKLNGLQSAIDKFNDE----D----LEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-AIIVRTNK 209 (274) T ss_pred ccCHHHHHHHHHHhhcc----C----CCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecC-eeEEEcCC Confidence 22344455566555432 1 2568999999987766553222333322211 112 2344 69999999 Q ss_pred C Q lcl|NC_013594. 305 L 305 (305) Q Consensus 305 L 305 (305) + T Consensus 210 ~ 210 (274) T protein:vir:97 210 L 210 (274) T ss_pred C Confidence 9 No 26 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=73.89 E-value=0.16 Score=24.88 Aligned_cols=185 Identities=12% Similarity=0.087 Sum_probs=100.5 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCC----cc---chhhhhhccCCCchhhh--ccceeeecccccc Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNS----ST---RSNTYGWLGKFPTLKEW--VGKRTIQQMEAHG 71 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S----~~---~~~~y~~Lg~~P~l~ew--~Ge~~~~~l~~~~ 71 (305) =+|.|+.+..+- ...+.+.+ -+..++..-.. .. ....|..+|+ ..++ ..+.....+.... T Consensus 10 d~iiPev~~~~v---~~~~~~~l-----~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~---a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:94 10 DQIIPEVLAPMM---QAQLEKKL-----RFASFAEVDSTLQGQPGDTLTFPAFVYSGD---AQVVAEGEKIPTDILETKK 78 (274) T ss_pred heechHHHHHHH---HHhhhhhh-----hhcccceecccccCCCCCEEEEeeecCCCc---cccccCCCcccccccccce Confidence 234555554442 22222221 13334332110 01 1111222333 2333 2456667777777 Q ss_pred ceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccch Q lcl|NC_013594. 72 YSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGS 151 (305) Q Consensus 72 ~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~ 151 (305) -+.+.+.++..++|+-.+..----.....+.+++|++-++..+..++..|..+... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~------------------------ 134 (274) T protein:vir:94 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT------------------------ 134 (274) T ss_pred eEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc------------------------ Confidence 78888889999999998877754446778888888888888888888776541000 Q ss_pred hhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhccc Q lcl|NC_013594. 152 AVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKG 231 (305) Q Consensus 152 ~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~ 231 (305) + +.. T Consensus 135 ---~-------------------------------------------------------------------------~~~ 138 (274) T protein:vir:94 135 ---V-------------------------------------------------------------------------NAD 138 (274) T ss_pred ---c-------------------------------------------------------------------------ccc Confidence 0 001 Q ss_pred ccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcccc---ccc----cccceeeEEeccc Q lcl|NC_013594. 232 DLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTT---VSN----EMKGKLQLVVADY 304 (305) Q Consensus 232 ~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~---~~N----~~~~~~~~iv~p~ 304 (305) +++.+.+-.|+..+... + ..+++|+|+|.....-++-..-+.+..+..+ ..| -+.| ++|++++. T Consensus 139 ~~~~d~i~dA~~~l~d~----~----~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~Vi~s~~ 209 (274) T protein:vir:94 139 ITKLNGLQSAIDKFNDE----D----LEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-AIIVRTNK 209 (274) T ss_pred ccCHHHHHHHHHHhhcc----C----CCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecC-eeEEEcCC Confidence 22344455566555432 1 2568999999987766553222333322211 112 2344 69999999 Q ss_pred C Q lcl|NC_013594. 305 L 305 (305) Q Consensus 305 L 305 (305) + T Consensus 210 ~ 210 (274) T protein:vir:94 210 L 210 (274) T ss_pred C Confidence 9 No 27 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=71.60 E-value=0.19 Score=24.49 Aligned_cols=225 Identities=9% Similarity=0.044 Sum_probs=92.2 Q ss_pred CCcCHHHHHHH-------------------------HHHH-HHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCc Q lcl|NC_013594. 1 MIVTPASIKAL-------------------------MTSW-RKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPT 54 (305) Q Consensus 1 M~i~~~~l~~L-------------------------~~~~-~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~ 54 (305) |.-....++.. -..+ ++.+....+..+ ..+.++.++-.+...+|..+..-|. T Consensus 4 ~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~--l~~l~~~~~~~~~~~~~p~~~~~~~ 81 (324) T protein:vir:96 4 TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSK--IMQLGKYEPMEGTEKKFTFWADKPG 81 (324) T ss_pred chhhhHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhch--hhhhcceeeccCCceEEEEEecCcc Confidence 11111111100 0001 111111111111 3445666664444445555544454 Q ss_pred hhhhccceee---eccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccC Q lcl|NC_013594. 55 LKEWVGKRTI---QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYD 131 (305) Q Consensus 55 l~ew~Ge~~~---~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~D 131 (305) . .|+||-.- .++.=..-++..++++..+.|||+.+.+.+..+..-+.+.++++.++..++.++. + ...-.. T Consensus 82 a-~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~---G--~g~~~~ 155 (324) T protein:vir:96 82 A-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---N--QGNNPF 155 (324) T ss_pred e-eeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh---c--CCCCCc Confidence 3 67765433 3344355677889999999999999999889999999999999999999886552 2 100011 Q ss_pred CcccccccccccccccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceee Q lcl|NC_013594. 132 GQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFG 211 (305) Q Consensus 132 Gk~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g 211 (305) +..++++-+. ... ...+... .+. ++ .+.....+ . T Consensus 156 ~~~~~~~~~~----------~~~--------~~~~~~~--~~~--i~-----------~~~~~i~~---~---------- 189 (324) T protein:vir:96 156 GKSIAQSIKK----------TNK--------VIKGDFT--QDN--II-----------DLEALLED---D---------- 189 (324) T ss_pred Cccccccccc----------cce--------ecccccc--hHH--HH-----------HHHHhhhh---c---------- Confidence 1111111110 000 0000000 000 00 00000000 0 Q ss_pred eecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceecc--ccCe-EEecchhHHHHHHHHhhcccCCccc- Q lcl|NC_013594. 212 ASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGL--KPTH-IVVPVGLEKAAEQLLNRELFADGNT- 287 (305) Q Consensus 212 ~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i--~P~~-LvVpp~le~~A~~ll~~~~~~~~~~- 287 (305) .+.+.+ | -++. ..+.++++.|+.+|+++-. .|.. +=+|. .-+...+.+.. T Consensus 190 --~~~~~~---~--------i~n~----~~~~~L~~lkd~~G~~~~~~~~~~~l~G~PV---------~~~~~~~~~~~~ 243 (324) T protein:vir:96 190 --ELEANA---F--------ISKT----QNRSLLRKIVDPETKERIYDRNSDSLDGLPV---------VNLKSSNLKRGE 243 (324) T ss_pred --cCCCCE---E--------EEcH----HHHHHHHHhhCCCCCeeecCCCCCcccceee---------EeecCCCCCcce Confidence 001111 1 1122 2235677789999998632 2222 21221 00111111110 Q ss_pred ---cc-ccccc---ceeeEEec--ccC Q lcl|NC_013594. 288 ---TV-SNEMK---GKLQLVVA--DYL 305 (305) Q Consensus 288 ---~~-~N~~~---~~~~~iv~--p~L 305 (305) ++ .+-+. +-+++-++ ..+ T Consensus 244 ~~~gd~s~~~~~~~~~~~i~~~~~~~~ 270 (324) T protein:vir:96 244 LITGDFDKLIYGIPQLIEYKIDETAQL 270 (324) T ss_pred EEEEecceEEEEEecCcEEEEeecccc Confidence 00 11111 11222222 212 No 28 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=70.06 E-value=0.19 Score=24.48 Aligned_cols=258 Identities=11% Similarity=-0.055 Sum_probs=99.2 Q ss_pred CCcCHHHHHH-HHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceee---eccccccceeee Q lcl|NC_013594. 1 MIVTPASIKA-LMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTI---QQMEAHGYSIAN 76 (305) Q Consensus 1 M~i~~~~l~~-L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~---~~l~~~~~~i~n 76 (305) |.-+...-.. +-.-+.+.+.+...+. ....++|++++......+|..+..-|. -.|+||-.- .+..=..-++.- T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~-s~l~~~~~~i~~~~~~~~~p~~~~~~~-a~wv~Eg~~~~~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQG-STVAVLSARKPQRFGNEDIITFNGRPK-AEFVGEGQQKSSTTGEFDFVTSTP 78 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEeCCce-eEEeecCcccccccceeeEEEEee Confidence 6643322000 0011112222222222 225677777775555556766655555 367765433 333334567778 Q ss_pred ecccceeeccHHHhh---ccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccccccc-------c---- Q lcl|NC_013594. 77 KTFEGTVGISRDDFE---DDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHP-------V---- 142 (305) Q Consensus 77 ~tfg~~i~i~R~~I~---nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~-------~---- 142 (305) ++++..+.||++.+. |+...+..-+...|+++.++.+++.++.-...+ .|..+-...+. + T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~------~g~~~~g~~~~~~~~~~~~~~~~ 152 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPL------TGTVIPGWSNYLGAASKRVELTA 152 (311) T ss_pred EEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcc------cCccccccccccccccceeeccc Confidence 999999999999984 566889999999999999999998777432211 11111111110 0 Q ss_pred --------------------cccccccc---hhhhhHhhhccCCCccceeeeccc------ccccccchhhcccchhhhh Q lcl|NC_013594. 143 --------------------YPNVDGTG---SAVNTSNIVEQDSFSGLPFYLLDC------SRAVKPLIFQERRKPELVA 193 (305) Q Consensus 143 --------------------~~n~~~tg---~~~~~s~l~~~~~~~G~~~~l~~~------~~~vkP~i~Q~r~~~~f~a 193 (305) +.+....+ +......|..-++..|.+++.-+. ...=.|++.-...+.... T Consensus 153 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~- 231 (311) T protein:vir:99 153 DTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDE- 231 (311) T ss_pred cccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeecccccccc- Confidence 00000000 000001122224455555542110 111123322111100000 Q ss_pred hhccccccc-hhcccceeeeecccccccchhhhhh-hcccccchH-HHHHHHHHHHHhh---cCCCceeccccCeEEecc Q lcl|NC_013594. 194 RTRIDDDHV-FMDNEFLFGASARRAAGYGFWQMAV-AVKGDLTLD-NLWKGWQLMRAFE---GDGGKKLGLKPTHIVVPV 267 (305) Q Consensus 194 ~t~~~~~nv-f~~~~~~~g~d~r~n~G~g~wq~A~-~s~~~Lt~~-~l~~ar~aM~~~k---~~~G~~L~i~P~~LvVpp 267 (305) +..+...+ +.+....|..|.....-|+..+-.. .....-+.+ .......-|..++ -.++..+ .|.++++=. T Consensus 232 -~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~--~~~~v~~~~ 308 (311) T protein:vir:99 232 -ADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVF--TDRFVVIEN 308 (311) T ss_pred -cccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceec--ChhHeeeec Confidence 00111111 1111122222221111111111100 000000000 0111222222222 2333332 244433211 Q ss_pred hhHHHH Q lcl|NC_013594. 268 GLEKAA 273 (305) Q Consensus 268 ~le~~A 273 (305) .+| T Consensus 309 ---~~A 311 (311) T protein:vir:99 309 ---AVA 311 (311) T ss_pred ---ccC Confidence 111 No 29 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=69.55 E-value=0.14 Score=25.19 Aligned_cols=253 Identities=12% Similarity=0.023 Sum_probs=97.5 Q ss_pred CCcCHHHH--HHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeeec---cccccceee Q lcl|NC_013594. 1 MIVTPASI--KALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQ---MEAHGYSIA 75 (305) Q Consensus 1 M~i~~~~l--~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~---l~~~~~~i~ 75 (305) |..+...+ ..+. +..+ +..... ...+++|+.++-.....++.+...-|. -.|+||-.-.. +.=..-++. T Consensus 1 ma~~gG~lvp~~~~---~~ii-~~~~~~-s~i~~l~~~~~~~~~~~~ip~~~~~~~-a~~v~E~~~~~~~~~~f~~v~l~ 74 (298) T protein:vir:16 1 MVLNKGTLFDPTLV---TDLI-SKVAGK-SSIARLSAQKPIPFNGEKVFTFTMDSE-IDVVAESGKKTHGGVTLAPQTMV 74 (298) T ss_pred CcccCcceechhHH---HHHH-HHHHhh-hhhhhhcceeeccCCceEEEEEecCcc-eEEecCCccccccccceeEEEEe Confidence 87765443 2221 2222 222222 336777777765444455655555555 47887754333 332445778 Q ss_pred eecccceeeccHHHh---hccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccccccccc--ccccccc Q lcl|NC_013594. 76 NKTFEGTVGISRDDF---EDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVY--PNVDGTG 150 (305) Q Consensus 76 n~tfg~~i~i~R~~I---~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~--~n~~~tg 150 (305) .++++..+.||++.+ .++..++..-+...++++.++.+++.++.=. ++.-.-...+....+... .+...++ T Consensus 75 ~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~----~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:16 75 PIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGV----NPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred eeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccc----cCCCCcccccccccccccccccccccc Confidence 899999999999999 4666889999999999999999988777421 111000111222211100 0000000 Q ss_pred h--h---hhhHh------------------------hhccCCCccceeeecccccccccchhhcccchhhhhhhcccccc Q lcl|NC_013594. 151 S--A---VNTSN------------------------IVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDH 201 (305) Q Consensus 151 ~--~---~~~s~------------------------l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~n 201 (305) . . ..+.. |..-++..|.+++.-+. ..-.|-.+.-+ |..+...-|+.. T Consensus 151 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~-~~~~~~~l~G~--PV~~~~~v~~~~- 226 (298) T protein:vir:16 151 RGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELK-WGATPDTINGL--PVDVNKTVSDMS- 226 (298) T ss_pred cccccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcc-cCCCCceecce--eeEEeccccccc- Confidence 0 0 00111 11123344444332110 00001000011 111111111100 Q ss_pred chhcccceeeeecccccccchhhhh-hhcccccchHH--HHHHHHHH---HHhhcCCCceeccccCeEEecch Q lcl|NC_013594. 202 VFMDNEFLFGASARRAAGYGFWQMA-VAVKGDLTLDN--LWKGWQLM---RAFEGDGGKKLGLKPTHIVVPVG 268 (305) Q Consensus 202 vf~~~~~~~g~d~r~n~G~g~wq~A-~~s~~~Lt~~~--l~~ar~aM---~~~k~~~G~~L~i~P~~LvVpp~ 268 (305) -..+..++..|.+...-|+.++-. +.....-+.++ .+...+-| |.....++..+.-..=-.|.+.. T Consensus 227 -~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 227 -LTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred -CCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 011112232333322223322210 00000000000 00011111 11111222222111111222322 No 30 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=69.41 E-value=0.22 Score=24.15 Aligned_cols=244 Identities=13% Similarity=0.053 Sum_probs=91.4 Q ss_pred CCcCHHHHHHHHH------------HHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeeec-c Q lcl|NC_013594. 1 MIVTPASIKALMT------------SWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQ-M 67 (305) Q Consensus 1 M~i~~~~l~~L~~------------~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~-l 67 (305) ..++..-.+++.+ .+...+.+...... ...++|+.++-......|.....-+. -.|++|..-.. . T Consensus 96 ~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~-a~~v~E~~~~~~~ 173 (407) T protein:vir:48 96 DGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEV-VMRQEATVITLGGSDYKKLVNLGGTT-SGWVGETDARPET 173 (407) T ss_pred hhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhh-hhhhhceeeecCCCceEEEEecCCcc-eeeeccccccccc Confidence 0000100111111 01111222222221 12445666553333333322222233 46776644321 2 Q ss_pred c---cccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcc--cccccccc Q lcl|NC_013594. 68 E---AHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQN--FFDKEHPV 142 (305) Q Consensus 68 ~---~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~--fF~adH~~ 142 (305) + -..-++..++++..+.||++.+.|-...+..-+.+.|+++.++.++.. +|. |..+ |+| ++. |+. T Consensus 174 ~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a---~l~-G~G~----~~p~Gil~--~~~ 243 (407) T protein:vir:48 174 ATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIA---FTS-GDGS----KKPKGFLA--YES 243 (407) T ss_pred ccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhh---hhc-cCCC----Cccceeee--ccc Confidence 2 233577889999999999999998888999999999999999988874 333 2111 222 110 000 Q ss_pred cccccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccch Q lcl|NC_013594. 143 YPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGF 222 (305) Q Consensus 143 ~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~ 222 (305) ......+........+.......-..-.|++....|+| ..|.|+. T Consensus 244 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~--------------------------------~~~~~a~--- 288 (407) T protein:vir:48 244 TDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRK--------------------------------AHRSGAK--- 288 (407) T ss_pred ccccccccccccccccccccccccChHHHHHHHHhhch--------------------------------hhhcCCE--- Confidence 00000000000000000000000000001111111111 0112211 Q ss_pred hhhhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccccccccc--------- Q lcl|NC_013594. 223 WQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNEM--------- 293 (305) Q Consensus 223 wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~~~N~~--------- 293 (305) | -++... +..+++.||.+|+|| +.|..-- +...-...+-++-++.+|...++..-.+ T Consensus 289 ~--------v~n~~~----~~~L~~lkD~~Gr~l-~~~~~~~-g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~ 354 (407) T protein:vir:48 289 F--------MMNNSS----LFAIRLLKDNDGNYL-WRPGIEL-GQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYT 354 (407) T ss_pred E--------EEcHHH----HHHHHHhhccCCcee-eccCcCC-CCCceecceeeEEecCcCCccCCccEEEEEeccccEE Confidence 1 122222 356677888888886 2222100 0000000111222222222111100011 Q ss_pred ---cceeeEEecccC Q lcl|NC_013594. 294 ---KGKLQLVVADYL 305 (305) Q Consensus 294 ---~~~~~~iv~p~L 305 (305) +..+++..+|+- T Consensus 355 i~~~~~~~i~~d~~~ 369 (407) T protein:vir:48 355 IVDRIGTRILRDPYT 369 (407) T ss_pred EEEeeceEEEeeccc Confidence 122344444443 No 31 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=66.07 E-value=0.076 Score=26.71 Aligned_cols=239 Identities=10% Similarity=0.005 Sum_probs=92.7 Q ss_pred CC-----cCHHHH---HH-------------HHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhc Q lcl|NC_013594. 1 MI-----VTPASI---KA-------------LMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWV 59 (305) Q Consensus 1 M~-----i~~~~l---~~-------------L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~ 59 (305) |. -....+ .+ +-..+.+.+.......- ...++|..+|-......|.....-|. -.|+ T Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~-a~~v 220 (458) T protein:vir:10 143 VMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKEL-VVGALFEELPMSSKILTMLVEPDAGK-ATWV 220 (458) T ss_pred HHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhh-hHHhhcceeecCCcceEEEEecCCcc-eeec Confidence 00 000000 00 00011112222222111 13445666664444444444444344 3666 Q ss_pred cceeeecc---------ccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCcccc Q lcl|NC_013594. 60 GKRTIQQM---------EAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCY 130 (305) Q Consensus 60 Ge~~~~~l---------~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~ 130 (305) +|.....- .=..-++..++++..+.||++.+.|-+.++.+-+...|+++.++..+..+ |.+ ... T Consensus 221 ~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~---l~G-~G~--- 293 (458) T protein:vir:10 221 AASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAF---MTG-DGS--- 293 (458) T ss_pred ccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHh---hcC-CCC--- Confidence 55543321 11223677789999999999988777789999999999999999988855 332 111 Q ss_pred CCcccccccccccccccccchhhhhHhhhccCCCcccee---eecccccccccchhhcccchhhhhhhccccccchhccc Q lcl|NC_013594. 131 DGQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPF---YLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNE 207 (305) Q Consensus 131 DGk~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~~---~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~ 207 (305) |+|.=--.|+ +..+...+... .+.....+ .|++....++ T Consensus 294 -~~p~Gi~~~~------~~~~~~~~~~~---~~~~~~~~~~~~i~~~~~~l~---------------------------- 335 (458) T protein:vir:10 294 -GKPKGLLTLA------SEDSAKVVTEA---KADGSVLVTAKTISKLRRKLG---------------------------- 335 (458) T ss_pred -Cccceeeecc------cccccceeecc---cccccccccHHHHHHHHHhhh---------------------------- Confidence 2221000010 00000000000 00000000 0000000000 Q ss_pred ceeeeecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceecccc--CeEEecchhHHHHHHHHhhcccCCc Q lcl|NC_013594. 208 FLFGASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKP--THIVVPVGLEKAAEQLLNRELFADG 285 (305) Q Consensus 208 ~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P--~~LvVpp~le~~A~~ll~~~~~~~~ 285 (305) ...+.+++ | -++... +.++++.|+.+|+||.... .....++..-...+.++-++.+|.+ T Consensus 336 ----~~~~~~~~---~--------v~~~~~----~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~ 396 (458) T protein:vir:10 336 ----RHGLKLSK---L--------VLIVSM----DAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) T ss_pred ----hhhcCCCE---E--------EEcHHH----HHHHHhhcccCCceeeccccccccccCcCceecceeeEEccccccc Confidence 01122222 1 112222 3456778888888764211 0010000000112222333334432 Q ss_pred cccccccc-----------cceeeEEecccC Q lcl|NC_013594. 286 NTTVSNEM-----------KGKLQLVVADYL 305 (305) Q Consensus 286 ~~~~~N~~-----------~~~~~~iv~p~L 305 (305) +......+ +.-+++..+|+- T Consensus 397 ~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~ 427 (458) T protein:vir:10 397 ANSAEFAVIVYKDNFVMPRQRAVTVERERQA 427 (458) T ss_pred cCCcceEEEEecccEEEEEeeceEEEeeccc Confidence 21111111 112344444443 No 32 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=65.52 E-value=0.28 Score=23.60 Aligned_cols=195 Identities=11% Similarity=0.085 Sum_probs=99.0 Q ss_pred CC---------cCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecC-------CccchhhhhhccCCCchhhhccceee Q lcl|NC_013594. 1 MI---------VTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVN-------SSTRSNTYGWLGKFPTLKEWVGKRTI 64 (305) Q Consensus 1 M~---------i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~-------S~~~~~~y~~Lg~~P~l~ew~Ge~~~ 64 (305) |. |.|+.+..+ +...|.+.+ -+.+++.... .+-....|..+|+.-.+.| ..+... T Consensus 1 Ma~~~T~~~~~iiPev~s~~---v~~~~~~~~-----v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~-g~~i~~ 71 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPM---ISAKLPKAI-----KFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAE-GAAIDY 71 (278) T ss_pred CCCcceehhheecHHHHHHH---HHHHHHHhh-----hhcccceecccccCCCCCEEEEeeeccCCcceeecC-CCcCcc Confidence 65 555554443 222333222 1233332211 1111222222333221211 244455 Q ss_pred eccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccc Q lcl|NC_013594. 65 QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYP 144 (305) Q Consensus 65 ~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~ 144 (305) .++....-+.+.+.++..+.|+..+..----.....+.++++++.++..+..+++.|...++. + T Consensus 72 ~~lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~--------------~-- 135 (278) T protein:vir:80 72 SALETESVKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE--------------V-- 135 (278) T ss_pred cccccceeeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------c-- Confidence 666666667778889999999999988876678899999999999999999999888652110 0 Q ss_pred cccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhh Q lcl|NC_013594. 145 NVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQ 224 (305) Q Consensus 145 n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq 224 (305) +++ .+. + ++| T Consensus 136 ----~~~-~t~----------------------------------------~--------------~~~----------- 145 (278) T protein:vir:80 136 ----KGA-INI----------------------------------------G--------------LID----------- 145 (278) T ss_pred ----ccc-ccc----------------------------------------c--------------hhh----------- Confidence 000 000 0 000 Q ss_pred hhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcc---ccccccc---cceee Q lcl|NC_013594. 225 MAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGN---TTVSNEM---KGKLQ 298 (305) Q Consensus 225 ~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~---~~~~N~~---~~~~~ 298 (305) -..+.+..++.++. ..+-| .+.+|+|+|.-...-++.-..+.+.... ....|-. ..-++ T Consensus 146 --------~~~~~~~da~~~l~----~~~~~---~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~ 210 (278) T protein:vir:80 146 --------KIENTFTDAPDAIE----DESIT---TTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWE 210 (278) T ss_pred --------hHHHHHHHHHHhhc----ccCCC---cccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeeccee Confidence 00122333333332 22222 2457899988665444432223332211 1111211 22379 Q ss_pred EEecccC Q lcl|NC_013594. 299 LVVADYL 305 (305) Q Consensus 299 ~iv~p~L 305 (305) +++++.| T Consensus 211 Vi~s~~~ 217 (278) T protein:vir:80 211 IVRTKKL 217 (278) T ss_pred EEEcCCC Confidence 9999999 No 33 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=64.66 E-value=0.24 Score=24.00 Aligned_cols=245 Identities=15% Similarity=0.153 Sum_probs=87.8 Q ss_pred CCcCHHHH--HHHHHH-------------HHHHHHHHHhhhhhhhhce-eeecCCccchhhhhhccCCCchhhhcc---c Q lcl|NC_013594. 1 MIVTPASI--KALMTS-------------WRKDFQGGLEDAPSQYNKI-AMVVNSSTRSNTYGWLGKFPTLKEWVG---K 61 (305) Q Consensus 1 M~i~~~~l--~~L~~~-------------~~~~~~~~~~~a~~t~~~~-a~~v~S~~~~~~y~~Lg~~P~l~ew~G---e 61 (305) ..+....+ +++.++ +...|.+.+... +...++ ++.++.....-++.....-|.. -|+| + T Consensus 345 ~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~-s~i~~l~~~~~~~~~g~~~ip~~~~~~~a-~wv~E~~~ 422 (632) T protein:vir:96 345 FYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNK-AIIGQMGARMLPGLVGDVDIPKKTSGANF-YWIGEDED 422 (632) T ss_pred hhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhc-chhhhhcceEeecCCcceEEEEEeCCcee-EeecCCcc Confidence 00000000 111100 001111111110 112233 3444433222223222222332 3544 3 Q ss_pred eeeeccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCc-cccCCcccccccc Q lcl|NC_013594. 62 RTIQQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQ-PCYDGQNFFDKEH 140 (305) Q Consensus 62 ~~~~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~-~~~DGk~fF~adH 140 (305) ....++.-...++..++++..+.|||+.++|++.++..-+...|+.+.++.++..++. |..+ .-.-|- +.++++ T Consensus 423 ~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~----G~G~~~~p~Gi-~~~~~~ 497 (632) T protein:vir:96 423 VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLT----GTGLANDPVGL-LNMTGV 497 (632) T ss_pred ccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhc----ccCCCCcccee-eecccc Confidence 4445555566788999999999999999999999999999999999999999987652 2111 111231 445555 Q ss_pred cccccccccchhhh---hHhhh----c-cCCCccceeeeccccc-cc---------ccchhhcc---cchhhhhhhcccc Q lcl|NC_013594. 141 PVYPNVDGTGSAVN---TSNIV----E-QDSFSGLPFYLLDCSR-AV---------KPLIFQER---RKPELVARTRIDD 199 (305) Q Consensus 141 ~~~~n~~~tg~~~~---~s~l~----~-~~~~~G~~~~l~~~~~-~v---------kP~i~Q~r---~~~~f~a~t~~~~ 199 (305) +. ...+....+ +..|. . ..+.....|..-+... .+ .-.|||.- -.|..++..-|.+ T Consensus 498 ~~---~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~~~l~G~pv~~s~~ip~~ 574 (632) T protein:vir:96 498 PA---LTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNNEVNGYRAEASNQIPAD 574 (632) T ss_pred cc---eecccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeecCCeecccceEeccccccC Confidence 32 111111111 11111 0 1111112222211110 00 00011100 0011111112222 Q ss_pred ccchhcccceeeeecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceecc--ccCeEEecchhHHHHHHHH Q lcl|NC_013594. 200 DHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGL--KPTHIVVPVGLEKAAEQLL 277 (305) Q Consensus 200 ~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i--~P~~LvVpp~le~~A~~ll 277 (305) . .+|| |.... =.|.|. .-.|..+.+..+ ..++..+.. +=++-|+.|+ |..++ T Consensus 575 ~-------~~~g-d~s~~-~i~~~~-----~~~i~~~~~~~~--------~~~~v~~~~~~~~d~~v~~~~----af~~~ 628 (632) T protein:vir:96 575 T-------WIFG-DWSQI-VIAMWG-----VLDLKVDPYTKA--------ASDGLVLRVFQDVDAGVRRKE----AFCIA 628 (632) T ss_pred c-------EEEe-ecceE-EEEEec-----ceEEEEcccccc--------ccCceEEEEEeecCceeechh----hhhhe Confidence 1 1221 21110 011111 111222211110 112222222 2222232221 11111 Q ss_pred hhcc Q lcl|NC_013594. 278 NREL 281 (305) Q Consensus 278 ~~~~ 281 (305) +..- T Consensus 629 k~~A 632 (632) T protein:vir:96 629 KKGA 632 (632) T ss_pred eecC Confidence 1100 No 34 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=64.22 E-value=0.3 Score=23.42 Aligned_cols=229 Identities=15% Similarity=0.091 Sum_probs=93.4 Q ss_pred CCcCHHH-----HHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccce---eeeccccccc Q lcl|NC_013594. 1 MIVTPAS-----IKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKR---TIQQMEAHGY 72 (305) Q Consensus 1 M~i~~~~-----l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~---~~~~l~~~~~ 72 (305) +-+|... ...+ ....+...+.... ...++++.++.+. ...+.+....|. -.|+||- ...++.=..- T Consensus 251 ~~~t~~~gg~lip~~~---~~~ii~~~~~~~~-~l~~~~~~~~~~g-~~~~~~~~~~~~-a~~v~Eg~~~~~~~~~~~~i 324 (543) T protein:vir:81 251 MGLTKADGGYLVPFQL---DPTVIITSNGSLN-DIRRFARQVVATG-DVWHGVSSAAVQ-WSWDAEFEEVSDDSPEFGQP 324 (543) T ss_pred cccccccCcccCchhh---hhHHHHHHHhhhc-hhhhhcccccCCc-ceEEEEecCCcc-eeecccCcccccccccccee Confidence 1111100 0000 1222333332222 2344555444322 222333333333 4566543 3333444556 Q ss_pred eeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchh Q lcl|NC_013594. 73 SIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSA 152 (305) Q Consensus 73 ~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~ 152 (305) ++..++++..+.|||+.+ +|...+..-+...++++.++..+..++ . | ||.. ++|.|--...++.. T Consensus 325 ~~~~~k~~~~~~is~ell-~d~~~~~~~i~~~l~~~~~~~~d~ail---~-G------~Gt~----~~p~Gi~~~~~~~~ 389 (543) T protein:vir:81 325 EIPVKKAQGFVPISIEAL-QDEANVTETVALLFAEGKDELEAVTLT---T-G------TGQG----NQPTGIVTALAGTA 389 (543) T ss_pred eeeeeeeEeeehhhHHHH-hccHHHHHHHHHHHHHHHHHHHHHHHh---c-c------CCCC----cccccchhhccccc Confidence 788999999999999977 567899999999999999998888653 2 2 2221 23321100000000 Q ss_pred hhhHhhhccCCCccc-eee-ecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcc Q lcl|NC_013594. 153 VNTSNIVEQDSFSGL-PFY-LLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVK 230 (305) Q Consensus 153 ~~~s~l~~~~~~~G~-~~~-l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~ 230 (305) ..+. ....+. .+- +++....|+| ..+.++. | T Consensus 390 ~~~~-----~~~~~~~~~~~~~~~~~~l~~--------------------------------~~~~~~~---~------- 422 (543) T protein:vir:81 390 AEIA-----PVTAETFALADVYAVYEQLAA--------------------------------RHRRQGA---W------- 422 (543) T ss_pred cccc-----ccccccccHHHHHHHHHhhhc--------------------------------cccCCcE---E------- Confidence 0000 000000 000 0000000000 0111111 1 Q ss_pred cccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcc-----ccc--------cccc---c Q lcl|NC_013594. 231 GDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGN-----TTV--------SNEM---K 294 (305) Q Consensus 231 ~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~-----~~~--------~N~~---~ 294 (305) -++. ..+..+++.||.+|+||-- |..-=.|+.+ ...-++-+..++.+. ++. .+.+ + T Consensus 423 -v~n~----~~~~~l~~lkd~~G~~l~~-~~~~g~~~~l--~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~ 494 (543) T protein:vir:81 423 -LANN----LIYNKIRQFDTQGGAGLWT-TIGNGEPSQL--LGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADR 494 (543) T ss_pred -EEcH----HHHHHHHHhhcCCCceecc-CcCCCCCccc--cceeeEEeccccccccccccCCcceEEEeeccceeEEee Confidence 1222 2234566788888888632 1000001111 111222222222211 111 1111 1 Q ss_pred ceeeEEecccC Q lcl|NC_013594. 295 GKLQLVVADYL 305 (305) Q Consensus 295 ~~~~~iv~p~L 305 (305) +-+++.++|+. T Consensus 495 ~~~~i~~~~~~ 505 (543) T protein:vir:81 495 IGMTVEFIPHL 505 (543) T ss_pred cccEEEEeccc Confidence 23667777776 No 35 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=62.12 E-value=0.28 Score=23.64 Aligned_cols=233 Identities=12% Similarity=0.074 Sum_probs=89.5 Q ss_pred CCcCHHHHHHHHH------------HHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeeec-c Q lcl|NC_013594. 1 MIVTPASIKALMT------------SWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQ-M 67 (305) Q Consensus 1 M~i~~~~l~~L~~------------~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~-l 67 (305) ++-+.+.-+++.. .+...+.+...... ...++|+.++.++...+|...-..+.. .|+||..-.. . T Consensus 120 ~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s-~l~~l~~~~~~~~~~~~~~~~~~~~~a-~wv~E~~~~~~~ 197 (425) T protein:vir:10 120 HVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLIS-PMRQLCRVQPVSKAGFSKLFNMGGTTS-GWVGEASQRPQT 197 (425) T ss_pred HhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhh-hhhhhceeeeccCCceEEEEEcCCcce-eeeccccccccc Confidence 1111111111111 01111111111111 133456666644444455444334443 6877754322 1 Q ss_pred ---ccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHH---------HHhccC---Cc----- Q lcl|NC_013594. 68 ---EAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFK---------LLKDGF---TQ----- 127 (305) Q Consensus 68 ---~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~---------lL~~g~---~~----- 127 (305) .=..-++..++++..+.||++.+.|-...+..-+.+.++++.++..+..++. +|+.-. +. T Consensus 198 ~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~ 277 (425) T protein:vir:10 198 NAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPF 277 (425) T ss_pred cccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccc Confidence 2244588899999999999999988889999999999999999999885442 111000 00 Q ss_pred -------------cccCC-cccccccccccccccc-cchhhhhHhhhccCCCccceeeecccc------cccccchhhcc Q lcl|NC_013594. 128 -------------PCYDG-QNFFDKEHPVYPNVDG-TGSAVNTSNIVEQDSFSGLPFYLLDCS------RAVKPLIFQER 186 (305) Q Consensus 128 -------------~~~DG-k~fF~adH~~~~n~~~-tg~~~~~s~l~~~~~~~G~~~~l~~~~------~~vkP~i~Q~r 186 (305) .-||. ..+++.=|+.+.+.+. .-+......|..-++..|.+++.-+.. +.=+|+++-.- T Consensus 278 ~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~ 357 (425) T protein:vir:10 278 GAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPD 357 (425) T ss_pred cccccccccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecC Confidence 00000 0011111111000000 000111122333356777766543211 11134332211 Q ss_pred cch-----------------hhhh--hhccccccchhcccceeeeecccccccchhhhhhhcccccchHHHHHHH Q lcl|NC_013594. 187 RKP-----------------ELVA--RTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNLWKGW 242 (305) Q Consensus 187 ~~~-----------------~f~a--~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar 242 (305) .+. .+.. ...-..+.-|.++...|-+..|..++ .+. ..++-.-.+.++. T Consensus 358 ~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~~~~~~~~~~~~r~d~~--v~~-----~~A~~~l~~~as~ 425 (425) T protein:vir:10 358 MPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDPYTAKPYVLFYTTKRVGGG--LLN-----PEPMRAMKVAASE 425 (425) T ss_pred cCCccCCccEEEEEehhccEEEEEecceEEEecccccCCcEEEEEEEEeccE--eec-----ccceEEEEeeccC Confidence 100 0000 00000111122333333333332221 000 0000000011111 No 36 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=61.49 E-value=0.35 Score=23.07 Aligned_cols=228 Identities=11% Similarity=0.076 Sum_probs=92.3 Q ss_pred CCcCH-HHHHHHHHHH--------------------HHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhc Q lcl|NC_013594. 1 MIVTP-ASIKALMTSW--------------------RKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWV 59 (305) Q Consensus 1 M~i~~-~~l~~L~~~~--------------------~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~ 59 (305) |.+|+ ..++-+...- +..+....+..| -.++|++++-......|..+..-|.. .|+ T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~--i~~~~~~~~~~~~~~~~p~~~~~~~a-~~v 77 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISI--VQQFAQKIPMGTTGQKIPHWTGDVSA-SWI 77 (326) T ss_pred CCCCccchhhhcCcchhhheeccccCCcceechhhHHHHHHHHHhcch--hhhhcceeeccCCceEEEEEeCCcce-EEe Confidence 55554 1111111111 111122122221 33445565533333334333333443 455 Q ss_pred cce---eeeccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccc Q lcl|NC_013594. 60 GKR---TIQQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFF 136 (305) Q Consensus 60 Ge~---~~~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF 136 (305) ||- .-.++.=..-++..++++..+.|||+.+.+=...+..-+.+.++++.++..++.++ . |... |++.. T Consensus 78 ~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l---~-G~gs----~~p~g 149 (326) T protein:vir:42 78 GEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAI---N-GTDS----PFPTF 149 (326) T ss_pred cCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhh---c-ccCC----Ccccc Confidence 443 33334445567889999999999999998878899999999999999999988665 2 2111 22211 Q ss_pred cccccccccccccchhhhhHhhhccCCCccc-e---eeecccccccccchhhcccchhhhhhhccccccchhcccceeee Q lcl|NC_013594. 137 DKEHPVYPNVDGTGSAVNTSNIVEQDSFSGL-P---FYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGA 212 (305) Q Consensus 137 ~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~-~---~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~ 212 (305) -..+. .+...... .. ....+. . ..+++....+.+ T Consensus 150 i~~~~-----~~~~~~~~-~~----~~~~~~~~~~~~~~~~~~~~~~~-------------------------------- 187 (326) T protein:vir:42 150 LAQTT-----KEVSLVDP-DG----TGSNADLTVYDAVAVNALSLLVN-------------------------------- 187 (326) T ss_pred ccccc-----cccceeec-cc----ccccccchhHHHHHHHHHhhhhh-------------------------------- Confidence 11110 00000000 00 000000 0 000000000000 Q ss_pred ecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceeccc------c------CeEEecchhHHHHHHHHhhc Q lcl|NC_013594. 213 SARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLK------P------THIVVPVGLEKAAEQLLNRE 280 (305) Q Consensus 213 d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~------P------~~LvVpp~le~~A~~ll~~~ 280 (305) ..+.+++ | -++. ..+.++++.|+.+|++|-.. | +++-+|. .-++ T Consensus 188 ~~~~~a~---~--------v~n~----~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv---------~~~~ 243 (326) T protein:vir:42 188 AGKKWTH---T--------LLDD----ITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPT---------ILSD 243 (326) T ss_pred hccCccE---E--------EEeH----HHHHHHHHhhccCCceeeccccccCccccccCceeeeeeE---------EEcC Confidence 0000110 0 0111 22346666788888876321 1 1222222 1122 Q ss_pred ccCCcccc----c-ccccc---ceeeEE--ecccC Q lcl|NC_013594. 281 LFADGNTT----V-SNEMK---GKLQLV--VADYL 305 (305) Q Consensus 281 ~~~~~~~~----~-~N~~~---~~~~~i--v~p~L 305 (305) .++.+..- + +.-+. +-+++- -+..+ T Consensus 244 ~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~ 278 (326) T protein:vir:42 244 HVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATL 278 (326) T ss_pred CCCCCceEEEEeecceEEEEEecceEEEEeeccee Confidence 23332210 0 11111 112222 22222 No 37 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=60.52 E-value=0.37 Score=22.95 Aligned_cols=161 Identities=14% Similarity=0.109 Sum_probs=83.6 Q ss_pred eeecCCccchhhhhhccCCCchhhhccceeeeccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhH Q lcl|NC_013594. 34 AMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQP 113 (305) Q Consensus 34 a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~ 113 (305) -.-+..-...+...|+|++-.+.| ..+.....|+-..-+.+.+..|+.+.|+-.+..-===.-+....+++|.+-++.. T Consensus 1 ~~~~~~Gdtit~P~~iGda~~v~e-G~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~kv 79 (231) T protein:vir:73 1 ENGINLANLCEYPNDIGDAADVAE-GGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKV 79 (231) T ss_pred CccccCCceEEecccccchhhhcC-CCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHHhh Confidence 111111111111235776655444 4556677777777788889999999999998763101123456667777777777 Q ss_pred HHHHHHHHhccCCccccCCcccccccccccccccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhh Q lcl|NC_013594. 114 DELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVA 193 (305) Q Consensus 114 ~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a 193 (305) |.-+++.|... +. T Consensus 80 D~di~~~~~~a--~l----------------------------------------------------------------- 92 (231) T protein:vir:73 80 DDDLLKAAKTT--SQ----------------------------------------------------------------- 92 (231) T ss_pred hHHHHHhhccc--cc----------------------------------------------------------------- Confidence 77655444320 00 Q ss_pred hhccccccchhcccceeeeecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHH Q lcl|NC_013594. 194 RTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAA 273 (305) Q Consensus 194 ~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A 273 (305) .++.++|.+.+..|.+.+... + -.|.+++|+|.--..- T Consensus 93 ----------------------------------~~~~~~t~d~i~~A~~~fgde---~-----~~~~vivv~p~~~~~L 130 (231) T protein:vir:73 93 ----------------------------------TVSTKANVDGVQAALDIFNDE---D-----AQAYVLIVNPKDAAKI 130 (231) T ss_pred ----------------------------------cccccccHHHHHHHHHHhccc---c-----ccceEEEEcchHHHhh Confidence 011234555555555554321 1 2356788888654444 Q ss_pred HHHHhhcccC--Ccccccccc----ccceeeEEecccC Q lcl|NC_013594. 274 EQLLNRELFA--DGNTTVSNE----MKGKLQLVVADYL 305 (305) Q Consensus 274 ~~ll~~~~~~--~~~~~~~N~----~~~~~~~iv~p~L 305 (305) |+........ .++.-..|- +.| +++++++.+ T Consensus 131 rk~~~~~~~~~~~g~~i~~~G~iG~i~G-~~Vi~S~~~ 167 (231) T protein:vir:73 131 RKDANAKNIGSEVGANALINGTYADVLG-AQIVRSKKL 167 (231) T ss_pred hhccchhhhhhhhccceeeecccceEcc-eEEEEcCCC Confidence 4433322211 111111222 233 688888888 No 38 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=60.24 E-value=0.054 Score=27.54 Aligned_cols=239 Identities=11% Similarity=0.067 Sum_probs=93.7 Q ss_pred CCcCH---------HHHHH-HHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeee---cc Q lcl|NC_013594. 1 MIVTP---------ASIKA-LMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQ---QM 67 (305) Q Consensus 1 M~i~~---------~~l~~-L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~---~l 67 (305) |.... +.-.. +-..+...+.+.....-+ ..+.|+.++-.....++..+..-+. -.|++|..-. +. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~-l~~~~~~~~~~~~~~~ip~~~~~~~-a~~v~E~~~~~~~~~ 78 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSA-IMKLAKNEPMTAQKKKFTYLAKGVG-AYWVSETERIQTSKP 78 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccc-hhhhcceeeccCCceEEEEEeCCcc-eEEeecCcccccccc Confidence 33221 11000 111122222222222222 4555666664433334444444344 3677654432 23 Q ss_pred ccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccccccc---ccc Q lcl|NC_013594. 68 EAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHP---VYP 144 (305) Q Consensus 68 ~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~---~~~ 144 (305) .=..-+++.++++..+.||++.+.+-...+..-+.+.++++.++.+++.++. |.... .+...+-..+. ... T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~----G~g~~--~~~~~~~~~~~~~~~~~ 152 (304) T protein:vir:94 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF----GTKSP--YNTSTSGKPLVEGAEEK 152 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee----ccCCC--ccccccccccccccccc Confidence 3345577889999999999999999888999999999999999998877642 11100 00000000000 000 Q ss_pred cccccchhhhhH---------------------------hhhccCCCccceeeecccc-cccccchhhcccchhhhhhhc Q lcl|NC_013594. 145 NVDGTGSAVNTS---------------------------NIVEQDSFSGLPFYLLDCS-RAVKPLIFQERRKPELVARTR 196 (305) Q Consensus 145 n~~~tg~~~~~s---------------------------~l~~~~~~~G~~~~l~~~~-~~vkP~i~Q~r~~~~f~a~t~ 196 (305) ...++....... .|..-++..|.+++..+.. +.=+|++.... - T Consensus 153 ~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~~l~G~PV~~~~~---------~ 223 (304) T protein:vir:94 153 GNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGNEIMGLPLSYTGA---------D 223 (304) T ss_pred ccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCccccceeeEEecc---------c Confidence 000111111111 1222234444444332211 11123322111 1 Q ss_pred cccccchhcccceeeeecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceec------------------- Q lcl|NC_013594. 197 IDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLG------------------- 257 (305) Q Consensus 197 ~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~------------------- 257 (305) |.+.+ +...+| .|.. ++-+|.| +.++++-+..+-..+....+.+|+..+ T Consensus 224 ~~~~~---~~~~~~-gd~~-~~~~~~~-------~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v 291 (304) T protein:vir:94 224 VYDKK---KSLALM-GDWD-YARYGIL-------QGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMN 291 (304) T ss_pred ccCCC---CcEEEE-Eehh-hEEEEEe-------cceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEe Confidence 11100 011222 1221 1122222 222222222221122222223332111 Q ss_pred cccCe--EEecch Q lcl|NC_013594. 258 LKPTH--IVVPVG 268 (305) Q Consensus 258 i~P~~--LvVpp~ 268 (305) +.|.- +|.+.+ T Consensus 292 ~~~~a~~~l~~a~ 304 (304) T protein:vir:94 292 VKPEAFATLKPTE 304 (304) T ss_pred ecccceEEEEecC Confidence 12442 223333 No 39 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=60.24 E-value=0.054 Score=27.54 Aligned_cols=239 Identities=11% Similarity=0.067 Sum_probs=93.7 Q ss_pred CCcCH---------HHHHH-HHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeee---cc Q lcl|NC_013594. 1 MIVTP---------ASIKA-LMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQ---QM 67 (305) Q Consensus 1 M~i~~---------~~l~~-L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~---~l 67 (305) |.... +.-.. +-..+...+.+.....-+ ..+.|+.++-.....++..+..-+. -.|++|..-. +. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~-l~~~~~~~~~~~~~~~ip~~~~~~~-a~~v~E~~~~~~~~~ 78 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSA-IMKLAKNEPMTAQKKKFTYLAKGVG-AYWVSETERIQTSKP 78 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccc-hhhhcceeeccCCceEEEEEeCCcc-eEEeecCcccccccc Confidence 33221 11000 111122222222222222 4555666664433334444444344 3677654432 23 Q ss_pred ccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccccccc---ccc Q lcl|NC_013594. 68 EAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHP---VYP 144 (305) Q Consensus 68 ~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~---~~~ 144 (305) .=..-+++.++++..+.||++.+.+-...+..-+.+.++++.++.+++.++. |.... .+...+-..+. ... T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~----G~g~~--~~~~~~~~~~~~~~~~~ 152 (304) T protein:vir:10 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF----GTKSP--YNTSTSGKPLVEGAEEK 152 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee----ccCCC--ccccccccccccccccc Confidence 3345577889999999999999999888999999999999999998877642 11100 00000000000 000 Q ss_pred cccccchhhhhH---------------------------hhhccCCCccceeeecccc-cccccchhhcccchhhhhhhc Q lcl|NC_013594. 145 NVDGTGSAVNTS---------------------------NIVEQDSFSGLPFYLLDCS-RAVKPLIFQERRKPELVARTR 196 (305) Q Consensus 145 n~~~tg~~~~~s---------------------------~l~~~~~~~G~~~~l~~~~-~~vkP~i~Q~r~~~~f~a~t~ 196 (305) ...++....... .|..-++..|.+++..+.. +.=+|++.... - T Consensus 153 ~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~~l~G~PV~~~~~---------~ 223 (304) T protein:vir:10 153 GNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGNEIMGLPLSYTGA---------D 223 (304) T ss_pred ccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCccccceeeEEecc---------c Confidence 000111111111 1222234444444332211 11123322111 1 Q ss_pred cccccchhcccceeeeecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceec------------------- Q lcl|NC_013594. 197 IDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLG------------------- 257 (305) Q Consensus 197 ~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~------------------- 257 (305) |.+.+ +...+| .|.. ++-+|.| +.++++-+..+-..+....+.+|+..+ T Consensus 224 ~~~~~---~~~~~~-gd~~-~~~~~~~-------~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v 291 (304) T protein:vir:10 224 VYDKK---KSLALM-GDWD-YARYGIL-------QGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMN 291 (304) T ss_pred ccCCC---CcEEEE-Eehh-hEEEEEe-------cceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEe Confidence 11100 011222 1221 1122222 222222222221122222223332111 Q ss_pred cccCe--EEecch Q lcl|NC_013594. 258 LKPTH--IVVPVG 268 (305) Q Consensus 258 i~P~~--LvVpp~ 268 (305) +.|.- +|.+.+ T Consensus 292 ~~~~a~~~l~~a~ 304 (304) T protein:vir:10 292 VKPEAFATLKPTE 304 (304) T ss_pred ecccceEEEEecC Confidence 12442 223333 No 40 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=58.08 E-value=0.42 Score=22.65 Aligned_cols=238 Identities=11% Similarity=0.063 Sum_probs=95.9 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhh---------hhhh--------------hceeeecCCccchhhhhhccCCCchhh Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDA---------PSQY--------------NKIAMVVNSSTRSNTYGWLGKFPTLKE 57 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a---------~~t~--------------~~~a~~v~S~~~~~~y~~Lg~~P~l~e 57 (305) ..........|....++.+++..+.. |+++ .+.|+.++.......+.....-|. -. T Consensus 62 ~~~~~~~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~~~~-a~ 140 (390) T protein:vir:40 62 NVLASRGANALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGDVAT-AW 140 (390) T ss_pred HHHHhcCchhccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcCCcc-ee Confidence 00000001111112222222211111 2222 234566554444444444444444 35 Q ss_pred hccce-ee---eccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCc Q lcl|NC_013594. 58 WVGKR-TI---QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQ 133 (305) Q Consensus 58 w~Ge~-~~---~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk 133 (305) |++|- .+ .+..=..-++..++++..+.||++.+.|-..++..-+.+.++++.++..++.++ . |... |+ T Consensus 141 ~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l---~-G~G~----~~ 212 (390) T protein:vir:40 141 WGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIV---N-GSGK----DQ 212 (390) T ss_pred eeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhh---c-ccCC----Cc Confidence 66542 22 223335567888999999999999999999999999999999999999997443 2 2111 23 Q ss_pred ccccccccccccccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeee Q lcl|NC_013594. 134 NFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGAS 213 (305) Q Consensus 134 ~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d 213 (305) |.---.. ..+...... .......+--.+....+.-+ ...-. ++.. + T Consensus 213 P~Gil~~-----~~~~~~~~~-------~~~~~~~~t~~~~~~~~~~l----------~~~~~-~~~~-----------~ 258 (390) T protein:vir:40 213 PIGMMRD-----LNNVTAGEH-------PVKTATPLTDLTPATLATKV----------MLPLT-DNGK-----------K 258 (390) T ss_pred cceeeec-----ccccccccc-------ccccccccchhhHHHHHHHH----------HHHhh-cchh-----------h Confidence 2100000 000000000 00000000000000000000 00000 0000 0 Q ss_pred cccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceecc---ccCeEEecchhHHHHHHHHhhcccCCcccc-- Q lcl|NC_013594. 214 ARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGL---KPTHIVVPVGLEKAAEQLLNRELFADGNTT-- 288 (305) Q Consensus 214 ~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i---~P~~LvVpp~le~~A~~ll~~~~~~~~~~~-- 288 (305) .+.|+. | -++...+..-..+++.+++.+|+++.- .+.-+|+. ..++.+..- T Consensus 259 ~~~~a~---~--------i~n~~t~~~~l~~~~~~~d~~G~~v~~~~~~g~pvv~~-------------~~~p~~~i~~G 314 (390) T protein:vir:40 259 SVSDAI---L--------VINPADYWSKIYAATSYMTPQGVWVTGILPVPLEIVQS-------------VAVPVGKAVAG 314 (390) T ss_pred hhcCce---E--------EEcchhHHHHHHHHhhccCCCCccccccCCCceeEEEc-------------CCCCCCcEEEE Confidence 111111 1 022222223334567888888887521 12222222 222222110 Q ss_pred c-ccc---ccceeeEEecccC Q lcl|NC_013594. 289 V-SNE---MKGKLQLVVADYL 305 (305) Q Consensus 289 ~-~N~---~~~~~~~iv~p~L 305 (305) + .+- .++-+++.+++.. T Consensus 315 d~s~~~i~~~~~~~v~~~~~~ 335 (390) T protein:vir:40 315 RAKDYFMGIGSEQVIRTSTEY 335 (390) T ss_pred eeceEEEEeecceEEEecchh Confidence 0 011 1334556565554 No 41 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=57.76 E-value=0.3 Score=23.42 Aligned_cols=253 Identities=11% Similarity=0.034 Sum_probs=93.8 Q ss_pred CCcCHHHHHH-HHHH-HHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhcccee---eeccccccceee Q lcl|NC_013594. 1 MIVTPASIKA-LMTS-WRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRT---IQQMEAHGYSIA 75 (305) Q Consensus 1 M~i~~~~l~~-L~~~-~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~---~~~l~~~~~~i~ 75 (305) |..++..-.. |-.- .+..+....+.. ...++++.++-....-+|..+..-+. -.|++|-. -.++.=..-++. T Consensus 10 ~~~~t~~~g~~i~~~~~~~ii~~~~~~s--~l~~~~~~~~~~~~~~~~p~~~~~~~-a~~v~Eg~~~~~~~~~f~~i~~~ 86 (330) T protein:vir:77 10 QVALTGDFSAFLTPEQSQDYFAEIEKTS--IVQRIARKVPMGPTGISIPHWTGAVS-ASWTGEAERKPITKGSFGKQELE 86 (330) T ss_pred hccccCCCcceechhHHHHHHHHHHhcc--chhhhcceeeccCCceEEEEEcCCcc-eeEecCCCccccccceeeEEEEe Confidence 2222111000 0000 122222222222 24556666664333344555544454 35765533 233333446788 Q ss_pred eecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccc---------cccc-cccccc Q lcl|NC_013594. 76 NKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNF---------FDKE-HPVYPN 145 (305) Q Consensus 76 n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~f---------F~ad-H~~~~n 145 (305) .++++..+.|||+.+.+-+..+..-+.+.++++.++.+++.++ . |... |+++ .... +..... T Consensus 87 ~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l---~-G~g~----~~~~~g~~~~~~~~~~~~~~~~~~ 158 (330) T protein:vir:77 87 PVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAI---H-GIDK----PSAFKGYLAETTKVVSLADTNLTT 158 (330) T ss_pred EEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh---c-ccCC----CCccccccccccccceeecccccc Confidence 8999999999999999888999999999999999999998665 1 1110 0110 0000 000000 Q ss_pred cc---------------------ccc-----hhhhhHhhhccCCCccceeeeccc-----------ccccccchhhcccc Q lcl|NC_013594. 146 VD---------------------GTG-----SAVNTSNIVEQDSFSGLPFYLLDC-----------SRAVKPLIFQERRK 188 (305) Q Consensus 146 ~~---------------------~tg-----~~~~~s~l~~~~~~~G~~~~l~~~-----------~~~vkP~i~Q~r~~ 188 (305) .. ... +......|..-++..|.+++.-.. .+.=+|++.....+ T Consensus 159 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p 238 (330) T protein:vir:77 159 ASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVV 238 (330) T ss_pred cccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEecccc Confidence 00 000 000112233335566666554211 11113433222211 Q ss_pred hhhhhhhccccccchhc--ccceeeeecccccccchhhhhh--------hcccccchHHHHHHHHHHHHhhcCCCceec- Q lcl|NC_013594. 189 PELVARTRIDDDHVFMD--NEFLFGASARRAAGYGFWQMAV--------AVKGDLTLDNLWKGWQLMRAFEGDGGKKLG- 257 (305) Q Consensus 189 ~~f~a~t~~~~~nvf~~--~~~~~g~d~r~n~G~g~wq~A~--------~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~- 257 (305) ..+..+...+|.- ++++++-. .+.-.-...-++ ..........+..-..++|...-.++.++. T Consensus 239 ----~~~~~~~~~~~~gd~s~~~i~~~--~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 312 (330) T protein:vir:77 239 ----NGTVGNRVVGVMGDFSQVIWGQI--GGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDK 312 (330) T ss_pred ----CCCCCCccEEEEEecceEEEEEe--cCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecc Confidence 0000011112211 12222210 000000000000 000011111111111222222222222211 Q ss_pred -----cccCeEEecchhH Q lcl|NC_013594. 258 -----LKPTHIVVPVGLE 270 (305) Q Consensus 258 -----i~P~~LvVpp~le 270 (305) |....=--+|+-| T Consensus 313 ~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 313 DAFVKLTDQVAGTDPEEE 330 (330) T ss_pred cceEEEEeccCCcCCCCC Confidence 1111111123333 No 42 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=57.41 E-value=0.43 Score=22.57 Aligned_cols=181 Identities=10% Similarity=0.052 Sum_probs=94.9 Q ss_pred CCcCH-------HHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCC----cc---chhhhhhccCCCchhhh--ccceee Q lcl|NC_013594. 1 MIVTP-------ASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNS----ST---RSNTYGWLGKFPTLKEW--VGKRTI 64 (305) Q Consensus 1 M~i~~-------~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S----~~---~~~~y~~Lg~~P~l~ew--~Ge~~~ 64 (305) |..|. +.+... +...+.+.. -+.++|..-++ -. ..-.|..+|+ ..+| ..+... T Consensus 1 Ma~T~~~d~I~Pev~~~~---V~e~~~~~~-----~~~~~~~~d~~L~g~~G~ti~~P~~~~igd---ae~~~eg~~i~~ 69 (270) T protein:vir:95 1 MTQTKKANLINPEVLANV---VSAQMQNAI-----RFTPYAVTDDTLVGQPGDTITRPKYAYIGA---AEDLQEGVAMDT 69 (270) T ss_pred CCceehhhhcchHHHHHH---HHHHHHhHH-----hhccccccccccCCCCCCEEEeeeecCCCc---cccccCCCccch Confidence 77653 333332 222222221 12334332111 01 1112233343 3333 245566 Q ss_pred eccccccceeeeecccceeeccHHHhhc---cccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccccccc Q lcl|NC_013594. 65 QQMEAHGYSIANKTFEGTVGISRDDFED---DNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHP 141 (305) Q Consensus 65 ~~l~~~~~~i~n~tfg~~i~i~R~~I~n---DdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~ 141 (305) ..++-..-..+.+.+|+.++|+-.+..- |- ...+..++|.+-++..+..+.+.|+..+.. T Consensus 70 ~~lt~~~~~a~i~~~gk~~~itD~a~~~~~~dp---~~~~~~q~a~~~a~~~d~~li~~l~~a~~~-------------- 132 (270) T protein:vir:95 70 TQMSMTTTKVTVKETGKAVEVTQTAIITNVNGT---LQEASRQLAMSLADKVEIDYIAELNKSKQT-------------- 132 (270) T ss_pred hhcccchheeeeehhhCcceecHHHHhhhccch---HHHHHHHHHHHHHHHHHHHHHHHhcccccc-------------- Confidence 6777777788889999999999988763 44 455666788888888888777777641100 Q ss_pred ccccccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccc Q lcl|NC_013594. 142 VYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYG 221 (305) Q Consensus 142 ~~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g 221 (305) T Consensus 133 -------------------------------------------------------------------------------- 132 (270) T protein:vir:95 133 -------------------------------------------------------------------------------- 132 (270) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred hhhhhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccc----ccccccccee Q lcl|NC_013594. 222 FWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNT----TVSNEMKGKL 297 (305) Q Consensus 222 ~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~----~~~N~~~~~~ 297 (305) ++...+.+.+..|...+ ++.+.. |+.++|+|.....-++-..-+....++. +....+.| + T Consensus 133 -------~~~~~t~~~~~dA~~~l----gd~~~~----~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G-~ 196 (270) T protein:vir:95 133 -------ATVSADATGILDAIEVF----NSENDE----DYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVG-V 196 (270) T ss_pred -------cccccCHHHHHHHHHHh----ccccCC----CcEEEEcHHHHHHHHhhhcccccccccchhcccccceecc-e Confidence 00012345555666554 333333 4679999987766554322111222221 12333444 6 Q ss_pred eEEecccC Q lcl|NC_013594. 298 QLVVADYL 305 (305) Q Consensus 298 ~~iv~p~L 305 (305) ++||.... T Consensus 197 ~Viv~s~~ 204 (270) T protein:vir:95 197 SDIVKSKR 204 (270) T ss_pred eEEEeCCC Confidence 78887766 No 43 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=56.38 E-value=0.46 Score=22.44 Aligned_cols=213 Identities=12% Similarity=0.141 Sum_probs=81.5 Q ss_pred CCcCHHHHHHHHHH-------------HHHHHHHHHhhhhhhhhceeeecCCccchhhhhhcc-CCCchhhhccceeeec Q lcl|NC_013594. 1 MIVTPASIKALMTS-------------WRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLG-KFPTLKEWVGKRTIQQ 66 (305) Q Consensus 1 M~i~~~~l~~L~~~-------------~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg-~~P~l~ew~Ge~~~~~ 66 (305) +.........+..+ +...+.+..... ....+.++.++.+...-+|..+. ..+....|.++-.... T Consensus 123 ~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~ 201 (400) T protein:vir:38 123 RAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTV-VDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPA 201 (400) T ss_pred hhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhh-hhhhhcceeEeccCcceEEEEEecCCCccccccccccccc Confidence 11100000000000 111111111111 12334455555433333343332 2244443332233333 Q ss_pred ccc---ccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccccccccc Q lcl|NC_013594. 67 MEA---HGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVY 143 (305) Q Consensus 67 l~~---~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~ 143 (305) .+. ..-++..++++..+.|||+.+.|-+..+.+-+.+.++++.+...++.+..-...+ +.+ T Consensus 202 ~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~-------------~~~--- 265 (400) T protein:vir:38 202 MAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGF-------------TAK--- 265 (400) T ss_pred cccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccc-------------ccc--- Confidence 322 4457788999999999999999888888888999999988888776555332211 000 Q ss_pred ccccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchh Q lcl|NC_013594. 144 PNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFW 223 (305) Q Consensus 144 ~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~w 223 (305) ...+...+.. ++. ..++| ..++. | T Consensus 266 -------~~~~~~~~~~----------~~~--~~~~~----------------------------------~~~a~---~ 289 (400) T protein:vir:38 266 -------TISSVDDLKH----------INN--VDLDP----------------------------------AYSRV---I 289 (400) T ss_pred -------ccccHHHHHH----------HHH--hhhhh----------------------------------hhCcE---E Confidence 0011111100 000 00000 00110 1 Q ss_pred hhhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhh-----cccCCccccccccccceee Q lcl|NC_013594. 224 QMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNR-----ELFADGNTTVSNEMKGKLQ 298 (305) Q Consensus 224 q~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~-----~~~~~~~~~~~N~~~~~~~ 298 (305) -++.+ ...++++.||.+|+||- .|... ......|+.- +..+.++.+....+.|.+. T Consensus 290 --------v~~~~----~~~~l~~lkd~~G~~i~-~~~~~------~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s 350 (400) T protein:vir:38 290 --------IASQS----FYNFLDTVKDGNGRYLL-QDSIL------TPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIK 350 (400) T ss_pred --------EEcHH----HHHHHHHhhccCCCeee-ecCcC------CCCccccccceeEEecccccCCCCceEEEEEecc Confidence 11111 12345555677777652 22210 0000112221 1122222222222222211 Q ss_pred --EEecccC Q lcl|NC_013594. 299 --LVVADYL 305 (305) Q Consensus 299 --~iv~p~L 305 (305) +++..|- T Consensus 351 ~~~~~~~~~ 359 (400) T protein:vir:38 351 RAILFANRA 359 (400) T ss_pred ccEEEEeec Confidence 1111111 No 44 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=55.74 E-value=0.45 Score=22.49 Aligned_cols=235 Identities=11% Similarity=0.021 Sum_probs=92.9 Q ss_pred CCcCHHH------HHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeeec---ccccc Q lcl|NC_013594. 1 MIVTPAS------IKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQ---MEAHG 71 (305) Q Consensus 1 M~i~~~~------l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~---l~~~~ 71 (305) |..+... ...+. ...+....+.. .-+++|++++.....-++..+..-|.- .|+||-.-.. ..=.. T Consensus 1 Ma~~~~~~gg~~vP~~~~---~~ii~~l~~~s--~i~~l~~~i~~~~~~~~ip~~~~~~~a-~wv~Eg~~~~~s~~~f~~ 74 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMI---GAVRDRAIDSG--VLAKLSPEQPTIFGPVKGAVFSGVPRA-KIVGEGEVKPSASVDVSA 74 (315) T ss_pred CCCCcCCcCceEcchHHH---HHHHHHHHhhc--hhhhhcceeecCCCceEEEEEeCCcce-EEeeCCccccccccceee Confidence 6654311 11111 11222211222 256677777755444455555444442 5876654333 33344 Q ss_pred ceeeeecccceeeccHHHhhccccch----hHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccccccccccccc Q lcl|NC_013594. 72 YSIANKTFEGTVGISRDDFEDDNLGI----YAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVD 147 (305) Q Consensus 72 ~~i~n~tfg~~i~i~R~~I~nDdlG~----~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~ 147 (305) -+++-++++..+.||++.+.++.... ..-+.+.++++.++.+++.++. |.++. .|+++ . T Consensus 75 v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~----G~~~~--~~~~~-----------~ 137 (315) T protein:vir:80 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFH----GIDPA--TGKAA-----------S 137 (315) T ss_pred eEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheee----ccCCC--CCccc-----------c Confidence 56778999999999999887766653 3556777788777776654442 21110 01100 0 Q ss_pred ccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhh Q lcl|NC_013594. 148 GTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAV 227 (305) Q Consensus 148 ~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~ 227 (305) +...... .-.......+..+..+.. +. ...... +.+.+.+ |-| T Consensus 138 ~~~~~~~--~~~~~~~~~~~~~~d~~~----------------~~--~~~~~~------------~~~~~~~---~im-- 180 (315) T protein:vir:80 138 AVHTSLN--KTKNIVDATDSATADLVK----------------AV--GLIAGA------------GLQVPNG---VAL-- 180 (315) T ss_pred ccccccc--cccceeeccccchHHHHH----------------HH--HHHhhc------------cCccceE---EEE-- Confidence 0000000 000000011111100000 00 000000 1111111 111 Q ss_pred hcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEec--chhHHHHHHHHhhcccCCcccc-cc-c--cccc------ Q lcl|NC_013594. 228 AVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVP--VGLEKAAEQLLNRELFADGNTT-VS-N--EMKG------ 295 (305) Q Consensus 228 ~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVp--p~le~~A~~ll~~~~~~~~~~~-~~-N--~~~~------ 295 (305) +. +.+.++++.|+.+|++++-.|-+--+. .......+-++-++.++.+... .. . .+.| T Consensus 181 ------n~----~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~ 250 (315) T protein:vir:80 181 ------DP----AFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVH 250 (315) T ss_pred ------cH----HHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCCcccccccccccEEEEeecccEE Confidence 11 234666777777777665444221000 0001112223333334432110 00 0 1212 Q ss_pred -----eeeEEecccC Q lcl|NC_013594. 296 -----KLQLVVADYL 305 (305) Q Consensus 296 -----~~~~iv~p~L 305 (305) .+++-+.++- T Consensus 251 ~g~~~~~~i~i~~~~ 265 (315) T protein:vir:80 251 WGFQRNFPIELIEYG 265 (315) T ss_pred EEEecCeeEEEeccc Confidence 2233333333 No 45 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=55.06 E-value=0.49 Score=22.29 Aligned_cols=225 Identities=9% Similarity=0.085 Sum_probs=90.8 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccce---eeeccccccceeeee Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKR---TIQQMEAHGYSIANK 77 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~---~~~~l~~~~~~i~n~ 77 (305) |.-+...- -|-..+...+.+...+.. ...++|+.++.......|..+..-|. -+|++|- ...++.=..-++..+ T Consensus 30 ~~~~~~~~-lip~~~~~~ii~~~~~~s-~l~~~~~~~~~~~~~~~~p~~~~~~~-a~~v~Eg~~~~~~~~~~~~v~~~~~ 106 (324) T protein:vir:99 30 MMHEKKDG-TLLNDFTTPILQEVMENS-KIMRLGKYEPMEGTEKKFTFWADKPG-AYWVGEGQKIETSKATWVNATMRAF 106 (324) T ss_pred eccCCCcc-eechhHHHHHHHHHHhhc-hhhhhcceeeccCCceEEEEEecCcc-eeEeccCccccccccceeEEEEeeE Confidence 11111000 000111111122222211 14455666664444344444433344 3676553 333344455677889 Q ss_pred cccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhHh Q lcl|NC_013594. 78 TFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSN 157 (305) Q Consensus 78 tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s~ 157 (305) +++..+.|||+.+.|-...+..-+.+.++++.++.+++.++. + .+.. ..+..+..+-.. .+ ..+....+... T Consensus 107 k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~---G-~g~~-~~~~~~~~~~~~--~~-~~~~~~~~~~~ 178 (324) T protein:vir:99 107 KLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---N-QGNN-PFGKSIAQSIEK--TN-KVIKGDFTQDN 178 (324) T ss_pred EEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh---c-CCCC-ccCccccccccc--cc-eeccccCCHHH Confidence 999999999999998889999999999999999999886642 2 1110 111111111000 00 00000000000 Q ss_pred hhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccchHH Q lcl|NC_013594. 158 IVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDN 237 (305) Q Consensus 158 l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~ 237 (305) |++..-.|++ . + +.+.+ | -++... T Consensus 179 -------------i~~~~~~l~~--------------------~-----------~-~~~~~---~--------v~n~~~ 202 (324) T protein:vir:99 179 -------------IIDLEALLED--------------------D-----------E-LEANA---F--------ISKTQN 202 (324) T ss_pred -------------HHHHHHhhhh--------------------c-----------c-CCCCE---E--------EEcHHH Confidence 0000000000 0 0 11111 1 122222 Q ss_pred HHHHHHHHHHhhcCCCceec--cccCeEE-ecchhHHHHHHHHhhcccCCcccc-----ccccc----cc-eeeEEeccc Q lcl|NC_013594. 238 LWKGWQLMRAFEGDGGKKLG--LKPTHIV-VPVGLEKAAEQLLNRELFADGNTT-----VSNEM----KG-KLQLVVADY 304 (305) Q Consensus 238 l~~ar~aM~~~k~~~G~~L~--i~P~~Lv-Vpp~le~~A~~ll~~~~~~~~~~~-----~~N~~----~~-~~~~iv~p~ 304 (305) +..+++.||.+|+++- -.|..|. +|. .-+...+.+... -.+-+ .+ .+++.-++. T Consensus 203 ----~~~L~~l~d~~g~~~~~~~~~~~l~G~PV---------v~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~ 269 (324) T protein:vir:99 203 ----RSLLRKIVDPETKERIYDRNSDTLDGLPV---------VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQ 269 (324) T ss_pred ----HHHHHHhhcCCCceeecCCCCccccceeE---------EeecCCCCCcceEEEEecccEEEEEecCcEEEEeeccc Confidence 2355678999998752 2222221 211 111111111100 00111 11 122222222 Q ss_pred C Q lcl|NC_013594. 305 L 305 (305) Q Consensus 305 L 305 (305) + T Consensus 270 ~ 270 (324) T protein:vir:99 270 L 270 (324) T ss_pred c Confidence 2 No 46 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=54.61 E-value=0.5 Score=22.24 Aligned_cols=244 Identities=11% Similarity=0.025 Sum_probs=96.1 Q ss_pred CCcCHHHHHHH-HHHH-HHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeeecccc---ccceee Q lcl|NC_013594. 1 MIVTPASIKAL-MTSW-RKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEA---HGYSIA 75 (305) Q Consensus 1 M~i~~~~l~~L-~~~~-~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~---~~~~i~ 75 (305) |+-+...=-.| -..+ ........+. +.-.++|+.++-.....+|..+-.-|. -.|+||-.-...++ ..-+++ T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~--s~i~~l~~~~~~~~~~~~~p~~~~~~~-a~wv~Eg~~~~~s~~~f~~v~l~ 77 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGH--SSIAKLSPQKPIPFNGQREFVFDFDSD-IDIVAENGKKTHGGVSLDPVTIV 77 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhh--hhhhhhcceeeccCCceEEEEEecCcc-eEEeeCCcccccccccceeeEee Confidence 66554321000 0001 1111111111 123455655553322333444333343 36776654333222 345667 Q ss_pred eecccceeeccHHHh---hccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCC-ccccCCcccccccccccccccccch Q lcl|NC_013594. 76 NKTFEGTVGISRDDF---EDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFT-QPCYDGQNFFDKEHPVYPNVDGTGS 151 (305) Q Consensus 76 n~tfg~~i~i~R~~I---~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~-~~~~DGk~fF~adH~~~~n~~~tg~ 151 (305) -++++..+.||++.+ .+|..++.+-+...++++.++.+++.++.=..++.. +.-..|...+++-........++.. T Consensus 78 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (300) T protein:vir:95 78 PLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNP 157 (300) T ss_pred eEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccch Confidence 789999999999998 567799999999999999999999888743211100 0000111111111100000000000 Q ss_pred hhhh------------------------HhhhccCCCccceeeecc------cccccccchhhcccchhhhhhhcccccc Q lcl|NC_013594. 152 AVNT------------------------SNIVEQDSFSGLPFYLLD------CSRAVKPLIFQERRKPELVARTRIDDDH 201 (305) Q Consensus 152 ~~~~------------------------s~l~~~~~~~G~~~~l~~------~~~~vkP~i~Q~r~~~~f~a~t~~~~~n 201 (305) ...+ ..|..-++..|.+++--+ ..+.=.|+++-+..+ ......+.- T Consensus 158 ~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~----~~~~~~~~~ 233 (300) T protein:vir:95 158 DESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVS----YSQTDPKNT 233 (300) T ss_pred HHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCC----CCCCCCccE Confidence 0000 112222455555544211 111112322221110 000000111 Q ss_pred chhc---ccceeeeec-----------ccccccchhh---hhhhcccccchHHHHHHHHHHHHhhcCCC Q lcl|NC_013594. 202 VFMD---NEFLFGASA-----------RRAAGYGFWQ---MAVAVKGDLTLDNLWKGWQLMRAFEGDGG 253 (305) Q Consensus 202 vf~~---~~~~~g~d~-----------r~n~G~g~wq---~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G 253 (305) +|.- +-+.+|+.. .-+.+..+|| .++....-++..-.. -.|+...|+..| T Consensus 234 ~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~--~~a~~~l~~~~g 300 (300) T protein:vir:95 234 AIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMD--AASFARIVKTGG 300 (300) T ss_pred EEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeec--ccceEEEecCCC Confidence 2211 112233211 1122333444 222111111111000 145556677777 No 47 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=53.43 E-value=0.38 Score=22.86 Aligned_cols=249 Identities=14% Similarity=0.121 Sum_probs=91.7 Q ss_pred CCcCHHHHHH---------HHHHH-HHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccce---eeecc Q lcl|NC_013594. 1 MIVTPASIKA---------LMTSW-RKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKR---TIQQM 67 (305) Q Consensus 1 M~i~~~~l~~---------L~~~~-~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~---~~~~l 67 (305) |......+.. |-... +..++...+..+ -.++|++++-....-+|.....-|.. .|++|. ...++ T Consensus 7 ~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~--l~~~~~~~~~~~~~~~~p~~~~~~~a-~~v~E~~~~~~~~~ 83 (320) T protein:vir:10 7 FQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSI--VQQFAQKVPMGTTGQKIPHWIGDVSA-QWIGEGDMKPITKG 83 (320) T ss_pred CCHHHHHhhccccccccccccHHHHHHHHHHHHhccc--hhhhcceeeccCCceEEEEEeCCcce-EEecCCcccccccc Confidence 2111111110 01111 222222222222 45566666643333444444444553 576553 33333 Q ss_pred ccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccccccccc-ccc Q lcl|NC_013594. 68 EAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVY-PNV 146 (305) Q Consensus 68 ~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~-~n~ 146 (305) .=..-+++.++++..+.|||+.+.+-...+..-+.+.++++.++.+++.++ .+ .... .+..+-+.-+.+. ... T Consensus 84 ~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l---~G-~g~~--~~~~~~~~~~~~~~~~~ 157 (320) T protein:vir:10 84 NMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAAL---NG-TDSP--FPTYLAQTTKSVSLADP 157 (320) T ss_pred ceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhh---cc-cCCC--CCcccccccccccceec Confidence 334567888999999999999999988999999999999999999998864 22 1100 0111111111100 000 Q ss_pred ccc-chhh------h-----------------------hHhhhccCCCccceeeecccccccccchhhccc---chhhhh Q lcl|NC_013594. 147 DGT-GSAV------N-----------------------TSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERR---KPELVA 193 (305) Q Consensus 147 ~~t-g~~~------~-----------------------~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~---~~~f~a 193 (305) .+. .... . ...|..-++..|.+++.-... .-.|..++.++ .|..++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~-~~~~~~~~~~~i~g~pv~~~ 236 (320) T protein:vir:10 158 GGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTY-TDENSPFRAGRIVSRPTILS 236 (320) T ss_pred ccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccc-cCccccccCceeeeeeeEec Confidence 000 0000 0 001111223333333221100 00000000000 011111 Q ss_pred hhccccccchhcccceeeeecccccccchhhhh-h-----hc--cccc----chHHHHHHHHHHHHhhcCCCceeccccC Q lcl|NC_013594. 194 RTRIDDDHVFMDNEFLFGASARRAAGYGFWQMA-V-----AV--KGDL----TLDNLWKGWQLMRAFEGDGGKKLGLKPT 261 (305) Q Consensus 194 ~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A-~-----~s--~~~L----t~~~l~~ar~aM~~~k~~~G~~L~i~P~ 261 (305) ..-|.+..+ .+-+|.... -+|.|+-. + .+ .... ...-+..-..++|.....++.+ ++|+ T Consensus 237 ~~~~~~~~~------~~~gd~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v--~~~~ 307 (320) T protein:vir:10 237 DHVADGTTV------GYMGDFRNV-IWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHN--NDKD 307 (320) T ss_pred CCCCCCceE------EEEeecceE-EEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEE--eccc Confidence 111111111 111121111 12222111 0 00 0000 0011111222222222233333 2222 Q ss_pred eE------Eecch Q lcl|NC_013594. 262 HI------VVPVG 268 (305) Q Consensus 262 ~L------vVpp~ 268 (305) -+ ..|++ T Consensus 308 a~~~l~~~~ap~~ 320 (320) T protein:vir:10 308 AFVKLTNVVTPDA 320 (320) T ss_pred ceEEEEeccCCCC Confidence 11 22344 No 48 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=52.17 E-value=0.56 Score=21.96 Aligned_cols=199 Identities=12% Similarity=0.065 Sum_probs=88.7 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhc---cceeeeccccccceeeee Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWV---GKRTIQQMEAHGYSIANK 77 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~---Ge~~~~~l~~~~~~i~n~ 77 (305) |.++.-.-+-....+-+.|++..--++-..+.. ........+-+...+|. +...+.. +......+.++..+++.. T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~-~~~~~~GdTv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREY-EGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhccc-cccccCCcEEEEeecCc-ccccccccCCCccCccccccceEEEEEe Confidence 999742222222223444444332211111111 00111111222222232 2222222 223445677777777775 Q ss_pred cc-cceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhH Q lcl|NC_013594. 78 TF-EGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTS 156 (305) Q Consensus 78 tf-g~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s 156 (305) ++ +..+.|+..+-.-+... +..+.+.++++-++-.|..+.+++..+.+. +. + +. T Consensus 79 ~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~vD~~i~~~~~~a~~~-----------------~~-~-~~----- 133 (273) T protein:vir:79 79 QEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTA-----------------LT-G-SA----- 133 (273) T ss_pred eecccceeeccHHHHhhccc-HHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------------cc-c-cc----- Confidence 53 66777776554443333 467788888888888899888888652110 00 0 00 Q ss_pred hhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccchH Q lcl|NC_013594. 157 NIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLD 236 (305) Q Consensus 157 ~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~ 236 (305) +. . .....+ T Consensus 134 -----------------------~~--------------~----------------------------------~~~~~~ 142 (273) T protein:vir:79 134 -----------------------PS--------------D----------------------------------ADDAFD 142 (273) T ss_pred -----------------------cc--------------c----------------------------------hhhHHH Confidence 00 0 000112 Q ss_pred HHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHH----HHhhcccCCccccccc----cccceeeEEecccC Q lcl|NC_013594. 237 NLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQ----LLNRELFADGNTTVSN----EMKGKLQLVVADYL 305 (305) Q Consensus 237 ~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~----ll~~~~~~~~~~~~~N----~~~~~~~~iv~p~L 305 (305) .+-.|+++|. +.+-|- ..++|||+|.....-++ +.++.. .++.....| -+.| ++++.++.| T Consensus 143 ~i~~a~~~ld----~~~vP~--~~R~lvv~p~~~~~Ll~~~~~~~~~~~-~~~~~~l~~G~ig~~~G-~~i~~s~~l 211 (273) T protein:vir:79 143 LIASALKELT----KANVPN--VGRVVVVNAEMAFWLRSSGSKLTSADT-SGDAAGLRAGTIGNLLG-ARIVESNNL 211 (273) T ss_pred HHHHHHHHhh----hccCCc--cCcEEEECHHHHHHHhhchhhhhhhhh-cccccceeeeEeeEEec-eEEEecccc Confidence 3444444443 333232 23588999876654322 111111 111110111 1333 788888888 No 49 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=49.57 E-value=0.48 Score=22.34 Aligned_cols=276 Identities=12% Similarity=0.091 Sum_probs=105.3 Q ss_pred CCcCHHHHHHH----------------HH----------HHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCc Q lcl|NC_013594. 1 MIVTPASIKAL----------------MT----------SWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPT 54 (305) Q Consensus 1 M~i~~~~l~~L----------------~~----------~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~ 54 (305) +.-......++ .+ .+...+.+..... ..-.++|+.++-.+....|......+ T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~-~~l~~l~~~~~~~~~~~~~~~~~~~~- 167 (421) T protein:vir:13 90 EEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLKEGY-PSLKEHCHVIPVNRNAGKMPVRAGAS- 167 (421) T ss_pred HHHHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHHHHHHHHHhh-hhhhhhceeeeccCCceEEEEeecCC- Confidence 00000000000 00 0111111111111 11234455555333333333222211 Q ss_pred hhhh--ccceeeec---cccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCc-- Q lcl|NC_013594. 55 LKEW--VGKRTIQQ---MEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQ-- 127 (305) Q Consensus 55 l~ew--~Ge~~~~~---l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~-- 127 (305) ..+| ++|..-.. +.=..-+++.++++..+.||++.+.|-...+..-+.+.++++...+.+..+...+++..+. T Consensus 168 ~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~~~~~ 247 (421) T protein:vir:13 168 VDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVLAEET 247 (421) T ss_pred ccceeeccccccccccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhcccccc Confidence 1222 44443322 2223356778999999999999998888888999999999999999998877665432211 Q ss_pred -cccCCc-ccccccccccccccc-cchhhhhHhhhccCCCccceeeeccccc------ccccchhhcccchhhhhhhccc Q lcl|NC_013594. 128 -PCYDGQ-NFFDKEHPVYPNVDG-TGSAVNTSNIVEQDSFSGLPFYLLDCSR------AVKPLIFQERRKPELVARTRID 198 (305) Q Consensus 128 -~~~DGk-~fF~adH~~~~n~~~-tg~~~~~s~l~~~~~~~G~~~~l~~~~~------~vkP~i~Q~r~~~~f~a~t~~~ 198 (305) ..||.- -++..=++.+...++ --+......|..-++..|.+++- |... .=.|++.-.. ...++ T Consensus 248 ~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~-~~~~~~~~tl~G~pV~~~~~-------~~~~~ 319 (421) T protein:vir:13 248 INDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLK-ELSDGGDLVFKGRPVIELEE-------SIFDV 319 (421) T ss_pred ccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeec-CcCCCCCceecceeeEEecc-------ccccC Confidence 111110 001111110000000 00001112233334555655442 2111 1122221110 00000 Q ss_pred cccchhcccceeeeecccccccchhhh-hhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHH-HHH- Q lcl|NC_013594. 199 DDHVFMDNEFLFGASARRAAGYGFWQM-AVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKA-AEQ- 275 (305) Q Consensus 199 ~~nvf~~~~~~~g~d~r~n~G~g~wq~-A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~-A~~- 275 (305) .+ ...++||- |++. .++....++++...... -.++.-+=..-.+-++.++-+ +.. +.. T Consensus 320 ~~----~~~~~~gd---------~~~~~~~~~~~~~~v~~~~~~~----f~~~~~~~r~~~r~d~~~~~~--~a~~~~~~ 380 (421) T protein:vir:13 320 GD----ETKFIVSD---------FKTLIKFMDRKQYLIDQSKEAG----YTKNETIARIIERFDVNSPLD--KSSDAEKI 380 (421) T ss_pred CC----ceEEEEEe---------ccccEEEEEecceEEEeecccc----cccCeeEEEEEeeecceeecc--hhhheeee Confidence 00 01112211 1111 11223334443332221 001000000111111111111 110 000 Q ss_pred -----HHh-hcccCCccccccccccceeeEEecccC Q lcl|NC_013594. 276 -----LLN-RELFADGNTTVSNEMKGKLQLVVADYL 305 (305) Q Consensus 276 -----ll~-~~~~~~~~~~~~N~~~~~~~~iv~p~L 305 (305) ++. .+..+++.+...+|--|+-++-++|.- T Consensus 381 ~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 416 (421) T protein:vir:13 381 RKFGVIVKLQEVLKSSPRSGKNKNESKEEIKEEGEA 416 (421) T ss_pred cccceeeccccccCCCCcCCCCccccchheeecccc Confidence 011 122233334557889999999999998 No 50 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=48.91 E-value=0.21 Score=24.26 Aligned_cols=243 Identities=15% Similarity=0.076 Sum_probs=88.3 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceee----eccccccceeee Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTI----QQMEAHGYSIAN 76 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~----~~l~~~~~~i~n 76 (305) ..+-|..+ ...+....... .....+|+.++-......|..+..-..=-.|++|-.- .++.=..-++.. T Consensus 137 g~liP~~~-------~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~ 208 (394) T protein:vir:97 137 KPVSSEEI-------LYTPAREVKTV-VDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNI 208 (394) T ss_pred cccChHHH-------HHHHHHHhhhh-hhhhhhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeeh Confidence 11111111 01111111111 1134456665543333334333222112246655322 223335567888 Q ss_pred ecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCcc--ccCCc-ccccccccccccccccchhh Q lcl|NC_013594. 77 KTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQP--CYDGQ-NFFDKEHPVYPNVDGTGSAV 153 (305) Q Consensus 77 ~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~--~~DGk-~fF~adH~~~~n~~~tg~~~ 153 (305) ++++..+.|||+.+.|-..++..-+...++++.++..++.+..-+.++.... -||+- ..+...++...+..---+.. T Consensus 209 ~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~ 288 (394) T protein:vir:97 209 DTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAYNVSLIVSQS 288 (394) T ss_pred hheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhhCCEEEEcHH Confidence 9999999999999998888888889999999999998877655443322110 01110 00111110000000000000 Q ss_pred hhHhhhccCCCccceeeecccc------cccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhh Q lcl|NC_013594. 154 NTSNIVEQDSFSGLPFYLLDCS------RAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAV 227 (305) Q Consensus 154 ~~s~l~~~~~~~G~~~~l~~~~------~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~ 227 (305) ....|..-++..|.+++.-+.. +.=+|+++- .+...++ +.++|| |.+.. |. . T Consensus 289 ~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~-------~~~~~~~-------~~~~~g-d~~~~--~~-----~ 346 (394) T protein:vir:97 289 FYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVL-------SDEVLGA-------NKAFIG-DFKRG--VL-----F 346 (394) T ss_pred HHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEe-------cccccCC-------ccEEEe-ecccc--EE-----E Confidence 1112222345556665433211 111232210 0001111 112221 11100 00 0 Q ss_pred hcccccchHHHHHH--HHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccccccccccceeeEEeccc Q lcl|NC_013594. 228 AVKGDLTLDNLWKG--WQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNEMKGKLQLVVADY 304 (305) Q Consensus 228 ~s~~~Lt~~~l~~a--r~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~~~N~~~~~~~~iv~p~ 304 (305) +....++++..... -++++...-.+|.++. |+ -....+++....|. T Consensus 347 ~~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~--~~-----------------------------a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 347 ADRKDLGLRWADNEIYGQYLQAVLRFGVSKVD--DK-----------------------------AGYYVTFTPEPLPL 394 (394) T ss_pred EEecceEEEEecccccceeEEEEEEEccEEec--cc-----------------------------ceEEEEecccccCC Confidence 11111111110000 0001111111111110 10 11122344444444 No 51 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=47.84 E-value=0.48 Score=22.31 Aligned_cols=216 Identities=14% Similarity=0.157 Sum_probs=83.4 Q ss_pred CCcCHHHHHHHHHHH--HHHHHHHHhhhhhhhhcee-eecCCccchhhhhhccCCCchhhhccceeeeccccccceeeee Q lcl|NC_013594. 1 MIVTPASIKALMTSW--RKDFQGGLEDAPSQYNKIA-MVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSIANK 77 (305) Q Consensus 1 M~i~~~~l~~L~~~~--~~~~~~~~~~a~~t~~~~a-~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~l~~~~~~i~n~ 77 (305) -.|++..+..+-.=+ ++.+..-|...|.+=..+- ..+.++..+..|.-.|. .=+| .++-.++++.=...+-..| T Consensus 141 ~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~k--qa~E-Gd~L~~gKl~~~t~tA~ik 217 (410) T protein:vir:83 141 GVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGG--ASDE-KTELDSQKMVIDRLTVNAK 217 (410) T ss_pred cccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccc--cccc-cccccccceeeeeccceee Confidence 223333322211100 0111111111221100000 11223333333322221 1122 5667888888788888999 Q ss_pred cccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhh--- Q lcl|NC_013594. 78 TFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVN--- 154 (305) Q Consensus 78 tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~--- 154 (305) |||.-..+|||.||.-...++.-.++.|+.+.+.+-...|=+.|.. +..--+++..-.+|.-...-.+..+.... T Consensus 218 TyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~--t~t~~~a~~~~Tad~~~~~i~da~~~v~da~~ 295 (410) T protein:vir:83 218 TLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALAS--TSTGAVGYGNATADNVASAIWQAAGAVYTAVK 295 (410) T ss_pred hhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHH--hhhhhhhhhhccHHHHHHHHHHHHHHHhhhhc Confidence 9999999999999999999999999999999988888887777744 21112222222222100000000000000 Q ss_pred --------hH------------hhhc-cCCCcc----------------ceeeeccccc-----ccccchhhcccchhhh Q lcl|NC_013594. 155 --------TS------------NIVE-QDSFSG----------------LPFYLLDCSR-----AVKPLIFQERRKPELV 192 (305) Q Consensus 155 --------~s------------~l~~-~~~~~G----------------~~~~l~~~~~-----~vkP~i~Q~r~~~~f~ 192 (305) +| ++.. ..+..| -|..++|... ..-|...+..... + T Consensus 296 ~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~Ai~~~eS~--~ 373 (410) T protein:vir:83 296 GMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTAAIECFEQR--V 373 (410) T ss_pred cceeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCeeeEeccceeeeeecC--C Confidence 00 0000 001111 1111221110 0011111111100 0 Q ss_pred hhhccccccchhc-ccceeeeecccccccchhhhhhhccc-----ccchH Q lcl|NC_013594. 193 ARTRIDDDHVFMD-NEFLFGASARRAAGYGFWQMAVAVKG-----DLTLD 236 (305) Q Consensus 193 a~t~~~~~nvf~~-~~~~~g~d~r~n~G~g~wq~A~~s~~-----~Lt~~ 236 (305) ..++..+++++.- +.|- ||= +++..- +|+-. T Consensus 374 gp~qL~d~~i~nLt~~yS---------gY~----a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 374 GTLQVVEPSVFGLQVAYA---------GYF----STLVVNEDAIVPLVGS 410 (410) T ss_pred ceeEeeCCchhhhhhhhe---------eee----eeccccccceeeeccC Confidence 1123334444321 1111 111 111000 01100 No 52 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=47.71 E-value=0.69 Score=21.46 Aligned_cols=193 Identities=12% Similarity=0.044 Sum_probs=90.5 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhc---cceeeeccccccceeeee Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWV---GKRTIQQMEAHGYSIANK 77 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~---Ge~~~~~l~~~~~~i~n~ 77 (305) |.++.-.-+-....+-+.|++..--++-.++...... ....+-+....+. +...+.. +......+.+...+++.. T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~-~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTA-SKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhcccccccc-ccCceEEEeeccc-ccccccccCCCccCccccccceEEEEEe Confidence 9998432232322344555555443443333321111 1111222222232 1222221 233445666666677664 Q ss_pred cc-cceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhH Q lcl|NC_013594. 78 TF-EGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTS 156 (305) Q Consensus 78 tf-g~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s 156 (305) ++ ...+.|+..+-..+... +..+.+.++++-++..|..++.++..+.+. + . + T Consensus 79 ~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~alA~~vD~~i~~~~~~a~~~--------------~---~-~-------- 131 (273) T protein:vir:10 79 QEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTA--------------L---T-G-------- 131 (273) T ss_pred eeeecceEeecHHHhhhhcc-HHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------c---c-c-------- Confidence 43 56666775444333333 466788888888888898888877652110 0 0 0 Q ss_pred hhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccch- Q lcl|NC_013594. 157 NIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTL- 235 (305) Q Consensus 157 ~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~- 235 (305) +.+++. T Consensus 132 -------------------------------------------------------------------------~~~~~~~ 138 (273) T protein:vir:10 132 -------------------------------------------------------------------------SAPTDAD 138 (273) T ss_pred -------------------------------------------------------------------------ccccchh Confidence 001111 Q ss_pred ---HHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHH---HHhhcccCCcccccccccc-c------eeeEEec Q lcl|NC_013594. 236 ---DNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQ---LLNRELFADGNTTVSNEMK-G------KLQLVVA 302 (305) Q Consensus 236 ---~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~---ll~~~~~~~~~~~~~N~~~-~------~~~~iv~ 302 (305) +.+-+|+++| ++.+-|- ..++|||+|.....-++ .+......++ .+.++ | -++++.+ T Consensus 139 ~~~~~i~~a~~~l----d~~~vP~--~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~----~~~l~~G~ig~i~G~~v~~s 208 (273) T protein:vir:10 139 DAFDLIAKALKEL----TKANVPN--VGRVVVVNAEMAFWLRSSGSKLTSADTSGD----AAGLRAGTIGNLLGARIVES 208 (273) T ss_pred HHHHHHHHHHHHh----hhcCCCc--CCCEEEECHHHHHHHhcchhhhhhhhcccc----ccceeeeeeeEEeceEEEEe Confidence 2233333443 4444443 34688999976664332 1211111111 12221 1 2688888 Q ss_pred ccC Q lcl|NC_013594. 303 DYL 305 (305) Q Consensus 303 p~L 305 (305) ..| T Consensus 209 ~~l 211 (273) T protein:vir:10 209 NNL 211 (273) T ss_pred ccc Confidence 888 No 53 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=47.71 E-value=0.69 Score=21.46 Aligned_cols=193 Identities=12% Similarity=0.044 Sum_probs=90.5 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhc---cceeeeccccccceeeee Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWV---GKRTIQQMEAHGYSIANK 77 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~---Ge~~~~~l~~~~~~i~n~ 77 (305) |.++.-.-+-....+-+.|++..--++-.++...... ....+-+....+. +...+.. +......+.+...+++.. T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~-~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTA-SKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhcccccccc-ccCceEEEeeccc-ccccccccCCCccCccccccceEEEEEe Confidence 9998432232322344555555443443333321111 1111222222232 1222221 233445666666677664 Q ss_pred cc-cceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhH Q lcl|NC_013594. 78 TF-EGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTS 156 (305) Q Consensus 78 tf-g~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s 156 (305) ++ ...+.|+..+-..+... +..+.+.++++-++..|..++.++..+.+. + . + T Consensus 79 ~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~alA~~vD~~i~~~~~~a~~~--------------~---~-~-------- 131 (273) T protein:vir:10 79 QEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTA--------------L---T-G-------- 131 (273) T ss_pred eeeecceEeecHHHhhhhcc-HHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------c---c-c-------- Confidence 43 56666775444333333 466788888888888898888877652110 0 0 0 Q ss_pred hhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccch- Q lcl|NC_013594. 157 NIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTL- 235 (305) Q Consensus 157 ~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~- 235 (305) +.+++. T Consensus 132 -------------------------------------------------------------------------~~~~~~~ 138 (273) T protein:vir:10 132 -------------------------------------------------------------------------SAPTDAD 138 (273) T ss_pred -------------------------------------------------------------------------ccccchh Confidence 001111 Q ss_pred ---HHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHH---HHhhcccCCcccccccccc-c------eeeEEec Q lcl|NC_013594. 236 ---DNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQ---LLNRELFADGNTTVSNEMK-G------KLQLVVA 302 (305) Q Consensus 236 ---~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~---ll~~~~~~~~~~~~~N~~~-~------~~~~iv~ 302 (305) +.+-+|+++| ++.+-|- ..++|||+|.....-++ .+......++ .+.++ | -++++.+ T Consensus 139 ~~~~~i~~a~~~l----d~~~vP~--~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~----~~~l~~G~ig~i~G~~v~~s 208 (273) T protein:vir:10 139 DAFDLIAKALKEL----TKANVPN--VGRVVVVNAEMAFWLRSSGSKLTSADTSGD----AAGLRAGTIGNLLGARIVES 208 (273) T ss_pred HHHHHHHHHHHHh----hhcCCCc--CCCEEEECHHHHHHHhcchhhhhhhhcccc----ccceeeeeeeEEeceEEEEe Confidence 2233333443 4444443 34688999976664332 1211111111 12221 1 2688888 Q ss_pred ccC Q lcl|NC_013594. 303 DYL 305 (305) Q Consensus 303 p~L 305 (305) ..| T Consensus 209 ~~l 211 (273) T protein:vir:10 209 NNL 211 (273) T ss_pred ccc Confidence 888 No 54 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=43.57 E-value=0.5 Score=22.22 Aligned_cols=236 Identities=12% Similarity=0.168 Sum_probs=99.4 Q ss_pred CCcCHHH------HHHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceee--------ec Q lcl|NC_013594. 1 MIVTPAS------IKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTI--------QQ 66 (305) Q Consensus 1 M~i~~~~------l~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~--------~~ 66 (305) |..+... ...+. +..+....+..| ..++++.++......++..+..-|. -.|+||..- .+ T Consensus 1 ma~~t~~~gg~liP~~~~---~~Ii~~~~~~s~--l~~l~~~~~~~~~~~~~p~~~~~~~-a~wv~E~~~~~~~~~~~s~ 74 (305) T protein:vir:25 1 MADISRAEVASLIQEAYS---DTLLAAAKQGST--VLSAFQNVNMGTKTTHLPVLATLPE-ADWVGESATDPKGVKPTSK 74 (305) T ss_pred CCCccCCccceecCHHHH---HHHHHHHHhhch--hhhhcceeeccCCcEEEEEEeCCcc-eEEeecccccccccccccc Confidence 5444321 11111 222333333332 5666777765544445544444443 357766432 12 Q ss_pred cccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccc-- Q lcl|NC_013594. 67 MEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYP-- 144 (305) Q Consensus 67 l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~-- 144 (305) +.=..-+++.++++..+.||++.+.+-...+.+-+.+.++++.++.+++.++. |.+ .++.++....-... T Consensus 75 ~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~----G~g----~~~~~~~~~~~~~~~~ 146 (305) T protein:vir:25 75 VTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTD----KPASWVSPALIPAAVT 146 (305) T ss_pred cceeeEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhhee----ccC----CCCCcccccccccccc Confidence 22234567789999999999999988889999999999999999999988772 211 12223322110000 Q ss_pred ---cccccchhh-------------------------------hhHhhhccCCCccceeeecccccccccchhhcccchh Q lcl|NC_013594. 145 ---NVDGTGSAV-------------------------------NTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPE 190 (305) Q Consensus 145 ---n~~~tg~~~-------------------------------~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~ 190 (305) ....+.... ....+..-++..|.+++. |....=.|+++-...+ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~-~~~l~G~Pv~~~~~~~-- 223 (305) T protein:vir:25 147 AGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR-DDSFAGFRTFFNRNGA-- 223 (305) T ss_pred ccccccccccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeec-CCcccccceEEcCccC-- Confidence 000000000 011122224455555442 2222223443322110 Q ss_pred hhhhhccccccchhc--ccceeeeecc-------------cccccchhhh---hhhcccccchHHHHHHHHHHHHhhcCC Q lcl|NC_013594. 191 LVARTRIDDDHVFMD--NEFLFGASAR-------------RAAGYGFWQM---AVAVKGDLTLDNLWKGWQLMRAFEGDG 252 (305) Q Consensus 191 f~a~t~~~~~nvf~~--~~~~~g~d~r-------------~n~G~g~wq~---A~~s~~~Lt~~~l~~ar~aM~~~k~~~ 252 (305) .+.++..++.- +.|.+|.... ....+.+||- ++....-+. . ...+.. T Consensus 224 ----~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~----------~-~v~~p~ 288 (305) T protein:vir:25 224 ----WDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFA----------Y-VLGVSA 288 (305) T ss_pred ----CCCCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeec----------c-eeeCcc Confidence 01111122211 1222221100 0111223321 111100000 0 000011 Q ss_pred C-ceeccccCeEEecch Q lcl|NC_013594. 253 G-KKLGLKPTHIVVPVG 268 (305) Q Consensus 253 G-~~L~i~P~~LvVpp~ 268 (305) . ..+...|--.|-|.+ T Consensus 289 a~v~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 289 TAQGANKTPVAVVAPAA 305 (305) T ss_pred cEEEEccccccccCCCC Confidence 1 123444444455555 No 55 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=43.20 E-value=0.85 Score=20.96 Aligned_cols=232 Identities=12% Similarity=0.032 Sum_probs=84.1 Q ss_pred CCcCHHHHH-HHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhcccee---eeccccccceeee Q lcl|NC_013594. 1 MIVTPASIK-ALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRT---IQQMEAHGYSIAN 76 (305) Q Consensus 1 M~i~~~~l~-~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~---~~~l~~~~~~i~n 76 (305) |..+...=. -+-.-+...+....... ....++|+.++-....-+|......+.--.|++|.. -.++.=..-++.. T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~ 183 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRR-LTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANV 183 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhc-cchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEee Confidence 111110000 00000111122222221 123444555553333333433333322335665532 2333335567889 Q ss_pred ecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhH Q lcl|NC_013594. 77 KTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTS 156 (305) Q Consensus 77 ~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s 156 (305) ++++..+.|||+.+. |--.+..-+...++++.++..+..++ . |..+ |+ +|.+ -....... T Consensus 184 ~k~~~~~~is~ell~-d~~~l~~~i~~~la~a~~~~~d~~~l---~-G~g~----~~------~~~G-----i~~~~~~~ 243 (385) T protein:vir:19 184 KTIAHWVQASRQVMD-DAPMLQSYINNRLMYGLALKEEGQLL---N-GDGT----GD------NLEG-----LNKVATAY 243 (385) T ss_pred eeEEEeehhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHHHHH---h-ccCC----CC------cccc-----cccccccc Confidence 999999999999766 55567778888999999888886544 2 2111 11 1111 00000000 Q ss_pred hhhccCCCccceee--ecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccc Q lcl|NC_013594. 157 NIVEQDSFSGLPFY--LLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLT 234 (305) Q Consensus 157 ~l~~~~~~~G~~~~--l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt 234 (305) ... ....+...+ |.+....+++ . .+.++. | -++ T Consensus 244 ~~~--~~~~~~~~~d~i~~~~~~l~~-------------------------------~-~~~~~~---~--------~~~ 278 (385) T protein:vir:19 244 DTS--LNATGDTRADIIAHAIYQVTE-------------------------------S-EFSASG---I--------VLN 278 (385) T ss_pred ccc--ccccccchHHHHHHHHHhhcc-------------------------------c-cCCCCE---E--------EEc Confidence 000 000000000 0000000000 0 000100 0 122 Q ss_pred hHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcccccccc---c----cceeeEEecccC Q lcl|NC_013594. 235 LDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNE---M----KGKLQLVVADYL 305 (305) Q Consensus 235 ~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~~~N~---~----~~~~~~iv~p~L 305 (305) ... +.++++.||.+|++|--.|. =-.+..| ....++.++.+|.+..-..|. + +.-+++.+.... T Consensus 279 ~~~----~~~l~~lkd~~G~~l~~~~~-~~~~~~l--~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~ 349 (385) T protein:vir:19 279 PRD----WHNIALLKDNEGRYIFGGPQ-AFTSNIM--WGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSRED 349 (385) T ss_pred HHH----HHHHHHhhcCCCceeccCcc-cCCCcee--cceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccc Confidence 222 34566677777777631110 0000000 011122223333332111110 0 112333333222 No 56 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=43.20 E-value=0.85 Score=20.96 Aligned_cols=232 Identities=12% Similarity=0.032 Sum_probs=84.1 Q ss_pred CCcCHHHHH-HHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhcccee---eeccccccceeee Q lcl|NC_013594. 1 MIVTPASIK-ALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRT---IQQMEAHGYSIAN 76 (305) Q Consensus 1 M~i~~~~l~-~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~---~~~l~~~~~~i~n 76 (305) |..+...=. -+-.-+...+....... ....++|+.++-....-+|......+.--.|++|.. -.++.=..-++.. T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~ 183 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRR-LTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANV 183 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhc-cchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEee Confidence 111110000 00000111122222221 123444555553333333433333322335665532 2333335567889 Q ss_pred ecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccccccccccccccchhhhhH Q lcl|NC_013594. 77 KTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTS 156 (305) Q Consensus 77 ~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~~~~n~~~tg~~~~~s 156 (305) ++++..+.|||+.+. |--.+..-+...++++.++..+..++ . |..+ |+ +|.+ -....... T Consensus 184 ~k~~~~~~is~ell~-d~~~l~~~i~~~la~a~~~~~d~~~l---~-G~g~----~~------~~~G-----i~~~~~~~ 243 (385) T protein:vir:18 184 KTIAHWVQASRQVMD-DAPMLQSYINNRLMYGLALKEEGQLL---N-GDGT----GD------NLEG-----LNKVATAY 243 (385) T ss_pred eeEEEeehhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHHHHH---h-ccCC----CC------cccc-----cccccccc Confidence 999999999999766 55567778888999999888886544 2 2111 11 1111 00000000 Q ss_pred hhhccCCCccceee--ecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccc Q lcl|NC_013594. 157 NIVEQDSFSGLPFY--LLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLT 234 (305) Q Consensus 157 ~l~~~~~~~G~~~~--l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt 234 (305) ... ....+...+ |.+....+++ . .+.++. | -++ T Consensus 244 ~~~--~~~~~~~~~d~i~~~~~~l~~-------------------------------~-~~~~~~---~--------~~~ 278 (385) T protein:vir:18 244 DTS--LNATGDTRADIIAHAIYQVTE-------------------------------S-EFSASG---I--------VLN 278 (385) T ss_pred ccc--ccccccchHHHHHHHHHhhcc-------------------------------c-cCCCCE---E--------EEc Confidence 000 000000000 0000000000 0 000100 0 122 Q ss_pred hHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCcccccccc---c----cceeeEEecccC Q lcl|NC_013594. 235 LDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNE---M----KGKLQLVVADYL 305 (305) Q Consensus 235 ~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~~~N~---~----~~~~~~iv~p~L 305 (305) ... +.++++.||.+|++|--.|. =-.+..| ....++.++.+|.+..-..|. + +.-+++.+.... T Consensus 279 ~~~----~~~l~~lkd~~G~~l~~~~~-~~~~~~l--~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~ 349 (385) T protein:vir:18 279 PRD----WHNIALLKDNEGRYIFGGPQ-AFTSNIM--WGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSRED 349 (385) T ss_pred HHH----HHHHHHhhcCCCceeccCcc-cCCCcee--cceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccc Confidence 222 34566677777777631110 0000000 011122223333332111110 0 112333333222 No 57 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=40.65 E-value=0.73 Score=21.32 Aligned_cols=221 Identities=14% Similarity=-0.001 Sum_probs=85.7 Q ss_pred CCcCHHHHHHHH------------HHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeeec-c Q lcl|NC_013594. 1 MIVTPASIKALM------------TSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQ-M 67 (305) Q Consensus 1 M~i~~~~l~~L~------------~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~~-l 67 (305) ..+...-.+++. .-+...+.+...... ...++|+.++-+.....+...-.-+. -.|+||..-.. . T Consensus 97 ~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~-a~wv~E~~~~~~~ 174 (401) T protein:vir:44 97 DGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEV-VMRQEATVITVGGSDYKKLVNLGGTA-SGWVGETDTRSQT 174 (401) T ss_pred hhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhh-hhhhhceeeecCCCceEEEEecCCcc-ceeeccccccCcc Confidence 000000000000 011222222222221 23445666553333323332222222 35776654321 1 Q ss_pred ---ccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccc------- Q lcl|NC_013594. 68 ---EAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFD------- 137 (305) Q Consensus 68 ---~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~------- 137 (305) .=..-++..++++..+.||++.+.|-...+..-+.+.|+++-++.++..++. |..+.-..| +++ T Consensus 175 ~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~----G~G~~~p~G--il~~~~~~~~ 248 (401) T protein:vir:44 175 ATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTT----GDGTKKPKG--FLAYESTEES 248 (401) T ss_pred ccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhc----cCCCCccce--eecccccccc Confidence 2233477788999999999999998888999999999999999888776552 111000000 111 Q ss_pred ------------------------------ccccccccccc-cchhhhhHhhhccCCCccceeeeccc------cccccc Q lcl|NC_013594. 138 ------------------------------KEHPVYPNVDG-TGSAVNTSNIVEQDSFSGLPFYLLDC------SRAVKP 180 (305) Q Consensus 138 ------------------------------adH~~~~n~~~-tg~~~~~s~l~~~~~~~G~~~~l~~~------~~~vkP 180 (305) +=|+.+...+. --+......|..-++..|.+++.-+. ++.=.| T Consensus 249 ~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~P 328 (401) T protein:vir:44 249 DKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYG 328 (401) T ss_pred ccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceeccee Confidence 11110000000 00001112233445667776654321 122234 Q ss_pred chhhcccchhhhhhhccccc------------------------cchhcccceeeeecccccccchhhhhhhcccccchH Q lcl|NC_013594. 181 LIFQERRKPELVARTRIDDD------------------------HVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLD 236 (305) Q Consensus 181 ~i~Q~r~~~~f~a~t~~~~~------------------------nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~ 236 (305) +++-...+ . ...++. +-|.++...|-+..|.... + +..+ T Consensus 329 Vv~~~~~p-~----~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~--~----------~~~~ 391 (401) T protein:vir:44 329 IAENEQMP-D----IAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGM--L----------VDSQ 391 (401) T ss_pred eEEecCcC-C----ccCCccEEEEeehhccEEEEEecceEEeeeccccCCcEEEEEEEEeccE--E----------eccc Confidence 43322211 0 000111 1122222223222222211 0 0000 Q ss_pred HHHHHHHHHHHhhcCCCceeccccC Q lcl|NC_013594. 237 NLWKGWQLMRAFEGDGGKKLGLKPT 261 (305) Q Consensus 237 ~l~~ar~aM~~~k~~~G~~L~i~P~ 261 (305) + -+.|.+... T Consensus 392 a---------------~~~l~~~aa 401 (401) T protein:vir:44 392 A---------------IKLLKIAAA 401 (401) T ss_pred c---------------eEEEEeecC Confidence 0 011111111 No 58 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=40.01 E-value=0.99 Score=20.60 Aligned_cols=239 Identities=12% Similarity=0.079 Sum_probs=90.5 Q ss_pred CCcCHHHHHHHHH------------------HHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCC-------ch Q lcl|NC_013594. 1 MIVTPASIKALMT------------------SWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFP-------TL 55 (305) Q Consensus 1 M~i~~~~l~~L~~------------------~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P-------~l 55 (305) |. +-.-|++..+ .+...+.+-... .....++|++++-.....++..+..-| .- T Consensus 1 ~~-~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~-~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~ 78 (338) T protein:vir:78 1 MA-TLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQE-SSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGT 78 (338) T ss_pred Cc-chHHhhhhhcccccccceecccccccchHHHHHHHHHHHh-hchhhhhcceeeccCCceEEEEEecCccceeecccc Confidence 10 0001111100 111111111111 122455566655333333332222222 12 Q ss_pred hhhccc---eeeeccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCC Q lcl|NC_013594. 56 KEWVGK---RTIQQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDG 132 (305) Q Consensus 56 ~ew~Ge---~~~~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DG 132 (305) -.|+|| ....++.=..-+++.++++..+.||++.+.+....+..-+.+.++++.++.+++.++. |..+....+ T Consensus 79 ~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~----G~g~~~~~~ 154 (338) T protein:vir:78 79 SNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFH----GKSPLTGSA 154 (338) T ss_pred cccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCCcccc Confidence 234444 3333344455677889999999999999999889999999999999999999886552 222211000 Q ss_pred cccccccccccccccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceeee Q lcl|NC_013594. 133 QNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGA 212 (305) Q Consensus 133 k~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~ 212 (305) -.-+-+++.. ........ ...+.. .+.+ .+...... +..+. T Consensus 155 ~~gi~~~~~~-------~~~~~~~~-----~~~~~~-~~~~----------------~~~~~~~~-----~~~~~----- 195 (338) T protein:vir:78 155 LQGIDTNNVI-------VNTTNVDY-----LQTGTT-PLLD----------------RFLDGYDL-----VSANT----- 195 (338) T ss_pred cccccccccc-------cccccccc-----ccccch-hhHH----------------HHHHHHHH-----hhhhc----- Confidence 0001111110 00000000 000000 0000 00000000 00000 Q ss_pred ecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceecc------ccCeEEecchhHHHHHHHHhhcccCCcc Q lcl|NC_013594. 213 SARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGL------KPTHIVVPVGLEKAAEQLLNRELFADGN 286 (305) Q Consensus 213 d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i------~P~~LvVpp~le~~A~~ll~~~~~~~~~ 286 (305) +...+ + | -++.. ..+....++..||.+|+||-. .|..|. .+-++-++.+|... T Consensus 196 ~~~~~-~---~--------~m~~~-~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~--------G~PV~~~~~ip~~~ 254 (338) T protein:vir:78 196 DVDFN-G---W--------AADPR-YRARLLRSQAYRDANGNVDPTRINLAASAGDLL--------GLPVQFGKAVGGDL 254 (338) T ss_pred cccce-E---E--------EEchH-HHHHHHHHhhhccCCCceeecccccCCCCceee--------eeeEEEccccCccc Confidence 00000 0 0 01111 223335566778888888731 121221 11111222222110 Q ss_pred ----cc-------c-cccc---cceeeEEecccC Q lcl|NC_013594. 287 ----TT-------V-SNEM---KGKLQLVVADYL 305 (305) Q Consensus 287 ----~~-------~-~N~~---~~~~~~iv~p~L 305 (305) .. + ++.+ ++-+++-++++- T Consensus 255 ~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~ 288 (338) T protein:vir:78 255 GAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTA 288 (338) T ss_pred cccCCcccEEEEEecceEEEEeecccEEEEeecc Confidence 00 0 1111 122444444444 No 59 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=32.37 E-value=1.4 Score=19.73 Aligned_cols=224 Identities=8% Similarity=0.052 Sum_probs=92.0 Q ss_pred CCcCHHHHHHHH-------------------------HHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCch Q lcl|NC_013594. 1 MIVTPASIKALM-------------------------TSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTL 55 (305) Q Consensus 1 M~i~~~~l~~L~-------------------------~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l 55 (305) |..+..+++... ..+...+.+..... ....+++++++......+|..+..-|. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~-s~l~~l~~~~~~~~~~~~~p~~~~~~~- 81 (324) T protein:vir:78 4 TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMEN-SKIMQLGKYEPMEGTEKKFTFWADKPG- 81 (324) T ss_pred chhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEecCcc- Confidence 222221111100 00111111111111 113344566664444444555544444 Q ss_pred hhhccceee---eccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCC Q lcl|NC_013594. 56 KEWVGKRTI---QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDG 132 (305) Q Consensus 56 ~ew~Ge~~~---~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DG 132 (305) -.|++|-.- .++.=..-+++.++++..+.|||+.+.+.+..+..-+.+.++++.++..++.++. + .... ..+ T Consensus 82 a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~---G-~g~~-~~~ 156 (324) T protein:vir:78 82 AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---N-QGNN-PFG 156 (324) T ss_pred eeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc---c-CCCC-CcC Confidence 367655333 3344455677889999999999999999889999999999999999999886642 2 1110 111 Q ss_pred cccccccccccccccccchhhhhHhhhccCCCccce-ee-ecccccccccchhhcccchhhhhhhccccccchhccccee Q lcl|NC_013594. 133 QNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLP-FY-LLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLF 210 (305) Q Consensus 133 k~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~-~~-l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~ 210 (305) .-+.+.... .. ....+.. +- |.+..-.|.+ T Consensus 157 ~gi~~~~~~--------~~----------~~~~~~~t~~~i~~~~~~l~~------------------------------ 188 (324) T protein:vir:78 157 KSIAQSIEK--------TN----------KVIKGDFTQDNIIDLEALLED------------------------------ 188 (324) T ss_pred ccccccccc--------cc----------eeccccccHHHHHHHHHhhhh------------------------------ Confidence 111111110 00 0000000 00 0000000000 Q ss_pred eeecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceecc--ccCeEE-ecchhHHHHHHHHhhcccCCccc Q lcl|NC_013594. 211 GASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGL--KPTHIV-VPVGLEKAAEQLLNRELFADGNT 287 (305) Q Consensus 211 g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i--~P~~Lv-Vpp~le~~A~~ll~~~~~~~~~~ 287 (305) ..+.+++ | -++. +.+.++++.|+..|+++-. .|..|. +|. +.+...+.+.. T Consensus 189 --~~~~~~~---~--------vmn~----~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV---------~~~~~~~~~~~ 242 (324) T protein:vir:78 189 --DELEANA---F--------ISKT----QNRSLLRKIVDPETKERIYDRNSDSLDGLPV---------VNLKSSNLKRG 242 (324) T ss_pred --ccCCCCE---E--------EEcH----HHHHHHHHhhccCCCeeecCCCCCcccceee---------EeeCCCCCCcc Confidence 0011111 1 1222 2235677889999998622 222221 221 11111111110 Q ss_pred c-----ccccc---cceeeE--EecccC Q lcl|NC_013594. 288 T-----VSNEM---KGKLQL--VVADYL 305 (305) Q Consensus 288 ~-----~~N~~---~~~~~~--iv~p~L 305 (305) . -.+.+ ++-+++ .-++.+ T Consensus 243 ~~~~gd~~~~~~g~~~~~~i~~~~~~~~ 270 (324) T protein:vir:78 243 ELITGDFDKLIYGIPQLIEYKIDETAQL 270 (324) T ss_pred eEEEEecceEEEEEecCcEEEEeecccc Confidence 0 00111 111222 222222 No 60 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=32.37 E-value=1.4 Score=19.73 Aligned_cols=224 Identities=8% Similarity=0.052 Sum_probs=92.0 Q ss_pred CCcCHHHHHHHH-------------------------HHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCch Q lcl|NC_013594. 1 MIVTPASIKALM-------------------------TSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTL 55 (305) Q Consensus 1 M~i~~~~l~~L~-------------------------~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l 55 (305) |..+..+++... ..+...+.+..... ....+++++++......+|..+..-|. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~-s~l~~l~~~~~~~~~~~~~p~~~~~~~- 81 (324) T protein:vir:96 4 TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMEN-SKIMQLGKYEPMEGTEKKFTFWADKPG- 81 (324) T ss_pred chhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEecCcc- Confidence 222221111100 00111111111111 113344566664444444555544444 Q ss_pred hhhccceee---eccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCC Q lcl|NC_013594. 56 KEWVGKRTI---QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDG 132 (305) Q Consensus 56 ~ew~Ge~~~---~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DG 132 (305) -.|++|-.- .++.=..-+++.++++..+.|||+.+.+.+..+..-+.+.++++.++..++.++. + .... ..+ T Consensus 82 a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~---G-~g~~-~~~ 156 (324) T protein:vir:96 82 AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---N-QGNN-PFG 156 (324) T ss_pred eeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc---c-CCCC-CcC Confidence 367655333 3344455677889999999999999999889999999999999999999886642 2 1110 111 Q ss_pred cccccccccccccccccchhhhhHhhhccCCCccce-ee-ecccccccccchhhcccchhhhhhhccccccchhccccee Q lcl|NC_013594. 133 QNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLP-FY-LLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLF 210 (305) Q Consensus 133 k~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~-~~-l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~ 210 (305) .-+.+.... .. ....+.. +- |.+..-.|.+ T Consensus 157 ~gi~~~~~~--------~~----------~~~~~~~t~~~i~~~~~~l~~------------------------------ 188 (324) T protein:vir:96 157 KSIAQSIEK--------TN----------KVIKGDFTQDNIIDLEALLED------------------------------ 188 (324) T ss_pred ccccccccc--------cc----------eeccccccHHHHHHHHHhhhh------------------------------ Confidence 111111110 00 0000000 00 0000000000 Q ss_pred eeecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceecc--ccCeEE-ecchhHHHHHHHHhhcccCCccc Q lcl|NC_013594. 211 GASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGL--KPTHIV-VPVGLEKAAEQLLNRELFADGNT 287 (305) Q Consensus 211 g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i--~P~~Lv-Vpp~le~~A~~ll~~~~~~~~~~ 287 (305) ..+.+++ | -++. +.+.++++.|+..|+++-. .|..|. +|. +.+...+.+.. T Consensus 189 --~~~~~~~---~--------vmn~----~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV---------~~~~~~~~~~~ 242 (324) T protein:vir:96 189 --DELEANA---F--------ISKT----QNRSLLRKIVDPETKERIYDRNSDSLDGLPV---------VNLKSSNLKRG 242 (324) T ss_pred --ccCCCCE---E--------EEcH----HHHHHHHHhhccCCCeeecCCCCCcccceee---------EeeCCCCCCcc Confidence 0011111 1 1222 2235677889999998622 222221 221 11111111110 Q ss_pred c-----ccccc---cceeeE--EecccC Q lcl|NC_013594. 288 T-----VSNEM---KGKLQL--VVADYL 305 (305) Q Consensus 288 ~-----~~N~~---~~~~~~--iv~p~L 305 (305) . -.+.+ ++-+++ .-++.+ T Consensus 243 ~~~~gd~~~~~~g~~~~~~i~~~~~~~~ 270 (324) T protein:vir:96 243 ELITGDFDKLIYGIPQLIEYKIDETAQL 270 (324) T ss_pred eEEEEecceEEEEEecCcEEEEeecccc Confidence 0 00111 111222 222222 No 61 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=32.35 E-value=1.4 Score=19.73 Aligned_cols=215 Identities=12% Similarity=0.110 Sum_probs=82.8 Q ss_pred CCcCHHHHHHHHHHHH---------------------------HHHHHHHhhhhhhhhceeeecCCccchhhhhh--ccC Q lcl|NC_013594. 1 MIVTPASIKALMTSWR---------------------------KDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGW--LGK 51 (305) Q Consensus 1 M~i~~~~l~~L~~~~~---------------------------~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~--Lg~ 51 (305) +.......+++...++ ..+.......-+ ..++|+.++.+.....+.+ ... T Consensus 84 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~ 162 (397) T protein:vir:49 84 EEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDS-LQEYVNVENVTTLTGSRVYEKWTD 162 (397) T ss_pred hHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhh-HHhhhceeecccCccceEEEeecc Confidence 1111111111111111 111111111111 2334555543332222222 122 Q ss_pred CCchhhhccceeee----ccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCc Q lcl|NC_013594. 52 FPTLKEWVGKRTIQ----QMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQ 127 (305) Q Consensus 52 ~P~l~ew~Ge~~~~----~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~ 127 (305) .-..-.|++|..-. .+.=..-++..++++..+.||++.+.|-+.++..-+...++++.++.+++.++.-... +. T Consensus 163 ~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~--~~ 240 (397) T protein:vir:49 163 ITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAA--LP 240 (397) T ss_pred CCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccc--cc Confidence 22234677653321 2233455888999999999999999887888999999999999999998865432211 00 Q ss_pred cccCCcccccccccccccccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhccc Q lcl|NC_013594. 128 PCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNE 207 (305) Q Consensus 128 ~~~DGk~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~ 207 (305) ++ +...+...+ ++....|+| T Consensus 241 ------------~~--------~~~~~~d~i-------------~~~~~~l~~--------------------------- 260 (397) T protein:vir:49 241 ------------TK--------PTLTKWDDI-------------IDLEAKVDP--------------------------- 260 (397) T ss_pred ------------cc--------cccccHHHH-------------HHHHHhhhh--------------------------- Confidence 00 000000000 000000000 Q ss_pred ceeeeecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceeccc------cCeEE-ecchhHHHHHHHHhhc Q lcl|NC_013594. 208 FLFGASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLK------PTHIV-VPVGLEKAAEQLLNRE 280 (305) Q Consensus 208 ~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~------P~~Lv-Vpp~le~~A~~ll~~~ 280 (305) ..+.+++ | -++...+ ..+++.||.+|+||-.. |..|+ .|.- +..+. T Consensus 261 -----~~~~~a~---~--------vmn~~~~----~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~-------~~~~~ 313 (397) T protein:vir:49 261 -----AIKQTSF---F--------LTNTSGF----TALKKVKNALGDYLMERDVKSPTGYSIDGFAVK-------EVADR 313 (397) T ss_pred -----hhcCCCE---E--------EEcHHHH----HHHHHhhcCCCceeeccCcCCCCCceecceeeE-------Eeccc Confidence 0011111 1 1222222 44555666677765221 11111 0000 00001 Q ss_pred ccCCccccccccc------------cceeeEEecc-----------------cC Q lcl|NC_013594. 281 LFADGNTTVSNEM------------KGKLQLVVAD-----------------YL 305 (305) Q Consensus 281 ~~~~~~~~~~N~~------------~~~~~~iv~p-----------------~L 305 (305) ..+.+..+....+ ++-+++.+++ |+ T Consensus 314 ~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~ 367 (397) T protein:vir:49 314 WLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRF 367 (397) T ss_pred ccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeEEEEeee Confidence 1111111111111 1122333332 22 No 62 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=30.83 E-value=1.5 Score=19.55 Aligned_cols=237 Identities=13% Similarity=0.136 Sum_probs=90.9 Q ss_pred CC--cCHHHHHHHHH---------------------------HHHHHHHHHHhhhhhhhhceeeecCCcc-chhhhhhcc Q lcl|NC_013594. 1 MI--VTPASIKALMT---------------------------SWRKDFQGGLEDAPSQYNKIAMVVNSST-RSNTYGWLG 50 (305) Q Consensus 1 M~--i~~~~l~~L~~---------------------------~~~~~~~~~~~~a~~t~~~~a~~v~S~~-~~~~y~~Lg 50 (305) .. -..+.+++... -.++.+.+.....+ -.+.+|++++... ..-.+.... T Consensus 84 ~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~-~l~~~~~~~~~~~~~~~~~p~~~ 162 (390) T protein:vir:62 84 SADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSA-IMRGGATTFTTSDANPLDFTVIT 162 (390) T ss_pred hcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhh-hhhhcceeeecCCCceeEEEEEc Confidence 00 00000000000 01222222222222 2455666554222 111222232 Q ss_pred CCCchhhhccce---eeeccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCc Q lcl|NC_013594. 51 KFPTLKEWVGKR---TIQQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQ 127 (305) Q Consensus 51 ~~P~l~ew~Ge~---~~~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~ 127 (305) .-|. -.|++|- ...++.=..-++..++++..+.||++.+.|-...+..-+.+.++++.+...++.+. + | T Consensus 163 ~~~~-a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l---~-G--- 234 (390) T protein:vir:62 163 GRSS-ASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFI---T-G--- 234 (390) T ss_pred CCcc-eeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhh---c-c--- Confidence 2232 3566543 33444445567888999999999999999888889999999999999999988543 2 2 Q ss_pred cccCCcc--ccccccccccccc-ccchhhhh---------------------------HhhhccCCCccceeeecccccc Q lcl|NC_013594. 128 PCYDGQN--FFDKEHPVYPNVD-GTGSAVNT---------------------------SNIVEQDSFSGLPFYLLDCSRA 177 (305) Q Consensus 128 ~~~DGk~--fF~adH~~~~n~~-~tg~~~~~---------------------------s~l~~~~~~~G~~~~l~~~~~~ 177 (305) ||+| +++..-+...... +.....+. ..|..-++..|.+++.-+... T Consensus 235 ---~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~- 310 (390) T protein:vir:62 235 ---TGQPRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTV- 310 (390) T ss_pred ---CCccccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcCC- Confidence 2332 3332111000000 00001111 111112344454433221110 Q ss_pred cccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccchHH-----HHHHHHHHHHhhcCC Q lcl|NC_013594. 178 VKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTLDN-----LWKGWQLMRAFEGDG 252 (305) Q Consensus 178 vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~~~-----l~~ar~aM~~~k~~~ 252 (305) -.|-.+.-+ |..+...-|. +..+|| |... | ..+....++++. +.....+++...-.+ T Consensus 311 g~~~~l~G~--Pv~~~~~~p~-------~~i~~g-d~s~---~-----~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d 372 (390) T protein:vir:62 311 GAPSLFNGK--VVETDDGMPA-------DKILFA-DLSK---Y-----RVRFAGSLRVDRSVDAKFSTDQIVYRFLQRAD 372 (390) T ss_pred Cccceeccc--ceEEecCCCC-------ccEEEe-eccc---e-----eEEeecceEEEeeccccccCCcEEEEEEEEeC Confidence 001001111 1111111121 122232 2111 0 011111222211 122223333333455 Q ss_pred CceeccccCe-E-Eecch Q lcl|NC_013594. 253 GKKLGLKPTH-I-VVPVG 268 (305) Q Consensus 253 G~~L~i~P~~-L-vVpp~ 268 (305) |.+++-..=. | |.+.+ T Consensus 373 ~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 373 GLLVDARGAKVLTVTPGA 390 (390) T ss_pred cEeechhheEEEEeecCC Confidence 6554333311 1 22333 No 63 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=29.19 E-value=1.7 Score=19.35 Aligned_cols=279 Identities=9% Similarity=-0.024 Sum_probs=93.9 Q ss_pred CCcCHHH-H-HHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhh-ccCCCchhhhccceeee---cccccccee Q lcl|NC_013594. 1 MIVTPAS-I-KALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGW-LGKFPTLKEWVGKRTIQ---QMEAHGYSI 74 (305) Q Consensus 1 M~i~~~~-l-~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~-Lg~~P~l~ew~Ge~~~~---~l~~~~~~i 74 (305) |..+... - -.+-.-+...+.+-.. .......++++++-+...-+|.. .+.-| --.|+||-.-. ++.=..-++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~-~~~~i~~l~~~~~~~~~~~~~~~~~~~~~-~a~wv~E~~~~~~s~~~f~~i~~ 228 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLF-YELSLADLISSRPVTSPNLSYLTESAAHN-NAAAVAEAGTYPFSSEEFARVYE 228 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHH-hhhhHHhhccccccCCCceEEEEEcCCCC-cceeeccCcccccccccceeeEe Confidence 1111000 0 0000001111111111 12334555665553333333332 22222 22477664433 333345677 Q ss_pred eeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccc-cccccccccccchhh Q lcl|NC_013594. 75 ANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDK-EHPVYPNVDGTGSAV 153 (305) Q Consensus 75 ~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~a-dH~~~~n~~~tg~~~ 153 (305) ..++++..+.||++.+. |--.+-.-+...++++-++.++.- +|.+ ..+.-..| ++.. ... ....+.+... T Consensus 229 ~~~k~a~~~~iS~ell~-d~~~l~~~i~~~l~~~i~~~~d~~---~l~G-~G~~~p~G--il~~~~~~--~~~~~~~~~~ 299 (497) T protein:vir:10 229 QVGKVANALTITDEGLR-DAPELFNFVQGRLLEGIQRKEEVQ---LLAG-GGYPGVNG--LLQRSTGF--TASSASSLFG 299 (497) T ss_pred eeeeeEeecHhHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHH---hhcC-CCcccccc--cccccccc--cccccccchh Confidence 88999999999999886 555677778888888888888754 3332 11111111 1110 000 0000000000 Q ss_pred hhHhh--hccCCCccc-eeeeccccc-ccccch--hhcccchhhhhhhccccccc----hhcccceeeeecccccccchh Q lcl|NC_013594. 154 NTSNI--VEQDSFSGL-PFYLLDCSR-AVKPLI--FQERRKPELVARTRIDDDHV----FMDNEFLFGASARRAAGYGFW 223 (305) Q Consensus 154 ~~s~l--~~~~~~~G~-~~~l~~~~~-~vkP~i--~Q~r~~~~f~a~t~~~~~nv----f~~~~~~~g~d~r~n~G~g~w 223 (305) ....+ ......+|. .+..-.... .++-.. ..............+++... |.--..++....+.. +-| T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 376 (497) T protein:vir:10 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTP---NAV 376 (497) T ss_pred hhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCC---CeE Confidence 00000 000111111 111100000 000000 00000000001111111111 100000000000000 001 Q ss_pred hhhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeE-EecchhH---HHHHHHHhhcccCCccc--ccccc----- Q lcl|NC_013594. 224 QMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHI-VVPVGLE---KAAEQLLNRELFADGNT--TVSNE----- 292 (305) Q Consensus 224 q~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~L-vVpp~le---~~A~~ll~~~~~~~~~~--~~~N~----- 292 (305) -++ .....++++.||.+|++|..-|... ...|... ...+-++.++.++.+.. ++.+. T Consensus 377 --------vmn----~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i 444 (497) T protein:vir:10 377 --------VMN----PRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQT 444 (497) T ss_pred --------EEc----hHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEE Confidence 122 2345678999999999875432111 0000000 00111122233333321 11111 Q ss_pred -ccceeeEEecccC Q lcl|NC_013594. 293 -MKGKLQLVVADYL 305 (305) Q Consensus 293 -~~~~~~~iv~p~L 305 (305) -+..++|.++++. T Consensus 445 ~~r~~~~v~~~~~~ 458 (497) T protein:vir:10 445 ARREGVTMQMTNSN 458 (497) T ss_pred EEecccEEEeeccc Confidence 1233455555543 No 64 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=29.19 E-value=1.7 Score=19.35 Aligned_cols=279 Identities=9% Similarity=-0.024 Sum_probs=93.9 Q ss_pred CCcCHHH-H-HHHHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhh-ccCCCchhhhccceeee---cccccccee Q lcl|NC_013594. 1 MIVTPAS-I-KALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGW-LGKFPTLKEWVGKRTIQ---QMEAHGYSI 74 (305) Q Consensus 1 M~i~~~~-l-~~L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~-Lg~~P~l~ew~Ge~~~~---~l~~~~~~i 74 (305) |..+... - -.+-.-+...+.+-.. .......++++++-+...-+|.. .+.-| --.|+||-.-. ++.=..-++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~-~~~~i~~l~~~~~~~~~~~~~~~~~~~~~-~a~wv~E~~~~~~s~~~f~~i~~ 228 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLF-YELSLADLISSRPVTSPNLSYLTESAAHN-NAAAVAEAGTYPFSSEEFARVYE 228 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHH-hhhhHHhhccccccCCCceEEEEEcCCCC-cceeeccCcccccccccceeeEe Confidence 1111000 0 0000001111111111 12334555665553333333332 22222 22477664433 333345677 Q ss_pred eeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcccccc-cccccccccccchhh Q lcl|NC_013594. 75 ANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDK-EHPVYPNVDGTGSAV 153 (305) Q Consensus 75 ~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~a-dH~~~~n~~~tg~~~ 153 (305) ..++++..+.||++.+. |--.+-.-+...++++-++.++.- +|.+ ..+.-..| ++.. ... ....+.+... T Consensus 229 ~~~k~a~~~~iS~ell~-d~~~l~~~i~~~l~~~i~~~~d~~---~l~G-~G~~~p~G--il~~~~~~--~~~~~~~~~~ 299 (497) T protein:vir:78 229 QVGKVANALTITDEGLR-DAPELFNFVQGRLLEGIQRKEEVQ---LLAG-GGYPGVNG--LLQRSTGF--TASSASSLFG 299 (497) T ss_pred eeeeeEeecHhHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHH---hhcC-CCcccccc--cccccccc--cccccccchh Confidence 88999999999999886 555677778888888888888754 3332 11111111 1110 000 0000000000 Q ss_pred hhHhh--hccCCCccc-eeeeccccc-ccccch--hhcccchhhhhhhccccccc----hhcccceeeeecccccccchh Q lcl|NC_013594. 154 NTSNI--VEQDSFSGL-PFYLLDCSR-AVKPLI--FQERRKPELVARTRIDDDHV----FMDNEFLFGASARRAAGYGFW 223 (305) Q Consensus 154 ~~s~l--~~~~~~~G~-~~~l~~~~~-~vkP~i--~Q~r~~~~f~a~t~~~~~nv----f~~~~~~~g~d~r~n~G~g~w 223 (305) ....+ ......+|. .+..-.... .++-.. ..............+++... |.--..++....+.. +-| T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 376 (497) T protein:vir:78 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTP---NAV 376 (497) T ss_pred hhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCC---CeE Confidence 00000 000111111 111100000 000000 00000000001111111111 100000000000000 001 Q ss_pred hhhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeE-EecchhH---HHHHHHHhhcccCCccc--ccccc----- Q lcl|NC_013594. 224 QMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHI-VVPVGLE---KAAEQLLNRELFADGNT--TVSNE----- 292 (305) Q Consensus 224 q~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~L-vVpp~le---~~A~~ll~~~~~~~~~~--~~~N~----- 292 (305) -++ .....++++.||.+|++|..-|... ...|... ...+-++.++.++.+.. ++.+. T Consensus 377 --------vmn----~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i 444 (497) T protein:vir:78 377 --------VMN----PRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQT 444 (497) T ss_pred --------EEc----hHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEE Confidence 122 2345678999999999875432111 0000000 00111122233333321 11111 Q ss_pred -ccceeeEEecccC Q lcl|NC_013594. 293 -MKGKLQLVVADYL 305 (305) Q Consensus 293 -~~~~~~~iv~p~L 305 (305) -+..++|.++++. T Consensus 445 ~~r~~~~v~~~~~~ 458 (497) T protein:vir:78 445 ARREGVTMQMTNSN 458 (497) T ss_pred EEecccEEEeeccc Confidence 1233455555543 No 65 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=27.86 E-value=1.3 Score=19.88 Aligned_cols=265 Identities=13% Similarity=0.098 Sum_probs=86.8 Q ss_pred CCcC-HHHHH-HHHHHHHHHHHHHHhhhhhhhhceeeecC--CccchhhhhhccCCCchhhhccceeee----ccccccc Q lcl|NC_013594. 1 MIVT-PASIK-ALMTSWRKDFQGGLEDAPSQYNKIAMVVN--SSTRSNTYGWLGKFPTLKEWVGKRTIQ----QMEAHGY 72 (305) Q Consensus 1 M~i~-~~~l~-~L~~~~~~~~~~~~~~a~~t~~~~a~~v~--S~~~~~~y~~Lg~~P~l~ew~Ge~~~~----~l~~~~~ 72 (305) |..+ .++=. .+=+.+...+.+.....-+ -.++|+.++ +......+....+.+.. .|++|..-. .+.=..- T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~-l~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDA-LEQYVTVEPVRTRSGSRVLEKNSDMIPF-AEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhh-hhhhceeeeccCCceeEEEEeecCCccc-eeecccccccccccccceeE Confidence 1100 00000 0000011111111111111 223444433 22222222223333333 466553221 2333556 Q ss_pred eeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccc--cCCc-ccccccc-ccc-cccc Q lcl|NC_013594. 73 SIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPC--YDGQ-NFFDKEH-PVY-PNVD 147 (305) Q Consensus 73 ~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~--~DGk-~fF~adH-~~~-~n~~ 147 (305) ++..++++..+.|||+.+.|-+..+..-+.+.++++.++.++..+......+..+.. ||.- ..+...- +.+ .|.. T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~ 263 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAI 263 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccCHHHHHHHHHHhhhhhhccCCE Confidence 788899999999999999877788899999999999999999888765543221110 1110 0000000 000 0000 Q ss_pred ccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcc-cceeeeecccccccchhhhh Q lcl|NC_013594. 148 GTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDN-EFLFGASARRAAGYGFWQMA 226 (305) Q Consensus 148 ~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~-~~~~g~d~r~n~G~g~wq~A 226 (305) =--+......|..-++..|.+++.-+....-.+.|+-.+.-. .+....+.+..+-..+ .++|| |.....-- T Consensus 264 ~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~-~~~~~~~~~~~~~~~~~~~~~g-dfs~~~~i------ 335 (392) T protein:vir:10 264 LLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVV-VVSNRFLKSKGTTAKKAPLIIG-DLKEAIVL------ 335 (392) T ss_pred EEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEE-EecccccCCCcccCCceEEEEE-ehhceEEE------ Confidence 000011112233335556666553321111111111100000 0000000000111111 11221 21110000 Q ss_pred hhcccccchHH-------HHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcc-cCCc Q lcl|NC_013594. 227 VAVKGDLTLDN-------LWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNREL-FADG 285 (305) Q Consensus 227 ~~s~~~Lt~~~-------l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~-~~~~ 285 (305) +....++.+- +..-..+.+.....+|.++. |.-+++-. .. ..+.. -|.| T Consensus 336 -~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~--~~a~~~l~-~~------~~a~~~~~~~ 392 (392) T protein:vir:10 336 -FKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWD--NEAAVYGE-ID------LSAPVEQPQG 392 (392) T ss_pred -EeecceEEEEeccccchhhcCceEEEEEEeeccEEec--ccceEEEE-ec------ccccccCCCC Confidence 0011111111 11111111111112222221 22111100 00 00000 1222 No 66 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=27.86 E-value=1.3 Score=19.88 Aligned_cols=265 Identities=13% Similarity=0.098 Sum_probs=86.8 Q ss_pred CCcC-HHHHH-HHHHHHHHHHHHHHhhhhhhhhceeeecC--CccchhhhhhccCCCchhhhccceeee----ccccccc Q lcl|NC_013594. 1 MIVT-PASIK-ALMTSWRKDFQGGLEDAPSQYNKIAMVVN--SSTRSNTYGWLGKFPTLKEWVGKRTIQ----QMEAHGY 72 (305) Q Consensus 1 M~i~-~~~l~-~L~~~~~~~~~~~~~~a~~t~~~~a~~v~--S~~~~~~y~~Lg~~P~l~ew~Ge~~~~----~l~~~~~ 72 (305) |..+ .++=. .+=+.+...+.+.....-+ -.++|+.++ +......+....+.+.. .|++|..-. .+.=..- T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~-l~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDA-LEQYVTVEPVRTRSGSRVLEKNSDMIPF-AEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhh-hhhhceeeeccCCceeEEEEeecCCccc-eeecccccccccccccceeE Confidence 1100 00000 0000011111111111111 223444433 22222222223333333 466553221 2333556 Q ss_pred eeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccc--cCCc-ccccccc-ccc-cccc Q lcl|NC_013594. 73 SIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPC--YDGQ-NFFDKEH-PVY-PNVD 147 (305) Q Consensus 73 ~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~--~DGk-~fF~adH-~~~-~n~~ 147 (305) ++..++++..+.|||+.+.|-+..+..-+.+.++++.++.++..+......+..+.. ||.- ..+...- +.+ .|.. T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~ 263 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAI 263 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccCHHHHHHHHHHhhhhhhccCCE Confidence 788899999999999999877788899999999999999999888765543221110 1110 0000000 000 0000 Q ss_pred ccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcc-cceeeeecccccccchhhhh Q lcl|NC_013594. 148 GTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDN-EFLFGASARRAAGYGFWQMA 226 (305) Q Consensus 148 ~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~-~~~~g~d~r~n~G~g~wq~A 226 (305) =--+......|..-++..|.+++.-+....-.+.|+-.+.-. .+....+.+..+-..+ .++|| |.....-- T Consensus 264 ~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~-~~~~~~~~~~~~~~~~~~~~~g-dfs~~~~i------ 335 (392) T protein:vir:10 264 LLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVV-VVSNRFLKSKGTTAKKAPLIIG-DLKEAIVL------ 335 (392) T ss_pred EEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEE-EecccccCCCcccCCceEEEEE-ehhceEEE------ Confidence 000011112233335556666553321111111111100000 0000000000111111 11221 21110000 Q ss_pred hhcccccchHH-------HHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcc-cCCc Q lcl|NC_013594. 227 VAVKGDLTLDN-------LWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNREL-FADG 285 (305) Q Consensus 227 ~~s~~~Lt~~~-------l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~-~~~~ 285 (305) +....++.+- +..-..+.+.....+|.++. |.-+++-. .. ..+.. -|.| T Consensus 336 -~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~--~~a~~~l~-~~------~~a~~~~~~~ 392 (392) T protein:vir:10 336 -FKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWD--NEAAVYGE-ID------LSAPVEQPQG 392 (392) T ss_pred -EeecceEEEEeccccchhhcCceEEEEEEeeccEEec--ccceEEEE-ec------ccccccCCCC Confidence 0011111111 11111111111112222221 22111100 00 00000 1222 No 67 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=27.86 E-value=1.3 Score=19.88 Aligned_cols=265 Identities=13% Similarity=0.098 Sum_probs=86.8 Q ss_pred CCcC-HHHHH-HHHHHHHHHHHHHHhhhhhhhhceeeecC--CccchhhhhhccCCCchhhhccceeee----ccccccc Q lcl|NC_013594. 1 MIVT-PASIK-ALMTSWRKDFQGGLEDAPSQYNKIAMVVN--SSTRSNTYGWLGKFPTLKEWVGKRTIQ----QMEAHGY 72 (305) Q Consensus 1 M~i~-~~~l~-~L~~~~~~~~~~~~~~a~~t~~~~a~~v~--S~~~~~~y~~Lg~~P~l~ew~Ge~~~~----~l~~~~~ 72 (305) |..+ .++=. .+=+.+...+.+.....-+ -.++|+.++ +......+....+.+.. .|++|..-. .+.=..- T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~-l~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDA-LEQYVTVEPVRTRSGSRVLEKNSDMIPF-AEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhh-hhhhceeeeccCCceeEEEEeecCCccc-eeecccccccccccccceeE Confidence 1100 00000 0000011111111111111 223444433 22222222223333333 466553221 2333556 Q ss_pred eeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccc--cCCc-ccccccc-ccc-cccc Q lcl|NC_013594. 73 SIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPC--YDGQ-NFFDKEH-PVY-PNVD 147 (305) Q Consensus 73 ~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~--~DGk-~fF~adH-~~~-~n~~ 147 (305) ++..++++..+.|||+.+.|-+..+..-+.+.++++.++.++..+......+..+.. ||.- ..+...- +.+ .|.. T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~ 263 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAI 263 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccCHHHHHHHHHHhhhhhhccCCE Confidence 788899999999999999877788899999999999999999888765543221110 1110 0000000 000 0000 Q ss_pred ccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcc-cceeeeecccccccchhhhh Q lcl|NC_013594. 148 GTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDN-EFLFGASARRAAGYGFWQMA 226 (305) Q Consensus 148 ~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~-~~~~g~d~r~n~G~g~wq~A 226 (305) =--+......|..-++..|.+++.-+....-.+.|+-.+.-. .+....+.+..+-..+ .++|| |.....-- T Consensus 264 ~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~-~~~~~~~~~~~~~~~~~~~~~g-dfs~~~~i------ 335 (392) T protein:vir:10 264 LLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVV-VVSNRFLKSKGTTAKKAPLIIG-DLKEAIVL------ 335 (392) T ss_pred EEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEE-EecccccCCCcccCCceEEEEE-ehhceEEE------ Confidence 000011112233335556666553321111111111100000 0000000000111111 11221 21110000 Q ss_pred hhcccccchHH-------HHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcc-cCCc Q lcl|NC_013594. 227 VAVKGDLTLDN-------LWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNREL-FADG 285 (305) Q Consensus 227 ~~s~~~Lt~~~-------l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~-~~~~ 285 (305) +....++.+- +..-..+.+.....+|.++. |.-+++-. .. ..+.. -|.| T Consensus 336 -~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~--~~a~~~l~-~~------~~a~~~~~~~ 392 (392) T protein:vir:10 336 -FKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWD--NEAAVYGE-ID------LSAPVEQPQG 392 (392) T ss_pred -EeecceEEEEeccccchhhcCceEEEEEEeeccEEec--ccceEEEE-ec------ccccccCCCC Confidence 0011111111 11111111111112222221 22111100 00 00000 1222 No 68 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=27.86 E-value=1.3 Score=19.88 Aligned_cols=265 Identities=13% Similarity=0.098 Sum_probs=86.8 Q ss_pred CCcC-HHHHH-HHHHHHHHHHHHHHhhhhhhhhceeeecC--CccchhhhhhccCCCchhhhccceeee----ccccccc Q lcl|NC_013594. 1 MIVT-PASIK-ALMTSWRKDFQGGLEDAPSQYNKIAMVVN--SSTRSNTYGWLGKFPTLKEWVGKRTIQ----QMEAHGY 72 (305) Q Consensus 1 M~i~-~~~l~-~L~~~~~~~~~~~~~~a~~t~~~~a~~v~--S~~~~~~y~~Lg~~P~l~ew~Ge~~~~----~l~~~~~ 72 (305) |..+ .++=. .+=+.+...+.+.....-+ -.++|+.++ +......+....+.+.. .|++|..-. .+.=..- T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~-l~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDA-LEQYVTVEPVRTRSGSRVLEKNSDMIPF-AEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhh-hhhhceeeeccCCceeEEEEeecCCccc-eeecccccccccccccceeE Confidence 1100 00000 0000011111111111111 223444433 22222222223333333 466553221 2333556 Q ss_pred eeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccc--cCCc-ccccccc-ccc-cccc Q lcl|NC_013594. 73 SIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPC--YDGQ-NFFDKEH-PVY-PNVD 147 (305) Q Consensus 73 ~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~--~DGk-~fF~adH-~~~-~n~~ 147 (305) ++..++++..+.|||+.+.|-+..+..-+.+.++++.++.++..+......+..+.. ||.- ..+...- +.+ .|.. T Consensus 184 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~ 263 (392) T protein:vir:10 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAI 263 (392) T ss_pred EeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccCHHHHHHHHHHhhhhhhccCCE Confidence 788899999999999999877788899999999999999999888765543221110 1110 0000000 000 0000 Q ss_pred ccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcc-cceeeeecccccccchhhhh Q lcl|NC_013594. 148 GTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDN-EFLFGASARRAAGYGFWQMA 226 (305) Q Consensus 148 ~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~-~~~~g~d~r~n~G~g~wq~A 226 (305) =--+......|..-++..|.+++.-+....-.+.|+-.+.-. .+....+.+..+-..+ .++|| |.....-- T Consensus 264 ~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~-~~~~~~~~~~~~~~~~~~~~~g-dfs~~~~i------ 335 (392) T protein:vir:10 264 LLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVV-VVSNRFLKSKGTTAKKAPLIIG-DLKEAIVL------ 335 (392) T ss_pred EEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEE-EecccccCCCcccCCceEEEEE-ehhceEEE------ Confidence 000011112233335556666553321111111111100000 0000000000111111 11221 21110000 Q ss_pred hhcccccchHH-------HHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcc-cCCc Q lcl|NC_013594. 227 VAVKGDLTLDN-------LWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNREL-FADG 285 (305) Q Consensus 227 ~~s~~~Lt~~~-------l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~-~~~~ 285 (305) +....++.+- +..-..+.+.....+|.++. |.-+++-. .. ..+.. -|.| T Consensus 336 -~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~--~~a~~~l~-~~------~~a~~~~~~~ 392 (392) T protein:vir:10 336 -FKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWD--NEAAVYGE-ID------LSAPVEQPQG 392 (392) T ss_pred -EeecceEEEEeccccchhhcCceEEEEEEeeccEEec--ccceEEEE-ec------ccccccCCCC Confidence 0011111111 11111111111112222221 22111100 00 00000 1222 No 69 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=27.38 E-value=1.8 Score=19.12 Aligned_cols=223 Identities=12% Similarity=0.124 Sum_probs=92.6 Q ss_pred CCcCHH-HHHH-HHHHHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCchhhhccceeee---ccccccceee Q lcl|NC_013594. 1 MIVTPA-SIKA-LMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQ---QMEAHGYSIA 75 (305) Q Consensus 1 M~i~~~-~l~~-L~~~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~l~ew~Ge~~~~---~l~~~~~~i~ 75 (305) |..+.. .-.+ |-..+...+.+...... -..++|+.++-....-.|.....-|.. .|+||-.-. +++=..-+++ T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~-~l~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~Eg~~~~~~~~~f~~i~~~ 91 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTS-IVQQFAQKVPMGTTGQKIPHWVGDVSA-QWIGEGDMKPITKGNMTSQTIA 91 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhc-hhhhhcceeeccCCceEEEEEeCCcce-EEecCCccccccccceeEEEEe Confidence 221110 0000 00011122222222222 235556666644444455555555553 687664433 3333445677 Q ss_pred eecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcc--cccccccccccccccchhh Q lcl|NC_013594. 76 NKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQN--FFDKEHPVYPNVDGTGSAV 153 (305) Q Consensus 76 n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~--fF~adH~~~~n~~~tg~~~ 153 (305) .++++..+.||++.+.|....+..-+.+.++++.++.+++.++ + |.+. |++ +.+.... . T Consensus 92 ~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l---~-G~g~----~~~~~~~~~~~~-----------~ 152 (318) T protein:vir:24 92 PHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAM---H-GTDS----PFPTYIGQTTKA-----------I 152 (318) T ss_pred eEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhh---c-ccCC----CCCccccccccc-----------c Confidence 8999999999999999988889999999999999999988664 2 2121 222 1111110 0 Q ss_pred hhHhhhccCCCccceee---ecccccccccchhhcccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcc Q lcl|NC_013594. 154 NTSNIVEQDSFSGLPFY---LLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVK 230 (305) Q Consensus 154 ~~s~l~~~~~~~G~~~~---l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~ 230 (305) .... ...+..+. +++....+. ...+.+.. | T Consensus 153 ~~~~-----~~~~~~~~~~~~~~~~~~~~--------------------------------~~~~~~~~---~------- 185 (318) T protein:vir:24 153 SIAD-----TTGATTVYDQVAVNGLSLLV--------------------------------NDGKKWTH---T------- 185 (318) T ss_pred cccc-----cccccchHHHHHHHHHHhhc--------------------------------cccCCCCE---E------- Confidence 0000 00000000 000000000 00011110 0 Q ss_pred cccchHHHHHHHHHHHHhhcCCCceecccc------------CeEEecchhHHHHHHHHhhcccCCcccc-c----cccc Q lcl|NC_013594. 231 GDLTLDNLWKGWQLMRAFEGDGGKKLGLKP------------THIVVPVGLEKAAEQLLNRELFADGNTT-V----SNEM 293 (305) Q Consensus 231 ~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P------------~~LvVpp~le~~A~~ll~~~~~~~~~~~-~----~N~~ 293 (305) -++... +..+++.||.+|++|-... ..+.+|.- -++.++.+..- . ...+ T Consensus 186 -v~n~~~----~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~---------~~~~~~~~~~~~~~gdfs~~~ 251 (318) T protein:vir:24 186 -LLDDIT----EPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTI---------LSDHVVEGTTVGFMGDFSQLI 251 (318) T ss_pred -EEcHHH----HHHHHHhhccCCceeecCccccCccccccCceEEEEeeE---------EeCCCCCCccEEEEeecceEE Confidence 112222 2455677888888763211 01111111 11122222210 0 0111 Q ss_pred c---c--eeeEEecccC Q lcl|NC_013594. 294 K---G--KLQLVVADYL 305 (305) Q Consensus 294 ~---~--~~~~iv~p~L 305 (305) . + .+++.-+..| T Consensus 252 ~~~~~~l~i~~~~~~~~ 268 (318) T protein:vir:24 252 WGQIGGLSFDVTDQATL 268 (318) T ss_pred EEEecCeEEEEeeccce Confidence 1 1 1222223333 No 70 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=25.55 E-value=2 Score=18.88 Aligned_cols=226 Identities=10% Similarity=0.066 Sum_probs=92.0 Q ss_pred CCcCHHHHHHHHH----------------------------HHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCC Q lcl|NC_013594. 1 MIVTPASIKALMT----------------------------SWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKF 52 (305) Q Consensus 1 M~i~~~~l~~L~~----------------------------~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~ 52 (305) |.=++..-..+.. .+...+.+...+. ....+.|+.++.......|..+..- T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~ip~~~~~ 79 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMEN-SKIMQLGKYEPMEGTEKKFTFWADK 79 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhh-cchhhhcceeeccCCceEEEEEecC Confidence 2221111111100 0111111111111 1134456666654444455544444 Q ss_pred Cchhhhccceee---eccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccc Q lcl|NC_013594. 53 PTLKEWVGKRTI---QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPC 129 (305) Q Consensus 53 P~l~ew~Ge~~~---~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~ 129 (305) |. -.|+||..- .++.=..-++..++++..+.|||+.+.|-...+..-+.+.++++.++.+++.++. + .... T Consensus 80 ~~-a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~---G-~g~~- 153 (324) T protein:vir:97 80 PG-AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---N-QGNN- 153 (324) T ss_pred cc-eeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc---c-CCCC- Confidence 44 367765443 3333355677889999999999999998888999999999999999999886553 2 1110 Q ss_pred cCCcccccccccccccccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccce Q lcl|NC_013594. 130 YDGQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFL 209 (305) Q Consensus 130 ~DGk~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~ 209 (305) ..+.-++++.+.. + ..+....+..++ .+..-.|++ T Consensus 154 ~~~~gi~~~~~~~--~-~~~~~~~~~~~i-------------~~~~~~l~~----------------------------- 188 (324) T protein:vir:97 154 PFGKSIAQSIEKT--N-KVIKGDFTQDNI-------------IDLEALLED----------------------------- 188 (324) T ss_pred ccCcccccccccc--c-eeccccCCHHHH-------------HHHHHhhhh----------------------------- Confidence 0111122211100 0 000000000000 000000000 Q ss_pred eeeecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceecc--ccCeEE-ecchhHHHHHHHHhhcccCCcc Q lcl|NC_013594. 210 FGASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGL--KPTHIV-VPVGLEKAAEQLLNRELFADGN 286 (305) Q Consensus 210 ~g~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i--~P~~Lv-Vpp~le~~A~~ll~~~~~~~~~ 286 (305) ..+.++. | -++...+ ..+++.||.+|+++-. .+..|. +| ++.++..+.+. T Consensus 189 ---~~~~~~~---~--------v~n~~~~----~~L~~lkd~~g~~~~~~~~~~tl~G~P---------V~~~~~~~~~~ 241 (324) T protein:vir:97 189 ---DELEANA---F--------ISKTQNR----SLLRKIVDPETKERIYDRNSDTLDGLP---------VVNLKSSNLKR 241 (324) T ss_pred ---ccCCCCE---E--------EEcHHHH----HHHHHhhcCCCceeecCCCCcccccee---------eEeecCCCCCc Confidence 0011111 1 1222222 3566789999987632 111111 21 11111111111 Q ss_pred cc----c-ccccc---ceee--EEecccC Q lcl|NC_013594. 287 TT----V-SNEMK---GKLQ--LVVADYL 305 (305) Q Consensus 287 ~~----~-~N~~~---~~~~--~iv~p~L 305 (305) .. + .+-+. +-++ +.-+..+ T Consensus 242 ~~~~~gd~~~~~i~~~~~~~i~~~~~~~~ 270 (324) T protein:vir:97 242 GELITGDFDKLIYGIPQLIEYKIDETAQL 270 (324) T ss_pred ceEEEEecccEEEEEecCcEEEEeecccc Confidence 00 0 01111 1112 2222222 No 71 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=22.66 E-value=2.4 Score=18.48 Aligned_cols=214 Identities=13% Similarity=0.131 Sum_probs=88.0 Q ss_pred CCcCHHHHHHHHH------------HHHH-HHHHHHhhhhhhhhceeeecCCccchhhhhh--ccCCCchhhhccceeee Q lcl|NC_013594. 1 MIVTPASIKALMT------------SWRK-DFQGGLEDAPSQYNKIAMVVNSSTRSNTYGW--LGKFPTLKEWVGKRTIQ 65 (305) Q Consensus 1 M~i~~~~l~~L~~------------~~~~-~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~--Lg~~P~l~ew~Ge~~~~ 65 (305) -..+....+++.. .+.. ......+..| ..++|+.++-++....+.. ....-..-.|+||-.-. T Consensus 106 ~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~ 183 (408) T protein:vir:10 106 AFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDS--LQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKI 183 (408) T ss_pred hhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhch--hhhhcceeeccCCcceEEEeeccccccceeeecCcccc Confidence 0011111111111 1111 1111111111 3455666553333333322 22222345677664322 Q ss_pred -cc---ccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCccccccccc Q lcl|NC_013594. 66 -QM---EAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHP 141 (305) Q Consensus 66 -~l---~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~fF~adH~ 141 (305) .. .=..-++..++++..+.||++.+.|-..++..-+...++++.++.+++-++.-... |++ T Consensus 184 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~--------~~~------- 248 (408) T protein:vir:10 184 PDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA--------APK------- 248 (408) T ss_pred ccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------ccc------- Confidence 12 22455788899999999999999988888899999999999999988765432211 000 Q ss_pred ccccccccchhhhhHhhhccCCCccceeeeccc-ccccccchhhcccchhhhhhhccccccchhcccceeeeeccccccc Q lcl|NC_013594. 142 VYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDC-SRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGY 220 (305) Q Consensus 142 ~~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~-~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~ 220 (305) .+...+...+. +. ...++| ..|.|+. T Consensus 249 -------~~~~~~~~~l~-------------~~~~~~~~~--------------------------------~~~~~a~- 275 (408) T protein:vir:10 249 -------KPTIAKFDDVI-------------TMINTAVDP--------------------------------AIIATSS- 275 (408) T ss_pred -------ccccccHHHHH-------------HHHHHhhhh--------------------------------hhccCCE- Confidence 00000111110 00 000000 0011111 Q ss_pred chhhhhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhh-------cccCCccccccccc Q lcl|NC_013594. 221 GFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNR-------ELFADGNTTVSNEM 293 (305) Q Consensus 221 g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~-------~~~~~~~~~~~N~~ 293 (305) | -++. ..+.++++.||.+|++|- .|..-- + .-..|+.. ...+..+.+..-.+ T Consensus 276 --~--------v~n~----~~~~~l~~lkd~~G~~i~-~~~~~~-~-----~~~~l~G~PV~~~~~~~~~~~~~~~~~i~ 334 (408) T protein:vir:10 276 --L--------LTNQ----SGLNKLALVKTAEGKYLL-EPDPTK-P-----NSYLIKGKQVIVVADRWLPNTGSTVYPLY 334 (408) T ss_pred --E--------EEcH----HHHHHHHHhhccCCceEe-ccCcCC-C-----CCceecceeeEEecccccCccCCCceEEE Confidence 0 0122 223556677888888763 221000 0 00011111 01111111000011 Q ss_pred ------------cceeeEEecccC Q lcl|NC_013594. 294 ------------KGKLQLVVADYL 305 (305) Q Consensus 294 ------------~~~~~~iv~p~L 305 (305) +.-+++.++++. T Consensus 335 ~gd~~~~~~~~~~~~~~v~~~~~~ 358 (408) T protein:vir:10 335 YGDMSQAITLFDRENMSLLPTNIG 358 (408) T ss_pred EEehhccEEEEEecceEEEEcccc Confidence 222344444432 No 72 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=22.24 E-value=2.5 Score=18.42 Aligned_cols=235 Identities=11% Similarity=0.055 Sum_probs=88.4 Q ss_pred CCcCHHHHHH------HHH----------HHHHHHHHHHhhhhhhhhceeeecCCccc-hhhhhhccCCCchhhhccce- Q lcl|NC_013594. 1 MIVTPASIKA------LMT----------SWRKDFQGGLEDAPSQYNKIAMVVNSSTR-SNTYGWLGKFPTLKEWVGKR- 62 (305) Q Consensus 1 M~i~~~~l~~------L~~----------~~~~~~~~~~~~a~~t~~~~a~~v~S~~~-~~~y~~Lg~~P~l~ew~Ge~- 62 (305) |......+.. ... -.+..+...... -+-.+.+|+.++.... .-.+.....-|. -.|++|. T Consensus 97 ~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-a~~v~E~~ 174 (392) T protein:vir:13 97 NLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVER-SAIMRGGASTFTTSDANPMDFTVITGRAT-AGIVGETA 174 (392) T ss_pred chhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhh-hhhhhhcceeeecCCCceeEEEEEcCCcc-eeeecccc Confidence 1110000000 000 001111111111 1123445554442221 112222222233 2466554 Q ss_pred --eeeccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccCCcc--cccc Q lcl|NC_013594. 63 --TIQQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQN--FFDK 138 (305) Q Consensus 63 --~~~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~DGk~--fF~a 138 (305) ...++.=..-++..++++..+.||++.+.|-+.++..-+.+.++++.++.++..++. |..+ |+| +++. T Consensus 175 ~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~----G~Gt----~~p~Gil~~ 246 (392) T protein:vir:13 175 EIPESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLT----GTGT----GQPRGILTD 246 (392) T ss_pred cccccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----ccCC----ccccccccc Confidence 333344355678889999999999999999888999999999999999998886552 2111 111 2221 Q ss_pred ccccccc-ccccchhhhh---------------------------HhhhccCCCccceeeecccc------cccccchhh Q lcl|NC_013594. 139 EHPVYPN-VDGTGSAVNT---------------------------SNIVEQDSFSGLPFYLLDCS------RAVKPLIFQ 184 (305) Q Consensus 139 dH~~~~n-~~~tg~~~~~---------------------------s~l~~~~~~~G~~~~l~~~~------~~vkP~i~Q 184 (305) .-..... ..++....+. ..|..-++..|.+++.-+.. +.=.|++ T Consensus 247 ~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~Pv~-- 324 (392) T protein:vir:13 247 ATGANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKVVE-- 324 (392) T ss_pred cccccccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeE-- Confidence 1100000 0000011111 11112244555554432211 1112322 Q ss_pred cccchhhhhhhccccccchhcccceeeeecccccccchhhhhhhcccccch-----HHHHHHHHHHHHhhcCCCceeccc Q lcl|NC_013594. 185 ERRKPELVARTRIDDDHVFMDNEFLFGASARRAAGYGFWQMAVAVKGDLTL-----DNLWKGWQLMRAFEGDGGKKLGLK 259 (305) Q Consensus 185 ~r~~~~f~a~t~~~~~nvf~~~~~~~g~d~r~n~G~g~wq~A~~s~~~Lt~-----~~l~~ar~aM~~~k~~~G~~L~i~ 259 (305) +...-|.+ ..+|| |.. ..-.+.|+ .+++ .-+......++...-.+|.+.+ T Consensus 325 -------~~~~~~~~-------~i~~G-df~-~~~i~~~~-------~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~-- 379 (392) T protein:vir:13 325 -------TDDGMPAD-------KVLFA-DLS-KYRVRFAG-------SLRVDRSVDAKFSTDQIVYRFLQRADGLLVD-- 379 (392) T ss_pred -------EcCCCCCC-------cEEEe-ecc-ceeEEeec-------ceEEEeeccccccCCcEEEEEEEEeccEEec-- Confidence 11111221 22222 221 11111111 1111 1122222334444445555443 Q ss_pred cCeEEecchhHHHH Q lcl|NC_013594. 260 PTHIVVPVGLEKAA 273 (305) Q Consensus 260 P~~LvVpp~le~~A 273 (305) |.-.+ --.+..+| T Consensus 380 ~~A~~-~~~~~~aa 392 (392) T protein:vir:13 380 ARGAK-VLTVTPAA 392 (392) T ss_pred ccceE-EEEeeccC Confidence 22111 01111111 No 73 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=21.93 E-value=2.5 Score=18.38 Aligned_cols=235 Identities=15% Similarity=0.090 Sum_probs=91.7 Q ss_pred CCcCHHHHHHHHH------------------------HHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCc-h Q lcl|NC_013594. 1 MIVTPASIKALMT------------------------SWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPT-L 55 (305) Q Consensus 1 M~i~~~~l~~L~~------------------------~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~-l 55 (305) .....+..++... .+...+.+..... ....++++.++-++...+|.++-.-+. - T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (415) T protein:vir:46 99 TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE-FNLDKYVTVKRVTNGSGKYPVVRQSEVAA 177 (415) T ss_pred hhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhh-hhhhhhcceeeccCCceeEEEEEecCCcc Confidence 1111111111100 0111111111111 112334555443333333433311111 1 Q ss_pred hhhcccee-ee---ccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccC Q lcl|NC_013594. 56 KEWVGKRT-IQ---QMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYD 131 (305) Q Consensus 56 ~ew~Ge~~-~~---~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~D 131 (305) -.|++|.. .. ...=..-++..++++..+.|||+.+.|-+.++..-+...++++.++..++.++.-+.+|.+.. . T Consensus 178 ~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~--~ 255 (415) T protein:vir:46 178 LEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--T 255 (415) T ss_pred eeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccc--c Confidence 24555432 11 223355688899999999999999988788899999999999999999987765443321111 0 Q ss_pred CcccccccccccccccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceee Q lcl|NC_013594. 132 GQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFG 211 (305) Q Consensus 132 Gk~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g 211 (305) +. + .+ ... . . ....+... .+. |.=++. +... T Consensus 256 ~~---~-~~------~~~-~--~-------~~~~~~~~--~~~---i~~~~~---------~~~~--------------- 286 (415) T protein:vir:46 256 SS---G-FE------KEG-K--K-------LEVKKAKS--LDD---IKDAIN---------LNVK--------------- 286 (415) T ss_pred cc---c-cc------ccc-c--e-------eccccccc--hHH---HHHHHH---------hhhh--------------- Confidence 00 0 00 000 0 0 00000000 000 000000 0000 Q ss_pred eecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccccccc Q lcl|NC_013594. 212 ASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSN 291 (305) Q Consensus 212 ~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~~~N 291 (305) ..+.+++ | -++.+. +.++++.||.+|+||- .|+.- -+....-....+.-.+..+.++.+... T Consensus 287 -~~~~~~~---~--------v~n~~~----~~~L~~lkd~~G~~i~-~~~~~-~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:46 287 -PNYEHNV---A--------IVSQTM----FAKLDKMKDKLGNYLI-QPDVK-EKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred -hccCCCE---E--------EEcHHH----HHHHHHhhccCCCeee-ccCcC-CCCCccccceeeEEeccccccCCCccE Confidence 0111211 1 122222 3466788999999883 33210 000000001111122223333322223 Q ss_pred ccccee------------eEEecccC Q lcl|NC_013594. 292 EMKGKL------------QLVVADYL 305 (305) Q Consensus 292 ~~~~~~------------~~iv~p~L 305 (305) .+.|-+ ++..+++. T Consensus 349 ~~~gd~~~~~~~~~~~~~~v~~~~~~ 374 (415) T protein:vir:46 349 LIIGNLKDAIVLFDRSQYQASWTDYM 374 (415) T ss_pred EEEEehhccEEEEeecceEEEeeccc Confidence 343321 11111111 No 74 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=21.93 E-value=2.5 Score=18.38 Aligned_cols=235 Identities=15% Similarity=0.090 Sum_probs=91.7 Q ss_pred CCcCHHHHHHHHH------------------------HHHHHHHHHHhhhhhhhhceeeecCCccchhhhhhccCCCc-h Q lcl|NC_013594. 1 MIVTPASIKALMT------------------------SWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPT-L 55 (305) Q Consensus 1 M~i~~~~l~~L~~------------------------~~~~~~~~~~~~a~~t~~~~a~~v~S~~~~~~y~~Lg~~P~-l 55 (305) .....+..++... .+...+.+..... ....++++.++-++...+|.++-.-+. - T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (415) T protein:vir:47 99 TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE-FNLDKYVTVKRVTNGSGKYPVVRQSEVAA 177 (415) T ss_pred hhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhh-hhhhhhcceeeccCCceeEEEEEecCCcc Confidence 1111111111100 0111111111111 112334555443333333433311111 1 Q ss_pred hhhcccee-ee---ccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCccccC Q lcl|NC_013594. 56 KEWVGKRT-IQ---QMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYD 131 (305) Q Consensus 56 ~ew~Ge~~-~~---~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~~~~~~D 131 (305) -.|++|.. .. ...=..-++..++++..+.|||+.+.|-+.++..-+...++++.++..++.++.-+.+|.+.. . T Consensus 178 ~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~--~ 255 (415) T protein:vir:47 178 LEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--T 255 (415) T ss_pred eeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccc--c Confidence 24555432 11 223355688899999999999999988788899999999999999999987765443321111 0 Q ss_pred CcccccccccccccccccchhhhhHhhhccCCCccceeeecccccccccchhhcccchhhhhhhccccccchhcccceee Q lcl|NC_013594. 132 GQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFG 211 (305) Q Consensus 132 Gk~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~vkP~i~Q~r~~~~f~a~t~~~~~nvf~~~~~~~g 211 (305) +. + .+ ... . . ....+... .+. |.=++. +... T Consensus 256 ~~---~-~~------~~~-~--~-------~~~~~~~~--~~~---i~~~~~---------~~~~--------------- 286 (415) T protein:vir:47 256 SS---G-FE------KEG-K--K-------LEVKKAKS--LDD---IKDAIN---------LNVK--------------- 286 (415) T ss_pred cc---c-cc------ccc-c--e-------eccccccc--hHH---HHHHHH---------hhhh--------------- Confidence 00 0 00 000 0 0 00000000 000 000000 0000 Q ss_pred eecccccccchhhhhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeEEecchhHHHHHHHHhhcccCCccccccc Q lcl|NC_013594. 212 ASARRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSN 291 (305) Q Consensus 212 ~d~r~n~G~g~wq~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~LvVpp~le~~A~~ll~~~~~~~~~~~~~N 291 (305) ..+.+++ | -++.+. +.++++.||.+|+||- .|+.- -+....-....+.-.+..+.++.+... T Consensus 287 -~~~~~~~---~--------v~n~~~----~~~L~~lkd~~G~~i~-~~~~~-~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:47 287 -PNYEHNV---A--------IVSQTM----FAKLDKMKDKLGNYLI-QPDVK-EKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred -hccCCCE---E--------EEcHHH----HHHHHHhhccCCCeee-ccCcC-CCCCccccceeeEEeccccccCCCccE Confidence 0111211 1 122222 3466788999999883 33210 000000001111122223333322223 Q ss_pred ccccee------------eEEecccC Q lcl|NC_013594. 292 EMKGKL------------QLVVADYL 305 (305) Q Consensus 292 ~~~~~~------------~~iv~p~L 305 (305) .+.|-+ ++..+++. T Consensus 349 ~~~gd~~~~~~~~~~~~~~v~~~~~~ 374 (415) T protein:vir:47 349 LIIGNLKDAIVLFDRSQYQASWTDYM 374 (415) T ss_pred EEEEehhccEEEEeecceEEEeeccc Confidence 343321 11111111 No 75 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=20.35 E-value=2.7 Score=18.26 Aligned_cols=247 Identities=16% Similarity=0.073 Sum_probs=93.6 Q ss_pred CCcCHHHHHHHHHHHHHHHHHH---------Hhhhhhh-------------hhceeeecCCccchhhhhhccC-CCchhh Q lcl|NC_013594. 1 MIVTPASIKALMTSWRKDFQGG---------LEDAPSQ-------------YNKIAMVVNSSTRSNTYGWLGK-FPTLKE 57 (305) Q Consensus 1 M~i~~~~l~~L~~~~~~~~~~~---------~~~a~~t-------------~~~~a~~v~S~~~~~~y~~Lg~-~P~l~e 57 (305) +.-......++-..++..-..+ -...|++ -...|+.++-+.....|..+.. -... . T Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 188 (397) T protein:vir:96 110 EEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLEPKDIVDLSKYVRSVPVNSASGKFPVISKSGSKM-A 188 (397) T ss_pred hHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHHhhhhhhHHHhhhhccccccceeEEEEeccCCcc-c Confidence 0000000000000000000000 0000111 1222332221222222222211 1122 2 Q ss_pred hccc---ee-eeccccccceeeeecccceeeccHHHhhccccchhHHHHHHHHHHHHhhHHHHHHHHHhccC--CccccC Q lcl|NC_013594. 58 WVGK---RT-IQQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGF--TQPCYD 131 (305) Q Consensus 58 w~Ge---~~-~~~l~~~~~~i~n~tfg~~i~i~R~~I~nDdlG~~~~~~~~~G~aAa~~~~~lv~~lL~~g~--~~~~~D 131 (305) |++| .. .....-..-++..++++..+.||++.+.|-...+..-+.+.++++.++..+..+..-...+. +..-|| T Consensus 189 ~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~d 268 (397) T protein:vir:96 189 TVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTATAKSVVGVD 268 (397) T ss_pred cccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccchH Confidence 3322 22 12233344567788999999999999998788888889999999999999988776554321 112233 Q ss_pred Cc-ccccccccccccccccchhhhhHhhhccCCCccceeeecccccc------cccchhhcccchhhhhhhccccccchh Q lcl|NC_013594. 132 GQ-NFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRA------VKPLIFQERRKPELVARTRIDDDHVFM 204 (305) Q Consensus 132 Gk-~fF~adH~~~~n~~~tg~~~~~s~l~~~~~~~G~~~~l~~~~~~------vkP~i~Q~r~~~~f~a~t~~~~~nvf~ 204 (305) +- .++++.++...+..---+......|..-++..|.+++.-+..-. =+|++...- ..+... . T Consensus 269 ~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~--------~~~~~~---~ 337 (397) T protein:vir:96 269 GLKDLINKEIKKVYDVKLFISASMYSELDKLKDKNGRYLLQDSITAASGKQLLGKEVVVLDD--------DVIGKS---V 337 (397) T ss_pred HHHHHHHHhhhhhcCcEEEEcHHHHHHHHHhhccCCCeEeccCccCCCcccccccceEEecc--------cccCCC---C Confidence 32 23333332111100000111122333335667776654332221 133322110 011000 0 Q ss_pred c-ccceeeeecccccccchh-hhhhhcccccchHHHHHHHHHHHHhhcCCCceeccccCeE--E-ecch Q lcl|NC_013594. 205 D-NEFLFGASARRAAGYGFW-QMAVAVKGDLTLDNLWKGWQLMRAFEGDGGKKLGLKPTHI--V-VPVG 268 (305) Q Consensus 205 ~-~~~~~g~d~r~n~G~g~w-q~A~~s~~~Lt~~~l~~ar~aM~~~k~~~G~~L~i~P~~L--v-Vpp~ 268 (305) . ...+|| |.+..+-.+-| ++.+. ..+...+ -+.|+...-.+|.++ .|.-+ + |.++ T Consensus 338 ~~~~~~~g-d~~~~~~~~~~~~~~~~---~~~~~~~---~~~~~~~~r~d~~~~--~~~a~~~~~~~~a 397 (397) T protein:vir:96 338 GNVVGFIG-DAKAFASFFDRKQVSVS---WVDNNIY---GQLLAGIIRYDVKAT--DKKAGFYVTFTIG 397 (397) T ss_pred CceEEEEe-ehhcceEeEeecceEEE---Eeccccc---ceeEEEEEEEccEEe--cccceEEEEeecC Confidence 0 112232 32211111100 00110 1111111 234555555666655 44422 2 4555 Done!