Query lcl|NC_020866.1_cdsid_YP_007676421.1 [gene=RHVG_00042] [protein=major head protein] [protein_id=YP_007676421.1] [location=25019..25912] Match_columns 297 No_of_seqs 177 out of 358 Neff 6.3 Searched_HMMs 1612 Date Thu Nov 7 17:25:59 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_42 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_42_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:1991 Length: 305 # 100.0 2E-140 1E-143 786.1 17.8 296 1-296 1-305 (305) 2 protein:vir:99228 Length: 304 100.0 2E-140 1E-143 787.0 14.8 297 1-297 1-304 (304) 3 protein:vir:79246 Length: 304 100.0 7E-140 4E-143 783.6 14.8 297 1-297 1-304 (304) 4 protein:vir:103886 Length: 302 100.0 2.7E-92 1.7E-95 522.6 20.3 232 1-297 1-232 (302) 5 protein:vir:95512 Length: 693 100.0 8.3E-65 5.2E-68 371.9 15.7 219 1-297 394-627 (693) 6 protein:vir:79548 Length: 652 100.0 3.5E-62 2.1E-65 357.5 15.4 216 1-297 359-587 (652) 7 protein:vir:103886 Length: 302 99.8 1.3E-24 7.9E-28 151.6 -2.7 142 1-228 160-302 (302) 8 protein:vir:108211 Length: 318 97.4 1.5E-05 9.2E-09 47.1 12.7 206 1-297 21-250 (318) 9 protein:vir:95512 Length: 693 96.2 1.2E-07 7.5E-11 58.5 -7.3 250 1-297 338-616 (693) 10 protein:vir:79928 Length: 393 96.0 7.9E-05 4.9E-08 43.1 7.3 228 1-297 42-311 (393) 11 protein:vir:1638 Length: 298 # 95.6 0.00098 6.1E-07 37.1 11.8 227 1-297 1-259 (298) 12 protein:vir:2344 Length: 397 # 94.6 0.004 2.5E-06 33.7 13.6 222 1-297 1-262 (397) 13 protein:vir:79548 Length: 652 94.2 1.8E-06 1.1E-09 52.1 -6.9 237 1-297 319-579 (652) 14 protein:vir:9820 Length: 272 # 93.5 0.0073 4.5E-06 32.3 14.2 191 1-297 1-210 (272) 15 protein:vir:3033 Length: 272 # 93.5 0.0073 4.5E-06 32.3 14.2 191 1-297 1-210 (272) 16 protein:vir:94771 Length: 298 93.3 0.0082 5.1E-06 32.0 11.9 228 1-297 1-259 (298) 17 protein:vir:96223 Length: 324 89.4 0.027 1.7E-05 29.2 11.2 226 1-297 4-271 (324) 18 protein:vir:104085 Length: 320 88.4 0.023 1.4E-05 29.6 9.0 229 1-297 7-271 (320) 19 protein:vir:99920 Length: 311 88.2 0.034 2.1E-05 28.7 12.7 233 1-297 1-265 (311) 20 protein:vir:105334 Length: 276 86.6 0.045 2.8E-05 28.0 15.1 186 1-297 1-211 (276) 21 protein:vir:8187 Length: 311 # 86.2 0.048 2.9E-05 27.8 13.0 228 1-297 1-261 (311) 22 protein:vir:2430 Length: 318 # 86.0 0.049 3E-05 27.8 11.8 226 1-297 5-269 (318) 23 protein:vir:9759 Length: 303 # 82.8 0.074 4.6E-05 26.8 11.3 231 1-297 1-263 (303) 24 protein:vir:4600 Length: 415 # 82.5 0.076 4.7E-05 26.7 12.5 232 1-297 99-375 (415) 25 protein:vir:4700 Length: 415 # 82.5 0.076 4.7E-05 26.7 12.5 232 1-297 99-375 (415) 26 protein:vir:9574 Length: 300 # 81.8 0.083 5.1E-05 26.5 12.6 235 1-297 1-260 (300) 27 protein:vir:78223 Length: 333 80.7 0.093 5.7E-05 26.3 10.7 242 1-297 1-289 (333) 28 protein:vir:96762 Length: 632 79.5 0.1 6.5E-05 26.0 13.2 224 1-297 345-601 (632) 29 protein:vir:7771 Length: 330 # 79.1 0.11 6.7E-05 25.9 13.3 239 1-297 10-273 (330) 30 protein:vir:3613 Length: 272 # 78.3 0.12 7.2E-05 25.7 13.3 191 1-297 1-209 (272) 31 protein:vir:8102 Length: 543 # 78.2 0.12 7.2E-05 25.7 11.4 232 1-297 251-506 (543) 32 protein:vir:4226 Length: 326 # 77.5 0.12 7.7E-05 25.6 13.3 228 1-297 1-279 (326) 33 protein:vir:4092 Length: 390 # 75.3 0.15 9.2E-05 25.1 10.1 236 1-297 70-336 (390) 34 protein:vir:99749 Length: 324 74.6 0.16 9.7E-05 25.0 12.8 223 1-297 1-271 (324) 35 protein:vir:100247 Length: 425 73.7 0.17 0.0001 24.8 14.2 238 1-297 120-394 (425) 36 protein:vir:93742 Length: 274 69.8 0.22 0.00014 24.2 13.9 193 1-297 1-211 (274) 37 protein:vir:80684 Length: 315 69.4 0.22 0.00014 24.2 11.2 235 1-297 1-266 (315) 38 protein:vir:97148 Length: 324 67.4 0.25 0.00016 23.9 13.9 224 1-297 1-271 (324) 39 protein:vir:96262 Length: 274 67.1 0.26 0.00016 23.8 14.2 185 1-297 10-211 (274) 40 protein:vir:95898 Length: 274 67.1 0.26 0.00016 23.8 14.2 185 1-297 10-211 (274) 41 protein:vir:103955 Length: 324 66.7 0.26 0.00016 23.8 14.3 224 1-297 1-271 (324) 42 protein:vir:78523 Length: 338 65.8 0.28 0.00017 23.6 13.5 236 1-297 1-289 (338) 43 protein:vir:107687 Length: 319 63.9 0.31 0.00019 23.4 8.8 206 1-297 4-254 (319) 44 protein:vir:105905 Length: 304 63.8 0.31 0.00019 23.4 13.4 230 1-297 1-257 (304) 45 protein:vir:94142 Length: 304 63.8 0.31 0.00019 23.4 13.4 230 1-297 1-257 (304) 46 protein:vir:79642 Length: 329 63.7 0.31 0.00019 23.3 10.6 211 1-297 29-261 (329) 47 protein:vir:96833 Length: 275 60.3 0.37 0.00023 22.9 15.2 193 1-297 1-212 (275) 48 protein:vir:9704 Length: 394 # 55.6 0.48 0.0003 22.3 9.9 203 1-297 137-361 (394) 49 protein:vir:1886 Length: 385 # 54.4 0.5 0.00031 22.2 11.1 232 1-297 105-350 (385) 50 protein:vir:191 Length: 385 # 54.4 0.5 0.00031 22.2 11.1 232 1-297 105-350 (385) 51 protein:vir:96392 Length: 324 54.2 0.51 0.00032 22.2 13.3 224 1-297 4-271 (324) 52 protein:vir:78830 Length: 324 54.2 0.51 0.00032 22.2 13.3 224 1-297 4-271 (324) 53 protein:vir:2504 Length: 305 # 53.2 0.54 0.00033 22.1 13.4 231 1-297 1-256 (305) 54 protein:vir:485 Length: 407 # 50.8 0.6 0.00037 21.8 12.9 238 1-297 96-370 (407) 55 protein:vir:81070 Length: 390 48.1 0.68 0.00042 21.5 12.2 223 1-297 123-361 (390) 56 protein:vir:6242 Length: 390 # 46.7 0.73 0.00045 21.3 13.3 221 1-297 84-357 (390) 57 protein:vir:105038 Length: 428 45.4 0.77 0.00048 21.2 11.4 235 1-297 98-385 (428) 58 protein:vir:102655 Length: 322 44.9 0.79 0.00049 21.1 15.0 209 1-297 1-243 (322) 59 protein:vir:102605 Length: 273 44.1 0.82 0.00051 21.1 16.2 191 1-297 1-212 (273) 60 protein:vir:105822 Length: 273 44.1 0.82 0.00051 21.1 16.2 191 1-297 1-212 (273) 61 protein:vir:4456 Length: 401 # 40.9 0.95 0.00059 20.7 12.8 242 1-297 97-371 (401) 62 protein:vir:8420 Length: 477 # 40.7 0.96 0.0006 20.7 12.4 243 1-297 135-438 (477) 63 protein:vir:9309 Length: 324 # 39.5 1 0.00063 20.5 14.2 224 1-297 4-269 (324) 64 protein:vir:80213 Length: 334 38.3 1.1 0.00067 20.4 13.4 215 1-297 1-252 (334) 65 protein:vir:96123 Length: 274 38.2 1.1 0.00067 20.4 14.1 192 1-297 1-211 (274) 66 protein:vir:80068 Length: 301 37.8 1.1 0.00068 20.4 9.4 212 1-297 7-236 (301) 67 protein:vir:79987 Length: 415 37.2 1.1 0.0007 20.3 13.0 224 1-297 99-375 (415) 68 protein:vir:81100 Length: 415 37.2 1.1 0.0007 20.3 13.0 224 1-297 99-375 (415) 69 protein:vir:98339 Length: 415 37.2 1.1 0.0007 20.3 13.0 224 1-297 99-375 (415) 70 protein:vir:99675 Length: 324 36.8 1.2 0.00071 20.2 11.5 177 33-297 1-202 (324) 71 protein:vir:1239 Length: 274 # 36.7 1.2 0.00072 20.2 14.6 183 1-297 1-211 (274) 72 protein:vir:4339 Length: 395 # 36.7 1.2 0.00072 20.2 13.1 234 1-297 87-361 (395) 73 protein:vir:104342 Length: 314 35.4 1.2 0.00076 20.1 9.4 206 1-297 1-246 (314) 74 protein:vir:94494 Length: 274 32.4 1.4 0.00089 19.7 15.0 186 1-297 1-211 (274) 75 protein:vir:97433 Length: 274 32.4 1.4 0.00089 19.7 15.0 186 1-297 1-211 (274) 76 protein:vir:7990 Length: 273 # 30.2 1.6 0.00099 19.5 15.9 192 1-297 1-212 (273) 77 protein:vir:739 Length: 231 # 30.1 1.6 0.001 19.5 10.7 159 32-297 1-168 (231) 78 protein:vir:80180 Length: 381 27.0 1.9 0.0012 19.1 15.7 214 1-297 13-243 (381) 79 protein:vir:4856 Length: 293 # 24.9 2.1 0.0013 18.8 10.9 213 1-297 5-250 (293) 80 protein:vir:94576 Length: 347 24.9 2.1 0.0013 18.8 15.3 216 1-297 1-253 (347) 81 protein:vir:3870 Length: 400 # 24.5 2.2 0.0013 18.7 10.1 210 1-297 123-362 (400) 82 protein:vir:1433 Length: 435 # 24.4 2.2 0.0014 18.7 12.0 230 1-297 101-390 (435) 83 protein:vir:100135 Length: 418 24.3 2.2 0.0014 18.7 11.6 232 1-297 120-381 (418) 84 protein:vir:9410 Length: 415 # 23.8 2.3 0.0014 18.6 12.2 231 1-297 99-375 (415) 85 protein:vir:41 Length: 299 # N 23.5 2.3 0.0014 18.6 13.1 229 1-297 1-254 (299) 86 protein:vir:5739 Length: 366 # 22.9 2.4 0.0015 18.5 12.5 233 1-297 64-323 (366) 87 protein:vir:80376 Length: 435 22.7 2.4 0.0015 18.5 13.6 231 1-297 105-390 (435) 88 protein:vir:80930 Length: 278 22.6 2.4 0.0015 18.5 14.5 192 1-297 1-218 (278) 89 protein:vir:6324 Length: 335 # 22.0 2.5 0.0016 18.4 12.7 212 1-297 1-248 (335) 90 protein:vir:97031 Length: 402 21.5 2.6 0.0016 18.3 12.1 213 1-297 1-247 (402) 91 protein:vir:97053 Length: 390 20.4 2.8 0.0017 18.1 11.1 232 1-297 102-362 (390) No 1 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=100.00 E-value=2.3e-140 Score=786.12 Aligned_cols=296 Identities=48% Similarity=0.869 Sum_probs=290.4 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceeeeeeccccceeeeeccc Q lcl|NC_020866. 1 MQVTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQNLTESDYSIREKPWE 80 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l~~~~~~i~n~~fe 80 (297) ||||+++|++|+++||+.|++||+.+||+|++|||+|||++++|+|+|||+||+||||+|||++++|++|+|+|+||+|| T Consensus 1 M~i~~~~l~~l~~~~~~~f~~~~~~a~~~~~~iA~~vpSt~~~~tY~wLg~fP~lrewiGer~i~~l~~~~y~i~Nk~fe 80 (305) T protein:vir:19 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRTIQQMEAHGYSIANKTFE 80 (305) T ss_pred CccCHHHHHHHHHHHHHHHHHHHhhcCcccceEEeEecCCCCcccccccccCCccchhhcceeeeeccccceeEeecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccc---cccccceeeccc-- Q lcl|NC_020866. 81 LTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVL---DEDGKTVTVSNT-- 155 (297) Q Consensus 81 ~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~---~g~~~~~svsn~-- 155 (297) .||+|+|+|||||+||+|+|++++||++|++|||+|||+||++||+++|||||+||||||||+ +|++...+|||+ T Consensus 81 ~tV~V~R~dIeDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~cyDGq~FFdtDHpv~~~~~~tg~~~~vsn~~~ 160 (305) T protein:vir:19 81 GTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIVE 160 (305) T ss_pred ceeccchhhccccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccCCCCCcccCCCCCcccCCcccccccchhhhhc Confidence 999999999999999999999999999999999999999999999999999999999999984 578899999997 Q ss_pred -cccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHHH Q lcl|NC_020866. 156 -GGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAA 234 (297) Q Consensus 156 -~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~a 234 (297) .++++.+|||+|+++++||||+|+|++++|+++++++|++||++++|+||+|+|||+||||||+||+|+++|+++||++ T Consensus 161 ~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~~~~~~~~~d~~vf~~~e~~ygvd~R~n~Gygfwq~a~gS~~~Ls~~nl~a 240 (305) T protein:vir:19 161 QDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASTRRAAGYGFWQMAVAVKGDLTLDNLWK 240 (305) T ss_pred CCCCCCceeeeeecCCcceeEEEecccccceeeccCCCchhhhhhceeeeeeeeeeeccccchhheecCCCCCCHHHHHH Confidence 6788999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCC---cceecceeeEEecccc Q lcl|NC_020866. 235 ARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGE---TNPWKGTAELLVVPWL 296 (297) Q Consensus 235 ar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~---~N~~~~~~~~iv~p~L 296 (297) ||++|++||+++|+||+|+|++|||||+||.+|+|||+++.+++|. +|||+|++||||+||| T Consensus 241 ar~aM~~qk~d~G~pL~I~P~~LvVPp~LE~~A~qll~s~~i~~g~~~~~Np~~g~~eliV~P~L 305 (305) T protein:vir:19 241 GWQLMRSFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNEMKGKLQLVVADYL 305 (305) T ss_pred HHHHHHhhcCCCCceeeeecCeEEeCchhHHHHHHHHhhcccCCccccccceecceEEEEecccC Confidence 9999999999999999999999999999999999999999887765 6999999999999999 No 2 >protein:vir:99228 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:776 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950457;genbank:gi:119953658;genbank:GeneID:4643088 Probab=100.00 E-value=1.6e-140 Score=786.96 Aligned_cols=297 Identities=48% Similarity=0.859 Sum_probs=292.6 Q ss_pred CC-cCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceeeeeeccccceeeeecc Q lcl|NC_020866. 1 MQ-VTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQNLTESDYSIREKPW 79 (297) Q Consensus 1 M~-i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l~~~~~~i~n~~f 79 (297) |. ||+++|++|+++||+.|++||+.+|++|++|||+|||++++|+|+|||+||+||||||||++++|++|+|+|+||+| T Consensus 1 M~ii~~~~L~~l~~~~~~~f~~~~~~a~~~~~~iA~~VpSt~~~~~Y~WLg~~P~mreWiG~r~i~~l~~~~y~I~Nk~f 80 (304) T protein:vir:99 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQGYQITNKLF 80 (304) T ss_pred CCccCHHHHHHHHHHHHHHHHHHHhhcCcccceeEeEeecCccccccchhcccccchhhhhhhhhhhhhhccceeecccc Confidence 87 99999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccc---cccccceeecccc Q lcl|NC_020866. 80 ELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVL---DEDGKTVTVSNTG 156 (297) Q Consensus 80 e~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~---~g~~~~~svsn~~ 156 (297) |.||+|+|+|||||+||+|+|++++||++|++|||+|||+||++||+++||||||||||||||+ +|++...+|||+. T Consensus 81 E~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~~~~dg~g~~~~vsn~~ 160 (304) T protein:vir:99 81 ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVSNLF 160 (304) T ss_pred ccccccccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCcccccccccCcccccceec Confidence 9999999999999999999999999999999999999999999999999999999999999994 6889999999996 Q ss_pred cc---chhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHH Q lcl|NC_020866. 157 GG---TGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYA 233 (297) Q Consensus 157 ag---~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~ 233 (297) ++ ++.+|||+|+++++||||+|+|++++++++++++|++||++++|+||+|+|+|+||||||++|+|+++|+++||+ T Consensus 161 ~~~~~~g~~w~Lld~~r~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfWQlA~gS~a~Lt~~nl~ 240 (304) T protein:vir:99 161 APAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFWQLAAMSTEELNTANFE 240 (304) T ss_pred cCCCCCCCcEEEEeCCCCccceeeeccccceeeeccCCCchhhhhhcceeEeeeeeeccchhhhhhhhhcCCCcChHHHH Confidence 65 789999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCCcceecceeeEEeccccC Q lcl|NC_020866. 234 AARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGETNPWKGTAELLVVPWLA 297 (297) Q Consensus 234 aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~~N~~~~~~~~iv~p~La 297 (297) +||+|||+||+++|+||+|+|++|||||+||.+|+|||+++.+++|++|||+|++||||+|||. T Consensus 241 aAr~aMr~qk~d~G~pL~I~P~~LvVPp~LE~aA~~ll~a~~~~~G~tNp~~g~~eliV~P~Ld 304 (304) T protein:vir:99 241 KVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGADNPNFELVQVLDTAWLN 304 (304) T ss_pred HHHHHHHhhcCCCCceeccccCeEEecchHHHHHHHHHhhhccCCCCcceecceEEEEeecccC Confidence 9999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:79246 Length: 304 # NCBI annotation: conserved hypothetical protein # Family: family:all:776 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469162;genbank:gi:157835004;genbank:GeneID:5648827 Probab=100.00 E-value=6.5e-140 Score=783.64 Aligned_cols=297 Identities=48% Similarity=0.859 Sum_probs=291.9 Q ss_pred CC-cCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceeeeeeccccceeeeecc Q lcl|NC_020866. 1 MQ-VTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQNLTESDYSIREKPW 79 (297) Q Consensus 1 M~-i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l~~~~~~i~n~~f 79 (297) |. ||+++|++|+++||+.|++||+.+|++|++|||+|||++++|+|+|||+||+||||||||++++|++|+|+|+||+| T Consensus 1 M~ii~~~~L~~l~~~~~~~f~~~~~~a~~~~~~iA~~VpSt~~~~tY~WLg~~P~mreWiG~r~i~~l~~~~y~I~Nk~f 80 (304) T protein:vir:79 1 MAIITPALISALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQRVIKDMAAQGYQITNKLF 80 (304) T ss_pred CCccCHHHHHHHHHHHHHHHHHHHhhcCcccceeEeEeecCccccccchhcccccchhhhhhhhhhhhhhccceeecccc Confidence 87 59999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccc---cccccceeecccc Q lcl|NC_020866. 80 ELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVL---DEDGKTVTVSNTG 156 (297) Q Consensus 80 e~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~---~g~~~~~svsn~~ 156 (297) |.||+|+|+|||||+||+|+|++++||++|++|||+|||+||++||+++||||||||||||||+ +|++...+|||+. T Consensus 81 E~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~~~~d~~g~~~~vsn~~ 160 (304) T protein:vir:79 81 ESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVYPNVDGTGTATTVSNLF 160 (304) T ss_pred ccceeeccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCccccccccccccccceeec Confidence 9999999999999999999999999999999999999999999999999999999999999995 6889999999996 Q ss_pred cc---chhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHH Q lcl|NC_020866. 157 GG---TGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYA 233 (297) Q Consensus 157 ag---~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~ 233 (297) ++ ++.+|||+|+++++||||+|+|++++++++++++|++||++++|+||+|+|+|+||||||+||+|+++|+++||+ T Consensus 161 ~~~~~~g~~w~LlD~sr~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfWQlA~gS~a~Ls~~nl~ 240 (304) T protein:vir:79 161 APAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSLTKEDNEQVFMADEYVYGVRSRCNVGFGFWQLAAMSTEELNQVNFE 240 (304) T ss_pred cCCCCCCCeEEEEeCCCcccceeeeccccceeeecCCCCchhhhhhcceEEeeeeeeccchhhhhhhhhcCCccchHHHH Confidence 66 489999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCCcceecceeeEEeccccC Q lcl|NC_020866. 234 AARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGETNPWKGTAELLVVPWLA 297 (297) Q Consensus 234 aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~~N~~~~~~~~iv~p~La 297 (297) +||+|||+||+++|+||+|+|++|||||+||.+|+|||+++.+++|++|||+|++||||+|||. T Consensus 241 aAr~aMr~qk~d~G~pL~I~P~~LvVPp~LE~~A~~ll~a~~~~~G~tNp~~g~~eliV~P~Ld 304 (304) T protein:vir:79 241 KVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRLANGADNPNFELVQVLDTAWLN 304 (304) T ss_pred HHHHHHHhhcCCCCceeccccCEEEecchhHHHHHHHHhhhhcCCCCcceecceEEEEeecccC Confidence 9999999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=100.00 E-value=2.7e-92 Score=522.57 Aligned_cols=232 Identities=38% Similarity=0.655 Sum_probs=223.9 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceeeeeeccccceeeeeccc Q lcl|NC_020866. 1 MQVTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQNLTESDYSIREKPWE 80 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l~~~~~~i~n~~fe 80 (297) |+||+++|++|+++||+.|++||+++|++|++||+++||+||+++|+|||+||.|+||+|||++++|+|++|+|+|++|| T Consensus 1 m~it~~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge~~~~~l~~~~~~i~~~~~g 80 (302) T protein:vir:10 1 MLINKQSLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGAKVVKNLKAYKYVVENEDFE 80 (302) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccceeeccccccceeEEeeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeeccccccch Q lcl|NC_020866. 81 LTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVSNTGGGTG 160 (297) Q Consensus 81 ~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~ 160 (297) ++|+|+|++|||||||+|++++++||++|++||+++||++|.+|++++|||||+|||+||+++.. T Consensus 81 ~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~--------------- 145 (302) T protein:vir:10 81 ATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDA--------------- 145 (302) T ss_pred ceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceeccccccccc--------------- Confidence 99999999999999999999999999999999999999999999999999999999999987642 Q ss_pred hHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHHHHHHHHH Q lcl|NC_020866. 161 TPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAAARAALS 240 (297) Q Consensus 161 ~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~ 240 (297) .++|.|+++|+++ ..+++.++|.++|++|+ T Consensus 146 -----------------------------------------------~~~N~g~~~~~~~---~~~l~~~~~~aa~~am~ 175 (302) T protein:vir:10 146 -----------------------------------------------SVSNKGTAPLSNA---SQAAAKAGYGAARTAMK 175 (302) T ss_pred -----------------------------------------------ccccccchhhhhc---ccccchHHHHHHHHHHH Confidence 3567778888765 56899999999999999 Q ss_pred hhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCCcceecceeeEEeccccC Q lcl|NC_020866. 241 GMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGETNPWKGTAELLVVPWLA 297 (297) Q Consensus 241 ~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~~N~~~~~~~~iv~p~La 297 (297) +||+++|++|+|+|++|||||+||..|++|+.++++++|++|||+|++++||+|||. T Consensus 176 ~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~~~g~~Np~~g~~~~vv~p~L~ 232 (302) T protein:vir:10 176 KFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKLADNTPNPYVGTAELVVDGRIE 232 (302) T ss_pred HHhhhcccccccCCCEEEecchhHHHHHHHhhccccCCCCcceeccceEEEEeeccC Confidence 999999999999999999999999999999999999999999999999999999997 No 5 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=100.00 E-value=8.3e-65 Score=371.89 Aligned_cols=219 Identities=22% Similarity=0.312 Sum_probs=191.7 Q ss_pred CCcC--HHHHHHH-HHHHHHHHHHHHhhcchhhcceeeeec-CCccceecccccCCCcchhcc--cceeeeeecccccee Q lcl|NC_020866. 1 MQVT--AANLDAL-RVGFKTSFQGALDQAPSQYLRLTTVVP-SSTKEQRYGWMGKIPNVREWI--GPRAIQNLTESDYSI 74 (297) Q Consensus 1 M~i~--~~~l~~l-~~~~~~~f~~a~~~a~~~~~~~a~~v~-S~~~~~~y~~Lg~~P~lrew~--Ge~~~~~l~~~~~~i 74 (297) |.++ ..-.-.| --.+||.+.++|+.+|+||++||.+.. +|||+.+..+||+||.|+++. ||+++++++|.+++| T Consensus 394 ~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~ 473 (693) T protein:vir:95 394 LAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLRQVREGAEYKYVTLGERGEQI 473 (693) T ss_pred HHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCCChhhcCCCCceeeeecCCcccee Confidence 2221 1111111 124799999999999999999997654 899999999999999999985 999999999999999 Q ss_pred eeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeecc Q lcl|NC_020866. 75 REKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVSN 154 (297) Q Consensus 75 ~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svsn 154 (297) +.++||++|+||||+|||||||+|+++++.||++|+++++++||.+|.+ |++|+|||+|||+||+|+... T Consensus 474 ~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~--Np~m~DGk~LFhadH~Nl~tg-------- 543 (693) T protein:vir:95 474 ILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTG--NPAMSDGKTLFHADHSNLLTG-------- 543 (693) T ss_pred ehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CccccCCcceeeccccccccc-------- Confidence 9999999999999999999999999999999999999999999999998 999999999999999885310 Q ss_pred ccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHHH Q lcl|NC_020866. 155 TGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAA 234 (297) Q Consensus 155 ~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~a 234 (297) ..++++++++++ T Consensus 544 --------------------------------------------------------------------a~sals~~sl~~ 555 (693) T protein:vir:95 544 --------------------------------------------------------------------AASALSIDSLSK 555 (693) T ss_pred --------------------------------------------------------------------cccccChHHHHH Confidence 023678999999 Q ss_pred HHHHHHhhccC----CCcccccccCeEEecchHHHHHHHHHhhhccC-----CCCcceecceeeEEeccccC Q lcl|NC_020866. 235 ARAALSGMKGD----YGRPLGLMPNLLVVPPALEEAGRKILNSENAS-----GGETNPWKGTAELLVVPWLA 297 (297) Q Consensus 235 ar~aM~~~k~~----~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~-----~g~~N~~~~~~~~iv~p~La 297 (297) +|++|++||+. +|++|+|+|+||||||+||++|+||++++.++ .|..|||++.++||++|||. T Consensus 556 a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~a~~~~~~~NP~~~~~~vi~~prL~ 627 (693) T protein:vir:95 556 AKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPGADVNSGIVNPIRAFAQVIGEPRLD 627 (693) T ss_pred HHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccccccccccccchhccccccccceec Confidence 99999999964 68899999999999999999999999987754 45689999999999999994 No 6 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=100.00 E-value=3.5e-62 Score=357.54 Aligned_cols=216 Identities=21% Similarity=0.288 Sum_probs=189.5 Q ss_pred CCcC--HHHH-HHHHHHHHHHHHHHHhhcchhhcceeeeec-CCccceecccccCCCcchhcc--cceeeeeecccccee Q lcl|NC_020866. 1 MQVT--AANL-DALRVGFKTSFQGALDQAPSQYLRLTTVVP-SSTKEQRYGWMGKIPNVREWI--GPRAIQNLTESDYSI 74 (297) Q Consensus 1 M~i~--~~~l-~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~-S~~~~~~y~~Lg~~P~lrew~--Ge~~~~~l~~~~~~i 74 (297) +.++ ..-. .-|--.+||.+.++|+.+|+||++||.+.. +|||+.+...||+||.|+++. ||+++++++|.+++| T Consensus 359 ~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~ 438 (652) T protein:vir:79 359 AAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDKQATI 438 (652) T ss_pred HHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccceeeecCcccee Confidence 2221 1111 112224699999999999999999997755 899999999999999999985 999999999999999 Q ss_pred eeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCcccc-Cccccc-ccccccccccccceee Q lcl|NC_020866. 75 REKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECY-DGQNFF-DTDHPVLDEDGKTVTV 152 (297) Q Consensus 75 ~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~-DGk~~F-~tdH~~~~g~~~~~sv 152 (297) +.+|||+.|+||||+|+|||||+|+++++.||++|+++++++||.+|.+ ||+++ |||+|| |+||.|+.+ T Consensus 439 ~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~--Np~~~~DGk~LF~hA~H~Nl~~------- 509 (652) T protein:vir:79 439 ALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTS--NPKISTDNVSLFDKAKHANVLE------- 509 (652) T ss_pred eeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhc--CcccccCCceeecccccccccc------- Confidence 9999999999999999999999999999999999999999999999999 99996 999999 899987642 Q ss_pred ccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHH Q lcl|NC_020866. 153 SNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAY 232 (297) Q Consensus 153 sn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l 232 (297) .++++++++ T Consensus 510 -----------------------------------------------------------------------~aa~~~~~l 518 (652) T protein:vir:79 510 -----------------------------------------------------------------------SAAMDVASL 518 (652) T ss_pred -----------------------------------------------------------------------cccCCHHHH Confidence 125778999 Q ss_pred HHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccC-----CCCcceecceeeEEeccccC Q lcl|NC_020866. 233 AAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENAS-----GGETNPWKGTAELLVVPWLA 297 (297) Q Consensus 233 ~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~-----~g~~N~~~~~~~~iv~p~La 297 (297) ++||++|++||+ ++++|+|+|+||||||+||++|+||+.+..++ +|..||+++.++|||+|||. T Consensus 519 ~~ar~aM~~Qk~-g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~a~~~~~~~Np~~~~~~~i~eprL~ 587 (652) T protein:vir:79 519 DKARQLMRVQKE-GERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPVKDFATVIAEPRLD 587 (652) T ss_pred HHHHHHHHHhcc-CCccccccccEEEecchhHHHHHHHhccCCCcccccccccccccccccccccccccC Confidence 999999999996 44689999999999999999999999876653 46799999999999999995 No 7 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=99.78 E-value=1.3e-24 Score=151.58 Aligned_cols=142 Identities=34% Similarity=0.507 Sum_probs=114.2 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceeeeeeccccceeeeeccc Q lcl|NC_020866. 1 MQVTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQNLTESDYSIREKPWE 80 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l~~~~~~i~n~~fe 80 (297) ..+++++|++...++++ T Consensus 160 ~~l~~~~~~aa~~am~~--------------------------------------------------------------- 176 (302) T protein:vir:10 160 QAAAKAGYGAARTAMKK--------------------------------------------------------------- 176 (302) T ss_pred cccchHHHHHHHHHHHH--------------------------------------------------------------- Confidence 23333333333333322 Q ss_pred ceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHH-HHhcccCccccCcccccccccccccccccceeeccccccc Q lcl|NC_020866. 81 LTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFE-LLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVSNTGGGT 159 (297) Q Consensus 81 ~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~-lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svsn~~ag~ 159 (297) +....|+....+|+.||+. .|..+....|++++++++++||+.. . ...+++..-++ T Consensus 177 --------------------~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~~~g~~Np~~g-~--~~~vv~p~L~s 233 (302) T protein:vir:10 177 --------------------FKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKLADNTPNPYVG-T--AELVVDGRIES 233 (302) T ss_pred --------------------HhhhcccccccCCCEEEecchhHHHHHHHhhccccCCCCcceecc-c--eEEEEeeccCC Confidence 3444578889999999995 7888888999999999999999863 2 23344443445 Q ss_pred hhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCC Q lcl|NC_020866. 160 GTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLD 228 (297) Q Consensus 160 ~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~ 228 (297) +.+|||++.+++++|+++|.++.|+++.+++++++.||++++|.||+|+|+++||+|||++|+|+++-+ T Consensus 234 ~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 234 DTAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred CCceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccCC Confidence 678999999999999999999999999999999999999999999999999999999999999988766 No 8 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=97.39 E-value=1.5e-05 Score=47.06 Aligned_cols=206 Identities=13% Similarity=0.091 Sum_probs=102.8 Q ss_pred CCcCHHHHH-HHHHHHHHHHHHHHhhcchhhcceeeeecC-Cccceecc-----c-ccCCCcchhcccceeeeeeccccc Q lcl|NC_020866. 1 MQVTAANLD-ALRVGFKTSFQGALDQAPSQYLRLTTVVPS-STKEQRYG-----W-MGKIPNVREWIGPRAIQNLTESDY 72 (297) Q Consensus 1 M~i~~~~l~-~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S-~~~~~~y~-----~-Lg~~P~lrew~Ge~~~~~l~~~~~ 72 (297) |+=+|..|. .+...+...|- ++.-+++. .+ ...+-.|. + -++.=..-| -||+..-......- T Consensus 21 ll~~P~~I~~~i~e~~~~~~i-----ad~lf~~~----~a~~~~~v~f~~~~p~~~~~d~e~VaE-ggEiP~~~~~~G~~ 90 (318) T protein:vir:10 21 LVGNPLWIPTALKKMMVNQFI-----SESLFRNG----GANPNGVVAYNEGNPSFLEDDVADVAE-FGEIPVSAGARGLP 90 (318) T ss_pred hhCCchhHHHHHHHHHhccch-----hhhhhhcc----cccccceeEEEecccccccCcHhhccC-cccccccCCCCCch Confidence 333344432 22111111111 11111111 11 00111110 0 011111111 25555544444333 Q ss_pred ee-eeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccccccccee Q lcl|NC_020866. 73 SI-REKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVT 151 (297) Q Consensus 73 ~i-~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~s 151 (297) .| +.++||..+.|+++++..-+.+.+.+.+++++.+.+++-|..+|+.|.++..+.. +.-.++.. T Consensus 91 ~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~-----------~~s~~w~~--- 156 (318) T protein:vir:10 91 RTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTL-----------AVPTAWDN--- 156 (318) T ss_pred hhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----------cCCcCCCC--- Confidence 33 4579999999999999999999999999999999999999999999988543321 11100000 Q ss_pred eccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHH Q lcl|NC_020866. 152 VSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTA 231 (297) Q Consensus 152 vsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~ 231 (297) -++...+.. .-.++ T Consensus 157 ~~~~~~d~~------------------------------------------------------------------~A~e~ 170 (318) T protein:vir:10 157 GGKVRTDIA------------------------------------------------------------------IAIEQ 170 (318) T ss_pred cccccccch------------------------------------------------------------------hhhhh Confidence 000000000 00111 Q ss_pred HHHHHH-HHHhhccCCCcccccccCeEEecchHHHHH------HHHHhhhcc-----CCCCcce---ecceeeEEecccc Q lcl|NC_020866. 232 YAAARA-ALSGMKGDYGRPLGLMPNLLVVPPALEEAG------RKILNSENA-----SGGETNP---WKGTAELLVVPWL 296 (297) Q Consensus 232 l~aar~-aM~~~k~~~G~~L~i~P~~LvVp~~le~~A------~~ll~~~~~-----~~g~~N~---~~~~~~~iv~p~L 296 (297) +..|+. ++-..........+-+|+.||++|.....- +.++..+.. ...+.+- +.| +++|++|.+ T Consensus 171 v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~lG-l~vi~s~~~ 249 (318) T protein:vir:10 171 ISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSVMG-LNVIRSRTF 249 (318) T ss_pred hhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccccccceeec-eEEeecCcc Confidence 111111 111122222357889999999999988876 444432221 1112222 123 899999999 Q ss_pred C Q lcl|NC_020866. 297 A 297 (297) Q Consensus 297 a 297 (297) . T Consensus 250 p 250 (318) T protein:vir:10 250 P 250 (318) T ss_pred C Confidence 9 No 9 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=96.19 E-value=1.2e-07 Score=58.52 Aligned_cols=250 Identities=13% Similarity=0.052 Sum_probs=112.5 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhcchhhccee----------ee---ecCCccceecc-----cccCCCcchhcccce Q lcl|NC_020866. 1 MQVTAANLDALRVGFKTSFQGALDQAPSQYLRLT----------TV---VPSSTKEQRYG-----WMGKIPNVREWIGPR 62 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a----------~~---v~S~~~~~~y~-----~Lg~~P~lrew~Ge~ 62 (297) =++..+.-++|.. |.- .+=..++..|+-.. .+ +.+-.+.+..+ =-+|||.+.+=+..+ T Consensus 338 ~~~~d~~~~al~~--R~g--~~~~~~~n~~~g~~L~elAr~~L~~rg~~~~~~~~~~~~~~a~~htTSDFp~IL~~~~nk 413 (693) T protein:vir:95 338 NLVGDSVRASVLA--RIG--RGERQADNAYNGMTLRELARASLVDRGIGVASLNAPQMVGLAFTHTSSDFGLILLDVANK 413 (693) T ss_pred hHHHHHHHHHHHH--hcC--cccccCCccccCCcHHHHHHHHHHhcCCccCCCCHHHHHHHHHhcCcchhHHHHHHHHHH Confidence 0000000000000 000 00000000011000 00 00000000000 124666553322222 Q ss_pred eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccc Q lcl|NC_020866. 63 AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPV 142 (297) Q Consensus 63 ~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~ 142 (297) .+ -.+|.-.-.+|..-.. |..+-| |.+.-. +. .++|... -+| T Consensus 414 ~l----~~~y~~a~~t~~~~~~--~~~~~D-----Fk~~~~--------------~~--lg~~~~L-----------~~V 455 (693) T protein:vir:95 414 SV----LAGWEEAEETFPLWTK--SGILTD-----FKPARR--------------VG--LGEFSSL-----------RQV 455 (693) T ss_pred HH----HHHHHhhhhHHHHHhc--cCCCCc-----ccccce--------------ee--cCCCCCh-----------hhc Confidence 21 1233333333333221 111111 221110 00 0111100 011 Q ss_pred c-cccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhh--ccc----cccccccccceeeecccccccccc Q lcl|NC_020866. 143 L-DEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSK--TKL----DDDHVFMNKEFLYGTDARANVGFG 215 (297) Q Consensus 143 ~-~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~--~~~----~~~~vf~~~~~~~g~d~r~~~G~~ 215 (297) . .|++++.+++.-.......+|+..+++++|.|||+|...++-++. .+. -++.|| .++..+..+.||+. T Consensus 456 ~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy----~~L~~Np~m~DGk~ 531 (693) T protein:vir:95 456 REGAEYKYVTLGERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVY----AVLTGNPAMSDGKT 531 (693) T ss_pred CCCCceeeeecCCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHH----HHHhcCccccCCcc Confidence 1 256666777766667778999999999999999999999887653 222 233333 22345778999999 Q ss_pred ccchhhcCC--ccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHH--HHHHHhhhccCCCCcceecceeeEE Q lcl|NC_020866. 216 FWQMAYGSK--QTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEA--GRKILNSENASGGETNPWKGTAELL 291 (297) Q Consensus 216 l~q~a~~~~--~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~--A~~ll~~~~~~~g~~N~~~~~~~~i 291 (297) |||++|+|. ++-+.-++...-++...+..-.+..-.-..+.|-+.|.+--. +.+. .++++-+..+-|-.+.-.-+ T Consensus 532 LFhadH~Nl~tga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~-~a~~l~~s~~~~~a~~~~~~ 610 (693) T protein:vir:95 532 LFHADHSNLLTGAASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALED-KANQIINSESVPGADVNSGI 610 (693) T ss_pred eeeccccccccccccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHH-HHHHHhcccccccccccccc Confidence 999999996 456777887777777777777776433345667677765433 2222 22232222333322222234 Q ss_pred eccccC Q lcl|NC_020866. 292 VVPWLA 297 (297) Q Consensus 292 v~p~La 297 (297) |.|+-. T Consensus 611 ~NP~~~ 616 (693) T protein:vir:95 611 VNPIRA 616 (693) T ss_pred ccchhc Confidence 566433 No 10 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=96.00 E-value=7.9e-05 Score=43.09 Aligned_cols=228 Identities=13% Similarity=0.174 Sum_probs=110.9 Q ss_pred CCcCHHHHHHHHHHHHH----------------------------HHH-HHHhhcchhhc--ceeeeec-CCccceeccc Q lcl|NC_020866. 1 MQVTAANLDALRVGFKT----------------------------SFQ-GALDQAPSQYL--RLTTVVP-SSTKEQRYGW 48 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~----------------------------~f~-~a~~~a~~~~~--~~a~~v~-S~~~~~~y~~ 48 (297) ..+++++++-+- .|-| +.- --.++|+|-|- ++...+. ...++..+.- T Consensus 42 ~~~~~~e~el~E-~f~Kmm~G~~p~~eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~ 120 (393) T protein:vir:79 42 LALNEEETQILE-SFAKMMEGETPTNEVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPS 120 (393) T ss_pred hhcchhHHHHHH-HHHHHhcCCCchhheehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccc Confidence 333333322210 0000 000 01233344331 1222222 2344444443 Q ss_pred ccCCCcchhc-c---cceeeeeec---cccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_020866. 49 MGKIPNVREW-I---GPRAIQNLT---ESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELL 121 (297) Q Consensus 49 Lg~~P~lrew-~---Ge~~~~~l~---~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL 121 (297) +| .||+. | ||+.-.+|+ +..-.++.+++|-.|+++-++|.|-++.+..-+...+|++-+++-++.||..+ T Consensus 121 ~g---~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~f 197 (393) T protein:vir:79 121 IG---IMRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQF 197 (393) T ss_pred hh---eeeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhh Confidence 33 67776 3 777777777 34567889999999999999999999999999999999999999999999999 Q ss_pred hcccCccccCccccccc---ccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhcccccccccc Q lcl|NC_020866. 122 KLGFATECYDGQNFFDT---DHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFM 198 (297) Q Consensus 122 ~~G~~~~~~DGk~~F~t---dH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~ 198 (297) ++ .....||| |.+ .||.+-+ ..+...|+-.+==++|...++.|- --...+|+ T Consensus 198 k~-~ghtvfDa---~st~t~ahptGr~------~~~~qNGTlSleDllDm~~av~~~---------------hyt~svi~ 252 (393) T protein:vir:79 198 RS-HGHTVFDN---YSTNKLAHTTGLD------KNGVQNDTFSAEDFLDLIIAVMAN---------------EYTPSDLM 252 (393) T ss_pred hc-ccceeeec---cccCccceeecCC------ccccccccccHHHHHHHHHHHhcc---------------cCCcceEE Confidence 77 34455776 333 3555421 112344443344455555555542 11222222 Q ss_pred ccceeeeccccccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCC Q lcl|NC_020866. 199 NKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASG 278 (297) Q Consensus 199 ~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~ 278 (297) -..+-|-+=++.+.=-++.+.+++|-.+-...+ ..|-.-....|+ T Consensus 253 MHPLAWnv~AKna~me~~~~na~gN~~~~~~~t-----s~algp~~i~~~------------------------------ 297 (393) T protein:vir:79 253 MHPLAWTVFAKNELMGSLQANPYGNYPAKGAPS-----SMALGPDSIQGR------------------------------ 297 (393) T ss_pred EcCchhhhhhhhhhhcceeeccccccCccccch-----hhhhchhhhccc------------------------------ Confidence 222222211111111123333333211100000 000000011122 Q ss_pred CCcceecceeeEEeccccC Q lcl|NC_020866. 279 GETNPWKGTAELLVVPWLA 297 (297) Q Consensus 279 g~~N~~~~~~~~iv~p~La 297 (297) +-=.++|+++|..+ T Consensus 298 -----~~~nlnv~~sPfvp 311 (393) T protein:vir:79 298 -----LPFNFNVNLSPFIP 311 (393) T ss_pred -----cccceeEEEecccc Confidence 11125566666665 No 11 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=95.60 E-value=0.00098 Score=37.07 Aligned_cols=227 Identities=14% Similarity=0.131 Sum_probs=104.5 Q ss_pred CCcCHHHH--HHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceee---eeeccccceee Q lcl|NC_020866. 1 MQVTAANL--DALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAI---QNLTESDYSIR 75 (297) Q Consensus 1 M~i~~~~l--~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~---~~l~~~~~~i~ 75 (297) |..+...| ..+. +..+ +.... .+..+++|+++|......++.++..-|.- .|+||-.- .++.=..-++. T Consensus 1 ma~~gG~lvp~~~~---~~ii-~~~~~-~s~i~~l~~~~~~~~~~~~ip~~~~~~~a-~~v~E~~~~~~~~~~f~~v~l~ 74 (298) T protein:vir:16 1 MVLNKGTLFDPTLV---TDLI-SKVAG-KSSIARLSAQKPIPFNGEKVFTFTMDSEI-DVVAESGKKTHGGVTLAPQTMV 74 (298) T ss_pred CcccCcceechhHH---HHHH-HHHHh-hhhhhhhcceeeccCCceEEEEEecCcce-EEecCCccccccccceeEEEEe Confidence 88765443 2221 2222 22222 24467778787766565667777666664 79877432 33333445678 Q ss_pred eecccceeeccHHHh---hccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcc--cccccccccccccccce Q lcl|NC_020866. 76 EKPWELTIGVDRDDI---ETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQ--NFFDTDHPVLDEDGKTV 150 (297) Q Consensus 76 n~~fe~tv~v~R~~i---~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk--~~F~tdH~~~~g~~~~~ 150 (297) .++++..+.|+++.+ .++..++..-+...++++.++..++.++. |.++. +|+ .+....+... ... T Consensus 75 ~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~----G~~~~--~g~~~~~~~~~~~~~----~~~ 144 (298) T protein:vir:16 75 PIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFH----GVNPR--LGTASAVIGTNHFDS----KVT 144 (298) T ss_pred eeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhc----cccCC--CCccccccccccccc----ccc Confidence 899999999999999 46678899999999999999998876663 32221 111 1111111000 000 Q ss_pred eeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHH Q lcl|NC_020866. 151 TVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGT 230 (297) Q Consensus 151 svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~ 230 (297) ........... . ..+....+..+.-..+. ..+ |=+ + T Consensus 145 ~~~~~~~~~~~-~-~~~i~~~~~~~~~~~~~-----------------------------~~~---~vm--------n-- 180 (298) T protein:vir:16 145 QKVEAPRGIAD-P-NGAIENAVELLTGVDAD-----------------------------VTG---IAI--------N-- 180 (298) T ss_pred ccccccccccc-H-HHHHHHHHHHhhhcCCC-----------------------------ccE---EEE--------c-- Confidence 00000000111 1 11222222211100000 000 111 1 Q ss_pred HHHHHHHHHHhhccCCCccccc------ccCeEEecchHHHHHHHHHhhhccCCC-Cc---ceecce------------e Q lcl|NC_020866. 231 AYAAARAALSGMKGDYGRPLGL------MPNLLVVPPALEEAGRKILNSENASGG-ET---NPWKGT------------A 288 (297) Q Consensus 231 ~l~aar~aM~~~k~~~G~~L~i------~P~~LvVp~~le~~A~~ll~~~~~~~g-~~---N~~~~~------------~ 288 (297) ...+.++++.||-+|++|-. .|..|. .+-++-+..++.+ .+ -.+.|. + T Consensus 181 --~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~--------G~PV~~~~~v~~~~~~~~~~~~~GDfs~~~~~~~~~~~ 250 (298) T protein:vir:16 181 --PSFRSALAKQKDLQDNALFPELKWGATPDTIN--------GLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEV 250 (298) T ss_pred --HHHHHHHHHhhccCCCeeecCcccCCCCceec--------ceeeEEecccccccCCCccEEEEeeccceEEEEEecCc Confidence 33456678888999987631 111111 0111111222221 11 112222 2 Q ss_pred eEEeccccC Q lcl|NC_020866. 289 ELLVVPWLA 297 (297) Q Consensus 289 ~~iv~p~La 297 (297) ++-+.++=- T Consensus 251 ~~~~~~~~~ 259 (298) T protein:vir:16 251 PLEVIQYGD 259 (298) T ss_pred eEEEeeccC Confidence 222222111 No 12 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=94.59 E-value=0.004 Score=33.74 Aligned_cols=222 Identities=14% Similarity=0.177 Sum_probs=107.4 Q ss_pred CCcCHHHHHHHHHH------------HHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccce---eee Q lcl|NC_020866. 1 MQVTAANLDALRVG------------FKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPR---AIQ 65 (297) Q Consensus 1 M~i~~~~l~~l~~~------------~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~---~~~ 65 (297) |=.|+++...+.++ .+..+....+. +.-.+++++++-.....+|.....-|.. .|+||- ... T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~--s~i~~l~~~~~~~~~~~~ip~~~~~~~a-~wv~Eg~~~~~s 77 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKT--SIVQRVAQKIPMGATGIVIPHWTGDVSA-QWIGEGDMKPIT 77 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhc--cchhhhcceeeccCCceEEEEEcCCcce-EEecCCcccccc Confidence 77777665555442 23334433322 3345667777754444556655555553 787653 333 Q ss_pred eeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccc Q lcl|NC_020866. 66 NLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDE 145 (297) Q Consensus 66 ~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g 145 (297) +..=..-++..++++..+.|+++.++|...++..-+.+.++++.++..++.++. |.++ ++++..-.. . T Consensus 78 ~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~----G~gt----~~~~~~~~~----~ 145 (397) T protein:vir:23 78 KGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALH----GTNA----PSAFQGYLD----Q 145 (397) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh----cccC----Ccccccccc----c Confidence 333344667889999999999999999999999999999999999999986652 3222 222221111 0 Q ss_pred cccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCc Q lcl|NC_020866. 146 DGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQ 225 (297) Q Consensus 146 ~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~ 225 (297) ... +...++. .++..+ -..+..+....+ .+.+ |-+ T Consensus 146 ~~~----~~~~~~~--~~~~~~-~~~~~~l~~~~~-----------------------------~~a~---~vm------ 180 (397) T protein:vir:23 146 SNK----TQSISPN--AYQGLG-VSGLTKLVTDGK-----------------------------KWTH---TLL------ 180 (397) T ss_pred ccc----eeeeccc--chhHHH-HHHHHhhhhccc-----------------------------CCCE---EEE------ Confidence 000 0111111 111111 111111111000 0000 111 Q ss_pred cCCHHHHHHHHHHHHhhccCCCccccccc-----------CeEE-ecchHHHHHHHHHhhhccCCCCcceecce------ Q lcl|NC_020866. 226 TLDGTAYAAARAALSGMKGDYGRPLGLMP-----------NLLV-VPPALEEAGRKILNSENASGGETNPWKGT------ 287 (297) Q Consensus 226 ~l~~~~l~aar~aM~~~k~~~G~~L~i~P-----------~~Lv-Vp~~le~~A~~ll~~~~~~~g~~N~~~~~------ 287 (297) + ...+.++++.||.+|++|-... ..|+ +| ++-++..+.|.+-.+.+. T Consensus 181 --n----~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~P---------v~~s~~~~~g~~~~~~gDfs~~~i 245 (397) T protein:vir:23 181 --D----DTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRP---------TILSDHVAEGDVVGYAGDFSQIIW 245 (397) T ss_pred --c----HHHHHHHHHhhccCCceeecccccccccccccCceeeeee---------EEEeCCCCCCceEEEEeecceEEE Confidence 1 2345677788888888763211 0111 11 111122333332211111 Q ss_pred -----eeEEe--ccccC Q lcl|NC_020866. 288 -----AELLV--VPWLA 297 (297) Q Consensus 288 -----~~~iv--~p~La 297 (297) +++-+ +..+. T Consensus 246 ~~~~~i~i~~~~e~~~~ 262 (397) T protein:vir:23 246 GQVGGLSFDVTDQATLN 262 (397) T ss_pred EEEeceEEEEeeeeeee Confidence 11111 11111 No 13 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=94.21 E-value=1.8e-06 Score=52.05 Aligned_cols=237 Identities=10% Similarity=0.064 Sum_probs=93.2 Q ss_pred CCcCHHH-------HHHHHHHHHHHH-HHHHhhcchhhcc--eeeeecCCccceecccccCCCcchhcccceeeeeeccc Q lcl|NC_020866. 1 MQVTAAN-------LDALRVGFKTSF-QGALDQAPSQYLR--LTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQNLTES 70 (297) Q Consensus 1 M~i~~~~-------l~~l~~~~~~~f-~~a~~~a~~~~~~--~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l~~~ 70 (297) +..++.. |..|- |..+ ..|.. ...+.+ +..+ -+ .+ =-+|||.+..=+..+ .| -. T Consensus 319 ~~~~~~~~~~~g~~L~elA---r~~L~~~G~~--~~~~~~~~~v~~---A~-~h---sTsDFp~IL~~~~nk---~l-~~ 382 (652) T protein:vir:79 319 FEKTERDNVYNGMTLREYA---RMSLTERGIG--VSSYNPMQMVGA---AF-TH---STSDFGNILLDVANK---AI-LQ 382 (652) T ss_pred CcccccCccccCccHHHHH---HHHHHhhccC--CCCCCHHHHHHH---Hh-hc---CcchHHHHHHHHHHH---HH-HH Confidence 1111000 00000 0000 00110 111110 0000 00 00 025666542221111 11 11 Q ss_pred cceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccc-cccccc Q lcl|NC_020866. 71 DYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVL-DEDGKT 149 (297) Q Consensus 71 ~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~-~g~~~~ 149 (297) +|.-.-.+|.+-.. +..+-| |++.-. +. .++|... -+|. .|+++. T Consensus 383 ~y~~a~~t~~~~~~--~~~~~D-----Fk~~~~--------------~~--lg~~~~L-----------~~V~E~gEyk~ 428 (652) T protein:vir:79 383 GWEDAPETYEQWTR--KGQLSD-----FKIAHR--------------VG--MGGFSAL-----------RQVREGAEYKY 428 (652) T ss_pred HHhhhHHHHHHHhc--cCCCcc-----ccccce--------------ee--cCCCCCc-----------cccCCCCccce Confidence 23222333322221 011111 221100 00 0111110 1122 367777 Q ss_pred eeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhc--cc----cccccccccceeeecccccc-cccccc-chhh Q lcl|NC_020866. 150 VTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKT--KL----DDDHVFMNKEFLYGTDARAN-VGFGFW-QMAY 221 (297) Q Consensus 150 ~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~--~~----~~~~vf~~~~~~~g~d~r~~-~G~~l~-q~a~ 221 (297) .+++.-.......+|+..+++++|.|||+|...++-++.. +. -++.|| -++-.++.+. ||+.|| |++| T Consensus 429 ~t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy----~~l~~Np~~~~DGk~LF~hA~H 504 (652) T protein:vir:79 429 VTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVY----AILTSNPKISTDNVSLFDKAKH 504 (652) T ss_pred eeecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHH----HHHhcCcccccCCceeeccccc Confidence 7777777778889999999999999999999998876532 21 123332 1122456665 999999 9999 Q ss_pred cCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchH-HHHHHHHHhhhccCCCCcc----eecceeeEEecccc Q lcl|NC_020866. 222 GSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPAL-EEAGRKILNSENASGGETN----PWKGTAELLVVPWL 296 (297) Q Consensus 222 ~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~l-e~~A~~ll~~~~~~~g~~N----~~~~~~~~iv~p~L 296 (297) +|..+-..=++. +..+++......-. + . ..|-+.|.. -.-...-..++++-...+. ...+.+.++-. .+ T Consensus 505 ~Nl~~~aa~~~~-~l~~ar~aM~~Qk~--g-~-~~l~i~P~~llvp~~le~~a~~ll~s~~v~~a~~~~~~~Np~~~-~~ 578 (652) T protein:vir:79 505 ANVLESAAMDVA-SLDKARQLMRVQKE--G-E-RHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPVKD-FA 578 (652) T ss_pred ccccccccCCHH-HHHHHHHHHHHhcc--C-C-ccccccccEEEecchhHHHHHHHhccCCCccccccccccccccc-cc Confidence 999864322222 34444444333321 1 1 123333332 2222211222222222221 22233332211 11 Q ss_pred C Q lcl|NC_020866. 297 A 297 (297) Q Consensus 297 a 297 (297) - T Consensus 579 ~ 579 (652) T protein:vir:79 579 T 579 (652) T ss_pred c Confidence 1 No 14 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=93.51 E-value=0.0073 Score=32.29 Aligned_cols=191 Identities=13% Similarity=0.038 Sum_probs=101.3 Q ss_pred CCcCHHHHHHHHHH--HHHHHHHHHhhcchhhcceeeee----cCCccceecccccCCCcchhcc---cceeeeeecccc Q lcl|NC_020866. 1 MQVTAANLDALRVG--FKTSFQGALDQAPSQYLRLTTVV----PSSTKEQRYGWMGKIPNVREWI---GPRAIQNLTESD 71 (297) Q Consensus 1 M~i~~~~l~~l~~~--~~~~f~~a~~~a~~~~~~~a~~v----~S~~~~~~y~~Lg~~P~lrew~---Ge~~~~~l~~~~ 71 (297) |..+...+.++.+- +...+.+.+.+. .-+.+++.+- ......-++..+...+.. +|+ ++....++.... T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~-~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a-~~v~eg~~i~~~~~~~~~ 78 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKA-IRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDA-EDVAEGEAIPMTQLGFKK 78 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHH-hhhhccccccccccCCCCCEEEEEEecCCCCc-ccccCCCcccccccccce Confidence 88544333333321 111222222111 1233333221 111111222222233332 354 345566677777 Q ss_pred ceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccccccccee Q lcl|NC_020866. 72 YSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVT 151 (297) Q Consensus 72 ~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~s 151 (297) -+++.++++..+.|+++++.+....+...+.+.++++.++..+..+++.+....+. T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~------------------------ 134 (272) T protein:vir:98 79 TTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT------------------------ 134 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------------------ Confidence 78888999999999999999988889999999999999999888877765331000 Q ss_pred eccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHH Q lcl|NC_020866. 152 VSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTA 231 (297) Q Consensus 152 vsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~ 231 (297) .+.+.+.+. T Consensus 135 -----------------------------------------------------------------------~~~~~t~d~ 143 (272) T protein:vir:98 135 -----------------------------------------------------------------------VEATATVDG 143 (272) T ss_pred -----------------------------------------------------------------------cccccCHHH Confidence 001122345 Q ss_pred HHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHH--HhhhccCCCCcc--------eecceeeEEeccccC Q lcl|NC_020866. 232 YAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKI--LNSENASGGETN--------PWKGTAELLVVPWLA 297 (297) Q Consensus 232 l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~l--l~~~~~~~g~~N--------~~~~~~~~iv~p~La 297 (297) +.+|+..+... + ..++++||+|.....-++. ...........+ -+.| +.||+++.+- T Consensus 144 i~da~~~l~~~----~----~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G-~~Vi~s~~~p 210 (272) T protein:vir:98 144 VSKALDIFNDE----D----DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLG-VQIVRSRKCP 210 (272) T ss_pred HHHHHHHHhcc----C----CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcC-eeEEEcCCCC Confidence 55555554322 2 3357899999765444332 111111111111 1234 5888888887 No 15 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=93.51 E-value=0.0073 Score=32.29 Aligned_cols=191 Identities=13% Similarity=0.038 Sum_probs=101.3 Q ss_pred CCcCHHHHHHHHHH--HHHHHHHHHhhcchhhcceeeee----cCCccceecccccCCCcchhcc---cceeeeeecccc Q lcl|NC_020866. 1 MQVTAANLDALRVG--FKTSFQGALDQAPSQYLRLTTVV----PSSTKEQRYGWMGKIPNVREWI---GPRAIQNLTESD 71 (297) Q Consensus 1 M~i~~~~l~~l~~~--~~~~f~~a~~~a~~~~~~~a~~v----~S~~~~~~y~~Lg~~P~lrew~---Ge~~~~~l~~~~ 71 (297) |..+...+.++.+- +...+.+.+.+. .-+.+++.+- ......-++..+...+.. +|+ ++....++.... T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~-~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a-~~v~eg~~i~~~~~~~~~ 78 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKA-IRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDA-EDVAEGEAIPMTQLGFKK 78 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHH-hhhhccccccccccCCCCCEEEEEEecCCCCc-ccccCCCcccccccccce Confidence 88544333333321 111222222111 1233333221 111111222222233332 354 345566677777 Q ss_pred ceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccccccccee Q lcl|NC_020866. 72 YSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVT 151 (297) Q Consensus 72 ~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~s 151 (297) -+++.++++..+.|+++++.+....+...+.+.++++.++..+..+++.+....+. T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~------------------------ 134 (272) T protein:vir:30 79 TTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT------------------------ 134 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------------------ Confidence 78888999999999999999988889999999999999999888877765331000 Q ss_pred eccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHH Q lcl|NC_020866. 152 VSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTA 231 (297) Q Consensus 152 vsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~ 231 (297) .+.+.+.+. T Consensus 135 -----------------------------------------------------------------------~~~~~t~d~ 143 (272) T protein:vir:30 135 -----------------------------------------------------------------------VEATATVDG 143 (272) T ss_pred -----------------------------------------------------------------------cccccCHHH Confidence 001122345 Q ss_pred HHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHH--HhhhccCCCCcc--------eecceeeEEeccccC Q lcl|NC_020866. 232 YAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKI--LNSENASGGETN--------PWKGTAELLVVPWLA 297 (297) Q Consensus 232 l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~l--l~~~~~~~g~~N--------~~~~~~~~iv~p~La 297 (297) +.+|+..+... + ..++++||+|.....-++. ...........+ -+.| +.||+++.+- T Consensus 144 i~da~~~l~~~----~----~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G-~~Vi~s~~~p 210 (272) T protein:vir:30 144 VSKALDIFNDE----D----DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLG-VQIVRSRKCP 210 (272) T ss_pred HHHHHHHHhcc----C----CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcC-eeEEEcCCCC Confidence 55555554322 2 3357899999765444332 111111111111 1234 5888888887 No 16 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=93.26 E-value=0.0082 Score=32.03 Aligned_cols=228 Identities=14% Similarity=0.121 Sum_probs=101.9 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceee---eeeccccceeeee Q lcl|NC_020866. 1 MQVTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAI---QNLTESDYSIREK 77 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~---~~l~~~~~~i~n~ 77 (297) |.++...| +=.-+-..+-+.+... +...++|+.++-......|..+..-|.- .|++|-.- .+..=..-+++.+ T Consensus 1 ma~~gG~l--ip~~~~~~ii~~~~~~-s~i~~~~~~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~~~~~~f~~v~l~~~ 76 (298) T protein:vir:94 1 MVLNKGTL--FDPELVTDLISKVAGK-SSIARLSAQKPIPFNGEKVFTFTMDSEI-DVVAESGKKTHGGVTLAPQTMVPI 76 (298) T ss_pred Ceeccccc--cChhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEecCcce-EEeeCCccccccccceeEEEEeee Confidence 88876543 0011122222222222 2356667777655555667777666663 78866432 2333344556778 Q ss_pred cccceeeccHHHhh---ccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccc--ccccccccccccccceee Q lcl|NC_020866. 78 PWELTIGVDRDDIE---TDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQN--FFDTDHPVLDEDGKTVTV 152 (297) Q Consensus 78 ~fe~tv~v~R~~i~---dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~--~F~tdH~~~~g~~~~~sv 152 (297) +++..+.|+|+.+. +|..++..-+.+.++++.++..+..++. |.++. +|++ +..+.+.. ..+ T Consensus 77 k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~----G~~~~--~g~~~~~~~~~~~~-------~~~ 143 (298) T protein:vir:94 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFH----GVNPR--LGTASAVIGTNHFD-------SKV 143 (298) T ss_pred EEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhc----ccccC--CCcccccccccccc-------ccc Confidence 99999999999984 6667888999999999999998876653 32211 1111 11111100 000 Q ss_pred cccc-ccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHH Q lcl|NC_020866. 153 SNTG-GGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTA 231 (297) Q Consensus 153 sn~~-ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~ 231 (297) .+.. .+....-...+....+..+.-.+ +... .|-+ + T Consensus 144 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~-----------------------------~~~~---~~vm--------n--- 180 (298) T protein:vir:94 144 TQKVEAPRGIADPNGAIENAVELLTGVD-----------------------------ADVT---GIAI--------N--- 180 (298) T ss_pred ccccccccccccHHHHHHHHHHhhhhcC-----------------------------CCcc---EEEE--------c--- Confidence 1110 11110001111111111110000 0000 0211 1 Q ss_pred HHHHHHHHHhhccCCCccccc------ccCeEEecchHHHHHHHHHhhhccCCCC-cc---eecce------------ee Q lcl|NC_020866. 232 YAAARAALSGMKGDYGRPLGL------MPNLLVVPPALEEAGRKILNSENASGGE-TN---PWKGT------------AE 289 (297) Q Consensus 232 l~aar~aM~~~k~~~G~~L~i------~P~~LvVp~~le~~A~~ll~~~~~~~g~-~N---~~~~~------------~~ 289 (297) ...+.++++.||-+|++|-. .|..|+ ...++-++.++.+. .+ .+.|. ++ T Consensus 181 -~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~--------G~PV~~~~~v~~~~~~~~~~~~~Gdfs~~~~~~~~~~~~ 251 (298) T protein:vir:94 181 -PSFRSALAKQKDLQGNALFPELKWGATPDTIN--------GLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVP 251 (298) T ss_pred -HHHHHHHHHhhccCCCeeecCcccCCCCceec--------ceeeEEecccccccCCCccEEEEeeccceEEEEEecCce Confidence 23456677788888887631 121111 11112222333221 11 12121 22 Q ss_pred EEeccccC Q lcl|NC_020866. 290 LLVVPWLA 297 (297) Q Consensus 290 ~iv~p~La 297 (297) +-+.++-- T Consensus 252 ~~~~~~~~ 259 (298) T protein:vir:94 252 LEVIQYGD 259 (298) T ss_pred EEEeecCC Confidence 22222211 No 17 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=89.36 E-value=0.027 Score=29.20 Aligned_cols=226 Identities=13% Similarity=0.100 Sum_probs=97.6 Q ss_pred CCcCHHHHHHHH-------------------------HHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcc Q lcl|NC_020866. 1 MQVTAANLDALR-------------------------VGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNV 55 (297) Q Consensus 1 M~i~~~~l~~l~-------------------------~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~l 55 (297) |...+.+++... .-+...+.+... ..+...++++.+|-.+....|..+..-|.- T Consensus 4 ~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~-~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a 82 (324) T protein:vir:96 4 TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVM-ENSKIMQLGKYEPMEGTEKKFTFWADKPGA 82 (324) T ss_pred chhhhHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHH-hhchhhhhcceeeccCCceEEEEEecCcce Confidence 111111111100 011111111111 112245566777765566677777656654 Q ss_pred hhcccce---eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCc Q lcl|NC_020866. 56 REWIGPR---AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDG 132 (297) Q Consensus 56 rew~Ge~---~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DG 132 (297) .|+||- ...++.-..-++..++++..+.|+|+.+.|.+..+..-+.+.++++.++..++.++. |-. .-..+ T Consensus 83 -~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~----G~g-~~~~~ 156 (324) T protein:vir:96 83 -YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQG-NNPFG 156 (324) T ss_pred -eeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh----cCC-CCCcC Confidence 788654 333344455667889999999999999999889999999999999999999886552 211 10111 Q ss_pred ccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccc Q lcl|NC_020866. 133 QNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANV 212 (297) Q Consensus 133 k~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~ 212 (297) ..++++-+ ..+....... +| ..+.-++..... ..+..+ T Consensus 157 ~~~~~~~~-----------~~~~~~~~~~-~~-----~~i~~~~~~i~~-------------------------~~~~~~ 194 (324) T protein:vir:96 157 KSIAQSIK-----------KTNKVIKGDF-TQ-----DNIIDLEALLED-------------------------DELEAN 194 (324) T ss_pred cccccccc-----------ccceeccccc-ch-----HHHHHHHHhhhh-------------------------ccCCCC Confidence 11111111 0011000000 11 011111110000 000000 Q ss_pred cccccchhhcCCccCCHHHHHHHHHHHHhhccCCCccccc--ccC-eEEecchHHHHHHHHHhhhccCC-----CC-cce Q lcl|NC_020866. 213 GFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGL--MPN-LLVVPPALEEAGRKILNSENASG-----GE-TNP 283 (297) Q Consensus 213 G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i--~P~-~LvVp~~le~~A~~ll~~~~~~~-----g~-~N~ 283 (297) + |-+ + ...+.++++.||.+|+++-. .|. ++=+|.-. ..+...+. |+ .+. T Consensus 195 ~---~i~--------n----~~~~~~L~~lkd~~G~~~~~~~~~~~l~G~PV~~-------~~~~~~~~~~~~~gd~s~~ 252 (324) T protein:vir:96 195 A---FIS--------K----TQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVN-------LKSSNLKRGELITGDFDKL 252 (324) T ss_pred E---EEE--------c----HHHHHHHHHhhCCCCCeeecCCCCCcccceeeEe-------ecCCCCCcceEEEEecceE Confidence 0 111 1 23355677789999987532 222 22122100 00001111 11 111 Q ss_pred e---cceeeEEeccc--cC Q lcl|NC_020866. 284 W---KGTAELLVVPW--LA 297 (297) Q Consensus 284 ~---~~~~~~iv~p~--La 297 (297) + ++-+++-++.. +. T Consensus 253 ~~~~~~~~~i~~~~~~~~~ 271 (324) T protein:vir:96 253 IYGIPQLIEYKIDETAQLS 271 (324) T ss_pred EEEEecCcEEEEeeccccc Confidence 1 11122222222 11 No 18 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=88.36 E-value=0.023 Score=29.60 Aligned_cols=229 Identities=16% Similarity=0.198 Sum_probs=99.4 Q ss_pred CCcCHHHHHH---------HHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccce---eeeeec Q lcl|NC_020866. 1 MQVTAANLDA---------LRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPR---AIQNLT 68 (297) Q Consensus 1 M~i~~~~l~~---------l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~---~~~~l~ 68 (297) |......+.. |-.-+...+.+..... +.-.+++++++-.....+|..+..-|.. .|++|- ...+.. T Consensus 7 ~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~~p~~~~~~~a-~~v~E~~~~~~~~~~ 84 (320) T protein:vir:10 7 FQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKT-SIVQQFAQKVPMGTTGQKIPHWIGDVSA-QWIGEGDMKPITKGN 84 (320) T ss_pred CCHHHHHhhccccccccccccHHHHHHHHHHHHhc-cchhhhcceeeccCCceEEEEEeCCcce-EEecCCccccccccc Confidence 2222221111 1111222222222222 2355667777755555566666666665 687553 333333 Q ss_pred cccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccccccc Q lcl|NC_020866. 69 ESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGK 148 (297) Q Consensus 69 ~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~ 148 (297) =..-+++.++++..+.|+|+.+.|-...+..-+.+.|+++.++..++.++. |.... .+..+-...+.+ T Consensus 85 f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~----G~g~~--~~~~~~~~~~~~------ 152 (320) T protein:vir:10 85 MTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALN----GTDSP--FPTYLAQTTKSV------ 152 (320) T ss_pred eeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCC--CCcccccccccc------ Confidence 344567889999999999999999899999999999999999999887532 21110 001111111100 Q ss_pred ceeeccccccch-hHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccC Q lcl|NC_020866. 149 TVTVSNTGGGTG-TPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTL 227 (297) Q Consensus 149 ~~svsn~~ag~~-~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l 227 (297) .+......+. ..+...+.-......+.... .+.. .|- + T Consensus 153 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~--------------------~~v--------~ 191 (320) T protein:vir:10 153 --SLADPGGATASDLTAYDAVAVNGLSLLVNAK-----------KKWT--------------------HTL--------L 191 (320) T ss_pred --cceecccccccccccHHHHHHHHHhhhhccc-----------CCCc--------------------EEE--------E Confidence 0000000000 00110000000000000000 0000 010 0 Q ss_pred CHHHHHHHHHHHHhhccCCCccccccc------C------eEEecchHHHHHHHHHhhhccCCCCc--------cee--- Q lcl|NC_020866. 228 DGTAYAAARAALSGMKGDYGRPLGLMP------N------LLVVPPALEEAGRKILNSENASGGET--------NPW--- 284 (297) Q Consensus 228 ~~~~l~aar~aM~~~k~~~G~~L~i~P------~------~LvVp~~le~~A~~ll~~~~~~~g~~--------N~~--- 284 (297) + ...+.++++.|+-+|++|.... . ++-+|. +-++..+.+.+ +-+ T Consensus 192 n----~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv---------~~~~~~~~~~~~~~~gd~~~~~~~~ 258 (320) T protein:vir:10 192 D----DIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPT---------ILSDHVADGTTVGYMGDFRNVIWGQ 258 (320) T ss_pred c----HHHHHHHHHhhccCCceeeccccccCccccccCceeeeeee---------EecCCCCCCceEEEEeecceEEEEE Confidence 1 2345666778888888764321 1 111111 11122222221 121 Q ss_pred cceeeEEeccccC Q lcl|NC_020866. 285 KGTAELLVVPWLA 297 (297) Q Consensus 285 ~~~~~~iv~p~La 297 (297) ++-+++.++.... T Consensus 259 ~~~~~i~~~~~~~ 271 (320) T protein:vir:10 259 VGGLSFDVTDQAT 271 (320) T ss_pred ecCeEEEEeecce Confidence 2234444444333 No 19 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=88.22 E-value=0.034 Score=28.66 Aligned_cols=233 Identities=13% Similarity=0.054 Sum_probs=102.8 Q ss_pred CCcCHHHHH-HHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhccccee---eeeeccccceeee Q lcl|NC_020866. 1 MQVTAANLD-ALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRA---IQNLTESDYSIRE 76 (297) Q Consensus 1 M~i~~~~l~-~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~---~~~l~~~~~~i~n 76 (297) |.-+...-. .+=.-+...+-+...+. +...++++++|......+|.++..-|.. .|+||-. ..+..=..-++.. T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~-s~l~~~~~~i~~~~~~~~~p~~~~~~~a-~wv~Eg~~~~~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQG-STVAVLSARKPQRFGNEDIITFNGRPKA-EFVGEGQQKSSTTGEFDFVTSTP 78 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEeCCcee-EEeecCcccccccceeeEEEEee Confidence 664322110 00111112222222222 3367778888876666778787666764 7886542 2333334466778 Q ss_pred ecccceeeccHHHh---hccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeec Q lcl|NC_020866. 77 KPWELTIGVDRDDI---ETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVS 153 (297) Q Consensus 77 ~~fe~tv~v~R~~i---~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svs 153 (297) ++++..+.|+++.+ .|+...+..-+...|+++.++..++.++. |.++ ..|..+-...+....++. .+. T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~----G~g~--~~g~~~~g~~~~~~~~~~---~~~ 149 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYH----RINP--LTGTVIPGWSNYLGAASK---RVE 149 (311) T ss_pred EEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhc----ccCc--ccCccccccccccccccc---eee Confidence 89999999999998 46678899999999999999999976664 2211 112222211111111100 000 Q ss_pred cccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHH Q lcl|NC_020866. 154 NTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYA 233 (297) Q Consensus 154 n~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~ 233 (297) ....+...+..+....+.-+.- ....+ ..+ | |- ++ . T Consensus 150 --~~~~~~~~~~~~i~~~~~~~~~-~~~~~--------------~~~------------~---~v--------mn----~ 185 (311) T protein:vir:99 150 --LTADTIANPDLAIEAAVGLLVA-NGHPT--------------PVN------------G---LA--------LH----P 185 (311) T ss_pred --ccccccchhHHHHHHHHHHHhh-hccCC--------------Ccc------------E---EE--------Ec----H Confidence 0111111122222222211100 00000 000 0 11 11 2 Q ss_pred HHHHHHHhhccCCCcccc------cccCeEE-ecchHHHHHHHHHhhhccCCC--------------Ccceecce----e Q lcl|NC_020866. 234 AARAALSGMKGDYGRPLG------LMPNLLV-VPPALEEAGRKILNSENASGG--------------ETNPWKGT----A 288 (297) Q Consensus 234 aar~aM~~~k~~~G~~L~------i~P~~Lv-Vp~~le~~A~~ll~~~~~~~g--------------~~N~~~~~----~ 288 (297) ....++++.||.+|+||- -.|..|. .| +.-++.++++ ..-.+.|. + T Consensus 186 ~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~P---------v~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~ 256 (311) T protein:vir:99 186 SIAWGLSTARYTDGRKKFPELGLGIGVSSFEGID---------ASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGI 256 (311) T ss_pred HHHHHHHhhhccCCCeeecCcccCCCCceeccee---------eEeecccccccccccccchhhccCcceEEEeeccccE Confidence 345667888999998762 1111111 11 1111111110 01111121 1 Q ss_pred eEEeccccC Q lcl|NC_020866. 289 ELLVVPWLA 297 (297) Q Consensus 289 ~~iv~p~La 297 (297) .+-+.-.+. T Consensus 257 ~~~~~~~~~ 265 (311) T protein:vir:99 257 HWGVQRDIP 265 (311) T ss_pred EEEEecCce Confidence 111222222 No 20 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=86.59 E-value=0.045 Score=27.99 Aligned_cols=186 Identities=12% Similarity=0.065 Sum_probs=107.4 Q ss_pred CC---------cCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecC----CccceecccccCCCcchhcc--cceeee Q lcl|NC_020866. 1 MQ---------VTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPS----STKEQRYGWMGKIPNVREWI--GPRAIQ 65 (297) Q Consensus 1 M~---------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S----~~~~~~y~~Lg~~P~lrew~--Ge~~~~ 65 (297) |. |.|+.+... +++.+.++ .-+.+++....+ ....-+.......+..+++. .+.... T Consensus 1 Ma~~~T~l~d~i~Pev~~~~-------v~~~~~~~-~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~ 72 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPM-------MQAELDKK-LRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVD 72 (276) T ss_pred CCcceeehhhhhchHHHHHH-------HHHHHHhh-hhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCcc Confidence 77 444444332 22223222 223444433221 11112222222223333332 456677 Q ss_pred eeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccc Q lcl|NC_020866. 66 NLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDE 145 (297) Q Consensus 66 ~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g 145 (297) .+....-+.+.+++++.+.++..+..---...+....+++|.+.++.-|.-++..|..+... T Consensus 73 ~lt~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~------------------ 134 (276) T protein:vir:10 73 KIETNRREAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLT------------------ 134 (276) T ss_pred ccccceeeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------ Confidence 78777777888999999999999988866667888889999998888888777766541100 Q ss_pred cccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCc Q lcl|NC_020866. 146 DGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQ 225 (297) Q Consensus 146 ~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~ 225 (297) .+.. T Consensus 135 ----------------------------------------------------------------------------~~~~ 138 (276) T protein:vir:10 135 ----------------------------------------------------------------------------VSAD 138 (276) T ss_pred ----------------------------------------------------------------------------cccc Confidence 0112 Q ss_pred cCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhcc--CCCCc--------ceecceeeEEeccc Q lcl|NC_020866. 226 TLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENA--SGGET--------NPWKGTAELLVVPW 295 (297) Q Consensus 226 ~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~--~~g~~--------N~~~~~~~~iv~p~ 295 (297) +++.+.+.+|++.|.... ..++.|+|+|+....-++....+.. ..+.. .-+.| ++||+++. T Consensus 139 ~~t~d~i~~A~~~lgd~~--------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~Vi~s~~ 209 (276) T protein:vir:10 139 IGTLAGLEAAIDTFDDED--------LEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALG-AVIVRSKK 209 (276) T ss_pred ccCHHHHHHHHHHhcccc--------CcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecc-eeEEEcCC Confidence 344566677777765432 2457899999988777665322222 11222 23444 69999998 Q ss_pred cC Q lcl|NC_020866. 296 LA 297 (297) Q Consensus 296 La 297 (297) +- T Consensus 210 ~p 211 (276) T protein:vir:10 210 LD 211 (276) T ss_pred CC Confidence 87 No 21 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=86.16 E-value=0.048 Score=27.84 Aligned_cols=228 Identities=17% Similarity=0.126 Sum_probs=99.7 Q ss_pred CCcCHHH--H--HHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhccccee---eeeeccccce Q lcl|NC_020866. 1 MQVTAAN--L--DALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRA---IQNLTESDYS 73 (297) Q Consensus 1 M~i~~~~--l--~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~---~~~l~~~~~~ 73 (297) |.-.... | ..+. +..++...+ .+...++|+++|-.....+|..+..-|.- .|+||-. ..+..=..-+ T Consensus 1 mat~~~gg~lvP~~~~---~~ii~~~~~--~s~i~~~~~~i~~~~~~~~~p~~~~~~~a-~wv~Eg~~~~~~~~~f~~v~ 74 (311) T protein:vir:81 1 MVALATGTFQLPKHLV---PGVWQKAQG--QSVLARLSMAEPQEFGEQQYMTLTAPPRG-EVVGEGAQKSESTATFAPVT 74 (311) T ss_pred CceecCCceEcchhHH---HHHHHHHHh--cchhhhhcceeecCCCceEEEEEeCCcee-EEeecCcccccccceeeEEE Confidence 4433211 0 1111 111111111 23456777777765556667666666654 7886543 3333445566 Q ss_pred eeeecccceeeccHHHh---hccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccce Q lcl|NC_020866. 74 IREKPWELTIGVDRDDI---ETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTV 150 (297) Q Consensus 74 i~n~~fe~tv~v~R~~i---~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~ 150 (297) ++.++++..+.|+++.+ .||..++..-+...++++.++.++..++.-..+| +.....|. ..+..... T Consensus 75 l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~-~~~~~~gi---------~~~~~~~~ 144 (311) T protein:vir:81 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPL-TGAALSGS---------PAKILDTT 144 (311) T ss_pred EeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCC-CCcccccc---------cccccccc Confidence 77889999999999988 4677889999999999999999987665321111 11111110 00000000 Q ss_pred eeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHH Q lcl|NC_020866. 151 TVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGT 230 (297) Q Consensus 151 svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~ 230 (297) .+..... ........+....+.-+ ...+.. . .+ |-|. T Consensus 145 ~~~~~~~-~~~~~~~~~i~~~~~~~-~~~~~~---------------------------~-~~---~vmn---------- 181 (311) T protein:vir:81 145 NIVELTT-GTSATPDLAVEAAVGLV-LGDNLS---------------------------P-DG---VALD---------- 181 (311) T ss_pred eeeeecc-cccchHHHHHHHHHHHh-hhcCCC---------------------------c-eE---EEEc---------- Confidence 0101111 11111111222111110 000000 0 00 2211 Q ss_pred HHHHHHHHHHhhccCCCccccc------ccCeEEecchHHHHHHHHHhhhccCCCC----------------cceeccee Q lcl|NC_020866. 231 AYAAARAALSGMKGDYGRPLGL------MPNLLVVPPALEEAGRKILNSENASGGE----------------TNPWKGTA 288 (297) Q Consensus 231 ~l~aar~aM~~~k~~~G~~L~i------~P~~LvVp~~le~~A~~ll~~~~~~~g~----------------~N~~~~~~ 288 (297) ...+.++++.||.+|++|-. .|..|. ..-++.++.++++. .-.+-|.+ T Consensus 182 --~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~--------G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDf 251 (311) T protein:vir:81 182 --NTFSFMLATQRDSQGRKLYPELGFGTDVASFA--------GLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDF 251 (311) T ss_pred --HHHHHHHHhhhccCCCeeecCccccCCCceec--------ceeEEecccccccccccccccchhcccCCccEEEEEec Confidence 33456778889999998632 122221 00011111221110 01111211 Q ss_pred -eEEeccccC Q lcl|NC_020866. 289 -ELLVVPWLA 297 (297) Q Consensus 289 -~~iv~p~La 297 (297) .+++-.|-. T Consensus 252 s~~~i~~~~~ 261 (311) T protein:vir:81 252 SAFRWGVQVS 261 (311) T ss_pred ccEEEEEecc Confidence 122222222 No 22 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=85.99 E-value=0.049 Score=27.78 Aligned_cols=226 Identities=14% Similarity=0.169 Sum_probs=97.8 Q ss_pred CCcCHHHHHHHH-----------HHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceee---ee Q lcl|NC_020866. 1 MQVTAANLDALR-----------VGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAI---QN 66 (297) Q Consensus 1 M~i~~~~l~~l~-----------~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~---~~ 66 (297) =...+++-.... .-+...+.+.+... +...++++++|-......|..+..-|.. .|+||-.- .+ T Consensus 5 ~~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~Eg~~~~~~~ 82 (318) T protein:vir:24 5 TAFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKT-SIVQQFAQKVPMGTTGQKIPHWVGDVSA-QWIGEGDMKPITK 82 (318) T ss_pred CCCCHHHHHhhcccCcccceeechhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEeCCcce-EEecCCccccccc Confidence 000111111110 11222222222222 2345567777755555667666666665 78866433 33 Q ss_pred eccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccccc Q lcl|NC_020866. 67 LTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDED 146 (297) Q Consensus 67 l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~ 146 (297) ..=..-+++.++++..+.|+++.+.|....+..-+.+.++++.++..++.++ +|.+.. .+..+.+... T Consensus 83 ~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l----~G~g~~--~~~~~~~~~~------ 150 (318) T protein:vir:24 83 GNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAM----HGTDSP--FPTYIGQTTK------ 150 (318) T ss_pred cceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhh----cccCCC--CCcccccccc------ Confidence 3334456677999999999999999988889999999999999999988664 232211 0111111110 Q ss_pred ccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCcc Q lcl|NC_020866. 147 GKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQT 226 (297) Q Consensus 147 ~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~ 226 (297) ..+.+...+. ..++..+.. .....+.. ..+.+. .|= T Consensus 151 --~~~~~~~~~~--~~~~~~~~~-~~~~~~~~----------------------------~~~~~~---~~v-------- 186 (318) T protein:vir:24 151 --AISIADTTGA--TTVYDQVAV-NGLSLLVN----------------------------DGKKWT---HTL-------- 186 (318) T ss_pred --cccccccccc--cchHHHHHH-HHHHhhcc----------------------------ccCCCC---EEE-------- Confidence 1111111111 111111111 01000000 000000 011 Q ss_pred CCHHHHHHHHHHHHhhccCCCccccccc------------CeEEecchHHHHHHHHHhhhccCCCCcceec--------- Q lcl|NC_020866. 227 LDGTAYAAARAALSGMKGDYGRPLGLMP------------NLLVVPPALEEAGRKILNSENASGGETNPWK--------- 285 (297) Q Consensus 227 l~~~~l~aar~aM~~~k~~~G~~L~i~P------------~~LvVp~~le~~A~~ll~~~~~~~g~~N~~~--------- 285 (297) ++ ...+..+++.||.+|++|-... ..+.+|.- -++..+.|..=.+. T Consensus 187 ~n----~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~---------~~~~~~~~~~~~~~gdfs~~~~~ 253 (318) T protein:vir:24 187 LD----DITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTI---------LSDHVVEGTTVGFMGDFSQLIWG 253 (318) T ss_pred Ec----HHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeE---------EeCCCCCCccEEEEeecceEEEE Confidence 11 2234566778888888763221 11112211 11122222211111 Q ss_pred --ceeeE--EeccccC Q lcl|NC_020866. 286 --GTAEL--LVVPWLA 297 (297) Q Consensus 286 --~~~~~--iv~p~La 297 (297) +-+++ .-+..|. T Consensus 254 ~~~~l~i~~~~~~~~~ 269 (318) T protein:vir:24 254 QIGGLSFDVTDQATLN 269 (318) T ss_pred EecCeEEEEeecccee Confidence 11122 1222222 No 23 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=82.81 E-value=0.074 Score=26.79 Aligned_cols=231 Identities=10% Similarity=0.076 Sum_probs=101.3 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceee---eeeccccceeeee Q lcl|NC_020866. 1 MQVTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAI---QNLTESDYSIREK 77 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~---~~l~~~~~~i~n~ 77 (297) |..+...=--|=+.+...+-+.+. ..+...++|+++|-......|.+...-|. -.|+||-.- .+..=..-+++.+ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~-~~s~i~~l~~~~~~~~~~~~ip~~~~~~~-a~wv~E~~~~~~s~~~f~~v~l~~~ 78 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVK-GHSSLAKLSSQKPIPFNGSKEFTFTLDSD-IDVVAENGKKTHGGLSLEPVTIVPI 78 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHH-hhchhhhhcceeecCCCceEEEEEecCcc-eEEeecCccccccccceeeEEeeeE Confidence 664322100011111122222222 13446777877776556667766655555 378866433 3333345667888 Q ss_pred cccceeeccHHHh---hccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeecc Q lcl|NC_020866. 78 PWELTIGVDRDDI---ETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVSN 154 (297) Q Consensus 78 ~fe~tv~v~R~~i---~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svsn 154 (297) +.+..+.|+|+.+ .||..++..-+...++++.++..+.-++. |.++ .+|.+.=.....+..+. ..+ T Consensus 79 kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~----G~~~--~~g~~~~~~~~~~~~~~-----~~~ 147 (303) T protein:vir:97 79 KVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMH----GINP--RTKKASDVIGTNHFDSK-----VTQ 147 (303) T ss_pred EEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhc----cccc--CCccccccccccccccc-----ccc Confidence 9999999999988 57778899999999999999998875553 2211 11111100000011100 001 Q ss_pred cc-ccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHH Q lcl|NC_020866. 155 TG-GGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYA 233 (297) Q Consensus 155 ~~-ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~ 233 (297) .. .+.+...| .+....+..+.-..+ ....++. + . T Consensus 148 ~~~~~~~~~~~-~~i~~~~~~~~~~~~-----------------~~~~~vm-----------------------n----~ 182 (303) T protein:vir:97 148 VVKFTESEDAD-ANIEAAVNLIQGAEG-----------------VVTGLAM-----------------------D----T 182 (303) T ss_pred ccccccccchH-HHHHHHHHHHhhcCC-----------------CccEEEE-----------------------c----H Confidence 10 11111111 122222211100000 0000111 1 2 Q ss_pred HHHHHHHhhccCCCcccc-------cccCeEEecchHHHHHHHHHhhhccCCCC-----cc-eecce------------e Q lcl|NC_020866. 234 AARAALSGMKGDYGRPLG-------LMPNLLVVPPALEEAGRKILNSENASGGE-----TN-PWKGT------------A 288 (297) Q Consensus 234 aar~aM~~~k~~~G~~L~-------i~P~~LvVp~~le~~A~~ll~~~~~~~g~-----~N-~~~~~------------~ 288 (297) ..+.++++.||.+|+++- ..|..|. ...++.++..+.+. .+ .+.|. + T Consensus 183 ~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~--------G~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~ 254 (303) T protein:vir:97 183 EFSTALAKVTNGEMGPKMYPELAWGANPDSIN--------GLKSSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQI 254 (303) T ss_pred HHHHHHHHhhccCCCeEEecCccCCCCCceec--------ceeeEEecccCCccccCCCccEEEEeeccccEEEEEecCc Confidence 344566778888887653 1222222 01111122222210 11 12221 2 Q ss_pred eEEeccccC Q lcl|NC_020866. 289 ELLVVPWLA 297 (297) Q Consensus 289 ~~iv~p~La 297 (297) ++-+.++-. T Consensus 255 ~~~~~~~~~ 263 (303) T protein:vir:97 255 PMEIIKYGD 263 (303) T ss_pred EEEEeeccC Confidence 222222222 No 24 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=82.55 E-value=0.076 Score=26.72 Aligned_cols=232 Identities=13% Similarity=0.059 Sum_probs=93.6 Q ss_pred CCcCHHHHHHHHH------------------------HHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCc-c Q lcl|NC_020866. 1 MQVTAANLDALRV------------------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPN-V 55 (297) Q Consensus 1 M~i~~~~l~~l~~------------------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~-l 55 (297) ..+..+.+++... -+...+.+..... .....+++.+|-++....|.++-.-+. - T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (415) T protein:vir:46 99 TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE-FNLDKYVTVKRVTNGSGKYPVVRQSEVAA 177 (415) T ss_pred hhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhh-hhhhhhcceeeccCCceeEEEEEecCCcc Confidence 1111111111111 0111111111111 122334555555555555554422221 1 Q ss_pred hhcccce-eee---eeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccC Q lcl|NC_020866. 56 REWIGPR-AIQ---NLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYD 131 (297) Q Consensus 56 rew~Ge~-~~~---~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~D 131 (297) -.|++|- .+. ...=..-++..++++..+.|+|+.+.|-..++..-+.+.++++.++..++.++.-+.+|.+... T Consensus 178 ~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~-- 255 (415) T protein:vir:46 178 LEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST-- 255 (415) T ss_pred eeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccc-- Confidence 2466543 121 2233456788899999999999999887888999999999999999999877654433222110 Q ss_pred cccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeecccccc Q lcl|NC_020866. 132 GQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARAN 211 (297) Q Consensus 132 Gk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~ 211 (297) .... ....+.....+...| ..+.-+|.+...+ .+.+ T Consensus 256 -----------~~~~---~~~~~~~~~~~~~~~-----~~i~~~~~~~~~~-------------------------~~~~ 291 (415) T protein:vir:46 256 -----------SSGF---EKEGKKLEVKKAKSL-----DDIKDAINLNVKP-------------------------NYEH 291 (415) T ss_pred -----------cccc---ccccceeccccccch-----HHHHHHHHhhhhh-------------------------ccCC Confidence 0000 000001011111111 0111122211110 0000 Q ss_pred ccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEE-ecchHHHHHHHHHhhhccC---CCCcceecce Q lcl|NC_020866. 212 VGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLV-VPPALEEAGRKILNSENAS---GGETNPWKGT 287 (297) Q Consensus 212 ~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~Lv-Vp~~le~~A~~ll~~~~~~---~g~~N~~~~~ 287 (297) . .|- ++. ..+.++++.||-+|++|- .|+.-= +|..+ ....++.++..+ +|+...+.|. T Consensus 292 ~---~~v--------~n~----~~~~~L~~lkd~~G~~i~-~~~~~~~~~~~l--~G~pV~~~~~~~~~~~~~~~~~~gd 353 (415) T protein:vir:46 292 N---VAI--------VSQ----TMFAKLDKMKDKLGNYLI-QPDVKEKTQQRL--LGAKIEILPDEVLGQKGNNTLIIGN 353 (415) T ss_pred C---EEE--------EcH----HHHHHHHHhhccCCCeee-ccCcCCCCCccc--cceeeEEeccccccCCCccEEEEEe Confidence 0 011 112 224456778888888763 222100 00000 000011111111 2222333332 Q ss_pred ------------eeEEeccccC Q lcl|NC_020866. 288 ------------AELLVVPWLA 297 (297) Q Consensus 288 ------------~~~iv~p~La 297 (297) +++..+++.. T Consensus 354 ~~~~~~~~~~~~~~v~~~~~~~ 375 (415) T protein:vir:46 354 LKDAIVLFDRSQYQASWTDYMH 375 (415) T ss_pred hhccEEEEeecceEEEeecccc Confidence 2222222222 No 25 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=82.55 E-value=0.076 Score=26.72 Aligned_cols=232 Identities=13% Similarity=0.059 Sum_probs=93.6 Q ss_pred CCcCHHHHHHHHH------------------------HHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCc-c Q lcl|NC_020866. 1 MQVTAANLDALRV------------------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPN-V 55 (297) Q Consensus 1 M~i~~~~l~~l~~------------------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~-l 55 (297) ..+..+.+++... -+...+.+..... .....+++.+|-++....|.++-.-+. - T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (415) T protein:vir:47 99 TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE-FNLDKYVTVKRVTNGSGKYPVVRQSEVAA 177 (415) T ss_pred hhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhh-hhhhhhcceeeccCCceeEEEEEecCCcc Confidence 1111111111111 0111111111111 122334555555555555554422221 1 Q ss_pred hhcccce-eee---eeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccC Q lcl|NC_020866. 56 REWIGPR-AIQ---NLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYD 131 (297) Q Consensus 56 rew~Ge~-~~~---~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~D 131 (297) -.|++|- .+. ...=..-++..++++..+.|+|+.+.|-..++..-+.+.++++.++..++.++.-+.+|.+... T Consensus 178 ~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~-- 255 (415) T protein:vir:47 178 LEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST-- 255 (415) T ss_pred eeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccc-- Confidence 2466543 121 2233456788899999999999999887888999999999999999999877654433222110 Q ss_pred cccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeecccccc Q lcl|NC_020866. 132 GQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARAN 211 (297) Q Consensus 132 Gk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~ 211 (297) .... ....+.....+...| ..+.-+|.+...+ .+.+ T Consensus 256 -----------~~~~---~~~~~~~~~~~~~~~-----~~i~~~~~~~~~~-------------------------~~~~ 291 (415) T protein:vir:47 256 -----------SSGF---EKEGKKLEVKKAKSL-----DDIKDAINLNVKP-------------------------NYEH 291 (415) T ss_pred -----------cccc---ccccceeccccccch-----HHHHHHHHhhhhh-------------------------ccCC Confidence 0000 000001011111111 0111122211110 0000 Q ss_pred ccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEE-ecchHHHHHHHHHhhhccC---CCCcceecce Q lcl|NC_020866. 212 VGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLV-VPPALEEAGRKILNSENAS---GGETNPWKGT 287 (297) Q Consensus 212 ~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~Lv-Vp~~le~~A~~ll~~~~~~---~g~~N~~~~~ 287 (297) . .|- ++. ..+.++++.||-+|++|- .|+.-= +|..+ ....++.++..+ +|+...+.|. T Consensus 292 ~---~~v--------~n~----~~~~~L~~lkd~~G~~i~-~~~~~~~~~~~l--~G~pV~~~~~~~~~~~~~~~~~~gd 353 (415) T protein:vir:47 292 N---VAI--------VSQ----TMFAKLDKMKDKLGNYLI-QPDVKEKTQQRL--LGAKIEILPDEVLGQKGNNTLIIGN 353 (415) T ss_pred C---EEE--------EcH----HHHHHHHHhhccCCCeee-ccCcCCCCCccc--cceeeEEeccccccCCCccEEEEEe Confidence 0 011 112 224456778888888763 222100 00000 000011111111 2222333332 Q ss_pred ------------eeEEeccccC Q lcl|NC_020866. 288 ------------AELLVVPWLA 297 (297) Q Consensus 288 ------------~~~iv~p~La 297 (297) +++..+++.. T Consensus 354 ~~~~~~~~~~~~~~v~~~~~~~ 375 (415) T protein:vir:47 354 LKDAIVLFDRSQYQASWTDYMH 375 (415) T ss_pred hhccEEEEeecceEEEeecccc Confidence 2222222222 No 26 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=81.79 E-value=0.083 Score=26.52 Aligned_cols=235 Identities=16% Similarity=0.102 Sum_probs=98.7 Q ss_pred CCcCHHHHHHH-HHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceeee---eeccccceeee Q lcl|NC_020866. 1 MQVTAANLDAL-RVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQ---NLTESDYSIRE 76 (297) Q Consensus 1 M~i~~~~l~~l-~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~---~l~~~~~~i~n 76 (297) |+-+...=-.+ -.-+...+-+.+. ..+.-.++|+.+|-.....+|..+..-|. -.|+||-.-. +..=..-+++- T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~-~~s~i~~l~~~~~~~~~~~~~p~~~~~~~-a~wv~Eg~~~~~s~~~f~~v~l~~ 78 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVK-GHSSIAKLSPQKPIPFNGQREFVFDFDSD-IDIVAENGKKTHGGVSLDPVTIVP 78 (300) T ss_pred CcccccCCcceechhhHHHHHHHHH-hhhhhhhhcceeeccCCceEEEEEecCcc-eEEeeCCcccccccccceeeEeee Confidence 66554221000 0001111111111 11233456666654434445555544454 3688664332 22223455667 Q ss_pred ecccceeeccHHHh---hccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccc--cccccccccccccccee Q lcl|NC_020866. 77 KPWELTIGVDRDDI---ETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQN--FFDTDHPVLDEDGKTVT 151 (297) Q Consensus 77 ~~fe~tv~v~R~~i---~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~--~F~tdH~~~~g~~~~~s 151 (297) ++++..+.|+++.+ .||..++..-+...++++.++.+++.++. |.++. +|++ +..... .. .. T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~----G~~~~--~g~~~~~~~~~~------~~-~~ 145 (300) T protein:vir:95 79 LKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIH----GINPR--TKQASTIIGDNC------FD-KK 145 (300) T ss_pred EEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhh----cccCC--CCCCcccccccc------cc-cc Confidence 89999999999998 57779999999999999999999987763 21111 1111 111100 00 00 Q ss_pred eccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHH Q lcl|NC_020866. 152 VSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTA 231 (297) Q Consensus 152 vsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~ 231 (297) ..+...+.+...| .+......-+.-. .+...+ |-| + T Consensus 146 ~~~~~~~~~~~~~-~~i~~~~~~~~~~-----------------------------~~~~~~---~vm--------n--- 181 (300) T protein:vir:95 146 VTQTVPFKDTNPD-ESMEDAVGMIDGS-----------------------------ERDITG---AIL--------D--- 181 (300) T ss_pred cceeecccccchH-HHHHHHHHHhhhc-----------------------------CCCccE---EEE--------C--- Confidence 0111111111111 1111111100000 000000 221 1 Q ss_pred HHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCCc---c-eeccee------------eEEeccc Q lcl|NC_020866. 232 YAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGET---N-PWKGTA------------ELLVVPW 295 (297) Q Consensus 232 l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~~---N-~~~~~~------------~~iv~p~ 295 (297) .....++++.||-+|++|-.....--.|..| .-..++-++.++.+.+ + .+-|.+ ++-++++ T Consensus 182 -~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l--~G~Pv~~s~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~ 258 (300) T protein:vir:95 182 -PIFTTALSKMKNAEGGKLYPELAWGGVPDAI--NGLAVDKNRTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKY 258 (300) T ss_pred -HHHHHHHHHhhccCCCeeccCccccCCCcee--cceeeEEecCCCCCCCCCccEEEEeeccceEEEEEecccEEEEeec Confidence 2345678889999999874111110111111 1122222333433321 1 122222 2222222 Q ss_pred cC Q lcl|NC_020866. 296 LA 297 (297) Q Consensus 296 La 297 (297) -. T Consensus 259 ~~ 260 (300) T protein:vir:95 259 GD 260 (300) T ss_pred cC Confidence 11 No 27 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=80.69 E-value=0.093 Score=26.25 Aligned_cols=242 Identities=14% Similarity=0.101 Sum_probs=93.7 Q ss_pred CCcCHHHHHH------------------HHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccce Q lcl|NC_020866. 1 MQVTAANLDA------------------LRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPR 62 (297) Q Consensus 1 M~i~~~~l~~------------------l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~ 62 (297) |..-.+. ++ |=+-+...+-+.+... +...++++++|-......+..+..-|. -.|+||- T Consensus 1 ~a~l~el-~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~-s~l~~~~~~~~~~~~~~~~p~~~~~~~-a~~v~eg 77 (333) T protein:vir:78 1 MATLNEL-LPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQES-SLVLRMGEQIPISYGETIIPTTVKRPE-VGQVGVG 77 (333) T ss_pred CchhHHh-hhhcccccccCceecCCccccchhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEeCCce-eEeecCc Confidence 1111000 00 0001111111122111 123555666664444444444444333 2344433 Q ss_pred ee-----------eeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccC Q lcl|NC_020866. 63 AI-----------QNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYD 131 (297) Q Consensus 63 ~~-----------~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~D 131 (297) +. .+..=..-++.-++.+..+.|+|+.+.|....+..-+.+.|+++.++.+++.++. |.+..- T Consensus 78 ~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~----G~g~~~-- 151 (333) T protein:vir:78 78 TSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFH----GKSPLT-- 151 (333) T ss_pred ccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhc----ccCCCC-- Confidence 22 2222233467778999999999999999999999999999999999999986653 222110 Q ss_pred cccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeecccccc Q lcl|NC_020866. 132 GQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARAN 211 (297) Q Consensus 132 Gk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~ 211 (297) +..+-..-+....... ...... +..+..+|- +.-..+..+ .... ....++ +++ T Consensus 152 ~~~~~g~~~~~~~~~~--~~~~~~-~~~~~~~~~-~i~~~~~~~----~~~~-------~~~~~~-----~vm------- 204 (333) T protein:vir:78 152 GSALQGIDTDNVIANT--TNVDYL-QETGDPLLD-RLLDGYDLV----SANT-------DVEFNG-----WAV------- 204 (333) T ss_pred Cccccccccccccccc--cccccc-ccccchhHH-HHHHHHHhh----cccc-------ccCceE-----EEE------- Confidence 0000000010000000 000000 111111110 010011000 0000 000000 111 Q ss_pred ccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCC------------- Q lcl|NC_020866. 212 VGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASG------------- 278 (297) Q Consensus 212 ~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~------------- 278 (297) + .+.......++..||.+|+++-...-.--.|..+. ..-++.++.++. T Consensus 205 ----------------n-~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~--G~Pv~~~~~i~~~~~~~~~~~~~~~ 265 (333) T protein:vir:78 205 ----------------D-PRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVL--GLPAQFGRAVGGDLGAAVDSKTRII 265 (333) T ss_pred ----------------c-chHHHHHHHHhhhcCCCCceeecCccccCCCceee--ceeeEEccccCCCccccCCCccEEE Confidence 0 11123334566777888887642110000000000 011122222221 Q ss_pred -CC-ccee---cceeeEEeccccC Q lcl|NC_020866. 279 -GE-TNPW---KGTAELLVVPWLA 297 (297) Q Consensus 279 -g~-~N~~---~~~~~~iv~p~La 297 (297) |+ .+-+ ++-+++.++++-. T Consensus 266 ~gD~~~~~~g~~~~~~i~~~~~~~ 289 (333) T protein:vir:78 266 GGDFSQLKFGFADEIRIKMSDTAT 289 (333) T ss_pred EEecccEEEEEeeccEEEEecccc Confidence 11 1111 2335666666655 No 28 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=79.46 E-value=0.1 Score=25.97 Aligned_cols=224 Identities=15% Similarity=0.109 Sum_probs=99.3 Q ss_pred CCcCHHHH--HHHHHH-------------HHHHHHHHHhhcchhhcce-eeeecCCccceecccccCCCcchhcccc--- Q lcl|NC_020866. 1 MQVTAANL--DALRVG-------------FKTSFQGALDQAPSQYLRL-TTVVPSSTKEQRYGWMGKIPNVREWIGP--- 61 (297) Q Consensus 1 M~i~~~~l--~~l~~~-------------~~~~f~~a~~~a~~~~~~~-a~~v~S~~~~~~y~~Lg~~P~lrew~Ge--- 61 (297) ..+....| +++.++ +...|-+.+.. .+...++ ++.+|.....-++..+..-|.. -|+|| T Consensus 345 ~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~-~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a-~wv~E~~~ 422 (632) T protein:vir:96 345 FYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRN-KAIIGQMGARMLPGLVGDVDIPKKTSGANF-YWIGEDED 422 (632) T ss_pred hhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhh-cchhhhhcceEeecCCcceEEEEEeCCcee-EeecCCcc Confidence 01111111 111110 01122222211 1223334 4555654444444444444443 36644 Q ss_pred eeeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccc---cccc Q lcl|NC_020866. 62 RAIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQN---FFDT 138 (297) Q Consensus 62 ~~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~---~F~t 138 (297) ....++.-...++..++++..+.|+|+.++|++.++..-+...|+.+.+...|..++ +|..+ .++| +.++ T Consensus 423 ~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l----~G~G~---~~~p~Gi~~~~ 495 (632) T protein:vir:96 423 VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAML----TGTGL---ANDPVGLLNMT 495 (632) T ss_pred ccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhh----cccCC---CCccceeeecc Confidence 444555556678889999999999999999999999999999999999999998654 23211 1222 1112 Q ss_pred ccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccc Q lcl|NC_020866. 139 DHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQ 218 (297) Q Consensus 139 dH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q 218 (297) .++. .....+...|.. . ..+...|. . .++ + .+...|- T Consensus 496 ~~~~------------~~~~~~~~~~~~-i-~~~~~~i~---~------------~~~----------~----~~~~~~~ 532 (632) T protein:vir:96 496 GVPA------------LTYPAGGVDWAS-V-VDMETKIS---T------------FNA----------D----AGRLAYL 532 (632) T ss_pred cccc------------eecccccCCHHH-H-HHHHHHHh---h------------ccc----------c----cCccEEE Confidence 1111 110111111110 0 00111100 0 000 0 0001121 Q ss_pred hhhcCCccCCHHHHHHHHHHHH--hhccCCCcccccccCeEEecchHHHHHHHHHhhhccCC-----CCc-cee---cce Q lcl|NC_020866. 219 MAYGSKQTLDGTAYAAARAALS--GMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASG-----GET-NPW---KGT 287 (297) Q Consensus 219 ~a~~~~~~l~~~~l~aar~aM~--~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~-----g~~-N~~---~~~ 287 (297) + +.. .+.+++ ..+|-.|++|-- |..|.-.| ++.+..+++ |+- ..+ .+- T Consensus 533 ~--------~~~----~~~~l~~~~l~d~~G~~i~~-~~~l~G~p--------v~~s~~ip~~~~~~gd~s~~~i~~~~~ 591 (632) T protein:vir:96 533 T--------SVT----QRGAAKKAQVFDNTGERIWQ-NNEVNGYR--------AEASNQIPADTWIFGDWSQIVIAMWGV 591 (632) T ss_pred E--------chh----HHHHHHHHhccCCCCceeec-CCeecccc--------eEeccccccCcEEEeecceEEEEEecc Confidence 1 111 112222 356888888742 33332111 122233333 222 222 245 Q ss_pred eeEEeccccC Q lcl|NC_020866. 288 AELLVVPWLA 297 (297) Q Consensus 288 ~~~iv~p~La 297 (297) +++.++|+-- T Consensus 592 ~~i~~~~~~~ 601 (632) T protein:vir:96 592 LDLKVDPYTK 601 (632) T ss_pred eEEEEccccc Confidence 7788887643 No 29 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=79.06 E-value=0.11 Score=25.88 Aligned_cols=239 Identities=14% Similarity=0.132 Sum_probs=99.5 Q ss_pred CCcCHHHHHH-HHHH-HHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccce---eeeeeccccceee Q lcl|NC_020866. 1 MQVTAANLDA-LRVG-FKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPR---AIQNLTESDYSIR 75 (297) Q Consensus 1 M~i~~~~l~~-l~~~-~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~---~~~~l~~~~~~i~ 75 (297) |..++..-.. |-.- .+..++ .... .+...++++.++-......|..+..-|. -.|++|- ...+..=..-++. T Consensus 10 ~~~~t~~~g~~i~~~~~~~ii~-~~~~-~s~l~~~~~~~~~~~~~~~~p~~~~~~~-a~~v~Eg~~~~~~~~~f~~i~~~ 86 (330) T protein:vir:77 10 QVALTGDFSAFLTPEQSQDYFA-EIEK-TSIVQRIARKVPMGPTGISIPHWTGAVS-ASWTGEAERKPITKGSFGKQELE 86 (330) T ss_pred hccccCCCcceechhHHHHHHH-HHHh-ccchhhhcceeeccCCceEEEEEcCCcc-eeEecCCCccccccceeeEEEEe Confidence 2222111000 0000 122222 2222 2235566777775444456666666666 3677553 3333333446788 Q ss_pred eecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeeccc Q lcl|NC_020866. 76 EKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVSNT 155 (297) Q Consensus 76 n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svsn~ 155 (297) .++++..+.|+|+.++|-...+..-+.+.++++.++..+..++ +|... |+++-.--+.... .......+. T Consensus 87 ~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l----~G~g~----~~~~~g~~~~~~~--~~~~~~~~~ 156 (330) T protein:vir:77 87 PVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAI----HGIDK----PSAFKGYLAETTK--VVSLADTNL 156 (330) T ss_pred EEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh----cccCC----CCccccccccccc--cceeecccc Confidence 8999999999999999888999999999999999999987544 23221 1111000000000 000000000 Q ss_pred --cccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHH Q lcl|NC_020866. 156 --GGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYA 233 (297) Q Consensus 156 --~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~ 233 (297) .++.....| .+....+.-+....+ +..+ |- ++ . T Consensus 157 ~~~~~~~~~~~-~~l~~~~~~~~~~~~------------~~~~--------------------~v--------mn----~ 191 (330) T protein:vir:77 157 TTASGPQGNAY-LAVNNALSLLVNSGK------------KWTG--------------------TL--------LD----N 191 (330) T ss_pred cccccccchhH-HHHHHHHHhhhhcCC------------CccE--------------------EE--------Ec----H Confidence 011111111 111111111000000 0000 11 11 2 Q ss_pred HHHHHHHhhccCCCcccccccCeEEec---chHHHHHHHHHhhhccCCCCc-c---eec-----------ceeeEEeccc Q lcl|NC_020866. 234 AARAALSGMKGDYGRPLGLMPNLLVVP---PALEEAGRKILNSENASGGET-N---PWK-----------GTAELLVVPW 295 (297) Q Consensus 234 aar~aM~~~k~~~G~~L~i~P~~LvVp---~~le~~A~~ll~~~~~~~g~~-N---~~~-----------~~~~~iv~p~ 295 (297) ..+..+++.|+-+|++|-.....--.| ....-...-++.++..+.++. | .+. +-+++-++.. T Consensus 192 ~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e 271 (330) T protein:vir:77 192 VTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQ 271 (330) T ss_pred HHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeec Confidence 334567778888888763211100000 000000111222233333221 1 111 1233333333 Q ss_pred cC Q lcl|NC_020866. 296 LA 297 (297) Q Consensus 296 La 297 (297) .. T Consensus 272 ~~ 273 (330) T protein:vir:77 272 AT 273 (330) T ss_pred ce Confidence 32 No 30 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=78.28 E-value=0.12 Score=25.71 Aligned_cols=191 Identities=18% Similarity=0.150 Sum_probs=96.7 Q ss_pred CCcCHHHHHHHHH--HHHHHHHHHHhhcchhhcceeeeec-------CCccceecccccCCCcchhcccceeeeeecccc Q lcl|NC_020866. 1 MQVTAANLDALRV--GFKTSFQGALDQAPSQYLRLTTVVP-------SSTKEQRYGWMGKIPNVREWIGPRAIQNLTESD 71 (297) Q Consensus 1 M~i~~~~l~~l~~--~~~~~f~~a~~~a~~~~~~~a~~v~-------S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l~~~~ 71 (297) |.-+...|..+.. -|....++.+..+ --+..++..-. ++-....|.-+|+.-.+.| -++.....+.... T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~-~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~e-g~~i~~~~lt~~~ 78 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKA-LRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAE-GGEISLDKIGTTT 78 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhh-hhhccccccccccccCCCCEEEEeeeccCccccccCC-CCccChhhcCCcc Confidence 7643333333322 1111122222111 12233332211 1111112222333311111 1345666777777 Q ss_pred ceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccccccccee Q lcl|NC_020866. 72 YSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVT 151 (297) Q Consensus 72 ~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~s 151 (297) -+.+.+.++..+.|+-.+...---.......++++.+.++.-|.-++..|... +. + T Consensus 79 ~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~--~~----------------------~ 134 (272) T protein:vir:36 79 KSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTT--SQ----------------------T 134 (272) T ss_pred eeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc--cc----------------------c Confidence 77888899999999988877655556677777888888887777666655320 00 0 Q ss_pred eccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHH Q lcl|NC_020866. 152 VSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTA 231 (297) Q Consensus 152 vsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~ 231 (297) ....++.+. T Consensus 135 -----------------------------------------------------------------------~~~~~~~d~ 143 (272) T protein:vir:36 135 -----------------------------------------------------------------------VSTKANVDG 143 (272) T ss_pred -----------------------------------------------------------------------ccccccHHH Confidence 011234455 Q ss_pred HHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccC-C--------CCcceecceeeEEeccccC Q lcl|NC_020866. 232 YAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENAS-G--------GETNPWKGTAELLVVPWLA 297 (297) Q Consensus 232 l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~-~--------g~~N~~~~~~~~iv~p~La 297 (297) +..|++.|.... . .+++++|+|.....=++...-+... . |..-.+.| ++||++..+- T Consensus 144 i~~A~~~lgd~~----~----~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G-~~Vv~s~~~p 209 (272) T protein:vir:36 144 VQAALDIFNDED----A----QAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLG-AQIVRSKKLA 209 (272) T ss_pred HHHHHHHhhhcC----C----CceEEEEcHHHHHHHhcccccccccccccccceeeeccceecC-eeEEEeCCCC Confidence 666666665332 2 3578999998655443332222221 1 11223455 6899999887 No 31 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=78.24 E-value=0.12 Score=25.70 Aligned_cols=232 Identities=14% Similarity=0.086 Sum_probs=95.4 Q ss_pred CCcCHHHHHH-HHHH-HHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccc---eeeeeeccccceee Q lcl|NC_020866. 1 MQVTAANLDA-LRVG-FKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGP---RAIQNLTESDYSIR 75 (297) Q Consensus 1 M~i~~~~l~~-l~~~-~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge---~~~~~l~~~~~~i~ 75 (297) +-+|...=.. +=.- ....+.+.+.... ....+++.++.+ ....+.+....|. -.|+|| ....++.-..-++. T Consensus 251 ~~~t~~~gg~lip~~~~~~ii~~~~~~~~-~l~~~~~~~~~~-g~~~~~~~~~~~~-a~~v~Eg~~~~~~~~~~~~i~~~ 327 (543) T protein:vir:81 251 MGLTKADGGYLVPFQLDPTVIITSNGSLN-DIRRFARQVVAT-GDVWHGVSSAAVQ-WSWDAEFEEVSDDSPEFGQPEIP 327 (543) T ss_pred cccccccCcccCchhhhhHHHHHHHhhhc-hhhhhcccccCC-cceEEEEecCCcc-eeecccCccccccccccceeeee Confidence 1111100000 0000 1223333333322 345556555443 3334445544444 357654 33344444556788 Q ss_pred eecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeeccc Q lcl|NC_020866. 76 EKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVSNT 155 (297) Q Consensus 76 n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svsn~ 155 (297) .++++..+.|||+.+. |...+..-+.+.++++.++..+..++ +| ||.+ ++|.+--+......... T Consensus 328 ~~k~~~~~~is~ell~-d~~~~~~~i~~~l~~~~~~~~d~ail----~G------~Gt~----~~p~Gi~~~~~~~~~~~ 392 (543) T protein:vir:81 328 VKKAQGFVPISIEALQ-DEANVTETVALLFAEGKDELEAVTLT----TG------TGQG----NQPTGIVTALAGTAAEI 392 (543) T ss_pred eeeeEeeehhhHHHHh-ccHHHHHHHHHHHHHHHHHHHHHHHh----cc------CCCC----cccccchhhcccccccc Confidence 8999999999999885 66899999999999999999887553 23 2221 23332111111111111 Q ss_pred cccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHHHH Q lcl|NC_020866. 156 GGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAAA 235 (297) Q Consensus 156 ~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aa 235 (297) ..+.....-+.+.. -++...... | +.+ +.|- ++ ... T Consensus 393 ~~~~~~~~~~~~~~----~~~~~l~~~---------------------~----~~~---~~~v--------~n----~~~ 428 (543) T protein:vir:81 393 APVTAETFALADVY----AVYEQLAAR---------------------H----RRQ---GAWL--------AN----NLI 428 (543) T ss_pred cccccccccHHHHH----HHHHhhhcc---------------------c----cCC---cEEE--------Ec----HHH Confidence 11100000000100 000000000 0 000 0111 11 223 Q ss_pred HHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCC--------CCc--------cee---cceeeEEecccc Q lcl|NC_020866. 236 RAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASG--------GET--------NPW---KGTAELLVVPWL 296 (297) Q Consensus 236 r~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~--------g~~--------N~~---~~~~~~iv~p~L 296 (297) +..+++.||.+|+||--.+.- =.|+.+ ...-++.+...+. +.. |.+ ++-+++.++|+. T Consensus 429 ~~~l~~lkd~~G~~l~~~~~~-g~~~~l--~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~ 505 (543) T protein:vir:81 429 YNKIRQFDTQGGAGLWTTIGN-GEPSQL--LGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHL 505 (543) T ss_pred HHHHHHhhcCCCceeccCcCC-CCCccc--cceeeEEeccccccccccccCCcceEEEeeccceeEEeecccEEEEeccc Confidence 456677888888876321100 001111 0111111111111 111 111 123566666665 Q ss_pred C Q lcl|NC_020866. 297 A 297 (297) Q Consensus 297 a 297 (297) . T Consensus 506 ~ 506 (543) T protein:vir:81 506 F 506 (543) T ss_pred c Confidence 5 No 32 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=77.54 E-value=0.12 Score=25.56 Aligned_cols=228 Identities=15% Similarity=0.192 Sum_probs=97.4 Q ss_pred CCcCH-HHH--------HHHHHH------------HHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcc Q lcl|NC_020866. 1 MQVTA-ANL--------DALRVG------------FKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWI 59 (297) Q Consensus 1 M~i~~-~~l--------~~l~~~------------~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~ 59 (297) |.+|+ ... +++.++ .+..+....+.. .-.++++++|.......|..+..-|.. .|+ T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s--~i~~~~~~~~~~~~~~~~p~~~~~~~a-~~v 77 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKIS--IVQQFAQKIPMGTTGQKIPHWTGDVSA-SWI 77 (326) T ss_pred CCCCccchhhhcCcchhhheeccccCCcceechhhHHHHHHHHHhcc--hhhhhcceeeccCCceEEEEEeCCcce-EEe Confidence 55554 111 222111 012222222222 234556777755444555555555554 466 Q ss_pred cce---eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccc Q lcl|NC_020866. 60 GPR---AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFF 136 (297) Q Consensus 60 Ge~---~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F 136 (297) ||- .-.+..=..-++..++++..+.|+|+.+.+=...+..-+.++++++.++..++.++ .|..+ |++.. T Consensus 78 ~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l----~G~gs----~~p~g 149 (326) T protein:vir:42 78 GEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAI----NGTDS----PFPTF 149 (326) T ss_pred cCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhh----cccCC----Ccccc Confidence 543 33344445567888999999999999999878899999999999999999988664 22221 22111 Q ss_pred cccccccccccc--ceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccc Q lcl|NC_020866. 137 DTDHPVLDEDGK--TVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGF 214 (297) Q Consensus 137 ~tdH~~~~g~~~--~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~ 214 (297) ....... ....+.........++-.+.... ....... .+.+ T Consensus 150 -----i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~----------------------------~~~~--- 192 (326) T protein:vir:42 150 -----LAQTTKEVSLVDPDGTGSNADLTVYDAVAVNA-LSLLVNA----------------------------GKKW--- 192 (326) T ss_pred -----ccccccccceeecccccccccchhHHHHHHHH-Hhhhhhh----------------------------ccCc--- Confidence 1111100 00000000000001111100000 0000000 0000 Q ss_pred cccchhhcCCccCCHHHHHHHHHHHHhhccCCCccccccc------------CeEEecchHHHHHHHHHhhhccCCCCcc Q lcl|NC_020866. 215 GFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMP------------NLLVVPPALEEAGRKILNSENASGGETN 282 (297) Q Consensus 215 ~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P------------~~LvVp~~le~~A~~ll~~~~~~~g~~N 282 (297) ..|-+ + .+.+.++++.||-.|++|-... +++-+|... ++..+.+..= T Consensus 193 a~~v~--------n----~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~---------~~~~~~~~~~ 251 (326) T protein:vir:42 193 THTLL--------D----DITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTIL---------SDHVASGTVV 251 (326) T ss_pred cEEEE--------e----HHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEE---------cCCCCCCceE Confidence 00111 1 2344567778888888763211 122222211 1222222211 Q ss_pred --------ee---cceeeEEecc--ccC Q lcl|NC_020866. 283 --------PW---KGTAELLVVP--WLA 297 (297) Q Consensus 283 --------~~---~~~~~~iv~p--~La 297 (297) -+ ++-+++-++- .+. T Consensus 252 ~~~Gd~s~~~~~~~~~~~v~~~~e~~~~ 279 (326) T protein:vir:42 252 GYQGDFRQLVWGQVGGLSFDVTDQATLN 279 (326) T ss_pred EEEeecceEEEEEecceEEEEeecceee Confidence 11 1112222222 221 No 33 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=75.30 E-value=0.15 Score=25.13 Aligned_cols=236 Identities=9% Similarity=-0.006 Sum_probs=101.5 Q ss_pred CCcCHHHHHHHHH----------------HHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccc--- Q lcl|NC_020866. 1 MQVTAANLDALRV----------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGP--- 61 (297) Q Consensus 1 M~i~~~~l~~l~~----------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge--- 61 (297) +.+++..-+.+.. -+...+.+.+... +.-...|+.+|.......+.+...-|.. .|++| T Consensus 70 ~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~-s~i~~~~~~~~~~~~~~~i~~~~~~~~a-~~~~E~~~ 147 (390) T protein:vir:40 70 NALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVE-HPLLSKINFVNTTATTEWIISVGDVATA-WWGPLCAE 147 (390) T ss_pred hhccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhh-hhhhhhceeeecCCceeEEEEEcCCcce-eeeccccc Confidence 2233221111100 0111111111111 1223456777766655666666665553 66654 Q ss_pred ee-eeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccc--ccc Q lcl|NC_020866. 62 RA-IQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNF--FDT 138 (297) Q Consensus 62 ~~-~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~--F~t 138 (297) +. ..+..=..-++..+++...+.|+++.++|-..++..-+.+.++++.+...++.++. |... |+|. +.. T Consensus 148 ~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~----G~G~----~~P~Gil~~ 219 (390) T protein:vir:40 148 IKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVN----GSGK----DQPIGMMRD 219 (390) T ss_pred cCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhc----ccCC----Cccceeeec Confidence 22 22333355677889999999999999999999999999999999999999974442 3221 2221 111 Q ss_pred ccccccccccceeeccccccchhHHHH--HHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccc Q lcl|NC_020866. 139 DHPVLDEDGKTVTVSNTGGGTGTPWFL--LDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGF 216 (297) Q Consensus 139 dH~~~~g~~~~~svsn~~ag~~~awyl--ld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l 216 (297) .-.+-.+ . .....+. ...+. .+.-..++... ........+.+. T Consensus 220 ~~~~~~~--~---~~~~~~~--~~t~~~~~~~~~~l~~~~----------------------------~~~~~~~~~~a~ 264 (390) T protein:vir:40 220 LNNVTAG--E---HPVKTAT--PLTDLTPATLATKVMLPL----------------------------TDNGKKSVSDAI 264 (390) T ss_pred ccccccc--c---ccccccc--ccchhhHHHHHHHHHHHh----------------------------hcchhhhhcCce Confidence 0000000 0 0000000 00000 00001111100 000010111111 Q ss_pred cchhhcCCccCCHHHHHHHHHHHHhhccCCCcccc---cccCeEEecchHHHHHHHHHhhhccCCCCccee----cceee Q lcl|NC_020866. 217 WQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLG---LMPNLLVVPPALEEAGRKILNSENASGGETNPW----KGTAE 289 (297) Q Consensus 217 ~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~---i~P~~LvVp~~le~~A~~ll~~~~~~~g~~N~~----~~~~~ 289 (297) |- ++..++..-..+++.+++.+|+++. +.+.-+|+.+..- +-.+ --|+-+-| ++-++ T Consensus 265 ~i--------~n~~t~~~~l~~~~~~~d~~G~~v~~~~~~g~pvv~~~~~p--~~~i------~~Gd~s~~~i~~~~~~~ 328 (390) T protein:vir:40 265 LV--------INPADYWSKIYAATSYMTPQGVWVTGILPVPLEIVQSVAVP--VGKA------VAGRAKDYFMGIGSEQV 328 (390) T ss_pred EE--------EcchhHHHHHHHHhhccCCCCccccccCCCceeEEEcCCCC--CCcE------EEEeeceEEEEeecceE Confidence 21 1222223334567788888888743 1122222222110 0000 01211111 34456 Q ss_pred EEeccccC Q lcl|NC_020866. 290 LLVVPWLA 297 (297) Q Consensus 290 ~iv~p~La 297 (297) +-+++... T Consensus 329 v~~~~~~~ 336 (390) T protein:vir:40 329 IRTSTEYR 336 (390) T ss_pred EEecchhh Confidence 66666554 No 34 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=74.63 E-value=0.16 Score=25.01 Aligned_cols=223 Identities=14% Similarity=0.135 Sum_probs=95.9 Q ss_pred CCcC---HHHHHH-------------------------HHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCC Q lcl|NC_020866. 1 MQVT---AANLDA-------------------------LRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKI 52 (297) Q Consensus 1 M~i~---~~~l~~-------------------------l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~ 52 (297) |.=+ +.+++. |-..+...+.+...+. +...++|+.+|.......|..+..- T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~~p~~~~~ 79 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMEN-SKIMRLGKYEPMEGTEKKFTFWADK 79 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEecC Confidence 1111 111110 0111112222222222 2245567777755555566666555 Q ss_pred Ccchhcccce---eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccc Q lcl|NC_020866. 53 PNVREWIGPR---AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATEC 129 (297) Q Consensus 53 P~lrew~Ge~---~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~ 129 (297) |.. +|++|- ...+..-..-++..++++..+.|+|+.+.|-...+..-+.+.++++.++..++.++. |.+.. T Consensus 80 ~~a-~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~----G~g~~- 153 (324) T protein:vir:99 80 PGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN- 153 (324) T ss_pred cce-eEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh----cCCCC- Confidence 553 787553 334444455667889999999999999999889999999999999999999886642 21110 Q ss_pred cCcccccccccccccccccceeeccccccchhHHH-HHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccc Q lcl|NC_020866. 130 YDGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWF-LLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDA 208 (297) Q Consensus 130 ~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awy-lld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~ 208 (297) ..+..+..+. .. ......+. ..|- +.+.--.+++ .. T Consensus 154 ~~~~~~~~~~-------~~---~~~~~~~~-~~~~~i~~~~~~l~~--------------------------------~~ 190 (324) T protein:vir:99 154 PFGKSIAQSI-------EK---TNKVIKGD-FTQDNIIDLEALLED--------------------------------DE 190 (324) T ss_pred ccCccccccc-------cc---cceecccc-CCHHHHHHHHHhhhh--------------------------------cc Confidence 1111111110 00 00000110 0000 0000000000 00 Q ss_pred cccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccc--cccCeEE-ecchHHHHHHHHHhhhccCCCCc---- Q lcl|NC_020866. 209 RANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLG--LMPNLLV-VPPALEEAGRKILNSENASGGET---- 281 (297) Q Consensus 209 r~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~--i~P~~Lv-Vp~~le~~A~~ll~~~~~~~g~~---- 281 (297) +..+ .|- ++ ...+..+++.||..|+++- -.|..|. +|. +.+...+.+.. T Consensus 191 ~~~~---~~v--------~n----~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~PV---------v~~~~~~~~~~~~i~ 246 (324) T protein:vir:99 191 LEAN---AFI--------SK----TQNRSLLRKIVDPETKERIYDRNSDTLDGLPV---------VNLKSSNLKRGELIT 246 (324) T ss_pred CCCC---EEE--------Ec----HHHHHHHHHhhcCCCceeecCCCCccccceeE---------EeecCCCCCcceEEE Confidence 0000 021 11 2234466778999998752 2222221 221 11111111111 Q ss_pred ----ceec---cee--eEEeccccC Q lcl|NC_020866. 282 ----NPWK---GTA--ELLVVPWLA 297 (297) Q Consensus 282 ----N~~~---~~~--~~iv~p~La 297 (297) +-+. +-+ ++.-++.+. T Consensus 247 gd~~~~~~~~~~~~~i~~~~~~~~~ 271 (324) T protein:vir:99 247 GDFDKLIYGIPQLIEYKIDETAQLS 271 (324) T ss_pred EecccEEEEEecCcEEEEeeccccc Confidence 1111 111 222222222 No 35 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=73.65 E-value=0.17 Score=24.84 Aligned_cols=238 Identities=15% Similarity=0.137 Sum_probs=97.3 Q ss_pred CCcCHHHHHHHHHH------------HHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceeeee-- Q lcl|NC_020866. 1 MQVTAANLDALRVG------------FKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQN-- 66 (297) Q Consensus 1 M~i~~~~l~~l~~~------------~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~~-- 66 (297) ++-+.+.-+++.++ +...+.+..... +...++|+.+|.++....|......|.. .|+||-.-.. T Consensus 120 ~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~-s~l~~l~~~~~~~~~~~~~~~~~~~~~a-~wv~E~~~~~~~ 197 (425) T protein:vir:10 120 HVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLI-SPMRQLCRVQPVSKAGFSKLFNMGGTTS-GWVGEASQRPQT 197 (425) T ss_pred HhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhh-hhhhhhceeeeccCCceEEEEEcCCcce-eeeccccccccc Confidence 11111111111110 112222222221 2334457777766566666665555655 7887754321 Q ss_pred -e-ccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccc--cccc-ccc Q lcl|NC_020866. 67 -L-TESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQN--FFDT-DHP 141 (297) Q Consensus 67 -l-~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~--~F~t-dH~ 141 (297) . .-..-++..++++..+.|+++.+.|-...+..-+.+.++++.++..+..++ +|..+ |+| ++.. ... T Consensus 198 ~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l----~G~G~----~~p~Gil~~~~~~ 269 (425) T protein:vir:10 198 NAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFL----AGDGT----NKPNGLLTYIAGG 269 (425) T ss_pred cccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhh----cccCC----CCcceeeeccccc Confidence 1 224457888999999999999999888999999999999999999887433 23221 111 1110 000 Q ss_pred cccccccceee--ccccccchhHH-HHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccc Q lcl|NC_020866. 142 VLDEDGKTVTV--SNTGGGTGTPW-FLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQ 218 (297) Q Consensus 142 ~~~g~~~~~sv--sn~~ag~~~aw-ylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q 218 (297) ........... ..........| -+.+.-..+.|. .+.+ +.|- T Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~--------------------------------~~~~---a~~v 314 (425) T protein:vir:10 270 ANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSA--------------------------------FTGN---ARFA 314 (425) T ss_pred cccccccccccccccccccccccHHHHHHHHhhhhhh--------------------------------hccC---CEEE Confidence 00000000000 00000000000 011111111110 0111 1121 Q ss_pred hhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCC-C-Ccce-e----------- Q lcl|NC_020866. 219 MAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASG-G-ETNP-W----------- 284 (297) Q Consensus 219 ~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~-g-~~N~-~----------- 284 (297) + + ...+.++++.||-+|+||- .|+.---.+. .-..+-++-++..+. + +.++ + T Consensus 315 m--------n----~~~~~~L~~lkD~~G~~l~-~~~~~~g~~~-~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~ 380 (425) T protein:vir:10 315 M--------N----RNTQRQVRKLKDGQGNYLW-QPSYVAGQPA-TLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLII 380 (425) T ss_pred E--------c----hHHHHHHHHhhcCCCceee-ccCccCCCCc-eecceeeEEecCcCCccCCccEEEEEehhccEEEE Confidence 1 1 1234556677888888762 1110000000 000011111111121 1 1111 1 Q ss_pred -cceeeEEeccccC Q lcl|NC_020866. 285 -KGTAELLVVPWLA 297 (297) Q Consensus 285 -~~~~~~iv~p~La 297 (297) +..+++..+|+-. T Consensus 381 ~~~~~~v~~d~~~~ 394 (425) T protein:vir:10 381 DRIGVRVLRDPYTA 394 (425) T ss_pred EecceEEEeccccc Confidence 1224444455433 No 36 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=69.76 E-value=0.22 Score=24.21 Aligned_cols=193 Identities=11% Similarity=0.027 Sum_probs=102.1 Q ss_pred CCcCHHHHHHHHHH--HHHHHHHHHhhcchhhcceeeeecC----CccceecccccCCCcchhcc--cceeeeeeccccc Q lcl|NC_020866. 1 MQVTAANLDALRVG--FKTSFQGALDQAPSQYLRLTTVVPS----STKEQRYGWMGKIPNVREWI--GPRAIQNLTESDY 72 (297) Q Consensus 1 M~i~~~~l~~l~~~--~~~~f~~a~~~a~~~~~~~a~~v~S----~~~~~~y~~Lg~~P~lrew~--Ge~~~~~l~~~~~ 72 (297) |.=+...|..+.+- +...+.+.+... --+..++.+..+ -...-+.......+...++. .+.....+....- T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~-~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~ 79 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhh-hhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCccccccccccee Confidence 55444333332221 111111122111 112333322111 01111111112223333332 4556677777788 Q ss_pred eeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceee Q lcl|NC_020866. 73 SIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTV 152 (297) Q Consensus 73 ~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~sv 152 (297) +.+.+.++..+.|+..+....--.......++++++.++..+.-++..|..+... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~------------------------- 134 (274) T protein:vir:93 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT------------------------- 134 (274) T ss_pred EEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------------- Confidence 8888999999999999988776667888889999999999988887766441000 Q ss_pred ccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHH Q lcl|NC_020866. 153 SNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAY 232 (297) Q Consensus 153 sn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l 232 (297) .+..+++.+.+ T Consensus 135 ---------------------------------------------------------------------~~~~~~~~d~i 145 (274) T protein:vir:93 135 ---------------------------------------------------------------------VNADITKLNGL 145 (274) T ss_pred ---------------------------------------------------------------------ccccccCHHHH Confidence 00112334556 Q ss_pred HHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhcc--CCCCcc--------eecceeeEEeccccC Q lcl|NC_020866. 233 AAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENA--SGGETN--------PWKGTAELLVVPWLA 297 (297) Q Consensus 233 ~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~--~~g~~N--------~~~~~~~~iv~p~La 297 (297) ..|++.+... +..+++|+|+|.....-++--.-+.. .....+ -+.| +.||+++.+- T Consensus 146 ~dA~~~l~d~--------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~Vi~s~~~p 211 (274) T protein:vir:93 146 QSAIDKFNDE--------DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-AIIVRTNKLE 211 (274) T ss_pred HHHHHHhhhc--------cCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecC-eeEEEcCCCC Confidence 6666665432 23578999999877665543111111 111112 2334 6899999887 No 37 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=69.44 E-value=0.22 Score=24.16 Aligned_cols=235 Identities=15% Similarity=0.132 Sum_probs=97.6 Q ss_pred CCcCHHHHHH--HHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceee---eeeccccceee Q lcl|NC_020866. 1 MQVTAANLDA--LRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAI---QNLTESDYSIR 75 (297) Q Consensus 1 M~i~~~~l~~--l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~---~~l~~~~~~i~ 75 (297) |..+...--+ +-.-+...+-+.+.+ .+.-+++|+++|......++..+..-|.- .|+||-.- .+..=..-++. T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~-~s~i~~l~~~i~~~~~~~~ip~~~~~~~a-~wv~Eg~~~~~s~~~f~~v~l~ 78 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAID-SGVLAKLSPEQPTIFGPVKGAVFSGVPRA-KIVGEGEVKPSASVDVSAFTAQ 78 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHh-hchhhhhcceeecCCCceEEEEEeCCcce-EEeeCCccccccccceeeeEee Confidence 6654311000 000111111111211 23356778888866666677776666654 68876433 23233345667 Q ss_pred eecccceeeccHHHhhccCcch----hHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccccccccee Q lcl|NC_020866. 76 EKPWELTIGVDRDDIETDNLGI----YSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVT 151 (297) Q Consensus 76 n~~fe~tv~v~R~~i~dD~lG~----~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~s 151 (297) -++++..+.|+++.+.++.... ..-+.+.++++.++..++.++. |.++. .|+++=.-.+ .... T Consensus 79 ~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~----G~~~~--~~~~~~~~~~-------~~~~ 145 (315) T protein:vir:80 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFH----GIDPA--TGKAASAVHT-------SLNK 145 (315) T ss_pred eeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheee----ccCCC--CCcccccccc-------cccc Confidence 7899999999999998776663 3556677788877777754442 22211 1111000000 0001 Q ss_pred eccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHH Q lcl|NC_020866. 152 VSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTA 231 (297) Q Consensus 152 vsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~ 231 (297) ..+.....+..| .+....+.-+. +.+.+.+.+ |-|. T Consensus 146 ~~~~~~~~~~~~--~d~~~~~~~~~----------------------------~~~~~~~~~---~imn----------- 181 (315) T protein:vir:80 146 TKNIVDATDSAT--ADLVKAVGLIA----------------------------GAGLQVPNG---VALD----------- 181 (315) T ss_pred ccceeeccccch--HHHHHHHHHHh----------------------------hccCccceE---EEEc----------- Confidence 111111111111 01111110000 000111111 2221 Q ss_pred HHHHHHHHHhhccCCCcccccccCeE---E-ecchHHHHHHHHHhhhccCCCC-----cc--eecc-----------eee Q lcl|NC_020866. 232 YAAARAALSGMKGDYGRPLGLMPNLL---V-VPPALEEAGRKILNSENASGGE-----TN--PWKG-----------TAE 289 (297) Q Consensus 232 l~aar~aM~~~k~~~G~~L~i~P~~L---v-Vp~~le~~A~~ll~~~~~~~g~-----~N--~~~~-----------~~~ 289 (297) .+.+.++++.|+.+|++++-.|-+- . .|..| ..+-++.++..+.+. .+ .+.| .++ T Consensus 182 -~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl--~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~ 258 (315) T protein:vir:80 182 -PAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNW--RGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP 258 (315) T ss_pred -HHHHHHHHHHhhccCCcccccccccccccCCCcee--cceeeEecCcCCcccccccccccEEEEeecccEEEEEecCee Confidence 3446777888888887765443210 0 01011 112222223332211 11 1112 223 Q ss_pred EEeccccC Q lcl|NC_020866. 290 LLVVPWLA 297 (297) Q Consensus 290 ~iv~p~La 297 (297) +-+.++-- T Consensus 259 i~i~~~~~ 266 (315) T protein:vir:80 259 IELIEYGD 266 (315) T ss_pred EEEecccc Confidence 33333322 No 38 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=67.41 E-value=0.25 Score=23.86 Aligned_cols=224 Identities=13% Similarity=0.090 Sum_probs=96.5 Q ss_pred CCcCHHHHHHHH----------------H------------HHHHHHHHHHhhcchhhcceeeeecCCccceecccccCC Q lcl|NC_020866. 1 MQVTAANLDALR----------------V------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKI 52 (297) Q Consensus 1 M~i~~~~l~~l~----------------~------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~ 52 (297) |.=++..-..+. + -+...+.+.... .+...++++++|.......|..+..- T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~-~s~l~~~~~~~~~~~~~~~ip~~~~~ 79 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVME-NSKIMQLGKYEPMEGTEKKFTFWADK 79 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHh-hcchhhhcceeeccCCceEEEEEecC Confidence 222211111110 0 011111111111 12245567777766665666666555 Q ss_pred Ccchhcccce---eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccc Q lcl|NC_020866. 53 PNVREWIGPR---AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATEC 129 (297) Q Consensus 53 P~lrew~Ge~---~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~ 129 (297) |.. .|+||- ...++.-..-++..++++..+.|+|+.+.|-...+..-+.+.++++.++..++.++. |.... T Consensus 80 ~~a-~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~----G~g~~- 153 (324) T protein:vir:97 80 PGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN- 153 (324) T ss_pred cce-eEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----cCCCC- Confidence 654 788654 333334445667889999999999999998888999999999999999998886553 21110 Q ss_pred cCcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeecccc Q lcl|NC_020866. 130 YDGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDAR 209 (297) Q Consensus 130 ~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r 209 (297) ..+.-++++.+ ..+........|- + +.-++..... ..+ T Consensus 154 ~~~~gi~~~~~-----------~~~~~~~~~~~~~--~----i~~~~~~l~~-------------------------~~~ 191 (324) T protein:vir:97 154 PFGKSIAQSIE-----------KTNKVIKGDFTQD--N----IIDLEALLED-------------------------DEL 191 (324) T ss_pred ccCcccccccc-----------ccceeccccCCHH--H----HHHHHHhhhh-------------------------ccC Confidence 01111111111 0011111111111 0 1001100000 000 Q ss_pred ccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCccccc--ccCeEE-ecchHHHHHHHHHhhhccCCCCc----- Q lcl|NC_020866. 210 ANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGL--MPNLLV-VPPALEEAGRKILNSENASGGET----- 281 (297) Q Consensus 210 ~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i--~P~~Lv-Vp~~le~~A~~ll~~~~~~~g~~----- 281 (297) ..+ .|- ++ .+.+..+++.||..|+++-. .+..|. .| ++.+...+.+.. T Consensus 192 ~~~---~~v--------~n----~~~~~~L~~lkd~~g~~~~~~~~~~tl~G~P---------V~~~~~~~~~~~~~~~g 247 (324) T protein:vir:97 192 EAN---AFI--------SK----TQNRSLLRKIVDPETKERIYDRNSDTLDGLP---------VVNLKSSNLKRGELITG 247 (324) T ss_pred CCC---EEE--------Ec----HHHHHHHHHhhcCCCceeecCCCCcccccee---------eEeecCCCCCcceEEEE Confidence 001 121 11 22344577889999987532 111111 22 111111111111 Q ss_pred ---ceec---ceeeE--EeccccC Q lcl|NC_020866. 282 ---NPWK---GTAEL--LVVPWLA 297 (297) Q Consensus 282 ---N~~~---~~~~~--iv~p~La 297 (297) +-+. +-+++ .-+..+. T Consensus 248 d~~~~~i~~~~~~~i~~~~~~~~~ 271 (324) T protein:vir:97 248 DFDKLIYGIPQLIEYKIDETAQLS 271 (324) T ss_pred ecccEEEEEecCcEEEEeeccccc Confidence 1111 11122 2222222 No 39 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=67.06 E-value=0.26 Score=23.81 Aligned_cols=185 Identities=15% Similarity=0.093 Sum_probs=98.1 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeee------c-CCccceecccccCCCcchhcccceeeeeeccccce Q lcl|NC_020866. 1 MQVTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVV------P-SSTKEQRYGWMGKIPNVREWIGPRAIQNLTESDYS 73 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v------~-S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l~~~~~~ 73 (297) =+|.|+.+...... .+.+.+- +..++..- | ++-....|.-+|+.-.+.| -.+....++....-+ T Consensus 10 d~i~Pev~~~~v~~---~~~~~l~-----~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~~~lt~~~~~ 80 (274) T protein:vir:96 10 NQIVPEVLAPMMQA---ELEKKLR-----FASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE-GEKIPTDILETKKRE 80 (274) T ss_pred heechHHHHHHHHH---HHHhhhh-----ccccceecccccCCCCCEEEeeeecCCCccccccC-CCccchhhcccceeE Confidence 23345555443321 1222221 12222111 1 1111112222343322222 134566777777778 Q ss_pred eeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeec Q lcl|NC_020866. 74 IREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVS 153 (297) Q Consensus 74 i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svs 153 (297) .+.+.++..+.|+..+..-.--.....+.+++|.+.++.-|.-++..|+.+... + T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~------------------------~- 135 (274) T protein:vir:96 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT------------------------V- 135 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------------c- Confidence 888888999999987776654456677778888887777777777666542100 0 Q ss_pred cccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHH Q lcl|NC_020866. 154 NTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYA 233 (297) Q Consensus 154 n~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~ 233 (297) +..+++.+.+. T Consensus 136 ---------------------------------------------------------------------~~~~~~~d~i~ 146 (274) T protein:vir:96 136 ---------------------------------------------------------------------EADITKLTGLQ 146 (274) T ss_pred ---------------------------------------------------------------------cccccCHHHHH Confidence 01133455566 Q ss_pred HHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhcc--CCCCcce--------ecceeeEEeccccC Q lcl|NC_020866. 234 AARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENA--SGGETNP--------WKGTAELLVVPWLA 297 (297) Q Consensus 234 aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~--~~g~~N~--------~~~~~~~iv~p~La 297 (297) +|++.|.... ..+++|+|+|.....-++-..-+.. ..+..|+ +.| ++||++..+- T Consensus 147 ~A~~~lgd~~--------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~Vi~s~~~~ 211 (274) T protein:vir:96 147 TAIDKFNDED--------LEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALG-AVIVRSNKLE 211 (274) T ss_pred HHHHHhcccc--------ccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecC-eEEEEeCCCC Confidence 6666664322 2568999999887765553211111 1222233 333 6899988876 No 40 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=67.06 E-value=0.26 Score=23.81 Aligned_cols=185 Identities=15% Similarity=0.093 Sum_probs=98.1 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeee------c-CCccceecccccCCCcchhcccceeeeeeccccce Q lcl|NC_020866. 1 MQVTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVV------P-SSTKEQRYGWMGKIPNVREWIGPRAIQNLTESDYS 73 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v------~-S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l~~~~~~ 73 (297) =+|.|+.+...... .+.+.+- +..++..- | ++-....|.-+|+.-.+.| -.+....++....-+ T Consensus 10 d~i~Pev~~~~v~~---~~~~~l~-----~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~~~lt~~~~~ 80 (274) T protein:vir:95 10 NQIVPEVLAPMMQA---ELEKKLR-----FASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE-GEKIPTDILETKKRE 80 (274) T ss_pred heechHHHHHHHHH---HHHhhhh-----ccccceecccccCCCCCEEEeeeecCCCccccccC-CCccchhhcccceeE Confidence 23345555443321 1222221 12222111 1 1111112222343322222 134566777777778 Q ss_pred eeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeec Q lcl|NC_020866. 74 IREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVS 153 (297) Q Consensus 74 i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svs 153 (297) .+.+.++..+.|+..+..-.--.....+.+++|.+.++.-|.-++..|+.+... + T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~------------------------~- 135 (274) T protein:vir:95 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT------------------------V- 135 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------------c- Confidence 888888999999987776654456677778888887777777777666542100 0 Q ss_pred cccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHH Q lcl|NC_020866. 154 NTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYA 233 (297) Q Consensus 154 n~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~ 233 (297) +..+++.+.+. T Consensus 136 ---------------------------------------------------------------------~~~~~~~d~i~ 146 (274) T protein:vir:95 136 ---------------------------------------------------------------------EADITKLTGLQ 146 (274) T ss_pred ---------------------------------------------------------------------cccccCHHHHH Confidence 01133455566 Q ss_pred HHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhcc--CCCCcce--------ecceeeEEeccccC Q lcl|NC_020866. 234 AARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENA--SGGETNP--------WKGTAELLVVPWLA 297 (297) Q Consensus 234 aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~--~~g~~N~--------~~~~~~~iv~p~La 297 (297) +|++.|.... ..+++|+|+|.....-++-..-+.. ..+..|+ +.| ++||++..+- T Consensus 147 ~A~~~lgd~~--------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~Vi~s~~~~ 211 (274) T protein:vir:95 147 TAIDKFNDED--------LEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALG-AVIVRSNKLE 211 (274) T ss_pred HHHHHhcccc--------ccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecC-eEEEEeCCCC Confidence 6666664322 2568999999887765553211111 1222233 333 6899988876 No 41 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=66.71 E-value=0.26 Score=23.76 Aligned_cols=224 Identities=12% Similarity=0.085 Sum_probs=96.3 Q ss_pred CC---cCHHHHHH-------------------------HHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCC Q lcl|NC_020866. 1 MQ---VTAANLDA-------------------------LRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKI 52 (297) Q Consensus 1 M~---i~~~~l~~-------------------------l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~ 52 (297) |. -.+.+++. |-..+...+.+...+. +...++|+.+|..+....|..+..- T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~~p~~~~~ 79 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMEN-SKIMQLGKYEPMEGTEKKFTFWADK 79 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEeCC Confidence 11 00111110 1111222222222222 2345567777765555666666555 Q ss_pred Ccchhcccce---eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccc Q lcl|NC_020866. 53 PNVREWIGPR---AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATEC 129 (297) Q Consensus 53 P~lrew~Ge~---~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~ 129 (297) |. -+|+||- ...+..-..-++..++++..+.|+|+.+.|....+..-+.+.++++.++..+..++. |.... T Consensus 80 ~~-a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~----G~g~~- 153 (324) T protein:vir:10 80 PG-AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN- 153 (324) T ss_pred cc-eeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh----cCCCC- Confidence 55 4788654 333334445667889999999999999999889999999999999999998886542 21110 Q ss_pred cCcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeecccc Q lcl|NC_020866. 130 YDGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDAR 209 (297) Q Consensus 130 ~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r 209 (297) ..+..++.+.. ..+....... +| ..+.-++..... ..+ T Consensus 154 ~~~~~i~~~~~-----------~~~~~~~~~~-t~-----~~i~~~~~~l~~-------------------------~~~ 191 (324) T protein:vir:10 154 PFGKSIAQSIE-----------KTNKVIKGDF-TQ-----DNIIDLEALLED-------------------------DEL 191 (324) T ss_pred ccCcccccccc-----------ccceeccccC-CH-----HHHHHHHHhhhh-------------------------ccC Confidence 11222221110 0001000000 01 000001000000 000 Q ss_pred ccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccc--cccCeEE-ecchHHHHHHHHHhhhccCCCCc----- Q lcl|NC_020866. 210 ANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLG--LMPNLLV-VPPALEEAGRKILNSENASGGET----- 281 (297) Q Consensus 210 ~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~--i~P~~Lv-Vp~~le~~A~~ll~~~~~~~g~~----- 281 (297) ..++ |- ++ .+.+..+++.||..|+++- ..|..|. +|. +.....+.+.. T Consensus 192 ~~~~---~v--------~n----~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~PV---------~~~~~~~~~~~~~~~g 247 (324) T protein:vir:10 192 EANA---FI--------SK----TQNRSLLRKIVDPETKERIYDRNSDTLDGLPV---------VNLKSSNLKRGELITG 247 (324) T ss_pred CCCE---EE--------Ec----HHHHHHHHHhhccCCceeecCCCCccccceeE---------EeecCCCCCcceEEEE Confidence 0010 11 11 2234456778999998752 2222211 221 00011111111 Q ss_pred ---ceec---ce--eeEEeccccC Q lcl|NC_020866. 282 ---NPWK---GT--AELLVVPWLA 297 (297) Q Consensus 282 ---N~~~---~~--~~~iv~p~La 297 (297) +.+. +- +++.-++.+. T Consensus 248 d~~~~~~~~~~~~~i~~~~~~~~~ 271 (324) T protein:vir:10 248 DFDKLIYGIPQLIEYKIDETAQLS 271 (324) T ss_pred ecccEEEEEecCcEEEEeeccccc Confidence 1111 11 2222222222 No 42 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=65.77 E-value=0.28 Score=23.63 Aligned_cols=236 Identities=17% Similarity=0.154 Sum_probs=96.6 Q ss_pred CCcCHHHHHHHHH------------------HHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcc------- Q lcl|NC_020866. 1 MQVTAANLDALRV------------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNV------- 55 (297) Q Consensus 1 M~i~~~~l~~l~~------------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~l------- 55 (297) |. +-..|++..+ .+...+-+-.. ..+...++|.++|-.+....+..+...|.- T Consensus 1 ~~-~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~-~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~ 78 (338) T protein:vir:78 1 MA-TLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQ-ESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGT 78 (338) T ss_pred Cc-chHHhhhhhcccccccceecccccccchHHHHHHHHHHH-hhchhhhhcceeeccCCceEEEEEecCccceeecccc Confidence 11 1111111111 11111111121 123345667666655444444444333322 Q ss_pred hhcccc---eeeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCc Q lcl|NC_020866. 56 REWIGP---RAIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDG 132 (297) Q Consensus 56 rew~Ge---~~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DG 132 (297) -.|+|| ....+..-..-+++.++++..+.|+++.+.|....+..-+.+.++++.++..+..++. |..+..-.+ T Consensus 79 ~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~----G~g~~~~~~ 154 (338) T protein:vir:78 79 SNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFH----GKSPLTGSA 154 (338) T ss_pred cccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCCcccc Confidence 234444 2333333344567779999999999999999889999999999999999999986552 333211111 Q ss_pred ccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccc Q lcl|NC_020866. 133 QNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANV 212 (297) Q Consensus 133 k~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~ 212 (297) -.-+-+++.....+ ..+....+ ...+| -+....+..++-... .+..+ T Consensus 155 ~~gi~~~~~~~~~~----~~~~~~~~-~~~~~-~~~~~~~~~~~~~~~-----------~~~~~---------------- 201 (338) T protein:vir:78 155 LQGIDTNNVIVNTT----NVDYLQTG-TTPLL-DRFLDGYDLVSANTD-----------VDFNG---------------- 201 (338) T ss_pred cccccccccccccc----cccccccc-chhhH-HHHHHHHHHhhhhcc-----------ccceE---------------- Confidence 00111111111100 00111110 11111 111111111100000 00000 Q ss_pred cccccchhhcCCccCCHHHHHHHHHHHHhhccCCCccccc------ccCeEE-ecchHHHHHHHHHhhhccCC------- Q lcl|NC_020866. 213 GFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGL------MPNLLV-VPPALEEAGRKILNSENASG------- 278 (297) Q Consensus 213 G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i------~P~~Lv-Vp~~le~~A~~ll~~~~~~~------- 278 (297) |- ++ ....+....++..||.+|++|-. .|..|. +|. +-++.+++ T Consensus 202 ----~~--------m~-~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~G~PV---------~~~~~ip~~~~~~~~ 259 (338) T protein:vir:78 202 ----WA--------AD-PRYRARLLRSQAYRDANGNVDPTRINLAASAGDLLGLPV---------QFGKAVGGDLGAATD 259 (338) T ss_pred ----EE--------Ec-hHHHHHHHHHhhhccCCCceeecccccCCCCceeeeeeE---------EEccccCccccccCC Confidence 11 11 11233445566778888887631 111111 111 11111111 Q ss_pred -------CC-ccee---cceeeEEeccccC Q lcl|NC_020866. 279 -------GE-TNPW---KGTAELLVVPWLA 297 (297) Q Consensus 279 -------g~-~N~~---~~~~~~iv~p~La 297 (297) |+ ++.+ ++-+++-+++.-. T Consensus 260 ~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~ 289 (338) T protein:vir:78 260 SKVRVVGGDFSQLKYGFADEIRVKMSDTAT 289 (338) T ss_pred cccEEEEEecceEEEEeecccEEEEeeccc Confidence 11 1111 1224444444444 No 43 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=63.93 E-value=0.31 Score=23.38 Aligned_cols=206 Identities=12% Similarity=0.108 Sum_probs=97.1 Q ss_pred CCcCHHHHHH-------------------HHHH--HHHHHHHHHhh--cchhhcceee---eecCCccceecccccCCCc Q lcl|NC_020866. 1 MQVTAANLDA-------------------LRVG--FKTSFQGALDQ--APSQYLRLTT---VVPSSTKEQRYGWMGKIPN 54 (297) Q Consensus 1 M~i~~~~l~~-------------------l~~~--~~~~f~~a~~~--a~~~~~~~a~---~v~S~~~~~~y~~Lg~~P~ 54 (297) |--+.+-+.+ +|+. +..+=.+.|+. ++-+|+++-- .++-...+-+|...... . T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~-G 82 (319) T protein:vir:10 4 KKFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDKV-G 82 (319) T ss_pred cchhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeeccc-c Confidence 2222222221 2221 12222223332 1224555432 22222333344444333 3 Q ss_pred chhcccce----eeeeeccccceeeeecccceeeccHHHhhccC---cchhHHHHHHHHHHHHhhHHHHHHHHHhcccCc Q lcl|NC_020866. 55 VREWIGPR----AIQNLTESDYSIREKPWELTIGVDRDDIETDN---LGIYSPLFQEMGRSAGSKWDMLVFELLKLGFAT 127 (297) Q Consensus 55 lrew~Ge~----~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~---lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~ 127 (297) .-+|+|++ ...+..-..++.+...|+..+.+++++++.-. +-+=.+....+.++.++++|+++|- |. T Consensus 83 ~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~----G~-- 156 (319) T protein:vir:10 83 TAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFK----GS-- 156 (319) T ss_pred ceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEe----ec-- Confidence 34576554 22334445677888999999999999999864 3334566667777888888887762 21 Q ss_pred cccCcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeecc Q lcl|NC_020866. 128 ECYDGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTD 207 (297) Q Consensus 128 ~~~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d 207 (297) +.|++.. -....+... T Consensus 157 ----------~~~g~~G----LlN~p~~~~-------------------------------------------------- 172 (319) T protein:vir:10 157 ----------APHKIVS----VFNHPNITK-------------------------------------------------- 172 (319) T ss_pred ----------cccccee----EEeCCCcee-------------------------------------------------- Confidence 1221110 000000000 Q ss_pred ccccccccccchhhcCCccCC----HHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCC--- Q lcl|NC_020866. 208 ARANVGFGFWQMAYGSKQTLD----GTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGE--- 280 (297) Q Consensus 208 ~r~~~G~~l~q~a~~~~~~l~----~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~--- 280 (297) .+.+.+ |. .++.+ .+-+.+++.++..+.. | .+.|+.|++||++...-. +...+.|. T Consensus 173 --~~~~~~-~~-----~~t~t~~~i~~di~~~~~~l~~~s~--g---~~~p~~L~L~p~~~~~L~----~~~~~~~~t~l 235 (319) T protein:vir:10 173 --ITSGKW-ID-----VSTMKPETAEAELTQAIETIETITR--G---QHRATNILIPPSMRKVLA----IRMPETTMSYL 235 (319) T ss_pred --eecCCC-CC-----ccccCHHHHHHHHHHHHHHHHHhcC--c---eeeceEEEecHHHHHhhh----cccCCCCeeHH Confidence 000000 00 01112 2335556666665532 2 257999999999875442 22222221 Q ss_pred -----cceecceeeEEeccccC Q lcl|NC_020866. 281 -----TNPWKGTAELLVVPWLA 297 (297) Q Consensus 281 -----~N~~~~~~~~iv~p~La 297 (297) .|| .+++.-.|+|. T Consensus 236 ~~lk~~~~---~l~I~~~pel~ 254 (319) T protein:vir:10 236 DYFKSQNS---GIEIDSIAELE 254 (319) T ss_pred HHHHHhcC---CceEEEeeeec Confidence 133 36677778887 No 44 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=63.77 E-value=0.31 Score=23.36 Aligned_cols=230 Identities=10% Similarity=0.077 Sum_probs=94.8 Q ss_pred CCcCHHH-HHH---------HHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceee---eee Q lcl|NC_020866. 1 MQVTAAN-LDA---------LRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAI---QNL 67 (297) Q Consensus 1 M~i~~~~-l~~---------l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~---~~l 67 (297) |....-+ .+. |=.-+...+.+..... ....++++.+|-+.....+..+..-|.. .|++|-.- .+. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~E~~~~~~~~~ 78 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMAN-SAIMKLAKNEPMTAQKKKFTYLAKGVGA-YWVSETERIQTSKP 78 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhc-cchhhhcceeeccCCceEEEEEeCCcce-EEeecCcccccccc Confidence 4433211 000 1111222233333322 2245567777755555556666555554 68765432 222 Q ss_pred ccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccc Q lcl|NC_020866. 68 TESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDG 147 (297) Q Consensus 68 ~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~ 147 (297) .=..-+++.++++..+.|+|+.++|-...+..-+.+.++++.++..++.++. |.... ++.... + .+.. T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~----G~g~~----~~~~~~--~--~~~~ 146 (304) T protein:vir:10 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF----GTKSP----YNTSTS--G--KPLV 146 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee----ccCCC----cccccc--c--cccc Confidence 2344567789999999999999999888899999999999999988875532 21110 111000 0 0000 Q ss_pred cceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccC Q lcl|NC_020866. 148 KTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTL 227 (297) Q Consensus 148 ~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l 227 (297) ............+..+| .+....+.-+ ... .+.+++ |- + T Consensus 147 ~~~~~~~~~~~~~~~~~-~~i~~~~~~l----~~~-------------------------~~~~~~---~v--------~ 185 (304) T protein:vir:10 147 EGAEEKGNVVTDTNNLY-VDLSALMATI----EDE-------------------------ELDPNG---VL--------T 185 (304) T ss_pred ccccccccccccccchH-HHHHHHHHHh----hhc-------------------------cCCcCE---EE--------E Confidence 00011111111111111 1111111100 000 000000 11 1 Q ss_pred CHHHHHHHHHHHHhhccCCCcccccc-cCeEEecchHHHHHHHHHhhhccCCCC----------cceec---ceeeEEec Q lcl|NC_020866. 228 DGTAYAAARAALSGMKGDYGRPLGLM-PNLLVVPPALEEAGRKILNSENASGGE----------TNPWK---GTAELLVV 293 (297) Q Consensus 228 ~~~~l~aar~aM~~~k~~~G~~L~i~-P~~LvVp~~le~~A~~ll~~~~~~~g~----------~N~~~---~~~~~iv~ 293 (297) + ...+..+++.||-+|+||-.. |..|. -..+..++..+... .+.+. +-+++-++ T Consensus 186 ~----~~~~~~L~~lkd~~G~~l~~~~~~~l~--------G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~ 253 (304) T protein:vir:10 186 T----RSFRSKMRNALDANDRPLFDANGNEIM--------GLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAIS 253 (304) T ss_pred c----HHHHHHHHHhhccCCcEeecCCCcccc--------ceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEe Confidence 1 234455667788888875211 11111 01111111111110 01111 11111111 Q ss_pred cccC Q lcl|NC_020866. 294 PWLA 297 (297) Q Consensus 294 p~La 297 (297) -+-. T Consensus 254 ~e~~ 257 (304) T protein:vir:10 254 EDAT 257 (304) T ss_pred ecce Confidence 1111 No 45 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=63.77 E-value=0.31 Score=23.36 Aligned_cols=230 Identities=10% Similarity=0.077 Sum_probs=94.8 Q ss_pred CCcCHHH-HHH---------HHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceee---eee Q lcl|NC_020866. 1 MQVTAAN-LDA---------LRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAI---QNL 67 (297) Q Consensus 1 M~i~~~~-l~~---------l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~---~~l 67 (297) |....-+ .+. |=.-+...+.+..... ....++++.+|-+.....+..+..-|.. .|++|-.- .+. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~E~~~~~~~~~ 78 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMAN-SAIMKLAKNEPMTAQKKKFTYLAKGVGA-YWVSETERIQTSKP 78 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhc-cchhhhcceeeccCCceEEEEEeCCcce-EEeecCcccccccc Confidence 4433211 000 1111222233333322 2245567777755555556666555554 68765432 222 Q ss_pred ccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccc Q lcl|NC_020866. 68 TESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDG 147 (297) Q Consensus 68 ~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~ 147 (297) .=..-+++.++++..+.|+|+.++|-...+..-+.+.++++.++..++.++. |.... ++.... + .+.. T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~----G~g~~----~~~~~~--~--~~~~ 146 (304) T protein:vir:94 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF----GTKSP----YNTSTS--G--KPLV 146 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee----ccCCC----cccccc--c--cccc Confidence 2344567789999999999999999888899999999999999988875532 21110 111000 0 0000 Q ss_pred cceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccC Q lcl|NC_020866. 148 KTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTL 227 (297) Q Consensus 148 ~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l 227 (297) ............+..+| .+....+.-+ ... .+.+++ |- + T Consensus 147 ~~~~~~~~~~~~~~~~~-~~i~~~~~~l----~~~-------------------------~~~~~~---~v--------~ 185 (304) T protein:vir:94 147 EGAEEKGNVVTDTNNLY-VDLSALMATI----EDE-------------------------ELDPNG---VL--------T 185 (304) T ss_pred ccccccccccccccchH-HHHHHHHHHh----hhc-------------------------cCCcCE---EE--------E Confidence 00011111111111111 1111111100 000 000000 11 1 Q ss_pred CHHHHHHHHHHHHhhccCCCcccccc-cCeEEecchHHHHHHHHHhhhccCCCC----------cceec---ceeeEEec Q lcl|NC_020866. 228 DGTAYAAARAALSGMKGDYGRPLGLM-PNLLVVPPALEEAGRKILNSENASGGE----------TNPWK---GTAELLVV 293 (297) Q Consensus 228 ~~~~l~aar~aM~~~k~~~G~~L~i~-P~~LvVp~~le~~A~~ll~~~~~~~g~----------~N~~~---~~~~~iv~ 293 (297) + ...+..+++.||-+|+||-.. |..|. -..+..++..+... .+.+. +-+++-++ T Consensus 186 ~----~~~~~~L~~lkd~~G~~l~~~~~~~l~--------G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~ 253 (304) T protein:vir:94 186 T----RSFRSKMRNALDANDRPLFDANGNEIM--------GLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAIS 253 (304) T ss_pred c----HHHHHHHHHhhccCCcEeecCCCcccc--------ceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEe Confidence 1 234455667788888875211 11111 01111111111110 01111 11111111 Q ss_pred cccC Q lcl|NC_020866. 294 PWLA 297 (297) Q Consensus 294 p~La 297 (297) -+-. T Consensus 254 ~e~~ 257 (304) T protein:vir:94 254 EDAT 257 (304) T ss_pred ecce Confidence 1111 No 46 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=63.67 E-value=0.31 Score=23.35 Aligned_cols=211 Identities=12% Similarity=0.100 Sum_probs=97.2 Q ss_pred CCcCHHHHHHHHHH--HHHHHHHHHhh--cchhhccee---eeecCCccceecccccCCCcchhccccee----eeeecc Q lcl|NC_020866. 1 MQVTAANLDALRVG--FKTSFQGALDQ--APSQYLRLT---TVVPSSTKEQRYGWMGKIPNVREWIGPRA----IQNLTE 69 (297) Q Consensus 1 M~i~~~~l~~l~~~--~~~~f~~a~~~--a~~~~~~~a---~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~----~~~l~~ 69 (297) |..+....-+ |+. +..+=+.-|+. ++-+|+++- +.++-...+-+|...... ..-+|+|++. ..+..- T Consensus 29 ~~~~~~~~~~-f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~-G~a~~~~d~~~dip~vd~~~ 106 (329) T protein:vir:79 29 AKNDASDMGI-WTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKV-GHAKIIADYTDDLSTVDALM 106 (329) T ss_pred ceeccchhhH-HHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecc-eeeeeecCcccccceeeccc Confidence 2222111111 221 12222223331 234455443 223333334455554333 3346776532 234444 Q ss_pred ccceeeeecccceeeccHHHhhccC---cchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccccc Q lcl|NC_020866. 70 SDYSIREKPWELTIGVDRDDIETDN---LGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDED 146 (297) Q Consensus 70 ~~~~i~n~~fe~tv~v~R~~i~dD~---lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~ 146 (297) ..++.+...|+..+.++.++++--. +-+=.+..+.+.++.++++|+++| .|. +.|++.. T Consensus 107 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f----~G~------------~~~g~~G-- 168 (329) T protein:vir:79 107 TSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVF----KGS------------KPHKIIS-- 168 (329) T ss_pred ceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEE----eec------------cccccee-- Confidence 5667888999999999999998763 333345555566666667776665 221 1222110 Q ss_pred ccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCcc Q lcl|NC_020866. 147 GKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQT 226 (297) Q Consensus 147 ~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~ 226 (297) -....++.. . .....+..-|.- + +.. T Consensus 169 --LlN~p~v~~------------------------------~-------------------~~~~~~~~~w~~--k-t~~ 194 (329) T protein:vir:79 169 --VFEHPNLTT------------------------------I-------------------NSAGWNNAAGTG--K-KPE 194 (329) T ss_pred --eecCCCccc------------------------------c-------------------ccCCCCCccccc--c-CHH Confidence 000000000 0 000000111210 0 001 Q ss_pred CCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCC--------cceecceeeEEeccccC Q lcl|NC_020866. 227 LDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGE--------TNPWKGTAELLVVPWLA 297 (297) Q Consensus 227 l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~--------~N~~~~~~~~iv~p~La 297 (297) --.+-+.+++.++..+... .+.|+.|++||++.. +|.+...+.|. .|| .+++.-.|+|. T Consensus 195 ei~~di~~~~~~l~~~s~g-----~~~p~~L~Lpp~~~~----~L~~~~~~~~~tvl~~lk~~~~---~l~I~~~~el~ 261 (329) T protein:vir:79 195 TAQDELEQAIEKIETLTNG-----QHRANMILIPPSMRK----VLMVRMPETTMSYLDYFKQQNG---GITIESISELE 261 (329) T ss_pred HHHHHHHHHHHHHHHhcCc-----eecccEEEecHHHHH----HhhcccCCCCccHHHHHHHhCC---CcEEEEccccc Confidence 1123456666666665432 357999999998764 33333333332 243 36777788887 No 47 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=60.33 E-value=0.37 Score=22.92 Aligned_cols=193 Identities=11% Similarity=0.027 Sum_probs=101.2 Q ss_pred CCc-CHHHHHHHHH--HHHHHHHHHHhhcchhhcceeeeecC----CccceecccccCCCcchhcc--cceeeeeecccc Q lcl|NC_020866. 1 MQV-TAANLDALRV--GFKTSFQGALDQAPSQYLRLTTVVPS----STKEQRYGWMGKIPNVREWI--GPRAIQNLTESD 71 (297) Q Consensus 1 M~i-~~~~l~~l~~--~~~~~f~~a~~~a~~~~~~~a~~v~S----~~~~~~y~~Lg~~P~lrew~--Ge~~~~~l~~~~ 71 (297) |.. +...|..+.. -|....++.+.+ ..-+..++..-+. -...-+.......+...++. .+.....+.... T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~-~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~ 79 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDK-KLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKK 79 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHH-hhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhcccce Confidence 544 2233433322 122222333321 1223344433211 01111111111122222332 445667777777 Q ss_pred ceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccccccccee Q lcl|NC_020866. 72 YSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVT 151 (297) Q Consensus 72 ~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~s 151 (297) -+.+.+.++..+.|+..+..----.......+++|.+.++.-|.-++..|+.+..+ T Consensus 80 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~------------------------ 135 (275) T protein:vir:96 80 RQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLK------------------------ 135 (275) T ss_pred eeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------------ Confidence 77888999999999998877544445777778888888888887777666441100 Q ss_pred eccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHH Q lcl|NC_020866. 152 VSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTA 231 (297) Q Consensus 152 vsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~ 231 (297) .+..+++.+. T Consensus 136 ----------------------------------------------------------------------~~~~~~~~d~ 145 (275) T protein:vir:96 136 ----------------------------------------------------------------------VEADITKLAG 145 (275) T ss_pred ----------------------------------------------------------------------ccccccCHHH Confidence 0112334566 Q ss_pred HHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhcc--CCCCcce--------ecceeeEEeccccC Q lcl|NC_020866. 232 YAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENA--SGGETNP--------WKGTAELLVVPWLA 297 (297) Q Consensus 232 l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~--~~g~~N~--------~~~~~~~iv~p~La 297 (297) +..|++.|.... ..++.|+|+|.....-++....+.. .....|. +.| ++||++..+- T Consensus 146 i~dA~~~lgd~~--------~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G-~~Vi~s~~~p 212 (275) T protein:vir:96 146 LQTAIDKFNDED--------LEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALG-AIIVRSNKIK 212 (275) T ss_pred HHHHHHHhcccc--------CCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecC-eeEEEeCCCC Confidence 667777664321 2578999999977666554322221 1112222 333 6889998886 No 48 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=55.56 E-value=0.48 Score=22.35 Aligned_cols=203 Identities=14% Similarity=0.123 Sum_probs=85.4 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhccccee----eeeeccccceeee Q lcl|NC_020866. 1 MQVTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRA----IQNLTESDYSIRE 76 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~----~~~l~~~~~~i~n 76 (297) ..+-|+.+ ...+.+.... .+....+|+.+|-+.....|..+..-..=-.|+||-. ..+..-..-++.. T Consensus 137 g~liP~~~-------~~~ii~~~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~ 208 (394) T protein:vir:97 137 KPVSSEEI-------LYTPAREVKT-VVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNI 208 (394) T ss_pred cccChHHH-------HHHHHHHhhh-hhhhhhhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeeh Confidence 11111111 0111111111 1233445667665444444444332222235775532 1223335567888 Q ss_pred ecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeecccc Q lcl|NC_020866. 77 KPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVSNTG 156 (297) Q Consensus 77 ~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svsn~~ 156 (297) ++++..+.|+|+.+.|-..++..-+...++++.+...+..+..-+.++ +. .+..+ T Consensus 209 ~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~---------------------~~--~~~~~-- 263 (394) T protein:vir:97 209 DTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF---------------------TT--KTVKN-- 263 (394) T ss_pred hheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------------------cc--ccccc-- Confidence 999999999999999988888888999999999988886544322111 00 00000 Q ss_pred ccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHHHHH Q lcl|NC_020866. 157 GGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAAAR 236 (297) Q Consensus 157 ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aar 236 (297) | ..+..+++... +...++ -|- ++.. .. T Consensus 264 -------~-----~~~~~~~~~~~--------------------------~~~~~a---~~v--------~n~~----~~ 290 (394) T protein:vir:97 264 -------L-----DEIKALLNGGF--------------------------DPAYNV---SLI--------VSQS----FY 290 (394) T ss_pred -------H-----HHHHHHHHhhh--------------------------hhhhCC---EEE--------EcHH----HH Confidence 0 01111111000 000001 121 1222 24 Q ss_pred HHHHhhccCCCcccccccC-------eEE-ecchHHHHHHHHHhhhccCC-----CC-ccee----cceeeEEeccccC Q lcl|NC_020866. 237 AALSGMKGDYGRPLGLMPN-------LLV-VPPALEEAGRKILNSENASG-----GE-TNPW----KGTAELLVVPWLA 297 (297) Q Consensus 237 ~aM~~~k~~~G~~L~i~P~-------~Lv-Vp~~le~~A~~ll~~~~~~~-----g~-~N~~----~~~~~~iv~p~La 297 (297) .+++..||-+|+||- .|+ .|+ .|.-. ..+...+. |+ .+-+ +..+++-.+..-. T Consensus 291 ~~l~~lkd~~G~~i~-~~~~~~~~~~~l~G~pv~~-------~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~ 361 (394) T protein:vir:97 291 QTLDTLKDGNGRYLL-QDDITAVSGKVLLGKPVFV-------LSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI 361 (394) T ss_pred HHHHHhhccCCCeee-ecCcCCCCCceeccceeEE-------ecccccCCccEEEeeccccEEEEEecceEEEEecccc Confidence 456778888888762 222 111 11100 00001111 11 1100 1112222211111 No 49 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=54.39 E-value=0.5 Score=22.21 Aligned_cols=232 Identities=13% Similarity=-0.012 Sum_probs=88.5 Q ss_pred CCcCHHHHH-HHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccce---eeeeeccccceeee Q lcl|NC_020866. 1 MQVTAANLD-ALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPR---AIQNLTESDYSIRE 76 (297) Q Consensus 1 M~i~~~~l~-~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~---~~~~l~~~~~~i~n 76 (297) |..+...=. .+-.-+...+.+..... +....+++.+|-......|......+.--.|++|- .-.+..=..-++.. T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~ 183 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRR-LTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANV 183 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhc-cchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEee Confidence 111100000 00000112222222222 22444555655444444454443333334677553 22333334466788 Q ss_pred ecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeecccc Q lcl|NC_020866. 77 KPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVSNTG 156 (297) Q Consensus 77 ~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svsn~~ 156 (297) ++++..+.|+|+.+. |--.+..-+.+.++++.++..+..++ +|..+ |+ ++-+-........ ... T Consensus 184 ~k~~~~~~is~ell~-d~~~l~~~i~~~la~a~~~~~d~~~l----~G~g~----~~------~~~Gi~~~~~~~~-~~~ 247 (385) T protein:vir:18 184 KTIAHWVQASRQVMD-DAPMLQSYINNRLMYGLALKEEGQLL----NGDGT----GD------NLEGLNKVATAYD-TSL 247 (385) T ss_pred eeEEEeehhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHHHHH----hccCC----CC------ccccccccccccc-ccc Confidence 999999999999776 44557777888899999998886544 23211 11 1111000000000 000 Q ss_pred ccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHHHHH Q lcl|NC_020866. 157 GGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAAAR 236 (297) Q Consensus 157 ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aar 236 (297) ...+..+| -+.... +.+-. . ...+..+ |- ++. ..+ T Consensus 248 ~~~~~~~~-d~i~~~----~~~l~------~--~~~~~~~--------------------~~--------~~~----~~~ 282 (385) T protein:vir:18 248 NATGDTRA-DIIAHA----IYQVT------E--SEFSASG--------------------IV--------LNP----RDW 282 (385) T ss_pred cccccchH-HHHHHH----HHhhc------c--ccCCCCE--------------------EE--------EcH----HHH Confidence 11111111 000000 00000 0 0000000 11 122 234 Q ss_pred HHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCC-----cce-e----cceeeEEeccccC Q lcl|NC_020866. 237 AALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGE-----TNP-W----KGTAELLVVPWLA 297 (297) Q Consensus 237 ~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~-----~N~-~----~~~~~~iv~p~La 297 (297) .++++.||-+|++|--.|.- -.+..| ....++.+...+.+. .+- + +.-++|.++.... T Consensus 283 ~~l~~lkd~~G~~l~~~~~~-~~~~~l--~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~ 350 (385) T protein:vir:18 283 HNIALLKDNEGRYIFGGPQA-FTSNIM--WGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDR 350 (385) T ss_pred HHHHHhhcCCCceeccCccc-CCCcee--cceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEecccc Confidence 56777888888876321100 000000 011112222233221 111 1 1123333333332 No 50 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=54.39 E-value=0.5 Score=22.21 Aligned_cols=232 Identities=13% Similarity=-0.012 Sum_probs=88.5 Q ss_pred CCcCHHHHH-HHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccce---eeeeeccccceeee Q lcl|NC_020866. 1 MQVTAANLD-ALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPR---AIQNLTESDYSIRE 76 (297) Q Consensus 1 M~i~~~~l~-~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~---~~~~l~~~~~~i~n 76 (297) |..+...=. .+-.-+...+.+..... +....+++.+|-......|......+.--.|++|- .-.+..=..-++.. T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~ 183 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRR-LTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANV 183 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhc-cchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEee Confidence 111100000 00000112222222222 22444555655444444454443333334677553 22333334466788 Q ss_pred ecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeecccc Q lcl|NC_020866. 77 KPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVSNTG 156 (297) Q Consensus 77 ~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svsn~~ 156 (297) ++++..+.|+|+.+. |--.+..-+.+.++++.++..+..++ +|..+ |+ ++-+-........ ... T Consensus 184 ~k~~~~~~is~ell~-d~~~l~~~i~~~la~a~~~~~d~~~l----~G~g~----~~------~~~Gi~~~~~~~~-~~~ 247 (385) T protein:vir:19 184 KTIAHWVQASRQVMD-DAPMLQSYINNRLMYGLALKEEGQLL----NGDGT----GD------NLEGLNKVATAYD-TSL 247 (385) T ss_pred eeEEEeehhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHHHHH----hccCC----CC------ccccccccccccc-ccc Confidence 999999999999776 44557777888899999998886544 23211 11 1111000000000 000 Q ss_pred ccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHHHHH Q lcl|NC_020866. 157 GGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAAAR 236 (297) Q Consensus 157 ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aar 236 (297) ...+..+| -+.... +.+-. . ...+..+ |- ++. ..+ T Consensus 248 ~~~~~~~~-d~i~~~----~~~l~------~--~~~~~~~--------------------~~--------~~~----~~~ 282 (385) T protein:vir:19 248 NATGDTRA-DIIAHA----IYQVT------E--SEFSASG--------------------IV--------LNP----RDW 282 (385) T ss_pred cccccchH-HHHHHH----HHhhc------c--ccCCCCE--------------------EE--------EcH----HHH Confidence 11111111 000000 00000 0 0000000 11 122 234 Q ss_pred HHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCC-----cce-e----cceeeEEeccccC Q lcl|NC_020866. 237 AALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGE-----TNP-W----KGTAELLVVPWLA 297 (297) Q Consensus 237 ~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~-----~N~-~----~~~~~~iv~p~La 297 (297) .++++.||-+|++|--.|.- -.+..| ....++.+...+.+. .+- + +.-++|.++.... T Consensus 283 ~~l~~lkd~~G~~l~~~~~~-~~~~~l--~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~ 350 (385) T protein:vir:19 283 HNIALLKDNEGRYIFGGPQA-FTSNIM--WGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDR 350 (385) T ss_pred HHHHHhhcCCCceeccCccc-CCCcee--cceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEecccc Confidence 56777888888876321100 000000 011112222233221 111 1 1123333333332 No 51 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=54.16 E-value=0.51 Score=22.19 Aligned_cols=224 Identities=12% Similarity=0.080 Sum_probs=97.7 Q ss_pred CCcCHHHHHHHH-------------------------HHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcc Q lcl|NC_020866. 1 MQVTAANLDALR-------------------------VGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNV 55 (297) Q Consensus 1 M~i~~~~l~~l~-------------------------~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~l 55 (297) |..+..++.... .-+...+.+.... .+...++++++|.......|..+..-|.. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~-~s~l~~l~~~~~~~~~~~~~p~~~~~~~a 82 (324) T protein:vir:96 4 TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVME-NSKIMQLGKYEPMEGTEKKFTFWADKPGA 82 (324) T ss_pred chhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHh-hchhhhhcceeeccCCceEEEEEecCcce Confidence 222222222100 0011111111111 12234456777765555667666555654 Q ss_pred hhcccce---eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCc Q lcl|NC_020866. 56 REWIGPR---AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDG 132 (297) Q Consensus 56 rew~Ge~---~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DG 132 (297) .|++|- ...++.=..-+++.++++..+.|+|+.+.|....+..-+.+.++++.++..+..++. |-... ..+ T Consensus 83 -~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~----G~g~~-~~~ 156 (324) T protein:vir:96 83 -YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN-PFG 156 (324) T ss_pred -eEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCC-CcC Confidence 787653 333344445667889999999999999999889999999999999999998886642 21110 111 Q ss_pred ccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccc Q lcl|NC_020866. 133 QNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANV 212 (297) Q Consensus 133 k~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~ 212 (297) .-+.+... ..+....... +| -+ +.-++..-.. ..+..+ T Consensus 157 ~gi~~~~~-----------~~~~~~~~~~-t~-~~----i~~~~~~l~~-------------------------~~~~~~ 194 (324) T protein:vir:96 157 KSIAQSIE-----------KTNKVIKGDF-TQ-DN----IIDLEALLED-------------------------DELEAN 194 (324) T ss_pred cccccccc-----------ccceeccccc-cH-HH----HHHHHHhhhh-------------------------ccCCCC Confidence 11111110 0011110000 00 00 1111110000 000000 Q ss_pred cccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccc--cccCeE-EecchHHHHHHHHHhhhccCCCCc-------- Q lcl|NC_020866. 213 GFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLG--LMPNLL-VVPPALEEAGRKILNSENASGGET-------- 281 (297) Q Consensus 213 G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~--i~P~~L-vVp~~le~~A~~ll~~~~~~~g~~-------- 281 (297) .|- ++ .+.+.++++.|+..|+++- ..|..| =+|. +.....+.+.. T Consensus 195 ---~~v--------mn----~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV---------~~~~~~~~~~~~~~~gd~~ 250 (324) T protein:vir:96 195 ---AFI--------SK----TQNRSLLRKIVDPETKERIYDRNSDSLDGLPV---------VNLKSSNLKRGELITGDFD 250 (324) T ss_pred ---EEE--------Ec----HHHHHHHHHhhccCCCeeecCCCCCcccceee---------EeeCCCCCCcceEEEEecc Confidence 121 11 2345567888999998763 222222 1221 11111111111 Q ss_pred cee---cceeeEEec--cccC Q lcl|NC_020866. 282 NPW---KGTAELLVV--PWLA 297 (297) Q Consensus 282 N~~---~~~~~~iv~--p~La 297 (297) +.+ ++-+++-++ +.+. T Consensus 251 ~~~~g~~~~~~i~~~~~~~~~ 271 (324) T protein:vir:96 251 KLIYGIPQLIEYKIDETAQLS 271 (324) T ss_pred eEEEEEecCcEEEEeeccccc Confidence 111 111222222 2222 No 52 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=54.16 E-value=0.51 Score=22.19 Aligned_cols=224 Identities=12% Similarity=0.080 Sum_probs=97.7 Q ss_pred CCcCHHHHHHHH-------------------------HHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcc Q lcl|NC_020866. 1 MQVTAANLDALR-------------------------VGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNV 55 (297) Q Consensus 1 M~i~~~~l~~l~-------------------------~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~l 55 (297) |..+..++.... .-+...+.+.... .+...++++++|.......|..+..-|.. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~-~s~l~~l~~~~~~~~~~~~~p~~~~~~~a 82 (324) T protein:vir:78 4 TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVME-NSKIMQLGKYEPMEGTEKKFTFWADKPGA 82 (324) T ss_pred chhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHh-hchhhhhcceeeccCCceEEEEEecCcce Confidence 222222222100 0011111111111 12234456777765555667666555654 Q ss_pred hhcccce---eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCc Q lcl|NC_020866. 56 REWIGPR---AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDG 132 (297) Q Consensus 56 rew~Ge~---~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DG 132 (297) .|++|- ...++.=..-+++.++++..+.|+|+.+.|....+..-+.+.++++.++..+..++. |-... ..+ T Consensus 83 -~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~----G~g~~-~~~ 156 (324) T protein:vir:78 83 -YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN-PFG 156 (324) T ss_pred -eEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCC-CcC Confidence 787653 333344445667889999999999999999889999999999999999998886642 21110 111 Q ss_pred ccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccc Q lcl|NC_020866. 133 QNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANV 212 (297) Q Consensus 133 k~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~ 212 (297) .-+.+... ..+....... +| -+ +.-++..-.. ..+..+ T Consensus 157 ~gi~~~~~-----------~~~~~~~~~~-t~-~~----i~~~~~~l~~-------------------------~~~~~~ 194 (324) T protein:vir:78 157 KSIAQSIE-----------KTNKVIKGDF-TQ-DN----IIDLEALLED-------------------------DELEAN 194 (324) T ss_pred cccccccc-----------ccceeccccc-cH-HH----HHHHHHhhhh-------------------------ccCCCC Confidence 11111110 0011110000 00 00 1111110000 000000 Q ss_pred cccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccc--cccCeE-EecchHHHHHHHHHhhhccCCCCc-------- Q lcl|NC_020866. 213 GFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLG--LMPNLL-VVPPALEEAGRKILNSENASGGET-------- 281 (297) Q Consensus 213 G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~--i~P~~L-vVp~~le~~A~~ll~~~~~~~g~~-------- 281 (297) .|- ++ .+.+.++++.|+..|+++- ..|..| =+|. +.....+.+.. T Consensus 195 ---~~v--------mn----~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV---------~~~~~~~~~~~~~~~gd~~ 250 (324) T protein:vir:78 195 ---AFI--------SK----TQNRSLLRKIVDPETKERIYDRNSDSLDGLPV---------VNLKSSNLKRGELITGDFD 250 (324) T ss_pred ---EEE--------Ec----HHHHHHHHHhhccCCCeeecCCCCCcccceee---------EeeCCCCCCcceEEEEecc Confidence 121 11 2345567888999998763 222222 1221 11111111111 Q ss_pred cee---cceeeEEec--cccC Q lcl|NC_020866. 282 NPW---KGTAELLVV--PWLA 297 (297) Q Consensus 282 N~~---~~~~~~iv~--p~La 297 (297) +.+ ++-+++-++ +.+. T Consensus 251 ~~~~g~~~~~~i~~~~~~~~~ 271 (324) T protein:vir:78 251 KLIYGIPQLIEYKIDETAQLS 271 (324) T ss_pred eEEEEEecCcEEEEeeccccc Confidence 111 111222222 2222 No 53 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=53.15 E-value=0.54 Score=22.07 Aligned_cols=231 Identities=13% Similarity=0.147 Sum_probs=102.6 Q ss_pred CCcCHHH-HHH-HHHHH-HHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceee--------eeecc Q lcl|NC_020866. 1 MQVTAAN-LDA-LRVGF-KTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAI--------QNLTE 69 (297) Q Consensus 1 M~i~~~~-l~~-l~~~~-~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~--------~~l~~ 69 (297) |.-+... ... +=.-+ +..++...+. +...++++.++.......+..+..-|.. .|+||-.- .+.+= T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~--s~l~~l~~~~~~~~~~~~~p~~~~~~~a-~wv~E~~~~~~~~~~~s~~~f 77 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQG--STVLSAFQNVNMGTKTTHLPVLATLPEA-DWVGESATDPKGVKPTSKVTW 77 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhh--chhhhhcceeeccCCcEEEEEEeCCcce-EEeecccccccccccccccce Confidence 4433211 000 00011 2222222222 2356667777766666666666665653 68866431 12222 Q ss_pred ccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccc Q lcl|NC_020866. 70 SDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKT 149 (297) Q Consensus 70 ~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~ 149 (297) ..-+++.++++..+.|+++.+.|-...+.+-+.+.++++.++..++.++ +|.+ .++.++....-....... T Consensus 78 ~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~----~G~g----~~~~~~~~~~~~~~~~~~- 148 (305) T protein:vir:25 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVI----FGTD----KPASWVSPALIPAAVTAG- 148 (305) T ss_pred eeEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhhe----eccC----CCCCcccccccccccccc- Confidence 3345778999999999999999888999999999999999999998766 2422 133333322111100000 Q ss_pred eeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCH Q lcl|NC_020866. 150 VTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDG 229 (297) Q Consensus 150 ~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~ 229 (297) ...........+. +.-..+.-+.-.... ..+..+. | -++ T Consensus 149 --~~~~~~~~~~~~~--~~~~~~~~~~~~~~~-------------~~~~~~~---------------~--------v~~- 187 (305) T protein:vir:25 149 --QAVEVVGGVANES--DIVGATNRAAKAVAS-------------AGWAPDT---------------L--------LSS- 187 (305) T ss_pred --ccccccccchhhh--HHHHHHHHHHHhhhh-------------cccccce---------------e--------Eec- Confidence 0000011111111 100111100000000 0000000 1 111 Q ss_pred HHHHHHHHHHHhhccCCCcccccccCeEE-ecchHHHHHHHHHhhhccCC---------CC-ccee---cceeeEEeccc Q lcl|NC_020866. 230 TAYAAARAALSGMKGDYGRPLGLMPNLLV-VPPALEEAGRKILNSENASG---------GE-TNPW---KGTAELLVVPW 295 (297) Q Consensus 230 ~~l~aar~aM~~~k~~~G~~L~i~P~~Lv-Vp~~le~~A~~ll~~~~~~~---------g~-~N~~---~~~~~~iv~p~ 295 (297) .+.+..+++.||-+|++|- .|+.|. .|. +-++..+. |+ .+-+ ++-+++-++.+ T Consensus 188 ---~~~~~~l~~lkd~~G~~i~-~~~~l~G~Pv---------~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~ 254 (305) T protein:vir:25 188 ---LALRYEVANIRDANGNPVF-RDDSFAGFRT---------FFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQ 254 (305) T ss_pred ---HHHHHHHHHhhccCCceee-cCCcccccce---------EEcCccCCCCCccEEEEEecceEEEEEecCeEEEEeee Confidence 2234557788999999874 343322 111 00111111 11 1111 12234444444 Q ss_pred cC Q lcl|NC_020866. 296 LA 297 (297) Q Consensus 296 La 297 (297) .+ T Consensus 255 ~~ 256 (305) T protein:vir:25 255 AT 256 (305) T ss_pred ee Confidence 43 No 54 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=50.81 E-value=0.6 Score=21.80 Aligned_cols=238 Identities=12% Similarity=0.095 Sum_probs=94.5 Q ss_pred CCcCHHHHHHHHHH------------HHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceeee--- Q lcl|NC_020866. 1 MQVTAANLDALRVG------------FKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQ--- 65 (297) Q Consensus 1 M~i~~~~l~~l~~~------------~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~--- 65 (297) ..++....+++.++ +...+.+...... ....+|+.+|-......|.-...-+. -.|++|-.-. T Consensus 96 ~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~-a~~v~E~~~~~~~ 173 (407) T protein:vir:48 96 DGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEV-VMRQEATVITLGGSDYKKLVNLGGTT-SGWVGETDARPET 173 (407) T ss_pred hhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhh-hhhhhceeeecCCCceEEEEecCCcc-eeeeccccccccc Confidence 01111111111110 1112222222221 23445666664444444433333344 3687664321 Q ss_pred eec-cccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccc--cccc-ccc Q lcl|NC_020866. 66 NLT-ESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQN--FFDT-DHP 141 (297) Q Consensus 66 ~l~-~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~--~F~t-dH~ 141 (297) ... -..-++..++++..+.|+++.+.|-...+..-+.+.|+++.+...+..+ | .|-.+ ||| ++.. ... T Consensus 174 ~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~---l-~G~G~----~~p~Gil~~~~~~ 245 (407) T protein:vir:48 174 ATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAF---T-SGDGS----KKPKGFLAYESTD 245 (407) T ss_pred ccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhh---h-ccCCC----Cccceeeeccccc Confidence 111 2335677899999999999999998889999999999999999988743 3 33222 222 1100 000 Q ss_pred cccccc--cceeecccc-ccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccc Q lcl|NC_020866. 142 VLDEDG--KTVTVSNTG-GGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQ 218 (297) Q Consensus 142 ~~~g~~--~~~svsn~~-ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q 218 (297) ...... ......... ++.-..--+.+.-..+++- .+.+. -|- T Consensus 246 ~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~--------------------------------~~~~a---~~v 290 (407) T protein:vir:48 246 EDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKA--------------------------------HRSGA---KFM 290 (407) T ss_pred ccccccccccccccccccccccChHHHHHHHHhhchh--------------------------------hhcCC---EEE Confidence 000000 000000000 0000000011111111110 01110 121 Q ss_pred hhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCC-CC-cc-ee----------- Q lcl|NC_020866. 219 MAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASG-GE-TN-PW----------- 284 (297) Q Consensus 219 ~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~-g~-~N-~~----------- 284 (297) ++ ...+..+++.||.+|+|| +.|+.---.|. .-..+-++.++..+. +. .. .+ T Consensus 291 --------~n----~~~~~~L~~lkD~~Gr~l-~~~~~~~g~~~-~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~ 356 (407) T protein:vir:48 291 --------MN----NSSLFAIRLLKDNDGNYL-WRPGIELGQPS-SLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIV 356 (407) T ss_pred --------Ec----HHHHHHHHHhhccCCcee-eccCcCCCCCc-eecceeeEEecCcCCccCCccEEEEEeccccEEEE Confidence 11 223456778888888886 23321000000 000111122222221 11 11 11 Q ss_pred -cceeeEEeccccC Q lcl|NC_020866. 285 -KGTAELLVVPWLA 297 (297) Q Consensus 285 -~~~~~~iv~p~La 297 (297) +..+++..+|+-. T Consensus 357 ~~~~~~i~~d~~~~ 370 (407) T protein:vir:48 357 DRIGTRILRDPYTN 370 (407) T ss_pred EeeceEEEeecccc Confidence 2234455555433 No 55 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=48.13 E-value=0.68 Score=21.50 Aligned_cols=223 Identities=15% Similarity=0.005 Sum_probs=88.3 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhccccee---eeeeccccceeeee Q lcl|NC_020866. 1 MQVTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRA---IQNLTESDYSIREK 77 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~---~~~l~~~~~~i~n~ 77 (297) -.|.++.++. +-+.+... +....+++.+|.+.....|........--.|++|-. -.+..=..-++..+ T Consensus 123 ~~~~~~~~~~--------ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~ 193 (390) T protein:vir:81 123 ALTTPNRLPG--------FITPPDAR-LTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTH 193 (390) T ss_pred ceechhhhHH--------HHHHHhhh-hhhhhhcceeeccCCceEEEEEecCCcceeeecCCcccccccceeeEEEEeee Confidence 1111112221 22222222 234455667665544444544443333347875532 23333345667888 Q ss_pred cccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeecc-cc Q lcl|NC_020866. 78 PWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVSN-TG 156 (297) Q Consensus 78 ~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svsn-~~ 156 (297) +++..+.|+++.+.| ...+..-+...++++.++..+..++ +| ||.+ .+|-+- .+...... .. T Consensus 194 k~~~~~~is~ell~d-~~~~~~~i~~~l~~~~~~~~d~a~l----~G------~g~~----~~~~Gi--~~~~~~~~~~~ 256 (390) T protein:vir:81 194 VIAHTMKATRQILSD-APQLASYMNNRLIRGLKVKEDAEIL----RG------TGAN----DGLLGL--IPQATTYAAPT 256 (390) T ss_pred EEEEeehhhHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHH----hc------CCCC----Ccccce--eeccccccccc Confidence 999999999998764 4567777888899999999887443 22 1111 111110 00000000 01 Q ss_pred ccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHHHHH Q lcl|NC_020866. 157 GGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAAAR 236 (297) Q Consensus 157 ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aar 236 (297) +..+..+| -+ +.-++.+.... .+.+. -|- ++ ...+ T Consensus 257 ~~~~~~~~-~~----~~~~~~~~~~~-------------------------~~~~~---~~v--------~~----~~~~ 291 (390) T protein:vir:81 257 TIAGATRV-DQ----LRLAMLQASLA-------------------------EYNPS---GIV--------IN----PIDW 291 (390) T ss_pred ccccchhH-HH----HHHHHHhhccc-------------------------cCCCC---EEE--------Ec----HHHH Confidence 11111222 11 11111111000 00000 011 11 1234 Q ss_pred HHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCC-----c-cee----cceeeEEecc--ccC Q lcl|NC_020866. 237 AALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGE-----T-NPW----KGTAELLVVP--WLA 297 (297) Q Consensus 237 ~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~-----~-N~~----~~~~~~iv~p--~La 297 (297) .++++.||-+|++|--.|.-. .++.| ....++.+...+.|. - +-+ ++-+++.++. .+. T Consensus 292 ~~l~~lkd~~G~~l~~~~~~~-~~~~l--~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~ 361 (390) T protein:vir:81 292 AAIELAKDANNQYLIGNARGT-LTPTL--WGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGEDF 361 (390) T ss_pred HHHHHhhcCCCceeecCcccc-cCcee--cceeeEEcCCCCCCcEEEEehhceEEEEEecceEEEEecccchh Confidence 466677788887653211100 00000 011122222333322 1 111 1122222211 111 No 56 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=46.69 E-value=0.73 Score=21.35 Aligned_cols=221 Identities=14% Similarity=0.126 Sum_probs=91.4 Q ss_pred CC--cCHHHHHHHHHH---------------------------HHHHHHHHHhhcchhhcceeeeecCCc-cceeccccc Q lcl|NC_020866. 1 MQ--VTAANLDALRVG---------------------------FKTSFQGALDQAPSQYLRLTTVVPSST-KEQRYGWMG 50 (297) Q Consensus 1 M~--i~~~~l~~l~~~---------------------------~~~~f~~a~~~a~~~~~~~a~~v~S~~-~~~~y~~Lg 50 (297) .. -..+.+++...+ ++..+.+..... +..+.+|++++.+. ..-.+.... T Consensus 84 ~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~-~~l~~~~~~~~~~~~~~~~~p~~~ 162 (390) T protein:vir:62 84 SADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERS-AIMRGGATTFTTSDANPLDFTVIT 162 (390) T ss_pred hcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhh-hhhhhcceeeecCCCceeEEEEEc Confidence 00 000000000000 122222222222 22455666665332 222344444 Q ss_pred CCCcchhcccce---eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCc Q lcl|NC_020866. 51 KIPNVREWIGPR---AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFAT 127 (297) Q Consensus 51 ~~P~lrew~Ge~---~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~ 127 (297) .-|. -.|++|- ...+..=..-++..++++..+.|+++.++|-...+..-+.+.++++.+...+..++ +| T Consensus 163 ~~~~-a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l----~G--- 234 (390) T protein:vir:62 163 GRSS-ASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFI----TG--- 234 (390) T ss_pred CCcc-eeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhh----cc--- Confidence 4343 3677543 33444445567788999999999999999988888999999999999999887433 33 Q ss_pred cccCccc--ccccccccccccccceeeccccccchhHHH--HHHHHHHHHHHHhhcccccchhhhcccccccccccccee Q lcl|NC_020866. 128 ECYDGQN--FFDTDHPVLDEDGKTVTVSNTGGGTGTPWF--LLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFL 203 (297) Q Consensus 128 ~~~DGk~--~F~tdH~~~~g~~~~~svsn~~ag~~~awy--lld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~ 203 (297) ||+| ++...- ... ..+.. +..+...| +.+.-..++|. T Consensus 235 ---~G~p~Gi~~~~~-----~~~-~~~~~--~~~~~~~~~~l~~~~~~l~~~---------------------------- 275 (390) T protein:vir:62 235 ---TGQPRGILTDAS-----PAT-ATFLA--TDTDSKVSDALIDLFHEVPSA---------------------------- 275 (390) T ss_pred ---CCcccccccccc-----ccc-cceec--ccccccchHHHHHHHHhhhhh---------------------------- Confidence 4443 232110 000 00000 01111111 11111111110 Q ss_pred eeccccccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccC-------eEEecchHHHHHHHHHhhhcc Q lcl|NC_020866. 204 YGTDARANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPN-------LLVVPPALEEAGRKILNSENA 276 (297) Q Consensus 204 ~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~-------~LvVp~~le~~A~~ll~~~~~ 276 (297) .+.+ +.|-+ +.+. ...+++.||-+|++|- .|. .|+=.| ++.++.. T Consensus 276 ----~~~~---a~~vm--------n~~~----~~~L~~lkd~~g~~l~-~~~~~~g~~~~l~G~P--------v~~~~~~ 327 (390) T protein:vir:62 276 ----YRAN---AKYVV--------NDLR----AAQMRKLKDANGQYLW-QSGLTVGAPSLFNGKV--------VETDDGM 327 (390) T ss_pred ----hhcC---CEEEE--------chHH----HHHHHHhhccCCCeee-cCCcCCCccceecccc--------eEEecCC Confidence 0000 11211 1122 3455667788887752 221 111001 1112222 Q ss_pred CCC-----Cccee----cceeeEEeccccC Q lcl|NC_020866. 277 SGG-----ETNPW----KGTAELLVVPWLA 297 (297) Q Consensus 277 ~~g-----~~N~~----~~~~~~iv~p~La 297 (297) +.+ +-+-| ++.+++-++.... T Consensus 328 p~~~i~~gd~s~~~i~~~~~~~v~~~~~~~ 357 (390) T protein:vir:62 328 PADKILFADLSKYRVRFAGSLRVDRSVDAK 357 (390) T ss_pred CCccEEEeeccceeEEeecceEEEeecccc Confidence 221 11000 1222333222222 No 57 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=45.42 E-value=0.77 Score=21.20 Aligned_cols=235 Identities=14% Similarity=0.149 Sum_probs=92.4 Q ss_pred CCcC--HHHH-HHHHH---------------------------HHHHHHHHHHhhcchhhcceeeeecCCccceeccccc Q lcl|NC_020866. 1 MQVT--AANL-DALRV---------------------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMG 50 (297) Q Consensus 1 M~i~--~~~l-~~l~~---------------------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg 50 (297) |.+. ..++ ++... -+...+-+-+....+-.+..++.+|..+....+..+. T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~ 177 (428) T protein:vir:10 98 MSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLA 177 (428) T ss_pred HHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEe Confidence 0000 0000 00000 0011111111111111111245566544445566565 Q ss_pred CCCcchhccccee---eeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCc Q lcl|NC_020866. 51 KIPNVREWIGPRA---IQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFAT 127 (297) Q Consensus 51 ~~P~lrew~Ge~~---~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~ 127 (297) .-|.. .|+||-. ..+..=..-++..++++..+.|||+.+.|-..++..-+.+.|+++.+...++.++ .|..+ T Consensus 178 ~~~~a-~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l----~G~G~ 252 (428) T protein:vir:10 178 GGATA-SYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFM----RDDGT 252 (428) T ss_pred CCcce-eeeccCccccccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHh----ccCCC Confidence 55554 5775543 3333333456788999999999999999888999999999999999999988553 22111 Q ss_pred -cccCcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeec Q lcl|NC_020866. 128 -ECYDGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGT 206 (297) Q Consensus 128 -~~~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~ 206 (297) .-.+| ++.. .+....+.......... +......+..++.... T Consensus 253 ~~~p~G--i~~~-----~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~-------------------------- 295 (428) T protein:vir:10 253 GDTPIG--MKAR-----ATQWNRLLPWAADAAVN----LDTIDTYLDSIILMSM-------------------------- 295 (428) T ss_pred Cccccc--cccc-----ccccccccccccccccc----HHHHHHHHHHHHHhhh-------------------------- Confidence 00011 1100 00000000000000000 0001111111110000 Q ss_pred cccccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccc--cccCeEEecchHHHHHHHHHhhhccCC----CC Q lcl|NC_020866. 207 DARANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLG--LMPNLLVVPPALEEAGRKILNSENASG----GE 280 (297) Q Consensus 207 d~r~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~--i~P~~LvVp~~le~~A~~ll~~~~~~~----g~ 280 (297) ..+.+.+...|-+ + ...+.++++.||.+|+||- +.+..|. ..-++.++..+. +. T Consensus 296 ~~~~~~~~~~~v~--------n----~~~~~~L~~lkd~~G~~i~~~~~~g~l~--------G~pv~~~~~~p~~~~~~~ 355 (428) T protein:vir:10 296 DGNSNMISSGWGM--------S----NRTYMKLFGLRDGNGNKVYPEMAQGMLK--------GYPIQRTSAIPANLGEGG 355 (428) T ss_pred ccccccccCEEEE--------c----HHHHHHHHHhhccCCceeccCCCCCeee--------ceeeEEeccccccccCCC Confidence 0000111111222 1 2234566777888888763 1111110 011111122221 00 Q ss_pred --ccee-----------cceeeEEeccccC Q lcl|NC_020866. 281 --TNPW-----------KGTAELLVVPWLA 297 (297) Q Consensus 281 --~N~~-----------~~~~~~iv~p~La 297 (297) +-.+ ++-+++.++++=+ T Consensus 356 ~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~ 385 (428) T protein:vir:10 356 KESEIYFADFNDVVIGEDGNMKVDFSKEAS 385 (428) T ss_pred ccceEEEEecceEEEEEecceEEEeecccc Confidence 0111 1334444554433 No 58 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=44.89 E-value=0.79 Score=21.15 Aligned_cols=209 Identities=14% Similarity=0.136 Sum_probs=107.5 Q ss_pred CCcC---------HHHHHHHHH-HHHHHHHHHHhhcchhhcceeeeecCCccceeccccc--CCCcchhc-cc----cee Q lcl|NC_020866. 1 MQVT---------AANLDALRV-GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMG--KIPNVREW-IG----PRA 63 (297) Q Consensus 1 M~i~---------~~~l~~l~~-~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg--~~P~lrew-~G----e~~ 63 (297) |.|+ ...|...|. -|...+..+|++-.+-.+.-++......++..+..++ .++..++= ++ +-+ T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 80 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSADGT 80 (322) T ss_pred CcccceeeeeeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccccCcc Confidence 2222 234444443 3566666666666666665554322333443333332 34443221 11 112 Q ss_pred ee----eeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccc Q lcl|NC_020866. 64 IQ----NLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTD 139 (297) Q Consensus 64 ~~----~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~td 139 (297) +. ......-.+...+|..-+-|++.|...=.+..-.+..++.|.+-++--|+.++..+..+.+. T Consensus 81 ~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~------------ 148 (322) T protein:vir:10 81 YPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASI------------ 148 (322) T ss_pred cCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccc------------ Confidence 21 22223344666777777888888866655666677778999999999999999776553211 Q ss_pred cccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccch Q lcl|NC_020866. 140 HPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQM 219 (297) Q Consensus 140 H~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~ 219 (297) ++.+. ++...+ +. ...+ T Consensus 149 ----~~~gt--~v~~~s------------------------------------s~--------------~i~~------- 165 (322) T protein:vir:10 149 ----KGTGQ--PVEFLA------------------------------------TQ--------------EIGD------- 165 (322) T ss_pred ----ccccc--ccccCC------------------------------------Cc--------------cccc------- Confidence 00010 000000 00 0000 Q ss_pred hhcCCccCCHHHHHHHHHHHHhhccC-CCcccccccCeEEecchHHHHHHHHHhhhcc------------CCCCcceecc Q lcl|NC_020866. 220 AYGSKQTLDGTAYAAARAALSGMKGD-YGRPLGLMPNLLVVPPALEEAGRKILNSENA------------SGGETNPWKG 286 (297) Q Consensus 220 a~~~~~~l~~~~l~aar~aM~~~k~~-~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~------------~~g~~N~~~~ 286 (297) ++..++.+-|-+|.+.+++..=+ +|. +++||+|+.... ||.-... ..|...-|.| T Consensus 166 ---g~~g~t~~kl~~a~~~l~~~dvp~d~~------R~~vv~p~~~~~---LL~d~~~ts~D~~~~~~l~~~G~ig~~lG 233 (322) T protein:vir:10 166 ---GTKPISFDYVTEITERFLENEIEPEVS------KVIVIGPTQARK---LLQITEATSADYTSAMDLQSKGIITNWMG 233 (322) T ss_pred ---CccchhHHHHHHHHHHHHhcCCCCCCC------eEEEeCHHHHHH---HhcchhhhhhhcccchhhhhcCeeeeeee Confidence 12245556666676666655433 342 289999998554 5433221 1243333444 Q ss_pred eeeEEeccccC Q lcl|NC_020866. 287 TAELLVVPWLA 297 (297) Q Consensus 287 ~~~~iv~p~La 297 (297) ++++++-+|- T Consensus 234 -f~~i~s~~lp 243 (322) T protein:vir:10 234 -YTWIVSTRLD 243 (322) T ss_pred -EEEEEeccCC Confidence 6778888874 No 59 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=44.07 E-value=0.82 Score=21.06 Aligned_cols=191 Identities=13% Similarity=0.076 Sum_probs=87.7 Q ss_pred CCcCH---HHHHHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcc---cceeeeeecccccee Q lcl|NC_020866. 1 MQVTA---ANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWI---GPRAIQNLTESDYSI 74 (297) Q Consensus 1 M~i~~---~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~---Ge~~~~~l~~~~~~i 74 (297) |.++. +...+ .+-..|++.+--++-.++...-. .....+-+...++. +...+.. +......+.+...++ T Consensus 1 MA~~~~~pe~~~~---~v~~~~~~~lv~~~l~~~~~~~~-~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAFNNFIPELWSD---MLLEEWTAQTVFANLVNREYEGT-ASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred CcchhhhHHHHHH---HHHHHHHhhhccchhhccccccc-cccCceEEEeeccc-ccccccccCCCccCccccccceEEE Confidence 99973 33321 12333444432222222211101 11112222222222 1223322 233445666666667 Q ss_pred eeecc-cceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeec Q lcl|NC_020866. 75 REKPW-ELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVS 153 (297) Q Consensus 75 ~n~~f-e~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svs 153 (297) +..++ ...+.|+..+-..+... +..+.++++++-++.-|.-++.++..+.+. +.. T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~alA~~vD~~i~~~~~~a~~~--------------~~~--------- 131 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTA--------------LTG--------- 131 (273) T ss_pred EEeeeeecceEeecHHHhhhhcc-HHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------ccc--------- Confidence 66443 56666765444443333 467788889998888888888877652111 000 Q ss_pred cccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCH---- Q lcl|NC_020866. 154 NTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDG---- 229 (297) Q Consensus 154 n~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~---- 229 (297) +.+++. T Consensus 132 ----------------------------------------------------------------------~~~~~~~~~~ 141 (273) T protein:vir:10 132 ----------------------------------------------------------------------SAPTDADDAF 141 (273) T ss_pred ----------------------------------------------------------------------ccccchhHHH Confidence 001111 Q ss_pred HHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHH---HHhhhccCCCCcceec-c------eeeEEeccccC Q lcl|NC_020866. 230 TAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRK---ILNSENASGGETNPWK-G------TAELLVVPWLA 297 (297) Q Consensus 230 ~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~---ll~~~~~~~g~~N~~~-~------~~~~iv~p~La 297 (297) +.+.+|+++| ++..-|- ..++|||+|.....-++ .+.... ..|+.+.++ | =++++.+..|- T Consensus 142 ~~i~~a~~~l----d~~~vP~--~~R~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~i~G~~v~~s~~lp 212 (273) T protein:vir:10 142 DLIAKALKEL----TKANVPN--VGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) T ss_pred HHHHHHHHHh----hhcCCCc--CCCEEEECHHHHHHHhcchhhhhhhh-ccccccceeeeeeeEEeceEEEEecccc Confidence 2233344444 3333332 24588999977665332 222111 122333332 2 16888887775 No 60 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=44.07 E-value=0.82 Score=21.06 Aligned_cols=191 Identities=13% Similarity=0.076 Sum_probs=87.7 Q ss_pred CCcCH---HHHHHHHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcc---cceeeeeecccccee Q lcl|NC_020866. 1 MQVTA---ANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWI---GPRAIQNLTESDYSI 74 (297) Q Consensus 1 M~i~~---~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~---Ge~~~~~l~~~~~~i 74 (297) |.++. +...+ .+-..|++.+--++-.++...-. .....+-+...++. +...+.. +......+.+...++ T Consensus 1 MA~~~~~pe~~~~---~v~~~~~~~lv~~~l~~~~~~~~-~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAFNNFIPELWSD---MLLEEWTAQTVFANLVNREYEGT-ASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred CcchhhhHHHHHH---HHHHHHHhhhccchhhccccccc-cccCceEEEeeccc-ccccccccCCCccCccccccceEEE Confidence 99973 33321 12333444432222222211101 11112222222222 1223322 233445666666667 Q ss_pred eeecc-cceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeec Q lcl|NC_020866. 75 REKPW-ELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVS 153 (297) Q Consensus 75 ~n~~f-e~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svs 153 (297) +..++ ...+.|+..+-..+... +..+.++++++-++.-|.-++.++..+.+. +.. T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~alA~~vD~~i~~~~~~a~~~--------------~~~--------- 131 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTA--------------LTG--------- 131 (273) T ss_pred EEeeeeecceEeecHHHhhhhcc-HHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------ccc--------- Confidence 66443 56666765444443333 467788889998888888888877652111 000 Q ss_pred cccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCH---- Q lcl|NC_020866. 154 NTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDG---- 229 (297) Q Consensus 154 n~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~---- 229 (297) +.+++. T Consensus 132 ----------------------------------------------------------------------~~~~~~~~~~ 141 (273) T protein:vir:10 132 ----------------------------------------------------------------------SAPTDADDAF 141 (273) T ss_pred ----------------------------------------------------------------------ccccchhHHH Confidence 001111 Q ss_pred HHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHH---HHhhhccCCCCcceec-c------eeeEEeccccC Q lcl|NC_020866. 230 TAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRK---ILNSENASGGETNPWK-G------TAELLVVPWLA 297 (297) Q Consensus 230 ~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~---ll~~~~~~~g~~N~~~-~------~~~~iv~p~La 297 (297) +.+.+|+++| ++..-|- ..++|||+|.....-++ .+.... ..|+.+.++ | =++++.+..|- T Consensus 142 ~~i~~a~~~l----d~~~vP~--~~R~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~i~G~~v~~s~~lp 212 (273) T protein:vir:10 142 DLIAKALKEL----TKANVPN--VGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) T ss_pred HHHHHHHHHh----hhcCCCc--CCCEEEECHHHHHHHhcchhhhhhhh-ccccccceeeeeeeEEeceEEEEecccc Confidence 2233344444 3333332 24588999977665332 222111 122333332 2 16888887775 No 61 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=40.87 E-value=0.95 Score=20.70 Aligned_cols=242 Identities=13% Similarity=0.077 Sum_probs=93.3 Q ss_pred CCcCHHHHHHHHH------------HHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceeeee-e Q lcl|NC_020866. 1 MQVTAANLDALRV------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQN-L 67 (297) Q Consensus 1 M~i~~~~l~~l~~------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~~-l 67 (297) ..+.....+++.+ -+...+.+..... +...++|+.+|-+.....+.-...-+.. .|+||-.-.. . T Consensus 97 ~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~a-~wv~E~~~~~~~ 174 (401) T protein:vir:44 97 DGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDE-VVMRQEATVITVGGSDYKKLVNLGGTAS-GWVGETDTRSQT 174 (401) T ss_pred hhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhh-hhhhhhceeeecCCCceEEEEecCCccc-eeeccccccCcc Confidence 1111111111111 1122222323222 2345567776644444344433333333 6887654221 1 Q ss_pred ---ccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccc Q lcl|NC_020866. 68 ---TESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLD 144 (297) Q Consensus 68 ---~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~ 144 (297) .=..-++..++++..+.|+++.+.|-...+..-+.+.|+++.++..+..++. |..+.-- +-++........ T Consensus 175 ~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~----G~G~~~p--~Gil~~~~~~~~ 248 (401) T protein:vir:44 175 ATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTT----GDGTKKP--KGFLAYESTEES 248 (401) T ss_pred ccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhc----cCCCCcc--ceeecccccccc Confidence 2233467778899999999999998888999999999999999888764442 2221100 112221111100 Q ss_pred -ccccceeecccccc-chhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhc Q lcl|NC_020866. 145 -EDGKTVTVSNTGGG-TGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYG 222 (297) Q Consensus 145 -g~~~~~svsn~~ag-~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~ 222 (297) .............+ .+...| ..+..+|+..... .+.+ +.|-+ T Consensus 249 ~~~~~~~~~~~~~t~~~~~~~~-----d~i~~~~~~l~~~-------------------------~~~~---a~~v~--- 292 (401) T protein:vir:44 249 DKARAFGKLQHIVSGEATAVTA-----DAIIKLIYTLRKA-------------------------HRTG---AKFMM--- 292 (401) T ss_pred ccccccccccccccccccccCH-----HHHHHHHHhcchh-------------------------hhcC---CEEEE--- Confidence 00000000000000 000000 0011111110000 0000 11211 Q ss_pred CCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCC-CC-cc-eecc------------e Q lcl|NC_020866. 223 SKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASG-GE-TN-PWKG------------T 287 (297) Q Consensus 223 ~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~-g~-~N-~~~~------------~ 287 (297) + ...+..+++.||-+|+||- .|+.- .++.-.-...-++-++..+. +. .. .+.| - T Consensus 293 -----n----~~~~~~L~~lkd~~G~~l~-~~~~~-~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~ 361 (401) T protein:vir:44 293 -----N----NNSLFAIRLLKDTEGNYLW-RPGLE-LGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIG 361 (401) T ss_pred -----c----HHHHHHHHHhhccCCceee-cCCcC-CCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecc Confidence 1 2235666677888888762 22100 00000000111111112111 11 11 1112 2 Q ss_pred eeEEeccccC Q lcl|NC_020866. 288 AELLVVPWLA 297 (297) Q Consensus 288 ~~~iv~p~La 297 (297) +++..+|+-. T Consensus 362 ~~~~~~~~~~ 371 (401) T protein:vir:44 362 TRILRDPYTN 371 (401) T ss_pred eEEeeecccc Confidence 3444444432 No 62 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=40.70 E-value=0.96 Score=20.68 Aligned_cols=243 Identities=13% Similarity=0.057 Sum_probs=93.4 Q ss_pred CCcCH---HHH---------HHHHH-----------H-HHHHHHHHHhhcchhhcceeeeec--CCccceecccccCCCc Q lcl|NC_020866. 1 MQVTA---ANL---------DALRV-----------G-FKTSFQGALDQAPSQYLRLTTVVP--SSTKEQRYGWMGKIPN 54 (297) Q Consensus 1 M~i~~---~~l---------~~l~~-----------~-~~~~f~~a~~~a~~~~~~~a~~v~--S~~~~~~y~~Lg~~P~ 54 (297) +.... ... +++.+ - +...+.+.+....+ ..+++..++ .......+.....-+. T Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~-i~~~~~~~~~~~~~~~~~ip~~~~~~~ 213 (477) T protein:vir:84 135 MVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELARAGRT-YANLCPTEPLPGGTSSINIPKILTGTS 213 (477) T ss_pred HhhhhhhhhHHHHHHhhhhhccccccCCCcceeeccchhHHHHHHHhhhcch-HHHhhceeeecCCcceeEEEEEecCcc Confidence 00000 000 00000 0 11223333322222 233343333 2333334444444444 Q ss_pred chhccccee--------eeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccC Q lcl|NC_020866. 55 VREWIGPRA--------IQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFA 126 (297) Q Consensus 55 lrew~Ge~~--------~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~ 126 (297) --.|+||-. ..++.=..-++..++++..+.|||+.|+|-...+-.-+...++++.+...+..++ +|.+ T Consensus 214 ~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l----~G~G 289 (477) T protein:vir:84 214 TAIQAADNAALTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVI----SGTG 289 (477) T ss_pred eeeeeccCcccccccccccccceeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHh----ccCC Confidence 456766532 1222223356778899999999999999999999999999999999999996433 3322 Q ss_pred ccccCcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeec Q lcl|NC_020866. 127 TECYDGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGT 206 (297) Q Consensus 127 ~~~~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~ 206 (297) + .|+| +++....+. ..++...++...+ +.......++.-... +.. T Consensus 290 t---~~~p-----~Gi~~~~~~-~~~~~~~~~~t~~----~~~~~~~~i~~~~~~---------------------~~~- 334 (477) T protein:vir:84 290 S---NNQV-----VGVRATAGI-TQVTATSAGSALE----KHQIIYQKIADAIQR---------------------VHT- 334 (477) T ss_pred C---CCcc-----ceeeecccc-ccccccccccchh----hHHHHHHHHHHHHhh---------------------ccc- Confidence 1 1121 111111100 0011111111100 000000001000000 000 Q ss_pred cccccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCccccccc------CeEEec-----chHHHHHHHHHhhhc Q lcl|NC_020866. 207 DARANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMP------NLLVVP-----PALEEAGRKILNSEN 275 (297) Q Consensus 207 d~r~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P------~~LvVp-----~~le~~A~~ll~~~~ 275 (297) ..+.+ -..|-+. .....++++.||-+|+||-... ..++.+ +.-.-...-++.+.. T Consensus 335 ~~~~~--~~~~v~~------------~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~ 400 (477) T protein:vir:84 335 SRFLE--PEVIVMH------------PRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPT 400 (477) T ss_pred cccCC--ccEEEEc------------HHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCc Confidence 00000 0123221 2234667777888888764211 111110 000111222333333 Q ss_pred cCCC---Ccc---eecc----------eeeEEeccccC Q lcl|NC_020866. 276 ASGG---ETN---PWKG----------TAELLVVPWLA 297 (297) Q Consensus 276 ~~~g---~~N---~~~~----------~~~~iv~p~La 297 (297) .|.+ ..| .+.| -+.+.++|+.- T Consensus 401 ~p~~~~~~~d~~~i~~gd~~~~~i~~~~~~~~~~~~~~ 438 (477) T protein:vir:84 401 LPTTLGTGTDQDVIHVLRASDLALFESSVRMRALQETR 438 (477) T ss_pred ccccccccCCcceEEEEEeceEEEEeeceeEEeccccc Confidence 3321 011 1111 12333444433 No 63 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=39.49 E-value=1 Score=20.55 Aligned_cols=224 Identities=13% Similarity=0.101 Sum_probs=96.4 Q ss_pred CCcCHHHHH-------------HHHH------------HHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcc Q lcl|NC_020866. 1 MQVTAANLD-------------ALRV------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNV 55 (297) Q Consensus 1 M~i~~~~l~-------------~l~~------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~l 55 (297) |..++.+++ +..+ -+...+-+... ..+...++++.+|-......|.++..-|.- T Consensus 4 ~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~-~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a 82 (324) T protein:vir:93 4 TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVM-ENSKIMQLGKYEPMEGTEKKFTFWADKPGA 82 (324) T ss_pred hHHHHHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHH-hhchhhhhcceeeccCCceEEEEEecCcce Confidence 222222221 1100 01111111111 112245567777755555667776555654 Q ss_pred hhcccce---eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCc Q lcl|NC_020866. 56 REWIGPR---AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDG 132 (297) Q Consensus 56 rew~Ge~---~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DG 132 (297) +|+||- ...++.-..-+++.++++..+.|+|+.+.|-+..+..-+.+.++++.++..++.++ . |.... ..+ T Consensus 83 -~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l---~-G~g~~-~~~ 156 (324) T protein:vir:93 83 -YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI---L-NQGNN-PFG 156 (324) T ss_pred -eeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh---c-CCCCC-CcC Confidence 687553 33334445566788999999999999999888899999999999999999888653 2 21110 111 Q ss_pred ccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccc Q lcl|NC_020866. 133 QNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANV 212 (297) Q Consensus 133 k~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~ 212 (297) ..++++ ... .+.... +..+| .+ +.-++.+... . .+..+ T Consensus 157 ~~~~~~-------~~~----~~~~~~-~~~~~-~~----i~~~~~~l~~------------~-------------~~~~~ 194 (324) T protein:vir:93 157 KSIAQS-------IEK----TNKVIK-GDFTQ-DN----IIDLEALLED------------D-------------ELEAN 194 (324) T ss_pred cccccc-------ccc----cceecc-ccccH-HH----HHHHHHhhhh------------c-------------cCCCC Confidence 111111 000 011000 00111 00 1111110000 0 00000 Q ss_pred cccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccc--cCeEE-ecchHHHHHHHHHhhhccCC-------CCc- Q lcl|NC_020866. 213 GFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLM--PNLLV-VPPALEEAGRKILNSENASG-------GET- 281 (297) Q Consensus 213 G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~--P~~Lv-Vp~~le~~A~~ll~~~~~~~-------g~~- 281 (297) -|-+ + ...+..+++.||.+|+++-.. |..|. +|. +.....+. |+- T Consensus 195 ---~~v~--------n----~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV---------v~~~~~~~~~~~i~~gdfs 250 (324) T protein:vir:93 195 ---AFIS--------K----TQNRSLLRKIVDPETKERIYDRNSDSLDGLPV---------VNLKSSNLKRGELITGDFD 250 (324) T ss_pred ---EEEE--------c----HHHHHHHHHhhCCCCCeeecCCCCCcccceee---------EeecCCCCCcceEEEEecc Confidence 1211 1 234556778899999976321 21111 111 10111111 111 Q ss_pred cee---cceeeEEeccccC Q lcl|NC_020866. 282 NPW---KGTAELLVVPWLA 297 (297) Q Consensus 282 N~~---~~~~~~iv~p~La 297 (297) +.+ ++-+++-++.... T Consensus 251 ~~~~~~~~~~~i~~~~~~~ 269 (324) T protein:vir:93 251 KLIYGIPQLIEYKIDETAQ 269 (324) T ss_pred eEEEEEecCcEEEEeeccc Confidence 111 1112222222222 No 64 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=38.29 E-value=1.1 Score=20.41 Aligned_cols=215 Identities=13% Similarity=0.091 Sum_probs=100.3 Q ss_pred CC-----------cCH-HHHHHHH-HHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceeeeee Q lcl|NC_020866. 1 MQ-----------VTA-ANLDALR-VGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQNL 67 (297) Q Consensus 1 M~-----------i~~-~~l~~l~-~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l 67 (297) |- ... ..-.+|| .-|..+...+|+.. +-+..+.++ .+-...+++ +||.+ |+-+++.. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~-s~~~~~~~~-r~i~~G~s~----~~~~i----G~~~~~~~ 70 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYS-SKFASWMNV-RSLRGTNQL----RVDRV----GASTIAGR 70 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHh-hhhhcccee-eeccccceE----EEeee----cceeeeee Confidence 21 111 1124566 55667777777655 333333322 111122222 34543 22222111 Q ss_pred cccc--ceeeeecccceeeccH-----HHhhc---c--CcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCcc-ccCccc Q lcl|NC_020866. 68 TESD--YSIREKPWELTIGVDR-----DDIET---D--NLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATE-CYDGQN 134 (297) Q Consensus 68 ~~~~--~~i~n~~fe~tv~v~R-----~~i~d---D--~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~-~~DGk~ 134 (297) .-.. -.=.+++=+.+|.|+- ..|.| = .+..=+++.+++|++=|++-|+.++..|..|.... -..-++ T Consensus 71 ~~g~~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~ 150 (334) T protein:vir:80 71 KAGEELVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKP 150 (334) T ss_pred cCCCCCCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 1100 0001233344444432 22221 1 24567889999999999999999998776543221 111122 Q ss_pred ccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccc Q lcl|NC_020866. 135 FFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGF 214 (297) Q Consensus 135 ~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~ 214 (297) -|+.-|.. +....+.. T Consensus 151 ~~~~G~~~----------~~~~~g~~------------------------------------------------------ 166 (334) T protein:vir:80 151 AFHDGILL----------PSTISGLA------------------------------------------------------ 166 (334) T ss_pred cccCCcce----------eecccccc------------------------------------------------------ Confidence 22111100 00000000 Q ss_pred cccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccc-cccCeEEecchHHHH---HHHHHhhhccCCCCcceecce--- Q lcl|NC_020866. 215 GFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLG-LMPNLLVVPPALEEA---GRKILNSENASGGETNPWKGT--- 287 (297) Q Consensus 215 ~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~-i~P~~LvVp~~le~~---A~~ll~~~~~~~g~~N~~~~~--- 287 (297) ....-+...+-+|...++.+-+...-|-. ..++++||+|..... +.++++++....+..|++.+. T Consensus 167 --------~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~i~ 238 (334) T protein:vir:80 167 --------ADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGGRIA 238 (334) T ss_pred --------cchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccceeEE Confidence 00112233344444444444444444322 356799999987665 445666665555556766543 Q ss_pred ----eeEEeccccC Q lcl|NC_020866. 288 ----AELLVVPWLA 297 (297) Q Consensus 288 ----~~~iv~p~La 297 (297) ++|+.+++|- T Consensus 239 ~v~G~~V~~Sn~~P 252 (334) T protein:vir:80 239 MLNGVRVVETPRFP 252 (334) T ss_pred EEeceEEEeecCCC Confidence 7888899988 No 65 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=38.16 E-value=1.1 Score=20.40 Aligned_cols=192 Identities=15% Similarity=0.089 Sum_probs=95.9 Q ss_pred CCcCHHHHHHHHHH--HHHHHHHHHhhcchhhcceeeeecC----Ccc-ceeccc--ccCCCcchhcccceeeeeecccc Q lcl|NC_020866. 1 MQVTAANLDALRVG--FKTSFQGALDQAPSQYLRLTTVVPS----STK-EQRYGW--MGKIPNVREWIGPRAIQNLTESD 71 (297) Q Consensus 1 M~i~~~~l~~l~~~--~~~~f~~a~~~a~~~~~~~a~~v~S----~~~-~~~y~~--Lg~~P~lrew~Ge~~~~~l~~~~ 71 (297) |.=....|..|..= |...+++.+.. .--+..++..-.+ ... .+...| +|+.-.+.|. .+....++.... T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~-~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g-~~i~~~~it~~~ 78 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDK-KLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEG-EKIPVDQIGTSK 78 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHh-hhhhcccccccccccCCCCCEEEEEeeccCCCccccCCC-CcCchhhcccce Confidence 65322222222221 11111111111 1112233322110 011 111112 2333222221 345666777777 Q ss_pred ceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccccccccee Q lcl|NC_020866. 72 YSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVT 151 (297) Q Consensus 72 ~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~s 151 (297) -+.+.+.++..+.|+-.+..--.-.......+++|++.++.-+.-++..|+.+ +. + T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a--~~----------------------~ 134 (274) T protein:vir:96 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA--TL----------------------T 134 (274) T ss_pred eEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CC----------------------C Confidence 77788888999999877776655567777888888888888887777666331 00 0 Q ss_pred eccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHH Q lcl|NC_020866. 152 VSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTA 231 (297) Q Consensus 152 vsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~ 231 (297) .+..+++.+. T Consensus 135 ----------------------------------------------------------------------~~~~~~~~d~ 144 (274) T protein:vir:96 135 ----------------------------------------------------------------------VEADITKLDG 144 (274) T ss_pred ----------------------------------------------------------------------cCcccccHHH Confidence 0011233455 Q ss_pred HHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccC--CCCcce--------ecceeeEEeccccC Q lcl|NC_020866. 232 YAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENAS--GGETNP--------WKGTAELLVVPWLA 297 (297) Q Consensus 232 l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~--~g~~N~--------~~~~~~~iv~p~La 297 (297) +-.|++.+.... ..+++|+|+|.....-++.-.-+... .+..|+ +.| ++||+++.|- T Consensus 145 i~dA~~~l~d~~--------~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G-~~Vi~s~~~p 211 (274) T protein:vir:96 145 LQTAIDKFNDED--------LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALG-AVIVRSNKLN 211 (274) T ss_pred HHHHHHHhcccC--------CCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecC-eeEEEcCCCC Confidence 556666654321 25789999999766655532111111 111121 233 6899999986 No 66 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=37.85 E-value=1.1 Score=20.36 Aligned_cols=212 Identities=13% Similarity=0.092 Sum_probs=97.8 Q ss_pred CCcCHHHHHHHHHHHHHHHHHHHhhcchhhccee---eeecCCccceecccccCCCcchhcccceee----eeeccccce Q lcl|NC_020866. 1 MQVTAANLDALRVGFKTSFQGALDQAPSQYLRLT---TVVPSSTKEQRYGWMGKIPNVREWIGPRAI----QNLTESDYS 73 (297) Q Consensus 1 M~i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a---~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~----~~l~~~~~~ 73 (297) ..+..+.|+.|-..+.....+- -..+++. +.++-...+-+|... +.-..-+|+|++.- -+..-..+. T Consensus 7 g~f~~~~l~~id~~v~e~~~~~-----l~~r~l~~v~~~~~~~~~~~~~~~~-~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 7 ATIEARDLQAIDNVIYEPKQEE-----LTARSVFPQKFDVNEGAESYSFDVM-TRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred chhhHHHHHHHHHHHHHhhhhh-----hhhhhhcccccCCCCceEEEEEeee-ccceeEEEecCcccccccccccceeEE Confidence 3344444444444332222221 1223332 222222233333333 22244456654321 122234667 Q ss_pred eeeecccceeeccHHHhhccC---cchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccce Q lcl|NC_020866. 74 IREKPWELTIGVDRDDIETDN---LGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTV 150 (297) Q Consensus 74 i~n~~fe~tv~v~R~~i~dD~---lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~ 150 (297) .+...|+..+.+++++++.=. +.+=.+....+.++.++++|+++|- |.. .|++.. -. T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~----G~~------------~~g~~G----Ll 140 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFR----GEK------------KYAIKG----AF 140 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEee----ecc------------ccccee----ee Confidence 889999999999999999864 3444566777778888888888772 211 111110 00 Q ss_pred eeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHH Q lcl|NC_020866. 151 TVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGT 230 (297) Q Consensus 151 svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~ 230 (297) ...++... . ..+-...+-.-|..+ +..--.+ T Consensus 141 N~p~~~~~------------------------------~----------------~~~~~~~~~~~w~~~---t~~ei~~ 171 (301) T protein:vir:80 141 EATGIQID------------------------------V----------------SPTTGVGNVSKWEKK---TAEQIID 171 (301) T ss_pred cCCCcccc------------------------------c----------------ccCcccccccccccC---CHHHHHH Confidence 00000000 0 000000111112100 0000123 Q ss_pred HHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCC--------cceecceeeEEeccccC Q lcl|NC_020866. 231 AYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGE--------TNPWKGTAELLVVPWLA 297 (297) Q Consensus 231 ~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~--------~N~~~~~~~~iv~p~La 297 (297) -+.+++.++..+.. | ...|..|++||+....-.+-..++. .|. .||+ ++++-.|+|. T Consensus 172 di~~~~~~l~~~s~--g---~~~p~~L~L~p~~~~~L~~~~~~~~--~~~tvl~~l~~~~~~---~~I~~~p~L~ 236 (301) T protein:vir:80 172 EIGEAHTKITVLPG--Y---GTASLKLCLPPKQFELINKKRYSNE--DSRSVLKVLQDNAWF---SAIVRVPDLA 236 (301) T ss_pred HHHHHHHHHHHhcC--c---eecccEEEecHHHHHhhhhccccCC--CCeeHHHHHHHHcCc---ceEEEcceec Confidence 45666666655532 2 2468999999997765432221111 111 2443 5666778887 No 67 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=37.19 E-value=1.1 Score=20.29 Aligned_cols=224 Identities=12% Similarity=0.065 Sum_probs=90.8 Q ss_pred CCcCHHHHHHHHH------------------------HHHHHHHHHHhhcchhhcceeeeecCCccceeccccc--CCCc Q lcl|NC_020866. 1 MQVTAANLDALRV------------------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMG--KIPN 54 (297) Q Consensus 1 M~i~~~~l~~l~~------------------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg--~~P~ 54 (297) ..+..+..++... -+...+.+..... ..-..+++.++-++...+|..+- .-+. T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (415) T protein:vir:79 99 TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE-FNLDKYVTVKRVTNGSGKYPVVRQSEVAA 177 (415) T ss_pred hhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhh-hhhhhheeeeeccCCceeEEEEeecCCcc Confidence 1111111111100 0111111111111 11233455555333333443332 2222 Q ss_pred chhcccce-eee---eeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCcccc Q lcl|NC_020866. 55 VREWIGPR-AIQ---NLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECY 130 (297) Q Consensus 55 lrew~Ge~-~~~---~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~ 130 (297) -.|++|- .+. ...=..-++..++++..+.|+|+.+.|-...+..-+.+.++++.++..+..+..-+..|-+. T Consensus 178 -~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--- 253 (415) T protein:vir:79 178 -LEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--- 253 (415) T ss_pred -ceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--- Confidence 3577543 222 12234567788999999999999998878888888999999999999887765544332111 Q ss_pred CcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccc Q lcl|NC_020866. 131 DGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARA 210 (297) Q Consensus 131 DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~ 210 (297) +...... ...+..+..+...| ..+.-+|...... .+. T Consensus 254 ----------~~~~~~~---~~~~~~~~~~~~~~-----~~i~~~~~~~~~~-------------------------~~~ 290 (415) T protein:vir:79 254 ----------STSSGFE---KEGKKLEVKKAKSL-----DDIKDAINLNVKP-------------------------NYE 290 (415) T ss_pred ----------ccccccc---ccccccccccccch-----hHHHHHHHhhhhh-------------------------ccC Confidence 0110000 00111111111111 0111111110000 000 Q ss_pred cccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccC-------eEE-ecchHHHHHHHHHhhhccC---CC Q lcl|NC_020866. 211 NVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPN-------LLV-VPPALEEAGRKILNSENAS---GG 279 (297) Q Consensus 211 ~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~-------~Lv-Vp~~le~~A~~ll~~~~~~---~g 279 (297) +. -|- ++. ..+.++++.||-+|+||- .|+ .|. .| +...+..+ .| T Consensus 291 ~~---~~v--------~n~----~~~~~l~~lkd~~G~~l~-~~~~~~~~~~~l~G~p---------V~~~~~~~~~~~~ 345 (415) T protein:vir:79 291 HN---VAI--------VSQ----TMFAKLDKMKDKLGNYLI-QPDVKEKTQQRLLGAK---------IEILPDEVLGQKG 345 (415) T ss_pred CC---EEE--------EcH----HHHHHHHHhhccCCceee-ccCcCCCCCceeccee---------eEEecccccCCCC Confidence 00 011 112 234567778898998873 222 111 11 01111111 22 Q ss_pred Ccceecc------------eeeEEeccccC Q lcl|NC_020866. 280 ETNPWKG------------TAELLVVPWLA 297 (297) Q Consensus 280 ~~N~~~~------------~~~~iv~p~La 297 (297) +...+.| -+++..+++.. T Consensus 346 ~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~ 375 (415) T protein:vir:79 346 NNTLIIGNLKDAIVLFDRSQYQASWTDYMH 375 (415) T ss_pred ccEEEEEehhccEEEEeecceEEEEecccc Confidence 2223333 23333334333 No 68 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=37.19 E-value=1.1 Score=20.29 Aligned_cols=224 Identities=12% Similarity=0.065 Sum_probs=90.8 Q ss_pred CCcCHHHHHHHHH------------------------HHHHHHHHHHhhcchhhcceeeeecCCccceeccccc--CCCc Q lcl|NC_020866. 1 MQVTAANLDALRV------------------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMG--KIPN 54 (297) Q Consensus 1 M~i~~~~l~~l~~------------------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg--~~P~ 54 (297) ..+..+..++... -+...+.+..... ..-..+++.++-++...+|..+- .-+. T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (415) T protein:vir:81 99 TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE-FNLDKYVTVKRVTNGSGKYPVVRQSEVAA 177 (415) T ss_pred hhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhh-hhhhhheeeeeccCCceeEEEEeecCCcc Confidence 1111111111100 0111111111111 11233455555333333443332 2222 Q ss_pred chhcccce-eee---eeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCcccc Q lcl|NC_020866. 55 VREWIGPR-AIQ---NLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECY 130 (297) Q Consensus 55 lrew~Ge~-~~~---~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~ 130 (297) -.|++|- .+. ...=..-++..++++..+.|+|+.+.|-...+..-+.+.++++.++..+..+..-+..|-+. T Consensus 178 -~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--- 253 (415) T protein:vir:81 178 -LEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--- 253 (415) T ss_pred -ceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--- Confidence 3577543 222 12234567788999999999999998878888888999999999999887765544332111 Q ss_pred CcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccc Q lcl|NC_020866. 131 DGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARA 210 (297) Q Consensus 131 DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~ 210 (297) +...... ...+..+..+...| ..+.-+|...... .+. T Consensus 254 ----------~~~~~~~---~~~~~~~~~~~~~~-----~~i~~~~~~~~~~-------------------------~~~ 290 (415) T protein:vir:81 254 ----------STSSGFE---KEGKKLEVKKAKSL-----DDIKDAINLNVKP-------------------------NYE 290 (415) T ss_pred ----------ccccccc---ccccccccccccch-----hHHHHHHHhhhhh-------------------------ccC Confidence 0110000 00111111111111 0111111110000 000 Q ss_pred cccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccC-------eEE-ecchHHHHHHHHHhhhccC---CC Q lcl|NC_020866. 211 NVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPN-------LLV-VPPALEEAGRKILNSENAS---GG 279 (297) Q Consensus 211 ~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~-------~Lv-Vp~~le~~A~~ll~~~~~~---~g 279 (297) +. -|- ++. ..+.++++.||-+|+||- .|+ .|. .| +...+..+ .| T Consensus 291 ~~---~~v--------~n~----~~~~~l~~lkd~~G~~l~-~~~~~~~~~~~l~G~p---------V~~~~~~~~~~~~ 345 (415) T protein:vir:81 291 HN---VAI--------VSQ----TMFAKLDKMKDKLGNYLI-QPDVKEKTQQRLLGAK---------IEILPDEVLGQKG 345 (415) T ss_pred CC---EEE--------EcH----HHHHHHHHhhccCCceee-ccCcCCCCCceeccee---------eEEecccccCCCC Confidence 00 011 112 234567778898998873 222 111 11 01111111 22 Q ss_pred Ccceecc------------eeeEEeccccC Q lcl|NC_020866. 280 ETNPWKG------------TAELLVVPWLA 297 (297) Q Consensus 280 ~~N~~~~------------~~~~iv~p~La 297 (297) +...+.| -+++..+++.. T Consensus 346 ~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~ 375 (415) T protein:vir:81 346 NNTLIIGNLKDAIVLFDRSQYQASWTDYMH 375 (415) T ss_pred ccEEEEEehhccEEEEeecceEEEEecccc Confidence 2223333 23333334333 No 69 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=37.19 E-value=1.1 Score=20.29 Aligned_cols=224 Identities=12% Similarity=0.065 Sum_probs=90.8 Q ss_pred CCcCHHHHHHHHH------------------------HHHHHHHHHHhhcchhhcceeeeecCCccceeccccc--CCCc Q lcl|NC_020866. 1 MQVTAANLDALRV------------------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMG--KIPN 54 (297) Q Consensus 1 M~i~~~~l~~l~~------------------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg--~~P~ 54 (297) ..+..+..++... -+...+.+..... ..-..+++.++-++...+|..+- .-+. T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (415) T protein:vir:98 99 TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE-FNLDKYVTVKRVTNGSGKYPVVRQSEVAA 177 (415) T ss_pred hhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhh-hhhhhheeeeeccCCceeEEEEeecCCcc Confidence 1111111111100 0111111111111 11233455555333333443332 2222 Q ss_pred chhcccce-eee---eeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCcccc Q lcl|NC_020866. 55 VREWIGPR-AIQ---NLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECY 130 (297) Q Consensus 55 lrew~Ge~-~~~---~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~ 130 (297) -.|++|- .+. ...=..-++..++++..+.|+|+.+.|-...+..-+.+.++++.++..+..+..-+..|-+. T Consensus 178 -~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--- 253 (415) T protein:vir:98 178 -LEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--- 253 (415) T ss_pred -ceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--- Confidence 3577543 222 12234567788999999999999998878888888999999999999887765544332111 Q ss_pred CcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccc Q lcl|NC_020866. 131 DGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARA 210 (297) Q Consensus 131 DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~ 210 (297) +...... ...+..+..+...| ..+.-+|...... .+. T Consensus 254 ----------~~~~~~~---~~~~~~~~~~~~~~-----~~i~~~~~~~~~~-------------------------~~~ 290 (415) T protein:vir:98 254 ----------STSSGFE---KEGKKLEVKKAKSL-----DDIKDAINLNVKP-------------------------NYE 290 (415) T ss_pred ----------ccccccc---ccccccccccccch-----hHHHHHHHhhhhh-------------------------ccC Confidence 0110000 00111111111111 0111111110000 000 Q ss_pred cccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccC-------eEE-ecchHHHHHHHHHhhhccC---CC Q lcl|NC_020866. 211 NVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPN-------LLV-VPPALEEAGRKILNSENAS---GG 279 (297) Q Consensus 211 ~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~-------~Lv-Vp~~le~~A~~ll~~~~~~---~g 279 (297) +. -|- ++. ..+.++++.||-+|+||- .|+ .|. .| +...+..+ .| T Consensus 291 ~~---~~v--------~n~----~~~~~l~~lkd~~G~~l~-~~~~~~~~~~~l~G~p---------V~~~~~~~~~~~~ 345 (415) T protein:vir:98 291 HN---VAI--------VSQ----TMFAKLDKMKDKLGNYLI-QPDVKEKTQQRLLGAK---------IEILPDEVLGQKG 345 (415) T ss_pred CC---EEE--------EcH----HHHHHHHHhhccCCceee-ccCcCCCCCceeccee---------eEEecccccCCCC Confidence 00 011 112 234567778898998873 222 111 11 01111111 22 Q ss_pred Ccceecc------------eeeEEeccccC Q lcl|NC_020866. 280 ETNPWKG------------TAELLVVPWLA 297 (297) Q Consensus 280 ~~N~~~~------------~~~~iv~p~La 297 (297) +...+.| -+++..+++.. T Consensus 346 ~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~ 375 (415) T protein:vir:98 346 NNTLIIGNLKDAIVLFDRSQYQASWTDYMH 375 (415) T ss_pred ccEEEEEehhccEEEEeecceEEEEecccc Confidence 2223333 23333334333 No 70 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=36.83 E-value=1.2 Score=20.25 Aligned_cols=177 Identities=16% Similarity=0.107 Sum_probs=76.8 Q ss_pred eeeeecCCccceecccccCCCcchhcccceeeeeeccccce----eeeecccceeeccHH--------Hhhcc--Ccchh Q lcl|NC_020866. 33 LTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQNLTESDYS----IREKPWELTIGVDRD--------DIETD--NLGIY 98 (297) Q Consensus 33 ~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l~~~~~~----i~n~~fe~tv~v~R~--------~i~dD--~lG~~ 98 (297) ..+.+.+ .+++ +||.+ |+-+++...-...- =....=+.+|.|+-. ||++= .+.+- T Consensus 1 ~vr~i~~---g~s~----~~~~i----G~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr 69 (324) T protein:vir:99 1 MTRTITS---GKSA----QFPVM----GRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVR 69 (324) T ss_pred Ceeeeec---CceE----EEeee----eeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccch Confidence 2222222 2222 34443 32222222211110 001233334443322 22111 24567 Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHhcccCc-cccCcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHh Q lcl|NC_020866. 99 SPLFQEMGRSAGSKWDMLVFELLKLGFAT-ECYDGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIIL 177 (297) Q Consensus 99 ~~~~~~~g~aaa~~~~~lv~~lL~~G~~~-~~~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~ 177 (297) ++..+++|++=++.-|+.++..+..+... .--...+.+..+|.... +.++ T Consensus 70 ~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~---------~~~~-------------------- 120 (324) T protein:vir:99 70 SEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLV---------KITG-------------------- 120 (324) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCcccee---------cccc-------------------- Confidence 78899999999999999999887553211 10111222222221100 0000 Q ss_pred hcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeE Q lcl|NC_020866. 178 QKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLL 257 (297) Q Consensus 178 q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~L 257 (297) -....+.+.+.+-++...++..-+...=| -..+++ T Consensus 121 -------------------------------------------~~~~~~~~~~~~~dai~~a~~~Lde~~VP--~~gR~~ 155 (324) T protein:vir:99 121 -------------------------------------------KKEDPAKYGTQVIQALTYARAAFAKKYIP--AGDRTF 155 (324) T ss_pred -------------------------------------------cccccccCHHHHHHHHHHHHHHHhhcCCC--CCCCEE Confidence 00112333444444444444444433333 235789 Q ss_pred EecchHHHHHHHHHhhhccC---CCCcceec-c------eeeEEeccccC Q lcl|NC_020866. 258 VVPPALEEAGRKILNSENAS---GGETNPWK-G------TAELLVVPWLA 297 (297) Q Consensus 258 vVp~~le~~A~~ll~~~~~~---~g~~N~~~-~------~~~~iv~p~La 297 (297) ||+|.... .|+...... .+..+.+. | =++|+.+++|- T Consensus 156 vv~P~~y~---~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp 202 (324) T protein:vir:99 156 YTDPDTYS---AILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMT 202 (324) T ss_pred EeChHHHH---HHhhcccccccccccccceecceEEEEeceEEEecCCcc Confidence 99998864 333332221 12223332 2 26888999886 No 71 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=36.71 E-value=1.2 Score=20.23 Aligned_cols=183 Identities=13% Similarity=0.072 Sum_probs=93.9 Q ss_pred CC---------cCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeee------cC-CccceecccccCCCcchhcc--cce Q lcl|NC_020866. 1 MQ---------VTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVV------PS-STKEQRYGWMGKIPNVREWI--GPR 62 (297) Q Consensus 1 M~---------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v------~S-~~~~~~y~~Lg~~P~lrew~--Ge~ 62 (297) |. |.|+.+.+... ..+.+.+ -+..++.+- |- +-....|.-+|+ ..++. .+. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~---~~~~~~l-----~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~---a~~~~~g~~i 69 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQ---AQLEKKL-----RFASFAEVDSTLQGQPGDTLTFPAFVYSGD---AQVVAEGEKI 69 (274) T ss_pred CCcceeehhhhhchHHHHHHHH---HHHHhhh-----hhcccceecccccCCCCCEEEEeeecCCCc---cccccCCCcc Confidence 33 34444433321 1122221 122232221 10 111111222333 23332 345 Q ss_pred eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccc Q lcl|NC_020866. 63 AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPV 142 (297) Q Consensus 63 ~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~ 142 (297) ...++....-+.+.+..+..++|+..+..----.......+++|.+.++.-|.-+...+..+..+ T Consensus 70 ~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~--------------- 134 (274) T protein:vir:12 70 PTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------------- 134 (274) T ss_pred chhhcccceeeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------- Confidence 66777777777888888999999887665543345566667777776766666555554331000 Q ss_pred ccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhc Q lcl|NC_020866. 143 LDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYG 222 (297) Q Consensus 143 ~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~ 222 (297) . T Consensus 135 -------------------------------------------------------------------------------~ 135 (274) T protein:vir:12 135 -------------------------------------------------------------------------------V 135 (274) T ss_pred -------------------------------------------------------------------------------c Confidence 0 Q ss_pred CCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhcc--CCCCcce--------ecceeeEEe Q lcl|NC_020866. 223 SKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENA--SGGETNP--------WKGTAELLV 292 (297) Q Consensus 223 ~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~--~~g~~N~--------~~~~~~~iv 292 (297) +..+++.+.+-.|++.|.... ..+++|+|+|.....-++...-+.. ..+..|. +.| ++||+ T Consensus 136 ~~~a~~~d~i~dA~~~lgd~~--------~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G-~~Vi~ 206 (274) T protein:vir:12 136 NADITKLNGLQSAIDKFNDED--------LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-AIIVR 206 (274) T ss_pred cccccCHHHHHHHHHHhcccc--------ccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecC-eeEEE Confidence 011344566667776664322 2567899999987765554222222 1222232 444 68999 Q ss_pred ccccC Q lcl|NC_020866. 293 VPWLA 297 (297) Q Consensus 293 ~p~La 297 (297) ++.+- T Consensus 207 s~~~p 211 (274) T protein:vir:12 207 SNKLE 211 (274) T ss_pred eCCCC Confidence 98886 No 72 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=36.70 E-value=1.2 Score=20.23 Aligned_cols=234 Identities=12% Similarity=-0.004 Sum_probs=89.9 Q ss_pred CCcCHHHHHHHHHH----------------------------HHHHHHHHHhhcchhhcceeeeecCCccceecccccCC Q lcl|NC_020866. 1 MQVTAANLDALRVG----------------------------FKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKI 52 (297) Q Consensus 1 M~i~~~~l~~l~~~----------------------------~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~ 52 (297) +.......++.... +...+.+.+... +....+++.+|-.....+|...... T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~-~~l~~l~~~~~~~~~~~~~~~~~~~ 165 (395) T protein:vir:43 87 MVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRR-LTIRDLVAPGTTESNSVEYVRETGF 165 (395) T ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhh-hhHHhhccceecCCCceEEEEEecC Confidence 11111111111111 111122222222 2233445555543444455544333 Q ss_pred Ccchhccccee---eeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccc Q lcl|NC_020866. 53 PNVREWIGPRA---IQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATEC 129 (297) Q Consensus 53 P~lrew~Ge~~---~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~ 129 (297) ..--.|+||-. -.++.=..-++..++++..+.|+++.+. |--.+..-+.+.++++.++..+..++ +|..+ T Consensus 166 ~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~~l~~~v~~~la~a~~~~~d~~~l----~G~g~-- 238 (395) T protein:vir:43 166 VNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILD-DASALQSYIDARARYGLMLVEECQLL----YGNGT-- 238 (395) T ss_pred CCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHH----hccCC-- Confidence 23347876543 2333334466788899999999999875 44445566778899999998887554 22111 Q ss_pred cCcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeecccc Q lcl|NC_020866. 130 YDGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDAR 209 (297) Q Consensus 130 ~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r 209 (297) |+++ ..+...... ..+.......+..+| .+....+.-+ . + . .+ T Consensus 239 --~~~~----~Gi~~~~~~-~~~~~~~~~~~~~~~-~~i~~~~~~~----~------~----~---------------~~ 281 (395) T protein:vir:43 239 --GANL----HGIIPQAQA-YAPPSGVVVTAEQRI-DRIRLAILQA----Q------L----A---------------EF 281 (395) T ss_pred --CCcc----ccccccccc-cccccccccccchhH-HHHHHHHHhh----c------c----c---------------cC Confidence 1111 001111100 000001111111111 0111111000 0 0 0 00 Q ss_pred ccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCC------cce Q lcl|NC_020866. 210 ANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGE------TNP 283 (297) Q Consensus 210 ~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~------~N~ 283 (297) .+. -|- ++.. ...++++.||-+|++|.-.|.-. .++.+. ...++.+...+.+. .+- T Consensus 282 ~~~---~~v--------mn~~----~~~~l~~lkd~~G~~i~~~~~~~-~~~~l~--G~pVv~~~~~~~~~~~~gd~~~~ 343 (395) T protein:vir:43 282 PAS---GIV--------LNPI----DWALIELNKDAENRYIIGSPQNG-TTPTLW--RLPVVETQAITQDEFLTGAFSLG 343 (395) T ss_pred CCc---EEE--------EcHH----HHHHHHHhhccCCceeccccccC-CCceec--ceeeEEcCCCCCCcEEEEeccce Confidence 000 011 1222 23456677888888765221100 000000 11122233333322 111 Q ss_pred e----cceeeEEeccccC Q lcl|NC_020866. 284 W----KGTAELLVVPWLA 297 (297) Q Consensus 284 ~----~~~~~~iv~p~La 297 (297) + ++-+++.+++.-. T Consensus 344 ~~~~~~~~~~i~~~~~~~ 361 (395) T protein:vir:43 344 AQIFDRMDIEVLVSTEND 361 (395) T ss_pred EEEEEecceEEEEecccc Confidence 1 2334455554322 No 73 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=35.41 E-value=1.2 Score=20.09 Aligned_cols=206 Identities=12% Similarity=0.151 Sum_probs=98.7 Q ss_pred CCcC-HHHHHHHHHHHHHH----------H---------HHHHhh--cchhhccee---eeecCCccceecccccCCCcc Q lcl|NC_020866. 1 MQVT-AANLDALRVGFKTS----------F---------QGALDQ--APSQYLRLT---TVVPSSTKEQRYGWMGKIPNV 55 (297) Q Consensus 1 M~i~-~~~l~~l~~~~~~~----------f---------~~a~~~--a~~~~~~~a---~~v~S~~~~~~y~~Lg~~P~l 55 (297) |.++ .+.+.++...+.+. | ++-|+. ++-+|+++- +.++.-..+-+|...... +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~-G~ 79 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGV-GI 79 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccc-cc Confidence 5555 34444443332111 1 111211 122344332 122222233344444322 23 Q ss_pred hhccccee----eeeeccccceeeeecccceeeccHHHhhccC---cchhHHHHHHHHHHHHhhHHHHHHHHHhcccCcc Q lcl|NC_020866. 56 REWIGPRA----IQNLTESDYSIREKPWELTIGVDRDDIETDN---LGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATE 128 (297) Q Consensus 56 rew~Ge~~----~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~---lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~ 128 (297) -+|+|++. ..+..-...+.+...|+..+.++.++++--. +.+=++....+.++.++++|+++| .| T Consensus 80 a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f----~G---- 151 (314) T protein:vir:10 80 AQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVW----SG---- 151 (314) T ss_pred eeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEE----ee---- Confidence 35665542 2333446677888999999999999998752 333345556666666777777666 12 Q ss_pred ccCcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccc Q lcl|NC_020866. 129 CYDGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDA 208 (297) Q Consensus 129 ~~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~ 208 (297) |+.|++.. -...+++.. . T Consensus 152 --------~~~~g~~G----LlN~p~v~~------------------------------------------------~-- 169 (314) T protein:vir:10 152 --------SAPHGIVS----VFDQPNINN------------------------------------------------V-- 169 (314) T ss_pred --------ccccccee----EeecCCCcc------------------------------------------------c-- Confidence 12221110 000000000 0 Q ss_pred cccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCC-------- Q lcl|NC_020866. 209 RANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGE-------- 280 (297) Q Consensus 209 r~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~-------- 280 (297) + +-+-|. +..--.+-+.++++++..|.. | ...|+.|++||++.. +|.+...+.|. T Consensus 170 --~-~~~~Wa-----T~~ei~~Di~~~~~~l~~~s~--g---~~~p~~l~Lpp~~~~----~L~~~~~~~~~tvl~~l~~ 232 (314) T protein:vir:10 170 --V-ATPNWS-----VPQNAIDDVTAMIDAVESSTQ--G---LHHVTDILLPASARR----VMQGLVPQTNLSYGELFTR 232 (314) T ss_pred --c-CCCCcc-----cHHHHHHHHHHHHHHHHHhcC--c---cccceeEEecHHHHH----hhcccccCCCccHHHHHHH Confidence 0 000131 111113445666666666543 2 357999999998764 45443322221 Q ss_pred cceecceeeEEeccccC Q lcl|NC_020866. 281 TNPWKGTAELLVVPWLA 297 (297) Q Consensus 281 ~N~~~~~~~~iv~p~La 297 (297) .|| .+++.-.|+|. T Consensus 233 n~~---~l~I~~~~el~ 246 (314) T protein:vir:10 233 NNP---GLTIRFLQFLD 246 (314) T ss_pred hCC---CcEEEEccccc Confidence 133 47888889998 No 74 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=32.38 E-value=1.4 Score=19.73 Aligned_cols=186 Identities=12% Similarity=0.056 Sum_probs=97.2 Q ss_pred CC---------cCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecC----CccceecccccCCCcchhcc--cceeee Q lcl|NC_020866. 1 MQ---------VTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPS----STKEQRYGWMGKIPNVREWI--GPRAIQ 65 (297) Q Consensus 1 M~---------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S----~~~~~~y~~Lg~~P~lrew~--Ge~~~~ 65 (297) |. |.|+.+.+... ..+.+.+ -+..++..-.. ....-++...+..+...++. .+.... T Consensus 1 ma~~~T~~~d~iiPev~~~~v~---~~~~~~l-----~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~ 72 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQ---AQLEKKL-----RFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCccceehhheechHHHHHHHH---Hhhhhhh-----hhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccc Confidence 43 34444433322 1111111 13334333111 01111111112222233332 445667 Q ss_pred eeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccc Q lcl|NC_020866. 66 NLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDE 145 (297) Q Consensus 66 ~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g 145 (297) .+....-+.+.+.++..++|+-.+...---.......+++|++-++.-|.-++..|..+ +. T Consensus 73 ~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a--~~----------------- 133 (274) T protein:vir:94 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--KL----------------- 133 (274) T ss_pred ccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc--Cc----------------- Confidence 77777777888889999999988877644445677778888888888888777766441 00 Q ss_pred cccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCc Q lcl|NC_020866. 146 DGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQ 225 (297) Q Consensus 146 ~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~ 225 (297) .+ +.. T Consensus 134 -----~~----------------------------------------------------------------------~~~ 138 (274) T protein:vir:94 134 -----TV----------------------------------------------------------------------NAD 138 (274) T ss_pred -----cc----------------------------------------------------------------------ccc Confidence 00 011 Q ss_pred cCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhcc--CCCCcc--------eecceeeEEeccc Q lcl|NC_020866. 226 TLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENA--SGGETN--------PWKGTAELLVVPW 295 (297) Q Consensus 226 ~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~--~~g~~N--------~~~~~~~~iv~p~ 295 (297) +++.+.+..|++.+.... ..++.|+|+|.....-++-..-+.+ .....+ -+.| ++|++++. T Consensus 139 ~~~~d~i~dA~~~l~d~~--------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~Vi~s~~ 209 (274) T protein:vir:94 139 ITKLNGLQSAIDKFNDED--------LEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-AIIVRTNK 209 (274) T ss_pred ccCHHHHHHHHHHhhccC--------CCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecC-eeEEEcCC Confidence 233455566666664321 2568999999977665543211111 111122 2334 58999988 Q ss_pred cC Q lcl|NC_020866. 296 LA 297 (297) Q Consensus 296 La 297 (297) +- T Consensus 210 ~p 211 (274) T protein:vir:94 210 LE 211 (274) T ss_pred CC Confidence 87 No 75 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=32.38 E-value=1.4 Score=19.73 Aligned_cols=186 Identities=12% Similarity=0.056 Sum_probs=97.2 Q ss_pred CC---------cCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecC----CccceecccccCCCcchhcc--cceeee Q lcl|NC_020866. 1 MQ---------VTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPS----STKEQRYGWMGKIPNVREWI--GPRAIQ 65 (297) Q Consensus 1 M~---------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S----~~~~~~y~~Lg~~P~lrew~--Ge~~~~ 65 (297) |. |.|+.+.+... ..+.+.+ -+..++..-.. ....-++...+..+...++. .+.... T Consensus 1 ma~~~T~~~d~iiPev~~~~v~---~~~~~~l-----~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~ 72 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQ---AQLEKKL-----RFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCccceehhheechHHHHHHHH---Hhhhhhh-----hhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccc Confidence 43 34444433322 1111111 13334333111 01111111112222233332 445667 Q ss_pred eeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccc Q lcl|NC_020866. 66 NLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDE 145 (297) Q Consensus 66 ~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g 145 (297) .+....-+.+.+.++..++|+-.+...---.......+++|++-++.-|.-++..|..+ +. T Consensus 73 ~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a--~~----------------- 133 (274) T protein:vir:97 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--KL----------------- 133 (274) T ss_pred ccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc--Cc----------------- Confidence 77777777888889999999988877644445677778888888888888777766441 00 Q ss_pred cccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCc Q lcl|NC_020866. 146 DGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQ 225 (297) Q Consensus 146 ~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~ 225 (297) .+ +.. T Consensus 134 -----~~----------------------------------------------------------------------~~~ 138 (274) T protein:vir:97 134 -----TV----------------------------------------------------------------------NAD 138 (274) T ss_pred -----cc----------------------------------------------------------------------ccc Confidence 00 011 Q ss_pred cCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhcc--CCCCcc--------eecceeeEEeccc Q lcl|NC_020866. 226 TLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENA--SGGETN--------PWKGTAELLVVPW 295 (297) Q Consensus 226 ~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~--~~g~~N--------~~~~~~~~iv~p~ 295 (297) +++.+.+..|++.+.... ..++.|+|+|.....-++-..-+.+ .....+ -+.| ++|++++. T Consensus 139 ~~~~d~i~dA~~~l~d~~--------~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-~~Vi~s~~ 209 (274) T protein:vir:97 139 ITKLNGLQSAIDKFNDED--------LEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-AIIVRTNK 209 (274) T ss_pred ccCHHHHHHHHHHhhccC--------CCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecC-eeEEEcCC Confidence 233455566666664321 2568999999977665543211111 111122 2334 58999988 Q ss_pred cC Q lcl|NC_020866. 296 LA 297 (297) Q Consensus 296 La 297 (297) +- T Consensus 210 ~p 211 (274) T protein:vir:97 210 LE 211 (274) T ss_pred CC Confidence 87 No 76 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=30.18 E-value=1.6 Score=19.47 Aligned_cols=192 Identities=13% Similarity=0.085 Sum_probs=87.3 Q ss_pred CCcCH---HHHHHHHHHHHHHHHHHHhhc---chhhcceeeeecCCccceecccccCCCcchhcc---cceeeeeecccc Q lcl|NC_020866. 1 MQVTA---ANLDALRVGFKTSFQGALDQA---PSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWI---GPRAIQNLTESD 71 (297) Q Consensus 1 M~i~~---~~l~~l~~~~~~~f~~a~~~a---~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~---Ge~~~~~l~~~~ 71 (297) |.++. +.... .+-..|++.+--+ ...|... .....+-+...++. +...+.. +......+.+.. T Consensus 1 MA~~~~~pei~~~---~v~~~~~~~lv~~~l~~~~~~~~----~~~GdTv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~ 72 (273) T protein:vir:79 1 MAFNNFIPELWSD---MLLEEWTAQTVFANLVNREYEGI----ASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTG 72 (273) T ss_pred CcchhhhHHHHHH---HHHHHHHhhccchhhhhcccccc----ccCCcEEEEeecCc-ccccccccCCCccCccccccce Confidence 99973 32211 1223333332111 1112211 11112222222222 2233332 223455667777 Q ss_pred ceeeeecc-cceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccce Q lcl|NC_020866. 72 YSIREKPW-ELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTV 150 (297) Q Consensus 72 ~~i~n~~f-e~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~ 150 (297) -+++..++ ...+.|+..+-.-+... +....++++++-++--|..++.++..+.+. +.. . T Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~vD~~i~~~~~~a~~~--------------~~~--~--- 132 (273) T protein:vir:79 73 VDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTA--------------LTG--S--- 132 (273) T ss_pred EEEEEeeecccceeeccHHHHhhccc-HHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------------ccc--c--- Confidence 77777553 66677775554444443 467888888888888888888887552100 000 0 Q ss_pred eeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHH Q lcl|NC_020866. 151 TVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGT 230 (297) Q Consensus 151 svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~ 230 (297) .. . +..-..+ T Consensus 133 ---~~--~-----------------------------------------------------------------~~~~~~~ 142 (273) T protein:vir:79 133 ---AP--S-----------------------------------------------------------------DADDAFD 142 (273) T ss_pred ---cc--c-----------------------------------------------------------------chhhHHH Confidence 00 0 0000012 Q ss_pred HHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHH---HHhhhccCCCCcceec-c------eeeEEeccccC Q lcl|NC_020866. 231 AYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRK---ILNSENASGGETNPWK-G------TAELLVVPWLA 297 (297) Q Consensus 231 ~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~---ll~~~~~~~g~~N~~~-~------~~~~iv~p~La 297 (297) .+..++++|.+.. -|- ..++|||+|.....-.+ .+... ...|+.+.++ | =++++.++.|- T Consensus 143 ~i~~a~~~ld~~~----vP~--~~R~lvv~p~~~~~Ll~~~~~~~~~-~~~~~~~~l~~G~ig~~~G~~i~~s~~lp 212 (273) T protein:vir:79 143 LIASALKELTKAN----VPN--VGRVVVVNAEMAFWLRSSGSKLTSA-DTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) T ss_pred HHHHHHHHhhhcc----CCc--cCcEEEECHHHHHHHhhchhhhhhh-hhcccccceeeeEeeEEeceEEEeccccc Confidence 3344444443332 221 23488899877664322 12111 1122333332 1 16888887775 No 77 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=30.09 E-value=1.6 Score=19.46 Aligned_cols=159 Identities=16% Similarity=0.081 Sum_probs=81.6 Q ss_pred ceeeeecCCccceecccccCCCcchhcccceeeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHh Q lcl|NC_020866. 32 RLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGS 111 (297) Q Consensus 32 ~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~ 111 (297) .=...+..+ .+-..|+|+.-.+-| -.+.....|+-..-+.+.++.++.+.|+-.+..---=.-+....+++|.+-++ T Consensus 1 ~~~~~~Gdt--it~P~~iGda~~v~e-G~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANL--CEYPNDIGDAADVAE-GGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCce--EEecccccchhhhcC-CCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 001111111 111134666644444 23456667777777888899999999998887531111234455666666666 Q ss_pred hHHHHHHHHHhcccCccccCcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccc Q lcl|NC_020866. 112 KWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKL 191 (297) Q Consensus 112 ~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~ 191 (297) ..|.-++..|... +. T Consensus 78 kvD~di~~~~~~a--~l--------------------------------------------------------------- 92 (231) T protein:vir:73 78 KVDDDLLKAAKTT--SQ--------------------------------------------------------------- 92 (231) T ss_pred hhhHHHHHhhccc--cc--------------------------------------------------------------- Confidence 6665444333210 00 Q ss_pred cccccccccceeeeccccccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHH Q lcl|NC_020866. 192 DDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKIL 271 (297) Q Consensus 192 ~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll 271 (297) ....+++.+.+..|.+.+... + -.|..++|+|.--..=|+.. T Consensus 93 ------------------------------~~~~~~t~d~i~~A~~~fgde---~-----~~~~vivv~p~~~~~Lrk~~ 134 (231) T protein:vir:73 93 ------------------------------TVSTKANVDGVQAALDIFNDE---D-----AQAYVLIVNPKDAAKIRKDA 134 (231) T ss_pred ------------------------------cccccccHHHHHHHHHHhccc---c-----ccceEEEEcchHHHhhhhcc Confidence 001234555566666665432 1 23567888886544444433 Q ss_pred hhhccC---------CCCcceecceeeEEeccccC Q lcl|NC_020866. 272 NSENAS---------GGETNPWKGTAELLVVPWLA 297 (297) Q Consensus 272 ~~~~~~---------~g~~N~~~~~~~~iv~p~La 297 (297) ...... +|..=-+.| ++|++++.+. T Consensus 135 ~~~~~~~~~g~~i~~~G~iG~i~G-~~Vi~S~~~~ 168 (231) T protein:vir:73 135 NAKNIGSEVGANALINGTYADVLG-AQIVRSKKLA 168 (231) T ss_pred chhhhhhhhccceeeecccceEcc-eEEEEcCCCC Confidence 222211 112223344 6888888888 No 78 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=26.98 E-value=1.9 Score=19.07 Aligned_cols=214 Identities=15% Similarity=0.130 Sum_probs=101.9 Q ss_pred CCcCHHHHHHHH-HHHHHHHHHHHhhcchhhcceeeeec-C--CccceecccccCCCcchhcc--cceeeeeecccccee Q lcl|NC_020866. 1 MQVTAANLDALR-VGFKTSFQGALDQAPSQYLRLTTVVP-S--STKEQRYGWMGKIPNVREWI--GPRAIQNLTESDYSI 74 (297) Q Consensus 1 M~i~~~~l~~l~-~~~~~~f~~a~~~a~~~~~~~a~~v~-S--~~~~~~y~~Lg~~P~lrew~--Ge~~~~~l~~~~~~i 74 (297) =-++.++.+++. .=+....++.|+.. .-+..++.+.. + ..++-+...+|. |...... ++..+.++.+...++ T Consensus 13 ~~~~~t~~~~fiPev~s~~v~~~l~~~-lv~~~l~~~~~~~~~~GdTV~ip~~g~-~~a~d~~~g~~i~~~~~~~~~~~i 90 (381) T protein:vir:80 13 SAVDLSNVQVFIPEVWSSEVRMFRDQK-FAALEATKKIPFEGKKGDLIHIPNISR-AAVYDKQPQTPVNLQARTDSEFTF 90 (381) T ss_pred cccchhhHHhhhhHHHHHHHHHHHHHh-hhhhhccccccceeecCceEEeeccCc-ceeeeecCCCcccccccCCceEEE Confidence 112222222211 12223333333221 11112221110 0 111222333442 3333322 455677778887778 Q ss_pred eeecc-cceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceeec Q lcl|NC_020866. 75 REKPW-ELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTVS 153 (297) Q Consensus 75 ~n~~f-e~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~svs 153 (297) +..++ ...+.|+..|.....+..-....++++.+-++.-|+.++.++....... .+........+. T Consensus 91 tID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~-----------~~~~~t~~~~i~-- 157 (381) T protein:vir:80 91 TVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFP-----------SQRIYSYDTTLG-- 157 (381) T ss_pred EEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-----------cccccccccccc-- Confidence 77554 4567888888888888888999999999999999999998876522110 000000000000 Q ss_pred cccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHHH Q lcl|NC_020866. 154 NTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYA 233 (297) Q Consensus 154 n~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~ 233 (297) +..+ ....-.....++.+.|. T Consensus 158 ---------------------------------------~~~~--------------------~~~~t~~~~~~t~~~i~ 178 (381) T protein:vir:80 158 ---------------------------------------DGTV--------------------NAHLTGTPAPLTYAALL 178 (381) T ss_pred ---------------------------------------cccc--------------------ccccccchhhHHHHHHH Confidence 0000 00000112233344555 Q ss_pred HHHHHHHhhccCCCcccccccCeEEecchHHHHHHHH---HhhhccCCCCcceecc-------eeeEEeccccC Q lcl|NC_020866. 234 AARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKI---LNSENASGGETNPWKG-------TAELLVVPWLA 297 (297) Q Consensus 234 aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~l---l~~~~~~~g~~N~~~~-------~~~~iv~p~La 297 (297) +|++.|.+.. -|. ..++|||+|.....-++. ++++. +..+.+++ =++++++++|- T Consensus 179 ~a~~~Lde~~----VP~--egR~lvv~P~~~~~Ll~~~~~~~ad~---~~~~~l~~G~Ig~i~G~~Vv~Sn~lp 243 (381) T protein:vir:80 179 LAKQKLDEAD----VPQ--EGRIVMVSPAQYIDLLSINQFISVDF---SQVKPVTSGVVGTILGMEVIVTTQIG 243 (381) T ss_pred HHHHHHhhcC----CCc--CCcEEEeCHHHHHHHhhchhhhhhhh---ccchhhhceeeeEEcceEEEeecccc Confidence 5555554332 221 346889998776654432 22222 22222221 27888998886 No 79 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=24.95 E-value=2.1 Score=18.80 Aligned_cols=213 Identities=11% Similarity=0.033 Sum_probs=93.8 Q ss_pred CCcCHHHHH--HHHHHHHHHHHHHHhhcchhhcceeeeecCCcccee--cccccCCCcchhccccee-ee---eeccccc Q lcl|NC_020866. 1 MQVTAANLD--ALRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQR--YGWMGKIPNVREWIGPRA-IQ---NLTESDY 72 (297) Q Consensus 1 M~i~~~~l~--~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~--y~~Lg~~P~lrew~Ge~~-~~---~l~~~~~ 72 (297) |..+...=- .+=.-+...+.+......+ -.+++..+|.+..... +......-..-+|+||-. +. ...=..- T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~-l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~i 83 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDS-LQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKLSLI 83 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhh-hhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccceeEE Confidence 221110000 0000111222222222222 3455766664433333 333333333457886632 21 2333455 Q ss_pred eeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccceee Q lcl|NC_020866. 73 SIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTVTV 152 (297) Q Consensus 73 ~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~sv 152 (297) ++..++++..+.|+|+.+.|-..++..-+.+.++++.++.+++.++.-+..+.. .....+ T Consensus 84 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~-------------------~~~~~~- 143 (293) T protein:vir:48 84 KYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT-------------------KPTLTK- 143 (293) T ss_pred EEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc-------------------cccccC- Confidence 788899999999999999998899999999999999999998877654322110 000000 Q ss_pred ccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHH Q lcl|NC_020866. 153 SNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAY 232 (297) Q Consensus 153 sn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l 232 (297) | ..+..++..-. + ..+.+ +.|=+ + T Consensus 144 -----------~-----d~i~~~~~~l~------~-------------------~~~~~---a~~vm--------n---- 167 (293) T protein:vir:48 144 -----------W-----DDIIDLEAKVD------P-------------------AIKQT---SFFLT--------N---- 167 (293) T ss_pred -----------H-----HHHHHHHHhhh------h-------------------hhcCC---CEEEE--------c---- Confidence 0 01111111100 0 00001 11211 1 Q ss_pred HHHHHHHHhhccCCCcccccc------cCeEE-ecchHHHHHHHHHhhhccCC---CCcceecc------------eeeE Q lcl|NC_020866. 233 AAARAALSGMKGDYGRPLGLM------PNLLV-VPPALEEAGRKILNSENASG---GETNPWKG------------TAEL 290 (297) Q Consensus 233 ~aar~aM~~~k~~~G~~L~i~------P~~Lv-Vp~~le~~A~~ll~~~~~~~---g~~N~~~~------------~~~~ 290 (297) ...+..+++.||-+|++|-.. |..|+ .|.- +......+. +....+.| -+++ T Consensus 168 ~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~-------~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 240 (293) T protein:vir:48 168 TSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVK-------EISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSL 240 (293) T ss_pred HHHHHHHHHhhccCCceEeecCcCCCCCceecceeeE-------EecccccCCccCCceEEEEEeccceEEEEEecceEE Confidence 234567888999999976311 11111 1110 011111221 11112222 2223 Q ss_pred Eeccc---cC Q lcl|NC_020866. 291 LVVPW---LA 297 (297) Q Consensus 291 iv~p~---La 297 (297) .+++. .. T Consensus 241 ~~~~~~~~~~ 250 (293) T protein:vir:48 241 LSTNIGGGAF 250 (293) T ss_pred EEecccchhh Confidence 33221 11 No 80 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=24.92 E-value=2.1 Score=18.80 Aligned_cols=216 Identities=19% Similarity=0.130 Sum_probs=96.9 Q ss_pred CC---------cCH------HHHHHHHH-HHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccceee Q lcl|NC_020866. 1 MQ---------VTA------ANLDALRV-GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRAI 64 (297) Q Consensus 1 M~---------i~~------~~l~~l~~-~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~~ 64 (297) |. -.+ ...++||. -|..+...+|+.. +-+..+.++ .+-...++ -+||.+ |+-++ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~-s~~~~~~~~-rti~~G~s----v~~~~i----G~~~~ 70 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRT-SVTMNKHLV-RSIQSGKS----AQFPVL----GRTKA 70 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHH-Hhhhhhhhh-eeccccce----EEeeec----cceeE Confidence 33 111 12355554 3566666677655 333333321 01011111 234543 33333 Q ss_pred eeeccccc---eee-eecccceeeccHH--------Hhhcc--CcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCcccc Q lcl|NC_020866. 65 QNLTESDY---SIR-EKPWELTIGVDRD--------DIETD--NLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECY 130 (297) Q Consensus 65 ~~l~~~~~---~i~-n~~fe~tv~v~R~--------~i~dD--~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~ 130 (297) ........ ++. .++=+.++.|+-. ||++- .+..-+++.+++|++=++.-|+.++..|..+.+...- T Consensus 71 ~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~ 150 (347) T protein:vir:94 71 AYLQPGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTA 150 (347) T ss_pred eeeecCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 22221111 111 1233344444332 22222 2467788999999999999999999776553322111 Q ss_pred CcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccc Q lcl|NC_020866. 131 DGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARA 210 (297) Q Consensus 131 DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~ 210 (297) -.++. .|.+...++ +..++. T Consensus 151 ~~~~~--------~g~~~~~~v-~i~~~~--------------------------------------------------- 170 (347) T protein:vir:94 151 NNENI--------AGLGKAHVL-EVGDQA--------------------------------------------------- 170 (347) T ss_pred ccccc--------ccCCcceeE-eeeccc--------------------------------------------------- Confidence 11110 111100000 000000 Q ss_pred cccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCCcceec-c--- Q lcl|NC_020866. 211 NVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGETNPWK-G--- 286 (297) Q Consensus 211 ~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~~N~~~-~--- 286 (297) .-.+..+.+..++-.+...++..-+...-| -.+.++||+|.....-.+.+.......+..|.+. | T Consensus 171 ---------~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP--~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~G~V~ 239 (347) T protein:vir:94 171 ---------TLQGDQVKLGQAIIAQLTLARAKLTGNYVP--SSDRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIR 239 (347) T ss_pred ---------cccccccccHHHHHHHHHHHHHHhhhcCCC--CCCCEEEeChHHHHHHHHhhcccccccccccccccceeE Confidence 000111233344444444444444444434 2378999999888766665544443333344432 1 Q ss_pred ---eeeEEeccccC Q lcl|NC_020866. 287 ---TAELLVVPWLA 297 (297) Q Consensus 287 ---~~~~iv~p~La 297 (297) =++|+++|+|. T Consensus 240 ~v~G~~V~~Sn~~p 253 (347) T protein:vir:94 240 NVMGFEVIEVPHLT 253 (347) T ss_pred EeeceEEEEcCccc Confidence 16899999997 No 81 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=24.47 E-value=2.2 Score=18.74 Aligned_cols=210 Identities=18% Similarity=0.195 Sum_probs=87.2 Q ss_pred CCcCHHHHHHHHHH-------------HHHHHHHHHhhcchhhcceeeeecCCccceeccccc-CCCcchhcccc-eeee Q lcl|NC_020866. 1 MQVTAANLDALRVG-------------FKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMG-KIPNVREWIGP-RAIQ 65 (297) Q Consensus 1 M~i~~~~l~~l~~~-------------~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg-~~P~lrew~Ge-~~~~ 65 (297) +........++..+ +...+.+.... .+....+++.+|.+....+|..+. .-+... |++| -... T Consensus 123 ~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~E~~~~~ 200 (400) T protein:vir:38 123 RAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQT-VVDLKPFTNVFQASTQKGTYPTVANATTKMV-TVAELEKNP 200 (400) T ss_pred hhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHh-hhhhhhcceeEeccCcceEEEEEecCCCccc-ccccccccc Confidence 11111111111110 11111111111 122344566666554444444432 234443 4433 2222 Q ss_pred ee---ccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccc Q lcl|NC_020866. 66 NL---TESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPV 142 (297) Q Consensus 66 ~l---~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~ 142 (297) .. .=..-++..++++..+.|||+.+.|-...+-+-+.+.++++.+...++.+..-...| T Consensus 201 ~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~------------------ 262 (400) T protein:vir:38 201 AMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGF------------------ 262 (400) T ss_pred ccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccc------------------ Confidence 22 224456778899999999999999888888888999999888887776554322211 Q ss_pred ccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhc Q lcl|NC_020866. 143 LDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYG 222 (297) Q Consensus 143 ~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~ 222 (297) +. ...+.. . .+.-++.....+ .. + +.|-+ T Consensus 263 ---~~--~~~~~~------~--------~~~~~~~~~~~~---------~~-----------------~---a~~v~--- 291 (400) T protein:vir:38 263 ---TA--KTISSV------D--------DLKHINNVDLDP---------AY-----------------S---RVIIA--- 291 (400) T ss_pred ---cc--cccccH------H--------HHHHHHHhhhhh---------hh-----------------C---cEEEE--- Confidence 00 000000 0 011111110000 00 0 11211 Q ss_pred CCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhh-----hcc---CCCCcceecceee--EEe Q lcl|NC_020866. 223 SKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNS-----ENA---SGGETNPWKGTAE--LLV 292 (297) Q Consensus 223 ~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~-----~~~---~~g~~N~~~~~~~--~iv 292 (297) + .....++++.||.+|+|| +.|+..- + ....|+.- +.. ..|+...+.|.+. ++. T Consensus 292 -----~----~~~~~~l~~lkd~~G~~i-~~~~~~~-~-----~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~ 355 (400) T protein:vir:38 292 -----S----QSFYNFLDTVKDGNGRYL-LQDSILT-P-----SGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILF 355 (400) T ss_pred -----c----HHHHHHHHHhhccCCCee-eecCcCC-C-----CccccccceeEEecccccCCCCceEEEEEeccccEEE Confidence 1 223455666788888876 2232110 0 00112221 111 1233333433321 112 Q ss_pred ccc--cC Q lcl|NC_020866. 293 VPW--LA 297 (297) Q Consensus 293 ~p~--La 297 (297) ..| +. T Consensus 356 ~~~~~~~ 362 (400) T protein:vir:38 356 ANRADFM 362 (400) T ss_pred EeecceE Confidence 111 11 No 82 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=24.40 E-value=2.2 Score=18.73 Aligned_cols=230 Identities=13% Similarity=0.115 Sum_probs=89.6 Q ss_pred CCcC----HHHH-----------------HHHH------------HHHHHHHHHHHhhcchhhcce-eeeecCCccceec Q lcl|NC_020866. 1 MQVT----AANL-----------------DALR------------VGFKTSFQGALDQAPSQYLRL-TTVVPSSTKEQRY 46 (297) Q Consensus 1 M~i~----~~~l-----------------~~l~------------~~~~~~f~~a~~~a~~~~~~~-a~~v~S~~~~~~y 46 (297) |+-. ...+ .++. .-+...+-+.+... ....++ ++.+|..+..-.| T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~-~~i~~~~~~~~~~~~~~~~~ 179 (435) T protein:vir:14 101 MVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPK-SVVRKLGARTLPLSNGNITI 179 (435) T ss_pred HHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhh-chhhhhcceeeecCCCceEE Confidence 0000 0000 0000 00111111111111 112222 5566655555566 Q ss_pred ccccCCCcchhccccee---eeeeccccceeeeecccceeeccHHHhhccCc--chhHHHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_020866. 47 GWMGKIPNVREWIGPRA---IQNLTESDYSIREKPWELTIGVDRDDIETDNL--GIYSPLFQEMGRSAGSKWDMLVFELL 121 (297) Q Consensus 47 ~~Lg~~P~lrew~Ge~~---~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~l--G~~~~~~~~~g~aaa~~~~~lv~~lL 121 (297) ..+..-|.. .|+||-. -.+..=..-++..++++..+.||++.+.|-.. .+-.-+...++++.++..++.++ T Consensus 180 p~~~~~~~a-~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l--- 255 (435) T protein:vir:14 180 PRLKGGAIV-GYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFI--- 255 (435) T ss_pred EEEeCCcce-eeeccCccccccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhh--- Confidence 666555553 5775532 23332234567788999999999999888533 46677888899999999887654 Q ss_pred hcccCccccCccccccccccccc--ccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccc Q lcl|NC_020866. 122 KLGFATECYDGQNFFDTDHPVLD--EDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMN 199 (297) Q Consensus 122 ~~G~~~~~~DGk~~F~tdH~~~~--g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~ 199 (297) .| ||.. ..|.+- .... ..+.....+........+....+..+..... T Consensus 256 -~G------~G~~----~~p~Gi~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~------------------- 304 (435) T protein:vir:14 256 -RD------DGTA----NTPKGLRFWALP-SNVITASDASTLQKIETDLGKVILALENADA------------------- 304 (435) T ss_pred -cc------CCCC----ccccceeecccc-cceeccccccchhhHHHHHHHHHHHhhhccc------------------- Confidence 22 1211 011110 0000 0000111111100011111111111100000 Q ss_pred cceeeeccccccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccc--cccCeEEecchHHHHHHHHHhhhccC Q lcl|NC_020866. 200 KEFLYGTDARANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLG--LMPNLLVVPPALEEAGRKILNSENAS 277 (297) Q Consensus 200 ~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~--i~P~~LvVp~~le~~A~~ll~~~~~~ 277 (297) .+.+. .|- ++ .....++++.||.+|++|- ..+..|. ..-++.++..+ T Consensus 305 --------~~~~~---~~v--------~n----~~~~~~L~~lkd~~G~~l~~~~~~g~l~--------G~Pv~~~~~~p 353 (435) T protein:vir:14 305 --------NLTQP---GWI--------MA----PRTFRFLEGLRDGNGNKVYPELANGMLK--------GYPVGKTTQVP 353 (435) T ss_pred --------cccCC---EEE--------Ec----HHHHHHHHHhhccCCceeccCCCCCeee--------cceeEeecccc Confidence 00000 111 11 1233567778888888753 1111111 01111112221 Q ss_pred C-------------CCc-cee---cceeeEEeccccC Q lcl|NC_020866. 278 G-------------GET-NPW---KGTAELLVVPWLA 297 (297) Q Consensus 278 ~-------------g~~-N~~---~~~~~~iv~p~La 297 (297) . |+- ..+ ++-+++.++++-. T Consensus 354 ~~~~~~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~ 390 (435) T protein:vir:14 354 INLGETGKESEIYFTDFGDVFIGEEETLEIDYSKEAT 390 (435) T ss_pred ccccCCCccceEEEeecccEEEEEecccEEEEecccc Confidence 1 110 111 2345556666544 No 83 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=24.34 E-value=2.2 Score=18.72 Aligned_cols=232 Identities=11% Similarity=0.021 Sum_probs=89.1 Q ss_pred CCcCH--HHHHHH---------------HHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccce- Q lcl|NC_020866. 1 MQVTA--ANLDAL---------------RVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPR- 62 (297) Q Consensus 1 M~i~~--~~l~~l---------------~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~- 62 (297) +.... ..+..+ -.-+...+-+.+... .....+++.+|-.....+|........--.|++|- T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 198 (418) T protein:vir:10 120 VRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRK-MTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGA 198 (418) T ss_pred hhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhh-hhHHhhcceeeccCCceeEEEEecCCCceeeeccCc Confidence 00000 000000 001111222222222 22333455555333333344333322223677543 Q ss_pred --eeeeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccc Q lcl|NC_020866. 63 --AIQNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDH 140 (297) Q Consensus 63 --~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH 140 (297) ...+..=..-++..++++..+.|+++.+. |--.+..-+.+.++++.++..+..++ .| ||.. .+ T Consensus 199 ~~~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~l~~~i~~~l~~a~~~~~d~a~l----~G------~g~~----~~ 263 (418) T protein:vir:10 199 QKPTSDLKFNLKNQPVRTIAHLFKASRQILD-DAPALQSYIDGRARYGLQLTEEGQIL----KG------DGTG----AN 263 (418) T ss_pred cccccccceeeEEEeeeeEEEeehhhHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHh----cc------CCCC----cc Confidence 33333334456777899999999999775 55566667778889999988887554 23 2211 12 Q ss_pred ccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchh Q lcl|NC_020866. 141 PVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMA 220 (297) Q Consensus 141 ~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a 220 (297) |.+-............. .+..+|- .+.-++...... .+.+.+ |- T Consensus 264 p~Gi~~~~~~~~~~~~~-~~~~~~~-----~i~~~~~~~~~~-------------------------~~~~~~---~v-- 307 (418) T protein:vir:10 264 ILGILPQASAFMPSITL-ANATPID-----KIRLALLQAVLA-------------------------EFPATG---IV-- 307 (418) T ss_pred ccccccccccccccccc-cccccHH-----HHHHHHHhhccc-------------------------cCCCCE---EE-- Confidence 22111111011111111 1111110 011111100000 000000 11 Q ss_pred hcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCC------ccee----cceeeE Q lcl|NC_020866. 221 YGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGE------TNPW----KGTAEL 290 (297) Q Consensus 221 ~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~------~N~~----~~~~~~ 290 (297) ++. .....+++.||.+|++|.-.|.-. .++.+. ...++.+...+.|. ++-+ ++-+++ T Consensus 308 ------~n~----~~~~~L~~lkd~~G~~i~~~~~~~-~~~~l~--G~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i 374 (418) T protein:vir:10 308 ------LNP----IDWASIELTKDSQGRYIVGNPVNG-TTPRLW--NLPVVETQAMTANEFLVGAFSMAAQIFDRMEIEV 374 (418) T ss_pred ------EcH----HHHHHHHHhhcCCCceeccccccC-CCceec--ceeeEEcCCCCCCcEEEeeccceEEEEEecceEE Confidence 112 234567778888888765222100 111110 11222233333332 1211 234555 Q ss_pred EeccccC Q lcl|NC_020866. 291 LVVPWLA 297 (297) Q Consensus 291 iv~p~La 297 (297) .++++=. T Consensus 375 ~~~~~~~ 381 (418) T protein:vir:10 375 LLSTENV 381 (418) T ss_pred EEecccc Confidence 5555422 No 84 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=23.83 E-value=2.3 Score=18.65 Aligned_cols=231 Identities=13% Similarity=0.070 Sum_probs=92.1 Q ss_pred CCcCHHHHHHHHHH------------------------HHHHHHHHHhhcchhhcceeeeecCCccceeccccc--CCCc Q lcl|NC_020866. 1 MQVTAANLDALRVG------------------------FKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMG--KIPN 54 (297) Q Consensus 1 M~i~~~~l~~l~~~------------------------~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg--~~P~ 54 (297) ..+..+..++.... +...+.+..... ....++++.++-++....|.++. ..+. T Consensus 99 ~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (415) T protein:vir:94 99 TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE-FNLDKYVTVKRVTNGSGKYPVVRQSEVAA 177 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhh-hhhhhhcceeeccCCceeEEEEeecCCcc Confidence 11111112111110 111111111111 11233455555344444444432 2233 Q ss_pred chhcccce-eeee---eccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCcccc Q lcl|NC_020866. 55 VREWIGPR-AIQN---LTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECY 130 (297) Q Consensus 55 lrew~Ge~-~~~~---l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~ 130 (297) -.|++|- .+.. ..=..-++..++++..+.|+|+.+.|-..++..-+.+.++++.++..++.+..-+..|-.. T Consensus 178 -~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--- 253 (415) T protein:vir:94 178 -LEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--- 253 (415) T ss_pred -ceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--- Confidence 3477553 2221 2234566788999999999999999888999999999999999999988776544332111 Q ss_pred CcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccc Q lcl|NC_020866. 131 DGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARA 210 (297) Q Consensus 131 DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~ 210 (297) ...-+ .. ...+.....+...| ..+.-+++..... .+. T Consensus 254 -----~~~~~-----~~---~~~~~~~~~~~~~~-----~~i~~~~~~~~~~-------------------------~~~ 290 (415) T protein:vir:94 254 -----STSSG-----FE---KEGKKLEVKKAKSL-----DDIKDAINLNVKP-------------------------NYE 290 (415) T ss_pred -----ccccc-----cc---ccccccccccccch-----HHHHHHHHhhhhh-------------------------ccC Confidence 11100 00 00011111111111 0111111110000 000 Q ss_pred cccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeE-EecchHHHHHHHHHhhhccC---CCCcceecc Q lcl|NC_020866. 211 NVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLL-VVPPALEEAGRKILNSENAS---GGETNPWKG 286 (297) Q Consensus 211 ~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~L-vVp~~le~~A~~ll~~~~~~---~g~~N~~~~ 286 (297) +. .|- ++. ..+.++++.||-+|+||- .|+.. -.|..+. ...++-++..+ .|+..++.| T Consensus 291 ~~---~~v--------mn~----~~~~~l~~lkd~~G~~l~-~~~~~~~~~~~l~--G~pV~~~~~~~~~~~~~~~i~~g 352 (415) T protein:vir:94 291 HN---VAI--------VSQ----TMFAKLDKMKDKLGNYLI-QPDVKEKTQQRLL--GAKIEILPDEVLGQKGNNTLIIG 352 (415) T ss_pred CC---EEE--------EcH----HHHHHHHHhhccCCCeee-ccCcCCCCCceec--ceeeEEecccccCCCCccEEEEE Confidence 00 111 122 234566778898998863 22210 0000000 00011111112 122233333 Q ss_pred e------------eeEEeccccC Q lcl|NC_020866. 287 T------------AELLVVPWLA 297 (297) Q Consensus 287 ~------------~~~iv~p~La 297 (297) . +++..+++.. T Consensus 353 d~~~~~~~~~~~~~~v~~~~~~~ 375 (415) T protein:vir:94 353 NLKDAIVLFDRSQYQASWTDYMH 375 (415) T ss_pred ehhccEEEEeecceEEEEecccc Confidence 2 2222222222 No 85 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=23.55 E-value=2.3 Score=18.61 Aligned_cols=229 Identities=17% Similarity=0.213 Sum_probs=96.4 Q ss_pred CCcCHHHHHH-------HHHHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhccccee---eeeeccc Q lcl|NC_020866. 1 MQVTAANLDA-------LRVGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPRA---IQNLTES 70 (297) Q Consensus 1 M~i~~~~l~~-------l~~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~---~~~l~~~ 70 (297) |=.|+.+... |=.-+...+.+..... +.-+++++.+|.......+..... |. -.|++|-. ..+..=. T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~~~~~~~-~~-a~~v~E~~~~~~~~~~f~ 77 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNG-SAAMKLAKAVPMTKPEEEFTFMSG-VG-AFWVDEAERIQTSKPTFT 77 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhc-chhhhhceeeecCCCcEEEEEEcC-Cc-eeeeecCcccccccccee Confidence 2222111100 1111222222222222 225566777775554445544433 44 36775532 2333445 Q ss_pred cceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCcccccccccccccccccce Q lcl|NC_020866. 71 DYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLDEDGKTV 150 (297) Q Consensus 71 ~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~g~~~~~ 150 (297) .-++..++++..+.|+++.+.|-...+..-+...++++.++..++.++ .|-.+ +++. ..+... . T Consensus 78 ~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l----~G~g~----~~~~-----gil~~~---~ 141 (299) T protein:vir:41 78 KAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVF----TGVES----PYNW-----NILKSA---T 141 (299) T ss_pred EEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHh----hcccC----cccc-----cccccc---c Confidence 567888999999999999999888888999999999999999987554 23211 1111 011100 0 Q ss_pred eeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHH Q lcl|NC_020866. 151 TVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGT 230 (297) Q Consensus 151 svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~ 230 (297) ..++........| .+ +.-++.+.. ... +.+.+ |- ++ T Consensus 142 ~~~~~~~~~~~~~--~~----l~~~~~~l~------------~~~-------------~~~~~---~v--------~n-- 177 (299) T protein:vir:41 142 DASNLVEETANKY--DD----LNEAIGLIE------------AED-------------LEPNG---IA--------TI-- 177 (299) T ss_pred ccceeeccccccH--HH----HHHHHHhhh------------ccc-------------CCcCE---EE--------Ec-- Confidence 0111111111100 11 111111000 000 00000 11 11 Q ss_pred HHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCCcce--ecce-----------ee--EEeccc Q lcl|NC_020866. 231 AYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGETNP--WKGT-----------AE--LLVVPW 295 (297) Q Consensus 231 ~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~~N~--~~~~-----------~~--~iv~p~ 295 (297) ...+.++++.|+-+|+||-..+ .---.+.+ ...-++.++..+.++.++ +.|. ++ +.-++. T Consensus 178 --~~~~~~L~~lkd~~G~~l~~~~-~~~~~~~l--~G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~ 252 (299) T protein:vir:41 178 --RKQRVKYRSTKDGNGMPIFNTA-TSNGVDDV--LGLPIAYTPKYTFGDKDISELVGDWNQAYYGILRGVEYEILTEAT 252 (299) T ss_pred --HHHHHHHHHhhccCCceeecCC-cCCCCcee--cceeeEEecccCCCCCceEEEEEecccEEEEEecCcEEEEeeccc Confidence 2346677888999999874321 10000000 011122223333333211 1111 11 111111 Q ss_pred cC Q lcl|NC_020866. 296 LA 297 (297) Q Consensus 296 La 297 (297) +. T Consensus 253 ~~ 254 (299) T protein:vir:41 253 LT 254 (299) T ss_pred cc Confidence 11 No 86 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=22.94 E-value=2.4 Score=18.52 Aligned_cols=233 Identities=14% Similarity=0.157 Sum_probs=93.8 Q ss_pred CCcCHHHHHH---HHHHHHHHHHHHHhhcchhhcce-eeeecCCccceecccccCCCcchhccccee---eeeeccccce Q lcl|NC_020866. 1 MQVTAANLDA---LRVGFKTSFQGALDQAPSQYLRL-TTVVPSSTKEQRYGWMGKIPNVREWIGPRA---IQNLTESDYS 73 (297) Q Consensus 1 M~i~~~~l~~---l~~~~~~~f~~a~~~a~~~~~~~-a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~~---~~~l~~~~~~ 73 (297) |.++...=.+ +=+-+...|.+.+.. .+..+++ ++.+|..+..-.+..+..-|.- .|++|-. -.+..=..-+ T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~-~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a-~wv~E~~~~~~s~~~f~~i~ 141 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRD-RTVVRILGARSIPLPNGNLSMPRLSGGATA-GYVGEGKDVVATGATFDDVK 141 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhh-hcchhhhceeeeecCCCceEEEEEeCCcce-eeeccCccccccccceeEEE Confidence 1111000000 000111222222221 1223444 5556644444455544444443 5775532 2333334567 Q ss_pred eeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCc-cccCcccccccccccccccccceee Q lcl|NC_020866. 74 IREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFAT-ECYDGQNFFDTDHPVLDEDGKTVTV 152 (297) Q Consensus 74 i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~-~~~DGk~~F~tdH~~~~g~~~~~sv 152 (297) ++.++++..+.|+|+.+.|-...+..-+.+.++++.++.+|..++ .|..+ .-..| +++. .+.....+ T Consensus 142 ~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l----~G~G~~~~p~G--i~~~------~~~~~~~~ 209 (366) T protein:vir:57 142 LSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFL----RDDGTGDTPKG--MKAV------ATAANRLV 209 (366) T ss_pred EeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhh----ccCCCCccccc--eeec------ccccccee Confidence 888999999999999999888888888999999999998886443 22111 00001 1110 00000111 Q ss_pred ccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCCccCCHHHH Q lcl|NC_020866. 153 SNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSKQTLDGTAY 232 (297) Q Consensus 153 sn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~~~l~~~~l 232 (297) .. .+++..+--.+.-. .-++.. ..+.+...+.+.|- ++ T Consensus 210 ~~--~~t~~~~~~~~~~~--~~~~~~--------------------------~~~~~~~~~~a~~v--------mn---- 247 (366) T protein:vir:57 210 AW--TGTAINLTTIDEYL--DSLILK--------------------------HMDSNSNMIRCGWG--------LS---- 247 (366) T ss_pred ec--cccccchhhHHHHH--HHHHHh--------------------------hhccccccccCEEE--------ec---- Confidence 11 11111111111000 000000 00011111111221 12 Q ss_pred HHHHHHHHhhccCCCcccc--cccCeEEecchHHHHHHHHHhhhccCC---CCcc---ee-----------cceeeEEec Q lcl|NC_020866. 233 AAARAALSGMKGDYGRPLG--LMPNLLVVPPALEEAGRKILNSENASG---GETN---PW-----------KGTAELLVV 293 (297) Q Consensus 233 ~aar~aM~~~k~~~G~~L~--i~P~~LvVp~~le~~A~~ll~~~~~~~---g~~N---~~-----------~~~~~~iv~ 293 (297) ...+.++++.||.+|++|- +.+..|. ..-++.++.+++ ...| .+ ++-+++-++ T Consensus 248 ~~~~~~L~~lkd~~G~~l~~~~~~g~l~--------G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~ 319 (366) T protein:vir:57 248 NRTYMTLFGLRDGNGNKVYPEMSQGILK--------GYPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFS 319 (366) T ss_pred HHHHHHHHhhhccCCceeccCCCCCeec--------ceeeEEccccccccccCCCccEEEEEecceEEEEEecceEEEEe Confidence 2234557777888888753 1111111 111122222221 0111 11 233444454 Q ss_pred cccC Q lcl|NC_020866. 294 PWLA 297 (297) Q Consensus 294 p~La 297 (297) ++-+ T Consensus 320 ~ea~ 323 (366) T protein:vir:57 320 TEAT 323 (366) T ss_pred eccc Confidence 5433 No 87 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=22.74 E-value=2.4 Score=18.50 Aligned_cols=231 Identities=14% Similarity=0.136 Sum_probs=92.2 Q ss_pred CCcCHHH-----HHHHHHH------------------------HHHHHHHHHhhcchhhcce-eeeecCCccceeccccc Q lcl|NC_020866. 1 MQVTAAN-----LDALRVG------------------------FKTSFQGALDQAPSQYLRL-TTVVPSSTKEQRYGWMG 50 (297) Q Consensus 1 M~i~~~~-----l~~l~~~------------------------~~~~f~~a~~~a~~~~~~~-a~~v~S~~~~~~y~~Lg 50 (297) |.-.+.. -.++..+ +.+.|-+.+... +...++ |+.+|..+....|..+. T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~-~~i~~~~~~~v~~~~~~~~~p~~~ 183 (435) T protein:vir:80 105 LAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPK-SVVRKLGARTLPLSNGNITIPRLK 183 (435) T ss_pred HHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhh-chhhhccceeeecCCCceEEEEEe Confidence 0000000 0000000 011111111111 222334 55666555566676665 Q ss_pred CCCcchhcccce---eeeeeccccceeeeecccceeeccHHHhhccCc--chhHHHHHHHHHHHHhhHHHHHHHHHhccc Q lcl|NC_020866. 51 KIPNVREWIGPR---AIQNLTESDYSIREKPWELTIGVDRDDIETDNL--GIYSPLFQEMGRSAGSKWDMLVFELLKLGF 125 (297) Q Consensus 51 ~~P~lrew~Ge~---~~~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~l--G~~~~~~~~~g~aaa~~~~~lv~~lL~~G~ 125 (297) .-|.. .|++|- .-.+..=..-++..++++..+.|+++.+.|-.. .+..-+.+.++++.++..++.++. |. T Consensus 184 ~~~~a-~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~----G~ 258 (435) T protein:vir:80 184 GGAIV-GYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIR----DD 258 (435) T ss_pred CCcce-eeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhc----cC Confidence 56664 577553 333333345667788999999999999887543 577789999999999999885542 21 Q ss_pred Cc-cccCcccccccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceee Q lcl|NC_020866. 126 AT-ECYDGQNFFDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLY 204 (297) Q Consensus 126 ~~-~~~DGk~~F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~ 204 (297) .+ .--.| +.+. .+... +.....+........+....+-.+.... + T Consensus 259 G~~~~p~G--i~~~-----~~~~~---~~~~~~~~~~~~~~~d~~~~~~~~~~~~----------------~-------- 304 (435) T protein:vir:80 259 GTANTPKG--LRFW-----ALPGN---VITASDGSTLQKIETDLGKAILALENAD----------------A-------- 304 (435) T ss_pred CCCCcccc--eeec-----ccccc---eeecccccchhhHHHHHHHHHHHhhccc----------------c-------- Confidence 10 00000 1110 00000 0011111111111111111111110000 0 Q ss_pred eccccccccccccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccc--cccCeEEecchHHHHHHHHHhhhccCCC--C Q lcl|NC_020866. 205 GTDARANVGFGFWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLG--LMPNLLVVPPALEEAGRKILNSENASGG--E 280 (297) Q Consensus 205 g~d~r~~~G~~l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~--i~P~~LvVp~~le~~A~~ll~~~~~~~g--~ 280 (297) .+.+ +.|- ++.. ...++++.||-+|++|- +.+..|. ...++.++..+.. . T Consensus 305 ---~~~~---~~~v--------mn~~----~~~~L~~lkd~~G~~l~~~~~~~~l~--------G~pv~~~~~~p~~~~~ 358 (435) T protein:vir:80 305 ---NLTQ---PGWI--------MAPR----TFRFLEGLRDGNGNKVYPELANGMLK--------GYPVGKTTQVPINLGE 358 (435) T ss_pred ---cccc---CEEE--------EcHH----HHHHHHhhhccCCceeccCCCCCeEe--------eeeeEEeccccccccC Confidence 0001 1121 1112 33567788888888763 1111111 0111111222210 0 Q ss_pred -cc---ee-----------cceeeEEeccccC Q lcl|NC_020866. 281 -TN---PW-----------KGTAELLVVPWLA 297 (297) Q Consensus 281 -~N---~~-----------~~~~~~iv~p~La 297 (297) .| .+ ++.+++-++++=. T Consensus 359 ~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~ 390 (435) T protein:vir:80 359 AGKESEIYFTDFGDVFIGEEETLEIDYSKEAT 390 (435) T ss_pred CCCcceEEEEEcccEEEEeecceEEEEecccc Confidence 01 11 2233343444333 No 88 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=22.62 E-value=2.4 Score=18.48 Aligned_cols=192 Identities=11% Similarity=0.076 Sum_probs=92.6 Q ss_pred CC---------cCHHHHHHHHHHHHHHHHHHHhhcchhhcceeeeecC-------CccceecccccCCCcchhcccceee Q lcl|NC_020866. 1 MQ---------VTAANLDALRVGFKTSFQGALDQAPSQYLRLTTVVPS-------STKEQRYGWMGKIPNVREWIGPRAI 64 (297) Q Consensus 1 M~---------i~~~~l~~l~~~~~~~f~~a~~~a~~~~~~~a~~v~S-------~~~~~~y~~Lg~~P~lrew~Ge~~~ 64 (297) |. |.|+.+.++- ...|++.+ -+.+++....+ +.....|.-+|+.-.+.| -.+... T Consensus 1 Ma~~~T~~~~~iiPev~s~~v---~~~~~~~~-----v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~-g~~i~~ 71 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMI---SAKLPKAI-----KFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAE-GAAIDY 71 (278) T ss_pred CCCcceehhheecHHHHHHHH---HHHHHHhh-----hhcccceecccccCCCCCEEEEeeeccCCcceeecC-CCcCcc Confidence 65 5555554432 22222221 12233322111 111112222232211111 133445 Q ss_pred eeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccc Q lcl|NC_020866. 65 QNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLD 144 (297) Q Consensus 65 ~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~ 144 (297) .++....-+.+.+.++..+.|+..+....--.....+.+++|++.++.-+..+++.|+...+. .++ + T Consensus 72 ~~lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~--~~~-------~---- 138 (278) T protein:vir:80 72 SALETESVKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE--VKG-------A---- 138 (278) T ss_pred cccccceeeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--ccc-------c---- Confidence 666666666778888999999999988877778899999999999999999999888652111 000 0 Q ss_pred ccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCC Q lcl|NC_020866. 145 EDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSK 224 (297) Q Consensus 145 g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~ 224 (297) .++ + ... T Consensus 139 -----~t~-~----~~~--------------------------------------------------------------- 145 (278) T protein:vir:80 139 -----INI-G----LID--------------------------------------------------------------- 145 (278) T ss_pred -----ccc-c----hhh--------------------------------------------------------------- Confidence 000 0 000 Q ss_pred ccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccC--CCCcc--------eecceeeEEecc Q lcl|NC_020866. 225 QTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENAS--GGETN--------PWKGTAELLVVP 294 (297) Q Consensus 225 ~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~--~g~~N--------~~~~~~~~iv~p 294 (297) -..+.+..++.++.. .+-| .+.+|+|+|.....-++.-..+... ....+ -+.| ++|++++ T Consensus 146 --~~~~~~~da~~~l~~----~~~~---~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G-~~Vi~s~ 215 (278) T protein:vir:80 146 --KIENTFTDAPDAIED----ESIT---TTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLG-WEIVRTK 215 (278) T ss_pred --hHHHHHHHHHHhhcc----cCCC---cccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecc-eeEEEcC Confidence 001222333332211 1112 1346778876554433332112211 11111 1223 5888888 Q ss_pred ccC Q lcl|NC_020866. 295 WLA 297 (297) Q Consensus 295 ~La 297 (297) .|- T Consensus 216 ~~p 218 (278) T protein:vir:80 216 KLA 218 (278) T ss_pred CCC Confidence 877 No 89 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=21.96 E-value=2.5 Score=18.38 Aligned_cols=212 Identities=12% Similarity=0.100 Sum_probs=95.9 Q ss_pred CCcCHHHH-----------HHHHH-HHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcc------cce Q lcl|NC_020866. 1 MQVTAANL-----------DALRV-GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWI------GPR 62 (297) Q Consensus 1 M~i~~~~l-----------~~l~~-~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~------Ge~ 62 (297) |- |+.+| .+||. -|.-+...+|+.. +-+..+.++ .+-...+++ +||.+-+-- |+- T Consensus 1 ms-~~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~-s~~~~~~~~-rti~~g~s~----~~~~iG~~~~~~~~pG~~ 73 (335) T protein:vir:63 1 MS-FLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYT-SKFAPLMNI-RDLRGSNVV----RLDRLGNVEAKGRRAGEE 73 (335) T ss_pred CC-CcccchhhhcccccchhheehhhhhhhHHHHHHhh-hhhccccce-eeeccceeE----EEeeeeeeeeecccCCcC Confidence 32 11111 12332 3455555566542 222233211 111222222 455442210 111 Q ss_pred eeee-eccccceeeeecccceeeccHHHhhcc-----CcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCc-cccCcccc Q lcl|NC_020866. 63 AIQN-LTESDYSIREKPWELTIGVDRDDIETD-----NLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFAT-ECYDGQNF 135 (297) Q Consensus 63 ~~~~-l~~~~~~i~n~~fe~tv~v~R~~i~dD-----~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~-~~~DGk~~ 135 (297) --++ ....+.+|+..+ .-+.|..|.|= ++..=+++.+++|++=|++-|+.++..|.++... .-...++. T Consensus 74 l~~~~~~~~k~~itVD~----ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~ 149 (335) T protein:vir:63 74 LERSRVVNDKWNLTVDT----LLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDA 149 (335) T ss_pred cCCCCccccceEEEecc----eeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCC Confidence 1111 111122333222 12344444331 3567789999999999999999999777665433 22233333 Q ss_pred cccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeecccccccccc Q lcl|NC_020866. 136 FDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFG 215 (297) Q Consensus 136 F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~ 215 (297) |+.-+ +++-...+ T Consensus 150 ~~~G~----------~~~~~~tg--------------------------------------------------------- 162 (335) T protein:vir:63 150 FSPGV----------LEKLDLTG--------------------------------------------------------- 162 (335) T ss_pred cCCCc----------ceeeeecc--------------------------------------------------------- Confidence 33111 00000000 Q ss_pred ccchhhcCCccCCHHHHHHHHHHHHhhccCCCccc-ccccCeEEecchHHHH---HHHHHhhhccCCCCcceecc----- Q lcl|NC_020866. 216 FWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPL-GLMPNLLVVPPALEEA---GRKILNSENASGGETNPWKG----- 286 (297) Q Consensus 216 l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L-~i~P~~LvVp~~le~~---A~~ll~~~~~~~g~~N~~~~----- 286 (297) .+..-+.+.+.+|......+-+...-|- ++.+.+.||+|..... +.++++.+..+.+..|.+.+ T Consensus 163 -------~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~ 235 (335) T protein:vir:63 163 -------LTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAI 235 (335) T ss_pred -------CcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEE Confidence 0011123444444444444443333332 2456789999887665 45567776554444343321 Q ss_pred --eeeEEeccccC Q lcl|NC_020866. 287 --TAELLVVPWLA 297 (297) Q Consensus 287 --~~~~iv~p~La 297 (297) =++|+.+|+|. T Consensus 236 v~Gv~V~~sn~lP 248 (335) T protein:vir:63 236 LNGVKVLETPRFA 248 (335) T ss_pred eeceEEEeeccCC Confidence 16788999997 No 90 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=21.52 E-value=2.6 Score=18.32 Aligned_cols=213 Identities=12% Similarity=0.079 Sum_probs=100.7 Q ss_pred CCc----------CHHHHHHHH-HHHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhc-ccceeeeeec Q lcl|NC_020866. 1 MQV----------TAANLDALR-VGFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREW-IGPRAIQNLT 68 (297) Q Consensus 1 M~i----------~~~~l~~l~-~~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew-~Ge~~~~~l~ 68 (297) |-. .+..-.+|+ +-|..+...+|+.. +-+..+.+. .+-...+++ +||.+-+- ++.++-|..- T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~-si~~~~~~v-rti~~GkS~----qf~~iG~~~a~y~~~G~~l 74 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKG-ENILSYFDV-QTVTGTNTV----SNKYLGETELQVLAPGQSP 74 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHH-HhhcCccee-eeecccceE----EEEEEeeeEEeeecccccc Confidence 422 123356677 56777777788653 223333222 121222222 34444221 1222222222 Q ss_pred cccceeeeecccceeecc-----HHHhhccC-----cc-hhHHHHHHHHHHHHhhHHHHHHHHHhc-cc-CccccCcccc Q lcl|NC_020866. 69 ESDYSIREKPWELTIGVD-----RDDIETDN-----LG-IYSPLFQEMGRSAGSKWDMLVFELLKL-GF-ATECYDGQNF 135 (297) Q Consensus 69 ~~~~~i~n~~fe~tv~v~-----R~~i~dD~-----lG-~~~~~~~~~g~aaa~~~~~lv~~lL~~-G~-~~~~~DGk~~ 135 (297) + +..+++ -+.+|.|+ |..|.|-| +. .=+.+.+++|++=+++.|+.+++++.. |. +..-.++.+- T Consensus 75 d-g~~~~~--~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~ 151 (402) T protein:vir:97 75 N-ATPTQA--DKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPR 151 (402) T ss_pred C-CCCccc--ccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCc Confidence 2 223333 35555555 66664433 43 335778999999999999999988755 21 1110110000 Q ss_pred cccccccccccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeecccccccccc Q lcl|NC_020866. 136 FDTDHPVLDEDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFG 215 (297) Q Consensus 136 F~tdH~~~~g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~ 215 (297) +.. .+....+ .+ T Consensus 152 ~~~-------~g~s~~~----~~--------------------------------------------------------- 163 (402) T protein:vir:97 152 VKG-------HGFSINV----NV--------------------------------------------------------- 163 (402) T ss_pred ccc-------ccccccc----cc--------------------------------------------------------- Confidence 000 0000000 00 Q ss_pred ccchhhcCCccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHH---HHHHhhhccCCCCcceecc------ Q lcl|NC_020866. 216 FWQMAYGSKQTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAG---RKILNSENASGGETNPWKG------ 286 (297) Q Consensus 216 l~q~a~~~~~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A---~~ll~~~~~~~g~~N~~~~------ 286 (297) .......+...+.+|...-..+.+...-|..= .++||+|.....- .+|++.+...++...+-.| T Consensus 164 -----t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~d--Rv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v~ 236 (402) T protein:vir:97 164 -----TESEALANPQYVMAAVEYALEQQLEQEVDISD--VAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) T ss_pred -----ccchhhcCHHHHHHHHHHHHHHHHhcCCCccc--cEEEeChHHHHHHhhcccccchhhccccCCccccceeEEEe Confidence 00011234455555555544444444444322 5889998766654 4456666543322212222 Q ss_pred eeeEEeccccC Q lcl|NC_020866. 287 TAELLVVPWLA 297 (297) Q Consensus 287 ~~~~iv~p~La 297 (297) -++|+.+++|. T Consensus 237 Gv~Vv~SnnlP 247 (402) T protein:vir:97 237 NCPVIPSNRFP 247 (402) T ss_pred ceEEEecCccc Confidence 27889999997 No 91 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=20.38 E-value=2.8 Score=18.15 Aligned_cols=232 Identities=15% Similarity=0.036 Sum_probs=88.1 Q ss_pred CCcC-HHHHHHHHH------------HHHHHHHHHHhhcchhhcceeeeecCCccceecccccCCCcchhcccce---ee Q lcl|NC_020866. 1 MQVT-AANLDALRV------------GFKTSFQGALDQAPSQYLRLTTVVPSSTKEQRYGWMGKIPNVREWIGPR---AI 64 (297) Q Consensus 1 M~i~-~~~l~~l~~------------~~~~~f~~a~~~a~~~~~~~a~~v~S~~~~~~y~~Lg~~P~lrew~Ge~---~~ 64 (297) +... ...+++..+ -+...+.+.+....+ ...+++.+|-.....+|........--.|++|- .- T Consensus 102 ~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 180 (390) T protein:vir:97 102 ATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLT-VRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPE 180 (390) T ss_pred hhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhh-hHhhcceeeccCCceEEEEEecCCcceeeecCCccccc Confidence 1000 000111110 011222222322223 344566666444445555554433334677543 22 Q ss_pred eeeccccceeeeecccceeeccHHHhhccCcchhHHHHHHHHHHHHhhHHHHHHHHHhcccCccccCccccccccccccc Q lcl|NC_020866. 65 QNLTESDYSIREKPWELTIGVDRDDIETDNLGIYSPLFQEMGRSAGSKWDMLVFELLKLGFATECYDGQNFFDTDHPVLD 144 (297) Q Consensus 65 ~~l~~~~~~i~n~~fe~tv~v~R~~i~dD~lG~~~~~~~~~g~aaa~~~~~lv~~lL~~G~~~~~~DGk~~F~tdH~~~~ 144 (297) .+..=..-++..++++..+.|+|+.+. |-..+.+-+.+.++++.++..+..++. | ||.. .+|.+- T Consensus 181 ~~~~~~~i~~~~~k~~~~~~is~ell~-ds~~l~~~i~~~la~a~~~~~d~a~l~----G------~g~~----~~p~Gi 245 (390) T protein:vir:97 181 SSLKFAKKTDTTHVIAHTMKATRQILS-DAPQLASYMNNRLIRGLKVKEDAEILR----G------TGAN----DGLLGL 245 (390) T ss_pred cccceeEEEEeeeeEEEeehhhHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHhh----c------CCCC----ccccce Confidence 333334566788899999999999775 455666778888999999998875542 2 2211 122111 Q ss_pred ccccceeeccccccchhHHHHHHHHHHHHHHHhhcccccchhhhccccccccccccceeeeccccccccccccchhhcCC Q lcl|NC_020866. 145 EDGKTVTVSNTGGGTGTPWFLLDTTRALKPIILQKRRDFQFVSKTKLDDDHVFMNKEFLYGTDARANVGFGFWQMAYGSK 224 (297) Q Consensus 145 g~~~~~svsn~~ag~~~awylld~s~alkpiI~q~r~~~~~~~~~~~~~~~vf~~~~~~~g~d~r~~~G~~l~q~a~~~~ 224 (297) -... .........++..++-. +.-++..... . .+.++ -|- T Consensus 246 ~~~~-~~~~~~~~~~~~~~~d~-----~~~~~~~~~~------------~-------------~~~~~---~~v------ 285 (390) T protein:vir:97 246 IPQA-TTYAAPTTIAGATRVDQ-----LRLAMLQASL------------A-------------EYPAS---GIV------ 285 (390) T ss_pred eecc-ccccccccccccchHHH-----HHHHHHhhcc------------c-------------cCCCC---EEE------ Confidence 0000 00000111112222210 1101110000 0 00000 011 Q ss_pred ccCCHHHHHHHHHHHHhhccCCCcccccccCeEEecchHHHHHHHHHhhhccCCCC-----cc-ee----cceeeEEec- Q lcl|NC_020866. 225 QTLDGTAYAAARAALSGMKGDYGRPLGLMPNLLVVPPALEEAGRKILNSENASGGE-----TN-PW----KGTAELLVV- 293 (297) Q Consensus 225 ~~l~~~~l~aar~aM~~~k~~~G~~L~i~P~~LvVp~~le~~A~~ll~~~~~~~g~-----~N-~~----~~~~~~iv~- 293 (297) ++. ..+.++++.||-+|++|--.|.- ..++.| ....++.+...+.+. -+ -+ +.-+++.++ T Consensus 286 --~n~----~~~~~L~~lkd~~G~~l~~~~~~-~~~~~l--~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~ 356 (390) T protein:vir:97 286 --INP----IDWAAIELAKDANNQYLIGNARG-TLTPTL--WGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGY 356 (390) T ss_pred --EcH----HHHHHHHHhhcCCCceeecCccC-CCCcee--cceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEee Confidence 111 23455667778888776311100 000000 011112222333322 11 11 111222221 Q ss_pred --cccC Q lcl|NC_020866. 294 --PWLA 297 (297) Q Consensus 294 --p~La 297 (297) +... T Consensus 357 ~~~~f~ 362 (390) T protein:vir:97 357 VNDDFQ 362 (390) T ss_pred cccccc Confidence 1111 Done!