Query lcl|NC_017674.1_cdsid_YP_006200789.1 [gene=F358_gp24] [protein=putative major strucutral protein] [protein_id=YP_006200789.1] [location=15455..16603] Match_columns 382 No_of_seqs 106 out of 111 Neff 6.4 Searched_HMMs 1612 Date Thu Nov 7 16:35:03 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_24 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_24_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:96079 Length: 382 100.0 1E-142 6E-146 799.1 32.3 382 1-382 1-382 (382) 2 protein:vir:99576 Length: 388 100.0 7E-135 4E-138 756.2 31.2 378 1-382 1-388 (388) 3 protein:vir:107732 Length: 379 100.0 3E-124 2E-127 697.5 31.6 364 1-382 1-379 (379) 4 protein:vir:106734 Length: 336 100.0 3E-114 2E-117 643.3 30.1 334 21-382 1-336 (336) 5 protein:vir:78558 Length: 336 100.0 5E-114 3E-117 641.8 29.8 334 21-382 1-336 (336) 6 protein:vir:3643 Length: 336 # 100.0 2E-112 1E-115 632.8 29.9 334 21-382 1-336 (336) 7 protein:vir:94070 Length: 339 100.0 3E-112 2E-115 631.7 29.9 337 19-382 1-339 (339) 8 protein:vir:101557 Length: 336 100.0 3E-112 2E-115 631.8 29.8 334 21-382 1-336 (336) 9 protein:vir:79642 Length: 329 100.0 1.6E-90 9.8E-94 512.9 27.9 320 34-382 1-326 (329) 10 protein:vir:104342 Length: 314 100.0 8.4E-89 5.2E-92 503.4 26.7 309 35-382 1-311 (314) 11 protein:vir:107687 Length: 319 100.0 2.9E-87 1.8E-90 495.0 27.1 316 21-382 1-319 (319) 12 protein:vir:80068 Length: 301 100.0 3.9E-86 2.4E-89 488.8 26.4 292 70-382 1-301 (301) 13 protein:vir:5255 Length: 304 # 100.0 8.7E-85 5.4E-88 481.4 27.6 289 73-381 1-304 (304) 14 protein:vir:103285 Length: 296 100.0 3.8E-85 2.4E-88 483.4 25.0 291 61-382 1-293 (296) 15 protein:vir:7771 Length: 330 # 98.6 4.5E-09 2.8E-12 66.4 13.5 300 47-382 1-321 (330) 16 protein:vir:104085 Length: 320 98.5 4.3E-09 2.7E-12 66.4 11.5 297 57-382 1-315 (320) 17 protein:vir:1638 Length: 298 # 98.4 1.5E-08 9.4E-12 63.4 11.3 278 70-382 1-297 (298) 18 protein:vir:105778 Length: 358 98.3 1.4E-08 8.5E-12 63.7 9.5 325 1-382 1-357 (358) 19 protein:vir:94771 Length: 298 98.3 2.7E-08 1.7E-11 62.1 11.1 280 61-382 1-297 (298) 20 protein:vir:8187 Length: 311 # 98.3 4.7E-08 2.9E-11 60.7 11.9 281 70-382 1-308 (311) 21 protein:vir:9574 Length: 300 # 98.2 8.4E-08 5.2E-11 59.4 12.6 281 61-382 1-298 (300) 22 protein:vir:80376 Length: 435 98.1 3E-07 1.9E-10 56.3 13.1 350 1-382 65-431 (435) 23 protein:vir:99920 Length: 311 98.1 1.5E-07 9.1E-11 58.0 11.4 287 60-382 1-310 (311) 24 protein:vir:5739 Length: 366 # 98.1 4.2E-07 2.6E-10 55.5 13.8 345 1-382 1-364 (366) 25 protein:vir:1433 Length: 435 # 98.0 8.6E-07 5.3E-10 53.8 14.9 352 1-382 45-431 (435) 26 protein:vir:96392 Length: 324 98.0 1.5E-06 9.6E-10 52.4 15.5 304 1-382 1-313 (324) 27 protein:vir:78830 Length: 324 98.0 1.5E-06 9.6E-10 52.4 15.5 304 1-382 1-313 (324) 28 protein:vir:41 Length: 299 # N 97.9 1E-06 6.2E-10 53.5 13.6 283 61-382 1-296 (299) 29 protein:vir:2504 Length: 305 # 97.9 1.1E-06 6.9E-10 53.2 13.9 278 68-382 1-296 (305) 30 protein:vir:78523 Length: 338 97.9 1.3E-06 8.3E-10 52.8 13.4 303 26-382 1-333 (338) 31 protein:vir:103955 Length: 324 97.9 3.5E-06 2.1E-09 50.5 15.3 305 1-382 1-313 (324) 32 protein:vir:97148 Length: 324 97.8 4.1E-06 2.6E-09 50.1 15.5 303 1-382 1-313 (324) 33 protein:vir:95763 Length: 297 97.8 3.9E-06 2.4E-09 50.3 14.7 289 47-382 1-294 (297) 34 protein:vir:2430 Length: 318 # 97.8 2.6E-06 1.6E-09 51.2 13.6 291 51-382 1-311 (318) 35 protein:vir:9309 Length: 324 # 97.8 4.7E-06 2.9E-09 49.8 14.8 305 19-382 1-313 (324) 36 protein:vir:80684 Length: 315 97.7 3.9E-06 2.4E-09 50.2 13.4 281 61-382 1-304 (315) 37 protein:vir:105905 Length: 304 97.7 6.7E-06 4.2E-09 48.9 14.6 288 47-382 1-303 (304) 38 protein:vir:94142 Length: 304 97.7 6.7E-06 4.2E-09 48.9 14.6 288 47-382 1-303 (304) 39 protein:vir:96223 Length: 324 97.7 8.8E-06 5.4E-09 48.3 15.0 304 1-382 1-313 (324) 40 protein:vir:99749 Length: 324 97.7 9.8E-06 6.1E-09 48.0 15.2 301 1-382 1-313 (324) 41 protein:vir:9759 Length: 303 # 97.7 4.2E-06 2.6E-09 50.1 13.1 282 61-382 1-301 (303) 42 protein:vir:78223 Length: 333 97.6 6.5E-06 4E-09 49.0 13.4 301 26-382 1-330 (333) 43 protein:vir:4226 Length: 326 # 97.6 7.3E-06 4.5E-09 48.7 13.1 306 37-382 1-321 (326) 44 protein:vir:94673 Length: 419 97.4 1.8E-05 1.1E-08 46.6 13.6 328 1-382 50-415 (419) 45 protein:vir:108211 Length: 318 97.3 7.7E-06 4.8E-09 48.6 10.3 269 63-382 1-315 (318) 46 protein:vir:2344 Length: 397 # 97.2 1.6E-05 1E-08 46.8 11.4 290 37-382 1-304 (397) 47 protein:vir:105038 Length: 428 97.1 0.00015 9E-08 41.6 15.6 352 1-382 46-426 (428) 48 protein:vir:10364 Length: 390 96.8 0.00013 8.1E-08 41.9 12.7 323 1-382 61-390 (390) 49 protein:vir:100135 Length: 418 96.8 6.3E-05 3.9E-08 43.6 10.8 321 1-382 77-413 (418) 50 protein:vir:1886 Length: 385 # 96.7 3.3E-05 2.1E-08 45.1 9.2 318 1-382 18-382 (385) 51 protein:vir:191 Length: 385 # 96.7 3.3E-05 2.1E-08 45.1 9.2 318 1-382 18-382 (385) 52 protein:vir:101650 Length: 497 96.7 7.4E-05 4.6E-08 43.2 10.9 341 1-382 67-491 (497) 53 protein:vir:7855 Length: 497 # 96.7 7.4E-05 4.6E-08 43.2 10.9 341 1-382 67-491 (497) 54 protein:vir:4339 Length: 395 # 96.6 0.00016 1E-07 41.4 11.9 325 1-382 36-393 (395) 55 protein:vir:8420 Length: 477 # 96.3 0.00033 2E-07 39.7 12.3 341 1-382 93-469 (477) 56 protein:vir:97053 Length: 390 96.3 0.0004 2.5E-07 39.2 12.6 317 1-382 61-390 (390) 57 protein:vir:81227 Length: 413 96.3 0.00078 4.8E-07 37.6 14.1 325 1-382 51-408 (413) 58 protein:vir:104256 Length: 458 96.2 0.00014 8.7E-08 41.7 9.8 335 1-382 99-456 (458) 59 protein:vir:81070 Length: 390 95.6 0.0016 9.8E-07 35.9 12.8 323 1-382 58-390 (390) 60 protein:vir:93616 Length: 645 95.1 0.0028 1.8E-06 34.6 13.1 345 1-382 242-637 (645) 61 protein:vir:8102 Length: 543 # 95.0 0.0011 6.7E-07 36.9 10.0 333 1-382 188-540 (543) 62 protein:vir:4600 Length: 415 # 94.8 0.0014 8.6E-07 36.3 10.2 330 1-382 44-402 (415) 63 protein:vir:4700 Length: 415 # 94.8 0.0014 8.6E-07 36.3 10.2 330 1-382 44-402 (415) 64 protein:vir:100247 Length: 425 94.5 0.00061 3.8E-07 38.2 7.6 334 1-382 71-422 (425) 65 protein:vir:1328 Length: 392 # 94.4 0.0015 9.2E-07 36.1 9.5 332 1-382 20-389 (392) 66 protein:vir:3613 Length: 272 # 94.3 0.0048 3E-06 33.3 12.4 260 64-382 1-272 (272) 67 protein:vir:80930 Length: 278 93.9 0.0055 3.4E-06 33.0 11.7 263 64-382 1-275 (278) 68 protein:vir:96833 Length: 275 93.9 0.0049 3E-06 33.2 11.2 255 61-382 1-275 (275) 69 protein:vir:485 Length: 407 # 93.6 0.003 1.9E-06 34.4 9.6 326 1-382 50-398 (407) 70 protein:vir:4456 Length: 401 # 93.4 0.0022 1.3E-06 35.2 8.6 326 1-382 51-399 (401) 71 protein:vir:96762 Length: 632 93.4 0.0069 4.3E-06 32.4 11.2 334 1-382 269-631 (632) 72 protein:vir:6212 Length: 434 # 93.1 0.0012 7.6E-07 36.6 6.7 332 1-382 56-427 (434) 73 protein:vir:96123 Length: 274 92.9 0.0095 5.9E-06 31.7 13.3 257 64-382 1-268 (274) 74 protein:vir:9410 Length: 415 # 92.8 0.0071 4.4E-06 32.4 10.4 333 1-382 58-402 (415) 75 protein:vir:3033 Length: 272 # 92.5 0.011 6.9E-06 31.3 13.6 256 64-382 1-267 (272) 76 protein:vir:9820 Length: 272 # 92.5 0.011 6.9E-06 31.3 13.6 256 64-382 1-267 (272) 77 protein:vir:6242 Length: 390 # 92.4 0.0042 2.6E-06 33.6 8.7 330 1-382 20-387 (390) 78 protein:vir:98339 Length: 415 92.1 0.013 8E-06 30.9 11.2 332 1-382 48-402 (415) 79 protein:vir:81100 Length: 415 92.1 0.013 8E-06 30.9 11.2 332 1-382 48-402 (415) 80 protein:vir:79987 Length: 415 92.1 0.013 8E-06 30.9 11.2 332 1-382 48-402 (415) 81 protein:vir:93742 Length: 274 90.7 0.019 1.2E-05 30.0 12.5 254 63-382 1-268 (274) 82 protein:vir:97433 Length: 274 90.2 0.022 1.4E-05 29.6 13.2 257 63-382 1-268 (274) 83 protein:vir:94494 Length: 274 90.2 0.022 1.4E-05 29.6 13.2 257 63-382 1-268 (274) 84 protein:vir:97255 Length: 310 89.4 0.026 1.6E-05 29.2 10.5 275 54-382 1-308 (310) 85 protein:vir:107882 Length: 307 88.5 0.032 2E-05 28.8 11.6 269 60-382 1-300 (307) 86 protein:vir:1239 Length: 274 # 87.2 0.04 2.5E-05 28.2 12.5 251 63-382 1-268 (274) 87 protein:vir:4856 Length: 293 # 84.6 0.059 3.7E-05 27.3 11.4 267 62-382 1-279 (293) 88 protein:vir:102119 Length: 404 84.2 0.032 2E-05 28.7 7.6 328 1-382 44-398 (404) 89 protein:vir:105334 Length: 276 83.7 0.066 4.1E-05 27.0 12.4 254 64-382 1-268 (276) 90 protein:vir:101607 Length: 379 82.7 0.044 2.7E-05 28.0 7.7 316 1-382 39-379 (379) 91 protein:vir:79078 Length: 307 82.5 0.076 4.7E-05 26.7 11.3 272 60-382 1-300 (307) 92 protein:vir:99888 Length: 309 80.4 0.095 5.9E-05 26.2 13.1 271 47-382 1-301 (309) 93 protein:vir:95107 Length: 270 79.6 0.079 4.9E-05 26.6 7.9 251 64-382 1-263 (270) 94 protein:vir:4830 Length: 397 # 76.2 0.14 8.6E-05 25.3 11.4 308 1-382 63-385 (397) 95 protein:vir:94933 Length: 330 74.9 0.15 9.5E-05 25.0 9.5 302 15-382 1-327 (330) 96 protein:vir:3158 Length: 321 # 74.8 0.15 9.5E-05 25.0 15.2 294 11-382 1-309 (321) 97 protein:vir:3845 Length: 395 # 74.7 0.12 7.1E-05 25.7 7.4 310 1-382 1-381 (395) 98 protein:vir:4997 Length: 397 # 72.3 0.19 0.00011 24.6 12.2 309 1-382 53-383 (397) 99 protein:vir:78640 Length: 352 71.5 0.19 0.00012 24.5 10.0 314 1-382 1-344 (352) 100 protein:vir:95898 Length: 274 71.5 0.19 0.00012 24.5 12.4 253 63-382 1-268 (274) 101 protein:vir:96262 Length: 274 71.5 0.19 0.00012 24.5 12.4 253 63-382 1-268 (274) 102 protein:vir:8843 Length: 317 # 70.3 0.21 0.00013 24.3 10.1 280 61-382 1-313 (317) 103 protein:vir:4511 Length: 409 # 68.0 0.24 0.00015 23.9 12.6 331 1-382 41-404 (409) 104 protein:vir:3870 Length: 400 # 67.7 0.17 0.0001 24.8 6.6 306 1-382 72-397 (400) 105 protein:vir:4159 Length: 315 # 65.3 0.29 0.00018 23.6 13.8 298 18-382 1-315 (315) 106 protein:vir:94424 Length: 387 63.7 0.18 0.00011 24.6 6.0 311 1-382 53-379 (387) 107 protein:vir:96978 Length: 387 63.7 0.18 0.00011 24.6 6.0 311 1-382 53-379 (387) 108 protein:vir:2685 Length: 387 # 63.7 0.18 0.00011 24.6 6.0 311 1-382 53-379 (387) 109 protein:vir:9361 Length: 402 # 61.2 0.26 0.00016 23.8 6.4 312 1-382 68-396 (402) 110 protein:vir:3991 Length: 404 # 57.1 0.44 0.00027 22.5 11.6 310 1-382 56-391 (404) 111 protein:vir:93881 Length: 387 55.1 0.32 0.0002 23.3 5.8 312 1-382 53-379 (387) 112 protein:vir:4197 Length: 314 # 51.1 0.59 0.00037 21.8 14.8 292 26-382 1-310 (314) 113 protein:vir:4092 Length: 390 # 50.8 0.6 0.00037 21.8 13.0 325 1-382 8-368 (390) 114 protein:vir:1383 Length: 421 # 48.5 0.67 0.00041 21.5 9.9 305 1-382 54-392 (421) 115 protein:vir:4953 Length: 397 # 46.1 0.75 0.00046 21.3 11.3 307 1-382 53-383 (397) 116 protein:vir:739 Length: 231 # 44.8 0.79 0.00049 21.1 11.9 219 108-382 1-231 (231) 117 protein:vir:6324 Length: 335 # 44.3 0.81 0.0005 21.1 6.5 291 26-382 1-328 (335) 118 protein:vir:94622 Length: 341 43.3 0.85 0.00053 21.0 12.4 294 61-382 1-337 (341) 119 protein:vir:100172 Length: 394 42.3 0.89 0.00055 20.9 8.6 303 1-382 54-382 (394) 120 protein:vir:1025 Length: 408 # 41.3 0.93 0.00058 20.7 12.9 315 1-382 56-391 (408) 121 protein:vir:9704 Length: 394 # 40.1 0.98 0.00061 20.6 9.2 314 1-382 53-388 (394) 122 protein:vir:99675 Length: 324 30.5 1.6 0.00097 19.5 7.1 258 103-382 1-301 (324) 123 protein:vir:102655 Length: 322 29.6 1.6 0.001 19.4 9.6 289 61-382 1-319 (322) 124 protein:vir:7409 Length: 408 # 29.0 1.7 0.0011 19.3 12.0 312 1-382 56-391 (408) 125 protein:vir:106647 Length: 303 25.5 2.1 0.0013 18.9 8.3 272 61-382 1-288 (303) 126 protein:vir:1268 Length: 397 # 25.1 2.1 0.0013 18.8 9.2 317 1-382 60-395 (397) 127 protein:vir:107593 Length: 392 23.7 2.3 0.0014 18.6 11.4 320 1-382 35-382 (392) 128 protein:vir:102082 Length: 392 23.7 2.3 0.0014 18.6 11.4 320 1-382 35-382 (392) 129 protein:vir:105004 Length: 392 23.7 2.3 0.0014 18.6 11.4 320 1-382 35-382 (392) 130 protein:vir:102873 Length: 392 23.7 2.3 0.0014 18.6 11.4 320 1-382 35-382 (392) 131 protein:vir:80213 Length: 334 21.8 2.5 0.0016 18.4 7.4 303 47-382 1-332 (334) No 1 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=100.00 E-value=9.9e-143 Score=799.11 Aligned_cols=382 Identities=95% Similarity=1.414 Sum_probs=377.0 Q ss_pred CCCcceeeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHH Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPI 80 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~ 80 (382) |+|+||+||||+||+|||||+++++.+++++|+|+||+||+++.+++++.++++.+....+||||++++++|++|+|+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~g~p~ 80 (382) T protein:vir:96 1 MSHISKTHSRLAGRHAKPFDLKNVTHEAVAALGRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPI 80 (382) T ss_pred CCCcceeeeecCCccccchhhhcccHHHHHHHhccccccCcccchhHhhhhhhhhhhhhhcccccccCCccccCCccHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEe Q lcl|NC_017674. 81 QFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMV 160 (382) Q Consensus 81 ~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y 160 (382) +||+||||++||++|+||++++||||+|+|+|.+++++|+++|.+|+|++|||++|+|++|+++++++++++++++||+| T Consensus 81 ~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~y 160 (382) T protein:vir:96 81 QFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGLLV 160 (382) T ss_pred HHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCCCccccccceeEEEEEEEEEeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHH Q lcl|NC_017674. 161 GTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIR 240 (382) Q Consensus 161 ~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~ 240 (382) +.+|+++|+++|++|+++|+.+||+++|+++|+++|||+++|+++++||||||||||+..++++++|++||++||++||+ T Consensus 161 g~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~ 240 (382) T protein:vir:96 161 GTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIR 240 (382) T ss_pred cHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCcccccHHHHHHHHH Confidence 99999999999999999999999999999999999999999889999999999999999888899999999999999999 Q ss_pred HHHHHHHHhcCCeeeeccccceEecCHHHHhhccccCCCCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEcc Q lcl|NC_017674. 241 EAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFV 320 (382) Q Consensus 241 ~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~ 320 (382) +++++|++||+|.|+++++|++|+|||+++.+|+++|++|+||++|||+||||++|+++|||++++++|+++.+++|+|. T Consensus 241 ~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~peL~~a~~~g~g~~~~~~~~~ 320 (382) T protein:vir:96 241 EAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQMQGKTPEDALVLFV 320 (382) T ss_pred HHHHHHHhccCCeeeecccceEEeechHHHhhccccCccCccHHHHHHHhcCCcEEEEccccccccCCCccceeEEEEec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 321 EDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 321 ~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ++++..+++++++.++|+|.+|++++.+|+|++.++|++||++|||||+||||+||+|++|| T Consensus 321 ~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 321 EEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred chhhhhcccccccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccCC Confidence 99999889999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=100.00 E-value=6.7e-135 Score=756.17 Aligned_cols=378 Identities=62% Similarity=1.012 Sum_probs=358.9 Q ss_pred CCCcceeeeecCccccccccccc------cchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCccccc Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKN------ITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTP 74 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~ 74 (382) |+||||+||+|+||++||++|+. ++.++++||+|+||+||++..+.+...+.... .+++||||++.+++|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~--~~~~a~da~~~~~~t~~ 78 (388) T protein:vir:99 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGG--VATQAFDSAYVAPTTQA 78 (388) T ss_pred CCCccceeeecCCcccchhhhhcCCcceeeechhhHhhhhcceeccCccchhhhhhhhhhh--hhhcccCcccccccccC Confidence 99999999999999999999866 66788999999999999998888777766543 57899999999999999 Q ss_pred chhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEE Q lcl|NC_017674. 75 SIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRG 154 (382) Q Consensus 75 ~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~ 154 (382) |+|||++||+||||+|||++++|+++++||||+|+|+|.+++++|+++|.+|+|++|||++|+|++|++++++++++|++ T Consensus 79 ~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~ 158 (388) T protein:vir:99 79 SIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRG 158 (388) T ss_pred cccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccCCCceeccceeeeeeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC----CCcccc Q lcl|NC_017674. 155 ELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS----QGWSTA 230 (382) Q Consensus 155 ~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~----~~Wa~k 230 (382) ++||+|+++|+++|+++|++|+++|+.+||+++|+++|+++|||++.+.++++|||||||||++.+++++ ++|++| T Consensus 159 ~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~k 238 (388) T protein:vir:99 159 EMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGAN 238 (388) T ss_pred EeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccccCCcCcccccC Confidence 9999999999999999999999999999999999999999999997655679999999999997765432 469999 Q ss_pred CHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccCCCCccHHHHHHHhcCccEEEEccccccccCCCC Q lcl|NC_017674. 231 DWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQ 310 (382) Q Consensus 231 T~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~ 310 (382) |++||++||++++++|+.||+|+|++++.|++|+|||+++.+|+++|++|+||++|||+||||++|+++|||+++++ + T Consensus 239 T~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~~--t 316 (388) T protein:vir:99 239 AFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQGGNP--D 316 (388) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhccccCcCCccHHHHHHHhcCCcEEEEecccccccc--c Confidence 99999999999999999999999999989999999999999999999999999999999999999999999998765 4 Q ss_pred CceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 311 EPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 311 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ++++++++++++++...++++++..++.+.+|++|++||+|+++++|+|||++|||||+||||+||+|++|| T Consensus 317 gg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 317 DGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred CCceeEEEEecccccccccCccCcceeEEecccccccccceecCceeEeccccceeeeEEeccchhheeccC Confidence 567899999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=100.00 E-value=3.3e-124 Score=697.54 Aligned_cols=364 Identities=39% Similarity=0.661 Sum_probs=329.6 Q ss_pred CCCcceeeeecCcccccc--ccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCc-------- Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKP--FDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAP-------- 70 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~-------- 70 (382) |+|+||+||||+||++|| ++.++++.++|++|+||||+|++...+.+ ....+||||++.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~~~~---------~~~~~amd~~~~~~~~~~~~~l 71 (379) T protein:vir:10 1 MPQISKIHSSLNARQMTQMVMDSADVTLDNLKHLESYGIHLNGRKNKLF---------ELMQFAMDSNDIGPIPTPLSPL 71 (379) T ss_pred CCCcceeeeecCccccchhhhccccccHHHHHHHHhcCccccchhhhhh---------hhhhhhhccccccccccccCcc Confidence 999999999999999999 57788999999999999999997754332 13457999996664 Q ss_pred ccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEee Q lcl|NC_017674. 71 VTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRT 150 (382) Q Consensus 71 ~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~ 150 (382) ++.++.|+|. ||++|.|++|+++++||++++||||+|+|+|++++++|+++|.+|+|++|||++|+|++|+++++++++ T Consensus 72 ~~~~~~g~~~-~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~d~pl~d~~~~~~~r~ 150 (379) T protein:vir:10 72 SPVSIPGLIQ-FLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGGNMALMSWTPTFETRT 150 (379) T ss_pred ccccccchHH-HHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEeccccCCCeeeeeeeeeeee Confidence 3344556555 666666999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC-----C Q lcl|NC_017674. 151 IVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS-----Q 225 (382) Q Consensus 151 v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~-----~ 225 (382) +|++++||+|+++|+++|+++|++|+++|+.+||+++|+++|+++|||+.+. ++++|||||||||++.+++++ + T Consensus 151 v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~-~~~~yGllNdP~l~a~~t~atg~~~~t 229 (379) T protein:vir:10 151 VVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDG-SGRTFGFLNDPNLPAYVAVPNGAGGSP 229 (379) T ss_pred eEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCC-CcceEEEEeCCCCcccccccCCccccc Confidence 9999999999999999999999999999999999999999999999997553 589999999999997765543 4 Q ss_pred CccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccCCCCccHHHHHHHhcCccEEEEccccccc Q lcl|NC_017674. 226 GWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGV 305 (382) Q Consensus 226 ~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a 305 (382) +|++||++||++||++++++++.+|+|.|+++..|++|+|||+++.+|+++|++|+||++|||+|||||+|+++|||+++ T Consensus 230 ~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~n~~g~Tvl~~lk~n~Pnl~i~t~pEL~~a 309 (379) T protein:vir:10 230 LWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTPTELGYSVAQYMRESYPNVTFVSAPELNDA 309 (379) T ss_pred ccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccccccCccHHHHHHHhcCCcEEEEccccccc Confidence 69999999999999999999999999999999899999999999999999999999999999999999999999999997 Q ss_pred cCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 306 QMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 306 ~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +|+ +..+++|.+++.+ .+++...++.+.+|++|++||+|+++++|+|||++|||||+||||+||+|++|- T Consensus 310 ggg----~~~~~~~~~~~~~---~~t~~~~~~~~~~p~k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 310 NGG----SSAIYYYADAVEN---NGTDDGRTWLQVVPTKMFTLGVEKKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred CCC----ccEEEEEeeccCC---CccCCcceEEEecchhhhhccceecCceeEeccccceeeeeeecchhhheecCC Confidence 653 3367888888763 345556778899999999999999999999999999999999999999999999 No 4 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=100.00 E-value=2.6e-114 Score=643.31 Aligned_cols=334 Identities=24% Similarity=0.353 Sum_probs=307.1 Q ss_pred ccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCc--ccccchhHHHHHHhhhhhhheecccccc Q lcl|NC_017674. 21 LKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAP--VTTPSIPTPIQFLQTWLPGFVKVMTAAR 98 (382) Q Consensus 21 ~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~--~t~~~~~~~~~~l~~idp~v~~~~~~~~ 98 (382) |.+. +++++|+|+||+||++. +.+...+ ..+||||++.++ +|.++.|+++++++||||++||++|+|+ T Consensus 1 ~~~~--~~~~~l~~~gi~~~~~~-~~~~~~~-------~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~ 70 (336) T protein:vir:10 1 MRDA--QRIQNLARAGVILPRSV-KNVSTPL-------AEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPM 70 (336) T ss_pred CchH--HHHHHHhccCeecchhh-hhhhHHH-------HHHHHhhhhhccccccCCCcchHHHHHhhcCcceeeeeechh Confidence 4444 57999999999999864 3344333 456888776654 6788899999999999999999999999 Q ss_pred chhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHH Q lcl|NC_017674. 99 KIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAET 178 (382) Q Consensus 99 ~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~ 178 (382) ++++||||+|+|+||+++++|+++|.+|++++|||++|+|++|+++++++++++++++||+||++|+++|+++|++|+++ T Consensus 71 ~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~ 150 (336) T protein:vir:10 71 KAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASE 150 (336) T ss_pred chhhhcccccCCCcceeeEEEEeeeeeeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeecc Q lcl|NC_017674. 179 KRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKA 258 (382) Q Consensus 179 K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~ 258 (382) |+.+||+++|+++|+++||||+ ++++|||||||||++.+++++++|++||++||++||++++++|+.||+|.|+++ T Consensus 151 Ka~aA~~ale~~~N~~~~~Gd~---~~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~- 226 (336) T protein:vir:10 151 LNYSSALGLAKFLNGSYLFGVA---GLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQE- 226 (336) T ss_pred HHHHHHHHHHHhhCeEEEEeec---ccceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeec- Confidence 9999999999999999999986 588999999999999888888999999999999999999999999999999887 Q ss_pred ccceEecCHHHHhhccccCCCCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhh Q lcl|NC_017674. 259 EKITLALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFS 338 (382) Q Consensus 259 ~p~~L~Lp~~~~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 338 (382) +|++|+|||+++.+|+++|++|+|+++|||+|||||+|+++|||++++| ++++++++++. +. .+++ T Consensus 227 ~~~tL~Lp~~~~~~L~~~n~~g~tv~~~lk~n~Pnl~i~t~pel~~Agg------~~~~~~~~~~~----~~----~t~~ 292 (336) T protein:vir:10 227 AVLHMGLPPTAMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASG------RLVQLWAPRVE----GK----DTAT 292 (336) T ss_pred cceEEEechHHHHhccCCCccCccHHHHHHHhCCccEEEEcccccccCC------ceEEEEEeccc----CC----ccee Confidence 6999999999999999999999999999999999999999999987643 46888888864 22 3456 Q ss_pred hhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 339 QLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 339 ~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +.+|++|++||+|+++++|+|||++|||||+||||+||+|++|| T Consensus 293 ~~~P~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 293 CGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred eecChhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 67899999999999999999999999999999999999999999 No 5 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=100.00 E-value=4.9e-114 Score=641.79 Aligned_cols=334 Identities=24% Similarity=0.353 Sum_probs=306.9 Q ss_pred ccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCc--ccccchhHHHHHHhhhhhhheecccccc Q lcl|NC_017674. 21 LKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAP--VTTPSIPTPIQFLQTWLPGFVKVMTAAR 98 (382) Q Consensus 21 ~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~--~t~~~~~~~~~~l~~idp~v~~~~~~~~ 98 (382) |.+. +++++|+|+||+||++. +.+...+ ..+||||++.++ +|.++.|+++++++||||++||++++|+ T Consensus 1 ~~~~--~~~~~l~~~gi~~~~~~-~~~~~~~-------~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~ 70 (336) T protein:vir:78 1 MRDA--QRIQNLARAGVILPRSV-KNVSTPL-------AEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPM 70 (336) T ss_pred CchH--HHHHHHhccCeecchhh-hhhhHHH-------HHHHHhhhhhccccccCCCcchHHHHHHhcccceeeehhhhh Confidence 4444 57999999999999864 3344333 456888776654 6788899999999999999999999999 Q ss_pred chhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHH Q lcl|NC_017674. 99 KIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAET 178 (382) Q Consensus 99 ~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~ 178 (382) ++++||||+|+|+|.+++++|+++|.+|+|++|||++|+|++|+++++++++++++++||+||++|+++|+++|++|+++ T Consensus 71 ~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~ 150 (336) T protein:vir:78 71 KAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASE 150 (336) T ss_pred hhhhhcccccCCCccccEEEEeeeecceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeecc Q lcl|NC_017674. 179 KRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKA 258 (382) Q Consensus 179 K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~ 258 (382) |+.+||+++|+++|+++|||++ ++++|||||||||++.+++++++|++||+|||++||++++++|+.||+|.|+++ T Consensus 151 Ka~aA~~ale~~~N~~~~~Gd~---~~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~- 226 (336) T protein:vir:78 151 LNYSSALGLAKFLNGSYLFGVA---GLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQE- 226 (336) T ss_pred HHHHHHHHHHHhhCeEEEEecc---ccceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeec- Confidence 9999999999999999999985 588999999999999888888999999999999999999999999999999887 Q ss_pred ccceEecCHHHHhhccccCCCCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhh Q lcl|NC_017674. 259 EKITLALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFS 338 (382) Q Consensus 259 ~p~~L~Lp~~~~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 338 (382) +|++|+|||+++.+|+++|++|+||++|||+|||||+|+++|||++++| ++++++++++. + ..++. T Consensus 227 ~~~tL~Lp~~~~~~L~~~n~~g~tv~~~lk~n~Pnl~i~t~pel~~Agg------~~~~~~~~~~~----~----~~t~~ 292 (336) T protein:vir:78 227 AVLHMGLPPTAMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASG------RLVQLWAPRVE----G----KDTAT 292 (336) T ss_pred cceEEEechHHHHhccCCCccCccHHHHHHHhcCccEEEEcccccccCc------ceEEEEEeecc----C----Cccee Confidence 6999999999999999999999999999999999999999999987643 46788888763 2 23456 Q ss_pred hhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 339 QLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 339 ~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +.+|++|++||+|+++++|+|||++|||||+||||+||+|++|| T Consensus 293 ~~~p~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 293 CGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred eecchhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 77899999999999999999999999999999999999999999 No 6 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=100.00 E-value=2.1e-112 Score=632.85 Aligned_cols=334 Identities=23% Similarity=0.331 Sum_probs=303.2 Q ss_pred ccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccC--cccccchhHHHHHHhhhhhhheecccccc Q lcl|NC_017674. 21 LKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTA--PVTTPSIPTPIQFLQTWLPGFVKVMTAAR 98 (382) Q Consensus 21 ~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~--~~t~~~~~~~~~~l~~idp~v~~~~~~~~ 98 (382) |.+. +++++|+|+||+|+++..+.... ...++|||++.+ ++|+.+.|+|..+++||||++||++|+|+ T Consensus 1 ~~~~--~~~~~l~~~gi~~~~~~~~~~~~--------~~~~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~ 70 (336) T protein:vir:36 1 MRDA--QRIQNLARAGVILPRSVQNVSTP--------LTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPM 70 (336) T ss_pred CchH--HHHHHHhhcCeeecchhhhhhhH--------HHHhhhhhhhccCccccCCCcchHHHHHHhhccceEeeecchh Confidence 4444 57999999999999975433221 245788887665 46788999999999999999999999999 Q ss_pred chhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHH Q lcl|NC_017674. 99 KIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAET 178 (382) Q Consensus 99 ~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~ 178 (382) ++++||||+|+|+|.+++++|+++|.+|+|++|||++|+|++|+++++++++|+++++||+||++|+++|+++|++|+++ T Consensus 71 ~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~ 150 (336) T protein:vir:36 71 KAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASE 150 (336) T ss_pred hhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeecc Q lcl|NC_017674. 179 KRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKA 258 (382) Q Consensus 179 K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~ 258 (382) |+.+||+++|+++|+++||||+ ++++|||||||||++.+++++++|++||+|||++||++++++|+.||+|.++.+ T Consensus 151 Ka~aA~~ale~~~N~i~~~Gd~---~~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~- 226 (336) T protein:vir:36 151 LNYSSALGLAKFLNGSYLFGVA---GLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQE- 226 (336) T ss_pred HHHHHHHHHHHhhCcEEEEecc---ccceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeec- Confidence 9999999999999999999985 588999999999999888888899999999999999999999999999998865 Q ss_pred ccceEecCHHHHhhccccCCCCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhh Q lcl|NC_017674. 259 EKITLALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFS 338 (382) Q Consensus 259 ~p~~L~Lp~~~~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 338 (382) .|++|+|||+++.+|+++|++|+||++|||+|||||+|+++|||++++| +.++++.+++. +.+ +.. T Consensus 227 ~~~tL~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~g------~~~~l~~~~~~----~~~----t~~ 292 (336) T protein:vir:36 227 DVLRMGLPPTAMSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASG------RLVQLWAPRVE----GKD----TAT 292 (336) T ss_pred cccEEEechHHHHhccCCCccCccHHHHHHHhcCccEEEEccccccCCC------ceEEEEEEecC----CCc----cee Confidence 6999999999999999999999999999999999999999999988743 24566666543 222 345 Q ss_pred hhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 339 QLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 339 ~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ..+|++|++||+|+++++|+|||++|||||+||||+||+|++|| T Consensus 293 ~~~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 293 CGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred eecchhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 57899999999999999999999999999999999999999999 No 7 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=100.00 E-value=3.4e-112 Score=631.68 Aligned_cols=337 Identities=26% Similarity=0.365 Sum_probs=304.7 Q ss_pred ccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccC--cccccchhHHHHHHhhhhhhheecccc Q lcl|NC_017674. 19 FDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTA--PVTTPSIPTPIQFLQTWLPGFVKVMTA 96 (382) Q Consensus 19 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~--~~t~~~~~~~~~~l~~idp~v~~~~~~ 96 (382) +.+ +++-.+++||+|+||+||++..+.+... +..+||||++.+ +.|+++++||+++|+||||++||++|+ T Consensus 1 ~~~-~~~~~~~~~l~~~g~~~~~~~~~~~~~~-------~~~~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~ 72 (339) T protein:vir:94 1 MSI-NNDRTDIKQLEKVGIIFDGYSPKSISSE-------VSAYAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLA 72 (339) T ss_pred Cce-echHHHHHHHHhhceeeccchhhhcchh-------hHhhhccccccccccccccccchhhhhhhhhchhheeeccc Confidence 222 2344678999999999999987755433 456899998655 578889999999999999999999999 Q ss_pred ccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChH Q lcl|NC_017674. 97 ARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSA 176 (382) Q Consensus 97 ~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~ 176 (382) ++++++||||+|+|+|++++++|+++|.+|+|++|||++|+|++|+++++++++++++++||+|+++|+++|+++|++|+ T Consensus 73 ~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~~~g~~l~ 152 (339) T protein:vir:94 73 PMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYGDLEMATYGEAGIDYV 152 (339) T ss_pred ccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEeEEEEEEEEeecHHHHHHHHhhCCChH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeee Q lcl|NC_017674. 177 ETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDP 256 (382) Q Consensus 177 ~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~ 256 (382) ++|+.+||+++++++|+++|||++ ++++|||||||||++.+ +++++|++||++||++||++++++|+.+|+|.|++ T Consensus 153 ~~Ka~aA~~al~~~~N~i~~~Gd~---~~~~~GLlN~P~l~~~v-~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~ 228 (339) T protein:vir:94 153 ARQEISASLVMAKFANSSYLLGVA---GIANYGLMNDPSLPAPV-AATVNWATAAPEDIANDVVAMVGRLISQSGGLITG 228 (339) T ss_pred HHHHHHHHHHHHHhhceEEeeeec---ccceEEEEeCCCccccc-cCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeee Confidence 999999999999999999999996 47899999999998765 45678999999999999999999999999999998 Q ss_pred ccccceEecCHHHHhhccccCCCCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchh Q lcl|NC_017674. 257 KAEKITLALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSV 336 (382) Q Consensus 257 ~~~p~~L~Lp~~~~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~ 336 (382) + +|++|+|||+++.+|+++|++|+|+++|||+|||||+|+++|||+++++ +.++++++++. +. .+ T Consensus 229 ~-~~~~L~LP~~~~~~L~~~n~~~~Tvl~~lk~n~pnl~i~~~~el~~a~g------~~~~~~~~~~~----~~----~~ 293 (339) T protein:vir:94 229 Q-ERMVMALAPSALNNVNRTNNFGLSAGAKIAQTYPNIQFVAVPEFDTASG------RLVQLWVPEVN----GQ----PT 293 (339) T ss_pred c-cCcEEEecHHHHHhcccCCcCCccHHHHHHHhcCCcEEEEccccccCCC------ceEEEEEEecc----CC----cc Confidence 7 5999999999999999999999999999999999999999999987643 24555555542 22 23 Q ss_pred hhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 337 FSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 337 ~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ..+.+||+|++||+|+++++|+|||++|||||+||||+||+|++|| T Consensus 294 ~~~~~p~~~~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 294 GEVAFAEKLRSHSIERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred eEEEcchhhhccccEEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 4577899999999999999999999999999999999999999999 No 8 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=100.00 E-value=3.2e-112 Score=631.83 Aligned_cols=334 Identities=23% Similarity=0.334 Sum_probs=302.2 Q ss_pred ccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccC--cccccchhHHHHHHhhhhhhheecccccc Q lcl|NC_017674. 21 LKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTA--PVTTPSIPTPIQFLQTWLPGFVKVMTAAR 98 (382) Q Consensus 21 ~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~--~~t~~~~~~~~~~l~~idp~v~~~~~~~~ 98 (382) |.+. +++++|+|+||+|+++..+.... ...+||||++.+ ++|+++.|+|+.+.+||||++||++++|| T Consensus 1 ~~~~--~~~~~l~~~gi~~~~~~~~~~~~--------~~~~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~ 70 (336) T protein:vir:10 1 MRDA--QRIQNLARAGVILPRSVQNVSTP--------LTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPM 70 (336) T ss_pred CchH--HHHHHHhhcCeeecchhhhhhhh--------HHHhhhhhhhccCccccCCCchhHHHHHhhcccceeeehhhhh Confidence 4444 57999999999999975433221 234678776554 46788999999999999999999999999 Q ss_pred chhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHH Q lcl|NC_017674. 99 KIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAET 178 (382) Q Consensus 99 ~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~ 178 (382) ++++||||+|+|+|.+++++|+++|.+|+|++|||++|+|++|+++++++++|+++++||+||++|+++|+++|++|+++ T Consensus 71 ~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~ 150 (336) T protein:vir:10 71 KAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASE 150 (336) T ss_pred hhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeecc Q lcl|NC_017674. 179 KRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKA 258 (382) Q Consensus 179 K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~ 258 (382) |+.+||+++|+++|+++||||+ ++++|||||||||++.+++++++|++||+|||++||++++++|+.||+|+++.+ T Consensus 151 Ka~aA~~ale~~~N~i~~~Gd~---~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~- 226 (336) T protein:vir:10 151 LNYSSALGLAKFLNGSYLFGVA---GLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQE- 226 (336) T ss_pred HHHHHHHHHHHhhCcEEEEecc---ccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeccc- Confidence 9999999999999999999985 588999999999998888888899999999999999999999999999998765 Q ss_pred ccceEecCHHHHhhccccCCCCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhh Q lcl|NC_017674. 259 EKITLALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFS 338 (382) Q Consensus 259 ~p~~L~Lp~~~~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 338 (382) .|++|+|||+++.+|+++|++|+||++|||+|||||+|+++|||++++|+ .++++.+++. +.+ +.. T Consensus 227 ~~~tL~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~G~------~~~l~~~~~~----~~~----t~~ 292 (336) T protein:vir:10 227 DVLRMGLPPTAMSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGR------LVQLWAPRVE----GKD----TAT 292 (336) T ss_pred CcceEEecHHHHHhccCCCccCccHHHHHHHhcCccEEEEccccccCCCc------eEEEEEEecC----CCc----cee Confidence 69999999999999999999999999999999999999999999887432 4566666543 222 344 Q ss_pred hhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 339 QLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 339 ~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ..+|++|++||+|+++++|+|||++|||||+||||+||+|++|| T Consensus 293 ~~~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 293 CGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred eecchhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 56899999999999999999999999999999999999999999 No 9 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=100.00 E-value=1.6e-90 Score=512.91 Aligned_cols=320 Identities=14% Similarity=0.090 Sum_probs=280.8 Q ss_pred hcceeccccchhhhhhhhcccccchhhhhhcccccCcccccc--hhHHHHHHhhhhhhheeccccccchhhhCccccCCC Q lcl|NC_017674. 34 RIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPS--IPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGS 111 (382) Q Consensus 34 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~--~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~ 111 (382) -.|.++ .+.+..+.+..+.+|||++......+.+ ..|++++|++|||++||+++++++++++||++++++ T Consensus 1 ~~~~~~--------~~~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~ 72 (329) T protein:vir:79 1 MRGNIM--------SKEMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELS 72 (329) T ss_pred Cccchh--------hhhhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCC Confidence 223332 2344455555566888888555443332 569999999999999999999999999999999999 Q ss_pred cceeeEEEEeeecccceeecccc-cCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHh Q lcl|NC_017674. 112 WEDQEIVQGIVEPAGTAVEYGDH-TNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIF 190 (382) Q Consensus 112 ~~~~t~t~~v~e~~G~a~~ygd~-~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~ 190 (382) ||+++++|+++|.+|++++|||+ +|+|++|++++++++++++++.+|+|+++|+++|+++|++|+++|+.+|+++++++ T Consensus 73 ~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~ 152 (329) T protein:vir:79 73 DTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQL 152 (329) T ss_pred CceeEEEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHh Confidence 99999999999999999999996 46799999999999999999999999999999999999999999999999999999 Q ss_pred hccEEEEeeccCCcccceEEEeCCCCcceeccCC--CCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHH Q lcl|NC_017674. 191 RNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS--QGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATS 268 (382) Q Consensus 191 ~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~--~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~ 268 (382) +|+++|||++ ++++||||||||+++..++++ ++|++||++||++||++++++++.+|+|.+ .|++|+|||+ T Consensus 153 ~n~i~f~G~~---~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~----~p~~L~Lpp~ 225 (329) T protein:vir:79 153 VNHLVFKGSK---PHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQH----RANMILIPPS 225 (329) T ss_pred hccEEEeecc---cccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCcee----cccEEEecHH Confidence 9999999985 588999999999987655443 469999999999999999999999999987 4779999999 Q ss_pred HHhhccc-cCCCCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhc Q lcl|NC_017674. 269 KVDYLSV-TTPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFIT 347 (382) Q Consensus 269 ~~~~Ls~-t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~ 347 (382) ++.+|++ .+++|+|+++||++|||+++|+++|||++++++ ++++++.|.++. ..+.+.+|++|++ T Consensus 226 ~~~~L~~~~~~~~~tvl~~lk~~~~~l~I~~~~el~~ag~~---g~~~~v~y~~~~-----------~~~~~~vp~~~~~ 291 (329) T protein:vir:79 226 MRKVLMVRMPETTMSYLDYFKQQNGGITIESISELEDIDGA---GTKAALVYEKDP-----------MNMSIEIPEAFNM 291 (329) T ss_pred HHHHhhcccCCCCccHHHHHHHhCCCcEEEEcccccccCCC---CceEEEEEecCC-----------ceEEEecCcceee Confidence 9999975 467899999999999999999999999988554 455677765543 2356678999999 Q ss_pred ccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 348 LGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 348 l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ||+|+++++|++||++|||||+||||+||+|++|| T Consensus 292 l~~q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI 326 (329) T protein:vir:79 292 LTAQPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGL 326 (329) T ss_pred eeceecCceEEEceeeeEEEEEEECcceeeeeeee Confidence 99999999999999999999999999999999999 No 10 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=100.00 E-value=8.4e-89 Score=503.44 Aligned_cols=309 Identities=13% Similarity=0.112 Sum_probs=265.2 Q ss_pred cceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcce Q lcl|NC_017674. 35 IGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWED 114 (382) Q Consensus 35 ~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~ 114 (382) .-+.|+. ..-.+... .+.||.. .-+....|++++|++|||+|||+++++++++++||++++++||+ T Consensus 1 ~~~~~~~-~~~~~~~~---------~~~~~~~----~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~ 66 (314) T protein:vir:10 1 MAIKFDA-EQAKITTH---------LEQMGVE----KADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHA 66 (314) T ss_pred CccchHH-HHHHHHHH---------HHhhccc----chhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCce Confidence 2233331 11111111 1122211 11123569999999999999999999999999999999999999 Q ss_pred eeEEEEeeecccceeeccccc-CCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhcc Q lcl|NC_017674. 115 QEIVQGIVEPAGTAVEYGDHT-NIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNA 193 (382) Q Consensus 115 ~t~t~~v~e~~G~a~~ygd~~-DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~ 193 (382) ++++|+++|.+|++++|||++ |+|++|++++++++++++|+.+|+|+++|+++|+++|++|+++|+.+|++++++++|+ T Consensus 67 et~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~ 146 (314) T protein:vir:10 67 KYFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDK 146 (314) T ss_pred eEEEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhce Confidence 999999999999999999975 5799999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhc Q lcl|NC_017674. 194 IGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYL 273 (382) Q Consensus 194 i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~L 273 (382) ++|+|++ ++|+|||||||||+.. +++++| +|++||++||++++++++++|+|.|. |++|+|||+++.+| T Consensus 147 i~f~G~~---~~g~~GLlN~p~v~~~--~~~~~W--aT~~ei~~Di~~~~~~l~~~s~g~~~----p~~l~Lpp~~~~~L 215 (314) T protein:vir:10 147 LVWSGSA---PHGIVSVFDQPNINNV--VATPNW--SVPQNAIDDVTAMIDAVESSTQGLHH----VTDILLPASARRVM 215 (314) T ss_pred EEEeecc---cccceeEeecCCCccc--cCCCCc--ccHHHHHHHHHHHHHHHHHhcCcccc----ceeEEecHHHHHhh Confidence 9999985 4789999999999854 346789 59999999999999999999999884 67999999999999 Q ss_pred cccCC-CCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhccccee Q lcl|NC_017674. 274 SVTTP-YGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEK 352 (382) Q Consensus 274 s~t~~-~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~ 352 (382) +++++ +|+|+++||++||||++|+++|||+++++++ +++++.|.++. ..+.+++|++|++||+|+ T Consensus 216 ~~~~~~~~~tvl~~l~~n~~~l~I~~~~el~~ag~~g---~~~~v~y~~~~-----------~~~~~~vp~~~~~l~~e~ 281 (314) T protein:vir:10 216 QGLVPQTNLSYGELFTRNNPGLTIRFLQFLDNYDGAG---GKAALAFEKSP-----------LNMSIEIPEVTNVLPAQP 281 (314) T ss_pred cccccCCCccHHHHHHHhCCCcEEEEcccccccCCCc---ceEEEEEecCC-----------cEEEEecCccceeeccee Confidence 98754 6999999999999999999999999887654 44566664432 245678899999999999 Q ss_pred cCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 353 RAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 353 ~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ++++|++||++|||||+||||+||+|++|| T Consensus 282 ~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI 311 (314) T protein:vir:10 282 KDLHFRYPVTSKATGLIVYRPLTMAVIKGI 311 (314) T ss_pred cCceEEEcceeeeEEEEEECcceeEeeeee Confidence 999999999999999999999999999999 No 11 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=100.00 E-value=2.9e-87 Score=495.03 Aligned_cols=316 Identities=15% Similarity=0.138 Sum_probs=270.1 Q ss_pred ccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccch-hHHHHHHhhhhhhheeccccccc Q lcl|NC_017674. 21 LKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSI-PTPIQFLQTWLPGFVKVMTAARK 99 (382) Q Consensus 21 ~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~-~~~~~~l~~idp~v~~~~~~~~~ 99 (382) |+.. ++|.+....+.....+ +.||.+. ..+. .|++++|++|||++||+++++++ T Consensus 1 ~~~~-------------~~~~~~~~~~~~~~~~-------~~~~~da-----~~~~g~~~~~ql~~id~~v~e~~~~~l~ 55 (319) T protein:vir:10 1 MTTK-------------KFDEADKSNVEMYLIQ-------AGVKQDA-----AATMGIWTAQELHRIKSQSYEEDYPVGS 55 (319) T ss_pred CCCc-------------chhHHhhHHHHHHHhh-------ccchhhh-----hhhhhhHHHHHHHHHHHHHHhhhhccee Confidence 2221 1111111100001000 1122221 1233 48899999999999999999999 Q ss_pred hhhhCccccCCCcceeeEEEEeeecccceeecccccC-CceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHH Q lcl|NC_017674. 100 IDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTN-IPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAET 178 (382) Q Consensus 100 ~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~D-iP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~ 178 (382) ++++||+.++++||+++++|.++|.+|++++|||+++ +|++|++++++.+++++++.+|+|+++|+++|+++|++|+++ T Consensus 56 ~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ 135 (319) T protein:vir:10 56 ALRVFPVTTELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTR 135 (319) T ss_pred chhhcccccCCCCceEEEEeeeeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHH Confidence 9999999999999999999999999999999999764 799999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeecc Q lcl|NC_017674. 179 KRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKA 258 (382) Q Consensus 179 K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~ 258 (382) |+.+|++++++++|+++|+|++ +.|+||||||||+++.+++...+|++||++||++||++++++++++|+|.|. T Consensus 136 k~~aA~~~~~~~~n~i~f~G~~---~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~--- 209 (319) T protein:vir:10 136 KASACQLAHDQLVNRLVFKGSA---PHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHR--- 209 (319) T ss_pred HHHHHHHHHHHhhceEEEeecc---cccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceee--- Confidence 9999999999999999999985 5789999999999988776667789999999999999999999999999984 Q ss_pred ccceEecCHHHHhhccc-cCCCCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhh Q lcl|NC_017674. 259 EKITLALATSKVDYLSV-TTPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVF 337 (382) Q Consensus 259 ~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~ 337 (382) |++|+|||+++.+|++ .+++|+|+++||++||||++|+++|||++++++ |++++++|.++. ..+ T Consensus 210 -p~~L~L~p~~~~~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel~~ag~~---g~~~~v~y~~~~-----------~~~ 274 (319) T protein:vir:10 210 -ATNILIPPSMRKVLAIRMPETTMSYLDYFKSQNSGIEIDSIAELEDIDGA---GTKGVLVYEKNP-----------MNM 274 (319) T ss_pred -ceEEEecHHHHHhhhcccCCCCeeHHHHHHHhcCCceEEEeeeecccCCC---cceEEEEEecCC-----------ceE Confidence 6799999999999975 567899999999999999999999999988654 345677765542 245 Q ss_pred hhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 338 SQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 338 ~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .+.+|++|++||+|+++++|++||++|||||+||||.||+|++|| T Consensus 275 ~~~v~~~~~~~~~e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 275 SIEIPEAFNMLPAQPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred EEecCcceeeeeeeecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 678899999999999999999999999999999999999999999 No 12 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=100.00 E-value=3.9e-86 Score=488.82 Aligned_cols=292 Identities=15% Similarity=0.132 Sum_probs=269.0 Q ss_pred cccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeeccccc-CCceeeeeeeeeE Q lcl|NC_017674. 70 PVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHT-NIPLTSWNANFER 148 (382) Q Consensus 70 ~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~-DiP~vd~~~~~~~ 148 (382) ++++.+.+|++++|++|||++||++++++++++|||++++++||+++++|.++|.+|++++|||++ |+|+++++++++. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 566677889999999999999999999999999999999999999999999999999999999965 5799999999999 Q ss_pred eeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC---- Q lcl|NC_017674. 149 RTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS---- 224 (382) Q Consensus 149 ~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~---- 224 (382) +++++++.+|+|+++||++|+++|++|+++|+.+|++++++++|+++|+|++ +.|+|||||+||+++..++++ T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~---~~g~~GLlN~p~~~~~~~~~~~~~~ 157 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEK---KYAIKGAFEATGIQIDVSPTTGVGN 157 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecc---cccceeeecCCCcccccccCccccc Confidence 9999999999999999999999999999999999999999999999999985 578999999999987765543 Q ss_pred -CCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc---CCCCccHHHHHHHhcCccEEEEcc Q lcl|NC_017674. 225 -QGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT---TPYGISVSDWIEQTYPKMRIVSAP 300 (382) Q Consensus 225 -~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t---~~~~~Tvl~~l~~n~pnl~i~~~p 300 (382) ++|++||++||++||++++++++.+|+|.+. |++|+|||++|.+|+++ +.+|+|+++||++|||+++|+++| T Consensus 158 ~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~----p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p 233 (301) T protein:vir:80 158 VSKWEKKTAEQIIDEIGEAHTKITVLPGYGTA----SLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVP 233 (301) T ss_pred ccccccCCHHHHHHHHHHHHHHHHHhcCceec----ccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEcc Confidence 5799999999999999999999999999874 77999999999999865 678999999999999999999999 Q ss_pred ccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeec Q lcl|NC_017674. 301 ELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYL 380 (382) Q Consensus 301 eL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~ 380 (382) ||++++++ +.+++++|.++. ..+.+.+|++|++||+|+++++|++||++|||||+||||.||+|++ T Consensus 234 ~L~~~g~~---g~~~~v~~~~~~-----------d~~~~~v~~~~~~~~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~ 299 (301) T protein:vir:80 234 DLAGMGTA---GSDSFAVIHDSN-----------ETAELIIPMDITRHPEEYSFPRTKVPFEERTAGVVVRFPAAIVRVD 299 (301) T ss_pred eeccCCCC---cccEEEEEecCC-----------cEEEEEecCceeeecceecCceeEeeeeeeeEEEEEEccceEEEEe Confidence 99988654 455777775543 2345678999999999999999999999999999999999999999 Q ss_pred CC Q lcl|NC_017674. 381 GI 382 (382) Q Consensus 381 GI 382 (382) || T Consensus 300 GI 301 (301) T protein:vir:80 300 GI 301 (301) T ss_pred cC Confidence 99 No 13 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=100.00 E-value=8.7e-85 Score=481.44 Aligned_cols=289 Identities=10% Similarity=0.055 Sum_probs=258.4 Q ss_pred ccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeeccccee--eccc-ccCCceeeeeeeeeEe Q lcl|NC_017674. 73 TPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAV--EYGD-HTNIPLTSWNANFERR 149 (382) Q Consensus 73 ~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~--~ygd-~~DiP~vd~~~~~~~~ 149 (382) .+.++|++++|++||+++||.++++++++++||++++++||+++++|+++|.+|+|+ ++++ .+|+|++|++++++++ T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 457889999999999999999999999999999999999999999999999999999 5566 5678999999999999 Q ss_pred eEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceecc---CCCC Q lcl|NC_017674. 150 TIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTP---PSQG 226 (382) Q Consensus 150 ~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~---a~~~ 226 (382) +|+++++||+|+++||++|+++|++|+++|+++||+++++++|+++|+|++. ..|++||||||||+...++ ++++ T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~--~~g~~GllN~p~v~~~~~~~~~a~~~ 158 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAK--DSRLTGLLNNKSVEVYAIKGAAQNTK 158 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeecc--ccceEEEEeCCCcceeeecCCccCCc Confidence 9999999999999999999999999999999999999999999999999853 3579999999999976544 3467 Q ss_pred ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc--CCCCccHHHHHHHhcC-----ccEEEEc Q lcl|NC_017674. 227 WSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT--TPYGISVSDWIEQTYP-----KMRIVSA 299 (382) Q Consensus 227 Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t--~~~~~Tvl~~l~~n~p-----nl~i~~~ 299 (382) |++||++||++||+++++++|.+|++.+ .|++|+|||+++.+|+.+ +++|+|+|+||++||| +|+|+.+ T Consensus 159 w~~~T~~eI~~di~~~~~~i~~~s~~~~----~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~~~g~~l~I~~v 234 (304) T protein:vir:52 159 VQAMDFDKAVAFFKEIFLKGMEKTKRIE----APNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSAAAGRQVAIKAL 234 (304) T ss_pred cccCCHHHHHHHHHHHHHHHHhccCcee----cCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcccccCCcceEEEe Confidence 9999999999999999999999999987 467999999999999754 6789999999999988 6789999 Q ss_pred cc-cccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCC-ceEeccccceeeeEeeccchhe Q lcl|NC_017674. 300 PE-LSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAK-SYVEDFSNGTAGALCKRPWAVV 377 (382) Q Consensus 300 pe-L~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~-~~~~~~~~~t~Gv~i~~P~aia 377 (382) |+ +.+++ .+|+++|+.|.++.. ...+++||+|++||+|++++ .|++||++|||||+||||++|+ T Consensus 235 ~~~~~~~g---~~g~~r~vvY~~d~~-----------~~~~~vP~p~~~l~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~ 300 (304) T protein:vir:52 235 PSNYGTRV---TDGKTRAMVYVNSKE-----------HVIFDVPMSPTVLDAQPKGLLAFESGLRMAFGGVTFMEPDSAL 300 (304) T ss_pred cccccccC---CCCceEEEEEecChh-----------heEEecCccccccchhhcCCceEEecceeeeeeEEEEccceee Confidence 84 55443 345566666655432 34567899999999999986 7999999999999999999999 Q ss_pred eecC Q lcl|NC_017674. 378 RYLG 381 (382) Q Consensus 378 ~~~G 381 (382) |+|= T Consensus 301 y~D~ 304 (304) T protein:vir:52 301 YVDY 304 (304) T ss_pred eecC Confidence 9999 No 14 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=100.00 E-value=3.8e-85 Score=483.38 Aligned_cols=291 Identities=13% Similarity=0.122 Sum_probs=264.5 Q ss_pred hhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeeccccc-CCce Q lcl|NC_017674. 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHT-NIPL 139 (382) Q Consensus 61 ~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~-DiP~ 139 (382) |.||.++. ...|++++|++|||++||+++++++++++||+.++++||+++++|+++|.+|++++|||++ |+|+ T Consensus 1 ~~~~~a~~------~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~ 74 (296) T protein:vir:10 1 MGVDKADA------AGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPL 74 (296) T ss_pred Ccccchhh------hHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccce Confidence 67886543 3568999999999999999999999999999999999999999999999999999999975 5799 Q ss_pred eeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcce Q lcl|NC_017674. 140 TSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAF 219 (382) Q Consensus 140 vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~ 219 (382) +|++++++++++++++.+|+|+++||++|+++|++|+++|+.+|++++++++|+++|+|++ ++|+|||||||+++.. T Consensus 75 v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~---~~g~~GLlN~p~v~~~ 151 (296) T protein:vir:10 75 VDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGST---AHGIPSVFDYPNINNV 151 (296) T ss_pred eeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecc---cccceeEeecCCCccc Confidence 9999999999999999999999999999999999999999999999999999999999985 4789999999999854 Q ss_pred eccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHHHHHhcCccEEEE Q lcl|NC_017674. 220 QTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDWIEQTYPKMRIVS 298 (382) Q Consensus 220 ~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~~n~pnl~i~~ 298 (382) +++++|+++ .||++||++++++++.+|+|.|. |.+|+|||+++.+|+++ +++|+|+++||++||||++|++ T Consensus 152 --~~~~~W~~~--t~i~~Di~~~~~~l~~~s~g~~~----p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~ 223 (296) T protein:vir:10 152 --VSGGSWSQP--TTAVSDITSLLDIIETSTNGQHR----ATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEF 223 (296) T ss_pred --cccCCccCH--HHHHHHHHHHHHHHHHhhCceec----ceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEE Confidence 346789655 49999999999999999999985 56899999999999875 7889999999999999999999 Q ss_pred ccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchhee Q lcl|NC_017674. 299 APELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVR 378 (382) Q Consensus 299 ~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~ 378 (382) +|||++++++| ++++++|.++. ..+.+.+|+++++||+|+++++|++||++|||||+||||.||+| T Consensus 224 ~~~l~~a~~~g---~~~~v~~~~~~-----------~~~~~~v~~~~~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~ 289 (296) T protein:vir:10 224 VQYLNDYNGTG---TSAAIAYEKDP-----------NNMAIEIPEATNALPAQPKDLHFKIPVTSKATGLIVYRPLTMAV 289 (296) T ss_pred eeeeccCCCCc---ceEEEEEEcCC-----------ceEEEEcCcceeeecccccCceEEEeeEeeEEEEEEECCceeEE Confidence 99999886543 45666664432 24567789999999999999999999999999999999999999 Q ss_pred ecCC Q lcl|NC_017674. 379 YLGI 382 (382) Q Consensus 379 ~~GI 382 (382) ++|| T Consensus 290 ~dGI 293 (296) T protein:vir:10 290 MKGI 293 (296) T ss_pred Eeee Confidence 9999 No 15 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.57 E-value=4.5e-09 Score=66.36 Aligned_cols=300 Identities=12% Similarity=-0.020 Sum_probs=161.4 Q ss_pred hhhhhcccccchhhhhhcccccCcccccchh-HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecc Q lcl|NC_017674. 47 QIKALAKAGAFRSGSAMDSNFTAPVTTPSIP-TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPA 125 (382) Q Consensus 47 ~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~-~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~ 125 (382) |.... +.+.. ...|..+.+ +|-.+. .++++.+......+.+.++..... ....|++.+.. T Consensus 1 m~~~~-----------~~a~~-~~~t~~~g~~i~~~~~----~~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~ 61 (330) T protein:vir:77 1 MAGST-----------VPSTQ-VALTGDFSAFLTPEQS----QDYFAEIEKTSIVQRIARKVPMGP---TGISIPHWTGA 61 (330) T ss_pred Ccccc-----------cchhh-ccccCCCcceechhHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEcCC Confidence 11111 11111 112222233 333222 256666666666677666544332 44678888888 Q ss_pred cceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcc Q lcl|NC_017674. 126 GTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGN 205 (382) Q Consensus 126 G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~ 205 (382) +.+.+.+....+|..+...++.....+.++..+.++.+=++. ...++.+.-.....+++.+.+|+-.++|+.. .+ T Consensus 62 ~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~d---s~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~--~~ 136 (330) T protein:vir:77 62 VSASWTGEAERKPITKGSFGKQELEPVKITTIFAESAEVVRL---NPLNYLNTMRTKIAEAIALKFDAAAIHGIDK--PS 136 (330) T ss_pred cceeEecCCCccccccceeeEEEEeEEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhcccCC--CC Confidence 888889999999999999999999999999999999854433 3567899999999999999999999999854 35 Q ss_pred cceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHH Q lcl|NC_017674. 206 RTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVS 284 (382) Q Consensus 206 g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl 284 (382) ...|++|++............-..++...+++||.+++..+...-. .+..++|.++.+..|.. .+..|.-++ T Consensus 137 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-------~~~~~vmn~~~~~~l~~lkd~~G~~l~ 209 (330) T protein:vir:77 137 AFKGYLAETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGK-------KWTGTLLDNVTEPILNTAVDGNGRPLF 209 (330) T ss_pred ccccccccccccceeecccccccccccchhHHHHHHHHHhhhhcCC-------CccEEEEcHHHHHHHHHHhccCCceee Confidence 6689999875332222222222345566778999988887765421 23478999999888854 233333332 Q ss_pred HHHHHh-----cCccE-----EEEccccccccCCCCCceeEEEE--cchhh-----hhhhccccccchhhhhh--hhhhh Q lcl|NC_017674. 285 DWIEQT-----YPKMR-----IVSAPELSGVQMKAQEPEDALVL--FVEDV-----NAAVDGSTDGGSVFSQL--VQSKF 345 (382) Q Consensus 285 ~~l~~n-----~pnl~-----i~~~peL~~a~g~g~~~~~~~~~--~~~~v-----~~~~~~~~~~~~~~~~~--~p~~~ 345 (382) .--... ..+.+ ++....+.. +.. +.+..+++ +.+-+ ...++-+++....+... ..... T Consensus 210 ~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~-~~~--~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~ 286 (330) T protein:vir:77 210 VESTYTEQVGAIREGRILGRPTYVADNVVN-GTV--GNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVP 286 (330) T ss_pred cCccccccccccCCceecceeeEEeccccC-CCC--CCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccc Confidence 110000 11122 222222211 111 11111111 11100 00000000000000000 00000 Q ss_pred hcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 346 ITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 346 ~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ..... ...-...+.+..|.++.+ ++|.||+.+.+. T Consensus 287 ~~~~~-f~~~~~~~r~~~r~d~~v-~~~~a~~~i~~~ 321 (330) T protein:vir:77 287 KLISL-WQHNMVAVRCEAEFAFMV-NDKDAFVKLTDQ 321 (330) T ss_pred cccch-hhcCcEEEEEEEEeccEE-ecccceEEEEec Confidence 00000 001124456666766655 679999999999 No 16 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.49 E-value=4.3e-09 Score=66.44 Aligned_cols=297 Identities=10% Similarity=-0.028 Sum_probs=153.1 Q ss_pred chhhhhhcccccCcc---cccchh-HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecc Q lcl|NC_017674. 57 FRSGSAMDSNFTAPV---TTPSIP-TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYG 132 (382) Q Consensus 57 ~~~~~amDa~~~~~~---t~~~~~-~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~yg 132 (382) ..+..+||++..... ++.+.+ +|..+. .++++.+......+.+.++...+. ...+|++.+..+.+.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~----~~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~ 73 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQA----KDYFAEAEKTSIVQQFAQKVPMGT---TGQKIPHWIGDVSAQWIG 73 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEeCCcceEEec Confidence 112234554422221 111222 444433 466677666666777776655433 456788888888888999 Q ss_pred cccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEe Q lcl|NC_017674. 133 DHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLN 212 (382) Q Consensus 133 d~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN 212 (382) ...++|..+...++...+++.++..+.++.+=++. ...++.+.-.....+++.+.+|+-.++|+..+..+++-|.++ T Consensus 74 E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d---s~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~ 150 (320) T protein:vir:10 74 EGDMKPITKGNMTSQNIAPHKIATIFVASAETVRA---NPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTK 150 (320) T ss_pred CCccccccccceeEEEEeeEEEEEeehhhHHHHhc---ChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccc Confidence 99999999999999999999999999999866543 336788888889999999999999999986544444344433 Q ss_pred CCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH-HH-- Q lcl|NC_017674. 213 DPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW-IE-- 288 (382) Q Consensus 213 ~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~-- 288 (382) .-++. ...+..+++-+. .-+++.+++..+... . ..+..+++.|+.+..|.+ .+..|..++.- +. T Consensus 151 ~~~~~---~~~~~~~~~~~~--~~~~~~~~~~~~~~~---~----~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~ 218 (320) T protein:vir:10 151 SVSLA---DPGGATASDLTA--YDAVAVNGLSLLVNA---K----KKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTD 218 (320) T ss_pred cccce---eccccccccccc--HHHHHHHHHhhhhcc---c----CCCcEEEEcHHHHHHHHHhhccCCceeeccccccC Confidence 32221 111122222111 112233333332221 1 134589999999999964 23334333221 11 Q ss_pred --HhcCccEEEEccccccccCCCCCceeEEE--------EcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceE Q lcl|NC_017674. 289 --QTYPKMRIVSAPELSGVQMKAQEPEDALV--------LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYV 358 (382) Q Consensus 289 --~n~pnl~i~~~peL~~a~g~g~~~~~~~~--------~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~ 358 (382) .+++..++...|=+....... +...+++ ....++.. +-++... +.-.-+.+-..... ...-... T Consensus 219 ~~~~~~~~~i~g~pv~~~~~~~~-~~~~~~~gd~~~~~~~~~~~~~i--~~~~~~~--~~~~~~~~~~~~~~-f~~~~~~ 292 (320) T protein:vir:10 219 ENSPFRAGRIVSRPTILSDHVAD-GTTVGYMGDFRNVIWGQVGGLSF--DVTDQAT--LNLGTPTEPNFVSL-WQHNLVA 292 (320) T ss_pred ccccccCceeeeeeeEecCCCCC-CceEEEEeecceEEEEEecCeEE--EEeecce--eeeccccccccchh-hhcCcEE Confidence 112334454444432211111 1111111 01111100 0000000 00000000000000 0001112 Q ss_pred eccccceeeeEeeccchheeecCC Q lcl|NC_017674. 359 EDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 359 ~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +.+..+. |+.+.+|.||+.+.|+ T Consensus 293 ~r~~~~~-d~~v~~~~a~~~l~~~ 315 (320) T protein:vir:10 293 VRVEAEY-AFHNNDKDAFVKLTNV 315 (320) T ss_pred EEEEEee-ccEEecccceEEEEec Confidence 3344444 6777999999999999 No 17 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.35 E-value=1.5e-08 Score=63.44 Aligned_cols=278 Identities=9% Similarity=-0.032 Sum_probs=149.0 Q ss_pred cccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEe Q lcl|NC_017674. 70 PVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERR 149 (382) Q Consensus 70 ~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~ 149 (382) ++++.+.-+|-.+.+ +|++.+......+.+.++..... ....+++....+.|.++|...++|..+...+...- T Consensus 1 ma~~gG~lvp~~~~~----~ii~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l 73 (298) T protein:vir:16 1 MVLNKGTLFDPTLVT----DLISKVAGKSSIARLSAQKPIPF---NGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTM 73 (298) T ss_pred CcccCcceechhHHH----HHHHHHHhhhhhhhhcceeeccC---CceEEEEEecCcceEEecCCccccccccceeEEEE Confidence 222222224444443 55555555555566665443322 34567888888889999999999999999999999 Q ss_pred eEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccC--CcccceEEEeCCCCcceeccCCCCc Q lcl|NC_017674. 150 TIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSG--LGNRTYGFLNDPNLPAFQTPPSQGW 227 (382) Q Consensus 150 ~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g--~~~g~~GllN~P~l~~~~~~a~~~W 227 (382) ..+.++....+|.+=++.+.-...++.+.-+...++++.+.+++-.++|..++ ...+..|+....+.....+.. T Consensus 74 ~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~---- 149 (298) T protein:vir:16 74 VPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEA---- 149 (298) T ss_pred eeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccccc---- Confidence 99999999999886555444556788888888999999999999999996432 122233333222211111111 Q ss_pred cccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHHHHHh-----cCccEEEEccc Q lcl|NC_017674. 228 STADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDWIEQT-----YPKMRIVSAPE 301 (382) Q Consensus 228 a~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~~n-----~pnl~i~~~pe 301 (382) .......++||.+++..+...-. .+..++|.++.+..|.+. +..|.-++.-.-.+ .-++.++.... T Consensus 150 -~~~~~~~~~~i~~~~~~~~~~~~-------~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~ 221 (298) T protein:vir:16 150 -PRGIADPNGAIENAVELLTGVDA-------DVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKT 221 (298) T ss_pred -ccccccHHHHHHHHHHHhhhcCC-------CccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecc Confidence 11223457788888887665321 234689999988888643 44454443211111 11122222221 Q ss_pred cccccCCCCCceeEEE--Ecchhhhhhhccccccchhhhhhhhhhhhcccc-eecC---Cce-----EeccccceeeeEe Q lcl|NC_017674. 302 LSGVQMKAQEPEDALV--LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV-EKRA---KSY-----VEDFSNGTAGALC 370 (382) Q Consensus 302 L~~a~g~g~~~~~~~~--~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~-~~~~---~~~-----~~~~~~~t~Gv~i 370 (382) +.... ...+..++ .|..-+.- + ..+.+... ..+. ...+ ..| ...++.| .|..+ T Consensus 222 v~~~~---~~~~~~~~~GDfs~~~~~---~-------~~~~~~~~--~~~~~~~~~~~~~~f~~~~v~~ra~~r-~d~~v 285 (298) T protein:vir:16 222 VSDMS---LTQRDRAIIGDFANGFKW---G-------YAKEVPLE--VIQYGDPDNSGLDLKGYNQVYIRAELF-LGWGI 285 (298) T ss_pred ccccc---CCCccEEEEeeccceEEE---E-------EecCceEE--EeeccCCcCcchhhhhcCcEEEEEEEE-EccEe Confidence 21111 11111111 11110000 0 00000000 0000 0000 001 1122223 46778 Q ss_pred eccchheeecCC Q lcl|NC_017674. 371 KRPWAVVRYLGI 382 (382) Q Consensus 371 ~~P~aia~~~GI 382 (382) ++|.||+++.|. T Consensus 286 ~~~~a~~~l~~a 297 (298) T protein:vir:16 286 LDATKFARVTEA 297 (298) T ss_pred ecccceEEEeec Confidence 999999999999 No 18 >protein:vir:105778 Length: 358 # NCBI annotation: gp9 # Family: family:all:10995 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224147;genbank:gi:62362222;genbank:GeneID:3342531 Probab=98.28 E-value=1.4e-08 Score=63.68 Aligned_cols=325 Identities=13% Similarity=0.030 Sum_probs=180.3 Q ss_pred CCCcceeeeecCccccccccccc--cchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCccccc-c-- Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKN--ITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTP-S-- 75 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~-~-- 75 (382) |-= ||. .+.+ +.-..+++|.-.-..|+.... +.||-+++...+.+.+ | T Consensus 1 ~~f-~K~------------~~an~~~~~~qw~~L~~~Rna~n~~~~--------------a~maan~a~~~~~~~~~NAv 53 (358) T protein:vir:10 1 MYF-SKE------------TLATNSRLGGHWNELWANRNMWNAQHD--------------AMIAANRSNMTPEWLAVNAV 53 (358) T ss_pred Cee-chh------------hhhhHHHHHHHHHHHHHHHHHhhhhhh--------------hHHhhhHHHhhhhhheeccc Confidence 100 110 0001 111224444332223322110 1111122211111111 1 Q ss_pred hhHHHHHHhhhhhhheeccccc---cchhhhCccccCCCcceeeEEEEeeec-ccceee--ccccc-CCceeeeeeeeeE Q lcl|NC_017674. 76 IPTPIQFLQTWLPGFVKVMTAA---RKIDEIIGIDTVGSWEDQEIVQGIVEP-AGTAVE--YGDHT-NIPLTSWNANFER 148 (382) Q Consensus 76 ~~~~~~~l~~idp~v~~~~~~~---~~~~~l~~v~t~g~~~~~t~t~~v~e~-~G~a~~--ygd~~-DiP~vd~~~~~~~ 148 (382) .+|+..+--.||.+++++..++ .-..+|.|+.+..+-......|.+... .|++.. .|... ..--+..+.+-. T Consensus 54 ~~v~~D~wr~~D~~~~q~fr~e~~~~l~NDLm~ls~sv~Igktv~~y~~~gd~~~~v~~SmsGQ~~~~lD~~~y~~dGt- 132 (358) T protein:vir:10 54 GGFTRDFWAEIDRQVLQLRDQEVGMEIVNDLIGVQTVLPVGKTAKLYNVIGDIADDVSVSIDGQAPFSFDHTEYASDGD- 132 (358) T ss_pred ccCCHHHHHHHhhhhhhhcccchhHHHHhhhhhccccccHHHHHHHHhhhcCCCceEEEEecccCcccccceeeeccCC- Confidence 3467777888888888877665 235788899888876655555665444 776643 34321 222233333333 Q ss_pred eeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccC--CcccceEEEeCCCCcceecc---- Q lcl|NC_017674. 149 RTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSG--LGNRTYGFLNDPNLPAFQTP---- 222 (382) Q Consensus 149 ~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g--~~~g~~GllN~P~l~~~~~~---- 222 (382) +|=-+-.||+.+.+|+.-.+--|+++..+-+.+..+++.+++-+.+|.|+.+- ..+-+|||-||||.--..-+ T Consensus 133 -piPIfdsg~~f~WR~~~~~~~~g~d~~~daQ~~~~~kv~~~~vdy~lNG~~~I~v~g~t~~Glrn~~n~~qv~l~~~s~ 211 (358) T protein:vir:10 133 -PIPVFTAGYGVNWRHAAGLNSLGIDLVLDSQMAKMRKFNQKRVNYYLNGDPNIQVQSYPAQGIKNHRNTKKINLGSGSG 211 (358) T ss_pred -EeeeeccCccccccchhhcCccccchhHHHHHHHHHHHHHHHHhhhhccCCceeecCcccccccCCcceeEEEeccCCC Confidence 44445566677778888788889999999999999999999999999998321 13668999999996522222 Q ss_pred -CCCCccccCHHHHHHHH-HHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCC---CccHHHHHHHhcCcc-E Q lcl|NC_017674. 223 -PSQGWSTADWAGIIGDI-REAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPY---GISVSDWIEQTYPKM-R 295 (382) Q Consensus 223 -a~~~Wa~kT~~eI~~Di-~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~---~~Tvl~~l~~n~pnl-~ 295 (382) ..-+..++|+++++.-+ .+++.++-...+-. ...++..+|+.+..|.++ .+. .-|||+++++- +++ + T Consensus 212 g~NiDlttat~~a~~~~f~~~l~~~~~~~N~~~-----~~~~~~vs~ei~~n~~r~Y~~~~~~~gTIl~~vl~~-~~va~ 285 (358) T protein:vir:10 212 GANIDLTTADMTALFAFFGKGAFGTLARANKVA-----QYDVMWVSPEIWANLAQPYVVNGVVSGNVLNAVLPF-APVRE 285 (358) T ss_pred cceeeeccCCHHHHHHHHHHHHHHHHHhhcccc-----eeeEEEEcHHHHhhhhcccccccccchhhHHHhhcc-cCccc Confidence 12357889998888888 66777766555422 234899999999999874 322 34999999764 444 4 Q ss_pred EEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEecccccee---eeEeec Q lcl|NC_017674. 296 IVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTA---GALCKR 372 (382) Q Consensus 296 i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~---Gv~i~~ 372 (382) |...+.|++ .-.+.|....++-..+.|+..+...- |.+ .+...|.-++| |++|+. T Consensus 286 I~~~~~Lsg-------Neii~~~~~~~vi~plvG~~~gt~~~----pR~-----------~p~ddY~f~vwsA~glqik~ 343 (358) T protein:vir:10 286 IRQTFALSG-------NEFIAYVRRQDIISPLVGMAVGVVPL----PRP-----------LPNVNYNFQIMSAEGLQITA 343 (358) T ss_pred ccccccCCC-------ccEEEEEeCCceeeeeecceeeeecC----CCC-----------CCCcchhhhhhhhhceeeee Confidence 666666642 23455555555444444433332211 111 11123444444 334443 Q ss_pred cc----hheeecCC Q lcl|NC_017674. 373 PW----AVVRYLGI 382 (382) Q Consensus 373 P~----aia~~~GI 382 (382) =. .+.+..-+ T Consensus 344 D~~Gks~Vv~~~~~ 357 (358) T protein:vir:10 344 DDQGLSGVVYGANL 357 (358) T ss_pred ccccceeeEeeccc Confidence 21 12222222 No 19 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.28 E-value=2.7e-08 Score=62.06 Aligned_cols=280 Identities=10% Similarity=-0.007 Sum_probs=146.7 Q ss_pred hhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCcee Q lcl|NC_017674. 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT 140 (382) Q Consensus 61 ~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~v 140 (382) ||.+ .+.-+|..+.+ +|++.+...-..+.+.++.+.+. ....+++....+.|.+.+.+.++|.. T Consensus 1 ma~~---------gG~lip~~~~~----~ii~~~~~~s~i~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~ 64 (298) T protein:vir:94 1 MVLN---------KGTLFDPELVT----DLISKVAGKSSIARLSAQKPIPF---NGEKVFTFTMDSEIDVVAESGKKTHG 64 (298) T ss_pred Ceec---------cccccChhHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcceEEeeCCcccccc Confidence 2222 22224444433 55666655555666666554433 34567888777888899999999999 Q ss_pred eeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcc--cceEEEeCCCCcc Q lcl|NC_017674. 141 SWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGN--RTYGFLNDPNLPA 218 (382) Q Consensus 141 d~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~--g~~GllN~P~l~~ 218 (382) +...+...-..+.++....+|.+=++...-...+|.+.-+...++++.+.+++..++|..++... ...|..+..+... T Consensus 65 ~~~f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~ 144 (298) T protein:vir:94 65 GVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVT 144 (298) T ss_pred ccceeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccc Confidence 99999999999999998888875443333445678888888999999999999999996432211 1122111111111 Q ss_pred eeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHh-cC---- Q lcl|NC_017674. 219 FQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQT-YP---- 292 (382) Q Consensus 219 ~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n-~p---- 292 (382) ..+ -.......+++||.+++..+...-. .+..++|.|+.+..|.+ .+..|.-++.=...+ -| T Consensus 145 ~~~-----~~~~~~~~~~~~i~~~~~~~~~~~~-------~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~ 212 (298) T protein:vir:94 145 QKV-----EAPRGIADPNGAIENAVELLTGVDA-------DVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTIN 212 (298) T ss_pred ccc-----ccccccccHHHHHHHHHHhhhhcCC-------CccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceec Confidence 000 0122344567899999887665421 23468999999988854 233343332111111 11 Q ss_pred ccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccc-ee--------cCCceEecccc Q lcl|NC_017674. 293 KMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV-EK--------RAKSYVEDFSN 363 (382) Q Consensus 293 nl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~-~~--------~~~~~~~~~~~ 363 (382) ++.++....+... .+++....++-.|..-+. .+ ..+.+... ..+. .. ..-..-..++. T Consensus 213 G~PV~~~~~v~~~-~~~~~~~~~~Gdfs~~~~---~~-------~~~~~~~~--~~~~~~~d~~~~~~f~~~~v~~r~~~ 279 (298) T protein:vir:94 213 GLPVDVNKTVSDM-SLTQRDRAIIGDFANGFK---WG-------YAKEVPLE--VIQYGDPDNSGLDLKGYNQVYIRAEL 279 (298) T ss_pred ceeeEEecccccc-cCCCccEEEEeeccceEE---EE-------EecCceEE--EeecCCCcCcchhhhhcCcEEEEEEE Confidence 1122222222111 111111101001111000 00 00000000 0000 00 00011123344 Q ss_pred ceeeeEeeccchheeecCC Q lcl|NC_017674. 364 GTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 364 ~t~Gv~i~~P~aia~~~GI 382 (382) |. |+.+++|.||+++.|. T Consensus 280 r~-~~~~~~~~a~~~l~~~ 297 (298) T protein:vir:94 280 FL-GWGILDATKFARVTEA 297 (298) T ss_pred Ee-ccEeecccceEEEEec Confidence 44 6677889999999999 No 20 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.25 E-value=4.7e-08 Score=60.75 Aligned_cols=281 Identities=10% Similarity=-0.074 Sum_probs=146.9 Q ss_pred cccccc--hhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeee Q lcl|NC_017674. 70 PVTTPS--IPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFE 147 (382) Q Consensus 70 ~~t~~~--~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~ 147 (382) +.|... .-+|..+.+ +|++.+...-..+.+.++.+... ...++++.+..+.|.+.+.+..+|..+...++. T Consensus 1 mat~~~gg~lvP~~~~~----~ii~~~~~~s~i~~~~~~i~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v 73 (311) T protein:vir:81 1 MVALATGTFQLPKHLVP----GVWQKAQGQSVLARLSMAEPQEF---GEQQYMTLTAPPRGEVVGEGAQKSESTATFAPV 73 (311) T ss_pred CceecCCceEcchhHHH----HHHHHHHhcchhhhhcceeecCC---CceEEEEEeCCceeEEeecCcccccccceeeEE Confidence 222222 224555544 55555555555555555443322 346788888888888999999999999998888 Q ss_pred EeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCc Q lcl|NC_017674. 148 RRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGW 227 (382) Q Consensus 148 ~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~W 227 (382) .-+.+.++..+.+|.+=++...-...+|.+.-....++++.+.+++.+++|+.++.....-|+++... ..... ... T Consensus 74 ~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~--~~~~~--~~~ 149 (311) T protein:vir:81 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKIL--DTTNI--VEL 149 (311) T ss_pred EEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCccccccccccc--cccee--eee Confidence 88899998888888753333334556788888899999999999999999986554444556665421 11111 111 Q ss_pred cccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHH-hcCc----cEEE---E Q lcl|NC_017674. 228 STADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQ-TYPK----MRIV---S 298 (382) Q Consensus 228 a~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~-n~pn----l~i~---~ 298 (382) ...+...+..+|.+++..+...- . .+..++|.|..+..|.+ .+..|.-++.-... ..|+ ..++ . T Consensus 150 ~~~~~~~~~~~i~~~~~~~~~~~---~----~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~ 222 (311) T protein:vir:81 150 TTGTSATPDLAVEAAVGLVLGDN---L----SPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDT 222 (311) T ss_pred cccccchHHHHHHHHHHHhhhcC---C----CceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEeccc Confidence 22233344566777776654332 1 23458999998888854 23334333321110 0110 1111 1 Q ss_pred ccccc--------cccCCCCCceeEEEEc-------chhhhhhhccccccchhhhhhhhhhhhcccc-eecCCceEeccc Q lcl|NC_017674. 299 APELS--------GVQMKAQEPEDALVLF-------VEDVNAAVDGSTDGGSVFSQLVQSKFITLGV-EKRAKSYVEDFS 362 (382) Q Consensus 299 ~peL~--------~a~g~g~~~~~~~~~~-------~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~-~~~~~~~~~~~~ 362 (382) +|.-. .....+.+...++-.| ..++.. +. .+....--.. -...-...+.+. T Consensus 223 i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~--~~-----------~~~~~~~~~~~~~~~~~v~~r~~ 289 (311) T protein:vir:81 223 VRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPL--EL-----------IEFGDPDGLGDLKRQNQIAIRAE 289 (311) T ss_pred ccccccccccccchhcccCCccEEEEEecccEEEEEeccceE--EE-----------eccCCCCcchhhhhcCcEEEEEE Confidence 11100 0000001111111111 111100 00 0000000000 000111233444 Q ss_pred cceeeeEeeccchheeecCC Q lcl|NC_017674. 363 NGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 363 ~~t~Gv~i~~P~aia~~~GI 382 (382) .|+ |..+.+|.||+++.|. T Consensus 290 ~r~-d~~v~~~~a~~~l~~a 308 (311) T protein:vir:81 290 VVY-GIGIMSTDAFAVVRDA 308 (311) T ss_pred EEe-ccEeecccceEEEEee Confidence 555 5566779999999999 No 21 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.22 E-value=8.4e-08 Score=59.37 Aligned_cols=281 Identities=9% Similarity=-0.032 Sum_probs=149.8 Q ss_pred hhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCcee Q lcl|NC_017674. 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT 140 (382) Q Consensus 61 ~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~v 140 (382) ||-... +.+.-+|..+.. ++++.+...-..+.+.++..... ....|++.+..+.|.+.|...++|.. T Consensus 1 ma~~t~------~~G~lip~~~~~----~ii~~l~~~s~i~~l~~~~~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~~s 67 (300) T protein:vir:95 1 MSEAQL------SKGNLFNPELVT----KVINKVKGHSSIAKLSPQKPIPF---NGQREFVFDFDSDIDIVAENGKKTHG 67 (300) T ss_pred Cccccc------CCcceechhhHH----HHHHHHHhhhhhhhhcceeeccC---CceEEEEEecCcceEEeeCCcccccc Confidence 221111 112224544444 55665555544555555443222 34567888877888899999999999 Q ss_pred eeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeecc--CCcccceEEEeCCCCcc Q lcl|NC_017674. 141 SWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQS--GLGNRTYGFLNDPNLPA 218 (382) Q Consensus 141 d~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~--g~~~g~~GllN~P~l~~ 218 (382) +...++..-+.+.++....+|.+=+++......++.+.-.....+++...+++-+++|+.. |......|..+.++... T Consensus 68 ~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~ 147 (300) T protein:vir:95 68 GVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVT 147 (300) T ss_pred cccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccc Confidence 9999999999999999999987533322345678888888899999999999999999632 22234456555555432 Q ss_pred eeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHh--c---C Q lcl|NC_017674. 219 FQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQT--Y---P 292 (382) Q Consensus 219 ~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n--~---p 292 (382) .++.. +....++||.+++..+...- . .+..++|.|..+..|.. .+..|..++.-.... . - T Consensus 148 ~~~~~-------~~~~~~~~i~~~~~~~~~~~---~----~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~ 213 (300) T protein:vir:95 148 QTVPF-------KDTNPDESMEDAVGMIDGSE---R----DITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAIN 213 (300) T ss_pred eeecc-------cccchHHHHHHHHHHhhhcC---C----CccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceec Confidence 22111 11122567777777664432 1 24468999999988864 344455443211111 1 1 Q ss_pred ccEEEEccccccccCCCCCceeEEE--Ecchhhhhhhccccccchhhhhhhhhhhhccc-ce------ecCCceEecccc Q lcl|NC_017674. 293 KMRIVSAPELSGVQMKAQEPEDALV--LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLG-VE------KRAKSYVEDFSN 363 (382) Q Consensus 293 nl~i~~~peL~~a~g~g~~~~~~~~--~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~-~~------~~~~~~~~~~~~ 363 (382) ++.++...... .+.+.....++ .|..-+. -+ ..+.+..++.... .. ...-..-+.++. T Consensus 214 G~Pv~~s~~v~---~~~~~~~~~~~~GDf~~~~~---~~-------~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~ 280 (300) T protein:vir:95 214 GLAVDKNRTVS---YSQTDPKNTAIVGDFETMFK---WG-------YAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEA 280 (300) T ss_pred ceeeEEecCCC---CCCCCCccEEEEeeccceEE---EE-------EecccEEEEeeccCCCCcchhhhhcCcEEEEEEE Confidence 12222211111 11111111111 1111000 00 0000010000000 00 000112234444 Q ss_pred ceeeeEeeccchheeecCC Q lcl|NC_017674. 364 GTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 364 ~t~Gv~i~~P~aia~~~GI 382 (382) |+ |+.|++|.||+++.|+ T Consensus 281 r~-d~~v~~~~a~~~l~~~ 298 (300) T protein:vir:95 281 YI-GWGIMDAASFARIVKT 298 (300) T ss_pred ee-cceeecccceEEEecC Confidence 55 6667789999999999 No 22 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.08 E-value=3e-07 Score=56.32 Aligned_cols=350 Identities=10% Similarity=0.024 Sum_probs=155.5 Q ss_pred CCCcceeee-ecCccccccccccc----cchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccc-cCccccc Q lcl|NC_017674. 1 MSQISKTHS-RLAGRNAKPFDLKN----ITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNF-TAPVTTP 74 (382) Q Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~~----~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~-~~~~t~~ 74 (382) +.+...... .......++...+. .....+.++.+ .+....+........ +.......+... ....+.. T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~ 138 (435) T protein:vir:80 65 AAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVR-ALAAARGDAQLASKL-----AIERGFGEEVAMSLNTLSPG 138 (435) T ss_pred hcccccchhhhhccccccccccccchhhhhHHHHHHHHH-HHHhccchhHHHHHH-----HHhhhhhhhhhhhhcccCCC Confidence 111000000 00000001111111 00011111100 000000000000000 000000001000 0011112 Q ss_pred chh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEE Q lcl|NC_017674. 75 SIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIV 152 (382) Q Consensus 75 ~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~ 152 (382) +.| +|..+.+ +|++.+......+.+-. +.-+.....+.|++.+..+.+.+.+....+|..+...+...-.++ T Consensus 139 ~gg~lvP~~~~~----~ii~~l~~~~~i~~~~~--~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~ 212 (435) T protein:vir:80 139 AGGVLVPENLSS----EVIELLRPKSVVRKLGA--RTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAK 212 (435) T ss_pred CCccccchhHHH----HHHHHHhhhchhhhccc--eeeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEEeeE Confidence 222 4444333 45554433333333211 111112234667777777777788888889999999899999999 Q ss_pred EEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCH Q lcl|NC_017674. 153 RGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADW 232 (382) Q Consensus 153 ~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~ 232 (382) .++..+.+|.+=|+ -...+-++.+.-......++...+++-+++|+.. .+...|++|+..+.....++ ...|. T Consensus 213 k~~~~~~is~ell~-ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~--~~~p~Gi~~~~~~~~~~~~~----~~~~~ 285 (435) T protein:vir:80 213 KMAALVPIANDLIK-YAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGT--ANTPKGLRFWALPGNVITAS----DGSTL 285 (435) T ss_pred EEEEeehhhHHHHH-hhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCC--CCcccceeecccccceeecc----cccch Confidence 99998888864433 3333456778888888889999999999999633 34567999987554322221 23577 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHh-cCccEEEEccccccccCCCC Q lcl|NC_017674. 233 AGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQT-YPKMRIVSAPELSGVQMKAQ 310 (382) Q Consensus 233 ~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n-~pnl~i~~~peL~~a~g~g~ 310 (382) +.+..|+.+++..+.....+. .+..++|.+..+..|.. .+..|.-++.-+..+ +-++.++....+..-.+.+. T Consensus 286 ~~~~~d~~~~~~~~~~~~~~~-----~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~~~ 360 (435) T protein:vir:80 286 QKIETDLGKAILALENADANL-----TQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGEAG 360 (435) T ss_pred hhHHHHHHHHHHHhhcccccc-----ccCEEEEcHHHHHHHHhhhccCCceeccCCCCCeEeeeeeEEeccccccccCCC Confidence 778889998888776654321 13468999999998864 344444443211111 11122222222211111111 Q ss_pred CceeEEEEcchhhhhhhccccccchhhhhhhhhh-hhc------ccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 311 EPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSK-FIT------LGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 311 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~-~~~------l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +...+++-.-.++. -+ .-+...+ +..+.. +.. .-.+. + ...+.+..| .++.+++|.||+++.|| T Consensus 361 ~~~~i~~gd~s~~~---i~-~~~~~~i-~~~~~~~~~~~~~~~~~~f~~-n-~~~~r~~~r-~d~~~~~~~a~~~l~~~ 431 (435) T protein:vir:80 361 KESEIYFTDFGDVF---IG-EEETLEI-DYSKEATYKDADGHMVSAFQR-D-QTLIRVIAK-NDFGPRHVESIAVLSGV 431 (435) T ss_pred CcceEEEEEcccEE---EE-eecceEE-EEeccccccccccchhhhhhc-C-cceeeeeee-eCcEeecccceEEEecc Confidence 12222221111110 00 0000000 000000 000 00000 1 112233333 36678899999999999 No 23 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.08 E-value=1.5e-07 Score=58.04 Aligned_cols=287 Identities=13% Similarity=-0.042 Sum_probs=151.1 Q ss_pred hhhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCce Q lcl|NC_017674. 60 GSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPL 139 (382) Q Consensus 60 ~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~ 139 (382) +..+++ ..+.-+|..+.+ +|++.+.+....+.+..+...+. ....|++....+.|.+.|....+|. T Consensus 1 Mat~tt-------~~g~~vP~~~~~----~ii~~~~~~s~l~~~~~~i~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~~ 66 (311) T protein:vir:99 1 MATFGT-------GNLKNLPRNIAD----GMVKDVVQGSTVAVLSARKPQRF---GNEDIITFNGRPKAEFVGEGQQKSS 66 (311) T ss_pred CceecC-------CCceeccHHHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEeCCceeEEeecCccccc Confidence 111221 122224554444 55555555544555554433322 3457888888888889999999999 Q ss_pred eeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcce Q lcl|NC_017674. 140 TSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAF 219 (382) Q Consensus 140 vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~ 219 (382) .+....+..-..+.++..+.+|.+=++..--...++.+.-.....+++.+.+++-+++|+..+...+..|+.|-....+. T Consensus 67 ~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~ 146 (311) T protein:vir:99 67 TTGEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASK 146 (311) T ss_pred ccceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccc Confidence 99999999999999999998887533333355678999999999999999999999999854333444454443332211 Q ss_pred eccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHHHHHh-cC----c Q lcl|NC_017674. 220 QTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDWIEQT-YP----K 293 (382) Q Consensus 220 ~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~~n-~p----n 293 (382) .. + -...+......||.+++..+....... .++.++|.+..+..|.+. +..|.-+++-.... -| + T Consensus 147 ~~--~--~~~~~~~~~~~~i~~~~~~~~~~~~~~-----~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G 217 (311) T protein:vir:99 147 RV--E--LTADTIANPDLAIEAAVGLLVANGHPT-----PVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEG 217 (311) T ss_pred ee--e--ccccccchhHHHHHHHHHHHhhhccCC-----CccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecc Confidence 11 1 112233444667777777665554321 234588999988888643 33344333211110 00 1 Q ss_pred cEEEE---ccccccccCCC----CCceeEEEEcchhhhhhhccccccchhhhhhhhhhh--hcccc---e-----ecCCc Q lcl|NC_017674. 294 MRIVS---APELSGVQMKA----QEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKF--ITLGV---E-----KRAKS 356 (382) Q Consensus 294 l~i~~---~peL~~a~g~g----~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~--~~l~~---~-----~~~~~ 356 (382) +.++. +|.-....... ......++ + -+.... ..+ .++... ..+.. . ...-- T Consensus 218 ~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~-~-Gdf~~~--------~~~--~~~~~~~~~~~~~~~~~~~~~~~~~d~ 285 (311) T protein:vir:99 218 IDASVSDTVNGGDEADPDDEDLDAARAVRGI-V-GDFANG--------IHW--GVQRDIPVELIKYGDPDGQGDLKRHNQ 285 (311) T ss_pred eeeEeecccccccccccccchhhccCcceEE-E-eecccc--------EEE--EEecCceEEEeecCCCCcchhhhhcCc Confidence 11111 11100000000 00000000 0 000000 000 000000 00000 0 01112 Q ss_pred eEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 357 YVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 357 ~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .-..++.|+++. |++|.+++..++. T Consensus 286 ~~~r~~~r~d~~-v~~~~~v~~~~~~ 310 (311) T protein:vir:99 286 IALRLEIVYGWY-VFTDRFVVIENAV 310 (311) T ss_pred EEEEEEEeecce-ecChhHeeeeccc Confidence 345778899887 5679888888888 No 24 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.08 E-value=4.2e-07 Score=55.55 Aligned_cols=345 Identities=11% Similarity=0.015 Sum_probs=153.7 Q ss_pred CCC----cceeeeecCccccccccccccchHHHHH----Hhh-cceeccccchhhhhhhhcccccchhhhhhcccccCcc Q lcl|NC_017674. 1 MSQ----ISKTHSRLAGRNAKPFDLKNITNDAVAS----LSR-IGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPV 71 (382) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~----l~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~ 71 (382) |-- ..|.|+..++.-+++= .+.....++.+ |.+ .|- +..+ .++...... .....+++- . T Consensus 1 ~a~~~a~~~~~~~~~~~~~~~~~-~~~~kg~~~~~~~~a~a~~~g~-~~~a--~~~a~~~~~--~~~~~~a~~------~ 68 (366) T protein:vir:57 1 MAAAVAVPVKAHSVAPGIIIKEE-LQQYKGAGMTRMVMSIAAGKGN-LADA--AKFAATELG--DTGLSMAIS------T 68 (366) T ss_pred Ccccccccccccccccccccccc-cccccchhHHHHHHHHHhcccc-hhHH--HHHHHHhhc--chhhhhhcc------c Confidence 111 1133333222222211 00111111111 111 111 1100 000000000 000011110 1 Q ss_pred cccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEe Q lcl|NC_017674. 72 TTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERR 149 (382) Q Consensus 72 t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~ 149 (382) ++.+.| +|..+- .+|++.+....-.+.+ +.. .-+-....+.+++.+..+.+.+.+...++|..+...++..- T Consensus 69 ~~~~Gg~lvP~~~~----~~ii~~l~~~s~l~~l-g~~-~v~~~~g~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~i~~ 142 (366) T protein:vir:57 69 AAGSGGALIPQNMQ----NEVIELLRDRTVVRIL-GAR-SIPLPNGNLSMPRLSGGATAGYVGEGKDVVATGATFDDVKL 142 (366) T ss_pred cccCCccccchhHH----HHHHHHHhhhcchhhh-cee-eeecCCCceEEEEEeCCcceeeeccCccccccccceeEEEE Confidence 122333 444432 3455554433222222 110 00111134667777777777788998999999999999999 Q ss_pred eEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccc Q lcl|NC_017674. 150 TIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWST 229 (382) Q Consensus 150 ~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~ 229 (382) +.+.++....+|.+=|+ ....++.+.-.....+++.+.+|+-.++|+.. .+.-.|++|.+.........+ -.. T Consensus 143 ~~~k~~~~~~iS~ell~---ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~--~~~p~Gi~~~~~~~~~~~~~~--~t~ 215 (366) T protein:vir:57 143 SAKTMIALVPVSNQLIG---RAGFNVEQLLLGDILSAIATREDKAFLRDDGT--GDTPKGMKAVATAANRLVAWT--GTA 215 (366) T ss_pred eeEEEEEeehhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHHhhccCCC--Cccccceeeccccccceeecc--ccc Confidence 99999999988864443 33457888888888889999999999999743 244579999876543222111 122 Q ss_pred cCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHh-cCccEEEEccccccccC Q lcl|NC_017674. 230 ADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQT-YPKMRIVSAPELSGVQM 307 (382) Q Consensus 230 kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n-~pnl~i~~~peL~~a~g 307 (382) .+...+..++..+.........+. .....+|.+..+..|.. .+..|..++.-+... .-++.++....+..-.+ T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~-----~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~ 290 (366) T protein:vir:57 216 INLTTIDEYLDSLILKHMDSNSNM-----IRCGWGLSNRTYMTLFGLRDGNGNKVYPEMSQGILKGYPIQRTSAIPANLG 290 (366) T ss_pred cchhhHHHHHHHHHHhhhcccccc-----ccCEEEecHHHHHHHHhhhccCCceeccCCCCCeecceeeEEccccccccc Confidence 344445444444433222222111 12367899999888864 355555554211111 11222333333321111 Q ss_pred CCCCceeEEEEcchhhh------hhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecC Q lcl|NC_017674. 308 KAQEPEDALVLFVEDVN------AAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLG 381 (382) Q Consensus 308 ~g~~~~~~~~~~~~~v~------~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~G 381 (382) .+.+...+++-.-.++. ..++.+++. +|.. +.-...... ..-...+.+..++ ++.+++|.||+++.| T Consensus 291 ~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea--~~~~--~~g~~~~~f--~~~~~~iR~~~~~-d~~v~~~~a~~~lt~ 363 (366) T protein:vir:57 291 DDGNESEIYFCDFNDVVIGEDGMMKVDFSTEA--TYKD--ADGQLVSAF--ARNQSLIRVVTEH-DIGFRHPEGLVLGTG 363 (366) T ss_pred cCCCccEEEEEecceEEEEEecceEEEEeecc--cccc--ccccchhhh--hcCceeEEeeeee-CcEeeccccEEEEec Confidence 12222222221111110 000000000 0000 000000000 0111223444444 455699999999999 Q ss_pred C Q lcl|NC_017674. 382 I 382 (382) Q Consensus 382 I 382 (382) | T Consensus 364 ~ 364 (366) T protein:vir:57 364 V 364 (366) T ss_pred c Confidence 9 No 25 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=98.04 E-value=8.6e-07 Score=53.85 Aligned_cols=352 Identities=9% Similarity=0.026 Sum_probs=155.2 Q ss_pred CCCcc-------------------eeee--ecCccccccccccccchH-HHHHHhhcceeccc--cchhhhhhhhcccc- Q lcl|NC_017674. 1 MSQIS-------------------KTHS--RLAGRNAKPFDLKNITND-AVASLSRIGLVFDH--AVVQDQIKALAKAG- 55 (382) Q Consensus 1 ~~~~~-------------------~~~~--~~~~~~~~~~~~~~~~~~-~~~~l~~~g~~~~~--~~~~~~~~~~~~~~- 55 (382) |+++. .... ....+...+...+...+. .-.++.++--.+-. +..........+.. T Consensus 45 i~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 124 (435) T protein:vir:14 45 FSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGF 124 (435) T ss_pred HHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhh Confidence 11110 0000 000000011111110000 00011110000000 00000000000000 Q ss_pred cchhhhhhcccccCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeeccc Q lcl|NC_017674. 56 AFRSGSAMDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGD 133 (382) Q Consensus 56 ~~~~~~amDa~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd 133 (382) ......++. ..+..+.| +|..+ ..+|++.+....-.+.+.. +.-+.....+.|++.+..+.+.+.+. T Consensus 125 ~~~~~~~~~-----~~t~~~gg~~vP~~~----~~~ii~~l~~~~~i~~~~~--~~~~~~~~~~~~p~~~~~~~a~~v~E 193 (435) T protein:vir:14 125 GEEVAMSLN-----TLSPGAGGVLVPENL----SSEVIELLRPKSVVRKLGA--RTLPLSNGNITIPRLKGGAIVGYIGA 193 (435) T ss_pred hhhhhhhcc-----cCCcCCCccccchhH----HHHHHHHHhhhchhhhhcc--eeeecCCCceEEEEEeCCcceeeecc Confidence 000001111 11222222 44433 2355555544333333311 11111223567788877777888888 Q ss_pred ccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeC Q lcl|NC_017674. 134 HTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLND 213 (382) Q Consensus 134 ~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~ 213 (382) ...+|..+.......-.++.++..+.+|.+=+.- +..+.+|.+.-......++.+.+|+..++|+.. .+...|+++. T Consensus 194 ~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d-s~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~--~~~p~Gi~~~ 270 (435) T protein:vir:14 194 DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKY-AGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGT--ANTPKGLRFW 270 (435) T ss_pred CccccccccceeEEEeeeEEEEEeehhhHHHHHh-hccCHHHHHHHHHHHHHHHHHHHHHHhhccCCC--Cccccceeec Confidence 8889999988888888899999988888643332 222345778888888889999999999999743 2346799987 Q ss_pred CCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHHHHHh-c Q lcl|NC_017674. 214 PNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDWIEQT-Y 291 (382) Q Consensus 214 P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~~n-~ 291 (382) ...+...+.+ . ..|.+.+.+|+.+++..+.....+. .+..++|.+..+..|... +..|.-++.-+... . T Consensus 271 ~~~~~~~~~~--~--~~~~~~~~~~~~~l~~~~~~~~~~~-----~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~g~l 341 (435) T protein:vir:14 271 ALPSNVITAS--D--ASTLQKIETDLGKVILALENADANL-----TQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGML 341 (435) T ss_pred ccccceeccc--c--ccchhhHHHHHHHHHHHhhhccccc-----cCCEEEEcHHHHHHHHHhhccCCceeccCCCCCee Confidence 6544322221 2 2577778899999988877654332 123688999999888643 33344333111000 0 Q ss_pred CccEEEEccccccccCCCCCceeEEEEcchhhh------hhhccccccchhhhhhhhhhhhcccceecCCceEeccccce Q lcl|NC_017674. 292 PKMRIVSAPELSGVQMKAQEPEDALVLFVEDVN------AAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGT 365 (382) Q Consensus 292 pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~------~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t 365 (382) -++.++..+.+-.-.+.+++...+++-.-..+. -.++-++. ..|...-... ..-++.. ...+.+..|. T Consensus 342 ~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~--~~~~~~~~~~--~~~f~~~--~~~~r~~~r~ 415 (435) T protein:vir:14 342 KGYPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEETLEIDYSKE--ATYKDADGHM--VSAFQRD--QTLIRVIAKN 415 (435) T ss_pred ecceeEeeccccccccCCCccceEEEeecccEEEEEecccEEEEecc--ccccccccch--hhhhhcC--hhheeeeeee Confidence 011222222221101111111122221111110 00000000 0000000000 0000000 1123445555 Q ss_pred eeeEeeccchheeecCC Q lcl|NC_017674. 366 AGALCKRPWAVVRYLGI 382 (382) Q Consensus 366 ~Gv~i~~P~aia~~~GI 382 (382) ++ .+++|.||+++.|+ T Consensus 416 d~-~~~~~~a~~~l~~~ 431 (435) T protein:vir:14 416 DF-GPRHVESIAVLAGV 431 (435) T ss_pred Cc-eeecccceEEEecC Confidence 44 88999999999999 No 26 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.00 E-value=1.5e-06 Score=52.45 Aligned_cols=304 Identities=9% Similarity=-0.095 Sum_probs=148.3 Q ss_pred CCCcceeeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHH Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPI 80 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~ 80 (382) |+| .+ ..++++++.... ..... ..+|.-......++.-+|. T Consensus 1 ~~~---------------~~---~~~~~~~~~~~~---------------~~~~~------~~~a~~~~~~~~~~~~iP~ 41 (324) T protein:vir:96 1 MEQ---------------TQ---KLKLNLQHFASN---------------NVKPQ------VFNPDNVMMHEKKDGTLMN 41 (324) T ss_pred CCc---------------ch---hhhHHHHHHHHH---------------hhhhh------hhccccccccCcCccccch Confidence 333 11 111122221110 00000 0111111111112233555 Q ss_pred HHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEe Q lcl|NC_017674. 81 QFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMV 160 (382) Q Consensus 81 ~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y 160 (382) .+.+ +|++.+......+.++++.+... .+..|++.+..+.+.+.+.+..+|..+....+.....+.++..+.+ T Consensus 42 ~~~~----~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~i 114 (324) T protein:vir:96 42 EFTT----PILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPV 114 (324) T ss_pred hHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehh Confidence 5444 55555555555666665544322 4577888888888889999999999999999999999999999999 Q ss_pred cHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHH Q lcl|NC_017674. 161 GTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIR 240 (382) Q Consensus 161 ~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~ 240 (382) +.+=++.+ ..++.+.-.....+++.+.+++.+++|+..+. .-.|+++..+.....+. ...-++||. T Consensus 115 s~ell~ds---~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~--~~~gi~~~~~~~~~~~~---------~~~t~~~i~ 180 (324) T protein:vir:96 115 TKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIEKTNKVIK---------GDFTQDNII 180 (324) T ss_pred hHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--cCccccccccccceecc---------ccccHHHHH Confidence 98555433 36788888888888889999999999975432 22466654432211111 111267777 Q ss_pred HHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEc Q lcl|NC_017674. 241 EAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLF 319 (382) Q Consensus 241 ~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~ 319 (382) +++..+...- . .+..++|.++.+..|... +..|..++. .-..-++...|=........+.+..++-.+ T Consensus 181 ~~~~~l~~~~---~----~~~~~vmn~~~~~~L~~l~d~~G~~~~~----~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~ 249 (324) T protein:vir:96 181 DLEALLEDDE---L----EANAFISKTQNRSLLRKIVDPETKERIY----DRNSDSLDGLPVVNLKSSNLKRGELITGDF 249 (324) T ss_pred HHHHhhhhcc---C----CCCEEEEcHHHHHHHHHhhccCCCeeec----CCCCCcccceeeEeeCCCCCCcceEEEEec Confidence 7777664421 1 244789999999888643 333332221 100011111111111111111111111111 Q ss_pred -------chhhhhhhccccccchhhhhhhhhh-hhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 320 -------VEDVNAAVDGSTDGGSVFSQLVQSK-FITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 320 -------~~~v~~~~~~~~~~~~~~~~~~p~~-~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ..++.- +.++... +...-... -...-.+ .-.....+..|. |+.+++|.||+++.|. T Consensus 250 ~~~~~g~~~~~~i--~~~~~~~--~~~~~~~~~~~~~~f~--~d~~~~r~~~r~-d~~v~~~~A~~~l~~a 313 (324) T protein:vir:96 250 DKLIYGIPQLIEY--KIDETAQ--LSTVKNEDGTPVNLFE--QDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred ceEEEEEecCcEE--EEeeccc--ccccccccccchhhhh--cCcEEEEEEEEE-ccEEecccceEEEecc Confidence 111100 0000000 00000000 0000000 011223344444 5666679999999998 No 27 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.00 E-value=1.5e-06 Score=52.45 Aligned_cols=304 Identities=9% Similarity=-0.095 Sum_probs=148.3 Q ss_pred CCCcceeeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHH Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPI 80 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~ 80 (382) |+| .+ ..++++++.... ..... ..+|.-......++.-+|. T Consensus 1 ~~~---------------~~---~~~~~~~~~~~~---------------~~~~~------~~~a~~~~~~~~~~~~iP~ 41 (324) T protein:vir:78 1 MEQ---------------TQ---KLKLNLQHFASN---------------NVKPQ------VFNPDNVMMHEKKDGTLMN 41 (324) T ss_pred CCc---------------ch---hhhHHHHHHHHH---------------hhhhh------hhccccccccCcCccccch Confidence 333 11 111122221110 00000 0111111111112233555 Q ss_pred HHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEe Q lcl|NC_017674. 81 QFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMV 160 (382) Q Consensus 81 ~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y 160 (382) .+.+ +|++.+......+.++++.+... .+..|++.+..+.+.+.+.+..+|..+....+.....+.++..+.+ T Consensus 42 ~~~~----~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~i 114 (324) T protein:vir:78 42 EFTT----PILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPV 114 (324) T ss_pred hHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehh Confidence 5444 55555555555666665544322 4577888888888889999999999999999999999999999999 Q ss_pred cHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHH Q lcl|NC_017674. 161 GTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIR 240 (382) Q Consensus 161 ~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~ 240 (382) +.+=++.+ ..++.+.-.....+++.+.+++.+++|+..+. .-.|+++..+.....+. ...-++||. T Consensus 115 s~ell~ds---~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~--~~~gi~~~~~~~~~~~~---------~~~t~~~i~ 180 (324) T protein:vir:78 115 TKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIEKTNKVIK---------GDFTQDNII 180 (324) T ss_pred hHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--cCccccccccccceecc---------ccccHHHHH Confidence 98555433 36788888888888889999999999975432 22466654432211111 111267777 Q ss_pred HHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEc Q lcl|NC_017674. 241 EAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLF 319 (382) Q Consensus 241 ~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~ 319 (382) +++..+...- . .+..++|.++.+..|... +..|..++. .-..-++...|=........+.+..++-.+ T Consensus 181 ~~~~~l~~~~---~----~~~~~vmn~~~~~~L~~l~d~~G~~~~~----~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~ 249 (324) T protein:vir:78 181 DLEALLEDDE---L----EANAFISKTQNRSLLRKIVDPETKERIY----DRNSDSLDGLPVVNLKSSNLKRGELITGDF 249 (324) T ss_pred HHHHhhhhcc---C----CCCEEEEcHHHHHHHHHhhccCCCeeec----CCCCCcccceeeEeeCCCCCCcceEEEEec Confidence 7777664421 1 244789999999888643 333332221 100011111111111111111111111111 Q ss_pred -------chhhhhhhccccccchhhhhhhhhh-hhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 320 -------VEDVNAAVDGSTDGGSVFSQLVQSK-FITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 320 -------~~~v~~~~~~~~~~~~~~~~~~p~~-~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ..++.- +.++... +...-... -...-.+ .-.....+..|. |+.+++|.||+++.|. T Consensus 250 ~~~~~g~~~~~~i--~~~~~~~--~~~~~~~~~~~~~~f~--~d~~~~r~~~r~-d~~v~~~~A~~~l~~a 313 (324) T protein:vir:78 250 DKLIYGIPQLIEY--KIDETAQ--LSTVKNEDGTPVNLFE--QDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred ceEEEEEecCcEE--EEeeccc--ccccccccccchhhhh--cCcEEEEEEEEE-ccEEecccceEEEecc Confidence 111100 0000000 00000000 0000000 011223344444 5666679999999998 No 28 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=97.94 E-value=1e-06 Score=53.47 Aligned_cols=283 Identities=8% Similarity=0.009 Sum_probs=146.6 Q ss_pred hhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCcee Q lcl|NC_017674. 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT 140 (382) Q Consensus 61 ~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~v 140 (382) |..+|+......++..-+|..+.+ +|++.+......+.+..+...+. .+..+.+.+. ..+.+.+.+.++|.. T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~----~ii~~~~~~s~l~~~~~~~~~~~---~~~~~~~~~~-~~a~~v~E~~~~~~~ 72 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISE----QIITGVKNGSAAMKLAKAVPMTK---PEEEFTFMSG-VGAFWVDEAERIQTS 72 (299) T ss_pred CCcCCCcccccCCCceecchhHHH----HHHHHHHhcchhhhhceeeecCC---CcEEEEEEcC-CceeeeecCcccccc Confidence 333443211111112225555444 55565555555555555444332 2344555543 446778888899999 Q ss_pred eeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCccee Q lcl|NC_017674. 141 SWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQ 220 (382) Q Consensus 141 d~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~ 220 (382) +...++.....+.++..+.++.+=++ ....++.+.-.....+++.+.+|+-.++|+..+ .-.|+++.......+ T Consensus 73 ~~~f~~v~l~~~k~~~~~~is~ell~---ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~---~~~gil~~~~~~~~~ 146 (299) T protein:vir:41 73 KPTFTKAKMRSKKMGVIIPTTKENLN---YSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESP---YNWNILKSATDASNL 146 (299) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHh---cCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCc---cccccccccccccee Confidence 99999999999999999999985543 333678888999999999999999999998543 225888765432222 Q ss_pred ccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHh-cC---ccE Q lcl|NC_017674. 221 TPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQT-YP---KMR 295 (382) Q Consensus 221 ~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n-~p---nl~ 295 (382) +.. ... -++||.+++.++...-. .+..+++.+..+..|.+ .+..|.-++.=--.+ -+ ++. T Consensus 147 ~~~----~~~----~~~~l~~~~~~l~~~~~-------~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~P 211 (299) T protein:vir:41 147 VEE----TAN----KYDDLNEAIGLIEAEDL-------EPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVDDVLGLP 211 (299) T ss_pred ecc----ccc----cHHHHHHHHHhhhcccC-------CcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCceeccee Confidence 111 112 26788888887654221 24478999999988864 333333332100000 00 112 Q ss_pred EEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccce--------ecCCceEeccccceee Q lcl|NC_017674. 296 IVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVE--------KRAKSYVEDFSNGTAG 367 (382) Q Consensus 296 i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~--------~~~~~~~~~~~~~t~G 367 (382) ++..+.+. .+.+.....+-.+.+ + +-+-. +...+. .....+...... ...-...+.+..|. | T Consensus 212 V~~~~~~~---~~~~~~~~~~gdfs~-~---~i~~~-~~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~-d 281 (299) T protein:vir:41 212 IAYTPKYT---FGDKDISELVGDWNQ-A---YYGIL-RGVEYE-ILTEATLTTVADETGKPLNLAERDMAAIKATFEV-G 281 (299) T ss_pred eEEecccC---CCCCceEEEEEeccc-E---EEEEe-cCcEEE-EeecccccccccccccchhhhhcCcEEEEEEEEe-c Confidence 22222221 111111111111111 0 00000 000000 000000000000 00012233555565 5 Q ss_pred eEeeccchheeecCC Q lcl|NC_017674. 368 ALCKRPWAVVRYLGI 382 (382) Q Consensus 368 v~i~~P~aia~~~GI 382 (382) ..+++|.||+.+.+- T Consensus 282 ~~v~~~~A~~~l~~~ 296 (299) T protein:vir:41 282 FMVVKDEAFSAVQPK 296 (299) T ss_pred cEEecccceEEEEec Confidence 567779999999999 No 29 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=97.94 E-value=1.1e-06 Score=53.21 Aligned_cols=278 Identities=8% Similarity=0.000 Sum_probs=143.5 Q ss_pred cCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeeccccc-----CCcee Q lcl|NC_017674. 68 TAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHT-----NIPLT 140 (382) Q Consensus 68 ~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~-----DiP~v 140 (382) -+..++.+.| +|..+.+ +|++.+......+.+..+.+... .+..+++....+.|.+.|... ++|.. T Consensus 1 ma~~t~~~gg~liP~~~~~----~Ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s 73 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSD----TLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGESATDPKGVKPTS 73 (305) T ss_pred CCCccCCccceecCHHHHH----HHHHHHHhhchhhhhcceeeccC---CcEEEEEEeCCcceEEeeccccccccccccc Confidence 0112222223 5555544 66666666666666666555433 356677777777787776643 35777 Q ss_pred eeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCccee Q lcl|NC_017674. 141 SWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQ 220 (382) Q Consensus 141 d~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~ 220 (382) +.......-..+.++..+.++.+=++ ....++.+.-.....+++.+.+++..++|+..+. |+.+...++... T Consensus 74 ~~~f~~i~~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~-----~~~~~~~~~~~~ 145 (305) T protein:vir:25 74 KVTWANRTLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPA-----SWVSPALIPAAV 145 (305) T ss_pred ccceeeEEeeeEEEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHhhhheeccCCCC-----Cccccccccccc Confidence 88888888889999999999984442 3446789999999999999999999999985432 333322222211 Q ss_pred cc--CCCCc-cccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHhcCccEE Q lcl|NC_017674. 221 TP--PSQGW-STADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQTYPKMRI 296 (382) Q Consensus 221 ~~--a~~~W-a~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n~pnl~i 296 (382) .. ....+ ...+..++++++..+...+.... . .+..+++.+..+..|.+ .+..|.-++. -...-++.+ T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~----~~~~~v~~~~~~~~l~~lkd~~G~~i~~--~~~l~G~Pv 216 (305) T protein:vir:25 146 TAGQAVEVVGGVANESDIVGATNRAAKAVASAG---W----APDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGFRT 216 (305) T ss_pred cccccccccccchhhhHHHHHHHHHHHhhhhcc---c----ccceeEecHHHHHHHHHhhccCCceeec--CCcccccce Confidence 11 11112 22334556777776665543322 1 23358899998888854 3444544431 001111112 Q ss_pred EEccccccccCCCCCceeEEEEcchhhh------hhhccccccchhhhhhhhhhhhcccce-ecCCceEeccccceeeeE Q lcl|NC_017674. 297 VSAPELSGVQMKAQEPEDALVLFVEDVN------AAVDGSTDGGSVFSQLVQSKFITLGVE-KRAKSYVEDFSNGTAGAL 369 (382) Q Consensus 297 ~~~peL~~a~g~g~~~~~~~~~~~~~v~------~~~~~~~~~~~~~~~~~p~~~~~l~~~-~~~~~~~~~~~~~t~Gv~ 369 (382) +...... ...+++ .+++-.-.++. ..++-++ ...|.. .. . +.. ...-.....+..|.+ .. T Consensus 217 ~~~~~~~---~~~~~~-~~~~gd~s~~~i~~~~~~~i~~~~--~~~~~~--~~--~--~~~~~~~~~~~~R~~~r~~-~~ 283 (305) T protein:vir:25 217 FFNRNGA---WDADAA-IEVIADSSRVKIGVRQDITVKFLD--QATLGT--GE--N--QINLAERDMVALRLKARFA-YV 283 (305) T ss_pred EEcCccC---CCCCcc-EEEEEecceEEEEEecCeEEEEee--eeeeec--CC--c--eeeeeecCcEEEEEEEeec-ce Confidence 2111111 111111 11111100000 0000000 000000 00 0 000 011122344555664 55 Q ss_pred eeccchheeecCC Q lcl|NC_017674. 370 CKRPWAVVRYLGI 382 (382) Q Consensus 370 i~~P~aia~~~GI 382 (382) |.+|.+|+.++|+ T Consensus 284 v~~p~a~v~~~~~ 296 (305) T protein:vir:25 284 LGVSATAQGANKT 296 (305) T ss_pred eeCcccEEEEccc Confidence 8889999999999 No 30 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.89 E-value=1.3e-06 Score=52.80 Aligned_cols=303 Identities=10% Similarity=-0.014 Sum_probs=153.4 Q ss_pred hHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccc-cchhHHHHHHhhhhhhheeccccccchhhhC Q lcl|NC_017674. 26 NDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTT-PSIPTPIQFLQTWLPGFVKVMTAARKIDEII 104 (382) Q Consensus 26 ~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~-~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~ 104 (382) +-.+++|+. +++-++..+..++ ++.-+|-.+.+ +|++.+......+.+. T Consensus 1 ~~~~~e~~~--------------------------~~~~~~~~~~~~~~~~~liP~~~~~----~ii~~~~~~s~l~~l~ 50 (338) T protein:vir:78 1 MATLNELAP--------------------------NTAGSNHQGRLAHVPSDLLPKEIVG----PIFDKAQESSLVLRLG 50 (338) T ss_pred CcchHHhhh--------------------------hhcccccccceecccccccchHHHH----HHHHHHHhhchhhhhc Confidence 111222221 2222232322222 22235655555 5666666666666666 Q ss_pred ccccCCCcceeeEEEEeeec--------ccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChH Q lcl|NC_017674. 105 GIDTVGSWEDQEIVQGIVEP--------AGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSA 176 (382) Q Consensus 105 ~v~t~g~~~~~t~t~~v~e~--------~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~ 176 (382) ++..... ....+++... .+.+...+++..+|..+...+......+.++..+.++.+=++ ....++. T Consensus 51 ~~~~~~~---~~~~ip~~~~~~~a~~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~---ds~~~~~ 124 (338) T protein:vir:78 51 ENIPISY---GETIIPTTVKRPEVGQVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFAR---MNPSGLY 124 (338) T ss_pred ceeeccC---CceEEEEEecCccceeecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHh---cCHHHHH Confidence 6544333 3444555432 244456677788899999999999999999999988884333 2346788 Q ss_pred HHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeee Q lcl|NC_017674. 177 ETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDP 256 (382) Q Consensus 177 ~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~ 256 (382) +.-.....+++.+.+|+-+++|+..+..++..|++++..+...++ ....+ +.....++++.+++..+...... T Consensus 125 ~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~~~~~~~-~~~~~--~~~~~~~~~~~~~~~~~~~~~~~---- 197 (338) T protein:vir:78 125 TKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNVIVNTTN-VDYLQ--TGTTPLLDRFLDGYDLVSANTDV---- 197 (338) T ss_pred HHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccccccccc-ccccc--ccchhhHHHHHHHHHHhhhhccc---- Confidence 888889999999999999999996655566778887766543222 11222 33445688888888776543321 Q ss_pred ccccceEecCHHHHhhccc----cCCCCccHHHHHHHh-cC----ccEEEE---ccccccccCCCCCceeEEE-Ecchhh Q lcl|NC_017674. 257 KAEKITLALATSKVDYLSV----TTPYGISVSDWIEQT-YP----KMRIVS---APELSGVQMKAQEPEDALV-LFVEDV 323 (382) Q Consensus 257 ~~~p~~L~Lp~~~~~~Ls~----t~~~~~Tvl~~l~~n-~p----nl~i~~---~peL~~a~g~g~~~~~~~~-~~~~~v 323 (382) .+..++|.|..+..|.. .+..|.-++.-.... -| ++.++. +|.-..+ +.++.. .+++ .|.. + T Consensus 198 --~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~-~~~~~~-~~~~gdfs~-~ 272 (338) T protein:vir:78 198 --DFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGA-ATDSKV-RVVGGDFSQ-L 272 (338) T ss_pred --cceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCceeeeeeEEEccccCccccc-cCCccc-EEEEEecce-E Confidence 23468899888777642 233333332111111 11 122222 2321111 111111 1111 1111 0 Q ss_pred hhhhccccccchhhhhhhhhhhhcc---c-cee----cCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 324 NAAVDGSTDGGSVFSQLVQSKFITL---G-VEK----RAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 324 ~~~~~~~~~~~~~~~~~~p~~~~~l---~-~~~----~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +.+ +.+...+ +..+.--..- + .+. ..-.....|+.|. |+.+.+|.||+++... T Consensus 273 ---~~~-~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~-d~~v~~~~a~~~l~~~ 333 (338) T protein:vir:78 273 ---KYG-FADEIRV-KMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTF-GWLLGDKQAFVKFVDD 333 (338) T ss_pred ---EEE-eecccEE-EEeecccccccccccccchhhhhcCcEEEEEEEEe-ccEeecccceEEEecc Confidence 000 0000000 0000000000 0 000 0011223344444 5677899999999999 No 31 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=97.86 E-value=3.5e-06 Score=50.53 Aligned_cols=305 Identities=9% Similarity=-0.093 Sum_probs=147.0 Q ss_pred CCCcceeeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHH Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPI 80 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~ 80 (382) |+|. +..+.+...|+.=...|-.| +|+-......++.-+|- T Consensus 1 ~~~~---------------~~~~~~~~~f~~~~~~~~~~------------------------~a~~~~~~~~~~~liP~ 41 (324) T protein:vir:10 1 MEQT---------------QKLKLNLQHFASNNVKPQVF------------------------NPDNVMMHEKKDGTLLN 41 (324) T ss_pred CCCc---------------hHHHHHHHHHHHHhhcccee------------------------cccceeccCCCcceech Confidence 3331 11111111111111111111 11111111111222454 Q ss_pred HHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEe Q lcl|NC_017674. 81 QFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMV 160 (382) Q Consensus 81 ~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y 160 (382) .+.+ +|++.+......+.++++.+.+. .+..|++.+..+.+.+.+.+..+|..+....+..-..+.++..+.+ T Consensus 42 ~~~~----~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~i 114 (324) T protein:vir:10 42 DFTT----PILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPV 114 (324) T ss_pred hHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEeCCcceeEeccCccccccccceeEEEEeeEEEEEeehh Confidence 4444 44444444444555555444332 4567888888888899999999999999999999999999999999 Q ss_pred cHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHH Q lcl|NC_017674. 161 GTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIR 240 (382) Q Consensus 161 ~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~ 240 (382) +.+-++.+ ..++.+.-.....+++.+.+++..++|+..+. ...|+++..... ++. . +...-++||. T Consensus 115 S~ell~ds---~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~--~~~~i~~~~~~~--~~~----~---~~~~t~~~i~ 180 (324) T protein:vir:10 115 TKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIEKT--NKV----I---KGDFTQDNII 180 (324) T ss_pred hHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCc--cCcccccccccc--cee----c---cccCCHHHHH Confidence 98655433 35788888888888889999999999975432 224555533211 111 0 1112256777 Q ss_pred HHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHHHHHhcCc---cEEEEccccccccCCCCCceeEE Q lcl|NC_017674. 241 EAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDWIEQTYPK---MRIVSAPELSGVQMKAQEPEDAL 316 (382) Q Consensus 241 ~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~~n~pn---l~i~~~peL~~a~g~g~~~~~~~ 316 (382) +++..+...- . .+..+++.|+.+..|.+. +..|.-++ .-.+... +.++..+.. ..+.+. ++ T Consensus 181 ~~~~~l~~~~---~----~~~~~v~n~~~~~~L~~l~d~~g~~~~--~~~~~~~l~G~PV~~~~~~-----~~~~~~-~~ 245 (324) T protein:vir:10 181 DLEALLEDDE---L----EANAFISKTQNRSLLRKIVDPETKERI--YDRNSDTLDGLPVVNLKSS-----NLKRGE-LI 245 (324) T ss_pred HHHHhhhhcc---C----CCCEEEEcHHHHHHHHHhhccCCceee--cCCCCccccceeEEeecCC-----CCCcce-EE Confidence 7777664421 1 244789999999988643 33333221 1111111 112211111 111221 12 Q ss_pred EEcchhhhhhhccccccchhhhhhhhhhhhcccce----ecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 317 VLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVE----KRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 317 ~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~----~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +-.-.++. ..+.+........+..-......... ...-.....+..|.++ .+..|.||+.+.|. T Consensus 246 ~gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~-~v~~~~A~~~l~~a 313 (324) T protein:vir:10 246 TGDFDKLI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL-HIADDKAFAKLVPA 313 (324) T ss_pred EEecccEE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEcc-EEecccceEEEEec Confidence 11111110 00000000000000000000000000 0111233445556654 45569999999999 No 32 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=97.85 E-value=4.1e-06 Score=50.10 Aligned_cols=303 Identities=9% Similarity=-0.090 Sum_probs=147.8 Q ss_pred CCCcceeeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHH Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPI 80 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~ 80 (382) |.|. +.|- .+ ++++-.- +.. .. ..+|.-....+..+.-+|. T Consensus 1 ~~~~-~~~~-----------------~~---~~~f~~~------------~~~----~~--~~~a~~~~~~~~~~~~iP~ 41 (324) T protein:vir:97 1 MEQT-QKLK-----------------LN---LQHFASN------------NVK----PQ--VFNPDNVMMHEKKDGTLMN 41 (324) T ss_pred Cccc-hhHH-----------------HH---HHHHHHh------------hhh----hh--hhccccccccCCCcceech Confidence 3331 1110 11 1111000 000 00 0111111111112223555 Q ss_pred HHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEe Q lcl|NC_017674. 81 QFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMV 160 (382) Q Consensus 81 ~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y 160 (382) .+.+ +|++.+......+.++.+.+.+ ..+..+++....+.+.+.+.+..+|..+...+......+.++..+.+ T Consensus 42 ~~~~----~ii~~~~~~s~l~~~~~~~~~~---~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~i 114 (324) T protein:vir:97 42 EFTT----PILQEVMENSKIMQLGKYEPME---GTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPV 114 (324) T ss_pred hHHH----HHHHHHHhhcchhhhcceeecc---CCceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehh Confidence 4444 4555554444455555444333 24567888888888899999999999999999999999999999999 Q ss_pred cHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHH Q lcl|NC_017674. 161 GTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIR 240 (382) Q Consensus 161 ~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~ 240 (382) +.+-++.+ ..++.+.-.....+++.+.+++.++.|+..+.+ ..|+++........++. +-| ++||. T Consensus 115 s~ell~ds---~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~--~~gi~~~~~~~~~~~~~-----~~~----~~~i~ 180 (324) T protein:vir:97 115 TKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF--GKSIAQSIEKTNKVIKG-----DFT----QDNII 180 (324) T ss_pred hHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcc--Cccccccccccceeccc-----cCC----HHHHH Confidence 98555433 467888888899999999999999999865432 24666654332211111 112 56677 Q ss_pred HHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHhcC---ccEEEEccccccccCCCCCceeEE Q lcl|NC_017674. 241 EAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQTYP---KMRIVSAPELSGVQMKAQEPEDAL 316 (382) Q Consensus 241 ~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n~p---nl~i~~~peL~~a~g~g~~~~~~~ 316 (382) +++..+...- . .+.+++|.|..+..|.. .+..|..++. -.... +..++..+- ...+.+..++ T Consensus 181 ~~~~~l~~~~---~----~~~~~v~n~~~~~~L~~lkd~~g~~~~~--~~~~~tl~G~PV~~~~~-----~~~~~~~~~~ 246 (324) T protein:vir:97 181 DLEALLEDDE---L----EANAFISKTQNRSLLRKIVDPETKERIY--DRNSDTLDGLPVVNLKS-----SNLKRGELIT 246 (324) T ss_pred HHHHhhhhcc---C----CCCEEEEcHHHHHHHHHhhcCCCceeec--CCCCccccceeeEeecC-----CCCCcceEEE Confidence 7777664421 1 24478999999988864 3433433321 01101 111222111 1111111111 Q ss_pred EEcchhhhhhhccccccchhhhhhhhhhhhcc------cceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 317 VLFVEDVNAAVDGSTDGGSVFSQLVQSKFITL------GVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 317 ~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l------~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) -.+.+-+-. +.+.-......+..-...... -.+. -.....+..|. |+.+++|.||+.+.+. T Consensus 247 gd~~~~~i~--~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~--d~~~~r~~~r~-d~~v~~~~a~~~l~~~ 313 (324) T protein:vir:97 247 GDFDKLIYG--IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ--DMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred EecccEEEE--EecCcEEEEeecccccccccccccchhhhhc--CcEEEEEEEEe-ccEEecccceEEEEec Confidence 111110000 000000000000000000000 0000 01122334444 5555679999999999 No 33 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=97.80 E-value=3.9e-06 Score=50.25 Aligned_cols=289 Identities=7% Similarity=-0.091 Sum_probs=145.7 Q ss_pred hhhhhcccccchhhhhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeeccc Q lcl|NC_017674. 47 QIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAG 126 (382) Q Consensus 47 ~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G 126 (382) |. .-.||+.-....++++.-+|..+.+ +|++.+......+.+.++...+.- ...++++..... T Consensus 1 m~-----------~~~~~~~~~~~t~~~~~lvP~~~~~----~ii~~~~~~s~l~~~~~~~~~~~~--~~~~~~~~~~~~ 63 (297) T protein:vir:95 1 MT-----------VQTFNPENVLVSQKKDGTLHKEFTD----IIMKEVAQNSLVMQLGQYQEMEGE--QEKTVYVQTDGI 63 (297) T ss_pred CC-----------ccccccccccccCCCcceechhHHH----HHHHHHHhhchhhhhcceeecCCC--ccEEEEEEcCCc Confidence 10 0123433221111222225555554 566666555555666555433221 234456666677 Q ss_pred ceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCccc Q lcl|NC_017674. 127 TAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNR 206 (382) Q Consensus 127 ~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g 206 (382) .+.+.+++.++|..+..........+.++..+.++.+-++.+. .++.+.-....++++.+.+++-+++|+..+ + T Consensus 64 ~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~---~~l~~~i~~~la~ai~~~~d~a~l~G~g~~---~ 137 (297) T protein:vir:95 64 SAYWVNETEKIKTDKPEVVPVTLKAHKLGIILVTSREALNYTW---KKFFEDMKPQIVEAFYKKIDEAGLLGHDTP---F 137 (297) T ss_pred eeEEeecCccccccccceeEEEEeeEEEEEeehhhHHHHhcCH---HHHHHHHHHHHHHHHHHHHHHHHhcccCCc---c Confidence 7888899899999999999999999999999999986555333 578888889999999999999999998542 2 Q ss_pred ceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHH Q lcl|NC_017674. 207 TYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSD 285 (382) Q Consensus 207 ~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~ 285 (382) -.|+++.........+ ..-| ++||.+++.++...-. .+..+++.+..+..|.+. +..|.-++ T Consensus 138 ~~gi~~~~~~~~~~~~-----~~~t----~~~i~~~~~~l~~~~~-------~~~~~v~~~~~~~~L~~l~d~~G~~i~- 200 (297) T protein:vir:95 138 ANSVAKAAKDANKVIG-----GPIN----YDNILKLQDALYDADV-------EPNAFVSKIQNRSALREARDGNKVSIY- 200 (297) T ss_pred cccccccccccceecc-----cccC----HHHHHHHHHHhhhccC-------CcCEEEEcHHHHHHHHHhhccCCceee- Confidence 3577764432111111 0112 5677777777654321 234789999999988642 33333222 Q ss_pred HHHHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhc-ccce---ecCCceEecc Q lcl|NC_017674. 286 WIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFIT-LGVE---KRAKSYVEDF 361 (382) Q Consensus 286 ~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~-l~~~---~~~~~~~~~~ 361 (382) ..... ++...|-.-.-....+.+..++-.+.. +. ..+.+......+.+.-...... -+.. ...-.....+ T Consensus 201 --~~~~~--~l~G~Pv~~~~~~~~~~~~~~~gd~s~-~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 274 (297) T protein:vir:95 201 --DKAAN--TIDGITTVDLKSARFEKGDLLAGDFDN-LI-YGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRA 274 (297) T ss_pred --cCCCC--cccceeeEeecCCCCCCceEEEEeccc-EE-EEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEE Confidence 11111 111122110001111222211111111 00 0000000000000000000000 0000 0011222334 Q ss_pred ccceeeeEeeccchheeecCC Q lcl|NC_017674. 362 SNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 362 ~~~t~Gv~i~~P~aia~~~GI 382 (382) ..|. |..+++|.||+.+..- T Consensus 275 ~~~~-d~~v~~~~a~~~l~~a 294 (297) T protein:vir:95 275 TMDI-AVMITKTDAFAKLTPA 294 (297) T ss_pred EEEe-ccEeecccceEEEeec Confidence 4444 5556779999998877 No 34 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=97.79 E-value=2.6e-06 Score=51.18 Aligned_cols=291 Identities=12% Similarity=0.009 Sum_probs=143.2 Q ss_pred hcccccchhhhhhcccccCc----ccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeeccc Q lcl|NC_017674. 51 LAKAGAFRSGSAMDSNFTAP----VTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAG 126 (382) Q Consensus 51 ~~~~~~~~~~~amDa~~~~~----~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G 126 (382) +++. -+||+.-... .+..+.-+|..+.+ +|++.+.+....+.+..+.... ..+..+++....+ T Consensus 1 ~~~~------~~~~~e~~~~~~~~~~~~~~~ip~~~~~----~ii~~~~~~~~l~~~~~~~~~~---~~~~~ip~~~~~~ 67 (318) T protein:vir:24 1 MAAG------TAFAVDHAQIAQTGDTMFKGYLEPEQAK----DYFAEAEKTSIVQQFAQKVPMG---TTGQKIPHWVGDV 67 (318) T ss_pred CCCC------CCCCHHHHHhhcccCcccceeechhHHH----HHHHHHHhhchhhhhcceeecc---CCceEEEEEeCCc Confidence 2222 2233222111 11112225555544 4455554444455555443322 2356677777778 Q ss_pred ceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCccc Q lcl|NC_017674. 127 TAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNR 206 (382) Q Consensus 127 ~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g 206 (382) .+.+.+....+|..+...++..-..+.++..+.+|.+-++. ...++.+.-.....+++...+|+-+++|+..+.. T Consensus 68 ~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~d---s~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~-- 142 (318) T protein:vir:24 68 SAQWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRA---NPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFP-- 142 (318) T ss_pred ceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHhhc---ChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCC-- Confidence 88899999999999999999999999999999998865543 3367888888999999999999999999854322 Q ss_pred ceEEEeCCC-CcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHH Q lcl|NC_017674. 207 TYGFLNDPN-LPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVS 284 (382) Q Consensus 207 ~~GllN~P~-l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl 284 (382) .|+++... +....+...+ ... .+++.+++..+...- ..+..++|.|+.+..|.. .+..|..++ T Consensus 143 -~~~~~~~~~~~~~~~~~~~----~~~---~~~~~~~~~~~~~~~-------~~~~~~v~n~~~~~~L~~lkd~~G~~l~ 207 (318) T protein:vir:24 143 -TYIGQTTKAISIADTTGAT----TVY---DQVAVNGLSLLVNDG-------KKWTHTLLDDITEPILNGAKDQNGRPLF 207 (318) T ss_pred -ccccccccccccccccccc----chH---HHHHHHHHHhhcccc-------CCCCEEEEcHHHHHHHHHhhccCCceee Confidence 35554332 1111111111 111 223334443332211 124578999999998864 244444433 Q ss_pred HHHHHh-----cCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhh-hhhhhcccc----e--- Q lcl|NC_017674. 285 DWIEQT-----YPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLV-QSKFITLGV----E--- 351 (382) Q Consensus 285 ~~l~~n-----~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-p~~~~~l~~----~--- 351 (382) .-...+ +...++...|-.-......++....+-.+.+ + +.+- .+.. ...+ ......-.. . T Consensus 208 ~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~-~---~~~~-~~~l--~i~~~~~~~~~~~~~~~~~~~~ 280 (318) T protein:vir:24 208 IESTYGEAASPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQ-L---IWGQ-IGGL--SFDVTDQATLNLGTVESPNFVS 280 (318) T ss_pred cCccccCccccccCceEEEEeeEEeCCCCCCccEEEEeecce-E---EEEE-ecCe--EEEEeeccceeccccccccchh Confidence 211111 1112333333321111111111001001100 0 0000 0000 0000 000000000 0 Q ss_pred -ecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 352 -KRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 352 -~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ...-.....+..|. |+.+.+|.||+.+.++ T Consensus 281 ~f~~~~~~~r~~~r~-d~~v~~~~a~~~i~~~ 311 (318) T protein:vir:24 281 LWQHNLVAVRVEAEY-AFHCNDAEAFVALTNV 311 (318) T ss_pred hhhcCcEEEEEEEEE-ccEEecccceEEEEee Confidence 00111223445555 5666889999999999 No 35 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=97.78 E-value=4.7e-06 Score=49.79 Aligned_cols=305 Identities=8% Similarity=-0.074 Sum_probs=146.1 Q ss_pred ccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHHHHHhhhhhhheecccccc Q lcl|NC_017674. 19 FDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAAR 98 (382) Q Consensus 19 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~ 98 (382) |+.......+++. |-. .... ...+.|+-....++++.-+|-.+.+ +|++.+.... T Consensus 1 ~~~~~~~~~~~~~-------f~~--------~~~~------~~~~~a~~~~~~~~~~~liP~~~~~----~ii~~~~~~s 55 (324) T protein:vir:93 1 MEQTQKLKLNLQH-------FAS--------NNVK------PQVFNPDNVMMHEKKDGTLLNDFTT----PILQEVMENS 55 (324) T ss_pred CchhHHHHHHHHH-------HHH--------hhhh------hhhcccccccccCCCcceechhHHH----HHHHHHHhhc Confidence 1111111111111 110 0000 0111222111111122224544444 4555554444 Q ss_pred chhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHH Q lcl|NC_017674. 99 KIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAET 178 (382) Q Consensus 99 ~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~ 178 (382) ..+.+..+...+. ..+.|++.+..+.+.+.+.+.++|..+...+...-..+.++..+.+|.+-++.+ ..++.+. T Consensus 56 ~l~~l~~~~~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds---~~~l~~~ 129 (324) T protein:vir:93 56 KIMQLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEE 129 (324) T ss_pred hhhhhcceeeccC---CceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcc---hHHHHHH Confidence 4555555443322 346788888888888999999999999999999999999999999988555433 3578888 Q ss_pred HHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeecc Q lcl|NC_017674. 179 KRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKA 258 (382) Q Consensus 179 K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~ 258 (382) -.....+++.+.+++-+++|+..+. ...|+++........+. + ..-++||.+++..|...- . T Consensus 130 i~~~l~~aia~~~d~a~l~G~g~~~--~~~~~~~~~~~~~~~~~-~--------~~~~~~i~~~~~~l~~~~---~---- 191 (324) T protein:vir:93 130 MKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIEKTNKVIK-G--------DFTQDNIIDLEALLEDDE---L---- 191 (324) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCCC--cCccccccccccceecc-c--------cccHHHHHHHHHhhhhcc---C---- Confidence 8888888888889999999975432 22456654332211111 1 112677888887765432 1 Q ss_pred ccceEecCHHHHhhccc-cCCCCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEc-------chhhhhhhccc Q lcl|NC_017674. 259 EKITLALATSKVDYLSV-TTPYGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLF-------VEDVNAAVDGS 330 (382) Q Consensus 259 ~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~-------~~~v~~~~~~~ 330 (382) .+..+++.++.+..|.+ .+..|.-++. ....+ ++...|=.-......+.+..++-.+ ..++. ++.+ T Consensus 192 ~~~~~v~n~~~~~~L~~l~d~~G~~~~~--~~~~~--~l~G~PVv~~~~~~~~~~~i~~gdfs~~~~~~~~~~~--i~~~ 265 (324) T protein:vir:93 192 EANAFISKTQNRSLLRKIVDPETKERIY--DRNSD--SLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIE--YKID 265 (324) T ss_pred CCCEEEEcHHHHHHHHHhhCCCCCeeec--CCCCC--cccceeeEeecCCCCCcceEEEEecceEEEEEecCcE--EEEe Confidence 24478999999998864 3333432211 00111 1111111100111112221111111 11100 0000 Q ss_pred cccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 331 TDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 331 ~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +.. .+...-...-...-. ...-...+.+..|. |+.+.+|.||+++.+. T Consensus 266 ~~~--~~~~~~~~~~~~~~~-f~~n~~~~r~~~r~-d~~v~~~~a~~~l~~a 313 (324) T protein:vir:93 266 ETA--QLSTVKNEDGTPVNL-FEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred ecc--cccccccccccchhh-hhcCcEEEEEEEEe-ccEEecccceEEEecc Confidence 000 000000000000000 00011233444455 5667889999999998 No 36 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=97.70 E-value=3.9e-06 Score=50.22 Aligned_cols=281 Identities=9% Similarity=-0.052 Sum_probs=141.2 Q ss_pred hhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCcee Q lcl|NC_017674. 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT 140 (382) Q Consensus 61 ~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~v 140 (382) ||..+...+ +.-+|..+.+ +|++.+...-..+.+..+...+. ...++++....+.|.+.|.+..+|.. T Consensus 1 Ma~~~~~~g-----g~~vP~~~~~----~ii~~l~~~s~i~~l~~~i~~~~---~~~~ip~~~~~~~a~wv~Eg~~~~~s 68 (315) T protein:vir:80 1 MADDFLSAG-----KLELPGSMIG----AVRDRAIDSGVLAKLSPEQPTIF---GPVKGAVFSGVPRAKIVGEGEVKPSA 68 (315) T ss_pred CCCCcCCcC-----ceEcchHHHH----HHHHHHHhhchhhhhcceeecCC---CceEEEEEeCCcceEEeeCCcccccc Confidence 333332122 2235555544 45555554444454444332222 35678888888888999999999999 Q ss_pred eeeeeeeEeeEEEEEEEEEecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcce Q lcl|NC_017674. 141 SWNANFERRTIVRGELGMMVGTLEEGRASAIR-LNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAF 219 (382) Q Consensus 141 d~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g-~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~ 219 (382) +...++..-..+.++....+|.+=++.....- -.|.+.-....++++.+.+++-+|+|+..+...+.-|+.+.-+.. T Consensus 69 ~~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~-- 146 (315) T protein:vir:80 69 SVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKT-- 146 (315) T ss_pred ccceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccc-- Confidence 99988888888888888888875333222111 126677778889999999999999997543333333433321110 Q ss_pred eccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CC-----CCccHHHHHHHhcC- Q lcl|NC_017674. 220 QTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TP-----YGISVSDWIEQTYP- 292 (382) Q Consensus 220 ~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~-----~~~Tvl~~l~~n~p- 292 (382) +. ........++||.+++..+...... .+..++|.|+.+..|.+. +. .+..++.=+...-| T Consensus 147 -----~~-~~~~~~~~~~d~~~~~~~~~~~~~~------~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~ 214 (315) T protein:vir:80 147 -----KN-IVDATDSATADLVKAVGLIAGAGLQ------VPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLD 214 (315) T ss_pred -----cc-eeeccccchHHHHHHHHHHhhccCc------cceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCc Confidence 00 1122334567888888776543321 123578998888887532 11 11111110111101 Q ss_pred ---ccEEEE---ccccccccCCCCCceeEEE---------EcchhhhhhhccccccchhhhhhhhhhhhcccceecCCce Q lcl|NC_017674. 293 ---KMRIVS---APELSGVQMKAQEPEDALV---------LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSY 357 (382) Q Consensus 293 ---nl~i~~---~peL~~a~g~g~~~~~~~~---------~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~ 357 (382) ++.++. +|..... +.+....++ -+..++.- +-.+.. ..-..+..+ ...-.. T Consensus 215 tl~G~PV~~~~~~~~~~~~---~~~~~~~~~~GDfs~~~~g~~~~~~i--~i~~~~---~~~~~~~~~------~~~~~v 280 (315) T protein:vir:80 215 NWRGLNVGASSTVSGAPEM---SPASGVKAIVGDFSRVHWGFQRNFPI--ELIEYG---DPDQTGRDL------KGHNEV 280 (315) T ss_pred eecceeeEecCcCCccccc---ccccccEEEEeecccEEEEEecCeeE--EEeccc---cccCcccch------hhcCcE Confidence 111221 2221111 111111111 11111100 000000 000000000 001112 Q ss_pred EeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 358 VEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 358 ~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .+.+..|. |..|++|.||+++.+. T Consensus 281 ~~r~~~r~-~~~v~~~~a~~~l~~~ 304 (315) T protein:vir:80 281 MVRAEAVL-YVAIESLDSFAVVKEK 304 (315) T ss_pred EEEEEEEe-cceeecccceEEEeec Confidence 33455554 6678899999999998 No 37 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=97.70 E-value=6.7e-06 Score=48.94 Aligned_cols=288 Identities=8% Similarity=-0.086 Sum_probs=147.4 Q ss_pred hhhhhcccccchhhhhhcccccCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec Q lcl|NC_017674. 47 QIKALAKAGAFRSGSAMDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP 124 (382) Q Consensus 47 ~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~ 124 (382) |. .. ..+|.-. .++.+.+ +|..+. .+|++.+......+.+..+...+. ....+++.+. T Consensus 1 ma----~~-------~~~~~~~--~~t~~gg~lip~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~ip~~~~ 60 (304) T protein:vir:10 1 MA----TP-------TYTPGNV--ILSDFKNGVIPAEQG----TLIMKDIMANSAIMKLAKNEPMTA---QKKKFTYLAK 60 (304) T ss_pred Cc----cc-------ccccccc--cccCCCceecchhHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEeC Confidence 11 00 1111111 1122222 555443 356666655555666665544333 3456778877 Q ss_pred ccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCc Q lcl|NC_017674. 125 AGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLG 204 (382) Q Consensus 125 ~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~ 204 (382) .+.+.+.+....+|..+...++.....+.++..+.++.+=++ ....++.+.-.....+++.+.+|+-+++|+..+. T Consensus 61 ~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~- 136 (304) T protein:vir:10 61 GVGAYWVSETERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLK---WTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPY- 136 (304) T ss_pred CcceEEeecCcccccccceeeEEEEEEEEEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhheeccCCCc- Confidence 778888898889999999999999999999999999885443 3347788888899999999999999999985421 Q ss_pred ccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccH Q lcl|NC_017674. 205 NRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISV 283 (382) Q Consensus 205 ~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tv 283 (382) -.|.+....+....... .. .++....++||.+++.++...-. .+..++|.++.+..|.+ .+..|.-+ T Consensus 137 --~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~i~~~~~~l~~~~~-------~~~~~v~~~~~~~~L~~lkd~~G~~l 204 (304) T protein:vir:10 137 --NTSTSGKPLVEGAEEKG--NV-VTDTNNLYVDLSALMATIEDEEL-------DPNGVLTTRSFRSKMRNALDANDRPL 204 (304) T ss_pred --ccccccccccccccccc--cc-cccccchHHHHHHHHHHhhhccC-------CcCEEEEcHHHHHHHHHhhccCCcEe Confidence 12333333332211111 11 12233458888888877754321 23468999999999864 33334332 Q ss_pred HHHHHHh---cCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccce--------- Q lcl|NC_017674. 284 SDWIEQT---YPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVE--------- 351 (382) Q Consensus 284 l~~l~~n---~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~--------- 351 (382) ++ .+ .-++.++..+.+-. ..+++. +++-.-.++. ..+.+........+. .+....-+ T Consensus 205 ~~---~~~~~l~G~PV~~~~~~~~---~~~~~~-~~~gd~~~~~-~~~~~~~~i~~~~e~---~~~~~~~~~~~g~~~~~ 273 (304) T protein:vir:10 205 FD---ANGNEIMGLPLSYTGADVY---DKKKSL-ALMGDWDYAR-YGILQGIEYAISEDA---TLTTLQASDASGQPVSL 273 (304) T ss_pred ec---CCCccccceeeEEeccccc---CCCCcE-EEEEehhhEE-EEEecceEEEEeecc---eeeeecccccCccchhh Confidence 21 11 00122222222211 111221 1111101100 000000000000000 00000000 Q ss_pred ecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 352 KRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 352 ~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ...--+...++.|.++ .+++|.||+.+..- T Consensus 274 f~~~~~~~r~~~r~~~-~v~~~~a~~~l~~a 303 (304) T protein:vir:10 274 FERDMFALRATMHIAY-MNVKPEAFATLKPT 303 (304) T ss_pred hhcCcEEEEEEEEecc-EeecccceEEEEec Confidence 0011122344555554 45569999999888 No 38 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=97.70 E-value=6.7e-06 Score=48.94 Aligned_cols=288 Identities=8% Similarity=-0.086 Sum_probs=147.4 Q ss_pred hhhhhcccccchhhhhhcccccCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec Q lcl|NC_017674. 47 QIKALAKAGAFRSGSAMDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP 124 (382) Q Consensus 47 ~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~ 124 (382) |. .. ..+|.-. .++.+.+ +|..+. .+|++.+......+.+..+...+. ....+++.+. T Consensus 1 ma----~~-------~~~~~~~--~~t~~gg~lip~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~ip~~~~ 60 (304) T protein:vir:94 1 MA----TP-------TYTPGNV--ILSDFKNGVIPAEQG----TLIMKDIMANSAIMKLAKNEPMTA---QKKKFTYLAK 60 (304) T ss_pred Cc----cc-------ccccccc--cccCCCceecchhHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEeC Confidence 11 00 1111111 1122222 555443 356666655555666665544333 3456778877 Q ss_pred ccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCc Q lcl|NC_017674. 125 AGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLG 204 (382) Q Consensus 125 ~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~ 204 (382) .+.+.+.+....+|..+...++.....+.++..+.++.+=++ ....++.+.-.....+++.+.+|+-+++|+..+. T Consensus 61 ~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~- 136 (304) T protein:vir:94 61 GVGAYWVSETERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLK---WTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPY- 136 (304) T ss_pred CcceEEeecCcccccccceeeEEEEEEEEEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhheeccCCCc- Confidence 778888898889999999999999999999999999885443 3347788888899999999999999999985421 Q ss_pred ccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccH Q lcl|NC_017674. 205 NRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISV 283 (382) Q Consensus 205 ~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tv 283 (382) -.|.+....+....... .. .++....++||.+++.++...-. .+..++|.++.+..|.+ .+..|.-+ T Consensus 137 --~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~i~~~~~~l~~~~~-------~~~~~v~~~~~~~~L~~lkd~~G~~l 204 (304) T protein:vir:94 137 --NTSTSGKPLVEGAEEKG--NV-VTDTNNLYVDLSALMATIEDEEL-------DPNGVLTTRSFRSKMRNALDANDRPL 204 (304) T ss_pred --ccccccccccccccccc--cc-cccccchHHHHHHHHHHhhhccC-------CcCEEEEcHHHHHHHHHhhccCCcEe Confidence 12333333332211111 11 12233458888888877754321 23468999999999864 33334332 Q ss_pred HHHHHHh---cCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccce--------- Q lcl|NC_017674. 284 SDWIEQT---YPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVE--------- 351 (382) Q Consensus 284 l~~l~~n---~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~--------- 351 (382) ++ .+ .-++.++..+.+-. ..+++. +++-.-.++. ..+.+........+. .+....-+ T Consensus 205 ~~---~~~~~l~G~PV~~~~~~~~---~~~~~~-~~~gd~~~~~-~~~~~~~~i~~~~e~---~~~~~~~~~~~g~~~~~ 273 (304) T protein:vir:94 205 FD---ANGNEIMGLPLSYTGADVY---DKKKSL-ALMGDWDYAR-YGILQGIEYAISEDA---TLTTLQASDASGQPVSL 273 (304) T ss_pred ec---CCCccccceeeEEeccccc---CCCCcE-EEEEehhhEE-EEEecceEEEEeecc---eeeeecccccCccchhh Confidence 21 11 00122222222211 111221 1111101100 000000000000000 00000000 Q ss_pred ecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 352 KRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 352 ~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ...--+...++.|.++ .+++|.||+.+..- T Consensus 274 f~~~~~~~r~~~r~~~-~v~~~~a~~~l~~a 303 (304) T protein:vir:94 274 FERDMFALRATMHIAY-MNVKPEAFATLKPT 303 (304) T ss_pred hhcCcEEEEEEEEecc-EeecccceEEEEec Confidence 0011122344555554 45569999999888 No 39 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=97.68 E-value=8.8e-06 Score=48.31 Aligned_cols=304 Identities=9% Similarity=-0.063 Sum_probs=144.1 Q ss_pred CCCcceeeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHH Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPI 80 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~ 80 (382) |+| -.+.. .++++ |- ..+.+.. ..++.-.......+.-+|- T Consensus 1 ~~~--------------------~~~~~-~~~~~----f~--------~~~~~~~------~~~a~~~~~~~~~~~lip~ 41 (324) T protein:vir:96 1 MEQ--------------------TQKLK-LNLQH----FA--------SNNVKPQ------VFNPDNVMMHEKKDGTLLN 41 (324) T ss_pred CCc--------------------chhhh-HHHHH----HH--------Hhhhhhh------hcccccccccCCCcceech Confidence 222 11111 11111 10 0111111 1121111111112222454 Q ss_pred HHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEe Q lcl|NC_017674. 81 QFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMV 160 (382) Q Consensus 81 ~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y 160 (382) .+.+ +|++.+......+.++++..... .++.|++.+..+.+.+.|....+|..+....+.....+.++..+.+ T Consensus 42 ~~~~----~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~i 114 (324) T protein:vir:96 42 DFTT----PILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPV 114 (324) T ss_pred hHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehh Confidence 4444 44444444444555555544332 3577888888888889999999999999999999999999999999 Q ss_pred cHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHH Q lcl|NC_017674. 161 GTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIR 240 (382) Q Consensus 161 ~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~ 240 (382) +.+=++.+ ..++.+.-.....+++.+.+++.+++|+..+.. -.|+++.-.. ...+..++ .-++||. T Consensus 115 s~ell~ds---~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~--~~~~~~~~~~-------~~~~~~~~--~~~~~i~ 180 (324) T protein:vir:96 115 TKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF--GKSIAQSIKK-------TNKVIKGD--FTQDNII 180 (324) T ss_pred hHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCc--Cccccccccc-------cceecccc--cchHHHH Confidence 88555433 367888888888899999999999999754322 2344442211 11111111 1256666 Q ss_pred HHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHH-HhcCccEEEEccccccccCCCCCceeEE-- Q lcl|NC_017674. 241 EAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIE-QTYPKMRIVSAPELSGVQMKAQEPEDAL-- 316 (382) Q Consensus 241 ~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~-~n~pnl~i~~~peL~~a~g~g~~~~~~~-- 316 (382) +++..+...- . .+..+++.++.+..|.. .+..|..++.--. .++-++.++..+.. ..+.+..++ T Consensus 181 ~~~~~i~~~~---~----~~~~~i~n~~~~~~L~~lkd~~G~~~~~~~~~~~l~G~PV~~~~~~-----~~~~~~~~~gd 248 (324) T protein:vir:96 181 DLEALLEDDE---L----EANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSS-----NLKRGELITGD 248 (324) T ss_pred HHHHhhhhcc---C----CCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCCcccceeeEeecCC-----CCCcceEEEEe Confidence 6776664321 1 24578999999988864 3333432221000 00111122211111 111111111 Q ss_pred -----EEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 317 -----VLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 317 -----~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +.+..++. ++.++.. .+.......-..+-... .-.....+..|. |+.+++|.||+++.+- T Consensus 249 ~s~~~~~~~~~~~--i~~~~~~--~~~~~~~~~~~~~~~~~-~n~v~~r~~~r~-d~~v~~~~a~~~l~~a 313 (324) T protein:vir:96 249 FDKLIYGIPQLIE--YKIDETA--QLSTVKNEDGTPVNLFE-QDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred cceEEEEEecCcE--EEEeecc--cccccccccccchhhhh-cCcEEEEEEEEe-ccEEecccceEEEecc Confidence 11111110 0000000 00000000000000000 001223444455 5567779999999988 No 40 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=97.68 E-value=9.8e-06 Score=48.05 Aligned_cols=301 Identities=8% Similarity=-0.072 Sum_probs=146.8 Q ss_pred CCCcceeeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHH Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPI 80 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~ 80 (382) |+|. .+.. .+++++- ..+.... ..+|+-.......+.-+|- T Consensus 1 ~~k~--------------------~~~~-~~~~~~~------------~~~~~~~------~~~a~~~~~~~~~~~lip~ 41 (324) T protein:vir:99 1 MEQT--------------------QKLK-LNLQHFA------------SNNVKPQ------VFNPDNVMMHEKKDGTLLN 41 (324) T ss_pred CCCc--------------------hHhh-HHHHHHH------------HHhhhhh------hccccceeccCCCcceech Confidence 3331 1000 1122211 1111110 1122111111111222454 Q ss_pred HHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEe Q lcl|NC_017674. 81 QFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMV 160 (382) Q Consensus 81 ~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y 160 (382) .+.+ +|++.+......+.++.+...+. .+..|++.+..+.+.+.+.+..+|..+.......-..+.++..+.+ T Consensus 42 ~~~~----~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~i 114 (324) T protein:vir:99 42 DFTT----PILQEVMENSKIMRLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPV 114 (324) T ss_pred hHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehh Confidence 4444 45555444444555555443332 4567788877788889999899999999999999999999999999 Q ss_pred cHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHH Q lcl|NC_017674. 161 GTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIR 240 (382) Q Consensus 161 ~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~ 240 (382) |.+-++.+ ..++.+.-.....+++.+.+++.+++|+..++. ..|+++........++ ...-++||. T Consensus 115 S~ell~ds---~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~--~~~~~~~~~~~~~~~~---------~~~~~~~i~ 180 (324) T protein:vir:99 115 TKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF--GKSIAQSIEKTNKVIK---------GDFTQDNII 180 (324) T ss_pred hHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCcc--Cccccccccccceecc---------ccCCHHHHH Confidence 98655543 357888888888888888999999999754322 2455554332111111 111256777 Q ss_pred HHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHHHHHhcC---ccEEEEccccccccCCCCCceeEE Q lcl|NC_017674. 241 EAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDWIEQTYP---KMRIVSAPELSGVQMKAQEPEDAL 316 (382) Q Consensus 241 ~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~~n~p---nl~i~~~peL~~a~g~g~~~~~~~ 316 (382) +++..|...- . .+..+++.|+.+..|.+. +..|..++. -.... ++.++..+.. ..+.+. ++ T Consensus 181 ~~~~~l~~~~---~----~~~~~v~n~~~~~~L~~l~d~~g~~~~~--~~~~~~l~G~PVv~~~~~-----~~~~~~-~i 245 (324) T protein:vir:99 181 DLEALLEDDE---L----EANAFISKTQNRSLLRKIVDPETKERIY--DRNSDTLDGLPVVNLKSS-----NLKRGE-LI 245 (324) T ss_pred HHHHhhhhcc---C----CCCEEEEcHHHHHHHHHhhcCCCceeec--CCCCccccceeEEeecCC-----CCCcce-EE Confidence 7777664421 1 244789999999888642 333332211 01111 1112222111 112221 12 Q ss_pred EEcchhhhhhhccccccchhhhhhhhhhhhcccce--------ecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 317 VLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVE--------KRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 317 ~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~--------~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +-.-.++. .+- .+...+ +............ ...-.....+..|. |+.+.+|.||+.+.|. T Consensus 246 ~gd~~~~~---~~~-~~~~~i-~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~-d~~v~~~~a~~~lt~a 313 (324) T protein:vir:99 246 TGDFDKLI---YGI-PQLIEY-KIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred EEecccEE---EEE-ecCcEE-EEeecccccccccccccchhhhhcCcEEEEEEEEE-ccEEecccceEEEEec Confidence 11111110 000 000000 0000000000000 01112334455566 4455579999999999 No 41 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=97.67 E-value=4.2e-06 Score=50.05 Aligned_cols=282 Identities=8% Similarity=-0.054 Sum_probs=143.4 Q ss_pred hhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCcee Q lcl|NC_017674. 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT 140 (382) Q Consensus 61 ~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~v 140 (382) ||-+ ++.+.-+|..+.+ +|++.+......+.+.++..... .+..+++....+.|.+.+.+..+|.. T Consensus 1 m~t~-------t~gg~liP~~~~~----~ii~~l~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~E~~~~~~s 66 (303) T protein:vir:97 1 MGTE-------TSKASLFDKHLVS----DLINKVKGHSSLAKLSSQKPIPF---NGSKEFTFTLDSDIDVVAENGKKTHG 66 (303) T ss_pred Cccc-------CCCCeEcchhHHH----HHHHHHHhhchhhhhcceeecCC---CceEEEEEecCcceEEeecCcccccc Confidence 2211 1122225555544 55555555555556555443322 45677888888888999998899999 Q ss_pred eeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCC-CCcce Q lcl|NC_017674. 141 SWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDP-NLPAF 219 (382) Q Consensus 141 d~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P-~l~~~ 219 (382) +...+...-+.+.++..+.+|.+=++.......++.+.-.....+++.+.+|+-.++|+..+ .+..+..... ++... T Consensus 67 ~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~--~g~~~~~~~~~~~~~~ 144 (303) T protein:vir:97 67 GLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPR--TKKASDVIGTNHFDSK 144 (303) T ss_pred ccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccC--Cccccccccccccccc Confidence 99999999999999999999885343333456788888999999999999999999996422 1211111110 11100 Q ss_pred eccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHH-HHHHh-----cC Q lcl|NC_017674. 220 QTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSD-WIEQT-----YP 292 (382) Q Consensus 220 ~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~-~l~~n-----~p 292 (382) .+..=...+.+..++||.+++..+...- . .+..++|.|+.+..|.. .+..|.-++. -+... .- T Consensus 145 ---~~~~~~~~~~~~~~~~i~~~~~~~~~~~---~----~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~ 214 (303) T protein:vir:97 145 ---VTQVVKFTESEDADANIEAAVNLIQGAE---G----VVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSIN 214 (303) T ss_pred ---cccccccccccchHHHHHHHHHHHhhcC---C----CccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceec Confidence 0000011223345788888888765421 1 24568999988887753 2333322210 00000 00 Q ss_pred ccEEEEccccccccCCCCCceeEEEE--cchhhhhhhccccccchhhhhhhhhhhhcc---------cceecCCceEecc Q lcl|NC_017674. 293 KMRIVSAPELSGVQMKAQEPEDALVL--FVEDVNAAVDGSTDGGSVFSQLVQSKFITL---------GVEKRAKSYVEDF 361 (382) Q Consensus 293 nl~i~~~peL~~a~g~g~~~~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l---------~~~~~~~~~~~~~ 361 (382) ++.++.-..+.+-...+.. ...+++ |..-+. -+- .+.+....... -.+.. ..-+.+ T Consensus 215 G~Pv~~s~~v~~~~~~~~~-~~~~~~Gdf~~~~~---~~~-------~~~~~~~~~~~~~~d~~~~~~~~~n--~~~~r~ 281 (303) T protein:vir:97 215 GLKSSVNTTVGAGADEAES-KDLVIIGDFESMFK---WGY-------AKQIPMEIIKYGDPDNSGKDLKGYN--QIYLRA 281 (303) T ss_pred ceeeEEecccCCccccCCC-ccEEEEeeccccEE---EEE-------ecCcEEEEeeccCCCCcchhhhhcC--cEEEEE Confidence 1222221111111111111 111111 100000 000 00000000000 00000 111233 Q ss_pred ccceeeeEeeccchheeecCC Q lcl|NC_017674. 362 SNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 362 ~~~t~Gv~i~~P~aia~~~GI 382 (382) +.|+ |..|++|.||+++.-. T Consensus 282 ~~r~-~~~v~~p~af~~l~~~ 301 (303) T protein:vir:97 282 EAYI-GWGILDAKSFARVTKG 301 (303) T ss_pred EEEe-ccEeecccceEEeeCC Confidence 4444 5567789999999998 No 42 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=97.61 E-value=6.5e-06 Score=49.04 Aligned_cols=301 Identities=10% Similarity=-0.014 Sum_probs=147.9 Q ss_pred hHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccc-cchhHHHHHHhhhhhhheeccccccchhhhC Q lcl|NC_017674. 26 NDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTT-PSIPTPIQFLQTWLPGFVKVMTAARKIDEII 104 (382) Q Consensus 26 ~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~-~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~ 104 (382) +..+++|+.- .+=++..+..+. ++.-+|-.+.+ +|++.+...-..+.+. T Consensus 1 ~a~l~el~~~--------------------------~~~~~~~g~~~~~~~~liP~~~~~----~ii~~l~~~s~l~~~~ 50 (333) T protein:vir:78 1 MATLNELLPN--------------------------SAGSNHQGRLAHVPSDLLPKEIVG----PIFDKAQESSLVLRMG 50 (333) T ss_pred CchhHHhhhh--------------------------cccccccCceecCCccccchhHHH----HHHHHHHhhchhhhhc Confidence 1222222211 000111111111 11124554444 4555555554455555 Q ss_pred ccccCCCcceeeEEEEeeeccc--------ceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChH Q lcl|NC_017674. 105 GIDTVGSWEDQEIVQGIVEPAG--------TAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSA 176 (382) Q Consensus 105 ~v~t~g~~~~~t~t~~v~e~~G--------~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~ 176 (382) .+...+. ....+++..... .+...++...+|..+....+..-..+.++....++.+=++ ....++. T Consensus 51 ~~~~~~~---~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~---~s~~~~~ 124 (333) T protein:vir:78 51 EQIPISY---GETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFAR---MNPSGLY 124 (333) T ss_pred ceeeccC---CceEEEEEeCCceeEeecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHh---cCHHHHH Confidence 5443322 334455554443 3334456667899999999999999999999999884443 3345788 Q ss_pred HHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeee Q lcl|NC_017674. 177 ETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDP 256 (382) Q Consensus 177 ~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~ 256 (382) +.-+....+++.+.+++-+++|+.++...+.-|++|...+...+. ......+.+..++||.+++..+..... . T Consensus 125 ~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~---~~~~~~~~~~~~~~i~~~~~~~~~~~~--~-- 197 (333) T protein:vir:78 125 TKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTN---VDYLQETGDPLLDRLLDGYDLVSANTD--V-- 197 (333) T ss_pred HHHHHHHHHHHHHHHHHHHhcccCCCCCccccccccccccccccc---ccccccccchhHHHHHHHHHhhccccc--c-- Confidence 888888999999999999999986655566778887766543221 122223344457788888776544322 1 Q ss_pred ccccceEecCHHHHhhccc----cCCCCccHHHHHHHhc-----CccEEEEccccccccCCCCCce-eEEEEcchhhhhh Q lcl|NC_017674. 257 KAEKITLALATSKVDYLSV----TTPYGISVSDWIEQTY-----PKMRIVSAPELSGVQMKAQEPE-DALVLFVEDVNAA 326 (382) Q Consensus 257 ~~~p~~L~Lp~~~~~~Ls~----t~~~~~Tvl~~l~~n~-----pnl~i~~~peL~~a~g~g~~~~-~~~~~~~~~v~~~ 326 (382) .+..++|.|..+..|.+ .+..|.-++......- -++.++....+..-.+.+.+.. .+++-+-.++. T Consensus 198 --~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~-- 273 (333) T protein:vir:78 198 --EFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLK-- 273 (333) T ss_pred --CceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEE-- Confidence 23468888887766632 2333444433222111 1222332222221111111111 11111111110 Q ss_pred hccccccchhhhhhhhhhhhccc-ce---------ecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 327 VDGSTDGGSVFSQLVQSKFITLG-VE---------KRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 327 ~~~~~~~~~~~~~~~p~~~~~l~-~~---------~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .+- .+.+........ .. ...-...+-++.|. |+.|+.|.||+++.+- T Consensus 274 -~g~-------~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~-d~~v~~~~a~~~l~~~ 330 (333) T protein:vir:78 274 -FGF-------ADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTF-GWLLGDKQAFVKFVDD 330 (333) T ss_pred -EEE-------eeccEEEEeccccccccccceeehhhcCcEEEEEEEEE-ccEEecccceEEEecc Confidence 000 000000000000 00 00001112344444 5566999999999988 No 43 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=97.57 E-value=7.3e-06 Score=48.74 Aligned_cols=306 Identities=9% Similarity=-0.051 Sum_probs=144.7 Q ss_pred eeccccchhhhhhhhcccccchhhhhhcccccCcccccchh-HHHHHHhhhhhhheeccccccchhhhCccccCCCccee Q lcl|NC_017674. 37 LVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIP-TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQ 115 (382) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~-~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~ 115 (382) +++...-..+.... ....++ ...++.+.+ +|..+.+ ++++.+....-.+.+..+...+. . T Consensus 1 ~~~~~~r~~~~~~~-------~e~~a~-----~~~~~~~g~~ip~~~~~----~ii~~~~~~s~i~~~~~~~~~~~---~ 61 (326) T protein:vir:42 1 MAVNPDRTTPFLGV-------NDPKVA-----QTGDSMFEGYLEPEQAQ----DYFAEAEKISIVQQFAQKIPMGT---T 61 (326) T ss_pred CCCCccchhhhcCc-------chhhhe-----eccccCCcceechhhHH----HHHHHHHhcchhhhhcceeeccC---C Confidence 22111100000000 000000 001111222 4443333 56666666655566555544332 4 Q ss_pred eEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEE Q lcl|NC_017674. 116 EIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIG 195 (382) Q Consensus 116 t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~ 195 (382) ...|++.+..+.+.+.+.+..+|..+...++..-..+.++..+.+|.+=++. ...++.+.-.....+++...+++-. T Consensus 62 ~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~---s~~~~~~~i~~~l~~a~~~~~d~a~ 138 (326) T protein:vir:42 62 GQKIPHWTGDVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRA---NPANYLGTMRTKVATAFAMAFDNAA 138 (326) T ss_pred ceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhc---CHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4667788877888888999999999999999999999999999999855443 3467888888888999999999999 Q ss_pred EEeeccCCcccceEEEeCCCCcceec-cCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcc Q lcl|NC_017674. 196 FYGWQSGLGNRTYGFLNDPNLPAFQT-PPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLS 274 (382) Q Consensus 196 ~~Gd~~g~~~g~~GllN~P~l~~~~~-~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls 274 (382) ++|+..+ .-.|++|.+....... ..+..++..+..++. +..++..+...- . ....++|.+..+..|. T Consensus 139 l~G~gs~---~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~----~---~~a~~v~n~~~~~~L~ 206 (326) T protein:vir:42 139 INGTDSP---FPTFLAQTTKEVSLVDPDGTGSNADLTVYDAV--AVNALSLLVNAG----K---KWTHTLLDDITEPILN 206 (326) T ss_pred hcccCCC---ccccccccccccceeecccccccccchhHHHH--HHHHHhhhhhhc----c---CccEEEEeHHHHHHHH Confidence 9998643 2357777664322211 111222223333221 222222221111 1 1236789999888885 Q ss_pred c-cCCCCccHHHHHHHh-----cCccEEEEccccc-cccCCCCC------ceeEEEEcchhhhhhhccccccchhhhhhh Q lcl|NC_017674. 275 V-TTPYGISVSDWIEQT-----YPKMRIVSAPELS-GVQMKAQE------PEDALVLFVEDVNAAVDGSTDGGSVFSQLV 341 (382) Q Consensus 275 ~-t~~~~~Tvl~~l~~n-----~pnl~i~~~peL~-~a~g~g~~------~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 341 (382) + .+..|.-++.--..+ ++..++...|=.- ..-..++. -...++-...++. ++-+++..... .- T Consensus 207 ~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~--v~~~~e~~~~~--~~ 282 (326) T protein:vir:42 207 GAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLS--FDVTDQATLNL--GT 282 (326) T ss_pred HhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecceE--EEEeecceeee--cc Confidence 4 333343332110011 1112222222211 00001110 0001110111110 00000000000 00 Q ss_pred hhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 342 QSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 342 p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +..-..... ...-.....+..+. ++.+.+|.||+++.++ T Consensus 283 ~~~~~~~~~-~~~d~~~~r~~~~~-d~~v~~~~a~~~l~~~ 321 (326) T protein:vir:42 283 PQAPNFVSL-WQHNLVAVRVEAEY-AFHCNDKDAFVKLTNV 321 (326) T ss_pred cccccchhh-hhcCcEEEEEEEEe-ccEEecccceEEEeec Confidence 000000000 00112334455555 5677999999999999 No 44 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=97.43 E-value=1.8e-05 Score=46.64 Aligned_cols=328 Identities=11% Similarity=0.043 Sum_probs=148.5 Q ss_pred CCCc----------------ceeeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhc Q lcl|NC_017674. 1 MSQI----------------SKTHSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMD 64 (382) Q Consensus 1 ~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amD 64 (382) +.++ ...-..-.....+.+.........+..+... .. .+..+....... ......++ T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~----~~~~~~~~ 122 (419) T protein:vir:94 50 AARAALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRAR--DK-RGQFQVEMRDID----PNRLLSRD 122 (419) T ss_pred HHHHHHHHHHHHHHHHHhhhhccccccccccccchhhhhhhHHHHHHHHHh--hh-hhhhhHHHHHHH----HHHhhccc Confidence 0000 0000000000011111111111111111110 00 000000000000 01111222 Q ss_pred ccccCcccccch-hHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEE--------eeecccceeeccccc Q lcl|NC_017674. 65 SNFTAPVTTPSI-PTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQG--------IVEPAGTAVEYGDHT 135 (382) Q Consensus 65 a~~~~~~t~~~~-~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~--------v~e~~G~a~~ygd~~ 135 (382) .. .+..++++. -+|..... .+......+...+.++.+..... ..+.|. +....+.+.+.+.+. T Consensus 123 ~~-~~~~~~~~~~~~p~~~~~----~i~~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 194 (419) T protein:vir:94 123 AP-AGTITNPNVPHLPQLVPG----IVPTTPDLPLLVADLLDQQNADY---NVLEYIRDTSGTAGAGSTWNKAAVVPEGT 194 (419) T ss_pred cc-cccccCCcccccchhhhH----HHHHHHhhhhhhhhcceeeeccC---CceeeeeeccccccccccCcccceecCCc Confidence 21 122222221 12222222 22222233333444444333222 222222 233344567788888 Q ss_pred CCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCC Q lcl|NC_017674. 136 NIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPN 215 (382) Q Consensus 136 DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~ 215 (382) ..|..+....+.....+.++....+|.+=++-+. ++.+.-.....+++...+|+.+++|+.. ....|++|.++ T Consensus 195 ~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~----~l~~~i~~~la~a~~~~~d~aii~G~G~---~~p~Gi~~~~~ 267 (419) T protein:vir:94 195 AKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS----QLMGYIQGRLTYGLRFLRDRQLLNGNGS---TEMQGILTTPG 267 (419) T ss_pred cccccccceeeEEeeeeeEEEeehhhHHHHHhHH----HHHHHHHHHHHHHHHHHHHHHHHhccCc---ccccceecccc Confidence 8999999999999999999999999986555332 4777777778888888899999999853 35689999999 Q ss_pred CcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccH-HH-HHHHh-- Q lcl|NC_017674. 216 LPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISV-SD-WIEQT-- 290 (382) Q Consensus 216 l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tv-l~-~l~~n-- 290 (382) +....+ ...+...|....++||.+++..+...-. .+..++|.++.+..|... +..+..+ +. .+... T Consensus 268 ~~~~~~--~~~~~~~t~~~~~~~l~~~~~~~~~~~~-------~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~ 338 (419) T protein:vir:94 268 IGTYQQ--PKPTAPATDEPPLVDIRRAKTVAEIAGF-------PPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEAT 338 (419) T ss_pred cccccc--cccccccccchhHHHHHHHHHhhhhccC-------CCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCC Confidence 765443 2345567777889999999988764321 234689999988888532 2212111 10 00000 Q ss_pred --cCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceec------CCceEeccc Q lcl|NC_017674. 291 --YPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKR------AKSYVEDFS 362 (382) Q Consensus 291 --~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~------~~~~~~~~~ 362 (382) .-++.++....+. .+..++-.+...+. ..+ +..+.. -+..+ .-....-+. T Consensus 339 ~~l~G~pV~~~~~~~-------~~~~~~gd~~~~~~-~~~-------------~~~~~v-~~~~~~~~~~~~~~~~~r~~ 396 (419) T protein:vir:94 339 PRIWGLNVVSTVAIA-------QGTALVGGFRQGAT-LWS-------------RQGITV-LMTDSHADFFTANTLVILAE 396 (419) T ss_pred ccccceeeEEcCCCC-------CccEEEeeccceEE-EEE-------------ecceEE-EEeccccchhhcCcEEEEEE Confidence 0011222221111 11111111111000 000 000000 00000 111223444 Q ss_pred cceeeeEeeccchheeecCC Q lcl|NC_017674. 363 NGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 363 ~~t~Gv~i~~P~aia~~~GI 382 (382) .|.+ +.++.|.||+++..- T Consensus 397 ~r~d-~~v~~~~a~~~~~~~ 415 (419) T protein:vir:94 397 FRAN-LAVYQPKAFVRVTFA 415 (419) T ss_pred Eeec-cEEeccccEEEEEec Confidence 5555 455779999998877 No 45 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=97.30 E-value=7.7e-06 Score=48.62 Aligned_cols=269 Identities=15% Similarity=0.177 Sum_probs=134.7 Q ss_pred hcccccC--cccccchhHHHHHHh---hhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecc---cceeecccc Q lcl|NC_017674. 63 MDSNFTA--PVTTPSIPTPIQFLQ---TWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPA---GTAVEYGDH 134 (382) Q Consensus 63 mDa~~~~--~~t~~~~~~~~~~l~---~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~---G~a~~ygd~ 134 (382) |-+- .+ ++.+.+.-....++. .|..++.+.+-..+-++.||- +.+.-....+.|.-..+. |.+.-.+.+ T Consensus 1 ~~~~-~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~--~~~a~~~~~v~f~~~~p~~~~~d~e~VaEg 77 (318) T protein:vir:10 1 MTAP-TGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFR--NGGANPNGVVAYNEGNPSFLEDDVADVAEF 77 (318) T ss_pred CCCC-CcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhh--cccccccceeEEEecccccccCcHhhccCc Confidence 1111 00 011111111222222 122222222212222333442 222222344545443333 666666667 Q ss_pred cCCceeeeeeeeeEe-eEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeC Q lcl|NC_017674. 135 TNIPLTSWNANFERR-TIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLND 213 (382) Q Consensus 135 ~DiP~vd~~~~~~~~-~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~ 213 (382) ..+|+++..-.+.+. .+.-++.++++|.+.+ ...+++...+....++++...+.|+.+ |..|.+ T Consensus 78 gEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~---~~n~~~~v~r~~~~l~Nti~r~~d~~a------------~dal~s 142 (318) T protein:vir:10 78 GEIPVSAGARGLPRTAFAVKKALGVRVSKEMI---DENRVGAVNDQMLQLRNTFIRANDRSA------------KALLQS 142 (318) T ss_pred ccccccCCCCCchhhhhhehhccceeccHHHH---hhcChhHHHHHHHHHHHHHHHHHHHHH------------HHHHhc Confidence 789999987766555 5579999999998654 445678888888888888888877553 345666 Q ss_pred CCCcceeccCCCCccccCHHHHHHHHHHHHHHH--------------HHhcCCeeeeccccceEecCHHHHhhccccCCC Q lcl|NC_017674. 214 PNLPAFQTPPSQGWSTADWAGIIGDIREAVRQL--------------RIQSQDQIDPKAEKITLALATSKVDYLSVTTPY 279 (382) Q Consensus 214 P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l--------------~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~~~ 279 (382) ++++. .++++.|.... ....|+-.+...+ ....-| + .|++|+|.|..+..|.+-. T Consensus 143 a~t~~--~~~s~~w~~~~--~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~G-Y----~pdtIVlhP~~~~~l~~n~-- 211 (318) T protein:vir:10 143 PIVPT--LAVPTAWDNGG--KVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFG-F----IPDTIVMHYALLPILMDNE-- 211 (318) T ss_pred ccccc--ccCCcCCCCcc--cccccchhhhhhhhhhhhhhhhhhhhhhhhccC-c----cceeeEECHHHHHHHhcch-- Confidence 66553 34455665311 1112322222211 112222 2 4679999999999996431 Q ss_pred CccHHHHHH-------------HhcC----ccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhh Q lcl|NC_017674. 280 GISVSDWIE-------------QTYP----KMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQ 342 (382) Q Consensus 280 ~~Tvl~~l~-------------~n~p----nl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p 342 (382) .+.+++. -+|| +++++.-|-+.. + .+|+...... |... -..| T Consensus 212 --~~~~~y~~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~-------~--~alvlq~g~v--------G~~~--d~~p 270 (318) T protein:vir:10 212 --NFMKVYERNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPI-------D--RVLIMERGTV--------GFYS--DTRP 270 (318) T ss_pred --hhhhhhhccchhhhhcccccccccceeeceEEeecCccCC-------C--eeEEEecCCc--------ceee--cccc Confidence 1222221 1233 256666555531 1 1333322222 2111 1123 Q ss_pred hhhhcccce------ecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 343 SKFITLGVE------KRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 343 ~~~~~l~~~------~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ..+..+-+| -.+.+|...++..+ ...|..|+|+..++|| T Consensus 271 l~~t~~~~egg~~~g~~~~s~~~~~~~~~-~~~V~~PkA~~~itgi 315 (318) T protein:vir:10 271 LQFTALYPEGNGPNGGPTESYRADASHKR-ALAVDQPKAALWLTGI 315 (318) T ss_pred ceeeecccCCCCCCCCcchhhheehheee-eeeeeCcceeEEEeec Confidence 222222222 14556777666554 5678999999999999 No 46 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=97.23 E-value=1.6e-05 Score=46.83 Aligned_cols=290 Identities=12% Similarity=0.011 Sum_probs=140.2 Q ss_pred eeccccchhhhhhhhcccccchhhhhhcccccCcccccchh-HHHHHHhhhhhhheeccccccchhhhCccccCCCccee Q lcl|NC_017674. 37 LVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIP-TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQ 115 (382) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~-~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~ 115 (382) ++|+.. ... +... .++...+ ++-.+.+ ++++.+......+.+..+.+... . T Consensus 1 ~g~~~e---------------~~~-~~~~-----~t~~~~g~l~~~~~~----~ii~~l~~~s~i~~l~~~~~~~~---~ 52 (397) T protein:vir:23 1 MGFSAD---------------HSQ-IAQT-----KDTMFTGYLDPVQAK----DYFAEAEKTSIVQRVAQKIPMGA---T 52 (397) T ss_pred CCcCHH---------------HHH-Hhhc-----cCCCCccccchhHHH----HHHHHHHhccchhhhcceeeccC---C Confidence 222111 000 1111 1111222 3333333 44555444555555555544332 3 Q ss_pred eEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEE Q lcl|NC_017674. 116 EIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIG 195 (382) Q Consensus 116 t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~ 195 (382) +..|++.+....+.+.++...+|..+....+....++.++..+.++.+=++.+ ..++.+.-+...++++.+.+|+.+ T Consensus 53 ~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds---~~~l~~~i~~~l~~aia~~~d~a~ 129 (397) T protein:vir:23 53 GIVIPHWTGDVSAQWIGEGDMKPITKGNMTKRDVHPAKIATIFVASAETVRAN---PANYLGTMRTKVATAIAMAFDNAA 129 (397) T ss_pred ceEEEEEcCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHH Confidence 46788888788888899999999999999999999999999999998655433 477899999999999999999999 Q ss_pred EEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc Q lcl|NC_017674. 196 FYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV 275 (382) Q Consensus 196 ~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~ 275 (382) ++|+..+ .+.-|+++..+..... ++ ..+ .+|+.+++..+...- . .+..++|.+..+..|.+ T Consensus 130 l~G~gt~--~~~~~~~~~~~~~~~~--~~----~~~----~~~~~~~~~~l~~~~----~---~~a~~vmn~~~~~~L~~ 190 (397) T protein:vir:23 130 LHGTNAP--SAFQGYLDQSNKTQSI--SP----NAY----QGLGVSGLTKLVTDG----K---KWTHTLLDDTVEPVLNG 190 (397) T ss_pred hhcccCC--cccccccccccceeee--cc----cch----hHHHHHHHHhhhhcc----c---CCCEEEEcHHHHHHHHH Confidence 9998542 3445655544322111 11 112 233444444433221 1 23478999998888864 Q ss_pred c-CCCCccHHHHHHHh-cC----ccEEEEccccccccCC-CCC------ceeEEEEcchhhhhhhccccccchhhhhhhh Q lcl|NC_017674. 276 T-TPYGISVSDWIEQT-YP----KMRIVSAPELSGVQMK-AQE------PEDALVLFVEDVNAAVDGSTDGGSVFSQLVQ 342 (382) Q Consensus 276 t-~~~~~Tvl~~l~~n-~p----nl~i~~~peL~~a~g~-g~~------~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p 342 (382) . +..|..++.=-..+ .| .-++...|-.-..... +.. -...++....++. ++.++. ..+..... T Consensus 191 lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~--i~~~~e--~~~~~~~~ 266 (397) T protein:vir:23 191 SVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLS--FDVTDQ--ATLNLGSQ 266 (397) T ss_pred hhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEeceE--EEEeee--eeeeeccc Confidence 3 33344332211111 11 1123333322111000 000 0001111111100 000000 00000000 Q ss_pred hhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 343 SKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 343 ~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .--..+-.-. .-.....+..| .++.+++|.+|++..+- T Consensus 267 ~~~~~~~lf~-~d~v~~ra~~r-~d~~v~~~~a~~~~~~~ 304 (397) T protein:vir:23 267 ESPNFVSLWQ-HNLVAVRVEAE-YGLLINDVNAFVKLTFD 304 (397) T ss_pred cccceeeeee-ccceeEEEEee-eccceecccceEEEeec Confidence 0000000000 00011222233 35688899999999886 No 47 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=97.12 E-value=0.00015 Score=41.62 Aligned_cols=352 Identities=11% Similarity=0.069 Sum_probs=143.2 Q ss_pred CCCcce------------------eeeecCcccc-ccccccccchHHHHHHhhcceeccccchhhhhhhhccc-ccchhh Q lcl|NC_017674. 1 MSQISK------------------THSRLAGRNA-KPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKA-GAFRSG 60 (382) Q Consensus 1 ~~~~~~------------------~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~-~~~~~~ 60 (382) ++++.. .++...++.. ...+.+......+....+. +..-.+..++........ ...... T Consensus 46 ~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 124 (428) T protein:vir:10 46 FTDISAKMDRMEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVMS-IAAAQGNLQDAAKFASDELNDQSVS 124 (428) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHHH-HHHhhhhHHHHHHHhhhhhhhhhHh Confidence 111100 0000000000 0001111111111110000 000000000110000000 000000 Q ss_pred hhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCcee Q lcl|NC_017674. 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT 140 (382) Q Consensus 61 ~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~v 140 (382) +++.. ..+.++.-+|-.+. ++|++.+....-.+.+.. +..+.....+.+++....+.+.+.+.+..+|.. T Consensus 125 ~~~~~----~~~~gg~liP~~~~----~~ii~~l~~~~~l~~~~~--~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~ 194 (428) T protein:vir:10 125 MAIST----AAGSGGVLIPQNIH----SEVIELLRDRTIVRKLGA--RSIPLPNGNMSLPRLAGGATASYTGENQDAKVS 194 (428) T ss_pred hhhcc----cccCCccccchhHH----HHHHHHHhhhchhhhhcc--eeeecCCcceEEEEEeCCcceeeeccCcccccc Confidence 11110 01111122444333 355555443333333311 111111233567777766777788888899999 Q ss_pred eeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCccee Q lcl|NC_017674. 141 SWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQ 220 (382) Q Consensus 141 d~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~ 220 (382) +...+...-..+.++..+.+|.+=+..+ ..++.+--......++...+|+.+++|+..+ +...|++|........ T Consensus 195 ~~~f~~i~~~~~k~~~~v~is~ell~ds---~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~--~~p~Gi~~~~~~~~~~ 269 (428) T protein:vir:10 195 EARFDDVKLTAKTMIAMVPISNALIGRA---GFNVEQLVLQDILTAISVREDKAFMRDDGTG--DTPIGMKARATQWNRL 269 (428) T ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhccCCCC--cccccccccccccccc Confidence 9998899999999999999998655433 4568888888888899999999999997532 3457999976543221 Q ss_pred ccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHH-hcCccEEEE Q lcl|NC_017674. 221 TPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQ-TYPKMRIVS 298 (382) Q Consensus 221 ~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~-n~pnl~i~~ 298 (382) ... ..-+..+.+.+-..++.+.. ...... ........+|.+..+..|.. .+..|.-++.-... .+-++.++. T Consensus 270 ~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~----~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~g~l~G~pv~~ 343 (428) T protein:vir:10 270 LPW-AADAAVNLDTIDTYLDSIIL-MSMDGN----SNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEMAQGMLKGYPIQR 343 (428) T ss_pred ccc-cccccccHHHHHHHHHHHHH-hhhccc----cccccCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeeceeeEE Confidence 111 11122333333222222221 111111 11122367889988888864 34444544321100 011222222 Q ss_pred ccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhh-hh------cccceecCCceEeccccceeeeEee Q lcl|NC_017674. 299 APELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSK-FI------TLGVEKRAKSYVEDFSNGTAGALCK 371 (382) Q Consensus 299 ~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~-~~------~l~~~~~~~~~~~~~~~~t~Gv~i~ 371 (382) ..-+-.-.+.+.+...+++-...++ +.+. .+...+ +.-+.. +. ....+.. ...+.+..| -|+.++ T Consensus 344 ~~~~p~~~~~~~~~~~i~~gd~s~~---~i~~-~~~i~i-~~~~~~~~~~~~~~~~~~f~~~--~~~~R~~~r-~d~~v~ 415 (428) T protein:vir:10 344 TSAIPANLGEGGKESEIYFADFNDV---VIGE-DGNMKV-DFSKEASYIDTDGKLVSAFSRN--QSLIRVVTE-HDIGFR 415 (428) T ss_pred eccccccccCCCccceEEEEecceE---EEEE-ecceEE-Eeecccccccccccccchhhcc--hhheeeeee-eCceee Confidence 2111110111111111221111110 0000 000000 000000 00 0000000 011123333 377889 Q ss_pred ccchheeecCC Q lcl|NC_017674. 372 RPWAVVRYLGI 382 (382) Q Consensus 372 ~P~aia~~~GI 382 (382) +|.||+.++|| T Consensus 416 ~p~a~~~~t~~ 426 (428) T protein:vir:10 416 HPEGLVLGTGV 426 (428) T ss_pred ccceEEEEecc Confidence 99999999999 No 48 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=96.78 E-value=0.00013 Score=41.87 Aligned_cols=323 Identities=12% Similarity=0.014 Sum_probs=148.4 Q ss_pred CCCcceeeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchh-HH Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIP-TP 79 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~-~~ 79 (382) +.++.+. ..-.....+...-.......+......+..-.... . .....+.++.... .+..+.+ +| T Consensus 61 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~------~~~~~~~~~~~~~-~~~~~g~~~~ 126 (390) T protein:vir:10 61 VAELEGN-GAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARA------T------MNIKAALNTASTD-AAGSAGALTT 126 (390) T ss_pred HHHHHhh-cccccccccchhhhhhhhHHHHHHHHhhhhhhhhh------h------hHHHHHHHhhhcc-cccccccccc Confidence 1110000 00000000000000001111221111111100000 0 0001112222122 1222222 33 Q ss_pred HHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec-ccceeecccccCCceeeeeeeeeEeeEEEEEEEE Q lcl|NC_017674. 80 IQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP-AGTAVEYGDHTNIPLTSWNANFERRTIVRGELGM 158 (382) Q Consensus 80 ~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~ 158 (382) -.+. +++++.+......+.++.+.+.+. ..+.|...+. .+.+.+.+.+..+|..+.........++.++..+ T Consensus 127 ~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~ 199 (390) T protein:vir:10 127 PNRL----PGFITQPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTM 199 (390) T ss_pred hhHH----HHHHHHHHhhchhhhhcceeeccC---CceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEee Confidence 3222 467777777667777777665543 3456666554 4677788888899999999999999999999999 Q ss_pred EecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHH Q lcl|NC_017674. 159 MVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGD 238 (382) Q Consensus 159 ~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~D 238 (382) .+|.+ +..-. .++.+.-....++++...+|+-+++|+.. .....|++|.++....... .+....++| T Consensus 200 ~is~e-ll~d~---~~l~~~i~~~l~~~~~~~~~~~il~G~G~--~~~p~Gi~~~~~~~~~~~~-------~~~~~~~~~ 266 (390) T protein:vir:10 200 KATRQ-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGA--NDGLLGLIPQATTYAAPTT-------IAGATRVDQ 266 (390) T ss_pred hhhHH-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCCC--Ccccccccccccccccccc-------ccccchHHH Confidence 99985 43322 26888888888889999999999999743 3456899998875433221 112223566 Q ss_pred HHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHhcC----ccEEEEccccccccCCCCCce Q lcl|NC_017674. 239 IREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQTYP----KMRIVSAPELSGVQMKAQEPE 313 (382) Q Consensus 239 i~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n~p----nl~i~~~peL~~a~g~g~~~~ 313 (382) +.+++..+...-. .+..++|.|+.+..|.+ .+..|.-++.--...-+ ++.++..+.+. .+. T Consensus 267 ~~~~~~~l~~~~~-------~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p-------~~~ 332 (390) T protein:vir:10 267 LRLAMLQASLAEY-------PASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMA-------PGE 332 (390) T ss_pred HHHHHHhhccccC-------CCCEEEEcHHHHHHHHHhhcCCCceeecCCcCcCCceecceeeEEcCCCC-------CCc Confidence 7777766643221 23468999998888864 23334333221111101 11222221111 011 Q ss_pred eEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 314 DALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 314 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) +++-.-.+.....+. +-+...+.........-....-+..|. |+.+++|.||+..+== T Consensus 333 -~~~gdf~~~~~~~~~---------~~~~i~~~~~~~~~~~~~~~~r~~~r~-d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 333 -FLVGAFDLAAQIFDQ---------WDARVEIGYVNDDFQRNMVTVLAEERL-ALVVYRPEALISGSFA 390 (390) T ss_pred -EEEEeccceEEEEEe---------cceEEEEeecccccccCcEEEEEEEee-ccEEeccccEEEEEeC Confidence 111110000000000 000000000000000111222333444 4577888888654311 No 49 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=96.76 E-value=6.3e-05 Score=43.63 Aligned_cols=321 Identities=12% Similarity=0.039 Sum_probs=149.6 Q ss_pred CCCcceeeeec----CccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccch Q lcl|NC_017674. 1 MSQISKTHSRL----AGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSI 76 (382) Q Consensus 1 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~ 76 (382) +..+.+.-... .....++..-.......++.+....-. . .+...... ...........++++. T Consensus 77 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~---~~~~~~~~-------~~~~~~~~~~~~~~~~ 143 (418) T protein:vir:10 77 LLEAEQKLARGGGSAELETPKTLGQLVTESEEMKGMDGSARK---S---VRVRVDRK-------SIMNVPATVGSGVSGS 143 (418) T ss_pred HHHHHHHHhhcccccccchhhhhhHHhhhHHHHHHHHHHHhh---h---hhhhhHHH-------HHHHhhhhccCCCCCC Confidence 00000000000 000000000000111112222111100 0 00000000 0000000111222233 Q ss_pred h--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec-ccceeecccccCCceeeeeeeeeEeeEEE Q lcl|NC_017674. 77 P--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP-AGTAVEYGDHTNIPLTSWNANFERRTIVR 153 (382) Q Consensus 77 ~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~ 153 (382) + +|..+.+ +|++.+......++++++...+. .++.+..... .+.+.+.+.+..+|..+...+...-..+. T Consensus 144 g~lvp~~~~~----~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k 216 (418) T protein:vir:10 144 NSLVVADRQA----GIIAPPQRKMTIRDLLMPGQTSS---SSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRT 216 (418) T ss_pred ccccchhHHH----HHHHHHhhhhhHHhhcceeeccC---CceeEEEEecCCCceeeeccCccccccccceeeEEEeeee Confidence 3 4444443 67777777777777776655443 3455666544 35566778888999999999999999999 Q ss_pred EEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHH Q lcl|NC_017674. 154 GELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWA 233 (382) Q Consensus 154 ~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~ 233 (382) ++..+.+|.+=+..+ .++.+--.....+++...+|+-+++|+..+ ....|++|..+....... ..++ T Consensus 217 ~~~~~~is~ell~ds----~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~--~~p~Gi~~~~~~~~~~~~-~~~~------ 283 (418) T protein:vir:10 217 IAHLFKASRQILDDA----PALQSYIDGRARYGLQLTEEGQILKGDGTG--ANILGILPQASAFMPSIT-LANA------ 283 (418) T ss_pred EEEeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHhccCCCC--cccccccccccccccccc-cccc------ Confidence 999999998644332 268888888888899999999999997542 336799998775433221 1111 Q ss_pred HHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHHHHHh----cCccEEEEccccccccCC Q lcl|NC_017674. 234 GIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDWIEQT----YPKMRIVSAPELSGVQMK 308 (382) Q Consensus 234 eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~~n----~pnl~i~~~peL~~a~g~ 308 (382) .-++||.+++..+...- ..+..++|.|..+..|... +..|.-++.=.... +-++.++..+.+. T Consensus 284 ~~~~~i~~~~~~~~~~~-------~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p----- 351 (418) T protein:vir:10 284 TPIDKIRLALLQAVLAE-------FPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPRLWNLPVVETQAMT----- 351 (418) T ss_pred ccHHHHHHHHHhhcccc-------CCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCceecceeeEEcCCCC----- Confidence 12566666666553221 1234689999999888542 33344333211110 0011222221111 Q ss_pred CCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccce---ecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 309 AQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVE---KRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 309 g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~---~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .+ .+++- +...+. ..+. +-+. +...+-. ...-....-+..+.+| .++.|.||++.+.. T Consensus 352 --~~-~~~~gd~s~~~~-~~~~---------~~~~--i~~~~~~~~~f~~~~~~~r~~~~~d~-~~~~~~a~~~~~~~ 413 (418) T protein:vir:10 352 --AN-EFLVGAFSMAAQ-IFDR---------MEIE--VLLSTENVDDFEKNMVSIRAEERLAL-AVYRPESFVTGALV 413 (418) T ss_pred --CC-cEEEeeccceEE-EEEe---------cceE--EEEecccchhhhcCceEEEEEEeecc-EEecccceEEEEec Confidence 00 11111 100000 0000 0000 0000000 0011122234445544 68899999999988 No 50 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=96.74 E-value=3.3e-05 Score=45.13 Aligned_cols=318 Identities=11% Similarity=-0.014 Sum_probs=152.9 Q ss_pred CCCcc----------------------eeeeecC----------ccccccccc----cccchHHHHHHhhcceeccccch Q lcl|NC_017674. 1 MSQIS----------------------KTHSRLA----------GRNAKPFDL----KNITNDAVASLSRIGLVFDHAVV 44 (382) Q Consensus 1 ~~~~~----------------------~~~~~~~----------~~~~~~~~~----~~~~~~~~~~l~~~g~~~~~~~~ 44 (382) |.++. +..-.+. -+..+.-.. ........+++.+..-. T Consensus 18 ~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 91 (385) T protein:vir:18 18 MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDG------ 91 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHH------ Confidence 00000 0000000 000000000 00000011111111000 Q ss_pred hhhhhhhcccccchhhhhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec Q lcl|NC_017674. 45 QDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP 124 (382) Q Consensus 45 ~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~ 124 (382) ............+|.+. .+.++.-+|- .+.+.|++.+......+.++++...+. ..+.|.+.+. T Consensus 92 -----~~~~~~~~~~~~~~~~~----~~~~g~~i~~----~~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~ 155 (385) T protein:vir:18 92 -----KQGTFGAKTFNKSLGSD----ADSAGSLIQP----MQIPGIIMPGLRRLTIRDLLAQGRTSS---NALEYVREEV 155 (385) T ss_pred -----hhccchhhHHHhhhccc----cccCCceecc----hhhhHHHHHhhhccchhhhcceecccC---cceEEEEEec Confidence 00000000011122211 1111111332 234567777777777788777755443 3466777664 Q ss_pred -ccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCC Q lcl|NC_017674. 125 -AGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGL 203 (382) Q Consensus 125 -~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~ 203 (382) .+.+.+.+.+..+|..+....+.....+.++..+.++.+ +..-. .++.+.-....++++...+|+-.++|+.. T Consensus 156 ~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e-ll~d~---~~l~~~i~~~la~a~~~~~d~~~l~G~g~-- 229 (385) T protein:vir:18 156 FTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ-VMDDA---PMLQSYINNRLMYGLALKEEGQLLNGDGT-- 229 (385) T ss_pred CCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH-HHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCC-- Confidence 456677888889999999999999999999999999974 43322 25777777888888889999999999754 Q ss_pred cccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCcc Q lcl|NC_017674. 204 GNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGIS 282 (382) Q Consensus 204 ~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~T 282 (382) .....|+++.++....... .+.+..++||.+++.++...-. .+..++|+|..+..|.. .+..|.- T Consensus 230 ~~~~~Gi~~~~~~~~~~~~-------~~~~~~~d~i~~~~~~l~~~~~-------~~~~~~~~~~~~~~l~~lkd~~G~~ 295 (385) T protein:vir:18 230 GDNLEGLNKVATAYDTSLN-------ATGDTRADIIAHAIYQVTESEF-------SASGIVLNPRDWHNIALLKDNEGRY 295 (385) T ss_pred CCccccccccccccccccc-------ccccchHHHHHHHHHhhccccC-------CCCEEEEcHHHHHHHHHhhcCCCce Confidence 3456799998765432211 1222346778778776643221 24578999999988854 2444443 Q ss_pred HHHHHHHhcC----ccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccc-e----ec Q lcl|NC_017674. 283 VSDWIEQTYP----KMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV-E----KR 353 (382) Q Consensus 283 vl~~l~~n~p----nl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~-~----~~ 353 (382) ++.-....-+ ++.++..+.+. .+. +++..-.+.. ....+........ + .. T Consensus 296 l~~~~~~~~~~~l~G~pV~~~~~~p-------~~~-~~~gd~~~~~-------------~~~~~~~~~v~~~~~~~~~~~ 354 (385) T protein:vir:18 296 IFGGPQAFTSNIMWGLPVVPTKAQA-------AGT-FTVGGFDMAS-------------QVWDRMDATVEVSREDRDNFV 354 (385) T ss_pred eccCcccCCCceecceeeEEcCcCC-------CCc-EEEeecccEE-------------EEEEecceEEEEeccccchhh Confidence 3321111111 12222222111 011 1111101000 0000000000000 0 00 Q ss_pred CCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 354 AKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 354 ~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .-.+..-+..|++ +.+++|.||+.++.- T Consensus 355 ~~~~~~~~~~r~~-~~v~~~~a~~~~~~~ 382 (385) T protein:vir:18 355 KNMLTILCEERLA-LAHYRPTAIIKGTFS 382 (385) T ss_pred cCcEEEEEEEeec-cEEecccceEEEEec Confidence 1122344555655 455889999999988 No 51 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=96.74 E-value=3.3e-05 Score=45.13 Aligned_cols=318 Identities=11% Similarity=-0.014 Sum_probs=152.9 Q ss_pred CCCcc----------------------eeeeecC----------ccccccccc----cccchHHHHHHhhcceeccccch Q lcl|NC_017674. 1 MSQIS----------------------KTHSRLA----------GRNAKPFDL----KNITNDAVASLSRIGLVFDHAVV 44 (382) Q Consensus 1 ~~~~~----------------------~~~~~~~----------~~~~~~~~~----~~~~~~~~~~l~~~g~~~~~~~~ 44 (382) |.++. +..-.+. -+..+.-.. ........+++.+..-. T Consensus 18 ~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 91 (385) T protein:vir:19 18 MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDG------ 91 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHH------ Confidence 00000 0000000 000000000 00000011111111000 Q ss_pred hhhhhhhcccccchhhhhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec Q lcl|NC_017674. 45 QDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP 124 (382) Q Consensus 45 ~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~ 124 (382) ............+|.+. .+.++.-+|- .+.+.|++.+......+.++++...+. ..+.|.+.+. T Consensus 92 -----~~~~~~~~~~~~~~~~~----~~~~g~~i~~----~~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~ 155 (385) T protein:vir:19 92 -----KQGTFGAKTFNKSLGSD----ADSAGSLIQP----MQIPGIIMPGLRRLTIRDLLAQGRTSS---NALEYVREEV 155 (385) T ss_pred -----hhccchhhHHHhhhccc----cccCCceecc----hhhhHHHHHhhhccchhhhcceecccC---cceEEEEEec Confidence 00000000011122211 1111111332 234567777777777788777755443 3466777664 Q ss_pred -ccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCC Q lcl|NC_017674. 125 -AGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGL 203 (382) Q Consensus 125 -~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~ 203 (382) .+.+.+.+.+..+|..+....+.....+.++..+.++.+ +..-. .++.+.-....++++...+|+-.++|+.. T Consensus 156 ~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e-ll~d~---~~l~~~i~~~la~a~~~~~d~~~l~G~g~-- 229 (385) T protein:vir:19 156 FTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ-VMDDA---PMLQSYINNRLMYGLALKEEGQLLNGDGT-- 229 (385) T ss_pred CCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH-HHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCC-- Confidence 456677888889999999999999999999999999974 43322 25777777888888889999999999754 Q ss_pred cccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCcc Q lcl|NC_017674. 204 GNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGIS 282 (382) Q Consensus 204 ~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~T 282 (382) .....|+++.++....... .+.+..++||.+++.++...-. .+..++|+|..+..|.. .+..|.- T Consensus 230 ~~~~~Gi~~~~~~~~~~~~-------~~~~~~~d~i~~~~~~l~~~~~-------~~~~~~~~~~~~~~l~~lkd~~G~~ 295 (385) T protein:vir:19 230 GDNLEGLNKVATAYDTSLN-------ATGDTRADIIAHAIYQVTESEF-------SASGIVLNPRDWHNIALLKDNEGRY 295 (385) T ss_pred CCccccccccccccccccc-------ccccchHHHHHHHHHhhccccC-------CCCEEEEcHHHHHHHHHhhcCCCce Confidence 3456799998765432211 1222346778778776643221 24578999999988854 2444443 Q ss_pred HHHHHHHhcC----ccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccc-e----ec Q lcl|NC_017674. 283 VSDWIEQTYP----KMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV-E----KR 353 (382) Q Consensus 283 vl~~l~~n~p----nl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~-~----~~ 353 (382) ++.-....-+ ++.++..+.+. .+. +++..-.+.. ....+........ + .. T Consensus 296 l~~~~~~~~~~~l~G~pV~~~~~~p-------~~~-~~~gd~~~~~-------------~~~~~~~~~v~~~~~~~~~~~ 354 (385) T protein:vir:19 296 IFGGPQAFTSNIMWGLPVVPTKAQA-------AGT-FTVGGFDMAS-------------QVWDRMDATVEVSREDRDNFV 354 (385) T ss_pred eccCcccCCCceecceeeEEcCcCC-------CCc-EEEeecccEE-------------EEEEecceEEEEeccccchhh Confidence 3321111111 12222222111 011 1111101000 0000000000000 0 00 Q ss_pred CCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 354 AKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 354 ~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .-.+..-+..|++ +.+++|.||+.++.- T Consensus 355 ~~~~~~~~~~r~~-~~v~~~~a~~~~~~~ 382 (385) T protein:vir:19 355 KNMLTILCEERLA-LAHYRPTAIIKGTFS 382 (385) T ss_pred cCcEEEEEEEeec-cEEecccceEEEEec Confidence 1122344555655 455889999999988 No 52 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=96.71 E-value=7.4e-05 Score=43.23 Aligned_cols=341 Identities=11% Similarity=0.038 Sum_probs=150.0 Q ss_pred CCCcc---eeeeecCcccc--------ccccccc--cchHHHHHHhhcceec-----c--cc-chhhhhhhhcccccchh Q lcl|NC_017674. 1 MSQIS---KTHSRLAGRNA--------KPFDLKN--ITNDAVASLSRIGLVF-----D--HA-VVQDQIKALAKAGAFRS 59 (382) Q Consensus 1 ~~~~~---~~~~~~~~~~~--------~~~~~~~--~~~~~~~~l~~~g~~~-----~--~~-~~~~~~~~~~~~~~~~~ 59 (382) +.++. .....+..+.. ++..... .....+....+.+... . .+ ...+......+. .. T Consensus 67 ~a~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~ 143 (497) T protein:vir:10 67 DAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADG---ET 143 (497) T ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhh---hh Confidence 00000 00000000000 0000000 0000000000000000 0 00 000000000000 00 Q ss_pred hhhhcccccCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec-ccceeecccccC Q lcl|NC_017674. 60 GSAMDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP-AGTAVEYGDHTN 136 (382) Q Consensus 60 ~~amDa~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~-~G~a~~ygd~~D 136 (382) ..+...... ..+++..| +|.. +.++|++.+......+.++++.+.+. ..+.|..... .+.+.+.+.+.. T Consensus 144 ~~~~~~~~~-~~~~~~gg~~vp~~----~~~~ii~~~~~~~~i~~l~~~~~~~~---~~~~~~~~~~~~~~a~wv~E~~~ 215 (497) T protein:vir:10 144 APAAIGQNP-FGSTGTFAPGILPT----FLPGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGT 215 (497) T ss_pred hHHHHHhhh-cccCcccccccchh----hhHHHHHHHHhhhhHHhhccccccCC---CceEEEEEcCCCCcceeeccCcc Confidence 000000101 11122222 4433 44577888777777788877655544 3466665433 457788899999 Q ss_pred CceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCC Q lcl|NC_017674. 137 IPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNL 216 (382) Q Consensus 137 iP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l 216 (382) +|..+...+......+.++..+.+|.+=|+-+ . .|.+--.....+++...+|+-.++|+.. ....|+||++.. T Consensus 216 ~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~--~--~l~~~i~~~l~~~i~~~~d~~~l~G~G~---~~p~Gil~~~~~ 288 (497) T protein:vir:10 216 YPFSSEEFARVYEQVGKVANALTITDEGLRDA--P--ELFNFVQGRLLEGIQRKEEVQLLAGGGY---PGVNGLLQRSTG 288 (497) T ss_pred cccccccceeeEeeeeeeEeecHhHHHHHHhH--H--HHHHHHHHHHHHHHHHHHHHHhhcCCCc---cccccccccccc Confidence 99999999999999999999988887533322 2 4788888888999999999999999743 346799999875 Q ss_pred cceeccCCCC-------------------c----------------------------cccCHHHHHHHHHHHHHHHHHh Q lcl|NC_017674. 217 PAFQTPPSQG-------------------W----------------------------STADWAGIIGDIREAVRQLRIQ 249 (382) Q Consensus 217 ~~~~~~a~~~-------------------W----------------------------a~kT~~eI~~Di~~~~~~l~~~ 249 (382) .+.....++. | ...|...++.++..++..+... T Consensus 289 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 368 (497) T protein:vir:10 289 FTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT 368 (497) T ss_pred ccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhh Confidence 4322111000 0 0012334445555555544443 Q ss_pred cCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHHHH---------Hh--cCccEEEEccccccccCCCCCc-eeEE Q lcl|NC_017674. 250 SQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDWIE---------QT--YPKMRIVSAPELSGVQMKAQEP-EDAL 316 (382) Q Consensus 250 t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~---------~n--~pnl~i~~~peL~~a~g~g~~~-~~~~ 316 (382) .. . .|..++|.+..+..|.+. +..|.-++.--. .. .-+..++..+.+. ++..--+. .... T Consensus 369 ~~--~----~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~-~~~~~~Gd~~~~~ 441 (497) T protein:vir:10 369 LF--Q----TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-LGTILVGHFAPSV 441 (497) T ss_pred cc--c----CCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCC-CCceEEeecccce Confidence 21 1 234677888777776532 333433321100 00 0011222222111 00000000 0001 Q ss_pred EEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 317 VLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 317 ~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) |...+.....++.+ +. .......-...+-++.|.+| .|++|.||+++.-. T Consensus 442 ~~i~~r~~~~v~~~-----------~~----~~~~f~~n~v~~r~~~r~~~-~v~~p~A~~~l~~~ 491 (497) T protein:vir:10 442 IQTARREGVTMQMT-----------NS----NGTDFVDGKVTVRAEERLGL-LVYRPSAFQLIQLK 491 (497) T ss_pred EEEEEecccEEEee-----------cc----cchhhhcCcEEEEEEEeecc-eeeccccEEEEEec Confidence 11111000000000 00 00001112334555666655 77899999999888 No 53 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=96.71 E-value=7.4e-05 Score=43.23 Aligned_cols=341 Identities=11% Similarity=0.038 Sum_probs=150.0 Q ss_pred CCCcc---eeeeecCcccc--------ccccccc--cchHHHHHHhhcceec-----c--cc-chhhhhhhhcccccchh Q lcl|NC_017674. 1 MSQIS---KTHSRLAGRNA--------KPFDLKN--ITNDAVASLSRIGLVF-----D--HA-VVQDQIKALAKAGAFRS 59 (382) Q Consensus 1 ~~~~~---~~~~~~~~~~~--------~~~~~~~--~~~~~~~~l~~~g~~~-----~--~~-~~~~~~~~~~~~~~~~~ 59 (382) +.++. .....+..+.. ++..... .....+....+.+... . .+ ...+......+. .. T Consensus 67 ~a~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~ 143 (497) T protein:vir:78 67 DAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADG---ET 143 (497) T ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhh---hh Confidence 00000 00000000000 0000000 0000000000000000 0 00 000000000000 00 Q ss_pred hhhhcccccCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec-ccceeecccccC Q lcl|NC_017674. 60 GSAMDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP-AGTAVEYGDHTN 136 (382) Q Consensus 60 ~~amDa~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~-~G~a~~ygd~~D 136 (382) ..+...... ..+++..| +|.. +.++|++.+......+.++++.+.+. ..+.|..... .+.+.+.+.+.. T Consensus 144 ~~~~~~~~~-~~~~~~gg~~vp~~----~~~~ii~~~~~~~~i~~l~~~~~~~~---~~~~~~~~~~~~~~a~wv~E~~~ 215 (497) T protein:vir:78 144 APAAIGQNP-FGSTGTFAPGILPT----FLPGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGT 215 (497) T ss_pred hHHHHHhhh-cccCcccccccchh----hhHHHHHHHHhhhhHHhhccccccCC---CceEEEEEcCCCCcceeeccCcc Confidence 000000101 11122222 4433 44577888777777788877655544 3466665433 457788899999 Q ss_pred CceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCC Q lcl|NC_017674. 137 IPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNL 216 (382) Q Consensus 137 iP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l 216 (382) +|..+...+......+.++..+.+|.+=|+-+ . .|.+--.....+++...+|+-.++|+.. ....|+||++.. T Consensus 216 ~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~--~--~l~~~i~~~l~~~i~~~~d~~~l~G~G~---~~p~Gil~~~~~ 288 (497) T protein:vir:78 216 YPFSSEEFARVYEQVGKVANALTITDEGLRDA--P--ELFNFVQGRLLEGIQRKEEVQLLAGGGY---PGVNGLLQRSTG 288 (497) T ss_pred cccccccceeeEeeeeeeEeecHhHHHHHHhH--H--HHHHHHHHHHHHHHHHHHHHHhhcCCCc---cccccccccccc Confidence 99999999999999999999988887533322 2 4788888888999999999999999743 346799999875 Q ss_pred cceeccCCCC-------------------c----------------------------cccCHHHHHHHHHHHHHHHHHh Q lcl|NC_017674. 217 PAFQTPPSQG-------------------W----------------------------STADWAGIIGDIREAVRQLRIQ 249 (382) Q Consensus 217 ~~~~~~a~~~-------------------W----------------------------a~kT~~eI~~Di~~~~~~l~~~ 249 (382) .+.....++. | ...|...++.++..++..+... T Consensus 289 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 368 (497) T protein:vir:78 289 FTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT 368 (497) T ss_pred ccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhh Confidence 4322111000 0 0012334445555555544443 Q ss_pred cCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHHHH---------Hh--cCccEEEEccccccccCCCCCc-eeEE Q lcl|NC_017674. 250 SQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDWIE---------QT--YPKMRIVSAPELSGVQMKAQEP-EDAL 316 (382) Q Consensus 250 t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~---------~n--~pnl~i~~~peL~~a~g~g~~~-~~~~ 316 (382) .. . .|..++|.+..+..|.+. +..|.-++.--. .. .-+..++..+.+. ++..--+. .... T Consensus 369 ~~--~----~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~-~~~~~~Gd~~~~~ 441 (497) T protein:vir:78 369 LF--Q----TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-LGTILVGHFAPSV 441 (497) T ss_pred cc--c----CCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCC-CCceEEeecccce Confidence 21 1 234677888777776532 333433321100 00 0011222222111 00000000 0001 Q ss_pred EEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 317 VLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 317 ~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) |...+.....++.+ +. .......-...+-++.|.+| .|++|.||+++.-. T Consensus 442 ~~i~~r~~~~v~~~-----------~~----~~~~f~~n~v~~r~~~r~~~-~v~~p~A~~~l~~~ 491 (497) T protein:vir:78 442 IQTARREGVTMQMT-----------NS----NGTDFVDGKVTVRAEERLGL-LVYRPSAFQLIQLK 491 (497) T ss_pred EEEEEecccEEEee-----------cc----cchhhhcCcEEEEEEEeecc-eeeccccEEEEEec Confidence 11111000000000 00 00001112334555666655 77899999999888 No 54 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=96.57 E-value=0.00016 Score=41.36 Aligned_cols=325 Identities=13% Similarity=0.048 Sum_probs=151.0 Q ss_pred CCCcce---------------eeeecC----------ccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccc Q lcl|NC_017674. 1 MSQISK---------------THSRLA----------GRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAG 55 (382) Q Consensus 1 ~~~~~~---------------~~~~~~----------~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~ 55 (382) +.+... ....+. .+...+-..+.............++.- . . T Consensus 36 ~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~------------~ 101 (395) T protein:vir:43 36 FGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEEAPKTAGQMVAESLKEQGVTS--S------------L 101 (395) T ss_pred HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccchhhhHHHHHHHHHHHHHHHH--H------------h Confidence 000000 000000 000000000000000001111111100 0 0 Q ss_pred cchhhhhhcccccCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec-ccceeecc Q lcl|NC_017674. 56 AFRSGSAMDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP-AGTAVEYG 132 (382) Q Consensus 56 ~~~~~~amDa~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~-~G~a~~yg 132 (382) .......+...... .++.+.| +|-.+ .++|++.+........++++.+.+. ..+.|..... .+.+.+.| T Consensus 102 ~~~~~~~~~~~~~~-~~~~~~g~~vp~~~----~~~ii~~~~~~~~l~~l~~~~~~~~---~~~~~~~~~~~~~~a~~v~ 173 (395) T protein:vir:43 102 RGSHRVSMPRSAIT-SIDGSGGALVAPDR----RPGVVAAPQRRLTIRDLVAPGTTES---NSVEYVRETGFVNNAAPVS 173 (395) T ss_pred hhhhhhhhhhhhhc-ccCCCCccccchhh----HHHHHHHHHhhhhHHhhccceecCC---CceEEEEEecCCCceeeec Confidence 00000111111011 1122222 33333 3467777777777777777665543 3466666543 56777889 Q ss_pred cccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEe Q lcl|NC_017674. 133 DHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLN 212 (382) Q Consensus 133 d~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN 212 (382) .+...|..+.........++.++..+.++.+=++.+ . ++.+--....++++...+|.-.++|+.. .....|+++ T Consensus 174 E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~-~l~~~v~~~la~a~~~~~d~~~l~G~g~--~~~~~Gi~~ 247 (395) T protein:vir:43 174 EGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDA---S-ALQSYIDARARYGLMLVEECQLLYGNGT--GANLHGIIP 247 (395) T ss_pred CCccccccccceeEEEEeeeeEEEeehhhHHHHHhH---H-HHHHHHHHHHHHHHHHHHHHHHHhccCC--CCccccccc Confidence 888999999999999999999999999997544322 2 5777777888888888888889999743 344579999 Q ss_pred CCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHh- Q lcl|NC_017674. 213 DPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQT- 290 (382) Q Consensus 213 ~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n- 290 (382) .+++.+.... ...+.+..++||.+++..+...-. .+..++|.|..+..|.. .+..|.-++.-.... T Consensus 248 ~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~~~~-------~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~ 315 (395) T protein:vir:43 248 QAQAYAPPSG-----VVVTAEQRIDRIRLAILQAQLAEF-------PASGIVLNPIDWALIELNKDAENRYIIGSPQNGT 315 (395) T ss_pred cccccccccc-----cccccchhHHHHHHHHHhhccccC-------CCcEEEEcHHHHHHHHHhhccCCceeccccccCC Confidence 8775432211 123455678888888877643321 23478999999888853 344444333211111 Q ss_pred ---cCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceee Q lcl|NC_017674. 291 ---YPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAG 367 (382) Q Consensus 291 ---~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~G 367 (382) .-++.++..+.+. .+ .+++-.-.+.....+. ....+ +.-+.. ....+. -.+..-+..| .| T Consensus 316 ~~~l~G~pVv~~~~~~-------~~-~~~~gd~~~~~~~~~~---~~~~i-~~~~~~--~~~f~~--~~~~~r~~~r-~d 378 (395) T protein:vir:43 316 TPTLWRLPVVETQAIT-------QD-EFLTGAFSLGAQIFDR---MDIEV-LVSTEN--DKDFEN--NMVTIRAEER-LA 378 (395) T ss_pred CceecceeeEEcCCCC-------CC-cEEEEeccceEEEEEe---cceEE-EEeccc--cchhhc--CcEEEEEEEe-ec Confidence 0112233222211 01 1111111110000000 00000 000000 000000 0111122222 36 Q ss_pred eEeeccchheeecCC Q lcl|NC_017674. 368 ALCKRPWAVVRYLGI 382 (382) Q Consensus 368 v~i~~P~aia~~~GI 382 (382) +.+++|.||++++-= T Consensus 379 ~~v~~~~a~~~~~~t 393 (395) T protein:vir:43 379 FAVYRPEAFVTGSLT 393 (395) T ss_pred cEEecccceEEEEec Confidence 667889999887544 No 55 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=96.35 E-value=0.00033 Score=39.68 Aligned_cols=341 Identities=10% Similarity=0.041 Sum_probs=141.2 Q ss_pred CCCcceeeeecCccc----ccccc---ccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccc Q lcl|NC_017674. 1 MSQISKTHSRLAGRN----AKPFD---LKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTT 73 (382) Q Consensus 1 ~~~~~~~~~~~~~~~----~~~~~---~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~ 73 (382) .......+....... .+.+. ...-.....+.+.+........ .. .......+... .+..++ T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~---------~~~~~~~~~~~-~~~~~~ 160 (477) T protein:vir:84 93 ATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESD--KE---------IRKIAKVGEEY-RDLDRN 160 (477) T ss_pred cccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhh--hh---------HHHHHHhhhhh-cccccc Confidence 000000000000000 00000 0000000011111100000000 00 00000111111 111111 Q ss_pred cc-hh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccc-eeeccccc-----CCceeeeee Q lcl|NC_017674. 74 PS-IP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGT-AVEYGDHT-----NIPLTSWNA 144 (382) Q Consensus 74 ~~-~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~-a~~ygd~~-----DiP~vd~~~ 144 (382) .. .| +|-.+ +-.+|++.+........++++..... ....+.++..+..+. +.+.++++ +.|..+... T Consensus 161 ~~~gg~lv~~~~---~~~~ii~~l~~~~~i~~~~~~~~~~~-~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f 236 (477) T protein:vir:84 161 GGTGGYAVPPLW---MMNRFIELARAGRTYANLCPTEPLPG-GTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTD 236 (477) T ss_pred CCCcceeeccch---hHHHHHHHhhhcchHHHhhceeeecC-CcceeEEEEEecCcceeeeeccCcccccccccccccce Confidence 11 12 32222 22356666655555556655433211 223456666554333 23445442 457788888 Q ss_pred eeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceecc-C Q lcl|NC_017674. 145 NFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTP-P 223 (382) Q Consensus 145 ~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~-a 223 (382) +...-+.+.++..+.+|.+=| .....++.+--......++...+|.-.++|+.. .+...|++|.+++...+.+ + T Consensus 237 ~~i~~~~~k~~~~~~iS~ell---~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt--~~~p~Gi~~~~~~~~~~~~~~ 311 (477) T protein:vir:84 237 GFVQANVKTIAGQQGIAIQLL---DQAAVSVDEFVFRDLAADYANKLNVQVISGTGS--NNQVVGVRATAGITQVTATSA 311 (477) T ss_pred eeEEEeeeeEEeeeHHHHHHH---hccchhHHHHHHHHHHHHHHHHHHHHHhccCCC--CCccceeeecccccccccccc Confidence 888888889988888887444 334567888889999999999999999999743 3346899999987644322 2 Q ss_pred CCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHH----------HHHHh-- Q lcl|NC_017674. 224 SQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSD----------WIEQT-- 290 (382) Q Consensus 224 ~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~----------~l~~n-- 290 (382) ++.|+ ..+..+++|.+++..+-.... . .+...+|.|..+..|.. .+..|.-+++ ++..+ T Consensus 312 ~~t~~--~~~~~~~~i~~~~~~~~~~~~--~----~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~ 383 (477) T protein:vir:84 312 GSALE--KHQIIYQKIADAIQRVHTSRF--L----EPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVAS 383 (477) T ss_pred ccchh--hHHHHHHHHHHHHhhcccccc--C----CccEEEEcHHHHHHHHHhhccCCCeeeecCccccccccccccccc Confidence 22332 234455555555544332211 1 12346778877777743 2222322221 11111 Q ss_pred ------cCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccc Q lcl|NC_017674. 291 ------YPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNG 364 (382) Q Consensus 291 ------~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~ 364 (382) .-+..++..+.+-.-.|.+.+...+++-.-.++ +.+. .+... +..|..+ .- .-...|++ .+. T Consensus 384 ~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~---~i~~-~~~~~--~~~~~~~---~~-~~~~~~~v--~~~ 451 (477) T protein:vir:84 384 QRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDL---ALFE-SSVRM--RALQETR---AE-NLSVLLQV--YGY 451 (477) T ss_pred ccccchhcccceEecCcccccccccCCcceEEEEEeceE---EEEe-eceeE--Eeccccc---cc-cceeeeee--hhh Confidence 001223333322111111222222222111111 0000 00000 0011110 00 00011111 111 Q ss_pred eeeeEeeccchheeecCC Q lcl|NC_017674. 365 TAGALCKRPWAVVRYLGI 382 (382) Q Consensus 365 t~Gv~i~~P~aia~~~GI 382 (382) ...+.+|+|.||+.++|. T Consensus 452 ~~~~~~r~~~afv~~t~~ 469 (477) T protein:vir:84 452 LAFTAARFPQSVVEIGGT 469 (477) T ss_pred hhhhhhccccceEEeecc Confidence 223567889999999999 No 56 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=96.30 E-value=0.0004 Score=39.22 Aligned_cols=317 Identities=13% Similarity=0.019 Sum_probs=145.3 Q ss_pred CCCcceeeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchh--H Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIP--T 78 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~--~ 78 (382) +.+..+.... .....++..-.......++.+.+.+..-... ... .......+.... ++.+.| + T Consensus 61 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~-----~~~~~~~~~~~~--~~~~~g~li 125 (390) T protein:vir:97 61 VAELEGNGAG-GDVQHVSVGDMFVASEQFQASTGRWNDRSAR-------ATM-----NIKAALNTASTD--AAGSAGALT 125 (390) T ss_pred HHHHHhcccc-cccccccchhhhhhhHHHHHHHHHhhhhhhh-------hhh-----HHHHHHHhhhcc--ccccccccc Confidence 0000000000 0000000000001111222221111110000 000 001111111111 122223 3 Q ss_pred HHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec-ccceeecccccCCceeeeeeeeeEeeEEEEEEE Q lcl|NC_017674. 79 PIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP-AGTAVEYGDHTNIPLTSWNANFERRTIVRGELG 157 (382) Q Consensus 79 ~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g 157 (382) |..+. ++|++.+......+.++++...+. ....|..... .+.+.+.+.+..+|..+..........+.++.. T Consensus 126 p~~~~----~~ii~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~ 198 (390) T protein:vir:97 126 TPNRL----PGFITPPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHT 198 (390) T ss_pred chhhh----HHHHHHHhhhhhhHhhcceeeccC---CceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEe Confidence 33333 366666666666666666554433 3456666654 467778888889999999999999999999999 Q ss_pred EEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHH Q lcl|NC_017674. 158 MMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIG 237 (382) Q Consensus 158 ~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~ 237 (382) +.++.+ +-.-. .++.+.-.....+++.+.+|+-+++|+.. .....|++|.++...... ..+.+..++ T Consensus 199 ~~is~e-ll~ds---~~l~~~i~~~la~a~~~~~d~a~l~G~g~--~~~p~Gi~~~~~~~~~~~-------~~~~~~~~d 265 (390) T protein:vir:97 199 MKATRQ-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGA--NDGLLGLIPQATTYAAPT-------TIAGATRVD 265 (390) T ss_pred ehhhHH-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCCC--Cccccceeeccccccccc-------cccccchHH Confidence 999885 43322 25788888888899999999999999743 344689999876443221 123334466 Q ss_pred HHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHh----cCccEEEEccccccccCCCCCc Q lcl|NC_017674. 238 DIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQT----YPKMRIVSAPELSGVQMKAQEP 312 (382) Q Consensus 238 Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n----~pnl~i~~~peL~~a~g~g~~~ 312 (382) ||.+++..+...-. .+..++|.|+.+..|.+ .+..|.-++.-.... +-++.++..+.+. .+ T Consensus 266 ~~~~~~~~~~~~~~-------~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~-------~~ 331 (390) T protein:vir:97 266 QLRLAMLQASLAEY-------PASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMA-------PG 331 (390) T ss_pred HHHHHHHhhccccC-------CCCEEEEcHHHHHHHHHhhcCCCceeecCccCCCCceecceeeEEcCCCC-------CC Confidence 77777765543221 23478999999988864 233343332110000 0112222222111 01 Q ss_pred eeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCce-----EeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 313 EDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSY-----VEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 313 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~-----~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .+++-.-... +....+..... ........| ..-+..| .|..+++|.||++.+== T Consensus 332 -~~~~gd~~~~-------------~~~~~~~~~~i-~~~~~~~~f~~~~~~~r~~~r-~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 332 -EFLVGAFDLA-------------AQIFDQWDARV-EIGYVNDDFQRNMVTVLAEER-LALVVYRPEALITGSFA 390 (390) T ss_pred -cEEEEeccce-------------EEEEEecceEE-EEeecccccccCcEEEEEEEe-eccEEeccccEEEEEeC Confidence 1111110000 00000000000 000000011 1122222 35567778887665422 No 57 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=96.29 E-value=0.00078 Score=37.63 Aligned_cols=325 Identities=11% Similarity=0.046 Sum_probs=149.8 Q ss_pred CCC-----c-ceeeeec--Cccc---ccccccccc--chHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccc Q lcl|NC_017674. 1 MSQ-----I-SKTHSRL--AGRN---AKPFDLKNI--TNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNF 67 (382) Q Consensus 1 ~~~-----~-~~~~~~~--~~~~---~~~~~~~~~--~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~ 67 (382) +.+ . ......+ ++.. .+....++. ........+..+....... .. ......+... T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~-------~~~~~~~~~~ 118 (413) T protein:vir:81 51 LQEATAGSVDSEKSGELTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYV-----AP-------RVKAASDPAS 118 (413) T ss_pred HHHHHHhHHhHHHhhhHhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhh-----hh-------HHHhhhhhhh Confidence 000 0 0000000 0000 111111111 1112222222222211100 00 0001111111 Q ss_pred c-CcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec----ccceeecccccCCceeee Q lcl|NC_017674. 68 T-APVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP----AGTAVEYGDHTNIPLTSW 142 (382) Q Consensus 68 ~-~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~----~G~a~~ygd~~DiP~vd~ 142 (382) . +..+..+..+|.. +.++|++.+......++++++..... .+..|.+... .+.+.+.+.+..+|..+. T Consensus 119 ~~~~~~~~~~~vp~~----~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 191 (413) T protein:vir:81 119 TATLTDEFQGGYGTT----WNRNIIYRRREKLVVADLMDNLTMTN---TTIKYLMEKANRVVEGGFKTVAEGGKKPYMRF 191 (413) T ss_pred hcccccccccccchh----hHHHHHHHHhhhhhHHhhcceeeccC---CceeEEEeccccccccccceecCcccccccCc Confidence 1 1112223334433 44578888888888888877654433 3344444322 245567788888888774 Q ss_pred -eeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceec Q lcl|NC_017674. 143 -NANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQT 221 (382) Q Consensus 143 -~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~ 221 (382) ........++.++..+.+|.+=|..+. .|.+--....+.++...+|+-+++|+..+ ....|++|.+++.+... T Consensus 192 ~~f~~i~~~~~k~~~~~~iS~ell~ds~----~l~~~i~~~la~~~~~~~d~~~l~G~G~~--~~~~Gi~~~~~~~~~~~ 265 (413) T protein:vir:81 192 ADFDIVTESLSKIAGLTKITDEMIEDYD----FLVSYINARLLEELAIEEERQLLLGDGTG--NNLTGLLKRDGIQTLAV 265 (413) T ss_pred ccceeeEeeeeeEEEeehhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhccCCCC--Ccccccccccccccccc Confidence 678888889999999999986444332 37777777778888888888899997442 34579999888653322 Q ss_pred cCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHh---cCc---- Q lcl|NC_017674. 222 PPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQT---YPK---- 293 (382) Q Consensus 222 ~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n---~pn---- 293 (382) . +.+..+++|..++..+....+. . +..++|.++.+..|.. .+..|.-++.-.... ++. T Consensus 266 ~--------~~~~~~~~i~~~~~~~~~~~~~--~----~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~ 331 (413) T protein:vir:81 266 S--------NKDELADSIYKAMTNISLATPF--Q----ADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLD 331 (413) T ss_pred c--------ccchhHHHHHHHHHHhhhhccC--C----CcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccC Confidence 1 2334577777777665443332 2 3468899998888853 243444333211110 000 Q ss_pred cEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhccccee------cCCceEeccccceee Q lcl|NC_017674. 294 MRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEK------RAKSYVEDFSNGTAG 367 (382) Q Consensus 294 l~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~------~~~~~~~~~~~~t~G 367 (382) -++...|=.-...- ..+ .+++-+-.... ....+..+. +.+.. ..-....-+..|. + T Consensus 332 ~~l~G~pv~~s~~~--~~~-~~~~gd~~~~~-------------~~~~~~~~~-v~~~~~~~~~~~~~~~~~r~~~r~-d 393 (413) T protein:vir:81 332 PAPWGLRTVQSQVV--PVG-KPVVGAFRSAA-------------SVLRKGGVR-IDSTNTNVDDFENNLITVRAEERV-G 393 (413) T ss_pred ceecceeeEEcCCC--Ccc-cEEEEecccEE-------------EEEEecceE-EEEeccccchhhcCcEEEEEEEee-c Confidence 01111111110000 001 11111111100 000000000 00000 0111233444444 4 Q ss_pred eEeeccchheeecCC Q lcl|NC_017674. 368 ALCKRPWAVVRYLGI 382 (382) Q Consensus 368 v~i~~P~aia~~~GI 382 (382) +.+++|.||+.++.= T Consensus 394 ~~~~~~~a~~~l~~~ 408 (413) T protein:vir:81 394 LMVTFPEAIVQLDVA 408 (413) T ss_pred cEEecccceEEEEec Confidence 556889999988766 No 58 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=96.25 E-value=0.00014 Score=41.70 Aligned_cols=335 Identities=10% Similarity=0.003 Sum_probs=139.5 Q ss_pred CCC----ccee-eeecCcccc-ccccccccchHHHHHHhhcceeccccch-hhhhhhhcccccchhhhhhcccccCcccc Q lcl|NC_017674. 1 MSQ----ISKT-HSRLAGRNA-KPFDLKNITNDAVASLSRIGLVFDHAVV-QDQIKALAKAGAFRSGSAMDSNFTAPVTT 73 (382) Q Consensus 1 ~~~----~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~l~~~g~~~~~~~~-~~~~~~~~~~~~~~~~~amDa~~~~~~t~ 73 (382) +.+ .... .-....+.. +.... -......++++..+ ..... +........ .....++... ...+. T Consensus 99 ~~~~~~~~~~~e~~~~~~~~~~~~~~~--~~~~~~~~~e~~~~--~~~~~~~~~~~~~~~---~~~~~a~~~~--~~~~~ 169 (458) T protein:vir:10 99 QDEIKSLLTAREGRSFVGDSVAKALYG--TQENFEDEVEKLVL--LSYVMEKGVFETEHG---QRHLKAVNQS--SSVEV 169 (458) T ss_pred HHHHHHHHHHHHhhhhhhhhhhccchh--hhhhHHHHHHHHHH--HHHHHhhccchhhhh---hhhhhhhhhc--ccCcc Confidence 000 0000 000000000 00000 00000111111000 00000 000000000 0001111111 01111 Q ss_pred cchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCc------eeeeeeeee Q lcl|NC_017674. 74 PSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIP------LTSWNANFE 147 (382) Q Consensus 74 ~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP------~vd~~~~~~ 147 (382) .+.-+|..+ .+.|++.+......+.+..+..... ....|.+....+.|.+.+.+...| ..+...... T Consensus 170 g~~~ip~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i 242 (458) T protein:vir:10 170 SSESYETIF----SQRIIRDLQKELVVGALFEELPMSS---KILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEI 242 (458) T ss_pred ccceehhhH----hHHHHHHHHhhhhHHhhcceeecCC---cceEEEEecCCcceeecccccccccccccccccccceee Confidence 222344433 3456666555555555555433322 344555555556666666554444 334455666 Q ss_pred EeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCc Q lcl|NC_017674. 148 RRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGW 227 (382) Q Consensus 148 ~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~W 227 (382) ....+.++..+.+|.+=+.- ...++.+.-......++...+|.-+++|+.. ....|++|++......++....+ T Consensus 243 ~~~~~k~~~~v~is~ell~d---s~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~---~~p~Gi~~~~~~~~~~~~~~~~~ 316 (458) T protein:vir:10 243 HFSTYKLAAKSFITDETEED---AIFSLLPLLRKRLIEAHAVSIEEAFMTGDGS---GKPKGLLTLASEDSAKVVTEAKA 316 (458) T ss_pred EeeeeeEEeeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhcCCCC---Cccceeeecccccccceeecccc Confidence 67778888888888754432 3356888888888889999999999999743 45679999998664433322222 Q ss_pred cccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHH-HHHh--------cCccEEE Q lcl|NC_017674. 228 STADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDW-IEQT--------YPKMRIV 297 (382) Q Consensus 228 a~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~-l~~n--------~pnl~i~ 297 (382) ... ..--++||.+++..+...- . .+..++|.+..+..|... +..|.-++.. +... +-+..++ T Consensus 317 ~~~-~~~~~~~i~~~~~~l~~~~----~---~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~ 388 (458) T protein:vir:10 317 DGS-VLVTAKTISKLRRKLGRHG----L---KLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVV 388 (458) T ss_pred ccc-ccccHHHHHHHHHhhhhhh----c---CCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeE Confidence 111 1112456666776653321 1 134689999998888542 3333323221 1111 0112232 Q ss_pred EccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchhe Q lcl|NC_017674. 298 SAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVV 377 (382) Q Consensus 298 ~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia 377 (382) ....+- ..++.+..++..|.+.+. ..+.+ + . + ....+|.. .-...+-...| -|..+++|.+|+ T Consensus 389 ~~~~~p---~~~~~~~~~~~~f~~~~~-~~~~~--~-~--~-v~~d~~~~------~~~~~~~~~~r-~~~~v~~~~a~v 451 (458) T protein:vir:10 389 VSEYFP---AKANSAEFAVIVYKDNFV-MPRQR--A-V--T-VERERQAG------KQRDAYYVTQR-VNLQRYFANGVV 451 (458) T ss_pred Eccccc---cccCCcceEEEEecccEE-EEEee--c-e--E-EEeecccC------CCceEEEEEEE-ecceEecccceE Confidence 221111 111222223323322111 00000 0 0 0 00111110 11122333444 467888999887 Q ss_pred eecCC Q lcl|NC_017674. 378 RYLGI 382 (382) Q Consensus 378 ~~~GI 382 (382) ..+== T Consensus 452 ~~~~a 456 (458) T protein:vir:10 452 SGTYA 456 (458) T ss_pred EEeec Confidence 62111 No 59 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=95.57 E-value=0.0016 Score=35.93 Aligned_cols=323 Identities=14% Similarity=0.049 Sum_probs=147.1 Q ss_pred CCCcceeeeecCcccc--ccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchh- Q lcl|NC_017674. 1 MSQISKTHSRLAGRNA--KPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIP- 77 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~- 77 (382) ..++........+... +...........++.+.+.+-...+.. .. + ...+..+. .. .++.+.| T Consensus 58 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~--~-----~~~~~~~~-~~-~~~~~~g~ 123 (390) T protein:vir:81 58 RQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDRSARA-----TM--N-----IKAALNTA-ST-DAAGSAGA 123 (390) T ss_pred HHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHHhhhhhhh-----hh--H-----HHHHHHhh-cc-ccccCCcc Confidence 0001111111111111 111111111122222222221111000 00 0 00011111 11 1122222 Q ss_pred -HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec-ccceeecccccCCceeeeeeeeeEeeEEEEE Q lcl|NC_017674. 78 -TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP-AGTAVEYGDHTNIPLTSWNANFERRTIVRGE 155 (382) Q Consensus 78 -~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~ 155 (382) +|-.+. +++++.+......+.++.+..... ....|..... .+.+.+.+.+..+|..+.........++.++ T Consensus 124 ~~~~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~ 196 (390) T protein:vir:81 124 LTTPNRL----PGFITPPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIA 196 (390) T ss_pred eechhhh----HHHHHHHhhhhhhhhhcceeeccC---CceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEE Confidence 222222 357777766666777766554433 3455565543 4677788888899999999999999999999 Q ss_pred EEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHH Q lcl|NC_017674. 156 LGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGI 235 (382) Q Consensus 156 ~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI 235 (382) ..+.+|.+=++.+ .++.+.-.....+++...+|+-+++|+.. .....|++|.+........ .+.... T Consensus 197 ~~~~is~ell~d~----~~~~~~i~~~l~~~~~~~~d~a~l~G~g~--~~~~~Gi~~~~~~~~~~~~-------~~~~~~ 263 (390) T protein:vir:81 197 HTMKATRQILSDA----PQLASYMNNRLIRGLKVKEDAEILRGTGA--NDGLLGLIPQATTYAAPTT-------IAGATR 263 (390) T ss_pred EeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CCcccceeecccccccccc-------cccchh Confidence 9999988533322 25888888888888888999999999754 3457899998775433221 112223 Q ss_pred HHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHhc-C---ccEEEEccccccccCCCC Q lcl|NC_017674. 236 IGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQTY-P---KMRIVSAPELSGVQMKAQ 310 (382) Q Consensus 236 ~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n~-p---nl~i~~~peL~~a~g~g~ 310 (382) ++||.+++.++...- + .+..++|.|+.+..|.+ .+..|.-++.-....- + ++.++..+.+. T Consensus 264 ~~~~~~~~~~~~~~~---~----~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pv~~~~~~p------- 329 (390) T protein:vir:81 264 VDQLRLAMLQASLAE---Y----NPSGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMA------- 329 (390) T ss_pred HHHHHHHHHhhcccc---C----CCCEEEEcHHHHHHHHHhhcCCCceeecCcccccCceecceeeEEcCCCC------- Confidence 567777776654331 1 23468999999888864 3444443332111110 1 11222222111 Q ss_pred CceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 311 EPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 311 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .+. +++-.-.+.....+. +-+.......+.-...-....-+..|.+ +.++.|.||+..+== T Consensus 330 ~~~-~~~gd~~~~~~~~~~---------~~~~v~~~~~~~~~~~~~v~~r~~~r~d-~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 330 PGE-FLVGAFDLAAQIFDQ---------WDARVEIGYVGEDFQRNMITVLAEERLA-LVVYRPEALISGSFA 390 (390) T ss_pred CCc-EEEEehhceEEEEEe---------cceEEEEecccchhhcCcEEEEEEEeec-cEEecccceEEEEeC Confidence 011 111110110000000 0000000000000001112233444444 466777777654311 No 60 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=95.07 E-value=0.0028 Score=34.55 Aligned_cols=345 Identities=12% Similarity=0.002 Sum_probs=135.3 Q ss_pred CCCcceee--------------------------eecCccccccccccccchHHHHH----Hhhcceeccccchhhhhhh Q lcl|NC_017674. 1 MSQISKTH--------------------------SRLAGRNAKPFDLKNITNDAVAS----LSRIGLVFDHAVVQDQIKA 50 (382) Q Consensus 1 ~~~~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~----l~~~g~~~~~~~~~~~~~~ 50 (382) .+.+..+- .-...+..++- .+......|.+ |...+-.+.. ..++... T Consensus 242 i~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~~~~~~~~~~~~~~~-~~~~kg~~f~~~~~al~~~~g~~~~--a~e~a~~ 318 (645) T protein:vir:93 242 IRQVDAHLKRLRELEAGKAATAQPVKQAGNGNVAAVASAPVIRVE-QKLDKGIGFARFAKSLAAAKGVRSE--ALEVARR 318 (645) T ss_pred HHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccch-hhhhhhhhHHHHHHHHHhcccchhH--HHHHHHh Confidence 00000000 00000000000 00000001111 1111100000 0001100 Q ss_pred hcccc---cchhhhhhcccccCcccc--cchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcc-eeeEEEEeeec Q lcl|NC_017674. 51 LAKAG---AFRSGSAMDSNFTAPVTT--PSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWE-DQEIVQGIVEP 124 (382) Q Consensus 51 ~~~~~---~~~~~~amDa~~~~~~t~--~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~-~~t~t~~v~e~ 124 (382) ..... ......++-+.... .+. .+.-+|..+.. +|++.+.+..-.+.+-.....+-.. ...+..+.... T Consensus 319 ~~~~~~~~~~~~~~a~~~~~~~-~~~~~Gg~~vp~~~~~----~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~ 393 (645) T protein:vir:93 319 QYPDDSRLHHVLKSAVGAGTTT-DPQWAGSLSEYQEYAQ----DFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVS 393 (645) T ss_pred hcccchhhhhhhhhhhhccccc-cccccCCccCchhhHH----HHHHhhhhhhhHHhhccccccccccccCceeeeeeec Confidence 00000 00001111111000 011 11112322222 4444444433333332221111111 12344555555 Q ss_pred ccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCC- Q lcl|NC_017674. 125 AGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGL- 203 (382) Q Consensus 125 ~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~- 203 (382) .+.+.+.|...++|..+...+...-+.+.++.-..+|.+=|+ ....++.+--.....+++...+++-++.|+..+. T Consensus 394 ~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~---ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~ 470 (645) T protein:vir:93 394 GGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIR---FSSPAADALVRNALAEAVVARLDTDFVDPKKAAVA 470 (645) T ss_pred CcceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccC Confidence 566778888889999999999999999999998888874443 3356677777788888888999988888874321 Q ss_pred cccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCcc Q lcl|NC_017674. 204 GNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGIS 282 (382) Q Consensus 204 ~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~T 282 (382) ...-.|++|. +. .+ ++......|+..++..+...... +.. ...+|.|..+..|... +..|.- T Consensus 471 ~~~p~gi~~~--~~--~~--------~~~~~~~~d~~~~~~~~~~a~~~---~~~--a~~vmn~~~~~~L~~lkd~~G~~ 533 (645) T protein:vir:93 471 DVSPASITHD--VK--GT--------ASSGNPDADAEAAFGQFVAANLQ---PTG--AVWLMSSTNALALSMRKNALGQK 533 (645) T ss_pred Cccccceecc--cc--cc--------ccccchHHHHHHHHHHHHhcCCC---ccc--cEEEEcHHHHHHHHhccccCCce Confidence 1112344441 11 11 11112346777777776554321 111 2478899888888643 333332 Q ss_pred HHHHHHHhcCccEEEEcccccccc-CC---CCCceeEEE--------Ecchhhhhhhccccccchhh-hhhhhhhhhccc Q lcl|NC_017674. 283 VSDWIEQTYPKMRIVSAPELSGVQ-MK---AQEPEDALV--------LFVEDVNAAVDGSTDGGSVF-SQLVQSKFITLG 349 (382) Q Consensus 283 vl~~l~~n~pnl~i~~~peL~~a~-g~---g~~~~~~~~--------~~~~~v~~~~~~~~~~~~~~-~~~~p~~~~~l~ 349 (382) ++--+-.. +-++...|=+.... .. ..+...+.+ .+.++..-.+....++..+- ....+..+ T Consensus 534 ~~~~~~~~--~~tL~G~PV~~s~~vp~~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~l---- 607 (645) T protein:vir:93 534 EYPDMTLL--GGSFQGLPVIVSQYVGDQLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSM---- 607 (645) T ss_pred eecCCCCC--CceeeceeeEEeccCCcceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhH---- Confidence 21000000 00111111110000 00 000111111 11111000000000000000 00000000 Q ss_pred ceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 350 VEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 350 ~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .| .-.+-+.++.+++. .+++|.||++++|| T Consensus 608 f~--~d~vaira~~r~d~-~~~~p~a~~~lt~~ 637 (645) T protein:vir:93 608 FQ--TGSVAIRAERWINW-RRRRTAAVAVITGV 637 (645) T ss_pred hh--cCceEEEEEEEEcc-eeeCccceEEEecc Confidence 00 11223455566554 45999999999999 No 61 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=94.95 E-value=0.0011 Score=36.86 Aligned_cols=333 Identities=12% Similarity=-0.016 Sum_probs=144.9 Q ss_pred CCC----cceeeeecC---ccccccc-cc-cccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcc Q lcl|NC_017674. 1 MSQ----ISKTHSRLA---GRNAKPF-DL-KNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPV 71 (382) Q Consensus 1 ~~~----~~~~~~~~~---~~~~~~~-~~-~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~ 71 (382) +.+ +.+....+. ....++. .. ......++.+..+-+. ...+.+. ...++........ T Consensus 188 ~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~----------~~~l~~~----e~~~~~~~~~~~~ 253 (543) T protein:vir:81 188 SDNVRAAATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPH----------AAILTEE----EKRAINEVRAMGL 253 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhH----------HHHhhhh----hhhhhhhhhhccc Confidence 000 000000000 0000000 00 0000001111100000 0000000 0011111111123 Q ss_pred cccchh--HHHHHHhhhhhhheeccccc-cchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeE Q lcl|NC_017674. 72 TTPSIP--TPIQFLQTWLPGFVKVMTAA-RKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFER 148 (382) Q Consensus 72 t~~~~~--~~~~~l~~idp~v~~~~~~~-~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~ 148 (382) ++.+.| +|..+. ++++.....+ -....+..+.+. ...+.+++....+.+.+.|.+..+|..+....... T Consensus 254 t~~~gg~lip~~~~----~~ii~~~~~~~~~l~~~~~~~~~----~g~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 325 (543) T protein:vir:81 254 TKADGGYLVPFQLD----PTVIITSNGSLNDIRRFARQVVA----TGDVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQPE 325 (543) T ss_pred ccccCcccCchhhh----hHHHHHHHhhhchhhhhcccccC----CcceEEEEecCCcceeecccCccccccccccceee Confidence 333444 343333 2333222222 223344333222 13345666666777788888889999999999999 Q ss_pred eeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCcc Q lcl|NC_017674. 149 RTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWS 228 (382) Q Consensus 149 ~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa 228 (382) ..++.++..+.+|.+ +..- ..++.+.-.....+++...+|.-+++|+.. .+...|+++++....... .. T Consensus 326 ~~~~k~~~~~~is~e-ll~d---~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt--~~~p~Gi~~~~~~~~~~~-----~~ 394 (543) T protein:vir:81 326 IPVKKAQGFVPISIE-ALQD---EANVTETVALLFAEGKDELEAVTLTTGTGQ--GNQPTGIVTALAGTAAEI-----AP 394 (543) T ss_pred eeeeeeEeeehhhHH-HHhc---cHHHHHHHHHHHHHHHHHHHHHHHhccCCC--Ccccccchhhcccccccc-----cc Confidence 999999999999984 4332 248899999999999999999999999743 345679998765332211 11 Q ss_pred ccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHhcC----ccEEEEc---c Q lcl|NC_017674. 229 TADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQTYP----KMRIVSA---P 300 (382) Q Consensus 229 ~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n~p----nl~i~~~---p 300 (382) ..+..-.++|+.+++..+-.. +.. ...++|.+..+..|.. .+..|.=++.-+...-| ++.++.. | T Consensus 395 ~~~~~~~~~~~~~~~~~l~~~----~~~---~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g~~~~l~G~pv~~~~~~~ 467 (543) T protein:vir:81 395 VTAETFALADVYAVYEQLAAR----HRR---QGAWLANNLIYNKIRQFDTQGGAGLWTTIGNGEPSQLLGRPVGEAEAMD 467 (543) T ss_pred cccccccHHHHHHHHHhhhcc----ccC---CcEEEEcHHHHHHHHHhhcCCCceeccCcCCCCCccccceeeEEecccc Confidence 122233467787787766422 122 2368999999988854 23333322221111111 1222222 2 Q ss_pred ccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeec Q lcl|NC_017674. 301 ELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYL 380 (382) Q Consensus 301 eL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~ 380 (382) ......+. .+...++ |.+ ....+.+...+ ..+ +..|.- ...-.........-...+.+ +.+++|.||+.+. T Consensus 468 ~~~~~~~~-~~~~~i~--~gd-~~~~~i~~~~~-~~i-~~~~~~--~~~~~~~~~~~~~~~~~r~d-~~v~~~~A~~~l~ 538 (543) T protein:vir:81 468 ANWNTSAS-ADNFVLL--YGN-FQNYVIADRIG-MTV-EFIPHL--FGTNRRPNGSRGWFAYYRMG-ADVVNPNAFRLLN 538 (543) T ss_pred cccccccc-CCcceEE--Eee-ccceeEEeecc-cEE-EEeccc--cccchhhcCceEEEEEEeec-cEeecccceEEEE Confidence 21111111 1111121 111 11110000000 000 000000 00000001112223344444 4557799998877 Q ss_pred CC Q lcl|NC_017674. 381 GI 382 (382) Q Consensus 381 GI 382 (382) -- T Consensus 539 ~~ 540 (543) T protein:vir:81 539 VE 540 (543) T ss_pred ec Confidence 66 No 62 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=94.78 E-value=0.0014 Score=36.25 Aligned_cols=330 Identities=9% Similarity=-0.002 Sum_probs=138.4 Q ss_pred CCCccee--------------eeecC--ccccccccccccchHHHHHHhhcceeccccchh-hhhhhhcccccchhhhhh Q lcl|NC_017674. 1 MSQISKT--------------HSRLA--GRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQ-DQIKALAKAGAFRSGSAM 63 (382) Q Consensus 1 ~~~~~~~--------------~~~~~--~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~am 63 (382) +..+.+. ..... .+....-+.... .........+..+...... ...+.+... .... T Consensus 44 v~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~ 116 (415) T protein:vir:46 44 ITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTY--RNQANINDLGISIQNTKVTSQEVRDFTEY-----LETR 116 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhh--HHHHHHHHHHHhhhhhhhhHHHHHHHHHH-----Hhhh Confidence 0000000 00000 000000000000 0001111111111110000 000011000 0001 Q ss_pred cccccCcccccch--hHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeee--cccceeecccccCCce Q lcl|NC_017674. 64 DSNFTAPVTTPSI--PTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE--PAGTAVEYGDHTNIPL 139 (382) Q Consensus 64 Da~~~~~~t~~~~--~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e--~~G~a~~ygd~~DiP~ 139 (382) .....+..++.+. -+|..+ .++|++.+........++.+..... .+..+.+.. ..+.+...+.+..+|- T Consensus 117 ~~~~~~~~~t~~g~~~iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~Eg~~~~~ 189 (415) T protein:vir:46 117 NDIQGGSLKTDSGFVVIPEEI----VTDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPE 189 (415) T ss_pred hhhhhccccccCCcccccHHH----HHHHHHHHHhhhhhhhhcceeeccC---CceeEEEEEecCCcceeeccccccccc Confidence 1111112222233 255444 3467777666666777665543322 223344443 3345567788888886 Q ss_pred ee-eeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcc Q lcl|NC_017674. 140 TS-WNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPA 218 (382) Q Consensus 140 vd-~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~ 218 (382) .+ ...+......+.++..+.+|.+=+ .....+|.+.-....++++.+.+|+-++.|+..+... .++......+. T Consensus 190 ~~~~~~~~v~~~~~k~~~~~~iS~ell---~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~--~~~~~~~~~~~ 264 (415) T protein:vir:46 190 LAVKPFFQLAYDINTHRGYFRISREAI---EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSGFEKEGK 264 (415) T ss_pred ccccceeeEEeeeeeeEeeehhhHHHH---hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcc--ccccccccccc Confidence 54 578888889999999998888544 3345688888888899999999999999998543222 12222111111 Q ss_pred eeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHH-HHHHhcC---- Q lcl|NC_017674. 219 FQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSD-WIEQTYP---- 292 (382) Q Consensus 219 ~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~-~l~~n~p---- 292 (382) . +. .+...-++||.+++.++...-. .+..++|.++.+..|.. .+..|.-++. -+....| T Consensus 265 ~-------~~-~~~~~~~~~i~~~~~~~~~~~~-------~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~ 329 (415) T protein:vir:46 265 K-------LE-VKKAKSLDDIKDAINLNVKPNY-------EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLL 329 (415) T ss_pred e-------ec-cccccchHHHHHHHHhhhhhcc-------CCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCcccc Confidence 1 11 1111125667777776654321 23478999999988854 2333443321 0011111 Q ss_pred ccEEEEccccccccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEee Q lcl|NC_017674. 293 KMRIVSAPELSGVQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCK 371 (382) Q Consensus 293 nl~i~~~peL~~a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~ 371 (382) +..++..+.+- .... ++ ..+++- |.+-+.. .+. +-+. +........ ....-...|. |+.+. T Consensus 330 G~pV~~~~~~~-~~~~-~~-~~~~~gd~~~~~~~-~~~---------~~~~--v~~~~~~~~--~~~~~~~~r~-d~~v~ 391 (415) T protein:vir:46 330 GAKIEILPDEV-LGQK-GN-NTLIIGNLKDAIVL-FDR---------SQYQ--ASWTDYMHF--GECLMIAVRQ-DCRIL 391 (415) T ss_pred ceeeEEecccc-ccCC-Cc-cEEEEEehhccEEE-Eee---------cceE--EEeeccccC--ceEEEEEEEe-ccEEe Confidence 12233322221 1111 11 112211 1111100 000 0000 000010000 1112234454 66777 Q ss_pred ccchheeecCC Q lcl|NC_017674. 372 RPWAVVRYLGI 382 (382) Q Consensus 372 ~P~aia~~~GI 382 (382) +|.||++++-- T Consensus 392 ~~~a~~~~~~~ 402 (415) T protein:vir:46 392 DYKSAIVIEYD 402 (415) T ss_pred ccccEEEEEee Confidence 89999888644 No 63 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=94.78 E-value=0.0014 Score=36.25 Aligned_cols=330 Identities=9% Similarity=-0.002 Sum_probs=138.4 Q ss_pred CCCccee--------------eeecC--ccccccccccccchHHHHHHhhcceeccccchh-hhhhhhcccccchhhhhh Q lcl|NC_017674. 1 MSQISKT--------------HSRLA--GRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQ-DQIKALAKAGAFRSGSAM 63 (382) Q Consensus 1 ~~~~~~~--------------~~~~~--~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~am 63 (382) +..+.+. ..... .+....-+.... .........+..+...... ...+.+... .... T Consensus 44 v~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~ 116 (415) T protein:vir:47 44 ITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTY--RNQANINDLGISIQNTKVTSQEVRDFTEY-----LETR 116 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhh--HHHHHHHHHHHhhhhhhhhHHHHHHHHHH-----Hhhh Confidence 0000000 00000 000000000000 0001111111111110000 000011000 0001 Q ss_pred cccccCcccccch--hHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeee--cccceeecccccCCce Q lcl|NC_017674. 64 DSNFTAPVTTPSI--PTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE--PAGTAVEYGDHTNIPL 139 (382) Q Consensus 64 Da~~~~~~t~~~~--~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e--~~G~a~~ygd~~DiP~ 139 (382) .....+..++.+. -+|..+ .++|++.+........++.+..... .+..+.+.. ..+.+...+.+..+|- T Consensus 117 ~~~~~~~~~t~~g~~~iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~Eg~~~~~ 189 (415) T protein:vir:47 117 NDIQGGSLKTDSGFVVIPEEI----VTDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPE 189 (415) T ss_pred hhhhhccccccCCcccccHHH----HHHHHHHHHhhhhhhhhcceeeccC---CceeEEEEEecCCcceeeccccccccc Confidence 1111112222233 255444 3467777666666777665543322 223344443 3345567788888886 Q ss_pred ee-eeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcc Q lcl|NC_017674. 140 TS-WNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPA 218 (382) Q Consensus 140 vd-~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~ 218 (382) .+ ...+......+.++..+.+|.+=+ .....+|.+.-....++++.+.+|+-++.|+..+... .++......+. T Consensus 190 ~~~~~~~~v~~~~~k~~~~~~iS~ell---~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~--~~~~~~~~~~~ 264 (415) T protein:vir:47 190 LAVKPFFQLAYDINTHRGYFRISREAI---EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSGFEKEGK 264 (415) T ss_pred ccccceeeEEeeeeeeEeeehhhHHHH---hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcc--ccccccccccc Confidence 54 578888889999999998888544 3345688888888899999999999999998543222 12222111111 Q ss_pred eeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHH-HHHHhcC---- Q lcl|NC_017674. 219 FQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSD-WIEQTYP---- 292 (382) Q Consensus 219 ~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~-~l~~n~p---- 292 (382) . +. .+...-++||.+++.++...-. .+..++|.++.+..|.. .+..|.-++. -+....| T Consensus 265 ~-------~~-~~~~~~~~~i~~~~~~~~~~~~-------~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~ 329 (415) T protein:vir:47 265 K-------LE-VKKAKSLDDIKDAINLNVKPNY-------EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLL 329 (415) T ss_pred e-------ec-cccccchHHHHHHHHhhhhhcc-------CCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCcccc Confidence 1 11 1111125667777776654321 23478999999988854 2333443321 0011111 Q ss_pred ccEEEEccccccccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEee Q lcl|NC_017674. 293 KMRIVSAPELSGVQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCK 371 (382) Q Consensus 293 nl~i~~~peL~~a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~ 371 (382) +..++..+.+- .... ++ ..+++- |.+-+.. .+. +-+. +........ ....-...|. |+.+. T Consensus 330 G~pV~~~~~~~-~~~~-~~-~~~~~gd~~~~~~~-~~~---------~~~~--v~~~~~~~~--~~~~~~~~r~-d~~v~ 391 (415) T protein:vir:47 330 GAKIEILPDEV-LGQK-GN-NTLIIGNLKDAIVL-FDR---------SQYQ--ASWTDYMHF--GECLMIAVRQ-DCRIL 391 (415) T ss_pred ceeeEEecccc-ccCC-Cc-cEEEEEehhccEEE-Eee---------cceE--EEeeccccC--ceEEEEEEEe-ccEEe Confidence 12233322221 1111 11 112211 1111100 000 0000 000010000 1112234454 66777 Q ss_pred ccchheeecCC Q lcl|NC_017674. 372 RPWAVVRYLGI 382 (382) Q Consensus 372 ~P~aia~~~GI 382 (382) +|.||++++-- T Consensus 392 ~~~a~~~~~~~ 402 (415) T protein:vir:47 392 DYKSAIVIEYD 402 (415) T ss_pred ccccEEEEEee Confidence 89999888644 No 64 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=94.52 E-value=0.00061 Score=38.19 Aligned_cols=334 Identities=10% Similarity=0.026 Sum_probs=144.2 Q ss_pred CCCcceeeeecCccccccccccccchHHHHHHhhcce--eccccchh-hhhhhhcccccchhhhhhcccccCcccccchh Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDLKNITNDAVASLSRIGL--VFDHAVVQ-DQIKALAKAGAFRSGSAMDSNFTAPVTTPSIP 77 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~--~~~~~~~~-~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~ 77 (382) +.++.+.+.-+..++. . +.. .-..+.+... .-.....+ +..+++...... . -.-++... .++++.| T Consensus 71 ~~~~~~~~~ei~~~~~-~-----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~~-~--e~~~al~~-~t~~~gG 139 (425) T protein:vir:10 71 LAKVDKVSADLEALQA-A-----VDE-ANIKIAAAQMGANGVKPLRDPEYTEAFKAHVKR-G--DVQAALNK-GEDSEGG 139 (425) T ss_pred HHHHHHHHHHHHHHHH-H-----HHH-HHHHHHhhhcccccccccccHHHHHHHHHHhhh-h--hhHHHhhc-CcCCCCc Confidence 1111111110000000 0 000 0000000000 00000000 000000000000 0 00011111 1223333 Q ss_pred --HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceeee-eeeeeEeeEEEE Q lcl|NC_017674. 78 --TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSW-NANFERRTIVRG 154 (382) Q Consensus 78 --~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~-~~~~~~~~v~~~ 154 (382) +|-. +.++|++.+......+.+..+.+... ....+++......+.+.|.+..+|-.+. ..++..-..+.+ T Consensus 140 ~lvP~~----~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~ 212 (425) T protein:vir:10 140 YLTPIE----WDRTITNKLVLISPMRQLCRVQPVSK---AGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPLSFASGEI 212 (425) T ss_pred eeccHh----HHHHHHHHHHhhhhhhhhceeeeccC---CceEEEEEcCCcceeeeccccccccccccccceeeeeheee Confidence 4433 33466676665555666655443332 2345555555556667788888887764 678888888889 Q ss_pred EEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCC-Ccc----c Q lcl|NC_017674. 155 ELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQ-GWS----T 229 (382) Q Consensus 155 ~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~-~Wa----~ 229 (382) +..+.+|.+=++ ....++.+.-......++...+|+-+++|+.. +...|+||++...+....... .+. . T Consensus 213 ~~~i~iS~ell~---ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~---~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~ 286 (425) T protein:vir:10 213 YANPAATQQILD---DAEIDLESWLATEVQTEFAKQEGKAFLAGDGT---NKPNGLLTYIAGGANAAKHPFGAIEVVNSG 286 (425) T ss_pred EeehHhHHHHHh---cchhHHHHHHHHHHHHHHHHHHHhhhhcccCC---CCcceeeecccccccccccccccccccccc Confidence 888888875443 44578889999999999999999999999742 456799998875433221111 111 1 Q ss_pred cCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH-HHHhcC----ccEEEEccccc Q lcl|NC_017674. 230 ADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW-IEQTYP----KMRIVSAPELS 303 (382) Q Consensus 230 kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~~n~p----nl~i~~~peL~ 303 (382) .+..--++||.+++..|...- ..+ -+++|.++.+..|.. .+..|.-++.- +....| +..++....+. T Consensus 287 ~~~~~~~d~l~~l~~~l~~~~----~~~---a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p 359 (425) T protein:vir:10 287 AAADITSDGIIDLVYDLPSAF----TGN---ARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMP 359 (425) T ss_pred ccccccHHHHHHHHhhhhhhh----ccC---CEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecCcC Confidence 223334667777776653321 112 267899998888853 34334333210 011011 11222222222 Q ss_pred cccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 304 GVQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 304 ~a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .. +++..-+++- +..-+. ..+ ..+ +. ....++. .......-...|.+| .++.|.||+.+..= T Consensus 360 ~~---~~~~~~i~~Gd~~~~~~-i~~--~~~---~~-v~~d~~~------~~~~~~~~~~~r~d~-~v~~~~A~~~l~~~ 422 (425) T protein:vir:10 360 DV---AANSTPILFGDFQQTYL-IID--RIG---VR-VLRDPYT------AKPYVLFYTTKRVGG-GLLNPEPMRAMKVA 422 (425) T ss_pred Cc---cCCccEEEEEehhccEE-EEE--ecc---eE-EEecccc------cCCcEEEEEEEEecc-EeecccceEEEEee Confidence 11 1111111111 111000 000 000 00 0011110 011122334445444 45558888664433 No 65 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=94.43 E-value=0.0015 Score=36.09 Aligned_cols=332 Identities=10% Similarity=0.021 Sum_probs=134.2 Q ss_pred CCCcceeeeecCccccc-----cc-----cccccch-------------HH---HHHHhhcceeccccchhhhhhhhccc Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAK-----PF-----DLKNITN-------------DA---VASLSRIGLVFDHAVVQDQIKALAKA 54 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~-----~~-----~~~~~~~-------------~~---~~~l~~~g~~~~~~~~~~~~~~~~~~ 54 (382) |..+-. -...+... -+ +++.++. .. .+.+++.+-.-+...... ...+.+. T Consensus 20 ~~~l~~---~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~r~ 95 (392) T protein:vir:13 20 LRSLTD---EFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGSGAQRSADHD-DDAVLRA 95 (392) T ss_pred HHHHHH---HhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccchhhhhhHH-HHHHHhc Confidence 100000 00000000 00 0000000 00 000000000000000000 0000000 Q ss_pred cc----chhhhhhcccccCcccccchh-HHHHHHhhhhhhheeccccccc-hhhhCccccCCCcceeeEEEEeeecccce Q lcl|NC_017674. 55 GA----FRSGSAMDSNFTAPVTTPSIP-TPIQFLQTWLPGFVKVMTAARK-IDEIIGIDTVGSWEDQEIVQGIVEPAGTA 128 (382) Q Consensus 55 ~~----~~~~~amDa~~~~~~t~~~~~-~~~~~l~~idp~v~~~~~~~~~-~~~l~~v~t~g~~~~~t~t~~v~e~~G~a 128 (382) .. .....+... ....++++.+ ++-.+. . ++++.+..... .+.+..+.... ....+.+++....+.+ T Consensus 96 g~~~~~~~~~~~~~~--~~~t~~~~g~~~~~~~~---~-~~i~~~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~a 167 (392) T protein:vir:13 96 GNLGEARSFEFAPEK--RDGTKAGNPNVLSRTLY---G-QLIAQAVERSAIMRGGASTFTTS--DANPMDFTVITGRATA 167 (392) T ss_pred cchhhhHHHHhhhhh--hcccccCCCccccccch---H-HHHHHHHhhhhhhhhcceeeecC--CCceeEEEEEcCCcce Confidence 00 000011111 1111112211 111111 1 11111111111 12222221111 1134566777777778 Q ss_pred eecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccce Q lcl|NC_017674. 129 VEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTY 208 (382) Q Consensus 129 ~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~ 208 (382) .+.+.+..+|..+...+...-.++.++..+.+|.+=|+ ....++.+--....+.++.+.+|.-+++|+.. +... T Consensus 168 ~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt---~~p~ 241 (392) T protein:vir:13 168 GIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFAT---DQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGT---GQPR 241 (392) T ss_pred eeecccccccccccceeeEEeeeeeEEeeehhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhcccCC---cccc Confidence 88899999999999999999999999999888876554 34557888888888888999999999999743 3457 Q ss_pred EEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH- Q lcl|NC_017674. 209 GFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW- 286 (382) Q Consensus 209 GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~- 286 (382) |+|+++..... ...|++++ .-.++||.+++..|-..- .. .-.+++.+..+..|.. .+..|.-++.- T Consensus 242 Gil~~~~~~~~----~~~~~~~~-~~~~d~l~~~~~~l~~~~----~~---~a~~v~n~~~~~~l~~lkd~~G~~l~~~~ 309 (392) T protein:vir:13 242 GILTDATGANA----AFGEADAD-SKVSDALIDLFHEVPSAY----RK---NAKFVVNDLRAAQMRKLKDANGQYLWQSA 309 (392) T ss_pred ccccccccccc----cccccccc-cccHHHHHHHHHhhhhhh----hc---CCEEEEcHHHHHHHHHhhccCCceeecCC Confidence 99987753211 11222111 112556666766653321 11 1257888888887753 33334322210 Q ss_pred HHHhcCccEEEEccccccccCCCCCceeEEE-Ecchhhhhhhccccccchhhhhhhhhhhhcc---cceecCCceEeccc Q lcl|NC_017674. 287 IEQTYPKMRIVSAPELSGVQMKAQEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITL---GVEKRAKSYVEDFS 362 (382) Q Consensus 287 l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l---~~~~~~~~~~~~~~ 362 (382) +...-| -++...|=...... ..+. +++ .|. .+ ....+..+... -..........-+. T Consensus 310 ~~~g~~-~~l~G~Pv~~~~~~--~~~~-i~~Gdf~-~~--------------~i~~~~~~~i~~~~~~~~~~~~~~~r~~ 370 (392) T protein:vir:13 310 LTVGAP-DTFNGKVVETDDGM--PADK-VLFADLS-KY--------------RVRFAGSLRVDRSVDAKFSTDQIVYRFL 370 (392) T ss_pred cCCCCC-ceecceeeEEcCCC--CCCc-EEEeecc-ce--------------eEEeecceEEEeeccccccCCcEEEEEE Confidence 000001 01111111111000 0011 111 110 00 00000000000 00011122334455 Q ss_pred cceeeeEeeccchheeecCC Q lcl|NC_017674. 363 NGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 363 ~~t~Gv~i~~P~aia~~~GI 382 (382) .|.+| .+++|.||+.+..- T Consensus 371 ~r~d~-~~~~~~A~~~~~~~ 389 (392) T protein:vir:13 371 QRADG-LLVDARGAKVLTVT 389 (392) T ss_pred EEecc-EEecccceEEEEee Confidence 56554 47789998866655 No 66 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=94.30 E-value=0.0048 Score=33.29 Aligned_cols=260 Identities=12% Similarity=0.047 Sum_probs=129.3 Q ss_pred cccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCc-ceeeEEEEeeecccceeecccccCCceeee Q lcl|NC_017674. 64 DSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSW-EDQEIVQGIVEPAGTAVEYGDHTNIPLTSW 142 (382) Q Consensus 64 Da~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~-~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~ 142 (382) -|+. ....+++-+|.-+..++..++-+ ......+..++....- -=.+++++.++..|.+..+++++++|.-.. T Consensus 1 ma~~--~T~~~d~iiPev~~~~v~~~~~~----~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~l 74 (272) T protein:vir:36 1 MSKQ--KTTLADLVNPEVLAPIVSYELNK----ALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKI 74 (272) T ss_pred CCCc--ceehhhhhchHHHHHHHHHHHHh----hhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhc Confidence 1110 12334566677777776554422 2333444444433211 126889999999999999999999999999 Q ss_pred eeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceecc Q lcl|NC_017674. 143 NANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTP 222 (382) Q Consensus 143 ~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~ 222 (382) +.......+...+-++++ .++.+++ .+-++..+-...+.+++...+++-.+ . .++- +.... T Consensus 75 t~~~~~~~i~~~~k~~~v--tD~~~~~-~~~d~~~~~~~~~a~~~a~~~d~~i~-~-----------~l~~----~~~~~ 135 (272) T protein:vir:36 75 GTTTKSVTIKKAAKGTEI--TDEAALS-GYGDPIGESNKQLGLSLANKVDDDLL-S-----------AAKT----TSQTV 135 (272) T ss_pred CCcceeEeeehhhccccc--cHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHH-H-----------Hhcc----ccccc Confidence 999999999887665555 4454444 34455555555555556666554222 1 1110 00000 Q ss_pred CCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccCC-------CCccHHH-HHHHhcCcc Q lcl|NC_017674. 223 PSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTTP-------YGISVSD-WIEQTYPKM 294 (382) Q Consensus 223 a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~~-------~~~Tvl~-~l~~n~pnl 294 (382) + ...+ +++|.+++..+-..- ..+..++++|..+..|.+-.. .+..++. -.--.|-++ T Consensus 136 ~----~~~~----~d~i~~A~~~lgd~~-------~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~ 200 (272) T protein:vir:36 136 S----TKAN----VDGVQAALDIFNDED-------AQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGA 200 (272) T ss_pred c----cccc----HHHHHHHHHHhhhcC-------CCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCe Confidence 1 1123 445555655443221 135689999999998864211 1111100 000113355 Q ss_pred EEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccc-cceeeeEeecc Q lcl|NC_017674. 295 RIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFS-NGTAGALCKRP 373 (382) Q Consensus 295 ~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~-~~t~Gv~i~~P 373 (382) +|+.-..+-. ..+....|++.+.--.. +.+. .... ..++..+....... -..+|+-+.+| T Consensus 201 ~Vv~s~~~p~-----~~~~~~~~~~~~gA~~~----------~~~~-~~~v---E~~R~~~~~~d~i~~~~~y~~~v~~~ 261 (272) T protein:vir:36 201 QIVRSKKLAE-----GSALMFKIVSNSPALKL----------VLKR-GVQV---ETDRDIVTKTTVITADEHYAAYLYDL 261 (272) T ss_pred eEEEeCCCCC-----CceeEEEEEecccceee----------eecC-Cccc---ccccchhhcCcEEEEEEEEEEEEEcC Confidence 6654433321 11222333433322111 1000 0000 01111111111111 24579999999 Q ss_pred chheee--cCC Q lcl|NC_017674. 374 WAVVRY--LGI 382 (382) Q Consensus 374 ~aia~~--~GI 382 (382) .+++.+ .|+ T Consensus 262 ~~vv~~t~~g~ 272 (272) T protein:vir:36 262 TKVVNITFTGV 272 (272) T ss_pred ccEEEEeecCC Confidence 987765 688 No 67 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=93.94 E-value=0.0055 Score=32.95 Aligned_cols=263 Identities=12% Similarity=0.039 Sum_probs=132.2 Q ss_pred cccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccC--CCcceeeEEEEeeecccceeecccccCCceee Q lcl|NC_017674. 64 DSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTV--GSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTS 141 (382) Q Consensus 64 Da~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~--g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd 141 (382) -|+ .....+++-+|..+..++.-++.+. .....+..++.. |.- -.+++++.++..|.+..|.++++++.-+ T Consensus 1 Ma~--~~T~~~~~iiPev~s~~v~~~~~~~----~v~~~~~~~~~~l~g~~-G~tv~ip~~~~~g~a~~~~~g~~i~~~~ 73 (278) T protein:vir:80 1 MAD--LTTKLANLIDPEVMGPMISAKLPKA----IKFGKIAPIDNSLEGQP-GSEITVPKYKYIGDAQDVAEGAAIDYSA 73 (278) T ss_pred CCC--cceehhheecHHHHHHHHHHHHHHh----hhhcccceecccccCCC-CCEEEEeeeccCCcceeecCCCcCcccc Confidence 111 0112234557777777776554332 222232222222 111 2688899999999999999999999999 Q ss_pred eeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceec Q lcl|NC_017674. 142 WNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQT 221 (382) Q Consensus 142 ~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~ 221 (382) .+.......+...+-+++++. +. +...+.++.++-...+.+++.+.+++..+-.. -|..+.. T Consensus 74 lt~~~~~~~i~~~~~a~~v~D--~~-~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l--------~~a~~~~------- 135 (278) T protein:vir:80 74 LETESVKHGIKKAGKGVKLTD--ES-VLSGYGDPVEEAQKQIRMAIASKVDNDILEEA--------LTTTLEV------- 135 (278) T ss_pred cccceeeEeeehhhccccccH--HH-HhhccccHHHHHHHHHHHHHHHHHHHHHHHHH--------hcccccc------- Confidence 999999888888766655554 43 33457778888888888888888887554321 1211110 Q ss_pred cCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccC--------CCCccHH-HHHHHhcC Q lcl|NC_017674. 222 PPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTT--------PYGISVS-DWIEQTYP 292 (382) Q Consensus 222 ~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~--------~~~~Tvl-~~l~~n~p 292 (382) ++.....+.+..++.+.++...+.... + + .+..|+++|..+..|.+-. .++..++ .-.--.|- T Consensus 136 --~~~~t~~~~~~~~~~~~da~~~l~~~~---~-~--~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~ 207 (278) T protein:vir:80 136 --KGAINIGLIDKIENTFTDAPDAIEDES---I-T--TTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELL 207 (278) T ss_pred --ccccccchhhhHHHHHHHHHHhhcccC---C-C--cccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeec Confidence 001111233344555555544432221 1 1 1236889999988885321 1111110 00000122 Q ss_pred ccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccc-cceeeeEee Q lcl|NC_017674. 293 KMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFS-NGTAGALCK 371 (382) Q Consensus 293 nl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~-~~t~Gv~i~ 371 (382) +++|+.-..+- .+ ..|++.+.--.. +... +..... .+..+....-.. -..+|+-+. T Consensus 208 G~~Vi~s~~~p-------~~--t~~l~~~gAi~~----------~~~~-~~~vE~---~Rd~~~~~d~i~~~~~yg~~v~ 264 (278) T protein:vir:80 208 GWEIVRTKKLA-------DG--NALAVKAGALKT----------FLKR-NLLAES---GRDMDHKLTKFNADQHYAVALV 264 (278) T ss_pred ceeEEEcCCCC-------cc--eEEEEeccceee----------eecC-Cccccc---ccchhhccceeeeeeEEEEEEE Confidence 34444333221 01 223333321110 1110 111111 111111111111 134699999 Q ss_pred ccchheeecCC Q lcl|NC_017674. 372 RPWAVVRYLGI 382 (382) Q Consensus 372 ~P~aia~~~GI 382 (382) +|.+++.+.=- T Consensus 265 ~~~~~v~it~~ 275 (278) T protein:vir:80 265 DETKAVKVVPV 275 (278) T ss_pred cCcceEEEeec Confidence 99999888766 No 68 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=93.86 E-value=0.0049 Score=33.24 Aligned_cols=255 Identities=14% Similarity=0.085 Sum_probs=128.9 Q ss_pred hhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCC-cceeeEEEEeeecccceeecccccCCce Q lcl|NC_017674. 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGS-WEDQEIVQGIVEPAGTAVEYGDHTNIPL 139 (382) Q Consensus 61 ~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~-~~~~t~t~~v~e~~G~a~~ygd~~DiP~ 139 (382) |||=+. ...+++-+|..+..++..++-+ -.....+..+++... ---.+++++.++..|.+..|.++++++. T Consensus 1 ~~~~~~----T~l~d~i~PEv~~~~v~~~~~~----~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~ 72 (275) T protein:vir:96 1 MALENM----TKLANMVNPEVLAPMMQAELDK----KLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPI 72 (275) T ss_pred CCCccc----chhhhhhchHHHHHHHHHHHHH----hhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcch Confidence 444322 3345666788888887765532 233344443333311 1136899999999999999999999999 Q ss_pred eeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcce Q lcl|NC_017674. 140 TSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAF 219 (382) Q Consensus 140 vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~ 219 (382) -.+........+...+-+++++.+ .+.+. +.++..+-...+..++.+.+++-.+ .. ++.-... T Consensus 73 ~~lt~~~~~~~i~~~~~~~~i~D~--~~~~~-~~d~~~~~~~~~a~~~a~~~d~~ll-~~-----------l~~a~~~-- 135 (275) T protein:vir:96 73 DLIETKKRQATIRKIGKGTVLTDE--ALLSG-YGDPKGEAVRQHGLAIANKVDNDVL-EA-----------LQGATLK-- 135 (275) T ss_pred hhcccceeeEEeehhcccccccHH--HHHhh-ccchHHHHHHHHHHHHHHHHHHHHH-HH-----------Hhccccc-- Confidence 999999999889887666666554 33343 4455555555566666666665432 21 1110111 Q ss_pred eccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccC--------CCCccHHHHHHH-- Q lcl|NC_017674. 220 QTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTT--------PYGISVSDWIEQ-- 289 (382) Q Consensus 220 ~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~--------~~~~Tvl~~l~~-- 289 (382) +.++. -+ ++.|.+++..+-.. +..+..|+++|..+..|.+-. ..+-. .+.. T Consensus 136 -~~~~~----~~----~d~i~dA~~~lgd~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~---~~~~G~ 196 (275) T protein:vir:96 136 -VEADI----TK----LAGLQTAIDKFNDE-------DLEPMVLFVNPLDAGKLRASATDNFTRATLLGDN---VIVKGA 196 (275) T ss_pred -ccccc----cC----HHHHHHHHHHhccc-------cCCccEEEeCHHHHHHHHhccccccccccccccc---ceeccc Confidence 11111 12 44455555544221 124678999999999884321 11111 1111 Q ss_pred --hcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEecccc-cee Q lcl|NC_017674. 290 --TYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSN-GTA 366 (382) Q Consensus 290 --n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~-~t~ 366 (382) .|-+++|+.-..+. .+ ..|++.+.-- .+-..-+... ..++..+...-.... ..+ T Consensus 197 ig~~~G~~Vi~s~~~p-------~~--t~~i~~~gA~-----------~~~~~~~~~v---E~~Rd~~~~~d~i~~~~~y 253 (275) T protein:vir:96 197 FGEALGAIIVRSNKIK-------EG--EAILAKRGAV-----------KLITKRDFFL---ETERHASHKSTALFSDKHY 253 (275) T ss_pred cceecCeeEEEeCCCC-------cc--eEEEEeccce-----------eeeecCCccc---ccccchhhcCcEEEEeEEE Confidence 12344554333221 01 1233322211 1100000000 111111221111111 356 Q ss_pred eeEeeccchheee------cCC Q lcl|NC_017674. 367 GALCKRPWAVVRY------LGI 382 (382) Q Consensus 367 Gv~i~~P~aia~~------~GI 382 (382) |+-+.+|..++.+ +|+ T Consensus 254 ~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 254 VAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred EEEEEcCccEEEEEecccccCC Confidence 8899999888875 344 No 69 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=93.62 E-value=0.003 Score=34.42 Aligned_cols=326 Identities=13% Similarity=0.061 Sum_probs=143.4 Q ss_pred CCCcceeeeecCccc---cccccc--cccc---hHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCccc Q lcl|NC_017674. 1 MSQISKTHSRLAGRN---AKPFDL--KNIT---NDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVT 72 (382) Q Consensus 1 ~~~~~~~~~~~~~~~---~~~~~~--~~~~---~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t 72 (382) +..+-.....+.... -++... .+.. ...+..--+-|.. ..+.. ....+|.. .+ T Consensus 50 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a~~~~l~~g~~----------~~~~~----~e~~a~~~-----~t 110 (407) T protein:vir:48 50 LAELENLKSDLEAELAEVKRPAGGTQNKVASEHKEAFIGFMRKGRE----------DGLRE----LERKALQV-----GN 110 (407) T ss_pred HHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHhccch----------hhhhH----HHHHhhhc-----cc Confidence 000000000000000 011100 0000 1111111111111 00000 00112211 12 Q ss_pred ccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceee-eeeeeeEe Q lcl|NC_017674. 73 TPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTS-WNANFERR 149 (382) Q Consensus 73 ~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~ 149 (382) .++.| +|..+ .++|++.+...-..+.+..+.+.+. ....+++......+.+.+.+...|-.+ .......- T Consensus 111 ~~~gG~~iP~~~----~~~I~~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~ 183 (407) T protein:vir:48 111 DEDGGYAIPEEL----DRTILTLLKDEVVMRQEATVITLGG---SDYKKLVNLGGTTSGWVGETDARPETATSKLGLIEP 183 (407) T ss_pred CCCCcccccHhH----HHHHHHHHHhhhhhhhhceeeecCC---CceEEEEecCCcceeeecccccccccccccceeEEe Confidence 22223 45443 4456655544444444444333322 345666666666677778777787654 46778888 Q ss_pred eEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCC--- Q lcl|NC_017674. 150 TIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQG--- 226 (382) Q Consensus 150 ~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~--- 226 (382) .++.++..+.+|.+=++ ....++.+.-.....+++...+++-.++|+.. +...|+|+++.+.........+ T Consensus 184 ~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~---~~p~Gil~~~~~~~~~~~~~~~~~~ 257 (407) T protein:vir:48 184 FMGEIYGNPQATQKMLD---DAFFNVEDWINSELALEFAEQEEIAFTSGDGS---KKPKGFLAYESTDEDDKTRAFGKLQ 257 (407) T ss_pred eeeeeEeehhhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhhhccCCC---Cccceeeeccccccccccccccccc Confidence 88999998988886543 34467888888888888899999999999854 3457999998765432211100 Q ss_pred -ccccCHHH-HHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH-HHHhcC----ccEEEE Q lcl|NC_017674. 227 -WSTADWAG-IIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW-IEQTYP----KMRIVS 298 (382) Q Consensus 227 -Wa~kT~~e-I~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~~n~p----nl~i~~ 298 (382) -.+.+... -++||.+++..|... +..+ -.+++.+..+..|.. .+..|.-++.- +....| +..++. T Consensus 258 ~~~~~~~~~~~~d~i~~l~~~l~~~----~~~~---a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~ 330 (407) T protein:vir:48 258 HIASGAASGVTADAIIKLIYTLRKA----HRSG---AKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYGIVE 330 (407) T ss_pred ccccccccccChHHHHHHHHhhchh----hhcC---CEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEE Confidence 01111122 256777777766432 1122 157889988888853 23334333210 011111 111222 Q ss_pred ccccccccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchhe Q lcl|NC_017674. 299 APELSGVQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVV 377 (382) Q Consensus 299 ~peL~~a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia 377 (382) ...+... +++...+++- +...+. ..+. .+ +.. ...++. ..-....-+..|.+| .+..|.||+ T Consensus 331 ~~~~p~~---~~~~~~i~~Gd~~~~~~-i~~~--~~---~~i-~~d~~~------~~~~~~~~~~~r~d~-~v~~~~a~~ 393 (407) T protein:vir:48 331 NEQMPDI---AADAKAIAFGNFKRGYT-IVDR--IG---TRI-LRDPYT------NKPFVGFYTTKRTGG-MLVDSQAIK 393 (407) T ss_pred ecCcCCc---cCCccEEEEEeccccEE-EEEe--ec---eEE-Eeeccc------cCCcEEEEEEEEecc-EEecccceE Confidence 2222111 1122112211 111110 0000 00 000 001110 112223345556655 455699987 Q ss_pred eecCC Q lcl|NC_017674. 378 RYLGI 382 (382) Q Consensus 378 ~~~GI 382 (382) .+..= T Consensus 394 ~l~~~ 398 (407) T protein:vir:48 394 LMKIG 398 (407) T ss_pred EEEee Confidence 65443 No 70 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=93.45 E-value=0.0022 Score=35.18 Aligned_cols=326 Identities=12% Similarity=0.017 Sum_probs=142.2 Q ss_pred CCC----cceeeeecCcccccccccc-c-cc---hHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcc Q lcl|NC_017674. 1 MSQ----ISKTHSRLAGRNAKPFDLK-N-IT---NDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPV 71 (382) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~-~-~~---~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~ 71 (382) +++ ++...-..... -+|.... . .. ...|...-|-|.. .+.. .+ ...+|- .+.. T Consensus 51 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~e~~~a~~~~lr~~~~------~~~~-~~-------e~~a~~---~~~~ 112 (401) T protein:vir:44 51 LSELENLKSDLEKELLEL-KRPARGAQNKVAAEHKDAFVGFLRKGRE------DGLR-DL-------ERKALQ---VGTD 112 (401) T ss_pred HHHHHHHHHHHHHHHHHh-hccccccccchhHHHHHHHHHHHhhhhh------hhhH-HH-------HHHHhh---cCCC Confidence 110 00000000000 0111110 0 10 1112221111110 0000 00 001111 0111 Q ss_pred cccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceee-eeeeeeEee Q lcl|NC_017674. 72 TTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTS-WNANFERRT 150 (382) Q Consensus 72 t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~ 150 (382) ..++.-+|..+. ++|++.+......+.+..+.+.+. ....+++......+.+.+.....|-.+ ...++..-. T Consensus 113 ~~GG~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~ 185 (401) T protein:vir:44 113 EDGGYAVPEELD----RSILSLLKDEVVMRQEATVITVGG---SDYKKLVNLGGTASGWVGETDTRSQTATSRLGLIEPF 185 (401) T ss_pred CCCceeccHhHH----HHHHHHHHhhhhhhhhceeeecCC---CceEEEEecCCccceeeccccccCccccccceeeeee Confidence 112223554443 466665554444455544433322 234455555545555667766777555 367777888 Q ss_pred EEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCC---- Q lcl|NC_017674. 151 IVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQG---- 226 (382) Q Consensus 151 v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~---- 226 (382) ++.++..+.+|.+=+. ....++.+.-....+.++...++.-.++|+.. +...|+||.+...........+ T Consensus 186 ~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~---~~p~Gil~~~~~~~~~~~~~~~~~~~ 259 (401) T protein:vir:44 186 MGEIYGNPQATQKMLD---DAFFNVEAWINSELATEFAEQEEIAFTTGDGT---KKPKGFLAYESTEESDKARAFGKLQH 259 (401) T ss_pred hhheeeehhhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhhhccCCC---Cccceeeccccccccccccccccccc Confidence 8888888888885544 34568888888899999999999999999853 3457999988765432211100 Q ss_pred ccccCHH-HHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH-HHHhcC----ccEEEEc Q lcl|NC_017674. 227 WSTADWA-GIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW-IEQTYP----KMRIVSA 299 (382) Q Consensus 227 Wa~kT~~-eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~~n~p----nl~i~~~ 299 (382) =.+.+.. --++||.+++..|...- ..+ .++++.++.+..|.. .+..|.-++.- +...-| +.-++.. T Consensus 260 ~~t~~~~~~~~d~i~~~~~~l~~~~----~~~---a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~ 332 (401) T protein:vir:44 260 IVSGEATAVTADAIIKLIYTLRKAH----RTG---AKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAEN 332 (401) T ss_pred cccccccccCHHHHHHHHHhcchhh----hcC---CEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEe Confidence 0111111 22677777777664321 112 257899998888853 34344433210 111111 1112222 Q ss_pred cccccccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhccc-ceecCCceEeccccceeeeEeeccchhe Q lcl|NC_017674. 300 PELSGVQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLG-VEKRAKSYVEDFSNGTAGALCKRPWAVV 377 (382) Q Consensus 300 peL~~a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~-~~~~~~~~~~~~~~~t~Gv~i~~P~aia 377 (382) ..+.. .++++.-+++- +...+. ..-+..+..+- .....-....-+..|.+|.+ ..|.||+ T Consensus 333 ~~~p~---~~~~~~~i~~Gd~~~~~~--------------i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~~-~~~~a~~ 394 (401) T protein:vir:44 333 EQMPD---IAADAKAIAFGNFKRGYT--------------IVDRIGTRILRDPYTNKPFVGFYTTKRTGGML-VDSQAIK 394 (401) T ss_pred cCcCC---ccCCccEEEEeehhccEE--------------EEEecceEEeeeccccCCcEEEEEEEEeccEE-ecccceE Confidence 11111 11222111111 111100 00011111100 00011223334444555544 4488887 Q ss_pred eecCC Q lcl|NC_017674. 378 RYLGI 382 (382) Q Consensus 378 ~~~GI 382 (382) .+..= T Consensus 395 ~l~~~ 399 (401) T protein:vir:44 395 LLKIA 399 (401) T ss_pred EEEee Confidence 65444 No 71 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=93.37 E-value=0.0069 Score=32.42 Aligned_cols=334 Identities=10% Similarity=0.038 Sum_probs=137.6 Q ss_pred CCCcceeeeecCccccccc----cc-cccch-----HH------HHHHhhcc---eeccccchhhhhhhhcccccchhhh Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPF----DL-KNITN-----DA------VASLSRIG---LVFDHAVVQDQIKALAKAGAFRSGS 61 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~----~~-~~~~~-----~~------~~~l~~~g---~~~~~~~~~~~~~~~~~~~~~~~~~ 61 (382) |.+.+..+..-.+....+- +. ..+.. .. ++.+.+-. -.+.............+. +....+ T Consensus 269 l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~-arg~~~ 347 (632) T protein:vir:96 269 MNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKE-ARGFYM 347 (632) T ss_pred HhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhh-hhhhhh Confidence 3332222211111100000 00 00000 00 00000000 000000000111110000 000000 Q ss_pred hhcc---cccCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccC Q lcl|NC_017674. 62 AMDS---NFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTN 136 (382) Q Consensus 62 amDa---~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~D 136 (382) .++. ......|+.+.| +|-.++. ..|++.+.+..-++.+ +... -+-....++++.....+.+.+.|.... T Consensus 348 ~~~~l~~ra~~~~t~~~gg~lvp~~~~~---~~iie~lr~~s~i~~l-~~~~-~~~~~g~~~ip~~~~~~~a~wv~E~~~ 422 (632) T protein:vir:96 348 PHEVLVQRQLEKKTAGKGGELVATELLS---EEFIDILRNKAIIGQM-GARM-LPGLVGDVDIPKKTSGANFYWIGEDED 422 (632) T ss_pred hHHHHHHhhhhcccccccccccccccch---HHHHHHHhhcchhhhh-cceE-eecCCcceEEEEEeCCceeEeecCCcc Confidence 1110 000011222233 3322221 2455554433333332 2111 011123466777777777778888888 Q ss_pred CceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCC Q lcl|NC_017674. 137 IPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNL 216 (382) Q Consensus 137 iP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l 216 (382) +|..+...+...-..+.++..+.+|.+=|.. ...++.+.-......++...+++-+++|+.. .+...|++|..++ T Consensus 423 ~~~s~~~f~~i~l~~~k~~~~v~iS~ell~d---s~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~--~~~p~Gi~~~~~~ 497 (632) T protein:vir:96 423 VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQ---SSIHVENLIREDLIEGIGVALDLAMLTGTGL--ANDPVGLLNMTGV 497 (632) T ss_pred ccccccceeeEEeeeeEEEEehhhHHHHHhc---cchHHHHHHHHHHHHHHHHHHHHHhhcccCC--CCccceeeecccc Confidence 9999988888898999999988888754432 3567888888888889999999999999743 3446799998887 Q ss_pred cceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc---cCCCCccHHH--HHHHhc Q lcl|NC_017674. 217 PAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV---TTPYGISVSD--WIEQTY 291 (382) Q Consensus 217 ~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~---t~~~~~Tvl~--~l~~n~ 291 (382) ++...+. +..| ++||.++...+...... . .+...++.+..+..|.. .+..|.-+++ .| .-| T Consensus 498 ~~~~~~~----~~~~----~~~i~~~~~~i~~~~~~---~--~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~~~l-~G~ 563 (632) T protein:vir:96 498 PALTYPA----GGVD----WASVVDMETKISTFNAD---A--GRLAYLTSVTQRGAAKKAQVFDNTGERIWQNNEV-NGY 563 (632) T ss_pred cceeccc----ccCC----HHHHHHHHHHHhhcccc---c--CccEEEEchhHHHHHHHHhccCCCCceeecCCee-ccc Confidence 6432221 1122 34566666665544321 1 12356788777666642 2333333321 00 012 Q ss_pred CccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEee Q lcl|NC_017674. 292 PKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCK 371 (382) Q Consensus 292 pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~ 371 (382) |-+.-..+|.-....|.. ...++-....+.-.+ -| +.. .......+-+..+ .++-++ T Consensus 564 pv~~s~~ip~~~~~~gd~---s~~~i~~~~~~~i~~-------------~~--~~~----~~~~~v~~~~~~~-~d~~v~ 620 (632) T protein:vir:96 564 RAEASNQIPADTWIFGDW---SQIVIAMWGVLDLKV-------------DP--YTK----AASDGLVLRVFQD-VDAGVR 620 (632) T ss_pred ceEeccccccCcEEEeec---ceEEEEEecceEEEE-------------cc--ccc----cccCceEEEEEee-cCceee Confidence 211111122111110100 011110001100000 00 000 0011111122222 355677 Q ss_pred ccchheeecCC Q lcl|NC_017674. 372 RPWAVVRYLGI 382 (382) Q Consensus 372 ~P~aia~~~GI 382 (382) +|.+|+...== T Consensus 621 ~~~af~~~k~~ 631 (632) T protein:vir:96 621 RKEAFCIAKKG 631 (632) T ss_pred chhhhhheeec Confidence 78777643222 No 72 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=93.11 E-value=0.0012 Score=36.56 Aligned_cols=332 Identities=10% Similarity=-0.010 Sum_probs=144.4 Q ss_pred CCCc---ceee----e-ecCccccc-cccccccc-----hHHHHHHh----hcceecccc--chh-hhhhh----hcccc Q lcl|NC_017674. 1 MSQI---SKTH----S-RLAGRNAK-PFDLKNIT-----NDAVASLS----RIGLVFDHA--VVQ-DQIKA----LAKAG 55 (382) Q Consensus 1 ~~~~---~~~~----~-~~~~~~~~-~~~~~~~~-----~~~~~~l~----~~g~~~~~~--~~~-~~~~~----~~~~~ 55 (382) |..+ .+.- . .-.++... +...+... ..+.+... .-+....+. ... ..-+. +.... T Consensus 56 i~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~ 135 (434) T protein:vir:62 56 LAKLEEKEKEEDPAKKKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNI 135 (434) T ss_pred HHHHHHHHHHHHHHhhhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhcccc Confidence 0000 0000 0 00000000 00000000 00000000 000000000 000 00000 00000 Q ss_pred cchhhhhhcccccCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeec-- Q lcl|NC_017674. 56 AFRSGSAMDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEY-- 131 (382) Q Consensus 56 ~~~~~~amDa~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~y-- 131 (382) ......++ ..++++.| +|-.+ ...|++.+......+.+..+...+ ..+.|++....+.+... T Consensus 136 ~~~e~~a~------~~~t~~GG~lvP~~~----~~~Ii~~l~~~~~i~~~~~~~~~~----~~~~~p~~~~~~~a~~~~~ 201 (434) T protein:vir:62 136 DEKEARAL------GLVTGNGSVTIPDFL----SKEIITYAQEENFLRRLGTGVKTK----ENIKYPVLVKKAEAQGHKN 201 (434) T ss_pred chhhhhhh------cccccccceecchhh----HHHHHHhhhhhhhhhhhcceeccC----CceEEEEEecCCcccceec Confidence 00001111 12223344 44433 345655554444444444332222 23567776655555433 Q ss_pred -ccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEE Q lcl|NC_017674. 132 -GDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGF 210 (382) Q Consensus 132 -gd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~Gl 210 (382) +...+.|..+.......-.++.++.-+.+|.+=|. ....++.+.-......++...+++-.++|+..+ ...-|+ T Consensus 202 ~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~--~~~~g~ 276 (434) T protein:vir:62 202 ERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLA---RTGLPIEQIVMDELKKAYVRKETQYMVNGDEAN--NINDGA 276 (434) T ss_pred ccccccccccccceeeEEeeheeeEeehhhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhccCCCC--ccccce Confidence 33567788888888899999999998888875443 345678888888999999999999999998543 334588 Q ss_pred EeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHH-HHH Q lcl|NC_017674. 211 LNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSD-WIE 288 (382) Q Consensus 211 lN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~-~l~ 288 (382) ++.+.+.... +....++||.+++.++...- ..+. .++|.+..+..|.. .+..|.-++. ... T Consensus 277 ~~~~~~~~~~----------~~~~~~d~l~~l~~~l~~~~----~~~a---~~v~n~~~~~~L~~lkd~~G~~l~~~~~~ 339 (434) T protein:vir:62 277 LAKKAVEFKT----------DEKNLYDALVKMKNTPVKEV----RKKA---RWVLNTAALTKIETMKTDDGFPLLRPFNQ 339 (434) T ss_pred eecccccccc----------cccchhhHHHHHHhhcchhh----hcCC---EEEEcHHHHHHHHHhhccCCCEeeccCCC Confidence 8776653221 11123567777777664322 1221 56888888888854 3433443332 110 Q ss_pred Hh--cC----ccEEEEccccccccCCCCCceeEEE-Ecchhhhhhhccccccchhhhhhhhhhhhcccc-eecCCceEec Q lcl|NC_017674. 289 QT--YP----KMRIVSAPELSGVQMKAQEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV-EKRAKSYVED 360 (382) Q Consensus 289 ~n--~p----nl~i~~~peL~~a~g~g~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~-~~~~~~~~~~ 360 (382) -+ .| +..++....+.. .+.++. ..+++ .|..-+ .+..-+ +..+..+.. .......-.- T Consensus 340 ~~~g~~~tl~G~pV~~~~~~~~-~~~~~~-~~i~~Gdfs~~~----i~~~~g--------~~~i~~~~~~~~~~~~v~~~ 405 (434) T protein:vir:62 340 AEGGIGYTLLGFPVEEEDAIDI-PDSPDT-PVFYFGDFSKFY----IQDVIG--------SLEVQKLVELFSRTNRVGFR 405 (434) T ss_pred ccCCCCceecceeeEEecCccC-ccCCCc-eEEEEeeccceE----EEEeec--------eeEEEeehhhhcccCceEEE Confidence 00 01 122333322221 111111 11111 111100 000000 001111100 0012223345 Q ss_pred cccceeeeEeeccchheeecCC Q lcl|NC_017674. 361 FSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 361 ~~~~t~Gv~i~~P~aia~~~GI 382 (382) +..|..|-.||.|.+++-..+. T Consensus 406 ~~~r~Dgk~i~~~~~~~~~~~~ 427 (434) T protein:vir:62 406 IWNLLDAQLIHSPFEVPVYKYV 427 (434) T ss_pred EEeeecceeecCcccceEEEEE Confidence 6677888889999999877555 No 73 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=92.91 E-value=0.0095 Score=31.67 Aligned_cols=257 Identities=11% Similarity=0.021 Sum_probs=129.9 Q ss_pred cccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCc-ceeeEEEEeeecccceeecccccCCceeee Q lcl|NC_017674. 64 DSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSW-EDQEIVQGIVEPAGTAVEYGDHTNIPLTSW 142 (382) Q Consensus 64 Da~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~-~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~ 142 (382) -|+.+ ...+++-+|..+..++..++- ..+....+..++++..- --.+++++.++..|.+..|.+++++|.-++ T Consensus 1 ma~~~--T~~~d~i~Pev~s~~v~~~~~----~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~i 74 (274) T protein:vir:96 1 MAQGT--TKVSNLIVPEVLAPMMQAELD----KKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQI 74 (274) T ss_pred CCccc--cchhhhhhhHHHHHHHHHHHH----hhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhc Confidence 11101 223456678777777765543 33344445544443211 125899999999999999999999999999 Q ss_pred eeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceecc Q lcl|NC_017674. 143 NANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTP 222 (382) Q Consensus 143 ~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~ 222 (382) ........+...+-+++++. +.+++ .+.++-.+....+..++.+.+++..+-- ++.-.. ... T Consensus 75 t~~~~~~~i~~~~~~~~i~D--~~~~~-~~~d~~~~~~~~~~~~~a~~~d~~i~~~------------l~~a~~---~~~ 136 (274) T protein:vir:96 75 GTSKREAKVRKIGKGTELTD--EAVLS-GFGDPQGEAVRQHGLAIANKVDNDVLEA------------LKGATL---TVE 136 (274) T ss_pred ccceeEEEEEeeeceeeecH--HHHHh-hcchHHHHHHHHHHHHHHHHHHHHHHHH------------HhcCCC---CcC Confidence 99999888888766666655 44434 4556667777777777777777644321 110000 011 Q ss_pred CCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccC--------CCCccHH-HHHHHhcCc Q lcl|NC_017674. 223 PSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTT--------PYGISVS-DWIEQTYPK 293 (382) Q Consensus 223 a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~--------~~~~Tvl-~~l~~n~pn 293 (382) ++.. + ++.|.++...+-.. +..+..|+++|..+..|.+-+ +.|..++ .-.--+|-+ T Consensus 137 ~~~~----~----~d~i~dA~~~l~d~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G 201 (274) T protein:vir:96 137 ADIT----K----LDGLQTAIDKFNDE-------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALG 201 (274) T ss_pred cccc----c----HHHHHHHHHHhccc-------CCCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecC Confidence 1111 2 44555555554322 124678999999999885421 1111110 000001223 Q ss_pred cEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEecccc-ceeeeEeec Q lcl|NC_017674. 294 MRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSN-GTAGALCKR 372 (382) Q Consensus 294 l~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~-~t~Gv~i~~ 372 (382) ++|+.-+.+-. ...|++.+.--. +-...+.... .++..+........ ..+|+-+.+ T Consensus 202 ~~Vi~s~~~p~---------~t~~l~~~gA~~-----------~~~~~~~~vE---~~Rd~~~~~d~i~~~~~yg~~~~~ 258 (274) T protein:vir:96 202 AVIVRSNKLNK---------GEALLAKKGAVK-----------LITKRDFFLE---KDRDASRKSTALYSDKHYVAYLYD 258 (274) T ss_pred eeEEEcCCCCc---------ceEEEEeCccee-----------eeecCCcccc---cccchhhcccEEEEeeEEEEEEEc Confidence 44443222210 012233221111 1000010011 11111111112222 247999999 Q ss_pred cchheeecCC Q lcl|NC_017674. 373 PWAVVRYLGI 382 (382) Q Consensus 373 P~aia~~~GI 382 (382) |.+++.+.== T Consensus 259 ~~~vv~~t~~ 268 (274) T protein:vir:96 259 ESKVVKITKG 268 (274) T ss_pred CccEEEEEcC Confidence 9887766543 No 74 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=92.75 E-value=0.0071 Score=32.36 Aligned_cols=333 Identities=9% Similarity=0.010 Sum_probs=140.2 Q ss_pred CCCcc-eeeeecCccccccccccccchHHHHHHhhcceeccccchhhh-hhhhcccccchhhhhhcccccCccccc--ch Q lcl|NC_017674. 1 MSQIS-KTHSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQ-IKALAKAGAFRSGSAMDSNFTAPVTTP--SI 76 (382) Q Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~-~~~~~~~~~~~~~~amDa~~~~~~t~~--~~ 76 (382) +.+.. +.+.....++....... ............+..+.....+.. .+.+.+.. ....+.. .+..++. +. T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~----~~~~~~~-~~~~~~~~g~~ 131 (415) T protein:vir:94 58 LDKLKEKDGTSENNQQSVEVNEA-STYRNQANINDLGISIQNTKVTSQEVRDFTEYL----ETRNDIQ-GGSLKTDSGFV 131 (415) T ss_pred HHHHHHHHHhhhhccccccccch-hhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHh----hhhhhhh-hhccccccccc Confidence 00000 00000000000000000 000111111111111111110000 00000000 0001111 1112222 23 Q ss_pred hHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCcee-eeeeeeeEeeEEEEE Q lcl|NC_017674. 77 PTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT-SWNANFERRTIVRGE 155 (382) Q Consensus 77 ~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~v-d~~~~~~~~~v~~~~ 155 (382) -+|.. +.++|++.+........++.+..... ....+.+......+.+...+.+.++|-. ....+.....++.++ T Consensus 132 ~iP~~----~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~ 206 (415) T protein:vir:94 132 VIPEE----IVTDILKLKEVEFNLDKYVTVKRVTN-GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHR 206 (415) T ss_pred cCcHH----HHHHHHHHHHhhhhhhhhcceeeccC-CceeEEEEeecCCccceeccccccccccccccceeeEeeheeee Confidence 35533 44577777777777777766544321 1122333333444556677888888854 457888889999999 Q ss_pred EEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHH Q lcl|NC_017674. 156 LGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGI 235 (382) Q Consensus 156 ~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI 235 (382) ..+.+|.+=++ ....++.+.-....++++...+|+-++.|+..+... .+..+.......... + ...+ T Consensus 207 ~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--~~~~~~~~~~~~~~~--~--~~~~---- 273 (415) T protein:vir:94 207 GYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSGFEKEGKKLEV--K--KAKS---- 273 (415) T ss_pred eechhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--ccccccccccccccc--c--cccc---- Confidence 99988885333 334678888888888888999999999987554322 222222111111111 1 1122 Q ss_pred HHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHH-HHHHhcC----ccEEEEccccccccCCC Q lcl|NC_017674. 236 IGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSD-WIEQTYP----KMRIVSAPELSGVQMKA 309 (382) Q Consensus 236 ~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~-~l~~n~p----nl~i~~~peL~~a~g~g 309 (382) ++||.+++..+...- . .+..++|.|+.+..|.. .+..|.-++. -+....| +..++..+.+- .+..+ T Consensus 274 ~~~i~~~~~~~~~~~---~----~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~-~~~~~ 345 (415) T protein:vir:94 274 LDDIKDAINLNVKPN---Y----EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEV-LGQKG 345 (415) T ss_pred hHHHHHHHHhhhhhc---c----CCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecccc-cCCCC Confidence 567777777654321 1 24578999999988864 3444443321 0110011 12233333221 11111 Q ss_pred CCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 310 QEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 310 ~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ...+++- +.+-+.. .+. ....+. ..+|.. -....-...|. ++.+.+|.||++..-- T Consensus 346 --~~~i~~gd~~~~~~~-~~~---~~~~v~---~~~~~~-------~~~~~r~~~r~-d~~~~~~~a~~~~~~~ 402 (415) T protein:vir:94 346 --NNTLIIGNLKDAIVL-FDR---SQYQAS---WTDYMH-------FGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred --ccEEEEEehhccEEE-Eee---cceEEE---Eecccc-------CceEEEEEEEe-ccEEeccccEEEEEEe Confidence 1112221 1111110 000 000000 001100 01111223343 5667789999888644 No 75 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=92.49 E-value=0.011 Score=31.28 Aligned_cols=256 Identities=9% Similarity=0.019 Sum_probs=131.1 Q ss_pred cccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCc-ceeeEEEEeeecccceeecccccCCceeee Q lcl|NC_017674. 64 DSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSW-EDQEIVQGIVEPAGTAVEYGDHTNIPLTSW 142 (382) Q Consensus 64 Da~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~-~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~ 142 (382) -|... .+.+++-+|..+..++.-++-+. .....+.-++....- --.++.++.++..|.+..++++.++|..+. T Consensus 1 MA~~~--T~~~~~~iPev~s~~v~~~~~~~----~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~ 74 (272) T protein:vir:30 1 MAVGT--TKMAQMLDPEVLADMIDAEVGKA----IRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQL 74 (272) T ss_pred CCCcc--ccchheechHHHHHHHHHHHHHH----hhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccccccc Confidence 11101 22334557777777765444322 222222222221110 114788899998999999999999999999 Q ss_pred eeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceecc Q lcl|NC_017674. 143 NANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTP 222 (382) Q Consensus 143 ~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~ 222 (382) ..+.....+..++..+.++.++... ...++.+.-...+.+++.+.+++..+ +-. . |- +.+. T Consensus 75 ~~~~~~~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~-~~~----~---~a--------~~~~ 135 (272) T protein:vir:30 75 GFKKTTMTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVL-DAL----S---KS--------TQTV 135 (272) T ss_pred ccceEEEEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHH-HHh----c---cc--------cccc Confidence 9999999999999998988776543 45577777777777777777775433 210 1 11 1111 Q ss_pred CCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc---C--CCCccHHHHHHH----hcCc Q lcl|NC_017674. 223 PSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT---T--PYGISVSDWIEQ----TYPK 293 (382) Q Consensus 223 a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t---~--~~~~Tvl~~l~~----n~pn 293 (382) .+. .| +++|.+++..+-..- ..+..++++|..+..|.+. + ..+......+.. ++-+ T Consensus 136 ~~~----~t----~d~i~da~~~l~~~~-------~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G 200 (272) T protein:vir:30 136 EAT----AT----VDGVSKALDIFNDED-------DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLG 200 (272) T ss_pred ccc----cC----HHHHHHHHHHHhccC-------CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcC Confidence 111 12 456666666553221 1356899999998887432 1 101111111111 1234 Q ss_pred cEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEecccc-ceeeeEeec Q lcl|NC_017674. 294 MRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSN-GTAGALCKR 372 (382) Q Consensus 294 l~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~-~t~Gv~i~~ 372 (382) ++++.-+-+.. + ..|++.+.--.... ..... ...++.......-... +..|+-+.+ T Consensus 201 ~~Vi~s~~~p~-------~--t~~~~~~~a~~~~~-------------~~~~~-ve~~r~~~~~~~~i~~~~~~~~~v~~ 257 (272) T protein:vir:30 201 VQIVRSRKCPK-------G--TAYMVRKGALRIML-------------KRNTM-VETDRDITKAINQIVANKHYGVYLYK 257 (272) T ss_pred eeEEEcCCCCc-------c--eEEEEcCCeEEEEe-------------cCCce-eeeccccccceeEEEEEEEEEEEEEc Confidence 45544333310 0 12333222110000 00000 0111111111222221 345788999 Q ss_pred cchheeecCC Q lcl|NC_017674. 373 PWAVVRYLGI 382 (382) Q Consensus 373 P~aia~~~GI 382 (382) |.+++...-= T Consensus 258 ~~~vv~~t~~ 267 (272) T protein:vir:30 258 AEKAVKITLK 267 (272) T ss_pred CCceEEEEec Confidence 9988877644 No 76 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=92.49 E-value=0.011 Score=31.28 Aligned_cols=256 Identities=9% Similarity=0.019 Sum_probs=131.1 Q ss_pred cccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCc-ceeeEEEEeeecccceeecccccCCceeee Q lcl|NC_017674. 64 DSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSW-EDQEIVQGIVEPAGTAVEYGDHTNIPLTSW 142 (382) Q Consensus 64 Da~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~-~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~ 142 (382) -|... .+.+++-+|..+..++.-++-+. .....+.-++....- --.++.++.++..|.+..++++.++|..+. T Consensus 1 MA~~~--T~~~~~~iPev~s~~v~~~~~~~----~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~ 74 (272) T protein:vir:98 1 MAVGT--TKMAQMLDPEVLADMIDAEVGKA----IRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQL 74 (272) T ss_pred CCCcc--ccchheechHHHHHHHHHHHHHH----hhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccccccc Confidence 11101 22334557777777765444322 222222222221110 114788899998999999999999999999 Q ss_pred eeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceecc Q lcl|NC_017674. 143 NANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTP 222 (382) Q Consensus 143 ~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~ 222 (382) ..+.....+..++..+.++.++... ...++.+.-...+.+++.+.+++..+ +-. . |- +.+. T Consensus 75 ~~~~~~~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~-~~~----~---~a--------~~~~ 135 (272) T protein:vir:98 75 GFKKTTMTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVL-DAL----S---KS--------TQTV 135 (272) T ss_pred ccceEEEEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHH-HHh----c---cc--------cccc Confidence 9999999999999998988776543 45577777777777777777775433 210 1 11 1111 Q ss_pred CCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc---C--CCCccHHHHHHH----hcCc Q lcl|NC_017674. 223 PSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT---T--PYGISVSDWIEQ----TYPK 293 (382) Q Consensus 223 a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t---~--~~~~Tvl~~l~~----n~pn 293 (382) .+. .| +++|.+++..+-..- ..+..++++|..+..|.+. + ..+......+.. ++-+ T Consensus 136 ~~~----~t----~d~i~da~~~l~~~~-------~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G 200 (272) T protein:vir:98 136 EAT----AT----VDGVSKALDIFNDED-------DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLG 200 (272) T ss_pred ccc----cC----HHHHHHHHHHHhccC-------CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcC Confidence 111 12 456666666553221 1356899999998887432 1 101111111111 1234 Q ss_pred cEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEecccc-ceeeeEeec Q lcl|NC_017674. 294 MRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSN-GTAGALCKR 372 (382) Q Consensus 294 l~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~-~t~Gv~i~~ 372 (382) ++++.-+-+.. + ..|++.+.--.... ..... ...++.......-... +..|+-+.+ T Consensus 201 ~~Vi~s~~~p~-------~--t~~~~~~~a~~~~~-------------~~~~~-ve~~r~~~~~~~~i~~~~~~~~~v~~ 257 (272) T protein:vir:98 201 VQIVRSRKCPK-------G--TAYMVRKGALRIML-------------KRNTM-VETDRDITKAINQIVANKHYGVYLYK 257 (272) T ss_pred eeEEEcCCCCc-------c--eEEEEcCCeEEEEe-------------cCCce-eeeccccccceeEEEEEEEEEEEEEc Confidence 45544333310 0 12333222110000 00000 0111111111222221 345788999 Q ss_pred cchheeecCC Q lcl|NC_017674. 373 PWAVVRYLGI 382 (382) Q Consensus 373 P~aia~~~GI 382 (382) |.+++...-= T Consensus 258 ~~~vv~~t~~ 267 (272) T protein:vir:98 258 AEKAVKITLK 267 (272) T ss_pred CCceEEEEec Confidence 9988877644 No 77 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=92.42 E-value=0.0042 Score=33.62 Aligned_cols=330 Identities=9% Similarity=-0.002 Sum_probs=134.0 Q ss_pred CCCcceeeeecCccccc-----ccc-----ccccch------HHH-------HHHhh-cceeccccchhhh--hhhhccc Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAK-----PFD-----LKNITN------DAV-------ASLSR-IGLVFDHAVVQDQ--IKALAKA 54 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~-----~~~-----~~~~~~------~~~-------~~l~~-~g~~~~~~~~~~~--~~~~~~~ 54 (382) |+.+-. -...|... .++ +..+.. ... ..+.+ .+........... ...+... T Consensus 20 ~~~L~~---~~~~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~ 96 (390) T protein:vir:62 20 LRTLTD---EFAGKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSGAQRSADVDDDATLRAG 96 (390) T ss_pred HHHHHH---HhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhcchHHHHHHhhh Confidence 111100 00111100 000 000000 000 00000 0111000000000 0000000 Q ss_pred ---ccchhhhhhcccccCcccccchhH-HHHHHhhhhhhheecc--cccc-chhhhCccccCCCcceeeEEEEeeecccc Q lcl|NC_017674. 55 ---GAFRSGSAMDSNFTAPVTTPSIPT-PIQFLQTWLPGFVKVM--TAAR-KIDEIIGIDTVGSWEDQEIVQGIVEPAGT 127 (382) Q Consensus 55 ---~~~~~~~amDa~~~~~~t~~~~~~-~~~~l~~idp~v~~~~--~~~~-~~~~l~~v~t~g~~~~~t~t~~v~e~~G~ 127 (382) .......+... ....++.+.++ +-.+.. ..|.+.+ ...+ .+...++..+ ...+.+++....+. T Consensus 97 ~~~~~r~~~~~~~~--~~~t~~~~g~~~~~~~~~---~~i~~~~~~~~~l~~~~~~~~~~~-----~~~~~~p~~~~~~~ 166 (390) T protein:vir:62 97 NLGEARSFEFAPEK--RDGTKAGNPNVLSRTLYG---QLIAQAVERSAIMRGGATTFTTSD-----ANPLDFTVITGRSS 166 (390) T ss_pred hhhhhHHHHhhhhh--hcccccCCCccccccchH---HHHHHHHhhhhhhhhcceeeecCC-----CceeEEEEEcCCcc Confidence 00000111111 11122222222 211221 1122221 1111 1223333211 12356777777778 Q ss_pred eeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccc Q lcl|NC_017674. 128 AVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRT 207 (382) Q Consensus 128 a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~ 207 (382) +.+.+....+|..+....+..-.++.++..+.+|.+=|+. ..+++.+.-....+.++...+|+-.++|+. + - T Consensus 167 a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~l~G~G---~--p 238 (390) T protein:vir:62 167 ASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATD---QVLDLVGFLVSDAGPAIGDAMGRHFITGTG---Q--P 238 (390) T ss_pred eeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhh---hhHHHHHHHHHHHHHHHHHHHHhhhhccCC---c--c Confidence 8888888899999999999999999999999998765543 456788888888899999999999999973 1 1 Q ss_pred eEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHH- Q lcl|NC_017674. 208 YGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSD- 285 (382) Q Consensus 208 ~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~- 285 (382) -|++|+++........+ .-...| ++||.+++..|...- ..+ -.++|.++.+..|.. .+..|.=++. T Consensus 239 ~Gi~~~~~~~~~~~~~~-~~~~~~----~~~l~~~~~~l~~~~----~~~---a~~vmn~~~~~~L~~lkd~~g~~l~~~ 306 (390) T protein:vir:62 239 RGILTDASPATATFLAT-DTDSKV----SDALIDLFHEVPSAY----RAN---AKYVVNDLRAAQMRKLKDANGQYLWQS 306 (390) T ss_pred ccccccccccccceecc-cccccc----hHHHHHHHHhhhhhh----hcC---CEEEEchHHHHHHHHhhccCCCeeecC Confidence 48999876543221111 111233 455555665553221 111 157888888887753 2322221210 Q ss_pred HHHHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhc--c-cceecCCceEeccc Q lcl|NC_017674. 286 WIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFIT--L-GVEKRAKSYVEDFS 362 (382) Q Consensus 286 ~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~--l-~~~~~~~~~~~~~~ 362 (382) -+...-|+ ++...|=...... .. . .+++- + ... +....+..+.. . -.....-...+... T Consensus 307 ~~~~g~~~-~l~G~Pv~~~~~~-p~-~-~i~~g--d-~s~-----------~~i~~~~~~~v~~~~~~~~~~~~~~~~~~ 368 (390) T protein:vir:62 307 GLTVGAPS-LFNGKVVETDDGM-PA-D-KILFA--D-LSK-----------YRVRFAGSLRVDRSVDAKFSTDQIVYRFL 368 (390) T ss_pred CcCCCccc-eecccceEEecCC-CC-c-cEEEe--e-ccc-----------eeEEeecceEEEeeccccccCCcEEEEEE Confidence 00000010 1111110000000 00 0 01110 0 000 00000000000 0 00011112233444 Q ss_pred cceeeeEeeccchheeecCC Q lcl|NC_017674. 363 NGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 363 ~~t~Gv~i~~P~aia~~~GI 382 (382) .|.+ +.+..|.||+.+..= T Consensus 369 ~r~d-~~~~~~~A~~~l~~~ 387 (390) T protein:vir:62 369 QRAD-GLLVDARGAKVLTVT 387 (390) T ss_pred EEeC-cEeechhheEEEEee Confidence 5555 468889998777744 No 78 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=92.07 E-value=0.013 Score=30.93 Aligned_cols=332 Identities=10% Similarity=-0.018 Sum_probs=136.9 Q ss_pred CCCcce-----------eeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccC Q lcl|NC_017674. 1 MSQISK-----------THSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTA 69 (382) Q Consensus 1 ~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~ 69 (382) -+|+.. ...........+..... ......+....+..+..... .....+... ......+....+ T Consensus 48 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~~~~~~~ 122 (415) T protein:vir:98 48 RSQIQEKQEELDKLKEKDGTSENNQQSVEVNEAR-TYRNQANINDLGISIQNTKV---TSQEVRDFT-EYLETRNDIQGG 122 (415) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhcccccccchhh-hHHHHHHHHHHhhhhhhhhh---HHHHHHHHH-HHHhhhhhhhhc Confidence 000000 00000000000000000 00011111111111111100 000000000 000011111112 Q ss_pred cccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeee--cccceeecccccCCceee-eee Q lcl|NC_017674. 70 PVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE--PAGTAVEYGDHTNIPLTS-WNA 144 (382) Q Consensus 70 ~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e--~~G~a~~ygd~~DiP~vd-~~~ 144 (382) ..++.+.+ +|..+ .++|++.+........++.+..... .+..+.+.. ..+.+...+.+.++|-.+ ... T Consensus 123 ~~~~~~gg~~iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~ 195 (415) T protein:vir:98 123 SLKTDSGFVVIPEEI----VTDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) T ss_pred cccccccccccchHH----HHHHHHHHHhhhhhhhheeeeeccC---CceeEEEEeecCCccceeeccccccCcccccce Confidence 22333333 55444 4466666666666666665543221 122333333 334455667778888554 578 Q ss_pred eeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC Q lcl|NC_017674. 145 NFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS 224 (382) Q Consensus 145 ~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~ 224 (382) ......++.++..+.+|.+=+ .....++.+.-.....+++.+.+|+-++.|+..+... .++.+........+. T Consensus 196 ~~v~~~~~k~~~~~~iS~ell---~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--~~~~~~~~~~~~~~~-- 268 (415) T protein:vir:98 196 FQLAYDINTHRGYFRISREAI---EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSGFEKEGKKLEV-- 268 (415) T ss_pred eeEEeeeeeeEeeehhhHHHH---hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--ccccccccccccccc-- Confidence 888889999999888887533 3345678888888888888889999999997554222 233332221111111 Q ss_pred CCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH-HHHhcC----ccEEEE Q lcl|NC_017674. 225 QGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW-IEQTYP----KMRIVS 298 (382) Q Consensus 225 ~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~~n~p----nl~i~~ 298 (382) . ...+ ++||.+++..+...- . .+..++|.++.+..|.. .+..|.-++.- +....+ +..++. T Consensus 269 ~--~~~~----~~~i~~~~~~~~~~~---~----~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~ 335 (415) T protein:vir:98 269 K--KAKS----LDDIKDAINLNVKPN---Y----EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) T ss_pred c--cccc----hhHHHHHHHhhhhhc---c----CCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEE Confidence 1 1122 566767776654321 1 23478999998888854 23334322210 001011 112333 Q ss_pred ccccccccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchhe Q lcl|NC_017674. 299 APELSGVQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVV 377 (382) Q Consensus 299 ~peL~~a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia 377 (382) .+.+- .+. +.+ ..+++- |.+-+. ..+.+ . +. +...+..... ...-...|. |+.+++|.||+ T Consensus 336 ~~~~~-~~~-~~~-~~~~~Gd~~~~~~-~~~~~---~------~~--v~~~~~~~~~--~~~~~~~r~-d~~v~~~~a~~ 397 (415) T protein:vir:98 336 LPDEV-LGQ-KGN-NTLIIGNLKDAIV-LFDRS---Q------YQ--ASWTDYMHFG--ECLMIAVRQ-DCRILDYKSAI 397 (415) T ss_pred ecccc-cCC-CCc-cEEEEEehhccEE-EEeec---c------eE--EEEeccccCc--eEEEEEEEe-ccEEeccccEE Confidence 33221 111 111 112221 111110 00000 0 00 0000100011 111233454 56667899998 Q ss_pred eecCC Q lcl|NC_017674. 378 RYLGI 382 (382) Q Consensus 378 ~~~GI 382 (382) .++-- T Consensus 398 ~~~~~ 402 (415) T protein:vir:98 398 VIEYD 402 (415) T ss_pred EEEEe Confidence 88655 No 79 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=92.07 E-value=0.013 Score=30.93 Aligned_cols=332 Identities=10% Similarity=-0.018 Sum_probs=136.9 Q ss_pred CCCcce-----------eeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccC Q lcl|NC_017674. 1 MSQISK-----------THSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTA 69 (382) Q Consensus 1 ~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~ 69 (382) -+|+.. ...........+..... ......+....+..+..... .....+... ......+....+ T Consensus 48 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~~~~~~~ 122 (415) T protein:vir:81 48 RSQIQEKQEELDKLKEKDGTSENNQQSVEVNEAR-TYRNQANINDLGISIQNTKV---TSQEVRDFT-EYLETRNDIQGG 122 (415) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhcccccccchhh-hHHHHHHHHHHhhhhhhhhh---HHHHHHHHH-HHHhhhhhhhhc Confidence 000000 00000000000000000 00011111111111111100 000000000 000011111112 Q ss_pred cccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeee--cccceeecccccCCceee-eee Q lcl|NC_017674. 70 PVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE--PAGTAVEYGDHTNIPLTS-WNA 144 (382) Q Consensus 70 ~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e--~~G~a~~ygd~~DiP~vd-~~~ 144 (382) ..++.+.+ +|..+ .++|++.+........++.+..... .+..+.+.. ..+.+...+.+.++|-.+ ... T Consensus 123 ~~~~~~gg~~iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~ 195 (415) T protein:vir:81 123 SLKTDSGFVVIPEEI----VTDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) T ss_pred cccccccccccchHH----HHHHHHHHHhhhhhhhheeeeeccC---CceeEEEEeecCCccceeeccccccCcccccce Confidence 22333333 55444 4466666666666666665543221 122333333 334455667778888554 578 Q ss_pred eeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC Q lcl|NC_017674. 145 NFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS 224 (382) Q Consensus 145 ~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~ 224 (382) ......++.++..+.+|.+=+ .....++.+.-.....+++.+.+|+-++.|+..+... .++.+........+. T Consensus 196 ~~v~~~~~k~~~~~~iS~ell---~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--~~~~~~~~~~~~~~~-- 268 (415) T protein:vir:81 196 FQLAYDINTHRGYFRISREAI---EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSGFEKEGKKLEV-- 268 (415) T ss_pred eeEEeeeeeeEeeehhhHHHH---hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--ccccccccccccccc-- Confidence 888889999999888887533 3345678888888888888889999999997554222 233332221111111 Q ss_pred CCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH-HHHhcC----ccEEEE Q lcl|NC_017674. 225 QGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW-IEQTYP----KMRIVS 298 (382) Q Consensus 225 ~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~~n~p----nl~i~~ 298 (382) . ...+ ++||.+++..+...- . .+..++|.++.+..|.. .+..|.-++.- +....+ +..++. T Consensus 269 ~--~~~~----~~~i~~~~~~~~~~~---~----~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~ 335 (415) T protein:vir:81 269 K--KAKS----LDDIKDAINLNVKPN---Y----EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) T ss_pred c--cccc----hhHHHHHHHhhhhhc---c----CCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEE Confidence 1 1122 566767776654321 1 23478999998888854 23334322210 001011 112333 Q ss_pred ccccccccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchhe Q lcl|NC_017674. 299 APELSGVQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVV 377 (382) Q Consensus 299 ~peL~~a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia 377 (382) .+.+- .+. +.+ ..+++- |.+-+. ..+.+ . +. +...+..... ...-...|. |+.+++|.||+ T Consensus 336 ~~~~~-~~~-~~~-~~~~~Gd~~~~~~-~~~~~---~------~~--v~~~~~~~~~--~~~~~~~r~-d~~v~~~~a~~ 397 (415) T protein:vir:81 336 LPDEV-LGQ-KGN-NTLIIGNLKDAIV-LFDRS---Q------YQ--ASWTDYMHFG--ECLMIAVRQ-DCRILDYKSAI 397 (415) T ss_pred ecccc-cCC-CCc-cEEEEEehhccEE-EEeec---c------eE--EEEeccccCc--eEEEEEEEe-ccEEeccccEE Confidence 33221 111 111 112221 111110 00000 0 00 0000100011 111233454 56667899998 Q ss_pred eecCC Q lcl|NC_017674. 378 RYLGI 382 (382) Q Consensus 378 ~~~GI 382 (382) .++-- T Consensus 398 ~~~~~ 402 (415) T protein:vir:81 398 VIEYD 402 (415) T ss_pred EEEEe Confidence 88655 No 80 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=92.07 E-value=0.013 Score=30.93 Aligned_cols=332 Identities=10% Similarity=-0.018 Sum_probs=136.9 Q ss_pred CCCcce-----------eeeecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccC Q lcl|NC_017674. 1 MSQISK-----------THSRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTA 69 (382) Q Consensus 1 ~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~ 69 (382) -+|+.. ...........+..... ......+....+..+..... .....+... ......+....+ T Consensus 48 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~~~~~~~ 122 (415) T protein:vir:79 48 RSQIQEKQEELDKLKEKDGTSENNQQSVEVNEAR-TYRNQANINDLGISIQNTKV---TSQEVRDFT-EYLETRNDIQGG 122 (415) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhcccccccchhh-hHHHHHHHHHHhhhhhhhhh---HHHHHHHHH-HHHhhhhhhhhc Confidence 000000 00000000000000000 00011111111111111100 000000000 000011111112 Q ss_pred cccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeee--cccceeecccccCCceee-eee Q lcl|NC_017674. 70 PVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE--PAGTAVEYGDHTNIPLTS-WNA 144 (382) Q Consensus 70 ~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e--~~G~a~~ygd~~DiP~vd-~~~ 144 (382) ..++.+.+ +|..+ .++|++.+........++.+..... .+..+.+.. ..+.+...+.+.++|-.+ ... T Consensus 123 ~~~~~~gg~~iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~ 195 (415) T protein:vir:79 123 SLKTDSGFVVIPEEI----VTDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) T ss_pred cccccccccccchHH----HHHHHHHHHhhhhhhhheeeeeccC---CceeEEEEeecCCccceeeccccccCcccccce Confidence 22333333 55444 4466666666666666665543221 122333333 334455667778888554 578 Q ss_pred eeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC Q lcl|NC_017674. 145 NFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS 224 (382) Q Consensus 145 ~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~ 224 (382) ......++.++..+.+|.+=+ .....++.+.-.....+++.+.+|+-++.|+..+... .++.+........+. T Consensus 196 ~~v~~~~~k~~~~~~iS~ell---~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~--~~~~~~~~~~~~~~~-- 268 (415) T protein:vir:79 196 FQLAYDINTHRGYFRISREAI---EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSGFEKEGKKLEV-- 268 (415) T ss_pred eeEEeeeeeeEeeehhhHHHH---hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc--ccccccccccccccc-- Confidence 888889999999888887533 3345678888888888888889999999997554222 233332221111111 Q ss_pred CCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH-HHHhcC----ccEEEE Q lcl|NC_017674. 225 QGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW-IEQTYP----KMRIVS 298 (382) Q Consensus 225 ~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~~n~p----nl~i~~ 298 (382) . ...+ ++||.+++..+...- . .+..++|.++.+..|.. .+..|.-++.- +....+ +..++. T Consensus 269 ~--~~~~----~~~i~~~~~~~~~~~---~----~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~ 335 (415) T protein:vir:79 269 K--KAKS----LDDIKDAINLNVKPN---Y----EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEI 335 (415) T ss_pred c--cccc----hhHHHHHHHhhhhhc---c----CCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEE Confidence 1 1122 566767776654321 1 23478999998888854 23334322210 001011 112333 Q ss_pred ccccccccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchhe Q lcl|NC_017674. 299 APELSGVQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVV 377 (382) Q Consensus 299 ~peL~~a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia 377 (382) .+.+- .+. +.+ ..+++- |.+-+. ..+.+ . +. +...+..... ...-...|. |+.+++|.||+ T Consensus 336 ~~~~~-~~~-~~~-~~~~~Gd~~~~~~-~~~~~---~------~~--v~~~~~~~~~--~~~~~~~r~-d~~v~~~~a~~ 397 (415) T protein:vir:79 336 LPDEV-LGQ-KGN-NTLIIGNLKDAIV-LFDRS---Q------YQ--ASWTDYMHFG--ECLMIAVRQ-DCRILDYKSAI 397 (415) T ss_pred ecccc-cCC-CCc-cEEEEEehhccEE-EEeec---c------eE--EEEeccccCc--eEEEEEEEe-ccEEeccccEE Confidence 33221 111 111 112221 111110 00000 0 00 0000100011 111233454 56667899998 Q ss_pred eecCC Q lcl|NC_017674. 378 RYLGI 382 (382) Q Consensus 378 ~~~GI 382 (382) .++-- T Consensus 398 ~~~~~ 402 (415) T protein:vir:79 398 VIEYD 402 (415) T ss_pred EEEEe Confidence 88655 No 81 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=90.73 E-value=0.019 Score=29.98 Aligned_cols=254 Identities=11% Similarity=0.043 Sum_probs=127.3 Q ss_pred hcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCc-ceeeEEEEeeecccceeecccccCCceee Q lcl|NC_017674. 63 MDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSW-EDQEIVQGIVEPAGTAVEYGDHTNIPLTS 141 (382) Q Consensus 63 mDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~-~~~t~t~~v~e~~G~a~~ygd~~DiP~vd 141 (382) |=.. ....++.-+|..+..++..++-+ .+....+..++.+..- --.+++++.++..|.+..|.++++++..+ T Consensus 1 ma~~---~T~~~~~iiPev~~~~v~~~~~~----~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~ 73 (274) T protein:vir:93 1 MPQG---ITKTSNQIIPEVLAPMMQAQLEK----KLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) T ss_pred CCcc---ceehhheechHHHHHHHHHHHHh----hhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccc Confidence 1111 12233555787777777655433 2333444444333221 12588999999999999999999999999 Q ss_pred eeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceec Q lcl|NC_017674. 142 WNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQT 221 (382) Q Consensus 142 ~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~ 221 (382) +........+...+-+++++. +.+++. +.++.++-...+.+++..++++-.+-.- +.-... + T Consensus 74 it~~~~~~~i~~~~~~~~i~D--~~~~~~-~~d~~~~~~~~~~~~~a~~~d~~~~~~~------------~~a~~~---~ 135 (274) T protein:vir:93 74 LETKKREAKIRKIAKGTSITD--EALLSG-YGDPQGEQVRQHGLAHANKVDNDVLEAL------------MGAKLT---V 135 (274) T ss_pred cccceeEEEeeeecccccccH--HHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHHHH------------hccccc---c Confidence 999999999988776655555 444443 4566666667777777777775443211 100000 0 Q ss_pred cCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccC--------CCCccHHHHHHH---- Q lcl|NC_017674. 222 PPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTT--------PYGISVSDWIEQ---- 289 (382) Q Consensus 222 ~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~--------~~~~Tvl~~l~~---- 289 (382) .+ ...+ +++|.+++..+-.. +..+..|+++|..+..|.+-. ..+-.+ +++ T Consensus 136 -~~---~~~~----~d~i~dA~~~l~d~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~---~~~G~ig 197 (274) T protein:vir:93 136 -NA---DITK----LNGLQSAIDKFNDE-------DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDI---IVKGAFG 197 (274) T ss_pred -cc---cccC----HHHHHHHHHHhhhc-------cCCccEEEeCHHHHHHHHhhhhhcccccccccccc---eeecccc Confidence 00 0112 44555555554322 124668999999999886421 111111 111 Q ss_pred hcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEecccc-ceeee Q lcl|NC_017674. 290 TYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSN-GTAGA 368 (382) Q Consensus 290 n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~-~t~Gv 368 (382) .+-+++|+.-+.+- .+ ..|++.+.--. +...-+... ..++..+........ ..+|+ T Consensus 198 ~~~G~~Vi~s~~~p-------~~--t~~l~~~gai~-----------~~~~~~~~v---E~~Rd~~~~~d~i~~~~~y~~ 254 (274) T protein:vir:93 198 EALGAIIVRTNKLE-------AG--TAILAKKGAVK-----------LILKRDFFL---EVARDASTKTTALYSDKHYVA 254 (274) T ss_pred eecCeeEEEcCCCC-------cc--eEEEEeCCeEE-----------EEecCCccc---ccccchhhcccEEEEEEEEEE Confidence 12344554333221 00 12222211100 000000000 011111111111111 34688 Q ss_pred EeeccchheeecCC Q lcl|NC_017674. 369 LCKRPWAVVRYLGI 382 (382) Q Consensus 369 ~i~~P~aia~~~GI 382 (382) -+.+|.+++.+.-= T Consensus 255 ~~~~~~~~v~~t~~ 268 (274) T protein:vir:93 255 YLYDESKAVKITKG 268 (274) T ss_pred EEEcCCceEEEeeC Confidence 88888887776643 No 82 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=90.17 E-value=0.022 Score=29.64 Aligned_cols=257 Identities=11% Similarity=0.033 Sum_probs=125.9 Q ss_pred hcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCc-ceeeEEEEeeecccceeecccccCCceee Q lcl|NC_017674. 63 MDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSW-EDQEIVQGIVEPAGTAVEYGDHTNIPLTS 141 (382) Q Consensus 63 mDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~-~~~t~t~~v~e~~G~a~~ygd~~DiP~vd 141 (382) |= + +....+++-+|..+..++.-++ ...+....+..++....- --.+++++.+...|.+..|.++++++.-. T Consensus 1 ma-~--~~T~~~d~iiPev~~~~v~~~~----~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~ 73 (274) T protein:vir:97 1 MP-Q--GLTKTSDQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) T ss_pred CC-c--cceehhheechHHHHHHHHHhh----hhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccc Confidence 11 1 1123345567877777776544 233344444444433211 13689999999999999999999999999 Q ss_pred eeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceec Q lcl|NC_017674. 142 WNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQT 221 (382) Q Consensus 142 ~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~ 221 (382) .........+...+-++++ .++.+++. +-++-.+-...+.+++..++++-.+-- ++.-.+. + T Consensus 74 lt~~~~~~~i~~~~~~~~i--~D~~~~~~-~~dp~~~~~~~~a~a~a~~vd~~~~~~------------l~~a~~~---~ 135 (274) T protein:vir:97 74 LETKKREAKIRKIAKGTSI--TDEALLSG-YGDPQGEQVRQHGLAHANKVDNDVLEA------------LMGAKLT---V 135 (274) T ss_pred cccceeEEEeeeecceecc--cHHHHHhc-cchHHHHHHHHHHHHHHHHHHHHHHHH------------HhccCcc---c Confidence 9999999998887655555 45544554 345555666666677777777543311 1110010 0 Q ss_pred cCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccC--------CCCccHHH-HHHHhcC Q lcl|NC_017674. 222 PPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTT--------PYGISVSD-WIEQTYP 292 (382) Q Consensus 222 ~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~--------~~~~Tvl~-~l~~n~p 292 (382) .+ +.-+ +++|.++...+-.. +..+..|+++|..+..|.+-+ .++-.++. =.--.|- T Consensus 136 -~~---~~~~----~d~i~dA~~~l~d~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~ 200 (274) T protein:vir:97 136 -NA---DITK----LNGLQSAIDKFNDE-------DLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEAL 200 (274) T ss_pred -cc---cccC----HHHHHHHHHHhhcc-------CCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceec Confidence 00 0112 44555565554332 124668999999999886421 11111110 0000122 Q ss_pred ccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccc-cceeeeEee Q lcl|NC_017674. 293 KMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFS-NGTAGALCK 371 (382) Q Consensus 293 nl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~-~~t~Gv~i~ 371 (382) +++|+.-+.+- . ...|++.+.--.. +... +... ..++..+...-... -..+||-+. T Consensus 201 G~~Vi~s~~~p-------~--~t~~l~~~gA~~~----------~~~~-~~~v---E~~Rd~~~~~d~i~~~~~y~~~~~ 257 (274) T protein:vir:97 201 GAIIVRTNKLE-------A--GTAILAKKGAVKL----------ILKR-DFFL---EVARDASTKTTALYSDKHYVAYLY 257 (274) T ss_pred CeeEEEcCCCC-------c--ceEEEEeCcceEe----------eecC-Ccee---ccccchhhcccEEEEEEEEEEEEE Confidence 44444322211 0 0122222211100 0000 0000 01111111111111 125688888 Q ss_pred ccchheeecCC Q lcl|NC_017674. 372 RPWAVVRYLGI 382 (382) Q Consensus 372 ~P~aia~~~GI 382 (382) +|..++.+.-= T Consensus 258 ~~~~vv~~t~~ 268 (274) T protein:vir:97 258 DESKAVKITKG 268 (274) T ss_pred cCCceEEEecC Confidence 88777775543 No 83 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=90.17 E-value=0.022 Score=29.64 Aligned_cols=257 Identities=11% Similarity=0.033 Sum_probs=125.9 Q ss_pred hcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCc-ceeeEEEEeeecccceeecccccCCceee Q lcl|NC_017674. 63 MDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSW-EDQEIVQGIVEPAGTAVEYGDHTNIPLTS 141 (382) Q Consensus 63 mDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~-~~~t~t~~v~e~~G~a~~ygd~~DiP~vd 141 (382) |= + +....+++-+|..+..++.-++ ...+....+..++....- --.+++++.+...|.+..|.++++++.-. T Consensus 1 ma-~--~~T~~~d~iiPev~~~~v~~~~----~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~ 73 (274) T protein:vir:94 1 MP-Q--GLTKTSDQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) T ss_pred CC-c--cceehhheechHHHHHHHHHhh----hhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccc Confidence 11 1 1123345567877777776544 233344444444433211 13689999999999999999999999999 Q ss_pred eeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceec Q lcl|NC_017674. 142 WNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQT 221 (382) Q Consensus 142 ~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~ 221 (382) .........+...+-++++ .++.+++. +-++-.+-...+.+++..++++-.+-- ++.-.+. + T Consensus 74 lt~~~~~~~i~~~~~~~~i--~D~~~~~~-~~dp~~~~~~~~a~a~a~~vd~~~~~~------------l~~a~~~---~ 135 (274) T protein:vir:94 74 LETKKREAKIRKIAKGTSI--TDEALLSG-YGDPQGEQVRQHGLAHANKVDNDVLEA------------LMGAKLT---V 135 (274) T ss_pred cccceeEEEeeeecceecc--cHHHHHhc-cchHHHHHHHHHHHHHHHHHHHHHHHH------------HhccCcc---c Confidence 9999999998887655555 45544554 345555666666677777777543311 1110010 0 Q ss_pred cCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccC--------CCCccHHH-HHHHhcC Q lcl|NC_017674. 222 PPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTT--------PYGISVSD-WIEQTYP 292 (382) Q Consensus 222 ~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~--------~~~~Tvl~-~l~~n~p 292 (382) .+ +.-+ +++|.++...+-.. +..+..|+++|..+..|.+-+ .++-.++. =.--.|- T Consensus 136 -~~---~~~~----~d~i~dA~~~l~d~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~ 200 (274) T protein:vir:94 136 -NA---DITK----LNGLQSAIDKFNDE-------DLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEAL 200 (274) T ss_pred -cc---cccC----HHHHHHHHHHhhcc-------CCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceec Confidence 00 0112 44555565554332 124668999999999886421 11111110 0000122 Q ss_pred ccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccc-cceeeeEee Q lcl|NC_017674. 293 KMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFS-NGTAGALCK 371 (382) Q Consensus 293 nl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~-~~t~Gv~i~ 371 (382) +++|+.-+.+- . ...|++.+.--.. +... +... ..++..+...-... -..+||-+. T Consensus 201 G~~Vi~s~~~p-------~--~t~~l~~~gA~~~----------~~~~-~~~v---E~~Rd~~~~~d~i~~~~~y~~~~~ 257 (274) T protein:vir:94 201 GAIIVRTNKLE-------A--GTAILAKKGAVKL----------ILKR-DFFL---EVARDASTKTTALYSDKHYVAYLY 257 (274) T ss_pred CeeEEEcCCCC-------c--ceEEEEeCcceEe----------eecC-Ccee---ccccchhhcccEEEEEEEEEEEEE Confidence 44444322211 0 0122222211100 0000 0000 01111111111111 125688888 Q ss_pred ccchheeecCC Q lcl|NC_017674. 372 RPWAVVRYLGI 382 (382) Q Consensus 372 ~P~aia~~~GI 382 (382) +|..++.+.-= T Consensus 258 ~~~~vv~~t~~ 268 (274) T protein:vir:94 258 DESKAVKITKG 268 (274) T ss_pred cCCceEEEecC Confidence 88777775543 No 84 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=89.43 E-value=0.026 Score=29.24 Aligned_cols=275 Identities=10% Similarity=0.034 Sum_probs=119.3 Q ss_pred cccchhhhhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCcccc-CCCcceeeEEEEeee-cccce--e Q lcl|NC_017674. 54 AGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDT-VGSWEDQEIVQGIVE-PAGTA--V 129 (382) Q Consensus 54 ~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t-~g~~~~~t~t~~v~e-~~G~a--~ 129 (382) |++ .|.+.++.+ .-..+.-+|+|.+-..-...+.+|=+. +|. .+.|.-.. ..|.+ . T Consensus 1 mpa--------------ltLaea~k~--~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~----~~~ynR~~~~~~~~~~~ 60 (310) T protein:vir:97 1 MAS--------------VTLAESAKL--AQDELVAGVIENIITVNRMFDVLPFDSIEGN----SLAYNRENVLGDVIMAG 60 (310) T ss_pred Ccc--------------cchHHHhhc--CcchHHHHHHHHHhccchHHHhCCcccccCC----cceeeEeeccCCccccc Confidence 110 010001100 001122345555544445556665321 121 23443322 22222 2 Q ss_pred ecccccCC--ceeeeeeeeeEeeEEEEEEEEEecHHHHHH--HHHh-CCC--hHHHHHHHHHHHHHHhhccEEEEeeccC Q lcl|NC_017674. 130 EYGDHTNI--PLTSWNANFERRTIVRGELGMMVGTLEEGR--ASAI-RLN--SAETKRQQAAIGLEIFRNAIGFYGWQSG 202 (382) Q Consensus 130 ~ygd~~Di--P~vd~~~~~~~~~v~~~~~g~~y~~~El~~--A~~~-g~~--l~~~K~~aAr~a~~~~~n~i~~~Gd~~g 202 (382) ..-.+++. |......++....+..++..+ |+-+ +... +-+ .-+..-....+++.++.+...++||.+ T Consensus 61 v~~~~~~~g~~~~~~t~~~~~~~L~i~~g~~-----~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a- 134 (310) T protein:vir:97 61 VGTTFSGAGAGKAAATFTKVNSNLTTIMGDA-----EVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGA- 134 (310) T ss_pred ccccccCCCccccccccceeeeeeeeeeehh-----hhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccC- Confidence 22123222 333333444444455555444 4433 2322 322 333445566678888888889999965 Q ss_pred CcccceEEEeCCCCcceec-cCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHH---Hhhcccc-- Q lcl|NC_017674. 203 LGNRTYGFLNDPNLPAFQT-PPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSK---VDYLSVT-- 276 (382) Q Consensus 203 ~~~g~~GllN~P~l~~~~~-~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~---~~~Ls~t-- 276 (382) .++.+||+.. +..... .+++.-+.-| ++|+.+++..+|..-+ .|..|++.|.. +.-+.+. T Consensus 135 -~n~F~GL~~~--~~~~q~i~~~~~gg~~t----~d~LDeLl~~v~~~~g-------~p~~~l~~~~~~r~i~A~~R~~~ 200 (310) T protein:vir:97 135 -GNEFAGLIQL--CASGQKATTGATGSAIS----FAILDELMDLVVDKDG-------QVDYLTMHARTLRSYKALLRALG 200 (310) T ss_pred -CCcccchhhc--CCccceeecCCCCCCCC----HHHHHHHHHHHhcCCC-------CCCEEEecHHHHHHHHHHHHHhc Confidence 4678899884 222111 1111223334 4788889988876543 35678888864 4433321 Q ss_pred ---------CCCCccHHHHHHHhcCccEEEEcccccc--ccCCCCCceeEEEEcchh---hhhhhccccccchhhhhhhh Q lcl|NC_017674. 277 ---------TPYGISVSDWIEQTYPKMRIVSAPELSG--VQMKAQEPEDALVLFVED---VNAAVDGSTDGGSVFSQLVQ 342 (382) Q Consensus 277 ---------~~~~~Tvl~~l~~n~pnl~i~~~peL~~--a~g~g~~~~~~~~~~~~~---v~~~~~~~~~~~~~~~~~~p 342 (382) +.+|.-|+ .|-++-|...-.+.. ..+.+++-..+..+...+ ..+. .|-+...... T Consensus 201 ~~g~~~~~~~~~G~~v~-----~~~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv-~Gl~~~~~~g----- 269 (310) T protein:vir:97 201 GASINEVVELPSGAEVP-----AYSGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGI-AGLTATQAAG----- 269 (310) T ss_pred CCCCCCccccCCCCEEe-----eeCCeEEEEeCccCCCccccccCCceeEEEEeeCccccccce-eccccCCccc----- Confidence 12333332 233444443322111 011111112222222222 1111 1211111000 Q ss_pred hhhhccc-c-eecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 343 SKFITLG-V-EKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 343 ~~~~~l~-~-~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ...+..+ + ++...+|.+.. ..|+.+.-|.|++.+.|| T Consensus 270 lsVr~~G~~~~~~v~~~~V~~---Y~~~av~~~~A~a~L~~V 308 (310) T protein:vir:97 270 IQVVDVGESEDSDEHIWRVKW---YCGLALFSEKGLACADGI 308 (310) T ss_pred eeEEeCCcccCCcceeEEEEE---eeeEEEecccceeeeccc Confidence 0111112 1 11123444433 369999999999999999 No 85 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=88.54 E-value=0.032 Score=28.80 Aligned_cols=269 Identities=10% Similarity=0.054 Sum_probs=114.4 Q ss_pred hhhhcccccCcccccchhHHHHH-HhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccce------eecc Q lcl|NC_017674. 60 GSAMDSNFTAPVTTPSIPTPIQF-LQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTA------VEYG 132 (382) Q Consensus 60 ~~amDa~~~~~~t~~~~~~~~~~-l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a------~~yg 132 (382) ++.|....+ .+| +|..+ +.|-.+ .+-+..+||....+-..-...+|+ .-+-. ..-| T Consensus 1 m~~~~~~~~---~dp---~LT~~A~gy~n~--------~~ia~~l~P~vpv~~~~~k~~~f~---~eaF~~~~t~r~~~~ 63 (307) T protein:vir:10 1 MGRLSKLRI---VDP---VLTNLAIGYTNA--------EFIGQSLMPVVEVEKEGGKIPKFG---KESFRLYKTERALRA 63 (307) T ss_pred CCCCCCCcc---cCh---hHHHHHHhhcch--------hhhhhhcCCcccccccccceeeEC---cccccchhhhcccCC Confidence 233332211 111 11111 122222 356677888766655444444443 21100 0111 Q ss_pred cccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHH----HHHHHhhccEEEEeeccCCcccce Q lcl|NC_017674. 133 DHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAA----IGLEIFRNAIGFYGWQSGLGNRTY 208 (382) Q Consensus 133 d~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr----~a~~~~~n~i~~~Gd~~g~~~g~~ 208 (382) +.+-+ +... ........-+.+..+.++ .+.++....++.+++...++ +..|...-++++.... T Consensus 64 ~~~~v---~~~~-~~~~~~~~~~~~L~~~id-~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~-------- 130 (307) T protein:vir:10 64 RSNRM---NPED-LGSIDIVLDEHDLEYPID-YREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNS-------- 130 (307) T ss_pred Cccee---eccc-ccccccccccccccccCC-hhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccc-------- Confidence 11111 1000 000011111111222221 12344455665555444433 3335544555554421 Q ss_pred EEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc---------cCCC Q lcl|NC_017674. 209 GFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV---------TTPY 279 (382) Q Consensus 209 GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~---------t~~~ 279 (382) .|+-...+.+.+..|.+++ -+++.||.+...++...++- .|++++|....+..|.. .+.. T Consensus 131 ----y~~~~k~tLsGt~~Wsd~~-sDPi~di~~~~~ai~~~~g~------~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~ 199 (307) T protein:vir:10 131 ----YAGGNKKQLSATEKFTAAG-SDPVGVIEDGKEAIRTKIGR------RPNTMVIGASAYKTLKAHPQLIEKIKYSMK 199 (307) T ss_pred ----cCCCceEEeccccccCCCC-CCcHHHHHHHHHHHHhhhCC------ccceEEeCHHHHHHHhcCHHHHHHhCCccc Confidence 1111111222345798876 45789999999999887762 47799999999998863 1222 Q ss_pred C-ccHHHHHHHhcCccEEEEcccc--ccccCCCCC--ceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecC Q lcl|NC_017674. 280 G-ISVSDWIEQTYPKMRIVSAPEL--SGVQMKAQE--PEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRA 354 (382) Q Consensus 280 ~-~Tvl~~l~~n~pnl~i~~~peL--~~a~g~g~~--~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~ 354 (382) + +| .+.|++-+ .++.+.+-+- ..+.+.-.. +-+++.+|++..... +.. +...|.- -.-.+..+ T Consensus 200 g~it-~~~la~ll-~v~~i~vg~a~~~~~~~~~~~iw~~~~vl~yv~~~~~~--~~~------~~~epsf--GyT~~~~g 267 (307) T protein:vir:10 200 GIVT-VDLLKEIF-EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGG--QQR------TPYEPSY--GYTLRKKG 267 (307) T ss_pred cccC-HHHHHHHh-CceeEEEeeeeeeccCCccceeCCCceEEEecccccCC--CCC------ccccccc--ceeEEEcC Confidence 3 34 34555443 4544444332 122111000 223555555443210 000 0111210 11223445 Q ss_pred CceEeccccceeeeE------eeccchheeecCC Q lcl|NC_017674. 355 KSYVEDFSNGTAGAL------CKRPWAVVRYLGI 382 (382) Q Consensus 355 ~~~~~~~~~~t~Gv~------i~~P~aia~~~GI 382 (382) ..+..++.+. +|+. ++.|.-++.-.|. T Consensus 268 ~~~~d~~~~~-~~~~~~r~~~~~~~~i~~~~~G~ 300 (307) T protein:vir:10 268 NPVVDTRIED-GKLELVRSTDIFRPYLLGADAGY 300 (307) T ss_pred CeEeeceecC-CceeEEeccccccceeecccccc Confidence 5555555553 4443 3456666666665 No 86 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=87.23 E-value=0.04 Score=28.24 Aligned_cols=251 Identities=12% Similarity=0.025 Sum_probs=123.0 Q ss_pred hcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCc-ceeeEEEEeeecccceeecccccCCceee Q lcl|NC_017674. 63 MDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSW-EDQEIVQGIVEPAGTAVEYGDHTNIPLTS 141 (382) Q Consensus 63 mDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~-~~~t~t~~v~e~~G~a~~ygd~~DiP~vd 141 (382) |=.. ....+++-+|..+..++..++. ..+....+..++.+..- --.+++++.+...|.+..|.++++++.-. T Consensus 1 ma~~---~T~l~d~iiPev~~~~v~~~~~----~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~ 73 (274) T protein:vir:12 1 MAQG---LTKTSNQIIPEVLAPMMQAQLE----KKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDI 73 (274) T ss_pred CCcc---eeehhhhhchHHHHHHHHHHHH----hhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhh Confidence 1111 1233456688888888776543 33344444444433211 13788999999999999999999999999 Q ss_pred eeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceec Q lcl|NC_017674. 142 WNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQT 221 (382) Q Consensus 142 ~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~ 221 (382) .........+...+-+++++. +.+++. +-++..+-...+..++...+++-.+ .. ++..... T Consensus 74 lt~~~~~~~i~~~~~~~~i~D--~~~~~~-~~d~~~~~~~q~~~~~a~~vd~~~l-~~-----------~~~a~~~---- 134 (274) T protein:vir:12 74 LETKKREAKIRKIAKGTSITD--EALLSG-YGDPQGEQVRQHGLAHANKVDNDVL-EA-----------LMGAKLT---- 134 (274) T ss_pred cccceeeEEeeeecceeeecH--HHHHhc-ccchHHHHHHHHHHHHHHHHHHHHH-HH-----------Hhccccc---- Confidence 999999999988766655554 444444 4455555556666666666665322 11 1100000 Q ss_pred cCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccC--------CCCccHHHHHHH---- Q lcl|NC_017674. 222 PPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTT--------PYGISVSDWIEQ---- 289 (382) Q Consensus 222 ~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~--------~~~~Tvl~~l~~---- 289 (382) .. .+++. ++.|++++..+-.. +..+..|+++|..+..|.+-. +++.. .+.. T Consensus 135 ~~----~~a~~---~d~i~dA~~~lgd~-------~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~---~~~~G~ig 197 (274) T protein:vir:12 135 VN----ADITK---LNGLQSAIDKFNDE-------DLEPMVLFINPLDAGKLRGDASTNFTRATELGDD---IIVKGAFG 197 (274) T ss_pred cc----ccccC---HHHHHHHHHHhccc-------cccccEEEeCHHHHHHHHhhhhhhcccccccccc---ceecccce Confidence 00 11111 34444455444221 114568999999998886421 11211 1110 Q ss_pred hcCccEEEE---ccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEe-ccccce Q lcl|NC_017674. 290 TYPKMRIVS---APELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVE-DFSNGT 365 (382) Q Consensus 290 n~pnl~i~~---~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~-~~~~~t 365 (382) .|-+++|+. +|+.. .|++.+.--.. +... +... ...+..+...- =..-.. T Consensus 198 ~~~G~~Vi~s~~~p~~t------------~~l~~~gA~~~----------~~~~-~~~v---E~~Rd~~~~~d~i~~~~~ 251 (274) T protein:vir:12 198 EALGAIIVRSNKLEAGT------------AILAKKGAVKL----------ILKR-DFFL---EVARDASTKTTALYSDKH 251 (274) T ss_pred eecCeeEEEeCCCCcce------------EEEEeccceee----------eecC-Ccee---ccccchhhcccEEEeeeE Confidence 123444443 23221 12222111000 0000 0000 00011111100 111134 Q ss_pred eeeEeeccchheeecCC Q lcl|NC_017674. 366 AGALCKRPWAVVRYLGI 382 (382) Q Consensus 366 ~Gv~i~~P~aia~~~GI 382 (382) +||-+.+|..++.+..= T Consensus 252 y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:12 252 YVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEEEcCCceEEEEcC Confidence 57777777766666644 No 87 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=84.63 E-value=0.059 Score=27.32 Aligned_cols=267 Identities=12% Similarity=-0.034 Sum_probs=124.9 Q ss_pred hhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeee-cccceeecccccCCcee Q lcl|NC_017674. 62 AMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE-PAGTAVEYGDHTNIPLT 140 (382) Q Consensus 62 amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e-~~G~a~~ygd~~DiP~v 140 (382) .+.+...+..+.++.-+|..+.+ +|++.+......+.+..+..... ......+.... ..+.+.+.+....+|-. T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~-~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~ 75 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRT----AINTLVRQYDSLQEYVNVENVTT-LTGSRVYEKWTDITGLANIDDEAGKIADI 75 (293) T ss_pred CceeecccccCcCceEechhHHH----HHHHHHHhhhhhhhhceeeeccC-CcceEEEEeecCCCcceeeecCCcccccc Confidence 11211111111112225665554 55555555544555443322111 11233344443 45667888888888854 Q ss_pred -eeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcce Q lcl|NC_017674. 141 -SWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAF 219 (382) Q Consensus 141 -d~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~ 219 (382) +....+..-..+.++..+.+|.+=++ ....+|.+.-....++++...+|+-.+.|... . + T Consensus 76 ~~~~~~~i~l~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~---~-------------~ 136 (293) T protein:vir:48 76 DDPKLSLIKYTIKRYAGISTVTNSLLA---DSAENILAWLSGWIAKKVVVTRNKAILGVVDK---L-------------P 136 (293) T ss_pred cccceeEEEEeeeEEEEeehhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHhHHhhcccc---c-------------c Confidence 56788888899999999998875554 34567888888888888888888877777521 0 0 Q ss_pred eccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH-HHHhcCccEEE Q lcl|NC_017674. 220 QTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW-IEQTYPKMRIV 297 (382) Q Consensus 220 ~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~~n~pnl~i~ 297 (382) . .....| ++||.+++.++...- ..+ ..++|.++.+..|.. .+..|.-+++= +....+ -+|. T Consensus 137 ~-----~~~~~~----~d~i~~~~~~l~~~~----~~~---a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~-~~l~ 199 (293) T protein:vir:48 137 T-----KPTLTK----WDDIIDLEAKVDPAI----KQT---SFFLTNTSGFTALKKVKNALGDYLMERDVKSPTG-YSIA 199 (293) T ss_pred c-----cccccC----HHHHHHHHHhhhhhh----cCC---CEEEEcHHHHHHHHHhhccCCceEeecCcCCCCC-ceec Confidence 0 011122 456777777664321 122 368999999988854 23333322210 111111 0111 Q ss_pred Eccc--ccc-ccCCCCCce-eEEE-Ecchhhhhhhccccccchhhhhhhhhhhhccc---ceecCCceEeccccceeeeE Q lcl|NC_017674. 298 SAPE--LSG-VQMKAQEPE-DALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLG---VEKRAKSYVEDFSNGTAGAL 369 (382) Q Consensus 298 ~~pe--L~~-a~g~g~~~~-~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~---~~~~~~~~~~~~~~~t~Gv~ 369 (382) ..|= ... .-.....+. .+++ .+.+-+.. .+. +.+. +.... .....-....-+..|.+| . T Consensus 200 G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~-~~~---------~~~~--i~~~~~~~~~~~~~~~~~r~~~r~d~-~ 266 (293) T protein:vir:48 200 GFAVKEISDRWLPNASSGVMPLYFGDLKQAVTL-FDR---------QQMS--LLSTNIGGGAFETDTTKVRVIDRFDV-V 266 (293) T ss_pred ceeeEEecccccCCccCCceEEEEEeccceEEE-EEe---------cceE--EEEecccchhhhcCeEEEEEEEeeCc-E Confidence 1110 000 000001111 1111 11111100 000 0000 00000 000011122344455544 5 Q ss_pred eeccchheeecCC Q lcl|NC_017674. 370 CKRPWAVVRYLGI 382 (382) Q Consensus 370 i~~P~aia~~~GI 382 (382) +++|.||+.+..= T Consensus 267 ~~~~~a~~~l~~~ 279 (293) T protein:vir:48 267 ATDTEAFVPASFK 279 (293) T ss_pred EecccceEEEEee Confidence 6889999977643 No 88 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=84.20 E-value=0.032 Score=28.74 Aligned_cols=328 Identities=9% Similarity=-0.004 Sum_probs=136.1 Q ss_pred CCCcceeee------ecCcccccccccccc-c-hHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCccc Q lcl|NC_017674. 1 MSQISKTHS------RLAGRNAKPFDLKNI-T-NDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVT 72 (382) Q Consensus 1 ~~~~~~~~~------~~~~~~~~~~~~~~~-~-~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t 72 (382) +.|+...+. .+.....++..-..- . ....++..+.. -.+..++....- .........+. . ..+ T Consensus 44 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~----~~~~~~e~~a~-~-~~~ 114 (404) T protein:vir:10 44 QAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNGALFVRAI---ADNLLKQKNQRG----LNLSEKEINAI-S-ENI 114 (404) T ss_pred HHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHH---HHHHHHHHHhhh----hcchhhHHhhh-c-ccc Confidence 122111000 011111111111000 0 00000000000 000000000000 00000000111 1 111 Q ss_pred ccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCcee--eeeeeeeE Q lcl|NC_017674. 73 TPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT--SWNANFER 148 (382) Q Consensus 73 ~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~v--d~~~~~~~ 148 (382) +++.| +|..+ .++|++.+........++++.....- ...+.|........+.+.+.+..+|.. +....... T Consensus 115 ~~~gg~~vP~~~----~~~ii~~~~~~~~l~~l~~~~~~~~~-~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~ 189 (404) T protein:vir:10 115 DEDGGYAVPEDI----QTKINTRLKDTTDLYNMVDYEPVFTR-SGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFN 189 (404) T ss_pred CCCCceeechhH----HHHHHHHHhhhhhHhhhhceeeccCC-ccceEEEEecCCcceeeccccccccccccccceeeeE Confidence 22223 44433 34666666666666666665443211 123444454445556666666677664 34566667 Q ss_pred eeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCcc Q lcl|NC_017674. 149 RTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWS 228 (382) Q Consensus 149 ~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa 228 (382) ..++.++..+.+|.+=+. ....++.+.-.....+++...+|+-+++|+.. .+...|+++.+.+.+..+.. T Consensus 190 ~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~--~~~~~gi~~~~~~~~~~~~~----- 259 (404) T protein:vir:10 190 FKLKDLADFMSIPNDLLK---FADKSLEDWIINWFVDKVRITRNAEILYGAGG--DEHATGIMTANKFKKITLPK----- 259 (404) T ss_pred eeheeeEeeehhhHHHHh---hcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC--CCcccceeeccccceeeccc----- Confidence 777888888888884332 33457888888888888999999999999753 34567999887765333221 Q ss_pred ccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHH-HHHHhcC----ccEEEEcccc Q lcl|NC_017674. 229 TADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSD-WIEQTYP----KMRIVSAPEL 302 (382) Q Consensus 229 ~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~-~l~~n~p----nl~i~~~peL 302 (382) ..+ ++|+..+++.... .+ +.++ ..++|.|..+..|.+. +..|.-++. -+....| +..++.++.. T Consensus 260 ~~~----~~~~~~~~~~~l~-~~--~~~~---~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~ 329 (404) T protein:vir:10 260 SPA----LKDFKKCKNVELL-NV--FKAT---SSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYRFLGLPVIELPND 329 (404) T ss_pred ccc----HHHHHHHHHhhhh-cc--ccCC---CEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCccccceeeEEeccc Confidence 123 3455545442211 11 1222 3678999888888542 333332221 0111111 1122223221 Q ss_pred ccccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccceec--------CCceEeccccceeeeEeecc Q lcl|NC_017674. 303 SGVQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKR--------AKSYVEDFSNGTAGALCKRP 373 (382) Q Consensus 303 ~~a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~--------~~~~~~~~~~~t~Gv~i~~P 373 (382) ...++ +++. .+++- +.+-+.. + .+.. +.++.. .-....-+..|. |+.+.+| T Consensus 330 ~~~~~-~~~~-~~~~gd~s~~~~~-----------~---~~~~---~~i~~~~~~~~~~~~~~~~~~~~~r~-d~~v~~~ 389 (404) T protein:vir:10 330 LLLST-ESAI-PVLLGDTKEAYKY-----------V---SDGA---YELATTNIGAGAFETNTTKARIIMRI-DGNVKDS 389 (404) T ss_pred ccCCC-CCcc-EEEEEeccccEEE-----------E---Eecc---eEEEEeccccchhhcCceEEEEEEee-ccEEecc Confidence 11111 1111 11111 1110000 0 0000 011110 011122333343 5577888 Q ss_pred chheeecCC Q lcl|NC_017674. 374 WAVVRYLGI 382 (382) Q Consensus 374 ~aia~~~GI 382 (382) .||+.++=- T Consensus 390 ~a~~~~~~~ 398 (404) T protein:vir:10 390 EALLIAEIP 398 (404) T ss_pred cceEEEEee Confidence 888766544 No 89 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=83.68 E-value=0.066 Score=27.04 Aligned_cols=254 Identities=12% Similarity=0.048 Sum_probs=126.3 Q ss_pred cccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCC-cceeeEEEEeeecccceeecccccCCceeee Q lcl|NC_017674. 64 DSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGS-WEDQEIVQGIVEPAGTAVEYGDHTNIPLTSW 142 (382) Q Consensus 64 Da~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~-~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~ 142 (382) -|+. ..+.+++-+|.-|..|+..++-+ ......+..++++.. ---.+++++.++..|.+..+++++++|.-.+ T Consensus 1 Ma~~--~T~l~d~i~Pev~~~~v~~~~~~----~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~l 74 (276) T protein:vir:10 1 MAQG--TTTKSTQIVPEVLAPMMQAELDK----KLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKI 74 (276) T ss_pred CCcc--eeehhhhhchHHHHHHHHHHHHh----hhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCcccc Confidence 1111 12334555777777777654422 233344444444321 1236899999999999999999999999999 Q ss_pred eeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceecc Q lcl|NC_017674. 143 NANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTP 222 (382) Q Consensus 143 ~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~ 222 (382) ........+..++-++.++.++. .+ .+.+.-.+-.+.+..++...+++-.+ .. ++.-... .+ T Consensus 75 t~~~~~a~i~~~~k~~~~tD~a~--~~-~~~dp~~~~~~~~~~~~a~~~d~~~~-~~-----------l~~~~~~--~~- 136 (276) T protein:vir:10 75 ETNRREAKIHKIGKGTDITDEAL--LS-GYGDPQGEAVRQHGLAIANKVDNDVL-EA-----------LRGTKLT--VS- 136 (276) T ss_pred ccceeeEEeehccccccccHHHH--Hh-hccchHHHHHHHHHHHHHHHHHHHHH-HH-----------Hhccccc--cc- Confidence 99999999988776666665443 33 35556555556666666666664322 11 1100000 00 Q ss_pred CCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc--------CCCCccHHHHHHH----h Q lcl|NC_017674. 223 PSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT--------TPYGISVSDWIEQ----T 290 (382) Q Consensus 223 a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t--------~~~~~Tvl~~l~~----n 290 (382) + ..-| ++.|.+++..+-.. +..+..|++.|..+..|.+- ++++-. .+.. . T Consensus 137 ~----~~~t----~d~i~~A~~~lgd~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~---~~~~G~ig~ 198 (276) T protein:vir:10 137 A----DIGT----LAGLEAAIDTFDDE-------DLEPMVLFINPKDAGKLRSSASDNFTRATELGDN---IIVKGAFGE 198 (276) T ss_pred c----cccC----HHHHHHHHHHhccc-------cCcccEEEEcHHHHHHHHHhcccccccccccccc---ceeccccce Confidence 0 0123 34444555444322 11356899999999888431 111111 1110 1 Q ss_pred cCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEecc-ccceeeeE Q lcl|NC_017674. 291 YPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDF-SNGTAGAL 369 (382) Q Consensus 291 ~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~-~~~t~Gv~ 369 (382) |-+++|+.-+.+. .+ ..|++.+.--... +.-+... ...+..+...-.. .-..+|+- T Consensus 199 ~~G~~Vi~s~~~p-------~~--t~~l~~~gAi~~~-----------~~~~~~v---E~dRd~~~~~d~i~~~~~y~~~ 255 (276) T protein:vir:10 199 ALGAVIVRSKKLD-------EG--EAILAKRGAVKLI-----------TKRDFFL---ETDRDPSTKTTALYSDKHYVAY 255 (276) T ss_pred ecceeEEEcCCCC-------cc--eEEEEeccceeee-----------ecCCcee---ecccchhhcccEEEEeeEEEEE Confidence 2245555433321 01 2233332211110 0000000 0111111111111 11356888 Q ss_pred eeccchheeecCC Q lcl|NC_017674. 370 CKRPWAVVRYLGI 382 (382) Q Consensus 370 i~~P~aia~~~GI 382 (382) +..|..++.+.=- T Consensus 256 ~~~~~~vv~~t~~ 268 (276) T protein:vir:10 256 LYDESKAVKVTKG 268 (276) T ss_pred EEcCcceEEEecC Confidence 9999877776633 No 90 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=82.71 E-value=0.044 Score=28.01 Aligned_cols=316 Identities=9% Similarity=-0.003 Sum_probs=129.1 Q ss_pred CCCcceeeeecCccccccccc--cccchHHHHHH----hhcceeccc-c-chhhhhhhhcccccc--hhhhhhcccccCc Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFDL--KNITNDAVASL----SRIGLVFDH-A-VVQDQIKALAKAGAF--RSGSAMDSNFTAP 70 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~l----~~~g~~~~~-~-~~~~~~~~~~~~~~~--~~~~amDa~~~~~ 70 (382) |+..-+. .+..+.. ..+. ..+.++ ++-+-.-.. . ..+............ .....+.++. +. T Consensus 39 ~~~~~~~-------~~~e~~~~~~~l~-~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 109 (379) T protein:vir:10 39 MTSEKDL-------AVNELKSDMAALQ-AHADKLDVKLKEKAKSEDKSDSLVKSITENFNDIKEVRNGKSIQVKAVG-DM 109 (379) T ss_pred hhHHHHH-------HHHHHHHHHHHHH-HHHHHHHHHHHhcccccccchhHHHHHHHHHHhHHHHHhhhhhhhhhhc-cc Confidence 1100000 0000000 0000 001111 111100000 0 000000000000000 0001222221 11 Q ss_pred ccccch--hHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccce--eecccccCCceeeeeeee Q lcl|NC_017674. 71 VTTPSI--PTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTA--VEYGDHTNIPLTSWNANF 146 (382) Q Consensus 71 ~t~~~~--~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a--~~ygd~~DiP~vd~~~~~ 146 (382) .+..+. .+|.. +.+.|++.+......+.++.+.+... .++.|......+.+ .+.+.+...|..+..... T Consensus 110 ~~~~~~~~~ip~~----~~~~ii~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~ 182 (379) T protein:vir:10 110 TLPVNLTGAQPKD----YNFDVVLNPSQMLNVSDIVGAVSISG---GTYTFVRENGAGEGAIGAQVEGATKGQKDYDISM 182 (379) T ss_pred ccCCCCccccchh----hhhHHHHhHHhhhhHHhhceeeeccC---CceEEEEeecCCCcccccccCCccccccccceee Confidence 222222 23333 34466666666666666665544432 45667666544433 345777889999999999 Q ss_pred eEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCC Q lcl|NC_017674. 147 ERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQG 226 (382) Q Consensus 147 ~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~ 226 (382) ....++.++..+.+|.+=|+-+. .|.+--....++++...+|.-.+.|+.. .++.+. .... + T Consensus 183 i~~~~~k~~~~~~iS~ell~D~~----~l~~~i~~~la~~~~~~~~~~~~~g~~~---~~~~~~---------~~~~--~ 244 (379) T protein:vir:10 183 IDVNTDFIAGFTRYSKKMANNLP----FLTSFIPNALRRDYAKAENAAFNAVLAA---NATAST---------EIIT--N 244 (379) T ss_pred eEeeeeeEEeeehhhHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHHhccccc---cccccc---------cccc--C Confidence 99999999999999975444332 3666666666667777777655555421 111110 0011 1 Q ss_pred ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHH--HHHHh-----cCccEEEE Q lcl|NC_017674. 227 WSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSD--WIEQT-----YPKMRIVS 298 (382) Q Consensus 227 Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~--~l~~n-----~pnl~i~~ 298 (382) ..+ ++||.+++..+... + . .+..++|.|..+..|... +..|.-++. ...++ .-++.++. T Consensus 245 --~~~----~d~i~~~~~~~~~~--~-~----~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~ 311 (379) T protein:vir:10 245 --KNK----VEMLINEIAKQENL--D-F----PVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRINGIPLFR 311 (379) T ss_pred --ccc----HHHHHHHHHhhhhc--c-C----CCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcceecceeeEe Confidence 112 45666666555432 1 1 234688999888887532 333433321 00001 00122222 Q ss_pred ccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccc-eecCCceEeccccceeeeEeeccchhe Q lcl|NC_017674. 299 APELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV-EKRAKSYVEDFSNGTAGALCKRPWAVV 377 (382) Q Consensus 299 ~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~-~~~~~~~~~~~~~~t~Gv~i~~P~aia 377 (382) -+.+. .|..++-.|.. ... .+.+.+...+..... ....-...+-++.|. |+.|++|.||+ T Consensus 312 s~~~~-------ag~~~~gdf~~-~~~----------~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~-~~~v~~p~a~v 372 (379) T protein:vir:10 312 ATWLA-------ANKYYVGDWTR-VTK----------VTTEGLSLEFSEVEGTNFVKNNITARIEAQV-ALAVEQPAALI 372 (379) T ss_pred cCCCC-------CCceEEeeccc-EEE----------EEEeceEEEEeecccccccCCcEEEEEEEEe-ccEEecCccEE Confidence 22211 11111111111 000 000000000000000 001112223344455 66677899999 Q ss_pred e--ecCC Q lcl|NC_017674. 378 R--YLGI 382 (382) Q Consensus 378 ~--~~GI 382 (382) + ..+| T Consensus 373 ~~~~~~~ 379 (379) T protein:vir:10 373 FGDFTAV 379 (379) T ss_pred EEEecCC Confidence 8 7788 No 91 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=82.50 E-value=0.076 Score=26.71 Aligned_cols=272 Identities=10% Similarity=0.043 Sum_probs=110.6 Q ss_pred hhhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccce------eeccc Q lcl|NC_017674. 60 GSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTA------VEYGD 133 (382) Q Consensus 60 ~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a------~~ygd 133 (382) ++.|....+ +. +.|..+.. -|+ -+++-++.|||....+...-...+|. .-+-. ..-|+ T Consensus 1 m~~~~~~~~---~d---p~LT~~A~-----gy~--n~~~Iad~lfP~vpV~~~~~k~~~f~---~e~f~~~~t~ra~~~~ 64 (307) T protein:vir:79 1 MGRLSKLRI---VD---PVLTNLAI-----GYT--NAEFIGQTLMPVVEVEKEGGKIPKFG---KESFRLYQTERALRAK 64 (307) T ss_pred CCCCCCCcc---cC---HHHHHHHh-----hcc--chhhhhhhcCCcccccccccceeeec---cccccccccccccCCC Confidence 233332211 11 11211111 111 23466778888766555333334442 11100 01111 Q ss_pred ccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHH----HHHHHHHHhhccEEEEeeccCCcccceE Q lcl|NC_017674. 134 HTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQ----QAAIGLEIFRNAIGFYGWQSGLGNRTYG 209 (382) Q Consensus 134 ~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~----aAr~a~~~~~n~i~~~Gd~~g~~~g~~G 209 (382) .+.+...+ ++.....+. +.+..+.++ .+..+..+.++.+++.. .+.+..|...-+++|-+.+ | T Consensus 65 ~~~v~~~~--~~~~~~~~~--~~~l~~~id-~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~-------y- 131 (307) T protein:vir:79 65 SNRMNPED--IDSVDVNLD--EHDLEYPID-YREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSS-------Y- 131 (307) T ss_pred cceeeeec--ccccccccc--ccchhhccc-chhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccc-------c- Confidence 11111100 111110111 111122111 12233445555444333 3445556666666665432 1 Q ss_pred EEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc---------cCCCC Q lcl|NC_017674. 210 FLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV---------TTPYG 280 (382) Q Consensus 210 llN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~---------t~~~~ 280 (382) |+-...+.+.+..|.+++ -+++.||.+...++...++- .|++++|....+..|.+ .+..+ T Consensus 132 ----~~~~k~tLsgt~~Wsd~~-sDPi~di~~~~~ai~~~~g~------~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g 200 (307) T protein:vir:79 132 ----AAGNKKQLSATEKFTAAN-SDPVGVIEDGKEAIRTKIGR------RPNTMVIGASAYKTLKAHPQLIEKIKYSMKG 200 (307) T ss_pred ----CCCceEEEccCcccCCCC-CCcHHHHHHHHHHHHHhhCC------ccceEEeCHHHHHHHhcCHHHHHHhcCcccc Confidence 211122222345698876 45789999999999887762 47899999999998853 12223 Q ss_pred ccHHHHHHHhcCccEEEEcccc--ccccCCCC--CceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCc Q lcl|NC_017674. 281 ISVSDWIEQTYPKMRIVSAPEL--SGVQMKAQ--EPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKS 356 (382) Q Consensus 281 ~Tvl~~l~~n~pnl~i~~~peL--~~a~g~g~--~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~ 356 (382) +--.++|++-+ .++.+.+-+- ..+.+.-. -+-+++..|++..... ....... |. .-.-.+..+.. T Consensus 201 ~it~~~la~l~-~v~~V~vg~a~y~~~~~~~~~iw~~~~~l~y~~~~~~~-~~~~~~~-------ps--~Gyt~~~~g~~ 269 (307) T protein:vir:79 201 IVTVDLLKEIF-EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGG-QQRTPYE-------PS--YGYTLRKKGNP 269 (307) T ss_pred ccCHHHHHHHh-CceeEEEeeeeeecccccchhcCCCceEEEecccccCC-CCCcccc-------cc--cceeEEecCce Confidence 32245666543 3443333221 22221110 1224555555432210 0000000 10 00111222332 Q ss_pred eEeccccc-----eeeeEeeccchheeecCC Q lcl|NC_017674. 357 YVEDFSNG-----TAGALCKRPWAVVRYLGI 382 (382) Q Consensus 357 ~~~~~~~~-----t~Gv~i~~P~aia~~~GI 382 (382) ...++.+. ....++..|.-++.-.|. T Consensus 270 ~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~ 300 (307) T protein:vir:79 270 VVDTRIEDGKLELVRATDIFRPYLLGADAGY 300 (307) T ss_pred EEecccCCCceeEEeecccccceeeccccch Confidence 33333331 222234556666655555 No 92 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=80.42 E-value=0.095 Score=26.19 Aligned_cols=271 Identities=13% Similarity=0.058 Sum_probs=119.7 Q ss_pred hhhhhcccccchhhhhhcccccCcccccchhHHHHH-HhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecc Q lcl|NC_017674. 47 QIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQF-LQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPA 125 (382) Q Consensus 47 ~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~~~-l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~ 125 (382) |...+ .--|. .|..+ +.|-.+ .+-+++|||....+-..-...+|+-.|.- T Consensus 1 ~~~~~---------~~~dp------------~LT~~A~gy~n~--------~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F 51 (309) T protein:vir:99 1 MSNAP---------FPIDP------------ELTAIAIAYRNG--------RMISDEVLPRVPVGKQEFKFWKYDLAQGF 51 (309) T ss_pred CCCCC---------cCcCH------------hHHHHHhhccCh--------hhhhhhcCCccccCccccceeeechhhcc Confidence 10000 00011 11111 222233 34567888887666544444444432211 Q ss_pred ccee-ecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHH----HHhhccEEEEeec Q lcl|NC_017674. 126 GTAV-EYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGL----EIFRNAIGFYGWQ 200 (382) Q Consensus 126 G~a~-~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~----~~~~n~i~~~Gd~ 200 (382) -... ..+=..+--.++.........+...+.-+-+...|...|. .+.++.++....++..+ |...-++++.-. T Consensus 52 ~~~~t~r~~~~~~~~v~~~~~~~~~~~~~~~L~~~i~~~~~~~a~-~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a- 129 (309) T protein:vir:99 52 TVPETLVGRKSKPNEVEFSATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSPN- 129 (309) T ss_pred cccchhhccCCCcceEeecccCceeeecccceeecCCchhhhhcc-CCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChh- Confidence 0000 0011122234555555555555566666666666665443 35666665555444433 333344433321 Q ss_pred cCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc----- Q lcl|NC_017674. 201 SGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV----- 275 (382) Q Consensus 201 ~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~----- 275 (382) |.|.=..-+.+.+..|.+++.| ++.||.+....+ |. .|++++|....+.+|.+ T Consensus 130 -----------~y~~~~k~~Lsgt~~wsd~~SD-Pi~~i~~~~~~~-----g~-----~PN~~vlg~~~~~~l~~hp~i~ 187 (309) T protein:vir:99 130 -----------SYAAGNKTTLSGADQWSDPTSN-PLPVITDALDSV-----IL-----RPNIGVLGRRTATILRRHPKIV 187 (309) T ss_pred -----------hcCCCceEEecCccccCCCCCC-cHHHHHHHHHhh-----CC-----CcceEEechHHHHHHhhCHHHH Confidence 1121111122223468886644 678888887664 32 47799999999988853 Q ss_pred ----cCC--CCccHHHHHHHhcCcc-EEEEcc-cccccc-C---CCCC--ceeEEEEcchhhhhhhccccccchhhhhhh Q lcl|NC_017674. 276 ----TTP--YGISVSDWIEQTYPKM-RIVSAP-ELSGVQ-M---KAQE--PEDALVLFVEDVNAAVDGSTDGGSVFSQLV 341 (382) Q Consensus 276 ----t~~--~~~Tvl~~l~~n~pnl-~i~~~p-eL~~a~-g---~g~~--~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 341 (382) .+. .++--.++|++-|- + +|..-- -+..+. + .-.. +-+++.+|.+.-....++ ++-+.+|.- T Consensus 188 ~~ik~~~~~~g~it~~~la~l~~-ve~V~vg~a~~n~a~~g~~~~~~~iwg~~~~L~y~~~~~~~~~~-ps~G~t~~~-- 263 (309) T protein:vir:99 188 KAYNGSLGDEGMVPMAFLQELLE-LDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNG-TTFGLTAQW-- 263 (309) T ss_pred HHhcCCCccccccCHHHHHHHhC-cceEEeecceeeccccccccccccccCCcEEEEEcCCCCCCccc-ccccceeec-- Confidence 111 12223577777653 3 222110 111111 1 0000 223455554432211111 111111100 Q ss_pred hhhhhcccceecCCceEeccccceeeeEee-----ccchheeecCC Q lcl|NC_017674. 342 QSKFITLGVEKRAKSYVEDFSNGTAGALCK-----RPWAVVRYLGI 382 (382) Q Consensus 342 p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~-----~P~aia~~~GI 382 (382) . .+....|..|+...-||-.|| .|.-++.-.|. T Consensus 264 -------~-~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~ 301 (309) T protein:vir:99 264 -------G-DRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGF 301 (309) T ss_pred -------c-cccCCceeeeeeccCCceEEEEeccccchhcchhcch Confidence 0 123445666666655554443 56666666665 No 93 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=79.61 E-value=0.079 Score=26.63 Aligned_cols=251 Identities=12% Similarity=-0.022 Sum_probs=121.8 Q ss_pred cccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCc-ceeeEEEEeeecccceeecccccCCceeee Q lcl|NC_017674. 64 DSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSW-EDQEIVQGIVEPAGTAVEYGDHTNIPLTSW 142 (382) Q Consensus 64 Da~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~-~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~ 142 (382) -| ....+++-+|--|..|+..++-+ -.+...+..+++...- -=.+++++.++..|.+..+.++++++.-.. T Consensus 1 Ma----~T~~~d~I~Pev~~~~V~e~~~~----~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~l 72 (270) T protein:vir:95 1 MT----QTKKANLINPEVLANVVSAQMQN----AIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQM 72 (270) T ss_pred CC----ceehhhhcchHHHHHHHHHHHHh----HHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhc Confidence 01 12234555777777777655422 2233344444444221 137899999999999999999999999999 Q ss_pred eeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceecc Q lcl|NC_017674. 143 NANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTP 222 (382) Q Consensus 143 ~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~ 222 (382) .......++...+-+++++.+ .+....+ +...+-.....+.+..++++..+ +. . . |... +. T Consensus 73 t~~~~~a~i~~~gk~~~itD~--a~~~~~~-dp~~~~~~q~a~~~a~~~d~~li-~~-l---~---~a~~--------~~ 133 (270) T protein:vir:95 73 SMTTTKVTVKETGKAVEVTQT--AIITNVN-GTLQEASRQLAMSLADKVEIDYI-AE-L---N---KSKQ--------TA 133 (270) T ss_pred ccchheeeeehhhCcceecHH--HHhhhcc-chHHHHHHHHHHHHHHHHHHHHH-HH-h---c---cccc--------cc Confidence 999999999998777766654 3333334 44444444455555555554322 21 0 1 1100 00 Q ss_pred CCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccC-----CCCccHHHHHHH-h---cCc Q lcl|NC_017674. 223 PSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTT-----PYGISVSDWIEQ-T---YPK 293 (382) Q Consensus 223 a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~-----~~~~Tvl~~l~~-n---~pn 293 (382) + ..-|. ++|++++..+ +. . ...+..|++.|..+..|.+-. .++.. .+.. . |-+ T Consensus 134 -~---~~~t~----~~~~dA~~~l----gd-~--~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~---~~~~G~ig~~~G 195 (270) T protein:vir:95 134 -T---VSADA----TGILDAIEVF----NS-E--NDEDYVLYVNPKDYNKLVKSLFKVGGNVQDR---AISKGDLVEIVG 195 (270) T ss_pred -c---cccCH----HHHHHHHHHh----cc-c--cCCCcEEEEcHHHHHHHHhhhcccccccccc---hhcccccceecc Confidence 0 01233 4444444433 11 1 124678999999998885421 11111 1111 0 223 Q ss_pred cEE-EEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEec-cccceeeeEee Q lcl|NC_017674. 294 MRI-VSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVED-FSNGTAGALCK 371 (382) Q Consensus 294 l~i-~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~-~~~~t~Gv~i~ 371 (382) +++ +.- .+. +.+ ..|++.+..-+ +- ...... ...++..+...-. ..-..+||-++ T Consensus 196 ~~Viv~s----~~~---~~~--~~~l~~~gAi~-----------~~--~~~~~~-vEtdRd~~~~~d~i~~~~~y~v~~~ 252 (270) T protein:vir:95 196 VSDIVKS----KRV---SEN--TAFLQRYGAME-----------IV--NKKKPE-AYTDFDILKRTHLLSTNYHYSVNLK 252 (270) T ss_pred eeEEEeC----CCC---Cce--eEEEEecccee-----------ee--ecCCce-eeeccchhhcccEEEeeeEEEEEEE Confidence 342 221 111 111 23444332111 10 111111 0111111111111 12246778888 Q ss_pred ccchheeecCC Q lcl|NC_017674. 372 RPWAVVRYLGI 382 (382) Q Consensus 372 ~P~aia~~~GI 382 (382) .|..++.++== T Consensus 253 ~~skvv~~t~~ 263 (270) T protein:vir:95 253 DETGVVKVTFK 263 (270) T ss_pred ccceEEEEEec Confidence 88777755322 No 94 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=76.19 E-value=0.14 Score=25.29 Aligned_cols=308 Identities=10% Similarity=0.002 Sum_probs=128.9 Q ss_pred CCCcceeeeecCcccccccc--ccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchh- Q lcl|NC_017674. 1 MSQISKTHSRLAGRNAKPFD--LKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIP- 77 (382) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~- 77 (382) +...... ....+..++.. .........+.++++ +... ...+..+. ...++.+.| T Consensus 63 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~----~~~~~~~~--~~~t~~~gg~ 119 (397) T protein:vir:48 63 ARANEVV--NMSEEEKKPLTKSEEEVKAGFVKDFKNL---------------VRGR----YQNLLDSK--TDASGSDAGL 119 (397) T ss_pred HHHhhhh--hhhhhccccccchhhHHHHHHHHHHHHH---------------Hhhh----hhHHHHHh--hccCCccccc Confidence 1110000 00111111111 111111111111111 0000 00011111 112222333 Q ss_pred -HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceee-eeeeeeEeeEEEEE Q lcl|NC_017674. 78 -TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTS-WNANFERRTIVRGE 155 (382) Q Consensus 78 -~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~~~~ 155 (382) +|..+. ++|++.+........++++.......-....+...+..+.+.+.+....+|-.+ .......-.++.++ T Consensus 120 ~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~ 195 (397) T protein:vir:48 120 TIPQDIQ----TAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYA 195 (397) T ss_pred cccHHHH----HHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeee Confidence 555544 467777666666666665543332222222233334556677777777888654 67888888899999 Q ss_pred EEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHH Q lcl|NC_017674. 156 LGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGI 235 (382) Q Consensus 156 ~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI 235 (382) ..+.+|.+=++. ...++.+.-.....+++...+|+-.+.|+..+ . +.. .. .+ T Consensus 196 ~~~~iS~ell~d---s~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~--~-----------~~~---~~-----~~---- 247 (397) T protein:vir:48 196 GISTVTNSLLAD---SAENILAWLSGWIAKKVVVTRNKAILEAIATL--P-----------TKP---TL-----TK---- 247 (397) T ss_pred eehhhHHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhccccc--c-----------ccc---cc-----cc---- Confidence 999888765543 34577877788888888889999999997321 0 000 00 12 Q ss_pred HHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHH-HHHhcCccEEEEcccc----ccccCCC Q lcl|NC_017674. 236 IGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDW-IEQTYPKMRIVSAPEL----SGVQMKA 309 (382) Q Consensus 236 ~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~-l~~n~pnl~i~~~peL----~~a~g~g 309 (382) ++||.+++.+|...- .. ...+++.+..+..|... +..|.-++.- +...-+ -+|...|=. ......+ T Consensus 248 ~d~i~~~~~~l~~~~----~~---~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~-~~l~G~PV~~~~~~~~~~~~ 319 (397) T protein:vir:48 248 WDDIIDLQAKVDPAI----KQ---TSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTG-YSIDGFAVKEVADRWLANAS 319 (397) T ss_pred HHHHHHHHHHhhhhh----cC---CCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCC-ceeccceeEEecccccCCcC Confidence 345555655554321 11 23688999999888642 3334333210 011101 011111100 0000011 Q ss_pred CCceeEEE-Ecchhhhhhhccccccchhhhhhhhhhhhccc-ceecCCceEeccccceeeeEeeccchheeec--CC Q lcl|NC_017674. 310 QEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLG-VEKRAKSYVEDFSNGTAGALCKRPWAVVRYL--GI 382 (382) Q Consensus 310 ~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~-~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~--GI 382 (382) .+...+++ .+.+-+.. + + ...+....-.+. .....-....-+..|.+ +.++.|.+|+..+ +. T Consensus 320 ~~~~~~~~gd~~~~~~~---~-~------~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 320 SGAMPLYFGDLKQAVTL---F-D------RQQMSLLSTNIGGGAFETDTTKIRVIDRFD-VVATDTESFVPASFKAI 385 (397) T ss_pred CCceEEEEEeccceEEE---E-e------ecceEEEEeccchhhhhcCceeEEEEeeec-cEEecccceEEEEeccc Confidence 11111111 11111100 0 0 000000000000 00001112223444444 4567788886554 33 No 95 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=74.86 E-value=0.15 Score=25.05 Aligned_cols=302 Identities=12% Similarity=0.036 Sum_probs=125.3 Q ss_pred ccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHHHHHhhhhhhheecc Q lcl|NC_017674. 15 NAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVM 94 (382) Q Consensus 15 ~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~ 94 (382) -|| |--|. . -.-|..-.+-...++|=| .|.+..+.++. ..+..+++|.+ T Consensus 1 ~~~-------------------~~~~~--~---~~~~~~~~~~~p~l~m~a-----lTLaea~~l~~--d~~~~~VIE~l 49 (330) T protein:vir:94 1 MVR-------------------ICTPP--L---RGRWRTLTHQFPELKMPT-----VTLAESAKLSQ--DHLVSGLIETI 49 (330) T ss_pred Cce-------------------ecCCc--c---ccceeehhccccccchhh-----hhhhHHhhcCc--hhhHHHHHHhh Confidence 000 00000 0 000000000001122221 22222332221 22345677777 Q ss_pred ccccchhhhCcccc-CCCcceeeEEEEeeecccceeecccccCC-ceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhC Q lcl|NC_017674. 95 TAARKIDEIIGIDT-VGSWEDQEIVQGIVEPAGTAVEYGDHTNI-PLTSWNANFERRTIVRGELGMMVGTLEEGRASAIR 172 (382) Q Consensus 95 ~~~~~~~~l~~v~t-~g~~~~~t~t~~v~e~~G~a~~ygd~~Di-P~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g 172 (382) ...-...+.+|-+. +++ .+.|.....-+.+.+..=+.-+ |.-.....+.+..+..++..+++..+ -|...| T Consensus 50 ~~~s~iL~~lpf~~ve~~----~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~---iadl~g 122 (330) T protein:vir:94 50 VEVNPLYEMMPFTEIEGN----ALAYNRENVLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGL---IQATRS 122 (330) T ss_pred hccchHHhhcccccccCC----cceeeeeecCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHH---HHHhcC Confidence 66666667776432 222 3444443333333322111111 21122223333345555555444432 233444 Q ss_pred C--ChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC-CCccccCHHHHHHHHHHHHHHHHHh Q lcl|NC_017674. 173 L--NSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS-QGWSTADWAGIIGDIREAVRQLRIQ 249 (382) Q Consensus 173 ~--~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~-~~Wa~kT~~eI~~Di~~~~~~l~~~ 249 (382) - ++.........+++.+++....++||.. .+++.||++ ++.....-.+ ..-+.-| ++|+.+|+..++.. T Consensus 123 ~~~d~~~~q~~~~ieal~~~~e~~linGDs~--~~~F~GL~~--~~~~~q~i~tg~~gg~~T----~d~LDeLl~~v~~~ 194 (330) T protein:vir:94 123 DFMDQTSVQVASKAKSIGRQYQASMITGDGT--GNSFQGMMG--LVAASQTISAGANGGTLT----FELLDQLLDLVKDK 194 (330) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHhhccCCC--Cccccchhh--cCCcccEEecCCCCCCCC----HHHHHHHHHHhcCC Confidence 3 4445555567778888899999999854 367789976 3322222111 1122334 57888888888765 Q ss_pred cCCeeeeccccceEecCHHHHhhcc---c-c----------CCCCccHHHHHHHhcCccEEEEccccc-cccCCCCCcee Q lcl|NC_017674. 250 SQDQIDPKAEKITLALATSKVDYLS---V-T----------TPYGISVSDWIEQTYPKMRIVSAPELS-GVQMKAQEPED 314 (382) Q Consensus 250 t~g~~~~~~~p~~L~Lp~~~~~~Ls---~-t----------~~~~~Tvl~~l~~n~pnl~i~~~peL~-~a~g~g~~~~~ 314 (382) -+ .|..|+++......+. + . +.+|.-|+. |-++-|...--+. +++.+..++.. T Consensus 195 ~g-------~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~-----~~GvPi~~~d~ip~~~~~~~~~~tt 262 (330) T protein:vir:94 195 DG-------QVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPT-----YRGVPWFVNDFIPSNMTQGTATNAT 262 (330) T ss_pred CC-------CCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEee-----eCCeEEEecccccCCCCcccCCCce Confidence 43 3567877665444442 2 1 122332222 2233333321111 11111112222 Q ss_pred EEE-Ecc-hh-hhhhhccccccchhhhhhhhhhhhccc-ce-ecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 315 ALV-LFV-ED-VNAAVDGSTDGGSVFSQLVQSKFITLG-VE-KRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 315 ~~~-~~~-~~-v~~~~~~~~~~~~~~~~~~p~~~~~l~-~~-~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .+| +.. ++ ...-+.|-+..... -..+ +..+ .+ +.-.+|.+.. ..|+.+.-|.|++.+.|| T Consensus 263 sIyav~~G~~~~~qgV~Gl~~~g~~-glsV----r~~G~~~~k~v~~~~v~~---y~~~av~~~~a~~~L~~V 327 (330) T protein:vir:94 263 AIFAGTFDDGSNKYGIAGLTARGSA-GLRV----QNVGAKENADETITRVKM---YCGFANFSQLGLAAIKGL 327 (330) T ss_pred eEEEEeecccccccceEeecCCCCC-ccee----eeCCCccccceeeEEEEE---eeeeEEechhheeeeccc Confidence 222 221 21 11111121111100 0000 1111 11 1123344433 358889999999999999 No 96 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=74.82 E-value=0.15 Score=25.04 Aligned_cols=294 Identities=10% Similarity=-0.009 Sum_probs=113.7 Q ss_pred cCccccccccccccchHHHHHHhh-cceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHHHHHhhhhhh Q lcl|NC_017674. 11 LAGRNAKPFDLKNITNDAVASLSR-IGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPG 89 (382) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~ 89 (382) |+.| ..++ .++.+++ .++..+ |++ .+..++-.+.+.+.-. T Consensus 1 ~~~k--------~~~~-~l~~~~~~~~~~~~-----------------------~~~-------~g~~v~~~~~~~l~~~ 41 (321) T protein:vir:31 1 MASR--------TINN-DLSRITEKNALTVD-----------------------DLD-------AGGTLPDPLWDEFWTD 41 (321) T ss_pred CchH--------HHHH-HHHHHHHhcccccc-----------------------ccC-------CcceeCHHHHHHHHHH Confidence 1111 0110 0011111 111110 111 1111333444433222 Q ss_pred heeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccc--cCCceeeeeeeeeEeeEEEEEEEEEecHHHHHH Q lcl|NC_017674. 90 FVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDH--TNIPLTSWNANFERRTIVRGELGMMVGTLEEGR 167 (382) Q Consensus 90 v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~--~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~ 167 (382) +.+- .+-++...++|++.. . .....++..|.+...++. ...+..+...+......+....-..++.+-|. T Consensus 42 i~e~-s~~l~~i~v~~v~~~---~---~~i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~- 113 (321) T protein:vir:31 42 MIEE-TPLLDAIRTETVGAK---K---TRIPTLNIGERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQ- 113 (321) T ss_pred HHHh-hhhhhhceeeeccCc---c---eeeeeeccCCcccccccccccccccccceeeeeeeeeEEEEeehhccHHHHH- Confidence 3221 122333333443211 1 111222222222222221 12233344444445555566655566665443 Q ss_pred HHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCc---ccceEEEeCCCCcceeccCCCCcccc-CHHHHHHHHHHHH Q lcl|NC_017674. 168 ASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLG---NRTYGFLNDPNLPAFQTPPSQGWSTA-DWAGIIGDIREAV 243 (382) Q Consensus 168 A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~---~g~~GllN~P~l~~~~~~a~~~Wa~k-T~~eI~~Di~~~~ 243 (382) ....+-++.+.-.....+++...++.++|+|+..... .---|+|+.+.-.+. +..++.. .. ++++.+++ T Consensus 114 d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~----~~~~~~~~~~---~d~l~~l~ 186 (321) T protein:vir:31 114 ENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVE----TIDAADDILD---NDLVIRTI 186 (321) T ss_pred hhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhccccc----cccccccccC---HHHHHHHH Confidence 3344678999999999999999999999999843211 011366654321111 1112111 11 23334444 Q ss_pred HHHHHhcCCeeeeccccceEecCHHHHhhccc--cC-C--CCccHHHHH-HHhcCccEEEEccccccccCCCCCceeEEE Q lcl|NC_017674. 244 RQLRIQSQDQIDPKAEKITLALATSKVDYLSV--TT-P--YGISVSDWI-EQTYPKMRIVSAPELSGVQMKAQEPEDALV 317 (382) Q Consensus 244 ~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~--t~-~--~~~Tvl~~l-~~n~pnl~i~~~peL~~a~g~g~~~~~~~~ 317 (382) ..|-.. ++.+ .-...+|....+..+.. .+ + .+...+.-- ..++=++.++.+|.+-. +. +++ T Consensus 187 ~~l~~~----yr~~-~~~v~im~~~~~~~~~~~l~~~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP~-------~~-il~ 253 (321) T protein:vir:31 187 AGLDSK----YRAR-MNPALIVSEDQLLSYHYTLTDRDTPLGDNVIMGEADVNPFSFPIIGSGLWPD-------DK-AMF 253 (321) T ss_pred HhccHh----HhcC-CCeEEEechHHHHHHHHHHhcCCCccccchhhccccccccceeEEEcCCCCC-------Cc-EEE Confidence 443211 1111 11356677765543311 11 1 111111110 01233455666666532 11 111 Q ss_pred Ecchhhhhhhccccccchhhhhhhhhhhhccccee--cCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 318 LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEK--RAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 318 ~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~--~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) -.-++.. . .+.+.+..+...-..+. +...+.--++ +--|.+|..+.+++.+.|| T Consensus 254 t~~~nl~---~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ve~~~a~a~~~~i 309 (321) T protein:vir:31 254 TDPQNLI---Y-------ALYRDLEIDVLTESDKVSERDLHARYFMR-GDDDFAIENTEAVVLAEGL 309 (321) T ss_pred eccccEE---E-------EEeeccEEEEeecCccccccceeeEeeee-eecceeEeccccEEEEecC Confidence 1111110 0 01111111111111111 1112222222 2257888999999999999 No 97 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=74.65 E-value=0.12 Score=25.73 Aligned_cols=310 Identities=11% Similarity=-0.029 Sum_probs=126.1 Q ss_pred CC-----------------------------------------------------------C-cceeeeecCcccccccc Q lcl|NC_017674. 1 MS-----------------------------------------------------------Q-ISKTHSRLAGRNAKPFD 20 (382) Q Consensus 1 ~~-----------------------------------------------------------~-~~~~~~~~~~~~~~~~~ 20 (382) |+ + .........++...+.. T Consensus 1 M~~~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (395) T protein:vir:38 1 MNINQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKKP 80 (395) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 00 0 00000011111111111 Q ss_pred ccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchh--HHHHHHhhhhhhheecccccc Q lcl|NC_017674. 21 LKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAAR 98 (382) Q Consensus 21 ~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~ 98 (382) ....+... . .+.+...+.+. .....+. +..++++.| +|..+.. +|++.+.... T Consensus 81 ~~~~~~~~--~------------~~~~~~~~~~~-----~~~~~~~--~~~~~~~gg~~vP~~~~~----~ii~~~~~~~ 135 (395) T protein:vir:38 81 LPVKDGKP--D------------AQAMKNQFVKD-----FKNLVTS--GTTGTGNAGLTIPEDIQL----QIRTLTRSFT 135 (395) T ss_pred cchhhhhH--H------------HHHHHHHHHHH-----HHHHHhh--ccCccCCCceecchhHhh----HHHHHHHhhc Confidence 11000000 0 00111111110 0000111 112222333 5555443 5666666555 Q ss_pred chhhhCccccCCCcceeeEEEEee-ecccceeecccccCCcee-eeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChH Q lcl|NC_017674. 99 KIDEIIGIDTVGSWEDQEIVQGIV-EPAGTAVEYGDHTNIPLT-SWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSA 176 (382) Q Consensus 99 ~~~~l~~v~t~g~~~~~t~t~~v~-e~~G~a~~ygd~~DiP~v-d~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~ 176 (382) ..+.+..+.....-. ..+.+... +..+.+.+.+....+|-. ....+....+.+.++..+.+|.+=++ ....+|. T Consensus 136 ~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~---ds~~~l~ 211 (395) T protein:vir:38 136 SLESLANVENVTTSH-GSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLK---DTVDNII 211 (395) T ss_pred chhhhcceeeccCCc-ceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHh---hhHHHHH Confidence 566654433221111 12333333 333445566777778754 46778888889999988888874332 3455778 Q ss_pred HHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeee Q lcl|NC_017674. 177 ETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDP 256 (382) Q Consensus 177 ~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~ 256 (382) +--......++...+|+-+++|+..+.. .+ . ..+ ++||.++++..... .+.. T Consensus 212 ~~i~~~la~~~~~~~~~~il~g~g~~~~-----------~~--~--------~~~----~~~i~~~~~~~l~~---~~~~ 263 (395) T protein:vir:38 212 QWLVNWAAKKDVVTRNAKILEVMGKAPK-----------KP--T--------ISQ----FDNIKDLENNTLDP---AIES 263 (395) T ss_pred HHHHHHHHHHHHHHHHHHHhhccccccc-----------cc--c--------ccc----HHHHHHHHHHhhhh---hhcC Confidence 8888888889999999999999743210 00 0 012 23444444322111 1122 Q ss_pred ccccceEecCHHHHhhccc-cCCCCccHHHH-HHHhcCccEEEEcccc--ccc-cCCCCCceeEEE-Ecchhhhhhhccc Q lcl|NC_017674. 257 KAEKITLALATSKVDYLSV-TTPYGISVSDW-IEQTYPKMRIVSAPEL--SGV-QMKAQEPEDALV-LFVEDVNAAVDGS 330 (382) Q Consensus 257 ~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~~n~pnl~i~~~peL--~~a-~g~g~~~~~~~~-~~~~~v~~~~~~~ 330 (382) + ..++|.|..+..|.. .+..|.-++.- +....|+ +|...|=+ ..+ .+.+.+...+++ .+.+.+.. .+ T Consensus 264 ~---a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~-~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i-~~-- 336 (395) T protein:vir:38 264 T---SSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKY-LIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITL-FD-- 336 (395) T ss_pred C---CEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcc-eeccceeEEecccccCcCCCcceEEEEeccccEEE-EE-- Confidence 2 258899998888854 34344433211 1111111 11111110 011 011111111111 11111100 00 Q ss_pred cccchhhhhhhhhhhhcc-cceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 331 TDGGSVFSQLVQSKFITL-GVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 331 ~~~~~~~~~~~p~~~~~l-~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) . +.+...+... ......-.+..-+..|. |+.+.+|.||+.++.- T Consensus 337 ~-------~~~~i~~~~~~~~~~~~~~~~~r~~~r~-d~~~~~~~a~~~~~~~ 381 (395) T protein:vir:38 337 R-------QQMQIDTTNVGAGSFEHDTTKLRFIDRF-DVQLIDDGAFAAASFK 381 (395) T ss_pred e-------cceEEEEeccccchhhcCceEEEEEEee-ccEEecccceEEEEee Confidence 0 0000000000 00000111223444444 4566779999999977 No 98 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=72.25 E-value=0.19 Score=24.60 Aligned_cols=309 Identities=11% Similarity=0.005 Sum_probs=128.9 Q ss_pred CCCcceee--------eecCccccccccc--cccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCc Q lcl|NC_017674. 1 MSQISKTH--------SRLAGRNAKPFDL--KNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAP 70 (382) Q Consensus 1 ~~~~~~~~--------~~~~~~~~~~~~~--~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~ 70 (382) |..+.... .-.+.+..++... +.......+.++++ +... ...+..+...+. T Consensus 53 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------l~~~----~~~~~~~~~~~t 113 (397) T protein:vir:49 53 RDLFKEQYTEARANEVANMSEEEKKPLTKNEEEVKANFVKDFKNL---------------VRGR----YQNLLDSKTDGS 113 (397) T ss_pred HHHHHHHHHHHHHhhhhcccccccccccchhhHHHHHHHHHHHHH---------------hhcc----hhhHHHhhhccC Confidence 00000000 0000000011100 00000111111110 0000 000111110111 Q ss_pred ccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEee-ecccceeecccccCCceee-eeeeeeE Q lcl|NC_017674. 71 VTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIV-EPAGTAVEYGDHTNIPLTS-WNANFER 148 (382) Q Consensus 71 ~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~-e~~G~a~~ygd~~DiP~vd-~~~~~~~ 148 (382) .+.++.-+|..+.. .|++.+........+..+..... ....+.+... +..+.+.+.+....+|..+ ....... T Consensus 114 ~~~gg~~iP~~~~~----~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~ 188 (397) T protein:vir:49 114 GSDAGLTIPQDIRT----AINTLVRQFDSLQEYVNVENVTT-LTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIR 188 (397) T ss_pred CccCcceecHHHHH----HHHHHHHhhhhHhhhcceeeccC-CcceEEEEeeccCCcceeeeccccccccccccceeeeE Confidence 11122235544443 55555555555555554432221 1122334433 3446677777777888665 4678888 Q ss_pred eeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCcc Q lcl|NC_017674. 149 RTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWS 228 (382) Q Consensus 149 ~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa 228 (382) -..+.++..+.+|.+=++ ....++.+.-.....+++.+.+|+-+++|+..+. | .. .. T Consensus 189 ~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~----------~---~~---~~---- 245 (397) T protein:vir:49 189 YAIKRYAGISTVTNSLLA---DSAENILAWLSGWIAKKVVVTRNKAILEAIGTLP----------N---KP---TL---- 245 (397) T ss_pred eeeeeeEeehhhHHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------c---cc---cc---- Confidence 888899888888875443 3346788888888888899999999999973210 1 00 00 Q ss_pred ccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHH-HHHHhcCccEEEEccc--ccc Q lcl|NC_017674. 229 TADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSD-WIEQTYPKMRIVSAPE--LSG 304 (382) Q Consensus 229 ~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~-~l~~n~pnl~i~~~pe--L~~ 304 (382) .| ++||.+++..+...- . .+..++|.|..+..|... +..|.-++. -+....+ -+|...|= ... T Consensus 246 -~~----~d~i~~~~~~l~~~~----~---~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~-~~l~G~pV~~~~~ 312 (397) T protein:vir:49 246 -AK----WDDIIDLQAKVDPAI----K---QTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTG-YSIDGFVVKEISD 312 (397) T ss_pred -cC----HHHHHHHHHhhhhhh----c---CCCEEEEcHHHHHHHHHhhccCCceeecccccCCCC-ceecceeeEEecc Confidence 12 456777777664321 1 234789999999888543 333332221 0111111 11111110 000 Q ss_pred --ccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccc---eecCCceEeccccceeeeEeeccchhee Q lcl|NC_017674. 305 --VQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV---EKRAKSYVEDFSNGTAGALCKRPWAVVR 378 (382) Q Consensus 305 --a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~---~~~~~~~~~~~~~~t~Gv~i~~P~aia~ 378 (382) ......+...+++- +.+.+.. + ..-...+...+. ....-....-+..|.+|. +++|.||+. T Consensus 313 ~~~~~~~~~~~~~~~gd~~~~~~~-----------~-~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~-~~~~~a~~~ 379 (397) T protein:vir:49 313 RFLPNGTGGAMPLYFGDLKQAVTL-----------F-DRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVV-STDTEAFVP 379 (397) T ss_pred cccccccCCceeEEEeeccceEEE-----------E-eecccEEEEeccccchhhcCeeeEEEEEeeccE-EecccceEE Confidence 00000111111111 1111100 0 000000000000 001112234455666665 678999987 Q ss_pred ecCC Q lcl|NC_017674. 379 YLGI 382 (382) Q Consensus 379 ~~GI 382 (382) ...= T Consensus 380 ~~~~ 383 (397) T protein:vir:49 380 ASFK 383 (397) T ss_pred EEec Confidence 7633 No 99 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=71.53 E-value=0.19 Score=24.48 Aligned_cols=314 Identities=12% Similarity=0.012 Sum_probs=131.1 Q ss_pred CCCcceeeeecCccc------------------------ccccccccc---chHHHHHHhhcceeccccchhhhhhhhcc Q lcl|NC_017674. 1 MSQISKTHSRLAGRN------------------------AKPFDLKNI---TNDAVASLSRIGLVFDHAVVQDQIKALAK 53 (382) Q Consensus 1 ~~~~~~~~~~~~~~~------------------------~~~~~~~~~---~~~~~~~l~~~g~~~~~~~~~~~~~~~~~ 53 (382) |..+.+.-.-+..-+ .++.....- ....+.+..+-+..-. ...+ .... T Consensus 1 ~eei~~l~~~~~~l~~~~~~l~~~~d~~e~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~-~~~~----~~~~ 75 (352) T protein:vir:78 1 MEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLNDNEKLVKAKAEFYRHAILPN-EFEK----PSME 75 (352) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccchhhhHHHHHHHHHHHHhhhh-HHHH----HHhh Confidence 221111111100000 000000000 0011222222111100 0000 0000 Q ss_pred cccchhhhhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeeccc Q lcl|NC_017674. 54 AGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGD 133 (382) Q Consensus 54 ~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd 133 (382) .. ....+|- .+..+..+.-+|..+. .+|++.+......+.+..+.+.++.. ...+....+.+.+.+. T Consensus 76 ~~--~~~~al~---~~~~~~gG~lIP~~~~----~~Ii~~l~~~s~l~~~~~v~~~~~~~----~p~~~~~~~~a~~v~E 142 (352) T protein:vir:78 76 AQ--RLLHALP---TGNDSGGDKLLPKTLS----KEIVSEPFAKNQLREKARLTNIKGLE----IPRVSYTLDDDDFITD 142 (352) T ss_pred HH--HHHHHhc---cCCCCCCceeccHhHH----HHHHHHHHhhcchhhheeeEecCCce----EEEEecCCCccccccc Confidence 00 0001111 1111112222554443 35666555555566666666655432 1222334456778888 Q ss_pred ccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEE-EeeccCCcccceEEEe Q lcl|NC_017674. 134 HTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGF-YGWQSGLGNRTYGFLN 212 (382) Q Consensus 134 ~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~-~Gd~~g~~~g~~GllN 212 (382) ...+|..+...++..-.++.++..+.+|.+=|+ ....++.+--....++++...++..+| .|+..+ .-.|.++ T Consensus 143 ~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~---~~~g~l~ 216 (352) T protein:vir:78 143 VETAKELKLKGDTVKFTTNKFKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAVSPKSG---LEHMSFY 216 (352) T ss_pred ccccccccccceeeeecceeEEeechhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCc---cccccee Confidence 888999999999999999999998888886443 334667777777777777666666554 343221 2247777 Q ss_pred CCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhc-cccCCCCccHHHHHHHhc Q lcl|NC_017674. 213 DPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYL-SVTTPYGISVSDWIEQTY 291 (382) Q Consensus 213 ~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~L-s~t~~~~~Tvl~~l~~n~ 291 (382) ++.+...+ ....++||.+++.+|...-. .+ -+.+|-+..+..| ...++.|..++. .- T Consensus 217 ~~~~~~~t-----------~~~~~d~i~~~~~~l~~~~~----~~---a~~~mn~~t~~~l~~~~~~~~~~~~~----~~ 274 (352) T protein:vir:78 217 NGSVKEVE-----------GANMYDAIINALADLHEDYR----DN---ATIYMRYADYVKIISVLSNGTTNFFD----TP 274 (352) T ss_pred cccccccc-----------ccchHHHHHHHHhccChhhh----cC---CEEEEehHHHHHHHHHHhccCCcccc----cC Confidence 76654321 11125666667766532211 11 1356655554444 333333443331 11 Q ss_pred CccEEEEccccccccCCCCCceeEEE-EcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEe Q lcl|NC_017674. 292 PKMRIVSAPELSGVQMKAQEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALC 370 (382) Q Consensus 292 pnl~i~~~peL~~a~g~g~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i 370 (382) |+ ++...|=.-..+ ...+++ -|..- +...-++-+..+. +........-+..|..|. + T Consensus 275 ~~-~llG~PV~~~~~-----~~~~~~Gdf~~~--------------~~~~~~~~~~~~~-~~~~g~~~f~~~~r~Dg~-~ 332 (352) T protein:vir:78 275 AE-KVFGKPVVFTDA-----AVKPIVGDFNYF--------------GINYDGTTYDTDK-DVKKGEYLFVLTAWYDQQ-R 332 (352) T ss_pred Cc-cccccceEEecC-----CCceeEeehhhh--------------hhhhhhheeeeec-cccCCeeEEEEEeeeCce-e Confidence 21 111111111110 001111 11000 0000011111110 111223344555666666 4 Q ss_pred eccchheeecCC Q lcl|NC_017674. 371 KRPWAVVRYLGI 382 (382) Q Consensus 371 ~~P~aia~~~GI 382 (382) ++|.||+.+.-= T Consensus 333 ~~~eA~~~l~~~ 344 (352) T protein:vir:78 333 TLDSAFRIAKAK 344 (352) T ss_pred echhheEEEEee Confidence 569998665433 No 100 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=71.52 E-value=0.19 Score=24.48 Aligned_cols=253 Identities=11% Similarity=0.080 Sum_probs=125.0 Q ss_pred hcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccC--CCcceeeEEEEeeecccceeecccccCCcee Q lcl|NC_017674. 63 MDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTV--GSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT 140 (382) Q Consensus 63 mDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~--g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~v 140 (382) |= + +....++.-+|..+..++.-++. ..+....+..+++. |.- -.+++++.+...|.+..|.++++++.- T Consensus 1 m~-~--~~T~l~d~i~Pev~~~~v~~~~~----~~l~~~~~~~~~~~l~g~~-G~tv~iP~~~~ig~a~~~~~g~~i~~~ 72 (274) T protein:vir:95 1 MA-Q--GMTKLTNQIVPEVLAPMMQAELE----KKLRFASFAEIDNTLVGQP-GDTLTFPAFIYSGDAKVVAEGEKIPTD 72 (274) T ss_pred CC-c--ceeehhheechHHHHHHHHHHHH----hhhhccccceecccccCCC-CCEEEeeeecCCCccccccCCCccchh Confidence 11 1 12233456677777777765543 23333333333322 211 378999999999999999999999988 Q ss_pred eeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCccee Q lcl|NC_017674. 141 SWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQ 220 (382) Q Consensus 141 d~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~ 220 (382) ..........+...+-++.++ ++.+.+. +.++..+-...+..++...+++-.+ . . ++.....+ T Consensus 73 ~lt~~~~~~~i~~~~~a~~i~--D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~-~-~----------l~~a~~~~-- 135 (274) T protein:vir:95 73 ILETKKREAKIRKIAKGTSIS--DEALLSG-YGDPQGEQVRQHGLAHANKVDDDVL-E-A----------LKSAKLTV-- 135 (274) T ss_pred hcccceeEEEeeeeecceeeh--HHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH-H-H----------Hhcccccc-- Confidence 999999998888876665555 5544443 4455556666666677666665332 1 1 11000000 Q ss_pred ccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccC--------CCCccHHHHHHH--- Q lcl|NC_017674. 221 TPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTT--------PYGISVSDWIEQ--- 289 (382) Q Consensus 221 ~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~--------~~~~Tvl~~l~~--- 289 (382) .+ ...+ ++.|++++..+-.. +..+..|+++|..+..|.+-. +.+.. -++. T Consensus 136 -~~----~~~~----~d~i~~A~~~lgd~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~---~~~~G~i 196 (274) T protein:vir:95 136 -EA----DITK----LTGLQTAIDKFNDE-------DLEPMVLFISPLDAGKLRGDATTNFTRATELGDD---VIVKGAF 196 (274) T ss_pred -cc----cccC----HHHHHHHHHHhccc-------cccccEEEeCHHHHHHHHhhcccccccccccccc---ceecccc Confidence 00 0112 44455555544221 124668999999999885421 11111 1110 Q ss_pred -hcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEec-cccceee Q lcl|NC_017674. 290 -TYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVED-FSNGTAG 367 (382) Q Consensus 290 -n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~-~~~~t~G 367 (382) .+-+++|+.-..+. . ...|++.+.--. .+... +... ...+..+...-. ..-..+| T Consensus 197 g~~~G~~Vi~s~~~~-------~--~t~~l~~~gA~~----------~~~~~-~~~v---E~~Rd~~~~~d~i~~~~~y~ 253 (274) T protein:vir:95 197 GEALGAVIVRSNKLE-------A--GTAILAKKGAVK----------LITKR-DFFL---ETDRDPSTKTTALYSDKHYV 253 (274) T ss_pred ceecCeEEEEeCCCC-------C--ceEEEEecccee----------eeecC-Cccc---ccccccccccCEEEEeEEEE Confidence 12234444322211 0 012222221110 00000 0000 111111111111 1124579 Q ss_pred eEeeccchheeecCC Q lcl|NC_017674. 368 ALCKRPWAVVRYLGI 382 (382) Q Consensus 368 v~i~~P~aia~~~GI 382 (382) +-+.+|..++.+.== T Consensus 254 ~~~~~~~~~v~~tk~ 268 (274) T protein:vir:95 254 AYLYDESKAVKITKG 268 (274) T ss_pred EEEEcCCcEEEEEcC Confidence 999999887776633 No 101 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=71.52 E-value=0.19 Score=24.48 Aligned_cols=253 Identities=11% Similarity=0.080 Sum_probs=125.0 Q ss_pred hcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccC--CCcceeeEEEEeeecccceeecccccCCcee Q lcl|NC_017674. 63 MDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTV--GSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT 140 (382) Q Consensus 63 mDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~--g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~v 140 (382) |= + +....++.-+|..+..++.-++. ..+....+..+++. |.- -.+++++.+...|.+..|.++++++.- T Consensus 1 m~-~--~~T~l~d~i~Pev~~~~v~~~~~----~~l~~~~~~~~~~~l~g~~-G~tv~iP~~~~ig~a~~~~~g~~i~~~ 72 (274) T protein:vir:96 1 MA-Q--GMTKLTNQIVPEVLAPMMQAELE----KKLRFASFAEIDNTLVGQP-GDTLTFPAFIYSGDAKVVAEGEKIPTD 72 (274) T ss_pred CC-c--ceeehhheechHHHHHHHHHHHH----hhhhccccceecccccCCC-CCEEEeeeecCCCccccccCCCccchh Confidence 11 1 12233456677777777765543 23333333333322 211 378999999999999999999999988 Q ss_pred eeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCccee Q lcl|NC_017674. 141 SWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQ 220 (382) Q Consensus 141 d~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~ 220 (382) ..........+...+-++.++ ++.+.+. +.++..+-...+..++...+++-.+ . . ++.....+ T Consensus 73 ~lt~~~~~~~i~~~~~a~~i~--D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~-~-~----------l~~a~~~~-- 135 (274) T protein:vir:96 73 ILETKKREAKIRKIAKGTSIS--DEALLSG-YGDPQGEQVRQHGLAHANKVDDDVL-E-A----------LKSAKLTV-- 135 (274) T ss_pred hcccceeEEEeeeeecceeeh--HHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH-H-H----------Hhcccccc-- Confidence 999999998888876665555 5544443 4455556666666677666665332 1 1 11000000 Q ss_pred ccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccC--------CCCccHHHHHHH--- Q lcl|NC_017674. 221 TPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTT--------PYGISVSDWIEQ--- 289 (382) Q Consensus 221 ~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~--------~~~~Tvl~~l~~--- 289 (382) .+ ...+ ++.|++++..+-.. +..+..|+++|..+..|.+-. +.+.. -++. T Consensus 136 -~~----~~~~----~d~i~~A~~~lgd~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~---~~~~G~i 196 (274) T protein:vir:96 136 -EA----DITK----LTGLQTAIDKFNDE-------DLEPMVLFISPLDAGKLRGDATTNFTRATELGDD---VIVKGAF 196 (274) T ss_pred -cc----cccC----HHHHHHHHHHhccc-------cccccEEEeCHHHHHHHHhhcccccccccccccc---ceecccc Confidence 00 0112 44455555544221 124668999999999885421 11111 1110 Q ss_pred -hcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEec-cccceee Q lcl|NC_017674. 290 -TYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVED-FSNGTAG 367 (382) Q Consensus 290 -n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~-~~~~t~G 367 (382) .+-+++|+.-..+. . ...|++.+.--. .+... +... ...+..+...-. ..-..+| T Consensus 197 g~~~G~~Vi~s~~~~-------~--~t~~l~~~gA~~----------~~~~~-~~~v---E~~Rd~~~~~d~i~~~~~y~ 253 (274) T protein:vir:96 197 GEALGAVIVRSNKLE-------A--GTAILAKKGAVK----------LITKR-DFFL---ETDRDPSTKTTALYSDKHYV 253 (274) T ss_pred ceecCeEEEEeCCCC-------C--ceEEEEecccee----------eeecC-Cccc---ccccccccccCEEEEeEEEE Confidence 12234444322211 0 012222221110 00000 0000 111111111111 1124579 Q ss_pred eEeeccchheeecCC Q lcl|NC_017674. 368 ALCKRPWAVVRYLGI 382 (382) Q Consensus 368 v~i~~P~aia~~~GI 382 (382) +-+.+|..++.+.== T Consensus 254 ~~~~~~~~~v~~tk~ 268 (274) T protein:vir:96 254 AYLYDESKAVKITKG 268 (274) T ss_pred EEEEcCCcEEEEEcC Confidence 999999887776633 No 102 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=70.33 E-value=0.21 Score=24.29 Aligned_cols=280 Identities=10% Similarity=-0.073 Sum_probs=122.5 Q ss_pred hhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeec-ccccCCce Q lcl|NC_017674. 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEY-GDHTNIPL 139 (382) Q Consensus 61 ~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~y-gd~~DiP~ 139 (382) ||.=|. +-.|..+.+.-..+.. .|+.+--..+-..+++.- +.-....+.|..-+....+... ..++|-|. T Consensus 1 ma~~~~--~~~t~~~~g~~~dl~~----~I~~isp~dTPf~S~i~~---~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~ 71 (317) T protein:vir:88 1 MATPTN--AVSTVEINGKREDLID----IIYNIAPYDTPFMSAIGK---GVATAITHEWQTDELRQPGKNTRVEGEDATI 71 (317) T ss_pred CCcccc--ceEeeeeeeeeechhh----hheecCCccCcceeeecC---ceecccEEEEEeeecCCccccccccCccccc Confidence 111111 1112222222111111 333332222333334442 2233456666655444333211 12223222 Q ss_pred eeeee---eeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeecc------CCcccceEE Q lcl|NC_017674. 140 TSWNA---NFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQS------GLGNRTYGF 210 (382) Q Consensus 140 vd~~~---~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~------g~~~g~~Gl 210 (382) ..... ..-..+|++=...+.++.+....+.. -+.-+....-+...+.+.++...++|.++ .....+-|+ T Consensus 72 ~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~--~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl 149 (317) T protein:vir:88 72 KAGSFTTMLNNYCQISDETLQVTGTADRVKKAGR--KNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANI 149 (317) T ss_pred ccccCCEEeccEEEEEEeEEEEeehhhhhhhcCc--cchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhH Confidence 22111 11123588888888888877654432 34333333333344444455555555432 112455566 Q ss_pred EeC--CC-Cc-----ceeccCCCCccccCHHHH-HHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccCC--- Q lcl|NC_017674. 211 LND--PN-LP-----AFQTPPSQGWSTADWAGI-IGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTTP--- 278 (382) Q Consensus 211 lN~--P~-l~-----~~~~~a~~~Wa~kT~~eI-~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~~--- 278 (382) ++. ++ +- ......+..|-+.|+..+ -+||++++.++|...+ .|..+.+++.+...|+.-.. T Consensus 150 ~~~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg-------~~~~i~v~a~~k~~i~~~~~~~~ 222 (317) T protein:vir:88 150 FAYYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGG-------QANSIQTSSSIKKAISKNMKGRA 222 (317) T ss_pred HHHhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCC-------CCCEEEeChHHHHHHHHHhcCCc Confidence 553 11 10 001112234544444444 4558899999999664 24568899988777763211 Q ss_pred -----------CCccHHHHHHHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhc Q lcl|NC_017674. 279 -----------YGISVSDWIEQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFIT 347 (382) Q Consensus 279 -----------~~~Tvl~~l~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~ 347 (382) .+.+|-.|. .+|-.++|+.-+-+.. ..++.+..+.+... + .-|.+... T Consensus 223 ~~i~~~~~~~~~g~~v~~~~-tdfG~v~ii~~r~lp~--------~~~~~~D~~~~~l~----------~--Lr~~~~e~ 281 (317) T protein:vir:88 223 TEITLDASDNRIAQTVDVYE-SDFGKYTIRANRWFHE--------NTLFVFDPKMHSLC----------Y--LRPFFQHE 281 (317) T ss_pred eeEEEcccCeEEEEEEEEEE-eCCeEEEEEeCCCCCC--------CeEEEEccccccee----------e--cccceeec Confidence 111111111 1233344444444421 11232332222110 1 01111111 Q ss_pred ccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 348 LGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 348 l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) | .+....+--....=+|++++-|.|.+...|+ T Consensus 282 l---aKtGd~~k~~i~~E~tLe~~N~~a~a~i~~l 313 (317) T protein:vir:88 282 L---AKTGDSEKRQLLVEYTFRVNNEKSGALIRDV 313 (317) T ss_pred c---CCCcccceeEEEEEEEEEEcCccceeEEEEe Confidence 1 2233333344445679999999999999999 No 103 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=68.01 E-value=0.24 Score=23.95 Aligned_cols=331 Identities=10% Similarity=0.026 Sum_probs=134.5 Q ss_pred CCCcceeeeec--------------C---cccccccccccc---ch---HHHHHHhhcceeccccchhhhhhhhcccccc Q lcl|NC_017674. 1 MSQISKTHSRL--------------A---GRNAKPFDLKNI---TN---DAVASLSRIGLVFDHAVVQDQIKALAKAGAF 57 (382) Q Consensus 1 ~~~~~~~~~~~--------------~---~~~~~~~~~~~~---~~---~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~ 57 (382) +.++......+ . .......+..+- +. ..+.+.-+-|.. .+...... T Consensus 41 ~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~l~~~~~-----------~~~~~e~~ 109 (409) T protein:vir:45 41 KSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPENNSQQDEKRAQVFDKWMRHGAS-----------ELTSEERK 109 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCCCCcchhhHHHHHHHHHHHHhhhh-----------hccHHHHH Confidence 00000000000 0 000001111110 00 112211111111 11000000 Q ss_pred hhhhhhcccccCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeeccc-ceeecccc Q lcl|NC_017674. 58 RSGSAMDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAG-TAVEYGDH 134 (382) Q Consensus 58 ~~~~amDa~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G-~a~~ygd~ 134 (382) ......+. +..+.++.| +|..+.. +|++.+......+.+..+.+... .....+...+..+ .+.+.+.. T Consensus 110 -~~~~~~a~--~~~~~~~gg~liP~~~~~----~ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~E~ 180 (409) T protein:vir:45 110 -ALRELRAQ--GVAQDEKGGYTVPETFLA----KVVEKMKSYGGIASVAQILTTSD--GRTMEWATADGTSEVGVLLGEN 180 (409) T ss_pred -HHHHHhhc--cCccCcCCceeccHhHHH----HHHHHHHhhhhhhhhceeeecCC--CceEEEEeeccCcccccccccc Confidence 00111111 112222223 5555544 56666554444444443332222 1233444444433 34566777 Q ss_pred cCCceeeeeeeeeEeeEEEEE-EEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeC Q lcl|NC_017674. 135 TNIPLTSWNANFERRTIVRGE-LGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLND 213 (382) Q Consensus 135 ~DiP~vd~~~~~~~~~v~~~~-~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~ 213 (382) ..+|..+.......-..+... ..+.+|.+=+.- ...++.+.-......++...+|+-.++|+..+..+...|+++. T Consensus 181 ~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~d---s~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~ 257 (409) T protein:vir:45 181 EEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQD---SAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAAS 257 (409) T ss_pred ccccccccccceeeeeeeeeeeeehhhhHHHHhc---cHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeec Confidence 778887776666554444443 334566644432 3467888888888889999999999999865544567899997 Q ss_pred CCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHH-HHHHh- Q lcl|NC_017674. 214 PNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSD-WIEQT- 290 (382) Q Consensus 214 P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~-~l~~n- 290 (382) +.....+..+ ..-| ++||.+++..|-..-.. + .-..+++.+..+..|.. .+..|.-++. -+... T Consensus 258 ~~~~~~~~~~----~~~~----~d~i~~l~~~l~~~~~~----~-a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~ 324 (409) T protein:vir:45 258 VTGTTQTAAA----NAVK----WQEILALKHSIDPAYRR----G-PKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVA 324 (409) T ss_pred cccccccccc----cccc----hHHHHHHHHhhhhhhcc----C-CeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCC Confidence 7633221111 1123 45566666665332111 1 11135677777777743 2333433321 00001 Q ss_pred ---cCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceee Q lcl|NC_017674. 291 ---YPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAG 367 (382) Q Consensus 291 ---~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~G 367 (382) .-+..++....+... +++...+++-.-.+.. .+ ..+...+ +....+|. ......+-+..|. | T Consensus 325 ~~~l~G~PV~~~~~~p~~---~~~~~~i~~Gd~~~~~---i~-~~~~~~~-~~~~d~~~------~~~~~~~~~~~r~-d 389 (409) T protein:vir:45 325 PASVLNVPYVIDQEIDDI---GAGKKFMFCGDFDRFI---IR-RVRYMIL-KRLVERYA------EYDQTGFLAFHRF-D 389 (409) T ss_pred CceecceeeEEecCcCCc---cCCccEEEEeehhhhh---ee-eccceEE-EEeecccc------cCCcEEEEEEEEe-c Confidence 111222222222111 1111112221111110 00 0000000 00011110 1122233444555 4 Q ss_pred eEeeccchheeecCC Q lcl|NC_017674. 368 ALCKRPWAVVRYLGI 382 (382) Q Consensus 368 v~i~~P~aia~~~GI 382 (382) +.+..|.||+.+.+= T Consensus 390 ~~~~~~~A~~~l~~k 404 (409) T protein:vir:45 390 CILEDTSAIKALVGK 404 (409) T ss_pred cEeechhheEEEEec Confidence 458889998877765 No 104 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=67.75 E-value=0.17 Score=24.85 Aligned_cols=306 Identities=8% Similarity=-0.055 Sum_probs=124.1 Q ss_pred CCCcceeee----ecCccccc-----cccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcc Q lcl|NC_017674. 1 MSQISKTHS----RLAGRNAK-----PFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPV 71 (382) Q Consensus 1 ~~~~~~~~~----~~~~~~~~-----~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~ 71 (382) +........ .-..+..+ ..+...-....++...+....+. ...........++. .+. T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~----~~~ 137 (400) T protein:vir:38 72 LKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFA----------VLRAVPTDASDAVN----AGV 137 (400) T ss_pred HHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHh----------hhhhhhHHHHHHHh----hcc Confidence 000000000 00000000 00000000000000000000000 00000000001111 122 Q ss_pred cccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeee-cccceeecccccCCce-eeeeeeee Q lcl|NC_017674. 72 TTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE-PAGTAVEYGDHTNIPL-TSWNANFE 147 (382) Q Consensus 72 t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e-~~G~a~~ygd~~DiP~-vd~~~~~~ 147 (382) +.++.| +|..+ .+.|++.+......+.++++.+.+. .+..|++.. ..|.+...+.....|- .+...+.. T Consensus 138 ~~~~gg~~vP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i 210 (400) T protein:vir:38 138 KAADAASTIPETI----SNTPQRELQTVVDLKPFTNVFQAST---QKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPV 210 (400) T ss_pred cccCCcccccHHH----HHHHHHHHHhhhhhhhcceeEeccC---cceEEEEEecCCCccccccccccccccccccceee Confidence 333333 44433 3466666655555666665543332 355677765 4566777887777774 56677788 Q ss_pred EeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCc Q lcl|NC_017674. 148 RRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGW 227 (382) Q Consensus 148 ~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~W 227 (382) .-.++.++..+.+|.+=| .....++.+.-......++...+|.-.++|...+ ++.+ T Consensus 211 ~~~~~k~~~~~~is~ell---~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~------------------~~~~--- 266 (400) T protein:vir:38 211 NWSVETYRQALPVSQESI---DDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGF------------------TAKT--- 266 (400) T ss_pred EeehhheeeehhhHHHHH---hhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccc------------------cccc--- Confidence 888888888888887433 2334567777777788888888888888775210 0000 Q ss_pred cccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH-HHHhcC----ccEEEEccc Q lcl|NC_017674. 228 STADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW-IEQTYP----KMRIVSAPE 301 (382) Q Consensus 228 a~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~~n~p----nl~i~~~pe 301 (382) ..|. +||.+++........ .-.++|.|+.+..|.. .+..|.-++.- +...-| +..++.... T Consensus 267 -~~~~----~~~~~~~~~~~~~~~--------~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~ 333 (400) T protein:vir:38 267 -ISSV----DDLKHINNVDLDPAY--------SRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGKSVLGMPIAVVSD 333 (400) T ss_pred -cccH----HHHHHHHHhhhhhhh--------CcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCccccccceeEEecc Confidence 1233 344444432221111 1267899988888864 33334433210 111101 112222221 Q ss_pred cccccCCCCCceeEEEEc-chhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeec Q lcl|NC_017674. 302 LSGVQMKAQEPEDALVLF-VEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYL 380 (382) Q Consensus 302 L~~a~g~g~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~ 380 (382) .- ..+. +...+++-. .+-+.. .+. +.+..++..+ ... .....+..|.+|. +..|.+|+.+. T Consensus 334 ~~-~~~~--g~~~~~~gd~s~~~~~-~~~---------~~~~~~~~~~--~~~--~~~~~~~~r~d~~-~~~~~a~~~l~ 395 (400) T protein:vir:38 334 DT-LGAA--GEAHAFLGDIKRAILF-ANR---------ADFMVRWVDD--QIY--GQFLQAGMRFGVS-VADEKAGYFLT 395 (400) T ss_pred cc-cCCC--CceEEEEEeccccEEE-Eee---------cceEEEEecc--ccc--ceeEEEEEEeccE-EecccceEEEE Confidence 11 1111 111122111 110000 000 0001010000 000 1123344555444 44588888866 Q ss_pred CC Q lcl|NC_017674. 381 GI 382 (382) Q Consensus 381 GI 382 (382) .- T Consensus 396 ~~ 397 (400) T protein:vir:38 396 YT 397 (400) T ss_pred ee Confidence 66 No 105 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=65.32 E-value=0.29 Score=23.57 Aligned_cols=298 Identities=11% Similarity=0.007 Sum_probs=127.6 Q ss_pred cccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchh--HHHHHHhhhhhhheeccc Q lcl|NC_017674. 18 PFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMT 95 (382) Q Consensus 18 ~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~ 95 (382) -|-..+|- ........-++... +.+.| .|..+-+ +++.+. T Consensus 1 ~~~~~~~~---------------------------~~~~~~~~k~~t~~------d~~Gg~l~P~~~~~-----~i~~~~ 42 (315) T protein:vir:41 1 MLTIEDIR---------------------------GGKPFEIVPKIDVP------DLGRGVLSVDRFGE-----FVKAVR 42 (315) T ss_pred Ccccchhh---------------------------cCChhhhhhhcCCc------CCCCceechHHHHH-----HHHHHH Confidence 00111111 10111111112211 11112 2333222 222222 Q ss_pred cccchhhhCcccc-CCCccee--eEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhC Q lcl|NC_017674. 96 AARKIDEIIGIDT-VGSWEDQ--EIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIR 172 (382) Q Consensus 96 ~~~~~~~l~~v~t-~g~~~~~--t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g 172 (382) ..--.+.+.-+.+ .+...-+ ..-+...-..| ....|+..+.|..+..........+.+..-...+.+.|+ -..-+ T Consensus 43 e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g-~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~-D~~~~ 120 (315) T protein:vir:41 43 DSAVIIPEARIDNALKSYEKDISRLSLVLDVGPG-RDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIE-DNIEG 120 (315) T ss_pred hhhhhhhhceeeeccccccccccccccCcccccc-cccccCcCCCCCCccccceeeeceeeeeeeccccHHHHH-hhhcc Confidence 2222233333221 1111110 00000000011 123344444555555555555555666655566665555 34457 Q ss_pred CChHHHHHHHHHHHHHHhhccEEEEeeccCC---cccceEEEeCCCCcceeccCCCCccc-cCHHHHHHHHHHHHHHHHH Q lcl|NC_017674. 173 LNSAETKRQQAAIGLEIFRNAIGFYGWQSGL---GNRTYGFLNDPNLPAFQTPPSQGWST-ADWAGIIGDIREAVRQLRI 248 (382) Q Consensus 173 ~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~---~~g~~GllN~P~l~~~~~~a~~~Wa~-kT~~eI~~Di~~~~~~l~~ 248 (382) .++.+.-......++...++...|.||.... .+.-.|+|+.....+.. ....++. ..+.+.+.|+...+..-.. T Consensus 121 ~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~--~~~~~~a~~~~~d~l~~l~~sl~~~yr 198 (315) T protein:vir:41 121 KAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKLTE--SDVDPEAEDWPMNLFDTMIESLPTPYR 198 (315) T ss_pred ccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceecccccccc--cccccccccccHHHHHHHHHhcChHHh Confidence 8999999999999999999999999974210 12335888876543322 1223332 2244445555444433232 Q ss_pred hcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHh-----cCccEEEEccccccccCCCCCceeEEEEcchh Q lcl|NC_017674. 249 QSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQT-----YPKMRIVSAPELSGVQMKAQEPEDALVLFVED 322 (382) Q Consensus 249 ~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n-----~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~ 322 (382) ... ...+++|..+.+..+.+ .+..|.-+++=.... +-+..++.+|.+...+. +...+++-.-++ T Consensus 199 ~~~-------~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~---~~~~ilf~d~~n 268 (315) T protein:vir:41 199 NNL-------PNMKFYVTWDIYRAYRDALKGRETGLGDQALTGANSILYDGRPVQYVPALEALND---GKSRALFVVPTQ 268 (315) T ss_pred hcC-------CceEEEEcHHHHHHHHHHhccCCCccccchhhcCCCceecccceEecccccccCC---CCccEEEecccc Confidence 221 12357788877665532 122222223211111 11334666666643321 111222211111 Q ss_pred hhhhhccccccchhhhhhhhhhhhccccee--cCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 323 VNAAVDGSTDGGSVFSQLVQSKFITLGVEK--RAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 323 v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~--~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) . +. .++...+..+ ++ +...+..-.+.|.+|-.+-...+++....| T Consensus 269 l---~~-----------~~~~~i~i~~-~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 269 L---VY-----------GFWRNIKVVP-DYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred e---EE-----------EeccccEEEe-eecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 1 00 1111111111 11 122344444567777666688889999999 No 106 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=63.69 E-value=0.18 Score=24.62 Aligned_cols=311 Identities=10% Similarity=0.005 Sum_probs=124.2 Q ss_pred CCCcceeeeecCccc-------cccccccccc-hH---HHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccC Q lcl|NC_017674. 1 MSQISKTHSRLAGRN-------AKPFDLKNIT-ND---AVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTA 69 (382) Q Consensus 1 ~~~~~~~~~~~~~~~-------~~~~~~~~~~-~~---~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~ 69 (382) ...+-++|-.+-... -.+.. .+.. .. ++...-|.+..-. ........ ......+- . T Consensus 53 ~~~l~~~~~~~e~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~r~~~~~~-----~~~~~~~~-----~~~~~~a~-~- 119 (387) T protein:vir:94 53 FNIVERQVQDIEEKEKAKVKDKGEAYQ-SLSDNEKMVKAKAEFYRHAILPN-----EFEKPSME-----AQRLLHAL-P- 119 (387) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccCC-CCchhHHHHHHHHHHHHHHHhhh-----hHHHHHHH-----HHHHHhhh-c- Confidence 000000000000000 00000 0000 00 0111001000000 00000000 00000110 0 Q ss_pred cccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeee-cccceeecccccCCceeeeeeee Q lcl|NC_017674. 70 PVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE-PAGTAVEYGDHTNIPLTSWNANF 146 (382) Q Consensus 70 ~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e-~~G~a~~ygd~~DiP~vd~~~~~ 146 (382) ..+..+.| +|-.+ ..+|++.+...-..+.+..+.+.+.. +++.++ ..+.+.+.+.+...|..+...++ T Consensus 120 ~~~~~~gG~lIP~~~----~~~Ii~~~~~~~~l~~~~~~~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~ 190 (387) T protein:vir:94 120 TGNDSGGDKLLPKTL----SKEIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDT 190 (387) T ss_pred cCCCCCCceeechhH----HHHHHHHHHhhchhhhhceeeecCCc-----eeeeeeccCCccccccccccccccccccce Confidence 11112223 55444 34666666555556666666555542 223333 34556677888888888888888 Q ss_pred eEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCC Q lcl|NC_017674. 147 ERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQG 226 (382) Q Consensus 147 ~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~ 226 (382) ..-..+.++..+.+|.+=|. ....++.+--.....+++...++..+|.+ .+|.. .-.|.++++.+...+ T Consensus 191 v~l~~~k~~~~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~~-g~g~g-~~~g~~~~~~~~~~~------ 259 (387) T protein:vir:94 191 VKFTTNKFKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAV-SPKSG-LEHMSFYNGSVKEVE------ 259 (387) T ss_pred eeechheeeeechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhc-CCCcc-ccceeeecccccccc------ Confidence 88899999998899865343 33556676666666666666666655533 22221 224777766554221 Q ss_pred ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHH-HhhccccCCCCccHHHHHHHhcCccEEEEccccccc Q lcl|NC_017674. 227 WSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSK-VDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGV 305 (382) Q Consensus 227 Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~-~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a 305 (382) .+..++||.+++.+|...-. .+. ..+|-+.. ...+....+.|..++. .-|+ ++...|=.-.. T Consensus 260 -----~~~~~d~i~~~~~~l~~~y~----~na---~~imn~~t~~~~~~~~~~~~~~~~~----~~~~-~llG~PV~~~~ 322 (387) T protein:vir:94 260 -----GADMYDAIINALADLHEDYR----DNA---TIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTD 322 (387) T ss_pred -----ccchHHHHHHHHhccChhhh----cCC---EEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEec Confidence 11236677777776644221 121 34554443 3334333333332221 1121 11111111000 Q ss_pred cCCCCCceeEEE-EcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 306 QMKAQEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 306 ~g~g~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) + ...+++ -|..-+. .+. ++-+.... +.......+-+..|..|.+ ++|.||+.+.== T Consensus 323 ~-----~~~~~~GDf~~~~~-----------~~~---~~~~~~~~-~~~~~~~~~~~~~r~Dg~v-~~~~A~~~l~~k 379 (387) T protein:vir:94 323 A-----AVKPIVGDFNYFGI-----------NYD---GTTYDTDK-DVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 379 (387) T ss_pred C-----CCceeeechhhhhh-----------hhh---hhhheecc-cccCCceEEEEEEEeCcEe-echhheEEEEee Confidence 0 001111 1111000 000 01010000 0112233444555655554 569998875432 No 107 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=63.69 E-value=0.18 Score=24.62 Aligned_cols=311 Identities=10% Similarity=0.005 Sum_probs=124.2 Q ss_pred CCCcceeeeecCccc-------cccccccccc-hH---HHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccC Q lcl|NC_017674. 1 MSQISKTHSRLAGRN-------AKPFDLKNIT-ND---AVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTA 69 (382) Q Consensus 1 ~~~~~~~~~~~~~~~-------~~~~~~~~~~-~~---~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~ 69 (382) ...+-++|-.+-... -.+.. .+.. .. ++...-|.+..-. ........ ......+- . T Consensus 53 ~~~l~~~~~~~e~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~r~~~~~~-----~~~~~~~~-----~~~~~~a~-~- 119 (387) T protein:vir:96 53 FNIVERQVQDIEEKEKAKVKDKGEAYQ-SLSDNEKMVKAKAEFYRHAILPN-----EFEKPSME-----AQRLLHAL-P- 119 (387) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccCC-CCchhHHHHHHHHHHHHHHHhhh-----hHHHHHHH-----HHHHHhhh-c- Confidence 000000000000000 00000 0000 00 0111001000000 00000000 00000110 0 Q ss_pred cccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeee-cccceeecccccCCceeeeeeee Q lcl|NC_017674. 70 PVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE-PAGTAVEYGDHTNIPLTSWNANF 146 (382) Q Consensus 70 ~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e-~~G~a~~ygd~~DiP~vd~~~~~ 146 (382) ..+..+.| +|-.+ ..+|++.+...-..+.+..+.+.+.. +++.++ ..+.+.+.+.+...|..+...++ T Consensus 120 ~~~~~~gG~lIP~~~----~~~Ii~~~~~~~~l~~~~~~~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~ 190 (387) T protein:vir:96 120 TGNDSGGDKLLPKTL----SKEIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDT 190 (387) T ss_pred cCCCCCCceeechhH----HHHHHHHHHhhchhhhhceeeecCCc-----eeeeeeccCCccccccccccccccccccce Confidence 11112223 55444 34666666555556666666555542 223333 34556677888888888888888 Q ss_pred eEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCC Q lcl|NC_017674. 147 ERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQG 226 (382) Q Consensus 147 ~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~ 226 (382) ..-..+.++..+.+|.+=|. ....++.+--.....+++...++..+|.+ .+|.. .-.|.++++.+...+ T Consensus 191 v~l~~~k~~~~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~~-g~g~g-~~~g~~~~~~~~~~~------ 259 (387) T protein:vir:96 191 VKFTTNKFKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAV-SPKSG-LEHMSFYNGSVKEVE------ 259 (387) T ss_pred eeechheeeeechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhc-CCCcc-ccceeeecccccccc------ Confidence 88899999998899865343 33556676666666666666666655533 22221 224777766554221 Q ss_pred ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHH-HhhccccCCCCccHHHHHHHhcCccEEEEccccccc Q lcl|NC_017674. 227 WSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSK-VDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGV 305 (382) Q Consensus 227 Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~-~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a 305 (382) .+..++||.+++.+|...-. .+. ..+|-+.. ...+....+.|..++. .-|+ ++...|=.-.. T Consensus 260 -----~~~~~d~i~~~~~~l~~~y~----~na---~~imn~~t~~~~~~~~~~~~~~~~~----~~~~-~llG~PV~~~~ 322 (387) T protein:vir:96 260 -----GADMYDAIINALADLHEDYR----DNA---TIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTD 322 (387) T ss_pred -----ccchHHHHHHHHhccChhhh----cCC---EEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEec Confidence 11236677777776644221 121 34554443 3334333333332221 1121 11111111000 Q ss_pred cCCCCCceeEEE-EcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 306 QMKAQEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 306 ~g~g~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) + ...+++ -|..-+. .+. ++-+.... +.......+-+..|..|.+ ++|.||+.+.== T Consensus 323 ~-----~~~~~~GDf~~~~~-----------~~~---~~~~~~~~-~~~~~~~~~~~~~r~Dg~v-~~~~A~~~l~~k 379 (387) T protein:vir:96 323 A-----AVKPIVGDFNYFGI-----------NYD---GTTYDTDK-DVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 379 (387) T ss_pred C-----CCceeeechhhhhh-----------hhh---hhhheecc-cccCCceEEEEEEEeCcEe-echhheEEEEee Confidence 0 001111 1111000 000 01010000 0112233444555655554 569998875432 No 108 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=63.69 E-value=0.18 Score=24.62 Aligned_cols=311 Identities=10% Similarity=0.005 Sum_probs=124.2 Q ss_pred CCCcceeeeecCccc-------cccccccccc-hH---HHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccC Q lcl|NC_017674. 1 MSQISKTHSRLAGRN-------AKPFDLKNIT-ND---AVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTA 69 (382) Q Consensus 1 ~~~~~~~~~~~~~~~-------~~~~~~~~~~-~~---~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~ 69 (382) ...+-++|-.+-... -.+.. .+.. .. ++...-|.+..-. ........ ......+- . T Consensus 53 ~~~l~~~~~~~e~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~r~~~~~~-----~~~~~~~~-----~~~~~~a~-~- 119 (387) T protein:vir:26 53 FNIVERQVQDIEEKEKAKVKDKGEAYQ-SLSDNEKMVKAKAEFYRHAILPN-----EFEKPSME-----AQRLLHAL-P- 119 (387) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccCC-CCchhHHHHHHHHHHHHHHHhhh-----hHHHHHHH-----HHHHHhhh-c- Confidence 000000000000000 00000 0000 00 0111001000000 00000000 00000110 0 Q ss_pred cccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeee-cccceeecccccCCceeeeeeee Q lcl|NC_017674. 70 PVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE-PAGTAVEYGDHTNIPLTSWNANF 146 (382) Q Consensus 70 ~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e-~~G~a~~ygd~~DiP~vd~~~~~ 146 (382) ..+..+.| +|-.+ ..+|++.+...-..+.+..+.+.+.. +++.++ ..+.+.+.+.+...|..+...++ T Consensus 120 ~~~~~~gG~lIP~~~----~~~Ii~~~~~~~~l~~~~~~~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~ 190 (387) T protein:vir:26 120 TGNDSGGDKLLPKTL----SKEIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDT 190 (387) T ss_pred cCCCCCCceeechhH----HHHHHHHHHhhchhhhhceeeecCCc-----eeeeeeccCCccccccccccccccccccce Confidence 11112223 55444 34666666555556666666555542 223333 34556677888888888888888 Q ss_pred eEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCC Q lcl|NC_017674. 147 ERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQG 226 (382) Q Consensus 147 ~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~ 226 (382) ..-..+.++..+.+|.+=|. ....++.+--.....+++...++..+|.+ .+|.. .-.|.++++.+...+ T Consensus 191 v~l~~~k~~~~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~~-g~g~g-~~~g~~~~~~~~~~~------ 259 (387) T protein:vir:26 191 VKFTTNKFKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAV-SPKSG-LEHMSFYNGSVKEVE------ 259 (387) T ss_pred eeechheeeeechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhc-CCCcc-ccceeeecccccccc------ Confidence 88899999998899865343 33556676666666666666666655533 22221 224777766554221 Q ss_pred ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHH-HhhccccCCCCccHHHHHHHhcCccEEEEccccccc Q lcl|NC_017674. 227 WSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSK-VDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGV 305 (382) Q Consensus 227 Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~-~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a 305 (382) .+..++||.+++.+|...-. .+. ..+|-+.. ...+....+.|..++. .-|+ ++...|=.-.. T Consensus 260 -----~~~~~d~i~~~~~~l~~~y~----~na---~~imn~~t~~~~~~~~~~~~~~~~~----~~~~-~llG~PV~~~~ 322 (387) T protein:vir:26 260 -----GADMYDAIINALADLHEDYR----DNA---TIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTD 322 (387) T ss_pred -----ccchHHHHHHHHhccChhhh----cCC---EEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEec Confidence 11236677777776644221 121 34554443 3334333333332221 1121 11111111000 Q ss_pred cCCCCCceeEEE-EcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 306 QMKAQEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 306 ~g~g~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) + ...+++ -|..-+. .+. ++-+.... +.......+-+..|..|.+ ++|.||+.+.== T Consensus 323 ~-----~~~~~~GDf~~~~~-----------~~~---~~~~~~~~-~~~~~~~~~~~~~r~Dg~v-~~~~A~~~l~~k 379 (387) T protein:vir:26 323 A-----AVKPIVGDFNYFGI-----------NYD---GTTYDTDK-DVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 379 (387) T ss_pred C-----CCceeeechhhhhh-----------hhh---hhhheecc-cccCCceEEEEEEEeCcEe-echhheEEEEee Confidence 0 001111 1111000 000 01010000 0112233444555655554 569998875432 No 109 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=61.16 E-value=0.26 Score=23.80 Aligned_cols=312 Identities=11% Similarity=-0.010 Sum_probs=125.2 Q ss_pred CCCcceeeeecC-------ccccccccccc---cchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCc Q lcl|NC_017674. 1 MSQISKTHSRLA-------GRNAKPFDLKN---ITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAP 70 (382) Q Consensus 1 ~~~~~~~~~~~~-------~~~~~~~~~~~---~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~ 70 (382) +..+-.+|-.+- ....++..-.. -...++.+..|-... + +......... .....+| . . T Consensus 68 ~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~---~--~~~~~~~~~~--~~~~~a~----~-~ 135 (402) T protein:vir:93 68 FNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAIL---P--NEFEKPSMEA--QRLLHAL----P-T 135 (402) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHh---h--hhHHHHHHhH--HHHHhhh----c-c Confidence 000000000000 00000000000 000011111010000 0 0000000000 0000011 0 1 Q ss_pred ccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeee-cccceeecccccCCceeeeeeeee Q lcl|NC_017674. 71 VTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE-PAGTAVEYGDHTNIPLTSWNANFE 147 (382) Q Consensus 71 ~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e-~~G~a~~ygd~~DiP~vd~~~~~~ 147 (382) .+.++.| +|..+. .+|++.+...-..+.+..+.+.++. +++.++ ..+.+.+.+.....|..+....+. T Consensus 136 ~t~~~GG~lIP~~~~----~~Ii~~~~~~~~l~~~~~v~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i 206 (402) T protein:vir:93 136 GNDSGGDKLLPKTLS----KEIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDTV 206 (402) T ss_pred CCCcCCccccchhHH----HHHHHhHHhhhhhhhhceeeecCCc-----eeeeeeccCCcccccccccccccccccccee Confidence 1222223 555443 3566666555555666666555542 233333 345566778888888888888888 Q ss_pred EeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCc Q lcl|NC_017674. 148 RRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGW 227 (382) Q Consensus 148 ~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~W 227 (382) .-.++.++..+.+|.+=|. -...++.+--.....+++...+++.+|.+ .+|. ..-.|.++++.+...+ T Consensus 207 ~~~~~k~~~~i~iS~ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~~~-g~g~-g~p~g~~~~~~~~~~~------- 274 (402) T protein:vir:93 207 KFTTNKFKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAV-SPKS-GLEHMSFYNGSVKEVE------- 274 (402) T ss_pred eecceeeeeechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhc-CCCc-cccceeeecccccccc------- Confidence 8899999988888865343 33456776666666666666666554433 2221 1234777766544221 Q ss_pred cccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHH-HhhccccCCCCccHHHHHHHhcCccEEEEcccccccc Q lcl|NC_017674. 228 STADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSK-VDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQ 306 (382) Q Consensus 228 a~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~-~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~ 306 (382) ....++||.+++.+|...- ..+. .++|-+.. ...+....+.|..++. .-|+ ++...|=.-..+ T Consensus 275 ----~~~~~d~l~~~~~~l~~~y----~~na---~~imn~~t~~~~~~~~~d~~~~~~~----~~~~-~llG~PV~~t~~ 338 (402) T protein:vir:93 275 ----GADMYDAIINALADLHEDY----RDNA---TIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTDA 338 (402) T ss_pred ----ccchHHHHHHHHhccChhh----hcCC---EEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEecC Confidence 1123677777777654321 1121 34565443 3434333333333321 1121 111111111110 Q ss_pred CCCCCceeEEE-EcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeec--CC Q lcl|NC_017674. 307 MKAQEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYL--GI 382 (382) Q Consensus 307 g~g~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~--GI 382 (382) ...+++ .|..-+- .+. ++-+... -+.......+-+..|.+|.++ .|.||+.+. +- T Consensus 339 -----~~~i~~GDf~~~~~-----------~~~---~~~~~~~-~~~~~~~~~~~~~~r~Dg~v~-~~~A~~~l~ik~~ 396 (402) T protein:vir:93 339 -----AVKPIVGDFNYFGI-----------NYD---GTTYDTD-KDVKKGEYLFVLTAWYDQQRT-LDSAFRIAKAKEN 396 (402) T ss_pred -----CCceeeechhhhhh-----------hhh---hhhhhhh-hcccCCceEEEEEEEeCcEEe-chhheEEEEeecC Confidence 011111 1111000 000 0111110 001123344555667766554 599987543 22 No 110 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=57.12 E-value=0.44 Score=22.53 Aligned_cols=310 Identities=9% Similarity=-0.010 Sum_probs=127.7 Q ss_pred CCCcceeeeec------Cccc--cccccccc--cch---HHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccc Q lcl|NC_017674. 1 MSQISKTHSRL------AGRN--AKPFDLKN--ITN---DAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNF 67 (382) Q Consensus 1 ~~~~~~~~~~~------~~~~--~~~~~~~~--~~~---~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~ 67 (382) +.++...+-.. ..+. ..+..... ... .++...-+-|...... ....++ T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~e~~a~---- 116 (404) T protein:vir:39 56 RDALREQLVEAQAEQVVNMREEEKGPLNKSEYELKDKFVKEFVNMVRNPMAFLNT---------------VSSKTE---- 116 (404) T ss_pred HHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHHhcchhhhhh---------------hhhhhh---- Confidence 00000000000 0000 00000000 000 0111111111110000 000111 Q ss_pred cCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCce-eeeee Q lcl|NC_017674. 68 TAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPL-TSWNA 144 (382) Q Consensus 68 ~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~-vd~~~ 144 (382) ...+..+.| +|..+.+ .|++.+......+.++.+.....-.-........+..+.+.+.+....+|- .+... T Consensus 117 -~~~t~~~gg~~iP~~~~~----~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f 191 (404) T protein:vir:39 117 -TSGSDSAAGLTIPQDIRT----MINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRL 191 (404) T ss_pred -hcccccCCceeccHHHHH----HHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccce Confidence 111222222 5655554 566665555556666554332221111122223344566777888788885 55788 Q ss_pred eeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC Q lcl|NC_017674. 145 NFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS 224 (382) Q Consensus 145 ~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~ 224 (382) ......++.++..+.+|.+=++ ....+|.+.-.....+++.+.+|+-+++|+..+ .+.. T Consensus 192 ~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~-------------~~~~----- 250 (404) T protein:vir:39 192 TIIKYLIKRYAGIITATNTLLK---DTAENILAWLSSWIAKKVVVTRNQAIIAAMGTV-------------PKKP----- 250 (404) T ss_pred eeEEeeeeeEEeeehhHHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------cccc----- Confidence 8888999999988888875443 334678888888888888889999999996321 0100 Q ss_pred CCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH-HHHhcCccEEEEcc-- Q lcl|NC_017674. 225 QGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW-IEQTYPKMRIVSAP-- 300 (382) Q Consensus 225 ~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~~n~pnl~i~~~p-- 300 (382) ...+.+ ||.+++...... .+..+ ..++|.|+.+..|.. .+..|.-++.- +....| -+|...| T Consensus 251 ---~~~~~~----~i~~~~~~~~~~---~~~~~---a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~-~~l~G~pV~ 316 (404) T protein:vir:39 251 ---TIAKFD----DVITMINTSVDP---AIIAT---SSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNS-YLIKGKKVI 316 (404) T ss_pred ---ccccHH----HHHHHHHHhhhh---hhccC---CEEEEcHHHHHHHHHhhccCCceeeccCcCCCCc-ceecceeEE Confidence 012333 444443321111 11222 258899999888864 23334333210 000011 0111111 Q ss_pred --ccccccCCCCCceeEEE-Ecchhhhhhhccccccchhhhhhhhhhhhcccce---ecCCceEeccccceeeeEeeccc Q lcl|NC_017674. 301 --ELSGVQMKAQEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVE---KRAKSYVEDFSNGTAGALCKRPW 374 (382) Q Consensus 301 --eL~~a~g~g~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~---~~~~~~~~~~~~~t~Gv~i~~P~ 374 (382) +-......+.+...+++ .+.+-+.. +. .+.+.. ...+.. ...-....-+..|. |+.+++|. T Consensus 317 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~---~~-------~~~~~i--~~~~~~~~~~~~~~~~~r~~~r~-d~~~~~~~ 383 (404) T protein:vir:39 317 VVADRWLPNSGSTVYPLYYGDMSQAITL---FD-------RENMSL--LPTNIGAGAFETDTTKIRVIDRF-DVKTTDSE 383 (404) T ss_pred EecccccCccCCCccEEEEEeccccEEE---Ee-------ecceEE--EEeccchhhhhhceeeEEEEeee-ccEEeccc Confidence 10011111111111111 11110100 00 000000 000000 00011223344444 56788899 Q ss_pred hheeecCC Q lcl|NC_017674. 375 AVVRYLGI 382 (382) Q Consensus 375 aia~~~GI 382 (382) ||+....- T Consensus 384 a~~~~~~~ 391 (404) T protein:vir:39 384 ALVAGSFT 391 (404) T ss_pred ceEEEEee Confidence 99998877 No 111 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=55.05 E-value=0.32 Score=23.27 Aligned_cols=312 Identities=10% Similarity=-0.028 Sum_probs=126.3 Q ss_pred CCCcceeee--------ecCcccccccc--ccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCc Q lcl|NC_017674. 1 MSQISKTHS--------RLAGRNAKPFD--LKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAP 70 (382) Q Consensus 1 ~~~~~~~~~--------~~~~~~~~~~~--~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~ 70 (382) +..+.+++- .+......+-. ........+...-|-++.-.... ...... .....+| .. T Consensus 53 ~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~-----~~~~~~--~~~~~al-----~~ 120 (387) T protein:vir:93 53 FNIVERQVKDIEEKEKAKVKDTGEAYQSLNDHEKMVKAKAEFYRHAILPNEFE-----KPSMEA--QRLLHAL-----PT 120 (387) T ss_pred HHHHHHHHHHHHHHHHHhhhhccccCCCcchhhHHHHHHHHHHHHHhhhhhhh-----hhhhhh--HHHHHhh-----cc Confidence 000000000 00000000000 00000111111111111000000 000000 0000010 11 Q ss_pred ccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEee-ecccceeecccccCCceeeeeeeee Q lcl|NC_017674. 71 VTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIV-EPAGTAVEYGDHTNIPLTSWNANFE 147 (382) Q Consensus 71 ~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~-e~~G~a~~ygd~~DiP~vd~~~~~~ 147 (382) .+.++.| +|..+.+ +|++.+...-..+.+..+.+.++. +++.. ...+.+.+.+.....|..+...+.. T Consensus 121 ~t~s~gG~~IP~~~~~----~Ii~~~~~~~~l~~~~~v~~~~~~-----~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v 191 (387) T protein:vir:93 121 GNDSGGDKLLPKTLSK----EIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKLKGDTV 191 (387) T ss_pred CcCCCCceeechhHHH----HHHHHHHhhchhhhheeeeecCCc-----eEEEEeecCCccccccCccccccccccccee Confidence 1222223 5655544 555555554445666666555542 23333 3445566778888888888888888 Q ss_pred EeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCc Q lcl|NC_017674. 148 RRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGW 227 (382) Q Consensus 148 ~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~W 227 (382) .-..+.++..+.+|.+=| .-...++.+--......++...++..+|.+ .+|. ..-.|.|+++.+... T Consensus 192 ~~~~~k~~~~~~iS~ell---~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~-g~g~-g~p~g~l~~~~~~~v-------- 258 (387) T protein:vir:93 192 KFTTNKFKVFAAISDTVI---HGSDVDLVNWVENALQSGLAAKERKDALAV-SPKS-GLDHMSFYNGSVKEV-------- 258 (387) T ss_pred eeeheeeeeechhhHHHH---hhhHHHHHHHHHHHHHHHHHHHHHHhHhhc-CCCc-cccceeeeccccccc-------- Confidence 888999988888886433 334556777777777777777777765543 2222 123577777654321 Q ss_pred cccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHH-HhhccccCCCCccHHHHHHHhcCccEEEEcccccccc Q lcl|NC_017674. 228 STADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSK-VDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQ 306 (382) Q Consensus 228 a~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~-~~~Ls~t~~~~~Tvl~~l~~n~pnl~i~~~peL~~a~ 306 (382) +....++||.+++.+|-..- ..+. .++|.+.. ...+....+.|..++ .. .|+ +|...|=.-..+ T Consensus 259 ---~~~~~~d~i~~~~~~l~~~~----~~~a---~~~mn~~t~~~~~~~~~d~~~~~~---~~-~~~-~llG~PV~~~~~ 323 (387) T protein:vir:93 259 ---EGADMYDAIINALADLHEDY----RDNA---TIYMRYADYVKIISVLSNGTTNFF---DT-PAE-KVFGKPVVFTDA 323 (387) T ss_pred ---cccchHHHHHHHHhccChhh----hcCC---EEEEechHHHHHHHHHhcCCCccc---cc-CCc-cccccceEEecC Confidence 11122567777777654322 1121 34565543 344433322232222 11 121 121211111110 Q ss_pred CCCCCceeEEE-EcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 307 MKAQEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 307 g~g~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) ...+++ .|..- +...-++-+.+.. +.....+.+-...|.+|. +++|.||+.+.-= T Consensus 324 -----~~~~~~GDf~~~--------------~~~~~~~~~~~~~-~~~~~~~~~~~~~r~d~~-v~~~eA~~~l~~k 379 (387) T protein:vir:93 324 -----AVKPIVGDFNYF--------------GINYDGTTYDTDK-DVKKGEYLFVLTAWYDQQ-RTLDSAFRIAKAK 379 (387) T ss_pred -----CCceeeeehhhh--------------heehhhheeeecc-cccCCceeEEEEeeeCce-eechhheEEEEee Confidence 000110 01000 0000011111110 111222333344566555 4569999876432 No 112 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=51.13 E-value=0.59 Score=21.84 Aligned_cols=292 Identities=9% Similarity=-0.040 Sum_probs=132.6 Q ss_pred hHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCc Q lcl|NC_017674. 26 NDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIG 105 (382) Q Consensus 26 ~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~ 105 (382) ++.++.+.+ .. +. ...-|++ ++.. .|.++ + ++++.+...--.+.+.- T Consensus 1 ~~~~~~~~~---~~-------------k~-----it~~d~~--gG~L-----~P~~~-~----~~i~~l~e~s~i~~~a~ 47 (314) T protein:vir:41 1 MDFLNKPFQ---IT-------------PK-----IDVPDLG--KGIL-----AVQRF-G----EFVREVRENSAIIKDAR 47 (314) T ss_pred CchhhhHHH---hh-------------cc-----cccccCC--Ccee-----ChHHH-H----HHHHHHHhccchhhhee Confidence 222222221 00 00 0011211 1111 23332 2 34444444444444444 Q ss_pred cc-cCCCcceeeEEEEeeecc----cceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHH Q lcl|NC_017674. 106 ID-TVGSWEDQEIVQGIVEPA----GTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKR 180 (382) Q Consensus 106 v~-t~g~~~~~t~t~~v~e~~----G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~ 180 (382) +. +.+. ....+..+... ..+...|+.++.|..+.......-..+.+..-+.++.+.|+- ..-|.++.+.-. T Consensus 48 vi~t~~s---~~~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D-~a~~~~le~~i~ 123 (314) T protein:vir:41 48 VLNALKS---YEVDISRISLGVELEPGRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALED-NIEQSAFEQTIT 123 (314) T ss_pred eecccCc---cceeecccccCcccccccccccCCccCCcccccccceeeeeEEEEEeecccHHHHHh-hhchhhHHHHHH Confidence 43 2232 11222222211 112234555666777777777777777777778888777763 335678999999 Q ss_pred HHHHHHHHHhhccEEEEeeccCC-----cccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeee Q lcl|NC_017674. 181 QQAAIGLEIFRNAIGFYGWQSGL-----GNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQID 255 (382) Q Consensus 181 ~aAr~a~~~~~n~i~~~Gd~~g~-----~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~ 255 (382) ....+.+...+....|.||.+.. .+...|+|+.....++.+. .-..+.+++.+.|+...+..-..+..+ T Consensus 124 ~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~~---~~~~~~~~~~~~~l~~sl~~~yr~~~~--- 197 (314) T protein:vir:41 124 SLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTDAE---PEDENWPLNLFDGMMDELDTRYLQLKP--- 197 (314) T ss_pred HHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceeecC---ccccccHHHHHHHHHHhcCchhhcCCC--- Confidence 99999999999999999985321 1123477765433221111 111234444444444444332333321 Q ss_pred eccccceEecCHHHHhhcccc-CCCCccHHHHHHHh-----cCccEEEEccccccccCCCCCceeEEEEcchhhhhhhcc Q lcl|NC_017674. 256 PKAEKITLALATSKVDYLSVT-TPYGISVSDWIEQT-----YPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDG 329 (382) Q Consensus 256 ~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~l~~n-----~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~ 329 (382) ..+.+|++..+..+.+. .+-+..+++..... +-+..++.+|.+...+. +...+++=.-++. +. T Consensus 198 ----~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~---~~~~i~fgd~~nl---v~- 266 (314) T protein:vir:41 198 ----RMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQYDGIPIQYVPALDALGD---DKARALLTVPTNL---VY- 266 (314) T ss_pred ----ceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCceecceeeEecccccccCC---CCceEEEechhhe---EE- Confidence 23577888766554321 11122223222222 22445777777755432 2222332222221 11 Q ss_pred ccccchhhhhhhhhhhhcccceec--CCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 330 STDGGSVFSQLVQSKFITLGVEKR--AKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 330 ~~~~~~~~~~~~p~~~~~l~~~~~--~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .+....+..+ ++. ...+..-.+.|+.....-.+.++....+= T Consensus 267 ----------~~~~~ir~~~-~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~ 310 (314) T protein:vir:41 267 ----------GFWRNIRIEP-KRDAAMRRTEYIASLRADCNYEDENAAVAAVIDM 310 (314) T ss_pred ----------EeeceeEEee-cccCcCCeEEEEEEEEeceEEEEcCcEEEEEeec Confidence 1111222211 111 22333334445544444466666666555 No 113 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=50.79 E-value=0.6 Score=21.80 Aligned_cols=325 Identities=11% Similarity=-0.035 Sum_probs=132.0 Q ss_pred CCCcceeee----ecCcc-----cccccc----------------ccccchHHHHHHhhcceeccccchhhhhhhhcccc Q lcl|NC_017674. 1 MSQISKTHS----RLAGR-----NAKPFD----------------LKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAG 55 (382) Q Consensus 1 ~~~~~~~~~----~~~~~-----~~~~~~----------------~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~ 55 (382) +.++...+. .+..+ ..+.++ ...-.......+.+.|..--....+ T Consensus 8 ~~e~~e~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r---------- 77 (390) T protein:vir:40 8 DSETLNISTAFLNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDES---------- 77 (390) T ss_pred HHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHH---------- Confidence 000000000 00000 000000 0000000111111222110000000 Q ss_pred cchhhhhhcccccC-cccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccc Q lcl|NC_017674. 56 AFRSGSAMDSNFTA-PVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDH 134 (382) Q Consensus 56 ~~~~~~amDa~~~~-~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~ 134 (382) -++.+.... ..+..+.-+|..+.+ +|++.+...-..+.++.+.+.+. ....++.....+.+.+.+.. T Consensus 78 -----~~~~~~~~~~~~~~gg~lvP~~~~~----~I~~~~~~~s~i~~~~~~~~~~~---~~~~i~~~~~~~~a~~~~E~ 145 (390) T protein:vir:40 78 -----KYYNEVIAGNGFAGVTALLPPTVFE----RVFEDLTVEHPLLSKINFVNTTA---TTEWIISVGDVATAWWGPLC 145 (390) T ss_pred -----HHHHHHHhccCcccCcccccHHHHH----HHHHHHHhhhhhhhhceeeecCC---ceeEEEEEcCCcceeeeccc Confidence 011111011 111122235655554 44444433333444444433332 34446666777777777766 Q ss_pred cCCc-eeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeC Q lcl|NC_017674. 135 TNIP-LTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLND 213 (382) Q Consensus 135 ~DiP-~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~ 213 (382) ..+| ..+.......-..+.+...+.+|.+=++ ....++.+.-.....+++...+|+-+++|+.. ..-.|+||+ T Consensus 146 ~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~---ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~---~~P~Gil~~ 219 (390) T protein:vir:40 146 AEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLD---LGPSWLDQYVRTILGEAMALGLEAGIVNGSGK---DQPIGMMRD 219 (390) T ss_pred cccCccccccceeeEeeeeeEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhhhcccCC---Cccceeeec Confidence 6665 4567778888888889888888865554 34557888899999999999999999999743 334699998 Q ss_pred CCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHH-HHhhcc----ccCCCCccHHHHHH Q lcl|NC_017674. 214 PNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATS-KVDYLS----VTTPYGISVSDWIE 288 (382) Q Consensus 214 P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~-~~~~Ls----~t~~~~~Tvl~~l~ 288 (382) +..............+-|.+.+.+.+..+...+....... ..+ -.++|.++ .+.+|. ..+..|.-++..+- T Consensus 220 ~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~-~~~---a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~~~ 295 (390) T protein:vir:40 220 LNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKS-VSD---AILVINPADYWSKIYAATSYMTPQGVWVTGILP 295 (390) T ss_pred cccccccccccccccccchhhHHHHHHHHHHHhhcchhhh-hcC---ceEEEcchhHHHHHHHHhhccCCCCccccccCC Confidence 7533211111111222233333333333333332222111 111 13455543 334332 11223332221110 Q ss_pred HhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhccccee--cCCceEecccccee Q lcl|NC_017674. 289 QTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEK--RAKSYVEDFSNGTA 366 (382) Q Consensus 289 ~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~--~~~~~~~~~~~~t~ 366 (382) + ++.++..+.+. .+. +++-.-.++. ..+. + .+.+.... +. .......-...|.+ T Consensus 296 --~-g~pvv~~~~~p-------~~~-i~~Gd~s~~~-i~~~---------~--~~~v~~~~-~~~f~~~~~~~r~~~r~d 351 (390) T protein:vir:40 296 --V-PLEIVQSVAVP-------VGK-AVAGRAKDYF-MGIG---------S--EQVIRTST-EYRLLDDETLYYAKQYAN 351 (390) T ss_pred --C-ceeEEEcCCCC-------CCc-EEEEeeceEE-EEee---------c--ceEEEecc-hhhhhcCcEEEEEEEEeC Confidence 1 23333222211 111 2211111110 0000 0 00011110 01 11123334455554 Q ss_pred eeEeeccchhee--ecCC Q lcl|NC_017674. 367 GALCKRPWAVVR--YLGI 382 (382) Q Consensus 367 Gv~i~~P~aia~--~~GI 382 (382) | .++.|.||+. +.++ T Consensus 352 g-~v~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 352 G-RPKDNSSFLVFDITGL 368 (390) T ss_pred C-EEecccceEEEEeecc Confidence 4 4556998884 3555 No 114 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=48.51 E-value=0.67 Score=21.55 Aligned_cols=305 Identities=11% Similarity=0.031 Sum_probs=118.7 Q ss_pred CCCcc--------------eeeeecCccccccccccccc-hHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcc Q lcl|NC_017674. 1 MSQIS--------------KTHSRLAGRNAKPFDLKNIT-NDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDS 65 (382) Q Consensus 1 ~~~~~--------------~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa 65 (382) |..+. +......++....-+..... ......+.++. .... +-. T Consensus 54 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~-------~~~ 111 (421) T protein:vir:13 54 MEIIEEEIESVMTAIDEERKNTNFTGGRVIINGDSKEEKRSLQLSAMSKTI---------------RGIQ-------LSE 111 (421) T ss_pred HHHHHHHHHHHHHHHHHHHhhhcccccccccccchhHHHHHHHHHHHHHhh---------------hccc-------hhH Confidence 11000 00000000000000000000 00011111110 0000 000 Q ss_pred cccCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccc--eeecccccCCceee Q lcl|NC_017674. 66 NFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGT--AVEYGDHTNIPLTS 141 (382) Q Consensus 66 ~~~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~--a~~ygd~~DiP~vd 141 (382) ...+..++++.| +|..+.+ .|++.+......+.++.+.+... .+..|.+...... +...+...++|..+ T Consensus 112 ~~ra~~t~~~gg~liP~~~~~----~Ii~~~~~~~~l~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~E~~~~~~s~ 184 (421) T protein:vir:13 112 EERDIMSSTNNGAVIPQEFVN----EFEKLKEGYPSLKEHCHVIPVNR---NAGKMPVRAGASVDKLANLAKDTELVKAM 184 (421) T ss_pred HHhhccccCCcceecchhhHH----HHHHHHHhhhhhhhhceeeeccC---CceEEEEeecCCccceeeccccccccccc Confidence 011122333333 5544443 55555554444555554433222 2344444433333 33456677888888 Q ss_pred eeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceec Q lcl|NC_017674. 142 WNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQT 221 (382) Q Consensus 142 ~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~ 221 (382) .......-.++.++..+.+|.+=|+-+ ..+|.+--....++++..++|.-+. +...|+++.+. T Consensus 185 ~~f~~i~~~~~k~~~~v~iS~ell~ds---~~~l~~~i~~~la~~~~~~~~~~i~--------~~~~g~~~~~~------ 247 (421) T protein:vir:13 185 LKTQPMAYDIDDYGLLAPIDNSLLEDS---EINFLEFVNEEFAEFAVNTENAEIV--------KQAKAVLAEET------ 247 (421) T ss_pred cceeEEEeeeeeeEeehhhhHHHHhhh---HHHHHHHHHHHHHHHHHHHhhhhHh--------hhhhhcccccc------ Confidence 888888888999998888887544333 3456655555556666666553211 12234433211 Q ss_pred cCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHhcC----ccEE Q lcl|NC_017674. 222 PPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQTYP----KMRI 296 (382) Q Consensus 222 ~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n~p----nl~i 296 (382) ..+ ++||.++++.+...- .. ...++|.+..+..|.. .+..|.=++.-+...-| +..+ T Consensus 248 -------~~~----~d~i~~~~~~l~~~~----~~---~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~tl~G~pV 309 (421) T protein:vir:13 248 -------IND----YAGLVKTINSLVPNA----RK---RAIIVTNSDGRAYLDGLMDKQGRPLLKELSDGGDLVFKGRPV 309 (421) T ss_pred -------ccc----hHHHHHHHHHhhhhh----cC---CCEEEEcHHHHHHHHHhhcCCCceeecCcCCCCCceecceee Confidence 012 456777777664321 11 2368999999988864 34444433332221111 1223 Q ss_pred EEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEecccccee---------- Q lcl|NC_017674. 297 VSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTA---------- 366 (382) Q Consensus 297 ~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~---------- 366 (382) +..+..-. +.+.....++-.+.+-+.. .+.+. ..+.. .+..+ ...-.+.+.+..|.+ T Consensus 310 ~~~~~~~~--~~~~~~~~~~gd~~~~~~~-~~~~~---~~v~~-~~~~~------f~~~~~~~r~~~r~d~~~~~~~a~~ 376 (421) T protein:vir:13 310 IELEESIF--DVGDETKFIVSDFKTLIKF-MDRKQ---YLIDQ-SKEAG------YTKNETIARIIERFDVNSPLDKSSD 376 (421) T ss_pred EEeccccc--cCCCceEEEEEeccccEEE-EEecc---eEEEe-ecccc------cccCeeEEEEEeeecceeecchhhh Confidence 33322211 1111111111111111100 00000 00000 00000 001112223334443 Q ss_pred eeEeeccchheeecCC Q lcl|NC_017674. 367 GALCKRPWAVVRYLGI 382 (382) Q Consensus 367 Gv~i~~P~aia~~~GI 382 (382) .+.+.+|.+++...++ T Consensus 377 ~~~~~~~~a~v~~~~~ 392 (421) T protein:vir:13 377 AEKIRKFGVIVKLQEV 392 (421) T ss_pred eeeecccceeeccccc Confidence 3344555556666565 No 115 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=46.05 E-value=0.75 Score=21.27 Aligned_cols=307 Identities=10% Similarity=-0.015 Sum_probs=130.8 Q ss_pred CCCc--------ceeeeecCccccccccccc--cchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCc Q lcl|NC_017674. 1 MSQI--------SKTHSRLAGRNAKPFDLKN--ITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAP 70 (382) Q Consensus 1 ~~~~--------~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~ 70 (382) +... .....-...+.-++..-.. ......+.++++ +.... ....-++ . . T Consensus 53 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------l~~~~-~~~~~~~----~-~ 111 (397) T protein:vir:49 53 RDMFKEQYTEARANEVANMSEEEKKPLTKSEEEVKAGFVKDFKNL---------------VRGRY-QNLLDSK----T-D 111 (397) T ss_pred HHHHHHHHHHHHHHhhhccccccccccccchhHHHHHHHHHHHHH---------------Hhcch-hHHHHHh----h-c Confidence 0000 0000000111111111110 000111111111 10000 0000011 1 1 Q ss_pred ccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEee-ecccceeecccccCCce-eeeeeee Q lcl|NC_017674. 71 VTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIV-EPAGTAVEYGDHTNIPL-TSWNANF 146 (382) Q Consensus 71 ~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~-e~~G~a~~ygd~~DiP~-vd~~~~~ 146 (382) .++.+.| +|..+. +.|++.+........++.+.....-. ..+.|... +..|.+.+.+.+..+|- ......+ T Consensus 112 ~t~~~gg~~vP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:49 112 ASGSDAGLTIPQDIQ----TAIHTLVSQYDSLQEYVNVENVTTLT-GSRVYEKWTDITGLANIDDEAGKIADVDDPKLSL 186 (397) T ss_pred cccccCcccccHhHH----HHHHHHHHhhhhHHhhhceeecccCc-cceEEEeeccCCcceeeecCccccccccccceee Confidence 1222333 454443 46666666666666665554332211 12333333 34567788888888885 5678888 Q ss_pred eEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCC Q lcl|NC_017674. 147 ERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQG 226 (382) Q Consensus 147 ~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~ 226 (382) ..-.++.++..+.+|.+=++. ...++.+--.....+++...+|+-.+.|+..+ . +. +... T Consensus 187 i~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~--~-----------~~---~~~~- 246 (397) T protein:vir:49 187 IKYTIKRYAGISTVTNSLLAD---SAENILAWLSGWIAKKVVVTRNKAILEAIAAL--P-----------TK---PTLT- 246 (397) T ss_pred EEeeeeeEEeeehhHHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--c-----------cc---cccc- Confidence 899999999999988754432 34567887888888888888999899997321 0 00 0011 Q ss_pred ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHH-HHHhcCccEEEEcc---- Q lcl|NC_017674. 227 WSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDW-IEQTYPKMRIVSAP---- 300 (382) Q Consensus 227 Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~-l~~n~pnl~i~~~p---- 300 (382) + ++||.+++..+...- ..+ ..++|.++.+..|.. .+..|.-++.= +....+ -++...| T Consensus 247 ----~----~d~i~~~~~~l~~~~----~~~---a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~-~~l~G~PV~~~ 310 (397) T protein:vir:49 247 ----K----WDDIIDLEAKVDPAI----KQT---SFFLTNTSGFTALKKVKNALGDYLMERDVKSPTG-YSIDGFAVKEV 310 (397) T ss_pred ----c----HHHHHHHHHhhhhhh----cCC---CEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCC-ceecceeeEEe Confidence 2 456666776664432 122 368899999988854 23334433210 111111 1121111 Q ss_pred ccccccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccce---ecCCceEeccccceeeeEeeccchh Q lcl|NC_017674. 301 ELSGVQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVE---KRAKSYVEDFSNGTAGALCKRPWAV 376 (382) Q Consensus 301 eL~~a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~---~~~~~~~~~~~~~t~Gv~i~~P~ai 376 (382) +-.....+..+...+++- +.+-+. .+ + .+.+. +...+.. ...-....-+..|. |+.+++|.+| T Consensus 311 ~~~~~~~~~~~~~~i~~gd~~~~~~---~~-~------~~~~~--i~~~~~~~~~~~~~~~~~r~~~r~-d~~~~~~~a~ 377 (397) T protein:vir:49 311 ADRWLANGTGGAMPLYFGDLKQAVT---LF-D------RQHMS--LLSTNIGGGAFETDTTKVRVIDRF-DVVATDTEAF 377 (397) T ss_pred cccccccccCCceeEEEeeccceEE---EE-e------ecceE--EEEeccccchhhcCceeEEEEeee-CcEEecccce Confidence 101111111111112211 111010 00 0 00000 0000000 00001112233343 5577888888 Q ss_pred eeecCC Q lcl|NC_017674. 377 VRYLGI 382 (382) Q Consensus 377 a~~~GI 382 (382) +.+..= T Consensus 378 ~~~~~~ 383 (397) T protein:vir:49 378 VPASFK 383 (397) T ss_pred EEEEee Confidence 876643 No 116 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=44.79 E-value=0.79 Score=21.13 Aligned_cols=219 Identities=10% Similarity=0.008 Sum_probs=104.1 Q ss_pred cCCCcceeeEEEEeeecccceeecccccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHH Q lcl|NC_017674. 108 TVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGL 187 (382) Q Consensus 108 t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~ 187 (382) ..|--.=.+++|+.+ .|.|..+++++.+|.-.+.......+|...+-+++++..+.. +..|=++ .+-.....+++ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l--~~~gDp~-~ea~~Q~~~~i 75 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAAL--SGYGDPI-GESNKQLGLSL 75 (231) T ss_pred CccccCCceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHh--hccCchH-HHHHHHHHHHH Confidence 222222368888865 899999999999999999999999999999888888776653 4455444 33333444444 Q ss_pred HHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCH Q lcl|NC_017674. 188 EIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALAT 267 (382) Q Consensus 188 ~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~ 267 (382) ..++|.=.+ + + +. +..|..++. --+++|++++..+-.. +..+..+++.| T Consensus 76 A~kvD~di~-~-~----------~~-----------~a~l~~~~~-~t~d~i~~A~~~fgde-------~~~~~vivv~p 124 (231) T protein:vir:73 76 ANKVDDDLL-K-A----------AK-----------TTSQTVSTK-ANVDGVQAALDIFNDE-------DAQAYVLIVNP 124 (231) T ss_pred HHhhhHHHH-H-h----------hc-----------ccccccccc-ccHHHHHHHHHHhccc-------cccceEEEEcc Confidence 444443111 1 1 00 001111111 1155555555554221 12467899999 Q ss_pred HHHhhccccCCCCccHHHHHHHh---------cCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhh Q lcl|NC_017674. 268 SKVDYLSVTTPYGISVSDWIEQT---------YPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFS 338 (382) Q Consensus 268 ~~~~~Ls~t~~~~~Tvl~~l~~n---------~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 338 (382) ..+..|-+--....+ -.....+ +-+++|+.-+-+.. ++ +...-|...+.- ..+ T Consensus 125 ~~~~~Lrk~~~~~~~-~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~----~~-~~~~~~i~~~gA-----------l~~- 186 (231) T protein:vir:73 125 KDAAKIRKDANAKNI-GSEVGANALINGTYADVLGAQIVRSKKLAE----GS-ALMFKIVSNSPA-----------LKL- 186 (231) T ss_pred hHHHhhhhccchhhh-hhhhccceeeecccceEcceEEEEcCCCCC----Cc-eeeeeEEeeccc-----------eee- Confidence 988888441111000 0001010 12334443222211 11 111111111110 000 Q ss_pred hhhhhhhhcccceecCCceEec-cccceeeeEeeccchheee--cCC Q lcl|NC_017674. 339 QLVQSKFITLGVEKRAKSYVED-FSNGTAGALCKRPWAVVRY--LGI 382 (382) Q Consensus 339 ~~~p~~~~~l~~~~~~~~~~~~-~~~~t~Gv~i~~P~aia~~--~GI 382 (382) .+-...+ ...++..+...-. ..-..+||-++.|..++.+ .|+ T Consensus 187 -~~k~~~~-vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 187 -VLKRGVQ-VETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred -eecccce-eeccccccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 0000000 0011111111111 1113479999999998876 677 No 117 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=44.33 E-value=0.81 Score=21.08 Aligned_cols=291 Identities=10% Similarity=0.041 Sum_probs=112.9 Q ss_pred hHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCc Q lcl|NC_017674. 26 NDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIG 105 (382) Q Consensus 26 ~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~ 105 (382) |-..+.|-|-|. .++.+-.+-||+.+.-+|.+.....-+.+.++. T Consensus 1 ms~~~~~tr~~~-----------------------------------~~s~~d~al~le~f~geV~~af~~~s~~~~~~~ 45 (335) T protein:vir:63 1 MSFLNDLTRPNY-----------------------------------AGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMN 45 (335) T ss_pred CCCcccchhhhc-----------------------------------ccccchhheehhhhhhhHHHHHHhhhhhccccc Confidence 000111111111 011111222444444444444433444455555 Q ss_pred cccCCCcceeeEEEEeeecccceeec----ccc-cCCceeeeeeeeeEeeE--EEEEEEEEecHHHHHHHHHhCCChHHH Q lcl|NC_017674. 106 IDTVGSWEDQEIVQGIVEPAGTAVEY----GDH-TNIPLTSWNANFERRTI--VRGELGMMVGTLEEGRASAIRLNSAET 178 (382) Q Consensus 106 v~t~g~~~~~t~t~~v~e~~G~a~~y----gd~-~DiP~vd~~~~~~~~~v--~~~~~g~~y~~~El~~A~~~g~~l~~~ 178 (382) +.+-- ...++.|+.. |..+.. |.. ...|... ++....| ..+.-.+=|.++|. ++..++-++ T Consensus 46 ~rti~--~g~s~~~~~i---G~~~~~~~~pG~~l~~~~~~~---~k~~itVD~ll~a~~~I~dlDe~----~~~yDvRse 113 (335) T protein:vir:63 46 IRDLR--GSNVVRLDRL---GNVEAKGRRAGEELERSRVVN---DKWNLTVDTLLYLRHQFDHQDEW----TQSFDMRKE 113 (335) T ss_pred eeeec--cceeEEEeee---eeeeeecccCCcCcCCCCccc---cceEEEecceeechhhhhhHHHH----hcCchhHHH Confidence 44431 1256655544 666554 221 1112211 2212111 11222333445553 233444444 Q ss_pred HHHHHHHHHHHhhccEEE------EeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_017674. 179 KRQQAAIGLEIFRNAIGF------YGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQD 252 (382) Q Consensus 179 K~~aAr~a~~~~~n~i~~------~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g 252 (382) -....-.++.++.|+.++ .+..+ .-...|-++ |++...+..+ +.-+...++.+.+=+..+..++.++--. T Consensus 114 ~s~e~G~aLA~~~D~~~~~~i~~aa~~~a--~~~~~~~~~-~G~~~~~~~t-g~~~~~~~~~l~~a~~~a~~~L~e~dVP 189 (335) T protein:vir:63 114 VAELDGQELARKFDQACLIQVIKAAAMDA--PVDLEDAFS-PGVLEKLDLT-GLTAKQAADKIVRMHRRVVETFIDRDLG 189 (335) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccC--ccccCCCcC-CCcceeeeec-cCcccccHHHHHHHHHHHHHHHHhccCC Confidence 444555555555555432 22211 111223333 3333222222 2333346888888777777777654421 Q ss_pred eeeeccccceEecCHHHHhhccccC-----CCCcc--HHHHHHHh---cCccEEEEccccccccCCCCC----------- Q lcl|NC_017674. 253 QIDPKAEKITLALATSKVDYLSVTT-----PYGIS--VSDWIEQT---YPKMRIVSAPELSGVQMKAQE----------- 311 (382) Q Consensus 253 ~~~~~~~p~~L~Lp~~~~~~Ls~t~-----~~~~T--vl~~l~~n---~pnl~i~~~peL~~a~g~g~~----------- 311 (382) ++-..+...+++|.+|..|-.-+ +++.| .-.+.+.. --+++|+..+.|-+..+.++. T Consensus 190 --~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d 267 (335) T protein:vir:63 190 --DAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEE 267 (335) T ss_pred --CcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEEeeceEEEeeccCCCCCcccccccccCCccccc Confidence 00012357899999999985421 12111 11111111 124556666665221111100 Q ss_pred -ceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccch--heeecCC Q lcl|NC_017674. 312 -PEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWA--VVRYLGI 382 (382) Q Consensus 312 -~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~a--ia~~~GI 382 (382) .+-+.+.+.++.... .+.++..-.. --+.+.-.|.+.+... .|+-++||.+ +....|| T Consensus 268 ~~~~~~~~~~~~Al~t-----------~~~~~vt~e~-~~~~~~~~~~i~~~~a-~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:63 268 SERQIALFLPSKTLIT-----------AQVAPVQAKL-WEDNEKFSWVLDTFQM-YNIGARRPDTAGAIELKGI 328 (335) T ss_pred cceeEEEEEecceEEE-----------EEEeecccce-eeccchhhHHhHHHHH-cCCcccccceEEEEEEcCC Confidence 001222222211100 0001100000 0011112333333333 6999999955 4567888 No 118 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=43.32 E-value=0.85 Score=20.97 Aligned_cols=294 Identities=12% Similarity=0.033 Sum_probs=104.1 Q ss_pred hhhcccccCc-ccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcc-eeeEEEEeeecccceeecccccC Q lcl|NC_017674. 61 SAMDSNFTAP-VTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWE-DQEIVQGIVEPAGTAVEYGDHTN 136 (382) Q Consensus 61 ~amDa~~~~~-~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~-~~t~t~~v~e~~G~a~~ygd~~D 136 (382) |+|==..+++ .++++.. +| +-|...+.+.+.....+..++. +.++... -+++.++... ...+.-|.-... T Consensus 1 ~~~~~~~~~~~~~t~~v~~fip----ei~s~~i~~~l~~~~v~~~~~~-d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~ 74 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIP----EQWLSEVQMFRKAKMLDTSVVK-TWGAQVKKGDTFHVPRIS-ELGVEDKATDVP 74 (341) T ss_pred CcchhhhccccccchhHHHHHH----HHHHHHHHHHHHhhcchhhccc-cccccccCCceEEEeccC-cceeeeecCCCc Confidence 2222122221 1222221 22 3333344444444455555553 3222221 2678787653 333444543445 Q ss_pred CceeeeeeeeeEeeE-EEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCC Q lcl|NC_017674. 137 IPLTSWNANFERRTI-VRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPN 215 (382) Q Consensus 137 iP~vd~~~~~~~~~v-~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~ 215 (382) ++.-+.+..+...++ ..-..++.++..|.. +...++-++-...+..++.+..++..+--.+........+-+..++ T Consensus 75 i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~---~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~ 151 (341) T protein:vir:94 75 VGVQPVNDTDFVITVDTDRTTAVALDDLLEI---QASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSN 151 (341) T ss_pred cccccccCceEEEEEeeeeecceeechHHHH---hhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCcc Confidence 555555555555555 223455666655542 3355666666666666666666554321111100010011111111 Q ss_pred CcceeccCCCCccccCHHH-HHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccCC------CCccHHHHHH Q lcl|NC_017674. 216 LPAFQTPPSQGWSTADWAG-IIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTTP------YGISVSDWIE 288 (382) Q Consensus 216 l~~~~~~a~~~Wa~kT~~e-I~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~~------~~~Tvl~~l~ 288 (382) .. . +.+++. .++.|.++...+-.. ++ |. ....++++|..+..|.+-+. .+.. -++ T Consensus 152 ~~------~----t~~~~~~~~~~i~~a~~~Lde~--~V--P~-~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~---~l~ 213 (341) T protein:vir:94 152 GA------I----TGNGQAFSFAVFLAARRLLLEA--DV--PE-EKIVLLISPGQESALFTIPQFISKDFINNA---PIA 213 (341) T ss_pred cc------c----cCchhhhhHHHHHHHHHHHhhc--CC--Cc-cCCEEEeCHHHHHHHhhchhhhhhhccccc---hhh Confidence 11 0 111222 234444444443332 11 32 23478999999999964321 1111 122 Q ss_pred H----hcCccEEEEccccccccCCCC--CceeEEEEcch-hhhhh-------hccccccchhh-------hhhh-hhhhh Q lcl|NC_017674. 289 Q----TYPKMRIVSAPELSGVQMKAQ--EPEDALVLFVE-DVNAA-------VDGSTDGGSVF-------SQLV-QSKFI 346 (382) Q Consensus 289 ~----n~pnl~i~~~peL~~a~g~g~--~~~~~~~~~~~-~v~~~-------~~~~~~~~~~~-------~~~~-p~~~~ 346 (382) + +.-+++|...+.+-...+.+. +...++..... -+.+. .+........| ...+ |+-++ T Consensus 214 ~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~ 293 (341) T protein:vir:94 214 QGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAA 293 (341) T ss_pred eeeeeeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhh Confidence 1 112334444333321111000 00000000000 00000 00000000000 0000 11111 Q ss_pred cccce-ecC-CceEecccc------c-eeeeEeeccchheeecCC Q lcl|NC_017674. 347 TLGVE-KRA-KSYVEDFSN------G-TAGALCKRPWAVVRYLGI 382 (382) Q Consensus 347 ~l~~~-~~~-~~~~~~~~~------~-t~Gv~i~~P~aia~~~GI 382 (382) ...+| .+. ..|. +.++ + ..|+-+.||.+++.+.=- T Consensus 294 ~~~~~~~~~~~~~~-~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~ 337 (341) T protein:vir:94 294 AVVSKAPRVTQSFE-NREQVWLMVGRQAYGARLYRPLHAVNIHTT 337 (341) T ss_pred ccccccccccccch-hhhhhhhhhhhhhhcccccCcceeEEEecC Confidence 11111 000 1111 1111 1 236666666665433322 No 119 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=42.26 E-value=0.89 Score=20.86 Aligned_cols=303 Identities=11% Similarity=-0.010 Sum_probs=128.4 Q ss_pred CCCc----------ceeeeecCccccccccc-cccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccC Q lcl|NC_017674. 1 MSQI----------SKTHSRLAGRNAKPFDL-KNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTA 69 (382) Q Consensus 1 ~~~~----------~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~ 69 (382) ..++ ...+.....+.-.--+. ........+.+++ .+ +. .....+.+. . T Consensus 54 ~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~l-~~----~~~~~~~~~-~ 112 (394) T protein:vir:10 54 NDQIKDLEAENKANSDPDKPVDNAQPNGTDLKKKPIDAKKKAIND---------------FI-HS----HGKVIDNAA-G 112 (394) T ss_pred HHHHHHHHHHHHhhcchhhhhhhhcccccchhhhHHHHHHHHHHH---------------HH-hc----cchhhhhhh-c Confidence 0000 00000000000000000 0000000011111 00 00 001112221 1 Q ss_pred cccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeec-ccceeecccccCCce-eeeeee Q lcl|NC_017674. 70 PVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP-AGTAVEYGDHTNIPL-TSWNAN 145 (382) Q Consensus 70 ~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~-~G~a~~ygd~~DiP~-vd~~~~ 145 (382) ..++++.| +|..+ ..+|++.+......+.++.+..... .+..|.+... .+.+.+.+...++|- .+...+ T Consensus 113 ~~t~~~gg~~vP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~ 185 (394) T protein:vir:10 113 HVTSTEAGVLIPEEI----IYDPTAEVNSVVDLSTLVTKTPVTT---PKGTYPILKRATDRFSSVAELAENPALAEPEFE 185 (394) T ss_pred ccccccCceeccHHH----HHHHHHHHHhhhhhhhhceeeeccC---CceEEEEEecCCCccccccccccccccccccce Confidence 12333333 45443 4467777777666777665443322 3455666554 466677888888885 556888 Q ss_pred eeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCC Q lcl|NC_017674. 146 FERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQ 225 (382) Q Consensus 146 ~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~ 225 (382) +....++.++.-+.+|.+=|+.+ ..++.+--.....+++...+|+-++.|... +..+ .. . T Consensus 186 ~v~l~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~la~~~~~~~~~~il~g~g~----~~~~---------~~--~-- 245 (394) T protein:vir:10 186 QVDWSVSTYRGAIPLSEEAIADS---AVDLTSLVGQSINEKSVNTYNAMIAPVLQS----FTAK---------AT--T-- 245 (394) T ss_pred eEEeeeeeeEeeehhHHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhhcccc----cccc---------cc--c-- Confidence 88888999998888988666543 356777777778888888888888877532 1100 00 0 Q ss_pred CccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHHHHHHh-----cC----ccE Q lcl|NC_017674. 226 GWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSDWIEQT-----YP----KMR 295 (382) Q Consensus 226 ~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~~l~~n-----~p----nl~ 295 (382) ...+. +||.+++......- . + ..++|.++.+..|.. .+..|.-++.--..+ .| ++. T Consensus 246 --~~~~~----d~l~~~~~~~~~~~---~--~---a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~P 311 (394) T protein:vir:10 246 --TDTLV----DSLKHILNVDLDPA---Y--S---RALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVP 311 (394) T ss_pred --ccccH----HHHHHHHHhhhhhh---c--c---CEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccce Confidence 11233 34444443222111 1 1 258899988888864 233343332211000 11 122 Q ss_pred EEEccccccccCCCCCceeEEEE-cchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccc Q lcl|NC_017674. 296 IVSAPELSGVQMKAQEPEDALVL-FVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPW 374 (382) Q Consensus 296 i~~~peL~~a~g~g~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~ 374 (382) ++.... ... +.+.+...+++- +.+-+.. .+. +.+...+.... ..... +-...|.+ +.+++|. T Consensus 312 V~~~~~-~~~-~~~~~~~~i~~gd~s~~~~~-~~~---------~~~~v~~~~~~--~~~~~--~~~~~r~d-~~~~~~~ 374 (394) T protein:vir:10 312 VYVVGD-ALL-GSAAGDQKAFVGDLKRGVLF-ADR---------QQVTLAWEDSK--IYGRY--LGAAFRFG-VKQADSN 374 (394) T ss_pred eEEecc-ccc-CCCCCceEEEEeeccccEEE-Eee---------cceEEEEeccc--cccee--EEEEEEec-cEEeccc Confidence 222211 111 111111222221 1111110 000 00000000000 00111 22334554 4566699 Q ss_pred hheeecCC Q lcl|NC_017674. 375 AVVRYLGI 382 (382) Q Consensus 375 aia~~~GI 382 (382) +|+.+..= T Consensus 375 ai~~~~~~ 382 (394) T protein:vir:10 375 AGYFVTNT 382 (394) T ss_pred cEEEEEee Confidence 99886644 No 120 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=41.27 E-value=0.93 Score=20.75 Aligned_cols=315 Identities=9% Similarity=-0.046 Sum_probs=129.2 Q ss_pred CCCcceeeeecCcc--------cccccc--ccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCc Q lcl|NC_017674. 1 MSQISKTHSRLAGR--------NAKPFD--LKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAP 70 (382) Q Consensus 1 ~~~~~~~~~~~~~~--------~~~~~~--~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~ 70 (382) +.++.........+ .-.+.. .........+.+.++.-...+. .. .....++ .. T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~-------~~~~~a~-----~~ 118 (408) T protein:vir:10 56 RDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAF-----MN-------TVSSKTE-----TS 118 (408) T ss_pred HHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHhhcchhh-----hh-------hhhhhhh-----hc Confidence 00000000000000 000110 0111111111111111000000 00 0000111 11 Q ss_pred ccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEE-eeecccceeecccccCCceee-eeeee Q lcl|NC_017674. 71 VTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQG-IVEPAGTAVEYGDHTNIPLTS-WNANF 146 (382) Q Consensus 71 ~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~-v~e~~G~a~~ygd~~DiP~vd-~~~~~ 146 (382) .+.++.| +|..+. +.|++.+......+.+..+.....-. ..+.+. ..+..+.+.+.|....+|-.+ ...+. T Consensus 119 ~t~~~gg~~vP~~~~----~~Ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 193 (408) T protein:vir:10 119 GSDSAAGLTIPQDIR----TMINTLVRQYDSLQQYVRVESVSTSN-GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) T ss_pred ccccCCceeccHhHH----HHHHHHHHhhchhhhhcceeeccCCc-ceEEEeeccccccceeeecCccccccccCcceee Confidence 2222333 454443 46666666666666665443322211 112222 224446677888888888655 57888 Q ss_pred eEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCC Q lcl|NC_017674. 147 ERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQG 226 (382) Q Consensus 147 ~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~ 226 (382) ..-+.+.++....+|.+=++ ....++.+--.....+++...+|+-++.|+..+. +..+ T Consensus 194 i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~-------------~~~~------ 251 (408) T protein:vir:10 194 IKYLIKRYAGIITATNTSLK---DTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP-------------KKPT------ 251 (408) T ss_pred EEeeeeeEEeeehhHHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------cccc------ Confidence 88899999998888875443 3456778888888888888999988888873210 0000 Q ss_pred ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHH-HHHhcCccEEEEcc---- Q lcl|NC_017674. 227 WSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDW-IEQTYPKMRIVSAP---- 300 (382) Q Consensus 227 Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~-l~~n~pnl~i~~~p---- 300 (382) ..+.+++++.++..+.. .+..+ -.+++.+..+..|..- +..|.-+++- +....|+ ++...| T Consensus 252 --~~~~~~l~~~~~~~~~~-------~~~~~---a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~-~l~G~PV~~~ 318 (408) T protein:vir:10 252 --IAKFDDVITMINTAVDP-------AIIAT---SSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSY-LIKGKQVIVV 318 (408) T ss_pred --cccHHHHHHHHHHhhhh-------hhccC---CEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCc-eecceeeEEe Confidence 12344444433322211 11222 2588999999888643 4445444321 1111111 121111 Q ss_pred ccccccCCCCCceeEEE-EcchhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchheee Q lcl|NC_017674. 301 ELSGVQMKAQEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRY 379 (382) Q Consensus 301 eL~~a~g~g~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~ 379 (382) +-......+.+...+++ .+.+-+.. + +.....+ +.-+..+ .....-.....+..|.+| .++.|.+|+.+ T Consensus 319 ~~~~~~~~~~~~~~i~~gd~~~~~~~---~-~~~~~~v-~~~~~~~----~~f~~~~~~~r~~~r~d~-~v~~~~a~~~~ 388 (408) T protein:vir:10 319 ADRWLPNTGSTVYPLYYGDMSQAITL---F-DRENMSL-LPTNIGA----GAFETDTTKIRVIDRFDV-KATDSEALVAG 388 (408) T ss_pred cccccCccCCCceEEEEEehhccEEE---E-EecceEE-EEccccc----chhhcCceEEEEEEeecc-EEeccccEEEE Confidence 00111111111111111 11110100 0 0000000 0000000 000011223345555544 66679999988 Q ss_pred cCC Q lcl|NC_017674. 380 LGI 382 (382) Q Consensus 380 ~GI 382 (382) +.- T Consensus 389 ~~~ 391 (408) T protein:vir:10 389 SFS 391 (408) T ss_pred Eee Confidence 755 No 121 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=40.15 E-value=0.98 Score=20.62 Aligned_cols=314 Identities=8% Similarity=-0.041 Sum_probs=122.5 Q ss_pred CCCcce----eeeecCccc-----cccccccccc-hHHH-HHHhhcceeccccchhhhhhhhcccccchhhhhhcccccC Q lcl|NC_017674. 1 MSQISK----THSRLAGRN-----AKPFDLKNIT-NDAV-ASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTA 69 (382) Q Consensus 1 ~~~~~~----~~~~~~~~~-----~~~~~~~~~~-~~~~-~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~ 69 (382) +.++-. ....+.++. .++..-..-. ...+ ..++..+.........+......... ......... .. T Consensus 53 i~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~~ 129 (394) T protein:vir:97 53 LVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPI--NETTPVEPQ-KD 129 (394) T ss_pred HHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHH--Hhhhhhhhh-cc Confidence 000000 000000000 1111000000 0001 11111111111110011001110000 000111111 12 Q ss_pred cccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecc-cceeecccccCCce-eeeeee Q lcl|NC_017674. 70 PVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPA-GTAVEYGDHTNIPL-TSWNAN 145 (382) Q Consensus 70 ~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~-G~a~~ygd~~DiP~-vd~~~~ 145 (382) ..+.++.| +|..+. +.|++.+......+.+..+.+... .+..|++.... +.+.+.+.....|- .+...+ T Consensus 130 ~~t~~~gg~liP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~ 202 (394) T protein:vir:97 130 GIKKENAKPVSSEEIL----YTPAREVKTVVDLKPFTTVYQAKK---ASGKYPVLQRATTKMVTVAELEKNPALAKPDFK 202 (394) T ss_pred ccccccccccChHHHH----HHHHHHhhhhhhhhhhceeeeccC---cceEEEEEecCCCccceecccccccccccccce Confidence 33444444 554443 466776665555555555433221 23455665533 45567788778875 446777 Q ss_pred eeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCC Q lcl|NC_017674. 146 FERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQ 225 (382) Q Consensus 146 ~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~ 225 (382) ...-..+.++..+.+|.+=++ ....++.+--....++++...+|.-.+.|...+ ++. T Consensus 203 ~v~l~~~k~~~~i~is~ell~---ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~------------------~~~-- 259 (394) T protein:vir:97 203 DVAWNIDTYRGAIPLSQESID---DADVDLVGIVSESISQIKVNTTNDAIAKVLKSF------------------TTK-- 259 (394) T ss_pred eEEeehhheeeehhhHHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------------ccc-- Confidence 888888888888888875443 234466666666667777777776666664210 000 Q ss_pred CccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHH-HHHHhcC----ccEEEEc Q lcl|NC_017674. 226 GWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSD-WIEQTYP----KMRIVSA 299 (382) Q Consensus 226 ~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~-~l~~n~p----nl~i~~~ 299 (382) ...+. +||.++++....... + -.++|.|..+..|.. .+..|.-++. -+....+ +..++.. T Consensus 260 --~~~~~----~~~~~~~~~~~~~~~-----~---a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~ 325 (394) T protein:vir:97 260 --TVKNL----DEIKALLNGGFDPAY-----N---VSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVL 325 (394) T ss_pred --ccccH----HHHHHHHHhhhhhhh-----C---CEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEe Confidence 01233 444444443222111 1 257899988888853 2333433321 0111001 1112222 Q ss_pred cccccccCCCCCceeEEEEc-chhhhhhhccccccchhhhhhhhhhhhcccceecCCceEeccccceeeeEeeccchhee Q lcl|NC_017674. 300 PELSGVQMKAQEPEDALVLF-VEDVNAAVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVR 378 (382) Q Consensus 300 peL~~a~g~g~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~ 378 (382) +. .+.+ .+ .+++-. ..-+.. .+. +.+..++ ..-..... ...+..|. |+.+.+|.+|+. T Consensus 326 ~~--~~~~--~~--~~~~gd~~~~~~~-~~~---------~~~~~~~--~~~~~~~~--~~~~~~r~-d~~v~~~~a~~~ 384 (394) T protein:vir:97 326 SD--EVLG--AN--KAFIGDFKRGVLF-ADR---------KDLGLRW--ADNEIYGQ--YLQAVLRF-GVSKVDDKAGYY 384 (394) T ss_pred cc--cccC--Cc--cEEEeeccccEEE-EEe---------cceEEEE--ecccccce--eEEEEEEE-ccEEecccceEE Confidence 21 1111 11 111111 000000 000 0000000 00000111 12344555 445668999998 Q ss_pred ecCC Q lcl|NC_017674. 379 YLGI 382 (382) Q Consensus 379 ~~GI 382 (382) ++.= T Consensus 385 ~~~~ 388 (394) T protein:vir:97 385 VTFT 388 (394) T ss_pred EEec Confidence 7776 No 122 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=30.55 E-value=1.6 Score=19.51 Aligned_cols=258 Identities=10% Similarity=-0.011 Sum_probs=96.3 Q ss_pred hCccccCCCcceeeEEEEeeecccceeeccc--cc-------CCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCC Q lcl|NC_017674. 103 IIGIDTVGSWEDQEIVQGIVEPAGTAVEYGD--HT-------NIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRL 173 (382) Q Consensus 103 l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd--~~-------DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~ 173 (382) ++=--+.| .++.|+ ..|..++..= ++ +++-......-.....++ .+ +.++-.++ +.. T Consensus 1 ~vr~i~~g----~s~~~~---~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~---~~---VdDiD~~q-a~~ 66 (324) T protein:vir:99 1 MTRTITSG----KSAQFP---VMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTD---VL---IYDIEDAM-NHY 66 (324) T ss_pred CeeeeecC----ceEEEe---eeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhh---hh---hhhHHHHh-cCc Confidence 11111222 344443 3466654321 22 233333222111212222 12 23333333 446 Q ss_pred ChHHHHHHHHHHHHHHhhccEEE----EeeccCCcccceEEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHh Q lcl|NC_017674. 174 NSAETKRQQAAIGLEIFRNAIGF----YGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQ 249 (382) Q Consensus 174 ~l~~~K~~aAr~a~~~~~n~i~~----~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~ 249 (382) ++-++-.+.+-.++.+..|+.++ .+-.......-.+.....+........+..-+..+++.+++-|.++-..|.+. T Consensus 67 Dlr~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~ 146 (324) T protein:vir:99 67 DVRSEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKK 146 (324) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhc Confidence 66666666666677776665432 01000000000011111111111111222233567889999888887777665 Q ss_pred cCCeeeeccccceEecCHHHHhhcccc---C--CCCccHHHHHHH---hcCccEEEEccccccccCCCCC-cee---EEE Q lcl|NC_017674. 250 SQDQIDPKAEKITLALATSKVDYLSVT---T--PYGISVSDWIEQ---TYPKMRIVSAPELSGVQMKAQE-PED---ALV 317 (382) Q Consensus 250 t~g~~~~~~~p~~L~Lp~~~~~~Ls~t---~--~~~~Tvl~~l~~---n~pnl~i~~~peL~~a~g~g~~-~~~---~~~ 317 (382) .= |. ....+++||.+|..|... + .++ +.-.+-+. +.-+++|+..+.|-...+.... +.. ... T Consensus 147 ~V----P~-~gR~~vv~P~~y~~Ll~~~~~~~~~~~-~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~ 220 (324) T protein:vir:99 147 YI----PA-GDRTFYTDPDTYSAILAALMPNAANYA-ALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIF 220 (324) T ss_pred CC----CC-CCCEEEeChHHHHHHhhcccccccccc-cccceecceEEEEeceEEEecCCcccccccccccccccccccc Confidence 52 33 235789999999988532 1 111 00111110 0124556655555322111000 000 000 Q ss_pred Ecchhhhh----hhccccccchhhhhh-------hhhhhhcccceecCCceEeccccceeeeEeeccchheeec------ Q lcl|NC_017674. 318 LFVEDVNA----AVDGSTDGGSVFSQL-------VQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYL------ 380 (382) Q Consensus 318 ~~~~~v~~----~~~~~~~~~~~~~~~-------~p~~~~~l~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~------ 380 (382) -...+... ..+-+......|... ++.......-+. .-.|-+..... .|+.+.||.|++... T Consensus 221 ~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~-~~~d~i~~~~a-~G~~~lRPe~a~~v~l~~~~~ 298 (324) T protein:vir:99 221 PATGDSTTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPE-YQADQIIAKYA-MGHGGLRPEAVGAIIFEDGET 298 (324) T ss_pred ccccccccccccccccCceeEEEEehhheEEEeeecceecceechh-hHHHhhhhhhh-hcCcccccceEEEEEEccCcc Confidence 00000000 000001111111111 111111111111 11222222222 388899998886554 Q ss_pred -CC Q lcl|NC_017674. 381 -GI 382 (382) Q Consensus 381 -GI 382 (382) |+ T Consensus 299 ~~~ 301 (324) T protein:vir:99 299 PAV 301 (324) T ss_pred ccc Confidence 34 No 123 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=29.58 E-value=1.6 Score=19.39 Aligned_cols=289 Identities=8% Similarity=-0.035 Sum_probs=108.0 Q ss_pred hhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCcccc---CCCcceeeEEEEe--eecccce---eecc Q lcl|NC_017674. 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDT---VGSWEDQEIVQGI--VEPAGTA---VEYG 132 (382) Q Consensus 61 ~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t---~g~~~~~t~t~~v--~e~~G~a---~~yg 132 (382) |++.+-..+..-. +..|+..|.+...-++ +..+.... ..|-|-++ .+...+.-..|.+ +..+|+. ...+ T Consensus 1 ~~~~~~~~~~~~M-s~~i~~~fv~qy~~~v-~~~~qq~~-s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (322) T protein:vir:10 1 MKLNAIMSMLPLI-AGDIDQAFVQTYETTL-RILSQQKS-AKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSA 77 (322) T ss_pred Ccccceeeeeeee-echhhhHHHHHHHHHH-HHHHHHhh-hhhhcccccccccccccceeeccccccccccccccccccc Confidence 3333322221111 1124444444333222 22222222 12222111 1111111111122 1123333 2345 Q ss_pred ccc-CCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEE---eeccCCcccce Q lcl|NC_017674. 133 DHT-NIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFY---GWQSGLGNRTY 208 (382) Q Consensus 133 d~~-DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~---Gd~~g~~~g~~ 208 (382) |.+ |.|..............-+..+..+...+. .++..+..+.-.+++..|++++.+++.+- |.+. . T Consensus 78 d~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~---~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~---~--- 148 (322) T protein:vir:10 78 DGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDI---SQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS---I--- 148 (322) T ss_pred CcccCCCccccccceEEEeecccccceecchHHH---HHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc---c--- Confidence 544 668777666665666666777776666665 34556666777777777888887776554 3221 1 Q ss_pred EEEeCCCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhccccC---CCCccHHH Q lcl|NC_017674. 209 GFLNDPNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVTT---PYGISVSD 285 (382) Q Consensus 209 GllN~P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t~---~~~~Tvl~ 285 (382) | .++.++. .+++..=...+..--++.|.++...+.... + |+..+-.++++|+++..|-.-. +.+..--+ T Consensus 149 ~---~~gt~v~-~~ss~~i~~g~~g~t~~kl~~a~~~l~~~d---v-p~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~ 220 (322) T protein:vir:10 149 K---GTGQPVE-FLATQEIGDGTKPISFDYVTEITERFLENE---I-EPEVSKVIVIGPTQARKLLQITEATSADYTSAM 220 (322) T ss_pred c---ccccccc-cCCCcccccCccchhHHHHHHHHHHHHhcC---C-CCCCCeEEEeCHHHHHHHhcchhhhhhhcccch Confidence 1 1111110 000100000111111333444444443333 1 2222336899999998885321 11111123 Q ss_pred HHHHh-----cCccEEEEccccc---c-------ccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccc Q lcl|NC_017674. 286 WIEQT-----YPKMRIVSAPELS---G-------VQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV 350 (382) Q Consensus 286 ~l~~n-----~pnl~i~~~peL~---~-------a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~ 350 (382) .|+.+ +=.++|.....|- + +...+.+-+. .+.|.++-.....+.+ ..+...+.|.+.+ T Consensus 221 ~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~-~~a~~k~Av~~a~~~d--v~~~i~~~~~~~~---- 293 (322) T protein:vir:10 221 DLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIW-CIAMTDMALGYHSCKD--IWTKVAEDPSASF---- 293 (322) T ss_pred hhhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCcccee-EEEEecCceeEEEeee--eeEEeeccCCcch---- Confidence 33322 2222332222221 1 0111111112 2233332211110000 0000011122211 Q ss_pred eecCCceEeccccceeeeEeeccchheeecCC Q lcl|NC_017674. 351 EKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) Q Consensus 351 ~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~GI 382 (382) .|.+.... ..|+.+-.|..|+.++=- T Consensus 294 -----a~~I~~~~-~~Ga~ri~~~gVv~i~~~ 319 (322) T protein:vir:10 294 -----AWRIYSAF-TADCVRVEDEHIFKLRLK 319 (322) T ss_pred -----hhhhhhhh-hhCceEeccCcEEEEEEe Confidence 11122222 235555577777665544 No 124 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=29.04 E-value=1.7 Score=19.33 Aligned_cols=312 Identities=10% Similarity=-0.014 Sum_probs=129.7 Q ss_pred CCCcceee--------eecCcccccccccc--ccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCc Q lcl|NC_017674. 1 MSQISKTH--------SRLAGRNAKPFDLK--NITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAP 70 (382) Q Consensus 1 ~~~~~~~~--------~~~~~~~~~~~~~~--~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~ 70 (382) +..+-... .-+.....++.... .......+.+.++.-...+. + ......++ .. T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~----~~~~~~a~----~~- 118 (408) T protein:vir:74 56 RDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAF--------L----NTVSSKTE----TS- 118 (408) T ss_pred HHHHHHHHHHHHHHHHhhccccccccccchhhhhHHHHHHHHHHHHhcchhh--------h----hhhhhhhh----cc- Confidence 00000000 00000001111111 11111111111111010000 0 00001111 11 Q ss_pred ccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeeccc-ceeecccccCCce-eeeeeee Q lcl|NC_017674. 71 VTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAG-TAVEYGDHTNIPL-TSWNANF 146 (382) Q Consensus 71 ~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G-~a~~ygd~~DiP~-vd~~~~~ 146 (382) .+..+.| +|-.+. +.|++.+......+.++++..... ....+.+......+ .+.+.+...++|- .+...+. T Consensus 119 ~~~~~gg~~vP~~~~----~~Ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 193 (408) T protein:vir:74 119 GSDSAAGLTIPQDIR----TMINTLVRQYDSLQQYVRVESVST-SSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTI 193 (408) T ss_pred cccCCCceeechhHh----hHHHHHHhhhcchhhhcceeeccC-CcceEEEEeecCCcccccccccccccccccccceee Confidence 1112222 454443 466676666666666665443322 12333444444433 3346677778875 5578899 Q ss_pred eEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCC Q lcl|NC_017674. 147 ERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQG 226 (382) Q Consensus 147 ~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~ 226 (382) .....+.++..+.+|.+=+ .....++.+.-.....+++...+|+-.++|+..+ .| .. T Consensus 194 i~~~~~k~~~~~~iS~ell---~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~----------~~---~~------- 250 (408) T protein:vir:74 194 IKYLIKRYAGIITATNTLL---KDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTV----------PK---KP------- 250 (408) T ss_pred EEeeeeeEEeeehhHHHHH---hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccc----------cc---cc------- Confidence 9999999999999988544 3445678888888888889999999999996311 00 00 Q ss_pred ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHH-HHHhcCccEEEEcc---- Q lcl|NC_017674. 227 WSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDW-IEQTYPKMRIVSAP---- 300 (382) Q Consensus 227 Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~-l~~n~pnl~i~~~p---- 300 (382) ...+.+.+++.++..+.. . +..+ ..++|.|..+..|..- +..|.-++.- +....| -+|...| T Consensus 251 -~~~~~~~i~~~~~~~l~~-----~--~~~~---a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~-~~l~G~pV~~~ 318 (408) T protein:vir:74 251 -TIANFDDVITMINTSVDP-----A--IIAT---SSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNS-YLIKGKQVIVV 318 (408) T ss_pred -ccccHHHHHHHHHHhhhh-----h--hcCC---CEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCC-ceecceeeEEe Confidence 012344444433322211 1 1112 2578999988888542 3334333210 111111 1121111 Q ss_pred ccccccCCCCCceeEEE-Ecchhhhhhhccccccchhhhhhhhhhhhcccce---ecCCceEeccccceeeeEeeccchh Q lcl|NC_017674. 301 ELSGVQMKAQEPEDALV-LFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGVE---KRAKSYVEDFSNGTAGALCKRPWAV 376 (382) Q Consensus 301 eL~~a~g~g~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~~---~~~~~~~~~~~~~t~Gv~i~~P~ai 376 (382) +-......+.+...+++ .+.+-+.. .+. +.+. +.+.+.. ...-....-+..|.+| .++.|.|| T Consensus 319 ~~~~~~~~~~~~~~i~~gd~~~~~~~-~~~---------~~~~--i~~~~~~~~~f~~~~~~~r~~~r~d~-~~~~~~a~ 385 (408) T protein:vir:74 319 ADRWLPNSGSTVYPLYYGDMSQAITL-FDR---------ENMS--LLPTNIGAGAFETDTTKIRVIDRFDV-KATDSEAL 385 (408) T ss_pred cCcccccccCCcceEEEEehhccEEE-EEe---------cceE--EEEeccccchhhcceeeEEEEEeeCc-EEecccce Confidence 00011111111111221 11111100 000 0000 0000000 0011122345556555 47779998 Q ss_pred eeecCC Q lcl|NC_017674. 377 VRYLGI 382 (382) Q Consensus 377 a~~~GI 382 (382) +..+.- T Consensus 386 ~~~~~~ 391 (408) T protein:vir:74 386 VAGSFT 391 (408) T ss_pred EEEEee Confidence 888754 No 125 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=25.49 E-value=2.1 Score=18.87 Aligned_cols=272 Identities=13% Similarity=0.077 Sum_probs=119.3 Q ss_pred hhhcccccC---cccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCC Q lcl|NC_017674. 61 SAMDSNFTA---PVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNI 137 (382) Q Consensus 61 ~amDa~~~~---~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~Di 137 (382) |+-+..... +...-+.-|-.+|-..|+ ++.+ .+-.-+..|.. . +-.-+++.|++.+..|.+.-.+.+..| T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~-~L~~----~LGv~r~~pla-~-Gt~iktyK~~~~~y~gda~dVaEGe~I 73 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLN-KLFE----ALAIQNKIPMN-V-GSALKQYRFKVEDSEKPNGDVAEGDVI 73 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHH-HHHH----Hhhhhcccccc-C-CceeeeeeeeceeeccccccccCCccc Confidence 222222111 111123335555555444 2222 22233344443 2 334567888889999999888888899 Q ss_pred ceeeeeeeee---EeeEEEEEEEEEecHHHHHHHHHhCCChH-HHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeC Q lcl|NC_017674. 138 PLTSWNANFE---RRTIVRGELGMMVGTLEEGRASAIRLNSA-ETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLND 213 (382) Q Consensus 138 P~vd~~~~~~---~~~v~~~~~g~~y~~~El~~A~~~g~~l~-~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~ 213 (382) |+..+..... ..++.-+.=+. +.+-+ |+.|...+ .+=.....+++++++++=.|-= |- T Consensus 74 plskvt~~~~~t~~~~~kK~rK~t--TdEAI---qlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~------------lk- 135 (303) T protein:vir:10 74 PLTKVTREQVDITELQFAKYRKST--SAEAI---QAHGYDLAINQTDNEMIKYVQKKFRAKFFET------------LK- 135 (303) T ss_pred chhhheeeecceEEEEeecccccc--cHHHH---HhhcCCchhHHHHHHHHHHHHhhhhHHHHHH------------Hh- Confidence 9999887543 33344433333 44333 34454433 2222234445555554221110 00 Q ss_pred CCCcceeccCCCCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc------CCCCccHHHHH Q lcl|NC_017674. 214 PNLPAFQTPPSQGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT------TPYGISVSDWI 287 (382) Q Consensus 214 P~l~~~~~~a~~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t------~~~~~Tvl~~l 287 (382) ++..+.-.+++.+--.+-|.+++...|..-....+.+..+..++=|-..+.||... .++|.+.++ T Consensus 136 -------taT~t~~~t~~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L~-- 206 (303) T protein:vir:10 136 -------SAIENGKRTNKTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVNLLT-- 206 (303) T ss_pred -------hcccccccccceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhhhhh-- Confidence 00000001122223366777788777766555444444444444455777888543 235666554 Q ss_pred HHhcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhh---hhcccceecCCceEeccccc Q lcl|NC_017674. 288 EQTYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSK---FITLGVEKRAKSYVEDFSNG 364 (382) Q Consensus 288 ~~n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~---~~~l~~~~~~~~~~~~~~~~ 364 (382) ||-+..|+..+++..-..-.+-.-++.+.|++- .+. -...|..-.-+- -..|..+.+.+.++-- T Consensus 207 --nfLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~-~g~------l~~~f~~t~D~tglIGv~h~~~~~~~t~eT~---- 273 (303) T protein:vir:10 207 --PYVGVKIVEFADVPQGEVWMTVAENLNVAYANP-RGE------LSRAFAFATDATGFVGVLHDIQPQRLTSDTI---- 273 (303) T ss_pred --hhhcceEEEeccCCCceEEEeeccceEEEEecC-chh------hhhhhhhccccccceEEEeccccceeeehhH---- Confidence 777877776655532111111122233444332 111 112232211000 0012222223222211 Q ss_pred eeeeEeeccchheeecCC Q lcl|NC_017674. 365 TAGALCKRPWAVVRYLGI 382 (382) Q Consensus 365 t~Gv~i~~P~aia~~~GI 382 (382) .-+....+|. +.||| T Consensus 274 ~~~~~~lfpE---~~dgi 288 (303) T protein:vir:10 274 YASAISMFPE---NIDAV 288 (303) T ss_pred hHhHHHhccc---ccceE Confidence 1122333443 45666 No 126 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=25.13 E-value=2.1 Score=18.83 Aligned_cols=317 Identities=9% Similarity=-0.053 Sum_probs=131.2 Q ss_pred CCCc----ceee----eecCccccccccccccchHHHHHHhhcceeccccchhhhhhhhcccccchhhhhhcccccCccc Q lcl|NC_017674. 1 MSQI----SKTH----SRLAGRNAKPFDLKNITNDAVASLSRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVT 72 (382) Q Consensus 1 ~~~~----~~~~----~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t 72 (382) ..+. ...+ ....++.. +-..........+.+.++ +.+....+..+.+.. .....+|. ..+ T Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~~~---~~~~~~~~~~~~~~~---~~~~~a~~-----~~~ 127 (397) T protein:vir:12 60 VPDLPGGVNFVPEQERNPEGQRSQ-GQGNEERQQQYSKAFLKG---LRGKRLTDEERDLLD---SPEFRAMS-----GIN 127 (397) T ss_pred HHHHHHHhhhhhhhhhhhcccccc-cchhhHHHHHHHHHHHHH---HhccCCcHHHHHHHh---hhhhhhcc-----ccc Confidence 0000 0000 00000000 000000000011111111 001110011011000 00111221 222 Q ss_pred ccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCcee-eeeeeeeEe Q lcl|NC_017674. 73 TPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLT-SWNANFERR 149 (382) Q Consensus 73 ~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~v-d~~~~~~~~ 149 (382) +++.| +|-.+ .+.|++.+........+.++.....- ...+.+......+.+.+.+.+..+|-. ....++... T Consensus 128 ~~~gg~lvP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~ 202 (397) T protein:vir:12 128 DEDGGILIPEDI----GRQIHEFKRQFEPLEQYVTVEPVTTR-SGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSY 202 (397) T ss_pred cccCcccCchhH----HHHHHHhhhhhhhHHhhcceeeccCC-ceeEEEEEecCCcceeeecccccccccccccceeEEe Confidence 33333 44433 45677776666666666554332211 123444455555667788888888854 467888888 Q ss_pred eEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCCCCccc Q lcl|NC_017674. 150 TIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPSQGWST 229 (382) Q Consensus 150 ~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~~~Wa~ 229 (382) ..+.++..+.+|.+=+ .....++.+--.....+++.+.+|.-+++|+..+ ...|. T Consensus 203 ~~~k~~~~~~is~e~l---~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~---~~~g~------------------- 257 (397) T protein:vir:12 203 SIIDYGGIMTLSNSML---NDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASL---KKVDI------------------- 257 (397) T ss_pred eheeeEeeehhhHHHH---hhchHHHHHHHHHHHHHHHHHHHHHHHHhccccc---ccccc------------------- Confidence 9999999998887544 3344678888888888888888999999997321 00111 Q ss_pred cCHHHHHHHHHHHHH-HHHHhcCCeeeeccccceEecCHHHHhhccc-cCCCCccHHH-HHHHhcC----ccEEEEcccc Q lcl|NC_017674. 230 ADWAGIIGDIREAVR-QLRIQSQDQIDPKAEKITLALATSKVDYLSV-TTPYGISVSD-WIEQTYP----KMRIVSAPEL 302 (382) Q Consensus 230 kT~~eI~~Di~~~~~-~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~-t~~~~~Tvl~-~l~~n~p----nl~i~~~peL 302 (382) .+ ++||.+++. .+.. .+.. ...+++.|..+..|.. .+..|.-++. -+....| +..+...+.. T Consensus 258 ~~----~~~i~~~~~~~l~~----~~~~---~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~ 326 (397) T protein:vir:12 258 DG----LDGIKKALNVTLDP----MVAP---GSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNR 326 (397) T ss_pred cc----HHHHHHHHhhccch----hhhC---CCEEEEcHHHHHHHHHhhccCCceeecccccCCCCccccceeeEEeccc Confidence 12 345554443 2211 1122 2368899988888854 3433432221 0111111 1112222111 Q ss_pred ccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcc-cceecCCceEeccccceeeeEeeccchheeecC Q lcl|NC_017674. 303 SGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITL-GVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLG 381 (382) Q Consensus 303 ~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l-~~~~~~~~~~~~~~~~t~Gv~i~~P~aia~~~G 381 (382) . .+...++...++-.|.+-+... + .+.+...+-.. ......-....-+..|.+| .++.|.||+.++- T Consensus 327 ~-~~~~~~~~~~~~gd~~~~~~~~-~---------~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~-~~~~~~a~~~~~~ 394 (397) T protein:vir:12 327 V-LKTQKGKAPLIIGNLKEAIVLF-D---------REQQSIASTDTGAGAFETNSTKVRGIEREDV-RKWDEDAVVFGQI 394 (397) T ss_pred c-cccCCCccEEEEEehhceEEEE-e---------ecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEEE Confidence 0 0111111111111111101000 0 00000000000 0000111233455566655 5688888887765 Q ss_pred C Q lcl|NC_017674. 382 I 382 (382) Q Consensus 382 I 382 (382) = T Consensus 395 t 395 (397) T protein:vir:12 395 T 395 (397) T ss_pred e Confidence 5 No 127 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=23.72 E-value=2.3 Score=18.63 Aligned_cols=320 Identities=11% Similarity=0.012 Sum_probs=129.8 Q ss_pred CCCcc------eeeeec-----Ccc-ccccccccccchHHHHHHhhccee-ccccchhhhhhhhcccccchhhhhhcccc Q lcl|NC_017674. 1 MSQIS------KTHSRL-----AGR-NAKPFDLKNITNDAVASLSRIGLV-FDHAVVQDQIKALAKAGAFRSGSAMDSNF 67 (382) Q Consensus 1 ~~~~~------~~~~~~-----~~~-~~~~~~~~~~~~~~~~~l~~~g~~-~~~~~~~~~~~~~~~~~~~~~~~amDa~~ 67 (382) +.++. +.+..+ ..+ ..++..-... ....+.++.... +.+...+......... ..... . T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~~--~ 105 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNV--DGEMEYRDVFMKALRNKPLNAEEREFLED-----DLEQR--A 105 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc--cchHHHHHHHHHHHhcccccHHHHHHHhh-----hhhhh--h Confidence 00000 000000 000 0001100000 011111110000 0011111111111000 00000 0 Q ss_pred cCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceee-eee Q lcl|NC_017674. 68 TAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTS-WNA 144 (382) Q Consensus 68 ~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd-~~~ 144 (382) ....++++.| +|..+. ++|++.+........+..+.....- ...+.+......+.+.+.+.+...|-.+ ... T Consensus 106 ~~~~t~~~gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHH----HHHHHHHHhhhhhhhhceeeeccCC-ceeEEEEeecCCccceeecccccccccccccc Confidence 1112333343 444443 4666766666656666554332211 1123334444445666778877887654 578 Q ss_pred eeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC Q lcl|NC_017674. 145 NFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS 224 (382) Q Consensus 145 ~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~ 224 (382) +...-..+.++..+.+|.+=|+. ...+|.+.-.....+++...+|.-++.|+..+. .+.. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~-----------------~~~~ 240 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-----------------KQAI 240 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----------------ccCc Confidence 88888889999999999866543 346788888888888999999988888863210 0011 Q ss_pred CCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHH-HHHhcC----ccEEEE Q lcl|NC_017674. 225 QGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDW-IEQTYP----KMRIVS 298 (382) Q Consensus 225 ~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~-l~~n~p----nl~i~~ 298 (382) .+. +||.++++..... .+.++ -.++|.|+.+..|.+. +..|.-++.- +....+ +..++. T Consensus 241 -----~~~----d~i~~~~~~~l~~---~~~~~---a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~ 305 (392) T protein:vir:10 241 -----KSL----DDIKDVLNVKLDP---AISPN---AILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVV 305 (392) T ss_pred -----cCH----HHHHHHHHHhhhh---hhccC---CEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEE Confidence 122 3444443321111 11222 3689999988888542 3333322211 111111 111111 Q ss_pred -cc--ccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccc---eecCCceEeccccceeeeEeec Q lcl|NC_017674. 299 -AP--ELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV---EKRAKSYVEDFSNGTAGALCKR 372 (382) Q Consensus 299 -~p--eL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~---~~~~~~~~~~~~~~t~Gv~i~~ 372 (382) .. -+... +.+.+ ...+++.+ .... ..+.....+.+...+. ....-....-+..|.+| .+++ T Consensus 306 ~~~~~~~~~~-~~~~~--~~~~~~gd-fs~~--------~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~ 372 (392) T protein:vir:10 306 VVSNRFLKSK-GTTAK--KAPLIIGD-LKEA--------IVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWD 372 (392) T ss_pred EecccccCCC-cccCC--ceEEEEEe-hhce--------EEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEec Confidence 00 01111 11111 11122211 0000 0000000001111110 00011223456666654 6778 Q ss_pred cchheeecCC Q lcl|NC_017674. 373 PWAVVRYLGI 382 (382) Q Consensus 373 P~aia~~~GI 382 (382) |.+|+.+..- T Consensus 373 ~~a~~~l~~~ 382 (392) T protein:vir:10 373 NEAAVYGEID 382 (392) T ss_pred ccceEEEEec Confidence 9999998765 No 128 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=23.72 E-value=2.3 Score=18.63 Aligned_cols=320 Identities=11% Similarity=0.012 Sum_probs=129.8 Q ss_pred CCCcc------eeeeec-----Ccc-ccccccccccchHHHHHHhhccee-ccccchhhhhhhhcccccchhhhhhcccc Q lcl|NC_017674. 1 MSQIS------KTHSRL-----AGR-NAKPFDLKNITNDAVASLSRIGLV-FDHAVVQDQIKALAKAGAFRSGSAMDSNF 67 (382) Q Consensus 1 ~~~~~------~~~~~~-----~~~-~~~~~~~~~~~~~~~~~l~~~g~~-~~~~~~~~~~~~~~~~~~~~~~~amDa~~ 67 (382) +.++. +.+..+ ..+ ..++..-... ....+.++.... +.+...+......... ..... . T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~~--~ 105 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNV--DGEMEYRDVFMKALRNKPLNAEEREFLED-----DLEQR--A 105 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc--cchHHHHHHHHHHHhcccccHHHHHHHhh-----hhhhh--h Confidence 00000 000000 000 0001100000 011111110000 0011111111111000 00000 0 Q ss_pred cCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceee-eee Q lcl|NC_017674. 68 TAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTS-WNA 144 (382) Q Consensus 68 ~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd-~~~ 144 (382) ....++++.| +|..+. ++|++.+........+..+.....- ...+.+......+.+.+.+.+...|-.+ ... T Consensus 106 ~~~~t~~~gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHH----HHHHHHHHhhhhhhhhceeeeccCC-ceeEEEEeecCCccceeecccccccccccccc Confidence 1112333343 444443 4666766666656666554332211 1123334444445666778877887654 578 Q ss_pred eeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC Q lcl|NC_017674. 145 NFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS 224 (382) Q Consensus 145 ~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~ 224 (382) +...-..+.++..+.+|.+=|+. ...+|.+.-.....+++...+|.-++.|+..+. .+.. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~-----------------~~~~ 240 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-----------------KQAI 240 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----------------ccCc Confidence 88888889999999999866543 346788888888888999999988888863210 0011 Q ss_pred CCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHH-HHHhcC----ccEEEE Q lcl|NC_017674. 225 QGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDW-IEQTYP----KMRIVS 298 (382) Q Consensus 225 ~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~-l~~n~p----nl~i~~ 298 (382) .+. +||.++++..... .+.++ -.++|.|+.+..|.+. +..|.-++.- +....+ +..++. T Consensus 241 -----~~~----d~i~~~~~~~l~~---~~~~~---a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~ 305 (392) T protein:vir:10 241 -----KSL----DDIKDVLNVKLDP---AISPN---AILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVV 305 (392) T ss_pred -----cCH----HHHHHHHHHhhhh---hhccC---CEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEE Confidence 122 3444443321111 11222 3689999988888542 3333322211 111111 111111 Q ss_pred -cc--ccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccc---eecCCceEeccccceeeeEeec Q lcl|NC_017674. 299 -AP--ELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV---EKRAKSYVEDFSNGTAGALCKR 372 (382) Q Consensus 299 -~p--eL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~---~~~~~~~~~~~~~~t~Gv~i~~ 372 (382) .. -+... +.+.+ ...+++.+ .... ..+.....+.+...+. ....-....-+..|.+| .+++ T Consensus 306 ~~~~~~~~~~-~~~~~--~~~~~~gd-fs~~--------~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~ 372 (392) T protein:vir:10 306 VVSNRFLKSK-GTTAK--KAPLIIGD-LKEA--------IVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWD 372 (392) T ss_pred EecccccCCC-cccCC--ceEEEEEe-hhce--------EEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEec Confidence 00 01111 11111 11122211 0000 0000000001111110 00011223456666654 6778 Q ss_pred cchheeecCC Q lcl|NC_017674. 373 PWAVVRYLGI 382 (382) Q Consensus 373 P~aia~~~GI 382 (382) |.+|+.+..- T Consensus 373 ~~a~~~l~~~ 382 (392) T protein:vir:10 373 NEAAVYGEID 382 (392) T ss_pred ccceEEEEec Confidence 9999998765 No 129 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=23.72 E-value=2.3 Score=18.63 Aligned_cols=320 Identities=11% Similarity=0.012 Sum_probs=129.8 Q ss_pred CCCcc------eeeeec-----Ccc-ccccccccccchHHHHHHhhccee-ccccchhhhhhhhcccccchhhhhhcccc Q lcl|NC_017674. 1 MSQIS------KTHSRL-----AGR-NAKPFDLKNITNDAVASLSRIGLV-FDHAVVQDQIKALAKAGAFRSGSAMDSNF 67 (382) Q Consensus 1 ~~~~~------~~~~~~-----~~~-~~~~~~~~~~~~~~~~~l~~~g~~-~~~~~~~~~~~~~~~~~~~~~~~amDa~~ 67 (382) +.++. +.+..+ ..+ ..++..-... ....+.++.... +.+...+......... ..... . T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~~--~ 105 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNV--DGEMEYRDVFMKALRNKPLNAEEREFLED-----DLEQR--A 105 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc--cchHHHHHHHHHHHhcccccHHHHHHHhh-----hhhhh--h Confidence 00000 000000 000 0001100000 011111110000 0011111111111000 00000 0 Q ss_pred cCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceee-eee Q lcl|NC_017674. 68 TAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTS-WNA 144 (382) Q Consensus 68 ~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd-~~~ 144 (382) ....++++.| +|..+. ++|++.+........+..+.....- ...+.+......+.+.+.+.+...|-.+ ... T Consensus 106 ~~~~t~~~gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHH----HHHHHHHHhhhhhhhhceeeeccCC-ceeEEEEeecCCccceeecccccccccccccc Confidence 1112333343 444443 4666766666656666554332211 1123334444445666778877887654 578 Q ss_pred eeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC Q lcl|NC_017674. 145 NFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS 224 (382) Q Consensus 145 ~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~ 224 (382) +...-..+.++..+.+|.+=|+. ...+|.+.-.....+++...+|.-++.|+..+. .+.. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~-----------------~~~~ 240 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-----------------KQAI 240 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----------------ccCc Confidence 88888889999999999866543 346788888888888999999988888863210 0011 Q ss_pred CCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHH-HHHhcC----ccEEEE Q lcl|NC_017674. 225 QGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDW-IEQTYP----KMRIVS 298 (382) Q Consensus 225 ~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~-l~~n~p----nl~i~~ 298 (382) .+. +||.++++..... .+.++ -.++|.|+.+..|.+. +..|.-++.- +....+ +..++. T Consensus 241 -----~~~----d~i~~~~~~~l~~---~~~~~---a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~ 305 (392) T protein:vir:10 241 -----KSL----DDIKDVLNVKLDP---AISPN---AILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVV 305 (392) T ss_pred -----cCH----HHHHHHHHHhhhh---hhccC---CEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEE Confidence 122 3444443321111 11222 3689999988888542 3333322211 111111 111111 Q ss_pred -cc--ccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccc---eecCCceEeccccceeeeEeec Q lcl|NC_017674. 299 -AP--ELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV---EKRAKSYVEDFSNGTAGALCKR 372 (382) Q Consensus 299 -~p--eL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~---~~~~~~~~~~~~~~t~Gv~i~~ 372 (382) .. -+... +.+.+ ...+++.+ .... ..+.....+.+...+. ....-....-+..|.+| .+++ T Consensus 306 ~~~~~~~~~~-~~~~~--~~~~~~gd-fs~~--------~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~ 372 (392) T protein:vir:10 306 VVSNRFLKSK-GTTAK--KAPLIIGD-LKEA--------IVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWD 372 (392) T ss_pred EecccccCCC-cccCC--ceEEEEEe-hhce--------EEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEec Confidence 00 01111 11111 11122211 0000 0000000001111110 00011223456666654 6778 Q ss_pred cchheeecCC Q lcl|NC_017674. 373 PWAVVRYLGI 382 (382) Q Consensus 373 P~aia~~~GI 382 (382) |.+|+.+..- T Consensus 373 ~~a~~~l~~~ 382 (392) T protein:vir:10 373 NEAAVYGEID 382 (392) T ss_pred ccceEEEEec Confidence 9999998765 No 130 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=23.72 E-value=2.3 Score=18.63 Aligned_cols=320 Identities=11% Similarity=0.012 Sum_probs=129.8 Q ss_pred CCCcc------eeeeec-----Ccc-ccccccccccchHHHHHHhhccee-ccccchhhhhhhhcccccchhhhhhcccc Q lcl|NC_017674. 1 MSQIS------KTHSRL-----AGR-NAKPFDLKNITNDAVASLSRIGLV-FDHAVVQDQIKALAKAGAFRSGSAMDSNF 67 (382) Q Consensus 1 ~~~~~------~~~~~~-----~~~-~~~~~~~~~~~~~~~~~l~~~g~~-~~~~~~~~~~~~~~~~~~~~~~~amDa~~ 67 (382) +.++. +.+..+ ..+ ..++..-... ....+.++.... +.+...+......... ..... . T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~~--~ 105 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNV--DGEMEYRDVFMKALRNKPLNAEEREFLED-----DLEQR--A 105 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc--cchHHHHHHHHHHHhcccccHHHHHHHhh-----hhhhh--h Confidence 00000 000000 000 0001100000 011111110000 0011111111111000 00000 0 Q ss_pred cCcccccchh--HHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeecccceeecccccCCceee-eee Q lcl|NC_017674. 68 TAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTS-WNA 144 (382) Q Consensus 68 ~~~~t~~~~~--~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G~a~~ygd~~DiP~vd-~~~ 144 (382) ....++++.| +|..+. ++|++.+........+..+.....- ...+.+......+.+.+.+.+...|-.+ ... T Consensus 106 ~~~~t~~~gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHH----HHHHHHHHhhhhhhhhceeeeccCC-ceeEEEEeecCCccceeecccccccccccccc Confidence 1112333343 444443 4666766666656666554332211 1123334444445666778877887654 578 Q ss_pred eeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEEeeccCCcccceEEEeCCCCcceeccCC Q lcl|NC_017674. 145 NFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPAFQTPPS 224 (382) Q Consensus 145 ~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~Gd~~g~~~g~~GllN~P~l~~~~~~a~ 224 (382) +...-..+.++..+.+|.+=|+. ...+|.+.-.....+++...+|.-++.|+..+. .+.. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~-----------------~~~~ 240 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-----------------KQAI 240 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----------------ccCc Confidence 88888889999999999866543 346788888888888999999988888863210 0011 Q ss_pred CCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc-CCCCccHHHH-HHHhcC----ccEEEE Q lcl|NC_017674. 225 QGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT-TPYGISVSDW-IEQTYP----KMRIVS 298 (382) Q Consensus 225 ~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t-~~~~~Tvl~~-l~~n~p----nl~i~~ 298 (382) .+. +||.++++..... .+.++ -.++|.|+.+..|.+. +..|.-++.- +....+ +..++. T Consensus 241 -----~~~----d~i~~~~~~~l~~---~~~~~---a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~ 305 (392) T protein:vir:10 241 -----KSL----DDIKDVLNVKLDP---AISPN---AILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVV 305 (392) T ss_pred -----cCH----HHHHHHHHHhhhh---hhccC---CEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEE Confidence 122 3444443321111 11222 3689999988888542 3333322211 111111 111111 Q ss_pred -cc--ccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhcccc---eecCCceEeccccceeeeEeec Q lcl|NC_017674. 299 -AP--ELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLGV---EKRAKSYVEDFSNGTAGALCKR 372 (382) Q Consensus 299 -~p--eL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~~---~~~~~~~~~~~~~~t~Gv~i~~ 372 (382) .. -+... +.+.+ ...+++.+ .... ..+.....+.+...+. ....-....-+..|.+| .+++ T Consensus 306 ~~~~~~~~~~-~~~~~--~~~~~~gd-fs~~--------~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~ 372 (392) T protein:vir:10 306 VVSNRFLKSK-GTTAK--KAPLIIGD-LKEA--------IVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWD 372 (392) T ss_pred EecccccCCC-cccCC--ceEEEEEe-hhce--------EEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEec Confidence 00 01111 11111 11122211 0000 0000000001111110 00011223456666654 6778 Q ss_pred cchheeecCC Q lcl|NC_017674. 373 PWAVVRYLGI 382 (382) Q Consensus 373 P~aia~~~GI 382 (382) |.+|+.+..- T Consensus 373 ~~a~~~l~~~ 382 (392) T protein:vir:10 373 NEAAVYGEID 382 (392) T ss_pred ccceEEEEec Confidence 9999998765 No 131 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=21.81 E-value=2.5 Score=18.36 Aligned_cols=303 Identities=12% Similarity=0.037 Sum_probs=106.4 Q ss_pred hhhhhcccccchhhhhhcccccCcccccchhHHHHHHhhhhhhheeccccccchhhhCccccCCCcceeeEEEEeeeccc Q lcl|NC_017674. 47 QIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAG 126 (382) Q Consensus 47 ~~~~~~~~~~~~~~~amDa~~~~~~t~~~~~~~~~~l~~idp~v~~~~~~~~~~~~l~~v~t~g~~~~~t~t~~v~e~~G 126 (382) |...-.+ ..+.+.-....+-.+-||+.+.-+|.......-..+.++.+.+.-. -.++.|+. .| T Consensus 1 m~~~~~~------------~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~--G~s~~~~~---iG 63 (334) T protein:vir:80 1 MTYPAAN------------THTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRG--TNQLRVDR---VG 63 (334) T ss_pred CCCCcCC------------CccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccc--cceEEEee---ec Confidence 2111110 0111111111111222344444444333323333444444433211 25666654 46 Q ss_pred ceeecc--cccCCceeeeeeeeeEeeEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEE----Eeec Q lcl|NC_017674. 127 TAVEYG--DHTNIPLTSWNANFERRTIVRGELGMMVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGF----YGWQ 200 (382) Q Consensus 127 ~a~~yg--d~~DiP~vd~~~~~~~~~v~~~~~g~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~----~Gd~ 200 (382) .+++.. -+..+.--...-++..-.|-.. --++.-+.++.+++ +..++-++-.+.+..++.++.|+..+ .|-. T Consensus 64 ~~~~~~~~~g~~l~~~~~~~~~~~l~ID~~-l~~~~~VddiD~~q-~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~ 141 (334) T protein:vir:80 64 ASTIAGRKAGEELVVQKNVSDKLNLTVDTV-LYARHFFDKFDEWT-SNLDVRKETAREDGIALARQYDQACIIQLQKCGD 141 (334) T ss_pred ceeeeeecCCCCCCCCCcccCceEEEEeee-eehhhhHhhHHHHh-cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 665432 1222211111112211111110 01112223333333 33444444445555555555554322 1100 Q ss_pred cCCcccceEEEeCCCCcceeccCC-CCccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccceEecCHHHHhhcccc--- Q lcl|NC_017674. 201 SGLGNRTYGFLNDPNLPAFQTPPS-QGWSTADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITLALATSKVDYLSVT--- 276 (382) Q Consensus 201 ~g~~~g~~GllN~P~l~~~~~~a~-~~Wa~kT~~eI~~Di~~~~~~l~~~t~g~~~~~~~p~~L~Lp~~~~~~Ls~t--- 276 (382) .........-+++. ....+...+ +.-...+++.+.+=+..+...+.++.-- ++-..+..++++|.+|..|-.- T Consensus 142 ~~~~~~~~~~~~~G-~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp--~~~~~~R~~vv~P~~y~~Ll~~~r~ 218 (334) T protein:vir:80 142 FLAPAHLKPAFHDG-ILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLG--DQLMSEGVTLLDPVIFSFLLEHDRL 218 (334) T ss_pred hcccccccccccCC-cceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCC--CCcCCceEEEeChHHHHHHhccccc Confidence 00000000011111 111111111 1223466888888777777777665431 1001235789999999998532 Q ss_pred -C-CCCc--cHHHHHHH---hcCccEEEEccccccccCCCCCceeEEEEcchhhhhhhccccccchhhhhhhhhhhhccc Q lcl|NC_017674. 277 -T-PYGI--SVSDWIEQ---TYPKMRIVSAPELSGVQMKAQEPEDALVLFVEDVNAAVDGSTDGGSVFSQLVQSKFITLG 349 (382) Q Consensus 277 -~-~~~~--Tvl~~l~~---n~pnl~i~~~peL~~a~g~g~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~p~~~~~l~ 349 (382) | +++- +...+-+. +.-+++|+..+.|=.....+...+...-.|..+.. . ....|.. |+- ..- T Consensus 219 ~n~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t------~-~~~~~~~--~~A--l~t 287 (334) T protein:vir:80 219 MNVEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVR------R-KMITFIP--SMA--LIS 287 (334) T ss_pred ccceeccccccccccceeEEEEeceEEEeecCCCCcccccccccccccccccccc------c-eEEEEEe--Cce--EEE Confidence 1 1111 11111111 12235555544442211110000000000111000 0 0000000 000 000 Q ss_pred ceec---CCceEeccccc-------eeeeEeecc--chheeecCC Q lcl|NC_017674. 350 VEKR---AKSYVEDFSNG-------TAGALCKRP--WAVVRYLGI 382 (382) Q Consensus 350 ~~~~---~~~~~~~~~~~-------t~Gv~i~~P--~aia~~~GI 382 (382) ++.. ...|..+.+.. ..|+-++|| .++..++++ T Consensus 288 ~~~~~~~~e~~~~~~~~~d~i~~~~a~G~g~lRPeaa~vv~~~~~ 332 (334) T protein:vir:80 288 AQVHPVSAQFWEEKKDFGHYLDTFQSYNIGQRRPDAVAVHDITVT 332 (334) T ss_pred EEEeecceeeeechhhHHHHHHHHHHcCCceeccceEEEEEEeee Confidence 1111 12233333333 569999999 556677888 Done!