Query lcl|NC_021342.1_cdsid_YP_008060416.1 [gene=M171_gp13] [protein=hypothetical protein] [protein_id=YP_008060416.1] [location=8394..9458] Match_columns 354 No_of_seqs 146 out of 190 Neff 7.6 Searched_HMMs 1612 Date Thu Nov 7 16:42:33 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_13 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_13_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:79642 Length: 329 100.0 1.4E-91 8.6E-95 518.7 31.7 322 14-354 1-326 (329) 2 protein:vir:5255 Length: 304 # 100.0 2.8E-91 1.8E-94 517.0 29.1 295 50-353 1-304 (304) 3 protein:vir:104342 Length: 314 100.0 6.2E-90 3.9E-93 509.6 30.9 311 1-354 1-311 (314) 4 protein:vir:107687 Length: 319 100.0 2.3E-88 1.5E-91 501.0 32.0 316 19-354 1-319 (319) 5 protein:vir:103285 Length: 296 100.0 1.9E-88 1.2E-91 501.5 30.8 292 45-354 1-293 (296) 6 protein:vir:80068 Length: 301 100.0 7.5E-87 4.7E-90 492.7 30.6 294 45-354 1-301 (301) 7 protein:vir:94070 Length: 339 100.0 1.9E-82 1.2E-85 468.6 28.3 332 1-354 1-339 (339) 8 protein:vir:101557 Length: 336 100.0 2.8E-79 1.7E-82 451.2 26.9 327 5-354 1-336 (336) 9 protein:vir:78558 Length: 336 100.0 3.3E-79 2.1E-82 450.8 25.7 324 5-354 1-336 (336) 10 protein:vir:3643 Length: 336 # 100.0 4E-79 2.5E-82 450.4 26.1 327 5-354 1-336 (336) 11 protein:vir:106734 Length: 336 100.0 6.4E-79 4E-82 449.3 25.2 324 5-354 1-336 (336) 12 protein:vir:107732 Length: 379 100.0 8.9E-78 5.5E-81 443.0 28.7 328 1-354 19-379 (379) 13 protein:vir:99576 Length: 388 100.0 4.1E-76 2.5E-79 433.9 25.7 334 1-354 21-388 (388) 14 protein:vir:96079 Length: 382 100.0 6.7E-74 4.1E-77 421.8 27.0 336 1-354 19-382 (382) 15 protein:vir:105778 Length: 358 99.7 4.3E-19 2.7E-22 121.3 12.4 325 1-354 1-357 (358) 16 protein:vir:94673 Length: 419 98.9 7.9E-10 4.9E-13 70.5 20.1 322 1-354 51-415 (419) 17 protein:vir:9574 Length: 300 # 98.9 4.2E-10 2.6E-13 72.0 18.3 282 45-354 1-298 (300) 18 protein:vir:8187 Length: 311 # 98.9 6.3E-10 3.9E-13 71.0 18.2 287 37-354 1-308 (311) 19 protein:vir:101650 Length: 497 98.9 9.6E-10 5.9E-13 70.0 19.1 327 1-354 87-491 (497) 20 protein:vir:7855 Length: 497 # 98.9 9.6E-10 5.9E-13 70.0 19.1 327 1-354 87-491 (497) 21 protein:vir:99920 Length: 311 98.9 5E-10 3.1E-13 71.6 16.4 289 44-354 1-310 (311) 22 protein:vir:9759 Length: 303 # 98.8 1.8E-09 1.1E-12 68.5 19.2 285 45-354 1-301 (303) 23 protein:vir:10364 Length: 390 98.8 5.8E-09 3.6E-12 65.7 20.5 303 1-354 73-390 (390) 24 protein:vir:80684 Length: 315 98.8 1.6E-09 1E-12 68.8 17.4 289 45-354 1-304 (315) 25 protein:vir:94142 Length: 304 98.8 2.6E-09 1.6E-12 67.6 18.0 284 35-354 1-303 (304) 26 protein:vir:105905 Length: 304 98.8 2.6E-09 1.6E-12 67.6 18.0 284 35-354 1-303 (304) 27 protein:vir:104256 Length: 458 98.8 4.5E-09 2.8E-12 66.3 19.2 322 1-354 118-456 (458) 28 protein:vir:41 Length: 299 # N 98.8 4.7E-09 2.9E-12 66.3 19.2 277 40-354 1-296 (299) 29 protein:vir:1638 Length: 298 # 98.8 2.7E-09 1.7E-12 67.6 17.7 280 45-354 1-297 (298) 30 protein:vir:95763 Length: 297 98.8 4.6E-09 2.9E-12 66.3 18.6 277 37-354 1-294 (297) 31 protein:vir:1433 Length: 435 # 98.7 1.3E-08 8E-12 63.8 20.7 326 1-354 53-431 (435) 32 protein:vir:97053 Length: 390 98.7 7.8E-09 4.8E-12 65.0 19.4 314 1-354 45-390 (390) 33 protein:vir:104085 Length: 320 98.7 4.5E-09 2.8E-12 66.4 17.2 294 24-354 1-315 (320) 34 protein:vir:81070 Length: 390 98.7 1.2E-08 7.3E-12 64.0 19.2 314 1-354 45-390 (390) 35 protein:vir:7771 Length: 330 # 98.7 9.3E-09 5.7E-12 64.6 18.0 294 35-354 1-321 (330) 36 protein:vir:81227 Length: 413 98.7 2.9E-08 1.8E-11 61.9 20.5 320 1-354 72-408 (413) 37 protein:vir:94771 Length: 298 98.7 9.7E-09 6E-12 64.5 17.8 280 45-354 1-297 (298) 38 protein:vir:80376 Length: 435 98.7 2E-08 1.2E-11 62.8 19.3 328 1-354 50-431 (435) 39 protein:vir:4339 Length: 395 # 98.6 2.7E-08 1.7E-11 62.1 19.7 314 1-354 52-393 (395) 40 protein:vir:78223 Length: 333 98.6 2.2E-08 1.4E-11 62.6 18.7 302 29-354 1-330 (333) 41 protein:vir:98339 Length: 415 98.6 2.3E-08 1.5E-11 62.4 18.6 321 1-354 42-402 (415) 42 protein:vir:79987 Length: 415 98.6 2.3E-08 1.5E-11 62.4 18.6 321 1-354 42-402 (415) 43 protein:vir:81100 Length: 415 98.6 2.3E-08 1.5E-11 62.4 18.6 321 1-354 42-402 (415) 44 protein:vir:78523 Length: 338 98.6 3.9E-08 2.4E-11 61.2 19.6 304 29-354 1-333 (338) 45 protein:vir:4700 Length: 415 # 98.6 1.9E-08 1.2E-11 62.9 17.7 322 1-354 49-402 (415) 46 protein:vir:4600 Length: 415 # 98.6 1.9E-08 1.2E-11 62.9 17.7 322 1-354 49-402 (415) 47 protein:vir:96223 Length: 324 98.6 4.4E-08 2.8E-11 60.9 19.5 294 3-354 1-313 (324) 48 protein:vir:2504 Length: 305 # 98.6 4.3E-08 2.7E-11 61.0 18.9 275 37-354 1-296 (305) 49 protein:vir:8102 Length: 543 # 98.6 4.3E-08 2.7E-11 61.0 18.8 320 1-354 197-540 (543) 50 protein:vir:1886 Length: 385 # 98.6 6.2E-08 3.8E-11 60.1 19.6 314 1-354 41-382 (385) 51 protein:vir:191 Length: 385 # 98.6 6.2E-08 3.8E-11 60.1 19.6 314 1-354 41-382 (385) 52 protein:vir:2430 Length: 318 # 98.6 4.7E-08 2.9E-11 60.8 18.7 292 24-354 1-311 (318) 53 protein:vir:9410 Length: 415 # 98.6 1.8E-08 1.1E-11 63.0 16.4 320 1-354 42-402 (415) 54 protein:vir:103955 Length: 324 98.5 7.7E-08 4.8E-11 59.6 19.4 294 3-354 1-313 (324) 55 protein:vir:100135 Length: 418 98.5 8E-08 4.9E-11 59.5 19.3 314 1-354 75-413 (418) 56 protein:vir:8420 Length: 477 # 98.5 2.2E-08 1.4E-11 62.5 16.0 336 1-354 93-469 (477) 57 protein:vir:9309 Length: 324 # 98.5 1.2E-07 7.4E-11 58.5 19.0 291 1-354 4-313 (324) 58 protein:vir:99749 Length: 324 98.5 1.7E-07 1.1E-10 57.7 19.7 294 3-354 1-313 (324) 59 protein:vir:4226 Length: 326 # 98.5 1E-07 6.4E-11 58.9 18.2 306 24-354 1-321 (326) 60 protein:vir:97148 Length: 324 98.4 2.3E-07 1.4E-10 57.0 19.9 296 17-354 1-313 (324) 61 protein:vir:100247 Length: 425 98.4 5.1E-08 3.2E-11 60.6 16.3 325 1-354 63-422 (425) 62 protein:vir:4456 Length: 401 # 98.4 2.5E-08 1.6E-11 62.2 13.4 326 1-354 56-399 (401) 63 protein:vir:108211 Length: 318 98.4 1.5E-08 9.5E-12 63.4 12.0 280 37-354 1-315 (318) 64 protein:vir:78830 Length: 324 98.4 5.2E-07 3.2E-10 55.0 20.0 294 17-354 1-313 (324) 65 protein:vir:96392 Length: 324 98.4 5.2E-07 3.2E-10 55.0 20.0 294 17-354 1-313 (324) 66 protein:vir:5739 Length: 366 # 98.4 4.4E-07 2.8E-10 55.4 19.5 316 1-354 1-364 (366) 67 protein:vir:485 Length: 407 # 98.3 1.9E-07 1.1E-10 57.5 16.8 319 1-354 41-398 (407) 68 protein:vir:105038 Length: 428 98.3 9.7E-07 6E-10 53.5 20.0 327 1-354 44-426 (428) 69 protein:vir:102119 Length: 404 98.3 8.1E-08 5E-11 59.5 13.7 315 1-354 38-398 (404) 70 protein:vir:96762 Length: 632 98.2 2.7E-07 1.7E-10 56.6 15.5 316 1-354 286-631 (632) 71 protein:vir:2344 Length: 397 # 98.2 5.4E-07 3.3E-10 54.9 16.8 288 24-354 1-304 (397) 72 protein:vir:9643 Length: 377 # 98.2 3.3E-06 2E-09 50.6 20.0 311 1-354 37-375 (377) 73 protein:vir:1328 Length: 392 # 98.1 3.4E-06 2.1E-09 50.5 19.5 319 1-354 45-389 (392) 74 protein:vir:101607 Length: 379 98.1 4.6E-06 2.9E-09 49.8 19.9 305 1-354 52-377 (379) 75 protein:vir:4856 Length: 293 # 98.0 1.6E-06 1E-09 52.3 16.3 272 36-354 1-279 (293) 76 protein:vir:4092 Length: 390 # 98.0 6.9E-06 4.3E-09 48.9 19.3 312 1-354 38-366 (390) 77 protein:vir:6212 Length: 434 # 98.0 1.3E-06 7.9E-10 52.9 15.2 319 1-354 82-427 (434) 78 protein:vir:1268 Length: 397 # 98.0 1.8E-06 1.1E-09 52.0 15.8 305 1-354 84-395 (397) 79 protein:vir:4197 Length: 314 # 98.0 6.2E-06 3.8E-09 49.1 18.5 297 23-354 1-311 (314) 80 protein:vir:1025 Length: 408 # 98.0 8.6E-07 5.3E-10 53.8 13.8 312 1-354 54-391 (408) 81 protein:vir:3991 Length: 404 # 98.0 1E-06 6.3E-10 53.5 14.1 313 1-354 54-391 (404) 82 protein:vir:102082 Length: 392 97.9 4.9E-06 3E-09 49.7 17.2 307 1-354 46-382 (392) 83 protein:vir:102873 Length: 392 97.9 4.9E-06 3E-09 49.7 17.2 307 1-354 46-382 (392) 84 protein:vir:105004 Length: 392 97.9 4.9E-06 3E-09 49.7 17.2 307 1-354 46-382 (392) 85 protein:vir:107593 Length: 392 97.9 4.9E-06 3E-09 49.7 17.2 307 1-354 46-382 (392) 86 protein:vir:6242 Length: 390 # 97.9 3.9E-06 2.4E-09 50.2 16.4 317 1-354 45-387 (390) 87 protein:vir:98635 Length: 377 97.9 9.1E-06 5.6E-09 48.2 18.2 312 1-354 20-375 (377) 88 protein:vir:7409 Length: 408 # 97.8 2.6E-06 1.6E-09 51.2 14.1 312 1-354 51-391 (408) 89 protein:vir:4953 Length: 397 # 97.8 5E-06 3.1E-09 49.7 15.5 312 1-354 51-383 (397) 90 protein:vir:102655 Length: 322 97.8 1.6E-05 9.9E-09 46.9 18.1 301 35-354 1-319 (322) 91 protein:vir:4511 Length: 409 # 97.8 1.9E-05 1.2E-08 46.5 18.5 317 1-354 49-404 (409) 92 protein:vir:93616 Length: 645 97.8 2E-05 1.2E-08 46.4 20.0 319 1-354 288-637 (645) 93 protein:vir:4830 Length: 397 # 97.8 6.4E-06 4E-09 49.1 15.3 310 1-354 51-383 (397) 94 protein:vir:81160 Length: 371 97.7 1.3E-05 8.3E-09 47.3 16.9 313 1-354 36-369 (371) 95 protein:vir:3613 Length: 272 # 97.7 2E-05 1.2E-08 46.4 17.3 268 41-354 1-270 (272) 96 protein:vir:78640 Length: 352 97.7 7.2E-06 4.5E-09 48.8 14.7 301 1-354 23-344 (352) 97 protein:vir:4997 Length: 397 # 97.7 7.2E-06 4.4E-09 48.8 14.7 310 1-354 51-383 (397) 98 protein:vir:3033 Length: 272 # 97.7 2.8E-05 1.7E-08 45.6 19.9 263 40-354 1-267 (272) 99 protein:vir:9820 Length: 272 # 97.7 2.8E-05 1.7E-08 45.6 19.9 263 40-354 1-267 (272) 100 protein:vir:4159 Length: 315 # 97.7 1.8E-05 1.1E-08 46.6 16.9 303 3-353 1-315 (315) 101 protein:vir:9509 Length: 381 # 97.6 3.8E-05 2.3E-08 44.8 20.0 309 1-354 1-366 (381) 102 protein:vir:101291 Length: 381 97.6 3.8E-05 2.3E-08 44.8 20.0 309 1-354 1-366 (381) 103 protein:vir:78350 Length: 383 97.6 4.2E-05 2.6E-08 44.6 17.3 309 1-354 38-373 (383) 104 protein:vir:95963 Length: 395 97.5 5.6E-05 3.5E-08 43.9 20.0 305 1-354 23-374 (395) 105 protein:vir:80930 Length: 278 97.5 5.8E-05 3.6E-08 43.8 18.5 272 40-354 1-275 (278) 106 protein:vir:3870 Length: 400 # 97.3 4.4E-05 2.7E-08 44.5 14.6 295 1-354 63-397 (400) 107 protein:vir:80128 Length: 466 97.3 9.9E-05 6.1E-08 42.5 18.7 320 1-354 82-446 (466) 108 protein:vir:95376 Length: 425 97.2 0.00014 8.4E-08 41.8 19.8 312 1-354 68-419 (425) 109 protein:vir:3845 Length: 395 # 97.1 0.00013 7.8E-08 42.0 14.9 309 1-354 48-381 (395) 110 protein:vir:100172 Length: 394 97.1 0.00018 1.1E-07 41.1 18.5 310 1-354 55-382 (394) 111 protein:vir:100632 Length: 381 97.1 0.00019 1.2E-07 41.0 19.6 308 1-354 1-366 (381) 112 protein:vir:93881 Length: 387 97.0 0.00016 1E-07 41.4 14.8 303 1-354 51-379 (387) 113 protein:vir:93742 Length: 274 97.0 0.00024 1.5E-07 40.4 19.9 263 41-354 1-268 (274) 114 protein:vir:100884 Length: 389 96.7 0.00038 2.4E-07 39.3 18.7 306 1-354 55-380 (389) 115 protein:vir:97433 Length: 274 96.7 0.00043 2.7E-07 39.1 19.6 265 41-354 1-268 (274) 116 protein:vir:94494 Length: 274 96.7 0.00043 2.7E-07 39.1 19.6 265 41-354 1-268 (274) 117 protein:vir:96833 Length: 275 96.6 0.00047 2.9E-07 38.8 18.4 266 37-354 1-269 (275) 118 protein:vir:96123 Length: 274 96.6 0.00047 2.9E-07 38.8 20.4 263 40-354 1-268 (274) 119 protein:vir:9361 Length: 402 # 96.5 0.00054 3.3E-07 38.5 15.2 303 1-354 73-394 (402) 120 protein:vir:96978 Length: 387 96.5 0.00043 2.7E-07 39.0 13.6 303 1-354 58-379 (387) 121 protein:vir:2685 Length: 387 # 96.5 0.00043 2.7E-07 39.0 13.6 303 1-354 58-379 (387) 122 protein:vir:94424 Length: 387 96.5 0.00043 2.7E-07 39.0 13.6 303 1-354 58-379 (387) 123 protein:vir:105334 Length: 276 96.2 0.00083 5.2E-07 37.5 19.3 266 41-354 1-268 (276) 124 protein:vir:80213 Length: 334 96.1 0.0011 6.5E-07 36.9 14.5 300 37-354 1-330 (334) 125 protein:vir:96262 Length: 274 96.0 0.0011 6.7E-07 36.9 18.5 265 41-354 1-268 (274) 126 protein:vir:95898 Length: 274 96.0 0.0011 6.7E-07 36.9 18.5 265 41-354 1-268 (274) 127 protein:vir:9704 Length: 394 # 95.9 0.0013 7.8E-07 36.5 15.6 299 1-354 51-388 (394) 128 protein:vir:962 Length: 397 # 95.5 0.002 1.2E-06 35.4 14.1 304 1-354 68-395 (397) 129 protein:vir:739 Length: 231 # 95.4 0.0021 1.3E-06 35.2 14.2 228 84-354 1-229 (231) 130 protein:vir:94576 Length: 347 95.2 0.0025 1.6E-06 34.8 14.8 294 24-354 1-347 (347) 131 protein:vir:8843 Length: 317 # 95.1 0.0029 1.8E-06 34.5 15.7 282 40-354 1-314 (317) 132 protein:vir:1239 Length: 274 # 95.0 0.0031 1.9E-06 34.4 18.3 262 41-354 1-268 (274) 133 protein:vir:99675 Length: 324 94.7 0.0037 2.3E-06 33.9 14.5 254 80-354 1-294 (324) 134 protein:vir:1383 Length: 421 # 94.7 0.0037 2.3E-06 33.9 17.6 307 1-354 52-381 (421) 135 protein:vir:8885 Length: 347 # 94.5 0.0042 2.6E-06 33.6 15.5 303 24-354 1-344 (347) 136 protein:vir:97255 Length: 310 94.5 0.0043 2.7E-06 33.6 17.0 279 42-354 1-308 (310) 137 protein:vir:95107 Length: 270 94.4 0.0046 2.8E-06 33.4 15.9 260 43-354 1-263 (270) 138 protein:vir:78739 Length: 332 94.0 0.0059 3.6E-06 32.8 13.0 297 24-354 1-332 (332) 139 protein:vir:10450 Length: 344 93.8 0.0064 4E-06 32.6 13.8 296 24-354 1-342 (344) 140 protein:vir:99888 Length: 309 93.5 0.0074 4.6E-06 32.3 14.3 265 40-354 1-295 (309) 141 protein:vir:2201 Length: 345 # 93.4 0.0075 4.7E-06 32.2 15.7 292 37-354 1-343 (345) 142 protein:vir:102823 Length: 470 93.3 0.0012 7.5E-07 36.6 6.9 292 19-354 1-339 (470) 143 protein:vir:3158 Length: 321 # 93.2 0.0083 5.2E-06 32.0 20.3 294 1-354 1-310 (321) 144 protein:vir:6324 Length: 335 # 92.2 0.012 7.8E-06 31.0 13.5 294 23-354 1-326 (335) 145 protein:vir:78935 Length: 335 91.2 0.017 1E-05 30.3 17.2 294 23-354 1-326 (335) 146 protein:vir:1541 Length: 347 # 89.9 0.024 1.5E-05 29.5 18.2 302 1-354 1-343 (347) 147 protein:vir:94711 Length: 347 89.1 0.029 1.8E-05 29.1 14.2 301 1-354 1-344 (347) 148 protein:vir:94622 Length: 341 88.7 0.031 1.9E-05 28.9 20.3 292 35-354 1-337 (341) 149 protein:vir:96666 Length: 462 87.8 0.036 2.3E-05 28.5 10.7 313 1-354 1-337 (462) 150 protein:vir:102605 Length: 273 84.9 0.057 3.5E-05 27.4 21.9 264 45-354 1-271 (273) 151 protein:vir:105822 Length: 273 84.9 0.057 3.5E-05 27.4 21.9 264 45-354 1-271 (273) 152 protein:vir:7990 Length: 273 # 83.5 0.068 4.2E-05 27.0 20.4 267 40-354 1-271 (273) 153 protein:vir:100057 Length: 375 82.4 0.077 4.8E-05 26.7 18.9 301 23-354 1-368 (375) 154 protein:vir:97397 Length: 517 81.8 0.083 5.1E-05 26.5 12.1 311 1-354 181-512 (517) 155 protein:vir:6378 Length: 346 # 81.5 0.085 5.3E-05 26.5 18.0 282 47-354 1-346 (346) 156 protein:vir:105645 Length: 400 81.0 0.09 5.6E-05 26.3 16.4 291 23-354 1-331 (400) 157 protein:vir:1084 Length: 437 # 78.3 0.12 7.2E-05 25.7 13.5 306 1-354 102-425 (437) 158 protein:vir:107882 Length: 307 76.5 0.14 8.4E-05 25.3 14.9 274 37-354 1-299 (307) 159 protein:vir:79078 Length: 307 75.3 0.15 9.2E-05 25.1 14.4 276 37-354 1-299 (307) 160 protein:vir:63741 Length: 468 72.1 0.19 0.00012 24.6 11.7 297 19-354 1-323 (468) 161 protein:vir:94933 Length: 330 70.1 0.21 0.00013 24.3 15.3 304 1-354 2-327 (330) 162 protein:vir:80491 Length: 467 69.4 0.22 0.00014 24.2 11.0 301 1-354 1-322 (467) 163 protein:vir:94800 Length: 319 67.1 0.26 0.00016 23.8 15.4 288 2-354 1-293 (319) 164 protein:vir:97331 Length: 319 67.1 0.26 0.00016 23.8 15.4 288 2-354 1-293 (319) 165 protein:vir:98480 Length: 348 63.7 0.31 0.00019 23.4 15.3 278 37-354 1-347 (348) 166 protein:vir:3364 Length: 347 # 61.7 0.35 0.00022 23.1 16.1 302 24-354 1-343 (347) 167 protein:vir:97031 Length: 402 60.3 0.37 0.00023 22.9 14.4 297 23-354 1-331 (402) 168 protein:vir:7019 Length: 401 # 47.8 0.69 0.00043 21.5 14.4 291 23-354 1-331 (401) 169 protein:vir:79928 Length: 393 45.0 0.79 0.00049 21.2 7.8 320 1-354 32-377 (393) 170 protein:vir:4074 Length: 480 # 33.8 1.3 0.00083 19.9 9.6 302 1-354 145-475 (480) 171 protein:vir:4902 Length: 348 # 31.2 1.5 0.00094 19.6 17.3 288 42-354 1-345 (348) 172 protein:vir:95603 Length: 463 30.2 1.6 0.00099 19.5 10.2 275 1-354 3-297 (463) 173 protein:vir:99311 Length: 463 30.2 1.6 0.00099 19.5 10.2 275 1-354 3-297 (463) 174 protein:vir:106590 Length: 349 24.2 2.2 0.0014 18.7 15.1 275 40-354 1-329 (349) 175 protein:vir:2736 Length: 348 # 21.9 2.5 0.0016 18.4 21.9 287 42-354 1-345 (348) No 1 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=100.00 E-value=1.4e-91 Score=518.71 Aligned_cols=322 Identities=19% Similarity=0.235 Sum_probs=298.3 Q ss_pred hhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeE Q lcl|NC_021342. 14 NQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADT 93 (354) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~ 93 (354) -.|.|..+++.+ |++.+...||+++. +..+.|+++.++|+++||++||++|||+++++++++++||+.++++||+++ T Consensus 1 ~~~~~~~~~~~~--d~~~~~~~a~~~~~-~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~ 77 (329) T protein:vir:79 1 MRGNIMSKEMKY--DEFEANVIANHMQL-RGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKT 77 (329) T ss_pred Cccchhhhhhcc--chhhhhhHhhhccc-ccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeE Confidence 677777777764 56667667887754 556888888999999999999999999999999999999999999999999 Q ss_pred EEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhhee Q lcl|NC_021342. 94 WMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVA 173 (354) Q Consensus 94 ~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~ 173 (354) ++|++++.+|++++|+++++|+|+++++.+++.+|++.|+.+|+|+++||+++++.|+||+++|+.+|++++++++|+++ T Consensus 78 ~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~ 157 (329) T protein:vir:79 78 FEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLV 157 (329) T ss_pred EEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeehhhCceeeeecCCccccccc----ccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCC Q lcl|NC_021342. 174 YFGDASRGMYGLFNNPNVTLSSAT----KDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTG 249 (354) Q Consensus 174 f~G~~~~gi~GLlN~p~~~~~~~~----~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~ 249 (354) |+|++++|++||||+||+++...+ ++|++||++||++||++++++++.+|+|++.|++|+|||+.|.+|++++ . T Consensus 158 f~G~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~--~ 235 (329) T protein:vir:79 158 FKGSKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRM--P 235 (329) T ss_pred EeecccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhccc--C Confidence 999999999999999999865432 4699999999999999999999999999999999999999999998754 4 Q ss_pred CCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCcee Q lcl|NC_021342. 250 YTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGI 329 (354) Q Consensus 250 ~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~ 329 (354) ++++|+++||++||+ +++|+.+++|+. +|.+|+|||++|+++++++++++||||++||+|+++++| T Consensus 236 ~~~~tvl~~lk~~~~-------~l~I~~~~el~~-------ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~q~~~~~~ 301 (329) T protein:vir:79 236 ETTMSYLDYFKQQNG-------GITIESISELED-------IDGAGTKAALVYEKDPMNMSIEIPEAFNMLTAQPKDLHF 301 (329) T ss_pred CCCccHHHHHHHhCC-------CcEEEEcccccc-------cCCCCceEEEEEecCCceEEEecCcceeeeeceecCceE Confidence 679999999999875 578999999864 356789999999999999999999999999999999999 Q ss_pred EEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 330 TVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 330 ~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ++||++|||||+||||+||+|+|== T Consensus 302 ~v~~~~r~~Gv~i~~P~ai~~~dGI 326 (329) T protein:vir:79 302 KVPCTSKCTGLTIYRPLTLVLIKGL 326 (329) T ss_pred EEceeeeEEEEEEECcceeeeeeee Confidence 9999999999999999999999955 No 2 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=100.00 E-value=2.8e-91 Score=517.00 Aligned_cols=295 Identities=18% Similarity=0.190 Sum_probs=278.2 Q ss_pred hhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccccee--EecCCCcccceeeeccceeEE Q lcl|NC_021342. 50 DGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGK--FIGANGQDLPRVAQSAQMHTV 127 (354) Q Consensus 50 ~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~--~~~~~~~dip~v~~~~~~~~~ 127 (354) =++++||++||++||++|||+++++++++++||+.++++||+++++|.+++.+|+++ +++++++|||+++++++++.. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 356899999999999999999999999999999999999999999999999999999 999999999999999999999 Q ss_pred EEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh-hCceeeeecCCccccc-----cccccc Q lcl|NC_021342. 128 PLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPNVTLSS-----ATKDYK 201 (354) Q Consensus 128 pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~~~~~~-----~~~~W~ 201 (354) |++.|+.+|+|+++||++|++.|++|+++|+++|++++++++|+++|+|++. .|++||||+|+++..+ ++++|+ T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~w~ 160 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTKVQ 160 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCccc Confidence 9999999999999999999999999999999999999999999999999985 7999999999998532 346799 Q ss_pred ccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeee Q lcl|NC_021342. 202 TMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQL 281 (354) Q Consensus 202 ~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L 281 (354) +||++||++||++++++++.+|+|.+.|++|+|||+.|.+|++++++ ++++|+|+||++||++ .+|++|+|+.+++. T Consensus 161 ~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~-~~~~Tvl~~l~~n~~~--~~g~~l~I~~v~~~ 237 (304) T protein:vir:52 161 AMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRA-NTDTTALEFLTKHLSA--AAGRQVAIKALPSN 237 (304) T ss_pred cCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCC-CCCchHHHHHHHhccc--ccCCcceEEEeccc Confidence 99999999999999999999999999999999999999999987755 5889999999999986 47999999999864 Q ss_pred eeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCc-eeEEeeeeeeeeEEEECCceeEeeec Q lcl|NC_021342. 282 DAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASL-GITVPAEYKISGTEFRYPLCAAYVDM 353 (354) Q Consensus 282 ~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l-~~~~~~~~~~gGv~i~~P~ai~y~D~ 353 (354) . .++|.+|+||||+|++|++++++++||||++||+|++++ .|++||++|+|||+||||+|++|+|+ T Consensus 238 ~------~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 238 Y------GTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDAQPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred c------cccCCCCceEEEEEecChhheEEecCccccccchhhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 3 234678999999999999999999999999999999986 79999999999999999999999999 No 3 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=100.00 E-value=6.2e-90 Score=509.64 Aligned_cols=311 Identities=22% Similarity=0.274 Sum_probs=281.2 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhh Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFD 80 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~ 80 (354) || + +|+.+..++.+....--..++|++++|+++||++||++|||+++++++++++ T Consensus 1 ~~--------------------~-----~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~ 55 (314) T protein:vir:10 1 MA--------------------I-----KFDAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNI 55 (314) T ss_pred Cc--------------------c-----chHHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhcccccccee Confidence 11 1 1222222222211111136677889999999999999999999999999999 Q ss_pred ccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHH Q lcl|NC_021342. 81 VPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARL 160 (354) Q Consensus 81 v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~a 160 (354) ||+.++++||+++++|.+++.+|++++|+++++|+|+++++++++++|+++|+.+|+|+++||+++++.|+||+++|+.+ T Consensus 56 i~v~~~~~~~~et~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~a 135 (314) T protein:vir:10 56 FPVTNEIPGHAKYFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQAL 135 (314) T ss_pred eccccCCCCceeEEEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHH Q lcl|NC_021342. 161 AFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWN 240 (354) Q Consensus 161 A~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~ 240 (354) |++++++++|+++|+|++++|++||||+||++..+++++|+ |++||++||++++++++++|+|.+.|++|+|||+.|. T Consensus 136 A~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~Wa--T~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~ 213 (314) T protein:vir:10 136 AFEAHDNLLDKLVWSGSAPHGIVSVFDQPNINNVVATPNWS--VPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARR 213 (314) T ss_pred HHHHHHHhhceEEEeecccccceeEeecCCCccccCCCCcc--cHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHH Confidence 99999999999999999999999999999999888888994 7999999999999999999999999999999999999 Q ss_pred HHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhc Q lcl|NC_021342. 241 QANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRML 320 (354) Q Consensus 241 ~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~ 320 (354) +|+++ ++++++|+++||++||+ +|+|+.+++|+++ |.+|++||++|+++++++++++||||++| T Consensus 214 ~L~~~--~~~~~~tvl~~l~~n~~-------~l~I~~~~el~~a-------g~~g~~~~v~y~~~~~~~~~~vp~~~~~l 277 (314) T protein:vir:10 214 VMQGL--VPQTNLSYGELFTRNNP-------GLTIRFLQFLDNY-------DGAGGKAALAFEKSPLNMSIEIPEVTNVL 277 (314) T ss_pred hhccc--ccCCCccHHHHHHHhCC-------CcEEEEccccccc-------CCCcceEEEEEecCCcEEEEecCccceee Confidence 99754 36789999999999875 6889999998643 56789999999999999999999999999 Q ss_pred cccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 321 APQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 321 ~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) |+|+++++|++||++|||||+||||.||+|+|== T Consensus 278 ~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI 311 (314) T protein:vir:10 278 PAQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGI 311 (314) T ss_pred cceecCceEEEcceeeeEEEEEECcceeEeeeee Confidence 9999999999999999999999999999999844 No 4 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=100.00 E-value=2.3e-88 Score=501.00 Aligned_cols=316 Identities=21% Similarity=0.273 Sum_probs=287.7 Q ss_pred cccccccccchhhhhhhhhhhccCCceeccch-hhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEE Q lcl|NC_021342. 19 HKGYVSRNGDQWVINNTALDAIGNPNIMLDAD-GGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYR 97 (354) Q Consensus 19 ~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~-~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~ 97 (354) +++ + ..|+++...+++.++.- .+.-||. +.+.|+++||++||++++|++++++++|++||+.++++||+++++|. T Consensus 1 ~~~-~--~~~~~~~~~~~~~~~~~-~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~ 76 (319) T protein:vir:10 1 MTT-K--KFDEADKSNVEMYLIQA-GVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYM 76 (319) T ss_pred CCC-c--chhHHhhHHHHHHHhhc-cchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEee Confidence 333 2 23566777777777542 2455553 45689999999999999999999999999999999999999999999 Q ss_pred eeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeee Q lcl|NC_021342. 98 SYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGD 177 (354) Q Consensus 98 ~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~ 177 (354) +++.+|++++|+++++|+|+++++.+++.+|+++|+.+|+|+++||++++++|+||+++|+.+|++++++++|+++|+|+ T Consensus 77 ~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~ 156 (319) T protein:vir:10 77 TFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGS 156 (319) T ss_pred eeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhCceeeeecCCcccccccc--cccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchH Q lcl|NC_021342. 178 ASRGMYGLFNNPNVTLSSATK--DYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTV 255 (354) Q Consensus 178 ~~~gi~GLlN~p~~~~~~~~~--~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tv 255 (354) +++|++||||+||++..++++ +|+++|++||++||++++++++++|+|++.|++|+|||+.|.+|++++ +++++|+ T Consensus 157 ~~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~--~~~~~t~ 234 (319) T protein:vir:10 157 APHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRM--PETTMSY 234 (319) T ss_pred ccccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhccc--CCCCeeH Confidence 999999999999998876653 467899999999999999999999999999999999999999998654 4689999 Q ss_pred HHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeee Q lcl|NC_021342. 256 MQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEY 335 (354) Q Consensus 256 l~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~ 335 (354) ++||++||+ +++|+.+++|+.+ +.+|+|||++|+++++++++++||||++||+|+++++|++||++ T Consensus 235 l~~lk~~~~-------~l~I~~~pel~~a-------g~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e~~~l~~~~~~~~ 300 (319) T protein:vir:10 235 LDYFKSQNS-------GIEIDSIAELEDI-------DGAGTKGVLVYEKNPMNMSIEIPEAFNMLPAQPKDLHFKVPCTS 300 (319) T ss_pred HHHHHHhcC-------CceEEEeeeeccc-------CCCcceEEEEEecCCceEEEecCcceeeeeeeecCceEEEeeee Confidence 999999875 5789999998643 56789999999999999999999999999999999999999999 Q ss_pred eeeeEEEECCceeEeeecC Q lcl|NC_021342. 336 KISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 336 ~~gGv~i~~P~ai~y~D~~ 354 (354) |||||+||||+||+|+|== T Consensus 301 r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 301 KCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eeEEEEEEccceeEeeecC Confidence 9999999999999999966 No 5 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=100.00 E-value=1.9e-88 Score=501.51 Aligned_cols=292 Identities=20% Similarity=0.285 Sum_probs=277.2 Q ss_pred eecc-chhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccc Q lcl|NC_021342. 45 IMLD-ADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQ 123 (354) Q Consensus 45 ~~~d-a~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~ 123 (354) |++| ||++++|+++||++||++|+|++++++++|++||+.++++||+++++|++++.+|++++|+++++|+|+++++.+ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~ 80 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALAT 80 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccce Confidence 6666 688899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccccccc Q lcl|NC_021342. 124 MHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTM 203 (354) Q Consensus 124 ~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~ 203 (354) ++.+|+++++.+|+|+++||++|++.|+||+++|+.+|++++++++|+++|||++++|++||||+||++..+++++|++ T Consensus 81 ~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W~~- 159 (296) T protein:vir:10 81 ERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSWSQ- 159 (296) T ss_pred eEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCccC- Confidence 9999999999999999999999999999999999999999999999999999999999999999999998888889976 Q ss_pred CHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeee Q lcl|NC_021342. 204 NGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDA 283 (354) Q Consensus 204 T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~ 283 (354) +.+|++||++++++++++|+|++.|++|+|||+.|.+|++++ +++++|+++||++|++ +++|+.+++|+. T Consensus 160 -~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~--~~~~~t~l~~ik~~~~-------~l~i~~~~~l~~ 229 (296) T protein:vir:10 160 -PTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLV--PGTSVSYGEFFRQNNS-------GVTVEFVQYLND 229 (296) T ss_pred -HHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhcc--CCCCccHHHHHHHhcC-------CceEEEeeeecc Confidence 459999999999999999999999999999999999998654 5789999999999875 578999999864 Q ss_pred ccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 284 AELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 284 ~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) + +.+|+++||+|+++++++++++||||+++|+|+++++|++||++|+|||+||||.||+|+|== T Consensus 230 a-------~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI 293 (296) T protein:vir:10 230 Y-------NGTGTSAAIAYEKDPNNMAIEIPEATNALPAQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGI 293 (296) T ss_pred C-------CCCcceEEEEEEcCCceEEEEcCcceeeecccccCceEEEeeEeeEEEEEEECCceeEEEeee Confidence 3 556899999999999999999999999999999999999999999999999999999999743 No 6 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=100.00 E-value=7.5e-87 Score=492.74 Aligned_cols=294 Identities=17% Similarity=0.256 Sum_probs=278.6 Q ss_pred eeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccce Q lcl|NC_021342. 45 IMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQM 124 (354) Q Consensus 45 ~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~ 124 (354) +..| +.++|+++||++||++++|++++++.+|+|+|+.++++||++++.|++++.+|++++++++++|+|++++++++ T Consensus 1 ~~~~--~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~ 78 (301) T protein:vir:80 1 MQGK--ITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVR 78 (301) T ss_pred CCcc--ccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCccccccccccccee Confidence 3444 55789999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccc-------cc Q lcl|NC_021342. 125 HTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSS-------AT 197 (354) Q Consensus 125 ~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~-------~~ 197 (354) +.+|++.++.+|+|+++||++++++|+||+++|+.+|++++++++|+++|+|++++|++||||+||++... .. T Consensus 79 ~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~ 158 (301) T protein:vir:80 79 KSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNV 158 (301) T ss_pred EEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999986532 23 Q ss_pred ccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeee Q lcl|NC_021342. 198 KDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQI 277 (354) Q Consensus 198 ~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~ 277 (354) ++|++||++||++||++++++++.+++|++.|++|+|||+.|.+|+++++++++++|+++||++|+++ ++|+. T Consensus 159 ~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~-------~~I~~ 231 (301) T protein:vir:80 159 SKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWF-------SAIVR 231 (301) T ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCc-------ceEEE Confidence 57999999999999999999999999999999999999999999999999999999999999999875 67999 Q ss_pred eeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 278 RFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 278 ~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +++|+.+ |.+|+|||++|+++++++++++||||++||+|+++++|++||++|+|||+||||+||+|+|== T Consensus 232 ~p~L~~~-------g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 232 VPDLAGM-------GTAGSDSFAVIHDSNETAELIIPMDITRHPEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred cceeccC-------CCCcccEEEEEecCCcEEEEEecCceeeecceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 9998643 556899999999999999999999999999999999999999999999999999999999966 No 7 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=100.00 E-value=1.9e-82 Score=468.62 Aligned_cols=332 Identities=14% Similarity=0.074 Sum_probs=291.7 Q ss_pred CcccchhHHHHh--hhhhhhcccccccccchhhhhhhhhhhcc-CCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_021342. 1 MAIKTIDAQTIQ--GNQWLVHKGYVSRNGDQWVINNTALDAIG-NPNIMLDADGGIAFYISQLAGIEATVYETPYGDITY 77 (354) Q Consensus 1 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~am~a~~-~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~ 77 (354) |+|+ +|.+.|. ++.|+++++....+-+. .+.-.|||++. .|.+.+-+++++ .+++|++||++|||++++++++ T Consensus 1 ~~~~-~~~~~~~~l~~~g~~~~~~~~~~~~~-~~~~~a~d~~~~~~~~~~~~~~~i--~a~~~~~i~~~vy~~~~~~~~~ 76 (339) T protein:vir:94 1 MSIN-NDRTDIKQLEKVGIIFDGYSPKSISS-EVSAYAMDAVNLTPTLQTTANAGI--PAWMTTFVDRRVIDIQLAPMAA 76 (339) T ss_pred Ccee-chHHHHHHHHhhceeeccchhhhcch-hhHhhhccccccccccccccccch--hhhhhhhhchhheeecccccch Confidence 7775 6888775 78999998666654322 23346999964 333444444333 3557899999999999999999 Q ss_pred hhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHH Q lcl|NC_021342. 78 RFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQ 157 (354) Q Consensus 78 r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k 157 (354) +++||+.++++|++++++|.++|.+|+|++|+|++++ |++++++++++++++.++.+|+|+++|+++|+++|++|+++| T Consensus 77 ~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~-Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~~~g~~l~~~K 155 (339) T protein:vir:94 77 AKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSAN-GMSKANVNFESRQNYRYQTWTEYGDLEMATYGEAGIDYVARQ 155 (339) T ss_pred hhhcccccCCCCcccEEEEeeeecccceEEcccccCC-CcccccceeeEEeEEEEEEEEeecHHHHHHHHhhCCChHHHH Confidence 9999999999999999999999999999999999876 999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhheeeeeehhhCceeeeecCCccc-ccccccccccCHHHHHHHHHHHHHHHHHHhCCcc---cccEEE Q lcl|NC_021342. 158 ARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTL-SSATKDYKTMNGQELFNMLNAPIFSVINLSRRFH---VPNTAL 233 (354) Q Consensus 158 ~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~-~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~---~p~~L~ 233 (354) +.+|++++++++|+++|+|++++|++||||||+++. .+++++|++||++||++||++++++|+.+|+|.+ .|++|+ T Consensus 156 a~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~ 235 (339) T protein:vir:94 156 EISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMA 235 (339) T ss_pred HHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEE Confidence 999999999999999999999999999999999975 4567899999999999999999999999999875 577999 Q ss_pred eCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEee Q lcl|NC_021342. 234 MFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMAN 313 (354) Q Consensus 234 l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~v 313 (354) |||+.|.+|+++ +.+++|+++||++|++ +++|+.++||+.+ +.++..+|+.|.++++++++++ T Consensus 236 LP~~~~~~L~~~---n~~~~Tvl~~lk~n~p-------nl~i~~~~el~~a-------~g~~~~~~~~~~~~~~~~~~~~ 298 (339) T protein:vir:94 236 LAPSALNNVNRT---NNFGLSAGAKIAQTYP-------NIQFVAVPEFDTA-------SGRLVQLWVPEVNGQPTGEVAF 298 (339) T ss_pred ecHHHHHhcccC---CcCCccHHHHHHHhcC-------CcEEEEccccccC-------CCceEEEEEEeccCCcceEEEc Confidence 999999999865 4578999999999865 4789999998632 3345678888888999999999 Q ss_pred CchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 314 PIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 314 p~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ||||++||+|+++++|++||++|||||+||||+||+|+|== T Consensus 299 p~~~~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 299 AEKLRSHSIERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred chhhhccccEEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 99999999999999999999999999999999999999866 No 8 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=100.00 E-value=2.8e-79 Score=451.24 Aligned_cols=327 Identities=13% Similarity=0.036 Sum_probs=288.1 Q ss_pred chhHHHHh--hhhhhhcccccccccchhhhhhhhhhhcc-CCceeccchhhH-HHHHHHHHHHHHHHHHhhhhcccchhh Q lcl|NC_021342. 5 TIDAQTIQ--GNQWLVHKGYVSRNGDQWVINNTALDAIG-NPNIMLDADGGI-AFYISQLAGIEATVYETPYGDITYRFD 80 (354) Q Consensus 5 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~am~a~~-~~~~~~da~~~~-~fl~~~L~~Id~~v~e~~~~~l~~r~~ 80 (354) -=|+|++. ++.|++++++..+++++... .||||+- .|.+.+-+++++ .||+ ++|||++|+++++++++.++ T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~--~~~da~d~~~~~~~~~~~~i~~~l~---~~i~p~~~~~~~~p~~a~~l 75 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTE--YAMDAADLSPHLSSTGSSGIPNYLT---TYVDPAVIDILVAPMKAAEL 75 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhhHHH--hhhhhhhccCccccCCCchhHHHHH---hhcccceeeehhhhhhhhhh Confidence 56999998 99999999999999988755 5666532 355555555554 8888 89999999999999999999 Q ss_pred ccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHH Q lcl|NC_021342. 81 VPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARL 160 (354) Q Consensus 81 v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~a 160 (354) +|+.+.++|.+++++|.++|.+|++++|+|++| +|++|++++++++++++++.+|+|+++|+++|+++|++|+.+|+.+ T Consensus 76 ~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D-~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~a 154 (336) T protein:vir:10 76 VGESKKGDWTTLVAAFITAEPTTKVATYGDYSS-DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYS 154 (336) T ss_pred ccccccCCccceeEEEeeeeceeeEEEeeccCC-CceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHH Confidence 999998887789999999999999999999865 5999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhheeeeeehhhCceeeeecCCccc--ccccccccccCHHHHHHHHHHHHHHHHHHhCCc---ccccEEEeC Q lcl|NC_021342. 161 AFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTL--SSATKDYKTMNGQELFNMLNAPIFSVINLSRRF---HVPNTALMF 235 (354) Q Consensus 161 A~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~--~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~---~~p~~L~l~ 235 (354) |++++++++|+++|+|+++++++||||||+++. +.++++|.++|++||++||++++++|+.||+|. +.|++|+|| T Consensus 155 A~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP 234 (336) T protein:vir:10 155 SALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLP 234 (336) T ss_pred HHHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEec Confidence 999999999999999999999999999999974 334556788899999999999999999999986 779999999 Q ss_pred HHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCc Q lcl|NC_021342. 236 PDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPI 315 (354) Q Consensus 236 p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~ 315 (354) |+.+.+|+++ +.+++|+++||++|+|+ ++|+..++|+.+ +..+..+|+-+..+++..++.+|+ T Consensus 235 ~~~~~~Ls~~---n~~g~Tvl~~lk~n~Pn-------l~i~t~pEl~~a-------~G~~~~l~~~~~~~~~t~~~~~p~ 297 (336) T protein:vir:10 235 PTAMSDLSKT---NQYGLAAAAKLKDIFPK-------LEFVTIPEYDTA-------SGRLVQLWAPRVEGKDTATCGFTE 297 (336) T ss_pred HHHHHhccCC---CccCccHHHHHHHhcCc-------cEEEEccccccC-------CCceEEEEEEecCCCcceeeecch Confidence 9999999764 46789999999998664 678888887432 223444555556678899999999 Q ss_pred hhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 316 PFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 316 ~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +|++||+|+++++|++||++|||||+||||++|+|+|== T Consensus 298 ~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 298 KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 999999999999999999999999999999999998855 No 9 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=100.00 E-value=3.3e-79 Score=450.82 Aligned_cols=324 Identities=13% Similarity=0.057 Sum_probs=286.7 Q ss_pred chhHHHHh--hhhhhhcccccccccchhhhhhhhhhhcc-CCceeccchhhH-HHHHHHHHHHHHHHHHhhhhcccchhh Q lcl|NC_021342. 5 TIDAQTIQ--GNQWLVHKGYVSRNGDQWVINNTALDAIG-NPNIMLDADGGI-AFYISQLAGIEATVYETPYGDITYRFD 80 (354) Q Consensus 5 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~am~a~~-~~~~~~da~~~~-~fl~~~L~~Id~~v~e~~~~~l~~r~~ 80 (354) -=|+|++. ++.|+++++......+++.. .||||+- .|.+.+-+++++ .||+ ++|||++|+++++++++.++ T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~--~a~da~d~~~~~~t~~~~g~~~~l~---~~i~p~~~~~~~~~~~~~~l 75 (336) T protein:vir:78 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAE--YAMDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAEL 75 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHH--HHHhhhhhccccccCCCcchHHHHH---Hhcccceeeehhhhhhhhhh Confidence 46888887 89999999999888888655 5777543 355666666654 8998 89999999999999999999 Q ss_pred ccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHH Q lcl|NC_021342. 81 VPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARL 160 (354) Q Consensus 81 v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~a 160 (354) +|+.+.++|.+++++|.++|.+|++++|+|++|+ |++|++++++++++++++.+|+|+++|+++|+++|++|+.+|+.+ T Consensus 76 ~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~-P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~a 154 (336) T protein:vir:78 76 VGESKKGDWTTLVAAFITAEPTTTVATYGDYSSD-GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYS 154 (336) T ss_pred cccccCCCccccEEEEeeeecceeeEEeecccCC-CeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHH Confidence 9999987766789999999999999999998765 999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhheeeeeehhhCceeeeecCCccc--ccccccccccCHHHHHHHHHHHHHHHHHHhCCc---ccccEEEeC Q lcl|NC_021342. 161 AFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTL--SSATKDYKTMNGQELFNMLNAPIFSVINLSRRF---HVPNTALMF 235 (354) Q Consensus 161 A~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~--~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~---~~p~~L~l~ 235 (354) |++++++++|+++|+|++++|++||||||+++. +.++++|+++|++||++||++++++|+.+|+|. +.|++|+|| T Consensus 155 A~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp 234 (336) T protein:vir:78 155 SALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLP 234 (336) T ss_pred HHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEec Confidence 999999999999999999999999999999975 335567899999999999999999999999986 468899999 Q ss_pred HHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEE---EcCcceEEEe Q lcl|NC_021342. 236 PDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVY---DKSDRNLAMA 312 (354) Q Consensus 236 p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y---~~~~~~~~~~ 312 (354) |+.+.+|+++ +.+++|+++||++|+|+ ++|+.+++|+.+ | ++++.+| ..++++++++ T Consensus 235 ~~~~~~L~~~---n~~g~tv~~~lk~n~Pn-------l~i~t~pel~~A-------g---g~~~~~~~~~~~~~~t~~~~ 294 (336) T protein:vir:78 235 PTAMSDLSKT---NQYGLSAAAKLKEIFPK-------LEFVTIPEYDTA-------S---GRLVQLWAPRVEGKDTATCG 294 (336) T ss_pred hHHHHhccCC---CccCccHHHHHHHhcCc-------cEEEEccccccc-------C---cceEEEEEeeccCCcceeee Confidence 9999999864 46789999999999663 678888988532 2 2345555 4457899999 Q ss_pred eCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 313 NPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 313 vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +|++|++||+|+++++|++||++|||||+||||++|+|+|== T Consensus 295 ~p~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 295 FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cchhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 999999999999999999999999999999999999998855 No 10 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=100.00 E-value=4e-79 Score=450.37 Aligned_cols=327 Identities=13% Similarity=0.037 Sum_probs=287.9 Q ss_pred chhHHHHh--hhhhhhcccccccccchhhhhhhhhhhcc-CCceeccchhhH-HHHHHHHHHHHHHHHHhhhhcccchhh Q lcl|NC_021342. 5 TIDAQTIQ--GNQWLVHKGYVSRNGDQWVINNTALDAIG-NPNIMLDADGGI-AFYISQLAGIEATVYETPYGDITYRFD 80 (354) Q Consensus 5 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~am~a~~-~~~~~~da~~~~-~fl~~~L~~Id~~v~e~~~~~l~~r~~ 80 (354) -=|+|++. ++.|++++++..+++.+... .||||+- .|.+.+-+++++ .||+ ++|||++||++++++++.++ T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~--~~~da~d~~~~~~~~~~~~~~~~l~---~~i~p~~~~~~~~~~~~~~l 75 (336) T protein:vir:36 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTE--YAMDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAEL 75 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhHHHH--hhhhhhhccCccccCCCcchHHHHH---HhhccceEeeecchhhhhhh Confidence 56999998 99999999999999988655 5676532 355555555554 8888 89999999999999999999 Q ss_pred ccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHH Q lcl|NC_021342. 81 VPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARL 160 (354) Q Consensus 81 v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~a 160 (354) +|+.+.++|.+++++|.++|.+|++++|+|++| +|++|++++++++++++++.+|+|+++|+++|+++|++|..+|+.+ T Consensus 76 ~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D-~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~a 154 (336) T protein:vir:36 76 VGESKKGDWTTLVAAFITAEPTTKVATYGDYSS-DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYS 154 (336) T ss_pred ccccccCCccceeEEEeeeeceeeEEEeeccCC-CceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHH Confidence 999998887789999999999999999999865 5999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhheeeeeehhhCceeeeecCCccc--ccccccccccCHHHHHHHHHHHHHHHHHHhCCc---ccccEEEeC Q lcl|NC_021342. 161 AFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTL--SSATKDYKTMNGQELFNMLNAPIFSVINLSRRF---HVPNTALMF 235 (354) Q Consensus 161 A~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~--~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~---~~p~~L~l~ 235 (354) |++++++++|+++|+|+++++++||||||+++. +.++++|.++|++||++||++++++|+.+|+|. +.|++|+|| T Consensus 155 A~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP 234 (336) T protein:vir:36 155 SALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLP 234 (336) T ss_pred HHHHHHHhhCcEEEEeccccceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEec Confidence 999999999999999999999999999999974 334556788999999999999999999999986 689999999 Q ss_pred HHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCc Q lcl|NC_021342. 236 PDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPI 315 (354) Q Consensus 236 p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~ 315 (354) |+.+.+|+++ +.+++|+++||++|+|+ ++|+..++|+.+ +..+..+|+-+..+++..++.+|+ T Consensus 235 ~~~~~~Ls~~---n~~g~Tvl~~lk~n~Pn-------l~i~t~pEl~~a-------~g~~~~l~~~~~~~~~t~~~~~p~ 297 (336) T protein:vir:36 235 PTAMSDLSKT---NQYGLAAAAKLKDIFPK-------LEFVTIPEYDTA-------SGRLVQLWAPRVEGKDTATCGFTE 297 (336) T ss_pred hHHHHhccCC---CccCccHHHHHHHhcCc-------cEEEEccccccC-------CCceEEEEEEecCCCcceeeecch Confidence 9999999764 46789999999998664 678888887432 223444455556678899999999 Q ss_pred hhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 316 PFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 316 ~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +|++||+|+++++|++||++|||||+||||++|+|+|== T Consensus 298 ~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 298 KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 999999999999999999999999999999999998855 No 11 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=100.00 E-value=6.4e-79 Score=449.27 Aligned_cols=324 Identities=13% Similarity=0.047 Sum_probs=288.9 Q ss_pred chhHHHHh--hhhhhhcccccccccchhhhhhhhhhhcc-CCceeccchhhH-HHHHHHHHHHHHHHHHhhhhcccchhh Q lcl|NC_021342. 5 TIDAQTIQ--GNQWLVHKGYVSRNGDQWVINNTALDAIG-NPNIMLDADGGI-AFYISQLAGIEATVYETPYGDITYRFD 80 (354) Q Consensus 5 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~am~a~~-~~~~~~da~~~~-~fl~~~L~~Id~~v~e~~~~~l~~r~~ 80 (354) -=|+|++. ++.|+++++......+++.. .||||+- .|.+.+-+++++ .||+ ++|||++|+++++++++.++ T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~--~a~da~d~~~~~~t~~~~g~~~~l~---~~i~p~~~~~~~~~~~~~~l 75 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAE--YAMDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAEL 75 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHH--HHHhhhhhccccccCCCcchHHHHH---hhcCcceeeeeechhchhhh Confidence 46888887 89999999999988888655 5777543 355666666654 8998 89999999999999999999 Q ss_pred ccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHH Q lcl|NC_021342. 81 VPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARL 160 (354) Q Consensus 81 v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~a 160 (354) +|+.+.++|++++++|.++|.+|++++|+|+ +|+|++|++++++++++++++.+|+|+.+|+++|+++|++|+.+|+.+ T Consensus 76 ~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~-~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~a 154 (336) T protein:vir:10 76 VGESKKGDWTTLVAAFITAEPTTKVATYGDY-SSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYS 154 (336) T ss_pred cccccCCCcceeeEEEEeeeeeeeEEEcccc-CCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHH Confidence 9999999999999999999999999999987 678999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhheeeeeehhhCceeeeecCCccc--ccccccccccCHHHHHHHHHHHHHHHHHHhCCc---ccccEEEeC Q lcl|NC_021342. 161 AFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTL--SSATKDYKTMNGQELFNMLNAPIFSVINLSRRF---HVPNTALMF 235 (354) Q Consensus 161 A~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~--~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~---~~p~~L~l~ 235 (354) |++++++++|+++|+|++++|++||||||+++. +.++++|++||++||++||++++++|+.+|+|. +.|++|+|| T Consensus 155 A~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp 234 (336) T protein:vir:10 155 SALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLP 234 (336) T ss_pred HHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEec Confidence 999999999999999999999999999999975 335567899999999999999999999999987 468899999 Q ss_pred HHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEc---CcceEEEe Q lcl|NC_021342. 236 PDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDK---SDRNLAMA 312 (354) Q Consensus 236 p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~---~~~~~~~~ 312 (354) |+.+.+|+++ +.+++|+++||++|+|+ ++|+.+++|..+ |++++.+|.. ++++++++ T Consensus 235 ~~~~~~L~~~---n~~g~tv~~~lk~n~Pn-------l~i~t~pel~~A----------gg~~~~~~~~~~~~~~t~~~~ 294 (336) T protein:vir:10 235 PTAMSDLSKT---NQYGLSAAAKLKEIFPK-------LEFVTIPEYDTA----------SGRLVQLWAPRVEGKDTATCG 294 (336) T ss_pred hHHHHhccCC---CccCccHHHHHHHhCCc-------cEEEEccccccc----------CCceEEEEEecccCCcceeee Confidence 9999999864 56789999999999664 678888888532 2245566644 47899999 Q ss_pred eCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 313 NPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 313 vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +|++|++||+|+++++|++||++|||||+||||++|+|+|== T Consensus 295 ~P~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 295 FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred cChhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 999999999999999999999999999999999999998855 No 12 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=100.00 E-value=8.9e-78 Score=443.00 Aligned_cols=328 Identities=13% Similarity=0.088 Sum_probs=277.3 Q ss_pred Ccc--cc--h-hHHHHhhhhhhhcccccccccchhhhhhhhhhhccCC-------ceeccchhhH-HHHHHHHHHHHHHH Q lcl|NC_021342. 1 MAI--KT--I-DAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNP-------NIMLDADGGI-AFYISQLAGIEATV 67 (354) Q Consensus 1 ~~~--~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~-------~~~~da~~~~-~fl~~~L~~Id~~v 67 (354) |+. +. + |++.+ ++.|+++++....+ +.....|||+.... .+.+-+++++ .||. +++ |.+ T Consensus 19 ~~~~~~~~~~~~~~~l-~~~gi~~~~~~~~~---~~~~~~amd~~~~~~~~~~~~~l~~~~~~g~~~~l~---~~~-p~~ 90 (379) T protein:vir:10 19 MVMDSADVTLDNLKHL-ESYGIHLNGRKNKL---FELMQFAMDSNDIGPIPTPLSPLSPVSIPGLIQFLQ---NWL-PGH 90 (379) T ss_pred hhhccccccHHHHHHH-HhcCccccchhhhh---hhhhhhhhccccccccccccCccccccccchHHHHH---hhc-chH Confidence 222 22 2 33333 57999998665544 34445699997322 3333344444 7886 566 899 Q ss_pred HHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHH Q lcl|NC_021342. 68 YETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSA 147 (354) Q Consensus 68 ~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~ 147 (354) +++..+++++.+|||+.+.++|++++++|.++|.+|++++|+|++++ |+++++++++++++++++.+|+|+++|+++|+ T Consensus 91 i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~d~-pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa 169 (379) T protein:vir:10 91 VRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGGNM-ALMSWTPTFETRTVVRFEAGLQVAPLEEARSS 169 (379) T ss_pred HHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEeccccCC-CeeeeeeeeeeeeeEEEEEEEeecHHHHHHHH Confidence 99999999999999999999999999999999999999999998665 99999999999999999999999999999999 Q ss_pred HhCCCcchHHHHHHHHHHHHHhhheeeee--ehhhCceeeeecCCccccc-------ccccccccCHHHHHHHHHHHHHH Q lcl|NC_021342. 148 AMNMPIDAEQARLAFRGAEEHSQSVAYFG--DASRGMYGLFNNPNVTLSS-------ATKDYKTMNGQELFNMLNAPIFS 218 (354) Q Consensus 148 ~~g~~ld~~k~~aA~~~~a~~~n~~~f~G--~~~~gi~GLlN~p~~~~~~-------~~~~W~~~T~~ei~~di~~~~~~ 218 (354) ++|++|+++|+.+|++++++++|+++||| ++++++|||||||+++... +.++|++||++||++||++++++ T Consensus 170 ~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~ 249 (379) T protein:vir:10 170 RVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTA 249 (379) T ss_pred HhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHH Confidence 99999999999999999999999999999 5789999999999997421 23569999999999999999999 Q ss_pred HHHHhCCccc----ccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccC Q lcl|NC_021342. 219 VINLSRRFHV----PNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNS 294 (354) Q Consensus 219 l~~~s~g~~~----p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~ 294 (354) ++.+|+|.+. |++|+|||+.+.+|+++ ..+++|+++||++|++ +++|+.+++|+.+ +++ T Consensus 250 l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~---n~~g~Tvl~~lk~n~P-------nl~i~t~pEL~~a-------ggg 312 (379) T protein:vir:10 250 LQVQSMGRIKSNKTPITIGIPNAYENYITTP---TELGYSVAQYMRESYP-------NVTFVSAPELNDA-------NGG 312 (379) T ss_pred HHHhhCCeecccccceeEEecHHHHHhhccc---cccCccHHHHHHHhcC-------CcEEEEccccccc-------CCC Confidence 9999999864 55999999999999865 4678999999999966 4688999998643 444 Q ss_pred cccEEEEEEc-------CcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 295 NKPRYMVYDK-------SDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 295 g~d~~v~y~~-------~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ++++++.++. +++.+.+++|++|++||+|+++++|++||++|||||+||||+||+|+|=| T Consensus 313 ~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 313 SSAIYYYADAVENNGTDDGRTWLQVVPTKMFTLGVEKKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred ccEEEEEeeccCCCccCCcceEEEecchhhhhccceecCceeEeccccceeeeeeecchhhheecCC Confidence 5555555543 34578999999999999999999999999999999999999999999999 No 13 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=100.00 E-value=4.1e-76 Score=433.90 Aligned_cols=334 Identities=10% Similarity=0.005 Sum_probs=282.2 Q ss_pred Cccc----chhHHHH--hhhhhhhcccccccccchhhhh----hhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHh Q lcl|NC_021342. 1 MAIK----TIDAQTI--QGNQWLVHKGYVSRNGDQWVIN----NTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYET 70 (354) Q Consensus 1 ~~~~----~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~----~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~ 70 (354) |+-+ ++|...+ +++.|+++++...+...+.... -.||||+..+. .+.++. .+.+..|++|||++|++ T Consensus 21 ~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~a~da~~~~~-~t~~~~--gip~~~~~~~~p~~~~~ 97 (388) T protein:vir:99 21 MANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAP-TTQASI--PTPIQFLQQWLPGFVKV 97 (388) T ss_pred hhcCCcceeeechhhHhhhhcceeccCccchhhhhhhhhhhhhhcccCcccccc-cccCcc--cHHHHHhhhhccceeee Confidence 3332 3666666 6889999998766655444332 35888764333 344443 35777789999999999 Q ss_pred hhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhC Q lcl|NC_021342. 71 PYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMN 150 (354) Q Consensus 71 ~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g 150 (354) .++++++.+|||+.+.++|.+++++|.++|.+|++++|+|++| +|+++++++++++++++++.+|+|+++|+++|+++| T Consensus 98 ~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D-~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g 176 (388) T protein:vir:99 98 LTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTN-IPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMR 176 (388) T ss_pred eechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccC-CCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhC Confidence 9999999999999998877788999999999999999999865 599999999999999999999999999999999999 Q ss_pred CCcchHHHHHHHHHHHHHhhheeeeeehh---hCceeeeecCCcccc------cccccccccCHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 151 MPIDAEQARLAFRGAEEHSQSVAYFGDAS---RGMYGLFNNPNVTLS------SATKDYKTMNGQELFNMLNAPIFSVIN 221 (354) Q Consensus 151 ~~ld~~k~~aA~~~~a~~~n~~~f~G~~~---~gi~GLlN~p~~~~~------~~~~~W~~~T~~ei~~di~~~~~~l~~ 221 (354) ++|+++|+.+|++++++++|+++|||+.+ .++|||||||+++.. .++++|++||++||++||++++++|+. T Consensus 177 ~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~ 256 (388) T protein:vir:99 177 INSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRV 256 (388) T ss_pred CCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999764 489999999998753 234579999999999999999999999 Q ss_pred HhCCcccc----cEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCccc Q lcl|NC_021342. 222 LSRRFHVP----NTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKP 297 (354) Q Consensus 222 ~s~g~~~p----~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d 297 (354) +|+|.+.| .+|+|||+.+.+|+++ ..+++|+++||++|++ ++.|+.+++|+.+. +.+|.+ T Consensus 257 qs~g~~~~~~~~~tL~LP~~~~~~Ls~~---n~~g~Tvl~~lk~n~P-------nl~i~t~pEl~~a~------~tgg~~ 320 (388) T protein:vir:99 257 QSEDNIDPEDVDITLVLPMNKVDMLSVV---TDLGISVRDWLKQTYP-------RVRVMSAPELQGGN------PDDGKD 320 (388) T ss_pred hcCCeeeecccceEEEechHHHHhcccc---CcCCccHHHHHHHhcC-------CcEEEEeccccccc------ccCCce Confidence 99998765 4899999999999754 4568999999999865 57888998886542 234556 Q ss_pred EEEEEEcC-----------cceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 298 RYMVYDKS-----------DRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 298 ~~v~y~~~-----------~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .++.|.++ .+.+.+.+|++|++||+|+++++|++||++|||||+||||+||+|+|== T Consensus 321 ~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 321 IAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred eEEEEecccccccccCccCcceeEEecccccccccceecCceeEeccccceeeeEEeccchhheeccC Confidence 67766543 4468888999999999999999999999999999999999999998855 No 14 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=100.00 E-value=6.7e-74 Score=421.76 Aligned_cols=336 Identities=10% Similarity=0.028 Sum_probs=277.6 Q ss_pred Cccc--chhHHHHhhhhhhhcccccccc------cchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021342. 1 MAIK--TIDAQTIQGNQWLVHKGYVSRN------GDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPY 72 (354) Q Consensus 1 ~~~~--~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~ 72 (354) |-.+ +.+.=.=+++.|+++.+.+... ...+.....||||......++ ++.+ ..+..|+++||+++++++ T Consensus 19 ~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~-~~~g--~p~~~l~~~~p~~~~~~~ 95 (382) T protein:vir:96 19 FDLKNVTHEAVAALGRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTT-PSIP--TPIQFLQTWLPGFVKVMT 95 (382) T ss_pred hhhhcccHHHHHHHhccccccCcccchhHhhhhhhhhhhhhhcccccccCCcccc-CCcc--HHHHHHhhhhhhhhhhhh Confidence 1112 2232233477999998876433 122334457999874333333 3333 356667999999999999 Q ss_pred hcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCC Q lcl|NC_021342. 73 GDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMP 152 (354) Q Consensus 73 ~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ 152 (354) +++.++++||+.+.++|.+++++|.++|.+|+|++|+|++|+ |+++++++++++++++++.+|+|+.+|+.+|+++|++ T Consensus 96 ~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~-Pl~d~~~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~ 174 (382) T protein:vir:96 96 AARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNI-PLTSWNANFERRTIVRGELGLLVGTLEEGRASAIRLN 174 (382) T ss_pred hhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCC-CccccccceeEEEEEEEEEeeeecHHHHHHHHhhCCC Confidence 999999999999987776799999999999999999998765 9999999999999999999999999999999999999 Q ss_pred cchHHHHHHHHHHHHHhhheeeeee---hhhCceeeeecCCcccc--cccccccccCHHHHHHHHHHHHHHHHHHhCCcc Q lcl|NC_021342. 153 IDAEQARLAFRGAEEHSQSVAYFGD---ASRGMYGLFNNPNVTLS--SATKDYKTMNGQELFNMLNAPIFSVINLSRRFH 227 (354) Q Consensus 153 ld~~k~~aA~~~~a~~~n~~~f~G~---~~~gi~GLlN~p~~~~~--~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~ 227 (354) |..+|+.+|++++++++|+++|||+ .+.|+|||||||++++. .++++|++||++||++||++++++|+.+|+|.+ T Consensus 175 l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~ 254 (382) T protein:vir:96 175 SAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQI 254 (382) T ss_pred cHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCee Confidence 9999999999999999999999997 34789999999999853 456789999999999999999999999999988 Q ss_pred c----ccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEE Q lcl|NC_021342. 228 V----PNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYD 303 (354) Q Consensus 228 ~----p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~ 303 (354) . |.+|+|||+.|.+|+++ ..+++|+++||++|+| +++|+.+++|+.+.. .|.+++++|+.|. T Consensus 255 ~~~~~~~~L~LP~~~~~~Ls~~---n~~g~Tvl~~lk~n~P-------nl~i~t~peL~~a~~----~g~g~~~~~~~~~ 320 (382) T protein:vir:96 255 DPKAEKITMALATSKVDYLSVT---TPYGISVSDWIEQTYP-------KMRIVSAPELSGVQM----QGKTPEDALVLFV 320 (382) T ss_pred eecccceEEeechHHHhhcccc---CccCccHHHHHHHhcC-------CcEEEEccccccccC----CCccceeEEEEec Confidence 6 45899999999999754 4678999999999865 468899999876532 2335788999987 Q ss_pred cCc-----------ceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 304 KSD-----------RNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 304 ~~~-----------~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+. ..+.+.+|..++.+++|++.++|++||+++||||+||||++|+|+|== T Consensus 321 ~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 321 EEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred chhhhhcccccccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccCC Confidence 763 344555566667789999999999999999999999999999998855 No 15 >protein:vir:105778 Length: 358 # NCBI annotation: gp9 # Family: family:all:10995 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224147;genbank:gi:62362222;genbank:GeneID:3342531 Probab=99.67 E-value=4.3e-19 Score=121.27 Aligned_cols=325 Identities=11% Similarity=0.128 Sum_probs=206.5 Q ss_pred Ccc-cch--hHHHHhhhhhh---hcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_021342. 1 MAI-KTI--DAQTIQGNQWL---VHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGD 74 (354) Q Consensus 1 ~~~-~~~--~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~ 74 (354) |-. |.. .-+.+ +.||- +.+...+.-..++-+.+.|--+ ...+.. ++...|...-|.++|.++.+...++ T Consensus 1 ~~f~K~~~an~~~~-~~qw~~L~~~Rna~n~~~~a~maan~a~~~--~~~~~~--NAv~~v~~D~wr~~D~~~~q~fr~e 75 (358) T protein:vir:10 1 MYFSKETLATNSRL-GGHWNELWANRNMWNAQHDAMIAANRSNMT--PEWLAV--NAVGGFTRDFWAEIDRQVLQLRDQE 75 (358) T ss_pred CeechhhhhhHHHH-HHHHHHHHHHHHHhhhhhhhHHhhhHHHhh--hhhhee--cccccCCHHHHHHHhhhhhhhcccc Confidence 111 110 11111 12222 1111111100010011110000 011111 2223344445678898888866664 Q ss_pred --c-cchhhccccCCCCCceeEEEEEeecc-ccceeE-e-cCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHH Q lcl|NC_021342. 75 --I-TYRFDVPMAANIPEYADTWMYRSYDG-VTMGKF-I-GANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAA 148 (354) Q Consensus 75 --l-~~r~~v~v~~~~~~~~~~~~~~~~~~-~G~a~~-~-~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~ 148 (354) + ..-+|+++.++++.+.....|.+... .|++.. + |..+.++..+.++.+ --||+.+..+|+.+|||++..+- T Consensus 76 ~~~~l~NDLm~ls~sv~Igktv~~y~~~gd~~~~v~~SmsGQ~~~~lD~~~y~~d--GtpiPIfdsg~~f~WR~~~~~~~ 153 (358) T protein:vir:10 76 VGMEIVNDLIGVQTVLPVGKTAKLYNVIGDIADDVSVSIDGQAPFSFDHTEYASD--GDPIPVFTAGYGVNWRHAAGLNS 153 (358) T ss_pred hhHHHHhhhhhccccccHHHHHHHHhhhcCCCceEEEEecccCcccccceeeecc--CCEeeeeccCccccccchhhcCc Confidence 3 34567899999999988888876655 776653 3 334444555665544 44666667888888899999999 Q ss_pred hCCCcchHHHHHHHHHHHHHhhheeeeeehh-----hCceeeeecCCcccc-------cccccccccCHHHHHHHH-HHH Q lcl|NC_021342. 149 MNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-----RGMYGLFNNPNVTLS-------SATKDYKTMNGQELFNML-NAP 215 (354) Q Consensus 149 ~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-----~gi~GLlN~p~~~~~-------~~~~~W~~~T~~ei~~di-~~~ 215 (354) .|+++.++.+++..+++.++.-+.+|+|+.+ +-.+||-|||++... ...-|++++|+++++..+ .++ T Consensus 154 ~g~d~~~daQ~~~~~kv~~~~vdy~lNG~~~I~v~g~t~~Glrn~~n~~qv~l~~~s~g~NiDlttat~~a~~~~f~~~l 233 (358) T protein:vir:10 154 LGIDLVLDSQMAKMRKFNQKRVNYYLNGDPNIQVQSYPAQGIKNHRNTKKINLGSGSGGANIDLTTADMTALFAFFGKGA 233 (358) T ss_pred cccchhHHHHHHHHHHHHHHHHhhhhccCCceeecCcccccccCCcceeEEEeccCCCcceeeeccCCHHHHHHHHHHHH Confidence 9999999999999999999999999999765 456999999997622 223479999999888888 556 Q ss_pred HHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCC-CchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccC Q lcl|NC_021342. 216 IFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYT-DRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNS 294 (354) Q Consensus 216 ~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~-~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~ 294 (354) +.++... +....-.+++++|+.+..+.+.+...++ .-|||+++++-... -+|.+.+.|. T Consensus 234 ~~~~~~~-N~~~~~~~~~vs~ei~~n~~r~Y~~~~~~~gTIl~~vl~~~~v-------a~I~~~~~Ls------------ 293 (358) T protein:vir:10 234 FGTLARA-NKVAQYDVMWVSPEIWANLAQPYVVNGVVSGNVLNAVLPFAPV-------REIRQTFALS------------ 293 (358) T ss_pred HHHHHhh-cccceeeEEEEcHHHHhhhhcccccccccchhhHHHhhcccCc-------ccccccccCC------------ Confidence 7777654 5566668999999999999877765543 55999999985321 2455555442 Q ss_pred cccEEEEEEcCcceEEEeeCchhhhcccccc--CceeEEeeeeeeeeEEEECCce----eEeeecC Q lcl|NC_021342. 295 NKPRYMVYDKSDRNLAMANPIPFRMLAPQMA--SLGITVPAEYKISGTEFRYPLC----AAYVDMA 354 (354) Q Consensus 295 g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~--~l~~~~~~~~~~gGv~i~~P~a----i~y~D~~ 354 (354) .+.+++|.+..+++.-.+.||+-..|.-.. +-+|.+..+++. |++||.-.. ++|.--- T Consensus 294 -gNeii~~~~~~~vi~plvG~~~gt~~~pR~~p~ddY~f~vwsA~-glqik~D~~Gks~Vv~~~~~ 357 (358) T protein:vir:10 294 -GNEFIAYVRRQDIISPLVGMAVGVVPLPRPLPNVNYNFQIMSAE-GLQITADDQGLSGVVYGANL 357 (358) T ss_pred -CccEEEEEeCCceeeeeecceeeeecCCCCCCCcchhhhhhhhh-ceeeeeccccceeeEeeccc Confidence 246899999999999999999877664322 236777778776 577664321 1110000 No 16 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.94 E-value=7.9e-10 Score=70.50 Aligned_cols=322 Identities=11% Similarity=0.025 Sum_probs=163.5 Q ss_pred CcccchhHHHHhhhhhh------hcc---cccccccchhhhh---hhhhhhccCCcee-------------ccc-----h Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWL------VHK---GYVSRNGDQWVIN---NTALDAIGNPNIM-------------LDA-----D 50 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~------~~~---~~~~~~~~~~~~~---~~am~a~~~~~~~-------------~da-----~ 50 (354) -.++.+.. ........ ..+ .........+... ...+......... .+. . T Consensus 51 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 129 (419) T protein:vir:94 51 ARAALLRT-APPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTIT 129 (419) T ss_pred HHHHHHHH-HHHHHHHHhhhhccccccccccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhcccccccccc Confidence 00000000 00000000 000 0000000000000 0000000000000 000 0 Q ss_pred hhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEe--------eccccceeEecCCCcccceeeecc Q lcl|NC_021342. 51 GGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRS--------YDGVTMGKFIGANGQDLPRVAQSA 122 (354) Q Consensus 51 ~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~--------~~~~G~a~~~~~~~~dip~v~~~~ 122 (354) .++..+.. +.+...+..........+.++.+....+ ..+.|.. ....+.+.|++.++. +|..+... T Consensus 130 ~~~~~~~p--~~~~~~i~~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-~~~~~~~~ 203 (419) T protein:vir:94 130 NPNVPHLP--QLVPGIVPTTPDLPLLVADLLDQQNADY---NVLEYIRDTSGTAGAGSTWNKAAVVPEGTA-KPQSTLSF 203 (419) T ss_pred CCcccccc--hhhhHHHHHHHhhhhhhhhcceeeeccC---CceeeeeeccccccccccCcccceecCCcc-ccccccce Confidence 01111221 2334445555555666666666543222 1222221 222345667776544 77777788 Q ss_pred ceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccc Q lcl|NC_021342. 123 QMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKT 202 (354) Q Consensus 123 ~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~ 202 (354) +......+.++..+.+|.+=++.+. .+..--....+++++..+|+.+++|+......|+++.+++........+.. T Consensus 204 ~~i~~~~~k~~~~~~is~ell~d~~----~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~ 279 (419) T protein:vir:94 204 DTITTTLKTVAHWLPITRQAADDNS----QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAP 279 (419) T ss_pred eeEEeeeeeEEEeehhhHHHHHhHH----HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccccc Confidence 8888999999998899876555332 477777788999999999999999999888999999999877666656666 Q ss_pred cCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeee Q lcl|NC_021342. 203 MNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLD 282 (354) Q Consensus 203 ~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~ 282 (354) .|....+++|.+++..+... + ..+..++|+|+.|..|....-+. ++. ++...+. ..+.+-.|...|.+. T Consensus 280 ~t~~~~~~~l~~~~~~~~~~--~-~~~~~~v~n~~~~~~l~~~k~~~-~~~----~~~~~~~---~~~~~~~l~G~pV~~ 348 (419) T protein:vir:94 280 ATDEPPLVDIRRAKTVAEIA--G-FPPDGVVVHPQDWESIELDQAPG-SGV----FRVIANV---QGEATPRIWGLNVVS 348 (419) T ss_pred cccchhHHHHHHHHHhhhhc--c-CCCCEEEEcHHHHHHHHHHhhcC-CCc----eeecCCc---ccCCCccccceeeEE Confidence 77888899999999998752 3 35678999999999987543222 221 1111111 122223343334433 Q ss_pred eccccccccccCcccEEEEEEcCcceEEEeeCchhhhcccc-cc----CceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 283 AAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQ-MA----SLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 283 ~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~-~~----~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ...... + ..++.+-+ +.+.+..-+.++..... .. --...+.++.+++ +.+++|.|++++.++ T Consensus 349 ~~~~~~-~-------~~~~gd~~-~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d-~~v~~~~a~~~~~~~ 415 (419) T protein:vir:94 349 TVAIAQ-G-------TALVGGFR-QGATLWSRQGITVLMTDSHADFFTANTLVILAEFRAN-LAVYQPKAFVRVTFA 415 (419) T ss_pred cCCCCC-c-------cEEEeecc-ceEEEEEecceEEEEeccccchhhcCcEEEEEEEeec-cEEeccccEEEEEec Confidence 332211 1 01111111 11111111122222111 11 1124456677775 667889999999999 No 17 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.93 E-value=4.2e-10 Score=72.01 Aligned_cols=282 Identities=9% Similarity=-0.057 Sum_probs=162.7 Q ss_pred eeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccce Q lcl|NC_021342. 45 IMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQM 124 (354) Q Consensus 45 ~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~ 124 (354) |-..+++++.++. ..+.+.+++...+.-..+++.++. +.+.+ ...+.+....+.+.|++.. ..+|..+...+. T Consensus 1 ma~~t~~~G~lip---~~~~~~ii~~l~~~s~i~~l~~~~-~~~~~--~~~~p~~~~~~~a~wv~Eg-~~~~~s~~~f~~ 73 (300) T protein:vir:95 1 MSEAQLSKGNLFN---PELVTKVINKVKGHSSIAKLSPQK-PIPFN--GQREFVFDFDSDIDIVAEN-GKKTHGGVSLDP 73 (300) T ss_pred CcccccCCcceec---hhhHHHHHHHHHhhhhhhhhccee-eccCC--ceEEEEEecCcceEEeeCC-ccccccccccee Confidence 2222334444444 445677888777777777776653 22223 2455566666788898765 568888888888 Q ss_pred eEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeeh-----hhCceeeeecCCccccccccc Q lcl|NC_021342. 125 HTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDA-----SRGMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 125 ~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~-----~~gi~GLlN~p~~~~~~~~~~ 199 (354) ...+.+.++.-..+|.+=+.+..-...++...-....++++++.+|+.+|+|+. ..++.|..+.++.....+..+ T Consensus 74 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 153 (300) T protein:vir:95 74 VTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFK 153 (300) T ss_pred eEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeeccc Confidence 888888988888887653322223345677778888999999999999999952 234566666555443333222 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRF 279 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~ 279 (354) . ...+++|.+++..+... ...|..++|+|..+..|.+.. +..|..++.-. ...+.+.++--.| T Consensus 154 ~-----~~~~~~i~~~~~~~~~~---~~~~~~~vmn~~~~~~L~~lk--d~~G~~i~~~~-------~~~~~~~~l~G~P 216 (300) T protein:vir:95 154 D-----TNPDESMEDAVGMIDGS---ERDITGAILDPIFTTALSKMK--NAEGGKLYPEL-------AWGGVPDAINGLA 216 (300) T ss_pred c-----cchHHHHHHHHHHhhhc---CCCccEEEECHHHHHHHHHhh--ccCCCeeccCc-------cccCCCceeccee Confidence 1 22357888888877542 245678999999999986533 44443332111 1123333444444 Q ss_pred eeeeccccccccccCcccEEEEEEcCcceEEEeeCc--hhhhccc-ccc----Cc----eeEEeeeeeeeeEEEECCcee Q lcl|NC_021342. 280 QLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPI--PFRMLAP-QMA----SL----GITVPAEYKISGTEFRYPLCA 348 (354) Q Consensus 280 ~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~--~~~~~~~-~~~----~l----~~~~~~~~~~gGv~i~~P~ai 348 (354) .+.+..... ..++.++.+++-+.+ +.+.+.+-. .+.+.+- ... ++ ..-+.++.|+ |+.+++|.|+ T Consensus 217 v~~s~~v~~--~~~~~~~~~~~GDf~-~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~-d~~v~~~~a~ 292 (300) T protein:vir:95 217 VDKNRTVSY--SQTDPKNTAIVGDFE-TMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYI-GWGIMDAASF 292 (300) T ss_pred eEEecCCCC--CCCCCccEEEEeecc-ceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEee-cceeecccce Confidence 443333221 112233433333322 111111111 1222111 111 11 2556677787 5788889999 Q ss_pred EeeecC Q lcl|NC_021342. 349 AYVDMA 354 (354) Q Consensus 349 ~y~D~~ 354 (354) +++--+ T Consensus 293 ~~l~~~ 298 (300) T protein:vir:95 293 ARIVKT 298 (300) T ss_pred EEEecC Confidence 998877 No 18 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.89 E-value=6.3e-10 Score=71.03 Aligned_cols=287 Identities=10% Similarity=0.011 Sum_probs=156.6 Q ss_pred hhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccc Q lcl|NC_021342. 37 LDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLP 116 (354) Q Consensus 37 m~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip 116 (354) |-.... + .++.. +.+...+++...+.-..+++..+. +.+.+ ...+......+.+.|++.. ..+| T Consensus 1 mat~~~---------g-g~lvP--~~~~~~ii~~~~~~s~i~~~~~~i-~~~~~--~~~~p~~~~~~~a~wv~Eg-~~~~ 64 (311) T protein:vir:81 1 MVALAT---------G-TFQLP--KHLVPGVWQKAQGQSVLARLSMAE-PQEFG--EQQYMTLTAPPRGEVVGEG-AQKS 64 (311) T ss_pred CceecC---------C-ceEcc--hhHHHHHHHHHHhcchhhhhccee-ecCCC--ceEEEEEeCCceeEEeecC-cccc Confidence 222211 2 23332 445677888777777777877653 23323 3455666677788888754 5578 Q ss_pred eeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeeh---hhCceeeeecCCccc Q lcl|NC_021342. 117 RVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDA---SRGMYGLFNNPNVTL 193 (354) Q Consensus 117 ~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~---~~gi~GLlN~p~~~~ 193 (354) ..+...+......+.++.-..+|.+=|+...-...++...-....++++++.+|+.+++|+. +.+..|+++...-. T Consensus 65 ~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~- 143 (311) T protein:vir:81 65 ESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDT- 143 (311) T ss_pred cccceeeEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCccccccccccccc- Confidence 88888888888888888777776543332223345677788888999999999999999964 33455666542111 Q ss_pred ccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccc Q lcl|NC_021342. 194 SSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNEL 273 (354) Q Consensus 194 ~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l 273 (354) .......+.+......+|.+++..+.. ....|..++|+|..+..|.+-. +..+.-++.-.. ..+.+- T Consensus 144 -~~~~~~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~vmn~~~~~~l~~lk--d~~G~~l~~~~~-------~~~~~~ 210 (311) T protein:vir:81 144 -TNIVELTTGTSATPDLAVEAAVGLVLG---DNLSPDGVALDNTFSFMLATQR--DSQGRKLYPELG-------FGTDVA 210 (311) T ss_pred -ceeeeecccccchHHHHHHHHHHHhhh---cCCCceEEEEcHHHHHHHHhhh--ccCCCeeecCcc-------ccCCCc Confidence 111112222233345677777777653 2346678999999999996422 333322221000 011222 Q ss_pred eeeeeeeeeecccccc----------ccccCcccEEEEEEcCcceEEEeeCchhhhccc----cccC----ceeEEeeee Q lcl|NC_021342. 274 DIQIRFQLDAAELAAN----------GVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP----QMAS----LGITVPAEY 335 (354) Q Consensus 274 ~I~~~~~L~~~~~~~~----------g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~----~~~~----l~~~~~~~~ 335 (354) ++...|.+....+... .....++.+++..+.+.=.+.+.-.+.+...+. +..+ =...+.+.. T Consensus 211 tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~ 290 (311) T protein:vir:81 211 SFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEV 290 (311) T ss_pred eecceeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEE Confidence 2322332222211110 011223444444444321222211222222211 1111 124566778 Q ss_pred eeeeEEEECCceeEeeecC Q lcl|NC_021342. 336 KISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 336 ~~gGv~i~~P~ai~y~D~~ 354 (354) |++ ..+.+|.|++++--| T Consensus 291 r~d-~~v~~~~a~~~l~~a 308 (311) T protein:vir:81 291 VYG-IGIMSTDAFAVVRDA 308 (311) T ss_pred Eec-cEeecccceEEEEee Confidence 874 788899999999888 No 19 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.89 E-value=9.6e-10 Score=70.03 Aligned_cols=327 Identities=12% Similarity=0.052 Sum_probs=163.6 Q ss_pred CcccchhHHHHhh---------hhhhh--cccccccccchhh-h----------hhhhhhhccCCceeccchhhHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQG---------NQWLV--HKGYVSRNGDQWV-I----------NNTALDAIGNPNIMLDADGGIAFYIS 58 (354) Q Consensus 1 ~~~~~~~~~~~~~---------~~~~~--~~~~~~~~~~~~~-~----------~~~am~a~~~~~~~~da~~~~~fl~~ 58 (354) -..+..+-...+. ..+.. ..+.......... . ...+......-....+ +.+.+++. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~gg~~vp 164 (497) T protein:vir:10 87 QIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGST--GTFAPGIL 164 (497) T ss_pred hHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccC--cccccccc Confidence 0000000000000 00000 0000000000000 0 0000000000111122 22333333 Q ss_pred HHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcccceeeeccceeEEEEEEEEeeEe Q lcl|NC_021342. 59 QLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECH 137 (354) Q Consensus 59 ~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~ 137 (354) +.+.+.+++...+....+.++++..-.+ + ++.+.... ..+.+.|++.+ ..+|..+...+......+.++.-.. T Consensus 165 --~~~~~~ii~~~~~~~~i~~l~~~~~~~~-~--~~~~~~~~~~~~~a~wv~E~-~~~~~s~~~f~~i~~~~~k~a~~~~ 238 (497) T protein:vir:10 165 --PTFLPGIVEQLFYELSLADLISSRPVTS-P--NLSYLTESAAHNNAAAVAEA-GTYPFSSEEFARVYEQVGKVANALT 238 (497) T ss_pred --hhhhHHHHHHHHhhhhHHhhccccccCC-C--ceEEEEEcCCCCcceeeccC-cccccccccceeeEeeeeeeEeecH Confidence 5566788998888888888877543322 2 34444432 34677888765 4578888888888899999988877 Q ss_pred ecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccccc---------------- Q lcl|NC_021342. 138 YTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYK---------------- 201 (354) Q Consensus 138 ~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~---------------- 201 (354) +|.+ |.+-. . .++.--....++++++.+|+-+++|+...+..|+++.++.........+. T Consensus 239 iS~e-ll~d~--~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) T protein:vir:10 239 ITDE-GLRDA--P-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) T ss_pred hHHH-HHHhH--H-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhccc Confidence 7764 43322 2 37788888899999999999999999887889999988754322211110 Q ss_pred ---------------------------------ccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccC Q lcl|NC_021342. 202 ---------------------------------TMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMT 248 (354) Q Consensus 202 ---------------------------------~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~ 248 (354) ..+....+.++..++..+.. .+...|..++|+|..|..|.+- - T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~vmn~~~~~~l~~l--k 390 (497) T protein:vir:10 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQL--TLFQTPNAVVMNPRDWELLRLT--K 390 (497) T ss_pred ccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhh--hcccCCCeEEEchHHHHHHHHh--h Confidence 01233445566666666654 3556778999999999988542 2 Q ss_pred CCCCchHHHHHHhhCc---ccccccccceeeeeeeeeecccccccc--ccCcccEEEEEEcCcceEEEeeCchhhhcccc Q lcl|NC_021342. 249 GYTDRTVMQHFMEANS---YTLLTGNELDIQIRFQLDAAELAANGV--SNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQ 323 (354) Q Consensus 249 ~~~~~Tvl~~l~~n~~---~~~~~g~~l~I~~~~~L~~~~~~~~g~--g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~ 323 (354) +..|.-++ .... .....+.+-++...|...+........ |+-+.-...++++ ..+.+.+-. ..... T Consensus 391 d~~G~~i~----~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r--~~~~v~~~~---~~~~~ 461 (497) T protein:vir:10 391 DANGQYMG----GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR--EGVTMQMTN---SNGTD 461 (497) T ss_pred cCCCceec----cCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEe--cccEEEeec---ccchh Confidence 44443222 1100 000011111233333333332221110 1111111111221 122222110 00011 Q ss_pred c-cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 324 M-ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 324 ~-~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) . ++ ...+.++.|++ ..|++|.||+++++. T Consensus 462 f~~n-~v~~r~~~r~~-~~v~~p~A~~~l~~~ 491 (497) T protein:vir:10 462 FVDG-KVTVRAEERLG-LLVYRPSAFQLIQLK 491 (497) T ss_pred hhcC-cEEEEEEEeec-ceeeccccEEEEEec Confidence 1 22 45677788886 588899999999999 No 20 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.89 E-value=9.6e-10 Score=70.03 Aligned_cols=327 Identities=12% Similarity=0.052 Sum_probs=163.6 Q ss_pred CcccchhHHHHhh---------hhhhh--cccccccccchhh-h----------hhhhhhhccCCceeccchhhHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQG---------NQWLV--HKGYVSRNGDQWV-I----------NNTALDAIGNPNIMLDADGGIAFYIS 58 (354) Q Consensus 1 ~~~~~~~~~~~~~---------~~~~~--~~~~~~~~~~~~~-~----------~~~am~a~~~~~~~~da~~~~~fl~~ 58 (354) -..+..+-...+. ..+.. ..+.......... . ...+......-....+ +.+.+++. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~gg~~vp 164 (497) T protein:vir:78 87 QIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGST--GTFAPGIL 164 (497) T ss_pred hHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccC--cccccccc Confidence 0000000000000 00000 0000000000000 0 0000000000111122 22333333 Q ss_pred HHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcccceeeeccceeEEEEEEEEeeEe Q lcl|NC_021342. 59 QLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECH 137 (354) Q Consensus 59 ~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~ 137 (354) +.+.+.+++...+....+.++++..-.+ + ++.+.... ..+.+.|++.+ ..+|..+...+......+.++.-.. T Consensus 165 --~~~~~~ii~~~~~~~~i~~l~~~~~~~~-~--~~~~~~~~~~~~~a~wv~E~-~~~~~s~~~f~~i~~~~~k~a~~~~ 238 (497) T protein:vir:78 165 --PTFLPGIVEQLFYELSLADLISSRPVTS-P--NLSYLTESAAHNNAAAVAEA-GTYPFSSEEFARVYEQVGKVANALT 238 (497) T ss_pred --hhhhHHHHHHHHhhhhHHhhccccccCC-C--ceEEEEEcCCCCcceeeccC-cccccccccceeeEeeeeeeEeecH Confidence 5566788998888888888877543322 2 34444432 34677888765 4578888888888899999988877 Q ss_pred ecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccccc---------------- Q lcl|NC_021342. 138 YTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYK---------------- 201 (354) Q Consensus 138 ~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~---------------- 201 (354) +|.+ |.+-. . .++.--....++++++.+|+-+++|+...+..|+++.++.........+. T Consensus 239 iS~e-ll~d~--~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) T protein:vir:78 239 ITDE-GLRDA--P-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) T ss_pred hHHH-HHHhH--H-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhccc Confidence 7764 43322 2 37788888899999999999999999887889999988754322211110 Q ss_pred ---------------------------------ccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccC Q lcl|NC_021342. 202 ---------------------------------TMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMT 248 (354) Q Consensus 202 ---------------------------------~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~ 248 (354) ..+....+.++..++..+.. .+...|..++|+|..|..|.+- - T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~vmn~~~~~~l~~l--k 390 (497) T protein:vir:78 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQL--TLFQTPNAVVMNPRDWELLRLT--K 390 (497) T ss_pred ccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhh--hcccCCCeEEEchHHHHHHHHh--h Confidence 01233445566666666654 3556778999999999988542 2 Q ss_pred CCCCchHHHHHHhhCc---ccccccccceeeeeeeeeecccccccc--ccCcccEEEEEEcCcceEEEeeCchhhhcccc Q lcl|NC_021342. 249 GYTDRTVMQHFMEANS---YTLLTGNELDIQIRFQLDAAELAANGV--SNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQ 323 (354) Q Consensus 249 ~~~~~Tvl~~l~~n~~---~~~~~g~~l~I~~~~~L~~~~~~~~g~--g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~ 323 (354) +..|.-++ .... .....+.+-++...|...+........ |+-+.-...++++ ..+.+.+-. ..... T Consensus 391 d~~G~~i~----~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r--~~~~v~~~~---~~~~~ 461 (497) T protein:vir:78 391 DANGQYMG----GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR--EGVTMQMTN---SNGTD 461 (497) T ss_pred cCCCceec----cCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEe--cccEEEeec---ccchh Confidence 44443222 1100 000011111233333333332221110 1111111111221 122222110 00011 Q ss_pred c-cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 324 M-ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 324 ~-~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) . ++ ...+.++.|++ ..|++|.||+++++. T Consensus 462 f~~n-~v~~r~~~r~~-~~v~~p~A~~~l~~~ 491 (497) T protein:vir:78 462 FVDG-KVTVRAEERLG-LLVYRPSAFQLIQLK 491 (497) T ss_pred hhcC-cEEEEEEEeec-ceeeccccEEEEEec Confidence 1 22 45677788886 588899999999999 No 21 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.86 E-value=5e-10 Score=71.60 Aligned_cols=289 Identities=7% Similarity=-0.079 Sum_probs=159.0 Q ss_pred ceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccc Q lcl|NC_021342. 44 NIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQ 123 (354) Q Consensus 44 ~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~ 123 (354) =++++++ ++..+. +.+..++++...+.-..+++..+. +.+.+ ...+......+.+.|++.. ..+|..+...+ T Consensus 1 Mat~tt~-~g~~vP---~~~~~~ii~~~~~~s~l~~~~~~i-~~~~~--~~~~p~~~~~~~a~wv~Eg-~~~~~~~~~f~ 72 (311) T protein:vir:99 1 MATFGTG-NLKNLP---RNIADGMVKDVVQGSTVAVLSARK-PQRFG--NEDIITFNGRPKAEFVGEG-QQKSSTTGEFD 72 (311) T ss_pred CceecCC-Cceecc---HHHHHHHHHHHHhhchhhhhccee-eccCC--ceEEEEEeCCceeEEeecC-cccccccceee Confidence 0122222 222233 345567888777777777776553 33333 3455566667788998765 45888888888 Q ss_pred eeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeeh---hhCceeeeecCCcccccccccc Q lcl|NC_021342. 124 MHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDA---SRGMYGLFNNPNVTLSSATKDY 200 (354) Q Consensus 124 ~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~---~~gi~GLlN~p~~~~~~~~~~W 200 (354) ......+.++.-+.+|.+=++.......++...-....++++++.+|+.+|+|+. +.+..|+.+..+.......... T Consensus 73 ~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~ 152 (311) T protein:vir:99 73 FVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTA 152 (311) T ss_pred EEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccc Confidence 8888888888888887653333334456788888899999999999999999975 3445565554433322222222 Q ss_pred cccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeee Q lcl|NC_021342. 201 KTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQ 280 (354) Q Consensus 201 ~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~ 280 (354) .+......||.+++..+... +....++.++|+|..+..|.+-. +..|.-+++ ..+. .+.+-.+...|. T Consensus 153 --~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~vmn~~~~~~L~~lk--d~~G~~l~~----~~~~---~~~~~~l~G~Pv 220 (311) T protein:vir:99 153 --DTIANPDLAIEAAVGLLVAN-GHPTPVNGLALHPSIAWGLSTAR--YTDGRKKFP----ELGL---GIGVSSFEGIDA 220 (311) T ss_pred --cccchhHHHHHHHHHHHhhh-ccCCCccEEEEcHHHHHHHHhhh--ccCCCeeec----Cccc---CCCCceecceee Confidence 22333456777777766543 22345677999999999996532 333322211 1000 111112222232 Q ss_pred eeeccccc--------cccccCcccEEEEEEcCcceEEEeeCchhhhcccc---cc---Cc----eeEEeeeeeeeeEEE Q lcl|NC_021342. 281 LDAAELAA--------NGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQ---MA---SL----GITVPAEYKISGTEF 342 (354) Q Consensus 281 L~~~~~~~--------~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~---~~---~l----~~~~~~~~~~gGv~i 342 (354) .....+.. .....+.++.+++-+.+ +.+.+.+-..+++.-.. .. ++ -.-+.++.|+++. + T Consensus 221 ~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~-v 298 (311) T protein:vir:99 221 SVSDTVNGGDEADPDDEDLDAARAVRGIVGDFA-NGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWY-V 298 (311) T ss_pred EeecccccccccccccchhhccCcceEEEeecc-ccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecce-e Confidence 22221110 00111223333322222 22333333332221111 11 11 2456778999875 6 Q ss_pred ECCceeEeeecC Q lcl|NC_021342. 343 RYPLCAAYVDMA 354 (354) Q Consensus 343 ~~P~ai~y~D~~ 354 (354) ++|.+++..|-+ T Consensus 299 ~~~~~v~~~~~~ 310 (311) T protein:vir:99 299 FTDRFVVIENAV 310 (311) T ss_pred cChhHeeeeccc Confidence 789888888888 No 22 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.85 E-value=1.8e-09 Score=68.49 Aligned_cols=285 Identities=10% Similarity=-0.084 Sum_probs=157.1 Q ss_pred eeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccce Q lcl|NC_021342. 45 IMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQM 124 (354) Q Consensus 45 ~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~ 124 (354) |-++++++ ++.. +.+...+++...+.-..+++.++. +.+.+ +..+......+.+.|++.. ..+|..+...+. T Consensus 1 m~t~t~gg--~liP--~~~~~~ii~~l~~~s~i~~l~~~~-~~~~~--~~~ip~~~~~~~a~wv~E~-~~~~~s~~~f~~ 72 (303) T protein:vir:97 1 MGTETSKA--SLFD--KHLVSDLINKVKGHSSLAKLSSQK-PIPFN--GSKEFTFTLDSDIDVVAEN-GKKTHGGLSLEP 72 (303) T ss_pred CcccCCCC--eEcc--hhHHHHHHHHHHhhchhhhhccee-ecCCC--ceEEEEEecCcceEEeecC-ccccccccceee Confidence 33333222 3333 445677888777777788877654 33333 3455566667788999865 557888888888 Q ss_pred eEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh-C----ceeeeecCCccccccccc Q lcl|NC_021342. 125 HTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR-G----MYGLFNNPNVTLSSATKD 199 (354) Q Consensus 125 ~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-g----i~GLlN~p~~~~~~~~~~ 199 (354) ...+.+.++.-+.+|.+=|........++...-....++++++.+|+.+++|+... | ..|..+..+...... . T Consensus 73 v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-~- 150 (303) T protein:vir:97 73 VTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVV-K- 150 (303) T ss_pred EEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccccccccccccccccccc-c- Confidence 88899999988888765333333345577788899999999999999999996432 2 122222112111110 0 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRF 279 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~ 279 (354) ..+.+..++||.+++..+.. ....|..++|+|..+..|.+.. +..+.-++ . +.....+.+-.|...| T Consensus 151 --~~~~~~~~~~i~~~~~~~~~---~~~~~~~~vmn~~~~~~L~~lk--d~~g~~~~----~--~~~~~~~~~~~l~G~P 217 (303) T protein:vir:97 151 --FTESEDADANIEAAVNLIQG---AEGVVTGLAMDTEFSTALAKVT--NGEMGPKM----Y--PELAWGANPDSINGLK 217 (303) T ss_pred --cccccchHHHHHHHHHHHhh---cCCCccEEEEcHHHHHHHHHhh--ccCCCeEE----e--cCccCCCCCceeccee Confidence 11233457899999988764 2356678999999999886432 32221111 0 0001112222344444 Q ss_pred eeeeccccccccccCcccEEEEEEcC-------cceEEEeeCchhhhcc--c--cccCceeEEeeeeeeeeEEEECCcee Q lcl|NC_021342. 280 QLDAAELAANGVSNSNKPRYMVYDKS-------DRNLAMANPIPFRMLA--P--QMASLGITVPAEYKISGTEFRYPLCA 348 (354) Q Consensus 280 ~L~~~~~~~~g~g~~g~d~~v~y~~~-------~~~~~~~vp~~~~~~~--~--~~~~l~~~~~~~~~~gGv~i~~P~ai 348 (354) ...+......+....+++..++-+.+ .+.+++.+-....... . -.++ ..-+.++.|++ ..+++|.|+ T Consensus 218 v~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n-~~~~r~~~r~~-~~v~~p~af 295 (303) T protein:vir:97 218 SSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYN-QIYLRAEAYIG-WGILDAKSF 295 (303) T ss_pred eEEecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcC-cEEEEEEEEec-cEeecccce Confidence 44443332222222223332222221 1222222211000000 0 0112 13455677774 778889999 Q ss_pred EeeecC Q lcl|NC_021342. 349 AYVDMA 354 (354) Q Consensus 349 ~y~D~~ 354 (354) +++-=| T Consensus 296 ~~l~~~ 301 (303) T protein:vir:97 296 ARVTKG 301 (303) T ss_pred EEeeCC Confidence 998888 No 23 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=98.80 E-value=5.8e-09 Score=65.74 Aligned_cols=303 Identities=13% Similarity=0.060 Sum_probs=161.4 Q ss_pred Cc--------ccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021342. 1 MA--------IKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPY 72 (354) Q Consensus 1 ~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~ 72 (354) .. .+.-+++...+... ...........+......+..+.+++.++..+ +-+.+++... T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~---~~~~ii~~~~ 138 (390) T protein:vir:10 73 VQHVSVGDLFVASEQFQASAGRWN-----------DRSARATMNIKAALNTASTDAAGSAGALTTPN---RLPGFITQPD 138 (390) T ss_pred ccccchhhhhhhhHHHHHHHHhhh-----------hhhhhhhhHHHHHHHhhhcccccccccccchh---HHHHHHHHHH Confidence 00 01111111110000 00000000111111111122223344455432 2356777777 Q ss_pred hcccchhhccccCCCCCceeEEEEEeecc-ccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCC Q lcl|NC_021342. 73 GDITYRFDVPMAANIPEYADTWMYRSYDG-VTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNM 151 (354) Q Consensus 73 ~~l~~r~~v~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~ 151 (354) .....+.++.+.+ .+.+ ++.+..... .+.+.+++.++ .+|..+...+........++..+.+|.+ +-... . T Consensus 139 ~~~~l~~~~~~~~-~~~~--~~~~~~~~~~~~~a~~v~Eg~-~~~~~~~~~~~i~~~~~k~~~~~~is~e-ll~d~---~ 210 (390) T protein:vir:10 139 ARLTVRDLIGSGR-TDSA--LIEYVQETGFVNNAAIVAEGA-LKPESSLKFAKKTDTTHVIAHTMKATRQ-ILSDA---P 210 (390) T ss_pred hhchhhhhcceee-ccCC--ceEEEEEecCCcceeeecCCc-cccccccceeEEEEeeEEEEEeehhhHH-HHHhH---H Confidence 7777777776543 2222 344444443 46777876654 4788888888888899999988888875 43322 2 Q ss_pred CcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCccccc Q lcl|NC_021342. 152 PIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPN 230 (354) Q Consensus 152 ~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~ 230 (354) ++..--....+++++..+|+.+++|+... ...|++|.++....+.. .+....++++.+++.++... ...+. T Consensus 211 ~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~---~~~~~ 282 (390) T protein:vir:10 211 QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT-----IAGATRVDQLRLAMLQASLA---EYPAS 282 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCcccccccccccccccccc-----ccccchHHHHHHHHHhhccc---cCCCC Confidence 57778888899999999999999997543 47899998876543222 11223467788888777642 34567 Q ss_pred EEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEE Q lcl|NC_021342. 231 TALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLA 310 (354) Q Consensus 231 ~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~ 310 (354) .++|+|+.|..|.+.. +..|.-++. . .. .+.+-.+...|...+..... + ..++.+.+ +.+. T Consensus 283 ~~v~n~~~~~~L~~lk--d~~g~~l~~----~-~~---~~~~~~l~G~pv~~~~~~p~-~-------~~~~gdf~-~~~~ 343 (390) T protein:vir:10 283 GIVINPIDWAAIELAK--DANNQYLIG----N-AR---GTLTPTLWGLPVVATQAMAP-G-------EFLVGAFD-LAAQ 343 (390) T ss_pred EEEEcHHHHHHHHHhh--cCCCceeec----C-Cc---CcCCceecceeeEEcCCCCC-C-------cEEEEecc-ceEE Confidence 8999999999987533 444432221 1 11 11122344444444333211 1 11111211 1122 Q ss_pred EeeCchhhhcc----c-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 311 MANPIPFRMLA----P-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 311 ~~vp~~~~~~~----~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.....++..- . -.++ ...+.+..|++ +.+++|.|++++++| T Consensus 344 ~~~~~~~~i~~~~~~~~~~~~-~~~~r~~~r~d-~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 344 IFDQWDARVEIGYVNDDFQRN-MVTVLAEERLA-LVVYRPEALISGSFA 390 (390) T ss_pred EEEecceEEEEeecccccccC-cEEEEEEEeec-cEEeccccEEEEEeC Confidence 21112222111 0 1122 34555778875 689999999999999 No 24 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.80 E-value=1.6e-09 Score=68.75 Aligned_cols=289 Identities=11% Similarity=-0.022 Sum_probs=152.8 Q ss_pred eeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccce Q lcl|NC_021342. 45 IMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQM 124 (354) Q Consensus 45 ~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~ 124 (354) |...+.+.+.++.. +.+..++++.....-..|++..+.. .+ ...+.+.+....+.+.|++.. ..+|..+...+. T Consensus 1 Ma~~~~~~gg~~vP--~~~~~~ii~~l~~~s~i~~l~~~i~-~~--~~~~~ip~~~~~~~a~wv~Eg-~~~~~s~~~f~~ 74 (315) T protein:vir:80 1 MADDFLSAGKLELP--GSMIGAVRDRAIDSGVLAKLSPEQP-TI--FGPVKGAVFSGVPRAKIVGEG-EVKPSASVDVSA 74 (315) T ss_pred CCCCcCCcCceEcc--hHHHHHHHHHHHhhchhhhhcceee-cC--CCceEEEEEeCCcceEEeeCC-ccccccccceee Confidence 22333333333333 4556778887777777777655432 22 224566667777888899875 457888888888 Q ss_pred eEEEEEEEEeeEeecHHHHHHHHHhC-CCcchHHHHHHHHHHHHHhhheeeeeehhh---CceeeeecCCcccccccccc Q lcl|NC_021342. 125 HTVPLGYAGNECHYTLDEMRKSAAMN-MPIDAEQARLAFRGAEEHSQSVAYFGDASR---GMYGLFNNPNVTLSSATKDY 200 (354) Q Consensus 125 ~~~pv~~~~~~~~~~~~El~~a~~~g-~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~---gi~GLlN~p~~~~~~~~~~W 200 (354) .....+.++.-..+|.+=++.....- -.|...-.+..++++++.+|+.+|+|+... +..|+.+.-+... . T Consensus 75 v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~-----~- 148 (315) T protein:vir:80 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTK-----N- 148 (315) T ss_pred eEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccccccccccccc-----c- Confidence 88888888877777654332221111 125566678889999999999999996432 3334333211110 0 Q ss_pred cccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeee Q lcl|NC_021342. 201 KTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQ 280 (354) Q Consensus 201 ~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~ 280 (354) ........+.||.+++..+... +...+...+|+|..+..|.+-......+ +--.++.. .. ..|.+-++...|. T Consensus 149 ~~~~~~~~~~d~~~~~~~~~~~--~~~~~~~~imn~~~~~~L~~l~~~~g~~-~~g~~~~~-~~---~~g~~~tl~G~PV 221 (315) T protein:vir:80 149 IVDATDSATADLVKAVGLIAGA--GLQVPNGVALDPAFSFALSTEVYPKGSP-LAGQPMYP-AA---GFAGLDNWRGLNV 221 (315) T ss_pred eeeccccchHHHHHHHHHHhhc--cCccceEEEEcHHHHHHHHHHhhccCCc-cccccccc-cc---ccCCCceecceee Confidence 1112334568888888777542 3445668999999999986543221111 11111110 00 0122223444444 Q ss_pred eeecccccc-ccccCcccEEEEEEcCc------ceEEEeeCchhhhccccccCc----eeEEeeeeeeeeEEEECCceeE Q lcl|NC_021342. 281 LDAAELAAN-GVSNSNKPRYMVYDKSD------RNLAMANPIPFRMLAPQMASL----GITVPAEYKISGTEFRYPLCAA 349 (354) Q Consensus 281 L~~~~~~~~-g~g~~g~d~~v~y~~~~------~~~~~~vp~~~~~~~~~~~~l----~~~~~~~~~~gGv~i~~P~ai~ 349 (354) +.+...... ..+...+..++.-+.+. +.+.+.+-..-.. .....++ ...+.++.|+ |..+++|.|++ T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~-~~~~~~~~~~~~v~~r~~~r~-~~~v~~~~a~~ 299 (315) T protein:vir:80 222 GASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDP-DQTGRDLKGHNEVMVRAEAVL-YVAIESLDSFA 299 (315) T ss_pred EecCcCCcccccccccccEEEEeecccEEEEEecCeeEEEeccccc-cCcccchhhcCcEEEEEEEEe-cceeecccceE Confidence 433332211 11112222333223221 2222222110000 0001111 2556677887 58899999999 Q ss_pred eeecC Q lcl|NC_021342. 350 YVDMA 354 (354) Q Consensus 350 y~D~~ 354 (354) ++..+ T Consensus 300 ~l~~~ 304 (315) T protein:vir:80 300 VVKEK 304 (315) T ss_pred EEeec Confidence 99877 No 25 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.78 E-value=2.6e-09 Score=67.64 Aligned_cols=284 Identities=11% Similarity=0.015 Sum_probs=155.4 Q ss_pred hhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcc Q lcl|NC_021342. 35 TALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQD 114 (354) Q Consensus 35 ~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~d 114 (354) -|-....... +++.+.++..+. +.+.+.+++...+....+++..+.. .+. ....+......+.+.+++... . T Consensus 1 ma~~~~~~~~-~~~t~~gg~lip---~~~~~~ii~~~~~~~~l~~~~~~~~-~~~--~~~~ip~~~~~~~a~~v~E~~-~ 72 (304) T protein:vir:94 1 MATPTYTPGN-VILSDFKNGVIP---AEQGTLIMKDIMANSAIMKLAKNEP-MTA--QKKKFTYLAKGVGAYWVSETE-R 72 (304) T ss_pred Cccccccccc-ccccCCCceecc---hhHHHHHHHHHHhccchhhhcceee-ccC--CceEEEEEeCCcceEEeecCc-c Confidence 1111111111 122223333333 3455678887777777777766543 222 234555666677788887654 5 Q ss_pred cceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccc Q lcl|NC_021342. 115 LPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLS 194 (354) Q Consensus 115 ip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~ 194 (354) +|..+...+........++..+.++.+=++.+ ..++...-....++++++.+|+.+++|+...+-.|.+....++.. T Consensus 73 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 149 (304) T protein:vir:94 73 IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT---AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGA 149 (304) T ss_pred cccccceeeEEEEEEEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccc Confidence 78888888888888899998888876444433 467888888889999999999999999876554554444333322 Q ss_pred cccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccce Q lcl|NC_021342. 195 SATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELD 274 (354) Q Consensus 195 ~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~ 274 (354) ....... .+....++||.+++.++... ...+..++|+|+.|..|.+.. +..+.-++ ..+. ...-|.|+ T Consensus 150 ~~~~~~~-~~~~~~~~~i~~~~~~l~~~---~~~~~~~v~~~~~~~~L~~lk--d~~G~~l~----~~~~-~~l~G~PV- 217 (304) T protein:vir:94 150 EEKGNVV-TDTNNLYVDLSALMATIEDE---ELDPNGVLTTRSFRSKMRNAL--DANDRPLF----DANG-NEIMGLPL- 217 (304) T ss_pred ccccccc-ccccchHHHHHHHHHHhhhc---cCCcCEEEEcHHHHHHHHHhh--ccCCcEee----cCCC-ccccceee- Confidence 2111111 12334588999998888642 345668999999999996533 33332211 1111 11223332 Q ss_pred eeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhc----------ccc-cc----C-c---eeEEeeee Q lcl|NC_021342. 275 IQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRML----------APQ-MA----S-L---GITVPAEY 335 (354) Q Consensus 275 I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~----------~~~-~~----~-l---~~~~~~~~ 335 (354) ........ ..++. .+++- |.+.+.+..-..++.- .-+ .. + . ...+.++. T Consensus 218 ------~~~~~~~~----~~~~~-~~~~g-d~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~ 285 (304) T protein:vir:94 218 ------SYTGADVY----DKKKS-LALMG-DWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATM 285 (304) T ss_pred ------EEeccccc----CCCCc-EEEEE-ehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEE Confidence 22221111 01111 12221 1122212111111110 000 00 0 1 24556778 Q ss_pred eeeeEEEECCceeEeeecC Q lcl|NC_021342. 336 KISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 336 ~~gGv~i~~P~ai~y~D~~ 354 (354) |++ ..+++|.|++.+--| T Consensus 286 r~~-~~v~~~~a~~~l~~a 303 (304) T protein:vir:94 286 HIA-YMNVKPEAFATLKPT 303 (304) T ss_pred Eec-cEeecccceEEEEec Confidence 885 667779999999999 No 26 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.78 E-value=2.6e-09 Score=67.64 Aligned_cols=284 Identities=11% Similarity=0.015 Sum_probs=155.4 Q ss_pred hhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcc Q lcl|NC_021342. 35 TALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQD 114 (354) Q Consensus 35 ~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~d 114 (354) -|-....... +++.+.++..+. +.+.+.+++...+....+++..+.. .+. ....+......+.+.+++... . T Consensus 1 ma~~~~~~~~-~~~t~~gg~lip---~~~~~~ii~~~~~~~~l~~~~~~~~-~~~--~~~~ip~~~~~~~a~~v~E~~-~ 72 (304) T protein:vir:10 1 MATPTYTPGN-VILSDFKNGVIP---AEQGTLIMKDIMANSAIMKLAKNEP-MTA--QKKKFTYLAKGVGAYWVSETE-R 72 (304) T ss_pred Cccccccccc-ccccCCCceecc---hhHHHHHHHHHHhccchhhhcceee-ccC--CceEEEEEeCCcceEEeecCc-c Confidence 1111111111 122223333333 3455678887777777777766543 222 234555666677788887654 5 Q ss_pred cceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccc Q lcl|NC_021342. 115 LPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLS 194 (354) Q Consensus 115 ip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~ 194 (354) +|..+...+........++..+.++.+=++.+ ..++...-....++++++.+|+.+++|+...+-.|.+....++.. T Consensus 73 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 149 (304) T protein:vir:10 73 IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT---AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGA 149 (304) T ss_pred cccccceeeEEEEEEEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccc Confidence 78888888888888899998888876444433 467888888889999999999999999876554554444333322 Q ss_pred cccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccce Q lcl|NC_021342. 195 SATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELD 274 (354) Q Consensus 195 ~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~ 274 (354) ....... .+....++||.+++.++... ...+..++|+|+.|..|.+.. +..+.-++ ..+. ...-|.|+ T Consensus 150 ~~~~~~~-~~~~~~~~~i~~~~~~l~~~---~~~~~~~v~~~~~~~~L~~lk--d~~G~~l~----~~~~-~~l~G~PV- 217 (304) T protein:vir:10 150 EEKGNVV-TDTNNLYVDLSALMATIEDE---ELDPNGVLTTRSFRSKMRNAL--DANDRPLF----DANG-NEIMGLPL- 217 (304) T ss_pred ccccccc-ccccchHHHHHHHHHHhhhc---cCCcCEEEEcHHHHHHHHHhh--ccCCcEee----cCCC-ccccceee- Confidence 2111111 12334588999998888642 345668999999999996533 33332211 1111 11223332 Q ss_pred eeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhc----------ccc-cc----C-c---eeEEeeee Q lcl|NC_021342. 275 IQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRML----------APQ-MA----S-L---GITVPAEY 335 (354) Q Consensus 275 I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~----------~~~-~~----~-l---~~~~~~~~ 335 (354) ........ ..++. .+++- |.+.+.+..-..++.- .-+ .. + . ...+.++. T Consensus 218 ------~~~~~~~~----~~~~~-~~~~g-d~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~ 285 (304) T protein:vir:10 218 ------SYTGADVY----DKKKS-LALMG-DWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATM 285 (304) T ss_pred ------EEeccccc----CCCCc-EEEEE-ehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEE Confidence 22221111 01111 12221 1122212111111110 000 00 0 1 24556778 Q ss_pred eeeeEEEECCceeEeeecC Q lcl|NC_021342. 336 KISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 336 ~~gGv~i~~P~ai~y~D~~ 354 (354) |++ ..+++|.|++.+--| T Consensus 286 r~~-~~v~~~~a~~~l~~a 303 (304) T protein:vir:10 286 HIA-YMNVKPEAFATLKPT 303 (304) T ss_pred Eec-cEeecccceEEEEec Confidence 885 667779999999999 No 27 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=98.78 E-value=4.5e-09 Score=66.33 Aligned_cols=322 Identities=7% Similarity=-0.069 Sum_probs=158.6 Q ss_pred Ccccchh------HHHHhhh---hhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhh Q lcl|NC_021342. 1 MAIKTID------AQTIQGN---QWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETP 71 (354) Q Consensus 1 ~~~~~~~------~~~~~~~---~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~ 71 (354) ...+..+ .+..+.. ..+...+.. +-.....++.+. ...+++ +.+..+.. +.+.+.+++.. T Consensus 118 ~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~-----~~~~~~~~~~a~--~~~~~~--~~g~~~ip--~~~~~~ii~~~ 186 (458) T protein:vir:10 118 SVAKALYGTQENFEDEVEKLVLLSYVMEKGVF-----ETEHGQRHLKAV--NQSSSV--EVSSESYE--TIFSQRIIRDL 186 (458) T ss_pred hhhccchhhhhhHHHHHHHHHHHHHHHhhccc-----hhhhhhhhhhhh--hhcccC--ccccceeh--hhHhHHHHHHH Confidence 0000000 0000000 000000000 000000011111 111111 12233333 55677788877 Q ss_pred hhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccc------eeeeccceeEEEEEEEEeeEeecHHHHHH Q lcl|NC_021342. 72 YGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLP------RVAQSAQMHTVPLGYAGNECHYTLDEMRK 145 (354) Q Consensus 72 ~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip------~v~~~~~~~~~pv~~~~~~~~~~~~El~~ 145 (354) ......+.+..+. +.+.+ ...+.+....+.+.|++.+.. .| ..+...+......+.++..+.+|..=|.. T Consensus 187 ~~~~~l~~~~~~~-~~~~~--~~~~~~~~~~~~a~~v~e~~~-~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~d 262 (458) T protein:vir:10 187 QKELVVGALFEEL-PMSSK--ILTMLVEPDAGKATWVAASTY-GTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEED 262 (458) T ss_pred HhhhhHHhhccee-ecCCc--ceEEEEecCCcceeecccccc-cccccccccccccceeeEeeeeeEEeeehhhHHHHhc Confidence 7777777776543 22222 334444444566677654422 22 12233455666777777777777653333 Q ss_pred HHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHH-HHHHHHHHHHHHHHHHhC Q lcl|NC_021342. 146 SAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQ-ELFNMLNAPIFSVINLSR 224 (354) Q Consensus 146 a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~-ei~~di~~~~~~l~~~s~ 224 (354) + ..++..--....+.++...+|+-+++|+......|++|+++....+....++...+. --+++|.+++..+... T Consensus 263 s---~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~-- 337 (458) T protein:vir:10 263 A---IFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRH-- 337 (458) T ss_pred c---hHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeecccccccccccHHHHHHHHHhhhhh-- Confidence 2 346777788889999999999999999977778999999886544333222221111 1257777787777542 Q ss_pred CcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEc Q lcl|NC_021342. 225 RFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDK 304 (354) Q Consensus 225 g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~ 304 (354) + ..+..++|+|..|..|..-. +..|.-++..-..+. ...|.+.++...|.+....... +++.++ +++-. T Consensus 338 ~-~~~~~~v~~~~~~~~l~~lk--d~~G~~i~~~~~~~~---~~~~~~~~l~G~pv~~~~~~p~---~~~~~~--~~~~~ 406 (458) T protein:vir:10 338 G-LKLSKLVLIVSMDAYYDLLE--DEEWQDVAQVGNDSV---KLQGQVGRIYGLPVVVSEYFPA---KANSAE--FAVIV 406 (458) T ss_pred h-cCCCEEEEcHHHHHHHHhhc--ccCCceeeccccccc---cccCcCceecceeeEEcccccc---ccCCcc--eEEEE Confidence 2 34678999999999886532 333321211111111 1223333444445444433211 111222 12211 Q ss_pred CcceEEEeeCchhhhccccccCc-eeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 305 SDRNLAMANPIPFRMLAPQMASL-GITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 305 ~~~~~~~~vp~~~~~~~~~~~~l-~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) -.+.+.+..-..++...-.+-.. ...+....|+ |..+++|.++++...| T Consensus 407 f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~-~~~v~~~~a~v~~~~a 456 (458) T protein:vir:10 407 YKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRV-NLQRYFANGVVSGTYA 456 (458) T ss_pred ecccEEEEEeeceEEEeecccCCCceEEEEEEEe-cceEecccceEEEeec Confidence 22333333222333322111111 2455667776 5889999999999999 No 28 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.78 E-value=4.7e-09 Score=66.25 Aligned_cols=277 Identities=7% Similarity=-0.050 Sum_probs=160.3 Q ss_pred ccCCceeccc-hhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCccccee Q lcl|NC_021342. 40 IGNPNIMLDA-DGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRV 118 (354) Q Consensus 40 ~~~~~~~~da-~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v 118 (354) ++..+..... +.++..+. +.+..++++.....-..+++..+. +.+.+...+ ..... ..+.+++. +..+|.. T Consensus 1 ~g~~a~~~~~~~~~~~~iP---~~~~~~ii~~~~~~s~l~~~~~~~-~~~~~~~~~--~~~~~-~~a~~v~E-~~~~~~~ 72 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIP---INISEQIITGVKNGSAAMKLAKAV-PMTKPEEEF--TFMSG-VGAFWVDE-AERIQTS 72 (299) T ss_pred CCcCCCcccccCCCceecc---hhHHHHHHHHHHhcchhhhhceee-ecCCCcEEE--EEEcC-Cceeeeec-Ccccccc Confidence 3322222222 12222222 456677888777777777776653 333333333 33333 45778865 4558888 Q ss_pred eeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccc Q lcl|NC_021342. 119 AQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATK 198 (354) Q Consensus 119 ~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~ 198 (354) +...+........++.-+.++.+=++.+ ..++...-....++++++.+|+.+++|+....-.|+++........+.. T Consensus 73 ~~~f~~v~l~~~k~~~~~~is~ell~ds---~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~ 149 (299) T protein:vir:41 73 KPTFTKAKMRSKKMGVIIPTTKENLNYS---VTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEE 149 (299) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeecc Confidence 8888888899999999999987555433 3578888899999999999999999998776667888765432222211 Q ss_pred cccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeee Q lcl|NC_021342. 199 DYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIR 278 (354) Q Consensus 199 ~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~ 278 (354) . ..-++||.+++.++... ...+..++|+|..|..|.+.. +..+.-++ ...+ ..+. -.+... T Consensus 150 ~------~~~~~~l~~~~~~l~~~---~~~~~~~v~n~~~~~~L~~lk--d~~G~~l~----~~~~---~~~~-~~l~G~ 210 (299) T protein:vir:41 150 T------ANKYDDLNEAIGLIEAE---DLEPNGIATIRKQRVKYRSTK--DGNGMPIF----NTAT---SNGV-DDVLGL 210 (299) T ss_pred c------cccHHHHHHHHHhhhcc---cCCcCEEEEcHHHHHHHHHhh--ccCCceee----cCCc---CCCC-ceecce Confidence 1 12268888998887642 245678999999999997633 33332221 1111 0111 134444 Q ss_pred eeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhcccc------------------ccCceeEEeeeeeeeeE Q lcl|NC_021342. 279 FQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQ------------------MASLGITVPAEYKISGT 340 (354) Q Consensus 279 ~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~------------------~~~l~~~~~~~~~~gGv 340 (354) |...+..... + +.+..+++-+ ...+.+..-..++..... .++ ...+.++.++ |. T Consensus 211 PV~~~~~~~~-~----~~~~~~~~gd-fs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~r~~~~~-d~ 282 (299) T protein:vir:41 211 PIAYTPKYTF-G----DKDISELVGD-WNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERD-MAAIKATFEV-GF 282 (299) T ss_pred eeEEecccCC-C----CCceEEEEEe-cccEEEEEecCcEEEEeecccccccccccccchhhhhcC-cEEEEEEEEe-cc Confidence 4444443321 1 1111222211 121222222222221110 112 2456778888 57 Q ss_pred EEECCceeEeeecC Q lcl|NC_021342. 341 EFRYPLCAAYVDMA 354 (354) Q Consensus 341 ~i~~P~ai~y~D~~ 354 (354) .+++|.|++.+-.+ T Consensus 283 ~v~~~~A~~~l~~~ 296 (299) T protein:vir:41 283 MVVKDEAFSAVQPK 296 (299) T ss_pred EEecccceEEEEec Confidence 78889999999888 No 29 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.77 E-value=2.7e-09 Score=67.57 Aligned_cols=280 Identities=9% Similarity=-0.020 Sum_probs=160.0 Q ss_pred eeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccce Q lcl|NC_021342. 45 IMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQM 124 (354) Q Consensus 45 ~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~ 124 (354) |.++ ++..+. +.+.+++++...+.-..+++.++.. .+.+. ..+......+.+.|++.. ..+|..+...+. T Consensus 1 ma~~---gG~lvp---~~~~~~ii~~~~~~s~i~~l~~~~~-~~~~~--~~ip~~~~~~~a~~v~E~-~~~~~~~~~f~~ 70 (298) T protein:vir:16 1 MVLN---KGTLFD---PTLVTDLISKVAGKSSIARLSAQKP-IPFNG--EKVFTFTMDSEIDVVAES-GKKTHGGVTLAP 70 (298) T ss_pred Cccc---Ccceec---hhHHHHHHHHHHhhhhhhhhcceee-ccCCc--eEEEEEecCcceEEecCC-ccccccccceeE Confidence 2221 222222 3345667777777777777776442 22222 345556667888998765 568888888888 Q ss_pred eEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeeh-----hhCceeeeecCCccccccccc Q lcl|NC_021342. 125 HTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDA-----SRGMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 125 ~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~-----~~gi~GLlN~p~~~~~~~~~~ 199 (354) .....+.++.-..+|.+=|........++...-+...++++++.+|+.+|+|.. ..++.|+....+....... T Consensus 71 v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~-- 148 (298) T protein:vir:16 71 QTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE-- 148 (298) T ss_pred EEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccc-- Confidence 888899998888887655554444456777788889999999999999999953 1233444333332211111 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRF 279 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~ 279 (354) .. ......+.||.+++.++... ...+..++|+|..+..|.+. -+..+.-++.-. ...+.+-++...| T Consensus 149 ~~-~~~~~~~~~i~~~~~~~~~~---~~~~~~~vmn~~~~~~l~~l--kd~~G~~i~~~~-------~~~~~~~~l~G~P 215 (298) T protein:vir:16 149 AP-RGIADPNGAIENAVELLTGV---DADVTGIAINPSFRSALAKQ--KDLQDNALFPEL-------KWGATPDTINGLP 215 (298) T ss_pred cc-cccccHHHHHHHHHHHhhhc---CCCccEEEEcHHHHHHHHHh--hccCCCeeecCc-------ccCCCCceeccee Confidence 11 11234578999999888652 24566899999999998653 244443332111 1123333444444 Q ss_pred eeeeccccccccccCcccEEEEEEcCcceEEEeeCchh--hhccc-cc---------cCceeEEeeeeeeeeEEEECCce Q lcl|NC_021342. 280 QLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPF--RMLAP-QM---------ASLGITVPAEYKISGTEFRYPLC 347 (354) Q Consensus 280 ~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~--~~~~~-~~---------~~l~~~~~~~~~~gGv~i~~P~a 347 (354) ......... ...++++.+++-+.+ +.+.+.+...+ ...+. .. ++ -.-+.++.|+ |..+++|.| T Consensus 216 V~~~~~v~~--~~~~~~~~~~~GDfs-~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~-~v~~ra~~r~-d~~v~~~~a 290 (298) T protein:vir:16 216 VDVNKTVSD--MSLTQRDRAIIGDFA-NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYN-QVYIRAELFL-GWGILDATK 290 (298) T ss_pred eEEeccccc--ccCCCccEEEEeecc-ceEEEEEecCceEEEeeccCCcCcchhhhhcC-cEEEEEEEEE-ccEeecccc Confidence 444333222 122344554443332 21222222221 11111 00 11 1335567776 588999999 Q ss_pred eEeeecC Q lcl|NC_021342. 348 AAYVDMA 354 (354) Q Consensus 348 i~y~D~~ 354 (354) ++++--| T Consensus 291 ~~~l~~a 297 (298) T protein:vir:16 291 FARVTEA 297 (298) T ss_pred eEEEeec Confidence 9999999 No 30 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.76 E-value=4.6e-09 Score=66.29 Aligned_cols=277 Identities=6% Similarity=-0.076 Sum_probs=157.2 Q ss_pred hhhccCCce-eccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCccc Q lcl|NC_021342. 37 LDAIGNPNI-MLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDL 115 (354) Q Consensus 37 m~a~~~~~~-~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~di 115 (354) |....-.+. .+.+..++..+- +.+..++++.....-..+++.++..-.+.+ ...+........+.+++.. ..+ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP---~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~~~~~~~~~~~a~~v~Eg-~~~ 74 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLH---KEFTDIIMKEVAQNSLVMQLGQYQEMEGEQ--EKTVYVQTDGISAYWVNET-EKI 74 (297) T ss_pred CCccccccccccccCCCcceec---hhHHHHHHHHHHhhchhhhhcceeecCCCc--cEEEEEEcCCceeEEeecC-ccc Confidence 444321111 111223333333 455577788777777777777664322222 2334445556677888765 458 Q ss_pred ceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccc Q lcl|NC_021342. 116 PRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSS 195 (354) Q Consensus 116 p~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~ 195 (354) |..+...+........++....++.+-++.+. .++...-....++++++.+|+.+++|+...+-.|+++........ T Consensus 75 ~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~---~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~ 151 (297) T protein:vir:95 75 KTDKPEVVPVTLKAHKLGIILVTSREALNYTW---KKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKV 151 (297) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcCH---HHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccee Confidence 88888888888999999999998876565443 468888889999999999999999998877778888765432211 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCccccccccccee Q lcl|NC_021342. 196 ATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDI 275 (354) Q Consensus 196 ~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I 275 (354) . .+. - -++||.+++.++... + ..+..++|+|..+..|.+-. +..+.-++ . .......|.| T Consensus 152 ~-~~~--~----t~~~i~~~~~~l~~~--~-~~~~~~v~~~~~~~~L~~l~--d~~G~~i~----~-~~~~~l~G~P--- 211 (297) T protein:vir:95 152 I-GGP--I----NYDNILKLQDALYDA--D-VEPNAFVSKIQNRSALREAR--DGNKVSIY----D-KAANTIDGIT--- 211 (297) T ss_pred c-ccc--c----CHHHHHHHHHHhhhc--c-CCcCEEEEcHHHHHHHHHhh--ccCCceee----c-CCCCccccee--- Confidence 1 111 1 257788888888653 2 34678999999999996532 33332111 1 1111112222 Q ss_pred eeeeeeeeccccccccccCcccEEEEEEcCc------ceEEEeeCchhhhcc------c----cccCceeEEeeeeeeee Q lcl|NC_021342. 276 QIRFQLDAAELAANGVSNSNKPRYMVYDKSD------RNLAMANPIPFRMLA------P----QMASLGITVPAEYKISG 339 (354) Q Consensus 276 ~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~------~~~~~~vp~~~~~~~------~----~~~~l~~~~~~~~~~gG 339 (354) ...... ....++..+.-+.+. +.+.+.+-.+..... . -.++ ...+.+.++++ T Consensus 212 ----v~~~~~------~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~r~~~~~d- 279 (297) T protein:vir:95 212 ----TVDLKS------ARFEKGDLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQE-MIAIRATMDIA- 279 (297) T ss_pred ----eEeecC------CCCCCceEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcC-cEEEEEEEEec- Confidence 211111 001111222222111 112222211111100 0 0112 24566677774 Q ss_pred EEEECCceeEeeecC Q lcl|NC_021342. 340 TEFRYPLCAAYVDMA 354 (354) Q Consensus 340 v~i~~P~ai~y~D~~ 354 (354) ..+.+|.|++.+=.| T Consensus 280 ~~v~~~~a~~~l~~a 294 (297) T protein:vir:95 280 VMITKTDAFAKLTPA 294 (297) T ss_pred cEeecccceEEEeec Confidence 778889999999999 No 31 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=98.74 E-value=1.3e-08 Score=63.84 Aligned_cols=326 Identities=11% Similarity=0.012 Sum_probs=165.1 Q ss_pred Ccc-----------cchhHHHHhhhhhh---hccc-------------cc-------ccccc--hhhhh-hhhhhhccCC Q lcl|NC_021342. 1 MAI-----------KTIDAQTIQGNQWL---VHKG-------------YV-------SRNGD--QWVIN-NTALDAIGNP 43 (354) Q Consensus 1 ~~~-----------~~~~~~~~~~~~~~---~~~~-------------~~-------~~~~~--~~~~~-~~am~a~~~~ 43 (354) -.. +....+........ ++.+ +. ..... ..... ....... . T Consensus 53 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 130 (435) T protein:vir:14 53 ERAEAAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVA--M 130 (435) T ss_pred HHHHHHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhh--h Confidence 000 00000000000000 0000 00 00000 00000 0000000 1 Q ss_pred ceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccc Q lcl|NC_021342. 44 NIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQ 123 (354) Q Consensus 44 ~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~ 123 (354) .++......+.+++. +.+...+++...+....+.+..-..+...+ .+.+......+.+.|++.. ..+|..+...+ T Consensus 131 ~~~~~t~~~gg~~vP--~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~--~~~~p~~~~~~~a~~v~E~-~~~~~~~~~f~ 205 (435) T protein:vir:14 131 SLNTLSPGAGGVLVP--ENLSSEVIELLRPKSVVRKLGARTLPLSNG--NITIPRLKGGAIVGYIGAD-TDIPTTQQQFD 205 (435) T ss_pred hcccCCcCCCccccc--hhHHHHHHHHHhhhchhhhhcceeeecCCC--ceEEEEEeCCcceeeeccC-cccccccccee Confidence 111112222334444 456677888776666665542211122222 3455666666777787664 45788887778 Q ss_pred eeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh-hCceeeeecCCcccccccccccc Q lcl|NC_021342. 124 MHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPNVTLSSATKDYKT 202 (354) Q Consensus 124 ~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~~~~~~~~~~W~~ 202 (354) ......+.++..+.+|.+=|+.+ ..+.++..--....++++.+.+|+.+++|+.. ....|+++....+......++ T Consensus 206 ~i~~~~~k~~~~~~iS~ell~ds-~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~-- 282 (435) T protein:vir:14 206 DLKLTAKKMAALVPIANDLIKYA-GVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDA-- 282 (435) T ss_pred EEEeeeEEEEEeehhhHHHHHhh-ccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccc-- Confidence 88888899888888875444333 12334777778889999999999999999865 357899987765544333443 Q ss_pred cCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeee Q lcl|NC_021342. 203 MNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLD 282 (354) Q Consensus 203 ~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~ 282 (354) .|.+.+..++.+++..+..... ...+..++|+|..|..|.... +..|.-++. . ..+. .+...|... T Consensus 283 ~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~v~n~~~~~~L~~lk--d~~G~~l~~---~------~~~g--~l~G~Pv~~ 348 (435) T protein:vir:14 283 STLQKIETDLGKVILALENADA-NLTQPGWIMAPRTFRFLEGLR--DGNGNKVYP---E------LANG--MLKGYPVGK 348 (435) T ss_pred cchhhHHHHHHHHHHHhhhccc-cccCCEEEEcHHHHHHHHHhh--ccCCceecc---C------CCCC--eeecceeEe Confidence 3466778899999888875422 234567999999999986533 444432221 1 0111 223333333 Q ss_pred eccccccccccCcccEEEEEEcCcceEEEeeCchhhhcccc---------------ccCceeEEeeeeeeeeEEEECCce Q lcl|NC_021342. 283 AAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQ---------------MASLGITVPAEYKISGTEFRYPLC 347 (354) Q Consensus 283 ~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~---------------~~~l~~~~~~~~~~gGv~i~~P~a 347 (354) ...... ..+.+++...++|-+=.+.+ +..-.+++..-.. .++ ...+.++.|++ ..+++|.| T Consensus 349 ~~~~p~-~~~~~~~~~~i~~gd~s~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~-~~~~r~~~r~d-~~~~~~~a 424 (435) T protein:vir:14 349 TTQVPI-NLGETGKESEIYFTDFGDVF-IGEEETLEIDYSKEATYKDADGHMVSAFQRD-QTLIRVIAKND-FGPRHVES 424 (435) T ss_pred eccccc-cccCCCccceEEEeecccEE-EEEecccEEEEeccccccccccchhhhhhcC-hhheeeeeeeC-ceeecccc Confidence 322211 11222222223332212222 3322333221110 112 14556778875 58999999 Q ss_pred eEeeecC Q lcl|NC_021342. 348 AAYVDMA 354 (354) Q Consensus 348 i~y~D~~ 354 (354) ++++.=+ T Consensus 425 ~~~l~~~ 431 (435) T protein:vir:14 425 IAVLAGV 431 (435) T ss_pred eEEEecC Confidence 9999888 No 32 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.74 E-value=7.8e-09 Score=65.04 Aligned_cols=314 Identities=13% Similarity=0.069 Sum_probs=158.0 Q ss_pred CcccchhHH--HHh----h-----hhhhhccccc--------------ccccchhhhhhhhhhhccCCceeccchhhHHH Q lcl|NC_021342. 1 MAIKTIDAQ--TIQ----G-----NQWLVHKGYV--------------SRNGDQWVINNTALDAIGNPNIMLDADGGIAF 55 (354) Q Consensus 1 ~~~~~~~~~--~~~----~-----~~~~~~~~~~--------------~~~~~~~~~~~~am~a~~~~~~~~da~~~~~f 55 (354) -.+..++.+ .++ . ......+... ............-..+......+..+.+++.. T Consensus 45 ~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~l 124 (390) T protein:vir:97 45 ATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGAL 124 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccc Confidence 001111100 000 0 0000000000 00000000000000000011111112233333 Q ss_pred HHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecc-ccceeEecCCCcccceeeeccceeEEEEEEEEe Q lcl|NC_021342. 56 YISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDG-VTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGN 134 (354) Q Consensus 56 l~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~ 134 (354) +. +.+.+.+++.....-..+.++++.. .+.+ .+.+..... .+.+.+++.+ ..+|..+...+........++. T Consensus 125 ip---~~~~~~ii~~~~~~~~i~~~~~~~~-~~~~--~~~~~~~~~~~~~a~~v~Eg-~~~~~~~~~~~~i~~~~~k~~~ 197 (390) T protein:vir:97 125 TT---PNRLPGFITPPDARLTVRDLIGSGR-TDSA--LIEYVQETGFVNNAAIVAEG-ALKPESSLKFAKKTDTTHVIAH 197 (390) T ss_pred cc---hhhhHHHHHHHhhhhhhHhhcceee-ccCC--ceEEEEEecCCcceeeecCC-ccccccccceeEEEEeeeeEEE Confidence 33 2334567777777777777766542 2222 344444433 4677888754 4578888888888888999988 Q ss_pred eEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccccccCHHHHHHHHH Q lcl|NC_021342. 135 ECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDYKTMNGQELFNMLN 213 (354) Q Consensus 135 ~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~ 213 (354) -..++.+ +-... .++..--....++++++.+|+.+|+|+... ...|++|.++....... .+....+++|. T Consensus 198 ~~~is~e-ll~ds---~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~-----~~~~~~~d~~~ 268 (390) T protein:vir:97 198 TMKATRQ-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT-----IAGATRVDQLR 268 (390) T ss_pred eehhhHH-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeecccccccccc-----ccccchHHHHH Confidence 8888774 43322 257777788899999999999999997644 47899998875443221 22344467888 Q ss_pred HHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeecccccccccc Q lcl|NC_021342. 214 APIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSN 293 (354) Q Consensus 214 ~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~ 293 (354) +++..+.. ....+..++|+|..|..|.+-. +..|.-++. . . ..+.+-.+...|.+.+..... + T Consensus 269 ~~~~~~~~---~~~~~~~~v~n~~~~~~L~~lk--d~~G~~l~~----~-~---~~~~~~~l~G~pV~~~~~~~~-~--- 331 (390) T protein:vir:97 269 LAMLQASL---AEYPASGIVINPIDWAAIELAK--DANNQYLIG----N-A---RGTLTPTLWGLPVVATQAMAP-G--- 331 (390) T ss_pred HHHHhhcc---ccCCCCEEEEcHHHHHHHHHhh--cCCCceeec----C-c---cCCCCceecceeeEEcCCCCC-C--- Confidence 88877754 2345678999999999997533 444432211 0 0 011112233333333332211 1 Q ss_pred CcccEEEEEEcCcceEEEeeCchhhhccc----c-ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 294 SNKPRYMVYDKSDRNLAMANPIPFRMLAP----Q-MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 294 ~g~d~~v~y~~~~~~~~~~vp~~~~~~~~----~-~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..++.+.+ +.+.+...+.++.... . .+++ ..+.++.++ |..+++|.|++++++| T Consensus 332 ----~~~~gd~~-~~~~~~~~~~~~i~~~~~~~~f~~~~-~~~r~~~r~-d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 332 ----EFLVGAFD-LAAQIFDQWDARVEIGYVNDDFQRNM-VTVLAEERL-ALVVYRPEALITGSFA 390 (390) T ss_pred ----cEEEEecc-ceEEEEEecceEEEEeecccccccCc-EEEEEEEee-ccEEeccccEEEEEeC Confidence 11111211 1122222222222111 0 1222 334556666 5789999999999999 No 33 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.71 E-value=4.5e-09 Score=66.37 Aligned_cols=294 Identities=7% Similarity=-0.058 Sum_probs=153.3 Q ss_pred ccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccc Q lcl|NC_021342. 24 SRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVT 103 (354) Q Consensus 24 ~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G 103 (354) +.-+..+..+..+|-- +.+.++++ ++. ..+-+++++........++++++.. .+ ..+..+.+....+ T Consensus 1 ~~~~~~~~~~~~~~~~------t~~~~~~~-~ip---~~~~~~ii~~~~~~s~l~~~~~~~~-~~--~~~~~~p~~~~~~ 67 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQ------TGDTMFKG-YLE---PEQAKDYFAEAEKTSIVQQFAQKVP-MG--TTGQKIPHWIGDV 67 (320) T ss_pred CCCCccCCHHHHHhhc------cccccccc-ccc---HHHHHHHHHHHHhccchhhhcceee-cc--CCceEEEEEeCCc Confidence 2222223333322221 12222232 344 3345677777777777777776543 22 2234555666677 Q ss_pred ceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh--- Q lcl|NC_021342. 104 MGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR--- 180 (354) Q Consensus 104 ~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~--- 180 (354) .+.|++.. ..+|..+...+....+.+.++..+.+|.+=|+.+ ..++...-....++++++.+|+.+|+|+..- T Consensus 68 ~a~~v~E~-~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~ 143 (320) T protein:vir:10 68 SAQWIGEG-DMKPITKGNMTSQNIAPHKIATIFVASAETVRAN---PANYLGTMRTKVATAFAMAFDSAALNGTDSPFPT 143 (320) T ss_pred ceEEecCC-ccccccccceeEEEEeeEEEEEeehhhHHHHhcC---hHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCc Confidence 78888764 5589888888899999999999999987655543 3578888888999999999999999998743 Q ss_pred CceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHH Q lcl|NC_021342. 181 GMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFM 260 (354) Q Consensus 181 gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~ 260 (354) ++.|.++..++... ....++..+. .-.++.+++..+.. ....+..++++|+.|..|.+.. +..+..++.-.. T Consensus 144 ~~~~~~~~~~~~~~-~~~~~~~~~~--~~~~~~~~~~~~~~---~~~~~~~~v~n~~~~~~L~~lk--d~~G~~l~~~~~ 215 (320) T protein:vir:10 144 YLAQTTKSVSLADP-GGATASDLTA--YDAVAVNGLSLLVN---AKKKWTHTLLDDIVEPILNGAK--DKNGRPLFIEST 215 (320) T ss_pred ccccccccccceec-cccccccccc--HHHHHHHHHhhhhc---ccCCCcEEEEcHHHHHHHHHhh--ccCCceeecccc Confidence 33333332222111 1111111111 12334445444432 3456789999999999996533 333322211000 Q ss_pred hhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc------------------ Q lcl|NC_021342. 261 EANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP------------------ 322 (354) Q Consensus 261 ~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~------------------ 322 (354) .........+ ..+...|....... ..++.. ++|- |...+-+.....++.... T Consensus 216 ~~~~~~~~~~--~~i~g~pv~~~~~~------~~~~~~-~~~g-d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 285 (320) T protein:vir:10 216 YTDENSPFRA--GRIVSRPTILSDHV------ADGTTV-GYMG-DFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSL 285 (320) T ss_pred ccCccccccC--ceeeeeeeEecCCC------CCCceE-EEEe-ecceEEEEEecCeEEEEeecceeeeccccccccchh Confidence 0000000111 12333333332211 112211 1121 111121222222211100 Q ss_pred cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 323 QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 323 ~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) -.++ ...+.++.++ ++.+.+|.|++++.-+ T Consensus 286 f~~~-~~~~r~~~~~-d~~v~~~~a~~~l~~~ 315 (320) T protein:vir:10 286 WQHN-LVAVRVEAEY-AFHNNDKDAFVKLTNV 315 (320) T ss_pred hhcC-cEEEEEEEee-ccEEecccceEEEEec Confidence 0112 2345566776 5888999999998855 No 34 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.70 E-value=1.2e-08 Score=64.04 Aligned_cols=314 Identities=12% Similarity=0.041 Sum_probs=162.4 Q ss_pred CcccchhHH--HHh-----hhhhh----hcccccccc--cchhh------------hhhhhhhhccCCceeccchhhHHH Q lcl|NC_021342. 1 MAIKTIDAQ--TIQ-----GNQWL----VHKGYVSRN--GDQWV------------INNTALDAIGNPNIMLDADGGIAF 55 (354) Q Consensus 1 ~~~~~~~~~--~~~-----~~~~~----~~~~~~~~~--~~~~~------------~~~~am~a~~~~~~~~da~~~~~f 55 (354) -.+..++.+ .++ .+... ......... ..+.. ....-+.+......+....+++.+ T Consensus 45 ~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 124 (390) T protein:vir:81 45 ATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGAL 124 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcce Confidence 011111110 000 00000 000000000 00000 000011111111111222334445 Q ss_pred HHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcccceeeeccceeEEEEEEEEe Q lcl|NC_021342. 56 YISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGN 134 (354) Q Consensus 56 l~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~ 134 (354) +.. .+.+.+++........+.++.+.. .+.+ .+.+.... ..+.+.+++.+ ..+|..+...+.....+..++. T Consensus 125 ~~~---~~~~~ii~~~~~~~~l~~~~~~~~-~~~~--~~~~~~~~~~~~~a~~v~Eg-~~~~~~~~~~~~i~~~~~k~~~ 197 (390) T protein:vir:81 125 TTP---NRLPGFITPPDARLTVRDLIGSGR-TDSA--LIEYVQETGFVNNAAIVAEG-ALKPESSLKFAKKTDTTHVIAH 197 (390) T ss_pred ech---hhhHHHHHHHhhhhhhhhhcceee-ccCC--ceEEEEEecCCcceeeecCC-cccccccceeeEEEEeeeEEEE Confidence 543 233567777777777777776542 2222 33444443 34677888765 4588888888888999999999 Q ss_pred eEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccccccCHHHHHHHHH Q lcl|NC_021342. 135 ECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDYKTMNGQELFNMLN 213 (354) Q Consensus 135 ~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~ 213 (354) .+.+|.+ +.... ..+..--....++++++.+|+.+++|+... ...|+++..+....+... +....+++|. T Consensus 198 ~~~is~e-ll~d~---~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~-----~~~~~~~~~~ 268 (390) T protein:vir:81 198 TMKATRQ-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTI-----AGATRVDQLR 268 (390) T ss_pred eehhhHH-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeeccccccccccc-----ccchhHHHHH Confidence 8888875 43322 257788888899999999999999998653 489999988765433221 1223367888 Q ss_pred HHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeecccccccccc Q lcl|NC_021342. 214 APIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSN 293 (354) Q Consensus 214 ~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~ 293 (354) +++.++... ...+..++|+|+.|..|.+.. +..|.-++. . + ..+.+-.+...|...+..... + T Consensus 269 ~~~~~~~~~---~~~~~~~v~~~~~~~~l~~lk--d~~G~~l~~----~-~---~~~~~~~l~G~pv~~~~~~p~-~--- 331 (390) T protein:vir:81 269 LAMLQASLA---EYNPSGIVINPIDWAAIELAK--DANNQYLIG----N-A---RGTLTPTLWGLPVVATQAMAP-G--- 331 (390) T ss_pred HHHHhhccc---cCCCCEEEEcHHHHHHHHHhh--cCCCceeec----C-c---ccccCceecceeeEEcCCCCC-C--- Confidence 888877642 345678999999999887532 444432221 1 1 111122333444443332211 1 Q ss_pred CcccEEEEEEcCcceEEEeeCchhhhccc-c----ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 294 SNKPRYMVYDKSDRNLAMANPIPFRMLAP-Q----MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 294 ~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~----~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+++.+.+ +.+.+..-..+++... + .++ ...+.++.|++ ..++.|.|++++.+| T Consensus 332 ----~~~~gd~~-~~~~~~~~~~~~v~~~~~~~~~~~~-~v~~r~~~r~d-~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 332 ----EFLVGAFD-LAAQIFDQWDARVEIGYVGEDFQRN-MITVLAEERLA-LVVYRPEALISGSFA 390 (390) T ss_pred ----cEEEEehh-ceEEEEEecceEEEEecccchhhcC-cEEEEEEEeec-cEEecccceEEEEeC Confidence 11222211 1122222122222111 1 122 23456778875 689999999999999 No 35 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.68 E-value=9.3e-09 Score=64.62 Aligned_cols=294 Identities=10% Similarity=-0.031 Sum_probs=163.8 Q ss_pred hhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcc Q lcl|NC_021342. 35 TALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQD 114 (354) Q Consensus 35 ~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~d 114 (354) .|.+-......+.. ++++.++..+ +-.++++...+....++++++.. .+. ..+.+.+....+.+.+++.. .. T Consensus 1 m~~~~~~a~~~~~t-~~~g~~i~~~---~~~~ii~~~~~~s~l~~~~~~~~-~~~--~~~~~p~~~~~~~a~~v~Eg-~~ 72 (330) T protein:vir:77 1 MAGSTVPSTQVALT-GDFSAFLTPE---QSQDYFAEIEKTSIVQRIARKVP-MGP--TGISIPHWTGAVSASWTGEA-ER 72 (330) T ss_pred Ccccccchhhcccc-CCCcceechh---HHHHHHHHHHhccchhhhcceee-ccC--CceEEEEEcCCcceeEecCC-Cc Confidence 11111000111111 2234445432 22456777777777778776543 222 23456666667788888754 56 Q ss_pred cceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh-hCceeeeecCCccc Q lcl|NC_021342. 115 LPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPNVTL 193 (354) Q Consensus 115 ip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~~~~ 193 (354) +|..+...+......+.++.-..++.+=|+. ...++...-....++++++.+|+.+|+|+.. .+..|+++...... T Consensus 73 ~~~~~~~f~~i~~~~~k~~~~~~is~ell~d---s~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~ 149 (330) T protein:vir:77 73 KPITKGSFGKQELEPVKITTIFAESAEVVRL---NPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVV 149 (330) T ss_pred cccccceeeEEEEeEEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccc Confidence 8888888888888889998888887754443 3457888889999999999999999999764 46679988764322 Q ss_pred ccc---cccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHH-HHhhCcccccc Q lcl|NC_021342. 194 SSA---TKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQH-FMEANSYTLLT 269 (354) Q Consensus 194 ~~~---~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~-l~~n~~~~~~~ 269 (354) ... ..+. +.+....++||.+++..+... ...+..++|+|..|..|.+-. +..+.-++.- +....+. . T Consensus 150 ~~~~~~~~~~-~~~~~~~~~~l~~~~~~~~~~---~~~~~~~vmn~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~---~ 220 (330) T protein:vir:77 150 SLADTNLTTA-SGPQGNAYLAVNNALSLLVNS---GKKWTGTLLDNVTEPILNTAV--DGNGRPLFVESTYTEQVG---A 220 (330) T ss_pred eeeccccccc-ccccchhHHHHHHHHHhhhhc---CCCccEEEEcHHHHHHHHHHh--ccCCceeecCcccccccc---c Confidence 111 1111 233556788999998888653 234568999999999886532 3333222110 0000000 0 Q ss_pred cccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhc--cc--------------------cccCc Q lcl|NC_021342. 270 GNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRML--AP--------------------QMASL 327 (354) Q Consensus 270 g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~--~~--------------------~~~~l 327 (354) ..+.++...|......... + .++++..++.-+.+. +.+.....++.. .- -.++ T Consensus 221 ~~~~~l~G~PV~~~~~~p~-~-~~~~~~~~~~gd~s~--~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~- 295 (330) T protein:vir:77 221 IREGRILGRPTYVADNVVN-G-TVGNRVVGVMGDFSQ--VIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHN- 295 (330) T ss_pred cCCceecceeeEEeccccC-C-CCCCccEEEEEecce--EEEEEecCcEEEEeecceeeecccccccccccccchhhcC- Confidence 1112333334333333221 1 112233333223222 112211221111 00 1122 Q ss_pred eeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 328 GITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 328 ~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ...+.++.|++ +.+++|.|++++..+ T Consensus 296 ~~~~r~~~r~d-~~v~~~~a~~~i~~~ 321 (330) T protein:vir:77 296 MVAVRCEAEFA-FMVNDKDAFVKLTDQ 321 (330) T ss_pred cEEEEEEEEec-cEEecccceEEEEec Confidence 25677888886 666889999999988 No 36 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=98.67 E-value=2.9e-08 Score=61.94 Aligned_cols=320 Identities=10% Similarity=0.003 Sum_probs=155.2 Q ss_pred Ccccchh-------HHHHhhhhhhhcccccccccchhh-hhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021342. 1 MAIKTID-------AQTIQGNQWLVHKGYVSRNGDQWV-INNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPY 72 (354) Q Consensus 1 ~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~-~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~ 72 (354) ...+++. ........+..+..... .+.. ....++.... .....+.+++ .+.. +.+.+.+++... T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~-~~~~~~~~~~--~~vp--~~~~~~ii~~~~ 143 (413) T protein:vir:81 72 EGYKSIGEFFAKRAGDQIKQQAGGAQLNYSV---GEYVAPRVKAASDPA-STATLTDEFQ--GGYG--TTWNRNIIYRRR 143 (413) T ss_pred hhhhhhhhhhhhhhhhHHHHHHHHHHhhhhh---hhhhhhHHHhhhhhh-hhcccccccc--cccc--hhhHHHHHHHHh Confidence 0000000 00000000000000000 0000 0000111100 0111111222 2222 556788999888 Q ss_pred hcccchhhccccCCCCCceeEEEEEeec--cccceeEecCCCcccceeee-ccceeEEEEEEEEeeEeecHHHHHHHHHh Q lcl|NC_021342. 73 GDITYRFDVPMAANIPEYADTWMYRSYD--GVTMGKFIGANGQDLPRVAQ-SAQMHTVPLGYAGNECHYTLDEMRKSAAM 149 (354) Q Consensus 73 ~~l~~r~~v~v~~~~~~~~~~~~~~~~~--~~G~a~~~~~~~~dip~v~~-~~~~~~~pv~~~~~~~~~~~~El~~a~~~ 149 (354) +....++++++..-.+ ..-.+...... ..+.+.+++.+ ..+|..+. ..+....+.+.++..+.+|.+=|+.+ T Consensus 144 ~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~a~~v~Eg-~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds--- 218 (413) T protein:vir:81 144 EKLVVADLMDNLTMTN-TTIKYLMEKANRVVEGGFKTVAEG-GKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDY--- 218 (413) T ss_pred hhhhHHhhcceeeccC-CceeEEEeccccccccccceecCc-ccccccCcccceeeEeeeeeEEEeehhhHHHHHHH--- Confidence 8888888877543222 22222222211 23456777654 33565553 56778888888888888887544333 Q ss_pred CCCcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCccc Q lcl|NC_021342. 150 NMPIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHV 228 (354) Q Consensus 150 g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~ 228 (354) . .|..--....++++++.+|+.+++|+... ...||++.+++...... +.+..++++.+++..+... +... T Consensus 219 ~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~------~~~~~~~~i~~~~~~~~~~--~~~~ 289 (413) T protein:vir:81 219 D-FLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVS------NKDELADSIYKAMTNISLA--TPFQ 289 (413) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHHhccCCCCCccccccccccccccccc------ccchhHHHHHHHHHHhhhh--ccCC Confidence 2 37777777889999999999999997543 45799999887654332 2344577888887776543 3345 Q ss_pred ccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcce Q lcl|NC_021342. 229 PNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRN 308 (354) Q Consensus 229 p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~ 308 (354) +..++|+|..|..|.+-. +..|.-++.-...........+.+-++...|...+.... .+ ..++.+.+ +. T Consensus 290 ~~~~vmn~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~-~~-------~~~~gd~~-~~ 358 (413) T protein:vir:81 290 ADALVINPLDYQELRLAK--DANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVP-VG-------KPVVGAFR-SA 358 (413) T ss_pred CcEEEEcHHHHHHHHHhh--ccCCceeccccccccccccccccCceecceeeEEcCCCC-cc-------cEEEEecc-cE Confidence 678999999999986533 333322211000000000000111122233333322211 11 11111111 12 Q ss_pred EEEeeCchhhh--cccc---ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 309 LAMANPIPFRM--LAPQ---MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 309 ~~~~vp~~~~~--~~~~---~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+.....++. ..-. ...-...+.++.+++ +.+++|.++++++++ T Consensus 359 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~~~l~~~ 408 (413) T protein:vir:81 359 ASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVG-LMVTFPEAIVQLDVA 408 (413) T ss_pred EEEEEecceEEEEeccccchhhcCcEEEEEEEeec-cEEecccceEEEEec Confidence 22222222222 1111 111134666777875 677899999999999 No 37 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.67 E-value=9.7e-09 Score=64.52 Aligned_cols=280 Identities=8% Similarity=-0.038 Sum_probs=157.3 Q ss_pred eeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccce Q lcl|NC_021342. 45 IMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQM 124 (354) Q Consensus 45 ~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~ 124 (354) |++++ +. ++. +.+.+++++...+.-..+++.++.. .+.+ ...+......+.+.|++.+ ..+|..+...+. T Consensus 1 ma~~g---G~-lip--~~~~~~ii~~~~~~s~i~~~~~~~~-~~~~--~~~~p~~~~~~~a~~v~Eg-~~~~~~~~~f~~ 70 (298) T protein:vir:94 1 MVLNK---GT-LFD--PELVTDLISKVAGKSSIARLSAQKP-IPFN--GEKVFTFTMDSEIDVVAES-GKKTHGGVTLAP 70 (298) T ss_pred Ceecc---cc-ccC--hhHHHHHHHHHHhhchhhhhcceee-ccCC--ceEEEEEecCcceEEeeCC-ccccccccceeE Confidence 44432 22 222 3455677787777777777776542 2323 3455566667788898765 568888888888 Q ss_pred eEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh-h----CceeeeecCCccccccccc Q lcl|NC_021342. 125 HTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-R----GMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 125 ~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-~----gi~GLlN~p~~~~~~~~~~ 199 (354) .....+.++.-..+|.+=|+...-...++...-+...++++++.+|+.+++|... . ...|..+..+...... . T Consensus 71 v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~--~ 148 (298) T protein:vir:94 71 QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKV--E 148 (298) T ss_pred EEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccccc--c Confidence 8888888888888776544333233446777888899999999999999999432 1 1222222122111000 0 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRF 279 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~ 279 (354) .......+++||.+++.++... ...+..++|+|..+..|.+.. +..|.-++. .. ...+.+-++...| T Consensus 149 -~~~~~~~~~~~i~~~~~~~~~~---~~~~~~~vmn~~~~~~l~~lk--d~~G~~l~~----~~---~~~~~~~tl~G~P 215 (298) T protein:vir:94 149 -APRGIADPNGAIENAVELLTGV---DADVTGIAINPSFRSALAKQK--DLQGNALFP----EL---KWGATPDTINGLP 215 (298) T ss_pred -cccccccHHHHHHHHHHhhhhc---CCCccEEEEcHHHHHHHHHhh--ccCCCeeec----Cc---ccCCCCceeccee Confidence 1122345678999999988653 345678999999999996532 333322211 10 1123333454455 Q ss_pred eeeeccccccccccCcccEEEEEEcCcceEEEeeCch--hhhccc-c---------ccCceeEEeeeeeeeeEEEECCce Q lcl|NC_021342. 280 QLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIP--FRMLAP-Q---------MASLGITVPAEYKISGTEFRYPLC 347 (354) Q Consensus 280 ~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~--~~~~~~-~---------~~~l~~~~~~~~~~gGv~i~~P~a 347 (354) .+....+.. ...++++.+++-+.+ +.+.+.+-.. +...+- + .++ ..-+.++.|+ |+.+++|.| T Consensus 216 V~~~~~v~~--~~~~~~~~~~~Gdfs-~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~-~v~~r~~~r~-~~~~~~~~a 290 (298) T protein:vir:94 216 VDVNKTVSD--MSLTQRDRAIIGDFA-NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYN-QVYIRAELFL-GWGILDATK 290 (298) T ss_pred eEEeccccc--ccCCCccEEEEeecc-ceEEEEEecCceEEEeecCCCcCcchhhhhcC-cEEEEEEEEe-ccEeecccc Confidence 544443321 122334444433322 1121212122 212111 0 111 1235567776 578889999 Q ss_pred eEeeecC Q lcl|NC_021342. 348 AAYVDMA 354 (354) Q Consensus 348 i~y~D~~ 354 (354) ++++--+ T Consensus 291 ~~~l~~~ 297 (298) T protein:vir:94 291 FARVTEA 297 (298) T ss_pred eEEEEec Confidence 9999888 No 38 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.66 E-value=2e-08 Score=62.82 Aligned_cols=328 Identities=11% Similarity=0.013 Sum_probs=162.6 Q ss_pred Ccccchh---------HHHHhhhhh-hhcccc--c--ccccch-----hhhhhhhhhhcc-------------------C Q lcl|NC_021342. 1 MAIKTID---------AQTIQGNQW-LVHKGY--V--SRNGDQ-----WVINNTALDAIG-------------------N 42 (354) Q Consensus 1 ~~~~~~~---------~~~~~~~~~-~~~~~~--~--~~~~~~-----~~~~~~am~a~~-------------------~ 42 (354) -.|+.+. +..++.+.+ ..+++. + ....++ +..-..++.... . T Consensus 50 ~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 129 (435) T protein:vir:80 50 AQIERAEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVA 129 (435) T ss_pred HHHHHHHHHHHHHHhhcccccchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhh Confidence 0111000 000000000 000000 0 000000 000000000000 0 Q ss_pred CceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeecc Q lcl|NC_021342. 43 PNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSA 122 (354) Q Consensus 43 ~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~ 122 (354) ..++......+.+++. +.+...+++...+....+.+-...-+...+ .+.+......+.+.|++.. ..+|..+... T Consensus 130 ~~~~~~~~~~gg~lvP--~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~--~~~~p~~~~~~~a~~v~E~-~~~~~~~~~f 204 (435) T protein:vir:80 130 MSLNTLSPGAGGVLVP--ENLSSEVIELLRPKSVVRKLGARTLPLSNG--NITIPRLKGGAIVGYIGAD-TDIPTTQQQF 204 (435) T ss_pred hhhcccCCCCCccccc--hhHHHHHHHHHhhhchhhhccceeeecCCC--ceEEEEEeCCcceeeeccC-ccccccccce Confidence 0011111122334443 445667777666555555542111122222 3455566666777787665 4578888888 Q ss_pred ceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh-hCceeeeecCCccccccccccc Q lcl|NC_021342. 123 QMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPNVTLSSATKDYK 201 (354) Q Consensus 123 ~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~~~~~~~~~~W~ 201 (354) +......+.++..+.+|..=|+.+ ..+-++..--....+.++++.+++.+|+|+.. ....|++++...........+ T Consensus 205 ~~i~~~~~k~~~~~~is~ell~ds-~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~- 282 (435) T protein:vir:80 205 DDLKLTAKKMAALVPIANDLIKYA-GVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDG- 282 (435) T ss_pred eeEEEeeEEEEEeehhhHHHHHhh-cccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccc- Confidence 888889999998888876544433 22445777788889999999999999999764 357899998765443333333 Q ss_pred ccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeee Q lcl|NC_021342. 202 TMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQL 281 (354) Q Consensus 202 ~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L 281 (354) .+.+.+..|+.+++..+..... ...+..++|+|..+..|.... +..|.-++.-+ .+. ++...|.. T Consensus 283 -~~~~~~~~d~~~~~~~~~~~~~-~~~~~~~vmn~~~~~~L~~lk--d~~G~~l~~~~---------~~~--~l~G~pv~ 347 (435) T protein:vir:80 283 -STLQKIETDLGKAILALENADA-NLTQPGWIMAPRTFRFLEGLR--DGNGNKVYPEL---------ANG--MLKGYPVG 347 (435) T ss_pred -cchhhHHHHHHHHHHHhhcccc-ccccCEEEEcHHHHHHHHhhh--ccCCceeccCC---------CCC--eEeeeeeE Confidence 3466777899999888865422 234567899999999986533 44443332100 111 23333333 Q ss_pred eeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-c--------------ccCceeEEeeeeeeeeEEEECCc Q lcl|NC_021342. 282 DAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-Q--------------MASLGITVPAEYKISGTEFRYPL 346 (354) Q Consensus 282 ~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~--------------~~~l~~~~~~~~~~gGv~i~~P~ 346 (354) ....... ..+.++....++|-+ ...+-+..-..+++... + .+| ...+.+..++ ++.+++|. T Consensus 348 ~~~~~p~-~~~~~~~~~~i~~gd-~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n-~~~~r~~~r~-d~~~~~~~ 423 (435) T protein:vir:80 348 KTTQVPI-NLGEAGKESEIYFTD-FGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRD-QTLIRVIAKN-DFGPRHVE 423 (435) T ss_pred Eeccccc-cccCCCCcceEEEEE-cccEEEEeecceEEEEeccccccccccchhhhhhcC-cceeeeeeee-CcEeeccc Confidence 3222211 112222222233321 11111221122211110 0 122 2455677777 58899999 Q ss_pred eeEeeecC Q lcl|NC_021342. 347 CAAYVDMA 354 (354) Q Consensus 347 ai~y~D~~ 354 (354) |++++.=+ T Consensus 424 a~~~l~~~ 431 (435) T protein:vir:80 424 SIAVLSGV 431 (435) T ss_pred ceEEEecc Confidence 99998777 No 39 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.65 E-value=2.7e-08 Score=62.09 Aligned_cols=314 Identities=10% Similarity=0.088 Sum_probs=162.5 Q ss_pred CcccchhHHHHh-------hhhhhhcccccccccchhhhh--------------hhhhhhccCCceeccchhhHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQ-------GNQWLVHKGYVSRNGDQWVIN--------------NTALDAIGNPNIMLDADGGIAFYISQ 59 (354) Q Consensus 1 ~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~--------------~~am~a~~~~~~~~da~~~~~fl~~~ 59 (354) ..++.++.+.-+ ...--..+............. ...+.. ..++....+++..+. T Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~g~~vp-- 126 (395) T protein:vir:43 52 TAQGELQARLSAAEQAMLANEKRDGGEEAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPR---SAITSIDGSGGALVA-- 126 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhhccccccchhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhh---hhhcccCCCCccccc-- Confidence 122222221111 000000011110000000000 000000 111111122333333 Q ss_pred HHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcccceeeeccceeEEEEEEEEeeEee Q lcl|NC_021342. 60 LAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHY 138 (354) Q Consensus 60 L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~ 138 (354) ..+.+.|++........+.++++..-.+ .++.+.... ..+.+.+++.++ .+|..+...+......+.++..+.+ T Consensus 127 -~~~~~~ii~~~~~~~~l~~l~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~-~~~~~~~~~~~i~~~~~k~~~~~~i 201 (395) T protein:vir:43 127 -PDRRPGVVAAPQRRLTIRDLVAPGTTES---NSVEYVRETGFVNNAAPVSEGT-QKPYSDLTFELENAPVRTIAHLFKA 201 (395) T ss_pred -hhhHHHHHHHHHhhhhHHhhccceecCC---CceEEEEEecCCCceeeecCCc-cccccccceeEEEEeeeeEEEeehh Confidence 3345678888888877888777553322 234444443 346778887654 5788888888899999999999999 Q ss_pred cHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccccccCHHHHHHHHHHHHH Q lcl|NC_021342. 139 TLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIF 217 (354) Q Consensus 139 ~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~ 217 (354) +.+ +-... . .+..--....+++++..+|+.+++|+... ...|+++..++....... ..+.+..+++|.+++. T Consensus 202 s~e-ll~d~--~-~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~---~~~~~~~~~~i~~~~~ 274 (395) T protein:vir:43 202 SRQ-ILDDA--S-ALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGV---VVTAEQRIDRIRLAIL 274 (395) T ss_pred hHH-HHHhH--H-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc---ccccchhHHHHHHHHH Confidence 865 43322 2 47777788899999999999999997543 347999988765443322 1334567889999988 Q ss_pred HHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCc-c Q lcl|NC_021342. 218 SVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSN-K 296 (354) Q Consensus 218 ~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g-~ 296 (354) ++... ...+..++|+|..|..|.... +..|.-++. .+. .+.+-.+...|.+.........+--+. + T Consensus 275 ~~~~~---~~~~~~~vmn~~~~~~l~~lk--d~~G~~i~~-----~~~---~~~~~~l~G~pVv~~~~~~~~~~~~gd~~ 341 (395) T protein:vir:43 275 QAQLA---EFPASGIVLNPIDWALIELNK--DAENRYIIG-----SPQ---NGTTPTLWRLPVVETQAITQDEFLTGAFS 341 (395) T ss_pred hhccc---cCCCcEEEEcHHHHHHHHHhh--ccCCceecc-----ccc---cCCCceecceeeEEcCCCCCCcEEEEecc Confidence 87542 234678999999999986543 434432221 111 111222333333333322111000011 1 Q ss_pred cEEEEEEcCcceEEEeeCchhhhcccccc-Cc---eeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 297 PRYMVYDKSDRNLAMANPIPFRMLAPQMA-SL---GITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 297 d~~v~y~~~~~~~~~~vp~~~~~~~~~~~-~l---~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ....++++.. +.+. +.. +.+ .+ .+.+.++.++ ++.+++|.|+++++++ T Consensus 342 ~~~~~~~~~~--~~i~------~~~-~~~~~f~~~~~~~r~~~r~-d~~v~~~~a~~~~~~t 393 (395) T protein:vir:43 342 LGAQIFDRMD--IEVL------VST-ENDKDFENNMVTIRAEERL-AFAVYRPEAFVTGSLT 393 (395) T ss_pred ceEEEEEecc--eEEE------Eec-cccchhhcCcEEEEEEEee-ccEEecccceEEEEec Confidence 1122222211 1111 111 111 11 2344456666 5778999999999999 No 40 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.62 E-value=2.2e-08 Score=62.57 Aligned_cols=302 Identities=11% Similarity=-0.028 Sum_probs=163.2 Q ss_pred hhhhhhh-hhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeE Q lcl|NC_021342. 29 QWVINNT-ALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKF 107 (354) Q Consensus 29 ~~~~~~~-am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~ 107 (354) =..++.+ ++-+-.++.-.+.+. ... +.. +.+-.++++...+.-..+++..+. +.+.+ ...+.+......+.| T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~-~~~-liP--~~~~~~ii~~l~~~s~l~~~~~~~-~~~~~--~~~~p~~~~~~~a~~ 73 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHV-PSD-LLP--KEIVGPIFDKAQESSLVLRMGEQI-PISYG--ETIIPTTVKRPEVGQ 73 (333) T ss_pred CchhHHhhhhcccccccCceecC-Ccc-ccc--hhHHHHHHHHHHhhchhhhhccee-eccCC--ceEEEEEeCCceeEe Confidence 0112222 121111111111111 111 222 455677888877777778777653 33333 234444554555555 Q ss_pred ecCC-------CcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh- Q lcl|NC_021342. 108 IGAN-------GQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS- 179 (354) Q Consensus 108 ~~~~-------~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~- 179 (354) ++.. +..+|..+...+......+.++.-..+|.+=++.+ ..++..--....++++++.+|+-+|+|+.. T Consensus 74 v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s---~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~ 150 (333) T protein:vir:78 74 VGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMN---PSGLYTKLQGDLAYAIGRGIDLAVFHGKSPL 150 (333) T ss_pred ecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHHhcccCCC Confidence 4432 24467777777788888889888888877444333 346778888889999999999999999864 Q ss_pred --hCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcc-cCCCCCchHH Q lcl|NC_021342. 180 --RGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQL-MTGYTDRTVM 256 (354) Q Consensus 180 --~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~-~~~~~~~Tvl 256 (354) .+..|+++..++...+.. .....+.+..+++|.+++..+.. ++...+..++|+|..|..|.+-. ..+..+.-++ T Consensus 151 ~~~~~~g~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~ 227 (333) T protein:vir:78 151 TGSALQGIDTDNVIANTTNV-DYLQETGDPLLDRLLDGYDLVSA--NTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDP 227 (333) T ss_pred CCcccccccccccccccccc-cccccccchhHHHHHHHHHhhcc--ccccCceEEEEcchHHHHHHHHhhhcCCCCceee Confidence 567788887765433221 11222344457888888887754 34556778999999998875422 2233333222 Q ss_pred HHHHhhCcccccccccceeeeeeeeeecccccc-ccccCcccEEEEEEcCcceEEEeeCchhhh--ccc----c------ Q lcl|NC_021342. 257 QHFMEANSYTLLTGNELDIQIRFQLDAAELAAN-GVSNSNKPRYMVYDKSDRNLAMANPIPFRM--LAP----Q------ 323 (354) Q Consensus 257 ~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~-g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~--~~~----~------ 323 (354) ... ...+.+-++...|...+..+... +.+..++..+++-+.+. +.+.....++. .+- . T Consensus 228 ~~~-------~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~--~~~g~~~~~~i~~~~~~~~~~~~~~~~ 298 (333) T protein:vir:78 228 SRI-------NLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQ--LKFGFADEIRIKMSDTATLTDSGSATV 298 (333) T ss_pred cCc-------cccCCCceeeceeeEEccccCCCccccCCCccEEEEEeccc--EEEEEeeccEEEEecccccccccccee Confidence 111 11233344555555544433211 12222333333333322 22222222222 110 0 Q ss_pred ---ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 324 ---MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 324 ---~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .++ ...+.++.+++ +.+++|.|++++=-+ T Consensus 299 ~~~~~~-~v~~r~~~r~d-~~v~~~~a~~~l~~~ 330 (333) T protein:vir:78 299 SMWQTN-QIAILIEVTFG-WLLGDKQAFVKFVDD 330 (333) T ss_pred ehhhcC-cEEEEEEEEEc-cEEecccceEEEecc Confidence 011 13356677774 777999999998777 No 41 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.62 E-value=2.3e-08 Score=62.41 Aligned_cols=321 Identities=6% Similarity=-0.043 Sum_probs=151.6 Q ss_pred CcccchhHH-----------------------------------HHhhhhhhhcccccccccchhhhh--h-hhhhhccC Q lcl|NC_021342. 1 MAIKTIDAQ-----------------------------------TIQGNQWLVHKGYVSRNGDQWVIN--N-TALDAIGN 42 (354) Q Consensus 1 ~~~~~~~~~-----------------------------------~~~~~~~~~~~~~~~~~~~~~~~~--~-~am~a~~~ 42 (354) -.++.++.| .........+.+.....+.+...- . ...... T Consensus 42 ~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 119 (415) T protein:vir:98 42 QEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI-- 119 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhh-- Confidence 000000000 000000010111111111111000 0 000000 Q ss_pred CceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-ec Q lcl|NC_021342. 43 PNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QS 121 (354) Q Consensus 43 ~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~ 121 (354) ....+..+ ++.+++. +.+.+.+++........+.++.+.. ++.+...+.+......+.+.+++.+++ +|-.+ .. T Consensus 120 ~~~~~~~~-~gg~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~E~~~-~~~~~~~~ 194 (415) T protein:vir:98 120 QGGSLKTD-SGFVVIP--EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEELEE-NPELAVKP 194 (415) T ss_pred hhcccccc-ccccccc--hHHHHHHHHHHHhhhhhhhheeeee-ccCCceeEEEEeecCCccceeeccccc-cCcccccc Confidence 11111111 2334444 4667778887777777777766532 222333444444445556677765543 55443 45 Q ss_pred cceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccc Q lcl|NC_021342. 122 AQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDY 200 (354) Q Consensus 122 ~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W 200 (354) .+.....++.++.-+.+|..=++. ...++..--....++++++.+|+.+++|+... +..++.+.......... T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~--- 268 (415) T protein:vir:98 195 FFQLAYDINTHRGYFRISREAIED---AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEV--- 268 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccccc--- Confidence 677788888888888887653332 34567778888899999999999999997543 22233222221111111 Q ss_pred cccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeee Q lcl|NC_021342. 201 KTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQ 280 (354) Q Consensus 201 ~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~ 280 (354) +...-+++|.+++.++... ...+..++|+|+.|..|.+- -+..|. ||-..++ ..|.+-.|...|. T Consensus 269 ---~~~~~~~~i~~~~~~~~~~---~~~~~~~v~n~~~~~~l~~l--kd~~G~----~l~~~~~---~~~~~~~l~G~pV 333 (415) T protein:vir:98 269 ---KKAKSLDDIKDAINLNVKP---NYEHNVAIVSQTMFAKLDKM--KDKLGN----YLIQPDV---KEKTQQRLLGAKI 333 (415) T ss_pred ---ccccchhHHHHHHHhhhhh---ccCCCEEEEcHHHHHHHHHh--hccCCc----eeeccCc---CCCCCceecceee Confidence 1112267788888777542 24567899999999999653 233332 2211111 1222223333332 Q ss_pred eeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 281 LDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 281 L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ......-. ++.+...++.-+-+ +.+.+..-..+++.............++.|+ +..+.+|.|++++++. T Consensus 334 ~~~~~~~~---~~~~~~~~~~Gd~~-~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:98 334 EILPDEVL---GQKGNNTLIIGNLK-DAIVLFDRSQYQASWTDYMHFGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred EEeccccc---CCCCccEEEEEehh-ccEEEEeecceEEEEeccccCceEEEEEEEe-ccEEeccccEEEEEEe Confidence 22222111 12222222222212 2122222223333222222223344566777 4777889999999999 No 42 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.62 E-value=2.3e-08 Score=62.41 Aligned_cols=321 Identities=6% Similarity=-0.043 Sum_probs=151.6 Q ss_pred CcccchhHH-----------------------------------HHhhhhhhhcccccccccchhhhh--h-hhhhhccC Q lcl|NC_021342. 1 MAIKTIDAQ-----------------------------------TIQGNQWLVHKGYVSRNGDQWVIN--N-TALDAIGN 42 (354) Q Consensus 1 ~~~~~~~~~-----------------------------------~~~~~~~~~~~~~~~~~~~~~~~~--~-~am~a~~~ 42 (354) -.++.++.| .........+.+.....+.+...- . ...... T Consensus 42 ~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 119 (415) T protein:vir:79 42 QEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI-- 119 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhh-- Confidence 000000000 000000010111111111111000 0 000000 Q ss_pred CceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-ec Q lcl|NC_021342. 43 PNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QS 121 (354) Q Consensus 43 ~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~ 121 (354) ....+..+ ++.+++. +.+.+.+++........+.++.+.. ++.+...+.+......+.+.+++.+++ +|-.+ .. T Consensus 120 ~~~~~~~~-~gg~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~E~~~-~~~~~~~~ 194 (415) T protein:vir:79 120 QGGSLKTD-SGFVVIP--EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEELEE-NPELAVKP 194 (415) T ss_pred hhcccccc-ccccccc--hHHHHHHHHHHHhhhhhhhheeeee-ccCCceeEEEEeecCCccceeeccccc-cCcccccc Confidence 11111111 2334444 4667778887777777777766532 222333444444445556677765543 55443 45 Q ss_pred cceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccc Q lcl|NC_021342. 122 AQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDY 200 (354) Q Consensus 122 ~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W 200 (354) .+.....++.++.-+.+|..=++. ...++..--....++++++.+|+.+++|+... +..++.+.......... T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~--- 268 (415) T protein:vir:79 195 FFQLAYDINTHRGYFRISREAIED---AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEV--- 268 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccccc--- Confidence 677788888888888887653332 34567778888899999999999999997543 22233222221111111 Q ss_pred cccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeee Q lcl|NC_021342. 201 KTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQ 280 (354) Q Consensus 201 ~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~ 280 (354) +...-+++|.+++.++... ...+..++|+|+.|..|.+- -+..|. ||-..++ ..|.+-.|...|. T Consensus 269 ---~~~~~~~~i~~~~~~~~~~---~~~~~~~v~n~~~~~~l~~l--kd~~G~----~l~~~~~---~~~~~~~l~G~pV 333 (415) T protein:vir:79 269 ---KKAKSLDDIKDAINLNVKP---NYEHNVAIVSQTMFAKLDKM--KDKLGN----YLIQPDV---KEKTQQRLLGAKI 333 (415) T ss_pred ---ccccchhHHHHHHHhhhhh---ccCCCEEEEcHHHHHHHHHh--hccCCc----eeeccCc---CCCCCceecceee Confidence 1112267788888777542 24567899999999999653 233332 2211111 1222223333332 Q ss_pred eeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 281 LDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 281 L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ......-. ++.+...++.-+-+ +.+.+..-..+++.............++.|+ +..+.+|.|++++++. T Consensus 334 ~~~~~~~~---~~~~~~~~~~Gd~~-~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:79 334 EILPDEVL---GQKGNNTLIIGNLK-DAIVLFDRSQYQASWTDYMHFGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred EEeccccc---CCCCccEEEEEehh-ccEEEEeecceEEEEeccccCceEEEEEEEe-ccEEeccccEEEEEEe Confidence 22222111 12222222222212 2122222223333222222223344566777 4777889999999999 No 43 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.62 E-value=2.3e-08 Score=62.41 Aligned_cols=321 Identities=6% Similarity=-0.043 Sum_probs=151.6 Q ss_pred CcccchhHH-----------------------------------HHhhhhhhhcccccccccchhhhh--h-hhhhhccC Q lcl|NC_021342. 1 MAIKTIDAQ-----------------------------------TIQGNQWLVHKGYVSRNGDQWVIN--N-TALDAIGN 42 (354) Q Consensus 1 ~~~~~~~~~-----------------------------------~~~~~~~~~~~~~~~~~~~~~~~~--~-~am~a~~~ 42 (354) -.++.++.| .........+.+.....+.+...- . ...... T Consensus 42 ~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 119 (415) T protein:vir:81 42 QEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI-- 119 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhh-- Confidence 000000000 000000010111111111111000 0 000000 Q ss_pred CceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-ec Q lcl|NC_021342. 43 PNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QS 121 (354) Q Consensus 43 ~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~ 121 (354) ....+..+ ++.+++. +.+.+.+++........+.++.+.. ++.+...+.+......+.+.+++.+++ +|-.+ .. T Consensus 120 ~~~~~~~~-~gg~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~E~~~-~~~~~~~~ 194 (415) T protein:vir:81 120 QGGSLKTD-SGFVVIP--EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEELEE-NPELAVKP 194 (415) T ss_pred hhcccccc-ccccccc--hHHHHHHHHHHHhhhhhhhheeeee-ccCCceeEEEEeecCCccceeeccccc-cCcccccc Confidence 11111111 2334444 4667778887777777777766532 222333444444445556677765543 55443 45 Q ss_pred cceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccc Q lcl|NC_021342. 122 AQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDY 200 (354) Q Consensus 122 ~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W 200 (354) .+.....++.++.-+.+|..=++. ...++..--....++++++.+|+.+++|+... +..++.+.......... T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~--- 268 (415) T protein:vir:81 195 FFQLAYDINTHRGYFRISREAIED---AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEV--- 268 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccccc--- Confidence 677788888888888887653332 34567778888899999999999999997543 22233222221111111 Q ss_pred cccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeee Q lcl|NC_021342. 201 KTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQ 280 (354) Q Consensus 201 ~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~ 280 (354) +...-+++|.+++.++... ...+..++|+|+.|..|.+- -+..|. ||-..++ ..|.+-.|...|. T Consensus 269 ---~~~~~~~~i~~~~~~~~~~---~~~~~~~v~n~~~~~~l~~l--kd~~G~----~l~~~~~---~~~~~~~l~G~pV 333 (415) T protein:vir:81 269 ---KKAKSLDDIKDAINLNVKP---NYEHNVAIVSQTMFAKLDKM--KDKLGN----YLIQPDV---KEKTQQRLLGAKI 333 (415) T ss_pred ---ccccchhHHHHHHHhhhhh---ccCCCEEEEcHHHHHHHHHh--hccCCc----eeeccCc---CCCCCceecceee Confidence 1112267788888777542 24567899999999999653 233332 2211111 1222223333332 Q ss_pred eeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 281 LDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 281 L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ......-. ++.+...++.-+-+ +.+.+..-..+++.............++.|+ +..+.+|.|++++++. T Consensus 334 ~~~~~~~~---~~~~~~~~~~Gd~~-~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:81 334 EILPDEVL---GQKGNNTLIIGNLK-DAIVLFDRSQYQASWTDYMHFGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred EEeccccc---CCCCccEEEEEehh-ccEEEEeecceEEEEeccccCceEEEEEEEe-ccEEeccccEEEEEEe Confidence 22222111 12222222222212 2122222223333222222223344566777 4777889999999999 No 44 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.61 E-value=3.9e-08 Score=61.19 Aligned_cols=304 Identities=11% Similarity=-0.062 Sum_probs=161.1 Q ss_pred hhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeec------cc Q lcl|NC_021342. 29 QWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD------GV 102 (354) Q Consensus 29 ~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~------~~ 102 (354) -..++...-.+.+.+.-..-.......+. +.+-.++++...+.-..+++.++. +.+.+...+.....+ .. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP---~~~~~~ii~~~~~~s~l~~l~~~~-~~~~~~~~ip~~~~~~~a~~v~~ 76 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLP---KEIVGPIFDKAQESSLVLRLGENI-PISYGETIIPTTVKRPEVGQVGV 76 (338) T ss_pred CcchHHhhhhhcccccccceecccccccc---hHHHHHHHHHHHhhchhhhhccee-eccCCceEEEEEecCccceeecc Confidence 11122221111111110000011122333 455677888888888888887764 344333333332222 12 Q ss_pred cceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh--- Q lcl|NC_021342. 103 TMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS--- 179 (354) Q Consensus 103 G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~--- 179 (354) +.+.+++. +..+|..+...+......+.++.-..++.+=++. ...++..--....++++++.+|+.+++|+.. T Consensus 77 ~~~~~~~E-g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~d---s~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~ 152 (338) T protein:vir:78 77 GTSNEQRE-GGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARM---NPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTG 152 (338) T ss_pred cccccccc-cccccccccceeEEEEEEEEEEEeehhhHHHHhc---CHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcc Confidence 33444444 3446777777777788888888888887643333 2356777788889999999999999999864 Q ss_pred hCceeeeecCCcccccc-cccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcc-cCCCCCchHHH Q lcl|NC_021342. 180 RGMYGLFNNPNVTLSSA-TKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQL-MTGYTDRTVMQ 257 (354) Q Consensus 180 ~gi~GLlN~p~~~~~~~-~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~-~~~~~~~Tvl~ 257 (354) .+..|++++......+. ...+ ......++++.+++..+... ....+..++|+|..+..|..-+ ..+..+.-++. T Consensus 153 ~~~~gi~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~ 228 (338) T protein:vir:78 153 SALQGIDTNNVIVNTTNVDYLQ--TGTTPLLDRFLDGYDLVSAN--TDVDFNGWAADPRYRARLLRSQAYRDANGNVDPT 228 (338) T ss_pred cccccccccccccccccccccc--ccchhhHHHHHHHHHHhhhh--ccccceEEEEchHHHHHHHHHhhhccCCCceeec Confidence 45677777666533222 2222 22456688888888877543 3345678999999998885422 22333322211 Q ss_pred HHHhhCcccccccccceeeeeeeeeecccccc-ccccCcccEEEEEEcCcceEEEeeCchhhhccc-------------c Q lcl|NC_021342. 258 HFMEANSYTLLTGNELDIQIRFQLDAAELAAN-GVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-------------Q 323 (354) Q Consensus 258 ~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~-g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-------------~ 323 (354) -....+.+.+|...|...+..+... +...+.+..+++-+.+ .+.+.....++.... + T Consensus 229 -------~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 299 (338) T protein:vir:78 229 -------RINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFS--QLKYGFADEIRVKMSDTATLTDNTSPTPQ 299 (338) T ss_pred -------ccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecc--eEEEEeecccEEEEeeccccccccccccc Confidence 0112344445555555544433221 1112222222222222 122222222221110 1 Q ss_pred ccCc----eeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 324 MASL----GITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 324 ~~~l----~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..++ ...+.++.|+ |..+.+|.|++++--+ T Consensus 300 ~~~~~~~~~~~~r~~~r~-d~~v~~~~a~~~l~~~ 333 (338) T protein:vir:78 300 TVSMWQTNQIAILIEVTF-GWLLGDKQAFVKFVDD 333 (338) T ss_pred chhhhhcCcEEEEEEEEe-ccEeecccceEEEecc Confidence 1111 1345667777 5788999999998777 No 45 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.60 E-value=1.9e-08 Score=62.91 Aligned_cols=322 Identities=6% Similarity=-0.020 Sum_probs=153.2 Q ss_pred CcccchhHH--HHh--------------------------hhhhhhcccccccccch---hhhhhhhhhhccCCceeccc Q lcl|NC_021342. 1 MAIKTIDAQ--TIQ--------------------------GNQWLVHKGYVSRNGDQ---WVINNTALDAIGNPNIMLDA 49 (354) Q Consensus 1 ~~~~~~~~~--~~~--------------------------~~~~~~~~~~~~~~~~~---~~~~~~am~a~~~~~~~~da 49 (354) ..|+.++.+ .++ ......+.+.......+ |.........+ ....+.. T Consensus 49 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~t 126 (415) T protein:vir:47 49 SQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI--QGGSLKT 126 (415) T ss_pred HHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhh--hhccccc Confidence 001111000 000 00000000000000000 00000000000 0011111 Q ss_pred hhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-eccceeEEE Q lcl|NC_021342. 50 DGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QSAQMHTVP 128 (354) Q Consensus 50 ~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~~~~~~~p 128 (354) +. +..++. +.+.+.+++........+.++.+.. ...+...+.+......+.+.+++.++. +|-.+ ...+..... T Consensus 127 ~~-g~~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~Eg~~-~~~~~~~~~~~v~~~ 201 (415) T protein:vir:47 127 DS-GFVVIP--EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEELEE-NPELAVKPFFQLAYD 201 (415) T ss_pred cC-Cccccc--HHHHHHHHHHHHhhhhhhhhcceee-ccCCceeEEEEEecCCcceeecccccc-cccccccceeeEEee Confidence 12 223333 5667778888888888878766432 222223333334444556667765544 56443 456778888 Q ss_pred EEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHH Q lcl|NC_021342. 129 LGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQEL 208 (354) Q Consensus 129 v~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei 208 (354) .+.++..+.+|..=++. ...++..--....++++++.+|+.+++|+......+........ ...+. .+...- T Consensus 202 ~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~----~~~~~-~~~~~~ 273 (415) T protein:vir:47 202 INTHRGYFRISREAIED---AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKE----GKKLE-VKKAKS 273 (415) T ss_pred eeeeEeeehhhHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccc----cceec-cccccc Confidence 88888888887654433 34577788889999999999999999997543333322221111 11111 111122 Q ss_pred HHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccc Q lcl|NC_021342. 209 FNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAA 288 (354) Q Consensus 209 ~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~ 288 (354) ++||.+++.++... ...+..++|+|+.|..|.+.. +..|.-++ ..++ .++.+-.|...|......... T Consensus 274 ~~~i~~~~~~~~~~---~~~~~~~v~n~~~~~~L~~lk--d~~G~~i~----~~~~---~~~~~~~l~G~pV~~~~~~~~ 341 (415) T protein:vir:47 274 LDDIKDAINLNVKP---NYEHNVAIVSQTMFAKLDKMK--DKLGNYLI----QPDV---KEKTQQRLLGAKIEILPDEVL 341 (415) T ss_pred hHHHHHHHHhhhhh---ccCCCEEEEcHHHHHHHHHhh--ccCCCeee----ccCc---CCCCCccccceeeEEeccccc Confidence 67788888877653 235678999999999996532 43443221 1111 122222333333322222111 Q ss_pred cccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 289 NGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 289 ~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ++.++..++.-+.+ +.+.+..-+.+++.............++.|+ ++.+.+|.|+++++++ T Consensus 342 ---~~~~~~~~~~gd~~-~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:47 342 ---GQKGNNTLIIGNLK-DAIVLFDRSQYQASWTDYMHFGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred ---cCCCccEEEEEehh-ccEEEEeecceEEEeeccccCceEEEEEEEe-ccEEeccccEEEEEee Confidence 22222222222222 2232322233333222222223344567777 5778899999999998 No 46 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.60 E-value=1.9e-08 Score=62.91 Aligned_cols=322 Identities=6% Similarity=-0.020 Sum_probs=153.2 Q ss_pred CcccchhHH--HHh--------------------------hhhhhhcccccccccch---hhhhhhhhhhccCCceeccc Q lcl|NC_021342. 1 MAIKTIDAQ--TIQ--------------------------GNQWLVHKGYVSRNGDQ---WVINNTALDAIGNPNIMLDA 49 (354) Q Consensus 1 ~~~~~~~~~--~~~--------------------------~~~~~~~~~~~~~~~~~---~~~~~~am~a~~~~~~~~da 49 (354) ..|+.++.+ .++ ......+.+.......+ |.........+ ....+.. T Consensus 49 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~t 126 (415) T protein:vir:46 49 SQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI--QGGSLKT 126 (415) T ss_pred HHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhh--hhccccc Confidence 001111000 000 00000000000000000 00000000000 0011111 Q ss_pred hhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-eccceeEEE Q lcl|NC_021342. 50 DGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QSAQMHTVP 128 (354) Q Consensus 50 ~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~~~~~~~p 128 (354) +. +..++. +.+.+.+++........+.++.+.. ...+...+.+......+.+.+++.++. +|-.+ ...+..... T Consensus 127 ~~-g~~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~Eg~~-~~~~~~~~~~~v~~~ 201 (415) T protein:vir:46 127 DS-GFVVIP--EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEELEE-NPELAVKPFFQLAYD 201 (415) T ss_pred cC-Cccccc--HHHHHHHHHHHHhhhhhhhhcceee-ccCCceeEEEEEecCCcceeecccccc-cccccccceeeEEee Confidence 12 223333 5667778888888888878766432 222223333334444556667765544 56443 456778888 Q ss_pred EEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHH Q lcl|NC_021342. 129 LGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQEL 208 (354) Q Consensus 129 v~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei 208 (354) .+.++..+.+|..=++. ...++..--....++++++.+|+.+++|+......+........ ...+. .+...- T Consensus 202 ~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~----~~~~~-~~~~~~ 273 (415) T protein:vir:46 202 INTHRGYFRISREAIED---AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKE----GKKLE-VKKAKS 273 (415) T ss_pred eeeeEeeehhhHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccc----cceec-cccccc Confidence 88888888887654433 34577788889999999999999999997543333322221111 11111 111122 Q ss_pred HHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccc Q lcl|NC_021342. 209 FNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAA 288 (354) Q Consensus 209 ~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~ 288 (354) ++||.+++.++... ...+..++|+|+.|..|.+.. +..|.-++ ..++ .++.+-.|...|......... T Consensus 274 ~~~i~~~~~~~~~~---~~~~~~~v~n~~~~~~L~~lk--d~~G~~i~----~~~~---~~~~~~~l~G~pV~~~~~~~~ 341 (415) T protein:vir:46 274 LDDIKDAINLNVKP---NYEHNVAIVSQTMFAKLDKMK--DKLGNYLI----QPDV---KEKTQQRLLGAKIEILPDEVL 341 (415) T ss_pred hHHHHHHHHhhhhh---ccCCCEEEEcHHHHHHHHHhh--ccCCCeee----ccCc---CCCCCccccceeeEEeccccc Confidence 67788888877653 235678999999999996532 43443221 1111 122222333333322222111 Q ss_pred cccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 289 NGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 289 ~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ++.++..++.-+.+ +.+.+..-+.+++.............++.|+ ++.+.+|.|+++++++ T Consensus 342 ---~~~~~~~~~~gd~~-~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:46 342 ---GQKGNNTLIIGNLK-DAIVLFDRSQYQASWTDYMHFGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred ---cCCCccEEEEEehh-ccEEEEeecceEEEeeccccCceEEEEEEEe-ccEEeccccEEEEEee Confidence 22222222222222 2232322233333222222223344567777 5778899999999998 No 47 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.59 E-value=4.4e-08 Score=60.89 Aligned_cols=294 Identities=7% Similarity=-0.040 Sum_probs=154.2 Q ss_pred ccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhcc Q lcl|NC_021342. 3 IKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVP 82 (354) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~ 82 (354) ||.-+-.... -..|......++.. ...-++.++.++..+. +.+-.++++.....-..+++++ T Consensus 1 ~~~~~~~~~~--------------~~~f~~~~~~~~~~-~a~~~~~~~~~~~lip---~~~~~~ii~~~~~~s~l~~l~~ 62 (324) T protein:vir:96 1 MEQTQKLKLN--------------LQHFASNNVKPQVF-NPDNVMMHEKKDGTLL---NDFTTPILQEVMENSKIMQLGK 62 (324) T ss_pred CCcchhhhHH--------------HHHHHHhhhhhhhc-ccccccccCCCcceec---hhHHHHHHHHHHhhchhhhhcc Confidence 2211111110 01111111222221 1111222223333343 3344666666666666677665 Q ss_pred ccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHH Q lcl|NC_021342. 83 MAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAF 162 (354) Q Consensus 83 v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~ 162 (354) +.. .+. .++.+......+.+.+++.. ..+|..+...+........++....++.+=++.+ ..++...-....+ T Consensus 63 ~~~-~~~--~~~~~p~~~~~~~a~~v~Eg-~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~l~ 135 (324) T protein:vir:96 63 YEP-MEG--TEKKFTFWADKPGAYWVGEG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIA 135 (324) T ss_pred eee-ccC--CceEEEEEecCcceeeecCC-ccccccccceeEEEEEeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHH Confidence 543 222 23556666667788888765 5578888888888899999998888887555543 3568888888999 Q ss_pred HHHHHHhhheeeeeehhhCc-eeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHH Q lcl|NC_021342. 163 RGAEEHSQSVAYFGDASRGM-YGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQ 241 (354) Q Consensus 163 ~~~a~~~n~~~f~G~~~~gi-~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~ 241 (354) +++++.+|+.+|+|+...+. .|+++.... ...+... ..-+++|.+++.++... ...+..++++|..+.. T Consensus 136 ~aia~~~d~~~l~G~g~~~~~~~~~~~~~~-----~~~~~~~--~~~~~~i~~~~~~i~~~---~~~~~~~i~n~~~~~~ 205 (324) T protein:vir:96 136 EAFYKKFDEAGILNQGNNPFGKSIAQSIKK-----TNKVIKG--DFTQDNIIDLEALLEDD---ELEANAFISKTQNRSL 205 (324) T ss_pred HHHHHHHHHHhhhcCCCCCcCccccccccc-----cceeccc--ccchHHHHHHHHhhhhc---cCCCCEEEEcHHHHHH Confidence 99999999999999764332 344332221 1111111 11257788888877542 3456789999999999 Q ss_pred HhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhcc Q lcl|NC_021342. 242 ANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLA 321 (354) Q Consensus 242 L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~ 321 (354) |.+.. +..+.-++ .. +.+-++...|...... ...++..++.-+ ...+-+....+++.-. T Consensus 206 L~~lk--d~~G~~~~----~~-------~~~~~l~G~PV~~~~~------~~~~~~~~~~gd--~s~~~~~~~~~~~i~~ 264 (324) T protein:vir:96 206 LRKIV--DPETKERI----YD-------RNSDSLDGLPVVNLKS------SNLKRGELITGD--FDKLIYGIPQLIEYKI 264 (324) T ss_pred HHHhh--CCCCCeee----cC-------CCCCcccceeeEeecC------CCCCcceEEEEe--cceEEEEEecCcEEEE Confidence 87532 33332211 11 1111222222211111 111122222222 1122222223322211 Q ss_pred c------------------cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 322 P------------------QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 322 ~------------------~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) . -.++ ...+.+..++ |+.+.+|.|++++-.| T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~n-~v~~r~~~r~-d~~v~~~~a~~~l~~a 313 (324) T protein:vir:96 265 DETAQLSTVKNEDGTPVNLFEQD-MVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred eecccccccccccccchhhhhcC-cEEEEEEEEe-ccEEecccceEEEecc Confidence 0 0112 2455667777 4668889999999999 No 48 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.57 E-value=4.3e-08 Score=60.98 Aligned_cols=275 Identities=10% Similarity=0.024 Sum_probs=147.8 Q ss_pred hhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCc--- Q lcl|NC_021342. 37 LDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQ--- 113 (354) Q Consensus 37 m~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~--- 113 (354) |-. ++.+ .++...- +.+.+.+++...+.-..+++..+.. .+.+ +..+......+.+.|++.... T Consensus 1 ma~------~t~~-~gg~liP---~~~~~~Ii~~~~~~s~l~~l~~~~~-~~~~--~~~~p~~~~~~~a~wv~E~~~~~~ 67 (305) T protein:vir:25 1 MAD------ISRA-EVASLIQ---EAYSDTLLAAAKQGSTVLSAFQNVN-MGTK--TTHLPVLATLPEADWVGESATDPK 67 (305) T ss_pred CCC------ccCC-ccceecC---HHHHHHHHHHHHhhchhhhhcceee-ccCC--cEEEEEEeCCcceEEeeccccccc Confidence 211 1111 2233333 4456778888887777777776543 2222 345555556677888866543 Q ss_pred -ccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh---CceeeeecC Q lcl|NC_021342. 114 -DLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR---GMYGLFNNP 189 (354) Q Consensus 114 -dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~---gi~GLlN~p 189 (354) ++|..+...+......+.++....++.+=++. ...++..--.+..++++++.+|+.+|+|+..- +..+.++.. T Consensus 68 ~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~d---s~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~ 144 (305) T protein:vir:25 68 GVKPTSKVTWANRTLVAEEIAVIIPVHENVIDD---ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAA 144 (305) T ss_pred ccccccccceeeEEeeeEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCcccccccccc Confidence 36666677777888888888888887744433 34568888889999999999999999997642 222222221 Q ss_pred Ccccccccccccc-cCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCccccc Q lcl|NC_021342. 190 NVTLSSATKDYKT-MNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLL 268 (354) Q Consensus 190 ~~~~~~~~~~W~~-~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~ 268 (354) .... .....+.. .+..++++++..+...+.. ....+..++|+|..|..|.+. .+..+.-++ ..+ .. T Consensus 145 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~~~~~~~l~~l--kd~~G~~i~----~~~---~l 211 (305) T protein:vir:25 145 VTAG-QAVEVVGGVANESDIVGATNRAAKAVAS---AGWAPDTLLSSLALRYEVANI--RDANGNPVF----RDD---SF 211 (305) T ss_pred cccc-ccccccccchhhhHHHHHHHHHHHhhhh---cccccceeEecHHHHHHHHHh--hccCCceee----cCC---cc Confidence 1111 11111211 2234556666666655533 234556799999999998643 244443221 111 12 Q ss_pred ccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhc--------cc--ccc---CceeEEeeee Q lcl|NC_021342. 269 TGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRML--------AP--QMA---SLGITVPAEY 335 (354) Q Consensus 269 ~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~--------~~--~~~---~l~~~~~~~~ 335 (354) .|.|..+. .... ...++..++.-+ .+.+.+.....++.. .. +.. .=.+.+.++. T Consensus 212 ~G~Pv~~~-------~~~~----~~~~~~~~~~gd--~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~ 278 (305) T protein:vir:25 212 AGFRTFFN-------RNGA----WDADAAIEVIAD--SSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKA 278 (305) T ss_pred cccceEEc-------CccC----CCCCccEEEEEe--cceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEE Confidence 34443221 1110 011111111112 222222222222110 00 110 1124566788 Q ss_pred eeeeEEEECCceeEeeecC Q lcl|NC_021342. 336 KISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 336 ~~gGv~i~~P~ai~y~D~~ 354 (354) |+| +.+.+|.++++++.. T Consensus 279 r~~-~~v~~p~a~v~~~~~ 296 (305) T protein:vir:25 279 RFA-YVLGVSATAQGANKT 296 (305) T ss_pred eec-ceeeCcccEEEEccc Confidence 885 678999999999997 No 49 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=98.57 E-value=4.3e-08 Score=60.95 Aligned_cols=320 Identities=11% Similarity=0.046 Sum_probs=154.7 Q ss_pred CcccchhHHHHhhhhhhhc---cc-------ccccccchhhh--hhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVH---KG-------YVSRNGDQWVI--NNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVY 68 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~---~~-------~~~~~~~~~~~--~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~ 68 (354) -..+.+|-+.-..+..... +. .+......... ..-++...... ..++++ ++ +++. +.+.++++ T Consensus 197 ~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~-~~t~~~-gg-~lip--~~~~~~ii 271 (543) T protein:vir:81 197 KIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAM-GLTKAD-GG-YLVP--FQLDPTVI 271 (543) T ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhc-cccccc-Cc-ccCc--hhhhhHHH Confidence 0011111111110000000 00 00000000000 00111111101 122222 22 2322 23444444 Q ss_pred Hhhh-hcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHH Q lcl|NC_021342. 69 ETPY-GDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSA 147 (354) Q Consensus 69 e~~~-~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~ 147 (354) .... ..-..+.+..+... .+ .+.+.+....+.+.|++.+ ..+|..+...+........++..+.+|.. +... T Consensus 272 ~~~~~~~~~l~~~~~~~~~--~g--~~~~~~~~~~~~a~~v~Eg-~~~~~~~~~~~~i~~~~~k~~~~~~is~e-ll~d- 344 (543) T protein:vir:81 272 ITSNGSLNDIRRFARQVVA--TG--DVWHGVSSAAVQWSWDAEF-EEVSDDSPEFGQPEIPVKKAQGFVPISIE-ALQD- 344 (543) T ss_pred HHHHhhhchhhhhcccccC--Cc--ceEEEEecCCcceeecccC-ccccccccccceeeeeeeeeEeeehhhHH-HHhc- Confidence 3333 32334555544321 22 2344455566777888765 44788888888889999999999999884 4332 Q ss_pred HhCCCcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_021342. 148 AMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRF 226 (354) Q Consensus 148 ~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~ 226 (354) ..++...-....+++++..+|+.+|+|+... ...|+++.+........ ...+..-.++|+.+++..+.. .. T Consensus 345 --~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~l~~---~~ 416 (543) T protein:vir:81 345 --EANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIA---PVTAETFALADVYAVYEQLAA---RH 416 (543) T ss_pred --cHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhccccccccc---ccccccccHHHHHHHHHhhhc---cc Confidence 2378888888899999999999999998643 57899987664322211 112223346888888887753 22 Q ss_pred ccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccc--cccCcccEEEEEEc Q lcl|NC_021342. 227 HVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANG--VSNSNKPRYMVYDK 304 (354) Q Consensus 227 ~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g--~g~~g~d~~v~y~~ 304 (354) .....++|+|..|..|.+.. +..|.=++. ++ ..|.+-.|...|.+......... ....+.. .|+|- T Consensus 417 ~~~~~~v~n~~~~~~l~~lk--d~~G~~l~~-----~~---~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~-~i~~g- 484 (543) T protein:vir:81 417 RRQGAWLANNLIYNKIRQFD--TQGGAGLWT-----TI---GNGEPSQLLGRPVGEAEAMDANWNTSASADNF-VLLYG- 484 (543) T ss_pred cCCcEEEEcHHHHHHHHHhh--cCCCceecc-----Cc---CCCCCccccceeeEEeccccccccccccCCcc-eEEEe- Confidence 23347999999999997543 333321111 01 11222233333433333321111 1111222 23331 Q ss_pred CcceEEEeeCchhhh--cccc------ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 305 SDRNLAMANPIPFRM--LAPQ------MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 305 ~~~~~~~~vp~~~~~--~~~~------~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) |...+.+..-..++. .+-- .++ .+.+..+.+++ +.+++|.|++++.++ T Consensus 485 d~~~~~i~~~~~~~i~~~~~~~~~~~~~~~-~~~~~~~~r~d-~~v~~~~A~~~l~~~ 540 (543) T protein:vir:81 485 NFQNYVIADRIGMTVEFIPHLFGTNRRPNG-SRGWFAYYRMG-ADVVNPNAFRLLNVE 540 (543) T ss_pred eccceeEEeecccEEEEeccccccchhhcC-ceEEEEEEeec-cEeecccceEEEEec Confidence 122333332223222 1110 111 23445566775 567889999999999 No 50 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.56 E-value=6.2e-08 Score=60.10 Aligned_cols=314 Identities=10% Similarity=0.033 Sum_probs=165.3 Q ss_pred Ccc-------cchhHH--HHh--hhhhhhcccccccccchhhhhh-hhhhhcc--------CCceeccchhhHHHHHHHH Q lcl|NC_021342. 1 MAI-------KTIDAQ--TIQ--GNQWLVHKGYVSRNGDQWVINN-TALDAIG--------NPNIMLDADGGIAFYISQL 60 (354) Q Consensus 1 ~~~-------~~~~~~--~~~--~~~~~~~~~~~~~~~~~~~~~~-~am~a~~--------~~~~~~da~~~~~fl~~~L 60 (354) -.+ +.++.+ .++ ...+...+...+.....+.... -.++... ...+...++.++.++. T Consensus 41 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~--- 117 (385) T protein:vir:18 41 SDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQ--- 117 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceec--- Confidence 000 000000 000 0000000000000000000000 0000000 0112222334444554 Q ss_pred HHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecc-ccceeEecCCCcccceeeeccceeEEEEEEEEeeEeec Q lcl|NC_021342. 61 AGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDG-VTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYT 139 (354) Q Consensus 61 ~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~ 139 (354) ..+.+.+++........+.++++.. .+. .++.+..... .+.+.+++.+ ..+|..+...+........++..+.++ T Consensus 118 ~~~~~~ii~~~~~~~~l~~~~~~~~-~~~--~~~~~~~~~~~~~~a~~v~E~-~~~~~~~~~~~~~~~~~~k~~~~~~is 193 (385) T protein:vir:18 118 PMQIPGIIMPGLRRLTIRDLLAQGR-TSS--NALEYVREEVFTNNADVVAEK-ALKPESDITFSKQTANVKTIAHWVQAS 193 (385) T ss_pred chhhhHHHHHhhhccchhhhcceec-ccC--cceEEEEEecCCcceeeeccC-ccccccccceeEEEEeeeeEEEeehhh Confidence 3456678888888888888887653 222 2345555543 4567777664 558888888888899999999999988 Q ss_pred HHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccccccCHHHHHHHHHHHHHH Q lcl|NC_021342. 140 LDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFS 218 (354) Q Consensus 140 ~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~ 218 (354) . |+..-. ..+...-....+++++..+|+.+++|+... ...|+++.++....+... +.+..+++|.+++.+ T Consensus 194 ~-ell~d~---~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~-----~~~~~~d~i~~~~~~ 264 (385) T protein:vir:18 194 R-QVMDDA---PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNA-----TGDTRADIIAHAIYQ 264 (385) T ss_pred H-HHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc-----cccchHHHHHHHHHh Confidence 6 443322 247777788889999999999999997543 457999988765433221 233357888888888 Q ss_pred HHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccE Q lcl|NC_021342. 219 VINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPR 298 (354) Q Consensus 219 l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~ 298 (354) +... ...+..++|+|..|..|..-. +..|.-++. ++. .+.+-.+...|.+.+..... + . T Consensus 265 l~~~---~~~~~~~~~~~~~~~~l~~lk--d~~G~~l~~-----~~~---~~~~~~l~G~pV~~~~~~p~-~-------~ 323 (385) T protein:vir:18 265 VTES---EFSASGIVLNPRDWHNIALLK--DNEGRYIFG-----GPQ---AFTSNIMWGLPVVPTKAQAA-G-------T 323 (385) T ss_pred hccc---cCCCCEEEEcHHHHHHHHHhh--cCCCceecc-----Ccc---cCCCceecceeeEEcCcCCC-C-------c Confidence 7542 345678999999999986533 444433321 111 12222333344443332211 1 1 Q ss_pred EEEEEcCcceEEEeeCchhhhccc-cc-----cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 299 YMVYDKSDRNLAMANPIPFRMLAP-QM-----ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 299 ~v~y~~~~~~~~~~vp~~~~~~~~-~~-----~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .++.+.+ +.+.+..-..++.... +. ++ .+.+.++.|++ +.+++|.++++++++ T Consensus 324 ~~~gd~~-~~~~~~~~~~~~v~~~~~~~~~~~~~-~~~~~~~~r~~-~~v~~~~a~~~~~~~ 382 (385) T protein:vir:18 324 FTVGGFD-MASQVWDRMDATVEVSREDRDNFVKN-MLTILCEERLA-LAHYRPTAIIKGTFS 382 (385) T ss_pred EEEeecc-cEEEEEEecceEEEEeccccchhhcC-cEEEEEEEeec-cEEecccceEEEEec Confidence 1222211 2222222222222111 11 22 34556777876 667899999999999 No 51 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.56 E-value=6.2e-08 Score=60.10 Aligned_cols=314 Identities=10% Similarity=0.033 Sum_probs=165.3 Q ss_pred Ccc-------cchhHH--HHh--hhhhhhcccccccccchhhhhh-hhhhhcc--------CCceeccchhhHHHHHHHH Q lcl|NC_021342. 1 MAI-------KTIDAQ--TIQ--GNQWLVHKGYVSRNGDQWVINN-TALDAIG--------NPNIMLDADGGIAFYISQL 60 (354) Q Consensus 1 ~~~-------~~~~~~--~~~--~~~~~~~~~~~~~~~~~~~~~~-~am~a~~--------~~~~~~da~~~~~fl~~~L 60 (354) -.+ +.++.+ .++ ...+...+...+.....+.... -.++... ...+...++.++.++. T Consensus 41 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~--- 117 (385) T protein:vir:19 41 SDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQ--- 117 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceec--- Confidence 000 000000 000 0000000000000000000000 0000000 0112222334444554 Q ss_pred HHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecc-ccceeEecCCCcccceeeeccceeEEEEEEEEeeEeec Q lcl|NC_021342. 61 AGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDG-VTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYT 139 (354) Q Consensus 61 ~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~ 139 (354) ..+.+.+++........+.++++.. .+. .++.+..... .+.+.+++.+ ..+|..+...+........++..+.++ T Consensus 118 ~~~~~~ii~~~~~~~~l~~~~~~~~-~~~--~~~~~~~~~~~~~~a~~v~E~-~~~~~~~~~~~~~~~~~~k~~~~~~is 193 (385) T protein:vir:19 118 PMQIPGIIMPGLRRLTIRDLLAQGR-TSS--NALEYVREEVFTNNADVVAEK-ALKPESDITFSKQTANVKTIAHWVQAS 193 (385) T ss_pred chhhhHHHHHhhhccchhhhcceec-ccC--cceEEEEEecCCcceeeeccC-ccccccccceeEEEEeeeeEEEeehhh Confidence 3456678888888888888887653 222 2345555543 4567777664 558888888888899999999999988 Q ss_pred HHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccccccCHHHHHHHHHHHHHH Q lcl|NC_021342. 140 LDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFS 218 (354) Q Consensus 140 ~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~ 218 (354) . |+..-. ..+...-....+++++..+|+.+++|+... ...|+++.++....+... +.+..+++|.+++.+ T Consensus 194 ~-ell~d~---~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~-----~~~~~~d~i~~~~~~ 264 (385) T protein:vir:19 194 R-QVMDDA---PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNA-----TGDTRADIIAHAIYQ 264 (385) T ss_pred H-HHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc-----cccchHHHHHHHHHh Confidence 6 443322 247777788889999999999999997543 457999988765433221 233357888888888 Q ss_pred HHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccE Q lcl|NC_021342. 219 VINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPR 298 (354) Q Consensus 219 l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~ 298 (354) +... ...+..++|+|..|..|..-. +..|.-++. ++. .+.+-.+...|.+.+..... + . T Consensus 265 l~~~---~~~~~~~~~~~~~~~~l~~lk--d~~G~~l~~-----~~~---~~~~~~l~G~pV~~~~~~p~-~-------~ 323 (385) T protein:vir:19 265 VTES---EFSASGIVLNPRDWHNIALLK--DNEGRYIFG-----GPQ---AFTSNIMWGLPVVPTKAQAA-G-------T 323 (385) T ss_pred hccc---cCCCCEEEEcHHHHHHHHHhh--cCCCceecc-----Ccc---cCCCceecceeeEEcCcCCC-C-------c Confidence 7542 345678999999999986533 444433321 111 12222333344443332211 1 1 Q ss_pred EEEEEcCcceEEEeeCchhhhccc-cc-----cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 299 YMVYDKSDRNLAMANPIPFRMLAP-QM-----ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 299 ~v~y~~~~~~~~~~vp~~~~~~~~-~~-----~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .++.+.+ +.+.+..-..++.... +. ++ .+.+.++.|++ +.+++|.++++++++ T Consensus 324 ~~~gd~~-~~~~~~~~~~~~v~~~~~~~~~~~~~-~~~~~~~~r~~-~~v~~~~a~~~~~~~ 382 (385) T protein:vir:19 324 FTVGGFD-MASQVWDRMDATVEVSREDRDNFVKN-MLTILCEERLA-LAHYRPTAIIKGTFS 382 (385) T ss_pred EEEeecc-cEEEEEEecceEEEEeccccchhhcC-cEEEEEEEeec-cEEecccceEEEEec Confidence 1222211 2222222222222111 11 22 34556777876 667899999999999 No 52 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=98.56 E-value=4.7e-08 Score=60.77 Aligned_cols=292 Identities=8% Similarity=-0.030 Sum_probs=155.8 Q ss_pred ccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccc Q lcl|NC_021342. 24 SRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVT 103 (354) Q Consensus 24 ~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G 103 (354) ...+.++.....+|-..+ +.+++ ..+- +.+..++++...+.-..+++..+.. .+. .+..+.+....+ T Consensus 1 ~~~~~~~~~e~~~~~~~~------~~~~~-~~ip---~~~~~~ii~~~~~~~~l~~~~~~~~-~~~--~~~~ip~~~~~~ 67 (318) T protein:vir:24 1 MAAGTAFAVDHAQIAQTG------DTMFK-GYLE---PEQAKDYFAEAEKTSIVQQFAQKVP-MGT--TGQKIPHWVGDV 67 (318) T ss_pred CCCCCCCCHHHHHhhccc------Ccccc-eeec---hhHHHHHHHHHHhhchhhhhcceee-ccC--CceEEEEEeCCc Confidence 344455555555443321 12222 2333 3445667776666666677765432 222 234555566677 Q ss_pred ceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCce Q lcl|NC_021342. 104 MGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMY 183 (354) Q Consensus 104 ~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~ 183 (354) .+.+++.. ..+|..+...+........++....+|.+=|+.+ ..++...-....++++++.+|+.+++|+..-.-. T Consensus 68 ~a~~v~Eg-~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~ 143 (318) T protein:vir:24 68 SAQWIGEG-DMKPITKGNMTSQTIAPHKIATIFVASAETVRAN---PANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPT 143 (318) T ss_pred ceEEecCC-ccccccccceeEEEEeeEEEEEeehhhHHHhhcC---hHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCc Confidence 88888764 5588888888888888898888888877544533 3568888888999999999999999998654445 Q ss_pred eeeecCCc-ccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhh Q lcl|NC_021342. 184 GLFNNPNV-TLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEA 262 (354) Q Consensus 184 GLlN~p~~-~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n 262 (354) |+++.... +....... ......++.+++..+.. ....+..++|+|+.|..|.+.. +..+..++.-...+ T Consensus 144 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~---~~~~~~~~v~n~~~~~~L~~lk--d~~G~~l~~~~~~~ 213 (318) T protein:vir:24 144 YIGQTTKAISIADTTGA-----TTVYDQVAVNGLSLLVN---DGKKWTHTLLDDITEPILNGAK--DQNGRPLFIESTYG 213 (318) T ss_pred ccccccccccccccccc-----cchHHHHHHHHHHhhcc---ccCCCCEEEEcHHHHHHHHHhh--ccCCceeecCcccc Confidence 55554321 11111111 11112344555555432 3345678999999999997533 33343322111111 Q ss_pred CcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-------------c-----c Q lcl|NC_021342. 263 NSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-------------Q-----M 324 (354) Q Consensus 263 ~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-------------~-----~ 324 (354) +......+.+ +...|....... ..++..++.-+. ..+-+....+++.... . . T Consensus 214 ~~~~~~~~~~--i~g~pv~~~~~~------~~~~~~~~~gdf--s~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~ 283 (318) T protein:vir:24 214 EAASPFRSGR--IVARPTILSDHV------VEGTTVGFMGDF--SQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQ 283 (318) T ss_pred CccccccCce--EEEEeeEEeCCC------CCCccEEEEeec--ceEEEEEecCeEEEEeeccceeccccccccchhhhh Confidence 1111111112 222222222211 122332222221 2222222222221110 0 1 Q ss_pred cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 325 ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 325 ~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ++ ...+.+..++ ++.+.+|.|++++-.+ T Consensus 284 ~~-~~~~r~~~r~-d~~v~~~~a~~~i~~~ 311 (318) T protein:vir:24 284 HN-LVAVRVEAEY-AFHCNDAEAFVALTNV 311 (318) T ss_pred cC-cEEEEEEEEE-ccEEecccceEEEEee Confidence 11 2456677887 4777999999998887 No 53 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.56 E-value=1.8e-08 Score=63.03 Aligned_cols=320 Identities=6% Similarity=-0.048 Sum_probs=151.8 Q ss_pred CcccchhH-----------------------------------HHHhhhhhhhcccccccccchhhh--hh--hhhhhcc Q lcl|NC_021342. 1 MAIKTIDA-----------------------------------QTIQGNQWLVHKGYVSRNGDQWVI--NN--TALDAIG 41 (354) Q Consensus 1 ~~~~~~~~-----------------------------------~~~~~~~~~~~~~~~~~~~~~~~~--~~--~am~a~~ 41 (354) --++.|+. .....+......+.......+... +. ....+. T Consensus 42 ~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~- 120 (415) T protein:vir:94 42 QEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQ- 120 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhh- Confidence 00000000 000000000000000000001000 00 000000 Q ss_pred CCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-e Q lcl|NC_021342. 42 NPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-Q 120 (354) Q Consensus 42 ~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~ 120 (354) ...++. +.+.+++. +.+.+.+++........+.++.+.. .+.+...+.+......+.+.+++.++. +|-.+ . T Consensus 121 --~~~~~~-~~g~~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~Eg~~-~~~~~~~ 193 (415) T protein:vir:94 121 --GGSLKT-DSGFVVIP--EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEELEE-NPELAVK 193 (415) T ss_pred --hhcccc-ccccccCc--HHHHHHHHHHHHhhhhhhhhcceee-ccCCceeEEEEeecCCccceecccccc-ccccccc Confidence 000111 11223333 5667788888888888888776542 233333445555555566777765543 55433 4 Q ss_pred ccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCce-eeeecCCccccccccc Q lcl|NC_021342. 121 SAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMY-GLFNNPNVTLSSATKD 199 (354) Q Consensus 121 ~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~-GLlN~p~~~~~~~~~~ 199 (354) ..+.....++.++.-+.+|.+=++. ...++..--....++++++.+|+.+++|+....-. ++......... T Consensus 194 ~~~~i~~~~~k~~~~~~is~ell~d---s~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~----- 265 (415) T protein:vir:94 194 PFFQLAYDINTHRGYFRISREAIED---AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK----- 265 (415) T ss_pred cceeeEeeheeeeeechhhHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccc----- Confidence 5677888888888888887653332 34577788888899999999999999997643222 22221111111 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRF 279 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~ 279 (354) +... ...-+++|.+++.++... ...+..++|+|+.|..|.... +..|.-++ ..++ .++.+-.|...| T Consensus 266 ~~~~-~~~~~~~i~~~~~~~~~~---~~~~~~~vmn~~~~~~l~~lk--d~~G~~l~----~~~~---~~~~~~~l~G~p 332 (415) T protein:vir:94 266 LEVK-KAKSLDDIKDAINLNVKP---NYEHNVAIVSQTMFAKLDKMK--DKLGNYLI----QPDV---KEKTQQRLLGAK 332 (415) T ss_pred cccc-cccchHHHHHHHHhhhhh---ccCCCEEEEcHHHHHHHHHhh--ccCCCeee----ccCc---CCCCCceeccee Confidence 1111 112267788888877542 235779999999999997532 44443221 1111 122222333333 Q ss_pred eeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 280 QLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 280 ~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .......-. ++.++..+++-+.. +.+.+..-..+++..............+.|+ ++.+.+|.|++++++. T Consensus 333 V~~~~~~~~---~~~~~~~i~~gd~~-~~~~~~~~~~~~v~~~~~~~~~~~~r~~~r~-d~~~~~~~a~~~~~~~ 402 (415) T protein:vir:94 333 IEILPDEVL---GQKGNNTLIIGNLK-DAIVLFDRSQYQASWTDYMHFGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred eEEeccccc---CCCCccEEEEEehh-ccEEEEeecceEEEEeccccCceEEEEEEEe-ccEEeccccEEEEEEe Confidence 222222111 11222222222212 2222222233333222222222334456776 5777889999999998 No 54 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.54 E-value=7.7e-08 Score=59.57 Aligned_cols=294 Identities=7% Similarity=-0.028 Sum_probs=155.1 Q ss_pred ccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhcc Q lcl|NC_021342. 3 IKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVP 82 (354) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~ 82 (354) +|.-+-....- ..|..+..+++.-. ...++..+.++..+- +.+...+++.....-..+++++ T Consensus 1 ~~~~~~~~~~~--------------~~f~~~~~~~~~~~-a~~~~~~~~~~~liP---~~~~~~ii~~~~~~s~l~~~~~ 62 (324) T protein:vir:10 1 MEQTQKLKLNL--------------QHFASNNVKPQVFN-PDNVMMHEKKDGTLL---NDFTTPILQEVMENSKIMQLGK 62 (324) T ss_pred CCCchHHHHHH--------------HHHHHHhhccceec-ccceeccCCCcceec---hhHHHHHHHHHHhhchhhhhcc Confidence 22221111111 11111111221111 111122222223333 3445667776666666777766 Q ss_pred ccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHH Q lcl|NC_021342. 83 MAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAF 162 (354) Q Consensus 83 v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~ 162 (354) +.. .+. .++.+.+.+..+.+.+++.+ ..+|..+...+........++....+|.+-++.+ ..++...-....+ T Consensus 63 ~~~-~~~--~~~~~p~~~~~~~a~~v~Eg-~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~ 135 (324) T protein:vir:10 63 YEP-MEG--TEKKFTFWADKPGAYWVGEG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIA 135 (324) T ss_pred eee-ccC--CceEEEEEeCCcceeEeccC-ccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHH Confidence 543 222 23556666677888998765 5578888888888889999998888887655544 3467888888899 Q ss_pred HHHHHHhhheeeeeehhhC-ceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHH Q lcl|NC_021342. 163 RGAEEHSQSVAYFGDASRG-MYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQ 241 (354) Q Consensus 163 ~~~a~~~n~~~f~G~~~~g-i~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~ 241 (354) +++++.+|+.+++|+...+ -.|+++......... +...-+++|.+++..+.. ....+..++|+|..|.. T Consensus 136 ~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~-------~~~~t~~~i~~~~~~l~~---~~~~~~~~v~n~~~~~~ 205 (324) T protein:vir:10 136 EAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVI-------KGDFTQDNIIDLEALLED---DELEANAFISKTQNRSL 205 (324) T ss_pred HHHHHHHHHHhhhcCCCCccCccccccccccceec-------cccCCHHHHHHHHHhhhh---ccCCCCEEEEcHHHHHH Confidence 9999999999999975432 345554332211111 111226788888888754 23456789999999999 Q ss_pred HhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhc- Q lcl|NC_021342. 242 ANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRML- 320 (354) Q Consensus 242 L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~- 320 (354) |.+-. +..+.-++ . .+.+-++...|...... ...++..+++-+ ...+.+....+++.- T Consensus 206 L~~l~--d~~g~~~~---~--------~~~~~~l~G~PV~~~~~------~~~~~~~~~~gd--~~~~~~~~~~~~~i~~ 264 (324) T protein:vir:10 206 LRKIV--DPETKERI---Y--------DRNSDTLDGLPVVNLKS------SNLKRGELITGD--FDKLIYGIPQLIEYKI 264 (324) T ss_pred HHHhh--ccCCceee---c--------CCCCccccceeEEeecC------CCCCcceEEEEe--cccEEEEEecCcEEEE Confidence 87533 32232111 0 11111222222221111 111222222212 122222222222211 Q ss_pred -------ccc----------ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 321 -------APQ----------MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 321 -------~~~----------~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ... .++ ...+.++.+++ ..+.+|.|++++..+ T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~r~~~r~d-~~v~~~~A~~~l~~a 313 (324) T protein:vir:10 265 DETAQLSTVKNEDGTPVNLFEQD-MVALRATMHVA-LHIADDKAFAKLVPA 313 (324) T ss_pred eecccccccccccccchhhhhcC-cEEEEEEEEEc-cEEecccceEEEEec Confidence 000 011 24556677775 556679999999998 No 55 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.53 E-value=8e-08 Score=59.51 Aligned_cols=314 Identities=10% Similarity=0.110 Sum_probs=156.9 Q ss_pred CcccchhHHHHhhh--hhhhcccccc--------------cc--cchhhhhhhhhhhccCCceeccchhhHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGN--QWLVHKGYVS--------------RN--GDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAG 62 (354) Q Consensus 1 ~~~~~~~~~~~~~~--~~~~~~~~~~--------------~~--~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~ 62 (354) -.++.++.+..... .....+.... .. ...+........... .....+...++..+. +. T Consensus 75 ~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~lvp---~~ 150 (418) T protein:vir:10 75 ARLLEAEQKLARGGGSAELETPKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVP-ATVGSGVSGSNSLVV---AD 150 (418) T ss_pred HHHHHHHHHHhhcccccccchhhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhh-hhccCCCCCCccccc---hh Confidence 00000000000000 0000000000 00 000000000000000 011111122333333 45 Q ss_pred HHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecc-ccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHH Q lcl|NC_021342. 63 IEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDG-VTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLD 141 (354) Q Consensus 63 Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~ 141 (354) +.+.+++........+.++++.. .+.+ ++.+..... .+.+.|++.+ ..+|..+...+......+.++..+.+|.. T Consensus 151 ~~~~ii~~~~~~~~l~~~~~~~~-~~~~--~~~~~~~~~~~~~a~~v~E~-~~~~~~~~~f~~v~~~~~k~~~~~~is~e 226 (418) T protein:vir:10 151 RQAGIIAPPQRKMTIRDLLMPGQ-TSSS--SIEYTVETGFTNNAAAVAEG-AQKPTSDLKFNLKNQPVRTIAHLFKASRQ 226 (418) T ss_pred HHHHHHHHHhhhhhHHhhcceee-ccCC--ceeEEEEecCCCceeeeccC-ccccccccceeeEEEeeeeEEEeehhhHH Confidence 56678887777777777776543 2222 234444333 4566777665 44788888888888888998888888865 Q ss_pred HHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh-CceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHH Q lcl|NC_021342. 142 EMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR-GMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVI 220 (354) Q Consensus 142 El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~-gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~ 220 (354) =|+.+ . ++..--....++++++.+|+.+|+|+..- ...|+++..+....+.+. +...-+++|.+++..+. T Consensus 227 ll~ds---~-~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~-----~~~~~~~~i~~~~~~~~ 297 (418) T protein:vir:10 227 ILDDA---P-ALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITL-----ANATPIDKIRLALLQAV 297 (418) T ss_pred HHHhH---H-HHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccc-----cccccHHHHHHHHHhhc Confidence 44322 2 57777888889999999999999997654 478999988765433221 11123677777777765 Q ss_pred HHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEE Q lcl|NC_021342. 221 NLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYM 300 (354) Q Consensus 221 ~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v 300 (354) . ....+..++|+|..|..|.+.. +..|.-++. ++. .+.+-.|...|.+.+..... +. .+ T Consensus 298 ~---~~~~~~~~v~n~~~~~~L~~lk--d~~G~~i~~-----~~~---~~~~~~l~G~pV~~~~~~p~-~~-------~~ 356 (418) T protein:vir:10 298 L---AEFPATGIVLNPIDWASIELTK--DSQGRYIVG-----NPV---NGTTPRLWNLPVVETQAMTA-NE-------FL 356 (418) T ss_pred c---ccCCCCEEEEcHHHHHHHHHhh--cCCCceecc-----ccc---cCCCceecceeeEEcCCCCC-Cc-------EE Confidence 3 2345568999999999987543 434432221 111 12222333334433332211 10 11 Q ss_pred EEEcCcceEEEeeCchhhhccc-ccc----CceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 301 VYDKSDRNLAMANPIPFRMLAP-QMA----SLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 301 ~y~~~~~~~~~~vp~~~~~~~~-~~~----~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +-+.+ +.+.+..-+.++...- +.+ .-...+.++.+++ +.+++|.|+++++++ T Consensus 357 ~gd~s-~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d-~~~~~~~a~~~~~~~ 413 (418) T protein:vir:10 357 VGAFS-MAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLA-LAVYRPESFVTGALV 413 (418) T ss_pred Eeecc-ceEEEEEecceEEEEecccchhhhcCceEEEEEEeec-cEEecccceEEEEec Confidence 11111 1122221122222111 111 1123555677776 569999999999999 No 56 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.52 E-value=2.2e-08 Score=62.53 Aligned_cols=336 Identities=11% Similarity=0.062 Sum_probs=159.1 Q ss_pred Ccc--------cchhHHHHh---------------hhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHH Q lcl|NC_021342. 1 MAI--------KTIDAQTIQ---------------GNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYI 57 (354) Q Consensus 1 ~~~--------~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~ 57 (354) ... ...+..... .+...-......... +.......+.. ..+++.....++.... T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~--~~~~~~~~~~gg~lv~ 169 (477) T protein:vir:84 93 ATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDK-EIRKIAKVGEE--YRDLDRNGGTGGYAVP 169 (477) T ss_pred cccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhh-hHHHHHHhhhh--hccccccCCCcceeec Confidence 000 000000000 000000000000000 00000001111 1122222222332222 Q ss_pred HHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccc-ceeEecCCC----cccceeeeccceeEEEEEEE Q lcl|NC_021342. 58 SQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVT-MGKFIGANG----QDLPRVAQSAQMHTVPLGYA 132 (354) Q Consensus 58 ~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G-~a~~~~~~~----~dip~v~~~~~~~~~pv~~~ 132 (354) . +.+...+++...+....++++... +++.....+.+...+..+ .+.+.+.++ ...|..+...+....+.+.+ T Consensus 170 ~--~~~~~~ii~~l~~~~~i~~~~~~~-~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~ 246 (477) T protein:vir:84 170 P--LWMMNRFIELARAGRTYANLCPTE-PLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTI 246 (477) T ss_pred c--chhHHHHHHHhhhcchHHHhhcee-eecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeE Confidence 2 334456777776666666665543 222233334444443222 234555432 34566666777778888888 Q ss_pred EeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh-hCceeeeecCCcccccc---cccccccCHHHH Q lcl|NC_021342. 133 GNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPNVTLSSA---TKDYKTMNGQEL 208 (354) Q Consensus 133 ~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~~~~~~~---~~~W~~~T~~ei 208 (354) +.-+.+|.+=|+.+ ..++..--....+.+++..+|..+++|+.. ....||+|.+++...+. +.+|.. .+.. T Consensus 247 ~~~~~iS~ell~ds---~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~--~~~~ 321 (477) T protein:vir:84 247 AGQQGIAIQLLDQA---AVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEK--HQII 321 (477) T ss_pred EeeeHHHHHHHhcc---chhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhh--HHHH Confidence 87777765444443 457888888899999999999999999864 45799999998764333 334433 4456 Q ss_pred HHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHH--------HHHhhCcccccccccceeeeeee Q lcl|NC_021342. 209 FNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQ--------HFMEANSYTLLTGNELDIQIRFQ 280 (354) Q Consensus 209 ~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~--------~l~~n~~~~~~~g~~l~I~~~~~ 280 (354) +.+|.+++..+.. ++...+...+|+|..|..|.+-. +..+.-+++ ...... ....+.+-.+...|. T Consensus 322 ~~~i~~~~~~~~~--~~~~~~~~~v~~~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~~~~~~--~~~~~~~~~l~G~pV 395 (477) T protein:vir:84 322 YQKIADAIQRVHT--SRFLEPEVIVMHPRRWASFHAIF--AGDDRPLIVPSGPGFNNLGVLTE--VASQRVVGQMHGLPV 395 (477) T ss_pred HHHHHHHHhhccc--cccCCccEEEEcHHHHHHHHHhh--ccCCCeeeecCcccccccccccc--cccccccchhcccce Confidence 6777777666543 23445667999999999886522 333321110 000000 000112223333344 Q ss_pred eeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhcccccc-CceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 281 LDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMA-SLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 281 L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~-~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+...- .+.+.++....++|-+-.+.+-..-.+.+...+--+. .....+.........-+|+|.|++.+-.+ T Consensus 396 v~s~~~p-~~~~~~~d~~~i~~gd~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~ 469 (477) T protein:vir:84 396 VTDPTLP-TTLGTGTDQDVIHVLRASDLALFESSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGT 469 (477) T ss_pred EecCccc-ccccccCCcceEEEEEeceEEEEeeceeEEeccccccccceeeeeehhhhhhhhhccccceEEeecc Confidence 4443322 1223322223344433344443333333333333222 22222222333334567889999988777 No 57 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.48 E-value=1.2e-07 Score=58.54 Aligned_cols=291 Identities=8% Similarity=-0.024 Sum_probs=156.6 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhh Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFD 80 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~ 80 (354) |.....+++ .|..+...++.-. ..-++..+.++..+- +.+...+++.....-..+++ T Consensus 4 ~~~~~~~~~-------------------~f~~~~~~~~~~~-a~~~~~~~~~~~liP---~~~~~~ii~~~~~~s~l~~l 60 (324) T protein:vir:93 4 TQKLKLNLQ-------------------HFASNNVKPQVFN-PDNVMMHEKKDGTLL---NDFTTPILQEVMENSKIMQL 60 (324) T ss_pred hHHHHHHHH-------------------HHHHhhhhhhhcc-cccccccCCCcceec---hhHHHHHHHHHHhhchhhhh Confidence 111111111 1222223333221 111122222233333 34456677766666667776 Q ss_pred ccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHH Q lcl|NC_021342. 81 VPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARL 160 (354) Q Consensus 81 v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~a 160 (354) ..+.. .+. ..+.+.+....+.+.+++.+ ..+|..+...+........++..+.+|.+=++.+ ..++...-... T Consensus 61 ~~~~~-~~~--~~~~ip~~~~~~~a~~v~Eg-~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~ 133 (324) T protein:vir:93 61 GKYEP-MEG--TEKKFTFWADKPGAYWVGEG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPM 133 (324) T ss_pred cceee-ccC--CceEEEEEecCcceeeecCC-ccccccccceeEEEEEeEEEEEeehhhHHHHhcc---hHHHHHHHHHH Confidence 65542 222 23456666677788888764 5588888888888889999998888887555544 24677888888 Q ss_pred HHHHHHHHhhheeeeeehhhC-ceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHH Q lcl|NC_021342. 161 AFRGAEEHSQSVAYFGDASRG-MYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLW 239 (354) Q Consensus 161 A~~~~a~~~n~~~f~G~~~~g-i~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~ 239 (354) .++++++.+|+.+++|+...+ ..|+++.......... ...-++||.+++.++... ...+..++++|+.| T Consensus 134 l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~-------~~~~~~~i~~~~~~l~~~---~~~~~~~v~n~~~~ 203 (324) T protein:vir:93 134 IAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK-------GDFTQDNIIDLEALLEDD---ELEANAFISKTQNR 203 (324) T ss_pred HHHHHHHHHHHHHhcCCCCCCcCccccccccccceecc-------ccccHHHHHHHHHhhhhc---cCCCCEEEEcHHHH Confidence 999999999999999975432 2455544332211111 112267888888887652 34567899999999 Q ss_pred HHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhh Q lcl|NC_021342. 240 NQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRM 319 (354) Q Consensus 240 ~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~ 319 (354) ..|.+.. +..+.-++ .........|.| ...... ...++..+++-+ ...+.+....+++. T Consensus 204 ~~L~~l~--d~~G~~~~----~~~~~~~l~G~P-------Vv~~~~------~~~~~~~i~~gd--fs~~~~~~~~~~~i 262 (324) T protein:vir:93 204 SLLRKIV--DPETKERI----YDRNSDSLDGLP-------VVNLKS------SNLKRGELITGD--FDKLIYGIPQLIEY 262 (324) T ss_pred HHHHHhh--CCCCCeee----cCCCCCccccee-------eEeecC------CCCCcceEEEEe--cceEEEEEecCcEE Confidence 9996532 33332111 111111122332 221111 111222222222 22222222222222 Q ss_pred cccc------------------ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 320 LAPQ------------------MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 320 ~~~~------------------~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) -... .++ ...+.+..++ |+.+.+|.|++++..| T Consensus 263 ~~~~~~~~~~~~~~~~~~~~~f~~n-~~~~r~~~r~-d~~v~~~~a~~~l~~a 313 (324) T protein:vir:93 263 KIDETAQLSTVKNEDGTPVNLFEQD-MVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred EEeecccccccccccccchhhhhcC-cEEEEEEEEe-ccEEecccceEEEecc Confidence 1110 112 2456677777 4678889999999988 No 58 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.47 E-value=1.7e-07 Score=57.70 Aligned_cols=294 Identities=7% Similarity=-0.037 Sum_probs=155.6 Q ss_pred ccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhcc Q lcl|NC_021342. 3 IKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVP 82 (354) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~ 82 (354) .|.-+- .+..-..|.....+++.-. ...++..+.++..+- +.+...+++.....-..++++. T Consensus 1 ~~k~~~--------------~~~~~~~~~~~~~~~~~~~-a~~~~~~~~~~~lip---~~~~~~ii~~~~~~s~l~~~~~ 62 (324) T protein:vir:99 1 MEQTQK--------------LKLNLQHFASNNVKPQVFN-PDNVMMHEKKDGTLL---NDFTTPILQEVMENSKIMRLGK 62 (324) T ss_pred CCCchH--------------hhHHHHHHHHHhhhhhhcc-ccceeccCCCcceec---hhHHHHHHHHHHhhchhhhhcc Confidence 111111 1101111111111222111 111222222223333 3445677777666666777766 Q ss_pred ccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHH Q lcl|NC_021342. 83 MAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAF 162 (354) Q Consensus 83 v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~ 162 (354) +.. .+.+ ++.+......+.+.|++.. ..+|..+...+........++....+|.+-++.+. .++...-....+ T Consensus 63 ~~~-~~~~--~~~~p~~~~~~~a~~v~Eg-~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~---~~l~~~i~~~l~ 135 (324) T protein:vir:99 63 YEP-MEGT--EKKFTFWADKPGAYWVGEG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY---SQFFEEMKPMIA 135 (324) T ss_pred eee-ccCC--ceEEEEEecCcceeEeccC-ccccccccceeEEEEeeEEEEEeehhhHHHHhcch---HHHHHHHHHHHH Confidence 543 2222 3556666667788898764 55888888888889999999998898876555543 467788888899 Q ss_pred HHHHHHhhheeeeeehhhC-ceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHH Q lcl|NC_021342. 163 RGAEEHSQSVAYFGDASRG-MYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQ 241 (354) Q Consensus 163 ~~~a~~~n~~~f~G~~~~g-i~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~ 241 (354) +++++.+|+.+++|+...+ ..|+++........+. ...-+++|.+++.+|.. ....+..++++|..|.. T Consensus 136 ~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~-------~~~~~~~i~~~~~~l~~---~~~~~~~~v~n~~~~~~ 205 (324) T protein:vir:99 136 EAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK-------GDFTQDNIIDLEALLED---DELEANAFISKTQNRSL 205 (324) T ss_pred HHHHHHHHHHhhhcCCCCccCccccccccccceecc-------ccCCHHHHHHHHHhhhh---ccCCCCEEEEcHHHHHH Confidence 9999999999999976542 2455543332111111 11226788888888754 23456689999999999 Q ss_pred HhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhcc Q lcl|NC_021342. 242 ANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLA 321 (354) Q Consensus 242 L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~ 321 (354) |.+-. +..+.-++ . .+.+-++...|....... ..++..+++-+ ...+.+....+++.-. T Consensus 206 L~~l~--d~~g~~~~---~--------~~~~~~l~G~PVv~~~~~------~~~~~~~i~gd--~~~~~~~~~~~~~i~~ 264 (324) T protein:vir:99 206 LRKIV--DPETKERI---Y--------DRNSDTLDGLPVVNLKSS------NLKRGELITGD--FDKLIYGIPQLIEYKI 264 (324) T ss_pred HHHhh--cCCCceee---c--------CCCCccccceeEEeecCC------CCCcceEEEEe--cccEEEEEecCcEEEE Confidence 86532 32332111 0 111112222222211111 11222222212 2222232223222211 Q ss_pred --------cc----------ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 322 --------PQ----------MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 322 --------~~----------~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .. .++ ...+.++.+++ +.+.+|.|++++..+ T Consensus 265 ~~~~~~~~~~~~~~~~~~~f~~~-~~~~r~~~r~d-~~v~~~~a~~~lt~a 313 (324) T protein:vir:99 265 DETAQLSTVKNEDGTPVNLFEQD-MVALRATMHVA-LHIADDKAFAKLVPA 313 (324) T ss_pred eecccccccccccccchhhhhcC-cEEEEEEEEEc-cEEecccceEEEEec Confidence 00 011 24556677775 666689999999999 No 59 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.46 E-value=1e-07 Score=58.87 Aligned_cols=306 Identities=8% Similarity=-0.085 Sum_probs=151.8 Q ss_pred ccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccc Q lcl|NC_021342. 24 SRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVT 103 (354) Q Consensus 24 ~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G 103 (354) +..+++-.......+. ..++++..++.+..+. +.+-.++++...+.-..+++..+.. .+ .....+.+....+ T Consensus 1 ~~~~~~r~~~~~~~~e--~~a~~~~~~~~g~~ip---~~~~~~ii~~~~~~s~i~~~~~~~~-~~--~~~~~~p~~~~~~ 72 (326) T protein:vir:42 1 MAVNPDRTTPFLGVND--PKVAQTGDSMFEGYLE---PEQAQDYFAEAEKISIVQQFAQKIP-MG--TTGQKIPHWTGDV 72 (326) T ss_pred CCCCccchhhhcCcch--hhheeccccCCcceec---hhhHHHHHHHHHhcchhhhhcceee-cc--CCceEEEEEeCCc Confidence 1111111111111111 1122222222333444 3344667777777777777665432 22 2234555666677 Q ss_pred ceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCce Q lcl|NC_021342. 104 MGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMY 183 (354) Q Consensus 104 ~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~ 183 (354) .+.+++. +..+|..+...+......+.++..+.+|.+=++.+ ..++..--....+++++..+|+.+|+|+...+-. T Consensus 73 ~a~~v~E-g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s---~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~ 148 (326) T protein:vir:42 73 SASWIGE-GDMKPITKGNMTSQTIAPHKIATIFVASAETVRAN---PANYLGTMRTKVATAFAMAFDNAAINGTDSPFPT 148 (326) T ss_pred ceEEecC-CccccccccceeEEEEeeEEEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccc Confidence 7888865 46689888888888899999998888887544433 4578888888899999999999999998866667 Q ss_pred eeeecCCcccc-cccccccccCHHHHHHH--HHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHH Q lcl|NC_021342. 184 GLFNNPNVTLS-SATKDYKTMNGQELFNM--LNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFM 260 (354) Q Consensus 184 GLlN~p~~~~~-~~~~~W~~~T~~ei~~d--i~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~ 260 (354) |+++.+..... .......+ .+-...| +..++..+. ........++|+|..+..|.+-. +..+.-++.--. T Consensus 149 gi~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~---~~~~~~a~~v~n~~~~~~L~~lk--d~~G~~l~~~~~ 221 (326) T protein:vir:42 149 FLAQTTKEVSLVDPDGTGSN--ADLTVYDAVAVNALSLLV---NAGKKWTHTLLDDITEPILNGAK--DKSGRPLFIEST 221 (326) T ss_pred cccccccccceeeccccccc--ccchhHHHHHHHHHhhhh---hhccCccEEEEeHHHHHHHHHhh--ccCCceeecccc Confidence 88876653221 11112211 1112222 233333332 22334567999999999997532 333321111000 Q ss_pred hhCcccccccccceeeeeeeeeecccccccc-c-cCcccEEEEEEcCcceEEEeeCchhh--hcccc---c-----cCce Q lcl|NC_021342. 261 EANSYTLLTGNELDIQIRFQLDAAELAANGV-S-NSNKPRYMVYDKSDRNLAMANPIPFR--MLAPQ---M-----ASLG 328 (354) Q Consensus 261 ~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~-g-~~g~d~~v~y~~~~~~~~~~vp~~~~--~~~~~---~-----~~l~ 328 (354) .+.......+ ..+...|............ . .+.-..++... .+-+.+.+-.... ....+ + ++ . T Consensus 222 ~~~~~~~~~~--~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~--~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d-~ 296 (326) T protein:vir:42 222 YTEENSPFRL--GRIVARPTILSDHVASGTVVGYQGDFRQLVWGQ--VGGLSFDVTDQATLNLGTPQAPNFVSLWQHN-L 296 (326) T ss_pred ccCccccccC--ceeeeeeEEEcCCCCCCceEEEEeecceEEEEE--ecceEEEEeecceeeecccccccchhhhhcC-c Confidence 0000000000 1122223222222111000 0 00111111111 1222222211111 10000 0 11 2 Q ss_pred eEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 329 ITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 329 ~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..+.+..++ ++.+.+|.|++++... T Consensus 297 ~~~r~~~~~-d~~v~~~~a~~~l~~~ 321 (326) T protein:vir:42 297 VAVRVEAEY-AFHCNDKDAFVKLTNV 321 (326) T ss_pred EEEEEEEEe-ccEEecccceEEEeec Confidence 455677777 5788999999998777 No 60 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.45 E-value=2.3e-07 Score=56.99 Aligned_cols=296 Identities=6% Similarity=-0.053 Sum_probs=154.0 Q ss_pred hhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEE Q lcl|NC_021342. 17 LVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMY 96 (354) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~ 96 (354) ...++..+..--.|.......+.. .....+.++.++..+- +.+...+++........+.++.+.. .+ ..++.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~-~a~~~~~~~~~~~~iP---~~~~~~ii~~~~~~s~l~~~~~~~~-~~--~~~~~i 73 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVF-NPDNVMMHEKKDGTLM---NEFTTPILQEVMENSKIMQLGKYEP-ME--GTEKKF 73 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhh-ccccccccCCCcceec---hhHHHHHHHHHHhhcchhhhcceee-cc--CCceEE Confidence 111111110000111111111110 0000111222333333 3445667777777777777765542 22 234566 Q ss_pred EeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeee Q lcl|NC_021342. 97 RSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFG 176 (354) Q Consensus 97 ~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G 176 (354) ......+.+.|++.+. .+|..+...+........++.-..+|.+=++.+ ..++...-....++++++.+|+.+++| T Consensus 74 p~~~~~~~a~~v~Eg~-~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~l~~aia~~~d~a~l~G 149 (324) T protein:vir:97 74 TFWADKPGAYWVGEGQ-KIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) T ss_pred EEEecCcceeEeccCc-cccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 6666778889998764 588888888888899999998888887545443 357888888999999999999999999 Q ss_pred ehhhC-ceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchH Q lcl|NC_021342. 177 DASRG-MYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTV 255 (354) Q Consensus 177 ~~~~g-i~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tv 255 (354) +...+ ..|+++..........+ ..-+++|.+++.++... ...+.+++|+|..|..|.+.. +..+..+ T Consensus 150 ~g~~~~~~gi~~~~~~~~~~~~~-------~~~~~~i~~~~~~l~~~---~~~~~~~v~n~~~~~~L~~lk--d~~g~~~ 217 (324) T protein:vir:97 150 QGNNPFGKSIAQSIEKTNKVIKG-------DFTQDNIIDLEALLEDD---ELEANAFISKTQNRSLLRKIV--DPETKER 217 (324) T ss_pred CCCCccCccccccccccceeccc-------cCCHHHHHHHHHhhhhc---cCCCCEEEEcHHHHHHHHHhh--cCCCcee Confidence 76542 35566544322211111 11257788888887642 345678999999999987532 3333221 Q ss_pred HHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcC------cceEEEeeCchhhhccc-----c- Q lcl|NC_021342. 256 MQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKS------DRNLAMANPIPFRMLAP-----Q- 323 (354) Q Consensus 256 l~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~------~~~~~~~vp~~~~~~~~-----~- 323 (354) +. ........|.| ........ .++..++.-+.+ .+.+.+.+-........ . T Consensus 218 ~~----~~~~~tl~G~P-------V~~~~~~~------~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~ 280 (324) T protein:vir:97 218 IY----DRNSDTLDGLP-------VVNLKSSN------LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) T ss_pred ec----CCCCcccccee-------eEeecCCC------CCcceEEEEecccEEEEEecCcEEEEeecccccccccccccc Confidence 10 00111122333 22111110 111111111111 11122222111110000 0 Q ss_pred ----ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 324 ----MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 324 ----~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .++ ...+.+.++++ +.+.+|.|++++..+ T Consensus 281 ~~~f~~d-~~~~r~~~r~d-~~v~~~~a~~~l~~~ 313 (324) T protein:vir:97 281 VNLFEQD-MVALRATMHVA-LHIADDKAFAKLVPA 313 (324) T ss_pred hhhhhcC-cEEEEEEEEec-cEEecccceEEEEec Confidence 011 24555677775 556679999999999 No 61 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=98.45 E-value=5.1e-08 Score=60.56 Aligned_cols=325 Identities=10% Similarity=0.059 Sum_probs=163.5 Q ss_pred Ccccchh--------------------HHHHh-hhhhhhcccccccccchhhhhhhhhhhcc-----CCceeccchhhHH Q lcl|NC_021342. 1 MAIKTID--------------------AQTIQ-GNQWLVHKGYVSRNGDQWVINNTALDAIG-----NPNIMLDADGGIA 54 (354) Q Consensus 1 ~~~~~~~--------------------~~~~~-~~~~~~~~~~~~~~~~~~~~~~~am~a~~-----~~~~~~da~~~~~ 54 (354) -.++++| -.... ...-....+.......++. -|+.... ...++....+.+. T Consensus 63 ~~~~~~e~~~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~af~~~l~~~e~~~al~~~t~~~gG 139 (425) T protein:vir:10 63 AGLPTSDALAKVDKVSADLEALQAAVDEANIKIAAAQMGANGVKPLRDPEYT---EAFKAHVKRGDVQAALNKGEDSEGG 139 (425) T ss_pred hhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccHHHH---HHHHHHhhhhhhHHHhhcCcCCCCc Confidence 0000000 00000 0000000000000111110 0111100 0011111122334 Q ss_pred HHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeee-ccceeEEEEEEEE Q lcl|NC_021342. 55 FYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQ-SAQMHTVPLGYAG 133 (354) Q Consensus 55 fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~-~~~~~~~pv~~~~ 133 (354) +++. +.+.+.+++.....-..+++..+.. .+.+. ..+.+....+.+.|++.+ ..+|-.+. ..+......+.++ T Consensus 140 ~lvP--~~~~~~ii~~~~~~s~l~~l~~~~~-~~~~~--~~~~~~~~~~~a~wv~E~-~~~~~~~~~~f~~v~~~~~k~~ 213 (425) T protein:vir:10 140 YLTP--IEWDRTITNKLVLISPMRQLCRVQP-VSKAG--FSKLFNMGGTTSGWVGEA-SQRPQTNAATFQPLSFASGEIY 213 (425) T ss_pred eecc--HhHHHHHHHHHHhhhhhhhhceeee-ccCCc--eEEEEEcCCcceeeeccc-cccccccccccceeeeeheeeE Confidence 4554 5667788888877777777765432 22222 333334445566777665 44565553 5677777888887 Q ss_pred eeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccc---c----ccCHH Q lcl|NC_021342. 134 NECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDY---K----TMNGQ 206 (354) Q Consensus 134 ~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W---~----~~T~~ 206 (354) .-..+|.+=|+ ....++...-....++++++.+|+.+++|+......|++|.+..........| . ..+.. T Consensus 214 ~~i~iS~ell~---ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (425) T protein:vir:10 214 ANPAATQQILD---DAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAAD 290 (425) T ss_pred eehHhHHHHHh---cchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeecccccccccccccccccccccccccc Confidence 77777664443 33567888888999999999999999999887788999998875443322211 1 11223 Q ss_pred HHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccc Q lcl|NC_021342. 207 ELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAEL 286 (354) Q Consensus 207 ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~ 286 (354) --+++|.+++..|... ....-.++|+|..|..|..- .+..|.-++ ..+. ..|.+-.|...|.+..... T Consensus 291 ~~~d~l~~l~~~l~~~---~~~~a~~vmn~~~~~~L~~l--kD~~G~~l~----~~~~---~~g~~~~l~G~PV~~~~~~ 358 (425) T protein:vir:10 291 ITSDGIIDLVYDLPSA---FTGNARFAMNRNTQRQVRKL--KDGQGNYLW----QPSY---VAGQPATLAGYPVTEVPDM 358 (425) T ss_pred ccHHHHHHHHhhhhhh---hccCCEEEEchHHHHHHHHh--hcCCCceee----ccCc---cCCCCceecceeeEEecCc Confidence 3467777787776532 23445789999999998753 244443221 1111 1233334444444444432 Q ss_pred cccccccCcccEEEEEEcCcceEEEeeCchhhhccccccC-ceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 287 AANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMAS-LGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 287 ~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~-l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ... + .+.+.++.-+.+ +.+.+..-..++.+.-.+-. -...+..+.|++ ..+.+|.|++.+-++ T Consensus 359 p~~--~-~~~~~i~~Gd~~-~~~~i~~~~~~~v~~d~~~~~~~~~~~~~~r~d-~~v~~~~A~~~l~~~ 422 (425) T protein:vir:10 359 PDV--A-ANSTPILFGDFQ-QTYLIIDRIGVRVLRDPYTAKPYVLFYTTKRVG-GGLLNPEPMRAMKVA 422 (425) T ss_pred CCc--c-CCccEEEEEehh-ccEEEEEecceEEEecccccCCcEEEEEEEEec-cEeecccceEEEEee Confidence 222 2 223332222222 22222222223332211111 124555677775 667779999999999 No 62 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.39 E-value=2.5e-08 Score=62.23 Aligned_cols=326 Identities=7% Similarity=0.045 Sum_probs=164.2 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhh---hh------hhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhh Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVI---NN------TALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETP 71 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~------~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~ 71 (354) -+++.++.+....+.-... .-.....+... .. ..+...-..++....++.+.+++. +.+.+.|++.. T Consensus 56 ~~~~~~~~~~~~~~~~~~~--~~~~~~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP--~~~~~~ii~~~ 131 (401) T protein:vir:44 56 NLKSDLEKELLELKRPARG--AQNKVAAEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVP--EELDRSILSLL 131 (401) T ss_pred HHHHHHHHHHHHhhccccc--cccchhHHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceecc--HhHHHHHHHHH Confidence 1222223222221111000 00000000000 00 000000000111111222334444 56677888877 Q ss_pred hhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-eccceeEEEEEEEEeeEeecHHHHHHHHHhC Q lcl|NC_021342. 72 YGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QSAQMHTVPLGYAGNECHYTLDEMRKSAAMN 150 (354) Q Consensus 72 ~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g 150 (354) ......+.+..+.. .+.+ ...+.+......+.|.+... ..|..+ ...+......+.++.-+.+|..=|+.+ . T Consensus 132 ~~~~~l~~~~~~~~-~~~~--~~~~~~~~~~~~a~wv~E~~-~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds---~ 204 (401) T protein:vir:44 132 KDEVVMRQEATVIT-VGGS--DYKKLVNLGGTASGWVGETD-TRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDA---F 204 (401) T ss_pred Hhhhhhhhhceeee-cCCC--ceEEEEecCCccceeecccc-ccCccccccceeeeeehhheeeehhhhHHHHhcc---h Confidence 77777777665432 2222 23444444445566765543 345444 246666777777877777776544433 4 Q ss_pred CCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccc------cC-HHHHHHHHHHHHHHHHHHh Q lcl|NC_021342. 151 MPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKT------MN-GQELFNMLNAPIFSVINLS 223 (354) Q Consensus 151 ~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~------~T-~~ei~~di~~~~~~l~~~s 223 (354) .++...-....+.++++.+|..+++|+......|+|+.+.....+....|.. .+ ..--+++|.+++..|... T Consensus 205 ~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~- 283 (401) T protein:vir:44 205 FNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKA- 283 (401) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchh- Confidence 5788888888999999999999999998777899999888654332222211 11 112267888888877542 Q ss_pred CCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEE Q lcl|NC_021342. 224 RRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYD 303 (354) Q Consensus 224 ~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~ 303 (354) + ...-.++|++..|..|..- -+..|.-++. .+ ...|.+-.|...|.+........+ .+.+. |+|- T Consensus 284 -~-~~~a~~v~n~~~~~~L~~l--kd~~G~~l~~----~~---~~~g~~~~l~G~PVv~~~~~p~~~---~~~~~-i~~G 348 (401) T protein:vir:44 284 -H-RTGAKFMMNNNSLFAIRLL--KDTEGNYLWR----PG---LELGQPSSLAGYGIAENEQMPDIA---ADAKA-IAFG 348 (401) T ss_pred -h-hcCCEEEEcHHHHHHHHHh--hccCCceeec----CC---cCCCCCceecceeeEEecCcCCcc---CCccE-EEEe Confidence 2 2234689999999999653 2444432211 00 012334345555555444332221 12222 3332 Q ss_pred cCcceEEEeeCchhhhccccc-cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 304 KSDRNLAMANPIPFRMLAPQM-ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 304 ~~~~~~~~~vp~~~~~~~~~~-~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +=.+.+.+.--+.++.+--.. ..-...+.++.|++ +.+..|.|++.+.++ T Consensus 349 d~~~~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d-~~~~~~~a~~~l~~~ 399 (401) T protein:vir:44 349 NFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTG-GMLVDSQAIKLLKIA 399 (401) T ss_pred ehhccEEEEEecceEEeeeccccCCcEEEEEEEEec-cEEecccceEEEEee Confidence 112323222222333321111 11234566778886 556669999999999 No 63 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.39 E-value=1.5e-08 Score=63.44 Aligned_cols=280 Identities=10% Similarity=0.044 Sum_probs=158.2 Q ss_pred hhhccCCceeccchhhHHHHHHHH----HHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccc---cceeEec Q lcl|NC_021342. 37 LDAIGNPNIMLDADGGIAFYISQL----AGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGV---TMGKFIG 109 (354) Q Consensus 37 m~a~~~~~~~~da~~~~~fl~~~L----~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~---G~a~~~~ 109 (354) |-+ |.-.+.+..++....++| +.|+.++.+.....+.+..||--.+ .....++.|....+. |.+..+. T Consensus 1 ~~~---~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~--a~~~~~v~f~~~~p~~~~~d~e~Va 75 (318) T protein:vir:10 1 MTA---PTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGG--ANPNGVVAYNEGNPSFLEDDVADVA 75 (318) T ss_pred CCC---CCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhccc--ccccceeEEEecccccccCcHhhcc Confidence 322 332333334456666666 6788899998888888888885321 223335555544433 5666565 Q ss_pred CCCcccceeeeccceeEE-EEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeec Q lcl|NC_021342. 110 ANGQDLPRVAQSAQMHTV-PLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNN 188 (354) Q Consensus 110 ~~~~dip~v~~~~~~~~~-pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~ 188 (354) .+++ +|.++...+..++ ....++.++++|.+.+. +.+.+.-.+....++++..++.|+.++ ..|.+ T Consensus 76 EggE-iP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~---~n~~~~v~r~~~~l~Nti~r~~d~~a~---------dal~s 142 (318) T protein:vir:10 76 EFGE-IPVSAGARGLPRTAFAVKKALGVRVSKEMID---ENRVGAVNDQMLQLRNTFIRANDRSAK---------ALLQS 142 (318) T ss_pred Cccc-ccccCCCCCchhhhhhehhccceeccHHHHh---hcChhHHHHHHHHHHHHHHHHHHHHHH---------HHHhc Confidence 5544 7888877765555 55689999999875544 346677788888888888888877744 34666 Q ss_pred CCcccccccccccccCHHHHHHHHHHHHHHHH-------------HHhCCcccccEEEeCHHHHHHHhhcccCCCCCchH Q lcl|NC_021342. 189 PNVTLSSATKDYKTMNGQELFNMLNAPIFSVI-------------NLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTV 255 (354) Q Consensus 189 p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~-------------~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tv 255 (354) ++++...++..|.+. .....|+..+...+. ...+.-+.|++|+|+|..|..|.+- ..+ T Consensus 143 a~t~~~~~s~~w~~~--~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n-------~~~ 213 (318) T protein:vir:10 143 PIVPTLAVPTAWDNG--GKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDN-------ENF 213 (318) T ss_pred cccccccCCcCCCCc--ccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcc-------hhh Confidence 666666667777752 212233333332221 0112346799999999999999642 122 Q ss_pred HHHHHhh-Cccccccc-c---cceeeeeeeeeeccccccccccCcccEEEEEEc-CcceEEEeeCchhhhccccc----- Q lcl|NC_021342. 256 MQHFMEA-NSYTLLTG-N---ELDIQIRFQLDAAELAANGVSNSNKPRYMVYDK-SDRNLAMANPIPFRMLAPQM----- 324 (354) Q Consensus 256 l~~l~~n-~~~~~~~g-~---~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~-~~~~~~~~vp~~~~~~~~~~----- 324 (354) .++...| ++.....+ . +-.+-...++....+- .++..+.++ ..-.+. -++|++..+..+ T Consensus 214 ~~~y~~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p--------~~~alvlq~g~vG~~~--d~~pl~~t~~~~egg~~ 283 (318) T protein:vir:10 214 MKVYERNANYVSTAPDWTGNFPGSVMGLNVIRSRTFP--------IDRVLIMERGTVGFYS--DTRPLQFTALYPEGNGP 283 (318) T ss_pred hhhhhccchhhhhcccccccccceeeceEEeecCccC--------CCeeEEEecCCcceee--ccccceeeecccCCCCC Confidence 2333322 22111111 0 0011112222222111 123334332 222222 345555443322 Q ss_pred ---cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 325 ---ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 325 ---~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+..|...+...+ ..-|.+|+|++..-== T Consensus 284 ~g~~~~s~~~~~~~~~-~~~V~~PkA~~~itgi 315 (318) T protein:vir:10 284 NGGPTESYRADASHKR-ALAVDQPKAALWLTGI 315 (318) T ss_pred CCCcchhhheehheee-eeeeeCcceeEEEeec Confidence 5678888887776 4889999999986533 No 64 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.36 E-value=5.2e-07 Score=55.03 Aligned_cols=294 Identities=8% Similarity=-0.038 Sum_probs=155.6 Q ss_pred hhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEE Q lcl|NC_021342. 17 LVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMY 96 (354) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~ 96 (354) -..++..+..-..|..+......- .....+..+.++..+- +.+...+++.....-..+.++++.. .+. .++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~a~~~~~~~~~~~~iP---~~~~~~ii~~~~~~s~l~~l~~~~~-~~~--~~~~~ 73 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVF-NPDNVMMHEKKDGTLM---NEFTTPILQEVMENSKIMQLGKYEP-MEG--TEKKF 73 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhh-ccccccccCcCccccc---hhHHHHHHHHHHhhchhhhhcceee-ccC--CceEE Confidence 111111111111111111110000 0111122222333333 3455677777777777777766543 222 23556 Q ss_pred EeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeee Q lcl|NC_021342. 97 RSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFG 176 (354) Q Consensus 97 ~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G 176 (354) .+....+.+.+++.. ..+|..+...+........++....++.+=++.+ ..++...-....++++++.+|+.+|+| T Consensus 74 p~~~~~~~a~~v~Eg-~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~la~ai~~~~d~a~l~G 149 (324) T protein:vir:78 74 TFWADKPGAYWVGEG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) T ss_pred EEEecCcceeEecCC-ccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 666677788898774 5588888888888999999998888887555543 356888888899999999999999999 Q ss_pred ehhhC-ceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchH Q lcl|NC_021342. 177 DASRG-MYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTV 255 (354) Q Consensus 177 ~~~~g-i~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tv 255 (354) +...+ -.|+++..+....... ...-+++|.+++.++... ...+..++|+|+.|..|.+.. +..+..+ T Consensus 150 ~g~~~~~~gi~~~~~~~~~~~~-------~~~t~~~i~~~~~~l~~~---~~~~~~~vmn~~~~~~L~~l~--d~~G~~~ 217 (324) T protein:vir:78 150 QGNNPFGKSIAQSIEKTNKVIK-------GDFTQDNIIDLEALLEDD---ELEANAFISKTQNRSLLRKIV--DPETKER 217 (324) T ss_pred CCCCCcCccccccccccceecc-------ccccHHHHHHHHHhhhhc---cCCCCEEEEcHHHHHHHHHhh--ccCCCee Confidence 75432 3455554442221111 112367888888877542 345678999999999986532 3333221 Q ss_pred HHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc------------- Q lcl|NC_021342. 256 MQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP------------- 322 (354) Q Consensus 256 l~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~------------- 322 (354) + .. +.+-++...|...... ...++..++.-+. ..+-+.....++.-.. T Consensus 218 ~----~~-------~~~~~l~G~PV~~~~~------~~~~~~~~~~gd~--~~~~~g~~~~~~i~~~~~~~~~~~~~~~~ 278 (324) T protein:vir:78 218 I----YD-------RNSDSLDGLPVVNLKS------SNLKRGELITGDF--DKLIYGIPQLIEYKIDETAQLSTVKNEDG 278 (324) T ss_pred e----cC-------CCCCcccceeeEeeCC------CCCCcceEEEEec--ceEEEEEecCcEEEEeecccccccccccc Confidence 1 11 1222222222222111 0112222222221 1122222222211100 Q ss_pred c-----ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 323 Q-----MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 323 ~-----~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) . .++ ...+.+..++ |+.+.+|.|++++-.| T Consensus 279 ~~~~~f~~d-~~~~r~~~r~-d~~v~~~~A~~~l~~a 313 (324) T protein:vir:78 279 TPVNLFEQD-MVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred cchhhhhcC-cEEEEEEEEE-ccEEecccceEEEecc Confidence 0 011 2455567777 5777779999999988 No 65 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.36 E-value=5.2e-07 Score=55.03 Aligned_cols=294 Identities=8% Similarity=-0.038 Sum_probs=155.6 Q ss_pred hhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEE Q lcl|NC_021342. 17 LVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMY 96 (354) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~ 96 (354) -..++..+..-..|..+......- .....+..+.++..+- +.+...+++.....-..+.++++.. .+. .++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~a~~~~~~~~~~~~iP---~~~~~~ii~~~~~~s~l~~l~~~~~-~~~--~~~~~ 73 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVF-NPDNVMMHEKKDGTLM---NEFTTPILQEVMENSKIMQLGKYEP-MEG--TEKKF 73 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhh-ccccccccCcCccccc---hhHHHHHHHHHHhhchhhhhcceee-ccC--CceEE Confidence 111111111111111111110000 0111122222333333 3455677777777777777766543 222 23556 Q ss_pred EeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeee Q lcl|NC_021342. 97 RSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFG 176 (354) Q Consensus 97 ~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G 176 (354) .+....+.+.+++.. ..+|..+...+........++....++.+=++.+ ..++...-....++++++.+|+.+|+| T Consensus 74 p~~~~~~~a~~v~Eg-~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~la~ai~~~~d~a~l~G 149 (324) T protein:vir:96 74 TFWADKPGAYWVGEG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) T ss_pred EEEecCcceeEecCC-ccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 666677788898774 5588888888888999999998888887555543 356888888899999999999999999 Q ss_pred ehhhC-ceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchH Q lcl|NC_021342. 177 DASRG-MYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTV 255 (354) Q Consensus 177 ~~~~g-i~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tv 255 (354) +...+ -.|+++..+....... ...-+++|.+++.++... ...+..++|+|+.|..|.+.. +..+..+ T Consensus 150 ~g~~~~~~gi~~~~~~~~~~~~-------~~~t~~~i~~~~~~l~~~---~~~~~~~vmn~~~~~~L~~l~--d~~G~~~ 217 (324) T protein:vir:96 150 QGNNPFGKSIAQSIEKTNKVIK-------GDFTQDNIIDLEALLEDD---ELEANAFISKTQNRSLLRKIV--DPETKER 217 (324) T ss_pred CCCCCcCccccccccccceecc-------ccccHHHHHHHHHhhhhc---cCCCCEEEEcHHHHHHHHHhh--ccCCCee Confidence 75432 3455554442221111 112367888888877542 345678999999999986532 3333221 Q ss_pred HHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc------------- Q lcl|NC_021342. 256 MQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP------------- 322 (354) Q Consensus 256 l~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~------------- 322 (354) + .. +.+-++...|...... ...++..++.-+. ..+-+.....++.-.. T Consensus 218 ~----~~-------~~~~~l~G~PV~~~~~------~~~~~~~~~~gd~--~~~~~g~~~~~~i~~~~~~~~~~~~~~~~ 278 (324) T protein:vir:96 218 I----YD-------RNSDSLDGLPVVNLKS------SNLKRGELITGDF--DKLIYGIPQLIEYKIDETAQLSTVKNEDG 278 (324) T ss_pred e----cC-------CCCCcccceeeEeeCC------CCCCcceEEEEec--ceEEEEEecCcEEEEeecccccccccccc Confidence 1 11 1222222222222111 0112222222221 1122222222211100 Q ss_pred c-----ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 323 Q-----MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 323 ~-----~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) . .++ ...+.+..++ |+.+.+|.|++++-.| T Consensus 279 ~~~~~f~~d-~~~~r~~~r~-d~~v~~~~A~~~l~~a 313 (324) T protein:vir:96 279 TPVNLFEQD-MVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred cchhhhhcC-cEEEEEEEEE-ccEEecccceEEEecc Confidence 0 011 2455567777 5777779999999988 No 66 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.36 E-value=4.4e-07 Score=55.41 Aligned_cols=316 Identities=10% Similarity=0.033 Sum_probs=156.1 Q ss_pred Ccc------------------------cchhHHH----Hhhhhhhhcccccccccchhhhhhhhhhhcc----CCceecc Q lcl|NC_021342. 1 MAI------------------------KTIDAQT----IQGNQWLVHKGYVSRNGDQWVINNTALDAIG----NPNIMLD 48 (354) Q Consensus 1 ~~~------------------------~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~----~~~~~~d 48 (354) ||- |-..|.+ +.+..|-... .. ..|...-+ .-++.+. T Consensus 1 ~a~~~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g~~~~--------a~---~~a~~~~~~~~~~~a~~~~ 69 (366) T protein:vir:57 1 MAAAVAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVMSIAAGKGNLAD--------AA---KFAATELGDTGLSMAISTA 69 (366) T ss_pred CcccccccccccccccccccccccccccchhHHHHHHHHHhcccchhH--------HH---HHHHHhhcchhhhhhcccc Confidence 111 1111111 1111111000 00 00111000 0112223 Q ss_pred chhhHHHHHHHHHHHHHHHHHhhhhcccchhh-ccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEE Q lcl|NC_021342. 49 ADGGIAFYISQLAGIEATVYETPYGDITYRFD-VPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTV 127 (354) Q Consensus 49 a~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~-v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~ 127 (354) ++++ .+++. +.+..++++..+.....|.+ ..+. +...+ .+.+......+.+.|++.. .++|..+...+.... T Consensus 70 ~~~G-g~lvP--~~~~~~ii~~l~~~s~l~~lg~~~v-~~~~g--~~~~p~~t~~~~a~wv~E~-~~~~~s~~~f~~i~~ 142 (366) T protein:vir:57 70 AGSG-GALIP--QNMQNEVIELLRDRTVVRILGARSI-PLPNG--NLSMPRLSGGATAGYVGEG-KDVVATGATFDDVKL 142 (366) T ss_pred ccCC-ccccc--hhHHHHHHHHHhhhcchhhhceeee-ecCCC--ceEEEEEeCCcceeeeccC-ccccccccceeEEEE Confidence 3333 34433 44566788776666555554 2111 12222 3555566666677887765 558888888888889 Q ss_pred EEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh-hCceeeeecCCcccccccccccccCHH Q lcl|NC_021342. 128 PLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPNVTLSSATKDYKTMNGQ 206 (354) Q Consensus 128 pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~~~~~~~~~~W~~~T~~ 206 (354) +.+.++.-..+|.+=|+.+ ..+++.--....++++++.+|+.+++|+.. ..-.|++|..+.........=...+.. T Consensus 143 ~~~k~~~~~~iS~ell~ds---~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~ 219 (366) T protein:vir:57 143 SAKTMIALVPVSNQLIGRA---GFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLT 219 (366) T ss_pred eeEEEEEeehhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchh Confidence 9999998888875444433 346778888889999999999999999863 467899998775432221111122333 Q ss_pred HHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccc Q lcl|NC_021342. 207 ELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAEL 286 (354) Q Consensus 207 ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~ 286 (354) .+..++..+...... .+........+|+|..|..|.+.+ +..|..++.-+. .| .+...|...+..+ T Consensus 220 ~~~~~~~~~~~~~~~-~~~~~~~a~~vmn~~~~~~L~~lk--d~~G~~l~~~~~--------~g---~l~G~Pvv~s~~i 285 (366) T protein:vir:57 220 TIDEYLDSLILKHMD-SNSNMIRCGWGLSNRTYMTLFGLR--DGNGNKVYPEMS--------QG---ILKGYPIQRTSAI 285 (366) T ss_pred hHHHHHHHHHHhhhc-cccccccCEEEecHHHHHHHHhhh--ccCCceeccCCC--------CC---eecceeeEEcccc Confidence 333333333222221 122233457899999999987643 444443331110 01 1222222222222 Q ss_pred cccccccCcccEEEEEEcCcceEEEeeCchhhhccc----------cccCc----eeEEeeeeeeeeEEEECCceeEeee Q lcl|NC_021342. 287 AANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP----------QMASL----GITVPAEYKISGTEFRYPLCAAYVD 352 (354) Q Consensus 287 ~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~----------~~~~l----~~~~~~~~~~gGv~i~~P~ai~y~D 352 (354) -. ..+..+...-++|- |.+.+-+..-..++.... +..++ ...+.++.++ ++.+++|.|++++- T Consensus 286 p~-~~~~~~~~~~i~~g-dfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~-d~~v~~~~a~~~lt 362 (366) T protein:vir:57 286 PA-NLGDDGNESEIYFC-DFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEH-DIGFRHPEGLVLGT 362 (366) T ss_pred cc-ccccCCCccEEEEE-ecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeee-CcEeeccccEEEEe Confidence 11 11211111123332 222222222222222110 01111 2456677777 47889999999988 Q ss_pred cC Q lcl|NC_021342. 353 MA 354 (354) Q Consensus 353 ~~ 354 (354) =+ T Consensus 363 ~~ 364 (366) T protein:vir:57 363 GV 364 (366) T ss_pred cc Confidence 77 No 67 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.33 E-value=1.9e-07 Score=57.49 Aligned_cols=319 Identities=9% Similarity=0.078 Sum_probs=162.5 Q ss_pred Ccccc--------------hhHHHHhhhhhhhcccccccccchhh--------------hhhhhhhhccCCceeccchhh Q lcl|NC_021342. 1 MAIKT--------------IDAQTIQGNQWLVHKGYVSRNGDQWV--------------INNTALDAIGNPNIMLDADGG 52 (354) Q Consensus 1 ~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~am~a~~~~~~~~da~~~ 52 (354) -.++. ++.+....+...- +......+++. +...-..+. ...++++ T Consensus 41 ~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~--~~~~~~~~e~~~a~~~~l~~g~~~~~~~~e~~a~---~~~t~~~-- 113 (407) T protein:vir:48 41 GEVETLNGKLAELENLKSDLEAELAEVKRPAG--GTQNKVASEHKEAFIGFMRKGREDGLRELERKAL---QVGNDED-- 113 (407) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--ccccchhhHHHHHHHHHHhccchhhhhHHHHHhh---hcccCCC-- Confidence 00111 1111111010100 00000001100 000000111 1112222 Q ss_pred HHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-eccceeEEEEEE Q lcl|NC_021342. 53 IAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QSAQMHTVPLGY 131 (354) Q Consensus 53 ~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~~~~~~~pv~~ 131 (354) +.+++. +.+.++|++........+.+..+.+ .+ ...+.+.+......+.|++... .+|-.+ ...+.....++. T Consensus 114 gG~~iP--~~~~~~I~~~~~~~~~l~~~~~~~~-~~--~~~~~~~~~~~~~~a~~v~E~~-~~~~~~~~~f~~i~~~~~k 187 (407) T protein:vir:48 114 GGYAIP--EELDRTILTLLKDEVVMRQEATVIT-LG--GSDYKKLVNLGGTTSGWVGETD-ARPETATSKLGLIEPFMGE 187 (407) T ss_pred Cccccc--HhHHHHHHHHHHhhhhhhhhceeee-cC--CCceEEEEecCCcceeeecccc-cccccccccceeEEeeeee Confidence 334444 5678888888777777777665432 22 2234454555556677776543 355443 356677788888 Q ss_pred EEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccc------cC- Q lcl|NC_021342. 132 AGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKT------MN- 204 (354) Q Consensus 132 ~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~------~T- 204 (354) ++.-+.+|.+=|+.+ ..++..--....+++++..+|+.+++|+......|+|+++..........|.. .+ T Consensus 188 ~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~ 264 (407) T protein:vir:48 188 IYGNPQATQKMLDDA---FFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAA 264 (407) T ss_pred eEeehhhHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeecccccccccccccccccccccccc Confidence 888778876654443 45688888888999999999999999998878899999887654332222211 11 Q ss_pred HHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeec Q lcl|NC_021342. 205 GQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAA 284 (354) Q Consensus 205 ~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~ 284 (354) ..--++||.+++..|... +.. .-.+++++..|..|.+ +-+..|.-++ ..+ ...|.+-.+...|.+... T Consensus 265 ~~~~~d~i~~l~~~l~~~--~~~-~a~~v~n~~~~~~L~~--lkD~~Gr~l~----~~~---~~~g~~~~l~G~PV~~~~ 332 (407) T protein:vir:48 265 SGVTADAIIKLIYTLRKA--HRS-GAKFMMNNSSLFAIRL--LKDNDGNYLW----RPG---IELGQPSSLAGYGIVENE 332 (407) T ss_pred cccChHHHHHHHHhhchh--hhc-CCEEEEcHHHHHHHHH--hhccCCceee----ccC---cCCCCCceecceeeEEec Confidence 122267788888877542 222 2368999999999864 2244443221 111 112333344444444444 Q ss_pred cccccccccCcccEEEEE-EcCcceEEEeeCchhhhccccc--cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 285 ELAANGVSNSNKPRYMVY-DKSDRNLAMANPIPFRMLAPQM--ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 285 ~~~~~g~g~~g~d~~v~y-~~~~~~~~~~vp~~~~~~~~~~--~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..... + .+++. |+| +.+ +.+.+.--+.++.+--.+ ++ ...+.++.|++ +.+..|.|++++.++ T Consensus 333 ~~p~~--~-~~~~~-i~~Gd~~-~~~~i~~~~~~~i~~d~~~~~~-~~~~~~~~r~d-~~v~~~~a~~~l~~~ 398 (407) T protein:vir:48 333 QMPDI--A-ADAKA-IAFGNFK-RGYTIVDRIGTRILRDPYTNKP-FVGFYTTKRTG-GMLVDSQAIKLMKIG 398 (407) T ss_pred CcCCc--c-CCccE-EEEEecc-ccEEEEEeeceEEEeeccccCC-cEEEEEEEEec-cEEecccceEEEEee Confidence 32221 2 22232 233 222 212211112222222111 22 24556778886 567779999999999 No 68 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=98.30 E-value=9.7e-07 Score=53.55 Aligned_cols=327 Identities=10% Similarity=0.008 Sum_probs=154.0 Q ss_pred CcccchhHH-----HHhhh--------h------hhhcc-cccccccchhhhh---------------hhhhhhc----c Q lcl|NC_021342. 1 MAIKTIDAQ-----TIQGN--------Q------WLVHK-GYVSRNGDQWVIN---------------NTALDAI----G 41 (354) Q Consensus 1 ~~~~~~~~~-----~~~~~--------~------~~~~~-~~~~~~~~~~~~~---------------~~am~a~----~ 41 (354) --++.||.+ +.+.. . ..+.+ ......+..+... ..+.... . T Consensus 44 ~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (428) T protein:vir:10 44 QQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSV 123 (428) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhH Confidence 122222211 00000 0 00000 0000000000000 0000000 0 Q ss_pred CCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeec Q lcl|NC_021342. 42 NPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQS 121 (354) Q Consensus 42 ~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~ 121 (354) .-.+...+..+ .+++. +.+.+++++........+++..-.-+...+. +.+......+.+.|++.+ ..+|..+.. T Consensus 124 ~~~~~~~~~~g-g~liP--~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~--~~~p~~~~~~~a~~v~Eg-~~~~~~~~~ 197 (428) T protein:vir:10 124 SMAISTAAGSG-GVLIP--QNIHSEVIELLRDRTIVRKLGARSIPLPNGN--MSLPRLAGGATASYTGEN-QDAKVSEAR 197 (428) T ss_pred hhhhcccccCC-ccccc--hhHHHHHHHHHhhhchhhhhcceeeecCCcc--eEEEEEeCCcceeeeccC-ccccccccc Confidence 00112222222 34433 4455677777776666666522111222232 344455555677888665 557877777 Q ss_pred cceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh-hCceeeeecCCcccccccccc Q lcl|NC_021342. 122 AQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPNVTLSSATKDY 200 (354) Q Consensus 122 ~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~~~~~~~~~~W 200 (354) .+........++.-+.+|.+=|+.+ ..++..--....++++...+|+.+++|+.. ....|++|............- T Consensus 198 f~~i~~~~~k~~~~v~is~ell~ds---~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~ 274 (428) T protein:vir:10 198 FDDVKLTAKTMIAMVPISNALIGRA---GFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAA 274 (428) T ss_pred eeeEEeeeEEEEEeehhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccc Confidence 7888888888888888887655544 346777788889999999999999999864 356799987654322111111 Q ss_pred -cccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeee Q lcl|NC_021342. 201 -KTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRF 279 (354) Q Consensus 201 -~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~ 279 (354) +..+.+. ++...+.+................+|+|..|..|.... +..|.-++. ... .| .|...| T Consensus 275 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk--d~~G~~i~~---~~~-----~g---~l~G~p 340 (428) T protein:vir:10 275 DAAVNLDT-IDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLR--DGNGNKVYP---EMA-----QG---MLKGYP 340 (428) T ss_pred cccccHHH-HHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhh--ccCCceecc---CCC-----CC---eeecee Confidence 1122222 22222222222111112234467899999999886532 444433321 100 11 233333 Q ss_pred eeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhcccc---------------ccCceeEEeeeeeeeeEEEEC Q lcl|NC_021342. 280 QLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQ---------------MASLGITVPAEYKISGTEFRY 344 (354) Q Consensus 280 ~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~---------------~~~l~~~~~~~~~~gGv~i~~ 344 (354) .+....... +.+.+++...++|- |...+-+..-..++..... .++ ...+.++.++ ++.+++ T Consensus 341 v~~~~~~p~-~~~~~~~~~~i~~g-d~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~-~~~~R~~~r~-d~~v~~ 416 (428) T protein:vir:10 341 IQRTSAIPA-NLGEGGKESEIYFA-DFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRN-QSLIRVVTEH-DIGFRH 416 (428) T ss_pred eEEeccccc-cccCCCccceEEEE-ecceEEEEEecceEEEeecccccccccccccchhhcc-hhheeeeeee-Cceeec Confidence 333322211 12222323233332 2222223332333322111 011 2345677887 589999 Q ss_pred CceeEeeecC Q lcl|NC_021342. 345 PLCAAYVDMA 354 (354) Q Consensus 345 P~ai~y~D~~ 354 (354) |.|++++.=. T Consensus 417 p~a~~~~t~~ 426 (428) T protein:vir:10 417 PEGLVLGTGV 426 (428) T ss_pred cceEEEEecc Confidence 9999997655 No 69 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.28 E-value=8.1e-08 Score=59.46 Aligned_cols=315 Identities=8% Similarity=0.045 Sum_probs=151.3 Q ss_pred Ccccch----h-------------------------------HHHHh---hhhhhhcccccccccchhhhhhhhhhhccC Q lcl|NC_021342. 1 MAIKTI----D-------------------------------AQTIQ---GNQWLVHKGYVSRNGDQWVINNTALDAIGN 42 (354) Q Consensus 1 ~~~~~~----~-------------------------------~~~~~---~~~~~~~~~~~~~~~~~~~~~~~am~a~~~ 42 (354) --+..| + +.... ...++-.. .... +........++ T Consensus 38 ~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~e~~a~-- 110 (404) T protein:vir:10 38 NEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNGALFVRAIADNLLKQK-NQRG----LNLSEKEINAI-- 110 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHHHHHHHHH-Hhhh----hcchhhHHhhh-- Confidence 000000 0 00000 00000000 0000 00000011111 Q ss_pred CceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCccccee--ee Q lcl|NC_021342. 43 PNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRV--AQ 120 (354) Q Consensus 43 ~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v--~~ 120 (354) +...++.+.+++. +.+.+++++........+.++++.. ++...-.+.+........+.+++.+.. .|.. +. T Consensus 111 ---~~~~~~~gg~~vP--~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~~g~~~~~~~~~~~~~~~v~e~~~-~~~~~~~~ 183 (404) T protein:vir:10 111 ---SENIDEDGGYAVP--EDIQTKINTRLKDTTDLYNMVDYEP-VFTRSGSRTYEKRSKQKPMKPLSENQQ-IPTNGDNG 183 (404) T ss_pred ---ccccCCCCceeec--hhHHHHHHHHHhhhhhHhhhhceee-ccCCccceEEEEecCCcceeecccccc-cccccccc Confidence 1111222334444 5566778887777777777766542 111112344444445556666655433 3432 34 Q ss_pred ccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh-hCceeeeecCCccccccccc Q lcl|NC_021342. 121 SAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 121 ~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~~~~~~~~~~ 199 (354) ..+........++.-+.+|.+=|+. ...++..--....+++++..+|+.+++|+.. ....|+++..+..+.+.++ T Consensus 184 ~f~~i~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~- 259 (404) T protein:vir:10 184 KLERFNFKLKDLADFMSIPNDLLKF---ADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPK- 259 (404) T ss_pred ceeeeEeeheeeEeeehhhHHHHhh---cHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeeccc- Confidence 4556667777788777787643332 3346777788889999999999999999774 3567888887765443322 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRF 279 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~ 279 (354) .+ -++++..+++.... .+....-.++|+|..|..|.+.. +..|.-++. .++ ..+.+-.+...| T Consensus 260 ---~~---~~~~~~~~~~~~l~--~~~~~~~~~v~n~~~~~~L~~lk--d~~G~~l~~----~~~---~~~~~~~l~G~P 322 (404) T protein:vir:10 260 ---SP---ALKDFKKCKNVELL--NVFKATSSWIVNQDGFNYLDSLE--DKTGRPYLQ----PDP---KDPTQYRFLGLP 322 (404) T ss_pred ---cc---cHHHHHHHHHhhhh--ccccCCCEEEEcHHHHHHHHHhh--ccCCceeec----cCc---CCCCCcccccee Confidence 11 14566666553222 23344457899999999986532 333322211 000 112222222222 Q ss_pred eeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhcc--cccc---CceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 280 QLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLA--PQMA---SLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 280 ~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~--~~~~---~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .......... ++.+... ++|-+=.+.+.+..-..++... .... .-...+.++.++ |+.+++|.+++.+.++ T Consensus 323 V~~~~~~~~~--~~~~~~~-~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 398 (404) T protein:vir:10 323 VIELPNDLLL--STESAIP-VLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRI-DGNVKDSEALLIAEIP 398 (404) T ss_pred eEEecccccC--CCCCccE-EEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEee-ccEEecccceEEEEee Confidence 2211111111 1222233 3332222223332222222211 1111 112446667777 4789999999999999 No 70 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=98.22 E-value=2.7e-07 Score=56.61 Aligned_cols=316 Identities=9% Similarity=-0.019 Sum_probs=151.0 Q ss_pred CcccchhH-----------HHHhhhh---hhhcccccccccchhhhhh--------------hhhhhccCCceeccchhh Q lcl|NC_021342. 1 MAIKTIDA-----------QTIQGNQ---WLVHKGYVSRNGDQWVINN--------------TALDAIGNPNIMLDADGG 52 (354) Q Consensus 1 ~~~~~~~~-----------~~~~~~~---~~~~~~~~~~~~~~~~~~~--------------~am~a~~~~~~~~da~~~ 52 (354) .....++. +...-.. ....... ....-+..... ..++.....++...+++. T Consensus 286 ~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~-~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~ 364 (632) T protein:vir:96 286 PGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDW-SKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGK 364 (632) T ss_pred hhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccch-hhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccc Confidence 00000000 0000000 0000000 00000000000 001100001111111222 Q ss_pred HHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEE Q lcl|NC_021342. 53 IAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYA 132 (354) Q Consensus 53 ~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~ 132 (354) +.+++.. +.+...+++..++....+++-.-.-++..+ .+.+......+.+.|++.. ..+|..+...+........+ T Consensus 365 gg~lvp~-~~~~~~iie~lr~~s~i~~l~~~~~~~~~g--~~~ip~~~~~~~a~wv~E~-~~~~~s~~~f~~i~l~~~k~ 440 (632) T protein:vir:96 365 GGELVAT-ELLSEEFIDILRNKAIIGQMGARMLPGLVG--DVDIPKKTSGANFYWIGED-EDVQDSDFDFTTLSFSPKTI 440 (632) T ss_pred ccccccc-ccchHHHHHHHhhcchhhhhcceEeecCCc--ceEEEEEeCCceeEeecCC-ccccccccceeeEEeeeeEE Confidence 3344431 223456777766665555541111122222 3455555556667777654 44777777777888888888 Q ss_pred EeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh-hCceeeeecCCcccccccccccccCHHHHHHH Q lcl|NC_021342. 133 GNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPNVTLSSATKDYKTMNGQELFNM 211 (354) Q Consensus 133 ~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~d 211 (354) +..+.+|.+=|..+ ..+++.--......+++..+|+.+++|+.. ....|++|..+++..+..+. ..+ +++ T Consensus 441 ~~~v~iS~ell~ds---~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~--~~~----~~~ 511 (632) T protein:vir:96 441 AGAVPVTRKLRKQS---SIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAG--GVD----WAS 511 (632) T ss_pred EEehhhHHHHHhcc---chHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecccc--cCC----HHH Confidence 88777765544433 456777777889999999999999999863 34689999888765432221 111 456 Q ss_pred HHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeecccccccc Q lcl|NC_021342. 212 LNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGV 291 (354) Q Consensus 212 i~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~ 291 (354) |.++..++... +....+...+++|..+..|......+..+.-++ ..+ ...|.|..+ +........ T Consensus 512 i~~~~~~i~~~-~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~----~~~---~l~G~pv~~-------s~~ip~~~~ 576 (632) T protein:vir:96 512 VVDMETKISTF-NADAGRLAYLTSVTQRGAAKKAQVFDNTGERIW----QNN---EVNGYRAEA-------SNQIPADTW 576 (632) T ss_pred HHHHHHHHhhc-ccccCccEEEEchhHHHHHHHHhccCCCCceee----cCC---eecccceEe-------ccccccCcE Confidence 77777666543 222334568899988877765444454443222 222 223444222 111100000 Q ss_pred ccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 292 SNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 292 g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) -.+.-..+++... .-+.+ ...+- ....-...+.++.++ ++-+++|.+++++-.+ T Consensus 577 ~~gd~s~~~i~~~--~~~~i------~~~~~~~~~~~~v~~~~~~~~-d~~v~~~~af~~~k~~ 631 (632) T protein:vir:96 577 IFGDWSQIVIAMW--GVLDL------KVDPYTKAASDGLVLRVFQDV-DAGVRRKEAFCIAKKG 631 (632) T ss_pred EEeecceEEEEEe--cceEE------EEccccccccCceEEEEEeec-Cceeechhhhhheeec Confidence 0011011111111 11222 11111 111223455567776 5789999999999999 No 71 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=98.20 E-value=5.4e-07 Score=54.95 Aligned_cols=288 Identities=9% Similarity=-0.019 Sum_probs=150.3 Q ss_pred ccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccc Q lcl|NC_021342. 24 SRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVT 103 (354) Q Consensus 24 ~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G 103 (354) +-++++ ..+|-.. .+.+. +.++..++ -.++++.....-..+++..+.. .+. .+..+....... T Consensus 1 ~g~~~e----~~~~~~~------~t~~~-~g~l~~~~---~~~ii~~l~~~s~i~~l~~~~~-~~~--~~~~ip~~~~~~ 63 (397) T protein:vir:23 1 MGFSAD----HSQIAQT------KDTMF-TGYLDPVQ---AKDYFAEAEKTSIVQRVAQKIP-MGA--TGIVIPHWTGDV 63 (397) T ss_pred CCcCHH----HHHHhhc------cCCCC-ccccchhH---HHHHHHHHHhccchhhhcceee-ccC--CceEEEEEcCCc Confidence 333333 2222211 11122 22344322 2345555556666666665432 232 235556666677 Q ss_pred ceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh-hCc Q lcl|NC_021342. 104 MGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS-RGM 182 (354) Q Consensus 104 ~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~-~gi 182 (354) .+.|++.. ..+|..+...+......+.++..+.++.+=++.+ ..++...-....++++++.+|+.+++|+.. .++ T Consensus 64 ~a~wv~Eg-~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds---~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~ 139 (397) T protein:vir:23 64 SAQWIGEG-DMKPITKGNMTKRDVHPAKIATIFVASAETVRAN---PANYLGTMRTKVATAIAMAFDNAALHGTNAPSAF 139 (397) T ss_pred ceEEecCC-ccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhcccCCccc Confidence 78888664 5588888888888889999998888876655543 357888889999999999999999999764 345 Q ss_pred eeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhh Q lcl|NC_021342. 183 YGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEA 262 (354) Q Consensus 183 ~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n 262 (354) .|+.+..+......+. -..+++.+++.++... ...+..++|+|..|..|.+.. +..+..++.--..+ T Consensus 140 ~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~l~~~---~~~~a~~vmn~~~~~~L~~lk--d~~G~~i~~~~~~~ 206 (397) T protein:vir:23 140 QGYLDQSNKTQSISPN--------AYQGLGVSGLTKLVTD---GKKWTHTLLDDTVEPVLNGSV--DANGRPLFVESTYE 206 (397) T ss_pred ccccccccceeeeccc--------chhHHHHHHHHhhhhc---ccCCCEEEEcHHHHHHHHHhh--ccCCceeecccccc Confidence 5665554432222111 1234555666666542 234578999999999987532 33333221100000 Q ss_pred CcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcC------cceEEEeeCchhhhcc-c----cccCc---- Q lcl|NC_021342. 263 NSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKS------DRNLAMANPIPFRMLA-P----QMASL---- 327 (354) Q Consensus 263 ~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~------~~~~~~~vp~~~~~~~-~----~~~~l---- 327 (354) .. ...+.+-++...|........ .++..++.-+.+ .+.+.+.+-......- . .+.++ T Consensus 207 ~~--~~~~~~~tl~G~Pv~~s~~~~------~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d 278 (397) T protein:vir:23 207 SL--TTPFREGRILGRPTILSDHVA------EGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHN 278 (397) T ss_pred cc--cccccCceeeeeeEEEeCCCC------CCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeecc Confidence 00 000011123333333322211 111111111111 1112222211111100 0 01111 Q ss_pred eeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 328 GITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 328 ~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ...+.++.|++ +.+++|.++++++.. T Consensus 279 ~v~~ra~~r~d-~~v~~~~a~~~~~~~ 304 (397) T protein:vir:23 279 LVAVRVEAEYG-LLINDVNAFVKLTFD 304 (397) T ss_pred ceeEEEEeeec-cceecccceEEEeec Confidence 13455667774 789999999999997 No 72 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.15 E-value=3.3e-06 Score=50.64 Aligned_cols=311 Identities=7% Similarity=-0.015 Sum_probs=149.7 Q ss_pred Ccccch-----hHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_021342. 1 MAIKTI-----DAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDI 75 (354) Q Consensus 1 ~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l 75 (354) .-++.+ +-+..+.+.-+...+....++.+... +.++.... ..+ +++.+++. +.+..+|++.....- T Consensus 37 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~---~~~~~~~~--~~~--~~gg~lvP--~~~~~~I~~~l~~~s 107 (377) T protein:vir:96 37 AAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIK---FFNDIDKN--VGG--KDKFKLLP--EETMVQVFDDLVAEH 107 (377) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcccCHHHHH---HHHHHHhc--CCC--CCCceecC--HHHHHHHHHHHHhhh Confidence 000111 11112222222222333333333222 22221111 122 33334444 345556666555444 Q ss_pred cchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccc-eeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcc Q lcl|NC_021342. 76 TYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLP-RVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPID 154 (354) Q Consensus 76 ~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip-~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld 154 (354) ..|.+..+..- + +.. .....+..+.+.|++..+ .++ ..+...+....+.+.++.-..++.+=|+. ...+++ T Consensus 108 ~i~~~~~v~~~-~-~~~--~i~~~~~~~~a~wv~e~~-~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~d---s~~~le 179 (377) T protein:vir:96 108 PLLKVINFKNT-S-LRL--KALTAETSGTAVWGDIFG-EIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF---GPKWLK 179 (377) T ss_pred hhhhhceeEec-C-Cce--EEEEecCCcceeEeeccc-ccccccCccceeEeeeeeeEEeechhhHHHhhc---chhhHH Confidence 44455444322 2 112 233445667788876543 343 34566777888888888777776554443 355788 Q ss_pred hHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccc---------------ccccCHHHHHHHHHHHHHHH Q lcl|NC_021342. 155 AEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKD---------------YKTMNGQELFNMLNAPIFSV 219 (354) Q Consensus 155 ~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~---------------W~~~T~~ei~~di~~~~~~l 219 (354) .--....+++++..+++-+++|+....-.|+|+++.......... ....+++.+.+.+..+...+ T Consensus 180 ~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 259 (377) T protein:vir:96 180 QFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHL 259 (377) T ss_pred HHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhh Confidence 889999999999999999999998888899999887543221111 11234555555444444444 Q ss_pred HHHhCC----cccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCc Q lcl|NC_021342. 220 INLSRR----FHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSN 295 (354) Q Consensus 220 ~~~s~g----~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g 295 (354) .....+ ..+.-.++|+|..+..+...+... -.++.+...-|.|+.+. .+... .. + T Consensus 260 ~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~----------~~~G~~~~~l~~p~~v~-----~s~~~-p~-----~ 318 (377) T protein:vir:96 260 SVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSR----------NQFGEYVTVLPHGITIL-----ESLAV-ET-----G 318 (377) T ss_pred ccccccccccccCceEEEEchhhHHhcccccccc----------CCCCCceeccCCCceEE-----ecCCC-Cc-----c Confidence 322111 112345889998876653222110 01111111112232222 11110 00 1 Q ss_pred ccEEEEEEcCcceEEEeeCchhhhccc-cccCc--eeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 296 KPRYMVYDKSDRNLAMANPIPFRMLAP-QMASL--GITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 296 ~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l--~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) + ++..+.+. +.+..-..++.-.. +.... ...+....|++| .++.|.|++.+|++ T Consensus 319 ~--i~fgdf~~--Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG-~~~d~~a~~vl~l~ 375 (377) T protein:vir:96 319 K--AIAFVANR--YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYG-KAKDNHTAALLTLA 375 (377) T ss_pred c--EEEEEcCc--EEEEEecccEEEeehhhhhhcCCeEEEEEEEEcC-EEecCCcEEEEEEe Confidence 1 12112211 22222222222111 22221 234556777754 56899999999999 No 73 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.11 E-value=3.4e-06 Score=50.55 Aligned_cols=319 Identities=8% Similarity=-0.014 Sum_probs=154.1 Q ss_pred CcccchhHH---H---Hhhhhhh------hcccccccccchhhhhhhhhhh-----------ccCCceeccchhhHHHHH Q lcl|NC_021342. 1 MAIKTIDAQ---T---IQGNQWL------VHKGYVSRNGDQWVINNTALDA-----------IGNPNIMLDADGGIAFYI 57 (354) Q Consensus 1 ~~~~~~~~~---~---~~~~~~~------~~~~~~~~~~~~~~~~~~am~a-----------~~~~~~~~da~~~~~fl~ 57 (354) ..++.||.+ . ++...-. ..++...............+.+ .......+.+ +++.++. T Consensus 45 ~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~-~~g~~~~ 123 (392) T protein:vir:13 45 TAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGSGAQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKA-GNPNVLS 123 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccchhhhhhHHHHHHHhccchhhhHHHHhhhhhhccccc-CCCcccc Confidence 222333221 0 0000000 0000000000000000000000 0000001111 1223333 Q ss_pred HHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEe Q lcl|NC_021342. 58 SQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECH 137 (354) Q Consensus 58 ~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~ 137 (354) . +.+.+.+.+........+.+..+.... ....+.+......+.+.|++.+ ..+|..+...+......+.++.-.. T Consensus 124 ~--~~~~~~i~~~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~E~-~~~~~~~~~f~~v~~~~~k~~~~~~ 198 (392) T protein:vir:13 124 R--TLYGQLIAQAVERSAIMRGGASTFTTS--DANPMDFTVITGRATAGIVGET-AEIPESYPATTQRSMGGFKYGFASV 198 (392) T ss_pred c--cchHHHHHHHHhhhhhhhhcceeeecC--CCceeEEEEEcCCcceeeeccc-ccccccccceeeEEeeeeeEEeeeh Confidence 2 222333333323222333333322111 1223455566667778888665 4578888888888888899888888 Q ss_pred ecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHH Q lcl|NC_021342. 138 YTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIF 217 (354) Q Consensus 138 ~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~ 217 (354) +|.+=|+.+ ..++..--....+.++++.+|..+++|+....-.|+++.+...... ..|.+.+ .-.+++|.+++. T Consensus 199 iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~--~~~~~~~-~~~~d~l~~~~~ 272 (392) T protein:vir:13 199 VSYEFATDQ---VLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAA--FGEADAD-SKVSDALIDLFH 272 (392) T ss_pred hHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccccc--ccccccc-cccHHHHHHHHH Confidence 876655533 4467777888899999999999999998776778999887643222 2222211 122567777777 Q ss_pred HHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCccc Q lcl|NC_021342. 218 SVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKP 297 (354) Q Consensus 218 ~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d 297 (354) .|... .-.+-..+|+|..+..|..- -+..|.-++ ..++ ..|.+-.+...|.+....... + T Consensus 273 ~l~~~---~~~~a~~v~n~~~~~~l~~l--kd~~G~~l~----~~~~---~~g~~~~l~G~Pv~~~~~~~~--------~ 332 (392) T protein:vir:13 273 EVPSA---YRKNAKFVVNDLRAAQMRKL--KDANGQYLW----QSAL---TVGAPDTFNGKVVETDDGMPA--------D 332 (392) T ss_pred hhhhh---hhcCCEEEEcHHHHHHHHHh--hccCCceee----cCCc---CCCCCceecceeeEEcCCCCC--------C Confidence 66432 22345689999999988653 244443221 1111 122222333344443332211 1 Q ss_pred EEEEEEcCcceEEEeeCchhhhccc-cc--cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 298 RYMVYDKSDRNLAMANPIPFRMLAP-QM--ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 298 ~~v~y~~~~~~~~~~vp~~~~~~~~-~~--~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .++.-+. +.+.+..-..++.... +. ..-...+.++.|++ +.+.+|.|++.+.++ T Consensus 333 ~i~~Gdf--~~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~A~~~~~~~ 389 (392) T protein:vir:13 333 KVLFADL--SKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRAD-GLLVDARGAKVLTVT 389 (392) T ss_pred cEEEeec--cceeEEeecceEEEeeccccccCCcEEEEEEEEec-cEEecccceEEEEee Confidence 1221121 2222322233333211 11 11234566788886 558999999999999 No 74 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=98.10 E-value=4.6e-06 Score=49.83 Aligned_cols=305 Identities=12% Similarity=0.017 Sum_probs=147.1 Q ss_pred CcccchhHHH--Hh---hhhhhhcccccccccchhhhh---------h--hhhhhccCCceeccchhhHHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQT--IQ---GNQWLVHKGYVSRNGDQWVIN---------N--TALDAIGNPNIMLDADGGIAFYISQLAGIE 64 (354) Q Consensus 1 ~~~~~~~~~~--~~---~~~~~~~~~~~~~~~~~~~~~---------~--~am~a~~~~~~~~da~~~~~fl~~~L~~Id 64 (354) ..+..|..+. ++ .+... .+.........+... . ..+.+. ...++..+.++.+. +.+. T Consensus 52 ~~~~~l~~~~~~~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ip----~~~~ 124 (379) T protein:vir:10 52 SDMAALQAHADKLDVKLKEKAK-SEDKSDSLVKSITENFNDIKEVRNGKSIQVKAV--GDMTLPVNLTGAQP----KDYN 124 (379) T ss_pred HHHHHHHHHHHHHHHHHHhccc-ccccchhHHHHHHHHHHhHHHHHhhhhhhhhhh--cccccCCCCccccc----hhhh Confidence 0111111110 00 01110 011111000111000 0 111221 22233333333322 4455 Q ss_pred HHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccc--ceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHH Q lcl|NC_021342. 65 ATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVT--MGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDE 142 (354) Q Consensus 65 ~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G--~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~E 142 (354) ..+++........+.++.+.+- ...++.|......+ .+.+++. +..+|..+...+.....+..++.-+.+|.+= T Consensus 125 ~~ii~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~v~E-g~~~~~~~~~f~~i~~~~~k~~~~~~iS~el 200 (379) T protein:vir:10 125 FDVVLNPSQMLNVSDIVGAVSI---SGGTYTFVRENGAGEGAIGAQVE-GATKGQKDYDISMIDVNTDFIAGFTRYSKKM 200 (379) T ss_pred hHHHHhHHhhhhHHhhceeeec---cCCceEEEEeecCCCcccccccC-CccccccccceeeeEeeeeeEEeeehhhHHH Confidence 6677777777777777655432 22234444443332 3344544 4567888888888889999999888888654 Q ss_pred HHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 143 MRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINL 222 (354) Q Consensus 143 l~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~ 222 (354) |+.+. .+..--....+++++..+|.-++.|....+..+.+. .++ ..-+++|.+++.++.. T Consensus 201 l~D~~----~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~---------~~~------~~~~d~i~~~~~~~~~- 260 (379) T protein:vir:10 201 ANNLP----FLTSFIPNALRRDYAKAENAAFNAVLAANATASTEI---------ITN------KNKVEMLINEIAKQEN- 260 (379) T ss_pred HhhHH----HHHHHHHHHHHHHHHHHHHHHHhccccccccccccc---------ccC------cccHHHHHHHHHhhhh- Confidence 44332 366666677788888999988877765443332211 111 1114677777777654 Q ss_pred hCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEE Q lcl|NC_021342. 223 SRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVY 302 (354) Q Consensus 223 s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y 302 (354) ....+..++|+|..|..|.+.. +..|. |+-.-+ .....|.+..+...|.+.+..... |+ +++- T Consensus 261 --~~~~~~~~vmn~~~~~~l~~lk--d~~G~----~l~~~~-~~~~~~~~~~l~G~pvv~s~~~~a------g~--~~~g 323 (379) T protein:vir:10 261 --LDFPVTAIVLRPTDYYDILVTQ--KSVGA----GYGLPG-VVTQDNGVLRINGIPLFRATWLAA------NK--YYVG 323 (379) T ss_pred --ccCCCCEEEEcHHHHHHHHHhh--ccCCc----eeccCC-ccCCCCCcceecceeeEecCCCCC------Cc--eEEe Confidence 2345678999999999986533 33332 221111 111223333444445554443221 11 1211 Q ss_pred EcCcceEEEe--eCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 303 DKSDRNLAMA--NPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 303 ~~~~~~~~~~--vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+.-.+... +...+..... ....-...+.++.|+ |+.+++|.|+++++++ T Consensus 324 df~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~-~~~v~~p~a~v~~~~~ 377 (379) T protein:vir:10 324 DWTRVTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQV-ALAVEQPAALIFGDFT 377 (379) T ss_pred ecccEEEEEEeceEEEEeecccccccCCcEEEEEEEEe-ccEEecCccEEEEEec Confidence 2111001000 1111111111 011113566678888 4788899999999999 No 75 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=98.03 E-value=1.6e-06 Score=52.33 Aligned_cols=272 Identities=7% Similarity=-0.044 Sum_probs=142.0 Q ss_pred hhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcc Q lcl|NC_021342. 36 ALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQD 114 (354) Q Consensus 36 am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~d 114 (354) -++++ .....+.+.++.. +.+.+.+++........+++..+.. .......+.+.... ..+.+.+++.... T Consensus 1 ~l~~~-----~~~t~~~gg~liP--~~~~~~Ii~~~~~~~~l~~~~~~~~-~~~~~g~~~~~~~~~~~~~a~~v~Eg~~- 71 (293) T protein:vir:48 1 MLDSK-----TDHSGSDAGLTIP--QDIRTAINTLVRQYDSLQEYVNVEN-VTTLTGSRVYEKWTDITGLANIDDEAGK- 71 (293) T ss_pred Cceee-----cccccCcCceEec--hhHHHHHHHHHHhhhhhhhhceeee-ccCCcceEEEEeecCCCcceeeecCCcc- Confidence 12221 1111122333333 5666778888777777777755432 22222234444443 3466788876544 Q ss_pred ccee-eeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccc Q lcl|NC_021342. 115 LPRV-AQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTL 193 (354) Q Consensus 115 ip~v-~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~ 193 (354) +|-. ....+......+.++..+.+|.+=++.+ ..++...-....+++++..+|+-++.|...... T Consensus 72 ~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~----------- 137 (293) T protein:vir:48 72 IADIDDPKLSLIKYTIKRYAGISTVTNSLLADS---AENILAWLSGWIAKKVVVTRNKAILGVVDKLPT----------- 137 (293) T ss_pred cccccccceeEEEEeeeEEEEeehhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHhHHhhccccccc----------- Confidence 5543 3567777888888888888876555443 457888888889999999999999887543210 Q ss_pred ccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccc Q lcl|NC_021342. 194 SSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNEL 273 (354) Q Consensus 194 ~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l 273 (354) .. ...+ ++||.+++.++... + .....++|+|+.|..|.+-. +..+.-++ ..++ ..+.+- T Consensus 138 ---~~--~~~~----~d~i~~~~~~l~~~--~-~~~a~~vmn~~~~~~L~~lk--d~~g~~l~----~~~~---~~~~~~ 196 (293) T protein:vir:48 138 ---KP--TLTK----WDDIIDLEAKVDPA--I-KQTSFFLTNTSGFTALKKVK--NALGDYLM----ERDV---KSPTGY 196 (293) T ss_pred ---cc--cccC----HHHHHHHHHhhhhh--h-cCCCEEEEcHHHHHHHHHhh--ccCCceEe----ecCc---CCCCCc Confidence 00 1112 46777777777542 2 23457999999999986532 33332211 1111 112222 Q ss_pred eeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhh--ccc---cccCceeEEeeeeeeeeEEEECCcee Q lcl|NC_021342. 274 DIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRM--LAP---QMASLGITVPAEYKISGTEFRYPLCA 348 (354) Q Consensus 274 ~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~--~~~---~~~~l~~~~~~~~~~gGv~i~~P~ai 348 (354) .|...|........... ...++..+ +|-+=.+.+.+..-..++. ... ....=...+.+.+|++ +.+++|.|+ T Consensus 197 ~l~G~Pv~~~~~~~~~~-~~~~~~~~-~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~ 273 (293) T protein:vir:48 197 SIAGFAVKEISDRWLPN-ASSGVMPL-YFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFD-VVATDTEAF 273 (293) T ss_pred eecceeeEEecccccCC-ccCCceEE-EEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeC-cEEecccce Confidence 33222222221111111 11222222 2221122222221122221 111 0111134566778876 567889999 Q ss_pred EeeecC Q lcl|NC_021342. 349 AYVDMA 354 (354) Q Consensus 349 ~y~D~~ 354 (354) +.+.++ T Consensus 274 ~~l~~~ 279 (293) T protein:vir:48 274 VPASFK 279 (293) T ss_pred EEEEee Confidence 999998 No 76 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.01 E-value=6.9e-06 Score=48.88 Aligned_cols=312 Identities=6% Similarity=-0.062 Sum_probs=146.8 Q ss_pred Cc-------ccch--hHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhh Q lcl|NC_021342. 1 MA-------IKTI--DAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETP 71 (354) Q Consensus 1 ~~-------~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~ 71 (354) |. ++.. .+..-..+......+....+.++... ++++.... ..++ .+..+.. +.+...+++.. T Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~---~~~~~~~~--~~~~--~gg~lvP--~~~~~~I~~~~ 108 (390) T protein:vir:40 38 MAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESK---YYNEVIAG--NGFA--GVTALLP--PTVFERVFEDL 108 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHH---HHHHHHhc--cCcc--cCccccc--HHHHHHHHHHH Confidence 00 0000 00000011111112222223222211 22221111 1122 2223333 45556677766 Q ss_pred hhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccc-eeeeccceeEEEEEEEEeeEeecHHHHHHHHHhC Q lcl|NC_021342. 72 YGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLP-RVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMN 150 (354) Q Consensus 72 ~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip-~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g 150 (354) ...-..+.++.+.. .+.+ ...+......+.+.|++..+. +| ..+...+......+.++.-+.+|..=|+.+ . T Consensus 109 ~~~s~i~~~~~~~~-~~~~--~~~i~~~~~~~~a~~~~E~~~-~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds---~ 181 (390) T protein:vir:40 109 TVEHPLLSKINFVN-TTAT--TEWIISVGDVATAWWGPLCAE-IKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLG---P 181 (390) T ss_pred Hhhhhhhhhceeee-cCCc--eeEEEEEcCCcceeeeccccc-cCccccccceeeEeeeeeEEEeehhhHHHHhcc---h Confidence 66655666655432 2222 233445556677888765433 44 345667778888888888888876555544 4 Q ss_pred CCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccc--ccccccCHHHHHHHHHHHHHHHHHHhCCccc Q lcl|NC_021342. 151 MPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSAT--KDYKTMNGQELFNMLNAPIFSVINLSRRFHV 228 (354) Q Consensus 151 ~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~--~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~ 228 (354) .++..--....+++++..+|+-+++|+....-.|++|.++....... ....+-|...+.+.+..+...+......... T Consensus 182 ~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~ 261 (390) T protein:vir:40 182 SWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVS 261 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhc Confidence 46888888999999999999999999876666899998764322111 1111122333333333333333222211223 Q ss_pred ccEEEeCHHHHH-HHhh-cccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCc Q lcl|NC_021342. 229 PNTALMFPDLWN-QANN-QLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSD 306 (354) Q Consensus 229 p~~L~l~p~~~~-~L~~-~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~ 306 (354) --.++|+|..+. +|.. +...+..|. |+...-+ -| .|.+....... + . ++|- |. T Consensus 262 ~a~~i~n~~t~~~~l~~~~~~~d~~G~----~v~~~~~----~g-------~pvv~~~~~p~------~--~-i~~G-d~ 316 (390) T protein:vir:40 262 DAILVINPADYWSKIYAATSYMTPQGV----WVTGILP----VP-------LEIVQSVAVPV------G--K-AVAG-RA 316 (390) T ss_pred CceEEEcchhHHHHHHHHhhccCCCCc----cccccCC----Cc-------eeEEEcCCCCC------C--c-EEEE-ee Confidence 345788887643 3321 112223332 1211100 12 22222221111 1 1 2221 11 Q ss_pred ceEEEeeCchhhhccc-ccc--CceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 307 RNLAMANPIPFRMLAP-QMA--SLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 307 ~~~~~~vp~~~~~~~~-~~~--~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..+.+..-..+++... +.. .-...+..+.|++ ..++.|.|++.+.|+ T Consensus 317 s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d-g~v~~~~A~~~l~~~ 366 (390) T protein:vir:40 317 KDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYAN-GRPKDNSSFLVFDIT 366 (390) T ss_pred ceEEEEeecceEEEecchhhhhcCcEEEEEEEEeC-CEEecccceEEEEee Confidence 1122222233332211 221 2235666788875 567779999999999 No 77 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=98.00 E-value=1.3e-06 Score=52.89 Aligned_cols=319 Identities=8% Similarity=-0.002 Sum_probs=160.1 Q ss_pred Ccccc-hhHHHH----hhh---h----hhhcccccccccchhhhhhhhhhhc--------cCCceeccchhhHHHHHHHH Q lcl|NC_021342. 1 MAIKT-IDAQTI----QGN---Q----WLVHKGYVSRNGDQWVINNTALDAI--------GNPNIMLDADGGIAFYISQL 60 (354) Q Consensus 1 ~~~~~-~~~~~~----~~~---~----~~~~~~~~~~~~~~~~~~~~am~a~--------~~~~~~~da~~~~~fl~~~L 60 (354) ...+. ...+.+ ++. . +...++.......++. .|.... -..++++. .+.+.|++. T Consensus 82 ~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r---~a~~~~l~~~~~~~e~~a~~~~-t~~GG~lvP-- 155 (434) T protein:vir:62 82 PTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIR---SVFANYIVGNIDEKEARALGLV-TGNGSVTIP-- 155 (434) T ss_pred hhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHH---HHHHHHhccccchhhhhhhccc-ccccceecc-- Confidence 00000 000000 000 0 0000000000000000 010000 00011111 122445655 Q ss_pred HHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEe--cCCCcccceeeeccceeEEEEEEEEeeEee Q lcl|NC_021342. 61 AGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFI--GANGQDLPRVAQSAQMHTVPLGYAGNECHY 138 (354) Q Consensus 61 ~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~--~~~~~dip~v~~~~~~~~~pv~~~~~~~~~ 138 (354) +.+.+.|++........+.+..+....+ .+.+.+....+.+.+. .....++|..+...+......+.++.-+.+ T Consensus 156 ~~~~~~Ii~~l~~~~~i~~~~~~~~~~~----~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~i 231 (434) T protein:vir:62 156 DFLSKEIITYAQEENFLRRLGTGVKTKE----NIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATV 231 (434) T ss_pred hhhHHHHHHhhhhhhhhhhhcceeccCC----ceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhh Confidence 5577788887777777777765532221 2344444444444443 233456777777778888888888887777 Q ss_pred cHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhC-ceeeeecCCcccccccccccccCHHHHHHHHHHHHH Q lcl|NC_021342. 139 TLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRG-MYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIF 217 (354) Q Consensus 139 ~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~g-i~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~ 217 (354) |.+=|+.+ ..++..--....+.++...+|+.+++|+...+ ..|+++.++++..+.. ...+++|.+++. T Consensus 232 S~ell~ds---~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~--------~~~~d~l~~l~~ 300 (434) T protein:vir:62 232 TKKLLART---GLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDE--------KNLYDALVKMKN 300 (434) T ss_pred HHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccc--------cchhhHHHHHHh Confidence 76544433 45788888888999999999999999987554 4577877776443221 123677778877 Q ss_pred HHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCccc Q lcl|NC_021342. 218 SVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKP 297 (354) Q Consensus 218 ~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d 297 (354) ++... +. ..-..+|+|..|..|.+- -+..|.-+++ - ......|.+-+|...|......... +.++.. T Consensus 301 ~l~~~--~~-~~a~~v~n~~~~~~L~~l--kd~~G~~l~~----~-~~~~~~g~~~tl~G~pV~~~~~~~~---~~~~~~ 367 (434) T protein:vir:62 301 TPVKE--VR-KKARWVLNTAALTKIETM--KTDDGFPLLR----P-FNQAEGGIGYTLLGFPVEEEDAIDI---PDSPDT 367 (434) T ss_pred hcchh--hh-cCCEEEEcHHHHHHHHHh--hccCCCEeec----c-CCCccCCCCceecceeeEEecCccC---ccCCCc Confidence 77542 21 223579999999998652 2433322211 0 0011234444455555544443322 222222 Q ss_pred EEEEE-EcCcceE-EEee-Cchhhhccccc-cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 298 RYMVY-DKSDRNL-AMAN-PIPFRMLAPQM-ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 298 ~~v~y-~~~~~~~-~~~v-p~~~~~~~~~~-~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..|+| +.+ +.+ .... ++.++.+..-. ..-..-+.++.|+.|-.|+.|.+++-+=+- T Consensus 368 ~~i~~Gdfs-~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~ 427 (434) T protein:vir:62 368 PVFYFGDFS-KFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYV 427 (434) T ss_pred eEEEEeecc-ceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEE Confidence 33433 322 222 1111 12222222111 222344777899988889889888765332 No 78 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=97.99 E-value=1.8e-06 Score=52.04 Aligned_cols=305 Identities=9% Similarity=-0.021 Sum_probs=144.4 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhh Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFD 80 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~ 80 (354) ..-+....+. ..+.+.-..+.. ...++. .. ........++.....+.+.+++. +.+.+.+++........+.+ T Consensus 84 ~~~~~~~~~~-~~~a~~~~~~~~-~~~~~~-~~--~~~~~~~~a~~~~~~~~gg~lvP--~~~~~~ii~~~~~~~~l~~~ 156 (397) T protein:vir:12 84 GQGNEERQQQ-YSKAFLKGLRGK-RLTDEE-RD--LLDSPEFRAMSGINDEDGGILIP--EDIGRQIHEFKRQFEPLEQY 156 (397) T ss_pred cchhhHHHHH-HHHHHHHHHhcc-CCcHHH-HH--HHhhhhhhhccccccccCcccCc--hhHHHHHHHhhhhhhhHHhh Confidence 0000000000 001000000000 000010 00 00000001111111122334444 56677788887777777777 Q ss_pred ccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-eccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHH Q lcl|NC_021342. 81 VPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQAR 159 (354) Q Consensus 81 v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~ 159 (354) +++.. .+...-.+.+......+.+.+++.++. +|-.+ ...+........++....+|..=++. ...++..--.. T Consensus 157 ~~~~~-~~~~~~~~~~~~~~~~~~a~~v~Eg~~-~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~d---s~~~l~~~i~~ 231 (397) T protein:vir:12 157 VTVEP-VTTRSGTRLLEKNADMVPFSPVEELGN-LPEIDQPRFTKVSYSIIDYGGIMTLSNSMLND---SDQAIMTYVAK 231 (397) T ss_pred cceee-ccCCceeEEEEEecCCcceeeeccccc-ccccccccceeEEeeheeeEeeehhhHHHHhh---chHHHHHHHHH Confidence 65432 111112344444445566778877654 45333 45677788888888877777654433 34567777888 Q ss_pred HHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHH-HHHHHhCCcccccEEEeCHHH Q lcl|NC_021342. 160 LAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIF-SVINLSRRFHVPNTALMFPDL 238 (354) Q Consensus 160 aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~-~l~~~s~g~~~p~~L~l~p~~ 238 (354) ..++++++.+|+.+++|+....-.|++ + +++|.+++. .+.. .......++++|.. T Consensus 232 ~l~~~~~~~~d~~il~G~g~~~~~g~~-----------------~----~~~i~~~~~~~l~~---~~~~~a~~~~n~~~ 287 (397) T protein:vir:12 232 WFAKKSVVTRNNLILAAIASLKKVDID-----------------G----LDGIKKALNVTLDP---MVAPGSIVLTNQDG 287 (397) T ss_pred HHHHHHHHHHHHHHHhccccccccccc-----------------c----HHHHHHHHhhccch---hhhCCCEEEEcHHH Confidence 899999999999999997653322221 1 344555443 2221 22334579999999 Q ss_pred HHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhh Q lcl|NC_021342. 239 WNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFR 318 (354) Q Consensus 239 ~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~ 318 (354) |..|.+- .+..|. |+-..++ ..|.+-.+...|.+...... .+.+ .++.. +++-+=.+.+.+..-..++ T Consensus 288 ~~~L~~l--kd~~G~----~l~~~~~---~~g~~~~l~G~pv~~~~~~~-~~~~-~~~~~-~~~gd~~~~~~~~~~~~~~ 355 (397) T protein:vir:12 288 YDWLDTL--KDGTGR----YLLQPDP---TNPTKKLLDGRPVVPFTNRV-LKTQ-KGKAP-LIIGNLKEAIVLFDREQQS 355 (397) T ss_pred HHHHHHh--hccCCc----eeecccc---cCCCCccccceeeEEecccc-cccC-CCccE-EEEEehhceEEEEeecceE Confidence 9998653 233332 2211111 12333334333333222111 1111 12222 3332212333333222322 Q ss_pred hccc--cc---cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 319 MLAP--QM---ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 319 ~~~~--~~---~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.-. .. ..=...+.++.+++ ..+++|.|+++++++ T Consensus 356 i~~~~~~~~~f~~~~~~~r~~~r~d-~~~~~~~a~~~~~~t 395 (397) T protein:vir:12 356 IASTDTGAGAFETNSTKVRGIERED-VRKWDEDAVVFGQIT 395 (397) T ss_pred EEEeccccchhhcCceEEEEEEeec-cEEecccceEEEEEe Confidence 2111 11 11135666788875 577999999999999 No 79 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=97.98 E-value=6.2e-06 Score=49.13 Aligned_cols=297 Identities=12% Similarity=0.021 Sum_probs=152.6 Q ss_pred cccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-c Q lcl|NC_021342. 23 VSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-G 101 (354) Q Consensus 23 ~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~ 101 (354) +.-++. +.+. ...++++ +.++..|..+- .+ ++++.....-..|++..+....++....+...... . T Consensus 1 ~~~~~~-------~~~~--~k~it~~-d~~gG~L~P~~--~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~ 67 (314) T protein:vir:41 1 MDFLNK-------PFQI--TPKIDVP-DLGKGILAVQR--FG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLGVE 67 (314) T ss_pred Cchhhh-------HHHh--hcccccc-cCCCceeChHH--HH-HHHHHHHhccchhhheeeecccCccceeecccccCcc Confidence 222222 2222 2333332 33344566532 23 45565555556666666544334333322211111 1 Q ss_pred c-cceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh- Q lcl|NC_021342. 102 V-TMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS- 179 (354) Q Consensus 102 ~-G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~- 179 (354) . ..+.+.+ .....|..+...+......+.+...+.++.+-|+... .|.++...-....++.+...+..+.|+|+.. T Consensus 68 ~~~~~~~~~-~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a-~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~ 145 (314) T protein:vir:41 68 LEPGRNTSG-TKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNI-EQSAFEQTITSLLASGVTYDLECFFLHADSSL 145 (314) T ss_pred ccccccccc-CCccCCcccccccceeeeeEEEEEeecccHHHHHhhh-chhhHHHHHHHHHHHHHHHHHHHHhhccccCC Confidence 1 1112222 2233456666677777788888888888888887765 5678988889999999999999999999864 Q ss_pred -------hCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCc-c-cccEEEeCHHHHHHHhhcccCCC Q lcl|NC_021342. 180 -------RGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRF-H-VPNTALMFPDLWNQANNQLMTGY 250 (354) Q Consensus 180 -------~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~-~-~p~~L~l~p~~~~~L~~~~~~~~ 250 (354) ....|+|+.........++ -..+.+++ .+.+++..|... +. . +....+|++..+..+.+..-..+ T Consensus 146 ~s~~~~~~~p~G~l~~a~~~~~~~~~-~~~~~~~~---~~~~l~~sl~~~--yr~~~~~~~~~m~~~t~~~~r~~l~~~~ 219 (314) T protein:vir:41 146 TTGRELYRINDGWMKLAGNQYTDAEP-EDENWPLN---LFDGMMDELDTR--YLQLKPRMKFYVSNEIYNGYRKQLLVRE 219 (314) T ss_pred cCcccchhcchhhhhhcccceeecCc-cccccHHH---HHHHHHHhcCch--hhcCCCceEEEecHHHHHHHHHHHhccC Confidence 2456888765433222111 01122333 344444444321 21 1 23468889888776543211111 Q ss_pred CCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccc-cCcee Q lcl|NC_021342. 251 TDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQM-ASLGI 329 (354) Q Consensus 251 ~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~-~~l~~ 329 (354) ++ +.+.. ...+.+..+...|..........+. + +. .|++ -|++++...+...++..+-.. +.-.+ T Consensus 220 ~~--l~~~~-------~~~~~~~~l~G~PV~~~~~~~~~~~--~-~~-~i~f-gd~~nlv~~~~~~ir~~~~~~a~~~~~ 285 (314) T protein:vir:41 220 TG--LGDSA-------LIGATGLQYDGIPIQYVPALDALGD--D-KA-RALL-TVPTNLVYGFWRNIRIEPKRDAAMRRT 285 (314) T ss_pred Cc--ccchh-------hhCCCCceecceeeEecccccccCC--C-Cc-eEEE-echhheEEEeeceeEEeecccCcCCeE Confidence 11 11111 1235566666566555554433222 2 22 2333 357788777777777765432 22345 Q ss_pred EEeeeeeeeeEEEECC-ceeEeeecC Q lcl|NC_021342. 330 TVPAEYKISGTEFRYP-LCAAYVDMA 354 (354) Q Consensus 330 ~~~~~~~~gGv~i~~P-~ai~y~D~~ 354 (354) .+-...|++....-.+ -+++++.-| T Consensus 286 ~~~~~~r~d~~~~~~~aa~~~~~~~~ 311 (314) T protein:vir:41 286 EYIASLRADCNYEDENAAVAAVIDMS 311 (314) T ss_pred EEEEEEEeceEEEEcCcEEEEEeecc Confidence 5555566653332232 233446666 No 80 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=97.97 E-value=8.6e-07 Score=53.85 Aligned_cols=312 Identities=7% Similarity=0.005 Sum_probs=142.5 Q ss_pred CcccchhHHHHhh----hhhhhcccccccc---cchhhhhhh-hhh-----------hccCCceeccchhhHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQG----NQWLVHKGYVSRN---GDQWVINNT-ALD-----------AIGNPNIMLDADGGIAFYISQLA 61 (354) Q Consensus 1 ~~~~~~~~~~~~~----~~~~~~~~~~~~~---~~~~~~~~~-am~-----------a~~~~~~~~da~~~~~fl~~~L~ 61 (354) -.++.+..|.-.. ..+. ........ ......... ++. .....++..-..+.+.+++. + T Consensus 54 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP--~ 130 (408) T protein:vir:10 54 VRRDALREQLVEAQAEQVVNM-REEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIP--Q 130 (408) T ss_pred HHHHHHHHHHHHHHHHHHhcc-ccccccccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceecc--H Confidence 0011111110000 0000 00000000 000000000 000 00001111111222344444 5 Q ss_pred HHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEee-ccccceeEecCCCcccceee-eccceeEEEEEEEEeeEeec Q lcl|NC_021342. 62 GIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSY-DGVTMGKFIGANGQDLPRVA-QSAQMHTVPLGYAGNECHYT 139 (354) Q Consensus 62 ~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~-~~~G~a~~~~~~~~dip~v~-~~~~~~~~pv~~~~~~~~~~ 139 (354) .+.+.|++........+.++.+.. .+.....+.+... +..+.+.+++.++. +|-.+ ...+......+.++..+.+| T Consensus 131 ~~~~~Ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~~~~~~~i~~~~~k~~~~~~iS 208 (408) T protein:vir:10 131 DIRTMINTLVRQYDSLQQYVRVES-VSTSNGSRVYEKWTDVTPLTVMDAEDGK-IPDLDNPQLTIIKYLIKRYAGIITAT 208 (408) T ss_pred hHHHHHHHHHHhhchhhhhcceee-ccCCcceEEEeeccccccceeeecCccc-cccccCcceeeEEeeeeeEEeeehhH Confidence 566788888887777777755432 1111122222222 34466778776543 55443 45677888888888888887 Q ss_pred HHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHH Q lcl|NC_021342. 140 LDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSV 219 (354) Q Consensus 140 ~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l 219 (354) ..=|+. ...++..--....++++...+|+-++.|+.... +..++ .+ ++||..++... T Consensus 209 ~ell~d---s~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~----------~~~~~------~~----~~~l~~~~~~~ 265 (408) T protein:vir:10 209 NTSLKD---TAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP----------KKPTI------AK----FDDVITMINTA 265 (408) T ss_pred HHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------ccccc------cc----HHHHHHHHHHh Confidence 654443 345677778888999999999999999876421 11110 12 34444444322 Q ss_pred HHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEE Q lcl|NC_021342. 220 INLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRY 299 (354) Q Consensus 220 ~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~ 299 (354) ... +....-.++|+|..|..|.+- -+..|.-+++ .++ .++.+-.+...|......-... ..+.++. . T Consensus 266 ~~~--~~~~~a~~v~n~~~~~~l~~l--kd~~G~~i~~----~~~---~~~~~~~l~G~PV~~~~~~~~~-~~~~~~~-~ 332 (408) T protein:vir:10 266 VDP--AIIATSSLLTNQSGLNKLALV--KTAEGKYLLE----PDP---TKPNSYLIKGKQVIVVADRWLP-NTGSTVY-P 332 (408) T ss_pred hhh--hhccCCEEEEcHHHHHHHHHh--hccCCceEec----cCc---CCCCCceecceeeEEecccccC-ccCCCce-E Confidence 111 223334789999999998753 3444443322 111 1122223332222222111111 1111222 2 Q ss_pred EEEEcCcceEEEeeCchhhhccc-ccc--C--ceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 300 MVYDKSDRNLAMANPIPFRMLAP-QMA--S--LGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 300 v~y~~~~~~~~~~vp~~~~~~~~-~~~--~--l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ++|-+=.+.+.+..-..++.... +.. . -...+.++.|++ +.+.+|.++++++++ T Consensus 333 i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~~~~~ 391 (408) T protein:vir:10 333 LYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFD-VKATDSEALVAGSFS 391 (408) T ss_pred EEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeec-cEEeccccEEEEEee Confidence 23322122233322233332221 111 1 134566778875 677889999999998 No 81 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=97.97 E-value=1e-06 Score=53.46 Aligned_cols=313 Identities=8% Similarity=-0.001 Sum_probs=144.8 Q ss_pred CcccchhHHHHh---hhhhhhcccccccc---cchhhhhh-hhh-----------hhccCCceeccchhhHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQ---GNQWLVHKGYVSRN---GDQWVINN-TAL-----------DAIGNPNIMLDADGGIAFYISQLAG 62 (354) Q Consensus 1 ~~~~~~~~~~~~---~~~~~~~~~~~~~~---~~~~~~~~-~am-----------~a~~~~~~~~da~~~~~fl~~~L~~ 62 (354) -.++.++.|.-+ ....-..+...... ..+....+ -++ .+....++.....+.+.++.. +. T Consensus 54 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP--~~ 131 (404) T protein:vir:39 54 VRRDALREQLVEAQAEQVVNMREEEKGPLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIP--QD 131 (404) T ss_pred HHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceecc--HH Confidence 000000000000 00000000000000 00000000 000 000001111111222334444 56 Q ss_pred HHHHHHHhhhhcccchhhccccCCCCCceeEEEEEee-ccccceeEecCCCcccce-eeeccceeEEEEEEEEeeEeecH Q lcl|NC_021342. 63 IEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSY-DGVTMGKFIGANGQDLPR-VAQSAQMHTVPLGYAGNECHYTL 140 (354) Q Consensus 63 Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~-~~~G~a~~~~~~~~dip~-v~~~~~~~~~pv~~~~~~~~~~~ 140 (354) +.+.+++........+.++.+. +.+....++.+... +..+.+.+++..+. +|- .....+.....+..++..+.+|. T Consensus 132 ~~~~ii~~~~~~~~l~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-~~~~~~~~f~~i~~~~~k~~~~~~iS~ 209 (404) T protein:vir:39 132 IRTMINTLVRQYDSLQQYVRVE-SVSTSNGSRVYEKWTDVTPLTVMDAEDGK-IPDLDNPRLTIIKYLIKRYAGIITATN 209 (404) T ss_pred HHHHHHHHHHhhhhHHhhccee-eccCCcceEEEEeecCCccceeeecCccc-cccccccceeeEEeeeeeEEeeehhHH Confidence 6677888777777787776653 23323333433333 34466778776544 553 34567788888999888888876 Q ss_pred HHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHH Q lcl|NC_021342. 141 DEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVI 220 (354) Q Consensus 141 ~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~ 220 (354) .=++.+ ..++..--....++++.+.+|+.+++|+.... +.. .. .+ +++|.+++.... T Consensus 210 ell~ds---~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~----------~~~-~~-----~~----~~~i~~~~~~~~ 266 (404) T protein:vir:39 210 TLLKDT---AENILAWLSSWIAKKVVVTRNQAIIAAMGTVP----------KKP-TI-----AK----FDDVITMINTSV 266 (404) T ss_pred HHHhhc---hHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------ccc-cc-----cc----HHHHHHHHHHhh Confidence 544332 45677788888999999999999999975421 110 01 12 344554444322 Q ss_pred HHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEE Q lcl|NC_021342. 221 NLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYM 300 (354) Q Consensus 221 ~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v 300 (354) .. .....-.++|+|..|..|..- .+..|.-++ ..++. .+.+-.|...|........ .+..+.+...++ T Consensus 267 ~~--~~~~~a~~v~n~~~~~~L~~l--kd~~G~~l~----~~~~~---~~~~~~l~G~pV~~~~~~~-~~~~~~~~~~~~ 334 (404) T protein:vir:39 267 DP--AIIATSSLLTNQSGLNKLALV--KTAEGKYLL----EPDPT---KPNSYLIKGKKVIVVADRW-LPNSGSTVYPLY 334 (404) T ss_pred hh--hhccCCEEEEcHHHHHHHHHh--hccCCceee----ccCcC---CCCcceecceeEEEecccc-cCccCCCccEEE Confidence 21 222334699999999999753 233343222 11111 1222223222222221111 111122222333 Q ss_pred EEEcCcceEEEeeCchhhhcccc-c--c--CceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 301 VYDKSDRNLAMANPIPFRMLAPQ-M--A--SLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 301 ~y~~~~~~~~~~vp~~~~~~~~~-~--~--~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+.. +.+.+..-..++..... . . .-...+.++.|++ +.+++|.|++.+.+. T Consensus 335 ~gd~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~~~~~~~ 391 (404) T protein:vir:39 335 YGDMS-QAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFD-VKTTDSEALVAGSFT 391 (404) T ss_pred EEecc-ccEEEEeecceEEEEeccchhhhhhceeeEEEEeeec-cEEecccceEEEEee Confidence 33322 33333322333322111 1 1 1134566778875 788999999999988 No 82 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=97.93 E-value=4.9e-06 Score=49.70 Aligned_cols=307 Identities=7% Similarity=0.014 Sum_probs=140.3 Q ss_pred CcccchhHHHHhhhhhhhcc----------------ccccc--ccch---hhhhhhhhhhccCCceeccchhhHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHK----------------GYVSR--NGDQ---WVINNTALDAIGNPNIMLDADGGIAFYISQ 59 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~----------------~~~~~--~~~~---~~~~~~am~a~~~~~~~~da~~~~~fl~~~ 59 (354) -..+.++............+ .+++. ..++ +........++ ...++ +.+.+++. T Consensus 46 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~---~~~t~--~~gg~~vP- 119 (392) T protein:vir:10 46 DLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAM---SGLTG--EDGGLVIP- 119 (392) T ss_pred HHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhc---ccccc--CCCceecc- Confidence 00111111111110000000 00000 0000 00000000010 00111 22334443 Q ss_pred HHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-eccceeEEEEEEEEeeEee Q lcl|NC_021342. 60 LAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QSAQMHTVPLGYAGNECHY 138 (354) Q Consensus 60 L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~~~~~~~pv~~~~~~~~~ 138 (354) +.+.+.+++.....-..+.++.+. +.+...-.+.+......+.+.|++.++. +|-.+ ...+......+.++..+.+ T Consensus 120 -~~~~~~ii~~~~~~s~l~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~~~~~~~v~l~~~k~~~~~~i 196 (392) T protein:vir:10 120 -QDIQTQINELARSFDALEQYVTVE-PVRTRSGSRVLEKNSDMIPFAEITEMGE-IPETDNPKFSNVQYAVKDRAGILPL 196 (392) T ss_pred -hhHHHHHHHHHHhhhhhhhhceee-eccCCceeEEEEeecCCccceeeccccc-ccccccccceeEEeeeeeEEEeehh Confidence 456677888777777776665542 1221222333444444556778776544 45333 4567778888888888888 Q ss_pred cHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHH Q lcl|NC_021342. 139 TLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFS 218 (354) Q Consensus 139 ~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~ 218 (354) |..=|+.+ ..+|..--....++++++.+|..+++|+......|. .+ +++|.++++. T Consensus 197 S~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~-----------------~~----~d~i~~~~~~ 252 (392) T protein:vir:10 197 SRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI-----------------KS----LDDIKDVLNV 252 (392) T ss_pred hHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc-----------------cC----HHHHHHHHHH Confidence 87655543 356788888889999999999999988765322111 11 2444444432 Q ss_pred HHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceee---eeeeeeeccccccccccCc Q lcl|NC_021342. 219 VINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQ---IRFQLDAAELAANGVSNSN 295 (354) Q Consensus 219 l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~---~~~~L~~~~~~~~g~g~~g 295 (354) .... .....-.++|+|+.|..|.+-. +..|.-++ ..++ ..+.+-.|. ++...........+. ..+ T Consensus 253 ~l~~--~~~~~a~~vm~~~~~~~L~~lk--d~~G~~l~----~~~~---~~~~~~tllG~~~v~~~~~~~~~~~~~-~~~ 320 (392) T protein:vir:10 253 KLDP--AISPNAILLTNQDGFNYLDKLK--DKDGKYIL----QSDP---TQKNKKLFAGTNPVVVVSNRFLKSKGT-TAK 320 (392) T ss_pred hhhh--hhccCCEEEEcHHHHHHHHHhh--ccCCCeEe----ecCc---cCCccccccCcccEEEecccccCCCcc-cCC Confidence 2221 2223356999999999996532 33332111 1110 011111121 111111111111111 122 Q ss_pred ccEEEEEEcCcceEEEee--Cchhhhcccc---ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 296 KPRYMVYDKSDRNLAMAN--PIPFRMLAPQ---MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 296 ~d~~v~y~~~~~~~~~~v--p~~~~~~~~~---~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ... ++|-+=.+.+.+.. .+.+.+.+.. ...-...+.++.|++ +.+++|.+|+.+.++ T Consensus 321 ~~~-~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 321 KAP-LIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD-VQMWDNEAAVYGEID 382 (392) T ss_pred ceE-EEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEEEEec Confidence 222 33321112222222 2222222211 111134577888886 688899999999998 No 83 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=97.93 E-value=4.9e-06 Score=49.70 Aligned_cols=307 Identities=7% Similarity=0.014 Sum_probs=140.3 Q ss_pred CcccchhHHHHhhhhhhhcc----------------ccccc--ccch---hhhhhhhhhhccCCceeccchhhHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHK----------------GYVSR--NGDQ---WVINNTALDAIGNPNIMLDADGGIAFYISQ 59 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~----------------~~~~~--~~~~---~~~~~~am~a~~~~~~~~da~~~~~fl~~~ 59 (354) -..+.++............+ .+++. ..++ +........++ ...++ +.+.+++. T Consensus 46 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~---~~~t~--~~gg~~vP- 119 (392) T protein:vir:10 46 DLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAM---SGLTG--EDGGLVIP- 119 (392) T ss_pred HHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhc---ccccc--CCCceecc- Confidence 00111111111110000000 00000 0000 00000000010 00111 22334443 Q ss_pred HHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-eccceeEEEEEEEEeeEee Q lcl|NC_021342. 60 LAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QSAQMHTVPLGYAGNECHY 138 (354) Q Consensus 60 L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~~~~~~~pv~~~~~~~~~ 138 (354) +.+.+.+++.....-..+.++.+. +.+...-.+.+......+.+.|++.++. +|-.+ ...+......+.++..+.+ T Consensus 120 -~~~~~~ii~~~~~~s~l~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~~~~~~~v~l~~~k~~~~~~i 196 (392) T protein:vir:10 120 -QDIQTQINELARSFDALEQYVTVE-PVRTRSGSRVLEKNSDMIPFAEITEMGE-IPETDNPKFSNVQYAVKDRAGILPL 196 (392) T ss_pred -hhHHHHHHHHHHhhhhhhhhceee-eccCCceeEEEEeecCCccceeeccccc-ccccccccceeEEeeeeeEEEeehh Confidence 456677888777777776665542 1221222333444444556778776544 45333 4567778888888888888 Q ss_pred cHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHH Q lcl|NC_021342. 139 TLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFS 218 (354) Q Consensus 139 ~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~ 218 (354) |..=|+.+ ..+|..--....++++++.+|..+++|+......|. .+ +++|.++++. T Consensus 197 S~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~-----------------~~----~d~i~~~~~~ 252 (392) T protein:vir:10 197 SRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI-----------------KS----LDDIKDVLNV 252 (392) T ss_pred hHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc-----------------cC----HHHHHHHHHH Confidence 87655543 356788888889999999999999988765322111 11 2444444432 Q ss_pred HHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceee---eeeeeeeccccccccccCc Q lcl|NC_021342. 219 VINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQ---IRFQLDAAELAANGVSNSN 295 (354) Q Consensus 219 l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~---~~~~L~~~~~~~~g~g~~g 295 (354) .... .....-.++|+|+.|..|.+-. +..|.-++ ..++ ..+.+-.|. ++...........+. ..+ T Consensus 253 ~l~~--~~~~~a~~vm~~~~~~~L~~lk--d~~G~~l~----~~~~---~~~~~~tllG~~~v~~~~~~~~~~~~~-~~~ 320 (392) T protein:vir:10 253 KLDP--AISPNAILLTNQDGFNYLDKLK--DKDGKYIL----QSDP---TQKNKKLFAGTNPVVVVSNRFLKSKGT-TAK 320 (392) T ss_pred hhhh--hhccCCEEEEcHHHHHHHHHhh--ccCCCeEe----ecCc---cCCccccccCcccEEEecccccCCCcc-cCC Confidence 2221 2223356999999999996532 33332111 1110 011111121 111111111111111 122 Q ss_pred ccEEEEEEcCcceEEEee--Cchhhhcccc---ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 296 KPRYMVYDKSDRNLAMAN--PIPFRMLAPQ---MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 296 ~d~~v~y~~~~~~~~~~v--p~~~~~~~~~---~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ... ++|-+=.+.+.+.. .+.+.+.+.. ...-...+.++.|++ +.+++|.+|+.+.++ T Consensus 321 ~~~-~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 321 KAP-LIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD-VQMWDNEAAVYGEID 382 (392) T ss_pred ceE-EEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEEEEec Confidence 222 33321112222222 2222222211 111134577888886 688899999999998 No 84 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=97.93 E-value=4.9e-06 Score=49.70 Aligned_cols=307 Identities=7% Similarity=0.014 Sum_probs=140.3 Q ss_pred CcccchhHHHHhhhhhhhcc----------------ccccc--ccch---hhhhhhhhhhccCCceeccchhhHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHK----------------GYVSR--NGDQ---WVINNTALDAIGNPNIMLDADGGIAFYISQ 59 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~----------------~~~~~--~~~~---~~~~~~am~a~~~~~~~~da~~~~~fl~~~ 59 (354) -..+.++............+ .+++. ..++ +........++ ...++ +.+.+++. T Consensus 46 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~---~~~t~--~~gg~~vP- 119 (392) T protein:vir:10 46 DLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAM---SGLTG--EDGGLVIP- 119 (392) T ss_pred HHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhc---ccccc--CCCceecc- Confidence 00111111111110000000 00000 0000 00000000010 00111 22334443 Q ss_pred HHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-eccceeEEEEEEEEeeEee Q lcl|NC_021342. 60 LAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QSAQMHTVPLGYAGNECHY 138 (354) Q Consensus 60 L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~~~~~~~pv~~~~~~~~~ 138 (354) +.+.+.+++.....-..+.++.+. +.+...-.+.+......+.+.|++.++. +|-.+ ...+......+.++..+.+ T Consensus 120 -~~~~~~ii~~~~~~s~l~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~~~~~~~v~l~~~k~~~~~~i 196 (392) T protein:vir:10 120 -QDIQTQINELARSFDALEQYVTVE-PVRTRSGSRVLEKNSDMIPFAEITEMGE-IPETDNPKFSNVQYAVKDRAGILPL 196 (392) T ss_pred -hhHHHHHHHHHHhhhhhhhhceee-eccCCceeEEEEeecCCccceeeccccc-ccccccccceeEEeeeeeEEEeehh Confidence 456677888777777776665542 1221222333444444556778776544 45333 4567778888888888888 Q ss_pred cHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHH Q lcl|NC_021342. 139 TLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFS 218 (354) Q Consensus 139 ~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~ 218 (354) |..=|+.+ ..+|..--....++++++.+|..+++|+......|. .+ +++|.++++. T Consensus 197 S~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~-----------------~~----~d~i~~~~~~ 252 (392) T protein:vir:10 197 SRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI-----------------KS----LDDIKDVLNV 252 (392) T ss_pred hHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc-----------------cC----HHHHHHHHHH Confidence 87655543 356788888889999999999999988765322111 11 2444444432 Q ss_pred HHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceee---eeeeeeeccccccccccCc Q lcl|NC_021342. 219 VINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQ---IRFQLDAAELAANGVSNSN 295 (354) Q Consensus 219 l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~---~~~~L~~~~~~~~g~g~~g 295 (354) .... .....-.++|+|+.|..|.+-. +..|.-++ ..++ ..+.+-.|. ++...........+. ..+ T Consensus 253 ~l~~--~~~~~a~~vm~~~~~~~L~~lk--d~~G~~l~----~~~~---~~~~~~tllG~~~v~~~~~~~~~~~~~-~~~ 320 (392) T protein:vir:10 253 KLDP--AISPNAILLTNQDGFNYLDKLK--DKDGKYIL----QSDP---TQKNKKLFAGTNPVVVVSNRFLKSKGT-TAK 320 (392) T ss_pred hhhh--hhccCCEEEEcHHHHHHHHHhh--ccCCCeEe----ecCc---cCCccccccCcccEEEecccccCCCcc-cCC Confidence 2221 2223356999999999996532 33332111 1110 011111121 111111111111111 122 Q ss_pred ccEEEEEEcCcceEEEee--Cchhhhcccc---ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 296 KPRYMVYDKSDRNLAMAN--PIPFRMLAPQ---MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 296 ~d~~v~y~~~~~~~~~~v--p~~~~~~~~~---~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ... ++|-+=.+.+.+.. .+.+.+.+.. ...-...+.++.|++ +.+++|.+|+.+.++ T Consensus 321 ~~~-~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 321 KAP-LIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD-VQMWDNEAAVYGEID 382 (392) T ss_pred ceE-EEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEEEEec Confidence 222 33321112222222 2222222211 111134577888886 688899999999998 No 85 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=97.93 E-value=4.9e-06 Score=49.70 Aligned_cols=307 Identities=7% Similarity=0.014 Sum_probs=140.3 Q ss_pred CcccchhHHHHhhhhhhhcc----------------ccccc--ccch---hhhhhhhhhhccCCceeccchhhHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHK----------------GYVSR--NGDQ---WVINNTALDAIGNPNIMLDADGGIAFYISQ 59 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~----------------~~~~~--~~~~---~~~~~~am~a~~~~~~~~da~~~~~fl~~~ 59 (354) -..+.++............+ .+++. ..++ +........++ ...++ +.+.+++. T Consensus 46 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~---~~~t~--~~gg~~vP- 119 (392) T protein:vir:10 46 DLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAM---SGLTG--EDGGLVIP- 119 (392) T ss_pred HHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhc---ccccc--CCCceecc- Confidence 00111111111110000000 00000 0000 00000000010 00111 22334443 Q ss_pred HHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceee-eccceeEEEEEEEEeeEee Q lcl|NC_021342. 60 LAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVA-QSAQMHTVPLGYAGNECHY 138 (354) Q Consensus 60 L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~-~~~~~~~~pv~~~~~~~~~ 138 (354) +.+.+.+++.....-..+.++.+. +.+...-.+.+......+.+.|++.++. +|-.+ ...+......+.++..+.+ T Consensus 120 -~~~~~~ii~~~~~~s~l~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~~~~~~~v~l~~~k~~~~~~i 196 (392) T protein:vir:10 120 -QDIQTQINELARSFDALEQYVTVE-PVRTRSGSRVLEKNSDMIPFAEITEMGE-IPETDNPKFSNVQYAVKDRAGILPL 196 (392) T ss_pred -hhHHHHHHHHHHhhhhhhhhceee-eccCCceeEEEEeecCCccceeeccccc-ccccccccceeEEeeeeeEEEeehh Confidence 456677888777777776665542 1221222333444444556778776544 45333 4567778888888888888 Q ss_pred cHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHH Q lcl|NC_021342. 139 TLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFS 218 (354) Q Consensus 139 ~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~ 218 (354) |..=|+.+ ..+|..--....++++++.+|..+++|+......|. .+ +++|.++++. T Consensus 197 S~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~-----------------~~----~d~i~~~~~~ 252 (392) T protein:vir:10 197 SRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI-----------------KS----LDDIKDVLNV 252 (392) T ss_pred hHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc-----------------cC----HHHHHHHHHH Confidence 87655543 356788888889999999999999988765322111 11 2444444432 Q ss_pred HHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceee---eeeeeeeccccccccccCc Q lcl|NC_021342. 219 VINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQ---IRFQLDAAELAANGVSNSN 295 (354) Q Consensus 219 l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~---~~~~L~~~~~~~~g~g~~g 295 (354) .... .....-.++|+|+.|..|.+-. +..|.-++ ..++ ..+.+-.|. ++...........+. ..+ T Consensus 253 ~l~~--~~~~~a~~vm~~~~~~~L~~lk--d~~G~~l~----~~~~---~~~~~~tllG~~~v~~~~~~~~~~~~~-~~~ 320 (392) T protein:vir:10 253 KLDP--AISPNAILLTNQDGFNYLDKLK--DKDGKYIL----QSDP---TQKNKKLFAGTNPVVVVSNRFLKSKGT-TAK 320 (392) T ss_pred hhhh--hhccCCEEEEcHHHHHHHHHhh--ccCCCeEe----ecCc---cCCccccccCcccEEEecccccCCCcc-cCC Confidence 2221 2223356999999999996532 33332111 1110 011111121 111111111111111 122 Q ss_pred ccEEEEEEcCcceEEEee--Cchhhhcccc---ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 296 KPRYMVYDKSDRNLAMAN--PIPFRMLAPQ---MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 296 ~d~~v~y~~~~~~~~~~v--p~~~~~~~~~---~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ... ++|-+=.+.+.+.. .+.+.+.+.. ...-...+.++.|++ +.+++|.+|+.+.++ T Consensus 321 ~~~-~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 321 KAP-LIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD-VQMWDNEAAVYGEID 382 (392) T ss_pred ceE-EEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEEEEec Confidence 222 33321112222222 2222222211 111134577888886 688899999999998 No 86 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=97.91 E-value=3.9e-06 Score=50.22 Aligned_cols=317 Identities=8% Similarity=-0.025 Sum_probs=150.2 Q ss_pred CcccchhH--------------HHHh--hhhhhhc--ccccccccchhh----hh-hhhhhhccCCceeccchhhHHHHH Q lcl|NC_021342. 1 MAIKTIDA--------------QTIQ--GNQWLVH--KGYVSRNGDQWV----IN-NTALDAIGNPNIMLDADGGIAFYI 57 (354) Q Consensus 1 ~~~~~~~~--------------~~~~--~~~~~~~--~~~~~~~~~~~~----~~-~~am~a~~~~~~~~da~~~~~fl~ 57 (354) ..++.||. +.-. +..+... ........+.+- .. ..+..........+.+.+ +.++. T Consensus 45 ~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~-g~~~~ 123 (390) T protein:vir:62 45 TAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSGAQRSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGN-PNVLS 123 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCC-Ccccc Confidence 11222222 1111 0000000 000000000000 00 000000000001112222 22232 Q ss_pred HHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEe Q lcl|NC_021342. 58 SQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECH 137 (354) Q Consensus 58 ~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~ 137 (354) . +.....+.+........|.+..+..-.+ ...+.+......+.+.|++.. ..+|-.+...+........++.-+. T Consensus 124 ~--~~~~~~i~~~~~~~~~l~~~~~~~~~~~--~~~~~~p~~~~~~~a~wv~E~-~~~~~~~~~f~~i~~~~~k~~~~~~ 198 (390) T protein:vir:62 124 R--TLYGQLIAQAVERSAIMRGGATTFTTSD--ANPLDFTVITGRSSASIVGET-AEIPESYPATAQRSMGGFKYGFASV 198 (390) T ss_pred c--cchHHHHHHHHhhhhhhhhcceeeecCC--CceeEEEEEcCCcceeeeccc-ccccccccceeeeEeeeeeEEeehH Confidence 2 2233334444433333444443322111 123556667777788888754 4578888888888889999998888 Q ss_pred ecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHH Q lcl|NC_021342. 138 YTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIF 217 (354) Q Consensus 138 ~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~ 217 (354) +|.+=|+.+ ..++..--....+.+++..+|+-+++|+.. -.|++|+++....+......+ .-.+++|.+++. T Consensus 199 iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~--p~Gi~~~~~~~~~~~~~~~~~---~~~~~~l~~~~~ 270 (390) T protein:vir:62 199 VSYEFATDQ---VLDLVGFLVSDAGPAIGDAMGRHFITGTGQ--PRGILTDASPATATFLATDTD---SKVSDALIDLFH 270 (390) T ss_pred HHHHHHhhh---hHHHHHHHHHHHHHHHHHHHHhhhhccCCc--cccccccccccccceeccccc---ccchHHHHHHHH Confidence 876655543 456778888889999999999999999863 368999876543322222211 122566777777 Q ss_pred HHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCccc Q lcl|NC_021342. 218 SVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKP 297 (354) Q Consensus 218 ~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d 297 (354) +|... +. ..-..+|+|+.|..|.+-. +..+. ||-.-++ ..|.+-.+...|.+....... + T Consensus 271 ~l~~~--~~-~~a~~vmn~~~~~~L~~lk--d~~g~----~l~~~~~---~~g~~~~l~G~Pv~~~~~~p~--------~ 330 (390) T protein:vir:62 271 EVPSA--YR-ANAKYVVNDLRAAQMRKLK--DANGQ----YLWQSGL---TVGAPSLFNGKVVETDDGMPA--------D 330 (390) T ss_pred hhhhh--hh-cCCEEEEchHHHHHHHHhh--ccCCC----eeecCCc---CCCccceecccceEEecCCCC--------c Confidence 66432 22 1236899999999986532 33332 1111000 112222232223332222111 1 Q ss_pred EEEEEEcCcceEEEeeCchhhhccc-cc--cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 298 RYMVYDKSDRNLAMANPIPFRMLAP-QM--ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 298 ~~v~y~~~~~~~~~~vp~~~~~~~~-~~--~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) . |+|- |-..+.+..-.+++.... +. ..-...+..+.|++ ..+..|.|++.+.++ T Consensus 331 ~-i~~g-d~s~~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d-~~~~~~~A~~~l~~~ 387 (390) T protein:vir:62 331 K-ILFA-DLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRAD-GLLVDARGAKVLTVT 387 (390) T ss_pred c-EEEe-eccceeEEeecceEEEeeccccccCCcEEEEEEEEeC-cEeechhheEEEEee Confidence 1 2221 111111111122222111 11 11134556778886 579999999999999 No 87 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=97.90 E-value=9.1e-06 Score=48.22 Aligned_cols=312 Identities=8% Similarity=-0.037 Sum_probs=139.0 Q ss_pred Ccc-----------------cch-----hHHHHh-hhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHH Q lcl|NC_021342. 1 MAI-----------------KTI-----DAQTIQ-GNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYI 57 (354) Q Consensus 1 ~~~-----------------~~~-----~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~ 57 (354) -++ +.+ .....+ .+.+...+ .-..++.+... +++..... ..+ +.+.+++ T Consensus 20 ~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~-~~~~lt~ee~~---~~~~~~~~--~~~--~~gg~~v 91 (377) T protein:vir:98 20 AKISAGATSEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRD-KNRELTAEEIK---FFNDIDKN--VGG--KDKFKLL 91 (377) T ss_pred HHHHhhhhhHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhcc-CCcccCHHHHH---HHHHHHhc--cCC--CCCcccc Confidence 000 000 000011 22222111 11222222111 22221111 122 2233333 Q ss_pred HHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEe Q lcl|NC_021342. 58 SQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECH 137 (354) Q Consensus 58 ~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~ 137 (354) . +.+..+|++.....-..|.++.+..-.+ . ..+...+..+.+.|++..+.--+..+...+....+.+.++.-.. T Consensus 92 P--~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~---~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~ 165 (377) T protein:vir:98 92 P--EETMVQVFDDLVAEHPLLKVINFKNTSL-R---LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVV 165 (377) T ss_pred C--HHHHHHHHHHHHHhhhhhhheeeEecCc-c---eEEEEecCCcceeEeecccccCcccCccceeEeecceeEEeeec Confidence 3 4455566665554444455544332222 1 23445566777888765433222345556777788888887777 Q ss_pred ecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHH---HHHHHHHH Q lcl|NC_021342. 138 YTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQ---ELFNMLNA 214 (354) Q Consensus 138 ~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~---ei~~di~~ 214 (354) +|.+=|+.+ ..++..--....++++++.+++-+++|+...+-.|||+++...+....+.+...+.. +.+.++.. T Consensus 166 is~elL~ds---~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 242 (377) T protein:vir:98 166 IPKDALKFG---PKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSD 242 (377) T ss_pred ccHHhhhcc---HhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccccccchhhhHhhhhh Confidence 765544433 447888888899999999999999999988888999998754332222111111111 11111110 Q ss_pred ------------HHHHHH----HHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeee Q lcl|NC_021342. 215 ------------PIFSVI----NLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIR 278 (354) Q Consensus 215 ------------~~~~l~----~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~ 278 (354) +++.+. ..-+...+.+.++++|..+..+.-.+...+ .++.+...-|.|+.+.. T Consensus 243 ~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~----------~~G~~~t~lg~p~~vv~- 311 (377) T protein:vir:98 243 LTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRN----------QFGEYVTVLPHGITILE- 311 (377) T ss_pred hchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccC----------CCCccccccCCCceEEe- Confidence 111110 001123455667777776655431110000 11111111122322221 Q ss_pred eeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCc--eeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 279 FQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASL--GITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 279 ~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l--~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.......+--+.-.++++.++. -+++..-. +.... ...+....|.+ -.++.|.|++.++|+ T Consensus 312 ----s~~~p~~~i~fgdf~~Y~i~~r~--~~~i~~~~-------~~~~~~d~~~f~~~~r~d-g~~~~~~a~~vl~i~ 375 (377) T protein:vir:98 312 ----SLAVETGKAIAFVANRYDAFMAT--ASTIEEYD-------QTFAMEDLQLYLTKNYFY-GKAKDNHTAALLTLA 375 (377) T ss_pred ----cCCCCcccEEEEEecceeEEeec--ceEEEeec-------hhhhhcCceEEEEEEEEc-CEEeccCcEEEEEEe Confidence 11100000000111123333322 12221111 11111 23455566765 488999999999999 No 88 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=97.83 E-value=2.6e-06 Score=51.19 Aligned_cols=312 Identities=10% Similarity=-0.011 Sum_probs=142.8 Q ss_pred Ccccchh---HH--HHh--hhhhhhcc--cccccccchhhhhhh-h-----------hhhccCCceeccchhhHHHHHHH Q lcl|NC_021342. 1 MAIKTID---AQ--TIQ--GNQWLVHK--GYVSRNGDQWVINNT-A-----------LDAIGNPNIMLDADGGIAFYISQ 59 (354) Q Consensus 1 ~~~~~~~---~~--~~~--~~~~~~~~--~~~~~~~~~~~~~~~-a-----------m~a~~~~~~~~da~~~~~fl~~~ 59 (354) -..+.+| .| ..+ ........ ........+...... + .......++.....+.+.+++. T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP- 129 (408) T protein:vir:74 51 NEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIP- 129 (408) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeec- Confidence 0000001 00 000 00000000 000000000000000 0 0000011111112222334444 Q ss_pred HHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccc-ceeEecCCCcccce-eeeccceeEEEEEEEEeeEe Q lcl|NC_021342. 60 LAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVT-MGKFIGANGQDLPR-VAQSAQMHTVPLGYAGNECH 137 (354) Q Consensus 60 L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G-~a~~~~~~~~dip~-v~~~~~~~~~pv~~~~~~~~ 137 (354) +.+.+.|++........+.++.+.. .+.....+.+......+ .+.+++.. ..+|- .+...+......+.++.... T Consensus 130 -~~~~~~Ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~E~-~~~~~~~~~~~~~i~~~~~k~~~~~~ 206 (408) T protein:vir:74 130 -QDIRTMINTLVRQYDSLQQYVRVES-VSTSSGSRVYEKWTDVTPLKAMDEED-GKIPDLDNPRLTIIKYLIKRYAGIIT 206 (408) T ss_pred -hhHhhHHHHHHhhhcchhhhcceee-ccCCcceEEEEeecCCcccccccccc-cccccccccceeeEEeeeeeEEeeeh Confidence 5667788888888877888766432 22233344444444443 33455444 34553 34667888888899888888 Q ss_pred ecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHH Q lcl|NC_021342. 138 YTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIF 217 (354) Q Consensus 138 ~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~ 217 (354) +|..=++. ...++..--....++++...+|+.+++|+....-. . .. .+. ++|.+++. T Consensus 207 iS~ell~d---s~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~----------~-~~-----~~~----~~i~~~~~ 263 (408) T protein:vir:74 207 ATNTLLKD---TAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKK----------P-TI-----ANF----DDVITMIN 263 (408) T ss_pred hHHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----------c-cc-----ccH----HHHHHHHH Confidence 87654443 34567788888899999999999999996532110 0 01 123 34444443 Q ss_pred -HHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcc Q lcl|NC_021342. 218 -SVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNK 296 (354) Q Consensus 218 -~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~ 296 (354) .+.. .....-.++|+|..|..|.+-. +..|.-++ ..++ ..|.+-.|...|........... .+.++ T Consensus 264 ~~l~~---~~~~~a~~v~n~~~~~~l~~lk--d~~G~~l~----~~~~---~~~~~~~l~G~pV~~~~~~~~~~-~~~~~ 330 (408) T protein:vir:74 264 TSVDP---AIIATSSLLTNQSGLNKLALVK--TAEGKYLL----EPDP---TKPNSYLIKGKQVIVVADRWLPN-SGSTV 330 (408) T ss_pred Hhhhh---hhcCCCEEEEcHHHHHHHHHhh--cCCCceEe----ccCc---CCCCCceecceeeEEecCccccc-ccCCc Confidence 2221 2223346899999999997533 33343222 1111 11222233333332222111111 11222 Q ss_pred cEEEEEEcCcceEEEee--Cchhhhccccc-c--CceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 297 PRYMVYDKSDRNLAMAN--PIPFRMLAPQM-A--SLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 297 d~~v~y~~~~~~~~~~v--p~~~~~~~~~~-~--~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..++..+.. +.+.+.. .+.+.+.+-.. . .-...+.++.|++| .+++|.|++.++++ T Consensus 331 ~~i~~gd~~-~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~-~~~~~~a~~~~~~~ 391 (408) T protein:vir:74 331 YPLYYGDMS-QAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDV-KATDSEALVAGSFT 391 (408) T ss_pred ceEEEEehh-ccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCc-EEecccceEEEEee Confidence 222322322 2222221 12222222111 1 12356677888865 68889999999997 No 89 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.81 E-value=5e-06 Score=49.67 Aligned_cols=312 Identities=6% Similarity=-0.057 Sum_probs=145.8 Q ss_pred CcccchhHH---HHhhhhhhhcccccccc---cchhhhh-hhhhhhcc-CC------ceeccchhhHHHHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQ---TIQGNQWLVHKGYVSRN---GDQWVIN-NTALDAIG-NP------NIMLDADGGIAFYISQLAGIEAT 66 (354) Q Consensus 1 ~~~~~~~~~---~~~~~~~~~~~~~~~~~---~~~~~~~-~~am~a~~-~~------~~~~da~~~~~fl~~~L~~Id~~ 66 (354) ..++.++.+ .......-..+...... ..+.... .-++.... .. .+.....+.+.+++. +.+.+. T Consensus 51 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP--~~~~~~ 128 (397) T protein:vir:49 51 MKRDMFKEQYTEARANEVANMSEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIP--QDIQTA 128 (397) T ss_pred HHHHHHHHHHHHHHHHhhhccccccccccccchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCccccc--HhHHHH Confidence 000000000 00000000001000000 0000000 00111100 00 001111122334444 456677 Q ss_pred HHHhhhhcccchhhccccCCCCCceeEEEEEee-ccccceeEecCCCcccce-eeeccceeEEEEEEEEeeEeecHHHHH Q lcl|NC_021342. 67 VYETPYGDITYRFDVPMAANIPEYADTWMYRSY-DGVTMGKFIGANGQDLPR-VAQSAQMHTVPLGYAGNECHYTLDEMR 144 (354) Q Consensus 67 v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~-~~~G~a~~~~~~~~dip~-v~~~~~~~~~pv~~~~~~~~~~~~El~ 144 (354) |++........++++.+.. .+.....+.+... +..+.+.+++.++. +|- .....+.....++.++.-+.+|..=++ T Consensus 129 ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ 206 (397) T protein:vir:49 129 IHTLVSQYDSLQEYVNVEN-VTTLTGSRVYEKWTDITGLANIDDEAGK-IADVDDPKLSLIKYTIKRYAGISTVTNSLLA 206 (397) T ss_pred HHHHHHhhhhHHhhhceee-cccCccceEEEeeccCCcceeeecCccc-cccccccceeeEEeeeeeEEeeehhHHHHHh Confidence 8888777777777765432 2212223334333 34567888876544 553 345677888888998888887755443 Q ss_pred HHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021342. 145 KSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSR 224 (354) Q Consensus 145 ~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~ 224 (354) .+ ..++..--....++++++.+|+-+++|+......+ ...+ +++|.+++.++... T Consensus 207 ds---~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~~-----------~~~~---------~d~i~~~~~~l~~~-- 261 (397) T protein:vir:49 207 DS---AENILAWLSGWIAKKVVVTRNKAILEAIAALPTKP-----------TLTK---------WDDIIDLEAKVDPA-- 261 (397) T ss_pred hh---HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-----------cccc---------HHHHHHHHHhhhhh-- Confidence 33 35677778888999999999999999976432111 0111 45677777777653 Q ss_pred CcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEc Q lcl|NC_021342. 225 RFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDK 304 (354) Q Consensus 225 g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~ 304 (354) ......++|+|..|..|.+- .+..|.-++ ..++. .+.+-.|...|......-. ...+..+.. .++|-+ T Consensus 262 -~~~~a~~vmn~~~~~~l~~l--kd~~G~~l~----~~~~~---~~~~~~l~G~PV~~~~~~~-~~~~~~~~~-~i~~gd 329 (397) T protein:vir:49 262 -IKQTSFFLTNTSGFTALKKV--KNALGDYLM----ERDVK---SPTGYSIDGFAVKEVADRW-LANGTGGAM-PLYFGD 329 (397) T ss_pred -hcCCCEEEEcHHHHHHHHHh--hcCCCceee----ccCcC---CCCCceecceeeEEecccc-cccccCCce-eEEEee Confidence 23446899999999999653 244443222 11111 1222223222222111100 011112222 233322 Q ss_pred CcceEEEeeCchhh--hcccc---ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 305 SDRNLAMANPIPFR--MLAPQ---MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 305 ~~~~~~~~vp~~~~--~~~~~---~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) =.+.+.+..-..++ +.+.. ...-...+.++.|++ +.+++|.+++.+.++ T Consensus 330 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 330 LKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFD-VVATDTEAFVPASFK 383 (397) T ss_pred ccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeC-cEEecccceEEEEee Confidence 12223322212222 21111 111124455677775 688999999999999 No 90 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=97.80 E-value=1.6e-05 Score=46.88 Aligned_cols=301 Identities=8% Similarity=-0.043 Sum_probs=140.4 Q ss_pred hhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCce--eEEEEEeecccccee---Eec Q lcl|NC_021342. 35 TALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYA--DTWMYRSYDGVTMGK---FIG 109 (354) Q Consensus 35 ~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~--~~~~~~~~~~~G~a~---~~~ 109 (354) .++.++.+....|..+-.-+|..+-...+. .+++.....|... +-..+...... .++.-..+..+|+.. ..+ T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv~qy~~~v~-~~~qq~~s~L~~t--V~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFVQTYETTLR-ILSQQKSAKLKQY--CQHKNESSESHNWETLASMDPDAVKRKRSRQQSA 77 (322) T ss_pred CcccceeeeeeeeechhhhHHHHHHHHHHH-HHHHHhhhhhhcc--cccccccccccceeeccccccccccccccccccc Confidence 355565555444444322355532222222 2333333333322 22222222111 111111111223222 233 Q ss_pred CCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecC Q lcl|NC_021342. 110 ANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNP 189 (354) Q Consensus 110 ~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p 189 (354) +..-|.|..+....+....+..+..++.+ .+++.++ +..++...-.+++..+++++.|++++.|--+....|. + T Consensus 78 d~~~dtp~~~~~~~~r~~~~~d~~~~~~V--Dd~D~~k-~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~---~ 151 (322) T protein:vir:10 78 DGTYPTPVNNKPFAKRRTNVDTYDTGHVV--EQEDISQ-MLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKG---T 151 (322) T ss_pred CcccCCCccccccceEEEeecccccceec--chHHHHH-hhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccc---c Confidence 44446777666666666777777766655 4555553 4567778888899999999999988865322221221 1 Q ss_pred Cccccccccc-ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCccccc Q lcl|NC_021342. 190 NVTLSSATKD-YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLL 268 (354) Q Consensus 190 ~~~~~~~~~~-W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~ 268 (354) +.++....+. -...+..--++.|.++...|.+..---.++..++++|+.|..|+.-. ..++ -+|.-.+-. .. T Consensus 152 gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~--~~ts---~D~~~~~~l--~~ 224 (322) T protein:vir:10 152 GQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQIT--EATS---ADYTSAMDL--QS 224 (322) T ss_pred ccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcch--hhhh---hhcccchhh--hh Confidence 1111000000 00011122255677777777664322223457999999999998521 2221 233211111 01 Q ss_pred ccccceeeeeeeeeeccccc----------cccccCcccEEEEEEcCcceEEEeeCchhhhccccc--cCceeEEeeeee Q lcl|NC_021342. 269 TGNELDIQIRFQLDAAELAA----------NGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQM--ASLGITVPAEYK 336 (354) Q Consensus 269 ~g~~l~I~~~~~L~~~~~~~----------~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~--~~l~~~~~~~~~ 336 (354) +|..-.+-...|+....+.. .+...+.+-..++|.++ -+.+..-.+++.--.+- +...+.+..... T Consensus 225 ~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~--Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~ 302 (322) T protein:vir:10 225 KGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDM--ALGYHSCKDIWTKVAEDPSASFAWRIYSAFT 302 (322) T ss_pred cCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecC--ceeEEEeeeeeEEeeccCCcchhhhhhhhhh Confidence 12222233333333332211 11112233445677654 34444433333311221 222345666666 Q ss_pred eeeEEEECCceeEeeecC Q lcl|NC_021342. 337 ISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 337 ~gGv~i~~P~ai~y~D~~ 354 (354) +|.+++ +|..++.+|.- T Consensus 303 ~Ga~ri-~~~gVv~i~~~ 319 (322) T protein:vir:10 303 ADCVRV-EDEHIFKLRLK 319 (322) T ss_pred hCceEe-ccCcEEEEEEe Confidence 665555 88999998888 No 91 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=97.80 E-value=1.9e-05 Score=46.51 Aligned_cols=317 Identities=12% Similarity=-0.005 Sum_probs=148.2 Q ss_pred Ccccch------hHHHHhhhhhhhc----c-------------------cccccccchhhhhhhhhhhccCCceeccchh Q lcl|NC_021342. 1 MAIKTI------DAQTIQGNQWLVH----K-------------------GYVSRNGDQWVINNTALDAIGNPNIMLDADG 51 (354) Q Consensus 1 ~~~~~~------~~~~~~~~~~~~~----~-------------------~~~~~~~~~~~~~~~am~a~~~~~~~~da~~ 51 (354) -.||.. +...+..+..... + +.......+....-....+. ..+++++ T Consensus 49 ~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~---~~~~~~~- 124 (409) T protein:vir:45 49 ERIAREEELRRQDQAYIESNEEEQRQNLDPENNSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQ---GVAQDEK- 124 (409) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhhcccCCCCCcchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhc---cCccCcC- Confidence 001100 0111111110000 0 00000111111100112221 1123322 Q ss_pred hHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccc-cceeEecCCCcccceeeeccceeEEEEE Q lcl|NC_021342. 52 GIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGV-TMGKFIGANGQDLPRVAQSAQMHTVPLG 130 (354) Q Consensus 52 ~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~-G~a~~~~~~~~dip~v~~~~~~~~~pv~ 130 (354) +.+++. +.+...|++........+.+..+..-.+ + ..+.+...+.. ..+.+++... .+|..+..........+ T Consensus 125 -gg~liP--~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~v~E~~-~~~~~~~~f~~~~l~~~ 198 (409) T protein:vir:45 125 -GGYTVP--ETFLAKVVEKMKSYGGIASVAQILTTSD-G-RTMEWATADGTSEVGVLLGENE-EAGEEDTDFGMGSLGAL 198 (409) T ss_pred -Cceecc--HhHHHHHHHHHHhhhhhhhhceeeecCC-C-ceEEEEeeccCccccccccccc-cccccccccceeeeeee Confidence 334444 4456778887777777777655432212 1 22333344333 3445665543 35666666655555444 Q ss_pred EEE-eeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh---hCceeeeecCCcccccccccccccCHH Q lcl|NC_021342. 131 YAG-NECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS---RGMYGLFNNPNVTLSSATKDYKTMNGQ 206 (354) Q Consensus 131 ~~~-~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~---~gi~GLlN~p~~~~~~~~~~W~~~T~~ 206 (354) ... .-..+|.+=|+.+ ..++..--....+.++...+|+.+++|+.. .+..|+++.+.....+..++ ..| T Consensus 199 k~~~~~i~is~ell~ds---~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~--~~~-- 271 (409) T protein:vir:45 199 KMTSKIIRVSNELLQDS---AIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAAN--AVK-- 271 (409) T ss_pred eeeeeehhhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeecccccccccccc--ccc-- Confidence 443 2334555444333 346777778888999999999999999864 36789998876432222111 112 Q ss_pred HHHHHHHHHHHHHHHHhCCccccc-EEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeecc Q lcl|NC_021342. 207 ELFNMLNAPIFSVINLSRRFHVPN-TALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAE 285 (354) Q Consensus 207 ei~~di~~~~~~l~~~s~g~~~p~-~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~ 285 (354) ++||.+++..|... +...+. .+++++..|..|..- .+..|.-++ ...+ ..|.+-.+...|.+.... T Consensus 272 --~d~i~~l~~~l~~~--~~~~a~~~~~~n~~~~~~l~~l--kd~~G~~i~----~~~~---~~~~~~~l~G~PV~~~~~ 338 (409) T protein:vir:45 272 --WQEILALKHSIDPA--YRRGPKFRLAFNDNTLKLISEM--EDGQGRPLW----LPDI---VGVAPASVLNVPYVIDQE 338 (409) T ss_pred --hHHHHHHHHhhhhh--hccCCeEEEEECHHHHHHHHHh--hcCCCceee----ccCc---CCCCCceecceeeEEecC Confidence 46677777777542 333333 467899999888642 244443221 1111 123333444444444443 Q ss_pred ccccccccCcccEEEEE-EcCcceEEEeeCchhhh--ccccc-cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 286 LAANGVSNSNKPRYMVY-DKSDRNLAMANPIPFRM--LAPQM-ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 286 ~~~~g~g~~g~d~~v~y-~~~~~~~~~~vp~~~~~--~~~~~-~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .... + .+.+. |+| +.+. .+ +..-..++. +...+ ..-...+.+..|++ ..+..|.|++.+.++ T Consensus 339 ~p~~--~-~~~~~-i~~Gd~~~-~~-i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d-~~~~~~~A~~~l~~k 404 (409) T protein:vir:45 339 IDDI--G-AGKKF-MFCGDFDR-FI-IRRVRYMILKRLVERYAEYDQTGFLAFHRFD-CILEDTSAIKALVGK 404 (409) T ss_pred cCCc--c-CCccE-EEEeehhh-hh-eeeccceEEEEeecccccCCcEEEEEEEEec-cEeechhheEEEEec Confidence 2211 1 22332 333 3221 11 111111111 11111 11234456677875 569999999999997 No 92 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=97.78 E-value=2e-05 Score=46.35 Aligned_cols=319 Identities=9% Similarity=-0.085 Sum_probs=151.6 Q ss_pred CcccchhHHH----Hhhhhhhhcc---cccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_021342. 1 MAIKTIDAQT----IQGNQWLVHK---GYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYG 73 (354) Q Consensus 1 ~~~~~~~~~~----~~~~~~~~~~---~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~ 73 (354) -..|..+|-. +....|-... -.-..++.+..... ...+......++++.+++.++.. +.+...+++..++ T Consensus 288 ~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~-~~~~a~~~~~~~~~~~~Gg~~vp--~~~~~~ii~~l~~ 364 (645) T protein:vir:93 288 KLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHH-VLKSAVGAGTTTDPQWAGSLSEY--QEYAQDFIDYLRP 364 (645) T ss_pred hhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhh-hhhhhhhccccccccccCCccCc--hhhHHHHHHhhhh Confidence 0111111111 1111111000 00000111111111 11222223445666666666665 4455677777777 Q ss_pred cccchhhccccCCCCCce-eEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCC Q lcl|NC_021342. 74 DITYRFDVPMAANIPEYA-DTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMP 152 (354) Q Consensus 74 ~l~~r~~v~v~~~~~~~~-~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ 152 (354) ....+++-....+..... -.+.......-+.+.|++. +..+|..+...+......+.++.-..+|.+=|+.+ ..+ T Consensus 365 ~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~E-g~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds---~~~ 440 (645) T protein:vir:93 365 QTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGE-GKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFS---SPA 440 (645) T ss_pred hhhHHhhccccccccccccCceeeeeeecCcceEEecc-CccccccccceeEEEEeeEEEEEeehhHHHHHhhc---hHH Confidence 666666533211111110 0123334444566788865 45588888888888888888887777765434433 456 Q ss_pred cchHHHHHHHHHHHHHhhheeeeeehhhC----ceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCccc Q lcl|NC_021342. 153 IDAEQARLAFRGAEEHSQSVAYFGDASRG----MYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHV 228 (354) Q Consensus 153 ld~~k~~aA~~~~a~~~n~~~f~G~~~~g----i~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~ 228 (354) ++.--....+++++..+|+.+|+|+...+ -.|++|.. +.. ++......|+..++.++... +... T Consensus 441 ~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~--~~~--------~~~~~~~~d~~~~~~~~~~a--~~~~ 508 (645) T protein:vir:93 441 ADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDV--KGT--------ASSGNPDADAEAAFGQFVAA--NLQP 508 (645) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccc--ccc--------ccccchHHHHHHHHHHHHhc--CCCc Confidence 77777888999999999999999865421 24454421 111 11122346788888877654 3332 Q ss_pred c-cEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcc Q lcl|NC_021342. 229 P-NTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDR 307 (354) Q Consensus 229 p-~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~ 307 (354) + -..+|+|..+..|.+.. +..|.-+. +.....+ -++...|.+.+......-....-++..++-. . T Consensus 509 ~~a~~vmn~~~~~~L~~lk--d~~G~~~~-------~~~~~~~--~tL~G~PV~~s~~vp~~~~~gd~s~~~ig~~---~ 574 (645) T protein:vir:93 509 TGAVWLMSSTNALALSMRK--NALGQKEY-------PDMTLLG--GSFQGLPVIVSQYVGDQLVLVNAPDIYLADD---G 574 (645) T ss_pred cccEEEEcHHHHHHHHhcc--ccCCceee-------cCCCCCC--ceeeceeeEEeccCCcceeEeccccEEEEEe---c Confidence 3 25789999999997643 32232111 0001111 1233333333322211000001112221111 1 Q ss_pred eEEEeeCchhhhc------------------cccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 308 NLAMANPIPFRML------------------APQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 308 ~~~~~vp~~~~~~------------------~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+.+.+...-+.. -.-.++ -.-+.++++++ ..+++|.|++++.=+ T Consensus 575 ~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d-~vaira~~r~d-~~~~~p~a~~~lt~~ 637 (645) T protein:vir:93 575 GVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTG-SVAIRAERWIN-WRRRRTAAVAVITGV 637 (645) T ss_pred ceEEEeecceeEEEeecccccccccccccchhHhhcC-ceEEEEEEEEc-ceeeCccceEEEecc Confidence 1222211111100 000122 24566788875 778999999998866 No 93 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=97.76 E-value=6.4e-06 Score=49.06 Aligned_cols=310 Identities=7% Similarity=-0.056 Sum_probs=142.5 Q ss_pred CcccchhHHHHh------hhhhhhcccccccccchhhhhh-hhhhhcc---------CCceeccchhhHHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQ------GNQWLVHKGYVSRNGDQWVINN-TALDAIG---------NPNIMLDADGGIAFYISQLAGIE 64 (354) Q Consensus 1 ~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~-~am~a~~---------~~~~~~da~~~~~fl~~~L~~Id 64 (354) ..++.++.+.-. .+..-...+............. -++.... .....+.+ .++ +++. +.+. T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~-~gg-~~iP--~~~~ 126 (397) T protein:vir:48 51 MKRDMFKEQYTEARANEVVNMSEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGS-DAG-LTIP--QDIQ 126 (397) T ss_pred HHHHHHHHHHHHHHHhhhhhhhhhccccccchhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCc-ccc-cccc--HHHH Confidence 000011000000 0000000000000000000000 0110000 00111111 223 3332 4556 Q ss_pred HHHHHhhhhcccchhhccccCCCCCceeEEEEEe-eccccceeEecCCCccccee-eeccceeEEEEEEEEeeEeecHHH Q lcl|NC_021342. 65 ATVYETPYGDITYRFDVPMAANIPEYADTWMYRS-YDGVTMGKFIGANGQDLPRV-AQSAQMHTVPLGYAGNECHYTLDE 142 (354) Q Consensus 65 ~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~-~~~~G~a~~~~~~~~dip~v-~~~~~~~~~pv~~~~~~~~~~~~E 142 (354) +.|++........+.++.+.. .+.....+.+.. .+..+.+.+++... .+|.. ....+........++..+.+|..= T Consensus 127 ~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~-~~~~~~~~~~~~v~~~~~k~~~~~~iS~el 204 (397) T protein:vir:48 127 TAIHTLVRQYDSLQEYVNVEN-VTTLTGSRVYEKWADITGLAKLDDEAG-SIGTNDDPKLYPIRYAIKRYAGISTVTNSL 204 (397) T ss_pred HHHHHHHHHHHHHHhhhceee-ccCCcceEEEEeecCCCcceeeecccc-ccccccccceeeEEeeheeeeeehhhHHHH Confidence 778887777777777765432 222222233332 23445677776543 35544 345677777888888888887654 Q ss_pred HHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 143 MRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINL 222 (354) Q Consensus 143 l~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~ 222 (354) |+.+ ..++..--....++++++.+|+.+++|+...+..| . ..+ +++|.+++.+|... T Consensus 205 l~ds---~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~~~----------~-~~~---------~d~i~~~~~~l~~~ 261 (397) T protein:vir:48 205 LADS---AENILAWLSGWIAKKVVVTRNKAILEAIATLPTKP----------T-LTK---------WDDIIDLQAKVDPA 261 (397) T ss_pred Hhhc---hHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc----------c-ccc---------HHHHHHHHHHhhhh Confidence 4433 45677788888999999999999999975532211 0 111 35666677776542 Q ss_pred hCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEE Q lcl|NC_021342. 223 SRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVY 302 (354) Q Consensus 223 s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y 302 (354) ...+..++++|..|..|.+.. +..|.-++.--..++.-..+.|.|+.+.....+ ..+..+...++.- T Consensus 262 ---~~~~a~~v~n~~~~~~L~~lk--d~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~--------~~~~~~~~~~~~g 328 (397) T protein:vir:48 262 ---IKQTSFFLTNTSGFTALKKVK--NAFGDYLMERDVKSPTGYSIDGFAVKEVADRWL--------ANASSGAMPLYFG 328 (397) T ss_pred ---hcCCCEEEECHHHHHHHHHhh--cCCCceeeccCcCCCCCceeccceeEEeccccc--------CCcCCCceEEEEE Confidence 234578999999999986532 333332211000111111223333322211111 1122233333322 Q ss_pred EcCcceEEEeeCchhhh--ccc---cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 303 DKSDRNLAMANPIPFRM--LAP---QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 303 ~~~~~~~~~~vp~~~~~--~~~---~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.. +.+.+..-..++. ... ....-...+.++.|++ +.+++|.+++.+.++ T Consensus 329 d~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~~~~~~~ 383 (397) T protein:vir:48 329 DLK-QAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFD-VVATDTESFVPASFK 383 (397) T ss_pred ecc-ceEEEEeecceEEEEeccchhhhhcCceeEEEEeeec-cEEecccceEEEEec Confidence 222 2222222122221 111 1111134556777775 577899999999998 No 94 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=97.75 E-value=1.3e-05 Score=47.30 Aligned_cols=313 Identities=6% Similarity=-0.004 Sum_probs=144.8 Q ss_pred CcccchhHHHHhh--------hhhhhcccccccccchhhhhhhhh-h---hccCCceeccchhhHHHHHHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQG--------NQWLVHKGYVSRNGDQWVINNTAL-D---AIGNPNIMLDADGGIAFYISQLAGIEATVY 68 (354) Q Consensus 1 ~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~am-~---a~~~~~~~~da~~~~~fl~~~L~~Id~~v~ 68 (354) --|+.++.+ |.. ...+-.+...............+. + .-...+++....+.+.+++. +.+.+.++ T Consensus 36 ~ei~~l~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~t~~~gg~~vP--~~~~~~ii 112 (371) T protein:vir:81 36 EEIVALQEK-FDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAFVNHIRTRFRNAMSEGSNQDGGYTVP--QDIQTRIN 112 (371) T ss_pred HHHHHHHHH-HHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHHHHHHHHhhccCCCccCceeec--HhHHHHHH Confidence 111111111 000 000000000000000000000000 0 00001111111122333333 45667888 Q ss_pred HhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccce-eeeccceeEEEEEEEEeeEeecHHHHHHHH Q lcl|NC_021342. 69 ETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPR-VAQSAQMHTVPLGYAGNECHYTLDEMRKSA 147 (354) Q Consensus 69 e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~-v~~~~~~~~~pv~~~~~~~~~~~~El~~a~ 147 (354) +........+.++++.. .+.....+.+......+.+.+++.++ .+|- .+...+......+.++.-+.+|..=++.+ T Consensus 113 ~~~~~~s~i~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~v~Eg~-~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds- 189 (371) T protein:vir:81 113 ELRESKDALQNLITVEP-VTTLSGSRVFKKRSQQTGFVEVAEGA-AIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDS- 189 (371) T ss_pred HHHHhhhhhhhhceeee-ccCCceeEEEEeecCCcceeeecccc-ccccccccceeeEEeeeeEEEEeehhhHHHHhhh- Confidence 88888888888776532 23233334444555556778887654 3553 44677888888899988888887655544 Q ss_pred HhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcc Q lcl|NC_021342. 148 AMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFH 227 (354) Q Consensus 148 ~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~ 227 (354) ..+|..--....++++++.+|+.+++|+....-.|. .+. +++..++...... ... T Consensus 190 --~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~~-----------------~~~----~~i~~~~~~~l~~--~~~ 244 (371) T protein:vir:81 190 --TEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKTAI-----------------ADL----DGLKQIINVQLDP--VFR 244 (371) T ss_pred --hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-----------------ccH----HHHHHHHHhhcch--hhh Confidence 246777788888999999999999999764322121 112 3333333211111 122 Q ss_pred cccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeecccc--cccc-ccCcccEEEEEEc Q lcl|NC_021342. 228 VPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELA--ANGV-SNSNKPRYMVYDK 304 (354) Q Consensus 228 ~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~--~~g~-g~~g~d~~v~y~~ 304 (354) ..-.++|+|..|..|.+.. +..+. ||-..++ ..+.+-.+...|.......- .... +.+.....++|-+ T Consensus 245 ~~a~~vmn~~~~~~L~~lk--d~~g~----~l~~~~~---~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd 315 (371) T protein:vir:81 245 STSSVIVNQDAFNWLDTLK--DQNGQ----YLLQPSI---SSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGD 315 (371) T ss_pred cCCEEEEcHHHHHHHHHhh--ccCCC----eeeeccc---CCCCCceecceeEEEecccccCccccccccCCcceEEEEe Confidence 3457999999999986532 33332 1211111 11222223223332222111 0000 1111111223221 Q ss_pred CcceEEEeeCchhhhcccccc-----CceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 305 SDRNLAMANPIPFRMLAPQMA-----SLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 305 ~~~~~~~~vp~~~~~~~~~~~-----~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) =.+.+.+.....++....... .-...+.++.|++ ..+++|.+++.++++ T Consensus 316 ~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d-~~~~~~~a~~~~~~~ 369 (371) T protein:vir:81 316 LKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMD-VKMRDDEAFVFGEVQ 369 (371) T ss_pred hhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEEEEEe Confidence 112233322223222221111 1134666777775 678889999999999 No 95 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=97.71 E-value=2e-05 Score=46.36 Aligned_cols=268 Identities=6% Similarity=-0.040 Sum_probs=133.1 Q ss_pred cCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCC-ceeEEEEEeeccccceeEecCCCcccceee Q lcl|NC_021342. 41 GNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPE-YADTWMYRSYDGVTMGKFIGANGQDLPRVA 119 (354) Q Consensus 41 ~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~-~~~~~~~~~~~~~G~a~~~~~~~~dip~v~ 119 (354) +....|..+|- +..| -+.+-+.+.....+....+..+...+.. ...++.++.+...|.+..++++ ++++... T Consensus 1 ma~~~T~~~d~----iiPe--v~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg-~~i~~~~ 73 (272) T protein:vir:36 1 MSKQKTTLADL----VNPE--VLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEG-GEISLDK 73 (272) T ss_pred CCCcceehhhh----hchH--HHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCC-CccChhh Confidence 22334444442 2221 1122233334444555555544443221 2446778888888999887765 4688888 Q ss_pred eccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccc Q lcl|NC_021342. 120 QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 120 ~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~ 199 (354) .+.+.....+.+.+.+|. ..|++..+ .+-++-..-...++..+++..|+-++-.- .| .+ ... + T Consensus 74 lt~~~~~~~i~~~~k~~~--vtD~~~~~-~~~d~~~~~~~~~a~~~a~~~d~~i~~~l-----~~---~~---~~~-~-- 136 (272) T protein:vir:36 74 IGTTTKSVTIKKAAKGTE--ITDEAALS-GYGDPIGESNKQLGLSLANKVDDDLLSAA-----KT---TS---QTV-S-- 136 (272) T ss_pred cCCcceeEeeehhhcccc--ccHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHHHHh-----cc---cc---ccc-c-- Confidence 888888888887666555 46666665 44556666777777788888887654211 11 01 011 1 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRF 279 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~ 279 (354) +.--+++|.+++..+-.. ...+..++++|..|..|.+...-........+-+..|+.+. .+...+ T Consensus 137 -----~~~~~d~i~~A~~~lgd~---~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig-------~~~G~~ 201 (272) T protein:vir:36 137 -----TKANVDGVQAALDIFNDE---DAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYA-------DVLGAQ 201 (272) T ss_pred -----ccccHHHHHHHHHHhhhc---CCCceEEEEcHHHHHHHhcccccccccccccccceeeeccc-------eecCee Confidence 111246777787777543 23578999999999998642211111100001111121111 122222 Q ss_pred eeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 280 QLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 280 ~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+....+. . ++.....|-..+.-+......+++.-.- ......-.+..... .|+-+.+|.+++.+-.+ T Consensus 202 Vv~s~~~p-~-----~~~~~~~~~~~~gA~~~~~~~~~~vE~~R~~~~~~d~i~~~~~-y~~~v~~~~~vv~~t~~ 270 (272) T protein:vir:36 202 IVRSKKLA-E-----GSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEH-YAAYLYDLTKVVNITFT 270 (272) T ss_pred EEEeCCCC-C-----CceeEEEEEecccceeeeecCCcccccccchhhcCcEEEEEEE-EEEEEEcCccEEEEeec Confidence 22222211 1 1111122211222232222233222111 11112223333333 47999999999999988 No 96 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=97.70 E-value=7.2e-06 Score=48.78 Aligned_cols=301 Identities=7% Similarity=-0.059 Sum_probs=146.8 Q ss_pred CcccchhHHHHhhhhhhhcc------------------cccccccchhhhhhhhhhhc-cCCceeccchhhHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHK------------------GYVSRNGDQWVINNTALDAI-GNPNIMLDADGGIAFYISQLA 61 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~~~am~a~-~~~~~~~da~~~~~fl~~~L~ 61 (354) -.++.+++..-....-...+ ++....+ .+.. ..+... ...+++..+.+++.|++. + T Consensus 23 ~~~d~~e~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~-~~~~--~~~~~~~~~~al~~~~~~~gG~lIP--~ 97 (352) T protein:vir:78 23 RQVQDIEEKEKAKVKDKGEAYQSLNDNEKLVKAKAEFYRHAILPN-EFEK--PSMEAQRLLHALPTGNDSGGDKLLP--K 97 (352) T ss_pred HHHHHHHHHHHHHhhhccccccccchhhhHHHHHHHHHHHHhhhh-HHHH--HHhhHHHHHHHhccCCCCCCceecc--H Confidence 01111111111110000000 0000000 0000 000000 001122223344556665 5 Q ss_pred HHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHH Q lcl|NC_021342. 62 GIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLD 141 (354) Q Consensus 62 ~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~ 141 (354) .+.++|++........|.+..+.+..+ ..+. .+....+.+.|++.. ..+|..+...+......+.++.-+.+|.+ T Consensus 98 ~~~~~Ii~~l~~~s~l~~~~~v~~~~~---~~~p-~~~~~~~~a~~v~E~-~~~~~~~~~f~~v~~~~~k~~~~i~is~e 172 (352) T protein:vir:78 98 TLSKEIVSEPFAKNQLREKARLTNIKG---LEIP-RVSYTLDDDDFITDV-ETAKELKLKGDTVKFTTNKFKVFAAISDT 172 (352) T ss_pred hHHHHHHHHHHhhcchhhheeeEecCC---ceEE-EEecCCCcccccccc-cccccccccceeeeecceeEEeechhhHH Confidence 667788887777777788777654322 1221 222234567787654 44777777788888888888888888776 Q ss_pred HHHHHHHhCCCcchHHHHHHHHHHHHHhhheee-eeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHH Q lcl|NC_021342. 142 EMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAY-FGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVI 220 (354) Q Consensus 142 El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f-~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~ 220 (354) =|+.+ ..++..--....++++...+++.+| .|+....-.|.++++++...+.. ..+++|.+++.+|. T Consensus 173 ll~Ds---~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~---------~~~d~i~~~~~~l~ 240 (352) T protein:vir:78 173 VIHGS---DVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGA---------NMYDAIINALADLH 240 (352) T ss_pred HHhhh---hHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceecccccccccc---------chHHHHHHHHhccC Confidence 55443 3467766777777778777777665 55544445788888776543322 22577777777764 Q ss_pred HHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEE Q lcl|NC_021342. 221 NLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYM 300 (354) Q Consensus 221 ~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v 300 (354) .. +. ..-..+|++..|..|.+.. +..+..++ ...+. .+-|.|+.+ +.+.. + ++ T Consensus 241 ~~--~~-~~a~~~mn~~t~~~l~~~~--~~~~~~~~----~~~~~-~llG~PV~~-------~~~~~---------~-~~ 293 (352) T protein:vir:78 241 ED--YR-DNATIYMRYADYVKIISVL--SNGTTNFF----DTPAE-KVFGKPVVF-------TDAAV---------K-PI 293 (352) T ss_pred hh--hh-cCCEEEEehHHHHHHHHHH--hccCCccc----ccCCc-cccccceEE-------ecCCC---------c-ee Confidence 42 21 1246889998887776543 22232222 11111 122333322 21100 0 11 Q ss_pred EEEcCcceEEEeeCchhhhcc-ccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 301 VYDKSDRNLAMANPIPFRMLA-PQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 301 ~y~~~~~~~~~~vp~~~~~~~-~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +-+.+.=.+.. ..+.+-. -+...-...+.+.+|++|. +.+|.|++.+.++ T Consensus 294 ~Gdf~~~~~~~---~~~~~~~~~~~~~g~~~f~~~~r~Dg~-~~~~eA~~~l~~~ 344 (352) T protein:vir:78 294 VGDFNYFGINY---DGTTYDTDKDVKKGEYLFVLTAWYDQQ-RTLDSAFRIAKAK 344 (352) T ss_pred Eeehhhhhhhh---hhheeeeeccccCCeeEEEEEeeeCce-eechhheEEEEee Confidence 11111000000 0011100 0111223556677888765 6779999999999 No 97 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=97.70 E-value=7.2e-06 Score=48.79 Aligned_cols=310 Identities=7% Similarity=-0.018 Sum_probs=144.0 Q ss_pred CcccchhHH--HHhhhh----hhhcccccccccchhhhhh-hhhhhc---------cCCceeccchhhHHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQ--TIQGNQ----WLVHKGYVSRNGDQWVINN-TALDAI---------GNPNIMLDADGGIAFYISQLAGIE 64 (354) Q Consensus 1 ~~~~~~~~~--~~~~~~----~~~~~~~~~~~~~~~~~~~-~am~a~---------~~~~~~~da~~~~~fl~~~L~~Id 64 (354) ..++.++.+ .....+ ....++............. -++... ......+.+ .+.+++. +.+. T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~--~gg~~iP--~~~~ 126 (397) T protein:vir:49 51 MKRDLFKEQYTEARANEVANMSEEEKKPLTKNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGS--DAGLTIP--QDIR 126 (397) T ss_pred HHHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCc--cCcceec--HHHH Confidence 000001000 000000 0000000000000000000 000000 000111122 2334433 4455 Q ss_pred HHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcccceee-eccceeEEEEEEEEeeEeecHHH Q lcl|NC_021342. 65 ATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQDLPRVA-QSAQMHTVPLGYAGNECHYTLDE 142 (354) Q Consensus 65 ~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip~v~-~~~~~~~~pv~~~~~~~~~~~~E 142 (354) ..+++........+++..+.. .+.....+.+.... ..+.+.+++..+. +|..+ ...+......+.++.-+.+|..= T Consensus 127 ~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~~~~~~~v~~~~~k~~~~~~iS~el 204 (397) T protein:vir:49 127 TAINTLVRQFDSLQEYVNVEN-VTTLTGSRVYEKWADITGLAKLDDEGGQ-IGQNDDPKLSLIRYAIKRYAGISTVTNSL 204 (397) T ss_pred HHHHHHHHhhhhHhhhcceee-ccCCcceEEEEeeccCCcceeeeccccc-cccccccceeeeEeeeeeeEeehhhHHHH Confidence 677777777777777665432 22233334444443 3466777765433 55444 34567777888888777777644 Q ss_pred HHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 143 MRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINL 222 (354) Q Consensus 143 l~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~ 222 (354) |+. ...++..--....++++++.+|+-+++|+.... +..+ ..+ +++|.+++.++... T Consensus 205 l~d---s~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~----------~~~~-~~~---------~d~i~~~~~~l~~~ 261 (397) T protein:vir:49 205 LAD---SAENILAWLSGWIAKKVVVTRNKAILEAIGTLP----------NKPT-LAK---------WDDIIDLQAKVDPA 261 (397) T ss_pred Hhh---hhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------cccc-ccC---------HHHHHHHHHhhhhh Confidence 433 345777888888999999999999999975421 1111 111 45677777777542 Q ss_pred hCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEE Q lcl|NC_021342. 223 SRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVY 302 (354) Q Consensus 223 s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y 302 (354) ...+..++|+|..|..|.+-. +..|.-++ ..++ ..|.+-.|...|......... ..+.+++. .++| T Consensus 262 ---~~~~a~~v~n~~~~~~l~~lk--d~~g~~l~----~~~~---~~g~~~~l~G~pV~~~~~~~~-~~~~~~~~-~~~~ 327 (397) T protein:vir:49 262 ---IKQTSLFLTNTSGFTALKKVK--NAMGDYLM----ERDV---KSPTGYSIDGFVVKEISDRFL-PNGTGGAM-PLYF 327 (397) T ss_pred ---hcCCCEEEEcHHHHHHHHHhh--ccCCceee----cccc---cCCCCceecceeeEEeccccc-ccccCCce-eEEE Confidence 345678999999999997533 33332221 1011 112222333333222111111 11112222 2333 Q ss_pred EcCcceEEEeeCchhhhcccc-----ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 303 DKSDRNLAMANPIPFRMLAPQ-----MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 303 ~~~~~~~~~~vp~~~~~~~~~-----~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) -+=.+.+.+..-..+++.--. ...-...+.++.|++| .+++|.|++++.++ T Consensus 328 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~-~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 328 GDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDV-VSTDTEAFVPASFK 383 (397) T ss_pred eeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeecc-EEecccceEEEEec Confidence 221222222221222221111 1112345667888865 57889999999998 No 98 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=97.69 E-value=2.8e-05 Score=45.56 Aligned_cols=263 Identities=7% Similarity=-0.048 Sum_probs=138.1 Q ss_pred ccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCC--CCCceeEEEEEeeccccceeEecCCCcccce Q lcl|NC_021342. 40 IGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAAN--IPEYADTWMYRSYDGVTMGKFIGANGQDLPR 117 (354) Q Consensus 40 ~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~ 117 (354) |.+.. |..++ .++. +.+.+.+.+.....+....+.-+... +.+| .++....+...|.+.+++.+ +++|. T Consensus 1 MA~~~-T~~~~----~~iP--ev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G-~tv~iP~~~~~~~a~~v~eg-~~i~~ 71 (272) T protein:vir:30 1 MAVGT-TKMAQ----MLDP--EVLADMIDAEVGKAIRFAPLAEVDTTLEGQPG-TTLTVPKWDYIGDAEDVAEG-EAIPM 71 (272) T ss_pred CCCcc-ccchh----eech--HHHHHHHHHHHHHHhhhhccccccccccCCCC-CEEEEEEecCCCCcccccCC-Ccccc Confidence 21111 22222 2222 22233344444444444454443322 1222 36777777778999998865 67898 Q ss_pred eeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccc Q lcl|NC_021342. 118 VAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSAT 197 (354) Q Consensus 118 v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~ 197 (354) .+...+.....+..++..+.++..+.+. ...++...-.+.+.+.+++..|+.++.-- .|- + ... T Consensus 72 ~~~~~~~~~~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~~~~-----~~a---~-----~~~ 135 (272) T protein:vir:30 72 TQLGFKKTTMTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVLDAL-----SKS---T-----QTV 135 (272) T ss_pred cccccceEEEEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHHHHh-----ccc---c-----ccc Confidence 9998888888999988888887665444 35578888888899999999888766311 111 0 001 Q ss_pred ccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCch-HHHHHHhhCcccccccccceee Q lcl|NC_021342. 198 KDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRT-VMQHFMEANSYTLLTGNELDIQ 276 (354) Q Consensus 198 ~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~T-vl~~l~~n~~~~~~~g~~l~I~ 276 (354) +.. .| +++|.+++..+... + ..+..++|+|..|..|.+.......+.+ ...-+..+ |..-++. T Consensus 136 ~~~--~t----~d~i~da~~~l~~~--~-~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~-------g~ig~i~ 199 (272) T protein:vir:30 136 EAT--AT----VDGVSKALDIFNDE--D-DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS-------GVYGEVL 199 (272) T ss_pred ccc--cC----HHHHHHHHHHHhcc--C-CCccEEEEcHHHHHHHHHhcccccccccccccccccc-------ccchhhc Confidence 111 11 56777787777543 2 4567899999999988642211111100 00001111 2111233 Q ss_pred eeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 277 IRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 277 ~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..+.+....... +..+++ ++..+.+..-.+.+.-.- +.......+....++ |+.+.+|.+++.+-++ T Consensus 200 G~~Vi~s~~~p~--------~t~~~~--~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~-~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:30 200 GVQIVRSRKCPK--------GTAYMV--RKGALRIMLKRNTMVETDRDITKAINQIVANKHY-GVYLYKAEKAVKITLK 267 (272) T ss_pred CeeEEEcCCCCc--------ceEEEE--cCCeEEEEecCCceeeeccccccceeEEEEEEEE-EEEEEcCCceEEEEec Confidence 333333333211 111222 223333333223222111 112233444444555 5889999999999888 No 99 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=97.69 E-value=2.8e-05 Score=45.56 Aligned_cols=263 Identities=7% Similarity=-0.048 Sum_probs=138.1 Q ss_pred ccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCC--CCCceeEEEEEeeccccceeEecCCCcccce Q lcl|NC_021342. 40 IGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAAN--IPEYADTWMYRSYDGVTMGKFIGANGQDLPR 117 (354) Q Consensus 40 ~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~ 117 (354) |.+.. |..++ .++. +.+.+.+.+.....+....+.-+... +.+| .++....+...|.+.+++.+ +++|. T Consensus 1 MA~~~-T~~~~----~~iP--ev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G-~tv~iP~~~~~~~a~~v~eg-~~i~~ 71 (272) T protein:vir:98 1 MAVGT-TKMAQ----MLDP--EVLADMIDAEVGKAIRFAPLAEVDTTLEGQPG-TTLTVPKWDYIGDAEDVAEG-EAIPM 71 (272) T ss_pred CCCcc-ccchh----eech--HHHHHHHHHHHHHHhhhhccccccccccCCCC-CEEEEEEecCCCCcccccCC-Ccccc Confidence 21111 22222 2222 22233344444444444454443322 1222 36777777778999998865 67898 Q ss_pred eeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccc Q lcl|NC_021342. 118 VAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSAT 197 (354) Q Consensus 118 v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~ 197 (354) .+...+.....+..++..+.++..+.+. ...++...-.+.+.+.+++..|+.++.-- .|- + ... T Consensus 72 ~~~~~~~~~~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~~~~-----~~a---~-----~~~ 135 (272) T protein:vir:98 72 TQLGFKKTTMTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVLDAL-----SKS---T-----QTV 135 (272) T ss_pred cccccceEEEEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHHHHh-----ccc---c-----ccc Confidence 9998888888999988888887665444 35578888888899999999888766311 111 0 001 Q ss_pred ccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCch-HHHHHHhhCcccccccccceee Q lcl|NC_021342. 198 KDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRT-VMQHFMEANSYTLLTGNELDIQ 276 (354) Q Consensus 198 ~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~T-vl~~l~~n~~~~~~~g~~l~I~ 276 (354) +.. .| +++|.+++..+... + ..+..++|+|..|..|.+.......+.+ ...-+..+ |..-++. T Consensus 136 ~~~--~t----~d~i~da~~~l~~~--~-~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~-------g~ig~i~ 199 (272) T protein:vir:98 136 EAT--AT----VDGVSKALDIFNDE--D-DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS-------GVYGEVL 199 (272) T ss_pred ccc--cC----HHHHHHHHHHHhcc--C-CCccEEEEcHHHHHHHHHhcccccccccccccccccc-------ccchhhc Confidence 111 11 56777787777543 2 4567899999999988642211111100 00001111 2111233 Q ss_pred eeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 277 IRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 277 ~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..+.+....... +..+++ ++..+.+..-.+.+.-.- +.......+....++ |+.+.+|.+++.+-++ T Consensus 200 G~~Vi~s~~~p~--------~t~~~~--~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~-~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:98 200 GVQIVRSRKCPK--------GTAYMV--RKGALRIMLKRNTMVETDRDITKAINQIVANKHY-GVYLYKAEKAVKITLK 267 (272) T ss_pred CeeEEEcCCCCc--------ceEEEE--cCCeEEEEecCCceeeeccccccceeEEEEEEEE-EEEEEcCCceEEEEec Confidence 333333333211 111222 223333333223222111 112233444444555 5889999999999888 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=97.69 E-value=1.8e-05 Score=46.58 Aligned_cols=303 Identities=12% Similarity=0.038 Sum_probs=140.8 Q ss_pred ccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHH-HHHHHHHHHHHhhhhcccchhhc Q lcl|NC_021342. 3 IKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYIS-QLAGIEATVYETPYGDITYRFDV 81 (354) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~-~L~~Id~~v~e~~~~~l~~r~~v 81 (354) |-||| .|+ ++..+. +..+ +++ ++.++.++.. ++.++-..+.| .-..|++. T Consensus 1 ~~~~~--------------~~~-~~~~~~----~~k~-----~t~-~d~~Gg~l~P~~~~~~i~~~~e----~s~~l~~~ 51 (315) T protein:vir:41 1 MLTIE--------------DIR-GGKPFE----IVPK-----IDV-PDLGRGVLSVDRFGEFVKAVRD----SAVIIPEA 51 (315) T ss_pred Ccccc--------------hhh-cCChhh----hhhh-----cCC-cCCCCceechHHHHHHHHHHHh----hhhhhhhc Confidence 22222 222 111111 1122 222 1223334443 23333233333 33344554 Q ss_pred cccCCCCCceeEEEEE-eeccccc-eeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHH Q lcl|NC_021342. 82 PMAANIPEYADTWMYR-SYDGVTM-GKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQAR 159 (354) Q Consensus 82 ~v~~~~~~~~~~~~~~-~~~~~G~-a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~ 159 (354) .+....+.....+.-. ....+.. +.+.+. ....+..+...+....+...+..-...+.+-|+.+. .|.++...... T Consensus 52 ~vi~~~~~~~~~i~~~g~~~~~~~g~~~~~~-~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~-~~~~~e~~l~~ 129 (315) T protein:vir:41 52 RIDNALKSYEKDISRLSLVLDVGPGRDETGQ-KLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNI-EGKAFEQKIVT 129 (315) T ss_pred eeeeccccccccccccccCcccccccccccC-cCCCCCCccccceeeeceeeeeeeccccHHHHHhhh-ccccHHHHHHH Confidence 4433322222211000 0001111 112222 222344445556666677777777778777777653 46789999999 Q ss_pred HHHHHHHHHhhheeeeeehhh------CceeeeecCCccccccccccccc-CHHHHHHHHHHHHHHHHHHhCCcccccEE Q lcl|NC_021342. 160 LAFRGAEEHSQSVAYFGDASR------GMYGLFNNPNVTLSSATKDYKTM-NGQELFNMLNAPIFSVINLSRRFHVPNTA 232 (354) Q Consensus 160 aA~~~~a~~~n~~~f~G~~~~------gi~GLlN~p~~~~~~~~~~W~~~-T~~ei~~di~~~~~~l~~~s~g~~~p~~L 232 (354) ..+++.++.++...|+|+... ...|+|+..+..+.....+++.. .+.+.+.|+..++..-..+. .+.-.. T Consensus 130 ~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~---~~~~~~ 206 (315) T protein:vir:41 130 LLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNN---LPNMKF 206 (315) T ss_pred HHHHHHHHHHHHHhhccCCcCcCccccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhc---CCceEE Confidence 999999999999999998753 45688887664443333444432 13333444333332222110 122468 Q ss_pred EeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEe Q lcl|NC_021342. 233 LMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMA 312 (354) Q Consensus 233 ~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~ 312 (354) +|+++.+..+.+.. ++.+.-+++-. ...|.+.++...|....+.....+.+ +. .|++. |.+++... T Consensus 207 imn~~t~~~~rklk--~~~g~~lw~~~-------~~~g~~~tl~G~PV~~~~~m~~~~~~---~~-~ilf~-d~~nl~~~ 272 (315) T protein:vir:41 207 YVTWDIYRAYRDAL--KGRETGLGDQA-------LTGANSILYDGRPVQYVPALEALNDG---KS-RALFV-VPTQLVYG 272 (315) T ss_pred EEcHHHHHHHHHHh--ccCCCccccch-------hhcCCCceecccceEecccccccCCC---Cc-cEEEe-cccceEEE Confidence 89998887664322 22222222111 12345556655555444443332221 11 24443 35555555 Q ss_pred eCchhhhccccc-cCceeEEeeeeeeeeE-EEECCceeEeeec Q lcl|NC_021342. 313 NPIPFRMLAPQM-ASLGITVPAEYKISGT-EFRYPLCAAYVDM 353 (354) Q Consensus 313 vp~~~~~~~~~~-~~l~~~~~~~~~~gGv-~i~~P~ai~y~D~ 353 (354) +-..++..+-.. +.-.+.+-...|+++- .+..-.++..+-| T Consensus 273 ~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 273 FWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred eccccEEEeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 555555544322 2223444445666553 3344344555666 No 101 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=97.61 E-value=3.8e-05 Score=44.84 Aligned_cols=309 Identities=9% Similarity=-0.071 Sum_probs=149.7 Q ss_pred Ccccchh-----------------HHHHh----------------------hhhhhhcccccccccchhhhhhhhhhhcc Q lcl|NC_021342. 1 MAIKTID-----------------AQTIQ----------------------GNQWLVHKGYVSRNGDQWVINNTALDAIG 41 (354) Q Consensus 1 ~~~~~~~-----------------~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~am~a~~ 41 (354) |.||..+ .+..+ .+..+...+....++.+... +..+. T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~---~~~~~- 76 (381) T protein:vir:95 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS---FFMDI- 76 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHH---HHHHH- Confidence 2222211 11110 01111111111122211111 11111 Q ss_pred CCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccc-eeee Q lcl|NC_021342. 42 NPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLP-RVAQ 120 (354) Q Consensus 42 ~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip-~v~~ 120 (354) ....++.+.+++. +.+..+|++.....-..|++..+.. .+.. ......+..+.+.|++..+ .++ ..+. T Consensus 77 ----~~~~~~~gg~lvP--~~~~~~I~~~l~~~s~i~~~~~v~~-~~~~---~~i~~~~~~~~a~w~~e~~-~~~~~~~~ 145 (381) T protein:vir:95 77 ----NKNVNYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKN-AGLR---LKFLKSETSGVAVWGKIYG-EIKGQLDA 145 (381) T ss_pred ----hcccCCCCceecC--HHHHHHHHHHHHhhccceeheeeEe-cCcc---eEEEEecCCcceeeecccc-cccccccc Confidence 1111222335554 5666778877766666677766543 2221 2334455677788876543 243 3345 Q ss_pred ccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccc--- Q lcl|NC_021342. 121 SAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSAT--- 197 (354) Q Consensus 121 ~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~--- 197 (354) ..+....+.+.++.-..+|.+=|+.+ ..+++.--....+++++..+++-+++|+...+-.|+|++++......+ T Consensus 146 ~f~~i~l~~~kl~~~~~is~elL~Ds---~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~ 222 (381) T protein:vir:95 146 AFSEETAIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAY 222 (381) T ss_pred cceeeeecceeEEeechhhHHHhhcC---HHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccc Confidence 56777788888887777765544433 447888888899999999999999999988888999998653221111 Q ss_pred ------ccccccCHHHHHHHHHHHHHHHHHHhCCcc----cccEEEeCHHHHHHHhhccc-CCCCCchHHHHHHhhCccc Q lcl|NC_021342. 198 ------KDYKTMNGQELFNMLNAPIFSVINLSRRFH----VPNTALMFPDLWNQANNQLM-TGYTDRTVMQHFMEANSYT 266 (354) Q Consensus 198 ------~~W~~~T~~ei~~di~~~~~~l~~~s~g~~----~p~~L~l~p~~~~~L~~~~~-~~~~~~Tvl~~l~~n~~~~ 266 (354) ..+...++...++.+..++..+....++.. +-..++|+|..+..+..... .+..| .+. T Consensus 223 ~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G-----------~~v 291 (381) T protein:vir:95 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANG-----------VYV 291 (381) T ss_pred cccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCC-----------cee Confidence 122223344445555555555533222222 22357899988776642211 11111 111 Q ss_pred ccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCc--eeEEeeeeeeeeEEEE Q lcl|NC_021342. 267 LLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASL--GITVPAEYKISGTEFR 343 (354) Q Consensus 267 ~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l--~~~~~~~~~~gGv~i~ 343 (354) ..-+.+..|.. +... .. + -++..+.+. +.+..-..+++-.. +...+ ...+....|.+ ..++ T Consensus 292 ~~l~~g~~vv~-----s~~~-p~-----~--~iifgDfs~--Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~d-g~~~ 355 (381) T protein:vir:95 292 TALPFNLNVIE-----STVQ-EA-----G--KVLTYVKGL--YDGYLAGGINVQKFKETLALDDMDLYTAKQFAY-GKAK 355 (381) T ss_pred ecCCCCceEEe-----cCCC-Cc-----C--cEEEEeccc--EEEEEecccEEEeechhHhhcCCeEEEEEEEEc-CEEe Confidence 11111222221 1111 00 1 122222221 22222222222111 11111 23455677765 5678 Q ss_pred CCceeEeeecC Q lcl|NC_021342. 344 YPLCAAYVDMA 354 (354) Q Consensus 344 ~P~ai~y~D~~ 354 (354) .|.|++++|++ T Consensus 356 ~~~A~~v~~l~ 366 (381) T protein:vir:95 356 DNKVAAVWKLD 366 (381) T ss_pred cCceEEEEEEE Confidence 99999999999 No 102 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=97.61 E-value=3.8e-05 Score=44.84 Aligned_cols=309 Identities=9% Similarity=-0.071 Sum_probs=149.7 Q ss_pred Ccccchh-----------------HHHHh----------------------hhhhhhcccccccccchhhhhhhhhhhcc Q lcl|NC_021342. 1 MAIKTID-----------------AQTIQ----------------------GNQWLVHKGYVSRNGDQWVINNTALDAIG 41 (354) Q Consensus 1 ~~~~~~~-----------------~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~am~a~~ 41 (354) |.||..+ .+..+ .+..+...+....++.+... +..+. T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~---~~~~~- 76 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS---FFMDI- 76 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHH---HHHHH- Confidence 2222211 11110 01111111111122211111 11111 Q ss_pred CCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccc-eeee Q lcl|NC_021342. 42 NPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLP-RVAQ 120 (354) Q Consensus 42 ~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip-~v~~ 120 (354) ....++.+.+++. +.+..+|++.....-..|++..+.. .+.. ......+..+.+.|++..+ .++ ..+. T Consensus 77 ----~~~~~~~gg~lvP--~~~~~~I~~~l~~~s~i~~~~~v~~-~~~~---~~i~~~~~~~~a~w~~e~~-~~~~~~~~ 145 (381) T protein:vir:10 77 ----NKNVNYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKN-AGLR---LKFLKSETSGVAVWGKIYG-EIKGQLDA 145 (381) T ss_pred ----hcccCCCCceecC--HHHHHHHHHHHHhhccceeheeeEe-cCcc---eEEEEecCCcceeeecccc-cccccccc Confidence 1111222335554 5666778877766666677766543 2221 2334455677788876543 243 3345 Q ss_pred ccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccc--- Q lcl|NC_021342. 121 SAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSAT--- 197 (354) Q Consensus 121 ~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~--- 197 (354) ..+....+.+.++.-..+|.+=|+.+ ..+++.--....+++++..+++-+++|+...+-.|+|++++......+ T Consensus 146 ~f~~i~l~~~kl~~~~~is~elL~Ds---~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~ 222 (381) T protein:vir:10 146 AFSEETAIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAY 222 (381) T ss_pred cceeeeecceeEEeechhhHHHhhcC---HHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccc Confidence 56777788888887777765544433 447888888899999999999999999988888999998653221111 Q ss_pred ------ccccccCHHHHHHHHHHHHHHHHHHhCCcc----cccEEEeCHHHHHHHhhccc-CCCCCchHHHHHHhhCccc Q lcl|NC_021342. 198 ------KDYKTMNGQELFNMLNAPIFSVINLSRRFH----VPNTALMFPDLWNQANNQLM-TGYTDRTVMQHFMEANSYT 266 (354) Q Consensus 198 ------~~W~~~T~~ei~~di~~~~~~l~~~s~g~~----~p~~L~l~p~~~~~L~~~~~-~~~~~~Tvl~~l~~n~~~~ 266 (354) ..+...++...++.+..++..+....++.. +-..++|+|..+..+..... .+..| .+. T Consensus 223 ~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G-----------~~v 291 (381) T protein:vir:10 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANG-----------VYV 291 (381) T ss_pred cccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCC-----------cee Confidence 122223344445555555555533222222 22357899988776642211 11111 111 Q ss_pred ccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCc--eeEEeeeeeeeeEEEE Q lcl|NC_021342. 267 LLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASL--GITVPAEYKISGTEFR 343 (354) Q Consensus 267 ~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l--~~~~~~~~~~gGv~i~ 343 (354) ..-+.+..|.. +... .. + -++..+.+. +.+..-..+++-.. +...+ ...+....|.+ ..++ T Consensus 292 ~~l~~g~~vv~-----s~~~-p~-----~--~iifgDfs~--Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~d-g~~~ 355 (381) T protein:vir:10 292 TALPFNLNVIE-----STVQ-EA-----G--KVLTYVKGL--YDGYLAGGINVQKFKETLALDDMDLYTAKQFAY-GKAK 355 (381) T ss_pred ecCCCCceEEe-----cCCC-Cc-----C--cEEEEeccc--EEEEEecccEEEeechhHhhcCCeEEEEEEEEc-CEEe Confidence 11111222221 1111 00 1 122222221 22222222222111 11111 23455677765 5678 Q ss_pred CCceeEeeecC Q lcl|NC_021342. 344 YPLCAAYVDMA 354 (354) Q Consensus 344 ~P~ai~y~D~~ 354 (354) .|.|++++|++ T Consensus 356 ~~~A~~v~~l~ 366 (381) T protein:vir:10 356 DNKVAAVWKLD 366 (381) T ss_pred cCceEEEEEEE Confidence 99999999999 No 103 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=97.57 E-value=4.2e-05 Score=44.59 Aligned_cols=309 Identities=9% Similarity=0.001 Sum_probs=141.4 Q ss_pred CcccchhHH---------HHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhh Q lcl|NC_021342. 1 MAIKTIDAQ---------TIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETP 71 (354) Q Consensus 1 ~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~ 71 (354) ..++.++.+ .-..+..+..++....++.+... ++++.... +.+.+.+++. +.+..+|++.. T Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~lt~~e~~---~~~~~~~~-----~~~~gg~lvP--~~~~~~I~~~l 107 (383) T protein:vir:78 38 EMVDAMAADIMEQAKKEARQEADAYISASRTDKNITNEEIK---FFNDINKE-----VGYKEETLLP--QTVVDEIFEDL 107 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhHHHHH---HHHHHhcc-----CCCCCccccC--HHHHHHHHHHH Confidence 001111100 00122233333444444433222 22332111 1233345554 45566777766 Q ss_pred hhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccc-eeeeccceeEEEEEEEEeeEeecHHHHHHHHHhC Q lcl|NC_021342. 72 YGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLP-RVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMN 150 (354) Q Consensus 72 ~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip-~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g 150 (354) ...-..|.++.+.. .+.. ..+...+..+.+.|.+..+. ++ ..+...+....+.+.++.-..++.+=|+-+ . T Consensus 108 ~~~s~l~~~~~v~~-~~~~---~~i~~~~~~~~a~w~~e~~~-~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds---~ 179 (383) T protein:vir:78 108 TTEHPFLASIGMRT-TGLR---TKFLKSETSGVAVWGKIFGE-IKGQLDATFSDEESIQNKLTAFVVVPKDLEKFG---P 179 (383) T ss_pred HhhccceeeeeeEe-cCCc---eEEEEEcCCcceEEeecccc-cccccCcceeeEeecceeeEeeccchHHHhhcc---H Confidence 55555566655432 2221 23445556677777665432 33 345556777788888887777765444433 4 Q ss_pred CCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccc---cccccccCHHHHHHHHHHHHHH---HHHHh- Q lcl|NC_021342. 151 MPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSA---TKDYKTMNGQELFNMLNAPIFS---VINLS- 223 (354) Q Consensus 151 ~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~---~~~W~~~T~~ei~~di~~~~~~---l~~~s- 223 (354) .+++.--....+++++..+|+.+++|+...+-.|++++.+...... ..+|. ++..--+.|+..++.. +.+.- T Consensus 180 ~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~ 258 (383) T protein:vir:78 180 AWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKA-ATGTLTFANPKTTVNELTDVYKYHS 258 (383) T ss_pred HHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCccccccccccccc-ccchhhhhhhHHHHHHHHHHHhccc Confidence 4788888999999999999999999998778899998765322111 11222 1111112233222222 22110 Q ss_pred ---CCc----ccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcc Q lcl|NC_021342. 224 ---RRF----HVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNK 296 (354) Q Consensus 224 ---~g~----~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~ 296 (354) ++. .+..+.+++|..|..+.-.... ...++.+...-+.++.|. ++.... . + T Consensus 259 ~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~----------~~~~G~~~t~l~~~~~iv-----~s~~~p-~-----~- 316 (383) T protein:vir:78 259 VKENGHPLNVAGKVTLLVNPTDAWDVKKQYTS----------LNANGVYVTALPFNLNII-----ESLFVP-E-----K- 316 (383) T ss_pred hhcccchhhhcCceEEEEcCcchhhhccchhc----------cCCCCceeeecCCCceEE-----ecCCCC-c-----c- Confidence 110 1123466777554433211100 001111111112222221 111110 0 1 Q ss_pred cEEEEEEcCcceEEEeeCchhhhccc-cccCc--eeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 297 PRYMVYDKSDRNLAMANPIPFRMLAP-QMASL--GITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 297 d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l--~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .++..+.+. +.+..-..++.-.- +.... ...+....|++| .++.|.|++++||+ T Consensus 317 -~iifgdfs~--Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~dG-~~~~~~A~~vl~~~ 373 (383) T protein:vir:78 317 -KAISYVAER--YDALIGGPLDIGTYDQTLAIEDLNLYAAKQFAYG-KAKDDKAAAVWTLN 373 (383) T ss_pred -cEEEeeccc--eEEEecccceEEecchhhhhcCceEEEEEEEEcC-EEecCCeEEEEEEE Confidence 011111111 22222222222111 11111 234555677765 78899999999999 No 104 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=97.50 E-value=5.6e-05 Score=43.91 Aligned_cols=305 Identities=9% Similarity=-0.049 Sum_probs=143.1 Q ss_pred Ccccchh---------------------------HHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhH Q lcl|NC_021342. 1 MAIKTID---------------------------AQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGI 53 (354) Q Consensus 1 ~~~~~~~---------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~ 53 (354) .+++..+ ++.-..+..+...+..+.++.+...-.-++.. ...+.+ T Consensus 23 ~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~r~~~~l~~ee~~~~~~~~~--------~t~~~g 94 (395) T protein:vir:95 23 NLVQNGASDEEQSKAFGAMFDALSNDLQEEITAEINNRVVDNGILAKRSQDPLTSEERKFFNDINY--------DVGYTD 94 (395) T ss_pred HHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccchHHHHHHHHHhh--------ccCCCC Confidence 0000000 00001111111222222232222111112211 112223 Q ss_pred HHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEE Q lcl|NC_021342. 54 AFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAG 133 (354) Q Consensus 54 ~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~ 133 (354) .+++. +.+..+|++.....-..|.+..+..-.+ . ..+...+..+.+.|......--+..+...+......+.++ T Consensus 95 G~liP--~~~~~~Ii~~l~~~s~i~~~~~v~~~~~--~--~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~ 168 (395) T protein:vir:95 95 EKILP--ETVVERVFDDLQKDHPLLSKINFQNAGI--K--TRVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLT 168 (395) T ss_pred ceecc--HHHHHHHHHHHHhhhhhhhhceeEecCC--c--eEEEEecCCcceEEeecccccCccccccceeeeeceeeEE Confidence 34444 5566777777666666667665543222 1 2344556677777765433222344566677777888888 Q ss_pred eeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh--CceeeeecCCcccccccccccccCHHHHHHH Q lcl|NC_021342. 134 NECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR--GMYGLFNNPNVTLSSATKDYKTMNGQELFNM 211 (354) Q Consensus 134 ~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~--gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~d 211 (354) .-..+|.+=|+. ...+++.--....+++++..+|+-+++|+... .=.|+|++...... ...|...+.....++ T Consensus 169 ~~~~iS~ell~d---s~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~--~~~~~~~~~~~t~~~ 243 (395) T protein:vir:95 169 CFVVLPDDLSTF---GPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSG--AVTDKASSGTLTFAD 243 (395) T ss_pred EeecccHHHHhc---chhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeeccccccc--ccccccccchhhhhh Confidence 777776544433 35578888999999999999999999998654 24799998664322 222222222122222 Q ss_pred HHHHHHHHHH----H---hCC----cccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceee--ee Q lcl|NC_021342. 212 LNAPIFSVIN----L---SRR----FHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQ--IR 278 (354) Q Consensus 212 i~~~~~~l~~----~---s~g----~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~--~~ 278 (354) +...+..+.. . .++ ...--+.+|+|..+..+...+.- ++ ..|.+.++. ++ T Consensus 244 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~~------------~~-----~~G~~~~~lg~g~ 306 (395) T protein:vir:95 244 ADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYTY------------LT-----ANGGFVTVLPYNV 306 (395) T ss_pred hHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCccee------------cc-----CCCcceeccCCcc Confidence 2222222211 1 111 12234678888776655322110 00 123332221 22 Q ss_pred eeeeeccccccccccCcccEEEEE-EcCcceEEEee--CchhhhccccccC--ceeEEeeeeeeeeEEEECCceeEeeec Q lcl|NC_021342. 279 FQLDAAELAANGVSNSNKPRYMVY-DKSDRNLAMAN--PIPFRMLAPQMAS--LGITVPAEYKISGTEFRYPLCAAYVDM 353 (354) Q Consensus 279 ~~L~~~~~~~~g~g~~g~d~~v~y-~~~~~~~~~~v--p~~~~~~~~~~~~--l~~~~~~~~~~gGv~i~~P~ai~y~D~ 353 (354) +.+.+..... ++ ++| +.+ + +.+.. .+.+..+. +... -...+....|++ -.++.|.|+++++| T Consensus 307 ~v~~~~~~p~------~~---i~fgdfs-~-y~i~~r~~~~i~~~~-~~~~~~d~~~f~~~~r~d-g~~~~~~A~~~l~i 373 (395) T protein:vir:95 307 TIITSEFVPE------GK---LVAFVTD-R-YNAVRGGGLTVKKFD-QTLALEDAVLFTAKTFAY-GQPDDNKASAVYDL 373 (395) T ss_pred eEEEcCCCCC------Cc---EEEEecc-c-EEEEEecceEEEecc-chhhhCCcEEEEEEEEEC-CEEeccccEEEEEe Confidence 2222222111 11 222 211 1 11111 11222221 1111 134566677875 67789999999999 Q ss_pred C Q lcl|NC_021342. 354 A 354 (354) Q Consensus 354 ~ 354 (354) . T Consensus 374 ~ 374 (395) T protein:vir:95 374 K 374 (395) T ss_pred e Confidence 9 No 105 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=97.48 E-value=5.8e-05 Score=43.79 Aligned_cols=272 Identities=7% Similarity=-0.060 Sum_probs=139.3 Q ss_pred ccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCC-CceeEEEEEeeccccceeEecCCCccccee Q lcl|NC_021342. 40 IGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIP-EYADTWMYRSYDGVTMGKFIGANGQDLPRV 118 (354) Q Consensus 40 ~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~-~~~~~~~~~~~~~~G~a~~~~~~~~dip~v 118 (354) |. ...|.-+| .| .. +.+.+.|.+...+.+....+..+...+. ....++....+...|.++.+.++ ++++.. T Consensus 1 Ma-~~~T~~~~---~i-iP--ev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g-~~i~~~ 72 (278) T protein:vir:80 1 MA-DLTTKLAN---LI-DP--EVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEG-AAIDYS 72 (278) T ss_pred CC-Ccceehhh---ee-cH--HHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCC-CcCccc Confidence 21 11222222 12 22 2233444444444455555544333221 12356777788878988887765 568878 Q ss_pred eeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccc Q lcl|NC_021342. 119 AQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATK 198 (354) Q Consensus 119 ~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~ 198 (354) +...+.....+...+.+|. ..|+++.+ .+.++-......++..+++..|+.++..-.+ ..+. .+. T Consensus 73 ~lt~~~~~~~i~~~~~a~~--v~D~~~~~-~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~-----a~~~-------~~~ 137 (278) T protein:vir:80 73 ALETESVKHGIKKAGKGVK--LTDESVLS-GYGDPVEEAQKQIRMAIASKVDNDILEEALT-----TTLE-------VKG 137 (278) T ss_pred ccccceeeEeeehhhcccc--ccHHHHhh-ccccHHHHHHHHHHHHHHHHHHHHHHHHHhc-----cccc-------ccc Confidence 8888888888877666554 56666554 4667778888999999999999877743222 1111 111 Q ss_pred cccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHH-HHHHhhCcccccccccceeee Q lcl|NC_021342. 199 DYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVM-QHFMEANSYTLLTGNELDIQI 277 (354) Q Consensus 199 ~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl-~~l~~n~~~~~~~g~~l~I~~ 277 (354) .....+.+..++.+.++..++... +...+..|+++|..|..|.+.........+-+ +=+..+ |.--++.. T Consensus 138 ~~t~~~~~~~~~~~~da~~~l~~~--~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~-------G~ig~~~G 208 (278) T protein:vir:80 138 AINIGLIDKIENTFTDAPDAIEDE--SITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVK-------GAFGELLG 208 (278) T ss_pred ccccchhhhHHHHHHHHHHhhccc--CCCcccEEEECHHHHHHHHhhhhhhccccccccccceee-------ccceeecc Confidence 111223455667777777766442 33445579999999998864311111111000 001111 21112222 Q ss_pred eeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 278 RFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 278 ~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+.+....+. .+ ..+++. +.-+.+....+.+.-.- ..+.....+..... -|+-+.+|.+++.+-.. T Consensus 209 ~~Vi~s~~~p------~~--t~~l~~--~gAi~~~~~~~~~vE~~Rd~~~~~d~i~~~~~-yg~~v~~~~~~v~it~~ 275 (278) T protein:vir:80 209 WEIVRTKKLA------DG--NALAVK--AGALKTFLKRNLLAESGRDMDHKLTKFNADQH-YAVALVDETKAVKVVPV 275 (278) T ss_pred eeEEEcCCCC------cc--eEEEEe--ccceeeeecCCcccccccchhhccceeeeeeE-EEEEEEcCcceEEEeec Confidence 2333332221 11 122222 22343333333332111 11112233333333 37999999999999888 No 106 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=97.32 E-value=4.4e-05 Score=44.47 Aligned_cols=295 Identities=9% Similarity=-0.037 Sum_probs=135.9 Q ss_pred Ccc--------------------------------------cchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccC Q lcl|NC_021342. 1 MAI--------------------------------------KTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGN 42 (354) Q Consensus 1 ~~~--------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~ 42 (354) ..+ ...+......... +..............++.+ T Consensus 63 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~--- 135 (400) T protein:vir:38 63 EKRDLYEAALKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDV----GTFAVLRAVPTDASDAVNA--- 135 (400) T ss_pred HHHHHHHHHHHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHH----HHHhhhhhhhHHHHHHHhh--- Confidence 000 0001101000000 0000000000000011111 Q ss_pred CceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcccc-eeee Q lcl|NC_021342. 43 PNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQDLP-RVAQ 120 (354) Q Consensus 43 ~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip-~v~~ 120 (354) .+..++ +.+++. +.+.+.+++.....-..+.++++..- ...+..+.+.. ..+.+.+++..+. .| ..+. T Consensus 136 --~~~~~~--gg~~vP--~~~~~~ii~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~E~~~-~~~~~~~ 205 (400) T protein:vir:38 136 --GVKAAD--AASTIP--ETISNTPQRELQTVVDLKPFTNVFQA---STQKGTYPTVANATTKMVTVAELEK-NPAMAKP 205 (400) T ss_pred --cccccC--Cccccc--HHHHHHHHHHHHhhhhhhhcceeEec---cCcceEEEEEecCCCcccccccccc-ccccccc Confidence 112222 234444 55677788777776667776664321 12233444443 4566777766544 34 3345 Q ss_pred ccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccc Q lcl|NC_021342. 121 SAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDY 200 (354) Q Consensus 121 ~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W 200 (354) ..+........++.-+.+|..=|+ ....++..--....++++...+|..+++|.......| . T Consensus 206 ~f~~i~~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~------------~--- 267 (400) T protein:vir:38 206 EFKPVNWSVETYRQALPVSQESID---DSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTAKT------------I--- 267 (400) T ss_pred cceeeEeehhheeeehhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc------------c--- Confidence 666777777788877777664333 2344677777888888999999999988865321110 0 Q ss_pred cccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeee Q lcl|NC_021342. 201 KTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQ 280 (354) Q Consensus 201 ~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~ 280 (354) .+ +++|.+++...... .....++|+|..|..|..- .+..|.-++.- ++ ..+.+-.+...|. T Consensus 268 --~~----~~~~~~~~~~~~~~----~~~a~~v~~~~~~~~l~~l--kd~~G~~i~~~----~~---~~~~~~~l~G~pv 328 (400) T protein:vir:38 268 --SS----VDDLKHINNVDLDP----AYSRVIIASQSFYNFLDTV--KDGNGRYLLQD----SI---LTPSGKSVLGMPI 328 (400) T ss_pred --cc----HHHHHHHHHhhhhh----hhCcEEEEcHHHHHHHHHh--hccCCCeeeec----Cc---CCCCcccccccee Confidence 12 23444444332221 1135799999999998753 23334322210 00 1122222333333 Q ss_pred eeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 281 LDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 281 L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +....... +..|.. .++|-+=.+.+.+..-..++............+.++.|++ +.+..|.+|+++.++ T Consensus 329 ~~~~~~~~---~~~g~~-~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d-~~~~~~~a~~~l~~~ 397 (400) T protein:vir:38 329 AVVSDDTL---GAAGEA-HAFLGDIKRAILFANRADFMVRWVDDQIYGQFLQAGMRFG-VSVADEKAGYFLTYT 397 (400) T ss_pred EEeccccc---CCCCce-EEEEEeccccEEEEeecceEEEEecccccceeEEEEEEec-cEEecccceEEEEee Confidence 22222111 122222 2333211222222222223322222222233456778885 556679999999999 No 107 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=97.31 E-value=9.9e-05 Score=42.54 Aligned_cols=320 Identities=9% Similarity=0.013 Sum_probs=136.7 Q ss_pred CcccchhHH---------HHhhhhhhhcc-------cccc----------cccchhhhhhh-hhhhccCCceeccchhhH Q lcl|NC_021342. 1 MAIKTIDAQ---------TIQGNQWLVHK-------GYVS----------RNGDQWVINNT-ALDAIGNPNIMLDADGGI 53 (354) Q Consensus 1 ~~~~~~~~~---------~~~~~~~~~~~-------~~~~----------~~~~~~~~~~~-am~a~~~~~~~~da~~~~ 53 (354) --++.+..+ .......-... ...+ ...++...... .-+... ...+.+++ T Consensus 82 ~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~g~ 157 (466) T protein:vir:80 82 NELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQ----QKRAVSGA 157 (466) T ss_pred HHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhh----hhhhhccc Confidence 000000000 00000000000 0000 00000000000 000000 00112222 Q ss_pred HHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEE Q lcl|NC_021342. 54 AFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAG 133 (354) Q Consensus 54 ~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~ 133 (354) ..+.. +.+-+.+++........+..+.+..-.+ ...+.+....+.+.|.+.. .++|..+..++.....++.++ T Consensus 158 ~~~vP--~~~~~~i~~~l~~~~~l~~~~~v~~~~g----~~~~~~~~~~~~a~wv~E~-~~~~~~~~~f~~i~~~~~k~~ 230 (466) T protein:vir:80 158 ELTIP--DVMLELLRDNMHRYSKLISKVRLRPLKG----TARQNIAGAIPEGVWTEAV-ANLNELSLSFSQIEVDGYKVG 230 (466) T ss_pred ccccc--HHHHHHHHHhhhhhhhhhhheeeeecCc----eeEeeeecCCcceeecccc-cccccccccccceeecceeee Confidence 23333 2344445554444333444443322111 1222333344556776654 457877877888888999988 Q ss_pred eeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccc-----ccccccC---- Q lcl|NC_021342. 134 NECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSAT-----KDYKTMN---- 204 (354) Q Consensus 134 ~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~-----~~W~~~T---- 204 (354) .-+.+|..=|+.+ ..++..--....+.+++..+|+.+++|+....-.|+||..+....... +.+...+ T Consensus 231 ~~~~iS~ell~ds---~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (466) T protein:vir:80 231 GFIPIPNSTLEDS---DLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNL 307 (466) T ss_pred eehhhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhh Confidence 8888877665544 447888888899999999999999999887777899998654322111 1121111 Q ss_pred ---------HHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCccccccccccee Q lcl|NC_021342. 205 ---------GQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDI 275 (354) Q Consensus 205 ---------~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I 275 (354) +...+.++...+..+.. +........++++..+..|.........+- .++-.. .++.+ | T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g---~~~~~~-----~~~~~--i 375 (466) T protein:vir:80 308 LKIDPTGKSAEEFFSELVLKLSKARA--NYSNGMKFWAMSSNTHAVLMSKAITFNSAG---ALVASL-----NNTMP--I 375 (466) T ss_pred hhhhhhccchhhHHHHHHHHHHhhhc--cccCCceeEEecchhHHHhhcccccccCCc---cccccC-----CCccc--c Confidence 11222232222222111 122233345677787777754332211110 011000 01111 1 Q ss_pred eeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 276 QIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 276 ~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ...|...+........-.+....++++++ .-+.+..-....+ .++ ...+.++.|++ ..+++|.|++++|++ T Consensus 376 ~G~pvv~s~~~~~~~~~~g~~~~y~i~~r--~~~~i~~~~~~~f----~~d-~~~~r~~~r~d-g~~~~~~afv~~~~~ 446 (466) T protein:vir:80 376 VGGDIVILDFIPDNDIIGGYGSLYLLAER--ADIKLAQSEHVRF----IED-QTVFKGTARYD-GKPVFGEGFVAVNIA 446 (466) T ss_pred cccceeecCccCccceeeeccccEEEEee--cceEEEechhhhh----hcC-cEEEEEEEEEc-cEEeccCceEEEEec Confidence 11111111111000000011111222222 1222222111111 122 24566788875 566899999999999 No 108 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=97.19 E-value=0.00014 Score=41.80 Aligned_cols=312 Identities=10% Similarity=-0.028 Sum_probs=143.2 Q ss_pred Ccccchh--------------HHHH--------hhhhhhhcc-----------cccccccchhhhhhhhhhhccCCceec Q lcl|NC_021342. 1 MAIKTID--------------AQTI--------QGNQWLVHK-----------GYVSRNGDQWVINNTALDAIGNPNIML 47 (354) Q Consensus 1 ~~~~~~~--------------~~~~--------~~~~~~~~~-----------~~~~~~~~~~~~~~~am~a~~~~~~~~ 47 (354) -.++.++ .+.. .++++-+.. ........+......+.. +... T Consensus 68 ~~~~~le~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~ 142 (425) T protein:vir:95 68 EKKSKLEGEIAQLEDELEQINSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFR-----NLRA 142 (425) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHH-----hhcc Confidence 0011111 0000 000000000 000000000000000000 1111 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeee-ccceeE Q lcl|NC_021342. 48 DADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQ-SAQMHT 126 (354) Q Consensus 48 da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~-~~~~~~ 126 (354) . +++.+++. +.+.+.+++........+.++.+.. .+ +. ..+.+....+.+.|++.++. +|..+. ..+... T Consensus 143 ~--~~gg~~vP--~~~~~~Ii~~l~~~~~i~~~~~~~~-~~-g~--~~ip~~~~~~~a~~v~E~~~-~~~~~~~~f~~i~ 213 (425) T protein:vir:95 143 V--AGGELTIP--EVVVNRIMDIMGDYTTLYPLVDKIR-VK-GT--TRILVDTDTSPATWIEQSGA-LPTGDVGTIASID 213 (425) T ss_pred c--ccCceecc--HHHHHHHHHHHHhhhhHHHhhceee-cC-ce--eEEEEecCCccccccccccc-cccccccccceee Confidence 1 22334444 4466677776666666666665432 22 22 23445556677888876544 566664 367777 Q ss_pred EEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhh--CceeeeecCCcccccccccccccC Q lcl|NC_021342. 127 VPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASR--GMYGLFNNPNVTLSSATKDYKTMN 204 (354) Q Consensus 127 ~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~--gi~GLlN~p~~~~~~~~~~W~~~T 204 (354) ...+.++.-+.+|..=|+.+ ..+++.--....+.+++..+|+-+++|+... .-.|+++..... ...+..+.+ T Consensus 214 l~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~--~~~~~~~~~- 287 (425) T protein:vir:95 214 FDGFKVGKVTFVDNYLLQDS---IINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPE--NQVTVEADN- 287 (425) T ss_pred eeheeeeeeehhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccc--ccccccccc- Confidence 78888887777876645444 3367788888899999999999999998642 346888753321 111122221 Q ss_pred HHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHH-HHHhhc-ccCCCCCchHHHHHHhhCcccccccccceeeeeeeee Q lcl|NC_021342. 205 GQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLW-NQANNQ-LMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLD 282 (354) Q Consensus 205 ~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~-~~L~~~-~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~ 282 (354) ..++++.+++..+.... ........+|++..| ..|..- ..-+..|. ||-.- ..+..-++...|.+. T Consensus 288 --~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~----~i~~~-----~~~~~~~l~G~pvv~ 355 (425) T protein:vir:95 288 --NLLKNLVKQIGLIDTGD-DSVGEIVAVMKRSTYYNRLVEFSIQVDSNGN----VVGKL-----PNLRTPDLLGLRVVF 355 (425) T ss_pred --chHHHHHHHHHhhhhhc-cccCceEEEEeChHHHHHHHHHHhhcCCCCc----eeecc-----CCCCCccccceeeEE Confidence 23567777776664321 112233567777754 434321 11233332 22110 011122333344443 Q ss_pred eccccccccccCcccEEEEEEcCcceEEEeeCchhhhcccccc--CceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 283 AAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMA--SLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 283 ~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~--~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.........-+.-..+++.+ ...+.+.+ .. +.. .-...+.++.|+. ..+++|.|+++++|. T Consensus 356 ~~~~~~~~i~~Gd~~~~~~~~--~~~~~i~~------~~-~~~f~~~~~~~~~~~r~d-~~~~~~~a~~~~~i~ 419 (425) T protein:vir:95 356 NNFLDDDTVLFGEFEQYTLVE--RENITIDS------ST-HVKFTEDQTAFRGKGRFD-GKPVKPEAFVLVTIT 419 (425) T ss_pred cCcCCCccEEEEecccEEEEe--ecceEEEe------ec-ccccccCceEEEEEEeeC-cEeecccceEEEEec Confidence 332211110000000111111 11122221 11 111 1134455566765 688999999999999 No 109 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=97.07 E-value=0.00013 Score=41.97 Aligned_cols=309 Identities=7% Similarity=-0.041 Sum_probs=138.8 Q ss_pred CcccchhHHH------Hh-----hhhhhhcccccccccchhhhh-------hhhhhhccCCceeccchhhHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQT------IQ-----GNQWLVHKGYVSRNGDQWVIN-------NTALDAIGNPNIMLDADGGIAFYISQLAG 62 (354) Q Consensus 1 ~~~~~~~~~~------~~-----~~~~~~~~~~~~~~~~~~~~~-------~~am~a~~~~~~~~da~~~~~fl~~~L~~ 62 (354) -.|+.+.... .. .+..-..+............. .......... .+..+ +++ +++. +. T Consensus 48 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~gg-~~vP--~~ 122 (395) T protein:vir:38 48 ASLKNAKMAQELAKSAYEDARANLNAEPVNKKPLPVKDGKPDAQAMKNQFVKDFKNLVTSG-TTGTG-NAG-LTIP--ED 122 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhHHHHHHHHHHHHHHHHHHhhc-cCccC-CCc-eecc--hh Confidence 0000000000 00 000000000000000000000 0001111111 11111 222 3333 45 Q ss_pred HHHHHHHhhhhcccchhhccccCCCCCceeEEEEEee-ccccceeEecCCCccccee-eeccceeEEEEEEEEeeEeecH Q lcl|NC_021342. 63 IEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSY-DGVTMGKFIGANGQDLPRV-AQSAQMHTVPLGYAGNECHYTL 140 (354) Q Consensus 63 Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~-~~~G~a~~~~~~~~dip~v-~~~~~~~~~pv~~~~~~~~~~~ 140 (354) +.+.|++........+.+..+. +.......+.+... +..+.+.+++.. ..+|-. ....+......+.++..+.+|. T Consensus 123 ~~~~ii~~~~~~~~l~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~E~-~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ 200 (395) T protein:vir:38 123 IQLQIRTLTRSFTSLESLANVE-NVTTSHGSRVYEKLADITPLKDLDDES-ALIGDNDDPELTVVKYLIHRYAGITTVTN 200 (395) T ss_pred HhhHHHHHHHhhcchhhhccee-eccCCcceEEEEeeccCCccccccccc-cccccccccceeeEEeeeeeeEeehhhHH Confidence 5677888877777777775543 22222223333333 233455666554 335533 3456677778888887777765 Q ss_pred HHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHH Q lcl|NC_021342. 141 DEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVI 220 (354) Q Consensus 141 ~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~ 220 (354) .=++ ....++..--....+++++..+|+-+++|+....-. .. . .+ +++|.+++.... T Consensus 201 ell~---ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~----------~~-~-----~~----~~~i~~~~~~~l 257 (395) T protein:vir:38 201 TLLK---DTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKK----------PT-I-----SQ----FDNIKDLENNTL 257 (395) T ss_pred HHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----------cc-c-----cc----HHHHHHHHHHhh Confidence 4333 234567778888899999999999999997643210 00 0 11 234444443222 Q ss_pred HHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEE Q lcl|NC_021342. 221 NLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYM 300 (354) Q Consensus 221 ~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v 300 (354) .. .....-.++|+|..|..|.+-. +..|.-++. ..+ ..+.+-.|...|.+........ +..+... + T Consensus 258 ~~--~~~~~a~~v~n~~~~~~L~~lk--d~~G~~l~~----~~~---~~~~~~~l~G~pV~~~~~~~~~--~~~~~~~-i 323 (395) T protein:vir:38 258 DP--AIESTSSFITNQSGYNILSKVK--DADGRYLMQ----PDV---TSPDKYLIDGKPVIRIADKWLP--DVSGSHP-L 323 (395) T ss_pred hh--hhcCCCEEEEcHHHHHHHHHhh--ccCCceeec----cCc---CCCCcceeccceeEEecccccC--cCCCcce-E Confidence 21 2223356899999999997533 333432211 111 1233333433333333221111 1122222 3 Q ss_pred EEEcCcceEEEeeCchhhhcccc-c----cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 301 VYDKSDRNLAMANPIPFRMLAPQ-M----ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 301 ~y~~~~~~~~~~vp~~~~~~~~~-~----~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +|-+-.+.+.+..-..++..-.. . ..-.+.+.++.|++ +.+.+|.+++++++. T Consensus 324 ~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~~~~~~~ 381 (395) T protein:vir:38 324 YFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFD-VQLIDDGAFAAASFK 381 (395) T ss_pred EEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEEEEee Confidence 33222233333322332221111 1 11134566778875 677789999999999 No 110 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=97.07 E-value=0.00018 Score=41.07 Aligned_cols=310 Identities=7% Similarity=-0.016 Sum_probs=140.5 Q ss_pred CcccchhHHH-Hh--hhhhhhcc--ccccccc-------chhh---hh-hhhhhhccCCceeccchhhHHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQT-IQ--GNQWLVHK--GYVSRNG-------DQWV---IN-NTALDAIGNPNIMLDADGGIAFYISQLAGIE 64 (354) Q Consensus 1 ~~~~~~~~~~-~~--~~~~~~~~--~~~~~~~-------~~~~---~~-~~am~a~~~~~~~~da~~~~~fl~~~L~~Id 64 (354) -.|+.++.+. .. ....+... ....... ..+. .. -...+.+.+ ..+++ .+.+++. +.+. T Consensus 55 ~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~--~~t~~--~gg~~vP--~~~~ 128 (394) T protein:vir:10 55 DQIKDLEAENKANSDPDKPVDNAQPNGTDLKKKPIDAKKKAINDFIHSHGKVIDNAAG--HVTST--EAGVLIP--EEII 128 (394) T ss_pred HHHHHHHHHHHhhcchhhhhhhhcccccchhhhHHHHHHHHHHHHHhccchhhhhhhc--ccccc--cCceecc--HHHH Confidence 1111110000 00 00000000 0000000 0000 00 000111111 11222 2334444 5667 Q ss_pred HHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcccce-eeeccceeEEEEEEEEeeEeecHHH Q lcl|NC_021342. 65 ATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQDLPR-VAQSAQMHTVPLGYAGNECHYTLDE 142 (354) Q Consensus 65 ~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip~-v~~~~~~~~~pv~~~~~~~~~~~~E 142 (354) +.|++........+.++.+.. .+.. +..+.... ..+.+.+++..+. .|- .+...+.....++.++.-..+|.+= T Consensus 129 ~~ii~~~~~~~~l~~~~~~~~-~~~~--~~~~~~~~~~~~~~~~~~E~~~-~~~~~~~~~~~v~l~~~k~~~~~~iS~el 204 (394) T protein:vir:10 129 YDPTAEVNSVVDLSTLVTKTP-VTTP--KGTYPILKRATDRFSSVAELAE-NPALAEPEFEQVDWSVSTYRGAIPLSEEA 204 (394) T ss_pred HHHHHHHHhhhhhhhhceeee-ccCC--ceEEEEEecCCCcccccccccc-ccccccccceeEEeeeeeeEeeehhHHHH Confidence 788888887777777766432 2212 23343333 3466677766544 453 3456677788888888878887765 Q ss_pred HHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 143 MRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINL 222 (354) Q Consensus 143 l~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~ 222 (354) |+.+ ..++..--....+++++..+|+-+++|..... +....+ ..+ +++|.+++...... T Consensus 205 l~ds---~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~----------~~~~~~----~~~----~d~l~~~~~~~~~~ 263 (394) T protein:vir:10 205 IADS---AVDLTSLVGQSINEKSVNTYNAMIAPVLQSFT----------AKATTT----DTL----VDSLKHILNVDLDP 263 (394) T ss_pred Hhhh---hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------cccccc----ccc----HHHHHHHHHhhhhh Confidence 5544 34677778888889999999999988865311 111000 112 34455444333221 Q ss_pred hCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEE Q lcl|NC_021342. 223 SRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVY 302 (354) Q Consensus 223 s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y 302 (354) .+ ...++|+|+.|..|..- .+..|.-++.--..+ ....+.+-.+...|.......... .. .+ +-.++| T Consensus 264 -~~---~a~~vmn~~~~~~l~~l--kd~~G~~i~~~~~~~---~~~~~~~~~L~G~PV~~~~~~~~~-~~-~~-~~~i~~ 331 (394) T protein:vir:10 264 -AY---SRALVVTQSLFNTLDTL--KDKNGRYLLHDASDS---ITDGTAKGTVLGVPVYVVGDALLG-SA-AG-DQKAFV 331 (394) T ss_pred -hc---cCEEEecHHHHHHHHHh--hccCCCeeeeccccc---cccCCcccccccceeEEecccccC-CC-CC-ceEEEE Confidence 11 24799999999999753 244443221100000 001122223333333222211111 11 12 222333 Q ss_pred EcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 303 DKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 303 ~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) -+=.+.+.+..-..++............+..+.|++ +.+++|.+|+++.++ T Consensus 332 gd~s~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d-~~~~~~~ai~~~~~~ 382 (394) T protein:vir:10 332 GDLKRGVLFADRQQVTLAWEDSKIYGRYLGAAFRFG-VKQADSNAGYFVTNT 382 (394) T ss_pred eeccccEEEEeecceEEEEecccccceeEEEEEEec-cEEeccccEEEEEee Confidence 221222333322333333222222233455677886 567779999999988 No 111 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=97.06 E-value=0.00019 Score=40.98 Aligned_cols=308 Identities=9% Similarity=-0.088 Sum_probs=142.7 Q ss_pred Ccccchh-----------------HHHHh-----------------------hhhhhhcccccccccchhhhhhhhhhhc Q lcl|NC_021342. 1 MAIKTID-----------------AQTIQ-----------------------GNQWLVHKGYVSRNGDQWVINNTALDAI 40 (354) Q Consensus 1 ~~~~~~~-----------------~~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~~~am~a~ 40 (354) |+||-.+ ....+ .+.+. ..+....+..+-.. ..++ T Consensus 1 m~~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~-~~~~~~~l~~~e~~---~~~~- 75 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSS-LPKSAQTLSANQRN---FFMD- 75 (381) T ss_pred CchhHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHH-hcccccccCHHHHH---HHHH- Confidence 3333111 00000 01111 11222222222111 1111 Q ss_pred cCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccc-eee Q lcl|NC_021342. 41 GNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLP-RVA 119 (354) Q Consensus 41 ~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip-~v~ 119 (354) ++.+.++.+.+++. +.+..+|++.....-..|.+..+.. .+.. ......+..|.+.|.+..+ .++ ..+ T Consensus 76 ----~~~~t~~~Gg~lvP--~~~~~~I~~~l~~~spir~~a~v~~-~~~~---~~i~~~~~~~~a~W~~e~~-~~~~~~~ 144 (381) T protein:vir:10 76 ----INKSVGYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKN-AGLR---LKFLKSETSGVAVWGKIYG-EIKGQLD 144 (381) T ss_pred ----HhhcCCCCCceecC--HHHHHHHHHHHHhhcceeeeeeeEe-cCcc---eEEEeecCCcceEEeeccc-ccccccC Confidence 11122233345554 5566777776665555556555432 2211 2233445667777765433 233 345 Q ss_pred eccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccc--ccc- Q lcl|NC_021342. 120 QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTL--SSA- 196 (354) Q Consensus 120 ~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~--~~~- 196 (354) ...+....+.+.++.-...+.+=|+-+ ..+++.--....+++++..+++-+++|+...+-.|||++.+-.. ... T Consensus 145 ~~f~~i~l~~~kl~a~i~is~elL~Ds---~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~ 221 (381) T protein:vir:10 145 AAFSEETAIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGA 221 (381) T ss_pred ccceeEeecceeEEeeccccHHHHhcc---HHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCccccccccc Confidence 566777888888887777776555444 45788888899999999999999999998888899998754221 111 Q ss_pred cccc------cccCHHHHHHHHHHHHHHHHHHhCCcc----cccEEEeCHHHHHHHhhcc-cCCCCCchHHHHHHhhCcc Q lcl|NC_021342. 197 TKDY------KTMNGQELFNMLNAPIFSVINLSRRFH----VPNTALMFPDLWNQANNQL-MTGYTDRTVMQHFMEANSY 265 (354) Q Consensus 197 ~~~W------~~~T~~ei~~di~~~~~~l~~~s~g~~----~p~~L~l~p~~~~~L~~~~-~~~~~~~Tvl~~l~~n~~~ 265 (354) .+++ ...++...++.+...+..+...-.+.. .-.+++|+|..+..+.... ..+..|. |+ T Consensus 222 ~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~----~v------ 291 (381) T protein:vir:10 222 YPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGV----YV------ 291 (381) T ss_pred cccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCc----ee------ Confidence 1111 111222333333333333322111111 1236789998877764221 1111111 11 Q ss_pred cccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCc--eeEEeeeeeeeeEEE Q lcl|NC_021342. 266 TLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASL--GITVPAEYKISGTEF 342 (354) Q Consensus 266 ~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l--~~~~~~~~~~gGv~i 342 (354) ..-+.+..|. ++.... . ++ ++..+.+. +.+..-+.+++-.. +...+ ...+....|.+ -.+ T Consensus 292 -~~lp~g~~vv-----~~~~~p-~-----~~--i~fGDfs~--Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~d-G~~ 354 (381) T protein:vir:10 292 -TALPFNLNVI-----ESTVQE-A-----GK--VLTYVKGL--YDGYLAGGINVQKFKETLALDDMDLYTAKQFAY-GKA 354 (381) T ss_pred -ecCCCCceeE-----EcCCCC-c-----Cc--EEEEEccc--EEEEEecccEEEeechhhhhcCceEEEEEEEEc-CEE Confidence 0001111121 111110 1 11 22223222 22222222221111 11111 23455566665 567 Q ss_pred ECCceeEeeecC Q lcl|NC_021342. 343 RYPLCAAYVDMA 354 (354) Q Consensus 343 ~~P~ai~y~D~~ 354 (354) +.|.|++++|++ T Consensus 355 ~~~~A~~v~~l~ 366 (381) T protein:vir:10 355 KDNKVAAVWKLD 366 (381) T ss_pred ecCCcEEEEEEe Confidence 899999999998 No 112 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=96.99 E-value=0.00016 Score=41.35 Aligned_cols=303 Identities=7% Similarity=-0.077 Sum_probs=138.7 Q ss_pred CcccchhH--HHHhhh--hhhhcccccccccc-----------hhhhhh--------hhhhhc-cCCceeccchhhHHHH Q lcl|NC_021342. 1 MAIKTIDA--QTIQGN--QWLVHKGYVSRNGD-----------QWVINN--------TALDAI-GNPNIMLDADGGIAFY 56 (354) Q Consensus 1 ~~~~~~~~--~~~~~~--~~~~~~~~~~~~~~-----------~~~~~~--------~am~a~-~~~~~~~da~~~~~fl 56 (354) ..+..+.. ..++.. .-+... ....... ++.... ..+.+. ...+++..+++++.++ T Consensus 51 ~~~~~l~~~~~~~e~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~ 129 (387) T protein:vir:93 51 QRFNIVERQVKDIEEKEKAKVKDT-GEAYQSLNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKL 129 (387) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhc-cccCCCcchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCcee Confidence 00000000 000000 000000 0000000 000000 000000 0011122222333444 Q ss_pred HHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeE Q lcl|NC_021342. 57 ISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNEC 136 (354) Q Consensus 57 ~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~ 136 (354) +. +.+.+.|++.....-..|.+..+.+-.+ .++.. .....+.+.|++... ..|..+...+......+.++.-+ T Consensus 130 IP--~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~-~~~~~~~a~~v~E~~-~~~~~~~~f~~v~~~~~k~~~~~ 202 (387) T protein:vir:93 130 LP--KTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPR-VSYTLDDDDFITDVE-TAKELKLKGDTVKFTTNKFKVFA 202 (387) T ss_pred ec--hhHHHHHHHHHHhhchhhhheeeeecCC---ceEEE-EeecCCccccccCcc-cccccccccceeeeeheeeeeec Confidence 44 4556677777666666677766543222 22211 222345567776544 35666777777778888888877 Q ss_pred eecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeee-eehhhCceeeeecCCcccccccccccccCHHHHHHHHHHH Q lcl|NC_021342. 137 HYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYF-GDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAP 215 (354) Q Consensus 137 ~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~-G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~ 215 (354) .+|.+=|+- ...++..--....++++...+++.+|. |+....-.|+++++++...+. ...+++|.++ T Consensus 203 ~iS~ell~D---s~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~---------~~~~d~i~~~ 270 (387) T protein:vir:93 203 AISDTVIHG---SDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEG---------ADMYDAIINA 270 (387) T ss_pred hhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc---------cchHHHHHHH Confidence 887554433 344677777788888888888887664 443334578888777654322 2235677778 Q ss_pred HHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCc Q lcl|NC_021342. 216 IFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSN 295 (354) Q Consensus 216 ~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g 295 (354) +.+|... +. ..-..+|++..|..+.+.. .++.+ .+ + .|.+-+|...|.+.+.+... T Consensus 271 ~~~l~~~--~~-~~a~~~mn~~t~~~~~~~~-~d~~~-~~---~---------~~~~~~llG~PV~~~~~~~~------- 326 (387) T protein:vir:93 271 LADLHED--YR-DNATIYMRYADYVKIISVL-SNGTT-NF---F---------DTPAEKVFGKPVVFTDAAVK------- 326 (387) T ss_pred HhccChh--hh-cCCEEEEechHHHHHHHHH-hcCCC-cc---c---------ccCCccccccceEEecCCCc------- Confidence 7776543 21 1235788888776654432 22222 11 1 12222233333333221100 Q ss_pred ccEEEEEEcCcceEEEeeCchhhhcc-ccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 296 KPRYMVYDKSDRNLAMANPIPFRMLA-PQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 296 ~d~~v~y~~~~~~~~~~vp~~~~~~~-~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +++-+.+.-++ .. ..+...+ .+...-.+.+-+..|++|. +.+|.|++++.+. T Consensus 327 ---~~~GDf~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~r~d~~-v~~~eA~~~l~~k 379 (387) T protein:vir:93 327 ---PIVGDFNYFGI-NY--DGTTYDTDKDVKKGEYLFVLTAWYDQQ-RTLDSAFRIAKAK 379 (387) T ss_pred ---eeeeehhhhhe-eh--hhheeeecccccCCceeEEEEeeeCce-eechhheEEEEee Confidence 11111110000 00 1111111 1122223445566788765 5679999999997 No 113 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=96.95 E-value=0.00024 Score=40.43 Aligned_cols=263 Identities=8% Similarity=0.011 Sum_probs=134.6 Q ss_pred cCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCC-ceeEEEEEeeccccceeEecCCCcccceee Q lcl|NC_021342. 41 GNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPE-YADTWMYRSYDGVTMGKFIGANGQDLPRVA 119 (354) Q Consensus 41 ~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~-~~~~~~~~~~~~~G~a~~~~~~~~dip~v~ 119 (354) +-...|.-+|- +.. +.+.+.+.+.....+....+..+...+.. ...++.+..+...|.++.+.++ ++++..+ T Consensus 1 ma~~~T~~~~~----iiP--ev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg-~~i~~~~ 73 (274) T protein:vir:93 1 MPQGITKTSNQ----IIP--EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG-EKIPTDI 73 (274) T ss_pred CCccceehhhe----ech--HHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCC-Ccccccc Confidence 12233333331 122 22223333444444445555544332221 2346778888888999888654 5688888 Q ss_pred eccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccc Q lcl|NC_021342. 120 QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 120 ~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~ 199 (354) ...+.....+.+.+.+|.+ .|+++++. +.++-......+.+++++..|+.++..-.+. + ..+ .+ T Consensus 74 it~~~~~~~i~~~~~~~~i--~D~~~~~~-~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a--------~-~~~---~~- 137 (274) T protein:vir:93 74 LETKKREAKIRKIAKGTSI--TDEALLSG-YGDPQGEQVRQHGLAHANKVDNDVLEALMGA--------K-LTV---NA- 137 (274) T ss_pred cccceeEEEeeeecccccc--cHHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHHHHhcc--------c-ccc---cc- Confidence 8888888888776655555 66666554 4556677778888899999988766322110 0 011 11 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccC---CCCCchHHHHHHhhCcccccccccceee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMT---GYTDRTVMQHFMEANSYTLLTGNELDIQ 276 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~---~~~~~Tvl~~l~~n~~~~~~~g~~l~I~ 276 (354) +++. +++|.+++.+|-.. + ..+..|+|+|..|..|.+.... ..++. .+-+..++.+-...| T Consensus 138 --~~~~---~d~i~dA~~~l~d~--~-~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~--g~~~~~~G~ig~~~G------ 201 (274) T protein:vir:93 138 --DITK---LNGLQSAIDKFNDE--D-LEPMVLFINPLDAGKLRGDASTNFTRATEL--GDDIIVKGAFGEALG------ 201 (274) T ss_pred --cccC---HHHHHHHHHHhhhc--c-CCccEEEeCHHHHHHHHhhhhhcccccccc--cccceeecccceecC------ Confidence 1111 56677777777543 2 3678999999999999753111 11110 001111222212222 Q ss_pred eeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 277 IRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 277 ~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+.+...... . +..+++ .+..+.+..-.+.+.-.- ..+...-.+..... .|+-+.+|.+++.+-.+ T Consensus 202 -~~Vi~s~~~p-~-------~t~~l~--~~gai~~~~~~~~~vE~~Rd~~~~~d~i~~~~~-y~~~~~~~~~~v~~t~~ 268 (274) T protein:vir:93 202 -AIIVRTNKLE-A-------GTAILA--KKGAVKLILKRDFFLEVARDASTKTTALYSDKH-YVAYLYDESKAVKITKG 268 (274) T ss_pred -eeEEEcCCCC-c-------ceEEEE--eCCeEEEEecCCcccccccchhhcccEEEEEEE-EEEEEEcCCceEEEeeC Confidence 2222222111 0 112222 234444443333332111 11122333333333 47899999999998887 No 114 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=96.73 E-value=0.00038 Score=39.34 Aligned_cols=306 Identities=8% Similarity=-0.024 Sum_probs=138.0 Q ss_pred CcccchhHHHHh------hhhhhhcccccccccchhhhhhhh-----------hhhccCCceeccchhhHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQ------GNQWLVHKGYVSRNGDQWVINNTA-----------LDAIGNPNIMLDADGGIAFYISQLAGI 63 (354) Q Consensus 1 ~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~a-----------m~a~~~~~~~~da~~~~~fl~~~L~~I 63 (354) -.|+.+....-. ...... ..............-+ +.+. ...++++ +.+++. +.+ T Consensus 55 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~lr~~~~~~~~~---~~~t~~~--gg~~vP--~~~ 125 (389) T protein:vir:10 55 DQIKALEAEKPAEPKTEPKDDGSK--KGTDLSKKPIDAKKKAINDFIHSHGKVIDAT---SKVTSTE--AGVLIP--EEI 125 (389) T ss_pred HHHHHHHHHHHhhhhccccccccc--cccccchhHHHHHHHHHHHHhhcchhhhhhh---cccccCC--cceeeh--HHH Confidence 111111110000 000000 0000000000000001 1111 0112222 334443 445 Q ss_pred HHHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcccc-eeeeccceeEEEEEEEEeeEeecHH Q lcl|NC_021342. 64 EATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQDLP-RVAQSAQMHTVPLGYAGNECHYTLD 141 (354) Q Consensus 64 d~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip-~v~~~~~~~~~pv~~~~~~~~~~~~ 141 (354) .+.+++........+.++.+.. .+.. +..+.... ..+.+.+++.++. .| ..+...+.....++.++.-+.+|.. T Consensus 126 ~~~i~~~~~~~~~l~~~~~~~~-~~~~--~~~~~~~~~~~~~~~~~~E~~~-~~~~~~~~~~~i~~~~~k~~~~~~iS~e 201 (389) T protein:vir:10 126 IYDPTAEVNSVVDLSTLVTKTP-VTTP--KGTYPILKRATDRFSSVAELAE-NPKLAEPEFNKVDWSVATYRGAIPLSEE 201 (389) T ss_pred HHHHHHHHHhhhhHHhhcceee-ccCC--eeEEEEEecCCCcccccccccc-ccccccccceeeeeeheeeEeeehhhHH Confidence 6777777777777777665432 2212 23333332 2344455655543 34 3455667778888888888888876 Q ss_pred HHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 142 EMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVIN 221 (354) Q Consensus 142 El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~ 221 (354) =|+.+ ..++..-.....++++...+|..+..|.......| ..+ ..+ ++++.++++.... T Consensus 202 ll~ds---~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~----------~~~----~~~----~d~l~~~~~~~~~ 260 (389) T protein:vir:10 202 AIADS---AVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTAKK----------TTT----DTL----VDSLKHILNVDLD 260 (389) T ss_pred HHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc----------ccc----ccc----HHHHHHHHHhhhh Confidence 55543 34677778888899999999999887765421111 000 111 3444444442221 Q ss_pred HhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCc-ccccccccceeeeeeeeeeccccccccccCcccEEE Q lcl|NC_021342. 222 LSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANS-YTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYM 300 (354) Q Consensus 222 ~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~-~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v 300 (354) . .. ...++|+|..|..|.+-. +..|. ||-.... .....+.+..+-..|.+......... .++ +-.+ T Consensus 261 ~--~~--~a~~~~n~~~~~~L~~lk--d~~G~----~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~--~~~-~~~~ 327 (389) T protein:vir:10 261 P--AY--SRALVVTQSLFNTLDTLK--DKNGR----YLLHDASDSITDGTAKGTILGVPVYVVGDTLLGS--LAG-DQKA 327 (389) T ss_pred h--hh--CcEEEecHHHHHHHHHhh--ccCCC----eeeecCcccccccccccccccceeEEecccccCC--CCC-ceEE Confidence 1 11 247999999999997533 33332 1211100 00011222233333322221111111 111 2223 Q ss_pred EEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 301 VYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 301 ~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +|-+=.+.+.+...+.++..-.........+....|++|. +.+|.|++++.++ T Consensus 328 ~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~-~~~~~a~~~~~~~ 380 (389) T protein:vir:10 328 FVGDLKRGVLFTDRQQVTLAWEDSKIYGKYLGAAFRFGVQ-KADSKAGYFVTNT 380 (389) T ss_pred EEeeccccEEEEeecceEEEeeccccccceEEEEEEeccE-EecccceEEEEee Confidence 3321112233333344444333333333445666788755 6889999999998 No 115 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=96.66 E-value=0.00043 Score=39.05 Aligned_cols=265 Identities=8% Similarity=0.024 Sum_probs=133.0 Q ss_pred cCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCC-ceeEEEEEeeccccceeEecCCCcccceee Q lcl|NC_021342. 41 GNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPE-YADTWMYRSYDGVTMGKFIGANGQDLPRVA 119 (354) Q Consensus 41 ~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~-~~~~~~~~~~~~~G~a~~~~~~~~dip~v~ 119 (354) +..+.|.-+|-- .. +-+.+.+.+.....+....+..+...+.. ...++.++.+...|.++.+.++ ++++..+ T Consensus 1 ma~~~T~~~d~i----iP--ev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g-~~i~~~~ 73 (274) T protein:vir:97 1 MPQGLTKTSDQI----IP--EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG-EKIPTDI 73 (274) T ss_pred CCccceehhhee----ch--HHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC-Ccccccc Confidence 223344444321 22 22223334444444555555544432221 2457888888888999887664 5688888 Q ss_pred eccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccc Q lcl|NC_021342. 120 QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 120 ~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~ 199 (354) ...+.....+...+.+ |.+.|+++++..+ ++-......++.++++..|+.++.--.. ....... T Consensus 74 lt~~~~~~~i~~~~~~--~~i~D~~~~~~~~-dp~~~~~~~~a~a~a~~vd~~~~~~l~~---------a~~~~~~---- 137 (274) T protein:vir:97 74 LETKKREAKIRKIAKG--TSITDEALLSGYG-DPQGEQVRQHGLAHANKVDNDVLEALMG---------AKLTVNA---- 137 (274) T ss_pred cccceeEEEeeeecce--ecccHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhc---------cCccccc---- Confidence 8888888888776655 5556777666544 4456667778888888888766522111 1111110 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchH-HHHHHhhCcccccccccceeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTV-MQHFMEANSYTLLTGNELDIQIR 278 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tv-l~~l~~n~~~~~~~g~~l~I~~~ 278 (354) .++. ++.|.++..+|-.. ...+..|+|+|..|..|.+.........|- -+-+..++.+....| . T Consensus 138 --~~~~---~d~i~dA~~~l~d~---~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-------~ 202 (274) T protein:vir:97 138 --DITK---LNGLQSAIDKFNDE---DLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-------A 202 (274) T ss_pred --cccC---HHHHHHHHHHhhcc---CCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecC-------e Confidence 1111 56777777777543 236789999999999997532110000000 001112222212222 2 Q ss_pred eeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 279 FQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 279 ~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+...... . +..+++ .+.-+.+....+.+.-.- .+....-.+-... ..|+-+.+|..++.+--+ T Consensus 203 ~Vi~s~~~p-~-------~t~~l~--~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~-~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:97 203 IIVRTNKLE-A-------GTAILA--KKGAVKLILKRDFFLEVARDASTKTTALYSDK-HYVAYLYDESKAVKITKG 268 (274) T ss_pred eEEEcCCCC-c-------ceEEEE--eCcceEeeecCCceeccccchhhcccEEEEEE-EEEEEEEcCCceEEEecC Confidence 222222110 0 111222 233333333333332111 1111222333333 347899999999887777 No 116 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=96.66 E-value=0.00043 Score=39.05 Aligned_cols=265 Identities=8% Similarity=0.024 Sum_probs=133.0 Q ss_pred cCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCC-ceeEEEEEeeccccceeEecCCCcccceee Q lcl|NC_021342. 41 GNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPE-YADTWMYRSYDGVTMGKFIGANGQDLPRVA 119 (354) Q Consensus 41 ~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~-~~~~~~~~~~~~~G~a~~~~~~~~dip~v~ 119 (354) +..+.|.-+|-- .. +-+.+.+.+.....+....+..+...+.. ...++.++.+...|.++.+.++ ++++..+ T Consensus 1 ma~~~T~~~d~i----iP--ev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g-~~i~~~~ 73 (274) T protein:vir:94 1 MPQGLTKTSDQI----IP--EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG-EKIPTDI 73 (274) T ss_pred CCccceehhhee----ch--HHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC-Ccccccc Confidence 223344444321 22 22223334444444555555544432221 2457888888888999887664 5688888 Q ss_pred eccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccc Q lcl|NC_021342. 120 QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 120 ~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~ 199 (354) ...+.....+...+.+ |.+.|+++++..+ ++-......++.++++..|+.++.--.. ....... T Consensus 74 lt~~~~~~~i~~~~~~--~~i~D~~~~~~~~-dp~~~~~~~~a~a~a~~vd~~~~~~l~~---------a~~~~~~---- 137 (274) T protein:vir:94 74 LETKKREAKIRKIAKG--TSITDEALLSGYG-DPQGEQVRQHGLAHANKVDNDVLEALMG---------AKLTVNA---- 137 (274) T ss_pred cccceeEEEeeeecce--ecccHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhc---------cCccccc---- Confidence 8888888888776655 5556777666544 4456667778888888888766522111 1111110 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchH-HHHHHhhCcccccccccceeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTV-MQHFMEANSYTLLTGNELDIQIR 278 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tv-l~~l~~n~~~~~~~g~~l~I~~~ 278 (354) .++. ++.|.++..+|-.. ...+..|+|+|..|..|.+.........|- -+-+..++.+....| . T Consensus 138 --~~~~---~d~i~dA~~~l~d~---~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-------~ 202 (274) T protein:vir:94 138 --DITK---LNGLQSAIDKFNDE---DLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALG-------A 202 (274) T ss_pred --cccC---HHHHHHHHHHhhcc---CCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecC-------e Confidence 1111 56777777777543 236789999999999997532110000000 001112222212222 2 Q ss_pred eeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 279 FQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 279 ~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+...... . +..+++ .+.-+.+....+.+.-.- .+....-.+-... ..|+-+.+|..++.+--+ T Consensus 203 ~Vi~s~~~p-~-------~t~~l~--~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~-~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:94 203 IIVRTNKLE-A-------GTAILA--KKGAVKLILKRDFFLEVARDASTKTTALYSDK-HYVAYLYDESKAVKITKG 268 (274) T ss_pred eEEEcCCCC-c-------ceEEEE--eCcceEeeecCCceeccccchhhcccEEEEEE-EEEEEEEcCCceEEEecC Confidence 222222110 0 111222 233333333333332111 1111222333333 347899999999887777 No 117 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=96.61 E-value=0.00047 Score=38.82 Aligned_cols=266 Identities=6% Similarity=0.007 Sum_probs=130.2 Q ss_pred hhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCC-CCceeEEEEEeeccccceeEecCCCccc Q lcl|NC_021342. 37 LDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANI-PEYADTWMYRSYDGVTMGKFIGANGQDL 115 (354) Q Consensus 37 m~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~-~~~~~~~~~~~~~~~G~a~~~~~~~~di 115 (354) |-. .+.|..+| .+.. +.+-+-|.+.....+....+..+.+.+ +....++..+.+...|.++.+.++ ++| T Consensus 1 ~~~---~~~T~l~d----~i~P--Ev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g-~~i 70 (275) T protein:vir:96 1 MAL---ENMTKLAN----MVNP--EVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEG-EEI 70 (275) T ss_pred CCC---cccchhhh----hhch--HHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCC-CCc Confidence 211 22344443 2222 222233334444445555555443332 112456788888888999888664 578 Q ss_pred ceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccc Q lcl|NC_021342. 116 PRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSS 195 (354) Q Consensus 116 p~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~ 195 (354) +......+.....+...+.+|.+ .|++..+. +.++-.+....+...+++..|+-++. .+ +....+. + T Consensus 71 ~~~~lt~~~~~~~i~~~~~~~~i--~D~~~~~~-~~d~~~~~~~~~a~~~a~~~d~~ll~---~l------~~a~~~~-~ 137 (275) T protein:vir:96 71 PIDLIETKKRQATIRKIGKGTVL--TDEALLSG-YGDPKGEAVRQHGLAIANKVDNDVLE---AL------QGATLKV-E 137 (275) T ss_pred chhhcccceeeEEeehhcccccc--cHHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHH---HH------hcccccc-c Confidence 88888888888888777666555 66665544 44555666777888888888876551 11 1111111 1 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchH-HHHHHhhCcccccccccce Q lcl|NC_021342. 196 ATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTV-MQHFMEANSYTLLTGNELD 274 (354) Q Consensus 196 ~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tv-l~~l~~n~~~~~~~g~~l~ 274 (354) .. ++ -++.|.+++.++-.. ...+..|+++|..+..|.+.........+. -+-+..|+.+. + T Consensus 138 ~~-----~~---~~d~i~dA~~~lgd~---~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig-------~ 199 (275) T protein:vir:96 138 AD-----IT---KLAGLQTAIDKFNDE---DLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFG-------E 199 (275) T ss_pred cc-----cc---CHHHHHHHHHHhccc---cCCccEEEeCHHHHHHHHhcccccccccccccccceeccccc-------e Confidence 11 11 156677777777432 236789999999999995531100000000 00011122211 2 Q ss_pred eeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeec Q lcl|NC_021342. 275 IQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDM 353 (354) Q Consensus 275 I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~ 353 (354) +...+.+...... . + ..+++. +.-+.+....+.+.-.- ......-.+.... ..|+-+.+|..++.+-. T Consensus 200 ~~G~~Vi~s~~~p-~-----~--t~~i~~--~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~-~y~~~~~~~~~vv~~t~ 268 (275) T protein:vir:96 200 ALGAIIVRSNKIK-E-----G--EAILAK--RGAVKLITKRDFFLETERHASHKSTALFSDK-HYVAYLYDESKVVKITK 268 (275) T ss_pred ecCeeEEEeCCCC-c-----c--eEEEEe--ccceeeeecCCcccccccchhhcCcEEEEeE-EEEEEEEcCccEEEEEe Confidence 2222222222111 0 1 122222 22333333222221111 0111122222223 34789999999998866 Q ss_pred C Q lcl|NC_021342. 354 A 354 (354) Q Consensus 354 ~ 354 (354) . T Consensus 269 ~ 269 (275) T protein:vir:96 269 S 269 (275) T ss_pred c Confidence 6 No 118 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=96.61 E-value=0.00047 Score=38.81 Aligned_cols=263 Identities=7% Similarity=-0.007 Sum_probs=135.2 Q ss_pred ccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCC-CceeEEEEEeeccccceeEecCCCccccee Q lcl|NC_021342. 40 IGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIP-EYADTWMYRSYDGVTMGKFIGANGQDLPRV 118 (354) Q Consensus 40 ~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~-~~~~~~~~~~~~~~G~a~~~~~~~~dip~v 118 (354) |. ...|.-+| .+.. +-+-+.+.+.....+....+..+.+.+. ....++.++.+...|.++.+.++ +++|.. T Consensus 1 ma-~~~T~~~d----~i~P--ev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g-~~i~~~ 72 (274) T protein:vir:96 1 MA-QGTTKVSN----LIVP--EVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEG-EKIPVD 72 (274) T ss_pred CC-ccccchhh----hhhh--HHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCC-CcCchh Confidence 21 22222222 2222 2222334444455555556555544322 12346788888888999887664 578888 Q ss_pred eeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccc Q lcl|NC_021342. 119 AQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATK 198 (354) Q Consensus 119 ~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~ 198 (354) +...+.....+...+.+|.+ .|++..+ .+.++-.+....+...+++..|+.++.--.+ .... ..... T Consensus 73 ~it~~~~~~~i~~~~~~~~i--~D~~~~~-~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~---------a~~~-~~~~~ 139 (274) T protein:vir:96 73 QIGTSKREAKVRKIGKGTEL--TDEAVLS-GFGDPQGEAVRQHGLAIANKVDNDVLEALKG---------ATLT-VEADI 139 (274) T ss_pred hcccceeEEEEEeeeceeee--cHHHHHh-hcchHHHHHHHHHHHHHHHHHHHHHHHHHhc---------CCCC-cCccc Confidence 88888888888776665555 5666655 4556667778888889999998877632211 0000 11111 Q ss_pred cccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhccc---CCCCCchHHHHHHhhCccccccccccee Q lcl|NC_021342. 199 DYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLM---TGYTDRTVMQHFMEANSYTLLTGNELDI 275 (354) Q Consensus 199 ~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~---~~~~~~Tvl~~l~~n~~~~~~~g~~l~I 275 (354) . .++.|.++..+|-.. ...+..|+|+|..|..|.+... ...++. .+=+..++.+....|.+ T Consensus 140 ~--------~~d~i~dA~~~l~d~---~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~--g~~~~~~g~ig~~~G~~--- 203 (274) T protein:vir:96 140 T--------KLDGLQTAIDKFNDE---DLEPMVLFVNPLDAGGLRTSASDNFTRPTQL--GDNIIVKGAFGEALGAV--- 203 (274) T ss_pred c--------cHHHHHHHHHHhccc---CCCceEEEeCHHHHHHHHhcccccccccccc--cccceeecccceecCee--- Confidence 1 156777777777543 2367899999999999965321 111110 00111222222222222 Q ss_pred eeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 276 QIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 276 ~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+....+. . +..+++ .+.-+.+....+.+.-.- .+......+..... .|+-+.+|.+++.+--+ T Consensus 204 ----Vi~s~~~p-~-------~t~~l~--~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~-yg~~~~~~~~vv~~t~~ 268 (274) T protein:vir:96 204 ----IVRSNKLN-K-------GEALLA--KKGAVKLITKRDFFLEKDRDASRKSTALYSDKH-YVAYLYDESKVVKITKG 268 (274) T ss_pred ----EEEcCCCC-c-------ceEEEE--eCcceeeeecCCcccccccchhhcccEEEEeeE-EEEEEEcCccEEEEEcC Confidence 12211110 0 111222 223333333333222111 11112233333333 47999999999999888 No 119 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=96.53 E-value=0.00054 Score=38.51 Aligned_cols=303 Identities=7% Similarity=-0.081 Sum_probs=139.1 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhh--------h--------hhhhhhhcc-CCceeccchhhHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWV--------I--------NNTALDAIG-NPNIMLDADGGIAFYISQLAGI 63 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~--------~~~am~a~~-~~~~~~da~~~~~fl~~~L~~I 63 (354) -.++.++...=......-.+.. .....+.. . ....+.+.. ..++....++.+.+++. +.+ T Consensus 73 ~~~~~~e~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP--~~~ 149 (402) T protein:vir:93 73 RQVQDIEEKEKAKVKDKGEAYQ-SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLP--KTL 149 (402) T ss_pred HHHHHHHHHHHhhhhhccccCC-CCchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccc--hhH Confidence 0111111100000000000000 00000000 0 000011000 01111112223334444 456 Q ss_pred HHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHH Q lcl|NC_021342. 64 EATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEM 143 (354) Q Consensus 64 d~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El 143 (354) ...|++.....-..|.++.+..-.+ .++.. .....+.+.|++... ..|..+...+......+.++.-+.+|.+=| T Consensus 150 ~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~-~~~~~~~a~~v~Eg~-~~~~~~~~f~~i~~~~~k~~~~i~iS~ell 224 (402) T protein:vir:93 150 SKEIVSEPFAKNQLREKARLTNIKG---LEIPR-VSYTLDDDDFITDVE-TAKELKAKGDTVKFTTNKFKVFAAISDTVI 224 (402) T ss_pred HHHHHHhHHhhhhhhhhceeeecCC---ceeee-eeccCCccccccccc-cccccccccceeeecceeeeeechhhHHHH Confidence 7778877766666677766543222 12211 122345567776644 466667777778888888888778875544 Q ss_pred HHHHHhCCCcchHHHHHHHHHHHHHhhheeee-eehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 144 RKSAAMNMPIDAEQARLAFRGAEEHSQSVAYF-GDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINL 222 (354) Q Consensus 144 ~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~-G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~ 222 (354) +-+ ..++..--....++++...+++.+|. |+....-.|+++.++++..+. ...+++|.+++.+|... T Consensus 225 ~Ds---~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~---------~~~~d~l~~~~~~l~~~ 292 (402) T protein:vir:93 225 HGS---DVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG---------ADMYDAIINALADLHED 292 (402) T ss_pred hhh---HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc---------cchHHHHHHHHhccChh Confidence 433 44567777777788888888776664 444334467887776654322 22367788888777542 Q ss_pred hCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEE Q lcl|NC_021342. 223 SRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVY 302 (354) Q Consensus 223 s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y 302 (354) +. ..-..+|++..|..+.+.. .++ +..++ .+.|-+|...|...+.+.. + +++- T Consensus 293 --y~-~na~~imn~~t~~~~~~~~-~d~-~~~~~------------~~~~~~llG~PV~~t~~~~---------~-i~~G 345 (402) T protein:vir:93 293 --YR-DNATIYMRYADYVKIISVL-SNG-TTNFF------------DTPAEKVFGKPVVFTDAAV---------K-PIVG 345 (402) T ss_pred --hh-cCCEEEEechHHHHHHHHH-hcC-CCccc------------ccCCccccccceEEecCCC---------c-eeee Confidence 21 2235788888776665443 222 22111 1222222222222222110 1 1111 Q ss_pred EcCcceEEEeeCchhhhcc-ccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 303 DKSDRNLAMANPIPFRMLA-PQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 303 ~~~~~~~~~~vp~~~~~~~-~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +...-++ +...+..-+ -+...-...+-+..|++|. +.+|.|++++.|. T Consensus 346 Df~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~-v~~~~A~~~l~ik 394 (402) T protein:vir:93 346 DFNYFGI---NYDGTTYDTDKDVKKGEYLFVLTAWYDQQ-RTLDSAFRIAKAK 394 (402) T ss_pred chhhhhh---hhhhhhhhhhhcccCCceEEEEEEEeCcE-EechhheEEEEee Confidence 1110000 000111100 1112224566678888655 4579999999997 No 120 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=96.45 E-value=0.00043 Score=39.03 Aligned_cols=303 Identities=8% Similarity=-0.042 Sum_probs=139.4 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhh------hhhhhh----------c-cCCceeccchhhHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVIN------NTALDA----------I-GNPNIMLDADGGIAFYISQLAGI 63 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~am~a----------~-~~~~~~~da~~~~~fl~~~L~~I 63 (354) -.++.++...-.... -..+........+.... ...+.. . ...++...+++++.+++. +.+ T Consensus 58 ~~~~~~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP--~~~ 134 (387) T protein:vir:96 58 RQVQDIEEKEKAKVK-DKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLP--KTL 134 (387) T ss_pred HHHHHHHHHHHhhhh-hccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeec--hhH Confidence 011111111100000 00000000000000000 000000 0 000111112222334444 456 Q ss_pred HHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHH Q lcl|NC_021342. 64 EATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEM 143 (354) Q Consensus 64 d~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El 143 (354) .++|++.....-..|.++.+..-.+ .++.. .....+.+.|++.. ...|..+...+......+.++.-+.+|.+=| T Consensus 135 ~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~-~~~~~~~a~~v~Eg-~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell 209 (387) T protein:vir:96 135 SKEIVSEPFAKNQLREKARLTNIKG---LEIPR-VSYTLDDDDFITDV-ETAKELKAKGDTVKFTTNKFKVFAAISDTVI 209 (387) T ss_pred HHHHHHHHHhhchhhhhceeeecCC---ceeee-eeccCCcccccccc-ccccccccccceeeechheeeeechhhHHHH Confidence 7788887777666777766543222 12221 22234556676654 3466677777777888888888788875544 Q ss_pred HHHHHhCCCcchHHHHHHHHHHHHHhhheeee-eehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 144 RKSAAMNMPIDAEQARLAFRGAEEHSQSVAYF-GDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINL 222 (354) Q Consensus 144 ~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~-G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~ 222 (354) +.+ ..++..--....++++...+++.+|. |...-.-.|.++.++++..+. +..+++|.+++..|... T Consensus 210 ~ds---~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~---------~~~~d~i~~~~~~l~~~ 277 (387) T protein:vir:96 210 HGS---DVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG---------ADMYDAIINALADLHED 277 (387) T ss_pred hhh---HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc---------cchHHHHHHHHhccChh Confidence 433 44666667777777888888777664 443334467787777654322 22367788888777543 Q ss_pred hCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEE Q lcl|NC_021342. 223 SRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVY 302 (354) Q Consensus 223 s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y 302 (354) +. ..-..+|++..|..+.+.. .+ .+..++ ...+. .+-|.|+.+ +.+. . + +++- T Consensus 278 --y~-~na~~imn~~t~~~~~~~~-~~-~~~~~~----~~~~~-~llG~PV~~-------~~~~-----~----~-~~~G 330 (387) T protein:vir:96 278 --YR-DNATIYMRYADYVKIISVL-SN-GTTNFF----DTPAE-KVFGKPVVF-------TDAA-----V----K-PIVG 330 (387) T ss_pred --hh-cCCEEEEechHHHHHHHHH-hc-CCCccc----ccCCc-cccccceEE-------ecCC-----C----c-eeee Confidence 11 2236788888877665433 22 222111 11111 122333322 2110 0 0 1111 Q ss_pred EcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 303 DKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 303 ~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+.-++ ....+...+. +...-.+.+.+..|++| .+++|.|++++.|. T Consensus 331 Df~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~r~Dg-~v~~~~A~~~l~~k 379 (387) T protein:vir:96 331 DFNYFGI---NYDGTTYDTDKDVKKGEYLFVLTAWYDQ-QRTLDSAFRIAKAK 379 (387) T ss_pred chhhhhh---hhhhhhheecccccCCceEEEEEEEeCc-EeechhheEEEEee Confidence 1110000 0011111111 11222456667788865 55679999999997 No 121 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=96.45 E-value=0.00043 Score=39.03 Aligned_cols=303 Identities=8% Similarity=-0.042 Sum_probs=139.4 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhh------hhhhhh----------c-cCCceeccchhhHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVIN------NTALDA----------I-GNPNIMLDADGGIAFYISQLAGI 63 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~am~a----------~-~~~~~~~da~~~~~fl~~~L~~I 63 (354) -.++.++...-.... -..+........+.... ...+.. . ...++...+++++.+++. +.+ T Consensus 58 ~~~~~~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP--~~~ 134 (387) T protein:vir:26 58 RQVQDIEEKEKAKVK-DKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLP--KTL 134 (387) T ss_pred HHHHHHHHHHHhhhh-hccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeec--hhH Confidence 011111111100000 00000000000000000 000000 0 000111112222334444 456 Q ss_pred HHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHH Q lcl|NC_021342. 64 EATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEM 143 (354) Q Consensus 64 d~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El 143 (354) .++|++.....-..|.++.+..-.+ .++.. .....+.+.|++.. ...|..+...+......+.++.-+.+|.+=| T Consensus 135 ~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~-~~~~~~~a~~v~Eg-~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell 209 (387) T protein:vir:26 135 SKEIVSEPFAKNQLREKARLTNIKG---LEIPR-VSYTLDDDDFITDV-ETAKELKAKGDTVKFTTNKFKVFAAISDTVI 209 (387) T ss_pred HHHHHHHHHhhchhhhhceeeecCC---ceeee-eeccCCcccccccc-ccccccccccceeeechheeeeechhhHHHH Confidence 7788887777666777766543222 12221 22234556676654 3466677777777888888888788875544 Q ss_pred HHHHHhCCCcchHHHHHHHHHHHHHhhheeee-eehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 144 RKSAAMNMPIDAEQARLAFRGAEEHSQSVAYF-GDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINL 222 (354) Q Consensus 144 ~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~-G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~ 222 (354) +.+ ..++..--....++++...+++.+|. |...-.-.|.++.++++..+. +..+++|.+++..|... T Consensus 210 ~ds---~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~---------~~~~d~i~~~~~~l~~~ 277 (387) T protein:vir:26 210 HGS---DVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG---------ADMYDAIINALADLHED 277 (387) T ss_pred hhh---HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc---------cchHHHHHHHHhccChh Confidence 433 44666667777777888888777664 443334467787777654322 22367788888777543 Q ss_pred hCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEE Q lcl|NC_021342. 223 SRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVY 302 (354) Q Consensus 223 s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y 302 (354) +. ..-..+|++..|..+.+.. .+ .+..++ ...+. .+-|.|+.+ +.+. . + +++- T Consensus 278 --y~-~na~~imn~~t~~~~~~~~-~~-~~~~~~----~~~~~-~llG~PV~~-------~~~~-----~----~-~~~G 330 (387) T protein:vir:26 278 --YR-DNATIYMRYADYVKIISVL-SN-GTTNFF----DTPAE-KVFGKPVVF-------TDAA-----V----K-PIVG 330 (387) T ss_pred --hh-cCCEEEEechHHHHHHHHH-hc-CCCccc----ccCCc-cccccceEE-------ecCC-----C----c-eeee Confidence 11 2236788888877665433 22 222111 11111 122333322 2110 0 0 1111 Q ss_pred EcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 303 DKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 303 ~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+.-++ ....+...+. +...-.+.+.+..|++| .+++|.|++++.|. T Consensus 331 Df~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~r~Dg-~v~~~~A~~~l~~k 379 (387) T protein:vir:26 331 DFNYFGI---NYDGTTYDTDKDVKKGEYLFVLTAWYDQ-QRTLDSAFRIAKAK 379 (387) T ss_pred chhhhhh---hhhhhhheecccccCCceEEEEEEEeCc-EeechhheEEEEee Confidence 1110000 0011111111 11222456667788865 55679999999997 No 122 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=96.45 E-value=0.00043 Score=39.03 Aligned_cols=303 Identities=8% Similarity=-0.042 Sum_probs=139.4 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhh------hhhhhh----------c-cCCceeccchhhHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVIN------NTALDA----------I-GNPNIMLDADGGIAFYISQLAGI 63 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~am~a----------~-~~~~~~~da~~~~~fl~~~L~~I 63 (354) -.++.++...-.... -..+........+.... ...+.. . ...++...+++++.+++. +.+ T Consensus 58 ~~~~~~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP--~~~ 134 (387) T protein:vir:94 58 RQVQDIEEKEKAKVK-DKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLP--KTL 134 (387) T ss_pred HHHHHHHHHHHhhhh-hccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeec--hhH Confidence 011111111100000 00000000000000000 000000 0 000111112222334444 456 Q ss_pred HHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHH Q lcl|NC_021342. 64 EATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEM 143 (354) Q Consensus 64 d~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El 143 (354) .++|++.....-..|.++.+..-.+ .++.. .....+.+.|++.. ...|..+...+......+.++.-+.+|.+=| T Consensus 135 ~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~-~~~~~~~a~~v~Eg-~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell 209 (387) T protein:vir:94 135 SKEIVSEPFAKNQLREKARLTNIKG---LEIPR-VSYTLDDDDFITDV-ETAKELKAKGDTVKFTTNKFKVFAAISDTVI 209 (387) T ss_pred HHHHHHHHHhhchhhhhceeeecCC---ceeee-eeccCCcccccccc-ccccccccccceeeechheeeeechhhHHHH Confidence 7788887777666777766543222 12221 22234556676654 3466677777777888888888788875544 Q ss_pred HHHHHhCCCcchHHHHHHHHHHHHHhhheeee-eehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 144 RKSAAMNMPIDAEQARLAFRGAEEHSQSVAYF-GDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINL 222 (354) Q Consensus 144 ~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~-G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~ 222 (354) +.+ ..++..--....++++...+++.+|. |...-.-.|.++.++++..+. +..+++|.+++..|... T Consensus 210 ~ds---~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~---------~~~~d~i~~~~~~l~~~ 277 (387) T protein:vir:94 210 HGS---DVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG---------ADMYDAIINALADLHED 277 (387) T ss_pred hhh---HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc---------cchHHHHHHHHhccChh Confidence 433 44666667777777888888777664 443334467787777654322 22367788888777543 Q ss_pred hCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEE Q lcl|NC_021342. 223 SRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVY 302 (354) Q Consensus 223 s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y 302 (354) +. ..-..+|++..|..+.+.. .+ .+..++ ...+. .+-|.|+.+ +.+. . + +++- T Consensus 278 --y~-~na~~imn~~t~~~~~~~~-~~-~~~~~~----~~~~~-~llG~PV~~-------~~~~-----~----~-~~~G 330 (387) T protein:vir:94 278 --YR-DNATIYMRYADYVKIISVL-SN-GTTNFF----DTPAE-KVFGKPVVF-------TDAA-----V----K-PIVG 330 (387) T ss_pred --hh-cCCEEEEechHHHHHHHHH-hc-CCCccc----ccCCc-cccccceEE-------ecCC-----C----c-eeee Confidence 11 2236788888877665433 22 222111 11111 122333322 2110 0 0 1111 Q ss_pred EcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 303 DKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 303 ~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+.-++ ....+...+. +...-.+.+.+..|++| .+++|.|++++.|. T Consensus 331 Df~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~r~Dg-~v~~~~A~~~l~~k 379 (387) T protein:vir:94 331 DFNYFGI---NYDGTTYDTDKDVKKGEYLFVLTAWYDQ-QRTLDSAFRIAKAK 379 (387) T ss_pred chhhhhh---hhhhhhheecccccCCceEEEEEEEeCc-EeechhheEEEEee Confidence 1110000 0011111111 11222456667788865 55679999999997 No 123 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=96.24 E-value=0.00083 Score=37.47 Aligned_cols=266 Identities=8% Similarity=-0.006 Sum_probs=133.0 Q ss_pred cCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCC-CceeEEEEEeeccccceeEecCCCcccceee Q lcl|NC_021342. 41 GNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIP-EYADTWMYRSYDGVTMGKFIGANGQDLPRVA 119 (354) Q Consensus 41 ~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~-~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~ 119 (354) +....|..+| .+.. +.+-+-|.+.....+....+..+.+.+. ....++.++.+...|.++.++++ +++|... T Consensus 1 Ma~~~T~l~d----~i~P--ev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg-~~i~~~~ 73 (276) T protein:vir:10 1 MAQGTTTKST----QIVP--EVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEG-QKIPVDK 73 (276) T ss_pred CCcceeehhh----hhch--HHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCC-CccCccc Confidence 1122344433 2222 2222333444444444555554444332 23456788888888999888775 4688888 Q ss_pred eccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccc Q lcl|NC_021342. 120 QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 120 ~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~ 199 (354) ...+.....+.+.+.+|.+ .|+..... +.++-..-.+.+...+++..|+-++- .+ .. +... T Consensus 74 lt~~~~~a~i~~~~k~~~~--tD~a~~~~-~~dp~~~~~~~~~~~~a~~~d~~~~~---~l------~~-------~~~~ 134 (276) T protein:vir:10 74 IETNRREAKIHKIGKGTDI--TDEALLSG-YGDPQGEAVRQHGLAIANKVDNDVLE---AL------RG-------TKLT 134 (276) T ss_pred cccceeeEEeehccccccc--cHHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHH---HH------hc-------cccc Confidence 8888888888887666665 55555443 45566777777888888888876541 11 10 0111 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRF 279 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~ 279 (354) ++..+. -++.|.+++..+-.. ...++.++|+|..|..|.+.........+-. .++ ...+|.--++...+ T Consensus 135 ~~~~~~--t~d~i~~A~~~lgd~---~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~----g~~--~~~~G~ig~~~G~~ 203 (276) T protein:vir:10 135 VSADIG--TLAGLEAAIDTFDDE---DLEPMVLFINPKDAGKLRSSASDNFTRATEL----GDN--IIVKGAFGEALGAV 203 (276) T ss_pred cccccc--CHHHHHHHHHHhccc---cCcccEEEEcHHHHHHHHHhccccccccccc----ccc--ceeccccceeccee Confidence 111111 146677777776543 2367899999999999964211111100000 000 00112111222222 Q ss_pred eeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 280 QLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 280 ~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+...... ....+++ .+.-+.+....+.+.-.- ......-.+-... ..|+-+.+|..++.+-.+ T Consensus 204 Vi~s~~~p--------~~t~~l~--~~gAi~~~~~~~~~vE~dRd~~~~~d~i~~~~-~y~~~~~~~~~vv~~t~~ 268 (276) T protein:vir:10 204 IVRSKKLD--------EGEAILA--KRGAVKLITKRDFFLETDRDPSTKTTALYSDK-HYVAYLYDESKAVKVTKG 268 (276) T ss_pred EEEcCCCC--------cceEEEE--eccceeeeecCCceeecccchhhcccEEEEee-EEEEEEEcCcceEEEecC Confidence 22222110 0112222 223333333333322111 0111222333333 347999999999999888 No 124 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=96.06 E-value=0.0011 Score=36.90 Aligned_cols=300 Identities=9% Similarity=0.036 Sum_probs=125.0 Q ss_pred hhhccCCceeccc----hhhH-HHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEec-C Q lcl|NC_021342. 37 LDAIGNPNIMLDA----DGGI-AFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIG-A 110 (354) Q Consensus 37 m~a~~~~~~~~da----~~~~-~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~-~ 110 (354) |--+.+...+..+ .+.. .|+..-.-+|+... .+.-..+.++.+.+-. +..++.+.. .|..++.. . T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af----~~~s~~~~~~~~r~i~--~G~s~~~~~---iG~~~~~~~~ 71 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASF----MYSSKFASWMNVRSLR--GTNQLRVDR---VGASTIAGRK 71 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHH----HHhhhhhccceeeecc--ccceEEEee---ecceeeeeec Confidence 2222112222211 1112 34422223444443 2223333444433211 133444443 34444321 1 Q ss_pred CCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeeh-------hhCce Q lcl|NC_021342. 111 NGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDA-------SRGMY 183 (354) Q Consensus 111 ~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~-------~~gi~ 183 (354) .+..+..-...-++..+.|-.. .-++.-+.+++.++ ...++-..-.+.+..++++..|+.++--.. ....+ T Consensus 72 ~g~~l~~~~~~~~~~~l~ID~~-l~~~~~VddiD~~q-~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~ 149 (334) T protein:vir:80 72 AGEELVVQKNVSDKLNLTVDTV-LYARHFFDKFDEWT-SNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLK 149 (334) T ss_pred CCCCCCCCCcccCceEEEEeee-eehhhhHhhHHHHh-cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 1122221111122333333221 12233457777775 456777888888999999999997763311 11111 Q ss_pred eeeecCCcccc--cccccccccCHHHHHHHHHHHHHHHHHHhCCcc--cccEEEeCHHHHHHHhhcccCCCCCchHHHHH Q lcl|NC_021342. 184 GLFNNPNVTLS--SATKDYKTMNGQELFNMLNAPIFSVINLSRRFH--VPNTALMFPDLWNQANNQLMTGYTDRTVMQHF 259 (354) Q Consensus 184 GLlN~p~~~~~--~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~--~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l 259 (354) .-+.+.+.... +..+.=...+++.+.+-+..+...|.++.---+ ....++|+|..|..|..-. .-.+ .+|. T Consensus 150 ~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~--r~~n---~d~~ 224 (334) T protein:vir:80 150 PAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHD--RLMN---VEFG 224 (334) T ss_pred ccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhccc--cccc---ceec Confidence 11111111100 111111234588889989889888887522111 2368999999999987521 0000 0111 Q ss_pred HhhCcccccccccceeeeeeeeeecccccc---ccccCc---------ccEEEEEEcCcceEEEeeCchhhhcc-ccccC Q lcl|NC_021342. 260 MEANSYTLLTGNELDIQIRFQLDAAELAAN---GVSNSN---------KPRYMVYDKSDRNLAMANPIPFRMLA-PQMAS 326 (354) Q Consensus 260 ~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~---g~g~~g---------~d~~v~y~~~~~~~~~~vp~~~~~~~-~~~~~ 326 (354) -..+......|.-..+-.++.+++..+-.. ....++ +.++.++- .++-+...-.++++.-. -+.+. T Consensus 225 ~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~-~~~Al~t~~~~~~~~e~~~~~~~ 303 (334) T protein:vir:80 225 AKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFI-PSMALISAQVHPVSAQFWEEKKD 303 (334) T ss_pred cccccccccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEE-eCceEEEEEEeecceeeeechhh Confidence 110000011222222222333332222110 000010 11222221 12222222223322111 01223 Q ss_pred ceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 327 LGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 327 l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..+.+.+.... |+-++||++++.+++. T Consensus 304 ~~d~i~~~~a~-G~g~lRPeaa~vv~~~ 330 (334) T protein:vir:80 304 FGHYLDTFQSY-NIGQRRPDAVAVHDIT 330 (334) T ss_pred HHHHHHHHHHc-CCceeccceEEEEEEe Confidence 34444444443 7999999999999999 No 125 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=96.05 E-value=0.0011 Score=36.86 Aligned_cols=265 Identities=7% Similarity=-0.007 Sum_probs=130.2 Q ss_pred cCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCC-CceeEEEEEeeccccceeEecCCCcccceee Q lcl|NC_021342. 41 GNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIP-EYADTWMYRSYDGVTMGKFIGANGQDLPRVA 119 (354) Q Consensus 41 ~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~-~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~ 119 (354) +..+.|.=+| +... +-+.+.+.+.....+....+..+.+... ....++..+.+...|.++.+.++ ++++... T Consensus 1 m~~~~T~l~d----~i~P--ev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g-~~i~~~~ 73 (274) T protein:vir:96 1 MAQGMTKLTN----QIVP--EVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEG-EKIPTDI 73 (274) T ss_pred CCcceeehhh----eech--HHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCC-Cccchhh Confidence 2233333333 2222 1222233344445555555544443221 12367788888888999887664 5788888 Q ss_pred eccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccc Q lcl|NC_021342. 120 QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 120 ~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~ 199 (354) ...+.....+.+.+.+|.+ .|++..+. +.++-......+...+++..|+.++ ..++ + . ... T Consensus 74 lt~~~~~~~i~~~~~a~~i--~D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~---~~l~--~----a-------~~~ 134 (274) T protein:vir:96 74 LETKKREAKIRKIAKGTSI--SDEALLSG-YGDPQGEQVRQHGLAHANKVDDDVL---EALK--S----A-------KLT 134 (274) T ss_pred cccceeEEEeeeeecceee--hHHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH---HHHh--c----c-------ccc Confidence 8888888888776655555 67766654 4455566777788888888877654 1111 0 0 001 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchH-HHHHHhhCcccccccccceeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTV-MQHFMEANSYTLLTGNELDIQIR 278 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tv-l~~l~~n~~~~~~~g~~l~I~~~ 278 (354) ++.++. -++.|.+++.+|-.. ...+..|+|+|..|..|.+...-....-|- .+=+..|+.+-...| . T Consensus 135 ~~~~~~--~~d~i~~A~~~lgd~---~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-------~ 202 (274) T protein:vir:96 135 VEADIT--KLTGLQTAIDKFNDE---DLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALG-------A 202 (274) T ss_pred cccccc--CHHHHHHHHHHhccc---cccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecC-------e Confidence 111111 156677777776543 246789999999999997632100000000 000111222211122 2 Q ss_pred eeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 279 FQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 279 ~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+....+. . ...+++. +.-+......+++.-.- .+....-.+-.. ...|+-+.+|..++.+--. T Consensus 203 ~Vi~s~~~~-~-------~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~-~~y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:96 203 VIVRSNKLE-A-------GTAILAK--KGAVKLITKRDFFLETDRDPSTKTTALYSD-KHYVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEeCCCC-C-------ceEEEEe--ccceeeeecCCcccccccccccccCEEEEe-EEEEEEEEcCCcEEEEEcC Confidence 222222111 0 0112221 22222222223221111 011122222223 3458999999999998877 No 126 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=96.05 E-value=0.0011 Score=36.86 Aligned_cols=265 Identities=7% Similarity=-0.007 Sum_probs=130.2 Q ss_pred cCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCC-CceeEEEEEeeccccceeEecCCCcccceee Q lcl|NC_021342. 41 GNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIP-EYADTWMYRSYDGVTMGKFIGANGQDLPRVA 119 (354) Q Consensus 41 ~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~-~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~ 119 (354) +..+.|.=+| +... +-+.+.+.+.....+....+..+.+... ....++..+.+...|.++.+.++ ++++... T Consensus 1 m~~~~T~l~d----~i~P--ev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g-~~i~~~~ 73 (274) T protein:vir:95 1 MAQGMTKLTN----QIVP--EVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEG-EKIPTDI 73 (274) T ss_pred CCcceeehhh----eech--HHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCC-Cccchhh Confidence 2233333333 2222 1222233344445555555544443221 12367788888888999887664 5788888 Q ss_pred eccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccc Q lcl|NC_021342. 120 QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 120 ~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~ 199 (354) ...+.....+.+.+.+|.+ .|++..+. +.++-......+...+++..|+.++ ..++ + . ... T Consensus 74 lt~~~~~~~i~~~~~a~~i--~D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~---~~l~--~----a-------~~~ 134 (274) T protein:vir:95 74 LETKKREAKIRKIAKGTSI--SDEALLSG-YGDPQGEQVRQHGLAHANKVDDDVL---EALK--S----A-------KLT 134 (274) T ss_pred cccceeEEEeeeeecceee--hHHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH---HHHh--c----c-------ccc Confidence 8888888888776655555 67766654 4455566777788888888877654 1111 0 0 001 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchH-HHHHHhhCcccccccccceeeee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTV-MQHFMEANSYTLLTGNELDIQIR 278 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tv-l~~l~~n~~~~~~~g~~l~I~~~ 278 (354) ++.++. -++.|.+++.+|-.. ...+..|+|+|..|..|.+...-....-|- .+=+..|+.+-...| . T Consensus 135 ~~~~~~--~~d~i~~A~~~lgd~---~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G-------~ 202 (274) T protein:vir:95 135 VEADIT--KLTGLQTAIDKFNDE---DLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALG-------A 202 (274) T ss_pred cccccc--CHHHHHHHHHHhccc---cccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecC-------e Confidence 111111 156677777776543 246789999999999997632100000000 000111222211122 2 Q ss_pred eeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 279 FQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 279 ~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+....+. . ...+++. +.-+......+++.-.- .+....-.+-.. ...|+-+.+|..++.+--. T Consensus 203 ~Vi~s~~~~-~-------~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~-~~y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:95 203 VIVRSNKLE-A-------GTAILAK--KGAVKLITKRDFFLETDRDPSTKTTALYSD-KHYVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEeCCCC-C-------ceEEEEe--ccceeeeecCCcccccccccccccCEEEEe-EEEEEEEEcCCcEEEEEcC Confidence 222222111 0 0112221 22222222223221111 011122222223 3458999999999998877 No 127 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=95.92 E-value=0.0013 Score=36.48 Aligned_cols=299 Identities=8% Similarity=0.009 Sum_probs=127.3 Q ss_pred CcccchhHH------HHh----------------------hhhhhhccccc------ccccchhhhh---hhhhhhccCC Q lcl|NC_021342. 1 MAIKTIDAQ------TIQ----------------------GNQWLVHKGYV------SRNGDQWVIN---NTALDAIGNP 43 (354) Q Consensus 1 ~~~~~~~~~------~~~----------------------~~~~~~~~~~~------~~~~~~~~~~---~~am~a~~~~ 43 (354) -.|+.++.+ .++ .+..+..+... .....+.... ....... . T Consensus 51 ~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 128 (394) T protein:vir:97 51 ANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQ--K 128 (394) T ss_pred HHHHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhh--c Confidence 001100000 000 00000000000 0000000000 0000000 0 Q ss_pred ceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcccce-eeec Q lcl|NC_021342. 44 NIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQDLPR-VAQS 121 (354) Q Consensus 44 ~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip~-v~~~ 121 (354) ...+. +.+.+++. +.+.+.|++........+.++.+.. .+.+ +..+.... ..+.+.+++.++. .|- .+.. T Consensus 129 ~~~t~--~~gg~liP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~--~~~~~~~~~~~~~~~~v~E~~~-~~~~~~~~ 200 (394) T protein:vir:97 129 DGIKK--ENAKPVSS--EEILYTPAREVKTVVDLKPFTTVYQ-AKKA--SGKYPVLQRATTKMVTVAELEK-NPALAKPD 200 (394) T ss_pred ccccc--ccccccCh--HHHHHHHHHHhhhhhhhhhhceeee-ccCc--ceEEEEEecCCCccceeccccc-cccccccc Confidence 00111 12334443 4566778887777777777665432 2222 22333333 3345667766543 453 3345 Q ss_pred cceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccccc Q lcl|NC_021342. 122 AQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYK 201 (354) Q Consensus 122 ~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~ 201 (354) .+........++.-+.+|..=|+.+ ..++..--....++++...+|+.+++|..... +.. T Consensus 201 ~~~v~l~~~k~~~~i~is~ell~ds---~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~---------------~~~-- 260 (394) T protein:vir:97 201 FKDVAWNIDTYRGAIPLSQESIDDA---DVDLVGIVSESISQIKVNTTNDAIAKVLKSFT---------------TKT-- 260 (394) T ss_pred ceeEEeehhheeeehhhHHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---------------ccc-- Confidence 6677777777777777765433322 34577777777888888999988887743210 111 Q ss_pred ccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeee Q lcl|NC_021342. 202 TMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQL 281 (354) Q Consensus 202 ~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L 281 (354) ..+ +++|.++++...... ..-.++|+|..|..|..- .+..|.-++. -++ ..|.+-.+...|.+ T Consensus 261 ~~~----~~~~~~~~~~~~~~~----~~a~~v~n~~~~~~l~~l--kd~~G~~i~~----~~~---~~~~~~~l~G~pv~ 323 (394) T protein:vir:97 261 VKN----LDEIKALLNGGFDPA----YNVSLIVSQSFYQTLDTL--KDGNGRYLLQ----DDI---TAVSGKVLLGKPVF 323 (394) T ss_pred ccc----HHHHHHHHHhhhhhh----hCCEEEEcHHHHHHHHHh--hccCCCeeee----cCc---CCCCCceeccceeE Confidence 112 345555554433211 124699999999998653 2433432211 011 01112122222222 Q ss_pred eeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 282 DAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 282 ~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..... .. +.+.+++-+.+ +.+.+..-..++............+.++.|++ +.+.+|.+|+.+++. T Consensus 324 ~~~~~-~~-----~~~~~~~gd~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d-~~v~~~~a~~~~~~~ 388 (394) T protein:vir:97 324 VLSDE-VL-----GANKAFIGDFK-RGVLFADRKDLGLRWADNEIYGQYLQAVLRFG-VSKVDDKAGYYVTFT 388 (394) T ss_pred Eeccc-cc-----CCccEEEeecc-ccEEEEEecceEEEEecccccceeEEEEEEEc-cEEecccceEEEEec Confidence 11110 01 11111111211 11111111112211111111122356778885 577799999999999 No 128 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=95.50 E-value=0.002 Score=35.44 Aligned_cols=304 Identities=7% Similarity=-0.015 Sum_probs=125.5 Q ss_pred CcccchhHHH--Hhh-hhhhhccc-------------ccccccchhhhhhhhhh----hcc--CCceeccchhhHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQT--IQG-NQWLVHKG-------------YVSRNGDQWVINNTALD----AIG--NPNIMLDADGGIAFYIS 58 (354) Q Consensus 1 ~~~~~~~~~~--~~~-~~~~~~~~-------------~~~~~~~~~~~~~~am~----a~~--~~~~~~da~~~~~fl~~ 58 (354) -.|+.+..+. ++. ........ ..............++. ... ....... ..+.+++. T Consensus 68 ~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~vp 145 (397) T protein:vir:96 68 EKIAELQKEKQDLEDELAKAADPTDQKPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTS--VEGGALIP 145 (397) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccc--cccccchh Confidence 1111111000 000 00000000 00000000000000000 000 0000111 11222222 Q ss_pred HHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcccc-eeeeccceeEEEEEEEEeeE Q lcl|NC_021342. 59 QLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQDLP-RVAQSAQMHTVPLGYAGNEC 136 (354) Q Consensus 59 ~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip-~v~~~~~~~~~pv~~~~~~~ 136 (354) +.+.+.+++ .......+..+.+. +-...+..+.... ..+.+.+++..+. .| ..+...+.....+..++.-. T Consensus 146 --~~~~~~i~~-~~~~~~l~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~E~~~-~~~~~~~~~~~i~~~~~~~~~~~ 218 (397) T protein:vir:96 146 --QELLQPQLE-PKDIVDLSKYVRSV---PVNSASGKFPVISKSGSKMATVQQLEK-NPQLANPKMVEIDYSVATRRGYI 218 (397) T ss_pred --HHHHHHHHH-hhhhhhHHHhhhhc---cccccceeEEEEeccCCcccccccccc-ccccccccccceeecHhHhhcch Confidence 344555555 23333334444332 1112223333332 2344455555444 34 34455666666677766666 Q ss_pred eecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHH Q lcl|NC_021342. 137 HYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPI 216 (354) Q Consensus 137 ~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~ 216 (354) .+|.+=|+.+ ..++..--....++++...+|.-++.|.....-.|. .| +++|.+++ T Consensus 219 ~~s~ell~ds---~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~-----------------~~----~d~~~~~~ 274 (397) T protein:vir:96 219 PISQEMIDDA---SYDVTGLIADEIQDQSLNTKNADIAAVLKTATAKSV-----------------VG----VDGLKDLI 274 (397) T ss_pred hhHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc-----------------cc----hHHHHHHH Confidence 6665544433 335666677778888889999988887553221110 12 34455555 Q ss_pred HHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcc Q lcl|NC_021342. 217 FSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNK 296 (354) Q Consensus 217 ~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~ 296 (354) ...... +. .-.++|+|+.|..|..- .+..|.-++. -++ ..+.+-.+...|......... +. ..++ T Consensus 275 ~~~~~~--~~--~a~~v~n~~~~~~l~~l--kd~~G~~~~~----~~~---~~~~~~~l~G~pv~~~~~~~~-~~-~~~~ 339 (397) T protein:vir:96 275 NKEIKK--VY--DVKLFISASMYSELDKL--KDKNGRYLLQ----DSI---TAASGKQLLGKEVVVLDDDVI-GK-SVGN 339 (397) T ss_pred HHhhhh--hc--CcEEEEcHHHHHHHHHh--hccCCCeEec----cCc---cCCCcccccccceEEeccccc-CC-CCCc Confidence 443322 11 24799999999999653 3444432221 000 112222333333322221111 11 1223 Q ss_pred cEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 297 PRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 297 d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..+++-+.+ +.+.+..-+.++............+..+.|++ ..+++|.+++++-+. T Consensus 340 ~~~~~gd~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d-~~~~~~~a~~~~~~~ 395 (397) T protein:vir:96 340 VVGFIGDAK-AFASFFDRKQVSVSWVDNNIYGQLLAGIIRYD-VKATDKKAGFYVTFT 395 (397) T ss_pred eEEEEeehh-cceEeEeecceEEEEecccccceeEEEEEEEc-cEEecccceEEEEee Confidence 333322322 22223333334433322222234456678886 577899999999877 No 129 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=95.40 E-value=0.0021 Score=35.23 Aligned_cols=228 Identities=9% Similarity=0.009 Sum_probs=118.4 Q ss_pred cCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHH Q lcl|NC_021342. 84 AANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFR 163 (354) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~ 163 (354) +.....| .++.++.+ .|.++.++.+ +.+|......+.....+.+.+.+|+++ |++.....|-|+ .+.....+. T Consensus 1 ~~~~~~G-dtit~P~~--iGda~~v~eG-~~i~~~~l~~t~~~atIk~~gk~~~it--D~a~l~~~gDp~-~ea~~Q~~~ 73 (231) T protein:vir:73 1 ENGINLA-NLCEYPND--IGDAADVAEG-GEISLDKIGTTTKSVTIKKAAKGTEIT--DEAALSGYGDPI-GESNKQLGL 73 (231) T ss_pred CccccCC-ceEEeccc--ccchhhhcCC-CcCChhhccccceeeeEeeeccceeee--HHHHhhccCchH-HHHHHHHHH Confidence 3344433 34666655 7888888776 448888888889999999988888884 555555566565 555566666 Q ss_pred HHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHh Q lcl|NC_021342. 164 GAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQAN 243 (354) Q Consensus 164 ~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~ 243 (354) .++.+.|.=++ +.+. ++.|+.+++. -++.|++++..+-.. ...|+.++++|..+..|- T Consensus 74 ~iA~kvD~di~---~~~~---------------~a~l~~~~~~-t~d~i~~A~~~fgde---~~~~~vivv~p~~~~~Lr 131 (231) T protein:vir:73 74 SLANKVDDDLL---KAAK---------------TTSQTVSTKA-NVDGVQAALDIFNDE---DAQAYVLIVNPKDAAKIR 131 (231) T ss_pred HHHHhhhHHHH---Hhhc---------------cccccccccc-cHHHHHHHHHHhccc---cccceEEEEcchHHHhhh Confidence 77666666333 1110 1112222221 267788888887543 357889999999999883 Q ss_pred hcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccc- Q lcl|NC_021342. 244 NQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAP- 322 (354) Q Consensus 244 ~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~- 322 (354) + ..+.... -.... +++ ..+|.-.++..++.+.+.... . ++....-|...+.=+.+..-.+++.-+- T Consensus 132 k--~~~~~~~--~~~~g-~~i--~~~G~iG~i~G~~Vi~S~~~~-~-----~~~~~~~~i~~~gAl~~~~k~~~~vEtdR 198 (231) T protein:vir:73 132 K--DANAKNI--GSEVG-ANA--LINGTYADVLGAQIVRSKKLA-E-----GSALMFKIVSNSPALKLVLKRGVQVETDR 198 (231) T ss_pred h--ccchhhh--hhhhc-cce--eeecccceEcceEEEEcCCCC-C-----CceeeeeEEeeccceeeeecccceeeccc Confidence 2 1111110 00000 111 123333333333333332221 1 1111111211122233322222221111 Q ss_pred cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 323 QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 323 ~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ........+-.. ...+|-+++|..++.+-++ T Consensus 199 d~~~k~~~i~~~-~~y~v~l~~~~~vv~~t~~ 229 (231) T protein:vir:73 199 DIVTKTTVITAD-EHYAAYLYDLTKVVNITFT 229 (231) T ss_pred cccccccEEEEe-EEEEEEEEcCccEEEEEee Confidence 011122233222 3357999999999999999 No 130 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=95.22 E-value=0.0025 Score=34.85 Aligned_cols=294 Identities=11% Similarity=0.033 Sum_probs=125.6 Q ss_pred ccccchhhhhhhhhhhccCCceecc-------chhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEE Q lcl|NC_021342. 24 SRNGDQWVINNTALDAIGNPNIMLD-------ADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMY 96 (354) Q Consensus 24 ~~~~~~~~~~~~am~a~~~~~~~~d-------a~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~ 96 (354) +.+ .++++.+.+. +|.--.|+..-.-+|+.+.- +.-..+.++.+.+ +- +..++.+ T Consensus 1 ma~------------~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~----~~s~~~~~~~~rt-i~-~G~sv~~ 62 (347) T protein:vir:94 1 MAN------------MNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFT----RTSVTMNKHLVRS-IQ-SGKSAQF 62 (347) T ss_pred CCc------------cccccccccccccCCcccchHHHHHHHHhHHHHHHHH----HHHhhhhhhhhee-cc-ccceEEe Confidence 100 1111212111 11111355333334444443 2233444444432 11 2344444 Q ss_pred EeeccccceeEec-CCCccc--ceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhhee Q lcl|NC_021342. 97 RSYDGVTMGKFIG-ANGQDL--PRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVA 173 (354) Q Consensus 97 ~~~~~~G~a~~~~-~~~~di--p~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~ 173 (354) . ..|..+... ..++++ |..+....+..+.|-.. .-+..-+.+++.++ +..++-.+-...+..++++..|+.+ T Consensus 63 ~---~iG~~~~~~~~~G~~l~~~~~~~~~~e~~ltID~~-~y~~~~VddiD~~q-~~~D~rs~~~~~~g~ALA~~~D~~i 137 (347) T protein:vir:94 63 P---VLGRTKAAYLQPGENLDDKRKDMKHTEKTINIDGL-LTADVLIYDIEDAM-NHYDVRSEYTAQLGESLAMAADGAV 137 (347) T ss_pred e---eccceeEeeeecCcCCCCCcCCccccceEEEEcch-hhhhhhhhhHHHHh-cCcchHHHHHHHHHHHHHHHHHHHH Confidence 4 345544321 112222 22233333333333221 12334457888875 5667778888889999999999877 Q ss_pred ee----e-----ehhhCceeeeecCCccccccccccc--ccCHHHHHHHHHHHHHHHHHHhCCcc-cccEEEeCHHHHHH Q lcl|NC_021342. 174 YF----G-----DASRGMYGLFNNPNVTLSSATKDYK--TMNGQELFNMLNAPIFSVINLSRRFH-VPNTALMFPDLWNQ 241 (354) Q Consensus 174 f~----G-----~~~~gi~GLlN~p~~~~~~~~~~W~--~~T~~ei~~di~~~~~~l~~~s~g~~-~p~~L~l~p~~~~~ 241 (354) +- + -......|..-.-.+......+.+. .++++.+++.|.++..+|.+. .+. .+..++++|..|.. T Consensus 138 ~~~l~~~a~~~~~~~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~--dVP~~~R~~vv~P~~y~~ 215 (347) T protein:vir:94 138 LAEMAKLCNLPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGN--YVPSSDRVFYTTPDNYSA 215 (347) T ss_pred HHHHHHhhccccccccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhc--CCCCCCCEEEeChHHHHH Confidence 52 1 1111122221111111111111111 346888999999999888765 332 35789999999998 Q ss_pred HhhcccCCCCCc-hHHHHHHhhCcccccccccceeeeeeeeeeccccc-----cccc----------------------c Q lcl|NC_021342. 242 ANNQLMTGYTDR-TVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAA-----NGVS----------------------N 293 (354) Q Consensus 242 L~~~~~~~~~~~-Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~-----~g~g----------------------~ 293 (354) |.+.......+. ++.. +. +|....+--.+.+++..+-. ...+ + T Consensus 216 LLk~~~~~~~~~~~~~~-~~--------~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d 286 (347) T protein:vir:94 216 ILAALMPNAANYQALID-PS--------TGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVA 286 (347) T ss_pred HHHhhcccccccccccc-cc--------cceeEEeeceEEEEcCccccccCccccccccccccccccccccccccccccc Confidence 875332222221 2111 11 12222222222222222211 0001 0 Q ss_pred CcccEEEEEEcCcceEEEeeCchhhhc-cccccCceeEEeeeeeeeeEEEECCceeEe--eecC Q lcl|NC_021342. 294 SNKPRYMVYDKSDRNLAMANPIPFRML-APQMASLGITVPAEYKISGTEFRYPLCAAY--VDMA 354 (354) Q Consensus 294 ~g~d~~v~y~~~~~~~~~~vp~~~~~~-~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y--~D~~ 354 (354) -++...+++.++ -+...-.++++.- .-+.+...+.+.+.... |+-++||++.+- ..=| T Consensus 287 ~~~~~~l~~~~~--A~~tv~~~~~~~e~~~~~~~~~~~i~~~~a~-G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 287 LDNVVGLFNHRS--AVGTVKLKDMALERARRANFQADQIIAKYAM-GHGGLRPEACGALVFKKA 347 (347) T ss_pred ccceEEEEechh--hhhhhhhcccceeeeechhhhhhhhhhhhhh-cCcccccceeEEEEecCC Confidence 011112222211 1111111121110 01222334555555555 689999998874 4444 No 131 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=95.06 E-value=0.0029 Score=34.53 Aligned_cols=282 Identities=11% Similarity=0.025 Sum_probs=135.1 Q ss_pred ccCCcee---ccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccc Q lcl|NC_021342. 40 IGNPNIM---LDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLP 116 (354) Q Consensus 40 ~~~~~~~---~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip 116 (354) |..|..+ .++. .+=+-+...|+..--.+.-.-.++. +.......+.+...+....++.....+.|.| T Consensus 1 ma~~~~~~~t~~~~-------g~~~dl~~~I~~isp~dTPf~S~i~---~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~ 70 (317) T protein:vir:88 1 MATPTNAVSTVEIN-------GKREDLIDIIYNIAPYDTPFMSAIG---KGVATAITHEWQTDELRQPGKNTRVEGEDAT 70 (317) T ss_pred CCccccceEeeeee-------eeeechhhhheecCCccCcceeeec---CceecccEEEEEeeecCCccccccccCcccc Confidence 2223322 2221 1112222333332222222222332 2223333344443333332222222223333 Q ss_pred eeeeccc---eeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehh---------hCcee Q lcl|NC_021342. 117 RVAQSAQ---MHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDAS---------RGMYG 184 (354) Q Consensus 117 ~v~~~~~---~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~---------~gi~G 184 (354) ....... .-..+|++-...++.+.+-...+. .-+........+...+.+.+++..++|.+. ..+-| T Consensus 71 ~~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G--~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~G 148 (317) T protein:vir:88 71 IKAGSFTTMLNNYCQISDETLQVTGTADRVKKAG--RKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMAN 148 (317) T ss_pred cccccCCEEeccEEEEEEeEEEEeehhhhhhhcC--ccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhh Confidence 2222211 123467777777777666654442 245566666777888888899999998642 23456 Q ss_pred eeec---------CCc-ccccccccccccCHHHH-HHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCc Q lcl|NC_021342. 185 LFNN---------PNV-TLSSATKDYKTMNGQEL-FNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDR 253 (354) Q Consensus 185 LlN~---------p~~-~~~~~~~~W~~~T~~ei-~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~ 253 (354) |++. +|. ++...+..|...|+..+ -++|++++.++|.. | -.|..+.++|..-..|+.-. .++... T Consensus 149 l~~~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~--G-g~~~~i~v~a~~k~~i~~~~-~~~~~~ 224 (317) T protein:vir:88 149 IFAYYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRN--G-GQANSIQTSSSIKKAISKNM-KGRATE 224 (317) T ss_pred HHHHhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhc--C-CCCCEEEeChHHHHHHHHHh-cCCcee Confidence 6543 111 11222334544433322 25688999999985 3 26788999998877776421 111110 Q ss_pred hHHHHHHhhCcccc-----cccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccccCce Q lcl|NC_021342. 254 TVMQHFMEANSYTL-----LTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLG 328 (354) Q Consensus 254 Tvl~~l~~n~~~~~-----~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~ 328 (354) +...-.++-.... ..+-.+.|...+++ ..+.++++ |++.+++.+-.|+..-+.-.-+-. T Consensus 225 -i~~~~~~~~~g~~v~~~~tdfG~v~ii~~r~l-------------p~~~~~~~--D~~~~~l~~Lr~~~~e~laKtGd~ 288 (317) T protein:vir:88 225 -ITLDASDNRIAQTVDVYESDFGKYTIRANRWF-------------HENTLFVF--DPKMHSLCYLRPFFQHELAKTGDS 288 (317) T ss_pred -EEEcccCeEEEEEEEEEEeCCeEEEEEeCCCC-------------CCCeEEEE--cccccceeecccceeeccCCCccc Confidence 0000000000000 00111223333322 12445554 588898888777766555555544 Q ss_pred eEEeeeeeeeeEEEECCceeEe-eecC Q lcl|NC_021342. 329 ITVPAEYKISGTEFRYPLCAAY-VDMA 354 (354) Q Consensus 329 ~~~~~~~~~gGv~i~~P~ai~y-~D~~ 354 (354) -+.-.+.. .|++++-|.+.+. .|++ T Consensus 289 ~k~~i~~E-~tLe~~N~~a~a~i~~l~ 314 (317) T protein:vir:88 289 EKRQLLVE-YTFRVNNEKSGALIRDVV 314 (317) T ss_pred ceeEEEEE-EEEEEcCccceeEEEEec Confidence 44444444 4799999999988 4555 No 132 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=94.97 E-value=0.0031 Score=34.37 Aligned_cols=262 Identities=8% Similarity=0.019 Sum_probs=128.3 Q ss_pred cCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCC-CceeEEEEEeeccccceeEecCCCcccceee Q lcl|NC_021342. 41 GNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIP-EYADTWMYRSYDGVTMGKFIGANGQDLPRVA 119 (354) Q Consensus 41 ~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~-~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~ 119 (354) +..+.|.-+|-- .. +.+.+.|.+.....+....+..+...+. -...++..+.+...|.++.+.++ ++++... T Consensus 1 ma~~~T~l~d~i----iP--ev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g-~~i~~~~ 73 (274) T protein:vir:12 1 MAQGLTKTSNQI----IP--EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG-EKIPTDI 73 (274) T ss_pred CCcceeehhhhh----ch--HHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC-Cccchhh Confidence 224445554422 22 2222233333444455555555444322 13556778888888999888664 5788888 Q ss_pred eccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccc Q lcl|NC_021342. 120 QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKD 199 (354) Q Consensus 120 ~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~ 199 (354) ...+.....+.+.+.+|. +.|++..+..+ ++-......+...+++..|+-++.--.. ...+. .. T Consensus 74 lt~~~~~~~i~~~~~~~~--i~D~~~~~~~~-d~~~~~~~q~~~~~a~~vd~~~l~~~~~---------a~~~~---~~- 137 (274) T protein:vir:12 74 LETKKREAKIRKIAKGTS--ITDEALLSGYG-DPQGEQVRQHGLAHANKVDNDVLEALMG---------AKLTV---NA- 137 (274) T ss_pred cccceeeEEeeeecceee--ecHHHHHhccc-chHHHHHHHHHHHHHHHHHHHHHHHHhc---------ccccc---cc- Confidence 888888888887665544 46666666544 4445666777778888877755421110 00011 00 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcc---cCCCCCchHHHHHHhhCcccccccccceee Q lcl|NC_021342. 200 YKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQL---MTGYTDRTVMQHFMEANSYTLLTGNELDIQ 276 (354) Q Consensus 200 W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~---~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~ 276 (354) +++. ++.|.+++.+|-.. ...+..|+|+|..+..|.+.. +...+... .. +..++.+-...|. T Consensus 138 --~a~~---~d~i~dA~~~lgd~---~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g-~~-~~~~G~ig~~~G~----- 202 (274) T protein:vir:12 138 --DITK---LNGLQSAIDKFNDE---DLEPMVLFINPLDAGKLRGDASTNFTRATELG-DD-IIVKGAFGEALGA----- 202 (274) T ss_pred --cccC---HHHHHHHHHHhccc---cccccEEEeCHHHHHHHHhhhhhhcccccccc-cc-ceecccceeecCe----- Confidence 1111 56677777776443 236789999999999997632 11111100 01 1112221111221 Q ss_pred eeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhhhc--cccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 277 IRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRML--APQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 277 ~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~--~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +.+...... . + ..+++ .+.-+.+....+.+.- -.+.+ ..-.+-.. ...|+-+.+|..++.+--+ T Consensus 203 --~Vi~s~~~p-~-----~--t~~l~--~~gA~~~~~~~~~~vE~~Rd~~~-~~d~i~~~-~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:12 203 --IIVRSNKLE-A-----G--TAILA--KKGAVKLILKRDFFLEVARDAST-KTTALYSD-KHYVAYLYDESKAVKITKG 268 (274) T ss_pred --eEEEeCCCC-c-----c--eEEEE--eccceeeeecCCceeccccchhh-cccEEEee-eEEEEEEEcCCceEEEEcC Confidence 122221110 0 0 01111 1222222222222211 11111 12222222 2347888899998888777 No 133 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=94.71 E-value=0.0037 Score=33.92 Aligned_cols=254 Identities=10% Similarity=0.021 Sum_probs=113.9 Q ss_pred hccccCCCCCceeEEEEEeeccccceeEecC-CCcccce--eeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchH Q lcl|NC_021342. 80 DVPMAANIPEYADTWMYRSYDGVTMGKFIGA-NGQDLPR--VAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAE 156 (354) Q Consensus 80 ~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~-~~~dip~--v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~ 156 (354) ++ ..+-. ..++.+. ..|+.++..- .+++|.. -+..-.+..+.|=.. .-+..-+.+++.++ +..++-.+ T Consensus 1 ~v---r~i~~-g~s~~~~---~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~-l~~~~~VdDiD~~q-a~~Dlr~e 71 (324) T protein:vir:99 1 MT---RTITS-GKSAQFP---VMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGL-LTTDVLIYDIEDAM-NHYDVRSE 71 (324) T ss_pred Ce---eeeec-CceEEEe---eeeeeEeccccCCCCcCCCcCCcCcccEEEEecch-hhhhhhhhhHHHHh-cCccchhH Confidence 22 22221 2233322 3355543211 1222211 112222333322111 12334456788776 55678888 Q ss_pred HHHHHHHHHHHHhhheeee---e----ehhhCceeeeecCCccc--ccccccccccCHHHHHHHHHHHHHHHHHHhCCcc Q lcl|NC_021342. 157 QARLAFRGAEEHSQSVAYF---G----DASRGMYGLFNNPNVTL--SSATKDYKTMNGQELFNMLNAPIFSVINLSRRFH 227 (354) Q Consensus 157 k~~aA~~~~a~~~n~~~f~---G----~~~~gi~GLlN~p~~~~--~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~ 227 (354) -.+.+..++++..|+.+|- + .+...-.+.....+... ...+..=...+++.+++.|.++..+|.++.--.. T Consensus 72 ~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~ 151 (324) T protein:vir:99 72 YSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAG 151 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCC Confidence 8899999999999987762 1 11111111111111111 1111111235688999999999999987532222 Q ss_pred cccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccc----------------- Q lcl|NC_021342. 228 VPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANG----------------- 290 (354) Q Consensus 228 ~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g----------------- 290 (354) ...++|+|..|..|...+.... ..|.-.+ . ..+|....+--.+.+++..+-... T Consensus 152 -gR~~vv~P~~y~~Ll~~~~~~~-----~~~~~~~-~--~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~ 222 (324) T protein:vir:99 152 -DRTFYTDPDTYSAILAALMPNA-----ANYAALI-D--PETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPA 222 (324) T ss_pred -CCEEEeChHHHHHHhhcccccc-----ccccccc-c--eecceEEEEeceEEEecCCcccccccccccccccccccccc Confidence 2579999999998865432211 1221111 1 122333333333333333321100 Q ss_pred ----------cccCcccEEEEEEcCcc-eEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 291 ----------VSNSNKPRYMVYDKSDR-NLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 291 ----------~g~~g~d~~v~y~~~~~-~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+..++-+.+++..+-- +++. ++...+....+.+ ..+.+...... |..+.||++++.+... T Consensus 223 ~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~-~~~~~e~~~~~~~-~~d~i~~~~a~-G~~~lRPe~a~~v~l~ 294 (324) T protein:vir:99 223 TGDSTTTGKMTVGADNVVGLFVHRSAVATLKL-KDMALERARRPEY-QADQIIAKYAM-GHGGLRPEAVGAIIFE 294 (324) T ss_pred ccccccccccccccCceeEEEEehhheEEEee-ecceecceechhh-HHHhhhhhhhh-cCcccccceEEEEEEc Confidence 01111222233322211 1111 1111222222222 33444444444 7888899999888765 No 134 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=94.69 E-value=0.0037 Score=33.90 Aligned_cols=307 Identities=8% Similarity=-0.008 Sum_probs=134.0 Q ss_pred Cccc-------chhHHHHhh-hhh--hhcccccccccch-hhhh-hhhhh-hcc-------CCceeccchhhHHHHHHHH Q lcl|NC_021342. 1 MAIK-------TIDAQTIQG-NQW--LVHKGYVSRNGDQ-WVIN-NTALD-AIG-------NPNIMLDADGGIAFYISQL 60 (354) Q Consensus 1 ~~~~-------~~~~~~~~~-~~~--~~~~~~~~~~~~~-~~~~-~~am~-a~~-------~~~~~~da~~~~~fl~~~L 60 (354) -.|+ .+..+.-+. +.- .-.+......... .... ..++. +.. .-+..+.++ +.+++. T Consensus 52 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~--gg~liP-- 127 (421) T protein:vir:13 52 ARMEIIEEEIESVMTAIDEERKNTNFTGGRVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTN--NGAVIP-- 127 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcccccccccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCC--cceecc-- Confidence 1111 111110000 000 0000000000000 0000 00100 000 001112222 233444 Q ss_pred HHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecH Q lcl|NC_021342. 61 AGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTL 140 (354) Q Consensus 61 ~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~ 140 (354) +.+.+.|++........+.++.+. +.+.+.-.+.+........+.+.+.+ ..+|..+...+.....++.++.-+.+|. T Consensus 128 ~~~~~~Ii~~~~~~~~l~~l~~~~-~~~~~~~~~~~~~~~~~~~~~~~~E~-~~~~~s~~~f~~i~~~~~k~~~~v~iS~ 205 (421) T protein:vir:13 128 QEFVNEFEKLKEGYPSLKEHCHVI-PVNRNAGKMPVRAGASVDKLANLAKD-TELVKAMLKTQPMAYDIDDYGLLAPIDN 205 (421) T ss_pred hhhHHHHHHHHHhhhhhhhhceee-eccCCceEEEEeecCCccceeecccc-ccccccccceeEEEeeeeeeEeehhhhH Confidence 455667777777766777776543 23333333333333333333444443 4467767777777788888888777765 Q ss_pred HHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHH Q lcl|NC_021342. 141 DEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVI 220 (354) Q Consensus 141 ~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~ 220 (354) .=|+.+ ..++..--....++++..++|.-+.. ...|+++.++. . + +++|.++++++. T Consensus 206 ell~ds---~~~l~~~i~~~la~~~~~~~~~~i~~-----~~~g~~~~~~~------~-----~----~d~i~~~~~~l~ 262 (421) T protein:vir:13 206 SLLEDS---EINFLEFVNEEFAEFAVNTENAEIVK-----QAKAVLAEETI------N-----D----YAGLVKTINSLV 262 (421) T ss_pred HHHhhh---HHHHHHHHHHHHHHHHHHHhhhhHhh-----hhhhccccccc------c-----c----hHHHHHHHHHhh Confidence 544333 33555556666677777777755432 23455433321 1 1 467777887775 Q ss_pred HHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEE Q lcl|NC_021342. 221 NLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYM 300 (354) Q Consensus 221 ~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v 300 (354) .. ...+..++|+|..|..|.+. -+..|.=++.-+ ..+.+..+...|......... ++++.. .+ T Consensus 263 ~~---~~~~a~~v~n~~~~~~l~~l--kd~~G~~i~~~~--------~~~~~~tl~G~pV~~~~~~~~---~~~~~~-~~ 325 (421) T protein:vir:13 263 PN---ARKRAIIVTNSDGRAYLDGL--MDKQGRPLLKEL--------SDGGDLVFKGRPVIELEESIF---DVGDET-KF 325 (421) T ss_pred hh---hcCCCEEEEcHHHHHHHHHh--hcCCCceeecCc--------CCCCCceecceeeEEeccccc---cCCCce-EE Confidence 42 23456899999999999753 244443222111 112223343333333332211 112222 23 Q ss_pred EEEcCcceEEEeeCchhhhcccccc---CceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 301 VYDKSDRNLAMANPIPFRMLAPQMA---SLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 301 ~y~~~~~~~~~~vp~~~~~~~~~~~---~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +|-+-.+.+.+.....++....... .-.+.+.++.|++|. +..|.++..+-+. T Consensus 326 ~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~-~~~~~a~~~~~~~ 381 (421) T protein:vir:13 326 IVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDVN-SPLDKSSDAEKIR 381 (421) T ss_pred EEEeccccEEEEEecceEEEeecccccccCeeEEEEEeeecce-eecchhhheeeec Confidence 3332223344433344443322111 112456667777544 4445554444333 No 135 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=94.52 E-value=0.0042 Score=33.63 Aligned_cols=303 Identities=10% Similarity=-0.001 Sum_probs=126.9 Q ss_pred ccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccc Q lcl|NC_021342. 24 SRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVT 103 (354) Q Consensus 24 ~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G 103 (354) |.+... ..+++.+ .+.+.. .+|.--.|+..-.-+++...-+ .-..+.++.+.+ +- +..++.+. ..| T Consensus 1 ~a~~~~--~~~~~~~-~g~~~~--~~d~~al~ie~~~geV~~~f~~----~s~~~~~~~~r~-i~-~G~sv~~~---~iG 66 (347) T protein:vir:88 1 MANATG--GQQIGAN-QGKGQS--AADKLALFLKVFGGEVLTAFVR----RSVTMDKHMVRT-IQ-NGKSASFP---VMG 66 (347) T ss_pred CCCccc--chhhhcc-CCCCcc--ccchHHHHHHHHHHHHHHHHHH----Hhhhhhcccccc-cc-CcceEEEe---eec Confidence 111110 0001111 011111 1221123443333445443332 234455555433 21 23445444 334 Q ss_pred ceeEec-CCCccc--ceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeee--- Q lcl|NC_021342. 104 MGKFIG-ANGQDL--PRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGD--- 177 (354) Q Consensus 104 ~a~~~~-~~~~di--p~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~--- 177 (354) ..+... ..++++ |..+..-.+..+.|-.+- -+..-+.+++.++ ...++-.+-.+.+..++++..|+.++--- T Consensus 67 ~~~~~~~~~g~~l~~~~~~~~~~~~~i~ID~~~-y~~~~Vdd~D~~q-~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~ 144 (347) T protein:vir:88 67 RTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLL-TSDVLIYDIEDAM-NHYDVRAEYSAQLGEALAIAADGAVLAEMAKL 144 (347) T ss_pred ceeeeeeccccCCCCCCCCCccceEEEEEechh-hhhhhhhhHHHHh-hcCCchHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 433211 112222 222333344444443321 2233456777775 45667777788899999999999876221 Q ss_pred ------hhhCceeeeecCCcccccccccc--cccCHHHHHHHHHHHHHHHHHHhCCcc-cccEEEeCHHHHHHHhhcccC Q lcl|NC_021342. 178 ------ASRGMYGLFNNPNVTLSSATKDY--KTMNGQELFNMLNAPIFSVINLSRRFH-VPNTALMFPDLWNQANNQLMT 248 (354) Q Consensus 178 ------~~~gi~GLlN~p~~~~~~~~~~W--~~~T~~ei~~di~~~~~~l~~~s~g~~-~p~~L~l~p~~~~~L~~~~~~ 248 (354) ....+.|+-....++..+ +.+- ..++++.+++.|.++...|.++ .+. ....++|+|..|..|...... T Consensus 145 a~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~a~~~Lde~--~VP~~gR~~vv~P~~y~~Ll~~~~~ 221 (347) T protein:vir:88 145 CNLPAASNENIAGLGQAVVLNIGA-AADLVDVEARGKAILKGLTLARARLTKN--YVPAGDRRFYCAPEDYSAILSALMP 221 (347) T ss_pred hccccccccccCCccccccccccc-cccccchhhhHHHHHHHHHHHHHHHhhc--CCCCCCCEEEeCHHHHHHHhcchhh Confidence 112233432111111111 1111 2345677888999998888764 332 346899999999988754322 Q ss_pred CCCCchHHHHHHhhCcccccccccceeeeeeeeeecccccccc-------ccCccc--------EEEEEEcCcceEEEee Q lcl|NC_021342. 249 GYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGV-------SNSNKP--------RYMVYDKSDRNLAMAN 313 (354) Q Consensus 249 ~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~-------g~~g~d--------~~v~y~~~~~~~~~~v 313 (354) ... +|.-..... +|....+--...+++..+-.... +..++. ..--|.-+..+..-.+ T Consensus 222 ~~~-----~~~~~~~~~---~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~ 293 (347) T protein:vir:88 222 NAA-----NYAALIDPE---TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLF 293 (347) T ss_pred hhh-----hhccccchh---cceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEE Confidence 111 121111110 12111222222222222110000 000000 0000111122111111 Q ss_pred Cchhhhccc-----------cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 314 PIPFRMLAP-----------QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 314 p~~~~~~~~-----------~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) -.+.-...+ .++...+.+.+.... |+-+.||++++.+... T Consensus 294 ~~~~a~g~v~~~d~~~e~~r~~~~~~d~i~~~~~~-G~~~~rPe~a~~~~~~ 344 (347) T protein:vir:88 294 NHRSAVGTVKLKDMALERARRPEFQADQIIGKYAM-GHGGLRPEAAGALVFT 344 (347) T ss_pred echhhhhheecccceeeeeechhhHHHHhhhhhhh-cCceeccceEEEEEeC Confidence 111111111 122334556666665 6999999999887776 No 136 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=94.49 E-value=0.0043 Score=33.57 Aligned_cols=279 Identities=12% Similarity=-0.001 Sum_probs=130.5 Q ss_pred CCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhcc---ccCCCCCceeEEEEEeecc---ccceeEecCCC-cc Q lcl|NC_021342. 42 NPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVP---MAANIPEYADTWMYRSYDG---VTMGKFIGANG-QD 114 (354) Q Consensus 42 ~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~---v~~~~~~~~~~~~~~~~~~---~G~a~~~~~~~-~d 114 (354) -|+++.. ++ ..+.. +.+...|+|.....-..-+.+| +++. ++.|..... ++...+-..+. .+ T Consensus 1 mpaltLa-ea--~k~~~--d~l~~~ViE~~~~~s~lL~~LpF~~veg~------~~~ynR~~~~~~~~~~~v~~~~~~~g 69 (310) T protein:vir:97 1 MASVTLA-ES--AKLAQ--DELVAGVIENIITVNRMFDVLPFDSIEGN------SLAYNRENVLGDVIMAGVGTTFSGAG 69 (310) T ss_pred CcccchH-HH--hhcCc--chHHHHHHHHHhccchHHHhCCcccccCC------cceeeEeeccCCcccccccccccCCC Confidence 0222222 21 12222 4455677776654444445555 3322 233332222 22222111111 11 Q ss_pred cceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcc--hHHHHHHHHHHHHHhhheeeeeehh-hCceeeeecCC- Q lcl|NC_021342. 115 LPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPID--AEQARLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPN- 190 (354) Q Consensus 115 ip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld--~~k~~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~- 190 (354) .|......+.....+..++..+++..+-.+. ..+-+.+ ....+...+++.++.++-.+|||.. ..++||+..-. T Consensus 70 ~~~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl--~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~ 147 (310) T protein:vir:97 70 AGKAAATFTKVNSNLTTIMGDAEVNGLIQAT--RSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCAS 147 (310) T ss_pred ccccccccceeeeeeeeeeehhhhhhHHHhh--hcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCc Confidence 2222233344455566665555443221111 1233333 3456667788899999999999873 46779876421 Q ss_pred cccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHH---HHHHhhcccC-CCCCchHHHHHHhhCccc Q lcl|NC_021342. 191 VTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDL---WNQANNQLMT-GYTDRTVMQHFMEANSYT 266 (354) Q Consensus 191 ~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~---~~~L~~~~~~-~~~~~Tvl~~l~~n~~~~ 266 (354) -.....++.=..-| ++|+.++++.+|.. --.|..|+++|.. +.-+.|.-.. ..+..++.. T Consensus 148 ~q~i~~~~~gg~~t----~d~LDeLl~~v~~~---~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~--------- 211 (310) T protein:vir:97 148 GQKATTGATGSAIS----FAILDELMDLVVDK---DGQVDYLTMHARTLRSYKALLRALGGASINEVVELP--------- 211 (310) T ss_pred cceeecCCCCCCCC----HHHHHHHHHHHhcC---CCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccC--------- Confidence 11111111111123 47888888888752 2368899999964 5555542110 112222211 Q ss_pred cccccc-ceeeeeeeeeecccccc--ccccCcccEEEEEEcCcc-----eEEEeeC----chhhhcc-ccccC-ceeEEe Q lcl|NC_021342. 267 LLTGNE-LDIQIRFQLDAAELAAN--GVSNSNKPRYMVYDKSDR-----NLAMANP----IPFRMLA-PQMAS-LGITVP 332 (354) Q Consensus 267 ~~~g~~-l~I~~~~~L~~~~~~~~--g~g~~g~d~~v~y~~~~~-----~~~~~vp----~~~~~~~-~~~~~-l~~~~~ 332 (354) .|++ +...-+|.+.+..+... ....+|+..+.+..-..+ +..++.+ ...++.. .+.+. ..|.+. T Consensus 212 --~G~~v~~~~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~~V~ 289 (310) T protein:vir:97 212 --SGAEVPAYSGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIWRVK 289 (310) T ss_pred --CCCEEeeeCCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeEEEE Confidence 1222 33444445544443222 112345555666555543 3333321 2233333 23333 455554 Q ss_pred eeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 333 AEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 333 ~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+ -|+-++-|.|++.++-- T Consensus 290 ~Y---~~~av~~~~A~a~L~~V 308 (310) T protein:vir:97 290 WY---CGLALFSEKGLACADGI 308 (310) T ss_pred Ee---eeEEEecccceeeeccc Confidence 44 37888999999886555 No 137 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=94.38 E-value=0.0046 Score=33.41 Aligned_cols=260 Identities=8% Similarity=-0.029 Sum_probs=126.6 Q ss_pred CceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCC-ceeEEEEEeeccccceeEecCCCcccceeeec Q lcl|NC_021342. 43 PNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPE-YADTWMYRSYDGVTMGKFIGANGQDLPRVAQS 121 (354) Q Consensus 43 ~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~-~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~ 121 (354) =+-|.-+|- ... +-+-+-|.+.....+....+..+.+.+.. .-.++.++.+...|.++.+.++ ++|+..... T Consensus 1 Ma~T~~~d~----I~P--ev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg-~~i~~~~lt 73 (270) T protein:vir:95 1 MTQTKKANL----INP--EVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEG-VAMDTTQMS 73 (270) T ss_pred CCceehhhh----cch--HHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCC-Cccchhhcc Confidence 000111221 111 11111222222222333344444333221 3456788888889999988875 468888888 Q ss_pred cceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccccc Q lcl|NC_021342. 122 AQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYK 201 (354) Q Consensus 122 ~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~ 201 (354) .+.....+.+.+.+|.+ -|++.....|=| -.+-....+..++++.|+.++ +.+ .|- .|+ T Consensus 74 ~~~~~a~i~~~gk~~~i--tD~a~~~~~~dp-~~~~~~q~a~~~a~~~d~~li---~~l--~~a-------------~~~ 132 (270) T protein:vir:95 74 MTTTKVTVKETGKAVEV--TQTAIITNVNGT-LQEASRQLAMSLADKVEIDYI---AEL--NKS-------------KQT 132 (270) T ss_pred cchheeeeehhhCccee--cHHHHhhhccch-HHHHHHHHHHHHHHHHHHHHH---HHh--ccc-------------ccc Confidence 88888888887766665 666665544544 455566677778777776553 111 111 111 Q ss_pred ccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeee Q lcl|NC_021342. 202 TMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQL 281 (354) Q Consensus 202 ~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L 281 (354) . +..--.++|++++..+-. ....++.++|+|..+..|.+...-..... -+-+..|+.+....|.+ .. T Consensus 133 ~-~~~~t~~~~~dA~~~lgd---~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~--~~~~~~~G~ig~~~G~~-------Vi 199 (270) T protein:vir:95 133 A-TVSADATGILDAIEVFNS---ENDEDYVLYVNPKDYNKLVKSLFKVGGNV--QDRAISKGDLVEIVGVS-------DI 199 (270) T ss_pred c-ccccCHHHHHHHHHHhcc---ccCCCcEEEEcHHHHHHHHhhhccccccc--ccchhcccccceeccee-------EE Confidence 0 001114667777766633 23567899999999999854321111110 11112223222222221 11 Q ss_pred eeccccccccccCcccEEEEEEcCcceEEEeeCchhhhccccc--cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 282 DAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQM--ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 282 ~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~--~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .... .-..+ -.|-..+.-+.+....+.+. ..+. .-....+-.. +..||.+..|..++.+..+ T Consensus 200 v~s~-----~~~~~----~~~l~~~gAi~~~~~~~~~v-EtdRd~~~~~d~i~~~-~~y~v~~~~~skvv~~t~~ 263 (270) T protein:vir:95 200 VKSK-----RVSEN----TAFLQRYGAMEIVNKKKPEA-YTDFDILKRTHLLSTN-YHYSVNLKDETGVVKVTFK 263 (270) T ss_pred EeCC-----CCCce----eEEEEeccceeeeecCCcee-eeccchhhcccEEEee-eEEEEEEEccceEEEEEec Confidence 1000 00001 11222344444444444332 1111 1112222222 3357999999999998887 No 138 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=93.95 E-value=0.0059 Score=32.82 Aligned_cols=297 Identities=8% Similarity=-0.036 Sum_probs=129.1 Q ss_pred ccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccc Q lcl|NC_021342. 24 SRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVT 103 (354) Q Consensus 24 ~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G 103 (354) +..=.+....+.+=... .+..-|++ --.|+..-.-+|+.+.- ..-..+.++.+.+-. +..++.+... | T Consensus 1 ~~~~~~~~~~~~~~~~~--~~~~~d~~-~al~le~~~geV~~~f~----~~s~~~~~~~~r~i~--~G~tv~i~~i---g 68 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGA--RNADYDVR-YATALKLFSGEVFTAFN----NASIFKGLVRSYDLR--GGKSKQFMFT---G 68 (332) T ss_pred CcccccccCCccccCCc--cccccccc-hhhhhhhhhhhHHHHHH----HHhhhhhcccccccc--ccceEEEEec---c Confidence 11111122222221111 11112222 12344222234444443 222334444432211 2344544433 4 Q ss_pred ceeE--ecCCCccc-ceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeee----e Q lcl|NC_021342. 104 MGKF--IGANGQDL-PRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYF----G 176 (354) Q Consensus 104 ~a~~--~~~~~~di-p~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~----G 176 (354) ..++ +..+ .++ |..+++-.+....+-. ..-+..-+.+++.++ ...++-.+-.+.+..++++..|+.++- + T Consensus 69 ~~~~~~~~~g-~~l~~~~~~~~~~~~l~ID~-~ky~~~~VddiD~~q-~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~a 145 (332) T protein:vir:78 69 KLSAGYHTPG-TPIVGDAGIKANEKTLVMDD-LLVSSQFVYSLDEIF-SQYSTRAEVSKQIGEALATHYDERIARVLAKA 145 (332) T ss_pred ceeEeeecCC-CCCCCCCCCCCceEEEEEeh-hhhhHHHHHhHHHHh-cCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 4433 2222 222 2223333344433322 123445567888886 456788888899999999999987762 1 Q ss_pred -ehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcc-cccEEEeCHHHHHHHhhc---cc-C-C Q lcl|NC_021342. 177 -DASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFH-VPNTALMFPDLWNQANNQ---LM-T-G 249 (354) Q Consensus 177 -~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~-~p~~L~l~p~~~~~L~~~---~~-~-~ 249 (354) .......|. |+-.....+.. .+.+++.+++-|.++..+|.++ .+- .-..++|+|..|..|.+. ++ + + T Consensus 146 a~~~~~~~~~---~g~~~~~~~~~-~~~~~~~~~~~i~~a~~~Lde~--~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~ 219 (332) T protein:vir:78 146 SAEASPVTGE---PGGFHVNIGAG-NTNDAQAIVDGFFEAAAVLDER--SAPQEGRVAVLSPRQYYSLISSVDTNILNRE 219 (332) T ss_pred hcccCccccc---ccccccccCCc-cccCHHHHHHHHHHHHHHHhhc--CCCccCCEEEeCHHHHHHHHhhcCceeeeee Confidence 111222221 11111111111 2346889999999999998775 331 114688999999998751 11 1 1 Q ss_pred CCCchHHHHHHhhCcccccccc-cceeeeeeeeeeccccccc------cc----------cCcccEEEEEEcCcceEEEe Q lcl|NC_021342. 250 YTDRTVMQHFMEANSYTLLTGN-ELDIQIRFQLDAAELAANG------VS----------NSNKPRYMVYDKSDRNLAMA 312 (354) Q Consensus 250 ~~~~Tvl~~l~~n~~~~~~~g~-~l~I~~~~~L~~~~~~~~g------~g----------~~g~d~~v~y~~~~~~~~~~ 312 (354) ..+. +... .+|. --.+-..+.+++..+-... .+ +-++-..++|. ++-+... T Consensus 220 ~~~~--------~~~~--~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h--~~a~~~v 287 (332) T protein:vir:78 220 IGNS--------QGDM--NSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFH--REAAGCI 287 (332) T ss_pred cccc--------ccce--ecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeec--ccceeee Confidence 1110 0000 0010 0111222223333221100 00 00111223332 3334333 Q ss_pred eCchhhhccc----cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 313 NPIPFRMLAP----QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 313 vp~~~~~~~~----~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..++++.--. ..+.....+...... |+-+.||.+++.+==| T Consensus 288 ~~~~~~~~~t~~~~~~~~~~d~i~~~~~~-G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 288 QSVAPTIQTTSGDFNVQYQGDLIVGKLAM-GCGSLRTSVAGSFQAA 332 (332) T ss_pred eeeccchhhhhcccchhhhHhhhhhhhhh-cCceecccceEEEeeC Confidence 3344322111 112223344444444 6899999999999888 No 139 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=93.78 E-value=0.0064 Score=32.61 Aligned_cols=296 Identities=10% Similarity=0.027 Sum_probs=126.2 Q ss_pred ccccchhhhhhhhhhhccCCceecc-------chhhH-HHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEE Q lcl|NC_021342. 24 SRNGDQWVINNTALDAIGNPNIMLD-------ADGGI-AFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWM 95 (354) Q Consensus 24 ~~~~~~~~~~~~am~a~~~~~~~~d-------a~~~~-~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~ 95 (354) +.+. ++++...+. +.+.. .|+..-.-+|+...-+ .-..+.++.+.+ +- +..++. T Consensus 1 ma~~------------~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~----~s~~~~~~~~r~-i~-~g~s~~ 62 (344) T protein:vir:10 1 MANM------------TGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFAR----TSVTTSRHMVRS-IS-SGKSAQ 62 (344) T ss_pred Cccc------------cccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHH----Hhhhcccceeee-ec-ccceEE Confidence 1110 011111111 11222 3443333455554433 223344444332 11 234444 Q ss_pred EEeeccccceeEec-CCCccccee--eeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhhe Q lcl|NC_021342. 96 YRSYDGVTMGKFIG-ANGQDLPRV--AQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSV 172 (354) Q Consensus 96 ~~~~~~~G~a~~~~-~~~~dip~v--~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~ 172 (354) +. ..|..++-. ..+++++.. +..-.+..+-+-. ..-+..-+.+++.++ ...++..+-.+.+..++++..|+. T Consensus 63 ~~---~iG~~~~~~~~~G~~l~~t~~~~~~~e~~l~ID~-~~y~~~~VdDiD~~q-~~~D~r~~~~~~~G~aLA~~~D~~ 137 (344) T protein:vir:10 63 FP---VLGRTQAAYLAPGENLDDIRKDIKHTEKVITIDG-LLTADVLIYDIEDAM-NHYDVRSEYTSQLGESLAMAADGA 137 (344) T ss_pred EE---eeceeEEEeeecCCCCCCCCCCcccceEEEEEcc-hhhhhhhhhhHHHHh-cCcchHHHHHHHHHHHHHHHHHHH Confidence 44 334444321 112333321 1222232232211 012344557888885 566787888888999999999987 Q ss_pred eee----ee-----hhhCceeeeecCCcccccccccc--cccCHHHHHHHHHHHHHHHHHHhCCcc-cccEEEeCHHHHH Q lcl|NC_021342. 173 AYF----GD-----ASRGMYGLFNNPNVTLSSATKDY--KTMNGQELFNMLNAPIFSVINLSRRFH-VPNTALMFPDLWN 240 (354) Q Consensus 173 ~f~----G~-----~~~gi~GLlN~p~~~~~~~~~~W--~~~T~~ei~~di~~~~~~l~~~s~g~~-~p~~L~l~p~~~~ 240 (354) ++- +. ......|+-..-.+.....+..- ...+++.+++.|.++...|.++ .+- ....++|+|..|. T Consensus 138 i~~~la~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~--~VP~~gR~~vv~P~~y~ 215 (344) T protein:vir:10 138 VLAEIAGLCNVESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKN--YVPSSDRVFYCDPDSYS 215 (344) T ss_pred HHHHHHhhhccccccccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhc--CCCccCCEEEeChHHHH Confidence 752 11 11222332111111111111111 1245678888899999888775 221 1257889999999 Q ss_pred HHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccc-----cccCcc----------cEEEEEEcC Q lcl|NC_021342. 241 QANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANG-----VSNSNK----------PRYMVYDKS 305 (354) Q Consensus 241 ~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g-----~g~~g~----------d~~v~y~~~ 305 (354) .|..-..-. --+|.-.+ . ..+|....+-..+.+++..+-... -+.+|. ...+.+++. T Consensus 216 ~Ll~~~~~~-----~~~~~~~~-~--~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~ 287 (344) T protein:vir:10 216 AILAALMPN-----AANYAALI-D--PEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNV 287 (344) T ss_pred HHhhccccc-----cccccccc-c--eeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeeccee Confidence 987532110 00111111 1 112333333333333333321100 000110 111111111 Q ss_pred ------cceEEEeeCchh--hhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 306 ------DRNLAMANPIPF--RMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 306 ------~~~~~~~vp~~~--~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) |+-+-...-+++ +.... .+...+.+.+.... |.-+.||++++.+..+ T Consensus 288 ~~l~~h~~A~~~v~~~~~~~e~~r~-~~~~~d~i~g~~~~-G~~vlRPe~a~~v~~~ 342 (344) T protein:vir:10 288 IGLFMHRSAVGTVKLRDLALERARR-ANFQADQIIAKYAM-GHGGLRPEAAGAVVFK 342 (344) T ss_pred EEEeechhhhhhhhhccceeecccc-hhHHHHHHHHHhhc-ccceecccceEEEEee Confidence 110001111111 11111 22234455555555 6999999999999999 No 140 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=93.48 E-value=0.0074 Score=32.27 Aligned_cols=265 Identities=12% Similarity=0.051 Sum_probs=112.2 Q ss_pred ccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccc--ee-EecCCCcccc Q lcl|NC_021342. 40 IGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTM--GK-FIGANGQDLP 116 (354) Q Consensus 40 ~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~--a~-~~~~~~~dip 116 (354) +.+..+..|.. | +.+= +.-.-+++.+.++||...-...+.+-..|...+..-. .. -.+...+ T Consensus 1 ~~~~~~~~dp~-----L----T~~A---~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~--- 65 (309) T protein:vir:99 1 MSNAPFPIDPE-----L----TAIA---IAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPN--- 65 (309) T ss_pred CCCCCcCcCHh-----H----HHHH---hhccChhhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcc--- Confidence 33444444421 1 1111 1112233667777776532223222222222111000 00 0111112 Q ss_pred eeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhh----heeeeeehhhCceeeeecC-C- Q lcl|NC_021342. 117 RVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQ----SVAYFGDASRGMYGLFNNP-N- 190 (354) Q Consensus 117 ~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n----~~~f~G~~~~gi~GLlN~p-~- 190 (354) .++.........+...+....+..+|+..|. .+.++.....+.+...+...++ ++++.-. |.| + T Consensus 66 ~v~~~~~~~~~~~~~~~L~~~i~~~~~~~a~-~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a---------~y~~~~ 135 (309) T protein:vir:99 66 EVEFSATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSPN---------SYAAGN 135 (309) T ss_pred eEeecccCceeeecccceeecCCchhhhhcc-CCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChh---------hcCCCc Confidence 3445555555556665665666667776663 3566666666655555554443 2322211 111 1 Q ss_pred cccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhc-----ccC---CCCCchHHHHHHhh Q lcl|NC_021342. 191 VTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQ-----LMT---GYTDRTVMQHFMEA 262 (354) Q Consensus 191 ~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~-----~~~---~~~~~Tvl~~l~~n 262 (354) ..+.+.+..|++.++ +++.||.++..++ | ..|++++|+...|..|.+- ++. ...+.--.+.|++- T Consensus 136 k~~Lsgt~~wsd~~S-DPi~~i~~~~~~~-----g-~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l 208 (309) T protein:vir:99 136 KTTLSGADQWSDPTS-NPLPVITDALDSV-----I-LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQEL 208 (309) T ss_pred eEEecCccccCCCCC-CcHHHHHHHHHhh-----C-CCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHHHH Confidence 112233446988665 4688999887665 3 4899999999999887541 111 11111223444432 Q ss_pred Ccccccccccceeeeeeeeeeccccc-ccccc-------CcccEEEEEEcC-cceEEEeeCchhhhccc-c---ccCcee Q lcl|NC_021342. 263 NSYTLLTGNELDIQIRFQLDAAELAA-NGVSN-------SNKPRYMVYDKS-DRNLAMANPIPFRMLAP-Q---MASLGI 329 (354) Q Consensus 263 ~~~~~~~g~~l~I~~~~~L~~~~~~~-~g~g~-------~g~d~~v~y~~~-~~~~~~~vp~~~~~~~~-~---~~~l~~ 329 (354) ++++.+.. +..... +..|. -|.+..++|... .+.+. . .++... + ...-.+ T Consensus 209 ----------~~ve~V~v--g~a~~n~a~~g~~~~~~~iwg~~~~L~y~~~~~~~~~----~-ps~G~t~~~~~r~~g~~ 271 (309) T protein:vir:99 209 ----------LELDAIYI--GEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRN----G-TTFGLTAQWGDRVSGSI 271 (309) T ss_pred ----------hCcceEEe--ecceeeccccccccccccccCCcEEEEEcCCCCCCcc----c-ccccceeecccccCCce Confidence 11211111 111110 00011 134445555432 11211 0 111111 1 122235 Q ss_pred EEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 330 TVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 330 ~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ..|++..-||-.||--. +++--|. T Consensus 272 ~d~~~~~~g~~~vr~~~-~~k~~i~ 295 (309) T protein:vir:99 272 ADPNIGLRGGQRVRVGE-SVKELVT 295 (309) T ss_pred eeeeeccCCceEEEEec-cccchhc Confidence 66666666664433211 1111111 No 141 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=93.45 E-value=0.0075 Score=32.23 Aligned_cols=292 Identities=9% Similarity=0.035 Sum_probs=125.6 Q ss_pred hhhccC-Cceecc-------chhhH-HHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeE Q lcl|NC_021342. 37 LDAIGN-PNIMLD-------ADGGI-AFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKF 107 (354) Q Consensus 37 m~a~~~-~~~~~d-------a~~~~-~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~ 107 (354) |-.+.+ ....++ +++.. .|+..-.-+++...-+. =..+.++.+. .+- +..++.+.. .|..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~----s~~~~~~~~r-~i~-~gks~~~~~---iG~~~~ 71 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFART----SVTTSRHMVR-SIS-SGKSAQFPV---LGRTQA 71 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHH----hhhcccceee-ecc-ccceEEEee---ecceEE Confidence 211111 111111 12222 34432233555544332 2233444432 111 234454443 344443 Q ss_pred --ecCCCccccee--eeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeee----e-e- Q lcl|NC_021342. 108 --IGANGQDLPRV--AQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYF----G-D- 177 (354) Q Consensus 108 --~~~~~~dip~v--~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~----G-~- 177 (354) +..+ +.++.. +....+.++.+-.. .-+..-+.+++.++ ...++-.+-.+.+..++++..|+.++- + . T Consensus 72 ~~~~~G-~~l~~~~~~~~~~e~~ltID~~-~y~~~~VddiD~~q-~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~ 148 (345) T protein:vir:22 72 AYLAPG-ENLDDKRKDIKHTEKVITIDGL-LTADVLIYDIEDAM-NHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNV 148 (345) T ss_pred EeeecC-CCCCCCCCCcccceEEEEecch-hhhhhhHhhHHHHh-cCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 2222 222111 11222322222111 11234456888875 466777778888999999999987762 1 0 Q ss_pred --h-hhCceeeeecCCccccccccccc--ccCHHHHHHHHHHHHHHHHHHhCCcc-cccEEEeCHHHHHHHhhcccCCCC Q lcl|NC_021342. 178 --A-SRGMYGLFNNPNVTLSSATKDYK--TMNGQELFNMLNAPIFSVINLSRRFH-VPNTALMFPDLWNQANNQLMTGYT 251 (354) Q Consensus 178 --~-~~gi~GLlN~p~~~~~~~~~~W~--~~T~~ei~~di~~~~~~l~~~s~g~~-~p~~L~l~p~~~~~L~~~~~~~~~ 251 (354) + .....|+-+-..+.....+.++. .++++.+++.|.++..+|.++ .+- .-..++|+|..|..|..-..-.. T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~--~VP~~~R~~vv~P~~y~~Ll~~~~~~~- 225 (345) T protein:vir:22 149 ESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKN--YVPAADRVFYCDPDSYSAILAALMPNA- 225 (345) T ss_pred cccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhc--CCCccCCEEEeChHHHHHHhccccccc- Confidence 0 01112222211111222222222 245788999999998888764 221 12579999999999875321110 Q ss_pred CchHHHHHHhhCcccccccccceeeeeeeeeeccccc------------------ccccc------CcccEEEEEEcCcc Q lcl|NC_021342. 252 DRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAA------------------NGVSN------SNKPRYMVYDKSDR 307 (354) Q Consensus 252 ~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~------------------~g~g~------~g~d~~v~y~~~~~ 307 (354) -+|.-.+.. ..|....+-..+.+++..+.. .+.|. .++-++++|.++ T Consensus 226 ----~~~~~~~~~---~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~-- 296 (345) T protein:vir:22 226 ----ANYAALIDP---EKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRS-- 296 (345) T ss_pred ----ccccccccc---ccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehh-- Confidence 111111111 123333333333333222210 00000 011233444322 Q ss_pred eEEEeeCch--hhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 308 NLAMANPIP--FRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 308 ~~~~~vp~~--~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) -+...-.++ .+... +.+...+.+.+.... |+-+.||++++.+..- T Consensus 297 A~~~v~~~~~~~e~~r-~~~~~~d~I~~~~a~-G~~vlRPeaa~~i~~~ 343 (345) T protein:vir:22 297 AVGTVKLRDLALERAR-RANFQADQIIAKYAM-GHGGLRPEAAGAVVFK 343 (345) T ss_pred heeeeeeecceeeeee-chhHHHHHHHHHHhc-CCcccccceeEEEEEe Confidence 111111122 12222 122334455555554 6999999999887666 No 142 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=93.29 E-value=0.0012 Score=36.58 Aligned_cols=292 Identities=14% Similarity=0.080 Sum_probs=124.8 Q ss_pred cc-cccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhh--hcccchhhccccCCCCCceeEEE Q lcl|NC_021342. 19 HK-GYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPY--GDITYRFDVPMAANIPEYADTWM 95 (354) Q Consensus 19 ~~-~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~--~~l~~r~~v~v~~~~~~~~~~~~ 95 (354) +| ++..-+ |+. ..-|+.|++.- +.+ +.+ +.+|+++...-. .+++.-.-++ ..+.......|+ T Consensus 1 ~~~~~~~~~-~~a--~~~al~~a~~~--------g~A-lR~--EsLd~~l~~lt~~~~~ftf~~~i~-k~~a~STV~ey~ 65 (470) T protein:vir:10 1 MPYEHLKHL-DEA--TLKALNAAGQV--------AES-LER--EDLEPEVTQLNVLDTPLTDLLSKN-AVKAKAYEHEYN 65 (470) T ss_pred CChhHhhhh-hHH--HHHHHHHhhhc--------chh-hhh--hhhccceeEeeecCccchhhhhcC-CchhhhHhhhhh Confidence 22 111111 121 12255554211 123 222 555666554222 2233322232 223333333332 Q ss_pred --EEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhhee Q lcl|NC_021342. 96 --YRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVA 173 (354) Q Consensus 96 --~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~ 173 (354) +......|.+. ++ .....+..|.+..|++..+..++.....+...+...+-.=.++.....+.|-..+++...... T Consensus 66 ~~~~rhG~~g~s~-~~-E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~ 143 (470) T protein:vir:10 66 VVTARHDKIGYAA-FR-EGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLA 143 (470) T ss_pred hhcccccccccee-ec-ccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhh Confidence 22223344432 22 334445667788888889999998888877766555444458888888899999999999999 Q ss_pred eeeehhhC-----------ceeeee--cCCc--ccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHH Q lcl|NC_021342. 174 YFGDASRG-----------MYGLFN--NPNV--TLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDL 238 (354) Q Consensus 174 f~G~~~~g-----------i~GLlN--~p~~--~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~ 238 (354) ||||+.+. ..||.| +++- .+..+.+.- .. .+.|+++-..+. .++++-.|+-+.||+.. T Consensus 144 FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~----Ls--~~~L~~aa~~I~-~~~~fGt~TD~~lp~~v 216 (470) T protein:vir:10 144 FYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRP----LS--IDLLWEAESRVV-STQAFANPTAVFISYVD 216 (470) T ss_pred hhhccccccccCcccCceeccchhhhccCCCCccccccCCCC----cc--HHHHHHHHhhhc-ccccccChhhhccchhH Confidence 99988652 344422 1110 111111110 10 245555544443 35678889999999999 Q ss_pred HHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEc----Cc-----ceE Q lcl|NC_021342. 239 WNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDK----SD-----RNL 309 (354) Q Consensus 239 ~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~----~~-----~~~ 309 (354) ...|....++..- .+..+|......|.++. .+.++. |.-.-++..+.+. +| ++- T Consensus 217 ka~f~~~~~~~qR------v~~~~N~~~~~~G~~v~-----~f~sa~------G~I~L~~s~~m~~~~k~~p~~l~~~v~ 279 (470) T protein:vir:10 217 KLNLQASFYQISR------VMTTADRRAGLLGADAQ-----SYIGVR------GEHSLYPSQFLGDFHKFNPARFGAEVG 279 (470) T ss_pred HHHHHHhhcCceE------EEEecCCCceeeeeecc-----ceeeee------eeeeecccccccchhhcCcccCCcccC Confidence 9988764332110 11111211111222211 000000 0000000011110 00 000 Q ss_pred EEeeC---------chhhhccccccCc--------eeEEeeeeeeeeEEEECCcee-EeeecC Q lcl|NC_021342. 310 AMANP---------IPFRMLAPQMASL--------GITVPAEYKISGTEFRYPLCA-AYVDMA 354 (354) Q Consensus 310 ~~~vp---------~~~~~~~~~~~~l--------~~~~~~~~~~gGv~i~~P~ai-~y~D~~ 354 (354) .+.-| .+...++.+-+.. .|......+.|-- +|.++ ++.|.. T Consensus 280 ~~aAP~~~~tv~~t~~~~a~~~~sk~g~~~~~~v~sy~y~v~~~~gds---~s~~v~vt~t~~ 339 (470) T protein:vir:10 280 DFAAPSNSWTVSTTDNFVTLPYNSGLGDPANTTVYSYAFKAANFYGES---AAKYIDVYIDST 339 (470) T ss_pred CcccCceeEEeecCCCceeecccCCCCcccCcceeEEEEEEEEecCCC---CcceEEEEEeee Confidence 01111 1222222222211 1222222222211 12222 222222 No 143 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=93.22 E-value=0.0083 Score=31.98 Aligned_cols=294 Identities=10% Similarity=-0.018 Sum_probs=128.7 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCcee-ccchhhHHHHHHHHHHHHHHHHHhhhhcccchh Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIM-LDADGGIAFYISQLAGIEATVYETPYGDITYRF 79 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~-~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~ 79 (354) |+-|+++.. ++ . ..++ ..++ .|++++...-....+.|-..+.+. .+-+...+ T Consensus 1 ~~~k~~~~~-l~----------------~------~~~~---~~~~~~~~~~g~~v~~~~~~~l~~~i~e~-s~~l~~i~ 53 (321) T protein:vir:31 1 MASRTINND-LS----------------R------ITEK---NALTVDDLDAGGTLPDPLWDEFWTDMIEE-TPLLDAIR 53 (321) T ss_pred CchHHHHHH-HH----------------H------HHHh---ccccccccCCcceeCHHHHHHHHHHHHHh-hhhhhhce Confidence 655555431 00 0 0000 0111 122222111112223333334432 22233334 Q ss_pred hccccCCCCCceeEEEEEeeccccceeEecC-CCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHH Q lcl|NC_021342. 80 DVPMAANIPEYADTWMYRSYDGVTMGKFIGA-NGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQA 158 (354) Q Consensus 80 ~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~-~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~ 158 (354) ++++....+ .+. .....+.+-+.+. .....+..+...+..............+++.-|+..+ .+.++...-. T Consensus 54 v~~v~~~~~----~i~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a-~~~d~e~~i~ 126 (321) T protein:vir:31 54 TETVGAKKT----RIP--TLNIGERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENP-EGEALADRIL 126 (321) T ss_pred eeeccCcce----eee--eeccCCcccccccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhh-cchhHHHHHH Confidence 444432211 111 1111111112221 1112233344455666777887777778777776543 3567888888 Q ss_pred HHHHHHHHHHhhheeeeeehhhCc------eeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCccccc-E Q lcl|NC_021342. 159 RLAFRGAEEHSQSVAYFGDASRGM------YGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPN-T 231 (354) Q Consensus 159 ~aA~~~~a~~~n~~~f~G~~~~gi------~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~-~ 231 (354) ...+++++..+++++|+|+....- .|+++...-... ..++...+. -++.+.+++..|-.. +...+. . T Consensus 127 ~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~--~~~~~~~~~--~~d~l~~l~~~l~~~--yr~~~~~v 200 (321) T protein:vir:31 127 NLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVE--TIDAADDIL--DNDLVIRTIAGLDSK--YRARMNPA 200 (321) T ss_pred HHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhccccc--ccccccccc--CHHHHHHHHHhccHh--HhcCCCeE Confidence 999999999999999999865443 466654321111 111211111 123455555555332 333343 5 Q ss_pred EEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEE Q lcl|NC_021342. 232 ALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAM 311 (354) Q Consensus 232 L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~ 311 (354) .+|+++.+..+.......+++ +++-...+ +.+.++...|.+..+..- . +.+++ -+.+++.+ T Consensus 201 ~im~~~~~~~~~~~l~~~~~~--~~~~~l~~-------~~~~tl~G~pvv~~~~mP-~-------~~il~--t~~~nl~~ 261 (321) T protein:vir:31 201 LIVSEDQLLSYHYTLTDRDTP--LGDNVIMG-------EADVNPFSFPIIGSGLWP-D-------DKAMF--TDPQNLIY 261 (321) T ss_pred EEechHHHHHHHHHHhcCCCc--cccchhhc-------cccccccceeEEEcCCCC-C-------CcEEE--eccccEEE Confidence 679988876554333222222 11111111 222233333333333221 1 11222 23555544 Q ss_pred eeCchhhh--ccc--cc--cCceeEEeeeeeeeeEEEECCceeEeee-cC Q lcl|NC_021342. 312 ANPIPFRM--LAP--QM--ASLGITVPAEYKISGTEFRYPLCAAYVD-MA 354 (354) Q Consensus 312 ~vp~~~~~--~~~--~~--~~l~~~~~~~~~~gGv~i~~P~ai~y~D-~~ 354 (354) .+-...+. ..- +. +...+..-++..+ +..|..+.+++.+. |- T Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ve~~~a~a~~~~i~ 310 (321) T protein:vir:31 262 ALYRDLEIDVLTESDKVSERDLHARYFMRGDD-DFAIENTEAVVLAEGLG 310 (321) T ss_pred EEeeccEEEEeecCccccccceeeEeeeeeec-ceeEeccccEEEEecCC Confidence 44333322 111 11 2234444444455 46677788888766 44 No 144 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=92.18 E-value=0.012 Score=31.02 Aligned_cols=294 Identities=9% Similarity=-0.001 Sum_probs=133.0 Q ss_pred cccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccc Q lcl|NC_021342. 23 VSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGV 102 (354) Q Consensus 23 ~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~ 102 (354) +.+.++-... +. .+..- +. -.|+..-.-+|+... .+.-..+.++.+.+-. +..++.++. . T Consensus 1 ms~~~~~tr~------~~--~~s~~--d~-al~le~f~geV~~af----~~~s~~~~~~~~rti~--~g~s~~~~~---i 60 (335) T protein:vir:63 1 MSFLNDLTRP------NY--AGKNA--DV-DIHLEEHLGIVDKHF----AYTSKFAPLMNIRDLR--GSNVVRLDR---L 60 (335) T ss_pred CCCcccchhh------hc--ccccc--hh-heehhhhhhhHHHHH----Hhhhhhccccceeeec--cceeEEEee---e Confidence 2222111000 00 11011 11 244432223444433 2233344444443321 233444433 3 Q ss_pred cceeEe----cCCCcccceeeeccceeEEEE--EEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeee- Q lcl|NC_021342. 103 TMGKFI----GANGQDLPRVAQSAQMHTVPL--GYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYF- 175 (354) Q Consensus 103 G~a~~~----~~~~~dip~v~~~~~~~~~pv--~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~- 175 (354) |..+.. |..-++.|... ++..+.| ..+.. .-+.+++.++ ...++-.+-....-.++++..|+.+|. T Consensus 61 G~~~~~~~~pG~~l~~~~~~~---~k~~itVD~ll~a~---~~I~dlDe~~-~~yDvRse~s~e~G~aLA~~~D~~~~~~ 133 (335) T protein:vir:63 61 GNVEAKGRRAGEELERSRVVN---DKWNLTVDTLLYLR---HQFDHQDEWT-QSFDMRKEVAELDGQELARKFDQACLIQ 133 (335) T ss_pred eeeeeecccCCcCcCCCCccc---cceEEEecceeech---hhhhhHHHHh-cCchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 444433 22222223211 2222222 12222 2245666664 455666777777889999999997761 Q ss_pred -----ee-hhhCceeeeecCCccc-ccccccccccCHHHHHHHHHHHHHHHHHHhCCcc----cccEEEeCHHHHHHHhh Q lcl|NC_021342. 176 -----GD-ASRGMYGLFNNPNVTL-SSATKDYKTMNGQELFNMLNAPIFSVINLSRRFH----VPNTALMFPDLWNQANN 244 (354) Q Consensus 176 -----G~-~~~gi~GLlN~p~~~~-~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~----~p~~L~l~p~~~~~L~~ 244 (354) +. +....+|-++ +|+.. ...++.=+...++.+.+-+..+..+|.++ .+- ....++|+|..|..|.. T Consensus 134 i~~aa~~~a~~~~~~~~~-~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~--dVP~~~~~dr~~vv~P~~y~~Ll~ 210 (335) T protein:vir:63 134 VIKAAAMDAPVDLEDAFS-PGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDR--DLGDAVYSEGLTPMSPRVFSLLLE 210 (335) T ss_pred HHhhccccCccccCCCcC-CCcceeeeeccCcccccHHHHHHHHHHHHHHHHhc--cCCCcccCceEEEeChHHHHHHhc Confidence 11 2223333333 23221 11111112235888888888888888865 332 23689999999999875 Q ss_pred cccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeecccccccccc-------------CcccEEEEEEcCcceEEE Q lcl|NC_021342. 245 QLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSN-------------SNKPRYMVYDKSDRNLAM 311 (354) Q Consensus 245 ~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~-------------~g~d~~v~y~~~~~~~~~ 311 (354) - ..-.+. +|...+.......|....+-.++.+++..+- .+.++ ..+.++.++- .++-+.. T Consensus 211 ~--~~l~n~---~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP-~~~~t~~~lg~a~n~~~~d~~~~~~~~~-~~~Al~t 283 (335) T protein:vir:63 211 H--DKLMNV---EYQATGATNDYVKSRVAILNGVKVLETPRFA-TKAIAAHPLGRHFNVSAEESERQIALFL-PSKTLIT 283 (335) T ss_pred c--cccccc---ccccccccccccCceeEEeeceEEEeeccCC-CCCcccccccccCCccccccceeEEEEE-ecceEEE Confidence 2 111111 2322222111223444555555555555441 11111 0112233222 2233333 Q ss_pred eeCchhhhc-cccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 312 ANPIPFRML-APQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 312 ~vp~~~~~~-~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ...++++.- .-+.+...+.+.+.... |+-++||.+++-+... T Consensus 284 ~~~~~vt~e~~~~~~~~~~~i~~~~a~-G~g~lRPe~a~~i~~t 326 (335) T protein:vir:63 284 AQVAPVQAKLWEDNEKFSWVLDTFQMY-NIGARRPDTAGAIELK 326 (335) T ss_pred EEEeecccceeeccchhhHHhHHHHHc-CCcccccceEEEEEEc Confidence 333443321 11233445666666655 6999999999999988 No 145 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=91.24 E-value=0.017 Score=30.32 Aligned_cols=294 Identities=9% Similarity=-0.001 Sum_probs=130.1 Q ss_pred cccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccc Q lcl|NC_021342. 23 VSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGV 102 (354) Q Consensus 23 ~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~ 102 (354) +.+.++- .. .+ .++..- |. -.|+..-.-+|+++. .+.-..+.++.+.+-. +..++.++ .. T Consensus 1 ms~~~~~-t~-----~~--~~~s~~--d~-al~le~f~geV~~af----~~~s~~~~~~~~rti~--~g~s~~~~---~i 60 (335) T protein:vir:78 1 MSFLNDL-TR-----PN--YAGKNA--DV-DIHLEEHLGIVDKHF----AYTSKFAPLMNIRDLR--GSNVVRLD---RL 60 (335) T ss_pred CCccccc-cc-----cc--cccccc--hh-hhhhhhhhhHHHHHH----HHhhhhccccceeeec--cceeEEEe---ee Confidence 2211100 00 00 011111 11 244432233444443 3333344444443221 23444444 33 Q ss_pred cceeEe----cCCCcccceeeeccceeEEEE--EEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeee- Q lcl|NC_021342. 103 TMGKFI----GANGQDLPRVAQSAQMHTVPL--GYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYF- 175 (354) Q Consensus 103 G~a~~~----~~~~~dip~v~~~~~~~~~pv--~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~- 175 (354) |..+.. |..-+..|... ++..+.| ..+. +.-+.+++.++ ...++-.+-.+.+..++++..|+.++. T Consensus 61 G~~~~~~~~pG~~l~~~~~~~---~k~~itID~ll~a---~~~VddlDe~~-~~yDvR~e~s~~~G~aLA~~~Dq~~~~~ 133 (335) T protein:vir:78 61 GNVEAKGRRAGEELERSRVVN---DKWNLTVDTLLYL---RHQFDHQDEWT-QSFDMRKEVAELDGQELARKFDQACLIQ 133 (335) T ss_pred eeeeecccccCcccCCCCccc---CCeEEEecceeec---hhhHhhHHHhh-cCchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 555432 22222223221 2222222 1111 22246666664 456777777788889999999997761 Q ss_pred -----ee-hhhCceeeeecCCccc-ccccccccccCHHHHHHHHHHHHHHHHHHhCCccc--c--cEEEeCHHHHHHHhh Q lcl|NC_021342. 176 -----GD-ASRGMYGLFNNPNVTL-SSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHV--P--NTALMFPDLWNQANN 244 (354) Q Consensus 176 -----G~-~~~gi~GLlN~p~~~~-~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~--p--~~L~l~p~~~~~L~~ 244 (354) +. +....++-++ ||... ....+.=.+.+++.+.+-+.++..++.+. ..-. + ..++|+|..|..|.. T Consensus 134 l~~aa~~~a~~~~~~~~~-~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ek--dvP~~~~~~rv~vv~P~~y~~Ll~ 210 (335) T protein:vir:78 134 VIKAAAMDAPVDLEDAFS-PGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIER--DLGDAVYSEGLTPMSPRVFSLLLE 210 (335) T ss_pred HHhhcccccccccCCCcC-CCcceeeeeccccccccHHHHHHHHHHHHHHHHhc--cCCCCCCCccEEEeChHHHHHHhc Confidence 11 1112222222 22221 11111112345788888888888888754 2211 1 468999999999875 Q ss_pred cccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccC-------------cccEEEEEEcCcceEEE Q lcl|NC_021342. 245 QLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNS-------------NKPRYMVYDKSDRNLAM 311 (354) Q Consensus 245 ~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~-------------g~d~~v~y~~~~~~~~~ 311 (354) - ..-.+. +|...+.......|....+-.++.+++..+-. +.+++ -+.++.++ ..++-+.- T Consensus 211 ~--~~l~n~---~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~-~~~t~~~lg~a~n~~~~d~~~~~~~~-~~~~Al~t 283 (335) T protein:vir:78 211 H--DKLMSV---EYQATGATNDYVKSRVAILNGVKVLETPRFAT-KAISAHPLGRHFNVSAEEAERQIALF-LPSKTLIT 283 (335) T ss_pred c--cccccc---cccccccccccccceeEEeeceEEEeeccCCC-CCCccccccccCCcccccccceEEEE-EecceEEE Confidence 2 011111 23222221112234444444444455444321 11110 11223333 23332222 Q ss_pred eeCchhhhc-cccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 312 ANPIPFRML-APQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 312 ~vp~~~~~~-~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ...++++.- .-+.+...+.+.+.... |+-++||++++-+... T Consensus 284 ~~~~~~~~e~~~~~~~~~~~i~~~~a~-G~g~lRPe~a~~i~~t 326 (335) T protein:vir:78 284 AQVAPVQAKLWEDHDQFSWVLDTFQMY-NIGARRPDTAGAIELK 326 (335) T ss_pred EEEEecccceeeccchhhHhhhHHHHc-CCcccCcceEEEEEec Confidence 222333221 11233345666666654 6999999999999988 No 146 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=89.91 E-value=0.024 Score=29.50 Aligned_cols=302 Identities=12% Similarity=0.034 Sum_probs=123.2 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhh Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFD 80 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~ 80 (354) || ++ ++....+. ..+..++.-| .-..|+..-...++... ...-..+.+ T Consensus 1 ma------~~----------~~~~~~~t----------~~~~~~~~~~--~~a~~ie~f~g~V~~~f----~~~s~~~~~ 48 (347) T protein:vir:15 1 MA------NI----------QGGQQIGT----------NQGKGQSAAD--KLALFLKVFGGEVLTAF----ARTSVTMPR 48 (347) T ss_pred CC------cc----------ccCCcccc----------ccccCCCcch--HHHHHHHHHHHHHHHHH----HHhhhhhhc Confidence 11 00 00000000 0000111111 11234433333444433 223344555 Q ss_pred ccccCCCCCceeEEEEEeeccccceeE--ecCCCcccce--eeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchH Q lcl|NC_021342. 81 VPMAANIPEYADTWMYRSYDGVTMGKF--IGANGQDLPR--VAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAE 156 (354) Q Consensus 81 v~v~~~~~~~~~~~~~~~~~~~G~a~~--~~~~~~dip~--v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~ 156 (354) +.+.+ .- +..++.+... |..++ +.. +.+++. .+..-.+..+.|-.+ .-+..-+.+++.++ ...++-.+ T Consensus 49 ~~~~~-~~-~G~sv~i~~i---g~~t~~~~~~-g~~l~~~~~~~~~~e~~ltID~~-~~~~~~VddlD~~q-~~~D~~~~ 120 (347) T protein:vir:15 49 HMLRS-IA-SGKSAQFPVI---GRTKAAYLKP-GENLDDKRKDIKHTEKVIHIDGL-LTADVLIYDIEDAM-NHYDVRAE 120 (347) T ss_pred ccccc-cc-ccceeEeeec---cceeeeeecc-CCCCCCCCCCCccceEEEEechh-hhhhHHhhhHHHHh-cCCcchHH Confidence 54432 11 2334444433 33332 222 122221 112223333333322 12233457788776 55678788 Q ss_pred HHHHHHHHHHHHhhheeee-----eehh---hCceeeeecCCccc-ccc-ccccc--ccCHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021342. 157 QARLAFRGAEEHSQSVAYF-----GDAS---RGMYGLFNNPNVTL-SSA-TKDYK--TMNGQELFNMLNAPIFSVINLSR 224 (354) Q Consensus 157 k~~aA~~~~a~~~n~~~f~-----G~~~---~gi~GLlN~p~~~~-~~~-~~~W~--~~T~~ei~~di~~~~~~l~~~s~ 224 (354) -.+.+..++++..|+.++- .+.. ....+.+..+++.. .+. +++.. ..+++.|++-|.++..+|.++ T Consensus 121 ~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~-- 198 (347) T protein:vir:15 121 YTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKN-- 198 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhc-- Confidence 8889999999999988862 1111 11111111111111 111 11222 124667788888888888765 Q ss_pred Cc-ccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeecccccccc-------ccCcc Q lcl|NC_021342. 225 RF-HVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGV-------SNSNK 296 (354) Q Consensus 225 g~-~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~-------g~~g~ 296 (354) .+ .....++|+|..|..|....-. .+ -+|.-. . ...+|....+-..+++++..+-.... ..+.+ T Consensus 199 ~VP~~gR~~vv~P~~y~~LL~~~~~--~~---~d~~~~-~--~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~ 270 (347) T protein:vir:15 199 YVPAADRTFYTTPDNYSAILAALMP--NA---ANYQAL-I--DHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQK 270 (347) T ss_pred CCCccCCEEEeCHHHHHHHhccccc--cc---cccccc-c--cccceEEEEEeceEEEeccccccccccccccccccccc Confidence 33 1236799999999999764211 11 011100 0 01133333444444444444321110 00000 Q ss_pred ---------cEEEEEEc------CcceEEEeeCch--hhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 297 ---------PRYMVYDK------SDRNLAMANPIP--FRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 297 ---------d~~v~y~~------~~~~~~~~vp~~--~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ....+|++ .++-+.....++ ++....+.+- ...+...... |+-+.||.+++-+-.- T Consensus 271 ~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~~-~d~i~~~~~~-G~~vlrP~~av~~~~~ 343 (347) T protein:vir:15 271 HAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQ-ADQIIAKYAM-GHGGLRPEAAGAIVLP 343 (347) T ss_pred ccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccchhh-hhhhehhhhc-CCceeccccEEEEecC Confidence 01111111 111111111122 2222222222 2333334444 8999999998877443 No 147 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=89.07 E-value=0.029 Score=29.05 Aligned_cols=301 Identities=11% Similarity=0.043 Sum_probs=124.2 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhh Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFD 80 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~ 80 (354) || ... ...+..+- +..+..-|+ --.|+..-.-+++... ...-..+.+ T Consensus 1 m~-----------------------~~~---~~~~~t~~-g~~~~~~d~--~al~ik~f~~eV~~~f----~~~s~~~~~ 47 (347) T protein:vir:94 1 MA-----------------------NVP---GQKIGTDQ-GKGKSSSDA--LALFLKVFAGEVLTAF----TRRSVTADK 47 (347) T ss_pred CC-----------------------CCC---cccccccc-ccCCccccH--HHHHHHHHhHHHHHHH----HHHHhhhcc Confidence 11 000 00011111 111111111 1234432222333322 222223344 Q ss_pred ccccCCCCCceeEEEEEeeccccceeE--ecCCCccccee--eeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchH Q lcl|NC_021342. 81 VPMAANIPEYADTWMYRSYDGVTMGKF--IGANGQDLPRV--AQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAE 156 (354) Q Consensus 81 v~v~~~~~~~~~~~~~~~~~~~G~a~~--~~~~~~dip~v--~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~ 156 (354) +.+.+ +- +..++.+.. .|..++ +.. +++++.. +..-.+..+.+-.+- -+..-+.+++.++ ...++..+ T Consensus 48 ~~~r~-i~-~G~sv~i~~---iG~~tv~~~t~-G~~l~~~~~~~~~~e~~itID~~~-~~~~~VddiD~~q-~~~D~~~~ 119 (347) T protein:vir:94 48 HIVRT-IQ-NGKSAQFPV---MGRTSGVYLAP-GERLSDKRKGIKHTEKVITIDGLL-TADVMIFDIEDAM-NHYDVAGE 119 (347) T ss_pred ccccc-cc-ccceEEEec---ccceeeeeecC-CCCcCCCCCCCCcceEEEEecchh-hhhHHhhhHHHHh-cCcchHHH Confidence 43332 11 234444433 344433 211 1222111 122223333332221 1233446777775 56677788 Q ss_pred HHHHHHHHHHHHhhheeee---------eehhhCceeeeecCCc-ccccccccc-cccCHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_021342. 157 QARLAFRGAEEHSQSVAYF---------GDASRGMYGLFNNPNV-TLSSATKDY-KTMNGQELFNMLNAPIFSVINLSRR 225 (354) Q Consensus 157 k~~aA~~~~a~~~n~~~f~---------G~~~~gi~GLlN~p~~-~~~~~~~~W-~~~T~~ei~~di~~~~~~l~~~s~g 225 (354) -.+.+..++++..|+.++. +.+.....|+-. +++ +..+.+..- ..++++.+++.|.++..+|.+. . T Consensus 120 ~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~~-~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~--~ 196 (347) T protein:vir:94 120 YSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGLGT-ASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSN--Y 196 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcc-cceeeccccccccchhhhHHHHHHHHHHHHHHHhhc--C Confidence 8889999999999987752 122222334321 221 111111111 1245778888888888888764 2 Q ss_pred cc-cccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccc----------cC Q lcl|NC_021342. 226 FH-VPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVS----------NS 294 (354) Q Consensus 226 ~~-~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g----------~~ 294 (354) +. ....++|+|..|..|...+. .+..+|..... ..+|....+-..+.+++..+-..+.+ .. T Consensus 197 VP~~~R~~vv~P~~~~~Ll~~~~-----~~~~~~~~~~~---~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~a 268 (347) T protein:vir:94 197 VPAGDRYFYTTPDNYSAILAALM-----PNAANYAALID---PETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIAS 268 (347) T ss_pred CCCCCcEEEeCHHHHHHHhccch-----hhhhhcccccc---ccccceEEEeceEEEecCcccccccccccccCcceecC Confidence 21 23589999999998875321 11122222211 12333334444444444433211110 11 Q ss_pred cccEEEE------EEcCcc-eEEEe---------eCchhhhcc-ccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 295 NKPRYMV------YDKSDR-NLAMA---------NPIPFRMLA-PQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 295 g~d~~v~------y~~~~~-~~~~~---------vp~~~~~~~-~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) |.+..+. |..+-+ .+-+. --++++.-. -..+...+.+.+.... |.-+.||++++.+..+ T Consensus 269 G~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~~~~~~-G~~~~rP~~a~~~~~~ 344 (347) T protein:vir:94 269 GQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDAQGDLIVGKYAM-GHGGLRPEAAGALVFS 344 (347) T ss_pred cccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhchhhHHHHhhhhhhh-cCcccccceeEEEEec Confidence 1111110 110000 01111 011111110 0112234455555554 6999999999877666 No 148 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=88.67 E-value=0.031 Score=28.86 Aligned_cols=292 Identities=11% Similarity=-0.002 Sum_probs=125.0 Q ss_pred hhh-hhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccC-CCCCceeEEEEEeeccccceeEecCCC Q lcl|NC_021342. 35 TAL-DAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAA-NIPEYADTWMYRSYDGVTMGKFIGANG 112 (354) Q Consensus 35 ~am-~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~-~~~~~~~~~~~~~~~~~G~a~~~~~~~ 112 (354) .|| +-..++++++- ..-.|.. +-....+.+...+.+..+.++.-.. ++.. ..++.+.... ...++.+.. + T Consensus 1 ~~~~~~~~~~~~~t~--~v~~fip---ei~s~~i~~~l~~~~v~~~~~~d~~~~~~~-Gdtv~ip~~g-~~~~~d~~~-~ 72 (341) T protein:vir:94 1 MALGNTITGPSINTQ--RGQQFIP---EQWLSEVQMFRKAKMLDTSVVKTWGAQVKK-GDTFHVPRIS-ELGVEDKAT-D 72 (341) T ss_pred Ccchhhhccccccch--hHHHHHH---HHHHHHHHHHHHhhcchhhccccccccccC-CceEEEeccC-cceeeeecC-C Confidence 000 00111221111 1113432 3345566666777777777664221 1111 3456665442 344554432 2 Q ss_pred cccceeeeccceeEEEE-EEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCc Q lcl|NC_021342. 113 QDLPRVAQSAQMHTVPL-GYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNV 191 (354) Q Consensus 113 ~dip~v~~~~~~~~~pv-~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~ 191 (354) ..++.-+.........+ .....++.++ +++..+ ...++-.+-...+.+++++..|+.++--.+....... ++ T Consensus 73 ~~i~~~~~~~~~~~itiD~~~~~~~~i~--d~d~~~-~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~---~~- 145 (341) T protein:vir:94 73 VPVGVQPVNDTDFVITVDTDRTTAVALD--DLLEIQ-ASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTAS---QN- 145 (341) T ss_pred CccccccccCceEEEEEeeeeecceeec--hHHHHh-hccchHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc---Cc- Confidence 34555555555555666 2234555554 555544 3557777888888999999988877632222111110 11 Q ss_pred cccccccccc-ccCHH-HHHHHHHHHHHHHHHHhCCcc-cccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCccccc Q lcl|NC_021342. 192 TLSSATKDYK-TMNGQ-ELFNMLNAPIFSVINLSRRFH-VPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLL 268 (354) Q Consensus 192 ~~~~~~~~W~-~~T~~-ei~~di~~~~~~l~~~s~g~~-~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~ 268 (354) +.. ..+.. +.++. -.++.|.++...|.+. +.- ....++|+|..|..|.+- ...+-.++.-.+. .. T Consensus 146 ~~~--~~~~~~t~~~~~~~~~~i~~a~~~Lde~--~VP~~gR~lvv~P~~~~~Ll~~-----~~~~~~~~~g~~~---l~ 213 (341) T protein:vir:94 146 VFS--SSNGAITGNGQAFSFAVFLAARRLLLEA--DVPEEKIVLLISPGQESALFTI-----PQFISKDFINNAP---IA 213 (341) T ss_pred ccc--CccccccCchhhhhHHHHHHHHHHHhhc--CCCccCCEEEeCHHHHHHHhhc-----hhhhhhhccccch---hh Confidence 000 11111 11122 2356677777777553 321 235799999999999642 1111112221111 11 Q ss_pred ccccceeeeeeeeeecccccccc-----------------------------ccCcccEEEEEEcCc-ceEEEeeCchhh Q lcl|NC_021342. 269 TGNELDIQIRFQLDAAELAANGV-----------------------------SNSNKPRYMVYDKSD-RNLAMANPIPFR 318 (354) Q Consensus 269 ~g~~l~I~~~~~L~~~~~~~~g~-----------------------------g~~g~d~~v~y~~~~-~~~~~~vp~~~~ 318 (354) +|....+....++++..+-.... +..+.-+.+++.++. -.+++.-|.-+. T Consensus 214 ~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~ 293 (341) T protein:vir:94 214 QGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAA 293 (341) T ss_pred eeeeeeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhh Confidence 23333333333333332211100 001111122222211 112222222222 Q ss_pred hccccc---------cCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 319 MLAPQM---------ASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 319 ~~~~~~---------~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ...+|. +.....+.....+ |+-+.||.+++.+=-+ T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~-G~~~lrp~~~v~~~~~ 337 (341) T protein:vir:94 294 AVVSKAPRVTQSFENREQVWLMVGRQAY-GARLYRPLHAVNIHTT 337 (341) T ss_pred ccccccccccccchhhhhhhhhhhhhhh-cccccCcceeEEEecC Confidence 222221 1112222233333 6888888888766555 No 149 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=87.79 E-value=0.036 Score=28.47 Aligned_cols=313 Identities=14% Similarity=0.122 Sum_probs=137.4 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceec-cchh-hHHHHHHHHHHHHHHHHHhhh--hccc Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIML-DADG-GIAFYISQLAGIEATVYETPY--GDIT 76 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~-da~~-~~~fl~~~L~~Id~~v~e~~~--~~l~ 76 (354) |-+.+ .+-. -....++++. ...|.+...+.=++ |.+- ++++.. +.+|+.+...-+ .+++ T Consensus 1 ~~~~~----------~~~~--~~~~~~~~~~--e~~~KS~~tg~g~~p~~q~~~gAlR~---esL~~~i~~Lt~~~~~~~ 63 (462) T protein:vir:96 1 MHKDT----------NLTA--EQNKYADKFQ--EEVMKSYQTGYGITPDTQVDAGALRR---EILDDQITMLTWTQDDLI 63 (462) T ss_pred Ccccc----------ccch--hhhhhhchhh--HHHHHHHhcCCCcCCccccccchhhh---hhhhhhhheeeecccchh Confidence 22111 0000 0111222221 22344433333222 2222 345555 445555544222 3344 Q ss_pred chhhccccCCCCCceeEEEEE-eeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhC-CCcc Q lcl|NC_021342. 77 YRFDVPMAANIPEYADTWMYR-SYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMN-MPID 154 (354) Q Consensus 77 ~r~~v~v~~~~~~~~~~~~~~-~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g-~~ld 154 (354) .-+-++- .+.......|... ....+|.+.+++.. ...+..+.++.|++..+..++..-..++..-. ..+ .+.. T Consensus 64 ~~~~i~k-~~a~sTv~~y~~~~~~G~~g~~~f~~E~-g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl---~n~~~d~~ 138 (462) T protein:vir:96 64 FYREISR-RPAQSTVQKYDVYLRHGNVGHSRFVREV-GVAPVSDPNIRQKTVEMKYVSDTKNLSIASTL---VNNIQDPM 138 (462) T ss_pred hhhhcCC-chhhhhhhhheeeeccCccccccccccc-cccccCCCceEEEEEEEEEEeeeeeechhhhh---ccchhhHH Confidence 4333432 2333333333322 33445666665554 33577888899999999999988888765433 122 2444 Q ss_pred hHHHHHHHHHHHHHhhheeeeeehhhCceee---eecCCcccccccccccccCHHHH-HHHHHHHHHHHHHHhCCccccc Q lcl|NC_021342. 155 AEQARLAFRGAEEHSQSVAYFGDASRGMYGL---FNNPNVTLSSATKDYKTMNGQEL-FNMLNAPIFSVINLSRRFHVPN 230 (354) Q Consensus 155 ~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GL---lN~p~~~~~~~~~~W~~~T~~ei-~~di~~~~~~l~~~s~g~~~p~ 230 (354) ....+.|...+++......||||+.+.=.+- |+.-|+...-.+.+--++-++.. .+.|+.+-.. .+.++-.|+ T Consensus 139 ~~~~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~---i~~~fGt~T 215 (462) T protein:vir:96 139 QILTEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVL---IGKSFGTAT 215 (462) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhh---cccccCChh Confidence 7777788889999999999999887543111 33333211111111001111111 2344444322 245778899 Q ss_pred EEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCccccccccccee-eeeeeeeeccccccccccCcccEEEEEEcCcceE Q lcl|NC_021342. 231 TALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDI-QIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNL 309 (354) Q Consensus 231 ~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I-~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~ 309 (354) -+.||......|.+..++... .+...|......|.++.- ...... +...+---.+.+- +.+.+...+ T Consensus 216 D~~~p~~v~a~f~~~~l~~qr------v~~~~n~g~~~~G~~v~~f~s~~G~----I~L~~s~~m~~~~--i~~~~~~~~ 283 (462) T protein:vir:96 216 DAYMPIGVHADFVNSVLGRQM------QLMQDNSGNVNAGYNVQGFYSSRGF----IKLHGSTVMENEL--ILDESLQPL 283 (462) T ss_pred heecchHHHHHHHHhhcCceE------EEEcCCCCceeeeeeccceeeeeee----eeeCCceecCccc--ccccccccC Confidence 999999999988754332211 111122211122222111 011000 0000000000000 000000000 Q ss_pred EEeeCchhhhccc-------------cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 310 AMANPIPFRMLAP-------------QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 310 ~~~vp~~~~~~~~-------------~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) -.+|.|-+.... ......|++...+.-|-- .|..++..+++ T Consensus 284 -p~ap~~~~vsaTv~t~~~g~f~~~~d~~~y~Y~V~avs~dgeS---~PS~~VtaTva 337 (462) T protein:vir:96 284 -PNAPQPATVKATVETGKKGLFTDEHDRAELTYKVVVNSDDAQS---APSEAVTATVN 337 (462) T ss_pred -CCCCCCCceeEEEEeCCCCCCCCccCceeEEEEEEEECCCCcc---ccceeeEeeee Confidence 012333222111 122345666666654322 46777777777 No 150 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=84.94 E-value=0.057 Score=27.42 Aligned_cols=264 Identities=9% Similarity=0.076 Sum_probs=120.6 Q ss_pred eeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhcccc--CCCCCceeEEEEEeeccccceeEecCCCcccceeeecc Q lcl|NC_021342. 45 IMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMA--ANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSA 122 (354) Q Consensus 45 ~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~--~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~ 122 (354) |-. -.|+. +.+...+.+...+.+....++... ..+..| .++.+......+.+... ..+..++.-+.+. T Consensus 1 MA~-----~~~~p---e~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~G-dtv~ip~~~~~~~~d~~-~~~~~~~~~~~~~ 70 (273) T protein:vir:10 1 MAF-----NNFIP---ELWSDMLLEEWTAQTVFANLVNREYEGTASKG-NVVHIAGVVAPTVKDYK-AAGRQTSADAISD 70 (273) T ss_pred Ccc-----hhhhH---HHHHHHHHHHHHhhhccchhhccccccccccC-ceEEEeecccccccccc-cCCCccCcccccc Confidence 101 01333 333445555555666666665432 123333 46666665444433211 1112122233333 Q ss_pred ceeEEEEEE-EEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccccc Q lcl|NC_021342. 123 QMHTVPLGY-AGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYK 201 (354) Q Consensus 123 ~~~~~pv~~-~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~ 201 (354) +.....+-. ...++.++ +++..+.. .++. +-.+.+..+++...|+.++- ++..-+.+. ..+ + T Consensus 71 ~~~~~tid~~~~~~~~i~--d~d~~~~~-~~~~-~~~~~~~~alA~~vD~~i~~---------~~~~a~~~~--~~~--~ 133 (273) T protein:vir:10 71 TGVDLLIDQEKSIDFLVD--DIDRVQVA-GSLE-AYTRAGATALATDTDKFIAD---------MLVDNGTAL--TGS--A 133 (273) T ss_pred ceEEEEEeeeeecceEee--cHHHhhhh-ccHH-HHHHHHHHHHHHHHHHHHHH---------HHhcccccc--ccc--c Confidence 444444432 35555554 55555443 3563 35566777888888776551 110000000 011 1 Q ss_pred ccCHHHHHHHHHHHHHHHHHHhCCc-ccccEEEeCHHHHHHHhhc--ccCCCCCchHHHHHHhhCcccccccccceeeee Q lcl|NC_021342. 202 TMNGQELFNMLNAPIFSVINLSRRF-HVPNTALMFPDLWNQANNQ--LMTGYTDRTVMQHFMEANSYTLLTGNELDIQIR 278 (354) Q Consensus 202 ~~T~~ei~~di~~~~~~l~~~s~g~-~~p~~L~l~p~~~~~L~~~--~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~ 278 (354) .-++..+++.|.++..+|.+. +. .....|+++|..|..|.+. .+.. .+..-..+. -.+|..-.+... T Consensus 134 ~~~~~~~~~~i~~a~~~ld~~--~vP~~~R~lvv~p~~~~~L~~~~~~~~~------~~~~~~~~~--l~~G~ig~i~G~ 203 (273) T protein:vir:10 134 PTDADDAFDLIAKALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTS------ADTSGDAAG--LRAGTIGNLLGA 203 (273) T ss_pred ccchhHHHHHHHHHHHHhhhc--CCCcCCCEEEECHHHHHHHhcchhhhhh------hhccccccc--eeeeeeeEEece Confidence 234677899999998888654 32 1235799999999988642 1110 000000010 112333334444 Q ss_pred eeeeeccccccccccCcccEEEEEEcCcceEEEeeC-chhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 279 FQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANP-IPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 279 ~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp-~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+.++..+-. ++...++++.++. +.+..- ..+..+..+ +.....+...... |+-+.||.+++.+=-+ T Consensus 204 ~v~~s~~lp~-----~~~~~~~~~~~~A--~~~a~q~~~~e~~r~~-~~~~~~v~~~~~y-g~~v~~~~~~~~l~~~ 271 (273) T protein:vir:10 204 RIVESNNLRD-----TDDEQFVAFHPSA--AAYVSQIDTVEALRDQ-DSFSDRIRALHVY-GGKVVRPTGVVVFNKT 271 (273) T ss_pred EEEEeccccc-----CCccEEEEEeccc--eeeeeeeehhhcccCC-Ccceeeeeeeeee-eeeEeccceEEEEecc Confidence 4444433211 1112245554332 111110 011112222 2224444444443 6888899999987766 No 151 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=84.94 E-value=0.057 Score=27.42 Aligned_cols=264 Identities=9% Similarity=0.076 Sum_probs=120.6 Q ss_pred eeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhcccc--CCCCCceeEEEEEeeccccceeEecCCCcccceeeecc Q lcl|NC_021342. 45 IMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMA--ANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSA 122 (354) Q Consensus 45 ~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~--~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~ 122 (354) |-. -.|+. +.+...+.+...+.+....++... ..+..| .++.+......+.+... ..+..++.-+.+. T Consensus 1 MA~-----~~~~p---e~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~G-dtv~ip~~~~~~~~d~~-~~~~~~~~~~~~~ 70 (273) T protein:vir:10 1 MAF-----NNFIP---ELWSDMLLEEWTAQTVFANLVNREYEGTASKG-NVVHIAGVVAPTVKDYK-AAGRQTSADAISD 70 (273) T ss_pred Ccc-----hhhhH---HHHHHHHHHHHHhhhccchhhccccccccccC-ceEEEeecccccccccc-cCCCccCcccccc Confidence 101 01333 333445555555666666665432 123333 46666665444433211 1112122233333 Q ss_pred ceeEEEEEE-EEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccccccc Q lcl|NC_021342. 123 QMHTVPLGY-AGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYK 201 (354) Q Consensus 123 ~~~~~pv~~-~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~ 201 (354) +.....+-. ...++.++ +++..+.. .++. +-.+.+..+++...|+.++- ++..-+.+. ..+ + T Consensus 71 ~~~~~tid~~~~~~~~i~--d~d~~~~~-~~~~-~~~~~~~~alA~~vD~~i~~---------~~~~a~~~~--~~~--~ 133 (273) T protein:vir:10 71 TGVDLLIDQEKSIDFLVD--DIDRVQVA-GSLE-AYTRAGATALATDTDKFIAD---------MLVDNGTAL--TGS--A 133 (273) T ss_pred ceEEEEEeeeeecceEee--cHHHhhhh-ccHH-HHHHHHHHHHHHHHHHHHHH---------HHhcccccc--ccc--c Confidence 444444432 35555554 55555443 3563 35566777888888776551 110000000 011 1 Q ss_pred ccCHHHHHHHHHHHHHHHHHHhCCc-ccccEEEeCHHHHHHHhhc--ccCCCCCchHHHHHHhhCcccccccccceeeee Q lcl|NC_021342. 202 TMNGQELFNMLNAPIFSVINLSRRF-HVPNTALMFPDLWNQANNQ--LMTGYTDRTVMQHFMEANSYTLLTGNELDIQIR 278 (354) Q Consensus 202 ~~T~~ei~~di~~~~~~l~~~s~g~-~~p~~L~l~p~~~~~L~~~--~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~ 278 (354) .-++..+++.|.++..+|.+. +. .....|+++|..|..|.+. .+.. .+..-..+. -.+|..-.+... T Consensus 134 ~~~~~~~~~~i~~a~~~ld~~--~vP~~~R~lvv~p~~~~~L~~~~~~~~~------~~~~~~~~~--l~~G~ig~i~G~ 203 (273) T protein:vir:10 134 PTDADDAFDLIAKALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTS------ADTSGDAAG--LRAGTIGNLLGA 203 (273) T ss_pred ccchhHHHHHHHHHHHHhhhc--CCCcCCCEEEECHHHHHHHhcchhhhhh------hhccccccc--eeeeeeeEEece Confidence 234677899999998888654 32 1235799999999988642 1110 000000010 112333334444 Q ss_pred eeeeeccccccccccCcccEEEEEEcCcceEEEeeC-chhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 279 FQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANP-IPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 279 ~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp-~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+.++..+-. ++...++++.++. +.+..- ..+..+..+ +.....+...... |+-+.||.+++.+=-+ T Consensus 204 ~v~~s~~lp~-----~~~~~~~~~~~~A--~~~a~q~~~~e~~r~~-~~~~~~v~~~~~y-g~~v~~~~~~~~l~~~ 271 (273) T protein:vir:10 204 RIVESNNLRD-----TDDEQFVAFHPSA--AAYVSQIDTVEALRDQ-DSFSDRIRALHVY-GGKVVRPTGVVVFNKT 271 (273) T ss_pred EEEEeccccc-----CCccEEEEEeccc--eeeeeeeehhhcccCC-Ccceeeeeeeeee-eeeEeccceEEEEecc Confidence 4444433211 1112245554332 111110 011112222 2224444444443 6888899999987766 No 152 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=83.51 E-value=0.068 Score=26.99 Aligned_cols=267 Identities=9% Similarity=0.050 Sum_probs=122.4 Q ss_pred ccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCC-CCceeEEEEEeeccccceeEecCCCccccee Q lcl|NC_021342. 40 IGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANI-PEYADTWMYRSYDGVTMGKFIGANGQDLPRV 118 (354) Q Consensus 40 ~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~-~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v 118 (354) |.+. .|+. +.+...+.+.....+....++....+. +.-..++.+......+.+..... +..++.- T Consensus 1 MA~~----------~~~p---ei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~-~~~~~~~ 66 (273) T protein:vir:79 1 MAFN----------NFIP---ELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAA-GRQTSAD 66 (273) T ss_pred Ccch----------hhhH---HHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccC-CCccCcc Confidence 1110 1333 344555666666666666665332211 11123677766554443332221 1223334 Q ss_pred eeccceeEEEEEE-EEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCccccccc Q lcl|NC_021342. 119 AQSAQMHTVPLGY-AGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSAT 197 (354) Q Consensus 119 ~~~~~~~~~pv~~-~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~ 197 (354) +.+.+.....+-. ...++.++ +++..+ ...++. +-...+..++++..|+.++ +++..-+.. ... T Consensus 67 ~~~~~~~~~tid~~~~~~~~i~--d~d~~~-~~~~~~-~~~~~~~~ala~~vD~~i~---------~~~~~a~~~--~~~ 131 (273) T protein:vir:79 67 AISDTGVDLLIDQEKSIDFLVD--DIDRVQ-VAGSLE-AYTRAGATALATDTDKFIA---------DMLVDNGTA--LTG 131 (273) T ss_pred ccccceEEEEEeeecccceeec--cHHHHh-hcccHH-HHHHHHHHHHHHHHHHHHH---------HHHhhcccc--ccc Confidence 4445555666644 35566665 444443 344664 4556677788888877543 111000000 001 Q ss_pred ccccccCHHHHHHHHHHHHHHHHHHhCCc-ccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceee Q lcl|NC_021342. 198 KDYKTMNGQELFNMLNAPIFSVINLSRRF-HVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQ 276 (354) Q Consensus 198 ~~W~~~T~~ei~~di~~~~~~l~~~s~g~-~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~ 276 (354) + +.-++..+++.|.++..+|.+. +. .....|+++|..|..|.+.. . ..+-.++.-.++. -.+|..-.+. T Consensus 132 ~--~~~~~~~~~~~i~~a~~~ld~~--~vP~~~R~lvv~p~~~~~Ll~~~---~-~~~~~~~~~~~~~--l~~G~ig~~~ 201 (273) T protein:vir:79 132 S--APSDADDAFDLIASALKELTKA--NVPNVGRVVVVNAEMAFWLRSSG---S-KLTSADTSGDAAG--LRAGTIGNLL 201 (273) T ss_pred c--cccchhhHHHHHHHHHHHhhhc--cCCccCcEEEECHHHHHHHhhch---h-hhhhhhhcccccc--eeeeEeeEEe Confidence 1 1124667788888888887654 32 12358999999999886421 0 0000011101111 1123333333 Q ss_pred eeeeeeeccccccccccCcccEEEEEEcCcceEEEeeC-chhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 277 IRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANP-IPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 277 ~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp-~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ...+.++..+-.. .+ ...+++.++. +.+..- ..+..+.. ++.....+...... |+.+.+|.+++.+--+ T Consensus 202 G~~i~~s~~lp~~----~~-~~~~a~~~~A--~~~a~~~~~~e~~r~-~~~~~~~v~~~~~y-g~~v~~p~~vv~~~~~ 271 (273) T protein:vir:79 202 GARIVESNNLRDT----DD-EQFVAFHPSA--AAYVSQIDTVEALRD-QDSFSDRIRALHVY-GGKVVRPTGVVVFNKT 271 (273) T ss_pred ceEEEeccccccc----Cc-eEEEEEeccc--eeeeeehhhhhcccC-cccceeeeeeeeee-eeEEecCceEEEEecc Confidence 4444443332111 11 1234444332 111110 01111111 22234444444444 6888899999998877 No 153 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=82.37 E-value=0.077 Score=26.67 Aligned_cols=301 Identities=10% Similarity=0.048 Sum_probs=125.3 Q ss_pred cccccchhhhhhhhhhhcc-CCceec-----cchhhHHHHHHHH-HHHHHHHHHhhhhcccchhhccccCCCCCceeEEE Q lcl|NC_021342. 23 VSRNGDQWVINNTALDAIG-NPNIML-----DADGGIAFYISQL-AGIEATVYETPYGDITYRFDVPMAANIPEYADTWM 95 (354) Q Consensus 23 ~~~~~~~~~~~~~am~a~~-~~~~~~-----da~~~~~fl~~~L-~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~ 95 (354) +.+ ++++.. .++..+ .+++..+.+.+++ -+++...- ..-..+.++.+.+-. +..++. T Consensus 1 ~~~----------~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~----~~si~~~~~~~rti~--~Gksv~ 64 (375) T protein:vir:10 1 MAN----------ANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQ----HETIARDLVTKRTLK--NGKSLQ 64 (375) T ss_pred Ccc----------ccccccCccccCCccccccccchHHHHHHHHhHHHHHHHH----HHHhhhccccccccc--cCceEE Confidence 110 111111 011111 1223333444333 34444433 223344444433211 233444 Q ss_pred EEeeccccceeE--ecCCC--cccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhh Q lcl|NC_021342. 96 YRSYDGVTMGKF--IGANG--QDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQS 171 (354) Q Consensus 96 ~~~~~~~G~a~~--~~~~~--~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~ 171 (354) +. ..|..++ +..+. ++-|..+....+..+.+-.. .-+..-+.+++.++ ...++-.+-.+.+..++++..|+ T Consensus 65 f~---~iG~~t~~~~t~G~~i~~~~~~d~~~te~~l~ID~~-~y~~~~VdDiD~aq-a~~Dlr~e~s~~~G~aLA~~~D~ 139 (375) T protein:vir:10 65 FI---YTGRMTSSFHTPGTPILGNADKAPPVAEKTIVMDDL-LISSAFVYDLDETL-AHYELRGEISKKIGYALAEKYDR 139 (375) T ss_pred EE---eeeeeEEeeecCCcCcCCccccCCCCCceEEEecch-hhhhhhHhhHHHHh-cCchhHHHHHHHHHHHHHHHHHH Confidence 33 3344443 22211 22233333333333333221 12344557888875 56677788888899999999999 Q ss_pred eeee----e-ehhhCce--eeeecCCcccc---cccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHH Q lcl|NC_021342. 172 VAYF----G-DASRGMY--GLFNNPNVTLS---SATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQ 241 (354) Q Consensus 172 ~~f~----G-~~~~gi~--GLlN~p~~~~~---~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~ 241 (354) .++- | .....+. ..+. |+.... +....=...|++.+++.|.++..+|.++.--.. ...++|+|..|.. T Consensus 140 ~i~~~l~kaa~~~~p~~~~~~~~-~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~-~R~~vv~P~~y~~ 217 (375) T protein:vir:10 140 LIFRSITRGARSASPVSATNFVE-PGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQ-GRCAVLNPRQYYA 217 (375) T ss_pred HHHHHHHHhhhhccccccccccc-cCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCC-CCEEEeChHHHHH Confidence 8862 1 1111110 0000 111111 111111235799999999999999987522112 2468899999988 Q ss_pred HhhcccCC-CCCchHHHHHHhhCcccccccccceeeeeeeeeecccccc------------------------------- Q lcl|NC_021342. 242 ANNQLMTG-YTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAAN------------------------------- 289 (354) Q Consensus 242 L~~~~~~~-~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~------------------------------- 289 (354) |..-.-.+ ..+. +|. .++.. ..|....|.-++.+++..+-.. T Consensus 218 Ll~~~d~~~~~n~---d~~-~~~~~--~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~ 291 (375) T protein:vir:10 218 LIQDIGSNGLVNR---DVQ-GSALQ--SGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENAN 291 (375) T ss_pred HHhcCCccceeee---ccc-cccee--ccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcce Confidence 86311000 0000 110 00000 0111111111111111111000 Q ss_pred --------ccccC---cccEEEEEEcCcc-eEEEeeCchhhhcc--ccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 290 --------GVSNS---NKPRYMVYDKSDR-NLAMANPIPFRMLA--PQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 290 --------g~g~~---g~d~~v~y~~~~~-~~~~~vp~~~~~~~--~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) -.++. ++...+++.++.= ++++ +..-.++.. -+++-..+.+-..... |..+.||++++-+..+ T Consensus 292 ~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~-~~~~~~~~~~~~~~~~q~~~i~~~~a~-G~~~lrp~~av~l~~~ 368 (375) T protein:vir:10 292 ATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEA-IGPQVQVTNGDVSVIYQGDVILGRMAM-GADYLNPAAAVELYIG 368 (375) T ss_pred eeccccccccccccccCceEEEEEchhheeeeee-eccccccccchhhheeeeeeeeeeeee-ccCccCceeEEEEecC Confidence 00011 2333444432211 1111 111112211 1233334444444444 6889999999888776 No 154 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=81.77 E-value=0.083 Score=26.52 Aligned_cols=311 Identities=11% Similarity=-0.025 Sum_probs=107.3 Q ss_pred CcccchhHHHHhhhhhhh-----cccccccccchhhhhh--hhhhhc--cCC----ceec--cchhhHHHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLV-----HKGYVSRNGDQWVINN--TALDAI--GNP----NIML--DADGGIAFYISQLAGIEA 65 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~--~am~a~--~~~----~~~~--da~~~~~fl~~~L~~Id~ 65 (354) -.++.++++.-+ .+-.. .+......+.++.... ....+. ... .... +....+.++.. ..+-. T Consensus 181 e~~~~l~a~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--~~~~~ 257 (517) T protein:vir:97 181 KTVSELAANLMK-QRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAP--AGILK 257 (517) T ss_pred hhhhhhhhhHHH-HHHhhhhcccccccccchhhHHHHHHHHHHHHHHhcccccccceeeeecccccccccccc--hHHHH Confidence 122222222111 11000 0111111111111100 000000 000 0000 00111111111 11222 Q ss_pred HHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHH Q lcl|NC_021342. 66 TVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRK 145 (354) Q Consensus 66 ~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~ 145 (354) .+..........++++++.. ++ ...+ ..-...+.+.+... +...|..+...+..+.++..++.-+..|.+-|+. T Consensus 258 ~i~~~~~~~~~i~~~~~~~~-i~--~~~~--~~~~~~~~a~~~~e-G~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~D 331 (517) T protein:vir:97 258 RIQDAVNDEGSLLPFIRHEN-LP--TLVV--GGDNALTQGTGHTT-GTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNS 331 (517) T ss_pred HHHHhhhhhccceeeeeecc-cc--ceee--ecccccceeeeeec-CCcccccccceeeEEeeHhhhhhhhhhhHHHHHH Confidence 22222222223344444321 11 1111 00111122233332 2335656666666677777776666666665555 Q ss_pred HHHhCCC-cchHHHHHHHHHHHHHhhheeeeeehh-hCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHh Q lcl|NC_021342. 146 SAAMNMP-IDAEQARLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLS 223 (354) Q Consensus 146 a~~~g~~-ld~~k~~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s 223 (354) +..--.+ |..--....+..+++.|++-+++|+.. .+..|+++..+.... .+.. .+.+..+++..|..++. . T Consensus 332 s~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~-~~~~-~~~~~~d~i~~l~~a~~----~- 404 (517) T protein:vir:97 332 NATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWA-TNVT-GTTNIQELLEKLSVATP----K- 404 (517) T ss_pred hhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCccccccccccccccc-cccc-ccchHHHHHHHHHHHhh----h- Confidence 4321111 445556678889999999999999863 334455543221100 0000 01112222222222221 1 Q ss_pred CCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeee----eeeeeeccccccccccCcccEE Q lcl|NC_021342. 224 RRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQI----RFQLDAAELAANGVSNSNKPRY 299 (354) Q Consensus 224 ~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~----~~~L~~~~~~~~g~g~~g~d~~ 299 (354) ...-.++|+|..|..|.+.. +..|.=++.=... .+.+..+.. .+.+....... + ...+-. T Consensus 405 ---a~~a~~vmn~~t~~~I~klK--D~~G~Yl~~~~~~-------~~~~~~l~G~~~~~~~~~~~~~~~---~-~~~~y~ 468 (517) T protein:vir:97 405 ---AADSTLVIHRNDLAAIRFLK--DKNGNYVFPVGVS-------NQTIATHFGFNRLVQSVAVDEKTA---V-SLSGYV 468 (517) T ss_pred ---ccCCEEEECHHHHHHHHHhh--cCCCCeeccCcCC-------cccccccCCccccccccccCceeE---e-eccccE Confidence 11246999999999986533 4333222110000 111111110 11111000000 0 000111 Q ss_pred EEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 300 MVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 300 v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ++-...-+.++ +|.+- . | ...+-...++|| -|+.|+++++.-.. T Consensus 469 i~~~~g~~~~~-----~fd~~---~-n-~~~f~~~~~~~g-~i~~~~r~a~~~~~ 512 (517) T protein:vir:97 469 TNGSRGMEFEQ-----GTILV---E-N-NKEYLFEMPISG-SLEYKGTTAYGTYT 512 (517) T ss_pred EEeecceeeee-----eeecc---c-C-ceeEeeeeeecc-ccccccceEEEEEc Confidence 11111111111 11110 0 1 111222345554 56666666665444 No 155 >protein:vir:6378 Length: 346 # NCBI annotation: capsid protein E # Family: family:all:1021 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918991;genbank:gi:34610166;genbank:GeneID:2559600 Probab=81.53 E-value=0.085 Score=26.46 Aligned_cols=282 Identities=10% Similarity=0.042 Sum_probs=115.3 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccc-cceeEecCCCcccceeeecccee Q lcl|NC_021342. 47 LDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGV-TMGKFIGANGQDLPRVAQSAQMH 125 (354) Q Consensus 47 ~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~-G~a~~~~~~~~dip~v~~~~~~~ 125 (354) +| .|..++|+..=++ ....++-...+||-.. ...+.++.+...... ..+..++......+.-....+-. T Consensus 1 ~d-----~f~~~~l~~~i~~---~p~~~~l~~~~fp~~~--~~~t~~i~i~~~~g~~~la~~v~~~~~~~~~~~~g~~~~ 70 (346) T protein:vir:63 1 ME-----IFDTLTLAGVIQS---GPALSMYWQGFYPNEI--TFDTDEILFDLVFKDKKLAPFVAPNVQGRVIAARGYTTK 70 (346) T ss_pred CC-----ccCHHHHHHHHHh---cCCccchhhhcCcccc--ccccceEEEEEecCceeeeeeecCCCCcceecccceeee Confidence 33 4666666433222 2334455666776432 233445555544432 22334444433333322223333 Q ss_pred EEEEEEEEeeEeecHHHHHHHHHhC------CCcch-------HHHHHHHHHHHHHhhhe----eeeee---hhhCceee Q lcl|NC_021342. 126 TVPLGYAGNECHYTLDEMRKSAAMN------MPIDA-------EQARLAFRGAEEHSQSV----AYFGD---ASRGMYGL 185 (354) Q Consensus 126 ~~pv~~~~~~~~~~~~El~~a~~~g------~~ld~-------~k~~aA~~~~a~~~n~~----~f~G~---~~~gi~GL 185 (354) ......+.....++..|+...+... .+... ++....++.++..++.+ +..|. .+.++.-. T Consensus 71 ~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~~~E~m~~~al~~gki~~~g~~~~~~ 150 (346) T protein:vir:63 71 TFRPAYVKPKDVINPNRTLKRRAGEQPIIGGMSLQERFQAVVADSQLEQRQRIENRIEWMCAMATIYGYVDVVGEAFPMQ 150 (346) T ss_pred EeecCccCccceeCHHHHHHHhhhhhhccCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCEEEeeCCceeEE Confidence 4455666777788888886543322 21111 22223333333333322 22331 11111111 Q ss_pred eecCCcc-----cccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHH Q lcl|NC_021342. 186 FNNPNVT-----LSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFM 260 (354) Q Consensus 186 lN~p~~~-----~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~ 260 (354) .=+=|++ ..+.+..|++.++ .++.||.+.+..+...+ -..|.+++|+++.|..|.+ +..+.+.+. T Consensus 151 ~vdfg~~~~~~~~lt~~~~W~~~~a-dp~~di~~~~~~~~~~~--g~~~~~~i~~~~~~~~l~~-------~~~v~~~~~ 220 (346) T protein:vir:63 151 RVDFGRDPALTVQLTGGAAWDQATS-DPLGNIQTMRTTAWKKS--NSTITRLTMGLDAWSLFSQ-------KPAVVELLN 220 (346) T ss_pred EEeeCCCccceeeecccccCCCCCC-CHHHHHHHHHHHHHHcc--CCceEEEEECHHHHHHHhc-------CHHHHHHHh Confidence 1011222 2344567987655 57999999998887643 3468899999999998853 112333322 Q ss_pred hh-------------------------Ccccccccccceeeeeeeeeecccccccccc--CcccEEEEEEcCc-ceEEEe Q lcl|NC_021342. 261 EA-------------------------NSYTLLTGNELDIQIRFQLDAAELAANGVSN--SNKPRYMVYDKSD-RNLAMA 312 (354) Q Consensus 261 ~n-------------------------~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~--~g~d~~v~y~~~~-~~~~~~ 312 (354) .+ +.+... ..++|..... .+.+..|... --.|.++.+.... -.+... T Consensus 221 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~--~gi~i~~y~~---~y~d~~G~~~~~ip~~~v~~~p~~~~g~~~yg 295 (346) T protein:vir:63 221 LFYKGSTSDFNRSRLDDGSPVQYQGTIGGYNGM--GTLELYTYHD---TYTGDDNTEQEILGSYDVVGTGPGLQGTQCFG 295 (346) T ss_pred hhccccccccchhhcccchhhhhhhhHhhhhcc--CCeEEEEecc---EEEcCCCceeccccCCeEEEEecCCcceEEEe Confidence 10 000000 0111111110 0000111000 0012222222111 111111 Q ss_pred eCchhh-------hcc---ccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 313 NPIPFR-------MLA---PQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 313 vp~~~~-------~~~---~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+.++. +.+ .+.......+-..++ .=..+.+|.++..+.+- T Consensus 296 ~~~d~~~~~~~~~~~~~~~~~~dp~~~~~~~~s~-plPv~~~p~~~~~~~V~ 346 (346) T protein:vir:63 296 AIMDFKNGLVPTRMFPKMWEEEDPSVAMLMTQSA-PLMVPAQPNASFRMTVK 346 (346) T ss_pred eccccccCcccceeeeEEEEecCCCEEEEEEeee-ccceecCCCcEEEEEeC Confidence 111110 000 011122222222222 11335677777777777 No 156 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=81.00 E-value=0.09 Score=26.33 Aligned_cols=291 Identities=8% Similarity=-0.030 Sum_probs=132.1 Q ss_pred cccccchhhhhhhhhhhccCCceeccc-----hhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEE Q lcl|NC_021342. 23 VSRNGDQWVINNTALDAIGNPNIMLDA-----DGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYR 97 (354) Q Consensus 23 ~~~~~~~~~~~~~am~a~~~~~~~~da-----~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~ 97 (354) +.+. ...+..+ +.--.|+..-.-++++..-+. =..+.++.+.+ +. +..|+.+. T Consensus 1 Ms~~----------------n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~----si~~~~~~vRt-I~-~gkS~qf~ 58 (400) T protein:vir:10 1 MSTP----------------NNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKG----ENIMSYFDVQT-VT-GTNTVSNK 58 (400) T ss_pred CCCC----------------ccccccccccccchhhhHHhHhcchHHHHHHHH----hhhcccceeee-ec-ccceEEEE Confidence 1111 1111111 112245544444555555321 12223333332 11 22334333 Q ss_pred eeccccceeEec-CCCcccceeeeccceeEEEE--EEEEeeEeecHHHHHHHHHhCCC-cchHHHHHHHHHHHHHhhhee Q lcl|NC_021342. 98 SYDGVTMGKFIG-ANGQDLPRVAQSAQMHTVPL--GYAGNECHYTLDEMRKSAAMNMP-IDAEQARLAFRGAEEHSQSVA 173 (354) Q Consensus 98 ~~~~~G~a~~~~-~~~~dip~v~~~~~~~~~pv--~~~~~~~~~~~~El~~a~~~g~~-ld~~k~~aA~~~~a~~~n~~~ 173 (354) . .|+.+.-. ..+..+-.....-++..+.| ..+..-+-| +|+.++ .-.+ +..+-....-.+++++.|+.+ T Consensus 59 ~---lG~s~a~y~~pG~~ldg~~~~~dk~~ItIDtLL~a~~~V~---dlDd~q-~~yD~vRse~s~e~G~ALA~~~Dq~i 131 (400) T protein:vir:10 59 Y---LGETELQVLAPGQSPAATSTQADKNQLVIDATVIARNTVA---HLHDVQ-GDIDSLKPKLATNQAKQLKKMEDEML 131 (400) T ss_pred E---eeeeEEeeecCCCCcCCCCcccCcEEEEeCceeeecchhh---hHHHHh-hccccccHHHHHHHHHHHHHHHHHHH Confidence 2 34433211 11111111112222222222 333333334 555553 4455 566666777788888888855 Q ss_pred ee-----eeh----hhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhh Q lcl|NC_021342. 174 YF-----GDA----SRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANN 244 (354) Q Consensus 174 f~-----G~~----~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~ 244 (354) +- |.. ..+..|...++.....+....=...+++++...|.++..+|.++.-= .....+++||+.|..|.. T Consensus 132 iq~i~~a~~a~t~~~~~~~~g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP-~~d~vvl~pp~~Ys~Ll~ 210 (400) T protein:vir:10 132 IQQMLLGGIANTQAKRTNPRVKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVD-ISDVAILMPWRYFNVLRD 210 (400) T ss_pred HHHHHHhcccccccccccCCccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCC-ccceEEEcCHHHHHHHHh Confidence 41 211 11222222222211111112222346889999999999998764211 223588889999987764 Q ss_pred c--ccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccc------------cc-------cccCcccEEEEEE Q lcl|NC_021342. 245 Q--LMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAA------------NG-------VSNSNKPRYMVYD 303 (354) Q Consensus 245 ~--~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~------------~g-------~g~~g~d~~v~y~ 303 (354) - .++- +|.-.++ ..-..|..+.+-.++.+++..+-. ++ .++-.+-++++|. T Consensus 211 ~dkLvnr-------df~~s~~-g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~ 282 (400) T protein:vir:10 211 ADRIVDK-------SYTISQS-GATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFT 282 (400) T ss_pred CCcccch-------hccccCC-CccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEe Confidence 2 1211 1211111 011234444555555555544410 00 1233456777876 Q ss_pred cCcceEEEeeCchhhhc-cccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 304 KSDRNLAMANPIPFRML-APQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 304 ~~~~~~~~~vp~~~~~~-~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ++.=-+ .-.+|++.- --+.+...|.+.+.... |+..+||.|++-+-.+ T Consensus 283 ~sAv~t--vk~~~lt~~~~~d~r~~~~~id~~~a~-G~g~~RPeaa~vv~~~ 331 (400) T protein:vir:10 283 ADALLV--GRSIDVIGDIFYEKKEKTYYIDTFMSE-GAIPDRWEAVSVVTTK 331 (400) T ss_pred hhheEE--EEeeccccccccchhhHHHHHHHHHHh-CCcccchhheEEEEec Confidence 552222 212333331 22455566666677766 6999999999988777 No 157 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=78.26 E-value=0.12 Score=25.71 Aligned_cols=306 Identities=8% Similarity=-0.069 Sum_probs=118.4 Q ss_pred CcccchhHH------HHhhhhhhh----cccc---cccccchhhhhh-hhhhhccCCceeccchhhHHHHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQ------TIQGNQWLV----HKGY---VSRNGDQWVINN-TALDAIGNPNIMLDADGGIAFYISQLAGIEAT 66 (354) Q Consensus 1 ~~~~~~~~~------~~~~~~~~~----~~~~---~~~~~~~~~~~~-~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~ 66 (354) ..-+.++.+ .-....+.. .... .......+...- ...... .....+++ ++ +++. +.+... T Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~--~~~~~~~~-~g-~lvp--~~~~~~ 175 (437) T protein:vir:10 102 ETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRD--VTGIALKD-GK-VIIP--ETILTP 175 (437) T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhh--hhhccccc-cc-ccch--HHHHHH Confidence 000000000 000000000 0000 000000000000 000000 11112222 22 3332 223333 Q ss_pred HHHhhhhcccchhhccccCCCCCceeEEEEEeec-cccceeEecCCCcccce-eeeccceeEEEEEEEEeeEeecHHHHH Q lcl|NC_021342. 67 VYETPYGDITYRFDVPMAANIPEYADTWMYRSYD-GVTMGKFIGANGQDLPR-VAQSAQMHTVPLGYAGNECHYTLDEMR 144 (354) Q Consensus 67 v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip~-v~~~~~~~~~pv~~~~~~~~~~~~El~ 144 (354) +.+ ....-..+.++.+.. .. ..+..+.... ..+.+.+++..+. +|- .+...+......+.++.-+.+|..=|+ T Consensus 176 i~~-~~~~~~l~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~e~~~-~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ 250 (437) T protein:vir:10 176 EKE-VHQFPRLGSLVRTES-VT--TTTGKLPIFNNSTDLLTAHTEYGQ-TTKNATPVITPILWDLKTYTGGYVFSQELIS 250 (437) T ss_pred HHH-hhhhhhhhhcceeEe-ec--cCceeeEEeecccccccccccccc-ccccccccceeeeeehhheeeehhhhHHHHh Confidence 333 223333444444321 11 1123333332 2344555554433 443 234556667777777777777765444 Q ss_pred HHHHhCCCcchHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHH-HHHHHh Q lcl|NC_021342. 145 KSAAMNMPIDAEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIF-SVINLS 223 (354) Q Consensus 145 ~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~-~l~~~s 223 (354) .+ ..++..--....+.++...+|.-+++|+... .+..+++ .+. +|+.+++. .+.. T Consensus 251 ds---~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~----------~~~~~~~-----~~~----~~~~~~~~~~l~~-- 306 (437) T protein:vir:10 251 DS---SYDWQAELQSRLIELRDNTDDSLIITALTDG----------IKKTTST-----YLL----GDLKKVLNVTLKP-- 306 (437) T ss_pred hh---HHHHHHHHHHHHHHHHHHHHHHHHhhhhccc----------ccccccc-----cch----hhHHHHHHhhhhh-- Confidence 33 3467777777888999999999999996431 1111111 122 33333333 2221 Q ss_pred CCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEE Q lcl|NC_021342. 224 RRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYD 303 (354) Q Consensus 224 ~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~ 303 (354) .....-..+|+|..|..|..- .+..|.-++. .++ ..|.+-.+...|......-.... ++.|+. .++|- T Consensus 307 -~~~~~~~~~~~~~~~~~l~~l--kd~~g~~~~~----~~~---~~~~~~~l~G~pv~~~~~~~~~~-~~~~~~-~~~~g 374 (437) T protein:vir:10 307 -QDSAAASIVMSQSAYNLFDMA--TDAMGRPLLQ----PNV---TAATGYTLLGKTVVIVDDKLFPS-ASAGDV-NIVVA 374 (437) T ss_pred -hhhcCCEEEEcHHHHHHHHHh--hccCCCeeec----cCc---cCCCCcccccceeEEecccccCC-cCCCce-EEEEe Confidence 122223689999999998653 2444432221 110 11222223222222221100111 112222 23332 Q ss_pred cCcceEEEeeCchhhhccc-cccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 304 KSDRNLAMANPIPFRMLAP-QMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 304 ~~~~~~~~~vp~~~~~~~~-~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) +=.+.+.+..-+.+++.-. ........+....|+ ++.+..|.|++++-.- T Consensus 375 d~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~-d~~~~~~~a~~~l~~~ 425 (437) T protein:vir:10 375 PLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQ-NVVQASKDLIVNLTGK 425 (437) T ss_pred eccccEEEEeeeceEEEEecccccccceeeEEEEE-ccEEecccceEEEEee Confidence 2123232322223332111 111222334455677 4566689999987643 No 158 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=76.46 E-value=0.14 Score=25.35 Aligned_cols=274 Identities=9% Similarity=0.027 Sum_probs=104.1 Q ss_pred hhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccccceeEe------cC Q lcl|NC_021342. 37 LDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFI------GA 110 (354) Q Consensus 37 m~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~------~~ 110 (354) |-.. +..+..|. .|+.+= +.-.-+.+.+.+++|...-...+ ..|..++.-. .... .. T Consensus 1 m~~~-~~~~~~dp---------~LT~~A---~gy~n~~~ia~~l~P~vpv~~~~---~k~~~f~~ea-F~~~~t~r~~~~ 63 (307) T protein:vir:10 1 MGRL-SKLRIVDP---------VLTNLA---IGYTNAEFIGQSLMPVVEVEKEG---GKIPKFGKES-FRLYKTERALRA 63 (307) T ss_pred CCCC-CCCcccCh---------hHHHHH---HhhcchhhhhhhcCCcccccccc---cceeeECccc-ccchhhhcccCC Confidence 2211 22333331 111111 11112346777778765333332 2233332111 0000 00 Q ss_pred CCcccceeeecc-ceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHh----hheeeeeehhhCceee Q lcl|NC_021342. 111 NGQDLPRVAQSA-QMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHS----QSVAYFGDASRGMYGL 185 (354) Q Consensus 111 ~~~dip~v~~~~-~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~----n~~~f~G~~~~gi~GL 185 (354) ..+ +++... +.....+...+..+-... +..+....++.....+.+...+...+ -+++|.... |+ T Consensus 64 ~~~---~v~~~~~~~~~~~~~~~~L~~~id~---r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~----y~- 132 (307) T protein:vir:10 64 RSN---RMNPEDLGSIDIVLDEHDLEYPIDY---REDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNS----YA- 132 (307) T ss_pred Ccc---eeecccccccccccccccccccCCh---hhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccc----cC- Confidence 111 111110 111111222111111211 22334455565555555555444333 344443211 11 Q ss_pred eecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcc Q lcl|NC_021342. 186 FNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSY 265 (354) Q Consensus 186 lN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~ 265 (354) ..+..+.+.+..|+++++ +++.||.+.+.++... +...|++++|+.+.|..|.+- ..+++.++-... T Consensus 133 --~~~k~tLsGt~~Wsd~~s-DPi~di~~~~~ai~~~--~g~~Pn~~vlg~~a~~al~~h-------p~i~e~lk~~~~- 199 (307) T protein:vir:10 133 --GGNKKQLSATEKFTAAGS-DPVGVIEDGKEAIRTK--IGRRPNTMVIGASAYKTLKAH-------PQLIEKIKYSMK- 199 (307) T ss_pred --CCceEEeccccccCCCCC-CcHHHHHHHHHHHHhh--hCCccceEEeCHHHHHHHhcC-------HHHHHHhCCccc- Confidence 111112334557988765 5699999999999875 346899999999999988641 134444432110 Q ss_pred ccccccc-----ceeeeeeeeeeccccccc--cccCcccEEEEEEcCc-ceEEEeeCchhhhcc-ccccCceeEEeeeee Q lcl|NC_021342. 266 TLLTGNE-----LDIQIRFQLDAAELAANG--VSNSNKPRYMVYDKSD-RNLAMANPIPFRMLA-PQMASLGITVPAEYK 336 (354) Q Consensus 266 ~~~~g~~-----l~I~~~~~L~~~~~~~~g--~g~~g~d~~v~y~~~~-~~~~~~vp~~~~~~~-~~~~~l~~~~~~~~~ 336 (354) ....+. ++++.+...++-.-...+ .-.-|.+..++|.... ..-.--+-+| .+.. .+.++..+..++++ T Consensus 200 -g~it~~~la~ll~v~~i~vg~a~~~~~~~~~~~iw~~~~vl~yv~~~~~~~~~~~~ep-sfGyT~~~~g~~~~d~~~~- 276 (307) T protein:vir:10 200 -GIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEP-SYGYTLRKKGNPVVDTRIE- 276 (307) T ss_pred -cccCHHHHHHHhCceeEEEeeeeeeccCCccceeCCCceEEEecccccCCCCCccccc-ccceeEEEcCCeEeeceec- Confidence 000000 111111111110000000 0001334444553210 0000000011 1111 13345555555555 Q ss_pred eeeEEEECCc-----eeEeeecC Q lcl|NC_021342. 337 ISGTEFRYPL-----CAAYVDMA 354 (354) Q Consensus 337 ~gGv~i~~P~-----ai~y~D~~ 354 (354) -+|+++.|-. -++.-|.. T Consensus 277 ~~~~~~~r~~~~~~~~i~~~~~G 299 (307) T protein:vir:10 277 DGKLELVRSTDIFRPYLLGADAG 299 (307) T ss_pred CCceeEEeccccccceeeccccc Confidence 3566555433 23333322 No 159 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=75.28 E-value=0.15 Score=25.12 Aligned_cols=276 Identities=10% Similarity=0.018 Sum_probs=98.8 Q ss_pred hhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccc----ceeE-ecCC Q lcl|NC_021342. 37 LDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVT----MGKF-IGAN 111 (354) Q Consensus 37 m~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G----~a~~-~~~~ 111 (354) |-.. +..+..|- .|+.+= +.-.-+++.+.++||...-... ++.|..++... ..+. .+.. T Consensus 1 m~~~-~~~~~~dp---------~LT~~A---~gy~n~~~Iad~lfP~vpV~~~---~~k~~~f~~e~f~~~~t~ra~~~~ 64 (307) T protein:vir:79 1 MGRL-SKLRIVDP---------VLTNLA---IGYTNAEFIGQTLMPVVEVEKE---GGKIPKFGKESFRLYQTERALRAK 64 (307) T ss_pred CCCC-CCCcccCH---------HHHHHH---hhccchhhhhhhcCCccccccc---ccceeeeccccccccccccccCCC Confidence 2211 22223331 111111 1112345677777776532222 23333332111 0000 0111 Q ss_pred CcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHH----HHHHhhheeeeeehhhCceeeee Q lcl|NC_021342. 112 GQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRG----AEEHSQSVAYFGDASRGMYGLFN 187 (354) Q Consensus 112 ~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~----~a~~~n~~~f~G~~~~gi~GLlN 187 (354) ++.+... ..+.....+...+...-. +. +.....+.++.....+..... .+..--+++|.+.. | . T Consensus 65 ~~~v~~~--~~~~~~~~~~~~~l~~~i--d~-r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~----y---~ 132 (307) T protein:vir:79 65 SNRMNPE--DIDSVDVNLDEHDLEYPI--DY-REDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSS----Y---A 132 (307) T ss_pred cceeeee--ccccccccccccchhhcc--cc-hhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccc----c---C Confidence 1111110 111111122221111111 11 112223444444433333322 33333344444321 1 1 Q ss_pred cCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccc Q lcl|NC_021342. 188 NPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTL 267 (354) Q Consensus 188 ~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~ 267 (354) ..+.-+.+.+..|+++++ +++.||.+.+.++... +...|++++|++..|..|.+ +..+++.|+-.+. . T Consensus 133 ~~~k~tLsgt~~Wsd~~s-DPi~di~~~~~ai~~~--~g~~Pn~~vlg~~a~~~l~~-------h~~i~~~lk~~~~--g 200 (307) T protein:vir:79 133 AGNKKQLSATEKFTAANS-DPVGVIEDGKEAIRTK--IGRRPNTMVIGASAYKTLKA-------HPQLIEKIKYSMK--G 200 (307) T ss_pred CCceEEEccCcccCCCCC-CcHHHHHHHHHHHHHh--hCCccceEEeCHHHHHHHhc-------CHHHHHHhcCccc--c Confidence 112222334456988765 5699999999999875 34689999999999998864 1133333322110 0 Q ss_pred cccc-----cceeeeeeeeeeccccccc--cccCcccEEEEEEcC-cceEEEeeCchhhhcc-ccccCceeEEeeeeeee Q lcl|NC_021342. 268 LTGN-----ELDIQIRFQLDAAELAANG--VSNSNKPRYMVYDKS-DRNLAMANPIPFRMLA-PQMASLGITVPAEYKIS 338 (354) Q Consensus 268 ~~g~-----~l~I~~~~~L~~~~~~~~g--~g~~g~d~~v~y~~~-~~~~~~~vp~~~~~~~-~~~~~l~~~~~~~~~~g 338 (354) ...+ -++++.+...++-+-...+ .-.-|.+..++|... +.+-.-.+-+| .+.. .+.++.-...++++ -+ T Consensus 201 ~it~~~la~l~~v~~V~vg~a~y~~~~~~~~~iw~~~~~l~y~~~~~~~~~~~~~~p-s~Gyt~~~~g~~~~d~~~~-~~ 278 (307) T protein:vir:79 201 IVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEP-SYGYTLRKKGNPVVDTRIE-DG 278 (307) T ss_pred ccCHHHHHHHhCceeEEEeeeeeecccccchhcCCCceEEEecccccCCCCCccccc-ccceeEEecCceEEecccC-CC Confidence 0000 0122222221111100110 001133445555321 11110001011 1111 12223323333333 34 Q ss_pred eEEEECCce-----eEeeecC Q lcl|NC_021342. 339 GTEFRYPLC-----AAYVDMA 354 (354) Q Consensus 339 Gv~i~~P~a-----i~y~D~~ 354 (354) |+++.|-.- ++.-|.. T Consensus 279 ~~~~vrv~~~~~~~i~~~~~G 299 (307) T protein:vir:79 279 KLELVRATDIFRPYLLGADAG 299 (307) T ss_pred ceeEEeecccccceeeccccc Confidence 555443222 2333322 No 160 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=72.10 E-value=0.19 Score=24.57 Aligned_cols=297 Identities=14% Similarity=0.070 Sum_probs=129.0 Q ss_pred ccccc-----ccccchhhhhhhhhhhccCCc-eeccchh-hHHHHHHHHHHHHHHHHHhhh--hcccchhhccccCCCCC Q lcl|NC_021342. 19 HKGYV-----SRNGDQWVINNTALDAIGNPN-IMLDADG-GIAFYISQLAGIEATVYETPY--GDITYRFDVPMAANIPE 89 (354) Q Consensus 19 ~~~~~-----~~~~~~~~~~~~am~a~~~~~-~~~da~~-~~~fl~~~L~~Id~~v~e~~~--~~l~~r~~v~v~~~~~~ 89 (354) +|.+. ...+.+. ....+|.+...+. .+-|.+- ++++.. +.+|+++....+ .+++.-+-++- .+... T Consensus 1 ~~~~~~~~~~~~~~~~~-~~e~~~Ks~~agy~~~p~~q~~~~AlR~---EsL~~~i~~L~~~~~~f~~~~di~k-~~a~s 75 (468) T protein:vir:63 1 MPKNNKEEEVKEVNLNS-VQEDALKSFTTGYGITPDTQTDAGALRR---EFLDDQISMLTWTENDLTFYKDIAK-KPATS 75 (468) T ss_pred CCCCcchhhccccChhH-HHHHHHHHHHcCcccCCccccCcchhhh---hhhhhhhheeeecccchhhhhhccc-chhhh Confidence 33221 1111111 1133555543332 2233332 335555 455565544322 23333333331 12222 Q ss_pred ceeEEEEE-eeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCC-CcchHHHHHHHHHHHH Q lcl|NC_021342. 90 YADTWMYR-SYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNM-PIDAEQARLAFRGAEE 167 (354) Q Consensus 90 ~~~~~~~~-~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~-~ld~~k~~aA~~~~a~ 167 (354) ....|+.. ....+|.+...+.. ...+..+.++.|++..+..++..-..|+.--. ..++ +......+.|...+++ T Consensus 76 tv~~y~~~~~~G~~g~~~f~~E~-g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l---~n~i~d~~~~~~~~ai~~~a~ 151 (468) T protein:vir:63 76 TVAKYDVYMQHGKVGHTRFTREI-GVAPVSDPNIRQKTVNMKFASDTKNISIAAGL---VNNIQDPMQILTDDAIVNIAK 151 (468) T ss_pred hhhhheeeeccCccccccccccc-cccccCCCceEEEEEEeeeeeeeeeehhhhhh---hcchhhHHHHHHHHHHHHHHH Confidence 33333322 33445666655544 33567788889999999988887777653222 1222 4447777788889999 Q ss_pred Hhhheeeeeehhh----------Cceeeee--cCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeC Q lcl|NC_021342. 168 HSQSVAYFGDASR----------GMYGLFN--NPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMF 235 (354) Q Consensus 168 ~~n~~~f~G~~~~----------gi~GLlN--~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~ 235 (354) ......||||+.+ ..-||++ +|. .+..+.+... + -++|+++-..+ +.|+-.|.-+.|| T Consensus 152 tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~e-nviDa~G~~l--s----~~~lneaa~~i---~~gfG~~td~~~~ 221 (468) T protein:vir:63 152 TIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHDARGASL--T----ESLLNQAAVMI---SKGYGTPTDAYMP 221 (468) T ss_pred HHHHHhhhcccccccCCCccccccccceeEEecCC-ceeccCCCcc--C----HHHHHHHhhhc---cccccChhhhhcc Confidence 9999999998765 3345543 232 2233333322 2 24556554332 2367789999999 Q ss_pred HHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCc Q lcl|NC_021342. 236 PDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPI 315 (354) Q Consensus 236 p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~ 315 (354) +.....|....+.+..-+ +. .|......|.++. ...++....+ -..-.+.+ +...+...++. T Consensus 222 ~~v~a~~~~~~L~~q~~v-----~~-~n~~~~~~G~~v~-----g~~sa~G~I~------l~gs~il~-~~~~l~~~~~~ 283 (468) T protein:vir:63 222 VGVQADFVNQQLSKQTQL-----VR-DNGNNVSVGFNIQ-----GFHSARGFIK------LHGSTVME-NEQILDERILA 283 (468) T ss_pred hhHHhhhhhhhcCceEEE-----Ec-CCCCceeeeeccc-----ceecceeeee------ecCceeec-cccCCCccccc Confidence 999988855444332211 11 1122233343321 1111110000 00011111 11222111100 Q ss_pred hhhhccccccCc---eeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 316 PFRMLAPQMASL---GITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 316 ~~~~~~~~~~~l---~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .- .+|.+-.+ ..........+|..--|-++++.+|=. T Consensus 284 ~~--~Apsp~~vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs~~ 323 (468) T protein:vir:63 284 LP--TAPQPAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDD 323 (468) T ss_pred cc--ccccCCccceeeecccCCcccCCCcceEEEEEEEECCC Confidence 00 01111010 000000111112111133344444433 No 161 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=70.10 E-value=0.21 Score=24.26 Aligned_cols=304 Identities=11% Similarity=-0.037 Sum_probs=132.8 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhh Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFD 80 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~ 80 (354) .-|-+-| -..+-.+=.+-...+||-+. +.|++ ..+.. +.+...|+|...+.-...++ T Consensus 2 ~~~~~~~-------------~~~~~~~~~~~~p~l~m~al------TLaea--~~l~~--d~~~~~VIE~l~~~s~iL~~ 58 (330) T protein:vir:94 2 VRICTPP-------------LRGRWRTLTHQFPELKMPTV------TLAES--AKLSQ--DHLVSGLIETIVEVNPLYEM 58 (330) T ss_pred ceecCCc-------------cccceeehhccccccchhhh------hhhHH--hhcCc--hhhHHHHHHhhhccchHHhh Confidence 1111111 11111111122234566653 23332 23333 45667888877766666676 Q ss_pred ccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCC--cchHHH Q lcl|NC_021342. 81 VPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMP--IDAEQA 158 (354) Q Consensus 81 v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~--ld~~k~ 158 (354) +|... ...+ .+.|......+.+.+..-+...-|--.......+..+..++..++.+.+ -+...|-+ ...... T Consensus 59 lpf~~-ve~~--~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~---iadl~g~~~d~~~~q~ 132 (330) T protein:vir:94 59 MPFTE-IEGN--ALAYNRENVLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGL---IQATRSDFMDQTSVQV 132 (330) T ss_pred ccccc-ccCC--cceeeeeecCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHH---HHHhcCCHHHHHHHHH Confidence 66321 1111 1333333333444443222111110011112222334444433333222 22234443 344555 Q ss_pred HHHHHHHHHHhhheeeeeehh-hCceeeeecCC-cccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCH Q lcl|NC_021342. 159 RLAFRGAEEHSQSVAYFGDAS-RGMYGLFNNPN-VTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFP 236 (354) Q Consensus 159 ~aA~~~~a~~~n~~~f~G~~~-~gi~GLlN~p~-~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p 236 (354) ....+++.+++.+-.+|||.. .++.||+..-. -.....++.=..-| ++|+.+++..++.. . -.|..|+++. T Consensus 133 ~~~ieal~~~~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~gg~~T----~d~LDeLl~~v~~~-~--g~~~~~l~n~ 205 (330) T protein:vir:94 133 ASKAKSIGRQYQASMITGDGTGNSFQGMMGLVAASQTISAGANGGTLT----FELLDQLLDLVKDK-D--GQVDYLMSSF 205 (330) T ss_pred HHHHHHHHHHHHHHhhccCCCCccccchhhcCCcccEEecCCCCCCCC----HHHHHHHHHHhcCC-C--CCCcEEEech Confidence 667778999999999999765 46779865221 11121111111223 47788888888642 1 2588899877 Q ss_pred HHHHHHhhc-ccCCCCC---chHHHHHHhhCccccccccc-ceeeeeeeeeeccccc-ccc-ccCcccEEEEEEcC---- Q lcl|NC_021342. 237 DLWNQANNQ-LMTGYTD---RTVMQHFMEANSYTLLTGNE-LDIQIRFQLDAAELAA-NGV-SNSNKPRYMVYDKS---- 305 (354) Q Consensus 237 ~~~~~L~~~-~~~~~~~---~Tvl~~l~~n~~~~~~~g~~-l~I~~~~~L~~~~~~~-~g~-g~~g~d~~v~y~~~---- 305 (354) .....+..- |-....+ .++.. -|++ +...-+|.+.++.+.. .+. ..+|+..+++..-. T Consensus 206 a~~r~I~a~~R~~~~~~v~~~~~~~-----------~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~ 274 (330) T protein:vir:94 206 AMRRKYFSLLRALGGAAIGEVMTLP-----------SGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSN 274 (330) T ss_pred hHHHHHHHHHHhccCCCCCCccccc-----------CCCEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeeccccc Confidence 655544321 1011111 12111 1222 2333344333332211 111 22344444444432 Q ss_pred -cceEEEeeCc----hhhhcc-ccccC-ceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 306 -DRNLAMANPI----PFRMLA-PQMAS-LGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 306 -~~~~~~~vp~----~~~~~~-~~~~~-l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .-+..++-+. ..++.. .+.+. ..|.+..+ -|+-++-|.|++.++-- T Consensus 275 ~qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~y---~~~av~~~~a~~~L~~V 327 (330) T protein:vir:94 275 KYGIAGLTARGSAGLRVQNVGAKENADETITRVKMY---CGFANFSQLGLAAIKGL 327 (330) T ss_pred ccceEeecCCCCCcceeeeCCCccccceeeEEEEEe---eeeEEechhheeeeccc Confidence 1345554322 122322 22332 34555443 36778888888886544 No 162 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=69.44 E-value=0.22 Score=24.16 Aligned_cols=301 Identities=15% Similarity=0.082 Sum_probs=127.2 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhcc-CCceeccchh-hHHHHHHHHHHHHHHHHHhhh--hccc Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIG-NPNIMLDADG-GIAFYISQLAGIEATVYETPY--GDIT 76 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~-~~~~~~da~~-~~~fl~~~L~~Id~~v~e~~~--~~l~ 76 (354) |--+ |--=++|.+-... .. ..|.+.. +-+.+-|.+- ++++.. +.+++.+....+ .+++ T Consensus 1 ~~~~---------~~~~~~~~n~~~~-----~e-~~~Ks~~agy~~~p~tq~~~~AlR~---EsL~~~i~~Lt~~~~~f~ 62 (467) T protein:vir:80 1 MPKN---------NKEEVKEVNLNSV-----QE-DALKSFTTGYGITPDTQTDAGALRR---EFLDDQISMLTWTENDLT 62 (467) T ss_pred CCCc---------chhhhhhcccccC-----HH-HHHHHHHcccccCCccccCcchhhh---hhhhhhhheeeccccchh Confidence 1100 1111112211111 11 1333322 2233333332 335555 445555544322 3333 Q ss_pred chhhccccCCCCCceeEEEEE-eeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCC-Ccc Q lcl|NC_021342. 77 YRFDVPMAANIPEYADTWMYR-SYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNM-PID 154 (354) Q Consensus 77 ~r~~v~v~~~~~~~~~~~~~~-~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~-~ld 154 (354) .-+-++- .+.......|+.. ....+|.+...+.. ...+..+.++.|++..+..++..-..|+.--. ..++ +.. T Consensus 63 ~~~di~k-~~a~stv~~y~~~~~~G~~g~~~f~~E~-g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l---~n~i~d~~ 137 (467) T protein:vir:80 63 FYKDIAK-KPATSTVAKYDVYMQHGKVGHTRFTREI-GVAPVSDPNIRQKTVNMKFASDTKNISIAAGL---VNNIQDPM 137 (467) T ss_pred hhhhccc-chhhhhhhhheeeeccCccccccccccc-cccccCCCceEEEEEEeeeeeeeeeehhhhhh---hcchhhHH Confidence 3333331 1222233333322 33445666655544 33567788888999999888887777653222 1222 444 Q ss_pred hHHHHHHHHHHHHHhhheeeeeehhh----------Cceeeee--cCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 155 AEQARLAFRGAEEHSQSVAYFGDASR----------GMYGLFN--NPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINL 222 (354) Q Consensus 155 ~~k~~aA~~~~a~~~n~~~f~G~~~~----------gi~GLlN--~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~ 222 (354) ....+.|...+++......||||+.+ ..-||++ +|. .+..+.+... + -++|+++-..+ T Consensus 138 ~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~e-nviDa~G~~l--s----~~~lneaa~~i--- 207 (467) T protein:vir:80 138 QILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHDARGASL--T----ESLLNQAAVMI--- 207 (467) T ss_pred HHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecCC-ceeccCCCcc--C----HHHHHHHhhhc--- Confidence 77777888899999999999998765 3345543 232 2233333322 2 24556554332 Q ss_pred hCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEE Q lcl|NC_021342. 223 SRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVY 302 (354) Q Consensus 223 s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y 302 (354) +.|+-.|.-+.||+.....|....+.+..-+ +. .|......|.++. ...++....+ -..-.+. T Consensus 208 ~~gfG~~td~~~p~~v~a~~~~~~L~~q~~v-----~~-~n~~~~~~G~~v~-----g~~sa~G~I~------l~gs~il 270 (467) T protein:vir:80 208 SKGYGTPTDAYMPVGVQADFVNQQLSKQTQL-----VR-DNGNNVSVGFNIQ-----GFHSARGFIK------LHGSTVM 270 (467) T ss_pred cccccChhhhhcchhHHhhhhhhhcCceEEE-----Ec-CCCCceeeeeccc-----ceecceeeee------ecCceee Confidence 2367789999999999988855444332211 11 1122233343321 1111110000 0001111 Q ss_pred EcCcceEEEeeCchhhhccccccCc---eeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 303 DKSDRNLAMANPIPFRMLAPQMASL---GITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 303 ~~~~~~~~~~vp~~~~~~~~~~~~l---~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) + +...+...++..- .+|.+-.+ ..........+|..--|-++++.+|=. T Consensus 271 ~-~~~~l~~~~~~~~--~Apsp~~vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs~~ 322 (467) T protein:vir:80 271 E-NEQILDERILALP--TAPQPAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDD 322 (467) T ss_pred c-cccCCCccccccc--ccccCCccceeeecccCCcccCCCcceEEEEEEEECCC Confidence 1 1122211110000 01111010 000000111112111133344444433 No 163 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=67.08 E-value=0.26 Score=23.81 Aligned_cols=288 Identities=10% Similarity=0.010 Sum_probs=120.1 Q ss_pred cccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHH-HHHHHHHHHHHhhh-hcccchh Q lcl|NC_021342. 2 AIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYIS-QLAGIEATVYETPY-GDITYRF 79 (354) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~-~L~~Id~~v~e~~~-~~l~~r~ 79 (354) --|+| ..--|. ++ +-+++-... .. ++--.-|++ ....+|+......+ .++..-+ T Consensus 1 ~~~~~-----~~~~~~-~~--------------~~~~~~~~~--~~--~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~ 56 (319) T protein:vir:94 1 MNKTI-----KNATGM-LK--------------LNLQHFANK--SV--EPGQTLLKNKHVGILERVTAVNAYSTPALISN 56 (319) T ss_pred CCccc-----ccccce-eE--------------eehhhhhcc--CC--CcchHHHHHHHHHHHHHHHHHhhhhhhcccCc Confidence 00100 000000 00 000100000 00 111111221 11233332222111 1121111 Q ss_pred hccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhC-CCcchHHH Q lcl|NC_021342. 80 DVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMN-MPIDAEQA 158 (354) Q Consensus 80 ~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g-~~ld~~k~ 158 (354) -+ -+-+..++.....+..|-.. |.- .++...-+++..+.+..+-+ ..+|.+.+.+++..+..+ ........ T Consensus 57 ~~-----e~~gg~tVkIp~i~~~gl~D-Y~R-~~g~~~g~vt~~~~t~tidq-dR~~~F~VD~~D~~Etn~~l~a~~i~~ 128 (319) T protein:vir:94 57 DA-----IFMEGRSFTVMKGDTTELKD-YKR-NATNEFDHPKIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVVA 128 (319) T ss_pred ce-----EeccCcEEEEeeeccccccc-ccC-CCCcccCCcccceeEEEeec-ccccccccchhhHhhhhchhhHHHHHH Confidence 11 11256678877777666432 211 12233334455566655544 677888888888887532 22222334 Q ss_pred HHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHH Q lcl|NC_021342. 159 RLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDL 238 (354) Q Consensus 159 ~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~ 238 (354) +.++..+.-..|...|--..... | . ... .+.|++.+++.|.++..+|.+. ++.....|+++|.. T Consensus 129 ~~~~~~v~PEiDay~~skla~~a--~-------~--~~~---~~~t~~n~y~~i~~a~~~Lde~--~VP~~Rvl~Vtp~~ 192 (319) T protein:vir:94 129 RQGAEVVAPYLDNLRFATLARNK--A-------K--HLT---VGTGSDAQYDAVLDVSVELDEI--KAPENRVLFVSPTF 192 (319) T ss_pred HHHHHHhhhhhhHHHHHHHHhhc--c-------c--ccc---cccCHHHHHHHHHHHHHHHHhc--CCCCCcEEEeCHHH Confidence 44555555555655443322211 0 0 001 1246788999999999999875 44445789999999 Q ss_pred HHHHhhc-ccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchh Q lcl|NC_021342. 239 WNQANNQ-LMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPF 317 (354) Q Consensus 239 ~~~L~~~-~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~ 317 (354) |..|..- ++....+ +.+-... +|..-.+..++.++.+.... -+.+.+++...-.-..... ..+ T Consensus 193 ~~~L~~~~~f~~~~~--~~~~~~~-------~g~Vg~idG~~Vi~vps~~~-----k~in~i~~h~~A~~~~~k~--~~~ 256 (319) T protein:vir:94 193 YKGIKKFVIALPQGD--TRQQVLG-------KGVQGELDGFVIVKVPTKLL-----QGLQAIAVVGEVLASPIQA--DLA 256 (319) T ss_pred HHHHHhhhhhhcccc--cccccee-------eeeceeecCeEEEEeccccc-----ccceEEEEcCCeeeeeeee--eee Confidence 9999542 1111111 1111111 12211222222222221111 1223333332111111111 112 Q ss_pred hhccccccCceeEEeeeeeeeeEEEECCcee-EeeecC Q lcl|NC_021342. 318 RMLAPQMASLGITVPAEYKISGTEFRYPLCA-AYVDMA 354 (354) Q Consensus 318 ~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai-~y~D~~ 354 (354) +.+.+.++...|.+.+. .+.|+.|.+|... +|+... T Consensus 257 ~~~~p~~~~~a~~v~gr-~y~d~~V~~~k~~~Iy~~~~ 293 (319) T protein:vir:94 257 KTNSNIPGMFGTLAEQL-LYTGAFVPEHLQKYIFTIGG 293 (319) T ss_pred eccCCCccccceeeeee-eeeeeEEeccccceEEEeec Confidence 22222233335666654 4678999998832 355333 No 164 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=67.08 E-value=0.26 Score=23.81 Aligned_cols=288 Identities=10% Similarity=0.010 Sum_probs=120.1 Q ss_pred cccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceeccchhhHHHHHH-HHHHHHHHHHHhhh-hcccchh Q lcl|NC_021342. 2 AIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYIS-QLAGIEATVYETPY-GDITYRF 79 (354) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~-~L~~Id~~v~e~~~-~~l~~r~ 79 (354) --|+| ..--|. ++ +-+++-... .. ++--.-|++ ....+|+......+ .++..-+ T Consensus 1 ~~~~~-----~~~~~~-~~--------------~~~~~~~~~--~~--~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~ 56 (319) T protein:vir:97 1 MNKTI-----KNATGM-LK--------------LNLQHFANK--SV--EPGQTLLKNKHVGILERVTAVNAYSTPALISN 56 (319) T ss_pred CCccc-----ccccce-eE--------------eehhhhhcc--CC--CcchHHHHHHHHHHHHHHHHHhhhhhhcccCc Confidence 00100 000000 00 000100000 00 111111221 11233332222111 1121111 Q ss_pred hccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhC-CCcchHHH Q lcl|NC_021342. 80 DVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMN-MPIDAEQA 158 (354) Q Consensus 80 ~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g-~~ld~~k~ 158 (354) -+ -+-+..++.....+..|-.. |.- .++...-+++..+.+..+-+ ..+|.+.+.+++..+..+ ........ T Consensus 57 ~~-----e~~gg~tVkIp~i~~~gl~D-Y~R-~~g~~~g~vt~~~~t~tidq-dR~~~F~VD~~D~~Etn~~l~a~~i~~ 128 (319) T protein:vir:97 57 DA-----IFMEGRSFTVMKGDTTELKD-YKR-NATNEFDHPKIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVVA 128 (319) T ss_pred ce-----EeccCcEEEEeeeccccccc-ccC-CCCcccCCcccceeEEEeec-ccccccccchhhHhhhhchhhHHHHHH Confidence 11 11256678877777666432 211 12233334455566655544 677888888888887532 22222334 Q ss_pred HHHHHHHHHHhhheeeeeehhhCceeeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHH Q lcl|NC_021342. 159 RLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDL 238 (354) Q Consensus 159 ~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~ 238 (354) +.++..+.-..|...|--..... | . ... .+.|++.+++.|.++..+|.+. ++.....|+++|.. T Consensus 129 ~~~~~~v~PEiDay~~skla~~a--~-------~--~~~---~~~t~~n~y~~i~~a~~~Lde~--~VP~~Rvl~Vtp~~ 192 (319) T protein:vir:97 129 RQGAEVVAPYLDNLRFATLARNK--A-------K--HLT---VGTGSDAQYDAVLDVSVELDEI--KAPENRVLFVSPTF 192 (319) T ss_pred HHHHHHhhhhhhHHHHHHHHhhc--c-------c--ccc---cccCHHHHHHHHHHHHHHHHhc--CCCCCcEEEeCHHH Confidence 44555555555655443322211 0 0 001 1246788999999999999875 44445789999999 Q ss_pred HHHHhhc-ccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchh Q lcl|NC_021342. 239 WNQANNQ-LMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPF 317 (354) Q Consensus 239 ~~~L~~~-~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~ 317 (354) |..|..- ++....+ +.+-... +|..-.+..++.++.+.... -+.+.+++...-.-..... ..+ T Consensus 193 ~~~L~~~~~f~~~~~--~~~~~~~-------~g~Vg~idG~~Vi~vps~~~-----k~in~i~~h~~A~~~~~k~--~~~ 256 (319) T protein:vir:97 193 YKGIKKFVIALPQGD--TRQQVLG-------KGVQGELDGFVIVKVPTKLL-----QGLQAIAVVGEVLASPIQA--DLA 256 (319) T ss_pred HHHHHhhhhhhcccc--cccccee-------eeeceeecCeEEEEeccccc-----ccceEEEEcCCeeeeeeee--eee Confidence 9999542 1111111 1111111 12211222222222221111 1223333332111111111 112 Q ss_pred hhccccccCceeEEeeeeeeeeEEEECCcee-EeeecC Q lcl|NC_021342. 318 RMLAPQMASLGITVPAEYKISGTEFRYPLCA-AYVDMA 354 (354) Q Consensus 318 ~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai-~y~D~~ 354 (354) +.+.+.++...|.+.+. .+.|+.|.+|... +|+... T Consensus 257 ~~~~p~~~~~a~~v~gr-~y~d~~V~~~k~~~Iy~~~~ 293 (319) T protein:vir:97 257 KTNSNIPGMFGTLAEQL-LYTGAFVPEHLQKYIFTIGG 293 (319) T ss_pred eccCCCccccceeeeee-eeeeeEEeccccceEEEeec Confidence 22222233335666654 4678999998832 355333 No 165 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=63.74 E-value=0.31 Score=23.36 Aligned_cols=278 Identities=11% Similarity=0.028 Sum_probs=111.0 Q ss_pred hhhccCCceeccchhhHHHHHHHHHHHHHHHH-HhhhhcccchhhccccCCCCCceeEEEEEeeccc-cc---eeEecCC Q lcl|NC_021342. 37 LDAIGNPNIMLDADGGIAFYISQLAGIEATVY-ETPYGDITYRFDVPMAANIPEYADTWMYRSYDGV-TM---GKFIGAN 111 (354) Q Consensus 37 m~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~-e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~-G~---a~~~~~~ 111 (354) |. .-+..| .|-.++|+.+=.++. +.....+-..++||... . .++.+...... +. +.+.+.. T Consensus 1 M~----~~~~~d-----~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~-~----~~~~~~~~~~~~~~~~~a~~~~~~ 66 (348) T protein:vir:98 1 MS----WTLDTE-----FIEPTQLTGLIREALRDLQVNRFRLARWLPNVD-V----DDITFEFLRGGGGLAETASYRSWD 66 (348) T ss_pred Cc----chhhhh-----ccCHHHHHHHHHHHhhccCcchhhHHhcCCCcc-c----cceEEEEEeccCCceeeeeeecCC Confidence 11 111111 122344443323222 22333466678888532 1 12333332221 11 2333333 Q ss_pred Ccccceee-eccceeEEEEEEEEeeEeecHHHHHHHHHhCCC----cchH----HHHHHHHHHHHHhhheeeeee---hh Q lcl|NC_021342. 112 GQDLPRVA-QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMP----IDAE----QARLAFRGAEEHSQSVAYFGD---AS 179 (354) Q Consensus 112 ~~dip~v~-~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~----ld~~----k~~aA~~~~a~~~n~~~f~G~---~~ 179 (354) +. .|... ...+..+..+..++..+.++..|+...++.... .-.+ ...+.++..+...-++.+.|- .+ T Consensus 67 ~~-~~~~~r~g~~~~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g 145 (348) T protein:vir:98 67 TE-SKIGRREGLAKVMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTE 145 (348) T ss_pred Cc-cceeecccceeeeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEec Confidence 22 23222 234455666777778888888887664321110 0011 122222223333335555551 11 Q ss_pred hCceee-eecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHH Q lcl|NC_021342. 180 RGMYGL-FNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQH 258 (354) Q Consensus 180 ~gi~GL-lN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~ 258 (354) .+. .+ +..|.-...++++.|++.....++.||.+.+..+...+ | ..|..++|++..|..|.+ +..+.+. T Consensus 146 ~~~-~vDyg~~~~~~~t~~~~Ws~~~~adp~~di~~~~~~~~~~~-G-~~p~~~vm~~~~~~~l~~-------~~~i~~~ 215 (348) T protein:vir:98 146 LQQ-TVDFGRIGSHSVVAAVLWSVHATATPISDLESWVATYEDTN-G-QSPGVILMPKAAVSHMRQ-------CEEVIRQ 215 (348) T ss_pred Cce-EEccccCcccccccccccCCCCCCCHHHHHHHHHHHHHHcc-C-CcceEEEeCHHHHHHHhc-------CHHHHHH Confidence 111 11 22233223455678964333357899999998887643 3 468999999999998853 1122222 Q ss_pred HHhhCc--------------ccccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCch-------- Q lcl|NC_021342. 259 FMEANS--------------YTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIP-------- 316 (354) Q Consensus 259 l~~n~~--------------~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~-------- 316 (354) +.-.+. +....|.+ .|....+ .+.-.| ...+++ |+..-..+|.. T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~i~~~d~----~~~~~g----~~~~~~-----p~~~i~l~p~~~~~~~~~~ 281 (348) T protein:vir:98 216 VFPLAPSGTAPMVSVEQLNTVLSSMGLP-PIEVYDA----KVAVDG----VSTRIT-----PANAIALLPEPGATDAAQP 281 (348) T ss_pred HhccCccccccccCHHHHHHHHHhhCCe-EEEEeee----EEEcCC----ceecee-----cCCeEEEEecCCccccccc Confidence 211000 00001111 1211111 011111 111110 11111111110 Q ss_pred ----hhhcc--cc-----------------------ccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 317 ----FRMLA--PQ-----------------------MASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 317 ----~~~~~--~~-----------------------~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .+... +| .+.....+...++ .=..+.+|.++..++|= T Consensus 282 ~~~G~t~~G~~~e~~~~~~~~~~~~~~~i~~~~~~~~dP~~~~~~~~s~-~lPv~~~~~~~~~a~Vl 347 (348) T protein:vir:98 282 TELGATLLGTTAESLEDDYALAPGEQPGIVAATWKTKDPVRLWTHAAAV-GIPVLREPNLTFKAQVL 347 (348) T ss_pred ccccceecccchhhhccccccceeccCceeeeeeeecCCcEEEEEEeee-eeccccCCCcEEEEEEe Confidence 00000 00 0111111112222 11335677777777776 No 166 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=61.67 E-value=0.35 Score=23.09 Aligned_cols=302 Identities=10% Similarity=0.015 Sum_probs=124.3 Q ss_pred ccccchhhhhhhhh-hhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccc Q lcl|NC_021342. 24 SRNGDQWVINNTAL-DAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGV 102 (354) Q Consensus 24 ~~~~~~~~~~~~am-~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~ 102 (354) +.++.-- . .| -..+..+..-|++ -.|+....-+|+...-+ .-..+.++.+.+-. +..++.+.. . T Consensus 1 ~~~~~~~---~-~~~t~~g~~~~~~~~~--al~ie~~~g~V~~~f~~----~s~~~~~v~~r~~~--~G~sv~i~~---i 65 (347) T protein:vir:33 1 MANIQGG---Q-QIGTNQGKGQSAADKL--ALFLKVFGGEVLTAFAR----TSVTMPRHMLRSIA--SGKSAQFPV---I 65 (347) T ss_pred CCCCccC---c-ccccccccCCcccchH--HHHHHHHHHHHHHHHHH----HHhhhhhhcccccc--ccceeEeee---c Confidence 1000000 0 00 0001111122222 13553223344443332 22344444443211 233444433 3 Q ss_pred cceeE--ecCCCcccce--eeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHHHHHHHHHHHhhheee---- Q lcl|NC_021342. 103 TMGKF--IGANGQDLPR--VAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQARLAFRGAEEHSQSVAY---- 174 (354) Q Consensus 103 G~a~~--~~~~~~dip~--v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~aA~~~~a~~~n~~~f---- 174 (354) |..++ +.. +.+++. .+..-.+.++.+-.+- -+..-+.+++.++ +..++-.+-.+.+..++++..|+.++ T Consensus 66 G~~t~~~~~~-g~~l~~~~~~~~~~e~~ltiD~~~-y~~~~VddiD~~q-~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~ 142 (347) T protein:vir:33 66 GRTKAAYLKP-GENLDDKRKDIKHTEKVIHIDGLL-TADVLIYDIEDAM-NHYDVRAEYTAQLGESLAMAADGAVLAELA 142 (347) T ss_pred cceeeeeecC-CCCCCCCCCCCccceEEEEechhh-hhhHHHhhHHHHh-cCCchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33332 221 122221 1122223333322211 1123356777776 45667777888899999999999886 Q ss_pred -eeeh---hhCceeeeecCCccc---cccccccc-ccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcc Q lcl|NC_021342. 175 -FGDA---SRGMYGLFNNPNVTL---SSATKDYK-TMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQL 246 (354) Q Consensus 175 -~G~~---~~gi~GLlN~p~~~~---~~~~~~W~-~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~ 246 (354) .+.. .....+.+..+.... .+.++.|. .++++.|++.|.++..+|.++.--. ....++|+|..|..|..-. T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~-~gR~~vv~P~~y~~Ll~~~ 221 (347) T protein:vir:33 143 GLVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPA-ADRTFYTTPDNYSAILAAL 221 (347) T ss_pred HhhhhhcccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCc-cCcEEEeCHHHHHHHhccc Confidence 2211 111122222222111 11222232 2467889999999999998752211 2357999999999987522 Q ss_pred cCCCCCchHHHHHHhhCcccccccccceeeeeeeeeecccccccc---------c-------cCcccEEEEEEcC----- Q lcl|NC_021342. 247 MTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGV---------S-------NSNKPRYMVYDKS----- 305 (354) Q Consensus 247 ~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~---------g-------~~g~d~~v~y~~~----- 305 (354) . .+-.+|.-... ...|....+-..+++++..+-..+. | +.+..+..++++. T Consensus 222 ~-----~~~~d~~~~~~---~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~ 293 (347) T protein:vir:33 222 M-----PNAANYQALLD---PERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQ 293 (347) T ss_pred c-----ccccccccccc---cccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeee Confidence 1 11112211000 1123333344444444443321110 0 0011111111110 Q ss_pred -cceEEEeeCc--hhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 306 -DRNLAMANPI--PFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 306 -~~~~~~~vp~--~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ++-+...-.+ .++....+.+ ....+...... |+-+.||.+++-+-.- T Consensus 294 h~~A~g~v~~~~~~~e~~r~~~~-~~d~i~~~~~~-G~~vlrP~~av~i~~~ 343 (347) T protein:vir:33 294 HRSAVGTVKLKDLALERARRANY-QADQIIAKYAM-GHGGLRPEAAGAIVLP 343 (347) T ss_pred cchhheeeeeeceeeeeccchhh-hhHhhhhhhhc-CCceecccceEEEecC Confidence 1111111111 1222222222 23334444444 8999999998877444 No 167 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=60.31 E-value=0.37 Score=22.92 Aligned_cols=297 Identities=8% Similarity=0.012 Sum_probs=126.3 Q ss_pred cccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccc Q lcl|NC_021342. 23 VSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGV 102 (354) Q Consensus 23 ~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~ 102 (354) +.+. .. ...|+-.-.++.--.|+..-.-++++..-+. =..+.++.+.+ +- +..|+.+.. + T Consensus 1 Ms~~-n~----------~t~~~~~~s~~~~al~le~f~geV~taF~~~----si~~~~~~vrt-i~-~GkS~qf~~---i 60 (402) T protein:vir:97 1 MSTP-NT----------LTNVAVSASGEVDSLLIEKFNGKVNEQYLKG----ENILSYFDVQT-VT-GTNTVSNKY---L 60 (402) T ss_pred CCCc-cc----------ccccccccccchhhhhhhhhhhhHHHHHHHH----HhhcCcceeee-ec-ccceEEEEE---E Confidence 1110 00 0011111111222345544445666655331 11223333322 22 233443333 3 Q ss_pred cceeEec-CCCcccceeeeccceeEEEE--EEEEeeEeecHHHHHHHHHhCCC-cchHHHHHHHHHHHHHhhheeee--- Q lcl|NC_021342. 103 TMGKFIG-ANGQDLPRVAQSAQMHTVPL--GYAGNECHYTLDEMRKSAAMNMP-IDAEQARLAFRGAEEHSQSVAYF--- 175 (354) Q Consensus 103 G~a~~~~-~~~~dip~v~~~~~~~~~pv--~~~~~~~~~~~~El~~a~~~g~~-ld~~k~~aA~~~~a~~~n~~~f~--- 175 (354) |..++-. ..+..+-.....-++..+.| ..+...| +.+++.++ ..++ +..+-...+..++++..|+.++- T Consensus 61 G~~~a~y~~~G~~ldg~~~~~~k~~ItID~lL~a~~~---V~diDeaq-~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~ 136 (402) T protein:vir:97 61 GETELQVLAPGQSPNATPTQADKNQLVIDTTVIARNT---VAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQML 136 (402) T ss_pred eeeEEeeeccccccCCCCcccccEEEEeCceeechhh---hhhHHHHH-hcccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4443311 00111111111112222222 1122222 35666654 4555 56667778889999999996642 Q ss_pred --eehhh----Cceeeeec-CCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccC Q lcl|NC_021342. 176 --GDASR----GMYGLFNN-PNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMT 248 (354) Q Consensus 176 --G~~~~----gi~GLlN~-p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~ 248 (354) |-... +..+...+ .+.+.. .+..=...+++.+.+-|.++..+|.++.-=... ..++|+|..|..|..- + T Consensus 137 ~aa~a~t~~~~~~~~~~~~g~s~~~~-~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~d-Rv~vv~P~~y~~Ll~~--~ 212 (402) T protein:vir:97 137 LGGIANTKAERNKPRVKGHGFSINVN-VTESEALANPQYVMAAVEYALEQQLEQEVDISD-VAIMMPWKFFNALRDA--D 212 (402) T ss_pred HhhccccccccccCcccccccccccc-cccchhhcCHHHHHHHHHHHHHHHHhcCCCccc-cEEEeChHHHHHHhhc--c Confidence 11111 11111111 111111 111112347888999999999888764211112 4899999999988752 0 Q ss_pred CCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccc------------ccc-------ccCcccEEEEEEcCcceE Q lcl|NC_021342. 249 GYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAA------------NGV-------SNSNKPRYMVYDKSDRNL 309 (354) Q Consensus 249 ~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~------------~g~-------g~~g~d~~v~y~~~~~~~ 309 (354) .-.+. +|...+.. .-..|..+.+--++.+++..+-. .+. ++..+-++++|.+ +-+ T Consensus 213 rl~n~---d~~~~~~g-~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~--~Av 286 (402) T protein:vir:97 213 RIVDK---TYTISQSG-ATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTS--DAL 286 (402) T ss_pred cccch---hhccccCC-ccccceeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEec--ceE Confidence 00111 22211110 01123333333333333333210 110 1223446777754 322 Q ss_pred EEeeCchhhhc-cccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 310 AMANPIPFRML-APQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 310 ~~~vp~~~~~~-~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) .-.-.++++.- --+.+...|.+.+.... |+-.+||.++.-+-+- T Consensus 287 ~tvk~~~vT~~~~~d~r~~~~~id~~~a~-G~g~~RPeaa~vv~~~ 331 (402) T protein:vir:97 287 LVGRTIEVTGDIFYEKKEKTYYIDTFMAE-GAIPDRWEAVSVVTTK 331 (402) T ss_pred EEEEeeccccchhhchhHHHHHHHHHHHh-CCcccCccceEEEEEe Confidence 22223444332 22445555666666665 6999999998877222 No 168 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=47.82 E-value=0.69 Score=21.47 Aligned_cols=291 Identities=8% Similarity=-0.009 Sum_probs=125.0 Q ss_pred cccccchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeeccc Q lcl|NC_021342. 23 VSRNGDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGV 102 (354) Q Consensus 23 ~~~~~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~ 102 (354) +.+.++. .. -+.+ + -++.--.|+..-.-++++..-+. =..+.++.+.+ +. +..|+.++. . T Consensus 1 Ms~~n~~-t~-----~~~~-~----sg~~~al~Le~f~GeV~taF~~~----si~~~~~~vRt-i~-~gkS~qf~~---~ 60 (401) T protein:vir:70 1 MSTPNNL-TN-----VAVS-A----SGEVDSLLIEKFNGKVNEQYLKG----ENIMSYFDVQT-VT-GTNTVSNKY---L 60 (401) T ss_pred CCCCccc-cc-----cccc-c----ccchhHhHHhHhcchHHHHHHHH----hhhcccceeee-ec-ccceEEEEE---e Confidence 1111000 00 0000 0 01112245544444555555322 11223333332 11 223333332 3 Q ss_pred cceeEec-CCCcccceeeeccceeEEEE--EEEEeeEeecHHHHHHHHHhCCC-cchHHHHHHHHHHHHHhhheeeeeeh Q lcl|NC_021342. 103 TMGKFIG-ANGQDLPRVAQSAQMHTVPL--GYAGNECHYTLDEMRKSAAMNMP-IDAEQARLAFRGAEEHSQSVAYFGDA 178 (354) Q Consensus 103 G~a~~~~-~~~~dip~v~~~~~~~~~pv--~~~~~~~~~~~~El~~a~~~g~~-ld~~k~~aA~~~~a~~~n~~~f~G~~ 178 (354) |+.+.-. ..+..+-.....-++..+.| ..+. +.-+.+|+.++ .-.+ +..+-....-.++++..|+.++-=.. T Consensus 61 G~s~~~~~~pG~~ld~~~~~~dK~~ItID~lL~a---~~~V~dlDe~q-~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~ 136 (401) T protein:vir:70 61 GETELQVLAPGQSPAATSTQADKNQLVIDATVIA---RNTVAHLHDVQ-GDIDSLKPKLATNQAKQLKRMEDEMLIQQMM 136 (401) T ss_pred eeeEeeeecCCCCcCCCCcccccEEEEeCceeeh---hhhhhhHHHHH-hcccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 4433211 01111111111122222222 2222 22234555554 3444 45566666777888877775521111 Q ss_pred hhCceeeee------cCC------cccccccccccccCHHHHHHHHHHHHHHHHHHhCCc-ccccEEEeCHHHHHHHhhc Q lcl|NC_021342. 179 SRGMYGLFN------NPN------VTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRF-HVPNTALMFPDLWNQANNQ 245 (354) Q Consensus 179 ~~gi~GLlN------~p~------~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~-~~p~~L~l~p~~~~~L~~~ 245 (354) . -|+-| .|. .-..+...+=...+++++.+.|.++..+|.++ .+ .....+++||.-|..|... T Consensus 137 ~---aa~ana~~~~~~p~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEk--dVP~~r~vvl~pp~~Ys~Ll~~ 211 (401) T protein:vir:70 137 L---GGIANTQAKRTNPRVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQ--EVDISDVAILMPWRYFNVLRDA 211 (401) T ss_pred H---hccccccccccCCCcCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhc--CCCccceEEEcCHHHHHHHHhc Confidence 1 11111 010 00111122222357899999999999998875 32 2346788889998777653 Q ss_pred --ccCCCCCchHHHHHHh-hCcccccccccceeeeeeeeeeccccc-----------------ccc--ccCcccEEEEEE Q lcl|NC_021342. 246 --LMTGYTDRTVMQHFME-ANSYTLLTGNELDIQIRFQLDAAELAA-----------------NGV--SNSNKPRYMVYD 303 (354) Q Consensus 246 --~~~~~~~~Tvl~~l~~-n~~~~~~~g~~l~I~~~~~L~~~~~~~-----------------~g~--g~~g~d~~v~y~ 303 (354) .++ .+|-.. ++. -..|..+.+--++.+++..+-. ... ++-.+-++++|. T Consensus 212 d~L~n-------rd~~~s~~g~--~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~ 282 (401) T protein:vir:70 212 DRIVD-------KTYTISQSGA--TIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFT 282 (401) T ss_pred Ccccc-------hhhccccCCc--cccceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEe Confidence 111 122111 111 1233333444444444433311 000 233455777776 Q ss_pred cCcceEEEeeCchhhhc-cccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 304 KSDRNLAMANPIPFRML-APQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 304 ~~~~~~~~~vp~~~~~~-~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ++.=-. .-.+|++.- --+.+...|.+.+.... |+-.+||.|++-+-.+ T Consensus 283 ~~Av~t--vk~~~lt~~~~~d~r~~~~~id~~~a~-g~g~~RPeaa~vv~~k 331 (401) T protein:vir:70 283 ADALLV--GRSIDVTGDIFYEKKEKTYYIDTFMAE-GAIPDRWEAVSVVTTK 331 (401) T ss_pred hhheEE--EEeeccccchhhhhhhhHHHHHHHHHh-CCcccchhheEEEeec Confidence 552222 222333321 12445566666677766 6999999999887544 No 169 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=44.98 E-value=0.79 Score=21.16 Aligned_cols=320 Identities=12% Similarity=0.091 Sum_probs=132.2 Q ss_pred CcccchhHHHHhhhhhhh--cccccccc-cchhhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLV--HKGYVSRN-GDQWVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVYETPYGDITY 77 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~~~-~~~~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~ 77 (354) -.....|++.++=|+-=+ |-++..-. ++-. +-+ .+..-+.+.+++ .+++. .-|...+.|.+.|-.-. T Consensus 32 et~~e~~~~~~~~~~~e~el~E~f~Kmm~G~~p-----~~e-V~~~e~mtt~~a--~IliP--~vis~v~~Eaaepl~~~ 101 (393) T protein:vir:79 32 ETLAEADANKLALNEEETQILESFAKMMEGETP-----TNE-VNLREFMATPSA--QILIP--RVIVGTMREAAEPLYIG 101 (393) T ss_pred hhhhhhhhhhhhcchhHHHHHHHHHHHhcCCCc-----hhh-eehhhhhcCCCc--ceech--hhhhhhhhhcccchhHH Confidence 122233444443222111 11111100 0000 000 001111111122 22322 33444445533333223 Q ss_pred hhhccccCCCCCceeEEEEEeeccccceeEecCCCcccceeeec---cceeEEEEEEEEeeEeecHHHHHHHHHhCCCcc Q lcl|NC_021342. 78 RFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQDLPRVAQS---AQMHTVPLGYAGNECHYTLDEMRKSAAMNMPID 154 (354) Q Consensus 78 r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~---~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld 154 (354) .+++.-.. +..| ++..|..+. +=++.-++++.. +|..+.+ .+.....+-..+....||.+=+. ..|.++- T Consensus 102 ~kl~qk~~-L~~G-rsm~F~~~g-~~Ra~~IgEGgE-~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIs---DSg~Dvi 174 (393) T protein:vir:79 102 TKMLQKIR-LKSG-QSMIFPSIG-IMRAYDVAEGQE-IPEDSIDWQTHESPEIRVGKSGIRLRFTDEMIS---DSQWDLM 174 (393) T ss_pred HHHHHHHh-hhcC-cceeccchh-eeeecccccccc-ccccchhhhcCCceeEEechhhhhhhhHHHHhh---cchHHHH Confidence 33332110 1000 111111110 111122222211 3333332 23444556666677777655444 4578888 Q ss_pred hHHHHHHHHHHHHHhhheeeeeehhhCceeeeecCCcccc-cccccccc-cCHHHHHHHHHHHHHHHHHHhCCcccccEE Q lcl|NC_021342. 155 AEQARLAFRGAEEHSQSVAYFGDASRGMYGLFNNPNVTLS-SATKDYKT-MNGQELFNMLNAPIFSVINLSRRFHVPNTA 232 (354) Q Consensus 155 ~~k~~aA~~~~a~~~n~~~f~G~~~~gi~GLlN~p~~~~~-~~~~~W~~-~T~~ei~~di~~~~~~l~~~s~g~~~p~~L 232 (354) .-...+|-|+++++-+..+|+|.+..|.+-+=+-+.-+.. ..+-+..+ -.+.=.++||.+++-++.. .+..|.+| T Consensus 175 n~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~---~hyt~svi 251 (393) T protein:vir:79 175 SMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMA---NEYTPSDL 251 (393) T ss_pred HHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccccccccHHHHHHHHHHHhc---ccCCcceE Confidence 8899999999999999999999998876433222222211 11111111 1122246788888766643 46789999 Q ss_pred EeCHHHHHHHhhcc------cCCCCCchHHHHHHhhCcccc------cccc---cceeeeeeeeeeccccccccccCccc Q lcl|NC_021342. 233 LMFPDLWNQANNQL------MTGYTDRTVMQHFMEANSYTL------LTGN---ELDIQIRFQLDAAELAANGVSNSNKP 297 (354) Q Consensus 233 ~l~p~~~~~L~~~~------~~~~~~~Tvl~~l~~n~~~~~------~~g~---~l~I~~~~~L~~~~~~~~g~g~~g~d 297 (354) +|.|-+|+.+.+.. .+..+ .|-.+..+-+. +.|+ +++|.-.|.+.-. - ... T Consensus 252 ~MHPLAWnv~AKna~me~~~~na~g-----N~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d---~------k~~ 317 (393) T protein:vir:79 252 MMHPLAWTVFAKNELMGSLQANPYG-----NYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLD---K------KSR 317 (393) T ss_pred EEcCchhhhhhhhhhhcceeecccc-----ccCccccchhhhhchhhhccccccceeEEEecccccc---c------ccc Confidence 99999999876531 11000 00000000000 1112 3444444433211 0 123 Q ss_pred EEEEEEcCcceEEEeeCch-hhhccccccCc-eeEEeeeeeeeeEEEECCceeEe-eecC Q lcl|NC_021342. 298 RYMVYDKSDRNLAMANPIP-FRMLAPQMASL-GITVPAEYKISGTEFRYPLCAAY-VDMA 354 (354) Q Consensus 298 ~~v~y~~~~~~~~~~vp~~-~~~~~~~~~~l-~~~~~~~~~~gGv~i~~P~ai~y-~D~~ 354 (354) |+=.|.-|..++...+.-+ ++.-.-+-+.- -..++..+|.|==++---.+|+. ..|. T Consensus 318 rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~ 377 (393) T protein:vir:79 318 RFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNIS 377 (393) T ss_pred eeeEEEeecCCceEEEEecCcceeccccccccceeeeeeeeeceeeeeCCceEEEEecce Confidence 5555555555555444222 21111111111 24555566654213333344432 4444 No 170 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=33.80 E-value=1.3 Score=19.90 Aligned_cols=302 Identities=11% Similarity=0.026 Sum_probs=97.1 Q ss_pred CcccchhHHHHhhh-----hhh-hcc----cccccccch--hhhhhhhhhhccCCceeccchhhHHHHHHHHHHHHHHHH Q lcl|NC_021342. 1 MAIKTIDAQTIQGN-----QWL-VHK----GYVSRNGDQ--WVINNTALDAIGNPNIMLDADGGIAFYISQLAGIEATVY 68 (354) Q Consensus 1 ~~~~~~~~~~~~~~-----~~~-~~~----~~~~~~~~~--~~~~~~am~a~~~~~~~~da~~~~~fl~~~L~~Id~~v~ 68 (354) .+.+. ++.+.+. .-+ .++ ........+ ......+..+... +.....|+..--+..+.... T Consensus 145 e~~~~--~~el~akl~el~k~~ee~k~~~~~~~~~~~~~~~~~~e~r~~~~~~~------~~~e~~~~~~~~~~~~~~~~ 216 (480) T protein:vir:40 145 EAGVK--VRELEAKVEELNKEREELKKEREASIPSEKPEDAERKFMRELGSKMA------EMPEQGFLREFANGADLNVV 216 (480) T ss_pred hhhhh--hhhHHHHHHHHHhHHHHHhhhhhhhccccchhhhhhHHHHHHHHHhc------cchhhhhhhhhhhhcccccc Confidence 11110 0111100 000 000 000000000 0000000000000 00000111100000000000 Q ss_pred Hh-hhhcccchhhccccCCCCCceeEEEEEeeccccceeEecCCCcc-cceeeecc--ceeEEEEEEEEeeEeecHHHHH Q lcl|NC_021342. 69 ET-PYGDITYRFDVPMAANIPEYADTWMYRSYDGVTMGKFIGANGQD-LPRVAQSA--QMHTVPLGYAGNECHYTLDEMR 144 (354) Q Consensus 69 e~-~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~d-ip~v~~~~--~~~~~pv~~~~~~~~~~~~El~ 144 (354) .. -..+-...+.+.+... ...+.+ ....+...+.++ .....+.. ....-+......-+.+...++. T Consensus 217 ~~~~~~~~~~~~~~~~~~~----~~~~~~------~~~~~~~~g~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~ 286 (480) T protein:vir:40 217 NSLGSITSKYARKSGIYDG----AMKARF------QGLTLAEDGVDDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAYL 286 (480) T ss_pred ccccccccchhhheeechh----hhhhhh------hcceeeeccccceeeeeeeecccccccccccccchhhHHHHHHHH Confidence 00 0000000000000000 000000 000000000000 00000000 0000000000000111112222 Q ss_pred ---HHHHhCC----CcchHHHHHHHHHHHHHhhheeeeeeh--hhCceeeeecCCccccccccccc-ccCHHHHHHHHHH Q lcl|NC_021342. 145 ---KSAAMNM----PIDAEQARLAFRGAEEHSQSVAYFGDA--SRGMYGLFNNPNVTLSSATKDYK-TMNGQELFNMLNA 214 (354) Q Consensus 145 ---~a~~~g~----~ld~~k~~aA~~~~a~~~n~~~f~G~~--~~gi~GLlN~p~~~~~~~~~~W~-~~T~~ei~~di~~ 214 (354) .+.+.-+ +|..--....+..+...|++-+++|+. ..+..|+.+. ...|+ ..++++.+.++.. T Consensus 287 ~~~k~t~~lLDDa~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~--------~~~~~~~~~~~d~id~L~~ 358 (480) T protein:vir:40 287 QMDKATVRGVNDSGALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTA--------TDGWTKQIEYTDLFEGITD 358 (480) T ss_pred HhHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceee--------cccccccchhHHHHHHHHH Confidence 2222211 254556677888999999999999953 3344444322 11232 2456666666666 Q ss_pred HHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccc---c Q lcl|NC_021342. 215 PIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANG---V 291 (354) Q Consensus 215 ~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g---~ 291 (354) ++.+... ..+.+++|+|..|..|-+. -+..|. ||-+.. ...|.+-.+...|.+++......+ + T Consensus 359 al~~~y~-----~~a~~~vmn~~t~~~I~kl--KD~~G~----Yi~q~~---~~~~~~~~llG~pvv~~~~~~~~~~~~~ 424 (480) T protein:vir:40 359 AVAECSI-----SDAITIVMSPQTFAELRKA--KGTDGH----SRFNEL---ATKEQIAQSFGAVNLETRVWMPKDEVAV 424 (480) T ss_pred hhhHHhh-----CCCCEEEECHHHHHHHHHh--hcCCCC----eeccCc---ccccCcceecccceeeeeccccCCccee Confidence 6544322 2344799999999988543 355453 332211 112333333333333322111111 1 Q ss_pred ccCcccEEEEEEcCcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 292 SNSNKPRYMVYDKSDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 292 g~~g~d~~v~y~~~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) + ++.....+.+++-++++-+ .+++. .-.+-++.++||.. ++|.++.|+-+- T Consensus 425 ~-~~~~~~~~~d~~~~~~~~~---~~~~~-------~~~~~~e~~v~g~~-~~~~~~~~~~~~ 475 (480) T protein:vir:40 425 Y-NHDEYVLIGDLNVENYNDF---DLRYN-------VEQWLSETLVGGSI-RGKNRSAYLKKK 475 (480) T ss_pred e-eCCccEEEEecccceeccc---ccccc-------hhhhhhhhhhceee-EccccEEEEEec Confidence 2 2223344445543332211 11111 12344577887655 999999997666 No 171 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=31.23 E-value=1.5 Score=19.60 Aligned_cols=288 Identities=10% Similarity=0.028 Sum_probs=108.8 Q ss_pred CCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEeecccc-ceeEecCCCcccceeee Q lcl|NC_021342. 42 NPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRSYDGVT-MGKFIGANGQDLPRVAQ 120 (354) Q Consensus 42 ~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~~~~~G-~a~~~~~~~~dip~v~~ 120 (354) =+.+ .| .|..++|+..-+.+- .....+-...+||.. +.. ..+...+....... .|.+++..+.....-.. T Consensus 1 M~~l-~d-----~f~~~~l~~~v~~~~-~~~~~~l~~~~Fp~~-~~~-~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~ 71 (348) T protein:vir:49 1 MGLI-YD-----KVTASNIAGYFNALQ-ENVDSTLGESIFPAR-KQL-GTKLSYITGASGQSVALKAAAFDTNVTVRDRV 71 (348) T ss_pred Ccch-hh-----hcCHHHHHHHHHhcc-ccchhhhHhhcCCCc-ccc-CceeEEEEeecCceeeeeeecCCCCcceeccc Confidence 0110 01 133333322111111 112233345667742 211 12222222222222 22344444333222233 Q ss_pred ccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHHHH---------------HHHHHHHHHhhheeeee---ehhhCc Q lcl|NC_021342. 121 SAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQAR---------------LAFRGAEEHSQSVAYFG---DASRGM 182 (354) Q Consensus 121 ~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k~~---------------aA~~~~a~~~n~~~f~G---~~~~gi 182 (354) ..+..+..+..+.....++..|+...+...-+-....+. ..++..+...-++.+.| ..+.|. T Consensus 72 ~~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~ 151 (348) T protein:vir:49 72 SAEMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGV 151 (348) T ss_pred ceeeeeeecCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCc Confidence 345556677778888888887765554443222211111 22333333333444555 112121 Q ss_pred -eee-eecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhh-----cccC----CCC Q lcl|NC_021342. 183 -YGL-FNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANN-----QLMT----GYT 251 (354) Q Consensus 183 -~GL-lN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~-----~~~~----~~~ 251 (354) +.+ +..|.-...++++.|++.++ +++.||.+.+..+.+ + |. .|.+++|+++.|..|.+ ..+. ... T Consensus 152 ~~~vdyg~~~~~~~t~~~~W~~~~a-dp~~di~~~~~~~~~-~-G~-~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~ 227 (348) T protein:vir:49 152 NKDIDYGVKPDHKKQVSKSWAEPGA-TPLADLEDAIETARE-L-GL-NPERAVMNAKTFGLIRKAASTVKVIKPLAGDGS 227 (348) T ss_pred eEEEeecCCcccceeeeeccCCCCC-CHHHHHHHHHHHHHh-c-CC-cccEEEeCHHHHHHHhcCHHHHHHhhccCcccc Confidence 110 11122122334567998665 589999999877754 3 64 79999999999998854 1111 111 Q ss_pred Cc---hHHHHHHhhCcccccccccceeeeeeeeeecccccccccc--CcccEEEEEEcCc-ceEEEeeC-ch-------- Q lcl|NC_021342. 252 DR---TVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSN--SNKPRYMVYDKSD-RNLAMANP-IP-------- 316 (354) Q Consensus 252 ~~---Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~--~g~d~~v~y~~~~-~~~~~~vp-~~-------- 316 (354) .+ .+.+++... .|. .|.....-.. ...|... --.+.++....+. -......+ +. T Consensus 228 ~i~~~~~~~~~~~~------~g~--~i~~y~~~y~---d~dG~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~ 296 (348) T protein:vir:49 228 SVTKAELDNYIADN------FGV--TVVLENGTYR---NEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNT 296 (348) T ss_pred cccHHHHHHHHHhh------cCc--eEEEEeeEEE---ecCCcEeeeecCCeEEEecCCCcceeEEecChhhhhhccccc Confidence 11 223333222 122 2222111110 0111000 0001111111100 00001000 00 Q ss_pred ----hhhcc--------ccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 317 ----FRMLA--------PQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 317 ----~~~~~--------~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) ++..+ .+.+.....+...++ .=-.+.+|.++..+++- T Consensus 297 ~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~-~lPv~~~~~~~~~a~Vl 345 (348) T protein:vir:49 297 VNADVEIVDNGIAVTTTKTTDPVNVQTKVSMV-ALPSFERLDDVYMLTVI 345 (348) T ss_pred cccceeecCCeEEEeeeecCCCceEEEEEeee-ccccccCCCcEEEEEEe Confidence 00000 000000111111111 11235677777777766 No 172 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=30.22 E-value=1.6 Score=19.47 Aligned_cols=275 Identities=13% Similarity=0.112 Sum_probs=122.4 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceec-cchh-hHHHHHHHHHHHHHHHHHhhh--hccc Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIML-DADG-GIAFYISQLAGIEATVYETPY--GDIT 76 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~-da~~-~~~fl~~~L~~Id~~v~e~~~--~~l~ 76 (354) |--|.-|+|+.-.++ ++.+ .|.+...+.=++ |.+- ++++.. +.+|+.+....+ .+++ T Consensus 3 ~~~~~~~~~~~~~~~----------~~e~------~~KS~~tg~g~~p~~q~~~~AlR~---EsL~~~i~~Lt~~~~~f~ 63 (463) T protein:vir:95 3 IEKNLSDVQQKYADQ----------FQED------VVKSFQTGYGITPDTQIDAGALRR---EILDDQITMLTWTNEDLI 63 (463) T ss_pred cccccchHHHHHHhh----------hhHH------HHHHhhcCCccCCccccCcchhhh---hhhhhhhheeeecccchh Confidence 333445555433222 2111 122222222112 2222 335555 444555543222 3444 Q ss_pred chhhccccCCCCCceeEEEEE-eeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcch Q lcl|NC_021342. 77 YRFDVPMAANIPEYADTWMYR-SYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDA 155 (354) Q Consensus 77 ~r~~v~v~~~~~~~~~~~~~~-~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~ 155 (354) .-+-++- .+.......|+.. ....+|.+..++.. ...+..+.++.|++..+..++.....|...-. +-...+... T Consensus 64 ~~~~i~k-~~a~STV~~y~~~~~~G~~g~~~f~~E~-g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l--~n~~~d~~~ 139 (463) T protein:vir:95 64 FYRDISR-RPAQSTVVKYDQYLRHGNVGHSRFVKEI-GVAPVSDPNIRQKTVSMKYVSDTKNMSIASGL--VNNIADPSQ 139 (463) T ss_pred hhhhcCC-chhhhhhhhheeeeccCccccccccccc-cccccCCCceEEEEEEeeeeehhhhhhhHHHh--hcccccHHH Confidence 4343432 2333333333332 33445666665554 33567788888898888888877776554433 333557777 Q ss_pred HHHHHHHHHHHHHhhheeeeeehhhCc---------eeeee--cCCcccccccccccccCHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021342. 156 EQARLAFRGAEEHSQSVAYFGDASRGM---------YGLFN--NPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSR 224 (354) Q Consensus 156 ~k~~aA~~~~a~~~n~~~f~G~~~~gi---------~GLlN--~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~ 224 (354) ...+.|...+++......||||+.+.= .||.| +|. .+..+.+.- ..+ +.|+++-..+ +. T Consensus 140 ~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~e-nviDarG~~----Ls~--~~ln~Aa~~i---~~ 209 (463) T protein:vir:95 140 ILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKN-NVINAKGNQ----LTE--KHLNEAAVRI---GK 209 (463) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCC-CeeecCCCc----ccH--HHHhhhhhhh---hc Confidence 788889999999999999999886533 33322 111 111222111 111 3355443333 44 Q ss_pred CcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEc Q lcl|NC_021342. 225 RFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDK 304 (354) Q Consensus 225 g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~ 304 (354) ++-.|+-+.||......|.+..++... .+...|......|.++. . |-. T Consensus 210 ~fGt~TD~~lp~~vka~f~~~~l~~qr------v~~~~N~~~~~~G~~v~-----~---------------------f~s 257 (463) T protein:vir:95 210 GFGTATDAYMPIGVHADFVNSILGRQM------QLMQDNSGNVNTGYSVN-----G---------------------FYS 257 (463) T ss_pred ccCChhheecchHHHHHHHHHhcCceE------EEEcCCCCceeeeeecc-----c---------------------eee Confidence 778899999999999999754332211 11111111111111100 0 111 Q ss_pred CcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCce----eEeeecC Q lcl|NC_021342. 305 SDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLC----AAYVDMA 354 (354) Q Consensus 305 ~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~a----i~y~D~~ 354 (354) ..-.++++--.-+. . +..+.-. + -.-|.+ .+-.++. T Consensus 258 ~~G~I~L~~s~~m~-~---~~il~~~-----~-----~~~p~ap~~~~~tatv~ 297 (463) T protein:vir:95 258 SRGFIKLHGSTVME-N---ELILDES-----L-----QPLPNAPQPAKVTATVE 297 (463) T ss_pred eeeeeeeCCceecC-C---cccccch-----h-----hcCCCCccCceeEEEEe Confidence 11222222111010 0 0011100 0 011111 0111221 No 173 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=30.22 E-value=1.6 Score=19.47 Aligned_cols=275 Identities=13% Similarity=0.112 Sum_probs=122.4 Q ss_pred CcccchhHHHHhhhhhhhcccccccccchhhhhhhhhhhccCCceec-cchh-hHHHHHHHHHHHHHHHHHhhh--hccc Q lcl|NC_021342. 1 MAIKTIDAQTIQGNQWLVHKGYVSRNGDQWVINNTALDAIGNPNIML-DADG-GIAFYISQLAGIEATVYETPY--GDIT 76 (354) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~am~a~~~~~~~~-da~~-~~~fl~~~L~~Id~~v~e~~~--~~l~ 76 (354) |--|.-|+|+.-.++ ++.+ .|.+...+.=++ |.+- ++++.. +.+|+.+....+ .+++ T Consensus 3 ~~~~~~~~~~~~~~~----------~~e~------~~KS~~tg~g~~p~~q~~~~AlR~---EsL~~~i~~Lt~~~~~f~ 63 (463) T protein:vir:99 3 IEKNLSDVQQKYADQ----------FQED------VVKSFQTGYGITPDTQIDAGALRR---EILDDQITMLTWTNEDLI 63 (463) T ss_pred cccccchHHHHHHhh----------hhHH------HHHHhhcCCccCCccccCcchhhh---hhhhhhhheeeecccchh Confidence 333445555433222 2111 122222222112 2222 335555 444555543222 3444 Q ss_pred chhhccccCCCCCceeEEEEE-eeccccceeEecCCCcccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcch Q lcl|NC_021342. 77 YRFDVPMAANIPEYADTWMYR-SYDGVTMGKFIGANGQDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDA 155 (354) Q Consensus 77 ~r~~v~v~~~~~~~~~~~~~~-~~~~~G~a~~~~~~~~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~ 155 (354) .-+-++- .+.......|+.. ....+|.+..++.. ...+..+.++.|++..+..++.....|...-. +-...+... T Consensus 64 ~~~~i~k-~~a~STV~~y~~~~~~G~~g~~~f~~E~-g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l--~n~~~d~~~ 139 (463) T protein:vir:99 64 FYRDISR-RPAQSTVVKYDQYLRHGNVGHSRFVKEI-GVAPVSDPNIRQKTVSMKYVSDTKNMSIASGL--VNNIADPSQ 139 (463) T ss_pred hhhhcCC-chhhhhhhhheeeeccCccccccccccc-cccccCCCceEEEEEEeeeeehhhhhhhHHHh--hcccccHHH Confidence 4343432 2333333333332 33445666665554 33567788888898888888877776554433 333557777 Q ss_pred HHHHHHHHHHHHHhhheeeeeehhhCc---------eeeee--cCCcccccccccccccCHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021342. 156 EQARLAFRGAEEHSQSVAYFGDASRGM---------YGLFN--NPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSR 224 (354) Q Consensus 156 ~k~~aA~~~~a~~~n~~~f~G~~~~gi---------~GLlN--~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~ 224 (354) ...+.|...+++......||||+.+.= .||.| +|. .+..+.+.- ..+ +.|+++-..+ +. T Consensus 140 ~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~e-nviDarG~~----Ls~--~~ln~Aa~~i---~~ 209 (463) T protein:vir:99 140 ILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKN-NVINAKGNQ----LTE--KHLNEAAVRI---GK 209 (463) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCC-CeeecCCCc----ccH--HHHhhhhhhh---hc Confidence 788889999999999999999886533 33322 111 111222111 111 3355443333 44 Q ss_pred CcccccEEEeCHHHHHHHhhcccCCCCCchHHHHHHhhCcccccccccceeeeeeeeeeccccccccccCcccEEEEEEc Q lcl|NC_021342. 225 RFHVPNTALMFPDLWNQANNQLMTGYTDRTVMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDK 304 (354) Q Consensus 225 g~~~p~~L~l~p~~~~~L~~~~~~~~~~~Tvl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~ 304 (354) ++-.|+-+.||......|.+..++... .+...|......|.++. . |-. T Consensus 210 ~fGt~TD~~lp~~vka~f~~~~l~~qr------v~~~~N~~~~~~G~~v~-----~---------------------f~s 257 (463) T protein:vir:99 210 GFGTATDAYMPIGVHADFVNSILGRQM------QLMQDNSGNVNTGYSVN-----G---------------------FYS 257 (463) T ss_pred ccCChhheecchHHHHHHHHHhcCceE------EEEcCCCCceeeeeecc-----c---------------------eee Confidence 778899999999999999754332211 11111111111111100 0 111 Q ss_pred CcceEEEeeCchhhhccccccCceeEEeeeeeeeeEEEECCce----eEeeecC Q lcl|NC_021342. 305 SDRNLAMANPIPFRMLAPQMASLGITVPAEYKISGTEFRYPLC----AAYVDMA 354 (354) Q Consensus 305 ~~~~~~~~vp~~~~~~~~~~~~l~~~~~~~~~~gGv~i~~P~a----i~y~D~~ 354 (354) ..-.++++--.-+. . +..+.-. + -.-|.+ .+-.++. T Consensus 258 ~~G~I~L~~s~~m~-~---~~il~~~-----~-----~~~p~ap~~~~~tatv~ 297 (463) T protein:vir:99 258 SRGFIKLHGSTVME-N---ELILDES-----L-----QPLPNAPQPAKVTATVE 297 (463) T ss_pred eeeeeeeCCceecC-C---cccccch-----h-----hcCCCCccCceeEEEEe Confidence 11222222111010 0 0011100 0 011111 0111221 No 174 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=24.23 E-value=2.2 Score=18.70 Aligned_cols=275 Identities=11% Similarity=0.034 Sum_probs=99.2 Q ss_pred ccCCceeccch----hhH-HHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEe-eccccc-eeEecCCC Q lcl|NC_021342. 40 IGNPNIMLDAD----GGI-AFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRS-YDGVTM-GKFIGANG 112 (354) Q Consensus 40 ~~~~~~~~da~----~~~-~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~-~~~~G~-a~~~~~~~ 112 (354) +-+.-++++-. .-. .|..++|..+-+ +...+.+-+.++||... .. .. .+.+.. ...... +.+++-.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~d~~~~~~l~~~~~---~~~~~~~l~~~~Fp~~~-~~-~~-~~~~~~~~~~~~~~a~~v~~~~ 74 (349) T protein:vir:10 1 MKNQKLQLDLQRFATPILDMFSQNTVLDYTR---NRQYPEMLGDTLFPAVK-VP-TL-EVDILKAGSRVPTIASVSAFDA 74 (349) T ss_pred CCcchhhHHHHHHHHHhhcccCHHHHHHHHH---hcCcchhhHhhcCCccc-cc-cc-eeEEEeeccCcceeeeeecCCC Confidence 22343444321 111 223333322222 22334555677887432 11 11 122211 111121 23333332 Q ss_pred cccceeeeccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHH--------HHHHHHHHHH----Hhhheeeee---e Q lcl|NC_021342. 113 QDLPRVAQSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQ--------ARLAFRGAEE----HSQSVAYFG---D 177 (354) Q Consensus 113 ~dip~v~~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k--------~~aA~~~~a~----~~n~~~f~G---~ 177 (354) . .|..+-.....+..+..+...+.++..|+......+.+-.... ....++.+.. ..-++.+.| . T Consensus 75 ~-~~~~~r~~~~~~~~~p~ik~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~q~l~~Gki~~ 153 (349) T protein:vir:10 75 E-AEIGTREASKMTAELAYVKRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEKMTMEMFATGKITD 153 (349) T ss_pred C-cceecccceeEEeeccccccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeEE Confidence 2 2333333333444556677778899899887766554422111 1112222222 223445555 1 Q ss_pred hhhCce---eeeecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhhcccCCCCCch Q lcl|NC_021342. 178 ASRGMY---GLFNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANNQLMTGYTDRT 254 (354) Q Consensus 178 ~~~gi~---GLlN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~~~~~~~~~~T 254 (354) .+-|+. |. ...+..+.+.++.|++.++ .+++||.+.+..+ | ..|..++|++..|..|.+ +.. T Consensus 154 ~~~g~~vD~g~-~~~~~~~lt~~~~Ws~~~a-dpi~Di~~~~~~~-----g-~~p~~~vm~~~~~~~l~~-------~~~ 218 (349) T protein:vir:10 154 KKNGIAIDYGV-PKKHQETLSGTKTWDKSDA-SIIDNLQDWSDSL-----D-VTPTRALTSKKVLRILMR-------STE 218 (349) T ss_pred cCCcEEEeccc-CccceeEecCcccCCCCCC-CHHHHHHHHHHHh-----C-CCccEEEeCHHHHHHHhc-------CHH Confidence 111211 11 0111112345667988654 5788998776443 3 368999999999998853 112 Q ss_pred HHHHHHhhCc-----------c-cccccccceeeeeeeeeeccccccccccCcccEEEEEEcCcceEEEeeCchhh-hcc Q lcl|NC_021342. 255 VMQHFMEANS-----------Y-TLLTGNELDIQIRFQLDAAELAANGVSNSNKPRYMVYDKSDRNLAMANPIPFR-MLA 321 (354) Q Consensus 255 vl~~l~~n~~-----------~-~~~~g~~l~I~~~~~L~~~~~~~~g~g~~g~d~~v~y~~~~~~~~~~vp~~~~-~~~ 321 (354) +.+.+..++. + ....|. .|.....-... ..+.+.+ +-.-.+|...- ++| T Consensus 219 i~~~~~~~~~~~~~~~~~~~~~l~~~~~~--~i~~yd~~y~d-~~~~~~~---------------t~~~~~p~~~v~l~~ 280 (349) T protein:vir:10 219 IKEAIFGKDTGRVVGQADLDQWMTAQGLP--IIRAYDGKYRD-EDSRGNL---------------TTNSYFPEDRIVLFN 280 (349) T ss_pred HHHHhcccccccccCHHHHHHHHHhcCCc--eEEEEeeEEEe-ecCCCce---------------eecccccCCeEEEec Confidence 2222211110 0 000111 11111110000 0000000 11112222211 112 Q ss_pred ccccC-ceeEEe-----------eeeeee-eEEEE---CCceeEeeecC Q lcl|NC_021342. 322 PQMAS-LGITVP-----------AEYKIS-GTEFR---YPLCAAYVDMA 354 (354) Q Consensus 322 ~~~~~-l~~~~~-----------~~~~~g-Gv~i~---~P~ai~y~D~~ 354 (354) ....+ ..|-.. .....+ |..++ ...-..+.=++ T Consensus 281 ~~~~G~~~yG~~~e~~~~~~g~~~~~~~~~~~~~~~~~~~dP~~~~~~~ 329 (349) T protein:vir:10 281 DEVPGQKIYGPTPEENRLISSNAQVSNVGNIMAKIYETSEDPIGTWILA 329 (349) T ss_pred CCCceeEEeeccchhhhhcccccceeeccceEEEeeeecCCCceEEEEE Confidence 11111 111000 000111 11111 11111111111 No 175 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=21.92 E-value=2.5 Score=18.38 Aligned_cols=287 Identities=11% Similarity=0.030 Sum_probs=106.8 Q ss_pred CCceeccchhhHHHHHHHHHHHHHHHHHhhhhcccchhhccccCCCCCceeEEEEEe-eccccc-eeEecCCCcccceee Q lcl|NC_021342. 42 NPNIMLDADGGIAFYISQLAGIEATVYETPYGDITYRFDVPMAANIPEYADTWMYRS-YDGVTM-GKFIGANGQDLPRVA 119 (354) Q Consensus 42 ~~~~~~da~~~~~fl~~~L~~Id~~v~e~~~~~l~~r~~v~v~~~~~~~~~~~~~~~-~~~~G~-a~~~~~~~~dip~v~ 119 (354) =+++ .-.|...+|+..=.++ ......+-...+||.. +.. ...+.+.. ...... +.+++..+.....-. T Consensus 1 M~~i------~d~f~~~~l~~~v~~~-~~~~~~~l~~~~Fp~~-~~~--~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r 70 (348) T protein:vir:27 1 MGLI------YDKVTASNIAGYFNAL-QENVSSTLGESIFPAR-KQL--GTKLSYIKGASGQSVALKAAAFDTNVTIRDR 70 (348) T ss_pred Ccch------hhhcCHHHHHHHHHhc-cchhhhhhHhhcCCCc-ccc--ceeEEEEeeccCceeEeeeecCCCCcceecc Confidence 0110 0123333333211111 1122233344667732 211 11122111 111111 233433333211222 Q ss_pred eccceeEEEEEEEEeeEeecHHHHHHHHHhCCCcchHH-----------H----HHHHHHHHHHhhheeeee---ehhhC Q lcl|NC_021342. 120 QSAQMHTVPLGYAGNECHYTLDEMRKSAAMNMPIDAEQ-----------A----RLAFRGAEEHSQSVAYFG---DASRG 181 (354) Q Consensus 120 ~~~~~~~~pv~~~~~~~~~~~~El~~a~~~g~~ld~~k-----------~----~aA~~~~a~~~n~~~f~G---~~~~g 181 (354) ...+..+..+..+.....++..|+...+...-...... . ...++..+...-++.+.| ..+.| T Consensus 71 ~~~~~~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~ 150 (348) T protein:vir:27 71 VSAEMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDG 150 (348) T ss_pred cceeeeeeecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCC Confidence 23345566777777778888777655433322211111 1 112222333333455555 12222 Q ss_pred ceee--eecCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCcccccEEEeCHHHHHHHhh-----cccC----CC Q lcl|NC_021342. 182 MYGL--FNNPNVTLSSATKDYKTMNGQELFNMLNAPIFSVINLSRRFHVPNTALMFPDLWNQANN-----QLMT----GY 250 (354) Q Consensus 182 i~GL--lN~p~~~~~~~~~~W~~~T~~ei~~di~~~~~~l~~~s~g~~~p~~L~l~p~~~~~L~~-----~~~~----~~ 250 (354) ..=- ++.|.-...++++.|++.++ .++.||.+....+.+ .|. .|..++|+++.|..|.+ ..+. .. T Consensus 151 ~~~~vdfg~~~~~~~t~~~~W~~~~a-dp~~di~~~~~~~~~--~G~-~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~ 226 (348) T protein:vir:27 151 VNKDIDYGVKPDHKKQVSKSWAEPGA-TPLADLEDAIETARE--LGL-NPERAVMNAKTFGLIRKAASTVKVIKPLAGDG 226 (348) T ss_pred eeEEEeecCCcccceeeeeccCCCCC-CHHHHHHHHHHHHHh--cCC-cccEEEECHHHHHHHhcCHHHHHHhcccCccc Confidence 2101 12222122334567998665 578999999877753 364 89999999999998854 1111 11 Q ss_pred CCch---HHHHHHhhCcccccccccceeeeeeeeeecccccccccc--CcccEEEEEEcCc-ceEEEeeC-chhhhc--- Q lcl|NC_021342. 251 TDRT---VMQHFMEANSYTLLTGNELDIQIRFQLDAAELAANGVSN--SNKPRYMVYDKSD-RNLAMANP-IPFRML--- 320 (354) Q Consensus 251 ~~~T---vl~~l~~n~~~~~~~g~~l~I~~~~~L~~~~~~~~g~g~--~g~d~~v~y~~~~-~~~~~~vp-~~~~~~--- 320 (354) ..++ +.+|+... .|. .|.....-.. ...|... --.+.++....+. -......+ +..... T Consensus 227 ~~i~~~~~~~~~~~~------~g~--~i~~yd~~y~---d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~ 295 (348) T protein:vir:27 227 SAVTKAELENYIADN------FGV--SIVLENGTYR---NDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADN 295 (348) T ss_pred cccCHHHHHHHHHhh------cCc--eEEEEeeEEE---cCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhcc Confidence 1111 22222221 222 2222111111 1111100 0011222211110 01111100 000000 Q ss_pred ---------c--------ccccCceeEEeeeeeeeeEEEECCceeEeeecC Q lcl|NC_021342. 321 ---------A--------PQMASLGITVPAEYKISGTEFRYPLCAAYVDMA 354 (354) Q Consensus 321 ---------~--------~~~~~l~~~~~~~~~~gGv~i~~P~ai~y~D~~ 354 (354) + .+.+.....+...+ ..=-.+.+|.++...++- T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s-~~lPv~~~~~~~~~a~Vl 345 (348) T protein:vir:27 296 TVNAEVEIVDNGIAVTTTKTTDPVNVQTKVSM-VALPSFERLDDVYMLTVI 345 (348) T ss_pred ccccceeeeCCeeEEEeeecCCCceEEEEEee-eeeccccCCCcEEEEEEe Confidence 0 00000111111111 112335667777777766 Done!