Query lcl|NC_021309.1_cdsid_YP_008051948.1 [gene=12] [protein=major capsid protein] [protein_id=YP_008051948.1] [location=7089..8582] Match_columns 497 No_of_seqs 156 out of 925 Neff 10.2 Searched_HMMs 1612 Date Thu Nov 7 17:30:06 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_12 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_12_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:7855 Length: 497 # 100.0 4.8E-97 3E-100 548.6 50.4 497 1-497 1-497 (497) 2 protein:vir:101650 Length: 497 100.0 4.8E-97 3E-100 548.6 50.4 497 1-497 1-497 (497) 3 protein:vir:100135 Length: 418 100.0 1.7E-70 1.1E-73 403.1 43.1 411 1-496 4-418 (418) 4 protein:vir:4339 Length: 395 # 100.0 1.4E-68 8.9E-72 392.5 41.4 394 4-493 1-395 (395) 5 protein:vir:81227 Length: 413 100.0 4.9E-67 3.1E-70 384.1 43.2 406 14-496 1-413 (413) 6 protein:vir:10364 Length: 390 100.0 1.4E-67 9E-71 387.0 39.1 388 11-491 1-390 (390) 7 protein:vir:100247 Length: 425 100.0 3.3E-67 2.1E-70 385.1 39.6 404 1-494 1-425 (425) 8 protein:vir:81070 Length: 390 100.0 3.3E-67 2E-70 385.1 38.6 388 18-491 1-390 (390) 9 protein:vir:1886 Length: 385 # 100.0 9E-67 5.6E-70 382.7 40.2 384 1-494 1-385 (385) 10 protein:vir:191 Length: 385 # 100.0 9E-67 5.6E-70 382.7 40.2 384 1-494 1-385 (385) 11 protein:vir:97053 Length: 390 100.0 1.2E-66 7.5E-70 382.0 38.5 388 1-491 1-390 (390) 12 protein:vir:485 Length: 407 # 100.0 2.8E-66 1.7E-69 380.0 38.8 397 1-497 1-404 (407) 13 protein:vir:1328 Length: 392 # 100.0 2.3E-66 1.5E-69 380.4 37.2 387 1-494 1-392 (392) 14 protein:vir:101607 Length: 379 100.0 1.9E-65 1.2E-68 375.4 40.1 378 1-493 1-379 (379) 15 protein:vir:4456 Length: 401 # 100.0 2.1E-65 1.3E-68 375.1 37.6 393 1-493 1-401 (401) 16 protein:vir:6242 Length: 390 # 100.0 2.4E-65 1.5E-68 374.8 36.2 385 1-494 1-390 (390) 17 protein:vir:95376 Length: 425 100.0 5.5E-64 3.4E-67 367.4 42.8 414 5-497 1-425 (425) 18 protein:vir:94673 Length: 419 100.0 5.7E-64 3.5E-67 367.3 41.0 410 1-495 1-419 (419) 19 protein:vir:105038 Length: 428 100.0 2.6E-64 1.6E-67 369.2 37.8 404 1-493 1-428 (428) 20 protein:vir:104256 Length: 458 100.0 6.1E-62 3.8E-65 356.2 45.2 432 1-493 1-458 (458) 21 protein:vir:8420 Length: 477 # 100.0 1.2E-62 7.7E-66 360.0 39.9 433 7-497 1-475 (477) 22 protein:vir:1433 Length: 435 # 100.0 6.3E-63 3.9E-66 361.6 36.8 402 16-495 1-435 (435) 23 protein:vir:4511 Length: 409 # 100.0 5.4E-63 3.4E-66 362.0 35.6 397 1-496 1-409 (409) 24 protein:vir:80376 Length: 435 100.0 1.1E-62 6.8E-66 360.3 37.0 405 16-495 1-435 (435) 25 protein:vir:8102 Length: 543 # 100.0 7E-61 4.4E-64 350.4 45.4 429 1-494 40-543 (543) 26 protein:vir:4953 Length: 397 # 100.0 6.7E-62 4.2E-65 356.0 37.6 378 1-497 1-389 (397) 27 protein:vir:3870 Length: 400 # 100.0 2.9E-61 1.8E-64 352.4 40.5 392 5-494 1-400 (400) 28 protein:vir:1268 Length: 397 # 100.0 1.4E-61 8.8E-65 354.2 38.3 385 1-493 1-397 (397) 29 protein:vir:81160 Length: 371 100.0 1.1E-61 6.5E-65 354.9 36.9 355 1-493 1-371 (371) 30 protein:vir:6212 Length: 434 # 100.0 2E-61 1.2E-64 353.4 38.3 415 5-497 1-434 (434) 31 protein:vir:1025 Length: 408 # 100.0 2.4E-61 1.5E-64 352.9 37.6 385 1-497 1-397 (408) 32 protein:vir:3991 Length: 404 # 100.0 3.9E-61 2.4E-64 351.7 37.9 386 1-497 1-397 (404) 33 protein:vir:7409 Length: 408 # 100.0 6.9E-61 4.3E-64 350.4 38.8 385 1-497 1-397 (408) 34 protein:vir:102119 Length: 404 100.0 6.4E-61 4E-64 350.6 38.5 391 14-497 1-404 (404) 35 protein:vir:4600 Length: 415 # 100.0 1.4E-60 8.9E-64 348.7 40.0 400 1-497 1-408 (415) 36 protein:vir:4700 Length: 415 # 100.0 1.4E-60 8.9E-64 348.7 40.0 400 1-497 1-408 (415) 37 protein:vir:4997 Length: 397 # 100.0 1.3E-60 8.1E-64 348.9 38.6 378 1-497 1-389 (397) 38 protein:vir:80128 Length: 466 100.0 4.9E-60 3.1E-63 345.7 41.6 442 1-497 1-452 (466) 39 protein:vir:9410 Length: 415 # 100.0 4.5E-60 2.8E-63 346.0 40.3 399 1-497 1-408 (415) 40 protein:vir:3845 Length: 395 # 100.0 4.6E-60 2.9E-63 345.9 39.5 376 1-497 1-387 (395) 41 protein:vir:98339 Length: 415 100.0 9.8E-60 6.1E-63 344.1 41.1 399 1-497 1-408 (415) 42 protein:vir:79987 Length: 415 100.0 9.8E-60 6.1E-63 344.1 41.1 399 1-497 1-408 (415) 43 protein:vir:81100 Length: 415 100.0 9.8E-60 6.1E-63 344.1 41.1 399 1-497 1-408 (415) 44 protein:vir:1084 Length: 437 # 100.0 5E-59 3.1E-62 340.2 44.0 410 1-497 1-431 (437) 45 protein:vir:4830 Length: 397 # 100.0 5.1E-60 3.2E-63 345.6 37.5 378 1-497 1-389 (397) 46 protein:vir:9704 Length: 394 # 100.0 2E-59 1.3E-62 342.4 39.5 389 1-497 1-394 (394) 47 protein:vir:102873 Length: 392 100.0 1E-59 6.5E-63 343.9 37.8 374 14-497 1-388 (392) 48 protein:vir:102082 Length: 392 100.0 1E-59 6.5E-63 343.9 37.8 374 14-497 1-388 (392) 49 protein:vir:107593 Length: 392 100.0 1E-59 6.5E-63 343.9 37.8 374 14-497 1-388 (392) 50 protein:vir:105004 Length: 392 100.0 1E-59 6.5E-63 343.9 37.8 374 14-497 1-388 (392) 51 protein:vir:962 Length: 397 # 100.0 1E-58 6.2E-62 338.6 40.1 389 4-493 1-397 (397) 52 protein:vir:100172 Length: 394 100.0 4E-59 2.5E-62 340.8 37.5 379 1-497 1-388 (394) 53 protein:vir:1383 Length: 421 # 100.0 5.9E-59 3.7E-62 339.8 37.5 379 5-497 1-396 (421) 54 protein:vir:100884 Length: 389 100.0 6E-59 3.7E-62 339.8 37.2 377 15-497 1-386 (389) 55 protein:vir:4092 Length: 390 # 100.0 4.4E-59 2.7E-62 340.5 36.2 366 15-497 1-372 (390) 56 protein:vir:98635 Length: 377 100.0 1.1E-58 6.5E-62 338.4 29.7 365 1-493 1-377 (377) 57 protein:vir:4226 Length: 326 # 100.0 8E-59 5E-62 339.1 27.4 311 123-496 1-326 (326) 58 protein:vir:7771 Length: 330 # 100.0 1.6E-58 1E-61 337.4 28.4 303 143-497 1-327 (330) 59 protein:vir:93616 Length: 645 100.0 2.4E-57 1.5E-60 331.0 33.6 426 1-497 155-643 (645) 60 protein:vir:41 Length: 299 # N 100.0 2.8E-58 1.8E-61 336.1 27.4 282 146-494 1-299 (299) 61 protein:vir:9361 Length: 402 # 100.0 1.9E-57 1.2E-60 331.5 31.6 383 1-497 16-400 (402) 62 protein:vir:96762 Length: 632 100.0 9.4E-57 5.8E-60 327.7 35.2 413 1-492 185-632 (632) 63 protein:vir:104085 Length: 320 100.0 6.5E-58 4E-61 334.1 28.0 305 138-496 1-320 (320) 64 protein:vir:94424 Length: 387 100.0 1.3E-56 7.9E-60 327.0 34.4 383 1-497 1-385 (387) 65 protein:vir:2685 Length: 387 # 100.0 1.3E-56 7.9E-60 327.0 34.4 383 1-497 1-385 (387) 66 protein:vir:96978 Length: 387 100.0 1.3E-56 7.9E-60 327.0 34.4 383 1-497 1-385 (387) 67 protein:vir:5739 Length: 366 # 100.0 6.3E-58 3.9E-61 334.2 27.2 346 77-493 1-366 (366) 68 protein:vir:2430 Length: 318 # 100.0 1.7E-57 1E-60 331.9 27.9 302 138-497 1-317 (318) 69 protein:vir:93881 Length: 387 100.0 3.3E-56 2E-59 324.8 35.0 382 1-497 1-385 (387) 70 protein:vir:9574 Length: 300 # 100.0 1.9E-57 1.2E-60 331.5 27.8 279 152-493 1-300 (300) 71 protein:vir:8187 Length: 311 # 100.0 2.6E-57 1.6E-60 330.8 27.9 281 153-494 1-311 (311) 72 protein:vir:80684 Length: 315 100.0 2.6E-57 1.6E-60 330.8 27.4 287 151-497 1-310 (315) 73 protein:vir:105905 Length: 304 100.0 4.5E-57 2.8E-60 329.5 27.1 285 143-492 1-304 (304) 74 protein:vir:94142 Length: 304 100.0 4.5E-57 2.8E-60 329.5 27.1 285 143-492 1-304 (304) 75 protein:vir:2344 Length: 397 # 100.0 6E-57 3.7E-60 328.8 27.2 295 142-497 1-310 (397) 76 protein:vir:97148 Length: 324 100.0 2.6E-56 1.6E-59 325.3 29.7 303 108-497 1-319 (324) 77 protein:vir:96392 Length: 324 100.0 5.2E-56 3.3E-59 323.6 29.6 303 117-497 1-320 (324) 78 protein:vir:78830 Length: 324 100.0 5.2E-56 3.3E-59 323.6 29.6 303 117-497 1-320 (324) 79 protein:vir:9759 Length: 303 # 100.0 2.4E-56 1.5E-59 325.5 27.6 285 153-493 1-303 (303) 80 protein:vir:9309 Length: 324 # 100.0 1E-55 6.5E-59 322.0 29.9 303 117-497 1-320 (324) 81 protein:vir:1638 Length: 298 # 100.0 4.5E-56 2.8E-59 324.0 27.8 280 155-492 1-298 (298) 82 protein:vir:78523 Length: 338 100.0 8.7E-56 5.4E-59 322.4 27.7 303 136-496 1-338 (338) 83 protein:vir:78223 Length: 333 100.0 1.6E-55 1E-58 320.9 28.0 299 136-494 1-333 (333) 84 protein:vir:9643 Length: 377 # 100.0 6.6E-55 4.1E-58 317.6 30.8 363 1-493 1-377 (377) 85 protein:vir:99749 Length: 324 100.0 3.9E-55 2.4E-58 318.9 29.5 303 108-497 1-319 (324) 86 protein:vir:78640 Length: 352 100.0 1.4E-54 8.9E-58 315.8 31.3 348 40-497 1-350 (352) 87 protein:vir:100632 Length: 381 100.0 3.8E-55 2.3E-58 319.0 28.1 364 35-497 1-377 (381) 88 protein:vir:94771 Length: 298 100.0 3.8E-55 2.4E-58 318.9 27.6 277 155-492 1-298 (298) 89 protein:vir:4856 Length: 293 # 100.0 4E-55 2.5E-58 318.8 27.7 274 147-497 1-285 (293) 90 protein:vir:103955 Length: 324 100.0 8.3E-55 5.2E-58 317.1 29.2 303 108-497 1-320 (324) 91 protein:vir:95963 Length: 395 100.0 1.1E-53 6.5E-57 311.0 34.8 374 1-497 1-380 (395) 92 protein:vir:95763 Length: 297 100.0 4.6E-55 2.8E-58 318.5 27.3 281 143-494 1-297 (297) 93 protein:vir:96223 Length: 324 100.0 1.1E-54 7E-58 316.3 29.2 303 108-497 1-319 (324) 94 protein:vir:99920 Length: 311 100.0 4.6E-55 2.9E-58 318.5 26.9 282 151-493 1-311 (311) 95 protein:vir:101291 Length: 381 100.0 2.6E-54 1.6E-57 314.4 29.2 359 40-497 1-374 (381) 96 protein:vir:9509 Length: 381 # 100.0 2.6E-54 1.6E-57 314.4 29.2 359 40-497 1-374 (381) 97 protein:vir:2504 Length: 305 # 100.0 1.6E-54 1E-57 315.5 26.5 284 151-497 1-302 (305) 98 protein:vir:78350 Length: 383 100.0 1.1E-52 6.8E-56 305.5 30.0 372 1-497 1-379 (383) 99 protein:vir:97397 Length: 517 100.0 7.9E-41 4.9E-44 240.4 29.7 390 1-496 120-517 (517) 100 protein:vir:4197 Length: 314 # 100.0 1.5E-41 9.2E-45 244.4 25.1 292 139-496 1-314 (314) 101 protein:vir:4159 Length: 315 # 100.0 3.3E-41 2.1E-44 242.5 23.2 295 135-492 1-315 (315) 102 protein:vir:4074 Length: 480 # 100.0 8.3E-38 5.2E-41 223.9 21.5 362 1-496 109-480 (480) 103 protein:vir:3158 Length: 321 # 100.0 6.9E-36 4.3E-39 213.4 25.9 308 133-497 1-315 (321) 104 protein:vir:9820 Length: 272 # 100.0 9.9E-30 6.2E-33 179.6 23.8 266 151-496 1-272 (272) 105 protein:vir:3033 Length: 272 # 100.0 9.9E-30 6.2E-33 179.6 23.8 266 151-496 1-272 (272) 106 protein:vir:3613 Length: 272 # 99.8 1.6E-20 9.8E-24 129.1 20.2 264 151-493 1-272 (272) 107 protein:vir:93742 Length: 274 99.8 5.5E-20 3.4E-23 126.2 22.2 265 151-497 1-271 (274) 108 protein:vir:94933 Length: 330 99.8 1.8E-20 1.1E-23 128.9 17.9 308 117-494 1-330 (330) 109 protein:vir:80930 Length: 278 99.7 8.5E-19 5.3E-22 119.7 21.4 271 151-494 1-278 (278) 110 protein:vir:96833 Length: 275 99.7 1E-18 6.3E-22 119.2 20.6 269 151-497 1-275 (275) 111 protein:vir:105334 Length: 276 99.7 2.8E-18 1.7E-21 116.8 20.8 268 151-497 1-274 (276) 112 protein:vir:96123 Length: 274 99.7 1.2E-17 7.4E-21 113.4 21.7 268 151-497 1-274 (274) 113 protein:vir:97433 Length: 274 99.7 4.2E-17 2.6E-20 110.4 22.2 268 151-497 1-274 (274) 114 protein:vir:94494 Length: 274 99.7 4.2E-17 2.6E-20 110.4 22.2 268 151-497 1-274 (274) 115 protein:vir:1239 Length: 274 # 99.6 1.3E-16 8E-20 107.7 20.9 265 151-497 1-271 (274) 116 protein:vir:96262 Length: 274 99.6 2.5E-16 1.5E-19 106.2 21.2 265 151-497 1-271 (274) 117 protein:vir:95898 Length: 274 99.6 2.5E-16 1.5E-19 106.2 21.2 265 151-497 1-271 (274) 118 protein:vir:95107 Length: 270 99.5 2.1E-14 1.3E-17 95.6 20.0 260 151-497 1-267 (270) 119 protein:vir:97255 Length: 310 99.4 1.1E-13 6.9E-17 91.6 22.3 282 151-493 1-310 (310) 120 protein:vir:8324 Length: 410 # 99.4 1.7E-14 1.1E-17 96.0 17.5 388 1-491 1-410 (410) 121 protein:vir:79928 Length: 393 99.4 8.1E-14 5E-17 92.4 20.5 354 43-497 1-385 (393) 122 protein:vir:108211 Length: 318 99.4 1.9E-14 1.1E-17 95.9 16.9 296 148-494 1-318 (318) 123 protein:vir:99424 Length: 360 99.4 5.8E-13 3.6E-16 87.7 23.1 327 118-496 1-360 (360) 124 protein:vir:739 Length: 231 # 99.4 7.1E-14 4.4E-17 92.7 16.8 222 186-493 1-231 (231) 125 protein:vir:7990 Length: 273 # 99.2 1.5E-11 9.1E-15 80.0 19.1 261 151-493 1-273 (273) 126 protein:vir:93858 Length: 400 99.1 8.1E-12 5E-15 81.4 17.4 381 1-491 10-400 (400) 127 protein:vir:102605 Length: 273 99.1 4.2E-11 2.6E-14 77.5 19.3 260 151-493 1-273 (273) 128 protein:vir:105822 Length: 273 99.1 4.2E-11 2.6E-14 77.5 19.3 260 151-493 1-273 (273) 129 protein:vir:2201 Length: 345 # 99.0 7.6E-11 4.7E-14 76.1 15.2 296 139-493 1-345 (345) 130 protein:vir:3364 Length: 347 # 98.9 9E-11 5.6E-14 75.7 13.8 299 139-495 1-347 (347) 131 protein:vir:1541 Length: 347 # 98.9 1.8E-10 1.1E-13 74.0 15.3 303 139-495 1-347 (347) 132 protein:vir:80213 Length: 334 98.9 2.7E-10 1.7E-13 73.0 15.7 299 142-495 1-334 (334) 133 protein:vir:8885 Length: 347 # 98.9 3.4E-10 2.1E-13 72.5 15.8 295 139-494 1-347 (347) 134 protein:vir:103285 Length: 296 98.9 1.3E-09 8.1E-13 69.3 18.8 281 151-494 1-296 (296) 135 protein:vir:94576 Length: 347 98.9 5.5E-10 3.4E-13 71.3 16.7 295 139-493 1-347 (347) 136 protein:vir:100057 Length: 375 98.8 1.5E-09 9.3E-13 69.0 18.6 311 135-497 1-374 (375) 137 protein:vir:95318 Length: 328 98.8 5.1E-11 3.2E-14 77.0 10.1 281 146-435 1-328 (328) 138 protein:vir:78739 Length: 332 98.8 5.2E-10 3.2E-13 71.5 14.6 291 136-491 1-332 (332) 139 protein:vir:107687 Length: 319 98.8 4.5E-09 2.8E-12 66.3 19.5 303 117-491 1-319 (319) 140 protein:vir:103323 Length: 364 98.8 2.4E-09 1.5E-12 67.9 17.9 303 142-497 1-343 (364) 141 protein:vir:10450 Length: 344 98.8 1.2E-09 7.4E-13 69.5 15.4 296 139-493 1-344 (344) 142 protein:vir:80068 Length: 301 98.7 8.4E-09 5.2E-12 64.9 19.4 283 154-491 1-301 (301) 143 protein:vir:78935 Length: 335 98.7 7.6E-09 4.7E-12 65.1 18.4 293 142-497 1-332 (335) 144 protein:vir:94711 Length: 347 98.7 1.9E-09 1.2E-12 68.4 14.1 290 139-494 1-347 (347) 145 protein:vir:94622 Length: 341 98.7 4.4E-09 2.7E-12 66.4 16.1 287 146-495 1-341 (341) 146 protein:vir:103759 Length: 330 98.6 4.1E-10 2.6E-13 72.0 9.7 279 148-435 1-330 (330) 147 protein:vir:6324 Length: 335 # 98.6 1.3E-08 8.3E-12 63.7 17.9 294 142-497 1-332 (335) 148 protein:vir:105645 Length: 400 98.6 9.1E-09 5.7E-12 64.7 14.9 300 142-497 1-337 (400) 149 protein:vir:7324 Length: 335 # 98.5 1.5E-09 9.2E-13 69.0 10.0 278 151-436 1-335 (335) 150 protein:vir:107388 Length: 331 98.5 5.2E-09 3.2E-12 66.0 12.6 276 151-435 1-331 (331) 151 protein:vir:98525 Length: 331 98.5 5.2E-09 3.2E-12 66.0 12.6 276 151-435 1-331 (331) 152 protein:vir:107826 Length: 331 98.5 5.2E-09 3.2E-12 66.0 12.6 276 151-435 1-331 (331) 153 protein:vir:79642 Length: 329 98.5 7E-08 4.4E-11 59.8 18.5 311 113-494 1-329 (329) 154 protein:vir:104342 Length: 314 98.5 5.5E-08 3.4E-11 60.4 17.7 298 116-494 1-314 (314) 155 protein:vir:5974 Length: 324 # 98.5 1.3E-07 8.2E-11 58.3 19.0 272 151-497 1-294 (324) 156 protein:vir:97031 Length: 402 98.5 9.7E-09 6E-12 64.5 12.7 296 142-497 1-337 (402) 157 protein:vir:99675 Length: 324 98.4 9.7E-09 6E-12 64.5 11.8 253 186-497 1-301 (324) 158 protein:vir:7019 Length: 401 # 98.4 8.6E-09 5.4E-12 64.8 10.9 306 142-497 1-342 (401) 159 protein:vir:9927 Length: 295 # 98.4 1.7E-07 1.1E-10 57.7 17.7 264 151-497 1-292 (295) 160 protein:vir:8843 Length: 317 # 98.4 2.3E-07 1.4E-10 57.0 18.0 301 148-495 1-317 (317) 161 protein:vir:9875 Length: 296 # 98.3 1.8E-07 1.1E-10 57.6 16.6 273 142-494 1-296 (296) 162 protein:vir:1583 Length: 351 # 98.3 5.4E-07 3.4E-10 54.9 19.0 277 151-497 1-295 (351) 163 protein:vir:102944 Length: 330 98.2 1.9E-06 1.2E-09 51.9 19.7 280 151-497 1-297 (330) 164 protein:vir:80180 Length: 381 98.1 1.3E-06 7.8E-10 52.9 16.7 294 129-497 1-316 (381) 165 protein:vir:106647 Length: 303 97.8 4E-06 2.5E-09 50.2 15.4 274 146-497 1-300 (303) 166 protein:vir:79548 Length: 652 97.8 7.6E-06 4.7E-09 48.7 16.8 425 1-490 174-652 (652) 167 protein:vir:3136 Length: 322 # 97.8 2.7E-06 1.7E-09 51.1 13.8 282 151-497 1-322 (322) 168 protein:vir:102655 Length: 322 97.7 6.6E-06 4.1E-09 49.0 14.9 287 146-494 1-322 (322) 169 protein:vir:5255 Length: 304 # 97.5 2.2E-05 1.4E-08 46.1 15.0 282 157-490 1-304 (304) 170 protein:vir:101557 Length: 336 97.4 4.1E-05 2.5E-08 44.7 15.8 314 98-491 1-336 (336) 171 protein:vir:3643 Length: 336 # 97.3 5.5E-05 3.4E-08 43.9 15.1 314 98-491 1-336 (336) 172 protein:vir:78558 Length: 336 97.3 8.8E-05 5.5E-08 42.8 15.7 314 99-491 1-336 (336) 173 protein:vir:94070 Length: 339 97.1 0.00019 1.2E-07 41.0 16.0 316 94-491 1-339 (339) 174 protein:vir:107732 Length: 379 97.0 3.6E-05 2.2E-08 44.9 11.7 337 77-491 1-379 (379) 175 protein:vir:95512 Length: 693 97.0 0.00023 1.4E-07 40.6 16.0 428 1-491 220-693 (693) 176 protein:vir:106734 Length: 336 96.7 0.00034 2.1E-07 39.6 14.5 314 99-491 1-336 (336) 177 protein:vir:1829 Length: 355 # 96.6 0.00046 2.8E-07 38.9 19.1 327 118-497 1-354 (355) 178 protein:vir:98566 Length: 355 96.4 0.00062 3.9E-07 38.2 19.0 324 130-497 1-353 (355) 179 protein:vir:78777 Length: 358 96.3 0.00077 4.8E-07 37.7 17.2 324 114-497 1-352 (358) 180 protein:vir:99075 Length: 392 96.3 0.00081 5E-07 37.5 18.1 269 151-497 1-317 (392) 181 protein:vir:104011 Length: 337 96.2 0.00087 5.4E-07 37.4 19.7 324 118-496 1-337 (337) 182 protein:vir:1153 Length: 338 # 96.2 0.00089 5.5E-07 37.3 17.8 320 130-495 1-338 (338) 183 protein:vir:78186 Length: 337 96.2 0.00089 5.5E-07 37.3 18.8 324 118-496 1-337 (337) 184 protein:vir:79171 Length: 337 96.2 0.00091 5.6E-07 37.3 19.6 324 118-496 1-337 (337) 185 protein:vir:5694 Length: 357 # 96.2 0.00093 5.7E-07 37.2 18.1 324 130-497 1-346 (357) 186 protein:vir:6061 Length: 357 # 96.0 0.0011 7.1E-07 36.7 18.1 324 130-497 1-352 (357) 187 protein:vir:79157 Length: 339 96.0 0.0012 7.3E-07 36.6 19.2 324 118-497 1-339 (339) 188 protein:vir:2016 Length: 357 # 95.8 0.0015 9.2E-07 36.1 18.1 324 130-497 1-352 (357) 189 protein:vir:100331 Length: 342 95.2 0.0027 1.6E-06 34.7 18.9 323 118-497 1-342 (342) 190 protein:vir:1781 Length: 221 # 95.0 0.0023 1.4E-06 35.0 12.0 190 233-497 1-206 (221) 191 protein:vir:96079 Length: 382 95.0 0.0019 1.2E-06 35.5 11.4 338 77-491 1-382 (382) 192 protein:vir:98856 Length: 343 94.5 0.0041 2.6E-06 33.7 17.7 317 130-497 1-337 (343) 193 protein:vir:95131 Length: 325 94.0 0.0056 3.5E-06 32.9 19.3 273 152-497 1-295 (325) 194 protein:vir:99576 Length: 388 93.7 0.0025 1.6E-06 34.9 9.4 344 77-491 1-388 (388) 195 protein:vir:270 Length: 341 # 93.6 0.0068 4.2E-06 32.5 16.9 319 115-497 1-338 (341) 196 protein:vir:108303 Length: 418 92.3 0.012 7.5E-06 31.1 18.6 262 155-497 1-287 (418) 197 protein:vir:80446 Length: 367 91.9 0.014 8.4E-06 30.8 14.0 292 151-497 1-339 (367) 198 protein:vir:96792 Length: 315 91.8 0.014 8.7E-06 30.7 18.1 265 151-497 1-282 (315) 199 protein:vir:103886 Length: 302 90.4 0.021 1.3E-05 29.8 14.6 285 151-497 1-302 (302) 200 protein:vir:93966 Length: 400 89.2 0.028 1.7E-05 29.1 16.1 378 1-491 10-400 (400) 201 protein:vir:348 Length: 321 # 88.7 0.031 1.9E-05 28.9 13.0 298 112-491 1-321 (321) 202 protein:vir:3746 Length: 336 # 88.5 0.032 2E-05 28.8 17.4 312 127-497 1-334 (336) 203 protein:vir:95603 Length: 463 87.7 0.037 2.3E-05 28.4 14.4 309 106-497 1-354 (463) 204 protein:vir:99311 Length: 463 87.7 0.037 2.3E-05 28.4 14.4 309 106-497 1-354 (463) 205 protein:vir:94800 Length: 319 87.6 0.037 2.3E-05 28.4 20.4 282 98-497 1-298 (319) 206 protein:vir:97331 Length: 319 87.6 0.037 2.3E-05 28.4 20.4 282 98-497 1-298 (319) 207 protein:vir:94989 Length: 349 87.5 0.038 2.4E-05 28.4 18.5 278 151-497 1-319 (349) 208 protein:vir:78387 Length: 349 87.5 0.038 2.4E-05 28.3 17.8 278 151-497 1-315 (349) 209 protein:vir:3783 Length: 336 # 86.5 0.045 2.8E-05 28.0 17.3 312 127-497 1-334 (336) 210 protein:vir:1663 Length: 393 # 85.8 0.05 3.1E-05 27.7 15.0 380 1-491 1-393 (393) 211 protein:vir:4700 Length: 415 # 85.3 0.054 3.4E-05 27.5 19.9 388 1-471 7-415 (415) 212 protein:vir:4600 Length: 415 # 85.3 0.054 3.4E-05 27.5 19.9 388 1-471 7-415 (415) 213 protein:vir:95875 Length: 401 83.3 0.069 4.3E-05 26.9 16.4 310 142-494 1-401 (401) 214 protein:vir:3525 Length: 423 # 77.3 0.13 7.8E-05 25.5 17.8 268 151-497 1-308 (423) 215 protein:vir:78148 Length: 123 74.0 0.1 6.5E-05 26.0 7.0 106 377-493 1-123 (123) 216 protein:vir:107120 Length: 329 72.2 0.19 0.00012 24.6 20.9 293 110-497 1-310 (329) 217 protein:vir:105374 Length: 423 71.8 0.19 0.00012 24.5 17.8 275 151-497 1-332 (423) 218 protein:vir:174 Length: 423 # 71.5 0.19 0.00012 24.5 18.1 274 151-497 1-308 (423) 219 protein:vir:96666 Length: 462 58.0 0.42 0.00026 22.6 16.3 317 106-497 1-343 (462) 220 protein:vir:102823 Length: 470 45.9 0.75 0.00047 21.3 13.1 292 108-497 1-369 (470) 221 protein:vir:8846 Length: 705 # 36.9 1.1 0.00071 20.3 17.0 141 1-150 544-705 (705) 222 protein:vir:4339 Length: 395 # 31.0 1.5 0.00095 19.6 18.5 371 1-457 1-395 (395) 223 protein:vir:97397 Length: 517 30.2 1.6 0.00099 19.5 16.7 384 1-486 124-517 (517) 224 protein:vir:9410 Length: 415 # 29.2 1.7 0.001 19.3 20.0 388 1-471 7-415 (415) 225 protein:vir:4456 Length: 401 # 23.8 2.3 0.0014 18.6 18.8 378 1-484 5-401 (401) 226 protein:vir:100851 Length: 514 23.6 2.3 0.0014 18.6 11.3 329 92-497 1-386 (514) 227 protein:vir:103463 Length: 521 22.1 2.5 0.0015 18.4 19.1 366 48-497 1-508 (521) 228 protein:vir:8846 Length: 705 # 21.1 2.6 0.0016 18.3 15.8 127 1-136 576-705 (705) No 1 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=4.8e-97 Score=548.64 Aligned_cols=497 Identities=100% Similarity=1.396 Sum_probs=437.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) ||+...++++.+++.++++++.++..+..+|+++.+.++.+++++...+++..++..+..+++++++++++.++.++.+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999998888888888888888899999999988888877 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.....+............+..+.............................+.+..+...............++++.+| T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:78 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 66554444444444444334433333333333333333333333333344445555555555555566667778888899 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS 240 (497) ++|||++..+||+.+++.++|+++|+++++++++++||+++++++.++||+|++.+|+++++|++|++.+||++++++|| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS 240 (497) T protein:vir:78 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhH Confidence 99999999999999999999999999999999999999998877889999999999999999999999999999999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhh Q lcl|NC_021309. 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) Q Consensus 241 ~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) +|||+|+++|++||.++|++++++++|.+||+|+|+++|.||++.++..+.........................+.... T Consensus 241 ~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) T protein:vir:78 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhh Confidence 99999999999999999999999999999999999999999999999998888888888888888888888888888899 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccC Q lcl|NC_021309. 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~ 400 (497) +...+..+.+.....+.....+......++..+....++.++..+...++..+++|+||+.+|..|+++||++|||||++ T Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~ 400 (497) T protein:vir:78 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) T ss_pred hhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceee Q lcl|NC_021309. 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVY 480 (497) Q Consensus 401 ~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~ 480 (497) +.++..+.+....++|||+||++++.||+++++||||++++|.+++|.+++|+++++..++|++|+|+||++.|+||.|+ T Consensus 401 ~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~ 480 (497) T protein:vir:78 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVY 480 (497) T ss_pred cccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceee Confidence 99999999999999999999999999999999999999999999999999999999988899999999999999999999 Q ss_pred cccceEEEEeeCCCCCC Q lcl|NC_021309. 481 RPSAFQLIQLKKGATGS 497 (497) Q Consensus 481 ~~~a~~~l~~~~~a~~~ 497 (497) +|+||++|+++++++|| T Consensus 481 ~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 481 RPSAFQLIQLKKGATGS 497 (497) T ss_pred ccccEEEEEecCCccCC Confidence 99999999999999999 No 2 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=4.8e-97 Score=548.64 Aligned_cols=497 Identities=100% Similarity=1.396 Sum_probs=437.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) ||+...++++.+++.++++++.++..+..+|+++.+.++.+++++...+++..++..+..+++++++++++.++.++.+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999998888888888888888899999999988888877 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.....+............+..+.............................+.+..+...............++++.+| T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:10 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 66554444444444444334433333333333333333333333333344445555555555555566667778888899 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS 240 (497) ++|||++..+||+.+++.++|+++|+++++++++++||+++++++.++||+|++.+|+++++|++|++.+||++++++|| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS 240 (497) T protein:vir:10 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhH Confidence 99999999999999999999999999999999999999998877889999999999999999999999999999999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhh Q lcl|NC_021309. 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) Q Consensus 241 ~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) +|||+|+++|++||.++|++++++++|.+||+|+|+++|.||++.++..+.........................+.... T Consensus 241 ~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) T protein:vir:10 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhh Confidence 99999999999999999999999999999999999999999999999998888888888888888888888888888899 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccC Q lcl|NC_021309. 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~ 400 (497) +...+..+.+.....+.....+......++..+....++.++..+...++..+++|+||+.+|..|+++||++|||||++ T Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~ 400 (497) T protein:vir:10 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) T ss_pred hhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceee Q lcl|NC_021309. 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVY 480 (497) Q Consensus 401 ~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~ 480 (497) +.++..+.+....++|||+||++++.||+++++||||++++|.+++|.+++|+++++..++|++|+|+||++.|+||.|+ T Consensus 401 ~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~ 480 (497) T protein:vir:10 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVY 480 (497) T ss_pred cccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceee Confidence 99999999999999999999999999999999999999999999999999999999988899999999999999999999 Q ss_pred cccceEEEEeeCCCCCC Q lcl|NC_021309. 481 RPSAFQLIQLKKGATGS 497 (497) Q Consensus 481 ~~~a~~~l~~~~~a~~~ 497 (497) +|+||++|+++++++|| T Consensus 481 ~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 481 RPSAFQLIQLKKGATGS 497 (497) T ss_pred ccccEEEEEecCCccCC Confidence 99999999999999999 No 3 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=1.7e-70 Score=403.06 Aligned_cols=411 Identities=25% Similarity=0.322 Sum_probs=281.4 Q ss_pred CchHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQ---LAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDI 77 (497) Q Consensus 1 ~~~~a~~~~~~~~---~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~ 77 (497) |+.+.++++..+. +.++++++++...+...+.++.+++...+.+. ..+...+..++++++..+.+.++.++ T Consensus 4 ~~~~~~~~~~~~~~~el~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~------~~~~~~e~~~~~~~l~~~~~~l~~~~ 77 (418) T protein:vir:10 4 MNEPRQFGRKSGGDSHPEQVLETVTKELKRIGDEVKSAGEKALAEAKR------AGDLGVETKATVDELLIKQGELQARL 77 (418) T ss_pred chhHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------hhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 8888888764432 22223333332222222222222221111111 11223344445555555555555444 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccccc Q lcl|NC_021309. 78 PEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTG 157 (497) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (497) .+.+............. ..+ .............. . ..........................+++ T Consensus 78 ~~~e~~~~~~~~~~~~~------~~~-----~~~~~~~~~~~~~~---~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 141 (418) T protein:vir:10 78 LEAEQKLARGGGSAELE------TPK-----TLGQLVTESEEMKG---M--DGSARKSVRVRVDRKSIMNVPATVGSGVS 141 (418) T ss_pred HHHHHHHhhcccccccc------hhh-----hhhHHhhhHHHHHH---H--HHHHhhhhhhhhHHHHHHHhhhhccCCCC Confidence 43332211100000000 000 00000000000000 0 00000000011111111222233334455 Q ss_pred ccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeee Q lcl|NC_021309. 158 TFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANAL 237 (497) Q Consensus 158 ~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~ 237 (497) .+|++||+++..+||+.+++.++|+++|++++++++++++|++++.++.+.|++|++.+|+++++|++|++.++++++++ T Consensus 142 ~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~ 221 (418) T protein:vir:10 142 GSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLF 221 (418) T ss_pred CCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeeccCccccccccceeeEEEeeeeEEEee Confidence 66778888899999999999999999999999999999999998877889999999999999999999999999999999 Q ss_pred hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+++|+|+++|++||+++|+++++.++|.+||+|+|+++ |.||++.++..+...... T Consensus 222 ~is~ell~ds~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~-------------------- 281 (418) T protein:vir:10 222 KASRQILDDAPALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLA-------------------- 281 (418) T ss_pred hhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccc-------------------- Confidence 99999999999999999999999999999999999999875 999999876554332211 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCce Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ....++++..++..+...+ ..+++|+||+.+|..|++++|++|+| T Consensus 282 ----------------------------------~~~~~~~i~~~~~~~~~~~-~~~~~~v~n~~~~~~L~~lkd~~G~~ 326 (418) T protein:vir:10 282 ----------------------------------NATPIDKIRLALLQAVLAE-FPATGIVLNPIDWASIELTKDSQGRY 326 (418) T ss_pred ----------------------------------ccccHHHHHHHHHhhcccc-CCCCEEEEcHHHHHHHHHhhcCCCce Confidence 0111344555555555544 45567999999999999999999999 Q ss_pred eccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeec Q lcl|NC_021309. 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLG 476 (497) Q Consensus 397 i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~ 476 (497) ||+++..+ ..++|+|+||++++.||.++++||||++ +|.++++.+++|+++++.+.+|++|++.||++.|+| T Consensus 327 i~~~~~~~-------~~~~l~G~pV~~~~~~p~~~~~~gd~s~-~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d 398 (418) T protein:vir:10 327 IVGNPVNG-------TTPRLWNLPVVETQAMTANEFLVGAFSM-AAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLA 398 (418) T ss_pred eccccccC-------CCceecceeeEEcCCCCCCcEEEeeccc-eEEEEEecceEEEEecccchhhhcCceEEEEEEeec Confidence 99654322 3468999999999999999999999998 478999999999999998889999999999999999 Q ss_pred ceeecccceEEEEeeCCCCC Q lcl|NC_021309. 477 LLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 477 ~~v~~~~a~~~l~~~~~a~~ 496 (497) |.+++|+||+++++++++.| T Consensus 399 ~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 399 LAVYRPESFVTGALVEQAGG 418 (418) T ss_pred cEEecccceEEEEeccCCCC Confidence 99999999999999999999 No 4 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=1.4e-68 Score=392.51 Aligned_cols=394 Identities=26% Similarity=0.368 Sum_probs=272.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 4 TAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVR 83 (497) Q Consensus 4 ~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~ 83 (497) +.+++++++++..+++++... .++..+...++++... +..+++.++++++..+.+.++.++.+.+.. T Consensus 1 m~~~~k~l~el~~~~~~~~~~-------~~~~~e~~~~~~~~~~------~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 67 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQ-------IKSQAEQVNTQIANFG------EMNKETRAKVDELLTAQGELQARLSAAEQA 67 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHh------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 223333444443333333322 2222222112221111 112333344444444444444433322221 Q ss_pred HHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccccccccccc Q lcl|NC_021309. 84 NLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGI 163 (497) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v 163 (497) ............. .+... .........+ ......+. ............++++++|+++ T Consensus 68 ~~~~~~~~~~~~~-----~~~~~-------~~~~~~~~~~-----~~~~~~~~-----~~~~~~~~~~~~~~~~~~g~~v 125 (395) T protein:vir:43 68 MLANEKRDGGEEA-----PKTAG-------QMVAESLKEQ-----GVTSSLRG-----SHRVSMPRSAITSIDGSGGALV 125 (395) T ss_pred HHhhhccccccch-----hhhHH-------HHHHHHHHHH-----HHHHHhhh-----hhhhhhhhhhhcccCCCCcccc Confidence 1110000000000 00000 0000000000 00000000 0001111223345566778899 Q ss_pred chhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHH Q lcl|NC_021309. 164 LPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG 243 (497) Q Consensus 164 ~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~el 243 (497) ||++..+||+.+++.++|+++|++++++++.++||+.++.++.+.|++|++.+|+++++|++|++.++|++++++||+++ T Consensus 126 p~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~el 205 (395) T protein:vir:43 126 APDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQI 205 (395) T ss_pred chhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH Confidence 99999999999999999999999999999999999998877889999999999999999999999999999999999999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhh Q lcl|NC_021309. 244 LRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQ 322 (497) Q Consensus 244 l~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (497) |+|+++|++||.++|+++++.++|.+||+|+|+++ |.||++..+..+...... T Consensus 206 l~d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~-------------------------- 259 (395) T protein:vir:43 206 LDDASALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVV-------------------------- 259 (395) T ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccc-------------------------- Confidence 99999999999999999999999999999999876 589998776544332211 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcc Q lcl|NC_021309. 323 DTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFF 402 (497) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~ 402 (497) ......++++..++..+...++ ++.+|+||+.+|..|+++||++|+|||+++. T Consensus 260 --------------------------~~~~~~~~~i~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~ 312 (395) T protein:vir:43 260 --------------------------VTAEQRIDRIRLAILQAQLAEF-PASGIVLNPIDWALIELNKDAENRYIIGSPQ 312 (395) T ss_pred --------------------------cccchhHHHHHHHHHhhccccC-CCcEEEEcHHHHHHHHHhhccCCceeccccc Confidence 1112234566666667766654 4568999999999999999999999997643 Q ss_pred cccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecc Q lcl|NC_021309. 403 GNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRP 482 (497) Q Consensus 403 ~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~ 482 (497) .+ ..++|+|+||+++++||.++++||||++ +|.+++|.+++|+++++.+.+|++|++.||++.|+||.|++| T Consensus 313 ~~-------~~~~l~G~pVv~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 384 (395) T protein:vir:43 313 NG-------TTPTLWRLPVVETQAITQDEFLTGAFSL-GAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRP 384 (395) T ss_pred cC-------CCceecceeeEEcCCCCCCcEEEEeccc-eEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecc Confidence 22 3468999999999999999999999998 577889999999999998889999999999999999999999 Q ss_pred cceEEEEeeCC Q lcl|NC_021309. 483 SAFQLIQLKKG 493 (497) Q Consensus 483 ~a~~~l~~~~~ 493 (497) +||++|+++++ T Consensus 385 ~a~~~~~~taa 395 (395) T protein:vir:43 385 EAFVTGSLTAS 395 (395) T ss_pred cceEEEEeccC Confidence 99999999999 No 5 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=4.9e-67 Score=384.09 Aligned_cols=406 Identities=29% Similarity=0.398 Sum_probs=266.2 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 14 LAKSIKDINADETK-TAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHL 92 (497) Q Consensus 14 ~~~~~~~~~~~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 92 (497) |+++.++...+..+ +.+|+++.++++....+..+. +..++..+....+.++.......... ..... T Consensus 1 ~~ke~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 67 (413) T protein:vir:81 1 MVKEAGDAPTNAQVAEIAEVKSMVEQFKADEDAKRE----------RAKSVKANQDFLRELQEATAGSVDSE---KSGEL 67 (413) T ss_pred ChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHHHhHHhHH---HhhhH Confidence 55555554443222 222333333222221111110 01111111111111111000000000 00000 Q ss_pred HhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHH Q lcl|NC_021309. 93 ARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIV 172 (497) Q Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii 172 (497) ..... .................... ...........+ ................++++.+|+++|+++..+|| T Consensus 68 ~~~~~---~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii 139 (413) T protein:vir:81 68 TRKGE---GYKSIGEFFAKRAGDQIKQQ-AGGAQLNYSVGE----YVAPRVKAASDPASTATLTDEFQGGYGTTWNRNII 139 (413) T ss_pred hhhhh---hhhhhhhhhhhhhhhHHHHH-HHHHHhhhhhhh----hhhhHHHhhhhhhhhcccccccccccchhhHHHHH Confidence 00000 00000000000000000000 000000000000 00000000111122334455677788888999999 Q ss_pred HHHHhhhhHHhhcceeecCCCceEEEEEcCCC---ccceeccccccccccc-ccceeeEeeeeeEEeeehhhHHHHhhHH Q lcl|NC_021309. 173 EQLFYELSLADLISSRPVTSPNLSYLTESAAH---NNAAAVAEAGTYPFSS-EEFARVYEQVGKVANALTITDEGLRDAP 248 (497) Q Consensus 173 ~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~---~~a~wv~Eg~~~~~s~-~~f~~i~~~~~kla~~~~iS~ell~d~~ 248 (497) +.+++.++|+++|++++++++.++||+.++.. ..++|++||+.+|+++ ++|++|++.++|++++++||+|||+|++ T Consensus 140 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~ 219 (413) T protein:vir:81 140 YRRREKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYD 219 (413) T ss_pred HHHhhhhhHHhhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHH Confidence 99999999999999999999999999987642 4589999999999987 6899999999999999999999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhh Q lcl|NC_021309. 249 ELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVAS 327 (497) Q Consensus 249 ~l~~~i~~~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (497) .|++||+++|+++++.++|.+||+|+|+++ |.||++.++..+...... T Consensus 220 ~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~------------------------------- 268 (413) T protein:vir:81 220 FLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNK------------------------------- 268 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCCCccccccccccccccccccc------------------------------- Confidence 899999999999999999999999999986 589998876654433221 Q ss_pred hhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccccccc Q lcl|NC_021309. 328 LKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYG 407 (497) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~ 407 (497) ....+.+..++..+....+..+++|+||+.+|..|+++||++|||||.++.....+ T Consensus 269 ------------------------~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 324 (413) T protein:vir:81 269 ------------------------DELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYG 324 (413) T ss_pred ------------------------chhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceecccccccccc Confidence 11234455555555555566677899999999999999999999999887665544 Q ss_pred ccc-cccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceE Q lcl|NC_021309. 408 NPV-NGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQ 486 (497) Q Consensus 408 ~~~-~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~ 486 (497) .+. ...++|||+||++++++|+++++||||++ +|.+++|.+++++++++.+++|++|++.||+++|+|+.+++|+||+ T Consensus 325 ~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~-~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~ 403 (413) T protein:vir:81 325 SGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRS-AASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIV 403 (413) T ss_pred ccccccCceecceeeEEcCCCCcccEEEEeccc-EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceE Confidence 332 24468999999999999999999999998 5789999999999999988899999999999999999999999999 Q ss_pred EEEeeCCCCC Q lcl|NC_021309. 487 LIQLKKGATG 496 (497) Q Consensus 487 ~l~~~~~a~~ 496 (497) +|+++++++= T Consensus 404 ~l~~~~~~~p 413 (413) T protein:vir:81 404 QLDVAEVVTP 413 (413) T ss_pred EEEecCCCCC Confidence 9999887777 No 6 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=1.4e-67 Score=387.02 Aligned_cols=388 Identities=28% Similarity=0.394 Sum_probs=259.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 11 GRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQA-EVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIR 89 (497) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~-~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 89 (497) +.++.++ ++++ +.++++.+..+.++...... ..+.....+++.++++++++++++++.++.+.+..... T Consensus 1 m~e~~~~---l~~~----~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~--- 70 (390) T protein:vir:10 1 MTDITSK---LEAT----LANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAG--- 70 (390) T ss_pred ChHHHHH---HHHH----HHHHHHHHHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc--- Confidence 1111111 1111 11112222111111111000 01111222333333333333333333322221111000 Q ss_pred HHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhH Q lcl|NC_021309. 90 KHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLP 169 (497) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~ 169 (497) .....+.. ........... ...... ..... ...............++++.+|+++||++.. T Consensus 71 --------~~~~~~~~-----~~~~~~~~~~~--~~~~~~--~~~~~--~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 131 (390) T protein:vir:10 71 --------GDVQHVSV-----GDLFVASEQFQ--ASAGRW--NDRSA--RATMNIKAALNTASTDAAGSAGALTTPNRLP 131 (390) T ss_pred --------ccccccch-----hhhhhhhHHHH--HHHHhh--hhhhh--hhhhHHHHHHHhhhcccccccccccchhHHH Confidence 00000000 00000000000 000000 00000 0000111122233445566778899999999 Q ss_pred HHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhHHH Q lcl|NC_021309. 170 GIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAPE 249 (497) Q Consensus 170 ~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~~~ 249 (497) +||+.+++.++|+++|++++++++.++||++++.++.+.|++|++.+|+++++|++|++.+++++++++||++||+|+++ T Consensus 132 ~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~ 211 (390) T protein:vir:10 132 GFITQPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQ 211 (390) T ss_pred HHHHHHHhhchhhhhcceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHHhHHH Confidence 99999999999999999999999999999999877889999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhh Q lcl|NC_021309. 250 LFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASL 328 (497) Q Consensus 250 l~~~i~~~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (497) +++||.++|++++++++|.+||+|+|+++ |.||++.++......... T Consensus 212 l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~-------------------------------- 259 (390) T protein:vir:10 212 LASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIA-------------------------------- 259 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCCCcccccccccccccccccccc-------------------------------- Confidence 99999999999999999999999999875 999998765433221111 Q ss_pred hhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccc Q lcl|NC_021309. 329 KYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGN 408 (497) Q Consensus 329 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~ 408 (497) .....+.+..++..+...+ .++++|+||+.+|..|+++||++|+|||+++... T Consensus 260 ----------------------~~~~~~~~~~~~~~l~~~~-~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~---- 312 (390) T protein:vir:10 260 ----------------------GATRVDQLRLAMLQASLAE-YPASGIVINPIDWAAIELAKDANNQYLIGNARGT---- 312 (390) T ss_pred ----------------------ccchHHHHHHHHHhhcccc-CCCCEEEEcHHHHHHHHHhhcCCCceeecCCcCc---- Confidence 0112344555566665554 4567899999999999999999999999876432 Q ss_pred cccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEE Q lcl|NC_021309. 409 PVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLI 488 (497) Q Consensus 409 ~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l 488 (497) ..++|+|+||++++.||+++++||||++ +|.+++|.+++|+++++. .+|++|++.||++.|+||.|++|+||+++ T Consensus 313 ---~~~~l~G~pv~~~~~~p~~~~~~gdf~~-~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~~~r~d~~v~~~~a~~~~ 387 (390) T protein:vir:10 313 ---LTPTLWGLPVVATQAMAPGEFLVGAFDL-AAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEERLALVVYRPEALISG 387 (390) T ss_pred ---CCceecceeeEEcCCCCCCcEEEEeccc-eEEEEEecceEEEEeecc-cccccCcEEEEEEEeeccEEeccccEEEE Confidence 3468999999999999999999999998 477899999999998753 56999999999999999999999999999 Q ss_pred Eee Q lcl|NC_021309. 489 QLK 491 (497) Q Consensus 489 ~~~ 491 (497) ++. T Consensus 388 ~~a 390 (390) T protein:vir:10 388 SFA 390 (390) T ss_pred EeC Confidence 999 No 7 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=3.3e-67 Score=385.05 Aligned_cols=404 Identities=19% Similarity=0.225 Sum_probs=267.6 Q ss_pred Cch-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHH-------HHHHHHHHHHHHH Q lcl|NC_021309. 1 MPS-------TAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAH-------ERAQEMLKSLGGA 66 (497) Q Consensus 1 ~~~-------~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~-------e~~~e~~~~~~~~ 66 (497) |-. ++.|+....++.+.+.+++++..+++.++. +++..+++..+.+.+.. ....+...+++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~e~ra~~~~e~~~l~---~~~~~~~~~~k~~~~~~~~~~~~~~~~~e~~~~~~~~ 77 (425) T protein:vir:10 1 MSKKLLIAVLTAALTGPVGAVPRGIISVRAEGPTEVKALI---ENLQKAFHDFKAEHTKQLDAVKAGLPTSDALAKVDKV 77 (425) T ss_pred CchhHHHHhhHHHhhhhhhhhhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHHHHHH Confidence 322 334444555555666666665554443333 33222222222111110 0011111222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhh Q lcl|NC_021309. 67 DAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPA 146 (497) Q Consensus 67 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (497) +.+++.++..+.+.... ....... ...... . ...+.+..+....+... T Consensus 78 ~~ei~~~~~~~~~~~~~--------~~~~~~~----------~~~~~~---~-----------~~~~~~~af~~~l~~~e 125 (425) T protein:vir:10 78 SADLEALQAAVDEANIK--------IAAAQMG----------ANGVKP---L-----------RDPEYTEAFKAHVKRGD 125 (425) T ss_pred HHHHHHHHHHHHHHHHH--------HHhhhcc----------cccccc---c-----------ccHHHHHHHHHHhhhhh Confidence 22222222211111000 0000000 000000 0 00000111111111111 Q ss_pred hhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccccc-cccee Q lcl|NC_021309. 147 AIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSS-EEFAR 225 (497) Q Consensus 147 ~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~-~~f~~ 225 (497) .......++++.+|.+||+++..+|++.++..++|+++|++++++++.+++|+.++. ..++|++|++.+|+++ ++|++ T Consensus 126 ~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~-~~a~wv~E~~~~~~~~~~~f~~ 204 (425) T protein:vir:10 126 VQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGG-TTSGWVGEASQRPQTNAATFQP 204 (425) T ss_pred hHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcCC-cceeeeccccccccccccccce Confidence 222333445566677788888899999999999999999999999999999998874 6899999999999876 79999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) |++.++|++++++||+|+|+|+ ++|++||.++|+++++.++|.+||+|+|+++|.||++..+..+.......... T Consensus 205 v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~---- 280 (425) T protein:vir:10 205 LSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAI---- 280 (425) T ss_pred eeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeecccccccccccccccc---- Confidence 9999999999999999999997 79999999999999999999999999999999999998764433322111000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ............++++.+.+..+...|+. +.+|+||+.+|. T Consensus 281 --------------------------------------~~~~~~~~~~~~~d~l~~l~~~l~~~~~~-~a~~vmn~~~~~ 321 (425) T protein:vir:10 281 --------------------------------------EVVNSGAAADITSDGIIDLVYDLPSAFTG-NARFAMNRNTQR 321 (425) T ss_pred --------------------------------------ccccccccccccHHHHHHHHhhhhhhhcc-CCEEEEchHHHH Confidence 00001111223356677777787776665 457999999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCc-----CceEEEeeccceEEEEeecccEEEeecccc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~ 459 (497) .|+++||++|||||+++.... .+.+|+|+||++++.||. ..++||||++ +|.+++|.++++.++++ T Consensus 322 ~L~~lkD~~G~~l~~~~~~~g------~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~-~~~i~~~~~~~v~~d~~-- 392 (425) T protein:vir:10 322 QVRKLKDGQGNYLWQPSYVAG------QPATLAGYPVTEVPDMPDVAANSTPILFGDFQQ-TYLIIDRIGVRVLRDPY-- 392 (425) T ss_pred HHHHhhcCCCceeeccCccCC------CCceecceeeEEecCcCCccCCccEEEEEehhc-cEEEEEecceEEEeccc-- Confidence 999999999999998764432 346899999999999994 3378999998 58899999998876553 Q ss_pred hhhhcCceEEEEEEeecceeecccceEEEEeeCCC Q lcl|NC_021309. 460 TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 460 ~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a 494 (497) |.+|++.||++.|+|+.|++|+||++++++++= T Consensus 393 --~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 393 --TAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred --ccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 678999999999999999999999999998877 No 8 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=3.3e-67 Score=385.07 Aligned_cols=388 Identities=27% Similarity=0.399 Sum_probs=262.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021309. 18 IKDINADETKTAAEKKEALAKIEPDFKAHQA-EVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAV 96 (497) Q Consensus 18 ~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~-~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (497) +.++..+..+++.++++.+..+.++.+...+ ..+..+..+++.++++++++++++++..+.+.+..... .. T Consensus 1 m~~l~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~--------~~ 72 (390) T protein:vir:81 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAG--------GD 72 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------cc Confidence 1111111111222222222222222111111 11112223333344444444444333332222111000 00 Q ss_pred hhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHH Q lcl|NC_021309. 97 IMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF 176 (497) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~ 176 (497) . ..+.... ....... .+......... .. . ..............++++.+|+++||++..+||+.++ T Consensus 73 ~---~~~~~~~-----~~~~~~~--~~~~~~~~~~~--~~-~-~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~ 138 (390) T protein:vir:81 73 V---QHVSVGD-----MFVASEQ--FQASAGRWNDR--SA-R-ATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPD 138 (390) T ss_pred c---ccccchh-----hhhhhHH--HHHHHHHHhhh--hh-h-hhhHHHHHHHhhccccccCCcceechhhhHHHHHHHh Confidence 0 0000000 0000000 00000000000 00 0 0001111122233456677888999999999999999 Q ss_pred hhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhHHHHHHHHHH Q lcl|NC_021309. 177 YELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAPELFNFVQG 256 (497) Q Consensus 177 ~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~~~l~~~i~~ 256 (497) ..++|+++|++++++++.++||+.++..+.+.|++|++.+|+++++|++|++.+++++++++||+|+|+|++++++||.+ T Consensus 139 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~~~~~i~~ 218 (390) T protein:vir:81 139 ARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASYMNN 218 (390) T ss_pred hhhhhhhhcceeeccCCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHHhHHHHHHHHHH Confidence 99999999999999999999999988777899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh Q lcl|NC_021309. 257 RLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT 335 (497) Q Consensus 257 ~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (497) +|++++++++|.+||+|+|+++ |.||++.++......... T Consensus 219 ~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~--------------------------------------- 259 (390) T protein:vir:81 219 RLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIA--------------------------------------- 259 (390) T ss_pred HHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccccc--------------------------------------- Confidence 9999999999999999999876 999998766433222111 Q ss_pred hhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccccccccccccccc Q lcl|NC_021309. 336 GAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKN 415 (497) Q Consensus 336 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~ 415 (497) .....+++..++..+...++ .+++|+||+.+|..|+++||++|+|||+++... ..++ T Consensus 260 ---------------~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~-------~~~~ 316 (390) T protein:vir:81 260 ---------------GATRVDQLRLAMLQASLAEY-NPSGIVINPIDWAAIELAKDANNQYLIGNARGT-------LTPT 316 (390) T ss_pred ---------------cchhHHHHHHHHHhhccccC-CCCEEEEcHHHHHHHHHhhcCCCceeecCcccc-------cCce Confidence 11123455566666666554 556899999999999999999999999875432 3468 Q ss_pred ccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEee Q lcl|NC_021309. 416 IWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 416 l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~ 491 (497) |+|+||++++.+|+++++||||++ +|.+++|.+++|+++++. .+|++|++.||++.|+|+.|++|+||+++++. T Consensus 317 l~G~pv~~~~~~p~~~~~~gd~~~-~~~~~~~~~~~v~~~~~~-~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 317 LWGLPVVATQAMAPGEFLVGAFDL-AAQIFDQWDARVEIGYVG-EDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ecceeeEEcCCCCCCcEEEEehhc-eEEEEEecceEEEEeccc-chhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 999999999999999999999998 577899999999998864 46999999999999999999999999999999 No 9 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=9e-67 Score=382.66 Aligned_cols=384 Identities=23% Similarity=0.330 Sum_probs=271.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+.+++++.+++.++++++..+...+++++.+....+.++ +.++.++++.+..++.+. T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~~~~~~~~l~~~--------------------~~~~~~~~~~~~~~~~~~ 60 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQKAEIESTGQVSKQLQSD--------------------LMKVQEELTKSGTRLFDL 60 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHH Confidence 777666666666666555555443332222222222222111 112222222211111111 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +... ........ ... ......... .. ......... ............+++.+| T Consensus 61 ~~~~--------~~~~~~~~---~~~--~~~~~~~~~----------~~---~~~~~~~~~-~~~~~~~~~~~~~~~~~g 113 (385) T protein:vir:18 61 EQKL--------ASGAENPG---EKK--SFSERAAEE----------LI---KSWDGKQGT-FGAKTFNKSLGSDADSAG 113 (385) T ss_pred HHHh--------hccccccc---hhh--hhHHHHHHH----------HH---HHHHHhhcc-chhhHHHhhhccccccCC Confidence 1000 00000000 000 000000000 00 000000000 001111223334556678 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS 240 (497) +++||++...||+.++..++|+++|++++++++.++||+.++..+.+.|++|++.+|+++++|+++++.+++++++++|| T Consensus 114 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is 193 (385) T protein:vir:18 114 SLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQAS 193 (385) T ss_pred ceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhh Confidence 89999999999999999999999999999999999999998877889999999999999999999999999999999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhhhhhHHHHHHhhhhhcchhh Q lcl|NC_021309. 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAF 319 (497) Q Consensus 241 ~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (497) +|+|+|++++++||.++|+++++.++|.+||+|+|+++ |.||++.++..+..... T Consensus 194 ~ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~------------------------ 249 (385) T protein:vir:18 194 RQVMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNA------------------------ 249 (385) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc------------------------ Confidence 99999999999999999999999999999999999886 68998876543322111 Q ss_pred hhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceecc Q lcl|NC_021309. 320 VGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~ 399 (497) .....++.+.+++..+...+ ..+++|+||+.+|..|+++||++|||||+ T Consensus 250 ------------------------------~~~~~~d~i~~~~~~l~~~~-~~~~~~~~~~~~~~~l~~lkd~~G~~l~~ 298 (385) T protein:vir:18 250 ------------------------------TGDTRADIIAHAIYQVTESE-FSASGIVLNPRDWHNIALLKDNEGRYIFG 298 (385) T ss_pred ------------------------------cccchHHHHHHHHHhhcccc-CCCCEEEEcHHHHHHHHHhhcCCCceecc Confidence 11122455666666666554 45568999999999999999999999997 Q ss_pred CcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeeccee Q lcl|NC_021309. 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) Q Consensus 400 ~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v 479 (497) ++..+ ...+|+|+||++++++|+++++||||++ +|.++++.+++|+++++..++|++|++.||+++|+||.| T Consensus 299 ~~~~~-------~~~~l~G~pV~~~~~~p~~~~~~gd~~~-~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v 370 (385) T protein:vir:18 299 GPQAF-------TSNIMWGLPVVPTKAQAAGTFTVGGFDM-ASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAH 370 (385) T ss_pred CcccC-------CCceecceeeEEcCcCCCCcEEEeeccc-EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEE Confidence 65432 3468999999999999999999999998 578999999999999998889999999999999999999 Q ss_pred ecccceEEEEeeCCC Q lcl|NC_021309. 480 YRPSAFQLIQLKKGA 494 (497) Q Consensus 480 ~~~~a~~~l~~~~~a 494 (497) ++|+||+++++++++ T Consensus 371 ~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 371 YRPTAIIKGTFSSGS 385 (385) T ss_pred ecccceEEEEeccCC Confidence 999999999999999 No 10 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=9e-67 Score=382.66 Aligned_cols=384 Identities=23% Similarity=0.330 Sum_probs=271.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+.+++++.+++.++++++..+...+++++.+....+.++ +.++.++++.+..++.+. T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~~~~~~~~l~~~--------------------~~~~~~~~~~~~~~~~~~ 60 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQKAEIESTGQVSKQLQSD--------------------LMKVQEELTKSGTRLFDL 60 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHH Confidence 777666666666666555555443332222222222222111 112222222211111111 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +... ........ ... ......... .. ......... ............+++.+| T Consensus 61 ~~~~--------~~~~~~~~---~~~--~~~~~~~~~----------~~---~~~~~~~~~-~~~~~~~~~~~~~~~~~g 113 (385) T protein:vir:19 61 EQKL--------ASGAENPG---EKK--SFSERAAEE----------LI---KSWDGKQGT-FGAKTFNKSLGSDADSAG 113 (385) T ss_pred HHHh--------hccccccc---hhh--hhHHHHHHH----------HH---HHHHHhhcc-chhhHHHhhhccccccCC Confidence 1000 00000000 000 000000000 00 000000000 001111223334556678 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS 240 (497) +++||++...||+.++..++|+++|++++++++.++||+.++..+.+.|++|++.+|+++++|+++++.+++++++++|| T Consensus 114 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is 193 (385) T protein:vir:19 114 SLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQAS 193 (385) T ss_pred ceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhh Confidence 89999999999999999999999999999999999999998877889999999999999999999999999999999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhhhhhHHHHHHhhhhhcchhh Q lcl|NC_021309. 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAF 319 (497) Q Consensus 241 ~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (497) +|+|+|++++++||.++|+++++.++|.+||+|+|+++ |.||++.++..+..... T Consensus 194 ~ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~------------------------ 249 (385) T protein:vir:19 194 RQVMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNA------------------------ 249 (385) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc------------------------ Confidence 99999999999999999999999999999999999886 68998876543322111 Q ss_pred hhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceecc Q lcl|NC_021309. 320 VGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~ 399 (497) .....++.+.+++..+...+ ..+++|+||+.+|..|+++||++|||||+ T Consensus 250 ------------------------------~~~~~~d~i~~~~~~l~~~~-~~~~~~~~~~~~~~~l~~lkd~~G~~l~~ 298 (385) T protein:vir:19 250 ------------------------------TGDTRADIIAHAIYQVTESE-FSASGIVLNPRDWHNIALLKDNEGRYIFG 298 (385) T ss_pred ------------------------------cccchHHHHHHHHHhhcccc-CCCCEEEEcHHHHHHHHHhhcCCCceecc Confidence 11122455666666666554 45568999999999999999999999997 Q ss_pred CcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeeccee Q lcl|NC_021309. 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) Q Consensus 400 ~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v 479 (497) ++..+ ...+|+|+||++++++|+++++||||++ +|.++++.+++|+++++..++|++|++.||+++|+||.| T Consensus 299 ~~~~~-------~~~~l~G~pV~~~~~~p~~~~~~gd~~~-~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v 370 (385) T protein:vir:19 299 GPQAF-------TSNIMWGLPVVPTKAQAAGTFTVGGFDM-ASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAH 370 (385) T ss_pred CcccC-------CCceecceeeEEcCcCCCCcEEEeeccc-EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEE Confidence 65432 3468999999999999999999999998 578999999999999998889999999999999999999 Q ss_pred ecccceEEEEeeCCC Q lcl|NC_021309. 480 YRPSAFQLIQLKKGA 494 (497) Q Consensus 480 ~~~~a~~~l~~~~~a 494 (497) ++|+||+++++++++ T Consensus 371 ~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 371 YRPTAIIKGTFSSGS 385 (385) T ss_pred ecccceEEEEeccCC Confidence 999999999999999 No 11 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=1.2e-66 Score=381.96 Aligned_cols=388 Identities=27% Similarity=0.398 Sum_probs=259.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQA-EVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~-~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |.. +.+++++ ++.++.+.+..+.++...... ..+..++.+++.++++++++++++++.+... T Consensus 1 m~~----------~~~~l~~-------~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~ 63 (390) T protein:vir:97 1 MTD----------ITAKLEA-------TLANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAE 63 (390) T ss_pred ChH----------HHHHHHH-------HHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 1111111 111112222121111111000 0111122233333333444444333332222 Q ss_pred HHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccccccc Q lcl|NC_021309. 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+.... .. ....+........ ........ ........ ...............++++.+ T Consensus 64 ~~~~~~-------~~----~~~~~~~~~~~~~-----~~~~~~~~----~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 121 (390) T protein:vir:97 64 LEGNGA-------GG----DVQHVSVGDMFVA-----SEQFQAST----GRWNDRSA--RATMNIKAALNTASTDAAGSA 121 (390) T ss_pred HHhccc-------cc----ccccccchhhhhh-----hHHHHHHH----HHhhhhhh--hhhhHHHHHHHhhhccccccc Confidence 111100 00 0000000000000 00000000 00000000 000011112223444566778 Q ss_pred ccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehh Q lcl|NC_021309. 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 160 g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~i 239 (497) |+++||++..+||+.+++.++|+++|++++++++.++||+.++.++.+.|++||+.+|+++++|++|++.+++++++++| T Consensus 122 g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~i 201 (390) T protein:vir:97 122 GALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKA 201 (390) T ss_pred ccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehh Confidence 88999999999999999999999999999999999999999887788999999999999999999999999999999999 Q ss_pred hHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 240 TDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |+|+|+|+++|++||.++|++++++++|.+||+|+|+++ |.||++.++..+..... T Consensus 202 s~ell~ds~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~----------------------- 258 (390) T protein:vir:97 202 TRQILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTI----------------------- 258 (390) T ss_pred hHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccccc----------------------- Confidence 999999999999999999999999999999999999876 99999876544322111 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceec Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~ 398 (497) ......+.+..++..+...++ .+++|+||+.+|..|+++||++|+||| T Consensus 259 -------------------------------~~~~~~d~~~~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lkd~~G~~l~ 306 (390) T protein:vir:97 259 -------------------------------AGATRVDQLRLAMLQASLAEY-PASGIVINPIDWAAIELAKDANNQYLI 306 (390) T ss_pred -------------------------------cccchHHHHHHHHHhhccccC-CCCEEEEcHHHHHHHHHhhcCCCceee Confidence 011123445555566655554 456899999999999999999999999 Q ss_pred cCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecce Q lcl|NC_021309. 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~ 478 (497) +++... ..++|+|+||++++++|+++++||||++ +|.+++|.++++.++++. ++|++|++.||++.|+||. T Consensus 307 ~~~~~~-------~~~~l~G~pV~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~-~~f~~~~~~~r~~~r~d~~ 377 (390) T protein:vir:97 307 GNARGT-------LTPTLWGLPVVATQAMAPGEFLVGAFDL-AAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEERLALV 377 (390) T ss_pred cCccCC-------CCceecceeeEEcCCCCCCcEEEEeccc-eEEEEEecceEEEEeecc-cccccCcEEEEEEEeeccE Confidence 875322 3468999999999999999999999998 477899999999998754 4699999999999999999 Q ss_pred eecccceEEEEee Q lcl|NC_021309. 479 VYRPSAFQLIQLK 491 (497) Q Consensus 479 v~~~~a~~~l~~~ 491 (497) |++|+||+++++. T Consensus 378 v~~~~a~v~~~~a 390 (390) T protein:vir:97 378 VYRPEALITGSFA 390 (390) T ss_pred EeccccEEEEEeC Confidence 9999999999999 No 12 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=2.8e-66 Score=379.98 Aligned_cols=397 Identities=16% Similarity=0.149 Sum_probs=254.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |=.+-++.++..++.+.+. ++++...+.. ++.++++.++..+++.++.++.+. T Consensus 1 l~~~k~l~~~i~e~~~~~~-----------~~k~~~~~~~----------------~~~e~~~~~l~~~~e~~~~~~~~~ 53 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFD-----------DFKEKNDKRI----------------DAIEQEKGKLAGEVETLNGKLAEL 53 (407) T ss_pred CchHHHHHHHHHHHHHHHH-----------HHHHHHHHHH----------------HHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222222222111111111 1111111111 111122222222222222222222 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +..... .....................+ ....+. ...+...... ...........++.+.+| T Consensus 54 e~~~~~-~~~~~~~~~~~~~~~~~~~~~e------------~~~a~~----~~l~~g~~~~-~~~~e~~a~~~~t~~~gG 115 (407) T protein:vir:48 54 ENLKSD-LEAELAEVKRPAGGTQNKVASE------------HKEAFI----GFMRKGREDG-LRELERKALQVGNDEDGG 115 (407) T ss_pred HHHHHH-HHHHHHHhhccccccccchhhH------------HHHHHH----HHHhccchhh-hhHHHHHhhhcccCCCCc Confidence 111000 0000000000000000000000 000000 0000000000 001111223334445566 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccc-cccceeeEeeeeeEEeeehh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s-~~~f~~i~~~~~kla~~~~i 239 (497) .+||+++..+|++.++..++|+++|++++++++.+.+|+.+++ ..++|++|++.+|++ .++|++|++.++|++++++| T Consensus 116 ~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~i 194 (407) T protein:vir:48 116 YAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGG-TTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQA 194 (407) T ss_pred ccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCC-cceeeecccccccccccccceeEEeeeeeeEeehhh Confidence 6777788999999999999999999999999999999998774 679999999999976 58999999999999999999 Q ss_pred hHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 240 TDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |+|||+|+ +++++||.++|+++++.++|.+|++|+|+++|.||++..+.............. T Consensus 195 S~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~----------------- 257 (407) T protein:vir:48 195 TQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQ----------------- 257 (407) T ss_pred HHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeeccccccccccccccccc----------------- Confidence 99999997 589999999999999999999999999999999999876644332221110000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceec Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~ 398 (497) ...........++++.+.+..+...|+. +.+|+||+.+|..|+++||++||||| T Consensus 258 -------------------------~~~~~~~~~~~~d~i~~l~~~l~~~~~~-~a~~v~n~~~~~~L~~lkD~~Gr~l~ 311 (407) T protein:vir:48 258 -------------------------HIASGAASGVTADAIIKLIYTLRKAHRS-GAKFMMNNSSLFAIRLLKDNDGNYLW 311 (407) T ss_pred -------------------------ccccccccccChHHHHHHHHhhchhhhc-CCEEEEcHHHHHHHHHhhccCCceee Confidence 0001111122356777778888777654 45799999999999999999999999 Q ss_pred cCcccccccccccccccccccceEecCCCCcC-----ceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEE Q lcl|NC_021309. 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG-----TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEE 473 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~-----~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~ 473 (497) +++.... .+.+|||+||++++.||.. .++||||++ +|.+++|.+++|..++ +|.+|++.||++. T Consensus 312 ~~~~~~g------~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~-~~~i~~~~~~~i~~d~----~~~~~~~~~~~~~ 380 (407) T protein:vir:48 312 RPGIELG------QPSSLAGYGIVENEQMPDIAADAKAIAFGNFKR-GYTIVDRIGTRILRDP----YTNKPFVGFYTTK 380 (407) T ss_pred ccCcCCC------CCceecceeeEEecCcCCccCCccEEEEEeccc-cEEEEEeeceEEEeec----cccCCcEEEEEEE Confidence 8865432 3458999999999999952 268899998 5889999999998764 3678999999999 Q ss_pred eecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 474 RLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 474 r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) |+|++|++|+||++|+++++++.+ T Consensus 381 r~d~~v~~~~a~~~l~~~aa~~~~ 404 (407) T protein:vir:48 381 RTGGMLVDSQAIKLMKIGAATRQK 404 (407) T ss_pred EeccEEecccceEEEEeeccCCCC Confidence 999999999999999999999988 No 13 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=2.3e-66 Score=380.39 Aligned_cols=387 Identities=14% Similarity=0.097 Sum_probs=252.3 Q ss_pred CchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTA--QLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 ~~~~a--~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) ||.+- ++.++..++.+++ ++..++ . +..+..++..+++++++.+++.++.+++ T Consensus 1 m~~~~l~~l~e~r~~~~~e~--------------~~l~~~----~-------~~~~~~~e~~~~~~~l~~e~~~l~~~i~ 55 (392) T protein:vir:13 1 MDATTLSANFEARERATAEL--------------RSLTDE----F-------AGKEMTAEAREKEERLLTAVADFDGRIK 55 (392) T ss_pred CCHHHHHHHHHHHHHHHHHH--------------HHHHHH----h-------hcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 66542 1222222222222 211111 1 1111112222333344444444444332 Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccc Q lcl|NC_021309. 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) +......... ............. . ...... ........ +.......+..........++++. T Consensus 56 ~~~e~~~~~~-~~~~~~~~~~~~~---~--~~~~~~----~~~~~~~~--------r~g~~~~~~~~~~~~~~~~~t~~~ 117 (392) T protein:vir:13 56 RGIDAIKATD-AVTSLLSGLQGSG---S--GAQRSA----DHDDDAVL--------RAGNLGEARSFEFAPEKRDGTKAG 117 (392) T ss_pred HHHHHHHHHH-HHHHHhcccCCcc---c--chhhhh----hHHHHHHH--------hccchhhhHHHHhhhhhhcccccC Confidence 2111000000 0000000000000 0 000000 00000000 000000111111112223344555 Q ss_pred cccccchhhhHHHHHHHHhh-hhHHhhcceeecCCC-ceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEee Q lcl|NC_021309. 159 FAPGILPTFLPGIVEQLFYE-LSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANA 236 (497) Q Consensus 159 ~g~~v~p~~~~~ii~~~~~~-~~l~~~~~~~~~~~~-~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~ 236 (497) +|+++||++..++|..+... +.++.++++++++++ .+.+|+.++ .+.++||+|++.+|+++++|++|+++++|++++ T Consensus 118 ~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~ 196 (392) T protein:vir:13 118 NPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITG-RATAGIVGETAEIPESYPATTQRSMGGFKYGFA 196 (392) T ss_pred CCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcC-CcceeeecccccccccccceeeEEeeeeeEEee Confidence 67788998888877766555 467888999988654 588999887 578999999999999999999999999999999 Q ss_pred ehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhc Q lcl|NC_021309. 237 LTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT 315 (497) Q Consensus 237 ~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (497) ++||+|+|+|+ ++|++||.++|+++++.++|.+||+|+|+++|.||++..+..+....... T Consensus 197 ~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~------------------ 258 (392) T protein:vir:13 197 SVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGEAD------------------ 258 (392) T ss_pred ehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccccccccc------------------ Confidence 99999999987 58999999999999999999999999999999999987654433221110 Q ss_pred chhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCc Q lcl|NC_021309. 316 NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ 395 (497) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~ 395 (497) .....++.+.+.+..+...++. +.+|+||+.+|..|+++||++|+ T Consensus 259 ----------------------------------~~~~~~d~l~~~~~~l~~~~~~-~a~~v~n~~~~~~l~~lkd~~G~ 303 (392) T protein:vir:13 259 ----------------------------------ADSKVSDALIDLFHEVPSAYRK-NAKFVVNDLRAAQMRKLKDANGQ 303 (392) T ss_pred ----------------------------------cccccHHHHHHHHHhhhhhhhc-CCEEEEcHHHHHHHHHhhccCCc Confidence 0011234555666666655544 55799999999999999999999 Q ss_pred eeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEee Q lcl|NC_021309. 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERL 475 (497) Q Consensus 396 ~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~ 475 (497) |||+++.... .+.+|+|+||++++.+|+++++||||++ |.++++.+++++++.+. +|.+|++.||++.|+ T Consensus 304 ~l~~~~~~~g------~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~--~~i~~~~~~~i~~~~~~--~~~~~~~~~r~~~r~ 373 (392) T protein:vir:13 304 YLWQSALTVG------APDTFNGKVVETDDGMPADKVLFADLSK--YRVRFAGSLRVDRSVDA--KFSTDQIVYRFLQRA 373 (392) T ss_pred eeecCCcCCC------CCceecceeeEEcCCCCCCcEEEeeccc--eeEEeecceEEEeeccc--cccCCcEEEEEEEEe Confidence 9998765432 3458999999999999999999999997 77899999999987654 599999999999999 Q ss_pred cceeecccceEEEEeeCCC Q lcl|NC_021309. 476 GLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 476 ~~~v~~~~a~~~l~~~~~a 494 (497) |++|++|+||+.++++++| T Consensus 374 d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 374 DGLLVDARGAKVLTVTPAA 392 (392) T ss_pred ccEEecccceEEEEeeccC Confidence 9999999999999999999 No 14 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=1.9e-65 Score=375.45 Aligned_cols=378 Identities=13% Similarity=0.168 Sum_probs=253.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |. ..+++++++++..+++...... .++.+......+. +...+..++.++++++.+.++.++.+. T Consensus 1 m~-~~e~~~~~~~~~~~l~~~~~~~--------------~~e~~~~~e~~~~-~~~~~~~~~~~e~~~~~~~l~~~~~~~ 64 (379) T protein:vir:10 1 ME-ALEIKVALEAIKGQVDSKSSAQ--------------ALEVKGLIEALEA-KMTSEKDLAVNELKSDMAALQAHADKL 64 (379) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHh-HhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 54 4444444444444443333222 1221111111110 011111223334444444443333322 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +...... ........ .......... ....+.+.. ...........++++.++ T Consensus 65 e~~~~~~--------~~~~~~~~-----~~~~~~~~~~----------~~~~~~~~~-----~~~~~~~~~~~~~~~~~~ 116 (379) T protein:vir:10 65 DVKLKEK--------AKSEDKSD-----SLVKSITENF----------NDIKEVRNG-----KSIQVKAVGDMTLPVNLT 116 (379) T ss_pred HHHHHhc--------ccccccch-----hHHHHHHHHH----------HhHHHHHhh-----hhhhhhhhcccccCCCCc Confidence 2111000 00000000 0000000000 000000000 000011112223344455 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCC-ccceecccccccccccccceeeEeeeeeEEeeehh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAH-NNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~-~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~i 239 (497) +.+|+++..+|++.++..++|+++|+++++++++++||+.++.+ +.+.|++||+.+|+++++|++|+++++|++++++| T Consensus 117 ~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~i 196 (379) T protein:vir:10 117 GAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRY 196 (379) T ss_pred cccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehh Confidence 67888899999999999999999999999999999999998643 45689999999999999999999999999999999 Q ss_pred hHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhh Q lcl|NC_021309. 240 TDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAF 319 (497) Q Consensus 240 S~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (497) |+|||+|+++|++||.++|+++++.++|.+|+.|.|++.+.+.....+ T Consensus 197 S~ell~D~~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~~~~-------------------------------- 244 (379) T protein:vir:10 197 SKKMANNLPFLTSFIPNALRRDYAKAENAAFNAVLAANATASTEIITN-------------------------------- 244 (379) T ss_pred hHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccC-------------------------------- Confidence 999999999999999999999999999999999988754333222110 Q ss_pred hhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceecc Q lcl|NC_021309. 320 VGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~ 399 (497) ...++++..++..+..++ .++++|+||+.+|..|+++||++|+|+|+ T Consensus 245 --------------------------------~~~~d~i~~~~~~~~~~~-~~~~~~vmn~~~~~~l~~lkd~~G~~l~~ 291 (379) T protein:vir:10 245 --------------------------------KNKVEMLINEIAKQENLD-FPVTAIVLRPTDYYDILVTQKSVGAGYGL 291 (379) T ss_pred --------------------------------cccHHHHHHHHHhhhhcc-CCCCEEEEcHHHHHHHHHhhccCCceecc Confidence 001234555555555554 45668999999999999999999999998 Q ss_pred CcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeeccee Q lcl|NC_021309. 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) Q Consensus 400 ~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v 479 (497) ++.....+ .+++|||+||++++.||+++++||||++ +.+.+|++++|+++++..++|++|+|.||+++|+|+.| T Consensus 292 ~~~~~~~~----~~~~l~G~pvv~s~~~~ag~~~~gdf~~--~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v 365 (379) T protein:vir:10 292 PGVVTQDN----GVLRINGIPLFRATWLAANKYYVGDWTR--VTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAV 365 (379) T ss_pred CCccCCCC----CcceecceeeEecCCCCCCceEEeeccc--EEEEEEeceEEEEeecccccccCCcEEEEEEEEeccEE Confidence 76544332 3468999999999999999999999998 44667899999999988889999999999999999999 Q ss_pred ecccceEEEEeeCC Q lcl|NC_021309. 480 YRPSAFQLIQLKKG 493 (497) Q Consensus 480 ~~~~a~~~l~~~~~ 493 (497) +||+||+++++++. T Consensus 366 ~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 366 EQPAALIFGDFTAV 379 (379) T ss_pred ecCccEEEEEecCC Confidence 99999999999999 No 15 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=2.1e-65 Score=375.14 Aligned_cols=393 Identities=16% Similarity=0.142 Sum_probs=253.4 Q ss_pred CchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPST-AQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 ~~~~-a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |.-. -++++...++.+..+++++ ..++..++++... .++..+++.+++++++++..+.+ T Consensus 1 m~~~lk~l~~~~~el~~~~~~~k~-----------~~~~~~~~~e~~~---------~~l~~~~~~l~~~~~~~~~~~~~ 60 (401) T protein:vir:44 1 MAVDIKDVEQVAQELQQKFDDFKA-----------KNDKRVEAIEQEK---------GKLAGQVETLNGKLSELENLKSD 60 (401) T ss_pred CCccHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHH Confidence 3322 2222222222222222221 1111111111111 11122222233333333222222 Q ss_pred HHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccccccc Q lcl|NC_021309. 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+.......+......... . .+....+... .+........ .........++.+.+ T Consensus 61 ~~~~~~~~~~~~~~~~~~~-----~---~e~~~a~~~~----------------lr~~~~~~~~-~~e~~a~~~~~~~~G 115 (401) T protein:vir:44 61 LEKELLELKRPARGAQNKV-----A---AEHKDAFVGF----------------LRKGREDGLR-DLERKALQVGTDEDG 115 (401) T ss_pred HHHHHHHhhccccccccch-----h---HHHHHHHHHH----------------HhhhhhhhhH-HHHHHHhhcCCCCCC Confidence 2111100000000000000 0 0000000000 0000000000 011122333444556 Q ss_pred ccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccc-cccceeeEeeeeeEEeeeh Q lcl|NC_021309. 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANALT 238 (497) Q Consensus 160 g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s-~~~f~~i~~~~~kla~~~~ 238 (497) |.+||+++..+|++.++..++|+++|++++++++.+.+|+.+++ ..+.|++|++.+|.+ .++|++|++.+||++++++ T Consensus 116 G~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~ 194 (401) T protein:vir:44 116 GYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGG-TASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQ 194 (401) T ss_pred ceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCC-ccceeeccccccCccccccceeeeeehhheeeehh Confidence 66777788999999999999999999999999999999998774 578999999999875 5899999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcch Q lcl|NC_021309. 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) ||+|+|+|+ ++|++||.++|+++++.++|.+||+|+|+++|.||++..+..+............. T Consensus 195 iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~-------------- 260 (401) T protein:vir:44 195 ATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHI-------------- 260 (401) T ss_pred hhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccccccccccc-------------- Confidence 999999997 58999999999999999999999999999999999988776544332221110000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCcee Q lcl|NC_021309. 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i 397 (497) .........++++.+++..+...++. +.+|+||+.+|..|+++||++|||| T Consensus 261 ----------------------------~t~~~~~~~~d~i~~~~~~l~~~~~~-~a~~v~n~~~~~~L~~lkd~~G~~l 311 (401) T protein:vir:44 261 ----------------------------VSGEATAVTADAIIKLIYTLRKAHRT-GAKFMMNNNSLFAIRLLKDTEGNYL 311 (401) T ss_pred ----------------------------ccccccccCHHHHHHHHHhcchhhhc-CCEEEEcHHHHHHHHHhhccCCcee Confidence 00011112246677777777766654 4579999999999999999999999 Q ss_pred ccCcccccccccccccccccccceEecCCCCcCc-----eEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEE Q lcl|NC_021309. 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) |+++...+ .+.+|+|+||++++.||... ++||||++ +|.+++|.++++.+++ +|.+|++.||++ T Consensus 312 ~~~~~~~g------~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~-~~~i~~~~~~~~~~~~----~~~~~~v~~~a~ 380 (401) T protein:vir:44 312 WRPGLELG------QPSSLAGYGIAENEQMPDIAADAKAIAFGNFKR-GYTIVDRIGTRILRDP----YTNKPFVGFYTT 380 (401) T ss_pred ecCCcCCC------CCceecceeeEEecCcCCccCCccEEEEeehhc-cEEEEEecceEEeeec----cccCCcEEEEEE Confidence 98765432 34589999999999998522 68899998 5789999999988654 377999999999 Q ss_pred EeecceeecccceEEEEeeCC Q lcl|NC_021309. 473 ERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 473 ~r~~~~v~~~~a~~~l~~~~~ 493 (497) .|+|+.|++|+||++|+++++ T Consensus 381 ~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 381 KRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred EEeccEEecccceEEEEeecC Confidence 999999999999999999999 No 16 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=2.4e-65 Score=374.84 Aligned_cols=385 Identities=16% Similarity=0.132 Sum_probs=251.3 Q ss_pred CchHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQL--EAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 ~~~~a~~--~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) ||.+.-. .++...+.++++.+. + .....+..+|..+++++++++++.++.+++ T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~--------------~-----------~~~~~~lt~e~~~~~~~l~~e~~~l~~~i~ 55 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLT--------------D-----------EFAGKEMTDEAREKEERLITAVSDYDARIK 55 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHH--------------H-----------HhhcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 6654322 111111111111111 1 111111222333445555555555555443 Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccc Q lcl|NC_021309. 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) +....... .. .. ....... .......... ........ .+.......+..........++++. T Consensus 56 ~~~~~~~~-~~-~~--~~~~~~~----~~~~~~~~~~--~~~~~~~~--------~r~~~~~~~r~~~~~~~~~~~t~~~ 117 (390) T protein:vir:62 56 RGIEAIKA-ID-PV--TSLLSGL----QGSGSGAQRS--ADVDDDAT--------LRAGNLGEARSFEFAPEKRDGTKAG 117 (390) T ss_pred HHHHHHHH-HH-HH--HHHHhhc----ccccccchhh--cchHHHHH--------HhhhhhhhhHHHHhhhhhhcccccC Confidence 22111100 00 00 0000000 0000000000 00000000 0000000111111111222345556 Q ss_pred cccccchhhhHHHHHH-HHhhhhHHhhcceeecCCC-ceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEee Q lcl|NC_021309. 159 FAPGILPTFLPGIVEQ-LFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANA 236 (497) Q Consensus 159 ~g~~v~p~~~~~ii~~-~~~~~~l~~~~~~~~~~~~-~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~ 236 (497) +|+++||++..++|.. ++..+.++++|++++++++ .+.+|+.++ .+.+.|++|++.+|+++++|++|++++||++++ T Consensus 118 ~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~-~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~ 196 (390) T protein:vir:62 118 NPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITG-RSSASIVGETAEIPESYPATAQRSMGGFKYGFA 196 (390) T ss_pred CCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcC-CcceeeecccccccccccceeeeEeeeeeEEee Confidence 6788888888776655 4555678889999999764 589999887 467999999999999999999999999999999 Q ss_pred ehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhc Q lcl|NC_021309. 237 LTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT 315 (497) Q Consensus 237 ~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (497) ++||+|+|+|+ +++++||+++|+++++.++|.+|++|+| +|.||++............. T Consensus 197 ~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G--~p~Gi~~~~~~~~~~~~~~~------------------ 256 (390) T protein:vir:62 197 SVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTG--QPRGILTDASPATATFLATD------------------ 256 (390) T ss_pred hHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCC--ccccccccccccccceeccc------------------ Confidence 99999999998 5899999999999999999999999987 58999987654433221110 Q ss_pred chhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCc Q lcl|NC_021309. 316 NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ 395 (497) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~ 395 (497) .....++++...+..+...|.. ..+|+||+.+|..|++|||++|| T Consensus 257 ----------------------------------~~~~~~~~l~~~~~~l~~~~~~-~a~~vmn~~~~~~L~~lkd~~g~ 301 (390) T protein:vir:62 257 ----------------------------------TDSKVSDALIDLFHEVPSAYRA-NAKYVVNDLRAAQMRKLKDANGQ 301 (390) T ss_pred ----------------------------------ccccchHHHHHHHHhhhhhhhc-CCEEEEchHHHHHHHHhhccCCC Confidence 0111234455555566655543 44799999999999999999999 Q ss_pred eeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEee Q lcl|NC_021309. 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERL 475 (497) Q Consensus 396 ~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~ 475 (497) |||++..... .+.+|+|+||++++.+|++.++||||++ |.++++.++++.++.+. +|.+|++.||++.|+ T Consensus 302 ~l~~~~~~~g------~~~~l~G~Pv~~~~~~p~~~i~~gd~s~--~~i~~~~~~~v~~~~~~--~~~~~~~~~~~~~r~ 371 (390) T protein:vir:62 302 YLWQSGLTVG------APSLFNGKVVETDDGMPADKILFADLSK--YRVRFAGSLRVDRSVDA--KFSTDQIVYRFLQRA 371 (390) T ss_pred eeecCCcCCC------ccceecccceEEecCCCCccEEEeeccc--eeEEeecceEEEeeccc--cccCCcEEEEEEEEe Confidence 9998865432 3458999999999999999999999997 67899999999998754 599999999999999 Q ss_pred cceeecccceEEEEeeCCC Q lcl|NC_021309. 476 GLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 476 ~~~v~~~~a~~~l~~~~~a 494 (497) |++|++|+||+.|+++++| T Consensus 372 d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 372 DGLLVDARGAKVLTVTPGA 390 (390) T ss_pred CcEeechhheEEEEeecCC Confidence 9999999999999999999 No 17 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=5.5e-64 Score=367.41 Aligned_cols=414 Identities=14% Similarity=0.173 Sum_probs=251.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 5 AQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRN 84 (497) Q Consensus 5 a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~ 84 (497) +.|+.. .+.++++...+...+.+. .++.+.+...+++....+....++...+..+++.++++.+.++......+... T Consensus 1 ~~~~~~--~~~~el~~~~~~l~el~~-~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~ 77 (425) T protein:vir:95 1 MALRQL--MLTKKIEQRKAALDELVK-REQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEI 77 (425) T ss_pred CchHHH--HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 444432 244444444443322111 11111111111111111111111112222233333333333333333222111 Q ss_pred HHHHHHHHHhhhh--hhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccc Q lcl|NC_021309. 85 LKQIRKHLARAVI--MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPG 162 (497) Q Consensus 85 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 162 (497) . +.......... .................. .............. ...... ........ ....+++.+|.+ T Consensus 78 ~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~--~~~~~~-~~~~~~~~-~~~~~~~~gg~~ 149 (425) T protein:vir:95 78 A-QLEDELEQINSKQPSNQSRQKMQGSKGDVVE---MNRLQVREMLKTGE--YYKRSE-VVEFYEKF-RNLRAVAGGELT 149 (425) T ss_pred H-HHHHHHHHhhhhccchhhhhhhhhhhhhHHH---HHHHHHHHHHhhhh--hhhhhH-HHHHHHHH-HhhcccccCcee Confidence 0 00001000000 000000000000000000 00000000000000 000000 00000111 111233445556 Q ss_pred cchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccccc-ccceeeEeeeeeEEeeehhhH Q lcl|NC_021309. 163 ILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSS-EEFARVYEQVGKVANALTITD 241 (497) Q Consensus 163 v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~-~~f~~i~~~~~kla~~~~iS~ 241 (497) ||+++...|++.++..++|+++|+++++++ ...+|+.++ .+.++|++|++.+|+++ ++|++|++++++++++++||+ T Consensus 150 vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~g-~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ 227 (425) T protein:vir:95 150 IPEVVVNRIMDIMGDYTTLYPLVDKIRVKG-TTRILVDTD-TSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDN 227 (425) T ss_pred ccHHHHHHHHHHHHhhhhHHHhhceeecCc-eeEEEEecC-CccccccccccccccccccccceeeeeheeeeeeehhhH Confidence 666788889999999999999999999875 479999876 57899999999999887 799999999999999999999 Q ss_pred HHHhhHH-HHHHHHHHHHHHHHHHHHHhhhhcccCcc--ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 242 EGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYP--GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 242 ell~d~~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~--~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |||+|++ +|++||.++|+++++.++|.+||+|+|++ +|.||++..+........ T Consensus 228 ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~----------------------- 284 (425) T protein:vir:95 228 YLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVE----------------------- 284 (425) T ss_pred HHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccc----------------------- Confidence 9999984 89999999999999999999999999965 799999764432211100 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccC-CceEEechhHH----HHHHHHhhhc Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQT-PNAVVMNPRDW----ELLRLTKDAN 393 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~n~~~~----~~l~~lkd~~ 393 (497) .....++++...+..+...+... ..+|+||+.+| ..++++||++ T Consensus 285 -------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~ 333 (425) T protein:vir:95 285 -------------------------------ADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSN 333 (425) T ss_pred -------------------------------cccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCC Confidence 00111234444444444444433 44699999885 3567889999 Q ss_pred CceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEE Q lcl|NC_021309. 394 GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEE 473 (497) Q Consensus 394 G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~ 473 (497) |||||+++.. ..++|||+|||+++.+|.++++||||++ |.+++|.+++|.++++. +|.+|++.||++. T Consensus 334 g~~i~~~~~~--------~~~~l~G~pvv~~~~~~~~~i~~Gd~~~--~~~~~~~~~~i~~~~~~--~f~~~~~~~~~~~ 401 (425) T protein:vir:95 334 GNVVGKLPNL--------RTPDLLGLRVVFNNFLDDDTVLFGEFEQ--YTLVERENITIDSSTHV--KFTEDQTAFRGKG 401 (425) T ss_pred CceeeccCCC--------CCccccceeeEEcCcCCCccEEEEeccc--EEEEeecceEEEeeccc--ccccCceEEEEEE Confidence 9999985422 3468999999999999999999999997 67889999999999865 5999999999999 Q ss_pred eecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 474 RLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 474 r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) |+||++++|+||++++++++..|- T Consensus 402 r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 402 RFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred eeCcEeecccceEEEEecCcCCCC Confidence 999999999999999999999999 No 18 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=5.7e-64 Score=367.32 Aligned_cols=410 Identities=24% Similarity=0.317 Sum_probs=267.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) ||.+.++++...++.+......+ ...+..++.+.+..+.++++. + .+++..++..+++..+..+...... T Consensus 1 m~~~~~lee~~a~l~~~~~~~~~-~~~~~~~~~~e~~~~~~~~~~---~------~~~~~~~~~~~~~~~~~~~~~~~~~ 70 (419) T protein:vir:94 1 MPPTPTLEEQRAALLARLDDTSL-TTEQVQEIVAEARGLADALQA---E------SDRAAARAALLRTAPPAPKGPADGG 70 (419) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH---H------HHHHHHHHHHHHHHHHHHHHHhhhh Confidence 99999988766555443332222 111111222222222222211 1 1111111111111111111111000 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) ... ........+. .................... ......................++...++ T Consensus 71 ~~~-----------~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (419) T protein:vir:94 71 TPL-----------TPAEAGTFRS-----LAQRFADSDGLREYRARDKR--GQFQVEMRDIDPNRLLSRDAPAGTITNPN 132 (419) T ss_pred ccc-----------cccccccccc-----hhhhhhhHHHHHHHHHhhhh--hhhhHHHHHHHHHHhhccccccccccCCc Confidence 000 0000000000 00000000000000000000 00000000000111111222334455677 Q ss_pred cccchhhhHHHH-HHHHhhhhHHhhcceeecCCCceEEEEEcCCC-------ccceecccccccccccccceeeEeeeee Q lcl|NC_021309. 161 PGILPTFLPGIV-EQLFYELSLADLISSRPVTSPNLSYLTESAAH-------NNAAAVAEAGTYPFSSEEFARVYEQVGK 232 (497) Q Consensus 161 ~~v~p~~~~~ii-~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~-------~~a~wv~Eg~~~~~s~~~f~~i~~~~~k 232 (497) ..++|+...+++ ..+...+.|+++|++++++++.+.||++++.+ +.++|++||+.+|+++++|++|++.+++ T Consensus 133 ~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k 212 (419) T protein:vir:94 133 VPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKT 212 (419) T ss_pred ccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeee Confidence 788888777655 55566678999999999999999999986542 4578999999999999999999999999 Q ss_pred EEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhh Q lcl|NC_021309. 233 VANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPA 312 (497) Q Consensus 233 la~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (497) ++++++||+|+|+|+++|++||.++|+++++.++|.+||+|+|+++|.||++.++..+....... T Consensus 213 ~~~~~~is~ell~d~~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~--------------- 277 (419) T protein:vir:94 213 VAHWLPITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPT--------------- 277 (419) T ss_pred EEEeehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccc--------------- Confidence 99999999999999999999999999999999999999999999999999998765443322110 Q ss_pred hhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhh Q lcl|NC_021309. 313 DGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDA 392 (497) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~ 392 (497) ...+....++++.+++..+..+++ .+++|+||+.+|..|+++||+ T Consensus 278 ----------------------------------~~~t~~~~~~~l~~~~~~~~~~~~-~~~~~v~n~~~~~~l~~~k~~ 322 (419) T protein:vir:94 278 ----------------------------------APATDEPPLVDIRRAKTVAEIAGF-PPDGVVVHPQDWESIELDQAP 322 (419) T ss_pred ----------------------------------cccccchhHHHHHHHHHhhhhccC-CCCEEEEcHHHHHHHHHHhhc Confidence 111222345677778888877665 456899999999999999998 Q ss_pred cCc-eeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEE Q lcl|NC_021309. 393 NGQ-YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 393 ~G~-~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) +|+ |+++++..+. ...+|+|+||++++++|+++++||||++ +|.+++|.+++++++++.+++|++|++.||+ T Consensus 323 ~~~~~~~~~~~~~~------~~~~l~G~pV~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~ 395 (419) T protein:vir:94 323 GSGVFRVIANVQGE------ATPRIWGLNVVSTVAIAQGTALVGGFRQ-GATLWSRQGITVLMTDSHADFFTANTLVILA 395 (419) T ss_pred CCCceeecCCcccC------CCccccceeeEEcCCCCCccEEEeeccc-eEEEEEecceEEEEeccccchhhcCcEEEEE Confidence 665 5666544322 3468999999999999999999999998 4678999999999999988899999999999 Q ss_pred EEeecceeecccceEEEEeeCCCC Q lcl|NC_021309. 472 EERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 472 ~~r~~~~v~~~~a~~~l~~~~~a~ 495 (497) +.|+|+.|++|+||++++++++.+ T Consensus 396 ~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 396 EFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred EEeeccEEeccccEEEEEeccCCC Confidence 999999999999999999999999 No 19 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=2.6e-64 Score=369.19 Aligned_cols=404 Identities=17% Similarity=0.164 Sum_probs=252.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHH--HH-HHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETK--TA-AEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDI 77 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~--~~-~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~ 77 (497) ||.+.+|+++..++.++++.+..+..+ .+ ++..+.+.++..++++ ++++++.++... T Consensus 1 M~kl~~L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~--------------------l~~~i~~~e~~e 60 (428) T protein:vir:10 1 MPQIEELRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTD--------------------ISAKMDRMEATE 60 (428) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHHH Confidence 999999998888777766655532110 00 0111222222222222 222222211110 Q ss_pred HHHHHHHHHHHHHHHH-hhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccc Q lcl|NC_021309. 78 PEVEVRNLKQIRKHLA-RAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGST 156 (497) Q Consensus 78 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (497) +... ........... .......+.+. ..+..+..... .......... .. ..................+.+ T Consensus 61 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~--~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~ 131 (428) T protein:vir:10 61 RAAA-LVAKPVKATQHGPAVIVKAEPKQ----YTGAGMTRMVM--SIAAAQGNLQ-DA-AKFASDELNDQSVSMAISTAA 131 (428) T ss_pred HHHH-HHhhhhhchhhccccccccccch----hhhHHHHHHHH--HHHHhhhhHH-HH-HHHhhhhhhhhhHhhhhcccc Confidence 0000 00000000000 00000000000 00000000000 0000000000 00 000000001111122223344 Q ss_pred cccccccchhhhHHHHHHHHhhhhHHhh-cceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEe Q lcl|NC_021309. 157 GTFAPGILPTFLPGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVAN 235 (497) Q Consensus 157 ~~~g~~v~p~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~ 235 (497) +++|.+||+++..+||+.+++.++|+++ +++++++++.++||+.++ .+.++|++|++.+|+++++|++|++.++++++ T Consensus 132 ~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~ 210 (428) T protein:vir:10 132 GSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAG-GATASYTGENQDAKVSEARFDDVKLTAKTMIA 210 (428) T ss_pred cCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeC-CcceeeeccCccccccccceeeEEeeeEEEEE Confidence 5566667777889999999999999998 778898888899999987 46899999999999999999999999999999 Q ss_pred eehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCcc-ccccccccccccccccccchhhhhhhHHHHHHhhhh Q lcl|NC_021309. 236 ALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 236 ~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) +++||+|||+|+ ++|++||.++|++++++++|.+||+|+|++ +|.||++.++................. T Consensus 211 ~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~--------- 281 (428) T protein:vir:10 211 MVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLD--------- 281 (428) T ss_pred eehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccccHH--------- Confidence 999999999986 789999999999999999999999999986 799999977543322211100000000 Q ss_pred hcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhc Q lcl|NC_021309. 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDAN 393 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~ 393 (497) . .+............... +..+.+|+||+.+|..|+++||++ T Consensus 282 ------------------------------------~-~~~~~~~~~~~~~~~~~-~~~~~~~v~n~~~~~~L~~lkd~~ 323 (428) T protein:vir:10 282 ------------------------------------T-IDTYLDSIILMSMDGNS-NMISSGWGMSNRTYMKLFGLRDGN 323 (428) T ss_pred ------------------------------------H-HHHHHHHHHHhhhcccc-ccccCEEEEcHHHHHHHHHhhccC Confidence 0 00000111111111112 233467999999999999999999 Q ss_pred CceeccCcccccccccccccccccccceEecCCCCcC--------ceEEEeeccceEEEEeecccEEEeecccc------ Q lcl|NC_021309. 394 GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTARREGVTMQMTNSNG------ 459 (497) Q Consensus 394 G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~--------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------ 459 (497) |+|+|++.. ..+|+|+||++++.+|.+ .++||||++ |.++++.+++|+++++.. T Consensus 324 G~~i~~~~~----------~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~--~~i~~~~~i~i~~~~~~~~~~~~~ 391 (428) T protein:vir:10 324 GNKVYPEMA----------QGMLKGYPIQRTSAIPANLGEGGKESEIYFADFND--VVIGEDGNMKVDFSKEASYIDTDG 391 (428) T ss_pred CceeccCCC----------CCeeeceeeEEeccccccccCCCccceEEEEecce--EEEEEecceEEEeecccccccccc Confidence 999997532 237999999999999864 379999997 668899999999998753 Q ss_pred ---hhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 460 ---TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 460 ---~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) ++|++|++.||++.|+||.|++|+||+.++-..= T Consensus 392 ~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 392 KLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 5799999999999999999999999999987777 No 20 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=6.1e-62 Score=356.20 Aligned_cols=432 Identities=17% Similarity=0.125 Sum_probs=251.6 Q ss_pred Cc-hHHHHHHHH--HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MP-STAQLEAQG--RQLAKSIKDINADET-KTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDND 76 (497) Q Consensus 1 ~~-~~a~~~~~~--~~~~~~~~~~~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~ 76 (497) |- .+-++.+++ .++.+..+.+..... +++.+++. +....++.... ++.. ....|.++++.+..++++.+.++ T Consensus 1 ~~~~~~~~~~e~~~~e~a~~~~~~~~~~k~~e~~~~~k--e~~~~~l~~~~-e~~~-k~~~E~~~~le~~~ee~k~l~ee 76 (458) T protein:vir:10 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAQEAERMRK--EQEEKELARMN-DLVS-KAVGEDRKRLEEALELVKSLDEK 76 (458) T ss_pred CccchhhhhhhhchhhHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 11 111111111 111111111100000 00000000 00000000000 0000 00112222222222333332222 Q ss_pred HHHHHHHHHHHHHHHH---Hh---hhh---hhHHHHhH-hhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhh Q lcl|NC_021309. 77 IPEVEVRNLKQIRKHL---AR---AVI---MNPELKNA-TSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPA 146 (497) Q Consensus 77 ~~~~~~~~~~~~~~~~---~~---~~~---~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (497) ..+.........++.. .. ... .....+.. .............................+........... T Consensus 77 ~~~~~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 156 (458) T protein:vir:10 77 SKKSNELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQR 156 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhh Confidence 2211111100000000 00 000 00000000 00000000000000000000000000000000000011111 Q ss_pred hh-hhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccc------ Q lcl|NC_021309. 147 AI-GQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFS------ 219 (497) Q Consensus 147 ~~-~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s------ 219 (497) .. .....++.+.+|.++|+++..+|++.+++.++|+++|+++|++++...+|+.++ .+.+.|++|++.+|++ T Consensus 157 ~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~e~~~~~~~~~~~~~ 235 (458) T protein:vir:10 157 HLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPD-AGKATWVAASTYGTDTTTGEEV 235 (458) T ss_pred hhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecC-Ccceeecccccccccccccccc Confidence 11 112223344566778888999999999999999999999999999999999876 4679999999888754 Q ss_pred cccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchh Q lcl|NC_021309. 220 SEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLF 298 (497) Q Consensus 220 ~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~ 298 (497) +++|++|++.++|++++++||+|+|+|+ ++|++||.++|+++++.++|.+||+|+|+++|.||++.++..+........ T Consensus 236 ~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) T protein:vir:10 236 KGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAK 315 (458) T ss_pred cccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeeccc Confidence 6789999999999999999999999997 689999999999999999999999999999999999987654433221110 Q ss_pred hhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEe Q lcl|NC_021309. 299 GATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVM 378 (497) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (497) . .......++++.++++.+...++ .+.+|+| T Consensus 316 ~------------------------------------------------~~~~~~~~~~i~~~~~~l~~~~~-~~~~~v~ 346 (458) T protein:vir:10 316 A------------------------------------------------DGSVLVTAKTISKLRRKLGRHGL-KLSKLVL 346 (458) T ss_pred c------------------------------------------------cccccccHHHHHHHHHhhhhhhc-CCCEEEE Confidence 0 00111124566667777766655 4567999 Q ss_pred chhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC----ceEEEeeccceEEEEeecccEEEe Q lcl|NC_021309. 379 NPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG----TILVGHFAPSVIQTARREGVTMQM 454 (497) Q Consensus 379 n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~----~~~~gd~~~~~~~i~~r~~~~i~~ 454 (497) |+.+|..|+++||++|+|||++..... .....+.+|||+||+++++||.+ .++||||+. +|.+++|.+++|++ T Consensus 347 ~~~~~~~l~~lkd~~G~~i~~~~~~~~--~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~~f~~-~~~~~~~~~~~v~~ 423 (458) T protein:vir:10 347 IVSMDAYYDLLEDEEWQDVAQVGNDSV--KLQGQVGRIYGLPVVVSEYFPAKANSAEFAVIVYKD-NFVMPRQRAVTVER 423 (458) T ss_pred cHHHHHHHHhhcccCCceeeccccccc--cccCcCceecceeeEEccccccccCCcceEEEEecc-cEEEEEeeceEEEe Confidence 999999999999999999998755432 22334568999999999999974 479999987 57899999999987 Q ss_pred ecccchhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 455 TNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 455 ~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) +++ +.+|+|.||++.|+|+.|++|+|||+++++++ T Consensus 424 d~~----~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 424 ERQ----AGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred ecc----cCCCceEEEEEEEecceEecccceEEEeeccC Confidence 654 56899999999999999999999999998888 No 21 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=1.2e-62 Score=359.99 Aligned_cols=433 Identities=15% Similarity=0.109 Sum_probs=272.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 7 LEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLK 86 (497) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~ 86 (497) ++.+.++|...+.+++.+.. ++.+.++.+.++.+...+.....+..+++.+++++++++++.++..+++.+..... T Consensus 1 ~~k~~eem~~~i~eL~e~r~----~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~ 76 (477) T protein:vir:84 1 MEKHLEELRALRAAAVEAVA----TLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESE 76 (477) T ss_pred CchHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55555555555554444332 23333333334443333332233344555666666766666655444333221111 Q ss_pred HHHHHH--Hhh------hhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHH------HHHH--Hhhhhhhhhhhh Q lcl|NC_021309. 87 QIRKHL--ARA------VIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAEL------MGAF--ADGETAPAAIGQ 150 (497) Q Consensus 87 ~~~~~~--~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~--~~~~~~~~~~~~ 150 (497) ...... ... ............ ........................+. .... ............ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (477) T protein:vir:84 77 IERSGKLEAETKTVRKATVEVNEALTYEK-GNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYR 155 (477) T ss_pred HHHhhcchhhhhhhcccccccccchhhhh-hHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhc Confidence 000000 000 000000000000 00000000000000000000000000 0000 000011112223 Q ss_pred hccccccccccccchhhh-HHHHHHHHhhhhHHhhcceeecCC--CceEEEEEcCCCccceeccccc-----cccccccc Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADLISSRPVTS--PNLSYLTESAAHNNAAAVAEAG-----TYPFSSEE 222 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~-~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~p~~~~~~~~a~wv~Eg~-----~~~~s~~~ 222 (497) ...++++++|++|||+++ ..|++.+++.++|+++|+++++++ +++.||+.+++...++|++||+ .+|+++++ T Consensus 156 ~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~ 235 (477) T protein:vir:84 156 DLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLT 235 (477) T ss_pred cccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccc Confidence 334556678888999975 569999999999999999998765 4689999887777788999986 45788999 Q ss_pred ceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCc-cccccccccccccccccccchhhh Q lcl|NC_021309. 223 FARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGY-PGVNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 223 f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~-~~p~Gi~~~~~~~~~~~~~~~~~~ 300 (497) |++|++++||++++++||+|||+|+ +++++||.++|+++++.++|.+||+|+|+ ++|.||++.++......+... T Consensus 236 f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~--- 312 (477) T protein:vir:84 236 DGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAG--- 312 (477) T ss_pred eeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccc--- Confidence 9999999999999999999999996 69999999999999999999999999997 479999998765433322110 Q ss_pred hhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEech Q lcl|NC_021309. 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) ..........+.+.+++..+...++.++++|+||+ T Consensus 313 ---------------------------------------------~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~ 347 (477) T protein:vir:84 313 ---------------------------------------------SALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHP 347 (477) T ss_pred ---------------------------------------------cchhhHHHHHHHHHHHHhhccccccCCccEEEEcH Confidence 01111223345667777777788888889999999 Q ss_pred hHHHHHHHHhhhcCceeccCccccccc-------ccccccccccccceEecCCCCcC--------ceEEEeeccceEEEE Q lcl|NC_021309. 381 RDWELLRLTKDANGQYMGGNFFGNAYG-------NPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTA 445 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~i~~~~~~~~~~-------~~~~~~~~l~G~Pvv~~~~~~~~--------~~~~gd~~~~~~~i~ 445 (497) .+|..|+++||++|||||+|....... ......++|||+||++++.||++ .++||||++ +.++ T Consensus 348 ~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~--~~i~ 425 (477) T protein:vir:84 348 RRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASD--LALF 425 (477) T ss_pred HHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCcccccccccCCcceEEEEEece--EEEE Confidence 999999999999999999976443221 11223458999999999999964 379999987 4555 Q ss_pred eecccEEEeecccchhhhcCceEEEEEEeeccee-ecccceEEEEeeCCCCCC Q lcl|NC_021309. 446 RREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV-YRPSAFQLIQLKKGATGS 497 (497) Q Consensus 446 ~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v-~~~~a~~~l~~~~~a~~~ 497 (497) + .++++.++++.. +.++++.|++..++++.. ++|+||+.++.++.+.-+ T Consensus 426 ~-~~~~~~~~~~~~--~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~~~~~ 475 (477) T protein:vir:84 426 E-SSVRMRALQETR--AENLSVLLQVYGYLAFTAARFPQSVVEIGGTALTAPT 475 (477) T ss_pred e-eceeEEeccccc--cccceeeeeehhhhhhhhhccccceEEeecccccccc Confidence 4 578888887754 557888898888788754 569999999999988888 No 22 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=6.3e-63 Score=361.59 Aligned_cols=402 Identities=14% Similarity=0.156 Sum_probs=248.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HH Q lcl|NC_021309. 16 KSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKH--LA 93 (497) Q Consensus 16 ~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~--~~ 93 (497) +.+++++++..+..+++++.. +.. ...+.+. ++..+++++++++++.++.++++.+.......... .. T Consensus 1 M~i~eL~e~r~~~~~~~~~l~----~~~-~e~~~lt-----~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~ 70 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALA----QIE-VGGTALS-----VEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVD 70 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHH----HHH-hccCCCC-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 333333332222222222211 110 1111111 12223444455555555544443332111100000 00 Q ss_pred hhh--hhhHH---HHh--HhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhh-hhccccccccccccch Q lcl|NC_021309. 94 RAV--IMNPE---LKN--ATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIG-QNPFGSTGTFAPGILP 165 (497) Q Consensus 94 ~~~--~~~~~---~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~v~p 165 (497) ... ..... ... .........+....+ .......................... ....++++.+|.+||+ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~ 146 (435) T protein:vir:14 71 PNPTAVAAPAAAPVHAQPKALEVKGAKMARMVR----ALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPE 146 (435) T ss_pred chhhhhhhccccccccccchhhhhHHHHHHHHH----HHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccch Confidence 000 00000 000 000000000000000 00000000000000000000011111 2223344556667777 Q ss_pred hhhHHHHHHHHhhhhHHhh-cceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHH Q lcl|NC_021309. 166 TFLPGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGL 244 (497) Q Consensus 166 ~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell 244 (497) ++..+||+.+++.++|+++ +++++++++.+.||+.++ .+.++|++|++.+|+++++|++|++.++|++++++||+||| T Consensus 147 ~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell 225 (435) T protein:vir:14 147 NLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKG-GAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLI 225 (435) T ss_pred hHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeC-CcceeeeccCccccccccceeEEEeeeEEEEEeehhhHHHH Confidence 7888999999999999997 788999888999999987 46799999999999999999999999999999999999999 Q ss_pred hhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCcc-ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhh Q lcl|NC_021309. 245 RDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) Q Consensus 245 ~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) +|+ ++|++||.++|++++++++|.+|++|+|++ +|.||++.+....+........ T Consensus 226 ~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~--------------------- 284 (435) T protein:vir:14 226 KYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDAST--------------------- 284 (435) T ss_pred HhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccc--------------------- Confidence 997 469999999999999999999999999985 6999988654433222211100 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhh-hccCCceEEechhHHHHHHHHhhhcCceecc Q lcl|NC_021309. 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT-LFQTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~ 399 (497) ......++...+..+... .+..+.+|+||+.+|..|+++||++|+|||+ T Consensus 285 ------------------------------~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~ 334 (435) T protein:vir:14 285 ------------------------------LQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYP 334 (435) T ss_pred ------------------------------hhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceecc Confidence 001112222222222222 1234567999999999999999999999995 Q ss_pred CcccccccccccccccccccceEecCCCCcC--------ceEEEeeccceEEEEeecccEEEeecccc---------hhh Q lcl|NC_021309. 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTARREGVTMQMTNSNG---------TDF 462 (497) Q Consensus 400 ~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~--------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~---------~~f 462 (497) .. ...+|+|+||++++.||.+ .++||||++ |.+++|.++++.++++.. ++| T Consensus 335 ~~----------~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~--~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f 402 (435) T protein:vir:14 335 EL----------ANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGD--VFIGEEETLEIDYSKEATYKDADGHMVSAF 402 (435) T ss_pred CC----------CCCeeecceeEeeccccccccCCCccceEEEeeccc--EEEEEecccEEEEeccccccccccchhhhh Confidence 42 1347999999999999863 489999998 558999999999998754 569 Q ss_pred hcCceEEEEEEeecceeecccceEEEEeeCCCC Q lcl|NC_021309. 463 VDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 463 ~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~ 495 (497) ++|++.||+++|+||+|++|+||++|+-.+.-. T Consensus 403 ~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 403 QRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred hcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 999999999999999999999999887665544 No 23 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=5.4e-63 Score=361.95 Aligned_cols=397 Identities=16% Similarity=0.142 Sum_probs=254.2 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |= +-+|.++..++.++++.+ +++. +.....+|..+++++++++++.++.+++.. T Consensus 1 M~-l~eL~e~r~~l~~e~~~l--------------~~k~-----------~~~~~t~e~~~~~~~~~~e~~~l~~~i~~~ 54 (409) T protein:vir:45 1 MK-LHELKQKRNTIATDMRAL--------------NEKI-----------GDNAWTEEQRTEWNKAKSELEALDERIARE 54 (409) T ss_pred CC-HHHHHHHHHHHHHHHHHH--------------HHHh-----------hcCCCCHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 222222222222222211 1110 000112233334444444444444444333 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +......... ..... ............. .................... .................++.+.+| T Consensus 55 e~~~~~~~~~-~~~~~---~~~~~~~~~~~~~-~~~~~~~~a~~~~l~~~~~~---~~~~e~~~~~~~~a~~~~~~~~gg 126 (409) T protein:vir:45 55 EELRRQDQAY-IESNE---EEQRQNLDPENNS-QQDEKRAQVFDKWMRHGASE---LTSEERKALRELRAQGVAQDEKGG 126 (409) T ss_pred HHHHHHHHHH-Hhhhh---hhhcccCCCCCcc-hhhHHHHHHHHHHHHhhhhh---ccHHHHHHHHHHhhccCccCcCCc Confidence 2211110000 00000 0000000000000 00000000000000000000 000111111122233334445566 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCc-eEEEEEcCCCccceecccccccccccccceeeEeeeeeEE-eeeh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA-NALT 238 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla-~~~~ 238 (497) .+||+++..+|++.+++.++|+++|++++++++. ..+|+.++....+.|++|++.+|+++++|+++++.++|++ ++++ T Consensus 127 ~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~ 206 (409) T protein:vir:45 127 YTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIR 206 (409) T ss_pred eeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehh Confidence 7778888899999999999999999999998764 5566666545567899999999999999999999999986 5789 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCcc---ccccccccccccccccccchhhhhhhHHHHHHhhhhh Q lcl|NC_021309. 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP---GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) Q Consensus 239 iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~---~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) ||+|||+|+ ++|++||.++|+++++.++|.+||+|+|++ +|.||++..+........ T Consensus 207 is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~------------------- 267 (409) T protein:vir:45 207 VSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAA------------------- 267 (409) T ss_pred hhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccccc------------------- Confidence 999999997 699999999999999999999999999976 699998876543222111 Q ss_pred cchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCce-EEechhHHHHHHHHhhhc Q lcl|NC_021309. 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNA-VVMNPRDWELLRLTKDAN 393 (497) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~n~~~~~~l~~lkd~~ 393 (497) ....++++..++..+...++.++.+ |++|+.+|..|++|||++ T Consensus 268 ------------------------------------~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~ 311 (409) T protein:vir:45 268 ------------------------------------NAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQ 311 (409) T ss_pred ------------------------------------cccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCC Confidence 0111355667777777777777664 688999999999999999 Q ss_pred CceeccCcccccccccccccccccccceEecCCCCc-----CceEEEeeccceEEEEeecccEEEeecccchhhhcCceE Q lcl|NC_021309. 394 GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVT 468 (497) Q Consensus 394 G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~ 468 (497) |||||+++.... .+.+|||+||+++++||. ..++||||++ |.++++.+++++++... +|.+|++. T Consensus 312 G~~i~~~~~~~~------~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~--~~i~~~~~~~~~~~~d~--~~~~~~~~ 381 (409) T protein:vir:45 312 GRPLWLPDIVGV------APASVLNVPYVIDQEIDDIGAGKKFMFCGDFDR--FIIRRVRYMILKRLVER--YAEYDQTG 381 (409) T ss_pred CceeeccCcCCC------CCceecceeeEEecCcCCccCCccEEEEeehhh--hheeeccceEEEEeecc--cccCCcEE Confidence 999998765442 346899999999999985 3378899997 56788999999887643 57899999 Q ss_pred EEEEEeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 469 VRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 469 ~r~~~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) ||++.|+|+.|++|+||+.+++++++-| T Consensus 382 ~~~~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 382 FLAFHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred EEEEEEeccEeechhheEEEEeccCCCC Confidence 9999999999999999999999999999 No 24 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=1.1e-62 Score=360.27 Aligned_cols=405 Identities=13% Similarity=0.135 Sum_probs=248.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_021309. 16 KSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLA-- 93 (497) Q Consensus 16 ~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~-- 93 (497) +.+++++++..+..+++++.. +. +...+.+. ++..+++++++++++.++.++++.+............ T Consensus 1 M~l~eL~~~r~~~~~~~~~l~----~~-~~e~~~l~-----~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~ 70 (435) T protein:vir:80 1 MNVNELRRERAAVNQRVQALA----QI-EVGGTALS-----VEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVD 70 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHH----HH-HhccCCCC-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 333333332222222222211 11 11111221 1222334444444444444444333211110000000 Q ss_pred h--hhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHh--hhhhhhhhhh-hccccccccccccchhhh Q lcl|NC_021309. 94 R--AVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFAD--GETAPAAIGQ-NPFGSTGTFAPGILPTFL 168 (497) Q Consensus 94 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~-~~~~~~~~~g~~v~p~~~ 168 (497) . ..................... ........................ .......... ...++.+.+|.+||+++. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~ 149 (435) T protein:vir:80 71 PNPAAVTASAAAPVYAQPKAPEVK-GAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLS 149 (435) T ss_pred chhhhhccccccccccccchhhhh-HHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHH Confidence 0 000000000000000000000 000000000000000000000000 0001111111 222334455566677788 Q ss_pred HHHHHHHHhhhhHHhh-cceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH Q lcl|NC_021309. 169 PGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA 247 (497) Q Consensus 169 ~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~ 247 (497) .+||+.+++.++|+++ +++++++++.+.||+.++ .+.+.|++|++.+|+++++|++|++.++|++++++||+|+|+|+ T Consensus 150 ~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds 228 (435) T protein:vir:80 150 SEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKG-GAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYA 228 (435) T ss_pred HHHHHHHhhhchhhhccceeeecCCCceEEEEEeC-CcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHhh Confidence 8999999999999998 789999999999999987 46799999999999999999999999999999999999999986 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHhhhhcccCcc-ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhh Q lcl|NC_021309. 248 ---PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQD 323 (497) Q Consensus 248 ---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (497) ++|++||.++|+++++.++|.+|++|+|++ +|.||++.....+......... T Consensus 229 ~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~------------------------ 284 (435) T protein:vir:80 229 GVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGST------------------------ 284 (435) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccc------------------------ Confidence 479999999999999999999999999975 6999998775544332221100 Q ss_pred hhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhh-hccCCceEEechhHHHHHHHHhhhcCceeccCcc Q lcl|NC_021309. 324 TVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT-LFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFF 402 (497) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~ 402 (497) ......++..++..+... .+..+.+|+||+.+|..|+++||++|+|+|+.. T Consensus 285 ---------------------------~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~- 336 (435) T protein:vir:80 285 ---------------------------LQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPEL- 336 (435) T ss_pred ---------------------------hhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCC- Confidence 001111222332222222 133456899999999999999999999999532 Q ss_pred cccccccccccccccccceEecCCCCcC--------ceEEEeeccceEEEEeecccEEEeecccc---------hhhhcC Q lcl|NC_021309. 403 GNAYGNPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTARREGVTMQMTNSNG---------TDFVDG 465 (497) Q Consensus 403 ~~~~~~~~~~~~~l~G~Pvv~~~~~~~~--------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~---------~~f~~~ 465 (497) ...+|+|+||++++.||.+ .++||||++ |.+++|.+++|+++++.. ++|++| T Consensus 337 ---------~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~--~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n 405 (435) T protein:vir:80 337 ---------ANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGD--VFIGEEETLEIDYSKEATYKDADGHMVSAFQRD 405 (435) T ss_pred ---------CCCeEeeeeeEEeccccccccCCCCcceEEEEEccc--EEEEeecceEEEEeccccccccccchhhhhhcC Confidence 1247999999999999863 489999998 558899999999998764 569999 Q ss_pred ceEEEEEEeecceeecccceEEEEeeCCCC Q lcl|NC_021309. 466 KVTVRAEERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 466 ~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~ 495 (497) ++.||++.|+||.|++|+||++|+-..-.. T Consensus 406 ~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 406 QTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred cceeeeeeeeCcEeecccceEEEeccCCCC Confidence 999999999999999999999998777666 No 25 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=7e-61 Score=350.37 Aligned_cols=429 Identities=15% Similarity=0.082 Sum_probs=246.4 Q ss_pred CchH--HHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHH------------- Q lcl|NC_021309. 1 MPST--AQLEAQGRQLAKSIKDINA------DETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEM------------- 59 (497) Q Consensus 1 ~~~~--a~~~~~~~~~~~~~~~~~~------~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~------------- 59 (497) -|+. +++.++++++.+++.++.. ++..++.+++..++.+.++++..+.. +..+...+. T Consensus 40 ~~~~~~~~~~~~~~e~~~~~e~l~~~~~~~~~e~~~~~~~~~e~~el~~~~~~l~~~-e~~~~~~e~~~~~~~~~~~~~~ 118 (543) T protein:vir:81 40 APTLTYSQARNRADEVHARMEQIAELDKPTDEENEEFRALGAEFDSLVNHMSRLERA-AELARVRSTHEQIGKPQSGGQR 118 (543) T ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 2222 2223333332222222211 11122222222222222222211100 000000000 Q ss_pred ------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhh Q lcl|NC_021309. 60 ------------------------------LKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFE 109 (497) Q Consensus 60 ------------------------------~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (497) ..+.....++.+.+.++..........+...................... T Consensus 119 e~r~e~~a~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~ 198 (543) T protein:vir:81 119 RMRVEAGSSQGGRGDYDRDAILEPDSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKI 198 (543) T ss_pred HhhhhhhhHHHhhHHHHHhhhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000000000000000000000000000000000000000000000000 Q ss_pred hhhhhhhhHHHHH-----HhhhHHHHHHH-HHHH----HHhhhhhhhhhhhhccccccccccccchhhhHHHH-HHHHhh Q lcl|NC_021309. 110 KGTKFDVSFNVSA-----KAADPGTAAAE-LMGA----FADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIV-EQLFYE 178 (497) Q Consensus 110 ~~~~~~~~~~~~~-----~~~~~~~~~~~-~~~~----~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii-~~~~~~ 178 (497) ............. ........... .+.. .................+++++|.+||+++..++| ..++.. T Consensus 199 ~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~ 278 (543) T protein:vir:81 199 IERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSL 278 (543) T ss_pred HHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhh Confidence 0000000000000 00000000000 0000 00000000111122223455666777778888876 556677 Q ss_pred hhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhHHHHHHHHHHHH Q lcl|NC_021309. 179 LSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAPELFNFVQGRL 258 (497) Q Consensus 179 ~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~~~l~~~i~~~l 258 (497) ++|+.++++.+++ +.+.+|+.++ .+.++||+||+.+|+++++|++|++.+++++++++||+++|+|++++.+||.+.| T Consensus 279 ~~l~~~~~~~~~~-g~~~~~~~~~-~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~~~~~i~~~l 356 (543) T protein:vir:81 279 NDIRRFARQVVAT-GDVWHGVSSA-AVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQDEANVTETVALLF 356 (543) T ss_pred chhhhhcccccCC-cceEEEEecC-CcceeecccCccccccccccceeeeeeeeeEeeehhhHHHHhccHHHHHHHHHHH Confidence 8899999887664 5688999876 5789999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhhhcccCcc-ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhh Q lcl|NC_021309. 259 LEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGA 337 (497) Q Consensus 259 a~~~~~~~d~~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 337 (497) +++++.++|.+||+|+|++ +|.||++..+..+...... T Consensus 357 ~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~----------------------------------------- 395 (543) T protein:vir:81 357 AEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPV----------------------------------------- 395 (543) T ss_pred HHHHHHHHHHHHhccCCCCcccccchhhccccccccccc----------------------------------------- Confidence 9999999999999999986 7999988754332211110 Q ss_pred hhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccccccccccccccccc Q lcl|NC_021309. 338 AGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIW 417 (497) Q Consensus 338 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~ 417 (497) ......++++..++..+...++ ...+|+||+.+|..|+++||++|+|||.++..+ .+++|+ T Consensus 396 -----------~~~~~~~~~~~~~~~~l~~~~~-~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g-------~~~~l~ 456 (543) T protein:vir:81 396 -----------TAETFALADVYAVYEQLAARHR-RQGAWLANNLIYNKIRQFDTQGGAGLWTTIGNG-------EPSQLL 456 (543) T ss_pred -----------ccccccHHHHHHHHHhhhcccc-CCcEEEEcHHHHHHHHHhhcCCCceeccCcCCC-------CCcccc Confidence 0111223555666666665554 345799999999999999999999999875432 245899 Q ss_pred ccceEecCCCCcCc----------eEEEeeccceEEEEeecccEEEeecccc--hhhhcCceEEEEEEeecceeecccce Q lcl|NC_021309. 418 GVPVVTTPLIPLGT----------ILVGHFAPSVIQTARREGVTMQMTNSNG--TDFVDGKVTVRAEERLGLLVYRPSAF 485 (497) Q Consensus 418 G~Pvv~~~~~~~~~----------~~~gd~~~~~~~i~~r~~~~i~~~~~~~--~~f~~~~v~~r~~~r~~~~v~~~~a~ 485 (497) |+||+++++||.+. ++||||+. |.++++.+++|.++++.. ++|.+|+++||++.|+||.|.+|+|| T Consensus 457 G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~--~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~ 534 (543) T protein:vir:81 457 GRPVGEAEAMDANWNTSASADNFVLLYGNFQN--YVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAF 534 (543) T ss_pred ceeeEEeccccccccccccCCcceEEEeeccc--eeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccce Confidence 99999999998653 78999985 778999999999988743 46789999999999999999999999 Q ss_pred EEEEeeCCC Q lcl|NC_021309. 486 QLIQLKKGA 494 (497) Q Consensus 486 ~~l~~~~~a 494 (497) ++|+++++| T Consensus 535 ~~l~~~~~a 543 (543) T protein:vir:81 535 RLLNVETAS 543 (543) T ss_pred EEEEecccC Confidence 999999999 No 26 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=6.7e-62 Score=355.96 Aligned_cols=378 Identities=16% Similarity=0.160 Sum_probs=249.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |...-++..+..++.++++.+.+.... ......... ++.+++.++++.++++++.+...+.+. T Consensus 1 Mk~~~el~~~~~~~~~~~~~l~~~~~~----~~~~~~~~~-------------ee~~~~~~~i~~~~~~~e~~~~~~~~~ 63 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDKVENLNEKLNV----AMLDDSVSA-------------EELQAIKNERDTAKMKRDMFKEQYTEA 63 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHH----HHhhhhcCH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 665555555444444444333322111 100000000 111222223333333333333222211 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) ............ ........ .. ....... . ......... .........++++.+| T Consensus 64 ~~~~~~~~~~~~---------~~~~~~~~---~~-----~~~~~~~--~-----~~~~l~~~~-~~~~~~~~~~t~~~gg 118 (397) T protein:vir:49 64 RANEVANMSEEE---------KKPLTKSE---EE-----VKAGFVK--D-----FKNLVRGRY-QNLLDSKTDASGSDAG 118 (397) T ss_pred HHHhhhcccccc---------ccccccch---hH-----HHHHHHH--H-----HHHHHhcch-hHHHHHhhccccccCc Confidence 111100000000 00000000 00 0000000 0 000000000 0011122233444556 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCC--ceEEEEEcCCCccceeccccccccc-ccccceeeEeeeeeEEeee Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSP--NLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~--~~~~p~~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~ 237 (497) .+||+++..+|++.+++.++|+++|++++++++ ++.||+.....+.+.|++|++.+|+ ++++|++|+++++++++++ T Consensus 119 ~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~ 198 (397) T protein:vir:49 119 LTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGIS 198 (397) T ss_pred ccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeee Confidence 667777889999999999999999999998764 4567777666678999999999996 6899999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|||+|+ +++++||.++|++++++++|.+|++|+|++.+.+... T Consensus 199 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~~~~~-------------------------------- 246 (397) T protein:vir:49 199 TVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALPTKPTLT-------------------------------- 246 (397) T ss_pred hhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc-------------------------------- Confidence 9999999997 6899999999999999999999999999876432110 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCce Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .++++..++..+...++ .+.+|+||+.+|..|+++||++||| T Consensus 247 -------------------------------------~~d~i~~~~~~l~~~~~-~~a~~vmn~~~~~~l~~lkd~~G~~ 288 (397) T protein:vir:49 247 -------------------------------------KWDDIIDLEAKVDPAIK-QTSFFLTNTSGFTALKKVKNALGDY 288 (397) T ss_pred -------------------------------------cHHHHHHHHHhhhhhhc-CCCEEEEcHHHHHHHHHhhcCCCce Confidence 02445556666666654 4568999999999999999999999 Q ss_pred eccCcccccccccccccccccccceEecC--CCCcC-----ceEEEeeccceEEEEeecccEEEeecccchhhhcCceEE Q lcl|NC_021309. 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTP--LIPLG-----TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 397 i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~--~~~~~-----~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) ||+++... ....+|+|+||++++ .+|.+ .++||||++ +|.+++|.+++++++++.+++|.+|++.| T Consensus 289 l~~~~~~~------~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 361 (397) T protein:vir:49 289 LMERDVKS------PTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQ-AVTLFDRQHMSLLSTNIGGGAFETDTTKV 361 (397) T ss_pred eeccCcCC------CCCceecceeeEEecccccccccCCceeEEEeeccc-eEEEEeecceEEEEeccccchhhcCceeE Confidence 99876543 234689999998754 34443 389999998 58899999999999999888999999999 Q ss_pred EEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 470 RAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 470 r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) |++.|+|+.+++|+||++++++++++.. T Consensus 362 r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:49 362 RVIDRFDVVATDTEAFVPASFKAIADQK 389 (397) T ss_pred EEEeeeCcEEecccceEEEEeecccCCC Confidence 9999999999999999999999988877 No 27 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=2.9e-61 Score=352.44 Aligned_cols=392 Identities=14% Similarity=0.102 Sum_probs=257.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 5 AQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRN 84 (497) Q Consensus 5 a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~ 84 (497) +.++++++++.++++++++...+..+++++.+++ .+.. +...+..++.+++++++.+++.++.++...+... T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~----~~~~----~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~ 72 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPAMKTELRSLLEG----EDSE----ENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAAL 72 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hccc----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7788888888887777766554444444433322 1111 1112334555566666666666655554433222 Q ss_pred HHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccc-ccccccccc Q lcl|NC_021309. 85 LKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFG-STGTFAPGI 163 (497) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~~v 163 (497) ...... .... ........... ..................... .....................+ .++.+|.+| T Consensus 73 ~~~~~~-~~~~-~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~v 146 (400) T protein:vir:38 73 KGNEQS-SGKK-PDHPEEHSYRD--ALNAYLHTRGRNTDGVNFEKT--DVGTFAVLRAVPTDASDAVNAGVKAADAASTI 146 (400) T ss_pred HHHhhc-cccc-ccchhhhhHHH--HHHHHHhhHHHHHHHHHHHHH--HHHHHhhhhhhhHHHHHHHhhcccccCCcccc Confidence 111100 0000 00000000000 000000000000000000000 0000000000011111122222 334456667 Q ss_pred chhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccc-ccccceeeEeeeeeEEeeehhhHH Q lcl|NC_021309. 164 LPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALTITDE 242 (497) Q Consensus 164 ~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~~iS~e 242 (497) |+++...|++.++..++|+++|+++++++++++||+.+..++.+.|++|++..|+ ++++|++|++.+++++++++||+| T Consensus 147 P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~e 226 (400) T protein:vir:38 147 PETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQE 226 (400) T ss_pred cHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHH Confidence 7778899999999999999999999999999999998876778999999999986 689999999999999999999999 Q ss_pred HHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhh Q lcl|NC_021309. 243 GLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVG 321 (497) Q Consensus 243 ll~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (497) ||+|+ +++++||.++|++++..++|.+|++|+|++.+.|+.+. T Consensus 227 ll~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~------------------------------------ 270 (400) T protein:vir:38 227 SIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTAKTISSV------------------------------------ 270 (400) T ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccH------------------------------------ Confidence 99997 68999999999999999999999999998765544221 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCc Q lcl|NC_021309. 322 QDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNF 401 (497) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~ 401 (497) +++.+.+....... . ..+|+||+.+|..|+++||++|+|||+++ T Consensus 271 ----------------------------------~~~~~~~~~~~~~~-~-~a~~v~~~~~~~~l~~lkd~~G~~i~~~~ 314 (400) T protein:vir:38 271 ----------------------------------DDLKHINNVDLDPA-Y-SRVIIASQSFYNFLDTVKDGNGRYLLQDS 314 (400) T ss_pred ----------------------------------HHHHHHHHhhhhhh-h-CcEEEEcHHHHHHHHHhhccCCCeeeecC Confidence 01111111111111 1 35799999999999999999999999876 Q ss_pred ccccccccccccccccccceEecCCCCcCc-----eEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeec Q lcl|NC_021309. 402 FGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLG 476 (497) Q Consensus 402 ~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~ 476 (497) ..+. .+++|+|+||++++.+|.+. ++||||++ +|.+++|.+++++++++.. +...||+.+|+| T Consensus 315 ~~~~------~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~-~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~r~d 382 (400) T protein:vir:38 315 ILTP------SGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKR-AILFANRADFMVRWVDDQI-----YGQFLQAGMRFG 382 (400) T ss_pred cCCC------CccccccceeEEecccccCCCCceEEEEEeccc-cEEEEeecceEEEEecccc-----cceeEEEEEEec Confidence 5432 34689999999999988543 79999999 4889999999999987543 346899999999 Q ss_pred ceeecccceEEEEeeCCC Q lcl|NC_021309. 477 LLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 477 ~~v~~~~a~~~l~~~~~a 494 (497) +.|++|+||++|+++++| T Consensus 383 ~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 383 VSVADEKAGYFLTYTPKA 400 (400) T ss_pred cEEecccceEEEEeecCC Confidence 999999999999999999 No 28 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=1.4e-61 Score=354.17 Aligned_cols=385 Identities=16% Similarity=0.119 Sum_probs=239.2 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEK-KEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~-~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) ||-.+..+ ++++.++++++.++..+.+.+. .+..+++.++++.+.++++..++..+.. .+........+.. T Consensus 1 ~~~~m~k~--l~el~~~~~~~~~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~------~~~~~~~~~~~~~ 72 (397) T protein:vir:12 1 MPMQMSKK--EIALRQQFTEKKQQADKALQEGNTDEARALLDEVKQLKNQIELMTEGRSLD------VPDLPGGVNFVPE 72 (397) T ss_pred CCCcHHHH--HHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHhhhhhh Confidence 77544332 3334444433333222211111 1122222222222222222111110000 0000000000000 Q ss_pred HHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccccccc Q lcl|NC_021309. 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) ... ........ ..............+. .......... ................++++.+ T Consensus 73 ~~~-------~~~~~~~~--~~~~~~~~~~~~~a~~-------~~~~~~~~~~-----~~~~~~~~~~~~a~~~~~~~~g 131 (397) T protein:vir:12 73 QER-------NPEGQRSQ--GQGNEERQQQYSKAFL-------KGLRGKRLTD-----EERDLLDSPEFRAMSGINDEDG 131 (397) T ss_pred hhh-------hhcccccc--cchhhHHHHHHHHHHH-------HHHhccCCcH-----HHHHHHhhhhhhhccccccccC Confidence 000 00000000 0000000000000000 0000000000 0000111111222333344556 Q ss_pred ccccchhhhHHHHHHHHhhhhHHhhcceeecCCC--ceEEEEEcCCCccceeccccccccc-ccccceeeEeeeeeEEee Q lcl|NC_021309. 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSP--NLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANA 236 (497) Q Consensus 160 g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~--~~~~p~~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~ 236 (497) |.+||+++...||+.+++.++|+++|++++++++ .+.+|+.++ .+.++|++|++.+|+ +.++|++|++.++|++++ T Consensus 132 g~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~ 210 (397) T protein:vir:12 132 GILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNAD-MVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGI 210 (397) T ss_pred cccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecC-CcceeeecccccccccccccceeEEeeheeeEee Confidence 6667777889999999999999999999999865 455666665 457999999999997 579999999999999999 Q ss_pred ehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhc Q lcl|NC_021309. 237 LTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT 315 (497) Q Consensus 237 ~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (497) ++||+|+++|+ ++|++||.+.|++++++++|.+|++|+|+++|.|+.+.. T Consensus 211 ~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~~~g~~~~~----------------------------- 261 (397) T protein:vir:12 211 MTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLKKVDIDGLD----------------------------- 261 (397) T ss_pred ehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccHH----------------------------- Confidence 99999999997 589999999999999999999999999999988875421 Q ss_pred chhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHH-HhhhhhhccCCceEEechhHHHHHHHHhhhcC Q lcl|NC_021309. 316 NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF-VDIQLTLFQTPNAVVMNPRDWELLRLTKDANG 394 (497) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G 394 (497) ++..++ ..+... +..+.+|+||+.+|..|+++||++| T Consensus 262 -----------------------------------------~i~~~~~~~l~~~-~~~~a~~~~n~~~~~~L~~lkd~~G 299 (397) T protein:vir:12 262 -----------------------------------------GIKKALNVTLDPM-VAPGSIVLTNQDGYDWLDTLKDGTG 299 (397) T ss_pred -----------------------------------------HHHHHHhhccchh-hhCCCEEEEcHHHHHHHHHhhccCC Confidence 111111 122222 3345679999999999999999999 Q ss_pred ceeccCcccccccccccccccccccceEecCC-CCc-----CceEEEeeccceEEEEeecccEEEeecccchhhhcCceE Q lcl|NC_021309. 395 QYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL-IPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVT 468 (497) Q Consensus 395 ~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~-~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~ 468 (497) +|+|++....+ .+.+|||+||++++. +|. ..++||||++ +|.+++|.+++|+++++...+|++|++. T Consensus 300 ~~l~~~~~~~g------~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 372 (397) T protein:vir:12 300 RYLLQPDPTNP------TKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKE-AIVLFDREQQSIASTDTGAGAFETNSTK 372 (397) T ss_pred ceeecccccCC------CCccccceeeEEecccccccCCCccEEEEEehhc-eEEEEeecceEEEEeccccchhhcCceE Confidence 99998765332 346899999987665 342 2289999998 5778999999999999988899999999 Q ss_pred EEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 469 VRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 469 ~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) ||+++|+|+.+++|+||+++++++- T Consensus 373 ~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 373 VRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEEEEeeccEEecccceEEEEEeeC Confidence 9999999999999999999999988 No 29 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=1.1e-61 Score=354.88 Aligned_cols=355 Identities=18% Similarity=0.209 Sum_probs=236.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |. ++++++.++.....+|++....+ ++ .+++++++++++.++.++... T Consensus 1 M~-------------k~l~~l~e~~~~~~~e~~~~~~~---------------~~----~e~~~~~~~ei~~l~~~i~~~ 48 (371) T protein:vir:81 1 MP-------------KELRELLEQINNKKEEARKLLAE---------------NK----IEEAKKLKEEIVALQEKFDVA 48 (371) T ss_pred Cc-------------HHHHHHHHHHHHHHHHHHHHhhH---------------HH----HHHHHHHHHHHHHHHHHHHHH Confidence 33 22222222211111111111100 00 011222233333333333222 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +....... +........... . ... .+.+..+....+. ........++++.+| T Consensus 49 ~~~~~~~~-~~~~~~~~~~~~-------------~-~~~------------~~~~~~~~~~l~~-~~~~a~~~~t~~~gg 100 (371) T protein:vir:81 49 KELYEEQK-QTIEDKEPLKPT-------------V-QVK------------ENEVEAFVNHIRT-RFRNAMSEGSNQDGG 100 (371) T ss_pred HHHHHHHH-Hhhccccccccc-------------h-hhH------------HHHHHHHHHHHHH-HHHHhhccCCCccCc Confidence 11110000 000000000000 0 000 0001111111110 011223334555667 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcC-CCccceeccccccccc-ccccceeeEeeeeeEEeeeh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESA-AHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALT 238 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~~ 238 (497) .+||+++..+|++.+++.++|+++++++++++++..++.... ..+.++|++||+.+|+ ++++|++|+++++|++++++ T Consensus 101 ~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~ 180 (371) T protein:vir:81 101 YTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFR 180 (371) T ss_pred eeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccccccccccceeeEEeeeeEEEEeeh Confidence 777888899999999999999999999999887666544332 2357899999999986 67999999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcch Q lcl|NC_021309. 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) ||+|+|+|+ ++|++||.+.|++++++++|.+|++|+|++.|.|+.+... T Consensus 181 iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~~~~~~~------------------------------ 230 (371) T protein:vir:81 181 VTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKTAIADLDG------------------------------ 230 (371) T ss_pred hhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccHHH------------------------------ Confidence 999999997 6899999999999999999999999999988877643211 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHH-HhhhhhhccCCceEEechhHHHHHHHHhhhcCce Q lcl|NC_021309. 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF-VDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) +...+ ..+... +..+.+|+||+.+|..|+++||++|+| T Consensus 231 ----------------------------------------i~~~~~~~l~~~-~~~~a~~vmn~~~~~~L~~lkd~~g~~ 269 (371) T protein:vir:81 231 ----------------------------------------LKQIINVQLDPV-FRSTSSVIVNQDAFNWLDTLKDQNGQY 269 (371) T ss_pred ----------------------------------------HHHHHHhhcchh-hhcCCEEEEcHHHHHHHHHhhccCCCe Confidence 11111 112222 234567999999999999999999999 Q ss_pred eccCcccccccccccccccccccceEecCCCCcC------------ceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------------TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 397 i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~------------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) ||++..... .+++|+|+||++++.+|.+ .++||||++ +|.+++|.+++|+++++..++|++ T Consensus 270 l~~~~~~~~------~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~-~~~~~~~~~~~i~~~~~~~~~f~~ 342 (371) T protein:vir:81 270 LLQPSISSP------TGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKE-AVVMFDRQRTEIMSSNVAMDAFET 342 (371) T ss_pred eeecccCCC------CCceecceeEEEecccccCccccccccCCcceEEEEehhc-eEEEEeecceEEEEeccccchhhc Confidence 998765432 3568999999999998743 379999998 588999999999999998889999 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) |++.||++.|+|+.+++|+||++++++++ T Consensus 343 ~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 343 DATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred CceEEEEEEeeccEEecccceEEEEEecC Confidence 99999999999999999999999999998 No 30 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=2e-61 Score=353.41 Aligned_cols=415 Identities=15% Similarity=0.090 Sum_probs=243.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 5 AQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRN 84 (497) Q Consensus 5 a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~ 84 (497) +++++.++.+.+..++..+.....+. ...-..++++..+ ++.+++.++++.+.++++.++.......... T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~~~~----~~~~~~ee~~~~~------~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~ 70 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQGKVE----KNEVRSEELAAVK------AEVEQLTKEIQTISEELAKLEEKEKEEDPAK 70 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHh----ccCccHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 44554444444333333222111110 0000001111101 1112222333333333333222211111000 Q ss_pred HHHHHHHH--HhhhhhhHHHHhHhhhhhhhhhhhhHHHHHH-hhhHHHHHHHHHHHHHhhhhhh--hhhhhhcccccccc Q lcl|NC_021309. 85 LKQIRKHL--ARAVIMNPELKNATSFEKGTKFDVSFNVSAK-AADPGTAAAELMGAFADGETAP--AAIGQNPFGSTGTF 159 (497) Q Consensus 85 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 159 (497) ........ ................+.............. .........+.+..+....... .........+++.+ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~G 150 (434) T protein:vir:62 71 KKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNG 150 (434) T ss_pred hhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhccccccc Confidence 00000000 0000000000000000000000000000000 0000011112222222111110 11112222344556 Q ss_pred ccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceec---ccccccccccccceeeEeeeeeEEee Q lcl|NC_021309. 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAV---AEAGTYPFSSEEFARVYEQVGKVANA 236 (497) Q Consensus 160 g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv---~Eg~~~~~s~~~f~~i~~~~~kla~~ 236 (497) |.+||+++..+|++.+++.++|+++|++++++++ ++||+.+.. +.+.|+ +|++..|.++++|++|++.+||++++ T Consensus 151 G~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~~-~~~p~~~~~-~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~ 228 (434) T protein:vir:62 151 SVTIPDFLSKEIITYAQEENFLRRLGTGVKTKEN-IKYPVLVKK-AEAQGHKNERTNNEMPETDIEFDEIELSPTEFDAL 228 (434) T ss_pred ceecchhhHHHHHHhhhhhhhhhhhcceeccCCc-eEEEEEecC-CcccceecccccccccccccceeeEEeeheeeEee Confidence 6667777888999999999999999999988754 789988753 345554 66888999999999999999999999 Q ss_pred ehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhhhhhHHHHHHhhhhh Q lcl|NC_021309. 237 LTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) Q Consensus 237 ~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) ++||+|||+|+ ++|++||.++|+++++.++|.+||+|+|+++ |.|+++..+..... T Consensus 229 ~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~---------------------- 286 (434) T protein:vir:62 229 ATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKT---------------------- 286 (434) T ss_pred hhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccc---------------------- Confidence 99999999998 5899999999999999999999999999887 55665543321100 Q ss_pred cchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcC Q lcl|NC_021309. 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANG 394 (497) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G 394 (497) .....++++.+.+..+...++. +.+|+||+.+|..|+++||++| T Consensus 287 -----------------------------------~~~~~~d~l~~l~~~l~~~~~~-~a~~v~n~~~~~~L~~lkd~~G 330 (434) T protein:vir:62 287 -----------------------------------DEKNLYDALVKMKNTPVKEVRK-KARWVLNTAALTKIETMKTDDG 330 (434) T ss_pred -----------------------------------cccchhhHHHHHHhhcchhhhc-CCEEEEcHHHHHHHHHhhccCC Confidence 1112245666777777776654 4479999999999999999999 Q ss_pred ceeccCcccccccccccccccccccceEecCCCCcCc------eEEEeeccceEEEEeecc-cEEEeecccchhhhcCce Q lcl|NC_021309. 395 QYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT------ILVGHFAPSVIQTARREG-VTMQMTNSNGTDFVDGKV 467 (497) Q Consensus 395 ~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~------~~~gd~~~~~~~i~~r~~-~~i~~~~~~~~~f~~~~v 467 (497) ||||++......| .+++|+|+||++++.+|.+. ++||||++ |.|++|.+ ++|+++.+. +|.+|+| T Consensus 331 ~~l~~~~~~~~~g----~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~--~~i~~~~g~~~i~~~~~~--~~~~~~v 402 (434) T protein:vir:62 331 FPLLRPFNQAEGG----IGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSK--FYIQDVIGSLEVQKLVEL--FSRTNRV 402 (434) T ss_pred CEeeccCCCccCC----CCceecceeeEEecCccCccCCCceEEEEeeccc--eEEEEeeceeEEEeehhh--hcccCce Confidence 9999875433222 34689999999999998644 78999997 45777764 678877654 5889999 Q ss_pred EEEEEEeecceeec-ccceEEEEee-CCCCCC Q lcl|NC_021309. 468 TVRAEERLGLLVYR-PSAFQLIQLK-KGATGS 497 (497) Q Consensus 468 ~~r~~~r~~~~v~~-~~a~~~l~~~-~~a~~~ 497 (497) +||++.|+|+++++ |.++..+++. ++|+|+ T Consensus 403 ~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 403 GFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred EEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 99999999999886 8887777666 555555 No 31 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=2.4e-61 Score=352.91 Aligned_cols=385 Identities=13% Similarity=0.123 Sum_probs=246.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |.-.+.+++... +++++.. +++++++.+.+..+. .... .++..++.++++.+.++++.+..++.+. T Consensus 1 m~~~m~l~el~~----~~~~~~~----~~~~~~~~~~~~~~~---~~~~---~ee~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (408) T protein:vir:10 1 MGVKLTVNQLNE----AWIASGD----KVTDFNDQINMALND---DNFS---AEAMSELKNKRDNEKVRRDALREQLVEA 66 (408) T ss_pred CCccccHHHHHH----HHHHHHH----HHHHHHHHHHHHhhc---cccc---HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666665543221 1111111 111222221111110 0001 1112233333444444444444333322 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.......... ....... ....... .....+ ....+.. ..............++.+.+| T Consensus 67 ~~~~~~~~~~~----~~~~~~~------~~~~~~~-----~~~~~~----~~~~~~~--~~~~~~~~~~a~~~~t~~~gg 125 (408) T protein:vir:10 67 QAEQVVNMREE----EKGPLNK------SENELKD-----KFVKDF----VNMVRNP--MAFMNTVSSKTETSGSDSAAG 125 (408) T ss_pred HHHHHhccccc----ccccccc------chhhhHH-----HHHHHH----HHHhhcc--hhhhhhhhhhhhhcccccCCc Confidence 22111100000 0000000 0000000 000000 0000000 000111122233444555667 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceE--EEEEcCCCccceecccccccccc-cccceeeEeeeeeEEeee Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~wv~Eg~~~~~s-~~~f~~i~~~~~kla~~~ 237 (497) .+||+++..+||+.+++.++|+++|+++++++++.. +++..+..+.+.|++|++.+|++ .++|++|++.++++++++ T Consensus 126 ~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~ 205 (408) T protein:vir:10 126 LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGII 205 (408) T ss_pred eeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeee Confidence 777778889999999999999999999999876554 45555555678999999999975 699999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|||+|+ ++|++||.++|+++++.++|.+|++|+|++.+.+-.. T Consensus 206 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~~~~~-------------------------------- 253 (408) T protein:vir:10 206 TATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIA-------------------------------- 253 (408) T ss_pred hhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc-------------------------------- Confidence 9999999997 5899999999999999999999999999875431100 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHH-HhhhhhhccCCceEEechhHHHHHHHHhhhcCc Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF-VDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ 395 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~ 395 (497) .++++..++ ..+...++ .+..|+||+.+|..|+++||++|+ T Consensus 254 -------------------------------------~~~~l~~~~~~~~~~~~~-~~a~~v~n~~~~~~l~~lkd~~G~ 295 (408) T protein:vir:10 254 -------------------------------------KFDDVITMINTAVDPAII-ATSSLLTNQSGLNKLALVKTAEGK 295 (408) T ss_pred -------------------------------------cHHHHHHHHHHhhhhhhc-cCCEEEEcHHHHHHHHHhhccCCc Confidence 012333333 23333333 445799999999999999999999 Q ss_pred eeccCcccccccccccccccccccceEecC--CCCcCc-----eEEEeeccceEEEEeecccEEEeecccchhhhcCceE Q lcl|NC_021309. 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTP--LIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVT 468 (497) Q Consensus 396 ~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~--~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~ 468 (497) |||+++.... .+.+|+|+||++++ .+|... ++||||++ +|.+++|.+++|+++++.+..|.+|++. T Consensus 296 ~i~~~~~~~~------~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~~v~~~~~~~~~f~~~~~~ 368 (408) T protein:vir:10 296 YLLEPDPTKP------NSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQ-AITLFDRENMSLLPTNIGAGAFETDTTK 368 (408) T ss_pred eEeccCcCCC------CCceecceeeEEecccccCccCCCceEEEEEehhc-cEEEEEecceEEEEcccccchhhcCceE Confidence 9998764432 34689999999855 466533 79999998 5889999999999999988899999999 Q ss_pred EEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 469 VRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 469 ~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ||++.|+|+.|++|+||+++++++++..+ T Consensus 369 ~r~~~r~d~~v~~~~a~~~~~~~~~~~~~ 397 (408) T protein:vir:10 369 IRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) T ss_pred EEEEEeeccEEeccccEEEEEeeccccCC Confidence 99999999999999999999999987766 No 32 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=3.9e-61 Score=351.75 Aligned_cols=386 Identities=14% Similarity=0.107 Sum_probs=249.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+-.+.+++ +.+++.++.+... ++.+++.......+ . ..+...++.+++++++.+.+.+..++.+. T Consensus 1 ~~~~m~l~e----l~~~~~~~~~~~~----~~~~~~~~~~~~~~---~---~~ee~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (404) T protein:vir:39 1 MGVKLTVNQ----LNEAWIASGDKVT----DFNDQINMALNDDN---F---SAEAMSELKNKRDNEKVRRDALREQLVEA 66 (404) T ss_pred CChHHHHHH----HHHHHHHHHHHHH----HHHHHHHHHhcccc---c---cHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 777766653 2222222222211 11111111111100 0 11122344444555555555555444433 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +........... ...... . ..... ......+ ....+.. .. ............++++.+| T Consensus 67 ~~~~~~~~~~~~----~~~~~~---------~-~~~~~-~~~~~~~----~~~~~~~-~~-~~~~~e~~a~~~~t~~~gg 125 (404) T protein:vir:39 67 QAEQVVNMREEE----KGPLNK---------S-EYELK-DKFVKEF----VNMVRNP-MA-FLNTVSSKTETSGSDSAAG 125 (404) T ss_pred HHHHHhcccccc----cccccc---------c-hhhhH-HHHHHHH----HHHHhcc-hh-hhhhhhhhhhhcccccCCc Confidence 322111100000 000000 0 00000 0000000 0000000 00 0011112223334455566 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceE--EEEEcCCCccceeccccccccc-ccccceeeEeeeeeEEeee Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~ 237 (497) .+||+++..+|++.+++.++|+++|++++++++... +++..+..+.+.|++|++.+|+ ++++|++|+++++++++++ T Consensus 126 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~ 205 (404) T protein:vir:39 126 LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGII 205 (404) T ss_pred eeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeee Confidence 677778889999999999999999999999876554 5555555677899999999997 6799999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|+++|+ ++|++||.++|+++++.++|.+|++|+|++.+.+.... T Consensus 206 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~~~~~~~------------------------------- 254 (404) T protein:vir:39 206 TATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKPTIAK------------------------------- 254 (404) T ss_pred hhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc------------------------------- Confidence 9999999987 68999999999999999999999999998765442110 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCce Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ++++..++.......+..+.+|+||+.+|..|+++||++||| T Consensus 255 --------------------------------------~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~ 296 (404) T protein:vir:39 255 --------------------------------------FDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKY 296 (404) T ss_pred --------------------------------------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCce Confidence 111222222112223334567999999999999999999999 Q ss_pred eccCcccccccccccccccccccceEecCC--CCcC-----ceEEEeeccceEEEEeecccEEEeecccchhhhcCceEE Q lcl|NC_021309. 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--IPLG-----TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 397 i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~--~~~~-----~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) ||++..... ...+|+|+||++++. +|.. .+++|||++ +|.+++|.+++++++++..++|.+|++.| T Consensus 297 l~~~~~~~~------~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 369 (404) T protein:vir:39 297 LLEPDPTKP------NSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQ-AITLFDRENMSLLPTNIGAGAFETDTTKI 369 (404) T ss_pred eeccCcCCC------CcceecceeEEEecccccCccCCCccEEEEEeccc-cEEEEeecceEEEEeccchhhhhhceeeE Confidence 998765432 346899999998654 4432 389999998 57889999999999999888999999999 Q ss_pred EEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 470 RAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 470 r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) |++.|+|+.+++|+||+++++++++.++ T Consensus 370 r~~~r~d~~~~~~~a~~~~~~~~~a~~~ 397 (404) T protein:vir:39 370 RVIDRFDVKTTDSEALVAGSFTAIADQV 397 (404) T ss_pred EEEeeeccEEecccceEEEEeeccccCC Confidence 9999999999999999999999988766 No 33 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=6.9e-61 Score=350.41 Aligned_cols=385 Identities=14% Similarity=0.117 Sum_probs=250.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+..+.+++.. +...++.+... ++++++....+.. ... .+...++.+++++++++++.++.++.+. T Consensus 1 m~~~m~i~el~-~~~~~~~~~~~-------~~~~e~~~~~~~~---~~~---~e~i~e~~~~~~~~~~~~~~~~~~~~~~ 66 (408) T protein:vir:74 1 MGVKLTVNQLN-EAWIASGDKVT-------DFNDQINMALNDD---NFS---AEAMSELKNKRDNEKVRRDALREQLVEA 66 (408) T ss_pred CChhhhHHHHH-HHHHHHHHHHH-------HHHHHHHHHHhhh---ccc---HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88877775322 22222222111 1222211111111 111 1122344444555555555555444333 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.......... ....... .......... ..+ ....+. . ..............++...+| T Consensus 67 ~~~~~~~~~~~----~~~~~~~------~~~~~~~~~~-----~~~----~~~~~~-~-~~~~~~~~~~a~~~~~~~~gg 125 (408) T protein:vir:74 67 QAEQVVNMREE----EKGPLNK------SENELKDKFV-----KDF----VNMVRN-P-MAFLNTVSSKTETSGSDSAAG 125 (408) T ss_pred HHHHHhhcccc----ccccccc------hhhhhHHHHH-----HHH----HHHHhc-c-hhhhhhhhhhhhcccccCCCc Confidence 32211110000 0000000 0000000000 000 000000 0 000111122223334555567 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCc--eEEEEEcCCCccceeccccccccc-ccccceeeEeeeeeEEeee Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPN--LSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~p~~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~ 237 (497) .+||+++..+||+.+++.++|+++|+++++++++ +.+++..+....+.|++|++.+|+ ++++|++|++.++|+++++ T Consensus 126 ~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~ 205 (408) T protein:vir:74 126 LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGII 205 (408) T ss_pred eeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeee Confidence 7777788899999999999999999999998765 456666665567789999999997 6799999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|+|+|+ ++|++||.++|+++++.++|.+||+|+|++.+.+.... T Consensus 206 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~~~~~~------------------------------- 254 (408) T protein:vir:74 206 TATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKPTIAN------------------------------- 254 (408) T ss_pred hhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc------------------------------- Confidence 9999999997 58999999999999999999999999999875432110 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHH-HhhhhhhccCCceEEechhHHHHHHHHhhhcCc Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF-VDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ 395 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~ 395 (497) ++++...+ ..+...++ ...+|+||+.+|..|+++||++|+ T Consensus 255 --------------------------------------~~~i~~~~~~~l~~~~~-~~a~~v~n~~~~~~l~~lkd~~G~ 295 (408) T protein:vir:74 255 --------------------------------------FDDVITMINTSVDPAII-ATSSLLTNQSGLNKLALVKTAEGK 295 (408) T ss_pred --------------------------------------HHHHHHHHHHhhhhhhc-CCCEEEEcHHHHHHHHHhhcCCCc Confidence 11222222 23333343 455799999999999999999999 Q ss_pred eeccCcccccccccccccccccccceEecC--CCCc-----CceEEEeeccceEEEEeecccEEEeecccchhhhcCceE Q lcl|NC_021309. 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTP--LIPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVT 468 (497) Q Consensus 396 ~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~--~~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~ 468 (497) |||+++.... .+.+|+|+||++++ .+|. ..++||||++ +|.+++|.+++++++++.++.|.+|++. T Consensus 296 ~l~~~~~~~~------~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 368 (408) T protein:vir:74 296 YLLEPDPTKP------NSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQ-AITLFDRENMSLLPTNIGAGAFETDTTK 368 (408) T ss_pred eEeccCcCCC------CCceecceeeEEecCcccccccCCcceEEEEehhc-cEEEEEecceEEEEeccccchhhcceee Confidence 9998765432 34689999998865 4553 2379999998 5789999999999999988899999999 Q ss_pred EEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 469 VRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 469 ~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ||++.|+||.+++|+||+++++++++++- T Consensus 369 ~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 397 (408) T protein:vir:74 369 IRVIDRFDVKATDSEALVAGSFTAIADQV 397 (408) T ss_pred EEEEEeeCcEEecccceEEEEeecccCCC Confidence 99999999999999999999998776654 No 34 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=6.4e-61 Score=350.60 Aligned_cols=391 Identities=14% Similarity=0.139 Sum_probs=247.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 14 LAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLA 93 (497) Q Consensus 14 ~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 93 (497) |.++++++..+..+...+++..+.+... . .++.+.+.++++.++++++...... +.+........+. T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~-------~---~ee~~~~~~e~~~l~~~i~~~~~~~-~~~~~~~~~~~~~-- 67 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDGV-------T---AEELNKTSNEIDILQAKIEAQKRKE-NIENNFNEDNVKS-- 67 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcCC-------C---HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhhhccc-- Confidence 3333333333322222222222211000 0 0111122222333333322211110 0000000000000 Q ss_pred hhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhh-hhhhhhhhhccccccccccccchhhhHHHH Q lcl|NC_021309. 94 RAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGE-TAPAAIGQNPFGSTGTFAPGILPTFLPGIV 172 (497) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii 172 (497) .. .............+..... ....+....... ...........++++.+|.+||+++..+|+ T Consensus 68 --~~---~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii 131 (404) T protein:vir:10 68 --LN---TGKEENVIYNGALFVRAIA-----------DNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKIN 131 (404) T ss_pred --cc---cccchhhHHHHHHHHHHHH-----------HHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHH Confidence 00 0000000000000000000 000000000000 111122223334455566667778889999 Q ss_pred HHHHhhhhHHhhcceeecCC--CceEEEEEcCCCccceecccccccccc--cccceeeEeeeeeEEeeehhhHHHHhhH- Q lcl|NC_021309. 173 EQLFYELSLADLISSRPVTS--PNLSYLTESAAHNNAAAVAEAGTYPFS--SEEFARVYEQVGKVANALTITDEGLRDA- 247 (497) Q Consensus 173 ~~~~~~~~l~~~~~~~~~~~--~~~~~p~~~~~~~~a~wv~Eg~~~~~s--~~~f~~i~~~~~kla~~~~iS~ell~d~- 247 (497) +.++..++|+++|+++++++ +.+.||+.++ ...++|++|++.+|.+ +++|++|+++++|++++++||+|||+|+ T Consensus 132 ~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~-~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~ 210 (404) T protein:vir:10 132 TRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSK-QKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFAD 210 (404) T ss_pred HHHhhhhhHhhhhceeeccCCccceEEEEecC-CcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcH Confidence 99999999999999999875 4567888766 4679999999999875 6899999999999999999999999997 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhh Q lcl|NC_021309. 248 PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVA 326 (497) Q Consensus 248 ~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (497) ++|++||.++|++++++++|.+||+|+|+++ |.||++..+..+....... T Consensus 211 ~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~~----------------------------- 261 (404) T protein:vir:10 211 KSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPKSP----------------------------- 261 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeeccccc----------------------------- Confidence 5899999999999999999999999999875 8888877665443332211 Q ss_pred hhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccc Q lcl|NC_021309. 327 SLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAY 406 (497) Q Consensus 327 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~ 406 (497) .++++...+.....+.+..+.+|+||+.+|..|+++||++|||+|.++..+. T Consensus 262 ---------------------------~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~- 313 (404) T protein:vir:10 262 ---------------------------ALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDP- 313 (404) T ss_pred ---------------------------cHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC- Confidence 0122222232222233444567999999999999999999999998765432 Q ss_pred cccccccccccccceEe-cCCCCcCc-----eEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceee Q lcl|NC_021309. 407 GNPVNGGKNIWGVPVVT-TPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVY 480 (497) Q Consensus 407 ~~~~~~~~~l~G~Pvv~-~~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~ 480 (497) .+++|||+||++ ++.++.++ ++||||++ +|.+++|.+++|+++++...+|++|++.||+++|+|+.|. T Consensus 314 -----~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~ 387 (404) T protein:vir:10 314 -----TQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKE-AYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVK 387 (404) T ss_pred -----CCccccceeeEEecccccCCCCCccEEEEEeccc-cEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEe Confidence 346899999985 45555433 79999998 5789999999999999888889999999999999999999 Q ss_pred cccceEEEEeeCCCCCC Q lcl|NC_021309. 481 RPSAFQLIQLKKGATGS 497 (497) Q Consensus 481 ~~~a~~~l~~~~~a~~~ 497 (497) +|+||++++++++|+-. T Consensus 388 ~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 388 DSEALLIAEIPVESVQA 404 (404) T ss_pred cccceEEEEeecccCCC Confidence 99999999999999888 No 35 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=1.4e-60 Score=348.65 Aligned_cols=400 Identities=12% Similarity=0.094 Sum_probs=249.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |=..-++.+++.+ +++++.....+++..... +..+..+++.+++++++++++.++..+.+. T Consensus 1 mk~~~em~~~l~e------------------l~~~~~~~~~e~~~~~~~-~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~ 61 (415) T protein:vir:46 1 MKTKEELQSEISD------------------IKRQIDLKVKYATRALNN-DELEKAEKLEQEITDLRSQIQEKQEELDKL 61 (415) T ss_pred CchHHHHHHHHHH------------------HHHHHHHHHHHHHHHhch-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222222222221 222111111111110000 111112233333444444444333333222 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +......... .........+...... ...................+.+.......... .....+++.+| T Consensus 62 ~~~~~~~~~~---~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~t~~g~ 130 (415) T protein:vir:46 62 KEKDRTSENN---QQSVEVNEARTYRNQA-------NINDLGISIQNTKVTSQEVRDFTEYLETRNDI-QGGSLKTDSGF 130 (415) T ss_pred HHHHHhhhhc---ccccccchhhhhHHHH-------HHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhh-hhccccccCCc Confidence 2111000000 0000000000000000 00000000001111111222222222221111 12222334455 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcC-CCccceeccccccccc-ccccceeeEeeeeeEEeeeh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESA-AHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALT 238 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~~ 238 (497) .+||+++...|++.+++.++|+++|++++++++...||+... ....++|++|++.+|+ +.++|++|++.+++++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~ 210 (415) T protein:vir:46 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFR 210 (415) T ss_pred ccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeeh Confidence 667777889999999999999999999999988877776542 2357899999999997 57999999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcch Q lcl|NC_021309. 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) ||+|||+|+ .+|++||+++|++++++++|.+|++|+|++.+.++........... T Consensus 211 iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~------------------------ 266 (415) T protein:vir:46 211 ISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL------------------------ 266 (415) T ss_pred hhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccccccee------------------------ Confidence 999999997 5799999999999999999999999999987666544322111100 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCcee Q lcl|NC_021309. 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i 397 (497) .......++++.+++..+...++. +++|+||+.+|..|+++||++|+|| T Consensus 267 ------------------------------~~~~~~~~~~i~~~~~~~~~~~~~-~~~~v~n~~~~~~L~~lkd~~G~~i 315 (415) T protein:vir:46 267 ------------------------------EVKKAKSLDDIKDAINLNVKPNYE-HNVAIVSQTMFAKLDKMKDKLGNYL 315 (415) T ss_pred ------------------------------ccccccchHHHHHHHHhhhhhccC-CCEEEEcHHHHHHHHHhhccCCCee Confidence 011112245566666677666654 5689999999999999999999999 Q ss_pred ccCcccccccccccccccccccceEecCCCCcCc-----eEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEE Q lcl|NC_021309. 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) |+++..+. .+.+|||+||++++.+|.+. ++||||++ +|.+++|.++++++++ |.++++.+|++ T Consensus 316 ~~~~~~~~------~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~~~~ 383 (415) T protein:vir:46 316 IQPDVKEK------TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMIA 383 (415) T ss_pred eccCcCCC------CCccccceeeEEeccccccCCCccEEEEEehhc-cEEEEeecceEEEeec-----cccCceEEEEE Confidence 98765432 34689999999999998543 79999998 4778999999999875 56778999999 Q ss_pred EeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 473 ERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 473 ~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) +|+|+.|++|+||+++++++++.|. T Consensus 384 ~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:46 384 VRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEeccEEeccccEEEEEeeccCCCC Confidence 9999999999999999999999998 No 36 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=1.4e-60 Score=348.65 Aligned_cols=400 Identities=12% Similarity=0.094 Sum_probs=249.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |=..-++.+++.+ +++++.....+++..... +..+..+++.+++++++++++.++..+.+. T Consensus 1 mk~~~em~~~l~e------------------l~~~~~~~~~e~~~~~~~-~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~ 61 (415) T protein:vir:47 1 MKTKEELQSEISD------------------IKRQIDLKVKYATRALNN-DELEKAEKLEQEITDLRSQIQEKQEELDKL 61 (415) T ss_pred CchHHHHHHHHHH------------------HHHHHHHHHHHHHHHhch-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222222222221 222111111111110000 111112233333444444444333333222 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +......... .........+...... ...................+.+.......... .....+++.+| T Consensus 62 ~~~~~~~~~~---~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~t~~g~ 130 (415) T protein:vir:47 62 KEKDRTSENN---QQSVEVNEARTYRNQA-------NINDLGISIQNTKVTSQEVRDFTEYLETRNDI-QGGSLKTDSGF 130 (415) T ss_pred HHHHHhhhhc---ccccccchhhhhHHHH-------HHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhh-hhccccccCCc Confidence 2111000000 0000000000000000 00000000001111111222222222221111 12222334455 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcC-CCccceeccccccccc-ccccceeeEeeeeeEEeeeh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESA-AHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALT 238 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~~ 238 (497) .+||+++...|++.+++.++|+++|++++++++...||+... ....++|++|++.+|+ +.++|++|++.+++++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~ 210 (415) T protein:vir:47 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFR 210 (415) T ss_pred ccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeeh Confidence 667777889999999999999999999999988877776542 2357899999999997 57999999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcch Q lcl|NC_021309. 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) ||+|||+|+ .+|++||+++|++++++++|.+|++|+|++.+.++........... T Consensus 211 iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~------------------------ 266 (415) T protein:vir:47 211 ISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL------------------------ 266 (415) T ss_pred hhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccccccee------------------------ Confidence 999999997 5799999999999999999999999999987666544322111100 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCcee Q lcl|NC_021309. 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i 397 (497) .......++++.+++..+...++. +++|+||+.+|..|+++||++|+|| T Consensus 267 ------------------------------~~~~~~~~~~i~~~~~~~~~~~~~-~~~~v~n~~~~~~L~~lkd~~G~~i 315 (415) T protein:vir:47 267 ------------------------------EVKKAKSLDDIKDAINLNVKPNYE-HNVAIVSQTMFAKLDKMKDKLGNYL 315 (415) T ss_pred ------------------------------ccccccchHHHHHHHHhhhhhccC-CCEEEEcHHHHHHHHHhhccCCCee Confidence 011112245566666677666654 5689999999999999999999999 Q ss_pred ccCcccccccccccccccccccceEecCCCCcCc-----eEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEE Q lcl|NC_021309. 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) |+++..+. .+.+|||+||++++.+|.+. ++||||++ +|.+++|.++++++++ |.++++.+|++ T Consensus 316 ~~~~~~~~------~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~~~~ 383 (415) T protein:vir:47 316 IQPDVKEK------TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMIA 383 (415) T ss_pred eccCcCCC------CCccccceeeEEeccccccCCCccEEEEEehhc-cEEEEeecceEEEeec-----cccCceEEEEE Confidence 98765432 34689999999999998543 79999998 4778999999999875 56778999999 Q ss_pred EeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 473 ERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 473 ~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) +|+|+.|++|+||+++++++++.|. T Consensus 384 ~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:47 384 VRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEeccEEeccccEEEEEeeccCCCC Confidence 9999999999999999999999998 No 37 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=1.3e-60 Score=348.88 Aligned_cols=378 Identities=15% Similarity=0.156 Sum_probs=251.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |...-+|.++..++.++++++.+.. .+.+.......+++ .++.++++.++++.+.+...+... T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~----~~~~~~~~~~~ee~-------------~~l~~ei~~~~~~~~~~~~~~~~~ 63 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKL----NVAMLDDSVSAEEL-------------QAIKNERDTAKMKRDLFKEQYTEA 63 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHH----HHHHhcchhhHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHH Confidence 7766666666665555555543322 11111111111111 222222233333332222222111 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +........ ..... ..... .. ..............++.. .........++++.+| T Consensus 64 ~~~~~~~~~-~~~~~--------~~~~~-----~~-----~~~~~~~~~~~~~l~~~~------~~~~~~~~~~t~~~gg 118 (397) T protein:vir:49 64 RANEVANMS-EEEKK--------PLTKN-----EE-----EVKANFVKDFKNLVRGRY------QNLLDSKTDGSGSDAG 118 (397) T ss_pred HHhhhhccc-ccccc--------cccch-----hh-----HHHHHHHHHHHHHhhcch------hhHHHhhhccCCccCc Confidence 111000000 00000 00000 00 000000000000000000 0111223334445566 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCc--eEEEEEcCCCccceeccccccccccc-ccceeeEeeeeeEEeee Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPN--LSYLTESAAHNNAAAVAEAGTYPFSS-EEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~p~~~~~~~~a~wv~Eg~~~~~s~-~~f~~i~~~~~kla~~~ 237 (497) .+||+++..+|++.+++.++|+++|+++++++++ +.||+..+..+.+.|++|++.+|+++ ++|++|+++++++++++ T Consensus 119 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 198 (397) T protein:vir:49 119 LTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGIS 198 (397) T ss_pred ceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeeh Confidence 6677778899999999999999999999988764 55666666567899999999999875 89999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|+|+|+ .+|++||.++|+++++.++|.+|++|+|++.|.+... T Consensus 199 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~~~~~~-------------------------------- 246 (397) T protein:vir:49 199 TVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLPNKPTLA-------------------------------- 246 (397) T ss_pred hhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc-------------------------------- Confidence 9999999997 4799999999999999999999999999876432110 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCce Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .++++.+++..+...++ .+.+|+||+.+|..|+++||++|+| T Consensus 247 -------------------------------------~~d~i~~~~~~l~~~~~-~~a~~v~n~~~~~~l~~lkd~~g~~ 288 (397) T protein:vir:49 247 -------------------------------------KWDDIIDLQAKVDPAIK-QTSLFLTNTSGFTALKKVKNAMGDY 288 (397) T ss_pred -------------------------------------CHHHHHHHHHhhhhhhc-CCCEEEEcHHHHHHHHHhhccCCce Confidence 02345555666665554 4568999999999999999999999 Q ss_pred eccCcccccccccccccccccccceEecC--CCCc-----CceEEEeeccceEEEEeecccEEEeecccchhhhcCceEE Q lcl|NC_021309. 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTP--LIPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 397 i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~--~~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) ||.+++..+ ...+|+|+||++++ .+|. ..++||||++ +|.+++|.+++|+++++.+++|++|++.| T Consensus 289 l~~~~~~~g------~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 361 (397) T protein:vir:49 289 LMERDVKSP------TGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQ-AVTLFDRQHLSLLSTNIGGGAFETDTTKV 361 (397) T ss_pred eecccccCC------CCceecceeeEEecccccccccCCceeEEEeeccc-eEEEEeecccEEEEeccccchhhcCeeeE Confidence 998755432 34689999998754 4453 3479999998 58899999999999999888999999999 Q ss_pred EEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 470 RAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 470 r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) |++.|+|+.+++|+||++++++++++.. T Consensus 362 ~~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:49 362 RVIDRFDVVSTDTEAFVPASFKAIADQK 389 (397) T ss_pred EEEEeeccEEecccceEEEEeccccccc Confidence 9999999999999999999999988865 No 38 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=4.9e-60 Score=345.72 Aligned_cols=442 Identities=12% Similarity=0.082 Sum_probs=250.8 Q ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--Hh-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MP-STAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAE--VE-AHERAQEMLKSLGGADAAKDGLDND 76 (497) Q Consensus 1 ~~-~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~--~~-~~e~~~e~~~~~~~~~a~~~~~~~~ 76 (497) |- -..+|..+.+++..++.++..... +++.+...+...++..+.+ +. ..+..+.+.++..++++.+..++.+ T Consensus 1 ~~~~~~~l~~~~~~~~~~l~el~e~~~----~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~e 76 (466) T protein:vir:80 1 MALRQLMLAKKIEQRKAALAELLEQEK----ALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGE 76 (466) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 111123333333333333332211 1111111111111110100 00 0111222223333333333333333 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHhH-hhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccc Q lcl|NC_021309. 77 IPEVEVRNLKQIRKHLARAVIMNPELKNA-TSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGS 155 (497) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (497) +.+++.+... .................. .................. ....+.....+......... ......... T Consensus 77 i~~le~el~e-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~ 152 (466) T protein:vir:80 77 IKELENELEQ-LNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNM-PYEQRAALIARSEVKEFLAQ--VRTLAQQKR 152 (466) T ss_pred HHHHHHHHHH-HHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhh-hhhhHHHHHHHHHHHHHHHH--HHHHhhhhh Confidence 3332221100 000000000000000000 000000000000000000 00000000000000000000 000111112 Q ss_pred ccccccc-cchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEE Q lcl|NC_021309. 156 TGTFAPG-ILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA 234 (497) Q Consensus 156 ~~~~g~~-v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla 234 (497) +.++|++ ||.++...|++.++..++|+++|++.++++. .++|+.+. .+.+.|++|++.+|+++|+|++|++.+|+++ T Consensus 153 ~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g~-~~~~~~~~-~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~ 230 (466) T protein:vir:80 153 AVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKGT-ARQNIAGA-IPEGVWTEAVANLNELSLSFSQIEVDGYKVG 230 (466) T ss_pred hhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCce-eEeeeecC-Ccceeecccccccccccccccceeecceeee Confidence 2234444 4555677899999999999999999998765 68888765 4678999999999999999999999999999 Q ss_pred eeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhh Q lcl|NC_021309. 235 NALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 235 ~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) ++++||+|||+|+ +++++||+.+|+++++.++|.+||+|+|+++|.||++..+..+...................... T Consensus 231 ~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 309 (466) T protein:vir:80 231 GFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLK- 309 (466) T ss_pred eehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhhhh- Confidence 9999999999998 58999999999999999999999999999999999988655444433221111111100000000 Q ss_pred hcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHh--- Q lcl|NC_021309. 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK--- 390 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk--- 390 (497) ...........+......+......+..+...|+||+.++..|..++ T Consensus 310 ------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~ 359 (466) T protein:vir:80 310 ------------------------------IDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITF 359 (466) T ss_pred ------------------------------hhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccc Confidence 00000011111222333334445555666677999999999998887 Q ss_pred hhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEE Q lcl|NC_021309. 391 DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVR 470 (497) Q Consensus 391 d~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r 470 (497) +++|.|++.+.. ...|+|+||+++++||.+++++|||+. |.+++|.+++|..+++. .|.+|++.|| T Consensus 360 ~~~g~~~~~~~~----------~~~i~G~pvv~s~~~~~~~~~~g~~~~--y~i~~r~~~~i~~~~~~--~f~~d~~~~r 425 (466) T protein:vir:80 360 NSAGALVASLNN----------TMPIVGGDIVILDFIPDNDIIGGYGSL--YLLAERADIKLAQSEHV--RFIEDQTVFK 425 (466) T ss_pred cCCccccccCCC----------cccccccceeecCccCccceeeecccc--EEEEeecceEEEechhh--hhhcCcEEEE Confidence 677877765421 235899999999999999999999997 77999999999998754 5999999999 Q ss_pred EEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 471 AEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 471 ~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) +++|+||+|++|+||++++++....++ T Consensus 426 ~~~r~dg~~~~~~afv~~~~~~~~~~~ 452 (466) T protein:vir:80 426 GTARYDGKPVFGEGFVAVNIANANPTT 452 (466) T ss_pred EEEEEccEEeccCceEEEEecCCCccc Confidence 999999999999999999999987777 No 39 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=4.5e-60 Score=345.97 Aligned_cols=399 Identities=12% Similarity=0.105 Sum_probs=251.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |=...+++++++++.+++.++.....+.+.+. ..++.+.+..+++.++++++.+...+.+. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~-------------------~~e~~~~~~~ei~~l~~~i~~~~~~~~~~ 61 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNND-------------------ELEKAEKLEQEITDLRSQIQEKQEELDKL 61 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchh-------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44444444444333333333322221111110 01111222223333333333333322222 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +......................... ..................++.+......... ......+++.+| T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~-~~~~~~~~~~g~ 130 (415) T protein:vir:94 62 KEKDGTSENNQQSVEVNEASTYRNQA----------NINDLGISIQNTKVTSQEVRDFTEYLETRND-IQGGSLKTDSGF 130 (415) T ss_pred HHHHHhhhhccccccccchhhHHHHH----------HHHHHHhhhhhhhhhHHHHHHHHHHhhhhhh-hhhhcccccccc Confidence 21111000000000000000000000 0000000000001111111222222222111 122223344566 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEE--EcCCCccceeccccccccc-ccccceeeEeeeeeEEeee Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLT--ESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~--~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~ 237 (497) .+||+++...|++.+++.++|+++|++++++++...+|. .++ .+.+.|++|++.+|+ +.++|++|++.++++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~ 209 (415) T protein:vir:94 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred ccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecC-CccceeccccccccccccccceeeEeeheeeeeec Confidence 667777889999999999999999999999877666554 444 457899999999996 5689999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|||+|+ ++|++||.++|+++++.++|.+|++|+|++.+.++............ T Consensus 210 ~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~---------------------- 267 (415) T protein:vir:94 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE---------------------- 267 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccc---------------------- Confidence 9999999987 57999999999999999999999999999876665443221111000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCce Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ......++++.+++..+..+++ .+++|+||+.+|..|+++||++|+| T Consensus 268 --------------------------------~~~~~~~~~i~~~~~~~~~~~~-~~~~~vmn~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:94 268 --------------------------------VKKAKSLDDIKDAINLNVKPNY-EHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred --------------------------------cccccchHHHHHHHHhhhhhcc-CCCEEEEcHHHHHHHHHhhccCCCe Confidence 0111224556666666666655 4668999999999999999999999 Q ss_pred eccCcccccccccccccccccccceEecCCCCcCc-----eEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEE Q lcl|NC_021309. 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 397 i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) ||.+.+.+. ...+|||+||++++.+|.++ ++||||++ +|.+++|.++++++++ |..+++.+|+ T Consensus 315 l~~~~~~~~------~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~r~ 382 (415) T protein:vir:94 315 LIQPDVKEK------TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMI 382 (415) T ss_pred eeccCcCCC------CCceecceeeEEecccccCCCCccEEEEEehhc-cEEEEeecceEEEEec-----cccCceEEEE Confidence 998765432 34689999999999999654 79999998 4778999999999875 5677899999 Q ss_pred EEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ++|+|+.|++|+||++++++++++|+ T Consensus 383 ~~r~d~~~~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:94 383 AVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEEeccEEeccccEEEEEEeccCCCC Confidence 99999999999999999999999999 No 40 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=4.6e-60 Score=345.87 Aligned_cols=376 Identities=16% Similarity=0.149 Sum_probs=236.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |-- -+|+++..++.++++++. +++.+...+.+. +...+...+++.++++++.++...... T Consensus 1 M~~-~eL~~~~~~~~~~~~~l~-----------e~~~~~~~~~~~--------~~~~~~~ee~~~l~~~i~~~~~~~~~~ 60 (395) T protein:vir:38 1 MNI-NQLKDAFDMAGQKVQDLE-----------DKRAQFAIDLGN--------DASSHSVDDINKLNASLKNAKMAQELA 60 (395) T ss_pred CCH-HHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHhh--------hHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 321 112222222222222222 111111111000 000011111222222222222211110 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.. ............ ......... ..... .......+.... .........+++.+| T Consensus 61 ~~~-~~~~~~~~~~~~------~~~~~~~~~----------~~~~~----~~~~~~~~~~~~---~~~~~~~~~~~~~gg 116 (395) T protein:vir:38 61 KSA-YEDARANLNAEP------VNKKPLPVK----------DGKPD----AQAMKNQFVKDF---KNLVTSGTTGTGNAG 116 (395) T ss_pred HHH-HHHHHhhhhhcc------ccccccchh----------hhhHH----HHHHHHHHHHHH---HHHHhhccCccCCCc Confidence 000 000000000000 000000000 00000 000111111111 111222334455677 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceE--EEEEcCCCccceecccccccccc-cccceeeEeeeeeEEeee Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~wv~Eg~~~~~s-~~~f~~i~~~~~kla~~~ 237 (497) .+||+++..+||+.+++.++|+++|++++++++... +++..+..+.++|++|++.+|++ +++|++|++.++|+++++ T Consensus 117 ~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~ 196 (395) T protein:vir:38 117 LTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGIT 196 (395) T ss_pred eecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeeh Confidence 778888889999999999999999999998876544 45555555678999999999976 599999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+||++|+ ++|++||.++|+++++.++|.+|++|+|++.+.+.... T Consensus 197 ~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~~~------------------------------- 245 (395) T protein:vir:38 197 TVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKKPTISQ------------------------------- 245 (395) T ss_pred hhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc------------------------------- Confidence 9999999997 58999999999999999999999999998754321100 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHH-hhhhhhccCCceEEechhHHHHHHHHhhhcCc Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFV-DIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ 395 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~ 395 (497) ++++..++. .+.. .+....+|+||+.+|..|+++||++|+ T Consensus 246 --------------------------------------~~~i~~~~~~~l~~-~~~~~a~~v~n~~~~~~L~~lkd~~G~ 286 (395) T protein:vir:38 246 --------------------------------------FDNIKDLENNTLDP-AIESTSSFITNQSGYNILSKVKDADGR 286 (395) T ss_pred --------------------------------------HHHHHHHHHHhhhh-hhcCCCEEEEcHHHHHHHHHhhccCCc Confidence 011222222 2222 233456799999999999999999999 Q ss_pred eeccCcccccccccccccccccccceEecCCCCc------CceEEEeeccceEEEEeecccEEEeecccchhhhcCceEE Q lcl|NC_021309. 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL------GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 396 ~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~------~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) |||++.... ....+|+|+||++++.++. ..++||||++ +|.+++|.+++|+++++.+.+|++|++.| T Consensus 287 ~l~~~~~~~------~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~-~~~i~~~~~~~i~~~~~~~~~~~~~~~~~ 359 (395) T protein:vir:38 287 YLMQPDVTS------PDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQ-GITLFDRQQMQIDTTNVGAGSFEHDTTKL 359 (395) T ss_pred eeeccCcCC------CCcceeccceeEEecccccCcCCCcceEEEEeccc-cEEEEEecceEEEEeccccchhhcCceEE Confidence 999876443 2346899999999886542 2379999998 58899999999999999888899999999 Q ss_pred EEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 470 RAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 470 r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) |++.|+|+.|.+|+||++++++++++.+ T Consensus 360 r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 387 (395) T protein:vir:38 360 RFIDRFDVQLIDDGAFAAASFKTVANQA 387 (395) T ss_pred EEEEeeccEEecccceEEEEeecccCCC Confidence 9999999999999999999999887776 No 41 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=9.8e-60 Score=344.09 Aligned_cols=399 Identities=12% Similarity=0.098 Sum_probs=251.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |=...++++++.++.+++.....+..+.+.+.. .++.+++.++++.++++++.++.++.+. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~-------------------~~~~~~~~~e~~~l~~~i~~~~~~~~~~ 61 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE-------------------LEKAEKLEQEITDLRSQIQEKQEELDKL 61 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHH-------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666666666555555544443332222211100 0111122222223333333222222211 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +........ .......... +... .....................++.+......... ......+++.+| T Consensus 62 ~~~~~~~~~-~~~~~~~~~~--~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~gg 130 (415) T protein:vir:98 62 KEKDGTSEN-NQQSVEVNEA--RTYR-------NQANINDLGISIQNTKVTSQEVRDFTEYLETRND-IQGGSLKTDSGF 130 (415) T ss_pred HHHHhhhhh-cccccccchh--hhHH-------HHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhh-hhhccccccccc Confidence 111000000 0000000000 0000 0000000000000001111111222222211111 111222333445 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEE--EcCCCccceeccccccccc-ccccceeeEeeeeeEEeee Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLT--ESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~--~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~ 237 (497) .+||+++...|++.+++.++|+++|++++++++++.+|. .++ ...++|++|++.+|+ +.++|++|++.++++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 209 (415) T protein:vir:98 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecC-CccceeeccccccCcccccceeeEEeeeeeeEeee Confidence 556667888999999999999999999999877666554 443 457899999999996 4689999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+||++|+ ++|++||.++|+++++.++|.+|++|+|++.+.+............ T Consensus 210 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~----------------------- 266 (415) T protein:vir:98 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL----------------------- 266 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccc----------------------- Confidence 9999999987 5899999999999999999999999999887665543322111100 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCce Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .......++++.+++..+...++ .+++|+||+.+|..|+++||++||| T Consensus 267 -------------------------------~~~~~~~~~~i~~~~~~~~~~~~-~~~~~v~n~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:98 267 -------------------------------EVKKAKSLDDIKDAINLNVKPNY-EHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred -------------------------------ccccccchhHHHHHHHhhhhhcc-CCCEEEEcHHHHHHHHHhhccCCce Confidence 01111224566666666666554 4668999999999999999999999 Q ss_pred eccCcccccccccccccccccccceEecCCCCcCc-----eEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEE Q lcl|NC_021309. 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 397 i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) ||.+++.+. .+.+|+|+||++++.+|.++ ++||||++ +|.+++|.++++++++ |..+++.+|+ T Consensus 315 l~~~~~~~~------~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~~~ 382 (415) T protein:vir:98 315 LIQPDVKEK------TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMI 382 (415) T ss_pred eeccCcCCC------CCceecceeeEEecccccCCCCccEEEEEehhc-cEEEEeecceEEEEec-----cccCceEEEE Confidence 998765432 34689999999999998654 79999998 4779999999999875 4567789999 Q ss_pred EEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .+|+|+.|++|+||+++++++++.|+ T Consensus 383 ~~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:98 383 AVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEEeccEEeccccEEEEEEeccCCCC Confidence 99999999999999999999999999 No 42 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=9.8e-60 Score=344.09 Aligned_cols=399 Identities=12% Similarity=0.098 Sum_probs=251.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |=...++++++.++.+++.....+..+.+.+.. .++.+++.++++.++++++.++.++.+. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~-------------------~~~~~~~~~e~~~l~~~i~~~~~~~~~~ 61 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE-------------------LEKAEKLEQEITDLRSQIQEKQEELDKL 61 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHH-------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666666666555555544443332222211100 0111122222223333333222222211 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +........ .......... +... .....................++.+......... ......+++.+| T Consensus 62 ~~~~~~~~~-~~~~~~~~~~--~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~gg 130 (415) T protein:vir:79 62 KEKDGTSEN-NQQSVEVNEA--RTYR-------NQANINDLGISIQNTKVTSQEVRDFTEYLETRND-IQGGSLKTDSGF 130 (415) T ss_pred HHHHhhhhh-cccccccchh--hhHH-------HHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhh-hhhccccccccc Confidence 111000000 0000000000 0000 0000000000000001111111222222211111 111222333445 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEE--EcCCCccceeccccccccc-ccccceeeEeeeeeEEeee Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLT--ESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~--~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~ 237 (497) .+||+++...|++.+++.++|+++|++++++++++.+|. .++ ...++|++|++.+|+ +.++|++|++.++++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 209 (415) T protein:vir:79 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecC-CccceeeccccccCcccccceeeEEeeeeeeEeee Confidence 556667888999999999999999999999877666554 443 457899999999996 4689999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+||++|+ ++|++||.++|+++++.++|.+|++|+|++.+.+............ T Consensus 210 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~----------------------- 266 (415) T protein:vir:79 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL----------------------- 266 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccc----------------------- Confidence 9999999987 5899999999999999999999999999887665543322111100 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCce Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .......++++.+++..+...++ .+++|+||+.+|..|+++||++||| T Consensus 267 -------------------------------~~~~~~~~~~i~~~~~~~~~~~~-~~~~~v~n~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:79 267 -------------------------------EVKKAKSLDDIKDAINLNVKPNY-EHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred -------------------------------ccccccchhHHHHHHHhhhhhcc-CCCEEEEcHHHHHHHHHhhccCCce Confidence 01111224566666666666554 4668999999999999999999999 Q ss_pred eccCcccccccccccccccccccceEecCCCCcCc-----eEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEE Q lcl|NC_021309. 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 397 i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) ||.+++.+. .+.+|+|+||++++.+|.++ ++||||++ +|.+++|.++++++++ |..+++.+|+ T Consensus 315 l~~~~~~~~------~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~~~ 382 (415) T protein:vir:79 315 LIQPDVKEK------TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMI 382 (415) T ss_pred eeccCcCCC------CCceecceeeEEecccccCCCCccEEEEEehhc-cEEEEeecceEEEEec-----cccCceEEEE Confidence 998765432 34689999999999998654 79999998 4779999999999875 4567789999 Q ss_pred EEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .+|+|+.|++|+||+++++++++.|+ T Consensus 383 ~~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:79 383 AVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEEeccEEeccccEEEEEEeccCCCC Confidence 99999999999999999999999999 No 43 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=9.8e-60 Score=344.09 Aligned_cols=399 Identities=12% Similarity=0.098 Sum_probs=251.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |=...++++++.++.+++.....+..+.+.+.. .++.+++.++++.++++++.++.++.+. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~-------------------~~~~~~~~~e~~~l~~~i~~~~~~~~~~ 61 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE-------------------LEKAEKLEQEITDLRSQIQEKQEELDKL 61 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHH-------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666666666555555544443332222211100 0111122222223333333222222211 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +........ .......... +... .....................++.+......... ......+++.+| T Consensus 62 ~~~~~~~~~-~~~~~~~~~~--~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~gg 130 (415) T protein:vir:81 62 KEKDGTSEN-NQQSVEVNEA--RTYR-------NQANINDLGISIQNTKVTSQEVRDFTEYLETRND-IQGGSLKTDSGF 130 (415) T ss_pred HHHHhhhhh-cccccccchh--hhHH-------HHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhh-hhhccccccccc Confidence 111000000 0000000000 0000 0000000000000001111111222222211111 111222333445 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEE--EcCCCccceeccccccccc-ccccceeeEeeeeeEEeee Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLT--ESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~--~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~ 237 (497) .+||+++...|++.+++.++|+++|++++++++++.+|. .++ ...++|++|++.+|+ +.++|++|++.++++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 209 (415) T protein:vir:81 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecC-CccceeeccccccCcccccceeeEEeeeeeeEeee Confidence 556667888999999999999999999999877666554 443 457899999999996 4689999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+||++|+ ++|++||.++|+++++.++|.+|++|+|++.+.+............ T Consensus 210 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~----------------------- 266 (415) T protein:vir:81 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL----------------------- 266 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccc----------------------- Confidence 9999999987 5899999999999999999999999999887665543322111100 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCce Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .......++++.+++..+...++ .+++|+||+.+|..|+++||++||| T Consensus 267 -------------------------------~~~~~~~~~~i~~~~~~~~~~~~-~~~~~v~n~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:81 267 -------------------------------EVKKAKSLDDIKDAINLNVKPNY-EHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred -------------------------------ccccccchhHHHHHHHhhhhhcc-CCCEEEEcHHHHHHHHHhhccCCce Confidence 01111224566666666666554 4668999999999999999999999 Q ss_pred eccCcccccccccccccccccccceEecCCCCcCc-----eEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEE Q lcl|NC_021309. 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 397 i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) ||.+++.+. .+.+|+|+||++++.+|.++ ++||||++ +|.+++|.++++++++ |..+++.+|+ T Consensus 315 l~~~~~~~~------~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~~~ 382 (415) T protein:vir:81 315 LIQPDVKEK------TQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMI 382 (415) T ss_pred eeccCcCCC------CCceecceeeEEecccccCCCCccEEEEEehhc-cEEEEeecceEEEEec-----cccCceEEEE Confidence 998765432 34689999999999998654 79999998 4779999999999875 4567789999 Q ss_pred EEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .+|+|+.|++|+||+++++++++.|+ T Consensus 383 ~~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:81 383 AVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEEeccEEeccccEEEEEEeccCCCC Confidence 99999999999999999999999999 No 44 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=5e-59 Score=340.19 Aligned_cols=410 Identities=13% Similarity=0.070 Sum_probs=237.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEA-HERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~-~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |= +.+++++++++..+++++..+ ++++.+...+..++++....++.. .++.+++.++++++.........+... T Consensus 1 Mk-i~elk~el~~~~~el~~~~~e----lr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~ 75 (437) T protein:vir:10 1 MK-IEKLKKDLATKTAELNTKKAE----IRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRD 75 (437) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 53 455555555555554444332 222222222222222222222111 111223333333222222211111111 Q ss_pred HHHHHHHHHHH-----HHHhhhhhhHHHHhHhhhhhhhhhhhh----HHHHHHh-hhHHHHHHHHHHHHHhhhhhhhhhh Q lcl|NC_021309. 80 VEVRNLKQIRK-----HLARAVIMNPELKNATSFEKGTKFDVS----FNVSAKA-ADPGTAAAELMGAFADGETAPAAIG 149 (497) Q Consensus 80 ~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (497) .......+... ........................... ....... .............+.... ...... T Consensus 76 ~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~e~~ 154 (437) T protein:vir:10 76 DSDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYL-KTGEVR 154 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHH-Hhhhhh Confidence 00000000000 000000000000000000000000000 0000000 000000000001111111 111111 Q ss_pred hhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccc-ccccceeeEe Q lcl|NC_021309. 150 QNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYE 228 (497) Q Consensus 150 ~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~ 228 (497) ....++.+.+|. ++|+....+|..+...+.|+++|++++++++.+.+|+.....+.++|++|++..|+ ++++|++|++ T Consensus 155 ~~~~~~~~~~g~-lvp~~~~~~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~ 233 (437) T protein:vir:10 155 DVTGIALKDGKV-IIPETILTPEKEVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILW 233 (437) T ss_pred hhhhcccccccc-cchHHHHHHHHHhhhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccccccccceeeee Confidence 222233344444 44544455666778888999999999999999999998776778999999999996 6799999999 Q ss_pred eeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHH Q lcl|NC_021309. 229 QVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSN 307 (497) Q Consensus 229 ~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (497) .+++++++++||+|||+|+ ++|++||.++|+++++.++|.+|++|+|++.|.+..... T Consensus 234 ~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~~--------------------- 292 (437) T protein:vir:10 234 DLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKTTSTYL--------------------- 292 (437) T ss_pred ehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc--------------------- Confidence 9999999999999999997 479999999999999999999999999987654321110 Q ss_pred HHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHH-hhhhhhccCCceEEechhHHHHH Q lcl|NC_021309. 308 VKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFV-DIQLTLFQTPNAVVMNPRDWELL 386 (497) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~n~~~~~~l 386 (497) .+++.+.+. .+...|+ .+.+|+||+.+|..| T Consensus 293 -----------------------------------------------~~~~~~~~~~~l~~~~~-~~~~~~~~~~~~~~l 324 (437) T protein:vir:10 293 -----------------------------------------------LGDLKKVLNVTLKPQDS-AAASIVMSQSAYNLF 324 (437) T ss_pred -----------------------------------------------hhhHHHHHHhhhhhhhh-cCCEEEEcHHHHHHH Confidence 011122221 2333333 445799999999999 Q ss_pred HHHhhhcCceeccCcccccccccccccccccccceEecCCC--CcC---c--eEEEeeccceEEEEeecccEEEeecccc Q lcl|NC_021309. 387 RLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLI--PLG---T--ILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) Q Consensus 387 ~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~--~~~---~--~~~gd~~~~~~~i~~r~~~~i~~~~~~~ 459 (497) +++||++|+|||++++..+ .+++|||+||++++++ |.. + ++||||++ +|.+++|.++++.+++. T Consensus 325 ~~lkd~~g~~~~~~~~~~~------~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~r~~~~~~~~~~-- 395 (437) T protein:vir:10 325 DMATDAMGRPLLQPNVTAA------TGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKK-AVINFKLTEITGQFQDT-- 395 (437) T ss_pred HHhhccCCCeeeccCccCC------CCcccccceeEEecccccCCcCCCceEEEEeeccc-cEEEEeeeceEEEEecc-- Confidence 9999999999998765432 3468999999998764 432 2 79999998 57899999999998753 Q ss_pred hhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 460 TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 460 ~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) |..+.+.+|+.+|+||.|++|+||++|+.+.++..+ T Consensus 396 --~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~ 431 (437) T protein:vir:10 396 --YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTV 431 (437) T ss_pred --cccccceeeEEEEEccEEecccceEEEEeecccccc Confidence 455667899999999999999999999988666666 No 45 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=5.1e-60 Score=345.65 Aligned_cols=378 Identities=14% Similarity=0.143 Sum_probs=247.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |...-++++.+.++.++++++..... ++....... .++.+++.++++.+.++++.++...... T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~----~~~~~~~~~-------------~ee~~~l~~ei~~~~~~~~~~~~~~~~~ 63 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLN----VAMLDDSVT-------------AEELQAIKNERDTAKMKRDMFKEQYTEA 63 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH----Hhhcchhhh-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66665555555444444443332211 111000000 1111222223333333333322222111 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) ........ ... ........ . .. ..... ......+...... ........++++.+| T Consensus 64 ~~~~~~~~--~~~----------~~~~~~~~--~---~~--~~~~~-----~~~~~~~~~~~~~-~~~~~~~~~t~~~gg 118 (397) T protein:vir:48 64 RANEVVNM--SEE----------EKKPLTKS--E---EE--VKAGF-----VKDFKNLVRGRYQ-NLLDSKTDASGSDAG 118 (397) T ss_pred HHhhhhhh--hhh----------ccccccch--h---hH--HHHHH-----HHHHHHHHhhhhh-HHHHHhhccCCcccc Confidence 11100000 000 00000000 0 00 00000 0000000010000 011112233444566 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEE--EEcCCCccceecccccccccc-cccceeeEeeeeeEEeee Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYL--TESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p--~~~~~~~~a~wv~Eg~~~~~s-~~~f~~i~~~~~kla~~~ 237 (497) .+||+++..+||+.+++.++|+++|+++++++++..+| +..+..+.++|++|++.+|++ +++|++|+++++++++++ T Consensus 119 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 198 (397) T protein:vir:48 119 LTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGIS 198 (397) T ss_pred ccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeeh Confidence 77788899999999999999999999999988766555 444555678999999999987 699999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|+|+|+ .++++||.++|+++++.++|.+|++|+|++.+.+... T Consensus 199 ~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~~~~~~-------------------------------- 246 (397) T protein:vir:48 199 TVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLPTKPTLT-------------------------------- 246 (397) T ss_pred hhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc-------------------------------- Confidence 9999999987 5899999999999999999999999999876433111 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCce Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .++++.+++..+...++ ++.+|+||+.+|..|+++||++|+| T Consensus 247 -------------------------------------~~d~i~~~~~~l~~~~~-~~a~~v~n~~~~~~L~~lkd~~G~~ 288 (397) T protein:vir:48 247 -------------------------------------KWDDIIDLQAKVDPAIK-QTSFFLTNTSGFTALKKVKNAFGDY 288 (397) T ss_pred -------------------------------------cHHHHHHHHHHhhhhhc-CCCEEEECHHHHHHHHHhhcCCCce Confidence 02344455556655544 5578999999999999999999999 Q ss_pred eccCcccccccccccccccccccceEecC--CCC-----cCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEE Q lcl|NC_021309. 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTP--LIP-----LGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 397 i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~--~~~-----~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) ||+++.... ...+|+|+||++++ .+| ...++||||++ +|.+++|.+++++++++.+.+|.+|++.| T Consensus 289 i~~~~~~~~------~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 361 (397) T protein:vir:48 289 LMERDVKSP------TGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQ-AVTLFDRQQMSLLSTNIGGGAFETDTTKI 361 (397) T ss_pred eeccCcCCC------CCceeccceeEEecccccCCcCCCceEEEEEeccc-eEEEEeecceEEEEeccchhhhhcCceeE Confidence 998765432 34689999998754 344 33479999998 57899999999999999888999999999 Q ss_pred EEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 470 RAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 470 r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) |+++|+|+.+++|+||++++++++++.+ T Consensus 362 r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:48 362 RVIDRFDVVATDTESFVPASFKAIADQK 389 (397) T ss_pred EEEeeeccEEecccceEEEEecccccCC Confidence 9999999999999999999999998888 No 46 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=2e-59 Score=342.37 Aligned_cols=389 Identities=14% Similarity=0.103 Sum_probs=238.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |- ++ ++++++++ +.++++.+....++.+... +.+..++.+++.++++.+++++++++.++... T Consensus 1 M~---------~~---~l~el~~~----l~e~~~~i~~~~~e~~~~~-~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~ 63 (394) T protein:vir:97 1 MF---------EE---KIKEIKAT----IADLNNTIVTKTAQVKNAL-ESDDLEAARSIKAEVEQAKANLVEAENDLKLY 63 (394) T ss_pred Cc---------HH---HHHHHHHH----HHHHHHHHHHHHHHHHHhh-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 21 11 12222211 1122222222222211110 01112223444455555556665555544433 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccc-ccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGST-GTF 159 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 159 (497) +.....+............. . ......+.............. ........................+.+ +.+ T Consensus 64 e~~~e~~~~~~~~~~~~~~~-~-----~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~g 136 (394) T protein:vir:97 64 ESSVEVGGAENIGGKEVTQE-E-----KTYRESVNDFIRSKGKIVNDS-LRFEGKDEVLMPINETTPVEPQKDGIKKENA 136 (394) T ss_pred HHHhhhhccccccccccchh-h-----HHHHHHHHHHHHHHHHHhhhh-hhhhhHHHHHHHHHhhhhhhhhccccccccc Confidence 32211111000000000000 0 000000000000000000000 000000111111111111122222333 345 Q ss_pred ccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccc-ccccceeeEeeeeeEEeeeh Q lcl|NC_021309. 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALT 238 (497) Q Consensus 160 g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~~ 238 (497) |.+||+++...|++.+++.++|+++|++++++++++.+|+.+..++.++|++|++.+|+ ++++|++|++.++|++++++ T Consensus 137 g~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~ 216 (394) T protein:vir:97 137 KPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIP 216 (394) T ss_pred cccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehh Confidence 55667778889999999999999999999999999999998876778999999999997 67999999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcch Q lcl|NC_021309. 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) ||+|||+|+ +++++||.++|++++++++|.+|++|.+++.+.+..+ T Consensus 217 is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~--------------------------------- 263 (394) T protein:vir:97 217 LSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKN--------------------------------- 263 (394) T ss_pred hHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc--------------------------------- Confidence 999999997 5899999999999999999999999987765433211 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCcee Q lcl|NC_021309. 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i 397 (497) ++++..++...... +. +..|+||+.+|..|+++||++|||| T Consensus 264 -------------------------------------~~~~~~~~~~~~~~-~~-~a~~v~n~~~~~~l~~lkd~~G~~i 304 (394) T protein:vir:97 264 -------------------------------------LDEIKALLNGGFDP-AY-NVSLIVSQSFYQTLDTLKDGNGRYL 304 (394) T ss_pred -------------------------------------HHHHHHHHHhhhhh-hh-CCEEEEcHHHHHHHHHhhccCCCee Confidence 01111111111111 11 2469999999999999999999999 Q ss_pred ccCcccccccccccccccccccceEecC--CCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEee Q lcl|NC_021309. 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTP--LIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERL 475 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~Pvv~~~--~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~ 475 (497) |++++.+. .+.+|||+||++++ .++.++++||||++ +|.+++|.+++++++++. .+...||+++|+ T Consensus 305 ~~~~~~~~------~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~r~ 372 (394) T protein:vir:97 305 LQDDITAV------SGKVLLGKPVFVLSDEVLGANKAFIGDFKR-GVLFADRKDLGLRWADNE-----IYGQYLQAVLRF 372 (394) T ss_pred eecCcCCC------CCceeccceeEEecccccCCccEEEeeccc-cEEEEEecceEEEEeccc-----ccceeEEEEEEE Confidence 98765432 34689999999854 56777899999998 478999999999987643 345789999999 Q ss_pred cceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 476 GLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 476 ~~~v~~~~a~~~l~~~~~a~~~ 497 (497) |+.|.+|+||++++++++++-= T Consensus 373 d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 373 GVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred ccEEecccceEEEEecccccCC Confidence 9999999999999997666555 No 47 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=1e-59 Score=343.95 Aligned_cols=374 Identities=15% Similarity=0.121 Sum_probs=234.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 14 LAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLA 93 (497) Q Consensus 14 ~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 93 (497) |.++|++++++..+...++++.+.+ +..++.+++.++++.++++++..+.. .+.+..... . T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~------------~~~~e~~~~~~e~~~l~~~i~~~~~~-~~~~~~~~~------~ 61 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGE------------DKVAEAEQMMEEVRSLQKKIDLQRSL-DEAETEERN------N 61 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhH------------HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhh------c Confidence 3333433333322222222222111 00011112222223333333221110 000000000 0 Q ss_pred hhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHH Q lcl|NC_021309. 94 RAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVE 173 (497) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~ 173 (497) .......... ...+....+.. ..... .....................++++++|.+||+++..+|++ T Consensus 62 ~~~~~~~~~~--~~~~~~~~~~~-------~l~~~----~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~ 128 (392) T protein:vir:10 62 GREVETRNVD--GEMEYRDVFMK-------ALRNK----PLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINE 128 (392) T ss_pred cccccccCcc--chHHHHHHHHH-------HHhcc----cccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHH Confidence 0000000000 00000000000 00000 00000000000111112233344455666777788899999 Q ss_pred HHHhhhhHHhhcceeecCCCceE--EEEEcCCCccceecccccccccc-cccceeeEeeeeeEEeeehhhHHHHhhH-HH Q lcl|NC_021309. 174 QLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANALTITDEGLRDA-PE 249 (497) Q Consensus 174 ~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~wv~Eg~~~~~s-~~~f~~i~~~~~kla~~~~iS~ell~d~-~~ 249 (497) .+++.++|+++|+++++++++.. +|+.++ .+.++|++|++.+|++ .++|++|++.++|++++++||+|||+|+ ++ T Consensus 129 ~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~ 207 (392) T protein:vir:10 129 LARSFDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN 207 (392) T ss_pred HHHhhhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHH Confidence 99999999999999999876654 555554 4578999999999976 6999999999999999999999999987 68 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhh Q lcl|NC_021309. 250 LFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLK 329 (497) Q Consensus 250 l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (497) |++||.+.|++++++++|.+|++|+|++.+.|+.+ T Consensus 208 l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~--------------------------------------------- 242 (392) T protein:vir:10 208 ILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS--------------------------------------------- 242 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccccCccC--------------------------------------------- Confidence 99999999999999999999999999876544321 Q ss_pred hhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccccccccc Q lcl|NC_021309. 330 YGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNP 409 (497) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~ 409 (497) ++++.+++.....+.+..+..|+||+.+|..|+++||++|||||+++.... T Consensus 243 -------------------------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~---- 293 (392) T protein:vir:10 243 -------------------------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK---- 293 (392) T ss_pred -------------------------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC---- Confidence 112222222112233445567999999999999999999999998765432 Q ss_pred ccccccccccceEe-cCCC-C------cC--ceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeeccee Q lcl|NC_021309. 410 VNGGKNIWGVPVVT-TPLI-P------LG--TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) Q Consensus 410 ~~~~~~l~G~Pvv~-~~~~-~------~~--~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v 479 (497) ..++|+|+|+|+ ++.+ + .+ .++||||++ +|.+++|.+++++++++.+.+|++|++.||++.|+||.| T Consensus 294 --~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~-~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v 370 (392) T protein:vir:10 294 --NKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE-AIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQM 370 (392) T ss_pred --ccccccCcccEEEecccccCCCcccCCceEEEEEehhc-eEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEE Confidence 346899987665 3222 1 12 278999998 588999999999999998889999999999999999999 Q ss_pred ecccceEEEEeeCCCCCC Q lcl|NC_021309. 480 YRPSAFQLIQLKKGATGS 497 (497) Q Consensus 480 ~~~~a~~~l~~~~~a~~~ 497 (497) ++|+||+++++++++... T Consensus 371 ~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 371 WDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecccceEEEEeccccccc Confidence 999999999998877766 No 48 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=1e-59 Score=343.95 Aligned_cols=374 Identities=15% Similarity=0.121 Sum_probs=234.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 14 LAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLA 93 (497) Q Consensus 14 ~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 93 (497) |.++|++++++..+...++++.+.+ +..++.+++.++++.++++++..+.. .+.+..... . T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~------------~~~~e~~~~~~e~~~l~~~i~~~~~~-~~~~~~~~~------~ 61 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGE------------DKVAEAEQMMEEVRSLQKKIDLQRSL-DEAETEERN------N 61 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhH------------HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhh------c Confidence 3333433333322222222222111 00011112222223333333221110 000000000 0 Q ss_pred hhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHH Q lcl|NC_021309. 94 RAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVE 173 (497) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~ 173 (497) .......... ...+....+.. ..... .....................++++++|.+||+++..+|++ T Consensus 62 ~~~~~~~~~~--~~~~~~~~~~~-------~l~~~----~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~ 128 (392) T protein:vir:10 62 GREVETRNVD--GEMEYRDVFMK-------ALRNK----PLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINE 128 (392) T ss_pred cccccccCcc--chHHHHHHHHH-------HHhcc----cccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHH Confidence 0000000000 00000000000 00000 00000000000111112233344455666777788899999 Q ss_pred HHHhhhhHHhhcceeecCCCceE--EEEEcCCCccceecccccccccc-cccceeeEeeeeeEEeeehhhHHHHhhH-HH Q lcl|NC_021309. 174 QLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANALTITDEGLRDA-PE 249 (497) Q Consensus 174 ~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~wv~Eg~~~~~s-~~~f~~i~~~~~kla~~~~iS~ell~d~-~~ 249 (497) .+++.++|+++|+++++++++.. +|+.++ .+.++|++|++.+|++ .++|++|++.++|++++++||+|||+|+ ++ T Consensus 129 ~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~ 207 (392) T protein:vir:10 129 LARSFDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN 207 (392) T ss_pred HHHhhhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHH Confidence 99999999999999999876654 555554 4578999999999976 6999999999999999999999999987 68 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhh Q lcl|NC_021309. 250 LFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLK 329 (497) Q Consensus 250 l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (497) |++||.+.|++++++++|.+|++|+|++.+.|+.+ T Consensus 208 l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~--------------------------------------------- 242 (392) T protein:vir:10 208 ILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS--------------------------------------------- 242 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccccCccC--------------------------------------------- Confidence 99999999999999999999999999876544321 Q ss_pred hhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccccccccc Q lcl|NC_021309. 330 YGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNP 409 (497) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~ 409 (497) ++++.+++.....+.+..+..|+||+.+|..|+++||++|||||+++.... T Consensus 243 -------------------------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~---- 293 (392) T protein:vir:10 243 -------------------------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK---- 293 (392) T ss_pred -------------------------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC---- Confidence 112222222112233445567999999999999999999999998765432 Q ss_pred ccccccccccceEe-cCCC-C------cC--ceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeeccee Q lcl|NC_021309. 410 VNGGKNIWGVPVVT-TPLI-P------LG--TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) Q Consensus 410 ~~~~~~l~G~Pvv~-~~~~-~------~~--~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v 479 (497) ..++|+|+|+|+ ++.+ + .+ .++||||++ +|.+++|.+++++++++.+.+|++|++.||++.|+||.| T Consensus 294 --~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~-~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v 370 (392) T protein:vir:10 294 --NKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE-AIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQM 370 (392) T ss_pred --ccccccCcccEEEecccccCCCcccCCceEEEEEehhc-eEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEE Confidence 346899987665 3222 1 12 278999998 588999999999999998889999999999999999999 Q ss_pred ecccceEEEEeeCCCCCC Q lcl|NC_021309. 480 YRPSAFQLIQLKKGATGS 497 (497) Q Consensus 480 ~~~~a~~~l~~~~~a~~~ 497 (497) ++|+||+++++++++... T Consensus 371 ~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 371 WDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecccceEEEEeccccccc Confidence 999999999998877766 No 49 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=1e-59 Score=343.95 Aligned_cols=374 Identities=15% Similarity=0.121 Sum_probs=234.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 14 LAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLA 93 (497) Q Consensus 14 ~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 93 (497) |.++|++++++..+...++++.+.+ +..++.+++.++++.++++++..+.. .+.+..... . T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~------------~~~~e~~~~~~e~~~l~~~i~~~~~~-~~~~~~~~~------~ 61 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGE------------DKVAEAEQMMEEVRSLQKKIDLQRSL-DEAETEERN------N 61 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhH------------HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhh------c Confidence 3333433333322222222222111 00011112222223333333221110 000000000 0 Q ss_pred hhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHH Q lcl|NC_021309. 94 RAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVE 173 (497) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~ 173 (497) .......... ...+....+.. ..... .....................++++++|.+||+++..+|++ T Consensus 62 ~~~~~~~~~~--~~~~~~~~~~~-------~l~~~----~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~ 128 (392) T protein:vir:10 62 GREVETRNVD--GEMEYRDVFMK-------ALRNK----PLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINE 128 (392) T ss_pred cccccccCcc--chHHHHHHHHH-------HHhcc----cccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHH Confidence 0000000000 00000000000 00000 00000000000111112233344455666777788899999 Q ss_pred HHHhhhhHHhhcceeecCCCceE--EEEEcCCCccceecccccccccc-cccceeeEeeeeeEEeeehhhHHHHhhH-HH Q lcl|NC_021309. 174 QLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANALTITDEGLRDA-PE 249 (497) Q Consensus 174 ~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~wv~Eg~~~~~s-~~~f~~i~~~~~kla~~~~iS~ell~d~-~~ 249 (497) .+++.++|+++|+++++++++.. +|+.++ .+.++|++|++.+|++ .++|++|++.++|++++++||+|||+|+ ++ T Consensus 129 ~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~ 207 (392) T protein:vir:10 129 LARSFDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN 207 (392) T ss_pred HHHhhhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHH Confidence 99999999999999999876654 555554 4578999999999976 6999999999999999999999999987 68 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhh Q lcl|NC_021309. 250 LFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLK 329 (497) Q Consensus 250 l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (497) |++||.+.|++++++++|.+|++|+|++.+.|+.+ T Consensus 208 l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~--------------------------------------------- 242 (392) T protein:vir:10 208 ILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS--------------------------------------------- 242 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccccCccC--------------------------------------------- Confidence 99999999999999999999999999876544321 Q ss_pred hhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccccccccc Q lcl|NC_021309. 330 YGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNP 409 (497) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~ 409 (497) ++++.+++.....+.+..+..|+||+.+|..|+++||++|||||+++.... T Consensus 243 -------------------------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~---- 293 (392) T protein:vir:10 243 -------------------------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK---- 293 (392) T ss_pred -------------------------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC---- Confidence 112222222112233445567999999999999999999999998765432 Q ss_pred ccccccccccceEe-cCCC-C------cC--ceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeeccee Q lcl|NC_021309. 410 VNGGKNIWGVPVVT-TPLI-P------LG--TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) Q Consensus 410 ~~~~~~l~G~Pvv~-~~~~-~------~~--~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v 479 (497) ..++|+|+|+|+ ++.+ + .+ .++||||++ +|.+++|.+++++++++.+.+|++|++.||++.|+||.| T Consensus 294 --~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~-~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v 370 (392) T protein:vir:10 294 --NKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE-AIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQM 370 (392) T ss_pred --ccccccCcccEEEecccccCCCcccCCceEEEEEehhc-eEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEE Confidence 346899987665 3222 1 12 278999998 588999999999999998889999999999999999999 Q ss_pred ecccceEEEEeeCCCCCC Q lcl|NC_021309. 480 YRPSAFQLIQLKKGATGS 497 (497) Q Consensus 480 ~~~~a~~~l~~~~~a~~~ 497 (497) ++|+||+++++++++... T Consensus 371 ~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 371 WDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecccceEEEEeccccccc Confidence 999999999998877766 No 50 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=1e-59 Score=343.95 Aligned_cols=374 Identities=15% Similarity=0.121 Sum_probs=234.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 14 LAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLA 93 (497) Q Consensus 14 ~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 93 (497) |.++|++++++..+...++++.+.+ +..++.+++.++++.++++++..+.. .+.+..... . T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~------------~~~~e~~~~~~e~~~l~~~i~~~~~~-~~~~~~~~~------~ 61 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGE------------DKVAEAEQMMEEVRSLQKKIDLQRSL-DEAETEERN------N 61 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhH------------HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhh------c Confidence 3333433333322222222222111 00011112222223333333221110 000000000 0 Q ss_pred hhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHH Q lcl|NC_021309. 94 RAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVE 173 (497) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~ 173 (497) .......... ...+....+.. ..... .....................++++++|.+||+++..+|++ T Consensus 62 ~~~~~~~~~~--~~~~~~~~~~~-------~l~~~----~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~ 128 (392) T protein:vir:10 62 GREVETRNVD--GEMEYRDVFMK-------ALRNK----PLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINE 128 (392) T ss_pred cccccccCcc--chHHHHHHHHH-------HHhcc----cccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHH Confidence 0000000000 00000000000 00000 00000000000111112233344455666777788899999 Q ss_pred HHHhhhhHHhhcceeecCCCceE--EEEEcCCCccceecccccccccc-cccceeeEeeeeeEEeeehhhHHHHhhH-HH Q lcl|NC_021309. 174 QLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANALTITDEGLRDA-PE 249 (497) Q Consensus 174 ~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~wv~Eg~~~~~s-~~~f~~i~~~~~kla~~~~iS~ell~d~-~~ 249 (497) .+++.++|+++|+++++++++.. +|+.++ .+.++|++|++.+|++ .++|++|++.++|++++++||+|||+|+ ++ T Consensus 129 ~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~ 207 (392) T protein:vir:10 129 LARSFDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN 207 (392) T ss_pred HHHhhhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHH Confidence 99999999999999999876654 555554 4578999999999976 6999999999999999999999999987 68 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhh Q lcl|NC_021309. 250 LFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLK 329 (497) Q Consensus 250 l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (497) |++||.+.|++++++++|.+|++|+|++.+.|+.+ T Consensus 208 l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~--------------------------------------------- 242 (392) T protein:vir:10 208 ILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS--------------------------------------------- 242 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccccCccC--------------------------------------------- Confidence 99999999999999999999999999876544321 Q ss_pred hhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccccccccc Q lcl|NC_021309. 330 YGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNP 409 (497) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~ 409 (497) ++++.+++.....+.+..+..|+||+.+|..|+++||++|||||+++.... T Consensus 243 -------------------------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~---- 293 (392) T protein:vir:10 243 -------------------------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK---- 293 (392) T ss_pred -------------------------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC---- Confidence 112222222112233445567999999999999999999999998765432 Q ss_pred ccccccccccceEe-cCCC-C------cC--ceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeeccee Q lcl|NC_021309. 410 VNGGKNIWGVPVVT-TPLI-P------LG--TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) Q Consensus 410 ~~~~~~l~G~Pvv~-~~~~-~------~~--~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v 479 (497) ..++|+|+|+|+ ++.+ + .+ .++||||++ +|.+++|.+++++++++.+.+|++|++.||++.|+||.| T Consensus 294 --~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~-~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v 370 (392) T protein:vir:10 294 --NKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE-AIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQM 370 (392) T ss_pred --ccccccCcccEEEecccccCCCcccCCceEEEEEehhc-eEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEE Confidence 346899987665 3222 1 12 278999998 588999999999999998889999999999999999999 Q ss_pred ecccceEEEEeeCCCCCC Q lcl|NC_021309. 480 YRPSAFQLIQLKKGATGS 497 (497) Q Consensus 480 ~~~~a~~~l~~~~~a~~~ 497 (497) ++|+||+++++++++... T Consensus 371 ~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 371 WDNEAAVYGEIDLSAPVE 388 (392) T ss_pred ecccceEEEEeccccccc Confidence 999999999998877766 No 51 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=1e-58 Score=338.56 Aligned_cols=389 Identities=16% Similarity=0.142 Sum_probs=241.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 4 TAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVR 83 (497) Q Consensus 4 ~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~ 83 (497) ++.-+. .+.+++++++++. +++.++++.+.+...+++....+....+...++.+++++++.+++.++.++.+.+.. T Consensus 1 m~~k~~---~l~~~~~el~~~l-~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~ 76 (397) T protein:vir:96 1 MALKQL---ILNKQIKERSSEI-DKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKE 76 (397) T ss_pred CcHHHH---HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222221 2333344433321 222222222222222222111111112233444455555555555555555443322 Q ss_pred HHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccccccccccc Q lcl|NC_021309. 84 NLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGI 163 (497) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v 163 (497) ... .............. ...... ........ .........+..+....... ............+|..+ T Consensus 77 ~~~-l~~~~~~~~~~~~~--~~~~~~--~~~~~~~~------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~v 144 (397) T protein:vir:96 77 KQD-LEDELAKAADPTDQ--KPKDGE--KRKMKKFK------VTEEELAEKRSAINAFVKSK-GAEKRDGFTSVEGGALI 144 (397) T ss_pred HHH-HHHHHHhhhhhhhh--hhHHHH--HHHHHHHh------hhhHHHHHHHHHHHHHHHhh-hhhhhhcccccccccch Confidence 111 00000000000000 000000 00000000 00000111111111111111 11122223445556666 Q ss_pred chhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccc-ccccceeeEeeeeeEEeeehhhHH Q lcl|NC_021309. 164 LPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALTITDE 242 (497) Q Consensus 164 ~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~~iS~e 242 (497) |+++...|++ +....+++++|++++++++++.+|.....+..++|++|++..|+ ++++|++|+++++++++++++|++ T Consensus 145 p~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~e 223 (397) T protein:vir:96 145 PQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQE 223 (397) T ss_pred hHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHH Confidence 7777777876 57778999999999999999999988776678899999999996 689999999999999999999999 Q ss_pred HHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhh Q lcl|NC_021309. 243 GLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVG 321 (497) Q Consensus 243 ll~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (497) ||+|+ +++++||.+.|+++++.+++.+|++|+|++.|.|+.+. T Consensus 224 ll~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~------------------------------------ 267 (397) T protein:vir:96 224 MIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTATAKSVVGV------------------------------------ 267 (397) T ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccch------------------------------------ Confidence 99997 57999999999999999999999999998877664321 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCc Q lcl|NC_021309. 322 QDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNF 401 (497) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~ 401 (497) +++...+......++ +.+|+||+.+|..|+++||++|||||+++ T Consensus 268 ----------------------------------d~~~~~~~~~~~~~~--~a~~v~n~~~~~~l~~lkd~~G~~~~~~~ 311 (397) T protein:vir:96 268 ----------------------------------DGLKDLINKEIKKVY--DVKLFISASMYSELDKLKDKNGRYLLQDS 311 (397) T ss_pred ----------------------------------HHHHHHHHHhhhhhc--CcEEEEcHHHHHHHHHhhccCCCeEeccC Confidence 112222222222222 35799999999999999999999999876 Q ss_pred ccccccccccccccccccceEecCCCCcC------ceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEee Q lcl|NC_021309. 402 FGNAYGNPVNGGKNIWGVPVVTTPLIPLG------TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERL 475 (497) Q Consensus 402 ~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~ 475 (497) +.+. .+++|||+||++++.+..+ .++||||++ +|.+++|.++++.++++.. +.+.+|+++|+ T Consensus 312 ~~~~------~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~r~ 379 (397) T protein:vir:96 312 ITAA------SGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKA-FASFFDRKQVSVSWVDNNI-----YGQLLAGIIRY 379 (397) T ss_pred ccCC------CcccccccceEEecccccCCCCCceEEEEeehhc-ceEeEeecceEEEEecccc-----cceeEEEEEEE Confidence 5442 3468999999876654322 379999999 4789999999999887532 35789999999 Q ss_pred cceeecccceEEEEeeCC Q lcl|NC_021309. 476 GLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 476 ~~~v~~~~a~~~l~~~~~ 493 (497) ||.|++|+||++|+++++ T Consensus 380 d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 380 DVKATDKKAGFYVTFTIG 397 (397) T ss_pred ccEEecccceEEEEeecC Confidence 999999999999999999 No 52 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=4e-59 Score=340.77 Aligned_cols=379 Identities=17% Similarity=0.181 Sum_probs=239.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |-.+.++.+++++. +.++++.+.....+.+ . ..++..++.++++.++++++.++.++++. T Consensus 1 M~~l~~l~~~~~~~--------------~~e~~~~~~~~~~~~~---~---~~ee~~~~~~~~~~~~~~~~~l~~~i~~~ 60 (394) T protein:vir:10 1 MDKLQTLFNEVSAK--------------CADLNAQLNAKLQDEN---A---SVDDFQKIKDDLTAAKARRDAINDQIKDL 60 (394) T ss_pred ChHHHHHHHHHHHH--------------HHHHHHHHHHHHhhhh---c---cHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33333322222222 2222222221111100 0 01112233333444444444444444333 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.......... .... ...+ ........... .. ......+...... ........++++.+| T Consensus 61 e~~~~~~~~~~--~~~~--~~~~-----~~~~~~~~~~~-~~---------~~~~~~~l~~~~~-~~~~~~~~~t~~~gg 120 (394) T protein:vir:10 61 EAENKANSDPD--KPVD--NAQP-----NGTDLKKKPID-AK---------KKAINDFIHSHGK-VIDNAAGHVTSTEAG 120 (394) T ss_pred HHHHHhhcchh--hhhh--hhcc-----cccchhhhHHH-HH---------HHHHHHHHhccch-hhhhhhcccccccCc Confidence 22211000000 0000 0000 00000000000 00 0000111111111 111122334445566 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccc-ccccceeeEeeeeeEEeeehh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~~i 239 (497) .+||+++..+|++.+++.++|+++|++++++++++.||.....++.+.|++|++..|+ ++++|++|++.+++++++++| T Consensus 121 ~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~i 200 (394) T protein:vir:10 121 VLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPL 200 (394) T ss_pred eeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehh Confidence 6677788899999999999999999999999999999998876678899999999996 689999999999999999999 Q ss_pred hHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 240 TDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |+|||+|+ ++|++||.++|+++++.++|.+|++|+|++.+.++.+..+ T Consensus 201 S~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~~~~~------------------------------- 249 (394) T protein:vir:10 201 SEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTAKATTTDTL------------------------------- 249 (394) T ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc------------------------------- Confidence 99999997 6899999999999999999999999999876655432111 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHH-hhhhhhccCCceEEechhHHHHHHHHhhhcCcee Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFV-DIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i 397 (497) .+.+.+.+. .+... + ..+|+||+.+|..|+++||++|||| T Consensus 250 ------------------------------------~d~l~~~~~~~~~~~-~--~a~~vmn~~~~~~l~~lkd~~G~~i 290 (394) T protein:vir:10 250 ------------------------------------VDSLKHILNVDLDPA-Y--SRALVVTQSLFNTLDTLKDKNGRYL 290 (394) T ss_pred ------------------------------------HHHHHHHHHhhhhhh-c--cCEEEecHHHHHHHHHhhccCCCee Confidence 011112111 11111 1 3579999999999999999999999 Q ss_pred ccCcccccccccccccccccccceEecCCC--CcC----ceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEE Q lcl|NC_021309. 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLI--PLG----TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~--~~~----~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) |++....... ...+.+|||+||++++.. |.+ .++||||++ +|.++++.+++++++++.. |. +.+|+ T Consensus 291 ~~~~~~~~~~--~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~-~~~~~~~~~~~v~~~~~~~--~~---~~~~~ 362 (394) T protein:vir:10 291 LHDASDSITD--GTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKR-GVLFADRQQVTLAWEDSKI--YG---RYLGA 362 (394) T ss_pred eecccccccc--CCcccccccceeEEecccccCCCCCceEEEEeeccc-cEEEEeecceEEEEecccc--cc---eeEEE Confidence 9987654321 233468999999886643 322 279999999 5779999999999987543 54 46899 Q ss_pred EEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .+|+|+.|++|+||++++++++++|+ T Consensus 363 ~~r~d~~~~~~~ai~~~~~~~~~~~~ 388 (394) T protein:vir:10 363 AFRFGVKQADSNAGYFVTNTDAASGS 388 (394) T ss_pred EEEeccEEeccccEEEEEeecccCCC Confidence 99999999999999999999999999 No 53 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=5.9e-59 Score=339.81 Aligned_cols=379 Identities=13% Similarity=0.090 Sum_probs=242.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 5 AQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRN 84 (497) Q Consensus 5 a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~ 84 (497) +.+++.++++.++++++.++ +..+.++++....+.+ .++.+++.+++++++++++.++.++....... T Consensus 1 Mn~~e~lkel~~~~~el~~~-----------~~~~~~~~~~~~~e~~-~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~ 68 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEK-----------RCGIVEEIRSLAKEKK-EEEARSKALEREKIEARMEIIEEEIESVMTAI 68 (421) T ss_pred CCHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHhhccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 34444443333333333332 2222222222111111 11122333344444444444433332222111 Q ss_pred HHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhh-hhcccccccccccc Q lcl|NC_021309. 85 LKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIG-QNPFGSTGTFAPGI 163 (497) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~v 163 (497) ..... .. ...... . .... .. .. .. .....+.+....+...... .....+++.+|.+| T Consensus 69 ~~~~~-~~----~~~~~~-~---~~~~--~~--~~--~~-------~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~li 126 (421) T protein:vir:13 69 DEERK-NT----NFTGGR-V---IING--DS--KE--EK-------RSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVI 126 (421) T ss_pred HHHHh-hh----cccccc-c---cccc--ch--hH--HH-------HHHHHHHHHHhhhccchhHHHhhccccCCcceec Confidence 00000 00 000000 0 0000 00 00 00 0000111111111111001 11122344556677 Q ss_pred chhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCC-ccceecccccccccccccceeeEeeeeeEEeeehhhHH Q lcl|NC_021309. 164 LPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAH-NNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDE 242 (497) Q Consensus 164 ~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~-~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~e 242 (497) |+++..+|++.+++.++|+++|+++++++++++||+.+... ..++|++|++.+|.++++|++|++.+++++++++||+| T Consensus 127 P~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~e 206 (421) T protein:vir:13 127 PQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNS 206 (421) T ss_pred chhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehhhhHH Confidence 77788999999999999999999999999999999887643 34678999999999999999999999999999999999 Q ss_pred HHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhh Q lcl|NC_021309. 243 GLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVG 321 (497) Q Consensus 243 ll~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (497) ||+|+ ++|++||.++|++++..++|.+++ +.|.|+++.++.. T Consensus 207 ll~ds~~~l~~~i~~~la~~~~~~~~~~i~-----~~~~g~~~~~~~~-------------------------------- 249 (421) T protein:vir:13 207 LLEDSEINFLEFVNEEFAEFAVNTENAEIV-----KQAKAVLAEETIN-------------------------------- 249 (421) T ss_pred HHhhhHHHHHHHHHHHHHHHHHHHhhhhHh-----hhhhhcccccccc-------------------------------- Confidence 99998 579999999999999999987766 4577776543321 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCc Q lcl|NC_021309. 322 QDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNF 401 (497) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~ 401 (497) .++++.+++..+...++ .+.+|+||+.+|..|+++||++|||||+++ T Consensus 250 --------------------------------~~d~i~~~~~~l~~~~~-~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~ 296 (421) T protein:vir:13 250 --------------------------------DYAGLVKTINSLVPNAR-KRAIIVTNSDGRAYLDGLMDKQGRPLLKEL 296 (421) T ss_pred --------------------------------chHHHHHHHHHhhhhhc-CCCEEEEcHHHHHHHHHhhcCCCceeecCc Confidence 12455566666666554 456899999999999999999999999874 Q ss_pred ccccccccccccccccccceEecCCCCcCc-----eEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeec Q lcl|NC_021309. 402 FGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLG 476 (497) Q Consensus 402 ~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~ 476 (497) ..+ .+++|||+||++++++|.+. ++||||++ +|.+++|.+++|+++++. +|.+|++.||++.|+| T Consensus 297 ~~~-------~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~v~~~~~~--~f~~~~~~~r~~~r~d 366 (421) T protein:vir:13 297 SDG-------GDLVFKGRPVIELEESIFDVGDETKFIVSDFKT-LIKFMDRKQYLIDQSKEA--GYTKNETIARIIERFD 366 (421) T ss_pred CCC-------CCceecceeeEEeccccccCCCceEEEEEeccc-cEEEEEecceEEEeeccc--ccccCeeEEEEEeeec Confidence 332 34689999999999998543 79999998 588999999999999875 5999999999999999 Q ss_pred ceeecccceEEEEeeC---------CCCCC Q lcl|NC_021309. 477 LLVYRPSAFQLIQLKK---------GATGS 497 (497) Q Consensus 477 ~~v~~~~a~~~l~~~~---------~a~~~ 497 (497) +.+++|+||+.+.... ++++| T Consensus 367 ~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~ 396 (421) T protein:vir:13 367 VNSPLDKSSDAEKIRKFGVIVKLQEVLKSS 396 (421) T ss_pred ceeecchhhheeeecccceeeccccccCCC Confidence 9999999976543332 22222 No 54 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=6e-59 Score=339.79 Aligned_cols=377 Identities=17% Similarity=0.161 Sum_probs=240.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021309. 15 AKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLAR 94 (497) Q Consensus 15 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 94 (497) .+++++...+..+.+.++++.+++...+.+. ..++..++.+++++++++++++++++...+....... . T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~~~~------~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~-----~ 69 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQDENA------SVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEP-----K 69 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhHhh------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-----h Confidence 1222222222222233333333222111110 1112223333444445555555444444332211100 0 Q ss_pred hhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHH Q lcl|NC_021309. 95 AVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQ 174 (497) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~ 174 (497) .......... .. ... .. .... ..+....+.+... ........++++.+|.+||+++..+|++. T Consensus 70 ~~~~~~~~~~---~~--~~~---~~--~~~~-----~~~~~~~~lr~~~--~~~~~~~~~t~~~gg~~vP~~~~~~i~~~ 132 (389) T protein:vir:10 70 TEPKDDGSKK---GT--DLS---KK--PIDA-----KKKAINDFIHSHG--KVIDATSKVTSTEAGVLIPEEIIYDPTAE 132 (389) T ss_pred cccccccccc---cc--ccc---hh--HHHH-----HHHHHHHHhhcch--hhhhhhcccccCCcceeehHHHHHHHHHH Confidence 0000000000 00 000 00 0000 0000011111110 11112233445556667777888999999 Q ss_pred HHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccc-ccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHH Q lcl|NC_021309. 175 LFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALTITDEGLRDA-PELFN 252 (497) Q Consensus 175 ~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~ 252 (497) +++.++|+++|++++++++++.||+.+..++.+.|++|++.+|+ ++++|++|++.+++++++++||+|+|+|+ ++|++ T Consensus 133 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~ 212 (389) T protein:vir:10 133 VNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTA 212 (389) T ss_pred HHhhhhHHhhcceeeccCCeeEEEEEecCCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHH Confidence 99999999999999999999999999877777899999999985 79999999999999999999999999997 58999 Q ss_pred HHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhh Q lcl|NC_021309. 253 FVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGR 332 (497) Q Consensus 253 ~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (497) ||.+.|++++++++|.+|++|.|++.+.|.....+ T Consensus 213 ~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~~--------------------------------------------- 247 (389) T protein:vir:10 213 LVGQSIKEKSVNTYNAMIAPVLQSFTAKKTTTDTL--------------------------------------------- 247 (389) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccccccccccccc--------------------------------------------- Confidence 99999999999999999999988765544322110 Q ss_pred hhhhhhhcccccccccchhhhhhhHHHHHHH-hhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccccccccccc Q lcl|NC_021309. 333 VVTGAAGSGSGVAGSYPTAAEIAENVFDAFV-DIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVN 411 (497) Q Consensus 333 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~ 411 (497) .+.+.+.+. .+...+ .++|+||+.+|..|+++||++|||||+++..... ... T Consensus 248 ----------------------~d~l~~~~~~~~~~~~---~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~--~~~ 300 (389) T protein:vir:10 248 ----------------------VDSLKHILNVDLDPAY---SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSIT--DGT 300 (389) T ss_pred ----------------------HHHHHHHHHhhhhhhh---CcEEEecHHHHHHHHHhhccCCCeeeecCccccc--ccc Confidence 111222211 111111 3579999999999999999999999987654322 223 Q ss_pred ccccccccceEecCCC-CcC-----ceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccce Q lcl|NC_021309. 412 GGKNIWGVPVVTTPLI-PLG-----TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAF 485 (497) Q Consensus 412 ~~~~l~G~Pvv~~~~~-~~~-----~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~ 485 (497) ...+|||+||++++.. +.. .++||||++ +|.+++|++++|.++++.. |. ..+|+..|+|+.|++|+|| T Consensus 301 ~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~--~~---~~~~~~~r~d~~~~~~~a~ 374 (389) T protein:vir:10 301 AKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKR-GVLFTDRQQVTLAWEDSKI--YG---KYLGAAFRFGVQKADSKAG 374 (389) T ss_pred cccccccceeEEecccccCCCCCceEEEEeeccc-cEEEEeecceEEEeecccc--cc---ceEEEEEEeccEEecccce Confidence 4568999999875543 322 279999998 5789999999999987543 44 5789999999999999999 Q ss_pred EEEEeeCCCCCC Q lcl|NC_021309. 486 QLIQLKKGATGS 497 (497) Q Consensus 486 ~~l~~~~~a~~~ 497 (497) +++++++++.++ T Consensus 375 ~~~~~~~~~~~~ 386 (389) T protein:vir:10 375 YFVTNTDVPGSA 386 (389) T ss_pred EEEEeeccCCCC Confidence 999999999888 No 55 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=4.4e-59 Score=340.51 Aligned_cols=366 Identities=14% Similarity=0.147 Sum_probs=230.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021309. 15 AKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLAR 94 (497) Q Consensus 15 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 94 (497) .++++++..+. .++++++ .+.++......+..+.++++ .+.+..+.. .+.+..... T Consensus 1 ik~L~e~~~e~----~e~~~~~----------~~~~~~~~~~~e~~~~~~~~---~~~~~~~~~-------~~~~~~~~~ 56 (390) T protein:vir:40 1 MNNLDKKDSET----LNISTAF----------LNAIKEGATEAEQVTAFTNM---AEQIQNNII-------AQARKEVNR 56 (390) T ss_pred CchHHHHHHHH----HHHHHHH----------HHHHhhhhhHHHHHHHHHHH---HHHHHHHHH-------HHHHHHHHH Confidence 11122211111 1111111 00011000000111111111 111110000 000000000 Q ss_pred hhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHH Q lcl|NC_021309. 95 AVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQ 174 (497) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~ 174 (497) ......... .+... ....+.+..+.. ....++++.+|++||+++..+|++. T Consensus 57 ------~~~~~~~~~--~~~~~------------~l~~~~r~~~~~---------~~~~~~~~~gg~lvP~~~~~~I~~~ 107 (390) T protein:vir:40 57 ------EMNDNNVLA--SRGAN------------ALTSDESKYYNE---------VIAGNGFAGVTALLPPTVFERVFED 107 (390) T ss_pred ------HHHHHHHHH--hcCch------------hccHHHHHHHHH---------HHhccCcccCcccccHHHHHHHHHH Confidence 000000000 00000 000000111100 1112344556777888899999999 Q ss_pred HHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccc-cccccceeeEeeeeeEEeeehhhHHHHhhHH-HHHH Q lcl|NC_021309. 175 LFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTITDEGLRDAP-ELFN 252 (497) Q Consensus 175 ~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~-~s~~~f~~i~~~~~kla~~~~iS~ell~d~~-~l~~ 252 (497) ++..++|+++|++++++++...+|+.++ .+.+.|++|++..+ .++++|++|++++|+++++++||+|||+|++ +|++ T Consensus 108 ~~~~s~i~~~~~~~~~~~~~~~i~~~~~-~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~ 186 (390) T protein:vir:40 108 LTVEHPLLSKINFVNTTATTEWIISVGD-VATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQ 186 (390) T ss_pred HHhhhhhhhhceeeecCCceeEEEEEcC-CcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHH Confidence 9999999999999999999999999876 57899999998876 5789999999999999999999999999985 7999 Q ss_pred HHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhh Q lcl|NC_021309. 253 FVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGR 332 (497) Q Consensus 253 ~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (497) ||+++|+++++.++|.+|++|+|+++|.||++..+..+............... T Consensus 187 ~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~--------------------------- 239 (390) T protein:vir:40 187 YVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDL--------------------------- 239 (390) T ss_pred HHHHHHHHHHHHHHHhhhhcccCCCccceeeeccccccccccccccccccchh--------------------------- Confidence 99999999999999999999999999999998765444333222111100000 Q ss_pred hhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHH----HHHHHHhhhcCceeccCcccccccc Q lcl|NC_021309. 333 VVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDW----ELLRLTKDANGQYMGGNFFGNAYGN 408 (497) Q Consensus 333 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~----~~l~~lkd~~G~~i~~~~~~~~~~~ 408 (497) ........+...+... ........+|+||+.++ ..+++++|++|+|+|.. T Consensus 240 -----------------~~~~~~~~l~~~~~~~-~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~-------- 293 (390) T protein:vir:40 240 -----------------TPATLATKVMLPLTDN-GKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGI-------- 293 (390) T ss_pred -----------------hHHHHHHHHHHHhhcc-hhhhhcCceEEEcchhHHHHHHHHhhccCCCCcccccc-------- Confidence 0000111111111111 11123356799999884 35558999999999853 Q ss_pred cccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEE Q lcl|NC_021309. 409 PVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLI 488 (497) Q Consensus 409 ~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l 488 (497) .++|+|||+++.||+++++||||++ |.+++|.+++|+++++. +|.+|++.||++.|+|++|++|+||++| T Consensus 294 ------~~~g~pvv~~~~~p~~~i~~Gd~s~--~~i~~~~~~~v~~~~~~--~f~~~~~~~r~~~r~dg~v~~~~A~~~l 363 (390) T protein:vir:40 294 ------LPVPLEIVQSVAVPVGKAVAGRAKD--YFMGIGSEQVIRTSTEY--RLLDDETLYYAKQYANGRPKDNSSFLVF 363 (390) T ss_pred ------CCCceeEEEcCCCCCCcEEEEeece--EEEEeecceEEEecchh--hhhcCcEEEEEEEEeCCEEecccceEEE Confidence 2469999999999999999999997 67899999999998755 5999999999999999999999999999 Q ss_pred EeeCCCCCC Q lcl|NC_021309. 489 QLKKGATGS 497 (497) Q Consensus 489 ~~~~~a~~~ 497 (497) ++++.+... T Consensus 364 ~~~~~~~~~ 372 (390) T protein:vir:40 364 DITGLEGSP 372 (390) T ss_pred EeeccCCCC Confidence 999987331 No 56 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=1.1e-58 Score=338.44 Aligned_cols=365 Identities=14% Similarity=0.032 Sum_probs=236.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+-..+..++.++..++ +.+.++..... ++..+.++ +..+.+.+++... T Consensus 1 M~i~~k~~~~~~~~~~~---------------------l~~~~~~~~~~-------ee~~~~~~---~~~~~~~~~~~~~ 49 (377) T protein:vir:98 1 MAINLKELPKYREAVAE---------------------LSAKISAGATS-------EEQEKLFE---AAFTTMGDEILAK 49 (377) T ss_pred CCCcHHHHHHHHHHHHH---------------------HHHHHHhhhhh-------HHHHHHHH---HHHHhHHHHHHHH Confidence 44332222111111111 11111100000 00000011 1111111111100 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) .... .+.. ... ....... ..+.+..+.. ....++.+.+| T Consensus 50 ~~~e---~~~~-----------~~~---~~~~~~l---------------t~ee~~~~~~---------~~~~~~~~~gg 88 (377) T protein:vir:98 50 NEEE---MERM-----------FDL---RDKNREL---------------TAEEIKFFND---------IDKNVGGKDKF 88 (377) T ss_pred HHHH---HHHH-----------HHh---ccCCccc---------------CHHHHHHHHH---------HHhccCCCCCc Confidence 0000 0000 000 0000000 0000111100 11223455667 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccc-cccccceeeEeeeeeEEeeehh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~-~s~~~f~~i~~~~~kla~~~~i 239 (497) ++||+++...|++.+...++|+++|++.+++++ .++|+.++ .+.+.|++|++..+ +++++|++|++.+||++++++| T Consensus 89 ~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~~~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~i 166 (377) T protein:vir:98 89 KLLPEETMVQVFDDLVAEHPLLKVINFKNTSLR-LKALTAET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVI 166 (377) T ss_pred cccCHHHHHHHHHHHHHhhhhhhheeeEecCcc-eEEEEecC-CcceeEeecccccCcccCccceeEeecceeEEeeecc Confidence 778888999999999999999999999998765 79999876 57899999987765 6799999999999999999999 Q ss_pred hHHHHhhHH-HHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 240 TDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~d~~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |+|||+|++ ++++||+++|++++++++|.+|++|+|+++|.||++..+..+............ T Consensus 167 s~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~---------------- 230 (377) T protein:vir:98 167 PKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTY---------------- 230 (377) T ss_pred cHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccccc---------------- Confidence 999999975 899999999999999999999999999999999998765443332211100000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceec Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~ 398 (497) ....+.+.+....+...++. ..+|+||+.++..++++||.+|+|+| T Consensus 231 ---------------------------------~~~~~~~~~l~~~~~~~~~~-~a~~~m~~~t~~~~~klkd~~G~~i~ 276 (377) T protein:vir:98 231 ---------------------------------KTDKEAIADLSDLTPDNAPK-KLVPVMKHLSVNDKKRPLKIAGQVKL 276 (377) T ss_pred ---------------------------------cchhhhHhhhhhhchhHHHH-HHHHHHHHHHHHHHhhhhccCCceEE Confidence 00012223333334444433 34699999999999999999999999 Q ss_pred cCccccc--------ccccccccccccccc--eEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceE Q lcl|NC_021309. 399 GNFFGNA--------YGNPVNGGKNIWGVP--VVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVT 468 (497) Q Consensus 399 ~~~~~~~--------~~~~~~~~~~l~G~P--vv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~ 468 (497) ..++.+. ...+.+...+++|+| |+.++.+|+++++||||++ |.|++|.+++|+.+++. .|.+|++. T Consensus 277 ~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~--Y~i~~r~~~~i~~~~~~--~~~~d~~~ 352 (377) T protein:vir:98 277 ILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANR--YDAFMATASTIEEYDQT--FAMEDLQL 352 (377) T ss_pred EecccchhhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecc--eeEEeecceEEEeechh--hhhcCceE Confidence 6433221 111223344788888 5789999999999999998 88999999999988754 59999999 Q ss_pred EEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 469 VRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 469 ~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) ||++.|+||++++|+||++|+++.. T Consensus 353 f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 353 YLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEEEEcCEEeccCcEEEEEEecC Confidence 9999999999999999999999988 No 57 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=8e-59 Score=339.09 Aligned_cols=311 Identities=11% Similarity=0.078 Sum_probs=238.6 Q ss_pred HhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcC Q lcl|NC_021309. 123 KAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESA 202 (497) Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~ 202 (497) ....+.+ . ..+.. ....+....+++++|++||+++..+||+.+++.++|+++|++++++++.+++|+.++ T Consensus 1 ~~~~~~r------~---~~~~~-~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~ 70 (326) T protein:vir:42 1 MAVNPDR------T---TPFLG-VNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTG 70 (326) T ss_pred CCCCccc------h---hhhcC-cchhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeC Confidence 0000000 0 00000 112223334556667889999999999999999999999999999999999999987 Q ss_pred CCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCcccccc Q lcl|NC_021309. 203 AHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNG 281 (497) Q Consensus 203 ~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~G 281 (497) .+.++|++|++.+|+++++|++|++.++|++++++||+|+|+|+ +++++||.++|++++++++|+++|+|+|+++|.| T Consensus 71 -~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~g 149 (326) T protein:vir:42 71 -DVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTF 149 (326) T ss_pred -CcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc Confidence 46899999999999999999999999999999999999999986 6899999999999999999999999999999999 Q ss_pred ccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHH Q lcl|NC_021309. 282 LLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDA 361 (497) Q Consensus 282 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (497) |++..+................ ...+..... T Consensus 150 i~~~~~~~~~~~~~~~~~~~~~-------------------------------------------------~~~~~~~~~ 180 (326) T protein:vir:42 150 LAQTTKEVSLVDPDGTGSNADL-------------------------------------------------TVYDAVAVN 180 (326) T ss_pred ccccccccceeecccccccccc-------------------------------------------------hhHHHHHHH Confidence 9887654333222111000000 000111111 Q ss_pred HHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCce--EEEeecc Q lcl|NC_021309. 362 FVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI--LVGHFAP 439 (497) Q Consensus 362 ~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~--~~gd~~~ 439 (497) ........+....+|+||+.+|..|+++||++|+|||++........ .....+++|+||++++++|+++. ++|||++ T Consensus 181 ~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~-~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~ 259 (326) T protein:vir:42 181 ALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENS-PFRLGRIVARPTILSDHVASGTVVGYQGDFRQ 259 (326) T ss_pred HHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccc-cccCceeeeeeEEEcCCCCCCceEEEEeecce Confidence 11222344555678999999999999999999999998754432211 12345899999999999999874 6799998 Q ss_pred ceEEEEeecccEEEeecccc------------hhhhcCceEEEEEEeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 440 SVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 440 ~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) +.++++.+++|+++++.. ++|++|++.||+++|+||+|.+|+||++|+.++++++ T Consensus 260 --~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 260 --LVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred --EEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccccCC Confidence 447889999998877543 4599999999999999999999999999999999999 No 58 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=1.6e-58 Score=337.40 Aligned_cols=303 Identities=16% Similarity=0.161 Sum_probs=240.1 Q ss_pred hhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccccccc Q lcl|NC_021309. 143 TAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 143 ~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~ 222 (497) ............+++++|+++||++..+|++.+++.++|++++++++++++.++||+.++ .+.+.|++|++.+|+++++ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~ 79 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTG-AVSASWTGEAERKPITKGS 79 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcC-CcceeEecCCCccccccce Confidence 111222233345567788899999999999999999999999999999999999999987 4679999999999999999 Q ss_pred ceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhh Q lcl|NC_021309. 223 FARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 223 f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~ 300 (497) |++|++.++|++++++||+|||+|+ ++++++|.++|+++++.++|.+||+|+|+++ |.|+++............. T Consensus 80 f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~--- 156 (330) T protein:vir:77 80 FGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNL--- 156 (330) T ss_pred eeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccc--- Confidence 9999999999999999999999987 6899999999999999999999999999875 7788877643322211110 Q ss_pred hhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEech Q lcl|NC_021309. 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) ........+.++++..++..+...+ .+..+|+||+ T Consensus 157 --------------------------------------------~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~vmn~ 191 (330) T protein:vir:77 157 --------------------------------------------TTASGPQGNAYLAVNNALSLLVNSG-KKWTGTLLDN 191 (330) T ss_pred --------------------------------------------cccccccchhHHHHHHHHHhhhhcC-CCccEEEEcH Confidence 0001112223455555555555554 4556899999 Q ss_pred hHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCc------eEEEeeccceEEEEeecccEEEe Q lcl|NC_021309. 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT------ILVGHFAPSVIQTARREGVTMQM 454 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~------~~~gd~~~~~~~i~~r~~~~i~~ 454 (497) .+|..|+++||++|||||++....... ......+|+|+||++++.+|.++ +++|||++ +.++++.+++|++ T Consensus 192 ~~~~~l~~lkd~~G~~l~~~~~~~~~~-~~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~--~~i~~~~~~~i~~ 268 (330) T protein:vir:77 192 VTEPILNTAVDGNGRPLFVESTYTEQV-GAIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQ--VIWGQIGGLSFDV 268 (330) T ss_pred HHHHHHHHHhccCCceeecCccccccc-cccCCceecceeeEEeccccCCCCCCccEEEEEecce--EEEEEecCcEEEE Confidence 999999999999999999875543321 12245689999999999999765 78999998 5588999999998 Q ss_pred eccc----------------chhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 455 TNSN----------------GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 455 ~~~~----------------~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) +++. .++|++|++.||++.|+|+.|++|+||++|+.+++-+=- T Consensus 269 ~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~~ 327 (330) T protein:vir:77 269 TDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQVAGTDP 327 (330) T ss_pred eecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEeccCCcCC Confidence 7753 256999999999999999999999999999877722211 No 59 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=2.4e-57 Score=331.02 Aligned_cols=426 Identities=15% Similarity=0.130 Sum_probs=237.3 Q ss_pred CchH----HHHHHHHHHHH------HHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHhH--hHHHHHHHHHHH Q lcl|NC_021309. 1 MPST----AQLEAQGRQLA------KSIKDIN------ADETKTAAEKKEALAKIEPDFKAHQAEV--EAHERAQEMLKS 62 (497) Q Consensus 1 ~~~~----a~~~~~~~~~~------~~~~~~~------~~~~~~~~e~~~~~~~~~~~~~~~~~~~--~~~e~~~e~~~~ 62 (497) |... ...+....... ...++.. -...+.+.++++...++.+++++..... +......+..++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~ 234 (645) T protein:vir:93 155 YDRQFSAASGNRKPVVKIASSAGAAAQSTTVFHKEKTIMNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEH 234 (645) T ss_pred ccchhhhhhhhhcchhhhhhhhcchhhccccccccccccchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHH Confidence 1100 00000000000 0000000 0001112222222222222222211111 111112344466 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HhhhhhhHH-------HHhHhhhhhhhhhhhhHHHHHHhh-hHHHHHH Q lcl|NC_021309. 63 LGGADAAKDGLDNDIPEVEVRNLKQIRKHL--ARAVIMNPE-------LKNATSFEKGTKFDVSFNVSAKAA-DPGTAAA 132 (497) Q Consensus 63 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 132 (497) ++.+.++++.++.++++.+........... ......... ........++..+........... ....... T Consensus 235 ~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e 314 (645) T protein:vir:93 235 YDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNGNVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALE 314 (645) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHH Confidence 777777777777776655433221111100 000000000 000000011111111000000000 0000000 Q ss_pred HHHHHHHh-hhhh---hhhhhhhccccccccccccch-hhhHHHHHHHHhhhhHHhhcceeecC----CCceEEEEEcCC Q lcl|NC_021309. 133 ELMGAFAD-GETA---PAAIGQNPFGSTGTFAPGILP-TFLPGIVEQLFYELSLADLISSRPVT----SPNLSYLTESAA 203 (497) Q Consensus 133 ~~~~~~~~-~~~~---~~~~~~~~~~~~~~~g~~v~p-~~~~~ii~~~~~~~~l~~~~~~~~~~----~~~~~~p~~~~~ 203 (497) ..+..... .... .........++++.+|++++| ++..+||+.+++.+++++++.....+ .+.+++|+++++ T Consensus 315 ~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~ 394 (645) T protein:vir:93 315 VARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSG 394 (645) T ss_pred HHHhhcccchhhhhhhhhhhhccccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecC Confidence 00000000 0000 111111122223334555555 56788999999999999987654332 235789999874 Q ss_pred CccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCcc----c Q lcl|NC_021309. 204 HNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP----G 278 (497) Q Consensus 204 ~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~----~ 278 (497) +.++||+|++.+|+++++|++|++++|||+++++||+|||+|+ +++++||+++|+++++.++|.+||+|+|++ . T Consensus 395 -~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~ 473 (645) T protein:vir:93 395 -GAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVS 473 (645) T ss_pred -cceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCcc Confidence 6899999999999999999999999999999999999999986 799999999999999999999999998764 3 Q ss_pred cccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHH Q lcl|NC_021309. 279 VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENV 358 (497) Q Consensus 279 p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (497) |.|+++.......... ...++ T Consensus 474 p~gi~~~~~~~~~~~~-----------------------------------------------------------~~~d~ 494 (645) T protein:vir:93 474 PASITHDVKGTASSGN-----------------------------------------------------------PDADA 494 (645) T ss_pred ccceeccccccccccc-----------------------------------------------------------hHHHH Confidence 7776553221110000 00111 Q ss_pred HHHHHhhhhh-hccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEee Q lcl|NC_021309. 359 FDAFVDIQLT-LFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHF 437 (497) Q Consensus 359 ~~~~~~~~~~-~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~ 437 (497) ...+..+..+ ......+|+||+.++..|+++||++|+|+|... .. ..++|+|+||++++++|.+ +++||| T Consensus 495 ~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~-~~-------~~~tL~G~PV~~s~~vp~~-~~~gd~ 565 (645) T protein:vir:93 495 EAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDM-TL-------LGGSFQGLPVIVSQYVGDQ-LVLVNA 565 (645) T ss_pred HHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecCC-CC-------CCceeeceeeEEeccCCcc-eeEecc Confidence 1112222221 223345799999999999999999999998432 11 2348999999999999864 678999 Q ss_pred ccceEEEEeecccEEEeecccc--------------------hhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 438 APSVIQTARREGVTMQMTNSNG--------------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 438 ~~~~~~i~~r~~~~i~~~~~~~--------------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ++ +.++++.++.|.++++.. ++|++|||+||+++|+||.++||+||++|+-.+==.+| T Consensus 566 s~--~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~~ 643 (645) T protein:vir:93 566 PD--IYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGSAS 643 (645) T ss_pred cc--EEEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCccc Confidence 97 456777888777655432 46999999999999999999999999999755433333 No 60 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=2.8e-58 Score=336.08 Aligned_cols=282 Identities=20% Similarity=0.242 Sum_probs=238.1 Q ss_pred hhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 146 AAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 146 ~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) .........+++.+|++||+++..+||+.+++.++|+++|+++|++++...+|+.++ ..++|++|++.+|+++++|++ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~--~~a~~v~E~~~~~~~~~~f~~ 78 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMSG--VGAFWVDEAERIQTSKPTFTK 78 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEcC--CceeeeecCccccccccceeE Confidence 222233344556677888888999999999999999999999999999999998764 568999999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) |++.++|++++++||+|+++|+ ++++++|.+.|++++++++|.++++|+|+++|.||++............ T Consensus 79 v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~~-------- 150 (299) T protein:vir:41 79 AKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEET-------- 150 (299) T ss_pred EEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeeccc-------- Confidence 9999999999999999999987 6899999999999999999999999999999999988654332221110 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ...++++.+++..+...++ .+.+|+||+.+|. T Consensus 151 -----------------------------------------------~~~~~~l~~~~~~l~~~~~-~~~~~v~n~~~~~ 182 (299) T protein:vir:41 151 -----------------------------------------------ANKYDDLNEAIGLIEAEDL-EPNGIATIRKQRV 182 (299) T ss_pred -----------------------------------------------cccHHHHHHHHHhhhcccC-CcCEEEEcHHHHH Confidence 1123556666677766665 4567999999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCc----eEEEeeccceEEEEeecccEEEeecccc- Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT----ILVGHFAPSVIQTARREGVTMQMTNSNG- 459 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~----~~~gd~~~~~~~i~~r~~~~i~~~~~~~- 459 (497) .|+++||++|+|||++...+ +.++|+|+||++++.+|.++ ++||||++ +.+++|++++++++++.. T Consensus 183 ~L~~lkd~~G~~l~~~~~~~-------~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~--~~i~~~~~~~i~~~~~~~~ 253 (299) T protein:vir:41 183 KYRSTKDGNGMPIFNTATSN-------GVDDVLGLPIAYTPKYTFGDKDISELVGDWNQ--AYYGILRGVEYEILTEATL 253 (299) T ss_pred HHHHhhccCCceeecCCcCC-------CCceecceeeEEecccCCCCCceEEEEEeccc--EEEEEecCcEEEEeecccc Confidence 99999999999999876543 23589999999999999887 89999997 568999999999887542 Q ss_pred -----------hhhhcCceEEEEEEeecceeecccceEEEEeeCCC Q lcl|NC_021309. 460 -----------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 460 -----------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a 494 (497) ++|++|++.||++.|+||++++|+||++|+.+++- T Consensus 254 ~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 254 TTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred cccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 35999999999999999999999999999998887 No 61 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=1.9e-57 Score=331.54 Aligned_cols=383 Identities=14% Similarity=0.101 Sum_probs=242.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+.+.++++++.++.++++++..+ +.++........+++++...+ .+++.+++++++++++.++.+.+.. T Consensus 16 mk~l~el~~~~~e~~~~~~~~~~e----l~~~~~~~~~~~ee~~~~~~~------~~~l~~~~~~l~~~~~~~e~~~~~~ 85 (402) T protein:vir:93 16 MPTLYELKQSLGMIGQQLKNKNDE----LSQKATDPNIDMEDIKQLETE------KAGLQQRFNIVERQVQDIEEKEKAK 85 (402) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHH----HHHHHhccCcCHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 888888887776666555554432 222222111111122111111 1222233333333333322221110 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) ... .... .......... ...+. ...+.......... ..............++++.+| T Consensus 86 ~~~--------~~~~--~~~~~~~~~~---~~~~~-------~~~r~~~~~~~~~~---~~~~~~~~~~a~~~~t~~~GG 142 (402) T protein:vir:93 86 VKD--------KGEA--YQSLSDNEKM---VKAKA-------EFYRHAILPNEFEK---PSMEAQRLLHALPTGNDSGGD 142 (402) T ss_pred hhh--------cccc--CCCCchhHHH---HHHHH-------HHHHHHHhhhhHHH---HHHhHHHHHhhhccCCCcCCc Confidence 000 0000 0000000000 00000 00000000000000 001111112223334445566 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS 240 (497) ++||+++..+||+.++..++|+++|+++++++ ..+|+.+.....+.|++|++..|+++++|++|++.+|+++++++|| T Consensus 143 ~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS 220 (402) T protein:vir:93 143 KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAIS 220 (402) T ss_pred cccchhHHHHHHHhHHhhhhhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeecceeeeeechhh Confidence 67777788999999999999999999998875 4577766545678999999999999999999999999999999999 Q ss_pred HHHHhhH-HHHHHHHHHHHHHHHHHHHHh-hhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 241 DEGLRDA-PELFNFVQGRLLEGIQRKEEV-QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 241 ~ell~d~-~~l~~~i~~~la~~~~~~~d~-~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) +|||+|+ +++++||.++|+++++.+++. .|..|+|+++|.|+++..+...++. T Consensus 221 ~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~------------------------- 275 (402) T protein:vir:93 221 DTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG------------------------- 275 (402) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc------------------------- Confidence 9999997 689999999999999999765 5778899999999987654332211 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceec Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~ 398 (497) ...++.+.++++.+...|+.+ .+|+||+.+|..+..+++..|+|+| T Consensus 276 ---------------------------------~~~~d~l~~~~~~l~~~y~~n-a~~imn~~t~~~~~~~~~d~~~~~~ 321 (402) T protein:vir:93 276 ---------------------------------ADMYDAIINALADLHEDYRDN-ATIYMRYADYVKIISVLSNGTTNFF 321 (402) T ss_pred ---------------------------------cchHHHHHHHHhccChhhhcC-CEEEEechHHHHHHHHHhcCCCccc Confidence 011356667777777776654 4699999999888777777777877 Q ss_pred cCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecce Q lcl|NC_021309. 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~ 478 (497) .+. +.+|+|+||++++.++ +++||||+++ |.+++ ++.+...+. ..++++.||+..|+|++ T Consensus 322 ~~~-----------~~~llG~PV~~t~~~~--~i~~GDf~~~-~~~~~--~~~~~~~~~----~~~~~~~~~~~~r~Dg~ 381 (402) T protein:vir:93 322 DTP-----------AEKVFGKPVVFTDAAV--KPIVGDFNYF-GINYD--GTTYDTDKD----VKKGEYLFVLTAWYDQQ 381 (402) T ss_pred ccC-----------CccccccceEEecCCC--ceeeechhhh-hhhhh--hhhhhhhhc----ccCCceEEEEEEEeCcE Confidence 532 3479999999999875 5899999984 54544 344444332 23589999999999999 Q ss_pred eecccceEEEEeeCCCCCC Q lcl|NC_021309. 479 VYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 479 v~~~~a~~~l~~~~~a~~~ 497 (497) |++|+||+.|++++++..+ T Consensus 382 v~~~~A~~~l~ik~~~~~~ 400 (402) T protein:vir:93 382 RTLDSAFRIAKAKENTGPL 400 (402) T ss_pred EechhheEEEEeecCCCCC Confidence 9999999999999887666 No 62 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=9.4e-57 Score=327.74 Aligned_cols=413 Identities=14% Similarity=0.117 Sum_probs=236.5 Q ss_pred CchHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHH---HHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIK------------DINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEML---KSLGG 65 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~------------~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~---~~~~~ 65 (497) ||...+..+...+...+.+ ...+.....+.+.+++...+....+.+.. .+..++.. ..+++ T Consensus 185 ~~~~~~~~~~~~~~~~~~r~~~~~a~~~~~~~~~a~~~~~~~~E~~r~~eI~~l~~~~~~----~~~~~~ai~~g~sld~ 260 (632) T protein:vir:96 185 MPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQ----RSLAQEAIQKGHTVDQ 260 (632) T ss_pred ccchhhhhhccccccccccchhhcccccchhhhhhhhhhhhhhhHHHHHHHHHHHHHhhh----hhhHHHHHhccccHHH Confidence 3322211110000000000 00000000011111111111111111110 00000000 01111 Q ss_pred HHHHHH-HHHHHHHHHHHHHHH-------HHHH--HHHhhhh---hhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHH Q lcl|NC_021309. 66 ADAAKD-GLDNDIPEVEVRNLK-------QIRK--HLARAVI---MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAA 132 (497) Q Consensus 66 ~~a~~~-~~~~~~~~~~~~~~~-------~~~~--~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (497) .+++.. .+............. .... ....... .....+.......+..................... T Consensus 261 ~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~ 340 (632) T protein:vir:96 261 FRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGK 340 (632) T ss_pred HHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhh Confidence 111110 000000000000000 0000 0000000 00000000000000000000000000000000000 Q ss_pred HHHHHHHhhhhhhhhhhhhccccccccccccchhhh-HHHHHHHHhhhhHHhh-cceeecCCCceEEEEEcCCCccceec Q lcl|NC_021309. 133 ELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAAAV 210 (497) Q Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~-~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~~a~wv 210 (497) ..++.... ...........++++++|++||+++. .+||+.+++.++++++ +++++..++.++||+++++ +.++|| T Consensus 341 ~arg~~~~--~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~-~~a~wv 417 (632) T protein:vir:96 341 EARGFYMP--HEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSG-ANFYWI 417 (632) T ss_pred hhhhhhhh--HHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeCC-ceeEee Confidence 00110000 00111223334455667778888865 6799999999999998 7888998889999999974 689999 Q ss_pred ccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCc-cccccccccccc Q lcl|NC_021309. 211 AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGY-PGVNGLLQRSTG 288 (497) Q Consensus 211 ~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~-~~p~Gi~~~~~~ 288 (497) +|++.+|+++++|++|+++++|++++++||+|||+|+ ++++++|+++|.++++.++|.++|+|+|+ ++|.||++.+++ T Consensus 418 ~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~ 497 (632) T protein:vir:96 418 GEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGV 497 (632) T ss_pred cCCccccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccc Confidence 9999999999999999999999999999999999876 79999999999999999999999999996 579999998765 Q ss_pred cccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhh Q lcl|NC_021309. 289 FTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT 368 (497) Q Consensus 289 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 368 (497) .+........ .+..+..+...+... T Consensus 498 ~~~~~~~~~~-------------------------------------------------------~~~~i~~~~~~i~~~ 522 (632) T protein:vir:96 498 PALTYPAGGV-------------------------------------------------------DWASVVDMETKISTF 522 (632) T ss_pred cceecccccC-------------------------------------------------------CHHHHHHHHHHHhhc Confidence 4433221110 011222222333333 Q ss_pred hc-cCCceEEechhHHHHHHH--HhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEE Q lcl|NC_021309. 369 LF-QTPNAVVMNPRDWELLRL--TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTA 445 (497) Q Consensus 369 ~~-~~~~~~~~n~~~~~~l~~--lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~ 445 (497) +. .+..+|+||+..+..+.+ ++|++|+|||.+ .+|+|+||++++.+|.++++||||++ |.++ T Consensus 523 ~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~-------------~~l~G~pv~~s~~ip~~~~~~gd~s~--~~i~ 587 (632) T protein:vir:96 523 NADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------------NEVNGYRAEASNQIPADTWIFGDWSQ--IVIA 587 (632) T ss_pred ccccCccEEEEchhHHHHHHHHhccCCCCceeecC-------------CeecccceEeccccccCcEEEeecce--EEEE Confidence 22 234579999988777765 779999999964 36899999999999999999999998 5688 Q ss_pred eecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeC Q lcl|NC_021309. 446 RREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 446 ~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~ 492 (497) ++.+++|.++++.. |.+|++.||++.|+|++|++|++|++++..+ T Consensus 588 ~~~~~~i~~~~~~~--~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 588 MWGVLDLKVDPYTK--AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred EecceEEEEccccc--cccCceEEEEEeecCceeechhhhhheeecC Confidence 89999999998764 8899999999999999999999999999998 No 63 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=6.5e-58 Score=334.10 Aligned_cols=305 Identities=12% Similarity=0.079 Sum_probs=239.6 Q ss_pred HHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccc Q lcl|NC_021309. 138 FADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP 217 (497) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~ 217 (497) ...+............++++.+|++|||++..+||+.+++.++|+++|+++++++++++||+.++ .+.+.|++|++.+| T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~ 79 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIG-DVSAQWIGEGDMKP 79 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeC-CcceEEecCCcccc Confidence 00111111223334555667778899999999999999999999999999999999999999886 46799999999999 Q ss_pred cccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccc Q lcl|NC_021309. 218 FSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASS 296 (497) Q Consensus 218 ~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~ 296 (497) +++++|+++++.++|++++++||+|+|+|+ ++++++|.+.|++++++++|++||+|+|++.|.++.......+...... T Consensus 80 ~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~ 159 (320) T protein:vir:10 80 ITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGG 159 (320) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceeccc Confidence 999999999999999999999999999986 6899999999999999999999999999998888766544333222111 Q ss_pred hhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceE Q lcl|NC_021309. 297 LFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAV 376 (497) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 376 (497) ...... ....+.+...+..+.. .+..+.+| T Consensus 160 ~~~~~~-------------------------------------------------~~~~~~~~~~~~~~~~-~~~~~~~~ 189 (320) T protein:vir:10 160 ATASDL-------------------------------------------------TAYDAVAVNGLSLLVN-AKKKWTHT 189 (320) T ss_pred cccccc-------------------------------------------------ccHHHHHHHHHhhhhc-ccCCCcEE Confidence 100000 0001122233333333 34556789 Q ss_pred EechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCc--eEEEeeccceEEEEeecccEEEe Q lcl|NC_021309. 377 VMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARREGVTMQM 454 (497) Q Consensus 377 ~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~--~~~gd~~~~~~~i~~r~~~~i~~ 454 (497) +||+.+|..|+++||++|+|+|++........ .....+++|+||++++.+|.++ ++||||++ +.++++.++++++ T Consensus 190 v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~-~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~--~~~~~~~~~~i~~ 266 (320) T protein:vir:10 190 LLDDIVEPILNGAKDKNGRPLFIESTYTDENS-PFRAGRIVSRPTILSDHVADGTTVGYMGDFRN--VIWGQVGGLSFDV 266 (320) T ss_pred EEcHHHHHHHHHhhccCCceeeccccccCccc-cccCceeeeeeeEecCCCCCCceEEEEeecce--EEEEEecCeEEEE Confidence 99999999999999999999998755443322 2234579999999999999987 57899997 5588999999998 Q ss_pred ecccc------------hhhhcCceEEEEEEeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 455 TNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 455 ~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) +++.. ++|++|++.||+++|+|+.|++|+||++|+..+++.+ T Consensus 267 ~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 267 TDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred eecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 87643 5699999999999999999999999999998888777 No 64 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=1.3e-56 Score=327.01 Aligned_cols=383 Identities=14% Similarity=0.092 Sum_probs=245.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+.+.++++++.++.++++++..+. .++........+++++...+ .+++.+++++++.+++.++.+.... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el----~e~~~~~~~~~eei~~~~~~------~~~l~~~~~~l~~~~~~~e~~~~~~ 70 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDEL----SQKATDPNIDMEDIKQLETE------KAGLQQRFNIVERQVQDIEEKEKAK 70 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHH----HHHHhccCcCHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 9999999988877777666665432 22222111111112111111 1222233333333333322211110 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) .. ....... ........ ...+. ...+............ ............++++.+| T Consensus 71 ~~--------~~~~~~~--~~~~~~~~---~~~~~-------~~~r~~~~~~~~~~~~---~~~~~~~~a~~~~~~~~gG 127 (387) T protein:vir:94 71 VK--------DKGEAYQ--SLSDNEKM---VKAKA-------EFYRHAILPNEFEKPS---MEAQRLLHALPTGNDSGGD 127 (387) T ss_pred hh--------hccccCC--CCchhHHH---HHHHH-------HHHHHHHhhhhHHHHH---HHHHHHHhhhccCCCCCCc Confidence 00 0000000 00000000 00000 0000000000000000 0111111222334445556 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS 240 (497) .+||+++..+||+.++..++|+++|+++++++. .+|+.+.....++|++|++..++++++|++|++.+++++++++|| T Consensus 128 ~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~--~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS 205 (387) T protein:vir:94 128 KLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL--EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAIS 205 (387) T ss_pred eeechhHHHHHHHHHHhhchhhhhceeeecCCc--eeeeeeccCCccccccccccccccccccceeeechheeeeechhh Confidence 667777889999999999999999999888754 577766555679999999999999999999999999999999999 Q ss_pred HHHHhhH-HHHHHHHHHHHHHHHHHHHHh-hhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 241 DEGLRDA-PELFNFVQGRLLEGIQRKEEV-QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 241 ~ell~d~-~~l~~~i~~~la~~~~~~~d~-~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) +|||+|+ ++|++||.++|+++++.+++. .|..|+|+++|.|+++..+...++. T Consensus 206 ~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~------------------------- 260 (387) T protein:vir:94 206 DTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG------------------------- 260 (387) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc------------------------- Confidence 9999997 689999999999999999764 5778889999999987654332111 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceec Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~ 398 (497) ...++.+.++++.+...|+.+ .+|+||+.+|..+..+++..|+|+| T Consensus 261 ---------------------------------~~~~d~i~~~~~~l~~~y~~n-a~~imn~~t~~~~~~~~~~~~~~~~ 306 (387) T protein:vir:94 261 ---------------------------------ADMYDAIINALADLHEDYRDN-ATIYMRYADYVKIISVLSNGTTNFF 306 (387) T ss_pred ---------------------------------cchHHHHHHHHhccChhhhcC-CEEEEechHHHHHHHHHhcCCCccc Confidence 012456777777777777654 4799999999988877777888887 Q ss_pred cCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecce Q lcl|NC_021309. 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~ 478 (497) .+. +.+|+|+||++++.++ +++||||+++ |.++ .++.+...++ ..+|++.|++..|+|++ T Consensus 307 ~~~-----------~~~llG~PV~~~~~~~--~~~~GDf~~~-~~~~--~~~~~~~~~~----~~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:94 307 DTP-----------AEKVFGKPVVFTDAAV--KPIVGDFNYF-GINY--DGTTYDTDKD----VKKGEYLFVLTAWYDQQ 366 (387) T ss_pred ccC-----------CccccccceEEecCCC--ceeeechhhh-hhhh--hhhhheeccc----ccCCceEEEEEEEeCcE Confidence 532 3479999999999875 5899999984 4443 3455554443 23689999999999999 Q ss_pred eecccceEEEEeeCCCCCC Q lcl|NC_021309. 479 VYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 479 v~~~~a~~~l~~~~~a~~~ 497 (497) |++|+||+++++++++..+ T Consensus 367 v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:94 367 RTLDSAFRIAKAKENTGPL 385 (387) T ss_pred eechhheEEEEeecCCCCC Confidence 9999999999999887777 No 65 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=1.3e-56 Score=327.01 Aligned_cols=383 Identities=14% Similarity=0.092 Sum_probs=245.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+.+.++++++.++.++++++..+. .++........+++++...+ .+++.+++++++.+++.++.+.... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el----~e~~~~~~~~~eei~~~~~~------~~~l~~~~~~l~~~~~~~e~~~~~~ 70 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDEL----SQKATDPNIDMEDIKQLETE------KAGLQQRFNIVERQVQDIEEKEKAK 70 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHH----HHHHhccCcCHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 9999999988877777666665432 22222111111112111111 1222233333333333322211110 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) .. ....... ........ ...+. ...+............ ............++++.+| T Consensus 71 ~~--------~~~~~~~--~~~~~~~~---~~~~~-------~~~r~~~~~~~~~~~~---~~~~~~~~a~~~~~~~~gG 127 (387) T protein:vir:26 71 VK--------DKGEAYQ--SLSDNEKM---VKAKA-------EFYRHAILPNEFEKPS---MEAQRLLHALPTGNDSGGD 127 (387) T ss_pred hh--------hccccCC--CCchhHHH---HHHHH-------HHHHHHHhhhhHHHHH---HHHHHHHhhhccCCCCCCc Confidence 00 0000000 00000000 00000 0000000000000000 0111111222334445556 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS 240 (497) .+||+++..+||+.++..++|+++|+++++++. .+|+.+.....++|++|++..++++++|++|++.+++++++++|| T Consensus 128 ~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~--~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS 205 (387) T protein:vir:26 128 KLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL--EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAIS 205 (387) T ss_pred eeechhHHHHHHHHHHhhchhhhhceeeecCCc--eeeeeeccCCccccccccccccccccccceeeechheeeeechhh Confidence 667777889999999999999999999888754 577766555679999999999999999999999999999999999 Q ss_pred HHHHhhH-HHHHHHHHHHHHHHHHHHHHh-hhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 241 DEGLRDA-PELFNFVQGRLLEGIQRKEEV-QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 241 ~ell~d~-~~l~~~i~~~la~~~~~~~d~-~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) +|||+|+ ++|++||.++|+++++.+++. .|..|+|+++|.|+++..+...++. T Consensus 206 ~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~------------------------- 260 (387) T protein:vir:26 206 DTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG------------------------- 260 (387) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc------------------------- Confidence 9999997 689999999999999999764 5778889999999987654332111 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceec Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~ 398 (497) ...++.+.++++.+...|+.+ .+|+||+.+|..+..+++..|+|+| T Consensus 261 ---------------------------------~~~~d~i~~~~~~l~~~y~~n-a~~imn~~t~~~~~~~~~~~~~~~~ 306 (387) T protein:vir:26 261 ---------------------------------ADMYDAIINALADLHEDYRDN-ATIYMRYADYVKIISVLSNGTTNFF 306 (387) T ss_pred ---------------------------------cchHHHHHHHHhccChhhhcC-CEEEEechHHHHHHHHHhcCCCccc Confidence 012456777777777777654 4799999999988877777888887 Q ss_pred cCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecce Q lcl|NC_021309. 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~ 478 (497) .+. +.+|+|+||++++.++ +++||||+++ |.++ .++.+...++ ..+|++.|++..|+|++ T Consensus 307 ~~~-----------~~~llG~PV~~~~~~~--~~~~GDf~~~-~~~~--~~~~~~~~~~----~~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:26 307 DTP-----------AEKVFGKPVVFTDAAV--KPIVGDFNYF-GINY--DGTTYDTDKD----VKKGEYLFVLTAWYDQQ 366 (387) T ss_pred ccC-----------CccccccceEEecCCC--ceeeechhhh-hhhh--hhhhheeccc----ccCCceEEEEEEEeCcE Confidence 532 3479999999999875 5899999984 4443 3455554443 23689999999999999 Q ss_pred eecccceEEEEeeCCCCCC Q lcl|NC_021309. 479 VYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 479 v~~~~a~~~l~~~~~a~~~ 497 (497) |++|+||+++++++++..+ T Consensus 367 v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:26 367 RTLDSAFRIAKAKENTGPL 385 (387) T ss_pred eechhheEEEEeecCCCCC Confidence 9999999999999887777 No 66 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=1.3e-56 Score=327.01 Aligned_cols=383 Identities=14% Similarity=0.092 Sum_probs=245.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+.+.++++++.++.++++++..+. .++........+++++...+ .+++.+++++++.+++.++.+.... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el----~e~~~~~~~~~eei~~~~~~------~~~l~~~~~~l~~~~~~~e~~~~~~ 70 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDEL----SQKATDPNIDMEDIKQLETE------KAGLQQRFNIVERQVQDIEEKEKAK 70 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHH----HHHHhccCcCHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 9999999988877777666665432 22222111111112111111 1222233333333333322211110 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) .. ....... ........ ...+. ...+............ ............++++.+| T Consensus 71 ~~--------~~~~~~~--~~~~~~~~---~~~~~-------~~~r~~~~~~~~~~~~---~~~~~~~~a~~~~~~~~gG 127 (387) T protein:vir:96 71 VK--------DKGEAYQ--SLSDNEKM---VKAKA-------EFYRHAILPNEFEKPS---MEAQRLLHALPTGNDSGGD 127 (387) T ss_pred hh--------hccccCC--CCchhHHH---HHHHH-------HHHHHHHhhhhHHHHH---HHHHHHHhhhccCCCCCCc Confidence 00 0000000 00000000 00000 0000000000000000 0111111222334445556 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS 240 (497) .+||+++..+||+.++..++|+++|+++++++. .+|+.+.....++|++|++..++++++|++|++.+++++++++|| T Consensus 128 ~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~--~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS 205 (387) T protein:vir:96 128 KLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL--EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAIS 205 (387) T ss_pred eeechhHHHHHHHHHHhhchhhhhceeeecCCc--eeeeeeccCCccccccccccccccccccceeeechheeeeechhh Confidence 667777889999999999999999999888754 577766555679999999999999999999999999999999999 Q ss_pred HHHHhhH-HHHHHHHHHHHHHHHHHHHHh-hhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 241 DEGLRDA-PELFNFVQGRLLEGIQRKEEV-QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 241 ~ell~d~-~~l~~~i~~~la~~~~~~~d~-~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) +|||+|+ ++|++||.++|+++++.+++. .|..|+|+++|.|+++..+...++. T Consensus 206 ~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~------------------------- 260 (387) T protein:vir:96 206 DTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG------------------------- 260 (387) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc------------------------- Confidence 9999997 689999999999999999764 5778889999999987654332111 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceec Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~ 398 (497) ...++.+.++++.+...|+.+ .+|+||+.+|..+..+++..|+|+| T Consensus 261 ---------------------------------~~~~d~i~~~~~~l~~~y~~n-a~~imn~~t~~~~~~~~~~~~~~~~ 306 (387) T protein:vir:96 261 ---------------------------------ADMYDAIINALADLHEDYRDN-ATIYMRYADYVKIISVLSNGTTNFF 306 (387) T ss_pred ---------------------------------cchHHHHHHHHhccChhhhcC-CEEEEechHHHHHHHHHhcCCCccc Confidence 012456777777777777654 4799999999988877777888887 Q ss_pred cCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecce Q lcl|NC_021309. 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~ 478 (497) .+. +.+|+|+||++++.++ +++||||+++ |.++ .++.+...++ ..+|++.|++..|+|++ T Consensus 307 ~~~-----------~~~llG~PV~~~~~~~--~~~~GDf~~~-~~~~--~~~~~~~~~~----~~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:96 307 DTP-----------AEKVFGKPVVFTDAAV--KPIVGDFNYF-GINY--DGTTYDTDKD----VKKGEYLFVLTAWYDQQ 366 (387) T ss_pred ccC-----------CccccccceEEecCCC--ceeeechhhh-hhhh--hhhhheeccc----ccCCceEEEEEEEeCcE Confidence 532 3479999999999875 5899999984 4443 3455554443 23689999999999999 Q ss_pred eecccceEEEEeeCCCCCC Q lcl|NC_021309. 479 VYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 479 v~~~~a~~~l~~~~~a~~~ 497 (497) |++|+||+++++++++..+ T Consensus 367 v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:96 367 RTLDSAFRIAKAKENTGPL 385 (387) T ss_pred eechhheEEEEeecCCCCC Confidence 9999999999999887777 No 67 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=6.3e-58 Score=334.18 Aligned_cols=346 Identities=16% Similarity=0.110 Sum_probs=229.8 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccc Q lcl|NC_021309. 77 IPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGST 156 (497) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (497) ++...+...... +.........+.+ ..++-.+...... .....+......+...... ... .......+++ T Consensus 1 ~a~~~a~~~~~~--~~~~~~~~~~~~~----~~kg~~~~~~~~a--~a~~~g~~~~a~~~a~~~~-~~~-~~~~a~~~~~ 70 (366) T protein:vir:57 1 MAAAVAVPVKAH--SVAPGIIIKEELQ----QYKGAGMTRMVMS--IAAGKGNLADAAKFAATEL-GDT-GLSMAISTAA 70 (366) T ss_pred Cccccccccccc--ccccccccccccc----cccchhHHHHHHH--HHhcccchhHHHHHHHHhh-cch-hhhhhccccc Confidence 111111000000 0000000000000 0001001100000 0000000000000000000 000 0111222344 Q ss_pred cccccccchhhhHHHHHHHHhhhhHHhh-cceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEe Q lcl|NC_021309. 157 GTFAPGILPTFLPGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVAN 235 (497) Q Consensus 157 ~~~g~~v~p~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~ 235 (497) +++|.+||+++..+||+.+++.++|+.+ ++++++.++.++||++++ .+.++|++|++.+|+++++|++|++.++|+++ T Consensus 71 ~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~-~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~ 149 (366) T protein:vir:57 71 GSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSG-GATAGYVGEGKDVVATGATFDDVKLSAKTMIA 149 (366) T ss_pred cCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeC-CcceeeeccCccccccccceeEEEEeeEEEEE Confidence 5666677778888999999999999998 899999888999999987 47899999999999999999999999999999 Q ss_pred eehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCcc-ccccccccccccccccccchhhhhhhHHHHHHhhhh Q lcl|NC_021309. 236 ALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 236 ~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) +++||+|||+|+ ++++++|+++|++++++++|.+||+|+|++ +|.||++..+..+............. T Consensus 150 ~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~---------- 219 (366) T protein:vir:57 150 LVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLT---------- 219 (366) T ss_pred eehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchh---------- Confidence 999999999987 699999999999999999999999999975 79999988765433322110000000 Q ss_pred hcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhc Q lcl|NC_021309. 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDAN 393 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~ 393 (497) .... ..+....... ....+.....|+||+.+|..|+++||++ T Consensus 220 ------------------------------------~~~~-~~~~~~~~~~-~~~~~~~~a~~vmn~~~~~~L~~lkd~~ 261 (366) T protein:vir:57 220 ------------------------------------TIDE-YLDSLILKHM-DSNSNMIRCGWGLSNRTYMTLFGLRDGN 261 (366) T ss_pred ------------------------------------hHHH-HHHHHHHhhh-ccccccccCEEEecHHHHHHHHhhhccC Confidence 0000 0011111111 1222344667999999999999999999 Q ss_pred CceeccCcccccccccccccccccccceEecCCCCcC--------ceEEEeeccceEEEEeecccEEEeecccc------ Q lcl|NC_021309. 394 GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTARREGVTMQMTNSNG------ 459 (497) Q Consensus 394 G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~--------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------ 459 (497) |+|+|++.. ..+|+|+||++++.||.+ .++||||++ |.++++.+++|+++++.. T Consensus 262 G~~l~~~~~----------~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~--~~i~~~~~i~i~~~~ea~~~~~~g 329 (366) T protein:vir:57 262 GNKVYPEMS----------QGILKGYPIQRTSAIPANLGDDGNESEIYFCDFND--VVIGEDGMMKVDFSTEATYKDADG 329 (366) T ss_pred CceeccCCC----------CCeecceeeEEccccccccccCCCccEEEEEecce--EEEEEecceEEEEeeccccccccc Confidence 999996431 247999999999999963 378999997 668999999999887632 Q ss_pred ---hhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 460 ---TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 460 ---~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) ++|++|++.+|+++|+||.|+||+||++|+-..= T Consensus 330 ~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 330 QLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred cchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 5799999999999999999999999999987766 No 68 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=1.7e-57 Score=331.87 Aligned_cols=302 Identities=12% Similarity=0.088 Sum_probs=240.7 Q ss_pred HHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccc Q lcl|NC_021309. 138 FADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP 217 (497) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~ 217 (497) ..++.........+..++++.+|++||+++..+||+.+++.++|+++|++++++++.++||++++ .+.++|++|++.+| T Consensus 1 ~~~~~~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~ 79 (318) T protein:vir:24 1 MAAGTAFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVG-DVSAQWIGEGDMKP 79 (318) T ss_pred CCCCCCCCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeC-CcceEEecCCcccc Confidence 00000011122233445566677889999999999999999999999999999999999999987 56899999999999 Q ss_pred cccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccc Q lcl|NC_021309. 218 FSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASS 296 (497) Q Consensus 218 ~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~ 296 (497) +++++|++|++.+||++++++||+|+|+|+ ++++++|.++|+++++.++|.++|+|+|+++|.|++............. T Consensus 80 ~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~ 159 (318) T protein:vir:24 80 ITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTG 159 (318) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCccccccccccccccccc Confidence 999999999999999999999999999986 6899999999999999999999999999999999887654433222211 Q ss_pred hhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceE Q lcl|NC_021309. 297 LFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAV 376 (497) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 376 (497) .. ....+.+...+..+.. .+..+.+| T Consensus 160 ~~-----------------------------------------------------~~~~~~~~~~~~~~~~-~~~~~~~~ 185 (318) T protein:vir:24 160 AT-----------------------------------------------------TVYDQVAVNGLSLLVN-DGKKWTHT 185 (318) T ss_pred cc-----------------------------------------------------chHHHHHHHHHHhhcc-ccCCCCEE Confidence 00 0001122223333333 34455689 Q ss_pred EechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCc--eEEEeeccceEEEEeecccEEEe Q lcl|NC_021309. 377 VMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARREGVTMQM 454 (497) Q Consensus 377 ~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~--~~~gd~~~~~~~i~~r~~~~i~~ 454 (497) +||+.+|..|+++||++|+|||++....... ......+++|+||++++.+|.++ +++|||++ +.++++.++++++ T Consensus 186 v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~-~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~--~~~~~~~~l~i~~ 262 (318) T protein:vir:24 186 LLDDITEPILNGAKDQNGRPLFIESTYGEAA-SPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQ--LIWGQIGGLSFDV 262 (318) T ss_pred EEcHHHHHHHHHhhccCCceeecCccccCcc-ccccCceEEEEeeEEeCCCCCCccEEEEeecce--EEEEEecCeEEEE Confidence 9999999999999999999999986554332 22234579999999999999876 57899997 5578899999998 Q ss_pred ecccc------------hhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 455 TNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 455 ~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) +++.. ++|++|++.+|+++|+|+.|++|+||++|+..+++.|+ T Consensus 263 ~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~ 317 (318) T protein:vir:24 263 TDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGGE 317 (318) T ss_pred eeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCCC Confidence 77643 56999999999999999999999999999999999988 No 69 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=3.3e-56 Score=324.77 Aligned_cols=382 Identities=14% Similarity=0.134 Sum_probs=241.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+...++++.+.++.++++.+.++..+ +........++++ ++.++++.++++++.++.++.+. T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~----~~~~~~~~~ee~~-------------~~~~~~~~l~~~~~~l~~~~~~~ 63 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQ----KATDPNIDMEDIK-------------QLETEKAGLQQRFNIVERQVKDI 63 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHH----HHhccCcCHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHH Confidence 999999988887777766666553222 2211111111111 11122222333333333322222 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.......... .. ........... ...+. ...+........ ................++++.+| T Consensus 64 e~~~~~~~~~~-~~--~~~~~~~~~~~---~~~~~-------~~~r~~~~~~~~---~~~~~~~~~~~~al~~~t~s~gG 127 (387) T protein:vir:93 64 EEKEKAKVKDT-GE--AYQSLNDHEKM---VKAKA-------EFYRHAILPNEF---EKPSMEAQRLLHALPTGNDSGGD 127 (387) T ss_pred HHHHHHhhhhc-cc--cCCCcchhhHH---HHHHH-------HHHHHHhhhhhh---hhhhhhhHHHHHhhccCcCCCCc Confidence 21110000000 00 00000000000 00000 000000000000 00111111122233334445556 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS 240 (497) ++||+++..+||+.+++.++|+++|+++++++ ..+|+.......+.|++|++..++++++|++|++.+++++++++|| T Consensus 128 ~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS 205 (387) T protein:vir:93 128 KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAIS 205 (387) T ss_pred eeechhHHHHHHHHHHhhchhhhheeeeecCC--ceEEEEeecCCccccccCcccccccccccceeeeeheeeeeechhh Confidence 66777788999999999999999999998875 4577766545678999999999999999999999999999999999 Q ss_pred HHHHhhH-HHHHHHHHHHHHHHHHHHHHh-hhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 241 DEGLRDA-PELFNFVQGRLLEGIQRKEEV-QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 241 ~ell~d~-~~l~~~i~~~la~~~~~~~d~-~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) +|||+|+ ++|++||.++|+++++.+++. .|.+|+|+++|.|++...+...++. T Consensus 206 ~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~------------------------- 260 (387) T protein:vir:93 206 DTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEG------------------------- 260 (387) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc------------------------- Confidence 9999997 689999999999999999765 5778899999999987654322111 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHH-HHhhhcCcee Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLR-LTKDANGQYM 397 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~-~lkd~~G~~i 397 (497) ...++.+.++++.+...|+.+ .+|+||+.+|..+. +++|++|+|+ T Consensus 261 ---------------------------------~~~~d~i~~~~~~l~~~~~~~-a~~~mn~~t~~~~~~~~~d~~~~~~ 306 (387) T protein:vir:93 261 ---------------------------------ADMYDAIINALADLHEDYRDN-ATIYMRYADYVKIISVLSNGTTNFF 306 (387) T ss_pred ---------------------------------cchHHHHHHHHhccChhhhcC-CEEEEechHHHHHHHHHhcCCCccc Confidence 011356677777777777654 47999999987765 5556555544 Q ss_pred ccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecc Q lcl|NC_021309. 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGL 477 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~ 477 (497) +.. +.+|+|+||++++.++ .++||||+++ |.+ +.++.+...++ +.+++++|++..|+|+ T Consensus 307 ~~~------------~~~llG~PV~~~~~~~--~~~~GDf~~~-~~~--~~~~~~~~~~~----~~~~~~~~~~~~r~d~ 365 (387) T protein:vir:93 307 DTP------------AEKVFGKPVVFTDAAV--KPIVGDFNYF-GIN--YDGTTYDTDKD----VKKGEYLFVLTAWYDQ 365 (387) T ss_pred ccC------------CccccccceEEecCCC--ceeeeehhhh-hee--hhhheeeeccc----ccCCceeEEEEeeeCc Confidence 321 2479999999999876 5799999984 433 34565555443 4578999999999999 Q ss_pred eeecccceEEEEeeCCCCCC Q lcl|NC_021309. 478 LVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 478 ~v~~~~a~~~l~~~~~a~~~ 497 (497) +|++|+||+.+++++++.++ T Consensus 366 ~v~~~eA~~~l~~k~~~~~~ 385 (387) T protein:vir:93 366 QRTLDSAFRIAKAKENTGSL 385 (387) T ss_pred eeechhheEEEEeecCCCCC Confidence 99999999999999888777 No 70 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=1.9e-57 Score=331.55 Aligned_cols=279 Identities=15% Similarity=0.119 Sum_probs=224.7 Q ss_pred ccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeee Q lcl|NC_021309. 152 PFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVG 231 (497) Q Consensus 152 ~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~ 231 (497) ...+++++|++|||++..+||+.+++.++|+++|++++++++.+++|+.++ .+.++||+|++.+|+++++|+++++.+| T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~-~~~a~wv~Eg~~~~~s~~~f~~v~l~~~ 79 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDF-DSDIDIVAENGKKTHGGVSLDPVTIVPL 79 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEec-CcceEEeeCCcccccccccceeeEeeeE Confidence 445556678899999999999999999999999999999999999999987 4789999999999999999999999999 Q ss_pred eEEeeehhhHHHHh---h-HHHHHHHHHHHHHHHHHHHHHhhhhcccCcc--c---cccccccccccccccccchhhhhh Q lcl|NC_021309. 232 KVANALTITDEGLR---D-APELFNFVQGRLLEGIQRKEEVQLLAGGGYP--G---VNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 232 kla~~~~iS~ell~---d-~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~--~---p~Gi~~~~~~~~~~~~~~~~~~~~ 302 (497) |++++++||+|||+ | .++++++|.++|++++++++|.++++|++.+ . +.|.....+..+.... T Consensus 80 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~-------- 151 (300) T protein:vir:95 80 KVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVP-------- 151 (300) T ss_pred EEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeec-------- Confidence 99999999999994 3 4789999999999999999999999996533 2 2222222211111000 Q ss_pred hHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhH Q lcl|NC_021309. 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 382 (497) .+.....+.+..++..+...+ .++++|+||+.+ T Consensus 152 ----------------------------------------------~~~~~~~~~i~~~~~~~~~~~-~~~~~~vmn~~~ 184 (300) T protein:vir:95 152 ----------------------------------------------FKDTNPDESMEDAVGMIDGSE-RDITGAILDPIF 184 (300) T ss_pred ----------------------------------------------ccccchHHHHHHHHHHhhhcC-CCccEEEECHHH Confidence 001112344555555554444 456689999999 Q ss_pred HHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCc------eEEEeeccceEEEEeecccEEEeec Q lcl|NC_021309. 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT------ILVGHFAPSVIQTARREGVTMQMTN 456 (497) Q Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~------~~~gd~~~~~~~i~~r~~~~i~~~~ 456 (497) +..|+++||++|||||.+...+. ...+|||+||++++.+|.+. +++|||++ ++.++.|+++++++++ T Consensus 185 ~~~L~~lkd~~G~~i~~~~~~~~------~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GDf~~-~~~~~~~~~~~~~v~~ 257 (300) T protein:vir:95 185 TTALSKMKNAEGGKLYPELAWGG------VPDAINGLAVDKNRTVSYSQTDPKNTAIVGDFET-MFKWGYAKEVPMEIIK 257 (300) T ss_pred HHHHHHhhccCCCeeccCccccC------CCceecceeeEEecCCCCCCCCCccEEEEeeccc-eEEEEEecccEEEEee Confidence 99999999999999997654321 34689999999999998643 68899998 4667789999999987 Q ss_pred ccc------hhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 457 SNG------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 457 ~~~------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) +.. ++|++|++.||+++|+||.|++|+||++|+.++. T Consensus 258 ~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 258 YGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred ccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 643 3699999999999999999999999999988887 No 71 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=2.6e-57 Score=330.78 Aligned_cols=281 Identities=15% Similarity=0.134 Sum_probs=224.7 Q ss_pred cccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeee Q lcl|NC_021309. 153 FGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGK 232 (497) Q Consensus 153 ~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~k 232 (497) .-+.+++|.+||+++..+||+.+++.++|+++|++++++++.++||+.++ .+.++|++|++.+|+++++|+++++.++| T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~-~~~a~wv~Eg~~~~~~~~~f~~v~l~~~k 79 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeC-CceeEEeecCcccccccceeeEEEEeeEE Confidence 23444567788888999999999999999999999999999999999987 46899999999999999999999999999 Q ss_pred EEeeehhhHHHHh---h-HHHHHHHHHHHHHHHHHHHHHhhhhcccCcc---ccccccccccccccccccchhhhhhhHH Q lcl|NC_021309. 233 VANALTITDEGLR---D-APELFNFVQGRLLEGIQRKEEVQLLAGGGYP---GVNGLLQRSTGFTASSASSLFGATSATV 305 (497) Q Consensus 233 la~~~~iS~ell~---d-~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~---~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 305 (497) ++++++||+|||+ | ..+|+++|.+++++++++++|.++++|++.+ .+.|+.+.....+........ T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~------- 152 (311) T protein:vir:81 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTG------- 152 (311) T ss_pred EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeeccc------- Confidence 9999999999995 3 3579999999999999999999999997543 366776654322111111000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHH Q lcl|NC_021309. 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWEL 385 (497) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~ 385 (497) .... .+..+..+......++.++++|+||+.+|.. T Consensus 153 --------------------------------------------~~~~-~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~ 187 (311) T protein:vir:81 153 --------------------------------------------TSAT-PDLAVEAAVGLVLGDNLSPDGVALDNTFSFM 187 (311) T ss_pred --------------------------------------------ccch-HHHHHHHHHHHhhhcCCCceEEEEcHHHHHH Confidence 0000 1111222222334455677789999999999 Q ss_pred HHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC------------------ceEEEeeccceEEEEee Q lcl|NC_021309. 386 LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------------------TILVGHFAPSVIQTARR 447 (497) Q Consensus 386 l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~------------------~~~~gd~~~~~~~i~~r 447 (497) |+++||++|+|+|++..... ...+|+|+||++++.||.+ .+++|||++ |.+..+ T Consensus 188 l~~lkd~~G~~l~~~~~~~~------~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~--~~i~~~ 259 (311) T protein:vir:81 188 LATQRDSQGRKLYPELGFGT------DVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSA--FRWGVQ 259 (311) T ss_pred HHhhhccCCCeeecCccccC------CCceecceeEEecccccccccccccccchhcccCCccEEEEEeccc--EEEEEe Confidence 99999999999998754332 3468999999999999853 268999998 567778 Q ss_pred cccEEEeecccc-----hhhhcCceEEEEEEeecceeecccceEEEEeeCCC Q lcl|NC_021309. 448 EGVTMQMTNSNG-----TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 448 ~~~~i~~~~~~~-----~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a 494 (497) .+++++++++.. ++|++|+|.||++.|+|+.|++|+||++|+....| T Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 260 VSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred ccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 999999987642 46999999999999999999999999999999888 No 72 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=2.6e-57 Score=330.85 Aligned_cols=287 Identities=16% Similarity=0.109 Sum_probs=225.8 Q ss_pred hccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~ 230 (497) |..+++.++|++||+++..+||+.+++.++|+++|++++++++.++||++++ .+.|+|++|++.+|+++++|+++++.+ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~s~~~f~~v~l~~ 79 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeCCccccccccceeeeEeee Confidence 6667778888889999999999999999999999999999999999999987 468999999999999999999999999 Q ss_pred eeEEeeehhhHHHHhhH-HH----HHHHHHHHHHHHHHHHHHhhhhcccCccc---cccccccccccccccccchhhhhh Q lcl|NC_021309. 231 GKVANALTITDEGLRDA-PE----LFNFVQGRLLEGIQRKEEVQLLAGGGYPG---VNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 231 ~kla~~~~iS~ell~d~-~~----l~~~i~~~la~~~~~~~d~~~l~G~G~~~---p~Gi~~~~~~~~~~~~~~~~~~~~ 302 (497) ||++++++||+||++++ .+ |+++|.++|++++++++|.++++|+|.+. +.|+.+.....+... T Consensus 80 ~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~--------- 150 (315) T protein:vir:80 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIV--------- 150 (315) T ss_pred eeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccccccccccccccee--------- Confidence 99999999999999765 22 88999999999999999999999987432 333332211110000 Q ss_pred hHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhH Q lcl|NC_021309. 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 382 (497) ........++..++..+....+...++|+||+.+ T Consensus 151 ----------------------------------------------~~~~~~~~d~~~~~~~~~~~~~~~~~~~imn~~~ 184 (315) T protein:vir:80 151 ----------------------------------------------DATDSATADLVKAVGLIAGAGLQVPNGVALDPAF 184 (315) T ss_pred ----------------------------------------------eccccchHHHHHHHHHHhhccCccceEEEEcHHH Confidence 0001112334444444444444455689999999 Q ss_pred HHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC---------ceEEEeeccceEEEEeecccEEE Q lcl|NC_021309. 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG---------TILVGHFAPSVIQTARREGVTMQ 453 (497) Q Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~---------~~~~gd~~~~~~~i~~r~~~~i~ 453 (497) +..|+++||.+|++++..+... .....++.+|+|+||+++++||.+ .++||||++ +.++.+.+++++ T Consensus 185 ~~~L~~l~~~~g~~~~g~~~~~--~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~--~~~g~~~~~~i~ 260 (315) T protein:vir:80 185 SFALSTEVYPKGSPLAGQPMYP--AAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSR--VHWGFQRNFPIE 260 (315) T ss_pred HHHHHHHhhccCCccccccccc--ccccCCCceecceeeEecCcCCcccccccccccEEEEeeccc--EEEEEecCeeEE Confidence 9999999988877655432211 111223468999999999999864 268899998 556778899999 Q ss_pred eecccc------hhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 454 MTNSNG------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 454 ~~~~~~------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ++++.. ++|++|++.||+++|+||+|++|+||++|+.+++++.+ T Consensus 261 i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~ 310 (315) T protein:vir:80 261 LIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) T ss_pred EeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCC Confidence 877632 46999999999999999999999999999999998888 No 73 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=4.5e-57 Score=329.51 Aligned_cols=285 Identities=15% Similarity=0.154 Sum_probs=233.5 Q ss_pred hhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccccccc Q lcl|NC_021309. 143 TAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 143 ~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~ 222 (497) ...........++++.+|++||+++..+|++.+++.++|+++|++++++++.++||+.++ .+.+.|++|++.+|+++++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~ 79 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAK-GVGAYWVSETERIQTSKPE 79 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeC-CcceEEeecCcccccccce Confidence 222222334445566677788888889999999999999999999999999999999986 4689999999999999999 Q ss_pred ceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhh Q lcl|NC_021309. 223 FARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 223 f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~ 301 (497) |++|+++++|++++++||+|+|+|+ .+|++||.++|++++++++|.++++|+|+++|.|+.............. T Consensus 80 ~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~----- 154 (304) T protein:vir:10 80 YAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGN----- 154 (304) T ss_pred eeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccc----- Confidence 9999999999999999999999987 5899999999999999999999999999998887655432211111000 Q ss_pred hhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechh Q lcl|NC_021309. 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPR 381 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 381 (497) ........++++..++..+...++. +.+|+||+. T Consensus 155 ---------------------------------------------~~~~~~~~~~~i~~~~~~l~~~~~~-~~~~v~~~~ 188 (304) T protein:vir:10 155 ---------------------------------------------VVTDTNNLYVDLSALMATIEDEELD-PNGVLTTRS 188 (304) T ss_pred ---------------------------------------------ccccccchHHHHHHHHHHhhhccCC-cCEEEEcHH Confidence 0001122345666777777666544 557999999 Q ss_pred HHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC----ceEEEeeccceEEEEeecccEEEeecc Q lcl|NC_021309. 382 DWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG----TILVGHFAPSVIQTARREGVTMQMTNS 457 (497) Q Consensus 382 ~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~----~~~~gd~~~~~~~i~~r~~~~i~~~~~ 457 (497) +|..|+++||++|||+|++. ..+|+|+||++++.+|.+ .+++|||++ +.+++|++++++++++ T Consensus 189 ~~~~L~~lkd~~G~~l~~~~-----------~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~--~~~~~~~~~~i~~~~e 255 (304) T protein:vir:10 189 FRSKMRNALDANDRPLFDAN-----------GNEIMGLPLSYTGADVYDKKKSLALMGDWDY--ARYGILQGIEYAISED 255 (304) T ss_pred HHHHHHHhhccCCcEeecCC-----------CccccceeeEEecccccCCCCcEEEEEehhh--EEEEEecceEEEEeec Confidence 99999999999999999763 247999999999999854 489999997 5688999999998876 Q ss_pred cc--------------hhhhcCceEEEEEEeecceeecccceEEEEeeC Q lcl|NC_021309. 458 NG--------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 458 ~~--------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~ 492 (497) .. ++|++||+.||+++|+|+.|++|+||++|+... T Consensus 256 ~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 256 ATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 43 469999999999999999999999999999988 No 74 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=4.5e-57 Score=329.51 Aligned_cols=285 Identities=15% Similarity=0.154 Sum_probs=233.5 Q ss_pred hhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccccccccc Q lcl|NC_021309. 143 TAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 143 ~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~ 222 (497) ...........++++.+|++||+++..+|++.+++.++|+++|++++++++.++||+.++ .+.+.|++|++.+|+++++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~ 79 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAK-GVGAYWVSETERIQTSKPE 79 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeC-CcceEEeecCcccccccce Confidence 222222334445566677788888889999999999999999999999999999999986 4689999999999999999 Q ss_pred ceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhh Q lcl|NC_021309. 223 FARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 223 f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~ 301 (497) |++|+++++|++++++||+|+|+|+ .+|++||.++|++++++++|.++++|+|+++|.|+.............. T Consensus 80 ~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~----- 154 (304) T protein:vir:94 80 YAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGN----- 154 (304) T ss_pred eeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccc----- Confidence 9999999999999999999999987 5899999999999999999999999999998887655432211111000 Q ss_pred hhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechh Q lcl|NC_021309. 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPR 381 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 381 (497) ........++++..++..+...++. +.+|+||+. T Consensus 155 ---------------------------------------------~~~~~~~~~~~i~~~~~~l~~~~~~-~~~~v~~~~ 188 (304) T protein:vir:94 155 ---------------------------------------------VVTDTNNLYVDLSALMATIEDEELD-PNGVLTTRS 188 (304) T ss_pred ---------------------------------------------ccccccchHHHHHHHHHHhhhccCC-cCEEEEcHH Confidence 0001122345666777777666544 557999999 Q ss_pred HHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC----ceEEEeeccceEEEEeecccEEEeecc Q lcl|NC_021309. 382 DWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG----TILVGHFAPSVIQTARREGVTMQMTNS 457 (497) Q Consensus 382 ~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~----~~~~gd~~~~~~~i~~r~~~~i~~~~~ 457 (497) +|..|+++||++|||+|++. ..+|+|+||++++.+|.+ .+++|||++ +.+++|++++++++++ T Consensus 189 ~~~~L~~lkd~~G~~l~~~~-----------~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~--~~~~~~~~~~i~~~~e 255 (304) T protein:vir:94 189 FRSKMRNALDANDRPLFDAN-----------GNEIMGLPLSYTGADVYDKKKSLALMGDWDY--ARYGILQGIEYAISED 255 (304) T ss_pred HHHHHHHhhccCCcEeecCC-----------CccccceeeEEecccccCCCCcEEEEEehhh--EEEEEecceEEEEeec Confidence 99999999999999999763 247999999999999854 489999997 5688999999998876 Q ss_pred cc--------------hhhhcCceEEEEEEeecceeecccceEEEEeeC Q lcl|NC_021309. 458 NG--------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 458 ~~--------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~ 492 (497) .. ++|++||+.||+++|+|+.|++|+||++|+... T Consensus 256 ~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 256 ATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 43 469999999999999999999999999999988 No 75 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=6e-57 Score=328.82 Aligned_cols=295 Identities=15% Similarity=0.127 Sum_probs=233.1 Q ss_pred hhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccc Q lcl|NC_021309. 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSE 221 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~ 221 (497) +........+...+++.+|++|||++..+||+.+++.++|++++++++++++.++||++++ .+.+.|++|++.+|++++ T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~s~~ 79 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTG-DVSAQWIGEGDMKPITKG 79 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcC-CcceEEecCCcccccccc Confidence 1122233344455667778899999999999999999999999999999999999999987 567999999999999999 Q ss_pred cceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhh Q lcl|NC_021309. 222 EFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 222 ~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~ 300 (497) +|++|++.+||++++++||+|||+|+ ++++++|+++|++++++++|.++|+|+|++++.+.+......+...... T Consensus 80 ~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~---- 155 (397) T protein:vir:23 80 NMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPN---- 155 (397) T ss_pred ceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeeccc---- Confidence 99999999999999999999999987 6899999999999999999999999999876444333222111111100 Q ss_pred hhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEech Q lcl|NC_021309. 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) ...+.+..+...+...++ ..++|+||+ T Consensus 156 ----------------------------------------------------~~~~~~~~~~~~l~~~~~-~~a~~vmn~ 182 (397) T protein:vir:23 156 ----------------------------------------------------AYQGLGVSGLTKLVTDGK-KWTHTLLDD 182 (397) T ss_pred ----------------------------------------------------chhHHHHHHHHhhhhccc-CCCEEEEcH Confidence 011223333344444444 456899999 Q ss_pred hHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCce--EEEeeccceEEEEeecccEEEeeccc Q lcl|NC_021309. 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI--LVGHFAPSVIQTARREGVTMQMTNSN 458 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~--~~gd~~~~~~~i~~r~~~~i~~~~~~ 458 (497) .++..|+++||++|||||++......... ....+|+|+||++++++|+++. ++|||++ +.++++++++++++++. T Consensus 183 ~~~~~L~~lkd~~G~~i~~~~~~~~~~~~-~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~--~~i~~~~~i~i~~~~e~ 259 (397) T protein:vir:23 183 TVEPVLNGSVDANGRPLFVESTYESLTTP-FREGRILGRPTILSDHVAEGDVVGYAGDFSQ--IIWGQVGGLSFDVTDQA 259 (397) T ss_pred HHHHHHHHhhccCCceeeccccccccccc-ccCceeeeeeEEEeCCCCCCceEEEEeecce--EEEEEEeceEEEEeeee Confidence 99999999999999999998655433221 2345799999999999998874 7899997 45788999999887653 Q ss_pred ------------chhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 459 ------------GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 459 ------------~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .++|++||+.||+++|+|++|++|+||++++........ T Consensus 260 ~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~ 310 (397) T protein:vir:23 260 TLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTY 310 (397) T ss_pred eeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecccccee Confidence 256999999999999999999999999999986554333 No 76 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=2.6e-56 Score=325.33 Aligned_cols=303 Identities=15% Similarity=0.148 Sum_probs=233.4 Q ss_pred hhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcce Q lcl|NC_021309. 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~ 187 (497) +++. ++. ....+.+.................++.+|++||+++..+|++.+++.++|+++|++ T Consensus 1 ~~~~------~~~-----------~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~ 63 (324) T protein:vir:97 1 MEQT------QKL-----------KLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred Cccc------hhH-----------HHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcce Confidence 0000 000 00001111111111111222233445567778888899999999999999999999 Q ss_pred eecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~ 266 (497) +++++++++||+.++ .+.+.|++|++.+|+++++|+++++.++|++++++||+|+|+|+ ++++++|.++|++++++++ T Consensus 64 ~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:97 64 EPMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred eeccCCceEEEEEec-CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 999999999999986 56899999999999999999999999999999999999999987 6899999999999999999 Q ss_pred HhhhhcccCcc-ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccc Q lcl|NC_021309. 267 EVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 267 d~~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) |.++|+|+|++ .|.|+++.....+.... T Consensus 143 d~a~l~G~g~~~~~~gi~~~~~~~~~~~~--------------------------------------------------- 171 (324) T protein:vir:97 143 DEAGILNQGNNPFGKSIAQSIEKTNKVIK--------------------------------------------------- 171 (324) T ss_pred HHHhhccCCCCccCccccccccccceecc--------------------------------------------------- Confidence 99999999976 57777765432221110 Q ss_pred cccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecC Q lcl|NC_021309. 346 GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~ 425 (497) ....++++..++..+...++ .+.+|+||+.+|..|+++||++|+|+|.+. ...+|+|+||++++ T Consensus 172 -----~~~~~~~i~~~~~~l~~~~~-~~~~~v~n~~~~~~L~~lkd~~g~~~~~~~----------~~~tl~G~PV~~~~ 235 (324) T protein:vir:97 172 -----GDFTQDNIIDLEALLEDDEL-EANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDTLDGLPVVNLK 235 (324) T ss_pred -----ccCCHHHHHHHHHhhhhccC-CCCEEEEcHHHHHHHHHhhcCCCceeecCC----------CCccccceeeEeec Confidence 01123455566666666554 456899999999999999999999998643 23579999999988 Q ss_pred CCC--cCceEEEeeccceEEEEeecccEEEeecccc------------hhhhcCceEEEEEEeecceeecccceEEEEee Q lcl|NC_021309. 426 LIP--LGTILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 426 ~~~--~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~ 491 (497) ..+ .+.+++|||++ +.++++.+++|+++++.. ++|++|++.||+++|+|+.|++|+||++|+.. T Consensus 236 ~~~~~~~~~~~gd~~~--~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~ 313 (324) T protein:vir:97 236 SSNLKRGELITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CCCCCcceEEEEeccc--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Confidence 755 55689999997 557889999999987642 56999999999999999999999999999998 Q ss_pred CCCCCC Q lcl|NC_021309. 492 KGATGS 497 (497) Q Consensus 492 ~~a~~~ 497 (497) .+.+-. T Consensus 314 ~~~~~~ 319 (324) T protein:vir:97 314 DKKTDS 319 (324) T ss_pred cCCCCC Confidence 776644 No 77 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=5.2e-56 Score=323.65 Aligned_cols=303 Identities=15% Similarity=0.128 Sum_probs=231.9 Q ss_pred hHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceE Q lcl|NC_021309. 117 SFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS 196 (497) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 196 (497) +.+... .....+.+.................++.+|++||+++..+||+.+++.++|++++++++++++.++ T Consensus 1 ~~~~~~--------~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:96 1 MEQTQK--------LKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CCcchh--------hhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 000000 000011122111111111222334455667788888999999999999999999999999999999 Q ss_pred EEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccC Q lcl|NC_021309. 197 YLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGG 275 (497) Q Consensus 197 ~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G 275 (497) ||+.++ .+.++|++|++.+|+++++|+++++.++|++++++||+|+|+|+ +++++||.++|++++++++|.++|+|+| T Consensus 73 ~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g 151 (324) T protein:vir:96 73 FTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEec-CcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 999987 46899999999999999999999999999999999999999987 6899999999999999999999999999 Q ss_pred cc-ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhh Q lcl|NC_021309. 276 YP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEI 354 (497) Q Consensus 276 ~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (497) ++ .|.|+.+..+........ ... T Consensus 152 ~~~~~~gi~~~~~~~~~~~~~--------------------------------------------------------~~t 175 (324) T protein:vir:96 152 NNPFGKSIAQSIEKTNKVIKG--------------------------------------------------------DFT 175 (324) T ss_pred CCCcCccccccccccceeccc--------------------------------------------------------ccc Confidence 76 477776654332211110 112 Q ss_pred hhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCC--cCce Q lcl|NC_021309. 355 AENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP--LGTI 432 (497) Q Consensus 355 ~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~--~~~~ 432 (497) ++++..++..+...+ ..+++|+||+.+|..|+++||++|+|++.+. ...+|+|+||++++.++ .+.+ T Consensus 176 ~~~i~~~~~~l~~~~-~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~----------~~~~l~G~PV~~~~~~~~~~~~~ 244 (324) T protein:vir:96 176 QDNIIDLEALLEDDE-LEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDSLDGLPVVNLKSSNLKRGEL 244 (324) T ss_pred HHHHHHHHHhhhhcc-CCCCEEEEcHHHHHHHHHhhccCCCeeecCC----------CCCcccceeeEeeCCCCCCcceE Confidence 345556666665554 4566899999999999999999999998642 23579999999987755 5568 Q ss_pred EEEeeccceEEEEeecccEEEeecccc------------hhhhcCceEEEEEEeecceeecccceEEEEeeCCCC-CC Q lcl|NC_021309. 433 LVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT-GS 497 (497) Q Consensus 433 ~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~-~~ 497 (497) ++|||++ +.++++++++++++++.. ++|++|++.||+++|+||.|.+|+||++|+.....+ .+ T Consensus 245 ~~gd~~~--~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~ 320 (324) T protein:vir:96 245 ITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred EEEecce--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCC Confidence 9999997 558889999999987643 569999999999999999999999999998754433 33 No 78 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=5.2e-56 Score=323.65 Aligned_cols=303 Identities=15% Similarity=0.128 Sum_probs=231.9 Q ss_pred hHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceE Q lcl|NC_021309. 117 SFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS 196 (497) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 196 (497) +.+... .....+.+.................++.+|++||+++..+||+.+++.++|++++++++++++.++ T Consensus 1 ~~~~~~--------~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:78 1 MEQTQK--------LKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CCcchh--------hhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 000000 000011122111111111222334455667788888999999999999999999999999999999 Q ss_pred EEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccC Q lcl|NC_021309. 197 YLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGG 275 (497) Q Consensus 197 ~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G 275 (497) ||+.++ .+.++|++|++.+|+++++|+++++.++|++++++||+|+|+|+ +++++||.++|++++++++|.++|+|+| T Consensus 73 ~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g 151 (324) T protein:vir:78 73 FTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEec-CcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 999987 46899999999999999999999999999999999999999987 6899999999999999999999999999 Q ss_pred cc-ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhh Q lcl|NC_021309. 276 YP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEI 354 (497) Q Consensus 276 ~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (497) ++ .|.|+.+..+........ ... T Consensus 152 ~~~~~~gi~~~~~~~~~~~~~--------------------------------------------------------~~t 175 (324) T protein:vir:78 152 NNPFGKSIAQSIEKTNKVIKG--------------------------------------------------------DFT 175 (324) T ss_pred CCCcCccccccccccceeccc--------------------------------------------------------ccc Confidence 76 477776654332211110 112 Q ss_pred hhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCC--cCce Q lcl|NC_021309. 355 AENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP--LGTI 432 (497) Q Consensus 355 ~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~--~~~~ 432 (497) ++++..++..+...+ ..+++|+||+.+|..|+++||++|+|++.+. ...+|+|+||++++.++ .+.+ T Consensus 176 ~~~i~~~~~~l~~~~-~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~----------~~~~l~G~PV~~~~~~~~~~~~~ 244 (324) T protein:vir:78 176 QDNIIDLEALLEDDE-LEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDSLDGLPVVNLKSSNLKRGEL 244 (324) T ss_pred HHHHHHHHHhhhhcc-CCCCEEEEcHHHHHHHHHhhccCCCeeecCC----------CCCcccceeeEeeCCCCCCcceE Confidence 345556666665554 4566899999999999999999999998642 23579999999987755 5568 Q ss_pred EEEeeccceEEEEeecccEEEeecccc------------hhhhcCceEEEEEEeecceeecccceEEEEeeCCCC-CC Q lcl|NC_021309. 433 LVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT-GS 497 (497) Q Consensus 433 ~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~-~~ 497 (497) ++|||++ +.++++++++++++++.. ++|++|++.||+++|+||.|.+|+||++|+.....+ .+ T Consensus 245 ~~gd~~~--~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~ 320 (324) T protein:vir:78 245 ITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred EEEecce--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCC Confidence 9999997 558889999999987643 569999999999999999999999999998754433 33 No 79 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=2.4e-56 Score=325.47 Aligned_cols=285 Identities=14% Similarity=0.098 Sum_probs=224.8 Q ss_pred cccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeee Q lcl|NC_021309. 153 FGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGK 232 (497) Q Consensus 153 ~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~k 232 (497) .++.+++|++||+++..+||+.+++.++|+++|++++++++.++||+.++ .+.++|++|++.+|+++++|+++++.+|| T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~E~~~~~~s~~~f~~v~l~~~k 79 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTL-DSDIDVVAENGKKTHGGLSLEPVTIVPIK 79 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEec-CcceEEeecCccccccccceeeEEeeeEE Confidence 34556678889999999999999999999999999999999999999987 46899999999999999999999999999 Q ss_pred EEeeehhhHHHHh----hHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHH Q lcl|NC_021309. 233 VANALTITDEGLR----DAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNV 308 (497) Q Consensus 233 la~~~~iS~ell~----d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (497) +++++++|+|||+ +.++|+++|.+++++++++++|.++++|++++...+............ T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~--------------- 144 (303) T protein:vir:97 80 VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSK--------------- 144 (303) T ss_pred EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccc--------------- Confidence 9999999999993 346899999999999999999999999976433222111110000000 Q ss_pred HhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHH Q lcl|NC_021309. 309 KFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRL 388 (497) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~ 388 (497) ..............+++..++..+...+ ..+++|+||+.++..|++ T Consensus 145 ---------------------------------~~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~vmn~~~~~~L~~ 190 (303) T protein:vir:97 145 ---------------------------------VTQVVKFTESEDADANIEAAVNLIQGAE-GVVTGLAMDTEFSTALAK 190 (303) T ss_pred ---------------------------------cccccccccccchHHHHHHHHHHHhhcC-CCccEEEEcHHHHHHHHH Confidence 0000000111122345555555555443 455679999999999999 Q ss_pred HhhhcCceeccCcccccccccccccccccccceEecCCCCcC--------ceEEEeeccceEEEEeecccEEEeecccc- Q lcl|NC_021309. 389 TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTARREGVTMQMTNSNG- 459 (497) Q Consensus 389 lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~--------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~- 459 (497) +||++|+|+|.+...... ...+|+|+||++++.||.. .++||||+.. |.++.|.+++++++++.. T Consensus 191 lkd~~g~~~~~~~~~~~~-----~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~-~~~~~~~~~~~~~~~~~~~ 264 (303) T protein:vir:97 191 VTNGEMGPKMYPELAWGA-----NPDSINGLKSSVNTTVGAGADEAESKDLVIIGDFESM-FKWGYAKQIPMEIIKYGDP 264 (303) T ss_pred hhccCCCeEEecCccCCC-----CCceecceeeEEecccCCccccCCCccEEEEeecccc-EEEEEecCcEEEEeeccCC Confidence 999999999987644322 2358999999999999853 2789999873 668889999999987643 Q ss_pred -----hhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 460 -----TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 460 -----~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) ++|++|++.||++.|+|++|++|+||++|+-.++ T Consensus 265 d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 265 DNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred CCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 4699999999999999999999999999999999 No 80 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=1e-55 Score=322.02 Aligned_cols=303 Identities=15% Similarity=0.138 Sum_probs=232.0 Q ss_pred hHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceE Q lcl|NC_021309. 117 SFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS 196 (497) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 196 (497) +++.... ... .+.+................+++.++++||+++..+|++.+++.++|+++|++++++++.++ T Consensus 1 ~~~~~~~-------~~~-~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:93 1 MEQTQKL-------KLN-LQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CchhHHH-------HHH-HHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 0000000 000 01111111111122222334455567789999999999999999999999999999999999 Q ss_pred EEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccC Q lcl|NC_021309. 197 YLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGG 275 (497) Q Consensus 197 ~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G 275 (497) ||+.++ .+.++|++|++.+|+++++|++|++.++|++++++||+|||+|+ ++++++|+++|++++++++|.++|+|+| T Consensus 73 ip~~~~-~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g 151 (324) T protein:vir:93 73 FTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEec-CcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 999987 46899999999999999999999999999999999999999987 6899999999999999999999999999 Q ss_pred cc-ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhh Q lcl|NC_021309. 276 YP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEI 354 (497) Q Consensus 276 ~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (497) ++ .|.|+++.......... .... T Consensus 152 ~~~~~~~~~~~~~~~~~~~~--------------------------------------------------------~~~~ 175 (324) T protein:vir:93 152 NNPFGKSIAQSIEKTNKVIK--------------------------------------------------------GDFT 175 (324) T ss_pred CCCcCccccccccccceecc--------------------------------------------------------cccc Confidence 76 47777665432211110 0112 Q ss_pred hhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCC--cCce Q lcl|NC_021309. 355 AENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP--LGTI 432 (497) Q Consensus 355 ~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~--~~~~ 432 (497) ++++..++..+...++ .+.+|+||+.+|..|++++|++|+|++.+. ...+|+|+||+.++..+ .+.+ T Consensus 176 ~~~i~~~~~~l~~~~~-~~~~~v~n~~~~~~L~~l~d~~G~~~~~~~----------~~~~l~G~PVv~~~~~~~~~~~i 244 (324) T protein:vir:93 176 QDNIIDLEALLEDDEL-EANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDSLDGLPVVNLKSSNLKRGEL 244 (324) T ss_pred HHHHHHHHHhhhhccC-CCCEEEEcHHHHHHHHHhhCCCCCeeecCC----------CCCcccceeeEeecCCCCCcceE Confidence 3556666666666654 455899999999999999999999998643 23579999999977644 5668 Q ss_pred EEEeeccceEEEEeecccEEEeecccc------------hhhhcCceEEEEEEeecceeecccceEEEEeeCCCC-CC Q lcl|NC_021309. 433 LVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT-GS 497 (497) Q Consensus 433 ~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~-~~ 497 (497) ++|||++ +.++++++++|+++++.. ++|++|++.||+++|+||.|++|+||++|+...+.+ .| T Consensus 245 ~~gdfs~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~ 320 (324) T protein:vir:93 245 ITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred EEEecce--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCC Confidence 9999997 457889999999988743 569999999999999999999999999987554433 22 No 81 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=4.5e-56 Score=323.99 Aligned_cols=280 Identities=15% Similarity=0.109 Sum_probs=222.8 Q ss_pred cccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEE Q lcl|NC_021309. 155 STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA 234 (497) Q Consensus 155 ~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla 234 (497) ...++|.++||++..+||+.+++.++|+++|++++++++.+++|+.++ .+.++|++|++.+|+++++|+++++.+||++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~a 79 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEec-CcceEEecCCccccccccceeEEEEeeeeEE Confidence 445667889999999999999999999999999999999899999887 4789999999999999999999999999999 Q ss_pred eeehhhHHHHh---h-HHHHHHHHHHHHHHHHHHHHHhhhhcccC--ccccccccccccccccccccchhhhhhhHHHHH Q lcl|NC_021309. 235 NALTITDEGLR---D-APELFNFVQGRLLEGIQRKEEVQLLAGGG--YPGVNGLLQRSTGFTASSASSLFGATSATVSNV 308 (497) Q Consensus 235 ~~~~iS~ell~---d-~~~l~~~i~~~la~~~~~~~d~~~l~G~G--~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (497) ++++||+|||+ | ..+|++||+++|++++++++|.++++|++ ++.+.++............ T Consensus 80 ~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~-------------- 145 (298) T protein:vir:16 80 YGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQ-------------- 145 (298) T ss_pred EeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccc-------------- Confidence 99999999995 3 35899999999999999999999999964 3444443332111100000 Q ss_pred HhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHH Q lcl|NC_021309. 309 KFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRL 388 (497) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~ 388 (497) ............+++..++..+...+ .++.+|+||+.+|..|++ T Consensus 146 -----------------------------------~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~vmn~~~~~~l~~ 189 (298) T protein:vir:16 146 -----------------------------------KVEAPRGIADPNGAIENAVELLTGVD-ADVTGIAINPSFRSALAK 189 (298) T ss_pred -----------------------------------ccccccccccHHHHHHHHHHHhhhcC-CCccEEEEcHHHHHHHHH Confidence 00000011112345555555555544 445689999999999999 Q ss_pred HhhhcCceeccCcccccccccccccccccccceEecCCCCcC------ceEEEeeccceEEEEeecccEEEeecccc--- Q lcl|NC_021309. 389 TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------TILVGHFAPSVIQTARREGVTMQMTNSNG--- 459 (497) Q Consensus 389 lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~--- 459 (497) +||++|||+|++..... .+.+|+|+||++++.+|.+ .+++|||++ ++.++.|.+++++++++.. T Consensus 190 lkd~~G~~i~~~~~~~~------~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDfs~-~~~~~~~~~~~~~~~~~~~~~~ 262 (298) T protein:vir:16 190 QKDLQDNALFPELKWGA------TPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN-GFKWGYAKEVPLEVIQYGDPDN 262 (298) T ss_pred hhccCCCeeecCcccCC------CCceecceeeEEecccccccCCCccEEEEeeccc-eEEEEEecCceEEEeeccCCcC Confidence 99999999998754332 2458999999999999863 478899998 4667889999999987632 Q ss_pred ---hhhhcCceEEEEEEeecceeecccceEEEEeeC Q lcl|NC_021309. 460 ---TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 460 ---~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~ 492 (497) ++|++||+.||++.|+||+|++|+||++|+-.+ T Consensus 263 ~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 263 SGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 469999999999999999999999999998877 No 82 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=8.7e-56 Score=322.43 Aligned_cols=303 Identities=15% Similarity=0.138 Sum_probs=231.9 Q ss_pred HHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCC-------Cccce Q lcl|NC_021309. 136 GAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAA-------HNNAA 208 (497) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~-------~~~a~ 208 (497) -......+..........+.++.++.+||+++..+||+.+++.++|+++|++++++++.++||+.+.. ...+. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~ 80 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccc Confidence 00000000000000111122334556788889999999999999999999999999999999998752 24567 Q ss_pred ecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccc---cccccc Q lcl|NC_021309. 209 AVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG---VNGLLQ 284 (497) Q Consensus 209 wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~---p~Gi~~ 284 (497) |++|++.+|+++++|++|++.++|++++++||+|+|+|+ +++++||+++|++++++++|.+||+|+|++. |.||.+ T Consensus 81 ~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~ 160 (338) T protein:vir:78 81 EQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDT 160 (338) T ss_pred cccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccc Confidence 889999999999999999999999999999999999987 6899999999999999999999999999754 666665 Q ss_pred cccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHh Q lcl|NC_021309. 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVD 364 (497) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (497) ............. ........+.+..+... T Consensus 161 ~~~~~~~~~~~~~--------------------------------------------------~~~~~~~~~~~~~~~~~ 190 (338) T protein:vir:78 161 NNVIVNTTNVDYL--------------------------------------------------QTGTTPLLDRFLDGYDL 190 (338) T ss_pred ccccccccccccc--------------------------------------------------cccchhhHHHHHHHHHH Confidence 4433221111100 00011223455556566 Q ss_pred hhhhhccCCceEEechhHHHHH---HHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC---------ce Q lcl|NC_021309. 365 IQLTLFQTPNAVVMNPRDWELL---RLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG---------TI 432 (497) Q Consensus 365 ~~~~~~~~~~~~~~n~~~~~~l---~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~---------~~ 432 (497) +........++|+||+.++..| ++++|++|+|+|.+.... ..+.+|+|+||++++.||.+ .+ T Consensus 191 ~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~------~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~ 264 (338) T protein:vir:78 191 VSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLA------ASAGDLLGLPVQFGKAVGGDLGAATDSKVRV 264 (338) T ss_pred hhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccC------CCCceeeeeeEEEccccCccccccCCcccEE Confidence 6666667788999999998776 457899999999875433 23568999999999999852 37 Q ss_pred EEEeeccceEEEEeecccEEEeecccc------------hhhhcCceEEEEEEeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 433 LVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 433 ~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) ++|||++ |.++++.+++|+++++.. ++|++|++.+|++.|+||+|+||+||++|+-.+++.+ T Consensus 265 ~~gdfs~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 265 VGGDFSQ--LKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred EEEecce--EEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccCCCC Confidence 8999997 668999999999987642 5799999999999999999999999999999888888 No 83 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=1.6e-55 Score=320.95 Aligned_cols=299 Identities=15% Similarity=0.143 Sum_probs=229.6 Q ss_pred HHHHhhhhhhhhhhhhcc-ccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccc- Q lcl|NC_021309. 136 GAFADGETAPAAIGQNPF-GSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEA- 213 (497) Q Consensus 136 ~~~~~~~~~~~~~~~~~~-~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg- 213 (497) -+.....+.. ....... +.++.++.+||+++..+|++.+++.++|+++|++++++++.+.+|+.++ .+.++|++|+ T Consensus 1 ~a~l~el~~~-~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~eg~ 78 (333) T protein:vir:78 1 MATLNELLPN-SAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVK-RPEVGQVGVGT 78 (333) T ss_pred CchhHHhhhh-cccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeC-CceeEeecCcc Confidence 0000000000 0001111 1223344578888999999999999999999999999999999999987 4567777665 Q ss_pred -------cccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccc---cccc Q lcl|NC_021309. 214 -------GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG---VNGL 282 (497) Q Consensus 214 -------~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~---p~Gi 282 (497) +.+|+++++|++|++++||++++++||+|+++|+ +++++||+++|++++++++|.++|+|+|+++ |.|+ T Consensus 79 ~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~ 158 (333) T protein:vir:78 79 SNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGI 158 (333) T ss_pred cccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccc Confidence 5678899999999999999999999999999976 6899999999999999999999999999865 4455 Q ss_pred cccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHH Q lcl|NC_021309. 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) .+.....+.+.... ........++.+..++ T Consensus 159 ~~~~~~~~~~~~~~--------------------------------------------------~~~~~~~~~~~i~~~~ 188 (333) T protein:vir:78 159 DTDNVIANTTNVDY--------------------------------------------------LQETGDPLLDRLLDGY 188 (333) T ss_pred cccccccccccccc--------------------------------------------------cccccchhHHHHHHHH Confidence 44433222211100 0011122345677777 Q ss_pred HhhhhhhccCCceEEechhHHHHHHH---HhhhcCceeccCcccccccccccccccccccceEecCCCCcC--------- Q lcl|NC_021309. 363 VDIQLTLFQTPNAVVMNPRDWELLRL---TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG--------- 430 (497) Q Consensus 363 ~~~~~~~~~~~~~~~~n~~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~--------- 430 (497) ..+...++.++++|+|||.+|..|++ ++|++|+|+|.+..... .+.+|+|+||++++++|.+ T Consensus 189 ~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~------~~~~l~G~Pv~~~~~i~~~~~~~~~~~~ 262 (333) T protein:vir:78 189 DLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAA------QTGDVLGLPAQFGRAVGGDLGAAVDSKT 262 (333) T ss_pred HhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccC------CCceeeceeeEEccccCCCccccCCCcc Confidence 77777778888899999999987755 67999999998754332 3468999999999999965 Q ss_pred ceEEEeeccceEEEEeecccEEEeecccc---------hhhhcCceEEEEEEeecceeecccceEEEEeeCCC Q lcl|NC_021309. 431 TILVGHFAPSVIQTARREGVTMQMTNSNG---------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 431 ~~~~gd~~~~~~~i~~r~~~~i~~~~~~~---------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a 494 (497) .+++|||++ |.+++|.+++|+++++.. ++|++|++.||+++|+||.|++|+||++|+..+++ T Consensus 263 ~~~~gD~~~--~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 263 RIIGGDFSQ--LKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEEEEeccc--EEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCCC Confidence 389999998 568899999999988742 57999999999999999999999999999988888 No 84 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=6.6e-55 Score=317.62 Aligned_cols=363 Identities=15% Similarity=0.111 Sum_probs=224.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+...+-.++ ++++..++.+.++..... ++..+.++ +..+.+..++... T Consensus 1 M~i~~~~~~~---------------------~~e~~~~l~~~~~~~~~~-------e~~~~~~~---~~~~~~~~~~~~~ 49 (377) T protein:vir:96 1 MAINLKELPK---------------------YREAVAELSAKISAGATP-------EEQEKLFE---AAFTTMGDEILAK 49 (377) T ss_pred CCccHHHHHH---------------------HHHHHHHHHHHHhhcccH-------HHHHHHHH---HHHHHHHHHHHHH Confidence 4332221111 111111111111110000 00001111 1111111111110 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) .... .+... .... ..... ..+.+..+.. ....++.+.+| T Consensus 50 ~~~e---~~~~~-----------~~~~---~~~~l---------------t~ee~~~~~~---------~~~~~~~~~gg 88 (377) T protein:vir:96 50 NEEE---MERMF-----------DLRD---KNREL---------------TAEEIKFFND---------IDKNVGGKDKF 88 (377) T ss_pred HHHH---HHHHH-----------Hhcc---CCccc---------------CHHHHHHHHH---------HHhcCCCCCCc Confidence 0000 00000 0000 00000 0000111100 01123445556 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccc-cccccceeeEeeeeeEEeeehh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~-~s~~~f~~i~~~~~kla~~~~i 239 (497) ++||+++...|++.+...++|+++|+++++++ ..++|+.++ .+.++|++|++..+ +++++|++|++.+||++++++| T Consensus 89 ~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~-~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~i 166 (377) T protein:vir:96 89 KLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVI 166 (377) T ss_pred eecCHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecC-CcceeEeecccccccccCccceeEeeeeeeEEeechh Confidence 67777889999999999999999999999865 478998776 57899999988765 5799999999999999999999 Q ss_pred hHHHHhhHH-HHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 240 TDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~d~~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |++||+|++ ++++||+++|+++++.++|.+|++|+|+++|.||++..+..+................... + T Consensus 167 s~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~----~---- 238 (377) T protein:vir:96 167 PKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAI----A---- 238 (377) T ss_pred hHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeeccccc----c---- Confidence 999999985 8999999999999999999999999999999999998765554433222111110000000 0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc----------cCCceEEechhHHHHHHH Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF----------QTPNAVVMNPRDWELLRL 388 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~n~~~~~~l~~ 388 (497) ..... ..+.+.+.+..+...+. ....+|+||+.++..+ T Consensus 239 --------------------------~~~~~----~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~-- 286 (377) T protein:vir:96 239 --------------------------DLSDL----DPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL-- 286 (377) T ss_pred --------------------------ccccC----ChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhc-- Confidence 00000 01122222222222222 2234699999987644 Q ss_pred HhhhcCceeccCcccccccccccccccccccc--eEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCc Q lcl|NC_021309. 389 TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVP--VVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGK 466 (497) Q Consensus 389 lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~P--vv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~ 466 (497) .|+|.|++. .|. ..+++|+| |+.++.+|+++++||||++ |.|++|.+++|+.+++. .|.+|+ T Consensus 287 ----~~~~~~~~~----~G~----~~~~l~~p~~v~~s~~~p~~~i~fgdf~~--Y~i~~r~~~~i~~~~~~--~~~~d~ 350 (377) T protein:vir:96 287 ----EAKFTSRNQ----FGE----YVTVLPHGITILESLAVETGKAIAFVANR--YDAFMATASTIEEYDQT--FAMEDL 350 (377) T ss_pred ----cccccccCC----CCC----ceeccCCCceEEecCCCCcccEEEEEcCc--EEEEEecccEEEeehhh--hhhcCC Confidence 477777752 122 23677776 5789999999999999998 88999999999998764 599999 Q ss_pred eEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 467 VTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 467 v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) +.||+..|+||++++|+||++|+++-. T Consensus 351 ~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 351 QLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred eEEEEEEEEcCEEecCCcEEEEEEecC Confidence 999999999999999999999999988 No 85 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=3.9e-55 Score=318.88 Aligned_cols=303 Identities=16% Similarity=0.151 Sum_probs=233.3 Q ss_pred hhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcce Q lcl|NC_021309. 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~ 187 (497) +++. .+.+ ...+.+................++..+|++||+++..+|++.+++.++|+++|++ T Consensus 1 ~~k~------~~~~-----------~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~ 63 (324) T protein:vir:99 1 MEQT------QKLK-----------LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKY 63 (324) T ss_pred CCCc------hHhh-----------HHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcce Confidence 0000 0000 0001111111111111222233445556788989999999999999999999999 Q ss_pred eecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~ 266 (497) ++++++.+.||+.++ .+.+.|++|++.+|+++++|+++++.++|++++++||+|||+|+ +++++||.+.|++++++++ T Consensus 64 ~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~ 142 (324) T protein:vir:99 64 EPMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred eeccCCceEEEEEec-CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 999999999999886 57899999999999999999999999999999999999999997 6899999999999999999 Q ss_pred HhhhhcccCcc-ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccc Q lcl|NC_021309. 267 EVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 267 d~~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) |.++|+|+|++ .|.|+++.....+.... T Consensus 143 d~~~l~G~g~~~~~~~~~~~~~~~~~~~~--------------------------------------------------- 171 (324) T protein:vir:99 143 DEAGILNQGNNPFGKSIAQSIEKTNKVIK--------------------------------------------------- 171 (324) T ss_pred HHHhhhcCCCCccCccccccccccceecc--------------------------------------------------- Confidence 99999999976 47777664332211110 Q ss_pred cccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecC Q lcl|NC_021309. 346 GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~ 425 (497) ....++++..++..+...+ ..+++|+||+.+|..|++++|++|+|+|.+. .+.+|+|+||+.++ T Consensus 172 -----~~~~~~~i~~~~~~l~~~~-~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~----------~~~~l~G~PVv~~~ 235 (324) T protein:vir:99 172 -----GDFTQDNIIDLEALLEDDE-LEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDTLDGLPVVNLK 235 (324) T ss_pred -----ccCCHHHHHHHHHhhhhcc-CCCCEEEEcHHHHHHHHHhhcCCCceeecCC----------CCccccceeEEeec Confidence 0112355666666666554 4455899999999999999999999998642 23579999999998 Q ss_pred CCCc--CceEEEeeccceEEEEeecccEEEeecccc------------hhhhcCceEEEEEEeecceeecccceEEEEee Q lcl|NC_021309. 426 LIPL--GTILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 426 ~~~~--~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~ 491 (497) .++. +.+++|||++ +.++++.+++|+++++.. ++|++|++.||++.|+|+.|.+|+||++|+.. T Consensus 236 ~~~~~~~~~i~gd~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a 313 (324) T protein:vir:99 236 SSNLKRGELITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CCCCCcceEEEEeccc--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEec Confidence 8775 4589999998 557889999999987643 46999999999999999999999999999887 Q ss_pred CCCCCC Q lcl|NC_021309. 492 KGATGS 497 (497) Q Consensus 492 ~~a~~~ 497 (497) .+.+.. T Consensus 314 ~~~~~~ 319 (324) T protein:vir:99 314 DKKTDS 319 (324) T ss_pred cCCCCC Confidence 766664 No 86 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=1.4e-54 Score=315.76 Aligned_cols=348 Identities=14% Similarity=0.094 Sum_probs=223.7 Q ss_pred HHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHH Q lcl|NC_021309. 40 EPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFN 119 (497) Q Consensus 40 ~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 119 (497) ++++++...++ +++.++++.++++++.++.+.+...... ... .. ......+... .+. T Consensus 1 ~eei~~l~~~~------~~l~~~~~~l~~~~d~~e~e~~~~~~~~----~~~--~~-~~~~~~~~~~------~~~---- 57 (352) T protein:vir:78 1 MEDIKQLETEK------AGLQQRFNIVERQVQDIEEKEKAKVKDK----GEA--YQ-SLNDNEKLVK------AKA---- 57 (352) T ss_pred ChhHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHhhhc----ccc--cc-ccchhhhHHH------HHH---- Confidence 11111111111 1111222222222222221111000000 000 00 0000000000 000 Q ss_pred HHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEE Q lcl|NC_021309. 120 VSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLT 199 (497) Q Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~ 199 (497) ...+.............. ..........++++.+|.+||+++..+||+.++..++|+++|+++++++ ..+|+ T Consensus 58 ---~~~r~~~~~~~~~~~~~~---~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~--~~~p~ 129 (352) T protein:vir:78 58 ---EFYRHAILPNEFEKPSME---AQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPR 129 (352) T ss_pred ---HHHHHHhhhhHHHHHHhh---HHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC--ceEEE Confidence 000000000000000000 0011112222344445566666788999999999999999999988765 46788 Q ss_pred EcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHh-hhhcccCcc Q lcl|NC_021309. 200 ESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEV-QLLAGGGYP 277 (497) Q Consensus 200 ~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~-~~l~G~G~~ 277 (497) .+...+.+.||+|++.+|+++++|++|++.+|+++++++||+|||+|+ ++|++||.++|+++++.+++. .|.+|+|++ T Consensus 130 ~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~ 209 (352) T protein:vir:78 130 VSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSG 209 (352) T ss_pred EecCCCcccccccccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCc Confidence 776556899999999999999999999999999999999999999996 699999999999999998655 667888999 Q ss_pred ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhH Q lcl|NC_021309. 278 GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAEN 357 (497) Q Consensus 278 ~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 357 (497) +|.|+++..+...++.. ..++. T Consensus 210 ~~~g~l~~~~~~~~t~~----------------------------------------------------------~~~d~ 231 (352) T protein:vir:78 210 LEHMSFYNGSVKEVEGA----------------------------------------------------------NMYDA 231 (352) T ss_pred ccccceecccccccccc----------------------------------------------------------chHHH Confidence 99998876553321110 11356 Q ss_pred HHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEee Q lcl|NC_021309. 358 VFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHF 437 (497) Q Consensus 358 ~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~ 437 (497) +.+++..+...|+.+ .+|+||+.++..+.+++|.+|+|+|... +.+|+|+||++++.++ +++|||| T Consensus 232 i~~~~~~l~~~~~~~-a~~~mn~~t~~~l~~~~~~~~~~~~~~~-----------~~~llG~PV~~~~~~~--~~~~Gdf 297 (352) T protein:vir:78 232 IINALADLHEDYRDN-ATIYMRYADYVKIISVLSNGTTNFFDTP-----------AEKVFGKPVVFTDAAV--KPIVGDF 297 (352) T ss_pred HHHHHhccChhhhcC-CEEEEehHHHHHHHHHHhccCCcccccC-----------CccccccceEEecCCC--ceeEeeh Confidence 666777777776654 5799999999999999999999998642 2479999999999765 5899999 Q ss_pred ccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 438 APSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 438 ~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) +++ |.. +.++.++..+. ..++++.|++..|+|++|++|+||+.+++++++... T Consensus 298 ~~~-~~~--~~~~~~~~~~~----~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~~ 350 (352) T protein:vir:78 298 NYF-GIN--YDGTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGSL 350 (352) T ss_pred hhh-hhh--hhhheeeeecc----ccCCeeEEEEEeeeCceeechhheEEEEeecccCCC Confidence 984 333 34555554433 347899999999999999999999999999998888 No 87 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=3.8e-55 Score=318.96 Aligned_cols=364 Identities=13% Similarity=0.060 Sum_probs=222.2 Q ss_pred HHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhh Q lcl|NC_021309. 35 ALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKF 114 (497) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 114 (497) .--++.++++ +..+++...+++.+ ....+.+.. + .............. ..+.... . T Consensus 1 m~~kl~~~~~---------~~~~~~~~~~~~~~--~~~~~~~~~--~-~~~~~~~~~~~~~~--~~e~~~~--------~ 56 (381) T protein:vir:10 1 MTINLSETFA---------NAKNEFINAVNNGE--PQERQNELY--G-DMINQLFEETKLQA--KAEAERV--------S 56 (381) T ss_pred CchhHHHHHH---------HHHHHHHHHHHhhh--HHHHHHHHH--H-HHHHhhhhhHHHHH--HHHHHHH--------H Confidence 1111111111 11111111111100 000000000 0 00000000000000 0000000 0 Q ss_pred hhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCc Q lcl|NC_021309. 115 DVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPN 194 (497) Q Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 194 (497) . ..........+.+ ........++...+|++||+++...|++.+...++||++|+++++++ . T Consensus 57 ~-------~~~~~~~l~~~e~----------~~~~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~-~ 118 (381) T protein:vir:10 57 S-------LPKSAQTLSANQR----------NFFMDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-R 118 (381) T ss_pred H-------hcccccccCHHHH----------HHHHHHhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc-c Confidence 0 0000000000000 00112233455566777888899999999999999999999999865 4 Q ss_pred eEEEEEcCCCccceeccccccc-ccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_021309. 195 LSYLTESAAHNNAAAVAEAGTY-PFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLA 272 (497) Q Consensus 195 ~~~p~~~~~~~~a~wv~Eg~~~-~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~ 272 (497) ..+|+.++ .+.+.|++|++.. ++++|+|++|++.+||++++++||++||+|+ .+|++||+.+|+++++.++|.+|++ T Consensus 119 ~~i~~~~~-~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~ 197 (381) T protein:vir:10 119 LKFLKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK 197 (381) T ss_pred eEEEeecC-CcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEe Confidence 68898876 5689999998775 4678999999999999999999999999997 5899999999999999999999999 Q ss_pred ccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhh Q lcl|NC_021309. 273 GGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAA 352 (497) Q Consensus 273 G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 352 (497) |+|+++|.||++......................... .....+.. T Consensus 198 GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~----------~~~~~~~~------------------------- 242 (381) T protein:vir:10 198 GTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFA----------NPRATVNE------------------------- 242 (381) T ss_pred cccCCCceeeeecCCcccccccccccccccccccccc----------chhhHHHH------------------------- Confidence 9999999999975432221111100000000000000 00000000 Q ss_pred hhhhHHHHHH---HhhhhhhccCCceEEechhHHHHHHHHh---hhcCceeccCcccccccccccccccccccceEecCC Q lcl|NC_021309. 353 EIAENVFDAF---VDIQLTLFQTPNAVVMNPRDWELLRLTK---DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL 426 (497) Q Consensus 353 ~~~~~~~~~~---~~~~~~~~~~~~~~~~n~~~~~~l~~lk---d~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~ 426 (497) ...++... .......+..+..|+||+.++..++.++ +++|+|+|..+ +|.||++++. T Consensus 243 --l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~lp---------------~g~~vv~~~~ 305 (381) T protein:vir:10 243 --LTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP---------------FNLNVIESTV 305 (381) T ss_pred --HHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecCC---------------CCceeEEcCC Confidence 00000000 0111122334557999999999888655 88999988632 3778999999 Q ss_pred CCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEee-----CCCCCC Q lcl|NC_021309. 427 IPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK-----KGATGS 497 (497) Q Consensus 427 ~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~-----~~a~~~ 497 (497) ||+++++||||++ |.|++|.+++|+++++. .|.+|+++||+..|+||++++|+||++++++ .+.+.+ T Consensus 306 ~p~~~i~fGDfs~--Y~i~~r~~~~i~~~~~~--~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~~~ 377 (381) T protein:vir:10 306 QEAGKVLTYVKGL--YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEDT 377 (381) T ss_pred CCcCcEEEEEccc--EEEEEecccEEEeechh--hhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCccccccc Confidence 9999999999997 78999999999998864 5999999999999999999999999999986 334444 No 88 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=3.8e-55 Score=318.93 Aligned_cols=277 Identities=16% Similarity=0.126 Sum_probs=221.0 Q ss_pred cccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEE Q lcl|NC_021309. 155 STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA 234 (497) Q Consensus 155 ~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla 234 (497) .+.++|.+||+++..+||+.+++.++|+++|++++++++.++||+.++ .+.++|++|++.+|+++++|+++++.++|++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~ 79 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEec-CcceEEeeCCccccccccceeEEEEeeeEEE Confidence 334567889999999999999999999999999999999999999987 4679999999999999999999999999999 Q ss_pred eeehhhHHHHhh----HHHHHHHHHHHHHHHHHHHHHhhhhcccC--ccc---cccccccccccccccccchhhhhhhHH Q lcl|NC_021309. 235 NALTITDEGLRD----APELFNFVQGRLLEGIQRKEEVQLLAGGG--YPG---VNGLLQRSTGFTASSASSLFGATSATV 305 (497) Q Consensus 235 ~~~~iS~ell~d----~~~l~~~i~~~la~~~~~~~d~~~l~G~G--~~~---p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 305 (497) ++++||+|+|++ ..+|+++|+++|++++++++|.++++|++ ++. +.|+.......+. T Consensus 80 ~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~-------------- 145 (298) T protein:vir:94 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQ-------------- 145 (298) T ss_pred EeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccc-------------- Confidence 999999999952 35799999999999999999999999953 222 1111111110000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHH Q lcl|NC_021309. 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWEL 385 (497) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~ 385 (497) ............+++..++..+...+ .++.+|+||+.+|.. T Consensus 146 --------------------------------------~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~vmn~~~~~~ 186 (298) T protein:vir:94 146 --------------------------------------KVEAPRGIADPNGAIENAVELLTGVD-ADVTGIAINPSFRSA 186 (298) T ss_pred --------------------------------------ccccccccccHHHHHHHHHHhhhhcC-CCccEEEEcHHHHHH Confidence 00000111223345666666665554 445689999999999 Q ss_pred HHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC------ceEEEeeccceEEEEeecccEEEeecccc Q lcl|NC_021309. 386 LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------TILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) Q Consensus 386 l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~ 459 (497) |+++||++|||+|++...+. .+.+|||+||++++.+|.+ .+++|||++. +.++.|.++++++.++.. T Consensus 187 l~~lkd~~G~~l~~~~~~~~------~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gdfs~~-~~~~~~~~~~~~~~~~~~ 259 (298) T protein:vir:94 187 LAKQKDLQGNALFPELKWGA------TPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANG-FKWGYAKEVPLEVIQYGD 259 (298) T ss_pred HHHhhccCCCeeecCcccCC------CCceecceeeEEecccccccCCCccEEEEeeccce-EEEEEecCceEEEeecCC Confidence 99999999999998754432 3468999999999999853 4789999984 667788999999877532 Q ss_pred ------hhhhcCceEEEEEEeecceeecccceEEEEeeC Q lcl|NC_021309. 460 ------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 460 ------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~ 492 (497) ++|++|++.||++.|+||.+.||+||++|+-.+ T Consensus 260 ~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 260 PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 479999999999999999999999999998888 No 89 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=4e-55 Score=318.81 Aligned_cols=274 Identities=16% Similarity=0.170 Sum_probs=223.2 Q ss_pred hhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCC--ceEEEEEcCCCccceeccccccccc-ccccc Q lcl|NC_021309. 147 AIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP--NLSYLTESAAHNNAAAVAEAGTYPF-SSEEF 223 (497) Q Consensus 147 ~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~--~~~~p~~~~~~~~a~wv~Eg~~~~~-s~~~f 223 (497) -...+..++++.+|.+||+++..+|++.+++.++|+++|++++++++ .+.+|+.....+.+.|++|++.+|+ ++++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 22233334444556667778889999999999999999999998765 4557777665678999999999997 57999 Q ss_pred eeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhh Q lcl|NC_021309. 224 ARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 224 ~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 302 (497) ++|++.+||++++++||+|+++|+ .+|++||.++|++++++++|.+|++|.|++.+.+ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~--------------------- 139 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTKP--------------------- 139 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccccc--------------------- Confidence 999999999999999999999997 6899999999999999999999999987642110 Q ss_pred hHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhH Q lcl|NC_021309. 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 382 (497) ....++++.+++..+...++ ...+|+||+.+ T Consensus 140 ------------------------------------------------~~~~~d~i~~~~~~l~~~~~-~~a~~vmn~~~ 170 (293) T protein:vir:48 140 ------------------------------------------------TLTKWDDIIDLEAKVDPAIK-QTSFFLTNTSG 170 (293) T ss_pred ------------------------------------------------cccCHHHHHHHHHhhhhhhc-CCCEEEEcHHH Confidence 00013456666666666655 45579999999 Q ss_pred HHHHHHHhhhcCceeccCcccccccccccccccccccceEecCC--CCcC-----ceEEEeeccceEEEEeecccEEEee Q lcl|NC_021309. 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--IPLG-----TILVGHFAPSVIQTARREGVTMQMT 455 (497) Q Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~--~~~~-----~~~~gd~~~~~~~i~~r~~~~i~~~ 455 (497) |..|+++||++|||||+++.... ...+|+|+||++++. +|.. .++||||++ +|.+++|.+++++++ T Consensus 171 ~~~L~~lkd~~g~~l~~~~~~~~------~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~ 243 (293) T protein:vir:48 171 FTALKKVKNALGDYLMERDVKSP------TGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQ-AVTLFDRQQMSLLST 243 (293) T ss_pred HHHHHHhhccCCceEeecCcCCC------CCceecceeeEEecccccCCccCCceEEEEEeccc-eEEEEEecceEEEEe Confidence 99999999999999999865432 346899999987543 4432 379999998 578999999999999 Q ss_pred cccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 456 NSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 456 ~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ++.+++|++|++.||++.|+|+.+++|+||++++++++++.- T Consensus 244 ~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~ 285 (293) T protein:vir:48 244 NIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQK 285 (293) T ss_pred cccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCC Confidence 998889999999999999999999999999999988866544 No 90 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=8.3e-55 Score=317.07 Aligned_cols=303 Identities=16% Similarity=0.153 Sum_probs=232.0 Q ss_pred hhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcce Q lcl|NC_021309. 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~ 187 (497) +++. .+.+ ...+.+................++..+|++||+++..+|++.+++.++|+++|++ T Consensus 1 ~~~~------~~~~-----------~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~ 63 (324) T protein:vir:10 1 MEQT------QKLK-----------LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred CCCc------hHHH-----------HHHHHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcce Confidence 0000 0000 0001111111111111122233445556788999999999999999999999999 Q ss_pred eecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~ 266 (497) ++++++.++||+.++ .+.+.|++|++.+|+++++|+++++.++|++++++||+|+|+|+ +++++||.+.|++++++++ T Consensus 64 ~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~ 142 (324) T protein:vir:10 64 EPMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred eeccCCceEEEEEeC-CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 999999999999876 57899999999999999999999999999999999999999987 6899999999999999999 Q ss_pred HhhhhcccCcc-ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccc Q lcl|NC_021309. 267 EVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 267 d~~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) |.++|+|+|++ .|.|+++.....+.... T Consensus 143 d~a~l~G~g~~~~~~~i~~~~~~~~~~~~--------------------------------------------------- 171 (324) T protein:vir:10 143 DEAGILNQGNNPFGKSIAQSIEKTNKVIK--------------------------------------------------- 171 (324) T ss_pred HHHhhhcCCCCccCccccccccccceecc--------------------------------------------------- Confidence 99999999976 47777664332211110 Q ss_pred cccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecC Q lcl|NC_021309. 346 GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~ 425 (497) ....++++..++..+...++ .+++|+||+.+|..|++++|++|+|+|.+. ...+|+|+||++++ T Consensus 172 -----~~~t~~~i~~~~~~l~~~~~-~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~----------~~~~l~G~PV~~~~ 235 (324) T protein:vir:10 172 -----GDFTQDNIIDLEALLEDDEL-EANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDTLDGLPVVNLK 235 (324) T ss_pred -----ccCCHHHHHHHHHhhhhccC-CCCEEEEcHHHHHHHHHhhccCCceeecCC----------CCccccceeEEeec Confidence 01123556666666666554 456899999999999999999999998642 23579999999988 Q ss_pred CCCc--CceEEEeeccceEEEEeecccEEEeecccc------------hhhhcCceEEEEEEeecceeecccceEEEEee Q lcl|NC_021309. 426 LIPL--GTILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 426 ~~~~--~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~ 491 (497) .++. +.+++|||++ +.++++.+++|+++++.. ++|++|++.||+++|+|+.|.+|+||++|+.. T Consensus 236 ~~~~~~~~~~~gd~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:10 236 SSNLKRGELITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CCCCCcceEEEEeccc--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEec Confidence 7664 5589999997 457889999999987642 56999999999999999999999999999887 Q ss_pred CCCCC-C Q lcl|NC_021309. 492 KGATG-S 497 (497) Q Consensus 492 ~~a~~-~ 497 (497) ++.+- + T Consensus 314 ~~~~~~~ 320 (324) T protein:vir:10 314 DKKTDSV 320 (324) T ss_pred cCCCCCC Confidence 66653 3 No 91 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=1.1e-53 Score=311.02 Aligned_cols=374 Identities=15% Similarity=0.104 Sum_probs=227.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |...+++.++...+.+..+++... .++....+++... +.+.++.+..++... T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~~~-~~~~~~~e~~~~~---------------------------~~~~~~~~~~~~~~~ 52 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFANL-VQNGASDEEQSKA---------------------------FGAMFDALSNDLQEE 52 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHH-HhhhhhHHHHHHH---------------------------HHHHHHHHHHHHHHH Confidence 777666666555443222222110 0000000000000 001111111111000 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) ... +.+... .........+... . ..+.+. +. .....++.+.+| T Consensus 53 ~~~---e~~~~~---------~~~~~~~~r~~~~---l------------~~ee~~-~~---------~~~~~~t~~~gG 95 (395) T protein:vir:95 53 ITA---EINNRV---------VDNGILAKRSQDP---L------------TSEERK-FF---------NDINYDVGYTDE 95 (395) T ss_pred HHH---HHHHHH---------HHHHHHhhcCccc---c------------chHHHH-HH---------HHHhhccCCCCc Confidence 000 000000 0000000000000 0 000000 00 111224455667 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccccccc-ccccccceeeEeeeeeEEeeehh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTY-PFSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~-~~s~~~f~~i~~~~~kla~~~~i 239 (497) ++||+++..+|++.++..++|+++|+++++++. ..+|+.++ .+.+.|++|++.. ++++++|++|++.+|+++++++| T Consensus 96 ~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~~-~~i~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~i 173 (395) T protein:vir:95 96 KILPETVVERVFDDLQKDHPLLSKINFQNAGIK-TRVIKADP-AGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVL 173 (395) T ss_pred eeccHHHHHHHHHHHHhhhhhhhhceeEecCCc-eEEEEecC-CcceEEeecccccCccccccceeeeeceeeEEEeecc Confidence 778888899999999999999999999998764 68999776 5789999886665 57899999999999999999999 Q ss_pred hHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCcc--ccccccccccccccccccchhhhhhhHHHHHHhhhhhcc Q lcl|NC_021309. 240 TDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP--GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 240 S~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~--~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) |+|||+|+ ++|++||+++|+++++.++|++|++|+|++ +|.||++.....+.......................... T Consensus 174 S~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~ 253 (395) T protein:vir:95 174 PDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELND 253 (395) T ss_pred cHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHH Confidence 99999998 589999999999999999999999999986 599999876544333222111111110000000000000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCce Q lcl|NC_021309. 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ...... .........+.....|+||+.++. |..|+| T Consensus 254 -~~~~~~-------------------------------------~~~~~~~~~~~~~~~~~mn~~t~~------~~~g~~ 289 (395) T protein:vir:95 254 -VLKNLS-------------------------------------VDEKGKELKIDGKVALVVNPRDSW------DVQARY 289 (395) T ss_pred -HHHhhc-------------------------------------cccccchhhhcCceEEEEcchhhh------hcCCcc Confidence 000000 000001112233446999998864 667999 Q ss_pred eccCcccccccccccccccc--cccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEe Q lcl|NC_021309. 397 MGGNFFGNAYGNPVNGGKNI--WGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEER 474 (497) Q Consensus 397 i~~~~~~~~~~~~~~~~~~l--~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r 474 (497) +|++.. |. +.++ +|+||++++.||+++++||||++ |.|++|.+++|+++++. .|.+|++.||+..| T Consensus 290 ~~~~~~----G~----~~~~lg~g~~v~~~~~~p~~~i~fgdfs~--y~i~~r~~~~i~~~~~~--~~~~d~~~f~~~~r 357 (395) T protein:vir:95 290 TYLTAN----GG----FVTVLPYNVTIITSEFVPEGKLVAFVTDR--YNAVRGGGLTVKKFDQT--LALEDAVLFTAKTF 357 (395) T ss_pred eeccCC----Cc----ceeccCCcceEEEcCCCCCCcEEEEeccc--EEEEEecceEEEeccch--hhhCCcEEEEEEEE Confidence 998732 22 2245 46778999999999999999997 78999999999998764 59999999999999 Q ss_pred ecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 475 LGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 475 ~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) +|++|++|+||++|+++..-..- T Consensus 358 ~dg~~~~~~A~~~l~i~~~~~~~ 380 (395) T protein:vir:95 358 AYGQPDDNKASAVYDLKVASAPR 380 (395) T ss_pred ECCEEeccccEEEEEeeccCCCC Confidence 99999999999999987322211 No 92 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=4.6e-55 Score=318.48 Aligned_cols=281 Identities=15% Similarity=0.150 Sum_probs=231.2 Q ss_pred hhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCc-eEEEEEcCCCccceecccccccccccc Q lcl|NC_021309. 143 TAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSE 221 (497) Q Consensus 143 ~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~wv~Eg~~~~~s~~ 221 (497) ............+++++|++||+++..+|++.+++.++|+++|++++++++. ..+|+.++ .+.++|++|++.+|++++ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~Eg~~~~~~~~ 79 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTD-GISAYWVNETEKIKTDKP 79 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcC-CceeEEeecCcccccccc Confidence 1112222333445667778899999999999999999999999999998764 56777665 568999999999999999 Q ss_pred cceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhh Q lcl|NC_021309. 222 EFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 222 ~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~ 300 (497) +|++|++.++|++++++||+|+|+|+ +++++||+++|+++++.++|.++|+|+|++.|.|+++..+........ T Consensus 80 ~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~----- 154 (297) T protein:vir:95 80 EVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGG----- 154 (297) T ss_pred ceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceeccc----- Confidence 99999999999999999999999987 689999999999999999999999999999999998765432211110 Q ss_pred hhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEech Q lcl|NC_021309. 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) ...++++..++..+...++ ..++|+||+ T Consensus 155 ---------------------------------------------------~~t~~~i~~~~~~l~~~~~-~~~~~v~~~ 182 (297) T protein:vir:95 155 ---------------------------------------------------PINYDNILKLQDALYDADV-EPNAFVSKI 182 (297) T ss_pred ---------------------------------------------------ccCHHHHHHHHHHhhhccC-CcCEEEEcH Confidence 0113455566666665554 456899999 Q ss_pred hHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCC--CCcCceEEEeeccceEEEEeecccEEEeeccc Q lcl|NC_021309. 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--IPLGTILVGHFAPSVIQTARREGVTMQMTNSN 458 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~--~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~ 458 (497) .+|..|++++|++|+|+|++. ..+|+|+||+.+.. ++.+++++|||++ +.++++.+++++++++. T Consensus 183 ~~~~~L~~l~d~~G~~i~~~~-----------~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~--~~~~~~~~~~i~~~~~~ 249 (297) T protein:vir:95 183 QNRSALREARDGNKVSIYDKA-----------ANTIDGITTVDLKSARFEKGDLLAGDFDN--LIYGVPYNITYKISEEG 249 (297) T ss_pred HHHHHHHHhhccCCceeecCC-----------CCcccceeeEeecCCCCCCceEEEEeccc--EEEEEecCeEEEEeecc Confidence 999999999999999999753 24699999997665 5577899999997 55788999999998765 Q ss_pred c------------hhhhcCceEEEEEEeecceeecccceEEEEeeCCC Q lcl|NC_021309. 459 G------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 459 ~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a 494 (497) . ++|++|++.||+++|+||+|++|+||++|+..++. T Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 250 QISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred ccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 3 56999999999999999999999999999988888 No 93 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=1.1e-54 Score=316.34 Aligned_cols=303 Identities=16% Similarity=0.146 Sum_probs=228.6 Q ss_pred hhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcce Q lcl|NC_021309. 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~ 187 (497) +++. .+.+ ...+.+.................+..+|++||+++..+|++.+++.++|++++++ T Consensus 1 ~~~~------~~~~-----------~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:96 1 MEQT------QKLK-----------LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred CCcc------hhhh-----------HHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcce Confidence 0000 0000 0001111111111111111222334566788889999999999999999999999 Q ss_pred eecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~ 266 (497) ++++++.++||++++ .+.+.|++|++.+|+++++|+++++.++|++++++||+|||+|+ ++++++|.++|++++++++ T Consensus 64 ~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:96 64 EPMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred eeccCCceEEEEEec-CcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 999999999999987 46899999999999999999999999999999999999999987 7899999999999999999 Q ss_pred HhhhhcccCcc-ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccc Q lcl|NC_021309. 267 EVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 267 d~~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) |.++|+|+|++ .|.|+.+.....+.... T Consensus 143 d~~~l~G~g~~~~~~~~~~~~~~~~~~~~--------------------------------------------------- 171 (324) T protein:vir:96 143 DEAGILNQGNNPFGKSIAQSIKKTNKVIK--------------------------------------------------- 171 (324) T ss_pred HHHhhhcCCCCCcCccccccccccceecc--------------------------------------------------- Confidence 99999999976 46666554322111100 Q ss_pred cccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecC Q lcl|NC_021309. 346 GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~ 425 (497) ....++++..++..+...+ .++++|+||+.+|..|+++||++|+|++++. ...+|+|+||++++ T Consensus 172 -----~~~~~~~i~~~~~~i~~~~-~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~~----------~~~~l~G~PV~~~~ 235 (324) T protein:vir:96 172 -----GDFTQDNIIDLEALLEDDE-LEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDSLDGLPVVNLK 235 (324) T ss_pred -----cccchHHHHHHHHhhhhcc-CCCCEEEEcHHHHHHHHHhhCCCCCeeecCC----------CCCcccceeeEeec Confidence 0112345556666665554 4566899999999999999999999998642 23579999999977 Q ss_pred CCC--cCceEEEeeccceEEEEeecccEEEeecccc------------hhhhcCceEEEEEEeecceeecccceEEEEee Q lcl|NC_021309. 426 LIP--LGTILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 426 ~~~--~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~ 491 (497) ..+ .+.+++|||++ +.++++.+++|+++++.. ++|++|++.||+++|+||.|++|+||++|+.. T Consensus 236 ~~~~~~~~~~~gd~s~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a 313 (324) T protein:vir:96 236 SSNLKRGELITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CCCCCcceEEEEecce--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecc Confidence 655 45689999997 557889999999987643 57999999999999999999999999998865 Q ss_pred CCCCCC Q lcl|NC_021309. 492 KGATGS 497 (497) Q Consensus 492 ~~a~~~ 497 (497) ...+-. T Consensus 314 ~~~~~~ 319 (324) T protein:vir:96 314 DKRTDS 319 (324) T ss_pred cccCCC Confidence 544333 No 94 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=4.6e-55 Score=318.45 Aligned_cols=282 Identities=15% Similarity=0.076 Sum_probs=217.8 Q ss_pred hccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~ 230 (497) +. +.++++|++||+++..+|++.+++.++|+++|++++++++..+||+.++. +.++||+|++.+|+++++|+++++.+ T Consensus 1 Ma-t~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~-~~a~wv~Eg~~~~~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MA-TFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGR-PKAEFVGEGQQKSSTTGEFDFVTSTP 78 (311) T ss_pred Cc-eecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCC-ceeEEeecCcccccccceeeEEEEee Confidence 33 45566778888889999999999999999999999999988999999874 68999999999999999999999999 Q ss_pred eeEEeeehhhHHHHh---h-HHHHHHHHHHHHHHHHHHHHHhhhhcccCcccccccccccccc---ccccccchhhhhhh Q lcl|NC_021309. 231 GKVANALTITDEGLR---D-APELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGF---TASSASSLFGATSA 303 (497) Q Consensus 231 ~kla~~~~iS~ell~---d-~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~---~~~~~~~~~~~~~~ 303 (497) ||++++++||+|||+ | ..+|++||+++|++++++++|.++|+|+|++.+.++....... +....... T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~------ 152 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTA------ 152 (311) T ss_pred EEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccc------ Confidence 999999999999994 4 3689999999999999999999999999977655443322211 11110000 Q ss_pred HHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhh-hccCCceEEechhH Q lcl|NC_021309. 304 TVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT-LFQTPNAVVMNPRD 382 (497) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~n~~~ 382 (497) ........++..++..+... .....++|+||+.+ T Consensus 153 ---------------------------------------------~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~ 187 (311) T protein:vir:99 153 ---------------------------------------------DTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSI 187 (311) T ss_pred ---------------------------------------------cccchhHHHHHHHHHHHhhhccCCCccEEEEcHHH Confidence 00000111122222222211 23345679999999 Q ss_pred HHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC----------------ceEEEeeccceEEEEe Q lcl|NC_021309. 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG----------------TILVGHFAPSVIQTAR 446 (497) Q Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~----------------~~~~gd~~~~~~~i~~ 446 (497) |..|+++||++|||||++...+. ...+|+|+||++++.+|.+ .+++|||++ .+.+.. T Consensus 188 ~~~L~~lkd~~G~~l~~~~~~~~------~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~-~~~~~~ 260 (311) T protein:vir:99 188 AWGLSTARYTDGRKKFPELGLGI------GVSSFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFAN-GIHWGV 260 (311) T ss_pred HHHHHhhhccCCCeeecCcccCC------CCceecceeeEeecccccccccccccchhhccCcceEEEeeccc-cEEEEE Confidence 99999999999999998765442 2458999999999988732 257899998 467888 Q ss_pred ecccEEEeecccc-----hhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 447 REGVTMQMTNSNG-----TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 447 r~~~~i~~~~~~~-----~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) |.+++++++++.. ++|++||+.||++.|+||.|+|| +|++++-.++ T Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 261 QRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred ecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 9999999887643 56999999999999999999997 5666665555 No 95 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=2.6e-54 Score=314.39 Aligned_cols=359 Identities=12% Similarity=0.063 Sum_probs=222.6 Q ss_pred HHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh--HHHHhHhhhhhhhhhhhh Q lcl|NC_021309. 40 EPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMN--PELKNATSFEKGTKFDVS 117 (497) Q Consensus 40 ~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 117 (497) |. ++......+ ...++...++..... .... ........ ......... .+.+..... .+. T Consensus 1 m~-ik~~~~~~~---~~~e~~~~~~~~~~~--~~~~---~~~~~~~~----~~~~~~~~~~~~e~~~~~~~---~~~--- 61 (381) T protein:vir:10 1 MT-INLSETFAN---AKNEFINAVNNGEPQ--ERQN---ELYGDMIN----QLFEETKLQAKAEAERVSSL---PKS--- 61 (381) T ss_pred Cc-hhhHHHHHH---HHHHHHHHHhhhhhh--HHHH---HHHHHHHH----hhhhhHHHHHHHHHHHHHHh---ccC--- Confidence 11 111111111 111111111110000 0000 00000000 000000000 000000000 000 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEE Q lcl|NC_021309. 118 FNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSY 197 (497) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 197 (497) ......+.+ ........++++.+|++||+++..+|++.+...++|+++|++.+++++ ..+ T Consensus 62 ---------~~~lt~~e~----------~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~i 121 (381) T protein:vir:10 62 ---------AQSLSANQR----------SFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKF 121 (381) T ss_pred ---------cccccHHHH----------HHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcc-eEE Confidence 000000000 011122334455667778888999999999999999999999998765 689 Q ss_pred EEEcCCCccceecccccccc-cccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccC Q lcl|NC_021309. 198 LTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGG 275 (497) Q Consensus 198 p~~~~~~~~a~wv~Eg~~~~-~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G 275 (497) |+.++ .+.++|++|++..+ +++++|++|++.+|||+++++||++||+|+ .+|++||+.+|+++++.++|.+|++|+| T Consensus 122 ~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G 200 (381) T protein:vir:10 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) T ss_pred EEecC-CcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccC Confidence 99876 57899999988765 568999999999999999999999999997 4899999999999999999999999999 Q ss_pred ccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhh Q lcl|NC_021309. 276 YPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIA 355 (497) Q Consensus 276 ~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (497) +++|.||++..+................... ........+ T Consensus 201 ~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~----------------------------------------t~~~~~~~~ 240 (381) T protein:vir:10 201 KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTL----------------------------------------TFANPRATV 240 (381) T ss_pred CCCceeeeeccCccccccccccccccccccc----------------------------------------ccccchhhH Confidence 9999999986543221111100000000000 000000001 Q ss_pred hHHHHHHHhhh------hhhccCCceEEechhHHHHHHHHh---hhcCceeccCcccccccccccccccccccceEecCC Q lcl|NC_021309. 356 ENVFDAFVDIQ------LTLFQTPNAVVMNPRDWELLRLTK---DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL 426 (497) Q Consensus 356 ~~~~~~~~~~~------~~~~~~~~~~~~n~~~~~~l~~lk---d~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~ 426 (497) +.+...+..+. ...+.....|+||+.++..++.++ +++|+|+|..+ +|.||+.++. T Consensus 241 ~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~---------------~g~~vv~s~~ 305 (381) T protein:vir:10 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP---------------FNLNVIESTV 305 (381) T ss_pred HHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCC---------------CCceEEecCC Confidence 11111111111 112233446999999999988766 67798887532 3678999999 Q ss_pred CCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCC--CCC Q lcl|NC_021309. 427 IPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA--TGS 497 (497) Q Consensus 427 ~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a--~~~ 497 (497) ||+++++||||++ |.|++|.+++|+.+++. .|.+|+++||+..|+||++++|+||++++++... +.+ T Consensus 306 ~p~~~iifgDfs~--Y~i~~r~~~~i~~~~~~--~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~ 374 (381) T protein:vir:10 306 QEAGKVLTYVKGL--YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) T ss_pred CCcCcEEEEeccc--EEEEEecccEEEeechh--HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCc Confidence 9999999999997 88999999999998865 5999999999999999999999999998877643 222 No 96 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=2.6e-54 Score=314.39 Aligned_cols=359 Identities=12% Similarity=0.063 Sum_probs=222.6 Q ss_pred HHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh--HHHHhHhhhhhhhhhhhh Q lcl|NC_021309. 40 EPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMN--PELKNATSFEKGTKFDVS 117 (497) Q Consensus 40 ~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 117 (497) |. ++......+ ...++...++..... .... ........ ......... .+.+..... .+. T Consensus 1 m~-ik~~~~~~~---~~~e~~~~~~~~~~~--~~~~---~~~~~~~~----~~~~~~~~~~~~e~~~~~~~---~~~--- 61 (381) T protein:vir:95 1 MT-INLSETFAN---AKNEFINAVNNGEPQ--ERQN---ELYGDMIN----QLFEETKLQAKAEAERVSSL---PKS--- 61 (381) T ss_pred Cc-hhhHHHHHH---HHHHHHHHHhhhhhh--HHHH---HHHHHHHH----hhhhhHHHHHHHHHHHHHHh---ccC--- Confidence 11 111111111 111111111110000 0000 00000000 000000000 000000000 000 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEE Q lcl|NC_021309. 118 FNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSY 197 (497) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 197 (497) ......+.+ ........++++.+|++||+++..+|++.+...++|+++|++.+++++ ..+ T Consensus 62 ---------~~~lt~~e~----------~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~i 121 (381) T protein:vir:95 62 ---------AQSLSANQR----------SFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKF 121 (381) T ss_pred ---------cccccHHHH----------HHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcc-eEE Confidence 000000000 011122334455667778888999999999999999999999998765 689 Q ss_pred EEEcCCCccceecccccccc-cccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccC Q lcl|NC_021309. 198 LTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGG 275 (497) Q Consensus 198 p~~~~~~~~a~wv~Eg~~~~-~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G 275 (497) |+.++ .+.++|++|++..+ +++++|++|++.+|||+++++||++||+|+ .+|++||+.+|+++++.++|.+|++|+| T Consensus 122 ~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G 200 (381) T protein:vir:95 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) T ss_pred EEecC-CcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccC Confidence 99876 57899999988765 568999999999999999999999999997 4899999999999999999999999999 Q ss_pred ccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhh Q lcl|NC_021309. 276 YPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIA 355 (497) Q Consensus 276 ~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (497) +++|.||++..+................... ........+ T Consensus 201 ~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~----------------------------------------t~~~~~~~~ 240 (381) T protein:vir:95 201 KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTL----------------------------------------TFANPRATV 240 (381) T ss_pred CCCceeeeeccCccccccccccccccccccc----------------------------------------ccccchhhH Confidence 9999999986543221111100000000000 000000001 Q ss_pred hHHHHHHHhhh------hhhccCCceEEechhHHHHHHHHh---hhcCceeccCcccccccccccccccccccceEecCC Q lcl|NC_021309. 356 ENVFDAFVDIQ------LTLFQTPNAVVMNPRDWELLRLTK---DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL 426 (497) Q Consensus 356 ~~~~~~~~~~~------~~~~~~~~~~~~n~~~~~~l~~lk---d~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~ 426 (497) +.+...+..+. ...+.....|+||+.++..++.++ +++|+|+|..+ +|.||+.++. T Consensus 241 ~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~---------------~g~~vv~s~~ 305 (381) T protein:vir:95 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP---------------FNLNVIESTV 305 (381) T ss_pred HHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCC---------------CCceEEecCC Confidence 11111111111 112233446999999999988766 67798887532 3678999999 Q ss_pred CCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCC--CCC Q lcl|NC_021309. 427 IPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA--TGS 497 (497) Q Consensus 427 ~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a--~~~ 497 (497) ||+++++||||++ |.|++|.+++|+.+++. .|.+|+++||+..|+||++++|+||++++++... +.+ T Consensus 306 ~p~~~iifgDfs~--Y~i~~r~~~~i~~~~~~--~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~ 374 (381) T protein:vir:95 306 QEAGKVLTYVKGL--YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) T ss_pred CCcCcEEEEeccc--EEEEEecccEEEeechh--HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCc Confidence 9999999999997 88999999999998865 5999999999999999999999999998877643 222 No 97 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=1.6e-54 Score=315.46 Aligned_cols=284 Identities=12% Similarity=0.045 Sum_probs=221.4 Q ss_pred hccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccc-----ccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGT-----YPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~-----~~~s~~~f~~ 225 (497) ++.++++.+|++||+++..+|++.+++.++|++++++++++++.++||+.++ .+.+.|++|++. +|.++++|++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~wv~E~~~~~~~~~~~s~~~f~~ 79 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVKPTSKVTWAN 79 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeC-CcceEEeecccccccccccccccceee Confidence 6777777788888889999999999999999999999999999999999987 468999999986 4567899999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) |++.+||++++++||+||++|+ +++++||+++|++++++++|.+|++|+|++.+.+.....+......... T Consensus 80 i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~-------- 151 (305) T protein:vir:25 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAV-------- 151 (305) T ss_pred EEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccc-------- Confidence 9999999999999999999987 6899999999999999999999999998754332221111100000000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ..........+..+.+..+...+.. .+...+.|+||+.+|. T Consensus 152 --------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~~~~~ 192 (305) T protein:vir:25 152 --------------------------------------EVVGGVANESDIVGATNRAAKAVAS-AGWAPDTLLSSLALRY 192 (305) T ss_pred --------------------------------------cccccchhhhHHHHHHHHHHHhhhh-cccccceeEecHHHHH Confidence 0000001111122333333333322 2344556999999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC----ceEEEeeccceEEEEeecccEEEeeccc-- Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG----TILVGHFAPSVIQTARREGVTMQMTNSN-- 458 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~----~~~~gd~~~~~~~i~~r~~~~i~~~~~~-- 458 (497) .|+++||++|||+|++ .+|+|+||++++.+|.+ .+++|||++ |.++++.+++++++++. T Consensus 193 ~l~~lkd~~G~~i~~~-------------~~l~G~Pv~~~~~~~~~~~~~~~~~gd~s~--~~i~~~~~~~i~~~~~~~~ 257 (305) T protein:vir:25 193 EVANIRDANGNPVFRD-------------DSFAGFRTFFNRNGAWDADAAIEVIADSSR--VKIGVRQDITVKFLDQATL 257 (305) T ss_pred HHHHhhccCCceeecC-------------CcccccceEEcCccCCCCCccEEEEEecce--EEEEEecCeEEEEeeeeee Confidence 9999999999999975 26999999999998753 589999997 66889999999887753 Q ss_pred ------chhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 459 ------GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 459 ------~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .++|++|++.+|++.|+||.|+||+||++++....+.=+ T Consensus 258 ~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~ 302 (305) T protein:vir:25 258 GTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVA 302 (305) T ss_pred ecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccC Confidence 246999999999999999999999999999986554322 No 98 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=1.1e-52 Score=305.45 Aligned_cols=372 Identities=13% Similarity=0.030 Sum_probs=212.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+ .+++++.+++.++.+.+. +.++... ..++..+.++ +..+.+..++... T Consensus 1 M~--~kl~~~~~~~~e~~~~l~------------------~~~~~~~-------~~~~~~~~~~---~~~~~~~~~~~~~ 50 (383) T protein:vir:78 1 MT--IKLKNNLANYEEKRTAFV------------------NAVKNED-------TQEIQNKAYV---EMVDAMAADIMEQ 50 (383) T ss_pred Cc--hhHHHHHHHHHHHHHHHH------------------HHHhccC-------hHHHHHHHHH---HHHHHHHHHHHHH Confidence 43 222222222211111111 1111000 0000000011 1111111111000 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) . +... +. ..........+.. ....+.++.+ .....++++.+| T Consensus 51 ~-------~~~~-~~-----~~~~~~~~~~g~~---------------~lt~~e~~~~----------~~~~~~~~~~gg 92 (383) T protein:vir:78 51 A-------KKEA-RQ-----EADAYISASRTDK---------------NITNEEIKFF----------NDINKEVGYKEE 92 (383) T ss_pred H-------HHHH-HH-----HHHHHHHhcCChh---------------hhhHHHHHHH----------HHHhccCCCCCc Confidence 0 0000 00 0000000000000 0000000000 112234556677 Q ss_pred cccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccc-cccccceeeEeeeeeEEeeehh Q lcl|NC_021309. 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~-~s~~~f~~i~~~~~kla~~~~i 239 (497) ++||+++...|++.+...++|+++|++.+++++ .++|+.++ .+.+.|++|++..+ +++++|+++++.+|+++++++| T Consensus 93 ~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~-~~i~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~i 170 (383) T protein:vir:78 93 TLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLR-TKFLKSET-SGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVV 170 (383) T ss_pred cccCHHHHHHHHHHHHhhccceeeeeeEecCCc-eEEEEEcC-CcceEEeecccccccccCcceeeEeecceeeEeeccc Confidence 788888999999999999999999999998776 68999876 46899999987764 6799999999999999999999 Q ss_pred hHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchh Q lcl|NC_021309. 240 TDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |+|||+|+ .+|++||.++++++++.++|.+|++|+|+++|.||++.....+.................... . T Consensus 171 s~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~ 243 (383) T protein:vir:78 171 PKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFAN-------P 243 (383) T ss_pred hHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCcccccccccccccccchhhhhh-------h Confidence 99999997 589999999999999999999999999999999999754322211110000000000000000 0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHH---HHHhhhcCc Q lcl|NC_021309. 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELL---RLTKDANGQ 395 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l---~~lkd~~G~ 395 (497) ......+...... .. .+.............|++|+.++..+ ...++++|+ T Consensus 244 ~~~~~~l~~~~~~---------~~------------------~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~ 296 (383) T protein:vir:78 244 KTTVNELTDVYKY---------HS------------------VKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSLNANGV 296 (383) T ss_pred HHHHHHHHHHHhc---------cc------------------hhcccchhhhcCceEEEEcCcchhhhccchhccCCCCc Confidence 0000000000000 00 00000000001112477776653322 122233343 Q ss_pred eeccCcccccccccccccccccccc--eEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEE Q lcl|NC_021309. 396 YMGGNFFGNAYGNPVNGGKNIWGVP--VVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEE 473 (497) Q Consensus 396 ~i~~~~~~~~~~~~~~~~~~l~G~P--vv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~ 473 (497) | .+++|+| |+.++.||+++++||||++ |.|++|.+++|+.+++. .|.+|++.||+.. T Consensus 297 ~-----------------~t~l~~~~~iv~s~~~p~~~iifgdfs~--Y~i~~r~~~~i~~~~~~--~f~~d~~~f~~~~ 355 (383) T protein:vir:78 297 Y-----------------VTALPFNLNIIESLFVPEKKAISYVAER--YDALIGGPLDIGTYDQT--LAIEDLNLYAAKQ 355 (383) T ss_pred e-----------------eeecCCCceEEecCCCCcccEEEeeccc--eEEEecccceEEecchh--hhhcCceEEEEEE Confidence 3 2455555 6789999999999999998 78999999999988754 5999999999999 Q ss_pred eecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 474 RLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 474 r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) |+|+++++|+||++++++-..+.. T Consensus 356 r~dG~~~~~~A~~vl~~~~~~~~~ 379 (383) T protein:vir:78 356 FAYGKAKDDKAAAVWTLNINPAEQ 379 (383) T ss_pred EEcCEEecCCeEEEEEEEecCCCC Confidence 999999999999999998766666 No 99 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=7.9e-41 Score=240.43 Aligned_cols=390 Identities=12% Similarity=0.074 Sum_probs=209.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDN-DIPE 79 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~-~~~~ 79 (497) .|-+|.-.++-. .+++....+...+.+..+.+ .+ ..+..++..++.++++++..+...... .... T Consensus 120 v~~pa~~~a~I~----~vke~~~~e~~~~~~~~a~~---ee-------~~e~~~k~~el~a~l~~~~~~~~~~~~e~~~~ 185 (517) T protein:vir:97 120 TPNPSNKNAVVT----YFREEKKKEENKMTFDQNLM---QE-------LLDAKKLAADLNAKLKERENGGDNAALKTVSE 185 (517) T ss_pred cchhhhhhhhhh----hhhhhhhhhhhhhhhhhhhh---hh-------hhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 333333222211 12222221111111111111 00 011111222222222222222111110 0000 Q ss_pred HHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccccccc Q lcl|NC_021309. 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) ++..... ..+... .................... ..........................+.+ T Consensus 186 l~a~~~~-----~~~~~~---------~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (517) T protein:vir:97 186 LAANLMK-----QRESEK---------ILGVEALKVTPEATEFLKTR----EAEVAYMSASLTKDPKAAWTAELKERGIS 247 (517) T ss_pred hhhhHHH-----HHHhhh---------hcccccccccchhhHHHHHH----HHHHHHHHhcccccccceeeeeccccccc Confidence 0000000 000000 00000000000000000000 00000000000000000011111223445 Q ss_pred ccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehh Q lcl|NC_021309. 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 160 g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~i 239 (497) |..+|+.+...+...+...+++++++++.++. ...+|..++ ...+.|+.||+.+|+++++|+++++.++++++++++ T Consensus 248 ~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~--~~~~~~~~~-~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~ 324 (517) T protein:vir:97 248 GMPAPAGILKRIQDAVNDEGSLLPFIRHENLP--TLVVGGDNA-LTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKL 324 (517) T ss_pred ccccchHHHHHHHHhhhhhccceeeeeecccc--ceeeecccc-cceeeeeecCCcccccccceeeEEeeHhhhhhhhhh Confidence 66677778888888888888888877765443 345666554 346789999999999999999999999999999999 Q ss_pred hHHHHhhH-----HHHHHHHHHHHHHHHHHHHHhhhhcccCcc-ccccccccccccccccccchhhhhhhHHHHHHhhhh Q lcl|NC_021309. 240 TDEGLRDA-----PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 240 S~ell~d~-----~~l~~~i~~~la~~~~~~~d~~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) |+|||+|+ +.|++||.++|+++++.+++.+||+|+|++ .+.|+++............ T Consensus 325 S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~----------------- 387 (517) T protein:vir:97 325 PKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTG----------------- 387 (517) T ss_pred hHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccccccccccccc----------------- Confidence 99999875 339999999999999999999999999987 4667766542111100000 Q ss_pred hcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc-cCCceEEechhHHHHHHHHhhh Q lcl|NC_021309. 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNPRDWELLRLTKDA 392 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~n~~~~~~l~~lkd~ 392 (497) .+...+++ ..+..++. ..+..|+||+.+|..|+++||+ T Consensus 388 --------------------------------------~~~~~d~i---~~l~~a~~~a~~a~~vmn~~t~~~I~klKD~ 426 (517) T protein:vir:97 388 --------------------------------------TTNIQELL---EKLSVATPKAADSTLVIHRNDLAAIRFLKDK 426 (517) T ss_pred --------------------------------------cchHHHHH---HHHHHHhhhccCCEEEECHHHHHHHHHhhcC Confidence 00001111 11111111 1345799999999999999999 Q ss_pred cCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEE Q lcl|NC_021309. 393 NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 393 ~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) +|||||++...+. ...+++|..-+. +.++.+...++.++. |.++++.++.+.-+ .++..|+..|+.+ T Consensus 427 ~G~Yl~~~~~~~~------~~~~l~G~~~~~-~~~~~~~~~~~~~~~--y~i~~~~g~~~~~~----fd~~~n~~~f~~~ 493 (517) T protein:vir:97 427 NGNYVFPVGVSNQ------TIATHFGFNRLV-QSVAVDEKTAVSLSG--YVTNGSRGMEFEQG----TILVENNKEYLFE 493 (517) T ss_pred CCCeeccCcCCcc------cccccCCccccc-cccccCceeEeeccc--cEEEeecceeeeee----eecccCceeEeee Confidence 9999998754433 235677742222 334445555565553 67788887765322 2256789999999 Q ss_pred EeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 473 ERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 473 ~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) +|.++.|+.|++|+++.+..++.| T Consensus 494 ~~~~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 494 MPISGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred eeeccccccccceEEEEEcCCCCC Confidence 999999999999999999999999 No 100 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=1.5e-41 Score=244.43 Aligned_cols=292 Identities=9% Similarity=-0.004 Sum_probs=222.8 Q ss_pred HhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeec-CCCceEEEEEcCCC---ccceeccccc Q lcl|NC_021309. 139 ADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPV-TSPNLSYLTESAAH---NNAAAVAEAG 214 (497) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~p~~~~~~---~~a~wv~Eg~ 214 (497) ++..+......... +.++.+|+++.|+....+|+.+.+.++++++++++++ +++...+|+...+. +.+.|.+|.+ T Consensus 1 ~~~~~~~~~~~k~i-t~~d~~gG~L~P~~~~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~~ 79 (314) T protein:vir:41 1 MDFLNKPFQITPKI-DVPDLGKGILAVQRFGEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTKV 79 (314) T ss_pred CchhhhHHHhhccc-ccccCCCceeChHHHHHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCCc Confidence 12222222222222 3344456677777677899999999999999999864 66778888865322 2356777888 Q ss_pred ccccccccceeeEeeeeeEEeeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHhhhhcccCcc--------cccccc Q lcl|NC_021309. 215 TYPFSSEEFARVYEQVGKVANALTITDEGLRDAP---ELFNFVQGRLLEGIQRKEEVQLLAGGGYP--------GVNGLL 283 (497) Q Consensus 215 ~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~~---~l~~~i~~~la~~~~~~~d~~~l~G~G~~--------~p~Gi~ 283 (497) ..++++++|+++++.+|++...+.||+|+|+|++ +|+++|...++++++..++..+++|+|+. +|.|++ T Consensus 80 ~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l 159 (314) T protein:vir:41 80 APTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWM 159 (314) T ss_pred cCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhh Confidence 8899999999999999999999999999999984 79999999999999999999999999952 577887 Q ss_pred ccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHH Q lcl|NC_021309. 284 QRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFV 363 (497) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 363 (497) +.++...+.... .+.....+.+.+.+. T Consensus 160 ~~a~~~~~~~~~-----------------------------------------------------~~~~~~~~~~~~l~~ 186 (314) T protein:vir:41 160 KLAGNQYTDAEP-----------------------------------------------------EDENWPLNLFDGMMD 186 (314) T ss_pred hhcccceeecCc-----------------------------------------------------cccccHHHHHHHHHH Confidence 754322111100 001112345667777 Q ss_pred hhhhhhccCCc--eEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCC-----cCceEEEe Q lcl|NC_021309. 364 DIQLTLFQTPN--AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-----LGTILVGH 436 (497) Q Consensus 364 ~~~~~~~~~~~--~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~-----~~~~~~gd 436 (497) .++..|+.+.. +|+||+.++..++++++..|+|+|.+.... ..+.+|+|+||+.++.|| ++.++||| T Consensus 187 sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~------~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd 260 (314) T protein:vir:41 187 ELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIG------ATGLQYDGIPIQYVPALDALGDDKARALLTV 260 (314) T ss_pred hcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhC------CCCceecceeeEecccccccCCCCceEEEec Confidence 88888876543 799999999999999999999999875433 234579999999999885 46789999 Q ss_pred eccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 437 FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 437 ~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) |+. +.++.+..+++..... -.++++.|.+..|+|+.+.+++|.++..+..+..| T Consensus 261 ~~n--lv~~~~~~ir~~~~~~----a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 261 PTN--LVYGFWRNIRIEPKRD----AAMRRTEYIASLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred hhh--eEEEeeceeEEeeccc----CcCCeEEEEEEEEeceEEEEcCcEEEEEeeccCCC Confidence 997 4456666776665543 35789999999999999999999999999999999 No 101 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=3.3e-41 Score=242.51 Aligned_cols=295 Identities=10% Similarity=-0.061 Sum_probs=212.3 Q ss_pred HHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeec-CCCceEEEEEcCC---Cccceec Q lcl|NC_021309. 135 MGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPV-TSPNLSYLTESAA---HNNAAAV 210 (497) Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~p~~~~~---~~~a~wv 210 (497) .-...+-+...........+.++.+|++++|+....+|+.+.+.++++++|++++. ++....+++...+ .....|. T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~~d~~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~ 80 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDVPDLGRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDET 80 (315) T ss_pred CcccchhhcCChhhhhhhcCCcCCCCceechHHHHHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccccc Confidence 00000000011111112334455678888898888899999999999999998754 4444455543211 1235688 Q ss_pred ccccccccccccceeeEeeeeeEEeeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHhhhhcccCc------ccccc Q lcl|NC_021309. 211 AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP---ELFNFVQGRLLEGIQRKEEVQLLAGGGY------PGVNG 281 (497) Q Consensus 211 ~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~~---~l~~~i~~~la~~~~~~~d~~~l~G~G~------~~p~G 281 (497) +|.+..++++++|+++++.++++.+.+.||+++|+|++ +++++|...+++++++.++.++++|+|+ ++|.| T Consensus 81 ~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G 160 (315) T protein:vir:41 81 GQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDG 160 (315) T ss_pred cCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCcccccccc Confidence 89889999999999999999999999999999999974 8999999999999999999999999985 35678 Q ss_pred ccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHH Q lcl|NC_021309. 282 LLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDA 361 (497) Q Consensus 282 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (497) +++.++......... ........+.+.+. T Consensus 161 ~l~~a~~~~~~~~~~---------------------------------------------------~~a~~~~~d~l~~l 189 (315) T protein:vir:41 161 WLKLASEKLTESDVD---------------------------------------------------PEAEDWPMNLFDTM 189 (315) T ss_pred ceecccccccccccc---------------------------------------------------cccccccHHHHHHH Confidence 887654321111000 00001112456667 Q ss_pred HHhhhhhhccCC--ceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCC-----cCceEE Q lcl|NC_021309. 362 FVDIQLTLFQTP--NAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-----LGTILV 434 (497) Q Consensus 362 ~~~~~~~~~~~~--~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~-----~~~~~~ 434 (497) +..++..|+.++ .+|+||+.++..++++||++|+|+|++..... .+.+|+|+||+.++.|| .+.++| T Consensus 190 ~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g------~~~tl~G~PV~~~~~m~~~~~~~~~ilf 263 (315) T protein:vir:41 190 IESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTGA------NSILYDGRPVQYVPALEALNDGKSRALF 263 (315) T ss_pred HHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccchhhcC------CCceecccceEecccccccCCCCccEEE Confidence 778888887654 37999999999999999999999998764432 35689999999999986 466899 Q ss_pred EeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeC Q lcl|NC_021309. 435 GHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 435 gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~ 492 (497) |||+. |.++.+.+++++.+... .++.+.|.+..|+|+.+.++++.+...++- T Consensus 264 ~d~~n--l~~~~~~~i~i~~~~~a----~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 264 VVPTQ--LVYGFWRNIKVVPDYDA----EMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred ecccc--eEEEeccccEEEeeecC----CCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 99997 55677888888876543 367899999999999888777633333333 No 102 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=8.3e-38 Score=223.86 Aligned_cols=362 Identities=13% Similarity=0.082 Sum_probs=188.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINA-DETK-TAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~-~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) +|+-...+-+. +|.... ++.. ...+.++. .....+.+....++.++++++..+.+.+... T Consensus 109 ~pa~~~a~v~~------vks~~~~~e~~~~~~e~~e~----------~~e~~e~~~~~~el~akl~el~k~~ee~k~~-- 170 (480) T protein:vir:40 109 LPSNKGAKVTK------VREENKGEQEQMGANETQEI----------MKQAIEAGVKVRELEAKVEELNKEREELKKE-- 170 (480) T ss_pred cccchhhhhhh------hhhhhhhhhhhhhhHHHHHH----------HHhhhhhhhhhhhHHHHHHHHHhHHHHHhhh-- Confidence 66644333211 111100 0000 00000000 0111111112222223333322221111100 Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccc Q lcl|NC_021309. 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) ...... .. ........ .. +....... .. ....+.... .......... T Consensus 171 -----~~~~~~-------~~--~~~~~~~~-----e~-----r~~~~~~~--~~-~e~~~~~~~------~~~~~~~~~~ 217 (480) T protein:vir:40 171 -----REASIP-------SE--KPEDAERK-----FM-----RELGSKMA--EM-PEQGFLREF------ANGADLNVVN 217 (480) T ss_pred -----hhhhcc-------cc--chhhhhhH-----HH-----HHHHHHhc--cc-hhhhhhhhh------hhhccccccc Confidence 000000 00 00000000 00 00000000 00 000000000 0011112223 Q ss_pred cccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccc--cccceeeEee---eeeE Q lcl|NC_021309. 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFS--SEEFARVYEQ---VGKV 233 (497) Q Consensus 159 ~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s--~~~f~~i~~~---~~kl 233 (497) .++.++|++...+........++...++... .+ .....|++|+...+.. ..++....+. .+++ T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~g-~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l 285 (480) T protein:vir:40 218 SLGSITSKYARKSGIYDGAMKARFQGLTLAE-----------DG-VDDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAY 285 (480) T ss_pred cccccccchhhheeechhhhhhhhhcceeee-----------cc-ccceeeeeeeecccccccccccccchhhHHHHHHH Confidence 3445666554443333333333333332211 12 2245677776554432 1234444444 4788 Q ss_pred EeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccC--ccccccccccccccccccccchhhhhhhHHHHHHhh Q lcl|NC_021309. 234 ANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGG--YPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFP 311 (497) Q Consensus 234 a~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G--~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (497) +++..+|+++|+|+++|++||.++|++.++.+++.+||+|+| ++.+.|+.+.....+.. T Consensus 286 ~~~~k~t~~lLDDa~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~~~~~~------------------- 346 (480) T protein:vir:40 286 LQMDKATVRGVNDSGALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTATDGWTKQ------------------- 346 (480) T ss_pred HHhHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceeeccccccc------------------- Confidence 888899999999999999999999999999999999999955 44577664432110000 Q ss_pred hhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhh Q lcl|NC_021309. 312 ADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKD 391 (497) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd 391 (497) .... +.+...+..+...|+.++..|+||+.+|..|++||| T Consensus 347 -------------------------------------~~~~---d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD 386 (480) T protein:vir:40 347 -------------------------------------IEYT---DLFEGITDAVAECSISDAITIVMSPQTFAELRKAKG 386 (480) T ss_pred -------------------------------------chhH---HHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhc Confidence 0001 222234455667777777679999999999999999 Q ss_pred hcCceeccCcccccccccccccccccccceEec-CCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEE Q lcl|NC_021309. 392 ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTT-PLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVR 470 (497) Q Consensus 392 ~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~-~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r 470 (497) ++|||||+|..... .+++|||+|||++ ..+|.+...+|.++. ++.++||. ++. .....+..|+..|+ T Consensus 387 ~~G~Yi~q~~~~~~------~~~~llG~pvv~~~~~~~~~~~~~~~~~~-~~~~~d~~-~~~----~~~~~~~~~~~~~~ 454 (480) T protein:vir:40 387 TDGHSRFNELATKE------QIAQSFGAVNLETRVWMPKDEVAVYNHDE-YVLIGDLN-VEN----YNDFDLRYNVEQWL 454 (480) T ss_pred CCCCeeccCccccc------CcceecccceeeeeccccCCcceeeeCCc-cEEEEecc-cce----ecccccccchhhhh Confidence 99999999755432 4578999998765 567888888888886 68888863 322 22234568999999 Q ss_pred EEEeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 471 AEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 471 ~~~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) ++.|+++.|..|+||++++++..=-= T Consensus 455 ~e~~v~g~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:40 455 SETLVGGSIRGKNRSAYLKKKGSLGV 480 (480) T ss_pred hhhhhceeeEccccEEEEEeccCcCC Confidence 99999999999999999987532111 No 103 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=6.9e-36 Score=213.36 Aligned_cols=308 Identities=9% Similarity=-0.005 Sum_probs=211.5 Q ss_pred HHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceeccc Q lcl|NC_021309. 133 ELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAE 212 (497) Q Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~E 212 (497) -.++.+.. .............++..+|++|||++...+++.+.+.++++++++++++++....+|....+ +.+.|+++ T Consensus 1 ~~~k~~~~-~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~-~~~~~~~~ 78 (321) T protein:vir:31 1 MASRTINN-DLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIG-ERHRRPQD 78 (321) T ss_pred CchHHHHH-HHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccC-Cccccccc Confidence 00011111 11111222233334556778999999999999999999999999999999998999987653 45677763 Q ss_pred -c-cccccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCcccccccccccc Q lcl|NC_021309. 213 -A-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRST 287 (497) Q Consensus 213 -g-~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~ 287 (497) + ...+.++|+|+++++.++++.+.++||+++|+|+ ++++++|.+.++++++..++..+++|+|+..|.++....| T Consensus 79 e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G 158 (321) T protein:vir:31 79 EGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDG 158 (321) T ss_pred ccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchh Confidence 3 3455688999999999999999999999999986 4899999999999999999999999999877643322211 Q ss_pred ccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhh Q lcl|NC_021309. 288 GFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQL 367 (497) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (497) ......... ............+.+.+.+..++. T Consensus 159 ~l~~a~~~~-----------------------------------------------~~~~~~~~~~~~d~l~~l~~~l~~ 191 (321) T protein:vir:31 159 FITVAEGDV-----------------------------------------------ETIDAADDILDNDLVIRTIAGLDS 191 (321) T ss_pred hhhhhcccc-----------------------------------------------ccccccccccCHHHHHHHHHhccH Confidence 111000000 000000001113456667777888 Q ss_pred hhccCCc-eEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEe Q lcl|NC_021309. 368 TLFQTPN-AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTAR 446 (497) Q Consensus 368 ~~~~~~~-~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~ 446 (497) .|+.+++ +|+||+.++..+++.....+.++|.+...+ ....+|+|+||+.++.||.+.++++||+... ++. T Consensus 192 ~yr~~~~~v~im~~~~~~~~~~~l~~~~~~~~~~~l~~------~~~~tl~G~pvv~~~~mP~~~il~t~~~nl~--~~~ 263 (321) T protein:vir:31 192 KYRARMNPALIVSEDQLLSYHYTLTDRDTPLGDNVIMG------EADVNPFSFPIIGSGLWPDDKAMFTDPQNLI--YAL 263 (321) T ss_pred hHhcCCCeEEEechHHHHHHHHHHhcCCCccccchhhc------cccccccceeEEEcCCCCCCcEEEeccccEE--EEE Confidence 8876655 699999999888764444445777654332 1345799999999999999999999999854 445 Q ss_pred ecccEEEeecccch-hhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 447 REGVTMQMTNSNGT-DFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 447 r~~~~i~~~~~~~~-~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) +.+++++....... .+.++.+.+....++|+.|.+++|++.++-...+.-. T Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~~~~ 315 (321) T protein:vir:31 264 YRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGDPLEH 315 (321) T ss_pred eeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCcchhc Confidence 66777777655321 1223445555666799999999999999854443322 No 104 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.95 E-value=9.9e-30 Score=179.59 Aligned_cols=266 Identities=20% Similarity=0.222 Sum_probs=190.0 Q ss_pred hccccccccccccchhhh-HHHHHHHHhhhhHHhhccee----ecCCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~-~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) |+.+.+ ..+..++|+.. .-+++.+...+.+.+++.+. ...|+.+++|+.+. .+.+.|++||+.+|.++++|+. T Consensus 1 MA~~~T-~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~-~~~a~~v~eg~~i~~~~~~~~~ 78 (272) T protein:vir:98 1 MAVGTT-KMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDY-IGDAEDVAEGEAIPMTQLGFKK 78 (272) T ss_pred CCCccc-cchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecC-CCCcccccCCCcccccccccce Confidence 443333 34556777644 55667777777777776553 23456799999875 5689999999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) +++.+++++..+++|++++.++ +++.+++.+.+++++++++|..++..-.. +.. ... T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~---------a~~---~~~---------- 136 (272) T protein:vir:98 79 TTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK---------STQ---TVE---------- 136 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc---------ccc---ccc---------- Confidence 9999999999999999998765 79999999999999999999998853110 000 000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+...+ ....+|++||.++. T Consensus 137 ----------------------------------------------~~~t~d~i~da~~~l~~~~-~~~~~~vv~p~~~~ 169 (272) T protein:vir:98 137 ----------------------------------------------ATATVDGVSKALDIFNDED-DAETVIVMNPADAS 169 (272) T ss_pred ----------------------------------------------cccCHHHHHHHHHHHhccC-CCccEEEEcHHHHH Confidence 0001223334444443332 44568999999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++.+..+. +-...... .....+..++++|+||++++.+|.+++|+.+.. ++.++.+.+++++.++.. .+ T Consensus 170 ~L~k~~~~~~--~~~~~~~~-~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~--a~~~~~~~~~~ve~~r~~----~~ 240 (272) T protein:vir:98 170 TLRLDAAKEW--LGATEVGA-NRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKG--ALRIMLKRNTMVETDRDI----TK 240 (272) T ss_pred HHHHhccccc--cccccccc-cccccccchhhcCeeEEEcCCCCcceEEEEcCC--eEEEEecCCceeeecccc----cc Confidence 9987643221 11000000 001111235799999999999999999886655 577777888888876543 35 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) +...+++..|++++|.+|++|++++++++++- T Consensus 241 ~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 241 AINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 67899999999999999999999999999988 No 105 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.95 E-value=9.9e-30 Score=179.59 Aligned_cols=266 Identities=20% Similarity=0.222 Sum_probs=190.0 Q ss_pred hccccccccccccchhhh-HHHHHHHHhhhhHHhhccee----ecCCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~-~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) |+.+.+ ..+..++|+.. .-+++.+...+.+.+++.+. ...|+.+++|+.+. .+.+.|++||+.+|.++++|+. T Consensus 1 MA~~~T-~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~-~~~a~~v~eg~~i~~~~~~~~~ 78 (272) T protein:vir:30 1 MAVGTT-KMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDY-IGDAEDVAEGEAIPMTQLGFKK 78 (272) T ss_pred CCCccc-cchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecC-CCCcccccCCCcccccccccce Confidence 443333 34556777644 55667777777777776553 23456799999875 5689999999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) +++.+++++..+++|++++.++ +++.+++.+.+++++++++|..++..-.. +.. ... T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~---------a~~---~~~---------- 136 (272) T protein:vir:30 79 TTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK---------STQ---TVE---------- 136 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc---------ccc---ccc---------- Confidence 9999999999999999998765 79999999999999999999998853110 000 000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+...+ ....+|++||.++. T Consensus 137 ----------------------------------------------~~~t~d~i~da~~~l~~~~-~~~~~~vv~p~~~~ 169 (272) T protein:vir:30 137 ----------------------------------------------ATATVDGVSKALDIFNDED-DAETVIVMNPADAS 169 (272) T ss_pred ----------------------------------------------cccCHHHHHHHHHHHhccC-CCccEEEEcHHHHH Confidence 0001223334444443332 44568999999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++.+..+. +-...... .....+..++++|+||++++.+|.+++|+.+.. ++.++.+.+++++.++.. .+ T Consensus 170 ~L~k~~~~~~--~~~~~~~~-~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~--a~~~~~~~~~~ve~~r~~----~~ 240 (272) T protein:vir:30 170 TLRLDAAKEW--LGATEVGA-NRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKG--ALRIMLKRNTMVETDRDI----TK 240 (272) T ss_pred HHHHhccccc--cccccccc-cccccccchhhcCeeEEEcCCCCcceEEEEcCC--eEEEEecCCceeeecccc----cc Confidence 9987643221 11000000 001111235799999999999999999886655 577777888888876543 35 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) +...+++..|++++|.+|++|++++++++++- T Consensus 241 ~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 241 AINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 67899999999999999999999999999988 No 106 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.79 E-value=1.6e-20 Score=129.14 Aligned_cols=264 Identities=17% Similarity=0.142 Sum_probs=176.8 Q ss_pred hccccccccccccchhhhHH-HHHHHHhhhhHHhhcceeec----CCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPG-IVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~-ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) |+.+ .+.-..+|+||...+ +.+.+.....+.+++.+-+. .|+.+++|.... .+.+.++.||++++..+.++++ T Consensus 1 ma~~-~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~-~gda~~~~eg~~i~~~~lt~~~ 78 (272) T protein:vir:36 1 MSKQ-KTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTY-IGDAADVAEGGEISLDKIGTTT 78 (272) T ss_pred CCCc-ceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeecc-CccccccCCCCccChhhcCCcc Confidence 3333 334456777875555 55777766677777765442 356799999875 3678899999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+....++++....+ .++.+.+.+++++.+++++|+.++..-. | .... .+ T Consensus 79 ~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~-----~----~~~~-~~------------ 136 (272) T protein:vir:36 79 KSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAK-----T----TSQT-VS------------ 136 (272) T ss_pred eeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhc-----c----cccc-cc------------ Confidence 9999999988899999876554 6899999999999999999998874311 0 0000 00 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... ....++++||.++. T Consensus 137 ----------------------------------------------~~~~~d~i~~A~~~lgd~~-~~~~~ivv~p~~~~ 169 (272) T protein:vir:36 137 ----------------------------------------------TKANVDGVQAALDIFNDED-AQAYVLIVNPKDAA 169 (272) T ss_pred ----------------------------------------------ccccHHHHHHHHHHhhhcC-CCceEEEEcHHHHH Confidence 0001122223333222222 23568999999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEe--eccceEEEEeecccEEEeecccchhh Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGH--FAPSVIQTARREGVTMQMTNSNGTDF 462 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd--~~~~~~~i~~r~~~~i~~~~~~~~~f 462 (497) .|++...-. +. +...+......+.-++++|+||++++.+|.++.++.. |...++.++..++++++..+... T Consensus 170 ~L~k~~~~~--~~--~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~~~~~~vE~~R~~~--- 242 (272) T protein:vir:36 170 KIRKDANAK--NI--GSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV--- 242 (272) T ss_pred HHhcccccc--cc--cccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccceeeeecCCcccccccchh--- Confidence 887543221 11 1111110011122357999999999999998864322 33446666777777777655432 Q ss_pred hcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 463 VDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 463 ~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) +....+++..+++.+|.+|+++++++++-+ T Consensus 243 -~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 243 -TKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred -hcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 345678889999999999999999999999 No 107 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.78 E-value=5.5e-20 Score=126.17 Aligned_cols=265 Identities=17% Similarity=0.163 Sum_probs=179.3 Q ss_pred hccccccccccccchhhh-HHHHHHHHhhhhHHhhcceeec----CCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~-~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) |+.+ .+.-+..++|+.. .-+.+.+.....+.+++.+... .|+.+++|+.+. .+.+.|+.||+.++.++.+++. T Consensus 1 ma~~-~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~-~g~~~~~~eg~~i~~~~it~~~ 78 (274) T protein:vir:93 1 MPQG-ITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKK 78 (274) T ss_pred CCcc-ceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeecc-CCCcccccCCCcccccccccce Confidence 3333 3344567777744 5567777777777777765432 245789999874 4578899999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+....++++....+ .++.+.+.+.+++++++++|..++..-...... ... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~----------~~~----------- 137 (274) T protein:vir:93 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT----------VNA----------- 137 (274) T ss_pred eEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------ccc----------- Confidence 9999999888899999987654 688899999999999999999988542111000 000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... ....++++||..+. T Consensus 138 ----------------------------------------------~~~~~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:93 138 ----------------------------------------------DITKLNGLQSAIDKFNDED-LEPMVLFINPLDAG 170 (274) T ss_pred ----------------------------------------------cccCHHHHHHHHHHhhhcc-CCccEEEeCHHHHH Confidence 0000122333333333222 24567999999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++ |..-+++-.+..+.. ....+..+++.|+||++++.+|.++.++.+... +.++....++++..+.. .+ T Consensus 171 ~L~k--~~~~~f~~~s~~g~~-~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~ga--i~~~~~~~~~vE~~Rd~----~~ 241 (274) T protein:vir:93 171 KLRG--DASTNFTRATELGDD-IIVKGAFGEALGAIIVRTNKLEAGTAILAKKGA--VKLILKRDFFLEVARDA----ST 241 (274) T ss_pred HHHh--hhhhccccccccccc-ceeecccceecCeeEEEcCCCCcceEEEEeCCe--EEEEecCCcccccccch----hh Confidence 9974 332223222211111 111223457999999999999999988876554 55666777777665543 23 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ....+++..+++.++.+|++++++++ +.|| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~~v~~t~---~~~s 271 (274) T protein:vir:93 242 KTTALYSDKHYVAYLYDESKAVKITK---GSGS 271 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEee---Cccc Confidence 46788999999999999999999885 4455 No 108 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.77 E-value=1.8e-20 Score=128.88 Aligned_cols=308 Identities=14% Similarity=0.139 Sum_probs=198.1 Q ss_pred hHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceE Q lcl|NC_021309. 117 SFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS 196 (497) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 196 (497) .. .. .....+..+..-...+ -...+...+-...+.+.+.+....||+.+.+.+.|++.+++..+.++.+. T Consensus 1 ~~---~~------~~~~~~~~~~~~~~~~-p~l~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~ 70 (330) T protein:vir:94 1 MV---RI------CTPPLRGRWRTLTHQF-PELKMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALA 70 (330) T ss_pred Cc---ee------cCCccccceeehhccc-cccchhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcce Confidence 00 00 0000000010000000 00112222222334455556778899999999999999999999999999 Q ss_pred EEEEcCCCccceecccccccccccc-cceeeEeeeeeEEeeehhhHHHHh--hH-HHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_021309. 197 YLTESAAHNNAAAVAEAGTYPFSSE-EFARVYEQVGKVANALTITDEGLR--DA-PELFNFVQGRLLEGIQRKEEVQLLA 272 (497) Q Consensus 197 ~p~~~~~~~~a~wv~Eg~~~~~s~~-~f~~i~~~~~kla~~~~iS~ell~--d~-~~l~~~i~~~la~~~~~~~d~~~l~ 272 (497) |++.+. -+.+.|...++..+.+.+ +|.+++.+++.+.+.+.|++++.+ .+ .+...+-.....++++.+.+.+++| T Consensus 71 ~~r~~~-lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~lin 149 (330) T protein:vir:94 71 YNRENV-LGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMIT 149 (330) T ss_pred eeeeec-CCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 999887 478999999988887654 899999999999999999999965 34 4688888888999999999999999 Q ss_pred ccCcc-ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchh Q lcl|NC_021309. 273 GGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTA 351 (497) Q Consensus 273 G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 351 (497) |+.++ ++.||+........-.+. ..+ ..++ T Consensus 150 GDs~~~~F~GL~~~~~~~q~i~tg--------------------------------------------~~g----g~~T- 180 (330) T protein:vir:94 150 GDGTGNSFQGMMGLVAASQTISAG--------------------------------------------ANG----GTLT- 180 (330) T ss_pred cCCCCccccchhhcCCcccEEecC--------------------------------------------CCC----CCCC- Confidence 98764 577887644221110000 000 0000 Q ss_pred hhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC- Q lcl|NC_021309. 352 AEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG- 430 (497) Q Consensus 352 ~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~- 430 (497) -+.+|.++.. +. .....+.+|+||++.+..|+.+.+..|+|-..+......|.++ .++.|+|++.++.+|.+ T Consensus 181 ~d~LDeLl~~---v~-~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v---~~~~GvPi~~~d~ip~~~ 253 (330) T protein:vir:94 181 FELLDQLLDL---VK-DKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQI---PTYRGVPWFVNDFIPSNM 253 (330) T ss_pred HHHHHHHHHH---hc-CCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEE---eeeCCeEEEecccccCCC Confidence 0112222222 21 1223467899999999999999999998877666555555544 35779999999999863 Q ss_pred ---------ceEEEeec-----cceEEEEe--ecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCC Q lcl|NC_021309. 431 ---------TILVGHFA-----PSVIQTAR--REGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 431 ---------~~~~gd~~-----~~~~~i~~--r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a 494 (497) .+|+..|. ++...+.- ..++++..-.+. =.++.+.+|++++++.+|.+|.|+.+|+-...= T Consensus 254 ~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~---~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 254 TQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAK---ENADETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred CcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCc---cccceeeEEEEEeeeeEEechhheeeeccccCC Confidence 24555543 22122211 134444332111 135678899999999999999999988655443 No 109 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.73 E-value=8.5e-19 Score=119.65 Aligned_cols=271 Identities=15% Similarity=0.119 Sum_probs=176.7 Q ss_pred hccccccccccccchh-hhHHHHHHHHhhhhHHhhcceeec----CCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPT-FLPGIVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~-~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) |+.. ++.-+..++|| |..-+.+.+.+...+.+++..... .++.+++|+... .+.+.++.|++.++..+.++++ T Consensus 1 Ma~~-~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~-~g~a~~~~~g~~i~~~~lt~~~ 78 (278) T protein:vir:80 1 MADL-TTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKY-IGDAQDVAEGAAIDYSALETES 78 (278) T ss_pred CCCc-ceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeecc-CCcceeecCCCcCcccccccce Confidence 3332 23335677777 455577777777677777654332 355789999874 3567899999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccc-Cccccccccccccccccccccchhhhhhh Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGG-GYPGVNGLLQRSTGFTASSASSLFGATSA 303 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~-G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 303 (497) .++..++.+..+.++++....+ .++.+.+.+.+++++++++|..++..- |... +..+..+ T Consensus 79 ~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~-----~~~~~~t------------- 140 (278) T protein:vir:80 79 VKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTL-----EVKGAIN------------- 140 (278) T ss_pred eeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----ccccccc------------- Confidence 9999999888889999876554 688999999999999999999887531 1100 0000000 Q ss_pred HHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHH Q lcl|NC_021309. 304 TVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDW 383 (497) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~ 383 (497) ........+.+.++...+..........+++||..+ T Consensus 141 --------------------------------------------~~~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~ 176 (278) T protein:vir:80 141 --------------------------------------------IGLIDKIENTFTDAPDAIEDESITTTGVLFLNYKDT 176 (278) T ss_pred --------------------------------------------cchhhhHHHHHHHHHHhhcccCCCcccEEEECHHHH Confidence 000000111222222222222223344688999999 Q ss_pred HHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhh Q lcl|NC_021309. 384 ELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFV 463 (497) Q Consensus 384 ~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~ 463 (497) ..|++.. .-+++-.+..... ....+.-+++.|+||++++.+|.++.|+..-. ++..+....++++..+... T Consensus 177 ~~L~k~~--~~~~~~~~~~g~~-~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~g--Ai~~~~~~~~~vE~~Rd~~---- 247 (278) T protein:vir:80 177 AKLREEA--AGSWTKASQLGDD-LLVKGAFGELLGWEIVRTKKLADGNALAVKAG--ALKTFLKRNLLAESGRDMD---- 247 (278) T ss_pred HHHHhhh--hhhcccccccccc-ceeeccceeecceeEEEcCCCCcceEEEEecc--ceeeeecCCcccccccchh---- Confidence 8887643 2223221111111 11122345799999999999999998775433 5666667777776655332 Q ss_pred cCceEEEEEEeecceeecccceEEEEeeCCC Q lcl|NC_021309. 464 DGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 464 ~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a 494 (497) +....+++..+++.++++|+++++++..+.- T Consensus 248 ~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 248 HKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred hccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 3456888889999999999999999988777 No 110 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.72 E-value=1e-18 Score=119.24 Aligned_cols=269 Identities=13% Similarity=0.125 Sum_probs=182.2 Q ss_pred hccccccccccccchhhhHH-HHHHHHhhhhHHhhcceeec----CCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPG-IVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~-ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) +.....+.-..+|+||.... +.+.+.....+.+++.+-+. .|+.+++|.... .+.+.++.||+.++..+.+++. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~~~~lt~~~ 79 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVY-SGDAKVVPEGEEIPIDLIETKK 79 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeecc-CCccccccCCCCcchhhcccce Confidence 44444455556888886655 66777777777787765443 355799999875 4578899999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+..+.++++....+ .++.+.+.+.++.++++++|+.++.--++. +.. ... T Consensus 80 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a---------~~~-~~~----------- 138 (275) T protein:vir:96 80 RQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGA---------TLK-VEA----------- 138 (275) T ss_pred eeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc---------ccc-ccc----------- Confidence 9999999988899999986554 577888889999999999999887421110 000 000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... ....++++||..+. T Consensus 139 ----------------------------------------------~~~~~d~i~dA~~~lgd~~-~~~~~ivv~p~~~~ 171 (275) T protein:vir:96 139 ----------------------------------------------DITKLAGLQTAIDKFNDED-LEPMVLFVNPLDAG 171 (275) T ss_pred ----------------------------------------------cccCHHHHHHHHHHhcccc-CCccEEEeCHHHHH Confidence 0001222333333332222 24567999999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++..+ -+++-.+..+.. ....+.-+++.|++||+++.+|.++.|+.. +.++.++...+++++..+... + T Consensus 172 ~L~k~~~--~~f~~~~~~g~~-~~~~G~ig~~~G~~Vi~s~~~p~~t~~i~~--~gA~~~~~~~~~~vE~~Rd~~----~ 242 (275) T protein:vir:96 172 KLRASAT--DNFTRATLLGDN-VIVKGAFGEALGAIIVRSNKIKEGEAILAK--RGAVKLITKRDFFLETERHAS----H 242 (275) T ss_pred HHHhccc--cccccccccccc-ceeccccceecCeeEEEeCCCCcceEEEEe--ccceeeeecCCcccccccchh----h Confidence 9977532 122222111111 111223457999999999999999987643 445777777777777665432 3 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ....+++..+++.++++|+++++++++.+--|- T Consensus 243 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 243 KSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred cCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 467788889999999999999999887766666 No 111 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.70 E-value=2.8e-18 Score=116.80 Aligned_cols=268 Identities=16% Similarity=0.160 Sum_probs=180.6 Q ss_pred hccccccccccccchhhhHH-HHHHHHhhhhHHhhcceee----cCCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPG-IVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~-ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) |+.+ .+.-..+|.||.... +.+.+...+.+.+++.+-+ ..++.+++|.... .+.+.++.||++++..+.++++ T Consensus 1 Ma~~-~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~-igda~~~~eg~~i~~~~lt~~~ 78 (276) T protein:vir:10 1 MAQG-TTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVY-SGDATVVPEGQKIPVDKIETNR 78 (276) T ss_pred CCcc-eeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecC-CCccccccCCCccCccccccce Confidence 3333 334456788886554 6677777777777776543 3466899999865 3678899999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) .+...++.+....++++....+ .+..+.+.+.++..+++++|+.++.-- ...+.. ... T Consensus 79 ~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l---------~~~~~~-~~~----------- 137 (276) T protein:vir:10 79 REAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEAL---------RGTKLT-VSA----------- 137 (276) T ss_pred eeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHH---------hccccc-ccc----------- Confidence 9999999988899999987655 678888999999999999999877310 000000 000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... ....++++||..+. T Consensus 138 ----------------------------------------------~~~t~d~i~~A~~~lgd~~-~~~~~ivv~p~~~~ 170 (276) T protein:vir:10 138 ----------------------------------------------DIGTLAGLEAAIDTFDDED-LEPMVLFINPKDAG 170 (276) T ss_pred ----------------------------------------------cccCHHHHHHHHHHhcccc-CcccEEEEcHHHHH Confidence 0000122223333322222 24567899999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|+++.+.+ ++-.+..... ....+.-.++.|+|||+++.+|.++.|+.. +.++.++...+++++.++... + T Consensus 171 ~L~k~~~~~--f~~~s~~g~~-~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~--~gAi~~~~~~~~~vE~dRd~~----~ 241 (276) T protein:vir:10 171 KLRSSASDN--FTRATELGDN-IIVKGAFGEALGAVIVRSKKLDEGEAILAK--RGAVKLITKRDFFLETDRDPS----T 241 (276) T ss_pred HHHHhcccc--cccccccccc-ceeccccceecceeEEEcCCCCcceEEEEe--ccceeeeecCCceeecccchh----h Confidence 998754322 2222211111 111222457899999999999999987644 446777777888887766432 3 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ....+++...++.++.+|++++++++.+.+.-| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (276) T protein:vir:10 242 KTTALYSDKHYVAYLYDESKAVKVTKGAGTTDS 274 (276) T ss_pred cccEEEEeeEEEEEEEcCcceEEEecCCcCCcC Confidence 467778889999999999999999866533333 No 112 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.68 E-value=1.2e-17 Score=113.35 Aligned_cols=268 Identities=14% Similarity=0.119 Sum_probs=177.6 Q ss_pred hccccccccccccchhhhH-HHHHHHHhhhhHHhhcceeec----CCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLP-GIVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~-~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) ++... +.-+.+++|+... -+.+.+.....+.++++.-+. .|+.+++|+... .+.+..+.||+.++..+.+++. T Consensus 1 ma~~~-T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~-~g~~~~~~~g~~i~~~~it~~~ 78 (274) T protein:vir:96 1 MAQGT-TKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSK 78 (274) T ss_pred CCccc-cchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeecc-CCCccccCCCCcCchhhcccce Confidence 33333 3445678887555 466666666666676655321 356799999863 4567788999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+..+.++++....+ .++.+.+.+.+++++++++|..++..-.. ++.. .. T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~---------a~~~-~~------------ 136 (274) T protein:vir:96 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKG---------ATLT-VE------------ 136 (274) T ss_pred eEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhc---------CCCC-cC------------ Confidence 9999999887889999886554 57888999999999999999988743110 0000 00 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ......+.+.++...+.... ....++++||..+. T Consensus 137 ---------------------------------------------~~~~~~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:96 137 ---------------------------------------------ADITKLDGLQTAIDKFNDED-LEPMVLFVNPLDAG 170 (274) T ss_pred ---------------------------------------------cccccHHHHHHHHHHhcccC-CCceEEEeCHHHHH Confidence 00001223333333333222 24567899999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++... .+++-....+. .....+.-+++.|++|++++.+|.++.|+..-. ++.++...+++++..+.. .+ T Consensus 171 ~L~k~~~--~~f~~~~~~g~-~~~~~g~ig~~~G~~Vi~s~~~p~~t~~l~~~g--A~~~~~~~~~~vE~~Rd~----~~ 241 (274) T protein:vir:96 171 GLRTSAS--DNFTRPTQLGD-NIIVKGAFGEALGAVIVRSNKLNKGEALLAKKG--AVKLITKRDFFLEKDRDA----SR 241 (274) T ss_pred HHHhccc--ccccccccccc-cceeecccceecCeeEEEcCCCCcceEEEEeCc--ceeeeecCCcccccccch----hh Confidence 9987532 22322211111 111122345789999999999999998775544 466666777777655533 23 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ....+++..+++.++++|+++++++..++=+-- T Consensus 242 ~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 242 KSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred cccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 467888889999999999999999866554444 No 113 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.66 E-value=4.2e-17 Score=110.39 Aligned_cols=268 Identities=17% Similarity=0.142 Sum_probs=176.9 Q ss_pred hccccccccccccchhhh-HHHHHHHHhhhhHHhhcceeec----CCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~-~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) |..+ .+.-+.+|+||.. .-+.+.+.....+.+++.+-.. .++.+++|.... .+.+..+.||+.++..+.+++. T Consensus 1 ma~~-~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~-~g~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:97 1 MPQG-LTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKK 78 (274) T ss_pred CCcc-ceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecC-CCccccccCCCcccccccccce Confidence 3332 3344567888755 4566777666666677655432 356789999864 4567889999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+....++++....+ .++.+.+.+.+++++++++|+.++.--.+. +.. ... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a---------~~~-~~~----------- 137 (274) T protein:vir:97 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA---------KLT-VNA----------- 137 (274) T ss_pred eEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------Ccc-ccc----------- Confidence 9999999887889999876544 678888999999999999999887431110 000 000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... ....++++||..+. T Consensus 138 ----------------------------------------------~~~~~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:97 138 ----------------------------------------------DITKLNGLQSAIDKFNDED-LEPMVLFVNPLDAG 170 (274) T ss_pred ----------------------------------------------cccCHHHHHHHHHHhhccC-CCceEEEeCHHHHH Confidence 0000122333333333222 24567899999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++ |..-+++-.+..+.. ....+..+++.|++|++++.+|.++.++..-. ++.++...++.++..+... + T Consensus 171 ~L~k--~~~~~f~~~s~~g~~-~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~g--A~~~~~~~~~~vE~~Rd~~----~ 241 (274) T protein:vir:97 171 KLRG--DASTNFTRATELGDD-IIVKGAFGEALGAIIVRTNKLEAGTAILAKKG--AVKLILKRDFFLEVARDAS----T 241 (274) T ss_pred HHHh--hhhhhccccCccccc-ceeccccceecCeeEEEcCCCCcceEEEEeCc--ceEeeecCCceeccccchh----h Confidence 9874 433333322221111 11122345789999999999999998876544 5666777777777665432 3 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ....+++..+++.++.+|++++++++..+..-- T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 242 KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 456788889999999999999999854433222 No 114 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.66 E-value=4.2e-17 Score=110.39 Aligned_cols=268 Identities=17% Similarity=0.142 Sum_probs=176.9 Q ss_pred hccccccccccccchhhh-HHHHHHHHhhhhHHhhcceeec----CCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~-~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) |..+ .+.-+.+|+||.. .-+.+.+.....+.+++.+-.. .++.+++|.... .+.+..+.||+.++..+.+++. T Consensus 1 ma~~-~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~-~g~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:94 1 MPQG-LTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKK 78 (274) T ss_pred CCcc-ceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecC-CCccccccCCCcccccccccce Confidence 3332 3344567888755 4566777666666677655432 356789999864 4567889999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+....++++....+ .++.+.+.+.+++++++++|+.++.--.+. +.. ... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a---------~~~-~~~----------- 137 (274) T protein:vir:94 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA---------KLT-VNA----------- 137 (274) T ss_pred eEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------Ccc-ccc----------- Confidence 9999999887889999876544 678888999999999999999887431110 000 000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... ....++++||..+. T Consensus 138 ----------------------------------------------~~~~~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:94 138 ----------------------------------------------DITKLNGLQSAIDKFNDED-LEPMVLFVNPLDAG 170 (274) T ss_pred ----------------------------------------------cccCHHHHHHHHHHhhccC-CCceEEEeCHHHHH Confidence 0000122333333333222 24567899999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++ |..-+++-.+..+.. ....+..+++.|++|++++.+|.++.++..-. ++.++...++.++..+... + T Consensus 171 ~L~k--~~~~~f~~~s~~g~~-~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~g--A~~~~~~~~~~vE~~Rd~~----~ 241 (274) T protein:vir:94 171 KLRG--DASTNFTRATELGDD-IIVKGAFGEALGAIIVRTNKLEAGTAILAKKG--AVKLILKRDFFLEVARDAS----T 241 (274) T ss_pred HHHh--hhhhhccccCccccc-ceeccccceecCeeEEEcCCCCcceEEEEeCc--ceEeeecCCceeccccchh----h Confidence 9874 433333322221111 11122345789999999999999998876544 5666777777777665432 3 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ....+++..+++.++.+|++++++++..+..-- T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 242 KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 456788889999999999999999854433222 No 115 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.62 E-value=1.3e-16 Score=107.71 Aligned_cols=265 Identities=16% Similarity=0.128 Sum_probs=173.4 Q ss_pred hccccccccccccchhhhH-HHHHHHHhhhhHHhhcceee----cCCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLP-GIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~-~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) +..+ .+.-..+|+||... -+.+.+.....+.+++.+-. ..|+.+++|.... .+.+..+.||+.++..+.+.+. T Consensus 1 ma~~-~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:12 1 MAQG-LTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKK 78 (274) T ss_pred CCcc-eeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecC-CCccccccCCCccchhhcccce Confidence 3322 33445678887554 46666666666666665532 2466889999874 4567889999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLR-DAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~-d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+....++++... ...++.+.+.+.++++++.++|+.++.--.+.. . +... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~---------~-~~~~----------- 137 (274) T protein:vir:12 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------L-TVNA----------- 137 (274) T ss_pred eeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccc---------c-cccc----------- Confidence 99999998888999997654 345778889999999999999998874321100 0 0000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... ....++++||..+. T Consensus 138 ----------------------------------------------~a~~~d~i~dA~~~lgd~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:12 138 ----------------------------------------------DITKLNGLQSAIDKFNDED-LEPMVLFINPLDAG 170 (274) T ss_pred ----------------------------------------------cccCHHHHHHHHHHhcccc-ccccEEEeCHHHHH Confidence 0001223333333333222 24567899999998 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++. ..-+++-....+. .....+.-.++.|+||++++.+|.++.|+.. +.++..+....++++..+... + T Consensus 171 ~L~k~--~~~~fv~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~----~ 241 (274) T protein:vir:12 171 KLRGD--ASTNFTRATELGD-DIIVKGAFGEALGAIIVRSNKLEAGTAILAK--KGAVKLILKRDFFLEVARDAS----T 241 (274) T ss_pred HHHhh--hhhhccccccccc-cceecccceeecCeeEEEeCCCCcceEEEEe--ccceeeeecCCceeccccchh----h Confidence 88753 2222322211111 1111222347899999999999999876543 445667777788887766432 3 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ....+++..+++.++++|+++++++. ++|| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~---~~~~ 271 (274) T protein:vir:12 242 KTTALYSDKHYVAYLYDESKAVKITK---GSGS 271 (274) T ss_pred cccEEEeeeEEEEEEEcCCceEEEEc---CCcc Confidence 45688888999999999999999984 4555 No 116 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.61 E-value=2.5e-16 Score=106.15 Aligned_cols=265 Identities=16% Similarity=0.150 Sum_probs=173.9 Q ss_pred hccccccccccccchhhh-HHHHHHHHhhhhHHhhcceee----cCCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~-~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) +... .+.-..+|+||.. +-+.+.+.....+.+++.+-+ ..|+.+++|.... .+.+..+.||+.++..+.+.+. T Consensus 1 m~~~-~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:96 1 MAQG-MTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIY-SGDAKVVAEGEKIPTDILETKK 78 (274) T ss_pred CCcc-eeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecC-CCccccccCCCccchhhcccce Confidence 3332 3344567878755 456677766666667655433 2366899999874 3567889999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+-.+.++++....+ .++.+.+.+.++++++.++|+.++.--.+... +... T Consensus 79 ~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~----------~~~~----------- 137 (274) T protein:vir:96 79 REAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL----------TVEA----------- 137 (274) T ss_pred eEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------cccc----------- Confidence 9999999877789998875544 57888899999999999999988742211100 0000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... ....++++||..+. T Consensus 138 ----------------------------------------------~~~~~d~i~~A~~~lgd~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:96 138 ----------------------------------------------DITKLTGLQTAIDKFNDED-LEPMVLFISPLDAG 170 (274) T ss_pred ----------------------------------------------cccCHHHHHHHHHHhcccc-ccccEEEeCHHHHH Confidence 0000122223333322222 23557899999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++ +..-+++-....+. .....+.-+++.|++|++++.+|.++.++.. +.++..+....++++..+.. .+ T Consensus 171 ~L~k--~~~~~f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~--~gA~~~~~~~~~~vE~~Rd~----~~ 241 (274) T protein:vir:96 171 KLRG--DATTNFTRATELGD-DVIVKGAFGEALGAVIVRSNKLEAGTAILAK--KGAVKLITKRDFFLETDRDP----ST 241 (274) T ss_pred HHHh--hccccccccccccc-cceeccccceecCeEEEEeCCCCCceEEEEe--ccceeeeecCCccccccccc----cc Confidence 8875 33223332221111 1111223457899999999999998876643 34566666777777766543 24 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ....+++..+++.++++|+++++++ ..+|| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~~v~~t---k~~~~ 271 (274) T protein:vir:96 242 KTTALYSDKHYVAYLYDESKAVKIT---KGSGS 271 (274) T ss_pred ccCEEEEeEEEEEEEEcCCcEEEEE---cCCcc Confidence 5678888899999999999999988 56777 No 117 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.61 E-value=2.5e-16 Score=106.15 Aligned_cols=265 Identities=16% Similarity=0.150 Sum_probs=173.9 Q ss_pred hccccccccccccchhhh-HHHHHHHHhhhhHHhhcceee----cCCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~-~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) +... .+.-..+|+||.. +-+.+.+.....+.+++.+-+ ..|+.+++|.... .+.+..+.||+.++..+.+.+. T Consensus 1 m~~~-~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:95 1 MAQG-MTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIY-SGDAKVVAEGEKIPTDILETKK 78 (274) T ss_pred CCcc-eeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecC-CCccccccCCCccchhhcccce Confidence 3332 3344567878755 456677766666667655433 2366899999874 3567889999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+-.+.++++....+ .++.+.+.+.++++++.++|+.++.--.+... +... T Consensus 79 ~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~----------~~~~----------- 137 (274) T protein:vir:95 79 REAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKL----------TVEA----------- 137 (274) T ss_pred eEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------cccc----------- Confidence 9999999877789998875544 57888899999999999999988742211100 0000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... ....++++||..+. T Consensus 138 ----------------------------------------------~~~~~d~i~~A~~~lgd~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:95 138 ----------------------------------------------DITKLTGLQTAIDKFNDED-LEPMVLFISPLDAG 170 (274) T ss_pred ----------------------------------------------cccCHHHHHHHHHHhcccc-ccccEEEeCHHHHH Confidence 0000122223333322222 23557899999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhc Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++ +..-+++-....+. .....+.-+++.|++|++++.+|.++.++.. +.++..+....++++..+.. .+ T Consensus 171 ~L~k--~~~~~f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~--~gA~~~~~~~~~~vE~~Rd~----~~ 241 (274) T protein:vir:95 171 KLRG--DATTNFTRATELGD-DVIVKGAFGEALGAVIVRSNKLEAGTAILAK--KGAVKLITKRDFFLETDRDP----ST 241 (274) T ss_pred HHHh--hccccccccccccc-cceeccccceecCeEEEEeCCCCCceEEEEe--ccceeeeecCCccccccccc----cc Confidence 8875 33223332221111 1111223457899999999999998876643 34566666777777766543 24 Q ss_pred CceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ....+++..+++.++++|+++++++ ..+|| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~~v~~t---k~~~~ 271 (274) T protein:vir:95 242 KTTALYSDKHYVAYLYDESKAVKIT---KGSGS 271 (274) T ss_pred ccCEEEEeEEEEEEEEcCCcEEEEE---cCCcc Confidence 5678888899999999999999988 56777 No 118 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.46 E-value=2.1e-14 Score=95.55 Aligned_cols=260 Identities=13% Similarity=0.084 Sum_probs=168.2 Q ss_pred hccccccccccccchhhhHH-HHHHHHhhhhHHhhcceeec----CCCceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPG-IVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~-ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) |+.+ .-..+|+||.... +.+.+.....+.+++.+-+. .|..+++|.... .+.+.-+.||++++..+.++++ T Consensus 1 Ma~T---~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~-igdae~~~eg~~i~~~~lt~~~ 76 (270) T protein:vir:95 1 MTQT---KKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAY-IGAAEDLQEGVAMDTTQMSMTT 76 (270) T ss_pred CCce---ehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecC-CCccccccCCCccchhhcccch Confidence 3322 2234788886665 55666666667777765432 456789999874 5677889999999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhH Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 304 (497) -....++.+-...++++....+ .+....+.+.++..+++++|+.++.- .+|.....+ .. T Consensus 77 ~~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~-----l~~a~~~~~-------~~-------- 136 (270) T protein:vir:95 77 TKVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAE-----LNKSKQTAT-------VS-------- 136 (270) T ss_pred heeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHH-----hcccccccc-------cc-------- Confidence 9999999988899999976554 45677788889999999999887621 111100000 00 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH Q lcl|NC_021309. 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ...+.+.++...+... .....++++||.++. T Consensus 137 ------------------------------------------------~t~~~~~dA~~~lgd~-~~~~~~i~vhs~~~~ 167 (270) T protein:vir:95 137 ------------------------------------------------ADATGILDAIEVFNSE-NDEDYVLYVNPKDYN 167 (270) T ss_pred ------------------------------------------------cCHHHHHHHHHHhccc-cCCCcEEEEcHHHHH Confidence 0001112222222111 234567999999999 Q ss_pred HHHHHhhhcCceeccCcccccccccccccccccccceEecCCCC-cCceEEEeeccceEEEEeecccEEEeecccchhhh Q lcl|NC_021309. 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-LGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFV 463 (497) Q Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~-~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~ 463 (497) .|++...-. .. ..+......+.-+++.|++||+++.+| .++.|+ |...++.++...++.++.++... T Consensus 168 ~Lrk~~~~~----~~--~~~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l--~~~gAi~~~~~~~~~vEtdRd~~---- 235 (270) T protein:vir:95 168 KLVKSLFKV----GG--NVQDRAISKGDLVEIVGVSDIVKSKRVSENTAFL--QRYGAMEIVNKKKPEAYTDFDIL---- 235 (270) T ss_pred HHHhhhccc----cc--ccccchhcccccceecceeEEEeCCCCCceeEEE--EeccceeeeecCCceeeeccchh---- Confidence 998643111 10 011111111234578999998877665 455554 34556778887888877666432 Q ss_pred cCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 464 DGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 464 ~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) +....+.+..+++.++.+|+.+++++++ ..|| T Consensus 236 ~~~d~i~~~~~y~v~~~~~skvv~~t~~--~a~~ 267 (270) T protein:vir:95 236 KRTHLLSTNYHYSVNLKDETGVVKVTFK--PSGS 267 (270) T ss_pred hcccEEEeeeEEEEEEEccceEEEEEec--CCCC Confidence 3456778889999999999999999985 4455 No 119 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.44 E-value=1.1e-13 Score=91.62 Aligned_cols=282 Identities=13% Similarity=0.153 Sum_probs=174.1 Q ss_pred hccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccc-----eeccccccccccccccee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNA-----AAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a-----~wv~Eg~~~~~s~~~f~~ 225 (497) +...+-...+-..+......||+.+...+.|++.++..++.++.+.|.+..... .+ .|-.-....+.+..+|++ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~-~~~~~~v~~~~~~~g~~~~~~t~~~ 79 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLG-DVIMAGVGTTFSGAGAGKAAATFTK 79 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccC-CcccccccccccCCCccccccccce Confidence 221222222333444567889999999999999999999999999999886532 22 232223444678899999 Q ss_pred eEeeeeeEEeeehhhHHHHhh--H-H-HHHHHHHHHHHHHHHHHHHhhhhcccCccc-cccccccccccccccccchhhh Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRD--A-P-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d--~-~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~-p~Gi~~~~~~~~~~~~~~~~~~ 300 (497) ++...+.+.+.+.|.+.+.+- + + +...+=.....++++.+.+..++||+.++. +.|++........-.+. T Consensus 80 ~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~----- 154 (310) T protein:vir:97 80 VNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTG----- 154 (310) T ss_pred eeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecC----- Confidence 999999999999999876542 2 3 444444455678999999999999998654 55877654321100000 Q ss_pred hhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEech Q lcl|NC_021309. 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) ..++. ++ -+.+|-++.. + ......++++++|+ T Consensus 155 ---------------------------------------~~gg~----~t-~d~LDeLl~~---v-~~~~g~p~~~l~~~ 186 (310) T protein:vir:97 155 ---------------------------------------ATGSA----IS-FAILDELMDL---V-VDKDGQVDYLTMHA 186 (310) T ss_pred ---------------------------------------CCCCC----CC-HHHHHHHHHH---H-hcCCCCCCEEEecH Confidence 00000 01 0112222222 1 11233567899999 Q ss_pred hHHHHHHHHh-hhcCceeccCcccccccccccccccccccceEecCCCCcC----------ceEEEeecc-----ceEEE Q lcl|NC_021309. 381 RDWELLRLTK-DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG----------TILVGHFAP-----SVIQT 444 (497) Q Consensus 381 ~~~~~l~~lk-d~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~----------~~~~gd~~~-----~~~~i 444 (497) .+..+|+-+. ..+++.++... ....|.++ .++.|+|++.++.+|.+ .+|..-|.. +...+ T Consensus 187 ~~~r~i~A~~R~~~~~g~~~~~-~~~~G~~v---~~~~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl 262 (310) T protein:vir:97 187 RTLRSYKALLRALGGASINEVV-ELPSGAEV---PAYSGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGL 262 (310) T ss_pred HHHHHHHHHHHHhcCCCCCCcc-ccCCCCEE---eeeCCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceecc Confidence 9988887554 44445554432 22344443 36789999999999853 144444332 22222 Q ss_pred Ee--ecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 445 AR--REGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 445 ~~--r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) .. ..++++..-.+.. .++-..+|++++++.+|..|.|+.+|.-..- T Consensus 263 ~~~~~~glsVr~~G~~~---~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 263 TATQAAGIQVVDVGESE---DSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred ccCCccceeEEeCCccc---CCcceeEEEEEeeeEEEecccceeeeccccC Confidence 11 2234444322111 3567889999999999999999999976655 No 120 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.43 E-value=1.7e-14 Score=96.05 Aligned_cols=388 Identities=11% Similarity=0.081 Sum_probs=180.6 Q ss_pred CchHHHHHHHHH-HHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGR-QLAKSIKDIN-------ADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDG 72 (497) Q Consensus 1 ~~~~a~~~~~~~-~~~~~~~~~~-------~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~ 72 (497) |---.+.+-+-. +..+.|+++. +=+...++..|+...+...+.+......+.. -|...++..+..+.+. T Consensus 1 ~~n~t~a~d~~~RR~~~~L~~~EvSvv~~PAY~nA~vt~vRe~e~~~~~e~~~~~e~~en~---~e~~~~~~~~~~E~Rs 77 (410) T protein:vir:83 1 MGNATTASDEYIRRLENELREKESLVRGIYDRANASNRDVNEEEGQMVAECRGRMEQIKNQ---MEQAQEVNRIAFETRS 77 (410) T ss_pred CCCcccchhhHHHHHHHHhhhhheeeeccccccccccccchhhhccccccccCcccchhhh---hHHHHHHHHHHHHHHH Confidence 543333332222 2222222221 1111112222333222222222111111111 1222223333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhc Q lcl|NC_021309. 73 LDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNP 152 (497) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (497) +...+...-.. .-+......-+.+. ........-........+ ....... ..... T Consensus 78 ~~~~i~~~~~~-------~r~~p~~~~veyRS--------aGE~lkal~~~~~Gd~~A----~~~~e~~------r~a~~ 132 (410) T protein:vir:83 78 KGQAVDAAISA-------MRGSPVGTEVEYRS--------AGEYMLDMWNSAQGNASA----ADRLEVY------ARAAD 132 (410) T ss_pred HHHHHHhhhcc-------CcCCCCCCCccccc--------HHHHHHHHhccCCchHHH----HHHHHHH------HHhhc Confidence 33222110000 00000000001111 000000000000000000 0000000 01122 Q ss_pred cccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccc-------eeccccccccccccccee Q lcl|NC_021309. 153 FGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNA-------AAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 153 ~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a-------~wv~Eg~~~~~s~~~f~~ 225 (497) ...++.-.++|+|+++.+.|+.+.+..+|..++...|..+.+++||..+.. ... ..-.||+..+..+.+|+. T Consensus 133 ~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~-~tV~~q~~~~kqa~EGd~L~~gKl~~~t 211 (410) T protein:vir:83 133 HQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQR-PAVGLQGVAGGASDEKTELDSQKMVIDR 211 (410) T ss_pred cCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeeccc-ccccccccccccccccccccccceeeee Confidence 234444456788889999999999999999999899999999999887653 332 224589999999999999 Q ss_pred eEeeeeeEEeeehhhHHHHhh-HHHHHHHHHHHHHHHHHHHHHhh---hhcccCccccccccccccccccccccchhhhh Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRD-APELFNFVQGRLLEGIQRKEEVQ---LLAGGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d-~~~l~~~i~~~la~~~~~~~d~~---~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~ 301 (497) .+...++++++..+||+.++- +....+...+-|..+.+.+-+.+ +|+++=++ ....+ T Consensus 212 ~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~--------~~a~~----------- 272 (410) T protein:vir:83 212 LTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG--------AVGYG----------- 272 (410) T ss_pred ccceeehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh--------hhhhh----------- Confidence 999999999999999999974 45555666666766666555543 33322110 00000 Q ss_pred hhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhh-hccCCceEEech Q lcl|NC_021309. 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT-LFQTPNAVVMNP 380 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~n~ 380 (497) ...+. .+...+.++...+..+ ....-..+.++| T Consensus 273 -------~~Tad---------------------------------------~~~~~i~da~~~v~da~~~~~~~~i~vS~ 306 (410) T protein:vir:83 273 -------NATAD---------------------------------------NVASAIWQAAGAVYTAVKGMGRLVIAIAP 306 (410) T ss_pred -------hccHH---------------------------------------HHHHHHHHHHHHHhhhhccceeeeEEech Confidence 00000 0000000000000000 011112234444 Q ss_pred hHHHHHHHHhhhcCceeccCccccccccc--ccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeeccc Q lcl|NC_021309. 381 RDWELLRLTKDANGQYMGGNFFGNAYGNP--VNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSN 458 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~i~~~~~~~~~~~~--~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~ 458 (497) ..+..+..+- .++++.|..-.+ ....+ ..-...|+|+||++.+..++++++|.|-. ++..++...-.++..+.+ T Consensus 307 DVl~~~~~~f-~~~~~~~~dt~G-fg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~--Ai~~~eS~~gp~qL~d~~ 382 (410) T protein:vir:83 307 DVLGDFGPLF-APVNPTNAHSTG-FEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTA--AIECFEQRVGTLQVVEPS 382 (410) T ss_pred hhhhhcccee-eccCCCCccccc-ccccccccchhhhhcccceEEecCCCcCeeeEeccc--eeeeeecCCceeEeeCCc Confidence 4433322221 111122211100 00001 11245689999999999999999997654 688888776556666554 Q ss_pred chhhhcCceEEEEEEeecceeecccceEEEEee Q lcl|NC_021309. 459 GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 459 ~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~ 491 (497) -...+++.- .++.+.+..+.+++=|..+ T Consensus 383 i~nLt~~yS-----gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 383 VFGLQVAYA-----GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred hhhhhhhhe-----eeeeeccccccceeeeccC Confidence 433333332 6678899999999866554 No 121 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.42 E-value=8.1e-14 Score=92.37 Aligned_cols=354 Identities=16% Similarity=0.156 Sum_probs=179.8 Q ss_pred HHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHH Q lcl|NC_021309. 43 FKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSA 122 (497) Q Consensus 43 ~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 122 (497) ++.-.+++...--..-..+++.+ +..++++.+... +.+ .++.+. ....... T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~k~-------lr~~me~~et~~---------e~~--------~~~~~~-----~~~e~el 51 (393) T protein:vir:79 1 MENWLKQLKESGFTETQVQEQKS-------LRTRMERGETLA---------EAD--------ANKLAL-----NEEETQI 51 (393) T ss_pred CchHHHHHHhccCchhHHHHHHH-------HHHHhhhhhhhh---------hhh--------hhhhhc-----chhHHHH Confidence 11111111000000111111111 111111111000 000 000000 0000000 Q ss_pred HhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHH-HHHHhhhhHHhhcceeec-CCCceEEEEE Q lcl|NC_021309. 123 KAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIV-EQLFYELSLADLISSRPV-TSPNLSYLTE 200 (497) Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii-~~~~~~~~l~~~~~~~~~-~~~~~~~p~~ 200 (497) ...+ . ....+. ......+...--++..|..++|..+..++ +...+-.....++..+.. .|.+..+|.. T Consensus 52 ~E~f-----~----Kmm~G~-~p~~eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~ 121 (393) T protein:vir:79 52 LESF-----A----KMMEGE-TPTNEVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSI 121 (393) T ss_pred HHHH-----H----HHhcCC-CchhheehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccch Confidence 0000 0 000000 01111111111234445677776655544 444444455566666666 3445555543 Q ss_pred cCCCccceecccccccccc---cccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCc Q lcl|NC_021309. 201 SAAHNNAAAVAEAGTYPFS---SEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGY 276 (497) Q Consensus 201 ~~~~~~a~wv~Eg~~~~~s---~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~ 276 (497) . .--+.-|+||+..|+. ..+|+.|+++..|.+..+.+|+||+.|+ .++.++.-....++++++.+...+++.-+ T Consensus 122 g--~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~ 199 (393) T protein:vir:79 122 G--IMRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRS 199 (393) T ss_pred h--eeeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhc Confidence 2 2346679999999874 3689999999999999999999999998 58999999999999999999999998765 Q ss_pred cc---cccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhh Q lcl|NC_021309. 277 PG---VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAE 353 (497) Q Consensus 277 ~~---p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 353 (497) ++ ..++.+.+.+- ..|.......+..- T Consensus 200 ~ghtvfDa~st~t~ah--------------------------------------------------ptGr~~~~~qNGTl 229 (393) T protein:vir:79 200 HGHTVFDNYSTNKLAH--------------------------------------------------TTGLDKNGVQNDTF 229 (393) T ss_pred ccceeeeccccCccce--------------------------------------------------eecCCccccccccc Confidence 43 23322221110 01111111122223 Q ss_pred hhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccc-----------cceE Q lcl|NC_021309. 354 IAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWG-----------VPVV 422 (497) Q Consensus 354 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G-----------~Pvv 422 (497) ..+++.+...++.++. .++++++|||--|+.+.+-.-=.+-|. .+++ -++..+..+....| +-|+ T Consensus 230 SleDllDm~~av~~~h-yt~svi~MHPLAWnv~AKna~me~~~~--na~g-N~~~~~~~ts~algp~~i~~~~~~nlnv~ 305 (393) T protein:vir:79 230 SAEDFLDLIIAVMANE-YTPSDLMMHPLAWTVFAKNELMGSLQA--NPYG-NYPAKGAPSSMALGPDSIQGRLPFNFNVN 305 (393) T ss_pred cHHHHHHHHHHHhccc-CCcceEEEcCchhhhhhhhhhhcceee--cccc-ccCccccchhhhhchhhhccccccceeEE Confidence 3567778877777665 467889999999999986532222221 1111 12222233334444 6789 Q ss_pred ecCCCCcCceEEEeeccceEEEEeecccEEE-------eecccchhhhcCceEEEEEEeecceeecc-cceEE---EEee Q lcl|NC_021309. 423 TTPLIPLGTILVGHFAPSVIQTARREGVTMQ-------MTNSNGTDFVDGKVTVRAEERLGLLVYRP-SAFQL---IQLK 491 (497) Q Consensus 423 ~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~-------~~~~~~~~f~~~~v~~r~~~r~~~~v~~~-~a~~~---l~~~ 491 (497) +++.+|-++.- .++.|..+++....|. ++..+ +=..|...++..+|+|+.|++. .||.. +++. T Consensus 306 ~sPfvp~d~k~----~rFd~~~Vd~NnvgvlLV~D~i~tdq~d--dk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~~ 379 (393) T protein:vir:79 306 LSPFIPLDKKS----RRFDVYAVDRNNVGVLLVRDDLKTDQWD--EKARGLQNIKMIERYGIGILNEGKAIAVAKNISMD 379 (393) T ss_pred Eeccccccccc----ceeeEEEeecCCceEEEEecCcceeccc--cccccceeeeeeeeeceeeeeCCceEEEEecceee Confidence 99999854320 1112333344444332 22221 2347889999999999999986 44443 4444 Q ss_pred CCCCCC Q lcl|NC_021309. 492 KGATGS 497 (497) Q Consensus 492 ~~a~~~ 497 (497) ..-... T Consensus 380 k~y~~P 385 (393) T protein:vir:79 380 KSYAEP 385 (393) T ss_pred cccccc Confidence 443333 No 122 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.42 E-value=1.9e-14 Score=95.87 Aligned_cols=296 Identities=13% Similarity=0.068 Sum_probs=171.8 Q ss_pred hhh-hccccccccccccc------hhhhHHHHHHHHhhhhHHhh-ccee-ecCCCceEEEEEcC--CCccceeccccccc Q lcl|NC_021309. 148 IGQ-NPFGSTGTFAPGIL------PTFLPGIVEQLFYELSLADL-ISSR-PVTSPNLSYLTESA--AHNNAAAVAEAGTY 216 (497) Q Consensus 148 ~~~-~~~~~~~~~g~~v~------p~~~~~ii~~~~~~~~l~~~-~~~~-~~~~~~~~~p~~~~--~~~~a~wv~Eg~~~ 216 (497) ... ....++..++.+-+ |++++.-+..+.....|.+. .+.+ ..+++.+.|-+... ..+.+.-|+|++.+ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEi 80 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEI 80 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccc Confidence 011 11112223333322 55555544444455555443 3433 33455566644332 13567789999999 Q ss_pred ccccccceeeEe-eeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccc Q lcl|NC_021309. 217 PFSSEEFARVYE-QVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSA 294 (497) Q Consensus 217 ~~s~~~f~~i~~-~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~ 294 (497) |.+.+.++.-.+ ..+|.+.-+.||+|++..+ .+..+.....++..+.+..|...+.- +..+...+.+.. T Consensus 81 P~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~da---------l~sa~t~~~~~s 151 (318) T protein:vir:10 81 PVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKAL---------LQSPIVPTLAVP 151 (318) T ss_pred cccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHH---------HhccccccccCC Confidence 999999988877 5579999999999999876 67777778889999999999876632 111111111111 Q ss_pred cchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCc Q lcl|NC_021309. 295 SSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN 374 (497) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 374 (497) ..+............+ .. .+ .....+............+++.++ T Consensus 152 ~~w~~~~~~~~d~~~A--------~e---~v-------------------------~~a~~~~~~a~~~~~~~~~GY~pd 195 (318) T protein:vir:10 152 TAWDNGGKVRTDIAIA--------IE---QI-------------------------STAAPTAYPAGVGSSDEYFGFIPD 195 (318) T ss_pred cCCCCcccccccchhh--------hh---hh-------------------------hhhhhhhhhhhhhhhhhccCccce Confidence 1111000000000000 00 00 000001111112233356788899 Q ss_pred eEEechhHHHHH------HHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeec Q lcl|NC_021309. 375 AVVMNPRDWELL------RLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARRE 448 (497) Q Consensus 375 ~~~~n~~~~~~l------~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~ 448 (497) .++|||.+|..| +.+-..++.+++...... ..-+..++|+-|+.++.+|.+++++.+-.... .+.|.. T Consensus 196 tIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~t-----g~~~g~~lGl~vi~s~~~p~~~alvlq~g~vG-~~~d~~ 269 (318) T protein:vir:10 196 TIVMHYALLPILMDNENFMKVYERNANYVSTAPDWT-----GNFPGSVMGLNVIRSRTFPIDRVLIMERGTVG-FYSDTR 269 (318) T ss_pred eeEECHHHHHHHhcchhhhhhhhccchhhhhccccc-----ccccceeeceEEeecCccCCCeeEEEecCCcc-eeeccc Confidence 999999999999 444444555555432211 11134689999999999999999887765543 344556 Q ss_pred ccEEEeeccc--chhhhcC-ceEEEEEEeecceeecccceEEEEeeCCC Q lcl|NC_021309. 449 GVTMQMTNSN--GTDFVDG-KVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 449 ~~~i~~~~~~--~~~f~~~-~v~~r~~~r~~~~v~~~~a~~~l~~~~~a 494 (497) +++..--... ..++..+ ...+|+..+....|.+|.|+++|+---++ T Consensus 270 pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 270 PLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred cceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccCC Confidence 6655433321 2234433 47788999999999999999999887777 No 123 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.39 E-value=5.8e-13 Score=87.68 Aligned_cols=327 Identities=9% Similarity=0.078 Sum_probs=174.7 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEE Q lcl|NC_021309. 118 FNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSY 197 (497) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 197 (497) .. . ...... ..+. ........ ..+...-++++++|+....+++.+...+++++.++++++.+....+ T Consensus 1 ~~--~-----~~~~~~---~~n~-~~~~i~k~--~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei 67 (360) T protein:vir:99 1 MS--S-----NSTIDS---VRNQ-NMNSLSQK--DIGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEV 67 (360) T ss_pred Cc--c-----hhHHHH---Hhhh-HHHHHHhh--hccccccCceeecHHHHHHHHHHHhhccchhhhcceeecccccccc Confidence 00 0 000000 0000 00111111 1222334578999999999999999999999999999999888888 Q ss_pred EEEcCCCccceeccccccccc-ccccceeeEe-eeeeEEeeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021309. 198 LTESAAHNNAAAVAEAGTYPF-SSEEFARVYE-QVGKVANALTITDEGLRDAP-----ELFNFVQGRLLEGIQRKEEVQL 270 (497) Q Consensus 198 p~~~~~~~~a~wv~Eg~~~~~-s~~~f~~i~~-~~~kla~~~~iS~ell~d~~-----~l~~~i~~~la~~~~~~~d~~~ 270 (497) ++...+...-.--.|+++.+. ++.+...+.+ ..+++.....+..+-++++. .+++.|.+.++++++.-++.-. T Consensus 68 ~kig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~ 147 (360) T protein:vir:99 68 PQFGVPRLSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMG 147 (360) T ss_pred cccccceeeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHH Confidence 775432211111223333222 4455555555 34555666677777776653 3669999999999999999999 Q ss_pred hcccCccc--------------cccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh- Q lcl|NC_021309. 271 LAGGGYPG--------------VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT- 335 (497) Q Consensus 271 l~G~G~~~--------------p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 335 (497) ++|+.... -.|++..+...... +.. .......... T Consensus 148 ~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~--------------------------id~----a~d~t~~~~~~ 197 (360) T protein:vir:99 148 IRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQS--------------------------VDD----AGDSTRIGLED 197 (360) T ss_pred hhccchhcccccCcccchhhhhhHHHHHHhhcccch--------------------------hhc----ccccccccccc Confidence 98875421 00111110000000 000 0000000000 Q ss_pred ---hhhhccccccc--ccchhhhhhhHHH-HHHHhhhhhhccCC---ceEEechhHHHHHHHHhhhcCceeccCcccccc Q lcl|NC_021309. 336 ---GAAGSGSGVAG--SYPTAAEIAENVF-DAFVDIQLTLFQTP---NAVVMNPRDWELLRLTKDANGQYMGGNFFGNAY 406 (497) Q Consensus 336 ---~~~~~~~~~~~--~~~~~~~~~~~~~-~~~~~~~~~~~~~~---~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~ 406 (497) ......+.... ...........++ ..+..++..|+.+. .+|++++.+....+.....=...+.-.... T Consensus 198 ~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~~l~--- 274 (360) T protein:vir:99 198 TATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLTEREDPLGSAVIF--- 274 (360) T ss_pred ccccccccchhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEEEccCchHHHHHHHHhccCcccchhhee--- Confidence 00000000000 0000011122233 44455555554432 279999888666654432211122111000 Q ss_pred cccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCc--eEEEEEEeecceeecccc Q lcl|NC_021309. 407 GNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGK--VTVRAEERLGLLVYRPSA 484 (497) Q Consensus 407 ~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~--v~~r~~~r~~~~v~~~~a 484 (497) .....+..|+||+..+.+|.+.+++-+++...|.+ ..+++|..+.+... ..+.- +..-....+|+.+.+++| T Consensus 275 ---g~~~~~~~Gipi~~v~~~pd~~~mlT~p~NLi~g~--~~~iri~~~~e~~~-~~~~~~~~~~~~~~~~D~~iee~~A 348 (360) T protein:vir:99 275 ---GDSDITPFSYDLVGVNGFPDEYMMFTDPNNLAFGL--YEEMELDQSTDTDK-VHEQRLHSRNWLEGQFDFQIKEQQA 348 (360) T ss_pred ---cccccccceeeeEEcCCCCCCceEEeccCceeEEe--eeeeEEeecccchh-hhhhceeeeEEEEEEeeEEEEeccc Confidence 01223578999999999999999999998755444 46777766554321 22222 333345679999999999 Q ss_pred eEEEEeeCCCCC Q lcl|NC_021309. 485 FQLIQLKKGATG 496 (497) Q Consensus 485 ~~~l~~~~~a~~ 496 (497) ++.++-...+++ T Consensus 349 v~~vt~~~~~~~ 360 (360) T protein:vir:99 349 GVLVTDLETPTA 360 (360) T ss_pred EEEEecCCCCCC Confidence 999999999999 No 124 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.36 E-value=7.1e-14 Score=92.65 Aligned_cols=222 Identities=16% Similarity=0.155 Sum_probs=155.2 Q ss_pred ceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHH Q lcl|NC_021309. 186 SSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQR 264 (497) Q Consensus 186 ~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~ 264 (497) .-..-.|+.+++|.. .+.+.-++||+.++..+.++++-+...++.+-.+.|+++..-.+ .+......+.++.++++ T Consensus 1 ~~~~~~Gdtit~P~~---iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANLCEYPND---IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCceEEeccc---ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 111223567889976 34678899999999999999999999999988899999976554 46778899999999999 Q ss_pred HHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhccccc Q lcl|NC_021309. 265 KEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGV 344 (497) Q Consensus 265 ~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 344 (497) ++|..++.--.+. ... . .. T Consensus 78 kvD~di~~~~~~a---------~l~-~--~~------------------------------------------------- 96 (231) T protein:vir:73 78 KVDDDLLKAAKTT---------SQT-V--ST------------------------------------------------- 96 (231) T ss_pred hhhHHHHHhhccc---------ccc-c--cc------------------------------------------------- Confidence 9999987321100 000 0 00 Q ss_pred ccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhc------CceeccCcccccccccccccccccc Q lcl|NC_021309. 345 AGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDAN------GQYMGGNFFGNAYGNPVNGGKNIWG 418 (497) Q Consensus 345 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~------G~~i~~~~~~~~~~~~~~~~~~l~G 418 (497) ....+.+.+++..+... ...+.++++||.++..|++..+.+ |..++. .+.-..+.| T Consensus 97 -------~~t~d~i~~A~~~fgde-~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g~~i~~----------~G~iG~i~G 158 (231) T protein:vir:73 97 -------KANVDGVQAALDIFNDE-DAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALI----------NGTYADVLG 158 (231) T ss_pred -------cccHHHHHHHHHHhccc-cccceEEEEcchHHHhhhhccchhhhhhhhccceee----------ecccceEcc Confidence 00011222222222222 234567899999999998854332 222221 123347899 Q ss_pred cceEecCCCCcCceEEEe--eccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 419 VPVVTTPLIPLGTILVGH--FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 419 ~Pvv~~~~~~~~~~~~gd--~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) +||++|+.+|.++.+..- +...++.++...+++|+.++.. .+....+++.+.++..+.+|+.+++++++-+ T Consensus 159 ~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~----~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 159 AQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred eEEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccccc----cccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 999999999998876433 2456788888888888876643 2456888899999999999999999999999 No 125 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.16 E-value=1.5e-11 Score=79.98 Aligned_cols=261 Identities=14% Similarity=0.082 Sum_probs=146.6 Q ss_pred hccccccccccccchhhhHHHHHHHHhhhhHHhhcce----eecCCCceEEEEEcCCCccceecccccccccccccceee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS----RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~----~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i 226 (497) |+. -.++|..|...+++.+...+.+.++++. ....++++++|+... ...+.+..++...+..+.+.+.+ T Consensus 1 MA~------~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:79 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCc-ccccccccCCCccCccccccceE Confidence 211 1134455677788899888887777643 223456899998654 23455778888887778888888 Q ss_pred EeeeeeEEe-eehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc---ccCccccccccccccccccccccchhhhh Q lcl|NC_021309. 227 YEQVGKVAN-ALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA---GGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 227 ~~~~~kla~-~~~iS~-ell~d~~~l~~~i~~~la~~~~~~~d~~~l~---G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~ 301 (497) ++...+... -+.|++ +..++..++.+++ +.+.++++.++|+.++. +.++....+ .. T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~vD~~i~~~~~~a~~~~~~~-----------~~------- 134 (273) T protein:vir:79 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGTALTGS-----------AP------- 134 (273) T ss_pred EEEEeeecccceeeccHHHHhhcccHHHHH-HHHHHHHHHHHHHHHHHHHhhcccccccc-----------cc------- Confidence 888876533 356665 3445555687754 56788999999986652 111100000 00 Q ss_pred hhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc-cCCceEEech Q lcl|NC_021309. 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNP 380 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~n~ 380 (497) .+.....+.+..+...+....- ..+.+++++| T Consensus 135 -----------------------------------------------~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p 167 (273) T protein:vir:79 135 -----------------------------------------------SDADDAFDLIASALKELTKANVPNVGRVVVVNA 167 (273) T ss_pred -----------------------------------------------cchhhHHHHHHHHHHHhhhccCCccCcEEEECH Confidence 0000011223333322222221 1234689999 Q ss_pred hHHHHHHHHhhhcCc-eeccCcccccccccccccccccccceEecCCCCcCce-EEEeeccceEEEEeecccEEEeeccc Q lcl|NC_021309. 381 RDWELLRLTKDANGQ-YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI-LVGHFAPSVIQTARREGVTMQMTNSN 458 (497) Q Consensus 381 ~~~~~l~~lkd~~G~-~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~-~~gd~~~~~~~i~~r~~~~i~~~~~~ 458 (497) ..+..|.+..+.-.+ ..... ......+...++.|++|+.++.+|.++. .+..+...++....+ ...++..+.. T Consensus 168 ~~~~~Ll~~~~~~~~~~~~~~----~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~~-~~~~e~~r~~ 242 (273) T protein:vir:79 168 EMAFWLRSSGSKLTSADTSGD----AAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ-IDTVEALRDQ 242 (273) T ss_pred HHHHHHhhchhhhhhhhhccc----ccceeeeEeeEEeceEEEecccccccCceEEEEEeccceeeeee-hhhhhcccCc Confidence 999888764321111 11110 0011112234799999999999997542 122333333433322 2233333322 Q ss_pred chhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 459 GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 459 ~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) ..| -..+++.+.+|..|+||++++.++.+.+ T Consensus 243 -~~~---~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 243 -DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -ccc---eeeeeeeeeeeeEEecCceEEEEeccCC Confidence 223 4678889999999999999998765544 No 126 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.15 E-value=8.1e-12 Score=81.39 Aligned_cols=381 Identities=15% Similarity=0.116 Sum_probs=185.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAH--QAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~--~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) -|++.+-.+++. ++++.+..+..++..+ +..++...+++|+..-+.++..++...+.++- T Consensus 10 k~~~~ek~~~~~------------------~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e~eln 71 (400) T protein:vir:93 10 KPDLIEKQNRLA------------------ELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN 71 (400) T ss_pred cchHHHHHHHHh------------------hhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHhhhhh Confidence 233333222222 2222222222222111 11223333445555555555555444333221 Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccc Q lcl|NC_021309. 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) .+.+...+..+. .+-.+..+ ......+.... .....+.+..|... ..+...+.++. T Consensus 72 -------~~~E~~Kgk~~m-tefLkT~~------A~~~fa~~l~~----nsg~sd~knaW~A~------l~E~gvt~td~ 127 (400) T protein:vir:93 72 -------AQEEKPKGKDKM-TNFIESQN------AVTEFFDVLKK----NSGKSEIKNAWSAK------LAENGVTITDT 127 (400) T ss_pred -------hhhhhcccchhH-HHhhhhHH------HHHHHHHHHHh----hcCCcchhhhhhhh------hhhcccccCCc Confidence 000000000000 00000000 00000000000 00000111222111 11111111222 Q ss_pred cccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCcccee-cccccccccccccceeeEeeeeeEEeee Q lcl|NC_021309. 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA-VAEAGTYPFSSEEFARVYEQVGKVANAL 237 (497) Q Consensus 159 ~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~w-v~Eg~~~~~s~~~f~~i~~~~~kla~~~ 237 (497) . ..+|.-.+..|-..++...++++..++..+++--+..+-++. .-+| .--|+++.++..+|..-++.|.-++.+. T Consensus 128 n-~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~l~V~~~~dt~---~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~ 203 (400) T protein:vir:93 128 T-FQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSA---NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQ 203 (400) T ss_pred h-hhcchHHHHHHHHhhhccCCcccceeeecCCceeeecchhhh---cccceeccCCcccceeeeeeeeccCHHHHHHHh Confidence 1 133444556666667777788887777666332222222221 2355 4567889999999999999998888877 Q ss_pred hhhHHHHhhH---HHHHHHHHHHHHHHHHHH-HHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhh Q lcl|NC_021309. 238 TITDEGLRDA---PELFNFVQGRLLEGIQRK-EEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 238 ~iS~ell~d~---~~l~~~i~~~la~~~~~~-~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) .+.+-..++. ..|..||.++|...+-.+ .+++++-|+|++.+.++..-+.+..+...+. T Consensus 204 ~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~----------------- 266 (400) T protein:vir:93 204 SLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITT----------------- 266 (400) T ss_pred hhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhh----------------- Confidence 7755444432 468999999999999965 7999999999886655533221111100000 Q ss_pred hcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhc Q lcl|NC_021309. 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDAN 393 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~ 393 (497) . ... .+. --+.+++..+.....+......-+++.|..|..|++|+|++ T Consensus 267 --k---------------t~~------a~~---------~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~ 314 (400) T protein:vir:93 267 --K---------------AKS------AGK---------TPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQAT 314 (400) T ss_pred --h---------------hhh------cCC---------ccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCc Confidence 0 000 000 00122333333344444455556899999999999999999 Q ss_pred CceeccCccccccccccccccccccc-ceEecCCCCcC--ceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEE Q lcl|NC_021309. 394 GQYMGGNFFGNAYGNPVNGGKNIWGV-PVVTTPLIPLG--TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVR 470 (497) Q Consensus 394 G~~i~~~~~~~~~~~~~~~~~~l~G~-Pvv~~~~~~~~--~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r 470 (497) |++.|.-.... .+..+=+|+ ..|+...+|.. .+++ | + ++.+. -.+++ +.....|..|+-.+. T Consensus 315 ~~a~f~~~n~d------~~IA~~fGv~~Lv~~Tr~~~~kp~V~V-D--e-k~~i~-~~~~~----t~~sf~~~tNs~~il 379 (400) T protein:vir:93 315 ANANVRIKNDD------TEIASEVGVDEIIVYTGSKALKPTVLV-D--Q-KYHID-MQDLT----KVDAFEWKTNSNMIL 379 (400) T ss_pred ceeeeeecccc------chhhhhcccceeeeeccCCCCCceeee-e--h-hhhcc-ccCce----eccceeeeeccceEE Confidence 99998532222 122233342 23344555443 3333 3 2 23332 12221 122233667777888 Q ss_pred EEEeecceeecccceEEEEee Q lcl|NC_021309. 471 AEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 471 ~~~r~~~~v~~~~a~~~l~~~ 491 (497) .+..++|.+.-|++-++++.+ T Consensus 380 vetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 380 VETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eeeeeccceecccceeeEeeC Confidence 889999999999999999988 No 127 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.10 E-value=4.2e-11 Score=77.49 Aligned_cols=260 Identities=14% Similarity=0.081 Sum_probs=143.5 Q ss_pred hccccccccccccchhhhHHHHHHHHhhhhHHhhccee----ecCCCceEEEEEcCCCccceecccccccccccccceee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i 226 (497) |+. -.++|..|...+++.+.+.+.+..++..- ...++++++|+.... ..+.+..++...+..+.+.+.+ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccc-cccccccCCCccCccccccceE Confidence 111 12344456777888998888877776431 223568899986542 2345677777776667777777 Q ss_pred EeeeeeEE-eeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc---ccCccccccccccccccccccccchhhhh Q lcl|NC_021309. 227 YEQVGKVA-NALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA---GGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 227 ~~~~~kla-~~~~iS~-ell~d~~~l~~~i~~~la~~~~~~~d~~~l~---G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~ 301 (497) ++...+.. .-+.|++ +..+...+++++ .+...++++.++|..++. +.+... +.. . T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~alA~~vD~~i~~~~~~a~~~~-----------~~~--~------ 133 (273) T protein:vir:10 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTAL-----------TGS--A------ 133 (273) T ss_pred EEEEeeeeecceEeecHHHhhhhccHHHH-HHHHHHHHHHHHHHHHHHHHhcccccc-----------ccc--c------ Confidence 77775543 2345665 344455567775 456788999999987652 111000 000 0 Q ss_pred hhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc-cCCceEEech Q lcl|NC_021309. 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNP 380 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~n~ 380 (497) ..+....++.+..+...+....- ..+.+++++| T Consensus 134 ----------------------------------------------~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p 167 (273) T protein:vir:10 134 ----------------------------------------------PTDADDAFDLIAKALKELTKANVPNVGRVVVVNA 167 (273) T ss_pred ----------------------------------------------ccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECH Confidence 00001112333333333332222 2345689999 Q ss_pred hHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCc---eEEEeeccceEEEEeecccEEEeecc Q lcl|NC_021309. 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT---ILVGHFAPSVIQTARREGVTMQMTNS 457 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~---~~~gd~~~~~~~i~~r~~~~i~~~~~ 457 (497) ..+..|++..+--.+.-... .......+...++.|++|+.++.+|.++ ++.+- ..++....+ ...++.... T Consensus 168 ~~~~~L~~~~~~~~~~~~~~---~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~--~~A~~~a~q-~~~~e~~r~ 241 (273) T protein:vir:10 168 EMAFWLRSSGSKLTSADTSG---DAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH--PSAAAYVSQ-IDTVEALRD 241 (273) T ss_pred HHHHHHhcchhhhhhhhccc---cccceeeeeeeEEeceEEEEecccccCCccEEEEEe--ccceeeeee-eehhhcccC Confidence 99998876432111100000 0001111223579999999999999754 33333 333433322 123332222 Q ss_pred cchhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 458 NGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 458 ~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) . ..| -..+++.+.+|..|+||++++.++.+.+ T Consensus 242 ~-~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 242 Q-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred C-Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 1 223 3568888999999999999998765544 No 128 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.10 E-value=4.2e-11 Score=77.49 Aligned_cols=260 Identities=14% Similarity=0.081 Sum_probs=143.5 Q ss_pred hccccccccccccchhhhHHHHHHHHhhhhHHhhccee----ecCCCceEEEEEcCCCccceecccccccccccccceee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i 226 (497) |+. -.++|..|...+++.+.+.+.+..++..- ...++++++|+.... ..+.+..++...+..+.+.+.+ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccc-cccccccCCCccCccccccceE Confidence 111 12344456777888998888877776431 223568899986542 2345677777776667777777 Q ss_pred EeeeeeEE-eeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc---ccCccccccccccccccccccccchhhhh Q lcl|NC_021309. 227 YEQVGKVA-NALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA---GGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 227 ~~~~~kla-~~~~iS~-ell~d~~~l~~~i~~~la~~~~~~~d~~~l~---G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~ 301 (497) ++...+.. .-+.|++ +..+...+++++ .+...++++.++|..++. +.+... +.. . T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~alA~~vD~~i~~~~~~a~~~~-----------~~~--~------ 133 (273) T protein:vir:10 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTAL-----------TGS--A------ 133 (273) T ss_pred EEEEeeeeecceEeecHHHhhhhccHHHH-HHHHHHHHHHHHHHHHHHHHhcccccc-----------ccc--c------ Confidence 77775543 2345665 344455567775 456788999999987652 111000 000 0 Q ss_pred hhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc-cCCceEEech Q lcl|NC_021309. 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNP 380 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~n~ 380 (497) ..+....++.+..+...+....- ..+.+++++| T Consensus 134 ----------------------------------------------~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p 167 (273) T protein:vir:10 134 ----------------------------------------------PTDADDAFDLIAKALKELTKANVPNVGRVVVVNA 167 (273) T ss_pred ----------------------------------------------ccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECH Confidence 00001112333333333332222 2345689999 Q ss_pred hHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCc---eEEEeeccceEEEEeecccEEEeecc Q lcl|NC_021309. 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT---ILVGHFAPSVIQTARREGVTMQMTNS 457 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~---~~~gd~~~~~~~i~~r~~~~i~~~~~ 457 (497) ..+..|++..+--.+.-... .......+...++.|++|+.++.+|.++ ++.+- ..++....+ ...++.... T Consensus 168 ~~~~~L~~~~~~~~~~~~~~---~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~--~~A~~~a~q-~~~~e~~r~ 241 (273) T protein:vir:10 168 EMAFWLRSSGSKLTSADTSG---DAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH--PSAAAYVSQ-IDTVEALRD 241 (273) T ss_pred HHHHHHhcchhhhhhhhccc---cccceeeeeeeEEeceEEEEecccccCCccEEEEEe--ccceeeeee-eehhhcccC Confidence 99998876432111100000 0001111223579999999999999754 33333 333433322 123332222 Q ss_pred cchhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 458 NGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 458 ~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) . ..| -..+++.+.+|..|+||++++.++.+.+ T Consensus 242 ~-~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 242 Q-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred C-Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 1 223 3568888999999999999998765544 No 129 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.95 E-value=7.6e-11 Score=76.06 Aligned_cols=296 Identities=13% Similarity=0.082 Sum_probs=155.3 Q ss_pred Hhhhhh---hhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecC-CCceEEEEEcCCCccceeccccc Q lcl|NC_021309. 139 ADGETA---PAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAG 214 (497) Q Consensus 139 ~~~~~~---~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~wv~Eg~ 214 (497) +..... ............+..=.+.+..|..++.+.....+.++++.++.++. ++++++|+. + ..++.....|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G-~~~~~~~~~G~ 78 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-G-RTQAAYLAPGE 78 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-c-ceEEEeeecCC Confidence 000000 00000000011111113456778889999999999999999999887 567889986 3 45677888787 Q ss_pred ccccc--cccceeeEee--eeeEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccc----C-----cccccc Q lcl|NC_021309. 215 TYPFS--SEEFARVYEQ--VGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGG----G-----YPGVNG 281 (497) Q Consensus 215 ~~~~s--~~~f~~i~~~--~~kla~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~----G-----~~~p~G 281 (497) ....+ ++..++.++. ..+++.+..-.-+=.+...++.+.+.++..+++++..|+.++.-- . +..|.| T Consensus 79 ~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~~ 158 (345) T protein:vir:22 79 NLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIEG 158 (345) T ss_pred CCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 76543 5777884444 333333221111112233478999999999999999999887311 0 011221 Q ss_pred ccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHH Q lcl|NC_021309. 282 LLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDA 361 (497) Q Consensus 282 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (497) +-............ ...........+++.++++ T Consensus 159 ~~~~~~~~~~~~g~-----------------------------------------------~~t~~~~~~~~~~~ai~~a 191 (345) T protein:vir:22 159 LGTATVIETTQNKA-----------------------------------------------ALTDQVALGKEIIAALTKA 191 (345) T ss_pred cccccccccccccc-----------------------------------------------cccccccCHHHHHHHHHHH Confidence 11110000000000 0000001112233444444 Q ss_pred HHhhhhhhcc-CCceEEechhHHHHHHHHhhhc-CceeccCcccccccccccccccccccceEecCCCCcCc-------- Q lcl|NC_021309. 362 FVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDAN-GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-------- 431 (497) Q Consensus 362 ~~~~~~~~~~-~~~~~~~n~~~~~~l~~lkd~~-G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-------- 431 (497) ...+....-- ...+.+++|..|..|..-+.-+ ..|.... ....+...+++|+||+.++.+|.+. T Consensus 192 ~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~------~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~ 265 (345) T protein:vir:22 192 RAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALI------DPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGT 265 (345) T ss_pred HHHhhhcCCCccCCEEEeChHHHHHHhcccccccccccccc------ccccceEEEEeceEEEecccccccccCccccCc Confidence 4444332222 2345789999998775433221 2222111 1111224578999999999887421 Q ss_pred ---------------eEEE-------eeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEE Q lcl|NC_021309. 432 ---------------ILVG-------HFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQ 489 (497) Q Consensus 432 ---------------~~~g-------d~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~ 489 (497) ...+ -|.+.++..+...+++++..... ..|. ..+++..=+|-+|+||++.+.|+ T Consensus 266 ~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~-~~~~---d~I~~~~a~G~~vlRPeaa~~i~ 341 (345) T protein:vir:22 266 TGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRA-NFQA---DQIIAKYAMGHGGLRPEAAGAVV 341 (345) T ss_pred ccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeech-hHHH---HHHHHHHhcCCcccccceeEEEE Confidence 0001 11222333444445556665533 2332 36677788999999999999998 Q ss_pred eeCC Q lcl|NC_021309. 490 LKKG 493 (497) Q Consensus 490 ~~~~ 493 (497) ++-- T Consensus 342 ~~~~ 345 (345) T protein:vir:22 342 FKVE 345 (345) T ss_pred EeeC Confidence 8877 No 130 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.90 E-value=9e-11 Score=75.65 Aligned_cols=299 Identities=13% Similarity=0.075 Sum_probs=155.2 Q ss_pred Hhhhhhhhhhhhhcccccccccc---ccchhhhHHHHHHHHhhhhHHhhcceeecC-CCceEEEEEcCCCccceeccccc Q lcl|NC_021309. 139 ADGETAPAAIGQNPFGSTGTFAP---GILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAG 214 (497) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~g~---~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~wv~Eg~ 214 (497) +.+....... ....+..+..|. +.+..|..++.......+.++++.++.++. ++++.+|+... .++.....|+ T Consensus 1 ~~~~~~~~~~-~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~--~t~~~~~~g~ 77 (347) T protein:vir:33 1 MANIQGGQQI-GTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR--TKAAYLKPGE 77 (347) T ss_pred CCCCccCccc-ccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccc--eeeeeecCCC Confidence 0000000000 001111112222 345678888888888889999998887765 56888998643 4566666676 Q ss_pred cccc--ccccceeeEeeeeeEEee-ehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHhhhhc-----ccCccccccc--- Q lcl|NC_021309. 215 TYPF--SSEEFARVYEQVGKVANA-LTITDE-GLRDAPELFNFVQGRLLEGIQRKEEVQLLA-----GGGYPGVNGL--- 282 (497) Q Consensus 215 ~~~~--s~~~f~~i~~~~~kla~~-~~iS~e-ll~d~~~l~~~i~~~la~~~~~~~d~~~l~-----G~G~~~p~Gi--- 282 (497) ..+. .+++.++.++...++-.+ ..|.+- =++...++.+.+.++..+++++..|+.++. +.....+.+. T Consensus 78 ~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~ 157 (347) T protein:vir:33 78 NLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEG 157 (347) T ss_pred CCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Confidence 6544 345666655544332211 122111 122224788889999999999999998862 1111111110 Q ss_pred cccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHH Q lcl|NC_021309. 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) +...+...... ...+...........+++.++++. T Consensus 158 ~~~~~~~~~~~---------------------------------------------~~tg~~~d~~~~a~~i~~~i~~a~ 192 (347) T protein:vir:33 158 LGKPTVLTLVK---------------------------------------------PTTGSLTDPVELGKAIIAQLTIAR 192 (347) T ss_pred ccccccccccc---------------------------------------------cccccccchhhhHHHHHHHHHHHH Confidence 00000000000 000000000011223345555555 Q ss_pred Hhhhhhhc-cCCceEEechhHHHHHHHHhhh-cCceeccCcccccccccccccccccccceEecCCCCcCce-------- Q lcl|NC_021309. 363 VDIQLTLF-QTPNAVVMNPRDWELLRLTKDA-NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI-------- 432 (497) Q Consensus 363 ~~~~~~~~-~~~~~~~~n~~~~~~l~~lkd~-~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~-------- 432 (497) ..+....- ....+.+++|..|..|.+...- +..|... .....+...+++|++|+.++.+|.+.+ T Consensus 193 ~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~------~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ 266 (347) T protein:vir:33 193 ASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQAL------LDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAP 266 (347) T ss_pred HHHhhcCCCccCcEEEeCHHHHHHHhccccccccccccc------cccccceeEEEeceeEEEecccccCcccccccccc Confidence 55544433 2345678999998888654321 1223211 111222345789999999999986421 Q ss_pred --------------EEEeeccc--------eEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEe Q lcl|NC_021309. 433 --------------LVGHFAPS--------VIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQL 490 (497) Q Consensus 433 --------------~~gd~~~~--------~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~ 490 (497) +-++|+.. ++..+.-.+++++..... .+| .-.+++...+|.+++||++.+.+.+ T Consensus 267 ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~-~~~---~d~i~~~~~~G~~vlrP~~av~i~~ 342 (347) T protein:vir:33 267 ADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA-NYQ---ADQIIAKYAMGHGGLRPEAAGAIVL 342 (347) T ss_pred ccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccch-hhh---hHhhhhhhhcCCceecccceEEEec Confidence 11222221 111222333344444432 222 2556777888999999999999999 Q ss_pred eCCCC Q lcl|NC_021309. 491 KKGAT 495 (497) Q Consensus 491 ~~~a~ 495 (497) +..++ T Consensus 343 ~~~~~ 347 (347) T protein:vir:33 343 PKVSE 347 (347) T ss_pred CCCCC Confidence 99999 No 131 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.90 E-value=1.8e-10 Score=73.96 Aligned_cols=303 Identities=12% Similarity=0.061 Sum_probs=151.3 Q ss_pred Hhhhhhhhhhhhhcccccccccc---ccchhhhHHHHHHHHhhhhHHhhcceeecC-CCceEEEEEcCCCccceeccccc Q lcl|NC_021309. 139 ADGETAPAAIGQNPFGSTGTFAP---GILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAG 214 (497) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~g~---~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~wv~Eg~ 214 (497) +.......... ...+..+..+. +.+..|..+++......+.++++.++.++. ++++.+|+... .++.....|. T Consensus 1 ma~~~~~~~~~-t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~--~t~~~~~~g~ 77 (347) T protein:vir:15 1 MANIQGGQQIG-TNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR--TKAAYLKPGE 77 (347) T ss_pred CCccccCCccc-cccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccc--eeeeeeccCC Confidence 00000000000 00011111111 234567778888888888899998887765 56888998643 4566666676 Q ss_pred cccc--ccccceeeEeeeeeEEe-eehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHhhhhccc--Ccc-cc--cccccc Q lcl|NC_021309. 215 TYPF--SSEEFARVYEQVGKVAN-ALTITDE-GLRDAPELFNFVQGRLLEGIQRKEEVQLLAGG--GYP-GV--NGLLQR 285 (497) Q Consensus 215 ~~~~--s~~~f~~i~~~~~kla~-~~~iS~e-ll~d~~~l~~~i~~~la~~~~~~~d~~~l~G~--G~~-~p--~Gi~~~ 285 (497) ..+. .+++.++.++..-++-. -..|.+- -.+...++.+.+.++..+++++..|+.++.-- +.. .+ ...... T Consensus 78 ~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~ 157 (347) T protein:vir:15 78 NLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEG 157 (347) T ss_pred CCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 6544 34567775555443321 1222111 12223478888999999999999999887210 000 00 000000 Q ss_pred ccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhh Q lcl|NC_021309. 286 STGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI 365 (497) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 365 (497) .+....... .....+...........+++.++++...+ T Consensus 158 ~g~~~~~~~------------------------------------------~~~~~~~~~~~~~~~~~i~d~~~~a~~~L 195 (347) T protein:vir:15 158 LGKPTVLTL------------------------------------------VKPTTGDLTDPVELGKAIIAQLTIARASL 195 (347) T ss_pred cCccccccc------------------------------------------cccccccchhhhhHHHHHHHHHHHHHHHH Confidence 000000000 00000000000011122334444444333 Q ss_pred hhhhc-cCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCce------------ Q lcl|NC_021309. 366 QLTLF-QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI------------ 432 (497) Q Consensus 366 ~~~~~-~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~------------ 432 (497) ....- ....+.+++|..|..|.+-.+-.. ....... ....+...+++|++|+.++.+|.+.+ T Consensus 196 de~~VP~~gR~~vv~P~~y~~LL~~~~~~~----~d~~~~~-~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~ 270 (347) T protein:vir:15 196 TKNYVPAADRTFYTTPDNYSAILAALMPNA----ANYQALI-DHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQK 270 (347) T ss_pred hhcCCCccCCEEEeCHHHHHHHhccccccc----ccccccc-cccceEEEEEeceEEEeccccccccccccccccccccc Confidence 33322 233456889999888865543221 1111100 11122235789999999999984321 Q ss_pred ----------EEEeeccc--------eEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCC Q lcl|NC_021309. 433 ----------LVGHFAPS--------VIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 433 ----------~~gd~~~~--------~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a 494 (497) +-++|+.. ++..+.-++++++...... +| .-.+++...+|.+++||++.+.+.++..+ T Consensus 271 ~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~-~~---~d~i~~~~~~G~~vlrP~~av~~~~~~~~ 346 (347) T protein:vir:15 271 HAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN-YQ---ADQIIAKYAMGHGGLRPEAAGAIVLPKVS 346 (347) T ss_pred ccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccch-hh---hhhhehhhhcCCceeccccEEEEecCCCC Confidence 11222221 1223333344455444322 22 35567777889999999999999999999 Q ss_pred C Q lcl|NC_021309. 495 T 495 (497) Q Consensus 495 ~ 495 (497) + T Consensus 347 ~ 347 (347) T protein:vir:15 347 E 347 (347) T ss_pred C Confidence 9 No 132 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.88 E-value=2.7e-10 Score=73.05 Aligned_cols=299 Identities=13% Similarity=0.073 Sum_probs=155.2 Q ss_pred hhhhhhhhhhccccccccc--cccchhhhHHHHHHHHhhhhHHhhcceeecC-CCceEEEEEcCCCccceeccccccccc Q lcl|NC_021309. 142 ETAPAAIGQNPFGSTGTFA--PGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAGTYPF 218 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g--~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~wv~Eg~~~~~ 218 (497) +..............++++ .+.+.+|..++.......+.++++.++.++. ++++.+|+.. ..++....-|+.... T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG--~~~~~~~~~g~~l~~ 78 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVG--ASTIAGRKAGEELVV 78 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeec--ceeeeeecCCCCCCC Confidence 1110000000101112222 2344778889999999999999999999887 5688999863 456777777877777 Q ss_pred ccccceeeEeeeeeEE-eeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHhhhh----cccCccccccccccc--cccc Q lcl|NC_021309. 219 SSEEFARVYEQVGKVA-NALTITDE-GLRDAPELFNFVQGRLLEGIQRKEEVQLL----AGGGYPGVNGLLQRS--TGFT 290 (497) Q Consensus 219 s~~~f~~i~~~~~kla-~~~~iS~e-ll~d~~~l~~~i~~~la~~~~~~~d~~~l----~G~G~~~p~Gi~~~~--~~~~ 290 (497) ...+.++.++....+- ....|.+- =.+...++.+.+.+++.+++++..|++++ .+.....|.+.-... +..+ T Consensus 79 ~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~~ 158 (334) T protein:vir:80 79 QKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGILL 158 (334) T ss_pred CCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCcce Confidence 7777788777766532 22233221 12223479999999999999999999865 222211111110000 0000 Q ss_pred cccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc Q lcl|NC_021309. 291 ASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF 370 (497) Q Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 370 (497) ....... ......+...+...++.+...+....- T Consensus 159 ~~~~~g~----------------------------------------------~~~~~~~~~~l~~a~~~a~~~L~e~dv 192 (334) T protein:vir:80 159 PSTISGL----------------------------------------------AADAAADADVLVAAHRQGVEAMVFRDL 192 (334) T ss_pred eeccccc----------------------------------------------ccchhhhHHHHHHHHHHHHHHHHhcCC Confidence 0000000 000000011111222222222222211 Q ss_pred ----cCCceEEechhHHHHHHHHhhhcCc-eeccCcccccccccccccccccccceEecCCCCcCc-----------eEE Q lcl|NC_021309. 371 ----QTPNAVVMNPRDWELLRLTKDANGQ-YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----------ILV 434 (497) Q Consensus 371 ----~~~~~~~~n~~~~~~l~~lkd~~G~-~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-----------~~~ 434 (497) ....+.+++|..|..|..-..-..+ |... .+..........++.|+||+.|+.+|... .+- T Consensus 193 p~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s---~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~a 269 (334) T protein:vir:80 193 GDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAK---EGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTD 269 (334) T ss_pred CCCcCCceEEEeChHHHHHHhcccccccceeccc---cccccccceeEEEEeceEEEeecCCCCcccccccccccccccc Confidence 1234679999999988764221111 1000 00111222234578999999999999542 355 Q ss_pred EeeccceEEEEeecc--------cEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCC Q lcl|NC_021309. 435 GHFAPSVIQTARREG--------VTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 435 gd~~~~~~~i~~r~~--------~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~ 495 (497) |||+.....++-+.- ++.+..... ..|. -.+.+..=+|-+++||++++.++++-+-. T Consensus 270 gd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~-~~~~---d~i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 270 AEVRRKMITFIPSMALISAQVHPVSAQFWEEK-KDFG---HYLDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred ccccceEEEEEeCceEEEEEEeecceeeeech-hhHH---HHHHHHHHcCCceeccceEEEEEEeeecC Confidence 676654433333322 222322211 1121 12233355789999999988888877666 No 133 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.87 E-value=3.4e-10 Score=72.51 Aligned_cols=295 Identities=13% Similarity=0.063 Sum_probs=156.5 Q ss_pred Hhhhhhhhhhhhhccccccccc---cccchhhhHHHHHHHHhhhhHHhhcceeecC-CCceEEEEEcCCCccceeccccc Q lcl|NC_021309. 139 ADGETAPAAIGQNPFGSTGTFA---PGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAG 214 (497) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~g---~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~wv~Eg~ 214 (497) +.+....... ....+..++.+ .+.+.+|..+++......+.++++.++.++. ++++.+|+.- ..++.....|. T Consensus 1 ~a~~~~~~~~-~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG--~~~~~~~~~g~ 77 (347) T protein:vir:88 1 MANATGGQQI-GANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMG--RTKGYYLAPGE 77 (347) T ss_pred CCCcccchhh-hccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeec--ceeeeeecccc Confidence 0000000000 01111111112 2345778888998888888999999988865 5678899754 34566777776 Q ss_pred cccc--ccccceeeEeeeeeEEe-eehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHhhhhcc----cCc-----ccccc Q lcl|NC_021309. 215 TYPF--SSEEFARVYEQVGKVAN-ALTITDE-GLRDAPELFNFVQGRLLEGIQRKEEVQLLAG----GGY-----PGVNG 281 (497) Q Consensus 215 ~~~~--s~~~f~~i~~~~~kla~-~~~iS~e-ll~d~~~l~~~i~~~la~~~~~~~d~~~l~G----~G~-----~~p~G 281 (497) .... .++..+++++...++-. -..|.+. -.+...++.+.+.++..+++++..|+.++.- ... +.+.| T Consensus 78 ~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g 157 (347) T protein:vir:88 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) T ss_pred CCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCC Confidence 6544 35678887776665422 2233322 2233347888899999999999999988632 110 00111 Q ss_pred ccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHH Q lcl|NC_021309. 282 LLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDA 361 (497) Q Consensus 282 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (497) +-.... ...... +.......+...+++.++++ T Consensus 158 ~~~~~~-~~~~~~-----------------------------------------------~~~~~~~~~~~~~~~~i~~a 189 (347) T protein:vir:88 158 LGQAVV-LNIGAA-----------------------------------------------ADLVDVEARGKAILKGLTLA 189 (347) T ss_pred cccccc-cccccc-----------------------------------------------ccccchhhhHHHHHHHHHHH Confidence 100000 000000 00000000111123344444 Q ss_pred HHhhhhhhc-cCCceEEechhHHHHHHHHhh-hcCceeccCcccccccccccccccccccceEecCCCCcCc-------- Q lcl|NC_021309. 362 FVDIQLTLF-QTPNAVVMNPRDWELLRLTKD-ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-------- 431 (497) Q Consensus 362 ~~~~~~~~~-~~~~~~~~n~~~~~~l~~lkd-~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-------- 431 (497) ...+....- ....+++++|..|..|.+... ....|.-... .......+++|++|+.++.+|.+. T Consensus 190 ~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~------~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~ 263 (347) T protein:vir:88 190 RARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALID------PETGNIRNVMGFEVIEVPHLTVGGAGDNNPAD 263 (347) T ss_pred HHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccc------hhcceeeeeccceEEEeecccccccccccccc Confidence 433332221 124467899999888765332 2222322111 112234578999999999998421 Q ss_pred -----------------eEEEeeccceEEEEe--------ecccEEEeecccchhhhcCceEEEEEEeecceeecccceE Q lcl|NC_021309. 432 -----------------ILVGHFAPSVIQTAR--------REGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQ 486 (497) Q Consensus 432 -----------------~~~gd~~~~~~~i~~--------r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~ 486 (497) -+-+|++.....++- -.++.++..... ..| ...+++..-+|.+++||++.+ T Consensus 264 ~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~-~~~---~d~i~~~~~~G~~~~rPe~a~ 339 (347) T protein:vir:88 264 GVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP-EFQ---ADQIIGKYAMGHGGLRPEAAG 339 (347) T ss_pred cccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeech-hhH---HHHhhhhhhhcCceeccceEE Confidence 133445443222222 223344444322 223 246788899999999999999 Q ss_pred EEEeeCCC Q lcl|NC_021309. 487 LIQLKKGA 494 (497) Q Consensus 487 ~l~~~~~a 494 (497) .++++++| T Consensus 340 ~~~~~~a~ 347 (347) T protein:vir:88 340 ALVFTPAA 347 (347) T ss_pred EEEeCCCC Confidence 99999999 No 134 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.86 E-value=1.3e-09 Score=69.29 Aligned_cols=281 Identities=12% Similarity=0.071 Sum_probs=165.4 Q ss_pred hccccccccccccchh---hhHHHHHHHHhhhhHHhhcceeecCC---CceEEEEEcCCCccceecccc-cccccccccc Q lcl|NC_021309. 151 NPFGSTGTFAPGILPT---FLPGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEA-GTYPFSSEEF 223 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~---~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~p~~~~~~~~a~wv~Eg-~~~~~s~~~f 223 (497) +...-...+|.++..+ +.+.+++...+....+.++++.+..+ .+++|..... .+.+.|++.+ .+.|..+..+ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~~~dip~v~~~~ 79 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDG-VGIAQIVADYTDDLPLVDALA 79 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeec-cCceeEeCCCccccceeeccc Confidence 1111123445566654 33567777777777777666554322 3566666654 3567887764 5578889999 Q ss_pred eeeEeeeeeEEeeehhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhh Q lcl|NC_021309. 224 ARVYEQVGKVANALTITDEGLRDA----PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) Q Consensus 224 ~~i~~~~~kla~~~~iS~ell~d~----~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~ 299 (497) +......+.++.-+.++.+=|+.+ .+|..--....++++...+|+-+++|+...+..|++|.++.........| T Consensus 80 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W-- 157 (296) T protein:vir:10 80 TERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSW-- 157 (296) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCc-- Confidence 999999999999888886656543 35777778888899999999999999887778999998875432221111 Q ss_pred hhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhh--hhhccCCceEE Q lcl|NC_021309. 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQ--LTLFQTPNAVV 377 (497) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 377 (497) .+....++++..++..+. +.+...++.++ T Consensus 158 -------------------------------------------------~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~ 188 (296) T protein:vir:10 158 -------------------------------------------------SQPTTAVSDITSLLDIIETSTNGQHRATHLL 188 (296) T ss_pred -------------------------------------------------cCHHHHHHHHHHHHHHHHHhhCceecceeEE Confidence 111133455555555443 33556677888 Q ss_pred echhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCc-CceEEEeeccceEEEEeecccEEEeec Q lcl|NC_021309. 378 MNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL-GTILVGHFAPSVIQTARREGVTMQMTN 456 (497) Q Consensus 378 ~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~-~~~~~gd~~~~~~~i~~r~~~~i~~~~ 456 (497) ++|..+..|.......|.-++.-.. ......+|.+.|...+..... +..++.+-+.-++.+..-+.++ ... T Consensus 189 L~p~~~~~L~~~~~~~~~t~l~~ik------~~~~~l~i~~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~~--~~~ 260 (296) T protein:vir:10 189 LPTTARRIMQNLVPGTSVSYGEFFR------QNNSGVTVEFVQYLNDYNGTGTSAAIAYEKDPNNMAIEIPEATN--ALP 260 (296) T ss_pred eCHHHHHHHhhccCCCCccHHHHHH------HhcCCceEEEeeeeccCCCCcceEEEEEEcCCceEEEEcCccee--eec Confidence 8998888886655555533322111 011122333334333222111 1123333333333332222222 221 Q ss_pred ccchhhhcCceEEEEEEeec-ceeecccceEEEEeeCCC Q lcl|NC_021309. 457 SNGTDFVDGKVTVRAEERLG-LLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 457 ~~~~~f~~~~v~~r~~~r~~-~~v~~~~a~~~l~~~~~a 494 (497) .. ...=...++...|.+ ..+.+|.||++++.-+=| T Consensus 261 ~e---~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 261 AQ---PKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred cc---ccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 11 111136778889996 677789999999777766 No 135 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.86 E-value=5.5e-10 Score=71.35 Aligned_cols=295 Identities=13% Similarity=0.103 Sum_probs=158.7 Q ss_pred Hhhhhhhhhhhhhcccccccccc---ccchhhhHHHHHHHHhhhhHHhhcceeecC-CCceEEEEEcCCCccceeccccc Q lcl|NC_021309. 139 ADGETAPAAIGQNPFGSTGTFAP---GILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAG 214 (497) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~g~---~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~wv~Eg~ 214 (497) +.+........ ...+..++.|. +.+..|..++.+.....+.++++.++.++. ++++.+|+.. ..++.....|+ T Consensus 1 ma~~~~~~~~~-t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG--~~~~~~~~~G~ 77 (347) T protein:vir:94 1 MANMNGGQQMG-KDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLG--RTKAAYLQPGE 77 (347) T ss_pred CCccccccccc-cccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeecc--ceeEeeeecCc Confidence 00000000000 01111112222 355778889999999999999999988865 5688999753 35678888888 Q ss_pred cccc--ccccceeeEeeeeeEE-eeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc----ccCcc----c-ccc Q lcl|NC_021309. 215 TYPF--SSEEFARVYEQVGKVA-NALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGGYP----G-VNG 281 (497) Q Consensus 215 ~~~~--s~~~f~~i~~~~~kla-~~~~iS~-ell~d~~~l~~~i~~~la~~~~~~~d~~~l~----G~G~~----~-p~G 281 (497) .... .+++.++.++...++- ....|.+ +=.+...++.+.+.++..+++++..|+.++. +.... . +.| T Consensus 78 ~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g 157 (347) T protein:vir:94 78 NLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAG 157 (347) T ss_pred CCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 7754 4678888777665542 1222321 1122234788999999999999999998862 11110 0 011 Q ss_pred ccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHH Q lcl|NC_021309. 282 LLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDA 361 (497) Q Consensus 282 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (497) ... ..++...... ............+++.++++ T Consensus 158 ~~~---~~~v~i~~~~--------------------------------------------~~~~~~~~~~~~~~d~i~~a 190 (347) T protein:vir:94 158 LGK---AHVLEVGDQA--------------------------------------------TLQGDQVKLGQAIIAQLTLA 190 (347) T ss_pred CCc---ceeEeeeccc--------------------------------------------cccccccccHHHHHHHHHHH Confidence 000 0000000000 00000001112223444444 Q ss_pred HHhhhhhhcc-CCceEEechhHHHHHHHHhhhc-CceeccCcccccccccccccccccccceEecCCCCcCc-------- Q lcl|NC_021309. 362 FVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDAN-GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-------- 431 (497) Q Consensus 362 ~~~~~~~~~~-~~~~~~~n~~~~~~l~~lkd~~-G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-------- 431 (497) ...+....-- .+..++++|..|..|.+..+.+ +.+-.. .+.......++.|+||+.++.+|... T Consensus 191 ~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~------~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~ 264 (347) T protein:vir:94 191 RAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQAL------IDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEE 264 (347) T ss_pred HHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccc------cccccceeEEeeceEEEEcCccccccCccccccc Confidence 4444333222 2334577899998887643222 222111 11122345578999999999998421 Q ss_pred -----------------eEEEeeccceEEEEee--------cccEEEeecccchhhhcCceEEEEEEeecceeecccceE Q lcl|NC_021309. 432 -----------------ILVGHFAPSVIQTARR--------EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQ 486 (497) Q Consensus 432 -----------------~~~gd~~~~~~~i~~r--------~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~ 486 (497) -|=+||+.....++-+ .++++++.... .++ ...+.+..=+|-.++||++.+ T Consensus 265 ~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~--~~~--~~~i~~~~a~G~g~~rPe~a~ 340 (347) T protein:vir:94 265 GVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRA--NFQ--ADQIIAKYAMGHGGLRPEACG 340 (347) T ss_pred ccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeech--hhh--hhhhhhhhhhcCcccccceeE Confidence 1334555433333322 33445554322 223 235667788899999999999 Q ss_pred EEEeeCC Q lcl|NC_021309. 487 LIQLKKG 493 (497) Q Consensus 487 ~l~~~~~ 493 (497) .+.++++ T Consensus 341 ~i~~~~a 347 (347) T protein:vir:94 341 ALVFKKA 347 (347) T ss_pred EEEecCC Confidence 9999999 No 136 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.84 E-value=1.5e-09 Score=68.96 Aligned_cols=311 Identities=13% Similarity=0.074 Sum_probs=156.6 Q ss_pred HHHHHhhhhhh-hhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecC-CCceEEEEEcCCCccceeccc Q lcl|NC_021309. 135 MGAFADGETAP-AAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAE 212 (497) Q Consensus 135 ~~~~~~~~~~~-~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~wv~E 212 (497) ..-..-.+... ..-.+..-+..+..=.+.+..|..++.......+.++++.++.++. ++++++|+.- ..++....- T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG--~~t~~~~t~ 78 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTG--RMTSSFHTP 78 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeee--eeEEeeecC Confidence 00000000000 0000000001111113445678888999999999999999988886 5688899863 345555544 Q ss_pred cccc---ccccccceeeEe--eeeeEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc----ccCcccccccc Q lcl|NC_021309. 213 AGTY---PFSSEEFARVYE--QVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNGLL 283 (497) Q Consensus 213 g~~~---~~s~~~f~~i~~--~~~kla~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~----G~G~~~p~Gi~ 283 (497) |+.. +..++...+.++ .-.++..+..-.-+=.+...++.+.+.++..+++++..|+.++. +.....|.+.- T Consensus 79 G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~ 158 (375) T protein:vir:10 79 GTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSAT 158 (375) T ss_pred CcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 5443 233444444334 33333322111111122234789999999999999999998762 22111111100 Q ss_pred ccc--cccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHH Q lcl|NC_021309. 284 QRS--TGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDA 361 (497) Q Consensus 284 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (497) +.. +...+... .+.......+...+++.++++ T Consensus 159 ~~~~~Gg~~i~~~----------------------------------------------sg~~~~~~~ta~~~~~ai~~a 192 (375) T protein:vir:10 159 NFVEPGGTQIRVG----------------------------------------------SGTNESDAFTASALVNAFYDA 192 (375) T ss_pred cccccCcceeeec----------------------------------------------cccccccccCHHHHHHHHHHH Confidence 000 00000000 000000011122334445555 Q ss_pred HHhhhhhhcc-CCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCc--------- Q lcl|NC_021309. 362 FVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--------- 431 (497) Q Consensus 362 ~~~~~~~~~~-~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~--------- 431 (497) ...+....-- ...+.+++|..|..|.+-+|.+ .+......+...........++|++|+.++.+|... T Consensus 193 ~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~--~~~n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~ 270 (375) T protein:vir:10 193 AAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSN--GLVNRDVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGT 270 (375) T ss_pred HHHHhhcCCCCCCCEEEeChHHHHHHHhcCCcc--ceeeecccccceeccceEEEEeceEEEEecccccccccccccccc Confidence 4444433322 2345789999998887766544 222222222212223334578999999999999422 Q ss_pred ----------------------------eEEEee---c--------cceEEEEeecccEEEeeccc-chhhhcCceEEEE Q lcl|NC_021309. 432 ----------------------------ILVGHF---A--------PSVIQTARREGVTMQMTNSN-GTDFVDGKVTVRA 471 (497) Q Consensus 432 ----------------------------~~~gd~---~--------~~~~~i~~r~~~~i~~~~~~-~~~f~~~~v~~r~ 471 (497) .|-+|| + ..++..+.-.+++++++... ...++ ...+.+ T Consensus 271 ~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q--~~~i~~ 348 (375) T protein:vir:10 271 TGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIGPQVQVTNGDVSVIYQ--GDVILG 348 (375) T ss_pred ccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchhheeeeeeeccccccccchhhheee--eeeeee Confidence 133344 2 22233333355556654311 11222 356677 Q ss_pred EEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ..-+|-.+.||++.+.|+..+++..- T Consensus 349 ~~a~G~~~lrp~~av~l~~~~~~~~~ 374 (375) T protein:vir:10 349 RMAMGADYLNPAAAVELYIGATAPSA 374 (375) T ss_pred eeeeccCccCceeEEEEecCcCcccc Confidence 88899999999999988887666666 No 137 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.83 E-value=5.1e-11 Score=77.01 Aligned_cols=281 Identities=14% Similarity=0.028 Sum_probs=153.3 Q ss_pred hhhhhhcccccccccccc-chhhhHHHHHHHHhhhhHHhhcceeecC-CCceEEEEEcCCCccceecccccccccccccc Q lcl|NC_021309. 146 AAIGQNPFGSTGTFAPGI-LPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAGTYPFSSEEF 223 (497) Q Consensus 146 ~~~~~~~~~~~~~~g~~v-~p~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f 223 (497) +..-.+...+-......+ +......||+.+.+.++|++.+++.... +....+.++++- +.++|..=++..+.++.++ T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~L-P~~~fR~lN~g~~~s~~tt 79 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGL-PSATWRLLNYGVQPSKSTT 79 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeecc-CCceeeecCCccCccccee Confidence 000000000000001112 2235668999999999999999999886 445778888874 7899999999999999999 Q ss_pred eeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCcccccccccccccccccc---c--- Q lcl|NC_021309. 224 ARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASS---A--- 294 (497) Q Consensus 224 ~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~---~--- 294 (497) .+++..++-+.+.+.|.+.+.+.. .++...-.....++++.++...|++|+.+..|.++.....-.+... + T Consensus 80 ~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~qi 159 (328) T protein:vir:95 80 VQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQNI 159 (328) T ss_pred EEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCccccccccce Confidence 999999999999999999998754 3455555566889999999999999998777666643332111100 0 Q ss_pred --cchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhh---------------------------------hhhhhhhhhhh Q lcl|NC_021309. 295 --SSLFGATSATVSNVKFPADGTNGAFVGQDTVASL---------------------------------KYGRVVTGAAG 339 (497) Q Consensus 295 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------------------------~~~~~~~~~~~ 339 (497) ....+......... .........+++......+ +..+...-... T Consensus 160 idaGgtg~~~TSi~~v-~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~N 238 (328) T protein:vir:95 160 IDAGGTGTDNTSIWLV-VWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIAN 238 (328) T ss_pred eecccCCCCceEEEEE-EEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 00000000000000 0000000011100000000 00000000000 Q ss_pred cccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHh-hhcCceeccCcccccccccccccccccc Q lcl|NC_021309. 340 SGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK-DANGQYMGGNFFGNAYGNPVNGGKNIWG 418 (497) Q Consensus 340 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk-d~~G~~i~~~~~~~~~~~~~~~~~~l~G 418 (497) .............+..+.+..++..++ +......+|++|..-...|++.. ++..-++-.....+. ..-.++| T Consensus 239 Id~~~l~~~~~~~~l~~lm~~a~~~ip-~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~------~~t~~~g 311 (328) T protein:vir:95 239 IDVSNLSEPSSAANIAKLMVKALHRIP-NRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGE------WWTSFRG 311 (328) T ss_pred CcccccccccChhhHHHHHHHHHHHhc-cCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCc------ceeEECC Confidence 000000011123344555666666544 34444567999999999999874 433333332221111 1234789 Q ss_pred cceEecCCCCcCceEEE Q lcl|NC_021309. 419 VPVVTTPLIPLGTILVG 435 (497) Q Consensus 419 ~Pvv~~~~~~~~~~~~g 435 (497) +||..++++-.+...+. T Consensus 312 ipir~~dai~~tE~~vv 328 (328) T protein:vir:95 312 VPIRETDALLETEARVV 328 (328) T ss_pred eEEEEEeeeecCccccC Confidence 99998888765442221 No 138 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.80 E-value=5.2e-10 Score=71.50 Aligned_cols=291 Identities=12% Similarity=0.034 Sum_probs=151.0 Q ss_pred HHHHhhhhhhhhhhhhccccccccc----cccchhhhHHHHHHHHhhhhHHhhcceeecC-CCceEEEEEcCCCccceec Q lcl|NC_021309. 136 GAFADGETAPAAIGQNPFGSTGTFA----PGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAV 210 (497) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~g----~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~wv 210 (497) -.+..... .......+..++.| .+.+..|..++++.....+.++++.++.++. ++++.+|+.. ..++... T Consensus 1 ~~~~~~~~---~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig--~~~~~~~ 75 (332) T protein:vir:78 1 MTTLSNFS---LPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG--KLSAGYH 75 (332) T ss_pred Cccccccc---CCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEecc--ceeEeee Confidence 00000000 00011111112222 2556788899999999999999999888776 5688999864 3455555 Q ss_pred ccccccc-cccccceeeEeeeeeEE-eeehhhHHHHhh---HHHHHHHHHHHHHHHHHHHHHhhhhc----ccCcccccc Q lcl|NC_021309. 211 AEAGTYP-FSSEEFARVYEQVGKVA-NALTITDEGLRD---APELFNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNG 281 (497) Q Consensus 211 ~Eg~~~~-~s~~~f~~i~~~~~kla-~~~~iS~ell~d---~~~l~~~i~~~la~~~~~~~d~~~l~----G~G~~~p~G 281 (497) ..|.... ..+++-+++++..-+.- .-..|.+ +++ ..++.+.+.++..+++++..|+.++. +.....|.+ T Consensus 76 ~~g~~l~~~~~~~~~~~~l~ID~~ky~~~~Vdd--iD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~ 153 (332) T protein:vir:78 76 TPGTPIVGDAGIKANEKTLVMDDLLVSSQFVYS--LDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) T ss_pred cCCCCCCCCCCCCCceEEEEEehhhhhHHHHHh--HHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccc Confidence 5555543 33456566665554421 1122221 222 24788999999999999999988762 111111111 Q ss_pred ccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHH Q lcl|NC_021309. 282 LLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDA 361 (497) Q Consensus 282 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (497) ..........+. ....+...+.+.++++ T Consensus 154 ~~~g~~~~~~~~----------------------------------------------------~~~~~~~~~~~~i~~a 181 (332) T protein:vir:78 154 GEPGGFHVNIGA----------------------------------------------------GNTNDAQAIVDGFFEA 181 (332) T ss_pred ccccccccccCC----------------------------------------------------ccccCHHHHHHHHHHH Confidence 000000000000 0001122234445555 Q ss_pred HHhhhhhhccC-CceEEechhHHHHHHHHhhhcCceeccCccccccccccc--ccccccccceEecCCCCcCc------- Q lcl|NC_021309. 362 FVDIQLTLFQT-PNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVN--GGKNIWGVPVVTTPLIPLGT------- 431 (497) Q Consensus 362 ~~~~~~~~~~~-~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~--~~~~l~G~Pvv~~~~~~~~~------- 431 (497) ...+....--. ...++++|..|..|.+.+|. +.+-.. ..+..+.... ....++|++|+.++.+|... T Consensus 182 ~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~--~~~n~~-~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~ 258 (332) T protein:vir:78 182 AAVLDERSAPQEGRVAVLSPRQYYSLISSVDT--NILNRE-IGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSA 258 (332) T ss_pred HHHHhhcCCCccCCEEEeCHHHHHHHHhhcCc--eeeeee-ccccccceecceeeeEEeeeEEEecCccccCcccccccc Confidence 55554443322 23467799888888765442 111110 1111111111 13578999999999999532 Q ss_pred -------eEEEeeccceEEEEeec--------ccEEEeec--ccchhhhcCceEEEEEEeecceeecccceEEEEee Q lcl|NC_021309. 432 -------ILVGHFAPSVIQTARRE--------GVTMQMTN--SNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 432 -------~~~gd~~~~~~~i~~r~--------~~~i~~~~--~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~ 491 (497) .+-|||+...-.++-+. ++++++.. .....| .-.+++...+|.+++||++++.|+-. T Consensus 259 ~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~---~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 259 AVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQ---GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred cccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhh---HhhhhhhhhhcCceecccceEEEeeC Confidence 24455554333333222 23333322 112222 24667778899999999999988766 No 139 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.79 E-value=4.5e-09 Score=66.35 Aligned_cols=303 Identities=13% Similarity=0.045 Sum_probs=165.6 Q ss_pred hHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchh---hhHHHHHHHHhhhhHHhhcceeecCC- Q lcl|NC_021309. 117 SFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPT---FLPGIVEQLFYELSLADLISSRPVTS- 192 (497) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~---~~~~ii~~~~~~~~l~~~~~~~~~~~- 192 (497) ......... +......... ... .......+.|.++..+ +.+.+++...+....+.++++.+..+ T Consensus 1 ~~~~~~~~~-------~~~~~~~~~~--~~~---~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~ 68 (319) T protein:vir:10 1 MTTKKFDEA-------DKSNVEMYLI--QAG---VKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSP 68 (319) T ss_pred CCCcchhHH-------hhHHHHHHHh--hcc---chhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCC Confidence 000000000 0000000000 000 0011112234444433 33467888888877777777653332 Q ss_pred --CceEEEEEcCCCccceecccc-cccccccccceeeEeeeeeEEeeehhhHHHHhhH----HHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 193 --PNLSYLTESAAHNNAAAVAEA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA----PELFNFVQGRLLEGIQRK 265 (497) Q Consensus 193 --~~~~~p~~~~~~~~a~wv~Eg-~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~----~~l~~~i~~~la~~~~~~ 265 (497) .++.|..... .+.+.|++.+ .+.|..+..++......+.++.-+.++.+=|+.+ .+|..--....++++..+ T Consensus 69 ~~~~~~~~~~~~-~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~ 147 (319) T protein:vir:10 69 TDKTFEYMTFDK-VGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQL 147 (319) T ss_pred ceEEEEeeeecc-ccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHh Confidence 3566666655 4678898764 4578888889999999999988888876555433 357777788889999999 Q ss_pred HHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccc Q lcl|NC_021309. 266 EEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 266 ~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) +|+-+++|+...+..|++|.++....+.+.+... T Consensus 148 ~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~---------------------------------------------- 181 (319) T protein:vir:10 148 VNRLVFKGSAPHKIVSVFNHPNITKITSGKWIDV---------------------------------------------- 181 (319) T ss_pred hceEEEeecccccceeEEeCCCceeeecCCCCCc---------------------------------------------- Confidence 9999999988778899999987654332221110 Q ss_pred cccchhhhhhhHHHHHHHhhhh--hhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEe Q lcl|NC_021309. 346 GSYPTAAEIAENVFDAFVDIQL--TLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVT 423 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~ 423 (497) ...+...+++++..++..+.. .+...+..++++|..|..|.......|.-++.-... ....-+|.+.|... T Consensus 182 -~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~l~~lk~------~~~~l~I~~~pel~ 254 (319) T protein:vir:10 182 -STMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSYLDYFKS------QNSGIEIDSIAELE 254 (319) T ss_pred -cccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeHHHHHHH------hcCCceEEEeeeec Confidence 011233445566666655543 255567789999999998875555555433332111 11112344444433 Q ss_pred cCCCCc-CceEEEeeccceEEEEeecccEEEeecccchhhhcC-ceEEEEEEeecce-eecccceEEEEee Q lcl|NC_021309. 424 TPLIPL-GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDG-KVTVRAEERLGLL-VYRPSAFQLIQLK 491 (497) Q Consensus 424 ~~~~~~-~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~-~v~~r~~~r~~~~-v~~~~a~~~l~~~ 491 (497) ...... +..++...++-++.+..-+.++. .... .++ ...+....|.++. +.+|.||++++.. T Consensus 255 ~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~--~~~e----~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 255 DIDGAGTKGVLVYEKNPMNMSIEIPEAFNM--LPAQ----PKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred ccCCCcceEEEEEecCCceEEEecCcceee--eeee----ecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 222111 11233333333333322122222 1110 012 2455667777644 5579999999988 No 140 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.78 E-value=2.4e-09 Score=67.87 Aligned_cols=303 Identities=11% Similarity=0.068 Sum_probs=150.0 Q ss_pred hhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCC-CceEEEEEcCCCccceeccccccccccc Q lcl|NC_021309. 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~wv~Eg~~~~~s~ 220 (497) +...........+.++..=.+...+|..++.+.....+.++++..+.++.+ +++++|+.- ..++....-|+..--.. T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG--~~~~~~~~~G~~ld~~~ 78 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIG--ETELQVLSPGKSPDASP 78 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeee--eeEEeeeccCcccCCCC Confidence 111111111111222222234456788889999999999999999988865 588999863 34555555555544455 Q ss_pred ccceeeEeeeeeEEee-ehhhH-HHHhhHHH-HHHHHHHHHHHHHHHHHHhhhhcc---cCccccccccccccccccccc Q lcl|NC_021309. 221 EEFARVYEQVGKVANA-LTITD-EGLRDAPE-LFNFVQGRLLEGIQRKEEVQLLAG---GGYPGVNGLLQRSTGFTASSA 294 (497) Q Consensus 221 ~~f~~i~~~~~kla~~-~~iS~-ell~d~~~-l~~~i~~~la~~~~~~~d~~~l~G---~G~~~p~Gi~~~~~~~~~~~~ 294 (497) +..++.++..-++--. ..|.+ +=.++..+ +.+.+..++.+++++..|+.++.- .+.....+....+... ..+ T Consensus 79 ~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~--~~g 156 (364) T protein:vir:10 79 TEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVA--GHG 156 (364) T ss_pred cccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCccc--CCc Confidence 6667766665543311 12211 11233345 678888889999999999987520 0000000000000000 000 Q ss_pred cchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc-cCC Q lcl|NC_021309. 295 SSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTP 373 (497) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 373 (497) .. ....+...........+.+.++++...+....- ... T Consensus 157 ~~-----------------------------------------i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~ 195 (364) T protein:vir:10 157 FS-----------------------------------------IHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSE 195 (364) T ss_pred ce-----------------------------------------eeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccc Confidence 00 000000000011112223334444433332222 123 Q ss_pred ceEEechhHHHHHHHHhhhcCceeccCcc-cccccccccccccccccceEecCCCCcCc--------------------- Q lcl|NC_021309. 374 NAVVMNPRDWELLRLTKDANGQYMGGNFF-GNAYGNPVNGGKNIWGVPVVTTPLIPLGT--------------------- 431 (497) Q Consensus 374 ~~~~~n~~~~~~l~~lkd~~G~~i~~~~~-~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~--------------------- 431 (497) -+.+++|..|..|.+- .+.+-.... .+..+........++|+||+.++.+|... T Consensus 196 R~~vv~P~~y~~Ll~~----~~lvn~d~~~~~~~~~~~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~ 271 (364) T protein:vir:10 196 LCGLMPWTAFNCLRDA----DRIVDKSYTIAASDNTVDGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGN 271 (364) T ss_pred cEEEeChHHHHHHhcC----CccccccccccCCCccccceeEEEeceEEEeccccccccccccccccccccccccccCCc Confidence 4678999999888762 122211100 01111222234568999999999998410 Q ss_pred e--EEEeeccceEEEEee--------cccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 432 I--LVGHFAPSVIQTARR--------EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 432 ~--~~gd~~~~~~~i~~r--------~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) . ..+||+.....++-+ .+++.++..... .| ...+.+..=+|-.++||++++.++..+++.-- T Consensus 272 ~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~-~~---~~~ida~~a~G~g~lRPeaa~~i~~~~~~~~~ 343 (364) T protein:vir:10 272 RYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKK-EK---TWYIDTFLAEGAIPDRWEAVAVVTAADTAELA 343 (364) T ss_pred ccccccccceeEEEEEecceEEEEEEecceeeeeeccc-ee---eeeeeeehcccCcccCccceEEEEecCCCCCc Confidence 0 125554433333333 455555554322 11 23344566699999999999988655544332 No 141 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.76 E-value=1.2e-09 Score=69.50 Aligned_cols=296 Identities=13% Similarity=0.076 Sum_probs=150.9 Q ss_pred Hhhhhhh---hhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecC-CCceEEEEEcCCCccceeccccc Q lcl|NC_021309. 139 ADGETAP---AAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAG 214 (497) Q Consensus 139 ~~~~~~~---~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~wv~Eg~ 214 (497) +...... .........+.+..=.+.+..|..++.+.....+.++++.++.++. ++++++|+. + ..++.....|+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G-~~~~~~~~~G~ 78 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-G-RTQAAYLAPGE 78 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-c-eeEEEeeecCC Confidence 0000000 0000001111112222345778888999999999999999999887 567889986 2 45677777787 Q ss_pred ccccc--cccceeeEeeeeeEEe-eehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc----ccCcc-----cccc Q lcl|NC_021309. 215 TYPFS--SEEFARVYEQVGKVAN-ALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGGYP-----GVNG 281 (497) Q Consensus 215 ~~~~s--~~~f~~i~~~~~kla~-~~~iS~-ell~d~~~l~~~i~~~la~~~~~~~d~~~l~----G~G~~-----~p~G 281 (497) ..+.+ ++.-+++++..-++-- ...|.+ +=.+...++.+.+.++..+++++..|+.++. +.... .|.| T Consensus 79 ~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g 158 (344) T protein:vir:10 79 NLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENITG 158 (344) T ss_pred CCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Confidence 77654 4667776665544221 122221 1122234788999999999999999988752 11111 1111 Q ss_pred ccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHH Q lcl|NC_021309. 282 LLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDA 361 (497) Q Consensus 282 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (497) .-.............. .........+++.++++ T Consensus 159 ~~~~~~~~~~~~~~~~-----------------------------------------------t~~~~~~~~~~~~i~~a 191 (344) T protein:vir:10 159 LGTATVIETTQDKTTL-----------------------------------------------TDQVALGKEIIAALTKA 191 (344) T ss_pred ccccceeecccccccc-----------------------------------------------cchhhhHHHHHHHHHHH Confidence 1111000000000000 00000001122333333 Q ss_pred HHhhhhhhcc-CCceEEechhHHHHHHHHhhhc-CceeccCcccccccccccccccccccceEecCCCCcCc-------- Q lcl|NC_021309. 362 FVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDAN-GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-------- 431 (497) Q Consensus 362 ~~~~~~~~~~-~~~~~~~n~~~~~~l~~lkd~~-G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-------- 431 (497) ...+....-- ...+.+++|..|..|..-+.-+ +.|. +......+...+++|+||+.++.+|.+. T Consensus 192 ~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~------~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~ 265 (344) T protein:vir:10 192 RAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYA------ALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGT 265 (344) T ss_pred HHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccc------cccceeeeEEEEEeceEEEeccccccccCCcccccc Confidence 3333322221 2335688999988775432211 1121 1111111223468999999999998431 Q ss_pred -------------eEEEeeccceEE--------EEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEe Q lcl|NC_021309. 432 -------------ILVGHFAPSVIQ--------TARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQL 490 (497) Q Consensus 432 -------------~~~gd~~~~~~~--------i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~ 490 (497) .+.++|+...-. .+...+++++..... ..|. ..+++..=+|-+++||++...+++ T Consensus 266 tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~-~~~~---d~i~g~~~~G~~vlRPe~a~~v~~ 341 (344) T protein:vir:10 266 TGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRA-NFQA---DQIIAKYAMGHGGLRPEAAGAVVF 341 (344) T ss_pred cCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccch-hHHH---HHHHHHhhcccceecccceEEEEe Confidence 112244432111 222233344544432 2333 356677889999999999988888 Q ss_pred eCC Q lcl|NC_021309. 491 KKG 493 (497) Q Consensus 491 ~~~ 493 (497) ++- T Consensus 342 ~~~ 344 (344) T protein:vir:10 342 KTK 344 (344) T ss_pred ecC Confidence 877 No 142 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.73 E-value=8.4e-09 Score=64.85 Aligned_cols=283 Identities=15% Similarity=0.078 Sum_probs=165.6 Q ss_pred ccccccccccchh---hhHHHHHHHHhhhhHHhhcceeecC---CCceEEEEEcCCCccceeccccc-ccccccccceee Q lcl|NC_021309. 154 GSTGTFAPGILPT---FLPGIVEQLFYELSLADLISSRPVT---SPNLSYLTESAAHNNAAAVAEAG-TYPFSSEEFARV 226 (497) Q Consensus 154 ~~~~~~g~~v~p~---~~~~ii~~~~~~~~l~~~~~~~~~~---~~~~~~p~~~~~~~~a~wv~Eg~-~~~~s~~~f~~i 226 (497) -.+.+.|.+...+ +.+.+++.+.+....++++++.+.. ...+.|...+. .+.+.|.+.++ +.|..+..++.. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~-~G~~~~~~~~~~dip~~~~~~~~~ 79 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTR-SGAAKIIANGADDLPLVDVDMVRK 79 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeecc-ceeEEEecCcccccccccccceeE Confidence 2223334444443 3457888888888888877664332 23556666544 45778887754 478888889999 Q ss_pred EeeeeeEEeeehhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhh Q lcl|NC_021309. 227 YEQVGKVANALTITDEGLRDA----PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 227 ~~~~~kla~~~~iS~ell~d~----~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 302 (497) ....+.++.-+.++.+=|+.+ .+|..--....++++..++|+-+++|+..-+..|++|.++.......... T Consensus 80 ~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~----- 154 (301) T protein:vir:80 80 SVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTG----- 154 (301) T ss_pred EEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcc----- Confidence 999999998888876655433 35777788888999999999999999988788999999875443322110 Q ss_pred hHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhh--hccCCceEEech Q lcl|NC_021309. 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT--LFQTPNAVVMNP 380 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~n~ 380 (497) ..+.......+...+++++..++..+... +...+..++++| T Consensus 155 -------------------------------------~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p 197 (301) T protein:vir:80 155 -------------------------------------VGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPP 197 (301) T ss_pred -------------------------------------cccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecH Confidence 00111112234555667777777776433 445667899999 Q ss_pred hHHHHHHHHh--hhcCceeccCcccccccccccccccccccceEecCCCCcCc-eEEEeeccceEEEEeecccEEEeecc Q lcl|NC_021309. 381 RDWELLRLTK--DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-ILVGHFAPSVIQTARREGVTMQMTNS 457 (497) Q Consensus 381 ~~~~~l~~lk--d~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-~~~gd~~~~~~~i~~r~~~~i~~~~~ 457 (497) ..+..|.... +..|.-++.-.... ....+|.+.|..........+ .++-.-++-.+.+.. .+.+..... T Consensus 198 ~~~~~L~~~~~~~~~~~tvl~~l~~~------~~~~~I~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v--~~~~~~~~~ 269 (301) T protein:vir:80 198 KQFELINKKRYSNEDSRSVLKVLQDN------AWFSAIVRVPDLAGMGTAGSDSFAVIHDSNETAELII--PMDITRHPE 269 (301) T ss_pred HHHHhhhhccccCCCCeeHHHHHHHH------cCcceEEEcceeccCCCCcccEEEEEecCCcEEEEEe--cCceeeecc Confidence 9999997543 44454333321100 111234444433322211111 122111121222211 122222111 Q ss_pred cchhhhcCc-eEEEEEEeec-ceeecccceEEEEee Q lcl|NC_021309. 458 NGTDFVDGK-VTVRAEERLG-LLVYRPSAFQLIQLK 491 (497) Q Consensus 458 ~~~~f~~~~-v~~r~~~r~~-~~v~~~~a~~~l~~~ 491 (497) -.+++ ..+-...|++ ..+.+|.||++++.- T Consensus 270 ----e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 270 ----EYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred ----eecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 12332 3344567775 466779999999988 No 143 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.71 E-value=7.6e-09 Score=65.09 Aligned_cols=293 Identities=12% Similarity=0.028 Sum_probs=158.1 Q ss_pred hhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCC-CceEEEEEcCCCccceeccccccccccc Q lcl|NC_021309. 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~wv~Eg~~~~~s~ 220 (497) +.......+...+..++.-.+.+..|..++.+.....+.++++.++.++.+ +++++|+. + ..++.+..-|+....+. T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G-~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-G-NVEAKGRRAGEELERSR 78 (335) T ss_pred CCccccccccccccccchhhhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-e-eeeecccccCcccCCCC Confidence 111111222222333333356678899999999999999999999998864 58899975 3 34667776676666666 Q ss_pred ccceeeEeeeeeEEeeehhhHHHH---hhH---HHHHHHHHHHHHHHHHHHHHhhhh----cccCccccccc---ccccc Q lcl|NC_021309. 221 EEFARVYEQVGKVANALTITDEGL---RDA---PELFNFVQGRLLEGIQRKEEVQLL----AGGGYPGVNGL---LQRST 287 (497) Q Consensus 221 ~~f~~i~~~~~kla~~~~iS~ell---~d~---~~l~~~i~~~la~~~~~~~d~~~l----~G~G~~~p~Gi---~~~~~ 287 (497) +..++..+..-++- +++.++ ++. .++.+.+..++.+++++..|+.++ .+.....|..+ ++... T Consensus 79 ~~~~k~~itID~ll----~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:78 79 VVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred cccCCeEEEeccee----echhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCc Confidence 67777777665543 344443 332 478999999999999999999765 23222111111 11000 Q ss_pred ccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhh Q lcl|NC_021309. 288 GFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQL 367 (497) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (497) ..+...... ....+...+.+.++.+...+.. T Consensus 155 ~~~~~~tg~-------------------------------------------------~~~~~~~~l~~a~~~a~~~l~e 185 (335) T protein:vir:78 155 LEKLDLTGL-------------------------------------------------TAKEAAEKIVRMHRRVVETFIE 185 (335) T ss_pred ceeeeeccc-------------------------------------------------cccccHHHHHHHHHHHHHHHHh Confidence 000000000 0000111122222332222221 Q ss_pred hhc----cCCceEEechhHHHHHHHHhhhcCceeccCcc--cccccccccccccccccceEecCCCCcCc---------- Q lcl|NC_021309. 368 TLF----QTPNAVVMNPRDWELLRLTKDANGQYMGGNFF--GNAYGNPVNGGKNIWGVPVVTTPLIPLGT---------- 431 (497) Q Consensus 368 ~~~----~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~--~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~---------- 431 (497) ..- ...-+.+++|..|..|..-.. .+-.... .+..+........++|+||+.++.+|.+. T Consensus 186 kdvP~~~~~~rv~vv~P~~y~~Ll~~~~----l~n~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~ 261 (335) T protein:vir:78 186 RDLGDAVYSEGLTPMSPRVFSLLLEHDK----LMSVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHF 261 (335) T ss_pred ccCCCCCCCccEEEeChHHHHHHhcccc----cccccccccccccccccceeEEeeceEEEeeccCCCCCCccccccccC Confidence 111 112468999999999886432 1111100 01112233445679999999999999542 Q ss_pred -eEEEeeccceEEEEe--------ecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 432 -ILVGHFAPSVIQTAR--------REGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 432 -~~~gd~~~~~~~i~~--------r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .+-+||+.....++- -.++..++..... .|. ..+.+..=+|-.++||++.+.++++-.-.=+ T Consensus 262 n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-~~~---~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~ 332 (335) T protein:vir:78 262 NVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHD-QFS---WVLDTFQMYNIGARRPDTAGAIELKGIEAFD 332 (335) T ss_pred CcccccccceEEEEEecceEEEEEEEecccceeeccc-hhh---HhhhHHHHcCCcccCcceEEEEEecCCCccc Confidence 233444432222222 2223333332221 232 2344445589999999999999876554444 No 144 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.67 E-value=1.9e-09 Score=68.41 Aligned_cols=290 Identities=11% Similarity=0.055 Sum_probs=145.1 Q ss_pred Hhhhhhhhhhhhhccccccccc---cccchhhhHHHHHHHHhhhhHHhhcceeecC-CCceEEEEEcCCCccceeccccc Q lcl|NC_021309. 139 ADGETAPAAIGQNPFGSTGTFA---PGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAG 214 (497) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~g---~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~wv~Eg~ 214 (497) +... ....-....+..++.| .+.+.+|..+++......+.++++.++.++. ++++.+|+.. ..++.....|+ T Consensus 1 m~~~--~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG--~~tv~~~t~G~ 76 (347) T protein:vir:94 1 MANV--PGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMG--RTSGVYLAPGE 76 (347) T ss_pred CCCC--CccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEeccc--ceeeeeecCCC Confidence 0000 0000000011111121 2345678888888888888889999888875 5678899863 45666666666 Q ss_pred ccccc--cccceeeEeeeeeEEeeehhhHHHHh------hHHHHHHHHHHHHHHHHHHHHHhhhhcc----cC-cc---- Q lcl|NC_021309. 215 TYPFS--SEEFARVYEQVGKVANALTITDEGLR------DAPELFNFVQGRLLEGIQRKEEVQLLAG----GG-YP---- 277 (497) Q Consensus 215 ~~~~s--~~~f~~i~~~~~kla~~~~iS~ell~------d~~~l~~~i~~~la~~~~~~~d~~~l~G----~G-~~---- 277 (497) ..+.+ ..+-+++++...++- +++.+++ ...++.+.+.++..+++++..|+.++.- .. ++ T Consensus 77 ~l~~~~~~~~~~e~~itID~~~----~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~ 152 (347) T protein:vir:94 77 RLSDKRKGIKHTEKVITIDGLL----TADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNE 152 (347) T ss_pred CcCCCCCCCCcceEEEEecchh----hhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 65443 344555444433321 2333332 2247888899999999999999988621 11 11 Q ss_pred ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhH Q lcl|NC_021309. 278 GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAEN 357 (497) Q Consensus 278 ~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 357 (497) .+.|+-..........+.. ..........++. T Consensus 153 ~~~g~~~~s~~~~~~~~~~------------------------------------------------~~~~~~~~~~~~~ 184 (347) T protein:vir:94 153 NIAGLGTASVLEVGKKADL------------------------------------------------DTPAKLGEAIIGQ 184 (347) T ss_pred ccCCCcccceeeccccccc------------------------------------------------cchhhhHHHHHHH Confidence 1111111000000000000 0000001111233 Q ss_pred HHHHHHhhhhhhc-cCCceEEechhHHHHHHHHhhhcC-ceeccCcccccccccccccccccccceEecCCCCcCc---- Q lcl|NC_021309. 358 VFDAFVDIQLTLF-QTPNAVVMNPRDWELLRLTKDANG-QYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT---- 431 (497) Q Consensus 358 ~~~~~~~~~~~~~-~~~~~~~~n~~~~~~l~~lkd~~G-~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~---- 431 (497) ++.+...+....- ....+.+++|..|..|..-++-+. .|.-. .....+...+++|++|+.++.+|.+. T Consensus 185 i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~------~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~ 258 (347) T protein:vir:94 185 LTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAAL------IDPETGNIRNVMGFVVVEVPHLVQGGAGET 258 (347) T ss_pred HHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhcccc------ccccccceEEEeceEEEecCcccccccccc Confidence 3333333332221 123467899998877654332111 11111 11111233579999999999998421 Q ss_pred --------------e--------EEEeeccceEEEE--------eecccEEEeecccchhhhcCceEEEEEEeecceeec Q lcl|NC_021309. 432 --------------I--------LVGHFAPSVIQTA--------RREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYR 481 (497) Q Consensus 432 --------------~--------~~gd~~~~~~~i~--------~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~ 481 (497) . +-+||+...-.++ ...+++++..... ..| ...+++..-+|.+++| T Consensus 259 ~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~-~~~---~d~i~~~~~~G~~~~r 334 (347) T protein:vir:94 259 RGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDV-DAQ---GDLIVGKYAMGHGGLR 334 (347) T ss_pred cccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhch-hhH---HHHhhhhhhhcCcccc Confidence 0 2233333222221 1222333433221 223 2477888999999999 Q ss_pred ccceEEEEeeCCC Q lcl|NC_021309. 482 PSAFQLIQLKKGA 494 (497) Q Consensus 482 ~~a~~~l~~~~~a 494 (497) |++.+.|+++++- T Consensus 335 P~~a~~~~~~~A~ 347 (347) T protein:vir:94 335 PEAAGALVFSPAE 347 (347) T ss_pred cceeEEEEecCCC Confidence 9999999998444 No 145 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=98.67 E-value=4.4e-09 Score=66.38 Aligned_cols=287 Identities=13% Similarity=0.011 Sum_probs=145.5 Q ss_pred hhhhhhccc---cccccccccchhhhHHHHHHHHhhhhHHhhcceee---cCCCceEEEEEcCCCccceecccccccccc Q lcl|NC_021309. 146 AAIGQNPFG---STGTFAPGILPTFLPGIVEQLFYELSLADLISSRP---VTSPNLSYLTESAAHNNAAAVAEAGTYPFS 219 (497) Q Consensus 146 ~~~~~~~~~---~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s 219 (497) ++..+...+ ++..-..++|..|...+++.+.+.+.+.++++-.+ ..++++.+|+.. .+.+.-..++...+.. T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g--~~~~~d~~~~~~i~~~ 78 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS--ELGVEDKATDVPVGVQ 78 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC--cceeeeecCCCccccc Confidence 111111111 11112235555577888888988888888775443 335689999753 4556666778777777 Q ss_pred cccceeeEeeeeeE-EeeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHhhhhccc--Cccccccc-cccccccccccc Q lcl|NC_021309. 220 SEEFARVYEQVGKV-ANALTITDE-GLRDAPELFNFVQGRLLEGIQRKEEVQLLAGG--GYPGVNGL-LQRSTGFTASSA 294 (497) Q Consensus 220 ~~~f~~i~~~~~kl-a~~~~iS~e-ll~d~~~l~~~i~~~la~~~~~~~d~~~l~G~--G~~~p~Gi-~~~~~~~~~~~~ 294 (497) +.+-.++++...+. +.-+.|++. ..+...++.+.+.+...+++++++|+.++.-- +++.+.+. ....... T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~----- 153 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGA----- 153 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCcccc----- Confidence 77777777777444 344666664 44566788888889999999999998876321 01110000 0000000 Q ss_pred cchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc-cCC Q lcl|NC_021309. 295 SSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTP 373 (497) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 373 (497) .........++.++.+...+....- ... T Consensus 154 ---------------------------------------------------~t~~~~~~~~~~i~~a~~~Lde~~VP~~g 182 (341) T protein:vir:94 154 ---------------------------------------------------ITGNGQAFSFAVFLAARRLLLEADVPEEK 182 (341) T ss_pred ---------------------------------------------------ccCchhhhhHHHHHHHHHHHhhcCCCccC Confidence 0000000111222222222222211 123 Q ss_pred ceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEE------------------- Q lcl|NC_021309. 374 NAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILV------------------- 434 (497) Q Consensus 374 ~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~------------------- 434 (497) ..++++|..+..|.+...-.. .... +......+...++.|++|+.++.+|.++... T Consensus 183 R~lvv~P~~~~~Ll~~~~~~~----~~~~-g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~ 257 (341) T protein:vir:94 183 IVLLISPGQESALFTIPQFIS----KDFI-NNAPIAQGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTG 257 (341) T ss_pred CEEEeCHHHHHHHhhchhhhh----hhcc-ccchhheeeeeeEeceEEEEeccccccccccccccccceecccccccccc Confidence 457889999998865321111 1100 0011112223479999999999999654210 Q ss_pred --------EeeccceEEEEeeccc-EEEe-e-----------cccchhhh--cCceEEEEEEeecceeecccceEEEEee Q lcl|NC_021309. 435 --------GHFAPSVIQTARREGV-TMQM-T-----------NSNGTDFV--DGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 435 --------gd~~~~~~~i~~r~~~-~i~~-~-----------~~~~~~f~--~~~v~~r~~~r~~~~v~~~~a~~~l~~~ 491 (497) +|++...-.++-+.-+ ++++ . ......|. +-...+++..=+|.+|+||++.+-|... T Consensus 258 ~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~ 337 (341) T protein:vir:94 258 SRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTT 337 (341) T ss_pred cccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecC Confidence 1111100000000000 0000 0 00001111 1123455666789999999999888776 Q ss_pred CCCC Q lcl|NC_021309. 492 KGAT 495 (497) Q Consensus 492 ~~a~ 495 (497) +... T Consensus 338 ~~~~ 341 (341) T protein:vir:94 338 GDTV 341 (341) T ss_pred cCCC Confidence 6666 No 146 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.65 E-value=4.1e-10 Score=72.03 Aligned_cols=279 Identities=14% Similarity=0.022 Sum_probs=150.8 Q ss_pred hhhhccccc--cccccccch-hhhHHHHHHHHhhhhHHhhcceeecCCCc-eEEEEEcCCCccceecccccccccccccc Q lcl|NC_021309. 148 IGQNPFGST--GTFAPGILP-TFLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSEEF 223 (497) Q Consensus 148 ~~~~~~~~~--~~~g~~v~p-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f 223 (497) ......+.. ......+.| ..+..||+.+.+.+.|++.+++....+.. ....++++ -+.++|..=++..+.++.++ T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~-LP~~~fR~lN~g~~~s~~tt 79 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTG-LPTPTWRKLYGGVLPNKSST 79 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEee-cCCchhhhcCCccccccceE Confidence 000000000 000111222 35567999999999999999987643322 12334444 36789999999999999999 Q ss_pred eeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCcccccccccccccccccc------- Q lcl|NC_021309. 224 ARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASS------- 293 (497) Q Consensus 224 ~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~------- 293 (497) .+++..++-+.++..|.+.+.+.+ .++...-.....++++..+...|++|+.+..|.++...+.-.+... T Consensus 80 ~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~qv 159 (330) T protein:vir:10 80 AQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNV 159 (330) T ss_pred EEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCchhhe Confidence 999999999999999999998753 3456667777899999999999999998777776664433221100 Q ss_pred ccc-hhhhhhhHHHHHHhhhhhcchhhhhhh---------------------------------hhhhh--hhhhhhhhh Q lcl|NC_021309. 294 ASS-LFGATSATVSNVKFPADGTNGAFVGQD---------------------------------TVASL--KYGRVVTGA 337 (497) Q Consensus 294 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------------------------~~~~~--~~~~~~~~~ 337 (497) ..+ ........... ..........+++.. +...+ +..+...-. T Consensus 160 IdaGGtG~~~TSi~~-v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI 238 (330) T protein:vir:10 160 IDAGGTGSDNASAWL-VVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) T ss_pred eeccccccCceEEEE-EEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEE Confidence 000 00000000000 000000000000000 00000 000000000 Q ss_pred hhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHH-hhhcCceeccCcccccccccccccccc Q lcl|NC_021309. 338 AGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLT-KDANGQYMGGNFFGNAYGNPVNGGKNI 416 (497) Q Consensus 338 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~l-kd~~G~~i~~~~~~~~~~~~~~~~~~l 416 (497) ...............++.+.++.+...++ +.+....+|++|..-...|++. .++..-.+-...+. |.+ .-.+ T Consensus 239 ~NIdvs~l~~~~~~~~li~lm~~A~~~ip-~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~---g~~---~t~~ 311 (330) T protein:vir:10 239 CNIDVSDLATSANAQALIKYMIMAAERIP-QLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVS---GER---VMTF 311 (330) T ss_pred eecccccCCCCccHHHHHHHHHHHHHhcc-CCCCCcceeeechHHHHHHHHHHhhcccceeeeeecC---Cee---eEEE Confidence 01111111112233345666667666554 3444456799999999999986 44443333222211 111 1357 Q ss_pred cccceEecCCCCcCceEEE Q lcl|NC_021309. 417 WGVPVVTTPLIPLGTILVG 435 (497) Q Consensus 417 ~G~Pvv~~~~~~~~~~~~g 435 (497) +|+||..++++-.+...+. T Consensus 312 ~gipir~~Dail~tE~~vv 330 (330) T protein:vir:10 312 DGIPVQRTDALLNTESRVV 330 (330) T ss_pred CCeEEEEEeeeecCccccC Confidence 8999999888765542221 No 147 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.64 E-value=1.3e-08 Score=63.74 Aligned_cols=294 Identities=12% Similarity=0.023 Sum_probs=158.2 Q ss_pred hhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCC-CceEEEEEcCCCccceeccccccccccc Q lcl|NC_021309. 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~wv~Eg~~~~~s~ 220 (497) +.......+...+..++.-.+.+.+|..++.+.....+.++++.++.++.+ +++++|+. + ..++....-|+....+. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G-~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-G-NVEAKGRRAGEELERSR 78 (335) T ss_pred CCCcccchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-e-eeeeecccCCcCcCCCC Confidence 111111122222233333346678899999999999999999999998865 57889986 2 45677777777766666 Q ss_pred ccceeeEeeeeeEEeeehhhHHHH---hhH---HHHHHHHHHHHHHHHHHHHHhhhh----cccCccccccc---ccccc Q lcl|NC_021309. 221 EEFARVYEQVGKVANALTITDEGL---RDA---PELFNFVQGRLLEGIQRKEEVQLL----AGGGYPGVNGL---LQRST 287 (497) Q Consensus 221 ~~f~~i~~~~~kla~~~~iS~ell---~d~---~~l~~~i~~~la~~~~~~~d~~~l----~G~G~~~p~Gi---~~~~~ 287 (497) +..++.++..-++- +++.++ ++. .++.+.+..++.+++++..|+.++ .+.....|.++ ++... T Consensus 79 ~~~~k~~itVD~ll----~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:63 79 VVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred ccccceEEEeccee----echhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCc Confidence 67777777666543 333333 332 479999999999999999999765 33332111111 11000 Q ss_pred ccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhh Q lcl|NC_021309. 288 GFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQL 367 (497) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (497) ......+.. ....+...+...++.+...+.. T Consensus 155 ~~~~~~tg~-------------------------------------------------~~~~~~~~l~~a~~~a~~~L~e 185 (335) T protein:vir:63 155 LEKLDLTGL-------------------------------------------------TAKQAADKIVRMHRRVVETFID 185 (335) T ss_pred ceeeeeccC-------------------------------------------------cccccHHHHHHHHHHHHHHHHh Confidence 000000000 0000111112233333333332 Q ss_pred hhc----cCCceEEechhHHHHHHHHhhhcCc-eeccCcccccccccccccccccccceEecCCCCcCc----------- Q lcl|NC_021309. 368 TLF----QTPNAVVMNPRDWELLRLTKDANGQ-YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT----------- 431 (497) Q Consensus 368 ~~~----~~~~~~~~n~~~~~~l~~lkd~~G~-~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~----------- 431 (497) ..- ...-+.+++|..|..|..-+.--.+ |.. ..+...........++|+||+.++.+|.+. T Consensus 186 ~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~---s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n 262 (335) T protein:vir:63 186 RDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQA---TGATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFN 262 (335) T ss_pred ccCCCcccCceEEEeChHHHHHHhcccccccccccc---ccccccccCceeEEeeceEEEeeccCCCCCcccccccccCC Confidence 221 1224679999999988764321111 110 001112233445679999999999999432 Q ss_pred eEEEeeccceEEEEe--------ecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 432 ILVGHFAPSVIQTAR--------REGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 432 ~~~gd~~~~~~~i~~--------r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .+-|||......++- -.+++.++..+.. .|. ..+.+..=+|-.++||++++.++++-.-.=+ T Consensus 263 ~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~-~~~---~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~ 332 (335) T protein:vir:63 263 VSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNE-KFS---WVLDTFQMYNIGARRPDTAGAIELKGIGAFD 332 (335) T ss_pred ccccccceeEEEEEecceEEEEEEeecccceeeccc-hhh---HHhHHHHHcCCcccccceEEEEEEcCCCcee Confidence 344566443322222 2223333322221 222 2334445589999999999999874433222 No 148 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.56 E-value=9.1e-09 Score=64.66 Aligned_cols=300 Identities=11% Similarity=0.089 Sum_probs=150.3 Q ss_pred hhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCC-CceEEEEEcCCCccceeccccccccccc Q lcl|NC_021309. 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~wv~Eg~~~~~s~ 220 (497) +...........+.++..-.+.+..|..++.......+.++++..++++.+ +++++|+. + ..++.+..-|+..--+. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G-~s~a~y~~pG~~ldg~~ 78 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-G-ETELQVLAPGQSPAATS 78 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-e-eeEEeeecCCCCcCCCC Confidence 111111112222233333345667788889999999999999999999875 57889986 2 45677777777755455 Q ss_pred ccceeeEeeeeeEEee-ehhhH-HHHhhHHH-HHHHHHHHHHHHHHHHHHhhhhc----cc--Ccc----cccccccccc Q lcl|NC_021309. 221 EEFARVYEQVGKVANA-LTITD-EGLRDAPE-LFNFVQGRLLEGIQRKEEVQLLA----GG--GYP----GVNGLLQRST 287 (497) Q Consensus 221 ~~f~~i~~~~~kla~~-~~iS~-ell~d~~~-l~~~i~~~la~~~~~~~d~~~l~----G~--G~~----~p~Gi~~~~~ 287 (497) +..++..+..-.+-.. ..|.+ +=.++..+ +.+.+..++.+++++..|+.++. +. -+. .|.|+-.... T Consensus 79 ~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g~s 158 (400) T protein:vir:10 79 TQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHGFS 158 (400) T ss_pred cccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccccc Confidence 6677766665554321 22211 11122345 67888888999999999987652 10 011 1222221111 Q ss_pred ccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhh Q lcl|NC_021309. 288 GFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQL 367 (497) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (497) ......... ...+...+...++.+...+.. T Consensus 159 ~~v~~~~~~--------------------------------------------------~~~~~~~l~~A~~~A~~~LdE 188 (400) T protein:vir:10 159 VNVEVNEGE--------------------------------------------------ALVNPQYVMAAVEFALEQQLE 188 (400) T ss_pred eeecccccc--------------------------------------------------cccCHHHHHHHHHHHHHHHHh Confidence 100000000 000001111222333333222 Q ss_pred hhccCCceEEechhHHHHHHHHhhhcCceeccCccccc-cc-ccccccccccccceEecCCCCcCc-------------- Q lcl|NC_021309. 368 TLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNA-YG-NPVNGGKNIWGVPVVTTPLIPLGT-------------- 431 (497) Q Consensus 368 ~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~-~~-~~~~~~~~l~G~Pvv~~~~~~~~~-------------- 431 (497) ..-......+++|..|..+.+ +.+ +|+.-.+... .+ ........++|+||+.++.+|... T Consensus 189 kdVP~~d~vvl~pp~~Ys~Ll--~~d--kLvnrdf~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G 264 (400) T protein:vir:10 189 QEVDISDVAILMPWRYFNVLR--DAD--RIVDKSYTISQSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNG 264 (400) T ss_pred cCCCccceEEEcCHHHHHHHH--hCC--cccchhccccCCCccccceEEEEeceEEEeeCcCCcccCcccccccccCCCC Confidence 221122344455444433332 222 2332222211 11 112223468999999999998421 Q ss_pred -e--EEEeeccceEEEEeecccE-EEeecccchhhh---cCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 432 -I--LVGHFAPSVIQTARREGVT-MQMTNSNGTDFV---DGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 432 -~--~~gd~~~~~~~i~~r~~~~-i~~~~~~~~~f~---~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) . +-|||+.....++-+.-+- ++.-+-....|. +-...+-+.+=+|-.++||+++..++.+-.++.- T Consensus 265 ~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~~~~ 337 (400) T protein:vir:10 265 YRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMSEGAIPDRWEAVSVVTTKRQSTGA 337 (400) T ss_pred ccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHHhCCcccchhheEEEEecCCcccc Confidence 1 3377766544444433221 221111111111 1123334456678899999999999998777765 No 149 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.54 E-value=1.5e-09 Score=68.98 Aligned_cols=278 Identities=12% Similarity=0.001 Sum_probs=147.3 Q ss_pred hccccccc-----ccccc-chhhhHHHHHHHHhhhhHHhhcceeecCCCc-eEEEEEcCCCccceecccccccccccccc Q lcl|NC_021309. 151 NPFGSTGT-----FAPGI-LPTFLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSEEF 223 (497) Q Consensus 151 ~~~~~~~~-----~g~~v-~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f 223 (497) +.....+. ....+ +......||+.+.+.+.|++.+++....+.. ....++++ -+.++|..=++..+.++.++ T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~-LP~~~fR~lN~g~~~s~~tt 79 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAG-IPEPVWRRYNQGVQPTKTQT 79 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEe-cCCchhhhcCCccccccceE Confidence 11111110 01111 1224556999999999999999987644322 12334444 36789999999999999999 Q ss_pred eeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCcccccccccccccccccc------- Q lcl|NC_021309. 224 ARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASS------- 293 (497) Q Consensus 224 ~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~------- 293 (497) .+++..++-+.++..|.+.+.+.+ .++...-.....+++...+...|++|+.+..|.++...+.-.+... T Consensus 80 ~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~a 159 (335) T protein:vir:73 80 VPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAASA 159 (335) T ss_pred EEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccccCcc Confidence 999999999999999999887643 3466666777899999999999999998777777664332221000 Q ss_pred ---ccc-hhhhhhhHHHHHHhhhhhcchhhhhhh-------------------------------hhhhh--hhhhhhhh Q lcl|NC_021309. 294 ---ASS-LFGATSATVSNVKFPADGTNGAFVGQD-------------------------------TVASL--KYGRVVTG 336 (497) Q Consensus 294 ---~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------------------------~~~~~--~~~~~~~~ 336 (497) ..+ ........... ..........+++.. +...+ +..+..-- T Consensus 160 ~~iIdaGGtG~~~TSi~~-v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvR 238 (335) T protein:vir:73 160 ENVFSAGGSGSTNTSIWF-MSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISR 238 (335) T ss_pred cceeeccccccCceEEEE-EEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEE Confidence 000 00000000000 000000000011100 00000 00000000 Q ss_pred hhhccccccc-ccchhhhhhhHHHHHHHh-hhhhhccCCceEEechhHHHHHHHHhhhcCceecc-Cccccccccccccc Q lcl|NC_021309. 337 AAGSGSGVAG-SYPTAAEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGG-NFFGNAYGNPVNGG 413 (497) Q Consensus 337 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~-~~~~~~~~~~~~~~ 413 (497) .......... ......++.+.+..++.. ..+..+....+|++|..-...|++..-..++.-+. ..+. |.+ . T Consensus 239 I~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~---g~~---~ 312 (335) T protein:vir:73 239 ICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYG---GKK---I 312 (335) T ss_pred EeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccC---Cce---e Confidence 0000000000 011223445555566542 33445566678999999999999865444443332 2211 111 1 Q ss_pred ccccccceEecCCCCcCceEEEe Q lcl|NC_021309. 414 KNIWGVPVVTTPLIPLGTILVGH 436 (497) Q Consensus 414 ~~l~G~Pvv~~~~~~~~~~~~gd 436 (497) -.++|+||..++++-.+...+-. T Consensus 313 t~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 313 VSFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred EEECCeEEEEEeeeecCcccccC Confidence 24679999888887655422211 No 150 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.52 E-value=5.2e-09 Score=66.00 Aligned_cols=276 Identities=13% Similarity=0.032 Sum_probs=147.6 Q ss_pred hcccccc-----cccccc-chh-hhHHHHHHHHhhhhHHhhcceeecCCCc-eEEEEEcCCCccceeccccccccccccc Q lcl|NC_021309. 151 NPFGSTG-----TFAPGI-LPT-FLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 151 ~~~~~~~-----~~g~~v-~p~-~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~wv~Eg~~~~~s~~~ 222 (497) +.....+ .....+ +.. ....||+.+.+.+.|++.++++....+. ..+.++++ -+.++|..=++..+.++.+ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~-LP~~~fR~lN~g~~~s~~t 79 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKLNYGVQPEKSR 79 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEec-cCCchhhccCCccCcccce Confidence 1111000 000111 222 3457999999999999999998765443 34566665 4689999999999999999 Q ss_pred ceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccc------c Q lcl|NC_021309. 223 FARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTAS------S 293 (497) Q Consensus 223 f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~------~ 293 (497) +.+++..++-+.+.+.|.+.+.+.. .++...-...+.++++..+...|++|+.+..|.++.....-.... . T Consensus 80 t~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q 159 (331) T protein:vir:10 80 TVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) T ss_pred eEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccc Confidence 9999999999999999999998753 345565667788899999999999999876666664332211100 0 Q ss_pred c-c-chhhhhhhHHHHHHhhhhhcchhhhhhhhhhhh---hhhh---------------------hhhhhhhcc------ Q lcl|NC_021309. 294 A-S-SLFGATSATVSNVKFPADGTNGAFVGQDTVASL---KYGR---------------------VVTGAAGSG------ 341 (497) Q Consensus 294 ~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~---------------------~~~~~~~~~------ 341 (497) . . ...+......... .........+++......+ ..+. .......+- T Consensus 160 ~IdaGgtG~~~TSI~~v-~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 160 IIDAGGTGSDNASIWLT-VWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred eeecCCCCCCceEEEEE-EEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 0 0 0000000000000 0000000000000000000 0000 000000000 Q ss_pred cccc----cccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHh-hhcCce-eccCccccccccccccccc Q lcl|NC_021309. 342 SGVA----GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK-DANGQY-MGGNFFGNAYGNPVNGGKN 415 (497) Q Consensus 342 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk-d~~G~~-i~~~~~~~~~~~~~~~~~~ 415 (497) .... ....+..+..+.+..+...++ +.+....+|+||..-...|++.. ++..-+ +-...+.+ . ..-. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip-~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g---~---~~t~ 311 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIP-NVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAG---K---KVVA 311 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhc-ccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCC---c---ceeE Confidence 0000 001122334555666665554 33444567999999999999874 332222 22222111 1 1224 Q ss_pred ccccceEecCCCCcCceEEE Q lcl|NC_021309. 416 IWGVPVVTTPLIPLGTILVG 435 (497) Q Consensus 416 l~G~Pvv~~~~~~~~~~~~g 435 (497) ++|+||..++++-.+...+. T Consensus 312 ~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 312 FDGIPCRRTDALLLTEARVV 331 (331) T ss_pred ECCeeEEEeeeeecCccccC Confidence 78999988888765442221 No 151 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.52 E-value=5.2e-09 Score=66.00 Aligned_cols=276 Identities=13% Similarity=0.032 Sum_probs=147.6 Q ss_pred hcccccc-----cccccc-chh-hhHHHHHHHHhhhhHHhhcceeecCCCc-eEEEEEcCCCccceeccccccccccccc Q lcl|NC_021309. 151 NPFGSTG-----TFAPGI-LPT-FLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 151 ~~~~~~~-----~~g~~v-~p~-~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~wv~Eg~~~~~s~~~ 222 (497) +.....+ .....+ +.. ....||+.+.+.+.|++.++++....+. ..+.++++ -+.++|..=++..+.++.+ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~-LP~~~fR~lN~g~~~s~~t 79 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKLNYGVQPEKSR 79 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEec-cCCchhhccCCccCcccce Confidence 1111000 000111 222 3457999999999999999998765443 34566665 4689999999999999999 Q ss_pred ceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccc------c Q lcl|NC_021309. 223 FARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTAS------S 293 (497) Q Consensus 223 f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~------~ 293 (497) +.+++..++-+.+.+.|.+.+.+.. .++...-...+.++++..+...|++|+.+..|.++.....-.... . T Consensus 80 t~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q 159 (331) T protein:vir:98 80 TVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) T ss_pred eEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccc Confidence 9999999999999999999998753 345565667788899999999999999876666664332211100 0 Q ss_pred c-c-chhhhhhhHHHHHHhhhhhcchhhhhhhhhhhh---hhhh---------------------hhhhhhhcc------ Q lcl|NC_021309. 294 A-S-SLFGATSATVSNVKFPADGTNGAFVGQDTVASL---KYGR---------------------VVTGAAGSG------ 341 (497) Q Consensus 294 ~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~---------------------~~~~~~~~~------ 341 (497) . . ...+......... .........+++......+ ..+. .......+- T Consensus 160 ~IdaGgtG~~~TSI~~v-~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:98 160 IIDAGGTGSDNASIWLT-VWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred eeecCCCCCCceEEEEE-EEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 0 0 0000000000000 0000000000000000000 0000 000000000 Q ss_pred cccc----cccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHh-hhcCce-eccCccccccccccccccc Q lcl|NC_021309. 342 SGVA----GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK-DANGQY-MGGNFFGNAYGNPVNGGKN 415 (497) Q Consensus 342 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk-d~~G~~-i~~~~~~~~~~~~~~~~~~ 415 (497) .... ....+..+..+.+..+...++ +.+....+|+||..-...|++.. ++..-+ +-...+.+ . ..-. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip-~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g---~---~~t~ 311 (331) T protein:vir:98 239 NVDVSELTKNASAGADLIDLMTQAVELIP-NVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAG---K---KVVA 311 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhc-ccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCC---c---ceeE Confidence 0000 001122334555666665554 33444567999999999999874 332222 22222111 1 1224 Q ss_pred ccccceEecCCCCcCceEEE Q lcl|NC_021309. 416 IWGVPVVTTPLIPLGTILVG 435 (497) Q Consensus 416 l~G~Pvv~~~~~~~~~~~~g 435 (497) ++|+||..++++-.+...+. T Consensus 312 ~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:98 312 FDGIPCRRTDALLLTEARVV 331 (331) T ss_pred ECCeeEEEeeeeecCccccC Confidence 78999988888765442221 No 152 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.52 E-value=5.2e-09 Score=66.00 Aligned_cols=276 Identities=13% Similarity=0.032 Sum_probs=147.6 Q ss_pred hcccccc-----cccccc-chh-hhHHHHHHHHhhhhHHhhcceeecCCCc-eEEEEEcCCCccceeccccccccccccc Q lcl|NC_021309. 151 NPFGSTG-----TFAPGI-LPT-FLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 151 ~~~~~~~-----~~g~~v-~p~-~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~wv~Eg~~~~~s~~~ 222 (497) +.....+ .....+ +.. ....||+.+.+.+.|++.++++....+. ..+.++++ -+.++|..=++..+.++.+ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~-LP~~~fR~lN~g~~~s~~t 79 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKLNYGVQPEKSR 79 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEec-cCCchhhccCCccCcccce Confidence 1111000 000111 222 3457999999999999999998765443 34566665 4689999999999999999 Q ss_pred ceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccc------c Q lcl|NC_021309. 223 FARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTAS------S 293 (497) Q Consensus 223 f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~------~ 293 (497) +.+++..++-+.+.+.|.+.+.+.. .++...-...+.++++..+...|++|+.+..|.++.....-.... . T Consensus 80 t~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q 159 (331) T protein:vir:10 80 TVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQN 159 (331) T ss_pred eEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccc Confidence 9999999999999999999998753 345565667788899999999999999876666664332211100 0 Q ss_pred c-c-chhhhhhhHHHHHHhhhhhcchhhhhhhhhhhh---hhhh---------------------hhhhhhhcc------ Q lcl|NC_021309. 294 A-S-SLFGATSATVSNVKFPADGTNGAFVGQDTVASL---KYGR---------------------VVTGAAGSG------ 341 (497) Q Consensus 294 ~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~---------------------~~~~~~~~~------ 341 (497) . . ...+......... .........+++......+ ..+. .......+- T Consensus 160 ~IdaGgtG~~~TSI~~v-~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 160 IIDAGGTGSDNASIWLT-VWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred eeecCCCCCCceEEEEE-EEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 0 0 0000000000000 0000000000000000000 0000 000000000 Q ss_pred cccc----cccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHh-hhcCce-eccCccccccccccccccc Q lcl|NC_021309. 342 SGVA----GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK-DANGQY-MGGNFFGNAYGNPVNGGKN 415 (497) Q Consensus 342 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk-d~~G~~-i~~~~~~~~~~~~~~~~~~ 415 (497) .... ....+..+..+.+..+...++ +.+....+|+||..-...|++.. ++..-+ +-...+.+ . ..-. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip-~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g---~---~~t~ 311 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIP-NVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAG---K---KVVA 311 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhc-ccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCC---c---ceeE Confidence 0000 001122334555666665554 33444567999999999999874 332222 22222111 1 1224 Q ss_pred ccccceEecCCCCcCceEEE Q lcl|NC_021309. 416 IWGVPVVTTPLIPLGTILVG 435 (497) Q Consensus 416 l~G~Pvv~~~~~~~~~~~~g 435 (497) ++|+||..++++-.+...+. T Consensus 312 ~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 312 FDGIPCRRTDALLLTEARVV 331 (331) T ss_pred ECCeeEEEeeeeecCccccC Confidence 78999988888765442221 No 153 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.51 E-value=7e-08 Score=59.80 Aligned_cols=311 Identities=13% Similarity=0.050 Sum_probs=168.7 Q ss_pred hhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccc--ccccccccchh---hhHHHHHHHHhhhhHHhhcce Q lcl|NC_021309. 113 KFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGS--TGTFAPGILPT---FLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~g~~v~p~---~~~~ii~~~~~~~~l~~~~~~ 187 (497) .+.... .++.+....+...... ...+.... .+..|.++..+ +.+.+++...+....+.++++ T Consensus 1 ~~~~~~-----~~~~~~d~~~~~~~a~--------~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i 67 (329) T protein:vir:79 1 MRGNIM-----SKEMKYDEFEANVIAN--------HMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPV 67 (329) T ss_pred Cccchh-----hhhhccchhhhhhHhh--------hcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhccc Confidence 000000 0000000000000000 00111111 12223444433 345688888888887777765 Q ss_pred eecC---CCceEEEEEcCCCccceecccc-cccccccccceeeEeeeeeEEeeehhhHHHHhhH----HHHHHHHHHHHH Q lcl|NC_021309. 188 RPVT---SPNLSYLTESAAHNNAAAVAEA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA----PELFNFVQGRLL 259 (497) Q Consensus 188 ~~~~---~~~~~~p~~~~~~~~a~wv~Eg-~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~----~~l~~~i~~~la 259 (497) .+.. ..++.|..... .+.+.|++.+ .+.|..+..++.-....+.++.-+.++.+=|+.+ .+|..--....+ T Consensus 68 ~~~~~~~~~~~t~~~~~~-~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~ 146 (329) T protein:vir:79 68 TSELSDTDKTFEYQTFDK-VGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQ 146 (329) T ss_pred ccCCCCceeEEEeeeeec-ceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHH Confidence 5432 23566776655 4678888764 5678888888888888888888888876544432 357888888889 Q ss_pred HHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_021309. 260 EGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAG 339 (497) Q Consensus 260 ~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 339 (497) +++..++|+-+++|++..+..|++|.++..+....++ T Consensus 147 ~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~------------------------------------------- 183 (329) T protein:vir:79 147 NAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSAGW------------------------------------------- 183 (329) T ss_pred HHHHHhhccEEEeecccccceeeecCCCccccccCCC------------------------------------------- Confidence 9999999999999998778899999887643222111 Q ss_pred cccccccccchhhhhhhHHHHHHHhhhhh--hccCCceEEechhHHHHHHHHhhhcCceeccCccccccccccccccccc Q lcl|NC_021309. 340 SGSGVAGSYPTAAEIAENVFDAFVDIQLT--LFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIW 417 (497) Q Consensus 340 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~ 417 (497) +.......+...+.+++..++..+... +...+..++++|..+..|.......|.-++.-... ....-+|. T Consensus 184 --~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~~tvl~~lk~------~~~~l~I~ 255 (329) T protein:vir:79 184 --NNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETTMSYLDYFKQ------QNGGITIE 255 (329) T ss_pred --CCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCCccHHHHHHH------hCCCcEEE Confidence 111122334555667777777666544 34556778999988888865444555433321110 11112344 Q ss_pred ccceEecCCCC-cCceEEEeeccceEEEEeecccEEEeecccchhhhcC-ceEEEEEEeecc-eeecccceEEEEeeCCC Q lcl|NC_021309. 418 GVPVVTTPLIP-LGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDG-KVTVRAEERLGL-LVYRPSAFQLIQLKKGA 494 (497) Q Consensus 418 G~Pvv~~~~~~-~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~-~v~~r~~~r~~~-~v~~~~a~~~l~~~~~a 494 (497) +.|-..+.... .+-+++.+.++-.+.+..-+.++.. .... ++ ...+....|.++ .+++|.||++++.-.+- T Consensus 256 ~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l--~~q~----~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 256 SISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNML--TAQP----KDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred EcccccccCCCCceEEEEEecCCceEEEecCcceeee--ecee----cCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 44443322211 1223444444433433222222222 1110 12 245556778765 45579999999877665 No 154 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.50 E-value=5.5e-08 Score=60.38 Aligned_cols=298 Identities=12% Similarity=0.104 Sum_probs=160.4 Q ss_pred hhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchh---hhHHHHHHHHhhhhHHhhcceeecCC Q lcl|NC_021309. 116 VSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPT---FLPGIVEQLFYELSLADLISSRPVTS 192 (497) Q Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~---~~~~ii~~~~~~~~l~~~~~~~~~~~ 192 (497) ...+ .. .+... ... ....+..-....+|.++..+ +.+.+++...+...-+.++++.+..+ T Consensus 1 ~~~~--~~-~~~~~--------~~~------~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~ 63 (314) T protein:vir:10 1 MAIK--FD-AEQAK--------ITT------HLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIP 63 (314) T ss_pred Cccc--hH-HHHHH--------HHH------HHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCC Confidence 0000 00 00000 000 00011111122334556553 33457777777777666666544322 Q ss_pred ---CceEEEEEcCCCccceecccc-cccccccccceeeEeeeeeEEeeehhhHHHHhhH----HHHHHHHHHHHHHHHHH Q lcl|NC_021309. 193 ---PNLSYLTESAAHNNAAAVAEA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA----PELFNFVQGRLLEGIQR 264 (497) Q Consensus 193 ---~~~~~p~~~~~~~~a~wv~Eg-~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~----~~l~~~i~~~la~~~~~ 264 (497) .++.|..... .+.+.|++.. .+.|..+..+++.....+.++..+.++.+=|+.+ .+|..--....++++.. T Consensus 64 ~~~et~~~~~~e~-~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~ 142 (314) T protein:vir:10 64 GHAKYFEYPEFDG-VGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDN 142 (314) T ss_pred CceeEEEeeeecc-ccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHH Confidence 3566766654 4678898775 4488889899999999999999888875544432 35788888888999999 Q ss_pred HHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhccccc Q lcl|NC_021309. 265 KEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGV 344 (497) Q Consensus 265 ~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 344 (497) .+|+-+++|+...+..|++|.++.........| T Consensus 143 ~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~W----------------------------------------------- 175 (314) T protein:vir:10 143 LLDKLVWSGSAPHGIVSVFDQPNINNVVATPNW----------------------------------------------- 175 (314) T ss_pred hhceEEEeecccccceeEeecCCCccccCCCCc----------------------------------------------- Confidence 999999999887788999998875322221111 Q ss_pred ccccchhhhhhhHHHHHHHhhhhh--hccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceE Q lcl|NC_021309. 345 AGSYPTAAEIAENVFDAFVDIQLT--LFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVV 422 (497) Q Consensus 345 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv 422 (497) .+...+++++..++..+... +...++.+++.|..+..|...-+..|.-++.-.. .....-+|.+.|.. T Consensus 176 ----aT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~~l~------~n~~~l~I~~~~el 245 (314) T protein:vir:10 176 ----SVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGELFT------RNNPGLTIRFLQFL 245 (314) T ss_pred ----ccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHHHHH------HhCCCcEEEEcccc Confidence 12234455666666555533 4455667888888777664433333332221100 01112245555554 Q ss_pred ecCCCCcCc-eEEEeeccceEEEEeecccEEEeecccchhhhcC-ceEEEEEEeec-ceeecccceEEEEeeCCC Q lcl|NC_021309. 423 TTPLIPLGT-ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDG-KVTVRAEERLG-LLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 423 ~~~~~~~~~-~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~-~v~~r~~~r~~-~~v~~~~a~~~l~~~~~a 494 (497) .+......+ .++-+-++-.+.+..-+.++ ...... ++ .+.+....|.+ ..+++|.||++++..+=| T Consensus 246 ~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~--~l~~e~----~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 246 DNYDGAGGKAALAFEKSPLNMSIEIPEVTN--VLPAQP----KDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred cccCCCcceEEEEEecCCcEEEEecCccce--eeccee----cCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 433322222 12222222222221112222 111100 11 24445567775 556679999999888777 No 155 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.47 E-value=1.3e-07 Score=58.30 Aligned_cols=272 Identities=10% Similarity=0.041 Sum_probs=143.2 Q ss_pred hccccccccccccchhhhHHHH-HHHHhhhhHHh---------hccee--ecCCCceEEEEEcCCCccceeccccccccc Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIV-EQLFYELSLAD---------LISSR--PVTSPNLSYLTESAAHNNAAAVAEAGTYPF 218 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii-~~~~~~~~l~~---------~~~~~--~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~ 218 (497) |+ ++.-..+|.||.....+ +...+.+.+.+ +.... ..+|+.+++|....-++.+.-+.|+.+++. T Consensus 1 MA---~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~ 77 (324) T protein:vir:59 1 MA---YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVP 77 (324) T ss_pred CC---ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccch Confidence 33 12334578888666644 33333333322 11222 234667889988765567778899999998 Q ss_pred ccccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccch Q lcl|NC_021309. 219 SSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSL 297 (497) Q Consensus 219 s~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~ 297 (497) .+.+.++-....+..+.-..++++...-+ .+....+.++++..+.+..+..+|.- ..|++......+..... T Consensus 78 ~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~-----l~g~~~~~~~~~~~~dv-- 150 (324) T protein:vir:59 78 QKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAE-----LAGVFSNDDMKDNKLDI-- 150 (324) T ss_pred hhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhhhccccccceeee-- Confidence 88888887777777666666776653322 35667789999999999888887632 12222221110000000 Q ss_pred hhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEE Q lcl|NC_021309. 298 FGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVV 377 (497) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 377 (497) ...... ....+.+.++....-.. ...-.+|+ T Consensus 151 ---------------sa~~~~---------------------------------~~s~~~l~~A~~~~GD~-~~~~~~iv 181 (324) T protein:vir:59 151 ---------------SGTADG---------------------------------IYSAETFVDASYKLGDH-ESLLTAIG 181 (324) T ss_pred ---------------eccccc---------------------------------eecHHHHHHHHHHhCCc-ccCcEEEE Confidence 000000 00001122222221111 12345799 Q ss_pred echhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC-------ceEEEeeccceEEEEe-ecc Q lcl|NC_021309. 378 MNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG-------TILVGHFAPSVIQTAR-REG 449 (497) Q Consensus 378 ~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~-------~~~~gd~~~~~~~i~~-r~~ 449 (497) ||+.++..|++..-.+ |+.... + ...-++++|+|||+++.||.. .+..--|...++...+ +.. T Consensus 182 mhS~v~~~L~~~~li~--~~~~s~--~-----~~~i~~~~G~~VivdD~~p~~~~~~~~~~y~s~l~~~GAi~~~~~~~~ 252 (324) T protein:vir:59 182 MHSATMASAVKQDLIE--FVKDSQ--S-----GIRFPTYMNKRVIVDDSMPVETLEDGTKVFTSYLFGAGALGYAEGQPE 252 (324) T ss_pred EchHHHHHHHHhhhhh--hccccc--c-----CceeeeecccEEEEeCCCCccccCCCCceEEEEEEecCeEEEeecCCC Confidence 9999999999764221 222111 0 112357899999999999853 1111123344455544 334 Q ss_pred cEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeC-CCCCC Q lcl|NC_021309. 450 VTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK-GATGS 497 (497) Q Consensus 450 ~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~-~a~~~ 497 (497) +.++.++.. .++...+....++. +||..+.+-.-.. ....| T Consensus 253 v~vE~dRd~----~~g~~~l~~r~~~~---~~p~G~s~~~~~~~~~sPt 294 (324) T protein:vir:59 253 VPTETARNA----LGSQDILINRKHFV---LHPRGVKFTENAMAGTTPT 294 (324) T ss_pred cceecccCc----cccceEEEEeeEEE---eEeeeEEecccccCCCCCC Confidence 555554432 35667777777754 5666654432111 11111 No 156 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.46 E-value=9.7e-09 Score=64.51 Aligned_cols=296 Identities=11% Similarity=0.059 Sum_probs=147.0 Q ss_pred hhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCC-CceEEEEEcCCCccceeccccccccccc Q lcl|NC_021309. 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~wv~Eg~~~~~s~ 220 (497) +...........+.++..-.+.+.+|..++.+.....+.++++.++.++.+ +++++|+.- ..++.+..-|+..--+. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG--~~~a~y~~~G~~ldg~~ 78 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLG--ETELQVLAPGQSPNATP 78 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEe--eeEEeeeccccccCCCC Confidence 111111111111222222234456788888899989999999999988865 578999862 34555655555544455 Q ss_pred ccceeeEeeeeeEEeeehhhHHHH------hhHHH-HHHHHHHHHHHHHHHHHHhhhhc---ccC------cc-cccccc Q lcl|NC_021309. 221 EEFARVYEQVGKVANALTITDEGL------RDAPE-LFNFVQGRLLEGIQRKEEVQLLA---GGG------YP-GVNGLL 283 (497) Q Consensus 221 ~~f~~i~~~~~kla~~~~iS~ell------~d~~~-l~~~i~~~la~~~~~~~d~~~l~---G~G------~~-~p~Gi~ 283 (497) +..++..+..-.+- +++.++ ++..+ +.+.+..++.+++++..|+.++. -.+ .+ .|.+.- T Consensus 79 ~~~~k~~ItID~lL----~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~ 154 (402) T protein:vir:97 79 TQADKNQLVIDTTV----IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) T ss_pred cccccEEEEeCcee----echhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccc Confidence 66677666555433 233332 23345 67888889999999999997752 000 00 011110 Q ss_pred ccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHH Q lcl|NC_021309. 284 QRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFV 363 (497) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 363 (497) ......+ .+.......+...+.+.++++.. T Consensus 155 ~g~s~~~--------------------------------------------------~~t~~~a~~~~~~l~~ai~~a~~ 184 (402) T protein:vir:97 155 HGFSINV--------------------------------------------------NVTESEALANPQYVMAAVEYALE 184 (402) T ss_pred ccccccc--------------------------------------------------ccccchhhcCHHHHHHHHHHHHH Confidence 0000000 00000001112222333444443 Q ss_pred hhhhhhc-cCCceEEechhHHHHHHHHhhhcCceeccCcc-cccccccccccccccccceEecCCCCcCc---------- Q lcl|NC_021309. 364 DIQLTLF-QTPNAVVMNPRDWELLRLTKDANGQYMGGNFF-GNAYGNPVNGGKNIWGVPVVTTPLIPLGT---------- 431 (497) Q Consensus 364 ~~~~~~~-~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~-~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~---------- 431 (497) .+....- ...-+.+++|..|..|.+-.+ .+-.... .+...........++|+||+.++.+|... T Consensus 185 ~LdEkdVP~~dRv~vv~P~~y~~Ll~~~r----l~n~d~~~~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~ 260 (402) T protein:vir:97 185 QQLEQEVDISDVAIMMPWKFFNALRDADR----IVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSN 260 (402) T ss_pred HHHhcCCCccccEEEeChHHHHHHhhccc----ccchhhccccCCccccceeEEEeceEEEecCcccccccccccccccc Confidence 3332111 223468999999988876322 1111000 00111222234569999999999998521 Q ss_pred -----e--EEEeeccceEEEEeeccc-EEEeecccchhhhcC---ceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 432 -----I--LVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDG---KVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 432 -----~--~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~---~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) . +-|||+.....++-+.-+ +++.-+-..+.|... ...+-+.+=+|-.++||++...+.++.-++.- T Consensus 261 a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~ 337 (402) T protein:vir:97 261 EDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) T ss_pred CCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHHhCCcccCccceEEEEEecccccc Confidence 1 226666444444433222 111111111111100 12233445578889999999999888744332 No 157 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.42 E-value=9.7e-09 Score=64.52 Aligned_cols=253 Identities=11% Similarity=0.063 Sum_probs=126.8 Q ss_pred ceeecC-CCceEEEEEcCCCccceeccccccccc--cccccee--eEeeeeeEEeeehhhHHHHhhHHHHHHHHHHHHHH Q lcl|NC_021309. 186 SSRPVT-SPNLSYLTESAAHNNAAAVAEAGTYPF--SSEEFAR--VYEQVGKVANALTITDEGLRDAPELFNFVQGRLLE 260 (497) Q Consensus 186 ~~~~~~-~~~~~~p~~~~~~~~a~wv~Eg~~~~~--s~~~f~~--i~~~~~kla~~~~iS~ell~d~~~l~~~i~~~la~ 260 (497) -++++. ++++++|+.- ..++....-|+.... .++.-++ |++...++..+..-.-+=.+...++.+...++..+ T Consensus 1 ~vr~i~~g~s~~~~~iG--~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~ 78 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMG--RTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGE 78 (324) T ss_pred CeeeeecCceEEEeeee--eeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHHHHH Confidence 233443 5688999862 345666665555432 3344444 34444333332221111222334789999999999 Q ss_pred HHHHHHHhhhhcc----c--Cccc-cccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhh Q lcl|NC_021309. 261 GIQRKEEVQLLAG----G--GYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRV 333 (497) Q Consensus 261 ~~~~~~d~~~l~G----~--G~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (497) ++++..|+.++.- . .+.. ..++....+..... T Consensus 79 aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~----------------------------------------- 117 (324) T protein:vir:99 79 ALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVK----------------------------------------- 117 (324) T ss_pred HHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceec----------------------------------------- Confidence 9999999887521 0 0000 00000000000000 Q ss_pred hhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhcc-CCceEEechhHHHHHHHHhh-hcCceeccCccccccccccc Q lcl|NC_021309. 334 VTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKD-ANGQYMGGNFFGNAYGNPVN 411 (497) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~n~~~~~~l~~lkd-~~G~~i~~~~~~~~~~~~~~ 411 (497) ..+...........+++.++++...+....-- ...+.+++|..+..|..-+. ..+.|.... ....+ T Consensus 118 ------~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~------~~~~G 185 (324) T protein:vir:99 118 ------ITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYTDPDTYSAILAALMPNAANYAALI------DPETG 185 (324) T ss_pred ------ccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHhhccccccccccccc------ceecc Confidence 00000000111223345555555555443332 23456889998876643321 122232211 12223 Q ss_pred ccccccccceEecCCCCcCce-------------------------EEEeeccce--------EEEEeecccEEEeeccc Q lcl|NC_021309. 412 GGKNIWGVPVVTTPLIPLGTI-------------------------LVGHFAPSV--------IQTARREGVTMQMTNSN 458 (497) Q Consensus 412 ~~~~l~G~Pvv~~~~~~~~~~-------------------------~~gd~~~~~--------~~i~~r~~~~i~~~~~~ 458 (497) ...+++|++|+.++.+|...+ |-+|++... +..+....++++..... T Consensus 186 ~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~ 265 (324) T protein:vir:99 186 NIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRP 265 (324) T ss_pred eEEEEeceEEEecCCccccccccccccccccccccccccccccccccccccCceeEEEEehhheEEEeeecceecceech Confidence 345789999999999995321 223333221 22222233344444322 Q ss_pred chhhhcCceEEEEEEeecceeecccceEEEEeeCCCC-CC Q lcl|NC_021309. 459 GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT-GS 497 (497) Q Consensus 459 ~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~-~~ 497 (497) ..| ...+++..-+|-.++||++.+.+++.+.++ |. T Consensus 266 -~~~---~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~ 301 (324) T protein:vir:99 266 -EYQ---ADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAV 301 (324) T ss_pred -hhH---HHhhhhhhhhcCcccccceEEEEEEccCccccc Confidence 222 356677788899999999999999887765 22 No 158 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.39 E-value=8.6e-09 Score=64.79 Aligned_cols=306 Identities=11% Similarity=0.090 Sum_probs=150.6 Q ss_pred hhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCC-CceEEEEEcCCCccceeccccccccccc Q lcl|NC_021309. 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~wv~Eg~~~~~s~ 220 (497) +...........+.++..=.+.+..|..++.......+.++++..++++.+ +++++|+. + ..++....-|+..--+. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G-~s~~~~~~pG~~ld~~~ 78 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-G-ETELQVLAPGQSPAATS 78 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-e-eeEeeeecCCCCcCCCC Confidence 111111112222222333345567788888899999999999999999875 57889986 2 34666666666654456 Q ss_pred ccceeeEeeeeeEEeeehhhHHH---Hhh---HHH-HHHHHHHHHHHHHHHHHHhhhhcccCcccccccccccccccccc Q lcl|NC_021309. 221 EEFARVYEQVGKVANALTITDEG---LRD---APE-LFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASS 293 (497) Q Consensus 221 ~~f~~i~~~~~kla~~~~iS~el---l~d---~~~-l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~ 293 (497) +..++..+..-++- +++-+ |++ ..+ +.+.+..++.+++++..|+.++.-- ...|+-+.......+. T Consensus 79 ~~~dK~~ItID~lL----~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i---~~aa~ana~~~~~~p~ 151 (401) T protein:vir:70 79 TQADKNQLVIDATV----IARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQM---MLGGIANTQAKRTNPR 151 (401) T ss_pred cccccEEEEeCcee----ehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHH---HHhccccccccccCCC Confidence 67777666555433 22222 222 235 6788889999999999998764210 0001100000000000 Q ss_pred ccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCC Q lcl|NC_021309. 294 ASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTP 373 (497) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 373 (497) .... .......+.......+...+.+.++++...+....-... T Consensus 152 ~~~~-------------------------------------G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~ 194 (401) T protein:vir:70 152 VKGH-------------------------------------GFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDIS 194 (401) T ss_pred cCCC-------------------------------------ceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCcc Confidence 0000 000000000000111222344455555555554443334 Q ss_pred ceEEechhHHHHHHHHhhhcCceeccCccc-cccc-ccccccccccccceEecCCCCcCc---------------e--EE Q lcl|NC_021309. 374 NAVVMNPRDWELLRLTKDANGQYMGGNFFG-NAYG-NPVNGGKNIWGVPVVTTPLIPLGT---------------I--LV 434 (497) Q Consensus 374 ~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~-~~~~-~~~~~~~~l~G~Pvv~~~~~~~~~---------------~--~~ 434 (497) ...+++|..|..+.+-.| +|+.-.+. .+.+ ........++|+||+.++.+|.+. . +- T Consensus 195 r~vvl~pp~~Ys~Ll~~d----~L~nrd~~~s~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~ 270 (401) T protein:vir:70 195 DVAILMPWRYFNVLRDAD----RIVDKTYTISQSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPL 270 (401) T ss_pred ceEEEcCHHHHHHHHhcC----cccchhhccccCCccccceEEEEeceEEEeeccccccccccccccccccCCCccCCCC Confidence 566666666554443333 22221111 1111 122223468999999999998521 1 22 Q ss_pred EeeccceEEEEeecccE-EEeecccchhhh---cCceEEEEEEeecceeecccceEEEEeeCC-----CCCC Q lcl|NC_021309. 435 GHFAPSVIQTARREGVT-MQMTNSNGTDFV---DGKVTVRAEERLGLLVYRPSAFQLIQLKKG-----ATGS 497 (497) Q Consensus 435 gd~~~~~~~i~~r~~~~-i~~~~~~~~~f~---~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~-----a~~~ 497 (497) |||+.....++-+.-+- ++.-+-....|. +-...+-+..=+|-.++||++...++.+-+ +.|+ T Consensus 271 ~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~g~g~~RPeaa~vv~~k~~~~~~~~~~~ 342 (401) T protein:vir:70 271 PAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRNTTTGAVEGT 342 (401) T ss_pred ccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHHHhCCcccchhheEEEeecCcccccccccC Confidence 67765444444333221 121111111111 001223355667889999999988865554 2233 No 159 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.38 E-value=1.7e-07 Score=57.67 Aligned_cols=264 Identities=14% Similarity=0.085 Sum_probs=136.0 Q ss_pred hccccccccccccchh---hhHHHHHHHHhhhhHHhhcceeecCC-CceEEEEEcCCCccceeccccccccccccccee- Q lcl|NC_021309. 151 NPFGSTGTFAPGILPT---FLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR- 225 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~---~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~- 225 (497) |+.........+.+|+ ++..+-+.+..-..++...+..|++. ..+++|+... .+.+.-|+||+.+|-++.+... T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~-tgda~dVaEGe~Iplskvt~~~~ 79 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEV-TLDQTDPGEGETIPLSKVTRTKD 79 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeee-ecccccccCCcccchhhheeeee Confidence 2222222222233343 33333222222233444447788875 5789999775 4678889999999999998764 Q ss_pred --eEeeeeeEEeeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhh Q lcl|NC_021309. 226 --VYEQVGKVANALTITDEGLRDA--PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 226 --i~~~~~kla~~~~iS~ell~d~--~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~ 301 (497) .++..+|++.- +|.|.++.+ .+-...-.+.|..+++.++|+.|+.--.++. .+.. ...... T Consensus 80 ~t~t~kikK~rK~--tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat----------~t~t-g~~lq~-- 144 (295) T protein:vir:99 80 KDYTVKWFKKRRA--TTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKP----------TKVK-GVGLQK-- 144 (295) T ss_pred eeeEEEeeeeccc--ccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCc----------eeee-hhhHHH-- Confidence 67777777764 499998644 3566788889999999999999885322110 0000 000000 Q ss_pred hhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechh Q lcl|NC_021309. 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPR 381 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 381 (497) .....+..+ +.++ .....+.+.++||. T Consensus 145 ------------------a~a~~~~al---------------------------~~f~--------Ee~~~~~V~FVnP~ 171 (295) T protein:vir:99 145 ------------------ALSASWAKL---------------------------ATFN--------EFEGSPLVSFVSPL 171 (295) T ss_pred ------------------HHHHhhhhh---------------------------hhcc--------cccCCceEEEEehH Confidence 000000000 0000 11223558899999 Q ss_pred HHHHHHHHhhhcCceeccCcccccccccccccccccccc-eEecCCCCcCceEEEeeccceEEEEe-e-cccEEEeeccc Q lcl|NC_021309. 382 DWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVP-VVTTPLIPLGTILVGHFAPSVIQTAR-R-EGVTMQMTNSN 458 (497) Q Consensus 382 ~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~P-vv~~~~~~~~~~~~gd~~~~~~~i~~-r-~~~~i~~~~~~ 458 (497) |...+++-..-+ |+.. ...|..-. -.++|.- ||.+..+|.|++|.=--....+.-.+ + .++. . T Consensus 172 D~a~yl~~A~~~----~~~a--~~fG~~~L--~nfLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~------~ 237 (295) T protein:vir:99 172 DVANYLGDTKVG----ADAS--NVFGMTLL--KNFLGMQNVIVMPSVPEGKIYSTAVENLVFASLNVKGGDLG------G 237 (295) T ss_pred HHHHHHhccccc----cchh--hhhhhhhh--hhhhccceEEEcccCCCceEEEeeccceEEEEecCCchhhh------h Confidence 998887543221 1110 00000000 1378987 89999999998653211110000000 0 0010 0 Q ss_pred chhhhcCceEEEEEEe-------------ecce---eecccceEEEEeeCCCCCC Q lcl|NC_021309. 459 GTDFVDGKVTVRAEER-------------LGLL---VYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 459 ~~~f~~~~v~~r~~~r-------------~~~~---v~~~~a~~~l~~~~~a~~~ 497 (497) ..+|..|.+++.+..+ +.+. +-++++|++.++.+.++.- T Consensus 238 ~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~~~~ 292 (295) T protein:vir:99 238 LFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAAVPG 292 (295) T ss_pred hhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecCcCCC Confidence 1122334444433322 1222 3356799999996655444 No 160 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.36 E-value=2.3e-07 Score=56.98 Aligned_cols=301 Identities=10% Similarity=0.032 Sum_probs=150.9 Q ss_pred hhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeE Q lcl|NC_021309. 148 IGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVY 227 (497) Q Consensus 148 ~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~ 227 (497) ......+.++......-+++...|+..-....|+..++...+.++...+|....-.++...-..||.+.+.......... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~ 80 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTML 80 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEEe Confidence 11112222222222333445555555556667898888888887778899887765555455678887766442222111 Q ss_pred eeeee-EEeeehhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHhhhhcccCc---------ccccccccccccccccc Q lcl|NC_021309. 228 EQVGK-VANALTITDEGLRDA----PELFNFVQGRLLEGIQRKEEVQLLAGGGY---------PGVNGLLQRSTGFTASS 293 (497) Q Consensus 228 ~~~~k-la~~~~iS~ell~d~----~~l~~~i~~~la~~~~~~~d~~~l~G~G~---------~~p~Gi~~~~~~~~~~~ 293 (497) -.... +.-.+.||.-+..-+ .+..+|=..+-...+.+-++.++++|.-. .+..||++.-....... T Consensus 81 ~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~~ 160 (317) T protein:vir:88 81 NNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSLG 160 (317) T ss_pred ccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCceec Confidence 11111 112233443332211 13223322333445778888999988511 12222222110000000 Q ss_pred ccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhh-hhhHHHHHHHhhhhhhccC Q lcl|NC_021309. 294 ASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAE-IAENVFDAFVDIQLTLFQT 372 (497) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 372 (497) ..+. .....+.......+... .-+.+.+++..+=.++ .. T Consensus 161 ------------------~~g~---------------------~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~G-g~ 200 (317) T protein:vir:88 161 ------------------ANGV---------------------APVGDGSNTGTAGDLRLLTEDMLLNASESIWRNG-GQ 200 (317) T ss_pred ------------------cCcc---------------------ccccCCCccccccccccccHHHHHHHHHHHHhcC-CC Confidence 0000 00000000000000001 1123333333333334 34 Q ss_pred CceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccc-cceEecCCCCcCceEEEeeccceEEEEeecccE Q lcl|NC_021309. 373 PNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWG-VPVVTTPLIPLGTILVGHFAPSVIQTARREGVT 451 (497) Q Consensus 373 ~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G-~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~ 451 (497) ++.+++|+..-..|..+....+.++..+......|..+...-+=+| +.+|.+.+||++++++.|++...+... | ++. T Consensus 201 ~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~L-r-~~~ 278 (317) T protein:vir:88 201 ANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYL-R-PFF 278 (317) T ss_pred CCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeec-c-cce Confidence 5567899999999988855455565433222222222211112233 678999999999999999998655443 3 333 Q ss_pred EEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCC Q lcl|NC_021309. 452 MQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 452 i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~ 495 (497) .+--...+ +.........+++.+..|.|+.++...++.- T Consensus 279 ~e~laKtG-----d~~k~~i~~E~tLe~~N~~a~a~i~~l~~~~ 317 (317) T protein:vir:88 279 QHELAKTG-----DSEKRQLLVEYTFRVNNEKSGALIRDVVAQL 317 (317) T ss_pred eeccCCCc-----ccceeEEEEEEEEEEcCccceeEEEEecccC Confidence 33333333 3445666689999999999999987665555 No 161 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.32 E-value=1.8e-07 Score=57.56 Aligned_cols=273 Identities=14% Similarity=0.095 Sum_probs=133.0 Q ss_pred hhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCC-ce-EEEEEcCCCccceecccccccccc Q lcl|NC_021309. 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NL-SYLTESAAHNNAAAVAEAGTYPFS 219 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~-~~p~~~~~~~~a~wv~Eg~~~~~s 219 (497) ........+...+....-+...--+|...+-..+..-.-++...+..|++.+ .+ .||..+. .+.+.-|+||+.+|-+ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y-~gda~dVaEGe~Ipls 79 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPLS 79 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceee-eeccccccCCcccchh Confidence 0000000111111122222222223444443333333334444577888765 56 5565554 4678889999999999 Q ss_pred ccccee---eEeeeeeEEeeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccc Q lcl|NC_021309. 220 SEEFAR---VYEQVGKVANALTITDEGLRDA--PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSA 294 (497) Q Consensus 220 ~~~f~~---i~~~~~kla~~~~iS~ell~d~--~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~ 294 (497) +.+... .++..+|++--+ |.|.++.+ .+-...-.+.|..+++.++++.++.--.++ +..+.... T Consensus 80 kvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~Lkta---------T~t~~~t~ 148 (296) T protein:vir:98 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG---------TGTQDALG 148 (296) T ss_pred hheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcc---------cceeeech Confidence 998764 777777777664 99998644 356677888899999999999988432111 00000000 Q ss_pred cchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCc Q lcl|NC_021309. 295 SSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN 374 (497) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 374 (497) .....+.. .. +....+.+. -...... T Consensus 149 ~~lQ~Ala--------------------~~------------------------------~~~l~~~fe----ded~~~~ 174 (296) T protein:vir:98 149 AGLQGALA--------------------SA------------------------------WGKLQVLFE----DYGSERA 174 (296) T ss_pred hhHHHHHH--------------------HH------------------------------hhhhhhhcc----ccCCCce Confidence 00000000 00 000000000 0112356 Q ss_pred eEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEe Q lcl|NC_021309. 375 AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQM 454 (497) Q Consensus 375 ~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~ 454 (497) +.++||.|...++ ++++ |-.. ...|..-. -.++|.-+|.+..+|.|++|.---....+.-.+-.+- +. T Consensus 175 V~FVnP~D~a~yl--g~a~---it~q---t~fG~tyl--~nfLG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~--~l 242 (296) T protein:vir:98 175 IVFANSLDVAEYI--AKAG---ITTQ---TAFGLTYL--VDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNS--EL 242 (296) T ss_pred EEEEehHHHHHHh--cCCc---cchh---heechhhh--hhccccEEEEcCcCCCceEEEeeecceEEEeeccccc--ch Confidence 7899999987755 3332 1110 01110000 1277888899999999987643222111111110000 00 Q ss_pred ecccchhhhcCceEEEEEEe-------------ecce---eecccceEEEEeeCCC Q lcl|NC_021309. 455 TNSNGTDFVDGKVTVRAEER-------------LGLL---VYRPSAFQLIQLKKGA 494 (497) Q Consensus 455 ~~~~~~~f~~~~v~~r~~~r-------------~~~~---v~~~~a~~~l~~~~~a 494 (497) +. ...|..|.+++.+..+ +.+. +-++++|++.+++++. T Consensus 243 ~~--~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 243 AK--EFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred hh--hhccccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEecCCC Confidence 00 0112233344333322 1222 3356799999997777 No 162 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.31 E-value=5.4e-07 Score=54.93 Aligned_cols=277 Identities=10% Similarity=0.011 Sum_probs=138.3 Q ss_pred hccccccccccccchhhhHHHH-HHHHhhhhHHh---------hcceeecCCCceEEEEEcCCCccceeccccccccccc Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIV-EQLFYELSLAD---------LISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii-~~~~~~~~l~~---------~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~ 220 (497) |+. +.-..+|.||.....+ +...+.+.+++ +.....-+++.+++|....-++.+.-+.|+..++..+ T Consensus 1 MA~---T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~k 77 (351) T protein:vir:15 1 MAE---THLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNN 77 (351) T ss_pred CCc---eeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhe Confidence 332 2334578888766655 33333333332 1111223466788998765445677889999988888 Q ss_pred ccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhh Q lcl|NC_021309. 221 EEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) Q Consensus 221 ~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~ 299 (497) .+-++-....+..+--..++++...-+ .+....|.++++..+.+..+..+|.- ..|++........... T Consensus 78 itt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~-----l~gv~~~~~~~~~~~~----- 147 (351) T protein:vir:15 78 LTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSV-----LKGVMGVTKIANSKVY----- 147 (351) T ss_pred ecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhhchhhccccee----- Confidence 877776666666665566666543322 35667789999998888888877641 1222211110000000 Q ss_pred hhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEec Q lcl|NC_021309. 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMN 379 (497) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 379 (497) .... ..........+.+.++.....-.....-.+|+|| T Consensus 148 ---------------------------------------d~t~---~~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmh 185 (351) T protein:vir:15 148 ---------------------------------------DQTK---VSPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVN 185 (351) T ss_pred ---------------------------------------cccc---ccccccccCHHHHHHHHHHhccccccceEEEEEC Confidence 0000 0000000111233333333322212223579999 Q ss_pred hhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC-------ceEEEeeccceEEEEeecccEE Q lcl|NC_021309. 380 PRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG-------TILVGHFAPSVIQTARREGVTM 452 (497) Q Consensus 380 ~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~-------~~~~gd~~~~~~~i~~r~~~~i 452 (497) +..+..|++..--+ |+-.. .+ ...-+++.|+|||+++.+|.. .+..--|..+++...++ +..+ T Consensus 186 S~v~~~L~~~~li~--~~~~s--~~-----~~~i~t~~G~~VivdD~~p~~~~~~~~~~ytsyl~~~GAi~~~~~-~~~v 255 (351) T protein:vir:15 186 SATYSLMKVQGLIE--TIQPQ--NG-----ATPFEAYNGLRIVLDDDIEIDLTDKTKPVSTSYIFAPGAVRYSTN-MRST 255 (351) T ss_pred hHHHHHHHhhhhhh--hcccc--cc-----CcccceecceEEEEcCCCccccCCCCCceeEEEEEecceeeeecC-CcCc Confidence 99999998654111 11100 00 112357899999999999842 11111123333443332 3334 Q ss_pred EeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 453 QMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 453 ~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ++.+.... ..++-.+....++ ++||-.+.+-.-..+..|. T Consensus 256 e~~rd~~~--~~g~d~l~~r~~~---~~hp~G~s~~~~~~~~~~~ 295 (351) T protein:vir:15 256 ETKYDPLI--NGGQDVIVQKRVG---TIHVAGTSIKASFSPSKAS 295 (351) T ss_pred ceeecccC--CCCceEEEEeeee---eeeeeeeeecccccccCcC Confidence 44333221 1244444444443 5778777653222212111 No 163 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.20 E-value=1.9e-06 Score=51.90 Aligned_cols=280 Identities=11% Similarity=0.015 Sum_probs=138.8 Q ss_pred hccccccccccccchhhhHHHH-HHHHhhhhHHh---------hcceeecCCCceEEEEEcCCCccceeccccc-ccccc Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIV-EQLFYELSLAD---------LISSRPVTSPNLSYLTESAAHNNAAAVAEAG-TYPFS 219 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii-~~~~~~~~l~~---------~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~-~~~~s 219 (497) |+. .++.-..+|.||.....+ +.+.+.+.+++ +......+++.+++|....-++.+.-+.|++ .++.. T Consensus 1 Ma~-~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~ 79 (330) T protein:vir:10 1 MAN-ELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETG 79 (330) T ss_pred CCC-CceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchh Confidence 332 224445678888666544 33333333322 1112223567889999875556677788885 57777 Q ss_pred cccceeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchh Q lcl|NC_021309. 220 SEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLF 298 (497) Q Consensus 220 ~~~f~~i~~~~~kla~~~~iS~ell~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~ 298 (497) +.+-++-....++.+--..++++..-.+ .+-...+.++++..+.+..+..+|.- ..|+++........... T Consensus 80 ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~-----l~gvf~~~~~~~~~~~~--- 151 (330) T protein:vir:10 80 KITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIAT-----LNGIFATGTAGEKGALE--- 151 (330) T ss_pred hcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHH-----HHhhhhhhhcccchhhh--- Confidence 7777777777777666666666653322 45566788888888888777776632 23333322111000000 Q ss_pred hhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEe Q lcl|NC_021309. 299 GATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVM 378 (497) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (497) ............. ...+.+.++.....-. ...-.+|+| T Consensus 152 -------~~~~~~~~~~~a~----------------------------------~s~~~l~~A~~~~GD~-~~~~~~ivm 189 (330) T protein:vir:10 152 -------ETHVSDQSKASTG----------------------------------IDAGMVLDAKQLLGDS-ADQVTAIAM 189 (330) T ss_pred -------hhheecccccccc----------------------------------cCHHHHHHHHHHhccc-cccceEEEE Confidence 0000000000000 0011222222222111 123457999 Q ss_pred chhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCc--eEEEeeccceEEEEeecc---cEEE Q lcl|NC_021309. 379 NPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARREG---VTMQ 453 (497) Q Consensus 379 n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~--~~~gd~~~~~~~i~~r~~---~~i~ 453 (497) |+.++..|++..--+ |+-... + ...-++++|++||+++.+|... +..--|..+++.+.+... ..++ T Consensus 190 hS~v~~~L~~~~li~--~~~~s~--~-----~~~i~~~~G~~VivdD~~p~~~~~yt~yl~~~GAi~~~~~~~~~~v~~E 260 (330) T protein:vir:10 190 HSAVYTKLQKDNLIQ--YIQPTT--A-----TINIPTYLGYRVIIDDGIAPTGDIYTSYLFRTGSIGLNTGNPSGLTTFE 260 (330) T ss_pred cHHHHHHHHHhhhhh--hhcccc--c-----CcccccccceEEEEeCCCCCCCCceeEEEEecCceeeecccCCcccccc Confidence 999999998753111 111110 0 1123578999999999999533 111123444555543211 2333 Q ss_pred eecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 454 MTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 454 ~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .++. ..+++..+....++ ++||-.|.+-.-.....|. T Consensus 261 tdRd----~~~g~~~l~~r~~~---~~hp~G~s~~~~~~~~~~~ 297 (330) T protein:vir:10 261 TSRE----AAKGNDMIYTRRAL---VMHPYGVKWTGAEVDAGNI 297 (330) T ss_pred ccCC----ccccceEEEEeeEE---EeeeeeeeecccccccCcC Confidence 3332 22455555555553 4667666544322111111 No 164 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.09 E-value=1.3e-06 Score=52.93 Aligned_cols=294 Identities=11% Similarity=-0.029 Sum_probs=143.1 Q ss_pred HHHHHHHHHHHhhhhhhhhhhhhcccccc-ccccccchhhhHHHHHHHHhhhhHHhhcceeec---CCCceEEEEEcCCC Q lcl|NC_021309. 129 TAAAELMGAFADGETAPAAIGQNPFGSTG-TFAPGILPTFLPGIVEQLFYELSLADLISSRPV---TSPNLSYLTESAAH 204 (497) Q Consensus 129 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~---~~~~~~~p~~~~~~ 204 (497) .+ ..... -..+..+-.. .--.++|..|...+++.+.+.+.+..+++.... .+.++++|+.. . T Consensus 1 ~~------~~~~~------~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g--~ 66 (381) T protein:vir:80 1 MA------TIQGT------GGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS--R 66 (381) T ss_pred Cc------eeccc------ccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC--c Confidence 00 00000 0001111111 112345555777888898888888887655332 35678899854 4 Q ss_pred ccceecccccccccccccceeeEeeeeeEEe-eehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccC--ccccc Q lcl|NC_021309. 205 NNAAAVAEAGTYPFSSEEFARVYEQVGKVAN-ALTITDE-GLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGG--YPGVN 280 (497) Q Consensus 205 ~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~-~~~iS~e-ll~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G--~~~p~ 280 (497) +.+....++...+..+.+.+++++...+.-. -..|++. ..+...++.+.+.+.+..+++++.|+.++.--. ...+. T Consensus 67 ~~a~d~~~g~~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~ 146 (381) T protein:vir:80 67 AAVYDKQPQTPVNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPS 146 (381) T ss_pred ceeeeecCCCcccccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 5677888888887778888887777755433 3667654 345556888999999999999999998873210 00001 Q ss_pred cc--cccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHH Q lcl|NC_021309. 281 GL--LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENV 358 (497) Q Consensus 281 Gi--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (497) +. -...+....... ...........++.+ T Consensus 147 ~~~~t~~~~i~~~~~~-------------------------------------------------~~~t~~~~~~t~~~i 177 (381) T protein:vir:80 147 QRIYSYDTTLGDGTVN-------------------------------------------------AHLTGTPAPLTYAAL 177 (381) T ss_pred cccccccccccccccc-------------------------------------------------cccccchhhHHHHHH Confidence 00 000000000000 000000011122333 Q ss_pred HHHHHhhhhhhc-cCCceEEechhHHHHHHHHhhhcC-ceeccCcccccccccccccccccccceEecCCCCcCceE--E Q lcl|NC_021309. 359 FDAFVDIQLTLF-QTPNAVVMNPRDWELLRLTKDANG-QYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTIL--V 434 (497) Q Consensus 359 ~~~~~~~~~~~~-~~~~~~~~n~~~~~~l~~lkd~~G-~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~--~ 434 (497) +.+...+....- ..+.+++++|..+..|.+...-.. .|... .....+...++.|++|+.++.+|.+.+. . T Consensus 178 ~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~------~~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~~ 251 (381) T protein:vir:80 178 LLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQV------KPVTSGVVGTILGMEVIVTTQIGINSLTGYV 251 (381) T ss_pred HHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccc------hhhhceeeeEEcceEEEeeccccccccccee Confidence 344333333221 123468999999998875422111 11111 1111222357999999999999975431 1 Q ss_pred EeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecc-cceEEEE---eeCCCC----CC Q lcl|NC_021309. 435 GHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRP-SAFQLIQ---LKKGAT----GS 497 (497) Q Consensus 435 gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~-~a~~~l~---~~~~a~----~~ 497 (497) ..+. .-.. ....+.-+.+.+ +|..+-..++.....|..+... ..+-... .+.... |+ T Consensus 252 ~~ag---ap~~--~~~~~~~~~~~g-~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 316 (381) T protein:vir:80 252 NGQG---APTQ--PTPGVLGSPYLP-DQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQTLGS 316 (381) T ss_pred eecc---cccc--cccccccccccc-ccccceeeeeeeeeeceeeeeeeccceeeecceeeecCCCceeee Confidence 1110 0000 001112222322 3555556666666677766432 2222111 111111 11 No 165 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=97.84 E-value=4e-06 Score=50.18 Aligned_cols=274 Identities=14% Similarity=0.113 Sum_probs=130.3 Q ss_pred hhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCC-ceEEEEEc--CCCccceeccccccccccccc Q lcl|NC_021309. 146 AAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTES--AAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 146 ~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~p~~~--~~~~~a~wv~Eg~~~~~s~~~ 222 (497) +...... .....-+...--+|...+-..+..-.-++...+..|+..+ .++.++.. .....+.-|+||+.+|-++.+ T Consensus 1 M~~e~nl-~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt 79 (303) T protein:vir:10 1 MSAENNL-INVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVT 79 (303) T ss_pred CCCCcCC-cchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccchhhhe Confidence 1111111 1111112222234444443333333344444577777654 45444432 124567889999999999988 Q ss_pred ce---eeEeeeeeEEeeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccc-cccccccc Q lcl|NC_021309. 223 FA---RVYEQVGKVANALTITDEGLRDA--PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTG-FTASSASS 296 (497) Q Consensus 223 f~---~i~~~~~kla~~~~iS~ell~d~--~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~-~~~~~~~~ 296 (497) .. ..++..+|++--+ |.|.++.+ .+-...-.+.|..+++.++++.|+.---++. .+.... .+-..... T Consensus 80 ~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT----~t~~~t~~t~~s~~g 153 (303) T protein:vir:10 80 REQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAI----ENGKRTNKTKLSAEN 153 (303) T ss_pred eeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcc----cccccccceeecHHH Confidence 64 5788888888754 99998644 3566777888888899999988874211100 000000 00000000 Q ss_pred hhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceE Q lcl|NC_021309. 297 LFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAV 376 (497) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 376 (497) . ...+.... .. ++.... .....+. T Consensus 154 l---------------------------q~Al~~~~----------------~k----l~~~~e---------d~~~~V~ 177 (303) T protein:vir:10 154 L---------------------------QGALSKGR----------------AN----LSVLLD---------DEITPIA 177 (303) T ss_pred H---------------------------HHHHHhhh----------------hh----cccccc---------ccccEEE Confidence 0 00000000 00 000000 0112378 Q ss_pred EechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceE-EEEeecccEEEee Q lcl|NC_021309. 377 VMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVI-QTARREGVTMQMT 455 (497) Q Consensus 377 ~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~-~i~~r~~~~i~~~ 455 (497) ++||.|...++. +++ ++. .....|-.-. -.++|.-||.+..+|.|++|.=--....+ .+..+.++. T Consensus 178 FvNP~Daa~yl~--~A~---i~~--~~t~fG~n~L--~nfLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~---- 244 (303) T protein:vir:10 178 FVNPNDTAEYLA--NGF---INS--TGAQFGVNLL--TPYVGVKIVEFADVPQGEVWMTVAENLNVAYANPRGELS---- 244 (303) T ss_pred EEchHHHHHHhh--cCC---cch--hhhhhhhhhh--hhhhcceEEEeccCCCceEEEeeccceEEEEecCchhhh---- Confidence 999999998864 332 110 0001111000 13789889999999999875422111111 111111111 Q ss_pred cccchhhhcCceEEEEEEe-------------ecce---eecccceEEEEeeCCCCCC Q lcl|NC_021309. 456 NSNGTDFVDGKVTVRAEER-------------LGLL---VYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 456 ~~~~~~f~~~~v~~r~~~r-------------~~~~---v~~~~a~~~l~~~~~a~~~ 497 (497) ..-.|+.|.+++.+..+ +.+. +-++++|++.++++.=.+- T Consensus 245 --~~f~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~~e~~~ 300 (303) T protein:vir:10 245 --RAFAFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKKDEAGE 300 (303) T ss_pred --hhhhhccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEEeccccCC Confidence 11123344444444332 1222 3356789999996543222 No 166 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=97.84 E-value=7.6e-06 Score=48.66 Aligned_cols=425 Identities=12% Similarity=0.095 Sum_probs=153.5 Q ss_pred CchH-HHHHHHHHHHHHHHHHH--------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPST-AQLEAQGRQLAKSIKDI--------------------------------NADETKTAAEKKEALAKIEPDFKAHQ 47 (497) Q Consensus 1 ~~~~-a~~~~~~~~~~~~~~~~--------------------------------~~~~~~~~~e~~~~~~~~~~~~~~~~ 47 (497) |... -...++.+.+.+.++.+ .+...+.+.+.++++..+.+.+..+. T Consensus 174 ~a~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~v~d~EPa~~~~pvqAaAP~~De~airAq~~aeeraRi~~I~~l~a~Fg 253 (652) T protein:vir:79 174 MACIQSKRTEEFKKMPDSIRNMITPPRNSAPRVQDDEPAASRTPVQAAAPVVDENSIRAQVLAEQKARVNGINDLFAMFG 253 (652) T ss_pred hhhhhhhhhhhhhhhHHHHHHHhcccccccccccccccccccccccccCCcCchhHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 1110 00000111111111110 00001112223333322222222111 Q ss_pred HhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHhhhhhhHHHHhHhhhhhh---hhhhhhHHHHH Q lcl|NC_021309. 48 AEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNL--KQIRKHLARAVIMNPELKNATSFEKG---TKFDVSFNVSA 122 (497) Q Consensus 48 ~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~ 122 (497) ..++... .+.+...--.++...+.+-+.+.+...... ....-..............+.-...+ ........ .. T Consensus 254 gr~~~l~-~~~l~d~~~s~e~ar~~il~~l~~~~~p~~~~~~~~~~~~~g~~~~d~~~~aL~~R~g~~~~~~~~~~~-g~ 331 (652) T protein:vir:79 254 GRYQTLQ-AQCLADPECSLEQAREKLLNEMGRESTPSNKNTPAHIYAGNGNFVGDGIRQALMARAGFEKTERDNVYN-GM 331 (652) T ss_pred cccchHH-HHHhhccCCCHHHHHHHHHHHHHhhcCCCCCCcceeEeeccchhhHHHHHHHHHhhcCCcccccCcccc-Cc Confidence 0000000 000000000011111111111111000000 00000000000000000000000000 00000000 00 Q ss_pred HhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhh-hHHhhcceeecCCC-ceEEEEE Q lcl|NC_021309. 123 KAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYEL-SLADLISSRPVTSP-NLSYLTE 200 (497) Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~-~l~~~~~~~~~~~~-~~~~p~~ 200 (497) ......+.....++....+............-+++.++.++-...-..+.+...... +.+..|...+++-- .-+..+. T Consensus 332 ~L~elAr~~L~~~G~~~~~~~~~~~v~~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~l 411 (652) T protein:vir:79 332 TLREYARMSLTERGIGVSSYNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGM 411 (652) T ss_pred cHHHHHHHHHHhhccCCCCCCHHHHHHHHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeec Confidence 000111111112221111111111112222123333332222222223444443333 56666777666532 2233343 Q ss_pred cCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHH-hhHHHHHHHHHHHHHHHHHHHHHhhh---hcccCc Q lcl|NC_021309. 201 SAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGL-RDAPELFNFVQGRLLEGIQRKEEVQL---LAGGGY 276 (497) Q Consensus 201 ~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell-~d~~~l~~~i~~~la~~~~~~~d~~~---l~G~G~ 276 (497) +.-+..--|.|++........=+..++...+++.++.||||++ .|-..+-.-|-..+.++.++.+++.+ |.++.. T Consensus 412 -g~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~ 490 (652) T protein:vir:79 412 -GGFSALRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPK 490 (652) T ss_pred -CCCCCccccCCCCccceeeecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcc Confidence 3346677889999988776666777899999999999999986 67777777777888888887777543 333211 Q ss_pred c--ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhh Q lcl|NC_021309. 277 P--GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEI 354 (497) Q Consensus 277 ~--~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (497) - --+.+|.-+.........+ .....+...+.....-. T Consensus 491 ~~~DGk~LF~hA~H~Nl~~~aa-----------------------~~~~~l~~ar~aM~~Qk------------------ 529 (652) T protein:vir:79 491 ISTDNVSLFDKAKHANVLESAA-----------------------MDVASLDKARQLMRVQK------------------ 529 (652) T ss_pred cccCCceeeccccccccccccc-----------------------CCHHHHHHHHHHHHHhc------------------ Confidence 0 0111221111100000000 00000010100000000 Q ss_pred hhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccc-cceEecCCCCcC--- Q lcl|NC_021309. 355 AENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWG-VPVVTTPLIPLG--- 430 (497) Q Consensus 355 ~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G-~Pvv~~~~~~~~--- 430 (497) .........|..|++.+.-....+++..+.. +-..+...+..+| +.| ..||+++.+..+ T Consensus 530 ---------~g~~~l~i~P~~llvp~~le~~a~~ll~s~~--v~~a~~~~~~~Np------~~~~~~~i~eprL~~~s~~ 592 (652) T protein:vir:79 530 ---------EGERHLNIRPAFVLVPTAMESVANQVIRSSS--VKGADINAGIINP------VKDFATVIAEPRLDDNSQT 592 (652) T ss_pred ---------cCCccccccccEEEecchhHHHHHHHhccCC--Ccccccccccccc------cccccccccccccCCCCcc Confidence 0001122345566666665555555432221 1100001111111 223 255666666432 Q ss_pred ceEEEeeccc-eEEE---EeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEe Q lcl|NC_021309. 431 TILVGHFAPS-VIQT---ARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQL 490 (497) Q Consensus 431 ~~~~gd~~~~-~~~i---~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~ 490 (497) ..|+++-... .+.+ .-.++..|+. .. .|..+-+.+++...+|.+++|--.+++.+- T Consensus 593 ~wylaa~~~~dtiev~yL~G~~~P~ie~--~~--gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 593 TFYLAASKGSDTIEVAYLNGVDTPYIDQ--ME--GFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred cEEEecCCCCCeEEEEEecCCCCCeeee--cC--CCCcceEEEEEEEeccCceeeccceeeecC Confidence 2333332210 1111 1122333432 22 399999999999999999999999987776 No 167 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=97.80 E-value=2.7e-06 Score=51.11 Aligned_cols=282 Identities=11% Similarity=0.038 Sum_probs=139.2 Q ss_pred hccccccccc-cccch-hhhHHHHHHHHhhhhHHhhcceeec-CCCceEEEEEcCCCccceecccccccccccccce--e Q lcl|NC_021309. 151 NPFGSTGTFA-PGILP-TFLPGIVEQLFYELSLADLISSRPV-TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFA--R 225 (497) Q Consensus 151 ~~~~~~~~~g-~~v~p-~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~--~ 225 (497) +..+..++.+ .++.| .|...|...+.+.+....+.++... .|+++.||.... ++..=..+++...--..+-. . T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~--~tV~dY~~~~~i~~d~ltt~~~~ 78 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGT--PVVRSRPEQGDFTFDNLDTGEIS 78 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccc--cccccccCCCCcccccCCCceEE Confidence 3444333333 34545 5778888777777665555554332 467888888653 33333334444332233333 3 Q ss_pred eEeeeeeEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc--ccCc------cccccccccccccccccccch Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLA--GGGY------PGVNGLLQRSTGFTASSASSL 297 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~--G~G~------~~p~Gi~~~~~~~~~~~~~~~ 297 (497) +.+...|+.++. |+++..|++.+|.+...+..+++++...|+.+.. -+|. +.|.-+-..+..... T Consensus 79 l~IDq~KYfaf~-VdDD~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~------ 151 (322) T protein:vir:31 79 IILRDEVYAGNA-ISKKLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVG------ 151 (322) T ss_pred EEEehhhhhccc-cchhHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceec------ Confidence 445555566554 7777778888999999999999999988887631 0111 001100000000000 Q ss_pred hhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccC-CceE Q lcl|NC_021309. 298 FGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQT-PNAV 376 (497) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 376 (497) ........++.+.++...+....--. +.+. T Consensus 152 -------------------------------------------------~gt~~~~ay~~lv~l~~kLdkanVP~~gR~v 182 (322) T protein:vir:31 152 -------------------------------------------------TGTDQTMDVTDFSRVNYVMTQSKMPMGGMIG 182 (322) T ss_pred -------------------------------------------------cCCCchhhHHHHHHHHHHhccccCCCCCeEE Confidence 00000111233333333333222222 2345 Q ss_pred EechhHHHHHHHH-----hhhcCceeccCcccccccccccccccccccceEecCCCCcCc--eE---------EEeeccc Q lcl|NC_021309. 377 VMNPRDWELLRLT-----KDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--IL---------VGHFAPS 440 (497) Q Consensus 377 ~~n~~~~~~l~~l-----kd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~--~~---------~gd~~~~ 440 (497) +++|.-...|..+ --.++|+......+...|.. ...++.|.-|++|+.++.++ ++ .|-++- T Consensus 183 VV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~--~Vg~~~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~- 259 (322) T protein:vir:31 183 IIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQ--FVRSVYGIDLFVSNLLADANETINAGGDARSTTAGKCNM- 259 (322) T ss_pred EeCchhhhhhhhhhhhhhhhccccccccccccchhhHH--HHHHHhceeeeeeccccccccccccCcccccccceeecc- Confidence 6667766655332 23344432211111111111 13578999999999997543 11 122221 Q ss_pred eEEEEeeccc-------EE---EeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 441 VIQTARREGV-------TM---QMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 441 ~~~i~~r~~~-------~i---~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) +..+.|-+-. ++ +.... .+ +.--.+|+..|.|.++.+|+..+.|.-.+...-- T Consensus 260 f~~~~~~~~~~~~~~~~~l~~~e~~r~-~~---~~~d~~~~~~~~g~g~~r~e~l~~~~a~~~~~~~ 322 (322) T protein:vir:31 260 FMNVSDMGLLPFVVAWKEMPTTKSFID-DY---NDDLNTATTARWGNGLVRDENLVCVLANADKVTF 322 (322) T ss_pred cccccchhhhhhhhHhhhhhhhhcccC-cc---ccccceeeeeeecceeecccceEEEEeccccccC Confidence 1111111110 01 11111 11 2235678899999999999999877655443333 No 168 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=97.73 E-value=6.6e-06 Score=48.97 Aligned_cols=287 Identities=10% Similarity=0.016 Sum_probs=139.2 Q ss_pred hhhhhhcccc---ccccccccchhhhHHHHHHHHhh-hhHHhhcceeecCCCceEEEEEcCCCccceeccccc------- Q lcl|NC_021309. 146 AAIGQNPFGS---TGTFAPGILPTFLPGIVEQLFYE-LSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAG------- 214 (497) Q Consensus 146 ~~~~~~~~~~---~~~~g~~v~p~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~------- 214 (497) +......++. ++.-...-+.+|..++.-.+.+. +-|++.++..+-.+++..+-...+ ..+.-++++. T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~d 78 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLAS--MDPDAVKRKRSRQQSAD 78 (322) T ss_pred CcccceeeeeeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeeccc--ccccccccccccccccC Confidence 1111111110 11111122245555554444444 456665554443333322222111 1222222211 Q ss_pred ---ccccccccceeeEeeeeeEEeeehhhHHH-HhhHHHHHHHHHHHHHHHHHHHHHhhhhccc-Ccccccccccccccc Q lcl|NC_021309. 215 ---TYPFSSEEFARVYEQVGKVANALTITDEG-LRDAPELFNFVQGRLLEGIQRKEEVQLLAGG-GYPGVNGLLQRSTGF 289 (497) Q Consensus 215 ---~~~~s~~~f~~i~~~~~kla~~~~iS~el-l~d~~~l~~~i~~~la~~~~~~~d~~~l~G~-G~~~p~Gi~~~~~~~ 289 (497) +.|.....++..............|.+.- ++...+..+...+..+.+++++.|..|+.+- |... .|.-..+ T Consensus 79 ~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~-~~~~gt~--- 154 (322) T protein:vir:10 79 GTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS-IKGTGQP--- 154 (322) T ss_pred cccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc-ccccccc--- Confidence 23433333444444333334445666553 4566788888888999999999999887532 1110 0000000 Q ss_pred ccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhh Q lcl|NC_021309. 290 TASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTL 369 (497) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 369 (497) +...+. ......+....++.++.+...+..+. T Consensus 155 -v~~~ss-----------------------------------------------~~i~~g~~g~t~~kl~~a~~~l~~~d 186 (322) T protein:vir:10 155 -VEFLAT-----------------------------------------------QEIGDGTKPISFDYVTEITERFLENE 186 (322) T ss_pred -cccCCC-----------------------------------------------cccccCccchhHHHHHHHHHHHHhcC Confidence 000000 00000011112233444444443333 Q ss_pred ccC--CceEEechhHHHHHHHHhhhc-CceeccCcccccccccccccccccccceEecCCCCcCc--------------- Q lcl|NC_021309. 370 FQT--PNAVVMNPRDWELLRLTKDAN-GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--------------- 431 (497) Q Consensus 370 ~~~--~~~~~~n~~~~~~l~~lkd~~-G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~--------------- 431 (497) -.. ...++++|..|..|.....-. ..|+.... .+..+...+++|+.|+.++.+|... T Consensus 187 vp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~-----l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~ 261 (322) T protein:vir:10 187 IEPEVSKVIVIGPTQARKLLQITEATSADYTSAMD-----LQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGD 261 (322) T ss_pred CCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchh-----hhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCcc Confidence 221 235788899988877543322 22332111 0111124478999999999998321 Q ss_pred -eEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCC Q lcl|NC_021309. 432 -ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 432 -~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a 494 (497) ....-+.+.++.+....+++.+++.-.+. .+...+++..-+|..+++|+.|+.+.....- T Consensus 262 ~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~---~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 262 EIWCIAMTDMALGYHSCKDIWTKVAEDPSA---SFAWRIYSAFTADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred ceeEEEEecCceeEEEeeeeeEEeeccCCc---chhhhhhhhhhhCceEeccCcEEEEEEeccC Confidence 11123445566666666666666443332 2346678889999999999999999997777 No 169 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.50 E-value=2.2e-05 Score=46.08 Aligned_cols=282 Identities=12% Similarity=0.052 Sum_probs=146.0 Q ss_pred cccccccchh--hh-HHHHHHHHhhhhHHhhcceeecC---CCceEEEEEcCCCccce--eccc-ccccccccccceeeE Q lcl|NC_021309. 157 GTFAPGILPT--FL-PGIVEQLFYELSLADLISSRPVT---SPNLSYLTESAAHNNAA--AVAE-AGTYPFSSEEFARVY 227 (497) Q Consensus 157 ~~~g~~v~p~--~~-~~ii~~~~~~~~l~~~~~~~~~~---~~~~~~p~~~~~~~~a~--wv~E-g~~~~~s~~~f~~i~ 227 (497) -++..++..+ .+ +.|.+...+....+.++++.+.. -.++.+...+. .+.+. |++- ..+.|..+..+++-. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~-~G~a~~~~i~~~a~dip~vd~~~~~~~ 79 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADE-HGSLDDGLITVGTSTLDQVEVGFTPTR 79 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeec-cCcccccccCCcCCccceeecccceeE Confidence 1222334433 22 34555555555556665553322 23556655443 24555 8765 467898899999999 Q ss_pred eeeeeEEeeehhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHhhhhcccCc-cccccccccccccccccccchhhhhh Q lcl|NC_021309. 228 EQVGKVANALTITDEGLRDA----PELFNFVQGRLLEGIQRKEEVQLLAGGGY-PGVNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 228 ~~~~kla~~~~iS~ell~d~----~~l~~~i~~~la~~~~~~~d~~~l~G~G~-~~p~Gi~~~~~~~~~~~~~~~~~~~~ 302 (497) ...+.++.-+.+|.+=|+.+ .+|.+-=..-..+++...+|+..++|+-. ....|++|.+.+.......... T Consensus 80 ~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a---- 155 (304) T protein:vir:52 80 SYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQ---- 155 (304) T ss_pred EEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCcc---- Confidence 98888888888876555433 24666666667788999999999999753 3478999998765433221110 Q ss_pred hHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc--cCCceEEech Q lcl|NC_021309. 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF--QTPNAVVMNP 380 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~n~ 380 (497) +......+..++.+++..++..+..... ..++.+++.+ T Consensus 156 ----------------------------------------~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp 195 (304) T protein:vir:52 156 ----------------------------------------NTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDS 195 (304) T ss_pred ----------------------------------------CCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCH Confidence 0011222344455565555555543332 3455678888 Q ss_pred hHHHHHHHHhhhc-CceeccCcccccccccccccccccccceEecCCCCcC--ceEEEeeccceEEEEeecccEEEeecc Q lcl|NC_021309. 381 RDWELLRLTKDAN-GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG--TILVGHFAPSVIQTARREGVTMQMTNS 457 (497) Q Consensus 381 ~~~~~l~~lkd~~-G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~--~~~~gd~~~~~~~i~~r~~~~i~~~~~ 457 (497) ..+..|....-++ |.=++.-...+... ....+-+|.++|--....-..+ ..++.+-+.-++.+- ..+.+.+... T Consensus 196 ~~~~~l~~~~~~~~~~Tvl~~l~~n~~~-~~g~~l~I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~--vP~p~~~l~~ 272 (304) T protein:vir:52 196 LDLAHLALVQRANTDTTALEFLTKHLSA-AAGRQVAIKALPSNYGTRVTDGKTRAMVYVNSKEHVIFD--VPMSPTVLDA 272 (304) T ss_pred HHHHHHhhccCCCCCchHHHHHHHhccc-ccCCcceEEEecccccccCCCCceEEEEEecChhheEEe--cCccccccch Confidence 8887775432222 21111100000000 0000001222221111111111 134444444333331 1222332221 Q ss_pred cchhhhcCce--EEEEEEeecceee-cccceEEEEe Q lcl|NC_021309. 458 NGTDFVDGKV--TVRAEERLGLLVY-RPSAFQLIQL 490 (497) Q Consensus 458 ~~~~f~~~~v--~~r~~~r~~~~v~-~~~a~~~l~~ 490 (497) ..+|.. .+=++.|+++..+ +|.+|++++. T Consensus 273 ----q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 273 ----QPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred ----hhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 334543 3446788877655 5999999999 No 170 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=97.44 E-value=4.1e-05 Score=44.65 Aligned_cols=314 Identities=14% Similarity=0.093 Sum_probs=143.6 Q ss_pred hhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhh-----HHHH Q lcl|NC_021309. 98 MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFL-----PGIV 172 (497) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~-----~~ii 172 (497) ++ +.......++ ........... ...+ . ......+.......+++ +..-+|.+. +.++ T Consensus 1 ~~-~~~~~~~l~~---~gi~~~~~~~~-----~~~~----~--~~~~~da~d~~~~~~~~--~~~~i~~~l~~~i~p~~~ 63 (336) T protein:vir:10 1 MR-DAQRIQNLAR---AGVILPRSVQN-----VSTP----L--TEYAMDAADLSPHLSST--GSSGIPNYLTTYVDPAVI 63 (336) T ss_pred Cc-hHHHHHHHhh---cCeeecchhhh-----hhhh----H--HHhhhhhhhccCccccC--CCchhHHHHHhhccccee Confidence 00 0000000000 00000000000 0000 0 00000010111111112 223344444 4556 Q ss_pred HHHHhhhhHHhhcceeecCCC---ceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhh-HHHHhhH- Q lcl|NC_021309. 173 EQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT-DEGLRDA- 247 (497) Q Consensus 173 ~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS-~ell~d~- 247 (497) +.+.+......++++.+++.- .+.++.... .+.+.+.+-+.+.|.++......+-..+.++..+.++ .|+-+-. T Consensus 64 ~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~ 142 (336) T protein:vir:10 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) T ss_pred eehhhhhhhhhhccccccCCccceeEEEeeeec-eeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHH Confidence 666777777777777665432 345555443 3567888888889999877777777788888888888 4454433 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhh Q lcl|NC_021309. 248 --PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTV 325 (497) Q Consensus 248 --~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (497) .++.+--+.-.++++..++|.-.++|+...+.-|++|.+............ T Consensus 143 ~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~--------------------------- 195 (336) T protein:vir:10 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPW--------------------------- 195 (336) T ss_pred hCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCCc--------------------------- Confidence 367788888888999999999999999888889999987654211111000 Q ss_pred hhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc-----cCCceEEechhHHHHHHHHhhhcCceeccC Q lcl|NC_021309. 326 ASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~ 400 (497) ....+...+.+++..++..+..... ..+..+++-+.-+..|.. .+..|.-++.- T Consensus 196 --------------------~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~ 254 (336) T protein:vir:10 196 --------------------SGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK 254 (336) T ss_pred --------------------ccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccC-CCccCccHHHH Confidence 0001112334444444444443222 224445555555444422 12222111110 Q ss_pred cccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEee-cccEEEeecc---cchhhhcCceEEEEEEeec Q lcl|NC_021309. 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR-EGVTMQMTNS---NGTDFVDGKVTVRAEERLG 476 (497) Q Consensus 401 ~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r-~~~~i~~~~~---~~~~f~~~~v~~r~~~r~~ 476 (497) .. .+..++.++..+.+.... |+..+..+.-.+. ....+.+... ..--...-.+.+-+..|.+ T Consensus 255 lk-----------~n~Pnl~i~t~pEl~~a~---G~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~v~~~~rt~ 320 (336) T protein:vir:10 255 LK-----------DIFPKLEFVTIPEYDTAS---GRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) T ss_pred HH-----------HhcCccEEEEccccccCC---CceEEEEEEecCCCcceeeecchhhhccceeecCceeEecccccee Confidence 00 011122233333322111 1111101100000 0011111100 0000011135566778888 Q ss_pred ceee-cccceEEEEee Q lcl|NC_021309. 477 LLVY-RPSAFQLIQLK 491 (497) Q Consensus 477 ~~v~-~~~a~~~l~~~ 491 (497) |.++ +|.||++++.. T Consensus 321 Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 321 GAVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeeccchheeeecC Confidence 7776 59999999888 No 171 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=97.31 E-value=5.5e-05 Score=43.94 Aligned_cols=314 Identities=13% Similarity=0.076 Sum_probs=143.2 Q ss_pred hhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhH-----HHH Q lcl|NC_021309. 98 MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLP-----GIV 172 (497) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~-----~ii 172 (497) ++ +.......++ ......... .....+ .......+.......+++ +..-+|.++. .++ T Consensus 1 ~~-~~~~~~~l~~---~gi~~~~~~-----~~~~~~------~~~~~~da~d~~~~~~~~--~~~~~~~~l~~~i~p~~~ 63 (336) T protein:vir:36 1 MR-DAQRIQNLAR---AGVILPRSV-----QNVSTP------LTEYAMDAADLSPHLSST--GSSGIPNYLTTYVDPSVI 63 (336) T ss_pred Cc-hHHHHHHHhh---cCeeecchh-----hhhhhH------HHHhhhhhhhccCccccC--CCcchHHHHHHhhccceE Confidence 00 0000000000 000000000 000000 000000000111111111 1222444444 455 Q ss_pred HHHHhhhhHHhhcceeecCCC---ceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhh-HHHHhhH- Q lcl|NC_021309. 173 EQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT-DEGLRDA- 247 (497) Q Consensus 173 ~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS-~ell~d~- 247 (497) +.+.+......++++.+.+.- .+.++.... .+.+.+.+-+.+.|.++......+-..+.++..+.++ .|+.+-+ T Consensus 64 ~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~ 142 (336) T protein:vir:36 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) T ss_pred eeecchhhhhhhccccccCCccceeEEEeeeec-eeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHH Confidence 666666667777777665432 345555443 3567888888888999877777777788888888887 5555433 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhh Q lcl|NC_021309. 248 --PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTV 325 (497) Q Consensus 248 --~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (497) .++.+--+.-.++++..++|.-.++|+...+.-|++|.+............ T Consensus 143 ~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~--------------------------- 195 (336) T protein:vir:36 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPW--------------------------- 195 (336) T ss_pred hCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEecCCCccccccCCCc--------------------------- Confidence 357778888888999999999999999888889999987654211111000 Q ss_pred hhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc-----cCCceEEechhHHHHHHHHhhhcCceeccC Q lcl|NC_021309. 326 ASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~ 400 (497) ....+...+.+++..++..+..... ..+..+++-+.-+..|.. .+..|.-++.- T Consensus 196 --------------------~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~ 254 (336) T protein:vir:36 196 --------------------SGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK 254 (336) T ss_pred --------------------ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccC-CCccCccHHHH Confidence 0001122334445544444444332 224445555555444432 12222111110 Q ss_pred cccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEee-cccEEEeecc---cchhhhcCceEEEEEEeec Q lcl|NC_021309. 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR-EGVTMQMTNS---NGTDFVDGKVTVRAEERLG 476 (497) Q Consensus 401 ~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r-~~~~i~~~~~---~~~~f~~~~v~~r~~~r~~ 476 (497) .. .+..++.++..+.+.... |+..+..+.-.+. ....+.+... ..--...-.+.+-+..|.+ T Consensus 255 lk-----------~n~Pnl~i~t~pEl~~a~---g~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~v~~~~rt~ 320 (336) T protein:vir:36 255 LK-----------DIFPKLEFVTIPEYDTAS---GRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) T ss_pred HH-----------HhcCccEEEEccccccCC---CceEEEEEEecCCCcceeeecchhhhccceeecCceeEecccccee Confidence 00 011122233333322111 1111111100110 0011111100 0000011135566778888 Q ss_pred ceee-cccceEEEEee Q lcl|NC_021309. 477 LLVY-RPSAFQLIQLK 491 (497) Q Consensus 477 ~~v~-~~~a~~~l~~~ 491 (497) |.++ +|.||++++.. T Consensus 321 Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 321 GAVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeeccchheeeecC Confidence 7776 59999999888 No 172 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=97.26 E-value=8.8e-05 Score=42.81 Aligned_cols=314 Identities=14% Similarity=0.091 Sum_probs=147.1 Q ss_pred hHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhh-----HHHHH Q lcl|NC_021309. 99 NPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFL-----PGIVE 173 (497) Q Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~-----~~ii~ 173 (497) ..+.+......+ ......... .....+.... ...+.......++++ ..-+|.++ +.+++ T Consensus 1 ~~~~~~~~~l~~---~gi~~~~~~-----~~~~~~~~~~------a~da~d~~~~~~t~~--~~g~~~~l~~~i~p~~~~ 64 (336) T protein:vir:78 1 MRDAQRIQNLAR---AGVILPRSV-----KNVSTPLAEY------AMDAADLSPHLSSTG--SSGIPNYLTTYVDPSVID 64 (336) T ss_pred CchHHHHHHHhc---cCeecchhh-----hhhhHHHHHH------HHhhhhhccccccCC--CcchHHHHHHhcccceee Confidence 000000000000 000000000 0000000000 000000111111121 22244443 45566 Q ss_pred HHHhhhhHHhhcceeecCC---CceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH--- Q lcl|NC_021309. 174 QLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA--- 247 (497) Q Consensus 174 ~~~~~~~l~~~~~~~~~~~---~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~--- 247 (497) .+.+......++++.+++. ..+.|+.... .+.+.+.+-+.+.|..+...+..+-..+.++..+.++.+=+..+ T Consensus 65 ~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~ 143 (336) T protein:vir:78 65 ILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAG 143 (336) T ss_pred ehhhhhhhhhhcccccCCCccccEEEEeeeec-ceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHh Confidence 6666667777777766543 2456666554 46778888888999999999999999999999898985544322 Q ss_pred -HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhh Q lcl|NC_021309. 248 -PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVA 326 (497) Q Consensus 248 -~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (497) .++.+--+.-.++++..+++.-.++|+...+..|++|.+........... T Consensus 144 g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~----------------------------- 194 (336) T protein:vir:78 144 RVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTP----------------------------- 194 (336) T ss_pred CCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcC----------------------------- Confidence 36778888888899999999999999988889999998765422211100 Q ss_pred hhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc-----cCCceEEechhHHHHHHHHhhhcCceeccCc Q lcl|NC_021309. 327 SLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQYMGGNF 401 (497) Q Consensus 327 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~ 401 (497) .....+...+.+++..++..+..... ..+..+++-+.-+..|.. .+..|--++.-. T Consensus 195 ------------------~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~l 255 (336) T protein:vir:78 195 ------------------WSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKL 255 (336) T ss_pred ------------------cccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHHH Confidence 00111223344555555554443332 112345555555555532 122221111100 Q ss_pred ccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEe-ecccEEEeecc---cchhhhcCceEEEEEEeecc Q lcl|NC_021309. 402 FGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTAR-REGVTMQMTNS---NGTDFVDGKVTVRAEERLGL 477 (497) Q Consensus 402 ~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~-r~~~~i~~~~~---~~~~f~~~~v~~r~~~r~~~ 477 (497) . .+..++.++..+.+.... |+..+.++.-.+ ....++.+... ..--...-.+.+-+..|.+| T Consensus 256 k-----------~n~Pnl~i~t~pel~~Ag---g~~~~~~~~~~~~~~t~~~~~p~~f~~lpvq~~~~~~~v~~~~rt~G 321 (336) T protein:vir:78 256 K-----------EIFPKLEFVTIPEYDTAS---GRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWG 321 (336) T ss_pred H-----------HhcCccEEEEcccccccC---cceEEEEEeeccCCcceeeecchhhhccceeecCceeEeccccceee Confidence 0 011122233333332111 111110000000 00111111100 00000112355567778877 Q ss_pred eee-cccceEEEEee Q lcl|NC_021309. 478 LVY-RPSAFQLIQLK 491 (497) Q Consensus 478 ~v~-~~~a~~~l~~~ 491 (497) .++ +|.||++++.. T Consensus 322 v~i~~P~ai~~~~GI 336 (336) T protein:vir:78 322 AVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeccchheeeccC Confidence 766 59999999888 No 173 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=97.06 E-value=0.00019 Score=41.00 Aligned_cols=316 Identities=13% Similarity=0.086 Sum_probs=144.2 Q ss_pred hhhhhhHHHHhHhhhhh-hhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccc---hhhh- Q lcl|NC_021309. 94 RAVIMNPELKNATSFEK-GTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGIL---PTFL- 168 (497) Q Consensus 94 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~---p~~~- 168 (497) .......+ .....+. +-.+. .. ...... ... . .....+........+... ..|| .+++ T Consensus 1 ~~~~~~~~--~~~~l~~~g~~~~-~~--~~~~~~-----~~~-----~-~~a~d~~~~~~~~~~~~~-~~i~a~~~~~i~ 63 (339) T protein:vir:94 1 MSINNDRT--DIKQLEKVGIIFD-GY--SPKSIS-----SEV-----S-AYAMDAVNLTPTLQTTAN-AGIPAWMTTFVD 63 (339) T ss_pred CceechHH--HHHHHHhhceeec-cc--hhhhcc-----hhh-----H-hhhccccccccccccccc-cchhhhhhhhhc Confidence 00000000 0000000 00000 00 000000 000 0 000000000000111111 1122 2333 Q ss_pred HHHHHHHHhhhhHHhhcceeecCC---CceEEEEEcCCCccceeccccccccccc--ccceeeEeeeeeEEeeehhhHHH Q lcl|NC_021309. 169 PGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGTYPFSS--EEFARVYEQVGKVANALTITDEG 243 (497) Q Consensus 169 ~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~p~~~~~~~~a~wv~Eg~~~~~s~--~~f~~i~~~~~kla~~~~iS~el 243 (497) +.+++.+.+....+.++++.+.+. ..+.|+..+. .+.|.|.+.+.+.|..+ .+|.+.++....++-... ..|+ T Consensus 64 ~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~-~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~-~~E~ 141 (339) T protein:vir:94 64 RRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEP-VGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYG-DLEM 141 (339) T ss_pred hhheeecccccchhhhcccccCCCCcccEEEEeeeec-ccceEEcccccCCCcccccceeeEEeEEEEEEEEeec-HHHH Confidence 456677788888888888877654 3577877665 46788999988888765 556666665555444332 3344 Q ss_pred HhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhh Q lcl|NC_021309. 244 LRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) Q Consensus 244 l~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) -+-. .++.+--....++++...+|+-.++|+...+..|++|.+......... T Consensus 142 ~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s------------------------- 196 (339) T protein:vir:94 142 ATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAAT------------------------- 196 (339) T ss_pred HHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCC------------------------- Confidence 3322 367788888888899999999999998777789999987654321110 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc-----cCCceEEechhHHHHHHHHhhhcCc Q lcl|NC_021309. 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQ 395 (497) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~n~~~~~~l~~lkd~~G~ 395 (497) ......+...+.+++..++..+..... ..+..+++.+..+..|... +..|. T Consensus 197 -----------------------~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~-n~~~~ 252 (339) T protein:vir:94 197 -----------------------VNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRT-NNFGL 252 (339) T ss_pred -----------------------CCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC-CcCCc Confidence 011122344455666666665544432 1233466666666655432 22222 Q ss_pred eeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEE-EEeecccEEEeecc---cchhhhcCceEEEE Q lcl|NC_021309. 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQ-TARREGVTMQMTNS---NGTDFVDGKVTVRA 471 (497) Q Consensus 396 ~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~-i~~r~~~~i~~~~~---~~~~f~~~~v~~r~ 471 (497) -++.-.. ....++.++..+.+.... |+..+.++. +.+.....+.+... ..-....-.+.+-+ T Consensus 253 Tvl~~lk-----------~n~pnl~i~~~~el~~a~---g~~~~~~~~~~~~~~~~~~~~p~~~~~lpvq~~~~~~~v~~ 318 (339) T protein:vir:94 253 SAGAKIA-----------QTYPNIQFVAVPEFDTAS---GRLVQLWVPEVNGQPTGEVAFAEKLRSHSIERYSTTTRQKH 318 (339) T ss_pred cHHHHHH-----------HhcCCcEEEEccccccCC---CceEEEEEEeccCCcceEEEcchhhhccccEEcCceEEecc Confidence 1111000 012233344433332111 111000000 00001111111100 00000111355667 Q ss_pred EEeecceee-cccceEEEEee Q lcl|NC_021309. 472 EERLGLLVY-RPSAFQLIQLK 491 (497) Q Consensus 472 ~~r~~~~v~-~~~a~~~l~~~ 491 (497) ..|.+|.++ +|.||++++.- T Consensus 319 ~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 319 SGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred eeeeeeEEEEccceeeeeecC Confidence 788666555 69999999988 No 174 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=97.05 E-value=3.6e-05 Score=44.93 Aligned_cols=337 Identities=14% Similarity=0.066 Sum_probs=144.5 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhh-hhhhhhh---- Q lcl|NC_021309. 77 IPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETA-PAAIGQN---- 151 (497) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~---- 151 (497) ++... ....+....... . ......+... ...+. .........+........ ..+.... T Consensus 1 ~~~~~-----~~~~~~~~~~~~-----~-~~~~~~~~~~--~~~~~----l~~~gi~~~~~~~~~~~~~~~amd~~~~~~ 63 (379) T protein:vir:10 1 MPQIS-----KIHSSLNARQMT-----Q-MVMDSADVTL--DNLKH----LESYGIHLNGRKNKLFELMQFAMDSNDIGP 63 (379) T ss_pred CCCcc-----eeeeecCccccc-----h-hhhccccccH--HHHHH----HHhcCccccchhhhhhhhhhhhhccccccc Confidence 00000 000000000000 0 0000000000 00000 000000000000000000 0000000 Q ss_pred -----ccccccccccccch----hhhHHHHHHHHhhhhHHhhcceeecCCC---ceEEEEEcCCCccceecccccccccc Q lcl|NC_021309. 152 -----PFGSTGTFAPGILP----TFLPGIVEQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFS 219 (497) Q Consensus 152 -----~~~~~~~~g~~v~p----~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~wv~Eg~~~~~s 219 (497) ...++. +..-+| .+++.+|+.+.......+++++.+.+.- .+.++.... .+.+.+.+-+.+.|.. T Consensus 64 ~~~~~~~l~~~--~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~-~G~A~~ygd~~d~pl~ 140 (379) T protein:vir:10 64 IPTPLSPLSPV--SIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEG-LGTAQPYTDGGNMALM 140 (379) T ss_pred cccccCccccc--cccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeee-eeeeEEeccccCCCee Confidence 000111 111123 3556788888888888888888775432 445555544 3577888888888887 Q ss_pred cccceeeEeeeeeEEeeehhhHH-HHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCc--ccccccccccccccccc Q lcl|NC_021309. 220 SEEFARVYEQVGKVANALTITDE-GLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGY--PGVNGLLQRSTGFTASS 293 (497) Q Consensus 220 ~~~f~~i~~~~~kla~~~~iS~e-ll~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~--~~p~Gi~~~~~~~~~~~ 293 (497) +...+..+-..+.+...+.++.+ +.+-. .+|.+--..-.++++...+|+-.++|.+. .+..|++|.+.+..... T Consensus 141 d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t 220 (379) T protein:vir:10 141 SWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVA 220 (379) T ss_pred eeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCccccc Confidence 76666666666667766677644 43322 36888888889999999999999999643 35679999886543211 Q ss_pred ccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhcc-- Q lcl|NC_021309. 294 ASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-- 371 (497) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 371 (497) ... +.++.......+...+.+++..++..+...... T Consensus 221 ~at------------------------------------------g~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~ 258 (379) T protein:vir:10 221 VPN------------------------------------------GAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRI 258 (379) T ss_pred ccC------------------------------------------CcccccccccCCHHHHHHHHHHHHHHHHHhhCCee Confidence 100 111112223334455566666666654433221 Q ss_pred ----CCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEee Q lcl|NC_021309. 372 ----TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR 447 (497) Q Consensus 372 ----~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r 447 (497) .+..+++.+.-+..|... +..|.-++.-. ..+..++.++..+.+.... |. +...+.+.++ T Consensus 259 ~~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl~~l-----------k~n~Pnl~i~t~pEL~~ag---gg-~~~~~~~~~~ 322 (379) T protein:vir:10 259 KSNKTPITIGIPNAYENYITTP-TELGYSVAQYM-----------RESYPNVTFVSAPELNDAN---GG-SSAIYYYADA 322 (379) T ss_pred cccccceeEEecHHHHHhhccc-cccCccHHHHH-----------HHhcCCcEEEEcccccccC---CC-ccEEEEEeec Confidence 122355556655555422 11121111100 0012233344333332110 00 0111222211 Q ss_pred -cccEE-------Eeeccc----chhhhcCceEEEEEEeecceee-cccceEEEEee Q lcl|NC_021309. 448 -EGVTM-------QMTNSN----GTDFVDGKVTVRAEERLGLLVY-RPSAFQLIQLK 491 (497) Q Consensus 448 -~~~~i-------~~~~~~----~~~f~~~~v~~r~~~r~~~~v~-~~~a~~~l~~~ 491 (497) .+... ..-++. .-....-....-+..|.+|.++ +|.||++++.. T Consensus 323 ~~~~~t~~~~~~~~~~p~k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 323 VENNGTDDGRTWLQVVPTKMFTLGVEKKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred cCCCccCCcceEEEecchhhhhccceecCceeEeccccceeeeeeecchhhheecCC Confidence 11100 000100 0000011234455667666665 59999999887 No 175 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=96.98 E-value=0.00023 Score=40.58 Aligned_cols=428 Identities=12% Similarity=0.068 Sum_probs=142.6 Q ss_pred CchHHHHHHHHHHHH------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLA------------------------KSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERA 56 (497) Q Consensus 1 ~~~~a~~~~~~~~~~------------------------~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~ 56 (497) ||...+.....+... ....+.++....+..+.++.+..+...++....++.. +.. T Consensus 220 ~p~~l~~~~~~~~~~p~~~~~~PaPTPaaaaPaaP~aaap~~adirA~~~aae~~r~aaI~a~fa~f~~~~a~l~a-~~l 298 (693) T protein:vir:95 220 MPEALKTLLAPRAQTPAAPANTPAPTPASAAPAAPVAAAPTEADIRARILAEESGRRSAITAAFGAFSTGHAELLA-TCL 298 (693) T ss_pred hHHHHHHHHhhhcccccccccCcccCccCCCCCCCccCCCCcchhhHHHHHHHHHHHHHHHHHHHhccCChHHHHH-HHH Confidence 332111100000000 0000111111111112222222222221110000100 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH--HHHhhhhhhHHHHhHhhhhhhhhhhhhHH-HH-HHhhhHHHHH Q lcl|NC_021309. 57 QEMLKSLGGADAAKDGLDNDIPEVEVRNLK-QIRK--HLARAVIMNPELKNATSFEKGTKFDVSFN-VS-AKAADPGTAA 131 (497) Q Consensus 57 ~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~-~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~ 131 (497) .+..-.++++ .+.+-+.+......... .... ............+++.-...+.......+ .. ....+..+.. T Consensus 299 ~d~~~s~d~a---r~~lL~~l~~~~~p~~~~~~~~~~~~~~g~~~~d~~~~al~~R~g~~~~~~~n~~~g~~L~elAr~~ 375 (693) T protein:vir:95 299 NDMNITVDQA---REKLLAAIGADTQPAAALSAGAHIHAGNGNLVGDSVRASVLARIGRGERQADNAYNGMTLRELARAS 375 (693) T ss_pred hhcCCCHHHH---HHHHHHHHhhccCCCCCcCcCccccCCchhHHHHHHHHHHHHhcCcccccCCccccCCcHHHHHHHH Confidence 0000001111 11111111100000000 0000 00000000000000000000000000000 00 0001111111 Q ss_pred HHHHHHHHhhhhhhhhhhhhccccccccccccchhhhH-HHHHHHHhh-hhHHhhcceeecCC-CceEEEEEcCCCccce Q lcl|NC_021309. 132 AELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLP-GIVEQLFYE-LSLADLISSRPVTS-PNLSYLTESAAHNNAA 208 (497) Q Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~-~ii~~~~~~-~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~ 208 (497) ...++.................-+++.++ .|--.... .+.+-.... .+....|...+++- ...+..+. +.-+... T Consensus 376 L~~rg~~~~~~~~~~~~~~a~~htTSDFp-~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~l-g~~~~L~ 453 (693) T protein:vir:95 376 LVDRGIGVASLNAPQMVGLAFTHTSSDFG-LILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGL-GEFSSLR 453 (693) T ss_pred HHhcCCccCCCCHHHHHHHHHhcCcchhH-HHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeec-CCCCChh Confidence 21222211111111112222222333332 33333232 233333222 34555566555543 12222232 3335566 Q ss_pred ecccccccccccccceeeEeeeeeEEeeehhhHHHH-hhHHHHHHHHHHHHHHHHHHHHHhhhh---cccCccc-ccccc Q lcl|NC_021309. 209 AVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGL-RDAPELFNFVQGRLLEGIQRKEEVQLL---AGGGYPG-VNGLL 283 (497) Q Consensus 209 wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell-~d~~~l~~~i~~~la~~~~~~~d~~~l---~G~G~~~-p~Gi~ 283 (497) -|.|++...-....=..-++...+++.++.||||++ .|...+.+-|-..+.++.++.+++.+. .++..-. -+.+| T Consensus 454 ~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LF 533 (693) T protein:vir:95 454 QVREGAEYKYVTLGERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLF 533 (693) T ss_pred hcCCCCceeeeecCCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCccee Confidence 788888876655544456778899999999999986 677777777778888888888776543 2221100 01111 Q ss_pred ccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhh-hhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHH Q lcl|NC_021309. 284 QRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQ-DTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) ... ...... .......+... .....+...+.... ...+ T Consensus 534 had-H~Nl~t------------------ga~sals~~sl~~a~~am~~qk~~~~--~~~g-------------------- 572 (693) T protein:vir:95 534 HAD-HSNLLT------------------GAASALSIDSLSKAKTQMATQKAQVE--KGKG-------------------- 572 (693) T ss_pred ecc-cccccc------------------ccccccChHHHHHHHHHHHHhhcchh--ccCC-------------------- Confidence 110 000000 00000000000 00000000000000 0000 Q ss_pred HhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccc-cceEecCCCCc--Cce--EEEee Q lcl|NC_021309. 363 VDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWG-VPVVTTPLIPL--GTI--LVGHF 437 (497) Q Consensus 363 ~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G-~Pvv~~~~~~~--~~~--~~gd~ 437 (497) ......|..|+..+.-....+.+-.+.--+ .-+...+..+| +.| ..||+++.+.. ++. ++.|- T Consensus 573 ----~~L~i~P~~llvP~~le~~a~~l~~s~~~~--~a~~~~~~~NP------~~~~~~vi~~prL~~~s~~~Wyl~a~~ 640 (693) T protein:vir:95 573 ----RTLNIRPGFVLTPVALEDKANQIINSESVP--GADVNSGIVNP------IRAFAQVIGEPRLDDASATAWYMAAKK 640 (693) T ss_pred ----ceeecccceEEecchHHHHHHHHhcccccc--ccccccccccc------hhccccccccceecCCCCCceEEecCC Confidence 012234555666666555555544332111 00001111111 223 24555666642 222 22222 Q ss_pred ccceEE---EEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEee Q lcl|NC_021309. 438 APSVIQ---TARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 438 ~~~~~~---i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~ 491 (497) ..-.+. +.-.++..|+. .. .|..|-+.+++...+|.+++|--.+++=... T Consensus 641 ~~dtie~~yL~G~~~P~ie~--~~--gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 641 GSDTIEVAYLDGVDTPYLEQ--QE--GFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred CCCeEEEEEecCCCCCeEee--cC--CCCcceEEEEEEEeccCceeeccccccCCCC Confidence 110011 11122333333 22 3999999999999999999887776553333 No 176 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=96.70 E-value=0.00034 Score=39.60 Aligned_cols=314 Identities=11% Similarity=0.058 Sum_probs=142.4 Q ss_pred hHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhH-----HHHH Q lcl|NC_021309. 99 NPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLP-----GIVE 173 (497) Q Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~-----~ii~ 173 (497) ..+.+......+ ...... + .+....................-.+++..-+|.++. .+++ T Consensus 1 ~~~~~~~~~l~~---~gi~~~--------~-----~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~ 64 (336) T protein:vir:10 1 MRDAQRIQNLAR---AGVILP--------R-----SVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVID 64 (336) T ss_pred CchHHHHHHHhc---cCeecc--------h-----hhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHhhcCcceee Confidence 000000000000 000000 0 000000000000000000111111111222444443 4555 Q ss_pred HHHhhhhHHhhcceeecCCC---ceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHh-hH-- Q lcl|NC_021309. 174 QLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLR-DA-- 247 (497) Q Consensus 174 ~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~-d~-- 247 (497) .+.+......++++.+.+.. ...++.... .+.+.+.+-..+.|..+...+.-+-..+.++..+.++.+=+. -. T Consensus 65 ~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~-~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~ 143 (336) T protein:vir:10 65 ILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAG 143 (336) T ss_pred eeechhchhhhcccccCCCcceeeEEEEeeee-eeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHh Confidence 55555566666666654332 334444443 356777788889999988877778888888888888855443 22 Q ss_pred -HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhh Q lcl|NC_021309. 248 -PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVA 326 (497) Q Consensus 248 -~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (497) .++.+--+.-.++++..+++.-.++|+...+..|++|.+........... T Consensus 144 g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~----------------------------- 194 (336) T protein:vir:10 144 RVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTP----------------------------- 194 (336) T ss_pred CCCcHHHHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcC----------------------------- Confidence 36778888888889999999999999988889999998765422211110 Q ss_pred hhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc-----cCCceEEechhHHHHHHHHhhhcCceeccCc Q lcl|NC_021309. 327 SLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQYMGGNF 401 (497) Q Consensus 327 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~ 401 (497) .....+...+.+++..++..+..... ..+..+++-+.-+..|.. .+..|.-++.-. T Consensus 195 ------------------~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~l 255 (336) T protein:vir:10 195 ------------------WSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKL 255 (336) T ss_pred ------------------cccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHHH Confidence 00111223444555555555443332 112345555555555532 122221111100 Q ss_pred ccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEe-ecccEEEeecc---cchhhhcCceEEEEEEeecc Q lcl|NC_021309. 402 FGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTAR-REGVTMQMTNS---NGTDFVDGKVTVRAEERLGL 477 (497) Q Consensus 402 ~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~-r~~~~i~~~~~---~~~~f~~~~v~~r~~~r~~~ 477 (497) . .+..++.++..+.+.... |+..+.++.-.+ ....++.+... ..--...-.+..-+..|.+| T Consensus 256 k-----------~n~Pnl~i~t~pel~~Ag---g~~~~~~~~~~~~~~t~~~~~P~~f~~lpvq~~~~~~~v~~~~rt~G 321 (336) T protein:vir:10 256 K-----------EIFPKLEFVTIPEYDTAS---GRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWG 321 (336) T ss_pred H-----------HhCCccEEEEcccccccC---CceEEEEEecccCCcceeeecChhhhccceeecCceeEeccccceee Confidence 0 011122333333332111 111110000000 00111111100 00000112355567778877 Q ss_pred eee-cccceEEEEee Q lcl|NC_021309. 478 LVY-RPSAFQLIQLK 491 (497) Q Consensus 478 ~v~-~~~a~~~l~~~ 491 (497) .++ +|.||++++.. T Consensus 322 v~i~rP~ai~~~~GI 336 (336) T protein:vir:10 322 AVIFRPFAVAQMLGV 336 (336) T ss_pred eeeeccchheeeccC Confidence 766 59999999888 No 177 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=96.63 E-value=0.00046 Score=38.90 Aligned_cols=327 Identities=10% Similarity=0.025 Sum_probs=141.9 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEE Q lcl|NC_021309. 118 FNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSY 197 (497) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 197 (497) .+. ........+.... +..........+....|-|.....+...+.+.+.+++++++++++--.++. T Consensus 1 M~~---------~tr~~~~~y~~~~----A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~ 67 (355) T protein:vir:18 1 MRQ---------ETRFKFNAYLTQL----AKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEK 67 (355) T ss_pred CCh---------HHHHHHHHHHHHH----HHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeE Confidence 000 0000000010000 000000000112234566777788888899999999999999988654444 Q ss_pred EEEcCCCccceecc--cc-cccccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_021309. 198 LTESAAHNNAAAVA--EA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLL 271 (497) Q Consensus 198 p~~~~~~~~a~wv~--Eg-~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l 271 (497) .-.....+-++-+. .+ +..|......+.-.+.+++.-.-+.|+.+.|+.. ++++..+++.+.++++.=.-.--+ T Consensus 68 i~lgv~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGf 147 (355) T protein:vir:18 68 IGVGVTGTIASTTDTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGF 147 (355) T ss_pred EeeccCcceeeccccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcc Confidence 33321112222221 11 2233333445666677777777778899998864 688899999888877654444444 Q ss_pred cccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhh---------hhhhhccc Q lcl|NC_021309. 272 AGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVV---------TGAAGSGS 342 (497) Q Consensus 272 ~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~ 342 (497) +|+-.....-....+ ..-..+..|+..++..... .+...... T Consensus 148 NG~s~A~~Td~~~nP-----------------------------llqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~ 198 (355) T protein:vir:18 148 NGTTRADTSDRVKNP-----------------------------MLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAV 198 (355) T ss_pred cceeeeccCChhhCc-----------------------------CccccchhHHHHHHhcchhhhhccccccccccccce Confidence 553211100000000 0011122222222221110 00000000 Q ss_pred ccccccchhhhhhhHHHHHHHh-hhhhhccCCceEEechhHHHH--HHHHhhhcCceeccCccccccccccccccccccc Q lcl|NC_021309. 343 GVAGSYPTAAEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWEL--LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGV 419 (497) Q Consensus 343 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~--l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~ 419 (497) .................++... +...++..+..+++-..++.. -..|....+.| .-...........+|-|+ T Consensus 199 i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~p-----tE~~Aa~~i~s~k~iGGl 273 (355) T protein:vir:18 199 IRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRKLLADKYFPLVNKQQEN-----TESLAADIIISQKRIGNL 273 (355) T ss_pred eeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHhhccCCh-----HHHHHHHHHHHHHhhCCc Confidence 0001111222222334455543 456666666644333333222 11222222221 111111112223589999 Q ss_pred ceEecCCCCcCceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCC--- Q lcl|NC_021309. 420 PVVTTPLIPLGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT--- 495 (497) Q Consensus 420 Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~--- 495 (497) |.+..+++|.+.+++=-|+- +.|+...+- +-.+-+... +|++--.=..=-++.|-+++.++.+.-.+.+. T Consensus 274 pa~~~PffP~~~~lVT~L~N--LsIY~Q~gs~RR~~~d~p~----r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~~~~ 347 (355) T protein:vir:18 274 PAVRVPYFPANAVFVTTLEN--LSIYFMDESHRRSIDENPK----KDRVENYESMNIDYVVEAYAAGCLLENITLGDFTA 347 (355) T ss_pred eeEEccccCCCceEEeeccc--cEEEEecCcEEEEEEeccc----cccccchhhhcceeeeeccccEEEEeeeeecCCCC Confidence 99999999999988766664 333333332 112211111 22222222223344444555444443222211 Q ss_pred -----CC Q lcl|NC_021309. 496 -----GS 497 (497) Q Consensus 496 -----~~ 497 (497) |- T Consensus 348 ~~~~~~g 354 (355) T protein:vir:18 348 PAAPEGG 354 (355) T ss_pred cccccCC Confidence 11 No 178 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=96.44 E-value=0.00062 Score=38.16 Aligned_cols=324 Identities=11% Similarity=0.031 Sum_probs=139.6 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhccccc---cccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCcc Q lcl|NC_021309. 130 AAAELMGAFADGETAPAAIGQNPFGST---GTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNN 206 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~ 206 (497) ...+.+..+.... ... +...+.. .+....|-|.....+...+.+.+.+++++++++++--.++..-.....+- T Consensus 1 M~~~tr~~~~~y~-~~~---A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~i 76 (355) T protein:vir:98 1 MRPETRFKFNAYL-TRV---AELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTI 76 (355) T ss_pred CChHHHHHHHHHH-HHH---HHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccc Confidence 0000000010000 000 0011110 11234466777778888889999999999999988654444332211122 Q ss_pred ceecc--c-ccccccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCccccc Q lcl|NC_021309. 207 AAAVA--E-AGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVN 280 (497) Q Consensus 207 a~wv~--E-g~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~ 280 (497) ++-+. . .+..|.....++.-.+.+++.-.-+.|+.+.|+.. ++++..+++.+.++++.=.-.--++|+-..... T Consensus 77 agrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~T 156 (355) T protein:vir:98 77 ASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTS 156 (355) T ss_pred cccccCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccC Confidence 22211 1 12223333445666677777777778899998864 688999999888877654444444553211100 Q ss_pred cccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhh---------hhhhhcccccccccchh Q lcl|NC_021309. 281 GLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVV---------TGAAGSGSGVAGSYPTA 351 (497) Q Consensus 281 Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~ 351 (497) -....+ ..-..+..|+..++..... .+............... T Consensus 157 d~~~nP-----------------------------llqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy 207 (355) T protein:vir:98 157 DRTKNT-----------------------------LLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDY 207 (355) T ss_pred ChhhCc-----------------------------CccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCc Confidence 000000 0011122222222221110 00000000001111122 Q ss_pred hhhhhHHHHHHHh-hhhhhccCCceEEechhHHHHH--HHHhhhcCceeccCcccccccccccccccccccceEecCCCC Q lcl|NC_021309. 352 AEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWELL--RLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP 428 (497) Q Consensus 352 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~l--~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~ 428 (497) ..+....++++.. +...++..+..+++-..++..- ..|......| .-...........+|-|+|.+..+++| T Consensus 208 ~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~p-----tE~~Aa~~i~s~k~iGGlpa~~~PffP 282 (355) T protein:vir:98 208 ENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQEN-----SESLAADIIISQKRIGNLPAVRVPYFP 282 (355) T ss_pred ccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHhhhHhhccCCc-----HHHHHHHHHHHhhhhCCceeEEccccC Confidence 2222334445543 4566666666543333332221 1222222111 110111112223589999999999999 Q ss_pred cCceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEE---e---eCCC-CCC Q lcl|NC_021309. 429 LGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQ---L---KKGA-TGS 497 (497) Q Consensus 429 ~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~---~---~~~a-~~~ 497 (497) .+.+++=-|+- +.|+...+- +-.+-+... +|++--.=..=-++.|-+++.++.+. + .+++ .++ T Consensus 283 ~~~~lVT~L~N--LsIY~Q~gs~RR~~~d~p~----r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~ 353 (355) T protein:vir:98 283 ANAVLVTTLEN--LSIYFMDESHRRSIDENPK----KDRVENYESMNIDYVVEVYAAGCLLENITLGDFTAPAAPES 353 (355) T ss_pred CCceEEeeccc--cEEEEecCcEEEEEEeccc----cccccchhhhcceeeeeccccEEEeeceeeeCCCCCccccc Confidence 99988766664 333333332 111211111 12222222222333444444444332 2 2222 122 No 179 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=96.30 E-value=0.00077 Score=37.66 Aligned_cols=324 Identities=11% Similarity=0.002 Sum_probs=144.5 Q ss_pred hhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCC Q lcl|NC_021309. 114 FDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP 193 (497) Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 193 (497) ..+..+. . .......+.... +..........+....|.|.....+...+.+.+.++++++++++.-- T Consensus 1 m~~~M~~--~-------tr~~~~~y~~~~----A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~ 67 (358) T protein:vir:78 1 MSQTLTV--Q-------AEQRLNKYCDAL----AKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQI 67 (358) T ss_pred CcccccH--H-------HHHHHHHHHHHH----HHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccc Confidence 0000000 0 000000010000 00000000111223457777777888888889999999999998865 Q ss_pred ceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhHH------HHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 194 NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP------ELFNFVQGRLLEGIQRKEE 267 (497) Q Consensus 194 ~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~~------~l~~~i~~~la~~~~~~~d 267 (497) .++........+-++-..- ..|......+.-.+.+++.-.-+.|+.+.|+..+ ++...+++.+.+.++.=.- T Consensus 68 ~Ge~v~lg~~g~iagrt~t--r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i 145 (358) T protein:vir:78 68 KGQVVQVGVGQLYTGRKKG--GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDML 145 (358) T ss_pred eeeEEeecCCcccceecCC--CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccc Confidence 5544333221122222221 2333444566666777777667788888887653 6888888888887765443 Q ss_pred hhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhh--------h Q lcl|NC_021309. 268 VQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAA--------G 339 (497) Q Consensus 268 ~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~ 339 (497) .--++|+-.....-....+ ..-..+..|+..++......... . T Consensus 146 ~IGfNGts~A~~Td~~~nP-----------------------------llqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ 196 (358) T protein:vir:78 146 RVGWNGVSAADDTDPTANP-----------------------------LGQDVNKGWHQLAREWKGGSQIIKAAAGEKIY 196 (358) T ss_pred eecccceeeccCCChhhCc-----------------------------CccccchHHHHHHHhhchhhhhccccccCcee Confidence 4444443211100000000 00111122222222211110000 0 Q ss_pred cccccccccchhhhhhhHHHHHH-HhhhhhhccCCceEEechhHHHHH--HHHhhhcCceeccCcccccccccccccccc Q lcl|NC_021309. 340 SGSGVAGSYPTAAEIAENVFDAF-VDIQLTLFQTPNAVVMNPRDWELL--RLTKDANGQYMGGNFFGNAYGNPVNGGKNI 416 (497) Q Consensus 340 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~n~~~~~~l--~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l 416 (497) ...+....+.+. ....++++ ..+...++..+.-+++--.++..- ..|-...+.| .-..... .-..+| T Consensus 197 ig~g~~Gdy~NL---DalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~p-----TE~~Aa~--~i~k~i 266 (358) T protein:vir:78 197 FDPDGKGEYKTL---DEMASDLINTTIDPLFQQDPRLVVLVGTDLVAAAQAKLYSEATKP-----SEQIAAQ--QLAKSI 266 (358) T ss_pred ecCCCCCccccH---HHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhcCCCc-----HHHHHHH--HHHHHh Confidence 011111122222 22334444 345566666666544443333221 1222222221 1111111 123679 Q ss_pred cccceEecCCCCcCceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCC Q lcl|NC_021309. 417 WGVPVVTTPLIPLGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 417 ~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~ 495 (497) -|+|.+..+++|.+.+++=-|+- +.|+...+- +-.+-+.. .+|++--.=..=-++.|-+++.++.+......- T Consensus 267 GGlpa~~~PfFP~~~ilVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~v~~ 340 (358) T protein:vir:78 267 AGRKAYIPPFFPGKRMVVTTLDN--LHCYTQRGTRKRKADDNQ----DSKSFDNQYWRMEGYALGEHKAYGGFEEADIEI 340 (358) T ss_pred CCCeEEEccccCCCceEEeeccc--cEEEEecCcEEEEEEecc----ccccccchhhhcceeeeeccccEEEEeeeeeee Confidence 99999999999999988766664 333333332 22221111 123333333334456666777766665443211 Q ss_pred C----------C Q lcl|NC_021309. 496 G----------S 497 (497) Q Consensus 496 ~----------~ 497 (497) + + T Consensus 341 ~~~pa~~~~~~~ 352 (358) T protein:vir:78 341 GADPAVLAVEAA 352 (358) T ss_pred CCCCCccccCCc Confidence 1 1 No 180 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=96.26 E-value=0.00081 Score=37.53 Aligned_cols=269 Identities=16% Similarity=0.100 Sum_probs=112.8 Q ss_pred hccccccccccccchh-hhHHHHHHHHhhhhHHhhccee---ec---CCCceEEEEEcCCCccceec-----cccccccc Q lcl|NC_021309. 151 NPFGSTGTFAPGILPT-FLPGIVEQLFYELSLADLISSR---PV---TSPNLSYLTESAAHNNAAAV-----AEAGTYPF 218 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~-~~~~ii~~~~~~~~l~~~~~~~---~~---~~~~~~~p~~~~~~~~a~wv-----~Eg~~~~~ 218 (497) |+ -.++.|+ +...+++.+++.+++..++..- .. .++.+++|+... ..+.+. +++..... T Consensus 1 Ma-------~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~--~~~~~~~~~~~~~~~~~~~ 71 (392) T protein:vir:99 1 MA-------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP--SRGHTRKLRGAGAERNLTV 71 (392) T ss_pred Cc-------cccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc--ccceeeeccccccCCcccc Confidence 11 1245666 5667899999998887776432 22 255688876533 233332 23344444 Q ss_pred ccccceeeEeeeeeEEe-eehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccc Q lcl|NC_021309. 219 SSEEFARVYEQVGKVAN-ALTITDE-GLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASS 296 (497) Q Consensus 219 s~~~f~~i~~~~~kla~-~~~iS~e-ll~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~ 296 (497) .+.+-+.+++...+... -+.|+++ ..++..++...+.+...++++.++|..++.-- .+.+.+..... T Consensus 72 ~~~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~-~~a~~~~~~~~---------- 140 (392) T protein:vir:99 72 SDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEAAGAV---------- 140 (392) T ss_pred cccccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-hcccccccccc---------- Confidence 45555555555533332 2445544 45666677777777788999999998876310 00000000000 Q ss_pred hhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceE Q lcl|NC_021309. 297 LFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAV 376 (497) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 376 (497) ........++.+..+...+....--...++ T Consensus 141 --------------------------------------------------~~~~~~~~~~~i~~a~~~L~~~~vP~~R~~ 170 (392) T protein:vir:99 141 --------------------------------------------------HEVAPDEFFKGVNGARRALNELYIPQGRVL 170 (392) T ss_pred --------------------------------------------------cccChhhhHHHHHHHHHHHhhcCCCCCCEE Confidence 000001112223333333222222223467 Q ss_pred EechhHHHHHHHHhhhcCceeccCccccccc--ccccccccccccceEecCCCCcCceEEEeeccceEEEEeecc----- Q lcl|NC_021309. 377 VMNPRDWELLRLTKDANGQYMGGNFFGNAYG--NPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREG----- 449 (497) Q Consensus 377 ~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~--~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~----- 449 (497) ++.|..+..|.+ |.. ++.....+.... ...+...++.|++|+.+..+|.++.+.+..+. +.+..+.. T Consensus 171 vv~p~~~~~l~~--~~~--~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a--~~~at~a~v~~~~ 244 (392) T protein:vir:99 171 VVGTAVTEQILN--DDR--FIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA--FIMATRAPAPPMG 244 (392) T ss_pred EEcHHHHHHHhc--ccc--eeecccccchhhhhhhcceeeeeeeeEEEeecccccccceeeeccc--ccccccccccccc Confidence 888887777653 311 111111111000 01122347899999999999987765443222 11111110 Q ss_pred ------------cEEEeecccchhhhcCceEEEEEEeecceeec---ccceEE---EEee---------CCCCCC Q lcl|NC_021309. 450 ------------VTMQMTNSNGTDFVDGKVTVRAEERLGLLVYR---PSAFQL---IQLK---------KGATGS 497 (497) Q Consensus 450 ------------~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~---~~a~~~---l~~~---------~~a~~~ 497 (497) +...+.......+..+...+-. -.+..... ..+|.. ++.. ..+..+ T Consensus 245 ~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~--~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v~~~~~~ 317 (392) T protein:vir:99 245 AVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDT--YFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANAT 317 (392) T ss_pred ccceeEEecccceecceeecccceeeccccccce--eEEEEEEeeccccceeeeeeeeeecceeeeeeeecccce Confidence 0000000000001111111000 00001110 001100 0000 000000 No 181 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=96.21 E-value=0.00087 Score=37.36 Aligned_cols=324 Identities=12% Similarity=0.034 Sum_probs=150.6 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEE Q lcl|NC_021309. 118 FNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSY 197 (497) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 197 (497) .+. ........+.... +. .......+....|-|.....+...+.+.+.+++++++++++--.++. T Consensus 1 M~~---------~tr~~~~~y~~~~----A~--~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~ 65 (337) T protein:vir:10 1 MRK---------ETRQAYEKYAAQI----AK--LNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEK 65 (337) T ss_pred CCh---------HHHHHHHHHHHHH----HH--hcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeE Confidence 000 0000000010100 00 00001112234466777778888888889999999999988654443 Q ss_pred EEEcCCCccceec--ccccccccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_021309. 198 LTESAAHNNAAAV--AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLA 272 (497) Q Consensus 198 p~~~~~~~~a~wv--~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~ 272 (497) .-.....+-++-+ +.+...|..-...+.-.+.+++.-.-..|+.+.|+.. ++++..+++.+.++++.=.-.--++ T Consensus 66 v~lg~~g~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfn 145 (337) T protein:vir:10 66 LGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWN 145 (337) T ss_pred EeeccCcceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhccc Confidence 3322111112111 2223334444556667777777777788999999864 6899999998888776544444445 Q ss_pred ccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhh----hccccccccc Q lcl|NC_021309. 273 GGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAA----GSGSGVAGSY 348 (497) Q Consensus 273 G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 348 (497) |+-.....-....+- .-..+..|+..++......-.. .......... T Consensus 146 G~s~A~~Td~~~nPl-----------------------------lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~ 196 (337) T protein:vir:10 146 GVKAAATTDRQANPL-----------------------------LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKA 196 (337) T ss_pred ceeeccCCChhhCcC-----------------------------ccccchhHHHHHHhcchhhhhccccccCcceeecCC Confidence 532111100000000 0111222222222210000000 0000001111 Q ss_pred chhhhhhhHHHHHHHh-hhhhhccCCceEEechhHHHHHH--HHhhhcCceeccCcccccccccccccccccccceEecC Q lcl|NC_021309. 349 PTAAEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 349 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~ 425 (497) ...........+++.. +...++..+.-+++--.++..-. .+-...+. +.-...........+|-|+|.+..+ T Consensus 197 gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n~~~~-----ptE~~Aa~~i~s~k~iGGlpa~~~P 271 (337) T protein:vir:10 197 GDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQA-----PTERLAADLIVSQKRIGNLPAVRVP 271 (337) T ss_pred CCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhccCCC-----cHHHHHHHHHHHhhhhCCceeEEcc Confidence 1222222334455543 45666666665444333322211 11111111 1110111111223589999999999 Q ss_pred CCCcCceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 426 LIPLGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 426 ~~~~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) ++|.+.+++=-|+- +.|+...+- +-.+-+.. .+|++--.=..=-++.|-+++.++.++-...+.+ T Consensus 272 ffP~~~~lVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 272 FFPKRALMVTKLSN--LSIYYQEGARRRTLKEVP----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ccCCCceEEeechh--cEEEEecCcEEEEEEEcc----ccccccchhhccceeeeeccccEEEEeceeecCC Confidence 99999988776665 333333332 22221111 1334433333445677778888887766666666 No 182 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=96.19 E-value=0.00089 Score=37.30 Aligned_cols=320 Identities=13% Similarity=0.051 Sum_probs=148.2 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCcccee Q lcl|NC_021309. 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA 209 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~w 209 (497) .....+..+... ....+ ........+....|.|.....+...+.+.+.++++++++++.--.++..-.....+-++- T Consensus 1 M~~~tr~~~~~y-~~~~A--~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagr 77 (338) T protein:vir:11 1 MRNETRKQFDAY-LAQLA--KLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASR 77 (338) T ss_pred CCHHHHHHHHHH-HHHHH--HHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCcccccc Confidence 000000111000 00000 001111223344577877888889999999999999999988654443332211122222 Q ss_pred cc--cc-cccccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCcccccccc Q lcl|NC_021309. 210 VA--EA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLL 283 (497) Q Consensus 210 v~--Eg-~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~ 283 (497) +. .+ +..|..-...+.-.+.+++.-.-..|+.+.|+.. ++++..+++.+.++++.=.-.--++|+-.....-.. T Consensus 78 tdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~ 157 (338) T protein:vir:11 78 TDTTGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRA 157 (338) T ss_pred ccCCCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChh Confidence 21 11 1222222245555667777776778899998864 689999999888877654444444553211100000 Q ss_pred ccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhh--------hhhhhcccccccccchhhhhh Q lcl|NC_021309. 284 QRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVV--------TGAAGSGSGVAGSYPTAAEIA 355 (497) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~ 355 (497) ..+- .-..+..|+..++..... .+......+....+.+. . T Consensus 158 ~nPl-----------------------------lqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nL---D 205 (338) T protein:vir:11 158 ANPL-----------------------------LQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNL---D 205 (338) T ss_pred hCcC-----------------------------ccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccH---H Confidence 0000 011112222222221100 00000000111122222 2 Q ss_pred hHHHHHHH-hhhhhhccCCceEEechhHHHHHH--HHhhhcCceeccCcccccccccccccccccccceEecCCCCcCce Q lcl|NC_021309. 356 ENVFDAFV-DIQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI 432 (497) Q Consensus 356 ~~~~~~~~-~~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~ 432 (497) ....+++. .+...++..+.-+++-..++..-. .+-..... +.-...........+|-|+|.+..+++|.+.+ T Consensus 206 alV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~-----ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~ 280 (338) T protein:vir:11 206 ALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKYFPMVNKDQP-----ATEKIATDLILSQKRMGGLPPVEVPYVPEKGL 280 (338) T ss_pred HHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHHhcCCC-----hHHHHHHHHHHHhhhhCCceeEEccccCCCce Confidence 23344454 345666666654333323322211 12121111 11111111122245899999999999999998 Q ss_pred EEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCC Q lcl|NC_021309. 433 LVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 433 ~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~ 495 (497) ++=-|+- +.|+...+- +-.+-+.. .+|++--.=..=-++.|-+++.++.+.-.+.+. T Consensus 281 lVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 281 MVTTLKN--LSLYWQIGGRRRYLKEVP----EKNRIENYESSNDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred EEeeccc--cEEEEecCcEEEEEEecc----ccccccchhhhccceeeeccccEEEeecceecC Confidence 8766664 333333332 22221111 133333333344566777888888777666666 No 183 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=96.19 E-value=0.00089 Score=37.30 Aligned_cols=324 Identities=12% Similarity=0.037 Sum_probs=150.6 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEE Q lcl|NC_021309. 118 FNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSY 197 (497) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 197 (497) .+. ........+.... + ........+....|-|.....+...+.+.+.+++++++++++.-.++. T Consensus 1 M~~---------~tr~~~~~y~~~~----A--~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~ 65 (337) T protein:vir:78 1 MRK---------ETRQAYEKYAAQI----A--KLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEK 65 (337) T ss_pred CCh---------HHHHHHHHHHHHH----H--HhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeE Confidence 000 0000000010000 0 000001112233466777788888889999999999999988654443 Q ss_pred EEEcCCCccceec--ccccccccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_021309. 198 LTESAAHNNAAAV--AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLA 272 (497) Q Consensus 198 p~~~~~~~~a~wv--~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~ 272 (497) .-.....+-++-. +-+...|..-..++.-...+++.-.-+.|+.+.|+.. +++...+++.+.+.++.=.-.--++ T Consensus 66 v~lg~~g~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfN 145 (337) T protein:vir:78 66 LGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWN 145 (337) T ss_pred EecccCcceeeeecCCCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceeccc Confidence 3322111112111 1122333334456666677777766778899998864 6888888888888776544444445 Q ss_pred ccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhh----hccccccccc Q lcl|NC_021309. 273 GGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAA----GSGSGVAGSY 348 (497) Q Consensus 273 G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 348 (497) |+-.....-....+- .-..+..|+..++......-.. .......... T Consensus 146 Gts~A~~Td~~~nPl-----------------------------lqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~ 196 (337) T protein:vir:78 146 GVKAAATTDRQANPL-----------------------------LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKA 196 (337) T ss_pred ceeeccCCChhhCcC-----------------------------ccccchHHHHHHHhcchhhhhccccccCCceeecCC Confidence 432211110000000 0111122222222211000000 0000011111 Q ss_pred chhhhhhhHHHHHHHh-hhhhhccCCceEEechhHHHHHH--HHhhhcCceeccCcccccccccccccccccccceEecC Q lcl|NC_021309. 349 PTAAEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 349 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~ 425 (497) ..........++++.. +...++..+.-+++-..++..-. .+-...+.| .-...........+|-|+|.+..+ T Consensus 197 gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~p-----tE~~Aa~~i~s~k~iGGl~a~~~P 271 (337) T protein:vir:78 197 GDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAP-----TERLAADLIVSQKRIGNLPAVRVP 271 (337) T ss_pred CCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhcCCCc-----HHHHHHHHHHHhhhhcCcceEEcc Confidence 2222233344555543 46667766665444433322211 111111111 111111112223589999999999 Q ss_pred CCCcCceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 426 LIPLGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 426 ~~~~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) ++|.+.+++=-|+- +.|+...+- +-.+-+.. .+|++--.=..=-++.|-+++.++.++-...+.+ T Consensus 272 fFP~~~ilVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 272 FFPKRALMVTKLSN--LSIYYQEGARRRTLKEVP----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ccCCCceEEeechh--cEEEEecCcEEEEEEecc----ccccccchhhccceeeeeccccEEEEeceeecCC Confidence 99999988776665 333333332 22221111 1334433333445677778888887766666666 No 184 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=96.18 E-value=0.00091 Score=37.26 Aligned_cols=324 Identities=12% Similarity=0.033 Sum_probs=150.1 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEE Q lcl|NC_021309. 118 FNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSY 197 (497) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 197 (497) .+. ........+.... +. .......+....|-|.....+...+.+.+.+++++++++++--.++. T Consensus 1 M~~---------~tr~~~~~y~~~~----A~--~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~ 65 (337) T protein:vir:79 1 MRK---------ETRQAYEKYAAQI----AK--LNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEK 65 (337) T ss_pred CCh---------HHHHHHHHHHHHH----HH--hcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeE Confidence 000 0000000011100 00 00001112233466777778888888889999999999988654443 Q ss_pred EEEcCCCccceec--ccccccccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_021309. 198 LTESAAHNNAAAV--AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLA 272 (497) Q Consensus 198 p~~~~~~~~a~wv--~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~ 272 (497) .-.....+-++-+ +.+...|..-...+.-.+.+++.-.-..|+.+.|+.. ++++..+++.+.++++.=.-.--++ T Consensus 66 v~lg~~g~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfn 145 (337) T protein:vir:79 66 LGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWN 145 (337) T ss_pred EeeccCcceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhccc Confidence 3322111112111 2223334444556667777777777788999999864 6899999998888776544444445 Q ss_pred ccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhh----hccccccccc Q lcl|NC_021309. 273 GGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAA----GSGSGVAGSY 348 (497) Q Consensus 273 G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 348 (497) |+-.....-....+- .-..+..|+..++......-.. .......... T Consensus 146 G~s~A~~Td~~~nPl-----------------------------lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~ 196 (337) T protein:vir:79 146 GVKAAATTDRQANPL-----------------------------LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKA 196 (337) T ss_pred ceeeccCCChhhCcC-----------------------------ccccchhHHHHHHhcchhhhhccccccCcceeecCC Confidence 532111100000000 0111222222222210000000 0000011111 Q ss_pred chhhhhhhHHHHHHHh-hhhhhccCCceEEechhHHHHHH--HHhhhcCceeccCcccccccccccccccccccceEecC Q lcl|NC_021309. 349 PTAAEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 349 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~ 425 (497) ...........+++.. +...++..+.-+++--.++..-. .+-...+. +.-...........+|-|+|.+..+ T Consensus 197 gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n~~~~-----ptE~~Aa~~i~s~k~iGGlpa~~~P 271 (337) T protein:vir:79 197 GDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKYFPIVNATQA-----PTERLAADLIVSQKRIGNLPAVRVP 271 (337) T ss_pred CCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhccCCC-----cHHHHHHHHHHHhhhhCCceeEEcc Confidence 1222222334455543 45666666665444333322211 11111111 1110111111223589999999999 Q ss_pred CCCcCceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCC Q lcl|NC_021309. 426 LIPLGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 426 ~~~~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~ 496 (497) ++|.+.+++=-|+- +.|+...+- +-.+-+.. .+|++--.=..=-++.|-+++.++.++-...+.+ T Consensus 272 ffP~~~~lVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 272 FFPKRALMVTKLSN--LSIYYQEGARRRTLKEVP----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ccCCCceEEeechh--cEEEEecCcEEEEEEEcc----ccccccchhhccceeeeeccccEEEEeceeecCC Confidence 99999988776665 333333332 22221111 1334433333445667778888877765555555 No 185 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=96.16 E-value=0.00093 Score=37.22 Aligned_cols=324 Identities=12% Similarity=0.020 Sum_probs=144.5 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhccccc---cccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCcc Q lcl|NC_021309. 130 AAAELMGAFADGETAPAAIGQNPFGST---GTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNN 206 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~ 206 (497) ...+.+..+.... .. .+...+.. .+....|-|.....+...+.+.+.+++++++++++--.++..-.....+- T Consensus 1 M~~~tr~~~~~y~-~~---~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~i 76 (357) T protein:vir:56 1 MRQETRFKFNAYL-SR---VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSI 76 (357) T ss_pred CChHHHHHHHHHH-HH---HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccc Confidence 0000000110000 00 00011111 11234466777788888889999999999999988655444333211122 Q ss_pred ceecc--cc-cccccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCccccc Q lcl|NC_021309. 207 AAAVA--EA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVN 280 (497) Q Consensus 207 a~wv~--Eg-~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~ 280 (497) ++-+. -+ ...|..-..++.-...+++.-.-+.|+.+.|+.. +++...+++.+.+.++.=.-.--++|+-..... T Consensus 77 agrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T 156 (357) T protein:vir:56 77 ASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETS 156 (357) T ss_pred cccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccC Confidence 22211 11 1122222345666666777666678888988864 678888888888877654444444443211111 Q ss_pred cccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh---------hhhhcccccccccchh Q lcl|NC_021309. 281 GLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT---------GAAGSGSGVAGSYPTA 351 (497) Q Consensus 281 Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~ 351 (497) -....+ ..-..+..|+..++...... +............... T Consensus 157 d~~~nP-----------------------------llqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy 207 (357) T protein:vir:56 157 DRSSNP-----------------------------MLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDY 207 (357) T ss_pred ChhhCc-----------------------------CccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCc Confidence 000000 01111222222222211100 0000000001111122 Q ss_pred hhhhhHHHHHHHh-hhhhhccCCceEEechhHHHHHH--HHhhhcCceeccCcccccccccccccccccccceEecCCCC Q lcl|NC_021309. 352 AEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP 428 (497) Q Consensus 352 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~ 428 (497) ..+....++++.. +...++..+..+++--.++..-. .|....+.| .-...........+|-|+|.+..+++| T Consensus 208 ~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~p-----TE~~Aa~~i~s~k~iGGl~a~~~PfFP 282 (357) T protein:vir:56 208 ASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQDN-----SEMLAADVIISQKRIGNLPAVRVPYFP 282 (357) T ss_pred ccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccCCh-----HHHHHHHHHHHhhhhCCceeEEccccC Confidence 2222334455543 46666766665444433322211 221222211 111111122224589999999999999 Q ss_pred cCceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 429 LGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 429 ~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .+.+++=-|+- +.|+...+- +-.+-+.. .+|++--.=..=-++.|-+++.++.+.-.+.+.+. T Consensus 283 ~~~llVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 346 (357) T protein:vir:56 283 ADAMLITKLEN--LSIYYMDDSHRRVIEENP----KLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFS 346 (357) T ss_pred CCceEEeeccc--cEEEEecCcEEEEEEecc----ccccccchhhhcceeeeeccccEEEeeeeeeccCC Confidence 99988766664 333333332 21221111 12333322223345555666666655444333333 No 186 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=96.00 E-value=0.0011 Score=36.71 Aligned_cols=324 Identities=12% Similarity=0.032 Sum_probs=143.0 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhccccc---cccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCcc Q lcl|NC_021309. 130 AAAELMGAFADGETAPAAIGQNPFGST---GTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNN 206 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~ 206 (497) ...+.+..+.... ... +...+.. .+....|-|.....+...+.+.+.+++++++++++--.++..-.....+- T Consensus 1 M~~~tr~~~~~y~-~~~---A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~i 76 (357) T protein:vir:60 1 MRQETRFKFNAYL-SRV---AELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSI 76 (357) T ss_pred CChHHHHHHHHHH-HHH---HHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccc Confidence 0000000110000 000 0011111 11234466777778888889999999999999988654444332211122 Q ss_pred ceecc--cc-cccccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCccccc Q lcl|NC_021309. 207 AAAVA--EA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVN 280 (497) Q Consensus 207 a~wv~--Eg-~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~ 280 (497) ++-+. -+ ...|..-..++.-...+++.-.-+.|+.+.|+.. +++...+++.+.+.++.=.-.--++|+-..... T Consensus 77 agrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T 156 (357) T protein:vir:60 77 ASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETS 156 (357) T ss_pred ccccccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccC Confidence 22211 11 1122222355666667777766778889998864 678888888888877654444444443211111 Q ss_pred cccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh---------hhhhcccccccccchh Q lcl|NC_021309. 281 GLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT---------GAAGSGSGVAGSYPTA 351 (497) Q Consensus 281 Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~ 351 (497) -....+ ..-..+..|+..++...... +............... T Consensus 157 d~~~nP-----------------------------llqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy 207 (357) T protein:vir:60 157 DRSSNQ-----------------------------MLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDY 207 (357) T ss_pred ChhhCc-----------------------------CccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCc Confidence 000000 01111222222222211100 0000000001111122 Q ss_pred hhhhhHHHHHHHh-hhhhhccCCceEEechhHHHHHH--HHhhhcCceeccCcccccccccccccccccccceEecCCCC Q lcl|NC_021309. 352 AEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP 428 (497) Q Consensus 352 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~ 428 (497) ..+....++++.. +...++..+..+++--.++..-. .|....+.| .-...........+|-|+|.+..+++| T Consensus 208 ~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~p-----TE~~Aa~~i~s~k~iGGl~a~~~PfFP 282 (357) T protein:vir:60 208 ASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNREQDN-----SEMLAADVIISQKRIGNLPAVRVPYFP 282 (357) T ss_pred ccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhcCCCh-----HHHHHHHHHHHhhhhcCcceEEccccC Confidence 2222334455543 46666766665544433322211 111222111 111111112224589999999999999 Q ss_pred cCceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEe------eCCCCCC Q lcl|NC_021309. 429 LGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQL------KKGATGS 497 (497) Q Consensus 429 ~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~------~~~a~~~ 497 (497) .+.+++=-|+- +.|+...+- +-.+-+.. .+|++--.=..=-++.|-+++.++.+.- .+++.+. T Consensus 283 ~~~llVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~pa~~~ 352 (357) T protein:vir:60 283 ADAMLITKLEN--LSIYYMDDSHRRVIEENP----KLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKAT 352 (357) T ss_pred CCceEEeeccc--cEEEEecCcEEEEEEecc----ccccccchhhhcceeeeeccccEEEeeeeeeccCcccccCC Confidence 99988766664 333333332 21221111 1223322222334455555655555542 2233333 No 187 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=95.97 E-value=0.0012 Score=36.64 Aligned_cols=324 Identities=10% Similarity=0.071 Sum_probs=148.8 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEE Q lcl|NC_021309. 118 FNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSY 197 (497) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 197 (497) .+ .........+.... + ........+....|-|.....+...+.+.+.+++++++++++.-.++. T Consensus 1 M~---------~~tr~~~~~y~~~~----A--~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~ 65 (339) T protein:vir:79 1 MR---------NDTRRLFAAYKAAI----A--KLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEK 65 (339) T ss_pred CC---------hHHHHHHHHHHHHH----H--HHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeE Confidence 00 00000000111100 0 000011122234467777778888889999999999999988654443 Q ss_pred EEEcCCCccceec--ccccccccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_021309. 198 LTESAAHNNAAAV--AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLA 272 (497) Q Consensus 198 p~~~~~~~~a~wv--~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~ 272 (497) .-.....+-++-+ .-++..|..-..++.-...+++.-.-+.|+.+.|+.. +++...+++.+.+.++.=.-.--++ T Consensus 66 v~lg~~g~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfN 145 (339) T protein:vir:79 66 IGLGVSGPVASTTDTTQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFN 145 (339) T ss_pred EeeccCcceeecccCCCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceeccc Confidence 3322111112111 1122223333455666667777766778888998864 6888888888888776544444444 Q ss_pred ccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh----hhhhcccccc-cc Q lcl|NC_021309. 273 GGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT----GAAGSGSGVA-GS 347 (497) Q Consensus 273 G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~-~~ 347 (497) |+-.....-....+- .-..+..|+..++...... +......... .. T Consensus 146 Gts~A~~Td~~~nPl-----------------------------lqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ 196 (339) T protein:vir:79 146 GVSRAATSDRVANPM-----------------------------LQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGA 196 (339) T ss_pred ceeeecCCChhhCcC-----------------------------ccccchhHHHHHHhhhhhhhhccceeccceeEeccC Confidence 432211110000000 0111222222222211000 0000000000 11 Q ss_pred cchhhhhhhHHHHHHH-hhhhhhccCCceEEechhHHHH---HHHHhhhcCceeccCcccccccccccccccccccceEe Q lcl|NC_021309. 348 YPTAAEIAENVFDAFV-DIQLTLFQTPNAVVMNPRDWEL---LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVT 423 (497) Q Consensus 348 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~n~~~~~~---l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~ 423 (497) ...........++++. .+...++..+.-+++--.++.. +.++ .....| .-...........+|-|+|.+. T Consensus 197 ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~~~l~-n~~~~p-----tE~~Aa~~i~s~k~iGGl~a~~ 270 (339) T protein:vir:79 197 GADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKYFPLV-NRDRDP-----VQQIAADLIISQKRIGNLPAIR 270 (339) T ss_pred CCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHhhhHh-hcCCCh-----HHHHHHHHHHHhhhhCCceeEE Confidence 1122222233445554 3466667666654443333222 2222 211111 1111111222235899999999 Q ss_pred cCCCCcCceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 424 TPLIPLGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 424 ~~~~~~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .+++|.+.+++=-|+- +.|+...+- +-.+-+.. .+|++--.=..=-++.|-+++.++.+.=.+.+.|. T Consensus 271 ~PfFP~~~llVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 271 VPYFPANGLLVTRLDN--LSIYYQEGGRRRTILDNA----KRDRIENYESSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred ccccCCCceEEeechh--cEEEEecCcEEEEEEecc----ccccccchhhccceeeeeccccEEEeeeeecccCC Confidence 9999999988776665 333333332 22221111 13333333334456677788877777655555555 No 188 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=95.77 E-value=0.0015 Score=36.10 Aligned_cols=324 Identities=12% Similarity=0.029 Sum_probs=143.5 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhccccc---cccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCcc Q lcl|NC_021309. 130 AAAELMGAFADGETAPAAIGQNPFGST---GTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNN 206 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~ 206 (497) ...+.+..+.... .. .+...+.. .+....|-|.....+...+.+.+.+++++++++++--.++..-.....+- T Consensus 1 M~~~tr~~~~~y~-~~---~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~i 76 (357) T protein:vir:20 1 MRQETRFKFNAYL-SR---VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSI 76 (357) T ss_pred CChHHHHHHHHHH-HH---HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccc Confidence 0000000110000 00 00011111 11234466777788888889999999999999988654444332211122 Q ss_pred ceecc--cccc-cccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCccccc Q lcl|NC_021309. 207 AAAVA--EAGT-YPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVN 280 (497) Q Consensus 207 a~wv~--Eg~~-~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~ 280 (497) ++-+. -+.. .|..-..++.-...+++.-.-+.|+.+.|+.. +++...+++.+.+.++.=.-.--++|+-..... T Consensus 77 agrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T 156 (357) T protein:vir:20 77 ASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETS 156 (357) T ss_pred cccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccC Confidence 22111 1111 22222345666666777666678888988864 678888888888877654444444443211111 Q ss_pred cccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh---------hhhhcccccccccchh Q lcl|NC_021309. 281 GLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT---------GAAGSGSGVAGSYPTA 351 (497) Q Consensus 281 Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~ 351 (497) -....+ ..-..+..|+..++...... +............... T Consensus 157 d~~~nP-----------------------------llqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy 207 (357) T protein:vir:20 157 DRSSNP-----------------------------MLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDY 207 (357) T ss_pred ChhhCc-----------------------------CccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCc Confidence 000000 01111222222222211100 0000000001111122 Q ss_pred hhhhhHHHHHHHh-hhhhhccCCceEEechhHHHHHH--HHhhhcCceeccCcccccccccccccccccccceEecCCCC Q lcl|NC_021309. 352 AEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP 428 (497) Q Consensus 352 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~ 428 (497) ..+....++++.. +...++..+..+++--.++..-. .|....+.| .-...........+|-|+|.+..+++| T Consensus 208 ~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~p-----tE~~Aa~~i~s~k~iGGl~a~~~PfFP 282 (357) T protein:vir:20 208 ASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQDN-----SEMLAADVIISQKRIGNLPAVRVPYFP 282 (357) T ss_pred ccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccCCh-----HHHHHHHHHHHhhhhCCceeEEccccC Confidence 2222334455543 46666766665444433322211 221222111 111111122224589999999999999 Q ss_pred cCceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeC------CCCCC Q lcl|NC_021309. 429 LGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK------GATGS 497 (497) Q Consensus 429 ~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~------~a~~~ 497 (497) .+.+++=-|+- +.|+...+- +-.+-+.. .+|++--.=..=-++.|-+++.++.+.-.+ ++.+. T Consensus 283 ~~~ilVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~p~~~~ 352 (357) T protein:vir:20 283 ADAMLITKLEN--LSIYYMDDSHRRVIEENP----KLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKAT 352 (357) T ss_pred CCceEEeeccc--cEEEEecCcEEEEEEecc----ccccccchhhhcceeeeeccccEEEeeeeeeccccCCccCC Confidence 99988766664 333333332 21221111 123333222233455566666666554322 22222 No 189 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=95.15 E-value=0.0027 Score=34.71 Aligned_cols=323 Identities=14% Similarity=0.089 Sum_probs=147.4 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccc----cc-cccccchhhhHHHHHHHHhhhhHHhhcceeecCC Q lcl|NC_021309. 118 FNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGST----GT-FAPGILPTFLPGIVEQLFYELSLADLISSRPVTS 192 (497) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~-~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~ 192 (497) .+ .+.+..+... . ...+...+.. +. --..|-|.....+...+.+.+.+++++++++++. T Consensus 1 M~------------~~tr~~~~~y-~---~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e 64 (342) T protein:vir:10 1 MK------------DLTLEKYNAY-L---ARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDE 64 (342) T ss_pred CC------------hHHHHHHHHH-H---HHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCccccccc Confidence 00 0000001000 0 0000111111 11 1234667777788888899999999999999886 Q ss_pred CceEEEEEcCCCccceecc---cccccccccccceeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 193 PNLSYLTESAAHNNAAAVA---EAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 193 ~~~~~p~~~~~~~~a~wv~---Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~l~~~i~~~la~~~~~~~ 266 (497) -.++..-.....+-++-+. -+...|..-..++.-...+++.-.-+.|+.+.|+.. +++...+++.+.+.++.=. T Consensus 65 ~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~ 144 (342) T protein:vir:10 65 QTGETLGLDSAHTVASTTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDL 144 (342) T ss_pred ceeeEEecccCcccccccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhcc Confidence 5444433221112222221 112233333456666777777777778899998864 6888888888888776544 Q ss_pred HhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh---hhhhcccc Q lcl|NC_021309. 267 EVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT---GAAGSGSG 343 (497) Q Consensus 267 d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 343 (497) -.--++|+-.....-....+- .-..+..|+..++...... ........ T Consensus 145 i~IGfNGts~A~~Td~~~nPl-----------------------------lqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i 195 (342) T protein:vir:10 145 IMIGFNGTSRAATSDRNSNPL-----------------------------LQDVAKGWLQKMREDAKERVMNGESTDNQV 195 (342) T ss_pred ceecccceeeccCCChhhCcC-----------------------------ccccchHHHHHHHhhhhhhhcccceeccce Confidence 444445432211110000000 0111122222222111110 00000000 Q ss_pred cccccchhhhhhhHHHHHHHh-hhhhhccCCceEEechhHHHH---HHHHhhhcCceeccCccccccccccccccccccc Q lcl|NC_021309. 344 VAGSYPTAAEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWEL---LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGV 419 (497) Q Consensus 344 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~---l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~ 419 (497) ...............++++.. +...++..+.-+++--.++.. +..+.. .+. +.-...........+|-|+ T Consensus 196 ~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~-~~~-----ptE~~Aa~~i~s~k~iGGl 269 (342) T protein:vir:10 196 LVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLLADKYFPIVNQ-QNA-----PTEELAADIVISQKRIGGL 269 (342) T ss_pred eecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhc-CCC-----hHHHHHHHHHHhhhhhcCc Confidence 111112222223334455543 466666666654443333222 111211 111 1111111222224589999 Q ss_pred ceEecCCCCcCceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 420 PVVTTPLIPLGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 420 Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) |.+..+++|.+.+++=-|+- +.|+...+- +-.+-+.. .+|++--.=..=-++.|-+++.++.+.-.+.+..- T Consensus 270 ~a~~~PfFP~~~ilVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 270 KAVRVPFFPANAILITKLEN--LAIYVQEGTTRKHIENVP----KKDRIETYESENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred eeEEccccCCCceEEeeccc--cEEEEecCcEEEEEEecc----ccccccchhhhccceeeeccccEEEeecceecCCC Confidence 99999999999988766664 333333332 21221111 12333333333445566677776666544444434 No 190 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=95.01 E-value=0.0023 Score=35.02 Aligned_cols=190 Identities=15% Similarity=0.128 Sum_probs=89.1 Q ss_pred EEeeehhhHHHHhh------HHHHHHHHHHHHHHHHHHHHHhhhhc----ccCccccccccccccccccccccchhhhhh Q lcl|NC_021309. 233 VANALTITDEGLRD------APELFNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 233 la~~~~iS~ell~d------~~~l~~~i~~~la~~~~~~~d~~~l~----G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 302 (497) +- -.-+|+-++.| ..++.+...+++.++++...|+.++. +..+..|..--+ +..+.... T Consensus 1 iD-~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~--~g~~~~~~-------- 69 (221) T protein:vir:17 1 MD-DLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQD--GGFSVNIG-------- 69 (221) T ss_pred CC-cchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccc--cCcceecc-------- Confidence 11 12355555543 24688999999999999999998863 111111100000 00000000 Q ss_pred hHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCc-eEEechh Q lcl|NC_021309. 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN-AVVMNPR 381 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~n~~ 381 (497) .....+...+.+.++++...+....--... +++++|. T Consensus 70 ------------------------------------------a~~t~~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~ 107 (221) T protein:vir:17 70 ------------------------------------------AGNTNNAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPR 107 (221) T ss_pred ------------------------------------------ccccCCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcH Confidence 000011122234455555444443333233 4567888 Q ss_pred HHHHHHHHhhh-cCceeccCccccccccccc--ccccccccceEecCCCCc--CceEEEeeccceEEEEeecccEEEeec Q lcl|NC_021309. 382 DWELLRLTKDA-NGQYMGGNFFGNAYGNPVN--GGKNIWGVPVVTTPLIPL--GTILVGHFAPSVIQTARREGVTMQMTN 456 (497) Q Consensus 382 ~~~~l~~lkd~-~G~~i~~~~~~~~~~~~~~--~~~~l~G~Pvv~~~~~~~--~~~~~gd~~~~~~~i~~r~~~~i~~~~ 456 (497) .+..|.+-.|. -.++-+. +..+.... ....+.|++|+.|+.+|. ++-+..+-........+.. . T Consensus 108 ~y~~LL~~~d~~~~n~d~~----~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~-------~ 176 (221) T protein:vir:17 108 QYYSLISSVDTNILNREIG----NTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTNLVTDPGDATTSGENNG-------S 176 (221) T ss_pred HHHHHHHhcCcceeeeecc----cccccccccceeeeecCcEEEEeccCCcccccccccCCccccccccccc-------c Confidence 88877753221 1112121 11111111 234688999999999996 3322211111000000000 1 Q ss_pred ccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 457 SNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 457 ~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) +.+ +|. +.+ +.|+||+|+--+++-.+++-. T Consensus 177 yr~-~fs-~~~---------glv~~~~Avgtvkl~~~~~~~ 206 (221) T protein:vir:17 177 YRP-AIT-DRA---------GLVFHKEAADTVEVLLPPSRP 206 (221) T ss_pred ccc-ccc-ceE---------EEEEcchheeeeeeecCCCCC Confidence 111 122 111 678999999888888777665 No 191 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=94.98 E-value=0.0019 Score=35.50 Aligned_cols=338 Identities=12% Similarity=0.059 Sum_probs=127.4 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhh----hhhhHHHHHHhhhHHHH-HHHHHHHHHhh--hhhhhhhh Q lcl|NC_021309. 77 IPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTK----FDVSFNVSAKAADPGTA-AAELMGAFADG--ETAPAAIG 149 (497) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~-~~~~~~~~~~~--~~~~~~~~ 149 (497) ++... ....++. .+.......... .....+... .+.... ....+. ..+. .....+.. T Consensus 1 ~~~~~-----~~~~~~~--------~~~~~~~~~~~~~~~~~~~l~~~gi--~~~~~~~~~~~~~-~~~~~~~~~~~amD 64 (382) T protein:vir:96 1 MSHIS-----KTHSRLA--------GRHAKPFDLKNVTHEAVAALGRIGL--VFDHAVVQDQIKA-LAKAGAFRSGSAMD 64 (382) T ss_pred CCCcc-----eeeeecC--------CccccchhhhcccHHHHHHHhcccc--ccCcccchhHhhh-hhhhhhhhhhcccc Confidence 00000 0000000 000000000000 000000000 000000 000000 0000 00001111 Q ss_pred hhccccccccccccchh----hhHHHHHHHHhhhhHHhhcceeecCC---CceEEEEEcCCCccceeccccccccccccc Q lcl|NC_021309. 150 QNPFGSTGTFAPGILPT----FLPGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 150 ~~~~~~~~~~g~~v~p~----~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~ 222 (497) ....+..+.++.-+|-. +.+.+++-+.+......++++.+.+. ..+.|+.... .+.|.+.+-+.+.|..+.. T Consensus 65 a~~~~~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~-~G~A~~ygd~~D~Pl~d~~ 143 (382) T protein:vir:96 65 SNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP-AGTAVEYGDHTNIPLTSWN 143 (382) T ss_pred cccCCccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeec-ccceEEeecccCCCccccc Confidence 11111111222223332 34667888888878888888776543 2456766554 3677888888888887655 Q ss_pred ceeeEeeeeeEEeeehh-hHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhcccCc---ccccccccccccccccccc Q lcl|NC_021309. 223 FARVYEQVGKVANALTI-TDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGY---PGVNGLLQRSTGFTASSAS 295 (497) Q Consensus 223 f~~i~~~~~kla~~~~i-S~ell~d~---~~l~~~i~~~la~~~~~~~d~~~l~G~G~---~~p~Gi~~~~~~~~~~~~~ 295 (497) .+..+-..+.+.....+ ..|+.+-+ .++.+--+.-.++++...+|+-.++|+-. +..-|++|.+......... T Consensus 144 ~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a 223 (382) T protein:vir:96 144 ANFERRTIVRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPP 223 (382) T ss_pred cceeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccC Confidence 44444444555544555 45666543 35666667778888889999999999632 2466999988643221110 Q ss_pred chhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc----c Q lcl|NC_021309. 296 SLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF----Q 371 (497) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~ 371 (497) ... ....+...+.+++..++..+..... . T Consensus 224 ~~~-----------------------------------------------Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~ 256 (382) T protein:vir:96 224 SQG-----------------------------------------------WATADWAGIIGDIREAVRQLRIQSQDQIDP 256 (382) T ss_pred CCC-----------------------------------------------cccccHHHHHHHHHHHHHHHHhccCCeeee Confidence 000 0112223334444444444433221 0 Q ss_pred -C-CceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCc-CceEEEeeccceEEEEeec Q lcl|NC_021309. 372 -T-PNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL-GTILVGHFAPSVIQTARRE 448 (497) Q Consensus 372 -~-~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~-~~~~~gd~~~~~~~i~~r~ 448 (497) + +..+.+-+.-+..|.. .+..|-=++.-. ..+..++.++..+.+.. +..--|.... .|...+.. T Consensus 257 ~~~~~~L~LP~~~~~~Ls~-~n~~g~Tvl~~l-----------k~n~Pnl~i~t~peL~~a~~~g~g~~~~-~~~~~~e~ 323 (382) T protein:vir:96 257 KAEKITMALATSKVDYLSV-TTPYGISVSDWI-----------EQTYPKMRIVSAPELSGVQMQGKTPEDA-LVLFVEEV 323 (382) T ss_pred cccceEEeechHHHhhccc-cCccCccHHHHH-----------HHhcCCcEEEEccccccccCCCccceeE-EEEecchh Confidence 0 1123344443333321 111111011000 00011222222222210 0000000000 01111100 Q ss_pred ccEEEeecccchhhhc--------------C-ceEEEEEEe-ecceeecccceEEEEee Q lcl|NC_021309. 449 GVTMQMTNSNGTDFVD--------------G-KVTVRAEER-LGLLVYRPSAFQLIQLK 491 (497) Q Consensus 449 ~~~i~~~~~~~~~f~~--------------~-~v~~r~~~r-~~~~v~~~~a~~~l~~~ 491 (497) ...+..+....-.|.+ . .+..-+..| .|..|++|.||++++.- T Consensus 324 ~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 324 DASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred hhhcccccccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccCC Confidence 0000011101111211 0 011112233 45555679999999888 No 192 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=94.54 E-value=0.0041 Score=33.65 Aligned_cols=317 Identities=13% Similarity=0.020 Sum_probs=138.7 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhcccc-----ccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCC Q lcl|NC_021309. 130 AAAELMGAFADGETAPAAIGQNPFGS-----TGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAH 204 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~ 204 (497) ...+.+..+.... ... +...+. ..+.-..|.|.....+...+.+.+.++++++++++..-...+.-..... T Consensus 1 M~~~tr~~~~~y~-~~~---A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg 76 (343) T protein:vir:98 1 MNKTAQELFYSLI-GDA---AEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRK 76 (343) T ss_pred CChHHHHHHHHHH-HHH---HHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCc Confidence 0000000110000 000 001111 1111245677777788888888899999999999864333332211111 Q ss_pred ccceecccccc-cccccccceeeEeeeeeEEeeehhhHHHHhhH---HH-HHHHHHHHHHHHHHHHHHhhhhcccCcccc Q lcl|NC_021309. 205 NNAAAVAEAGT-YPFSSEEFARVYEQVGKVANALTITDEGLRDA---PE-LFNFVQGRLLEGIQRKEEVQLLAGGGYPGV 279 (497) Q Consensus 205 ~~a~wv~Eg~~-~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~---~~-l~~~i~~~la~~~~~~~d~~~l~G~G~~~p 279 (497) ..++-....+. .... ..+.-...+++.-.-+-|+-+.|+.. ++ +...+++.+.+.++.=.-.--++|+-.... T Consensus 77 ~~t~r~~t~~~~~~~~--~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~ 154 (343) T protein:vir:98 77 RHYGAHDRRTPIQQRW--TRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTD 154 (343) T ss_pred cccCccccCCCccccc--cCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccC Confidence 11111111000 0000 01111344555555567888888764 56 888888888877654333334444322111 Q ss_pred ccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhh----hhc---ccccccccchhh Q lcl|NC_021309. 280 NGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGA----AGS---GSGVAGSYPTAA 352 (497) Q Consensus 280 ~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~---~~~~~~~~~~~~ 352 (497) . + -.. .-..+..|+..++......-. ... ..+....+.+.+ T Consensus 155 T---~----------nPl-------------------lqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLD 202 (343) T protein:vir:98 155 T---S----------DPN-------------------LADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLD 202 (343) T ss_pred C---C----------Ccc-------------------hhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHH Confidence 0 0 000 011112222222221110000 000 001111233333 Q ss_pred hhhhHHHHHHHhhhhhhccCCceEEechhHHHHHH--HHhhhcCceeccCcccccccccccccccccccceEecCCCCcC Q lcl|NC_021309. 353 EIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG 430 (497) Q Consensus 353 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~ 430 (497) ... +++...+...++..+..+++--.++..-. .+-...+++ +.-...........++-|+|.+..+++|.+ T Consensus 203 alV---~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~n~~~~~----ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~ 275 (343) T protein:vir:98 203 ELA---YDLKQGLDARHRDAGDLVFLVGADLVAKEASLVYKGNGLI----ATEKAALNTHDLMKSFGGMPAMIVPNMPPR 275 (343) T ss_pred HHH---HHHHhcCchHHhcCCCEEEEEchhhhhhhhhhhhhhcCCC----hHHHHHHHHHHHHHhhCCCeeEEccccCCC Confidence 333 33445567777777776555444432222 111222211 100001111222357999999999999999 Q ss_pred ceEEEeeccceEEEEeeccc-EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 431 TILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 431 ~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) .+++=-|+- +.|+...+- +-.+-+.. .+|++--.=..=-|+.|-+++.++.+...+.+-+. T Consensus 276 ~llVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~ 337 (343) T protein:vir:98 276 AAIVTSLSN--LSIYTQEGSMRRGMKDDD----DKKAVRDSYYRNEAYAVEDCGKFMAVDFTKVKLSS 337 (343) T ss_pred ceEEeeccc--cEEEEecCcEEEEEEecc----ccccccchhhhcceeeeeccccEEEeeeeeeeecC Confidence 988766665 333333332 22221111 13344333334456677788887777554443333 No 193 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=94.02 E-value=0.0056 Score=32.92 Aligned_cols=273 Identities=11% Similarity=-0.049 Sum_probs=105.4 Q ss_pred ccccccccccccchhhhHHHHHHHHhhhhHHhhc-------ceeecCCCceEEEEEcCCCc---cceecccccccccccc Q lcl|NC_021309. 152 PFGSTGTFAPGILPTFLPGIVEQLFYELSLADLI-------SSRPVTSPNLSYLTESAAHN---NAAAVAEAGTYPFSSE 221 (497) Q Consensus 152 ~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~-------~~~~~~~~~~~~p~~~~~~~---~a~wv~Eg~~~~~s~~ 221 (497) ...+.-. +..|....-.++.+.+...+.... ...+..++-+..|-...-.+ ...-+.+.+..+.++. T Consensus 1 m~lsD~~---vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~ki 77 (325) T protein:vir:95 1 MALSDLA---VYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKVL 77 (325) T ss_pred Cchhhhh---hhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceecccee Confidence 0000000 011222233333333333222221 12233344445555432111 1122333333433332 Q ss_pred -cceeeEeeeeeEEeeehh--hHHHHh-hH-HHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccc Q lcl|NC_021309. 222 -EFARVYEQVGKVANALTI--TDEGLR-DA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASS 296 (497) Q Consensus 222 -~f~~i~~~~~kla~~~~i--S~ell~-d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~ 296 (497) +..++......=.++... +..+.. +. ..+...|.+.+++...+.+-+.++.+-. +.+...+........ T Consensus 78 tt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~-----~a~~~~~~~v~dis~- 151 (325) T protein:vir:95 78 KHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVY-----SALSQVSDVVYDATA- 151 (325) T ss_pred ccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----Hhhcccccceeeeec- Confidence 344444443333332222 222221 22 2455566666666554444333332210 011100000000000 Q ss_pred hhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceE Q lcl|NC_021309. 297 LFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAV 376 (497) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 376 (497) .............+.++...+-- ....-..| T Consensus 152 ------------------------------------------------~~~~~~~~~s~~~l~~A~~klGD-~~~~l~~~ 182 (325) T protein:vir:95 152 ------------------------------------------------NTDAADKLPTWNNLNNGQAKFGD-QSSQIAAW 182 (325) T ss_pred ------------------------------------------------ccCcccccccHHHHHHHHHHhcc-cccceeEE Confidence 00000000001122222222211 12223469 Q ss_pred EechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCc------eEEEeeccceEEEEeeccc Q lcl|NC_021309. 377 VMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT------ILVGHFAPSVIQTARREGV 450 (497) Q Consensus 377 ~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~------~~~gd~~~~~~~i~~r~~~ 450 (497) +||...+..|.+++-.+...++... +. . .-++.+|+|||+++.+|... +..--|.++++.+.+..+. T Consensus 183 ~MHS~v~~~L~~~~L~~~~~~~~~~--g~---~--~i~t~~G~~VIVdD~~p~~~~g~~~~ytty~lg~GAi~~~~~~~~ 255 (325) T protein:vir:95 183 IMHSTPMHKLYGSNLTNGERLFTYG--TV---N--VVRDPFGKLLVMTDSPNLFAAGTPNVYHILGLVPGGVLIGQNNDF 255 (325) T ss_pred EEchHHHHHHHHhhccccccccccC--Cc---c--cccccCCcEEEEeCCCCCCCccCceeEEEEEEecCeEEecCCCCc Confidence 9999999999987665544443321 11 1 22478899999999999532 2122233445555544443 Q ss_pred EEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCC-C Q lcl|NC_021309. 451 TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG-S 497 (497) Q Consensus 451 ~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~-~ 497 (497) .....+.... ..-..++|++. - -++||-.+..- .+..| | T Consensus 256 ~~~~~~~~~~--~~~~~~~~~~~--t-f~lhp~G~sw~---~s~~g~s 295 (325) T protein:vir:95 256 DANEETKNGD--ENIIRTYQAEW--S-YNIGVKGFAWD---KANGGKS 295 (325) T ss_pred cccccccCcc--cceeeeeeeee--e-EEeecceeeee---cccccCC Confidence 3332222211 11223444322 2 36789988772 22222 2 No 194 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=93.74 E-value=0.0025 Score=34.85 Aligned_cols=344 Identities=12% Similarity=0.030 Sum_probs=135.5 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhh---hHHHHHHHHHH-HHHhhhhhhhhhhhhc Q lcl|NC_021309. 77 IPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAA---DPGTAAAELMG-AFADGETAPAAIGQNP 152 (497) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 152 (497) ++....-. ..+........+..+..... .....+...... ..........- .+........+..... T Consensus 1 ~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~a~da~~ 71 (388) T protein:vir:99 1 MKQLSKVH-----QSLAGRSVRAFDMANGKADY----RLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAY 71 (388) T ss_pred CCCcccee-----eecCCcccchhhhhcCCcce----eeechhhHhhhhcceeccCccchhhhhhhhhhhhhhcccCccc Confidence 00000000 00000000000000000000 000000000000 00000000000 0000000001111111 Q ss_pred cccccccccccchh----hhHHHHHHHHhhhhHHhhcceeecCCC---ceEEEEEcCCCccceeccccccccccccccee Q lcl|NC_021309. 153 FGSTGTFAPGILPT----FLPGIVEQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 153 ~~~~~~~g~~v~p~----~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~ 225 (497) .+..+.++.-+|-. +.+.+++.+.......+++++.+.+.- .+.++.... .+.+.+.+-+.+.|..+...+. T Consensus 72 ~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~-~G~A~~ygd~~D~Pl~d~~~~~ 150 (388) T protein:vir:99 72 VAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEP-AGTAMEYGDLTNIPLSSWNVNF 150 (388) T ss_pred ccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeec-ceeEEEeecccCCCceecccee Confidence 11112222113322 335677777777777777777765432 455666544 3577788888888887766666 Q ss_pred eEeeeeeEEeeehhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHhhhhcccC-c--cccccccccccccccccccchh Q lcl|NC_021309. 226 VYEQVGKVANALTITDEGLRDA----PELFNFVQGRLLEGIQRKEEVQLLAGGG-Y--PGVNGLLQRSTGFTASSASSLF 298 (497) Q Consensus 226 i~~~~~kla~~~~iS~ell~d~----~~l~~~i~~~la~~~~~~~d~~~l~G~G-~--~~p~Gi~~~~~~~~~~~~~~~~ 298 (497) .+-..+.+...+.++.+=+.-+ .+|.+.-+.-.++++..++|+-.++|.. . .+.-|++|.+......... T Consensus 151 ~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at--- 227 (388) T protein:vir:99 151 ERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIAST--- 227 (388) T ss_pred eeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccc--- Confidence 5666666666667765544322 3678888888889999999999999943 2 2577999987643221110 Q ss_pred hhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccC--C--- Q lcl|NC_021309. 299 GATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQT--P--- 373 (497) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~--- 373 (497) ...+.......+...+.+++..++..+....... + T Consensus 228 ----------------------------------------~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~ 267 (388) T protein:vir:99 228 ----------------------------------------TPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDV 267 (388) T ss_pred ----------------------------------------cCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeeccc Confidence 0011112222334445556665555554333211 1 Q ss_pred -ceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecc--- Q lcl|NC_021309. 374 -NAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREG--- 449 (497) Q Consensus 374 -~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~--- 449 (497) ..+++-+.-+..|... +..|.-++.-. ..+..++.++..+.+..... -| -....+.+.++.+ T Consensus 268 ~~tL~LP~~~~~~Ls~~-n~~g~Tvl~~l-----------k~n~Pnl~i~t~pEl~~a~~-tg-g~~~~~~~~~~~~~~~ 333 (388) T protein:vir:99 268 DITLVLPMNKVDMLSVV-TDLGISVRDWL-----------KQTYPRVRVMSAPELQGGNP-DD-GKDIAYMFLDSVDTAV 333 (388) T ss_pred ceEEEechHHHHhcccc-CcCCccHHHHH-----------HHhcCCcEEEEecccccccc-cC-CceeEEEEeccccccc Confidence 1234444444444311 11111111000 00111222332222210000 00 0000011111000 Q ss_pred ---------------cEEEeecccchhhhcC-ceEEEEEEeecceee-cccceEEEEee Q lcl|NC_021309. 450 ---------------VTMQMTNSNGTDFVDG-KVTVRAEERLGLLVY-RPSAFQLIQLK 491 (497) Q Consensus 450 ---------------~~i~~~~~~~~~f~~~-~v~~r~~~r~~~~v~-~~~a~~~l~~~ 491 (497) +.+..-.. ..++ ....-+..|.+|.++ +|.||++++.- T Consensus 334 ~~~~~~~~t~~~~~p~~~~~l~v----q~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 334 DGSTDGGDTWAQLVQSKFVTLGV----EKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred ccCccCcceeEEecccccccccc----eecCceeEeccccceeeeEEeccchhheeccC Confidence 00000000 0011 233444556665555 69999999888 No 195 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=93.65 E-value=0.0068 Score=32.45 Aligned_cols=319 Identities=12% Similarity=0.026 Sum_probs=134.5 Q ss_pred hhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCc Q lcl|NC_021309. 115 DVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPN 194 (497) Q Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 194 (497) +.. ..+. ........+..... ........+....|-|.....+...+.+.+.++++++++++..-. T Consensus 1 m~~-~m~~-------~tr~~~~~y~~~~A------~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~ 66 (341) T protein:vir:27 1 MSQ-ILTQ-------SAREYMDNFAQQLA------KSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIE 66 (341) T ss_pred Ccc-cccH-------HHHHHHHHHHHHHH------HHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCcccccccee Confidence 000 0000 00000001111000 000011122234567777788889999999999999999988654 Q ss_pred eEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhH------HHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021309. 195 LSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA------PELFNFVQGRLLEGIQRKEEV 268 (497) Q Consensus 195 ~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~------~~l~~~i~~~la~~~~~~~d~ 268 (497) ++..-.....+-++-+. .+..|. ++..+.....+++.-.-+.|+.+.|+.. ++++..+++.+.++++.=.-. T Consensus 67 Ge~v~lg~~g~iagrtd-t~R~~r-~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~ 144 (341) T protein:vir:27 67 GQVVDVGVSGLYTGRKA-GGRFTK-QVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMR 144 (341) T ss_pred eeEeecccccceeeccC-CCceec-ccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhh Confidence 44333221111122111 122222 2356666666666666677888877642 568888888888877654444 Q ss_pred hhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhh--hccccccc Q lcl|NC_021309. 269 QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAA--GSGSGVAG 346 (497) Q Consensus 269 ~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 346 (497) --++|+-.....-....+- .-..+..|+..++......-.. ...++... T Consensus 145 IGfnGts~A~~Td~~anPl-----------------------------lqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~g 195 (341) T protein:vir:27 145 IGWNGVSAEADTDPSANPL-----------------------------GQDVNEGWIAFVKNRKASQVVDVDVYFDETNG 195 (341) T ss_pred hcccceeeccCCChhhccc-----------------------------ccccchhHHHHHHhhcccceeccceeeccCCC Confidence 4445532111000000000 0011122222222211000000 00011111 Q ss_pred ccchhhhhhhHHHHHHHh-hhhhhccCCceEEechhHHHHHH--HHhhhcCceeccCcccccccccccccccccccceEe Q lcl|NC_021309. 347 SYPTAAEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVT 423 (497) Q Consensus 347 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~ 423 (497) .+.+. .....+++.. +...++..+.-+++-..++..-. .|-..... +........-..+|-|+|.+. T Consensus 196 dy~nL---DAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~-------ptE~~Aa~~i~k~iGGlpa~~ 265 (341) T protein:vir:27 196 DYRTL---DAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAAQAKLYDKADK-------PSEQIAAQKLDKTIAGRPAYV 265 (341) T ss_pred ccccH---HHHHHHHHhcccChHHhcCCCEEEEEchhhhhhhhhhhhccCCC-------CHHHHHHHHHHHhhCCCeEEE Confidence 12222 2234444443 45666666654433333322211 11111111 111111111235899999999 Q ss_pred cCCCCcCceEEEeeccceEEEEeecccE-EEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCC-------C Q lcl|NC_021309. 424 TPLIPLGTILVGHFAPSVIQTARREGVT-MQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA-------T 495 (497) Q Consensus 424 ~~~~~~~~~~~gd~~~~~~~i~~r~~~~-i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a-------~ 495 (497) .+++|.+.+++=-|+- +.|+...+-. -.+-+... +|++.-. .-++.|-+-.+|..+.++.+- + T Consensus 266 ~PffP~~~~lVT~L~N--LsIY~Q~gs~RR~~~d~p~----r~rie~y---es~YvVEdyg~~~~~~~~~vkl~~~~~~~ 336 (341) T protein:vir:27 266 PPFLPDNAMVVTIPEN--LQVLTQHGTAQRKAKHESD----RKRSKTH---TGAWKVTQWVCWKRSPLTTQKKSTSALNH 336 (341) T ss_pred ccccCCCceEEeeccc--eEEEEecCcEEEEEEeccc----cccccch---hhhheeehhhhhhhccccccccCcccccc Confidence 9999999988766664 3333333322 12211111 1222111 013444444444433333222 2 Q ss_pred CC Q lcl|NC_021309. 496 GS 497 (497) Q Consensus 496 ~~ 497 (497) .| T Consensus 337 ~~ 338 (341) T protein:vir:27 337 RS 338 (341) T ss_pred cc Confidence 22 No 196 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=92.26 E-value=0.012 Score=31.08 Aligned_cols=262 Identities=10% Similarity=-0.015 Sum_probs=111.8 Q ss_pred cccccccccchh-hhHHHHHHHHhhhhHHhhcceee-----cCCCceEEEEEcCCCccceecccccccccccccceeeEe Q lcl|NC_021309. 155 STGTFAPGILPT-FLPGIVEQLFYELSLADLISSRP-----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYE 228 (497) Q Consensus 155 ~~~~~g~~v~p~-~~~~ii~~~~~~~~l~~~~~~~~-----~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~ 228 (497) .......++-|+ +..++++.+++.+++..++..-. -.++.+.+|+..... +.++......+.+-+.+++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~-----v~dg~~~~~~~~te~~v~l 75 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVK-----SASGRTLVKQPMVDQTIPF 75 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCcee-----ecccCCccccccccceEEE Confidence 112223345455 66789999999998888776522 124678888743221 2233334334444455444 Q ss_pred e--eeeEEeeehhh-HHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHH Q lcl|NC_021309. 229 Q--VGKVANALTIT-DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATV 305 (497) Q Consensus 229 ~--~~kla~~~~iS-~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 305 (497) . -+|... +.|+ .|..++..++...+.+...++++..+|..++.- +........+.. T Consensus 76 ~id~~k~~~-~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l---------~~~a~~~~gt~g----------- 134 (418) T protein:vir:10 76 KIAYQEHVG-LEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALT---------LKKAFHSSGTPG----------- 134 (418) T ss_pred EEecccccc-eeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH---------HhhcccccccCC----------- Confidence 4 344333 4444 445566677777777788899999999887621 110000000000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhcc-CC-ceEEechhHH Q lcl|NC_021309. 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TP-NAVVMNPRDW 383 (497) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~n~~~~ 383 (497) .....++++.++...+....-- .+ ...+++|..+ T Consensus 135 --------------------------------------------t~~~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~ 170 (418) T protein:vir:10 135 --------------------------------------------VRPGAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTC 170 (418) T ss_pred --------------------------------------------cCcchHHHHHHHHHHHHhcCCCCCCceEEEeCHHHH Confidence 0000122333333333222221 12 3457888776 Q ss_pred HHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCc-------e-EEEeeccceEEEEeecccEEEee Q lcl|NC_021309. 384 ELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-------I-LVGHFAPSVIQTARREGVTMQMT 455 (497) Q Consensus 384 ~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~-------~-~~gd~~~~~~~i~~r~~~~i~~~ 455 (497) ..|. ++... .+..... ..........++.|+.|+.++.+|..+ . +.|-... +..+.. ...+. T Consensus 171 ~~L~--~~~~~--~~~~~~~-~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~-~~~~~~----~~~t~ 240 (418) T protein:vir:10 171 ASLS--DEVTK--LFKESMV-EQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVN-GDTVGF----DGGTA 240 (418) T ss_pred HHHh--hhccc--ccccccc-chhhheeeeeeeeceEEEEecCCCcccccccccceeeeccccc-ceeEEE----eecce Confidence 5543 34332 2221111 111122234579999999999999532 1 2222111 111110 00000 Q ss_pred cccchhhhcC-ceEEEE---EEeecceee-cccceEEEEee-CCCCCC Q lcl|NC_021309. 456 NSNGTDFVDG-KVTVRA---EERLGLLVY-RPSAFQLIQLK-KGATGS 497 (497) Q Consensus 456 ~~~~~~f~~~-~v~~r~---~~r~~~~v~-~~~a~~~l~~~-~~a~~~ 497 (497) ...+. ...+ .++|-+ .-++...+. ++.-|++..-. +.+.|. T Consensus 241 s~~g~-l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~ 287 (418) T protein:vir:10 241 STTGF-LKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGA 287 (418) T ss_pred eeccc-eeeccEEEECceeecccccccccccceEEEEEeeccccccCc Confidence 00010 0001 111111 000000000 22333322221 111111 No 197 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=91.93 E-value=0.014 Score=30.82 Aligned_cols=292 Identities=10% Similarity=0.020 Sum_probs=107.7 Q ss_pred hccc-cccccccccchhhhHHHH-HHHHhhhhHHhhc---------ceeecCCCceEEEEEcCCCccceeccccc---cc Q lcl|NC_021309. 151 NPFG-STGTFAPGILPTFLPGIV-EQLFYELSLADLI---------SSRPVTSPNLSYLTESAAHNNAAAVAEAG---TY 216 (497) Q Consensus 151 ~~~~-~~~~~g~~v~p~~~~~ii-~~~~~~~~l~~~~---------~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~---~~ 216 (497) |..- ..+.-..++.||.....+ +...+.+.|++-. .....++..+.+|....-++...-+.+.. .. T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 0000 001111244555333322 2222222222210 01234556778888755444333333322 12 Q ss_pred ccccccce-eeEeeeeeEEe--eehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCcccccccccccccccccc Q lcl|NC_021309. 217 PFSSEEFA-RVYEQVGKVAN--ALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASS 293 (497) Q Consensus 217 ~~s~~~f~-~i~~~~~kla~--~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~ 293 (497) +..+.+-+ ++-...+.-.+ ...++..+-- .+....|.++++.-..+.....+|. -..||++.....+... T Consensus 81 t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG--~dpm~~Ia~qva~yW~r~~q~~Lla-----~L~Gvf~~~~a~~~~~ 153 (367) T protein:vir:80 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAG--SNPMTRIRNRFGVYWTRQWQRRIIA-----MAVGVYKSNLAGNFAT 153 (367) T ss_pred cccccccchheeeeehhcccchhhhHHHHhhC--chHHHHHHHHHHHHhhhhhHHHHHH-----HHHHhhccccccchhh Confidence 22333222 22222222122 2234444432 2445566666665544444433332 1234444322211100 Q ss_pred ccchhh---hhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhc Q lcl|NC_021309. 294 ASSLFG---ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF 370 (497) Q Consensus 294 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 370 (497) ...... +.......+...... ..+ . .........+.++...+-.. . T Consensus 154 ~~~~~~~~a~~~~~~~~~~~Dis~-------------------------~t~-~----~~~~~s~~~~~~A~~~lGD~-~ 202 (367) T protein:vir:80 154 IKTRGRVPAEVLGTAGDMVIDISG-------------------------QTN-P----ADAVFNREAFVDAAFTMGDH-V 202 (367) T ss_pred hhhhhccccccccccCceeeeeec-------------------------cCC-C----ccceecHHHHHHHHHHhccc-c Confidence 000000 000000000000000 000 0 00000011222222111111 1 Q ss_pred cCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC-----c----eEEEeeccce Q lcl|NC_021309. 371 QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG-----T----ILVGHFAPSV 441 (497) Q Consensus 371 ~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~-----~----~~~gd~~~~~ 441 (497) ..-.+++||+..+..|++++=-+ |+- +.. ....-+++.|++||+++.||.. + ++||. ++ T Consensus 203 ~~l~~i~mHS~V~~~L~~~~li~--~i~--~sd-----~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~---GA 270 (367) T protein:vir:80 203 GSIAAIAVHSMVYKRMTNNDEIE--FIP--DSK-----GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG---AA 270 (367) T ss_pred ccccEEEEchHHHHHHHhccccc--ccc--CCC-----CccccceecceeEEEeCCCcccccCCCceEEEEEEec---ce Confidence 23457899999999998774110 111 111 1123568899999999999942 2 34433 23 Q ss_pred EEEEeec-ccEEEeecccchhhh--cCceEEEEEEeecceeecccceEEEEeeCCC-------CC--------C Q lcl|NC_021309. 442 IQTARRE-GVTMQMTNSNGTDFV--DGKVTVRAEERLGLLVYRPSAFQLIQLKKGA-------TG--------S 497 (497) Q Consensus 442 ~~i~~r~-~~~i~~~~~~~~~f~--~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a-------~~--------~ 497 (497) +...+-. ...+++.+.. +-. .++-.+.-..| .+.||-.|.+..-.-++ .| | T Consensus 271 i~~~~~~~~~~~E~~Rd~--~~~~~gG~d~L~~Rr~---~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt 339 (367) T protein:vir:80 271 FGYADGAPQVPVAVGRRE--LRGNGSGLEYILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) T ss_pred eeecccCCccceecccch--hhhcCCceEEEEeeee---EEeecceeeecccccccccccccccccccccCCCC Confidence 3322211 1112333322 111 13334444444 58899988765332111 01 1 No 198 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=91.84 E-value=0.014 Score=30.75 Aligned_cols=265 Identities=12% Similarity=0.025 Sum_probs=91.5 Q ss_pred hccccccccccccchhhhHHHHHHHHhhhhHHhhccee-------ecCCCceEEEEE-cCCCccceecccccccccccc- Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR-------PVTSPNLSYLTE-SAAHNNAAAVAEAGTYPFSSE- 221 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~-------~~~~~~~~~p~~-~~~~~~a~wv~Eg~~~~~s~~- 221 (497) +..+..+. =.+.-+....-.++.+.+...+.+.+... ++.++=...+-. .++...-.=+.-.++....+. T Consensus 1 ~~~t~~sd-l~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit 79 (315) T protein:vir:96 1 MATTVNSD-LVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIA 79 (315) T ss_pred Cceeeecc-eeeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceecc Confidence 22222111 01122334445566655555554433221 111211111100 010000000111111222221 Q ss_pred cceeeEeeeeeEE-eeehh--hHHHHh---hHH-HHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccc Q lcl|NC_021309. 222 EFARVYEQVGKVA-NALTI--TDEGLR---DAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSA 294 (497) Q Consensus 222 ~f~~i~~~~~kla-~~~~i--S~ell~---d~~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~ 294 (497) +..++.. |++ +.-++ +...+. +.| ....-|...+..++....-...+.|. .+.+...+..... T Consensus 80 ~~~dvaV---k~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~-----~aai~~~t~~~~~-- 149 (315) T protein:vir:96 80 ADEMVSV---KVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNAL-----QGAIGSNAGMNVS-- 149 (315) T ss_pred cccceeE---EEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhh-----hhhhccccccccc-- Confidence 1222222 233 33344 233222 223 22233333333333222222222111 0000000000000 Q ss_pred cchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCc Q lcl|NC_021309. 295 SSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN 374 (497) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 374 (497) . .........+.++...+- -....-. T Consensus 150 -------------------~----------------------------------~~a~~~~~~l~dA~~klG-D~~~~l~ 175 (315) T protein:vir:96 150 -------------------G----------------------------------ELATEGKKVLTKGLRTMG-DKASSIA 175 (315) T ss_pred -------------------c----------------------------------cccccCHHHHHHHHHHhc-ccccCee Confidence 0 000000112222222221 1112234 Q ss_pred eEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEe Q lcl|NC_021309. 375 AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQM 454 (497) Q Consensus 375 ~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~ 454 (497) .|+||...+..|.+ +. --..++.. ..+.-....++.+|+||++++.||..+.+ | |.++++.+.+...+.... T Consensus 176 ~~vMHS~v~~~L~~-q~-L~~~~~~~----~~~~~~~~~~~~lGkrViVdD~~P~~~~~-g-l~~GAi~~~~~~~~~~~~ 247 (315) T protein:vir:96 176 IWVMDSTSYFDIVD-EA-IDNKLYEE----AGVVVYGGTPGTLGKPVLVTDQCPATKIF-G-LVAGAVMITESQAPGMRS 247 (315) T ss_pred EEEEchHHHHHHHH-hh-hhhhcccc----cceeEecCcCcccccEEEEECCCCcceee-e-eecceeeecCCCcccccc Confidence 69999999999987 32 11233221 11111112345679999999999986533 2 344455554433322121 Q ss_pred ecccchhhhcCceEEEEEEeecc-eeecccceEEEEeeCCCCCC Q lcl|NC_021309. 455 TNSNGTDFVDGKVTVRAEERLGL-LVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 455 ~~~~~~~f~~~~v~~r~~~r~~~-~v~~~~a~~~l~~~~~a~~~ 497 (497) .... ++-.+....|..| -+++|..|.+- ++..-| T Consensus 248 ~~~~------g~e~l~~~~r~e~tf~l~p~G~sw~---~~~~~s 282 (315) T protein:vir:96 248 YQID------DQENLAIGFRAEGTANVEVLGYKWK---TKTNVN 282 (315) T ss_pred ccCC------CcceeEEEEeeeeEeeeeeeeEEee---cCCCcC Confidence 1111 1222222244444 36788888663 221112 No 199 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=90.38 E-value=0.021 Score=29.76 Aligned_cols=285 Identities=10% Similarity=0.040 Sum_probs=116.1 Q ss_pred hccccccccccccchhhhHHHHHHHHhh-hhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEee Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYE-LSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQ 229 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~ 229 (497) +..+. +.=..+-..+...+.+-.... ......|++++.+...-+|........--.|++| .+...+.=...++. T Consensus 1 m~it~--~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge---~~~~~l~~~~~~i~ 75 (302) T protein:vir:10 1 MLINK--QSLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGA---KVVKNLKAYKYVVE 75 (302) T ss_pred CcccH--HHHHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccc---eeeccccccceeEE Confidence 00000 000000000111111222222 2344556666544444445444332212245444 33444555557789 Q ss_pred eeeEEeeehhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHhhhhc---c-cCcccccc--ccccccccccccccchhhhhh Q lcl|NC_021309. 230 VGKVANALTITDEGLR-DAPELFNFVQGRLLEGIQRKEEVQLLA---G-GGYPGVNG--LLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 230 ~~kla~~~~iS~ell~-d~~~l~~~i~~~la~~~~~~~d~~~l~---G-~G~~~p~G--i~~~~~~~~~~~~~~~~~~~~ 302 (497) .+++...+.|||+.+. |...+..-+-..+.++.++.+|..++. + .+.....| ++... T Consensus 76 ~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~d---------------- 139 (302) T protein:vir:10 76 NEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTD---------------- 139 (302) T ss_pred eecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceeccc---------------- Confidence 9999999999999885 667777888888888888888876542 1 11100000 00000 Q ss_pred hHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhh----hhhccCCceEEe Q lcl|NC_021309. 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQ----LTLFQTPNAVVM 378 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~ 378 (497) +..... ..+..+...............+.....++.... ......|..+++ T Consensus 140 ------------------------H~~g~~-~~~N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiV 194 (302) T protein:vir:10 140 ------------------------HPVGDA-SVSNKGTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLV 194 (302) T ss_pred ------------------------cccccc-ccccccchhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEe Confidence 000000 000000000000000111111222222222221 223334445555 Q ss_pred chhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCce--EEEeeccceEEE-EeecccEEEee Q lcl|NC_021309. 379 NPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI--LVGHFAPSVIQT-ARREGVTMQMT 455 (497) Q Consensus 379 n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~--~~gd~~~~~~~i-~~r~~~~i~~~ 455 (497) .+.-...-+++-.. +++-. ....+.. .-.-+|+++.+..++. ++.|.+.....+ -.++...+.. T Consensus 195 p~~le~~A~~ll~~-~~~~~------g~~Np~~-----g~~~~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~- 261 (302) T protein:vir:10 195 GPALEDVAKMLLTN-PKLAD------NTPNPYV-----GTAELVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVS- 261 (302) T ss_pred cchhHHHHHHHhhc-cccCC------CCcceec-----cceEEEEeeccCCCCceEEEecCCccceEEEcCccccEEEe- Confidence 54444444333211 11100 0011111 1245677888876653 344544322222 2233344443 Q ss_pred cccchhhhcCceEEEEEEeecceeecccc--eEEEEeeCCCCCC Q lcl|NC_021309. 456 NSNGTDFVDGKVTVRAEERLGLLVYRPSA--FQLIQLKKGATGS 497 (497) Q Consensus 456 ~~~~~~f~~~~v~~r~~~r~~~~v~~~~a--~~~l~~~~~a~~~ 497 (497) . ..|..+.+-+|.+..+|..-+---+ |=.+.+....+++ T Consensus 262 -~--~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 262 -Q--VNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred -c--cCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccCC Confidence 2 2377788888877766642222111 1123334444444 No 200 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=89.18 E-value=0.028 Score=29.11 Aligned_cols=378 Identities=15% Similarity=0.118 Sum_probs=131.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAH--QAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~--~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) -|++.+-.+++.. +++.+..+..++..+ +..++...+++|+..-+.++.-++...++++. T Consensus 10 K~~l~EK~~~~a~------------------~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~LN 71 (400) T protein:vir:93 10 KPDLIEKQNRLAE------------------LKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN 71 (400) T ss_pred cchHHHHHHHHhh------------------hhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhh Confidence 2333333222222 222222222222111 11223333445555555555555544444332 Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccc Q lcl|NC_021309. 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) ..+.. ..++.. +..-.+. ......|...... .....+.+.++..... +...+-++. T Consensus 72 a~~E~-------~KGK~k-Mt~~i~s---q~A~~eF~~vL~~-------N~G~S~~k~AW~A~L~------E~GVtiTD~ 127 (400) T protein:vir:93 72 AQEEK-------PKGKDK-MTNFIES---QNAVTEFFDVLKK-------NSGKSEIKNAWSAKLA------ENGVTITDT 127 (400) T ss_pred hhhhh-------hhhhHH-HHHHHhh---HHHHHHHHHHHhc-------cCCchhhhhhhhhhHh------hcCcceecc Confidence 11111 000000 0000000 0000001000000 0001122222222111 111111222 Q ss_pred cccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeeh Q lcl|NC_021309. 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALT 238 (497) Q Consensus 159 ~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~ 238 (497) ...+|.-++..|-..+....+++....+..++.--++..-++ .+.|.-.-.|.++.+...+|..-++.+.-+ |+. T Consensus 128 -~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~~s--~~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~V--Y~~ 202 (400) T protein:vir:93 128 -TFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMV--YKL 202 (400) T ss_pred -chhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhhhh--hhhhhhhccCCccccceeeeeeechhHHHH--HHH Confidence 123333344455555566666666444433332111111111 124444455677777777777766665433 333 Q ss_pred hh-HHHHh---hH-HHHHHHHHHHHHHHHH-HHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhh Q lcl|NC_021309. 239 IT-DEGLR---DA-PELFNFVQGRLLEGIQ-RKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPA 312 (497) Q Consensus 239 iS-~ell~---d~-~~l~~~i~~~la~~~~-~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (497) .| -++.. .+ ..|..||..+|+.++. +..|.+++-|+|++.+..+..-+.+-.+.... T Consensus 203 ~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~~~T----------------- 265 (400) T protein:vir:93 203 QSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKIT----------------- 265 (400) T ss_pred HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHh----------------- Confidence 33 23333 33 4589999999999998 89999999999998655443322111110000 Q ss_pred hhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH-HHHHHhh Q lcl|NC_021309. 313 DGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE-LLRLTKD 391 (497) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~-~l~~lkd 391 (497) ...-..+ -.+ ..+.+-.+.--+.+..+ ....++...+.. -|+.|+. T Consensus 266 -----------------------tkaksag-ktp-------fadaieeavdfvrptag--rrylivktedrkalldelrq 312 (400) T protein:vir:93 266 -----------------------TKAKSAG-KTP-------FADAIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQ 312 (400) T ss_pred -----------------------hhhhhcC-CCc-------hhHHHHHHHhhhccCCC--ceEEEEeccchHHHHHHHHh Confidence 0000000 000 00111111111111111 011233333322 2334433 Q ss_pred hcCce--eccCcccccccccccccccccccc-eEecCCC-CcCceEEEeeccceEEEEeecccEEEeecccchhhhcCce Q lcl|NC_021309. 392 ANGQY--MGGNFFGNAYGNPVNGGKNIWGVP-VVTTPLI-PLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKV 467 (497) Q Consensus 392 ~~G~~--i~~~~~~~~~~~~~~~~~~l~G~P-vv~~~~~-~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v 467 (497) ++.+. -...+ +. ....-.|+- +++.... .-..-++.|-+ |.|. -+++ ...+..-|..|.- T Consensus 313 atanahvriknd--da------eiasevgvdeiivytgskalkptvlvdqk---yhid-mqdl----tkvdafewktnsn 376 (400) T protein:vir:93 313 ATANAHVRIKND--DA------EIASEVGVDEIIVYTGSKALKPTVLVDQK---YHID-MQDL----TKVDAFEWKTNSN 376 (400) T ss_pred hccccceEeecc--hh------hhhhhcCcceeeeeeccccccceeeeccc---cccc-hhhh----hhhhhheeccCCc Confidence 32211 00000 00 000001111 0111110 00111111211 2221 1111 1112223555555 Q ss_pred EEEEEEeecceeecccceEEEEee Q lcl|NC_021309. 468 TVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 468 ~~r~~~r~~~~v~~~~a~~~l~~~ 491 (497) -+.++.-..+.|..-.|=++++.+ T Consensus 377 milvetltsghvetynagavitvs 400 (400) T protein:vir:93 377 MILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eEEEeecccCcceeeccceeEeeC Confidence 555556566666554444444544 No 201 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=88.65 E-value=0.031 Score=28.85 Aligned_cols=298 Identities=10% Similarity=0.035 Sum_probs=136.8 Q ss_pred hhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHhhhhHHhh----cce Q lcl|NC_021309. 112 TKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADL----ISS 187 (497) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~----~~~ 187 (497) -++ .. ..+.... -..+....+...+....+|+.+ ..+ T Consensus 1 mp~---~~-----------lsel~t~-------------------------tl~~rs~~~~D~v~~~n~LL~~L~~kG~~ 41 (321) T protein:vir:34 1 MPF---PN-----------ISDIITT-------------------------TIESRSGVIADNVTKNNAILARLAKRGKP 41 (321) T ss_pred CCC---ch-----------HHHHHHH-------------------------HHHhhcchhhhhhhcccHHHHHHHhcCcc Confidence 000 00 0000000 0011122223333333343332 334 Q ss_pred eecCC-CceEEEEEcCCCccceec-ccccccccccccceeeEeeeeeEEeeehhhH-HHHhhH--HHHHHHHHHH---HH Q lcl|NC_021309. 188 RPVTS-PNLSYLTESAAHNNAAAV-AEAGTYPFSSEEFARVYEQVGKVANALTITD-EGLRDA--PELFNFVQGR---LL 259 (497) Q Consensus 188 ~~~~~-~~~~~p~~~~~~~~a~wv-~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~-ell~d~--~~l~~~i~~~---la 259 (497) .+.++ .++..|.+.....++.|. ++..-.+.-.-+|.+-++..+.+++-+.||- |+|+.+ -.+-.++... .. T Consensus 42 ~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae 121 (321) T protein:vir:34 42 RLVSGGYTILEELSFSGNSNGGWYSGYDVLPTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAE 121 (321) T ss_pred cccCCCeeEEEEEeeccCcceeEEEeeeeeccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHH Confidence 44554 467778887766778885 5554445556689999999999999888874 456544 2344444444 45 Q ss_pred HHHHHHHHhhhhc-ccC--ccccccccccc----cccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhh Q lcl|NC_021309. 260 EGIQRKEEVQLLA-GGG--YPGVNGLLQRS----TGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGR 332 (497) Q Consensus 260 ~~~~~~~d~~~l~-G~G--~~~p~Gi~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (497) +.+...++..+.. |+| ..+..|+-... +..++..........+.... ...... T Consensus 122 ~t~~n~l~~~l~sdGTa~g~~~i~GL~~lv~~~p~tGtvGGIdra~~~~WRn~~------~d~~~~-------------- 181 (321) T protein:vir:34 122 ATMANDISAALYGDGTAFGGRAINGLDGAVPVDPTVGTYGGINRALWPFWRSQV------EDMAAV-------------- 181 (321) T ss_pred HHHHhhhhHhhhccccccccchhhhhhhhcccCCCCceeccccccchhhhhhhh------hhhhhc-------------- Confidence 5677788877765 554 33455542221 22222222111111111100 000000 Q ss_pred hhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccc Q lcl|NC_021309. 333 VVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNG 412 (497) Q Consensus 333 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~ 412 (497) .+.......+...... ..-....|+.|++...-|...+......-||--... ...|. . T Consensus 182 ----------------~t~~tl~~~m~~~w~~-~~Rg~~~PDlii~~~~~y~~y~~s~q~~qR~~~~~~--a~~Gf---~ 239 (321) T protein:vir:34 182 ----------------ATINTIQPAMTKLWSR-CVRGADMPDLIMSGNDAWTTYSNSLQVLQRFTSAEE--ANLGF---R 239 (321) T ss_pred ----------------ccHHHHHHHHHHHHHh-hccCCCCccEEEechHHHHHHHHhhheeeeeccccc--ccccc---e Confidence 0000000011111111 112345678888888888888876666666544322 22222 2 Q ss_pred cccccccceEecC----CCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEE Q lcl|NC_021309. 413 GKNIWGVPVVTTP----LIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLI 488 (497) Q Consensus 413 ~~~l~G~Pvv~~~----~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l 488 (497) .-...|.-||.++ .+|+++-||-|-+...++......+.. +.+..-.-+-+|-+.-..-.+-...+-+|.+=.+| T Consensus 240 ~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~p-i~p~r~~~~NqdA~~q~I~~~GnL~~sn~~~~~vL 318 (321) T protein:vir:34 240 SLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVP-LSPSRRAAFNQDAEAQILAWAGNLTCSGAQFQGRL 318 (321) T ss_pred eeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceee-cCcccccccchhHHhhhhhhhheeeeecccceeEE Confidence 3356788888887 689999999888865444333333332 22211000112222222222333333344443333 Q ss_pred Eee Q lcl|NC_021309. 489 QLK 491 (497) Q Consensus 489 ~~~ 491 (497) .-. T Consensus 319 ~~~ 321 (321) T protein:vir:34 319 IAE 321 (321) T ss_pred eeC Confidence 332 No 202 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=88.50 E-value=0.032 Score=28.78 Aligned_cols=312 Identities=10% Similarity=0.033 Sum_probs=137.1 Q ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhccccc-----cccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEc Q lcl|NC_021309. 127 PGTAAAELMGAFADGETAPAAIGQNPFGST-----GTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTES 201 (497) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~ 201 (497) ..+ +....+.. .. +...+.. ...-..|.|.....+...+.+.+.+++++++++++--.++..-.. T Consensus 1 mtr---~~~~~y~~----~~---A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg 70 (336) T protein:vir:37 1 MNK---QAYYALAA----AL---AKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGA 70 (336) T ss_pred CcH---HHHHHHHH----HH---HHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeec Confidence 000 00000100 00 0011111 111245677778888899999999999999999886444333222 Q ss_pred CCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhHH---HH-HHHHHHHHHHHHHHHHHhhh--hcccC Q lcl|NC_021309. 202 AAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP---EL-FNFVQGRLLEGIQRKEEVQL--LAGGG 275 (497) Q Consensus 202 ~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~~---~l-~~~i~~~la~~~~~~~d~~~--l~G~G 275 (497) ...+-++-.. .+..| .+...+.-.+.+++.-.-+.|+.+.|+..+ +. ...+...+.+.++ +|.-. ++|+- T Consensus 71 ~~g~iagrtd-t~R~~-~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iA--LD~i~IGfnG~s 146 (336) T protein:vir:37 71 TEKGVTGRKQ-TGRNL-ANLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVA--LDILQIGWNGQS 146 (336) T ss_pred cCcccccccC-CCccc-cccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHh--hchhhhccccee Confidence 1111111111 11222 224556666666666667788999998764 42 2333333344333 44333 33322 Q ss_pred ccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhh--------cccccccc Q lcl|NC_021309. 276 YPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAG--------SGSGVAGS 347 (497) Q Consensus 276 ~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~ 347 (497) ..... . ....-..+..|+..++......-... ...+.... T Consensus 147 ~A~~T-----------d---------------------nPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gd 194 (336) T protein:vir:37 147 VADNT-----------T---------------------KADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNAD 194 (336) T ss_pred eccCC-----------C---------------------CCcccccchhHHHHHHhccchhhcccccccCCceEEecCCCC Confidence 11000 0 00011112222333322111100000 00111112 Q ss_pred cchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHH--HHhhhcCceeccCcccccccccccccccccccceEecC Q lcl|NC_021309. 348 YPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 348 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~ 425 (497) +.+.+. ...+++..+...++..+.-+++--.++..-. .+-..+|.. |.-...........++-|+|.+..+ T Consensus 195 y~NLDa---lV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~----PtE~~Aa~~~~~~k~iGGlpa~~~P 267 (336) T protein:vir:37 195 YANLDD---LAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGLT----PTEKAALGSHNLMGSFGGMNAITPP 267 (336) T ss_pred cccHHH---HHHHHHhcCchHHhcCCCeEEEEchhhhhhhhhhhhhhcCCC----HHHHHHHHHHHHHHhhCCceeEEcc Confidence 333333 3444455567777777775544434332222 122222210 0000000011123579999999999 Q ss_pred CCCcCceEEEeeccceEEEEeecccE-EEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 426 LIPLGTILVGHFAPSVIQTARREGVT-MQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 426 ~~~~~~~~~gd~~~~~~~i~~r~~~~-i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ++|.+.+++=-|+- +.|+...+-. -.+-+.. .+|++.-.=..=-++.|-+++.++.+...+..-+- T Consensus 268 ffP~~~~lVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~ 334 (336) T protein:vir:37 268 NFPARAAAVTTLKN--LSVYTEAESVRRSLRNDE----DKKGLVTSYYRQEGYVVEDLGLMTAIDHTKVKLNG 334 (336) T ss_pred ccCCCceEEeechh--cEEEEecCcEEEEEEEcc----ccccccchhhhcceeeeeccccEEEeeeeeeeecC Confidence 99999988766665 3333333322 1221111 12333333334445666777777766554444433 No 203 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=87.70 E-value=0.037 Score=28.43 Aligned_cols=309 Identities=14% Similarity=0.061 Sum_probs=132.2 Q ss_pred hhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhh--hccccccccccccchhhhHHHHHHH---Hhhhh Q lcl|NC_021309. 106 TSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQ--NPFGSTGTFAPGILPTFLPGIVEQL---FYELS 180 (497) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~~v~p~~~~~ii~~~---~~~~~ 180 (497) ...++.....+.. . ...+.+.. .++... .....+-.+|+.+--+.+..-|..+ ..... T Consensus 1 ~~~~~~~~~~~~~-----~----------~~~~~e~~--~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~ 63 (463) T protein:vir:95 1 MTIEKNLSDVQQK-----Y----------ADQFQEDV--VKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLI 63 (463) T ss_pred CCcccccchHHHH-----H----------HhhhhHHH--HHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchh Confidence 0000000000000 0 00000000 000000 0001122345555555443333332 22234 Q ss_pred HHhhcceeecCCCceEEEEEc--CCCccceecccccccccccccceeeEeeeeeEEeeehhhHHH-HhhH-HHHHHHHHH Q lcl|NC_021309. 181 LADLISSRPVTSPNLSYLTES--AAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PELFNFVQG 256 (497) Q Consensus 181 l~~~~~~~~~~~~~~~~p~~~--~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~el-l~d~-~~l~~~i~~ 256 (497) ++.-+...++.+-.-+|-.+. ++.+.+.+++|++..+.+++++.+....+|=|+....+|.-+ |+++ .+.+....+ T Consensus 64 ~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~ 143 (463) T protein:vir:95 64 FYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTE 143 (463) T ss_pred hhhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHH Confidence 555566677766544454444 333457889999999999999999999999999888888765 5555 477888888 Q ss_pred HHHHHHHHHHHhhhhcccCcccc---------ccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhh Q lcl|NC_021309. 257 RLLEGIQRKEEVQLLAGGGYPGV---------NGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVAS 327 (497) Q Consensus 257 ~la~~~~~~~d~~~l~G~G~~~p---------~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (497) +-.-.++..++.+.++|+-.=.| .||.+.-... +. T Consensus 144 dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~e---------------------------------nv--- 187 (463) T protein:vir:95 144 DAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKN---------------------------------NV--- 187 (463) T ss_pred HHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCC---------------------------------Ce--- Confidence 88888999999999999853222 1221111000 00 Q ss_pred hhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccc-cc Q lcl|NC_021309. 328 LKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGN-AY 406 (497) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~-~~ 406 (497) ....|..+. .+.+..+-.. ....+.+++-++|...+.+.+..-.-.--|.+.+++.+. .. T Consensus 188 ----------iDarG~~Ls--------~~~ln~Aa~~-i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~ 248 (463) T protein:vir:95 188 ----------INAKGNQLT--------EKHLNEAAVR-IGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNT 248 (463) T ss_pred ----------eecCCCccc--------HHHHhhhhhh-hhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceee Confidence 000000000 0111111111 223444555667777776666643333333333332221 11 Q ss_pred ccccc-----------ccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecc-cchhh-h--cCceEEEE Q lcl|NC_021309. 407 GNPVN-----------GGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNS-NGTDF-V--DGKVTVRA 471 (497) Q Consensus 407 ~~~~~-----------~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~-~~~~f-~--~~~v~~r~ 471 (497) |.+.. .++++++.|-+.....+ ...++|.. ..+++.++.. .+..| . .....|++ T Consensus 249 G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~---~~p~ap~~--------~~~tatv~~~~~~~~~~~~~~a~~~Y~v 317 (463) T protein:vir:95 249 GYSVNGFYSSRGFIKLHGSTVMENELILDESLQ---PLPNAPQP--------AKVTATVETKQKGAFENEEDRAGLSYKV 317 (463) T ss_pred eeeccceeeeeeeeeeCCceecCCcccccchhh---cCCCCccC--------ceeEEEEeeccCCCCCCcccccceEEEE Confidence 11111 11111122221111111 00111111 1122233221 11222 1 22345555 Q ss_pred EEeecceeecccceE-----------EEEeeCCCCCC Q lcl|NC_021309. 472 EERLGLLVYRPSAFQ-----------LIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~a~~-----------~l~~~~~a~~~ 497 (497) ...-+..=-.|..++ +|++.-.+.++ T Consensus 318 v~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~ 354 (463) T protein:vir:95 318 VVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQ 354 (463) T ss_pred EEECCCCCcccchheeeeeeeccceEEEEEEecCCcc Confidence 555544444454443 33333333333 No 204 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=87.70 E-value=0.037 Score=28.43 Aligned_cols=309 Identities=14% Similarity=0.061 Sum_probs=132.2 Q ss_pred hhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhh--hccccccccccccchhhhHHHHHHH---Hhhhh Q lcl|NC_021309. 106 TSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQ--NPFGSTGTFAPGILPTFLPGIVEQL---FYELS 180 (497) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~~v~p~~~~~ii~~~---~~~~~ 180 (497) ...++.....+.. . ...+.+.. .++... .....+-.+|+.+--+.+..-|..+ ..... T Consensus 1 ~~~~~~~~~~~~~-----~----------~~~~~e~~--~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~ 63 (463) T protein:vir:99 1 MTIEKNLSDVQQK-----Y----------ADQFQEDV--VKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLI 63 (463) T ss_pred CCcccccchHHHH-----H----------HhhhhHHH--HHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchh Confidence 0000000000000 0 00000000 000000 0001122345555555443333332 22234 Q ss_pred HHhhcceeecCCCceEEEEEc--CCCccceecccccccccccccceeeEeeeeeEEeeehhhHHH-HhhH-HHHHHHHHH Q lcl|NC_021309. 181 LADLISSRPVTSPNLSYLTES--AAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PELFNFVQG 256 (497) Q Consensus 181 l~~~~~~~~~~~~~~~~p~~~--~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~el-l~d~-~~l~~~i~~ 256 (497) ++.-+...++.+-.-+|-.+. ++.+.+.+++|++..+.+++++.+....+|=|+....+|.-+ |+++ .+.+....+ T Consensus 64 ~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~ 143 (463) T protein:vir:99 64 FYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTE 143 (463) T ss_pred hhhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHH Confidence 555566677766544454444 333457889999999999999999999999999888888765 5555 477888888 Q ss_pred HHHHHHHHHHHhhhhcccCcccc---------ccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhh Q lcl|NC_021309. 257 RLLEGIQRKEEVQLLAGGGYPGV---------NGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVAS 327 (497) Q Consensus 257 ~la~~~~~~~d~~~l~G~G~~~p---------~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (497) +-.-.++..++.+.++|+-.=.| .||.+.-... +. T Consensus 144 dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~e---------------------------------nv--- 187 (463) T protein:vir:99 144 DAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKN---------------------------------NV--- 187 (463) T ss_pred HHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCC---------------------------------Ce--- Confidence 88888999999999999853222 1221111000 00 Q ss_pred hhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccc-cc Q lcl|NC_021309. 328 LKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGN-AY 406 (497) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~-~~ 406 (497) ....|..+. .+.+..+-.. ....+.+++-++|...+.+.+..-.-.--|.+.+++.+. .. T Consensus 188 ----------iDarG~~Ls--------~~~ln~Aa~~-i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~ 248 (463) T protein:vir:99 188 ----------INAKGNQLT--------EKHLNEAAVR-IGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNT 248 (463) T ss_pred ----------eecCCCccc--------HHHHhhhhhh-hhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceee Confidence 000000000 0111111111 223444555667777776666643333333333332221 11 Q ss_pred ccccc-----------ccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecc-cchhh-h--cCceEEEE Q lcl|NC_021309. 407 GNPVN-----------GGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNS-NGTDF-V--DGKVTVRA 471 (497) Q Consensus 407 ~~~~~-----------~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~-~~~~f-~--~~~v~~r~ 471 (497) |.+.. .++++++.|-+.....+ ...++|.. ..+++.++.. .+..| . .....|++ T Consensus 249 G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~---~~p~ap~~--------~~~tatv~~~~~~~~~~~~~~a~~~Y~v 317 (463) T protein:vir:99 249 GYSVNGFYSSRGFIKLHGSTVMENELILDESLQ---PLPNAPQP--------AKVTATVETKQKGAFENEEDRAGLSYKV 317 (463) T ss_pred eeeccceeeeeeeeeeCCceecCCcccccchhh---cCCCCccC--------ceeEEEEeeccCCCCCCcccccceEEEE Confidence 11111 11111122221111111 00111111 1122233221 11222 1 22345555 Q ss_pred EEeecceeecccceE-----------EEEeeCCCCCC Q lcl|NC_021309. 472 EERLGLLVYRPSAFQ-----------LIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~a~~-----------~l~~~~~a~~~ 497 (497) ...-+..=-.|..++ +|++.-.+.++ T Consensus 318 v~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~ 354 (463) T protein:vir:99 318 VVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQ 354 (463) T ss_pred EEECCCCCcccchheeeeeeeccceEEEEEEecCCcc Confidence 555544444454443 33333333333 No 205 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=87.63 E-value=0.037 Score=28.40 Aligned_cols=282 Identities=12% Similarity=0.010 Sum_probs=109.8 Q ss_pred hhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHh Q lcl|NC_021309. 98 MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFY 177 (497) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~ 177 (497) ++...+++.... + .....+ .. -+...+ -..-.+....+++.... T Consensus 1 ~~~~~~~~~~~~-------------------------~----~~~~~~----~~--~~~~~n-t~~l~~k~~~~LD~~~~ 44 (319) T protein:vir:94 1 MNKTIKNATGML-------------------------K----LNLQHF----AN--KSVEPG-QTLLKNKHVGILERVTA 44 (319) T ss_pred CCccccccccee-------------------------E----eehhhh----hc--cCCCcc-hHHHHHHHHHHHHHHHH Confidence 000000000000 0 000000 00 011111 11112222333433333 Q ss_pred hhhHHh--hcc--eeecCCCceEEEEEcCCCccceecccccccccccc--cceeeEeeeeeEEeeeh--hhHHHHhhHHH Q lcl|NC_021309. 178 ELSLAD--LIS--SRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSE--EFARVYEQVGKVANALT--ITDEGLRDAPE 249 (497) Q Consensus 178 ~~~l~~--~~~--~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~--~f~~i~~~~~kla~~~~--iS~ell~d~~~ 249 (497) ...+-. .++ .....++++.||+.+..+...+ ..++.....++ ++...++...+.-.+.. +... +.+.. T Consensus 45 ~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY--~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~--Etn~~ 120 (319) T protein:vir:94 45 VNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY--KRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRK--DTEGN 120 (319) T ss_pred HhhhhhhcccCcceEeccCcEEEEeeecccccccc--cCCCCcccCCcccceeEEEeecccccccccchhhHh--hhhch Confidence 332221 122 3345678899999876332222 11222222233 34444444444333321 1110 11112 Q ss_pred H--HHHHHHHHHHHHHHHHHhhhhc----ccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhh Q lcl|NC_021309. 250 L--FNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQD 323 (497) Q Consensus 250 l--~~~i~~~la~~~~~~~d~~~l~----G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (497) + ...+.......+.-.+|.-.+. +.|.. T Consensus 121 l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~---------------------------------------------- 154 (319) T protein:vir:94 121 IDINYVVARQGAEVVAPYLDNLRFATLARNKAKH---------------------------------------------- 154 (319) T ss_pred hhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc---------------------------------------------- Confidence 2 1222233333333334432221 00000 Q ss_pred hhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccc Q lcl|NC_021309. 324 TVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG 403 (497) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~ 403 (497) .....+....++.+.+++..+....--.+.+++++|..+..|.+-..- ....... T Consensus 155 ---------------------~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f----~~~~~~~ 209 (319) T protein:vir:94 155 ---------------------LTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIA----LPQGDTR 209 (319) T ss_pred ---------------------cccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhh----hcccccc Confidence 000011223345556666666554433344568888887777543221 1111111 Q ss_pred ccccccccccccccccceEecCC--CCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeec Q lcl|NC_021309. 404 NAYGNPVNGGKNIWGVPVVTTPL--IPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYR 481 (497) Q Consensus 404 ~~~~~~~~~~~~l~G~Pvv~~~~--~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~ 481 (497) . .+........|.|+||+.++. +..-.+++|.-+. . +...+--.+++..-....| --.++....+|..|.+ T Consensus 210 ~-~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A--~-~~~~k~~~~~~~~p~~~~~---a~~v~gr~y~d~~V~~ 282 (319) T protein:vir:94 210 Q-QVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEV--L-ASPIQADLAKTNSNIPGMF---GTLAEQLLYTGAFVPE 282 (319) T ss_pred c-cceeeeeceeecCeEEEEecccccccceEEEEcCCe--e-eeeeeeeeeeccCCCcccc---ceeeeeeeeeeeEEec Confidence 1 112223345799999987543 3344466665442 2 1111211233222111122 3578888999999999 Q ss_pred ccceEEEEeeCCCCCC Q lcl|NC_021309. 482 PSAFQLIQLKKGATGS 497 (497) Q Consensus 482 ~~a~~~l~~~~~a~~~ 497 (497) |++........+++.+ T Consensus 283 ~k~~~Iy~~~~~~~~~ 298 (319) T protein:vir:94 283 HLQKYIFTIGGTEVAT 298 (319) T ss_pred cccceEEEeecCCccc Confidence 9855444444444444 No 206 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=87.63 E-value=0.037 Score=28.40 Aligned_cols=282 Identities=12% Similarity=0.010 Sum_probs=109.8 Q ss_pred hhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHHh Q lcl|NC_021309. 98 MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFY 177 (497) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~~ 177 (497) ++...+++.... + .....+ .. -+...+ -..-.+....+++.... T Consensus 1 ~~~~~~~~~~~~-------------------------~----~~~~~~----~~--~~~~~n-t~~l~~k~~~~LD~~~~ 44 (319) T protein:vir:97 1 MNKTIKNATGML-------------------------K----LNLQHF----AN--KSVEPG-QTLLKNKHVGILERVTA 44 (319) T ss_pred CCccccccccee-------------------------E----eehhhh----hc--cCCCcc-hHHHHHHHHHHHHHHHH Confidence 000000000000 0 000000 00 011111 11112222333433333 Q ss_pred hhhHHh--hcc--eeecCCCceEEEEEcCCCccceecccccccccccc--cceeeEeeeeeEEeeeh--hhHHHHhhHHH Q lcl|NC_021309. 178 ELSLAD--LIS--SRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSE--EFARVYEQVGKVANALT--ITDEGLRDAPE 249 (497) Q Consensus 178 ~~~l~~--~~~--~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~--~f~~i~~~~~kla~~~~--iS~ell~d~~~ 249 (497) ...+-. .++ .....++++.||+.+..+...+ ..++.....++ ++...++...+.-.+.. +... +.+.. T Consensus 45 ~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY--~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~--Etn~~ 120 (319) T protein:vir:97 45 VNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY--KRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRK--DTEGN 120 (319) T ss_pred HhhhhhhcccCcceEeccCcEEEEeeecccccccc--cCCCCcccCCcccceeEEEeecccccccccchhhHh--hhhch Confidence 332221 122 3345678899999876332222 11222222233 34444444444333321 1110 11112 Q ss_pred H--HHHHHHHHHHHHHHHHHhhhhc----ccCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhh Q lcl|NC_021309. 250 L--FNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQD 323 (497) Q Consensus 250 l--~~~i~~~la~~~~~~~d~~~l~----G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (497) + ...+.......+.-.+|.-.+. +.|.. T Consensus 121 l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~---------------------------------------------- 154 (319) T protein:vir:97 121 IDINYVVARQGAEVVAPYLDNLRFATLARNKAKH---------------------------------------------- 154 (319) T ss_pred hhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc---------------------------------------------- Confidence 2 1222233333333334432221 00000 Q ss_pred hhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccc Q lcl|NC_021309. 324 TVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG 403 (497) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~ 403 (497) .....+....++.+.+++..+....--.+.+++++|..+..|.+-..- ....... T Consensus 155 ---------------------~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f----~~~~~~~ 209 (319) T protein:vir:97 155 ---------------------LTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIA----LPQGDTR 209 (319) T ss_pred ---------------------cccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhh----hcccccc Confidence 000011223345556666666554433344568888887777543221 1111111 Q ss_pred ccccccccccccccccceEecCC--CCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeec Q lcl|NC_021309. 404 NAYGNPVNGGKNIWGVPVVTTPL--IPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYR 481 (497) Q Consensus 404 ~~~~~~~~~~~~l~G~Pvv~~~~--~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~ 481 (497) . .+........|.|+||+.++. +..-.+++|.-+. . +...+--.+++..-....| --.++....+|..|.+ T Consensus 210 ~-~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A--~-~~~~k~~~~~~~~p~~~~~---a~~v~gr~y~d~~V~~ 282 (319) T protein:vir:97 210 Q-QVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEV--L-ASPIQADLAKTNSNIPGMF---GTLAEQLLYTGAFVPE 282 (319) T ss_pred c-cceeeeeceeecCeEEEEecccccccceEEEEcCCe--e-eeeeeeeeeeccCCCcccc---ceeeeeeeeeeeEEec Confidence 1 112223345799999987543 3344466665442 2 1111211233222111122 3578888999999999 Q ss_pred ccceEEEEeeCCCCCC Q lcl|NC_021309. 482 PSAFQLIQLKKGATGS 497 (497) Q Consensus 482 ~~a~~~l~~~~~a~~~ 497 (497) |++........+++.+ T Consensus 283 ~k~~~Iy~~~~~~~~~ 298 (319) T protein:vir:97 283 HLQKYIFTIGGTEVAT 298 (319) T ss_pred cccceEEEeecCCccc Confidence 9855444444444444 No 207 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=87.53 E-value=0.038 Score=28.36 Aligned_cols=278 Identities=12% Similarity=0.041 Sum_probs=111.0 Q ss_pred hccccccccccccchh--hhHH-HHHHHHhhhhHHhhcc---------eeecCCCceEEEEEcCCCcc--ceecccc--c Q lcl|NC_021309. 151 NPFGSTGTFAPGILPT--FLPG-IVEQLFYELSLADLIS---------SRPVTSPNLSYLTESAAHNN--AAAVAEA--G 214 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~--~~~~-ii~~~~~~~~l~~~~~---------~~~~~~~~~~~p~~~~~~~~--a~wv~Eg--~ 214 (497) |+. +.-..+++|+ .... +.+...+.+.|++-.- ....++..+++|-...-++. ..+-+.. + T Consensus 1 Ma~---T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~ 77 (349) T protein:vir:94 1 MAI---TTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQD 77 (349) T ss_pred CCc---eEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCccc Confidence 221 1123356665 2222 3333333344443111 11234556788876543332 2122211 1 Q ss_pred ccccccc-cceeeEeeeeeEEee--ehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCcccccccccccccccc Q lcl|NC_021309. 215 TYPFSSE-EFARVYEQVGKVANA--LTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTA 291 (497) Q Consensus 215 ~~~~s~~-~f~~i~~~~~kla~~--~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~ 291 (497) ..+.++. ++.++-.....--++ ..++.++--+ +..+.|.++++....+.....+|. -.+|+++.....+. T Consensus 78 ~~t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG~--dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~~~~~~~~ 150 (349) T protein:vir:94 78 IATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQ--NPLQSVASRLDNFWQRQAQRRLIA-----TALGLYNDNVSATD 150 (349) T ss_pred ccccccccccceeeeeeeeccccchhHHHHHhhCc--hHHHHHHHHHHHHHhhHHHHHHHH-----HHHhhhcccccccc Confidence 2333332 233333333332232 2344444322 445666676766655554444442 13444443211110 Q ss_pred ccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhh-hh- Q lcl|NC_021309. 292 SSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQL-TL- 369 (497) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~- 369 (497) ..... . .+..... . .+......+.++...+.. .. T Consensus 151 ~~~~~--~-------~~~~d~~-~----------------------------------~a~~~~~~~~~A~~~~Gdaa~G 186 (349) T protein:vir:94 151 AYHEQ--N-------DMVVDVS-A----------------------------------TSGFDAGAFIDATQTMGDALMG 186 (349) T ss_pred ccccc--C-------ceeEEec-c----------------------------------cCCCChhhHHHHHHHHHHHhcc Confidence 00000 0 0000000 0 000000111111111111 11 Q ss_pred --ccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC------c---eEEEeec Q lcl|NC_021309. 370 --FQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------T---ILVGHFA 438 (497) Q Consensus 370 --~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~------~---~~~gd~~ 438 (497) ...-..++||+..+..|++++--+ |+ ++ .. ....-++++|++||+++.||.. . ++|| T Consensus 187 d~~~~lt~i~mHS~v~~~L~~~~li~--~i-~~-s~-----~~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg--- 254 (349) T protein:vir:94 187 NGGEVLGAIAMHSFVYAQARKAQLID--FI-RD-AE-----NNTMFATYQGYRVIVDDSMTVVGQDTSRKFISIIFG--- 254 (349) T ss_pred ccccceeEEEEchHHHHHHHhcchhh--hc-cC-cc-----cCcccceecCcEEEEeCCCccccCCCCceEEEEEee--- Confidence 122346899999999998764311 11 10 00 1112467899999999999942 1 2444 Q ss_pred cceEEEEeec-ccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCC--------C Q lcl|NC_021309. 439 PSVIQTARRE-GVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG--------S 497 (497) Q Consensus 439 ~~~~~i~~r~-~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~--------~ 497 (497) .+++...+-. ...+++.+.....=..++-.+....|+ +.||..|..-.-..+..| | T Consensus 255 ~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~---~~hp~G~s~~~a~v~~~~~~~~~~sPt 319 (349) T protein:vir:94 255 QGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYSFTSAVITGNGTETIARSAS 319 (349) T ss_pred cceEEeecCCCCcceeeecccccCCcceeEEEEEeeEE---EeeeeeeeecccccCCCccccccCCCC Confidence 3334333322 122344332210001234555555554 678888866542211111 1 No 208 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=87.47 E-value=0.038 Score=28.34 Aligned_cols=278 Identities=12% Similarity=0.024 Sum_probs=110.4 Q ss_pred hccccccccccccchh--hhHH-HHHHHHhhhhHHhhc---------ceeecCCCceEEEEEcCCCcc--ceecccc--c Q lcl|NC_021309. 151 NPFGSTGTFAPGILPT--FLPG-IVEQLFYELSLADLI---------SSRPVTSPNLSYLTESAAHNN--AAAVAEA--G 214 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~--~~~~-ii~~~~~~~~l~~~~---------~~~~~~~~~~~~p~~~~~~~~--a~wv~Eg--~ 214 (497) |+. +.-..+++|+ .... +.+...+.+.|++-. .....++..+++|....-++. ..+...+ + T Consensus 1 Ma~---T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~ 77 (349) T protein:vir:78 1 MAI---TTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQD 77 (349) T ss_pred CCc---eEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCccc Confidence 221 1123356665 2233 333333333433311 111234566788877543332 2222221 2 Q ss_pred cccccc-ccceeeEeeeeeEEee--ehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCcccccccccccccccc Q lcl|NC_021309. 215 TYPFSS-EEFARVYEQVGKVANA--LTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTA 291 (497) Q Consensus 215 ~~~~s~-~~f~~i~~~~~kla~~--~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~ 291 (497) ..+..+ .++.++-.....--++ ..++.++--+ +..+.|.++++....+.....+|. ...|++........ T Consensus 78 ~~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG~--dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~~~~~a~~ 150 (349) T protein:vir:78 78 IATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQ--NPLQSVASRLDNFWQRQAQRRLIA-----TALGLYNDNVSATD 150 (349) T ss_pred ccccccccccceeeeeeeeccccchhHHHHHhhCc--hHHHHHHHHHHHHHhhHHHHHHHH-----HHHHhhcccccccc Confidence 223333 2344444433333333 2344444332 445666676766555544444432 12344432211000 Q ss_pred ccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhh--- Q lcl|NC_021309. 292 SSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT--- 368 (497) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 368 (497) .... ...+..... .... .....+.++...+... T Consensus 151 ~~~~---------~~~~t~d~s-~~a~----------------------------------~~~~~~~dA~~~lgda~~G 186 (349) T protein:vir:78 151 AYHE---------QNDMVVDVS-ATLG----------------------------------FDAGAFIDATQTMGDALMG 186 (349) T ss_pred hhhh---------cccceeeec-cccC----------------------------------CChhhhhhhHHHHHHHhcc Confidence 0000 000000000 0000 0001111111111111 Q ss_pred -hccCCceEEechhHHHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcC------c---eEEEeec Q lcl|NC_021309. 369 -LFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------T---ILVGHFA 438 (497) Q Consensus 369 -~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~------~---~~~gd~~ 438 (497) ....-.+++||+..+..|++++--+ |+ ++ .. ....-++++|++||+++.||.. . ++|| T Consensus 187 d~~~~lt~i~mHS~v~~~L~~~~li~--~i-~~-s~-----~~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg--- 254 (349) T protein:vir:78 187 NGGEVLGAIAMHSFVYAQARKAQLID--FI-RD-AE-----NNTMFATYQGYRVIVDDSMTVVGQGAQRKFISIIFG--- 254 (349) T ss_pred ccccceeEEEEchHHHHHHHhhhhhh--hc-cC-cc-----cCcccceecCeEEEEeCCCccccCCCCceEEEEEee--- Confidence 1122346899999999998764311 11 10 00 1112467899999999999942 1 2444 Q ss_pred cceEEEEeecc-cEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCC----CCC Q lcl|NC_021309. 439 PSVIQTARREG-VTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA----TGS 497 (497) Q Consensus 439 ~~~~~i~~r~~-~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a----~~~ 497 (497) .+++...+-.. ..+++.+.....=..++-.+....|+ ++||..|..-.-..+. .++ T Consensus 255 ~GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~---~~hp~G~s~~~a~v~~~~~~~~~ 315 (349) T protein:vir:78 255 QGAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYRFTSAVITGNGTETIA 315 (349) T ss_pred cceEEEccCCCccceeeecccccCCcceeEEEEEeeEE---EeeeeeeeeccccccCCcccccc Confidence 33444433211 12343332210001245555555555 6678777665422111 111 No 209 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=86.52 E-value=0.045 Score=27.97 Aligned_cols=312 Identities=10% Similarity=0.028 Sum_probs=136.5 Q ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhccccc-----cccccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEc Q lcl|NC_021309. 127 PGTAAAELMGAFADGETAPAAIGQNPFGST-----GTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTES 201 (497) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~ 201 (497) ..+ +....+.. .. +...+.. -..-..|.|.....+...+.+.+.+++++++++++--.++..-.. T Consensus 1 mtr---~~~~~y~~----~~---A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg 70 (336) T protein:vir:37 1 MNK---QAYYALAA----AL---AKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGA 70 (336) T ss_pred CcH---HHHHHHHH----HH---HHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeec Confidence 000 00000000 00 0011111 111245677777888888899999999999999885444333222 Q ss_pred CCCccceecccccccccccccceeeEeeeeeEEeeehhhHHHHhhHHHHH----HHHHHHHHHHHHHHHHhhh--hcccC Q lcl|NC_021309. 202 AAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAPELF----NFVQGRLLEGIQRKEEVQL--LAGGG 275 (497) Q Consensus 202 ~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d~~~l~----~~i~~~la~~~~~~~d~~~--l~G~G 275 (497) ...+-++-..-+. .......+.-.+.+++.-.-+.|+.+.|+..+.+. ..+...+.+.+ ++|.-. ++|+- T Consensus 71 ~~g~iagrtdt~r--~r~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~i--ALD~i~IGfnG~s 146 (336) T protein:vir:37 71 TEKGVTGRKQTGR--NLATLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQV--ALDILQIGWNGQS 146 (336) T ss_pred cCcccccccCCCC--CccccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHH--hcchhhhccccee Confidence 1111121111111 11122344455566666666778889998764333 33333333333 344333 33322 Q ss_pred ccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhh--------cccccccc Q lcl|NC_021309. 276 YPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAG--------SGSGVAGS 347 (497) Q Consensus 276 ~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~ 347 (497) .... .. - ...-..+..|+..++......-... ...+.... T Consensus 147 ~A~~-----------Td--n-------------------PllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gd 194 (336) T protein:vir:37 147 VATN-----------TT--K-------------------TDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNAD 194 (336) T ss_pred eccC-----------CC--C-------------------ccccccchhHHHHHHhccchhhcccccccCCceEEecCCCC Confidence 1100 00 0 0001112222222222111100000 00111122 Q ss_pred cchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHH--HHhhhcCceeccCcccccccccccccccccccceEecC Q lcl|NC_021309. 348 YPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 348 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~ 425 (497) +.+.+. ...+++..+...++..+.-+++--.++..-. .+-..+|.. |.-...........+|-|+|.+..+ T Consensus 195 y~NLDa---lV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~----PtE~~Aa~~~~~~k~iGGlpa~~~P 267 (336) T protein:vir:37 195 YANLDD---LAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGLT----PTEKAALGSHNLMGSFGGMNAITPP 267 (336) T ss_pred cccHHH---HHHHHHhccchHHhcCCCeEEEEchhhhhhhhhhhhhhcCCC----HHHHHHHHHHHHHHhhCCceEEEcc Confidence 333333 2444455567777777775544444432222 122222110 0000000011124579999999999 Q ss_pred CCCcCceEEEeeccceEEEEeecccE-EEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCCCCCC Q lcl|NC_021309. 426 LIPLGTILVGHFAPSVIQTARREGVT-MQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 426 ~~~~~~~~~gd~~~~~~~i~~r~~~~-i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~a~~~ 497 (497) ++|.+.+++=-|+- +.|+...+-. -.+-+.. .+|++.-.=..=-++.|-+++.++.+...+..-+- T Consensus 268 ffP~~~~lVT~L~N--LsIY~Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~ 334 (336) T protein:vir:37 268 NFPARAAAVTTLKN--LSVYTEAESVRRSLRNDE----DKKGLVTSYYRQEGYVVEDLGLMTAIDHTKVKLNG 334 (336) T ss_pred ccCCCceEEeeccc--cEEEEecCcEEEEEEEcc----ccccccchhhhcceeeeeccccEEEeeeeeeeccc Confidence 99999988766664 3333333322 1221111 13333333334456667778877777665555444 No 210 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=85.82 E-value=0.05 Score=27.72 Aligned_cols=380 Identities=15% Similarity=0.119 Sum_probs=131.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKA--HQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~--~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) |-.+--.+.|- . ++|+++-...+..++.. .+..++...+++|+..-+.++.-++...++++. T Consensus 1 mnkpdliekqn--r--------------laelkennvslksqisgfevknaiedl~K~~ELe~TlSe~~iEI~k~en~LN 64 (393) T protein:vir:16 1 MNKPDLIEKQN--R--------------LAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN 64 (393) T ss_pred CCCcchhhhhh--h--------------hhhhhhcccchhhhccchhhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhh Confidence 43332222211 1 22222222222222221 122233334455655555555555554444432 Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccc Q lcl|NC_021309. 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) ..+.. ..++. .+..-.+. ......|......+ ....+.+.++..... +...+.++. T Consensus 65 ~~eE~-------~KGK~-kMt~~ies---q~A~~eF~~vL~~N-------~G~S~~k~AW~A~L~------E~GVtiTD~ 120 (393) T protein:vir:16 65 AQEEK-------PKGKD-KMTNFIES---QNAVTEFFDVLKKN-------SGKSEIKNAWSAKLA------ENGVTITDT 120 (393) T ss_pred hhhhc-------chhhH-HHHHHHhh---HHHHHHHHHHHhcc-------CCchhhhhhhhhhHh------hcCcceecc Confidence 21110 00000 00000000 00000011000000 001122222222111 111111222 Q ss_pred cccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeeh Q lcl|NC_021309. 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALT 238 (497) Q Consensus 159 ~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~ 238 (497) ...+|.-++..|-..+....+++....+...+.--++..-++ .+.|.-.-.|.++.+...+|..-++.+.-+ |+. T Consensus 121 -~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~~s--~~eAq~HkdGqTK~eqa~~~~~~Tl~~~~V--Y~~ 195 (393) T protein:vir:16 121 -TFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMV--YKL 195 (393) T ss_pred -chhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhhhh--hhhhhhhccCCccccceeeeeeechhHHHH--HHH Confidence 123333344455555556666666444433322111111111 123444455677777777777666665433 333 Q ss_pred hh-HHHHh---hH-HHHHHHHHHHHHHHHH-HHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhh Q lcl|NC_021309. 239 IT-DEGLR---DA-PELFNFVQGRLLEGIQ-RKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPA 312 (497) Q Consensus 239 iS-~ell~---d~-~~l~~~i~~~la~~~~-~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (497) .| -++.. .+ ..|..||..+|+.++. +..|.+++-|+|++.+..+..-+.+-.+..... T Consensus 196 ~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Tt---------------- 259 (393) T protein:vir:16 196 QSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITT---------------- 259 (393) T ss_pred HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhh---------------- Confidence 33 23333 23 4589999999999998 899999999999986554433221111110000 Q ss_pred hhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHH-HHHHHhh Q lcl|NC_021309. 313 DGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE-LLRLTKD 391 (497) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~-~l~~lkd 391 (497) .....+. .+ ..+.+-.+.--+.+..+ ....++...+.. -|+.|+. T Consensus 260 ------------------kaksagk-------tp-------fadaieeavdfvrptag--rrylivktedrkalldelrq 305 (393) T protein:vir:16 260 ------------------KAKSAGK-------TP-------FADAIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQ 305 (393) T ss_pred ------------------hhhhcCC-------Cc-------hhHHHHHHHhhhccCCC--ceEEEEeccchHHHHHHHHh Confidence 0000000 00 00111111111111111 011222222322 2233332 Q ss_pred hcCc--eeccCcccccccccccccccccccc-eEecCCC-CcCceEEEeeccceEEEEeecccEEEeecccchhhhcCce Q lcl|NC_021309. 392 ANGQ--YMGGNFFGNAYGNPVNGGKNIWGVP-VVTTPLI-PLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKV 467 (497) Q Consensus 392 ~~G~--~i~~~~~~~~~~~~~~~~~~l~G~P-vv~~~~~-~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v 467 (497) ++.+ .-...+-. ....-.|+- +++.... .-..-++.|-+ |.|. -+++ ...+..-|..|.- T Consensus 306 atananvriknddt--------eiasevgvdeiivytgskalkptvlvdqk---yhid-mqdl----tkvdafewktnsn 369 (393) T protein:vir:16 306 ATANANVRIKNDDT--------EIASEVGVDEIIVYTGSKALKPTVLVDQK---YHID-MQDL----TKVDAFEWKTNSN 369 (393) T ss_pred hhccCceeeeccch--------hhhhhcCcceeeeeeccccccceeeeccc---cccc-hhhh----hhhhhheeccCCc Confidence 2211 00000000 000001111 0111110 00111111211 2221 1111 1112223555555 Q ss_pred EEEEEEeecceeecccceEEEEee Q lcl|NC_021309. 468 TVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 468 ~~r~~~r~~~~v~~~~a~~~l~~~ 491 (497) -+.++.-..+.|..-.|=++++.+ T Consensus 370 milvetltsghvetynagavitvs 393 (393) T protein:vir:16 370 MILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred eEEEeecccCcceeeccceeEeeC Confidence 555555566666554444444544 No 211 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=85.26 E-value=0.054 Score=27.53 Aligned_cols=388 Identities=13% Similarity=0.071 Sum_probs=93.2 Q ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPS-TAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 ~~~-~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |-. ..++.++.....+++++...+.. .+..+++.+++++.+.+++..+ ++++++++........... T Consensus 7 m~~~l~el~~~~~~~~~e~~~~~~~~~------~e~~~~~~~ev~~l~~~i~~~~------~~~~~~~~~~~~~~~~~~~ 74 (415) T protein:vir:47 7 LQSEISDIKRQIDLKVKYATRALNNDE------LEKAEKLEQEITDLRSQIQEKQ------EELDKLKEKDRTSENNQQS 74 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchhh------HHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHhhhhcccc Confidence 554 34444444444444444332211 1112222223332222222111 1111111111110000000 Q ss_pred HHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccccccc Q lcl|NC_021309. 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+. ...+........ ............... ....+... ....... ........++...- T Consensus 75 ~~~--------~~~~~~~~~~~~-~~~~~~~~~~~~~~~---~~~~~~~~--~~~~~~~-------~~~~~~t~~g~~~i 133 (415) T protein:vir:47 75 VEV--------NEARTYRNQANI-NDLGISIQNTKVTSQ---EVRDFTEY--LETRNDI-------QGGSLKTDSGFVVI 133 (415) T ss_pred ccc--------chhhhhHHHHHH-HHHHHhhhhhhhhHH---HHHHHHHH--Hhhhhhh-------hhccccccCCcccc Confidence 000 000000000000 000000000000000 00000000 0000000 00000000110011 Q ss_pred ccccchhhhHH-----HHHHHHhhhhHHhhcceee---cCCC-ceEEEEEcCCCccceecccccccccccccceeeEeee Q lcl|NC_021309. 160 APGILPTFLPG-----IVEQLFYELSLADLISSRP---VTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 160 g~~v~p~~~~~-----ii~~~~~~~~l~~~~~~~~---~~~~-~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~ 230 (497) ...+.++++.. .+..+-..-++......++ .+++ ...+.-+. .-..|.........++.--.+.. T Consensus 134 P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg------~~~~~~~~~~~~~v~~~~~k~~~ 207 (415) T protein:vir:47 134 PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEEL------EENPELAVKPFFQLAYDINTHRG 207 (415) T ss_pred cHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccc------cccccccccceeeEEeeeeeeEe Confidence 11222222221 2222211112221111122 1222 22221111 11222222222333333333333 Q ss_pred eeEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHh Q lcl|NC_021309. 231 GKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKF 310 (497) Q Consensus 231 ~kla~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (497) .-....--+.+.-..-...|...+...+++.+...+-...-.|...+...+........... ................. T Consensus 208 ~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~~~~~~~ 286 (415) T protein:vir:47 208 YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVK-KAKSLDDIKDAINLNVK 286 (415) T ss_pred eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccc-cccchHHHHHHHHhhhh Confidence 22222112221111112345666666677776666665555554433322222222222222 12222222222222223 Q ss_pred hhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHh Q lcl|NC_021309. 311 PADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK 390 (497) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk 390 (497) .....+.++++...+..+...+...+.+.+.+......+ .. +. +..++.++..+ . T Consensus 287 ~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~--~~-----l~------------G~pV~~~~~~~------~ 341 (415) T protein:vir:47 287 PNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ--QR-----LL------------GAKIEILPDEV------L 341 (415) T ss_pred hccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCC--cc-----cc------------ceeeEEecccc------c Confidence 333344555666666655554444333332221110000 00 00 00011111000 0 Q ss_pred hhcC--ceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEE---------eecccc Q lcl|NC_021309. 391 DANG--QYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQ---------MTNSNG 459 (497) Q Consensus 391 d~~G--~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~---------~~~~~~ 459 (497) .+.| ..++.. +...+.. ..-.|+.|-.+++.-..+. ++...|-+..+. ++.... T Consensus 342 ~~~~~~~~~~gd-~~~~~~~-----~~~~~~~v~~~~~~~~~~~---------~~~~~r~d~~v~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:47 342 GQKGNNTLIIGN-LKDAIVL-----FDRSQYQASWTDYMHFGEC---------LMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred cCCCccEEEEEe-hhccEEE-----EeecceEEEeeccccCceE---------EEEEEEeccEEeccccEEEEEeeccCC Confidence 0001 011111 0000000 0001222222222111111 111122222111 111100 Q ss_pred hhhhcCceEEEE Q lcl|NC_021309. 460 TDFVDGKVTVRA 471 (497) Q Consensus 460 ~~f~~~~v~~r~ 471 (497) ..+..++.+ T Consensus 407 ---~~~~~~~~~ 415 (415) T protein:vir:47 407 ---GEGDLGLEA 415 (415) T ss_pred ---CCCCccCCC Confidence 011111111 No 212 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=85.26 E-value=0.054 Score=27.53 Aligned_cols=388 Identities=13% Similarity=0.071 Sum_probs=93.2 Q ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPS-TAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 ~~~-~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |-. ..++.++.....+++++...+.. .+..+++.+++++.+.+++..+ ++++++++........... T Consensus 7 m~~~l~el~~~~~~~~~e~~~~~~~~~------~e~~~~~~~ev~~l~~~i~~~~------~~~~~~~~~~~~~~~~~~~ 74 (415) T protein:vir:46 7 LQSEISDIKRQIDLKVKYATRALNNDE------LEKAEKLEQEITDLRSQIQEKQ------EELDKLKEKDRTSENNQQS 74 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchhh------HHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHhhhhcccc Confidence 554 34444444444444444332211 1112222223332222222111 1111111111110000000 Q ss_pred HHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccccccc Q lcl|NC_021309. 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+. ...+........ ............... ....+... ....... ........++...- T Consensus 75 ~~~--------~~~~~~~~~~~~-~~~~~~~~~~~~~~~---~~~~~~~~--~~~~~~~-------~~~~~~t~~g~~~i 133 (415) T protein:vir:46 75 VEV--------NEARTYRNQANI-NDLGISIQNTKVTSQ---EVRDFTEY--LETRNDI-------QGGSLKTDSGFVVI 133 (415) T ss_pred ccc--------chhhhhHHHHHH-HHHHHhhhhhhhhHH---HHHHHHHH--Hhhhhhh-------hhccccccCCcccc Confidence 000 000000000000 000000000000000 00000000 0000000 00000000110011 Q ss_pred ccccchhhhHH-----HHHHHHhhhhHHhhcceee---cCCC-ceEEEEEcCCCccceecccccccccccccceeeEeee Q lcl|NC_021309. 160 APGILPTFLPG-----IVEQLFYELSLADLISSRP---VTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 160 g~~v~p~~~~~-----ii~~~~~~~~l~~~~~~~~---~~~~-~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~ 230 (497) ...+.++++.. .+..+-..-++......++ .+++ ...+.-+. .-..|.........++.--.+.. T Consensus 134 P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg------~~~~~~~~~~~~~v~~~~~k~~~ 207 (415) T protein:vir:46 134 PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEEL------EENPELAVKPFFQLAYDINTHRG 207 (415) T ss_pred cHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccc------cccccccccceeeEEeeeeeeEe Confidence 11222222221 2222211112221111122 1222 22221111 11222222222333333333333 Q ss_pred eeEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHh Q lcl|NC_021309. 231 GKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKF 310 (497) Q Consensus 231 ~kla~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (497) .-....--+.+.-..-...|...+...+++.+...+-...-.|...+...+........... ................. T Consensus 208 ~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~~~~~~~ 286 (415) T protein:vir:46 208 YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVK-KAKSLDDIKDAINLNVK 286 (415) T ss_pred eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccc-cccchHHHHHHHHhhhh Confidence 22222112221111112345666666677776666665555554433322222222222222 12222222222222223 Q ss_pred hhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHh Q lcl|NC_021309. 311 PADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK 390 (497) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk 390 (497) .....+.++++...+..+...+...+.+.+.+......+ .. +. +..++.++..+ . T Consensus 287 ~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~--~~-----l~------------G~pV~~~~~~~------~ 341 (415) T protein:vir:46 287 PNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ--QR-----LL------------GAKIEILPDEV------L 341 (415) T ss_pred hccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCC--cc-----cc------------ceeeEEecccc------c Confidence 333344555666666655554444333332221110000 00 00 00011111000 0 Q ss_pred hhcC--ceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEE---------eecccc Q lcl|NC_021309. 391 DANG--QYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQ---------MTNSNG 459 (497) Q Consensus 391 d~~G--~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~---------~~~~~~ 459 (497) .+.| ..++.. +...+.. ..-.|+.|-.+++.-..+. ++...|-+..+. ++.... T Consensus 342 ~~~~~~~~~~gd-~~~~~~~-----~~~~~~~v~~~~~~~~~~~---------~~~~~r~d~~v~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:46 342 GQKGNNTLIIGN-LKDAIVL-----FDRSQYQASWTDYMHFGEC---------LMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred cCCCccEEEEEe-hhccEEE-----EeecceEEEeeccccCceE---------EEEEEEeccEEeccccEEEEEeeccCC Confidence 0001 011111 0000000 0001222222222111111 111122222111 111100 Q ss_pred hhhhcCceEEEE Q lcl|NC_021309. 460 TDFVDGKVTVRA 471 (497) Q Consensus 460 ~~f~~~~v~~r~ 471 (497) ..+..++.+ T Consensus 407 ---~~~~~~~~~ 415 (415) T protein:vir:46 407 ---GEGDLGLEA 415 (415) T ss_pred ---CCCCccCCC Confidence 011111111 No 213 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=83.33 E-value=0.069 Score=26.94 Aligned_cols=310 Identities=13% Similarity=0.019 Sum_probs=132.8 Q ss_pred hhhhhhh-hhhccccccccccccchh-hhHHHHHHHHhhhhHHhhcceeecCCC---ceEEEEEcCCCccceecccccc- Q lcl|NC_021309. 142 ETAPAAI-GQNPFGSTGTFAPGILPT-FLPGIVEQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGT- 215 (497) Q Consensus 142 ~~~~~~~-~~~~~~~~~~~g~~v~p~-~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~wv~Eg~~- 215 (497) ......- .....+..++.|.-+-.- +....+....+...+.+++.+.+++.+ .+..-+.......-.-..||-+ T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 1111000 011111122222222222 335666666667888889999888754 2333222211111111222221 Q ss_pred ---------------------------------cccccccceeeEeeeeeEEeeehhhHHHHh-hH-HHHHHHHHHHH-H Q lcl|NC_021309. 216 ---------------------------------YPFSSEEFARVYEQVGKVANALTITDEGLR-DA-PELFNFVQGRL-L 259 (497) Q Consensus 216 ---------------------------------~~~s~~~f~~i~~~~~kla~~~~iS~ell~-d~-~~l~~~i~~~l-a 259 (497) ......+-..+..+.++++.+.++|+++.+ +. +.|..-|..+| . T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~ 160 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMN 160 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhh Confidence 111233445566789999999999999876 33 45665432233 2 Q ss_pred HHH---HHHHHhhhhcccCcccccccc-ccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh Q lcl|NC_021309. 260 EGI---QRKEEVQLLAGGGYPGVNGLL-QRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT 335 (497) Q Consensus 260 ~~~---~~~~d~~~l~G~G~~~p~Gi~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (497) -+. ...+-..+|++.++---.|-. ..++.. .......... ...+..+......+ T Consensus 161 g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~-----~~~~~~t~vt-----------------~~~l~rl~~~L~~n 218 (401) T protein:vir:95 161 GATQITEAVLQKDLLAAAGTVLYAGAATSDATIT-----GEGSTPSVVS-----------------YKNLMRLDQILTEN 218 (401) T ss_pred hhhhhHHHHHHHHHHhhcCeeecCCccceeeecc-----ccccccceec-----------------hhHHHHHHHHHHhc Confidence 222 333445566554321100100 000000 0000000000 00000000000000 Q ss_pred hhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccc--cccccccccc Q lcl|NC_021309. 336 GAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG--NAYGNPVNGG 413 (497) Q Consensus 336 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~--~~~~~~~~~~ 413 (497) ... .- ..++..........-...-+-++++.....|+.++|-.|.+-|.+..- ........+. T Consensus 219 Rap-----------k~----t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEi 283 (401) T protein:vir:95 219 RTP-----------TQ----TTIITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEV 283 (401) T ss_pred ccc-----------cc----hhhhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccc Confidence 000 00 000000000000000112234778888889999999888877765332 2223334456 Q ss_pred ccccccceEecCCCC--------cCc---------------------eEEEeeccceEEEEeeccc------EEEeecc- Q lcl|NC_021309. 414 KNIWGVPVVTTPLIP--------LGT---------------------ILVGHFAPSVIQTARREGV------TMQMTNS- 457 (497) Q Consensus 414 ~~l~G~Pvv~~~~~~--------~~~---------------------~~~gd~~~~~~~i~~r~~~------~i~~~~~- 457 (497) ..|-++++|+++.+- ++. .++|.- +|...+-++. .+.+-.. T Consensus 284 G~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~lV~G~d---Af~~~~l~g~g~~~~~~~ivk~pG 360 (401) T protein:vir:95 284 GSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVSGQEHYDVYPMLVVGDD---SFTSIGFQTDGKSLKFTVMTKMPG 360 (401) T ss_pred cccCceeEEecccceeecCCcccccccccccccccccCCCcceeeeeeEEccc---cceecccccCCccccceeEeecCC Confidence 678889999888743 110 123322 1222222221 2222211 Q ss_pred ------cchhhhcCceEEEEE-EeecceeecccceEEEEeeCCC Q lcl|NC_021309. 458 ------NGTDFVDGKVTVRAE-ERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 458 ------~~~~f~~~~v~~r~~-~r~~~~v~~~~a~~~l~~~~~a 494 (497) ...+ ||+++-++ +..++.+++++-++++.-.+.- T Consensus 361 ~~~ad~~DPl---gQ~g~vgwK~~~a~~vL~~e~m~~ies~a~~ 401 (401) T protein:vir:95 361 KETADRNDPY---GETGFSSIKWYYGILVKRPERLALIKTVAPL 401 (401) T ss_pred cCCCCCCCcc---cceehhhhhhhhhhheeccceeEEEEeecCC Confidence 2222 34444443 4567788999999999877777 No 214 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=77.35 E-value=0.13 Score=25.52 Aligned_cols=268 Identities=10% Similarity=-0.030 Sum_probs=107.8 Q ss_pred hccccccccccccchhhhHHHHHHHHhhhhHHhhcceeec-------CCCceEEEEEcCCCccceecccccccccccccc Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPV-------TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEF 223 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~-------~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f 223 (497) |+ .+-. ..+|..+....++.+++.+++.+++..-.- .++++++++.......-+-.+.+......+.+- T Consensus 1 MA-N~ll---T~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e 76 (423) T protein:vir:35 1 MA-NNLE---SNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFS 76 (423) T ss_pred Cc-cchh---hhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccccccc Confidence 22 1111 123444667899999999998887765221 255777776543211111111122222233333 Q ss_pred ee--eEeeeeeEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhh Q lcl|NC_021309. 224 AR--VYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 224 ~~--i~~~~~kla~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~ 301 (497) .+ +++..+|...+.-=+.|+.++..+++.++...+ ++++..+|..++..--.+.+. ..+.... T Consensus 77 ~~v~l~id~~k~~a~~v~d~e~~l~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a~~----~vgt~~t---------- 141 (423) T protein:vir:35 77 AKATGKVGKYITVAVEWTQIEEALKLNQLDQILSPIH-ERMVTDLETELAHFMMNNGAL----SLGSPNT---------- 141 (423) T ss_pred ceeeEEeccceeccceeCHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcccc----ccccccC---------- Confidence 33 444555544443334555556667777777664 678888888876421000000 0000000 Q ss_pred hhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhcc-CCceEEech Q lcl|NC_021309. 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNP 380 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~n~ 380 (497) ....++.+.++...+....-- .....+++| T Consensus 142 -------------------------------------------------~~~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p 172 (423) T protein:vir:35 142 -------------------------------------------------AIKKWADVAQTASFIKDIGIKTGENYAIMDP 172 (423) T ss_pred -------------------------------------------------CcchHHHHHHHHHHHHHhcCCcCCCEEEeCH Confidence 000012222222222222222 123458888 Q ss_pred hHHHHHHHHhhhcCceeccCcccccccccccc-cccccccceEecCCCCcCceEEEeeccc----------eEEEEeecc Q lcl|NC_021309. 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNG-GKNIWGVPVVTTPLIPLGTILVGHFAPS----------VIQTARREG 449 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~-~~~l~G~Pvv~~~~~~~~~~~~gd~~~~----------~~~i~~r~~ 449 (497) .....|.+ .+.+ +............... ..++.|+.|+.|+.+|..+.. .+... ...+.+... T Consensus 173 ~~~a~Ll~---~~~~-~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~g--t~~~~~~v~~a~~v~~~a~~~~~~ 246 (423) T protein:vir:35 173 WSAQRLAD---AQSG-LHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQG--DFDGAITVKTAPNVDYLSVKDSYQ 246 (423) T ss_pred HHHHHHhc---cccc-eeccccchhHHHhhccceeeecceEEEEcCCCcccccc--ccccceeecccccccccccccccc Confidence 88777642 1111 1111111111111111 247899999999999964321 11000 001111111 Q ss_pred ----cEEEeecccchhhhcCceEEEEEEeecceeecc------------c--ceEEEEeeCC-CCCC Q lcl|NC_021309. 450 ----VTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRP------------S--AFQLIQLKKG-ATGS 497 (497) Q Consensus 450 ----~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~------------~--a~~~l~~~~~-a~~~ 497 (497) +...+-...+..-..|.++ ..|...++| . =|+++.-..+ +.|. T Consensus 247 ~~~~~~~~~~~~~g~l~~GD~~t-----~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~ 308 (423) T protein:vir:35 247 FTVALTGATPSKTGFLKAGDQLK-----FTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTASGD 308 (423) T ss_pred ceeeeeeeeeccCCcEEecceEE-----eeeeeeccccccceeecccCCceeEEEEeccccccccCc Confidence 1111111111111122222 223233222 1 2222211111 1222 No 215 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=73.99 E-value=0.1 Score=25.97 Aligned_cols=106 Identities=11% Similarity=-0.044 Sum_probs=59.1 Q ss_pred EechhHHHHHHHHh-------hhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEe--- Q lcl|NC_021309. 377 VMNPRDWELLRLTK-------DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTAR--- 446 (497) Q Consensus 377 ~~n~~~~~~l~~lk-------d~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~--- 446 (497) ++....|..+.-.- -.+-++++... -+-+++|+..+.++.+|.++.++.|-.+..- +.| T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~aNp~ltG~----------lpV~~~GltWl~tpnlpg~~a~vlDst~lGg-maDE~l 69 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQANIVLTGS----------LPVSAYGLTWVTSRHITGTDPWLFDVEQLGG-MADEKL 69 (123) T ss_pred CcchhhHHHHhcchhcccccccccCCceEecC----------cceeeeceeeeecCCCCCCccceeehhhhcc-cccccc Confidence 11111122221110 11123444321 1225889999999999988876666543221 111 Q ss_pred -------ecccEEEeecccchhhhcCceEEEEEEeecceeecccceEEEEeeCC Q lcl|NC_021309. 447 -------REGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 447 -------r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~~ 493 (497) ..+..|++++.-++.=.+|+..+|+..-.--.|..|.|.++|+-.-- T Consensus 70 ~~Pgya~~~~~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 70 LSPEFAPAGNTGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred CCCcccCCCCcceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 12334555554443334788999987666666778999999987666 No 216 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=72.22 E-value=0.19 Score=24.60 Aligned_cols=293 Identities=9% Similarity=-0.039 Sum_probs=109.9 Q ss_pred hhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchh-hhHHHHHHHHhhhhHHh-hcc- Q lcl|NC_021309. 110 KGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPT-FLPGIVEQLFYELSLAD-LIS- 186 (497) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~-~~~~ii~~~~~~~~l~~-~~~- 186 (497) ..+.+....+...+... .+....+ .....+ ...+-..+.+.-.+ +...+-+.+...+.-.. +++ T Consensus 1 ~~~~~~~~~~~~~~~~~--~~~~~~~----~~~~~~-------~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~ 67 (329) T protein:vir:10 1 MDGIFITGVKTMNKEIK--NATGKLK----LNLQHF-------ANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISN 67 (329) T ss_pred CCceEEechhhhhhhhh--cccceeE----Eehhhh-------cCCccCCchhHHHHHHHHHHHHHHHhhceeeeeeccc Confidence 00000000000000000 0000000 000000 00000111111112 22223333322221111 122 Q ss_pred -eeecCCCceEEEEEcCCCccceecccccccccc--cccceeeEeeeeeEEeeeh--hhHHHHhhHHHH--HHHHHHHHH Q lcl|NC_021309. 187 -SRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFS--SEEFARVYEQVGKVANALT--ITDEGLRDAPEL--FNFVQGRLL 259 (497) Q Consensus 187 -~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s--~~~f~~i~~~~~kla~~~~--iS~ell~d~~~l--~~~i~~~la 259 (497) .....++++.||+.+..+.. .+ ..++..... +.++...++...+.-.+.. +..+ +.+..+ ...+..... T Consensus 68 ~~e~~~g~tVkIp~i~~~gl~-DY-~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~d--Etn~~l~a~~i~~~~~~ 143 (329) T protein:vir:10 68 DAIFMQGRSFTVIKGDVTELK-DY-KRNATNEFDHPQIQETTYFLDQEKYWGRFVDALDRR--DTEGNIDINYVVAKQAS 143 (329) T ss_pred ceeeccCcEEEEeeecccccc-cc-cCCCCccccccccceeEEEeecccceeeecchhhHh--hhhhhhhHHHHHHHHHH Confidence 34456789999998653222 22 212222222 3344445555544444321 1111 111112 222333344 Q ss_pred HHHHHHHHhhhhc---c-cCccccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh Q lcl|NC_021309. 260 EGIQRKEEVQLLA---G-GGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT 335 (497) Q Consensus 260 ~~~~~~~d~~~l~---G-~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (497) ..+.-.+|.-.+. + .|+. T Consensus 144 ~~v~pEiDay~~skla~~a~~~---------------------------------------------------------- 165 (329) T protein:vir:10 144 EVVAPYLDNLRFATLARNKAKH---------------------------------------------------------- 165 (329) T ss_pred HHhhhHHHHHHHHHHHhhcccc---------------------------------------------------------- Confidence 4444444433221 0 0000 Q ss_pred hhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccccccccccccccc Q lcl|NC_021309. 336 GAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKN 415 (497) Q Consensus 336 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~ 415 (497) .....+....++.+.++...+....-..+.+++++|..+..|.+-. +++....... .+.......+ T Consensus 166 ---------~~~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~----~f~~~~~~~~-~~~~~g~Vg~ 231 (329) T protein:vir:10 166 ---------LTVGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFV----IELPQGDNRQ-QVLGKGVQGE 231 (329) T ss_pred ---------cccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhh----hhhccccccc-cceeeeeeee Confidence 0000112233455556665665543223345688888877775421 2222211111 1222333457 Q ss_pred ccccceEecCC--CCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceEEEEEEeecceeecccceEE-EEeeC Q lcl|NC_021309. 416 IWGVPVVTTPL--IPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQL-IQLKK 492 (497) Q Consensus 416 l~G~Pvv~~~~--~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~a~~~-l~~~~ 492 (497) |.|+||+.++. +..-.+++|.-+. ..... +--.++...-....| -..++....+|..|.+|++... ...++ T Consensus 232 idG~~Ii~vps~~~k~in~ii~~~~A--~~~~~-K~~~~~~~~p~~~~~---a~~v~gr~yyd~~V~~~k~~~I~~~~~~ 305 (329) T protein:vir:10 232 LDGFTIVKVPSKMLQGVEAMAVIGEV--MASPI-QANEAKLNSNVPGMF---GTLAEQMLYTGAFVPEHLQKYIFTIGGK 305 (329) T ss_pred ecCeEEEEecCCcccceeEEEEcCCc--eeeee-eeeeeeeeCCCCccc---hheeeeeeeeeeEEEccccCEEEEeccc Confidence 99999987543 3333456655442 21111 111233222111112 3578888999999999985443 34444 Q ss_pred CCCCC Q lcl|NC_021309. 493 GATGS 497 (497) Q Consensus 493 ~a~~~ 497 (497) +.+.+ T Consensus 306 a~~~~ 310 (329) T protein:vir:10 306 EVETN 310 (329) T ss_pred CcccC Confidence 44444 No 217 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=71.83 E-value=0.19 Score=24.53 Aligned_cols=275 Identities=10% Similarity=-0.076 Sum_probs=105.1 Q ss_pred hccccccccccccchhhhHHHHHHHHhhhhHHhhcceee-----c--CCCceEEEEEcCCCccceecccccccccccccc Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRP-----V--TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEF 223 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~-----~--~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f 223 (497) |. .+-.+ .+|..+....++.+++.+++.+++..-. . .++++++++.......-+-...+..+...+.+- T Consensus 1 Ma-N~llT---~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e 76 (423) T protein:vir:10 1 MP-NNLDS---NVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cc-cchhh---hhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCcccc Confidence 11 11010 1344466789999999998888776521 1 356777776432211111111222222233333 Q ss_pred e--eeEeeeeeEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhh Q lcl|NC_021309. 224 A--RVYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 224 ~--~i~~~~~kla~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~ 301 (497) + .+++.-+|...+..=+.|+..+-.+++.+++.. .++++..+|..++.-- .+.+......++ +. T Consensus 77 ~~v~l~id~~k~va~~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~-~~~~~~~~gt~~--t~---------- 142 (423) T protein:vir:10 77 GKATGRVGNYITVAVEYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFM-MNNGALSLGSPN--TP---------- 142 (423) T ss_pred ceeEEEeeceeeeeeeechHHHhcChhhHHHHHHHH-HHHHHHHHHHHHHHHH-hhccccccccCC--cc---------- Confidence 3 355566665555444556655656677766655 5889999999876320 000000000000 00 Q ss_pred hhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhcc-CCceEEech Q lcl|NC_021309. 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMNP 380 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~n~ 380 (497) + ..++++.++...+....-- .....+++| T Consensus 143 -----------------------------------------------~---~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p 172 (423) T protein:vir:10 143 -----------------------------------------------I---TKWSDVAQTASFLKDLGVNEGENYAVMDP 172 (423) T ss_pred -----------------------------------------------c---chHHHHHHHHHHHHhccCCcCCCEEEeCh Confidence 0 0011111111111111111 123568888 Q ss_pred hHHHHHHHHhhhcCceeccCccccccccccccc-ccccccceEecCCCCcCceEEEeec---cceEEE-----EeecccE Q lcl|NC_021309. 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGG-KNIWGVPVVTTPLIPLGTILVGHFA---PSVIQT-----ARREGVT 451 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~-~~l~G~Pvv~~~~~~~~~~~~gd~~---~~~~~i-----~~r~~~~ 451 (497) .....|.+-. +.+................ .++.|+.|+.++.+|..+....-.+ .....+ .+..+.+ T Consensus 173 ~~~a~Ll~~~----~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~ 248 (423) T protein:vir:10 173 WSAQRLADAQ----TGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFT 248 (423) T ss_pred HHHHHHhccc----cceecccccchhhhhhccceeeecceEEEEeCCCccccccccccceeeeecceeccccccccceee Confidence 8777665321 1111111111111111112 4789999999999996432110000 000000 0111111 Q ss_pred EEee----cccchhhhcCceEEEE---EEeecceee------cccceEEEEeeCCCC----------------------- Q lcl|NC_021309. 452 MQMT----NSNGTDFVDGKVTVRA---EERLGLLVY------RPSAFQLIQLKKGAT----------------------- 495 (497) Q Consensus 452 i~~~----~~~~~~f~~~~v~~r~---~~r~~~~v~------~~~a~~~l~~~~~a~----------------------- 495 (497) +.+. +..+..-.-|.++|-+ .-+....|+ ++.-|+++.-..+.. T Consensus 249 ~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i~p~~i~~~~~~~~~~ 328 (423) T protein:vir:10 249 VTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVTVTLSGVPIYDTTNPQYNS 328 (423) T ss_pred eeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccCCceeeeccCccccccCCccccc Confidence 1110 0001000011111111 111111111 111122111111101 Q ss_pred --CC Q lcl|NC_021309. 496 --GS 497 (497) Q Consensus 496 --~~ 497 (497) +| T Consensus 329 v~a~ 332 (423) T protein:vir:10 329 VSRQ 332 (423) T ss_pred cccc Confidence 00 No 218 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=71.51 E-value=0.19 Score=24.48 Aligned_cols=274 Identities=11% Similarity=-0.054 Sum_probs=106.3 Q ss_pred hccccccccccccchhhhHHHHHHHHhhhhHHhhcceee-----c--CCCceEEEEEcCCCccceecccccccccccccc Q lcl|NC_021309. 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRP-----V--TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEF 223 (497) Q Consensus 151 ~~~~~~~~~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~-----~--~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f 223 (497) |.. +-. ..+|..+....++.+++.+++.+++..-. . .+++++|++.......-+-...+......+.+- T Consensus 1 MaN-~ll---T~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e 76 (423) T protein:vir:17 1 MPN-NLD---SNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Ccc-chh---hhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCcccc Confidence 211 101 12344466788999999998888776532 1 355777776332111111011111122223322 Q ss_pred e--eeEeeeeeEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccc-Cccccccccccccccccccccchhhh Q lcl|NC_021309. 224 A--RVYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGG-GYPGVNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 224 ~--~i~~~~~kla~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~-G~~~p~Gi~~~~~~~~~~~~~~~~~~ 300 (497) + .+++.-+|...+.-=+.|+..+-.+++.+++.. .++++..+|..++.-- +.. +..+ ..++ +. T Consensus 77 ~~v~l~id~~k~va~~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a-~~~~-gt~~--t~--------- 142 (423) T protein:vir:17 77 GKATGRVGNYITVAVEYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNG-ALSL-GSPN--TP--------- 142 (423) T ss_pred ceeEEEeeceeeeeeeecHHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhcc-cccc-ccCC--cc--------- Confidence 2 456666666655544566655666677766665 5889999998876321 100 0000 0000 00 Q ss_pred hhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhcc-CCceEEec Q lcl|NC_021309. 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ-TPNAVVMN 379 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~n 379 (497) + ..++++.++...+....-- .....+++ T Consensus 143 ------------------------------------------------~---~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~ 171 (423) T protein:vir:17 143 ------------------------------------------------I---TKWSDVAQTASFLKDLGVNEGENYAVMD 171 (423) T ss_pred ------------------------------------------------c---ccHHHHHHHHHHHHhccCCcCCCEEEeC Confidence 0 0011222222222111111 22356888 Q ss_pred hhHHHHHHHHhhhcCceeccCccccccccccccc-ccccccceEecCCCCcCceE-EEee-----c-cc-eEEEEee--- Q lcl|NC_021309. 380 PRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGG-KNIWGVPVVTTPLIPLGTIL-VGHF-----A-PS-VIQTARR--- 447 (497) Q Consensus 380 ~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~-~~l~G~Pvv~~~~~~~~~~~-~gd~-----~-~~-~~~i~~r--- 447 (497) |.....|.+-. . .++............... .++.|+.|+.|+.+|..+.. ++.. . .. .....+. T Consensus 172 p~~~a~Ll~~~---~-~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~ 247 (423) T protein:vir:17 172 PWSAQRLADAQ---T-GLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQF 247 (423) T ss_pred hHHHHHHhccc---c-ceecccccchHHHhhccceeeecceEEEEeCCCccccccceeceeeecccccccccccccccce Confidence 88777765321 1 111111111111111112 47899999999999964421 1100 0 00 0000000 Q ss_pred -cccEEEeecccchhhhcCceEEEE---EEeecceee------cccceEEEE-eeCCCCCC Q lcl|NC_021309. 448 -EGVTMQMTNSNGTDFVDGKVTVRA---EERLGLLVY------RPSAFQLIQ-LKKGATGS 497 (497) Q Consensus 448 -~~~~i~~~~~~~~~f~~~~v~~r~---~~r~~~~v~------~~~a~~~l~-~~~~a~~~ 497 (497) .++...+....+..-.-|.++|-+ ..+....|+ ++.-|++.. ..+.+.|. T Consensus 248 ~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~ 308 (423) T protein:vir:17 248 TVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGD 308 (423) T ss_pred eeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccccccCc Confidence 001111111111110112222111 111111111 111222111 11111111 No 219 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=57.99 E-value=0.42 Score=22.64 Aligned_cols=317 Identities=12% Similarity=0.059 Sum_probs=127.8 Q ss_pred hhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHHH---hhhhHH Q lcl|NC_021309. 106 TSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF---YELSLA 182 (497) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~~---~~~~l~ 182 (497) ..+++.-... .. .........+..... ..-.....+-.+++.+--|.+..-|..+. ....++ T Consensus 1 ~~~~~~~~~~--~~---------~~~~~~~e~~~KS~~----tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~ 65 (462) T protein:vir:96 1 MHKDTNLTAE--QN---------KYADKFQEEVMKSYQ----TGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFY 65 (462) T ss_pred Cccccccchh--hh---------hhhchhhHHHHHHHh----cCCCcCCccccccchhhhhhhhhhhheeeecccchhhh Confidence 0000000000 00 000000000000000 00000111223345555554433333332 223455 Q ss_pred hhcceeecCCCceEEEEEc--CCCccceecccccccccccccceeeEeeeeeEEeeehhhHHH-HhhH-HHHHHHHHHHH Q lcl|NC_021309. 183 DLISSRPVTSPNLSYLTES--AAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PELFNFVQGRL 258 (497) Q Consensus 183 ~~~~~~~~~~~~~~~p~~~--~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~el-l~d~-~~l~~~i~~~l 258 (497) .-+...++.+-.-+|-.+. +..+.+.++.|++..+.+++++.+.+..+|=++.-..+|-.+ |+.+ .+......++- T Consensus 66 ~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~da 145 (462) T protein:vir:96 66 REISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDA 145 (462) T ss_pred hhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHH Confidence 5566667666544444443 333457899999999999999999999999999987888765 4444 46778888888 Q ss_pred HHHHHHHHHhhhhcccCccccccc---cccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh Q lcl|NC_021309. 259 LEGIQRKEEVQLLAGGGYPGVNGL---LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT 335 (497) Q Consensus 259 a~~~~~~~d~~~l~G~G~~~p~Gi---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (497) .-.++..++.+.++|+-.=.|.+. +.-.+.. ..+...+.++ T Consensus 146 i~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~---------------------------~lI~~~NViD--------- 189 (462) T protein:vir:96 146 IAVVAKTIEWASFYGDASLTADPTGQGLEFDGLA---------------------------KLIDKDNVID--------- 189 (462) T ss_pred HHHHHHHHHHHHhhhhcccCCCccccccchhhhh---------------------------hhcCCCceee--------- Confidence 888999999999999863222111 1111100 0000000000 Q ss_pred hhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCcccc-cccccc---- Q lcl|NC_021309. 336 GAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGN-AYGNPV---- 410 (497) Q Consensus 336 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~-~~~~~~---- 410 (497) ..|..+ ...++.-........+.+++-++|...+.+.+..-.-.--|.+.+++.+. ..|... T Consensus 190 ----arG~~L---------s~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f~ 256 (462) T protein:vir:96 190 ----AKGESL---------TETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQDNSGNVNAGYNVQGFY 256 (462) T ss_pred ----cCCCCc---------cHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEEEcCCCCceeeeeecccee Confidence 000000 01111111122233444455566666666666533322223333322221 111111 Q ss_pred -------cccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhh----cCceEEEEEEeeccee Q lcl|NC_021309. 411 -------NGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFV----DGKVTVRAEERLGLLV 479 (497) Q Consensus 411 -------~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~----~~~v~~r~~~r~~~~v 479 (497) ..++++++.|-+......... ....-..++..+..-....|. .....|++...-...= T Consensus 257 s~~G~I~L~~s~~m~~~~i~~~~~~~~p-----------~ap~~~~vsaTv~t~~~g~f~~~~d~~~y~Y~V~avs~dge 325 (462) T protein:vir:96 257 SSRGFIKLHGSTVMENELILDESLQPLP-----------NAPQPATVKATVETGKKGLFTDEHDRAELTYKVVVNSDDAQ 325 (462) T ss_pred eeeeeeeeCCceecCcccccccccccCC-----------CCCCCCceeEEEEeCCCCCCCCccCceeEEEEEEEECCCCc Confidence 111222222322211111000 000000000110000001121 1123333333333222 Q ss_pred ecccceEEEEeeCCCCCC Q lcl|NC_021309. 480 YRPSAFQLIQLKKGATGS 497 (497) Q Consensus 480 ~~~~a~~~l~~~~~a~~~ 497 (497) --|..++-.+..++..|. T Consensus 326 S~PS~~VtaTva~~~~gv 343 (462) T protein:vir:96 326 SAPSEAVTATVNNATDGV 343 (462) T ss_pred cccceeeEeeeecccccc Confidence 235555555555544444 No 220 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=45.94 E-value=0.75 Score=21.26 Aligned_cols=292 Identities=14% Similarity=0.100 Sum_probs=106.4 Q ss_pred hhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccccccchhhhHHHHHHH---HhhhhHHhh Q lcl|NC_021309. 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQL---FYELSLADL 184 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~~---~~~~~l~~~ 184 (497) +.. +..... +.. . ..+....+..|+.+--+.+..-+..+ .....++.- T Consensus 1 ~~~----------------------~~~~~~-~~a-~-----~~al~~a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~ 51 (470) T protein:vir:10 1 MPY----------------------EHLKHL-DEA-T-----LKALNAAGQVAESLEREDLEPEVTQLNVLDTPLTDLLS 51 (470) T ss_pred CCh----------------------hHhhhh-hHH-H-----HHHHHHhhhcchhhhhhhhccceeEeeecCccchhhhh Confidence 000 000000 000 0 00000112222233322222211111 122244444 Q ss_pred cceeecCCCceEEEEEcCCCc--cceecccccccccccccceeeEeeeeeEEeeehhhHHHH---hhH-HHHHHHHHHHH Q lcl|NC_021309. 185 ISSRPVTSPNLSYLTESAAHN--NAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGL---RDA-PELFNFVQGRL 258 (497) Q Consensus 185 ~~~~~~~~~~~~~p~~~~~~~--~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~ell---~d~-~~l~~~i~~~l 258 (497) +...++.+-.-+|-.+.+..+ .-..+.|++..+.+++++.+.+..+|=++.-..||.-.+ +.+ .+++..+.++- T Consensus 52 i~k~~a~STV~ey~~~~~rhG~~g~s~~~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~da 131 (470) T protein:vir:10 52 KNAVKAKAYEHEYNVVTARHDKIGYAAFREGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREK 131 (470) T ss_pred cCCchhhhHhhhhhhhccccccccceeecccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHH Confidence 555666554444543333212 223468999999999999999999999999889997753 334 37888887777 Q ss_pred HHHHHHHHHhhhhcccC---cc--------ccccccccccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhhhh Q lcl|NC_021309. 259 LEGIQRKEEVQLLAGGG---YP--------GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVAS 327 (497) Q Consensus 259 a~~~~~~~d~~~l~G~G---~~--------~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (497) --.++..++.+.++|+- +. ++.||.+.-...... ....+.+.... T Consensus 132 i~~ia~tiE~a~FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~---------------NViDarG~~Ls--------- 187 (470) T protein:vir:10 132 MIAVANEFEYLAFYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQ---------------NVLDAGGRPLS--------- 187 (470) T ss_pred HHHHHHHHHhhhhhhccccccccCcccCceeccchhhhccCCCCc---------------cccccCCCCcc--------- Confidence 88899999999999964 11 233332211000000 00000000000 Q ss_pred hhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhh-hhhccCCceEEechhHHHHHHHHhhhcCceeccCccc-cc Q lcl|NC_021309. 328 LKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQ-LTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG-NA 405 (497) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~-~~ 405 (497) .+.+..+...+. ...+.+++-++|...+.+.+..-....-|.+.+++.. .. T Consensus 188 ---------------------------~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qRv~~~~N~~~~~ 240 (470) T protein:vir:10 188 ---------------------------IDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISRVMTTADRRAGL 240 (470) T ss_pred ---------------------------HHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceEEEEecCCCcee Confidence 011111111111 1223333444555555555544444444444332221 11 Q ss_pred ccccccc-----------cccccc-----cceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccc---------- Q lcl|NC_021309. 406 YGNPVNG-----------GKNIWG-----VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG---------- 459 (497) Q Consensus 406 ~~~~~~~-----------~~~l~G-----~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~---------- 459 (497) .|.+... +.++++ .|-+.....+ ++. -.++++.++.... T Consensus 241 ~G~~v~~f~sa~G~I~L~~s~~m~~~~k~~p~~l~~~v~-------~~a--------AP~~~~tv~~t~~~~a~~~~sk~ 305 (470) T protein:vir:10 241 LGADAQSYIGVRGEHSLYPSQFLGDFHKFNPARFGAEVG-------DFA--------APSNSWTVSTTDNFVTLPYNSGL 305 (470) T ss_pred eeeeccceeeeeeeeeecccccccchhhcCcccCCcccC-------Ccc--------cCceeEEeecCCCceeecccCCC Confidence 1111100 001111 0111111000 000 0001111100000 Q ss_pred hhhh-cC--ceEEEEEEeecceeecccceEEEEee--------------------------CCCCCC Q lcl|NC_021309. 460 TDFV-DG--KVTVRAEERLGLLVYRPSAFQLIQLK--------------------------KGATGS 497 (497) Q Consensus 460 ~~f~-~~--~v~~r~~~r~~~~v~~~~a~~~l~~~--------------------------~~a~~~ 497 (497) ..|. ++ ...|.+-.+.|=. .|.++ .++.. .+.+|+ T Consensus 306 g~~~~~~v~sy~y~v~~~~gds--~s~~v-~vt~t~~~v~kgv~ltI~~~~~v~yv~IYRk~~~s~~ 369 (470) T protein:vir:10 306 GDPANTTVYSYAFKAANFYGES--AAKYI-DVYIDSTEAGKGVRFQFHGLVNVKWLDVYRKDPGSQE 369 (470) T ss_pred CcccCcceeEEEEEEEEecCCC--CcceE-EEEEeeehhcceeEEEEecCCCCcEEEEEeecCCCCc Confidence 0011 00 1122222222111 12111 11111 111122 No 221 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=36.88 E-value=1.1 Score=20.25 Aligned_cols=141 Identities=13% Similarity=0.104 Sum_probs=10.1 Q ss_pred CchHH------HHHHHHHHHHHHHHHHHH-H---HHHHHHHHHHHHHH-----------HHHHHHHHHHhHhHHHHHHHH Q lcl|NC_021309. 1 MPSTA------QLEAQGRQLAKSIKDINA-D---ETKTAAEKKEALAK-----------IEPDFKAHQAEVEAHERAQEM 59 (497) Q Consensus 1 ~~~~a------~~~~~~~~~~~~~~~~~~-~---~~~~~~e~~~~~~~-----------~~~~~~~~~~~~~~~e~~~e~ 59 (497) +|... ++....+++.+....... . ....+.+++..... ...+....+...+......++ T Consensus 544 ~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~ 623 (705) T protein:vir:88 544 GGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEA 623 (705) T ss_pred ccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 111111111111000000 0 00000011000000 000000000000000000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHH Q lcl|NC_021309. 60 LKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFA 139 (497) Q Consensus 60 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (497) ..+..+++.....+ +....+.. ..+.+........ .... .............. ...... ............ T Consensus 624 q~~q~E~q~~q~e~--e~~~~~~~-~~~~e~~~~~a~~---~~~~-~~~e~e~~~~e~e~-~~e~~q-~~~~~~~~~~~~ 694 (705) T protein:vir:88 624 QMKQVEAQIRLAEI--ELKKQEAV-LQQREMALKEAEL---QLER-DRFTWERARNEAEY-HLEATQ-ARAAYIGDGKVP 694 (705) T ss_pred HHHHHHHHHHHHHH--HHHHHHHH-HHHHHHHHHHHHH---HHHH-HHHHHHHHHHHHHH-HHHHHH-HHHHHHHHHhHH Confidence 00000000000000 00000000 0000000000000 0000 00000000000000 000000 000000000000 Q ss_pred hhhhhhhhhhh Q lcl|NC_021309. 140 DGETAPAAIGQ 150 (497) Q Consensus 140 ~~~~~~~~~~~ 150 (497) .........+. T Consensus 695 ~~~k~~~~~rr 705 (705) T protein:vir:88 695 ETKKPTKAVRR 705 (705) T ss_pred HHHHHHHHhcC Confidence 00000011111 No 222 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=30.96 E-value=1.5 Score=19.56 Aligned_cols=371 Identities=12% Similarity=0.047 Sum_probs=93.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |....+.-+++.+..+++.+...+....+.+..+......++.++.. +++..+++++++++++.+...... T Consensus 1 m~~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~ 71 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKV---------DELLTAQGELQARLSAAEQAMLAN 71 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHhh Confidence 88766554444444334433332332222222222222222222111 111222222222222222222111 Q ss_pred HHHHHH-HHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhcccccccc Q lcl|NC_021309. 81 EVRNLK-QIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 81 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) +..... ...+............+............ ... +.... .........- T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~-~~~~~----------~~~~~~g~~v 125 (395) T protein:vir:43 72 EKRDGGEEAPKTAGQMVAESLKEQGVTSSLRGSHRV---------------SMP-RSAIT----------SIDGSGGALV 125 (395) T ss_pred hccccccchhhhHHHHHHHHHHHHHHHHHhhhhhhh---------------hhh-hhhhc----------ccCCCCcccc Confidence 111100 01111111111111111100000000000 000 00000 0000000000 Q ss_pred ccccchhhhH-----HHHHHHHhhhhHHhhcceeec---CCCceEEEEEcCCCccceecccccccccccccceeeEeeee Q lcl|NC_021309. 160 APGILPTFLP-----GIVEQLFYELSLADLISSRPV---TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVG 231 (497) Q Consensus 160 g~~v~p~~~~-----~ii~~~~~~~~l~~~~~~~~~---~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~ 231 (497) -..+.++++. ..+..+....++-...-.++. ..+...|.-+. .-..|.. ......++.-..+... T Consensus 126 p~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~------~~~~~~~-~~~~~i~~~~~k~~~~ 198 (395) T protein:vir:43 126 APDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNNAAPVSEG------TQKPYSD-LTFELENAPVRTIAHL 198 (395) T ss_pred chhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCceeeecCC------ccccccc-cceeEEEEeeeeEEEe Confidence 0011111111 112222111122111111121 12223332221 1122322 2233344444444333 Q ss_pred eEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHh---------hhhcccCccccccccccccccccccccchhhhhh Q lcl|NC_021309. 232 KVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEV---------QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 232 kla~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~---------~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 302 (497) -.-..--+. ..-.-...+...|...++..+...+=. .|+++.+...+. .. ............. T Consensus 199 ~~is~ell~-d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~------~~-~~~~~~~~~~~i~ 270 (395) T protein:vir:43 199 FKASRQILD-DASALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPP------SG-VVVTAEQRIDRIR 270 (395) T ss_pred ehhhHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccc------cc-cccccchhHHHHH Confidence 222211121 111112345566666666666655542 355544432211 11 1111111111221 Q ss_pred hHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhH Q lcl|NC_021309. 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 382 (497) ...............++++......+...+...+.+.+........+ .+..+ .++.++. T Consensus 271 ~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~--------~l~G~------------pVv~~~~- 329 (395) T protein:vir:43 271 LAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGSPQNGTTP--------TLWRL------------PVVETQA- 329 (395) T ss_pred HHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccccccCCCc--------eecce------------eeEEcCC- Confidence 22222222222233455555555555544433332222111000000 00000 0111110 Q ss_pred HHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCC----cCceEE-EeeccceEEEEeeccc-EEEeec Q lcl|NC_021309. 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP----LGTILV-GHFAPSVIQTARREGV-TMQMTN 456 (497) Q Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~----~~~~~~-gd~~~~~~~i~~r~~~-~i~~~~ 456 (497) + ..|..++... ..... ...-.|+.|-+++... .+.+.+ +.. +....+.+-..+ .+.+.. T Consensus 330 ------~--~~~~~~~gd~-~~~~~-----~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~-r~d~~v~~~~a~~~~~~ta 394 (395) T protein:vir:43 330 ------I--TQDEFLTGAF-SLGAQ-----IFDRMDIEVLVSTENDKDFENNMVTIRAEE-RLAFAVYRPEAFVTGSLTA 394 (395) T ss_pred ------C--CCCcEEEEec-cceEE-----EEEecceEEEEeccccchhhcCcEEEEEEE-eeccEEecccceEEEEecc Confidence 0 1122222211 00000 0011255555544321 121111 100 001111111111 122222 Q ss_pred c Q lcl|NC_021309. 457 S 457 (497) Q Consensus 457 ~ 457 (497) . T Consensus 395 a 395 (395) T protein:vir:43 395 S 395 (395) T ss_pred C Confidence 1 No 223 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=30.16 E-value=1.6 Score=19.47 Aligned_cols=384 Identities=10% Similarity=0.037 Sum_probs=88.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADE--TKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~--~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) +-.-++...-......+..+..++. .++..+..+...++..+++...+++.. ......+.+.+++...+.... T Consensus 124 a~~~a~I~~vke~~~~e~~~~~~~~a~~ee~~e~~~k~~el~a~l~~~~~~~~~-----~~~e~~~~l~a~~~~~~~~~~ 198 (517) T protein:vir:97 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDN-----AALKTVSELAANLMKQRESEK 198 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHH-----HHHhhhhhhhhhHHHHHHhhh Confidence 2212221111111111111111110 011111111112222222211111110 011112222222221111100 Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccc Q lcl|NC_021309. 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) ........ ............................ .......................... .. .... T Consensus 199 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~p~~~~~~i~-----~~---~~~~- 266 (517) T protein:vir:97 199 ILGVEALK--VTPEATEFLKTREAEVAYMSASLTKDPK-AAWTAELKERGISGMPAPAGILKRIQ-----DA---VNDE- 266 (517) T ss_pred hccccccc--ccchhhHHHHHHHHHHHHHHhccccccc-ceeeeecccccccccccchHHHHHHH-----Hh---hhhh- Confidence 00000000 0000000000000000000000000000 00000000000000000000000000 00 0000 Q ss_pred cccccchhhhHHHHHHHHhhhhHHhhcceeecCCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeEEeeeh Q lcl|NC_021309. 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALT 238 (497) Q Consensus 159 ~g~~v~p~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~ 238 (497) .. +++.. ...++..+......+..... +...+. ... |. .......++.-.++...--..... T Consensus 267 -~~---------i~~~~-~~~~i~~~~~~~~~~~~~a~-~~~eG~-~kp----~s-~~tf~~~~~~~~~ia~~~~~S~ql 328 (517) T protein:vir:97 267 -GS---------LLPFI-RHENLPTLVVGGDNALTQGT-GHTTGT-DKT----ES-NITLQTRVLTPQYVYKYIKLPKIV 328 (517) T ss_pred -cc---------ceeee-eeccccceeeecccccceee-eeecCC-ccc----cc-ccceeeEEeeHhhhhhhhhhhHHH Confidence 00 00000 00111111001111111111 111111 000 00 000011111111110000000011 Q ss_pred hhHHH---Hh-hHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHHhhhhh Q lcl|NC_021309. 239 ITDEG---LR-DAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) Q Consensus 239 iS~el---l~-d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) +.+.. .. --..|.+.+...|++.....+=..-=.|..-.....+.+.....+...+............ ...... T Consensus 329 l~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~~~d~i~~l~~--a~~~a~ 406 (517) T protein:vir:97 329 MNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSV--ATPKAA 406 (517) T ss_pred HHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccccccccccccccchHHHHHHHHHH--Hhhhcc Confidence 11110 00 0112333344444444433333211111110000011111111111112222211111111 111112 Q ss_pred cchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcC Q lcl|NC_021309. 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANG 394 (497) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G 394 (497) .+.++.+...+..+...+...+.+.+.+......+ T Consensus 407 ~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~~~~--------------------------------------------- 441 (517) T protein:vir:97 407 DSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTI--------------------------------------------- 441 (517) T ss_pred CCEEEECHHHHHHHHHhhcCCCCeeccCcCCcccc--------------------------------------------- Confidence 34466777777777777766665554322110000 Q ss_pred ceeccCccccccccccccccccc-ccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhhhcCceE---EE Q lcl|NC_021309. 395 QYMGGNFFGNAYGNPVNGGKNIW-GVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVT---VR 470 (497) Q Consensus 395 ~~i~~~~~~~~~~~~~~~~~~l~-G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~---~r 470 (497) .-+| |.... .+.++ |-..+.+ ..+..+++++....+..++|. +....|..+++. +| T Consensus 442 ~~l~--------G~~~~-~~~~~~~~~~~~~---~~~y~i~~~~g~~~~~~fd~~--------~n~~~f~~~~~~~g~i~ 501 (517) T protein:vir:97 442 ATHF--------GFNRL-VQSVAVDEKTAVS---LSGYVTNGSRGMEFEQGTILV--------ENNKEYLFEMPISGSLE 501 (517) T ss_pred cccC--------Ccccc-ccccccCceeEee---ccccEEEeecceeeeeeeecc--------cCceeEeeeeeeccccc Confidence 0000 00000 00111 1001110 112234455443233444432 233457778776 99 Q ss_pred EEEeecceeecccceE Q lcl|NC_021309. 471 AEERLGLLVYRPSAFQ 486 (497) Q Consensus 471 ~~~r~~~~v~~~~a~~ 486 (497) ++.|+.+.|.+|-.-= T Consensus 502 ~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 502 YKGTTAYGTYTPPVAG 517 (517) T ss_pred cccceEEEEEcCCCCC Confidence 9999999999985443 No 224 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=29.17 E-value=1.7 Score=19.34 Aligned_cols=388 Identities=13% Similarity=0.063 Sum_probs=92.7 Q ss_pred Cch-HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPS-TAQLEAQGRQLAKSIKDINA-DETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 ~~~-~a~~~~~~~~~~~~~~~~~~-~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) |-. ..++.++..+..+++++... +..++...+.+.+..+..+++...+. ++++++.......... T Consensus 7 l~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~-------------~~~~~~~~~~~~~~~~ 73 (415) T protein:vir:94 7 LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEE-------------LDKLKEKDGTSENNQQ 73 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHhhhhccc Confidence 544 34555555555555554433 22222233333333332222211111 1111111110000000 Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccc Q lcl|NC_021309. 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) ..+..... ....... ................. ...+... ......... .......++..- T Consensus 74 ~~~~~~~~-~~~~~~~---~~~~~~~~~~~~~~~~e--------~~~~~~~--~~~~~~~~~------~~~~~~~g~~~i 133 (415) T protein:vir:94 74 SVEVNEAS-TYRNQAN---INDLGISIQNTKVTSQE--------VRDFTEY--LETRNDIQG------GSLKTDSGFVVI 133 (415) T ss_pred cccccchh-hHHHHHH---HHHHHhhhhhhhhhHHH--------HHHHHHH--hhhhhhhhh------hccccccccccC Confidence 00000000 0000000 00000000000000000 0000000 000000000 000000011000 Q ss_pred cccccchhhhHH-----HHHHHHhhhhHHhhc---ceeecCC-CceEEEEEcCCCccceecccccccccccccceeeEee Q lcl|NC_021309. 159 FAPGILPTFLPG-----IVEQLFYELSLADLI---SSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQ 229 (497) Q Consensus 159 ~g~~v~p~~~~~-----ii~~~~~~~~l~~~~---~~~~~~~-~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~ 229 (497) -..+.++++.. .+..+-...++-... ++...++ ....+.-+. .-..|.........++.--.+. T Consensus 134 -P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg------~~~~~~~~~~~~~i~~~~~k~~ 206 (415) T protein:vir:94 134 -PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEEL------EENPELAVKPFFQLAYDINTHR 206 (415) T ss_pred -cHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceecccc------ccccccccccceeeEeeheeee Confidence 01122222211 221211111111111 1111112 222222111 1112222212222233222222 Q ss_pred eeeEEeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccccccccccccccccchhhhhhhHHHHHH Q lcl|NC_021309. 230 VGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVK 309 (497) Q Consensus 230 ~~kla~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (497) ..-....--+.+.-..-...|...|...++.++..++-...-.|.+.+...+........+... ............... T Consensus 207 ~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~-~~~~~~i~~~~~~~~ 285 (415) T protein:vir:94 207 GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKK-AKSLDDIKDAINLNV 285 (415) T ss_pred eechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccccccc-ccchHHHHHHHHhhh Confidence 2211111111111111123455666666777777776666556655443333222222222222 222222222222222 Q ss_pred hhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHH Q lcl|NC_021309. 310 FPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLT 389 (497) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~l 389 (497) ......+.++++...+..+...+...+.+...+......+ ..++ +..++..+..+ + T Consensus 286 ~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~--~~l~-----------------G~pV~~~~~~~-----~ 341 (415) T protein:vir:94 286 KPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ--QRLL-----------------GAKIEILPDEV-----L 341 (415) T ss_pred hhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCC--ceec-----------------ceeeEEecccc-----c Confidence 2222344455566665555554444333322221110000 0000 00001100000 0 Q ss_pred hhh-cCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEE---------eecccc Q lcl|NC_021309. 390 KDA-NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQ---------MTNSNG 459 (497) Q Consensus 390 kd~-~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~---------~~~~~~ 459 (497) -+. +...++. ++...... ..-.|+.|-.+++.-..+ .++...|.+..+. .+.... T Consensus 342 ~~~~~~~i~~g-d~~~~~~~-----~~~~~~~v~~~~~~~~~~---------~~r~~~r~d~~~~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:94 342 GQKGNNTLIIG-NLKDAIVL-----FDRSQYQASWTDYMHFGE---------CLMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred CCCCccEEEEE-ehhccEEE-----EeecceEEEEeccccCce---------EEEEEEEeccEEeccccEEEEEEeccCC Confidence 000 0001111 00000000 000122222222221111 1122222222221 111100 Q ss_pred hhhhcCceEEEE Q lcl|NC_021309. 460 TDFVDGKVTVRA 471 (497) Q Consensus 460 ~~f~~~~v~~r~ 471 (497) ..+..++.+ T Consensus 407 ---~~~~~~~~~ 415 (415) T protein:vir:94 407 ---GEGDLGLEA 415 (415) T ss_pred ---CCCccccCC Confidence 011112111 No 225 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=23.83 E-value=2.3 Score=18.65 Aligned_cols=378 Identities=11% Similarity=0.050 Sum_probs=84.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |-...++.+++....+++++...+..+++++. ..++.++++.....++..+ ...+++.+............ T Consensus 5 lk~l~~~~~el~~~~~~~k~~~~~~~~~~e~~---~~~l~~~~~~l~~~~~~~~------~~~~~~~~~~~~~~~~~~~~ 75 (401) T protein:vir:44 5 IKDVEQVAQELQQKFDDFKAKNDKRVEAIEQE---KGKLAGQVETLNGKLSELE------NLKSDLEKELLELKRPARGA 75 (401) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHhhcccccc Confidence 88888888888888888887777654444332 2222333333222222111 11111111111110000000 Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhhhhhhhhccccccccc Q lcl|NC_021309. 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.....+.++.... ..+... ............ .......++..-. T Consensus 76 ~~~~~~e~~~a~~~------~lr~~~-------~~~~~~~e~~a~---------------------~~~~~~~GG~~iP- 120 (401) T protein:vir:44 76 QNKVAAEHKDAFVG------FLRKGR-------EDGLRDLERKAL---------------------QVGTDEDGGYAVP- 120 (401) T ss_pred ccchhHHHHHHHHH------HHhhhh-------hhhhHHHHHHHh---------------------hcCCCCCCceecc- Confidence 00000000000000 000000 000000000000 0000000000000 Q ss_pred cccchhhhH-----HHHHHHHhhhhHHhhcceeec--CCCceEEEEEcCCCccceecccccccccccccceeeEeeeeeE Q lcl|NC_021309. 161 PGILPTFLP-----GIVEQLFYELSLADLISSRPV--TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKV 233 (497) Q Consensus 161 ~~v~p~~~~-----~ii~~~~~~~~l~~~~~~~~~--~~~~~~~p~~~~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kl 233 (497) ..+.++++. .++..+-...++-......++ ++....|.-+.. -..+.........++.--++...-. T Consensus 121 ~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~------~~~~~~~~~~~~v~~~~~k~~~~~~ 194 (401) T protein:vir:44 121 EELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGTASGWVGETD------TRSQTATSRLGLIEPFMGEIYGNPQ 194 (401) T ss_pred HhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCccceeecccc------ccCccccccceeeeeehhheeeehh Confidence 001111110 011111111111110001111 112222221111 0111111111122222212211111 Q ss_pred EeeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHH--------hhhhcccCccccccccccccccccc--cccch-hhhhh Q lcl|NC_021309. 234 ANALTITDEGLRDAPELFNFVQGRLLEGIQRKEE--------VQLLAGGGYPGVNGLLQRSTGFTAS--SASSL-FGATS 302 (497) Q Consensus 234 a~~~~iS~ell~d~~~l~~~i~~~la~~~~~~~d--------~~~l~G~G~~~p~Gi~~~~~~~~~~--~~~~~-~~~~~ 302 (497) ...--+.+....=-..+...|...++..+...+= ..|++..+...-.+.........+. ..... ..... T Consensus 195 iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~ 274 (401) T protein:vir:44 195 ATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAII 274 (401) T ss_pred hhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeccccccccccccccccccccccccccccCHHHHH Confidence 1111111111111223444444555555544433 2344333221111111111111110 11111 11111 Q ss_pred hHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhH Q lcl|NC_021309. 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 382 (497) .............+.++++...+..+...+...+...+.+......+. .++ +..++.++. T Consensus 275 ~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~--~l~-----------------G~PVv~~~~- 334 (401) T protein:vir:44 275 KLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPS--SLA-----------------GYGIAENEQ- 334 (401) T ss_pred HHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCc--eec-----------------ceeeEEecC- Confidence 111111111122223455555555555444333322222111100000 000 000011000 Q ss_pred HHHHHHHhhhcCceeccCcccccccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhh Q lcl|NC_021309. 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDF 462 (497) Q Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f 462 (497) +. ..++.+..++-.++...+.. ..-.|+-+..+++...+.+ .+....|.+..+.- ...| T Consensus 335 ---~p-~~~~~~~~i~~Gd~~~~~~i-----~~~~~~~~~~~~~~~~~~v--------~~~a~~r~d~~~~~----~~a~ 393 (401) T protein:vir:44 335 ---MP-DIAADAKAIAFGNFKRGYTI-----VDRIGTRILRDPYTNKPFV--------GFYTTKRTGGMLVD----SQAI 393 (401) T ss_pred ---cC-CccCCccEEEEeehhccEEE-----EEecceEEeeeccccCCcE--------EEEEEEEeccEEec----ccce Confidence 00 00111111111111000000 0001222222222211111 11111111111110 0001 Q ss_pred hcCceEEEEEEeecceeec-ccc Q lcl|NC_021309. 463 VDGKVTVRAEERLGLLVYR-PSA 484 (497) Q Consensus 463 ~~~~v~~r~~~r~~~~v~~-~~a 484 (497) .++. +.| T Consensus 394 ---------------~~l~~~aa 401 (401) T protein:vir:44 394 ---------------KLLKIAAA 401 (401) T ss_pred ---------------EEEEeecC Confidence 1111 111 No 226 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=23.56 E-value=2.3 Score=18.61 Aligned_cols=329 Identities=13% Similarity=0.047 Sum_probs=121.6 Q ss_pred HHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHHHHHhhhhhh-hhhhhhccccccccccccchhhhHH Q lcl|NC_021309. 92 LARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAP-AAIGQNPFGSTGTFAPGILPTFLPG 170 (497) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~v~p~~~~~ 170 (497) .. .....+.... ...+. ..+.. .+............-+....- ....-.....+-++|+.+--+.+.. T Consensus 1 ~~----~~~~~~~~~~---~~~~~---~~~~~-~~~~~~~~~~~~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~ 69 (514) T protein:vir:10 1 MY----TQDKTKDIMK---KSFFG---GDRAV-AFDTNKEDILNENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNR 69 (514) T ss_pred CC----ccchhhHHHh---hhhcc---cceee-eecCcHHHHHHHhcchhhhhhhhccccccCCccccCccchhhhhhcc Confidence 00 0000000000 00000 00000 000000000000000000000 0000001111224455555443333 Q ss_pred HHHHH---HhhhhHHhhcceeecCCCceEEEEEc--CCCccceecccccccccccccceeeEeeeeeEEeeehhhHHH-H Q lcl|NC_021309. 171 IVEQL---FYELSLADLISSRPVTSPNLSYLTES--AAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-L 244 (497) Q Consensus 171 ii~~~---~~~~~l~~~~~~~~~~~~~~~~p~~~--~~~~~a~wv~Eg~~~~~s~~~f~~i~~~~~kla~~~~iS~el-l 244 (497) -+..+ .....++.-+...++.+-.-+|-.+. ++.+.+.+++|++..+.+++++.+..+.++=+..-..+|..+ | T Consensus 70 ~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l 149 (514) T protein:vir:10 70 DLKVTTWGERDFTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDVNNPNERQRTINIKYIVDTHVTSIALQR 149 (514) T ss_pred ceeEeeecCcchhhhhhcCCchhhHHHhhhhhhcccCcccccccccccccCcCCCcceEEEEEeeeeeeeeeeeeehhhh Confidence 22222 22234455556666665444444433 333456789999999999999999999999998877777665 4 Q ss_pred hhH-HHHHHHHHHHHHHHHHHHHHhhhhcccCcc---------ccccccccccccccccccchhhhhhhHHHHHHhhhhh Q lcl|NC_021309. 245 RDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP---------GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) Q Consensus 245 ~d~-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~---------~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) +++ .+......++-.-.++..++.+.++|+-.- ++.||.+.-... T Consensus 150 ~n~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~~------------------------- 204 (514) T protein:vir:10 150 ANTIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAPE------------------------- 204 (514) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcCC------------------------- Confidence 444 467777778888889999999999987421 122332221100 Q ss_pred cchhhhhhhhhhhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcC Q lcl|NC_021309. 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANG 394 (497) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G 394 (497) +.+ .+ .|..+ ...++.-...+....+.+++-++|...+.+.+..-...-- T Consensus 205 --------NvI-Da------------rG~~L---------s~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~~~q 254 (514) T protein:vir:10 205 --------NHI-DL------------RGGRL---------SPAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQHLNGQ 254 (514) T ss_pred --------CeE-ec------------CCCCc---------cHHHHhhhhhhhhcccCChhheeCchHHHHHHhhcccCcc Confidence 000 00 00000 0111111112222334445555666666555544333333 Q ss_pred ceeccCcccc------------cccccccccccccccceEecCCCCcCceEEEeeccceEEEEeecccEEEeecccchhh Q lcl|NC_021309. 395 QYMGGNFFGN------------AYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDF 462 (497) Q Consensus 395 ~~i~~~~~~~------------~~~~~~~~~~~l~G~Pvv~~~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f 462 (497) |-+...+... ..|.-...++++.+.+-......+. +++.. .-..+.+.++...+..| T Consensus 255 RV~~~~n~~~~~~G~~v~~f~s~~G~I~L~gs~im~~~n~L~~~~~~-----~~~Ap------~~~~va~svT~~~~g~~ 323 (514) T protein:vir:10 255 RVMLPGQTGGMTTGLDIDKFLSAHGSIRIQGSTIMDSDNKLDFDRPV-----SPTAP------TAPQLSATVTPDGGGLW 323 (514) T ss_pred eEEeecCccceeeeeeccceeEeccceeecCCeeecccccCccCCcc-----CCcCC------CCCcceEEEecCccccc Confidence 3332222111 0011111111222221111111110 00000 00011122211111111 Q ss_pred h-------cC----------ceEEEEEEeecceeecccce-----------EEEEeeCCCCCC Q lcl|NC_021309. 463 V-------DG----------KVTVRAEERLGLLVYRPSAF-----------QLIQLKKGATGS 497 (497) Q Consensus 463 ~-------~~----------~v~~r~~~r~~~~v~~~~a~-----------~~l~~~~~a~~~ 497 (497) . ++ ...|++...-+..=-.|..+ +.|++..-+-|+ T Consensus 324 ~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~vtaT~a~~~~~i~ltItp~~~~~ 386 (514) T protein:vir:10 324 HEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLVQTATPTKKDDAITLTITPNAMQN 386 (514) T ss_pred CcccccccccccccccccceeEEEEEEEECCCCcccccceeeeeeeccCceEEEEEEeccCcc Confidence 0 00 11233332222222223333 234444323333 No 227 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=22.13 E-value=2.5 Score=18.41 Aligned_cols=366 Identities=15% Similarity=0.064 Sum_probs=122.7 Q ss_pred HhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhH Q lcl|NC_021309. 48 AEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADP 127 (497) Q Consensus 48 ~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 127 (497) -++... +.+.++..-+... +.+ .++.+. +..+...-+.+.+......+... T Consensus 1 ~~~~~~---~~l~~kw~p~l~~-~~~-~~i~~~-------------~~~~~a~~~enq~~~~~~~~~~~----------- 51 (521) T protein:vir:10 1 MTIKTK---AELLNKWKPLLEG-EGL-PEIANS-------------KQAIIAKIFENQEKDFQTAPEYK----------- 51 (521) T ss_pred CCcchh---HHHHHhhhhhhcc-CCC-Cccccc-------------hhhhhhhhhhhhhhhhhhccccc----------- Confidence 011110 1122222221111 000 000000 00000000000000000000000 Q ss_pred HHHHHHHHHHHHhhhhh---hhhhhhhccccccccccccchhhhHHHHHH---HHhhhhHHhhcceeecCCCceEE---- Q lcl|NC_021309. 128 GTAAAELMGAFADGETA---PAAIGQNPFGSTGTFAPGILPTFLPGIVEQ---LFYELSLADLISSRPVTSPNLSY---- 197 (497) Q Consensus 128 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~g~~v~p~~~~~ii~~---~~~~~~l~~~~~~~~~~~~~~~~---- 197 (497) ........+.+...... ..-.......+++++. +..+.+.++.+ ..+...-.+++.|.||+++..-| T Consensus 52 ~~~~~~~~~~~l~e~~~~~~~~~~~~~i~es~~t~~---v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMR 128 (521) T protein:vir:10 52 DEKIAQAFGSFLTEAEIGGDHGYNATNIAAGQTSGA---VTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALR 128 (521) T ss_pred hhHHHHHHhhhhhhhcccCccccccccccccccccc---cccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeee Confidence 00000000111000000 0000001111122211 11233444444 44555677899999998864332 Q ss_pred ---EEEcCC-----------Cccceecc---------------------------------------------------- Q lcl|NC_021309. 198 ---LTESAA-----------HNNAAAVA---------------------------------------------------- 211 (497) Q Consensus 198 ---p~~~~~-----------~~~a~wv~---------------------------------------------------- 211 (497) +.+... ..++.|-+ T Consensus 129 srY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~ 208 (521) T protein:vir:10 129 AVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKL 208 (521) T ss_pred eeccCCccccccccccchhccccccccccccccccccccccccccccccccccccccccceecccccccCCCcccccccc Confidence 111100 00011110 Q ss_pred -------------------------c---------ccccccccccceeeEeeeeeEEeeehhhHHHHhh--HH---HHHH Q lcl|NC_021309. 212 -------------------------E---------AGTYPFSSEEFARVYEQVGKVANALTITDEGLRD--AP---ELFN 252 (497) Q Consensus 212 -------------------------E---------g~~~~~s~~~f~~i~~~~~kla~~~~iS~ell~d--~~---~l~~ 252 (497) | +...++...++++++...+.=+-...+|-||.|| +. |.++ T Consensus 209 ~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEt 288 (521) T protein:vir:10 209 DAEIKKQMEAGALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADA 288 (521) T ss_pred cccccccccccceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHH Confidence 0 0112333344455555555555557789999998 32 6889 Q ss_pred HHHHHHHHHHHHHHHhhhhcccCcc---cccccccc----ccccccccccchhhhhhhHHHHHHhhhhhcchhhhhhhhh Q lcl|NC_021309. 253 FVQGRLLEGIQRKEEVQLLAGGGYP---GVNGLLQR----STGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTV 325 (497) Q Consensus 253 ~i~~~la~~~~~~~d~~~l~G~G~~---~p~Gi~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (497) .|.+-|+..|...|++.||.=--.. +..|+... .+.+...... . .....+ .. T Consensus 289 ELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~-------------d-----~~~~~~---~~ 347 (521) T protein:vir:10 289 ELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPI-------------D-----IRGARW---AG 347 (521) T ss_pred HHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceeccccc-------------c-----cccchH---HH Confidence 9999999999999999998311000 11111100 0000000000 0 000000 00 Q ss_pred hhhhhhhhhhhhhhcccccccccchhhhhhhHHHHHHHhhhhhhccCCceEEechhHHHHHHHHhhhcCceeccCccccc Q lcl|NC_021309. 326 ASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNA 405 (497) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~ 405 (497) ...+.. .+..-..+....+...+..++.++.++.-...|... |-.+..+..... T Consensus 348 e~~k~L----------------------~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~----~~~~~~~~~~~~ 401 (521) T protein:vir:10 348 ESFKAL----------------------LFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASV----DTGISYAAQGLA 401 (521) T ss_pred HHHHHH----------------------HHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhc----cccccccccccc Confidence 000000 001111122222334556777777777666655532 111122222111 Q ss_pred ccccccc-----ccccc-ccceEecCCCCcCceEEEeeccce----EEEEeecccEEEeecccchhhhcCceEEEEEEee Q lcl|NC_021309. 406 YGNPVNG-----GKNIW-GVPVVTTPLIPLGTILVGHFAPSV----IQTARREGVTMQMTNSNGTDFVDGKVTVRAEERL 475 (497) Q Consensus 406 ~~~~~~~-----~~~l~-G~Pvv~~~~~~~~~~~~gd~~~~~----~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~ 475 (497) .+...+. ...|. |++|.++++.+.+-+++|-=.... +.......+...... +-..|+ =.+-...|. T Consensus 402 ~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~-dp~sfq---P~~g~~tRY 477 (521) T protein:vir:10 402 TGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGS-DPKNFQ---PVMGFKTRY 477 (521) T ss_pred ccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccccccccc-CCcccc---ceeeeeeee Confidence 1111111 12343 578888999887766655321100 000000111111111 112232 233334676 Q ss_pred cceeecccceE-------EEEe---eCCCCCC Q lcl|NC_021309. 476 GLLVYRPSAFQ-------LIQL---KKGATGS 497 (497) Q Consensus 476 ~~~v~~~~a~~-------~l~~---~~~a~~~ 497 (497) +. ..+|=+-. +|+- ...+.-| T Consensus 478 ~l-~~NP~~~~~~~~~~~~i~~~~~~~~a~~~ 508 (521) T protein:vir:10 478 GI-GINPFAESAAQAPASRIQSGMPSILNSLG 508 (521) T ss_pred ce-eecCcccccCCccceeecccchhhhcccc Confidence 66 33441110 1110 0111111 No 228 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=21.15 E-value=2.6 Score=18.26 Aligned_cols=127 Identities=14% Similarity=0.117 Sum_probs=11.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHH--HHHHHHHHH-HHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021309. 1 MPSTAQLEAQGRQLAKSIKDINA--DETKTAAEK-KEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDI 77 (497) Q Consensus 1 ~~~~a~~~~~~~~~~~~~~~~~~--~~~~~~~e~-~~~~~~~~~~~~~~~~~~~~~e~~~e~~~~~~~~~a~~~~~~~~~ 77 (497) .+.+..++.+..++...-.+... ...+.-.+. +........+.+...+..+...+..++..+..+..++.+..+... T Consensus 576 ~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~~~~~~~~e~~~~~ 655 (705) T protein:vir:88 576 WTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKE 655 (705) T ss_pred hhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111111111111111111100 000000000 000000000001101111111111111111001111111000000 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHhHhhhhhhhhhhhhHHHHHHhhhHHHHHHHHHH Q lcl|NC_021309. 78 PEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMG 136 (497) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 136 (497) ++.+... . .......... .+. ...............+. ..........++ T Consensus 656 a~~~~~~-~--~~e~e~~~~e-~e~-~~e~~q~~~~~~~~~~~----~~~~k~~~~~rr 705 (705) T protein:vir:88 656 AELQLER-D--RFTWERARNE-AEY-HLEATQARAAYIGDGKV----PETKKPTKAVRR 705 (705) T ss_pred HHHHHHH-H--HHHHHHHHHH-HHH-HHHHHHHHHHHHHHHhH----HHHHHHHHHhcC Confidence 0000000 0 0000000000 000 00000000000000000 000000000111 Done!