Query lcl|NC_016164.1_cdsid_YP_004934604.1 [gene=S-CBS1_gp04] [protein=major capsid protein] [protein_id=YP_004934604.1] [location=4251..6761] Match_columns 836 No_of_seqs 310 out of 1564 Neff 9.0 Searched_HMMs 1612 Date Thu Nov 7 12:40:42 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_4 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_4_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:96762 Length: 632 100.0 1E-122 6E-126 689.5 50.0 615 206-836 1-632 (632) 2 protein:vir:93616 Length: 645 100.0 8.7E-98 5E-101 552.7 47.3 585 222-836 1-638 (645) 3 protein:vir:97397 Length: 517 100.0 4.6E-66 2.8E-69 378.8 40.3 505 234-836 1-515 (517) 4 protein:vir:1433 Length: 435 # 100.0 4.3E-59 2.7E-62 340.6 33.7 402 410-836 1-432 (435) 5 protein:vir:80376 Length: 435 100.0 1.3E-58 8.4E-62 337.8 35.8 403 410-836 1-432 (435) 6 protein:vir:105038 Length: 428 100.0 2.3E-58 1.5E-61 336.5 35.4 394 408-836 1-427 (428) 7 protein:vir:100135 Length: 418 100.0 9.8E-57 6.1E-60 327.6 39.6 406 391-836 1-414 (418) 8 protein:vir:95376 Length: 425 100.0 3E-56 1.9E-59 325.0 39.3 409 395-836 1-420 (425) 9 protein:vir:1328 Length: 392 # 100.0 8.8E-57 5.5E-60 327.9 35.9 383 410-836 1-390 (392) 10 protein:vir:485 Length: 407 # 100.0 2.3E-56 1.4E-59 325.6 37.9 375 427-836 1-399 (407) 11 protein:vir:4074 Length: 480 # 100.0 5.3E-57 3.3E-60 329.1 32.9 463 238-836 1-476 (480) 12 protein:vir:100247 Length: 425 100.0 5.8E-56 3.6E-59 323.4 38.2 399 393-836 1-423 (425) 13 protein:vir:6242 Length: 390 # 100.0 2.4E-56 1.5E-59 325.5 35.3 381 410-836 1-388 (390) 14 protein:vir:4456 Length: 401 # 100.0 1.3E-55 7.8E-59 321.6 37.3 376 413-836 1-400 (401) 15 protein:vir:5739 Length: 366 # 100.0 4.1E-57 2.6E-60 329.7 28.4 339 464-836 1-365 (366) 16 protein:vir:7855 Length: 497 # 100.0 5.9E-55 3.6E-58 317.9 40.2 418 383-836 1-492 (497) 17 protein:vir:101650 Length: 497 100.0 5.9E-55 3.6E-58 317.9 40.2 418 383-836 1-492 (497) 18 protein:vir:10364 Length: 390 100.0 4.3E-55 2.7E-58 318.6 37.7 383 408-835 1-390 (390) 19 protein:vir:97053 Length: 390 100.0 3.8E-55 2.4E-58 318.9 37.4 383 415-835 1-390 (390) 20 protein:vir:81070 Length: 390 100.0 4E-55 2.5E-58 318.8 37.3 383 415-835 1-390 (390) 21 protein:vir:104256 Length: 458 100.0 7.3E-54 4.5E-57 311.9 41.3 431 379-836 1-457 (458) 22 protein:vir:4339 Length: 395 # 100.0 6.2E-54 3.8E-57 312.3 39.0 384 412-836 1-394 (395) 23 protein:vir:8102 Length: 543 # 100.0 3.6E-53 2.2E-56 308.1 42.7 467 331-836 1-541 (543) 24 protein:vir:6212 Length: 434 # 100.0 3.5E-54 2.1E-57 313.7 36.3 409 413-836 1-432 (434) 25 protein:vir:81227 Length: 413 100.0 3.4E-53 2.1E-56 308.2 39.0 389 413-836 1-409 (413) 26 protein:vir:4511 Length: 409 # 100.0 6.8E-54 4.2E-57 312.1 35.2 392 410-836 1-405 (409) 27 protein:vir:8420 Length: 477 # 100.0 1E-53 6.5E-57 311.1 36.1 421 399-836 1-470 (477) 28 protein:vir:1886 Length: 385 # 100.0 3.4E-53 2.1E-56 308.2 37.7 374 430-836 1-383 (385) 29 protein:vir:191 Length: 385 # 100.0 3.4E-53 2.1E-56 308.2 37.7 374 430-836 1-383 (385) 30 protein:vir:97148 Length: 324 100.0 6.4E-54 4E-57 312.2 30.0 298 529-836 1-314 (324) 31 protein:vir:96392 Length: 324 100.0 3.2E-53 2E-56 308.4 29.7 298 529-836 1-314 (324) 32 protein:vir:78830 Length: 324 100.0 3.2E-53 2E-56 308.4 29.7 298 529-836 1-314 (324) 33 protein:vir:7771 Length: 330 # 100.0 2.3E-53 1.5E-56 309.1 27.8 281 552-836 1-322 (330) 34 protein:vir:9309 Length: 324 # 100.0 6.5E-53 4E-56 306.7 29.2 298 529-836 1-314 (324) 35 protein:vir:41 Length: 299 # N 100.0 9.1E-53 5.7E-56 305.9 27.7 276 556-836 1-297 (299) 36 protein:vir:105905 Length: 304 100.0 8.2E-53 5.1E-56 306.1 27.4 279 553-836 1-304 (304) 37 protein:vir:94142 Length: 304 100.0 8.2E-53 5.1E-56 306.1 27.4 279 553-836 1-304 (304) 38 protein:vir:99749 Length: 324 100.0 2.3E-52 1.4E-55 303.7 29.7 298 529-836 1-314 (324) 39 protein:vir:102119 Length: 404 100.0 4.3E-51 2.7E-54 296.7 36.5 379 415-836 1-399 (404) 40 protein:vir:4092 Length: 390 # 100.0 1.8E-51 1.1E-54 298.8 34.1 353 439-836 1-374 (390) 41 protein:vir:4700 Length: 415 # 100.0 1.1E-50 6.6E-54 294.5 38.3 387 412-836 1-403 (415) 42 protein:vir:4600 Length: 415 # 100.0 1.1E-50 6.6E-54 294.5 38.3 387 412-836 1-403 (415) 43 protein:vir:94673 Length: 419 100.0 5.5E-51 3.4E-54 296.1 36.7 394 413-836 1-416 (419) 44 protein:vir:81100 Length: 415 100.0 1.1E-50 7.1E-54 294.4 38.2 390 412-836 1-403 (415) 45 protein:vir:98339 Length: 415 100.0 1.1E-50 7.1E-54 294.4 38.2 390 412-836 1-403 (415) 46 protein:vir:79987 Length: 415 100.0 1.1E-50 7.1E-54 294.4 38.2 390 412-836 1-403 (415) 47 protein:vir:9410 Length: 415 # 100.0 1.2E-50 7.2E-54 294.4 37.9 390 412-836 1-403 (415) 48 protein:vir:96223 Length: 324 100.0 2.3E-52 1.4E-55 303.7 28.1 298 517-836 1-314 (324) 49 protein:vir:103955 Length: 324 100.0 4E-52 2.5E-55 302.4 29.4 298 529-836 1-314 (324) 50 protein:vir:80684 Length: 315 100.0 3.3E-52 2E-55 302.8 27.4 274 561-836 1-305 (315) 51 protein:vir:1268 Length: 397 # 100.0 3E-50 1.9E-53 292.1 36.0 378 408-836 1-396 (397) 52 protein:vir:9574 Length: 300 # 100.0 1.3E-51 8.3E-55 299.5 28.0 272 561-836 1-299 (300) 53 protein:vir:80128 Length: 466 100.0 1.1E-49 6.5E-53 289.1 36.6 417 378-836 1-447 (466) 54 protein:vir:95763 Length: 297 100.0 2.8E-51 1.8E-54 297.7 27.9 279 553-836 1-295 (297) 55 protein:vir:8187 Length: 311 # 100.0 3.1E-51 1.9E-54 297.5 28.1 271 563-836 1-309 (311) 56 protein:vir:101607 Length: 379 100.0 2E-49 1.3E-52 287.5 37.3 368 413-836 1-379 (379) 57 protein:vir:4953 Length: 397 # 100.0 1.9E-49 1.2E-52 287.7 36.5 367 412-836 1-384 (397) 58 protein:vir:9759 Length: 303 # 100.0 6.2E-51 3.8E-54 295.9 27.9 271 563-836 1-302 (303) 59 protein:vir:2430 Length: 318 # 100.0 1.5E-50 9.1E-54 293.8 28.3 285 540-836 1-312 (318) 60 protein:vir:81160 Length: 371 100.0 3.2E-49 2E-52 286.4 35.4 347 429-836 1-370 (371) 61 protein:vir:4997 Length: 397 # 100.0 4.9E-49 3E-52 285.5 36.2 367 412-836 1-384 (397) 62 protein:vir:4226 Length: 326 # 100.0 1.4E-50 8.6E-54 293.9 27.7 292 528-836 1-322 (326) 63 protein:vir:2344 Length: 397 # 100.0 2.7E-50 1.6E-53 292.4 27.9 280 547-836 1-305 (397) 64 protein:vir:98635 Length: 377 100.0 3.7E-50 2.3E-53 291.6 27.5 349 446-836 1-376 (377) 65 protein:vir:1025 Length: 408 # 100.0 1.2E-48 7.3E-52 283.3 35.5 372 406-836 1-392 (408) 66 protein:vir:1383 Length: 421 # 100.0 1.4E-48 9E-52 282.9 35.1 370 406-836 1-382 (421) 67 protein:vir:1638 Length: 298 # 100.0 1E-49 6.3E-53 289.2 28.2 269 565-836 1-298 (298) 68 protein:vir:3991 Length: 404 # 100.0 5.2E-48 3.2E-51 279.8 36.8 374 413-836 1-393 (404) 69 protein:vir:107593 Length: 392 100.0 3.3E-48 2E-51 280.9 35.3 358 426-836 1-383 (392) 70 protein:vir:105004 Length: 392 100.0 3.3E-48 2E-51 280.9 35.3 358 426-836 1-383 (392) 71 protein:vir:102873 Length: 392 100.0 3.3E-48 2E-51 280.9 35.3 358 426-836 1-383 (392) 72 protein:vir:102082 Length: 392 100.0 3.3E-48 2E-51 280.9 35.3 358 426-836 1-383 (392) 73 protein:vir:7409 Length: 408 # 100.0 4.9E-48 3.1E-51 279.9 36.1 371 406-836 1-392 (408) 74 protein:vir:4830 Length: 397 # 100.0 6.2E-48 3.8E-51 279.4 36.7 367 412-836 1-385 (397) 75 protein:vir:94771 Length: 298 100.0 1.8E-49 1.1E-52 287.8 27.9 269 565-836 1-298 (298) 76 protein:vir:9643 Length: 377 # 100.0 7.4E-49 4.6E-52 284.4 30.2 344 446-836 1-376 (377) 77 protein:vir:3845 Length: 395 # 100.0 1.4E-47 8.4E-51 277.5 36.1 365 410-836 1-382 (395) 78 protein:vir:100632 Length: 381 100.0 6E-49 3.7E-52 284.9 27.9 343 451-836 1-367 (381) 79 protein:vir:78223 Length: 333 100.0 8.4E-49 5.2E-52 284.2 28.4 289 532-836 1-331 (333) 80 protein:vir:104085 Length: 320 100.0 7.6E-49 4.7E-52 284.4 28.0 285 545-836 1-316 (320) 81 protein:vir:95963 Length: 395 100.0 5.5E-48 3.4E-51 279.7 32.5 352 436-836 1-375 (395) 82 protein:vir:3870 Length: 400 # 100.0 4.3E-47 2.6E-50 274.8 36.8 386 406-836 1-398 (400) 83 protein:vir:9704 Length: 394 # 100.0 4.5E-47 2.8E-50 274.7 36.1 380 410-836 1-389 (394) 84 protein:vir:101291 Length: 381 100.0 2.3E-48 1.4E-51 281.7 28.7 343 451-836 1-367 (381) 85 protein:vir:9509 Length: 381 # 100.0 2.3E-48 1.4E-51 281.7 28.7 343 451-836 1-367 (381) 86 protein:vir:100172 Length: 394 100.0 1E-46 6.2E-50 272.8 35.6 366 412-836 1-383 (394) 87 protein:vir:1084 Length: 437 # 100.0 5.1E-46 3.1E-49 268.9 39.2 396 383-836 1-429 (437) 88 protein:vir:99920 Length: 311 100.0 4.2E-48 2.6E-51 280.3 27.3 273 561-836 1-310 (311) 89 protein:vir:78523 Length: 338 100.0 6.7E-48 4.2E-51 279.2 27.9 289 529-836 1-334 (338) 90 protein:vir:2504 Length: 305 # 100.0 6.7E-48 4.1E-51 279.2 26.6 272 561-836 1-297 (305) 91 protein:vir:93881 Length: 387 100.0 1.6E-46 9.7E-50 271.7 32.5 378 408-836 1-380 (387) 92 protein:vir:94424 Length: 387 100.0 1.2E-46 7.3E-50 272.4 31.5 378 408-836 1-380 (387) 93 protein:vir:2685 Length: 387 # 100.0 1.2E-46 7.3E-50 272.4 31.5 378 408-836 1-380 (387) 94 protein:vir:96978 Length: 387 100.0 1.2E-46 7.3E-50 272.4 31.5 378 408-836 1-380 (387) 95 protein:vir:100884 Length: 389 100.0 6.1E-46 3.8E-49 268.5 35.0 363 415-836 1-381 (389) 96 protein:vir:9361 Length: 402 # 100.0 2.6E-46 1.6E-49 270.5 31.7 389 394-836 1-395 (402) 97 protein:vir:78640 Length: 352 100.0 1.8E-46 1.1E-49 271.3 29.4 343 442-836 1-345 (352) 98 protein:vir:78350 Length: 383 100.0 1.5E-46 9.2E-50 271.8 28.5 348 445-836 1-374 (383) 99 protein:vir:4856 Length: 293 # 100.0 8.3E-47 5.1E-50 273.2 27.0 263 557-836 1-280 (293) 100 protein:vir:962 Length: 397 # 100.0 3.6E-45 2.2E-48 264.3 35.7 383 391-836 1-396 (397) 101 protein:vir:4197 Length: 314 # 100.0 6.6E-38 4.1E-41 224.4 25.2 285 536-836 1-312 (314) 102 protein:vir:4159 Length: 315 # 100.0 3.1E-37 1.9E-40 220.7 23.9 290 541-836 1-315 (315) 103 protein:vir:3158 Length: 321 # 100.0 1.6E-34 9.9E-38 205.9 25.4 291 534-836 1-313 (321) 104 protein:vir:3033 Length: 272 # 100.0 6.2E-31 3.9E-34 186.2 24.9 258 561-836 1-268 (272) 105 protein:vir:9820 Length: 272 # 100.0 6.2E-31 3.9E-34 186.2 24.9 258 561-836 1-268 (272) 106 protein:vir:79548 Length: 652 99.9 5.6E-29 3.5E-32 175.5 25.0 539 233-834 1-652 (652) 107 protein:vir:95512 Length: 693 99.9 4.3E-27 2.6E-30 165.2 27.4 567 205-835 1-693 (693) 108 protein:vir:93742 Length: 274 99.9 1E-24 6.2E-28 152.2 23.0 259 561-836 1-269 (274) 109 protein:vir:96123 Length: 274 99.9 5.3E-23 3.3E-26 142.7 22.6 259 561-836 1-269 (274) 110 protein:vir:3613 Length: 272 # 99.9 4.7E-23 2.9E-26 143.0 21.1 258 561-836 1-271 (272) 111 protein:vir:97433 Length: 274 99.8 2.6E-22 1.6E-25 139.0 23.2 259 561-836 1-269 (274) 112 protein:vir:94494 Length: 274 99.8 2.6E-22 1.6E-25 139.0 23.2 259 561-836 1-269 (274) 113 protein:vir:105334 Length: 276 99.8 1.5E-22 9.6E-26 140.2 22.0 259 561-836 1-269 (276) 114 protein:vir:80930 Length: 278 99.8 2.9E-22 1.8E-25 138.7 22.4 266 561-836 1-276 (278) 115 protein:vir:96833 Length: 275 99.8 5.7E-22 3.6E-25 137.0 21.7 260 560-836 1-270 (275) 116 protein:vir:1239 Length: 274 # 99.8 2.6E-21 1.6E-24 133.5 22.4 259 561-836 1-269 (274) 117 protein:vir:95898 Length: 274 99.8 5.6E-21 3.5E-24 131.6 22.7 259 561-836 1-269 (274) 118 protein:vir:96262 Length: 274 99.8 5.6E-21 3.5E-24 131.6 22.7 259 561-836 1-269 (274) 119 protein:vir:79928 Length: 393 99.8 9.1E-21 5.7E-24 130.4 22.1 342 442-836 1-377 (393) 120 protein:vir:94933 Length: 330 99.8 1.1E-20 7E-24 129.9 20.7 294 513-836 1-328 (330) 121 protein:vir:8324 Length: 410 # 99.8 3.1E-21 1.9E-24 133.0 14.4 383 339-835 1-410 (410) 122 protein:vir:95107 Length: 270 99.7 1E-17 6.2E-21 113.8 21.1 256 561-836 1-264 (270) 123 protein:vir:108211 Length: 318 99.6 2.7E-17 1.7E-20 111.4 18.1 275 549-836 1-316 (318) 124 protein:vir:739 Length: 231 # 99.6 2.8E-17 1.7E-20 111.3 17.4 220 597-836 1-230 (231) 125 protein:vir:97255 Length: 310 99.6 2.2E-16 1.3E-19 106.5 21.7 272 561-836 1-309 (310) 126 protein:vir:7990 Length: 273 # 99.5 3.8E-15 2.4E-18 99.6 20.8 258 561-836 1-272 (273) 127 protein:vir:105822 Length: 273 99.5 6.1E-15 3.8E-18 98.5 21.1 257 567-836 1-272 (273) 128 protein:vir:102605 Length: 273 99.5 6.1E-15 3.8E-18 98.5 21.1 257 567-836 1-272 (273) 129 protein:vir:99424 Length: 360 99.5 9E-15 5.6E-18 97.6 20.9 295 529-836 1-356 (360) 130 protein:vir:103886 Length: 302 99.5 5.5E-15 3.4E-18 98.7 18.5 268 549-835 1-302 (302) 131 protein:vir:94622 Length: 341 99.4 1.3E-14 8.4E-18 96.6 17.6 282 553-836 1-338 (341) 132 protein:vir:80180 Length: 381 99.4 8E-14 5E-17 92.4 19.1 280 538-836 1-304 (381) 133 protein:vir:78739 Length: 332 99.3 1.1E-12 6.9E-16 86.1 18.1 286 536-835 1-332 (332) 134 protein:vir:8885 Length: 347 # 99.2 2.1E-12 1.3E-15 84.7 17.3 285 538-836 1-345 (347) 135 protein:vir:2201 Length: 345 # 99.2 2.5E-12 1.6E-15 84.2 17.7 283 549-836 1-344 (345) 136 protein:vir:94576 Length: 347 99.2 2.1E-12 1.3E-15 84.6 17.2 284 542-835 1-347 (347) 137 protein:vir:80213 Length: 334 99.2 1.3E-12 8.1E-16 85.7 16.1 284 549-836 1-331 (334) 138 protein:vir:93858 Length: 400 99.2 8.2E-12 5.1E-15 81.4 19.7 381 420-835 1-400 (400) 139 protein:vir:3364 Length: 347 # 99.2 2.3E-12 1.4E-15 84.4 16.4 287 542-836 1-345 (347) 140 protein:vir:100057 Length: 375 99.2 2.1E-11 1.3E-14 79.2 20.8 291 534-836 1-369 (375) 141 protein:vir:10450 Length: 344 99.2 5.8E-12 3.6E-15 82.2 17.4 287 542-836 1-343 (344) 142 protein:vir:3136 Length: 322 # 99.1 7.9E-12 4.9E-15 81.4 17.3 270 560-836 1-317 (322) 143 protein:vir:5974 Length: 324 # 99.1 2.9E-11 1.8E-14 78.4 20.2 260 561-836 1-288 (324) 144 protein:vir:94711 Length: 347 99.1 4.5E-12 2.8E-15 82.8 15.1 286 534-836 1-345 (347) 145 protein:vir:1541 Length: 347 # 99.1 1.2E-11 7.7E-15 80.4 16.9 285 542-836 1-342 (347) 146 protein:vir:103323 Length: 364 99.1 1.5E-10 9.5E-14 74.4 22.6 281 549-836 1-338 (364) 147 protein:vir:6324 Length: 335 # 99.0 1.1E-10 6.6E-14 75.3 19.1 284 549-836 1-331 (335) 148 protein:vir:102944 Length: 330 99.0 1.4E-10 8.9E-14 74.5 19.8 268 561-836 1-292 (330) 149 protein:vir:78935 Length: 335 99.0 9.2E-11 5.7E-14 75.6 18.6 282 549-836 1-331 (335) 150 protein:vir:103285 Length: 296 99.0 2.3E-10 1.5E-13 73.4 18.3 272 560-835 1-296 (296) 151 protein:vir:1583 Length: 351 # 98.9 8E-10 5E-13 70.4 20.0 266 561-836 1-290 (351) 152 protein:vir:102655 Length: 322 98.9 1.3E-09 8.1E-13 69.3 19.2 277 540-836 1-322 (322) 153 protein:vir:97031 Length: 402 98.9 1.5E-10 9.4E-14 74.4 14.0 282 549-836 1-338 (402) 154 protein:vir:99675 Length: 324 98.8 6.2E-10 3.9E-13 71.1 15.5 237 595-836 1-301 (324) 155 protein:vir:99075 Length: 392 98.8 2.6E-09 1.6E-12 67.7 18.6 261 567-836 1-306 (392) 156 protein:vir:107687 Length: 319 98.8 2.8E-09 1.7E-12 67.5 18.7 294 513-835 1-319 (319) 157 protein:vir:80068 Length: 301 98.8 3.7E-09 2.3E-12 66.8 19.3 269 563-835 1-301 (301) 158 protein:vir:7019 Length: 401 # 98.8 4.2E-10 2.6E-13 72.0 13.5 283 549-836 1-332 (401) 159 protein:vir:9927 Length: 295 # 98.8 2.5E-09 1.5E-12 67.8 17.0 256 561-836 1-287 (295) 160 protein:vir:105645 Length: 400 98.7 1E-09 6.2E-13 69.9 14.4 282 549-836 1-332 (400) 161 protein:vir:104342 Length: 314 98.7 3.7E-09 2.3E-12 66.8 16.8 290 522-835 1-314 (314) 162 protein:vir:79642 Length: 329 98.6 1.1E-08 6.9E-12 64.2 17.5 300 529-836 1-327 (329) 163 protein:vir:8843 Length: 317 # 98.6 1.5E-08 9.3E-12 63.5 17.4 273 558-836 1-314 (317) 164 protein:vir:108303 Length: 418 98.6 1.2E-07 7.4E-11 58.5 21.0 257 564-836 1-296 (418) 165 protein:vir:106647 Length: 303 98.5 9.4E-09 5.8E-12 64.6 14.4 266 556-836 1-295 (303) 166 protein:vir:95318 Length: 328 98.4 4.2E-08 2.6E-11 61.0 13.8 227 544-783 1-328 (328) 167 protein:vir:9875 Length: 296 # 98.3 1.5E-07 9.5E-11 57.9 15.7 267 545-836 1-296 (296) 168 protein:vir:7324 Length: 335 # 98.1 2.1E-07 1.3E-10 57.2 13.3 229 544-783 1-335 (335) 169 protein:vir:80446 Length: 367 98.1 8.1E-07 5E-10 54.0 16.4 265 559-836 1-319 (367) 170 protein:vir:103759 Length: 330 98.1 5.6E-07 3.5E-10 54.8 14.2 227 544-783 1-330 (330) 171 protein:vir:96792 Length: 315 98.1 4.6E-06 2.9E-09 49.8 19.2 259 561-836 1-280 (315) 172 protein:vir:3525 Length: 423 # 98.1 3.5E-06 2.2E-09 50.5 18.4 257 561-836 1-302 (423) 173 protein:vir:1991 Length: 305 # 98.0 4.6E-07 2.9E-10 55.3 12.9 211 550-804 1-305 (305) 174 protein:vir:97331 Length: 319 98.0 9.1E-06 5.6E-09 48.2 21.4 278 534-836 1-296 (319) 175 protein:vir:94800 Length: 319 98.0 9.1E-06 5.6E-09 48.2 21.4 278 534-836 1-296 (319) 176 protein:vir:98525 Length: 331 98.0 1.6E-06 9.8E-10 52.4 14.9 227 544-783 1-331 (331) 177 protein:vir:107388 Length: 331 98.0 1.6E-06 9.8E-10 52.4 14.9 227 544-783 1-331 (331) 178 protein:vir:107826 Length: 331 98.0 1.6E-06 9.8E-10 52.4 14.9 227 544-783 1-331 (331) 179 protein:vir:105374 Length: 423 97.9 1E-05 6.3E-09 48.0 18.9 261 561-836 1-302 (423) 180 protein:vir:95131 Length: 325 97.9 1E-05 6.4E-09 47.9 18.7 263 561-836 1-291 (325) 181 protein:vir:95451 Length: 313 97.9 3E-06 1.9E-09 50.9 15.2 271 562-836 1-311 (313) 182 protein:vir:174 Length: 423 # 97.8 2E-05 1.2E-08 46.4 20.1 262 561-836 1-317 (423) 183 protein:vir:94989 Length: 349 97.8 2.2E-05 1.4E-08 46.1 20.8 261 561-836 1-306 (349) 184 protein:vir:78387 Length: 349 97.7 2.5E-05 1.6E-08 45.8 19.9 266 561-836 1-306 (349) 185 protein:vir:3643 Length: 336 # 97.7 4.2E-06 2.6E-09 50.0 13.4 297 507-835 1-336 (336) 186 protein:vir:101557 Length: 336 97.6 1.2E-05 7.3E-09 47.6 15.0 297 507-835 1-336 (336) 187 protein:vir:5255 Length: 304 # 97.6 1.2E-05 7.6E-09 47.5 14.8 265 566-834 1-304 (304) 188 protein:vir:78558 Length: 336 97.4 2.4E-05 1.5E-08 45.9 14.2 297 507-835 1-336 (336) 189 protein:vir:107120 Length: 329 97.4 7.9E-05 4.9E-08 43.1 20.2 290 520-836 1-307 (329) 190 protein:vir:94070 Length: 339 97.3 7.5E-05 4.7E-08 43.2 15.5 301 495-835 1-339 (339) 191 protein:vir:105522 Length: 423 97.2 0.00012 7.2E-08 42.1 20.8 259 561-836 1-317 (423) 192 protein:vir:106734 Length: 336 97.0 8.6E-05 5.4E-08 42.9 13.1 297 507-835 1-336 (336) 193 protein:vir:1781 Length: 221 # 96.6 0.00042 2.6E-07 39.1 14.7 177 640-836 1-201 (221) 194 protein:vir:95875 Length: 401 96.4 0.00064 4E-07 38.1 16.4 286 546-836 1-399 (401) 195 protein:vir:96079 Length: 382 95.7 0.00083 5.1E-07 37.5 11.6 324 486-835 1-382 (382) 196 protein:vir:107732 Length: 379 95.6 0.0018 1.1E-06 35.7 14.2 325 486-835 1-379 (379) 197 protein:vir:79008 Length: 299 95.3 0.0024 1.5E-06 35.0 21.2 259 561-836 1-298 (299) 198 protein:vir:270 Length: 341 # 94.0 0.0058 3.6E-06 32.8 14.3 291 513-836 1-334 (341) 199 protein:vir:99576 Length: 388 93.9 0.0012 7.7E-07 36.5 8.0 330 482-835 1-388 (388) 200 protein:vir:1153 Length: 338 # 93.7 0.0067 4.2E-06 32.5 17.2 291 523-836 1-335 (338) 201 protein:vir:99311 Length: 463 92.6 0.011 6.7E-06 31.3 16.1 300 514-836 1-338 (463) 202 protein:vir:95603 Length: 463 92.6 0.011 6.7E-06 31.3 16.1 300 514-836 1-338 (463) 203 protein:vir:99228 Length: 304 92.5 0.0064 3.9E-06 32.6 9.8 210 549-805 1-304 (304) 204 protein:vir:104011 Length: 337 92.0 0.013 8.3E-06 30.9 18.7 289 523-836 1-332 (337) 205 protein:vir:102823 Length: 470 91.9 0.014 8.4E-06 30.8 11.2 287 524-836 1-322 (470) 206 protein:vir:79171 Length: 337 91.7 0.015 9.2E-06 30.6 18.7 289 523-835 1-337 (337) 207 protein:vir:78920 Length: 290 91.2 0.017 1E-05 30.3 21.0 255 561-836 1-290 (290) 208 protein:vir:98566 Length: 355 91.0 0.018 1.1E-05 30.2 17.2 291 523-836 1-340 (355) 209 protein:vir:1829 Length: 355 # 90.9 0.019 1.2E-05 30.1 17.3 291 523-836 1-340 (355) 210 protein:vir:96666 Length: 462 90.2 0.022 1.4E-05 29.6 15.4 302 510-836 1-338 (462) 211 protein:vir:79712 Length: 285 89.8 0.025 1.5E-05 29.4 19.5 254 569-836 1-285 (285) 212 protein:vir:79246 Length: 304 88.2 0.0087 5.4E-06 31.9 6.6 218 549-836 1-234 (304) 213 protein:vir:98856 Length: 343 88.2 0.034 2.1E-05 28.6 16.4 296 523-836 1-340 (343) 214 protein:vir:63741 Length: 468 87.2 0.04 2.5E-05 28.2 11.9 292 514-836 1-317 (468) 215 protein:vir:80835 Length: 464 86.5 0.045 2.8E-05 28.0 10.7 297 499-836 1-329 (464) 216 protein:vir:79157 Length: 339 86.3 0.046 2.9E-05 27.9 16.8 290 523-835 1-339 (339) 217 protein:vir:78777 Length: 358 84.8 0.058 3.6E-05 27.4 16.3 295 513-836 1-337 (358) 218 protein:vir:100331 Length: 342 84.1 0.063 3.9E-05 27.2 16.8 294 523-836 1-337 (342) 219 protein:vir:80491 Length: 467 84.1 0.063 3.9E-05 27.2 13.2 291 514-836 1-316 (467) 220 protein:vir:93966 Length: 400 83.5 0.068 4.2E-05 27.0 13.7 380 420-835 1-400 (400) 221 protein:vir:3746 Length: 336 # 82.8 0.074 4.6E-05 26.8 16.9 290 526-836 1-329 (336) 222 protein:vir:2016 Length: 357 # 82.5 0.076 4.7E-05 26.7 15.9 292 523-836 1-341 (357) 223 protein:vir:348 Length: 321 # 82.1 0.08 4.9E-05 26.6 16.8 275 549-835 1-321 (321) 224 protein:vir:105464 Length: 346 81.0 0.089 5.5E-05 26.3 21.0 259 561-836 1-299 (346) 225 protein:vir:861 Length: 318 # 80.8 0.092 5.7E-05 26.3 11.0 299 509-835 1-318 (318) 226 protein:vir:6061 Length: 357 # 80.8 0.092 5.7E-05 26.3 16.1 292 523-836 1-341 (357) 227 protein:vir:100851 Length: 514 80.6 0.093 5.8E-05 26.2 12.4 318 488-836 1-370 (514) 228 protein:vir:5694 Length: 357 # 80.3 0.096 6E-05 26.2 16.0 292 523-836 1-341 (357) 229 protein:vir:3783 Length: 336 # 74.4 0.16 9.8E-05 25.0 18.1 290 526-836 1-329 (336) 230 protein:vir:78186 Length: 337 73.5 0.17 0.00011 24.8 17.9 289 523-836 1-332 (337) 231 protein:vir:101039 Length: 529 73.5 0.17 0.00011 24.8 14.9 412 383-836 1-520 (529) 232 protein:vir:1663 Length: 393 # 69.6 0.22 0.00014 24.2 14.3 374 413-835 1-393 (393) 233 protein:vir:78090 Length: 302 63.4 0.32 0.0002 23.3 19.2 258 561-836 1-301 (302) 234 protein:vir:101811 Length: 529 58.7 0.41 0.00025 22.7 14.9 408 383-836 1-520 (529) 235 protein:vir:102335 Length: 312 52.8 0.55 0.00034 22.0 20.4 261 561-836 1-309 (312) 236 protein:vir:106998 Length: 468 42.9 0.87 0.00054 20.9 21.1 328 475-836 1-458 (468) 237 protein:vir:79078 Length: 307 42.4 0.88 0.00055 20.9 15.7 265 560-836 1-306 (307) 238 protein:vir:80986 Length: 528 41.5 0.92 0.00057 20.8 19.8 346 439-836 1-500 (528) 239 protein:vir:100603 Length: 529 35.9 1.2 0.00075 20.1 21.7 347 467-836 1-501 (529) 240 protein:vir:107947 Length: 519 35.7 1.2 0.00075 20.1 23.0 345 467-836 1-498 (519) 241 protein:vir:107882 Length: 307 35.0 1.3 0.00078 20.0 16.3 264 560-836 1-306 (307) 242 protein:vir:103463 Length: 521 33.7 1.3 0.00083 19.9 22.2 346 467-836 1-492 (521) 243 protein:vir:99523 Length: 311 32.9 1.4 0.00086 19.8 19.4 266 564-835 1-311 (311) 244 protein:vir:106286 Length: 534 31.7 1.5 0.00092 19.6 20.7 346 467-836 1-513 (534) 245 protein:vir:6901 Length: 522 # 31.2 1.5 0.00094 19.6 20.8 347 467-836 1-501 (522) 246 protein:vir:93696 Length: 364 29.8 1.6 0.001 19.4 15.6 269 561-836 1-360 (364) 247 protein:vir:78148 Length: 123 28.9 1.7 0.0011 19.3 6.9 102 735-836 1-123 (123) 248 protein:vir:2106 Length: 430 # 27.8 1.8 0.0011 19.2 16.1 260 561-836 1-334 (430) 249 protein:vir:6601 Length: 528 # 27.6 1.8 0.0011 19.2 20.2 348 439-836 1-503 (528) 250 protein:vir:7214 Length: 521 # 23.5 2.3 0.0014 18.6 22.3 347 467-836 1-500 (521) No 1 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=9.5e-123 Score=689.54 Aligned_cols=615 Identities=32% Similarity=0.492 Sum_probs=480.5 Q ss_pred CcccccchhHHHhhhccccchhhhhhhhhhcccccceEEEEEecCcccccccCcEEEeccccccchhhhcCCCceEeecC Q lcl|NC_016164. 206 DQNDGRSLMDLRELNSEPLYRSAVVADVARAKEDPEVVEFTFSSEQPVERYFGMEVLSHDPDAMNMSRLNSGAAPWLWNH 285 (836) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~rt~~~~~~~~~~v~~~~~~e~l~~~~~a~~~~~~~~~~~~lL~~H 285 (836) ||+..++.+..|..+..+++|.. .+.+.++|++.|||+++|||++||+||+|+|+|+|++++||+++++++ +||||+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~r~~-~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~e~l~~~~~~~~~~~~~~~-~~~l~~H 78 (632) T protein:vir:96 1 MPQPTKKTTVLRTIEGRELQREL-RVLSDSIDQEARTVELAASSEYPVPRWFGREILDHSPGAIRMGRLKNG-APLLDSH 78 (632) T ss_pred CCCcCCCCCccccccCceeeeEE-eeeeccccccccEEEEEEecCCccccccCcccccccccccchhhccCC-CeeeccC Confidence 66666888889999887777665 567789999999999999999999999999999999999999999876 7999999 Q ss_pred CCCCcceEEEEEEEecCCEEEEEEEEcCCcccccHHHHHHHHHHhcCccceeeeeeEeeccccccCCC--CeEEEEEEEE Q lcl|NC_016164. 286 NAEVVLGVVERAWMGDDRRGRVRTRWSPNTKIEGSEEYKRRQDWESGTIRNVSFMYSIDAPLDLTSRE--GMALVTAFTP 363 (836) Q Consensus 286 ~~~~~iG~v~~~~~~e~~~~~a~~~f~~~~~~~~~~~~~~~~~v~~G~l~~~SiG~~v~~~~~~~~~~--~~~~~~~~~l 363 (836) |+++|||+|++++++++++|+++++|+.+ ++++++|++||+|+|++|||||+|++|++.+.++ +++++++|+| T Consensus 79 ~~~~~iG~v~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~g~~~~~SiG~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (632) T protein:vir:96 79 SLREQIGVVEEVWLDDDRRLRARVRFSRS-----AKAEELWQDVLDGIRRHISIGYIIHEMVLESSGDQGDTYRVMDWEP 153 (632) T ss_pred CCCCcceEEEEEEEeCCceEEEEEEeCCC-----hhHHHHHHHHhcCcccceeeeeeeeeeeeecCCCCcceEEEEEEEE Confidence 99999999999999767678999999753 5789999999999999999999999998765544 4788999999 Q ss_pred EEEEEEeccCccchhhhhhhhhhhhhhhhhhhhhh--hh-----hhhhhh-----hhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 364 MEVSAVSIPADHTVGQGRKATSSSGPPGAAAATVA--PL-----SHNDNN-----HMDSSTIDMEAVRAQAAADERSRVA 431 (836) Q Consensus 364 ~EiS~V~~pA~~~a~v~~~~~~~~~~~~~~~~~~~--~~-----~~~~~~-----~~~~~~~~~e~~~~~~~~~~~~~~~ 431 (836) ||||||+|||||+++|.++................ .. ...... ..................++..+.. T Consensus 154 ~EiS~v~~pAd~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~a~~~~~~~~~a~~~~~~~~E~~r~~ 233 (632) T protein:vir:96 154 YEISLISVPADPTVGVGRSIDIGNITIRGAEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRIS 233 (632) T ss_pred EEEEEeecCCCCcceeeeeccccccccccccccchhhhhhccccccccccchhhcccccchhhhhhhhhhhhhhhHHHHH Confidence 99999999999999997655432211110000000 00 000000 0000001111122233344556677 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhh Q lcl|NC_016164. 432 SITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVR 511 (836) Q Consensus 432 ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (836) +|.++.+.+..++.+.++++++.+++++.+..++.+.....+........... ...............+......+.+ T Consensus 234 eI~~l~~~~~~~~~~~~ai~~g~sld~~ra~~ld~l~~~~~a~~~~~~a~~~~--~~~~~~~~~~i~~~~re~~~~~l~r 311 (632) T protein:vir:96 234 EITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLP--GKPAIHSARDLGIQHKELQQYSLMR 311 (632) T ss_pred HHHHHHHHhhhhhhHHHHHhccccHHHHHHHHHHHHhhhhhhhhhhhhhhhhh--hhhhhhhhhhhhhhHHHHHHHHHHH Confidence 88888888888888999999999999888888887765443322211111110 0111111222333344444444444 Q ss_pred hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhh-hhhhcccccccccccchhhHHHHHHHHHhhhh Q lcl|NC_016164. 512 AIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHR-DLVVDTASAAGDLVFTDGRPGSFIELLRNRLA 590 (836) Q Consensus 512 a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~-a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~ 590 (836) .++....... .......+.+.+.+++.|...++...+...... +..+.+.+++|.+++++.+...|++.+++.++ T Consensus 312 ai~a~a~~~~----~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~ 387 (632) T protein:vir:96 312 AINAAATGDW----SKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAI 387 (632) T ss_pred HHHhhhccch----hhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcch Confidence 4443322211 122233456667777778777776655554443 44444444444444445568899999999999 Q ss_pred hhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHH Q lcl|NC_016164. 591 LNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELA 670 (836) Q Consensus 591 l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~ 670 (836) +++++++++++..+.+++|+.++++.++|++|+++++.++++|+++++++++++++++||++||.|+.++++++|.+.|+ T Consensus 388 i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~ 467 (632) T protein:vir:96 388 IGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLI 467 (632) T ss_pred hhhhcceEeecCCcceEEEEEeCCceeEeecCCccccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHH Confidence 99998899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHH--Hh Q lcl|NC_016164. 671 TVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKT--TE 748 (836) Q Consensus 671 ~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~--lk 748 (836) .+++++++.+||+|+|++++|.||++.++++.++.+++.++++++.+++.++..++....+++|+|||.++..|.. ++ T Consensus 468 ~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~ 547 (632) T protein:vir:96 468 EGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVF 547 (632) T ss_pred HHHHHHHHHHhhcccCCCCccceeeecccccceecccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhcc Confidence 9999999999999999989999999999988888878888999999999999999888888999999998887776 56 Q ss_pred hccCccccccCCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccc Q lcl|NC_016164. 749 KATSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEA 828 (836) Q Consensus 749 d~~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~A 828 (836) |.+|+|+| . +++|+|+||++++++|.+.++||||+.|++++++++++.+++++++.+|++.|++++|+|+++++|+| T Consensus 548 d~~G~~i~--~-~~~l~G~pv~~s~~ip~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~a 624 (632) T protein:vir:96 548 DNTGERIW--Q-NNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEA 624 (632) T ss_pred CCCCceee--c-CCeecccceEeccccccCcEEEeecceEEEEEecceEEEEccccccccCceEEEEEeecCceeechhh Confidence 77777665 3 57999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEeecC Q lcl|NC_016164. 829 FCRGNDNL 836 (836) Q Consensus 829 f~~l~~A~ 836 (836) |++++.+= T Consensus 625 f~~~k~~A 632 (632) T protein:vir:96 625 FCIAKKGA 632 (632) T ss_pred hhheeecC Confidence 99999777 No 2 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=8.7e-98 Score=552.69 Aligned_cols=585 Identities=16% Similarity=0.164 Sum_probs=377.7 Q ss_pred cccchhhhhhhhhhcccccceEEEEEecCcccccccCcEEEeccccccchhhhcCCCceEeecCCCCCcceEEEEEEEec Q lcl|NC_016164. 222 EPLYRSAVVADVARAKEDPEVVEFTFSSEQPVERYFGMEVLSHDPDAMNMSRLNSGAAPWLWNHNAEVVLGVVERAWMGD 301 (836) Q Consensus 222 ~~~~~~~~~~~~~~~d~~~rt~~~~~~~~~~v~~~~~~e~l~~~~~a~~~~~~~~~~~~lL~~H~~~~~iG~v~~~~~~e 301 (836) |.|+|.+..+.++++|++.|+|++++|+++ ++|+ |+.| .|++++++ +.+||||+||+++|||+|.. . ++ T Consensus 1 m~~~~~~~~~~~k~~~~~~~~~~g~as~~~-~d~~-gd~i---~~~~~~~~----~~~~~l~~H~~~~~iG~~~~-~-~~ 69 (645) T protein:vir:93 1 MTLKRACSLLTVKSFSEDERVITGIASTPS-PDRD-GDIL---EPEGAEFG----SALPFLWQHDHSRPVGQCTV-R-RV 69 (645) T ss_pred CcccceeceeeEEeeecCceEEEEEEecCC-cccc-Ccee---chhhhccc----CCceeeeccCCCCceeEEEE-E-ec Confidence 889999999999999999999999999977 5554 4443 48888754 35899999999999999974 3 23 Q ss_pred CCEEEEEEEEcCCc----ccccHHHHHHHHHHhcCccceeeeeeEeeccccccCCCCeEEEEEEEEEEEEEEeccCccch Q lcl|NC_016164. 302 DRRGRVRTRWSPNT----KIEGSEEYKRRQDWESGTIRNVSFMYSIDAPLDLTSREGMALVTAFTPMEVSAVSIPADHTV 377 (836) Q Consensus 302 ~~~~~a~~~f~~~~----~~~~~~~~~~~~~v~~G~l~~~SiG~~v~~~~~~~~~~~~~~~~~~~l~EiS~V~~pA~~~a 377 (836) +.+|+++.++.... ....++++++|++||+|.|++|||||+|++|++.+. +.+++++|+|||||+|++||||+| T Consensus 70 ~~gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~G~~~~~SiG~~~~~~~~~~~--~~~~i~~~~l~EiS~V~~pAn~~a 147 (645) T protein:vir:93 70 SEGLEITATLAKPVPDMPSQLAARLDEAWAAIKTGLVRGLSVGFRPHEYTFLDG--GGLHFLRWELMEVSAVTVPANAEC 147 (645) T ss_pred CCceEEEEEecccccccccchHHHHHHHHHHHhcCcccceeeeeEEeeeeeecC--CCeEEEEEEEEEEeeeccCCCCcc Confidence 45588888775432 123468999999999999999999999999886553 347899999999999999999999 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhh---hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 378 GQGRKATSSSGPPGAAAATVAPLS---HNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGA 454 (836) Q Consensus 378 ~v~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~ 454 (836) .|...+.................. ..............+...............+..++.. .++....+..+++. T Consensus 148 ~v~~~ks~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~e~i~~l~~~ra~~~~--~~~~l~~~a~~~g~ 225 (645) T protein:vir:93 148 TIRTIKSYDRQFSAASGNRKPVVKIASSAGAAAQSTTVFHKEKTIMNIGEQIKSFENKRAALAA--SLEEVMTKAAEEGR 225 (645) T ss_pred hhhhhhhccchhhhhhhhhcchhhhhhhhcchhhccccccccccccchhhhhhhhhHHHHHHHH--HhhhhhhhHhhhcc Confidence 997654322211111111000000 0000000000000000000000000000000000000 01112222222222 Q ss_pred hHHHHHHHHHHHhhhhhhhHHHHHHHHhh-------hhhhHHHHhhh--------hhhhhhhhHHHHHHHhhhhhhhhhh Q lcl|NC_016164. 455 SEADAMRSVLSEIAKRPAAQPATPAAPVR-------SAQPIAAGGGS--------ADIGLTDKEARSFSFVRAIRAQMMP 519 (836) Q Consensus 455 t~~e~~~~~l~~l~~~~~a~~~~~~~~~~-------~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~a~~a~~~~ 519 (836) .+.......++.+..+.........+... ...+....... .......+......+.+..++.... T Consensus 226 ~l~aee~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~ 305 (645) T protein:vir:93 226 TLDVEEEEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNGNVAAVASAPVIRVEQKLDKGIGFARFAKSLAAA 305 (645) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccchhhhhhhhhHHHHHHHHHhc Confidence 22222222222222222222111111100 00000000000 0000000011111111111111000 Q ss_pred hhhhhhhhhhhhHHHHHHHHHHhhhh-hhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhccee Q lcl|NC_016164. 520 GDRAAFEAAAFEREVSEATAQRMGVT-PRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM 598 (836) Q Consensus 520 ~~~~~~~~~~~~~~~a~~~~~~~g~~-~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~ 598 (836) ......+.+.+++.+.. ........... .+..+.+...+|++++|+.+.+.|++.+++.+++++++++. T Consensus 306 ---------~g~~~~a~e~a~~~~~~~~~~~~~~~~a~-~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~ 375 (645) T protein:vir:93 306 ---------KGVRSEALEVARRQYPDDSRLHHVLKSAV-GAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGG 375 (645) T ss_pred ---------ccchhHHHHHHHhhcccchhhhhhhhhhh-hccccccccccCCccCchhhHHHHHHhhhhhhhHHhhcccc Confidence 00011112222222111 11111111111 12233344556889999999999999999999999998776 Q ss_pred eecC---CceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHH Q lcl|NC_016164. 599 LTGL---QGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIAL 675 (836) Q Consensus 599 ~~~~---~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~ 675 (836) ++.. .+.+++|+.++++.++|++||+.+++++++|++++++++|++++++||+|||.|+.++++++|...|++++++ T Consensus 376 ~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~ 455 (645) T protein:vir:93 376 IPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFSSPAADALVRNALAEAVVA 455 (645) T ss_pred ccccccccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHH Confidence 6553 3578999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcCCc---ccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccC Q lcl|NC_016164. 676 EIDRAALYGLGSN---SQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATS 752 (836) Q Consensus 676 ~~d~~il~G~Gt~---~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g 752 (836) ++|.+||+|+|++ ..|.|+++... +..++..+..++.+++..+..++....+++|+|||.++..|++++|++| T Consensus 456 ~~d~a~l~g~g~~~~~~~p~gi~~~~~----~~~~~~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G 531 (645) T protein:vir:93 456 RLDTDFVDPKKAAVADVSPASITHDVK----GTASSGNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALG 531 (645) T ss_pred HHHHHhhcCCCcccCCccccceecccc----ccccccchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCC Confidence 9999999998753 46888865321 1122233567889999888888777778899999999999999999999 Q ss_pred cccccc--CCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEeccc----------------------cccc Q lcl|NC_016164. 753 TAQFVL--EPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYA----------------------LDKS 808 (836) Q Consensus 753 ~~~~~~--~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~----------------------~~~~ 808 (836) +++|.. ..+++|+|+||++++++|+ .++||||+.+.+++++++.+..+.+. .|++ T Consensus 532 ~~~~~~~~~~~~tL~G~PV~~s~~vp~-~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~ 610 (645) T protein:vir:93 532 QKEYPDMTLLGGSFQGLPVIVSQYVGD-QLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQT 610 (645) T ss_pred ceeecCCCCCCceeeceeeEEeccCCc-ceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhc Confidence 988732 3467999999999999986 47899999999999999988776542 3889 Q ss_pred CcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 809 GSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 809 ~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) |++.||+++|+|+++++|+||++++.+= T Consensus 611 d~vaira~~r~d~~~~~p~a~~~lt~~~ 638 (645) T protein:vir:93 611 GSVAIRAERWINWRRRRTAAVAVITGVN 638 (645) T ss_pred CceEEEEEEEEcceeeCccceEEEeccc Confidence 9999999999999999999999999665 No 3 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=4.6e-66 Score=378.80 Aligned_cols=505 Identities=10% Similarity=0.056 Sum_probs=309.5 Q ss_pred hhcccccceEEEEEecCcccccccCcEEEeccccccchhhhcCCCceEeecCCCCCcceEEEEEEEecCCEEEEEEEEcC Q lcl|NC_016164. 234 ARAKEDPEVVEFTFSSEQPVERYFGMEVLSHDPDAMNMSRLNSGAAPWLWNHNAEVVLGVVERAWMGDDRRGRVRTRWSP 313 (836) Q Consensus 234 ~~~d~~~rt~~~~~~~~~~v~~~~~~e~l~~~~~a~~~~~~~~~~~~lL~~H~~~~~iG~v~~~~~~e~~~~~a~~~f~~ 313 (836) -+.+.+.++|+++++....+++++ +. +.|+||+.+...+.++||||+||+++|||++.. ..++| +|+++++|++ T Consensus 1 ~~~~~~~~~~~g~a~~~~~~d~~~-~~---~~~gaf~~~~~~~~~~~~l~~Hd~~~~ig~~~~-~~~~~-Gl~~~~~~~~ 74 (517) T protein:vir:97 1 MSGTFKDGVLIGKLVDYGSIDSYN-TV---FEPGAFDEYVGSEQTFNLDYRHDMQDKLAKFKV-IGRED-GIYIEAKPNN 74 (517) T ss_pred CccccCceEEEEEEEecCCCCCCC-ce---EccchHHHHHhcCCCeEEeecCCCCCceEEEEE-EEecC-ceEEEEeeCc Confidence 334567789999999987776544 43 459999887767778999999999999999864 44566 5999999864 Q ss_pred CcccccHHHHHHHHHHhcCccceeeeeeEeeccccccCCCCeEEEEEEEEEEEEEEeccCccchhhhhhhhhhhhhhhhh Q lcl|NC_016164. 314 NTKIEGSEEYKRRQDWESGTIRNVSFMYSIDAPLDLTSREGMALVTAFTPMEVSAVSIPADHTVGQGRKATSSSGPPGAA 393 (836) Q Consensus 314 ~~~~~~~~~~~~~~~v~~G~l~~~SiG~~v~~~~~~~~~~~~~~~~~~~l~EiS~V~~pA~~~a~v~~~~~~~~~~~~~~ 393 (836) ++.++++|++|++|. +|||||++++.. ..++++++++++|+|||+|++|||+++.|...+.......... T Consensus 75 -----~~~~~~~~~~~~~g~--~~S~gf~~~~~~---~~~~~~~~~~~~l~EvS~v~~pa~~~a~I~~vke~~~~e~~~~ 144 (517) T protein:vir:97 75 -----DIAYKRMKEAIDKGA--GLSVTFQPVEAS---EVDGVAYYKKCILAGGALTPNPSNKNAVVTYFREEKKKEENKM 144 (517) T ss_pred -----hHHHHHHHHHHHcCC--ceEEEEEeeccc---CCCCceEEEEEeeeeeeecchhhhhhhhhhhhhhhhhhhhhhh Confidence 467999999999994 999999998753 2345678899999999999999999999864432211100000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhh Q lcl|NC_016164. 394 AATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAA 473 (836) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a 473 (836) .. ........ . ...... .++..++.+......+........+..+... T Consensus 145 ~~-------------------~~a~~ee~-~---e~~~k~---------~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~ 192 (517) T protein:vir:97 145 TF-------------------DQNLMQEL-L---DAKKLA---------ADLNAKLKERENGGDNAALKTVSELAANLMK 192 (517) T ss_pred hh-------------------hhhhhhhh-h---hhhhhH---------HHHHHHHHHHHHHHHHHHHhhhhhhhhhHHH Confidence 00 00000000 0 000000 0000000000000000000000000000000 Q ss_pred HHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhh Q lcl|NC_016164. 474 QPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPN 553 (836) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~ 553 (836) .... ... .... ......... ..... ... ......+..... T Consensus 193 ~~~~-------~~~---------~~~~--------------~~~~~~~~~-~~~~~--~~~--~~~~~~~~~~~~----- 232 (517) T protein:vir:97 193 QRES-------EKI---------LGVE--------------ALKVTPEAT-EFLKT--REA--EVAYMSASLTKD----- 232 (517) T ss_pred HHHh-------hhh---------cccc--------------cccccchhh-HHHHH--HHH--HHHHHHhccccc----- Confidence 0000 000 0000 000000000 00000 000 000000000000 Q ss_pred hhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccc Q lcl|NC_016164. 554 DVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSV 633 (836) Q Consensus 554 ~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~ 633 (836) ..............++++.|..+...+...+...+++.++.+ ..+ .....++.......+.|+.||+.+|+++++| T Consensus 233 -~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~-~~~--i~~~~~~~~~~~~~a~~~~eG~~kp~s~~tf 308 (517) T protein:vir:97 233 -PKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIR-HEN--LPTLVVGGDNALTQGTGHTTGTDKTESNITL 308 (517) T ss_pred -ccceeeeecccccccccccchHHHHHHHHhhhhhccceeeee-ecc--ccceeeecccccceeeeeecCCcccccccce Confidence 000000111122345677788777777777777777766532 211 2345566666677788999999999999999 Q ss_pred eeEEeeeeeeeeeehhHHHHHhcchhH----HHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccc Q lcl|NC_016164. 634 DQVALVAKTLGAYTEFSRRLMLQSSID----VEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATN 709 (836) Q Consensus 634 ~~it~~~~t~~~~i~ISrelL~ds~~~----l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~ 709 (836) +++++.++++++++.+|+++|.|+.++ +++||.++|.++++++++.+||+|+|++.++.|+++..+.........+ T Consensus 309 ~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~ 388 (517) T protein:vir:97 309 QTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGT 388 (517) T ss_pred eeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCccccccccccccccccccccc Confidence 999999999999999999999976655 9999999999999999999999999998778888765443222222222 Q ss_pred hhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccceEEEEeh Q lcl|NC_016164. 710 PTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANGDVFFGVW 785 (836) Q Consensus 710 ~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~~i~~gD~ 785 (836) .+..+ ++..+..++....++.|+|||.+|..|+++||++|+|+|.. +++.+++|..-+. +.++.+...++++ T Consensus 389 ~~~~d---~i~~l~~a~~~a~~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~~~~~~l~G~~~~~-~~~~~~~~~~~~~ 464 (517) T protein:vir:97 389 TNIQE---LLEKLSVATPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLV-QSVAVDEKTAVSL 464 (517) T ss_pred chHHH---HHHHHHHHhhhccCCEEEECHHHHHHHHHhhcCCCCeeccCcCCcccccccCCccccc-cccccCceeEeec Confidence 33333 44444444444456789999999999999999999998743 2335677743222 3444555666778 Q ss_pred hceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEe--ecC Q lcl|NC_016164. 786 NQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGN--DNL 836 (836) Q Consensus 786 s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~--~A~ 836 (836) +.|.++.+.++....+ .++.+|+..|+.++|+++.|+.|++|+++. ..+ T Consensus 465 ~~y~i~~~~g~~~~~~--fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~ 515 (517) T protein:vir:97 465 SGYVTNGSRGMEFEQG--TILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) T ss_pred cccEEEeecceeeeee--eecccCceeEeeeeeeccccccccceEEEEEcCCC Confidence 8898888888765443 345678999999999999999999988644 334 No 4 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=4.3e-59 Score=340.57 Aligned_cols=402 Identities=18% Similarity=0.226 Sum_probs=257.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhH Q lcl|NC_016164. 410 SSTIDMEAVRAQAAADERSRVASITSLCREH-KADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPI 488 (836) Q Consensus 410 ~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~-~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~ 488 (836) ++..+..+.+.... .+..++.... +.+. +..+.....+.....++.+..+.................. T Consensus 1 M~i~eL~e~r~~~~-------~~~~~l~~~~~e~~~----lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~ 69 (435) T protein:vir:14 1 MNVNELRRERAAVN-------QRVQALAQIEVGGTA----LSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPV 69 (435) T ss_pred CCHHHHHHHHHHHH-------HHHHHHHHHHhccCC----CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 11111111111111 1111111100 0000 0000011111111111111111111110000000000000 Q ss_pred HHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhh--hh--hhhhhhhhhhhhhhhhcc Q lcl|NC_016164. 489 AAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGV--TP--RGILAPNDVLHRDLVVDT 564 (836) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~--~~--~g~~~~~~~~~~a~~~~~ 564 (836) . ....... ............ ...............+....+. .. ...............+.. T Consensus 70 ~----~~~~~~~------~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (435) T protein:vir:14 70 D----PNPTAVA------APAAAPVHAQPK----ALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTL 135 (435) T ss_pred c----chhhhhh------hccccccccccc----hhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccC Confidence 0 0000000 000000000000 0000000000000000000000 00 000000001112223344 Q ss_pred cccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeee Q lcl|NC_016164. 565 ASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLG 644 (836) Q Consensus 565 ~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~ 644 (836) +...|++++|+.+...|++.+++.+++++++++.+++.++.+++|+.++++.+.|++|++.+++++++|+++++.+++++ T Consensus 136 t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~ 215 (435) T protein:vir:14 136 SPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMA 215 (435) T ss_pred CcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeeeccCccccccccceeEEEeeeEEEE Confidence 45567788999999999999999999999988888888899999999999999999999999999999999999999999 Q ss_pred eeehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccc----cchhHHHHHHH Q lcl|NC_016164. 645 AYTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGA----TNPTYVELVSM 718 (836) Q Consensus 645 ~~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa----~~~t~~~l~~a 718 (836) ++++||+++|.|+. ++++++|...|+++++++++.+||+|+|++++|.||++......+...+ ......++.++ T Consensus 216 ~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l 295 (435) T protein:vir:14 216 ALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKV 295 (435) T ss_pred EeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHH Confidence 99999999999984 5799999999999999999999999999999999999876554443322 22345678888 Q ss_pred HHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCccccc--------eEEEEehhceEE Q lcl|NC_016164. 719 ESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVANG--------DVFFGVWNQMIM 790 (836) Q Consensus 719 ~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~~--------~i~~gD~s~~~i 790 (836) +..+...+.+..+++|+|||.+|..|+.++|++|+|+|....+++|+|+||++++.+|.+ .++||||++|.+ T Consensus 296 ~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i 375 (435) T protein:vir:14 296 ILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGDVFI 375 (435) T ss_pred HHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeecceeEeeccccccccCCCccceEEEeecccEEE Confidence 888887776777889999999999999999999999987667789999999999999863 589999999999 Q ss_pred EeecceEEEEeccc-----------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYA-----------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~-----------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++|+++++.++++. +|.+|++.||+++|+|+++++|+||++++.+= T Consensus 376 ~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~ 432 (435) T protein:vir:14 376 GEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVA 432 (435) T ss_pred EEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCC Confidence 99999999999874 48899999999999999999999999999777 No 5 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=1.3e-58 Score=337.85 Aligned_cols=403 Identities=18% Similarity=0.227 Sum_probs=256.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHH Q lcl|NC_016164. 410 SSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIA 489 (836) Q Consensus 410 ~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~ 489 (836) ++..+..+.+.... .+..++..... + ...+..+.....+.....++.+..+................... T Consensus 1 M~l~eL~~~r~~~~-------~~~~~l~~~~~-e--~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~ 70 (435) T protein:vir:80 1 MNVNELRRERAAVN-------QRVQALAQIEV-G--GTALSVEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVD 70 (435) T ss_pred CCHHHHHHHHHHHH-------HHHHHHHHHHh-c--cCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 11111111111110 01111110000 0 00000000000011111111111111111100000000000000 Q ss_pred HHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhh--h--hhhhhhhhhhhhhhccc Q lcl|NC_016164. 490 AGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPR--G--ILAPNDVLHRDLVVDTA 565 (836) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~--g--~~~~~~~~~~a~~~~~~ 565 (836) ....... ........... ...............++....+.... . .............+.++ T Consensus 71 ~~~~~~~----------~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 136 (435) T protein:vir:80 71 PNPAAVT----------ASAAAPVYAQP----KAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLS 136 (435) T ss_pred chhhhhc----------ccccccccccc----chhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccC Confidence 0000000 00000000000 00000000000000000000000000 0 00000111122233445 Q ss_pred ccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeee Q lcl|NC_016164. 566 SAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGA 645 (836) Q Consensus 566 ~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~ 645 (836) ...|++++|+.+.+.|++.+++.++++++.++++++.++.+.+|+.++++.+.|++|++.+++++++|+++++.++++++ T Consensus 137 ~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~ 216 (435) T protein:vir:80 137 PGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAA 216 (435) T ss_pred CCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEEeeEEEEE Confidence 55678899999999999999999999999888889989999999999999999999999999999999999999999999 Q ss_pred eehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccc----hhHHHHHHHH Q lcl|NC_016164. 646 YTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATN----PTYVELVSME 719 (836) Q Consensus 646 ~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~----~t~~~l~~a~ 719 (836) +++||+++|.|+. ++++++|.++|+++++++++.+||+|+|++++|+||++..........+.. ..+.++.+++ T Consensus 217 ~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 296 (435) T protein:vir:80 217 LVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAI 296 (435) T ss_pred eehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccchhhHHHHHHHHH Confidence 9999999999984 479999999999999999999999999999999999987765544333322 2356788888 Q ss_pred HHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCccccc--------eEEEEehhceEEE Q lcl|NC_016164. 720 SKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVANG--------DVFFGVWNQMIMG 791 (836) Q Consensus 720 ~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~~--------~i~~gD~s~~~i~ 791 (836) ..+..++.+..+++|+|||.++..|+.++|++|+|.|....+++|+|+||++++.+|.. .++||||++|+++ T Consensus 297 ~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~ 376 (435) T protein:vir:80 297 LALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIG 376 (435) T ss_pred HHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCCCCCeEeeeeeEEeccccccccCCCCcceEEEEEcccEEEE Confidence 88887776677889999999999999999999999987667789999999999999853 5899999999999 Q ss_pred eecceEEEEeccc-----------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 792 MWGALDIQVNPYA-----------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 792 ~~~~l~i~~~~~~-----------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +++++++.++++. .|.+|++.||++.|+|+++++|+||++++.+- T Consensus 377 ~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~ 432 (435) T protein:vir:80 377 EEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVA 432 (435) T ss_pred eecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccC Confidence 9999999999875 38899999999999999999999999999766 No 6 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=2.3e-58 Score=336.53 Aligned_cols=394 Identities=20% Similarity=0.292 Sum_probs=248.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhh Q lcl|NC_016164. 408 MDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQP 487 (836) Q Consensus 408 ~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~ 487 (836) |.. ..+..+.+... ..+..++..... ++ ..+..+.....+.....++.+..+................. T Consensus 1 M~k-l~~L~e~r~~l-------~~~~~~l~~~~~-e~--~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~ 69 (428) T protein:vir:10 1 MPQ-IEELRRQRAGI-------NEQIQALATIEA-TN--GTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKP 69 (428) T ss_pred Cch-HHHHHHHHHHH-------HHHHHHHHHHHh-cc--CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 000 00000000000 011111111000 00 00000000001111111112211111111100000000000 Q ss_pred HHHHhhh-hh-hhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhh-----hhhhhhhhhhhhh Q lcl|NC_016164. 488 IAAGGGS-AD-IGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPR-----GILAPNDVLHRDL 560 (836) Q Consensus 488 ~~~~~~~-~~-~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~-----g~~~~~~~~~~a~ 560 (836) ....... .. ..............+ .........+.... ........ ... T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 125 (428) T protein:vir:10 70 VKATQHGPAVIVKAEPKQYTGAGMTR----------------------MVMSIAAAQGNLQDAAKFASDELNDQS--VSM 125 (428) T ss_pred hhchhhccccccccccchhhhHHHHH----------------------HHHHHHHhhhhHHHHHHHhhhhhhhhh--Hhh Confidence 0000000 00 000000000000000 00000000000000 00000111 111 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeee Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVA 640 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~ 640 (836) ...+..+.|++++|+.+.+.|++.+++.+++++++++++++.++.+.+|+.++++.+.|++||+.+++++++|+++++.+ T Consensus 126 ~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~ 205 (428) T protein:vir:10 126 AISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAGGATASYTGENQDAKVSEARFDDVKLTA 205 (428) T ss_pred hhcccccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeCCcceeeeccCccccccccceeeEEeee Confidence 22333446788899999999999999999999998888888899999999999999999999999999999999999999 Q ss_pred eeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccc---cccccchhHHHHHH Q lcl|NC_016164. 641 KTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTE---NFGATNPTYVELVS 717 (836) Q Consensus 641 ~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~---t~aa~~~t~~~l~~ 717 (836) ++++++++||+++|.|+.++++++|.+.|+++++++++.+||+|+|++++|+||++.+..... +......+++.+.. T Consensus 206 ~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (428) T protein:vir:10 206 KTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDT 285 (428) T ss_pred EEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccccHHHHHH Confidence 999999999999999999999999999999999999999999999999999999987654322 11222334444333 Q ss_pred HHHHHh----hhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCccccc--------eEEEEeh Q lcl|NC_016164. 718 MESKVA----ADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVANG--------DVFFGVW 785 (836) Q Consensus 718 a~~~l~----~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~~--------~i~~gD~ 785 (836) ++..+. ..+.+..+++|+|||.++..|+.++|++|+|+|....+++|+|+||++++++|.+ .++|||| T Consensus 286 ~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~ 365 (428) T protein:vir:10 286 YLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKESEIYFADF 365 (428) T ss_pred HHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeeceeeEEeccccccccCCCccceEEEEec Confidence 333222 2233445789999999999999999999999987767789999999999999863 4899999 Q ss_pred hceEEEeecceEEEEecc-----------cccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 786 NQMIMGMWGALDIQVNPY-----------ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 786 s~~~i~~~~~l~i~~~~~-----------~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++|.+++++++++.++++ ..|..|++.||++.|+|+++.+|+||++++..- T Consensus 366 s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~ 427 (428) T protein:vir:10 366 NDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVL 427 (428) T ss_pred ceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccC Confidence 999999999999999887 358999999999999999999999999999888 No 7 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=9.8e-57 Score=327.65 Aligned_cols=406 Identities=12% Similarity=0.081 Sum_probs=248.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhh Q lcl|NC_016164. 391 GAAAATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKR 470 (836) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~ 470 (836) ................ ..+.++..............++.++... .....+.............+.+... T Consensus 1 ~~~~~~~~~~~~~~~~-----~~el~~~~~e~~~~l~~~~~e~~~~~e~------~~~e~~~~~~~~~e~~~~~~~l~~~ 69 (418) T protein:vir:10 1 MSHMNEPRQFGRKSGG-----DSHPEQVLETVTKELKRIGDEVKSAGEK------ALAEAKRAGDLGVETKATVDELLIK 69 (418) T ss_pred CCCchhHHHHHHHhcc-----HHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHhhhhhhHHHHHHHHHHHHH Confidence 0000000000000000 0000000000000000000011110000 0000000000000000000101000 Q ss_pred hhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhh Q lcl|NC_016164. 471 PAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGIL 550 (836) Q Consensus 471 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~ 550 (836) ............. .... ..... ....+........ .......+............ T Consensus 70 ~~~l~~~~~~~e~------------------~~~~-~~~~~---~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 124 (418) T protein:vir:10 70 QGELQARLLEAEQ------------------KLAR-GGGSA---ELETPKTLGQLVT---ESEEMKGMDGSARKSVRVRV 124 (418) T ss_pred HHHHHHHHHHHHH------------------HHhh-ccccc---ccchhhhhhHHhh---hHHHHHHHHHHHhhhhhhhh Confidence 0000000000000 0000 00000 0000000000000 00000111110000000000 Q ss_pred hhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecC-CceeeeeccCcccccc Q lcl|NC_016164. 551 APNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTG-AATAYWVAEGGDPTES 629 (836) Q Consensus 551 ~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~ 629 (836) .............++.+.++.++|+.+...|++.+++.+++++++ ++++..++.+++|+.++ ++.+.|++|+++++++ T Consensus 125 ~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~-~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~ 203 (418) T protein:vir:10 125 DRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLL-MPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPTS 203 (418) T ss_pred HHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhc-ceeeccCCceeEEEEecCCCceeeeccCcccccc Confidence 000111112233344556677889999999999999999999995 45566677889998766 5788999999999999 Q ss_pred cccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccc-c Q lcl|NC_016164. 630 QPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGA-T 708 (836) Q Consensus 630 ~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa-~ 708 (836) +++|+++++.+++++++++||+++|.++ .+++++|.+.|+++++++++.+||+|+|++.+|.||++.++....+.+. + T Consensus 204 ~~~f~~v~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~ 282 (418) T protein:vir:10 204 DLKFNLKNQPVRTIAHLFKASRQILDDA-PALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLAN 282 (418) T ss_pred ccceeeEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccc Confidence 9999999999999999999999999876 5899999999999999999999999999999999999988766655443 3 Q ss_pred chhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc---CCCCeecceeeEeeCccccceEEEEeh Q lcl|NC_016164. 709 NPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL---EPGGTVNGYNVVRSNQVANGDVFFGVW 785 (836) Q Consensus 709 ~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~---~~~~~l~G~pVv~s~~~~~~~i~~gD~ 785 (836) ..+++++.+++..+...+ ..+++|+|||.+|..|+.++|.+|+++|.. +.+++|+|+||++++.+|.++++|||| T Consensus 283 ~~~~~~i~~~~~~~~~~~--~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~ 360 (418) T protein:vir:10 283 ATPIDKIRLALLQAVLAE--FPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPRLWNLPVVETQAMTANEFLVGAF 360 (418) T ss_pred cccHHHHHHHHHhhcccc--CCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCceecceeeEEcCCCCCCcEEEeec Confidence 457899999999887765 356789999999999999999999988742 346789999999999999999999999 Q ss_pred hc-eEEEeecceEEEEeccc--ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 786 NQ-MIMGMWGALDIQVNPYA--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 786 s~-~~i~~~~~l~i~~~~~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++ |.+++++++++.++++. +|.+|++.||++.|+|+++++|+||++++.+- T Consensus 361 s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~ 414 (418) T protein:vir:10 361 SMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALVE 414 (418) T ss_pred cceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 97 77899999999998876 48999999999999999999999999988655 No 8 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=3e-56 Score=324.99 Aligned_cols=409 Identities=16% Similarity=0.167 Sum_probs=250.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh--HHHHHHHHHHHhhhhhh Q lcl|NC_016164. 395 ATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGAS--EADAMRSVLSEIAKRPA 472 (836) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t--~~e~~~~~l~~l~~~~~ 472 (836) ...+.. ... .+.+... ....+...+..++... ...+. +.+++..+ ........++.+.+... T Consensus 1 ~~~~~~--~~~-------~el~~~~-~~l~el~~~~~el~~~--~~el~----~~~e~ak~eee~~~l~~ei~~le~e~~ 64 (425) T protein:vir:95 1 MALRQL--MLT-------KKIEQRK-AALDELVKREQELQAK--AAELE----QAIEEAQTEEEVSAVEEEVAKLEDERN 64 (425) T ss_pred CchHHH--HHH-------HHHHHHH-HHHHHHHHHHHHHHHH--HHHHH----HHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 000000 000 0000000 0000000000000000 00000 00000000 00000000111100000 Q ss_pred hHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhh Q lcl|NC_016164. 473 AQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAP 552 (836) Q Consensus 473 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~ 552 (836) ...................... . ....... .... ........... . ................+. . T Consensus 65 ~l~~~~~~le~~~~~~~~~l~~--~--~~~~~~~-~~~~-----~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~---~ 129 (425) T protein:vir:95 65 ELNEKKSKLEGEIAQLEDELEQ--I--NSKQPSN-QSRQ-----KMQGSKGDVVE-M-NRLQVREMLKTGEYYKRS---E 129 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHHH--h--hhhccch-hhhh-----hhhhhhhhHHH-H-HHHHHHHHHhhhhhhhhh---H Confidence 0000000000000000000000 0 0000000 0000 00000000000 0 000000011110000000 0 Q ss_pred hhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccccc-c Q lcl|NC_016164. 553 NDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQ-P 631 (836) Q Consensus 553 ~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~-~ 631 (836) ...........+..++|++++|+.+.+.|++.+++.+++++++. +++ .++.+.+|+.++.+.+.|++|++++++++ + T Consensus 130 ~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~-~~~-~~g~~~ip~~~~~~~a~~v~E~~~~~~~~~~ 207 (425) T protein:vir:95 130 VVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVD-KIR-VKGTTRILVDTDTSPATWIEQSGALPTGDVG 207 (425) T ss_pred HHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhc-eee-cCceeEEEEecCCcccccccccccccccccc Confidence 01111112223344668889999999999999999999999954 444 35788999999999999999999999877 6 Q ss_pred cceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCc-cccccccccccc-ccccccccc Q lcl|NC_016164. 632 SVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSN-SQPEGLKFVTGI-NTENFGATN 709 (836) Q Consensus 632 ~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~-~~p~Gi~~~~~~-~~~t~aa~~ 709 (836) +|+++++++++++++++||+++|.|+.++++++|...|+++++++++.+||+|+|++ ++|.||++.... ...+..++. T Consensus 208 ~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~ 287 (425) T protein:vir:95 208 TIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADN 287 (425) T ss_pred ccceeeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccccccccccc Confidence 899999999999999999999999999999999999999999999999999999984 689999976433 334445667 Q ss_pred hhHHHHHHHHHHHhhhccccCccEEEecHHHH----HHHHHHhhccCccccc--cCCCCeecceeeEeeCccccceEEEE Q lcl|NC_016164. 710 PTYVELVSMESKVAADNADIGAMSYLTNSTLY----GGFKTTEKATSTAQFV--LEPGGTVNGYNVVRSNQVANGDVFFG 783 (836) Q Consensus 710 ~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~----~~L~~lkd~~g~~~~~--~~~~~~l~G~pVv~s~~~~~~~i~~g 783 (836) .+++++.+++..+..++....+++|+||+.++ ..|+.++|++|+|++. ....++|+|+||++++.+|++.++|| T Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l~G~pvv~~~~~~~~~i~~G 367 (425) T protein:vir:95 288 NLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRVVFNNFLDDDTVLFG 367 (425) T ss_pred chHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCccccceeeEEcCcCCCccEEEE Confidence 88999999999988888777888999999874 3467788999998753 33456899999999999999999999 Q ss_pred ehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 784 VWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 784 D~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ||++|.+++++++++.++++.+|.+|++.||++.|+|+++++|+||++++..= T Consensus 368 d~~~~~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~ 420 (425) T protein:vir:95 368 EFEQYTLVERENITIDSSTHVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITD 420 (425) T ss_pred ecccEEEEeecceEEEeecccccccCceEEEEEEeeCcEeecccceEEEEecC Confidence 99999999999999999999999999999999999999999999999998666 No 9 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=8.8e-57 Score=327.89 Aligned_cols=383 Identities=15% Similarity=0.117 Sum_probs=242.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHH Q lcl|NC_016164. 410 SSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIA 489 (836) Q Consensus 410 ~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~ 489 (836) +..+...+. .+.......+++++.......+...+ .....+.....++.+..+............... T Consensus 1 m~~~~l~~l----~e~r~~~~~e~~~l~~~~~~~~~~~e----~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~---- 68 (392) T protein:vir:13 1 MDATTLSAN----FEARERATAELRSLTDEFAGKEMTAE----AREKEERLLTAVADFDGRIKRGIDAIKATDAVT---- 68 (392) T ss_pred CCHHHHHHH----HHHHHHHHHHHHHHHHHhhcccccHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 111110000 01111111122222221111111111 111111111111111111110000000000000 Q ss_pred HHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccccc Q lcl|NC_016164. 490 AGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAG 569 (836) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g 569 (836) . . .. ............. ... ...+. +.+.... ....... ......+...+| T Consensus 69 ----~-~--~~--------------~~~~~~~~~~~~~---~~~-~~~~~-r~g~~~~-~~~~~~~--~~~~~~t~~~~g 119 (392) T protein:vir:13 69 ----S-L--LS--------------GLQGSGSGAQRSA---DHD-DDAVL-RAGNLGE-ARSFEFA--PEKRDGTKAGNP 119 (392) T ss_pred ----H-H--hc--------------ccCCcccchhhhh---hHH-HHHHH-hccchhh-hHHHHhh--hhhhcccccCCC Confidence 0 0 00 0000000000000 000 00000 0010000 0000000 011122233345 Q ss_pred ccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehh Q lcl|NC_016164. 570 DLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEF 649 (836) Q Consensus 570 ~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~I 649 (836) ++++|+.....|.+.+...+++++++..+.+...+.+.+|+.++.+.+.|++|++++|+++++|+++++.+++++++++| T Consensus 120 ~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~i 199 (392) T protein:vir:13 120 NVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMGGFKYGFASVV 199 (392) T ss_pred ccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccccceeeEEeeeeeEEeeehh Confidence 55555555444445555556667665444444556799999999999999999999999999999999999999999999 Q ss_pred HHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccc---cccccchhHHHHHHHHHHHhhhc Q lcl|NC_016164. 650 SRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTE---NFGATNPTYVELVSMESKVAADN 726 (836) Q Consensus 650 SrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~---t~aa~~~t~~~l~~a~~~l~~~~ 726 (836) |+++|.|+.++++++|.+.|+++++++++.+||+|+|+ ++|.||++.....+. ++.++.+++++|.+++..|...+ T Consensus 200 S~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt-~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~ 278 (392) T protein:vir:13 200 SYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGT-GQPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAY 278 (392) T ss_pred HHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCC-ccccccccccccccccccccccccccHHHHHHHHHhhhhhh Confidence 99999999999999999999999999999999999997 589999987654332 23345678999999999998765 Q ss_pred cccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEec Q lcl|NC_016164. 727 ADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNP 802 (836) Q Consensus 727 ~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~ 802 (836) ..+++|+|||+++..|+.++|++|+|+|.. +.+++|+|+||++++.+|++.++||||++|.++.++++++..+. T Consensus 279 --~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~i~~~~~~~i~~~~ 356 (392) T protein:vir:13 279 --RKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKVVETDDGMPADKVLFADLSKYRVRFAGSLRVDRSV 356 (392) T ss_pred --hcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEcCCCCCCcEEEeeccceeEEeecceEEEeec Confidence 356899999999999999999999998744 33468999999999999999999999999999999999999999 Q ss_pred ccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 803 YALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 803 ~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.+|.+|++.||++.|+|+++++|+||++++... T Consensus 357 ~~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~ 390 (392) T protein:vir:13 357 DAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTP 390 (392) T ss_pred cccccCCcEEEEEEEEeccEEecccceEEEEeec Confidence 9999999999999999999999999999888766 No 10 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=2.3e-56 Score=325.59 Aligned_cols=375 Identities=14% Similarity=0.157 Sum_probs=246.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHH Q lcl|NC_016164. 427 RSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARS 506 (836) Q Consensus 427 ~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (836) .....++.++... +....+++.+......+..+.....+................ . ..... T Consensus 1 l~~~k~l~~~i~e--~~~~~~~~k~~~~~~~~~~e~~~~~l~~~~e~~~~~~~~~e~-------~-------~~~~~--- 61 (407) T protein:vir:48 1 MADVKDVEQVAQE--LQRKFDDFKEKNDKRIDAIEQEKGKLAGEVETLNGKLAELEN-------L-------KSDLE--- 61 (407) T ss_pred CchHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------H-------HHHHH--- Confidence 0001111111000 000000000000000000001111111110000000000000 0 00000 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHH Q lcl|NC_016164. 507 FSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLR 586 (836) Q Consensus 507 ~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~ 586 (836) ........+........ ..+...++..... .+........+......++.+.|++++|+.+.+.|++.++ T Consensus 62 ----~~~~~~~~~~~~~~~~~---~~e~~~a~~~~l~---~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~ 131 (407) T protein:vir:48 62 ----AELAEVKRPAGGTQNKV---ASEHKEAFIGFMR---KGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLK 131 (407) T ss_pred ----HHHHHhhccccccccch---hhHHHHHHHHHHh---ccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHH Confidence 00000000000000000 0011111111110 1111111122233344445567788899999999999999 Q ss_pred hhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccc-cccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHH Q lcl|NC_016164. 587 NRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTES-QPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMV 665 (836) Q Consensus 587 ~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i 665 (836) ..++++++ +++++..++.+.+|+.++++.+.|++|++.++++ .++|+++++.+++++++++||+|+|.|+.++++++| T Consensus 132 ~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i 210 (407) T protein:vir:48 132 DEVVMRQE-ATVITLGGSDYKKLVNLGGTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWI 210 (407) T ss_pred hhhhhhhh-ceeeecCCCceEEEEecCCcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHH Confidence 99999998 5567777889999999999999999999999975 479999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCccccccccccccccc-------------ccccccchhHHHHHHHHHHHhhhccccCcc Q lcl|NC_016164. 666 RTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINT-------------ENFGATNPTYVELVSMESKVAADNADIGAM 732 (836) Q Consensus 666 ~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~-------------~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~ 732 (836) .+.|+++++++++.+|++|+|+ ++|.||++...... .+..++.+++++|.++++.|..++. .++ T Consensus 211 ~~~l~~~i~~~~~~a~l~G~G~-~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~--~~a 287 (407) T protein:vir:48 211 NSELALEFAEQEEIAFTSGDGS-KKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHR--SGA 287 (407) T ss_pred HHHHHHHHHHHHHhhhhccCCC-CccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhh--cCC Confidence 9999999999999999999998 68999996654322 2234456789999999999988764 578 Q ss_pred EEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCcccc-----ceEEEEehhc-eEEEeecceEEEEec Q lcl|NC_016164. 733 SYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVAN-----GDVFFGVWNQ-MIMGMWGALDIQVNP 802 (836) Q Consensus 733 ~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~-----~~i~~gD~s~-~~i~~~~~l~i~~~~ 802 (836) +|+||+.+|..|++++|.+|+|+|.+ +.+++|+|+||++++.+|. ..++||||+. |.++++.++++..++ T Consensus 288 ~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~ 367 (407) T protein:vir:48 288 KFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDP 367 (407) T ss_pred EEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeec Confidence 99999999999999999999998743 3456899999999999985 2378999985 888999999998876 Q ss_pred ccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 803 YALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 803 ~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) + +.+|++.||++.|+|+++++|+||++++.+- T Consensus 368 ~--~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~a 399 (407) T protein:vir:48 368 Y--TNKPFVGFYTTKRTGGMLVDSQAIKLMKIGA 399 (407) T ss_pred c--ccCCcEEEEEEEEeccEEecccceEEEEeec Confidence 5 5789999999999999999999999999877 No 11 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=5.3e-57 Score=329.11 Aligned_cols=463 Identities=11% Similarity=0.025 Sum_probs=258.3 Q ss_pred cccceEEEEEecCcccccccCcEEEeccccccchhhhcCCCceEeecCCCCCcceEEEEEEEecCCEEEEEEEEcCCccc Q lcl|NC_016164. 238 EDPEVVEFTFSSEQPVERYFGMEVLSHDPDAMNMSRLNSGAAPWLWNHNAEVVLGVVERAWMGDDRRGRVRTRWSPNTKI 317 (836) Q Consensus 238 ~~~rt~~~~~~~~~~v~~~~~~e~l~~~~~a~~~~~~~~~~~~lL~~H~~~~~iG~v~~~~~~e~~~~~a~~~f~~~~~~ 317 (836) -+.|||+++++....+++++++ + .++++. +..+||||+|| +|||++... .++++ + . T Consensus 1 ~~~~~~~G~a~~~~~~d~~gd~-~---~~~a~~-----~~~~~~l~~H~--~~iG~~~~~-~~~~~-~-----------~ 56 (480) T protein:vir:40 1 MKVKAVRGIANPLGTIDAHGTV-I---ESIANA-----GDGVDILNRHR--EKIGSGFVH-LEGDN-V-----------I 56 (480) T ss_pred CcceEEEEEEecCCCCCCcchh-e---cccccC-----CcCceeeeeCC--ceeeEEEEe-ecCCC-C-----------c Confidence 6889999999998777766643 3 356653 34689999997 799998653 33332 2 2 Q ss_pred ccHHHHHHHHHHhcCccceeeeeeEeeccccccCCCCeEEEEEEEEEEEEEEeccCccchhhhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 318 EGSEEYKRRQDWESGTIRNVSFMYSIDAPLDLTSREGMALVTAFTPMEVSAVSIPADHTVGQGRKATSSSGPPGAAAATV 397 (836) Q Consensus 318 ~~~~~~~~~~~v~~G~l~~~SiG~~v~~~~~~~~~~~~~~~~~~~l~EiS~V~~pA~~~a~v~~~~~~~~~~~~~~~~~~ 397 (836) .+++++++|++||+|.|++|||||++.++++. ..++++++++++|+|||+|++|||++|.|...+.......... T Consensus 57 ~t~~~~~~~~~~k~g~~~~~Sigf~~~~~~~~-~~~~~~~~~~~~l~EvS~v~~pa~~~a~v~~vks~~~~~e~~~---- 131 (480) T protein:vir:40 57 LTGYVDEEQYTAEKIEETGLSVGFNANGVKAR-EIDGVGYYKDVTITEVSLTPLPSNKGAKVTKVREENKGEQEQM---- 131 (480) T ss_pred cchhHHHHHHHHHcCCccceeeeeeeeecccc-cCCCeEEEEEEEEEEeEEeecccchhhhhhhhhhhhhhhhhhh---- Confidence 46789999999999999999999999886543 4456688899999999999999999999864332111000000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHH Q lcl|NC_016164. 398 APLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPAT 477 (836) Q Consensus 398 ~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~ 477 (836) ...+..+.. .+ ... ....+.+...+ ++++.......... T Consensus 132 -------------~~~e~~e~~----~e------~~e---~~~~~~el~ak---------------l~el~k~~ee~k~~ 170 (480) T protein:vir:40 132 -------------GANETQEIM----KQ------AIE---AGVKVRELEAK---------------VEELNKEREELKKE 170 (480) T ss_pred -------------hhHHHHHHH----Hh------hhh---hhhhhhhHHHH---------------HHHHHhHHHHHhhh Confidence 000000000 00 000 00000000000 00000000000000 Q ss_pred HHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|NC_016164. 478 PAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLH 557 (836) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~ 557 (836) . ..... ..... ... ....+...... .... ...+.+ .... T Consensus 171 ~--------~~~~~--~~~~~----~~~----~~e~r~~~~~~------~~~~----e~~~~~-------------~~~~ 209 (480) T protein:vir:40 171 R--------EASIP--SEKPE----DAE----RKFMRELGSKM------AEMP----EQGFLR-------------EFAN 209 (480) T ss_pred h--------hhhcc--ccchh----hhh----hHHHHHHHHHh------ccch----hhhhhh-------------hhhh Confidence 0 00000 00000 000 00000000000 0000 000000 0000 Q ss_pred hhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccc--ccccee Q lcl|NC_016164. 558 RDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTES--QPSVDQ 635 (836) Q Consensus 558 ~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~--~~~~~~ 635 (836) + ........++.++|. +...+........+....+.. ...+.....|++|+...+.. ..++.+ T Consensus 210 ~--~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~------------~~~g~~~~~~~~e~~~~~~~~~~~~~~~ 274 (480) T protein:vir:40 210 G--ADLNVVNSLGSITSK-YARKSGIYDGAMKARFQGLTL------------AEDGVDDTFISGTFKAGTDKNKSQTATK 274 (480) T ss_pred h--ccccccccccccccc-hhhheeechhhhhhhhhccee------------eeccccceeeeeeeeccccccccccccc Confidence 0 011122233344443 333222222222222221110 11223344566655433221 112233 Q ss_pred EEee---eeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCc-ccccccccccccccccccccchh Q lcl|NC_016164. 636 VALV---AKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSN-SQPEGLKFVTGINTENFGATNPT 711 (836) Q Consensus 636 it~~---~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~-~~p~Gi~~~~~~~~~t~aa~~~t 711 (836) ..+. ++++++....|+++|.|+ .++++||..+|++.++++++.+|++|+|++ +.+.|+...... .+..++ T Consensus 275 ~~~~~~~v~~l~~~~k~t~~lLDDa-~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~~-----~~~~~~ 348 (480) T protein:vir:40 275 RSLRPQMAEAYLQMDKATVRGVNDS-GALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTATDG-----WTKQIE 348 (480) T ss_pred chhhHHHHHHHHHhHHHHHHHhhhh-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceeeccc-----ccccch Confidence 3333 578888889999998765 589999999999999999999999997654 345565433211 112233 Q ss_pred HH-HHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEee-CccccceEEEEeh Q lcl|NC_016164. 712 YV-ELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRS-NQVANGDVFFGVW 785 (836) Q Consensus 712 ~~-~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s-~~~~~~~i~~gD~ 785 (836) .+ .|..++.+|...++. +.+.|+|||.+|..|++|||++|+|+|.+ +.+.+|+|+||+++ ..+|.+...++.+ T Consensus 349 ~~d~id~L~~al~~~y~~-~a~~~vmn~~t~~~I~klKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~~~~~ 427 (480) T protein:vir:40 349 YTDLFEGITDAVAECSIS-DAITIVMSPQTFAELRKAKGTDGHSRFNELATKEQIAQSFGAVNLETRVWMPKDEVAVYNH 427 (480) T ss_pred hHHHHHHHHHhhhHHhhC-CCCEEEECHHHHHHHHHhhcCCCCeeccCcccccCcceecccceeeeeccccCCcceeeeC Confidence 33 445688888877654 33469999999999999999999987743 34678999998765 5667666666666 Q ss_pred hc-eEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 786 NQ-MIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 786 s~-~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +. +.+++++ ++.. +...+..++..|.++.|+++++..|+||+.+|.=. T Consensus 428 ~~~~~~~d~~-~~~~--~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~ 476 (480) T protein:vir:40 428 DEYVLIGDLN-VENY--NDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKG 476 (480) T ss_pred CccEEEEecc-ccee--cccccccchhhhhhhhhhceeeEccccEEEEEecc Confidence 54 5667764 4432 22345678899999999999999999998877433 No 12 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=5.8e-56 Score=323.42 Aligned_cols=399 Identities=13% Similarity=0.124 Sum_probs=249.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhh Q lcl|NC_016164. 393 AAATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPA 472 (836) Q Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~ 472 (836) +.......-+......... .......+.......+++++. +++..+.....+.....++++..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~-----~~~~~l~e~ra~~~~e~~~l~---------~~~~~~~~~~k~~~~~~~~~~~~~~~ 66 (425) T protein:vir:10 1 MSKKLLIAVLTAALTGPVG-----AVPRGIISVRAEGPTEVKALI---------ENLQKAFHDFKAEHTKQLDAVKAGLP 66 (425) T ss_pred CchhHHHHhhHHHhhhhhh-----hhhHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 0000000000000000000 000000000000111111111 11111111111111111111111100 Q ss_pred hHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhh Q lcl|NC_016164. 473 AQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAP 552 (836) Q Consensus 473 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~ 552 (836) ...... .................+. ............. .......+....+.... . T Consensus 67 ~~e~~~-----~~~~~~~ei~~~~~~~~~~-------~~~~~~~~~~~~~---~~~~~~~~~~~af~~~l---------~ 122 (425) T protein:vir:10 67 TSDALA-----KVDKVSADLEALQAAVDEA-------NIKIAAAQMGANG---VKPLRDPEYTEAFKAHV---------K 122 (425) T ss_pred cHHHHH-----HHHHHHHHHHHHHHHHHHH-------HHHHHhhhccccc---ccccccHHHHHHHHHHh---------h Confidence 000000 0000000000000000000 0000000000000 00000011111111100 0 Q ss_pred hhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccccc-c Q lcl|NC_016164. 553 NDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQ-P 631 (836) Q Consensus 553 ~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~-~ 631 (836) ....++..+.++.+.|++++|+.+...|++.+++.+++++++ ++.+..++.+++|+.++++.+.|++|++.+|.++ + T Consensus 123 -~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~-~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~ 200 (425) T protein:vir:10 123 -RGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLC-RVQPVSKAGFSKLFNMGGTTSGWVGEASQRPQTNAA 200 (425) T ss_pred -hhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhc-eeeeccCCceEEEEEcCCcceeeecccccccccccc Confidence 011223334455677888899999999999999999999985 5667777889999999999999999999999875 7 Q ss_pred cceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccc-------- Q lcl|NC_016164. 632 SVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTE-------- 703 (836) Q Consensus 632 ~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~-------- 703 (836) +|+++++.+++++++++||+++|.|+.++++++|.++|+++++++++.+|++|+|+ ++|.||++.....+. T Consensus 201 ~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~-~~p~Gil~~~~~~~~~~~~~~~~ 279 (425) T protein:vir:10 201 TFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGT-NKPNGLLTYIAGGANAAKHPFGA 279 (425) T ss_pred ccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCC-CCcceeeeccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999997 589999986543322 Q ss_pred -----cccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCc Q lcl|NC_016164. 704 -----NFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQ 774 (836) Q Consensus 704 -----t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~ 774 (836) +..++.+++++|.+++..|...+ ..+++|+|||++|..|+.++|++|+|+|.+ +.+++|+|+||++++. T Consensus 280 ~~~~~~~~~~~~~~d~l~~l~~~l~~~~--~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~ 357 (425) T protein:vir:10 280 IEVVNSGAAADITSDGIIDLVYDLPSAF--TGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPD 357 (425) T ss_pred cccccccccccccHHHHHHHHhhhhhhh--ccCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecC Confidence 23445678999999999998776 457899999999999999999999998743 3457899999999999 Q ss_pred ccc-----ceEEEEehhc-eEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 775 VAN-----GDVFFGVWNQ-MIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 775 ~~~-----~~i~~gD~s~-~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +|. ..++||||++ |.++++.++++..+++ +.+|++.|+++.|+|+++++|+||++++.+. T Consensus 358 ~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~--~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~a 423 (425) T protein:vir:10 358 MPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDPY--TAKPYVLFYTTKRVGGGLLNPEPMRAMKVAA 423 (425) T ss_pred cCCccCCccEEEEEehhccEEEEEecceEEEeccc--ccCCcEEEEEEEEeccEeecccceEEEEeec Confidence 984 3489999997 7889999998877665 6789999999999999999999999999999 No 13 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=2.4e-56 Score=325.55 Aligned_cols=381 Identities=14% Similarity=0.120 Sum_probs=246.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHH Q lcl|NC_016164. 410 SSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIA 489 (836) Q Consensus 410 ~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~ 489 (836) +..+...+.. +.......+++++.....- ..+..+.....+.....++.+.++............. T Consensus 1 m~~~~l~~l~----e~r~~~~~e~~~L~~~~~~----~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~------ 66 (390) T protein:vir:62 1 MDATTLSANF----EARERATAELRTLTDEFAG----KEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDP------ 66 (390) T ss_pred CChhHHHHHH----HHHHHHHHHHHHHHHHhhc----ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------ Confidence 1111111100 0000111122222211110 0111111111111111122221111111000000000 Q ss_pred HHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccccc Q lcl|NC_016164. 490 AGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAG 569 (836) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g 569 (836) . .... ............. . ... ...+. +.+.. +. .............+...+| T Consensus 67 -~---------~~~~------~~~~~~~~~~~~~--~----~~~-~~~~~-r~~~~--~~-~r~~~~~~~~~~~t~~~~g 119 (390) T protein:vir:62 67 -V---------TSLL------SGLQGSGSGAQRS--A----DVD-DDATL-RAGNL--GE-ARSFEFAPEKRDGTKAGNP 119 (390) T ss_pred -H---------HHHH------hhcccccccchhh--c----chH-HHHHH-hhhhh--hh-hHHHHhhhhhhcccccCCC Confidence 0 0000 0000000000000 0 000 00000 00000 00 0000001111122334456 Q ss_pred ccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehh Q lcl|NC_016164. 570 DLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEF 649 (836) Q Consensus 570 ~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~I 649 (836) ++++|+.....|.+.++..+++++++....+...+.+.+|+.++.+.+.|++|++++|+++++|+++++++++++++++| T Consensus 120 ~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~i 199 (390) T protein:vir:62 120 NVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVV 199 (390) T ss_pred ccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccccceeeeEeeeeeEEeehHH Confidence 67777777777777888888888886655444456789999999999999999999999999999999999999999999 Q ss_pred HHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccc---cccccchhHHHHHHHHHHHhhhc Q lcl|NC_016164. 650 SRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTE---NFGATNPTYVELVSMESKVAADN 726 (836) Q Consensus 650 SrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~---t~aa~~~t~~~l~~a~~~l~~~~ 726 (836) |+|+|.|+.++++++|.+.|+++++.++|.+|++|+| +|.||++....... ...++.+++++|.+++++|...+ T Consensus 200 S~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G---~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~ 276 (390) T protein:vir:62 200 SYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTG---QPRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAY 276 (390) T ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCC---ccccccccccccccceecccccccchHHHHHHHHhhhhhh Confidence 9999999999999999999999999999999999987 58999987654332 23345578999999999998765 Q ss_pred cccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEec Q lcl|NC_016164. 727 ADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNP 802 (836) Q Consensus 727 ~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~ 802 (836) . .+++|+||++++..|++++|++|+|+|.. +.+.+|+|+||++++.+|++.++||||++|.+++++++++.++. T Consensus 277 ~--~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~~~i~~~~~~~v~~~~ 354 (390) T protein:vir:62 277 R--ANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAPSLFNGKVVETDDGMPADKILFADLSKYRVRFAGSLRVDRSV 354 (390) T ss_pred h--cCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCccceecccceEEecCCCCccEEEeeccceeEEeecceEEEeec Confidence 4 56799999999999999999999988743 34568999999999999999999999999999999999999999 Q ss_pred ccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 803 YALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 803 ~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.+|.+|++.||++.|+|+++++|+||++++.+= T Consensus 355 ~~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~ 388 (390) T protein:vir:62 355 DAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTP 388 (390) T ss_pred cccccCCcEEEEEEEEeCcEeechhheEEEEeec Confidence 9999999999999999999999999999988444 No 14 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=1.3e-55 Score=321.58 Aligned_cols=376 Identities=13% Similarity=0.145 Sum_probs=245.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHh Q lcl|NC_016164. 413 IDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGG 492 (836) Q Consensus 413 ~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~ 492 (836) +..+....... ..+....+.++. +..++..++............+.+..+..+....... T Consensus 1 m~~~lk~l~~~--~~el~~~~~~~k------~~~~~~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~------------ 60 (401) T protein:vir:44 1 MAVDIKDVEQV--AQELQQKFDDFK------AKNDKRVEAIEQEKGKLAGQVETLNGKLSELENLKSD------------ 60 (401) T ss_pred CCccHHHHHHH--HHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------ Confidence 11110000000 000000000000 0000000000000000000001100000000000000 Q ss_pred hhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccc Q lcl|NC_016164. 493 GSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLV 572 (836) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~v 572 (836) . .+. ......+........ ..+....+..... .+........+......++.+.|+++ T Consensus 61 ------~-~~~---------~~~~~~~~~~~~~~~---~~e~~~a~~~~lr---~~~~~~~~~~e~~a~~~~~~~~GG~~ 118 (401) T protein:vir:44 61 ------L-EKE---------LLELKRPARGAQNKV---AAEHKDAFVGFLR---KGREDGLRDLERKALQVGTDEDGGYA 118 (401) T ss_pred ------H-HHH---------HHHhhccccccccch---hHHHHHHHHHHHh---hhhhhhhHHHHHHHhhcCCCCCCcee Confidence 0 000 000000000000000 0001111111100 01001111122333444555677889 Q ss_pred cchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccc-ccccceeEEeeeeeeeeeehhHH Q lcl|NC_016164. 573 FTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTE-SQPSVDQVALVAKTLGAYTEFSR 651 (836) Q Consensus 573 vp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~~i~ISr 651 (836) +|+.+...|++.+++.+++++++ ++++..++.+.+|+..+++.+.|++|+++++. ..++|+++++.+++++++++||+ T Consensus 119 iP~~~~~~ii~~~~~~~~l~~~~-~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ 197 (401) T protein:vir:44 119 VPEELDRSILSLLKDEVVMRQEA-TVITVGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQ 197 (401) T ss_pred ccHhHHHHHHHHHHhhhhhhhhc-eeeecCCCceEEEEecCCccceeeccccccCccccccceeeeeehhheeeehhhhH Confidence 99999999999999999999984 55677778899999999999999999999886 45899999999999999999999 Q ss_pred HHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccccccc-------------ccccccchhHHHHHHH Q lcl|NC_016164. 652 RLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINT-------------ENFGATNPTYVELVSM 718 (836) Q Consensus 652 elL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~-------------~t~aa~~~t~~~l~~a 718 (836) ++|.|+.++++++|.+.|+++++++++.+||+|+|+ ++|.||++...... .+..++.++++++.++ T Consensus 198 ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~-~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~ 276 (401) T protein:vir:44 198 KMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGT-KKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKL 276 (401) T ss_pred HHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCC-CccceeeccccccccccccccccccccccccccccCHHHHHHH Confidence 999999999999999999999999999999999998 68999997655322 2234456789999999 Q ss_pred HHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccc-----eEEEEehhc-e Q lcl|NC_016164. 719 ESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANG-----DVFFGVWNQ-M 788 (836) Q Consensus 719 ~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~-----~i~~gD~s~-~ 788 (836) ++.|...+. .+++|+||+++|..|+.++|++|+|+|.. +.+++|+|+||++++.+|.. .++||||++ | T Consensus 277 ~~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~ 354 (401) T protein:vir:44 277 IYTLRKAHR--TGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGY 354 (401) T ss_pred HHhcchhhh--cCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccE Confidence 999987764 57899999999999999999999998743 34568999999999998842 378999986 7 Q ss_pred EEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 789 IMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 789 ~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .++++.++++..+++ +.+|++.||++.|+|+++++|+||++++.+- T Consensus 355 ~i~~~~~~~~~~~~~--~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~a 400 (401) T protein:vir:44 355 TIVDRIGTRILRDPY--TNKPFVGFYTTKRTGGMLVDSQAIKLLKIAA 400 (401) T ss_pred EEEEecceEEeeecc--ccCCcEEEEEEEEeccEEecccceEEEEeec Confidence 889999999887765 6789999999999999999999999999777 No 15 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=4.1e-57 Score=329.69 Aligned_cols=339 Identities=20% Similarity=0.312 Sum_probs=244.5 Q ss_pred HHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhh-hHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHh Q lcl|NC_016164. 464 LSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTD-KEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRM 542 (836) Q Consensus 464 l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~ 542 (836) +.+. ......... .........+ +........+..++...... . ...+....... T Consensus 1 ~a~~-------~a~~~~~~~--------~~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g--~-------~~~a~~~a~~~ 56 (366) T protein:vir:57 1 MAAA-------VAVPVKAHS--------VAPGIIIKEELQQYKGAGMTRMVMSIAAGKG--N-------LADAAKFAATE 56 (366) T ss_pred Cccc-------ccccccccc--------cccccccccccccccchhHHHHHHHHHhccc--c-------hhHHHHHHHHh Confidence 1100 000000000 0000000000 00011111111111100000 0 00000000000 Q ss_pred hhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeecc Q lcl|NC_016164. 543 GVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAE 622 (836) Q Consensus 543 g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~E 622 (836) .+ .....++ ..++.+.|++++|+.+.+.|++.+++.+++++++++.+++.++.+++|+.++++.++|++| T Consensus 57 ----~~----~~~~~~a--~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E 126 (366) T protein:vir:57 57 ----LG----DTGLSMA--ISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGE 126 (366) T ss_pred ----hc----chhhhhh--ccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeecc Confidence 00 0111122 2233446788889999999999999999999998888998899999999999999999999 Q ss_pred CcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccccccc Q lcl|NC_016164. 623 GGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINT 702 (836) Q Consensus 623 g~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~ 702 (836) ++++++++++|+++++.+++++++++||+|+|.|+.++++++|.+.|++++++++|.+||+|+|++++|+||++...... T Consensus 127 ~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~ 206 (366) T protein:vir:57 127 GKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAAN 206 (366) T ss_pred CccccccccceeEEEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999889999998765543 Q ss_pred c--cccccchhHHHHHHHHHHHh----hhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCccc Q lcl|NC_016164. 703 E--NFGATNPTYVELVSMESKVA----ADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVA 776 (836) Q Consensus 703 ~--t~aa~~~t~~~l~~a~~~l~----~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~ 776 (836) . +..++..++.++.+++..+. ..+.+..++.|+|||.++..|++++|++|++.|....+++|+|+||++++++| T Consensus 207 ~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pvv~s~~ip 286 (366) T protein:vir:57 207 RLVAWTGTAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPEMSQGILKGYPIQRTSAIP 286 (366) T ss_pred ceeeccccccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCceeccCCCCCeecceeeEEccccc Confidence 2 22233444444443333332 23344567899999999999999999999999877777899999999999998 Q ss_pred c--------ceEEEEehhceEEEeecceEEEEecc-----------cccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 777 N--------GDVFFGVWNQMIMGMWGALDIQVNPY-----------ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 777 ~--------~~i~~gD~s~~~i~~~~~l~i~~~~~-----------~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) + ..++||||++|.+++++++++.++++ ..|++|++.||+++|+|++++||+||++++.+- T Consensus 287 ~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~ 365 (366) T protein:vir:57 287 ANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVI 365 (366) T ss_pred cccccCCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEeccc Confidence 6 24899999999999999999988775 247889999999999999999999999999999 No 16 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=5.9e-55 Score=317.89 Aligned_cols=418 Identities=14% Similarity=0.047 Sum_probs=233.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh----hhhhhhhhhhhhhhhhhhhhhhhhhHHH Q lcl|NC_016164. 383 ATSSSGPPGAAAATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRV----ASITSLCREHKADDLAQGLIESGASEAD 458 (836) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~----~ei~al~~~~~l~e~a~eliee~~t~~e 458 (836) +..... . ....+..................+.+... .++.++...........+...+.... T Consensus 1 ~~~~~~----------l--~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-- 66 (497) T protein:vir:78 1 MPSTAQ----------L--EAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGA-- 66 (497) T ss_pred CCcchH----------H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 000000 0 00000000000000000000000000000 00111110000000000000000000 Q ss_pred HHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHH Q lcl|NC_016164. 459 AMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEAT 538 (836) Q Consensus 459 ~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~ 538 (836) ...++.+............... ................... ........... ............. T Consensus 67 --~a~~~~~~~~~~~~e~~~~~~~-------~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~--~~~~~~~~~~~~~ 131 (497) T protein:vir:78 67 --DAAKDGLDNDIPEVEVRNLKQI-------RKHLARAVIMNPELKNATS----FEKGTKFDVSF--NVSAKAADPGTAA 131 (497) T ss_pred --HHHHHHHHHHHHHHHhhhhhhH-------HHHHHHHHhhhHHHHhhhh----hhhhhhhhhhh--hhhhhhhhhHHHH Confidence 0000001000000000000000 0000000000000000000 00000000000 0000000000000 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecC-Ccee Q lcl|NC_016164. 539 AQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTG-AATA 617 (836) Q Consensus 539 ~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~-~~~a 617 (836) ....+....+... ....+ ....+.++.|+.++|+.+...|++.+++.+++++++. +++...+.+++|+.++ .+.+ T Consensus 132 ~~~~~~~~~~~~~--~~~~~-~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~-~~~~~~~~~~~~~~~~~~~~a 207 (497) T protein:vir:78 132 AELMGAFADGETA--PAAIG-QNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLIS-SRPVTSPNLSYLTESAAHNNA 207 (497) T ss_pred HHHHHHHhhhhhh--HHHHH-hhhcccCcccccccchhhhHHHHHHHHhhhhHHhhcc-ccccCCCceEEEEEcCCCCcc Confidence 0000000111111 11111 2223344556677888899999999999999999965 5566667899999876 4689 Q ss_pred eeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccc Q lcl|NC_016164. 618 YWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFV 697 (836) Q Consensus 618 ~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~ 697 (836) .||+|++.+|+++++|+++++.+++++++++||+|||.|+ +.++++|.++|++++++++|.+||+|+|++ +|.||++. T Consensus 208 ~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~-~p~Gil~~ 285 (497) T protein:vir:78 208 AAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQR 285 (497) T ss_pred eeeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcc-cccccccc Confidence 9999999999999999999999999999999999999875 679999999999999999999999999974 79999986 Q ss_pred cccccccccccc-------------------------------------------------------hhHHHHHHHHHHH Q lcl|NC_016164. 698 TGINTENFGATN-------------------------------------------------------PTYVELVSMESKV 722 (836) Q Consensus 698 ~~~~~~t~aa~~-------------------------------------------------------~t~~~l~~a~~~l 722 (836) +.....+..... ....++..++..+ T Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 365 (497) T protein:vir:78 286 STGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI 365 (497) T ss_pred cccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhh Confidence 654332211100 1112334444444 Q ss_pred hhhccccCccEEEecHHHHHHHHHHhhccCccccccC----------CCCeecceeeEeeCccccceEEEEehhc--eEE Q lcl|NC_016164. 723 AADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLE----------PGGTVNGYNVVRSNQVANGDVFFGVWNQ--MIM 790 (836) Q Consensus 723 ~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~----------~~~~l~G~pVv~s~~~~~~~i~~gD~s~--~~i 790 (836) ...+. ..+.+|+|||.+|..|+++||++|+|+|... .+++|||+||+++++||.++++||||++ |.+ T Consensus 366 ~~~~~-~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i 444 (497) T protein:vir:78 366 QLTLF-QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQT 444 (497) T ss_pred hhhcc-cCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEE Confidence 44432 3456899999999999999999999887432 2348999999999999999999999986 567 Q ss_pred EeecceEEEEecc--cccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPY--ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~--~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +++.++++.++++ .+|.+|++.||++.|+|+.|++|+||++++..= T Consensus 445 ~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~ 492 (497) T protein:vir:78 445 ARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) T ss_pred EEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecC Confidence 8999999999987 459999999999999999999999999999655 No 17 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=5.9e-55 Score=317.89 Aligned_cols=418 Identities=14% Similarity=0.047 Sum_probs=233.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh----hhhhhhhhhhhhhhhhhhhhhhhhhHHH Q lcl|NC_016164. 383 ATSSSGPPGAAAATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRV----ASITSLCREHKADDLAQGLIESGASEAD 458 (836) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~----~ei~al~~~~~l~e~a~eliee~~t~~e 458 (836) +..... . ....+..................+.+... .++.++...........+...+.... T Consensus 1 ~~~~~~----------l--~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-- 66 (497) T protein:vir:10 1 MPSTAQ----------L--EAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGA-- 66 (497) T ss_pred CCcchH----------H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 000000 0 00000000000000000000000000000 00111110000000000000000000 Q ss_pred HHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHH Q lcl|NC_016164. 459 AMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEAT 538 (836) Q Consensus 459 ~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~ 538 (836) ...++.+............... ................... ........... ............. T Consensus 67 --~a~~~~~~~~~~~~e~~~~~~~-------~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~--~~~~~~~~~~~~~ 131 (497) T protein:vir:10 67 --DAAKDGLDNDIPEVEVRNLKQI-------RKHLARAVIMNPELKNATS----FEKGTKFDVSF--NVSAKAADPGTAA 131 (497) T ss_pred --HHHHHHHHHHHHHHHhhhhhhH-------HHHHHHHHhhhHHHHhhhh----hhhhhhhhhhh--hhhhhhhhhHHHH Confidence 0000001000000000000000 0000000000000000000 00000000000 0000000000000 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecC-Ccee Q lcl|NC_016164. 539 AQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTG-AATA 617 (836) Q Consensus 539 ~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~-~~~a 617 (836) ....+....+... ....+ ....+.++.|+.++|+.+...|++.+++.+++++++. +++...+.+++|+.++ .+.+ T Consensus 132 ~~~~~~~~~~~~~--~~~~~-~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~-~~~~~~~~~~~~~~~~~~~~a 207 (497) T protein:vir:10 132 AELMGAFADGETA--PAAIG-QNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLIS-SRPVTSPNLSYLTESAAHNNA 207 (497) T ss_pred HHHHHHHhhhhhh--HHHHH-hhhcccCcccccccchhhhHHHHHHHHhhhhHHhhcc-ccccCCCceEEEEEcCCCCcc Confidence 0000000111111 11111 2223344556677888899999999999999999965 5566667899999876 4689 Q ss_pred eeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccc Q lcl|NC_016164. 618 YWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFV 697 (836) Q Consensus 618 ~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~ 697 (836) .||+|++.+|+++++|+++++.+++++++++||+|||.|+ +.++++|.++|++++++++|.+||+|+|++ +|.||++. T Consensus 208 ~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~-~p~Gil~~ 285 (497) T protein:vir:10 208 AAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQR 285 (497) T ss_pred eeeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcc-cccccccc Confidence 9999999999999999999999999999999999999875 679999999999999999999999999974 79999986 Q ss_pred cccccccccccc-------------------------------------------------------hhHHHHHHHHHHH Q lcl|NC_016164. 698 TGINTENFGATN-------------------------------------------------------PTYVELVSMESKV 722 (836) Q Consensus 698 ~~~~~~t~aa~~-------------------------------------------------------~t~~~l~~a~~~l 722 (836) +.....+..... ....++..++..+ T Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 365 (497) T protein:vir:10 286 STGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI 365 (497) T ss_pred cccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhh Confidence 654332211100 1112334444444 Q ss_pred hhhccccCccEEEecHHHHHHHHHHhhccCccccccC----------CCCeecceeeEeeCccccceEEEEehhc--eEE Q lcl|NC_016164. 723 AADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLE----------PGGTVNGYNVVRSNQVANGDVFFGVWNQ--MIM 790 (836) Q Consensus 723 ~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~----------~~~~l~G~pVv~s~~~~~~~i~~gD~s~--~~i 790 (836) ...+. ..+.+|+|||.+|..|+++||++|+|+|... .+++|||+||+++++||.++++||||++ |.+ T Consensus 366 ~~~~~-~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i 444 (497) T protein:vir:10 366 QLTLF-QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQT 444 (497) T ss_pred hhhcc-cCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEE Confidence 44432 3456899999999999999999999887432 2348999999999999999999999986 567 Q ss_pred EeecceEEEEecc--cccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPY--ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~--~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +++.++++.++++ .+|.+|++.||++.|+|+.|++|+||++++..= T Consensus 445 ~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~ 492 (497) T protein:vir:10 445 ARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) T ss_pred EEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecC Confidence 8999999999987 459999999999999999999999999999655 No 18 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=4.3e-55 Score=318.63 Aligned_cols=383 Identities=17% Similarity=0.140 Sum_probs=245.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhh Q lcl|NC_016164. 408 MDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQP 487 (836) Q Consensus 408 ~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~ 487 (836) +. +..............++.++....... .++..+.....+.....++.+..+..+............ T Consensus 1 m~-------e~~~~l~~~~~~~~~~~~~~~e~~~~~---~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~-- 68 (390) T protein:vir:10 1 MT-------DITSKLEATLANVTDSLRAFGERAVRD---GELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNG-- 68 (390) T ss_pred Ch-------HHHHHHHHHHHHHHHHHHHHHHHHHhh---cccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-- Confidence 11 111111111111111111111110000 000000000001111111111111111000000000000 Q ss_pred HHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccc Q lcl|NC_016164. 488 IAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASA 567 (836) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~ 567 (836) .... ........... .......+.......................+.++.. T Consensus 69 -----~~~~-----------------~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 120 (390) T protein:vir:10 69 -----AGGD-----------------VQHVSVGDLFV------ASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGS 120 (390) T ss_pred -----cccc-----------------ccccchhhhhh------hhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccc Confidence 0000 00000000000 0000001110000000000011111122233444445 Q ss_pred ccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCC-ceeeeeccCcccccccccceeEEeeeeeeeee Q lcl|NC_016164. 568 AGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGA-ATAYWVAEGGDPTESQPSVDQVALVAKTLGAY 646 (836) Q Consensus 568 ~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~ 646 (836) +|++++| .+...|++.+++.++|+++ +++++..++.+++|+.++. +.+.|++|++++++++++|+++++.+++++++ T Consensus 121 ~g~~~~~-~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~ 198 (390) T protein:vir:10 121 AGALTTP-NRLPGFITQPDARLTVRDL-IGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHT 198 (390) T ss_pred cccccch-hHHHHHHHHHHhhchhhhh-cceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEe Confidence 5555555 5567899999999999998 4566777788999988764 67999999999999999999999999999999 Q ss_pred ehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccccccccccc-ccchhHHHHHHHHHHHhhh Q lcl|NC_016164. 647 TEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFG-ATNPTYVELVSMESKVAAD 725 (836) Q Consensus 647 i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~a-a~~~t~~~l~~a~~~l~~~ 725 (836) ++||+++|.|+ +++.++|.+.|+++++++++.+||+|+|++.+|.||++.++....+.. ++...++++.+++..+... T Consensus 199 ~~is~ell~d~-~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 277 (390) T protein:vir:10 199 MKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLA 277 (390) T ss_pred ehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccccccccccccccccccccchHHHHHHHHHhhccc Confidence 99999999876 589999999999999999999999999999999999998776655443 3445688999999999876 Q ss_pred ccccCccEEEecHHHHHHHHHHhhccCcccccc---CCCCeecceeeEeeCccccceEEEEehhc-eEEEeecceEEEEe Q lcl|NC_016164. 726 NADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL---EPGGTVNGYNVVRSNQVANGDVFFGVWNQ-MIMGMWGALDIQVN 801 (836) Q Consensus 726 ~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~---~~~~~l~G~pVv~s~~~~~~~i~~gD~s~-~~i~~~~~l~i~~~ 801 (836) +. .+++|+|||++|..|+.++|++|+|+|.. ..+++|+|+||++++.+|+++++||||++ |.++++.++.+.++ T Consensus 278 ~~--~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~ 355 (390) T protein:vir:10 278 EY--PASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIG 355 (390) T ss_pred cC--CCCEEEEcHHHHHHHHHhhcCCCceeecCCcCcCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEe Confidence 54 56799999999999999999999988743 33568999999999999999999999986 67889999999987 Q ss_pred cc-cccccCcEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 802 PY-ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 802 ~~-~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) .+ .+|.+|++.||++.|+|+++++|+||++++.| T Consensus 356 ~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 356 YVNDDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 75 68999999999999999999999999999999 No 19 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=3.8e-55 Score=318.92 Aligned_cols=383 Identities=16% Similarity=0.122 Sum_probs=248.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhh Q lcl|NC_016164. 415 MEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGS 494 (836) Q Consensus 415 ~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~ 494 (836) +................++.++...... .. .........++++..+.........+.............. T Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~e~~~~---------~~-~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~ 70 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLKAFGERAVR---------DG-ELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAG 70 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHh---------hc-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 1111111111111111111111111100 00 0000011111111111111111110000000000000000 Q ss_pred hhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccc Q lcl|NC_016164. 495 ADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFT 574 (836) Q Consensus 495 ~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp 574 (836) .. ...... ..... .......+...................... ..+.++.++.++| T Consensus 71 ~~------------------~~~~~~-~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~lip 126 (390) T protein:vir:97 71 GD------------------VQHVSV-GDMFV----ASEQFQASTGRWNDRSARATMNIKAALNTA-STDAAGSAGALTT 126 (390) T ss_pred cc------------------cccccc-hhhhh----hhHHHHHHHHHhhhhhhhhhhHHHHHHHhh-hcccccccccccc Confidence 00 000000 00000 000011111111000000111111112222 2333455566667 Q ss_pred hhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCC-ceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHH Q lcl|NC_016164. 575 DGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGA-ATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRL 653 (836) Q Consensus 575 ~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrel 653 (836) +.+...|++.+++.+++++++ +..+..++.+++|+.++. +.+.|++||+++++++++|+++++.+++++++++||+++ T Consensus 127 ~~~~~~ii~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el 205 (390) T protein:vir:97 127 PNRLPGFITPPDARLTVRDLI-GSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI 205 (390) T ss_pred hhhhHHHHHHHhhhhhhHhhc-ceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH Confidence 778889999999999999984 556667788999998764 689999999999999999999999999999999999999 Q ss_pred HhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccc-cccchhHHHHHHHHHHHhhhccccCcc Q lcl|NC_016164. 654 MLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENF-GATNPTYVELVSMESKVAADNADIGAM 732 (836) Q Consensus 654 L~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~-aa~~~t~~~l~~a~~~l~~~~~~~~~~ 732 (836) |.++ .+++++|.++|+++++++++.+||+|+|+++.|.||++.++...... .++...++++.+++..+...+. .++ T Consensus 206 l~ds-~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~--~~~ 282 (390) T protein:vir:97 206 LSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEY--PAS 282 (390) T ss_pred HHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccccccccchHHHHHHHHHhhccccC--CCC Confidence 9876 58999999999999999999999999999999999998877665543 3455678899999999987764 567 Q ss_pred EEEecHHHHHHHHHHhhccCcccccc---CCCCeecceeeEeeCccccceEEEEehhc-eEEEeecceEEEEecc-cccc Q lcl|NC_016164. 733 SYLTNSTLYGGFKTTEKATSTAQFVL---EPGGTVNGYNVVRSNQVANGDVFFGVWNQ-MIMGMWGALDIQVNPY-ALDK 807 (836) Q Consensus 733 ~~vmnp~~~~~L~~lkd~~g~~~~~~---~~~~~l~G~pVv~s~~~~~~~i~~gD~s~-~~i~~~~~l~i~~~~~-~~~~ 807 (836) +|+|||++|..|++++|++|+|+|.. ..+++|+|+||++++.+|+++++||||+. |.++++.++.+.++.+ .+|. T Consensus 283 ~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~ 362 (390) T protein:vir:97 283 GIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQ 362 (390) T ss_pred EEEEcHHHHHHHHHhhcCCCceeecCccCCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccc Confidence 89999999999999999999988743 33568999999999999999999999986 7789999999998764 6899 Q ss_pred cCcEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 808 SGSVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 808 ~~~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) +|++.||++.|+|+++++|+||++++-| T Consensus 363 ~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 363 RNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred cCcEEEEEEEeeccEEeccccEEEEEeC Confidence 9999999999999999999999999999 No 20 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=4e-55 Score=318.83 Aligned_cols=383 Identities=16% Similarity=0.119 Sum_probs=246.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhh Q lcl|NC_016164. 415 MEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGS 494 (836) Q Consensus 415 ~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~ 494 (836) +............+...++.++.... ..+.. ..+.....++++..+.........+........... . T Consensus 1 m~~l~~~l~~~~~~~~~~~~~~~e~~---------~~~~~-~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~--~ 68 (390) T protein:vir:81 1 MTDITSKLEATLANVTDSLRAFGERA---------VRDGE-LNASARSKVDELFATVGNLSAEVQAARQRVAELEGN--G 68 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHH---------HhhcC-cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--c Confidence 11111111111111111111111100 00000 000011111111111111111111000000000000 0 Q ss_pred hhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccc Q lcl|NC_016164. 495 ADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFT 574 (836) Q Consensus 495 ~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp 574 (836) .. .. ...+ ...... . .....+.+.......................+.++. .++.++| T Consensus 69 ~~---~~---------~~~~---~~~~~~--~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~ 126 (390) T protein:vir:81 69 AG---GD---------VQHV---SVGDMF--V----ASEQFQASAGRWNDRSARATMNIKAALNTASTDAAG-SAGALTT 126 (390) T ss_pred cc---cc---------cccc---cchhhh--h----hhHHHHHHHHHHhhhhhhhhhHHHHHHHhhcccccc-CCcceec Confidence 00 00 0000 000000 0 000001111110000000000111111122233333 4444555 Q ss_pred hhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCC-ceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHH Q lcl|NC_016164. 575 DGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGA-ATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRL 653 (836) Q Consensus 575 ~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrel 653 (836) +.+...|++.+++.+++++++ ++++..++.+++|+.++. +.+.|++||+++++++++|+++++.+++++++++||+++ T Consensus 127 ~~~~~~ii~~~~~~~~l~~~~-~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el 205 (390) T protein:vir:81 127 PNRLPGFITPPDARLTVRDLI-GSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI 205 (390) T ss_pred hhhhHHHHHHHhhhhhhhhhc-ceeeccCCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHH Confidence 567778999999999999985 456777788999998764 578999999999999999999999999999999999999 Q ss_pred HhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccc-cccchhHHHHHHHHHHHhhhccccCcc Q lcl|NC_016164. 654 MLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENF-GATNPTYVELVSMESKVAADNADIGAM 732 (836) Q Consensus 654 L~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~-aa~~~t~~~l~~a~~~l~~~~~~~~~~ 732 (836) |.++ ++++++|.+.|+++++++++.+||+|+|++..|.||++.++....+. .++...++++.+++..+...+. .++ T Consensus 206 l~d~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 282 (390) T protein:vir:81 206 LSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEY--NPS 282 (390) T ss_pred HHhH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccccccchhHHHHHHHHHhhccccC--CCC Confidence 9876 58999999999999999999999999999999999998877665544 3445678999999999987764 567 Q ss_pred EEEecHHHHHHHHHHhhccCcccccc---CCCCeecceeeEeeCccccceEEEEehhc-eEEEeecceEEEEecc-cccc Q lcl|NC_016164. 733 SYLTNSTLYGGFKTTEKATSTAQFVL---EPGGTVNGYNVVRSNQVANGDVFFGVWNQ-MIMGMWGALDIQVNPY-ALDK 807 (836) Q Consensus 733 ~~vmnp~~~~~L~~lkd~~g~~~~~~---~~~~~l~G~pVv~s~~~~~~~i~~gD~s~-~~i~~~~~l~i~~~~~-~~~~ 807 (836) +|+|||++|..|+.++|++|+|+|.. +.+++|+|+||++++.+|+++++||||++ |.+++++++.+.++++ .+|. T Consensus 283 ~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~ 362 (390) T protein:vir:81 283 GIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGEDFQ 362 (390) T ss_pred EEEEcHHHHHHHHHhhcCCCceeecCcccccCceecceeeEEcCCCCCCcEEEEehhceEEEEEecceEEEEecccchhh Confidence 89999999999999999999988743 33468999999999999999999999997 6789999999998875 6799 Q ss_pred cCcEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 808 SGSVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 808 ~~~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) +|++.||++.|+|+++++|+||++++-| T Consensus 363 ~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 363 RNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred cCcEEEEEEEeeccEEecccceEEEEeC Confidence 9999999999999999999999999999 No 21 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=7.3e-54 Score=311.91 Aligned_cols=431 Identities=13% Similarity=0.110 Sum_probs=244.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHH Q lcl|NC_016164. 379 QGRKATSSSGPPGAAAATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEAD 458 (836) Q Consensus 379 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e 458 (836) .+-..+...+. ....+.. ...+ ..+. ..........+... .......+.+...+...+.....+ T Consensus 1 ~~~~~~~~~~e--------~~~~e~a-~~~~----~~~~-~~k~~e~~~~~ke~--~~~~l~~~~e~~~k~~~E~~~~le 64 (458) T protein:vir:10 1 MTIDINKLKEE--------LGLGDLA-KSLE----GLTA-AQKAQEAERMRKEQ--EEKELARMNDLVSKAVGEDRKRLE 64 (458) T ss_pred Cccchhhhhhh--------hchhhHH-HHHH----HHHH-HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHH Confidence 10000000000 0000000 0000 0000 00000000000000 000000011111111111110000 Q ss_pred HHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHH Q lcl|NC_016164. 459 AMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEAT 538 (836) Q Consensus 459 ~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~ 538 (836) ......+.+.+...+.........+.................................... ................. T Consensus 65 ~~~ee~k~l~ee~~~~~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~-~~~~~~~~~e~~~~~~~- 142 (458) T protein:vir:10 65 EALELVKSLDEKSKKSNELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYG-TQENFEDEVEKLVLLSY- 142 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchh-hhhhHHHHHHHHHHHHH- Confidence 0001111111111111000000000000000000000000000000000000000000000 00000000000000000 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceee Q lcl|NC_016164. 539 AQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAY 618 (836) Q Consensus 539 ~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~ 618 (836) ..+.+. ... ........+....++...|+.++|+.+...|++.+++.+++++++. +++..++.+.+|+.++.+.+. T Consensus 143 ~~~~~~-~~~--~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~a~ 218 (458) T protein:vir:10 143 VMEKGV-FET--EHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFE-ELPMSSKILTMLVEPDAGKAT 218 (458) T ss_pred HHhhcc-chh--hhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcc-eeecCCcceEEEEecCCccee Confidence 000000 000 0000111122233344567888999999999999999999999854 566777888999999999999 Q ss_pred eeccCcccccc------cccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccc Q lcl|NC_016164. 619 WVAEGGDPTES------QPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPE 692 (836) Q Consensus 619 ~v~Eg~~~~~~------~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~ 692 (836) |++|++.++++ +++|+++++.+++++++++||+++|.|+.+++.++|.+.|+++++++++.+||+|+|+ ++|+ T Consensus 219 ~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~-~~p~ 297 (458) T protein:vir:10 219 WVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGS-GKPK 297 (458) T ss_pred ecccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCC-Cccc Confidence 99999888754 5689999999999999999999999999999999999999999999999999999997 6899 Q ss_pred cccccccccccc-------ccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc------- Q lcl|NC_016164. 693 GLKFVTGINTEN-------FGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL------- 758 (836) Q Consensus 693 Gi~~~~~~~~~t-------~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~------- 758 (836) ||++.+...... ...+.+++++|.++++.+...+. .+++|+|||.+|..|..++|++|+|++.. T Consensus 298 Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~ 375 (458) T protein:vir:10 298 GLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGL--KLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKL 375 (458) T ss_pred eeeecccccccceeecccccccccccHHHHHHHHHhhhhhhc--CCCEEEEcHHHHHHHHhhcccCCceeeccccccccc Confidence 999876543221 12344689999999999987764 57899999999999999999999987532 Q ss_pred -CCCCeecceeeEeeCccccc----eEEEEehhc-eEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEE Q lcl|NC_016164. 759 -EPGGTVNGYNVVRSNQVANG----DVFFGVWNQ-MIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRG 832 (836) Q Consensus 759 -~~~~~l~G~pVv~s~~~~~~----~i~~gD~s~-~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l 832 (836) +.+++|+|+||++++.||.+ .++||||+. |.++++.++++..+++ +.+|++.|+++.|+|+.+++|+||+++ T Consensus 376 ~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~--~~~~~~~~~~~~r~~~~v~~~~a~v~~ 453 (458) T protein:vir:10 376 QGQVGRIYGLPVVVSEYFPAKANSAEFAVIVYKDNFVMPRQRAVTVERERQ--AGKQRDAYYVTQRVNLQRYFANGVVSG 453 (458) T ss_pred cCcCceecceeeEEccccccccCCcceEEEEecccEEEEEeeceEEEeecc--cCCCceEEEEEEEecceEecccceEEE Confidence 33468999999999999864 589999964 7899999999988776 568999999999999999999999999 Q ss_pred eecC Q lcl|NC_016164. 833 NDNL 836 (836) Q Consensus 833 ~~A~ 836 (836) +.|= T Consensus 454 ~~aa 457 (458) T protein:vir:10 454 TYAA 457 (458) T ss_pred eecc Confidence 9888 No 22 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=6.2e-54 Score=312.29 Aligned_cols=384 Identities=16% Similarity=0.139 Sum_probs=245.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHH Q lcl|NC_016164. 412 TIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAG 491 (836) Q Consensus 412 ~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~ 491 (836) +.+.++.......+......++.++.. ...+.++......+......+++.............. T Consensus 1 m~~~~k~l~el~~~~~~~~~~~~~~~e------~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~---------- 64 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQIKSQAE------QVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAA---------- 64 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHH---------- Confidence 111111111111111111111111100 0000000000001111111111111100000000000 Q ss_pred hhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccccccc Q lcl|NC_016164. 492 GGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDL 571 (836) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~ 571 (836) ...... ...... ......... .........+.+.....+. ......+...+.+.. .++. T Consensus 65 -----------~~~~~~--~~~~~~---~~~~~~~~~--~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~-~~g~ 123 (395) T protein:vir:43 65 -----------EQAMLA--NEKRDG---GEEAPKTAG--QMVAESLKEQGVTSSLRGS--HRVSMPRSAITSIDG-SGGA 123 (395) T ss_pred -----------HHHHHh--hhcccc---ccchhhhHH--HHHHHHHHHHHHHHHhhhh--hhhhhhhhhhcccCC-CCcc Confidence 000000 000000 000000000 0000001111111111111 111122223333333 4455 Q ss_pred ccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecC-CceeeeeccCcccccccccceeEEeeeeeeeeeehhH Q lcl|NC_016164. 572 VFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTG-AATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFS 650 (836) Q Consensus 572 vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~IS 650 (836) ++|+.+...|++.+++.++|++++. +.+..++.+++|+.++ .+.+.|++|++++++++++|+++++++++++++++|| T Consensus 124 ~vp~~~~~~ii~~~~~~~~l~~l~~-~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is 202 (395) T protein:vir:43 124 LVAPDRRPGVVAAPQRRLTIRDLVA-PGTTESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKAS 202 (395) T ss_pred ccchhhHHHHHHHHHhhhhHHhhcc-ceecCCCceEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhh Confidence 5666788899999999999999955 4455566789998766 4689999999999999999999999999999999999 Q ss_pred HHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccccccccccc---ccchhHHHHHHHHHHHhhhcc Q lcl|NC_016164. 651 RRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFG---ATNPTYVELVSMESKVAADNA 727 (836) Q Consensus 651 relL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~a---a~~~t~~~l~~a~~~l~~~~~ 727 (836) +++|.++ .++.++|.+.|+++++++++.+||+|+|+++.|.||++..+..+...+ .+...++++.+++..+..++. T Consensus 203 ~ell~d~-~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 281 (395) T protein:vir:43 203 RQILDDA-SALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEF 281 (395) T ss_pred HHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccchhHHHHHHHHHhhccccC Confidence 9999865 589999999999999999999999999999999999988776554432 334568899999999987764 Q ss_pred ccCccEEEecHHHHHHHHHHhhccCccccc---cCCCCeecceeeEeeCccccceEEEEehhc-eEEEeecceEEEEecc Q lcl|NC_016164. 728 DIGAMSYLTNSTLYGGFKTTEKATSTAQFV---LEPGGTVNGYNVVRSNQVANGDVFFGVWNQ-MIMGMWGALDIQVNPY 803 (836) Q Consensus 728 ~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~---~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~-~~i~~~~~l~i~~~~~ 803 (836) .+++|+|||.+|..|+.++|++|+++|. .+.+++|+|+||++++.+|+++++||||++ |.++++.++.+.++++ T Consensus 282 --~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 359 (395) T protein:vir:43 282 --PASGIVLNPIDWALIELNKDAENRYIIGSPQNGTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTE 359 (395) T ss_pred --CCcEEEEcHHHHHHHHHhhccCCceeccccccCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEecc Confidence 4678999999999999999999998874 234568999999999999999999999997 6788899999988876 Q ss_pred c--ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 804 A--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 804 ~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) . +|.+|++.||++.|+|+++++|+||++++.+- T Consensus 360 ~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~ta 394 (395) T protein:vir:43 360 NDKDFENNMVTIRAEERLAFAVYRPEAFVTGSLTA 394 (395) T ss_pred ccchhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 4 58999999999999999999999999998777 No 23 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=3.6e-53 Score=308.10 Aligned_cols=467 Identities=13% Similarity=0.104 Sum_probs=247.1 Q ss_pred cCccceeeeeeEeeccccccCCCCeEEEEEEEEEEEEEEeccCccchh-----hhhhhhhhh-----------hhhhhhh Q lcl|NC_016164. 331 SGTIRNVSFMYSIDAPLDLTSREGMALVTAFTPMEVSAVSIPADHTVG-----QGRKATSSS-----------GPPGAAA 394 (836) Q Consensus 331 ~G~l~~~SiG~~v~~~~~~~~~~~~~~~~~~~l~EiS~V~~pA~~~a~-----v~~~~~~~~-----------~~~~~~~ 394 (836) -. .+-++|-.|... +++....-. .+..... T Consensus 1 ~~----------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 46 (543) T protein:vir:81 1 MN----------------------------------TLDTLPVHPRTGLRAIGMGKRGPIWPVMGASDDHKDDAPTLTYS 46 (543) T ss_pred CC----------------------------------ccccCcCChhHHHHHHHhhccCccchhcccccchhhhhhhhhhh Confidence 01 112333333322 111111100 0000000 Q ss_pred hhhhhhhh----hhhhhhhhhh---------------hhhhhhhhhhh-hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 395 ATVAPLSH----NDNNHMDSST---------------IDMEAVRAQAA-ADERSRVASITSLCREHKADDLAQGLIESGA 454 (836) Q Consensus 395 ~~~~~~~~----~~~~~~~~~~---------------~~~e~~~~~~~-~~~~~~~~ei~al~~~~~l~e~a~eliee~~ 454 (836) +....... ..+....... ...+....... .....+..+..+..............++... T Consensus 47 ~~~~~~~e~~~~~e~l~~~~~~~~~e~~~~~~~~~e~~el~~~~~~l~~~e~~~~~~e~~~~~~~~~~~~~~e~r~e~~a 126 (543) T protein:vir:81 47 QARNRADEVHARMEQIAELDKPTDEENEEFRALGAEFDSLVNHMSRLERAAELARVRSTHEQIGKPQSGGQRRMRVEAGS 126 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh Confidence 00000000 0000000000 00000000000 0000000000000000000000000000000 Q ss_pred hHHHHHHHHHHH---------hhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhh-hhh---hhhhhhh Q lcl|NC_016164. 455 SEADAMRSVLSE---------IAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVR-AIR---AQMMPGD 521 (836) Q Consensus 455 t~~e~~~~~l~~---------l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-a~~---a~~~~~~ 521 (836) ..........+. +..+............+.................+.......... ... ....... T Consensus 127 ~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e 206 (543) T protein:vir:81 127 SQGGRGDYDRDAILEPDSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDED 206 (543) T ss_pred HHHhhHHHHHhhhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000000 000000000000000000000000000000000000000000000 000 0000000 Q ss_pred hhhh--hhhhhhHHHHHHHHHHhhhhhhhhhhhh--hhhhhhhhhcccccccccccchhhHHHH-HHHHHhhhhhhhhcc Q lcl|NC_016164. 522 RAAF--EAAAFEREVSEATAQRMGVTPRGILAPN--DVLHRDLVVDTASAAGDLVFTDGRPGSF-IELLRNRLALNTLGV 596 (836) Q Consensus 522 ~~~~--~~~~~~~~~a~~~~~~~g~~~~g~~~~~--~~~~~a~~~~~~~~~g~~vvp~~~~~~i-i~~l~~~~~l~~l~~ 596 (836) .... ...........+..+............. ...........+.+.|++++|..+...+ +..++..+++++++. T Consensus 207 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~ 286 (543) T protein:vir:81 207 STLARQCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFAR 286 (543) T ss_pred HHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcc Confidence 0000 0000000001111111100000000000 1111112233445567777887777665 476788889998854 Q ss_pred eeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 597 TMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALE 676 (836) Q Consensus 597 ~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~ 676 (836) . . ..++.+.+|+.++++.+.|++||+.++.++++|+++++.+++++++++||+++|.|+ +++.++|.+.|+++++++ T Consensus 287 ~-~-~~~g~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~ 363 (543) T protein:vir:81 287 Q-V-VATGDVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDEL 363 (543) T ss_pred c-c-cCCcceEEEEecCCcceeecccCccccccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHH Confidence 3 3 346789999999999999999999999999999999999999999999999999876 799999999999999999 Q ss_pred HHHHHHhhcCCccccccccccccccc---ccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCc Q lcl|NC_016164. 677 IDRAALYGLGSNSQPEGLKFVTGINT---ENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATST 753 (836) Q Consensus 677 ~d~~il~G~Gt~~~p~Gi~~~~~~~~---~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~ 753 (836) ++.+||+|+|++++|.||++...... .+..++.++++++.+++..+...+. .+++|+|||.+|..|+.++|++|+ T Consensus 364 ~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~v~n~~~~~~l~~lkd~~G~ 441 (543) T protein:vir:81 364 EAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHR--RQGAWLANNLIYNKIRQFDTQGGA 441 (543) T ss_pred HHHHHhccCCCCcccccchhhcccccccccccccccccHHHHHHHHHhhhcccc--CCcEEEEcHHHHHHHHHhhcCCCc Confidence 99999999999999999987654322 2344556789999999999987764 467999999999999999999999 Q ss_pred ccccc---CCCCeecceeeEeeCccccc----------eEEEEehhceEEEeecceEEEEecccc----cccCcEEEEEE Q lcl|NC_016164. 754 AQFVL---EPGGTVNGYNVVRSNQVANG----------DVFFGVWNQMIMGMWGALDIQVNPYAL----DKSGSVRVTAL 816 (836) Q Consensus 754 ~~~~~---~~~~~l~G~pVv~s~~~~~~----------~i~~gD~s~~~i~~~~~l~i~~~~~~~----~~~~~~~~r~~ 816 (836) |+|.+ +.+++|+|+||+++++||.+ .++||||++|.+++++++++.++++.+ |.+|++.|+++ T Consensus 442 ~l~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 521 (543) T protein:vir:81 442 GLWTTIGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAY 521 (543) T ss_pred eeccCcCCCCCccccceeeEEeccccccccccccCCcceEEEeeccceeEEeecccEEEEeccccccchhhcCceEEEEE Confidence 98743 34568999999999998854 289999999999999999999998754 56789999999 Q ss_pred EEeccEEEcccceEEEeecC Q lcl|NC_016164. 817 QDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 817 ~r~d~~v~~p~Af~~l~~A~ 836 (836) .|+|+++++|+||++++.+- T Consensus 522 ~r~d~~v~~~~A~~~l~~~~ 541 (543) T protein:vir:81 522 YRMGADVVNPNAFRLLNVET 541 (543) T ss_pred EeeccEeecccceEEEEecc Confidence 99999999999999999888 No 24 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=3.5e-54 Score=313.68 Aligned_cols=409 Identities=13% Similarity=0.162 Sum_probs=239.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhh--HHH Q lcl|NC_016164. 413 IDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQP--IAA 490 (836) Q Consensus 413 ~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~--~~~ 490 (836) +...+..............++.+......+ ..+..+......+.....++.+.+.................. ... T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~~~~~~~~---~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~ 77 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQGKVEKNEV---RSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPE 77 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhccCc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhh Confidence 000000000000000000111110000000 000000000000000011111111100000000000000000 000 Q ss_pred HhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccc Q lcl|NC_016164. 491 GGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGD 570 (836) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~ 570 (836) ...... ...............+......................+..+.+.....+.. .....++ .+..++.|+ T Consensus 78 ~~~~~~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~--~~~e~~a--~~~~t~~GG 151 (434) T protein:vir:62 78 KKEDPT--AKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNI--DEKEARA--LGLVTGNGS 151 (434) T ss_pred hhcchh--hhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhcccc--chhhhhh--hcccccccc Confidence 000000 0000000000000000000000000000000000000111111111111110 0111122 223345678 Q ss_pred cccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeee---ccCcccccccccceeEEeeeeeeeeee Q lcl|NC_016164. 571 LVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWV---AEGGDPTESQPSVDQVALVAKTLGAYT 647 (836) Q Consensus 571 ~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v---~Eg~~~~~~~~~~~~it~~~~t~~~~i 647 (836) +++|+.+.+.|++.+++.+++++++..+ ++ ++.+++|+....+.+.|+ +|++.++.++++|+++++.++++++++ T Consensus 152 ~lvP~~~~~~Ii~~l~~~~~i~~~~~~~-~~-~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~ 229 (434) T protein:vir:62 152 VTIPDFLSKEIITYAQEENFLRRLGTGV-KT-KENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALA 229 (434) T ss_pred eecchhhHHHHHHhhhhhhhhhhhccee-cc-CCceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeeh Confidence 8899999999999999999999997654 33 356888888777776665 567788999999999999999999999 Q ss_pred hhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhcc Q lcl|NC_016164. 648 EFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNA 727 (836) Q Consensus 648 ~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~ 727 (836) +||+++|.|+.++++++|.+.|+++++++++.+||+|+|+++.+.|+++..++... +.+..++++|.+++.++...+. T Consensus 230 ~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~--~~~~~~~d~l~~l~~~l~~~~~ 307 (434) T protein:vir:62 230 TVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFK--TDEKNLYDALVKMKNTPVKEVR 307 (434) T ss_pred hhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeeccccccc--ccccchhhHHHHHHhhcchhhh Confidence 99999999999999999999999999999999999999999888999876665433 3345689999999999987764 Q ss_pred ccCccEEEecHHHHHHHHHHhhccCcccccc------CCCCeecceeeEeeCccccce------EEEEehhceEEEeec- Q lcl|NC_016164. 728 DIGAMSYLTNSTLYGGFKTTEKATSTAQFVL------EPGGTVNGYNVVRSNQVANGD------VFFGVWNQMIMGMWG- 794 (836) Q Consensus 728 ~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~------~~~~~l~G~pVv~s~~~~~~~------i~~gD~s~~~i~~~~- 794 (836) .+++|+|||.+|..|++++|++|+|+|.+ +.+.+|+|+||++++.+|.+. ++||||++|+++++. T Consensus 308 --~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g 385 (434) T protein:vir:62 308 --KKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSKFYIQDVIG 385 (434) T ss_pred --cCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEeeccceEEEEeec Confidence 57899999999999999999999998743 234589999999999998543 789999999998874 Q ss_pred ceEEEEecccccccCcEEEEEEEEeccEEEc-ccceEEEe----ecC Q lcl|NC_016164. 795 ALDIQVNPYALDKSGSVRVTALQDVDVAVRH-PEAFCRGN----DNL 836 (836) Q Consensus 795 ~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~-p~Af~~l~----~A~ 836 (836) .+++..+.+.+|.+|+|.||++.|+|+++++ |.++++++ .|. T Consensus 386 ~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~ 432 (434) T protein:vir:62 386 SLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPT 432 (434) T ss_pred eeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCC Confidence 6889999999999999999999999999775 88887764 444 No 25 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=3.4e-53 Score=308.23 Aligned_cols=389 Identities=16% Similarity=0.109 Sum_probs=243.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHh Q lcl|NC_016164. 413 IDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGG 492 (836) Q Consensus 413 ~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~ 492 (836) +-.+. ......+......++.++. ++.....+......+++.................. T Consensus 1 ~~ke~-~~~~~~~~~~~~~e~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 59 (413) T protein:vir:81 1 MVKEA-GDAPTNAQVAEIAEVKSMV-------------EQFKADEDAKRERAKSVKANQDFLRELQEATAGSV------- 59 (413) T ss_pred ChhhH-HHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH------- Confidence 00000 0000001111111111111 11000000011000100000000000000000000 Q ss_pred hhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhh-hhhhhhhhhhhhhccccccccc Q lcl|NC_016164. 493 GSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGI-LAPNDVLHRDLVVDTASAAGDL 571 (836) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~-~~~~~~~~~a~~~~~~~~~g~~ 571 (836) ... ................... ....... ........ ..+...... ..............+..+.++. T Consensus 60 -~~~--~~~~~~~~~~~~~~~~~~~--~~~~~~~-----~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 128 (413) T protein:vir:81 60 -DSE--KSGELTRKGEGYKSIGEFF--AKRAGDQ-----IKQQAGGA-QLNYSVGEYVAPRVKAASDPASTATLTDEFQG 128 (413) T ss_pred -hHH--HhhhHhhhhhhhhhhhhhh--hhhhhhH-----HHHHHHHH-HhhhhhhhhhhhHHHhhhhhhhhccccccccc Confidence 000 0000000000000000000 0000000 00000000 000000000 0000111111223344566777 Q ss_pred ccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCC----ceeeeeccCccccccc-ccceeEEeeeeeeeee Q lcl|NC_016164. 572 VFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGA----ATAYWVAEGGDPTESQ-PSVDQVALVAKTLGAY 646 (836) Q Consensus 572 vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~----~~a~~v~Eg~~~~~~~-~~~~~it~~~~t~~~~ 646 (836) ++|+.+...|++.+++.+++++++. +.+..++.+.+|+.+.. ..+.|++||+++++++ ++|+++++.+++++++ T Consensus 129 ~vp~~~~~~ii~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~ 207 (413) T protein:vir:81 129 GYGTTWNRNIIYRRREKLVVADLMD-NLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGL 207 (413) T ss_pred ccchhhHHHHHHHHhhhhhHHhhcc-eeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEe Confidence 8899999999999999999999854 55666677888877653 4579999999999887 6899999999999999 Q ss_pred ehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhc Q lcl|NC_016164. 647 TEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADN 726 (836) Q Consensus 647 i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~ 726 (836) ++||+++|.|+. .++++|...|+++++++++.+||+|+|++..|.||++.++..+.+...+...++++.+++..+...+ T Consensus 208 ~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~ 286 (413) T protein:vir:81 208 TKITDEMIEDYD-FLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLAT 286 (413) T ss_pred ehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccccccchhHHHHHHHHHHhhhhc Confidence 999999998774 6999999999999999999999999999988999999998888777666667788888888776655 Q ss_pred cccCccEEEecHHHHHHHHHHhhccCccccccC-----------CCCeecceeeEeeCccccceEEEEehhc-eEEEeec Q lcl|NC_016164. 727 ADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLE-----------PGGTVNGYNVVRSNQVANGDVFFGVWNQ-MIMGMWG 794 (836) Q Consensus 727 ~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~-----------~~~~l~G~pVv~s~~~~~~~i~~gD~s~-~~i~~~~ 794 (836) ... +..|+|||.+|.+|+++||++|+|+|... ..++|+|+||++++++|++.++||||+. |.+++++ T Consensus 287 ~~~-~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~~~~~~~~~ 365 (413) T protein:vir:81 287 PFQ-ADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRSAASVLRKG 365 (413) T ss_pred cCC-CcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEEEecccEEEEEEec Confidence 443 44699999999999999999999887321 2347999999999999999999999996 7788899 Q ss_pred ceEEEEeccc--ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 795 ALDIQVNPYA--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 795 ~l~i~~~~~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++++.++++. +|.+|++.||+++|+|+++.+|+||++++.+= T Consensus 366 ~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~ 409 (413) T protein:vir:81 366 GVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAE 409 (413) T ss_pred ceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecC Confidence 9999998875 58999999999999999999999999988554 No 26 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=6.8e-54 Score=312.06 Aligned_cols=392 Identities=12% Similarity=0.106 Sum_probs=248.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHH Q lcl|NC_016164. 410 SSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIA 489 (836) Q Consensus 410 ~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~ 489 (836) +...+..+.+.. ...+++++.....-.. +.++.....+.....++.+..+................... T Consensus 1 M~l~eL~e~r~~-------l~~e~~~l~~k~~~~~----~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~ 69 (409) T protein:vir:45 1 MKLHELKQKRNT-------IATDMRALNEKIGDNA----WTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNE 69 (409) T ss_pred CCHHHHHHHHHH-------HHHHHHHHHHHhhcCC----CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 111111110000 0111111111110000 00011111111111111111111111000000000000000 Q ss_pred HHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccccc Q lcl|NC_016164. 490 AGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAG 569 (836) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g 569 (836) . ..+....+....... .........+++.......... ............++...| T Consensus 70 ~---------------------~~~~~~~~~~~~~~~--~~~~~a~~~~l~~~~~~~~~~e-~~~~~~~~a~~~~~~~~g 125 (409) T protein:vir:45 70 E---------------------EQRQNLDPENNSQQD--EKRAQVFDKWMRHGASELTSEE-RKALRELRAQGVAQDEKG 125 (409) T ss_pred h---------------------hhcccCCCCCcchhh--HHHHHHHHHHHHhhhhhccHHH-HHHHHHHhhccCccCcCC Confidence 0 000000000000000 0000000011110000000000 001111222333445567 Q ss_pred ccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCC-ceeeeeccCcccccccccceeEEeeeeeee-eee Q lcl|NC_016164. 570 DLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGA-ATAYWVAEGGDPTESQPSVDQVALVAKTLG-AYT 647 (836) Q Consensus 570 ~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~-~~i 647 (836) ++++|+.+.+.|++.+++.++|++++..+.......+.++...+. ..+.|++|++++++++++|+.+++.+++++ +++ T Consensus 126 g~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i 205 (409) T protein:vir:45 126 GYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKII 205 (409) T ss_pred ceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeeh Confidence 888999999999999999999999865554444445666666654 457899999999999999999999999885 678 Q ss_pred hhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--ccccccccccccccccccccchhHHHHHHHHHHHhhh Q lcl|NC_016164. 648 EFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSN--SQPEGLKFVTGINTENFGATNPTYVELVSMESKVAAD 725 (836) Q Consensus 648 ~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~--~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~ 725 (836) +||+++|.|+.++++++|.+.|+++++++++.+||+|+|++ .+|+||++.......+..++.+++++|.+++..|... T Consensus 206 ~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~ 285 (409) T protein:vir:45 206 RVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQEILALKHSIDPA 285 (409) T ss_pred hhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccccccccchHHHHHHHHhhhhh Confidence 99999999999999999999999999999999999999874 5799999887776677777788999999999999988 Q ss_pred ccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCcccc-----ceEEEEehhceEEEeecce Q lcl|NC_016164. 726 NADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVAN-----GDVFFGVWNQMIMGMWGAL 796 (836) Q Consensus 726 ~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~-----~~i~~gD~s~~~i~~~~~l 796 (836) +......+|+||+.++..|+.++|++|+|+|.. +.+.+|+|+||++++++|. ..++||||++|.+..++++ T Consensus 286 ~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~ 365 (409) T protein:vir:45 286 YRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYM 365 (409) T ss_pred hccCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhhhheeeccce Confidence 765555567889999999999999999998743 3456899999999999985 3478999999999999999 Q ss_pred EEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 797 DIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 797 ~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .+.++.+.+|.+|++.||++.|+|+++++|+||++++.+- T Consensus 366 ~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~ 405 (409) T protein:vir:45 366 ILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKG 405 (409) T ss_pred EEEEeecccccCCcEEEEEEEEeccEeechhheEEEEecc Confidence 9998888889999999999999999999999999988655 No 27 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=1e-53 Score=311.05 Aligned_cols=421 Identities=14% Similarity=0.090 Sum_probs=237.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh---hhhhhhhhhhhhHHHHHHHHHHHh---hhhhh Q lcl|NC_016164. 399 PLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKAD---DLAQGLIESGASEADAMRSVLSEI---AKRPA 472 (836) Q Consensus 399 ~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~---e~a~eliee~~t~~e~~~~~l~~l---~~~~~ 472 (836) ..++.. .+.....+..+.+. ....++.++......+ ...++...+.....+.....++.+ .++.. T Consensus 1 ~~k~~e--em~~~i~eL~e~r~-------~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~ 71 (477) T protein:vir:84 1 MEKHLE--ELRALRAAAVEAVA-------TLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIR 71 (477) T ss_pred CchHHH--HHHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000 00000000000000 0000111111110000 000000000000001111111111 11111 Q ss_pred hHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhh--hhhhhh Q lcl|NC_016164. 473 AQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGV--TPRGIL 550 (836) Q Consensus 473 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~--~~~g~~ 550 (836) .........................................+......... ..........+.... ...... T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~ 145 (477) T protein:vir:84 72 ELESEIERSGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGM------ADEPAKERLRRHMVDVESDKEIR 145 (477) T ss_pred HHHHHHHHhhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhh------hhhHHHHHHHHHHhhhhhhhhHH Confidence 110000000000000000000000000000000000000111000000000 000000000000000 000000 Q ss_pred hh-hhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhccee-eecCCceEEEEEecCCc-eeeeeccCcc-- Q lcl|NC_016164. 551 AP-NDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTGAA-TAYWVAEGGD-- 625 (836) Q Consensus 551 ~~-~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~~~-~a~~v~Eg~~-- 625 (836) .. .....+...+.+..++|.+++|+.+.+.|++.+++.+++++++..+ ++...+++.+|+..+++ .+.|++||+. T Consensus 146 ~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~ 225 (477) T protein:vir:84 146 KIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALT 225 (477) T ss_pred HHHHhhhhhccccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccc Confidence 11 1112223334444555666666777889999999999999875543 45667789999876655 4678999864 Q ss_pred ---cccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccccccc Q lcl|NC_016164. 626 ---PTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINT 702 (836) Q Consensus 626 ---~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~ 702 (836) +++++++|+.+++++++++++++||++||.|+.++++++|.++|+++++.++|.+||+|+|++++|.||++.++++. T Consensus 226 ~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~ 305 (477) T protein:vir:84 226 APSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQ 305 (477) T ss_pred cccccccccceeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeecccccc Confidence 46788999999999999999999999999999999999999999999999999999999999999999999888776 Q ss_pred ccccccchhH-------HHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----------------- Q lcl|NC_016164. 703 ENFGATNPTY-------VELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----------------- 758 (836) Q Consensus 703 ~t~aa~~~t~-------~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----------------- 758 (836) ++.+.+..++ .+|.+++..+...+. ....+|+|||.+|..|+.++|.+|+|+|.+ T Consensus 306 ~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~ 384 (477) T protein:vir:84 306 VTATSAGSALEKHQIIYQKIADAIQRVHTSRF-LEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQ 384 (477) T ss_pred ccccccccchhhHHHHHHHHHHHHhhcccccc-CCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccc Confidence 6655444443 445555555544432 345689999999999999999999987643 Q ss_pred CCCCeecceeeEeeCccccc--------eEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEE-cccce Q lcl|NC_016164. 759 EPGGTVNGYNVVRSNQVANG--------DVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVR-HPEAF 829 (836) Q Consensus 759 ~~~~~l~G~pVv~s~~~~~~--------~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~-~p~Af 829 (836) +++++|+|+||++++.+|++ .++||||++++++. .++.+..+++.++.++++.|+++.++++... +|+|| T Consensus 385 ~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~-~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~af 463 (477) T protein:vir:84 385 RVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE-SSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSV 463 (477) T ss_pred cccchhcccceEecCcccccccccCCcceEEEEEeceEEEEe-eceeEEeccccccccceeeeeehhhhhhhhhccccce Confidence 23468999999999999964 48999999998887 5789999999999999999999988887554 59999 Q ss_pred EEEeecC Q lcl|NC_016164. 830 CRGNDNL 836 (836) Q Consensus 830 ~~l~~A~ 836 (836) ++++.+= T Consensus 464 v~~t~~~ 470 (477) T protein:vir:84 464 VEIGGTA 470 (477) T ss_pred EEeeccc Confidence 9999544 No 28 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=3.4e-53 Score=308.22 Aligned_cols=374 Identities=15% Similarity=0.123 Sum_probs=245.3 Q ss_pred hhhhhhhhhhh-hhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHH Q lcl|NC_016164. 430 VASITSLCREH-KADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFS 508 (836) Q Consensus 430 ~~ei~al~~~~-~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (836) +.++.++.... ...+...++.++.....+......+.+.....+........... ... ... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~--------------~~~-~~~--- 62 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTR--------------LFD-LEQ--- 62 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------HHH-HHH--- Confidence 00111111000 00001111111111111111111111111111111000000000 000 000 Q ss_pred HhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhh Q lcl|NC_016164. 509 FVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNR 588 (836) Q Consensus 509 ~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~ 588 (836) ...+...... ... ........++.+... ...+ .......++....+..++|++ +|+.+...|++.+++. T Consensus 63 --~~~~~~~~~~-~~~----~~~~~~~~~~~~~~~-~~~~--~~~~~~~~~~~~~~~~~~g~~-i~~~~~~~ii~~~~~~ 131 (385) T protein:vir:18 63 --KLASGAENPG-EKK----SFSERAAEELIKSWD-GKQG--TFGAKTFNKSLGSDADSAGSL-IQPMQIPGIIMPGLRR 131 (385) T ss_pred --Hhhccccccc-hhh----hhHHHHHHHHHHHHH-Hhhc--cchhhHHHhhhccccccCCce-ecchhhhHHHHHhhhc Confidence 0000000000 000 000011111111110 0000 001112223333444444444 4556778899999999 Q ss_pred hhhhhhcceeeecCCceEEEEEecC-CceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHH Q lcl|NC_016164. 589 LALNTLGVTMLTGLQGPVAIPRQTG-AATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRT 667 (836) Q Consensus 589 ~~l~~l~~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~ 667 (836) +++++++.. .+..++.+.+|+..+ .+.+.|++|++++++++++|+++++.+++++++++||+++|.++ ..++++|.. T Consensus 132 ~~l~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~ 209 (385) T protein:vir:18 132 LTIRDLLAQ-GRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINN 209 (385) T ss_pred cchhhhcce-ecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHH Confidence 999998554 556667899999875 57889999999999999999999999999999999999999865 689999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCccccccccccccccccccc-ccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHH Q lcl|NC_016164. 668 ELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFG-ATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKT 746 (836) Q Consensus 668 ~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~a-a~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~ 746 (836) .|+++++++++.+||+|+|++..|.||++.++....+.. ++..++++|.+++..+...+. .+++|+|||++|..|+. T Consensus 210 ~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~ 287 (385) T protein:vir:18 210 RLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEF--SASGIVLNPRDWHNIAL 287 (385) T ss_pred HHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccC--CCCEEEEcHHHHHHHHH Confidence 999999999999999999999999999988776655443 344678999999999977653 46799999999999999 Q ss_pred HhhccCcccccc---CCCCeecceeeEeeCccccceEEEEehhc-eEEEeecceEEEEecc--cccccCcEEEEEEEEec Q lcl|NC_016164. 747 TEKATSTAQFVL---EPGGTVNGYNVVRSNQVANGDVFFGVWNQ-MIMGMWGALDIQVNPY--ALDKSGSVRVTALQDVD 820 (836) Q Consensus 747 lkd~~g~~~~~~---~~~~~l~G~pVv~s~~~~~~~i~~gD~s~-~~i~~~~~l~i~~~~~--~~~~~~~~~~r~~~r~d 820 (836) ++|++|+++|.. +.+++|+|+||++++.+|++.++||||+. |.++++.++++.++.+ .+|.+|++.||+++|+| T Consensus 288 lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~ 367 (385) T protein:vir:18 288 LKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLA 367 (385) T ss_pred hhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeec Confidence 999999988743 45678999999999999999999999986 7889999999888765 45899999999999999 Q ss_pred cEEEcccceEEEeecC Q lcl|NC_016164. 821 VAVRHPEAFCRGNDNL 836 (836) Q Consensus 821 ~~v~~p~Af~~l~~A~ 836 (836) +++++|+||++++.+- T Consensus 368 ~~v~~~~a~~~~~~~a 383 (385) T protein:vir:18 368 LAHYRPTAIIKGTFSS 383 (385) T ss_pred cEEecccceEEEEecc Confidence 9999999999999888 No 29 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=3.4e-53 Score=308.22 Aligned_cols=374 Identities=15% Similarity=0.123 Sum_probs=245.3 Q ss_pred hhhhhhhhhhh-hhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHH Q lcl|NC_016164. 430 VASITSLCREH-KADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFS 508 (836) Q Consensus 430 ~~ei~al~~~~-~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (836) +.++.++.... ...+...++.++.....+......+.+.....+........... ... ... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~--------------~~~-~~~--- 62 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTR--------------LFD-LEQ--- 62 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------HHH-HHH--- Confidence 00111111000 00001111111111111111111111111111111000000000 000 000 Q ss_pred HhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhh Q lcl|NC_016164. 509 FVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNR 588 (836) Q Consensus 509 ~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~ 588 (836) ...+...... ... ........++.+... ...+ .......++....+..++|++ +|+.+...|++.+++. T Consensus 63 --~~~~~~~~~~-~~~----~~~~~~~~~~~~~~~-~~~~--~~~~~~~~~~~~~~~~~~g~~-i~~~~~~~ii~~~~~~ 131 (385) T protein:vir:19 63 --KLASGAENPG-EKK----SFSERAAEELIKSWD-GKQG--TFGAKTFNKSLGSDADSAGSL-IQPMQIPGIIMPGLRR 131 (385) T ss_pred --Hhhccccccc-hhh----hhHHHHHHHHHHHHH-Hhhc--cchhhHHHhhhccccccCCce-ecchhhhHHHHHhhhc Confidence 0000000000 000 000011111111110 0000 001112223333444444444 4556778899999999 Q ss_pred hhhhhhcceeeecCCceEEEEEecC-CceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHH Q lcl|NC_016164. 589 LALNTLGVTMLTGLQGPVAIPRQTG-AATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRT 667 (836) Q Consensus 589 ~~l~~l~~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~ 667 (836) +++++++.. .+..++.+.+|+..+ .+.+.|++|++++++++++|+++++.+++++++++||+++|.++ ..++++|.. T Consensus 132 ~~l~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~ 209 (385) T protein:vir:19 132 LTIRDLLAQ-GRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINN 209 (385) T ss_pred cchhhhcce-ecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHH Confidence 999998554 556667899999875 57889999999999999999999999999999999999999865 689999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCccccccccccccccccccc-ccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHH Q lcl|NC_016164. 668 ELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFG-ATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKT 746 (836) Q Consensus 668 ~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~a-a~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~ 746 (836) .|+++++++++.+||+|+|++..|.||++.++....+.. ++..++++|.+++..+...+. .+++|+|||++|..|+. T Consensus 210 ~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~ 287 (385) T protein:vir:19 210 RLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEF--SASGIVLNPRDWHNIAL 287 (385) T ss_pred HHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccC--CCCEEEEcHHHHHHHHH Confidence 999999999999999999999999999988776655443 344678999999999977653 46799999999999999 Q ss_pred HhhccCcccccc---CCCCeecceeeEeeCccccceEEEEehhc-eEEEeecceEEEEecc--cccccCcEEEEEEEEec Q lcl|NC_016164. 747 TEKATSTAQFVL---EPGGTVNGYNVVRSNQVANGDVFFGVWNQ-MIMGMWGALDIQVNPY--ALDKSGSVRVTALQDVD 820 (836) Q Consensus 747 lkd~~g~~~~~~---~~~~~l~G~pVv~s~~~~~~~i~~gD~s~-~~i~~~~~l~i~~~~~--~~~~~~~~~~r~~~r~d 820 (836) ++|++|+++|.. +.+++|+|+||++++.+|++.++||||+. |.++++.++++.++.+ .+|.+|++.||+++|+| T Consensus 288 lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~ 367 (385) T protein:vir:19 288 LKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLA 367 (385) T ss_pred hhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeec Confidence 999999988743 45678999999999999999999999986 7889999999888765 45899999999999999 Q ss_pred cEEEcccceEEEeecC Q lcl|NC_016164. 821 VAVRHPEAFCRGNDNL 836 (836) Q Consensus 821 ~~v~~p~Af~~l~~A~ 836 (836) +++++|+||++++.+- T Consensus 368 ~~v~~~~a~~~~~~~a 383 (385) T protein:vir:19 368 LAHYRPTAIIKGTFSS 383 (385) T ss_pred cEEecccceEEEEecc Confidence 9999999999999888 No 30 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=6.4e-54 Score=312.21 Aligned_cols=298 Identities=13% Similarity=0.138 Sum_probs=245.5 Q ss_pred hhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEE Q lcl|NC_016164. 529 AFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAI 608 (836) Q Consensus 529 ~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~ 608 (836) |......+...+++..... ..........+..+.++.++|+.+.+.|++.+++.+++++++ ++++..++.+++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~------~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~-~~~~~~~~~~~i 73 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNV------KPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLG-KYEPMEGTEKKF 73 (324) T ss_pred CccchhHHHHHHHHHHhhh------hhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhc-ceeeccCCceEE Confidence 1111111111111100000 000111122334455777889999999999999999999984 556677788999 Q ss_pred EEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Q lcl|NC_016164. 609 PRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSN 688 (836) Q Consensus 609 p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~ 688 (836) |+.++.+.+.|++|++++++++++|+++++++++++++++||+|+|.++.++++++|.+.|+++++++++.++|+|+|++ T Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~ 153 (324) T protein:vir:97 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) T ss_pred EEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeeccee Q lcl|NC_016164. 689 SQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYN 768 (836) Q Consensus 689 ~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~p 768 (836) +.|.|+++.....+ +...+.+++++|.+++.++..++. .+++|+|||.+|..|++++|++|++.|....+++|+|+| T Consensus 154 ~~~~gi~~~~~~~~-~~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~n~~~~~~L~~lkd~~g~~~~~~~~~~tl~G~P 230 (324) T protein:vir:97 154 PFGKSIAQSIEKTN-KVIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLP 230 (324) T ss_pred ccCccccccccccc-eeccccCCHHHHHHHHHhhhhccC--CCCEEEEcHHHHHHHHHhhcCCCceeecCCCCcccccee Confidence 89999987655433 334466789999999999988764 467899999999999999999999999888889999999 Q ss_pred eEeeCccc--cceEEEEehhceEEEeecceEEEEeccc--------------ccccCcEEEEEEEEeccEEEcccceEEE Q lcl|NC_016164. 769 VVRSNQVA--NGDVFFGVWNQMIMGMWGALDIQVNPYA--------------LDKSGSVRVTALQDVDVAVRHPEAFCRG 832 (836) Q Consensus 769 Vv~s~~~~--~~~i~~gD~s~~~i~~~~~l~i~~~~~~--------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l 832 (836) |++++..+ .+.++||||+++.+++++++++..+++. .|.+|++.||++.|+|+++.+|+||+++ T Consensus 231 V~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l 310 (324) T protein:vir:97 231 VVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) T ss_pred eEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEE Confidence 99988754 5679999999999999999999887753 4899999999999999999999999999 Q ss_pred eecC Q lcl|NC_016164. 833 NDNL 836 (836) Q Consensus 833 ~~A~ 836 (836) +.+- T Consensus 311 ~~~~ 314 (324) T protein:vir:97 311 VPAD 314 (324) T ss_pred Eecc Confidence 9888 No 31 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=3.2e-53 Score=308.40 Aligned_cols=298 Identities=12% Similarity=0.136 Sum_probs=244.7 Q ss_pred hhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEE Q lcl|NC_016164. 529 AFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAI 608 (836) Q Consensus 529 ~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~ 608 (836) |......+...+.+-....+ ............+.++.++|+.+...|++.+++.+++++++ ++++..++.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~------~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~-~~~~~~~~~~~~ 73 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVK------PQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLG-KYEPMEGTEKKF 73 (324) T ss_pred CCcchhhhHHHHHHHHHhhh------hhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhc-ceeeccCCceEE Confidence 10000111111100000000 00111223334456677889999999999999999999985 456777778999 Q ss_pred EEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Q lcl|NC_016164. 609 PRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSN 688 (836) Q Consensus 609 p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~ 688 (836) |+.++.+.+.|++|++++++++++|+++++++++++++++||+|+|.++.++++++|.+.|+++++++++.++|+|+|++ T Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~ 153 (324) T protein:vir:96 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) T ss_pred EEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeeccee Q lcl|NC_016164. 689 SQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYN 768 (836) Q Consensus 689 ~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~p 768 (836) +.|.|+++..+.... ...+..++++|.+++.++..++. .+++|+|||++|..|+.++|.+|++.+....+++|+|+| T Consensus 154 ~~~~gi~~~~~~~~~-~~~~~~t~~~i~~~~~~l~~~~~--~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~P 230 (324) T protein:vir:96 154 PFGKSIAQSIEKTNK-VIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLP 230 (324) T ss_pred CcCccccccccccce-eccccccHHHHHHHHHhhhhccC--CCCEEEEcHHHHHHHHHhhccCCCeeecCCCCCccccee Confidence 899999876554433 33456789999999999988764 466899999999999999999999998888889999999 Q ss_pred eEeeCcc--ccceEEEEehhceEEEeecceEEEEeccc--------------ccccCcEEEEEEEEeccEEEcccceEEE Q lcl|NC_016164. 769 VVRSNQV--ANGDVFFGVWNQMIMGMWGALDIQVNPYA--------------LDKSGSVRVTALQDVDVAVRHPEAFCRG 832 (836) Q Consensus 769 Vv~s~~~--~~~~i~~gD~s~~~i~~~~~l~i~~~~~~--------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l 832 (836) |++++.+ +++.++||||+++.+++++++++..+++. .|++|++.||+++|+|+++.+|+||+++ T Consensus 231 V~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l 310 (324) T protein:vir:96 231 VVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) T ss_pred eEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEE Confidence 9998764 45679999999999999999999887763 4899999999999999999999999999 Q ss_pred eecC Q lcl|NC_016164. 833 NDNL 836 (836) Q Consensus 833 ~~A~ 836 (836) +.|- T Consensus 311 ~~a~ 314 (324) T protein:vir:96 311 VPAD 314 (324) T ss_pred eccc Confidence 9888 No 32 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=3.2e-53 Score=308.40 Aligned_cols=298 Identities=12% Similarity=0.136 Sum_probs=244.7 Q ss_pred hhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEE Q lcl|NC_016164. 529 AFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAI 608 (836) Q Consensus 529 ~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~ 608 (836) |......+...+.+-....+ ............+.++.++|+.+...|++.+++.+++++++ ++++..++.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~------~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~-~~~~~~~~~~~~ 73 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVK------PQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLG-KYEPMEGTEKKF 73 (324) T ss_pred CCcchhhhHHHHHHHHHhhh------hhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhc-ceeeccCCceEE Confidence 10000111111100000000 00111223334456677889999999999999999999985 456777778999 Q ss_pred EEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Q lcl|NC_016164. 609 PRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSN 688 (836) Q Consensus 609 p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~ 688 (836) |+.++.+.+.|++|++++++++++|+++++++++++++++||+|+|.++.++++++|.+.|+++++++++.++|+|+|++ T Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~ 153 (324) T protein:vir:78 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) T ss_pred EEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeeccee Q lcl|NC_016164. 689 SQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYN 768 (836) Q Consensus 689 ~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~p 768 (836) +.|.|+++..+.... ...+..++++|.+++.++..++. .+++|+|||++|..|+.++|.+|++.+....+++|+|+| T Consensus 154 ~~~~gi~~~~~~~~~-~~~~~~t~~~i~~~~~~l~~~~~--~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~P 230 (324) T protein:vir:78 154 PFGKSIAQSIEKTNK-VIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLP 230 (324) T ss_pred CcCccccccccccce-eccccccHHHHHHHHHhhhhccC--CCCEEEEcHHHHHHHHHhhccCCCeeecCCCCCccccee Confidence 899999876554433 33456789999999999988764 466899999999999999999999998888889999999 Q ss_pred eEeeCcc--ccceEEEEehhceEEEeecceEEEEeccc--------------ccccCcEEEEEEEEeccEEEcccceEEE Q lcl|NC_016164. 769 VVRSNQV--ANGDVFFGVWNQMIMGMWGALDIQVNPYA--------------LDKSGSVRVTALQDVDVAVRHPEAFCRG 832 (836) Q Consensus 769 Vv~s~~~--~~~~i~~gD~s~~~i~~~~~l~i~~~~~~--------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l 832 (836) |++++.+ +++.++||||+++.+++++++++..+++. .|++|++.||+++|+|+++.+|+||+++ T Consensus 231 V~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l 310 (324) T protein:vir:78 231 VVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) T ss_pred eEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEE Confidence 9998764 45679999999999999999999887763 4899999999999999999999999999 Q ss_pred eecC Q lcl|NC_016164. 833 NDNL 836 (836) Q Consensus 833 ~~A~ 836 (836) +.|- T Consensus 311 ~~a~ 314 (324) T protein:vir:78 311 VPAD 314 (324) T ss_pred eccc Confidence 9888 No 33 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=2.3e-53 Score=309.13 Aligned_cols=281 Identities=19% Similarity=0.218 Sum_probs=234.1 Q ss_pred hhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccc Q lcl|NC_016164. 552 PNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQP 631 (836) Q Consensus 552 ~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~ 631 (836) ......++.....+.++|+++ |+.+.+.|++.+++.+++++++. +++...+.+.+|+.++.+.+.|++|+++++++++ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i-~~~~~~~ii~~~~~~s~l~~~~~-~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~ 78 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFL-TPEQSQDYFAEIEKTSIVQRIAR-KVPMGPTGISIPHWTGAVSASWTGEAERKPITKG 78 (330) T ss_pred CcccccchhhccccCCCccee-chhHHHHHHHHHHhccchhhhcc-eeeccCCceEEEEEcCCcceeEecCCCccccccc Confidence 011112233344444555555 45567889999999999999955 5667777899999999999999999999999999 Q ss_pred cceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccccccc-------cc Q lcl|NC_016164. 632 SVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINT-------EN 704 (836) Q Consensus 632 ~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~-------~t 704 (836) +|+++++++++++++++||+|+|.++.++++++|.++|+++++++++.+||+|+|++..|.|+++...... .+ T Consensus 79 ~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~ 158 (330) T protein:vir:77 79 SFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNLTT 158 (330) T ss_pred eeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccccc Confidence 99999999999999999999999999999999999999999999999999999999988999886542211 11 Q ss_pred -ccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccC---------CCCeecceeeEeeCc Q lcl|NC_016164. 705 -FGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLE---------PGGTVNGYNVVRSNQ 774 (836) Q Consensus 705 -~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~---------~~~~l~G~pVv~s~~ 774 (836) .+.....++++.+++.++..++. .+++|+|||++|..|+.+||.+|+++|... .+++|+|+||+++++ T Consensus 159 ~~~~~~~~~~~l~~~~~~~~~~~~--~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~ 236 (330) T protein:vir:77 159 ASGPQGNAYLAVNNALSLLVNSGK--KWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADN 236 (330) T ss_pred cccccchhHHHHHHHHHhhhhcCC--CccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEecc Confidence 12233457889999988877653 567899999999999999999999887542 235899999999999 Q ss_pred cccce------EEEEehhceEEEeecceEEEEeccc------------------ccccCcEEEEEEEEeccEEEcccceE Q lcl|NC_016164. 775 VANGD------VFFGVWNQMIMGMWGALDIQVNPYA------------------LDKSGSVRVTALQDVDVAVRHPEAFC 830 (836) Q Consensus 775 ~~~~~------i~~gD~s~~~i~~~~~l~i~~~~~~------------------~~~~~~~~~r~~~r~d~~v~~p~Af~ 830 (836) +|++. ++||||+++.+++++++++..+++. .|.+|++.||++.|+|+++.+|+||+ T Consensus 237 ~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~ 316 (330) T protein:vir:77 237 VVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFV 316 (330) T ss_pred ccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceE Confidence 99754 8999999999999999999877653 38899999999999999999999999 Q ss_pred EEeecC Q lcl|NC_016164. 831 RGNDNL 836 (836) Q Consensus 831 ~l~~A~ 836 (836) +++.+. T Consensus 317 ~i~~~~ 322 (330) T protein:vir:77 317 KLTDQV 322 (330) T ss_pred EEEecc Confidence 999998 No 34 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=6.5e-53 Score=306.69 Aligned_cols=298 Identities=13% Similarity=0.152 Sum_probs=242.9 Q ss_pred hhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEE Q lcl|NC_016164. 529 AFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAI 608 (836) Q Consensus 529 ~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~ 608 (836) +......+...+.+ .......... ++ ...+..+.++.++|+.+.+.|++.+++.+++++++ +.++..++.+.+ T Consensus 1 ~~~~~~~~~~~~~f----~~~~~~~~~~-~a-~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~-~~~~~~~~~~~i 73 (324) T protein:vir:93 1 MEQTQKLKLNLQHF----ASNNVKPQVF-NP-DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLG-KYEPMEGTEKKF 73 (324) T ss_pred CchhHHHHHHHHHH----HHhhhhhhhc-cc-ccccccCCCcceechhHHHHHHHHHHhhchhhhhc-ceeeccCCceEE Confidence 00000001001000 0000000111 11 22233344455778999999999999999999985 556677778999 Q ss_pred EEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Q lcl|NC_016164. 609 PRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSN 688 (836) Q Consensus 609 p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~ 688 (836) |+.++.+.+.|++|++++++++++|+++++.+++++++++||+|+|.++.+++.++|.+.|+++++++++.++|+|+|++ T Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~ 153 (324) T protein:vir:93 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) T ss_pred EEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeeccee Q lcl|NC_016164. 689 SQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYN 768 (836) Q Consensus 689 ~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~p 768 (836) +.|.|+++....... ...+.+++++|.+++.++..++. .+.+|+|||++|..|+.++|++|++++....+++|+|+| T Consensus 154 ~~~~~~~~~~~~~~~-~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~n~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~P 230 (324) T protein:vir:93 154 PFGKSIAQSIEKTNK-VIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLP 230 (324) T ss_pred CcCccccccccccce-eccccccHHHHHHHHHhhhhccC--CCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCCccccee Confidence 889998876554333 33456789999999999988764 456899999999999999999999999888889999999 Q ss_pred eEeeCc--cccceEEEEehhceEEEeecceEEEEeccc--------------ccccCcEEEEEEEEeccEEEcccceEEE Q lcl|NC_016164. 769 VVRSNQ--VANGDVFFGVWNQMIMGMWGALDIQVNPYA--------------LDKSGSVRVTALQDVDVAVRHPEAFCRG 832 (836) Q Consensus 769 Vv~s~~--~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~--------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l 832 (836) |++++. .+.+.+++|||+++.+++++++++..+++. .|++|++.||+++|+|+++.+|+||+++ T Consensus 231 Vv~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l 310 (324) T protein:vir:93 231 VVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) T ss_pred eEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEE Confidence 999776 456679999999999999999999988763 3899999999999999999999999999 Q ss_pred eecC Q lcl|NC_016164. 833 NDNL 836 (836) Q Consensus 833 ~~A~ 836 (836) +.|. T Consensus 311 ~~a~ 314 (324) T protein:vir:93 311 VPAD 314 (324) T ss_pred eccc Confidence 9888 No 35 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=9.1e-53 Score=305.88 Aligned_cols=276 Identities=16% Similarity=0.167 Sum_probs=240.8 Q ss_pred hhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccccccccee Q lcl|NC_016164. 556 LHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQ 635 (836) Q Consensus 556 ~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~ 635 (836) +.......++.+.++.++|+.+...|++.+++.+++++++ ++++...+...+|+.+ ++.+.|++|++++++++++|++ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~-~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~~ 78 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLA-KAVPMTKPEEEFTFMS-GVGAFWVDEAERIQTSKPTFTK 78 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhc-eeeecCCCcEEEEEEc-CCceeeeecCccccccccceeE Confidence 1122223344455667889999999999999999999995 5677777888888775 5779999999999999999999 Q ss_pred EEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHH Q lcl|NC_016164. 636 VALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVEL 715 (836) Q Consensus 636 it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l 715 (836) +++.+++++++++||+|+|.++.++++++|.+.|+++++++++.++|+|+|+ ++|.|+++.......+..++..++++| T Consensus 79 v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~-~~~~gil~~~~~~~~~~~~~~~~~~~l 157 (299) T protein:vir:41 79 AKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVES-PYNWNILKSATDASNLVEETANKYDDL 157 (299) T ss_pred EEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccC-cccccccccccccceeeccccccHHHH Confidence 9999999999999999999999999999999999999999999999999997 588999987766666666677889999 Q ss_pred HHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccC---CCCeecceeeEeeCccccce----EEEEehhce Q lcl|NC_016164. 716 VSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLE---PGGTVNGYNVVRSNQVANGD----VFFGVWNQM 788 (836) Q Consensus 716 ~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~---~~~~l~G~pVv~s~~~~~~~----i~~gD~s~~ 788 (836) .+++.++..++. .+++|+|||+++.+|++++|.+|+++|.+. ..++|+|+||++++.+|.+. ++||||+++ T Consensus 158 ~~~~~~l~~~~~--~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~ 235 (299) T protein:vir:41 158 NEAIGLIEAEDL--EPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQA 235 (299) T ss_pred HHHHHhhhcccC--CcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCceecceeeEEecccCCCCCceEEEEEecccE Confidence 999999987664 567899999999999999999999988543 33689999999999999775 899999999 Q ss_pred EEEeecceEEEEecccc--------------cccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 789 IMGMWGALDIQVNPYAL--------------DKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 789 ~i~~~~~l~i~~~~~~~--------------~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .++.++++++..+++.. |++|++.||++.|+|+++.+|+||++++.+- T Consensus 236 ~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~a 297 (299) T protein:vir:41 236 YYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKA 297 (299) T ss_pred EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Confidence 99999999998877543 7899999999999999999999999999888 No 36 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=8.2e-53 Score=306.14 Aligned_cols=279 Identities=16% Similarity=0.167 Sum_probs=238.1 Q ss_pred hhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccccccc Q lcl|NC_016164. 553 NDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPS 632 (836) Q Consensus 553 ~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~ 632 (836) +..........+.++.|+.++|+.+...|++.+++.+++++++ ++++..++.+++|+.++.+.+.|++|++++++++++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~-~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~ 79 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLA-KNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPE 79 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhc-ceeeccCCceEEEEEeCCcceEEeecCcccccccce Confidence 1111222333445566778899999999999999999999985 556667788999999999999999999999999999 Q ss_pred ceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccc-----ccccccccccccccc Q lcl|NC_016164. 633 VDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPE-----GLKFVTGINTENFGA 707 (836) Q Consensus 633 ~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~-----Gi~~~~~~~~~t~aa 707 (836) |+++++++++++++++||+|+|.++.++++++|.++|+++++++++.++++|+|++ .|. +++........+... T Consensus 80 ~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~-~~~~~~~~~~~~~~~~~~~~~~~ 158 (304) T protein:vir:10 80 YAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSP-YNTSTSGKPLVEGAEEKGNVVTD 158 (304) T ss_pred eeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCC-ccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999974 343 344444444444555 Q ss_pred cchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCcccc----ceEEEE Q lcl|NC_016164. 708 TNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVAN----GDVFFG 783 (836) Q Consensus 708 ~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~----~~i~~g 783 (836) +..++++|.+++.++..++. .+++|+|||++|..|++++|++|+|.|.. .+++|+|+||++++++|. +.++|| T Consensus 159 ~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~L~~lkd~~G~~l~~~-~~~~l~G~PV~~~~~~~~~~~~~~~~~g 235 (304) T protein:vir:10 159 TNNLYVDLSALMATIEDEEL--DPNGVLTTRSFRSKMRNALDANDRPLFDA-NGNEIMGLPLSYTGADVYDKKKSLALMG 235 (304) T ss_pred ccchHHHHHHHHHHhhhccC--CcCEEEEcHHHHHHHHHhhccCCcEeecC-CCccccceeeEEecccccCCCCcEEEEE Confidence 66789999999999987764 46789999999999999999999988744 357999999999999974 358999 Q ss_pred ehhceEEEeecceEEEEeccc----------------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 784 VWNQMIMGMWGALDIQVNPYA----------------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 784 D~s~~~i~~~~~l~i~~~~~~----------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ||+++.+++++++++..+.+. .|++|++.||+++|+|+++.+|+||+++|.|= T Consensus 236 d~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 236 DWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 999999999999999887652 48999999999999999999999999999999 No 37 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=8.2e-53 Score=306.14 Aligned_cols=279 Identities=16% Similarity=0.167 Sum_probs=238.1 Q ss_pred hhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccccccc Q lcl|NC_016164. 553 NDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPS 632 (836) Q Consensus 553 ~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~ 632 (836) +..........+.++.|+.++|+.+...|++.+++.+++++++ ++++..++.+++|+.++.+.+.|++|++++++++++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~-~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~ 79 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLA-KNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPE 79 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhc-ceeeccCCceEEEEEeCCcceEEeecCcccccccce Confidence 1111222333445566778899999999999999999999985 556667788999999999999999999999999999 Q ss_pred ceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccc-----ccccccccccccccc Q lcl|NC_016164. 633 VDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPE-----GLKFVTGINTENFGA 707 (836) Q Consensus 633 ~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~-----Gi~~~~~~~~~t~aa 707 (836) |+++++++++++++++||+|+|.++.++++++|.++|+++++++++.++++|+|++ .|. +++........+... T Consensus 80 ~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~-~~~~~~~~~~~~~~~~~~~~~~~ 158 (304) T protein:vir:94 80 YAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSP-YNTSTSGKPLVEGAEEKGNVVTD 158 (304) T ss_pred eeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCC-ccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999974 343 344444444444555 Q ss_pred cchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCcccc----ceEEEE Q lcl|NC_016164. 708 TNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVAN----GDVFFG 783 (836) Q Consensus 708 ~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~----~~i~~g 783 (836) +..++++|.+++.++..++. .+++|+|||++|..|++++|++|+|.|.. .+++|+|+||++++++|. +.++|| T Consensus 159 ~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~L~~lkd~~G~~l~~~-~~~~l~G~PV~~~~~~~~~~~~~~~~~g 235 (304) T protein:vir:94 159 TNNLYVDLSALMATIEDEEL--DPNGVLTTRSFRSKMRNALDANDRPLFDA-NGNEIMGLPLSYTGADVYDKKKSLALMG 235 (304) T ss_pred ccchHHHHHHHHHHhhhccC--CcCEEEEcHHHHHHHHHhhccCCcEeecC-CCccccceeeEEecccccCCCCcEEEEE Confidence 66789999999999987764 46789999999999999999999988744 357999999999999974 358999 Q ss_pred ehhceEEEeecceEEEEeccc----------------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 784 VWNQMIMGMWGALDIQVNPYA----------------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 784 D~s~~~i~~~~~l~i~~~~~~----------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ||+++.+++++++++..+.+. .|++|++.||+++|+|+++.+|+||+++|.|= T Consensus 236 d~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 236 DWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 999999999999999887652 48999999999999999999999999999999 No 38 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=2.3e-52 Score=303.68 Aligned_cols=298 Identities=13% Similarity=0.149 Sum_probs=242.5 Q ss_pred hhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEE Q lcl|NC_016164. 529 AFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAI 608 (836) Q Consensus 529 ~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~ 608 (836) |...+..+.-.+++. +........ . .........++.++|+.+.+.|++.+++.+++++++ +.++..++.+.+ T Consensus 1 ~~k~~~~~~~~~~~~----~~~~~~~~~-~-a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~-~~~~~~~~~~~~ 73 (324) T protein:vir:99 1 MEQTQKLKLNLQHFA----SNNVKPQVF-N-PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLG-KYEPMEGTEKKF 73 (324) T ss_pred CCCchHhhHHHHHHH----HHhhhhhhc-c-ccceeccCCCcceechhHHHHHHHHHHhhchhhhhc-ceeeccCCceEE Confidence 000000010011100 000000000 1 112222334445778889999999999999999984 556677788999 Q ss_pred EEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Q lcl|NC_016164. 609 PRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSN 688 (836) Q Consensus 609 p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~ 688 (836) |+.++.+.+.|++|++++++++++|+++++.+++++++++||+|+|.|+..+++++|.+.|+++++++++.++|+|+|++ T Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) T protein:vir:99 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) T ss_pred EEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeeccee Q lcl|NC_016164. 689 SQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYN 768 (836) Q Consensus 689 ~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~p 768 (836) +.|.|+++.....+ +...+.+++++|.+++..|...+. .+++|+|||.+|..|+.++|++|++.|....+++|+|+| T Consensus 154 ~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~P 230 (324) T protein:vir:99 154 PFGKSIAQSIEKTN-KVIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLP 230 (324) T ss_pred ccCccccccccccc-eeccccCCHHHHHHHHHhhhhccC--CCCEEEEcHHHHHHHHHhhcCCCceeecCCCCcccccee Confidence 88999887654433 334456789999999999987764 456899999999999999999999999888889999999 Q ss_pred eEeeCccc--cceEEEEehhceEEEeecceEEEEeccc--------------ccccCcEEEEEEEEeccEEEcccceEEE Q lcl|NC_016164. 769 VVRSNQVA--NGDVFFGVWNQMIMGMWGALDIQVNPYA--------------LDKSGSVRVTALQDVDVAVRHPEAFCRG 832 (836) Q Consensus 769 Vv~s~~~~--~~~i~~gD~s~~~i~~~~~l~i~~~~~~--------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l 832 (836) |++++.++ .+.+++|||+++.+++++++++..+++. .|.+|++.||+++|+|+++.+|+||+++ T Consensus 231 Vv~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l 310 (324) T protein:vir:99 231 VVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) T ss_pred EEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEE Confidence 99998865 4569999999999999999999888763 3889999999999999999999999999 Q ss_pred eecC Q lcl|NC_016164. 833 NDNL 836 (836) Q Consensus 833 ~~A~ 836 (836) +.|- T Consensus 311 t~a~ 314 (324) T protein:vir:99 311 VPAD 314 (324) T ss_pred Eecc Confidence 9988 No 39 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=4.3e-51 Score=296.71 Aligned_cols=379 Identities=11% Similarity=0.144 Sum_probs=243.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHH---HHHHHHhhhhhhhHHHHHHHHhhhhhhHHHH Q lcl|NC_016164. 415 MEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAM---RSVLSEIAKRPAAQPATPAAPVRSAQPIAAG 491 (836) Q Consensus 415 ~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~---~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~ 491 (836) +.+..............+++ .+.++.....+.. ...++.+.+................. T Consensus 1 M~k~l~el~~~~~~~~~e~~-------------~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~----- 62 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELN-------------SLLNKDGVTAEELNKTSNEIDILQAKIEAQKRKENIENNFNE----- 62 (404) T ss_pred CcHHHHHHHHHHHHHHHHHH-------------HHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----- Confidence 11100011111111111111 1111111100111 11111111111100000000000000 Q ss_pred hhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccccccc Q lcl|NC_016164. 492 GGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDL 571 (836) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~ 571 (836) ... ...... ... .................++. ....+ .......+.....+..+.|+. T Consensus 63 --~~~--------------~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~--~~~~~~e~~a~~~~~~~~gg~ 120 (404) T protein:vir:10 63 --DNV--------------KSLNTG-KEE-NVIYNGALFVRAIADNLLKQ--KNQRG--LNLSEKEINAISENIDEDGGY 120 (404) T ss_pred --hhc--------------cccccc-cch-hhHHHHHHHHHHHHHHHHHH--HHhhh--hcchhhHHhhhccccCCCCce Confidence 000 000000 000 00000000000000000000 00000 011122233344455567788 Q ss_pred ccchhhHHHHHHHHHhhhhhhhhccee-eecCCceEEEEEecCCceeeeeccCcccccc--cccceeEEeeeeeeeeeeh Q lcl|NC_016164. 572 VFTDGRPGSFIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTGAATAYWVAEGGDPTES--QPSVDQVALVAKTLGAYTE 648 (836) Q Consensus 572 vvp~~~~~~ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~--~~~~~~it~~~~t~~~~i~ 648 (836) ++|+.+...|++.+++.++|++++... ++...+.+.+|+.++.+.+.|++|++.++.+ +++|+++++++++++++++ T Consensus 121 ~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~ 200 (404) T protein:vir:10 121 AVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMS 200 (404) T ss_pred eechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeeccccccccccccccceeeeEeeheeeEeeeh Confidence 899999999999999999999986544 3345678999999999999999999998874 5899999999999999999 Q ss_pred hHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHH-HHhhhcc Q lcl|NC_016164. 649 FSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMES-KVAADNA 727 (836) Q Consensus 649 ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~-~l~~~~~ 727 (836) ||+++|.|+.++++++|.+.|+++++++++.+|++|+|++..|.||++..++.+.+.+ +..+++++..++. .+... T Consensus 201 iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~-- 277 (404) T protein:vir:10 201 IPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLP-KSPALKDFKKCKNVELLNV-- 277 (404) T ss_pred hhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeecc-ccccHHHHHHHHHhhhhcc-- Confidence 9999999999999999999999999999999999999999999999988777766554 3457888888876 34433 Q ss_pred ccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEee-Ccccc-----ceEEEEehhc-eEEEeecce Q lcl|NC_016164. 728 DIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRS-NQVAN-----GDVFFGVWNQ-MIMGMWGAL 796 (836) Q Consensus 728 ~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s-~~~~~-----~~i~~gD~s~-~~i~~~~~l 796 (836) ...+++|+|||++|..|++++|++|+|+|.+ +.+++|+|+||++. +.++. ..++||||++ |.++.++++ T Consensus 278 ~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~ 357 (404) T protein:vir:10 278 FKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAY 357 (404) T ss_pred ccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecce Confidence 3467899999999999999999999998753 34568999999854 44443 3489999996 778999999 Q ss_pred EEEEecc--cccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 797 DIQVNPY--ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 797 ~i~~~~~--~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++.++++ ..|.+|++.||++.|+|+++.+|+||++++.+. T Consensus 358 ~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 399 (404) T protein:vir:10 358 ELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPV 399 (404) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeec Confidence 9988765 568999999999999999999999999999777 No 40 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=1.8e-51 Score=298.83 Aligned_cols=353 Identities=16% Similarity=0.107 Sum_probs=239.0 Q ss_pred hhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhh Q lcl|NC_016164. 439 EHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMM 518 (836) Q Consensus 439 ~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~ 518 (836) ...+++...+. .+......+.+...... ............ .... ... ..... ...... T Consensus 1 ik~L~e~~~e~-------~e~~~~~~~~~~~~~~~-~e~~~~~~~~~~----~~~~-~~~---~~~~~-~~~~~~----- 58 (390) T protein:vir:40 1 MNNLDKKDSET-------LNISTAFLNAIKEGATE-AEQVTAFTNMAE----QIQN-NII---AQARK-EVNREM----- 58 (390) T ss_pred CchHHHHHHHH-------HHHHHHHHHHHhhhhhH-HHHHHHHHHHHH----HHHH-HHH---HHHHH-HHHHHH----- Confidence 33333322222 12222222222111110 000000000000 0000 000 00000 000000 Q ss_pred hhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhccee Q lcl|NC_016164. 519 PGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM 598 (836) Q Consensus 519 ~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~ 598 (836) .........+.. ................++.++|++++|+.+.+.|++.++..++++++ +++ T Consensus 59 ---------------~~~~~~~~~~~~--~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~-~~~ 120 (390) T protein:vir:40 59 ---------------NDNNVLASRGAN--ALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSK-INF 120 (390) T ss_pred ---------------HHHHHHHhcCch--hccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhh-cee Confidence 000000000000 00000000001122334456788999999999999999999999998 567 Q ss_pred eecCCceEEEEEecCCceeeeeccCccccc-ccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 599 LTGLQGPVAIPRQTGAATAYWVAEGGDPTE-SQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEI 677 (836) Q Consensus 599 ~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~ 677 (836) .+...+...+|+.++.+.+.|++|+++++. ++++|+++++.+++++++++||+++|.|+..+++++|.++|++++++++ T Consensus 121 ~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~ 200 (390) T protein:vir:40 121 VNTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGL 200 (390) T ss_pred eecCCceeEEEEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 777788899999999999999999999874 6899999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhcCCcccccccccccccccc----cccccchhHHHHHHHHHHHhhhcc-----ccCccEEEecHHHH-H---HH Q lcl|NC_016164. 678 DRAALYGLGSNSQPEGLKFVTGINTE----NFGATNPTYVELVSMESKVAADNA-----DIGAMSYLTNSTLY-G---GF 744 (836) Q Consensus 678 d~~il~G~Gt~~~p~Gi~~~~~~~~~----t~aa~~~t~~~l~~a~~~l~~~~~-----~~~~~~~vmnp~~~-~---~L 744 (836) +.+|++|+|+ ++|.||++..+..+. ...+..+++.++..+...+...+. ...+++|+|||.++ . .+ T Consensus 201 ~~a~l~G~G~-~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~ 279 (390) T protein:vir:40 201 EAGIVNGSGK-DQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAA 279 (390) T ss_pred HhhhhcccCC-CccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHH Confidence 9999999996 589999986543332 123344667777777766655432 23578899999874 3 44 Q ss_pred HHHhhccCccccccCCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEE Q lcl|NC_016164. 745 KTTEKATSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVR 824 (836) Q Consensus 745 ~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~ 824 (836) ..+++.+|++.+ ...++|+||++++++|+++++||||++|.+++++++++.++++.+|.+|++.||++.|+|++++ T Consensus 280 ~~~~d~~G~~v~----~~~~~g~pvv~~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~ 355 (390) T protein:vir:40 280 TSYMTPQGVWVT----GILPVPLEIVQSVAVPVGKAVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPK 355 (390) T ss_pred hhccCCCCcccc----ccCCCceeEEEcCCCCCCcEEEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEe Confidence 578888888654 2335899999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEee-------cC Q lcl|NC_016164. 825 HPEAFCRGND-------NL 836 (836) Q Consensus 825 ~p~Af~~l~~-------A~ 836 (836) +|+||++++. +| T Consensus 356 ~~~A~~~l~~~~~~~~~~~ 374 (390) T protein:vir:40 356 DNSSFLVFDITGLEGSPAI 374 (390) T ss_pred cccceEEEEeeccCCCCCC Confidence 9999999973 22 No 41 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=1.1e-50 Score=294.55 Aligned_cols=387 Identities=10% Similarity=0.028 Sum_probs=237.4 Q ss_pred hhhhhhhhh---hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhH Q lcl|NC_016164. 412 TIDMEAVRA---QAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPI 488 (836) Q Consensus 412 ~~~~e~~~~---~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~ 488 (836) .+..++... ...........++..+.... .....+.....++.+..+.................. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~------------~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~ 68 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRALNND------------ELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTS 68 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchh------------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 000000000 00000011111111110000 000011111111111111111111110000000000 Q ss_pred HHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccc Q lcl|NC_016164. 489 AAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAA 568 (836) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~ 568 (836) .... .. .................. ..............+.+.. .............++++ T Consensus 69 ~~~~--~~--~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~t~~ 128 (415) T protein:vir:47 69 ENNQ--QS--VEVNEARTYRNQANINDL-----GISIQNTKVTSQEVRDFTE-----------YLETRNDIQGGSLKTDS 128 (415) T ss_pred hhcc--cc--cccchhhhhHHHHHHHHH-----HHhhhhhhhhHHHHHHHHH-----------HHhhhhhhhhccccccC Confidence 0000 00 000000000000000000 0000000000000011100 00011111223334557 Q ss_pred cccccchhhHHHHHHHHHhhhhhhhhcceeeecCC--ceEEEEEecCCceeeeeccCccccc-ccccceeEEeeeeeeee Q lcl|NC_016164. 569 GDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQ--GPVAIPRQTGAATAYWVAEGGDPTE-SQPSVDQVALVAKTLGA 645 (836) Q Consensus 569 g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~--~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~ 645 (836) |+.++|+.+.+.|++.+++.++|++++. +.+..+ +.+.+++.++.+.+.|++|++++++ +.++|+++++.++++++ T Consensus 129 g~~~iP~~~~~~ii~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~ 207 (415) T protein:vir:47 129 GFVVIPEEIVTDILKLKEVEFNLDKYVT-VKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRG 207 (415) T ss_pred CcccccHHHHHHHHHHHHhhhhhhhhcc-eeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEe Confidence 8889999999999999999999999954 444443 4556666677888999999999997 56899999999999999 Q ss_pred eehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhh Q lcl|NC_016164. 646 YTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAAD 725 (836) Q Consensus 646 ~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~ 725 (836) +++||+++|.|+.+++.++|.+.|+++++++++.+|++|+|++..+.++.............+..++++|.+++.++... T Consensus 208 ~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 287 (415) T protein:vir:47 208 YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKP 287 (415) T ss_pred eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhh Confidence 99999999999999999999999999999999999999999865555555444444455556668899999999999877 Q ss_pred ccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccc-----eEEEEehhc-eEEEeecc Q lcl|NC_016164. 726 NADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANG-----DVFFGVWNQ-MIMGMWGA 795 (836) Q Consensus 726 ~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~-----~i~~gD~s~-~~i~~~~~ 795 (836) +. .+++|+|||++|..|+.++|++|+|+|.. +.+++|+|+||++++++|.+ .++||||++ |.++++.+ T Consensus 288 ~~--~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 365 (415) T protein:vir:47 288 NY--EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQ 365 (415) T ss_pred cc--CCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecc Confidence 64 57899999999999999999999998743 34578999999999998854 389999997 67888999 Q ss_pred eEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 796 LDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 796 l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.+.+++ |.++++.+|+++|+|+++++|+||++++..= T Consensus 366 ~~v~~~~---~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:47 366 YQASWTD---YMHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred eEEEeec---cccCceEEEEEEEeccEEeccccEEEEEeec Confidence 9998876 5667789999999999999999999988433 No 42 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=1.1e-50 Score=294.55 Aligned_cols=387 Identities=10% Similarity=0.028 Sum_probs=237.4 Q ss_pred hhhhhhhhh---hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhH Q lcl|NC_016164. 412 TIDMEAVRA---QAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPI 488 (836) Q Consensus 412 ~~~~e~~~~---~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~ 488 (836) .+..++... ...........++..+.... .....+.....++.+..+.................. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~------------~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~ 68 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRALNND------------ELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTS 68 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchh------------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 000000000 00000011111111110000 000011111111111111111111110000000000 Q ss_pred HHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccc Q lcl|NC_016164. 489 AAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAA 568 (836) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~ 568 (836) .... .. .................. ..............+.+.. .............++++ T Consensus 69 ~~~~--~~--~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~t~~ 128 (415) T protein:vir:46 69 ENNQ--QS--VEVNEARTYRNQANINDL-----GISIQNTKVTSQEVRDFTE-----------YLETRNDIQGGSLKTDS 128 (415) T ss_pred hhcc--cc--cccchhhhhHHHHHHHHH-----HHhhhhhhhhHHHHHHHHH-----------HHhhhhhhhhccccccC Confidence 0000 00 000000000000000000 0000000000000011100 00011111223334557 Q ss_pred cccccchhhHHHHHHHHHhhhhhhhhcceeeecCC--ceEEEEEecCCceeeeeccCccccc-ccccceeEEeeeeeeee Q lcl|NC_016164. 569 GDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQ--GPVAIPRQTGAATAYWVAEGGDPTE-SQPSVDQVALVAKTLGA 645 (836) Q Consensus 569 g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~--~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~ 645 (836) |+.++|+.+.+.|++.+++.++|++++. +.+..+ +.+.+++.++.+.+.|++|++++++ +.++|+++++.++++++ T Consensus 129 g~~~iP~~~~~~ii~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~ 207 (415) T protein:vir:46 129 GFVVIPEEIVTDILKLKEVEFNLDKYVT-VKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRG 207 (415) T ss_pred CcccccHHHHHHHHHHHHhhhhhhhhcc-eeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEe Confidence 8889999999999999999999999954 444443 4556666677888999999999997 56899999999999999 Q ss_pred eehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhh Q lcl|NC_016164. 646 YTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAAD 725 (836) Q Consensus 646 ~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~ 725 (836) +++||+++|.|+.+++.++|.+.|+++++++++.+|++|+|++..+.++.............+..++++|.+++.++... T Consensus 208 ~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 287 (415) T protein:vir:46 208 YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKP 287 (415) T ss_pred eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhh Confidence 99999999999999999999999999999999999999999865555555444444455556668899999999999877 Q ss_pred ccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccc-----eEEEEehhc-eEEEeecc Q lcl|NC_016164. 726 NADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANG-----DVFFGVWNQ-MIMGMWGA 795 (836) Q Consensus 726 ~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~-----~i~~gD~s~-~~i~~~~~ 795 (836) +. .+++|+|||++|..|+.++|++|+|+|.. +.+++|+|+||++++++|.+ .++||||++ |.++++.+ T Consensus 288 ~~--~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 365 (415) T protein:vir:46 288 NY--EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQ 365 (415) T ss_pred cc--CCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecc Confidence 64 57899999999999999999999998743 34578999999999998854 389999997 67888999 Q ss_pred eEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 796 LDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 796 l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.+.+++ |.++++.+|+++|+|+++++|+||++++..= T Consensus 366 ~~v~~~~---~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:46 366 YQASWTD---YMHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred eEEEeec---cccCceEEEEEEEeccEEeccccEEEEEeec Confidence 9998876 5667789999999999999999999988433 No 43 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=5.5e-51 Score=296.14 Aligned_cols=394 Identities=19% Similarity=0.183 Sum_probs=241.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHh Q lcl|NC_016164. 413 IDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGG 492 (836) Q Consensus 413 ~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~ 492 (836) ++..+...........+.....+ ..+..++..++.....+......+.+..+................. T Consensus 1 m~~~~~lee~~a~l~~~~~~~~~------~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 69 (419) T protein:vir:94 1 MPPTPTLEEQRAALLARLDDTSL------TTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADG----- 69 (419) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh----- Confidence 11111100000000000000000 0001111111111111111111111111111111000000000000 Q ss_pred hhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccc Q lcl|NC_016164. 493 GSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLV 572 (836) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~v 572 (836) ........ ................................. ................+...++..+ T Consensus 70 ---~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (419) T protein:vir:94 70 ---GTPLTPAE---AGTFRSLAQRFADSDGLREYRARDKRGQFQVEM--------RDIDPNRLLSRDAPAGTITNPNVPH 135 (419) T ss_pred ---hccccccc---cccccchhhhhhhHHHHHHHHHhhhhhhhhHHH--------HHHHHHHhhccccccccccCCcccc Confidence 00000000 000000000000000000000000000000000 0000011111222334445667778 Q ss_pred cchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecC--------CceeeeeccCcccccccccceeEEeeeeeee Q lcl|NC_016164. 573 FTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTG--------AATAYWVAEGGDPTESQPSVDQVALVAKTLG 644 (836) Q Consensus 573 vp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~--------~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~ 644 (836) +|..+.+.++..++....+++++ ++.+..++.+.+|+.++ .+.+.|++||+.+++++++|+++++.+++++ T Consensus 136 ~p~~~~~~i~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~ 214 (419) T protein:vir:94 136 LPQLVPGIVPTTPDLPLLVADLL-DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVA 214 (419) T ss_pred cchhhhHHHHHHHhhhhhhhhcc-eeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEE Confidence 89999988888888888899884 45666777888887654 3457899999999999999999999999999 Q ss_pred eeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccc------cccchhHHHHHHH Q lcl|NC_016164. 645 AYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENF------GATNPTYVELVSM 718 (836) Q Consensus 645 ~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~------aa~~~t~~~l~~a 718 (836) ++++||+++|.|+ .+++++|..+|++++++++|.+||+|+|+ ++|+||++.+++...+. ......+++|.++ T Consensus 215 ~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~aii~G~G~-~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~ 292 (419) T protein:vir:94 215 HWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-TEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRA 292 (419) T ss_pred EeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCc-ccccceecccccccccccccccccccchhHHHHHHH Confidence 9999999999865 68999999999999999999999999997 58999998877654432 2233468999999 Q ss_pred HHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc-----CCCCeecceeeEeeCccccceEEEEehhc-eEEEe Q lcl|NC_016164. 719 ESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL-----EPGGTVNGYNVVRSNQVANGDVFFGVWNQ-MIMGM 792 (836) Q Consensus 719 ~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~-----~~~~~l~G~pVv~s~~~~~~~i~~gD~s~-~~i~~ 792 (836) ++.+...+. .+++|+|||++|..|..+++.+|++.++. +.+++|+|+||++++++|+++++||||+. |.+++ T Consensus 293 ~~~~~~~~~--~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~ 370 (419) T protein:vir:94 293 KTVAEIAGF--PPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWS 370 (419) T ss_pred HHhhhhccC--CCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceeeEEcCCCCCccEEEeeccceEEEEE Confidence 999987764 56799999999999999999888765432 33569999999999999999999999997 67888 Q ss_pred ecceEEEEeccc--ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 793 WGALDIQVNPYA--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 793 ~~~l~i~~~~~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.++++.++++. +|.+|++.||++.|+|+++++|+||++++.+= T Consensus 371 ~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~a 416 (419) T protein:vir:94 371 RQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) T ss_pred ecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEecc Confidence 999999998875 49999999999999999999999999998665 No 44 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=1.1e-50 Score=294.38 Aligned_cols=390 Identities=10% Similarity=0.043 Sum_probs=236.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHH Q lcl|NC_016164. 412 TIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIE-SGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAA 490 (836) Q Consensus 412 ~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~elie-e~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~ 490 (836) ++..++..... .+..++.....+. ...+.. +.....+.....++.+..+.................... T Consensus 1 mk~~~el~~~l-~el~~~~~~~~~e---------~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~ 70 (415) T protein:vir:81 1 MKTKEELQSEI-SDIKRQIDLKVKY---------ATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSEN 70 (415) T ss_pred CchHHHHHHHH-HHHHHHHHHHHHH---------HHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 00001100000 0000000000000 000000 000000111111111111111111111000000000000 Q ss_pred HhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccc Q lcl|NC_016164. 491 GGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGD 570 (836) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~ 570 (836) ....... ...+..... ...... ...........+..+.+. ..............+.++|+ T Consensus 71 ~~~~~~~-~~~~~~~~~---~~~~~~---------~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~gg 130 (415) T protein:vir:81 71 NQQSVEV-NEARTYRNQ---ANINDL---------GISIQNTKVTSQEVRDFT-------EYLETRNDIQGGSLKTDSGF 130 (415) T ss_pred ccccccc-chhhhHHHH---HHHHHH---------hhhhhhhhhHHHHHHHHH-------HHHhhhhhhhhccccccccc Confidence 0000000 000000000 000000 000000000000000000 00000111112233455688 Q ss_pred cccchhhHHHHHHHHHhhhhhhhhcceee-ecCCceEEEEEecCCceeeeeccCcccccc-cccceeEEeeeeeeeeeeh Q lcl|NC_016164. 571 LVFTDGRPGSFIELLRNRLALNTLGVTML-TGLQGPVAIPRQTGAATAYWVAEGGDPTES-QPSVDQVALVAKTLGAYTE 648 (836) Q Consensus 571 ~vvp~~~~~~ii~~l~~~~~l~~l~~~~~-~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~it~~~~t~~~~i~ 648 (836) .++|+.+.+.|++.+++.++|++++..+. +...+.+.+++.++...+.|++|++++++. .++|+++++.+++++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~ 210 (415) T protein:vir:81 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFR 210 (415) T ss_pred cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeeh Confidence 89999999999999999999999855432 234567888888888999999999999964 6899999999999999999 Q ss_pred hHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccc Q lcl|NC_016164. 649 FSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNAD 728 (836) Q Consensus 649 ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~ 728 (836) ||++||.|+.++++++|.+.|+++++++++.+|++|+|++..+.++.........+...+..+|++|.+++.++...+. T Consensus 211 iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~- 289 (415) T protein:vir:81 211 ISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY- 289 (415) T ss_pred hhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhcc- Confidence 9999999999999999999999999999999999999986544444444444444555566889999999999977653 Q ss_pred cCccEEEecHHHHHHHHHHhhccCccccccC----CCCeecceeeEeeCccccce-----EEEEehhc-eEEEeecceEE Q lcl|NC_016164. 729 IGAMSYLTNSTLYGGFKTTEKATSTAQFVLE----PGGTVNGYNVVRSNQVANGD-----VFFGVWNQ-MIMGMWGALDI 798 (836) Q Consensus 729 ~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~----~~~~l~G~pVv~s~~~~~~~-----i~~gD~s~-~~i~~~~~l~i 798 (836) .+++|+|||++|..|+.++|++|+|+|.+. .+++|+|+||++++.+|.+. ++||||++ |.++++.++++ T Consensus 290 -~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v 368 (415) T protein:vir:81 290 -EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQA 368 (415) T ss_pred -CCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEE Confidence 578999999999999999999999987543 35689999999999888543 89999997 66889999999 Q ss_pred EEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 799 QVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 799 ~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .++++ .++++.+|+++|+|+++++|+||++++..= T Consensus 369 ~~~~~---~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:81 369 SWTDY---MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred EEecc---ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 98764 556678999999999999999999987433 No 45 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=1.1e-50 Score=294.38 Aligned_cols=390 Identities=10% Similarity=0.043 Sum_probs=236.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHH Q lcl|NC_016164. 412 TIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIE-SGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAA 490 (836) Q Consensus 412 ~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~elie-e~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~ 490 (836) ++..++..... .+..++.....+. ...+.. +.....+.....++.+..+.................... T Consensus 1 mk~~~el~~~l-~el~~~~~~~~~e---------~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~ 70 (415) T protein:vir:98 1 MKTKEELQSEI-SDIKRQIDLKVKY---------ATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSEN 70 (415) T ss_pred CchHHHHHHHH-HHHHHHHHHHHHH---------HHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 00001100000 0000000000000 000000 000000111111111111111111111000000000000 Q ss_pred HhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccc Q lcl|NC_016164. 491 GGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGD 570 (836) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~ 570 (836) ....... ...+..... ...... ...........+..+.+. ..............+.++|+ T Consensus 71 ~~~~~~~-~~~~~~~~~---~~~~~~---------~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~gg 130 (415) T protein:vir:98 71 NQQSVEV-NEARTYRNQ---ANINDL---------GISIQNTKVTSQEVRDFT-------EYLETRNDIQGGSLKTDSGF 130 (415) T ss_pred ccccccc-chhhhHHHH---HHHHHH---------hhhhhhhhhHHHHHHHHH-------HHHhhhhhhhhccccccccc Confidence 0000000 000000000 000000 000000000000000000 00000111112233455688 Q ss_pred cccchhhHHHHHHHHHhhhhhhhhcceee-ecCCceEEEEEecCCceeeeeccCcccccc-cccceeEEeeeeeeeeeeh Q lcl|NC_016164. 571 LVFTDGRPGSFIELLRNRLALNTLGVTML-TGLQGPVAIPRQTGAATAYWVAEGGDPTES-QPSVDQVALVAKTLGAYTE 648 (836) Q Consensus 571 ~vvp~~~~~~ii~~l~~~~~l~~l~~~~~-~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~it~~~~t~~~~i~ 648 (836) .++|+.+.+.|++.+++.++|++++..+. +...+.+.+++.++...+.|++|++++++. .++|+++++.+++++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~ 210 (415) T protein:vir:98 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFR 210 (415) T ss_pred cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeeh Confidence 89999999999999999999999855432 234567888888888999999999999964 6899999999999999999 Q ss_pred hHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccc Q lcl|NC_016164. 649 FSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNAD 728 (836) Q Consensus 649 ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~ 728 (836) ||++||.|+.++++++|.+.|+++++++++.+|++|+|++..+.++.........+...+..+|++|.+++.++...+. T Consensus 211 iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~- 289 (415) T protein:vir:98 211 ISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY- 289 (415) T ss_pred hhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhcc- Confidence 9999999999999999999999999999999999999986544444444444444555566889999999999977653 Q ss_pred cCccEEEecHHHHHHHHHHhhccCccccccC----CCCeecceeeEeeCccccce-----EEEEehhc-eEEEeecceEE Q lcl|NC_016164. 729 IGAMSYLTNSTLYGGFKTTEKATSTAQFVLE----PGGTVNGYNVVRSNQVANGD-----VFFGVWNQ-MIMGMWGALDI 798 (836) Q Consensus 729 ~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~----~~~~l~G~pVv~s~~~~~~~-----i~~gD~s~-~~i~~~~~l~i 798 (836) .+++|+|||++|..|+.++|++|+|+|.+. .+++|+|+||++++.+|.+. ++||||++ |.++++.++++ T Consensus 290 -~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v 368 (415) T protein:vir:98 290 -EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQA 368 (415) T ss_pred -CCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEE Confidence 578999999999999999999999987543 35689999999999888543 89999997 66889999999 Q ss_pred EEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 799 QVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 799 ~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .++++ .++++.+|+++|+|+++++|+||++++..= T Consensus 369 ~~~~~---~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:98 369 SWTDY---MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred EEecc---ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 98764 556678999999999999999999987433 No 46 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=1.1e-50 Score=294.38 Aligned_cols=390 Identities=10% Similarity=0.043 Sum_probs=236.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHH Q lcl|NC_016164. 412 TIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIE-SGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAA 490 (836) Q Consensus 412 ~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~elie-e~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~ 490 (836) ++..++..... .+..++.....+. ...+.. +.....+.....++.+..+.................... T Consensus 1 mk~~~el~~~l-~el~~~~~~~~~e---------~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~ 70 (415) T protein:vir:79 1 MKTKEELQSEI-SDIKRQIDLKVKY---------ATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSEN 70 (415) T ss_pred CchHHHHHHHH-HHHHHHHHHHHHH---------HHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 00001100000 0000000000000 000000 000000111111111111111111111000000000000 Q ss_pred HhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccc Q lcl|NC_016164. 491 GGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGD 570 (836) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~ 570 (836) ....... ...+..... ...... ...........+..+.+. ..............+.++|+ T Consensus 71 ~~~~~~~-~~~~~~~~~---~~~~~~---------~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~gg 130 (415) T protein:vir:79 71 NQQSVEV-NEARTYRNQ---ANINDL---------GISIQNTKVTSQEVRDFT-------EYLETRNDIQGGSLKTDSGF 130 (415) T ss_pred ccccccc-chhhhHHHH---HHHHHH---------hhhhhhhhhHHHHHHHHH-------HHHhhhhhhhhccccccccc Confidence 0000000 000000000 000000 000000000000000000 00000111112233455688 Q ss_pred cccchhhHHHHHHHHHhhhhhhhhcceee-ecCCceEEEEEecCCceeeeeccCcccccc-cccceeEEeeeeeeeeeeh Q lcl|NC_016164. 571 LVFTDGRPGSFIELLRNRLALNTLGVTML-TGLQGPVAIPRQTGAATAYWVAEGGDPTES-QPSVDQVALVAKTLGAYTE 648 (836) Q Consensus 571 ~vvp~~~~~~ii~~l~~~~~l~~l~~~~~-~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~it~~~~t~~~~i~ 648 (836) .++|+.+.+.|++.+++.++|++++..+. +...+.+.+++.++...+.|++|++++++. .++|+++++.+++++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~ 210 (415) T protein:vir:79 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFR 210 (415) T ss_pred cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeeh Confidence 89999999999999999999999855432 234567888888888999999999999964 6899999999999999999 Q ss_pred hHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccc Q lcl|NC_016164. 649 FSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNAD 728 (836) Q Consensus 649 ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~ 728 (836) ||++||.|+.++++++|.+.|+++++++++.+|++|+|++..+.++.........+...+..+|++|.+++.++...+. T Consensus 211 iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~- 289 (415) T protein:vir:79 211 ISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY- 289 (415) T ss_pred hhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhcc- Confidence 9999999999999999999999999999999999999986544444444444444555566889999999999977653 Q ss_pred cCccEEEecHHHHHHHHHHhhccCccccccC----CCCeecceeeEeeCccccce-----EEEEehhc-eEEEeecceEE Q lcl|NC_016164. 729 IGAMSYLTNSTLYGGFKTTEKATSTAQFVLE----PGGTVNGYNVVRSNQVANGD-----VFFGVWNQ-MIMGMWGALDI 798 (836) Q Consensus 729 ~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~----~~~~l~G~pVv~s~~~~~~~-----i~~gD~s~-~~i~~~~~l~i 798 (836) .+++|+|||++|..|+.++|++|+|+|.+. .+++|+|+||++++.+|.+. ++||||++ |.++++.++++ T Consensus 290 -~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v 368 (415) T protein:vir:79 290 -EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQA 368 (415) T ss_pred -CCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEE Confidence 578999999999999999999999987543 35689999999999888543 89999997 66889999999 Q ss_pred EEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 799 QVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 799 ~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .++++ .++++.+|+++|+|+++++|+||++++..= T Consensus 369 ~~~~~---~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:79 369 SWTDY---MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred EEecc---ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 98764 556678999999999999999999987433 No 47 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=1.2e-50 Score=294.35 Aligned_cols=390 Identities=10% Similarity=0.029 Sum_probs=236.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHH Q lcl|NC_016164. 412 TIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIE-SGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAA 490 (836) Q Consensus 412 ~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~elie-e~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~ 490 (836) .+-.++... ...+ ...++.+.... ..++.. +.....+.....++.+..+............+....... T Consensus 1 mk~~~el~~-~l~e---l~~~~~~~~~~------~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~ 70 (415) T protein:vir:94 1 MKTKEELQS-EISD---IKRQIDLKVKY------ATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSEN 70 (415) T ss_pred CChHHHHHH-HHHH---HHHHHHHHHHH------HHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 000000000 0000 00001110000 000000 000000111111111111111111100000000000000 Q ss_pred HhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccc Q lcl|NC_016164. 491 GGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGD 570 (836) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~ 570 (836) ......... ............. ..............+.+.. . .............+++|+ T Consensus 71 ~~~~~~~~~-~~~~~~~~~~~~~------------~~~~~~~~~~~~e~~~~~~---~----~~~~~~~~~~~~~~~~g~ 130 (415) T protein:vir:94 71 NQQSVEVNE-ASTYRNQANINDL------------GISIQNTKVTSQEVRDFTE---Y----LETRNDIQGGSLKTDSGF 130 (415) T ss_pred ccccccccc-hhhHHHHHHHHHH------------HhhhhhhhhhHHHHHHHHH---H----hhhhhhhhhhcccccccc Confidence 000000000 0000000000000 0000000000000000000 0 001111122334456788 Q ss_pred cccchhhHHHHHHHHHhhhhhhhhcceee-ecCCceEEEEEecCCceeeeeccCccccc-ccccceeEEeeeeeeeeeeh Q lcl|NC_016164. 571 LVFTDGRPGSFIELLRNRLALNTLGVTML-TGLQGPVAIPRQTGAATAYWVAEGGDPTE-SQPSVDQVALVAKTLGAYTE 648 (836) Q Consensus 571 ~vvp~~~~~~ii~~l~~~~~l~~l~~~~~-~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~~i~ 648 (836) .++|+.+.+.|++.+++.+++++++..+. +...+.+.+++.++.+.+.|++|++++++ +.++|+++++.+++++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~ 210 (415) T protein:vir:94 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFR 210 (415) T ss_pred ccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeech Confidence 89999999999999999999999855433 23445777888888899999999999996 46899999999999999999 Q ss_pred hHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccc Q lcl|NC_016164. 649 FSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNAD 728 (836) Q Consensus 649 ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~ 728 (836) ||+++|.|+.++++++|.++|+++++++++.+|++|+|++..+.++.............+..++++|.++++++...+. T Consensus 211 is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~- 289 (415) T protein:vir:94 211 ISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY- 289 (415) T ss_pred hhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHHHHHHHhhhhhcc- Confidence 9999999999999999999999999999999999999986554444444444444445556789999999999877654 Q ss_pred cCccEEEecHHHHHHHHHHhhccCccccccC----CCCeecceeeEeeCccccce-----EEEEehhc-eEEEeecceEE Q lcl|NC_016164. 729 IGAMSYLTNSTLYGGFKTTEKATSTAQFVLE----PGGTVNGYNVVRSNQVANGD-----VFFGVWNQ-MIMGMWGALDI 798 (836) Q Consensus 729 ~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~----~~~~l~G~pVv~s~~~~~~~-----i~~gD~s~-~~i~~~~~l~i 798 (836) .+++|+|||++|..|+.++|++|+|+|.+. .+++|+|+||++++++|.+. ++||||++ |.++++.++++ T Consensus 290 -~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v 368 (415) T protein:vir:94 290 -EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQA 368 (415) T ss_pred -CCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEE Confidence 478999999999999999999999987433 35689999999999988654 89999997 67888999999 Q ss_pred EEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 799 QVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 799 ~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .+++ |.++++.+|++.|+|+++++|+||++++..= T Consensus 369 ~~~~---~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 403 (415) T protein:vir:94 369 SWTD---YMHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred EEec---cccCceEEEEEEEeccEEeccccEEEEEEec Confidence 8876 4667789999999999999999999988433 No 48 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=2.3e-52 Score=303.70 Aligned_cols=298 Identities=13% Similarity=0.156 Sum_probs=241.0 Q ss_pred hhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcc Q lcl|NC_016164. 517 MMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGV 596 (836) Q Consensus 517 ~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~ 596 (836) ....+..... .+.+.... ..+ . ...+ ......+.++.++|+.+.+.|++.+++.+++++++ T Consensus 1 ~~~~~~~~~~--------~~~f~~~~---~~~-----~-~~~a-~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~- 61 (324) T protein:vir:96 1 MEQTQKLKLN--------LQHFASNN---VKP-----Q-VFNP-DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLG- 61 (324) T ss_pred CCcchhhhHH--------HHHHHHhh---hhh-----h-hccc-ccccccCCCcceechhHHHHHHHHHHhhchhhhhc- Confidence 0000000000 00010000 000 0 0011 11222234455778889999999999999999985 Q ss_pred eeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 597 TMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALE 676 (836) Q Consensus 597 ~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~ 676 (836) ++++..++.+++|+.++.+.+.|++|++++++++++|+++++.+++++++++||+|+|.++..+++++|.+.|+++++++ T Consensus 62 ~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~ 141 (324) T protein:vir:96 62 KYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) T ss_pred ceeeccCCceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 45667777899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccc Q lcl|NC_016164. 677 IDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQF 756 (836) Q Consensus 677 ~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~ 756 (836) +|.++|+|+|++..|.|+++...... +...+.+++++|.+++.++..++. .+++|+|||++|..|+.++|++|++.| T Consensus 142 ~d~~~l~G~g~~~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~~i~~~~~--~~~~~i~n~~~~~~L~~lkd~~G~~~~ 218 (324) T protein:vir:96 142 FDEAGILNQGNNPFGKSIAQSIKKTN-KVIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLRKIVDPETKERI 218 (324) T ss_pred HHHHhhhcCCCCCcCccccccccccc-eecccccchHHHHHHHHhhhhccC--CCCEEEEcHHHHHHHHHhhCCCCCeee Confidence 99999999999989999887655433 333456789999999999987754 466899999999999999999999999 Q ss_pred ccCCCCeecceeeEeeCcc--ccceEEEEehhceEEEeecceEEEEeccc--------------ccccCcEEEEEEEEec Q lcl|NC_016164. 757 VLEPGGTVNGYNVVRSNQV--ANGDVFFGVWNQMIMGMWGALDIQVNPYA--------------LDKSGSVRVTALQDVD 820 (836) Q Consensus 757 ~~~~~~~l~G~pVv~s~~~--~~~~i~~gD~s~~~i~~~~~l~i~~~~~~--------------~~~~~~~~~r~~~r~d 820 (836) ....+++|+|+||++++.. +.+.++||||+++.+++++++++..+++. .|.+|++.||+++|+| T Consensus 219 ~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d 298 (324) T protein:vir:96 219 YDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVA 298 (324) T ss_pred cCCCCCcccceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEec Confidence 8888899999999997764 45679999999999999999999887763 4889999999999999 Q ss_pred cEEEcccceEEEeecC Q lcl|NC_016164. 821 VAVRHPEAFCRGNDNL 836 (836) Q Consensus 821 ~~v~~p~Af~~l~~A~ 836 (836) +++.+|+||++++.|. T Consensus 299 ~~v~~~~a~~~l~~a~ 314 (324) T protein:vir:96 299 LHIADDKAFAKLVPAD 314 (324) T ss_pred cEEecccceEEEeccc Confidence 9999999999999998 No 49 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=4e-52 Score=302.35 Aligned_cols=298 Identities=13% Similarity=0.153 Sum_probs=242.4 Q ss_pred hhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEE Q lcl|NC_016164. 529 AFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAI 608 (836) Q Consensus 529 ~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~ 608 (836) +...+..+.-++++..... ..... .+ ......+.++.++|..+.+.|++.+++.+++++++ ++++..++.+.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~----~~~~~-~a-~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~-~~~~~~~~~~~~ 73 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNV----KPQVF-NP-DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLG-KYEPMEGTEKKF 73 (324) T ss_pred CCCchHHHHHHHHHHHHhh----cccee-cc-cceeccCCCcceechhHHHHHHHHHHhhchhhhhc-ceeeccCCceEE Confidence 0000000100111100000 00000 11 12222334445778889999999999999999985 556677788999 Q ss_pred EEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Q lcl|NC_016164. 609 PRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSN 688 (836) Q Consensus 609 p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~ 688 (836) |+.++.+.+.|++|++++++++++|+++++.+++++++++||+|+|.|+..+++++|.+.|+++++++++.++|+|+|++ T Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~ 153 (324) T protein:vir:10 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) T ss_pred EEEeCCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeeccee Q lcl|NC_016164. 689 SQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYN 768 (836) Q Consensus 689 ~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~p 768 (836) ..|.|+++.....+ +...+.+++++|.+++.++..++. .+++|+|||.+|..|++++|++|++.|....+++|+|+| T Consensus 154 ~~~~~i~~~~~~~~-~~~~~~~t~~~i~~~~~~l~~~~~--~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~P 230 (324) T protein:vir:10 154 PFGKSIAQSIEKTN-KVIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLP 230 (324) T ss_pred ccCccccccccccc-eeccccCCHHHHHHHHHhhhhccC--CCCEEEEcHHHHHHHHHhhccCCceeecCCCCcccccee Confidence 89999987654433 333456889999999999988764 456899999999999999999999999888889999999 Q ss_pred eEeeCccc--cceEEEEehhceEEEeecceEEEEeccc--------------ccccCcEEEEEEEEeccEEEcccceEEE Q lcl|NC_016164. 769 VVRSNQVA--NGDVFFGVWNQMIMGMWGALDIQVNPYA--------------LDKSGSVRVTALQDVDVAVRHPEAFCRG 832 (836) Q Consensus 769 Vv~s~~~~--~~~i~~gD~s~~~i~~~~~l~i~~~~~~--------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l 832 (836) |++++.++ ++.+++|||+++.+++++++++..+++. .|++|++.||+++|+|+++.+|+||+++ T Consensus 231 V~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l 310 (324) T protein:vir:10 231 VVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) T ss_pred EEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEE Confidence 99988755 5569999999999999999999887763 3889999999999999999999999999 Q ss_pred eecC Q lcl|NC_016164. 833 NDNL 836 (836) Q Consensus 833 ~~A~ 836 (836) +.|- T Consensus 311 ~~a~ 314 (324) T protein:vir:10 311 VPAD 314 (324) T ss_pred Eecc Confidence 9888 No 50 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=3.3e-52 Score=302.83 Aligned_cols=274 Identities=16% Similarity=0.171 Sum_probs=229.6 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeee Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVA 640 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~ 640 (836) +..+..+.|++++|+.+...|++.+++.+++++++ ++++...+.+++|+.++.+.++|++|++++++++++|+++++.+ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~-~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~ 79 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLS-PEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhc-ceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEeee Confidence 44556667888999999999999999999999995 45667778899999999999999999999999999999999999 Q ss_pred eeeeeeehhHHHHHhcchhH----HHHHHHHHHHHHHHHHHHHHHHhhcCC--cccccccccccccccccccccchhHHH Q lcl|NC_016164. 641 KTLGAYTEFSRRLMLQSSID----VEQMVRTELATVIALEIDRAALYGLGS--NSQPEGLKFVTGINTENFGATNPTYVE 714 (836) Q Consensus 641 ~t~~~~i~ISrelL~ds~~~----l~~~i~~~l~~a~a~~~d~~il~G~Gt--~~~p~Gi~~~~~~~~~t~aa~~~t~~~ 714 (836) ++++++++||+|+|.++..+ ++++|.+.|++++++++|.++|+|++. +..+.|+.+...........+...+++ T Consensus 80 ~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 159 (315) T protein:vir:80 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSATAD 159 (315) T ss_pred eeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccchHH Confidence 99999999999999887665 779999999999999999999999763 334555554433333333344456889 Q ss_pred HHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccc---ccc-----CCCCeecceeeEeeCccccc-------- Q lcl|NC_016164. 715 LVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQ---FVL-----EPGGTVNGYNVVRSNQVANG-------- 778 (836) Q Consensus 715 l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~---~~~-----~~~~~l~G~pVv~s~~~~~~-------- 778 (836) +.+++.++...+.. .+.+|+|||+++..|+++++.+|++. +++ +.+++|+|+||+++++||.+ T Consensus 160 ~~~~~~~~~~~~~~-~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~ 238 (315) T protein:vir:80 160 LVKAVGLIAGAGLQ-VPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) T ss_pred HHHHHHHHhhccCc-cceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCCcccccccccc Confidence 99999888765433 34579999999999999998877643 222 23468999999999999853 Q ss_pred -eEEEEehhceEEEeecceEEEEeccc--------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 779 -DVFFGVWNQMIMGMWGALDIQVNPYA--------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 779 -~i~~gD~s~~~i~~~~~l~i~~~~~~--------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .++||||+++.++.++++++.++++. .|++|++.||++.|+|+++.+|+||++++.+. T Consensus 239 ~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~ 305 (315) T protein:vir:80 239 VKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) T ss_pred cEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeecc Confidence 47899999999999999999887763 48999999999999999999999999999988 No 51 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=3e-50 Score=292.07 Aligned_cols=378 Identities=13% Similarity=0.084 Sum_probs=230.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhh Q lcl|NC_016164. 408 MDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQP 487 (836) Q Consensus 408 ~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~ 487 (836) +++..... ......+......+++++......++ .+++..+...+ ...++.+.+............... .. T Consensus 1 ~~~~m~k~---l~el~~~~~~~~~~~~~~~~~~~~ee-~~~~~~e~~~l----~~~i~~~~~~~~~~~~~~~~~~~~-~~ 71 (397) T protein:vir:12 1 MPMQMSKK---EIALRQQFTEKKQQADKALQEGNTDE-ARALLDEVKQL----KNQIELMTEGRSLDVPDLPGGVNF-VP 71 (397) T ss_pred CCCcHHHH---HHHHHHHHHHHHHHHHHHhhhhhHHH-HHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhh-hh Confidence 11111000 00000011111111111111111100 00111111110 001111100000000000000000 00 Q ss_pred HHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHh-hhhhhh-hhhhhhhhhhhhhhccc Q lcl|NC_016164. 488 IAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRM-GVTPRG-ILAPNDVLHRDLVVDTA 565 (836) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~-g~~~~g-~~~~~~~~~~a~~~~~~ 565 (836) .... ...... . ..........+...++.+.. +..... ...............++ T Consensus 72 ~~~~--------------~~~~~~------~----~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 127 (397) T protein:vir:12 72 EQER--------------NPEGQR------S----QGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGIN 127 (397) T ss_pred hhhh--------------hhcccc------c----ccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccc Confidence 0000 000000 0 00000000001111111110 000000 00000111122334455 Q ss_pred ccccccccchhhHHHHHHHHHhhhhhhhhccee-eecCCceEEEEEecCCceeeeeccCccccc-ccccceeEEeeeeee Q lcl|NC_016164. 566 SAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTGAATAYWVAEGGDPTE-SQPSVDQVALVAKTL 643 (836) Q Consensus 566 ~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~ 643 (836) .+.|+.++|+.+...|++.+++.+++++++... ++...+.+.+++.++.+.+.|++||+++++ +.++|+++++.++++ T Consensus 128 ~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~ 207 (397) T protein:vir:12 128 DEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDY 207 (397) T ss_pred cccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheee Confidence 667888899999999999999999999986544 234467888999999999999999999996 568999999999999 Q ss_pred eeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHH-HH Q lcl|NC_016164. 644 GAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMES-KV 722 (836) Q Consensus 644 ~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~-~l 722 (836) +++++||+++|.|+.++++++|...|+++++++++.+|++|+|+ ++|.|+ .+++++.+++. .+ T Consensus 208 ~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~-~~~~g~---------------~~~~~i~~~~~~~l 271 (397) T protein:vir:12 208 GGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIAS-LKKVDI---------------DGLDGIKKALNVTL 271 (397) T ss_pred EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cccccc---------------ccHHHHHHHHhhcc Confidence 99999999999999999999999999999999999999999997 456554 34677887664 56 Q ss_pred hhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCcc-c-----cceEEEEehhc-eEEE Q lcl|NC_016164. 723 AADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQV-A-----NGDVFFGVWNQ-MIMG 791 (836) Q Consensus 723 ~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~-~-----~~~i~~gD~s~-~~i~ 791 (836) ...+ ..+++|+|||.+|.+|+.++|++|+|+|.. +.+++|+|+||++++++ + ...++||||++ |.++ T Consensus 272 ~~~~--~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~ 349 (397) T protein:vir:12 272 DPMV--APGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLF 349 (397) T ss_pred chhh--hCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEE Confidence 5444 467899999999999999999999988743 34568999999877653 3 22389999997 5678 Q ss_pred eecceEEEEecc--cccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 792 MWGALDIQVNPY--ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 792 ~~~~l~i~~~~~--~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++.++.+.++++ ..|.+|++.||++.|+|+++++|+||++++.++ T Consensus 350 ~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~ 396 (397) T protein:vir:12 350 DREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITV 396 (397) T ss_pred eecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEee Confidence 899999888764 568999999999999999999999999999999 No 52 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=1.3e-51 Score=299.50 Aligned_cols=272 Identities=12% Similarity=0.100 Sum_probs=228.8 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeee Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVA 640 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~ 640 (836) +..++.+ ++.++|+.+...|++.+++.+++++++. +.+...+.+.+|+.++.+.++|++|++++++++++|+++++++ T Consensus 1 ma~~t~~-~G~lip~~~~~~ii~~l~~~s~i~~l~~-~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~ 78 (300) T protein:vir:95 1 MSEAQLS-KGNLFNPELVTKVINKVKGHSSIAKLSP-QKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIVP 78 (300) T ss_pred CcccccC-CcceechhhHHHHHHHHHhhhhhhhhcc-eeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEeee Confidence 3333334 4456677789999999999999999854 5566667899999999999999999999999999999999999 Q ss_pred eeeeeeehhHHHHHh---cchhHHHHHHHHHHHHHHHHHHHHHHHhhc----CCcccccccccccccccccc-cccchhH Q lcl|NC_016164. 641 KTLGAYTEFSRRLML---QSSIDVEQMVRTELATVIALEIDRAALYGL----GSNSQPEGLKFVTGINTENF-GATNPTY 712 (836) Q Consensus 641 ~t~~~~i~ISrelL~---ds~~~l~~~i~~~l~~a~a~~~d~~il~G~----Gt~~~p~Gi~~~~~~~~~t~-aa~~~t~ 712 (836) ++++++++||+|+|. ++.++++++|.++|+++++++++.++|+|+ |++..+.|....++....+. +.+..++ T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (300) T protein:vir:95 79 LKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNPD 158 (300) T ss_pred EEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccchH Confidence 999999999999994 567899999999999999999999999994 44455566655554444333 3355678 Q ss_pred HHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccce------EEE Q lcl|NC_016164. 713 VELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANGD------VFF 782 (836) Q Consensus 713 ~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~~------i~~ 782 (836) ++|.+++..+...+. .+++|+|||+++.+|+++||++|+++|.. +.+++|+|+||++++++|.+. +++ T Consensus 159 ~~i~~~~~~~~~~~~--~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 236 (300) T protein:vir:95 159 ESMEDAVGMIDGSER--DITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDPKNTAIV 236 (300) T ss_pred HHHHHHHHHhhhcCC--CccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCCCCccEEEE Confidence 999999998877654 45689999999999999999999998743 346799999999999998543 788 Q ss_pred Eehhce-EEEeecceEEEEeccc--------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 783 GVWNQM-IMGMWGALDIQVNPYA--------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 783 gD~s~~-~i~~~~~l~i~~~~~~--------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) |||+++ .++.+.++++.++++. +|++|++.||+++|+|+++.+|+||++++.+= T Consensus 237 GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~ 299 (300) T protein:vir:95 237 GDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTG 299 (300) T ss_pred eeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCC Confidence 999975 5888999999988764 49999999999999999999999999999888 No 53 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=1.1e-49 Score=289.10 Aligned_cols=417 Identities=13% Similarity=0.091 Sum_probs=228.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh---hhhhhhhhhh Q lcl|NC_016164. 378 GQGRKATSSSGPPGAAAATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADD---LAQGLIESGA 454 (836) Q Consensus 378 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e---~a~eliee~~ 454 (836) ...+... .. ........+. .+..+... ....+..+..........++ ..++.++... T Consensus 1 ~~~~~~~-----l~---------~~~~~~~~~l--~el~e~~~----~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~ 60 (466) T protein:vir:80 1 MALRQLM-----LA---------KKIEQRKAAL--AELLEQEK----ALQKRSEELEAAIDEANTDEEIAVVEDEINKLE 60 (466) T ss_pred CchHHHH-----HH---------HHHHHHHHHH--HHHHHHHH----HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Confidence 0000000 00 0000000000 00000000 00000000000000000000 0000000000 Q ss_pred hHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHH Q lcl|NC_016164. 455 SEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREV 534 (836) Q Consensus 455 t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~ 534 (836) .......+....+..+.............. ....... ............... ......... .... T Consensus 61 ~~~~el~e~~~~l~~ei~~le~el~e~~~~----~~~~~~~------~~~~~~~~~~~~~~~---~~~~~~~~~--~~~~ 125 (466) T protein:vir:80 61 GEKTELEEKKSKLEGEIKELENELEQLNNK----EPKNNSE------PAQVSGARTQQFVGG---ETRMKGFFR--NMPY 125 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hhccCch------hHHHHhhhhhHHhhH---HHHHHHHHH--hhhh Confidence 000000000000000000000000000000 0000000 000000000000000 000000000 0000 Q ss_pred HHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCC Q lcl|NC_016164. 535 SEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGA 614 (836) Q Consensus 535 a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~ 614 (836) ....+........... ... ..........+++++++|+.+...|++.+++.+++++++. +.+ .++..++++.+.. T Consensus 126 ~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~-v~~-~~g~~~~~~~~~~ 200 (466) T protein:vir:80 126 EQRAALIARSEVKEFL--AQV-RTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVR-LRP-LKGTARQNIAGAI 200 (466) T ss_pred hhHHHHHHHHHHHHHH--HHH-HHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhhee-eee-cCceeEeeeecCC Confidence 0000000000000000 000 1111222234566788999999999999999999999854 333 3467788888888 Q ss_pred ceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccc Q lcl|NC_016164. 615 ATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGL 694 (836) Q Consensus 615 ~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi 694 (836) +.+.|++|++++++++++|+++++.+++++++++||++||.|+.++++++|..+|+++++.+++.+||+|+|+ ++|.|| T Consensus 201 ~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~-~~P~Gi 279 (466) T protein:vir:80 201 PEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGT-KMPVGI 279 (466) T ss_pred cceeecccccccccccccccceeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCC-CCccee Confidence 9999999999999999999999999999999999999999999999999999999999999999999999997 579999 Q ss_pred ccccccccccccc-------cchhH-----------------HHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhc Q lcl|NC_016164. 695 KFVTGINTENFGA-------TNPTY-----------------VELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKA 750 (836) Q Consensus 695 ~~~~~~~~~t~aa-------~~~t~-----------------~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~ 750 (836) ++..+..+..... ..++. .++..++..+... ...+..+|+||+.++..|..++.. T Consensus 280 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~w~~~~~~~~~l~~~~~~ 358 (466) T protein:vir:80 280 VTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARAN-YSNGMKFWAMSSNTHAVLMSKAIT 358 (466) T ss_pred eecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhcc-ccCCceeEEecchhHHHhhccccc Confidence 9765433221110 11111 1222222222322 334567899999999999888743 Q ss_pred -cCccccccCCC--CeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEccc Q lcl|NC_016164. 751 -TSTAQFVLEPG--GTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPE 827 (836) Q Consensus 751 -~g~~~~~~~~~--~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~ 827 (836) ++++.+++.++ ..|+|+||+++++||.+.++||||+.|.++++.++++..+++..|.+|++.||+.+|+|+++++|+ T Consensus 359 ~~~~g~~~~~~~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~ 438 (466) T protein:vir:80 359 FNSAGALVASLNNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGE 438 (466) T ss_pred ccCCccccccCCCcccccccceeecCccCccceeeeccccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccC Confidence 33444544433 358999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEeecC Q lcl|NC_016164. 828 AFCRGNDNL 836 (836) Q Consensus 828 Af~~l~~A~ 836 (836) ||++++.+= T Consensus 439 afv~~~~~~ 447 (466) T protein:vir:80 439 GFVAVNIAN 447 (466) T ss_pred ceEEEEecC Confidence 999997443 No 54 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=2.8e-51 Score=297.70 Aligned_cols=279 Identities=15% Similarity=0.186 Sum_probs=237.9 Q ss_pred hhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccccccc Q lcl|NC_016164. 553 NDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPS 632 (836) Q Consensus 553 ~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~ 632 (836) +..........+.++.++.++|+.+.+.|++.+++.+++++++.+..........+|+..+.+.+.|++|++++++++++ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 80 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIKTDKPE 80 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccccccccc Confidence 11111112223334556668899999999999999999999976655555567889999999999999999999999999 Q ss_pred ceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhH Q lcl|NC_016164. 633 VDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTY 712 (836) Q Consensus 633 ~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~ 712 (836) |+++++++++++++++||+|+|.|+..+++++|.+.|+++++++++.++|+|+|+ +.|.|+++...... +..++.+++ T Consensus 81 f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~-~~~~gi~~~~~~~~-~~~~~~~t~ 158 (297) T protein:vir:95 81 VVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDT-PFANSVAKAAKDAN-KVIGGPINY 158 (297) T ss_pred eeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCC-cccccccccccccc-eecccccCH Confidence 9999999999999999999999999999999999999999999999999999996 57899987655433 344566899 Q ss_pred HHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCc--cccceEEEEehhceEE Q lcl|NC_016164. 713 VELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQ--VANGDVFFGVWNQMIM 790 (836) Q Consensus 713 ~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~--~~~~~i~~gD~s~~~i 790 (836) ++|.+++.++..++. .+++|+|||+++..|++++|.+|+++|. ..+++|+|+||+.+.. ++++.++||||+++.+ T Consensus 159 ~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~L~~l~d~~G~~i~~-~~~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~~~ 235 (297) T protein:vir:95 159 DNILKLQDALYDADV--EPNAFVSKIQNRSALREARDGNKVSIYD-KAANTIDGITTVDLKSARFEKGDLLAGDFDNLIY 235 (297) T ss_pred HHHHHHHHHhhhccC--CcCEEEEcHHHHHHHHHhhccCCceeec-CCCCcccceeeEeecCCCCCCceEEEEecccEEE Confidence 999999999988764 4679999999999999999999998774 3467999999998654 5678899999999999 Q ss_pred EeecceEEEEeccc--------------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYA--------------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~--------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.++++++..+++. .|++|++.||++.|+|+++.+|+||++|+.|= T Consensus 236 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at 295 (297) T protein:vir:95 236 GVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAE 295 (297) T ss_pred EEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecC Confidence 99999999887764 38999999999999999999999999999998 No 55 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=3.1e-51 Score=297.47 Aligned_cols=271 Identities=14% Similarity=0.163 Sum_probs=226.7 Q ss_pred cccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeee Q lcl|NC_016164. 563 DTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKT 642 (836) Q Consensus 563 ~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t 642 (836) -.+.+.|++++|+.+.+.|++.+++.+++++++. +++...+.+++|+.++++.++|++||+++++++++|+++++.+++ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~-~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~k 79 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSM-AEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcc-eeecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEE Confidence 2234557789999999999999999999999965 567777889999999999999999999999999999999999999 Q ss_pred eeeeehhHHHHHh---cchhHHHHHHHHHHHHHHHHHHHHHHHhhcC--Ccccccccccccc--cccc--cccccchhHH Q lcl|NC_016164. 643 LGAYTEFSRRLML---QSSIDVEQMVRTELATVIALEIDRAALYGLG--SNSQPEGLKFVTG--INTE--NFGATNPTYV 713 (836) Q Consensus 643 ~~~~i~ISrelL~---ds~~~l~~~i~~~l~~a~a~~~d~~il~G~G--t~~~p~Gi~~~~~--~~~~--t~aa~~~t~~ 713 (836) ++++++||+|+|. ++..+++++|.++|+++++++++.++++|++ ++..+.|+++... .+.. +.......+. T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) T protein:vir:81 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) T ss_pred EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchHHH Confidence 9999999999995 5567899999999999999999999999974 4445667765422 1222 2222233456 Q ss_pred HHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccc----------- Q lcl|NC_016164. 714 ELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANG----------- 778 (836) Q Consensus 714 ~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~----------- 778 (836) ++.+++..+...+ ..+.+|+|||.++..|+++||++|++.|.. +.+++|+|+||++++.+|.+ T Consensus 160 ~i~~~~~~~~~~~--~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~ 237 (311) T protein:vir:81 160 AVEAAVGLVLGDN--LSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY 237 (311) T ss_pred HHHHHHHHhhhcC--CCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccccccchh Confidence 6777777775543 344579999999999999999999998754 34679999999999988743 Q ss_pred -------eEEEEehhceEEEeecceEEEEeccc-------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 779 -------DVFFGVWNQMIMGMWGALDIQVNPYA-------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 779 -------~i~~gD~s~~~i~~~~~l~i~~~~~~-------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .++||||++|.++.++++++..+++. .|.+|++.||++.|+|++|++|+||++++.|+ T Consensus 238 ~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~ 309 (311) T protein:vir:81 238 RTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) T ss_pred cccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeec Confidence 36899999999999999999888763 48999999999999999999999999999999 No 56 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=2e-49 Score=287.51 Aligned_cols=368 Identities=11% Similarity=0.084 Sum_probs=226.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHh Q lcl|NC_016164. 413 IDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGG 492 (836) Q Consensus 413 ~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~ 492 (836) ++. .+......++.+.. +....+. .++.....+.+.............. . T Consensus 1 m~~--------~e~~~~~~~~~~~l-----~~~~~~~-------~~e~~~~~e~~~~~~~~~~~~~~~e----------~ 50 (379) T protein:vir:10 1 MEA--------LEIKVALEAIKGQV-----DSKSSAQ-------ALEVKGLIEALEAKMTSEKDLAVNE----------L 50 (379) T ss_pred CCH--------HHHHHHHHHHHHHH-----HHHHHHH-------HHHHHHHHHHHHhHhhHHHHHHHHH----------H Confidence 100 00000001111000 0000000 0001111111110000000000000 0 Q ss_pred hhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccc Q lcl|NC_016164. 493 GSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLV 572 (836) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~v 572 (836) .........+... ............. ....... ........ ..+ .+ .............+.++.++.+ T Consensus 51 ~~~~~~l~~~~~~-~e~~~~~~~~~~~-~~~~~~~--~~~~~~~~-~~~----~~---~~~~~~~~~~~~~~~~~~~~~~ 118 (379) T protein:vir:10 51 KSDMAALQAHADK-LDVKLKEKAKSED-KSDSLVK--SITENFND-IKE----VR---NGKSIQVKAVGDMTLPVNLTGA 118 (379) T ss_pred HHHHHHHHHHHHH-HHHHHHhcccccc-cchhHHH--HHHHHHHh-HHH----HH---hhhhhhhhhhcccccCCCCccc Confidence 0000000000000 0000000000000 0000000 00000000 000 00 0000111112222334444456 Q ss_pred cchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecC--CceeeeeccCcccccccccceeEEeeeeeeeeeehhH Q lcl|NC_016164. 573 FTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTG--AATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFS 650 (836) Q Consensus 573 vp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~--~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~IS 650 (836) +|+.+...|++.++..+++++++ ++++..++.+.+|+.++ .+.+.|++||+.+|+++++|+++++++++|+++++|| T Consensus 119 ip~~~~~~ii~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS 197 (379) T protein:vir:10 119 QPKDYNFDVVLNPSQMLNVSDIV-GAVSISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYS 197 (379) T ss_pred cchhhhhHHHHhHHhhhhHHhhc-eeeeccCCceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehhh Confidence 78889999999999999999985 56677778899998765 3567889999999999999999999999999999999 Q ss_pred HHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccccC Q lcl|NC_016164. 651 RRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIG 730 (836) Q Consensus 651 relL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~ 730 (836) +++|.|+ +.+.++|.+.|+++++.+++.+|+.|+|+++ +.+. ....+..+++++.++++.+...+. . T Consensus 198 ~ell~D~-~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~-~~~~---------~~~~~~~~~d~i~~~~~~~~~~~~--~ 264 (379) T protein:vir:10 198 KKMANNL-PFLTSFIPNALRRDYAKAENAAFNAVLAANA-TAST---------EIITNKNKVEMLINEIAKQENLDF--P 264 (379) T ss_pred HHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHhccccccc-cccc---------ccccCcccHHHHHHHHHhhhhccC--C Confidence 9999876 5799999999999999999999999988642 1111 112233457889999998877653 4 Q ss_pred ccEEEecHHHHHHHHHHhhccCccccccC------CCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecc- Q lcl|NC_016164. 731 AMSYLTNSTLYGGFKTTEKATSTAQFVLE------PGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPY- 803 (836) Q Consensus 731 ~~~~vmnp~~~~~L~~lkd~~g~~~~~~~------~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~- 803 (836) +.+|+|||.+|..|+++||++|+|++... .+.+|+|+||++++.+|+++++||||+.+.+.++.++.+..+.+ T Consensus 265 ~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~ 344 (379) T protein:vir:10 265 VTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVTKVTTEGLSLEFSEVE 344 (379) T ss_pred CCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcceecceeeEecCCCCCCceEEeecccEEEEEEeceEEEEeecc Confidence 66899999999999999999999877432 22489999999999999999999999999888888877776654 Q ss_pred -cccccCcEEEEEEEEeccEEEcccceEEEee-cC Q lcl|NC_016164. 804 -ALDKSGSVRVTALQDVDVAVRHPEAFCRGND-NL 836 (836) Q Consensus 804 -~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~-A~ 836 (836) .+|.+|++.||++.|+|++|++|+|||+++- || T Consensus 345 ~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 345 GTNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred cccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 4699999999999999999999999999995 45 No 57 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=1.9e-49 Score=287.73 Aligned_cols=367 Identities=14% Similarity=0.169 Sum_probs=230.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHH Q lcl|NC_016164. 412 TIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAG 491 (836) Q Consensus 412 ~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~ 491 (836) ++..++.+ ....+. ..++.++.... .....++..+. +....+.+++ .................... T Consensus 1 Mk~~~el~-~~~~~~---~~~~~~l~~~~-----~~~~~~~~~~~-ee~~~~~~~i----~~~~~~~e~~~~~~~~~~~~ 66 (397) T protein:vir:49 1 MKTSNELH-DLWVAQ---GDKVENLNEKL-----NVAMLDDSVSA-EELQAIKNER----DTAKMKRDMFKEQYTEARAN 66 (397) T ss_pred CchHHHHH-HHHHHH---HHHHHHHHHHH-----HHHHhhhhcCH-HHHHHHHHHH----HHHHHHHHHHHHHHHHHHHH Confidence 11111110 111111 11111111000 00000111110 0111111111 11000000000000000000 Q ss_pred hhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccccccc Q lcl|NC_016164. 492 GGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDL 571 (836) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~ 571 (836) .. .......+ ......... ......+++... ..+. ........+.++++.|++ T Consensus 67 -----------~~--~~~~~~~~---~~~~~~~~~---~~~~~~~~~~~~----l~~~----~~~~~~~~~~~t~~~gg~ 119 (397) T protein:vir:49 67 -----------EV--ANMSEEEK---KPLTKSEEE---VKAGFVKDFKNL----VRGR----YQNLLDSKTDASGSDAGL 119 (397) T ss_pred -----------hh--hccccccc---cccccchhH---HHHHHHHHHHHH----Hhcc----hhHHHHHhhccccccCcc Confidence 00 00000000 000000000 000111111111 1110 111122233445567888 Q ss_pred ccchhhHHHHHHHHHhhhhhhhhccee-eecCCceEEEEEecC-CceeeeeccCccccc-ccccceeEEeeeeeeeeeeh Q lcl|NC_016164. 572 VFTDGRPGSFIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTG-AATAYWVAEGGDPTE-SQPSVDQVALVAKTLGAYTE 648 (836) Q Consensus 572 vvp~~~~~~ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~~i~ 648 (836) ++|+.+...|++.+++.++|++++... ++...+.+.+++... .+.+.|++|++++++ ++++|+++++++++++++++ T Consensus 120 ~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~ 199 (397) T protein:vir:49 120 TIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGIST 199 (397) T ss_pred cccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeeh Confidence 899999999999999999999985443 334566777777654 467899999999986 67999999999999999999 Q ss_pred hHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccc Q lcl|NC_016164. 649 FSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNAD 728 (836) Q Consensus 649 ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~ 728 (836) ||++||.|+.++++++|.+.|+++++++++.+|++|+|++..+. +..++++|.++++++..++. T Consensus 200 iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~~---------------~~~~~d~i~~~~~~l~~~~~- 263 (397) T protein:vir:49 200 VTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALPTKP---------------TLTKWDDIIDLEAKVDPAIK- 263 (397) T ss_pred hHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---------------ccccHHHHHHHHHhhhhhhc- Confidence 99999999999999999999999999999999999999764332 22468899999999988764 Q ss_pred cCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCc--ccc-----ceEEEEehhc-eEEEeecce Q lcl|NC_016164. 729 IGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQ--VAN-----GDVFFGVWNQ-MIMGMWGAL 796 (836) Q Consensus 729 ~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~--~~~-----~~i~~gD~s~-~~i~~~~~l 796 (836) .+++|+|||.+|..|+.++|++|+|+|.. +.+++|+|+||++++. +|. ..++||||++ |.+++++++ T Consensus 264 -~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~ 342 (397) T protein:vir:49 264 -QTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHM 342 (397) T ss_pred -CCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecce Confidence 56899999999999999999999998743 3356899999987543 333 3489999997 678999999 Q ss_pred EEEEeccc--ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 797 DIQVNPYA--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 797 ~i~~~~~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++..+++. +|.+|++.||++.|+|+++++|+||++++.+= T Consensus 343 ~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 343 SLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKA 384 (397) T ss_pred EEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEEeec Confidence 99999875 69999999999999999999999999999443 No 58 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=6.2e-51 Score=295.85 Aligned_cols=271 Identities=14% Similarity=0.146 Sum_probs=228.8 Q ss_pred cccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeee Q lcl|NC_016164. 563 DTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKT 642 (836) Q Consensus 563 ~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t 642 (836) -.+.+.|+.++|+.+...|++.+++.+++++++ ++.+..++...+|+.++++.+.|++|++++|+++++|+++++++++ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~-~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~k 79 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLS-SQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPIK 79 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhc-ceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeEE Confidence 223345677889999999999999999999995 4556777889999999999999999999999999999999999999 Q ss_pred eeeeehhHHHHHh---cchhHHHHHHHHHHHHHHHHHHHHHHHhhcCC----cccccccc--cccccccccccccchhHH Q lcl|NC_016164. 643 LGAYTEFSRRLML---QSSIDVEQMVRTELATVIALEIDRAALYGLGS----NSQPEGLK--FVTGINTENFGATNPTYV 713 (836) Q Consensus 643 ~~~~i~ISrelL~---ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt----~~~p~Gi~--~~~~~~~~t~aa~~~t~~ 713 (836) +++++++|+|+|. ++.++++++|.++|++++++++|.++++|+++ +..+.|+. ........+.+++..+++ T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (303) T protein:vir:97 80 VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDADA 159 (303) T ss_pred EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccchHH Confidence 9999999999994 66789999999999999999999999999643 33334433 222333344445567899 Q ss_pred HHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccC-----CCCeecceeeEeeCccccc--------eE Q lcl|NC_016164. 714 ELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLE-----PGGTVNGYNVVRSNQVANG--------DV 780 (836) Q Consensus 714 ~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~-----~~~~l~G~pVv~s~~~~~~--------~i 780 (836) +|.+++..+...+ ..+..|+|||+++.+|+++||++|+++|.+. .+++|+|+||++++++|.. .+ T Consensus 160 ~i~~~~~~~~~~~--~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~ 237 (303) T protein:vir:97 160 NIEAAVNLIQGAE--GVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEAESKDLV 237 (303) T ss_pred HHHHHHHHHhhcC--CCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccccCCCccEE Confidence 9999999887654 3456899999999999999999999887442 3468999999999999853 38 Q ss_pred EEEehh-ceEEEeecceEEEEeccc--------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 781 FFGVWN-QMIMGMWGALDIQVNPYA--------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 781 ~~gD~s-~~~i~~~~~l~i~~~~~~--------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +||||+ .|.++.+.++++..+++. +|.+|++.||++.|+|+++++|+||+++|+|= T Consensus 238 ~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~ 302 (303) T protein:vir:97 238 IIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGE 302 (303) T ss_pred EEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCC Confidence 999995 578999999999888764 48999999999999999999999999999998 No 59 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=1.5e-50 Score=293.78 Aligned_cols=285 Identities=17% Similarity=0.161 Sum_probs=231.7 Q ss_pred HHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeee Q lcl|NC_016164. 540 QRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYW 619 (836) Q Consensus 540 ~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~ 619 (836) -+.|. . .....+.....++ +.++.++|+.+...|++.+++.+++++++ ++++...+.+.+|+.++.+.+.| T Consensus 1 ~~~~~----~---~~~e~~~~~~~~~-~~~~~~ip~~~~~~ii~~~~~~~~l~~~~-~~~~~~~~~~~ip~~~~~~~a~~ 71 (318) T protein:vir:24 1 MAAGT----A---FAVDHAQIAQTGD-TMFKGYLEPEQAKDYFAEAEKTSIVQQFA-QKVPMGTTGQKIPHWVGDVSAQW 71 (318) T ss_pred CCCCC----C---CCHHHHHhhcccC-cccceeechhHHHHHHHHHHhhchhhhhc-ceeeccCCceEEEEEeCCcceEE Confidence 01110 0 1112222233333 34445678889999999999999999995 45677778899999999999999 Q ss_pred eccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccc Q lcl|NC_016164. 620 VAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTG 699 (836) Q Consensus 620 v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~ 699 (836) ++|++++++++++|+++++++++++++++||+|+|.|+.++++++|.+.|++++++++|.++|+|+|+ +.|.|+++... T Consensus 72 v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~-~~~~~~~~~~~ 150 (318) T protein:vir:24 72 IGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDS-PFPTYIGQTTK 150 (318) T ss_pred ecCCccccccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCC-CCCcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999996 57888887654 Q ss_pred cccccccc--cchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCC---------CCeeccee Q lcl|NC_016164. 700 INTENFGA--TNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEP---------GGTVNGYN 768 (836) Q Consensus 700 ~~~~t~aa--~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~---------~~~l~G~p 768 (836) ....+... .....+++.+++..+... +..+.+|+|||++|..|+.+||++|+++|.... +++++|+| T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~p 228 (318) T protein:vir:24 151 AISIADTTGATTVYDQVAVNGLSLLVND--GKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARP 228 (318) T ss_pred cccccccccccchHHHHHHHHHHhhccc--cCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEe Confidence 43333222 223344566777766554 456789999999999999999999999875432 24799999 Q ss_pred eEeeCccccce--EEEEehhceEEEeecceEEEEeccc--------------ccccCcEEEEEEEEeccEEEcccceEEE Q lcl|NC_016164. 769 VVRSNQVANGD--VFFGVWNQMIMGMWGALDIQVNPYA--------------LDKSGSVRVTALQDVDVAVRHPEAFCRG 832 (836) Q Consensus 769 Vv~s~~~~~~~--i~~gD~s~~~i~~~~~l~i~~~~~~--------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l 832 (836) |++++.++.++ ++||||+++.+++++++.+..+++. .|++|++.||+++|+|+++.+|+||+++ T Consensus 229 v~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i 308 (318) T protein:vir:24 229 TILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVAL 308 (318) T ss_pred eEEeCCCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEE Confidence 99999999765 5899999999999999999877653 3899999999999999999999999999 Q ss_pred eecC Q lcl|NC_016164. 833 NDNL 836 (836) Q Consensus 833 ~~A~ 836 (836) +.+. T Consensus 309 ~~~~ 312 (318) T protein:vir:24 309 TNVV 312 (318) T ss_pred Eeec Confidence 9988 No 60 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=3.2e-49 Score=286.45 Aligned_cols=347 Identities=14% Similarity=0.146 Sum_probs=229.7 Q ss_pred hhhhhhhhhhhhh-hhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHH Q lcl|NC_016164. 429 RVASITSLCREHK-ADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSF 507 (836) Q Consensus 429 ~~~ei~al~~~~~-l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (836) -.+++.++..... ..+....+..+. ..+..+.+..+................ .. T Consensus 1 M~k~l~~l~e~~~~~~~e~~~~~~~~------~~e~~~~~~~ei~~l~~~i~~~~~~~~---------------~~---- 55 (371) T protein:vir:81 1 MPKELRELLEQINNKKEEARKLLAEN------KIEEAKKLKEEIVALQEKFDVAKELYE---------------EQ---- 55 (371) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHH------HHHHHHHHHHHHHHHHHHHHHHHHHHH---------------HH---- Confidence 0012222211110 001111111110 000111111111111111100000000 00 Q ss_pred HHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHh Q lcl|NC_016164. 508 SFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRN 587 (836) Q Consensus 508 ~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~ 587 (836) ....... ........ ........+.+.. + ...+...+.++.+.|++++|+.+...|++.+++ T Consensus 56 --~~~~~~~-~~~~~~~~----~~~~~~~~~~~~l----~-------~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~ 117 (371) T protein:vir:81 56 --KQTIEDK-EPLKPTVQ----VKENEVEAFVNHI----R-------TRFRNAMSEGSNQDGGYTVPQDIQTRINELRES 117 (371) T ss_pred --HHhhccc-cccccchh----hHHHHHHHHHHHH----H-------HHHHHhhccCCCccCceeecHhHHHHHHHHHHh Confidence 0000000 00000000 0001111111111 0 011223334455668888999999999999999 Q ss_pred hhhhhhhccee-eecCCceEEEEEecCCceeeeeccCccccc-ccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHH Q lcl|NC_016164. 588 RLALNTLGVTM-LTGLQGPVAIPRQTGAATAYWVAEGGDPTE-SQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMV 665 (836) Q Consensus 588 ~~~l~~l~~~~-~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i 665 (836) .+++++++..+ +++..+.+.+++..+.+.+.|++||+++++ ++++|+++++++++++++++||+++|.|+.++++++| T Consensus 118 ~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i 197 (371) T protein:vir:81 118 KDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTL 197 (371) T ss_pred hhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHH Confidence 99999986543 233456777888888889999999999986 6799999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHH-HHhhhccccCccEEEecHHHHHHH Q lcl|NC_016164. 666 RTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMES-KVAADNADIGAMSYLTNSTLYGGF 744 (836) Q Consensus 666 ~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~-~l~~~~~~~~~~~~vmnp~~~~~L 744 (836) .+.|+++++++++.+|++|+|++ .|.|+ .+++++..++. .+...+ ..+++|+|||++|..| T Consensus 198 ~~~l~~a~~~~~~~~i~~g~g~~-~~~~~---------------~~~~~i~~~~~~~l~~~~--~~~a~~vmn~~~~~~L 259 (371) T protein:vir:81 198 VRWIGDESRVTRNGLIINVLNTK-AKTAI---------------ADLDGLKQIINVQLDPVF--RSTSSVIVNQDAFNWL 259 (371) T ss_pred HHHHHHHHHHHHHHHHHhhcccc-ccccc---------------ccHHHHHHHHHhhcchhh--hcCCEEEEcHHHHHHH Confidence 99999999999999999999974 34333 35666776664 454444 4578999999999999 Q ss_pred HHHhhccCcccccc----CCCCeecceeeEeeCcccc------------ceEEEEehhc-eEEEeecceEEEEeccc--c Q lcl|NC_016164. 745 KTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVAN------------GDVFFGVWNQ-MIMGMWGALDIQVNPYA--L 805 (836) Q Consensus 745 ~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~------------~~i~~gD~s~-~~i~~~~~l~i~~~~~~--~ 805 (836) ++++|++|+|+|.. +.+++|+|+||++++.+|. ..++||||+. |.++++.++++.++++. . T Consensus 260 ~~lkd~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~ 339 (371) T protein:vir:81 260 DTLKDQNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDA 339 (371) T ss_pred HHhhccCCCeeeecccCCCCCceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccch Confidence 99999999998743 3457999999999998873 2489999997 67889999999999875 5 Q ss_pred cccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 806 DKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 806 ~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) |.+|++.||++.|+|+++++|+||++++.+. T Consensus 340 f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~ 370 (371) T protein:vir:81 340 FETDATLWRAIERMDVKMRDDEAFVFGEVQL 370 (371) T ss_pred hhcCceEEEEEEeeccEEecccceEEEEEec Confidence 8899999999999999999999999999888 No 61 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=4.9e-49 Score=285.45 Aligned_cols=367 Identities=14% Similarity=0.164 Sum_probs=230.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHH Q lcl|NC_016164. 412 TIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAG 491 (836) Q Consensus 412 ~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~ 491 (836) ++..+.. .....+......++.+.. ++...+.....+ .++.+..+.................. T Consensus 1 Mk~~~eL-~~~~~~~~~~~~~l~~~~---------~~~~~~~~~~~e----e~~~l~~ei~~~~~~~~~~~~~~~~~--- 63 (397) T protein:vir:49 1 MKTSNEL-HDLWIAQGDKVENLNEKL---------NVAMLDDSVSAE----ELQAIKNERDTAKMKRDLFKEQYTEA--- 63 (397) T ss_pred CchHHHH-HHHHHHHHHHHHHHHHHH---------HHHHhcchhhHH----HHHHHHHHHHHHHHHHHHHHHHHHHH--- Confidence 1111111 000111111111111000 000000000000 11111111111000000000000000 Q ss_pred hhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccccccc Q lcl|NC_016164. 492 GGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDL 571 (836) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~ 571 (836) .... ...... ....+... ..........+.+....++. ........+.++++.|++ T Consensus 64 -~~~~----------~~~~~~--~~~~~~~~-------~~~~~~~~~~~~~~~~l~~~----~~~~~~~~~~~t~~~gg~ 119 (397) T protein:vir:49 64 -RANE----------VANMSE--EEKKPLTK-------NEEEVKANFVKDFKNLVRGR----YQNLLDSKTDGSGSDAGL 119 (397) T ss_pred -HHhh----------hhcccc--cccccccc-------hhhHHHHHHHHHHHHHhhcc----hhhHHHhhhccCCccCcc Confidence 0000 000000 00000000 00000011111111111110 111222334455667888 Q ss_pred ccchhhHHHHHHHHHhhhhhhhhccee-eecCCceEEEEEecC-CceeeeeccCccccccc-ccceeEEeeeeeeeeeeh Q lcl|NC_016164. 572 VFTDGRPGSFIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTG-AATAYWVAEGGDPTESQ-PSVDQVALVAKTLGAYTE 648 (836) Q Consensus 572 vvp~~~~~~ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~-~~~~~it~~~~t~~~~i~ 648 (836) ++|+.+...|++.+++.++|++++... ++...+.+.+++... .+.+.|++|++++++++ ++|+++++++++++++++ T Consensus 120 ~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~ 199 (397) T protein:vir:49 120 TIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGIST 199 (397) T ss_pred eecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehh Confidence 999999999999999999999985433 334456777877754 46789999999998765 799999999999999999 Q ss_pred hHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccc Q lcl|NC_016164. 649 FSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNAD 728 (836) Q Consensus 649 ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~ 728 (836) ||+++|.|+..+++++|.+.|+++++++++.+|++|+|++... ++.+++++|.+++.++...+ T Consensus 200 iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~~---------------~~~~~~d~i~~~~~~l~~~~-- 262 (397) T protein:vir:49 200 VTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLPNK---------------PTLAKWDDIIDLQAKVDPAI-- 262 (397) T ss_pred hHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc---------------ccccCHHHHHHHHHhhhhhh-- Confidence 9999999999999999999999999999999999999975321 23357899999999998775 Q ss_pred cCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeC--cccc-----ceEEEEehhc-eEEEeecce Q lcl|NC_016164. 729 IGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSN--QVAN-----GDVFFGVWNQ-MIMGMWGAL 796 (836) Q Consensus 729 ~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~--~~~~-----~~i~~gD~s~-~~i~~~~~l 796 (836) ..+++|+|||.+|..|++++|++|+|+|.. +.+++|+|+||++++ .+|. ..++||||+. |.+++++++ T Consensus 263 ~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 342 (397) T protein:vir:49 263 KQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHL 342 (397) T ss_pred cCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeeccc Confidence 457899999999999999999999998743 345689999998754 3442 3589999996 778999999 Q ss_pred EEEEeccc--ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 797 DIQVNPYA--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 797 ~i~~~~~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++..+++. +|.+|++.||++.|+|+++++|+||++++.+- T Consensus 343 ~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 343 SLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFKA 384 (397) T ss_pred EEEEeccccchhhcCeeeEEEEEeeccEEecccceEEEEecc Confidence 99999875 58999999999999999999999999998444 No 62 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=1.4e-50 Score=293.93 Aligned_cols=292 Identities=16% Similarity=0.146 Sum_probs=228.0 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEE Q lcl|NC_016164. 528 AAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVA 607 (836) Q Consensus 528 ~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~ 607 (836) .+.....+.+ .....+.+..+.++.+.|+ ++|+.+.+.|++.+++.+++++++ ++++...+.++ T Consensus 1 ~~~~~~r~~~--------------~~~~~e~~a~~~~~~~~g~-~ip~~~~~~ii~~~~~~s~i~~~~-~~~~~~~~~~~ 64 (326) T protein:vir:42 1 MAVNPDRTTP--------------FLGVNDPKVAQTGDSMFEG-YLEPEQAQDYFAEAEKISIVQQFA-QKIPMGTTGQK 64 (326) T ss_pred CCCCccchhh--------------hcCcchhhheeccccCCcc-eechhhHHHHHHHHHhcchhhhhc-ceeeccCCceE Confidence 0000000000 0011112222233334444 467778899999999999999985 45666777899 Q ss_pred EEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_016164. 608 IPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGS 687 (836) Q Consensus 608 ~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt 687 (836) +|+.++.+.+.|++|++++++++++|+++++++++++++++||+|+|.++..+++++|.++|++++++++|.++|+|+|+ T Consensus 65 ~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs 144 (326) T protein:vir:42 65 IPHWTGDVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDS 144 (326) T ss_pred EEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999996 Q ss_pred ccccccccccccccccc-----ccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccC--- Q lcl|NC_016164. 688 NSQPEGLKFVTGINTEN-----FGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLE--- 759 (836) Q Consensus 688 ~~~p~Gi~~~~~~~~~t-----~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~--- 759 (836) ++|.|+++........ ...+..++.++..+.......+....+++|+|||.++..|+++||++|+++|... T Consensus 145 -~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~ 223 (326) T protein:vir:42 145 -PFPTFLAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYT 223 (326) T ss_pred -CccccccccccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeecccccc Confidence 5799988664432211 1222334444443333333344456678999999999999999999999887542 Q ss_pred ------CCCeecceeeEeeCccccce--EEEEehhceEEEeecceEEEEecccc--------------cccCcEEEEEEE Q lcl|NC_016164. 760 ------PGGTVNGYNVVRSNQVANGD--VFFGVWNQMIMGMWGALDIQVNPYAL--------------DKSGSVRVTALQ 817 (836) Q Consensus 760 ------~~~~l~G~pVv~s~~~~~~~--i~~gD~s~~~i~~~~~l~i~~~~~~~--------------~~~~~~~~r~~~ 817 (836) ..++++|+||++++++|+++ ++||||++++++.++++++..+.+.. |.+|++.||+++ T Consensus 224 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~ 303 (326) T protein:vir:42 224 EENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEA 303 (326) T ss_pred CccccccCceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEE Confidence 13479999999999999886 47899999999999999998776533 888999999999 Q ss_pred EeccEEEcccceEEEeecC Q lcl|NC_016164. 818 DVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 818 r~d~~v~~p~Af~~l~~A~ 836 (836) |+|+++.+|+||++|+.+. T Consensus 304 ~~d~~v~~~~a~~~l~~~~ 322 (326) T protein:vir:42 304 EYAFHCNDKDAFVKLTNVD 322 (326) T ss_pred EeccEEecccceEEEeecc Confidence 9999999999999999888 No 63 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=2.7e-50 Score=292.38 Aligned_cols=280 Identities=19% Similarity=0.206 Sum_probs=231.9 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccc Q lcl|NC_016164. 547 RGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDP 626 (836) Q Consensus 547 ~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~ 626 (836) -|. ....+.....+.+.++++++|+ +...+++.+++.+++++++ +++++.++.+++|+.++.+.+.|++|++++ T Consensus 1 ~g~----~~e~~~~~~~~t~~~~g~l~~~-~~~~ii~~l~~~s~i~~l~-~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~ 74 (397) T protein:vir:23 1 MGF----SADHSQIAQTKDTMFTGYLDPV-QAKDYFAEAEKTSIVQRVA-QKIPMGATGIVIPHWTGDVSAQWIGEGDMK 74 (397) T ss_pred CCc----CHHHHHHhhccCCCCccccchh-HHHHHHHHHHhccchhhhc-ceeeccCCceEEEEEcCCcceEEecCCccc Confidence 111 1122223334444555666554 6778999999999999985 456667778999999999999999999999 Q ss_pred ccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccccccccccc Q lcl|NC_016164. 627 TESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFG 706 (836) Q Consensus 627 ~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~a 706 (836) ++++++|+++++.+++++++++||+|+|.++.++++++|.++|++++++++|.++|+|+|++..+.++.+........ T Consensus 75 ~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~-- 152 (397) T protein:vir:23 75 PITKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSI-- 152 (397) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeee-- Confidence 999999999999999999999999999999999999999999999999999999999999876666666554433322 Q ss_pred ccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCC---------CCeecceeeEeeCcccc Q lcl|NC_016164. 707 ATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEP---------GGTVNGYNVVRSNQVAN 777 (836) Q Consensus 707 a~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~---------~~~l~G~pVv~s~~~~~ 777 (836) .+...++++.++...+..++. .+++|+||++++..|+.+||++|+++|.... +++|+|+||++++++|+ T Consensus 153 ~~~~~~~~~~~~~~~l~~~~~--~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~ 230 (397) T protein:vir:23 153 SPNAYQGLGVSGLTKLVTDGK--KWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAE 230 (397) T ss_pred cccchhHHHHHHHHhhhhccc--CCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCC Confidence 334567788888888877654 5689999999999999999999999875432 35899999999999998 Q ss_pred ce--EEEEehhceEEEeecceEEEEeccc--------------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 778 GD--VFFGVWNQMIMGMWGALDIQVNPYA--------------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 778 ~~--i~~gD~s~~~i~~~~~l~i~~~~~~--------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +. ++||||+++++++++++.+..+++. .|++|++.||+++|+|+++++|+||++++.+. T Consensus 231 g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~ 305 (397) T protein:vir:23 231 GDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDP 305 (397) T ss_pred CceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecc Confidence 76 4799999999999999998887653 48999999999999999999999999999766 No 64 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=3.7e-50 Score=291.62 Aligned_cols=349 Identities=11% Similarity=0.015 Sum_probs=239.9 Q ss_pred hhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhh Q lcl|NC_016164. 446 AQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAF 525 (836) Q Consensus 446 a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~ 525 (836) +.-..++.....+...+..+.+..+... . +.. ...... ................. T Consensus 1 M~i~~k~~~~~~~~~~~l~~~~~~~~~~-e-------e~~-~~~~~~-~~~~~~~~~~~~~~e~~--------------- 55 (377) T protein:vir:98 1 MAINLKELPKYREAVAELSAKISAGATS-E-------EQE-KLFEAA-FTTMGDEILAKNEEEME--------------- 55 (377) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHhhhhh-H-------HHH-HHHHHH-HHhHHHHHHHHHHHHHH--------------- Confidence 2111111111111111111111110000 0 000 000000 00000000000000000 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCce Q lcl|NC_016164. 526 EAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGP 605 (836) Q Consensus 526 ~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~ 605 (836) .....+.+. ..........-......+..++|++++|+.+...|++.+...+++++++. +.+. ++. T Consensus 56 ----------~~~~~~~~~--~~lt~ee~~~~~~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~-v~~~-~~~ 121 (377) T protein:vir:98 56 ----------RMFDLRDKN--RELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVIN-FKNT-SLR 121 (377) T ss_pred ----------HHHHhccCC--cccCHHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhhee-eEec-Ccc Confidence 000000000 00000000000112234556678889999999999999999999999854 4444 467 Q ss_pred EEEEEecCCceeeeeccCcccc-cccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016164. 606 VAIPRQTGAATAYWVAEGGDPT-ESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYG 684 (836) Q Consensus 606 ~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G 684 (836) +.+|+.++.+.+.|++|+++.+ .++++|+++++.+++++++++||++||.|+..++++||.+.|+++++++++.+|++| T Consensus 122 ~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G 201 (377) T protein:vir:98 122 LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKG 201 (377) T ss_pred eEEEEecCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEec Confidence 8999999999999999998875 578999999999999999999999999999999999999999999999999999999 Q ss_pred cCCccccccccccccccccccc------ccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc Q lcl|NC_016164. 685 LGSNSQPEGLKFVTGINTENFG------ATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL 758 (836) Q Consensus 685 ~Gt~~~p~Gi~~~~~~~~~t~a------a~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~ 758 (836) +|+ ++|.||++..+..++... +.....+.+.++...+...++ .+++|+||+.++..+++++|.+|++.++. T Consensus 202 ~G~-~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~--~~a~~~m~~~t~~~~~klkd~~G~~i~~~ 278 (377) T protein:vir:98 202 DGL-LQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAP--KKLVPVMKHLSVNDKKRPLKIAGQVKLIL 278 (377) T ss_pred cCC-CcceeeeecccccccccccccccccccchhhhHhhhhhhchhHHH--HHHHHHHHHHHHHHHhhhhccCCceEEEe Confidence 997 589999986543332211 112234567777777776653 46789999999999999999999987631 Q ss_pred ------------------CCCCeeccee--eEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEE Q lcl|NC_016164. 759 ------------------EPGGTVNGYN--VVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQD 818 (836) Q Consensus 759 ------------------~~~~~l~G~p--Vv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r 818 (836) +.+.+++|+| |+.++++|+++++||||++|.+++++++++..+++..|.+|++.|+++.| T Consensus 279 n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r 358 (377) T protein:vir:98 279 NPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNY 358 (377) T ss_pred cccchhhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecceeEEeecceEEEeechhhhhcCceEEEEEEE Confidence 2234788888 67889999999999999999999999999999999999999999999999 Q ss_pred eccEEEcccceEEEeecC Q lcl|NC_016164. 819 VDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 819 ~d~~v~~p~Af~~l~~A~ 836 (836) +|+++++++||++++.+- T Consensus 359 ~dg~~~~~~a~~vl~i~~ 376 (377) T protein:vir:98 359 FYGKAKDNHTAALLTLAG 376 (377) T ss_pred EcCEEeccCcEEEEEEec Confidence 999999999999999999 No 65 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=1.2e-48 Score=283.34 Aligned_cols=372 Identities=11% Similarity=0.123 Sum_probs=227.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHH--HHHHHHHHhhhhhhhHHHHHHHHhh Q lcl|NC_016164. 406 NHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEAD--AMRSVLSEIAKRPAAQPATPAAPVR 483 (836) Q Consensus 406 ~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e--~~~~~l~~l~~~~~a~~~~~~~~~~ 483 (836) +...+...+.......... ++.++... +.+ ++.++..+..+ .....++.+..+............. T Consensus 1 m~~~m~l~el~~~~~~~~~-------~~~~~~~~--~~~---~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~ 68 (408) T protein:vir:10 1 MGVKLTVNQLNEAWIASGD-------KVTDFNDQ--INM---ALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQA 68 (408) T ss_pred CCccccHHHHHHHHHHHHH-------HHHHHHHH--HHH---HhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222222222111111111 11111100 000 00000001000 0000111111110000000000000 Q ss_pred hhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhc Q lcl|NC_016164. 484 SAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVD 563 (836) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~ 563 (836) .. ........+ .+... ... .......+++.... ................. T Consensus 69 ------------------~~--~~~~~~~~~---~~~~~-~~~--~~~~~~~~~~~~~~----~~~~~~~~~~~~~a~~~ 118 (408) T protein:vir:10 69 ------------------EQ--VVNMREEEK---GPLNK-SEN--ELKDKFVKDFVNMV----RNPMAFMNTVSSKTETS 118 (408) T ss_pred ------------------HH--Hhccccccc---ccccc-chh--hhHHHHHHHHHHHh----hcchhhhhhhhhhhhhc Confidence 00 000000000 00000 000 00011111221111 11111111112223344 Q ss_pred ccccccccccchhhHHHHHHHHHhhhhhhhhcceee-ecCCceEEEEEecC-CceeeeeccCcccccc-cccceeEEeee Q lcl|NC_016164. 564 TASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTML-TGLQGPVAIPRQTG-AATAYWVAEGGDPTES-QPSVDQVALVA 640 (836) Q Consensus 564 ~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~-~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~-~~~~~~it~~~ 640 (836) ++.+.|++++|+.+...|++.+++.+++++++..+. +...+.+.+++..+ .+.+.|++|++++++. .++|+++++.+ T Consensus 119 ~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~ 198 (408) T protein:vir:10 119 GSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLI 198 (408) T ss_pred ccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeee Confidence 455667888999999999999999999999865433 23456677776654 4678999999999975 58999999999 Q ss_pred eeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHH Q lcl|NC_016164. 641 KTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMES 720 (836) Q Consensus 641 ~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~ 720 (836) ++++++++||++||.|+.+++.++|.+.|+++++++++.+|++|+|++... .+..+++++.+++. T Consensus 199 ~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~---------------~~~~~~~~l~~~~~ 263 (408) T protein:vir:10 199 KRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK---------------PTIAKFDDVITMIN 263 (408) T ss_pred eeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---------------cccccHHHHHHHHH Confidence 999999999999999999999999999999999999999999999975321 12346788888764 Q ss_pred -HHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeC--ccccc-----eEEEEehhc- Q lcl|NC_016164. 721 -KVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSN--QVANG-----DVFFGVWNQ- 787 (836) Q Consensus 721 -~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~--~~~~~-----~i~~gD~s~- 787 (836) .+...+ ..+++|+|||.+|..|+.++|++|+|+|.+ +.+++|+|+||++++ .+|.. .++||||+. T Consensus 264 ~~~~~~~--~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~ 341 (408) T protein:vir:10 264 TAVDPAI--IATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQA 341 (408) T ss_pred Hhhhhhh--ccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhcc Confidence 565554 456899999999999999999999998743 235689999999865 34542 389999997 Q ss_pred eEEEeecceEEEEecc--cccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 788 MIMGMWGALDIQVNPY--ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 788 ~~i~~~~~l~i~~~~~--~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) |.+++++++++.++++ ..|.+|++.||++.|+|+++++|+||++++.+= T Consensus 342 ~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~ 392 (408) T protein:vir:10 342 ITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSA 392 (408) T ss_pred EEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEeec Confidence 6789999999999886 458999999999999999999999999999443 No 66 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1.4e-48 Score=282.86 Aligned_cols=370 Identities=12% Similarity=0.052 Sum_probs=230.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hHHHHHHHHHHHhhhhhhhHHHHHHHHhhh Q lcl|NC_016164. 406 NHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGA-SEADAMRSVLSEIAKRPAAQPATPAAPVRS 484 (836) Q Consensus 406 ~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~-t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~ 484 (836) +..+....+........... ..++ .+.++...++.. ...+.....++.+..+.............. T Consensus 1 Mn~~e~lkel~~~~~el~~~-------~~~~------~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~ 67 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEK-------RCGI------VEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTA 67 (421) T ss_pred CCHHHHHHHHHHHHHHHHHH-------HHHH------HHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111111111111111000 0000 011111111110 001111111111111111111110000000 Q ss_pred hhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcc Q lcl|NC_016164. 485 AQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDT 564 (836) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~ 564 (836) . ....+... .............. ........++.+ ...+.. .....++ .. T Consensus 68 ~------------~~~~~~~~-----~~~~~~~~~~~~~~----~~~~~~~~~~~~----~~~~~~--~~~~~ra---~~ 117 (421) T protein:vir:13 68 I------------DEERKNTN-----FTGGRVIINGDSKE----EKRSLQLSAMSK----TIRGIQ--LSEEERD---IM 117 (421) T ss_pred H------------HHHHhhhc-----ccccccccccchhH----HHHHHHHHHHHH----hhhccc--hhHHHhh---cc Confidence 0 00000000 00000000000000 000011111111 111111 1111222 22 Q ss_pred cccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCC--ceeeeeccCcccccccccceeEEeeeee Q lcl|NC_016164. 565 ASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGA--ATAYWVAEGGDPTESQPSVDQVALVAKT 642 (836) Q Consensus 565 ~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~--~~a~~v~Eg~~~~~~~~~~~~it~~~~t 642 (836) +.+.|++++|+.+...|++.+++.+++++++ ++.++.++.+.+|+.... ..+.|++|+++++.++++|+++++.+++ T Consensus 118 t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~-~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k 196 (421) T protein:vir:13 118 SSTNNGAVIPQEFVNEFEKLKEGYPSLKEHC-HVIPVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDD 196 (421) T ss_pred ccCCcceecchhhHHHHHHHHHhhhhhhhhc-eeeeccCCceEEEEeecCCccceeeccccccccccccceeEEEeeeee Confidence 3455778899999999999999999999985 556666666777766654 4467799999999999999999999999 Q ss_pred eeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_016164. 643 LGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKV 722 (836) Q Consensus 643 ~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l 722 (836) ++++++||+++|.|+.++++++|.++|++++.++++..+++ .|+|+++.++ ..++++|.+++.++ T Consensus 197 ~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~------~~~g~~~~~~---------~~~~d~i~~~~~~l 261 (421) T protein:vir:13 197 YGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVK------QAKAVLAEET---------INDYAGLVKTINSL 261 (421) T ss_pred eEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhh------hhhhcccccc---------ccchHHHHHHHHHh Confidence 99999999999999999999999999999999999988763 5677765432 24689999999999 Q ss_pred hhhccccCccEEEecHHHHHHHHHHhhccCcccccc---CCCCeecceeeEeeCccccc-----eEEEEehhc-eEEEee Q lcl|NC_016164. 723 AADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL---EPGGTVNGYNVVRSNQVANG-----DVFFGVWNQ-MIMGMW 793 (836) Q Consensus 723 ~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~---~~~~~l~G~pVv~s~~~~~~-----~i~~gD~s~-~~i~~~ 793 (836) ..++. .+++|+|||.+|..|++++|++|+|+|.. +.+++|+|+||++++++|.+ .++||||++ |.++++ T Consensus 262 ~~~~~--~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~ 339 (421) T protein:vir:13 262 VPNAR--KRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDR 339 (421) T ss_pred hhhhc--CCCEEEEcHHHHHHHHHhhcCCCceeecCcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEe Confidence 87764 56899999999999999999999998743 33568999999999998754 379999997 778999 Q ss_pred cceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 794 GALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 794 ~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +++++.++++.+|.+|++.||++.|+|+++++++||+.++.+= T Consensus 340 ~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 382 (421) T protein:vir:13 340 KQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRK 382 (421) T ss_pred cceEEEeecccccccCeeEEEEEeeecceeecchhhheeeecc Confidence 9999999999999999999999999999999999976654332 No 67 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=1e-49 Score=289.18 Aligned_cols=269 Identities=15% Similarity=0.123 Sum_probs=222.3 Q ss_pred cccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeee Q lcl|NC_016164. 565 ASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLG 644 (836) Q Consensus 565 ~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~ 644 (836) -...|+.++|+.+...|++.+++.+++++++. +++..++.+++|+.++.+.++|++|++++++++++|+++++++++++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~-~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~a 79 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSA-QKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcc-eeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEeeeeEE Confidence 22345567788889999999999999999965 55666778999999999999999999999999999999999999999 Q ss_pred eeehhHHHHHh---cchhHHHHHHHHHHHHHHHHHHHHHHHhhcC----Ccccccccccccccccc---cccccchhHHH Q lcl|NC_016164. 645 AYTEFSRRLML---QSSIDVEQMVRTELATVIALEIDRAALYGLG----SNSQPEGLKFVTGINTE---NFGATNPTYVE 714 (836) Q Consensus 645 ~~i~ISrelL~---ds~~~l~~~i~~~l~~a~a~~~d~~il~G~G----t~~~p~Gi~~~~~~~~~---t~aa~~~t~~~ 714 (836) ++++||+|+|. ++..+++++|.++|+++++++++.++++|.+ +...+.|+....+.... ........+++ T Consensus 80 ~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) T protein:vir:16 80 YGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) T ss_pred EeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHHHH Confidence 99999999995 4567899999999999999999999999953 33334443333222111 12222334778 Q ss_pred HHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccc------eEEEEe Q lcl|NC_016164. 715 LVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANG------DVFFGV 784 (836) Q Consensus 715 l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~------~i~~gD 784 (836) +.+++.++..++. .+.+|+|||+++..|++++|.+|+|+|.. +.+++|+|+||++++.+|.+ .+++|| T Consensus 160 i~~~~~~~~~~~~--~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GD 237 (298) T protein:vir:16 160 IENAVELLTGVDA--DVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGD 237 (298) T ss_pred HHHHHHHhhhcCC--CccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEee Confidence 9999998887654 45689999999999999999999998754 34579999999999999853 588999 Q ss_pred hhce-EEEeecceEEEEeccc--------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 785 WNQM-IMGMWGALDIQVNPYA--------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 785 ~s~~-~i~~~~~l~i~~~~~~--------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) |+++ .++.+.++++.++++. +|++|++.||+++|+|+++++|+||++++.|= T Consensus 238 fs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 238 FANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 9975 5788999999887652 48999999999999999999999999999999 No 68 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=5.2e-48 Score=279.81 Aligned_cols=374 Identities=12% Similarity=0.134 Sum_probs=229.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHh Q lcl|NC_016164. 413 IDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGG 492 (836) Q Consensus 413 ~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~ 492 (836) +...+..............++.++ .+...+...+.....+...+....+... .............. T Consensus 1 ~~~~m~l~el~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~ee~~~~~~~~~~~----~~~~~~~~~~~~~~---- 66 (404) T protein:vir:39 1 MGVKLTVNQLNEAWIASGDKVTDF------NDQINMALNDDNFSAEAMSELKNKRDNE----KVRRDALREQLVEA---- 66 (404) T ss_pred CChHHHHHHHHHHHHHHHHHHHHH------HHHHHHHhccccccHHHHHHHHHHHHHH----HHHHHHHHHHHHHH---- Confidence 111111111111111111111111 1111111111111111111111111110 00000000000000 Q ss_pred hhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccc Q lcl|NC_016164. 493 GSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLV 572 (836) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~v 572 (836) ....... ........... ..........+++...... +....... .......++++.|+++ T Consensus 67 ----------~~~~~~~---~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~---~~~~~~~~-e~~a~~~~t~~~gg~~ 127 (404) T protein:vir:39 67 ----------QAEQVVN---MREEEKGPLNK--SEYELKDKFVKEFVNMVRN---PMAFLNTV-SSKTETSGSDSAAGLT 127 (404) T ss_pred ----------HHHHHhc---ccccccccccc--chhhhHHHHHHHHHHHHhc---chhhhhhh-hhhhhhcccccCCcee Confidence 0000000 00000000000 0000011111222211110 11111122 2233344556677889 Q ss_pred cchhhHHHHHHHHHhhhhhhhhccee-eecCCceEEEEEecC-CceeeeeccCccccc-ccccceeEEeeeeeeeeeehh Q lcl|NC_016164. 573 FTDGRPGSFIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTG-AATAYWVAEGGDPTE-SQPSVDQVALVAKTLGAYTEF 649 (836) Q Consensus 573 vp~~~~~~ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~~i~I 649 (836) +|+.+...|++.+++.++|++++... ++...+.+.+++..+ .+.+.|++|++++++ +.++|+++++++++++++++| T Consensus 128 iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~i 207 (404) T protein:vir:39 128 IPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITA 207 (404) T ss_pred ccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehh Confidence 99999999999999999999986443 223345666666654 467899999999997 579999999999999999999 Q ss_pred HHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHH-HHhhhccc Q lcl|NC_016164. 650 SRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMES-KVAADNAD 728 (836) Q Consensus 650 SrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~-~l~~~~~~ 728 (836) |+++|.|+.++++++|.+.|+++++++++.+|++|+|++. +. ++..+++++.+++. .+...+ T Consensus 208 S~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~-~~--------------~~~~~~~~i~~~~~~~~~~~~-- 270 (404) T protein:vir:39 208 TNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVP-KK--------------PTIAKFDDVITMINTSVDPAI-- 270 (404) T ss_pred HHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-cc--------------cccccHHHHHHHHHHhhhhhh-- Confidence 9999999999999999999999999999999999999753 21 12235778888766 444443 Q ss_pred cCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCc--ccc-----ceEEEEehhc-eEEEeecce Q lcl|NC_016164. 729 IGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQ--VAN-----GDVFFGVWNQ-MIMGMWGAL 796 (836) Q Consensus 729 ~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~--~~~-----~~i~~gD~s~-~~i~~~~~l 796 (836) ..+++|+|||.+|..|+.++|++|+|+|.. +.+++|+|+||++++. +|. ..++||||++ |.+++++++ T Consensus 271 ~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 350 (404) T protein:vir:39 271 IATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENM 350 (404) T ss_pred ccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecce Confidence 356799999999999999999999998743 3456899999998754 342 2489999997 678999999 Q ss_pred EEEEeccc--ccccCcEEEEEEEEeccEEEcccceEEEeec-C Q lcl|NC_016164. 797 DIQVNPYA--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDN-L 836 (836) Q Consensus 797 ~i~~~~~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A-~ 836 (836) ++.++++. .|.+|++.||++.|+|+++++|+||++++.. + T Consensus 351 ~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (404) T protein:vir:39 351 SLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAI 393 (404) T ss_pred EEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEeecc Confidence 99999876 6899999999999999999999999999944 4 No 69 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=3.3e-48 Score=280.90 Aligned_cols=358 Identities=15% Similarity=0.122 Sum_probs=226.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHH Q lcl|NC_016164. 426 ERSRVASITSLCREHKADDLAQGLIESGA-SEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEA 504 (836) Q Consensus 426 ~~~~~~ei~al~~~~~l~e~a~eliee~~-t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (836) ......++++.. ..+.+....+.++.. ...+.....++.+.++.......... .. T Consensus 1 M~k~l~el~~~~--~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~----------------------~~ 56 (392) T protein:vir:10 1 MSKELRELLAKL--EGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEA----------------------ET 56 (392) T ss_pred CcHHHHHHHHHH--HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------------HH Confidence 111111111110 000011111111111 11111111111111111100000000 00 Q ss_pred HHHHHhhhhhhh-hhhhhhhhhhhhhhhHHHHHHHHHHh---hhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHH Q lcl|NC_016164. 505 RSFSFVRAIRAQ-MMPGDRAAFEAAAFEREVSEATAQRM---GVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGS 580 (836) Q Consensus 505 ~~~~~~~a~~a~-~~~~~~~~~~~~~~~~~~a~~~~~~~---g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ 580 (836) . ........ ...... ..+....+.+.. .....................++.+.|++++|+.+... T Consensus 57 ~---~~~~~~~~~~~~~~~--------~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ 125 (392) T protein:vir:10 57 E---ERNNGREVETRNVDG--------EMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQ 125 (392) T ss_pred H---HhhccccccccCccc--------hHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHH Confidence 0 00000000 000000 000011111100 00000000001111223334445567888889999999 Q ss_pred HHHHHHhhhhhhhhccee-eecCCceEEEEEecCCceeeeeccCcccccc-cccceeEEeeeeeeeeeehhHHHHHhcch Q lcl|NC_016164. 581 FIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTGAATAYWVAEGGDPTES-QPSVDQVALVAKTLGAYTEFSRRLMLQSS 658 (836) Q Consensus 581 ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~it~~~~t~~~~i~ISrelL~ds~ 658 (836) |++.+++.++|++++... +++..+.+.+++.++.+.+.|++|+++++++ .++|+++++.+++++++++||+++|.|+. T Consensus 126 ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~ 205 (392) T protein:vir:10 126 INELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD 205 (392) T ss_pred HHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhH Confidence 999999999999986543 3344567888888888999999999999975 68999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHH-HHhhhccccCccEEEec Q lcl|NC_016164. 659 IDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMES-KVAADNADIGAMSYLTN 737 (836) Q Consensus 659 ~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~-~l~~~~~~~~~~~~vmn 737 (836) +++.++|.+.|+++++++++.+|++|+|++.. .+..+++++.+++. .+...+ ..+++|+|| T Consensus 206 ~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------~~~~~~d~i~~~~~~~l~~~~--~~~a~~vm~ 267 (392) T protein:vir:10 206 QNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------QAIKSLDDIKDVLNVKLDPAI--SPNAILLTN 267 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------cCccCHHHHHHHHHHhhhhhh--ccCCEEEEc Confidence 99999999999999999999999999986431 12346788888774 555544 467899999 Q ss_pred HHHHHHHHHHhhccCcccccc----CCCCeecceeeEe-eCcc-c--------cceEEEEehhc-eEEEeecceEEEEec Q lcl|NC_016164. 738 STLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVR-SNQV-A--------NGDVFFGVWNQ-MIMGMWGALDIQVNP 802 (836) Q Consensus 738 p~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~-s~~~-~--------~~~i~~gD~s~-~~i~~~~~l~i~~~~ 802 (836) |.+|..|+++||++|+|+|.. +.+++|+|+|+++ ++.+ + ...++||||+. |.++++.++++.+++ T Consensus 268 ~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 268 QDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred HHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 999999999999999998743 3456899987655 3222 1 22379999997 678999999999998 Q ss_pred cc--ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 803 YA--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 803 ~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +. +|.+|++.||++.|+|+++++|+||++++.+. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 64 68999999999999999999999999998766 No 70 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=3.3e-48 Score=280.90 Aligned_cols=358 Identities=15% Similarity=0.122 Sum_probs=226.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHH Q lcl|NC_016164. 426 ERSRVASITSLCREHKADDLAQGLIESGA-SEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEA 504 (836) Q Consensus 426 ~~~~~~ei~al~~~~~l~e~a~eliee~~-t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (836) ......++++.. ..+.+....+.++.. ...+.....++.+.++.......... .. T Consensus 1 M~k~l~el~~~~--~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~----------------------~~ 56 (392) T protein:vir:10 1 MSKELRELLAKL--EGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEA----------------------ET 56 (392) T ss_pred CcHHHHHHHHHH--HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------------HH Confidence 111111111110 000011111111111 11111111111111111100000000 00 Q ss_pred HHHHHhhhhhhh-hhhhhhhhhhhhhhhHHHHHHHHHHh---hhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHH Q lcl|NC_016164. 505 RSFSFVRAIRAQ-MMPGDRAAFEAAAFEREVSEATAQRM---GVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGS 580 (836) Q Consensus 505 ~~~~~~~a~~a~-~~~~~~~~~~~~~~~~~~a~~~~~~~---g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ 580 (836) . ........ ...... ..+....+.+.. .....................++.+.|++++|+.+... T Consensus 57 ~---~~~~~~~~~~~~~~~--------~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ 125 (392) T protein:vir:10 57 E---ERNNGREVETRNVDG--------EMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQ 125 (392) T ss_pred H---HhhccccccccCccc--------hHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHH Confidence 0 00000000 000000 000011111100 00000000001111223334445567888889999999 Q ss_pred HHHHHHhhhhhhhhccee-eecCCceEEEEEecCCceeeeeccCcccccc-cccceeEEeeeeeeeeeehhHHHHHhcch Q lcl|NC_016164. 581 FIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTGAATAYWVAEGGDPTES-QPSVDQVALVAKTLGAYTEFSRRLMLQSS 658 (836) Q Consensus 581 ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~it~~~~t~~~~i~ISrelL~ds~ 658 (836) |++.+++.++|++++... +++..+.+.+++.++.+.+.|++|+++++++ .++|+++++.+++++++++||+++|.|+. T Consensus 126 ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~ 205 (392) T protein:vir:10 126 INELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD 205 (392) T ss_pred HHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhH Confidence 999999999999986543 3344567888888888999999999999975 68999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHH-HHhhhccccCccEEEec Q lcl|NC_016164. 659 IDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMES-KVAADNADIGAMSYLTN 737 (836) Q Consensus 659 ~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~-~l~~~~~~~~~~~~vmn 737 (836) +++.++|.+.|+++++++++.+|++|+|++.. .+..+++++.+++. .+...+ ..+++|+|| T Consensus 206 ~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------~~~~~~d~i~~~~~~~l~~~~--~~~a~~vm~ 267 (392) T protein:vir:10 206 QNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------QAIKSLDDIKDVLNVKLDPAI--SPNAILLTN 267 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------cCccCHHHHHHHHHHhhhhhh--ccCCEEEEc Confidence 99999999999999999999999999986431 12346788888774 555544 467899999 Q ss_pred HHHHHHHHHHhhccCcccccc----CCCCeecceeeEe-eCcc-c--------cceEEEEehhc-eEEEeecceEEEEec Q lcl|NC_016164. 738 STLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVR-SNQV-A--------NGDVFFGVWNQ-MIMGMWGALDIQVNP 802 (836) Q Consensus 738 p~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~-s~~~-~--------~~~i~~gD~s~-~~i~~~~~l~i~~~~ 802 (836) |.+|..|+++||++|+|+|.. +.+++|+|+|+++ ++.+ + ...++||||+. |.++++.++++.+++ T Consensus 268 ~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 268 QDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred HHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 999999999999999998743 3456899987655 3222 1 22379999997 678999999999998 Q ss_pred cc--ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 803 YA--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 803 ~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +. +|.+|++.||++.|+|+++++|+||++++.+. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 64 68999999999999999999999999998766 No 71 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=3.3e-48 Score=280.90 Aligned_cols=358 Identities=15% Similarity=0.122 Sum_probs=226.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHH Q lcl|NC_016164. 426 ERSRVASITSLCREHKADDLAQGLIESGA-SEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEA 504 (836) Q Consensus 426 ~~~~~~ei~al~~~~~l~e~a~eliee~~-t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (836) ......++++.. ..+.+....+.++.. ...+.....++.+.++.......... .. T Consensus 1 M~k~l~el~~~~--~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~----------------------~~ 56 (392) T protein:vir:10 1 MSKELRELLAKL--EGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEA----------------------ET 56 (392) T ss_pred CcHHHHHHHHHH--HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------------HH Confidence 111111111110 000011111111111 11111111111111111100000000 00 Q ss_pred HHHHHhhhhhhh-hhhhhhhhhhhhhhhHHHHHHHHHHh---hhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHH Q lcl|NC_016164. 505 RSFSFVRAIRAQ-MMPGDRAAFEAAAFEREVSEATAQRM---GVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGS 580 (836) Q Consensus 505 ~~~~~~~a~~a~-~~~~~~~~~~~~~~~~~~a~~~~~~~---g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ 580 (836) . ........ ...... ..+....+.+.. .....................++.+.|++++|+.+... T Consensus 57 ~---~~~~~~~~~~~~~~~--------~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ 125 (392) T protein:vir:10 57 E---ERNNGREVETRNVDG--------EMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQ 125 (392) T ss_pred H---HhhccccccccCccc--------hHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHH Confidence 0 00000000 000000 000011111100 00000000001111223334445567888889999999 Q ss_pred HHHHHHhhhhhhhhccee-eecCCceEEEEEecCCceeeeeccCcccccc-cccceeEEeeeeeeeeeehhHHHHHhcch Q lcl|NC_016164. 581 FIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTGAATAYWVAEGGDPTES-QPSVDQVALVAKTLGAYTEFSRRLMLQSS 658 (836) Q Consensus 581 ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~it~~~~t~~~~i~ISrelL~ds~ 658 (836) |++.+++.++|++++... +++..+.+.+++.++.+.+.|++|+++++++ .++|+++++.+++++++++||+++|.|+. T Consensus 126 ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~ 205 (392) T protein:vir:10 126 INELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD 205 (392) T ss_pred HHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhH Confidence 999999999999986543 3344567888888888999999999999975 68999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHH-HHhhhccccCccEEEec Q lcl|NC_016164. 659 IDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMES-KVAADNADIGAMSYLTN 737 (836) Q Consensus 659 ~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~-~l~~~~~~~~~~~~vmn 737 (836) +++.++|.+.|+++++++++.+|++|+|++.. .+..+++++.+++. .+...+ ..+++|+|| T Consensus 206 ~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------~~~~~~d~i~~~~~~~l~~~~--~~~a~~vm~ 267 (392) T protein:vir:10 206 QNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------QAIKSLDDIKDVLNVKLDPAI--SPNAILLTN 267 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------cCccCHHHHHHHHHHhhhhhh--ccCCEEEEc Confidence 99999999999999999999999999986431 12346788888774 555544 467899999 Q ss_pred HHHHHHHHHHhhccCcccccc----CCCCeecceeeEe-eCcc-c--------cceEEEEehhc-eEEEeecceEEEEec Q lcl|NC_016164. 738 STLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVR-SNQV-A--------NGDVFFGVWNQ-MIMGMWGALDIQVNP 802 (836) Q Consensus 738 p~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~-s~~~-~--------~~~i~~gD~s~-~~i~~~~~l~i~~~~ 802 (836) |.+|..|+++||++|+|+|.. +.+++|+|+|+++ ++.+ + ...++||||+. |.++++.++++.+++ T Consensus 268 ~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 268 QDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred HHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 999999999999999998743 3456899987655 3222 1 22379999997 678999999999998 Q ss_pred cc--ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 803 YA--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 803 ~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +. +|.+|++.||++.|+|+++++|+||++++.+. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 64 68999999999999999999999999998766 No 72 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=3.3e-48 Score=280.90 Aligned_cols=358 Identities=15% Similarity=0.122 Sum_probs=226.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHH Q lcl|NC_016164. 426 ERSRVASITSLCREHKADDLAQGLIESGA-SEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEA 504 (836) Q Consensus 426 ~~~~~~ei~al~~~~~l~e~a~eliee~~-t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (836) ......++++.. ..+.+....+.++.. ...+.....++.+.++.......... .. T Consensus 1 M~k~l~el~~~~--~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~----------------------~~ 56 (392) T protein:vir:10 1 MSKELRELLAKL--EGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEA----------------------ET 56 (392) T ss_pred CcHHHHHHHHHH--HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------------HH Confidence 111111111110 000011111111111 11111111111111111100000000 00 Q ss_pred HHHHHhhhhhhh-hhhhhhhhhhhhhhhHHHHHHHHHHh---hhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHH Q lcl|NC_016164. 505 RSFSFVRAIRAQ-MMPGDRAAFEAAAFEREVSEATAQRM---GVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGS 580 (836) Q Consensus 505 ~~~~~~~a~~a~-~~~~~~~~~~~~~~~~~~a~~~~~~~---g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ 580 (836) . ........ ...... ..+....+.+.. .....................++.+.|++++|+.+... T Consensus 57 ~---~~~~~~~~~~~~~~~--------~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ 125 (392) T protein:vir:10 57 E---ERNNGREVETRNVDG--------EMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQ 125 (392) T ss_pred H---HhhccccccccCccc--------hHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHH Confidence 0 00000000 000000 000011111100 00000000001111223334445567888889999999 Q ss_pred HHHHHHhhhhhhhhccee-eecCCceEEEEEecCCceeeeeccCcccccc-cccceeEEeeeeeeeeeehhHHHHHhcch Q lcl|NC_016164. 581 FIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTGAATAYWVAEGGDPTES-QPSVDQVALVAKTLGAYTEFSRRLMLQSS 658 (836) Q Consensus 581 ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~it~~~~t~~~~i~ISrelL~ds~ 658 (836) |++.+++.++|++++... +++..+.+.+++.++.+.+.|++|+++++++ .++|+++++.+++++++++||+++|.|+. T Consensus 126 ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~ 205 (392) T protein:vir:10 126 INELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD 205 (392) T ss_pred HHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhH Confidence 999999999999986543 3344567888888888999999999999975 68999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHH-HHhhhccccCccEEEec Q lcl|NC_016164. 659 IDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMES-KVAADNADIGAMSYLTN 737 (836) Q Consensus 659 ~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~-~l~~~~~~~~~~~~vmn 737 (836) +++.++|.+.|+++++++++.+|++|+|++.. .+..+++++.+++. .+...+ ..+++|+|| T Consensus 206 ~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------~~~~~~d~i~~~~~~~l~~~~--~~~a~~vm~ 267 (392) T protein:vir:10 206 QNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------QAIKSLDDIKDVLNVKLDPAI--SPNAILLTN 267 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------cCccCHHHHHHHHHHhhhhhh--ccCCEEEEc Confidence 99999999999999999999999999986431 12346788888774 555544 467899999 Q ss_pred HHHHHHHHHHhhccCcccccc----CCCCeecceeeEe-eCcc-c--------cceEEEEehhc-eEEEeecceEEEEec Q lcl|NC_016164. 738 STLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVR-SNQV-A--------NGDVFFGVWNQ-MIMGMWGALDIQVNP 802 (836) Q Consensus 738 p~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~-s~~~-~--------~~~i~~gD~s~-~~i~~~~~l~i~~~~ 802 (836) |.+|..|+++||++|+|+|.. +.+++|+|+|+++ ++.+ + ...++||||+. |.++++.++++.+++ T Consensus 268 ~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 268 QDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred HHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 999999999999999998743 3456899987655 3222 1 22379999997 678999999999998 Q ss_pred cc--ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 803 YA--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 803 ~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +. +|.+|++.||++.|+|+++++|+||++++.+. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 64 68999999999999999999999999998766 No 73 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=4.9e-48 Score=279.94 Aligned_cols=371 Identities=12% Similarity=0.128 Sum_probs=229.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHH---HHHHhhhhhhhHHHHHHHHh Q lcl|NC_016164. 406 NHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRS---VLSEIAKRPAAQPATPAAPV 482 (836) Q Consensus 406 ~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~---~l~~l~~~~~a~~~~~~~~~ 482 (836) +...+...++................++.++ .++.....+...+ .++.+..+............ T Consensus 1 m~~~m~i~el~~~~~~~~~~~~~~~~e~~~~-------------~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~ 67 (408) T protein:vir:74 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMA-------------LNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQ 67 (408) T ss_pred CChhhhHHHHHHHHHHHHHHHHHHHHHHHHH-------------HhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222222221111111111111111111110 0000000000111 11111111110000000000 Q ss_pred hhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 483 RSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVV 562 (836) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~ 562 (836) . .......... ..+.... .........+++... .+............... T Consensus 68 ~---------------------~~~~~~~~~~--~~~~~~~---~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~a~~ 117 (408) T protein:vir:74 68 A---------------------EQVVNMREEE--KGPLNKS---ENELKDKFVKDFVNM----VRNPMAFLNTVSSKTET 117 (408) T ss_pred H---------------------HHHhhccccc--cccccch---hhhhHHHHHHHHHHH----Hhcchhhhhhhhhhhhc Confidence 0 0000000000 0000000 000001111111111 11111111122222334 Q ss_pred cccccccccccchhhHHHHHHHHHhhhhhhhhccee-eecCCceEEEEEecC-CceeeeeccCccccc-ccccceeEEee Q lcl|NC_016164. 563 DTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTG-AATAYWVAEGGDPTE-SQPSVDQVALV 639 (836) Q Consensus 563 ~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~-~~~~~~~it~~ 639 (836) .+....|++++|+.+...|++.+++.++|++++... ++...+.+.+++..+ ++.+.|++|++++++ ++++|++++++ T Consensus 118 ~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~ 197 (408) T protein:vir:74 118 SGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYL 197 (408) T ss_pred ccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccccccccccccceeeEEee Confidence 445566788899999999999999999999986543 234456777777665 456789999999997 56999999999 Q ss_pred eeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHH Q lcl|NC_016164. 640 AKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSME 719 (836) Q Consensus 640 ~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~ 719 (836) +++++++++||+++|.|+.++++++|.+.|+++++++++.+|++|+|++... ++..+++++..++ T Consensus 198 ~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~---------------~~~~~~~~i~~~~ 262 (408) T protein:vir:74 198 IKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKK---------------PTIANFDDVITMI 262 (408) T ss_pred eeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---------------cccccHHHHHHHH Confidence 9999999999999999999999999999999999999999999999975422 1234678888876 Q ss_pred H-HHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCc--ccc-----ceEEEEehhc Q lcl|NC_016164. 720 S-KVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQ--VAN-----GDVFFGVWNQ 787 (836) Q Consensus 720 ~-~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~--~~~-----~~i~~gD~s~ 787 (836) . .+...+ ..+++|+|||.+|..|++++|++|+|+|.. +.+++|+|+||+++++ +|. ..++||||+. T Consensus 263 ~~~l~~~~--~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~ 340 (408) T protein:vir:74 263 NTSVDPAI--IATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQ 340 (408) T ss_pred HHhhhhhh--cCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhc Confidence 4 666555 357899999999999999999999998743 3456999999998653 442 3489999997 Q ss_pred -eEEEeecceEEEEeccc--ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 788 -MIMGMWGALDIQVNPYA--LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 788 -~~i~~~~~l~i~~~~~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) |.+++++++++.++++. .|.+|++.||++.|+|+++++|+||++++.+= T Consensus 341 ~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 392 (408) T protein:vir:74 341 AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTA 392 (408) T ss_pred cEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeec Confidence 67899999999999874 58999999999999999999999999999533 No 74 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=6.2e-48 Score=279.39 Aligned_cols=367 Identities=13% Similarity=0.150 Sum_probs=226.8 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHH Q lcl|NC_016164. 412 TIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAG 491 (836) Q Consensus 412 ~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~ 491 (836) +...++ ......+......++.+ .......+ .....+.++.+...................... T Consensus 1 Mk~~~e-l~~~~~~~~~~i~~~~~---------~~~~~~~~----~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~-- 64 (397) T protein:vir:48 1 MKTSNE-LHDLWVAQGDKVENLNE---------KLNVAMLD----DSVTAEELQAIKNERDTAKMKRDMFKEQYTEAR-- 64 (397) T ss_pred CchHHH-HHHHHHHHHHHHHHHHH---------HHHHhhcc----hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 000000 00111111111111110 00000000 000001111111111110000000000000000 Q ss_pred hhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccccccc Q lcl|NC_016164. 492 GGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDL 571 (836) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~ 571 (836) .. ....... . ...+.... ......+..+.+.....+. ..........++++.|++ T Consensus 65 --~~-------~~~~~~~----~-~~~~~~~~-------~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~t~~~gg~ 119 (397) T protein:vir:48 65 --AN-------EVVNMSE----E-EKKPLTKS-------EEEVKAGFVKDFKNLVRGR----YQNLLDSKTDASGSDAGL 119 (397) T ss_pred --Hh-------hhhhhhh----h-ccccccch-------hhHHHHHHHHHHHHHHhhh----hhHHHHHhhccCCccccc Confidence 00 0000000 0 00000000 0000011111111111111 001111223344556788 Q ss_pred ccchhhHHHHHHHHHhhhhhhhhccee-eecCCceEEEEEec-CCceeeeeccCcccccc-cccceeEEeeeeeeeeeeh Q lcl|NC_016164. 572 VFTDGRPGSFIELLRNRLALNTLGVTM-LTGLQGPVAIPRQT-GAATAYWVAEGGDPTES-QPSVDQVALVAKTLGAYTE 648 (836) Q Consensus 572 vvp~~~~~~ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~~-~~~~~~it~~~~t~~~~i~ 648 (836) ++|+.+...|++.+++.++|++++... +++..+.+.++... ..+.+.|++|++.++++ +++|+++++++++++++++ T Consensus 120 ~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~ 199 (397) T protein:vir:48 120 TIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGIST 199 (397) T ss_pred cccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehh Confidence 899999999999999999999986544 33444555665544 45678999999999976 5899999999999999999 Q ss_pred hHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccc Q lcl|NC_016164. 649 FSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNAD 728 (836) Q Consensus 649 ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~ 728 (836) ||+++|.++.++++++|.+.|+++++++++.+|++|+|++..+ ++..++++|.+++.+|...+. T Consensus 200 iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~~---------------~~~~~~d~i~~~~~~l~~~~~- 263 (397) T protein:vir:48 200 VTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLPTK---------------PTLTKWDDIIDLQAKVDPAIK- 263 (397) T ss_pred hHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---------------cccccHHHHHHHHHHhhhhhc- Confidence 9999999999999999999999999999999999999975432 233478899999999987754 Q ss_pred cCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCc--cc-----cceEEEEehhc-eEEEeecce Q lcl|NC_016164. 729 IGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQ--VA-----NGDVFFGVWNQ-MIMGMWGAL 796 (836) Q Consensus 729 ~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~--~~-----~~~i~~gD~s~-~~i~~~~~l 796 (836) .+++|+|||.+|..|+.++|++|+|+|.. +.+++|+|+||++++. ++ ...++||||+. |.+++++++ T Consensus 264 -~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 342 (397) T protein:vir:48 264 -QTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQM 342 (397) T ss_pred -CCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecce Confidence 57899999999999999999999998743 3456899999987543 32 34589999997 578999999 Q ss_pred EEEEeccc--ccccCcEEEEEEEEeccEEEcccceEEEee-cC Q lcl|NC_016164. 797 DIQVNPYA--LDKSGSVRVTALQDVDVAVRHPEAFCRGND-NL 836 (836) Q Consensus 797 ~i~~~~~~--~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~-A~ 836 (836) ++..+++. +|.+|++.||++.|+|+++++|+||++++. +. T Consensus 343 ~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 343 SLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAI 385 (397) T ss_pred EEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEeccc Confidence 99998864 699999999999999999999999999883 33 No 75 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=1.8e-49 Score=287.81 Aligned_cols=269 Identities=16% Similarity=0.135 Sum_probs=224.0 Q ss_pred cccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeee Q lcl|NC_016164. 565 ASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLG 644 (836) Q Consensus 565 ~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~ 644 (836) -...|+.++|+.+...|++.+++.+++++++ +..+...+.+++|+.++.+.+.|++|++++++++++|+++++.+++++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~-~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~ 79 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLS-AQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhc-ceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEEE Confidence 2234567788899999999999999999995 556666778999999999999999999999999999999999999999 Q ss_pred eeehhHHHHHh---cchhHHHHHHHHHHHHHHHHHHHHHHHhhc----CCccccccccccccccc---ccccccchhHHH Q lcl|NC_016164. 645 AYTEFSRRLML---QSSIDVEQMVRTELATVIALEIDRAALYGL----GSNSQPEGLKFVTGINT---ENFGATNPTYVE 714 (836) Q Consensus 645 ~~i~ISrelL~---ds~~~l~~~i~~~l~~a~a~~~d~~il~G~----Gt~~~p~Gi~~~~~~~~---~t~aa~~~t~~~ 714 (836) ++++||+|+|. ++..+++++|.++|+++++++++.++|+|. |++..+.|+....+..+ .........+++ T Consensus 80 ~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) T protein:vir:94 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) T ss_pred EeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHHHH Confidence 99999999996 455789999999999999999999999984 33333444332222211 122233345789 Q ss_pred HHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccc------eEEEEe Q lcl|NC_016164. 715 LVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANG------DVFFGV 784 (836) Q Consensus 715 l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~------~i~~gD 784 (836) +.+++.++..++. .+.+|+|||++|.+|++++|.+|+++|.. +.+++|+|+||++++.+|.+ .+++|| T Consensus 160 i~~~~~~~~~~~~--~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gd 237 (298) T protein:vir:94 160 IENAVELLTGVDA--DVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGD 237 (298) T ss_pred HHHHHHhhhhcCC--CccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEee Confidence 9999999987754 46689999999999999999999998743 34579999999999999853 589999 Q ss_pred hhce-EEEeecceEEEEeccc--------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 785 WNQM-IMGMWGALDIQVNPYA--------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 785 ~s~~-~i~~~~~l~i~~~~~~--------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) |++. .++.++++++.++++. +|.+|++.||++.|+|+++.+|+||++++.|= T Consensus 238 fs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 238 FANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 9885 5889999999887753 58999999999999999999999999999999 No 76 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=7.4e-49 Score=284.44 Aligned_cols=344 Identities=13% Similarity=0.001 Sum_probs=228.9 Q ss_pred hhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhh Q lcl|NC_016164. 446 AQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAF 525 (836) Q Consensus 446 a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~ 525 (836) +.-..++.....+.....++.+...... ....+... ..... ......... ... T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~~~~~-e~~~~~~~----~~~~~-~~~~~~~~~----~~e----------------- 53 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISAGATP-EEQEKLFE----AAFTT-MGDEILAKN----EEE----------------- 53 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhhcccH-HHHHHHHH----HHHHH-HHHHHHHHH----HHH----------------- Confidence 2222222222222222222222111000 00000000 00000 000000000 000 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCce Q lcl|NC_016164. 526 EAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGP 605 (836) Q Consensus 526 ~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~ 605 (836) .......+.+. ......+...-.+....++.++|++++|+.+...|++.+...+++++++. +.+. ++. T Consensus 54 --------~~~~~~~~~~~--~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~-v~~~-~~~ 121 (377) T protein:vir:96 54 --------MERMFDLRDKN--RELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVIN-FKNT-SLR 121 (377) T ss_pred --------HHHHHHhccCC--cccCHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhce-eEec-CCc Confidence 00000000000 00000000001122344566788899999999999999999999999854 4444 467 Q ss_pred EEEEEecCCceeeeeccCcccc-cccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016164. 606 VAIPRQTGAATAYWVAEGGDPT-ESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYG 684 (836) Q Consensus 606 ~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G 684 (836) ..+|+.++.+.+.|++|+++.+ .++++|+++++.+++++++++||++||.|+.+++++||...|+++++++++.+|++| T Consensus 122 ~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G 201 (377) T protein:vir:96 122 LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKG 201 (377) T ss_pred eEEEEecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEec Confidence 8899999999999999998876 578999999999999999999999999999999999999999999999999999999 Q ss_pred cCCccccccccccccccccccc-----------------ccchhHHHHHHHHHHHhhhcc---------ccCccEEEecH Q lcl|NC_016164. 685 LGSNSQPEGLKFVTGINTENFG-----------------ATNPTYVELVSMESKVAADNA---------DIGAMSYLTNS 738 (836) Q Consensus 685 ~Gt~~~p~Gi~~~~~~~~~t~a-----------------a~~~t~~~l~~a~~~l~~~~~---------~~~~~~~vmnp 738 (836) +|+ ++|.||++.......... ...++.+.+.++++.|...+. ..++++|+||| T Consensus 202 ~G~-~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~ 280 (377) T protein:vir:96 202 NGL-LQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNP 280 (377) T ss_pred cCC-CcceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEch Confidence 997 589999986543322111 112334556666666655442 23567899999 Q ss_pred HHHHHHH---HHhhccCccccccCCCCeeccee--eEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEEE Q lcl|NC_016164. 739 TLYGGFK---TTEKATSTAQFVLEPGGTVNGYN--VVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRV 813 (836) Q Consensus 739 ~~~~~L~---~lkd~~g~~~~~~~~~~~l~G~p--Vv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~ 813 (836) .++..+. ..++++| .+.+++|+| |+.++.+|++.++||||++|.+++++++++..+++.+|.+|++.| T Consensus 281 ~t~~~~~~~~~~~~~~G-------~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f 353 (377) T protein:vir:96 281 EDRWTLEAKFTSRNQFG-------EYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLY 353 (377) T ss_pred hhHHhccccccccCCCC-------CceeccCCCceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhhhhhcCCeEE Confidence 9986552 1222222 334677776 677899999999999999999999999999999999999999999 Q ss_pred EEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 814 TALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 814 r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) |+.+|+|+++++++||++++-++ T Consensus 354 ~~~~r~dG~~~d~~a~~vl~l~~ 376 (377) T protein:vir:96 354 LTKNYFYGKAKDNHTAALLTLAG 376 (377) T ss_pred EEEEEEcCEEecCCcEEEEEEec Confidence 99999999999999999999999 No 77 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=1.4e-47 Score=277.53 Aligned_cols=365 Identities=10% Similarity=0.066 Sum_probs=225.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHH Q lcl|NC_016164. 410 SSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIA 489 (836) Q Consensus 410 ~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~ 489 (836) ++..++....... .+...++.+..+....++.. .......+.++.+.+............... T Consensus 1 M~~~eL~~~~~~~----~~~~~~l~e~~~~~~~~~~~--------~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~----- 63 (395) T protein:vir:38 1 MNINQLKDAFDMA----GQKVQDLEDKRAQFAIDLGN--------DASSHSVDDINKLNASLKNAKMAQELAKSA----- 63 (395) T ss_pred CCHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHhh--------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Confidence 1111111111111 11111111111110000000 000000000111111000000000000000 Q ss_pred HHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccccc Q lcl|NC_016164. 490 AGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAG 569 (836) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g 569 (836) . . ......... ...+..................+.+. .........+..+.| T Consensus 64 --~-~---------~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~g 115 (395) T protein:vir:38 64 --Y-E---------DARANLNAE---PVNKKPLPVKDGKPDAQAMKNQFVKD-------------FKNLVTSGTTGTGNA 115 (395) T ss_pred --H-H---------HHHhhhhhc---cccccccchhhhhHHHHHHHHHHHHH-------------HHHHHhhccCccCCC Confidence 0 0 000000000 00000000000000011111111110 001111223344567 Q ss_pred ccccchhhHHHHHHHHHhhhhhhhhccee-eecCCceEEEEEecC-CceeeeeccCcccccc-cccceeEEeeeeeeeee Q lcl|NC_016164. 570 DLVFTDGRPGSFIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTG-AATAYWVAEGGDPTES-QPSVDQVALVAKTLGAY 646 (836) Q Consensus 570 ~~vvp~~~~~~ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~-~~~~~~it~~~~t~~~~ 646 (836) +.++|+.+...|++.+++.+++++++..+ ++...+.+.+++... .+.+.|++|+++++++ +++|+++++++++++++ T Consensus 116 g~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~ 195 (395) T protein:vir:38 116 GLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGI 195 (395) T ss_pred ceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEee Confidence 88899999999999999999999996543 334556777776655 4667899999999975 58999999999999999 Q ss_pred ehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHH-HHhhh Q lcl|NC_016164. 647 TEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMES-KVAAD 725 (836) Q Consensus 647 i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~-~l~~~ 725 (836) ++||+++|.|+.++++++|.+.|+++++++++.+|++|+|++..+.| ..+++++.+++. .+... T Consensus 196 ~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~---------------~~~~~~i~~~~~~~l~~~ 260 (395) T protein:vir:38 196 TTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKKPT---------------ISQFDNIKDLENNTLDPA 260 (395) T ss_pred hhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc---------------cccHHHHHHHHHHhhhhh Confidence 99999999999999999999999999999999999999997543322 235677777775 44444 Q ss_pred ccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCcccc------ceEEEEehhc-eEEEeec Q lcl|NC_016164. 726 NADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVAN------GDVFFGVWNQ-MIMGMWG 794 (836) Q Consensus 726 ~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~------~~i~~gD~s~-~~i~~~~ 794 (836) + ..+++|+|||.+|..|+.++|++|+|+|.. +.+++|+|+||+++++++. ..++||||++ |.++++. T Consensus 261 ~--~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~ 338 (395) T protein:vir:38 261 I--ESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQ 338 (395) T ss_pred h--cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEec Confidence 3 457899999999999999999999998743 3456899999999886542 3489999997 7889999 Q ss_pred ceEEEEecc--cccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 795 ALDIQVNPY--ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 795 ~l~i~~~~~--~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++++.++++ .+|.+|++.||++.|+|+++.+|+||++++..- T Consensus 339 ~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 382 (395) T protein:vir:38 339 QMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKT 382 (395) T ss_pred ceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeec Confidence 999998875 569999999999999999999999999999665 No 78 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=6e-49 Score=284.94 Aligned_cols=343 Identities=13% Similarity=0.065 Sum_probs=224.1 Q ss_pred hhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 451 ESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAF 530 (836) Q Consensus 451 ee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~ 530 (836) ... ...+++.+...+...... ... ...... .... .. ............. T Consensus 1 m~~--------kl~~~~~~~~~~~~~~~~----~~~-~~~~~~----~~~~------~~---~~~~~~~~~~~~~----- 49 (381) T protein:vir:10 1 MTI--------NLSETFANAKNEFINAVN----NGE-PQERQN----ELYG------DM---INQLFEETKLQAK----- 49 (381) T ss_pred Cch--------hHHHHHHHHHHHHHHHHH----hhh-HHHHHH----HHHH------HH---HHhhhhhHHHHHH----- Confidence 000 001111110000000000 000 000000 0000 00 0000000000000 Q ss_pred hHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEE Q lcl|NC_016164. 531 EREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPR 610 (836) Q Consensus 531 ~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~ 610 (836) .+.......+.+.. ..... ..........++.+.|++++|+.+.+.|++.+...+++++++. +.+. ++..++|+ T Consensus 50 -~e~~~~~~~~~~~~--~l~~~-e~~~~~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~-v~~~-~~~~~i~~ 123 (381) T protein:vir:10 50 -AEAERVSSLPKSAQ--TLSAN-QRNFFMDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLG-IKNA-GLRLKFLK 123 (381) T ss_pred -HHHHHHHHhccccc--ccCHH-HHHHHHHHhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeee-eEec-CcceEEEe Confidence 00011111111110 01111 1111112334455678889999999999999999999999854 4554 46788999 Q ss_pred ecCCceeeeeccCcccc-cccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc Q lcl|NC_016164. 611 QTGAATAYWVAEGGDPT-ESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNS 689 (836) Q Consensus 611 ~~~~~~a~~v~Eg~~~~-~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~ 689 (836) .++.+.+.|++|+++.+ ..+++|+++++.+++++++++||++||.|+..++++||...|+++++++++.+|++|+|+ + T Consensus 124 ~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~-~ 202 (381) T protein:vir:10 124 SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGK-D 202 (381) T ss_pred ecCCcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccC-C Confidence 99999999999988865 668999999999999999999999999999999999999999999999999999999997 5 Q ss_pred cccccccccccc-ccc-------ccccchhHHHHHHHHHHHhh-------h-----ccccCccEEEecHHHHHHHHHHh- Q lcl|NC_016164. 690 QPEGLKFVTGIN-TEN-------FGATNPTYVELVSMESKVAA-------D-----NADIGAMSYLTNSTLYGGFKTTE- 748 (836) Q Consensus 690 ~p~Gi~~~~~~~-~~t-------~aa~~~t~~~l~~a~~~l~~-------~-----~~~~~~~~~vmnp~~~~~L~~lk- 748 (836) +|.||++..... ..+ .+.+.+++.++..+...+.. . ..+.+++.|+|||.++..|+.++ T Consensus 203 qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~ 282 (381) T protein:vir:10 203 QPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT 282 (381) T ss_pred CceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccc Confidence 899998643221 111 11122333333333322211 1 12345778999999998887544 Q ss_pred --hccCccccccCCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcc Q lcl|NC_016164. 749 --KATSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHP 826 (836) Q Consensus 749 --d~~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p 826 (836) +++|+ |++..+ +|+||++++.||+++++||||++|.++++.++++..+++.+|.+|++.||++.|+|++++++ T Consensus 283 ~~~~~G~--~v~~lp---~g~~vv~~~~~p~~~i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~ 357 (381) T protein:vir:10 283 HLNANGV--YVTALP---FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDN 357 (381) T ss_pred cCCCCCc--eeecCC---CCceeEEcCCCCcCcEEEEEcccEEEEEecccEEEeechhhhhcCceEEEEEEEEcCEEecC Confidence 55554 444433 58899999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEeecC Q lcl|NC_016164. 827 EAFCRGNDNL 836 (836) Q Consensus 827 ~Af~~l~~A~ 836 (836) +||++++... T Consensus 358 ~A~~v~~l~~ 367 (381) T protein:vir:10 358 KVAAVWKLDL 367 (381) T ss_pred CcEEEEEEee Confidence 9999988776 No 79 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=8.4e-49 Score=284.16 Aligned_cols=289 Identities=14% Similarity=0.119 Sum_probs=229.0 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEe Q lcl|NC_016164. 532 REVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQ 611 (836) Q Consensus 532 ~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~ 611 (836) ....+++... .. .........+.++.++|+.+.+.|++.+++.+++++++. +++..++.+++|+. T Consensus 1 ~a~l~el~~~----~~----------~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~-~~~~~~~~~~~p~~ 65 (333) T protein:vir:78 1 MATLNELLPN----SA----------GSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGE-QIPISYGETIIPTT 65 (333) T ss_pred CchhHHhhhh----cc----------cccccCceecCCccccchhHHHHHHHHHHhhchhhhhcc-eeeccCCceEEEEE Confidence 0000000000 00 000011112234448899999999999999999999954 56677788999999 Q ss_pred cCCceeeeeccC--------cccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_016164. 612 TGAATAYWVAEG--------GDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALY 683 (836) Q Consensus 612 ~~~~~a~~v~Eg--------~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~ 683 (836) ++.+.+.|++|+ +.++.++++|+++++++++++++++||+|+|.++..+++++|.++|++++++++|.++|+ T Consensus 66 ~~~~~a~~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~ 145 (333) T protein:vir:78 66 VKRPEVGQVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFH 145 (333) T ss_pred eCCceeEeecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 999998888776 456788999999999999999999999999999999999999999999999999999999 Q ss_pred hcCCc--cccccccccccccccc-----ccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHH---HHhhccCc Q lcl|NC_016164. 684 GLGSN--SQPEGLKFVTGINTEN-----FGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFK---TTEKATST 753 (836) Q Consensus 684 G~Gt~--~~p~Gi~~~~~~~~~t-----~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~---~lkd~~g~ 753 (836) |+|++ ..|.|+.+.....+.+ ...+.+++++|.+++..+..++ ....+.|+|||.+|..|. .++|.+|+ T Consensus 146 G~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~~vmn~~~~~~L~~~~~~~d~~G~ 224 (333) T protein:vir:78 146 GKSPLTGSALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANT-DVEFNGWAVDPRFRAHLLRAQAYRDANGN 224 (333) T ss_pred ccCCCCCcccccccccccccccccccccccccchhHHHHHHHHHhhcccc-ccCceEEEEcchHHHHHHHHhhhcCCCCc Confidence 99864 3567776654433222 2334567899999998887654 345668999999987765 46788999 Q ss_pred ccccc----CCCCeecceeeEeeCccccc---------eEEEEehhceEEEeecceEEEEeccc-----------ccccC Q lcl|NC_016164. 754 AQFVL----EPGGTVNGYNVVRSNQVANG---------DVFFGVWNQMIMGMWGALDIQVNPYA-----------LDKSG 809 (836) Q Consensus 754 ~~~~~----~~~~~l~G~pVv~s~~~~~~---------~i~~gD~s~~~i~~~~~l~i~~~~~~-----------~~~~~ 809 (836) ++|.. +.+++|+|+||++++++|.+ .++||||+.|.+++++++++..+++. .|.+| T Consensus 225 ~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 304 (333) T protein:vir:78 225 VDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTN 304 (333) T ss_pred eeecCccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcC Confidence 88743 34579999999999999854 58999999999999999999998873 58999 Q ss_pred cEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 810 SVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 810 ~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++.||+++|+|+++++|+||++++.|= T Consensus 305 ~v~~r~~~r~d~~v~~~~a~~~l~~~~ 331 (333) T protein:vir:78 305 QIAILIEVTFGWLLGDKQAFVKFVDDE 331 (333) T ss_pred cEEEEEEEEEccEEecccceEEEeccC Confidence 999999999999999999999999888 No 80 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=7.6e-49 Score=284.40 Aligned_cols=285 Identities=17% Similarity=0.171 Sum_probs=224.8 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCc Q lcl|NC_016164. 545 TPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGG 624 (836) Q Consensus 545 ~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~ 624 (836) ...+... ....++....++ +.++.++|+.+.+.+++.+++.++++++++ +++..++.+++|+..+.+.+.|++|++ T Consensus 1 ~~~~~~~--~~~~~~~~~t~~-~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~-~~~~~~~~~~~p~~~~~~~a~~v~E~~ 76 (320) T protein:vir:10 1 MAAGTAF--QVDHAQIAQTGD-TMFKGYLEPEQAKDYFAEAEKTSIVQQFAQ-KVPMGTTGQKIPHWIGDVSAQWIGEGD 76 (320) T ss_pred CCCCccC--CHHHHHhhcccc-ccccccccHHHHHHHHHHHHhccchhhhcc-eeeccCCceEEEEEeCCcceEEecCCc Confidence 1111110 112222223333 333445677788999999999999999854 556667789999999999999999999 Q ss_pred ccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccc- Q lcl|NC_016164. 625 DPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTE- 703 (836) Q Consensus 625 ~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~- 703 (836) ++|+++++|++++++++|++++++||+|+|.++.++++++|.+.|++++++++|.+||+|+|+ +.|.++......... T Consensus 77 ~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~-~~~~~~~~~~~~~~~~ 155 (320) T protein:vir:10 77 MKPITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDS-PFPTYLAQTTKSVSLA 155 (320) T ss_pred cccccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCC-CCCcccccccccccce Confidence 999999999999999999999999999999999999999999999999999999999999996 455555433221111 Q ss_pred cc---cccchh--HHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccC---------CCCeecceee Q lcl|NC_016164. 704 NF---GATNPT--YVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLE---------PGGTVNGYNV 769 (836) Q Consensus 704 t~---aa~~~t--~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~---------~~~~l~G~pV 769 (836) .. ..+.++ .+.+.++...+... +..+++|+|||++|.+|+++||++|+++|... .+++++|+|| T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv 233 (320) T protein:vir:10 156 DPGGATASDLTAYDAVAVNGLSLLVNA--KKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPT 233 (320) T ss_pred ecccccccccccHHHHHHHHHhhhhcc--cCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeee Confidence 11 111121 12455666666544 35678999999999999999999999987532 1357999999 Q ss_pred EeeCccccce--EEEEehhceEEEeecceEEEEeccc--------------ccccCcEEEEEEEEeccEEEcccceEEEe Q lcl|NC_016164. 770 VRSNQVANGD--VFFGVWNQMIMGMWGALDIQVNPYA--------------LDKSGSVRVTALQDVDVAVRHPEAFCRGN 833 (836) Q Consensus 770 v~s~~~~~~~--i~~gD~s~~~i~~~~~l~i~~~~~~--------------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~ 833 (836) ++++.+|.++ ++||||+++.+++++++++..+++. .|++|++.||+++|+|+++.+|+||++++ T Consensus 234 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~ 313 (320) T protein:vir:10 234 ILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLT 313 (320) T ss_pred EecCCCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEE Confidence 9999999886 5799999999999999999887663 38899999999999999999999999999 Q ss_pred ecC Q lcl|NC_016164. 834 DNL 836 (836) Q Consensus 834 ~A~ 836 (836) .+. T Consensus 314 ~~~ 316 (320) T protein:vir:10 314 NVV 316 (320) T ss_pred ecc Confidence 888 No 81 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=5.5e-48 Score=279.66 Aligned_cols=352 Identities=12% Similarity=0.062 Sum_probs=225.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhh Q lcl|NC_016164. 436 LCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRA 515 (836) Q Consensus 436 l~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a 515 (836) +.... +..++.....+.+..+.+.+... ...........+..... .......... .. T Consensus 1 mt~~~-------~~~e~~~~~~e~~~~~~~~~~~~-~~~e~~~~~~~~~~~~~-----~~~~~~~~~~--------e~-- 57 (395) T protein:vir:95 1 MADMK-------QNNVKLKNYHEHKKQFANLVQNG-ASDEEQSKAFGAMFDAL-----SNDLQEEITA--------EI-- 57 (395) T ss_pred ChhHH-------HHHHHHHHHHHHHHHHHHHHhhh-hhHHHHHHHHHHHHHHH-----HHHHHHHHHH--------HH-- Confidence 11111 11111122222222221111111 00000000000000000 0000000000 00 Q ss_pred hhhhhhhhhhhhhhhhHHHHHHHH-HHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhh Q lcl|NC_016164. 516 QMMPGDRAAFEAAAFEREVSEATA-QRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTL 594 (836) Q Consensus 516 ~~~~~~~~~~~~~~~~~~~a~~~~-~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l 594 (836) ......... ...+ ........... .......+.+.|++++|+.+.+.|++.++..++++++ T Consensus 58 ---------------~~~~~~~~~~~~r~--~~~l~~ee~~~-~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~ 119 (395) T protein:vir:95 58 ---------------NNRVVDNGILAKRS--QDPLTSEERKF-FNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSK 119 (395) T ss_pred ---------------HHHHHHHHHHhhcC--ccccchHHHHH-HHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhh Confidence 000000000 0000 00000000000 1112334556788899999999999999999999999 Q ss_pred cceeeecCCceEEEEEecCCceeeeeccCcccc-cccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHH Q lcl|NC_016164. 595 GVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPT-ESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVI 673 (836) Q Consensus 595 ~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~ 673 (836) +. +.+. ++..++|+.++.+.+.|+.|.++.+ .++++|+++++.+++++++++||++||.|+..+++++|.+.|++++ T Consensus 120 ~~-v~~~-~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~i 197 (395) T protein:vir:95 120 IN-FQNA-GIKTRVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAI 197 (395) T ss_pred ce-eEec-CCceEEEEecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHH Confidence 54 4444 4578999999999999999877764 6789999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcCCc-cccccccccccccccc----ccccchhHHHHHHHHHHHhhh------------ccccCccEEEe Q lcl|NC_016164. 674 ALEIDRAALYGLGSN-SQPEGLKFVTGINTEN----FGATNPTYVELVSMESKVAAD------------NADIGAMSYLT 736 (836) Q Consensus 674 a~~~d~~il~G~Gt~-~~p~Gi~~~~~~~~~t----~aa~~~t~~~l~~a~~~l~~~------------~~~~~~~~~vm 736 (836) +.+++.+|++|+|++ ++|.||++........ ..++.++++++..+...+... ....++..|+| T Consensus 198 a~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~m 277 (395) T protein:vir:95 198 SVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVV 277 (395) T ss_pred HHHHhhheeeccCCCCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEE Confidence 999999999999987 4799999865433221 112233445444444333221 12345678999 Q ss_pred cHHHHHHHHHHhhccCcccccc--CCCCeec--ceeeEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEE Q lcl|NC_016164. 737 NSTLYGGFKTTEKATSTAQFVL--EPGGTVN--GYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVR 812 (836) Q Consensus 737 np~~~~~L~~lkd~~g~~~~~~--~~~~~l~--G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~ 812 (836) ||.++. +..|++.+.+ +.+.+++ |+||++++.||+++++||||++|.+++++++++..+++.+|.+|++. T Consensus 278 n~~t~~------~~~g~~~~~~~~G~~~~~lg~g~~v~~~~~~p~~~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d~~~ 351 (395) T protein:vir:95 278 NPRDSW------DVQARYTYLTANGGFVTVLPYNVTIITSEFVPEGKLVAFVTDRYNAVRGGGLTVKKFDQTLALEDAVL 351 (395) T ss_pred cchhhh------hcCCcceeccCCCcceeccCCcceEEEcCCCCCCcEEEEecccEEEEEecceEEEeccchhhhCCcEE Confidence 999875 3345555433 2334565 56689999999999999999999999999999999999999999999 Q ss_pred EEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 813 VTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 813 ~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ||+..|+|+++++++||++++..+ T Consensus 352 f~~~~r~dg~~~~~~A~~~l~i~~ 375 (395) T protein:vir:95 352 FTAKTFAYGQPDDNKASAVYDLKV 375 (395) T ss_pred EEEEEEECCEEeccccEEEEEeec Confidence 999999999999999999999877 No 82 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=4.3e-47 Score=274.80 Aligned_cols=386 Identities=10% Similarity=0.043 Sum_probs=223.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhh Q lcl|NC_016164. 406 NHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSA 485 (836) Q Consensus 406 ~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~ 485 (836) +.......+.++..............+++++......++.. ... +..+..++.+..+............... T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~-~~~-------~e~~~~~~~l~~ei~~l~e~~~~~~~~~ 72 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPAMKTELRSLLEGEDSEENL-KKA-------EGVRAKYDKAGKEIKDLEEKRDLYEAAL 72 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHH-HHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000000000000001111111111111111111000000 000 0001111111111111111100000000 Q ss_pred hhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccc Q lcl|NC_016164. 486 QPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTA 565 (836) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~ 565 (836) .............. ................ ....... .......+. .................... T Consensus 73 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-----------~~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 138 (400) T protein:vir:38 73 KGNEQSSGKKPDHP-EEHSYRDALNAYLHTR-----------GRNTDGV-NFEKTDVGT-FAVLRAVPTDASDAVNAGVK 138 (400) T ss_pred HHHhhcccccccch-hhhhHHHHHHHHHhhH-----------HHHHHHH-HHHHHHHHH-HhhhhhhhHHHHHHHhhccc Confidence 00000000000000 0000000000000000 0000000 000000000 00001111111222333345 Q ss_pred ccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEec-CCceeeeeccCccccc-ccccceeEEeeeeee Q lcl|NC_016164. 566 SAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQT-GAATAYWVAEGGDPTE-SQPSVDQVALVAKTL 643 (836) Q Consensus 566 ~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~ 643 (836) ++.|++++|+.+...|++.+++.++++++ +++.+...+...+|... .++.+.|++|+++++. ++++|+++++.++++ T Consensus 139 ~~~gg~~vP~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~ 217 (400) T protein:vir:38 139 AADAASTIPETISNTPQRELQTVVDLKPF-TNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETY 217 (400) T ss_pred ccCCcccccHHHHHHHHHHHHhhhhhhhc-ceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhhe Confidence 66688899999999999999999999998 44556666677777765 4467899999999986 679999999999999 Q ss_pred eeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_016164. 644 GAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVA 723 (836) Q Consensus 644 ~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~ 723 (836) +++++||++||.|+.++++++|.+.|+++++.+++.+|++|+|+. .+. +..+++++.+++.... T Consensus 218 ~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~-~~~---------------~~~~~~~~~~~~~~~~ 281 (400) T protein:vir:38 218 RQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGF-TAK---------------TISSVDDLKHINNVDL 281 (400) T ss_pred eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccc-ccc---------------ccccHHHHHHHHHhhh Confidence 999999999999999999999999999999999999999998853 221 2235777777776543 Q ss_pred hhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccc-----eEEEEehhc-eEEEee Q lcl|NC_016164. 724 ADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANG-----DVFFGVWNQ-MIMGMW 793 (836) Q Consensus 724 ~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~-----~i~~gD~s~-~~i~~~ 793 (836) ... .+++|+|||.+|..|+.++|++|+|+|.. +.+++|+|+||++++.+|.+ .++||||++ |.++++ T Consensus 282 ~~~---~~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~ 358 (400) T protein:vir:38 282 DPA---YSRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANR 358 (400) T ss_pred hhh---hCcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEee Confidence 322 35799999999999999999999998843 34568999999999988743 379999997 678889 Q ss_pred cceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 794 GALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 794 ~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .++++.++++..|. ..||+++|+|+++.+|+||++++.+- T Consensus 359 ~~~~~~~~~~~~~~---~~~~~~~r~d~~~~~~~a~~~l~~~~ 398 (400) T protein:vir:38 359 ADFMVRWVDDQIYG---QFLQAGMRFGVSVADEKAGYFLTYTP 398 (400) T ss_pred cceEEEEecccccc---eeEEEEEEeccEEecccceEEEEeec Confidence 99999998876654 57999999999999999999999665 No 83 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=4.5e-47 Score=274.67 Aligned_cols=380 Identities=11% Similarity=0.038 Sum_probs=220.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHH Q lcl|NC_016164. 410 SSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIA 489 (836) Q Consensus 410 ~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~ 489 (836) +.............. ...++.+... +....+.++.....+.....++.+.++................... T Consensus 1 M~~~~l~el~~~l~e----~~~~i~~~~~-----e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~ 71 (394) T protein:vir:97 1 MFEEKIKEIKATIAD----LNNTIVTKTA-----QVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGG 71 (394) T ss_pred CcHHHHHHHHHHHHH----HHHHHHHHHH-----HHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Confidence 000000110000000 0000000000 0000000000000001111111111111111110000000000000 Q ss_pred HHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccccc Q lcl|NC_016164. 490 AGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAG 569 (836) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g 569 (836) ........................+.......... ......+.. +.. .............+...| T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~-~~~--------~~~~~~~~~~~~~t~~~g 136 (394) T protein:vir:97 72 AENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSL------RFEGKDEVL-MPI--------NETTPVEPQKDGIKKENA 136 (394) T ss_pred cccccccccchhhHHHHHHHHHHHHHHHHHhhhhh------hhhhHHHHH-HHH--------Hhhhhhhhhccccccccc Confidence 00000000000000000000011110000000000 000000000 000 000111122233455668 Q ss_pred ccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEec-CCceeeeeccCccccc-ccccceeEEeeeeeeeeee Q lcl|NC_016164. 570 DLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQT-GAATAYWVAEGGDPTE-SQPSVDQVALVAKTLGAYT 647 (836) Q Consensus 570 ~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~~i 647 (836) ++++|+.+...|++.+++.+++++++ ++++...+...+|... +++.+.|++|++++++ ++++|+.+++.++++++++ T Consensus 137 g~liP~~~~~~ii~~~~~~~~l~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i 215 (394) T protein:vir:97 137 KPVSSEEILYTPAREVKTVVDLKPFT-TVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAI 215 (394) T ss_pred cccChHHHHHHHHHHhhhhhhhhhhc-eeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeeh Confidence 88999999999999999999999984 4566666777888765 4567899999999996 6799999999999999999 Q ss_pred hhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhcc Q lcl|NC_016164. 648 EFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNA 727 (836) Q Consensus 648 ~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~ 727 (836) +||++||.|+.++++++|.+.|+++++++++.+|++|.+++. + .+..+++++.+++...... T Consensus 216 ~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~-~---------------~~~~~~~~~~~~~~~~~~~-- 277 (394) T protein:vir:97 216 PLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT-T---------------KTVKNLDEIKALLNGGFDP-- 277 (394) T ss_pred hhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-c---------------cccccHHHHHHHHHhhhhh-- Confidence 999999999999999999999999999999999999887531 1 2234678888888765432 Q ss_pred ccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCc--cccceEEEEehhc-eEEEeecceEEEE Q lcl|NC_016164. 728 DIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQ--VANGDVFFGVWNQ-MIMGMWGALDIQV 800 (836) Q Consensus 728 ~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~--~~~~~i~~gD~s~-~~i~~~~~l~i~~ 800 (836) ..++.|+|||++|..|+.++|++|+|+|.. +.+++|+|+||++++. ++.+.++||||++ |.++.+.++++.+ T Consensus 278 -~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~ 356 (394) T protein:vir:97 278 -AYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRW 356 (394) T ss_pred -hhCCEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEE Confidence 235789999999999999999999998743 3356899999999554 6677899999987 6789999999998 Q ss_pred ecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 801 NPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 801 ~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +++..+ ...||+++|+|+++.+|+||++++..- T Consensus 357 ~~~~~~---~~~~~~~~r~d~~v~~~~a~~~~~~~~ 389 (394) T protein:vir:97 357 ADNEIY---GQYLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) T ss_pred eccccc---ceeEEEEEEEccEEecccceEEEEecc Confidence 876554 457999999999999999999888544 No 84 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=2.3e-48 Score=281.73 Aligned_cols=343 Identities=14% Similarity=0.056 Sum_probs=224.9 Q ss_pred hhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 451 ESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAF 530 (836) Q Consensus 451 ee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~ 530 (836) .... ..+...+...+........... ........ .... .......... T Consensus 1 m~ik--------~~~~~~~~~~e~~~~~~~~~~~--~~~~~~~~-------------~~~~---~~~~~~~~~~------ 48 (381) T protein:vir:10 1 MTIN--------LSETFANAKNEFINAVNNGEPQ--ERQNELYG-------------DMIN---QLFEETKLQA------ 48 (381) T ss_pred Cchh--------hHHHHHHHHHHHHHHHhhhhhh--HHHHHHHH-------------HHHH---hhhhhHHHHH------ Confidence 0000 0111110000000000000000 00000000 0000 0000000000 Q ss_pred hHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEE Q lcl|NC_016164. 531 EREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPR 610 (836) Q Consensus 531 ~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~ 610 (836) ..+.........+. ..... ...........++.+.|++++|+.+.+.|++.+++.+++++++. +.+. ++...+|+ T Consensus 49 ~~e~~~~~~~~~~~--~~lt~-~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~-v~~~-~~~~~i~~ 123 (381) T protein:vir:10 49 KAEAERVSSLPKSA--QSLSA-NQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLG-IKNA-GLRLKFLK 123 (381) T ss_pred HHHHHHHHHhccCc--ccccH-HHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceehee-eEec-CcceEEEE Confidence 00000011100000 00000 01111112233455678899999999999999999999999854 4444 46789999 Q ss_pred ecCCceeeeeccCcccc-cccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc Q lcl|NC_016164. 611 QTGAATAYWVAEGGDPT-ESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNS 689 (836) Q Consensus 611 ~~~~~~a~~v~Eg~~~~-~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~ 689 (836) .++.+.+.|++|+++.+ .++++|+++++.+++++++++||++||.|+..++++||...|+++++.+++.+|++|+|+ + T Consensus 124 ~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~-~ 202 (381) T protein:vir:10 124 SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGK-D 202 (381) T ss_pred ecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCC-C Confidence 99999999999998876 568999999999999999999999999999999999999999999999999999999997 5 Q ss_pred cccccccccccc-cccc-------cccch-------hHHHHHHHHHHHhhhc-----cccCccEEEecHHHHHHHHHHhh Q lcl|NC_016164. 690 QPEGLKFVTGIN-TENF-------GATNP-------TYVELVSMESKVAADN-----ADIGAMSYLTNSTLYGGFKTTEK 749 (836) Q Consensus 690 ~p~Gi~~~~~~~-~~t~-------aa~~~-------t~~~l~~a~~~l~~~~-----~~~~~~~~vmnp~~~~~L~~lkd 749 (836) +|.||++..... ..+. +.+.+ .++.|..++..+...+ .+.++++|+|||.++..|..+++ T Consensus 203 qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~ 282 (381) T protein:vir:10 203 QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT 282 (381) T ss_pred CceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc Confidence 899998753321 1110 11112 2344555555443321 23467889999999988876553 Q ss_pred ---ccCccccccCCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcc Q lcl|NC_016164. 750 ---ATSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHP 826 (836) Q Consensus 750 ---~~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p 826 (836) ++|+ |++..+ +|++|+.++.||+++++||||++|.+++++++++..+++.+|.+|++.||++.|+|++++++ T Consensus 283 ~~~~~G~--~v~~l~---~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~ 357 (381) T protein:vir:10 283 HLNANGV--YVTALP---FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDN 357 (381) T ss_pred cCCCCCc--eeecCC---CCceEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecC Confidence 3443 444332 47789999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEeecC Q lcl|NC_016164. 827 EAFCRGNDNL 836 (836) Q Consensus 827 ~Af~~l~~A~ 836 (836) +||++++-.. T Consensus 358 ~A~~v~~l~~ 367 (381) T protein:vir:10 358 KVAAVWKLDL 367 (381) T ss_pred ceEEEEEEEe Confidence 9999999777 No 85 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=2.3e-48 Score=281.73 Aligned_cols=343 Identities=14% Similarity=0.056 Sum_probs=224.9 Q ss_pred hhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 451 ESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAF 530 (836) Q Consensus 451 ee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~ 530 (836) .... ..+...+...+........... ........ .... .......... T Consensus 1 m~ik--------~~~~~~~~~~e~~~~~~~~~~~--~~~~~~~~-------------~~~~---~~~~~~~~~~------ 48 (381) T protein:vir:95 1 MTIN--------LSETFANAKNEFINAVNNGEPQ--ERQNELYG-------------DMIN---QLFEETKLQA------ 48 (381) T ss_pred Cchh--------hHHHHHHHHHHHHHHHhhhhhh--HHHHHHHH-------------HHHH---hhhhhHHHHH------ Confidence 0000 0111110000000000000000 00000000 0000 0000000000 Q ss_pred hHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEE Q lcl|NC_016164. 531 EREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPR 610 (836) Q Consensus 531 ~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~ 610 (836) ..+.........+. ..... ...........++.+.|++++|+.+.+.|++.+++.+++++++. +.+. ++...+|+ T Consensus 49 ~~e~~~~~~~~~~~--~~lt~-~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~-v~~~-~~~~~i~~ 123 (381) T protein:vir:95 49 KAEAERVSSLPKSA--QSLSA-NQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLG-IKNA-GLRLKFLK 123 (381) T ss_pred HHHHHHHHHhccCc--ccccH-HHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceehee-eEec-CcceEEEE Confidence 00000011100000 00000 01111112233455678899999999999999999999999854 4444 46789999 Q ss_pred ecCCceeeeeccCcccc-cccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc Q lcl|NC_016164. 611 QTGAATAYWVAEGGDPT-ESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNS 689 (836) Q Consensus 611 ~~~~~~a~~v~Eg~~~~-~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~ 689 (836) .++.+.+.|++|+++.+ .++++|+++++.+++++++++||++||.|+..++++||...|+++++.+++.+|++|+|+ + T Consensus 124 ~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~-~ 202 (381) T protein:vir:95 124 SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGK-D 202 (381) T ss_pred ecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCC-C Confidence 99999999999998876 568999999999999999999999999999999999999999999999999999999997 5 Q ss_pred cccccccccccc-cccc-------cccch-------hHHHHHHHHHHHhhhc-----cccCccEEEecHHHHHHHHHHhh Q lcl|NC_016164. 690 QPEGLKFVTGIN-TENF-------GATNP-------TYVELVSMESKVAADN-----ADIGAMSYLTNSTLYGGFKTTEK 749 (836) Q Consensus 690 ~p~Gi~~~~~~~-~~t~-------aa~~~-------t~~~l~~a~~~l~~~~-----~~~~~~~~vmnp~~~~~L~~lkd 749 (836) +|.||++..... ..+. +.+.+ .++.|..++..+...+ .+.++++|+|||.++..|..+++ T Consensus 203 qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~ 282 (381) T protein:vir:95 203 QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT 282 (381) T ss_pred CceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc Confidence 899998753321 1110 11112 2344555555443321 23467889999999988876553 Q ss_pred ---ccCccccccCCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcc Q lcl|NC_016164. 750 ---ATSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHP 826 (836) Q Consensus 750 ---~~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p 826 (836) ++|+ |++..+ +|++|+.++.||+++++||||++|.+++++++++..+++.+|.+|++.||++.|+|++++++ T Consensus 283 ~~~~~G~--~v~~l~---~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~ 357 (381) T protein:vir:95 283 HLNANGV--YVTALP---FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDN 357 (381) T ss_pred cCCCCCc--eeecCC---CCceEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecC Confidence 3443 444332 47789999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEeecC Q lcl|NC_016164. 827 EAFCRGNDNL 836 (836) Q Consensus 827 ~Af~~l~~A~ 836 (836) +||++++-.. T Consensus 358 ~A~~v~~l~~ 367 (381) T protein:vir:95 358 KVAAVWKLDL 367 (381) T ss_pred ceEEEEEEEe Confidence 9999999777 No 86 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=1e-46 Score=272.78 Aligned_cols=366 Identities=11% Similarity=0.047 Sum_probs=217.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHH Q lcl|NC_016164. 412 TIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAG 491 (836) Q Consensus 412 ~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~ 491 (836) +.+..... .+......++++..... ..+. .......+.....++.+..+............... T Consensus 1 M~~l~~l~----~~~~~~~~e~~~~~~~~-----~~~~-~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~------ 64 (394) T protein:vir:10 1 MDKLQTLF----NEVSAKCADLNAQLNAK-----LQDE-NASVDDFQKIKDDLTAAKARRDAINDQIKDLEAEN------ 64 (394) T ss_pred ChHHHHHH----HHHHHHHHHHHHHHHHH-----Hhhh-hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------ Confidence 00001000 01111111111111000 0000 00000000001111111111111110000000000 Q ss_pred hhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccccccc Q lcl|NC_016164. 492 GGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDL 571 (836) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~ 571 (836) ........ ....... ..............+++.... .+. ..... ......+++.|++ T Consensus 65 --~~~~~~~~------~~~~~~~-------~~~~~~~~~~~~~~~~~~~~l----~~~---~~~~~-~~~~~~t~~~gg~ 121 (394) T protein:vir:10 65 --KANSDPDK------PVDNAQP-------NGTDLKKKPIDAKKKAINDFI----HSH---GKVID-NAAGHVTSTEAGV 121 (394) T ss_pred --Hhhcchhh------hhhhhcc-------cccchhhhHHHHHHHHHHHHH----hcc---chhhh-hhhcccccccCce Confidence 00000000 0000000 000000000000111111110 010 01111 2233345567788 Q ss_pred ccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecC-CceeeeeccCccccc-ccccceeEEeeeeeeeeeehh Q lcl|NC_016164. 572 VFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTG-AATAYWVAEGGDPTE-SQPSVDQVALVAKTLGAYTEF 649 (836) Q Consensus 572 vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~~i~I 649 (836) ++|+.+...|++.+++.++|++++. +.+..++...+|.... ...+.|++|++++++ ++++|+++++.+++++++++| T Consensus 122 ~vP~~~~~~ii~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~i 200 (394) T protein:vir:10 122 LIPEEIIYDPTAEVNSVVDLSTLVT-KTPVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPL 200 (394) T ss_pred eccHHHHHHHHHHHHhhhhhhhhce-eeeccCCceEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehh Confidence 8999999999999999999999854 4455555566665553 467899999999996 679999999999999999999 Q ss_pred HHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhcccc Q lcl|NC_016164. 650 SRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADI 729 (836) Q Consensus 650 SrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~ 729 (836) |+++|.|+.+++.++|.+.|+++++++++.+|++|+|++ .+.++ .+..++++|.+++....... T Consensus 201 S~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~-~~~~~------------~~~~~~d~l~~~~~~~~~~~--- 264 (394) T protein:vir:10 201 SEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSF-TAKAT------------TTDTLVDSLKHILNVDLDPA--- 264 (394) T ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccc-ccccc------------cccccHHHHHHHHHhhhhhh--- Confidence 999999999999999999999999999999999999863 23222 23346778888776443332 Q ss_pred CccEEEecHHHHHHHHHHhhccCccccccC--------CCCeecceeeEeeCcc--cc----ceEEEEehhc-eEEEeec Q lcl|NC_016164. 730 GAMSYLTNSTLYGGFKTTEKATSTAQFVLE--------PGGTVNGYNVVRSNQV--AN----GDVFFGVWNQ-MIMGMWG 794 (836) Q Consensus 730 ~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~--------~~~~l~G~pVv~s~~~--~~----~~i~~gD~s~-~~i~~~~ 794 (836) .+++|+|||++|..|+.++|++|+|+|... .+++|+|+||++++.. +. ..++||||++ |.++++. T Consensus 265 ~~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~ 344 (394) T protein:vir:10 265 YSRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQ 344 (394) T ss_pred ccCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeec Confidence 247899999999999999999999987432 2358999999987653 22 2389999997 6788899 Q ss_pred ceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 795 ALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 795 ~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++++.++++..|.+ .|+++.|+|+++++|+||++++.+= T Consensus 345 ~~~v~~~~~~~~~~---~~~~~~r~d~~~~~~~ai~~~~~~~ 383 (394) T protein:vir:10 345 QVTLAWEDSKIYGR---YLGAAFRFGVKQADSNAGYFVTNTD 383 (394) T ss_pred ceEEEEecccccce---eEEEEEEeccEEeccccEEEEEeec Confidence 99999988877654 5899999999999999999987544 No 87 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=5.1e-46 Score=268.91 Aligned_cols=396 Identities=11% Similarity=0.036 Sum_probs=212.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh------------hhh---hhhh Q lcl|NC_016164. 383 ATSSSGPPGAAAATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREH------------KAD---DLAQ 447 (836) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~------------~l~---e~a~ 447 (836) ++.. +..+.......+......+++++.... .++ +... T Consensus 1 Mki~---------------------------elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~ 53 (437) T protein:vir:10 1 MKIE---------------------------KLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIK 53 (437) T ss_pred CCHH---------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 000000000000000000000000000 000 0000 Q ss_pred hhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 448 GLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEA 527 (836) Q Consensus 448 eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~ 527 (836) ++..............++................... .............. ............. ....... T Consensus 54 el~~~l~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~~-------~e~~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~~ 124 (437) T protein:vir:10 54 EIRSNIEVLEQASALKVEEKRDDSDLVAPELEENSAD-------NEEDDPEKLKTETK-SEAEKDKKTVKDE-EKRDAGG 124 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHH-HHHHHHHHHHHHH-HHHhHHH Confidence 0000000000000000000000000000000000000 00000000000000 0000000000000 0000000 Q ss_pred hhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEE Q lcl|NC_016164. 528 AAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVA 607 (836) Q Consensus 528 ~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~ 607 (836) . ......................................|++++|..+...+. .++..+.+++++ ++.+...+... T Consensus 125 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~-~~~~~~~~~~~ 200 (437) T protein:vir:10 125 L--QDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEK-EVHQFPRLGSLV-RTESVTTTTGK 200 (437) T ss_pred H--hHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHH-Hhhhhhhhhhcc-eeEeeccCcee Confidence 0 0000000000000000000000011111223334456677788998887665 457777888874 44555566677 Q ss_pred EEEec-CCceeeeeccCccccc-ccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_016164. 608 IPRQT-GAATAYWVAEGGDPTE-SQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGL 685 (836) Q Consensus 608 ~p~~~-~~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~ 685 (836) +|... .++.+.|++|++..++ ++++|+++++.+++++++++||+++|.|+.+++.++|.+.|+++++.+++.+|++|+ T Consensus 201 ~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~ 280 (437) T protein:vir:10 201 LPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITAL 280 (437) T ss_pred eEEeeccccccccccccccccccccccceeeeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 77764 4567899999999986 568999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccccccccccccccccchhHHHHHHHHH-HHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CC Q lcl|NC_016164. 686 GSNSQPEGLKFVTGINTENFGATNPTYVELVSMES-KVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EP 760 (836) Q Consensus 686 Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~-~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~ 760 (836) |++ .+. ..+..+++++.+++. .+...+ ..+++|+|||++|..|+.++|++|+|+|.+ +. T Consensus 281 g~~-~~~-------------~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~ 344 (437) T protein:vir:10 281 TDG-IKK-------------TTSTYLLGDLKKVLNVTLKPQD--SAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAAT 344 (437) T ss_pred ccc-ccc-------------cccccchhhHHHHHHhhhhhhh--hcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCC Confidence 863 221 112234566667664 555554 356799999999999999999999998853 34 Q ss_pred CCeecceeeEeeCcc--ccc-----eEEEEehhc-eEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEE Q lcl|NC_016164. 761 GGTVNGYNVVRSNQV--ANG-----DVFFGVWNQ-MIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRG 832 (836) Q Consensus 761 ~~~l~G~pVv~s~~~--~~~-----~i~~gD~s~-~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l 832 (836) +++|+|+||++++++ |.. .++||||+. |.++++.++.+.++++ +..+.+.+++.+|+|+++++|+||+++ T Consensus 345 ~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~--~~~~~~~~~~~~r~d~~~~~~~a~~~l 422 (437) T protein:vir:10 345 GYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQFQDT--YDIWYKQLGIFLRQNVVQASKDLIVNL 422 (437) T ss_pred CcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEEEEecc--cccccceeeEEEEEccEEecccceEEE Confidence 568999999997654 422 389999996 6688899999987764 556778999999999999999999998 Q ss_pred e---ecC Q lcl|NC_016164. 833 N---DNL 836 (836) Q Consensus 833 ~---~A~ 836 (836) + .|+ T Consensus 423 ~~~~~~~ 429 (437) T protein:vir:10 423 TGKLKAV 429 (437) T ss_pred Eeecccc Confidence 8 333 No 88 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=4.2e-48 Score=280.32 Aligned_cols=273 Identities=12% Similarity=0.111 Sum_probs=222.3 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeee Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVA 640 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~ 640 (836) +. +.+++++.++|+.+.+.|++.+++.+++++++.+ ++...+.+++|+.++.+.+.|++|++++++++++|+++++.+ T Consensus 1 Ma-t~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~-i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MA-TFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSAR-KPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTSTP 78 (311) T ss_pred Cc-eecCCCceeccHHHHHHHHHHHHhhchhhhhcce-eeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEee Confidence 22 3345566788999999999999999999999655 556667899999999999999999999999999999999999 Q ss_pred eeeeeeehhHHHHHh---cchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--ccccccccccc--ccccc--ccccchh Q lcl|NC_016164. 641 KTLGAYTEFSRRLML---QSSIDVEQMVRTELATVIALEIDRAALYGLGSN--SQPEGLKFVTG--INTEN--FGATNPT 711 (836) Q Consensus 641 ~t~~~~i~ISrelL~---ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~--~~p~Gi~~~~~--~~~~t--~aa~~~t 711 (836) ++++++++||+|+|. ++..+++++|.++|++++++++|.++|+|+|++ ..+.|+.+... .+.++ ....... T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~ 158 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIANP 158 (311) T ss_pred EEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccchh Confidence 999999999999995 567899999999999999999999999998853 33455443321 12222 2222334 Q ss_pred HHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCccccc--------- Q lcl|NC_016164. 712 YVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVANG--------- 778 (836) Q Consensus 712 ~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~~--------- 778 (836) +.++.+++..+..++.......|+|||.++..|++++|.+|+|+|.. +.+++|+|+||++++.+|.+ T Consensus 159 ~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~ 238 (311) T protein:vir:99 159 DLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPDDED 238 (311) T ss_pred HHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeecccccccccccccch Confidence 67788888887777666666779999999999999999999998754 23468999999999887632 Q ss_pred -------eEEEEehhc-eEEEeecceEEEEeccc-------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 779 -------DVFFGVWNQ-MIMGMWGALDIQVNPYA-------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 779 -------~i~~gD~s~-~~i~~~~~l~i~~~~~~-------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .+++|||++ +.++.+.++++..+++. .|++|++.||++.|+|+++.+| +|++++++. T Consensus 239 ~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~ 310 (311) T protein:vir:99 239 LDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAV 310 (311) T ss_pred hhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeeccc Confidence 357899987 55788888888887653 4899999999999999999986 688888888 No 89 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=6.7e-48 Score=279.19 Aligned_cols=289 Identities=15% Similarity=0.137 Sum_probs=225.9 Q ss_pred hhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEE Q lcl|NC_016164. 529 AFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAI 608 (836) Q Consensus 529 ~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~ 608 (836) +. ...++. ....+ ........+.++.++|+.+.+.|++.+++.+++++++. +.+..++.+++ T Consensus 1 ~~---~~~e~~----~~~~~----------~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~-~~~~~~~~~~i 62 (338) T protein:vir:78 1 MA---TLNELA----PNTAG----------SNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGE-NIPISYGETII 62 (338) T ss_pred Cc---chHHhh----hhhcc----------cccccceecccccccchHHHHHHHHHHHhhchhhhhcc-eeeccCCceEE Confidence 00 000000 00000 00011122334558999999999999999999999964 56777888999 Q ss_pred EEecCCc--------eeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 609 PRQTGAA--------TAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRA 680 (836) Q Consensus 609 p~~~~~~--------~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~ 680 (836) |+.+..+ .+.|++|++++++++++|+++++++++++++++||+|+|.++.++++++|.+.|+++++++++.+ T Consensus 63 p~~~~~~~a~~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~ 142 (338) T protein:vir:78 63 PTTVKRPEVGQVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLA 142 (338) T ss_pred EEEecCccceeecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHH Confidence 9987654 45567899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcCCc--ccccccccccccccccc-----cccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHH---HHhhc Q lcl|NC_016164. 681 ALYGLGSN--SQPEGLKFVTGINTENF-----GATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFK---TTEKA 750 (836) Q Consensus 681 il~G~Gt~--~~p~Gi~~~~~~~~~t~-----aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~---~lkd~ 750 (836) ||+|+|++ ..|.|+.+.......+. ......++++.++...+... ......+|+|||.++..|. .++|. T Consensus 143 ~l~G~g~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~m~~~~~~~L~~~~~l~d~ 221 (338) T protein:vir:78 143 VFHGKSPLTGSALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSAN-TDVDFNGWAADPRYRARLLRSQAYRDA 221 (338) T ss_pred hhcccCCCccccccccccccccccccccccccccchhhHHHHHHHHHHhhhh-ccccceEEEEchHHHHHHHHHhhhccC Confidence 99999863 46788876544332221 22334678888888777543 3445678999999987764 57789 Q ss_pred cCcccccc----CCCCeecceeeEeeCcccc---------ceEEEEehhceEEEeecceEEEEeccc------------- Q lcl|NC_016164. 751 TSTAQFVL----EPGGTVNGYNVVRSNQVAN---------GDVFFGVWNQMIMGMWGALDIQVNPYA------------- 804 (836) Q Consensus 751 ~g~~~~~~----~~~~~l~G~pVv~s~~~~~---------~~i~~gD~s~~~i~~~~~l~i~~~~~~------------- 804 (836) +|+++|.. +.+++|+|+||++++++|. ..++||||+.|.+++++++.+.++++. T Consensus 222 ~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 301 (338) T protein:vir:78 222 NGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTV 301 (338) T ss_pred CCceeecccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccch Confidence 99988743 3467999999999999984 348999999999999999999988763 Q ss_pred -ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 805 -LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 805 -~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .|.+|++.||++.|+|+++.+|+||+++++|= T Consensus 302 ~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 334 (338) T protein:vir:78 302 SMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDE 334 (338) T ss_pred hhhhcCcEEEEEEEEeccEeecccceEEEeccc Confidence 48899999999999999999999999999887 No 90 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=6.7e-48 Score=279.22 Aligned_cols=272 Identities=12% Similarity=0.108 Sum_probs=220.3 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccc-----cccccccee Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDP-----TESQPSVDQ 635 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~-----~~~~~~~~~ 635 (836) +..++.+.|+.++|+.+...|++.+++.+++++++ ++++..++.+.+|+.++.+.+.|++|++.. +.++++|++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~-~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~ 79 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAF-QNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhc-ceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceee Confidence 55556667788889999999999999999999995 566777788999999999999999999864 556889999 Q ss_pred EEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc--ccccccccccc--ccccccccchh Q lcl|NC_016164. 636 VALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNS--QPEGLKFVTGI--NTENFGATNPT 711 (836) Q Consensus 636 it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~--~p~Gi~~~~~~--~~~t~aa~~~t 711 (836) ++++++|++++++||+|+|.++.++++++|.+.|+++++++++.+||+|+|++. .+.++.+.... .......+... T Consensus 80 i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) T protein:vir:25 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) T ss_pred EEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccchh Confidence 999999999999999999999999999999999999999999999999998632 12223222211 12222233344 Q ss_pred HHHHHHHHHHHhhhcc--ccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCcccc----ceEEEEeh Q lcl|NC_016164. 712 YVELVSMESKVAADNA--DIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVAN----GDVFFGVW 785 (836) Q Consensus 712 ~~~l~~a~~~l~~~~~--~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~----~~i~~gD~ 785 (836) +.++..++..+..... ......|+|||.++..|++++|++|++.|. +++|+|+||++++.+|. +.++|||| T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~---~~~l~G~Pv~~~~~~~~~~~~~~~~~gd~ 236 (305) T protein:vir:25 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR---DDSFAGFRTFFNRNGAWDADAAIEVIADS 236 (305) T ss_pred hhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeec---CCcccccceEEcCccCCCCCccEEEEEec Confidence 5555555544433321 123346999999999999999999998773 46899999999999874 46899999 Q ss_pred hceEEEeecceEEEEeccc----------ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 786 NQMIMGMWGALDIQVNPYA----------LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 786 s~~~i~~~~~l~i~~~~~~----------~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++|.+++++++++..+++. .|++|++.+|++.|+|+++.+|+||++++..= T Consensus 237 s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~ 297 (305) T protein:vir:25 237 SRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTP 297 (305) T ss_pred ceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEcccc Confidence 9999999999999887753 58899999999999999999999999999752 No 91 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=1.6e-46 Score=271.70 Aligned_cols=378 Identities=11% Similarity=0.060 Sum_probs=221.8 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhh Q lcl|NC_016164. 408 MDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQP 487 (836) Q Consensus 408 ~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~ 487 (836) |+. ..+..........+......++.++....... . ++ .......++.+..+................ T Consensus 1 Mk~-l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~---~---ee----~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~- 68 (387) T protein:vir:93 1 MPT-LYELKQSLGMIGQQLKNKNDELSQKATDPNID---M---ED----IKQLETEKAGLQQRFNIVERQVKDIEEKEK- 68 (387) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcC---H---HH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 000 00000000011111111111111110000000 0 00 000111111111111111100000000000 Q ss_pred HHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccc Q lcl|NC_016164. 488 IAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASA 567 (836) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~ 567 (836) .... .................. . ....+... .+...... .......++..+.++.+ T Consensus 69 ------~~~~-------------~~~~~~~~~~~~~~~~~~-~-~~~~r~~~--~~~~~~~~-~~~~~~~~~al~~~t~s 124 (387) T protein:vir:93 69 ------AKVK-------------DTGEAYQSLNDHEKMVKA-K-AEFYRHAI--LPNEFEKP-SMEAQRLLHALPTGNDS 124 (387) T ss_pred ------Hhhh-------------hccccCCCcchhhHHHHH-H-HHHHHHHh--hhhhhhhh-hhhhHHHHHhhccCcCC Confidence 0000 000000000000000000 0 00000000 00000000 11112233445556667 Q ss_pred ccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEec-CCceeeeeccCcccccccccceeEEeeeeeeeee Q lcl|NC_016164. 568 AGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQT-GAATAYWVAEGGDPTESQPSVDQVALVAKTLGAY 646 (836) Q Consensus 568 ~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~ 646 (836) .|++++|+.+...|++.+++.++|++++.. .+..+ ..+|+.. +...+.|++|++..++++++|+++++.+++++++ T Consensus 125 ~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v-~~~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~ 201 (387) T protein:vir:93 125 GGDKLLPKTLSKEIVSEPFAKNQLREKARL-TNIKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVF 201 (387) T ss_pred CCceeechhHHHHHHHHHHhhchhhhheee-eecCC--ceEEEEeecCCccccccCcccccccccccceeeeeheeeeee Confidence 788999999999999999999999998654 44333 4566644 4577899999999999999999999999999999 Q ss_pred ehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHH-HHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhh Q lcl|NC_016164. 647 TEFSRRLMLQSSIDVEQMVRTELATVIALEIDRA-ALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAAD 725 (836) Q Consensus 647 i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~-il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~ 725 (836) ++||++||.|+.++++++|.+.|+++++++++.. |.+|+|+ ++|.|++...++..++ +..++++|.++++.+..+ T Consensus 202 ~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~-g~p~g~l~~~~~~~v~---~~~~~d~i~~~~~~l~~~ 277 (387) T protein:vir:93 202 AAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS-GLDHMSFYNGSVKEVE---GADMYDAIINALADLHED 277 (387) T ss_pred chhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc-cccceeeecccccccc---ccchHHHHHHHHhccChh Confidence 9999999999999999999999999999998765 5567775 5789988766554433 334689999999999888 Q ss_pred ccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccc Q lcl|NC_016164. 726 NADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYAL 805 (836) Q Consensus 726 ~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~ 805 (836) |. .+++|+||+.++..+..+++.+|++ |..+.+.+|+|+||++++.++ .++||||++|++.. .++.+. ++.. T Consensus 278 ~~--~~a~~~mn~~t~~~~~~~~~d~~~~-~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~-~~~~~~--~~~~ 349 (387) T protein:vir:93 278 YR--DNATIYMRYADYVKIISVLSNGTTN-FFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY-DGTTYD--TDKD 349 (387) T ss_pred hh--cCCEEEEechHHHHHHHHHhcCCCc-ccccCCccccccceEEecCCC--ceeeeehhhhheeh-hhheee--eccc Confidence 75 4679999999987765444433333 446778899999999999865 58999999987653 444444 4455 Q ss_pred cccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 806 DKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 806 ~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.++++.|+++.|+|+++++|+||++++..- T Consensus 350 ~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~ 380 (387) T protein:vir:93 350 VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred ccCCceeEEEEeeeCceeechhheEEEEeec Confidence 6789999999999999999999999887544 No 92 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=1.2e-46 Score=272.38 Aligned_cols=378 Identities=11% Similarity=0.069 Sum_probs=222.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhh Q lcl|NC_016164. 408 MDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQP 487 (836) Q Consensus 408 ~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~ 487 (836) |+. ..+.................++.++....... . ++.. .....++.+..+................. T Consensus 1 Mk~-l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~--~----eei~----~~~~~~~~l~~~~~~l~~~~~~~e~~~~~ 69 (387) T protein:vir:94 1 MPT-LYELKQSLGMIGQQLKNKNDELSQKATDPNID--M----EDIK----QLETEKAGLQQRFNIVERQVQDIEEKEKA 69 (387) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcC--H----HHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 000 00000000000000000011111110000000 0 0000 00011111111111100000000000000 Q ss_pred HHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccc Q lcl|NC_016164. 488 IAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASA 567 (836) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~ 567 (836) . ... ................. .....+... .+...... .......+.....++.+ T Consensus 70 ~-----~~~---------------~~~~~~~~~~~~~~~~~--~~~~~r~~~--~~~~~~~~-~~~~~~~~~a~~~~~~~ 124 (387) T protein:vir:94 70 K-----VKD---------------KGEAYQSLSDNEKMVKA--KAEFYRHAI--LPNEFEKP-SMEAQRLLHALPTGNDS 124 (387) T ss_pred h-----hhh---------------ccccCCCCchhHHHHHH--HHHHHHHHH--hhhhHHHH-HHHHHHHHhhhccCCCC Confidence 0 000 00000000000000000 000000000 00000000 01111222333445566 Q ss_pred ccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEec-CCceeeeeccCcccccccccceeEEeeeeeeeee Q lcl|NC_016164. 568 AGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQT-GAATAYWVAEGGDPTESQPSVDQVALVAKTLGAY 646 (836) Q Consensus 568 ~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~ 646 (836) .|++++|+.+...|++.+++.+++++++. +.+..+ ..+|+.. ....+.|++|++.+++++++|+++++.+++++++ T Consensus 125 ~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~-~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~ 201 (387) T protein:vir:94 125 GGDKLLPKTLSKEIVSEPFAKNQLREKAR-LTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVF 201 (387) T ss_pred CCceeechhHHHHHHHHHHhhchhhhhce-eeecCC--ceeeeeeccCCccccccccccccccccccceeeechheeeee Confidence 78899999999999999999999999865 344333 4556544 4577899999999999999999999999999999 Q ss_pred ehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHH-HHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhh Q lcl|NC_016164. 647 TEFSRRLMLQSSIDVEQMVRTELATVIALEIDRA-ALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAAD 725 (836) Q Consensus 647 i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~-il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~ 725 (836) ++||+|||.|+.++++++|.++|+++++++++.. |..|+|+ ++|.|+++..++..++ +..++++|.++++.+..+ T Consensus 202 i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~-g~~~g~~~~~~~~~~~---~~~~~d~i~~~~~~l~~~ 277 (387) T protein:vir:94 202 AAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS-GLEHMSFYNGSVKEVE---GADMYDAIINALADLHED 277 (387) T ss_pred chhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc-cccceeeecccccccc---ccchHHHHHHHHhccChh Confidence 9999999999999999999999999999997654 5566765 5789988776655443 344689999999999887 Q ss_pred ccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccc Q lcl|NC_016164. 726 NADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYAL 805 (836) Q Consensus 726 ~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~ 805 (836) |. .+++|+||+.++..+..+++..|++++ .+.+.+|+|+||++++.++ +++||||++|++.. .++.+. .+.. T Consensus 278 y~--~na~~imn~~t~~~~~~~~~~~~~~~~-~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~-~~~~~~--~~~~ 349 (387) T protein:vir:94 278 YR--DNATIYMRYADYVKIISVLSNGTTNFF-DTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY-DGTTYD--TDKD 349 (387) T ss_pred hh--cCCEEEEechHHHHHHHHHhcCCCccc-ccCCccccccceEEecCCC--ceeeechhhhhhhh-hhhhhe--eccc Confidence 64 467999999998887766666666554 5677899999999999865 58999999876654 344443 3344 Q ss_pred cccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 806 DKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 806 ~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ...|++.|+++.|+|+++++|+||++++..- T Consensus 350 ~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:94 350 VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred ccCCceEEEEEEEeCcEeechhheEEEEeec Confidence 5679999999999999999999999998755 No 93 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=1.2e-46 Score=272.38 Aligned_cols=378 Identities=11% Similarity=0.069 Sum_probs=222.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhh Q lcl|NC_016164. 408 MDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQP 487 (836) Q Consensus 408 ~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~ 487 (836) |+. ..+.................++.++....... . ++.. .....++.+..+................. T Consensus 1 Mk~-l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~--~----eei~----~~~~~~~~l~~~~~~l~~~~~~~e~~~~~ 69 (387) T protein:vir:26 1 MPT-LYELKQSLGMIGQQLKNKNDELSQKATDPNID--M----EDIK----QLETEKAGLQQRFNIVERQVQDIEEKEKA 69 (387) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcC--H----HHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 000 00000000000000000011111110000000 0 0000 00011111111111100000000000000 Q ss_pred HHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccc Q lcl|NC_016164. 488 IAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASA 567 (836) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~ 567 (836) . ... ................. .....+... .+...... .......+.....++.+ T Consensus 70 ~-----~~~---------------~~~~~~~~~~~~~~~~~--~~~~~r~~~--~~~~~~~~-~~~~~~~~~a~~~~~~~ 124 (387) T protein:vir:26 70 K-----VKD---------------KGEAYQSLSDNEKMVKA--KAEFYRHAI--LPNEFEKP-SMEAQRLLHALPTGNDS 124 (387) T ss_pred h-----hhh---------------ccccCCCCchhHHHHHH--HHHHHHHHH--hhhhHHHH-HHHHHHHHhhhccCCCC Confidence 0 000 00000000000000000 000000000 00000000 01111222333445566 Q ss_pred ccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEec-CCceeeeeccCcccccccccceeEEeeeeeeeee Q lcl|NC_016164. 568 AGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQT-GAATAYWVAEGGDPTESQPSVDQVALVAKTLGAY 646 (836) Q Consensus 568 ~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~ 646 (836) .|++++|+.+...|++.+++.+++++++. +.+..+ ..+|+.. ....+.|++|++.+++++++|+++++.+++++++ T Consensus 125 ~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~-~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~ 201 (387) T protein:vir:26 125 GGDKLLPKTLSKEIVSEPFAKNQLREKAR-LTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVF 201 (387) T ss_pred CCceeechhHHHHHHHHHHhhchhhhhce-eeecCC--ceeeeeeccCCccccccccccccccccccceeeechheeeee Confidence 78899999999999999999999999865 344333 4556544 4577899999999999999999999999999999 Q ss_pred ehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHH-HHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhh Q lcl|NC_016164. 647 TEFSRRLMLQSSIDVEQMVRTELATVIALEIDRA-ALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAAD 725 (836) Q Consensus 647 i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~-il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~ 725 (836) ++||+|||.|+.++++++|.++|+++++++++.. |..|+|+ ++|.|+++..++..++ +..++++|.++++.+..+ T Consensus 202 i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~-g~~~g~~~~~~~~~~~---~~~~~d~i~~~~~~l~~~ 277 (387) T protein:vir:26 202 AAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS-GLEHMSFYNGSVKEVE---GADMYDAIINALADLHED 277 (387) T ss_pred chhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc-cccceeeecccccccc---ccchHHHHHHHHhccChh Confidence 9999999999999999999999999999997654 5566765 5789988776655443 344689999999999887 Q ss_pred ccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccc Q lcl|NC_016164. 726 NADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYAL 805 (836) Q Consensus 726 ~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~ 805 (836) |. .+++|+||+.++..+..+++..|++++ .+.+.+|+|+||++++.++ +++||||++|++.. .++.+. .+.. T Consensus 278 y~--~na~~imn~~t~~~~~~~~~~~~~~~~-~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~-~~~~~~--~~~~ 349 (387) T protein:vir:26 278 YR--DNATIYMRYADYVKIISVLSNGTTNFF-DTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY-DGTTYD--TDKD 349 (387) T ss_pred hh--cCCEEEEechHHHHHHHHHhcCCCccc-ccCCccccccceEEecCCC--ceeeechhhhhhhh-hhhhhe--eccc Confidence 64 467999999998887766666666554 5677899999999999865 58999999876654 344443 3344 Q ss_pred cccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 806 DKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 806 ~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ...|++.|+++.|+|+++++|+||++++..- T Consensus 350 ~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:26 350 VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred ccCCceEEEEEEEeCcEeechhheEEEEeec Confidence 5679999999999999999999999998755 No 94 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=1.2e-46 Score=272.38 Aligned_cols=378 Identities=11% Similarity=0.069 Sum_probs=222.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhh Q lcl|NC_016164. 408 MDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQP 487 (836) Q Consensus 408 ~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~ 487 (836) |+. ..+.................++.++....... . ++.. .....++.+..+................. T Consensus 1 Mk~-l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~--~----eei~----~~~~~~~~l~~~~~~l~~~~~~~e~~~~~ 69 (387) T protein:vir:96 1 MPT-LYELKQSLGMIGQQLKNKNDELSQKATDPNID--M----EDIK----QLETEKAGLQQRFNIVERQVQDIEEKEKA 69 (387) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcC--H----HHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 000 00000000000000000011111110000000 0 0000 00011111111111100000000000000 Q ss_pred HHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccccc Q lcl|NC_016164. 488 IAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASA 567 (836) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~ 567 (836) . ... ................. .....+... .+...... .......+.....++.+ T Consensus 70 ~-----~~~---------------~~~~~~~~~~~~~~~~~--~~~~~r~~~--~~~~~~~~-~~~~~~~~~a~~~~~~~ 124 (387) T protein:vir:96 70 K-----VKD---------------KGEAYQSLSDNEKMVKA--KAEFYRHAI--LPNEFEKP-SMEAQRLLHALPTGNDS 124 (387) T ss_pred h-----hhh---------------ccccCCCCchhHHHHHH--HHHHHHHHH--hhhhHHHH-HHHHHHHHhhhccCCCC Confidence 0 000 00000000000000000 000000000 00000000 01111222333445566 Q ss_pred ccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEec-CCceeeeeccCcccccccccceeEEeeeeeeeee Q lcl|NC_016164. 568 AGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQT-GAATAYWVAEGGDPTESQPSVDQVALVAKTLGAY 646 (836) Q Consensus 568 ~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~ 646 (836) .|++++|+.+...|++.+++.+++++++. +.+..+ ..+|+.. ....+.|++|++.+++++++|+++++.+++++++ T Consensus 125 ~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~-~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~ 201 (387) T protein:vir:96 125 GGDKLLPKTLSKEIVSEPFAKNQLREKAR-LTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVF 201 (387) T ss_pred CCceeechhHHHHHHHHHHhhchhhhhce-eeecCC--ceeeeeeccCCccccccccccccccccccceeeechheeeee Confidence 78899999999999999999999999865 344333 4556544 4577899999999999999999999999999999 Q ss_pred ehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHH-HHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhh Q lcl|NC_016164. 647 TEFSRRLMLQSSIDVEQMVRTELATVIALEIDRA-ALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAAD 725 (836) Q Consensus 647 i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~-il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~ 725 (836) ++||+|||.|+.++++++|.++|+++++++++.. |..|+|+ ++|.|+++..++..++ +..++++|.++++.+..+ T Consensus 202 i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~-g~~~g~~~~~~~~~~~---~~~~~d~i~~~~~~l~~~ 277 (387) T protein:vir:96 202 AAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS-GLEHMSFYNGSVKEVE---GADMYDAIINALADLHED 277 (387) T ss_pred chhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc-cccceeeecccccccc---ccchHHHHHHHHhccChh Confidence 9999999999999999999999999999997654 5566765 5789988776655443 344689999999999887 Q ss_pred ccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccc Q lcl|NC_016164. 726 NADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYAL 805 (836) Q Consensus 726 ~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~ 805 (836) |. .+++|+||+.++..+..+++..|++++ .+.+.+|+|+||++++.++ +++||||++|++.. .++.+. .+.. T Consensus 278 y~--~na~~imn~~t~~~~~~~~~~~~~~~~-~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~-~~~~~~--~~~~ 349 (387) T protein:vir:96 278 YR--DNATIYMRYADYVKIISVLSNGTTNFF-DTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY-DGTTYD--TDKD 349 (387) T ss_pred hh--cCCEEEEechHHHHHHHHHhcCCCccc-ccCCccccccceEEecCCC--ceeeechhhhhhhh-hhhhhe--eccc Confidence 64 467999999998887766666666554 5677899999999999865 58999999876654 344443 3344 Q ss_pred cccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 806 DKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 806 ~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ...|++.|+++.|+|+++++|+||++++..- T Consensus 350 ~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:96 350 VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred ccCCceEEEEEEEeCcEeechhheEEEEeec Confidence 5679999999999999999999999998755 No 95 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=6.1e-46 Score=268.45 Aligned_cols=363 Identities=12% Similarity=0.063 Sum_probs=217.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhh Q lcl|NC_016164. 415 MEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGS 494 (836) Q Consensus 415 ~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~ 494 (836) +++... ...+......++++.......++ ....++ .+.....++.+..+..+........ ...... T Consensus 1 meeL~~-~~~~~~~~~~e~~~~l~~~~~~~--~~~~e~----~~~l~~ei~~~~~~~~~l~~~~~~~-------~~~~~~ 66 (389) T protein:vir:10 1 MDKLQT-LFNDVSAKCADLNAQLNAKLQDE--NASVDD----FQKIKDDLTAAKARRDAINDQIKAL-------EAEKPA 66 (389) T ss_pred ChHHHH-HHHHHHHHHHHHHHHHHHHHHhH--hhhHHH----HHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHh Confidence 111110 00111111111111110000000 000000 0000001111111111100000000 000000 Q ss_pred hhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccc Q lcl|NC_016164. 495 ADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFT 574 (836) Q Consensus 495 ~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp 574 (836) ................ ..... .......+... .++ ..........++.+.|++++| T Consensus 67 --------~~~~~~~~~~~~~~~~----~~~~~---~~~~~~~~~~~----lr~-----~~~~~~~~~~~t~~~gg~~vP 122 (389) T protein:vir:10 67 --------EPKTEPKDDGSKKGTD----LSKKP---IDAKKKAINDF----IHS-----HGKVIDATSKVTSTEAGVLIP 122 (389) T ss_pred --------hhhccccccccccccc----cchhH---HHHHHHHHHHH----hhc-----chhhhhhhcccccCCcceeeh Confidence 0000000000000000 00000 00000111110 000 111222334455567788899 Q ss_pred hhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecC-CceeeeeccCccccc-ccccceeEEeeeeeeeeeehhHHH Q lcl|NC_016164. 575 DGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTG-AATAYWVAEGGDPTE-SQPSVDQVALVAKTLGAYTEFSRR 652 (836) Q Consensus 575 ~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t~~~~i~ISre 652 (836) +.+...|++.+++.+++++++ ++.+..++...+|.... ...+.|++|+++++. ++++|+++++.+++++++++||++ T Consensus 123 ~~~~~~i~~~~~~~~~l~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~e 201 (389) T protein:vir:10 123 EEIIYDPTAEVNSVVDLSTLV-TKTPVTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEE 201 (389) T ss_pred HHHHHHHHHHHHhhhhHHhhc-ceeeccCCeeEEEEEecCCCccccccccccccccccccceeeeeeheeeEeeehhhHH Confidence 999999999999999999984 55566666677776654 456689999999985 789999999999999999999999 Q ss_pred HHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHH-HhhhccccCc Q lcl|NC_016164. 653 LMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESK-VAADNADIGA 731 (836) Q Consensus 653 lL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~-l~~~~~~~~~ 731 (836) +|.|+.++++++|.+.|+++++++++.+|++|+|++ .+.| .++..+++++.+++.. +...+ + T Consensus 202 ll~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~-~~~~------------~~~~~~~d~l~~~~~~~~~~~~----~ 264 (389) T protein:vir:10 202 AIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSF-TAKK------------TTTDTLVDSLKHILNVDLDPAY----S 264 (389) T ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccc-cccc------------ccccccHHHHHHHHHhhhhhhh----C Confidence 999999999999999999999999999999998753 2221 2234567888887763 33222 4 Q ss_pred cEEEecHHHHHHHHHHhhccCccccccC--------CCCeecceeeEeeCcc-ccc-----eEEEEehhc-eEEEeecce Q lcl|NC_016164. 732 MSYLTNSTLYGGFKTTEKATSTAQFVLE--------PGGTVNGYNVVRSNQV-ANG-----DVFFGVWNQ-MIMGMWGAL 796 (836) Q Consensus 732 ~~~vmnp~~~~~L~~lkd~~g~~~~~~~--------~~~~l~G~pVv~s~~~-~~~-----~i~~gD~s~-~~i~~~~~l 796 (836) ++|+|||.+|..|+.++|++|+|+|... .+++|+|+||+++++. +.. .++||||++ |.+++++++ T Consensus 265 a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 344 (389) T protein:vir:10 265 RALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQV 344 (389) T ss_pred cEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecce Confidence 7899999999999999999999987432 1248999999876543 221 389999997 789999999 Q ss_pred EEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 797 DIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 797 ~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++.++++..|.+ .+|+++|+|+++++|+||++++.+= T Consensus 345 ~i~~~~~~~~~~---~~~~~~r~d~~~~~~~a~~~~~~~~ 381 (389) T protein:vir:10 345 TLAWEDSKIYGK---YLGAAFRFGVQKADSKAGYFVTNTD 381 (389) T ss_pred EEEeeccccccc---eEEEEEEeccEEecccceEEEEeec Confidence 999998877654 6899999999999999999988553 No 96 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=2.6e-46 Score=270.52 Aligned_cols=389 Identities=10% Similarity=0.033 Sum_probs=221.4 Q ss_pred hhhhhhhhhhhhhhhhhhh-hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhH---HHHHHHHHHHhhh Q lcl|NC_016164. 394 AATVAPLSHNDNNHMDSST-IDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASE---ADAMRSVLSEIAK 469 (836) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~-~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~---~e~~~~~l~~l~~ 469 (836) ....+...+. ....++.. .+..... .+......++.+. ..+...++... .......++.+.. T Consensus 1 ~~~~~~~~~~-~~g~~mk~l~el~~~~----~e~~~~~~~~~~e---------l~~~~~~~~~~~ee~~~~~~~~~~l~~ 66 (402) T protein:vir:93 1 MRNFKNDNEL-LGGNEMPTLYELKQSL----GMIGQQLKNKNDE---------LSQKATDPNIDMEDIKQLETEKAGLQQ 66 (402) T ss_pred Ccchhhhhhc-CCCCCChHHHHHHHHH----HHHHHHHHHHHHH---------HHHHHhccCcCHHHHHHHHHHHHHHHH Confidence 0000000000 00000000 0000000 0001111111100 00000000000 0000111111111 Q ss_pred hhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhh Q lcl|NC_016164. 470 RPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGI 549 (836) Q Consensus 470 ~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~ 549 (836) +.................. ..... ............... .....+... .+...... T Consensus 67 ~~~~l~~~~~~~e~~~~~~-----~~~~~---------------~~~~~~~~~~~~~~~--~~~~~r~~~--~~~~~~~~ 122 (402) T protein:vir:93 67 RFNIVERQVQDIEEKEKAK-----VKDKG---------------EAYQSLSDNEKMVKA--KAEFYRHAI--LPNEFEKP 122 (402) T ss_pred HHHHHHHHHHHHHHHHHhh-----hhhcc---------------ccCCCCchhHHHHHH--HHHHHHHHH--hhhhHHHH Confidence 1111100000000000000 00000 000000000000000 000000000 00000000 Q ss_pred hhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEec-CCceeeeeccCccccc Q lcl|NC_016164. 550 LAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQT-GAATAYWVAEGGDPTE 628 (836) Q Consensus 550 ~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~ 628 (836) .............++.+.|++++|+.+...|++.++..+++++++.. .+.. ...+|+.. +...+.|++|++.++. T Consensus 123 -~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v-~~~~--~~~~p~~~~~~~~a~~v~Eg~~~~~ 198 (402) T protein:vir:93 123 -SMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL-TNIK--GLEIPRVSYTLDDDDFITDVETAKE 198 (402) T ss_pred -HHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhcee-eecC--CceeeeeeccCCccccccccccccc Confidence 01111222334445566778899999999999999999999998654 3433 34556554 4567899999999999 Q ss_pred ccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHH-HHhhcCCcccccccccccccccccccc Q lcl|NC_016164. 629 SQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRA-ALYGLGSNSQPEGLKFVTGINTENFGA 707 (836) Q Consensus 629 ~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~-il~G~Gt~~~p~Gi~~~~~~~~~t~aa 707 (836) ++++|+++++.+++++++++||++||.|+.++++++|.++|+++++++++.. |..|+|+ ++|.|++...++..++ T Consensus 199 ~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~-g~p~g~~~~~~~~~~~--- 274 (402) T protein:vir:93 199 LKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS-GLEHMSFYNGSVKEVE--- 274 (402) T ss_pred cccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc-cccceeeecccccccc--- Confidence 9999999999999999999999999999999999999999999999998664 5567775 5899998776655443 Q ss_pred cchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCccccceEEEEehhc Q lcl|NC_016164. 708 TNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQ 787 (836) Q Consensus 708 ~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~ 787 (836) +..++++|.+++++|...|. .+++|+||+.++..+..+++..|+++ ..+.+++|+|+||++++.++ +++||||++ T Consensus 275 ~~~~~d~l~~~~~~l~~~y~--~na~~imn~~t~~~~~~~~~d~~~~~-~~~~~~~llG~PV~~t~~~~--~i~~GDf~~ 349 (402) T protein:vir:93 275 GADMYDAIINALADLHEDYR--DNATIYMRYADYVKIISVLSNGTTNF-FDTPAEKVFGKPVVFTDAAV--KPIVGDFNY 349 (402) T ss_pred ccchHHHHHHHHhccChhhh--cCCEEEEechHHHHHHHHHhcCCCcc-cccCCccccccceEEecCCC--ceeeechhh Confidence 33468999999999988764 57799999999888766666555544 45678899999999999865 589999998 Q ss_pred eEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 788 MIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 788 ~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) |++... ++.+ +.+....+|++.|++..|+|+++++|+||++++..- T Consensus 350 ~~~~~~-~~~~--~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~ 395 (402) T protein:vir:93 350 FGINYD-GTTY--DTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 395 (402) T ss_pred hhhhhh-hhhh--hhhhcccCCceEEEEEEEeCcEEechhheEEEEeec Confidence 765433 3333 333445569999999999999999999999888644 No 97 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=1.8e-46 Score=271.33 Aligned_cols=343 Identities=12% Similarity=0.070 Sum_probs=221.7 Q ss_pred hhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhh Q lcl|NC_016164. 442 ADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGD 521 (836) Q Consensus 442 l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~ 521 (836) +++. .++.++...+.+ ..+.+..+......... ...... ..... .... T Consensus 1 ~eei-~~l~~~~~~l~~----~~~~l~~~~d~~e~e~~----~~~~~~---~~~~~--------------------~~~~ 48 (352) T protein:vir:78 1 MEDI-KQLETEKAGLQQ----RFNIVERQVQDIEEKEK----AKVKDK---GEAYQ--------------------SLND 48 (352) T ss_pred ChhH-HHHHHHHHHHHH----HHHHHHHHHHHHHHHHH----HHhhhc---ccccc--------------------ccch Confidence 2111 111111111111 11111100000000000 000000 00000 0000 Q ss_pred hhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeec Q lcl|NC_016164. 522 RAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTG 601 (836) Q Consensus 522 ~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~ 601 (836) ....... ..+..+..... ...... ...........+.++.+.|++++|+.+...|++.++..++|++++. +.+. T Consensus 49 ~~~~~~~--~~~~~r~~~~~--~~~~~~-~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~-v~~~ 122 (352) T protein:vir:78 49 NEKLVKA--KAEFYRHAILP--NEFEKP-SMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKAR-LTNI 122 (352) T ss_pred hhhHHHH--HHHHHHHHhhh--hHHHHH-HhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhhee-eEec Confidence 0000000 00000000000 000000 0011112223344566778889999999999999999999999865 4443 Q ss_pred CCceEEEEEec-CCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_016164. 602 LQGPVAIPRQT-GAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDR- 679 (836) Q Consensus 602 ~~~~~~~p~~~-~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~- 679 (836) .+ ..+|+.. +.+.+.|++|++.+++++++|+++++.+++++++++||++||.|+.++++++|.++|+++++++++. T Consensus 123 ~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~ 200 (352) T protein:vir:78 123 KG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKD 200 (352) T ss_pred CC--ceEEEEecCCCcccccccccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh Confidence 32 4556654 4567999999999999999999999999999999999999999999999999999999999998666 Q ss_pred HHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccC Q lcl|NC_016164. 680 AALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLE 759 (836) Q Consensus 680 ~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~ 759 (836) .|.+|+|+ ++|.|++...++..++. ..++++|.+++..|...+. .+++|+||+.++..|..+++.+|++++ .+ T Consensus 201 ~~~~g~g~-~~~~g~l~~~~~~~~t~---~~~~d~i~~~~~~l~~~~~--~~a~~~mn~~t~~~l~~~~~~~~~~~~-~~ 273 (352) T protein:vir:78 201 ALAVSPKS-GLEHMSFYNGSVKEVEG---ANMYDAIINALADLHEDYR--DNATIYMRYADYVKIISVLSNGTTNFF-DT 273 (352) T ss_pred hhhcCCCC-cccccceeccccccccc---cchHHHHHHHHhccChhhh--cCCEEEEehHHHHHHHHHHhccCCccc-cc Confidence 45577776 57888887766655443 2358999999999988764 468999999999999988888777765 56 Q ss_pred CCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 760 PGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 760 ~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .+.+|+|+||++++.++ .++||||+.|++.. .++.+ +.+....+|++.|++..|+|+++++|+||++++.+= T Consensus 274 ~~~~llG~PV~~~~~~~--~~~~Gdf~~~~~~~-~~~~~--~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a 345 (352) T protein:vir:78 274 PAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY-DGTTY--DTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 345 (352) T ss_pred CCccccccceEEecCCC--ceeEeehhhhhhhh-hhhee--eeeccccCCeeEEEEEeeeCceeechhheEEEEeec Confidence 67899999999999764 58999999887653 34433 344455689999999999999999999999998655 No 98 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=1.5e-46 Score=271.82 Aligned_cols=348 Identities=16% Similarity=0.094 Sum_probs=223.2 Q ss_pred hhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhh Q lcl|NC_016164. 445 LAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAA 524 (836) Q Consensus 445 ~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~ 524 (836) ...++.+......+......+........ ........+..... ........+ ...+ T Consensus 1 M~~kl~~~~~~~~e~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~-----~~~~~~~~~--------~~~~---------- 56 (383) T protein:vir:78 1 MTIKLKNNLANYEEKRTAFVNAVKNEDTQ-EIQNKAYVEMVDAM-----AADIMEQAK--------KEAR---------- 56 (383) T ss_pred CchhHHHHHHHHHHHHHHHHHHHhccChH-HHHHHHHHHHHHHH-----HHHHHHHHH--------HHHH---------- Confidence 22222222222333333322222111100 00000000000000 000000000 0000 Q ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCc Q lcl|NC_016164. 525 FEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQG 604 (836) Q Consensus 525 ~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~ 604 (836) .........+.+... ....... ........+++.|++++|+.+.+.|++.++..+++++++ ++.+. ++ T Consensus 57 -------~~~~~~~~~~~g~~~--lt~~e~~-~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~-~v~~~-~~ 124 (383) T protein:vir:78 57 -------QEADAYISASRTDKN--ITNEEIK-FFNDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASI-GMRTT-GL 124 (383) T ss_pred -------HHHHHHHHhcCChhh--hhHHHHH-HHHHHhccCCCCCccccCHHHHHHHHHHHHhhccceeee-eeEec-CC Confidence 000000001111000 0000000 011223445667889999999999999999999999985 45554 45 Q ss_pred eEEEEEecCCceeeeeccCcccc-cccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_016164. 605 PVAIPRQTGAATAYWVAEGGDPT-ESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALY 683 (836) Q Consensus 605 ~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~ 683 (836) ..++|+.++.+.+.|++|+++.+ .++++|+++++.+++++++++||++||.|+..+++++|.+.|+++++++++.+|++ T Consensus 125 ~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~ 204 (383) T protein:vir:78 125 RTKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIV 204 (383) T ss_pred ceEEEEEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEe Confidence 68999999999999999988875 57899999999999999999999999999999999999999999999999999999 Q ss_pred hcCCccccccccccccccc-cc-------ccccchhHHHHHHHHHHHhhhc------------cccCccEEEecHHHHHH Q lcl|NC_016164. 684 GLGSNSQPEGLKFVTGINT-EN-------FGATNPTYVELVSMESKVAADN------------ADIGAMSYLTNSTLYGG 743 (836) Q Consensus 684 G~Gt~~~p~Gi~~~~~~~~-~t-------~aa~~~t~~~l~~a~~~l~~~~------------~~~~~~~~vmnp~~~~~ 743 (836) |+|+ ++|.||++..+... .+ .+.+.++++++..+...+..-. ...++.+|+|||.++.. T Consensus 205 G~G~-~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~ 283 (383) T protein:vir:78 205 GDGN-DKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWD 283 (383) T ss_pred ccCC-CCceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhh Confidence 9996 68999987433211 11 1123344555555544443211 11245678888876543 Q ss_pred HH---HHhhccCccccccCCCCeeccee--eEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEE Q lcl|NC_016164. 744 FK---TTEKATSTAQFVLEPGGTVNGYN--VVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQD 818 (836) Q Consensus 744 L~---~lkd~~g~~~~~~~~~~~l~G~p--Vv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r 818 (836) +. ...+.+|.+ .+++|+| |+.++.+|+++++||||++|.+++++++++..+++.+|.+|++.||+..| T Consensus 284 ~~~~~~~~~~~G~~-------~t~l~~~~~iv~s~~~p~~~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r 356 (383) T protein:vir:78 284 VKKQYTSLNANGVY-------VTALPFNLNIIESLFVPEKKAISYVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQF 356 (383) T ss_pred hccchhccCCCCce-------eeecCCCceEEecCCCCcccEEEeeccceEEEecccceEEecchhhhhcCceEEEEEEE Confidence 22 122333332 2455555 77899999999999999999999999999999999999999999999999 Q ss_pred eccEEEcccceEEEeecC Q lcl|NC_016164. 819 VDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 819 ~d~~v~~p~Af~~l~~A~ 836 (836) +|+++++++||++++.++ T Consensus 357 ~dG~~~~~~A~~vl~~~~ 374 (383) T protein:vir:78 357 AYGKAKDDKAAAVWTLNI 374 (383) T ss_pred EcCEEecCCeEEEEEEEe Confidence 999999999999999999 No 99 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=8.3e-47 Score=273.23 Aligned_cols=263 Identities=16% Similarity=0.206 Sum_probs=219.4 Q ss_pred hhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhccee-eecCCceEEEEEecC-CceeeeeccCccccc-ccccc Q lcl|NC_016164. 557 HRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM-LTGLQGPVAIPRQTG-AATAYWVAEGGDPTE-SQPSV 633 (836) Q Consensus 557 ~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~-~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~-~~~~~ 633 (836) .....+.++.+.|++++|+.+...|++.+++.+++++++..+ ++...+.+.+|+... .+.+.|++|++++++ ++++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 122234445567788899999999999999999999985433 233456788887754 577999999999997 57999 Q ss_pred eeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHH Q lcl|NC_016164. 634 DQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYV 713 (836) Q Consensus 634 ~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~ 713 (836) +++++++++++++++||+|+|.|+.++++++|.++|+++++++++..|++|+|++.. .++..+++ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~---------------~~~~~~~d 145 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT---------------KPTLTKWD 145 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc---------------cccccCHH Confidence 999999999999999999999999999999999999999999999999999875321 23456899 Q ss_pred HHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCc--ccc-----ceEEE Q lcl|NC_016164. 714 ELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQ--VAN-----GDVFF 782 (836) Q Consensus 714 ~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~--~~~-----~~i~~ 782 (836) +|.+++.++..++. .+++|+|||++|..|++++|.+|+++|.. +.+++|+|+||++++. +|. ..++| T Consensus 146 ~i~~~~~~l~~~~~--~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~ 223 (293) T protein:vir:48 146 DIIDLEAKVDPAIK--QTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYF 223 (293) T ss_pred HHHHHHHhhhhhhc--CCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEE Confidence 99999999987754 57899999999999999999999998753 3456899999987543 332 24899 Q ss_pred Eehhc-eEEEeecceEEEEecc--cccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 783 GVWNQ-MIMGMWGALDIQVNPY--ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 783 gD~s~-~~i~~~~~l~i~~~~~--~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) |||++ |.+++++++++..+++ .+|.+|++.||++.|+|+++++|+||++++.+- T Consensus 224 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~ 280 (293) T protein:vir:48 224 GDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKA 280 (293) T ss_pred EeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeec Confidence 99997 6788999999999875 468999999999999999999999999888444 No 100 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=3.6e-45 Score=264.26 Aligned_cols=383 Identities=10% Similarity=0.038 Sum_probs=212.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhh Q lcl|NC_016164. 391 GAAAATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKR 470 (836) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~ 470 (836) .... ... .............+.... .........++.........++ + ........+.+... T Consensus 1 m~~k-~~~--l~~~~~el~~~l~eL~e~----~~~l~~~~~el~~~~ee~~~~e-------~----~~~~~~~~~~l~~~ 62 (397) T protein:vir:96 1 MALK-QLI--LNKQIKERSSEIDKLLSQ----RSDLEKQENDLERALEEAKTDE-------E----ISTVSDSADDLEKQ 62 (397) T ss_pred CcHH-HHH--HHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHhhhhHH-------H----HHHHHHHHHHHHHH Confidence 0000 000 000000000000000000 0000000001000000000000 0 00001111111111 Q ss_pred hhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhh Q lcl|NC_016164. 471 PAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGIL 550 (836) Q Consensus 471 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~ 550 (836) ........................... ................. ............... T Consensus 63 i~~l~~~i~~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~------- 121 (397) T protein:vir:96 63 VKDLDEKIAELQKEKQDLEDELAKAAD--PTDQKPKDGEKRKMKKF------------KVTEEELAEKRSAIN------- 121 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhh--hhhhhhHHHHHHHHHHH------------hhhhHHHHHHHHHHH------- Confidence 111111111000000000000000000 00000000000000000 000000000000000 Q ss_pred hhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEec-CCceeeeeccCccccc- Q lcl|NC_016164. 551 APNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQT-GAATAYWVAEGGDPTE- 628 (836) Q Consensus 551 ~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~-~~~~a~~v~Eg~~~~~- 628 (836) ..................++.++|+.+...+++ ++....+++++.. .+...+...+|... ++..+.|++|++..++ T Consensus 122 ~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~ 199 (397) T protein:vir:96 122 AFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRS-VPVNSASGKFPVISKSGSKMATVQQLEKNPQL 199 (397) T ss_pred HHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhh-ccccccceeEEEEeccCCcccccccccccccc Confidence 000001111223344566778889999998887 4666777776543 44444455555443 3567889999999986 Q ss_pred ccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccccccccccccc Q lcl|NC_016164. 629 SQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGAT 708 (836) Q Consensus 629 ~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~ 708 (836) +.++|++++++++++++++++|+++|.|+.++++++|.+.|+++++++++.+|++|+|++ .|. + T Consensus 200 ~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~-~~~---------------~ 263 (397) T protein:vir:96 200 ANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTA-TAK---------------S 263 (397) T ss_pred ccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-ccc---------------c Confidence 689999999999999999999999999999999999999999999999999999999863 232 2 Q ss_pred chhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc----CCCCeecceeeEeeCcccc------c Q lcl|NC_016164. 709 NPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL----EPGGTVNGYNVVRSNQVAN------G 778 (836) Q Consensus 709 ~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~----~~~~~l~G~pVv~s~~~~~------~ 778 (836) ..++++|.++++.+...+ .+++|+|||.+|..|+.++|++|+|+|.. +.+++|+|+||++++.+.. . T Consensus 264 ~~~~d~~~~~~~~~~~~~---~~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 340 (397) T protein:vir:96 264 VVGVDGLKDLINKEIKKV---YDVKLFISASMYSELDKLKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNV 340 (397) T ss_pred ccchHHHHHHHHHhhhhh---cCcEEEEcHHHHHHHHHhhccCCCeEeccCccCCCcccccccceEEecccccCCCCCce Confidence 356888888887654443 35789999999999999999999998743 3456899999998765432 2 Q ss_pred eEEEEehhc-eEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 779 DVFFGVWNQ-MIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 779 ~i~~gD~s~-~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .++||||++ |.+++++++.+.++++..| .+.+|+++|+|+++++|+||++++..+ T Consensus 341 ~~~~gd~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~r~d~~~~~~~a~~~~~~~~ 396 (397) T protein:vir:96 341 VGFIGDAKAFASFFDRKQVSVSWVDNNIY---GQLLAGIIRYDVKATDKKAGFYVTFTI 396 (397) T ss_pred EEEEeehhcceEeEeecceEEEEeccccc---ceeEEEEEEEccEEecccceEEEEeec Confidence 489999997 6789999999999887655 457999999999999999999998766 No 101 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=6.6e-38 Score=224.44 Aligned_cols=285 Identities=13% Similarity=0.085 Sum_probs=218.5 Q ss_pred HHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCc Q lcl|NC_016164. 536 EATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAA 615 (836) Q Consensus 536 ~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~ 615 (836) -+++++.. ......+ ....+|++++|+.+. .+++.+++.+++++++.++.+.......+++.+.+. T Consensus 1 ~~~~~~~~------------~~~k~it-~~d~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~ 66 (314) T protein:vir:41 1 MDFLNKPF------------QITPKID-VPDLGKGILAVQRFG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLGV 66 (314) T ss_pred CchhhhHH------------Hhhcccc-cccCCCceeChHHHH-HHHHHHHhccchhhheeeecccCccceeecccccCc Confidence 11111111 1111122 234467889998875 699999999999999765555566778888765432 Q ss_pred ----eeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchh--HHHHHHHHHHHHHHHHHHHHHHHhhcCCc- Q lcl|NC_016164. 616 ----TAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSI--DVEQMVRTELATVIALEIDRAALYGLGSN- 688 (836) Q Consensus 616 ----~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~--~l~~~i~~~l~~a~a~~~d~~il~G~Gt~- 688 (836) ...|.+|..+.++++++|+++.+.++++...+.||+++|.|+.. +++++|...|++++++.++..+++|+|+. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~ 146 (314) T protein:vir:41 67 ELEPGRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLT 146 (314) T ss_pred ccccccccccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCc Confidence 23456677778899999999999999999999999999999865 89999999999999999999999999863 Q ss_pred ------ccccccccccccccc--cccccchhHHHHHHHHHHHhhhcccc-CccEEEecHHHHHHHHHHhhccCcccc--- Q lcl|NC_016164. 689 ------SQPEGLKFVTGINTE--NFGATNPTYVELVSMESKVAADNADI-GAMSYLTNSTLYGGFKTTEKATSTAQF--- 756 (836) Q Consensus 689 ------~~p~Gi~~~~~~~~~--t~aa~~~t~~~l~~a~~~l~~~~~~~-~~~~~vmnp~~~~~L~~lkd~~g~~~~--- 756 (836) ++|+|++..++.... +.++...+.+.+.+++.+|...|+.. ++.+|+||+.++.+++.+++.++++.. T Consensus 147 s~~~~~~~p~G~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~ 226 (314) T protein:vir:41 147 TGRELYRINDGWMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSA 226 (314) T ss_pred CcccchhcchhhhhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchh Confidence 378899876544333 23334456777889999999988643 468999999999999999998887753 Q ss_pred -ccCCCCeecceeeEeeCccc-----cceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceE Q lcl|NC_016164. 757 -VLEPGGTVNGYNVVRSNQVA-----NGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFC 830 (836) Q Consensus 757 -~~~~~~~l~G~pVv~s~~~~-----~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~ 830 (836) ..+.+.+|+|+||+.++.+| +..++||||+++.++.+..+.+. ++.+..++++.|.+..|+|+.+.++.|.+ T Consensus 227 ~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~--~~~~a~~~~~~~~~~~r~d~~~~~~~aa~ 304 (314) T protein:vir:41 227 LIGATGLQYDGIPIQYVPALDALGDDKARALLTVPTNLVYGFWRNIRIE--PKRDAAMRRTEYIASLRADCNYEDENAAV 304 (314) T ss_pred hhCCCCceecceeeEecccccccCCCCceEEEechhheEEEeeceeEEe--ecccCcCCeEEEEEEEEeceEEEEcCcEE Confidence 33456689999999998874 57899999999988777666554 55566889999999999999999886665 Q ss_pred E--EeecC Q lcl|NC_016164. 831 R--GNDNL 836 (836) Q Consensus 831 ~--l~~A~ 836 (836) + +++|= T Consensus 305 ~~~~~~~~ 312 (314) T protein:vir:41 305 AAVIDMSS 312 (314) T ss_pred EEEeeccC Confidence 4 34444 No 102 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=3.1e-37 Score=220.70 Aligned_cols=290 Identities=12% Similarity=0.071 Sum_probs=215.0 Q ss_pred HhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCC----ce Q lcl|NC_016164. 541 RMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGA----AT 616 (836) Q Consensus 541 ~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~----~~ 616 (836) .+-...-....+... ..+ .+ ....+|++++|+... .+++.+.+.+++++++..+.++......++....+ .. T Consensus 1 ~~~~~~~~~~~~~~~-~k~-~t-~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g 76 (315) T protein:vir:41 1 MLTIEDIRGGKPFEI-VPK-ID-VPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPG 76 (315) T ss_pred CcccchhhcCChhhh-hhh-cC-CcCCCCceechHHHH-HHHHHHHhhhhhhhhceeeeccccccccccccccCcccccc Confidence 000000000001111 111 22 334468888888765 58899999999999976555554444445443322 12 Q ss_pred eeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcCCc-----c Q lcl|NC_016164. 617 AYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLGSN-----S 689 (836) Q Consensus 617 a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~-----~ 689 (836) ..|.+|.++.++++++|+++.+.++++...+.||+++|.|+. ++++++|..++++++++.++.++++|+|+. + T Consensus 77 ~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~ 156 (315) T protein:vir:41 77 RDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLR 156 (315) T ss_pred cccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCcccc Confidence 357778888899999999999999999999999999999985 489999999999999999999999999875 3 Q ss_pred ccccccccccccc----ccccccchhHHHHHHHHHHHhhhcccc-CccEEEecHHHHHHHHHHhhccCcccc----ccCC Q lcl|NC_016164. 690 QPEGLKFVTGINT----ENFGATNPTYVELVSMESKVAADNADI-GAMSYLTNSTLYGGFKTTEKATSTAQF----VLEP 760 (836) Q Consensus 690 ~p~Gi~~~~~~~~----~t~aa~~~t~~~l~~a~~~l~~~~~~~-~~~~~vmnp~~~~~L~~lkd~~g~~~~----~~~~ 760 (836) +|+|++..+.... .+..+...+.+.|.+++.+|...|+.. .+++|+||+.++..++.+++.+|++++ ..+. T Consensus 157 ~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~ 236 (315) T protein:vir:41 157 MSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTGAN 236 (315) T ss_pred ccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccchhhcCC Confidence 5589987554332 222334456778999999999988654 468999999999999999999998765 2345 Q ss_pred CCeecceeeEeeCccc-----cceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 761 GGTVNGYNVVRSNQVA-----NGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 761 ~~~l~G~pVv~s~~~~-----~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) +.+|+|+||+..+.|| ++.++||||++|.++++.++.+..+++ ..++.+.|.+..|+|+.+.++++.++...- T Consensus 237 ~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~--a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~ 314 (315) T protein:vir:41 237 SILYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYD--AEMRLTKYVASLRTDNHYEDEEGAVSATIT 314 (315) T ss_pred CceecccceEecccccccCCCCccEEEecccceEEEeccccEEEeeec--CCCCceEEEEEEEeceeEEeccceeEeeee Confidence 6799999999999885 567999999999999998888776654 566889999999999988887764443333 Q ss_pred C Q lcl|NC_016164. 836 L 836 (836) Q Consensus 836 ~ 836 (836) | T Consensus 315 v 315 (315) T protein:vir:41 315 V 315 (315) T ss_pred C Confidence 3 No 103 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=1.6e-34 Score=205.88 Aligned_cols=291 Identities=11% Similarity=0.066 Sum_probs=215.4 Q ss_pred HHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecC Q lcl|NC_016164. 534 VSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTG 613 (836) Q Consensus 534 ~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~ 613 (836) +++...... .....+....+.. ...+++++|+.+...+++.+.+.+++++++ ++++.......++.... T Consensus 1 ~~~k~~~~~---------l~~~~~~~~~~~~-~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i-~v~~v~~~~~~i~~~~~ 69 (321) T protein:vir:31 1 MASRTINND---------LSRITEKNALTVD-DLDAGGTLPDPLWDEFWTDMIEETPLLDAI-RTETVGAKKTRIPTLNI 69 (321) T ss_pred CchHHHHHH---------HHHHHHhcccccc-ccCCcceeCHHHHHHHHHHHHHhhhhhhhc-eeeeccCcceeeeeecc Confidence 111111100 0111222222223 334455666677888999999999999984 45666667777888776 Q ss_pred Cceeeeec-cC-cccccccccceeEEeeeeeeeeeehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc Q lcl|NC_016164. 614 AATAYWVA-EG-GDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLGSNS 689 (836) Q Consensus 614 ~~~a~~v~-Eg-~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~ 689 (836) +....|++ |+ +..+.++++|+++++.++++...+.||+++|.|+. ++++++|...|++++++.++..+++|+|++. T Consensus 70 ~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~ 149 (321) T protein:vir:31 70 GERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAE 149 (321) T ss_pred CCcccccccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCC Confidence 66677776 33 34556789999999999999999999999999875 5899999999999999999999999999765 Q ss_pred cc-----ccccccc--ccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHH-HhhccCcccc----c Q lcl|NC_016164. 690 QP-----EGLKFVT--GINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKT-TEKATSTAQF----V 757 (836) Q Consensus 690 ~p-----~Gi~~~~--~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~-lkd~~g~~~~----~ 757 (836) .| +|++... +......+++.++++.+.+++..|...|++.++.+|+||++++..+.. +++.+ .+.+ . T Consensus 150 ~~~~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~-~~~~~~~l~ 228 (321) T protein:vir:31 150 DSFENQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRD-TPLGDNVIM 228 (321) T ss_pred CcccccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCC-Cccccchhh Confidence 44 6887643 333444555667889999999999999887778899999999887765 55544 3432 2 Q ss_pred cCCCCeecceeeEeeCccccceEEEEehhceEEEeecceEEEEecccccc---cCcEEEEEEEEeccEEEcccceEEEee Q lcl|NC_016164. 758 LEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDK---SGSVRVTALQDVDVAVRHPEAFCRGND 834 (836) Q Consensus 758 ~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~---~~~~~~r~~~r~d~~v~~p~Af~~l~~ 834 (836) .+.+.+|+|+||+.++.||++.++|+||+++.++.+.++.+......... .+.+......++|+.|-+++|++.+++ T Consensus 229 ~~~~~tl~G~pvv~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~ 308 (321) T protein:vir:31 229 GEADVNPFSFPIIGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEG 308 (321) T ss_pred ccccccccceeEEEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEec Confidence 33456899999999999999999999999999999998888776654432 334444556678999999999999884 Q ss_pred ---cC Q lcl|NC_016164. 835 ---NL 836 (836) Q Consensus 835 ---A~ 836 (836) ++ T Consensus 309 i~~~~ 313 (321) T protein:vir:31 309 LGDPL 313 (321) T ss_pred CCcch Confidence 22 No 104 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.96 E-value=6.2e-31 Score=186.19 Aligned_cols=258 Identities=12% Similarity=0.087 Sum_probs=208.7 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhccee--eecC-CceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM--LTGL-QGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~--~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +..+.+..+..++|+.++..+++.+.....+.++.... +.+. +..+++|+....+.+.|++||+.++.++++++.++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 34344566788999999999999999988888775532 2222 34699999988889999999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) +.+++++..+.+|++++.++.+++.+.+.+.+++++++.++..++....... ....+..+++.|.+ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~--------------~~~~~~~t~d~i~d 146 (272) T protein:vir:30 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST--------------QTVEATATVDGVSK 146 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------cccccccCHHHHHH Confidence 9999999999999999999999999999999999999999999986432110 01123357889999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHhhcc--C-----ccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT--S-----TAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~--g-----~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) ++..|...+ .....|+|||.++..|+..+..+ + ......+.-++++|+||++|+.+|.+++|+.+...+.+ T Consensus 147 a~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~ 224 (272) T protein:vir:30 147 ALDIFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRI 224 (272) T ss_pred HHHHHhccC--CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEE Confidence 999887665 34578999999999987654221 1 11223344568999999999999999999999898888 Q ss_pred EeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.+.++++..+.+ ..++...+++..++++++.+|+++++++.+- T Consensus 225 ~~~~~~~ve~~r~--~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~ 268 (272) T protein:vir:30 225 MLKRNTMVETDRD--ITKAINQIVANKHYGVYLYKAEKAVKITLKD 268 (272) T ss_pred EecCCceeeeccc--cccceeEEEEEEEEEEEEEcCCceEEEEecc Confidence 8888888776654 4567899999999999999999999999888 No 105 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.96 E-value=6.2e-31 Score=186.19 Aligned_cols=258 Identities=12% Similarity=0.087 Sum_probs=208.7 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhccee--eecC-CceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM--LTGL-QGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~--~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +..+.+..+..++|+.++..+++.+.....+.++.... +.+. +..+++|+....+.+.|++||+.++.++++++.++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 34344566788999999999999999988888775532 2222 34699999988889999999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) +.+++++..+.+|++++.++.+++.+.+.+.+++++++.++..++....... ....+..+++.|.+ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~--------------~~~~~~~t~d~i~d 146 (272) T protein:vir:98 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST--------------QTVEATATVDGVSK 146 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------cccccccCHHHHHH Confidence 9999999999999999999999999999999999999999999986432110 01123357889999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHhhcc--C-----ccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT--S-----TAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~--g-----~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) ++..|...+ .....|+|||.++..|+..+..+ + ......+.-++++|+||++|+.+|.+++|+.+...+.+ T Consensus 147 a~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~ 224 (272) T protein:vir:98 147 ALDIFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRI 224 (272) T ss_pred HHHHHhccC--CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEE Confidence 999887665 34578999999999987654221 1 11223344568999999999999999999999898888 Q ss_pred EeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.+.++++..+.+ ..++...+++..++++++.+|+++++++.+- T Consensus 225 ~~~~~~~ve~~r~--~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~ 268 (272) T protein:vir:98 225 MLKRNTMVETDRD--ITKAINQIVANKHYGVYLYKAEKAVKITLKD 268 (272) T ss_pred EecCCceeeeccc--cccceeEEEEEEEEEEEEEcCCceEEEEecc Confidence 8888888776654 4567899999999999999999999999888 No 106 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=99.95 E-value=5.6e-29 Score=175.46 Aligned_cols=539 Identities=12% Similarity=0.069 Sum_probs=289.8 Q ss_pred hhhcccccceEEEEEecCcccccccCcEEEeccccccchhhhcCCCceEeecCCCCCcceEEEEEEEecCCEEEEEEEEc Q lcl|NC_016164. 233 VARAKEDPEVVEFTFSSEQPVERYFGMEVLSHDPDAMNMSRLNSGAAPWLWNHNAEVVLGVVERAWMGDDRRGRVRTRWS 312 (836) Q Consensus 233 ~~~~d~~~rt~~~~~~~~~~v~~~~~~e~l~~~~~a~~~~~~~~~~~~lL~~H~~~~~iG~v~~~~~~e~~~~~a~~~f~ 312 (836) .++-++....|.++- + +- +++ +++..|- +. |...+- +.. ..+++...| T Consensus 1 ~~a~~~~~aei~iy~--~--Ig-~w~-----vta~~~~-~~--------L~~l~~---~~~-I~i~INSpG--------- 48 (652) T protein:vir:79 1 MQAGHQSDADIYIYD--E--IG-FWG-----VTAKQFI-SD--------LNALGD---ITH-INLHINSPG--------- 48 (652) T ss_pred CCCCCCCCceEEEEe--e--cc-ccc-----CCHHHHH-HH--------HHhcCC---Cce-EEEEEeCCC--------- Confidence 333333333333221 0 00 111 2222221 00 111110 000 112222111 Q ss_pred CCcccccHHHHHHHHHHhc--CccceeeeeeEeeccccccCCCCeEEEE---EEEEEEEEEEeccCccchhhh------- Q lcl|NC_016164. 313 PNTKIEGSEEYKRRQDWES--GTIRNVSFMYSIDAPLDLTSREGMALVT---AFTPMEVSAVSIPADHTVGQG------- 380 (836) Q Consensus 313 ~~~~~~~~~~~~~~~~v~~--G~l~~~SiG~~v~~~~~~~~~~~~~~~~---~~~l~EiS~V~~pA~~~a~v~------- 380 (836) ...-.+..||..+|. +.+..+-.|+-.---+..-..+..+.+- .+-++..|- ++.-++.-. T Consensus 49 ----G~V~~G~aIyn~lk~~~~~v~~~i~G~AAS~ASvIa~ag~~~~m~~~a~lMIH~p~~---~~~G~a~dl~~~a~~L 121 (652) T protein:vir:79 49 ----GDVFEGIAIFNALKTHGASITVYVDGVAASMASVIAMVGNPVIMPENTFMMIHKPFG---FTGGDAEDMRTYADLL 121 (652) T ss_pred ----CChhHHHHHHHHHhhcCCCeEEEEeehhhhHHHHHHhcCCeEEcCCCceEEEEcccc---ccccCHHHHHHHHHHH Confidence 122234456666654 3333333332110000000011100000 011111111 111111100 Q ss_pred --------------------------------------------------------hhhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 381 --------------------------------------------------------RKATSSSGPPGAAAATVAPLSHND 404 (836) Q Consensus 381 --------------------------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~ 404 (836) .+... ..........+...... T Consensus 122 ~~~~~~i~~~Ya~ktG~~~e~i~~~m~~etwlta~EA~e~Gf~D~i~~~~~~~a~~~~~~~--~~~~~~p~~~~~~~~~~ 199 (652) T protein:vir:79 122 DKVEAVLLPAYAQKTGKTTDEIAAMLADETWMSGAECLAQGFADQVTPAVKAMACIQSKRT--EEFKKMPDSIRNMITPP 199 (652) T ss_pred HHHHHHHHHHHHHhhCCCHHHHHHHHhhhcCCCHHHHHhcCCcccccchhhhhhhhhhhhh--hhhhhhHHHHHHHhccc Confidence 00000 00000000000000000 Q ss_pred hh------hhhh-----------hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh--hhhh-hhhhhhhhhhHHHHHHHHH Q lcl|NC_016164. 405 NN------HMDS-----------STIDMEAVRAQAAADERSRVASITSLCREHK--ADDL-AQGLIESGASEADAMRSVL 464 (836) Q Consensus 405 ~~------~~~~-----------~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~--l~e~-a~eliee~~t~~e~~~~~l 464 (836) .. ..++ ...+..........+.+.+...|+++...++ ..++ ++.+++.+.+.++++..++ T Consensus 200 ~~~~~~v~d~EPa~~~~pvqAaAP~~De~airAq~~aeeraRi~~I~~l~a~Fggr~~~l~~~~l~d~~~s~e~ar~~il 279 (652) T protein:vir:79 200 RNSAPRVQDDEPAASRTPVQAAAPVVDENSIRAQVLAEQKARVNGINDLFAMFGGRYQTLQAQCLADPECSLEQAREKLL 279 (652) T ss_pred ccccccccccccccccccccccCCcCchhHHHHHHHHHHHHHHHHHHHHHHhhccccchHHHHHhhccCCCHHHHHHHHH Confidence 00 0000 0001112223344556677778888877764 2333 4557788889999999999 Q ss_pred HHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhh Q lcl|NC_016164. 465 SEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGV 544 (836) Q Consensus 465 ~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~ 544 (836) +++.....-..... + ........... +......+ ..|......+..+.+.++.+.+++++++.+.|. T Consensus 280 ~~l~~~~~p~~~~~--------~--~~~~~~~g~~~-~d~~~~aL--~~R~g~~~~~~~~~~~g~~L~elAr~~L~~~G~ 346 (652) T protein:vir:79 280 NEMGRESTPSNKNT--------P--AHIYAGNGNFV-GDGIRQAL--MARAGFEKTERDNVYNGMTLREYARMSLTERGI 346 (652) T ss_pred HHHHhhcCCCCCCc--------c--eeEeeccchhh-HHHHHHHH--HhhcCCcccccCccccCccHHHHHHHHHHhhcc Confidence 99854322111000 0 00000000111 11111111 122233344555778899999999999999998 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhh-----hhhhhcceeeecCCceEEEEEecCCceeee Q lcl|NC_016164. 545 TPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRL-----ALNTLGVTMLTGLQGPVAIPRQTGAATAYW 619 (836) Q Consensus 545 ~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~-----~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~ 619 (836) ...+.. +.....++. ++++++ +|..+.+.+.+.|++.+ .+.+++.+....+++..+..+.++.+.+.. T Consensus 347 ~~~~~~-~~~~v~~A~-~hsTsD-----Fp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~ 419 (652) T protein:vir:79 347 GVSSYN-PMQMVGAAF-THSTSD-----FGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQ 419 (652) T ss_pred CCCCCC-HHHHHHHHh-hcCcch-----HHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccc Confidence 888764 334444443 233222 35556666666655543 567777777788999999999999999999 Q ss_pred eccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc----cccccc Q lcl|NC_016164. 620 VAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNS----QPEGLK 695 (836) Q Consensus 620 v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~----~p~Gi~ 695 (836) |.|+++++.++...+..++++.|||+++.||||+|+||++++++.|...|+++.++++++.++.-.-+|+ ..+.|| T Consensus 420 V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF 499 (652) T protein:vir:79 420 VREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLF 499 (652) T ss_pred cCCCCccceeeecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceee Confidence 9999999999999999999999999999999999999999999999999999999999997765443332 345688 Q ss_pred -cccccccccccccchhHHHHHHHHHHHhhhcc-----ccCccEEEecHHHHHHHHHHhhccCccc--cccCCCCeecce Q lcl|NC_016164. 696 -FVTGINTENFGATNPTYVELVSMESKVAADNA-----DIGAMSYLTNSTLYGGFKTTEKATSTAQ--FVLEPGGTVNGY 767 (836) Q Consensus 696 -~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~-----~~~~~~~vmnp~~~~~L~~lkd~~g~~~--~~~~~~~~l~G~ 767 (836) |+.|.|..+. ++++.+.|.+++..|..+.. +..+..|+..|........+..+...+. ...+..+.+.|+ T Consensus 500 ~hA~H~Nl~~~--aa~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~a~~~~~~~Np~~~~ 577 (652) T protein:vir:79 500 DKAKHANVLES--AAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPVKDF 577 (652) T ss_pred ccccccccccc--ccCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCCcccccccccccccccc Confidence 6677676543 35788889999988888752 2345566666666555555432221110 011112334453 Q ss_pred -eeEeeCcccc---ceEEEEehhc---eEEEeecceE-EEEecccccccCcEEEEEEEEeccEEEcccceEEEee Q lcl|NC_016164. 768 -NVVRSNQVAN---GDVFFGVWNQ---MIMGMWGALD-IQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGND 834 (836) Q Consensus 768 -pVv~s~~~~~---~~i~~gD~s~---~~i~~~~~l~-i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~ 834 (836) .||+.+.+.. ..+|+++... +.+++-.|.+ ...+....|..+.+.|+++.+|+++++|-.++++.+. T Consensus 578 ~~~i~eprL~~~s~~~wylaa~~~~dtiev~yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 578 ATVIAEPRLDDNSQTTFYLAASKGSDTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred cccccccccCCCCcccEEEecCCCCCeEEEEEecCCCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 6777887754 3477776654 4444433322 2223345799999999999999999999999999999 No 107 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=99.93 E-value=4.3e-27 Score=165.16 Aligned_cols=567 Identities=12% Similarity=0.026 Sum_probs=284.1 Q ss_pred cCcccccchhHHHhhhccccch--------hhhhhhhhhcccccceEEEEEecCcccccccCcEEE--eccccccchhhh Q lcl|NC_016164. 205 MDQNDGRSLMDLRELNSEPLYR--------SAVVADVARAKEDPEVVEFTFSSEQPVERYFGMEVL--SHDPDAMNMSRL 274 (836) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~d~~~rt~~~~~~~~~~v~~~~~~e~l--~~~~~a~~~~~~ 274 (836) |-. ..+-.-+...+||.. ...-+.++...++. .++++ |.+|= .+++..|.- T Consensus 1 ~~~----~~~~~~~~~~~p~~~~~~~~~~~~~~w~~i~~~~~~~--~ei~i----------y~~Ig~wgita~~f~~--- 61 (693) T protein:vir:95 1 MGS----HQTLIHKNLMLPMAAALTEANAPHESWYSIKAAGRGV--AEVLL----------YDEIGVWGITALQFAR--- 61 (693) T ss_pred CCC----CcceeehhhhhcccccccCCCCCCCcceeeeecCCCe--eEEEE----------eecccccccCHHHHHH--- Confidence 110 000000111111100 00011122211111 11111 11110 011111100 Q ss_pred cCCCceEeecCCCCCcceEEEEEEEecCCEEEEEEEEcCCcccccHHHHHHHHHHhc--CccceeeeeeEeeccccccCC Q lcl|NC_016164. 275 NSGAAPWLWNHNAEVVLGVVERAWMGDDRRGRVRTRWSPNTKIEGSEEYKRRQDWES--GTIRNVSFMYSIDAPLDLTSR 352 (836) Q Consensus 275 ~~~~~~lL~~H~~~~~iG~v~~~~~~e~~~~~a~~~f~~~~~~~~~~~~~~~~~v~~--G~l~~~SiG~~v~~~~~~~~~ 352 (836) -|-.. |- .. ...+++. ++..+.-.+..||..+|. +.+.-.-.|+-.---+..-.. T Consensus 62 ------~L~~~------~d--------~~--~I~v~IN-SpGGdV~~G~aIyn~Lk~~~~~Vtv~vdGlAASaASvIama 118 (693) T protein:vir:95 62 ------DLKAM------GD--------LT--KINLHIH-SPGGDVFEGTAIYNLLRNHPASVDVYIDGLAASMASVIAMA 118 (693) T ss_pred ------HHHhc------CC--------Cc--eeEEEEE-CCCCchhhHHHHHHHHhhcCCCeEEEEeehhhhHHHHHHhc Confidence 01111 00 00 1122221 122233446667777776 444443333311100111111 Q ss_pred CCeEEEE---EEEEEEEEEEeccCccchhhhh-hhhh---------------hhhh------------------------ Q lcl|NC_016164. 353 EGMALVT---AFTPMEVSAVSIPADHTVGQGR-KATS---------------SSGP------------------------ 389 (836) Q Consensus 353 ~~~~~~~---~~~l~EiS~V~~pA~~~a~v~~-~~~~---------------~~~~------------------------ 389 (836) +..+.+- .+-++..+-+..+ ++.-.+ .... .... T Consensus 119 gd~i~m~~~a~~MIH~p~~~~~G---na~dl~~~a~~L~~~~~~i~~~Y~~ktG~~~e~i~~~m~~etwlta~EAve~Gf 195 (693) T protein:vir:95 119 GDTIYMPENAMMMVHKPWGIQGG---DADDMRRYAELLDKVEDTLVMAYANKTGKSADDIKALLKEETWMNGREAVAAGF 195 (693) T ss_pred CCeEEecCCCeEEEEcccccccc---CHHHHHHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHhhhcCCCHHHHHhccc Confidence 1111111 0112222222211 111000 0000 0000 Q ss_pred ----hhhhhh----hhhhhhhhhhhhh--------------hhh-------h----------hhh-hhhhhhhhhhhhhh Q lcl|NC_016164. 390 ----PGAAAA----TVAPLSHNDNNHM--------------DSS-------T----------IDM-EAVRAQAAADERSR 429 (836) Q Consensus 390 ----~~~~~~----~~~~~~~~~~~~~--------------~~~-------~----------~~~-e~~~~~~~~~~~~~ 429 (836) ...... ...........+. ... . .+. ...+......+..+ T Consensus 196 ~Dei~e~~~~~a~~~~~~~~~~~~~p~~l~~~~~~~~~~p~~~~~~PaPTPaaaaPaaP~aaap~~adirA~~~aae~~r 275 (693) T protein:vir:95 196 ADQLTEPLQAAAHLSSKRMQEFAHMPEALKTLLAPRAQTPAAPANTPAPTPASAAPAAPVAAAPTEADIRARILAEESGR 275 (693) T ss_pred hhhhhhhhHHHHhhHHHHHHHhhchHHHHHHHHhhhcccccccccCcccCccCCCCCCCccCCCCcchhhHHHHHHHHHH Confidence 000000 0000000000000 000 0 000 00011112223334 Q ss_pred hhhhhhhhhhhhh---hhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHH Q lcl|NC_016164. 430 VASITSLCREHKA---DDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARS 506 (836) Q Consensus 430 ~~ei~al~~~~~l---~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (836) ...|.++...... +..++-+.+.+.+.++++..+|+.+.....-....... ............ .... T Consensus 276 ~aaI~a~fa~f~~~~a~l~a~~l~d~~~s~d~ar~~lL~~l~~~~~p~~~~~~~---------~~~~~~~g~~~~-d~~~ 345 (693) T protein:vir:95 276 RSAITAAFGAFSTGHAELLATCLNDMNITVDQAREKLLAAIGADTQPAAALSAG---------AHIHAGNGNLVG-DSVR 345 (693) T ss_pred HHHHHHHHHhccCChHHHHHHHHhhcCCCHHHHHHHHHHHHhhccCCCCCcCcC---------ccccCCchhHHH-HHHH Confidence 4445554433221 22344445667888888888888876432211100000 000000000000 1111 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHH Q lcl|NC_016164. 507 FSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLR 586 (836) Q Consensus 507 ~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~ 586 (836) .++...... ......+.+.++.+.+++++++.+.|....++... ....++. ++++++ +|..+.+.+.+.++ T Consensus 346 ~al~~R~g~--~~~~~~n~~~g~~L~elAr~~L~~rg~~~~~~~~~-~~~~~a~-~htTSD-----Fp~IL~~~~nk~l~ 416 (693) T protein:vir:95 346 ASVLARIGR--GERQADNAYNGMTLRELARASLVDRGIGVASLNAP-QMVGLAF-THTSSD-----FGLILLDVANKSVL 416 (693) T ss_pred HHHHHhcCc--ccccCCccccCCcHHHHHHHHHHhcCCccCCCCHH-HHHHHHH-hcCcch-----hHHHHHHHHHHHHH Confidence 112211111 11222344778999999999999999888776543 3333333 233222 35556665655555 Q ss_pred hh-----hhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHH Q lcl|NC_016164. 587 NR-----LALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDV 661 (836) Q Consensus 587 ~~-----~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l 661 (836) .. ..+.+++.+....+++..+....++.+.+..|.|+++++.++......++.+.|||+++.||||+|+||++++ T Consensus 417 ~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga 496 (693) T protein:vir:95 417 AGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLRQVREGAEYKYVTLGERGEQIILATYGELFSITRQAIINDDLQM 496 (693) T ss_pred HHHHhhhhHHHHHhccCCCCcccccceeecCCCCChhhcCCCCceeeeecCCccceeehhhcCCeeeecHHhhhccchHH Confidence 53 3566777777778999999989999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCc---ccccccccccccccccccccchhHHHHHHHHHHHhhhcc----------c Q lcl|NC_016164. 662 EQMVRTELATVIALEIDRAALYGLGSN---SQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNA----------D 728 (836) Q Consensus 662 ~~~i~~~l~~a~a~~~d~~il~G~Gt~---~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~----------~ 728 (836) ++.|...|+++.++++++.++.-...| ...+.|||++|.|..+.+++.++.+.+.+++.+|..+.. + T Consensus 497 ~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~Nl~tga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~ 576 (693) T protein:vir:95 497 LSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHSNLLTGAASALSIDSLSKAKTQMATQKAQVEKGKGRTLN 576 (693) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeeccccccccccccccChHHHHHHHHHHHHhhcchhccCCceee Confidence 999999999999999998776443322 345779999999988877788899999999998877652 2 Q ss_pred cCccEEEecHHHHHHHHHHhhccCccc--cccCCCCeecce-eeEeeCccc---cceEEE-Eehhc--eEEEeecceE-E Q lcl|NC_016164. 729 IGAMSYLTNSTLYGGFKTTEKATSTAQ--FVLEPGGTVNGY-NVVRSNQVA---NGDVFF-GVWNQ--MIMGMWGALD-I 798 (836) Q Consensus 729 ~~~~~~vmnp~~~~~L~~lkd~~g~~~--~~~~~~~~l~G~-pVv~s~~~~---~~~i~~-gD~s~--~~i~~~~~l~-i 798 (836) ..+..|+..+........+..+...+. ...+..+.+.|+ .||+.+.+. ...||+ +|... +.+++-.|.+ . T Consensus 577 i~P~~llvP~~le~~a~~l~~s~~~~~a~~~~~~~NP~~~~~~vi~~prL~~~s~~~Wyl~a~~~~dtie~~yL~G~~~P 656 (693) T protein:vir:95 577 IRPGFVLTPVALEDKANQIINSESVPGADVNSGIVNPIRAFAQVIGEPRLDDASATAWYMAAKKGSDTIEVAYLDGVDTP 656 (693) T ss_pred cccceEEecchHHHHHHHHhccccccccccccccccchhccccccccceecCCCCCceEEecCCCCCeEEEEEecCCCCC Confidence 345566666666655555543221110 011112235554 577777774 345665 44432 4444444432 2 Q ss_pred EEecccccccCcEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 799 QVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 799 ~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) .......|..+.+.|+++.+|+++++|-.++++-.-| T Consensus 657 ~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 657 YLEQQEGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred eEeecCCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 2333457999999999999999999999999998888 No 108 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.89 E-value=1e-24 Score=152.17 Aligned_cols=259 Identities=13% Similarity=0.085 Sum_probs=205.0 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceee--ecC-CceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTML--TGL-QGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~--~~~-~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +....+..+..++|+.+...+.+.+.....+.+++.... .+. +..+++|+....+.+.++.||..++.++++.+..+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~ 80 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccceeE Confidence 344455667889999999999999988888888765432 222 33789999887778889999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) +.+++++..+.++++....+..++...+.+.++.++++.++..++....++. .+..+..++++.+.+ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~-------------~~~~~~~~~~d~i~d 147 (274) T protein:vir:93 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------LTVNADITKLNGLQS 147 (274) T ss_pred EEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------ccccccccCHHHHHH Confidence 9999999999999999888888899999999999999999999887553321 112234467899999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHh--h---cc--CccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTE--K---AT--STAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lk--d---~~--g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) ++.+|..++. ...+++|||..+..|++.. . .. |.+....+.-++++|++|++++.+|.++.|+.+...+.+ T Consensus 148 A~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gai~~ 225 (274) T protein:vir:93 148 AIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhhhccC--CccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCCCCcceEEEEeCCeEEE Confidence 9998876543 4568999999999887532 1 11 122233445678999999999999999999999888888 Q ss_pred EeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +...++.+..+.. ..+....+++..++++++.+|+++++++.|- T Consensus 226 ~~~~~~~vE~~Rd--~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~ 269 (274) T protein:vir:93 226 ILKRDFFLEVARD--ASTKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred EecCCcccccccc--hhhcccEEEEEEEEEEEEEcCCceEEEeeCc Confidence 7777777655543 3446679999999999999999999999888 No 109 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.86 E-value=5.3e-23 Score=142.71 Aligned_cols=259 Identities=15% Similarity=0.125 Sum_probs=200.6 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceee--ec-CCceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTML--TG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~--~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +....+..+.+++|+.++..+.+.+.....+.+++.... .+ .+..+++|+....+.+..+.||..++..+++.+..+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~~ 80 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccceeE Confidence 333334557889999999999999888877777654321 11 134789999876677888999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) +.+++++..+.++.+....+..++...+.++++.++++.++..+++..... ..+..+..++++.|.+ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a-------------~~~~~~~~~~~d~i~d 147 (274) T protein:vir:96 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA-------------TLTVEADITKLDGLQT 147 (274) T ss_pred EEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------------CCCcCcccccHHHHHH Confidence 999999999999999888887788999999999999999999888654221 1112234467899999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHhh--c-----cCccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTEK--A-----TSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd--~-----~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) +...|..++. ....++|||..+..|++... . .|.+....+.-++++|++|++++.+|.++.|+.....+.+ T Consensus 148 A~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:96 148 AIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKL 225 (274) T ss_pred HHHHhcccCC--CceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCCCCcceEEEEeCcceee Confidence 9998876543 45789999999999876531 1 1122333445678999999999999999988887777777 Q ss_pred EeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +...++.+..+. ...+....+++..++++++++|+++++++.|- T Consensus 226 ~~~~~~~vE~~R--d~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:96 226 ITKRDFFLEKDR--DASRKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) T ss_pred eecCCccccccc--chhhcccEEEEeeEEEEEEEcCccEEEEEcCc Confidence 777776665443 34456778999999999999999999999998 No 110 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.86 E-value=4.7e-23 Score=143.01 Aligned_cols=258 Identities=15% Similarity=0.090 Sum_probs=195.4 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceee--ec-CCceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTML--TG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~--~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +..+.+....+++|+.+...+.+.+.....+.+++.... .+ .+..+++|.......+.++.||.+++..+++.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~ 80 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKS 80 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCccee Confidence 333345567789999999999999888888888764322 22 244789999887777889999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) +.+++++..+.++++....+..++...+.++++.++++.++..++....+.. ...+..++++.|.+ T Consensus 81 ~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~--------------~~~~~~~~~d~i~~ 146 (272) T protein:vir:36 81 VTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTS--------------QTVSTKANVDGVQA 146 (272) T ss_pred EeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------ccccccccHHHHHH Confidence 9999999999999998888888899999999999999999998886542110 01123457889999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHhhcc------CccccccCCCCeecceeeEeeCccccceEEEEe----hhc Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT------STAQFVLEPGGTVNGYNVVRSNQVANGDVFFGV----WNQ 787 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~------g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD----~s~ 787 (836) ++..|...+. ...+++|||..+..|++..... |...+.++.-++++|++|++|+.+|.++.++.. ... T Consensus 147 A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA 224 (272) T protein:vir:36 147 ALDIFNDEDA--QAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPA 224 (272) T ss_pred HHHHhhhcCC--CceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccc Confidence 9998877653 3568999999999987654322 222233444578999999999999988754322 233 Q ss_pred eEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 788 MIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 788 ~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.++...++.++.+++ .......+++..++++++++|+++++++.+= T Consensus 225 ~~~~~~~~~~vE~~R~--~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g 271 (272) T protein:vir:36 225 LKLVLKRGVQVETDRD--IVTKTTVITADEHYAAYLYDLTKVVNITFTG 271 (272) T ss_pred eeeeecCCcccccccc--hhhcCcEEEEEEEEEEEEEcCccEEEEeecC Confidence 4445556666554443 3445678999999999999999999999776 No 111 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.85 E-value=2.6e-22 Score=138.96 Aligned_cols=259 Identities=13% Similarity=0.084 Sum_probs=202.5 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceee--ec-CCceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTML--TG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~--~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +....+....+++|+.+...+.+.+.....+.+++.... .+ .+..+++|.......+..+.||..++..+++.+..+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 333445667889999999999999988887777754421 22 245799998876677888999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) +.+++.+..+.++.+....+..+....+.++++.++++.++..++.-..++. .+..+..++++.+.+ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~-------------~~~~~~~~~~d~i~d 147 (274) T protein:vir:97 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------LTVNADITKLNGLQS 147 (274) T ss_pred EEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC-------------ccccccccCHHHHHH Confidence 9999999999999998888777889999999999999999998886543221 111234467899999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHh--h----c-cCccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTE--K----A-TSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lk--d----~-~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) +...|..++. ....++|||..+..|++.. . + .|.+...++.-++++|++|++++.+|.++.|+.....+.+ T Consensus 148 A~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:97 148 AIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCCcceEEEEeCcceEe Confidence 9998876543 4578999999999887531 1 1 1222334455678999999999999999998888888888 Q ss_pred EeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +...++.+..++.. ......+++..++++++.+|.++++++.+- T Consensus 226 ~~~~~~~vE~~Rd~--~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:97 226 ILKRDFFLEVARDA--STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eecCCceeccccch--hhcccEEEEEEEEEEEEEcCCceEEEecCc Confidence 77777776655543 445678899999999999999999999888 No 112 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.85 E-value=2.6e-22 Score=138.96 Aligned_cols=259 Identities=13% Similarity=0.084 Sum_probs=202.5 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceee--ec-CCceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTML--TG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~--~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +....+....+++|+.+...+.+.+.....+.+++.... .+ .+..+++|.......+..+.||..++..+++.+..+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 333445667889999999999999988887777754421 22 245799998876677888999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) +.+++.+..+.++.+....+..+....+.++++.++++.++..++.-..++. .+..+..++++.+.+ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~-------------~~~~~~~~~~d~i~d 147 (274) T protein:vir:94 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------LTVNADITKLNGLQS 147 (274) T ss_pred EEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC-------------ccccccccCHHHHHH Confidence 9999999999999998888777889999999999999999998886543221 111234467899999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHh--h----c-cCccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTE--K----A-TSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lk--d----~-~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) +...|..++. ....++|||..+..|++.. . + .|.+...++.-++++|++|++++.+|.++.|+.....+.+ T Consensus 148 A~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:94 148 AIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCCcceEEEEeCcceEe Confidence 9998876543 4578999999999887531 1 1 1222334455678999999999999999998888888888 Q ss_pred EeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +...++.+..++.. ......+++..++++++.+|.++++++.+- T Consensus 226 ~~~~~~~vE~~Rd~--~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:94 226 ILKRDFFLEVARDA--STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eecCCceeccccch--hhcccEEEEEEEEEEEEEcCCceEEEecCc Confidence 77777776655543 445678899999999999999999999888 No 113 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.85 E-value=1.5e-22 Score=140.16 Aligned_cols=259 Identities=12% Similarity=0.110 Sum_probs=203.3 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhccee--eec-CCceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM--LTG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~--~~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +....+.-..+++|+.+...+.+.+.....+.+++... +.+ .+..+++|.......+..+.||.+++..+++.++.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~ 80 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRRE 80 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccceee Confidence 33334456778999999999999999988888886532 222 345799999877778889999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) ..+++++..+.+|++....+..+....+.++++.++++.++..++.-..+. ..+..+..++++.+.+ T Consensus 81 a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~-------------~~~~~~~~~t~d~i~~ 147 (276) T protein:vir:10 81 AKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGT-------------KLTVSADIGTLAGLEA 147 (276) T ss_pred EEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------cccccccccCHHHHHH Confidence 999999999999999988887888999999999999999999887532211 1122344568899999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHhhcc-------CccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT-------STAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~-------g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) +...|..++. ...+++|||..+..|+++.... |.....++.-++++|++|++++.+|.++.|+.....+.+ T Consensus 148 A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi~~ 225 (276) T protein:vir:10 148 AIDTFDDEDL--EPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKKLDEGEAILAKRGAVKL 225 (276) T ss_pred HHHHhccccC--cccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCCCCcceEEEEeccceee Confidence 9998876542 4568999999999997653211 122233444578999999999999999988877777777 Q ss_pred EeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +...++.++.++.. .+....+++...+++++.+|..+++++.|= T Consensus 226 ~~~~~~~vE~dRd~--~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (276) T protein:vir:10 226 ITKRDFFLETDRDP--STKTTALYSDKHYVAYLYDESKAVKVTKGA 269 (276) T ss_pred eecCCceeecccch--hhcccEEEEeeEEEEEEEcCcceEEEecCC Confidence 77777777666543 445678899999999999999999999877 No 114 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.84 E-value=2.9e-22 Score=138.67 Aligned_cols=266 Identities=14% Similarity=0.098 Sum_probs=198.4 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceee--ecC-CceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTML--TGL-QGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~--~~~-~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +....+..+..++|+.+...+.+.+.....+.+++.... .+. +..+++|+......+.++.||+.++..+++.+..+ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESVK 80 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcccccccceee Confidence 222234457789999999999999988888877754322 222 34788999876677889999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) +.+++++..+.++++....+..++...+.++++.++++.++..+++..-+... ...+. .+.......++.+.+ T Consensus 81 ~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~-----~~~~~--~t~~~~~~~~~~~~d 153 (278) T protein:vir:80 81 HGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTL-----EVKGA--INIGLIDKIENTFTD 153 (278) T ss_pred EeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----ccccc--cccchhhhHHHHHHH Confidence 99999999999999988888888999999999999999999988865421110 01111 111122234667778 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHhhcc-------CccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT-------STAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~-------g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) +..++..++... ..+++|||..+..|++....+ |.+...++.-++++|++|++|+.+|.++.|+.....+.+ T Consensus 154 a~~~l~~~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi~~ 232 (278) T protein:vir:80 154 APDAIEDESITT-TGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLADGNALAVKAGALKT 232 (278) T ss_pred HHHhhcccCCCc-ccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCCcceEEEEeccceee Confidence 877777666543 346889999999887653211 233334455678999999999999999988877777767 Q ss_pred EeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +...++.+..++ ...+....+++..++++++++|+++++++.+= T Consensus 233 ~~~~~~~vE~~R--d~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a 276 (278) T protein:vir:80 233 FLKRNLLAESGR--DMDHKLTKFNADQHYAVALVDETKAVKVVPVA 276 (278) T ss_pred eecCCccccccc--chhhccceeeeeeEEEEEEEcCcceEEEeecc Confidence 776776665544 34456678999999999999999999999888 No 115 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.83 E-value=5.7e-22 Score=137.03 Aligned_cols=260 Identities=13% Similarity=0.124 Sum_probs=198.4 Q ss_pred hhhcccccccccccchhhHHHHHHHHHhhhhhhhhccee--eec-CCceEEEEEecCCceeeeeccCcccccccccceeE Q lcl|NC_016164. 560 LVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM--LTG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQV 636 (836) Q Consensus 560 ~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~--~~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~i 636 (836) +...+.+....+++|+.+...+.+.+.....+.+++... +.+ .+..+++|.....+.+..+.||..++..+++.+.. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 80 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKKR 80 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhccccee Confidence 111122445668899999999999999888888875432 222 23478999988777888999999999999999999 Q ss_pred EeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHH Q lcl|NC_016164. 637 ALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELV 716 (836) Q Consensus 637 t~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~ 716 (836) +..+++++..+.++++....+..+....+.++++.++++.++..++.-.++.. .+..+..++++.|. T Consensus 81 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~-------------~~~~~~~~~~d~i~ 147 (275) T protein:vir:96 81 QATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGAT-------------LKVEADITKLAGLQ 147 (275) T ss_pred eEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------ccccccccCHHHHH Confidence 99999999999999998877767788999999999999999998886543211 11233456799999 Q ss_pred HHHHHHhhhccccCccEEEecHHHHHHHHHHhhc-------cCccccccCCCCeecceeeEeeCccccceEEEEehhceE Q lcl|NC_016164. 717 SMESKVAADNADIGAMSYLTNSTLYGGFKTTEKA-------TSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMI 789 (836) Q Consensus 717 ~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~-------~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~ 789 (836) +++..|..+. .....++|||..+..|++.... .|.....++.-++++|++|++|+.+|.++.|+.....+. T Consensus 148 dA~~~lgd~~--~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~i~~~gA~~ 225 (275) T protein:vir:96 148 TAIDKFNDED--LEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKIKEGEAILAKRGAVK 225 (275) T ss_pred HHHHHhcccc--CCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCCCcceEEEEecccee Confidence 9999886543 2456899999999998765311 122233445557899999999999999988777666676 Q ss_pred EEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 790 MGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 790 i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++...++.++.++. ..+....+++..++++++++|+++++++..= T Consensus 226 ~~~~~~~~vE~~Rd--~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 270 (275) T protein:vir:96 226 LITKRDFFLETERH--ASHKSTALFSDKHYVAYLYDESKVVKITKSA 270 (275) T ss_pred eeecCCcccccccc--hhhcCcEEEEeEEEEEEEEcCccEEEEEecc Confidence 77767766655543 3456678899999999999999999988544 No 116 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.82 E-value=2.6e-21 Score=133.47 Aligned_cols=259 Identities=13% Similarity=0.090 Sum_probs=198.7 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceee--ec-CCceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTML--TG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~--~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +....+.-..+++|+.+...+.+.+.....+.+++.+-. .+ .+..+++|.....+.+..+.||..++..+++.+..+ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhcccceee Confidence 333344567789999999999998888877777754321 22 245888998876677888999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) ..+++.+..+.++++....+..+....+.++++.++++.++..++.-..++ ..+..+..++++.+.+ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a-------------~~~~~~~a~~~d~i~d 147 (274) T protein:vir:12 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA-------------KLTVNADITKLNGLQS 147 (274) T ss_pred EEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------cccccccccCHHHHHH Confidence 999999999999998877776778899999999999999999888654321 1112334568999999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHh--h----cc-CccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTE--K----AT-STAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lk--d----~~-g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) +...|..++ ....+++|||..+..|++.. + .+ |.....++.-++++|++|++++.+|.++.|+.....+.+ T Consensus 148 A~~~lgd~~--~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:12 148 AIDKFNDED--LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRSNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhcccc--ccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEeCCCCcceEEEEeccceee Confidence 999886654 24568999999999887632 1 11 222334455578999999999999998877666666666 Q ss_pred EeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +...++.++.++.. ......+++..++++++++|+.+++++.+- T Consensus 226 ~~~~~~~vE~~Rd~--~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:12 226 ILKRDFFLEVARDA--STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eecCCceeccccch--hhcccEEEeeeEEEEEEEcCCceEEEEcCC Confidence 66677776655543 345678999999999999999999999888 No 117 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.81 E-value=5.6e-21 Score=131.59 Aligned_cols=259 Identities=12% Similarity=0.082 Sum_probs=198.3 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhccee--eec-CCceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM--LTG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~--~~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +....+.-..+++|+.+...+.+.+.....+.+++..- +.+ .+..+++|.....+.+..+.||..++..+++.+..+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 33334456778899999999999988887777774322 222 245889998877677888999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) +.+++++..+.++++....+..++...+.++++.++++.++..++.-..+.. .+..+.+++++.+.+ T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-------------~~~~~~~~~~d~i~~ 147 (274) T protein:vir:95 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-------------LTVEADITKLTGLQT 147 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------ccccccccCHHHHHH Confidence 9999999999999998887777889999999999999999998886443211 112234567999999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHh--h----cc-CccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTE--K----AT-STAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lk--d----~~-g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) +...|...+ ....+++|||..+..|++.. + .+ |.....++.-++++|++|++|+.+|.++.|+.....+.+ T Consensus 148 A~~~lgd~~--~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~ 225 (274) T protein:vir:95 148 AIDKFNDED--LEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhcccc--ccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCceEEEEeccceee Confidence 999886554 24568999999999987642 1 11 223334555678999999999999998877666666666 Q ss_pred EeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +...++.++.++ +..+....+++..++++++++|+++++++.+= T Consensus 226 ~~~~~~~vE~~R--d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:95 226 ITKRDFFLETDR--DPSTKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eecCCccccccc--ccccccCEEEEeEEEEEEEEcCCcEEEEEcCC Confidence 666776665544 44557788999999999999999999999776 No 118 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.81 E-value=5.6e-21 Score=131.59 Aligned_cols=259 Identities=12% Similarity=0.082 Sum_probs=198.3 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhccee--eec-CCceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM--LTG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~--~~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +....+.-..+++|+.+...+.+.+.....+.+++..- +.+ .+..+++|.....+.+..+.||..++..+++.+..+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 33334456778899999999999988887777774322 222 245889998877677888999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) +.+++++..+.++++....+..++...+.++++.++++.++..++.-..+.. .+..+.+++++.+.+ T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-------------~~~~~~~~~~d~i~~ 147 (274) T protein:vir:96 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-------------LTVEADITKLTGLQT 147 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------ccccccccCHHHHHH Confidence 9999999999999998887777889999999999999999998886443211 112234567999999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHh--h----cc-CccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTE--K----AT-STAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lk--d----~~-g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) +...|...+ ....+++|||..+..|++.. + .+ |.....++.-++++|++|++|+.+|.++.|+.....+.+ T Consensus 148 A~~~lgd~~--~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~ 225 (274) T protein:vir:96 148 AIDKFNDED--LEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhcccc--ccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCceEEEEeccceee Confidence 999886554 24568999999999987642 1 11 223334555678999999999999998877666666666 Q ss_pred EeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +...++.++.++ +..+....+++..++++++++|+++++++.+= T Consensus 226 ~~~~~~~vE~~R--d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:96 226 ITKRDFFLETDR--DPSTKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eecCCccccccc--ccccccCEEEEeEEEEEEEEcCCcEEEEEcCC Confidence 666776665544 44557788999999999999999999999776 No 119 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.80 E-value=9.1e-21 Score=130.44 Aligned_cols=342 Identities=15% Similarity=0.185 Sum_probs=203.5 Q ss_pred hhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhh Q lcl|NC_016164. 442 ADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGD 521 (836) Q Consensus 442 l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~ 521 (836) ++....++.+.+-++.+. ..++.+..+....... .+......... T Consensus 1 ~~~~~~~~~~~~~~~~~~--~e~k~lr~~me~~et~------------~e~~~~~~~~~--------------------- 45 (393) T protein:vir:79 1 MENWLKQLKESGFTETQV--QEQKSLRTRMERGETL------------AEADANKLALN--------------------- 45 (393) T ss_pred CchHHHHHHhccCchhHH--HHHHHHHHHhhhhhhh------------hhhhhhhhhcc--------------------- Confidence 222222222222221111 1111111111100000 00000000000 Q ss_pred hhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeec Q lcl|NC_016164. 522 RAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTG 601 (836) Q Consensus 522 ~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~ 601 (836) ....+....+++. +.|..-......+.. -++..+..++|..+++.+.+...+-+...++...+... T Consensus 46 -------~~e~el~E~f~Km----m~G~~p~~eV~~~e~---mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~ 111 (393) T protein:vir:79 46 -------EEETQILESFAKM----MEGETPTNEVNLREF---MATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLK 111 (393) T ss_pred -------hhHHHHHHHHHHH----hcCCCchhheehhhh---hcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhh Confidence 0011111222221 122211111111111 23356788999999999998777777766665554443 Q ss_pred CCceEEEEEecCCceeeeeccCccccccc---ccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVAEGGDPTESQ---PSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEID 678 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~---~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d 678 (836) .+....++.. +.-.++-++||+++++.+ .+++.++++.+++|..+.+|+||+.||..++.++....+++++++..+ T Consensus 112 ~Grsm~F~~~-g~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKe 190 (393) T protein:vir:79 112 SGQSMIFPSI-GIMRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKE 190 (393) T ss_pred cCcceeccch-heeeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhH Confidence 4444555444 355678899999999865 468999999999999999999999999999999999999999999999 Q ss_pred HHHHhhcCCccc--ccccccc-----cccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc Q lcl|NC_016164. 679 RAALYGLGSNSQ--PEGLKFV-----TGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT 751 (836) Q Consensus 679 ~~il~G~Gt~~~--p~Gi~~~-----~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~ 751 (836) ..+|++.-++++ ..++.+. .|-...+-..++++.+||.++..++.... ..+.+++|||-.|+.+.+-.--. T Consensus 191 e~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~h--yt~svi~MHPLAWnv~AKna~me 268 (393) T protein:vir:79 191 QKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANE--YTPSDLMMHPLAWTVFAKNELMG 268 (393) T ss_pred HHHHhhhhcccceeeeccccCccceeecCCccccccccccHHHHHHHHHHHhccc--CCcceEEEcCchhhhhhhhhhhc Confidence 999999877766 4554332 22222234457789999999999887664 46789999999998876543222 Q ss_pred Ccc-----cccc--CC------CCee-----cceeeEeeCccccc------eEEEEehhceEEEeecceEEEEecccccc Q lcl|NC_016164. 752 STA-----QFVL--EP------GGTV-----NGYNVVRSNQVANG------DVFFGVWNQMIMGMWGALDIQVNPYALDK 807 (836) Q Consensus 752 g~~-----~~~~--~~------~~~l-----~G~pVv~s~~~~~~------~i~~gD~s~~~i~~~~~l~i~~~~~~~~~ 807 (836) +.+ .|.. -+ |..| +.+.|++|+.+|-. +++..|-+..-+....+ +|.++...+-. T Consensus 269 ~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D-~i~tdq~ddk~ 347 (393) T protein:vir:79 269 SLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRD-DLKTDQWDEKA 347 (393) T ss_pred ceeeccccccCccccchhhhhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEec-Ccceecccccc Confidence 211 1110 01 1112 23689999999843 24555555443322222 45566666667 Q ss_pred cCcEEEEEEEEeccEEEcc-cceEEEeecC Q lcl|NC_016164. 808 SGSVRVTALQDVDVAVRHP-EAFCRGNDNL 836 (836) Q Consensus 808 ~~~~~~r~~~r~d~~v~~p-~Af~~l~~A~ 836 (836) .|.+.++...|+|++|++. +|+++.++=- T Consensus 348 rdiq~iKl~ERYG~gvLn~gkaiavakNI~ 377 (393) T protein:vir:79 348 RGLQNIKMIERYGIGILNEGKAIAVAKNIS 377 (393) T ss_pred ccceeeeeeeeeceeeeeCCceEEEEecce Confidence 7889999999999999997 6777666433 No 120 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.79 E-value=1.1e-20 Score=129.93 Aligned_cols=294 Identities=14% Similarity=0.091 Sum_probs=203.6 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhh--hhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhh Q lcl|NC_016164. 513 IRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGI--LAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLA 590 (836) Q Consensus 513 ~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~--~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~ 590 (836) +.....| ..++. .....+-+.+..+.+. .....+.|......+++.+.+.+. T Consensus 1 ~~~~~~~-------------------------~~~~~~~~~~~~~p~l~m~alTL-aea~~l~~d~~~~~VIE~l~~~s~ 54 (330) T protein:vir:94 1 MVRICTP-------------------------PLRGRWRTLTHQFPELKMPTVTL-AESAKLSQDHLVSGLIETIVEVNP 54 (330) T ss_pred CceecCC-------------------------ccccceeehhccccccchhhhhh-hHHhhcCchhhHHHHHHhhhccch Confidence 0000000 00000 0011111122222222 223455678888899999999988 Q ss_pred hhhhcceeeecCCceEEEEEecCCceeeeeccCccccccc-ccceeEEeeeeeeeeeehhHHHH--HhcchhHHHHHHHH Q lcl|NC_016164. 591 LNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQ-PSVDQVALVAKTLGAYTEFSRRL--MLQSSIDVEQMVRT 667 (836) Q Consensus 591 l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~-~~~~~it~~~~t~~~~i~ISrel--L~ds~~~l~~~i~~ 667 (836) ++++.. +....++.+.+.+.+.-+.+.|...++.++.+. .+|.+++..++.+++.+.|.+.+ |.++..+...+-.+ T Consensus 55 iL~~lp-f~~ve~~~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~ 133 (330) T protein:vir:94 55 LYEMMP-FTEIEGNALAYNRENVLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVA 133 (330) T ss_pred HHhhcc-cccccCCcceeeeeecCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHH Confidence 887644 223345567888888889999999888888765 57999999999999999999998 45556678888889 Q ss_pred HHHHHHHHHHHHHHHhhcCCcccccccccc-cccccccc--cccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHH Q lcl|NC_016164. 668 ELATVIALEIDRAALYGLGSNSQPEGLKFV-TGINTENF--GATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGF 744 (836) Q Consensus 668 ~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~-~~~~~~t~--aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L 744 (836) ...++++++.+.++|||+.+++++.||++. .+.+.+.+ .++.+|.+++-.++..+.... ..+.+|+||++...+| T Consensus 134 ~~ieal~~~~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~--g~~~~~l~n~a~~r~I 211 (330) T protein:vir:94 134 SKAKSIGRQYQASMITGDGTGNSFQGMMGLVAASQTISAGANGGTLTFELLDQLLDLVKDKD--GQVDYLMSSFAMRRKY 211 (330) T ss_pred HHHHHHHHHHHHHhhccCCCCccccchhhcCCcccEEecCCCCCCCCHHHHHHHHHHhcCCC--CCCcEEEechhHHHHH Confidence 999999999999999999888888999864 34444443 346789999999988875432 2467899999999999 Q ss_pred HHHhhccCcccccc----CCCC---eecceeeEeeCccccc----------eEEEEehh-----ceEEEee----cceEE Q lcl|NC_016164. 745 KTTEKATSTAQFVL----EPGG---TVNGYNVVRSNQVANG----------DVFFGVWN-----QMIMGMW----GALDI 798 (836) Q Consensus 745 ~~lkd~~g~~~~~~----~~~~---~l~G~pVv~s~~~~~~----------~i~~gD~s-----~~~i~~~----~~l~i 798 (836) +.+....|++.... .-+. ++.|+|++.++.+|.+ .||+..|. +..++.. .|+.+ T Consensus 212 ~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsV 291 (330) T protein:vir:94 212 FSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRV 291 (330) T ss_pred HHHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCccee Confidence 99988777654322 1122 4679999999888753 46665553 3444443 24444 Q ss_pred EEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 799 QVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 799 ~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .+....-+++...+++.+++++++..|.|+.+|++=- T Consensus 292 -r~~G~~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~ 328 (330) T protein:vir:94 292 -QNVGAKENADETITRVKMYCGFANFSQLGLAAIKGLI 328 (330) T ss_pred -eeCCCccccceeeEEEEEeeeeEEechhheeeecccc Confidence 2222223567789999999999999999999998766 No 121 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.77 E-value=3.1e-21 Score=133.02 Aligned_cols=383 Identities=16% Similarity=0.091 Sum_probs=202.2 Q ss_pred eeeEeeccccccCCCC-eEE-EEEEEEEEEEEEeccCccchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 339 FMYSIDAPLDLTSREG-MAL-VTAFTPMEVSAVSIPADHTVGQGRKATSSSGPPGAAAATVAPLSHNDNNHMDSSTIDME 416 (836) Q Consensus 339 iG~~v~~~~~~~~~~~-~~~-~~~~~l~EiS~V~~pA~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 416 (836) +|- .+...++ ++| ++...+.|+|+|++|||.+|.+........+... +.. ..++..... T Consensus 1 ~~n------~t~a~d~~~RR~~~~L~~~EvSvv~~PAY~nA~vt~vRe~e~~~~~----------e~~-~~~e~~en~-- 61 (410) T protein:vir:83 1 MGN------ATTASDEYIRRLENELREKESLVRGIYDRANASNRDVNEEEGQMVA----------ECR-GRMEQIKNQ-- 61 (410) T ss_pred CCC------cccchhhHHHHHHHHhhhhheeeeccccccccccccchhhhccccc----------ccc-Ccccchhhh-- Confidence 221 1112222 222 3335567999999999999988532211000000 000 000000000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhh Q lcl|NC_016164. 417 AVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSAD 496 (836) Q Consensus 417 ~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 496 (836) .+.+.+ -..+..+.+....++....... T Consensus 62 --------------------------~e~~~~----~~~~~~E~Rs~~~~i~~~~~~~---------------------- 89 (410) T protein:vir:83 62 --------------------------MEQAQE----VNRIAFETRSKGQAVDAAISAM---------------------- 89 (410) T ss_pred --------------------------hHHHHH----HHHHHHHHHHHHHHHHhhhccC---------------------- Confidence 000000 0000000011111110000000 Q ss_pred hhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhh---hhhhhhhhhccccccccccc Q lcl|NC_016164. 497 IGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPN---DVLHRDLVVDTASAAGDLVF 573 (836) Q Consensus 497 ~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~---~~~~~a~~~~~~~~~g~~vv 573 (836) + ..+.....+++ .+.++++.+-....|..... ....++.....+.+..+ ++ T Consensus 90 -----------------r--~~p~~~~veyR------SaGE~lkal~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd~~~-~i 143 (410) T protein:vir:83 90 -----------------R--GSPVGTEVEYR------SAGEYMLDMWNSAQGNASAADRLEVYARAADHQKTGDLQG-VI 143 (410) T ss_pred -----------------c--CCCCCCCcccc------cHHHHHHHHhccCCchHHHHHHHHHHHHhhccCccccccc-cc Confidence 0 00000000000 11111111111111111111 11112222333333333 34 Q ss_pred chhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceee-------eeccCcccccccccceeEEeeeeeeeee Q lcl|NC_016164. 574 TDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAY-------WVAEGGDPTESQPSVDQVALVAKTLGAY 646 (836) Q Consensus 574 p~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~-------~v~Eg~~~~~~~~~~~~it~~~~t~~~~ 646 (836) |+.+.+..++++.+..++..+..+ +|..+..+.++..+..+... .-.||+..+.++++|+..+..+++||++ T Consensus 144 ~~~~v~d~i~li~q~r~i~slf~t-LP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikTyGGy 222 (410) T protein:vir:83 144 PDPIVGPVIDFIDSARPLVSTLGT-LPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDRLTVNAKTLGGY 222 (410) T ss_pred chhHhhhHHHHHhhccchhhhhhh-CCCCCCeeEEeeecccccccccccccccccccccccccceeeeeccceeehhcCc Confidence 455777788999998888887665 66666778887776655442 2459999999999999999999999999 Q ss_pred ehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHH---HHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_016164. 647 TEFSRRLMLQSSIDVEQMVRTELATVIALEIDRA---ALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVA 723 (836) Q Consensus 647 i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~---il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~ 723 (836) ..+|||.|+.|..++.+...+.|..+++..-+.+ +|+.+-+. .. +....++. . -..-|.++..++. T Consensus 223 t~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~-----~~---a~~~~Tad--~-~~~~i~da~~~v~ 291 (410) T protein:vir:83 223 VNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG-----AV---GYGNATAD--N-VASAIWQAAGAVY 291 (410) T ss_pred ccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-----hh---hhhhccHH--H-HHHHHHHHHHHHh Confidence 9999999999999999999999999999887764 45443221 10 11111110 0 0011222333333 Q ss_pred hhccccCccEEEecHHHHHHHHHH-hhccCccc----c-c----cCCCCeecceeeEeeCccccceEEEEehhceEEEee Q lcl|NC_016164. 724 ADNADIGAMSYLTNSTLYGGFKTT-EKATSTAQ----F-V----LEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMW 793 (836) Q Consensus 724 ~~~~~~~~~~~vmnp~~~~~L~~l-kd~~g~~~----~-~----~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~ 793 (836) .+-.+.+-..+.++|+.+..+..+ ++.+|... | + .+-.|.+++.||++.+..++++++|-|...+..+.. T Consensus 292 da~~~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~Ai~~~eS 371 (410) T protein:vir:83 292 TAVKGMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTAAIECFEQ 371 (410) T ss_pred hhhccceeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCeeeEeccceeeeeec Confidence 332244556789999998666543 33333211 1 1 223467899999999999999999999999988887 Q ss_pred cc--eEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 794 GA--LDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 794 ~~--l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) ++ +.+.-.+-+... ..|- .+|.+++..+.+++-+.-. T Consensus 372 ~~gp~qL~d~~i~nLt---~~yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 372 RVGTLQVVEPSVFGLQ---VAYA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred CCceeEeeCCchhhhh---hhhe--eeeeeccccccceeeeccC Confidence 74 444433322222 2333 6778889999999887777 No 122 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.68 E-value=1e-17 Score=113.79 Aligned_cols=256 Identities=11% Similarity=0.072 Sum_probs=189.9 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhccee--eec-CCceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM--LTG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~--~~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) ++ .+....+++|+.+...+.+.+.....+.+++..- +.+ .+..+++|.....+.+..+.||..++..+++.++.. T Consensus 1 Ma--~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~ 78 (270) T protein:vir:95 1 MT--QTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTK 78 (270) T ss_pred CC--ceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchhe Confidence 12 1234567899999999999988888888876532 222 345889999887778888999999999999999999 Q ss_pred eeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHH Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVS 717 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~ 717 (836) ..+++++..+.++.+....+..+....+.++++.+++++++..++.-.. |.... ....+++++|.+ T Consensus 79 a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~------~a~~~--------~~~~~t~~~~~d 144 (270) T protein:vir:95 79 VTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELN------KSKQT--------ATVSADATGILD 144 (270) T ss_pred eeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhc------ccccc--------cccccCHHHHHH Confidence 9999999999999998776555667889999999999999998874321 11100 123467889999 Q ss_pred HHHHHhhhccccCccEEEecHHHHHHHHHHhhc----cCccccccCCCCeecceeeEeeCccc-cceEEEEehhceEEEe Q lcl|NC_016164. 718 MESKVAADNADIGAMSYLTNSTLYGGFKTTEKA----TSTAQFVLEPGGTVNGYNVVRSNQVA-NGDVFFGVWNQMIMGM 792 (836) Q Consensus 718 a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~----~g~~~~~~~~~~~l~G~pVv~s~~~~-~~~i~~gD~s~~~i~~ 792 (836) ++..+.... ....+++|||.++..|++.... .+.....++.-++++|++|++++.++ .+..|+.....+.++. T Consensus 145 A~~~lgd~~--~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l~~~gAi~~~~ 222 (270) T protein:vir:95 145 AIEVFNSEN--DEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVGVSDIVKSKRVSENTAFLQRYGAMEIVN 222 (270) T ss_pred HHHHhcccc--CCCcEEEEcHHHHHHHHhhhcccccccccchhcccccceecceeEEEeCCCCCceeEEEEeccceeeee Confidence 998886543 2356899999999998753311 12223344556789999998866554 5566666666677777 Q ss_pred ecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 793 WGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 793 ~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ..++.++.++.. ......+.+..++.+++.+|..+++++.+= T Consensus 223 ~~~~~vEtdRd~--~~~~d~i~~~~~y~v~~~~~skvv~~t~~~ 264 (270) T protein:vir:95 223 KKKPEAYTDFDI--LKRTHLLSTNYHYSVNLKDETGVVKVTFKP 264 (270) T ss_pred cCCceeeeccch--hhcccEEEeeeEEEEEEEccceEEEEEecC Confidence 777776665543 446678889999999999999999988655 No 123 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.63 E-value=2.7e-17 Score=111.38 Aligned_cols=275 Identities=16% Similarity=0.061 Sum_probs=168.4 Q ss_pred hhhhhhhhhhhhhhccccccccc-----c-cchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCC---ceeee Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGDL-----V-FTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGA---ATAYW 619 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~~-----v-vp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~---~~a~~ 619 (836) +..+.. -.....++.+ + -|+.+...+.+++......-.|..+.-....+.+.+.+.... ....- T Consensus 1 ~~~~~~-------i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~ 73 (318) T protein:vir:10 1 MTAPTG-------IVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVAD 73 (318) T ss_pred CCCCCc-------ceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhh Confidence 111100 0011111222 1 256666667777666555555545444445666666554432 46677 Q ss_pred eccCcccccccccceeEEe-eeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccc Q lcl|NC_016164. 620 VAEGGDPTESQPSVDQVAL-VAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVT 698 (836) Q Consensus 620 v~Eg~~~~~~~~~~~~it~-~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~ 698 (836) |.|++++|...++++...+ ..+|+|..+.||+|++.....++......++++++++..|+.++.-.-+...|. +- .+ T Consensus 74 VaEggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~-~~-~s 151 (318) T protein:vir:10 74 VAEFGEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPT-LA-VP 151 (318) T ss_pred ccCcccccccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-cc-CC Confidence 8999999999998877665 557999999999999999999999999999999999999998775331111110 00 00 Q ss_pred ccccccccccchhHHHHHHHHH--------------HHhhhccccCccEEEecHHHHHHHH------HHhhccCccccc- Q lcl|NC_016164. 699 GINTENFGATNPTYVELVSMES--------------KVAADNADIGAMSYLTNSTLYGGFK------TTEKATSTAQFV- 757 (836) Q Consensus 699 ~~~~~t~aa~~~t~~~l~~a~~--------------~l~~~~~~~~~~~~vmnp~~~~~L~------~lkd~~g~~~~~- 757 (836) + ++........++..++. .....+-...+..++|||.+|..|. .+...++.+.+. T Consensus 152 ~----~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~ 227 (318) T protein:vir:10 152 T----AWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTA 227 (318) T ss_pred c----CCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhc Confidence 0 00000000011111111 1111122334678999999999884 333334443331 Q ss_pred ----cCCCCeecceeeEeeCccccceEEEEehhce-EEEeecceEEEEec-c----cccccCcEEEEEEEEeccEEEccc Q lcl|NC_016164. 758 ----LEPGGTVNGYNVVRSNQVANGDVFFGVWNQM-IMGMWGALDIQVNP-Y----ALDKSGSVRVTALQDVDVAVRHPE 827 (836) Q Consensus 758 ----~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~-~i~~~~~l~i~~~~-~----~~~~~~~~~~r~~~r~d~~v~~p~ 827 (836) -.-+++++|+.|+.++.+|.+.+|+.+-..+ .+.+-.+++...-. + ..-.+..+.+++......+|.+|+ T Consensus 228 ~~~tg~~~g~~lGl~vi~s~~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~Pk 307 (318) T protein:vir:10 228 PDWTGNFPGSVMGLNVIRSRTFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPK 307 (318) T ss_pred ccccccccceeeceEEeecCccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcc Confidence 1225678999999999999999988875443 33444444432211 1 112345578899999999999999 Q ss_pred ceEEEeecC Q lcl|NC_016164. 828 AFCRGNDNL 836 (836) Q Consensus 828 Af~~l~~A~ 836 (836) |+++||.=+ T Consensus 308 A~~~itgi~ 316 (318) T protein:vir:10 308 AALWLTGIV 316 (318) T ss_pred eeEEEeecc Confidence 999999888 No 124 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.62 E-value=2.8e-17 Score=111.33 Aligned_cols=220 Identities=14% Similarity=0.085 Sum_probs=169.5 Q ss_pred eeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 597 TMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALE 676 (836) Q Consensus 597 ~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~ 676 (836) .--...+..+++|.. ...+..+.||.+++...++.++.+.++++++..+.|+++.......+......++++.+++++ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~k 78 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANK 78 (231) T ss_pred CccccCCceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHHh Confidence 111123457888865 457788999999999999999999999999999999999888766677899999999999999 Q ss_pred HHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc----- Q lcl|NC_016164. 677 IDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT----- 751 (836) Q Consensus 677 ~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~----- 751 (836) +|..++.-..+. ..+ .+..++++.|.++...+.... ....+++|||..+..|++..+.. T Consensus 79 vD~di~~~~~~a-------------~l~-~~~~~t~d~i~~A~~~fgde~--~~~~vivv~p~~~~~Lrk~~~~~~~~~~ 142 (231) T protein:vir:73 79 VDDDLLKAAKTT-------------SQT-VSTKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKNIGSE 142 (231) T ss_pred hhHHHHHhhccc-------------ccc-ccccccHHHHHHHHHHhcccc--ccceEEEEcchHHHhhhhccchhhhhhh Confidence 999888532211 011 124578999999999987664 34568999999999998754432 Q ss_pred -CccccccCCCCeecceeeEeeCccccceEEEEe----hhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcc Q lcl|NC_016164. 752 -STAQFVLEPGGTVNGYNVVRSNQVANGDVFFGV----WNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHP 826 (836) Q Consensus 752 -g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD----~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p 826 (836) |...+..+.-++++|++|++|+.+|.+..++.. ...+.++...++.+..+++ .......+++...+.+++.+| T Consensus 143 ~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd--~~~k~~~i~~~~~y~v~l~~~ 220 (231) T protein:vir:73 143 VGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRD--IVTKTTVITADEHYAAYLYDL 220 (231) T ss_pred hccceeeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeecccc--ccccccEEEEeEEEEEEEEcC Confidence 334455666789999999999999998765433 3456667777777776654 455668899999999999999 Q ss_pred cceEEEeecC Q lcl|NC_016164. 827 EAFCRGNDNL 836 (836) Q Consensus 827 ~Af~~l~~A~ 836 (836) +.+|+++.+= T Consensus 221 ~~vv~~t~~g 230 (231) T protein:vir:73 221 TKVVNITFTG 230 (231) T ss_pred ccEEEEEeec Confidence 9999999776 No 125 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.62 E-value=2.2e-16 Score=106.45 Aligned_cols=272 Identities=11% Similarity=0.068 Sum_probs=176.0 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeecc-----Cccccccccccee Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAE-----GGDPTESQPSVDQ 635 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~E-----g~~~~~~~~~~~~ 635 (836) +...+..-...+.+..+...+++.+...+.+.++.. +.....+.+.+.+...-+.+.+.+- +...+.+..+|++ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~Lp-F~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t~~~ 79 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLP-FDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAATFTK 79 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCC-cccccCCcceeeEeeccCCcccccccccccCCCccccccccce Confidence 000011111234456677788999988888776533 2222344566666654333333222 2334567788999 Q ss_pred EEeeeeeeeeeehhHHHHHh--cc-hhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccc-ccccccc--cccc Q lcl|NC_016164. 636 VALVAKTLGAYTEFSRRLML--QS-SIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVT-GINTENF--GATN 709 (836) Q Consensus 636 it~~~~t~~~~i~ISrelL~--ds-~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~-~~~~~t~--aa~~ 709 (836) .++.++.+++.+.|.+.+.. ++ ..+....=.+...++++++.+..|+||+.+++.+.||++.. +...+.. .++. T Consensus 80 ~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~~gg~ 159 (310) T protein:vir:97 80 VNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGATGSA 159 (310) T ss_pred eeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecCCCCCC Confidence 99999999999999986432 23 33444444667779999999999999999877778998763 3344433 3467 Q ss_pred hhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhc-cCcccccc--CCC----CeecceeeEeeCccccc---- Q lcl|NC_016164. 710 PTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKA-TSTAQFVL--EPG----GTVNGYNVVRSNQVANG---- 778 (836) Q Consensus 710 ~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~-~g~~~~~~--~~~----~~l~G~pVv~s~~~~~~---- 778 (836) +|.++|-.++..+... ...+..|+|||+++.+|+.+... ++++.|.. ... .++.|+|++.++.+|.+ T Consensus 160 ~t~d~LDeLl~~v~~~--~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~~~~~ 237 (310) T protein:vir:97 160 ISFAILDELMDLVVDK--DGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTNQTKG 237 (310) T ss_pred CCHHHHHHHHHHHhcC--CCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCCcccc Confidence 8899999988887432 23467899999998888876543 33444422 111 25789999999988853 Q ss_pred ------eEEEEehh-----ceEEEee----cceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 779 ------DVFFGVWN-----QMIMGMW----GALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 779 ------~i~~gD~s-----~~~i~~~----~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .||+.-|. +-.++.. .++.+..-. .--+++...+++.+++++++..|.|+.+|.+=+ T Consensus 238 ~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G-~~~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~ 309 (310) T protein:vir:97 238 GTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVG-ESEDSDEHIWRVKWYCGLALFSEKGLACADGIT 309 (310) T ss_pred ccCCceeEEEEeeCccccccceeccccCCccceeEEeCC-cccCCcceeEEEEEeeeEEEecccceeeecccc Confidence 35554443 2223322 233332211 112557788999999999999999999999877 No 126 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.53 E-value=3.8e-15 Score=99.63 Aligned_cols=258 Identities=14% Similarity=0.037 Sum_probs=166.0 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeec---CCceEEEEEecCCceeeeeccCcccccccccceeEE Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTG---LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVA 637 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~---~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it 637 (836) +. -..++|+.+...+++.++....+.++..+.... .+..+++|+...........+++.++...++.+.++ T Consensus 1 MA------~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:79 1 MA------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) T ss_pred Cc------chhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEE Confidence 11 122469999999999999998888775443222 234799998776666667788888888888888888 Q ss_pred eeeeee-eeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHH Q lcl|NC_016164. 638 LVAKTL-GAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELV 716 (836) Q Consensus 638 ~~~~t~-~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~ 716 (836) +++.+. +.-+.|+..-...+..++.+ +.+.++.++++++|..++.-....... ....+.......++.|. T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~~~--------~~~~~~~~~~~~~~~i~ 145 (273) T protein:vir:79 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA--------LTGSAPSDADDAFDLIA 145 (273) T ss_pred EEEeeecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc--------cccccccchhhHHHHHH Confidence 888664 44556665333344556765 667789999999998765322111100 00011111223467899 Q ss_pred HHHHHHhhhccccCccEEEecHHHHHHHHHHhh----cc--Cc-cccccCCCCeecceeeEeeCccccce---EEEEehh Q lcl|NC_016164. 717 SMESKVAADNADIGAMSYLTNSTLYGGFKTTEK----AT--ST-AQFVLEPGGTVNGYNVVRSNQVANGD---VFFGVWN 786 (836) Q Consensus 717 ~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd----~~--g~-~~~~~~~~~~l~G~pVv~s~~~~~~~---i~~gD~s 786 (836) ++...|..++....+-.++++|..+..|....+ .. |. ..+..+.-++|.|++|+.|+.+|.+. ++.+-.+ T Consensus 146 ~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~ 225 (273) T protein:vir:79 146 SALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS 225 (273) T ss_pred HHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEecc Confidence 999988888776566789999999988754322 11 11 22334556789999999999999654 3333333 Q ss_pred ceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 787 QMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 787 ~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .+....+ ...+.. +..-..-...+++.+.+|+++++|+++++++.+= T Consensus 226 A~~~a~~-~~~~e~--~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g 272 (273) T protein:vir:79 226 AAAYVSQ-IDTVEA--LRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred ceeeeee-hhhhhc--ccCcccceeeeeeeeeeeeEEecCceEEEEeccC Confidence 3332221 112211 1111222457888999999999999999998666 No 127 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.52 E-value=6.1e-15 Score=98.52 Aligned_cols=257 Identities=14% Similarity=0.028 Sum_probs=166.0 Q ss_pred cccccccchhhHHHHHHHHHhhhhhhhhcceeee---cCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeee Q lcl|NC_016164. 567 AAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT---GLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTL 643 (836) Q Consensus 567 ~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~---~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~ 643 (836) -+-...+|+.+...+++.++....+.++..+... ..+..+.+|+...........+++.+.....+.+.+++++.+. T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEeee Confidence 1112346899999999999998888887554322 2235788888776665666777877777777778888877654 Q ss_pred -eeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_016164. 644 -GAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKV 722 (836) Q Consensus 644 -~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l 722 (836) +.-+.|+..-...+..++.+ +.+.++.+++.++|..++.-....... ....+.....-.++.|.++...| T Consensus 81 ~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~--------~~~~~~~~~~~~~~~i~~a~~~l 151 (273) T protein:vir:10 81 KSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA--------LTGSAPTDADDAFDLIAKALKEL 151 (273) T ss_pred eecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc--------cccccccchhHHHHHHHHHHHHh Confidence 44455665333334456766 677789999999998776422111100 00011111223578899999999 Q ss_pred hhhccccCccEEEecHHHHHHHHHHh----hcc--C-ccccccCCCCeecceeeEeeCccccce---EEEEehhceEEEe Q lcl|NC_016164. 723 AADNADIGAMSYLTNSTLYGGFKTTE----KAT--S-TAQFVLEPGGTVNGYNVVRSNQVANGD---VFFGVWNQMIMGM 792 (836) Q Consensus 723 ~~~~~~~~~~~~vmnp~~~~~L~~lk----d~~--g-~~~~~~~~~~~l~G~pVv~s~~~~~~~---i~~gD~s~~~i~~ 792 (836) ..++....+-.++++|..+..|.... ... | ...+..+.-++|.|++|+.|+.+|.+. ++++..+.+.... T Consensus 152 d~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~ 231 (273) T protein:vir:10 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVS 231 (273) T ss_pred hhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeee Confidence 88887666778999999999886532 211 1 122345556789999999999999653 4555544443332 Q ss_pred ec-ceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 793 WG-ALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 793 ~~-~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .. .++...++ .+=...+++.+.+|+++++|++++.++.+= T Consensus 232 q~~~~e~~r~~----~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g 272 (273) T protein:vir:10 232 QIDTVEALRDQ----DSFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred eeehhhcccCC----CcceeeeeeeeeeeeeEeccceEEEEeccC Confidence 11 12222222 122457888999999999999999998666 No 128 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.52 E-value=6.1e-15 Score=98.52 Aligned_cols=257 Identities=14% Similarity=0.028 Sum_probs=166.0 Q ss_pred cccccccchhhHHHHHHHHHhhhhhhhhcceeee---cCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeee Q lcl|NC_016164. 567 AAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT---GLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTL 643 (836) Q Consensus 567 ~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~---~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~ 643 (836) -+-...+|+.+...+++.++....+.++..+... ..+..+.+|+...........+++.+.....+.+.+++++.+. T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEeee Confidence 1112346899999999999998888887554322 2235788888776665666777877777777778888877654 Q ss_pred -eeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_016164. 644 -GAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKV 722 (836) Q Consensus 644 -~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l 722 (836) +.-+.|+..-...+..++.+ +.+.++.+++.++|..++.-....... ....+.....-.++.|.++...| T Consensus 81 ~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~--------~~~~~~~~~~~~~~~i~~a~~~l 151 (273) T protein:vir:10 81 KSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA--------LTGSAPTDADDAFDLIAKALKEL 151 (273) T ss_pred eecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc--------cccccccchhHHHHHHHHHHHHh Confidence 44455665333334456766 677789999999998776422111100 00011111223578899999999 Q ss_pred hhhccccCccEEEecHHHHHHHHHHh----hcc--C-ccccccCCCCeecceeeEeeCccccce---EEEEehhceEEEe Q lcl|NC_016164. 723 AADNADIGAMSYLTNSTLYGGFKTTE----KAT--S-TAQFVLEPGGTVNGYNVVRSNQVANGD---VFFGVWNQMIMGM 792 (836) Q Consensus 723 ~~~~~~~~~~~~vmnp~~~~~L~~lk----d~~--g-~~~~~~~~~~~l~G~pVv~s~~~~~~~---i~~gD~s~~~i~~ 792 (836) ..++....+-.++++|..+..|.... ... | ...+..+.-++|.|++|+.|+.+|.+. ++++..+.+.... T Consensus 152 d~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~ 231 (273) T protein:vir:10 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVS 231 (273) T ss_pred hhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeee Confidence 88887666778999999999886532 211 1 122345556789999999999999653 4555544443332 Q ss_pred ec-ceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 793 WG-ALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 793 ~~-~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .. .++...++ .+=...+++.+.+|+++++|++++.++.+= T Consensus 232 q~~~~e~~r~~----~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g 272 (273) T protein:vir:10 232 QIDTVEALRDQ----DSFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred eeehhhcccCC----CcceeeeeeeeeeeeeEeccceEEEEeccC Confidence 11 12222222 122457888999999999999999998666 No 129 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.50 E-value=9e-15 Score=97.58 Aligned_cols=295 Identities=9% Similarity=0.055 Sum_probs=176.8 Q ss_pred hhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEE Q lcl|NC_016164. 529 AFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAI 608 (836) Q Consensus 529 ~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~ 608 (836) +.. .+.+....... ......+ +.+.++.++.++++.....|++...+.+++.+.. +.++.......+ T Consensus 1 ~~~----~~~~~~~~n~~------~~~i~k~--~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~-r~~~~~s~~~ei 67 (360) T protein:vir:99 1 MSS----NSTIDSVRNQN------MNSLSQK--DIGLAELDGFQLPVDVTEEFLERMQKGVQILGMA-DTMTLARLEMEV 67 (360) T ss_pred Ccc----hhHHHHHhhhH------HHHHHhh--hccccccCceeecHHHHHHHHHHHhhccchhhhc-ceeecccccccc Confidence 111 11111110000 0111111 1222233445566677888999999999999884 555766666777 Q ss_pred EEecCCceeee-eccCcccc-cccccceeEEe-eeeeeeeeehhHHHHHhcch----hHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 609 PRQTGAATAYW-VAEGGDPT-ESQPSVDQVAL-VAKTLGAYTEFSRRLMLQSS----IDVEQMVRTELATVIALEIDRAA 681 (836) Q Consensus 609 p~~~~~~~a~~-v~Eg~~~~-~~~~~~~~it~-~~~t~~~~i~ISrelL~ds~----~~l~~~i~~~l~~a~a~~~d~~i 681 (836) ++..-+..... -.|++..+ ..+++...+.+ ..+++-....++.+.+.+.. ..+.+.|.+.|++++++-++... T Consensus 68 ~kig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~ 147 (360) T protein:vir:99 68 PQFGVPRLSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMG 147 (360) T ss_pred cccccceeeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHH Confidence 66655443322 22443332 24455555555 34555566677777766542 24568999999999999999999 Q ss_pred HhhcCCcc--------c-----cccccccc--ccccccccc------------------------------cchhHHHHH Q lcl|NC_016164. 682 LYGLGSNS--------Q-----PEGLKFVT--GINTENFGA------------------------------TNPTYVELV 716 (836) Q Consensus 682 l~G~Gt~~--------~-----p~Gi~~~~--~~~~~t~aa------------------------------~~~t~~~l~ 716 (836) ++|+.... . ..|++..+ ++..+..++ ...+..-+. T Consensus 148 ~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~ 227 (360) T protein:vir:99 148 IRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFN 227 (360) T ss_pred hhccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHH Confidence 98875421 1 24554332 111110000 112333467 Q ss_pred HHHHHHhhhccccC--ccEEEecHHHHHHHHH-Hhhcc---CccccccCCCCeecceeeEeeCccccceEEEEehhceEE Q lcl|NC_016164. 717 SMESKVAADNADIG--AMSYLTNSTLYGGFKT-TEKAT---STAQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIM 790 (836) Q Consensus 717 ~a~~~l~~~~~~~~--~~~~vmnp~~~~~L~~-lkd~~---g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i 790 (836) +++..|...|.... +.+|+|++......+. +.+-. |-..+.....-+.+|+||+..+.+|++.++|-+++++.+ T Consensus 228 ~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~~mlT~p~NLi~ 307 (360) T protein:vir:99 228 ETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLTEREDPLGSAVIFGDSDITPFSYDLVGVNGFPDEYMMFTDPNNLAF 307 (360) T ss_pred HHHHhcchhhhcCcccceEEEccCchHHHHHHHHhccCcccchhheecccccccceeeeEEcCCCCCCceEEeccCceeE Confidence 88889988876543 4489999987655443 32222 222223233335689999999999999999999999999 Q ss_pred EeecceEEEEeccccc-ccCc--EEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 791 GMWGALDIQVNPYALD-KSGS--VRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 791 ~~~~~l~i~~~~~~~~-~~~~--~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.+..+++....+... .... +.+....++|+.+.+++|.|++++== T Consensus 308 g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~ 356 (360) T protein:vir:99 308 GLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLE 356 (360) T ss_pred EeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCC Confidence 9999998875433222 1122 34445678999999999999988533 No 130 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=99.49 E-value=5.5e-15 Score=98.74 Aligned_cols=268 Identities=7% Similarity=-0.057 Sum_probs=157.8 Q ss_pred hhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTE 628 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~ 628 (836) +. .++..-. .+-..+...+........+..+..+++.+.+++..++......+.+..+ .++++. T Consensus 1 m~-------------it~~~l~-~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~--~Ge~~~ 64 (302) T protein:vir:10 1 ML-------------INKQSLN-AAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRW--IGAKVV 64 (302) T ss_pred Cc-------------ccHHHHH-HHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCcccc--ccceee Confidence 00 0000000 0111223334444444444444445666778888888888887776443 377888 Q ss_pred ccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCc-----ccccccccccccccc Q lcl|NC_016164. 629 SQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSN-----SQPEGLKFVTGINTE 703 (836) Q Consensus 629 ~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~-----~~p~Gi~~~~~~~~~ 703 (836) .++.....+++.++|+..+.|||++|+||++++...+.+.|+++.++.+++.++.-...+ ...+-+|++.|.... T Consensus 65 ~~l~~~~~~i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~ 144 (302) T protein:vir:10 65 KNLKAYKYVVENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGD 144 (302) T ss_pred ccccccceeEEeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccc Confidence 899999999999999999999999999999999999999999999999999877543221 122346666553221 Q ss_pred c-----------ccccchhHHHHHHHHHHHhhhccc------cCccEEEecHHHHHHHHHHhhccCccccccCCCCeecc Q lcl|NC_016164. 704 N-----------FGATNPTYVELVSMESKVAADNAD------IGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNG 766 (836) Q Consensus 704 t-----------~aa~~~t~~~l~~a~~~l~~~~~~------~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G 766 (836) . .....++.+.+.+++.+|..+... ..+..++..|......+.+-.. ++ +..+..+.+.| T Consensus 145 ~~~~N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~-~~--~~~g~~Np~~g 221 (302) T protein:vir:10 145 ASVSNKGTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN-PK--LADNTPNPYVG 221 (302) T ss_pred cccccccchhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc-cc--cCCCCcceecc Confidence 1 112234556677777777665432 3455677777666555544221 11 22233344444 Q ss_pred -eeeEeeCccccc-eEEE-EehhceEEEeecce-EEEEecccccccCcEEEEEEEEeccEEEcccceE--------EEee Q lcl|NC_016164. 767 -YNVVRSNQVANG-DVFF-GVWNQMIMGMWGAL-DIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFC--------RGND 834 (836) Q Consensus 767 -~pVv~s~~~~~~-~i~~-gD~s~~~i~~~~~l-~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~--------~l~~ 834 (836) ..+++++.+..+ .||+ .|.+.+......+. .........|..+.+.++.+.++++.-+--.++. ..+. T Consensus 222 ~~~~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~ 301 (302) T protein:vir:10 222 TAELVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTG 301 (302) T ss_pred ceEEEEeeccCCCCceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccC Confidence 467788887654 4554 45665543333222 2233334568888899988888875222211111 1122 Q ss_pred c Q lcl|NC_016164. 835 N 835 (836) Q Consensus 835 A 835 (836) | T Consensus 302 ~ 302 (302) T protein:vir:10 302 A 302 (302) T ss_pred C Confidence 2 No 131 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.44 E-value=1.3e-14 Score=96.63 Aligned_cols=282 Identities=8% Similarity=-0.005 Sum_probs=169.1 Q ss_pred hhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeee---cCCceEEEEEecCCceeeeeccCcccccc Q lcl|NC_016164. 553 NDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT---GLQGPVAIPRQTGAATAYWVAEGGDPTES 629 (836) Q Consensus 553 ~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~---~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~ 629 (836) ..+........-+++.-...+|+.+...+++.+.....+.++.. ... ..+..+++|+.. .+.+..+.+++.++.. T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~-d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i~~~ 78 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVK-TWGAQVKKGDTFHVPRIS-ELGVEDKATDVPVGVQ 78 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccc-cccccccCCceEEEeccC-cceeeeecCCCccccc Confidence 00111101111111112223699999999999998888877643 222 224578898764 5667778888888888 Q ss_pred cccceeEEeeeeee-eeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCC-cccccccccccccccccccc Q lcl|NC_016164. 630 QPSVDQVALVAKTL-GAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGS-NSQPEGLKFVTGINTENFGA 707 (836) Q Consensus 630 ~~~~~~it~~~~t~-~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt-~~~p~Gi~~~~~~~~~t~aa 707 (836) .++...+++.+.+. ..-+.|+..-...+..++...+.+.++++++++.|..++..... ...+.+..........+..+ T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~ 158 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGNG 158 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCch Confidence 88888888888554 45567777555556778999999999999999999887743211 11111111111111122233 Q ss_pred cchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhc-----cCccccccCCCCeecceeeEeeCccccceEEE Q lcl|NC_016164. 708 TNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKA-----TSTAQFVLEPGGTVNGYNVVRSNQVANGDVFF 782 (836) Q Consensus 708 ~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~-----~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~ 782 (836) ...+++.|.++...|..++.....-.++++|..+..|...... .|...+..+.-+++.|++|+.|+.+|.+.... T Consensus 159 ~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~ 238 (341) T protein:vir:94 159 QAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTSLIGNNSATG 238 (341) T ss_pred hhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEEecccccccccc Confidence 4467889999999998887766667889999999988653221 22223444445689999999999998653210 Q ss_pred ---------------------------Eehhce--EEEeecceE--EEEec-------------cccc--ccCcEEEEEE Q lcl|NC_016164. 783 ---------------------------GVWNQM--IMGMWGALD--IQVNP-------------YALD--KSGSVRVTAL 816 (836) Q Consensus 783 ---------------------------gD~s~~--~i~~~~~l~--i~~~~-------------~~~~--~~~~~~~r~~ 816 (836) +++..+ .++.+..+- -..++ +..| .+....+++. T Consensus 239 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 318 (341) T protein:vir:94 239 WRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGR 318 (341) T ss_pred ccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhhhh Confidence 011111 111111110 00010 0111 1233456677 Q ss_pred EEeccEEEcccceEEEeecC Q lcl|NC_016164. 817 QDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 817 ~r~d~~v~~p~Af~~l~~A~ 836 (836) .-+|.+++||++.+.++.+= T Consensus 319 ~~~G~~~lrp~~~v~~~~~~ 338 (341) T protein:vir:94 319 QAYGARLYRPLHAVNIHTTG 338 (341) T ss_pred hhhcccccCcceeEEEecCc Confidence 78899999999988877444 No 132 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.40 E-value=8e-14 Score=92.37 Aligned_cols=280 Identities=14% Similarity=0.062 Sum_probs=162.9 Q ss_pred HHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhccee-eec-CCceEEEEEecCCc Q lcl|NC_016164. 538 TAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM-LTG-LQGPVAIPRQTGAA 615 (836) Q Consensus 538 ~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~-~~~-~~~~~~~p~~~~~~ 615 (836) .+.-.| ..+. ....-..+.....+|+.+...+++.+.+...+..+..+. ... ....+++|+.. .+ T Consensus 1 ~~~~~~--~~~~----------~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~ 67 (381) T protein:vir:80 1 MATIQG--TGGY----------KGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RA 67 (381) T ss_pred Cceecc--cccc----------cCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cc Confidence 000000 0000 011111122234569999999999999988888775442 222 23478888875 56 Q ss_pred eeeeeccCcccccccccceeEEeeeeeee-eeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCC-ccccc- Q lcl|NC_016164. 616 TAYWVAEGGDPTESQPSVDQVALVAKTLG-AYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGS-NSQPE- 692 (836) Q Consensus 616 ~a~~v~Eg~~~~~~~~~~~~it~~~~t~~-~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt-~~~p~- 692 (836) .+..+.++++++...++...+++++.++- ..+.|+..-...+..++.+.+.+.++.++++..|+.++..... ...+. T Consensus 68 ~a~d~~~g~~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~ 147 (381) T protein:vir:80 68 AVYDKQPQTPVNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQ 147 (381) T ss_pred eeeeecCCCcccccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 77788899888888888888888885543 3467777555556678999999999999999999988743211 11111 Q ss_pred -------ccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc-----CccccccCC Q lcl|NC_016164. 693 -------GLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT-----STAQFVLEP 760 (836) Q Consensus 693 -------Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~-----g~~~~~~~~ 760 (836) ++-........+......+++.|.++...|..++....+-+++++|..+..|....... +...+..+. T Consensus 148 ~~~t~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~ 227 (381) T protein:vir:80 148 RIYSYDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGV 227 (381) T ss_pred ccccccccccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhcee Confidence 11111111122233344678999999999988877666678999999999886543221 122234444 Q ss_pred CCeecceeeEeeCccccceEE-----EEehhceEEEeecceEEEEecc-cccccCcEEEEEEEEeccEEEcc-cceEEEe Q lcl|NC_016164. 761 GGTVNGYNVVRSNQVANGDVF-----FGVWNQMIMGMWGALDIQVNPY-ALDKSGSVRVTALQDVDVAVRHP-EAFCRGN 833 (836) Q Consensus 761 ~~~l~G~pVv~s~~~~~~~i~-----~gD~s~~~i~~~~~l~i~~~~~-~~~~~~~~~~r~~~r~d~~v~~p-~Af~~l~ 833 (836) -++|+|++|+.|+.+|.+.+. +|-+... ...+ ...++ ..|......++....+|.++... ..+-..+ T Consensus 228 Ig~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~----~~~~--~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~ 301 (381) T protein:vir:80 228 VGTILGMEVIVTTQIGINSLTGYVNGQGAPTQP----TPGV--LGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFS 301 (381) T ss_pred eeEEcceEEEeecccccccccceeeeccccccc----cccc--cccccccccccceeeeeeeeeeceeeeeeeccceeee Confidence 578999999999999965321 1111100 0000 01111 12333445566666666666332 2233222 Q ss_pred ecC Q lcl|NC_016164. 834 DNL 836 (836) Q Consensus 834 ~A~ 836 (836) .|. T Consensus 302 g~~ 304 (381) T protein:vir:80 302 GAG 304 (381) T ss_pred cce Confidence 222 No 133 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.27 E-value=1.1e-12 Score=86.13 Aligned_cols=286 Identities=10% Similarity=-0.017 Sum_probs=158.8 Q ss_pred HHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccc---cccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEec Q lcl|NC_016164. 536 EATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGD---LVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQT 612 (836) Q Consensus 536 ~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~---~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~ 612 (836) -.++ ..+..+..... +...+++. .+.=+.+.++++......+.++.+.....-..+..+.+++. T Consensus 1 ~~~~-------~~~~~~~~~~~-----~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~i- 67 (332) T protein:vir:78 1 MTTL-------SNFSLPNQANG-----GARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT- 67 (332) T ss_pred Cccc-------ccccCCccccC-----CccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEec- Confidence 0000 00000111000 01111121 23347888999999988888888765443334567888877 Q ss_pred CCceeeeeccCccccc-ccccceeEEeeeee--eeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHh----hc Q lcl|NC_016164. 613 GAATAYWVAEGGDPTE-SQPSVDQVALVAKT--LGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALY----GL 685 (836) Q Consensus 613 ~~~~a~~v~Eg~~~~~-~~~~~~~it~~~~t--~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~----G~ 685 (836) +..+++....|..... .+++-++.++.+.+ |.. +.|..---.++..++.+.+.++.+.++++..|..++. +. T Consensus 68 g~~~~~~~~~g~~l~~~~~~~~~~~~l~ID~~ky~~-~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa 146 (332) T protein:vir:78 68 GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSS-QFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKAS 146 (332) T ss_pred cceeEeeecCCCCCCCCCCCCCceEEEEEehhhhhH-HHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 4556666666665533 24555555566554 333 2332211122445788999999999999999987653 21 Q ss_pred CCcccccccccccccccccc--cccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc-------C-ccc Q lcl|NC_016164. 686 GSNSQPEGLKFVTGINTENF--GATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT-------S-TAQ 755 (836) Q Consensus 686 Gt~~~p~Gi~~~~~~~~~t~--aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~-------g-~~~ 755 (836) .++....+......+..... ..+...++.|.++...|..++.....-.++++|..|..|...++.. + ... T Consensus 147 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~ 226 (332) T protein:vir:78 147 AEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD 226 (332) T ss_pred cccCcccccccccccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccc Confidence 11111122111111111111 1122356788999999988887766667888999998886543311 1 122 Q ss_pred cccCC-CCeecceeeEeeCccccce--------------EEEEehhce--EEEeecce--------EEEEec-ccccccC Q lcl|NC_016164. 756 FVLEP-GGTVNGYNVVRSNQVANGD--------------VFFGVWNQM--IMGMWGAL--------DIQVNP-YALDKSG 809 (836) Q Consensus 756 ~~~~~-~~~l~G~pVv~s~~~~~~~--------------i~~gD~s~~--~i~~~~~l--------~i~~~~-~~~~~~~ 809 (836) +..+. -++++|++|+.|+.+|... .|-|+|+.. .++.+..+ .+.... +..-.+- T Consensus 227 ~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~ 306 (332) T protein:vir:78 227 MNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQ 306 (332) T ss_pred eecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhh Confidence 23332 3578999999999998532 244555541 22222221 221111 1111112 Q ss_pred cEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 810 SVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 810 ~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) ...++..+-+|.+++||++++.++.| T Consensus 307 ~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 307 GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred HhhhhhhhhhcCceecccceEEEeeC Confidence 34677778899999999999999999 No 134 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.22 E-value=2.1e-12 Score=84.65 Aligned_cols=285 Identities=12% Similarity=0.070 Sum_probs=159.8 Q ss_pred HHHHhhhhhhhhhhhhhhhhhhhhhcccccccc--cccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCc Q lcl|NC_016164. 538 TAQRMGVTPRGILAPNDVLHRDLVVDTASAAGD--LVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAA 615 (836) Q Consensus 538 ~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~--~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~ 615 (836) .+...+ ........+.....+. .+.-+.+.+++.......+.++.+.....-..+..+.+|+. +.. T Consensus 1 ~a~~~~-----------~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~i-G~~ 68 (347) T protein:vir:88 1 MANATG-----------GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRT 68 (347) T ss_pred CCCccc-----------chhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeee-cce Confidence 000000 0000001111111222 23447888899888888888888765543345667888866 444 Q ss_pred eeeeeccCccccc--ccccceeEEeeeeee-eeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhc--CC--- Q lcl|NC_016164. 616 TAYWVAEGGDPTE--SQPSVDQVALVAKTL-GAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGL--GS--- 687 (836) Q Consensus 616 ~a~~v~Eg~~~~~--~~~~~~~it~~~~t~-~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~--Gt--- 687 (836) ++.....|..... ..+..+++++.+.++ ...+.|.+.--.....++.+.+.+++++++++..|+.++.-. ++ T Consensus 69 ~~~~~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~ 148 (347) T protein:vir:88 69 KGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLP 148 (347) T ss_pred eeeeeccccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 5555666665543 356677777777665 233344432222344578889999999999999999876321 11 Q ss_pred ---cccccccccccccccccc-------cccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc-----C Q lcl|NC_016164. 688 ---NSQPEGLKFVTGINTENF-------GATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT-----S 752 (836) Q Consensus 688 ---~~~p~Gi~~~~~~~~~t~-------aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~-----g 752 (836) +..+.|+-.....+..+. ......++.|.++...|..++.....-+++++|..|..|......+ + T Consensus 149 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~ 228 (347) T protein:vir:88 149 AASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAA 228 (347) T ss_pred cccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhcc Confidence 111233222211111111 1111236788889989988887777789999999998875533222 1 Q ss_pred ccccccCCCCeecceeeEeeCccccce---E----------------------EEEehhce-EE-Eee--------cceE Q lcl|NC_016164. 753 TAQFVLEPGGTVNGYNVVRSNQVANGD---V----------------------FFGVWNQM-IM-GMW--------GALD 797 (836) Q Consensus 753 ~~~~~~~~~~~l~G~pVv~s~~~~~~~---i----------------------~~gD~s~~-~i-~~~--------~~l~ 797 (836) ...+.++..+++.|++|+.|+.+|.+. . +.+|++.. .+ +.+ .++. T Consensus 229 ~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~ 308 (347) T protein:vir:88 229 LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) T ss_pred ccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccce Confidence 122344445689999999999998421 1 22334321 11 111 1112 Q ss_pred EEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 798 IQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 798 i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++.. ..-.+-...+++.+-+|.+++||++.+.++-.- T Consensus 309 ~e~~--r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~ 345 (347) T protein:vir:88 309 LERA--RRPEFQADQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) T ss_pred eeee--echhhHHHHhhhhhhhcCceeccceEEEEEeCC Confidence 2221 111223347888899999999999887766433 No 135 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.22 E-value=2.5e-12 Score=84.15 Aligned_cols=283 Identities=14% Similarity=0.095 Sum_probs=160.3 Q ss_pred hhhhhhhhhhhhhhccccc--ccc--cccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASA--AGD--LVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGG 624 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~--~g~--~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~ 624 (836) +......... ........ .+. .+.-+.+.+++.......+.++++...+.-..+..+.+|+. +..++.....|. T Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~~~~~~G~ 78 (345) T protein:vir:22 1 MASMTGGQQM-GTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGE 78 (345) T ss_pred Ccccccchhc-ccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEEEeeecCC Confidence 0000000000 00001110 111 24447888999998888899998866554445678888876 667777787787 Q ss_pred ccccc--cccceeEEeee--eeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcC--------Cccccc Q lcl|NC_016164. 625 DPTES--QPSVDQVALVA--KTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLG--------SNSQPE 692 (836) Q Consensus 625 ~~~~~--~~~~~~it~~~--~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~G--------t~~~p~ 692 (836) +.... .+...+.++.+ .+|... .|..---.++..++.+.+.++++.++++..|+.++.-.. .+..|. T Consensus 79 ~l~~~~~~~~~~e~~ltID~~~y~~~-~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~ 157 (345) T protein:vir:22 79 NLDDKRKDIKHTEKVITIDGLLTADV-LIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIE 157 (345) T ss_pred CCCCCCCCcccceEEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 76543 35556644444 444432 222211122456789999999999999999987763111 112333 Q ss_pred ccccccccccc--------cccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCc-----cccccC Q lcl|NC_016164. 693 GLKFVTGINTE--------NFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATST-----AQFVLE 759 (836) Q Consensus 693 Gi~~~~~~~~~--------t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~-----~~~~~~ 759 (836) |+-+....... ....+...++.|..+...|..++.....-+++++|..|..|..-+..+.. ..+..+ T Consensus 158 ~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G 237 (345) T protein:vir:22 158 GLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKG 237 (345) T ss_pred ccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccccccccc Confidence 32221111110 11111234778888888888888777778899999999988654332211 112233 Q ss_pred CCCeecceeeEeeCccccce-----------------------E---------EEEehhceEEEeecceEEEEecccccc Q lcl|NC_016164. 760 PGGTVNGYNVVRSNQVANGD-----------------------V---------FFGVWNQMIMGMWGALDIQVNPYALDK 807 (836) Q Consensus 760 ~~~~l~G~pVv~s~~~~~~~-----------------------i---------~~gD~s~~~i~~~~~l~i~~~~~~~~~ 807 (836) .-++++|++|+.|+.+|.+. . +|.-.+.+..+...++.++..... . T Consensus 238 ~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~--~ 315 (345) T protein:vir:22 238 SIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRA--N 315 (345) T ss_pred eEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeech--h Confidence 34578999999999887421 0 111111111222222223222221 1 Q ss_pred cCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 808 SGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 808 ~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +-...+++.+-+|.+++||++.+.++-=| T Consensus 316 ~~~d~I~~~~a~G~~vlRPeaa~~i~~~~ 344 (345) T protein:vir:22 316 FQADQIIAKYAMGHGGLRPEAAGAVVFKV 344 (345) T ss_pred HHHHHHHHHHhcCCcccccceeEEEEEee Confidence 22246777888999999999999988777 No 136 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.22 E-value=2.1e-12 Score=84.64 Aligned_cols=284 Identities=12% Similarity=0.083 Sum_probs=159.8 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccc--cccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeee Q lcl|NC_016164. 542 MGVTPRGILAPNDVLHRDLVVDTASAAGD--LVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYW 619 (836) Q Consensus 542 ~g~~~~g~~~~~~~~~~a~~~~~~~~~g~--~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~ 619 (836) +.....|... ....+....+|. .+.-+.+.+++.......+.++++..+..-..+..+.+|+. +..++.. T Consensus 1 ma~~~~~~~~-------~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~i-G~~~~~~ 72 (347) T protein:vir:94 1 MANMNGGQQM-------GKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVL-GRTKAAY 72 (347) T ss_pred CCcccccccc-------ccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeec-cceeEee Confidence 0000001000 000000111121 13447889999988888899998866543344667778755 5566677 Q ss_pred eccCccccc--ccccceeEEeeeeee-eeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhh----c----CCc Q lcl|NC_016164. 620 VAEGGDPTE--SQPSVDQVALVAKTL-GAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYG----L----GSN 688 (836) Q Consensus 620 v~Eg~~~~~--~~~~~~~it~~~~t~-~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G----~----Gt~ 688 (836) +..|.+... ..+..++.++.+.++ ...+.|..---.++..++.+.+.++++.++++..|+.++.- . .++ T Consensus 73 ~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~ 152 (347) T protein:vir:94 73 LQPGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANN 152 (347) T ss_pred eecCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 777777644 356677777766654 22223322212234567889999999999999999877521 1 111 Q ss_pred cccccccccccccc-----cc---ccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc-Ccc----c Q lcl|NC_016164. 689 SQPEGLKFVTGINT-----EN---FGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT-STA----Q 755 (836) Q Consensus 689 ~~p~Gi~~~~~~~~-----~t---~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~-g~~----~ 755 (836) ..+.|......+.. .+ ...+...++.|.++...|..++....+-.++++|..|..|....+.+ +.+ . T Consensus 153 ~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~ 232 (347) T protein:vir:94 153 ENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALID 232 (347) T ss_pred cccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccccc Confidence 11222211111111 11 11122347788999999888887766778888999998776533322 222 1 Q ss_pred cccCCCCeecceeeEeeCccccce-------------------------EEEEehhce--EEEe--------ecceEEEE Q lcl|NC_016164. 756 FVLEPGGTVNGYNVVRSNQVANGD-------------------------VFFGVWNQM--IMGM--------WGALDIQV 800 (836) Q Consensus 756 ~~~~~~~~l~G~pVv~s~~~~~~~-------------------------i~~gD~s~~--~i~~--------~~~l~i~~ 800 (836) +.++.-+++.|++|+.|+++|... -|=+||+.- .++. -.++.++. T Consensus 233 ~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~ 312 (347) T protein:vir:94 233 PSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALER 312 (347) T ss_pred cccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceee Confidence 233344578999999999998531 022333321 1111 12222222 Q ss_pred ecccccccCcEEEEEEEEeccEEEcccceE--EEeec Q lcl|NC_016164. 801 NPYALDKSGSVRVTALQDVDVAVRHPEAFC--RGNDN 835 (836) Q Consensus 801 ~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~--~l~~A 835 (836) .. .-.+-...+.+..-+|.+++||++.+ .++.| T Consensus 313 ~~--~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 313 AR--RANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred ee--chhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 22 22333457788889999999998876 67788 No 137 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.22 E-value=1.3e-12 Score=85.73 Aligned_cols=284 Identities=14% Similarity=0.078 Sum_probs=161.4 Q ss_pred hhhhhhhhhhhhhhcccccccc-cccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGD-LVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPT 627 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~-~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~ 627 (836) +..+... .-....-..+++. .+.=+.+.+++.......+.++++..++....+..+.+|+. +.+++....-|+++. T Consensus 1 m~~~~~~--~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~~~~g~~l~ 77 (334) T protein:vir:80 1 MTYPAAN--THTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAGRKAGEELV 77 (334) T ss_pred CCCCcCC--CccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeeeecCCCCCC Confidence 0000000 0000001111221 23338899999999988899998876655455678888866 666677777788887 Q ss_pred cccccceeEEeeeeee-eeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHh----hcCCcccc--------ccc Q lcl|NC_016164. 628 ESQPSVDQVALVAKTL-GAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALY----GLGSNSQP--------EGL 694 (836) Q Consensus 628 ~~~~~~~~it~~~~t~-~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~----G~Gt~~~p--------~Gi 694 (836) ...+..++.++.+.++ ...+.|..--=..+..++.+.+.++++.+++++.|+.++. +... ..| .|+ T Consensus 78 ~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~-~~~~~~~~~~~~G~ 156 (334) T protein:vir:80 78 VQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDF-LAPAHLKPAFHDGI 156 (334) T ss_pred CCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-cccccccccccCCc Confidence 7777777877777663 3333443322223456799999999999999999987652 2111 111 232 Q ss_pred ccccccccccc---cccchhHHHHHHHHHHHhhhcccc---CccEEEecHHHHHHHHHHhhccCc--------cccccCC Q lcl|NC_016164. 695 KFVTGINTENF---GATNPTYVELVSMESKVAADNADI---GAMSYLTNSTLYGGFKTTEKATST--------AQFVLEP 760 (836) Q Consensus 695 ~~~~~~~~~t~---aa~~~t~~~l~~a~~~l~~~~~~~---~~~~~vmnp~~~~~L~~lkd~~g~--------~~~~~~~ 760 (836) ......+..+. ......++.+..+...|..++... ..-+.+++|..|..|..-..-.++ ..+..+. T Consensus 157 ~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~ 236 (334) T protein:vir:80 157 LLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGGR 236 (334) T ss_pred ceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecccccccccccee Confidence 22211111111 111122345556666676666552 346899999999988654332211 1122233 Q ss_pred CCeecceeeEeeCccccce-----------EEEEehhceE-E-Eeecce------EEEEecccccccCcEEEEEEEEecc Q lcl|NC_016164. 761 GGTVNGYNVVRSNQVANGD-----------VFFGVWNQMI-M-GMWGAL------DIQVNPYALDKSGSVRVTALQDVDV 821 (836) Q Consensus 761 ~~~l~G~pVv~s~~~~~~~-----------i~~gD~s~~~-i-~~~~~l------~i~~~~~~~~~~~~~~~r~~~r~d~ 821 (836) -.+++|++|+.|+.+|... .+-|||+... + ..+..+ ++..+-+..-.+-...+.+.+-+|. T Consensus 237 i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G~ 316 (334) T protein:vir:80 237 IAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSYNI 316 (334) T ss_pred EEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHcCC Confidence 3578999999999999652 4566776432 1 122211 1111111111112224455567899 Q ss_pred EEEcccceEEEeecC Q lcl|NC_016164. 822 AVRHPEAFCRGNDNL 836 (836) Q Consensus 822 ~v~~p~Af~~l~~A~ 836 (836) +++||+++++++--+ T Consensus 317 g~lRPeaa~vv~~~~ 331 (334) T protein:vir:80 317 GQRRPDAVAVHDITV 331 (334) T ss_pred ceeccceEEEEEEee Confidence 999999999988777 No 138 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.20 E-value=8.2e-12 Score=81.37 Aligned_cols=381 Identities=12% Similarity=0.039 Sum_probs=177.9 Q ss_pred hhhhhhhhhhhhhhhh-hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhh Q lcl|NC_016164. 420 AQAAADERSRVASITS-LCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIG 498 (836) Q Consensus 420 ~~~~~~~~~~~~ei~a-l~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (836) .... ........+.+ +.+...+.+....+.... +..+.... ++.+.. .... ....... ... T Consensus 1 ~~~s-~~~~~k~~~~ek~~~~~~~~e~~~~lks~~-~g~~~~~~-~~~~~k----~~el--------~kT~Sel---~~e 62 (400) T protein:vir:93 1 MRIS-KRNMNKPDLIEKQNRLAELKENNVSLKSQI-SGFEVKNA-IEDLPK----VQEL--------EKTLSEN---SIE 62 (400) T ss_pred Cccc-ccccccchHHHHHHHHhhhhhhhhhhhhhh-hccchhhh-hhhchh----HHHH--------HHHHHHh---HHH Confidence 0000 00000000000 000011111000000000 00000000 000000 0000 0000000 000 Q ss_pred hhhhHHHHHHHhhhhhhhhhhhhhhhhhh--hhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhccc-ccccccccch Q lcl|NC_016164. 499 LTDKEARSFSFVRAIRAQMMPGDRAAFEA--AAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTA-SAAGDLVFTD 575 (836) Q Consensus 499 ~~~~~~~~~~~~~a~~a~~~~~~~~~~~~--~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~-~~~g~~vvp~ 575 (836) ...++.... .+....+...++..+. .......++-+..-.|.. .+.....+...-.+. .+....++|. T Consensus 63 i~k~e~eln----~~~E~~Kgk~~mtefLkT~~A~~~fa~~l~~nsg~s-----d~knaW~A~l~E~gvt~td~n~iLP~ 133 (400) T protein:vir:93 63 IIKIENELN----AQEEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKS-----EIKNAWSAKLAENGVTITDTTFQLPR 133 (400) T ss_pred HHHHhhhhh----hhhhhcccchhHHHhhhhHHHHHHHHHHHHhhcCCc-----chhhhhhhhhhhcccccCCchhhcch Confidence 000000000 0000001111110000 001111111112222211 111111111111111 1222236677 Q ss_pred hhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeee-eccCcccccccccceeEEeeeeeeeeeehhHHHHH Q lcl|NC_016164. 576 GRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYW-VAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLM 654 (836) Q Consensus 576 ~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~-v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL 654 (836) -+...|...++...++.++.. +...++ +-+........-+| ..-|.+++++++++..-++.|+-+.++..+..-.. T Consensus 134 ~il~aIq~al~~~~~~~~f~~--v~n~p~-l~V~~~~dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~ 210 (400) T protein:vir:93 134 KLVESINTALLNTNPVFKVFH--VTNVGA-LLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVK 210 (400) T ss_pred HHHHHHHHhhhccCCccccee--eecCCc-eeeecchhhhcccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhh Confidence 777778888888888877532 222221 11111112222334 55678889999999999999998888877733222 Q ss_pred --hcchhHHHHHHHHHHHHHHHH-HHHHHHHhhcCCccc-----ccccccccccccccccccchhHHHHHHHHHHHhhhc Q lcl|NC_016164. 655 --LQSSIDVEQMVRTELATVIAL-EIDRAALYGLGSNSQ-----PEGLKFVTGINTENFGATNPTYVELVSMESKVAADN 726 (836) Q Consensus 655 --~ds~~~l~~~i~~~l~~a~a~-~~d~~il~G~Gt~~~-----p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~ 726 (836) .++...+.+||..+|..++-. ..+++++-|+|+|+- ..-|....+....+-.++...+.++..-+....... T Consensus 211 ~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~ 290 (400) T protein:vir:93 211 RLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT 290 (400) T ss_pred hccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhc Confidence 234456799999999999996 579999999998751 111222222222233334445666555433333332 Q ss_pred cccCccEEEecHHHHHHHHHHhhccCccccccCCCC----eeccee-eEeeCccccc-eEEEEehhceEEEeecceEEEE Q lcl|NC_016164. 727 ADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGG----TVNGYN-VVRSNQVANG-DVFFGVWNQMIMGMWGALDIQV 800 (836) Q Consensus 727 ~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~----~l~G~p-Vv~s~~~~~~-~i~~gD~s~~~i~~~~~l~i~~ 800 (836) ......++++|..|+.|+.|++++|.+.|..+..+ +-+|+. .++....+.. ..++.|-..+ + ++..+.. T Consensus 291 -aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~kp~V~VDek~~-i---~~~~~~t 365 (400) T protein:vir:93 291 -AGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYH-I---DMQDLTK 365 (400) T ss_pred -cCCceeEEeccchHHHHHHhcCCcceeeeeeccccchhhhhcccceeeeeccCCCCCceeeeehhhh-c---cccCcee Confidence 23456899999999999999999999988554433 235553 4445555533 3444454433 2 2233333 Q ss_pred ecccccccCcEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 801 NPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 801 ~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) .....+.+|+-.+.++..+++.+.-|.+-++++.| T Consensus 366 ~~sf~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 366 VDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred ccceeeeeccceEEeeeeeccceecccceeeEeeC Confidence 44455677888889999999999999999999999 No 139 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.19 E-value=2.3e-12 Score=84.41 Aligned_cols=287 Identities=11% Similarity=0.042 Sum_probs=153.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccc--cccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeee Q lcl|NC_016164. 542 MGVTPRGILAPNDVLHRDLVVDTASAAGD--LVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYW 619 (836) Q Consensus 542 ~g~~~~g~~~~~~~~~~a~~~~~~~~~g~--~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~ 619 (836) +.....+.. ...+ .+.....|. .+.-+.+.+++.......+.++++.....-..+..+.+++. +.+++.. T Consensus 1 ~~~~~~~~~----~~t~---~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~i-G~~t~~~ 72 (347) T protein:vir:33 1 MANIQGGQQ----IGTN---QGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVI-GRTKAAY 72 (347) T ss_pred CCCCccCcc----cccc---cccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeec-cceeeee Confidence 000000000 0000 000011111 12337888999888888888888866443334567777766 4555566 Q ss_pred eccCccccc--ccccceeEEeeeeeee-eeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhh-----cCCc--c Q lcl|NC_016164. 620 VAEGGDPTE--SQPSVDQVALVAKTLG-AYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYG-----LGSN--S 689 (836) Q Consensus 620 v~Eg~~~~~--~~~~~~~it~~~~t~~-~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G-----~Gt~--~ 689 (836) +..|..+.. ..++..+.++.+.++- ..+.|.+---.++..++.+.+.++++.++++..|+.++.- .... . T Consensus 73 ~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~ 152 (347) T protein:vir:33 73 LKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSN 152 (347) T ss_pred ecCCCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 666776643 3355566556554332 1122222111224557888999999999999999987621 1110 0 Q ss_pred cccccccccccc----------cccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc-----Ccc Q lcl|NC_016164. 690 QPEGLKFVTGIN----------TENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT-----STA 754 (836) Q Consensus 690 ~p~Gi~~~~~~~----------~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~-----g~~ 754 (836) ...+.+...+.. ......+...++.|.++...|..++.....-+++++|..|..|....... +.. T Consensus 153 ~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~ 232 (347) T protein:vir:33 153 ENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALL 232 (347) T ss_pred cccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccccccc Confidence 001111111110 00011112346778888888988887666778999999998886543322 222 Q ss_pred ccccCCCCeecceeeEeeCccccceE-------EE---------------Eehhce--EEEe------ecceEEEEeccc Q lcl|NC_016164. 755 QFVLEPGGTVNGYNVVRSNQVANGDV-------FF---------------GVWNQM--IMGM------WGALDIQVNPYA 804 (836) Q Consensus 755 ~~~~~~~~~l~G~pVv~s~~~~~~~i-------~~---------------gD~s~~--~i~~------~~~l~i~~~~~~ 804 (836) .+.++.-++++|++|+.|+.+|.+.+ +. ++|+.. .++. ...+.+.++.+. T Consensus 233 ~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r 312 (347) T protein:vir:33 233 DPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERAR 312 (347) T ss_pred ccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeecc Confidence 34444456899999999999986422 11 122111 1111 111112222222 Q ss_pred ccccCcEEEEEEEEeccEEEcccceEEEee-cC Q lcl|NC_016164. 805 LDKSGSVRVTALQDVDVAVRHPEAFCRGND-NL 836 (836) Q Consensus 805 ~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~-A~ 836 (836) .-.+-...+++.+.+|.+++||++.+.++- -| T Consensus 313 ~~~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~ 345 (347) T protein:vir:33 313 RANYQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred chhhhhHhhhhhhhcCCceecccceEEEecCCC Confidence 222333567777888999999999877731 11 No 140 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.18 E-value=2.1e-11 Score=79.18 Aligned_cols=291 Identities=9% Similarity=-0.003 Sum_probs=158.0 Q ss_pred HHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecC Q lcl|NC_016164. 534 VSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTG 613 (836) Q Consensus 534 ~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~ 613 (836) ++..-..++|....+. +.... ..+.--.+.-+.+.+++.......+.++.+...+.-..+..+.+++. + T Consensus 1 ~~~~~~~~~~~~n~~t--------~~~~~--~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~i-G 69 (375) T protein:vir:10 1 MANANQVALGRSNLST--------GTGYG--GATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYT-G 69 (375) T ss_pred CccccccccCccccCC--------ccccc--cccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEee-e Confidence 1111111111100000 00000 00011123447788889888888888888866544445667888877 5 Q ss_pred CceeeeeccCccccc---cccccee--EEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHh----h Q lcl|NC_016164. 614 AATAYWVAEGGDPTE---SQPSVDQ--VALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALY----G 684 (836) Q Consensus 614 ~~~a~~v~Eg~~~~~---~~~~~~~--it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~----G 684 (836) ..++....-|.++.. .+....+ +++.-.+|... .|..---.++..++.+.+.++++.++++..|+.++. + T Consensus 70 ~~t~~~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~-~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~ka 148 (375) T protein:vir:10 70 RMTSSFHTPGTPILGNADKAPPVAEKTIVMDDLLISSA-FVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRG 148 (375) T ss_pred eeEEeeecCCcCcCCccccCCCCCceEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 555655555555421 2333333 44443444432 222211123456789999999999999999987752 2 Q ss_pred cCCc----ccc---ccccc-c--cccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc--- Q lcl|NC_016164. 685 LGSN----SQP---EGLKF-V--TGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT--- 751 (836) Q Consensus 685 ~Gt~----~~p---~Gi~~-~--~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~--- 751 (836) ..+. ..+ .|... . ++........+...++.|.++...|..++.....-+++++|..|..|...++.+ T Consensus 149 a~~~~p~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~ 228 (375) T protein:vir:10 149 ARSASPVSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLV 228 (375) T ss_pred hhhccccccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCcccee Confidence 1111 000 11111 1 111111112233457889999999988888777778999999998886654432 Q ss_pred -----CccccccCCCCeecceeeEeeCccccceE-------------------------------------EEEeh---h Q lcl|NC_016164. 752 -----STAQFVLEPGGTVNGYNVVRSNQVANGDV-------------------------------------FFGVW---N 786 (836) Q Consensus 752 -----g~~~~~~~~~~~l~G~pVv~s~~~~~~~i-------------------------------------~~gD~---s 786 (836) +......+.-.++.|++|+.|+.+|...+ |-+|| + T Consensus 229 n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~ 308 (375) T protein:vir:10 229 NRDVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGA 308 (375) T ss_pred eecccccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccC Confidence 12222222224789999999999984321 22233 1 Q ss_pred c-e-EE--------EeecceEEEEec-ccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 787 Q-M-IM--------GMWGALDIQVNP-YALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 787 ~-~-~i--------~~~~~l~i~~~~-~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) . + .+ ..-.++.+++.. +..-.+....|.+.+=+|.++.||++.+.++..- T Consensus 309 ~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~ 369 (375) T protein:vir:10 309 KSCGLIFQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGA 369 (375) T ss_pred ceEEEEEchhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCc Confidence 1 0 11 111233333321 1123445667888999999999999999988665 No 141 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.17 E-value=5.8e-12 Score=82.17 Aligned_cols=287 Identities=13% Similarity=0.055 Sum_probs=156.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeec Q lcl|NC_016164. 542 MGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVA 621 (836) Q Consensus 542 ~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~ 621 (836) +.....+.. ....+........+ --.+.-+.+.+++.......+.++++...+.-..+..+.+|+. +..++.... T Consensus 1 ma~~~~~~~---~n~~~~~~~~~~~~-~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G~~~~~~~~ 75 (344) T protein:vir:10 1 MANMTGGQQ---LGTNQGKDVMAAGD-KLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLA 75 (344) T ss_pred Ccccccccc---CCcccCCccCCccc-hhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-ceeEEEeee Confidence 000000000 00000000000000 1112337888999988888899998866554445677888877 556677777 Q ss_pred cCcccccc--cccceeEEeeeee--eeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhc----C----Ccc Q lcl|NC_016164. 622 EGGDPTES--QPSVDQVALVAKT--LGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGL----G----SNS 689 (836) Q Consensus 622 Eg~~~~~~--~~~~~~it~~~~t--~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~----G----t~~ 689 (836) .|++...+ .+.-++.++.+.+ |... .|..---.++..++.+.+.++++.++++..|+.++.-. . .+. T Consensus 76 ~G~~l~~t~~~~~~~e~~l~ID~~~y~~~-~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~ 154 (344) T protein:vir:10 76 PGENLDDIRKDIKHTEKVITIDGLLTADV-LIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNE 154 (344) T ss_pred cCCCCCCCCCCcccceEEEEEcchhhhhh-hhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 78777543 4566666666655 3332 22221112245678999999999999999998775321 1 122 Q ss_pred ccccccccccc----ccccc----cccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccC-----cccc Q lcl|NC_016164. 690 QPEGLKFVTGI----NTENF----GATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATS-----TAQF 756 (836) Q Consensus 690 ~p~Gi~~~~~~----~~~t~----aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g-----~~~~ 756 (836) .|.|+-+...+ ..... ..+...++.|.++...|..++.....-+++++|..|..|..-+..+. ...+ T Consensus 155 ~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~ 234 (344) T protein:vir:10 155 NITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDP 234 (344) T ss_pred ccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccce Confidence 23322111101 11110 11122366788888888888877667788899999998865432221 1122 Q ss_pred ccCCCCeecceeeEeeCccccce------E---------------EEEehhceE----------EEeecceEEEEecccc Q lcl|NC_016164. 757 VLEPGGTVNGYNVVRSNQVANGD------V---------------FFGVWNQMI----------MGMWGALDIQVNPYAL 805 (836) Q Consensus 757 ~~~~~~~l~G~pVv~s~~~~~~~------i---------------~~gD~s~~~----------i~~~~~l~i~~~~~~~ 805 (836) ..+.-++++|++|+.|+.+|.+. . +.++++... .+...++.++.... T Consensus 235 ~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~-- 312 (344) T protein:vir:10 235 EKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARR-- 312 (344) T ss_pred eeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccc-- Confidence 23333568999999999998531 1 112333210 11111222222211 Q ss_pred cccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 806 DKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 806 ~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) -.+-...+++.+-+|.+++||++.+.++-+- T Consensus 313 ~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~ 343 (344) T protein:vir:10 313 ANFQADQIIAKYAMGHGGLRPEAAGAVVFKT 343 (344) T ss_pred hhHHHHHHHHHhhcccceecccceEEEEeec Confidence 1222246778888999999999886655444 No 142 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=99.15 E-value=7.9e-12 Score=81.45 Aligned_cols=270 Identities=13% Similarity=0.108 Sum_probs=163.6 Q ss_pred hhhccc-ccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEe Q lcl|NC_016164. 560 LVVDTA-SAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVAL 638 (836) Q Consensus 560 ~~~~~~-~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~ 638 (836) ..+... +..-.+++|+.++..++..+........+..+...+.+..+.++.. +.+...-..+++.+....++..++++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsI-g~~tV~dY~~~~~i~~d~ltt~~~~l 79 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSV-GTPVVRSRPEQGDFTFDNLDTGEISI 79 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccc-cccccccccCCCCcccccCCCceEEE Confidence 122222 2233446699999999988888777666544444456667888766 55666667777777666666666665 Q ss_pred eeee--eeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhh--cCC-----cccccccccccccccccccccc Q lcl|NC_016164. 639 VAKT--LGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYG--LGS-----NSQPEGLKFVTGINTENFGATN 709 (836) Q Consensus 639 ~~~t--~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G--~Gt-----~~~p~Gi~~~~~~~~~t~aa~~ 709 (836) .+.+ |.++ .|+.+. .+...++.+...++.+++++...|..+..- +|. .+.|.-+-...+....+..... T Consensus 80 ~IDq~KYfaf-~VdDD~-~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~~~ 157 (322) T protein:vir:31 80 ILRDEVYAGN-AISKKL-RQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTDQT 157 (322) T ss_pred EEehhhhhcc-ccchhH-HHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCCch Confidence 5544 5543 477744 567889999999999999999888765321 111 0112111111122222333344 Q ss_pred hhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHh-----hccCccccc--cCC------CCeecceeeEeeCccc Q lcl|NC_016164. 710 PTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTE-----KATSTAQFV--LEP------GGTVNGYNVVRSNQVA 776 (836) Q Consensus 710 ~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lk-----d~~g~~~~~--~~~------~~~l~G~pVv~s~~~~ 776 (836) ..|+.|+++..+|..++.....-.++++|..+..|..+. -.+++-.-+ .+. -++++|..|++|+.++ T Consensus 158 ~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~V~~SN~l~ 237 (322) T protein:vir:31 158 MDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGIDLFVSNLLA 237 (322) T ss_pred hhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhceeeeeecccc Confidence 578999999999999988877778889999887664321 112221111 111 3678999999999987 Q ss_pred cce--EEEEeh---------hceEE----------Eeecce---EEEEecccccccCcEEEEEEEEeccEEEcccceEEE Q lcl|NC_016164. 777 NGD--VFFGVW---------NQMIM----------GMWGAL---DIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRG 832 (836) Q Consensus 777 ~~~--i~~gD~---------s~~~i----------~~~~~l---~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l 832 (836) .+. ++.|.- +.+.. ..|..| +-..++ ....-.+|..+|+|.++.+|+.++.+ T Consensus 238 ~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~----~~~~d~~~~~~~~g~g~~r~e~l~~~ 313 (322) T protein:vir:31 238 DANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDD----YNDDLNTATTARWGNGLVRDENLVCV 313 (322) T ss_pred ccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCc----cccccceeeeeeecceeecccceEEE Confidence 442 333321 11111 111112 111111 22335688999999999999999887 Q ss_pred eecC Q lcl|NC_016164. 833 NDNL 836 (836) Q Consensus 833 ~~A~ 836 (836) .... T Consensus 314 ~a~~ 317 (322) T protein:vir:31 314 LANA 317 (322) T ss_pred Eecc Confidence 6555 No 143 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.14 E-value=2.9e-11 Score=78.36 Aligned_cols=260 Identities=10% Similarity=-0.001 Sum_probs=159.2 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcce--------eee--cCCceEEEEEecCC-ceeeeeccCcccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVT--------MLT--GLQGPVAIPRQTGA-ATAYWVAEGGDPTES 629 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~--------~~~--~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~ 629 (836) ++ .+....+++|+++...+...+.....|.+-+.. .+. ..+..+++|....- ..+..+.|+..++.. T Consensus 1 MA--~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~ 78 (324) T protein:vir:59 1 MA--YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQ 78 (324) T ss_pred CC--ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchh Confidence 22 223467789999988887776666666442210 111 12346788877653 567788999999988 Q ss_pred cccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccc----ccccccc Q lcl|NC_016164. 630 QPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVT----GINTENF 705 (836) Q Consensus 630 ~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~----~~~~~t~ 705 (836) +++-++.....+..++.+.++.....-+.-+....+.++++...++..+..+|.-+ +|++... +...+++ T Consensus 79 ~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l------~g~~~~~~~~~~~~dvsa 152 (324) T protein:vir:59 79 KINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAEL------AGVFSNDDMKDNKLDISG 152 (324) T ss_pred hcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHH------HHhhhccccccceeeeec Confidence 98888888888888888888886554455567788999999999999988776532 1221111 1111122 Q ss_pred -cccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccC-ccccccCCCCeecceeeEeeCccccc----- Q lcl|NC_016164. 706 -GATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATS-TAQFVLEPGGTVNGYNVVRSNQVANG----- 778 (836) Q Consensus 706 -aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g-~~~~~~~~~~~l~G~pVv~s~~~~~~----- 778 (836) +...++++.|.++..+|-.... .-.+|+||+.++..|+...--+- ++.--...-++++|++|++++.+|.. T Consensus 153 ~~~~~~s~~~l~~A~~~~GD~~~--~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~~~G~~VivdD~~p~~~~~~~ 230 (324) T protein:vir:59 153 TADGIYSAETFVDASYKLGDHES--LLTAIGMHSATMASAVKQDLIEFVKDSQSGIRFPTYMNKRVIVDDSMPVETLEDG 230 (324) T ss_pred cccceecHHHHHHHHHHhCCccc--CcEEEEEchHHHHHHHHhhhhhhccccccCceeeeecccEEEEeCCCCccccCCC Confidence 2234688899999998765432 45689999999999886532111 00000112356899999999999842 Q ss_pred -----eEEEEehhceEEEe-ecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 779 -----DVFFGVWNQMIMGM-WGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 779 -----~i~~gD~s~~~i~~-~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .++|+. ..+.+.. ...+.++.++ +...+...+....++. ++|..+..-+.++ T Consensus 231 ~~~y~s~l~~~-GAi~~~~~~~~v~vE~dR--d~~~g~~~l~~r~~~~---~~p~G~s~~~~~~ 288 (324) T protein:vir:59 231 TKVFTSYLFGA-GALGYAEGQPEVPTETAR--NALGSQDILINRKHFV---LHPRGVKFTENAM 288 (324) T ss_pred CceEEEEEEec-CeEEEeecCCCcceeccc--CccccceEEEEeeEEE---eEeeeEEeccccc Confidence 133332 2232322 1223333332 2345666777777754 5555555544444 No 144 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.13 E-value=4.5e-12 Score=82.80 Aligned_cols=286 Identities=12% Similarity=0.089 Sum_probs=151.6 Q ss_pred HHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccc--cccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEe Q lcl|NC_016164. 534 VSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGD--LVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQ 611 (836) Q Consensus 534 ~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~--~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~ 611 (836) ++. ..+ .......+.....+. .+.-+.+.+++.......+.++++.....-..+..+.+|+. T Consensus 1 m~~---------~~~-------~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i 64 (347) T protein:vir:94 1 MAN---------VPG-------QKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM 64 (347) T ss_pred CCC---------CCc-------cccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecc Confidence 000 000 000000011111111 12235677778777777777787755443334567778777 Q ss_pred cCCceeeeeccCcccccc--cccceeEEeeeeeee-eeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhc--- Q lcl|NC_016164. 612 TGAATAYWVAEGGDPTES--QPSVDQVALVAKTLG-AYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGL--- 685 (836) Q Consensus 612 ~~~~~a~~v~Eg~~~~~~--~~~~~~it~~~~t~~-~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~--- 685 (836) +..++..+..|+..... ..+-.+.++.+.++- ..+.|..--=.+...++.+.+.++++.++++..|+.++.-. T Consensus 65 -G~~tv~~~t~G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~ 143 (347) T protein:vir:94 65 -GRTSGVYLAPGERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAIL 143 (347) T ss_pred -cceeeeeecCCCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 56666767777766432 345555555554442 11222221111234568889999999999999999775311 Q ss_pred ----CC-ccccccccccccccccccc-------ccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCc Q lcl|NC_016164. 686 ----GS-NSQPEGLKFVTGINTENFG-------ATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATST 753 (836) Q Consensus 686 ----Gt-~~~p~Gi~~~~~~~~~t~a-------a~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~ 753 (836) +. ...+.|+-....+...+.+ .....++.|.++...|..++.....-+++++|..|..|...+..+.. T Consensus 144 aa~~~~~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~ 223 (347) T protein:vir:94 144 CNLPAASNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAA 223 (347) T ss_pred hccccccccccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhh Confidence 11 1112232111111100000 01123566777888888887766677899999999887544332221 Q ss_pred -----cccccCCCCeecceeeEeeCccccce-----------E---------------EEEehhce--EEEeec------ Q lcl|NC_016164. 754 -----AQFVLEPGGTVNGYNVVRSNQVANGD-----------V---------------FFGVWNQM--IMGMWG------ 794 (836) Q Consensus 754 -----~~~~~~~~~~l~G~pVv~s~~~~~~~-----------i---------------~~gD~s~~--~i~~~~------ 794 (836) ..+..+.-++++|++|+.|+.+|.+. + +-+||+.. .++.+. T Consensus 224 ~~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~ 303 (347) T protein:vir:94 224 NYAALIDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVK 303 (347) T ss_pred hccccccccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhh Confidence 12223334689999999999998421 1 11222221 111111 Q ss_pred ceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 795 ALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 795 ~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) -+.+....+..-.+-...+++.+-+|.+++||++.+.++..- T Consensus 304 ~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~ 345 (347) T protein:vir:94 304 LRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFSP 345 (347) T ss_pred cccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEecC Confidence 111111111111222347888999999999999998888767 No 145 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.11 E-value=1.2e-11 Score=80.38 Aligned_cols=285 Identities=11% Similarity=0.018 Sum_probs=150.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeec Q lcl|NC_016164. 542 MGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVA 621 (836) Q Consensus 542 ~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~ 621 (836) +.....+... ..+. ..+...+.-..+.-+.+.+++.......+.++.+.....-..+..+.+|+.. ..++.... T Consensus 1 ma~~~~~~~~----~t~~-~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig-~~t~~~~~ 74 (347) T protein:vir:15 1 MANIQGGQQI----GTNQ-GKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG-RTKAAYLK 74 (347) T ss_pred CCccccCCcc----cccc-ccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeecc-ceeeeeec Confidence 0000000000 0000 0000000001133367788888888888888887654433445677887764 46666677 Q ss_pred cCccccc--ccccceeEEeeeeeeee-eehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhc--CCc-----ccc Q lcl|NC_016164. 622 EGGDPTE--SQPSVDQVALVAKTLGA-YTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGL--GSN-----SQP 691 (836) Q Consensus 622 Eg~~~~~--~~~~~~~it~~~~t~~~-~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~--Gt~-----~~p 691 (836) .|.+++. ..++..+.++.+.++-. .+.|.+---.++..++.+.+.++++.++++..|+.++.-. +.+ ..+ T Consensus 75 ~g~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:15 75 PGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNEN 154 (347) T ss_pred cCCCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 7776644 33556666665544321 2223221122345578899999999999999999876321 100 011 Q ss_pred cccccccccc-cccccc---------cchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccC-----cccc Q lcl|NC_016164. 692 EGLKFVTGIN-TENFGA---------TNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATS-----TAQF 756 (836) Q Consensus 692 ~Gi~~~~~~~-~~t~aa---------~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g-----~~~~ 756 (836) .+.+....+. .....+ ...-++.+.++...|..++.....-.++++|..|..|....+... ...+ T Consensus 155 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~ 234 (347) T protein:vir:15 155 IEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDH 234 (347) T ss_pred ccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccccc Confidence 1000000000 001101 111255666677778777776666788899999998865443322 2223 Q ss_pred ccCCCCeecceeeEeeCccccceE-------E---------------EEehhce--E--------EEeecceEEEEeccc Q lcl|NC_016164. 757 VLEPGGTVNGYNVVRSNQVANGDV-------F---------------FGVWNQM--I--------MGMWGALDIQVNPYA 804 (836) Q Consensus 757 ~~~~~~~l~G~pVv~s~~~~~~~i-------~---------------~gD~s~~--~--------i~~~~~l~i~~~~~~ 804 (836) .++.-++++|++|+.|+.+|.... . -++|... . .+...++.+.... T Consensus 235 ~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~-- 312 (347) T protein:vir:15 235 ERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERAR-- 312 (347) T ss_pred cceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecc-- Confidence 344446799999999999985321 1 1112110 1 1111222222222 Q ss_pred ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 805 LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 805 ~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .-.+-...+++.+.+|.+++||++.+.+ +| T Consensus 313 ~~~~~~d~i~~~~~~G~~vlrP~~av~~--~~ 342 (347) T protein:vir:15 313 RANYQADQIIAKYAMGHGGLRPEAAGAI--VL 342 (347) T ss_pred cchhhhhhhehhhhcCCceeccccEEEE--ec Confidence 2223335677778889999999998777 44 No 146 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.11 E-value=1.5e-10 Score=74.39 Aligned_cols=281 Identities=14% Similarity=0.021 Sum_probs=154.6 Q ss_pred hhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTE 628 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~ 628 (836) +..+.. ..+... .....-..+.-+.+.+++.+.....+.++++...+.-..+..+.+|+. +..+++...-|++..- T Consensus 1 ms~~n~-~t~~~~--~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~i-G~~~~~~~~~G~~ld~ 76 (364) T protein:vir:10 1 MSNPNV-LTQPAV--SASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYI-GETELQVLSPGKSPDA 76 (364) T ss_pred CCCccc-cccccc--ccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeee-eeeEEeeeccCcccCC Confidence 110000 000000 000111224447788888888888888888765554456678888887 4455555555555544 Q ss_pred ccccceeEEeeeeeeee-eehhHH-HHHhcchhH-HHHHHHHHHHHHHHHHHHHHHHh----hcCCcccc---ccccccc Q lcl|NC_016164. 629 SQPSVDQVALVAKTLGA-YTEFSR-RLMLQSSID-VEQMVRTELATVIALEIDRAALY----GLGSNSQP---EGLKFVT 698 (836) Q Consensus 629 ~~~~~~~it~~~~t~~~-~i~ISr-elL~ds~~~-l~~~i~~~l~~a~a~~~d~~il~----G~Gt~~~p---~Gi~~~~ 698 (836) ..+.-++.++.+.++-- ...|-. +-. ++.++ +.+.+..++++++++..|+.++. +.-++-.+ .++.... T Consensus 77 ~~~~~~k~~itID~ll~a~~~V~diDe~-q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~ 155 (364) T protein:vir:10 77 SPTEFDKNRLVVDTTVIARNTVAHFHDV-QNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGH 155 (364) T ss_pred CCcccCcEEEEecceeeechhhhhHHHH-hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCC Confidence 55666776777765431 122221 111 23455 68899999999999999987642 11111111 1111111 Q ss_pred c--cc-cccccccch----hHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc-------CccccccCCCCee Q lcl|NC_016164. 699 G--IN-TENFGATNP----TYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT-------STAQFVLEPGGTV 764 (836) Q Consensus 699 ~--~~-~~t~aa~~~----t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~-------g~~~~~~~~~~~l 764 (836) + +. ..+...... -.+.|..+...|..++.....-+++++|..|..|..-..-- +...+..+....+ T Consensus 156 g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v 235 (364) T protein:vir:10 156 GFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKS 235 (364) T ss_pred cceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEE Confidence 1 10 111111112 23445566677777777667789999999998876532211 2233444445678 Q ss_pred cceeeEeeCccccce---------------------E--EEEehhce--EEEee--------cceEEEEecccccccCcE Q lcl|NC_016164. 765 NGYNVVRSNQVANGD---------------------V--FFGVWNQM--IMGMW--------GALDIQVNPYALDKSGSV 811 (836) Q Consensus 765 ~G~pVv~s~~~~~~~---------------------i--~~gD~s~~--~i~~~--------~~l~i~~~~~~~~~~~~~ 811 (836) .|+||+.|+.+|... - ..+|++.. .++.+ .++..+... .-.+-.. T Consensus 236 ~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~--~~~~~~~ 313 (364) T protein:vir:10 236 WNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFY--EKKEKTW 313 (364) T ss_pred eceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeee--ccceeee Confidence 999999999998421 0 12444321 22222 222222221 2223344 Q ss_pred EEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 812 RVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 812 ~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .+.+++-+|.+++||++++.++.+= T Consensus 314 ~ida~~a~G~g~lRPeaa~~i~~~~ 338 (364) T protein:vir:10 314 YIDTFLAEGAIPDRWEAVAVVTAAD 338 (364) T ss_pred eeeeehcccCcccCccceEEEEecC Confidence 5667777999999999999998666 No 147 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.04 E-value=1.1e-10 Score=75.26 Aligned_cols=284 Identities=14% Similarity=0.038 Sum_probs=156.3 Q ss_pred hhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTE 628 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~ 628 (836) +..+.. ..+.......... .+.=+.+.+++.+.....+.++++...+.-..+..+.+|+. +..+++...-|.+... T Consensus 1 ms~~~~-~tr~~~~~s~~d~--al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~ 76 (335) T protein:vir:63 1 MSFLND-LTRPNYAGKNADV--DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELER 76 (335) T ss_pred CCCccc-chhhhcccccchh--heehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcCcCC Confidence 111110 1111111111111 13337889999999888999988866555555678888877 5566666766776665 Q ss_pred ccccceeEEeeeeeee-eeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHH----hhcCCc--ccccccccccccc Q lcl|NC_016164. 629 SQPSVDQVALVAKTLG-AYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAAL----YGLGSN--SQPEGLKFVTGIN 701 (836) Q Consensus 629 ~~~~~~~it~~~~t~~-~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il----~G~Gt~--~~p~Gi~~~~~~~ 701 (836) ..+..++.++.+.++- ....|-.---..+..++.+.+..++++++++..|+.++ .+.... ....+.++..+.. T Consensus 77 ~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~ 156 (335) T protein:vir:63 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLE 156 (335) T ss_pred CCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcce Confidence 5566677777776643 22223222112245689999999999999999999764 232211 1111111111111 Q ss_pred cc--ccc----ccchhHHHHHHHHHHHhhhcccc---CccEEEecHHHHHHHHHHhhccCc--------cccccCCCCee Q lcl|NC_016164. 702 TE--NFG----ATNPTYVELVSMESKVAADNADI---GAMSYLTNSTLYGGFKTTEKATST--------AQFVLEPGGTV 764 (836) Q Consensus 702 ~~--t~a----a~~~t~~~l~~a~~~l~~~~~~~---~~~~~vmnp~~~~~L~~lkd~~g~--------~~~~~~~~~~l 764 (836) .. +.. ......+.+..+...|..++... ..-+.+++|..|..|..-+.--++ ..+.++.-..+ T Consensus 157 ~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v 236 (335) T protein:vir:63 157 KLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAIL 236 (335) T ss_pred eeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEEe Confidence 11 111 11112345556667777666543 236899999999988764332222 11333334578 Q ss_pred cceeeEeeCccccce-----------EEEEehhceE--EEeecc------eEEEEecccccccCcEEEEEEEEeccEEEc Q lcl|NC_016164. 765 NGYNVVRSNQVANGD-----------VFFGVWNQMI--MGMWGA------LDIQVNPYALDKSGSVRVTALQDVDVAVRH 825 (836) Q Consensus 765 ~G~pVv~s~~~~~~~-----------i~~gD~s~~~--i~~~~~------l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~ 825 (836) .|+||+.|+.+|.+. .+-||+.... ++.+.- +.+..+-+..-.+-...+.+++-+|.+++| T Consensus 237 ~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G~g~lR 316 (335) T protein:vir:63 237 NGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYNIGARR 316 (335) T ss_pred eceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcCCcccc Confidence 999999999998543 3344554321 111111 111111111111123456666679999999 Q ss_pred ccceEEEee----cC Q lcl|NC_016164. 826 PEAFCRGND----NL 836 (836) Q Consensus 826 p~Af~~l~~----A~ 836 (836) |++.+.++- |+ T Consensus 317 Pe~a~~i~~tg~~~~ 331 (335) T protein:vir:63 317 PDTAGAIELKGIGAF 331 (335) T ss_pred cceEEEEEEcCCCce Confidence 999988873 22 No 148 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.04 E-value=1.4e-10 Score=74.54 Aligned_cols=268 Identities=11% Similarity=0.062 Sum_probs=161.0 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcce--------eeecCCceEEEEEecCC-ceeeeeccCc-cccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVT--------MLTGLQGPVAIPRQTGA-ATAYWVAEGG-DPTESQ 630 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~--------~~~~~~~~~~~p~~~~~-~~a~~v~Eg~-~~~~~~ 630 (836) +....+.-..++.|+++...+...+.....|.+-+.. .....+..+++|....- ..+..+.||. .++..+ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~k 80 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGK 80 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchhh Confidence 3333345567889999988887777666655442211 11234567888887643 5667778885 588888 Q ss_pred ccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhc-C--Cc--ccccccccccccccccc Q lcl|NC_016164. 631 PSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGL-G--SN--SQPEGLKFVTGINTENF 705 (836) Q Consensus 631 ~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~-G--t~--~~p~Gi~~~~~~~~~t~ 705 (836) .+-++-....+..+..+.++.....-+.-+....+.++++....+..+..++.-. | .. ....+.+...+....+. T Consensus 81 i~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~~~ 160 (330) T protein:vir:10 81 ITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQSK 160 (330) T ss_pred cccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheecccc Confidence 8888888888888888888887655566677788999999988888877665422 1 00 01111122222222333 Q ss_pred cccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccC-ccccccCCCCeecceeeEeeCccccce----- Q lcl|NC_016164. 706 GATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATS-TAQFVLEPGGTVNGYNVVRSNQVANGD----- 779 (836) Q Consensus 706 aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g-~~~~~~~~~~~l~G~pVv~s~~~~~~~----- 779 (836) +.+.++.+.|.++..+|-.... .-.+|+||+.++..|+...--+- ++.-....-++++|++|++++.+|... T Consensus 161 ~~a~~s~~~l~~A~~~~GD~~~--~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~~~G~~VivdD~~p~~~~~yt~ 238 (330) T protein:vir:10 161 ASTGIDAGMVLDAKQLLGDSAD--QVTAIAMHSAVYTKLQKDNLIQYIQPTTATINIPTYLGYRVIIDDGIAPTGDIYTS 238 (330) T ss_pred cccccCHHHHHHHHHHhccccc--cceEEEEcHHHHHHHHHhhhhhhhcccccCcccccccceEEEEeCCCCCCCCceeE Confidence 4455778889999888755432 35689999999998876431110 110011223678999999999998432 Q ss_pred EEEEehhceEEEee---cceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 780 VFFGVWNQMIMGMW---GALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 780 i~~gD~s~~~i~~~---~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++|+ ...+.+... ..+.++.+ .+...++..+....++ +++|..+..-...+ T Consensus 239 yl~~-~GAi~~~~~~~~~~v~~Etd--Rd~~~g~~~l~~r~~~---~~hp~G~s~~~~~~ 292 (330) T protein:vir:10 239 YLFR-TGSIGLNTGNPSGLTTFETS--REAAKGNDMIYTRRAL---VMHPYGVKWTGAEV 292 (330) T ss_pred EEEe-cCceeeecccCCcccccccc--CCccccceEEEEeeEE---Eeeeeeeeeccccc Confidence 2333 222222221 11222222 2334566666666664 45566666554333 No 149 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.03 E-value=9.2e-11 Score=75.60 Aligned_cols=282 Identities=13% Similarity=0.050 Sum_probs=153.0 Q ss_pred hhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTE 628 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~ 628 (836) +..+... .+.......... .+.=+.+.+++.+.....+.++++..++....+..+.+|+. +..++....-|.+..- T Consensus 1 ms~~~~~-t~~~~~~s~~d~--al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~ 76 (335) T protein:vir:78 1 MSFLNDL-TRPNYAGKNADV--DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELER 76 (335) T ss_pred CCccccc-cccccccccchh--hhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccCcccCC Confidence 1111100 011111111111 23347889999999999999998876655556678888866 5556666666666655 Q ss_pred ccccceeEEeeeeeee-eeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHH----hhcCC--ccccc-----cccc Q lcl|NC_016164. 629 SQPSVDQVALVAKTLG-AYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAAL----YGLGS--NSQPE-----GLKF 696 (836) Q Consensus 629 ~~~~~~~it~~~~t~~-~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il----~G~Gt--~~~p~-----Gi~~ 696 (836) ..+..++.++.+.++- ....|-+--=..+..++.+.+.+++++++++..|+.++ .+... ..... |+.. T Consensus 77 ~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~~ 156 (335) T protein:vir:78 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLE 156 (335) T ss_pred CCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcce Confidence 5566677777776643 22223221112245689999999999999999999765 22221 11111 2211 Q ss_pred ccccccccc-cccchhHHHHHHHHHHHhhhccccC---ccEEEecHHHHHHHHHHhhccCc--------cccccCCCCee Q lcl|NC_016164. 697 VTGINTENF-GATNPTYVELVSMESKVAADNADIG---AMSYLTNSTLYGGFKTTEKATST--------AQFVLEPGGTV 764 (836) Q Consensus 697 ~~~~~~~t~-aa~~~t~~~l~~a~~~l~~~~~~~~---~~~~vmnp~~~~~L~~lkd~~g~--------~~~~~~~~~~l 764 (836) .......+. ..+..-.+.+.++...|...+.... .-+.+++|..|..|..-+..-.+ ..+.++.-..+ T Consensus 157 ~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v 236 (335) T protein:vir:78 157 KLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVAIL 236 (335) T ss_pred eeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeEEe Confidence 111111111 1111123344555555654443221 35789999999988764332222 12333444578 Q ss_pred cceeeEeeCccccce-----------EEEEehhc-eE-EEeec--------ceEEEEecccccccCcEEEEEEEEeccEE Q lcl|NC_016164. 765 NGYNVVRSNQVANGD-----------VFFGVWNQ-MI-MGMWG--------ALDIQVNPYALDKSGSVRVTALQDVDVAV 823 (836) Q Consensus 765 ~G~pVv~s~~~~~~~-----------i~~gD~s~-~~-i~~~~--------~l~i~~~~~~~~~~~~~~~r~~~r~d~~v 823 (836) +|+||+.|+.+|.+. .+-+|+.. .. ++... .+..+...+. .+-...+.+++-+|.++ T Consensus 237 ~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~--~~~~~~i~~~~a~G~g~ 314 (335) T protein:vir:78 237 NGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDH--DQFSWVLDTFQMYNIGA 314 (335) T ss_pred eceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeecc--chhhHhhhHHHHcCCcc Confidence 999999999999543 22234432 11 12222 1221111111 11223556666799999 Q ss_pred EcccceEEEe----ecC Q lcl|NC_016164. 824 RHPEAFCRGN----DNL 836 (836) Q Consensus 824 ~~p~Af~~l~----~A~ 836 (836) +||++.+.++ -|+ T Consensus 315 lRPe~a~~i~~tg~~~~ 331 (335) T protein:vir:78 315 RRPDTAGAIELKGIEAF 331 (335) T ss_pred cCcceEEEEEecCCCcc Confidence 9999998887 233 No 150 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.97 E-value=2.3e-10 Score=73.38 Aligned_cols=272 Identities=14% Similarity=0.012 Sum_probs=161.8 Q ss_pred hhhcccccccccccc--hhhHHHHHHHHHhhhhhhhhcceeeec--CCceEEEEEecCCceeeeeccCc-ccccccccce Q lcl|NC_016164. 560 LVVDTASAAGDLVFT--DGRPGSFIELLRNRLALNTLGVTMLTG--LQGPVAIPRQTGAATAYWVAEGG-DPTESQPSVD 634 (836) Q Consensus 560 ~~~~~~~~~g~~vvp--~~~~~~ii~~l~~~~~l~~l~~~~~~~--~~~~~~~p~~~~~~~a~~v~Eg~-~~~~~~~~~~ 634 (836) .......++|.++.. +.+...+++.+.+....+++.....+. .-..+.+........+.|++.++ ..|..+...+ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~ 80 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALAT 80 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccce Confidence 000001112222221 123344555445444444443322111 12356666666677788887654 4788888999 Q ss_pred eEEeeeeeeeeeehhHHHHHhcc---hhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccc---- Q lcl|NC_016164. 635 QVALVAKTLGAYTEFSRRLMLQS---SIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGA---- 707 (836) Q Consensus 635 ~it~~~~t~~~~i~ISrelL~ds---~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa---- 707 (836) +....+..++..+.++.+=|..+ ..++...-....++++++.+|+.+|+|+.. ....||+|..++...+..+ T Consensus 81 ~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~-~g~~GLlN~p~v~~~~~~~~W~~ 159 (296) T protein:vir:10 81 ERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTA-HGIPSVFDYPNINNVVSGGSWSQ 159 (296) T ss_pred eEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccc-ccceeEeecCCCccccccCCccC Confidence 99999999999999987666543 456788888888999999999999999764 3457999988765443322 Q ss_pred cchhHHHHHHHHHHHhhh-ccccCccEEEecHHHHHHHHHHhhccCcccc--c--cCCCCeecceeeEeeCccccce-EE Q lcl|NC_016164. 708 TNPTYVELVSMESKVAAD-NADIGAMSYLTNSTLYGGFKTTEKATSTAQF--V--LEPGGTVNGYNVVRSNQVANGD-VF 781 (836) Q Consensus 708 ~~~t~~~l~~a~~~l~~~-~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~--~--~~~~~~l~G~pVv~s~~~~~~~-i~ 781 (836) ..-.++||.+++..+..+ ++...+..++++|..+..|.......|.-.+ + +.++.++.+.|...+......+ ++ T Consensus 160 ~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~~l~~a~~~g~~~~v 239 (296) T protein:vir:10 160 PTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQYLNDYNGTGTSAAI 239 (296) T ss_pred HHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEeeeeccCCCCcceEEE Confidence 223478999999887764 2345667899999999888655433332111 1 2234455555544433222122 22 Q ss_pred EEe--hhceEEEeecceEEEEecccccccCcEEEEEEEEec-cEEEcccceEEE---eec Q lcl|NC_016164. 782 FGV--WNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVD-VAVRHPEAFCRG---NDN 835 (836) Q Consensus 782 ~gD--~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d-~~v~~p~Af~~l---~~A 835 (836) +-+ ...+.+..-..+. ..+ .....-.+.++...+++ +-+++|.||+.+ +.| T Consensus 240 ~~~~~~~~~~~~v~~~~~--~~~-~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 240 AYEKDPNNMAIEIPEATN--ALP-AQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEEcCCceEEEEcCccee--eec-ccccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 222 3333333222222 222 12233446777788885 789999999998 678 No 151 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.93 E-value=8e-10 Score=70.44 Aligned_cols=266 Identities=10% Similarity=0.020 Sum_probs=155.7 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcce--------eeecCCceEEEEEecC-CceeeeeccCcccccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVT--------MLTGLQGPVAIPRQTG-AATAYWVAEGGDPTESQP 631 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~--------~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~ 631 (836) +. .+....+++|+++...+.+.......|.+-+.. .....+..+++|.... +..+..+.|+..++..++ T Consensus 1 MA--~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~ki 78 (351) T protein:vir:15 1 MA--ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNL 78 (351) T ss_pred CC--ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhee Confidence 22 223467789999988887766666655442211 1123355788887764 357778899999998888 Q ss_pred cceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhc-C--Cccccccccccccccccccccc Q lcl|NC_016164. 632 SVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGL-G--SNSQPEGLKFVTGINTENFGAT 708 (836) Q Consensus 632 ~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~-G--t~~~p~Gi~~~~~~~~~t~aa~ 708 (836) +-++-....+..+..+.++.....-+.-+....+.++|+...++..+..+|.-. | .+..... -+...+...+.+.. T Consensus 79 tt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~-~~~~d~t~~~~~~~ 157 (351) T protein:vir:15 79 TSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIAN-SKVYDQTKVSPSEP 157 (351) T ss_pred cccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcc-cceecccccccccc Confidence 888888888888888888886544445567778999999999998888776432 1 1111000 00111122233445 Q ss_pred chhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccC-ccccccCCCCeecceeeEeeCccccc--------- Q lcl|NC_016164. 709 NPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATS-TAQFVLEPGGTVNGYNVVRSNQVANG--------- 778 (836) Q Consensus 709 ~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g-~~~~~~~~~~~l~G~pVv~s~~~~~~--------- 778 (836) .++++.|.++..++-..... .-.+|+||+..+..|+...--+- ++.--...-++++|++|++++.+|.. T Consensus 158 ~is~~~l~~A~~~~GD~~~~-~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~t~~G~~VivdD~~p~~~~~~~~~~y 236 (351) T protein:vir:15 158 MFGAKGFTGAIGLMGDLQDT-AFGAIAVNSATYSLMKVQGLIETIQPQNGATPFEAYNGLRIVLDDDIEIDLTDKTKPVS 236 (351) T ss_pred ccCHHHHHHHHHHhcccccc-ceEEEEEChHHHHHHHhhhhhhhccccccCcccceecceEEEEcCCCccccCCCCCcee Confidence 57889999999987553221 24789999999998875431110 00000112367899999999999842 Q ss_pred -eEEEEehhceEEEeec-ceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 779 -DVFFGVWNQMIMGMWG-ALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 779 -~i~~gD~s~~~i~~~~-~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .++||. ..+.+.... .+++..++. ...|+..+..+.++ +++|..+..-+... T Consensus 237 tsyl~~~-GAi~~~~~~~~ve~~rd~~--~~~g~d~l~~r~~~---~~hp~G~s~~~~~~ 290 (351) T protein:vir:15 237 TSYIFAP-GAVRYSTNMRSTETKYDPL--INGGQDVIVQKRVG---TIHVAGTSIKASFS 290 (351) T ss_pred EEEEEec-ceeeeecCCcCcceeeccc--CCCCceEEEEeeee---eeeeeeeeeccccc Confidence 122322 222222211 133333333 23344445444443 46666665432211 No 152 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.87 E-value=1.3e-09 Score=69.30 Aligned_cols=277 Identities=10% Similarity=-0.030 Sum_probs=144.2 Q ss_pred HHhhhhhhhhhhhhhhhhhhhhhcccccccccccchh----hHHHHHHHHHhh-hhhhhhcceeeecCCceEEEEEecCC Q lcl|NC_016164. 540 QRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDG----RPGSFIELLRNR-LALNTLGVTMLTGLQGPVAIPRQTGA 614 (836) Q Consensus 540 ~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~----~~~~ii~~l~~~-~~l~~l~~~~~~~~~~~~~~p~~~~~ 614 (836) ..++.-..+ ...-+. -+++. +.+.+.-.+.+. +.|++- .+......+...+ ..-+. T Consensus 1 ~~~~~~~~~---------------~~~Ms~--~i~~~fv~qy~~~v~~~~qq~~s~L~~t-V~~~~~~~~~~~~-~~~~~ 61 (322) T protein:vir:10 1 MKLNAIMSM---------------LPLIAG--DIDQAFVQTYETTLRILSQQKSAKLKQY-CQHKNESSESHNW-ETLAS 61 (322) T ss_pred Ccccceeee---------------eeeeec--hhhhHHHHHHHHHHHHHHHHhhhhhhcc-cccccccccccce-eeccc Confidence 000000000 000000 12332 333332222222 222221 1211111221111 11111 Q ss_pred ceeeeeccCc----------ccccccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016164. 615 ATAYWVAEGG----------DPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYG 684 (836) Q Consensus 615 ~~a~~v~Eg~----------~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G 684 (836) ..+..++++. ..|......+........+...+.|.+.-......+..+...+..+.+++++.|..++.+ T Consensus 62 ~~~~~~~~~~~~~~~~d~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a 141 (322) T protein:vir:10 62 MDPDAVKRKRSRQQSADGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAG 141 (322) T ss_pred ccccccccccccccccCcccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhh Confidence 2222222111 112222234444445555555567777665556677888999999999999999988864 Q ss_pred c-CCcc--cccccccccccccccccccchhHHHHHHHHHHHhhhcccc-CccEEEecHHHHHHHHHHhhccCc------c Q lcl|NC_016164. 685 L-GSNS--QPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADI-GAMSYLTNSTLYGGFKTTEKATST------A 754 (836) Q Consensus 685 ~-Gt~~--~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~-~~~~~vmnp~~~~~L~~lkd~~g~------~ 754 (836) . |... .+.+.........+..+...++++.|+++...|..+.... ++-.++++|..|..|......... . T Consensus 142 ~~g~a~~~~~gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~ 221 (322) T protein:vir:10 142 AWKPASIKGTGQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMD 221 (322) T ss_pred hhccccccccccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchh Confidence 3 2211 1111111111122333445688999999999998887664 345788999999887654433221 1 Q ss_pred ccccCCCCeecceeeEeeCccccc------------------eEEEEehhceEEEeecceEEEEecccccccCcEEEEEE Q lcl|NC_016164. 755 QFVLEPGGTVNGYNVVRSNQVANG------------------DVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTAL 816 (836) Q Consensus 755 ~~~~~~~~~l~G~pVv~s~~~~~~------------------~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~ 816 (836) .+..+..++++|+.++.++.+|.. ..++..-+.+.++.+..+....+.... ......+++. T Consensus 222 l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~-~~~a~~I~~~ 300 (322) T protein:vir:10 222 LQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPS-ASFAWRIYSA 300 (322) T ss_pred hhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCC-cchhhhhhhh Confidence 123355678999999999998832 133444455555555554444332211 1234567778 Q ss_pred EEeccEEEcccceEEEe--ecC Q lcl|NC_016164. 817 QDVDVAVRHPEAFCRGN--DNL 836 (836) Q Consensus 817 ~r~d~~v~~p~Af~~l~--~A~ 836 (836) +-+|..+++|+.++.+. ++| T Consensus 301 ~~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 301 FTADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred hhhCceEeccCcEEEEEEeccC Confidence 89999999999998765 788 No 153 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.87 E-value=1.5e-10 Score=74.41 Aligned_cols=282 Identities=14% Similarity=0.088 Sum_probs=151.8 Q ss_pred hhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTE 628 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~ 628 (836) +..+.. ..+... .....-..+.-+.+.+++.+.....+.++++...+.-..+..+.+|+. +..+++...-|++..- T Consensus 1 Ms~~n~-~t~~~~--~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~i-G~~~a~y~~~G~~ldg 76 (402) T protein:vir:97 1 MSTPNT-LTNVAV--SASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNA 76 (402) T ss_pred CCCccc-cccccc--ccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEE-eeeEEeeeccccccCC Confidence 110000 000000 000111224447788888888888888888765554456678888887 4455555555555544 Q ss_pred ccccceeEEeeeeeeee-eehhHH--HHHhcchhH-HHHHHHHHHHHHHHHHHHHHHHh-----hc----CCcccccccc Q lcl|NC_016164. 629 SQPSVDQVALVAKTLGA-YTEFSR--RLMLQSSID-VEQMVRTELATVIALEIDRAALY-----GL----GSNSQPEGLK 695 (836) Q Consensus 629 ~~~~~~~it~~~~t~~~-~i~ISr--elL~ds~~~-l~~~i~~~l~~a~a~~~d~~il~-----G~----Gt~~~p~Gi~ 695 (836) ..+.-++..+.+.++=- ...|-+ +. ++..+ +.+.+.+++++++++..|+.++. +. +....|.+.- T Consensus 77 ~~~~~~k~~ItID~lL~a~~~V~diDea--q~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~ 154 (402) T protein:vir:97 77 TPTQADKNQLVIDTTVIARNTVAHIHDV--QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) T ss_pred CCcccccEEEEeCceeechhhhhhHHHH--HhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccc Confidence 45556666666665421 122211 22 23455 67899999999999999996642 11 1111222222 Q ss_pred ccccccc-ccccccchh----HHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhc-------cCccccccCCCCe Q lcl|NC_016164. 696 FVTGINT-ENFGATNPT----YVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKA-------TSTAQFVLEPGGT 763 (836) Q Consensus 696 ~~~~~~~-~t~aa~~~t----~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~-------~g~~~~~~~~~~~ 763 (836) +....+. .+......+ .+.+..+...|..++.....-+++++|..|..|..-.+- .+...+..+.-.. T Consensus 155 ~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~ 234 (402) T protein:vir:97 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) T ss_pred cccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEE Confidence 1111111 111111122 345556667777777776677999999999888754221 1223345555567 Q ss_pred ecceeeEeeCccccce---------------E--EEEehhc--eEEEeecceE------EEEecccccccCcEEEEEEEE Q lcl|NC_016164. 764 VNGYNVVRSNQVANGD---------------V--FFGVWNQ--MIMGMWGALD------IQVNPYALDKSGSVRVTALQD 818 (836) Q Consensus 764 l~G~pVv~s~~~~~~~---------------i--~~gD~s~--~~i~~~~~l~------i~~~~~~~~~~~~~~~r~~~r 818 (836) +.|++|+.|+.+|... . +-||++. ..++.+..+- +..+-+..-.+-...+.+++- T Consensus 235 v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a 314 (402) T protein:vir:97 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMA 314 (402) T ss_pred EeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHH Confidence 9999999999998531 1 1255543 2233332221 111111111122233555667 Q ss_pred eccEEEcccceEEE------eecC Q lcl|NC_016164. 819 VDVAVRHPEAFCRG------NDNL 836 (836) Q Consensus 819 ~d~~v~~p~Af~~l------~~A~ 836 (836) ++.+++||++..++ +.|. T Consensus 315 ~G~g~~RPeaa~vv~~~~~~t~~~ 338 (402) T protein:vir:97 315 EGAIPDRWEAVSVVTTKRDATTGD 338 (402) T ss_pred hCCcccCccceEEEEEeccccccc Confidence 89999999998777 3344 No 154 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.81 E-value=6.2e-10 Score=71.05 Aligned_cols=237 Identities=13% Similarity=0.107 Sum_probs=130.6 Q ss_pred cceeeecCCceEEEEEecCCceeeeeccCccccc--cccccee--EEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHH Q lcl|NC_016164. 595 GVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTE--SQPSVDQ--VALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELA 670 (836) Q Consensus 595 ~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~--~~~~~~~--it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~ 670 (836) ..|.+. .+..+.+|+. +..++....-|.++.. ..+.-.+ +++.-.+|....--.-+-. ++..++.+.+.++++ T Consensus 1 ~vr~i~-~g~s~~~~~i-G~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~-qa~~Dlr~e~s~~~G 77 (324) T protein:vir:99 1 MTRTIT-SGKSAQFPVM-GRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDA-MNHYDVRSEYSTQMG 77 (324) T ss_pred Ceeeee-cCceEEEeee-eeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHH-hcCccchhHHHHHHH Confidence 122233 3567888877 5566666666666533 3344444 4444444444321111112 245679999999999 Q ss_pred HHHHHHHHHHHHhh---c---CCccccccccc--------ccccccccccccchhHHHHHHHHHHHhhhccccCccEEEe Q lcl|NC_016164. 671 TVIALEIDRAALYG---L---GSNSQPEGLKF--------VTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLT 736 (836) Q Consensus 671 ~a~a~~~d~~il~G---~---Gt~~~p~Gi~~--------~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vm 736 (836) .++++..|+.++.- . .+.....+... ..+............++.|.++...|..++.....-.+++ T Consensus 78 ~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv 157 (324) T protein:vir:99 78 EALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYT 157 (324) T ss_pred HHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEe Confidence 99999999877522 1 11011111111 1111111111112346788888888988887777788999 Q ss_pred cHHHHHHHHHHhhcc-----CccccccCCCCeecceeeEeeCccccceE-------------------------EEEehh Q lcl|NC_016164. 737 NSTLYGGFKTTEKAT-----STAQFVLEPGGTVNGYNVVRSNQVANGDV-------------------------FFGVWN 786 (836) Q Consensus 737 np~~~~~L~~lkd~~-----g~~~~~~~~~~~l~G~pVv~s~~~~~~~i-------------------------~~gD~s 786 (836) +|..|..|..-+..+ +...+..+.-++++|++|+.|+.+|.... |-+|++ T Consensus 158 ~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~ 237 (324) T protein:vir:99 158 DPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGAD 237 (324) T ss_pred ChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccccccC Confidence 999998775432222 12233444446789999999999985321 233433 Q ss_pred ce--EEEee--------cceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEe--ec----C Q lcl|NC_016164. 787 QM--IMGMW--------GALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGN--DN----L 836 (836) Q Consensus 787 ~~--~i~~~--------~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~--~A----~ 836 (836) .. .++.. ..+.++... .-.+-...++..+-++.+++||++.+.++ .- | T Consensus 238 ~~~gl~~~~~a~~tv~~~~~~~e~~~--~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~ 301 (324) T protein:vir:99 238 NVVGLFVHRSAVATLKLKDMALERAR--RPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAV 301 (324) T ss_pred ceeEEEEehhheEEEeeecceeccee--chhhHHHhhhhhhhhcCcccccceEEEEEEccCccccc Confidence 21 11111 112222221 11223456778888899999999886554 22 2 No 155 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.80 E-value=2.6e-09 Score=67.68 Aligned_cols=261 Identities=10% Similarity=0.039 Sum_probs=141.1 Q ss_pred cccccccchhhHHHHHHHHHhhhhhhhhcceeeecC-----CceEEEEEecCCceeee-----eccCcccccccccceeE Q lcl|NC_016164. 567 AAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL-----QGPVAIPRQTGAATAYW-----VAEGGDPTESQPSVDQV 636 (836) Q Consensus 567 ~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~-----~~~~~~p~~~~~~~a~~-----v~Eg~~~~~~~~~~~~i 636 (836) -+-.++.|+.+...+++.|+...++.++..+-..+. +..+++|+... ..+.. .+.++.+...+.+...+ T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTEDSF 79 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc-ccceeeeccccccCCcccccccccceE Confidence 122447899999999999999998888765543322 22477765433 22222 23455666667777777 Q ss_pred Eeeeee-eeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHH Q lcl|NC_016164. 637 ALVAKT-LGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVEL 715 (836) Q Consensus 637 t~~~~t-~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l 715 (836) .+.+.+ .+.-+.|+.+-...+..++...+.+..+++++..++..++.-.... +.+. .. ..+.......|+.| T Consensus 80 ~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a--~~~~----~~-~~~~~~~~~~~~~i 152 (392) T protein:vir:99 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGA--PYEA----AG-AVHEVAPDEFFKGV 152 (392) T ss_pred EEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcc--cccc----cc-cccccChhhhHHHH Confidence 777733 3445667776655566678888889999999999998876432111 1000 00 01111223468899 Q ss_pred HHHHHHHhhhccccCccEEEecHHHHHHHHHHhhc-----cCc---cccccCCCCeecceeeEeeCccccceEEEEehhc Q lcl|NC_016164. 716 VSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKA-----TST---AQFVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQ 787 (836) Q Consensus 716 ~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~-----~g~---~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~ 787 (836) .++...|..++... .-.++++|..+..|...... .|. ..+..+.-+++.|++|+.++.+|.+..+.+..+. T Consensus 153 ~~a~~~L~~~~vP~-~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a 231 (392) T protein:vir:99 153 NGARRALNELYIPQ-GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) T ss_pred HHHHHHHhhcCCCC-CCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecccccccceeeeccc Confidence 99999998887765 45889999998887643111 111 1233455578999999999999987766554443 Q ss_pred eEEEeecce-----------------EE--EEecccccccCcEEEEEE-------EEeccEEEcccceEEEeecC Q lcl|NC_016164. 788 MIMGMWGAL-----------------DI--QVNPYALDKSGSVRVTAL-------QDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 788 ~~i~~~~~l-----------------~i--~~~~~~~~~~~~~~~r~~-------~r~d~~v~~p~Af~~l~~A~ 836 (836) +.+...... .. ..+....+..+...+... ...+.+......+......+ T Consensus 232 ~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v 306 (392) T protein:vir:99 232 FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSI 306 (392) T ss_pred cccccccccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeeccee Confidence 322221111 00 000000011111110000 00000111111111000000 No 156 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.80 E-value=2.8e-09 Score=67.48 Aligned_cols=294 Identities=12% Similarity=-0.008 Sum_probs=159.2 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccc--hhhHHHHHHHHHhhhh Q lcl|NC_016164. 513 IRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFT--DGRPGSFIELLRNRLA 590 (836) Q Consensus 513 ~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp--~~~~~~ii~~l~~~~~ 590 (836) .+ ......+. .. .....+...+ .. .....+.|.++.- +.+...+++...+... T Consensus 1 ~~-------~~~~~~~~-~~-~~~~~~~~~~-----------~~-----~da~~~~g~~~~~ql~~id~~v~e~~~~~l~ 55 (319) T protein:vir:10 1 MT-------TKKFDEAD-KS-NVEMYLIQAG-----------VK-----QDAAATMGIWTAQELHRIKSQSYEEDYPVGS 55 (319) T ss_pred CC-------CcchhHHh-hH-HHHHHHhhcc-----------ch-----hhhhhhhhhHHHHHHHHHHHHHHhhhhccee Confidence 00 00000000 00 0000000000 00 0000111211111 2333455666666555 Q ss_pred hhhhcceeeec--CCceEEEEEecCCceeeeeccCcc-cccccccceeEEeeeeeeeeeehhHHHHHhcc---hhHHHHH Q lcl|NC_016164. 591 LNTLGVTMLTG--LQGPVAIPRQTGAATAYWVAEGGD-PTESQPSVDQVALVAKTLGAYTEFSRRLMLQS---SIDVEQM 664 (836) Q Consensus 591 l~~l~~~~~~~--~~~~~~~p~~~~~~~a~~v~Eg~~-~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds---~~~l~~~ 664 (836) .+++....... ....+.+........+.|++.++. +|..+..++.....+..++..+.++.+=|... ..++... T Consensus 56 ~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ 135 (319) T protein:vir:10 56 ALRVFPVTTELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTR 135 (319) T ss_pred chhhcccccCCCCceEEEEeeeeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHH Confidence 55554322122 223556666666777888876544 78888899999999999999999987655543 4567788 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCccccccccccccccccccccc--------chhHHHHHHHHHHHhhh-ccccCccEEE Q lcl|NC_016164. 665 VRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGAT--------NPTYVELVSMESKVAAD-NADIGAMSYL 735 (836) Q Consensus 665 i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~--------~~t~~~l~~a~~~l~~~-~~~~~~~~~v 735 (836) -....++++++++|+.+|+|+.. ....||+|..++...+.+.. .-.+++|.+++.++..+ ++...+..++ T Consensus 136 k~~aA~~~~~~~~n~i~f~G~~~-~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~ 214 (319) T protein:vir:10 136 KASACQLAHDQLVNRLVFKGSAP-HKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNIL 214 (319) T ss_pred HHHHHHHHHHHhhceEEEeeccc-ccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEE Confidence 88888999999999999999764 34679999888655433211 12467888888888754 2334677899 Q ss_pred ecHHHHHHHHHHhhccCccc--cc--cCCCCeecceeeEeeCccccce-EEEE--ehhceEEEeecceEEEEeccccccc Q lcl|NC_016164. 736 TNSTLYGGFKTTEKATSTAQ--FV--LEPGGTVNGYNVVRSNQVANGD-VFFG--VWNQMIMGMWGALDIQVNPYALDKS 808 (836) Q Consensus 736 mnp~~~~~L~~lkd~~g~~~--~~--~~~~~~l~G~pVv~s~~~~~~~-i~~g--D~s~~~i~~~~~l~i~~~~~~~~~~ 808 (836) ++|+.+..|.......|... ++ +.++.+|.+.|..........+ +++- +...+.+..-..+. ..+- .... T Consensus 215 L~p~~~~~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~--~~~~-e~~~ 291 (319) T protein:vir:10 215 IPPSMRKVLAIRMPETTMSYLDYFKSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFN--MLPA-QPKD 291 (319) T ss_pred ecHHHHHhhhcccCCCCeeHHHHHHHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCccee--eeee-eecC Confidence 99999999865443333211 11 1233455555554433222222 2222 12222222222222 2211 1112 Q ss_pred CcEEEEEEEEec-cEEEcccceEEEeec Q lcl|NC_016164. 809 GSVRVTALQDVD-VAVRHPEAFCRGNDN 835 (836) Q Consensus 809 ~~~~~r~~~r~d-~~v~~p~Af~~l~~A 835 (836) -.+.+....|++ +-+++|.||+.++-= T Consensus 292 l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 292 LHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred ceEEEeeeeeeEEEEEEccceeEeeecC Confidence 224455566665 578889999998855 No 157 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.80 E-value=3.7e-09 Score=66.78 Aligned_cols=269 Identities=12% Similarity=0.040 Sum_probs=159.8 Q ss_pred ccccccccccc--chhhHHHHHHHHHhhhhhhhhccee--eecCCceEEEEEecCCceeeeeccCcc-cccccccceeEE Q lcl|NC_016164. 563 DTASAAGDLVF--TDGRPGSFIELLRNRLALNTLGVTM--LTGLQGPVAIPRQTGAATAYWVAEGGD-PTESQPSVDQVA 637 (836) Q Consensus 563 ~~~~~~g~~vv--p~~~~~~ii~~l~~~~~l~~l~~~~--~~~~~~~~~~p~~~~~~~a~~v~Eg~~-~~~~~~~~~~it 637 (836) ..+.+.|.++. -+.+...+++.+.+....+++.... .......+.+........+.+++.++. +|..+..++... T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 11112222221 1234456777777777777764322 222334566666667777888887654 688888899999 Q ss_pred eeeeeeeeeehhHHHHHhcc---hhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccccccccccccc------ Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQS---SIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGAT------ 708 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~ds---~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~------ 708 (836) ..+..++.-+.++.+=|... ..++...-....++++++++|+.+|+|+.. ....||+|..++.....++. T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~-~g~~GLlN~p~~~~~~~~~~~~~~~~ 159 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKK-YAIKGAFEATGIQIDVSPTTGVGNVS 159 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccc-ccceeeecCCCcccccccCccccccc Confidence 99999999999888655543 456778888889999999999999999864 34689999887654322111 Q ss_pred -------chhHHHHHHHHHHHhhh-ccccCccEEEecHHHHHHHHHHhhccCccc----cc--cCCCCeecceeeEeeCc Q lcl|NC_016164. 709 -------NPTYVELVSMESKVAAD-NADIGAMSYLTNSTLYGGFKTTEKATSTAQ----FV--LEPGGTVNGYNVVRSNQ 774 (836) Q Consensus 709 -------~~t~~~l~~a~~~l~~~-~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~----~~--~~~~~~l~G~pVv~s~~ 774 (836) .-.+++|.+++.++..+ +....+..++++|+.+..|......++.+. ++ +.+..+|.+.|...... T Consensus 160 ~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p~L~~~g 239 (301) T protein:vir:80 160 KWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVPDLAGMG 239 (301) T ss_pred ccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEcceeccCC Confidence 12368899999988664 233466789999999999976543332221 11 12333444444444332 Q ss_pred cccceE--EEEe-hhceEEEeecceEEEEecccccccCcEEEEEEEEe-ccEEEcccceEEEeec Q lcl|NC_016164. 775 VANGDV--FFGV-WNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDV-DVAVRHPEAFCRGNDN 835 (836) Q Consensus 775 ~~~~~i--~~gD-~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~-d~~v~~p~Af~~l~~A 835 (836) ....+. ++-+ ...+.+..-..+. ..+- ....-.+......|+ ++-+++|.||+.++-= T Consensus 240 ~~g~~~~v~~~~~~d~~~~~v~~~~~--~~~~-e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 240 TAGSDSFAVIHDSNETAELIIPMDIT--RHPE-EYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred CCcccEEEEEecCCcEEEEEecCcee--eecc-eecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 222222 2221 2222222222222 1111 111112334445666 5688999999998855 No 158 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.77 E-value=4.2e-10 Score=71.97 Aligned_cols=283 Identities=14% Similarity=0.094 Sum_probs=151.8 Q ss_pred hhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTE 628 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~ 628 (836) +..+.. ..+.... ....--.+.=+.+.+++.......+.++.+...+.-..+..+.+|+. +..+++...-|++... T Consensus 1 Ms~~n~-~t~~~~~--~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~ld~ 76 (401) T protein:vir:70 1 MSTPNN-LTNVAVS--ASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAA 76 (401) T ss_pred CCCCcc-ccccccc--cccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCcCC Confidence 100000 0000000 00111224456778888888888888888766655556678888877 5566666666666655 Q ss_pred ccccceeEEeeeeeee-eeehhHH-HHHhcchhH-HHHHHHHHHHHHHHHHHHHHHHh-----hc----CCccccccccc Q lcl|NC_016164. 629 SQPSVDQVALVAKTLG-AYTEFSR-RLMLQSSID-VEQMVRTELATVIALEIDRAALY-----GL----GSNSQPEGLKF 696 (836) Q Consensus 629 ~~~~~~~it~~~~t~~-~~i~ISr-elL~ds~~~-l~~~i~~~l~~a~a~~~d~~il~-----G~----Gt~~~p~Gi~~ 696 (836) ..+.-++..+.+.++- ....|-. +-. ++.++ +.+.+.+++++++++..|+.++. |- +....|.|.-+ T Consensus 77 ~~~~~dK~~ItID~lL~a~~~V~dlDe~-q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~ 155 (401) T protein:vir:70 77 TSTQADKNQLVIDATVIARNTVAHLHDV-QGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGH 155 (401) T ss_pred CCcccccEEEEeCceeehhhhhhhHHHH-HhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCC Confidence 5566666666665542 1222211 112 23455 67899999999999999986532 21 11122322211 Q ss_pred ccccccccc-----cccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHh-----hc--cCccccccCCCCee Q lcl|NC_016164. 697 VTGINTENF-----GATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTE-----KA--TSTAQFVLEPGGTV 764 (836) Q Consensus 697 ~~~~~~~t~-----aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lk-----d~--~g~~~~~~~~~~~l 764 (836) ...+..... ....--.+.+..+...|..++......++++.|..|..|.... +. .+.+.|..+....+ T Consensus 156 G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~v 235 (401) T protein:vir:70 156 GFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTLSS 235 (401) T ss_pred ceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEEEE Confidence 111111111 1111134456777777777777666567777777776664421 11 11223444444578 Q ss_pred cceeeEeeCccccce---------------EE--EEehhce--EEEeecceE------EEEecccccccCcEEEEEEEEe Q lcl|NC_016164. 765 NGYNVVRSNQVANGD---------------VF--FGVWNQM--IMGMWGALD------IQVNPYALDKSGSVRVTALQDV 819 (836) Q Consensus 765 ~G~pVv~s~~~~~~~---------------i~--~gD~s~~--~i~~~~~l~------i~~~~~~~~~~~~~~~r~~~r~ 819 (836) .|+||+.|+.+|.+. .| -||++.. .++.+..+- +..+-+.+-.+-...+-+++-+ T Consensus 236 aGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~ 315 (401) T protein:vir:70 236 YNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFMAE 315 (401) T ss_pred eceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHHHh Confidence 999999999998531 11 2566532 222222211 1111111112223345567788 Q ss_pred ccEEEcccceEEEeecC Q lcl|NC_016164. 820 DVAVRHPEAFCRGNDNL 836 (836) Q Consensus 820 d~~v~~p~Af~~l~~A~ 836 (836) +.+++||+|..+++-+- T Consensus 316 g~g~~RPeaa~vv~~k~ 332 (401) T protein:vir:70 316 GAIPDRWEAVSVVTTKR 332 (401) T ss_pred CCcccchhheEEEeecC Confidence 99999999998886444 No 159 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.75 E-value=2.5e-09 Score=67.76 Aligned_cols=256 Identities=9% Similarity=-0.028 Sum_probs=135.7 Q ss_pred hhccccccccccc-chhhHHHHHHHHHhhhhhhhhc--ceeeec-CCceEEEEEecCCceeeeeccCcccccccccce-- Q lcl|NC_016164. 561 VVDTASAAGDLVF-TDGRPGSFIELLRNRLALNTLG--VTMLTG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVD-- 634 (836) Q Consensus 561 ~~~~~~~~g~~vv-p~~~~~~ii~~l~~~~~l~~l~--~~~~~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~-- 634 (836) +.....+...-+. |+.+ +-+.+.-+....|.++. .|..|. ....+++|+..-...+.-|+||.++|.++++.. T Consensus 1 mAe~nlt~~~dL~~~~si-dfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~~~ 79 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSI-DFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRTKD 79 (295) T ss_pred CCCcccccHhhccCceee-hhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccchhhheeeee Confidence 1111111111121 2211 11222222222333321 233343 356899999988888999999999999999865 Q ss_pred -eEEeeeeeeeeeehhHHHHHhc-chhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhH Q lcl|NC_016164. 635 -QVALVAKTLGAYTEFSRRLMLQ-SSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTY 712 (836) Q Consensus 635 -~it~~~~t~~~~i~ISrelL~d-s~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~ 712 (836) ..+++.+|+++.+ |.|++.. ..-+....-.++|..+++.+++..||.-..++. .+.+..+-...+ T Consensus 80 ~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat-----------~t~tg~~lq~a~ 146 (295) T protein:vir:99 80 KDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKP-----------TKVKGVGLQKAL 146 (295) T ss_pred eeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCc-----------eeeehhhHHHHH Confidence 4778889998865 9999853 334577889999999999999999997654321 111111111133 Q ss_pred HHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc-Ccc-ccccCCCCeeccee-eEeeCccccceEEEEehhceE Q lcl|NC_016164. 713 VELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT-STA-QFVLEPGGTVNGYN-VVRSNQVANGDVFFGVWNQMI 789 (836) Q Consensus 713 ~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~-g~~-~~~~~~~~~l~G~p-Vv~s~~~~~~~i~~gD~s~~~ 789 (836) +.+.+....+-..+ ..+.++++||.+...++.-..-+ .+. .|-..---.++|.. |+.|..+|.|.++.--..++. T Consensus 147 a~~~~al~~f~Ee~--~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~nfLG~q~II~S~kv~~G~~~aT~~~Ni~ 224 (295) T protein:vir:99 147 SASWAKLATFNEFE--GSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKNFLGMQNVIVMPSVPEGKIYSTAVENLV 224 (295) T ss_pred HHhhhhhhhccccc--CCceEEEEehHHHHHHHhccccccchhhhhhhhhhhhhhccceEEEcccCCCceEEEeeccceE Confidence 33333333322222 23568999999988876543222 111 01000001389997 999999999987765444443 Q ss_pred EE---ee-cceEEEEecccccccCcEEEEEEEE-------------eccE---EEcccceEEEeecC Q lcl|NC_016164. 790 MG---MW-GALDIQVNPYALDKSGSVRVTALQD-------------VDVA---VRHPEAFCRGNDNL 836 (836) Q Consensus 790 i~---~~-~~l~i~~~~~~~~~~~~~~~r~~~r-------------~d~~---v~~p~Af~~l~~A~ 836 (836) ++ .. +++.- -..+..|.+.+....+ +.+. +-++.++++.+.-. T Consensus 225 ~ay~~~~~g~l~~----~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~ 287 (295) T protein:vir:99 225 FASLNVKGGDLGG----LFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEA 287 (295) T ss_pred EEEecCCchhhhh----hhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEec Confidence 32 22 22221 1112222222222111 1111 22345666666533 No 160 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.74 E-value=1e-09 Score=69.94 Aligned_cols=282 Identities=15% Similarity=0.078 Sum_probs=154.5 Q ss_pred hhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccCccccc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEGGDPTE 628 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~ 628 (836) +..+. ...+.... ....--.+.=+.+.+++.......+.++.+...+.-..+..+.+|+. +..+++...-|.+..- T Consensus 1 Ms~~n-~~t~p~~~--gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~ldg 76 (400) T protein:vir:10 1 MSTPN-NLTNVAVS--ASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAA 76 (400) T ss_pred CCCCc-cccccccc--cccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCcCC Confidence 00000 00000000 00111224456788888888888888888766655556678888877 6667777777777654 Q ss_pred ccccceeEEeeeeeee-eeehhHH--HHHhcchhH-HHHHHHHHHHHHHHHHHHHHHHh----hcCCc-----ccccccc Q lcl|NC_016164. 629 SQPSVDQVALVAKTLG-AYTEFSR--RLMLQSSID-VEQMVRTELATVIALEIDRAALY----GLGSN-----SQPEGLK 695 (836) Q Consensus 629 ~~~~~~~it~~~~t~~-~~i~ISr--elL~ds~~~-l~~~i~~~l~~a~a~~~d~~il~----G~Gt~-----~~p~Gi~ 695 (836) ..+.-++..+.+.++= ....|-. +.+ +..+ +.+.+.+++++++++..|+.++. +.-++ +.|.|.- T Consensus 77 ~~~~~dk~~ItIDtLL~a~~~V~dlDd~q--~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~ 154 (400) T protein:vir:10 77 TSTQADKNQLVIDATVIARNTVAHLHDVQ--GDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKG 154 (400) T ss_pred CCcccCcEEEEeCceeeecchhhhHHHHh--hccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccc Confidence 5566666666665542 2222222 322 3455 78999999999999999986652 21111 1222332 Q ss_pred cccccccccccccc-hh----HHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHh-----hcc--CccccccCCCCe Q lcl|NC_016164. 696 FVTGINTENFGATN-PT----YVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTE-----KAT--STAQFVLEPGGT 763 (836) Q Consensus 696 ~~~~~~~~t~aa~~-~t----~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lk-----d~~--g~~~~~~~~~~~ 763 (836) +...+...+..... .+ .+.+..+...|...+......++++.|..|..|..-. +.. +...+..+.... T Consensus 155 ~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~ 234 (400) T protein:vir:10 155 HGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLS 234 (400) T ss_pred cccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEE Confidence 22222111111111 11 2345566666666666555567778788887765421 111 112233333346 Q ss_pred ecceeeEeeCccccce---------------E--EEEehhce--EEEeecce------EEEEecccccccCcEEEEEEEE Q lcl|NC_016164. 764 VNGYNVVRSNQVANGD---------------V--FFGVWNQM--IMGMWGAL------DIQVNPYALDKSGSVRVTALQD 818 (836) Q Consensus 764 l~G~pVv~s~~~~~~~---------------i--~~gD~s~~--~i~~~~~l------~i~~~~~~~~~~~~~~~r~~~r 818 (836) +.|+||+.|+.+|... . +-||++.. .++.+..+ .+..+-+.+-.+-...+.+++- T Consensus 235 v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a 314 (400) T protein:vir:10 235 SYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMS 314 (400) T ss_pred EeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHH Confidence 8999999999998532 1 23666542 22222222 1111111122233445667778 Q ss_pred eccEEEcccceEEEeecC Q lcl|NC_016164. 819 VDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 819 ~d~~v~~p~Af~~l~~A~ 836 (836) ++.++.||++..+++-+= T Consensus 315 ~G~g~~RPeaa~vv~~~~ 332 (400) T protein:vir:10 315 EGAIPDRWEAVSVVTTKR 332 (400) T ss_pred hCCcccchhheEEEEecC Confidence 999999999999998665 No 161 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.71 E-value=3.7e-09 Score=66.83 Aligned_cols=290 Identities=13% Similarity=0.010 Sum_probs=155.1 Q ss_pred hhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccc--hhhHHHHHHHHHhhhhhhhhcceee Q lcl|NC_016164. 522 RAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFT--DGRPGSFIELLRNRLALNTLGVTML 599 (836) Q Consensus 522 ~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp--~~~~~~ii~~l~~~~~l~~l~~~~~ 599 (836) .+..+.. . .......+.+++ ......+|.++.. +.+...+++...+....+++..... T Consensus 1 ~~~~~~~-~-~~~~~~~~~~~~------------------~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~ 60 (314) T protein:vir:10 1 MAIKFDA-E-QAKITTHLEQMG------------------VEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTN 60 (314) T ss_pred CccchHH-H-HHHHHHHHHhhc------------------ccchhhhHHHHHHHHHHHHHHHhhhhccccccceeecccc Confidence 0000000 0 000000000000 0001111222222 1233345555554444444432211 Q ss_pred --ecCCceEEEEEecCCceeeeeccCcc-cccccccceeEEeeeeeeeeeehhHHHHHhcc---hhHHHHHHHHHHHHHH Q lcl|NC_016164. 600 --TGLQGPVAIPRQTGAATAYWVAEGGD-PTESQPSVDQVALVAKTLGAYTEFSRRLMLQS---SIDVEQMVRTELATVI 673 (836) Q Consensus 600 --~~~~~~~~~p~~~~~~~a~~v~Eg~~-~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds---~~~l~~~i~~~l~~a~ 673 (836) +..-..+.+........+.|++.++. +|..+..+++....+..++..+.+|.+=|... ..++...-....++++ T Consensus 61 ~~~~~~et~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~ 140 (314) T protein:vir:10 61 EIPGHAKYFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAH 140 (314) T ss_pred CCCCceeEEEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHH Confidence 11123566666767777888887544 78888899999999999999999987555543 4567788888889999 Q ss_pred HHHHHHHHHhhcCCcccccccccccccccccccccc----hhHHHHHHHHHHHhhhc-cccCccEEEecHHHHHHHHHHh Q lcl|NC_016164. 674 ALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATN----PTYVELVSMESKVAADN-ADIGAMSYLTNSTLYGGFKTTE 748 (836) Q Consensus 674 a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~----~t~~~l~~a~~~l~~~~-~~~~~~~~vmnp~~~~~L~~lk 748 (836) ++.+|+.+|+|+.. ....||+|..+++..+.++.. -.++||.+++.++..+. +...+..++++|..+..|.... T Consensus 141 ~~~~n~i~f~G~~~-~g~~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~ 219 (314) T protein:vir:10 141 DNLLDKLVWSGSAP-HGIVSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLV 219 (314) T ss_pred HHhhceEEEeeccc-ccceeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccc Confidence 99999999999754 346799998776544333222 23778899998887652 2345668999999988775443 Q ss_pred hccCccc--cc--cCCCCeecceeeEeeCccccceEEE---EehhceEEEeecceEEEEecccccccCcEEEEEEEEe-c Q lcl|NC_016164. 749 KATSTAQ--FV--LEPGGTVNGYNVVRSNQVANGDVFF---GVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDV-D 820 (836) Q Consensus 749 d~~g~~~--~~--~~~~~~l~G~pVv~s~~~~~~~i~~---gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~-d 820 (836) +..|... ++ +.++-+|-+.|-..+......+.++ -+...+.+..-..+. ..+. ....-.+.+....|+ + T Consensus 220 ~~~~~tvl~~l~~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~--~l~~-e~~~~~~~~~~~~r~~G 296 (314) T protein:vir:10 220 PQTNLSYGELFTRNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTN--VLPA-QPKDLHFRYPVTSKATG 296 (314) T ss_pred cCCCccHHHHHHHhCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccce--eecc-eecCceEEEcceeeeEE Confidence 3223211 11 1233345555544433322222221 122222222212222 1111 111222444456676 4 Q ss_pred cEEEcccceEEE---eec Q lcl|NC_016164. 821 VAVRHPEAFCRG---NDN 835 (836) Q Consensus 821 ~~v~~p~Af~~l---~~A 835 (836) +-+++|.||+.+ +.| T Consensus 297 v~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 297 LIVYRPLTMAVIKGITFA 314 (314) T ss_pred EEEECcceeEeeeeeecC Confidence 678899999975 466 No 162 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.64 E-value=1.1e-08 Score=64.18 Aligned_cols=300 Identities=10% Similarity=-0.049 Sum_probs=160.2 Q ss_pred hhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccc--hhhHHHHHHHHHhhhhhhhhccee--eecCCc Q lcl|NC_016164. 529 AFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFT--DGRPGSFIELLRNRLALNTLGVTM--LTGLQG 604 (836) Q Consensus 529 ~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp--~~~~~~ii~~l~~~~~l~~l~~~~--~~~~~~ 604 (836) +....+.+.. +.- ................+......+.++.. +.+...+++...+....+++.... .+.... T Consensus 1 ~~~~~~~~~~-~~d---~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~ 76 (329) T protein:vir:79 1 MRGNIMSKEM-KYD---EFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDK 76 (329) T ss_pred Cccchhhhhh-ccc---hhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCcee Confidence 0000000000 000 00000000000000011111111222221 223455666666666666654322 222223 Q ss_pred eEEEEEecCCceeeeeccC-cccccccccceeEEeeeeeeeeeehhHHHHHhcc---hhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 605 PVAIPRQTGAATAYWVAEG-GDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQS---SIDVEQMVRTELATVIALEIDRA 680 (836) Q Consensus 605 ~~~~p~~~~~~~a~~v~Eg-~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds---~~~l~~~i~~~l~~a~a~~~d~~ 680 (836) .+.+........+.|++.+ ...|..+..+.+....+..++..+.++.+=|..+ ..++...-....++++++.+|+. T Consensus 77 ~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i 156 (329) T protein:vir:79 77 TFEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHL 156 (329) T ss_pred EEEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccE Confidence 5666677677778888764 4578888888888999999999988887655543 45677888888889999999999 Q ss_pred HHhhcCCccccccccccccccccccccc----------chhHHHHHHHHHHHhhhcc-ccCccEEEecHHHHHHHHHHhh Q lcl|NC_016164. 681 ALYGLGSNSQPEGLKFVTGINTENFGAT----------NPTYVELVSMESKVAADNA-DIGAMSYLTNSTLYGGFKTTEK 749 (836) Q Consensus 681 il~G~Gt~~~p~Gi~~~~~~~~~t~aa~----------~~t~~~l~~a~~~l~~~~~-~~~~~~~vmnp~~~~~L~~lkd 749 (836) +|+|+.. ....||+|..+++..+.++. .-.+++|.+++.++..+.. ...+..++++|..+..|..... T Consensus 157 ~f~G~~~-~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~ 235 (329) T protein:vir:79 157 VFKGSKP-HKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMP 235 (329) T ss_pred EEeeccc-ccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccC Confidence 9999763 34579999888765443221 1246789999998877632 2456789999999988865443 Q ss_pred ccCccc--cc--cCCCCeecceeeEeeCccccce-EEEE--ehhceEEEeecceEEEEecccccccCcEEEEEEEEec-c Q lcl|NC_016164. 750 ATSTAQ--FV--LEPGGTVNGYNVVRSNQVANGD-VFFG--VWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVD-V 821 (836) Q Consensus 750 ~~g~~~--~~--~~~~~~l~G~pVv~s~~~~~~~-i~~g--D~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d-~ 821 (836) ..|.-. ++ +.++-+|-+.|-..+......+ +++- +...+.+..-..+. ..+- ....-.+.+....|++ + T Consensus 236 ~~~~tvl~~lk~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~--~l~~-q~~~~~~~v~~~~r~~Gv 312 (329) T protein:vir:79 236 ETTMSYLDYFKQQNGGITIESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFN--MLTA-QPKDLHFKVPCTSKCTGL 312 (329) T ss_pred CCCccHHHHHHHhCCCcEEEEcccccccCCCCceEEEEEecCCceEEEecCccee--eeec-eecCceEEEceeeeEEEE Confidence 333221 11 1233344444433332221112 2222 22223232222222 2221 1122224455566665 5 Q ss_pred EEEcccceEEEeecC Q lcl|NC_016164. 822 AVRHPEAFCRGNDNL 836 (836) Q Consensus 822 ~v~~p~Af~~l~~A~ 836 (836) -+++|.||+.++-=| T Consensus 313 ~i~~P~ai~~~dGI~ 327 (329) T protein:vir:79 313 TIYRPLTLVLIKGLV 327 (329) T ss_pred EEECcceeeeeeeee Confidence 788899999988666 No 163 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.61 E-value=1.5e-08 Score=63.47 Aligned_cols=273 Identities=10% Similarity=-0.046 Sum_probs=154.5 Q ss_pred hhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCC-ceeeeeccCccccccccccee- Q lcl|NC_016164. 558 RDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGA-ATAYWVAEGGDPTESQPSVDQ- 635 (836) Q Consensus 558 ~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~- 635 (836) .+.-+.+..+....-.-+.+.+.|...-...+|+..+.-... ..+-.+.+....-. +...-..||++.+.....-.. T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~-a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~ 79 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGV-ATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTM 79 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCce-ecccEEEEEeeecCCccccccccCcccccccccCCEE Confidence 111112222222223345566777766666777776644322 23335555543322 222334588877655433222 Q ss_pred EEeeeeeeeeeehhHHHHHhcchhH---HHHHHHHHHHHHHHHHHHHHHHhhcCC--------ccccccccccc------ Q lcl|NC_016164. 636 VALVAKTLGAYTEFSRRLMLQSSID---VEQMVRTELATVIALEIDRAALYGLGS--------NSQPEGLKFVT------ 698 (836) Q Consensus 636 it~~~~t~~~~i~ISrelL~ds~~~---l~~~i~~~l~~a~a~~~d~~il~G~Gt--------~~~p~Gi~~~~------ 698 (836) ..=-..-+...+.||.-+..-+..+ ..++=...-...+++-+|..+++|.-. ..+..||++.- T Consensus 80 ~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~ 159 (317) T protein:vir:88 80 LNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSL 159 (317) T ss_pred eccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCcee Confidence 2222234445555555443322233 233333444455677888899988521 12445665321 Q ss_pred ---cc-------c-cccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCee--- Q lcl|NC_016164. 699 ---GI-------N-TENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTV--- 764 (836) Q Consensus 699 ---~~-------~-~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l--- 764 (836) +. + ..+.....++.++|.+++.++=.+++. +..++++|.....|..+...++..........++ T Consensus 160 ~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~--~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~ 237 (317) T protein:vir:88 160 GANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQ--ANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQT 237 (317) T ss_pred ccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCC--CCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEE Confidence 11 0 111223347888999999998887654 3467899999988888754334333222221111 Q ss_pred -------cc-eeeEeeCccccceEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 765 -------NG-YNVVRSNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 765 -------~G-~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +| ++++.+..+|++.+++.|++.+.+..-..+....... ..+.....++..+++.+.+|.|.+++..-. T Consensus 238 v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~laK---tGd~~k~~i~~E~tLe~~N~~a~a~i~~l~ 314 (317) T protein:vir:88 238 VDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHELAK---TGDSEKRQLLVEYTFRVNNEKSGALIRDVV 314 (317) T ss_pred EEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeecccceeeccCC---CcccceeEEEEEEEEEEcCccceeEEEEec Confidence 22 3678888999999999999998776655543332222 225567778889999999999999988666 No 164 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=98.56 E-value=1.2e-07 Score=58.54 Aligned_cols=257 Identities=14% Similarity=0.093 Sum_probs=137.7 Q ss_pred ccccccccccchhhHHHHHHHHHhhhhhhhhcceeeec----CCceEEEEEecCCceeeeeccCcccccccccceeEEee Q lcl|NC_016164. 564 TASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTG----LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALV 639 (836) Q Consensus 564 ~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~----~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~ 639 (836) -..-...++-|+.+...+++.+++..++.+++.+.... .+..+++|+-. ...+.++......+++..++.+. T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~----~~~v~dg~~~~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPY----RVKSASGRTLVKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCC----ceeecccCCccccccccceEEEE Confidence 11112344558999999999999999988876653322 23477777632 23344565666667777887777 Q ss_pred eeee-eeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHH Q lcl|NC_016164. 640 AKTL-GAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSM 718 (836) Q Consensus 640 ~~t~-~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a 718 (836) +.+. ..-+.|+.+-...+..++...+.+..+.+++..+|..++.-- . +.-+..+ +.......|+++.++ T Consensus 77 id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~-~-----~a~~~~g----t~gt~~~~~~~i~~a 146 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTL-K-----KAFHSSG----TPGVRPGAFIDFANA 146 (418) T ss_pred EecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH-h-----hcccccc----cCCcCcchHHHHHHH Confidence 7444 345677766555566788888889999999999998876321 1 1101111 111222358999999 Q ss_pred HHHHhhhccccC-ccEEEecHHHHHHHHHHhhcc----C-ccccccCCCCeecceeeEeeCccccceE--------EEEe Q lcl|NC_016164. 719 ESKVAADNADIG-AMSYLTNSTLYGGFKTTEKAT----S-TAQFVLEPGGTVNGYNVVRSNQVANGDV--------FFGV 784 (836) Q Consensus 719 ~~~l~~~~~~~~-~~~~vmnp~~~~~L~~lkd~~----g-~~~~~~~~~~~l~G~pVv~s~~~~~~~i--------~~gD 784 (836) ...|...+.... .-.++++|..+..|....... + ...|..+.-+++.|+.|+.|+.+|..+. +.|- T Consensus 147 ~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga 226 (418) T protein:vir:10 147 GAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGT 226 (418) T ss_pred HHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEecCCCcccccccccceeeecc Confidence 999988877653 357789999987775322111 1 1123445557899999999999985321 1111 Q ss_pred h-hceEEEeecce-----EEEEeccccc-------------ccCcEEEEEEEEeccEEEcccce-EEEeecC Q lcl|NC_016164. 785 W-NQMIMGMWGAL-----DIQVNPYALD-------------KSGSVRVTALQDVDVAVRHPEAF-CRGNDNL 836 (836) Q Consensus 785 ~-s~~~i~~~~~l-----~i~~~~~~~~-------------~~~~~~~r~~~r~d~~v~~p~Af-~~l~~A~ 836 (836) . ....+...++- .+...+...| ......|++...+.-. -..++ +.+.-|+ T Consensus 227 ~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~--~~~~~tv~i~p~~ 296 (418) T protein:vir:10 227 VVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTD--AGGAGSIKISPSL 296 (418) T ss_pred cccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeecccc--ccCcceeEecccc Confidence 0 11111111000 0011111000 0011122222221000 00000 1111111 No 165 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.54 E-value=9.4e-09 Score=64.60 Aligned_cols=266 Identities=14% Similarity=0.004 Sum_probs=132.6 Q ss_pred hhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecC-CceE---EEEEecCCceeeeeccCcccccccc Q lcl|NC_016164. 556 LHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL-QGPV---AIPRQTGAATAYWVAEGGDPTESQP 631 (836) Q Consensus 556 ~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~-~~~~---~~p~~~~~~~a~~v~Eg~~~~~~~~ 631 (836) +.... ..+....-+......+.+.|-.-+..-..+... .|..|.. ...+ ++|.......+..|+||+.+|.+++ T Consensus 1 M~~e~-nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv-~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskv 78 (303) T protein:vir:10 1 MSAEN-NLINVEALGKAKSIDFANKLGVGLNKLFEALAI-QNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKV 78 (303) T ss_pred CCCCc-CCcchhhcccceeehhhhhhhhhHHHHHHHhhh-hccccccCCceeeeeeeeceeeccccccccCCcccchhhh Confidence 00000 000000111112222333332222211111111 2222322 2334 4444445577888999999999998 Q ss_pred cce---eEEeeeeeeeeeehhHHHHHh-cchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccc Q lcl|NC_016164. 632 SVD---QVALVAKTLGAYTEFSRRLML-QSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGA 707 (836) Q Consensus 632 ~~~---~it~~~~t~~~~i~ISrelL~-ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa 707 (836) +.. ..+++++|+++.+ |.|++. ...-+....-.++|..+++.+++..||.-..+.. .+. .. ... T Consensus 79 t~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT-----~t~----~~-t~~ 146 (303) T protein:vir:10 79 TREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAI-----ENG----KR-TNK 146 (303) T ss_pred eeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcc-----ccc----cc-ccc Confidence 854 5788999999966 999985 3344577889999999999999999986543210 000 00 112 Q ss_pred cchhHHHHHHHHHHHh----hhccccCccEEEecHHHHHHHHHHhhccCc-cccccCCCCeecceeeEeeCccccceEEE Q lcl|NC_016164. 708 TNPTYVELVSMESKVA----ADNADIGAMSYLTNSTLYGGFKTTEKATST-AQFVLEPGGTVNGYNVVRSNQVANGDVFF 782 (836) Q Consensus 708 ~~~t~~~l~~a~~~l~----~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~-~~~~~~~~~~l~G~pVv~s~~~~~~~i~~ 782 (836) ..++.+.|.+++.... ...-+....++++||.+...++.-..-..+ ..|-..---.++|..|+.|..+|.|.+|. T Consensus 147 t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L~nfLG~~II~S~kv~~G~~~~ 226 (303) T protein:vir:10 147 TKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVNLLTPYVGVKIVEFADVPQGEVWM 226 (303) T ss_pred eeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhhhhhhhhcceEEEeccCCCceEEE Confidence 2345667777766442 112223456999999998887543222111 11100000128899999999999998876 Q ss_pred EehhceEE---EeecceEEEEecccccccCcEEEEEEEE----------eccE---EEcccceEEEeecC Q lcl|NC_016164. 783 GVWNQMIM---GMWGALDIQVNPYALDKSGSVRVTALQD----------VDVA---VRHPEAFCRGNDNL 836 (836) Q Consensus 783 gD~s~~~i---~~~~~l~i~~~~~~~~~~~~~~~r~~~r----------~d~~---v~~p~Af~~l~~A~ 836 (836) --..++.+ -..+.+..... .+-.++|.+++.-... +.+. +-++.++++.+..= T Consensus 227 T~~~Ni~~ay~~~~g~l~~~f~-~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~ 295 (303) T protein:vir:10 227 TVAENLNVAYANPRGELSRAFA-FATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKK 295 (303) T ss_pred eeccceEEEEecCchhhhhhhh-hccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEEec Confidence 54444333 22332221111 1111222222211111 1111 22234555554411 No 166 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.36 E-value=4.2e-08 Score=61.03 Aligned_cols=227 Identities=15% Similarity=0.152 Sum_probs=133.7 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccC Q lcl|NC_016164. 544 VTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEG 623 (836) Q Consensus 544 ~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg 623 (836) -...++..+.... ....+-|......|++.+.+.+++.....-........+.+.+.++-|.+.|..=+ T Consensus 1 m~~~~~~~~TL~e-----------~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN 69 (328) T protein:vir:95 1 MAVKGLTALTLAD-----------WGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLN 69 (328) T ss_pred CCccccccccHHH-----------HHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecC Confidence 0000000000000 01112244455678999999888877655433333445777888899999999999 Q ss_pred cccccccccceeEEeeeeeeeeeehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--ccccccc---c Q lcl|NC_016164. 624 GDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLGSN--SQPEGLK---F 696 (836) Q Consensus 624 ~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~--~~p~Gi~---~ 696 (836) +.++.++.++.+++..++-+++.+.|.+.+..-.. -.+...-.....+++.++....||||+.+. ..+.||. + T Consensus 70 ~g~~~s~~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~ 149 (328) T protein:vir:95 70 YGVQPSKSTTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYS 149 (328) T ss_pred CccCcccceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcC Confidence 99999999999999999999999999997765432 123344455677899999999999996432 1223331 1 Q ss_pred ccc----ccccc-------------------------------------------------------------------- Q lcl|NC_016164. 697 VTG----INTEN-------------------------------------------------------------------- 704 (836) Q Consensus 697 ~~~----~~~~t-------------------------------------------------------------------- 704 (836) ... .+.+. T Consensus 150 ~~s~~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d 229 (328) T protein:vir:95 150 SLSAGNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRD 229 (328) T ss_pred ccccccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcC Confidence 000 00000 Q ss_pred -----------ccc--cchhHHHH----HHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccc-ccCCC----C Q lcl|NC_016164. 705 -----------FGA--TNPTYVEL----VSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQF-VLEPG----G 762 (836) Q Consensus 705 -----------~aa--~~~t~~~l----~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~-~~~~~----~ 762 (836) .+. .....+++ ..++.+++ +...++.+|+||.+....|+...-.-++..+ ..... - T Consensus 230 ~r~vvrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip--~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t 307 (328) T protein:vir:95 230 WRYVVRIANIDVSNLSEPSSAANIAKLMVKALHRIP--NRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWT 307 (328) T ss_pred cccEEEEecCcccccccccChhhHHHHHHHHHHHhc--cCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCccee Confidence 000 00011222 33333332 3456788999999999999875333233222 12222 2 Q ss_pred eecceeeEeeCccccceEEEE Q lcl|NC_016164. 763 TVNGYNVVRSNQVANGDVFFG 783 (836) Q Consensus 763 ~l~G~pVv~s~~~~~~~i~~g 783 (836) .++|+||-.++.+-.+...+. T Consensus 308 ~~~gipir~~dai~~tE~~vv 328 (328) T protein:vir:95 308 SFRGVPIRETDALLETEARVV 328 (328) T ss_pred EECCeEEEEEeeeecCccccC Confidence 478999999998876543222 No 167 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.30 E-value=1.5e-07 Score=57.94 Aligned_cols=267 Identities=13% Similarity=0.069 Sum_probs=128.6 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecC-CceE-EEEEecCCceeeeecc Q lcl|NC_016164. 545 TPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL-QGPV-AIPRQTGAATAYWVAE 622 (836) Q Consensus 545 ~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~-~~~~-~~p~~~~~~~a~~v~E 622 (836) .......|+... +.++.-+......+.+.|-.-+..-..+... .|..|.. ...+ .+|...-...+.-|+| T Consensus 1 ~~~~~~~~e~nl-------t~~~dl~~~~siDf~~~f~~~i~~L~~~LGv-~r~~pla~GstIkt~k~~~y~gda~dVaE 72 (296) T protein:vir:98 1 MVTSRTYPEENL-------IKSTDLKYPITIDVTNKFQENISKLLEMLGV-TRKISVSEGMTLKTYAGYDVTLAEGNVPE 72 (296) T ss_pred CCCccccCcCCC-------cchhhhhhhhhhhhHHHHhhhHHHHHHHhhh-cccccccCCCEEeeccceeeeeccccccC Confidence 000111111110 0000001111222333332222222122221 2334433 3456 4455677778889999 Q ss_pred Cccccccccccee---EEeeeeeeeeeehhHHHHHhc-chhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccc Q lcl|NC_016164. 623 GGDPTESQPSVDQ---VALVAKTLGAYTEFSRRLMLQ-SSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVT 698 (836) Q Consensus 623 g~~~~~~~~~~~~---it~~~~t~~~~i~ISrelL~d-s~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~ 698 (836) |+++|.++++... .+++++|+++.+ |.|++.. ..-+....-.++|..+++.+++..||.-..+.. T Consensus 73 Ge~Iplskvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT--------- 141 (296) T protein:vir:98 73 GEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT--------- 141 (296) T ss_pred CcccchhhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhccc--------- Confidence 9999999998654 778889999885 9999853 344577889999999999999999987654321 Q ss_pred ccccccccccchh---HHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCC-eecceeeEeeCc Q lcl|NC_016164. 699 GINTENFGATNPT---YVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGG-TVNGYNVVRSNQ 774 (836) Q Consensus 699 ~~~~~t~aa~~~t---~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~-~l~G~pVv~s~~ 774 (836) .+....+..+. ...+.++...+... +....++++||.+...+..-..-.-+-.| -..-. .++|.-|+.|.. T Consensus 142 --~t~~~t~~~lQ~Ala~~~~~l~~~fede--d~~~~V~FVnP~D~a~ylg~a~it~qt~f-G~tyl~nfLG~~II~S~k 216 (296) T protein:vir:98 142 --GTQDALGAGLQGALASAWGKLQVLFEDY--GSERAIVFANSLDVAEYIAKAGITTQTAF-GLTYLVDFTGTVIISTND 216 (296) T ss_pred --ceeeechhhHHHHHHHHhhhhhhhcccc--CCCceEEEEehHHHHHHhcCCccchhhee-chhhhhhccccEEEEcCc Confidence 01111111110 11122222333222 12357899999987765422111001011 00111 278989999999 Q ss_pred cccceEEEEehhceEEEe---e-cceEEEEecccccccCcEEEEEEE----------EeccE---EEcccceEEE--eec Q lcl|NC_016164. 775 VANGDVFFGVWNQMIMGM---W-GALDIQVNPYALDKSGSVRVTALQ----------DVDVA---VRHPEAFCRG--NDN 835 (836) Q Consensus 775 ~~~~~i~~gD~s~~~i~~---~-~~l~i~~~~~~~~~~~~~~~r~~~----------r~d~~---v~~p~Af~~l--~~A 835 (836) +|.|.+|.--..++.++. + +.+.....-++ .++|.+++.-.. -+.+. +-++.++++. +.| T Consensus 217 V~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~-d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~ 295 (296) T protein:vir:98 217 VTKGEIWATVPENIIFAYINPNNSELAKEFNLYG-DPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) T ss_pred CCCceEEEeeecceEEEeecccccchhhhhcccc-ccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEecCC Confidence 999988765544443332 2 11221111111 112222221111 11111 2223445443 355 Q ss_pred C Q lcl|NC_016164. 836 L 836 (836) Q Consensus 836 ~ 836 (836) | T Consensus 296 ~ 296 (296) T protein:vir:98 296 V 296 (296) T ss_pred C Confidence 5 No 168 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.14 E-value=2.1e-07 Score=57.23 Aligned_cols=229 Identities=14% Similarity=0.083 Sum_probs=126.8 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccC Q lcl|NC_016164. 544 VTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEG 623 (836) Q Consensus 544 ~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg 623 (836) -...+...+..... ...+.+......|++.+.+.+.+....+-.-............++-|.+.|..=+ T Consensus 1 m~~~~~~a~TL~E~-----------Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN 69 (335) T protein:vir:73 1 MALIGQTLPSLLDI-----------YNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYN 69 (335) T ss_pred CCcCCCCchhHHHH-----------HhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcC Confidence 00000111111100 0111123344458888888887766543322122111223345677888898888 Q ss_pred cccccccccceeEEeeeeeeeeeehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--cccccc---cc Q lcl|NC_016164. 624 GDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLGSN--SQPEGL---KF 696 (836) Q Consensus 624 ~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~--~~p~Gi---~~ 696 (836) ..++.++.++.+++...+-+++.+.|.+.+..-.. -.+...-.....+++.++....||||+-+. ..+.|| ++ T Consensus 70 ~g~~~s~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~ 149 (335) T protein:vir:73 70 QGVQPTKTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFN 149 (335) T ss_pred CccccccceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhc Confidence 88999999999999999999999999986654322 123444555678999999999999996431 122333 10 Q ss_pred cc-------ccccccc---------------------------------------------------------------- Q lcl|NC_016164. 697 VT-------GINTENF---------------------------------------------------------------- 705 (836) Q Consensus 697 ~~-------~~~~~t~---------------------------------------------------------------- 705 (836) .. +.+.+.+ T Consensus 150 ~~st~~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~ 229 (335) T protein:vir:73 150 TLSTSKAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLS 229 (335) T ss_pred CccccccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeE Confidence 00 0000000 Q ss_pred ---------------cc---cchhHHHHHHHH-HHH---hhhccccCccEEEecHHHHHHHHHHhhccCcccc-ccCCCC Q lcl|NC_016164. 706 ---------------GA---TNPTYVELVSME-SKV---AADNADIGAMSYLTNSTLYGGFKTTEKATSTAQF-VLEPGG 762 (836) Q Consensus 706 ---------------aa---~~~t~~~l~~a~-~~l---~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~-~~~~~~ 762 (836) +. .+-+-++|.+++ .++ ...+...+..+|+||.+....|+......++..+ .....+ T Consensus 230 i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g 309 (335) T protein:vir:73 230 VRDWRSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGG 309 (335) T ss_pred EeCcccEEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccCC Confidence 00 000112333332 222 2234566778999999999999875443343333 222222 Q ss_pred ----eecceeeEeeCccccceE-EEE Q lcl|NC_016164. 763 ----TVNGYNVVRSNQVANGDV-FFG 783 (836) Q Consensus 763 ----~l~G~pVv~s~~~~~~~i-~~g 783 (836) .++|+||-.++.+-.+.- +.+ T Consensus 310 ~~~t~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 310 KKIVSFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred ceeEEECCeEEEEEeeeecCcccccC Confidence 478999999998876542 223 No 169 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.13 E-value=8.1e-07 Score=53.98 Aligned_cols=265 Identities=10% Similarity=0.020 Sum_probs=127.9 Q ss_pred hhhhcccccccccccchhhHHHHHHHHHhhhhhhhhccee--------eecCCceEEEEEecCC-ceeeeeccCc---cc Q lcl|NC_016164. 559 DLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTM--------LTGLQGPVAIPRQTGA-ATAYWVAEGG---DP 626 (836) Q Consensus 559 a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~--------~~~~~~~~~~p~~~~~-~~a~~v~Eg~---~~ 626 (836) .......+.-..+++|+.+...+.+...+.+.|.+-+... ....+..+++|....- .....+.+.. +. T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 0000011122346778888777766665555555432211 1223446667765432 2222333332 22 Q ss_pred ccccccceeEEeeeeeeeeee---hhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHH---hhcCCc---ccc-----c Q lcl|NC_016164. 627 TESQPSVDQVALVAKTLGAYT---EFSRRLMLQSSIDVEQMVRTELATVIALEIDRAAL---YGLGSN---SQP-----E 692 (836) Q Consensus 627 ~~~~~~~~~it~~~~t~~~~i---~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il---~G~Gt~---~~p-----~ 692 (836) +..+.+-++....+...+..+ .++..+ .- -+....|.++++..-.+.....+| .|.-.. +.. . T Consensus 81 t~~kittg~~~a~v~~r~kaw~~~Dla~~l-sG--~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~ 157 (367) T protein:vir:80 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAEL-AG--SNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) T ss_pred cccccccchheeeeehhcccchhhhHHHHh-hC--chHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhh Confidence 333333333333333333333 344333 22 245567888887666665555444 222110 000 0 Q ss_pred cccc-------ccccccccc----cccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcccccc--- Q lcl|NC_016164. 693 GLKF-------VTGINTENF----GATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVL--- 758 (836) Q Consensus 693 Gi~~-------~~~~~~~t~----aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~--- 758 (836) +.++ ..++..+++ +...++...+.++...|-.+.. .-..++||+..+..|+.++-- .|+. T Consensus 158 ~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~--~l~~i~mHS~V~~~L~~~~li----~~i~~sd 231 (367) T protein:vir:80 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG--SIAAIAVHSMVYKRMTNNDEI----EFIPDSK 231 (367) T ss_pred hccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccc--cccEEEEchHHHHHHHhcccc----ccccCCC Confidence 0000 111122221 1234678889999887755443 346899999999988766421 1111 Q ss_pred --CCCCeecceeeEeeCccccc---------eEEEEehhceEEEee---cceEEEEecccccccCcEEEEEEEEeccEEE Q lcl|NC_016164. 759 --EPGGTVNGYNVVRSNQVANG---------DVFFGVWNQMIMGMW---GALDIQVNPYALDKSGSVRVTALQDVDVAVR 824 (836) Q Consensus 759 --~~~~~l~G~pVv~s~~~~~~---------~i~~gD~s~~~i~~~---~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~ 824 (836) ..-++++|++|++++.||.. .++||. ..+.++.. ...+...++-..--.|+-.+..+.+ .++ T Consensus 232 ~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~-GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~---~~~ 307 (367) T protein:vir:80 232 GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIV 307 (367) T ss_pred CccccceecceeEEEeCCCcccccCCCceEEEEEEec-ceeeecccCCccceecccchhhhcCCceEEEEeeee---EEe Confidence 12357899999999999942 133443 11212211 1223333332211234444544444 578 Q ss_pred cccceEEEeecC Q lcl|NC_016164. 825 HPEAFCRGNDNL 836 (836) Q Consensus 825 ~p~Af~~l~~A~ 836 (836) +|..|...+.++ T Consensus 308 hP~G~s~~~~~v 319 (367) T protein:vir:80 308 HPGGFNWLDADV 319 (367) T ss_pred ecceeeeccccc Confidence 888888776655 No 170 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.06 E-value=5.6e-07 Score=54.85 Aligned_cols=227 Identities=15% Similarity=0.129 Sum_probs=126.7 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeeccC Q lcl|NC_016164. 544 VTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAEG 623 (836) Q Consensus 544 ~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg 623 (836) -..-+...+... .....+.|......|++.+.+.+.+....+-.-............++-|.+.|..=+ T Consensus 1 m~~~~~~a~TL~-----------e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN 69 (330) T protein:vir:10 1 MATLSTNNPTMA-----------DVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLY 69 (330) T ss_pred CCcCCCCcccHH-----------HHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcC Confidence 000000000000 001112233444568888888887766543322112111223345677888898888 Q ss_pred cccccccccceeEEeeeeeeeeeehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--cccccccc--- Q lcl|NC_016164. 624 GDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLGSN--SQPEGLKF--- 696 (836) Q Consensus 624 ~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~--~~p~Gi~~--- 696 (836) ..++.++.++.+++...+-+++.+.|.+.+..-.. -++...-.....+++.++....||||+-+. ..+.||.. T Consensus 70 ~g~~~s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~ 149 (330) T protein:vir:10 70 GGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYN 149 (330) T ss_pred CccccccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcC Confidence 88999999999999999999999999997765322 123344566688999999999999996431 12233310 Q ss_pred c----cccccc------------------------------------------c---c---------------------- Q lcl|NC_016164. 697 V----TGINTE------------------------------------------N---F---------------------- 705 (836) Q Consensus 697 ~----~~~~~~------------------------------------------t---~---------------------- 705 (836) . .+.+.+ + . T Consensus 150 ~~ta~~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i 229 (330) T protein:vir:10 150 SLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTL 229 (330) T ss_pred CCCCCchhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEE Confidence 0 000000 0 0 Q ss_pred --------------cc--cchhHHHHHHHH----HHHhhhccccCccEEEecHHHHHHHHHH-hhccCccccccCCC--- Q lcl|NC_016164. 706 --------------GA--TNPTYVELVSME----SKVAADNADIGAMSYLTNSTLYGGFKTT-EKATSTAQFVLEPG--- 761 (836) Q Consensus 706 --------------aa--~~~t~~~l~~a~----~~l~~~~~~~~~~~~vmnp~~~~~L~~l-kd~~g~~~~~~~~~--- 761 (836) +. ......+|.+++ .+|+ +...++++|+||.+....|+.. .+.++...-..... T Consensus 230 ~d~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip--~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~ 307 (330) T protein:vir:10 230 RDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIP--QLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGER 307 (330) T ss_pred eCcccEEEEeecccccCCCCccHHHHHHHHHHHHHhcc--CCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCee Confidence 00 000112333333 2232 3445778999999999999875 34433221111122 Q ss_pred -CeecceeeEeeCccccceEEEE Q lcl|NC_016164. 762 -GTVNGYNVVRSNQVANGDVFFG 783 (836) Q Consensus 762 -~~l~G~pVv~s~~~~~~~i~~g 783 (836) -.+.|+||-.++++-.+.-.+. T Consensus 308 ~t~~~gipir~~Dail~tE~~vv 330 (330) T protein:vir:10 308 VMTFDGIPVQRTDALLNTESRVV 330 (330) T ss_pred eEEECCeEEEEEeeeecCccccC Confidence 2478999999999876653222 No 171 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=98.06 E-value=4.6e-06 Score=49.83 Aligned_cols=259 Identities=12% Similarity=0.042 Sum_probs=106.2 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcce--------eeecCCceEEEEEecCCceeeeeccCccccccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVT--------MLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPS 632 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~--------~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~ 632 (836) ++.+. .+--.+..+.+...+++.+.+.....+.... .+.+++....+.+..+...-.-+.-.+......++ T Consensus 1 ~~~t~-~sdl~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit 79 (315) T protein:vir:96 1 MATTV-NSDLVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIA 79 (315) T ss_pred Cceee-ecceeeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceecc Confidence 11111 1122345566666677776665444332111 11222222222221111111112222223222322 Q ss_pred -ceeEEeeeeeeeeeehh--HHHHHh---cchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccccccccccc Q lcl|NC_016164. 633 -VDQVALVAKTLGAYTEF--SRRLML---QSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFG 706 (836) Q Consensus 633 -~~~it~~~~t~~~~i~I--SrelL~---ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~a 706 (836) ...+..++ ..+.-++ +...+. ++...+...|.+.+..+.-+..-...+.+.- +.+........+.+ T Consensus 80 ~~~dvaVk~--~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~------aai~~~t~~~~~~~ 151 (315) T protein:vir:96 80 ADEMVSVKV--PWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQ------GAIGSNAGMNVSGE 151 (315) T ss_pred cccceeEEE--eecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhh------hhhccccccccccc Confidence 11222222 2232222 333333 2222233334444433333322222222221 11111111112233 Q ss_pred ccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhc------cCccccccCCCCeecceeeEeeCccccceE Q lcl|NC_016164. 707 ATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKA------TSTAQFVLEPGGTVNGYNVVRSNQVANGDV 780 (836) Q Consensus 707 a~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~------~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i 780 (836) .+.++...+.++..+|-.+.. .-..|+||..++..|.. +.- .+........+. .+|++|++++.||...+ T Consensus 152 ~a~~~~~~l~dA~~klGD~~~--~l~~~vMHS~v~~~L~~-q~L~~~~~~~~~~~~~~~~~~-~lGkrViVdD~~P~~~~ 227 (315) T protein:vir:96 152 LATEGKKVLTKGLRTMGDKAS--SIAIWVMDSTSYFDIVD-EAIDNKLYEEAGVVVYGGTPG-TLGKPVLVTDQCPATKI 227 (315) T ss_pred ccccCHHHHHHHHHHhccccc--CeeEEEEchHHHHHHHH-hhhhhhcccccceeEecCcCc-ccccEEEEECCCCccee Confidence 455788889999888744432 34689999999988865 211 111111112233 45999999999997654 Q ss_pred EEEehhceEEEeecceEEEEecccccccCcEEEEEEEEecc-EEEcccceEEEeecC Q lcl|NC_016164. 781 FFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDV-AVRHPEAFCRGNDNL 836 (836) Q Consensus 781 ~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~-~v~~p~Af~~l~~A~ 836 (836) |.--...+.+..-..+.....+ ..++-.+....|..+ -+++|..|..-+.+. T Consensus 228 ~gl~~GAi~~~~~~~~~~~~~~----~~g~e~l~~~~r~e~tf~l~p~G~sw~~~~~ 280 (315) T protein:vir:96 228 FGLVAGAVMITESQAPGMRSYQ----IDDQENLAIGFRAEGTANVEVLGYKWKTKTN 280 (315) T ss_pred eeeecceeeecCCCcccccccc----CCCcceeEEEEeeeeEeeeeeeeEEeecCCC Confidence 4311122222222221111111 112233333333333 367777777754444 No 172 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=98.05 E-value=3.5e-06 Score=50.48 Aligned_cols=257 Identities=7% Similarity=-0.038 Sum_probs=131.5 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecC------CceEEEEEecCCceeee-eccCcccccccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL------QGPVAIPRQTGAATAYW-VAEGGDPTESQPSV 633 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~------~~~~~~p~~~~~~~a~~-v~Eg~~~~~~~~~~ 633 (836) +..+. ...+|+.+...+++.+++..++.++..+...+. +..+++++-........ .+-+..+...+... T Consensus 1 MAN~l----lT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e 76 (423) T protein:vir:35 1 MANNL----ESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFS 76 (423) T ss_pred Cccch----hhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccccccc Confidence 11110 113589999999999999999988877655443 23566665433222222 11233444455666 Q ss_pred eeEEeeeeeeee-eehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhH Q lcl|NC_016164. 634 DQVALVAKTLGA-YTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTY 712 (836) Q Consensus 634 ~~it~~~~t~~~-~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~ 712 (836) .++.+.+.+... -+.++.+=...+..++..++... ..+++..++..++...-.+ ..+..+ +.+.....| T Consensus 77 ~~v~l~id~~k~~a~~v~d~e~~l~i~~~~~~l~~a-~~ala~~vd~~l~~~l~~~-----a~~~vg----t~~t~~~~~ 146 (423) T protein:vir:35 77 AKATGKVGKYITVAVEWTQIEEALKLNQLDQILSPI-HERMVTDLETELAHFMMNN-----GALSLG----SPNTAIKKW 146 (423) T ss_pred ceeeEEeccceeccceeCHHHHHhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhc-----cccccc----cccCCcchH Confidence 676666655433 45666654444555676655555 4777888888776422111 011111 111112358 Q ss_pred HHHHHHHHHHhhhccccCccEEEecHHHHHHHHHH----hhcc--CccccccCC-CCeecceeeEeeCccccceE----- Q lcl|NC_016164. 713 VELVSMESKVAADNADIGAMSYLTNSTLYGGFKTT----EKAT--STAQFVLEP-GGTVNGYNVVRSNQVANGDV----- 780 (836) Q Consensus 713 ~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~l----kd~~--g~~~~~~~~-~~~l~G~pVv~s~~~~~~~i----- 780 (836) +++.++...|...+...+.-..+++|..+..|... ...+ +...+..+. .+++.|+.|+.|+.+|..+. T Consensus 147 ~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~ 226 (423) T protein:vir:35 147 ADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDG 226 (423) T ss_pred HHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceEEEEcCCCcccccccccc Confidence 99999999999988887777889999998776421 1111 112344443 47899999999999996432 Q ss_pred --EEE-eh--hceEEEee----cceEEEEeccccc--ccCcEEEEEEEEeccEEE--------------cccceEEEeec Q lcl|NC_016164. 781 --FFG-VW--NQMIMGMW----GALDIQVNPYALD--KSGSVRVTALQDVDVAVR--------------HPEAFCRGNDN 835 (836) Q Consensus 781 --~~g-D~--s~~~i~~~----~~l~i~~~~~~~~--~~~~~~~r~~~r~d~~v~--------------~p~Af~~l~~A 835 (836) ..+ -. ....+... .++...+....++ ..+.+.| .|+..+ ++.-|++...+ T Consensus 227 ~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~-----aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~ 301 (423) T protein:vir:35 227 AITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQLKF-----TSTHWLNQQSKQTLYNGSTAMSFTATVLEET 301 (423) T ss_pred ceeeccccccccccccccccceeeeeeeeeccCCcEEecceEEe-----eeeeeccccccceeecccCCceeEEEEeccc Confidence 110 00 00011000 0111121111111 1122222 222222 22233333222 Q ss_pred C Q lcl|NC_016164. 836 L 836 (836) Q Consensus 836 ~ 836 (836) . T Consensus 302 ~ 302 (423) T protein:vir:35 302 N 302 (423) T ss_pred c Confidence 1 No 173 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=98.01 E-value=4.6e-07 Score=55.31 Aligned_cols=211 Identities=13% Similarity=0.053 Sum_probs=113.5 Q ss_pred hhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCcee-eeeccCccccc Q lcl|NC_016164. 550 LAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATA-YWVAEGGDPTE 628 (836) Q Consensus 550 ~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~~ 628 (836) +......-++ +-.-+...+...+....+..+..++.++.++..-++..+...|.. .|+ |+... T Consensus 1 M~i~~~~l~~-------------l~~~~~~~f~~~~~~a~~~~~~iA~~vpSt~~~~tY~wLg~fP~lrewi---Ger~i 64 (305) T protein:vir:19 1 MIVTPASIKA-------------LMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWV---GKRTI 64 (305) T ss_pred CccCHHHHHH-------------HHHHHHHHHHHHHhhcCcccceEEeEecCCCCcccccccccCCccchhh---cceee Confidence 0000000000 011133344444444455555556777888888888888888886 577 57788 Q ss_pred ccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhc--CCc-c--ccccccccccccc- Q lcl|NC_016164. 629 SQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGL--GSN-S--QPEGLKFVTGINT- 702 (836) Q Consensus 629 ~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~--Gt~-~--~p~Gi~~~~~~~~- 702 (836) .++.....++..++|...+.|+|+.|+||.+++.+-+.++|+++.+...|..++.-+ |-+ . ..+.+|+++|... T Consensus 65 ~~l~~~~y~i~Nk~fe~tV~V~R~dIeDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~cyDGq~FFdtDHpv~~ 144 (305) T protein:vir:19 65 QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYP 144 (305) T ss_pred eeccccceeEeeccccceeccchhhccccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccCCCCCcccCCCCCccc Confidence 889999999999999999999999999999999999999999999999998776432 211 1 2233555544221 Q ss_pred -ccccc-------------------------------------------------------------------------- Q lcl|NC_016164. 703 -ENFGA-------------------------------------------------------------------------- 707 (836) Q Consensus 703 -~t~aa-------------------------------------------------------------------------- 707 (836) ..+.+ T Consensus 145 ~~~~tg~~~~vsn~~~~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~~~~~~~~~d~~vf~~~e~~ygvd~R~n~Gygfwq 224 (305) T protein:vir:19 145 NVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASTRRAAGYGFWQ 224 (305) T ss_pred CCcccccccchhhhhcCCCCCCceeeeeecCCcceeEEEecccccceeeccCCCchhhhhhceeeeeeeeeeeccccchh Confidence 11111 Q ss_pred ------cchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccc-----cCCC-CeecceeeEeeCcc Q lcl|NC_016164. 708 ------TNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFV-----LEPG-GTVNGYNVVRSNQV 775 (836) Q Consensus 708 ------~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~-----~~~~-~~l~G~pVv~s~~~ 775 (836) ++++.+.+.+++ .+++..|+..|+++-+ ..++ -......++.++.+ T Consensus 225 ~a~gS~~~Ls~~nl~aar----------------------~aM~~qk~d~G~pL~I~P~~LvVPp~LE~~A~qll~s~~i 282 (305) T protein:vir:19 225 MAVAVKGDLTLDNLWKGW----------------------QLMRSFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELF 282 (305) T ss_pred heecCCCCCCHHHHHHHH----------------------HHHHhhcCCCCceeeeecCeEEeCchhHHHHHHHHhhccc Confidence 112222333333 3444555544443211 0000 00011112222222 Q ss_pred ccceEEEEehhceEEEeecceEEEEeccc Q lcl|NC_016164. 776 ANGDVFFGVWNQMIMGMWGALDIQVNPYA 804 (836) Q Consensus 776 ~~~~i~~gD~s~~~i~~~~~l~i~~~~~~ 804 (836) +.+.. ...-...+-+++.++|+. T Consensus 283 ~~g~~------~~~Np~~g~~eliV~P~L 305 (305) T protein:vir:19 283 ADGNT------TVSNEMKGKLQLVVADYL 305 (305) T ss_pred CCccc------cccceecceEEEEecccC Confidence 22210 000111233555555554 No 174 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=97.96 E-value=9.1e-06 Score=48.23 Aligned_cols=278 Identities=12% Similarity=0.072 Sum_probs=132.1 Q ss_pred HHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhh-h-cce-eeecCCceEEEEE Q lcl|NC_016164. 534 VSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNT-L-GVT-MLTGLQGPVAIPR 610 (836) Q Consensus 534 ~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~-l-~~~-~~~~~~~~~~~p~ 610 (836) +.++. ....|... +.......-..........+-+...+ +.......+.. + +.+ ......+.+.+|+ T Consensus 1 ~~~~~-----~~~~~~~~----~~~~~~~~~~~~~nt~~l~~k~~~~L-D~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~ 70 (319) T protein:vir:97 1 MNKTI-----KNATGMLK----LNLQHFANKSVEPGQTLLKNKHVGIL-ERVTAVNAYSTPALISNDAIFMEGRSFTVMK 70 (319) T ss_pred CCccc-----ccccceeE----eehhhhhccCCCcchHHHHHHHHHHH-HHHHHHhhhhhhcccCcceEeccCcEEEEee Confidence 00000 00001000 00000011111122223333344433 22222222221 1 121 2334567899998 Q ss_pred ecCCceeeeeccCcccccccccceeEEeeeee--eeeeehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_016164. 611 QTGAATAYWVAEGGDPTESQPSVDQVALVAKT--LGAYTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLG 686 (836) Q Consensus 611 ~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t--~~~~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~G 686 (836) .+. +...-+.-++....++++.+..++.+.. +-.+ .|..--...+. +.+-..+.+.+...++-.+|...+...- T Consensus 71 i~~-~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F-~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla 148 (319) T protein:vir:97 71 GDT-TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGR-FVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLA 148 (319) T ss_pred ecc-cccccccCCCCcccCCcccceeEEEeeccccccc-ccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHH Confidence 876 4454555455555666666666655533 2221 11111111111 1222334444455555566665443322 Q ss_pred CcccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccC-----ccccccCCC Q lcl|NC_016164. 687 SNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATS-----TAQFVLEPG 761 (836) Q Consensus 687 t~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g-----~~~~~~~~~ 761 (836) .+. +....+..+..-.|+.|.+++..|..++.. .+-.++++|..+..|..-..... ....+.+.- T Consensus 149 ~~a---------~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~V 218 (319) T protein:vir:97 149 RNK---------AKHLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQ 218 (319) T ss_pred hhc---------ccccccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccccceeeeec Confidence 110 001111112234589999999999887765 46788999999998865432211 122344555 Q ss_pred CeecceeeEeeC--ccccceEEEEehhceEEEee-cceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEe---ec Q lcl|NC_016164. 762 GTVNGYNVVRSN--QVANGDVFFGVWNQMIMGMW-GALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGN---DN 835 (836) Q Consensus 762 ~~l~G~pVv~s~--~~~~~~i~~gD~s~~~i~~~-~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~---~A 835 (836) ++|.|++|+.++ .+.+..+++|..+.+..... ..+++.......| ...++....+|+.|.+|++..+.. .+ T Consensus 219 g~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~~~---a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~ 295 (319) T protein:vir:97 219 GELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMF---GTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) T ss_pred eeecCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCcccc---ceeeeeeeeeeeEEeccccceEEEeecCC Confidence 789999998764 34455577776655433222 2233322112222 367888999999999998644433 22 Q ss_pred C Q lcl|NC_016164. 836 L 836 (836) Q Consensus 836 ~ 836 (836) . T Consensus 296 ~ 296 (319) T protein:vir:97 296 V 296 (319) T ss_pred c Confidence 2 No 175 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=97.96 E-value=9.1e-06 Score=48.23 Aligned_cols=278 Identities=12% Similarity=0.072 Sum_probs=132.1 Q ss_pred HHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhh-h-cce-eeecCCceEEEEE Q lcl|NC_016164. 534 VSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNT-L-GVT-MLTGLQGPVAIPR 610 (836) Q Consensus 534 ~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~-l-~~~-~~~~~~~~~~~p~ 610 (836) +.++. ....|... +.......-..........+-+...+ +.......+.. + +.+ ......+.+.+|+ T Consensus 1 ~~~~~-----~~~~~~~~----~~~~~~~~~~~~~nt~~l~~k~~~~L-D~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~ 70 (319) T protein:vir:94 1 MNKTI-----KNATGMLK----LNLQHFANKSVEPGQTLLKNKHVGIL-ERVTAVNAYSTPALISNDAIFMEGRSFTVMK 70 (319) T ss_pred CCccc-----ccccceeE----eehhhhhccCCCcchHHHHHHHHHHH-HHHHHHhhhhhhcccCcceEeccCcEEEEee Confidence 00000 00001000 00000011111122223333344433 22222222221 1 121 2334567899998 Q ss_pred ecCCceeeeeccCcccccccccceeEEeeeee--eeeeehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_016164. 611 QTGAATAYWVAEGGDPTESQPSVDQVALVAKT--LGAYTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLG 686 (836) Q Consensus 611 ~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t--~~~~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~G 686 (836) .+. +...-+.-++....++++.+..++.+.. +-.+ .|..--...+. +.+-..+.+.+...++-.+|...+...- T Consensus 71 i~~-~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F-~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla 148 (319) T protein:vir:94 71 GDT-TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGR-FVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLA 148 (319) T ss_pred ecc-cccccccCCCCcccCCcccceeEEEeeccccccc-ccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHH Confidence 876 4454555455555666666666655533 2221 11111111111 1222334444455555566665443322 Q ss_pred CcccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccC-----ccccccCCC Q lcl|NC_016164. 687 SNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATS-----TAQFVLEPG 761 (836) Q Consensus 687 t~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g-----~~~~~~~~~ 761 (836) .+. +....+..+..-.|+.|.+++..|..++.. .+-.++++|..+..|..-..... ....+.+.- T Consensus 149 ~~a---------~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~V 218 (319) T protein:vir:94 149 RNK---------AKHLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQ 218 (319) T ss_pred hhc---------ccccccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccccceeeeec Confidence 110 001111112234589999999999887765 46788999999998865432211 122344555 Q ss_pred CeecceeeEeeC--ccccceEEEEehhceEEEee-cceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEe---ec Q lcl|NC_016164. 762 GTVNGYNVVRSN--QVANGDVFFGVWNQMIMGMW-GALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGN---DN 835 (836) Q Consensus 762 ~~l~G~pVv~s~--~~~~~~i~~gD~s~~~i~~~-~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~---~A 835 (836) ++|.|++|+.++ .+.+..+++|..+.+..... ..+++.......| ...++....+|+.|.+|++..+.. .+ T Consensus 219 g~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~~~---a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~ 295 (319) T protein:vir:94 219 GELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMF---GTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) T ss_pred eeecCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCcccc---ceeeeeeeeeeeEEeccccceEEEeecCC Confidence 789999998764 34455577776655433222 2233322112222 367888999999999998644433 22 Q ss_pred C Q lcl|NC_016164. 836 L 836 (836) Q Consensus 836 ~ 836 (836) . T Consensus 296 ~ 296 (319) T protein:vir:94 296 V 296 (319) T ss_pred c Confidence 2 No 176 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=97.96 E-value=1.6e-06 Score=52.39 Aligned_cols=227 Identities=11% Similarity=0.072 Sum_probs=127.8 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccch-hhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeecc Q lcl|NC_016164. 544 VTPRGILAPNDVLHRDLVVDTASAAGDLVFTD-GRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAE 622 (836) Q Consensus 544 ~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~-~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~E 622 (836) -...+...+...... ..+-|. .+...|++.+.+.+++....+-............+.++-|.+.|..= T Consensus 1 m~~~~~~~~TL~e~A-----------k~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~l 69 (331) T protein:vir:98 1 MPTLSTTNPTLADVA-----------ARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKL 69 (331) T ss_pred CCccccCcccHHHHH-----------HhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhcc Confidence 000000111111100 001111 23345888888888877654433222222344556778889999988 Q ss_pred CcccccccccceeEEeeeeeeeeeehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--ccccccc--- Q lcl|NC_016164. 623 GGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLGSN--SQPEGLK--- 695 (836) Q Consensus 623 g~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~--~~p~Gi~--- 695 (836) +..++.++.++.+++...+-+++.+.|.+.+..... -.+...-...+.+++.+.....||+|+-+. ..+.||. T Consensus 70 N~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~ 149 (331) T protein:vir:98 70 NYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRF 149 (331) T ss_pred CCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhc Confidence 899999999999999999999999999998765422 223444566678899999999999987321 1122221 Q ss_pred cccc----ccccc------------------------------------------------------------------- Q lcl|NC_016164. 696 FVTG----INTEN------------------------------------------------------------------- 704 (836) Q Consensus 696 ~~~~----~~~~t------------------------------------------------------------------- 704 (836) +... .+.+. T Consensus 150 ~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~ 229 (331) T protein:vir:98 150 NSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLR 229 (331) T ss_pred cccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEc Confidence 0000 00000 Q ss_pred ------------cc---ccchhHHHHHH----HHHHHhhhccccCccEEEecHHHHHHHHHH-hhccCcc-ccccCCCC- Q lcl|NC_016164. 705 ------------FG---ATNPTYVELVS----MESKVAADNADIGAMSYLTNSTLYGGFKTT-EKATSTA-QFVLEPGG- 762 (836) Q Consensus 705 ------------~a---a~~~t~~~l~~----a~~~l~~~~~~~~~~~~vmnp~~~~~L~~l-kd~~g~~-~~~~~~~~- 762 (836) .+ ..+-+..++.+ +..+|+ +...++.+|+||.+....|+.. .+..... .......+ T Consensus 230 d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip--~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~ 307 (331) T protein:vir:98 230 DWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIP--NVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGK 307 (331) T ss_pred CcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhc--ccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCc Confidence 00 00001122222 333332 2446778999999999999875 3332222 22222222 Q ss_pred ---eecceeeEeeCccccceEEEE Q lcl|NC_016164. 763 ---TVNGYNVVRSNQVANGDVFFG 783 (836) Q Consensus 763 ---~l~G~pVv~s~~~~~~~i~~g 783 (836) .+.|+||-.++.+-.+...+. T Consensus 308 ~~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:98 308 KVVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred ceeEECCeeEEEeeeeecCccccC Confidence 478999999998876543222 No 177 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=97.96 E-value=1.6e-06 Score=52.39 Aligned_cols=227 Identities=11% Similarity=0.072 Sum_probs=127.8 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccch-hhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeecc Q lcl|NC_016164. 544 VTPRGILAPNDVLHRDLVVDTASAAGDLVFTD-GRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAE 622 (836) Q Consensus 544 ~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~-~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~E 622 (836) -...+...+...... ..+-|. .+...|++.+.+.+++....+-............+.++-|.+.|..= T Consensus 1 m~~~~~~~~TL~e~A-----------k~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~l 69 (331) T protein:vir:10 1 MPTLSTTNPTLADVA-----------ARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKL 69 (331) T ss_pred CCccccCcccHHHHH-----------HhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhcc Confidence 000000111111100 001111 23345888888888877654433222222344556778889999988 Q ss_pred CcccccccccceeEEeeeeeeeeeehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--ccccccc--- Q lcl|NC_016164. 623 GGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLGSN--SQPEGLK--- 695 (836) Q Consensus 623 g~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~--~~p~Gi~--- 695 (836) +..++.++.++.+++...+-+++.+.|.+.+..... -.+...-...+.+++.+.....||+|+-+. ..+.||. T Consensus 70 N~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~ 149 (331) T protein:vir:10 70 NYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRF 149 (331) T ss_pred CCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhc Confidence 899999999999999999999999999998765422 223444566678899999999999987321 1122221 Q ss_pred cccc----ccccc------------------------------------------------------------------- Q lcl|NC_016164. 696 FVTG----INTEN------------------------------------------------------------------- 704 (836) Q Consensus 696 ~~~~----~~~~t------------------------------------------------------------------- 704 (836) +... .+.+. T Consensus 150 ~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~ 229 (331) T protein:vir:10 150 NSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLR 229 (331) T ss_pred cccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEc Confidence 0000 00000 Q ss_pred ------------cc---ccchhHHHHHH----HHHHHhhhccccCccEEEecHHHHHHHHHH-hhccCcc-ccccCCCC- Q lcl|NC_016164. 705 ------------FG---ATNPTYVELVS----MESKVAADNADIGAMSYLTNSTLYGGFKTT-EKATSTA-QFVLEPGG- 762 (836) Q Consensus 705 ------------~a---a~~~t~~~l~~----a~~~l~~~~~~~~~~~~vmnp~~~~~L~~l-kd~~g~~-~~~~~~~~- 762 (836) .+ ..+-+..++.+ +..+|+ +...++.+|+||.+....|+.. .+..... .......+ T Consensus 230 d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip--~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~ 307 (331) T protein:vir:10 230 DWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIP--NVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGK 307 (331) T ss_pred CcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhc--ccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCc Confidence 00 00001122222 333332 2446778999999999999875 3332222 22222222 Q ss_pred ---eecceeeEeeCccccceEEEE Q lcl|NC_016164. 763 ---TVNGYNVVRSNQVANGDVFFG 783 (836) Q Consensus 763 ---~l~G~pVv~s~~~~~~~i~~g 783 (836) .+.|+||-.++.+-.+...+. T Consensus 308 ~~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 308 KVVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred ceeEECCeeEEEeeeeecCccccC Confidence 478999999998876543222 No 178 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=97.96 E-value=1.6e-06 Score=52.39 Aligned_cols=227 Identities=11% Similarity=0.072 Sum_probs=127.8 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccch-hhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeeecc Q lcl|NC_016164. 544 VTPRGILAPNDVLHRDLVVDTASAAGDLVFTD-GRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWVAE 622 (836) Q Consensus 544 ~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~-~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v~E 622 (836) -...+...+...... ..+-|. .+...|++.+.+.+++....+-............+.++-|.+.|..= T Consensus 1 m~~~~~~~~TL~e~A-----------k~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~l 69 (331) T protein:vir:10 1 MPTLSTTNPTLADVA-----------ARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKL 69 (331) T ss_pred CCccccCcccHHHHH-----------HhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhcc Confidence 000000111111100 001111 23345888888888877654433222222344556778889999988 Q ss_pred CcccccccccceeEEeeeeeeeeeehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--ccccccc--- Q lcl|NC_016164. 623 GGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLGSN--SQPEGLK--- 695 (836) Q Consensus 623 g~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~--~~p~Gi~--- 695 (836) +..++.++.++.+++...+-+++.+.|.+.+..... -.+...-...+.+++.+.....||+|+-+. ..+.||. T Consensus 70 N~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~ 149 (331) T protein:vir:10 70 NYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRF 149 (331) T ss_pred CCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhc Confidence 899999999999999999999999999998765422 223444566678899999999999987321 1122221 Q ss_pred cccc----ccccc------------------------------------------------------------------- Q lcl|NC_016164. 696 FVTG----INTEN------------------------------------------------------------------- 704 (836) Q Consensus 696 ~~~~----~~~~t------------------------------------------------------------------- 704 (836) +... .+.+. T Consensus 150 ~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~ 229 (331) T protein:vir:10 150 NSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLR 229 (331) T ss_pred cccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEc Confidence 0000 00000 Q ss_pred ------------cc---ccchhHHHHHH----HHHHHhhhccccCccEEEecHHHHHHHHHH-hhccCcc-ccccCCCC- Q lcl|NC_016164. 705 ------------FG---ATNPTYVELVS----MESKVAADNADIGAMSYLTNSTLYGGFKTT-EKATSTA-QFVLEPGG- 762 (836) Q Consensus 705 ------------~a---a~~~t~~~l~~----a~~~l~~~~~~~~~~~~vmnp~~~~~L~~l-kd~~g~~-~~~~~~~~- 762 (836) .+ ..+-+..++.+ +..+|+ +...++.+|+||.+....|+.. .+..... .......+ T Consensus 230 d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip--~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~ 307 (331) T protein:vir:10 230 DWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIP--NVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGK 307 (331) T ss_pred CcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhc--ccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCc Confidence 00 00001122222 333332 2446778999999999999875 3332222 22222222 Q ss_pred ---eecceeeEeeCccccceEEEE Q lcl|NC_016164. 763 ---TVNGYNVVRSNQVANGDVFFG 783 (836) Q Consensus 763 ---~l~G~pVv~s~~~~~~~i~~g 783 (836) .+.|+||-.++.+-.+...+. T Consensus 308 ~~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 308 KVVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred ceeEECCeeEEEeeeeecCccccC Confidence 478999999998876543222 No 179 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=97.93 E-value=1e-05 Score=47.97 Aligned_cols=261 Identities=9% Similarity=-0.044 Sum_probs=132.1 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecC------CceEEEEEecCCceeeee-ccCcccccccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL------QGPVAIPRQTGAATAYWV-AEGGDPTESQPSV 633 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~------~~~~~~p~~~~~~~a~~v-~Eg~~~~~~~~~~ 633 (836) +.. .-...+|+.+...+++.+++..++.++..+...+. +..+++++-......... ..+..+...++.. T Consensus 1 MaN----~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e 76 (423) T protein:vir:10 1 MPN----NLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Ccc----chhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCcccc Confidence 110 10112589999999999999999988876655433 235566544332222222 2333344556666 Q ss_pred eeEEeeeeeeee-eehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhc-CCcccccccccccccccccccccchh Q lcl|NC_016164. 634 DQVALVAKTLGA-YTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGL-GSNSQPEGLKFVTGINTENFGATNPT 711 (836) Q Consensus 634 ~~it~~~~t~~~-~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~-Gt~~~p~Gi~~~~~~~~~t~aa~~~t 711 (836) .++.+.+.+.-. -+.++.+=+..+..+++.+ .+.-.++++..+|..++... +.... ..+ +.+..... T Consensus 77 ~~v~l~id~~k~va~~v~d~E~~~~i~~~~~~-l~~A~~aLA~~vd~~ia~~~~~~~~~------~~g----t~~t~~~a 145 (423) T protein:vir:10 77 GKATGRVGNYITVAVEYQQLEEAIKLNQLEEI-LAPVRQRIVTDLETELAHFMMNNGAL------SLG----SPNTPITK 145 (423) T ss_pred ceeEEEeeceeeeeeeechHHHhcChhhHHHH-HHHHHHHHHHHHHHHHHHHHhhcccc------ccc----cCCcccch Confidence 666665544332 3455443333444556554 45556889999999876431 11110 111 11111135 Q ss_pred HHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHH----hhc--cCccccccCC-CCeecceeeEeeCccccceEEEEe Q lcl|NC_016164. 712 YVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTT----EKA--TSTAQFVLEP-GGTVNGYNVVRSNQVANGDVFFGV 784 (836) Q Consensus 712 ~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~l----kd~--~g~~~~~~~~-~~~l~G~pVv~s~~~~~~~i~~gD 784 (836) |+++.++...|...+.....-..+++|..+..|... ... .+...|..+. .+++.|+.|+.|+.+|..+....- T Consensus 146 ~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~ 225 (423) T protein:vir:10 146 WSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFG 225 (423) T ss_pred HHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCccccccccc Confidence 889999999999888887778899999998776532 111 1122344443 378999999999999964322110 Q ss_pred hh-----ceEE-----EeecceEEE----Eecccccc--cCcEEEEE---EEEeccEEE------cccceEEEeecC Q lcl|NC_016164. 785 WN-----QMIM-----GMWGALDIQ----VNPYALDK--SGSVRVTA---LQDVDVAVR------HPEAFCRGNDNL 836 (836) Q Consensus 785 ~s-----~~~i-----~~~~~l~i~----~~~~~~~~--~~~~~~r~---~~r~d~~v~------~p~Af~~l~~A~ 836 (836) -+ ...+ ..-....+. +-..+.+. .+.+.|-. ..+....+. .+.-|++..++. T Consensus 226 ~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~ 302 (423) T protein:vir:10 226 GTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADAN 302 (423) T ss_pred cceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeee Confidence 00 0000 000111111 11111111 11111111 111111111 334555554443 No 180 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=97.91 E-value=1e-05 Score=47.93 Aligned_cols=263 Identities=12% Similarity=0.035 Sum_probs=104.9 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhc---ceeeec---CCceEEEEEecC--Cc--eeeeeccCccccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLG---VTMLTG---LQGPVAIPRQTG--AA--TAYWVAEGGDPTESQ 630 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~---~~~~~~---~~~~~~~p~~~~--~~--~a~~v~Eg~~~~~~~ 630 (836) ++.+. -.+..+.....+++.+.+.....+.. +.++.. .+.-+.+|-... +. ....+.+.+..+..+ T Consensus 1 m~lsD----~~vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~k 76 (325) T protein:vir:95 1 MALSD----LAVYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKV 76 (325) T ss_pred Cchhh----hhhhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceeccce Confidence 00000 01123333344455544433332221 111111 111223333221 11 222334444444333 Q ss_pred cc-ceeEEeeeeeeeeee--hhHHHHHhcchh-HHHHHHHHHHHHHHHHHHHHHHHhhcCCc-cccccccccccccccc- Q lcl|NC_016164. 631 PS-VDQVALVAKTLGAYT--EFSRRLMLQSSI-DVEQMVRTELATVIALEIDRAALYGLGSN-SQPEGLKFVTGINTEN- 704 (836) Q Consensus 631 ~~-~~~it~~~~t~~~~i--~ISrelL~ds~~-~l~~~i~~~l~~a~a~~~d~~il~G~Gt~-~~p~Gi~~~~~~~~~t- 704 (836) .+ ...+...+..-.+.+ .++..+...+.+ .+...|...++++..+.+-..++.+.... .......+ .+...+ T Consensus 77 itt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~--dis~~~~ 154 (325) T protein:vir:95 77 LKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVY--DATANTD 154 (325) T ss_pred eccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccee--eeecccC Confidence 33 222333332222222 222222222222 22233333333333222222233222110 00011111 011111 Q ss_pred ccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCC---CCeecceeeEeeCccccce-- Q lcl|NC_016164. 705 FGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEP---GGTVNGYNVVRSNQVANGD-- 779 (836) Q Consensus 705 ~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~---~~~l~G~pVv~s~~~~~~~-- 779 (836) .....++...+.++..+|-.+.. .-..|+||..++..|....-.+.-..+.... -++.+|++|++++.+|... T Consensus 155 ~~~~~~s~~~l~~A~~klGD~~~--~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~~i~t~~G~~VIVdD~~p~~~~g 232 (325) T protein:vir:95 155 AADKLPTWNNLNNGQAKFGDQSS--QIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVNVVRDPFGKLLVMTDSPNLFAAG 232 (325) T ss_pred cccccccHHHHHHHHHHhccccc--ceeEEEEchHHHHHHHHhhccccccccccCCcccccccCCcEEEEeCCCCCCCcc Confidence 11223578899999988744432 3468999999999998755443322222211 1467899999999988532 Q ss_pred -------EEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 780 -------VFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 780 -------i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .+||. ..+.+.....+.....+...-+.....++.+.. -+++|..+..-+ +. T Consensus 233 ~~~~ytty~lg~-GAi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t---f~lhp~G~sw~~-s~ 291 (325) T protein:vir:95 233 TPNVYHILGLVP-GGVLIGQNNDFDANEETKNGDENIIRTYQAEWS---YNIGVKGFAWDK-AN 291 (325) T ss_pred CceeEEEEEEec-CeEEecCCCCccccccccCcccceeeeeeeeee---EEeecceeeeec-cc Confidence 12221 222233322222222222222233334443222 367888877733 33 No 181 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=97.88 E-value=3e-06 Score=50.86 Aligned_cols=271 Identities=12% Similarity=0.105 Sum_probs=161.4 Q ss_pred hcccccccccccchhhHHHHHHHHHhhhhhhhhcc-eeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeee Q lcl|NC_016164. 562 VDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGV-TMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVA 640 (836) Q Consensus 562 ~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~-~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~ 640 (836) ...++..-.+++.+.+++.|...|.+...=..+.. .+..+++..+.+|.. +++.+.--.|.++.....+..+++++.+ T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti-Gs~~~~~~~E~~~~~~~~i~TGEIt~~i 79 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI-GSVTLQEAEEDTPLIYNPIETGEITFQI 79 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc-CceeeeccccCCCeeecccccceEEEEE Confidence 33344555677888888877766655421111111 233455667777654 5667777788888888889999999999 Q ss_pred eeeeee-ehhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHHHHHhhcCC-----cccccccccccccccccccccchhH Q lcl|NC_016164. 641 KTLGAY-TEFSRRLMLQSS--IDVEQMVRTELATVIALEIDRAALYGLGS-----NSQPEGLKFVTGINTENFGATNPTY 712 (836) Q Consensus 641 ~t~~~~-i~ISrelL~ds~--~~l~~~i~~~l~~a~a~~~d~~il~G~Gt-----~~~p~Gi~~~~~~~~~t~aa~~~t~ 712 (836) ..|++- ..||.+|-+|+- -++-+.+..+-++++....+..+|. +|. ++.|.-+-..++...-+........ T Consensus 80 ~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~-~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~~~ 158 (313) T protein:vir:95 80 TEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLK-TGAEYFAANPGPHNVNGFPHVIVSAETNGVFAL 158 (313) T ss_pred EeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHh-hchhhhccCCCCcccccccceEEeccCCceehh Confidence 999875 479999877642 2344455555666666666666653 221 2233333223344444444455678 Q ss_pred HHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhc----cCccccccCC---CC-----eecceeeEeeCcccc--- Q lcl|NC_016164. 713 VELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKA----TSTAQFVLEP---GG-----TVNGYNVVRSNQVAN--- 777 (836) Q Consensus 713 ~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~----~g~~~~~~~~---~~-----~l~G~pVv~s~~~~~--- 777 (836) .++..+...+..++...+..++++.|.....|..+..- ...+.++... ++ .+.|..+.+|+.+.. T Consensus 159 ~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~AN~ 238 (313) T protein:vir:95 159 KHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHVANY 238 (313) T ss_pred hHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhhccc Confidence 89999999999999999999999999998888776432 1223333221 11 367888887776542 Q ss_pred ------ceEEEEehhce--------EEEeecceEE-EEecccccccCcEEEEEEEEeccEEEcccce-EEEeecC Q lcl|NC_016164. 778 ------GDVFFGVWNQM--------IMGMWGALDI-QVNPYALDKSGSVRVTALQDVDVAVRHPEAF-CRGNDNL 836 (836) Q Consensus 778 ------~~i~~gD~s~~--------~i~~~~~l~i-~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af-~~l~~A~ 836 (836) +.-+.|++-.. +++-|..+-- ......+-..+.+.. .+|+++++.+-+-+ +++++|- T Consensus 239 ~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~--~~R~G~Gi~R~~~L~~~~~~A~ 311 (313) T protein:vir:95 239 NDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVV--RCRYGFGIQRLDTLGLLATSAT 311 (313) T ss_pred cccccccCceeeeeeeeeecccccceeeeecccccccccccccccccccee--eeeecccceeecceeEEEeccc Confidence 22344443211 2333443311 111111222233444 55888888877665 5666777 No 182 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=97.78 E-value=2e-05 Score=46.36 Aligned_cols=262 Identities=10% Similarity=-0.008 Sum_probs=127.3 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCC------ceEEEEEecCCceeee-eccCcccccccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQ------GPVAIPRQTGAATAYW-VAEGGDPTESQPSV 633 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~------~~~~~p~~~~~~~a~~-v~Eg~~~~~~~~~~ 633 (836) +..+ -...+|+.+...+++.+++..++.++..+...+.+ ..+++++-........ ...+..+...++.. T Consensus 1 MaN~----llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e 76 (423) T protein:vir:17 1 MPNN----LDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cccc----hhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCcccc Confidence 1111 01136899999999999999999888766554432 2566664332111111 12333344455666 Q ss_pred eeEEeeeeeeee-eehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhH Q lcl|NC_016164. 634 DQVALVAKTLGA-YTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTY 712 (836) Q Consensus 634 ~~it~~~~t~~~-~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~ 712 (836) .++.+.+.+.-. -+.++.+=+..+..+++.++ +.-.++++..+|..++... ....+ +..+. .+.....| T Consensus 77 ~~v~l~id~~k~va~~v~d~E~~~~i~~~~~~l-~~A~~aLA~~vd~~ia~~~-~~~a~----~~~gt----~~t~~~a~ 146 (423) T protein:vir:17 77 GKATGRVGNYITVAVEYQQLEEAIKLNQLEEIL-APVRQRIVTDLETELAHFM-MNNGA----LSLGS----PNTPITKW 146 (423) T ss_pred ceeEEEeeceeeeeeeecHHHHhcChhHHHHHH-HHHHHHHHHHHHHHHHHHH-hhccc----ccccc----CCcccccH Confidence 666655544332 34455543344455565544 4456889999998776431 11001 00111 11111248 Q ss_pred HHHHHHHHHHhhhccccCccEEEecHHHHHHHHHH----hhcc--CccccccCC-CCeecceeeEeeCccccceEE-EEe Q lcl|NC_016164. 713 VELVSMESKVAADNADIGAMSYLTNSTLYGGFKTT----EKAT--STAQFVLEP-GGTVNGYNVVRSNQVANGDVF-FGV 784 (836) Q Consensus 713 ~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~l----kd~~--g~~~~~~~~-~~~l~G~pVv~s~~~~~~~i~-~gD 784 (836) +++.++...|...+.....-..+++|..+..|... ...+ +...+..+. .+++.|+.|+.|+.+|..+.. ++- T Consensus 147 ~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~ 226 (423) T protein:vir:17 147 SDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGG 226 (423) T ss_pred HHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCCCccccccceec Confidence 89999999999988887778899999998776532 1111 122344443 378999999999999954321 110 Q ss_pred h---------hceEEEe----ecceEEEEeccccc--ccCcEEEEE---EEEeccEE------EcccceEEEeec----- Q lcl|NC_016164. 785 W---------NQMIMGM----WGALDIQVNPYALD--KSGSVRVTA---LQDVDVAV------RHPEAFCRGNDN----- 835 (836) Q Consensus 785 ~---------s~~~i~~----~~~l~i~~~~~~~~--~~~~~~~r~---~~r~d~~v------~~p~Af~~l~~A----- 835 (836) . .+..... ..++...+...+++ ..+.+.|-. ..+....+ -...-|.+..++ T Consensus 227 t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~ 306 (423) T protein:vir:17 227 TLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSS 306 (423) T ss_pred eeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEeccccccc Confidence 0 0000000 01111111111111 011111110 00000000 011222222111 Q ss_pred ----------C Q lcl|NC_016164. 836 ----------L 836 (836) Q Consensus 836 ----------~ 836 (836) . T Consensus 307 ~~~tv~i~p~~ 317 (423) T protein:vir:17 307 GDVTVTLSGVP 317 (423) T ss_pred CceEEEecCcc Confidence 0 No 183 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=97.75 E-value=2.2e-05 Score=46.11 Aligned_cols=261 Identities=11% Similarity=0.080 Sum_probs=125.3 Q ss_pred hhcccccccccccch--hhHHHHHHHHHhhhhhhhhccee--------eecCCceEEEEEecC-Cce--eeeeccC--cc Q lcl|NC_016164. 561 VVDTASAAGDLVFTD--GRPGSFIELLRNRLALNTLGVTM--------LTGLQGPVAIPRQTG-AAT--AYWVAEG--GD 625 (836) Q Consensus 561 ~~~~~~~~g~~vvp~--~~~~~ii~~l~~~~~l~~l~~~~--------~~~~~~~~~~p~~~~-~~~--a~~v~Eg--~~ 625 (836) ++ .+.-...++|+ .+...+.+...+.+.|.+-+... ....+..+++|.... ... ..+.+.. +. T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~ 78 (349) T protein:vir:94 1 MA--ITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CC--ceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 11 12234567777 57777766666666666533221 112234566665532 111 1111111 12 Q ss_pred cccccccceeEEeeeeeeeee---ehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccc---- Q lcl|NC_016164. 626 PTESQPSVDQVALVAKTLGAY---TEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVT---- 698 (836) Q Consensus 626 ~~~~~~~~~~it~~~~t~~~~---i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~---- 698 (836) .+.++.+-++......-.+.. ..++..+- -+ +....|.++++....+.....+|.- .+|+|... T Consensus 79 ~t~~kit~~~~~a~~~~r~kaw~~~Dla~~ls-G~--dpm~~Ia~~va~yW~r~~q~~Lia~------L~Gvf~~~~~~~ 149 (349) T protein:vir:94 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELT-SQ--NPLQSVASRLDNFWQRQAQRRLIAT------ALGLYNDNVSAT 149 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhh-Cc--hHHHHHHHHHHHHHhhHHHHHHHHH------HHhhhccccccc Confidence 222333322222222222222 23444432 22 4457788888877777666655431 12222211 Q ss_pred -------ccccccccccchhHHHHHHHHHHHhhhcc---ccCccEEEecHHHHHHHHHHhhccC-ccccccCCCCeecce Q lcl|NC_016164. 699 -------GINTENFGATNPTYVELVSMESKVAADNA---DIGAMSYLTNSTLYGGFKTTEKATS-TAQFVLEPGGTVNGY 767 (836) Q Consensus 699 -------~~~~~t~aa~~~t~~~l~~a~~~l~~~~~---~~~~~~~vmnp~~~~~L~~lkd~~g-~~~~~~~~~~~l~G~ 767 (836) .......+.+.++...+..+..+|-.+-. ...-..++||+.++..|+.++--.- ++.--...-++++|+ T Consensus 150 ~~~~~~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i~ty~G~ 229 (349) T protein:vir:94 150 DAYHEQNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTMFATYQGY 229 (349) T ss_pred ccccccCceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhhccCcccCcccceecCc Confidence 11111112233566677777766544311 1123689999999998876532110 000001123578999 Q ss_pred eeEeeCccccc---------eEEEEehhceEEEeec---ceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 768 NVVRSNQVANG---------DVFFGVWNQMIMGMWG---ALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 768 pVv~s~~~~~~---------~i~~gD~s~~~i~~~~---~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) +|++++.||.. ..+||. ..+.+..-+ .++...++-..-..++..+..+.++ +++|..+..-..+ T Consensus 230 ~VivDD~~Pv~~~g~~~~yttylfg~-GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~---~~hp~G~s~~~a~ 305 (349) T protein:vir:94 230 RVIVDDSMTVVGQDTSRKFISIIFGQ-GAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYSFTSAV 305 (349) T ss_pred EEEEeCCCccccCCCCceEEEEEeec-ceEEeecCCCCcceeeecccccCCcceeEEEEEeeEE---Eeeeeeeeecccc Confidence 99999999841 134552 223333322 2344444433333456666666655 6677777776655 Q ss_pred C Q lcl|NC_016164. 836 L 836 (836) Q Consensus 836 ~ 836 (836) + T Consensus 306 v 306 (349) T protein:vir:94 306 I 306 (349) T ss_pred c Confidence 4 No 184 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=97.72 E-value=2.5e-05 Score=45.81 Aligned_cols=266 Identities=11% Similarity=0.066 Sum_probs=125.5 Q ss_pred hhcccccccccccch--hhHHHHHHHHHhhhhhhhhccee--------eecCCceEEEEEecC--C-ceeeeeccC--cc Q lcl|NC_016164. 561 VVDTASAAGDLVFTD--GRPGSFIELLRNRLALNTLGVTM--------LTGLQGPVAIPRQTG--A-ATAYWVAEG--GD 625 (836) Q Consensus 561 ~~~~~~~~g~~vvp~--~~~~~ii~~l~~~~~l~~l~~~~--------~~~~~~~~~~p~~~~--~-~~a~~v~Eg--~~ 625 (836) ++ .+.-..+++|+ .+...+.+...+.+.|.+-+... ....+..+++|.... + ....+...+ +. T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MA--ITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CC--ceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 11 12234567777 57777766666666665533211 112244666665543 1 121111211 12 Q ss_pred cccccccceeEEeeeeeeeeee---hhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHh---hcCCcccccc---ccc Q lcl|NC_016164. 626 PTESQPSVDQVALVAKTLGAYT---EFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALY---GLGSNSQPEG---LKF 696 (836) Q Consensus 626 ~~~~~~~~~~it~~~~t~~~~i---~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~---G~Gt~~~p~G---i~~ 696 (836) .+..+.+-++........+... .++..+ .-+ +....|.++++....+.....++. |.-......+ .+. T Consensus 79 ~t~~kitt~~~~a~~~~r~kaw~~~Dla~~l-sG~--dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~ 155 (349) T protein:vir:78 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVEL-TSQ--NPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQ 155 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHh-hCc--hHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhc Confidence 2333333333322233233333 334433 222 456778888887777666655442 2211100000 000 Q ss_pred ccccccccccccchhHHHHHHHHHHHhhhcc---ccCccEEEecHHHHHHHHHHhhccC-ccccccCCCCeecceeeEee Q lcl|NC_016164. 697 VTGINTENFGATNPTYVELVSMESKVAADNA---DIGAMSYLTNSTLYGGFKTTEKATS-TAQFVLEPGGTVNGYNVVRS 772 (836) Q Consensus 697 ~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~---~~~~~~~vmnp~~~~~L~~lkd~~g-~~~~~~~~~~~l~G~pVv~s 772 (836) ..++.. ..+.+.++...+..+...|-.+-. ...-..++||+.++..|+..+--.- ++.--...-++++|++|+++ T Consensus 156 ~~~t~d-~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i~ty~G~~VivD 234 (349) T protein:vir:78 156 NDMVVD-VSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTMFATYQGYRVIVD 234 (349) T ss_pred ccceee-eccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhhccCcccCcccceecCeEEEEe Confidence 011111 112223566777777666654311 1123689999999998876532110 00000112357899999999 Q ss_pred Cccccc---------eEEEEehhceEEEeec---ceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 773 NQVANG---------DVFFGVWNQMIMGMWG---ALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 773 ~~~~~~---------~i~~gD~s~~~i~~~~---~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +.+|.. .++||. ..+.+..-+ .++...++-..-..++..+..+.++ +++|..+..-..++ T Consensus 235 D~~Pv~~~g~~~~yttylfg~-GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~---~~hp~G~s~~~a~v 306 (349) T protein:vir:78 235 DSMTVVGQGAQRKFISIIFGQ-GAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYRFTSAVI 306 (349) T ss_pred CCCccccCCCCceEEEEEeec-ceEEEccCCCccceeeecccccCCcceeEEEEEeeEE---Eeeeeeeeeccccc Confidence 999842 134542 223232222 2344444433333456666666665 56777776665544 No 185 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=97.69 E-value=4.2e-06 Score=50.04 Aligned_cols=297 Identities=11% Similarity=0.023 Sum_probs=146.3 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhh------hhhhhhhhhhcccccccccccchhhHH- Q lcl|NC_016164. 507 FSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAP------NDVLHRDLVVDTASAAGDLVFTDGRPG- 579 (836) Q Consensus 507 ~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~------~~~~~~a~~~~~~~~~g~~vvp~~~~~- 579 (836) +..... -..+++.|....+.... ...+..+.......+.+..-+|+.+.. T Consensus 1 ~~~~~~-----------------------~~~l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~~~~~l~~~ 57 (336) T protein:vir:36 1 MRDAQR-----------------------IQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTY 57 (336) T ss_pred CchHHH-----------------------HHHHhhcCeeecchhhhhhhHHHHhhhhhhhccCccccCCCcchHHHHHHh Confidence 000000 11112223222211100 011111111111112222334555554 Q ss_pred ---HHHHHHHhhhhhhhhcceeeecC--CceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHH-HH Q lcl|NC_016164. 580 ---SFIELLRNRLALNTLGVTMLTGL--QGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSR-RL 653 (836) Q Consensus 580 ---~ii~~l~~~~~l~~l~~~~~~~~--~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISr-el 653 (836) .+++.+.+......+......+. ...+.++.......+.+++-+...|..+......+.+++.++..+.++. |+ T Consensus 58 i~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~ 137 (336) T protein:vir:36 58 VDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGEREL 137 (336) T ss_pred hccceEeeecchhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHH Confidence 23344444444444433221121 1355666666677788888888899999888888889999999999884 43 Q ss_pred Hhc--chhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccc-cc-----c--chhHHHHHHHHHHHh Q lcl|NC_016164. 654 MLQ--SSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENF-GA-----T--NPTYVELVSMESKVA 723 (836) Q Consensus 654 L~d--s~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~-aa-----~--~~t~~~l~~a~~~l~ 723 (836) -.- ...++.+.-.....+++.+++|...++|+.. ....|++|....+...+ ++ + .-.++||.+++..+. T Consensus 138 ~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~-~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~ 216 (336) T protein:vir:36 138 EMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQ 216 (336) T ss_pred HHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccc-cceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHH Confidence 322 1235556667777777777777777788753 44578988766542111 11 1 224789999999998 Q ss_pred hhccc----cCccEEEecHHHHHHHHHHhhccCccc--cccCCCCeecceeeEeeCccccceEEEEehhceEEEeecc-- Q lcl|NC_016164. 724 ADNAD----IGAMSYLTNSTLYGGFKTTEKATSTAQ--FVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGA-- 795 (836) Q Consensus 724 ~~~~~----~~~~~~vmnp~~~~~L~~lkd~~g~~~--~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~-- 795 (836) .+... ..+..++|.|..+..|... ...|... |+-.. +-++.++..+.+.... |+..++.+....+ T Consensus 217 ~qt~G~i~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~~lk~n---~Pnl~i~t~pEl~~a~---g~~~~l~~~~~~~~~ 289 (336) T protein:vir:36 217 TQSQGIITQEDVLRMGLPPTAMSDLSKT-NQYGLAAAAKLKDI---FPKLEFVTIPEYDTAS---GRLVQLWAPRVEGKD 289 (336) T ss_pred HhcCCeeeeccccEEEechHHHHhccCC-CccCccHHHHHHHh---cCccEEEEccccccCC---CceEEEEEEecCCCc Confidence 87532 2367899999988777432 2222111 11111 1112233333222110 1111111111111 Q ss_pred -eEEEEecc------cccccCcEEEEEEEEecc-EEEcccceEEEeec Q lcl|NC_016164. 796 -LDIQVNPY------ALDKSGSVRVTALQDVDV-AVRHPEAFCRGNDN 835 (836) Q Consensus 796 -l~i~~~~~------~~~~~~~~~~r~~~r~d~-~v~~p~Af~~l~~A 835 (836) .++. .+. .....-.+...+..|.++ .+++|.||+.++-= T Consensus 290 t~~~~-~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 290 TATCG-FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ceeee-cchhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 1111 110 011222345555666665 66779999987754 No 186 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=97.63 E-value=1.2e-05 Score=47.62 Aligned_cols=297 Identities=11% Similarity=0.023 Sum_probs=147.6 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhh------hhhhhhhhhhhcccccccccccchhhHHH Q lcl|NC_016164. 507 FSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILA------PNDVLHRDLVVDTASAAGDLVFTDGRPGS 580 (836) Q Consensus 507 ~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~------~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ 580 (836) +.....+ ..+++.|....+... ....+..+.......+.+...+|..+..- T Consensus 1 ~~~~~~~-----------------------~~l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~~l~~~ 57 (336) T protein:vir:10 1 MRDAQRI-----------------------QNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTY 57 (336) T ss_pred CchHHHH-----------------------HHHhhcCeeecchhhhhhhhHHHhhhhhhhccCccccCCCchhHHHHHhh Confidence 0000000 111222222211100 00111111111222222333345544432 Q ss_pred ----HHHHHHhhhhhhhhcceeeecC--CceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHH Q lcl|NC_016164. 581 ----FIELLRNRLALNTLGVTMLTGL--QGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLM 654 (836) Q Consensus 581 ----ii~~l~~~~~l~~l~~~~~~~~--~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL 654 (836) +++.+.+......+......+. ...+.++.......+.+++-+...|..+......+.+++.++..+.++.+=+ T Consensus 58 i~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El 137 (336) T protein:vir:10 58 VDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGEREL 137 (336) T ss_pred cccceeeehhhhhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHH Confidence 2333444444444432221121 1355666666677788888888899999888888889999999999985433 Q ss_pred hcc---hhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccc-cc-----c--chhHHHHHHHHHHHh Q lcl|NC_016164. 655 LQS---SIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENF-GA-----T--NPTYVELVSMESKVA 723 (836) Q Consensus 655 ~ds---~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~-aa-----~--~~t~~~l~~a~~~l~ 723 (836) ... ..++.+.-.....+++.+++|...++|+.. ....|++|....+.... ++ + .-.++||.+++..|. T Consensus 138 ~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~-~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~ 216 (336) T protein:vir:10 138 EMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQ 216 (336) T ss_pred HHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccc-cceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHH Confidence 322 345666777777777788888777888753 44578988766542111 11 1 224789999999998 Q ss_pred hhccc----cCccEEEecHHHHHHHHHHhhccCccc--cccCCCCeecceeeEeeCccccceEEEEehhceEEEeecc-- Q lcl|NC_016164. 724 ADNAD----IGAMSYLTNSTLYGGFKTTEKATSTAQ--FVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGA-- 795 (836) Q Consensus 724 ~~~~~----~~~~~~vmnp~~~~~L~~lkd~~g~~~--~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~-- 795 (836) .+..+ ..+..++|.|..+..|... ...|... |+-.. +-++.++..+.+.... |+..++.+....+ T Consensus 217 ~qs~G~i~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~~lk~n---~Pnl~i~t~pEl~~a~---G~~~~l~~~~~~~~~ 289 (336) T protein:vir:10 217 TQSQGIITQEDVLRMGLPPTAMSDLSKT-NQYGLAAAAKLKDI---FPKLEFVTIPEYDTAS---GRLVQLWAPRVEGKD 289 (336) T ss_pred HhcCCeecccCcceEEecHHHHHhccCC-CccCccHHHHHHHh---cCccEEEEccccccCC---CceEEEEEEecCCCc Confidence 86532 2367899999988777432 2222111 11111 1122233333322110 1111111111111 Q ss_pred -eEEEEecc------cccccCcEEEEEEEEecc-EEEcccceEEEeec Q lcl|NC_016164. 796 -LDIQVNPY------ALDKSGSVRVTALQDVDV-AVRHPEAFCRGNDN 835 (836) Q Consensus 796 -l~i~~~~~------~~~~~~~~~~r~~~r~d~-~v~~p~Af~~l~~A 835 (836) .++. .+. .....-.+...+..|.++ .+++|.||+.++-= T Consensus 290 t~~~~-~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 290 TATCG-FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ceeee-cchhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 1110 110 011222345555666665 66779999987754 No 187 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.61 E-value=1.2e-05 Score=47.52 Aligned_cols=265 Identities=8% Similarity=-0.014 Sum_probs=138.5 Q ss_pred ccccccccch--hhHHHHHHHHHhhhhhhhhcce--eeecCCceEEEEEecCCceee--eecc-CcccccccccceeEEe Q lcl|NC_016164. 566 SAAGDLVFTD--GRPGSFIELLRNRLALNTLGVT--MLTGLQGPVAIPRQTGAATAY--WVAE-GGDPTESQPSVDQVAL 638 (836) Q Consensus 566 ~~~g~~vvp~--~~~~~ii~~l~~~~~l~~l~~~--~~~~~~~~~~~p~~~~~~~a~--~v~E-g~~~~~~~~~~~~it~ 638 (836) -++..+++.+ .+...+.+...+....+++... ..+..-..+.+......+.+. |++- +..+|..+..+++... T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 1111111111 1112233322232233333221 111122345555555555555 7654 4668999999999999 Q ss_pred eeeeeeeeehhHHHHHhcc---hhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccc-------- Q lcl|NC_016164. 639 VAKTLGAYTEFSRRLMLQS---SIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGA-------- 707 (836) Q Consensus 639 ~~~t~~~~i~ISrelL~ds---~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa-------- 707 (836) .++.++.-+.+|.+=|..+ ..++.+.=.....+++...+|+.+|+|+-......||+|..++...+.++ T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~w~ 160 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTKVQ 160 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCccc Confidence 9999999888877545432 23566666677777888888899999974323467999988776443221 Q ss_pred ---cchhHHHHHHHHHHHhhhccc-cCccEEEecHHHHHHHHHHhhccCcc-c--cccCCCCeecceee--EeeC--ccc Q lcl|NC_016164. 708 ---TNPTYVELVSMESKVAADNAD-IGAMSYLTNSTLYGGFKTTEKATSTA-Q--FVLEPGGTVNGYNV--VRSN--QVA 776 (836) Q Consensus 708 ---~~~t~~~l~~a~~~l~~~~~~-~~~~~~vmnp~~~~~L~~lkd~~g~~-~--~~~~~~~~l~G~pV--v~s~--~~~ 776 (836) ..-.+++|.+++.++..+... ..+..++|.|+.+..|....-.++.. . |+.....-..|.|+ ...+ ... T Consensus 161 ~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~~~g~~l~I~~v~~~~~~ 240 (304) T protein:vir:52 161 AMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSAAAGRQVAIKALPSNYGT 240 (304) T ss_pred cCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcccccCCcceEEEecccccc Confidence 111356777888887665443 34568999999999886653333321 1 12111111233332 2111 111 Q ss_pred cc-----eEEEEeh--hceEEEeecceEEEEecccccccCc--EEEEEEEEecc-EEEcccceEEEee Q lcl|NC_016164. 777 NG-----DVFFGVW--NQMIMGMWGALDIQVNPYALDKSGS--VRVTALQDVDV-AVRHPEAFCRGND 834 (836) Q Consensus 777 ~~-----~i~~gD~--s~~~i~~~~~l~i~~~~~~~~~~~~--~~~r~~~r~d~-~v~~p~Af~~l~~ 834 (836) .+ .+++-+- ..+.+- -.+.+...+- ..+|. +.+=.+.|+++ .++.|.||+.++- T Consensus 241 ~g~~g~~r~vvY~~d~~~~~~~--vP~p~~~l~~--q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 241 RVTDGKTRAMVYVNSKEHVIFD--VPMSPTVLDA--QPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred cCCCCceEEEEEecChhheEEe--cCccccccch--hhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 11 1222222 112121 1111111111 12232 33335666666 7788999999999 No 188 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=97.42 E-value=2.4e-05 Score=45.91 Aligned_cols=297 Identities=12% Similarity=0.038 Sum_probs=147.8 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhh------hhhhhhhhhhhcccccccccccchhhHH- Q lcl|NC_016164. 507 FSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILA------PNDVLHRDLVVDTASAAGDLVFTDGRPG- 579 (836) Q Consensus 507 ~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~------~~~~~~~a~~~~~~~~~g~~vvp~~~~~- 579 (836) +.....+ ..+++.|....+... ....+..+.......+.+..-+|..+.. T Consensus 1 ~~~~~~~-----------------------~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~ 57 (336) T protein:vir:78 1 MRDAQRI-----------------------QNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTY 57 (336) T ss_pred CchHHHH-----------------------HHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHHh Confidence 0000011 111122222211100 0011111111111122222224444443 Q ss_pred ---HHHHHHHhhhhhhhhcceeeecC--CceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHH Q lcl|NC_016164. 580 ---SFIELLRNRLALNTLGVTMLTGL--QGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLM 654 (836) Q Consensus 580 ---~ii~~l~~~~~l~~l~~~~~~~~--~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL 654 (836) .+++.+.+......+......+. ...+.++.......+.+++-+...|..+...+..+-+++.++..+.++.+=+ T Consensus 58 i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El 137 (336) T protein:vir:78 58 VDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGEREL 137 (336) T ss_pred cccceeeehhhhhhhhhhcccccCCCccccEEEEeeeecceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHH Confidence 22344444444444433222222 1466777777778888889888999999999999999999999999886544 Q ss_pred hcc---hhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccc-cc-----c--chhHHHHHHHHHHHh Q lcl|NC_016164. 655 LQS---SIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENF-GA-----T--NPTYVELVSMESKVA 723 (836) Q Consensus 655 ~ds---~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~-aa-----~--~~t~~~l~~a~~~l~ 723 (836) ... ..++.+.-.....+++.+++|...++|+.. ....|++|....+.... ++ + .-.++||..++..+. T Consensus 138 ~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~-~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~ 216 (336) T protein:vir:78 138 EMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQ 216 (336) T ss_pred HHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccc-cceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHH Confidence 432 345666667777777777777777888753 45678998766543221 11 1 224678999999887 Q ss_pred hhcccc----CccEEEecHHHHHHHHHHhhccCccc--cccCCCCeecceeeEeeCccccceEEEEehhceEEEeecc-- Q lcl|NC_016164. 724 ADNADI----GAMSYLTNSTLYGGFKTTEKATSTAQ--FVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGA-- 795 (836) Q Consensus 724 ~~~~~~----~~~~~vmnp~~~~~L~~lkd~~g~~~--~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~-- 795 (836) .+.... .+..++|.|..+..|... ...|... |+-.. .-++.++..+.+.... |+..+++..+..+ T Consensus 217 ~qt~g~~~~~~~~tL~Lp~~~~~~L~~~-n~~g~tv~~~lk~n---~Pnl~i~t~pel~~Ag---g~~~~~~~~~~~~~~ 289 (336) T protein:vir:78 217 TQSQGIITQEAVLHMGLPPTAMSDLSKT-NQYGLSAAAKLKEI---FPKLEFVTIPEYDTAS---GRLVQLWAPRVEGKD 289 (336) T ss_pred HhcCCeeeeccceEEEechHHHHhccCC-CccCccHHHHHHHh---cCccEEEEcccccccC---cceEEEEEeeccCCc Confidence 765321 245789999998888543 2222111 11111 1112333333322110 1111111111110 Q ss_pred -eEEEEecc------cccccCcEEEEEEEEecc-EEEcccceEEEeec Q lcl|NC_016164. 796 -LDIQVNPY------ALDKSGSVRVTALQDVDV-AVRHPEAFCRGNDN 835 (836) Q Consensus 796 -l~i~~~~~------~~~~~~~~~~r~~~r~d~-~v~~p~Af~~l~~A 835 (836) +++. .+. .......+......|.++ -+++|.||+.++-= T Consensus 290 t~~~~-~p~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 290 TATCG-FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ceeee-cchhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 1111 010 011223344555566655 66779999887744 No 189 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=97.38 E-value=7.9e-05 Score=43.07 Aligned_cols=290 Identities=9% Similarity=0.069 Sum_probs=132.7 Q ss_pred hhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhh-hcc-e Q lcl|NC_016164. 520 GDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNT-LGV-T 597 (836) Q Consensus 520 ~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~-l~~-~ 597 (836) .++. +. .....+.++.....|.-... +.. ...-..--+.+...+-+...+-+.+........ +.. . T Consensus 1 ~~~~--~~-~~~~~~~~~~~~~~~~~~~~-------~~~--~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~ 68 (329) T protein:vir:10 1 MDGI--FI-TGVKTMNKEIKNATGKLKLN-------LQH--FANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISND 68 (329) T ss_pred CCce--EE-echhhhhhhhhcccceeEEe-------hhh--hcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccc Confidence 0000 00 00011111111111110000 000 000000111122233333333222222211111 111 1 Q ss_pred eeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHH--hcch--hHHHHHHHHHHHHHH Q lcl|NC_016164. 598 MLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLM--LQSS--IDVEQMVRTELATVI 673 (836) Q Consensus 598 ~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL--~ds~--~~l~~~i~~~l~~a~ 673 (836) ......+.+.+|+.+. +...-+.-++....+.++.+..++.+.. .+.+.+.=+-+ ..+. +.+...+...+...+ T Consensus 69 ~e~~~g~tVkIp~i~~-~gl~DY~R~~g~~~g~vt~~~~t~tidq-dR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v 146 (329) T protein:vir:10 69 AIFMQGRSFTVIKGDV-TELKDYKRNATNEFDHPQIQETTYFLDQ-EKYWGRFVDALDRRDTEGNIDINYVVAKQASEVV 146 (329) T ss_pred eeeccCcEEEEeeecc-cccccccCCCCccccccccceeEEEeec-ccceeeecchhhHhhhhhhhhHHHHHHHHHHHHh Confidence 2334567899998865 4444555445556666666666655543 22222221111 1111 122333444555666 Q ss_pred HHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc-- Q lcl|NC_016164. 674 ALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT-- 751 (836) Q Consensus 674 a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~-- 751 (836) +-.+|...+.-.-++. +....+..+..-.|+.|.+++..|..+... .+-.++++|..+..|....... T Consensus 147 ~pEiDay~~skla~~a---------~~~~~~~~t~~nay~~i~~a~~~Lde~~vp-~~Rvl~VtP~~~~~Lk~~~~f~~~ 216 (329) T protein:vir:10 147 APYLDNLRFATLARNK---------AKHLTVGSGADAQYDAVLDVSVELDEIGAG-ASRILFVTPKFYKGIKKFVIELPQ 216 (329) T ss_pred hhHHHHHHHHHHHhhc---------ccccccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhcc Confidence 6667765443221110 001111112233588999999999877544 4568889999999887532211 Q ss_pred --Cc-cccccCCCCeecceeeEeeCc--cccceEEEEehhceEEEe-ecceEEEEecccccccCcEEEEEEEEeccEEEc Q lcl|NC_016164. 752 --ST-AQFVLEPGGTVNGYNVVRSNQ--VANGDVFFGVWNQMIMGM-WGALDIQVNPYALDKSGSVRVTALQDVDVAVRH 825 (836) Q Consensus 752 --g~-~~~~~~~~~~l~G~pVv~s~~--~~~~~i~~gD~s~~~i~~-~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~ 825 (836) +. ...+.+.-++|.|++|+.++. +....+++|..+.+.... ...+++....... ....++....+|+.|.+ T Consensus 217 ~~~~~~~~~~g~Vg~idG~~Ii~vps~~~k~in~ii~~~~A~~~~~K~~~~~~~~p~~~~---~a~~v~gr~yyd~~V~~ 293 (329) T protein:vir:10 217 GDNRQQVLGKGVQGELDGFTIVKVPSKMLQGVEAMAVIGEVMASPIQANEAKLNSNVPGM---FGTLAEQMLYTGAFVPE 293 (329) T ss_pred ccccccceeeeeeeeecCeEEEEecCCcccceeEEEEcCCceeeeeeeeeeeeeCCCCcc---chheeeeeeeeeeEEEc Confidence 11 122344556899999987643 444456777765543322 2234443322222 33688889999999999 Q ss_pred ccceEEE---eecC Q lcl|NC_016164. 826 PEAFCRG---NDNL 836 (836) Q Consensus 826 p~Af~~l---~~A~ 836 (836) |++..+. +.|. T Consensus 294 ~k~~~I~~~~~~a~ 307 (329) T protein:vir:10 294 HLQKYIFTIGGKEV 307 (329) T ss_pred cccCEEEEecccCc Confidence 9864443 3333 No 190 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=97.28 E-value=7.5e-05 Score=43.19 Aligned_cols=301 Identities=10% Similarity=0.032 Sum_probs=142.8 Q ss_pred hhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhh-------hhhhhhhhhhccccc Q lcl|NC_016164. 495 ADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAP-------NDVLHRDLVVDTASA 567 (836) Q Consensus 495 ~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~-------~~~~~~a~~~~~~~~ 567 (836) -.+.+.... ..-+++.|....+.... ...+..+.......+ T Consensus 1 ~~~~~~~~~--------------------------------~~~l~~~g~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~ 48 (339) T protein:vir:94 1 MSINNDRTD--------------------------------IKQLEKVGIIFDGYSPKSISSEVSAYAMDAVNLTPTLQT 48 (339) T ss_pred CceechHHH--------------------------------HHHHHhhceeeccchhhhcchhhHhhhcccccccccccc Confidence 000000000 00111122211110000 001111100000111 Q ss_pred ccccc----cchhhHHHHHHHHHhhhhhhhhcceeeecC--CceEEEEEecCCceeeeeccCcccccccccceeEEeeee Q lcl|NC_016164. 568 AGDLV----FTDGRPGSFIELLRNRLALNTLGVTMLTGL--QGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAK 641 (836) Q Consensus 568 ~g~~v----vp~~~~~~ii~~l~~~~~l~~l~~~~~~~~--~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~ 641 (836) ....- ..+.+...+++.+.+....+.+......+. ...+.++.....+.+.+++.+...|..+...+...-++. T Consensus 49 ~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~ 128 (339) T protein:vir:94 49 TANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQNY 128 (339) T ss_pred ccccchhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEeEE Confidence 11111 223333445555666555555544332232 356888888888899999998888877755444444455 Q ss_pred eeeeeehhHHHHHhc---chhHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccccccccccc-c----cc--hh Q lcl|NC_016164. 642 TLGAYTEFSRRLMLQ---SSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFG-A----TN--PT 711 (836) Q Consensus 642 t~~~~i~ISrelL~d---s~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~a-a----~~--~t 711 (836) .+...+.++..=+.. ...++.+.-.....+++.+.+|+..++|+.. ....||+|...+.....+ + .+ -. T Consensus 129 ~~~~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~-~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI 207 (339) T protein:vir:94 129 RYQTWTEYGDLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAG-IANYGLMNDPSLPAPVAATVNWATAAPEDI 207 (339) T ss_pred EEEEEEeecHHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecc-cceEEEEeCCCccccccCCCCcccCCHHHH Confidence 555444444332221 2345667777778888888888888888742 345899987766433221 1 11 23 Q ss_pred HHHHHHHHHHHhhhcccc----CccEEEecHHHHHHHHHHhhccCccc--cccCCCCeecceeeEeeCcccc---ceE-E Q lcl|NC_016164. 712 YVELVSMESKVAADNADI----GAMSYLTNSTLYGGFKTTEKATSTAQ--FVLEPGGTVNGYNVVRSNQVAN---GDV-F 781 (836) Q Consensus 712 ~~~l~~a~~~l~~~~~~~----~~~~~vmnp~~~~~L~~lkd~~g~~~--~~~~~~~~l~G~pVv~s~~~~~---~~i-~ 781 (836) ++||.+++.++..+.... .+..++|.|..+..|... ...|... |+-.. ..++.++..+.+.. +.. + T Consensus 208 ~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~-n~~~~Tvl~~lk~n---~pnl~i~~~~el~~a~g~~~~~ 283 (339) T protein:vir:94 208 ANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRT-NNFGLSAGAKIAQT---YPNIQFVAVPEFDTASGRLVQL 283 (339) T ss_pred HHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC-CcCCccHHHHHHHh---cCCcEEEEccccccCCCceEEE Confidence 678889999887764321 245799999999887643 2222211 11111 11233443333321 111 1 Q ss_pred EE-eh---hceEEEeecceEEEEecccccccCcEEEEEEEEe-ccEEEcccceEEEeec Q lcl|NC_016164. 782 FG-VW---NQMIMGMWGALDIQVNPYALDKSGSVRVTALQDV-DVAVRHPEAFCRGNDN 835 (836) Q Consensus 782 ~g-D~---s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~-d~~v~~p~Af~~l~~A 835 (836) +. .. ....+..- +.+...+ .....-.+......|. |+.+++|.||+.++-= T Consensus 284 ~~~~~~~~~~~~~~~p--~~~~~lp-vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 284 WVPEVNGQPTGEVAFA--EKLRSHS-IERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred EEEeccCCcceEEEcc--hhhhccc-cEEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 11 10 01111110 0111111 0112233555666674 4477889999987754 No 191 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=97.25 E-value=0.00012 Score=42.15 Aligned_cols=259 Identities=8% Similarity=-0.033 Sum_probs=124.8 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecC------CceEEEEEecCCceeeeeccCccc---ccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL------QGPVAIPRQTGAATAYWVAEGGDP---TESQP 631 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~------~~~~~~p~~~~~~~a~~v~Eg~~~---~~~~~ 631 (836) +. ..-..+.|+.+...+++.+++..++.++..+-..+. +..+++|+-..... .- .-+... ...++ T Consensus 1 MA----Nsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~-~d-~~~~~~t~~~~~~l 74 (423) T protein:vir:10 1 MA----NNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKS-ER-TMDGDITGKSKNSL 74 (423) T ss_pred Cc----cccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceee-ec-ccCcccCccccccc Confidence 11 111226799999999999999999998877655433 23555544321111 11 111111 12234 Q ss_pred cceeEEeeeeeeee-eehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccch Q lcl|NC_016164. 632 SVDQVALVAKTLGA-YTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNP 710 (836) Q Consensus 632 ~~~~it~~~~t~~~-~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~ 710 (836) ...++.+.+.+.-. -+.++.+=+..+..+++. +.+.-.++++..+|..+....... -+ +..+. .+.... T Consensus 75 ~e~~v~l~id~~k~~a~~v~d~E~~l~i~~~~~-~l~~A~~aLA~~vd~~ia~~~~~~-~~----~~vgt----~~t~~~ 144 (423) T protein:vir:10 75 ISAKATGEVGNYITVAVEYRQIEEALKLNQLDQ-ILVPINERMVTDLETELALFMMKH-GA----LSLGS----PNTPIK 144 (423) T ss_pred ccceEEEEecceeeeeeeeChHHHhcChhHHHH-HHHHHHHHHHHHHHHHHHHHhhhc-cc----ccccc----cccccc Confidence 45556666654432 345654433345556655 455557889999998875322110 00 11111 111112 Q ss_pred hHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHH----Hhhcc--CccccccCC-CCeecceeeEeeCccccc---eE Q lcl|NC_016164. 711 TYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKT----TEKAT--STAQFVLEP-GGTVNGYNVVRSNQVANG---DV 780 (836) Q Consensus 711 t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~----lkd~~--g~~~~~~~~-~~~l~G~pVv~s~~~~~~---~i 780 (836) .|+++.++...|...+...+.-..+++|..+..|.. +...+ +...+..+. .+++.|+.++.|+.+|.. +. T Consensus 145 a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~ 224 (423) T protein:vir:10 145 KWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGNFGGIRALMSNGLASRTQGAF 224 (423) T ss_pred cHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccchHHHHhcccceeecceEEEEecCCcccccccc Confidence 478999999999888888777889999999887742 22211 222344443 479999999999999842 11 Q ss_pred -EEEehhceEEEeecc--------e---EEEEeccccc--ccCcEEE---EEEEEeccEE------EcccceEEEeec-- Q lcl|NC_016164. 781 -FFGVWNQMIMGMWGA--------L---DIQVNPYALD--KSGSVRV---TALQDVDVAV------RHPEAFCRGNDN-- 835 (836) Q Consensus 781 -~~gD~s~~~i~~~~~--------l---~i~~~~~~~~--~~~~~~~---r~~~r~d~~v------~~p~Af~~l~~A-- 835 (836) ..+-.+......... . ....+. ..+ ..+.+.| .+..++...+ -++.-|++.-++ T Consensus 225 ~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~-~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~ 303 (423) T protein:vir:10 225 GGKLTVKGTPEVNYDSVKDSYAFTATLTGATASK-KGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANA 303 (423) T ss_pred cceeeeeeeeEEEecccccccccccceeecccee-ceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccc Confidence 110111100000000 0 000000 000 0011111 0111111111 012223332222 Q ss_pred -------------C Q lcl|NC_016164. 836 -------------L 836 (836) Q Consensus 836 -------------~ 836 (836) + T Consensus 304 ~a~~~~tv~i~p~~ 317 (423) T protein:vir:10 304 HSSGDVTVKISGVP 317 (423) T ss_pred cccCceEEEecccc Confidence 1 No 192 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=96.97 E-value=8.6e-05 Score=42.87 Aligned_cols=297 Identities=11% Similarity=0.031 Sum_probs=142.8 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhh------hhhhhhhhhhhcccccccccccchhhHHH Q lcl|NC_016164. 507 FSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILA------PNDVLHRDLVVDTASAAGDLVFTDGRPGS 580 (836) Q Consensus 507 ~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~------~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ 580 (836) +.....+ ..+++.|....+... ....+..+.......+.+..-+|..+..- T Consensus 1 ~~~~~~~-----------------------~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~ 57 (336) T protein:vir:10 1 MRDAQRI-----------------------QNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTY 57 (336) T ss_pred CchHHHH-----------------------HHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHhh Confidence 0000011 111122222211100 00111111111111222222244444432 Q ss_pred ----HHHHHHhhhhhhhhcceeeecC--CceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHH Q lcl|NC_016164. 581 ----FIELLRNRLALNTLGVTMLTGL--QGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLM 654 (836) Q Consensus 581 ----ii~~l~~~~~l~~l~~~~~~~~--~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL 654 (836) +++.+.+......+......++ ...+.++.......+.+.+.+...|..+...+...-+++.++..+.++.+=+ T Consensus 58 i~p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El 137 (336) T protein:vir:10 58 VDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGEREL 137 (336) T ss_pred cCcceeeeeechhchhhhcccccCCCcceeeEEEEeeeeeeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHH Confidence 2233333333333322211111 1345556666667777888888899999888888999999999998886544 Q ss_pred hcc---hhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccc-cc-----c--chhHHHHHHHHHHHh Q lcl|NC_016164. 655 LQS---SIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENF-GA-----T--NPTYVELVSMESKVA 723 (836) Q Consensus 655 ~ds---~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~-aa-----~--~~t~~~l~~a~~~l~ 723 (836) ... ..++.+.-.....+++.+++|...++|+.. ....|++|....+.... ++ + .-.++||.+++..+. T Consensus 138 ~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~-~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~ 216 (336) T protein:vir:10 138 EMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQ 216 (336) T ss_pred HHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecc-cceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHH Confidence 432 345666666777777777777777788753 34678998766643221 11 1 224678999999887 Q ss_pred hhcccc----CccEEEecHHHHHHHHHHhhccCccc--cccCCCCeecceeeEeeCccccceEEEEehhceEEEeecc-- Q lcl|NC_016164. 724 ADNADI----GAMSYLTNSTLYGGFKTTEKATSTAQ--FVLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGA-- 795 (836) Q Consensus 724 ~~~~~~----~~~~~vmnp~~~~~L~~lkd~~g~~~--~~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~-- 795 (836) .+.... .+..++|.|..+..|... ...|... |+-.. .-++.++..+.+.... |+...++..+..+ T Consensus 217 ~qt~g~i~~~~~~tL~Lp~~~~~~L~~~-n~~g~tv~~~lk~n---~Pnl~i~t~pel~~Ag---g~~~~~~~~~~~~~~ 289 (336) T protein:vir:10 217 TQSQGIITQEAVLHMGLPPTAMSDLSKT-NQYGLSAAAKLKEI---FPKLEFVTIPEYDTAS---GRLVQLWAPRVEGKD 289 (336) T ss_pred HhcCCeeeeccceEEEechHHHHhccCC-CccCccHHHHHHHh---CCccEEEEcccccccC---CceEEEEEecccCCc Confidence 765321 245789999998888543 2222111 11111 1122343333332110 1111111111110 Q ss_pred -eEEEEeccc------ccccCcEEEEEEEEecc-EEEcccceEEEeec Q lcl|NC_016164. 796 -LDIQVNPYA------LDKSGSVRVTALQDVDV-AVRHPEAFCRGNDN 835 (836) Q Consensus 796 -l~i~~~~~~------~~~~~~~~~r~~~r~d~-~v~~p~Af~~l~~A 835 (836) +++. .|.. ....-.+......|.++ -+++|.||+.++-= T Consensus 290 t~~~~-~P~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 290 TATCG-FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred ceeee-cChhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 1111 0100 11122344455556655 66779999887744 No 193 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=96.65 E-value=0.00042 Score=39.10 Aligned_cols=177 Identities=10% Similarity=-0.004 Sum_probs=88.7 Q ss_pred eeeeeeeehhHHHHHhc-----chhHHHHHHHHHHHHHHHHHHHHHHHh----hcCCccccc----cccccccccccccc Q lcl|NC_016164. 640 AKTLGAYTEFSRRLMLQ-----SSIDVEQMVRTELATVIALEIDRAALY----GLGSNSQPE----GLKFVTGINTENFG 706 (836) Q Consensus 640 ~~t~~~~i~ISrelL~d-----s~~~l~~~i~~~l~~a~a~~~d~~il~----G~Gt~~~p~----Gi~~~~~~~~~t~a 706 (836) +. -.-+|+-++.| +..++.+...+++++++++..|+.++. +..+. .|. |.....-....+. T Consensus 1 iD----~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~-~p~~~~~~g~~~~~~a~~t~- 74 (221) T protein:vir:17 1 MD----DLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAA-APVTGQDGGFSVNIGAGNTN- 74 (221) T ss_pred CC----cchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc-CcccccccCcceeccccccC- Confidence 11 11233333332 456788999999999999999987753 22111 111 1111100011111 Q ss_pred ccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhc-------cCcc-ccccC-CCCeecceeeEeeCcccc Q lcl|NC_016164. 707 ATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKA-------TSTA-QFVLE-PGGTVNGYNVVRSNQVAN 777 (836) Q Consensus 707 a~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~-------~g~~-~~~~~-~~~~l~G~pVv~s~~~~~ 777 (836) .....++.|.++...|..++.....-.++++|..+..|..-.+. .+.. .+..+ .-+.+.|++|+.|+.+|. T Consensus 75 ~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~ 154 (221) T protein:vir:17 75 NAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLAS 154 (221) T ss_pred CHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCc Confidence 11223678888999998888877777888899877666432111 1111 12222 134689999999999996 Q ss_pred c--eEEEEehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 778 G--DVFFGVWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 778 ~--~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) . +-+..+...+.+ .......+.-+.. ..-+.+.+|+|+..+|-=. T Consensus 155 ~~gt~~~~~ag~~~~--------~~~~~~~yr~~fs------~~~glv~~~~Avgtvkl~~ 201 (221) T protein:vir:17 155 LYGTNLVTDPGDATT--------SGENNGSYRPAIT------DRAGLVFHKEAADTVEVLL 201 (221) T ss_pred ccccccccCCccccc--------ccccccccccccc------ceEEEEEcchheeeeeeec Confidence 3 212222111110 0000000000000 1114466677766665333 No 194 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=96.42 E-value=0.00064 Score=38.10 Aligned_cols=286 Identities=13% Similarity=0.085 Sum_probs=134.6 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeec--CCceEEEEEecCCcee-eeecc Q lcl|NC_016164. 546 PRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTG--LQGPVAIPRQTGAATA-YWVAE 622 (836) Q Consensus 546 ~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~--~~~~~~~p~~~~~~~a-~~v~E 622 (836) +.....+... ..++...+.+.-.-.-.+....+.-..+...+.++......+ .+..+.+.+...-+.+ ....| T Consensus 1 ~~~~~a~~~~----~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~e 76 (401) T protein:vir:95 1 MLNYNAPTDG----QKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQ 76 (401) T ss_pred CCccCCCccc----ccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhc Confidence 0000000000 001111111111111112222222333345566664432211 2234444444332222 12233 Q ss_pred Cccc----------------------------------ccccccceeEEeeeeeeeeeehhHHHHH-hcchhHHHHHHHH Q lcl|NC_016164. 623 GGDP----------------------------------TESQPSVDQVALVAKTLGAYTEFSRRLM-LQSSIDVEQMVRT 667 (836) Q Consensus 623 g~~~----------------------------------~~~~~~~~~it~~~~t~~~~i~ISrelL-~ds~~~l~~~i~~ 667 (836) |-.. ....++-..+..++++||.++.+|..++ .+.+..+...|.. T Consensus 77 Gv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ 156 (401) T protein:vir:95 77 GIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSR 156 (401) T ss_pred CCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHH Confidence 3211 1112223345668999999999999764 3466777776644 Q ss_pred -HHHHHHHHHHH---HHHHhhcCCccccccccc-ccccccccccccchhHHHHHHHHHHHhhhccc-------------- Q lcl|NC_016164. 668 -ELATVIALEID---RAALYGLGSNSQPEGLKF-VTGINTENFGATNPTYVELVSMESKVAADNAD-------------- 728 (836) Q Consensus 668 -~l~~a~a~~~d---~~il~G~Gt~~~p~Gi~~-~~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~-------------- 728 (836) .|.-+..+.++ +.+|+.-++---+ |-.. .+.....+...+.++++++..+...|..+.+. T Consensus 157 ell~g~~~~t~d~i~~dll~ag~~viyA-g~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dT 235 (401) T protein:vir:95 157 ELMNGATQITEAVLQKDLLAAAGTVLYA-GAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDT 235 (401) T ss_pred HHhhhhhhhHHHHHHHHHHhhcCeeecC-CccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCc Confidence 44444444444 3455433210001 1111 11112223334557888988888877642211 Q ss_pred --cCcc-EEEecHHHHHHHHHHhhccCcccccc------------CCCCeecceeeEeeCccc--------cc------- Q lcl|NC_016164. 729 --IGAM-SYLTNSTLYGGFKTTEKATSTAQFVL------------EPGGTVNGYNVVRSNQVA--------NG------- 778 (836) Q Consensus 729 --~~~~-~~vmnp~~~~~L~~lkd~~g~~~~~~------------~~~~~l~G~pVv~s~~~~--------~~------- 778 (836) .+.+ +.++|+..-..|+.++|-.|.+.|+. +.-|.+-++++++++.+. ++ T Consensus 236 k~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~ 315 (401) T protein:vir:95 236 KVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYR 315 (401) T ss_pred cccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccc Confidence 1122 35679999999999988777766643 233567788888877632 10 Q ss_pred --------------eEEEEehhceEEEeecce-----EEEEecc------cccccCcEEEEE-EEEeccEEEcccceEEE Q lcl|NC_016164. 779 --------------DVFFGVWNQMIMGMWGAL-----DIQVNPY------ALDKSGSVRVTA-LQDVDVAVRHPEAFCRG 832 (836) Q Consensus 779 --------------~i~~gD~s~~~i~~~~~l-----~i~~~~~------~~~~~~~~~~r~-~~r~d~~v~~p~Af~~l 832 (836) .+++|+-+...+...++- .+.+..- .+..-||..+.. .+..++.+.+++-++.+ T Consensus 316 ~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~i 395 (401) T protein:vir:95 316 TSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALI 395 (401) T ss_pred cccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEE Confidence 134555443333333221 2222211 112223433332 23667789999999999 Q ss_pred eecC Q lcl|NC_016164. 833 NDNL 836 (836) Q Consensus 833 ~~A~ 836 (836) +.+- T Consensus 396 es~a 399 (401) T protein:vir:95 396 KTVA 399 (401) T ss_pred Eeec Confidence 8877 No 195 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=95.68 E-value=0.00083 Score=37.48 Aligned_cols=324 Identities=9% Similarity=-0.002 Sum_probs=137.5 Q ss_pred hhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhh--hhhh------ Q lcl|NC_016164. 486 QPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPN--DVLH------ 557 (836) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~--~~~~------ 557 (836) .+.....+.... +...............-..+++.|....+..... ..+. T Consensus 1 ~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~ 58 (382) T protein:vir:96 1 MSHISKTHSRLA----------------------GRHAKPFDLKNVTHEAVAALGRIGLVFDHAVVQDQIKALAKAGAFR 58 (382) T ss_pred CCCcceeeeecC----------------------CccccchhhhcccHHHHHHHhccccccCcccchhHhhhhhhhhhhh Confidence 000000000000 0000000000001111122233333332211000 0000 Q ss_pred --hhhhhc--ccccccccccchhhH----HHHHHHHHhhhhhhhhcceeeecC--CceEEEEEecCCceeeeeccCcccc Q lcl|NC_016164. 558 --RDLVVD--TASAAGDLVFTDGRP----GSFIELLRNRLALNTLGVTMLTGL--QGPVAIPRQTGAATAYWVAEGGDPT 627 (836) Q Consensus 558 --~a~~~~--~~~~~g~~vvp~~~~----~~ii~~l~~~~~l~~l~~~~~~~~--~~~~~~p~~~~~~~a~~v~Eg~~~~ 627 (836) .+.... +..+.++.=+|..+. ..+++.+.+......+......+. ...+.++.......+.+++-+...| T Consensus 59 ~~~amDa~~~~~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~P 138 (382) T protein:vir:96 59 SGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIP 138 (382) T ss_pred hhcccccccCCccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCCC Confidence 111100 011111111243333 334455555555555543322222 1466777777778888888888888 Q ss_pred cccccceeEEeeeeeeeeeehhH-HHHHhcc--hhHHHHHHHHHHHHHHHHHHHHHHHhhc--CCccccccccccccccc Q lcl|NC_016164. 628 ESQPSVDQVALVAKTLGAYTEFS-RRLMLQS--SIDVEQMVRTELATVIALEIDRAALYGL--GSNSQPEGLKFVTGINT 702 (836) Q Consensus 628 ~~~~~~~~it~~~~t~~~~i~IS-relL~ds--~~~l~~~i~~~l~~a~a~~~d~~il~G~--Gt~~~p~Gi~~~~~~~~ 702 (836) ..+...+...-.+..++..+.++ .|+..-. ..++.+.-.....+++.+++|+..|+|+ |......||+|....+. T Consensus 139 l~d~~~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a 218 (382) T protein:vir:96 139 LTSWNANFERRTIVRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPP 218 (382) T ss_pred ccccccceeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCccc Confidence 88777667677777777777774 4444321 3455566667777777888888889995 23334469998776542 Q ss_pred ccc------ccc--chhHHHHHHHHHHHhhhccc-----cCccEEEecHHHHHHHHHHhhccCccc--cccCCCCeecce Q lcl|NC_016164. 703 ENF------GAT--NPTYVELVSMESKVAADNAD-----IGAMSYLTNSTLYGGFKTTEKATSTAQ--FVLEPGGTVNGY 767 (836) Q Consensus 703 ~t~------aa~--~~t~~~l~~a~~~l~~~~~~-----~~~~~~vmnp~~~~~L~~lkd~~g~~~--~~~~~~~~l~G~ 767 (836) ... +.+ .-.++||..++..+..+... ..+..+++.|..+..|... ...|-.. |+-.. .-++ T Consensus 219 ~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~-n~~g~Tvl~~lk~n---~Pnl 294 (382) T protein:vir:96 219 FQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT-TPYGISVSDWIEQT---YPKM 294 (382) T ss_pred ccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc-CccCccHHHHHHHh---cCCc Confidence 211 111 12367888999988776531 1234688899888777432 1122111 11111 1122 Q ss_pred eeEeeCcccc--------ce--EEEEe-hhce-EEEeecceEEE-Eec-c-----cccccCcEEEEEEE-EeccEEEccc Q lcl|NC_016164. 768 NVVRSNQVAN--------GD--VFFGV-WNQM-IMGMWGALDIQ-VNP-Y-----ALDKSGSVRVTALQ-DVDVAVRHPE 827 (836) Q Consensus 768 pVv~s~~~~~--------~~--i~~gD-~s~~-~i~~~~~l~i~-~~~-~-----~~~~~~~~~~r~~~-r~d~~v~~p~ 827 (836) .++..+.+.. .. +++.+ +... ....-...... .-+ + .....-.+...... ..|+.+++|. T Consensus 295 ~i~t~peL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ 374 (382) T protein:vir:96 295 RIVSAPELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPW 374 (382) T ss_pred EEEEccccccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeeeccceeecceeEeccccceeeeEEEcch Confidence 2333222210 01 11111 0000 00000000000 000 0 00000011111112 2566788899 Q ss_pred ceEEEeec Q lcl|NC_016164. 828 AFCRGNDN 835 (836) Q Consensus 828 Af~~l~~A 835 (836) ||+.++-= T Consensus 375 ai~~~~GI 382 (382) T protein:vir:96 375 AVVRYLGI 382 (382) T ss_pred hhhhccCC Confidence 99887644 No 196 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=95.60 E-value=0.0018 Score=35.67 Aligned_cols=325 Identities=10% Similarity=-0.007 Sum_probs=145.0 Q ss_pred hhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhh-hhhhhhhhc- Q lcl|NC_016164. 486 QPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPND-VLHRDLVVD- 563 (836) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~-~~~~a~~~~- 563 (836) .+.....+...... . .++.. ..+.......-..+++.|....+...... ....+.... T Consensus 1 ~~~~~~~~~~~~~~-----------~-~~~~~--------~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~amd~~~ 60 (379) T protein:vir:10 1 MPQISKIHSSLNAR-----------Q-MTQMV--------MDSADVTLDNLKHLESYGIHLNGRKNKLFELMQFAMDSND 60 (379) T ss_pred CCCcceeeeecCcc-----------c-cchhh--------hccccccHHHHHHHHhcCccccchhhhhhhhhhhhhcccc Confidence 00000000000000 0 00000 00000000001112233333332111000 000111100 Q ss_pred ---------ccccccccccchhhH---HHHHHHHHhhhhhhhhcceeeecCC--ceEEEEEecCCceeeeeccCcccccc Q lcl|NC_016164. 564 ---------TASAAGDLVFTDGRP---GSFIELLRNRLALNTLGVTMLTGLQ--GPVAIPRQTGAATAYWVAEGGDPTES 629 (836) Q Consensus 564 ---------~~~~~g~~vvp~~~~---~~ii~~l~~~~~l~~l~~~~~~~~~--~~~~~p~~~~~~~a~~v~Eg~~~~~~ 629 (836) ...+.+..-+|+.+. ..+++.+-......++......+.- ..+.++.......+.+++-+...|.. T Consensus 61 ~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~d~pl~ 140 (379) T protein:vir:10 61 IGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGGNMALM 140 (379) T ss_pred ccccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEeccccCCCee Confidence 000111111233222 3455555555444444332222221 35566666667778888888888888 Q ss_pred cccceeEEeeeeeeeeeehhHHHHHhc---chhHHHHHHHHHHHHHHHHHHHHHHHhhcCC-ccccccccccccccccc- Q lcl|NC_016164. 630 QPSVDQVALVAKTLGAYTEFSRRLMLQ---SSIDVEQMVRTELATVIALEIDRAALYGLGS-NSQPEGLKFVTGINTEN- 704 (836) Q Consensus 630 ~~~~~~it~~~~t~~~~i~ISrelL~d---s~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt-~~~p~Gi~~~~~~~~~t- 704 (836) +...+...-.++.++..+.++.+=+.. ...++.+.-.....+++.+.+|+..|+|... +....|++|...++... T Consensus 141 d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t 220 (379) T protein:vir:10 141 SWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVA 220 (379) T ss_pred eeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCccccc Confidence 877777777778888777776543332 2346777788888888888888889999642 34456999877654221 Q ss_pred ---cccc---------chhHHHHHHHHHHHhhhcccc-----CccEEEecHHHHHHHHHHhhccCccc--cccCCCCeec Q lcl|NC_016164. 705 ---FGAT---------NPTYVELVSMESKVAADNADI-----GAMSYLTNSTLYGGFKTTEKATSTAQ--FVLEPGGTVN 765 (836) Q Consensus 705 ---~aa~---------~~t~~~l~~a~~~l~~~~~~~-----~~~~~vmnp~~~~~L~~lkd~~g~~~--~~~~~~~~l~ 765 (836) .+++ .-.++||..++..+..+.... .+..+++.|..+..|... ...|... |+-.. .- T Consensus 221 ~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl~~lk~n---~P 296 (379) T protein:vir:10 221 VPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP-TELGYSVAQYMRES---YP 296 (379) T ss_pred ccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc-cccCccHHHHHHHh---cC Confidence 1111 113578888888877653321 233788999988888643 1112111 11111 11 Q ss_pred ceeeEeeCcccc----ce--EEEEeh-hceEEEeecceEEEEeccc------ccccCcEEEEEEEEecc-EEEcccceEE Q lcl|NC_016164. 766 GYNVVRSNQVAN----GD--VFFGVW-NQMIMGMWGALDIQVNPYA------LDKSGSVRVTALQDVDV-AVRHPEAFCR 831 (836) Q Consensus 766 G~pVv~s~~~~~----~~--i~~gD~-s~~~i~~~~~l~i~~~~~~------~~~~~~~~~r~~~r~d~-~v~~p~Af~~ 831 (836) ++.++..+.+.. ++ .++.+- .........-+. ...+.. ....-.+......|.++ .+++|.||+. T Consensus 297 nl~i~t~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~-~~~p~k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai~~ 375 (379) T protein:vir:10 297 NVTFVSAPELNDANGGSSAIYYYADAVENNGTDDGRTWL-QVVPTKMFTLGVEKKIKGYAEGYTNATAGAMLKRPFATYR 375 (379) T ss_pred CcEEEEcccccccCCCccEEEEEeeccCCCccCCcceEE-EecchhhhhccceecCceeEeccccceeeeeeecchhhhe Confidence 223333333211 11 122210 100000000000 001110 01112233444455554 6778999999 Q ss_pred Eeec Q lcl|NC_016164. 832 GNDN 835 (836) Q Consensus 832 l~~A 835 (836) ++-| T Consensus 376 ~~G~ 379 (379) T protein:vir:10 376 QTGA 379 (379) T ss_pred ecCC Confidence 9999 No 197 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=95.29 E-value=0.0024 Score=34.98 Aligned_cols=259 Identities=8% Similarity=-0.009 Sum_probs=117.7 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcce-----eeecCCceEEEEEecCCceeeeeccCc-ccccccc--c Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVT-----MLTGLQGPVAIPRQTGAATAYWVAEGG-DPTESQP--S 632 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~-----~~~~~~~~~~~p~~~~~~~a~~v~Eg~-~~~~~~~--~ 632 (836) ++ .+-..+.++..+.+.+...+....|+.. +.......+.+|+.+.. ...-..-++ -...+.. + T Consensus 1 MA-------~~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~-gl~DY~R~~~g~~~g~~~~~ 72 (299) T protein:vir:79 1 MA-------ALNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTT-GRVDSNRDTIAVAQRNYDNA 72 (299) T ss_pred Cc-------cchhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccc-cccccccCCCcccccccCcc Confidence 11 0112356777777777777666555432 11123457999988653 333332221 2233333 4 Q ss_pred ceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccch Q lcl|NC_016164. 633 VDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNP 710 (836) Q Consensus 633 ~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~ 710 (836) +...++.-.++-.+. |..--...+ ...+...+.+.....++-.+|...++..-++....| .....+..+..- T Consensus 73 ~~t~~ldqdr~~~f~-vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g-----~~~~~~~~T~~n 146 (299) T protein:vir:79 73 WEPKVLTNQRKWSTL-VHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALG-----NTADTTVLTTTN 146 (299) T ss_pred eeEEEeeccccceec-cchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcC-----CcccccccCHHH Confidence 444444444443331 110000111 111222233333344444556554443211111000 000111111223 Q ss_pred hHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhc----cC--ccccccCCCCeecceeeEe--eCccccc---- Q lcl|NC_016164. 711 TYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKA----TS--TAQFVLEPGGTVNGYNVVR--SNQVANG---- 778 (836) Q Consensus 711 t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~----~g--~~~~~~~~~~~l~G~pVv~--s~~~~~~---- 778 (836) .|+.|.+++..|..++....+..++++|..+..|.....- +. ....+.+.-+.|.|++|+. ++.++.. T Consensus 147 ~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~ 226 (299) T protein:vir:79 147 VLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFT 226 (299) T ss_pred HHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceec Confidence 4788999999999887766678899999999988754321 11 1123444456899999986 3444421 Q ss_pred ------------eEEEEehhceEEEee--cceEEEEecccccccCcEEEEEEEEeccEEEcc--cce-EEEeecC Q lcl|NC_016164. 779 ------------DVFFGVWNQMIMGMW--GALDIQVNPYALDKSGSVRVTALQDVDVAVRHP--EAF-CRGNDNL 836 (836) Q Consensus 779 ------------~i~~gD~s~~~i~~~--~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p--~Af-~~l~~A~ 836 (836) .++++..+.. +... ..+.+ ..|...-..+ .-+.-+.++|.-|.+. .++ +..+.|= T Consensus 227 ~G~~~~~~ak~in~ii~~~~a~-~~~~K~~~~~~-~~P~~~~~~~-~~~~~r~y~d~~v~~nk~~~i~~~~~~a~ 298 (299) T protein:vir:79 227 TGWKVGAGAKQIFMSLVHPSAI-ITPVSYQFSKL-DEPTAVTEGK-YFYFEESFEDVFILNKKADAIQFVVEGAG 298 (299) T ss_pred cCccccCcccccceEEEcCCee-eeeEeeeeEEe-ecCCCCCccc-eeeeeeeeeeeeeeccccCeEEEEeeecC Confidence 1333433222 2111 12222 2233221212 2233455566655553 233 4444444 No 198 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=93.96 E-value=0.0058 Score=32.83 Aligned_cols=291 Identities=12% Similarity=0.054 Sum_probs=130.1 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhh Q lcl|NC_016164. 513 IRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALN 592 (836) Q Consensus 513 ~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~ 592 (836) +... ...........+....++..|. . ..+..+.|-| .....+...+.+.+.+. T Consensus 1 m~~~------m~~~tr~~~~~y~~~~A~~ngv------------------~-~~~~~FsV~P-~v~q~L~~~i~ess~FL 54 (341) T protein:vir:27 1 MSQI------LTQSAREYMDNFAQQLAKSYGV------------------S-NVAELFNVSP-QLETKLRAAITESAEFL 54 (341) T ss_pred Cccc------ccHHHHHHHHHHHHHHHHHcCc------------------c-cccceEeecH-HHHHHHHHHHHhhHHhh Confidence 0000 0000000011111111111110 0 0111222334 35566778888888887 Q ss_pred hhcceeeec-CCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcc-----hhHHHHHHH Q lcl|NC_016164. 593 TLGVTMLTG-LQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQS-----SIDVEQMVR 666 (836) Q Consensus 593 ~l~~~~~~~-~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds-----~~~l~~~i~ 666 (836) +....+... ..|..-. ...+++-+.-+.-+ ..+. ++..+...|.....---+.|+.+.|... .+++...+. T Consensus 55 ~~Invv~V~e~~Ge~v~-lg~~g~iagrtdt~-R~~r-~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~ 131 (341) T protein:vir:27 55 KMITVTTVDQIEGQVVD-VGVSGLYTGRKAGG-RFTK-QVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLT 131 (341) T ss_pred hcCccccccceeeeEee-cccccceeeccCCC-ceec-ccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHH Confidence 765433222 2222111 11122223322221 2221 2345555555555554555666655432 367778888 Q ss_pred HHHHHHHHHHHHHHHHhhcCCc------ccc------ccccc----cccccc----ccccccchhHHHHHHHHHH----- Q lcl|NC_016164. 667 TELATVIALEIDRAALYGLGSN------SQP------EGLKF----VTGINT----ENFGATNPTYVELVSMESK----- 721 (836) Q Consensus 667 ~~l~~a~a~~~d~~il~G~Gt~------~~p------~Gi~~----~~~~~~----~t~aa~~~t~~~l~~a~~~----- 721 (836) +.+.++++.-.-...++|.-.. ..| +|.+. .+.... .......-+|.+|-+|... T Consensus 132 ~~i~~~~ALD~i~IGfnGts~A~~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~l 211 (341) T protein:vir:27 132 EFSNQMFALDIMRIGWNGVSAEADTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQ 211 (341) T ss_pred HHHHHHHhhhhhhhcccceeeccCCChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcc Confidence 8888887776666667875411 123 23221 001000 1111222345555554443 Q ss_pred HhhhccccCccEEEecHHHHHH-HHHHhhccCccc-ccc--CCCCeecceeeEeeCccccceEEEEehhceEEEeecce- Q lcl|NC_016164. 722 VAADNADIGAMSYLTNSTLYGG-FKTTEKATSTAQ-FVL--EPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGAL- 796 (836) Q Consensus 722 l~~~~~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~-~~~--~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l- 796 (836) |...+.+.+..+.++..+.+.. --.+-.....+. -+. .-..++-|+|.+..+.+|.+.+++--++++.|....|- T Consensus 212 I~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~ 291 (341) T protein:vir:27 212 IHPMFRNDPRLTVFVGSGLIGAAQAKLYDKADKPSEQIAAQKLDKTIAGRPAYVPPFLPDNAMVVTIPENLQVLTQHGTA 291 (341) T ss_pred cChHHhcCCCEEEEEchhhhhhhhhhhhccCCCCHHHHHHHHHHHhhCCCeEEEccccCCCceEEeeccceEEEEecCcE Confidence 3445555566788888776542 112222211110 000 01257899999999999999999999998877654442 Q ss_pred E--EEEecc----cccccCcEEEEEEEEeccEE-EcccceEEEeecC Q lcl|NC_016164. 797 D--IQVNPY----ALDKSGSVRVTALQDVDVAV-RHPEAFCRGNDNL 836 (836) Q Consensus 797 ~--i~~~~~----~~~~~~~~~~r~~~r~d~~v-~~p~Af~~l~~A~ 836 (836) . +.-.+. ..+.+ .|.++. +++.. .+-.-+.+-+.|+ T Consensus 292 RR~~~d~p~r~rie~yes---~YvVEd-yg~~~~~~~~~vkl~~~~~ 334 (341) T protein:vir:27 292 QRKAKHESDRKRSKTHTG---AWKVTQ-WVCWKRSPLTTQKKSTSAL 334 (341) T ss_pred EEEEEeccccccccchhh---hheeeh-hhhhhhccccccccCcccc Confidence 2 111221 12222 344433 34322 2222222233444 No 199 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=93.88 E-value=0.0012 Score=36.50 Aligned_cols=330 Identities=11% Similarity=0.020 Sum_probs=139.0 Q ss_pred hhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhh------h Q lcl|NC_016164. 482 VRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPND------V 555 (836) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~------~ 555 (836) +.+ ....+...... ..++ ..-+.....-......-..+++.|....+...... . T Consensus 1 ~~~----~~~~~~~~~~~------------~~~~----~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~ 60 (388) T protein:vir:99 1 MKQ----LSKVHQSLAGR------------SVRA----FDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEG 60 (388) T ss_pred CCC----ccceeeecCCc------------ccch----hhhhcCCcceeeechhhHhhhhcceeccCccchhhhhhhhhh Confidence 000 00000000000 0000 00000000000000011112333333322111100 0 Q ss_pred h--hhhhhh--cccccccccccchhhHH----HHHHHHHhhhhhhhhcceeeecC--CceEEEEEecCCceeeeeccCcc Q lcl|NC_016164. 556 L--HRDLVV--DTASAAGDLVFTDGRPG----SFIELLRNRLALNTLGVTMLTGL--QGPVAIPRQTGAATAYWVAEGGD 625 (836) Q Consensus 556 ~--~~a~~~--~~~~~~g~~vvp~~~~~----~ii~~l~~~~~l~~l~~~~~~~~--~~~~~~p~~~~~~~a~~v~Eg~~ 625 (836) . ..+... .+..+.++.=+|..+.. .|++.+.+.....++......+. ...+.++.......+.+++-+.. T Consensus 61 ~~~~~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D 140 (388) T protein:vir:99 61 GVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTN 140 (388) T ss_pred hhhhcccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccC Confidence 0 000000 01111122212333333 23333334434333332222221 13566777777778888888888 Q ss_pred cccccccceeEEeeeeeeeeeehhHHHHHhc---chhHHHHHHHHHHHHHHHHHHHHHHHhhc-CCc-cccccccccccc Q lcl|NC_016164. 626 PTESQPSVDQVALVAKTLGAYTEFSRRLMLQ---SSIDVEQMVRTELATVIALEIDRAALYGL-GSN-SQPEGLKFVTGI 700 (836) Q Consensus 626 ~~~~~~~~~~it~~~~t~~~~i~ISrelL~d---s~~~l~~~i~~~l~~a~a~~~d~~il~G~-Gt~-~~p~Gi~~~~~~ 700 (836) .|..+...+...-.++.++..+.++.+=+.. ...++.+.-.....+++.+++|+..|+|. |.+ ....|++|.+.. T Consensus 141 ~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l 220 (388) T protein:vir:99 141 IPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSL 220 (388) T ss_pred CCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCc Confidence 8888877777777778888777777543332 23456677777788888888888888885 222 235688887654 Q ss_pred cccccc----------cc--chhHHHHHHHHHHHhhhcccc-----CccEEEecHHHHHHHHHHhhccCccc--cccCCC Q lcl|NC_016164. 701 NTENFG----------AT--NPTYVELVSMESKVAADNADI-----GAMSYLTNSTLYGGFKTTEKATSTAQ--FVLEPG 761 (836) Q Consensus 701 ~~~t~a----------a~--~~t~~~l~~a~~~l~~~~~~~-----~~~~~vmnp~~~~~L~~lkd~~g~~~--~~~~~~ 761 (836) .....+ .+ .-.++||..++..+..+.... .+..+++.|..+..|... ...|... ++-.. T Consensus 221 ~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~~lk~n- 298 (388) T protein:vir:99 221 LPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQT- 298 (388) T ss_pred ccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhcccc-CcCCccHHHHHHHh- Confidence 322111 11 123678899999987765321 233688899888887433 2222111 11111 Q ss_pred CeecceeeEeeCccc------cceE--EEEeh-hceEEE-eecceE-EEEeccc------ccccCcEEEEEEEEe-ccEE Q lcl|NC_016164. 762 GTVNGYNVVRSNQVA------NGDV--FFGVW-NQMIMG-MWGALD-IQVNPYA------LDKSGSVRVTALQDV-DVAV 823 (836) Q Consensus 762 ~~l~G~pVv~s~~~~------~~~i--~~gD~-s~~~i~-~~~~l~-i~~~~~~------~~~~~~~~~r~~~r~-d~~v 823 (836) .-++.++..+.+. .+.. ++.+- ...... ..+... ....+.. ....-.+......|. |+.+ T Consensus 299 --~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt~Gv~i 376 (388) T protein:vir:99 299 --YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVML 376 (388) T ss_pred --cCCcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEEecccccccccceecCceeEeccccceeeeEE Confidence 1122333332221 1111 11110 000000 000000 0000110 001112233333444 4577 Q ss_pred EcccceEEEeec Q lcl|NC_016164. 824 RHPEAFCRGNDN 835 (836) Q Consensus 824 ~~p~Af~~l~~A 835 (836) ++|.||+.++-= T Consensus 377 r~P~Ai~~~~GI 388 (388) T protein:vir:99 377 KRPWAVVRLIGL 388 (388) T ss_pred eccchhheeccC Confidence 789999887744 No 200 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=93.68 E-value=0.0067 Score=32.50 Aligned_cols=291 Identities=10% Similarity=0.016 Sum_probs=131.3 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeec- Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTG- 601 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~- 601 (836) ...........+....++..| ....+..+.+.+.....+...+.+.+.+.+....+... T Consensus 1 M~~~tr~~~~~y~~~~A~~ng--------------------v~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e 60 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKLNG--------------------VNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDE 60 (338) T ss_pred CCHHHHHHHHHHHHHHHHHhC--------------------CCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccc Confidence 000000000111111111111 01112222233445566778888888887765543222 Q ss_pred CCceEEEEEecCCceeeeec--cCcc-cccccccceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVA--EGGD-PTESQPSVDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALE 676 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~--Eg~~-~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~ 676 (836) ..|..-. ...+++-+.-+. .+++ .|..-.+.+.-.|.....---+.|+.+.|... .+++...+.+.+.++++.- T Consensus 61 ~~Ge~v~-lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD 139 (338) T protein:vir:11 61 LQGEKIG-IGVSGTIASRTDTTGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALD 139 (338) T ss_pred eeeeEee-eccCccccccccCCCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhc Confidence 2222221 122233333222 1222 22222244555566655555556666666542 3577788888888877765 Q ss_pred HHHHHHhhcCCc------cccc------ccc------------cc-ccccccccc-ccchhHHHHHHHHHH-----Hhhh Q lcl|NC_016164. 677 IDRAALYGLGSN------SQPE------GLK------------FV-TGINTENFG-ATNPTYVELVSMESK-----VAAD 725 (836) Q Consensus 677 ~d~~il~G~Gt~------~~p~------Gi~------------~~-~~~~~~t~a-a~~~t~~~l~~a~~~-----l~~~ 725 (836) .-...++|.-.. ..|. |.+ +. .....+... +..-+|.+|-++... |... T Consensus 140 ~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~ 219 (338) T protein:vir:11 140 RLMIGFNGTSAAATTNRAANPLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPW 219 (338) T ss_pred hhhhcccceeeccCCChhhCcCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChH Confidence 555667775411 1232 321 10 000111111 111235555444433 3445 Q ss_pred ccccCccEEEecHHHHHH-HHHHhhccCccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeecce-EE Q lcl|NC_016164. 726 NADIGAMSYLTNSTLYGG-FKTTEKATSTAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGAL-DI 798 (836) Q Consensus 726 ~~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l-~i 798 (836) +.+.+..+.++....+.. -..+......+. . +.....++-|+|.+..+.+|.+.+++--++++.|....+- .- T Consensus 220 ~~~d~dLVvivG~dLladk~~~l~n~~~~ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR 299 (338) T protein:vir:11 220 HRRDPGLVVILGRELVHDKYFPMVNKDQPATEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRR 299 (338) T ss_pred HhcCCCEEEEEchhhhHHHHhHHHhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEE Confidence 565666788888876542 112222222221 0 0112357899999999999999999999998877655442 21 Q ss_pred EEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 799 QVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 799 ~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ...+. -+.+++.-.-..--+..|-++.+++.+.+-- T Consensus 300 ~~~d~--p~r~rie~y~s~Ne~YvVEd~~~~a~ieni~ 335 (338) T protein:vir:11 300 YLKEV--PEKNRIENYESSNDAYVVEDYGLGCLVENIE 335 (338) T ss_pred EEEec--cccccccchhhhccceeeeccccEEEeecce Confidence 11111 1122222222222233344444444443222 No 201 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=92.57 E-value=0.011 Score=31.35 Aligned_cols=300 Identities=13% Similarity=0.122 Sum_probs=135.7 Q ss_pred hhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHh---hhh Q lcl|NC_016164. 514 RAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRN---RLA 590 (836) Q Consensus 514 ~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~---~~~ 590 (836) ....+ ..............+++.+.+.. |. ..+..+..+++.+--+.+...+..+... ..- T Consensus 1 ~~~~~---~~~~~~~~~~~~~~e~~~KS~~t---g~----------g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~ 64 (463) T protein:vir:99 1 MTIEK---NLSDVQQKYADQFQEDVVKSFQT---GY----------GITPDTQIDAGALRREILDDQITMLTWTNEDLIF 64 (463) T ss_pred CCccc---ccchHHHHHHhhhhHHHHHHhhc---CC----------ccCCccccCcchhhhhhhhhhhheeeecccchhh Confidence 00000 00000001111122222222110 00 0011112223322222222222111110 111 Q ss_pred hhhhcceeeecCCceEE-EEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHH-HhcchhHHHHHHHHH Q lcl|NC_016164. 591 LNTLGVTMLTGLQGPVA-IPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRL-MLQSSIDVEQMVRTE 668 (836) Q Consensus 591 l~~l~~~~~~~~~~~~~-~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrel-L~ds~~~l~~~i~~~ 668 (836) +..+..+.....-..+. +....+.....+++|++..+.+++++.+.+..++=++....+|.-+ |.++..+......+. T Consensus 65 ~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~d 144 (463) T protein:vir:99 65 YRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTED 144 (463) T ss_pred hhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHH Confidence 11111111111111111 1223334667889999999999999999999999999888777743 667777888888999 Q ss_pred HHHHHHHHHHHHHHhhcCC---cc-----cccccccc-cccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHH Q lcl|NC_016164. 669 LATVIALEIDRAALYGLGS---NS-----QPEGLKFV-TGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNST 739 (836) Q Consensus 669 l~~a~a~~~d~~il~G~Gt---~~-----~p~Gi~~~-~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~ 739 (836) -...++..++.+.|+|+-. .+ +..||.+- +.-+...+-+..++.+.|..+-..+..+++ .+.-++|+.. T Consensus 145 ai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fG--t~TD~~lp~~ 222 (463) T protein:vir:99 145 AIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFG--TATDAYMPIG 222 (463) T ss_pred HHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccC--ChhheecchH Confidence 9999999999999998632 12 34555432 233445555667777777777666655543 3456788888 Q ss_pred HHHHHHHHhhccCcccccc-CCCCeecceee--EeeCc--c--c-----cceEEE--------EehhceEEEeecceEEE Q lcl|NC_016164. 740 LYGGFKTTEKATSTAQFVL-EPGGTVNGYNV--VRSNQ--V--A-----NGDVFF--------GVWNQMIMGMWGALDIQ 799 (836) Q Consensus 740 ~~~~L~~lkd~~g~~~~~~-~~~~~l~G~pV--v~s~~--~--~-----~~~i~~--------gD~s~~~i~~~~~l~i~ 799 (836) +.+.|..-.-. ....+.. +++....|+|| +.+.. + . .+..++ ++|....+ ..++. T Consensus 223 vka~f~~~~l~-~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~----tatv~ 297 (463) T protein:vir:99 223 VHADFVNSILG-RQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKV----TATVE 297 (463) T ss_pred HHHHHHHHhcC-ceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCccCcee----EEEEe Confidence 88877643221 1111221 22222334443 11110 0 0 000011 01110000 00111 Q ss_pred Eeccc----ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 800 VNPYA----LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 800 ~~~~~----~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ..... ........|++...-+.+=-.|+.++-.+.|- T Consensus 298 ~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~ 338 (463) T protein:vir:99 298 TKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSN 338 (463) T ss_pred eccCCCCCCcccccceEEEEEEECCCCCcccchheeeeeee Confidence 11110 01223345555554444444455555555442 No 202 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=92.57 E-value=0.011 Score=31.35 Aligned_cols=300 Identities=13% Similarity=0.122 Sum_probs=135.7 Q ss_pred hhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHh---hhh Q lcl|NC_016164. 514 RAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRN---RLA 590 (836) Q Consensus 514 ~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~---~~~ 590 (836) ....+ ..............+++.+.+.. |. ..+..+..+++.+--+.+...+..+... ..- T Consensus 1 ~~~~~---~~~~~~~~~~~~~~e~~~KS~~t---g~----------g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~ 64 (463) T protein:vir:95 1 MTIEK---NLSDVQQKYADQFQEDVVKSFQT---GY----------GITPDTQIDAGALRREILDDQITMLTWTNEDLIF 64 (463) T ss_pred CCccc---ccchHHHHHHhhhhHHHHHHhhc---CC----------ccCCccccCcchhhhhhhhhhhheeeecccchhh Confidence 00000 00000001111122222222110 00 0011112223322222222222111110 111 Q ss_pred hhhhcceeeecCCceEE-EEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHH-HhcchhHHHHHHHHH Q lcl|NC_016164. 591 LNTLGVTMLTGLQGPVA-IPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRL-MLQSSIDVEQMVRTE 668 (836) Q Consensus 591 l~~l~~~~~~~~~~~~~-~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrel-L~ds~~~l~~~i~~~ 668 (836) +..+..+.....-..+. +....+.....+++|++..+.+++++.+.+..++=++....+|.-+ |.++..+......+. T Consensus 65 ~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~d 144 (463) T protein:vir:95 65 YRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTED 144 (463) T ss_pred hhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHH Confidence 11111111111111111 1223334667889999999999999999999999999888777743 667777888888999 Q ss_pred HHHHHHHHHHHHHHhhcCC---cc-----cccccccc-cccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHH Q lcl|NC_016164. 669 LATVIALEIDRAALYGLGS---NS-----QPEGLKFV-TGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNST 739 (836) Q Consensus 669 l~~a~a~~~d~~il~G~Gt---~~-----~p~Gi~~~-~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~ 739 (836) -...++..++.+.|+|+-. .+ +..||.+- +.-+...+-+..++.+.|..+-..+..+++ .+.-++|+.. T Consensus 145 ai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fG--t~TD~~lp~~ 222 (463) T protein:vir:95 145 AIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFG--TATDAYMPIG 222 (463) T ss_pred HHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccC--ChhheecchH Confidence 9999999999999998632 12 34555432 233445555667777777777666655543 3456788888 Q ss_pred HHHHHHHHhhccCcccccc-CCCCeecceee--EeeCc--c--c-----cceEEE--------EehhceEEEeecceEEE Q lcl|NC_016164. 740 LYGGFKTTEKATSTAQFVL-EPGGTVNGYNV--VRSNQ--V--A-----NGDVFF--------GVWNQMIMGMWGALDIQ 799 (836) Q Consensus 740 ~~~~L~~lkd~~g~~~~~~-~~~~~l~G~pV--v~s~~--~--~-----~~~i~~--------gD~s~~~i~~~~~l~i~ 799 (836) +.+.|..-.-. ....+.. +++....|+|| +.+.. + . .+..++ ++|....+ ..++. T Consensus 223 vka~f~~~~l~-~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~----tatv~ 297 (463) T protein:vir:95 223 VHADFVNSILG-RQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKV----TATVE 297 (463) T ss_pred HHHHHHHHhcC-ceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCccCcee----EEEEe Confidence 88877643221 1111221 22222334443 11110 0 0 000011 01110000 00111 Q ss_pred Eeccc----ccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 800 VNPYA----LDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 800 ~~~~~----~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ..... ........|++...-+.+=-.|+.++-.+.|- T Consensus 298 ~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~ 338 (463) T protein:vir:95 298 TKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSN 338 (463) T ss_pred eccCCCCCCcccccceEEEEEEECCCCCcccchheeeeeee Confidence 11110 01223345555554444444455555555442 No 203 >protein:vir:99228 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:776 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950457;genbank:gi:119953658;genbank:GeneID:4643088 Probab=92.52 E-value=0.0064 Score=32.63 Aligned_cols=210 Identities=11% Similarity=0.060 Sum_probs=98.3 Q ss_pred hhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCcee-eeeccCcccc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATA-YWVAEGGDPT 627 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~ 627 (836) +.......-. .+-.-+...|...+....+..+..+..++..+..=++.-+...|.. .|++|= . T Consensus 1 M~ii~~~~L~-------------~l~~~~~~~f~~~~~~a~~~~~~iA~~VpSt~~~~~Y~WLg~~P~mreWiG~r---~ 64 (304) T protein:vir:99 1 MAIITPALIS-------------ALKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---V 64 (304) T ss_pred CCccCHHHHH-------------HHHHHHHHHHHHHHhhcCcccceeEeEeecCccccccchhcccccchhhhhhh---h Confidence 0000000000 0011133344444444445455556677777666666666666665 678653 4 Q ss_pred cccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCC---cccccc--cccccccc- Q lcl|NC_016164. 628 ESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGS---NSQPEG--LKFVTGIN- 701 (836) Q Consensus 628 ~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt---~~~p~G--i~~~~~~~- 701 (836) ...+.-...++.-++|-..+.|.|.-++||.+++..-+.++|+++.+..=|..++.-... .....| +|.++|.. T Consensus 65 i~~l~~~~y~I~Nk~fE~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~ 144 (304) T protein:vir:99 65 IKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVY 144 (304) T ss_pred hhhhhhccceeeccccccccccccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCccc Confidence 455566667788889999999999999999999999999999999998777655422110 000000 11111100 Q ss_pred -------------cc-c--------------------------------------------------------------- Q lcl|NC_016164. 702 -------------TE-N--------------------------------------------------------------- 704 (836) Q Consensus 702 -------------~~-t--------------------------------------------------------------- 704 (836) +. . T Consensus 145 ~~~dg~g~~~~vsn~~~~~~~~g~~w~Lld~~r~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfW 224 (304) T protein:vir:99 145 PNVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFW 224 (304) T ss_pred ccccccCcccccceeccCCCCCCCcEEEEeCCCCccceeeeccccceeeeccCCCchhhhhhcceeEeeeeeeccchhhh Confidence 00 0 Q ss_pred ----ccccchhHHHHHHHHHHHhhhccccC------ccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCc Q lcl|NC_016164. 705 ----FGATNPTYVELVSMESKVAADNADIG------AMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQ 774 (836) Q Consensus 705 ----~aa~~~t~~~l~~a~~~l~~~~~~~~------~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~ 774 (836) ++.++++.+.+.+++.+|+.+....+ +..+++.|+....-+. ++.++. T Consensus 225 QlA~gS~a~Lt~~nl~aAr~aMr~qk~d~G~pL~I~P~~LvVPp~LE~aA~~----------------------ll~a~~ 282 (304) T protein:vir:99 225 QLAAMSTEELNTANFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKE----------------------VVGVQR 282 (304) T ss_pred hhhhhcCCCcChHHHHHHHHHHHhhcCCCCceeccccCeEEecchHHHHHHH----------------------HHhhhc Confidence 01122333334444444443332211 1222333322222111 111111 Q ss_pred cccceEEEEehhceEEEeecceEEEEecccc Q lcl|NC_016164. 775 VANGDVFFGVWNQMIMGMWGALDIQVNPYAL 805 (836) Q Consensus 775 ~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~ 805 (836) ++.+ +. -..++-+++.++|+.. T Consensus 283 ~~~G-----~t----Np~~g~~eliV~P~Ld 304 (304) T protein:vir:99 283 LANG-----AD----NPNFELVQVLDTAWLN 304 (304) T ss_pred cCCC-----Cc----ceecceEEEEeecccC Confidence 1111 10 1112334444554443 No 204 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=91.98 E-value=0.013 Score=30.86 Aligned_cols=289 Identities=10% Similarity=0.016 Sum_probs=132.0 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeee-c Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT-G 601 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~-~ 601 (836) ...........+....++..|. . ..+..+.+-| .....+...+.+.+.+.+....+.. - T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv------------------~-~~~~~FsV~P-~v~q~L~~~i~ess~FL~~Invv~V~e 60 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLNDT------------------G-DVSKKFAVEP-TVQQRLETKMQESSEFLKRINVLPVTE 60 (337) T ss_pred CChHHHHHHHHHHHHHHHhcCh------------------h-hhcceeeecH-HHHHHHHHHHHHHHHhhccCceecccc Confidence 0000000111111111111111 0 0111222334 3555677788888888776554322 2 Q ss_pred CCceEEEEEecCCceeeeeccC--cccccccccceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVAEG--GDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALEI 677 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~Eg--~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~~ 677 (836) ..|..-. ...+++-+.-..-+ ...|..-...+.-.|.....---+.|+.+.|... .+++...+.+.+.++++.-. T Consensus 61 ~~Ge~v~-lg~~g~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~ 139 (337) T protein:vir:10 61 LEGEKLG-LSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDR 139 (337) T ss_pred ceeeEEe-eccCcceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhch Confidence 2222222 12222333222221 1112223445555666666555566777766542 35777888888887777655 Q ss_pred HHHHHhhcCCc------cccc------cccc----cccc----------ccccccccchhHHHHHHHHHH-----Hhhhc Q lcl|NC_016164. 678 DRAALYGLGSN------SQPE------GLKF----VTGI----------NTENFGATNPTYVELVSMESK-----VAADN 726 (836) Q Consensus 678 d~~il~G~Gt~------~~p~------Gi~~----~~~~----------~~~t~aa~~~t~~~l~~a~~~-----l~~~~ 726 (836) -...++|.-.. ..|. |.+. .+.. ..+..+ ..-+|.+|-++... +...+ T Consensus 140 i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG-~~gdy~nLDalV~D~~~~lI~~~~ 218 (337) T protein:vir:10 140 IMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVG-KAGDYENLDALVMDIVSSMIDPWF 218 (337) T ss_pred hhhcccceeeccCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeec-CCCCcccHHHHHHHHHhccCChHH Confidence 55667775411 1232 2221 0000 011111 11145555554443 34455 Q ss_pred cccCccEEEecHHHHHH-HHHHhhccCccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeecc-eEEE Q lcl|NC_016164. 727 ADIGAMSYLTNSTLYGG-FKTTEKATSTAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGA-LDIQ 799 (836) Q Consensus 727 ~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~-l~i~ 799 (836) .+.+..+.++....+.. -..+....+.+. . +.....++-|+|.+..+.+|.+.+++--|+++.|....| ..-. T Consensus 219 ~~d~~LVvivG~dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~ 298 (337) T protein:vir:10 219 QEDTGLVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRT 298 (337) T ss_pred hcCCCEEEEEchhhhhHHhhHHhccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEE Confidence 66667788888877652 112222222221 0 011125789999999999999999999999887765544 2211 Q ss_pred EecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 800 VNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 800 ~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ..+. -+++.+.-.-..--+..|-++..++.+. .+ T Consensus 299 ~~d~--p~r~rie~y~s~Ne~YvVEd~~~~a~ie-nI 332 (337) T protein:vir:10 299 LKEV--PERDRIENYESSNDAYVVEDFGCGCVAE-NI 332 (337) T ss_pred EEEc--cccccccchhhccceeeeeccccEEEEe-ce Confidence 1111 1122222111122233334444444333 12 No 205 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=91.93 E-value=0.014 Score=30.82 Aligned_cols=287 Identities=12% Similarity=0.062 Sum_probs=120.7 Q ss_pred hhhhhhhhHHHHHHHHHHhhh-hhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecC Q lcl|NC_016164. 524 AFEAAAFEREVSEATAQRMGV-TPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL 602 (836) Q Consensus 524 ~~~~~~~~~~~a~~~~~~~g~-~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~ 602 (836) ..+.... ..-.+..+.+.. -..|...+...+.......+-.. -.+ .-+..+..+..... T Consensus 1 ~~~~~~~--~~~~a~~~al~~a~~~g~AlR~EsLd~~l~~lt~~~-~~f-----------------tf~~~i~k~~a~ST 60 (470) T protein:vir:10 1 MPYEHLK--HLDEATLKALNAAGQVAESLEREDLEPEVTQLNVLD-TPL-----------------TDLLSKNAVKAKAY 60 (470) T ss_pred CChhHhh--hhhHHHHHHHHHhhhcchhhhhhhhccceeEeeecC-ccc-----------------hhhhhcCCchhhhH Confidence 0000000 000111111100 00010111111111111111000 000 01111111111111 Q ss_pred CceEEE-EEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHH---HhcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 603 QGPVAI-PRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRL---MLQSSIDVEQMVRTELATVIALEID 678 (836) Q Consensus 603 ~~~~~~-p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrel---L~ds~~~l~~~i~~~l~~a~a~~~d 678 (836) -..+.. ....+........|++-.+.+++++.+.+..++=++....+|.-+ +.+...+++..+.+.--..++.+++ T Consensus 61 V~ey~~~~~rhG~~g~s~~~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE 140 (470) T protein:vir:10 61 EHEYNVVTARHDKIGYAAFREGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFE 140 (470) T ss_pred hhhhhhhccccccccceeecccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHH Confidence 111111 011122222345899999999999999999999999998888764 4455668888888888889999999 Q ss_pred HHHHhhcC-----Cc-----cccccccccc----ccccccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHH Q lcl|NC_016164. 679 RAALYGLG-----SN-----SQPEGLKFVT----GINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGF 744 (836) Q Consensus 679 ~~il~G~G-----t~-----~~p~Gi~~~~----~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L 744 (836) .++|+|+- .. -+..||.+.- ..+...+-+..++.+.|..+-..+......-.+.-++|+..+.+.| T Consensus 141 ~a~FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f 220 (470) T protein:vir:10 141 YLAFYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNL 220 (470) T ss_pred hhhhhhccccccccCcccCceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHH Confidence 99999953 11 2456775422 2355566667778888888877776422223455678888888877 Q ss_pred HHHhhccCccccccCCCCeecceee--EeeCc--cc-cceEEEEehhceE---EEee------cceEEEEecccccccCc Q lcl|NC_016164. 745 KTTEKATSTAQFVLEPGGTVNGYNV--VRSNQ--VA-NGDVFFGVWNQMI---MGMW------GALDIQVNPYALDKSGS 810 (836) Q Consensus 745 ~~lkd~~g~~~~~~~~~~~l~G~pV--v~s~~--~~-~~~i~~gD~s~~~---i~~~------~~l~i~~~~~~~~~~~~ 810 (836) ..-....-+-....+++.-..|+|| +++-. +. .+..++.++.... +... -.+...++.... . T Consensus 221 ~~~~~~~qRv~~~~N~~~~~~G~~v~~f~sa~G~I~L~~s~~m~~~~k~~p~~l~~~v~~~aAP~~~~tv~~t~~----~ 296 (470) T protein:vir:10 221 QASFYQISRVMTTADRRAGLLGADAQSYIGVRGEHSLYPSQFLGDFHKFNPARFGAEVGDFAAPSNSWTVSTTDN----F 296 (470) T ss_pred HHhhcCceEEEEecCCCceeeeeeccceeeeeeeeeecccccccchhhcCcccCCcccCCcccCceeEEeecCCC----c Confidence 6554432222222222334467665 22211 00 1112222221110 0000 001111111100 0 Q ss_pred EEEEEEEEeccEEEcccceEE--EeecC Q lcl|NC_016164. 811 VRVTALQDVDVAVRHPEAFCR--GNDNL 836 (836) Q Consensus 811 ~~~r~~~r~d~~v~~p~Af~~--l~~A~ 836 (836) +.+ ... -..+...++-+.. .+.+. T Consensus 297 ~a~-~~~-sk~g~~~~~~v~sy~y~v~~ 322 (470) T protein:vir:10 297 VTL-PYN-SGLGDPANTTVYSYAFKAAN 322 (470) T ss_pred eee-ccc-CCCCcccCcceeEEEEEEEE Confidence 000 000 0000001111101 11111 No 206 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=91.66 E-value=0.015 Score=30.61 Aligned_cols=289 Identities=9% Similarity=0.004 Sum_probs=131.7 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeee-c Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT-G 601 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~-~ 601 (836) ...........+....++..|.. ..+..+.+-| .....+...+.+.+.+.+....+.. - T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~-------------------~~~~~FsV~P-~v~q~L~~~i~ess~FL~~Invv~V~e 60 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLNDTG-------------------DVSKKFAVEP-TVQQRLETKMQESSEFLKRINVLPVTE 60 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChh-------------------hhcceeeecH-HHHHHHHHHHHHHHHhhccCceecccc Confidence 00000001111111111111110 0111222334 3455677788888888776554322 2 Q ss_pred CCceEEEEEecCCceeeeeccCc--ccccccccceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVAEGG--DPTESQPSVDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALEI 677 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~Eg~--~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~~ 677 (836) ..|..-. ...+++-+.-..-+. ..|..-...+.-.|.....---+.|+.+.|... .+++...+.+.+.++++.-. T Consensus 61 ~~Ge~v~-lg~~g~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~ 139 (337) T protein:vir:79 61 LEGEKLG-LSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDR 139 (337) T ss_pred ceeeEEe-eccCcceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhch Confidence 2222222 122223332222211 112223445555666666555566777766542 35777788888877777655 Q ss_pred HHHHHhhcCCc------cccc------cccc----cccc----------ccccccccchhHHHHHHHHHH-----Hhhhc Q lcl|NC_016164. 678 DRAALYGLGSN------SQPE------GLKF----VTGI----------NTENFGATNPTYVELVSMESK-----VAADN 726 (836) Q Consensus 678 d~~il~G~Gt~------~~p~------Gi~~----~~~~----------~~~t~aa~~~t~~~l~~a~~~-----l~~~~ 726 (836) -...++|.-.. ..|. |.+. .+.. ..+..+ ..-+|.+|-++... +...+ T Consensus 140 i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG-~~gdy~nLDalV~D~~~~lI~~~~ 218 (337) T protein:vir:79 140 IMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVG-KAGDYENLDALVMDIVSSMIDPWF 218 (337) T ss_pred hhhcccceeeccCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeec-CCCCcccHHHHHHHHHhccCChHH Confidence 55667775411 1232 2221 0000 011111 11245555554443 34455 Q ss_pred cccCccEEEecHHHHHH-HHHHhhccCccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeecc-eEEE Q lcl|NC_016164. 727 ADIGAMSYLTNSTLYGG-FKTTEKATSTAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGA-LDIQ 799 (836) Q Consensus 727 ~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~-l~i~ 799 (836) .+.+..+.++..+.+.. -..+....+.+. . +.....++-|+|.+..+.+|.+.+++--|+++.|....| ..-. T Consensus 219 ~~d~~LVvivG~dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~ 298 (337) T protein:vir:79 219 QEDTGLVAICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRT 298 (337) T ss_pred hcCCCEEEEEchhhhhHHhhHHhccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEE Confidence 66667788888877652 112222222221 0 011125789999999999999999999999887765544 2211 Q ss_pred EecccccccCcEEEEEEEEeccEEEcccceEEEe-----ec Q lcl|NC_016164. 800 VNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGN-----DN 835 (836) Q Consensus 800 ~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~-----~A 835 (836) ..+. -+++++.-.-..--+..|-++..++.+. .| T Consensus 299 ~~d~--p~r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 299 LKEV--PERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred EEEc--cccccccchhhccceeeeeccccEEEEeceeecCC Confidence 1111 1122222111122233333444444333 12 No 207 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=91.23 E-value=0.017 Score=30.31 Aligned_cols=255 Identities=10% Similarity=0.041 Sum_probs=121.0 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeee-cCCceEEEEEecCCceeeeeccCcccccccccceeEEee Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT-GLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALV 639 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~-~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~ 639 (836) ++ +-..+.++..+.+.+........+...... .....+.+|+.+. ....-..-++.+..++++.+..++. T Consensus 1 Ma--------in~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~-~gl~DY~R~~g~~~g~v~~~~et~t 71 (290) T protein:vir:78 1 MA--------INYVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITT-TGLKAHTRNKGYNEGSASNTNKSYT 71 (290) T ss_pred Cc--------hhHHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeecc-CcccccccCCCcccCccccceeeEE Confidence 00 001123444555555555444444332222 2346799998864 4444444444455555554444443 Q ss_pred --eeeeeeeehhHHHHHh----cchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHH Q lcl|NC_016164. 640 --AKTLGAYTEFSRRLML----QSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYV 713 (836) Q Consensus 640 --~~t~~~~i~ISrelL~----ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~ 713 (836) -.++-.+ .| +-+. +-...+...+.+...+.++-.+|...+.-.-+.....+ .....+ .+..-.++ T Consensus 72 l~qdR~~~F-~v--D~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~-----~~~~~t-~t~~n~~~ 142 (290) T protein:vir:78 72 IDFDRDVEF-FV--DVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNS-----NSVAEE-ITKDNVFT 142 (290) T ss_pred eecccccee-ec--cccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccC-----cccccc-cCHHHHHH Confidence 3333222 11 1111 11233444555556666666777654432211110001 011111 12223577 Q ss_pred HHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCc-------cccccCCCCeecceeeEeeCc---cc------c Q lcl|NC_016164. 714 ELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATST-------AQFVLEPGGTVNGYNVVRSNQ---VA------N 777 (836) Q Consensus 714 ~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~-------~~~~~~~~~~l~G~pVv~s~~---~~------~ 777 (836) .|.+++.+|... ...+..++++|..+..|+....-... ...+...-+.|.|++|+..+. +. + T Consensus 143 ~i~~~~~~ldev--p~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~ 220 (290) T protein:vir:78 143 KLKAAIRKVKKY--GTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTD 220 (290) T ss_pred HHHHHHHHHHhc--CCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhcc Confidence 888888888653 35677899999999988643222110 112344456799999987542 11 1 Q ss_pred c----------eEEEEehhceEEEeecceEEEE-ecccccccCcEEEEEEEEeccEEEcc-cceEEEeecC Q lcl|NC_016164. 778 G----------DVFFGVWNQMIMGMWGALDIQV-NPYALDKSGSVRVTALQDVDVAVRHP-EAFCRGNDNL 836 (836) Q Consensus 778 ~----------~i~~gD~s~~~i~~~~~l~i~~-~~~~~~~~~~~~~r~~~r~d~~v~~p-~Af~~l~~A~ 836 (836) | .+++...+.. +.....-.+.. .|...-..+...+.-+.++|.-|.+. ..-+.+..+| T Consensus 221 G~~~~~~ak~in~ii~~~~a~-i~~~K~~~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 221 GYKPAAGAKKLNFLLVNKGSV-VGGAKHASIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred cccccCCccceeEEEEcCCce-eeeeeeeEEEeeCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 1 1333333222 11111112211 34443334556666677777777765 4456667777 No 208 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=91.01 E-value=0.018 Score=30.16 Aligned_cols=291 Identities=10% Similarity=0.057 Sum_probs=128.3 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeee-c Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT-G 601 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~-~ 601 (836) ...........+....++..|.... ..+-.+.|-|. ....+...+.+.+.+.+....+.. - T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~-----------------~~~~~FsV~P~-v~q~L~~~i~ess~FL~~INvv~V~e 62 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVAELNNISTD-----------------DVSKKFTVEPS-VTQTLMNTVQASSAFLKTINILPVAE 62 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChh-----------------HccceeecCHH-HHHHHHHHHHHHHHHhhcCceecccc Confidence 0000000111111111111111100 00112223343 455677788888888776554322 2 Q ss_pred CCceEEEEEecCCceeeeeccC--cc-cccccccceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVAEG--GD-PTESQPSVDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALE 676 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~Eg--~~-~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~ 676 (836) ..|..-. ...+++-+.-+.-+ .+ .+..-...+.-.|.....---+.|+.+.|... .+++...+.+.+.++++.- T Consensus 63 ~~Ge~i~-lgv~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD 141 (355) T protein:vir:98 63 MKGEKIG-VGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALD 141 (355) T ss_pred ceeeEee-eccCccccccccCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhc Confidence 2232222 12223333322211 11 12222344555566655555556666666542 3567777777777777665 Q ss_pred HHHHHHhhcCC----c--ccc------ccccc----ccc--c-------------ccccccccchhHHHHHHHHHH---- Q lcl|NC_016164. 677 IDRAALYGLGS----N--SQP------EGLKF----VTG--I-------------NTENFGATNPTYVELVSMESK---- 721 (836) Q Consensus 677 ~d~~il~G~Gt----~--~~p------~Gi~~----~~~--~-------------~~~t~aa~~~t~~~l~~a~~~---- 721 (836) .-...++|.-. + ..| +|.+. .+. + ..+..+ ..-+|.+|-++... T Consensus 142 ~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G-~~gdy~NLDAlV~D~~~~ 220 (355) T protein:vir:98 142 LIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVG-KNGDYENIDALVMDATNN 220 (355) T ss_pred hhhhcccceeeeccCChhhCcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeC-CCCCcccHHHHHHHHHhc Confidence 55566777541 1 123 23221 000 0 001111 11235555444443 Q ss_pred -HhhhccccCccEEEecHHHHHH-HHHHhhccCccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeec Q lcl|NC_016164. 722 -VAADNADIGAMSYLTNSTLYGG-FKTTEKATSTAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWG 794 (836) Q Consensus 722 -l~~~~~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~ 794 (836) |...+.+.+..+.++..+.+.. --.+......+. . +.....++-|+|.+..+.+|.+.+++--++++.|.... T Consensus 221 lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~ 300 (355) T protein:vir:98 221 LIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMD 300 (355) T ss_pred cCChHHhcCCCEEEEEchhhhHHHhhhHhhccCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEec Confidence 3444566666788888876542 122222222221 0 11123578999999999999999999999888776544 Q ss_pred c-eEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 795 A-LDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 795 ~-l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) + ..-...+. -+++++.-.-..--+..|-++..++.+. ++ T Consensus 301 gs~RR~~~d~--p~r~rie~y~s~Ne~YvVEd~~~~a~ie-nI 340 (355) T protein:vir:98 301 ESHRRSIDEN--PKKDRVENYESMNIDYVVEVYAAGCLLE-NI 340 (355) T ss_pred CcEEEEEEec--cccccccchhhhcceeeeeccccEEEee-ce Confidence 4 22111111 1111111111111222233333333322 12 No 209 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=90.87 E-value=0.019 Score=30.07 Aligned_cols=291 Identities=12% Similarity=0.077 Sum_probs=130.2 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeee-c Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT-G 601 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~-~ 601 (836) ...........+....++..|... + ..+..+.+-| .....+...+.+.+.+.+....+.. - T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~------------~-----~~~~~Fsv~P-~v~q~L~~~i~ess~FL~~INvv~V~e 62 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLNGISV------------D-----DVSKKFTVEP-SVTQTLMNTVQASSAFLQMINILPVAE 62 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCCh------------h-----HccceeccCH-HHHHHHHHHHHHHHHHhhcCceecccc Confidence 000000111111112121111110 0 0011222333 3555677888888888776554422 2 Q ss_pred CCceEEEEEecCCceeeeeccC--cc-cccccccceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVAEG--GD-PTESQPSVDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALE 676 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~Eg--~~-~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~ 676 (836) ..|..-. ...+++-+.-+.-+ .+ .+......+.-.|.....---+.|+.+.|... .+++...+.+.+.++++.- T Consensus 63 ~~Ge~i~-lgv~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD 141 (355) T protein:vir:18 63 MKGEKIG-VGVTGTIASTTDTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALD 141 (355) T ss_pred ceeeEEe-eccCcceeeccccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhc Confidence 2222222 22223333332211 11 12222344555666666555556666666542 3567777777777777665 Q ss_pred HHHHHHhhcCC----c--ccc------ccccc----ccc--c-------------ccccccccchhHHHHHHHHHH---- Q lcl|NC_016164. 677 IDRAALYGLGS----N--SQP------EGLKF----VTG--I-------------NTENFGATNPTYVELVSMESK---- 721 (836) Q Consensus 677 ~d~~il~G~Gt----~--~~p------~Gi~~----~~~--~-------------~~~t~aa~~~t~~~l~~a~~~---- 721 (836) .-...++|.-. + ..| +|.+. .+. + ..+..+ ..-+|.+|-++... T Consensus 142 ~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G-~~gdy~NLDAlV~d~~~~ 220 (355) T protein:vir:18 142 FIMAGFNGTTRADTSDRVKNPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVG-KNGDYENLDALVMDGTNT 220 (355) T ss_pred hhhhcccceeeeccCChhhCcCccccchhHHHHHHhcchhhhhccccccccccccceeeec-CCCCcccHHHHHHHHHhc Confidence 55566777541 1 123 23221 000 0 001111 11245555554443 Q ss_pred -HhhhccccCccEEEecHHHHHH-HHHHhhccCcccc-c----cCCCCeecceeeEeeCccccceEEEEehhceEEEeec Q lcl|NC_016164. 722 -VAADNADIGAMSYLTNSTLYGG-FKTTEKATSTAQF-V----LEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWG 794 (836) Q Consensus 722 -l~~~~~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~~-~----~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~ 794 (836) |...+.+.+..+.++..+.+.. --.+....+.+.= + .....++-|+|.+..+.+|.+.+++--|+++.|.... T Consensus 221 lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~ 300 (355) T protein:vir:18 221 LIDEIYQDDPKLVAIVGRKLLADKYFPLVNKQQENTESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLENLSIYFMD 300 (355) T ss_pred cCChHHhcCCCEEEEEchhhhHHHHhHHhhccCChHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeeccccEEEEec Confidence 3444566666788888876542 1222222232210 1 1113578999999999999999999999988776544 Q ss_pred c-eEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 795 A-LDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 795 ~-l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) + ..-...+. -+++.+.-.-..--+..|-++..++.+. ++ T Consensus 301 gs~RR~~~d~--p~r~rie~y~s~Ne~YvVEd~~~~a~ie-ni 340 (355) T protein:vir:18 301 ESHRRSIDEN--PKKDRVENYESMNIDYVVEAYAAGCLLE-NI 340 (355) T ss_pred CcEEEEEEec--cccccccchhhhcceeeeeccccEEEEe-ee Confidence 4 22111111 1112221111112222333333333333 12 No 210 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=90.17 E-value=0.022 Score=29.64 Aligned_cols=302 Identities=13% Similarity=0.111 Sum_probs=126.8 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHh-- Q lcl|NC_016164. 510 VRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRN-- 587 (836) Q Consensus 510 ~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~-- 587 (836) +.... +...... .......+++.+.+. .|. ..+..+..+++.+--+.+...+..+-.. T Consensus 1 ~~~~~---~~~~~~~----~~~~~~~e~~~KS~~---tg~----------g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~ 60 (462) T protein:vir:96 1 MHKDT---NLTAEQN----KYADKFQEEVMKSYQ---TGY----------GITPDTQVDAGALRREILDDQITMLTWTQD 60 (462) T ss_pred Ccccc---ccchhhh----hhhchhhHHHHHHHh---cCC----------CcCCccccccchhhhhhhhhhhheeeeccc Confidence 00000 0000000 000001112221111 000 0001111222222222222222111110 Q ss_pred -hhhhhhhcceeeecCCceEE-EEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHH-HhcchhHHHHH Q lcl|NC_016164. 588 -RLALNTLGVTMLTGLQGPVA-IPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRL-MLQSSIDVEQM 664 (836) Q Consensus 588 -~~~l~~l~~~~~~~~~~~~~-~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrel-L~ds~~~l~~~ 664 (836) ..-+..+..+.....-..+. +.......-..++.|++..+.+++.+.+.+..++=++....+|-.+ |.++..+..+. T Consensus 61 ~~~~~~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~ 140 (462) T protein:vir:96 61 DLIFYREISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQI 140 (462) T ss_pred chhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHH Confidence 01111121111111111111 1223334667889999999999999999999999999877777654 56677788888 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCC---c-----ccccccccc-cccccccccccchhHHHHHHHHHHHhhhccccCccEEE Q lcl|NC_016164. 665 VRTELATVIALEIDRAALYGLGS---N-----SQPEGLKFV-TGINTENFGATNPTYVELVSMESKVAADNADIGAMSYL 735 (836) Q Consensus 665 i~~~l~~a~a~~~d~~il~G~Gt---~-----~~p~Gi~~~-~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~v 735 (836) ..+.-...++..++.+.|+|+-. + -+..||.+- +.-+...+-+..++.+.|..+-..+..+++ .+.-++ T Consensus 141 ~~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fG--t~TD~~ 218 (462) T protein:vir:96 141 LTEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFG--TATDAY 218 (462) T ss_pred HHHHHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccC--Chhhee Confidence 88888999999999999999632 1 234555432 344556666777787777766655554443 345678 Q ss_pred ecHHHHHHHHHHhhccCcccccc-CCCCeecceee--EeeCc--c--c-----cceEEEE----ehhceEEEeecceEEE Q lcl|NC_016164. 736 TNSTLYGGFKTTEKATSTAQFVL-EPGGTVNGYNV--VRSNQ--V--A-----NGDVFFG----VWNQMIMGMWGALDIQ 799 (836) Q Consensus 736 mnp~~~~~L~~lkd~~g~~~~~~-~~~~~l~G~pV--v~s~~--~--~-----~~~i~~g----D~s~~~i~~~~~l~i~ 799 (836) |+..+.+.|..-.-. .+..+.. +++....|++| +.+.. + . .+..+++ ++..+ ..-..+... T Consensus 219 ~p~~v~a~f~~~~l~-~qrv~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~p~a--p~~~~vsaT 295 (462) T protein:vir:96 219 MPIGVHADFVNSVLG-RQMQLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPLPNA--PQPATVKAT 295 (462) T ss_pred cchHHHHHHHHhhcC-ceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccCCCC--CCCCceeEE Confidence 888888777643221 1111111 11112334333 11100 0 0 0000000 00000 000000000 Q ss_pred --Eeccccc----ccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 800 --VNPYALD----KSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 800 --~~~~~~~----~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ......| ...++.|++...-.-+=--|...+-++.|= T Consensus 296 v~t~~~g~f~~~~d~~~y~Y~V~avs~dgeS~PS~~VtaTva~ 338 (462) T protein:vir:96 296 VETGKKGLFTDEHDRAELTYKVVVNSDDAQSAPSEAVTATVNN 338 (462) T ss_pred EEeCCCCCCCCccCceeEEEEEEEECCCCccccceeeEeeeec Confidence 0000000 112333333333222222233333333221 No 211 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=89.75 E-value=0.025 Score=29.41 Aligned_cols=254 Identities=12% Similarity=0.088 Sum_probs=112.7 Q ss_pred cccccchhhHHHHHHHHHhhhhhhhhcce-----eeecCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeee Q lcl|NC_016164. 569 GDLVFTDGRPGSFIELLRNRLALNTLGVT-----MLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTL 643 (836) Q Consensus 569 g~~vvp~~~~~~ii~~l~~~~~l~~l~~~-----~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~ 643 (836) -.+...+.+...+.+.+........+... +.....+.+++|+..+...+.-..-+.-.+.++.+.+..++++..= T Consensus 1 Main~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~et~tl~~D 80 (285) T protein:vir:79 1 MTVVLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGKETVKLTHE 80 (285) T ss_pred CcchhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceeeeEEEeecc Confidence 00111223444454555444444444221 2233456899998854333333443333455555444444433211 Q ss_pred -eeeehhHHHHHhcc---hhHHHHHHHHHHH-HHHHHHHHHHHHhhcCCcccccccccccccccccccccchhHHHHHHH Q lcl|NC_016164. 644 -GAYTEFSRRLMLQS---SIDVEQMVRTELA-TVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPTYVELVSM 718 (836) Q Consensus 644 -~~~i~ISrelL~ds---~~~l~~~i~~~l~-~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t~~~l~~a 718 (836) +.-+.|. -+..+ ... .+.|..++. ..+.=.+|...++-.-++. | ...+.+. +..-.++.|.++ T Consensus 81 R~~~f~iD--~mDvdEn~~~~-~~ni~~ef~~~~vvPEiDayrfskla~~a---~-----~~~~~~~-T~~nv~~~i~~~ 148 (285) T protein:vir:79 81 DWFGYDLD--QFDMDENGAYT-VENVVREHNKMITIPHRDKVAVQKLFDSA---A-----KKATDSI-TKDNALDAYDTA 148 (285) T ss_pred ccceeccc--ccchhhhhhhh-HHHHHHHHHhhhhcchhhHHHHHHHHhhc---c-----ccccccc-CHHHHHHHHHHH Confidence 1111111 11111 111 122222222 2222244433332211100 0 0011111 122247888999 Q ss_pred HHHHhhhccccCccEEEecHHHHHHHHHHhhccCcc----cc----ccCCCCeecc-eeeEee--Cccccc------eEE Q lcl|NC_016164. 719 ESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTA----QF----VLEPGGTVNG-YNVVRS--NQVANG------DVF 781 (836) Q Consensus 719 ~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~----~~----~~~~~~~l~G-~pVv~s--~~~~~~------~i~ 781 (836) +.+|...... ++.+++|+|.++..|+.-+.-.... .+ +...-+.|.| .|++.. +.+... .++ T Consensus 149 ~~~lde~~vp-~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infi 227 (285) T protein:vir:79 149 EAYMFDNEVP-GGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFI 227 (285) T ss_pred HHHHHHcCCC-CceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccEE Confidence 9999887655 5678899999999887554321110 01 1222346788 788764 455431 134 Q ss_pred EEehhceEEEeecceE-EEEecccccccCcEEEEEEEEeccEEEccc--c-eEEEeecC Q lcl|NC_016164. 782 FGVWNQMIMGMWGALD-IQVNPYALDKSGSVRVTALQDVDVAVRHPE--A-FCRGNDNL 836 (836) Q Consensus 782 ~gD~s~~~i~~~~~l~-i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~--A-f~~l~~A~ 836 (836) +...+. .+.....-. ...+|...-..+...+.-+.++|.-|.+.+ + ++..+.|| T Consensus 228 iv~~~a-~i~~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 228 LTPLSA-IAPIVKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred EecCce-eccceeeeeeEeECCCCCCCcceeeeeeeeeeeeeehhhccceeeeeecccC Confidence 444332 222222111 223444443334556666667777666642 3 45556667 No 212 >protein:vir:79246 Length: 304 # NCBI annotation: conserved hypothetical protein # Family: family:all:776 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469162;genbank:gi:157835004;genbank:GeneID:5648827 Probab=88.23 E-value=0.0087 Score=31.89 Aligned_cols=218 Identities=8% Similarity=-0.023 Sum_probs=114.3 Q ss_pred hhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCcee-eeeccCcccc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATA-YWVAEGGDPT 627 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a-~~v~Eg~~~~ 627 (836) +.......-.+ +-.-+...|...+....+..+..+..++..+..=++.-+...|.. .|++|= . T Consensus 1 M~ii~~~~L~~-------------l~~~~~~~f~~~~~~a~~~~~~iA~~VpSt~~~~tY~WLg~~P~mreWiG~r---~ 64 (304) T protein:vir:79 1 MAIITPALISA-------------LKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIGQR---V 64 (304) T ss_pred CCccCHHHHHH-------------HHHHHHHHHHHHHhhcCcccceeEeEeecCccccccchhcccccchhhhhhh---h Confidence 00000000000 011133344444444444445556667777666666666666665 678653 4 Q ss_pred cccccceeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCC---cccccc--ccccccccc Q lcl|NC_016164. 628 ESQPSVDQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGS---NSQPEG--LKFVTGINT 702 (836) Q Consensus 628 ~~~~~~~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt---~~~p~G--i~~~~~~~~ 702 (836) ...+.-...++.-++|-..+.|.|+-++||.+++..-+.++|+++.+..=|..++.-... .....| +|.++|... T Consensus 65 i~~l~~~~y~I~Nk~fE~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~ 144 (304) T protein:vir:79 65 IKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVY 144 (304) T ss_pred hhhhhhccceeeccccccceeeccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCccc Confidence 455566667788889999999999999999999999999999999998877766533221 111122 343333210 Q ss_pred --ccccccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCCCCeecceeeEeeCccccceE Q lcl|NC_016164. 703 --ENFGATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEPGGTVNGYNVVRSNQVANGDV 780 (836) Q Consensus 703 --~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i 780 (836) ....+...+..+ +..+ +..+...+ T Consensus 145 ~~~d~~g~~~~vsn-----------------------------------------~~~~-------------~~~~g~~w 170 (304) T protein:vir:79 145 PNVDGTGTATTVSN-----------------------------------------LFAP-------------AADPGAAW 170 (304) T ss_pred ccccccccccccee-----------------------------------------eccC-------------CCCCCCeE Confidence 000000000000 0000 00112234 Q ss_pred EEEehh----ceEEEeecceEEEEe----cccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 781 FFGVWN----QMIMGMWGALDIQVN----PYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 781 ~~gD~s----~~~i~~~~~l~i~~~----~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ++.|-+ -+++-.|...++... ++.-|.++...|-+..|++++--...-...-+.+| T Consensus 171 ~LlD~sr~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfWQlA~gS~a~L 234 (304) T protein:vir:79 171 YLLDTSRSLKPLIYQERMKPSFTSLTKEDNEQVFMADEYVYGVRSRCNVGFGFWQLAAMSTEEL 234 (304) T ss_pred EEEeCCCcccceeeeccccceeeecCCCCchhhhhhcceEEeeeeeeccchhhhhhhhhcCCcc Confidence 555543 234444544444321 22346777888888888877665544333334455 No 213 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=88.17 E-value=0.034 Score=28.63 Aligned_cols=296 Identities=11% Similarity=0.034 Sum_probs=122.8 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecC Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL 602 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~ 602 (836) ...........+....++..|.... ....+..+.+.+.....+...+.+.+.+.+....+.... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~----------------~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q 64 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANPA----------------LALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSER 64 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCccc----------------hhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchh Confidence 0000001111111122222111100 001111122333345667778888888877655433322 Q ss_pred CceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcc--hhH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 603 QGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQS--SID-VEQMVRTELATVIALEIDR 679 (836) Q Consensus 603 ~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds--~~~-l~~~i~~~l~~a~a~~~d~ 679 (836) -+.-......++....-....+...... +.+.-.|.....---+.|+.+.|... .++ +...+...+.++++.-.=. T Consensus 65 ~~g~v~~~~~sg~~t~r~~t~~~~~~~~-~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~ 143 (343) T protein:vir:98 65 YQRAIDLRSNRKRHYGAHDRRTPIQQRW-TRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIK 143 (343) T ss_pred hcceEEEeecCccccCccccCCCccccc-cCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccce Confidence 2222222222222222111111111000 01111244444333445666665443 244 6666666666666554444 Q ss_pred HHHhhcCCc---cccc------cccc----ccc--c--------ccccccccchhHHHHHHHHH----HHhhhccccCcc Q lcl|NC_016164. 680 AALYGLGSN---SQPE------GLKF----VTG--I--------NTENFGATNPTYVELVSMES----KVAADNADIGAM 732 (836) Q Consensus 680 ~il~G~Gt~---~~p~------Gi~~----~~~--~--------~~~t~aa~~~t~~~l~~a~~----~l~~~~~~~~~~ 732 (836) ..++|.-.. ..|. |.+. .+. + .....+.+ -+|.+|-++.. .|...+.+.+.. T Consensus 144 IGfNGts~A~~T~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~g-gdy~NLDalV~D~~~~I~~~~~~d~dL 222 (343) T protein:vir:98 144 IGFYGTSVGTDTSDPNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEG-ADYVNLDELAYDLKQGLDARHRDAGDL 222 (343) T ss_pred ecccceeeccCCCCcchhhcchHHHHHHHhcchhhhhccceeccceeEecCC-CCcccHHHHHHHHHhcCchHHhcCCCE Confidence 556775321 2342 2211 000 0 00011111 13544444443 344445555667 Q ss_pred EEEecHHHHHHHH-HHhhccCc-cc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeecc-eEEEE--ec Q lcl|NC_016164. 733 SYLTNSTLYGGFK-TTEKATST-AQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGA-LDIQV--NP 802 (836) Q Consensus 733 ~~vmnp~~~~~L~-~lkd~~g~-~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~-l~i~~--~~ 802 (836) +.++..+....-. .+....++ +. . +.....++-|+|.+..+++|.+.+++--|+++.|....| ..-.. .+ T Consensus 223 VvivG~dLla~~~~~l~n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p 302 (343) T protein:vir:98 223 VFLVGADLVAKEASLVYKGNGLIATEKAALNTHDLMKSFGGMPAMIVPNMPPRAAIVTSLSNLSIYTQEGSMRRGMKDDD 302 (343) T ss_pred EEEEchhhhhhhhhhhhhhcCCChHHHHHHHHHHHHHhhCCCeeEEccccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 8888887654321 22222222 10 0 011235789999999999999999999998887765443 22211 12 Q ss_pred c----cccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 803 Y----ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 803 ~----~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) . ..+.+-...|.++..--++++.--.|...+.+= T Consensus 303 ~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~~g 340 (343) T protein:vir:98 303 DKKAVRDSYYRNEAYAVEDCGKFMAVDFTKVKLSSGKG 340 (343) T ss_pred ccccccchhhhcceeeeeccccEEEeeeeeeeecCCCC Confidence 1 112222233333333333333333333332211 No 214 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=87.19 E-value=0.04 Score=28.22 Aligned_cols=292 Identities=12% Similarity=0.063 Sum_probs=122.2 Q ss_pred hhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhh---hh Q lcl|NC_016164. 514 RAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNR---LA 590 (836) Q Consensus 514 ~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~---~~ 590 (836) .+..+..+.. .-.........+.+.+- .|. ..+..+...|+.+--+.+...+..+.... .- T Consensus 1 ~~~~~~~~~~---~~~~~~~~~e~~~Ks~~---agy----------~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~ 64 (468) T protein:vir:63 1 MPKNNKEEEV---KEVNLNSVQEDALKSFT---TGY----------GITPDTQTDAGALRREFLDDQISMLTWTENDLTF 64 (468) T ss_pred CCCCcchhhc---cccChhHHHHHHHHHHH---cCc----------ccCCccccCcchhhhhhhhhhhheeeecccchhh Confidence 0100100000 00111111122222110 000 01111122233332222222222111111 11 Q ss_pred hhhhcceeeecCCceEE-EEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHH-HhcchhHHHHHHHHH Q lcl|NC_016164. 591 LNTLGVTMLTGLQGPVA-IPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRL-MLQSSIDVEQMVRTE 668 (836) Q Consensus 591 l~~l~~~~~~~~~~~~~-~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrel-L~ds~~~l~~~i~~~ 668 (836) +..+..+.....-..+. +.......-..++.|++..+.+++++.+.+..++=++....+|.-+ +.++..+......+. T Consensus 65 ~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ 144 (468) T protein:vir:63 65 YKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDD 144 (468) T ss_pred hhhcccchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHH Confidence 11111111111111111 1223334667889999999999999999999999999866666543 456666777888888 Q ss_pred HHHHHHHHHHHHHHhhcCC----cc-----cccccccc-cccccccccccchhHHHHHHHHHHHhhhccccCccEEEecH Q lcl|NC_016164. 669 LATVIALEIDRAALYGLGS----NS-----QPEGLKFV-TGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNS 738 (836) Q Consensus 669 l~~a~a~~~d~~il~G~Gt----~~-----~p~Gi~~~-~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp 738 (836) -...++..++.+.|+|+-. ++ +..||+.. +.-+..+.-+..++.+.|..+...+...++ ...-++|+. T Consensus 145 ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG--~~td~~~~~ 222 (468) T protein:vir:63 145 AIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYG--TPTDAYMPV 222 (468) T ss_pred HHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhcccccc--Chhhhhcch Confidence 8899999999999998742 12 34555532 233445555556676666666554443332 344567777 Q ss_pred HHHHHHHH-HhhccCcccc-ccCCCCeecceeeEeeCccc-cc------eEEEEehhceEEEeecceEEEEecccccccC Q lcl|NC_016164. 739 TLYGGFKT-TEKATSTAQF-VLEPGGTVNGYNVVRSNQVA-NG------DVFFGVWNQMIMGMWGALDIQVNPYALDKSG 809 (836) Q Consensus 739 ~~~~~L~~-lkd~~g~~~~-~~~~~~~l~G~pVv~s~~~~-~~------~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~ 809 (836) .+.+.|.. ... .+..+ ......-..|+||- -.+. -| ..+++|..-.. -...+.... ..-. T Consensus 223 ~v~a~~~~~~L~--~q~~v~~~n~~~~~~G~~v~--g~~sa~G~I~l~gs~il~~~~~l~-~~~~~~~~A------psp~ 291 (468) T protein:vir:63 223 GVQADFVNQQLS--KQTQLVRDNGNNVSVGFNIQ--GFHSARGFIKLHGSTVMENEQILD-ERILALPTA------PQPA 291 (468) T ss_pred hHHhhhhhhhcC--ceEEEEcCCCCceeeeeccc--ceecceeeeeecCceeeccccCCC-ccccccccc------ccCC Confidence 77665522 211 11222 22223345566661 1111 11 22333322110 000000000 0011 Q ss_pred cEEEEEEEEeccEE-EcccceEEEeecC Q lcl|NC_016164. 810 SVRVTALQDVDVAV-RHPEAFCRGNDNL 836 (836) Q Consensus 810 ~~~~r~~~r~d~~v-~~p~Af~~l~~A~ 836 (836) .+. +..-.+.+- ...+.-+..+.++ T Consensus 292 ~vs--aT~~~~~~g~~~~~~~a~y~Y~v 317 (468) T protein:vir:63 292 KVT--ATQEAGKKGQFRAEDLAAHEYKV 317 (468) T ss_pred ccc--eeeecccCCcccCCCcceEEEEE Confidence 111 111111110 0111111122222 No 215 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=86.55 E-value=0.045 Score=27.98 Aligned_cols=297 Identities=13% Similarity=0.096 Sum_probs=118.0 Q ss_pred hhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhH Q lcl|NC_016164. 499 LTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRP 578 (836) Q Consensus 499 ~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~ 578 (836) ++. ..+.... ......+.++.+- .|. ..+..+...++.+--+.+. T Consensus 1 ~~~--------------------~~n~~~~--~~~~~e~~~Ks~t---tgy----------~~~p~~q~~~~AlRrEsL~ 45 (464) T protein:vir:80 1 MTE--------------------KKNTERQ--LTSVQEEVIKGFT---TGY----------GITPESQTDAAALRREFLD 45 (464) T ss_pred CCc--------------------chhhHhh--cCcccHHHHHHHH---hCC----------ccCcccccCcchhhhhhhh Confidence 000 0000000 0000011111110 000 0111112223333222222 Q ss_pred HHHHHHHHh---hhhhhhhcceeeecCCceEE-EEEecCCceeeeeccCcccccccccceeEEeeeeeeee--eehhHHH Q lcl|NC_016164. 579 GSFIELLRN---RLALNTLGVTMLTGLQGPVA-IPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGA--YTEFSRR 652 (836) Q Consensus 579 ~~ii~~l~~---~~~l~~l~~~~~~~~~~~~~-~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~--~i~ISre 652 (836) ..+..+-.. ..-+..+..+.....-..+. +.......-..++.|++..+.+++++.+.+..++-++. .+.|-.. T Consensus 46 ~~i~~Lt~~~~~f~f~~di~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~ 125 (464) T protein:vir:80 46 DQITMLTWADGDLSFYRDITKRPATSTVAKYDVYLAHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATG 125 (464) T ss_pred hhhheeeecccchhhhhhcCCchhhhhhhhhheeeccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehh Confidence 222111110 11111221111111111111 11223345678899999999999999999988885553 3333333 Q ss_pred HHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCC---c------ccccccccc-cccccccccccchhHHHHHHHHHHH Q lcl|NC_016164. 653 LMLQSSIDVEQMVRTELATVIALEIDRAALYGLGS---N------SQPEGLKFV-TGINTENFGATNPTYVELVSMESKV 722 (836) Q Consensus 653 lL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt---~------~~p~Gi~~~-~~~~~~t~aa~~~t~~~l~~a~~~l 722 (836) |.++..+......+.-...++..++.+.|+|+-. + -+..||.+- +.-+...+-+..++.+.|..+-..+ T Consensus 126 -lvn~~~d~~~~~~~dai~~va~tiE~a~FyGds~l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i 204 (464) T protein:vir:80 126 -LVNNIEDPMRILTDDAISVVAKTIEWASFYGDSDLSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLNQASVLV 204 (464) T ss_pred -hhcchhhHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHhhhhhhh Confidence 3455556677777777788899999999998632 1 144565432 3445666667778877777766666 Q ss_pred hhhccccCccEEEecHHHHHHH-HHHhhccCcccccc-CCCCeecceeeE--eeCccc---cceEEEEehhceEEEee-- Q lcl|NC_016164. 723 AADNADIGAMSYLTNSTLYGGF-KTTEKATSTAQFVL-EPGGTVNGYNVV--RSNQVA---NGDVFFGVWNQMIMGMW-- 793 (836) Q Consensus 723 ~~~~~~~~~~~~vmnp~~~~~L-~~lkd~~g~~~~~~-~~~~~l~G~pVv--~s~~~~---~~~i~~gD~s~~~i~~~-- 793 (836) ..+++ .+.-++|+..+.+.+ ..... .+..++. ...+...|++|- .+..-. .+..++.++. +....+ T Consensus 205 ~~~fG--t~TD~~lp~~v~a~f~n~~l~--~q~~~~~~n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~-~ld~~~~~ 279 (464) T protein:vir:80 205 GKGYG--TPTDAYMPIGVQADFVNQQLD--RQVQVISDNGQNATMGFNVKGFNSARGFIRLHGSTVMELEQ-ILDENRMQ 279 (464) T ss_pred hcccC--ChhhcccchhHHHHHHhhhcC--ceeEEEcCCCCcceeeeecccccccccceeccCccccCccc-cccccccc Confidence 55443 345677877777554 33332 2222222 222234565541 111100 0111111111 000000 Q ss_pred -------cceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 794 -------GALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 794 -------~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ..+...+++...-.-+.-...+...+-+.+.+..+=......+ T Consensus 280 ~~~apaapsvt~tv~~~~~g~f~~~~~~~~~~Ykv~~vn~~GeS~ps~~~ 329 (464) T protein:vir:80 280 LPNAPQKATVKATLEAGTKGKFRDEDLTIDTEYKVVVVSDDAESAPSDVA 329 (464) T ss_pred CCCCcCCceeEEEecCCcccCCccccccceeEEEEEEECCCCccccceee Confidence 0011111111000000001111111222222222111111111 No 216 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=86.34 E-value=0.046 Score=27.90 Aligned_cols=290 Identities=9% Similarity=0.024 Sum_probs=128.7 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeee-c Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT-G 601 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~-~ 601 (836) ...........+....++..|. ...+..+.+.+.....+...+.+.+.+.+....+.. - T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv--------------------~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e 60 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLNGV--------------------ERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPE 60 (339) T ss_pred CChHHHHHHHHHHHHHHHHhCc--------------------ccccceeeecHHHHHHHHHHHHHHHHHhccCccccccc Confidence 0000000111111111111111 011112223333555677778888888776554322 2 Q ss_pred CCceEEEEEecCCceeeeecc-Cccccccc-ccceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVAE-GGDPTESQ-PSVDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALEI 677 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~E-g~~~~~~~-~~~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~~ 677 (836) ..|..-. ...+++-+.-+.- +.+....+ .+++.-.|.....---+.|+.+.|... .+++...+...+.++++.-. T Consensus 61 ~~Ge~v~-lg~~g~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~ 139 (339) T protein:vir:79 61 QEGEKIG-LGVSGPVASTTDTTQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDR 139 (339) T ss_pred ceeeEEe-eccCcceeecccCCCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhcc Confidence 2222221 1222222322211 11222222 344555566655555556666666532 34677777777777766554 Q ss_pred HHHHHhhcCCc------ccc------cccc------------ccc--ccccccccccchhHHHHHHHHHH-----Hhhhc Q lcl|NC_016164. 678 DRAALYGLGSN------SQP------EGLK------------FVT--GINTENFGATNPTYVELVSMESK-----VAADN 726 (836) Q Consensus 678 d~~il~G~Gt~------~~p------~Gi~------------~~~--~~~~~t~aa~~~t~~~l~~a~~~-----l~~~~ 726 (836) =...++|.-.. ..| +|.+ +.. ....+...+..-+|.+|-++... +...+ T Consensus 140 i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~ 219 (339) T protein:vir:79 140 IMIGFNGVSRAATSDRVANPMLQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWY 219 (339) T ss_pred ceecccceeeecCCChhhCcCccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHH Confidence 45556775321 122 2321 100 00111111112245555544443 34455 Q ss_pred cccCccEEEecHHHHHH-HHHHhhccCccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeecc-eEEE Q lcl|NC_016164. 727 ADIGAMSYLTNSTLYGG-FKTTEKATSTAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGA-LDIQ 799 (836) Q Consensus 727 ~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~-l~i~ 799 (836) .+.+..+.++....+.. -..+-.....+. . +.....++-|+|.+..+.+|.+.+++--|+++.|....| ..-. T Consensus 220 ~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~ 299 (339) T protein:vir:79 220 AEDPDLVVVCGRNLLSDKYFPLVNRDRDPVQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDNLSIYYQEGGRRRT 299 (339) T ss_pred hcCCCEEEEEchhhhhhHhhhHhhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEE Confidence 66666788888877642 112222222221 0 111235789999999999999999999999887765443 2211 Q ss_pred E--ecc----cccccCcEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 800 V--NPY----ALDKSGSVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 800 ~--~~~----~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) . .+. ..+..-...|.++..--++.+. -+.+...| T Consensus 300 ~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~iE--ni~~~~aa 339 (339) T protein:vir:79 300 ILDNAKRDRIENYESSNDAYVIEDLACAAMAE--NIALAAAA 339 (339) T ss_pred EEeccccccccchhhccceeeeeccccEEEee--eeecccCC Confidence 1 121 1122222233333333333332 23333333 No 217 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=84.78 E-value=0.058 Score=27.37 Aligned_cols=295 Identities=11% Similarity=0.073 Sum_probs=128.8 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhh Q lcl|NC_016164. 513 IRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALN 592 (836) Q Consensus 513 ~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~ 592 (836) +... ...........+....++..|... ...+-.+.+.+.....+...+.+.+.+. T Consensus 1 m~~~------M~~~tr~~~~~y~~~~A~~ngv~~------------------~~~~~~Fsv~p~v~q~L~~~i~ess~FL 56 (358) T protein:vir:78 1 MSQT------LTVQAEQRLNKYCDALAKAYGIDI------------------SKLDKQFSVTGPVETTLRSALLASVEFL 56 (358) T ss_pred Cccc------ccHHHHHHHHHHHHHHHHHhCCCh------------------hHccceeeeChHHHHHHHHHHHHHHHHh Confidence 0000 000000001111111111111100 0011122233335556777788888887 Q ss_pred hhcceeee-cCCceEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcch-----hHHHHHHH Q lcl|NC_016164. 593 TLGVTMLT-GLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSS-----IDVEQMVR 666 (836) Q Consensus 593 ~l~~~~~~-~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~-----~~l~~~i~ 666 (836) +....+.. -..|.. +....+++-+.-..-+ .+......+.-.|.....---+.|+.+.|...+ .++...+. T Consensus 57 ~~INvv~V~e~~Ge~-v~lg~~g~iagrt~tr--~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~ 133 (358) T protein:vir:78 57 GLITCLDVDQIKGQV-VQVGVGQLYTGRKKGG--RFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVG 133 (358) T ss_pred hcCcccccccceeeE-EeecCCcccceecCCC--ccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHH Confidence 76554322 222322 2222223333332221 222333445555555555555666777665432 25777777 Q ss_pred HHHHHHHHHHHHHHHHhhcCCc------ccc------ccccc-----ccc--------ccccccc-ccchhHHHHHHHHH Q lcl|NC_016164. 667 TELATVIALEIDRAALYGLGSN------SQP------EGLKF-----VTG--------INTENFG-ATNPTYVELVSMES 720 (836) Q Consensus 667 ~~l~~a~a~~~d~~il~G~Gt~------~~p------~Gi~~-----~~~--------~~~~t~a-a~~~t~~~l~~a~~ 720 (836) ..+.++++.-.-...++|+-.. ..| +|.+. .++ ...+... ++.-+|.+|-.+.. T Consensus 134 ~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~ 213 (358) T protein:vir:78 134 EFVNKAFALDMLRVGWNGVSAADDTDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMAS 213 (358) T ss_pred HHHHHHHhhccceecccceeeccCCChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHH Confidence 7777777665555567775321 122 22211 000 0001011 11124555544444 Q ss_pred H-----HhhhccccCccEEEecHHHHHH-HHHHhhccCcccc-cc--CCCCeecceeeEeeCccccceEEEEehhceEEE Q lcl|NC_016164. 721 K-----VAADNADIGAMSYLTNSTLYGG-FKTTEKATSTAQF-VL--EPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMG 791 (836) Q Consensus 721 ~-----l~~~~~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~~-~~--~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~ 791 (836) . |...+.+.+..+.++..+.+.. --.+-...+.+.= +. .-..++-|+|.+..+++|.+.+++--|+++.|. T Consensus 214 D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~pTE~~Aa~~i~k~iGGlpa~~~PfFP~~~ilVT~L~NLsIY 293 (358) T protein:vir:78 214 DLINTTIDPLFQQDPRLVVLVGTDLVAAAQAKLYSEATKPSEQIAAQQLAKSIAGRKAYIPPFFPGKRMVVTTLDNLHCY 293 (358) T ss_pred HHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhcCCCcHHHHHHHHHHHHhCCCeEEEccccCCCceEEeeccccEEE Confidence 3 3445566666788888887642 1222222222210 00 011578999999999999999999999888776 Q ss_pred eecc-eEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 792 MWGA-LDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 792 ~~~~-l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ...| ..-...+. -+++.+.-.-..--+..|=++..++.+.+.- T Consensus 294 ~Q~gs~RR~~~d~--p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~ 337 (358) T protein:vir:78 294 TQRGTRKRKADDN--QDSKSFDNQYWRMEGYALGEHKAYGGFEEAD 337 (358) T ss_pred EecCcEEEEEEec--cccccccchhhhcceeeeeccccEEEEeeee Confidence 5443 22111111 1112222111222233334444444433221 No 218 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=84.08 E-value=0.063 Score=27.16 Aligned_cols=294 Identities=10% Similarity=0.060 Sum_probs=129.0 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeec- Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTG- 601 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~- 601 (836) ...........+....++..|.... ....+-.+.+.+.....+...+.+.+.+.+....+... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~----------------~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e 64 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQAELNNLPFN----------------ALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDE 64 (342) T ss_pred CChHHHHHHHHHHHHHHHHhCCChh----------------HccccceeecChHHHHHHHHHHHHHHHHhccCccccccc Confidence 0000000011111111111111100 00111112233335556777788888887765543222 Q ss_pred CCceEEEEEecCCceeeeeccC--ccc-ccccccceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVAEG--GDP-TESQPSVDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALE 676 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~Eg--~~~-~~~~~~~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~ 676 (836) ..|.. +-...+++-+.-+.-. .+. +..-..++.-.|.....---+.|+.+.|... .+++...+...+.++++.- T Consensus 65 ~~Ge~-i~lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD 143 (342) T protein:vir:10 65 QTGET-LGLDSAHTVASTTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRD 143 (342) T ss_pred ceeeE-EecccCcccccccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhc Confidence 22222 2122223333332211 122 2222345555666666555566666666542 3467777777777776655 Q ss_pred HHHHHHhhcCCc------ccc------cccccc----c-------c--cccccccccchhHHHHHHHHH----H-Hhhhc Q lcl|NC_016164. 677 IDRAALYGLGSN------SQP------EGLKFV----T-------G--INTENFGATNPTYVELVSMES----K-VAADN 726 (836) Q Consensus 677 ~d~~il~G~Gt~------~~p------~Gi~~~----~-------~--~~~~t~aa~~~t~~~l~~a~~----~-l~~~~ 726 (836) .=...++|.-.. ..| +|.+.. + + ...+..+. .-+|.+|-++.. . +...+ T Consensus 144 ~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~-~gdy~NLDalV~D~~~~lI~~~~ 222 (342) T protein:vir:10 144 LIMIGFNGTSRAATSDRNSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGK-GQEYANLDALVMDATEELIDEWH 222 (342) T ss_pred cceecccceeeccCCChhhCcCccccchHHHHHHHhhhhhhhcccceeccceeecC-CCCcccHHHHHHHHHhccCChHH Confidence 445556775321 122 232210 0 0 01111111 124554444443 3 34555 Q ss_pred cccCccEEEecHHHHHH-HHHHhhccCccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeecc-eEEE Q lcl|NC_016164. 727 ADIGAMSYLTNSTLYGG-FKTTEKATSTAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGA-LDIQ 799 (836) Q Consensus 727 ~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~-l~i~ 799 (836) .+.+..+.++....+.. -..+....+.+. . +.....++-|+|.+..+.+|.+.+++--|+++.|....| ..-. T Consensus 223 ~~d~dLVvivG~dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~ 302 (342) T protein:vir:10 223 RDDTDLVVITGRKLLADKYFPIVNQQNAPTEELAADIVISQKRIGGLKAVRVPFFPANAILITKLENLAIYVQEGTTRKH 302 (342) T ss_pred hcCCCEEEEEchhhhHHHHHHHHhcCCChHHHHHHHHHHhhhhhcCceeEEccccCCCceEEeeccccEEEEecCcEEEE Confidence 66667788888887652 112222222221 0 111235789999999999999999999898877755443 2211 Q ss_pred EecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 800 VNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 800 ~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ..+. -+++++.-.-..--+..|-++.+++.+.+-- T Consensus 303 ~~d~--p~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~ 337 (342) T protein:vir:10 303 IENV--PKKDRIETYESENIDYVVEDYGCAALIENIT 337 (342) T ss_pred EEec--cccccccchhhhccceeeeccccEEEeecce Confidence 1111 1112222111122233333444444433211 No 219 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=84.07 E-value=0.063 Score=27.15 Aligned_cols=291 Identities=11% Similarity=0.067 Sum_probs=119.6 Q ss_pred hhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhh---hh Q lcl|NC_016164. 514 RAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNR---LA 590 (836) Q Consensus 514 ~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~---~~ 590 (836) .+..+..+-.............+++....+ .+..+...|+.+--+.+...+..+.... .- T Consensus 1 ~~~~~~~~~~~~n~~~~~e~~~Ks~~agy~-----------------~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~ 63 (467) T protein:vir:80 1 MPKNNKEEVKEVNLNSVQEDALKSFTTGYG-----------------ITPDTQTDAGALRREFLDDQISMLTWTENDLTF 63 (467) T ss_pred CCCcchhhhhhcccccCHHHHHHHHHcccc-----------------cCCccccCcchhhhhhhhhhhheeeccccchhh Confidence 000000000000000001111111110000 0111112222222222222221111100 01 Q ss_pred hhhhcceeeecCCceEE-EEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHH-HhcchhHHHHHHHHH Q lcl|NC_016164. 591 LNTLGVTMLTGLQGPVA-IPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRL-MLQSSIDVEQMVRTE 668 (836) Q Consensus 591 l~~l~~~~~~~~~~~~~-~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrel-L~ds~~~l~~~i~~~ 668 (836) +..+..+.....-..+. +.......-..++.|++..+.+++++.+.+..++=++....+|..+ +.++..+......+. T Consensus 64 ~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ 143 (467) T protein:vir:80 64 YKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDD 143 (467) T ss_pred hhhcccchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHH Confidence 11111111111111111 1123334667889999999999999999999999999866666543 456666777888888 Q ss_pred HHHHHHHHHHHHHHhhcCC----cc-----cccccccc-cccccccccccchhHHHHHHHHHHHhhhccccCccEEEecH Q lcl|NC_016164. 669 LATVIALEIDRAALYGLGS----NS-----QPEGLKFV-TGINTENFGATNPTYVELVSMESKVAADNADIGAMSYLTNS 738 (836) Q Consensus 669 l~~a~a~~~d~~il~G~Gt----~~-----~p~Gi~~~-~~~~~~t~aa~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp 738 (836) -...++..++.+.|+|+-. ++ +..||+.. +.-+..+.-+..++.+.|..+...+...++ ...-++|+. T Consensus 144 ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG--~~td~~~p~ 221 (467) T protein:vir:80 144 AIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYG--TPTDAYMPV 221 (467) T ss_pred HHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhcccccc--Chhhhhcch Confidence 8899999999999998742 12 34555532 233445555556676666666554443332 344567777 Q ss_pred HHHHHHHH-HhhccCcccc-ccCCCCeecceeeEeeCccc-cc------eEEEEehhceEEEeecceEEEEecccccccC Q lcl|NC_016164. 739 TLYGGFKT-TEKATSTAQF-VLEPGGTVNGYNVVRSNQVA-NG------DVFFGVWNQMIMGMWGALDIQVNPYALDKSG 809 (836) Q Consensus 739 ~~~~~L~~-lkd~~g~~~~-~~~~~~~l~G~pVv~s~~~~-~~------~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~ 809 (836) .+.+.|.. ... .+..+ ......-..|+||- -.+. -| ..+++|..-.. -...+.... ..-. T Consensus 222 ~v~a~~~~~~L~--~q~~v~~~n~~~~~~G~~v~--g~~sa~G~I~l~gs~il~~~~~l~-~~~~~~~~A------psp~ 290 (467) T protein:vir:80 222 GVQADFVNQQLS--KQTQLVRDNGNNVSVGFNIQ--GFHSARGFIKLHGSTVMENEQILD-ERILALPTA------PQPA 290 (467) T ss_pred hHHhhhhhhhcC--ceEEEEcCCCCceeeeeccc--ceecceeeeeecCceeeccccCCC-ccccccccc------ccCC Confidence 77665522 211 11222 22223345566661 1111 11 22333322110 000000000 0011 Q ss_pred cEEEEEEEEeccEE-EcccceEEEeecC Q lcl|NC_016164. 810 SVRVTALQDVDVAV-RHPEAFCRGNDNL 836 (836) Q Consensus 810 ~~~~r~~~r~d~~v-~~p~Af~~l~~A~ 836 (836) .+. +..-.+.+- ...+.-+..+.++ T Consensus 291 ~vs--aT~~~~~~g~~~~~~~a~y~Y~v 316 (467) T protein:vir:80 291 KVT--ATQEAGKKGQFRAEDLAAHEYKV 316 (467) T ss_pred ccc--eeeecccCCcccCCCcceEEEEE Confidence 111 111111110 0111111122222 No 220 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=83.54 E-value=0.068 Score=27.00 Aligned_cols=380 Identities=12% Similarity=0.039 Sum_probs=118.8 Q ss_pred hhhhhhhhhhhhhhhh-hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhh Q lcl|NC_016164. 420 AQAAADERSRVASITS-LCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGGGSADIG 498 (836) Q Consensus 420 ~~~~~~~~~~~~ei~a-l~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (836) .... ........+.+ +.+...+.+....+.... +..+.... ++.+.. ..... ........... T Consensus 1 mriS-~~~~~K~~l~EK~~~~a~~~E~~~~LKS~~-~G~evkna-iedl~K----~~EL~--------~TlS~~~iEI~- 64 (400) T protein:vir:93 1 MRIS-KRNMNKPDLIEKQNRLAELKENNVSLKSQI-SGFEVKNA-IEDLPK----VQELE--------KTLSENSIEII- 64 (400) T ss_pred Cccc-ccccccchHHHHHHHHhhhhhhhhhhhhhh-hcchhhhh-hhhchh----HHHHH--------HhHhhcchhhh- Confidence 0000 00000000000 000011111000000000 00000000 000000 00000 00000000000 Q ss_pred hhhhHHHHHHHhhhhhhhhhhhhhhhhh-hhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhh-hhcccccccccccchh Q lcl|NC_016164. 499 LTDKEARSFSFVRAIRAQMMPGDRAAFE-AAAFEREVSEATAQRMGVTPRGILAPNDVLHRDL-VVDTASAAGDLVFTDG 576 (836) Q Consensus 499 ~~~~~~~~~~~~~a~~a~~~~~~~~~~~-~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~-~~~~~~~~g~~vvp~~ 576 (836) .....+.+......+.... ..-...+.+.++.+-+. ...|............ -.+.+.+.-...+|.- T Consensus 65 ---------~~en~LNa~~E~~KGK~kMt~~i~sq~A~~eF~~vL~-~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~ 134 (400) T protein:vir:93 65 ---------KIENELNAQEEKPKGKDKMTNFIESQNAVTEFFDVLK-KNSGKSEIKNAWSAKLAENGVTITDTTFQLPRK 134 (400) T ss_pred ---------hhhhhhhhhhhhhhhhHHHHHHHhhHHHHHHHHHHHh-ccCCchhhhhhhhhhHhhcCcceeccchhccHH Confidence 0000000000000000000 00000111122221111 1111111111111111 1112222333455655 Q ss_pred hHHHHHHHHHhhhhhhhhcceeeecCCceEEEEE-ecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHH--HH Q lcl|NC_016164. 577 RPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPR-QTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSR--RL 653 (836) Q Consensus 577 ~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~-~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISr--el 653 (836) +.-.|...+....++.+..- ++.. +.+-+.+ ..+...+.....|..+++...+|.--++.+..+.....+-. +- T Consensus 135 lv~sI~~A~~n~n~v~~vfH--VT~~-~~~~V~~s~~s~~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~Ae~~K~ 211 (400) T protein:vir:93 135 LVESINTALLNTNPVFKVFH--VTNV-GALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKR 211 (400) T ss_pred HHHHHHHhhhccCcceeeee--eccc-hhhhHHhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHHHHHHH Confidence 55555555555555544211 1111 1111111 11122344445666777777777666666655555444422 11 Q ss_pred HhcchhHHHHHHHHHHHHHHH-HHHHHHHHhhcCCccc-c-ccc---cccccccccccccc-chhHHHHHHHHHHHhhhc Q lcl|NC_016164. 654 MLQSSIDVEQMVRTELATVIA-LEIDRAALYGLGSNSQ-P-EGL---KFVTGINTENFGAT-NPTYVELVSMESKVAADN 726 (836) Q Consensus 654 L~ds~~~l~~~i~~~l~~a~a-~~~d~~il~G~Gt~~~-p-~Gi---~~~~~~~~~t~aa~-~~t~~~l~~a~~~l~~~~ 726 (836) +..+...++.+|..+|..++. +..+.+++-|+|+|+- . .-+ .......+.+.+++ .+-.+.|-.+..-+++.- T Consensus 212 ~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrpta 291 (400) T protein:vir:93 212 LQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA 291 (400) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCC Confidence 333445679999999999998 8899999999998751 1 111 11111222222222 233344555555444322 Q ss_pred cccCccEEEecHHH-HHHHHHHhhccCcccc-ccCCCCee---ccee-eEe-eCccccceEEEEehhceEEEeecceEEE Q lcl|NC_016164. 727 ADIGAMSYLTNSTL-YGGFKTTEKATSTAQF-VLEPGGTV---NGYN-VVR-SNQVANGDVFFGVWNQMIMGMWGALDIQ 799 (836) Q Consensus 727 ~~~~~~~~vmnp~~-~~~L~~lkd~~g~~~~-~~~~~~~l---~G~p-Vv~-s~~~~~~~i~~gD~s~~~i~~~~~l~i~ 799 (836) +...+++.... ...|..++.+..+... +-.....+ .|+. +++ +..-+-..-++.|.+ |++-+. + +. T Consensus 292 ---grrylivktedrkalldelrqatanahvriknddaeiasevgvdeiivytgskalkptvlvdqk-yhidmq-d--lt 364 (400) T protein:vir:93 292 ---GRRYLIVKTEDRKALLDELRQATANAHVRIKNDDAEIASEVGVDEIIVYTGSKALKPTVLVDQK-YHIDMQ-D--LT 364 (400) T ss_pred ---CceEEEEeccchHHHHHHHHhhccccceEeecchhhhhhhcCcceeeeeeccccccceeeeccc-cccchh-h--hh Confidence 12234444333 4445566655443322 11111111 1322 111 111111122233322 222111 0 00 Q ss_pred EecccccccCcEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 800 VNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 800 ~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) .-+-..|.+|.-.+.++.--.+-+---.|=++++.. T Consensus 365 kvdafewktnsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 365 KVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred hhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 011112233332233333222222222222222222 No 221 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=82.78 E-value=0.074 Score=26.78 Aligned_cols=290 Identities=9% Similarity=0.035 Sum_probs=122.6 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeec-CCc Q lcl|NC_016164. 526 EAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTG-LQG 604 (836) Q Consensus 526 ~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~-~~~ 604 (836) ........+....++..|.... ....+..+.+.+.....+...+.+.+.+.+....+... ..| T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a----------------~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~G 64 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLD----------------SVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKG 64 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChh----------------hhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccc Confidence 0000011111111111111100 00011122333445566778888888887765543222 222 Q ss_pred eEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcch--hHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_016164. 605 PVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSS--IDVEQMV-RTELATVIALEIDRAA 681 (836) Q Consensus 605 ~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~--~~l~~~i-~~~l~~a~a~~~d~~i 681 (836) ..-. ...+++-+.-..-+ ....++..+.-.|.....---+.|+.+.|...+ ++.+..+ ...+.++++.-.-... T Consensus 65 e~v~-lg~~g~iagrtdt~--R~~~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IG 141 (336) T protein:vir:37 65 QKLF-GATEKGVTGRKQTG--RNLANLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIG 141 (336) T ss_pred eEee-eccCcccccccCCC--ccccccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhc Confidence 2221 11122222222211 111223455555665555555667777765432 3333222 2223333333333344 Q ss_pred HhhcCC---ccccc------cccc----ccccc----------cccccccchhHHHHH----HHHHHHhhhccccCccEE Q lcl|NC_016164. 682 LYGLGS---NSQPE------GLKF----VTGIN----------TENFGATNPTYVELV----SMESKVAADNADIGAMSY 734 (836) Q Consensus 682 l~G~Gt---~~~p~------Gi~~----~~~~~----------~~t~aa~~~t~~~l~----~a~~~l~~~~~~~~~~~~ 734 (836) ++|.-. ...|. |.+. .+... .+...+..-+|.+|- +++..+...+.+.+..+. T Consensus 142 fnG~s~A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVv 221 (336) T protein:vir:37 142 WNGQSVADNTTKADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVF 221 (336) T ss_pred ccceeeccCCCCCcccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhcCchHHhcCCCeEE Confidence 677431 12343 2211 00000 000101112344443 344445555666667788 Q ss_pred EecHHHHHH-HHHHhhccC-ccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeecce-EEEEeccccc Q lcl|NC_016164. 735 LTNSTLYGG-FKTTEKATS-TAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGAL-DIQVNPYALD 806 (836) Q Consensus 735 vmnp~~~~~-L~~lkd~~g-~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l-~i~~~~~~~~ 806 (836) ++..+.+.. ...+-..++ +|. . +.....++-|+|.+..+.+|.+.+++--++++.|....|- .-...+. - T Consensus 222 ivG~dLla~~~~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~--p 299 (336) T protein:vir:37 222 LVGADLVSKETKLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRND--E 299 (336) T ss_pred EEchhhhhhhhhhhhhhcCCCHHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEc--c Confidence 888776532 112222222 221 0 0112357899999999999999999999998877655442 2111111 1 Q ss_pred ccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 807 KSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 807 ~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +++++.-.-..--+..|-++..++.+.+.= T Consensus 300 ~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~ 329 (336) T protein:vir:37 300 DKKGLVTSYYRQEGYVVEDLGLMTAIDHTK 329 (336) T ss_pred ccccccchhhhcceeeeeccccEEEeeeee Confidence 122222222222233344444444443222 No 222 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=82.54 E-value=0.076 Score=26.72 Aligned_cols=292 Identities=10% Similarity=0.044 Sum_probs=126.9 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeee-c Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT-G 601 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~-~ 601 (836) ...........+....++..|.... ..+-.+.|-|. ....+...+.+.+.+.+....+.. - T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~-----------------d~~~~FsV~P~-v~q~L~~~i~ess~FL~~INvv~V~e 62 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVAELNGIDAG-----------------DVSKKFTVEPS-VTQTLMNTMQESSDFLTRINIVPVSE 62 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChH-----------------HhcceeecCHH-HHHHHHHHHHHHHHHhccCCcccccc Confidence 0000001111111111111111100 00111223333 455677778888888776554322 2 Q ss_pred CCceEEEEEecCCceeeeec--cCccccccc-ccceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVA--EGGDPTESQ-PSVDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALE 676 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~--Eg~~~~~~~-~~~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~ 676 (836) ..|..-. ...+++-+.-+. -+.+....+ ..++.-.|.....---+.|+.+.|... .+++...+...+.++++.- T Consensus 63 ~~Ge~i~-lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD 141 (357) T protein:vir:20 63 MKGEKIG-IGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLD 141 (357) T ss_pred ceeeEEe-cccCccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhc Confidence 2222221 122223333221 111222222 244555566655555556666666532 2466677777777666654 Q ss_pred HHHHHHhhcCCc------ccc------cccc------------cc----ccc--c-cccccccchhHHHHHHHHH----H Q lcl|NC_016164. 677 IDRAALYGLGSN------SQP------EGLK------------FV----TGI--N-TENFGATNPTYVELVSMES----K 721 (836) Q Consensus 677 ~d~~il~G~Gt~------~~p------~Gi~------------~~----~~~--~-~~t~aa~~~t~~~l~~a~~----~ 721 (836) .=...++|.-.. ..| +|.+ +. .+. . .+..+ ..-+|.+|-.+.. . T Consensus 142 ~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G-~~gdy~NLDalV~D~~~~ 220 (357) T protein:vir:20 142 FIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVG-KGGDYASLDALVMDATNN 220 (357) T ss_pred cceecccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccccccccceeeec-CCCCcccHHHHHHHHHhc Confidence 444556775321 122 2322 10 000 0 01111 1124555544443 2 Q ss_pred -HhhhccccCccEEEecHHHHHH-HHHHhhccCccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeec Q lcl|NC_016164. 722 -VAADNADIGAMSYLTNSTLYGG-FKTTEKATSTAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWG 794 (836) Q Consensus 722 -l~~~~~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~ 794 (836) |...+.+.+..+.++....+.. --.+....+.+. . +.....++-|+|.+..+++|.+.+++--|+++.|.... T Consensus 221 lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~ 300 (357) T protein:vir:20 221 LIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMD 300 (357) T ss_pred cCChHHhcCCCEEEEEchhhhhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEec Confidence 3445566667788888877642 122222222221 1 11123578999999999999999999999887776544 Q ss_pred c-eEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 795 A-LDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 795 ~-l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) | ..-...+. -.++++.-.-..--+..|=++..++.+.+.- T Consensus 301 gs~RR~~~d~--p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~ 341 (357) T protein:vir:20 301 DSHRRVIEEN--PKLDRVENYESMNIDYVVEDYAAGCLVEKIK 341 (357) T ss_pred CcEEEEEEec--cccccccchhhhcceeeeeccccEEEeeeee Confidence 3 22111111 1112221111112222333333333333211 No 223 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=82.13 E-value=0.08 Score=26.61 Aligned_cols=275 Identities=12% Similarity=0.051 Sum_probs=128.4 Q ss_pred hhhhhhhhhhhhhhcccccccccccchh-hHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecC-Cceeeeec-cCcc Q lcl|NC_016164. 549 ILAPNDVLHRDLVVDTASAAGDLVFTDG-RPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTG-AATAYWVA-EGGD 625 (836) Q Consensus 549 ~~~~~~~~~~a~~~~~~~~~g~~vvp~~-~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~-~~~a~~v~-Eg~~ 625 (836) +..+.... ....+ ...-.+.+ +... -.+.++..|+.. +...+.....++..|-... ..++.|.. +..- T Consensus 1 mp~~~lse-l~t~t-l~~rs~~~-~D~v~~~n~LL~~L~~k------G~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l 71 (321) T protein:vir:34 1 MPFPNISD-IITTT-IESRSGVI-ADNVTKNNAILARLAKR------GKPRLVSGGYTILEELSFSGNSNGGWYSGYDVL 71 (321) T ss_pred CCCchHHH-HHHHH-HHhhcchh-hhhhhcccHHHHHHHhc------CcccccCCCeeEEEEEeeccCcceeEEEeeeee Confidence 10000000 00000 00000000 0000 111222222222 2222334445666666655 77788864 3322 Q ss_pred cccccccceeEEeeeeeeeeeehhHH-HHHhcchh-HHHHHHHHHHH---HHHHHHHHHHHHhhcCCc---cccccc--- Q lcl|NC_016164. 626 PTESQPSVDQVALVAKTLGAYTEFSR-RLMLQSSI-DVEQMVRTELA---TVIALEIDRAALYGLGSN---SQPEGL--- 694 (836) Q Consensus 626 ~~~~~~~~~~it~~~~t~~~~i~ISr-elL~ds~~-~l~~~i~~~l~---~a~a~~~d~~il~G~Gt~---~~p~Gi--- 694 (836) ...-.-.+..-.+..+.++.-+.||- |+|.++.- .+..++...|. +.+...++..++ .+|++ .+..|| T Consensus 72 ~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~-sdGTa~g~~~i~GL~~l 150 (321) T protein:vir:34 72 PTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALY-GDGTAFGGRAINGLDGA 150 (321) T ss_pred ccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhh-ccccccccchhhhhhhh Confidence 23334467888889999988888876 55666543 34444444443 333444444544 44442 233343 Q ss_pred cccc-cccc---------------ccccccchhHHHHHHHHHHHhhhcc--ccCccEEEecHHHHHHHHHHhhccCcccc Q lcl|NC_016164. 695 KFVT-GINT---------------ENFGATNPTYVELVSMESKVAADNA--DIGAMSYLTNSTLYGGFKTTEKATSTAQF 756 (836) Q Consensus 695 ~~~~-~~~~---------------~t~aa~~~t~~~l~~a~~~l~~~~~--~~~~~~~vmnp~~~~~L~~lkd~~g~~~~ 756 (836) ...+ +..+ .+-.++..|..++..++.++-.+-. ...+..|++....|.....-...--| + T Consensus 151 v~~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~y~~y~~s~q~~qR--~ 228 (321) T protein:vir:34 151 VPVDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDAWTTYSNSLQVLQR--F 228 (321) T ss_pred cccCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHHHHHHHHhhheeee--e Confidence 2211 1111 1112222345556666555433321 12456888888888766553332222 2 Q ss_pred ccCC-------CCeecceeeEeeC----ccccceEEEEehhceEEEeecceEEEE-ecccc--cccCcEEEEEEEEeccE Q lcl|NC_016164. 757 VLEP-------GGTVNGYNVVRSN----QVANGDVFFGVWNQMIMGMWGALDIQV-NPYAL--DKSGSVRVTALQDVDVA 822 (836) Q Consensus 757 ~~~~-------~~~l~G~pVv~s~----~~~~~~i~~gD~s~~~i~~~~~l~i~~-~~~~~--~~~~~~~~r~~~r~d~~ 822 (836) .... +=...|..|+..+ .+|++..||-|-+.+.+....+-.+.. .|... +.++.+.-.+.++.... T Consensus 229 ~~~~~a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r~~~~NqdA~~q~I~~~GnL~ 308 (321) T protein:vir:34 229 TSAEEANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSRRAAFNQDAEAQILAWAGNLT 308 (321) T ss_pred cccccccccceeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCcccccccchhHHhhhhhhhheee Confidence 1111 1246778888877 578899999998888776554433322 22211 12233333334455555 Q ss_pred EEcccceEEEeec Q lcl|NC_016164. 823 VRHPEAFCRGNDN 835 (836) Q Consensus 823 v~~p~Af~~l~~A 835 (836) +-++.+=.++++- T Consensus 309 ~sn~~~~~vL~~~ 321 (321) T protein:vir:34 309 CSGAQFQGRLIAE 321 (321) T ss_pred eecccceeEEeeC Confidence 5566665666555 No 224 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=81.04 E-value=0.089 Score=26.34 Aligned_cols=259 Identities=10% Similarity=-0.007 Sum_probs=107.7 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhh-h-----cceeeecCCceEEEEEecCCceeeeec-cCcccccccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNT-L-----GVTMLTGLQGPVAIPRQTGAATAYWVA-EGGDPTESQPSV 633 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~-l-----~~~~~~~~~~~~~~p~~~~~~~a~~v~-Eg~~~~~~~~~~ 633 (836) ++ +-....+...+.+.+........ + ...+.......+.+|+.+....+.-+. -++-...++++. T Consensus 1 Ma--------inya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~ 72 (346) T protein:vir:10 1 MT--------INYAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSN 72 (346) T ss_pred Cc--------chhHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCccccccccc Confidence 00 00112233333333333221111 1 111222345689999886332233232 222222244444 Q ss_pred eeEEeee--eeeeeeehhHHHHHhcch----hHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccc Q lcl|NC_016164. 634 DQVALVA--KTLGAYTEFSRRLMLQSS----IDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGA 707 (836) Q Consensus 634 ~~it~~~--~t~~~~i~ISrelL~ds~----~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa 707 (836) +..++++ .+.-.+ .|. -+.-+. ..+...+.......+.=.+|...|+-.-+.... .+..+... +.-+ T Consensus 73 ~~et~tl~qDR~~~F-~vD--~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~---~~~~~~~~-~a~T 145 (346) T protein:vir:10 73 DWDSYELKNERYWST-LVD--PSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEA---AHDGGITT-NTLD 145 (346) T ss_pred ceeEEEeecccccee-ccc--ccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhh---hccccccc-cccC Confidence 3333333 222221 111 111011 112222222222333334454433221100000 00000111 1112 Q ss_pred cchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccC-----ccccccCCCCeecceeeEe--eCcccc--- Q lcl|NC_016164. 708 TNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATS-----TAQFVLEPGGTVNGYNVVR--SNQVAN--- 777 (836) Q Consensus 708 ~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g-----~~~~~~~~~~~l~G~pVv~--s~~~~~--- 777 (836) ..-.++.|.+++.+|..+.....+.+++++|..+..|+.-..-.. +..-+...-+.|.|+||+. ++.+.. T Consensus 146 ~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~i~~~V~siDGv~Ii~VPs~r~~t~~~ 225 (346) T protein:vir:10 146 EKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNNIQRTVYSLDDVTIRVVPSDLMQTAYD 225 (346) T ss_pred HHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccccccccceeeeeecCeEEEEcchhhcccchh Confidence 223578899999999887776677899999999998865432211 1111344456799999976 344431 Q ss_pred ---c----------eEEEEehhceEEEeecceEEEE-ecccccccCcEEEEEEEEeccEEEcccc---eEEEeecC Q lcl|NC_016164. 778 ---G----------DVFFGVWNQMIMGMWGALDIQV-NPYALDKSGSVRVTALQDVDVAVRHPEA---FCRGNDNL 836 (836) Q Consensus 778 ---~----------~i~~gD~s~~~i~~~~~l~i~~-~~~~~~~~~~~~~r~~~r~d~~v~~p~A---f~~l~~A~ 836 (836) | .+++...+ ..+.....-.+.. .|. .-..|...+.-+.++|.-|.+.+. ++.++.|- T Consensus 226 f~~G~~~~t~ak~INfiiv~~~-A~ia~~K~~~~~if~P~-~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~ 299 (346) T protein:vir:10 226 FSDGSKIIDTAKQIEMFLIYNG-VQIAPEKYSFVGFDQPS-AATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKP 299 (346) T ss_pred hccCccccCCccceeEEEECCc-eeeeeeeeeeeEeeCCC-CCcccceeeeeeeeeeeeeeccccceEEEeeeccc Confidence 1 12333322 2222222112221 222 223444556666677776666433 34455555 No 225 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=80.80 E-value=0.092 Score=26.28 Aligned_cols=299 Identities=14% Similarity=0.052 Sum_probs=113.2 Q ss_pred HhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhh-hhcccccccccccchhhHHHHHHHHHh Q lcl|NC_016164. 509 FVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDL-VVDTASAAGDLVFTDGRPGSFIELLRN 587 (836) Q Consensus 509 ~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~-~~~~~~~~g~~vvp~~~~~~ii~~l~~ 587 (836) +...+.. .+.+.++.+-+ ....|............ -.+.+.+.-...+|.-+.-.|...+.. T Consensus 1 mtn~ies----------------q~A~~eF~~vL-~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n 63 (318) T protein:vir:86 1 MTNFIES----------------QNAVTEFFDVL-KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLN 63 (318) T ss_pred Ccchhhh----------------hHHHHHHHHHH-hccCCchhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhc Confidence 0000000 00111111111 00111111111111111 111122233345566555556555555 Q ss_pred hhhhhhhcceeeecCCceEEEE-EecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHH---HhcchhHHHH Q lcl|NC_016164. 588 RLALNTLGVTMLTGLQGPVAIP-RQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRL---MLQSSIDVEQ 663 (836) Q Consensus 588 ~~~l~~l~~~~~~~~~~~~~~p-~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrel---L~ds~~~l~~ 663 (836) ..++.+..- ++..+ .+-.. ...+...+.....|..+++...+|.--++.+..+.....+- ++ +..+...++. T Consensus 64 ~n~v~~vfH--VT~~~-~~~V~~s~~s~AeAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~A-e~~K~~~~sYsel~N 139 (318) T protein:vir:86 64 TNPVFKVFH--VTNVG-ALLVSRSFDSSAEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYN 139 (318) T ss_pred cCcceeeee--eccch-hhhhhhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHH-HHHHHhhhhHHHHHH Confidence 555544211 11111 11111 11223445555667777777777776666665555544442 22 3334456799 Q ss_pred HHHHHHHHHHH-HHHHHHHHhhcCCccc-c-ccc---ccccccccccccccc-hhHHHHHHHHHHHhhhccccCccEEEe Q lcl|NC_016164. 664 MVRTELATVIA-LEIDRAALYGLGSNSQ-P-EGL---KFVTGINTENFGATN-PTYVELVSMESKVAADNADIGAMSYLT 736 (836) Q Consensus 664 ~i~~~l~~a~a-~~~d~~il~G~Gt~~~-p-~Gi---~~~~~~~~~t~aa~~-~t~~~l~~a~~~l~~~~~~~~~~~~vm 736 (836) +|..+|..++. +..+.+++-|+|+|+- . .-+ .......+.+.+++. +-...|-.+..-+++.- +...+++ T Consensus 140 ~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagttpfanaieeavdfvrpta---grryliv 216 (318) T protein:vir:86 140 LIVAELTQAIVNKIVDLALVEGDGSNGFKSIDKEADVKKIKKITTKAKSAGTTPFANAIEEAVDFVRPTA---GRRYLIV 216 (318) T ss_pred HHHHHHHHHHHHHHHHhhheeecCCCCccchhhHHHHHHHHHHhhhhhccCCCchhhHHHHHHhhhccCC---CceEEEE Confidence 99999999998 8899999999998751 1 111 111112222222222 22233444444443321 2223444 Q ss_pred cHHH-HHHHHHHhhccCccccccCCCCe-e---ccee-eEe-eCccccceEEEEehhceEEEeecceEEEEecccccccC Q lcl|NC_016164. 737 NSTL-YGGFKTTEKATSTAQFVLEPGGT-V---NGYN-VVR-SNQVANGDVFFGVWNQMIMGMWGALDIQVNPYALDKSG 809 (836) Q Consensus 737 np~~-~~~L~~lkd~~g~~~~~~~~~~~-l---~G~p-Vv~-s~~~~~~~i~~gD~s~~~i~~~~~l~i~~~~~~~~~~~ 809 (836) .... ...|..++.+..+........++ + .|.. +++ +..-+-..-++.|.+ |++-+. + +..-+-..|.+| T Consensus 217 kaedrkalldelrqatanahvriknddteiasevgvdeiivytgskalkptvlvdqk-yhidmq-d--ltkvdafewktn 292 (318) T protein:vir:86 217 KAEDRKALLDELRQATANAHVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQK-YHIDMQ-D--LTKVDAFEWKTN 292 (318) T ss_pred eecchHHHHHHHHhhcccceeEEeccchhhhhhcCcceeeeeeccccccceeeeccc-eecchh-h--hhhhhcceeccC Confidence 4443 34455566554433221111111 1 1222 111 111111122233322 222111 0 000111122333 Q ss_pred cEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 810 SVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 810 ~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) .-.+.++.--.+-+---.|=++++.. T Consensus 293 snmilvetltsghvetynagavitvs 318 (318) T protein:vir:86 293 SNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred CceEEEeecccCcceeecCceeEEeC Confidence 32233333222222222222222222 No 226 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=80.75 E-value=0.092 Score=26.27 Aligned_cols=292 Identities=10% Similarity=0.039 Sum_probs=126.7 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeee-c Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT-G 601 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~-~ 601 (836) ...........+....++..|.... ..+-.+.|-|. ....+...+.+.+.+.+....+.. - T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~-----------------d~~~~FsV~P~-v~q~L~~~i~ess~FL~~INvv~V~e 62 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVAELNGIDAG-----------------DVSKKFTVEPS-VTQTLMNTMQESSDFLTRINIVPVSE 62 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChH-----------------HhcceeecCHH-HHHHHHHHHHHHHHHhccCCcccccc Confidence 0000001111111111111111100 00111223333 455677778888888776554322 2 Q ss_pred CCceEEEEEecCCceeeeecc--Cccccccc-ccceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVAE--GGDPTESQ-PSVDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALE 676 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~E--g~~~~~~~-~~~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~ 676 (836) ..|..-. ...+++-+.-+.- +.+....+ ..++.-.|.....---+.|+.+.|... .+++...+...+.++++.- T Consensus 63 ~~Ge~i~-lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD 141 (357) T protein:vir:60 63 MKGEKIG-IGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLD 141 (357) T ss_pred ceeeEEe-cccCcccccccccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhc Confidence 2222221 1222233332211 11221122 344555566655555556666666532 2466677777777766654 Q ss_pred HHHHHHhhcCCc------ccc------ccccc----cc------------cc--c-cccccccchhHHHHHHHHH----H Q lcl|NC_016164. 677 IDRAALYGLGSN------SQP------EGLKF----VT------------GI--N-TENFGATNPTYVELVSMES----K 721 (836) Q Consensus 677 ~d~~il~G~Gt~------~~p------~Gi~~----~~------------~~--~-~~t~aa~~~t~~~l~~a~~----~ 721 (836) .=...++|.-.. ..| +|.+. .+ +. . .+..+ ..-+|.+|-.+.. . T Consensus 142 ~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G-~~gdy~NLDalV~D~~~~ 220 (357) T protein:vir:60 142 LIMAGFNGVRRAETSDRSSNQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVG-KGGDYASLDALVMDATNN 220 (357) T ss_pred cceecccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeec-CCCCcccHHHHHHHHHhc Confidence 444556775321 122 23221 00 00 0 01111 1124555544443 2 Q ss_pred -HhhhccccCccEEEecHHHHHH-HHHHhhccCccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeec Q lcl|NC_016164. 722 -VAADNADIGAMSYLTNSTLYGG-FKTTEKATSTAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWG 794 (836) Q Consensus 722 -l~~~~~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~ 794 (836) |...+.+.+..+.++....+.. -..+....+.+. . +.....++-|+|.+..+++|.+.+++--|+++.|.... T Consensus 221 lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~ 300 (357) T protein:vir:60 221 LIEPWYQEDPDLVVIVGRQLLADKYFPIVNREQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMD 300 (357) T ss_pred cCChHHhcCCCEEEEEchhhhhHHhhhHhhcCCChHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeeccccEEEEec Confidence 3445566667788888877642 112222222221 0 11123578999999999999999999999887775544 Q ss_pred c-eEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 795 A-LDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 795 ~-l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) | ..-...+. -.++++.-.-..--+..|-++..++.+.+.- T Consensus 301 gs~RR~~~d~--p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~ 341 (357) T protein:vir:60 301 DSHRRVIEEN--PKLDRVENYESMNIDYVVEDYAAGCLVEKIK 341 (357) T ss_pred CcEEEEEEec--cccccccchhhhcceeeeeccccEEEeeeee Confidence 3 22111111 1112221111112222333333333333211 No 227 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=80.61 E-value=0.093 Score=26.23 Aligned_cols=318 Identities=13% Similarity=0.082 Sum_probs=121.3 Q ss_pred HHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhh-----hh Q lcl|NC_016164. 488 IAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDL-----VV 562 (836) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~-----~~ 562 (836) ...+....+ .. +...-.++.+..+.... .+.++.. .+..+...+. .. T Consensus 1 ~~~~~~~~~-------~~--------~~~~~~~~~~~~~~~~~-----~~~~~~~--------~~~~~~k~a~t~gy~~~ 52 (514) T protein:vir:10 1 MYTQDKTKD-------IM--------KKSFFGGDRAVAFDTNK-----EDILNEN--------LPENVKKSAFTAGHSIT 52 (514) T ss_pred CCccchhhH-------HH--------hhhhcccceeeeecCcH-----HHHHHHh--------cchhhhhhhhccccccC Confidence 000000000 00 00000111111111110 0000000 0111111100 11 Q ss_pred cccccccccccchhhHHHHHHHHH---hhhhhhhhcceeeecCCceEE-EEEecCCceeeeeccCcccccccccceeEEe Q lcl|NC_016164. 563 DTASAAGDLVFTDGRPGSFIELLR---NRLALNTLGVTMLTGLQGPVA-IPRQTGAATAYWVAEGGDPTESQPSVDQVAL 638 (836) Q Consensus 563 ~~~~~~g~~vvp~~~~~~ii~~l~---~~~~l~~l~~~~~~~~~~~~~-~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~ 638 (836) ....++|+.+--+.+...+..+.. +..-+..+..+.....-..+. +.......-..++.|++-.+..++.+.+.++ T Consensus 53 ~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~~d~~~~rk~~ 132 (514) T protein:vir:10 53 PDTQTDGAANRIESLNRDLKVTTWGERDFTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDVNNPNERQRTI 132 (514) T ss_pred CccccCccchhhhhhccceeEeeecCcchhhhhhcCCchhhHHHhhhhhhcccCcccccccccccccCcCCCcceEEEEE Confidence 112222332222222221110000 001111111111111001111 1122234467889999999999999999999 Q ss_pred eeeeeeeeehhHHHH-HhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCC--------cccccccccc-ccccccccccc Q lcl|NC_016164. 639 VAKTLGAYTEFSRRL-MLQSSIDVEQMVRTELATVIALEIDRAALYGLGS--------NSQPEGLKFV-TGINTENFGAT 708 (836) Q Consensus 639 ~~~t~~~~i~ISrel-L~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt--------~~~p~Gi~~~-~~~~~~t~aa~ 708 (836) .++-++....+|..+ |.|+..+......+.-...++..++.+.|+|+-. +.+..||.+- ..-+.+.+-+. T Consensus 133 ~~k~l~~~~~vS~~~~l~n~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~~NvIDarG~ 212 (514) T protein:vir:10 133 NIKYIVDTHVTSIALQRANTIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAPENHIDLRGG 212 (514) T ss_pred eeeeeeeeeeeeehhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcCCCeEecCCC Confidence 998888766666543 4677778888888888899999999999988632 1344666543 34455566666 Q ss_pred chhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCccccccCC-CCeecceee--EeeCc--cc-cceEEE Q lcl|NC_016164. 709 NPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTAQFVLEP-GGTVNGYNV--VRSNQ--VA-NGDVFF 782 (836) Q Consensus 709 ~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~~~~~~~-~~~l~G~pV--v~s~~--~~-~~~i~~ 782 (836) .++.+.|..+-..+...+ -.+.-++|+..+.+.+..-....-+ .++... ++-..|++| +.+.. +. .+..++ T Consensus 213 ~Ls~~~ln~aA~~i~~gf--Gt~TD~ylp~~vka~f~~~~~~~qR-V~~~~n~~~~~~G~~v~~f~s~~G~I~L~gs~im 289 (514) T protein:vir:10 213 RLSPAALNMAARKIGEGF--GTPTDAYMPIGIKADFVNQHLNGQR-VMLPGQTGGMTTGLDIDKFLSAHGSIRIQGSTIM 289 (514) T ss_pred CccHHHHhhhhhhhhccc--CChhheeCchHHHHHHhhcccCcce-EEeecCccceeeeeeccceeEeccceeecCCeee Confidence 777666665554443333 3455678888887766543322111 112211 111223222 00000 00 000011 Q ss_pred EehhceEEEe--------ecceEEEEecc--------cc----------cccC-cEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 783 GVWNQMIMGM--------WGALDIQVNPY--------AL----------DKSG-SVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 783 gD~s~~~i~~--------~~~l~i~~~~~--------~~----------~~~~-~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) +....+.... ...+.+.+.+. .. .+-+ ...|++...-.-+--.|..++..+.| T Consensus 290 ~~~n~L~~~~~~~~~Ap~~~~va~svT~~~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~vtaT~a 369 (514) T protein:vir:10 290 DSDNKLDFDRPVSPTAPTAPQLSATVTPDGGGLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLVQTATPT 369 (514) T ss_pred cccccCccCCccCCcCCCCCcceEEEecCcccccCcccccccccccccccccceeEEEEEEEECCCCcccccceeeeeee Confidence 1000000000 00000111000 00 0000 11122222222222223333322222 Q ss_pred C Q lcl|NC_016164. 836 L 836 (836) Q Consensus 836 ~ 836 (836) = T Consensus 370 ~ 370 (514) T protein:vir:10 370 K 370 (514) T ss_pred c Confidence 1 No 228 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=80.27 E-value=0.096 Score=26.15 Aligned_cols=292 Identities=11% Similarity=0.045 Sum_probs=127.2 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeee-c Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT-G 601 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~-~ 601 (836) ...........+....++..|.... ..+-.+.|-|. ....+...+.+.+.+.+....+.. - T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~-----------------d~~~~FsV~P~-v~q~L~~~i~ess~FL~~INvv~V~e 62 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVAELNGIDAG-----------------DVSKKFTVEPS-VTQTLMNTMQESSDFLTRINIVPVSE 62 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChH-----------------HhcceeecCHH-HHHHHHHHHHHHHHHhccCCcccccc Confidence 0000001111111111111111100 00111223333 455677778888888776554322 2 Q ss_pred CCceEEEEEecCCceeeeec--cCccccccc-ccceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVA--EGGDPTESQ-PSVDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALE 676 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~--Eg~~~~~~~-~~~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~ 676 (836) ..|..-. ...+++-+.-+. -+.+....+ ..++.-.|.....---+.|+.+.|... .+++...+...+.++++.- T Consensus 63 ~~Ge~i~-lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD 141 (357) T protein:vir:56 63 MKGEKIG-IGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLD 141 (357) T ss_pred ceeeEEe-cccCccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhc Confidence 2222221 122223333221 111222222 244555566655555556666666532 2466677777777666654 Q ss_pred HHHHHHhhcCCc------ccc------ccccc----cc------------cc--c-cccccccchhHHHHHHHHHH---- Q lcl|NC_016164. 677 IDRAALYGLGSN------SQP------EGLKF----VT------------GI--N-TENFGATNPTYVELVSMESK---- 721 (836) Q Consensus 677 ~d~~il~G~Gt~------~~p------~Gi~~----~~------------~~--~-~~t~aa~~~t~~~l~~a~~~---- 721 (836) .=...++|.-.. ..| +|.+. .+ +. . .+..+ ..-+|.+|-++... T Consensus 142 ~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G-~~gdy~NLDalV~D~~~~ 220 (357) T protein:vir:56 142 FIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVG-KGGDYASLDALVMDATNN 220 (357) T ss_pred cceecccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeec-CCCCcccHHHHHHHHHhc Confidence 444556775321 122 23221 00 00 0 01111 11245555444432 Q ss_pred -HhhhccccCccEEEecHHHHHH-HHHHhhccCccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeec Q lcl|NC_016164. 722 -VAADNADIGAMSYLTNSTLYGG-FKTTEKATSTAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWG 794 (836) Q Consensus 722 -l~~~~~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~ 794 (836) |...+.+.+..+.++....+.. --.+....+.+. . +.....++-|+|.+..+++|.+.+++--|+++.|.... T Consensus 221 lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~ 300 (357) T protein:vir:56 221 LIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMD 300 (357) T ss_pred cCChHHhcCCCEEEEEchhhhhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEec Confidence 3445566667788888877642 122222222221 1 11123578999999999999999999999887776544 Q ss_pred c-eEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 795 A-LDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 795 ~-l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) | ..-...+. -.++++.-.-..--+..|=++..++.+.+.- T Consensus 301 gs~RR~~~d~--p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~ 341 (357) T protein:vir:56 301 DSHRRVIEEN--PKLDRVENYESMNIDYVVEDYAAGCLVEKIK 341 (357) T ss_pred CcEEEEEEec--cccccccchhhhcceeeeeccccEEEeeeee Confidence 3 22111111 1112221111122222333333333333222 No 229 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=74.41 E-value=0.16 Score=24.97 Aligned_cols=290 Identities=9% Similarity=0.020 Sum_probs=121.2 Q ss_pred hhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeec-CCc Q lcl|NC_016164. 526 EAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTG-LQG 604 (836) Q Consensus 526 ~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~-~~~ 604 (836) ........+....++..|.... ....+..+.+.+.....+...+.+.+.+.+....+... ..| T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a----------------~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~G 64 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLD----------------SVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKG 64 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChh----------------hhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccc Confidence 0000011111111111111100 00011122334445566778888888887765543222 222 Q ss_pred eEEEEEecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhHHHHHhcch--hHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_016164. 605 PVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQSS--IDVEQMV-RTELATVIALEIDRAA 681 (836) Q Consensus 605 ~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds~--~~l~~~i-~~~l~~a~a~~~d~~i 681 (836) ..-. ...+++-+.-..-+... .....+.-.|.....---+.|+.+.|...+ ++.+..+ ...+.++++.-.-... T Consensus 65 e~v~-lg~~g~iagrtdt~r~r--~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IG 141 (336) T protein:vir:37 65 TKLF-GATEKGVTGRKQTGRNL--ATLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIG 141 (336) T ss_pred eEEe-eccCcccccccCCCCCc--cccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhc Confidence 2221 11122222222222111 122344445555555555567777765432 3333222 2223333333333344 Q ss_pred HhhcCC---cccccc------ccc----ccccc----------cccccccchhHHHHH----HHHHHHhhhccccCccEE Q lcl|NC_016164. 682 LYGLGS---NSQPEG------LKF----VTGIN----------TENFGATNPTYVELV----SMESKVAADNADIGAMSY 734 (836) Q Consensus 682 l~G~Gt---~~~p~G------i~~----~~~~~----------~~t~aa~~~t~~~l~----~a~~~l~~~~~~~~~~~~ 734 (836) ++|.-. ...|.+ .+. .+... .+...+..-+|.+|- +++..+...+.+.+..+. T Consensus 142 fnG~s~A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVv 221 (336) T protein:vir:37 142 WNGQSVATNTTKTDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVF 221 (336) T ss_pred ccceeeccCCCCccccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEE Confidence 566431 123432 211 00000 000101112344444 344445555666667788 Q ss_pred EecHHHHHH-HHHHhhccC-ccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeecce-EEEEeccccc Q lcl|NC_016164. 735 LTNSTLYGG-FKTTEKATS-TAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGAL-DIQVNPYALD 806 (836) Q Consensus 735 vmnp~~~~~-L~~lkd~~g-~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~l-~i~~~~~~~~ 806 (836) ++..+.+.. ...+-..++ +|. . +.....++-|+|.+..+.+|.+.+++--++++.|....|- .-...+. - T Consensus 222 ivG~dLla~~~~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~--p 299 (336) T protein:vir:37 222 LVGADLVSKETKLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRND--E 299 (336) T ss_pred EEchhhhhhhhhhhhhhcCCCHHHHHHHHHHHHHHhhCCceEEEccccCCCceEEeeccccEEEEecCcEEEEEEEc--c Confidence 888776532 112222222 221 0 0112357899999999999999999999998877655442 2111111 1 Q ss_pred ccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 807 KSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 807 ~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) +++++.-.-..--+..|-++..++.+.+.= T Consensus 300 ~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~ 329 (336) T protein:vir:37 300 DKKGLVTSYYRQEGYVVEDLGLMTAIDHTK 329 (336) T ss_pred ccccccchhhhcceeeeeccccEEEeeeee Confidence 122222222222333444444444444322 No 230 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=73.51 E-value=0.17 Score=24.81 Aligned_cols=289 Identities=9% Similarity=0.032 Sum_probs=127.9 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeee-c Q lcl|NC_016164. 523 AAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLT-G 601 (836) Q Consensus 523 ~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~-~ 601 (836) ...........+....++..|. ...+..+.+.+.....+...+.+.+.+.+....+.. - T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv--------------------~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e 60 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLNDT--------------------GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTE 60 (337) T ss_pred CChHHHHHHHHHHHHHHHhcCh--------------------hhhcceeecChHHHHHHHHHHHHHHHHhccCCcccccc Confidence 0000000111111111111111 011112223333555677778888888776554322 2 Q ss_pred CCceEEEEEecCCceeeeeccC-cc-cccccccceeEEeeeeeeeeeehhHHHHHhcc--hhHHHHHHHHHHHHHHHHHH Q lcl|NC_016164. 602 LQGPVAIPRQTGAATAYWVAEG-GD-PTESQPSVDQVALVAKTLGAYTEFSRRLMLQS--SIDVEQMVRTELATVIALEI 677 (836) Q Consensus 602 ~~~~~~~p~~~~~~~a~~v~Eg-~~-~~~~~~~~~~it~~~~t~~~~i~ISrelL~ds--~~~l~~~i~~~l~~a~a~~~ 677 (836) ..|..-. ...+++-+.-..-+ .+ .|..-...+.-.|.....---+.|+.+.|... .+++...+...+.++++.-. T Consensus 61 ~~Ge~v~-lg~~g~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~ 139 (337) T protein:vir:78 61 LEGEKLG-LSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDR 139 (337) T ss_pred ceeeEEe-cccCcceeeeecCCCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhcc Confidence 2222221 11222223222211 11 12222345555555555555556666666542 34677777777777766554 Q ss_pred HHHHHhhcCCc------ccc------cccc------------cccc--cccccccccchhHHHHHHHHHH-----Hhhhc Q lcl|NC_016164. 678 DRAALYGLGSN------SQP------EGLK------------FVTG--INTENFGATNPTYVELVSMESK-----VAADN 726 (836) Q Consensus 678 d~~il~G~Gt~------~~p------~Gi~------------~~~~--~~~~t~aa~~~t~~~l~~a~~~-----l~~~~ 726 (836) =...++|.-.. ..| +|.+ +... ...+..+ ..-+|.+|-++... +...+ T Consensus 140 i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG-~~gdy~NLDalV~d~~~~lI~~~~ 218 (337) T protein:vir:78 140 IMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIG-KAGDYENLDALVMDIVSSMIDPWF 218 (337) T ss_pred ceecccceeeccCCChhhCcCccccchHHHHHHHhcchhhhhccccccCCceeec-CCCCcccHHHHHHHHHhccCChHH Confidence 45556775321 122 2221 1100 0011111 11245555444433 34455 Q ss_pred cccCccEEEecHHHHHH-HHHHhhccCccc--c---ccCCCCeecceeeEeeCccccceEEEEehhceEEEeecc-eEEE Q lcl|NC_016164. 727 ADIGAMSYLTNSTLYGG-FKTTEKATSTAQ--F---VLEPGGTVNGYNVVRSNQVANGDVFFGVWNQMIMGMWGA-LDIQ 799 (836) Q Consensus 727 ~~~~~~~~vmnp~~~~~-L~~lkd~~g~~~--~---~~~~~~~l~G~pVv~s~~~~~~~i~~gD~s~~~i~~~~~-l~i~ 799 (836) .+.+..+.++....+.. -..+....+.+. . +.....++-|+|.+..+.+|.+.+++--|+++.|....| ..-. T Consensus 219 ~~d~dLVvivG~dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~ 298 (337) T protein:vir:78 219 QEDTGLVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRT 298 (337) T ss_pred hcCCCEEEEEchhhhHHHHHHHHhcCCCcHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeechhcEEEEecCcEEEE Confidence 66667788888887652 112222222221 0 011235789999999999999999999999887765443 2221 Q ss_pred EecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 800 VNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 800 ~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) ..+. -+++.+.-.-..--+..|-++..++.+. .+ T Consensus 299 ~~d~--p~r~rie~y~s~Ne~YvVEd~~~~a~iE-nI 332 (337) T protein:vir:78 299 LKEV--PERDRIENYESSNDAYVVEDFGCGCVAE-NI 332 (337) T ss_pred EEec--cccccccchhhccceeeeeccccEEEEe-ce Confidence 1111 1122222111122233333444444333 12 No 231 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=73.49 E-value=0.17 Score=24.81 Aligned_cols=412 Identities=12% Similarity=0.066 Sum_probs=125.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh----hhhhh----h Q lcl|NC_016164. 383 ATSSSGPPGAAAATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQG----LIESG----A 454 (836) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~e----liee~----~ 454 (836) +....+...+++.+.. . .+-..+|...-+++-...+.+. +.++. . T Consensus 1 ~~~~~~~l~~kw~p~l-----~----------------------~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~ 53 (529) T protein:vir:10 1 MSLKNKEILNKWTPLL-----E----------------------GEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDD 53 (529) T ss_pred CcccHHHHHHHhHHHh-----c----------------------CCccchhccchhhhhhhhhhhhhHHHHhhccccchh Confidence 0000000111110000 0 0000111110000000000000 00000 0 Q ss_pred hHHHHHHHHHHHhhhh--hhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhh--hhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 455 SEADAMRSVLSEIAKR--PAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVR--AIRAQMMPGDRAAFEAAAF 530 (836) Q Consensus 455 t~~e~~~~~l~~l~~~--~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--a~~a~~~~~~~~~~~~~~~ 530 (836) .+.+.....+.+..-. ..-......... . ........-....+.++.....-... .+-+...|. T Consensus 54 ~~~e~~~~~l~~~~~~~~~~~~~~~i~est-~-t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPT---------- 121 (529) T protein:vir:10 54 KLIEAFGQSLMEAEVAGDHGYDPTNIAAGQ-S-SGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPT---------- 121 (529) T ss_pred hhhhhhhcccchhhcccccccccccccccc-c-cccccccCchhhhhHHHHHhhhhhheeeeeecCCchh---------- Confidence 0000000000000000 000000000000 0 00000000000011110000000000 000000000 Q ss_pred hHHHHHHHHHHhhhhhh---------hhhhhhhhhhhhhhhcccccccccccc-----hhhH-HHHHHHHH--hhhhhhh Q lcl|NC_016164. 531 EREVSEATAQRMGVTPR---------GILAPNDVLHRDLVVDTASAAGDLVFT-----DGRP-GSFIELLR--NRLALNT 593 (836) Q Consensus 531 ~~~~a~~~~~~~g~~~~---------g~~~~~~~~~~a~~~~~~~~~g~~vvp-----~~~~-~~ii~~l~--~~~~l~~ 593 (836) ....++..+++.... ....+......+..........+...+ ..+. ....+.+. ..+.+.. T Consensus 122 --GLIFAMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~ 199 (529) T protein:vir:10 122 --GQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQ 199 (529) T ss_pred --hhhhhhheeecCCccccccccccccccccccccccccccccccccCccccccccccccccccCcceeeeecccceecc Confidence 000000001100000 000000000000000000000000000 0000 00000000 0000000 Q ss_pred h--------cceee-----------ecCCceEEEEEecCCceeeee-----ccCcccccccccceeEEeeeeeeeeeehh Q lcl|NC_016164. 594 L--------GVTML-----------TGLQGPVAIPRQTGAATAYWV-----AEGGDPTESQPSVDQVALVAKTLGAYTEF 649 (836) Q Consensus 594 l--------~~~~~-----------~~~~~~~~~p~~~~~~~a~~v-----~Eg~~~~~~~~~~~~it~~~~t~~~~i~I 649 (836) . +.... ......+.+.....+..+.-. .-+.++++-.++++++++.+++-+-.... T Consensus 200 ~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEY 279 (529) T protein:vir:10 200 NVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQY 279 (529) T ss_pred cccccccccCccccCcccccccccccccccccccccccchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccc Confidence 0 00000 000011111111011111111 01234678888999999999999999999 Q ss_pred HHHHHhc----chhHHHHHHHHHHHHHHHHHHHHHHHhhcCC------------cccccccccccccccccccccc---- Q lcl|NC_016164. 650 SRRLMLQ----SSIDVEQMVRTELATVIALEIDRAALYGLGS------------NSQPEGLKFVTGINTENFGATN---- 709 (836) Q Consensus 650 SrelL~d----s~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt------------~~~p~Gi~~~~~~~~~t~aa~~---- 709 (836) |-||.+| -.+++++.|.+.|+..|...||+.||.-.-+ ++...|++.......+..+-.. T Consensus 280 TiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~ 359 (529) T protein:vir:10 280 SIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESY 359 (529) T ss_pred cHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhcccccccccccceeecccCccccccchHHHHH Confidence 9999886 2468999999999999999999988743221 1123344433221111100000 Q ss_pred -hhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHH--hhccC----cccc-ccCCC----Ceec-ceeeEeeCccc Q lcl|NC_016164. 710 -PTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTT--EKATS----TAQF-VLEPG----GTVN-GYNVVRSNQVA 776 (836) Q Consensus 710 -~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~l--kd~~g----~~~~-~~~~~----~~l~-G~pVv~s~~~~ 776 (836) -.+-.+.++-..+..+.++.....++++++....|... .+... ...+ ..... |.|. ||+|++.++.+ T Consensus 360 k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~ 439 (529) T protein:vir:10 360 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYAR 439 (529) T ss_pred HHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhhhhccccccccccccccccCCceEEEEecCceEEEecCCCC Confidence 01223444444555555544567889999988887632 21111 0111 11111 3343 57999999988 Q ss_pred cceEEEEehh--ce--EEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccce-------EEEeec---------- Q lcl|NC_016164. 777 NGDVFFGVWN--QM--IMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAF-------CRGNDN---------- 835 (836) Q Consensus 777 ~~~i~~gD~s--~~--~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af-------~~l~~A---------- 835 (836) ..-+++|--. .+ -++.--++.+..-+..+-.+-|=++-...|+++. .+|=+. ..+.+. T Consensus 440 ~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP~~~~~~~~~~~r~~~g~~~~~~ag~n 518 (529) T protein:vir:10 440 QDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVMGFKTRYAIG-VNPFAESRTQAPQGRITSGMPGVNSVGKN 518 (529) T ss_pred cceEEEEEeCCcccccceeeccccccccccccCCCcccceeeeeeeecee-ecCccccccccccccccCCcchhhhcCcc Confidence 7766666421 01 1111111221111111222223333344455542 233111 000000 Q ss_pred -C Q lcl|NC_016164. 836 -L 836 (836) Q Consensus 836 -~ 836 (836) + T Consensus 519 ~~ 520 (529) T protein:vir:10 519 AY 520 (529) T ss_pred ce Confidence 0 No 232 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=69.58 E-value=0.22 Score=24.18 Aligned_cols=374 Identities=13% Similarity=0.069 Sum_probs=116.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHHHHHHHhhhhhhHHHHh Q lcl|NC_016164. 413 IDMEAVRAQAAADERSRVASITSLCREHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPATPAAPVRSAQPIAAGG 492 (836) Q Consensus 413 ~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~~~~~~~~~~~~~~~~~ 492 (836) +..+. ..+...+..+..+ -.-.+. ...+..+....+ +.+.....-....+.-..+-......-+ T Consensus 1 mnkpd-----liekqnrlaelke---------nnvslk-sqisgfevknai-edl~K~~ELe~TlSe~~iEI~k~en~LN 64 (393) T protein:vir:16 1 MNKPD-----LIEKQNRLAELKE---------NNVSLK-SQISGFEVKNAI-EDLPKVQELEKTLSENSIEIIKIENELN 64 (393) T ss_pred CCCcc-----hhhhhhhhhhhhh---------cccchh-hhccchhhhhhh-hhchhHHHHHHhHhhcchhhhhhhhhhh Confidence 00000 0000001111100 000000 000000000000 0000000000000000000000000000 Q ss_pred hhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhh-hhccccccccc Q lcl|NC_016164. 493 GSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDL-VVDTASAAGDL 571 (836) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~-~~~~~~~~g~~ 571 (836) .. .+.......+...+.. .+.+.++.+-+. ...|............ -.+.+.+.-.. T Consensus 65 ~~-----eE~~KGK~kMt~~ies----------------q~A~~eF~~vL~-~N~G~S~~k~AW~A~L~E~GVtiTD~~~ 122 (393) T protein:vir:16 65 AQ-----EEKPKGKDKMTNFIES----------------QNAVTEFFDVLK-KNSGKSEIKNAWSAKLAENGVTITDTTF 122 (393) T ss_pred hh-----hhcchhhHHHHHHHhh----------------HHHHHHHHHHHh-ccCCchhhhhhhhhhHhhcCcceeccch Confidence 00 0000000000011100 111111111110 1111111111111111 11112223334 Q ss_pred ccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEE-ecCCceeeeeccCcccccccccceeEEeeeeeeeeeehhH Q lcl|NC_016164. 572 VFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPR-QTGAATAYWVAEGGDPTESQPSVDQVALVAKTLGAYTEFS 650 (836) Q Consensus 572 vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~-~~~~~~a~~v~Eg~~~~~~~~~~~~it~~~~t~~~~i~IS 650 (836) .+|.-+.-.|...+....++.+..- ++.. +.+-+.+ ..+...+.....|..+++...+|.--++.+..+.....+- T Consensus 123 ~LP~~lv~sI~~A~~n~n~v~~vfH--VT~~-~~~~V~~s~~s~~eAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~A 199 (393) T protein:vir:16 123 QLPRKLVESINTALLNTNPVFKVFH--VTNV-GALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA 199 (393) T ss_pred hccHHHHHHHHHhhhccCcceeeee--eccc-hhhhHHhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHH Confidence 5565555555555555555544211 1111 1111111 1112234444566667777777766666665555544442 Q ss_pred HHH---HhcchhHHHHHHHHHHHHHHH-HHHHHHHHhhcCCccc-c-ccc---cccccccccccccc-chhHHHHHHHHH Q lcl|NC_016164. 651 RRL---MLQSSIDVEQMVRTELATVIA-LEIDRAALYGLGSNSQ-P-EGL---KFVTGINTENFGAT-NPTYVELVSMES 720 (836) Q Consensus 651 rel---L~ds~~~l~~~i~~~l~~a~a-~~~d~~il~G~Gt~~~-p-~Gi---~~~~~~~~~t~aa~-~~t~~~l~~a~~ 720 (836) ++ +..+...++.+|..+|+.++. +..+.+++-|+|+|+- . .-+ .......+.+.+++ .+-.+.|-.+.. T Consensus 200 -e~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagktpfadaieeavd 278 (393) T protein:vir:16 200 -ERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD 278 (393) T ss_pred -HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHh Confidence 22 233444679999999999998 8899999999998751 1 111 11111222222222 233344555555 Q ss_pred HHhhhccccCccEEEecHHH-HHHHHHHhhccCccccccCCCCe-e---ccee-eEe-eCccccceEEEEehhceEEEee Q lcl|NC_016164. 721 KVAADNADIGAMSYLTNSTL-YGGFKTTEKATSTAQFVLEPGGT-V---NGYN-VVR-SNQVANGDVFFGVWNQMIMGMW 793 (836) Q Consensus 721 ~l~~~~~~~~~~~~vmnp~~-~~~L~~lkd~~g~~~~~~~~~~~-l---~G~p-Vv~-s~~~~~~~i~~gD~s~~~i~~~ 793 (836) -+++.- +...+++.... ...|..++.+..+.....-..++ + .|+. +++ +..-+-..-++.|.+ |++-+. T Consensus 279 fvrpta---grrylivktedrkalldelrqatananvriknddteiasevgvdeiivytgskalkptvlvdqk-yhidmq 354 (393) T protein:vir:16 279 FVRPTA---GRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQK-YHIDMQ 354 (393) T ss_pred hhccCC---CceEEEEeccchHHHHHHHHhhhccCceeeeccchhhhhhcCcceeeeeeccccccceeeeccc-cccchh Confidence 444322 12234444333 34455565544333221111111 1 1222 111 111111122233322 222111 Q ss_pred cceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeec Q lcl|NC_016164. 794 GALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDN 835 (836) Q Consensus 794 ~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A 835 (836) + +..-+-..|.+|.-.+.++.--.+-+---.|=++++.. T Consensus 355 -d--ltkvdafewktnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 355 -D--LTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred -h--hhhhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 0 00011112233332333333222222222222222222 No 233 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=63.44 E-value=0.32 Score=23.32 Aligned_cols=258 Identities=11% Similarity=0.038 Sum_probs=104.4 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhc--c-eeeecCCceEEEEEecC----CceeeeeccCcccccccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLG--V-TMLTGLQGPVAIPRQTG----AATAYWVAEGGDPTESQPSV 633 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~--~-~~~~~~~~~~~~p~~~~----~~~a~~v~Eg~~~~~~~~~~ 633 (836) +. ..+-..+.+...+.+.....+....|. . .+.-.....+++|+.+- +..+.-..-++-...++++. T Consensus 1 Ma------ntl~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~~g~v~~ 74 (302) T protein:vir:78 1 MA------NSLALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYNRSTGFTQGSVTL 74 (302) T ss_pred CC------chhHHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccccccccCccccceee Confidence 00 000111233333433333333222221 1 12223456788888762 22333344344444555555 Q ss_pred eeEEeeeee-eeeeehhHHHHHhcchhHHH---HHHHHHH-HHHHHHHHHHHHHhhcCCcccccccccccccccccc--c Q lcl|NC_016164. 634 DQVALVAKT-LGAYTEFSRRLMLQSSIDVE---QMVRTEL-ATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENF--G 706 (836) Q Consensus 634 ~~it~~~~t-~~~~i~ISrelL~ds~~~l~---~~i~~~l-~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~--a 706 (836) +..++++.. -+..+.|. -+..+..+.. +.|..++ ...+.=.+|...++-.-+.....+ +....+. . T Consensus 75 ~~et~tlt~DR~~~f~vD--~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~-----~~~~~~~~~~ 147 (302) T protein:vir:78 75 AWSDYTLDYDLAQSFQID--AMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVG-----GVIDLSKPDA 147 (302) T ss_pred eeeeEEeeeccceeeecc--ccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccC-----ccccccccch Confidence 444433311 11222221 1211111211 2233332 233333455443432211000000 0000111 1 Q ss_pred ccchhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccCcc-------ccccCCCCeecceeeEeeC--cccc Q lcl|NC_016164. 707 ATNPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATSTA-------QFVLEPGGTVNGYNVVRSN--QVAN 777 (836) Q Consensus 707 a~~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g~~-------~~~~~~~~~l~G~pVv~s~--~~~~ 777 (836) ...-.++.|..++..|..+ ++.+++|+|.++..|+..+..+..- .-+...-..|.|+||+..+ .+.. T Consensus 148 t~~nvl~~i~~~~~~~~e~----~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~~i~~~V~~lDgv~Ii~VPs~r~~t 223 (302) T protein:vir:78 148 SAQALMGDIATAMELVDDS----NQLILVTSPTTLAGLLNTALIRESKNTQVLRRGEVDTKITFIQDVEVLQVPSEYLYD 223 (302) T ss_pred hHHHHHHHHHHHHHHhhcc----CCeEEEEChHHHHHHhcchhhccceeccccccccccceeeeecccEEEEchhhhccc Confidence 1122345666666666553 4678999999998887533221110 1122334568899987643 3321 Q ss_pred c----------------eEEEEehhceEEEeecceEEEE-ecccccccCcEEEEEEEEeccEEEccc--c-eEEEeecC Q lcl|NC_016164. 778 G----------------DVFFGVWNQMIMGMWGALDIQV-NPYALDKSGSVRVTALQDVDVAVRHPE--A-FCRGNDNL 836 (836) Q Consensus 778 ~----------------~i~~gD~s~~~i~~~~~l~i~~-~~~~~~~~~~~~~r~~~r~d~~v~~p~--A-f~~l~~A~ 836 (836) . .+++...+ ..+.....-.+.. .|...-..+...+.-+.++|.-|.+.+ + ++..+.|+ T Consensus 224 ~~~f~~G~~~~~~ak~INfiiv~~~-a~ia~~K~~~~~if~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~~ 301 (302) T protein:vir:78 224 KVAPKVGVPDYTGAKKIPYMIFKRD-APTGIVKTDKVRVFEPDTNQSADAYKVDLRLYHDLIVPKNQRPGIIKASFGTI 301 (302) T ss_pred ceeccCCccccCCccceeEEEECCC-eeeeeeeeeeeEeeCCCCCCCcceeeeeeeeEeeeeeeccccCeEEEeecccc Confidence 1 12222222 2222111111111 343333334455666667777666643 2 56667777 No 234 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=58.68 E-value=0.41 Score=22.72 Aligned_cols=408 Identities=12% Similarity=0.070 Sum_probs=122.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh----hhhhhh----h Q lcl|NC_016164. 383 ATSSSGPPGAAAATVAPLSHNDNNHMDSSTIDMEAVRAQAAADERSRVASITSLCREHKADDLAQ----GLIESG----A 454 (836) Q Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ei~al~~~~~l~e~a~----eliee~----~ 454 (836) +....+...+++.+.. . .+-..+|...-+++-...+.+ .+.++. . T Consensus 1 ~~~~~~~l~~kw~p~l-----~----------------------~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~ 53 (529) T protein:vir:10 1 MSLKNKEILNKWTPLL-----E----------------------GEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDD 53 (529) T ss_pred CccchHHHHHHhhHhh-----c----------------------CCccchhccchhhhhhhhhhhhhHHHHhcccccchh Confidence 0000000011110000 0 000011111000000000000 000000 0 Q ss_pred hHHHHHHHHHHHhhh--hhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHh--hhhhhhhhhhhhhhhhhhhh Q lcl|NC_016164. 455 SEADAMRSVLSEIAK--RPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFV--RAIRAQMMPGDRAAFEAAAF 530 (836) Q Consensus 455 t~~e~~~~~l~~l~~--~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~a~~a~~~~~~~~~~~~~~~ 530 (836) .+.+.....+.+..- ...-.......... ........-....+.++.....-.. -.+-+...|.. T Consensus 54 ~~~e~~~~~l~e~~~~~~~~~~~~~i~~st~--t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTG--------- 122 (529) T protein:vir:10 54 KLIEAFGQSLMEAEVAGDHGYDPTNIAAGQS--SGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTG--------- 122 (529) T ss_pred hhhhhhhccchhhcccccccccccccccccc--cccccccCchhhhhHHHHHHhhhhhhhheeccCCchhh--------- Confidence 000000000000000 00000000000000 0000000000001111100000000 00000000000 Q ss_pred hHHHHHHHHHHhhhhhh---------hhhhhhhhhhhhhhhccccc-----ccccccchhhH-HHHHHHHH--hhhhhhh Q lcl|NC_016164. 531 EREVSEATAQRMGVTPR---------GILAPNDVLHRDLVVDTASA-----AGDLVFTDGRP-GSFIELLR--NRLALNT 593 (836) Q Consensus 531 ~~~~a~~~~~~~g~~~~---------g~~~~~~~~~~a~~~~~~~~-----~g~~vvp~~~~-~~ii~~l~--~~~~l~~ 593 (836) ...++..+++.... ....+......+......+. .........+. ....+.+. ..+.+.. T Consensus 123 ---LIFAMRsrY~~~~~~~~~~eaf~~~~~pda~~sga~~~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~ 199 (529) T protein:vir:10 123 ---QVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQ 199 (529) T ss_pred ---hhheeeeeecCCcccccccccccccccccccccccccccccccccccccccccccccccccccceeeecccCceeec Confidence 00000000000000 00000000000000000000 00000000000 00000000 0000000 Q ss_pred hc--ceeee-----------------cCCceEEEEEecCCceeeee-----ccCcccccccccceeEEeeeeeeeeeehh Q lcl|NC_016164. 594 LG--VTMLT-----------------GLQGPVAIPRQTGAATAYWV-----AEGGDPTESQPSVDQVALVAKTLGAYTEF 649 (836) Q Consensus 594 l~--~~~~~-----------------~~~~~~~~p~~~~~~~a~~v-----~Eg~~~~~~~~~~~~it~~~~t~~~~i~I 649 (836) .. ..... .....+.+...-.+..+.-. .-+.++++-.++++++++.+++-+-.... T Consensus 200 ~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEY 279 (529) T protein:vir:10 200 NVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQY 279 (529) T ss_pred cccccccccCccccCcccccccccccccccccccccchhhhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccc Confidence 00 00000 00011111111011111100 01234677888999999999999999999 Q ss_pred HHHHHhc----chhHHHHHHHHHHHHHHHHHHHHHHHhhcCC------------cccccccccccccccccccccc---- Q lcl|NC_016164. 650 SRRLMLQ----SSIDVEQMVRTELATVIALEIDRAALYGLGS------------NSQPEGLKFVTGINTENFGATN---- 709 (836) Q Consensus 650 SrelL~d----s~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt------------~~~p~Gi~~~~~~~~~t~aa~~---- 709 (836) |-||.+| -.+++++.|.+.|+..|...||+.||.-.-+ ++.-.|++.......+..+-.. T Consensus 280 TiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~ 359 (529) T protein:vir:10 280 SIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESY 359 (529) T ss_pred cHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhhhhccccccccccccceeecccCccccccchHHHHH Confidence 9999886 2468999999999999999999988743221 1122344433221111100000 Q ss_pred -hhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHH--hhc-cC----ccccccCCC----Ceec-ceeeEeeCccc Q lcl|NC_016164. 710 -PTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTT--EKA-TS----TAQFVLEPG----GTVN-GYNVVRSNQVA 776 (836) Q Consensus 710 -~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~l--kd~-~g----~~~~~~~~~----~~l~-G~pVv~s~~~~ 776 (836) -.+-.+.++-..+..+.++.....++++++....|... ... .+ ++.-..... |.|. ||+|++.++.+ T Consensus 360 ~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~ 439 (529) T protein:vir:10 360 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYAR 439 (529) T ss_pred HHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhcccccccccccccccccccCCceEEEEecCceEEEecCCCC Confidence 01223444444555555544567889999988887632 111 11 111111111 3343 57899999888 Q ss_pred cceEEEEehhceEEEeecceEEEEeccc--------ccccCcEEEEEEEEeccEEEcccceE-------EEeec------ Q lcl|NC_016164. 777 NGDVFFGVWNQMIMGMWGALDIQVNPYA--------LDKSGSVRVTALQDVDVAVRHPEAFC-------RGNDN------ 835 (836) Q Consensus 777 ~~~i~~gD~s~~~i~~~~~l~i~~~~~~--------~~~~~~~~~r~~~r~d~~v~~p~Af~-------~l~~A------ 835 (836) ..-+++|--..-. ... .+...||+ +-.+-|=++-...|+++. .+|=+.- .+.+. T Consensus 440 ~dy~~vG~KG~~~--~~~--glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP~~~~~~~~~~~r~~~g~~~~~~ 514 (529) T protein:vir:10 440 QDYFTMGYRGANN--LDA--GIYYCPYVALTPLRGFDPKNFQPVMGFKTRYAIG-VNPFAESRTQAPQGRITSGMPGVNS 514 (529) T ss_pred cceEEEEEeCCcc--ccc--ceeeccccccccccccCCCcccceeeeeeeecee-ecCccccccccccccccCCcchhhh Confidence 7766665421000 000 11222332 112222233333444442 2331110 00000 Q ss_pred -----C Q lcl|NC_016164. 836 -----L 836 (836) Q Consensus 836 -----~ 836 (836) + T Consensus 515 ag~n~~ 520 (529) T protein:vir:10 515 VGKNAY 520 (529) T ss_pred cCccce Confidence 0 No 235 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=52.76 E-value=0.55 Score=22.02 Aligned_cols=261 Identities=7% Similarity=-0.000 Sum_probs=108.0 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhc--c-eeeecCCceEEEEEecCCceeeeecc--Cccccccccccee Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLG--V-TMLTGLQGPVAIPRQTGAATAYWVAE--GGDPTESQPSVDQ 635 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~--~-~~~~~~~~~~~~p~~~~~~~a~~v~E--g~~~~~~~~~~~~ 635 (836) +. ..+-..+.+...+-+.+...+....|. . .+.-.....+.+|+.+... +.-..- +..+..++++.+. T Consensus 1 Ma------ntl~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~g-l~DY~R~~g~~~~~g~v~~~~ 73 (312) T protein:vir:10 1 MA------NTLAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDG-LGDYSRGSANAYVGGDVKFEY 73 (312) T ss_pred CC------cchhHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeeccc-ccccccccCCccccccccccc Confidence 00 001112333333333333322111111 1 1222345788888876432 222332 2234445555544 Q ss_pred EEeee--eeeeeeehhHHHHHhcch----hHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccc-cccccccccccccccc Q lcl|NC_016164. 636 VALVA--KTLGAYTEFSRRLMLQSS----IDVEQMVRTELATVIALEIDRAALYGLGSNSQPE-GLKFVTGINTENFGAT 708 (836) Q Consensus 636 it~~~--~t~~~~i~ISrelL~ds~----~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~-Gi~~~~~~~~~t~aa~ 708 (836) .++++ .+.- -+.|. -+..+. ..+...+.......+.=.+|...++-.-+..... +.-+ .......+. T Consensus 74 et~tl~qDR~~-~F~vD--~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~---~~~~~~~T~ 147 (312) T protein:vir:10 74 ETKTMTQDRGR-KFTLD--AMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTN---VEYSYSVNS 147 (312) T ss_pred eeEEeeecccc-eeecc--ccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccc---cccccccCH Confidence 44433 2222 12221 111111 1222223333334444456665553221110000 0000 000011122 Q ss_pred chhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhcc-----CccccccCCCCeecceeeEeeC--cccc---- Q lcl|NC_016164. 709 NPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKAT-----STAQFVLEPGGTVNGYNVVRSN--QVAN---- 777 (836) Q Consensus 709 ~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~-----g~~~~~~~~~~~l~G~pVv~s~--~~~~---- 777 (836) .-.++.|..++..|..+... .+.+++|+|..+..|+.-.... .....+...-+.|.|+||+..+ .+.. T Consensus 148 ~ni~~~i~~~~~~lde~~vp-~~rvl~vTp~~~~lLk~~~~~~~~~~~~~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f 226 (312) T protein:vir:10 148 STIINKIKTGIKIIRENGYN-GPLVCHLTYDSMFAIEEKVLEKLTAVTFAQGGIQTQVPSIDGCALIKTPQNRMYSSILL 226 (312) T ss_pred HHHHHHHHHHHHHHHHccCC-CceEEEeChHHHHHHhhhhhceecccccccceeeeeeeeecccEEEEchhhhccceeee Confidence 23477888889999887654 5778999999987777521110 0011122333578999988643 2210 Q ss_pred --c-------------------eEEEEehhceEEEeecceEEEE-ecccccccCcEEEEEEEEeccEEEcc--cce-EEE Q lcl|NC_016164. 778 --G-------------------DVFFGVWNQMIMGMWGALDIQV-NPYALDKSGSVRVTALQDVDVAVRHP--EAF-CRG 832 (836) Q Consensus 778 --~-------------------~i~~gD~s~~~i~~~~~l~i~~-~~~~~~~~~~~~~r~~~r~d~~v~~p--~Af-~~l 832 (836) | .+++...+ ..+.....-.+.. .|...-..+...+.-+.++|.-|.+. .++ +.. T Consensus 227 ~dG~t~~~~~gg~~~~~~ak~INfiiv~~~-a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~ 305 (312) T protein:vir:10 227 NDGTTSNQTAGGYLKGTKALDTNFIIAPVD-VPLAITKQDKMRIFDPETNQTANAWSMDYRRYHDLWVTDNKANSVYANF 305 (312) T ss_pred ccCcccccccCceeecCcccccceEEeCCc-eeeceeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEEe Confidence 0 01222222 1111111111111 33333333445566666677766664 333 566 Q ss_pred eecC Q lcl|NC_016164. 833 NDNL 836 (836) Q Consensus 833 ~~A~ 836 (836) +.|= T Consensus 306 k~a~ 309 (312) T protein:vir:10 306 KDAK 309 (312) T ss_pred eccc Confidence 6666 No 236 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=42.92 E-value=0.87 Score=20.93 Aligned_cols=328 Identities=13% Similarity=0.100 Sum_probs=122.6 Q ss_pred HHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHH------------HHh Q lcl|NC_016164. 475 PATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATA------------QRM 542 (836) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~------------~~~ 542 (836) ..+.+...++=.+.........+.... ++.......+...... ..+ T Consensus 1 ~~~~e~l~~kW~plLe~~~~~~i~~~~----------------------k~~i~a~llENQe~~~~~~~~~~~~~~~~~~ 58 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNHGEAPAIGDRY----------------------KRAVTSVLLENQERFLREERGMLNEVAVNSL 58 (468) T ss_pred CcchHHHHHhhhHhhcCCccchhccch----------------------hhhhhhhhhhhHHHHHhccccccchhhHhhc Confidence 000000000000000000000000000 0000001111111100 001 Q ss_pred hhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcce-eeecCCceEEEEE-----ecC--- Q lcl|NC_016164. 543 GVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVT-MLTGLQGPVAIPR-----QTG--- 613 (836) Q Consensus 543 g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~-~~~~~~~~~~~p~-----~~~--- 613 (836) +. +...+.........+..... .-|..+. +.++..+..+..+++.. ..++..+-+.-.+ ..+ T Consensus 59 ~~---~~~~~~n~~~~~~~t~~v~~----~~P~Li~--l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~g~EA 129 (468) T protein:vir:10 59 GA---GTIAPAGSALGSANTGGLAG----FDPVLIS--LVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEA 129 (468) T ss_pred CC---cccchhhhhhhhcccccccc----cCchhhh--hHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCCCccc Confidence 10 00000011110000111111 1122211 22222233333343221 1111111111000 000 Q ss_pred -----------------------------------C--------------------ceeeeecc-CcccccccccceeEE Q lcl|NC_016164. 614 -----------------------------------A--------------------ATAYWVAE-GGDPTESQPSVDQVA 637 (836) Q Consensus 614 -----------------------------------~--------------------~~a~~v~E-g~~~~~~~~~~~~it 637 (836) . ..+...++ +.++++-.+++++++ T Consensus 130 f~nEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~~f~EMaFsIeK~t 209 (468) T protein:vir:10 130 LFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTS 209 (468) T ss_pred eeccccccccccccccccccccccccccccCCCCCcccccccccccccccccccchHHHhhcCCCCcccceeeeEEEEEE Confidence 0 00000111 223566777888999 Q ss_pred eeeeeeeeeehhHHHHHhc----chhHHHHHHHHHHHHHHHHHHHHHHHhhcCC--------cccccccccccccccccc Q lcl|NC_016164. 638 LVAKTLGAYTEFSRRLMLQ----SSIDVEQMVRTELATVIALEIDRAALYGLGS--------NSQPEGLKFVTGINTENF 705 (836) Q Consensus 638 ~~~~t~~~~i~ISrelL~d----s~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt--------~~~p~Gi~~~~~~~~~t~ 705 (836) +.+++-+-...+|-||.+| -.+++++.|.+.|+..+...+|+.|+.-.-+ +....|++.........+ T Consensus 210 VtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~~~~Gv~d~~~~~~~rw 289 (468) T protein:vir:10 210 VTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRW 289 (468) T ss_pred EeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheecccccccccccccccccchh Confidence 9999998888999998876 3468999999999999999999988754221 122344443222111111 Q ss_pred cccchhHH----HHHHHHHHHhhhccccCccEEEecHHHHHHHHH---HhhccC---cccc----ccCCC----Ceec-c Q lcl|NC_016164. 706 GATNPTYV----ELVSMESKVAADNADIGAMSYLTNSTLYGGFKT---TEKATS---TAQF----VLEPG----GTVN-G 766 (836) Q Consensus 706 aa~~~t~~----~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~---lkd~~g---~~~~----~~~~~----~~l~-G 766 (836) +. -.+. .|.+....+...........+++++.....|.. +....+ +... ....+ |.|. | T Consensus 290 ~~--e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~tg~~~~G~l~~r 367 (468) T protein:vir:10 290 SV--EKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGR 367 (468) T ss_pred HH--HHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceecccccccccccccccccCcceEEEEecCc Confidence 10 0111 123333444444444556789999999888875 221111 1100 01111 2333 5 Q ss_pred eeeEeeCcccc----ceEEEEehhc----eEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccc---------- Q lcl|NC_016164. 767 YNVVRSNQVAN----GDVFFGVWNQ----MIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEA---------- 828 (836) Q Consensus 767 ~pVv~s~~~~~----~~i~~gD~s~----~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~A---------- 828 (836) |+|++.+++.. +-+++|--.. --++.--++.+..-+..+-.+-|=++-...|+++ ..+|=+ T Consensus 368 ~~vy~D~Ya~~~s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l-~~NP~~~~~~~~~g~~ 446 (468) T protein:vir:10 368 IKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGM-VSNPFVTTNGLYNGTP 446 (468) T ss_pred eEEEEccccccCCccceEEEEEecCcceeceeeeccccccccccccCCCcccceeeeeeeece-eecccceeccccCCCc Confidence 78888876542 3333332100 0111112222222222222333334444556655 234422 Q ss_pred ----eEEEeecC Q lcl|NC_016164. 829 ----FCRGNDNL 836 (836) Q Consensus 829 ----f~~l~~A~ 836 (836) ..+.+++- T Consensus 447 ~~~~~~~~~N~y 458 (468) T protein:vir:10 447 DGEALTPNANMY 458 (468) T ss_pred ccccccccccce Confidence 12222222 No 237 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=42.43 E-value=0.88 Score=20.87 Aligned_cols=265 Identities=9% Similarity=-0.051 Sum_probs=109.2 Q ss_pred hhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceee----eeccCcccccccc-cce Q lcl|NC_016164. 560 LVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAY----WVAEGGDPTESQP-SVD 634 (836) Q Consensus 560 ~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~----~v~Eg~~~~~~~~-~~~ 634 (836) ++ ......++.+.+....+..-.+. .+.....-+++......++++... -.+. ..+-++......+ .++ T Consensus 1 m~----~~~~~~~~dp~LT~~A~gy~n~~-~Iad~lfP~vpV~~~~~k~~~f~~-e~f~~~~t~ra~~~~~~~v~~~~~~ 74 (307) T protein:vir:79 1 MG----RLSKLRIVDPVLTNLAIGYTNAE-FIGQTLMPVVEVEKEGGKIPKFGK-ESFRLYQTERALRAKSNRMNPEDID 74 (307) T ss_pred CC----CCCCCcccCHHHHHHHhhccchh-hhhhhcCCcccccccccceeeecc-ccccccccccccCCCcceeeeeccc Confidence 00 00111122333333222222222 121111112222223333333321 0010 1112222222222 233 Q ss_pred eEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhh--cCCc--ccccccccccccccccccccch Q lcl|NC_016164. 635 QVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYG--LGSN--SQPEGLKFVTGINTENFGATNP 710 (836) Q Consensus 635 ~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G--~Gt~--~~p~Gi~~~~~~~~~t~aa~~~ 710 (836) ..++.+...+-..+|....-..+.+++.+.-...+...+.+.++..+-.- +..+ ...+..++.+. .-+. ++.- T Consensus 75 ~~~~~~~~~~l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsgt~--~Wsd-~~sD 151 (307) T protein:vir:79 75 SVDVNLDEHDLEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLSATE--KFTA-ANSD 151 (307) T ss_pred cccccccccchhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEccCc--ccCC-CCCC Confidence 34444444443344444333334555666666666666666555432111 1111 11111111111 0111 1222 Q ss_pred hHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHH----HhhccCcc--ccccCCCCeeccee-eEeeCcc-----c-- Q lcl|NC_016164. 711 TYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKT----TEKATSTA--QFVLEPGGTVNGYN-VVRSNQV-----A-- 776 (836) Q Consensus 711 t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~----lkd~~g~~--~~~~~~~~~l~G~p-Vv~s~~~-----~-- 776 (836) -+.+|.+++.++....+ ..+..++|.+..|..|+. ++.-.++. ..-...-..++|+. |.+.... + T Consensus 152 Pi~di~~~~~ai~~~~g-~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it~~~la~l~~v~~V~vg~a~y~~~~~~~ 230 (307) T protein:vir:79 152 PVGVIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDLLKEIFEVENIAVGEAIYADDKDRF 230 (307) T ss_pred cHHHHHHHHHHHHHhhC-CccceEEeCHHHHHHHhcCHHHHHHhcCccccccCHHHHHHHhCceeEEEeeeeeecccccc Confidence 35788888888876654 456789999999987753 12222221 11111112344443 3221111 0 Q ss_pred ----cceEEEE--------------ehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 777 ----NGDVFFG--------------VWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 777 ----~~~i~~g--------------D~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) .+.+++. .++.-+.+.+.|.-+ ++.+.. ..+.-.+|+.....-.+.-|.+=..+++|+ T Consensus 231 ~~iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~~~~g~~~-~d~~~~-~~~~~~vrv~~~~~~~i~~~~~G~li~~~v 306 (307) T protein:vir:79 231 TDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPV-VDTRIE-DGKLELVRATDIFRPYLLGADAGYLISGIN 306 (307) T ss_pred hhcCCCceEEEecccccCCCCCcccccccceeEEecCceE-EecccC-CCceeEEeecccccceeeccccchhhccCC Confidence 1112221 122222333333221 233222 345567888888888899999989999999 No 238 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=41.51 E-value=0.92 Score=20.77 Aligned_cols=346 Identities=11% Similarity=0.009 Sum_probs=119.9 Q ss_pred hhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHH-HHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhh Q lcl|NC_016164. 439 EHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPA-TPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQM 517 (836) Q Consensus 439 ~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~ 517 (836) .+. ...+.+-=.-+|+... ..+... .........- ++...+. ..++...... T Consensus 1 ~~~-----------~~~l~~kw~p~l~~~~--~~~i~~~~~~~~~a~ll----enq~~~~-~~~~~~~~~~--------- 53 (528) T protein:vir:80 1 MKT-----------TKELMEKWSPLLENEK--LPEIATASKQKLVAKIL----ESQEADF-AVDPIYKDEK--------- 53 (528) T ss_pred Ccc-----------hHHHHHhhhHhhcCCc--cchhcchhhhhhhhhhh----hhhhHHh-hccccccchH--------- Confidence 000 0000000011111000 000000 0000000000 0000000 0000000000 Q ss_pred hhhhhhhhhhhhhhHHHHHHHHHHhhh-hhhhhhhhhhhhhhh-hhhcccccccccccchhhHHHHHHHHHhhhhhhhhc Q lcl|NC_016164. 518 MPGDRAAFEAAAFEREVSEATAQRMGV-TPRGILAPNDVLHRD-LVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLG 595 (836) Q Consensus 518 ~~~~~~~~~~~~~~~~~a~~~~~~~g~-~~~g~~~~~~~~~~a-~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~ 595 (836) ...++...+.. ...+...-....... ..+......+ |.++ . +.++..+..+..+++ T Consensus 54 ----------------~~~~~~~~l~ea~~~~~~~~~~~~i~es~~t~~v~~~~----P~Li-~-lvRra~p~LIa~DIw 111 (528) T protein:vir:80 54 ----------------VVEAFGGFIAEAEVAGDHGYDASQIAAGQTTGAITNVG----PAVI-G-MVRRAIPNLIAFDIC 111 (528) T ss_pred ----------------HHHhhhhhccccccccccCCccccccccccccccccCC----chhh-h-HHHHHHhhhhhhhhh Confidence 00000000000 000000000000000 0000000111 1111 0 111111222222221 Q ss_pred ceeeecC--CceEEEEE--------------------------------------------------------------- Q lcl|NC_016164. 596 VTMLTGL--QGPVAIPR--------------------------------------------------------------- 610 (836) Q Consensus 596 ~~~~~~~--~~~~~~p~--------------------------------------------------------------- 610 (836) . +.|+. ++-+.-.+ T Consensus 112 G-VQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~ 190 (528) T protein:vir:80 112 G-VQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTF 190 (528) T ss_pred e-eccCCchhhhheeeeeeecCCccccccccccccccccccccccccccccccccccccccccccccccccccceecccc Confidence 1 11110 00000000 Q ss_pred --------------------------------ec--------CCceeeeecc---------CcccccccccceeEEeeee Q lcl|NC_016164. 611 --------------------------------QT--------GAATAYWVAE---------GGDPTESQPSVDQVALVAK 641 (836) Q Consensus 611 --------------------------------~~--------~~~~a~~v~E---------g~~~~~~~~~~~~it~~~~ 641 (836) .. +.....-.+| +.++++-.++++++++.++ T Consensus 191 ~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAK 270 (528) T protein:vir:80 191 AETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAK 270 (528) T ss_pred ccccccccccccccccCccccCCcccccccccccccccccccccccchhhhhhhcccCCCccccccceeeEEEEEEEeee Confidence 00 0000001122 1235677788899999999 Q ss_pred eeeeeehhHHHHHhc----chhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc------------cccccccccccccccc Q lcl|NC_016164. 642 TLGAYTEFSRRLMLQ----SSIDVEQMVRTELATVIALEIDRAALYGLGSNS------------QPEGLKFVTGINTENF 705 (836) Q Consensus 642 t~~~~i~ISrelL~d----s~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~------------~p~Gi~~~~~~~~~t~ 705 (836) +-+-....|=||.+| -.+++++.|.+.|+..|...||+.||.-..+.. .+.|++.-........ T Consensus 271 SRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g 350 (528) T protein:vir:80 271 SRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRG 350 (528) T ss_pred ccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeeccccccccc Confidence 999888999998876 257899999999999999999999974321111 1233333221111111 Q ss_pred cccch-----hHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHh-----hccCc-cccc-cCC----CCeec-cee Q lcl|NC_016164. 706 GATNP-----TYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTE-----KATST-AQFV-LEP----GGTVN-GYN 768 (836) Q Consensus 706 aa~~~-----t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lk-----d~~g~-~~~~-~~~----~~~l~-G~p 768 (836) +-... .+-.|.++-..+..+.++.....++++++....|...= ...|- ..+. ... .|.|. ||+ T Consensus 351 ~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~ 430 (528) T protein:vir:80 351 ARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYK 430 (528) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceEEEEecCceE Confidence 00001 11224444455555555445578999999988876531 11111 1111 111 13343 578 Q ss_pred eEeeCccccceEEEEehhc--e--EEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 769 VVRSNQVANGDVFFGVWNQ--M--IMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 769 Vv~s~~~~~~~i~~gD~s~--~--~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) |++.++.+.+-+++|--.. + -++.--++.+..-.-.+-.+-|=++-...|+++ ..+|=+- ..+++- T Consensus 431 vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l-~~NP~~~-~~~~~~ 500 (528) T protein:vir:80 431 VFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGI-GINPFAD-SKSQAP 500 (528) T ss_pred EEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccccceeeeeeeece-eecCccc-ccCCcc Confidence 9999998877666664211 0 011111111110000111222223333445554 2333111 111111 No 239 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=35.89 E-value=1.2 Score=20.14 Aligned_cols=347 Identities=12% Similarity=0.071 Sum_probs=123.5 Q ss_pred hhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhh Q lcl|NC_016164. 467 IAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTP 546 (836) Q Consensus 467 l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~ 546 (836) +.-..++ ..++=.+.........+....+......+.. ...+.....+.++ .....+.+.+.++... T Consensus 1 ~~~~~~~-------l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~e---nq~~~~~~~~~~~---~~~~~e~~~~~l~e~~ 67 (529) T protein:vir:10 1 MSLKTKE-------ILNKWTPLLEGEGLPEIAGKNKQALVAQILE---AQEKDSKTDPVYR---DDKLIEAFGQSLMEAE 67 (529) T ss_pred CccchHH-------HHHHhhHhhcCCccchhcchhhhhhhhhhhh---hHHHHhhcccccc---hhhhhhhhhhccchhh Confidence 0000000 0000000000000000000000000000000 0000000000000 0011111111111000 Q ss_pred -hhhhhhhhhhhh-hhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecC--Cce-----EEE--------- Q lcl|NC_016164. 547 -RGILAPNDVLHR-DLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL--QGP-----VAI--------- 608 (836) Q Consensus 547 -~g~~~~~~~~~~-a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~--~~~-----~~~--------- 608 (836) .+.........+ ...+....... |.++ . +.++..+..+..+++. +.|+. ++- ..+ T Consensus 68 ~~~~~~~~~~~ia~s~~t~~v~~~~----P~Li-~-lvRra~p~LIa~DIwG-VQPMTgPTGLIFAMRsrY~~~~~~~~g 140 (529) T protein:vir:10 68 VAGDHGYDPTNIAAGQSSGAITNIG----PAVI-G-MVRRAIPSLIAFDIAG-VQPMTGPTGQVFALRSVYGKDPLAAGA 140 (529) T ss_pred ccccccccccccccccccccccccc----chhh-h-hHHHHHHhHHhhhhhe-eccCCchhhhhhhheeeecCCcCCCcc Confidence 000000000000 00000011111 1111 1 1111112222222211 00000 000 000 Q ss_pred ---------------------------------------------------------E--EecC---------------- Q lcl|NC_016164. 609 ---------------------------------------------------------P--RQTG---------------- 613 (836) Q Consensus 609 ---------------------------------------------------------p--~~~~---------------- 613 (836) . ...+ T Consensus 141 ~eaf~~~~e~dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~ 220 (529) T protein:vir:10 141 KEAFHPMYAPDAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDAL 220 (529) T ss_pred cccccccccccccccccccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccc Confidence 0 0000 Q ss_pred --------------Cceeeeecc---------CcccccccccceeEEeeeeeeeeeehhHHHHHhc----chhHHHHHHH Q lcl|NC_016164. 614 --------------AATAYWVAE---------GGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQ----SSIDVEQMVR 666 (836) Q Consensus 614 --------------~~~a~~v~E---------g~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~d----s~~~l~~~i~ 666 (836) .+...-.+| +..+++-.++++++++.+++-+-....|=||.+| -.+++++.|. T Consensus 221 ~~~~~a~~~~~~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELs 300 (529) T protein:vir:10 221 VSAKIAAGELAEIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELN 300 (529) T ss_pred cccccccccccccccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHH Confidence 000000112 1235666788899999999999889999999876 2578999999 Q ss_pred HHHHHHHHHHHHHHHHhhcCC------------cccccccccccccccccccccch-----hHHHHHHHHHHHhhhcccc Q lcl|NC_016164. 667 TELATVIALEIDRAALYGLGS------------NSQPEGLKFVTGINTENFGATNP-----TYVELVSMESKVAADNADI 729 (836) Q Consensus 667 ~~l~~a~a~~~d~~il~G~Gt------------~~~p~Gi~~~~~~~~~t~aa~~~-----t~~~l~~a~~~l~~~~~~~ 729 (836) +.|+..|...||+.||.-.-. .+...|++..........+--.. .+-.+.++-..+..+.++. T Consensus 301 NILStEImlEINReii~~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg 380 (529) T protein:vir:10 301 GILANEVMLEINREVIDWINYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRG 380 (529) T ss_pred HHHHHHHHHHhhHHHHHHhhhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccc Confidence 999999999999999862111 11223444322111100000001 1223344444555555544 Q ss_pred CccEEEecHHHHHHHHHH--hhccCc----ccc-ccCCC----Ceec-ceeeEeeCccccceEEEEehh--ce--EEEee Q lcl|NC_016164. 730 GAMSYLTNSTLYGGFKTT--EKATST----AQF-VLEPG----GTVN-GYNVVRSNQVANGDVFFGVWN--QM--IMGMW 793 (836) Q Consensus 730 ~~~~~vmnp~~~~~L~~l--kd~~g~----~~~-~~~~~----~~l~-G~pVv~s~~~~~~~i~~gD~s--~~--~i~~~ 793 (836) ....++++++....|... .+..+. ..+ ..... +.|. ||+|++.++.+..-+++|--. .+ -++.- T Consensus 381 ~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~ 460 (529) T protein:vir:10 381 AGNFIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYC 460 (529) T ss_pred cceEEEEchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeec Confidence 567889999998888632 221111 111 11111 3343 578999999887766666421 01 11111 Q ss_pred cceEEEEecccccccCcEEEEEEEEeccEEEcccceEEE-eecC Q lcl|NC_016164. 794 GALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRG-NDNL 836 (836) Q Consensus 794 ~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l-~~A~ 836 (836) -++.+..-+..+-.+-|=++-...|+++ ..+| |... +++. T Consensus 461 PYv~l~~~~~~dp~sfqP~~g~~tRY~l-~~NP--~~~~~~~~~ 501 (529) T protein:vir:10 461 PYVALTPLRGSDPKNFQPVMGFKTRYAI-GVNP--FAESRTQAP 501 (529) T ss_pred cccccccccccCCCcccceeeeeeeece-eecC--ccccccccc Confidence 1222211111222223334444445555 3344 2211 1211 No 240 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=35.73 E-value=1.2 Score=20.12 Aligned_cols=345 Identities=13% Similarity=0.066 Sum_probs=124.8 Q ss_pred hhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhh Q lcl|NC_016164. 467 IAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREVSEATAQRMGVTP 546 (836) Q Consensus 467 l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~a~~~~~~~g~~~ 546 (836) +..+ ...+.=.+.........+....+..--.....-+....... ...........+..-..... T Consensus 1 ~~~~---------~l~~kw~p~l~~~~~~~i~~~~~~~i~~~~~en~~~~~~~~------~~~~~~~~~~~~~~~l~e~~ 65 (519) T protein:vir:10 1 MKKN---------ALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTA------PEYRDEKISEAFGSFLTEAE 65 (519) T ss_pred Cchh---------HHHHHhHHhhcccccchhhhhhhHHHHHHHHHHHHHHhhhc------ccccchHHHHHHhhhcchhc Confidence 1000 00000000000000011100000000000000000000000 00000000111110000000 Q ss_pred -hhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHH---HhhhhhhhhcceeeecC--Cce-----EEEEEec--- Q lcl|NC_016164. 547 -RGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELL---RNRLALNTLGVTMLTGL--QGP-----VAIPRQT--- 612 (836) Q Consensus 547 -~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l---~~~~~l~~l~~~~~~~~--~~~-----~~~p~~~--- 612 (836) .+...-. .... ...+ +++.+ . .+...++.+. -+..+..+++. +.|+. ++- ..+.... T Consensus 66 ~~~~~~~~-~t~i--~~~~--~t~~v--~-~~~P~l~~l~rRa~p~LIa~DIwG-VQPMTgPTGLIFAMRsrY~n~~~~~ 136 (519) T protein:vir:10 66 IGGDHGYD-ATNI--AAGQ--TSGAV--T-QIGPAVMGMVRRAIPHLIAFDICG-VQPLNNPTGQVFALRAVYGKDPIAA 136 (519) T ss_pred cCCccccC-cccc--cccc--ccccc--c-ccchhHHHHHHHHHHhhhhhhhhe-eecCCchhhhhheeeeeecCCcccc Confidence 0000000 0000 0000 00110 0 1111222222 22222333321 11111 000 0000000 Q ss_pred --------------------------------------------------------------CC---------------- Q lcl|NC_016164. 613 --------------------------------------------------------------GA---------------- 614 (836) Q Consensus 613 --------------------------------------------------------------~~---------------- 614 (836) +. T Consensus 137 ~g~ea~~~~nEadt~fSG~~~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~ 216 (519) T protein:vir:10 137 GAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEA 216 (519) T ss_pred ccccccccccccccccCccccccccccccccccccccccccccccccccceeccccccccCCCCcCcccccccccccccc Confidence 00 Q ss_pred ceeee--------ecc---------CcccccccccceeEEeeeeeeeeeehhHHHHHhc----chhHHHHHHHHHHHHHH Q lcl|NC_016164. 615 ATAYW--------VAE---------GGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQ----SSIDVEQMVRTELATVI 673 (836) Q Consensus 615 ~~a~~--------v~E---------g~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~d----s~~~l~~~i~~~l~~a~ 673 (836) ..... .+| +..+++-.++++++++.+++-+-....|=||.+| -.+++++.|.+.|+..| T Consensus 217 ~~~~~~~~gmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEI 296 (519) T protein:vir:10 217 GQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEI 296 (519) T ss_pred ccccccccccccchhhccccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHH Confidence 00001 112 1135567778899999999998888999998876 25789999999999999 Q ss_pred HHHHHHHHHhhcCCcc------------cccccccccccccccccccc-----hhHHHHHHHHHHHhhhccccCccEEEe Q lcl|NC_016164. 674 ALEIDRAALYGLGSNS------------QPEGLKFVTGINTENFGATN-----PTYVELVSMESKVAADNADIGAMSYLT 736 (836) Q Consensus 674 a~~~d~~il~G~Gt~~------------~p~Gi~~~~~~~~~t~aa~~-----~t~~~l~~a~~~l~~~~~~~~~~~~vm 736 (836) ...||+.||.-..... .-.|++..........+--. -.+-.+.++-..+..+.++.....+++ T Consensus 297 mlEINReii~~i~~sa~~~~~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~ 376 (519) T protein:vir:10 297 MLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIA 376 (519) T ss_pred HHHhhHHHHhhhhhhhhcceeecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEE Confidence 9999999985321111 11244432111110000000 012234445555555555555578999 Q ss_pred cHHHHHHHHHHhh--c-cC---cccccc-CCC----Ceec-ceeeEeeCccccceEEEEehhc--e--EEEeecceEEEE Q lcl|NC_016164. 737 NSTLYGGFKTTEK--A-TS---TAQFVL-EPG----GTVN-GYNVVRSNQVANGDVFFGVWNQ--M--IMGMWGALDIQV 800 (836) Q Consensus 737 np~~~~~L~~lkd--~-~g---~~~~~~-~~~----~~l~-G~pVv~s~~~~~~~i~~gD~s~--~--~i~~~~~l~i~~ 800 (836) +++....|...-. . .+ +..+.. ... |.|. ||+|++.+..+.+-+++|--.. + -++.--++.+.. T Consensus 377 S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~ 456 (519) T protein:vir:10 377 SRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTP 456 (519) T ss_pred chHHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEecCcccccceeecccccccc Confidence 9998887765421 0 00 011111 111 3443 5799999998877666654210 0 111111222111 Q ss_pred ecccccccCcEEEEEEEEeccEEEcccce-------EEEeecC Q lcl|NC_016164. 801 NPYALDKSGSVRVTALQDVDVAVRHPEAF-------CRGNDNL 836 (836) Q Consensus 801 ~~~~~~~~~~~~~r~~~r~d~~v~~p~Af-------~~l~~A~ 836 (836) -+..+-.+-|=++-...|+++ ..+|=+- +++.+.. T Consensus 457 ~~~~dp~sfqP~~g~~tRY~l-~~NP~~~~~~~~~~~~i~~g~ 498 (519) T protein:vir:10 457 LRGSDPKNFQPVMGFKTRYGI-GINPFADPAAQAPTKRIQNGM 498 (519) T ss_pred ccccCCccccceeeeeeeece-eecCcccccccCccceeccCc Confidence 111222233334444456655 3344211 1111221 No 241 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=35.00 E-value=1.3 Score=20.04 Aligned_cols=264 Identities=8% Similarity=-0.044 Sum_probs=110.6 Q ss_pred hhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecCCceEEEEEecCCceeeee-----ccCccccccccc-c Q lcl|NC_016164. 560 LVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGLQGPVAIPRQTGAATAYWV-----AEGGDPTESQPS-V 633 (836) Q Consensus 560 ~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~~~~~~~p~~~~~~~a~~v-----~Eg~~~~~~~~~-~ 633 (836) +++ .....++.+.+...-+..-.+..+--.+.+ +++......++++.. ..+..+ +-++.....++. + T Consensus 1 m~~----~~~~~~~dp~LT~~A~gy~n~~~ia~~l~P-~vpv~~~~~k~~~f~--~eaF~~~~t~r~~~~~~~~v~~~~~ 73 (307) T protein:vir:10 1 MGR----LSKLRIVDPVLTNLAIGYTNAEFIGQSLMP-VVEVEKEGGKIPKFG--KESFRLYKTERALRARSNRMNPEDL 73 (307) T ss_pred CCC----CCCCcccChhHHHHHHhhcchhhhhhhcCC-cccccccccceeeEC--cccccchhhhcccCCCcceeecccc Confidence 000 011122333344433333323222222222 223233334444442 111111 112222222221 2 Q ss_pred eeEEeeeeeeeeeehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHh--hcCCc--ccccccccccccccccccccc Q lcl|NC_016164. 634 DQVALVAKTLGAYTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALY--GLGSN--SQPEGLKFVTGINTENFGATN 709 (836) Q Consensus 634 ~~it~~~~t~~~~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~--G~Gt~--~~p~Gi~~~~~~~~~t~aa~~ 709 (836) +..+......+-..++..+.-..+..+..+.....+...+.+..+..+-. -+..+ ...+..++.+. .-+. ++. T Consensus 74 ~~~~~~~~~~~L~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsGt~--~Wsd-~~s 150 (307) T protein:vir:10 74 GSIDIVLDEHDLEYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQLSATE--KFTA-AGS 150 (307) T ss_pred cccccccccccccccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEecccc--ccCC-CCC Confidence 22233333333333444443334455666777777777666665543211 01111 01112222111 1111 122 Q ss_pred hhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHH----HhhccCcc--ccccCCCCeeccee-eEeeCcc------- Q lcl|NC_016164. 710 PTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKT----TEKATSTA--QFVLEPGGTVNGYN-VVRSNQV------- 775 (836) Q Consensus 710 ~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~----lkd~~g~~--~~~~~~~~~l~G~p-Vv~s~~~------- 775 (836) --+.+|.+++.++....+ ..+..++|.+..|.+|+. ++.-.++. ..-...-..++|.. |++.... T Consensus 151 DPi~di~~~~~ai~~~~g-~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it~~~la~ll~v~~i~vg~a~~~~~~~~ 229 (307) T protein:vir:10 151 DPVGVIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDLLKEIFEVENIAVGEAIYADDKDR 229 (307) T ss_pred CcHHHHHHHHHHHHhhhC-CccceEEeCHHHHHHHhcCHHHHHHhCCccccccCHHHHHHHhCceeEEEeeeeeeccCCc Confidence 235788888888876654 456789999999987753 12112221 11000011234432 2221110 Q ss_pred ----ccceEEEE--------------ehhceEEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 776 ----ANGDVFFG--------------VWNQMIMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 776 ----~~~~i~~g--------------D~s~~~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) -.+.+++. .++.-+++.+.+..+. +++.. ..+...+|+...+.-.+.-+.|=.++++|+ T Consensus 230 ~~~iw~~~~vl~yv~~~~~~~~~~~~epsfGyT~~~~g~~~~-d~~~~-~~~~~~~r~~~~~~~~i~~~~~G~li~~~~ 306 (307) T protein:vir:10 230 FTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVV-DTRIE-DGKLELVRSTDIFRPYLLGADAGYLISGIN 306 (307) T ss_pred cceeCCCceEEEecccccCCCCCcccccccceeEEEcCCeEe-eceec-CCceeEEeccccccceeecccccceeccCC Confidence 01122221 1232223333443222 22222 345667888888888889999999999999 No 242 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=33.66 E-value=1.3 Score=19.88 Aligned_cols=346 Identities=13% Similarity=0.046 Sum_probs=125.0 Q ss_pred hhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhh-hhhhh-hhhhHHHHHHHHHHhhh Q lcl|NC_016164. 467 IAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDR-AAFEA-AAFEREVSEATAQRMGV 544 (836) Q Consensus 467 l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~-~~~~~-~~~~~~~a~~~~~~~g~ 544 (836) +.-. ..+...+.=.+.........+... .+.+.+...-... ..... ..........+...+.. T Consensus 1 ~~~~------~~~~l~~kw~p~l~~~~~~~i~~~---------~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e 65 (521) T protein:vir:10 1 MTIK------TKAELLNKWKPLLEGEGLPEIANS---------KQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTE 65 (521) T ss_pred CCcc------hhHHHHHhhhhhhccCCCCccccc---------hhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhh Confidence 1000 000000000010000000011000 0001000000000 00000 00000111111111100 Q ss_pred -hhhhhhhhhhhhhh-hhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecC--Cce-----EEEEEec--- Q lcl|NC_016164. 545 -TPRGILAPNDVLHR-DLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL--QGP-----VAIPRQT--- 612 (836) Q Consensus 545 -~~~g~~~~~~~~~~-a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~--~~~-----~~~p~~~--- 612 (836) ...+.........+ ...+...... -|.++ . +.++..+..+..+++. +.|+. .+- ..+.... T Consensus 66 ~~~~~~~~~~~~~i~es~~t~~v~~~----~P~Li-~-lvRra~p~LIa~DIwG-VQPMTgPTGLIFAMRsrY~~q~~~~ 138 (521) T protein:vir:10 66 AEIGGDHGYNATNIAAGQTSGAVTQI----GPAVM-G-MVRRAIPNLIAFDICG-VQPMNSPTGQVFALRAVYGKDPIAA 138 (521) T ss_pred hcccCccccccccccccccccccccC----Cchhh-h-HHHHHHhhhhhhhcee-eccCCchhhhheeeeeeccCCcccc Confidence 00000000000000 0001111111 12111 1 1222222222333321 11110 000 0000000 Q ss_pred -------------------------------------------------------C-----------------------C Q lcl|NC_016164. 613 -------------------------------------------------------G-----------------------A 614 (836) Q Consensus 613 -------------------------------------------------------~-----------------------~ 614 (836) . . T Consensus 139 ~g~eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~ 218 (521) T protein:vir:10 139 GAKEAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEA 218 (521) T ss_pred ccccccchhccccccccccccccccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccc Confidence 0 0 Q ss_pred ceeeee--------cc---------CcccccccccceeEEeeeeeeeeeehhHHHHHhc----chhHHHHHHHHHHHHHH Q lcl|NC_016164. 615 ATAYWV--------AE---------GGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQ----SSIDVEQMVRTELATVI 673 (836) Q Consensus 615 ~~a~~v--------~E---------g~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~d----s~~~l~~~i~~~l~~a~ 673 (836) .....+ +| +..+++-.++++++++.+++-+-....|=||.+| -.+++++.|.+.|+..| T Consensus 219 ~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEI 298 (521) T protein:vir:10 219 GALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEI 298 (521) T ss_pred cceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHH Confidence 000001 11 1235667778889999999998888999999876 25789999999999999 Q ss_pred HHHHHHHHHhhcCC------------cccccccccccccccccccccch-----hHHHHHHHHHHHhhhccccCccEEEe Q lcl|NC_016164. 674 ALEIDRAALYGLGS------------NSQPEGLKFVTGINTENFGATNP-----TYVELVSMESKVAADNADIGAMSYLT 736 (836) Q Consensus 674 a~~~d~~il~G~Gt------------~~~p~Gi~~~~~~~~~t~aa~~~-----t~~~l~~a~~~l~~~~~~~~~~~~vm 736 (836) ...+|+.||.-.-. ++.+.|+|..........+-... .+-.|.+....+..+.+......+++ T Consensus 299 mlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~ 378 (521) T protein:vir:10 299 MLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIA 378 (521) T ss_pred HHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEE Confidence 99999999843211 11233444322111111000000 12233344445555555455678899 Q ss_pred cHHHHHHHHHHh--h-c--cC-ccccccCC-C----Ceec-ceeeEeeCccccceEEEEehhc--e--EEEeecceEEEE Q lcl|NC_016164. 737 NSTLYGGFKTTE--K-A--TS-TAQFVLEP-G----GTVN-GYNVVRSNQVANGDVFFGVWNQ--M--IMGMWGALDIQV 800 (836) Q Consensus 737 np~~~~~L~~lk--d-~--~g-~~~~~~~~-~----~~l~-G~pVv~s~~~~~~~i~~gD~s~--~--~i~~~~~l~i~~ 800 (836) +++....|...- + . .| ...+.... + +.|. ||+|++.++.+.+-+++|--.. + -++.--++.+.. T Consensus 379 S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~ 458 (521) T protein:vir:10 379 SRNVVNVLASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTP 458 (521) T ss_pred chHHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccccc Confidence 999988887531 1 1 11 11222211 2 3443 5789999998877666664211 0 111111122111 Q ss_pred ecccccccCcEEEEEEEEeccEEEcccceEEEeecC Q lcl|NC_016164. 801 NPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDNL 836 (836) Q Consensus 801 ~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A~ 836 (836) -+..+-.+-|=++-...|+++ ..+|-+- ..+++- T Consensus 459 ~~~~dp~sfqP~~g~~tRY~l-~~NP~~~-~~~~~~ 492 (521) T protein:vir:10 459 LRGSDPKNFQPVMGFKTRYGI-GINPFAE-SAAQAP 492 (521) T ss_pred ccccCCccccceeeeeeeece-eecCccc-ccCCcc Confidence 111222223334444445555 2334111 111110 No 243 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=32.95 E-value=1.4 Score=19.80 Aligned_cols=266 Identities=9% Similarity=0.042 Sum_probs=104.7 Q ss_pred ccccccc--cccchhhHHHHHHHHHhhhhhhhhcc--eeeecCCceEEEEEecCCceeeeeccCcccccccccceeEEee Q lcl|NC_016164. 564 TASAAGD--LVFTDGRPGSFIELLRNRLALNTLGV--TMLTGLQGPVAIPRQTGAATAYWVAEGGDPTESQPSVDQVALV 639 (836) Q Consensus 564 ~~~~~g~--~vvp~~~~~~ii~~l~~~~~l~~l~~--~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~it~~ 639 (836) .+..+.. +-..+.+...+.+.+.....-..+.. ...-.....+++|+.+.. .+.-..-+.-...++++.+..+++ T Consensus 1 ~~~~an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~~~Gak~VkIp~i~~~-gl~dY~R~~g~~~g~v~~~~et~t 79 (311) T protein:vir:99 1 MPTDAETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDLVNGGRSFTLKTISTS-GLKDHTRGKGFNSGTISDEKTIYT 79 (311) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHhhhcccceecCchheeecCCEEEEEeeeec-cccccccccCccccceeeeeeEEE Confidence 0000000 00112223333222222221111111 111234568888888742 222232222234455554444444 Q ss_pred eeee-eeeehhHHHHHhcchhHH---HHHHHHHHH-HHHHHHHHHHHHhhc-----CCcccccc-ccccccccccccccc Q lcl|NC_016164. 640 AKTL-GAYTEFSRRLMLQSSIDV---EQMVRTELA-TVIALEIDRAALYGL-----GSNSQPEG-LKFVTGINTENFGAT 708 (836) Q Consensus 640 ~~t~-~~~i~ISrelL~ds~~~l---~~~i~~~l~-~a~a~~~d~~il~G~-----Gt~~~p~G-i~~~~~~~~~t~aa~ 708 (836) +..= +..+.|. -+.-+..+. .+.|...+. ..+.=.+|...++-. +..+...+ -+..++......-.. T Consensus 80 l~~DR~~~f~vD--~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~lt~ 157 (311) T protein:vir:99 80 MGQDRDVEFYLD--RQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETLDE 157 (311) T ss_pred eeeccceeeecc--hhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccccccCH Confidence 3211 1122221 111111111 122333332 233334454333211 11111100 000001111111111 Q ss_pred chhHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHhhccC-------ccccccCCCCeecceeeEee---Ccccc- Q lcl|NC_016164. 709 NPTYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTEKATS-------TAQFVLEPGGTVNGYNVVRS---NQVAN- 777 (836) Q Consensus 709 ~~t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lkd~~g-------~~~~~~~~~~~l~G~pVv~s---~~~~~- 777 (836) .-.++.|..++..|... ...+.+++|+|..+..|+..+.-.. ...-+...-+.|.|+||+.. +.+.. T Consensus 158 ~nvl~~l~~~~~~~~~v--~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r~~t~ 235 (311) T protein:vir:99 158 TNAYSQLKTGIGKVRKY--GTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESRITSIDGVQLIEVYESNRFMTK 235 (311) T ss_pred HHHHHHHHHHHHHHHhc--CCCCeEEEEChHHHHHHhhchhhheeeecccccccccccccceecCeEEEEecCchhhcch Confidence 12367777788777653 2456789999999988775332111 01113334467899987654 33321 Q ss_pred -----c----------eEEEEehhceEEEeecceEEEE-ecccccccCcEEEEEEEEeccEEEcc--cc-eEEEeec Q lcl|NC_016164. 778 -----G----------DVFFGVWNQMIMGMWGALDIQV-NPYALDKSGSVRVTALQDVDVAVRHP--EA-FCRGNDN 835 (836) Q Consensus 778 -----~----------~i~~gD~s~~~i~~~~~l~i~~-~~~~~~~~~~~~~r~~~r~d~~v~~p--~A-f~~l~~A 835 (836) | .+++...+ ..+.....-.+.. .|...-..+...+.-+.++|.-|.+. .+ ++..+.| T Consensus 236 ~~ft~G~~~~~~ak~INfiiv~~~-a~i~~~K~~~v~~f~P~~~~~gd~~l~~~R~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 236 YDFTDGAKPTEDAKAINFLVVAKP-AVISIVKENAVFLFAPGQHTDGDGYLYQNRLYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred hhhcCCccccCcccccceEEeCCC-eeeeeeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEeeecC Confidence 1 12333322 2222222212222 23333333446666666777777764 33 4777888 No 244 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=31.66 E-value=1.5 Score=19.65 Aligned_cols=346 Identities=14% Similarity=0.099 Sum_probs=120.4 Q ss_pred hhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhhhhhhhhhhhhHHH-HHHHHHHhhhh Q lcl|NC_016164. 467 IAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGDRAAFEAAAFEREV-SEATAQRMGVT 545 (836) Q Consensus 467 l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~~~~~-a~~~~~~~g~~ 545 (836) +..+ ...++=.+.........+....+ +.+.+...-.. ............ -....+.++.- T Consensus 1 ~~~~---------~l~~kw~p~l~~~~~~~i~~~~~--------~~~~a~l~enq-~~~~~~~~~~~~~~~~~~~~~~~~ 62 (534) T protein:vir:10 1 MSKK---------SLLKKWQPLVESEGMPAIASMKR--------KDIVARIFENQ-DEDIAHNEGGVYTDQVVVNSMVDV 62 (534) T ss_pred Cchh---------HHHHHhHHhhcCCccccccchhh--------hhhhhhhhhhH-HHHHhhhcccccchhhhhhhhhcc Confidence 1000 00000000000000000000000 00000000000 000000000000 00000011100 Q ss_pred hhhhhhhhhhhhhh-----------hhhcccccccccccchhhHHHHHHHHHhhhhhhhhcce-eeecCCceEEEEE--- Q lcl|NC_016164. 546 PRGILAPNDVLHRD-----------LVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVT-MLTGLQGPVAIPR--- 610 (836) Q Consensus 546 ~~g~~~~~~~~~~a-----------~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~-~~~~~~~~~~~p~--- 610 (836) .... ......+.. ...++.+..-...-|.++ . +.++..+..+..+++.. ..++..+-+.-.+ T Consensus 63 ~~~~-~~~~l~ea~~~~~~g~~~~~ia~s~~s~~v~~~~P~Li-~-lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY 139 (534) T protein:vir:10 63 KGRI-EEARLAEANIGGDHGYDATKIASGETSGSITNVGPAVM-G-LVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIY 139 (534) T ss_pred ccch-hhccccccccccccccccccccccccccccccccchhh-h-HHHHHHHhhhhhhhheeccCCchhhhheeeeeee Confidence 0000 000000000 000000000000112111 1 12222222223333211 0111111000000 Q ss_pred --ec-------------------------------------------------------CC------------------- Q lcl|NC_016164. 611 --QT-------------------------------------------------------GA------------------- 614 (836) Q Consensus 611 --~~-------------------------------------------------------~~------------------- 614 (836) .. +. T Consensus 140 ~n~~~~~s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~ 219 (534) T protein:vir:10 140 GGNSQDANAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQT 219 (534) T ss_pred cCCCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCcc Confidence 00 00 Q ss_pred --------------------ceeeeecc-----C----cccccccccceeEEeeeeeeeeeehhHHHHHhc----chhHH Q lcl|NC_016164. 615 --------------------ATAYWVAE-----G----GDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQ----SSIDV 661 (836) Q Consensus 615 --------------------~~a~~v~E-----g----~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~d----s~~~l 661 (836) +-..-.+| + .++++-.++++++++.+++-+-...+|=||.+| -.+++ T Consensus 220 ~ag~~~~~~~~~~~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDA 299 (534) T protein:vir:10 220 EAGLAYKWLLANGYAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDA 299 (534) T ss_pred ccccccccccccccceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCCh Confidence 00000011 1 135666788899999999999889999999876 24789 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCc------------ccccccccccccccccccccchhHH-------HHHHHHHHH Q lcl|NC_016164. 662 EQMVRTELATVIALEIDRAALYGLGSN------------SQPEGLKFVTGINTENFGATNPTYV-------ELVSMESKV 722 (836) Q Consensus 662 ~~~i~~~l~~a~a~~~d~~il~G~Gt~------------~~p~Gi~~~~~~~~~t~aa~~~t~~-------~l~~a~~~l 722 (836) ++.|.+.|+..|...||+.||.-.-+. ..-.|++........ .++-...+ .|.++-..+ T Consensus 300 EtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~--~~~~~~~e~~~~L~~~i~~~an~i 377 (534) T protein:vir:10 300 DSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDI--RGARWAGESYKALVVQIDKEANEI 377 (534) T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccccccceeeeeccccc--cchhHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999987542211 111233322111110 00111111 233333344 Q ss_pred hhhccccCccEEEecHHHHHHHHHH--hhc---cC-ccccc-cCC----CCeec-ceeeEeeCccccceEEEEehhc--e Q lcl|NC_016164. 723 AADNADIGAMSYLTNSTLYGGFKTT--EKA---TS-TAQFV-LEP----GGTVN-GYNVVRSNQVANGDVFFGVWNQ--M 788 (836) Q Consensus 723 ~~~~~~~~~~~~vmnp~~~~~L~~l--kd~---~g-~~~~~-~~~----~~~l~-G~pVv~s~~~~~~~i~~gD~s~--~ 788 (836) ..+.++.....++++++....|... .+. .| ...+. ... .|+|. ||+|++.++.+.+-+++|--.. + T Consensus 378 ~~~T~rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~ 457 (534) T protein:vir:10 378 ARQTGRGQGNFIICSRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEM 457 (534) T ss_pred HHhhccccccEEEEchhHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCccc Confidence 4444444566889999988887542 110 01 00111 111 23443 5799999998877666654211 0 Q ss_pred --EEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccc-------eEEEeecC Q lcl|NC_016164. 789 --IMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEA-------FCRGNDNL 836 (836) Q Consensus 789 --~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~A-------f~~l~~A~ 836 (836) -++.--++.+..-+..+-.+-|=++-...|+++. .+|=+ +.++.+.. T Consensus 458 ~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP~~~~~~~~~~~~i~~g~ 513 (534) T protein:vir:10 458 DAGLYYCPYVALTPLRGTDPKNFQPVLGFKTRYGVK-LHPMADATQNKGFAKISNGM 513 (534) T ss_pred ccceeeccccccccccccCCccccceeeeeeeecee-ecCcccccCCccccccccCC Confidence 1111112222111112222233344444455542 23311 12222221 No 245 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=31.20 E-value=1.5 Score=19.59 Aligned_cols=347 Identities=13% Similarity=0.054 Sum_probs=127.4 Q ss_pred hhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhh-hhhh-hhhhhhHHHHHHHHHHhhh Q lcl|NC_016164. 467 IAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGD-RAAF-EAAAFEREVSEATAQRMGV 544 (836) Q Consensus 467 l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~-~~~~-~~~~~~~~~a~~~~~~~g~ 544 (836) +.. ....+...++=.+.........+... .+.+.+...-.. +... ...........++...+.. T Consensus 1 ~~~-----~~~~e~l~~kw~p~l~~~~~~~~~~~---------~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e 66 (522) T protein:vir:69 1 MTT-----IKTKAQLVDKWKELLEGEGLPEIANS---------KQAIIAKIFENQEKDFEVSPEYKDEKIAQAFGSFLTE 66 (522) T ss_pred CCc-----cchHHHHHHhhHHHhcCCCCCccccc---------hhhhhhhhhhhhhHHhhcccccchhHHHHhhhhhhhh Confidence 100 00000000000000000000000000 000000000000 0000 0000000111111111100 Q ss_pred -hhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecC--Cc-----eEEEEEe----- Q lcl|NC_016164. 545 -TPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL--QG-----PVAIPRQ----- 611 (836) Q Consensus 545 -~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~--~~-----~~~~p~~----- 611 (836) ...|.........+ .++.+..-.-.-|..+. ++++..+..+..+++. +.|+. .+ ...+... T Consensus 67 a~~~~~~~~~~~~i~---es~~t~~v~~~~P~li~--lvrRa~p~LIa~DIwG-VQPMTgPTGLIFAMRsrY~~q~~~~~ 140 (522) T protein:vir:69 67 AEIGGDHGYNAQNIA---AGQTSGAVTQIGPAVMG--MVRRAIPNLIAFDICG-VQPMNSPTGQVFALRAVYGKDPIAAG 140 (522) T ss_pred hccccccCCCccccc---ccccccccccccchHHH--HHHHHHhhhhhhhcee-eccCCchhhhheeeeeeccCCcccCc Confidence 00011100000000 00000000011122111 1222222222233221 11110 00 0000000 Q ss_pred -----------------------------------------------------cC------------------------- Q lcl|NC_016164. 612 -----------------------------------------------------TG------------------------- 613 (836) Q Consensus 612 -----------------------------------------------------~~------------------------- 613 (836) .. T Consensus 141 ~~eaf~~~neadt~fSG~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~ 220 (522) T protein:vir:69 141 AKEAFHPMYAPDAMFSGQGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAG 220 (522) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccc Confidence 00 Q ss_pred ------Cceeeeecc---------CcccccccccceeEEeeeeeeeeeehhHHHHHhc----chhHHHHHHHHHHHHHHH Q lcl|NC_016164. 614 ------AATAYWVAE---------GGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQ----SSIDVEQMVRTELATVIA 674 (836) Q Consensus 614 ------~~~a~~v~E---------g~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~d----s~~~l~~~i~~~l~~a~a 674 (836) ..-..-.+| +..+++-.++++++++.+++-+-....|=||.+| -.+++++.|.+.|+..|. T Consensus 221 ~~y~~g~GmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEIm 300 (522) T protein:vir:69 221 ALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIM 300 (522) T ss_pred cceeeccccchhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHH Confidence 000000122 1235677788899999999999999999999876 257899999999999999 Q ss_pred HHHHHHHHhhcCCcc------------cccccccccccccccccccch-----hHHHHHHHHHHHhhhccccCccEEEec Q lcl|NC_016164. 675 LEIDRAALYGLGSNS------------QPEGLKFVTGINTENFGATNP-----TYVELVSMESKVAADNADIGAMSYLTN 737 (836) Q Consensus 675 ~~~d~~il~G~Gt~~------------~p~Gi~~~~~~~~~t~aa~~~-----t~~~l~~a~~~l~~~~~~~~~~~~vmn 737 (836) ..||+.||.-.-... .+.|++.........++-... .+-.|.++-..+..+.+......++++ T Consensus 301 lEINReii~~i~~sa~~~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S 380 (522) T protein:vir:69 301 LEINREVVDWINYSAQVGKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIAS 380 (522) T ss_pred HHhhHHHHhhhhhhheeeccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEc Confidence 999999984321111 123443322211111100000 122344444555555554456789999 Q ss_pred HHHHHHHHHHh--h---ccC-ccccccCC-C----Ceec-ceeeEeeCccccceEEEEehhc--e--EEEeecceEEEEe Q lcl|NC_016164. 738 STLYGGFKTTE--K---ATS-TAQFVLEP-G----GTVN-GYNVVRSNQVANGDVFFGVWNQ--M--IMGMWGALDIQVN 801 (836) Q Consensus 738 p~~~~~L~~lk--d---~~g-~~~~~~~~-~----~~l~-G~pVv~s~~~~~~~i~~gD~s~--~--~i~~~~~l~i~~~ 801 (836) ++....|...- + +.| ...+.... + +.|. ||+|++.++.+.+-+++|--.. + -++.--++.+..- T Consensus 381 ~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~ 460 (522) T protein:vir:69 381 RNVVNVLASVDTGISYAAQGLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYYAPYVALTPL 460 (522) T ss_pred hhHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccc Confidence 99988886431 1 111 12222222 1 3443 5789999998877666664211 0 1111112221111 Q ss_pred cccccccCcEEEEEEEEeccEEEcccceEE-Eeec--------C Q lcl|NC_016164. 802 PYALDKSGSVRVTALQDVDVAVRHPEAFCR-GNDN--------L 836 (836) Q Consensus 802 ~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~-l~~A--------~ 836 (836) +..+-.+-|=++-...|+++ ..+| |+. .+++ - T Consensus 461 ~~~dp~sfqP~~g~~tRY~l-~vNP--~~~~~~~~~~~ri~~g~ 501 (522) T protein:vir:69 461 RGSDPKNFQPVMGFKTRYGI-GVNP--FAESSLQAPGARIQSGM 501 (522) T ss_pred cccCCccccceeeeeeeece-eecC--cccccCCcccceeeccc Confidence 11222233334444456555 2334 211 2222 1 No 246 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=29.83 E-value=1.6 Score=19.43 Aligned_cols=269 Identities=11% Similarity=0.038 Sum_probs=110.8 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhh-hcceeeecCCceEEEEEec-------------CCceeeeeccCc-- Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNT-LGVTMLTGLQGPVAIPRQT-------------GAATAYWVAEGG-- 624 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~-l~~~~~~~~~~~~~~p~~~-------------~~~~a~~v~Eg~-- 624 (836) ++.+....+.......++..+.......+++.+ +.- ++....+...... .......|-++. T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G---~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~gv~Gd~~l 77 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIG---TSENAVIQRKTELESDAGDRITFDLSVHLRGKPTYGDARV 77 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCcccccccc---CCCCCcEEEeeecCCCCCceEEeeeeeecccCCcccCcee Confidence 223333333333333444444333333333322 211 0111111111100 000111221111 Q ss_pred ccccccccceeEEeeeeeeeeeeh----hHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHH-hhcCCcc---------c Q lcl|NC_016164. 625 DPTESQPSVDQVALVAKTLGAYTE----FSRRLMLQSSIDVEQMVRTELATVIALEIDRAAL-YGLGSNS---------Q 690 (836) Q Consensus 625 ~~~~~~~~~~~it~~~~t~~~~i~----ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il-~G~Gt~~---------~ 690 (836) +-.+..++|..-++.+.-+..-+. +|+| .+..++...-+..|..-++...+..++ +..|+-+ . T Consensus 78 eGnee~L~~~~~~i~idq~r~~V~~~g~ms~q---Rt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~ 154 (364) T protein:vir:93 78 EGKEESLRFYQDEVRIDQVRHSVSAGGRMSRK---RTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPD 154 (364) T ss_pred eccccceeEEeeEEEEeeccccccccCchhhh---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccC Confidence 113444556555555555554444 4433 356678888888999999998888654 3333211 1 Q ss_pred ccccc-c------------ccc-cccccc-cccchhHHHHHHHHHHHhhhccc--------------cCccEEEecHHHH Q lcl|NC_016164. 691 PEGLK-F------------VTG-INTENF-GATNPTYVELVSMESKVAADNAD--------------IGAMSYLTNSTLY 741 (836) Q Consensus 691 p~Gi~-~------------~~~-~~~~t~-aa~~~t~~~l~~a~~~l~~~~~~--------------~~~~~~vmnp~~~ 741 (836) +.+.. + ..+ ....+. ++..++++-|.++...+...... ...-+++|||..+ T Consensus 155 ~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~ 234 (364) T protein:vir:93 155 FTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQA 234 (364) T ss_pred cccccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhh Confidence 11111 0 000 011111 22345677777776655443111 0123788999988 Q ss_pred HHHHHHhhc--------------cCccccccCCCCeecceeeEeeCcccc-------c------eEEEEehhce-EEEee Q lcl|NC_016164. 742 GGFKTTEKA--------------TSTAQFVLEPGGTVNGYNVVRSNQVAN-------G------DVFFGVWNQM-IMGMW 793 (836) Q Consensus 742 ~~L~~lkd~--------------~g~~~~~~~~~~~l~G~pVv~s~~~~~-------~------~i~~gD~s~~-~i~~~ 793 (836) ..|++..+. ..+|+|. +.-+.+.|+.|+....+.. + .+++|--... .++-- T Consensus 235 ~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~-G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a~g~~ 313 (364) T protein:vir:93 235 TDMRTAAGGTWIDFQKAAAAAEGRNNPIFK-GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTA 313 (364) T ss_pred hhhhhcCCHHHHHHHHHhhhcccccCCcee-cCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEEeecC Confidence 888643321 1122332 3345677887766555431 1 1344433321 22223 Q ss_pred cceEEEEecccccccCcEEEEEEEEeccEEEc----ccceEEEeecC Q lcl|NC_016164. 794 GALDIQVNPYALDKSGSVRVTALQDVDVAVRH----PEAFCRGNDNL 836 (836) Q Consensus 794 ~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~----p~Af~~l~~A~ 836 (836) +|+...|..+...-.+.+.+-+..-+|++-.+ .-.+..+..|+ T Consensus 314 ~g~~~~w~Ee~~D~gn~~~i~~~~i~G~kK~rF~~~DfGvi~idtaa 360 (364) T protein:vir:93 314 NGLRFDWEETVKDYGNEPAIAAGFIAGMKKARFNNKDFGVISIDTAA 360 (364) T ss_pred CCCCceeeecccCCCCchhhhhhhHhhhhhcccCCccceEEEecccc Confidence 44444444433222233333332222222111 12233333333 No 247 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=28.93 E-value=1.7 Score=19.32 Aligned_cols=102 Identities=10% Similarity=0.018 Sum_probs=53.8 Q ss_pred EecHHHHHHHHHHhhc-------cCccccccCCCCeecceeeEeeCccccceEEEEe------hhc-------eEEEeec Q lcl|NC_016164. 735 LTNSTLYGGFKTTEKA-------TSTAQFVLEPGGTVNGYNVVRSNQVANGDVFFGV------WNQ-------MIMGMWG 794 (836) Q Consensus 735 vmnp~~~~~L~~lkd~-------~g~~~~~~~~~~~l~G~pVv~s~~~~~~~i~~gD------~s~-------~~i~~~~ 794 (836) +++...|..+...... +.++.|--.-+-+++|+.-++++++|.++.++-| |.+ |.-.... T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~aNp~ltG~lpV~~~GltWl~tpnlpg~~a~vlDst~lGgmaDE~l~~Pgya~~~~~ 80 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQANIVLTGSLPVSAYGLTWVTSRHITGTDPWLFDVEQLGGMADEKLLSPEFAPAGNT 80 (123) T ss_pred CcchhhHHHHhcchhcccccccccCCceEecCcceeeeceeeeecCCCCCCccceeehhhhccccccccCCCcccCCCCc Confidence 2222222222111111 1122221122335677777899999988765544 321 2222333 Q ss_pred ceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEee-cC Q lcl|NC_016164. 795 ALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGND-NL 836 (836) Q Consensus 795 ~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~-A~ 836 (836) |+++....+..-.+++..+|+..-----|.-|.|.++++- -| T Consensus 81 Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 81 GVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred ceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 5565555544444788888887555556778999998884 34 No 248 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=27.80 E-value=1.8 Score=19.17 Aligned_cols=260 Identities=13% Similarity=0.084 Sum_probs=104.0 Q ss_pred hhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeec------CCceEEEEEecCCceeeeeccCcccc--ccccc Q lcl|NC_016164. 561 VVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTG------LQGPVAIPRQTGAATAYWVAEGGDPT--ESQPS 632 (836) Q Consensus 561 ~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~------~~~~~~~p~~~~~~~a~~v~Eg~~~~--~~~~~ 632 (836) +... .+. ++. ...+++++.+.+..++.+.+.+..+. .+..+++|........ .|.... -.... T Consensus 1 Ma~~---~~~-~lt-i~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~----~G~~~t~~~~~~~ 71 (430) T protein:vir:21 1 MALN---EGQ-IVT-LAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ----EGWDLTDKATGLL 71 (430) T ss_pred Cccc---cch-hhH-HHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeecccccccc----ccccccCCCccce Confidence 1110 111 222 23378889999988888765433222 1234555544322222 221111 12345 Q ss_pred ceeEEeeeeeeee-eehhHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccccccccccchh Q lcl|NC_016164. 633 VDQVALVAKTLGA-YTEFSRRLMLQSSIDVEQMVRTELATVIALEIDRAALYGLGSNSQPEGLKFVTGINTENFGATNPT 711 (836) Q Consensus 633 ~~~it~~~~t~~~-~i~ISrelL~ds~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~~p~Gi~~~~~~~~~t~aa~~~t 711 (836) ..++.+++.+.-. .+.+|-+-|.+.. ..+.+|...+ ++++..+|..+++.-...+ .++... ...+...+... T Consensus 72 e~~v~~~~~~~~~V~~~~~~kEl~~~~-~~er~l~pAm-~~LA~~Vd~dl~~~~~~~~---~~v~~~--~~~t~~~~~~~ 144 (430) T protein:vir:21 72 ELNVAVNMGEPDNDFFQLRADDLRDET-AYRRRIQSAA-RKLANNVELKVANMAAEMG---SLVITS--PDAIGTNTADA 144 (430) T ss_pred eeeEeEEEeeeccceEEeehhHhcChh-hHHHHHHHHH-HHHHHHHHHHHHHHhhhhh---hccccc--cCCCCCCCCcc Confidence 5666666666543 2334433344322 2355555555 8899999988875421110 011000 00011122234 Q ss_pred HHHHHHHHHHHhhhccccC-ccEEEecHHHHHHHHH-Hh---hcc--CccccccCCCC-eeccee-eEeeCccccce--- Q lcl|NC_016164. 712 YVELVSMESKVAADNADIG-AMSYLTNSTLYGGFKT-TE---KAT--STAQFVLEPGG-TVNGYN-VVRSNQVANGD--- 779 (836) Q Consensus 712 ~~~l~~a~~~l~~~~~~~~-~~~~vmnp~~~~~L~~-lk---d~~--g~~~~~~~~~~-~l~G~p-Vv~s~~~~~~~--- 779 (836) |.++..+...|.......+ .-..+++|..+..|.. +. .+. +...|..+.-+ .+.|+. +..++.+|..+ T Consensus 145 ~~~~A~a~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt 224 (430) T protein:vir:21 145 WNFVADAEEIMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKST 224 (430) T ss_pred hhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHhhhhccccccccchhHHHhhcccccccchhhhhhhcCCcccccCcc Confidence 6777777777766655552 4578999999876633 21 111 11123333223 255664 44555554211 Q ss_pred ---E-EEEeh--h--ceEEE--------eecceEEEEeccccccc---------------------CcEEEEEEEEeccE Q lcl|NC_016164. 780 ---V-FFGVW--N--QMIMG--------MWGALDIQVNPYALDKS---------------------GSVRVTALQDVDVA 822 (836) Q Consensus 780 ---i-~~gD~--s--~~~i~--------~~~~l~i~~~~~~~~~~---------------------~~~~~r~~~r~d~~ 822 (836) + +-|-. + .+.+. +.....+..+-...... ..-.|++....+.. T Consensus 225 ~t~~tv~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~t 304 (430) T protein:vir:21 225 ATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGT 304 (430) T ss_pred CcCceeccccccccccceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCc Confidence 1 00100 0 00000 00000011110000000 11123333332221 Q ss_pred --EEcc--------------cceEEEeecC Q lcl|NC_016164. 823 --VRHP--------------EAFCRGNDNL 836 (836) Q Consensus 823 --v~~p--------------~Af~~l~~A~ 836 (836) .+-| .++.-++.++ T Consensus 305 tv~I~Pai~~~~~~~~~~~~~~y~nVsasp 334 (430) T protein:vir:21 305 HVEITPKPVALDDVSLSPEQRAYANVNTSL 334 (430) T ss_pred eeEEeecccccccccccccccccceecccc Confidence 0001 1111111111 No 249 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=27.65 E-value=1.8 Score=19.15 Aligned_cols=348 Identities=11% Similarity=-0.001 Sum_probs=119.2 Q ss_pred hhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhHHH-HHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhh Q lcl|NC_016164. 439 EHKADDLAQGLIESGASEADAMRSVLSEIAKRPAAQPA-TPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQM 517 (836) Q Consensus 439 ~~~l~e~a~eliee~~t~~e~~~~~l~~l~~~~~a~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~ 517 (836) .+. ...+.+--.-+|+... ..+... .........-. +...... .++........... T Consensus 1 ~~~-----------~~~l~~kw~p~l~~~~--~~~i~~~~~~~~~a~l~e----nq~~~~~-~~~~~~~~~~~~~~---- 58 (528) T protein:vir:66 1 MKT-----------TKELMEKWSPLLENEK--LPEIATASKQKLVAKILE----SQEADFA-VDPIYKDEKVVEAF---- 58 (528) T ss_pred Ccc-----------hHHHHHHhHHhhcCCC--cchhcchhhhhhhhhhhh----hhHHHhh-cccchhhHHHHHhh---- Confidence 110 0001111111111100 000000 00000000000 0000000 00000000000000 Q ss_pred hhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcce Q lcl|NC_016164. 518 MPGDRAAFEAAAFEREVSEATAQRMGVTPRGILAPNDVLHRDLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVT 597 (836) Q Consensus 518 ~~~~~~~~~~~~~~~~~a~~~~~~~g~~~~g~~~~~~~~~~a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~ 597 (836) ..++.-. ...+.............+++..+ -.-|..+. ++++.-+..+..+++. T Consensus 59 ------------------~~~l~ea--~~~~~~~~~~~~i~es~~t~~v~---~~~P~Li~--lvRRa~p~LIa~DIwG- 112 (528) T protein:vir:66 59 ------------------GGFIAEA--EVAGDHGYDASQIAAGQTTGAIT---NVGPAVIG--MVRRAIPNLIAFDICG- 112 (528) T ss_pred ------------------hhhhhhh--cccccccccchhccccccccccc---cCchhHHH--HHHHHHHhhhhhhhhe- Confidence 0000000 00000000000000000000000 00011100 1111111111122211 Q ss_pred eeecCC----------------------------------------------------------------ceEEEEE--- Q lcl|NC_016164. 598 MLTGLQ----------------------------------------------------------------GPVAIPR--- 610 (836) Q Consensus 598 ~~~~~~----------------------------------------------------------------~~~~~p~--- 610 (836) +.|+.. +...+.. T Consensus 113 VQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~ 192 (528) T protein:vir:66 113 VQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAE 192 (528) T ss_pred eecCCchhhhheeeeeeecCCcccccccccccccccccccccccccccccccCCccceeecccccccccccceeeecccc Confidence 011100 0000000 Q ss_pred ------------------------------ecCCceee--------eecc---------CcccccccccceeEEeeeeee Q lcl|NC_016164. 611 ------------------------------QTGAATAY--------WVAE---------GGDPTESQPSVDQVALVAKTL 643 (836) Q Consensus 611 ------------------------------~~~~~~a~--------~v~E---------g~~~~~~~~~~~~it~~~~t~ 643 (836) ......+. -.+| +.++++-.++++++++.+++- T Consensus 193 t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSR 272 (528) T protein:vir:66 193 TGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSR 272 (528) T ss_pred cceeeeccccccccccCcccccccccccccccccccceecccccchhhhhhhcccCCCcccchhhcceEEEeEEEEeecc Confidence 00000000 1112 123567778889999999999 Q ss_pred eeeehhHHHHHhc----chhHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc------------cccccccccccccccccc Q lcl|NC_016164. 644 GAYTEFSRRLMLQ----SSIDVEQMVRTELATVIALEIDRAALYGLGSNS------------QPEGLKFVTGINTENFGA 707 (836) Q Consensus 644 ~~~i~ISrelL~d----s~~~l~~~i~~~l~~a~a~~~d~~il~G~Gt~~------------~p~Gi~~~~~~~~~t~aa 707 (836) +-....|-||.+| -.+++++.|.+.|...|...||+.||.-..+.. .+.|++.........++- T Consensus 273 aLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~r 352 (528) T protein:vir:66 273 QLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGAR 352 (528) T ss_pred ceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeecccccccccch Confidence 9999999999886 257899999999999999999999974321111 122333322111111100 Q ss_pred cch-----hHHHHHHHHHHHhhhccccCccEEEecHHHHHHHHHHh-----hccCcc-cc-ccCC----CCeec-ceeeE Q lcl|NC_016164. 708 TNP-----TYVELVSMESKVAADNADIGAMSYLTNSTLYGGFKTTE-----KATSTA-QF-VLEP----GGTVN-GYNVV 770 (836) Q Consensus 708 ~~~-----t~~~l~~a~~~l~~~~~~~~~~~~vmnp~~~~~L~~lk-----d~~g~~-~~-~~~~----~~~l~-G~pVv 770 (836) ... .+-.|.++-..+..+.++.....++++++....|...= +..|-. .+ .... .|.|. ||+|+ T Consensus 353 w~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy 432 (528) T protein:vir:66 353 WAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVF 432 (528) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceeEEEecCceEEE Confidence 000 11224444455555555445578999999988876531 111111 11 1111 13444 57899 Q ss_pred eeCccccceEEEEehhc--e--EEEeecceEEEEecccccccCcEEEEEEEEeccEEEcccceEEEeec---C Q lcl|NC_016164. 771 RSNQVANGDVFFGVWNQ--M--IMGMWGALDIQVNPYALDKSGSVRVTALQDVDVAVRHPEAFCRGNDN---L 836 (836) Q Consensus 771 ~s~~~~~~~i~~gD~s~--~--~i~~~~~l~i~~~~~~~~~~~~~~~r~~~r~d~~v~~p~Af~~l~~A---~ 836 (836) +.+..+.+-+++|--.. + -++.--++.+..-.-.+-.+-|=++-...|+++ ..+|=+-.. +++ . T Consensus 433 ~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l-~vNP~~~~~-~~~~~~r 503 (528) T protein:vir:66 433 IDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGI-GINPFADSK-SQEPSAR 503 (528) T ss_pred ecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccccceeeeeeeece-eecCccccc-Ccccccc Confidence 99998877666664211 0 011111111110000111222223333445544 233311111 010 0 No 250 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=23.49 E-value=2.3 Score=18.60 Aligned_cols=347 Identities=13% Similarity=0.066 Sum_probs=123.6 Q ss_pred hhhhhhhHHHHHHHHhhhhhhHHHHhhhhhhhhhhhHHHHHHHhhhhhhhhhhhh-hhhhhhh-hhhHHHHHHHHHHhhh Q lcl|NC_016164. 467 IAKRPAAQPATPAAPVRSAQPIAAGGGSADIGLTDKEARSFSFVRAIRAQMMPGD-RAAFEAA-AFEREVSEATAQRMGV 544 (836) Q Consensus 467 l~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~-~~~~~~~-~~~~~~a~~~~~~~g~ 544 (836) +.-.. .+...+.=.+.........+... .+.+.+...-.. +...... ........++...+.. T Consensus 1 ~~~~~------~~~l~~kw~p~l~~~~~~~i~~~---------~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e 65 (521) T protein:vir:72 1 MTIKT------KAELLNKWKPLLEGEGLPEIANS---------KQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTE 65 (521) T ss_pred CCcch------hHHHHHhhhhhhccCCCCccccc---------hhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhh Confidence 11000 00001110011000000011000 000100000000 0000000 0000011111111100 Q ss_pred hh-hhhhhhhhhhhh-hhhhcccccccccccchhhHHHHHHHHHhhhhhhhhcceeeecC--Cc-----eEEEEE----- Q lcl|NC_016164. 545 TP-RGILAPNDVLHR-DLVVDTASAAGDLVFTDGRPGSFIELLRNRLALNTLGVTMLTGL--QG-----PVAIPR----- 610 (836) Q Consensus 545 ~~-~g~~~~~~~~~~-a~~~~~~~~~g~~vvp~~~~~~ii~~l~~~~~l~~l~~~~~~~~--~~-----~~~~p~----- 610 (836) .. .+.........+ ...+...... -|.++ . +.++..+..+..+++. +.|+. ++ ...+.. T Consensus 66 ~~~~~~~~~~~~~iaes~~t~~v~~~----~P~Li-~-lvRra~p~LIa~DIwG-VQPMTgPTGLIFAMRsrY~~q~~~~ 138 (521) T protein:vir:72 66 AEIGGDHGYNATNIAAGQTSGAVTQI----GPAVM-G-MVRRAIPNLIAFDICG-VQPMNSPTGQVFALRAVYGKDPVAA 138 (521) T ss_pred hcccCccccCcccccccccccccccC----Cchhh-h-HHHHHHhhhhhhhcee-eccCCchhhhheeeeeeecCCCCCc Confidence 00 000000000000 0001111111 11111 1 1121222222223211 11110 00 000000 Q ss_pred ------------------------------------------------------------ecCCc--------------- Q lcl|NC_016164. 611 ------------------------------------------------------------QTGAA--------------- 615 (836) Q Consensus 611 ------------------------------------------------------------~~~~~--------------- 615 (836) ..+.. T Consensus 139 ~g~ea~~~e~~~da~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a 218 (521) T protein:vir:72 139 GAKEAFHPMYGPDAMFSGQGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEA 218 (521) T ss_pred ccccccchhcccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCCcccccccccccccc Confidence 00000 Q ss_pred -eee--------eecc---------CcccccccccceeEEeeeeeeeeeehhHHHHHhc----chhHHHHHHHHHHHHHH Q lcl|NC_016164. 616 -TAY--------WVAE---------GGDPTESQPSVDQVALVAKTLGAYTEFSRRLMLQ----SSIDVEQMVRTELATVI 673 (836) Q Consensus 616 -~a~--------~v~E---------g~~~~~~~~~~~~it~~~~t~~~~i~ISrelL~d----s~~~l~~~i~~~l~~a~ 673 (836) ... -.+| +..+++-.++++++++.+++-+-...+|-||.+| -.+++++.|.+.|+..| T Consensus 219 ~~~y~~g~gm~Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEI 298 (521) T protein:vir:72 219 GALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEI 298 (521) T ss_pred CceeeeecccchhhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHH Confidence 000 0112 1224566677789999999988888999998876 25789999999999999 Q ss_pred HHHHHHHHHhhcCC------------cccccccccccccccccccccch-----hHHHHHHHHHHHhhhccccCccEEEe Q lcl|NC_016164. 674 ALEIDRAALYGLGS------------NSQPEGLKFVTGINTENFGATNP-----TYVELVSMESKVAADNADIGAMSYLT 736 (836) Q Consensus 674 a~~~d~~il~G~Gt------------~~~p~Gi~~~~~~~~~t~aa~~~-----t~~~l~~a~~~l~~~~~~~~~~~~vm 736 (836) ...||+.||.-.-. ++.+.|+|..........+-... .+-.|.+....+..+.+......+++ T Consensus 299 mlEINReii~~i~~sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~ 378 (521) T protein:vir:72 299 MLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIA 378 (521) T ss_pred HHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEE Confidence 99999999843211 11233444322111111000000 12233344445555555455678899 Q ss_pred cHHHHHHHHHHh--h-c--cC-ccccccCC-C----Cee-cceeeEeeCccccceEEEEehhc--e--EEEeecceEEEE Q lcl|NC_016164. 737 NSTLYGGFKTTE--K-A--TS-TAQFVLEP-G----GTV-NGYNVVRSNQVANGDVFFGVWNQ--M--IMGMWGALDIQV 800 (836) Q Consensus 737 np~~~~~L~~lk--d-~--~g-~~~~~~~~-~----~~l-~G~pVv~s~~~~~~~i~~gD~s~--~--~i~~~~~l~i~~ 800 (836) +++....|...- + . .| ...+.... + |.| .||+|++.++.+.+-+++|--.. + -++.--++.+.. T Consensus 379 S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~ 458 (521) T protein:vir:72 379 SRNVVNVLASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTP 458 (521) T ss_pred chHHHHHHhhcccccccccccccccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccceeecccccccc Confidence 999988887431 1 1 11 11222221 1 233 35799999998877666664211 0 011111122111 Q ss_pred ecccccccCcEEEEEEEEeccEEEcccce-------EEEeecC Q lcl|NC_016164. 801 NPYALDKSGSVRVTALQDVDVAVRHPEAF-------CRGNDNL 836 (836) Q Consensus 801 ~~~~~~~~~~~~~r~~~r~d~~v~~p~Af-------~~l~~A~ 836 (836) -+..+-.+-|=++-...|+++. .+|=+- ++++..- T Consensus 459 ~~~~dp~sfqP~~g~~tRY~l~-~NP~~~~~~~~~a~~i~~~~ 500 (521) T protein:vir:72 459 LRGSDPKNFQPVMGFKTRYGIG-INPFAESAAQAPASRIQSGM 500 (521) T ss_pred ccccCCccccceeeeeeeecee-ecCcccccCcccceeecCcC Confidence 1111222223333344455542 233111 1111111 Done!